BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022859
         (291 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255548834|ref|XP_002515473.1| Protein C2orf4, putative [Ricinus communis]
 gi|223545417|gb|EEF46922.1| Protein C2orf4, putative [Ricinus communis]
          Length = 291

 Score =  532 bits (1370), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 259/291 (89%), Positives = 276/291 (94%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           MEKIRRASHAGSWYTDN K+L EELDGWL+ A L KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1   MEKIRRASHAGSWYTDNPKKLDEELDGWLKAAALEKSPDVRGVIAPHAGYSYSGRAAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDP +ISR+FLLGPSHHYYTPKCALSTATVYKTPIGDLP+DLEVIEELKATGKFELM
Sbjct: 61  FGNIDPANISRIFLLGPSHHYYTPKCALSTATVYKTPIGDLPIDLEVIEELKATGKFELM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVFEGH +K+VPILVGA+NA+NEAMYG+L  KYVDDP+NFF
Sbjct: 121 DLRVDEAEHSMEMHLPYLAKVFEGHQVKVVPILVGALNADNEAMYGKLLGKYVDDPTNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRFNYMHYDKK+G IHKSIEALDKMGMDIIETGDPDAFK+YL EYDNTIC
Sbjct: 181 SVSSDFCHWGSRFNYMHYDKKYGAIHKSIEALDKMGMDIIETGDPDAFKQYLQEYDNTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVF+HML NC+T IKIKFLRYEQSSQCKT RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFVHMLRNCATVIKIKFLRYEQSSQCKTTRDSSVSYASAAAKVDA 291


>gi|359475871|ref|XP_002285449.2| PREDICTED: protein MEMO1-like [Vitis vinifera]
 gi|296082111|emb|CBI21116.3| unnamed protein product [Vitis vinifera]
          Length = 291

 Score =  527 bits (1358), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 256/291 (87%), Positives = 277/291 (95%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME IRRASHAGSWYTDN ++LAEELDGWLR +GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1   MEMIRRASHAGSWYTDNPRKLAEELDGWLRASGLAKSPDVRGVIAPHAGYSYSGRAAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDP+SISRVFLLGPSHHYYTPKCALS ATVYKTP+GDL +DLEV+EELKATGKFELM
Sbjct: 61  FGNIDPSSISRVFLLGPSHHYYTPKCALSRATVYKTPVGDLQIDLEVVEELKATGKFELM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVF+G+ +KI+PILVGA+NAENEAMYGR+ AKYVDDP+NFF
Sbjct: 121 DLDVDEAEHSMEMHLPYLAKVFDGYPVKIIPILVGALNAENEAMYGRMLAKYVDDPNNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRFNY+HYDKK G IHKSIEALD+MGMDIIETGDPDAFK+YLLE+DNTIC
Sbjct: 181 SVSSDFCHWGSRFNYVHYDKKFGAIHKSIEALDRMGMDIIETGDPDAFKRYLLEFDNTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVFLHML NCSTKIKI FLRYEQSSQCKT RDSSVSYASAAA+VDA
Sbjct: 241 GRHPISVFLHMLRNCSTKIKINFLRYEQSSQCKTMRDSSVSYASAAAQVDA 291


>gi|449441896|ref|XP_004138718.1| PREDICTED: protein MEMO1-like [Cucumis sativus]
 gi|449493342|ref|XP_004159261.1| PREDICTED: protein MEMO1-like [Cucumis sativus]
          Length = 291

 Score =  523 bits (1348), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 256/291 (87%), Positives = 277/291 (95%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+KIRRASHAGSWYTDNSK+LAEELDGWL  +GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1   MDKIRRASHAGSWYTDNSKKLAEELDGWLSASGLSKSPDVRGVIAPHAGYSYSGRAAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISR+FLLGPSHHYYTPKCALSTAT+Y+TP+GDLP+DLEVIEELKATGKFELM
Sbjct: 61  FGNIDPTNISRIFLLGPSHHYYTPKCALSTATIYQTPVGDLPIDLEVIEELKATGKFELM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVFEGH +K+VPILVGAV+AE+EA YGRL AKYVDDP NFF
Sbjct: 121 DMHVDEAEHSMEMHLPYLAKVFEGHPVKVVPILVGAVSAESEATYGRLLAKYVDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           S+SSDFCHWGSRF+YMHYDKKHG I+KSIEALD+MGMDIIETGD +AFKKYL EYDNTIC
Sbjct: 181 SISSDFCHWGSRFSYMHYDKKHGPIYKSIEALDRMGMDIIETGDAEAFKKYLQEYDNTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVFLHML + STKIKIKFLRYEQSSQCKT RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFLHMLKHSSTKIKIKFLRYEQSSQCKTTRDSSVSYASAAAKVDA 291


>gi|356552504|ref|XP_003544607.1| PREDICTED: protein MEMO1-like [Glycine max]
          Length = 296

 Score =  520 bits (1339), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 252/288 (87%), Positives = 273/288 (94%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K+RR SHAGSWYTDN K+L+EEL+GWL+  GL KS +VRGVIAPHAGYSYSGRAAAYAFG
Sbjct: 4   KVRRPSHAGSWYTDNPKQLSEELEGWLQSCGLTKSSDVRGVIAPHAGYSYSGRAAAYAFG 63

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           NIDP++I+RVFLLGPSHHYYTPKCAL+TATVYKTPIGDLP+DLEV EELKATGKFELMDI
Sbjct: 64  NIDPSNITRVFLLGPSHHYYTPKCALTTATVYKTPIGDLPIDLEVTEELKATGKFELMDI 123

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
            VDEAEHSMEMHLPYLAKVFEGH +KIVPILVGAV+AENEAMYG++ AKYVDD +NFFS+
Sbjct: 124 RVDEAEHSMEMHLPYLAKVFEGHQVKIVPILVGAVSAENEAMYGQILAKYVDDSNNFFSI 183

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWGSRFNYMHYDKKHG I+KSIEALDKMGMDIIETGDPD+FK+YLLEYDNTICGR
Sbjct: 184 SSDFCHWGSRFNYMHYDKKHGPIYKSIEALDKMGMDIIETGDPDSFKQYLLEYDNTICGR 243

Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           HPISVFLHML NCSTKIKIKFLRYEQSSQCK+ RDSSVSYASAAAK D
Sbjct: 244 HPISVFLHMLRNCSTKIKIKFLRYEQSSQCKSNRDSSVSYASAAAKTD 291


>gi|224143207|ref|XP_002324881.1| predicted protein [Populus trichocarpa]
 gi|118483368|gb|ABK93585.1| unknown [Populus trichocarpa]
 gi|222866315|gb|EEF03446.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score =  516 bits (1330), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 254/291 (87%), Positives = 271/291 (93%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+KIRR SHAGSWYTDN K+L EEL+GWL   GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1   MDKIRRPSHAGSWYTDNPKKLEEELEGWLGATGLTKSPDVRGVIAPHAGYSYSGRAAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+I+RVFLLGPSHHYYTPKCALSTATVYKTPIGDLP+DLEVIEELKATGKFELM
Sbjct: 61  FGNIDPTNITRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPIDLEVIEELKATGKFELM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYL K+FEGH +K+VPILVGA+N +NEAMYGRL AKYVDDP+NFF
Sbjct: 121 DLQVDEAEHSMEMHLPYLVKIFEGHPVKVVPILVGALNTDNEAMYGRLLAKYVDDPTNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRF+Y HYDKK G IHKSIEALDKMGMDIIETG+ DAFK+YL EYDNTIC
Sbjct: 181 SVSSDFCHWGSRFHYTHYDKKCGPIHKSIEALDKMGMDIIETGNADAFKQYLSEYDNTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVFLHM  NCSTKIKIKFLRYEQSSQCKT RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFLHMSSNCSTKIKIKFLRYEQSSQCKTMRDSSVSYASAAAKVDA 291


>gi|255647372|gb|ACU24152.1| unknown [Glycine max]
          Length = 296

 Score =  516 bits (1330), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 250/288 (86%), Positives = 272/288 (94%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K+RR SHAGSWYTDN K+L+EEL+GWL+  GL KS +VRGVIAPHAGYSYSGRAAAYAFG
Sbjct: 4   KVRRPSHAGSWYTDNPKQLSEELEGWLQSCGLTKSSDVRGVIAPHAGYSYSGRAAAYAFG 63

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           NIDP++I+RVFLLGPSHHYYTPKCAL+TATVYKTPIGDLP+DLEV EELKATGKFELMDI
Sbjct: 64  NIDPSNITRVFLLGPSHHYYTPKCALTTATVYKTPIGDLPIDLEVTEELKATGKFELMDI 123

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
            VDEAEHSME+HLPYLAKVFEGH +KIVPILVGAV+AENEAMYG++ AKYVDD +NFFS+
Sbjct: 124 RVDEAEHSMEVHLPYLAKVFEGHQVKIVPILVGAVSAENEAMYGQILAKYVDDSNNFFSI 183

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWGSRFNYMHYDKKHG I+KSIEALDKMGMDIIETGDPD+FK+YLLEYDNTICGR
Sbjct: 184 SSDFCHWGSRFNYMHYDKKHGPIYKSIEALDKMGMDIIETGDPDSFKQYLLEYDNTICGR 243

Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           HPISVFLHML NCSTKIKIKF RYEQSSQCK+ RDSSVSYASAAAK D
Sbjct: 244 HPISVFLHMLRNCSTKIKIKFFRYEQSSQCKSNRDSSVSYASAAAKTD 291


>gi|224092526|ref|XP_002309646.1| predicted protein [Populus trichocarpa]
 gi|222855622|gb|EEE93169.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 252/291 (86%), Positives = 270/291 (92%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           MEKIRRASHAGSWYTD+ K+L EEL+GWL + GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1   MEKIRRASHAGSWYTDDPKKLEEELEGWLSDTGLTKSPDVRGVIAPHAGYSYSGRAAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+I+RVFLLGPSHHYYTPKCALSTA VYKTPIGDLP+DLEVIEELKATGKFELM
Sbjct: 61  FGNIDPTNITRVFLLGPSHHYYTPKCALSTAAVYKTPIGDLPIDLEVIEELKATGKFELM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDE EHSMEMHLPYLAK+FEGH +K+VPILVGAV+A+NEAMYG L AKYVDDP+NFF
Sbjct: 121 DLQVDEHEHSMEMHLPYLAKIFEGHPVKVVPILVGAVSADNEAMYGCLLAKYVDDPTNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRF+Y  YDKK G IHKSIEALDKMGMDIIETG+ DA K+YL EY NTIC
Sbjct: 181 SVSSDFCHWGSRFHYTRYDKKCGPIHKSIEALDKMGMDIIETGNADALKQYLSEYGNTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVFLHML NCSTKIKIKFLRYEQSSQCKT RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFLHMLRNCSTKIKIKFLRYEQSSQCKTMRDSSVSYASAAAKVDA 291


>gi|326489895|dbj|BAJ94021.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326513924|dbj|BAJ92112.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 298

 Score =  497 bits (1279), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 238/291 (81%), Positives = 267/291 (91%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME++RRASHAGSWYT+N+ +L EELDGWL  A L KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1   MERVRRASHAGSWYTNNASKLEEELDGWLSAAALTKSPDVRAVIAPHAGYSYSGRCAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ ATVY TPIGDLP+DLEVIEELKATGKFE M
Sbjct: 61  FGNIDPTNISRVFLLGPSHHYYTPKCALTRATVYSTPIGDLPVDLEVIEELKATGKFEFM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYL+KVF+GH +K+VPILVGAVN+++EAMYG+L AKYVDDP NFF
Sbjct: 121 DLNVDEAEHSMEMHLPYLSKVFQGHNVKVVPILVGAVNSQSEAMYGQLLAKYVDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRF+Y +YDK HG IHKSIEALD+MGM+IIETGDPDAFK+YL EY+NTIC
Sbjct: 181 SVSSDFCHWGSRFSYTYYDKNHGAIHKSIEALDRMGMEIIETGDPDAFKQYLQEYENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISV LHML +C TKIK+ F+RYEQSSQCK+ RDSSVSYASAAAKVD+
Sbjct: 241 GRHPISVLLHMLKHCLTKIKVGFVRYEQSSQCKSMRDSSVSYASAAAKVDS 291


>gi|258644468|dbj|BAI39726.1| unknown protein [Oryza sativa Indica Group]
          Length = 298

 Score =  493 bits (1270), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 233/291 (80%), Positives = 268/291 (92%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME++RRASHAGSWYT+N+++L EELDGWLR AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1   MERVRRASHAGSWYTNNARKLDEELDGWLRAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ AT+Y TPIGDLP+D EVIEEL ATGKF+ M
Sbjct: 61  FGNIDPTNISRVFLLGPSHHYYTPKCALTRATIYSTPIGDLPVDHEVIEELNATGKFDFM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYL+KVF+GH +K+VPILVGA+N++NEAMYG+L +KY+DDP NFF
Sbjct: 121 DLSVDEAEHSMEMHLPYLSKVFQGHNVKVVPILVGALNSQNEAMYGQLLSKYLDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           S+SSDFCHWG+RF+Y +YDK HG IHKSIEALD+MGM+IIETG+PDAFK+YL EY+NTIC
Sbjct: 181 SISSDFCHWGTRFSYTYYDKSHGAIHKSIEALDRMGMEIIETGNPDAFKQYLQEYENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVFL ML +CSTKIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD 
Sbjct: 241 GRHPISVFLSMLKHCSTKIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVDT 291


>gi|115475776|ref|NP_001061484.1| Os08g0299000 [Oryza sativa Japonica Group]
 gi|50508226|dbj|BAD31730.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623453|dbj|BAF23398.1| Os08g0299000 [Oryza sativa Japonica Group]
 gi|215678729|dbj|BAG95166.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215692469|dbj|BAG87889.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215708833|dbj|BAG94102.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222640303|gb|EEE68435.1| hypothetical protein OsJ_26808 [Oryza sativa Japonica Group]
          Length = 298

 Score =  490 bits (1262), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 232/291 (79%), Positives = 267/291 (91%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME++RRASHAGSWYT+N+++L EELDGWLR AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1   MERVRRASHAGSWYTNNARKLDEELDGWLRAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ AT+Y TPIGDLP+D EVIEEL ATGKF+ M
Sbjct: 61  FGNIDPTNISRVFLLGPSHHYYTPKCALTRATIYSTPIGDLPVDHEVIEELNATGKFDFM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYL+KVF+GH +K+VPILVGA+N++NEAMYG+L +KY+DDP NFF
Sbjct: 121 DLSVDEAEHSMEMHLPYLSKVFQGHNVKVVPILVGALNSQNEAMYGQLLSKYLDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           S+SSDFCHWG+RF+Y +YDK HG IHKSIEALD+MGM+IIETG+PDAFK+YL EY+NTIC
Sbjct: 181 SISSDFCHWGTRFSYTYYDKSHGAIHKSIEALDRMGMEIIETGNPDAFKQYLQEYENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVFL ML +C TKIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD 
Sbjct: 241 GRHPISVFLSMLKHCLTKIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVDT 291


>gi|242055595|ref|XP_002456943.1| hypothetical protein SORBIDRAFT_03g046030 [Sorghum bicolor]
 gi|241928918|gb|EES02063.1| hypothetical protein SORBIDRAFT_03g046030 [Sorghum bicolor]
          Length = 299

 Score =  489 bits (1258), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 234/291 (80%), Positives = 268/291 (92%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME++RRASHAGSWYT+N+++L EEL+GWL  AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1   MERVRRASHAGSWYTNNARKLEEELNGWLGAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D EVIEEL ATG+FE M
Sbjct: 61  FGNIDPTNISRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQEVIEELSATGRFEFM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVF+GH +K+VPILVGA+++++EAMYG+L +KYVDDP NFF
Sbjct: 121 DLSVDEAEHSMEMHLPYLAKVFQGHTVKVVPILVGALSSQSEAMYGQLLSKYVDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRF+Y +Y++KHG IHKSIEALD+MGM+IIETGDP AFK+YL EY+NTIC
Sbjct: 181 SVSSDFCHWGSRFSYTYYEEKHGAIHKSIEALDRMGMEIIETGDPVAFKEYLQEYENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVFLHML +CSTKIKI F+RYEQSSQCK  RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFLHMLKHCSTKIKIGFVRYEQSSQCKNFRDSSVSYASAAAKVDA 291


>gi|297825573|ref|XP_002880669.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326508|gb|EFH56928.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 234/291 (80%), Positives = 264/291 (90%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           MEK+R+ +HAGSWYTDN  +L+ +L+ WL   GL KSP VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1   MEKVRQPTHAGSWYTDNPTKLSSDLEEWLSATGLTKSPHVRGVIAPHAGYSYSGRAAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           F NIDPT+ISR+FLLGPSHH+YTPKCALSTATVYKTPIGDLP+D+E+I+E++A GKF +M
Sbjct: 61  FANIDPTNISRIFLLGPSHHFYTPKCALSTATVYKTPIGDLPVDVEMIKEIRAMGKFGMM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVF+GH +K+VPILVGAV+AENEAMYG L AKYVDDP NFF
Sbjct: 121 DLRVDEAEHSMEMHLPYLAKVFKGHDVKVVPILVGAVSAENEAMYGELLAKYVDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRFNYMHYD  HG IHKSIEALDK GMDIIETGDPDAFKKYLLE++NTIC
Sbjct: 181 SVSSDFCHWGSRFNYMHYDNTHGAIHKSIEALDKKGMDIIETGDPDAFKKYLLEFENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPIS+FLHML + S+KIKI FLRYEQSSQC+T RDSSVSYASAAAK++ 
Sbjct: 241 GRHPISIFLHMLKHSSSKIKINFLRYEQSSQCQTMRDSSVSYASAAAKLET 291


>gi|357145523|ref|XP_003573672.1| PREDICTED: protein MEMO1-like [Brachypodium distachyon]
          Length = 298

 Score =  486 bits (1252), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 231/291 (79%), Positives = 266/291 (91%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME++R+ASHAGSWYT+N+++L EELDGWLR AGL KSP+VR +IAPHAGYSYSGR AAYA
Sbjct: 1   MERVRKASHAGSWYTNNARKLEEELDGWLRAAGLTKSPDVRAIIAPHAGYSYSGRCAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISRVFLLGPSHHYYTPKCALS  TVY TPIGDLP+D EVIEEL ATGKFE M
Sbjct: 61  FGNIDPTNISRVFLLGPSHHYYTPKCALSRTTVYSTPIGDLPVDQEVIEELSATGKFEFM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D  VDEAEHSMEMHLPYL+KVF+G+ +K+VPILVGA+++ENEAMYG+L +KYVDDP NFF
Sbjct: 121 DHNVDEAEHSMEMHLPYLSKVFQGYNVKVVPILVGALSSENEAMYGQLLSKYVDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           S+SSDFCHWGSRF+Y +YDK HG IHKSIEALD++GM+IIE+GDPDAFK+YL EY+NTIC
Sbjct: 181 SISSDFCHWGSRFSYTYYDKSHGAIHKSIEALDRLGMEIIESGDPDAFKQYLREYENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISV LHML +CSTK K+ F+RYEQSSQCK+ RDSSVSYASAAAK+DA
Sbjct: 241 GRHPISVLLHMLKHCSTKSKVGFVRYEQSSQCKSMRDSSVSYASAAAKIDA 291


>gi|148910301|gb|ABR18230.1| unknown [Picea sitchensis]
          Length = 291

 Score =  486 bits (1250), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 225/291 (77%), Positives = 255/291 (87%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME+ RRA HAGSWYTDN+ +L EELD WLR  G  KSP+VRGVIAPHAGYSYSGR AA+A
Sbjct: 1   MERARRAVHAGSWYTDNAGQLGEELDRWLRACGQSKSPDVRGVIAPHAGYSYSGRCAAFA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           F NIDP   SRVFLLGPSHHYYTPKCALS ATVYKTP+GDLP+DL+V +EL+A+GKFE M
Sbjct: 61  FANIDPQRTSRVFLLGPSHHYYTPKCALSKATVYKTPLGDLPIDLDVNDELRASGKFEDM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVF+G  +KIVPILVGA++ E+EAMYGRL AKYVDDP NFF
Sbjct: 121 DLQVDEAEHSMEMHLPYLAKVFQGWPVKIVPILVGALSPESEAMYGRLLAKYVDDPRNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRFNY++YD+K G I+KSIE LD+ GMDIIE+GD D+FK YLLEYDNTIC
Sbjct: 181 SVSSDFCHWGSRFNYVYYDRKQGAIYKSIETLDRKGMDIIESGDADSFKDYLLEYDNTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPISVFLHML NC  ++KI FL+YEQSSQC++ RDSSVSYASA AK +A
Sbjct: 241 GRHPISVFLHMLKNCGVRVKISFLQYEQSSQCRSMRDSSVSYASAVAKAEA 291


>gi|18400785|ref|NP_565590.1| uncharacterized protein [Arabidopsis thaliana]
 gi|4567249|gb|AAD23663.1| expressed protein [Arabidopsis thaliana]
 gi|16604563|gb|AAL24083.1| unknown protein [Arabidopsis thaliana]
 gi|20259161|gb|AAM14296.1| unknown protein [Arabidopsis thaliana]
 gi|21554336|gb|AAM63443.1| unknown [Arabidopsis thaliana]
 gi|330252585|gb|AEC07679.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 291

 Score =  486 bits (1250), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 233/291 (80%), Positives = 264/291 (90%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           MEKIR+ +HAGSWYTDN  +L+ +L+ WL   GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1   MEKIRQPTHAGSWYTDNPTKLSSDLEEWLNATGLTKSPDVRGVIAPHAGYSYSGRAAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           F NIDPT+ISR+FLLGPSHH+YTPKCALSTATVYKTPIG+LP+D+E+I+E++A GKF +M
Sbjct: 61  FANIDPTNISRIFLLGPSHHFYTPKCALSTATVYKTPIGNLPVDVEMIKEIRAMGKFGMM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVFEG+ +K+VPILVGAV+ ENEAMYG L AKYVDDP NFF
Sbjct: 121 DLRVDEAEHSMEMHLPYLAKVFEGNNVKVVPILVGAVSPENEAMYGELLAKYVDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRFNYMHYD  HG IHKSIEALDK GMDIIETGDPDAFKKYLLE++NTIC
Sbjct: 181 SVSSDFCHWGSRFNYMHYDNTHGAIHKSIEALDKKGMDIIETGDPDAFKKYLLEFENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           GRHPIS+FLHML + S+KIKI FLRYEQSSQC+T RDSSVSYASAAAK++ 
Sbjct: 241 GRHPISIFLHMLKHSSSKIKINFLRYEQSSQCQTMRDSSVSYASAAAKLET 291


>gi|226492415|ref|NP_001141056.1| uncharacterized protein LOC100273137 [Zea mays]
 gi|194702440|gb|ACF85304.1| unknown [Zea mays]
 gi|195654227|gb|ACG46581.1| hypothetical protein [Zea mays]
 gi|413951291|gb|AFW83940.1| hypothetical protein ZEAMMB73_880994 [Zea mays]
          Length = 300

 Score =  482 bits (1241), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 221/277 (79%), Positives = 253/277 (91%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME+ RRASHAGSWYT+++++L EEL+GWL  AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1   MERARRASHAGSWYTNDARKLEEELNGWLGAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D EVIEEL ATGKFE M
Sbjct: 61  FGNIDPTNISRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQEVIEELSATGKFEFM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVF+GH +K+VPILVGA++++NEA+YG+L +KYVDDP NFF
Sbjct: 121 DLNVDEAEHSMEMHLPYLAKVFQGHNVKVVPILVGALSSQNEALYGQLLSKYVDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRF+Y +Y+KKHG IHKSIEALD+MGM+IIETGDP AFK+YL EY+NTIC
Sbjct: 181 SVSSDFCHWGSRFSYTYYEKKHGAIHKSIEALDRMGMEIIETGDPVAFKEYLQEYENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
           GRHPISVFLHML +CSTKIKI F+RYEQSSQCK  RD
Sbjct: 241 GRHPISVFLHMLKHCSTKIKIGFVRYEQSSQCKNMRD 277


>gi|219887901|gb|ACL54325.1| unknown [Zea mays]
          Length = 300

 Score =  480 bits (1236), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 220/277 (79%), Positives = 252/277 (90%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME+ RRASHAGSWYT+++++L EEL+GWL  AGL KSP+ R VIAPHAGYSYSGR AAYA
Sbjct: 1   MERARRASHAGSWYTNDARKLEEELNGWLGAAGLTKSPDARAVIAPHAGYSYSGRCAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D EVIEEL ATGKFE M
Sbjct: 61  FGNIDPTNISRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQEVIEELSATGKFEFM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVF+GH +K+VPILVGA++++NEA+YG+L +KYVDDP NFF
Sbjct: 121 DLNVDEAEHSMEMHLPYLAKVFQGHNVKVVPILVGALSSQNEALYGQLLSKYVDDPKNFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRF+Y +Y+KKHG IHKSIEALD+MGM+IIETGDP AFK+YL EY+NTIC
Sbjct: 181 SVSSDFCHWGSRFSYTYYEKKHGAIHKSIEALDRMGMEIIETGDPVAFKEYLQEYENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
           GRHPISVFLHML +CSTKIKI F+RYEQSSQCK  RD
Sbjct: 241 GRHPISVFLHMLKHCSTKIKIGFVRYEQSSQCKNMRD 277


>gi|294462702|gb|ADE76896.1| unknown [Picea sitchensis]
          Length = 291

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 219/290 (75%), Positives = 254/290 (87%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME +RRA+HAGSWYT N+  L EEL+ WLR  G  KS +VRGVIAPHAGY YSGR AA+A
Sbjct: 1   MESVRRAAHAGSWYTSNASELGEELERWLRACGQSKSSDVRGVIAPHAGYRYSGRCAAFA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           F +IDP   SRVFLLGPSHHYYTPKCALS+ATVYKTP+GDLP+DLEV EELKATGKFE+M
Sbjct: 61  FAHIDPQQTSRVFLLGPSHHYYTPKCALSSATVYKTPLGDLPIDLEVYEELKATGKFEIM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           DI VDE EHSMEMHLPYLAKVF+G  +KIVPILVG+++ E+EAMYGR  +KYV+DP+NFF
Sbjct: 121 DIQVDEEEHSMEMHLPYLAKVFQGWPVKIVPILVGSLSTESEAMYGRFLSKYVNDPANFF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           S+SSDFCHWGSRFNY++YDKK G I+KSI+ALDK GMDI+ +GDP+AFK YLL+Y NTIC
Sbjct: 181 SISSDFCHWGSRFNYVYYDKKEGAIYKSIKALDKKGMDIVASGDPNAFKDYLLQYRNTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           GRHPISVFLHML NCSTKIK+ F++YEQSS+ KT RDSSVSYASA A+V+
Sbjct: 241 GRHPISVFLHMLQNCSTKIKVAFVQYEQSSKSKTMRDSSVSYASAVARVE 290


>gi|125561007|gb|EAZ06455.1| hypothetical protein OsI_28693 [Oryza sativa Indica Group]
          Length = 409

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 219/275 (79%), Positives = 252/275 (91%)

Query: 17  NSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLG 76
           ++++L EELDGWLR AGL KSP+VR VIAPHAGYSYSGR AAYAFGNIDPT+ISRVFLLG
Sbjct: 128 DTRKLDEELDGWLRAAGLTKSPDVRAVIAPHAGYSYSGRCAAYAFGNIDPTNISRVFLLG 187

Query: 77  PSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLP 136
           PSHHYYTPKCAL+ AT+Y TPIGDLP+D EVIEEL ATGKF+ MD+ VDEAEHSMEMHLP
Sbjct: 188 PSHHYYTPKCALTRATIYSTPIGDLPVDHEVIEELNATGKFDFMDLSVDEAEHSMEMHLP 247

Query: 137 YLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM 196
           YL+KVF+GH +K+VPILVGA+N++NEAMYG+L +KY+DDP NFFS+SSDFCHWG+RF+Y 
Sbjct: 248 YLSKVFQGHNVKVVPILVGALNSQNEAMYGQLLSKYLDDPKNFFSISSDFCHWGTRFSYT 307

Query: 197 HYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS 256
           +YDK HG IHKSIEALD+MGM+IIETG+PDAFK+YL EY+NTICGRHPISVFL ML +CS
Sbjct: 308 YYDKSHGAIHKSIEALDRMGMEIIETGNPDAFKQYLQEYENTICGRHPISVFLSMLKHCS 367

Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           TKIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD 
Sbjct: 368 TKIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVDT 402


>gi|413951290|gb|AFW83939.1| hypothetical protein ZEAMMB73_880994 [Zea mays]
          Length = 286

 Score =  452 bits (1163), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 212/277 (76%), Positives = 240/277 (86%), Gaps = 14/277 (5%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME+ RRASHAGSWYT+++++L EEL+GWL  AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1   MERARRASHAGSWYTNDARKLEEELNGWLGAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D EVIEEL ATGKFE M
Sbjct: 61  FGNIDPTNISRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQEVIEELSATGKFEFM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+ VDEAEHSMEMHLPYLAKVF+G              ++NEA+YG+L +KYVDDP NFF
Sbjct: 121 DLNVDEAEHSMEMHLPYLAKVFQG--------------SQNEALYGQLLSKYVDDPKNFF 166

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SVSSDFCHWGSRF+Y +Y+KKHG IHKSIEALD+MGM+IIETGDP AFK+YL EY+NTIC
Sbjct: 167 SVSSDFCHWGSRFSYTYYEKKHGAIHKSIEALDRMGMEIIETGDPVAFKEYLQEYENTIC 226

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
           GRHPISVFLHML +CSTKIKI F+RYEQSSQCK  RD
Sbjct: 227 GRHPISVFLHMLKHCSTKIKIGFVRYEQSSQCKNMRD 263


>gi|302785594|ref|XP_002974568.1| hypothetical protein SELMODRAFT_232363 [Selaginella moellendorffii]
 gi|300157463|gb|EFJ24088.1| hypothetical protein SELMODRAFT_232363 [Selaginella moellendorffii]
          Length = 289

 Score =  445 bits (1144), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 206/286 (72%), Positives = 245/286 (85%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+ASHAGSWY+D++  L EELDGWL+ +GLPK P+ R VIAPHAGY YSGR AA+AF NI
Sbjct: 3   RQASHAGSWYSDDALELREELDGWLKGSGLPKKPDARAVIAPHAGYRYSGRCAAFAFANI 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP  +SRVF+LGPSHH+YT KCALS  +VYKTP+GDLP+DL+V  ELK TG+FE M + V
Sbjct: 63  DPQVVSRVFILGPSHHHYTRKCALSRTSVYKTPLGDLPIDLDVYRELKETGQFEEMRVDV 122

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           DEAEHSMEMHLPYLAKVF G  +KIVPILVGA+++E+EA+YGRL AKY+DDPSNFFSVSS
Sbjct: 123 DEAEHSMEMHLPYLAKVFSGVPVKIVPILVGALSSESEALYGRLLAKYLDDPSNFFSVSS 182

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWGSRF Y +YDKK G I+KSIEALDK GMDIIE+ +P+AFK YL EY+NTICGRHP
Sbjct: 183 DFCHWGSRFGYTYYDKKRGPIYKSIEALDKCGMDIIESKNPEAFKSYLDEYENTICGRHP 242

Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           I V LHM+ +C+TK++++FL+YEQSS CKT RDSSVSYASA   V+
Sbjct: 243 IGVLLHMIQHCNTKLELRFLQYEQSSHCKTSRDSSVSYASAVITVE 288


>gi|302759687|ref|XP_002963266.1| hypothetical protein SELMODRAFT_438442 [Selaginella moellendorffii]
 gi|300168534|gb|EFJ35137.1| hypothetical protein SELMODRAFT_438442 [Selaginella moellendorffii]
          Length = 289

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 205/286 (71%), Positives = 244/286 (85%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+ASHAGSWY+D++  L EELDGWL+ +GLPK P+ R VIAPHAGY YSGR AA+AF NI
Sbjct: 3   RQASHAGSWYSDDALELREELDGWLKGSGLPKKPDARAVIAPHAGYRYSGRCAAFAFANI 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP  +SRVF+LGPSHH+YT KCALS  +VYKTP+GDLP+DL+V  ELK TG+FE M + V
Sbjct: 63  DPQVVSRVFILGPSHHHYTRKCALSRTSVYKTPLGDLPIDLDVYRELKETGQFEEMRVDV 122

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           DEAEHSMEMHLPYLAKVF G  +KIVPILVG +++E+EA+YGRL AKY+DDPSNFFSVSS
Sbjct: 123 DEAEHSMEMHLPYLAKVFSGVPVKIVPILVGTLSSESEALYGRLLAKYLDDPSNFFSVSS 182

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWGSRF Y +YDKK G I+KSIEALDK GMDIIE+ +P+AFK YL EY+NTICGRHP
Sbjct: 183 DFCHWGSRFGYTYYDKKRGPIYKSIEALDKCGMDIIESKNPEAFKSYLDEYENTICGRHP 242

Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           I V LHM+ +C+TK++++FL+YEQSS CKT RDSSVSYASA   V+
Sbjct: 243 IGVLLHMIQDCNTKLELRFLQYEQSSHCKTSRDSSVSYASAVITVE 288


>gi|168010446|ref|XP_001757915.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690792|gb|EDQ77157.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 290

 Score =  442 bits (1137), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 201/281 (71%), Positives = 236/281 (83%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A HAG+WYTD+++ L +ELD WLR +GLPK    R VIAPHAGY YSG  AAYAF NI
Sbjct: 4   REAFHAGTWYTDDAEELTQELDQWLRSSGLPKLANARAVIAPHAGYRYSGHCAAYAFANI 63

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP  + RVFLLGPSHH++T KC LS  +VYKTP+GDLP+D EV EEL+A+G+FE MD+ V
Sbjct: 64  DPQRVQRVFLLGPSHHHFTRKCCLSKVSVYKTPLGDLPIDNEVYEELRASGRFEDMDVSV 123

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           DEAEHSMEMHLPY+AKVF+G  +KIVPI+VGA     EA YGRL AKY+DDP+NFFS+SS
Sbjct: 124 DEAEHSMEMHLPYIAKVFQGTSVKIVPIMVGATTPATEATYGRLLAKYIDDPANFFSISS 183

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWGSRF Y +Y+KK G I+KSIEALDK GMDIIETGDPDAF++YL E++NTICGRHP
Sbjct: 184 DFCHWGSRFGYQYYEKKRGPIYKSIEALDKCGMDIIETGDPDAFQEYLSEFENTICGRHP 243

Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           I VFLHML +CST+ +IKFL+YEQSSQCK+ RDSSVSYASA
Sbjct: 244 IGVFLHMLKHCSTRTRIKFLQYEQSSQCKSMRDSSVSYASA 284


>gi|168017790|ref|XP_001761430.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687436|gb|EDQ73819.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 290

 Score =  433 bits (1114), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 196/281 (69%), Positives = 239/281 (85%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A+HAGSWY+D+++ L +ELD WLR +GLPK  + R +IAPHAGY YSGR AAYAF NI
Sbjct: 4   REATHAGSWYSDDAEELVQELDYWLRSSGLPKLTDARAIIAPHAGYWYSGRCAAYAFANI 63

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           D   + RVFLLGPSHH++T KCALSTA+VYKTP+GDLP+D EV +EL+A+G+FE M + V
Sbjct: 64  DSQRVQRVFLLGPSHHHFTRKCALSTASVYKTPLGDLPIDHEVYDELRASGRFEDMSMSV 123

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           DEAEHSMEMHLP+LAKVF+G  +KIVPI++GA   E EA YGRL AKY+DDP+NFFS+SS
Sbjct: 124 DEAEHSMEMHLPFLAKVFQGTNVKIVPIMIGACPPETEATYGRLLAKYIDDPNNFFSISS 183

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWGSRF Y +Y+KK G I+KSIEALD+ GMDIIE+GDPDAF++YL E+ NTICGRHP
Sbjct: 184 DFCHWGSRFGYQYYEKKRGPIYKSIEALDQRGMDIIESGDPDAFQEYLSEFKNTICGRHP 243

Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           I VF H+L +C+T+++IKFL+YEQSSQCK  RDSSVSYASA
Sbjct: 244 IGVFFHILRHCATRMRIKFLQYEQSSQCKNMRDSSVSYASA 284


>gi|222631062|gb|EEE63194.1| hypothetical protein OsJ_18003 [Oryza sativa Japonica Group]
          Length = 778

 Score =  348 bits (892), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 176/274 (64%), Positives = 201/274 (73%), Gaps = 52/274 (18%)

Query: 18  SKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGP 77
           +++L EELDGWLR AGL KSP+V  VIAPHAGYSYSGR AAYAFGNIDPT+ISRVFLLGP
Sbjct: 3   ARKLDEELDGWLRAAGLTKSPDVSAVIAPHAGYSYSGRCAAYAFGNIDPTNISRVFLLGP 62

Query: 78  SHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPY 137
           SHHYYTPKCAL+ AT+Y TPIGDLP+D E                               
Sbjct: 63  SHHYYTPKCALTRATIYSTPIGDLPVDHE------------------------------- 91

Query: 138 LAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
                                 +NEAMYG+L ++Y+DDP NFFS+SSDFCHWG+RF+Y +
Sbjct: 92  ---------------------DQNEAMYGQLLSRYLDDPKNFFSISSDFCHWGTRFSYTY 130

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
           YDK HG IHKSIEALD MGM+IIETG+PDAFK+YL EY+NTICGRHPISVFL ML +CST
Sbjct: 131 YDKSHGAIHKSIEALDHMGMEIIETGNPDAFKQYLQEYENTICGRHPISVFLSMLKHCST 190

Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           KIKI+F+RYEQSSQCK+ RDSSVSYASAAAKVD 
Sbjct: 191 KIKIRFVRYEQSSQCKSMRDSSVSYASAAAKVDT 224


>gi|384250635|gb|EIE24114.1| UPF0103-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 294

 Score =  343 bits (879), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 154/278 (55%), Positives = 203/278 (73%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RR +HAGSWY D+  RLA +++ W+     P+    R +I+PHAGYSY G   A+A+  I
Sbjct: 6   RRPAHAGSWYEDDGTRLASQIENWIGAVPTPRGAHARAIISPHAGYSYCGHVMAHAYKQI 65

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           +P  +SRVF+LGPSHHYYT +C LS A+ Y TP+G + + L++ +EL+ATG+F+++DI V
Sbjct: 66  NPERVSRVFILGPSHHYYTRRCCLSPASSYDTPLGSVSIALDIYQELQATGQFDVIDIDV 125

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           DEAEHS+E+ + YLA +  G   ++VPI+VG++  E EA YGRL A Y+DDPSN F +SS
Sbjct: 126 DEAEHSLELQMGYLAHIMRGRQFQLVPIMVGSLTPEREAAYGRLLAGYLDDPSNVFIISS 185

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWGSRF+Y   D   G I+K+IE LD  GMDIIE GDP AF +YL  Y NTICGRHP
Sbjct: 186 DFCHWGSRFSYTFNDSSKGPIYKAIEWLDTQGMDIIEQGDPKAFTEYLRRYGNTICGRHP 245

Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
           ISVFL+M+ +C TK +I F  Y+QS++  T RDSSVSY
Sbjct: 246 ISVFLNMVRSCRTKFEINFENYDQSARVATFRDSSVSY 283


>gi|332027147|gb|EGI67240.1| Protein MEMO1 [Acromyrmex echinatior]
          Length = 296

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 163/293 (55%), Positives = 207/293 (70%), Gaps = 8/293 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IRRASHAGSWY+DN   L+++L+GWL  A L   P  R +IAPHAGYSY G  A +A
Sbjct: 1   MALIRRASHAGSWYSDNGSELSKQLEGWLSAADLSHGP-ARAIIAPHAGYSYCGACAGFA 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I P  + R+F+LGPSHH   P CALS+A++Y+TP+ DL +D +V  EL+ TG FE M
Sbjct: 60  YRQISPVVVRRIFILGPSHHVRLPGCALSSASIYRTPLYDLLIDQQVRRELEETGHFECM 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D+  DE EHS+EM LP++AKV EG      I+PILVG+++ E EA+YGRL A Y+ DP  
Sbjct: 120 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPEREALYGRLLAPYMADPQT 179

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDFCHWG RF Y +YD+  G IH+SI+ LDKMGMDIIET +P  F +YL +Y NT
Sbjct: 180 LFVISSDFCHWGQRFRYTYYDRSCGPIHRSIQNLDKMGMDIIETLNPLMFTEYLKKYGNT 239

Query: 239 ICGRHPISVFLHML----GNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ICGRHPI V L  +    GN +  ++ +KFL+Y QSSQC    DSSVSYASA+
Sbjct: 240 ICGRHPIGVLLQAIHSLKGNTNGQRMNLKFLKYAQSSQCNNMNDSSVSYASAS 292


>gi|322779500|gb|EFZ09692.1| hypothetical protein SINV_14380 [Solenopsis invicta]
          Length = 310

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 163/293 (55%), Positives = 207/293 (70%), Gaps = 8/293 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IRRASHAGSWY+DN   L+++L+GWL  A L   P  R +IAPHAGYSY G  A +A
Sbjct: 15  MALIRRASHAGSWYSDNGSELSKQLEGWLSAADLSHGP-ARAIIAPHAGYSYCGACAGFA 73

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I P  + R+F+LGPSHH   P CALS+A++Y+TP+ DL +D +V  EL+ TG FE M
Sbjct: 74  YRQISPVVVRRIFILGPSHHVRLPGCALSSASIYRTPLYDLHIDQQVRRELEETGHFECM 133

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D+  DE EHS+EM LP++AKV EG      I+PILVG+++ E EA+YGRL A Y+ DP  
Sbjct: 134 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPEREALYGRLLAPYMADPQT 193

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDFCHWG RF Y +YD+  G IH+SI+ LDKMGMDIIET +P  F +YL +Y NT
Sbjct: 194 LFIISSDFCHWGQRFRYTYYDRSCGPIHRSIQNLDKMGMDIIETLNPPMFTEYLKKYGNT 253

Query: 239 ICGRHPISVFLHML----GNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ICGRHPI V L  +    GN +  ++ +KFL+Y QSSQC    DSSVSYASA+
Sbjct: 254 ICGRHPIGVLLQAIHSLKGNTNGQRMNLKFLKYAQSSQCNNMNDSSVSYASAS 306


>gi|125551770|gb|EAY97479.1| hypothetical protein OsI_19410 [Oryza sativa Indica Group]
          Length = 234

 Score =  340 bits (871), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 176/274 (64%), Positives = 200/274 (72%), Gaps = 52/274 (18%)

Query: 18  SKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGP 77
           +++L EELDGWLR AGL KSP+V  VIAPHAGYSYSGR AAYAFGNIDPT+ISRVFLLGP
Sbjct: 3   ARKLDEELDGWLRAAGLTKSPDVSAVIAPHAGYSYSGRCAAYAFGNIDPTNISRVFLLGP 62

Query: 78  SHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPY 137
           SHHYYTPKCAL+ AT+Y TPIGDLP+D E                               
Sbjct: 63  SHHYYTPKCALTRATIYSTPIGDLPVDHE------------------------------- 91

Query: 138 LAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
                                 +NEAMYG+L ++Y+DDP NFFS+SSDFCHWG+RF+Y +
Sbjct: 92  ---------------------DQNEAMYGQLLSRYLDDPKNFFSISSDFCHWGTRFSYTY 130

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
           YDK HG IHKSIEALD MGM+IIETG+PDAFK+YL EY+NTICGRHPISVFL ML +CST
Sbjct: 131 YDKSHGAIHKSIEALDHMGMEIIETGNPDAFKQYLQEYENTICGRHPISVFLSMLKHCST 190

Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           KIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD 
Sbjct: 191 KIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVDT 224


>gi|326427143|gb|EGD72713.1| MEMO1 protein [Salpingoeca sp. ATCC 50818]
          Length = 291

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 162/290 (55%), Positives = 206/290 (71%), Gaps = 1/290 (0%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   RRA+HAGSWYTD+ K L+ +LD WL + G    P  R +IAPHAGYSYSG  AAYA
Sbjct: 1   MSAHRRATHAGSWYTDDGKELSGQLDAWLGKVGEAIMPGARAIIAPHAGYSYSGPTAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I P ++ RVF+LGPSHH Y   CAL++ T Y TPIG+L +D +  ++L+ATG+FE M
Sbjct: 61  YKQIVPDNVRRVFVLGPSHHVYIQGCALTSTTTYDTPIGNLKIDRDTNKDLEATGQFEHM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           DI  DE EHS+EMHLPY+AKV EG    +VP+LVGA++++ E  YG++FA Y++DPSN F
Sbjct: 121 DIQTDEDEHSIEMHLPYVAKVMEGKDFTVVPVLVGALDSKLEQEYGKIFAPYLEDPSNLF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSDFCHWG RF Y +    H  I+K IE LDK GMDIIE  D   F+ YL E +NTIC
Sbjct: 181 VISSDFCHWGRRFGYTYVPPGHDAIYKGIEHLDKQGMDIIEKQDAKGFRAYLQECENTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIK-IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           GR+PIS+ L+ L     + K +KFL Y QS+ CK+ RDSSVSYASAA  +
Sbjct: 241 GRNPISILLNALQVLKDRQKEVKFLHYAQSNPCKSTRDSSVSYASAAVLI 290


>gi|307166154|gb|EFN60403.1| Protein MEMO1 [Camponotus floridanus]
          Length = 296

 Score =  337 bits (864), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 161/293 (54%), Positives = 206/293 (70%), Gaps = 8/293 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IRRASHAGSWY+DN   L+++L+GWL  A L   P  R +IAPHAGYSY G  A +A
Sbjct: 1   MALIRRASHAGSWYSDNGPELSKQLEGWLGAADLSHGP-ARAIIAPHAGYSYCGACAGFA 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I P  + R+F+LGPSHH   P CALS+ ++Y+TP+ DL +D +V  EL+ TG FE M
Sbjct: 60  YRQISPVVVRRIFILGPSHHVRLPGCALSSTSIYRTPLYDLHIDQQVRRELEETGHFECM 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D+  DE EHS+EM LP++AKV EG      I+PILVG+++ + EA+YGRL A Y+ DP  
Sbjct: 120 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPDREALYGRLLAPYMADPQT 179

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDFCHWG RF Y +YD+  G IH+SI+ LDKMGMDIIET +P  F +YL +Y NT
Sbjct: 180 LFVISSDFCHWGQRFRYTYYDRSCGPIHRSIQNLDKMGMDIIETLNPAMFTEYLKKYGNT 239

Query: 239 ICGRHPISVFLHML----GNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ICGRHPI V L  +    GN +  ++ +KFL+Y QSSQC    DSSVSYASA+
Sbjct: 240 ICGRHPIGVLLQAIHSLKGNTNGQRMNLKFLKYAQSSQCNNMNDSSVSYASAS 292


>gi|346469415|gb|AEO34552.1| hypothetical protein [Amblyomma maculatum]
          Length = 298

 Score =  336 bits (861), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 162/292 (55%), Positives = 202/292 (69%), Gaps = 5/292 (1%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ASHAGSWYTD+++ L  +LD WL   G P     R VIAPHAGY Y G  AA+A+  
Sbjct: 7   VRKASHAGSWYTDSARELRYQLDNWLSAVGPPTFGPARAVIAPHAGYQYCGACAAHAYKQ 66

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +DP+ + RVF+LGPSHH     C LS A  Y+TP  DL +D EV EEL  TG FE + I 
Sbjct: 67  VDPSVVRRVFILGPSHHARLGGCGLSPAKQYRTPFYDLTIDQEVYEELYETGAFEEVSIH 126

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
           VDE EHS+EMHLPY+AKV E     IVPI+VG+++ ENEA YGRL +KY+ D  N F +S
Sbjct: 127 VDENEHSLEMHLPYIAKVMENQEFTIVPIIVGSLSPENEAFYGRLLSKYLADADNLFVIS 186

Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
           SDFCHWG+RF+Y +YDK  G IH+SIE LDK GM IIE   P AF  YL +Y NTICGRH
Sbjct: 187 SDFCHWGARFHYQYYDKSWGNIHQSIEKLDKQGMSIIEELSPTAFTAYLKKYGNTICGRH 246

Query: 244 PISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           PI V L+ +    +     ++ +KFL+Y QSSQC +  DSSVSYASAA +++
Sbjct: 247 PIGVLLNAVDTLQSSGNGHRMVLKFLKYAQSSQCMSMGDSSVSYASAALRLE 298


>gi|427788107|gb|JAA59505.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 298

 Score =  336 bits (861), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 160/292 (54%), Positives = 202/292 (69%), Gaps = 5/292 (1%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ASHAGSWYTD+++ L  +L+ WL   G P     R +IAPHAGY Y G  AA+A+  
Sbjct: 7   VRKASHAGSWYTDSARELRYQLENWLSAVGPPTFGPARAIIAPHAGYQYCGACAAHAYKQ 66

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +DPT++ RVF+LGPSHH     C LS A  Y+TP+ DL ++ EV EEL  TG FE + I 
Sbjct: 67  VDPTAVRRVFILGPSHHARIGGCGLSPAKTYRTPLYDLTINQEVYEELYETGAFEEVSIH 126

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
           VDE EHS+EMHLPY+AKV E     IVPI+VG+++ ENEA YGRL +KY+ D  N F +S
Sbjct: 127 VDENEHSLEMHLPYIAKVMENQDFTIVPIIVGSLSPENEAFYGRLLSKYLADADNLFVIS 186

Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
           SDFCHWG+RF+Y  YDK  G IH+SIE LDK GM IIE   P AF  YL +Y NTICGRH
Sbjct: 187 SDFCHWGARFHYQFYDKSWGNIHQSIEKLDKQGMSIIEELSPTAFTAYLKKYGNTICGRH 246

Query: 244 PISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           PI V L+ +          ++ +KFL+Y QSSQC +  DSSVSYASAA +++
Sbjct: 247 PIGVLLNAVDTLQNSGNGHRMALKFLKYAQSSQCMSMSDSSVSYASAALRLE 298


>gi|302853948|ref|XP_002958486.1| hypothetical protein VOLCADRAFT_99750 [Volvox carteri f.
           nagariensis]
 gi|300256214|gb|EFJ40486.1| hypothetical protein VOLCADRAFT_99750 [Volvox carteri f.
           nagariensis]
          Length = 293

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 156/291 (53%), Positives = 205/291 (70%), Gaps = 4/291 (1%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAA 58
           M KIRR SHA SWY  ++  L  E+D W  +A +   P V  R +I PHAGYSY G   A
Sbjct: 1   MPKIRRPSHASSWYQGDADALNAEIDNW--KAAVQTEPRVSPRAIIGPHAGYSYCGHVMA 58

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           YA+ +ID    +RVFLLGPSHH +T KC LS+ ++Y TP+G + +D EV  +LKATG+FE
Sbjct: 59  YAYKHIDANKFTRVFLLGPSHHVFTRKCMLSSQSLYDTPLGSMEIDQEVYGQLKATGQFE 118

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           +M   VDEAEHS+E+H PY+     G   K+VPI+VGA+ AE EA+YG+L   Y+DDPSN
Sbjct: 119 VMSRDVDEAEHSLELHTPYIVHTMRGQSYKLVPIMVGALTAEGEALYGKLLGPYLDDPSN 178

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F VSSDFCHWGSRF+Y +Y+++ G I +SI+ LD++G+  IE G+P  F  YL +Y NT
Sbjct: 179 LFVVSSDFCHWGSRFSYTYYNREQGQIWQSIKWLDELGIQAIEGGNPADFTAYLEQYRNT 238

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           ICGRHPI VFL+ML +   + KIKF +Y+QSS+C ++  SSVSYA+A   V
Sbjct: 239 ICGRHPIGVFLNMLQHSKLRHKIKFTKYDQSSKCTSQSHSSVSYAAAIVTV 289


>gi|260837153|ref|XP_002613570.1| hypothetical protein BRAFLDRAFT_114006 [Branchiostoma floridae]
 gi|229298955|gb|EEN69579.1| hypothetical protein BRAFLDRAFT_114006 [Branchiostoma floridae]
          Length = 294

 Score =  330 bits (846), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 155/292 (53%), Positives = 206/292 (70%), Gaps = 7/292 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRA+HAGSWYT + + L ++L+GWL  A    SP  R +IAPHAGY+Y G   A+A+  
Sbjct: 3   VRRATHAGSWYTSSVQDLNDQLEGWLAAAQPTLSP-ARAIIAPHAGYAYCGACGAHAYRQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +DP +I RVF+LGPSHH     CA++   VY TP+ DL +D  + +EL  TG+FE+MDI 
Sbjct: 62  VDPRNIKRVFILGPSHHVRLSGCAVTATQVYHTPLYDLTIDTAINQELLQTGQFEVMDIN 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
            DE EHS+EMHLPY+AKV E       +VP+LVG ++ + EA YGRLF+K++ DP N F 
Sbjct: 122 TDEDEHSIEMHLPYVAKVMESKKGQFTVVPVLVGGLSVDQEAKYGRLFSKFLADPQNLFV 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF Y++YD+  G I++SIEALD+MGMDIIE  +P  F  YL +Y+NTICG
Sbjct: 182 ISSDFCHWGQRFKYVYYDRSQGDIYQSIEALDRMGMDIIEHMNPAEFTSYLRKYENTICG 241

Query: 242 RHPISVFLHMLGNCST----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           RHPI+V L+ +    T     + ++FL Y QSS+C+ K DSSVSYA+AA  V
Sbjct: 242 RHPIAVLLNAVKYLKTANSHTLSLRFLNYAQSSRCRRKEDSSVSYAAAALVV 293


>gi|289741463|gb|ADD19479.1| putative dioxygenase [Glossina morsitans morsitans]
          Length = 294

 Score =  329 bits (843), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 159/287 (55%), Positives = 199/287 (69%), Gaps = 6/287 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+ASHAGSWYT ++  L  +LD WL  A L   P  R +IAPHAGY+Y G   A+A+  +
Sbjct: 4   RKASHAGSWYTSSANELRRQLDNWLASACLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  Y+TP+ DL +D+EV  EL+ TG F  MD+  
Sbjct: 63  SPAVVKRIFILGPSHHVRLRGCALSVAKKYETPLYDLKIDIEVNAELEKTGHFSWMDMKT 122

Query: 125 DEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E   +   IVP+LVG++N + EA+YGRLF KY  +P N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEEFKNQFSIVPVLVGSLNPDQEALYGRLFTKYFLEPENLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RFNY +YDK  G I+KSIE+LDK GM+IIET DP +F  YL EY+NTICGR
Sbjct: 183 SSDFCHWGQRFNYTYYDKSSGPIYKSIESLDKRGMEIIETLDPASFTSYLREYNNTICGR 242

Query: 243 HPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L+ + +   K      KFL+Y QSSQCK   DSSVSYA+ +
Sbjct: 243 HPIGVLLNAVKSLQEKGCNTNFKFLKYAQSSQCKHMDDSSVSYAAGS 289


>gi|383852946|ref|XP_003701986.1| PREDICTED: protein MEMO1-like [Megachile rotundata]
          Length = 296

 Score =  328 bits (842), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 157/297 (52%), Positives = 203/297 (68%), Gaps = 8/297 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IRRA+HAGSWY  +   L ++L+GWL  A L   P  R +I+PHAGYSY G  A +A
Sbjct: 1   MALIRRATHAGSWYAGSGLELNKQLEGWLGAADLSHGP-ARAIISPHAGYSYCGACAGFA 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I P  + R+F+LGPSHH     CALS+A++Y+TP+ DL +D +V  EL+ +G FE M
Sbjct: 60  YRQISPVVVRRIFILGPSHHVRLAGCALSSASIYQTPLYDLHVDQQVCRELEESGHFEWM 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D+  DE EHS+EM LP++AKV EG      I+PILVG+++ E EA YGRL A Y+ DP  
Sbjct: 120 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPEREAFYGRLLAPYMADPQT 179

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDFCHWG RF Y +YD+  G IH+SI+ LDKMGMDIIET +P  F  YL +Y NT
Sbjct: 180 LFVISSDFCHWGQRFRYTYYDRSCGPIHRSIQNLDKMGMDIIETLNPAMFTDYLKKYGNT 239

Query: 239 ICGRHPISVFLHMLGNC-----STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           ICGRHPISV L ++ N        ++ +KFL+Y QS+QC    DSSVSYASA+  V+
Sbjct: 240 ICGRHPISVLLQIIHNLKGNTNGQRMNLKFLKYAQSNQCNNMSDSSVSYASASLVVE 296


>gi|307106589|gb|EFN54834.1| hypothetical protein CHLNCDRAFT_48888 [Chlorella variabilis]
          Length = 318

 Score =  328 bits (840), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 154/311 (49%), Positives = 200/311 (64%), Gaps = 26/311 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M +IR  +HAG+WY++ + +L ++L  WL EA   +    R +IAPHAGY YSG  AAYA
Sbjct: 1   MPRIRATTHAGTWYSNQAGQLQQQLKTWLSEAEAVEGQHARAIIAPHAGYRYSGHVAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  IDPT + RVFLLGPSHH+Y+  C LS A  Y TP+G   +D EV  EL+ATGKF+ +
Sbjct: 61  YKQIDPTQVRRVFLLGPSHHFYSKHCLLSPADAYATPLGSATIDAEVYAELRATGKFQEL 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
               DEAEHS+E+HLPY+  +  G    +VPI+VGA++AE+EA YGRL A Y+DD SN F
Sbjct: 121 KAAADEAEHSLELHLPYIVHMMAGRPFTLVPIVVGAISAESEAAYGRLLAPYLDDSSNLF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKH--------------------------GVIHKSIEALDK 214
            VSSDFCHWG RF+Y  YD                             G IH SIE LD+
Sbjct: 181 IVSSDFCHWGKRFSYTFYDPAQASAAGGGGQGRPGRGQQQQLSGGALLGPIHASIERLDR 240

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
            GM +IE  D   F  YL ++ NTICGRHP ++ LH L +C+TK  ++F +Y+QS +C T
Sbjct: 241 QGMALIEAQDAAGFTAYLKQHGNTICGRHPTALLLHALRHCATKHSVRFTKYDQSHRCTT 300

Query: 275 KRDSSVSYASA 285
           ++DSSVSYASA
Sbjct: 301 EQDSSVSYASA 311


>gi|195108689|ref|XP_001998925.1| GI24231 [Drosophila mojavensis]
 gi|193915519|gb|EDW14386.1| GI24231 [Drosophila mojavensis]
          Length = 295

 Score =  328 bits (840), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 161/289 (55%), Positives = 195/289 (67%), Gaps = 7/289 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRA+HAGSWYTD++  L+ +LD WL  A L   P  R +IAPHAGY+Y G   AYA+  
Sbjct: 3   VRRATHAGSWYTDSASELSRQLDRWLGAADLSHGP-ARAIIAPHAGYAYCGACGAYAYRQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P  + R+F+LGPSHH     CALS    YKTP+ DL +D EV  EL+ TG+F  MDI 
Sbjct: 62  VSPVVVKRIFILGPSHHVRLRSCALSVTKKYKTPLYDLKIDTEVNAELENTGQFTWMDIK 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
            DE EHS+EMHLPY+AKV E       IVPILVG++N E EA YG L AKY  DP+N F 
Sbjct: 122 TDEDEHSIEMHLPYIAKVMEDFKDQFTIVPILVGSLNPEQEAQYGSLLAKYFMDPTNLFV 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF+Y +YD+  G IHK IE LDK GMDIIET  P AF +YL +Y+NTICG
Sbjct: 182 ISSDFCHWGQRFSYTYYDRSCGPIHKCIEQLDKQGMDIIETLSPTAFTEYLRKYNNTICG 241

Query: 242 RHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           RHPI V L  +     +    +  KFL+Y QSSQC    DSSVSYAS +
Sbjct: 242 RHPIGVMLGAVKALQDQGFKNMSFKFLKYAQSSQCLDMEDSSVSYASGS 290


>gi|195446372|ref|XP_002070750.1| GK10855 [Drosophila willistoni]
 gi|194166835|gb|EDW81736.1| GK10855 [Drosophila willistoni]
          Length = 295

 Score =  327 bits (839), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 159/288 (55%), Positives = 197/288 (68%), Gaps = 7/288 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD+S  L+ +LD WL  A L   P  R +IAPHAGY+Y G   A+A+  +
Sbjct: 4   RRATHAGSWYTDSSSELSRQLDRWLSAAELSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  YKTP+ DL +D ++  EL+ TG F  MD+  
Sbjct: 63  SPVVVKRIFILGPSHHVRLRGCALSVAKKYKTPLYDLKIDTQINAELEKTGHFTWMDMKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E +     IVPILVG++N E EA YG L + Y+ DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSSYLMDPTNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF+Y +YD+  G IHKSIE LDK GMDIIET  P AF +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGPIHKSIEQLDKQGMDIIETLTPSAFTEYLRKYNNTICGR 242

Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L  +         K+  KFL+Y QSSQC+   DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQEQGYNKMSFKFLKYAQSSQCQDMDDSSVSYASGS 290


>gi|390332252|ref|XP_791245.2| PREDICTED: protein MEMO1-like [Strongylocentrotus purpuratus]
          Length = 299

 Score =  327 bits (837), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 156/296 (52%), Positives = 207/296 (69%), Gaps = 11/296 (3%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  +RRA+H+G+WY+ + K L+ EL+GWL  A    SP  R +IAPHAGYSY G   A+A
Sbjct: 1   MSTMRRATHSGTWYSSSGKTLSGELEGWLSNAERSGSP-ARAIIAPHAGYSYCGACGAHA 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  ++P +I RVF+LGPSHH Y P CAL+T T Y+TP+ DL +D +V ++L ++  FE M
Sbjct: 60  YKQVNPANIERVFILGPSHHAYLPGCALTTCTKYQTPLYDLDVDQKVNQDLMSSKMFETM 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            + VDE EHS+EMHLPY+AKV E    K  IVP+LVGA++   E  +GRLF+ Y+ DP N
Sbjct: 120 SVQVDEDEHSIEMHLPYIAKVMESKRGKFTIVPVLVGALDINKEQNFGRLFSTYLADPRN 179

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F VSSDFCHWG RF + +YDKK+  I++SIEA+D+MGMD+IE  DP  F  YL +Y NT
Sbjct: 180 LFVVSSDFCHWGKRFRFTYYDKKYPKIYQSIEAVDRMGMDLIEQLDPQGFAAYLKKYSNT 239

Query: 239 ICGRHPISVFLHML--------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ICGRHPI+V L+ +               ++KFL+Y QS+QCK + DSSVSYA+AA
Sbjct: 240 ICGRHPIAVLLNAITIAISERQNGSGLHYEMKFLQYAQSNQCKLQSDSSVSYAAAA 295


>gi|195500657|ref|XP_002097466.1| GE24472 [Drosophila yakuba]
 gi|194183567|gb|EDW97178.1| GE24472 [Drosophila yakuba]
          Length = 295

 Score =  326 bits (835), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 156/288 (54%), Positives = 198/288 (68%), Gaps = 7/288 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G   A+A+  +
Sbjct: 4   RRATHAGSWYTDSGAELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  Y+TP+ DL +D ++  EL+ TG+F  MD+  
Sbjct: 63  SPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDTQINAELEKTGQFSWMDMKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E +     IVPILVG++N E EA YG L + Y+ DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSSYLMDPTNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF+Y +YD+  G IHKSIE LDK GMDIIET  P +F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGAIHKSIEKLDKQGMDIIETLSPHSFTEYLRKYNNTICGR 242

Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L  +         K+  KFL+Y+QSSQC+   DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLKYDQSSQCQDMEDSSVSYASGS 290


>gi|125774497|ref|XP_001358507.1| GA20778 [Drosophila pseudoobscura pseudoobscura]
 gi|54638244|gb|EAL27646.1| GA20778 [Drosophila pseudoobscura pseudoobscura]
          Length = 295

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 158/288 (54%), Positives = 196/288 (68%), Gaps = 7/288 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD    L+ +LD WL  A L   P  R +IAPHAGY+Y G  +A+A+  I
Sbjct: 4   RRATHAGSWYTDAGSELSRQLDRWLGAAELSHGP-ARAIIAPHAGYTYCGACSAFAYRQI 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  YKTP+ DL +D+++  EL+ TG+F  MD+  
Sbjct: 63  SPVVVKRIFILGPSHHVRLRGCALSVAKKYKTPLYDLKIDIQINAELEKTGQFSWMDMKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E +     IVPILVG++N E EA YG L + Y  DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSTYFMDPTNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF+Y +YD+  G IHKSIE LDK GMD+IET  P AF +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGQIHKSIEKLDKQGMDLIETLSPTAFTEYLRKYNNTICGR 242

Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L  +         K+  KFL+Y QSSQC    DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLQYAQSSQCLDMEDSSVSYASGS 290


>gi|21357419|ref|NP_650252.1| CG8031 [Drosophila melanogaster]
 gi|7299707|gb|AAF54889.1| CG8031 [Drosophila melanogaster]
 gi|16769746|gb|AAL29092.1| LP04475p [Drosophila melanogaster]
 gi|220944484|gb|ACL84785.1| CG8031-PA [synthetic construct]
 gi|220954280|gb|ACL89683.1| CG8031-PA [synthetic construct]
          Length = 295

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 157/288 (54%), Positives = 198/288 (68%), Gaps = 7/288 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 4   RRATHAGSWYTDSGAELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACAAFAYRQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  Y+TP+ DL +D ++  EL+ TGKF  MD+  
Sbjct: 63  SPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDAQINSELEKTGKFSWMDMKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E +     IVPILVG++N E EA YG L + Y+ DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSSYLMDPTNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF+Y +YD   G IHKSIE LDK GMDIIE+ +P +F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGHRFSYTYYDSSCGAIHKSIEKLDKQGMDIIESLNPHSFTEYLRKYNNTICGR 242

Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L  +         K+  KFL+Y QSSQC+   DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDIEDSSVSYASGS 290


>gi|194901576|ref|XP_001980328.1| GG17083 [Drosophila erecta]
 gi|190652031|gb|EDV49286.1| GG17083 [Drosophila erecta]
          Length = 295

 Score =  324 bits (830), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 156/288 (54%), Positives = 197/288 (68%), Gaps = 7/288 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G   A+A+  +
Sbjct: 4   RRATHAGSWYTDSGTELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  Y+TP+ DL +D ++  EL+ TG+F  MD+  
Sbjct: 63  SPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDSQINAELEKTGQFSWMDMKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E +     IVPILVG++N E EA YG L + Y+ DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSPYLMDPTNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF+Y +YD+  G IHKSIE LDK GMDIIET  P +F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGAIHKSIEKLDKQGMDIIETLSPHSFTEYLRKYNNTICGR 242

Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L  +         K+  KFL+Y QSSQC+   DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDMEDSSVSYASGS 290


>gi|259155194|ref|NP_001158838.1| protein MEMO1 [Salmo salar]
 gi|223647658|gb|ACN10587.1| MEMO1 [Salmo salar]
          Length = 297

 Score =  324 bits (830), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 160/290 (55%), Positives = 199/290 (68%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +A    SP  R +IAPHAGYSY G  AA+A+  I
Sbjct: 8   REASHAGSWYTASGSQLNTQLEGWLSQAQSTVSP-ARAIIAPHAGYSYCGACAAHAYKQI 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   RVF+LGPSHH    +CALS+A +YKTP+ DL +D +V  +L  TG FE M +  
Sbjct: 67  DPSVTRRVFILGPSHHVPLSRCALSSAEIYKTPLYDLRIDQKVYADLWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H   + IVP+LVGA++   E  YG+L ++Y+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDELSIVPVLVGALSESKEQDYGKLLSRYLADPSNLFII 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G IH+SIE LDKMGM IIET DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFCYTYYDEAQGEIHRSIEHLDKMGMGIIETLDPMSFTNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L    T++   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAVSELRKSGTEMNFSFLNYAQSSQCRNWQDSSVSYAAGALIV 296


>gi|195037481|ref|XP_001990189.1| GH19198 [Drosophila grimshawi]
 gi|193894385|gb|EDV93251.1| GH19198 [Drosophila grimshawi]
          Length = 295

 Score =  323 bits (829), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 159/292 (54%), Positives = 194/292 (66%), Gaps = 7/292 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G   A+A+  +
Sbjct: 4   RRATHAGSWYTDSGSDLSRQLDRWLGAADLSHGP-ARAIIAPHAGYAYCGACGAFAYRQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS    YKTP+ DL +D EV  EL+ TG+F  MDI  
Sbjct: 63  SPVVVKRIFILGPSHHVRLRGCALSVTKKYKTPLYDLKIDTEVNAELEKTGQFTWMDIKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E       IVPILVG++N E EA YG L A Y  DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDFKDQFTIVPILVGSLNPEQEAQYGSLLATYFMDPTNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF+Y +YD+  G IHK IE LDK GMDIIET  P AF +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSSGPIHKCIEQLDKQGMDIIETLSPTAFTEYLRKYNNTICGR 242

Query: 243 HPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           HPI V L  +     +    +  KFL+Y QSSQC    DSSVSYAS +  ++
Sbjct: 243 HPIGVMLSAIKALQDQGFKNMSFKFLKYAQSSQCLDMEDSSVSYASGSLIIE 294


>gi|194743160|ref|XP_001954068.1| GF18089 [Drosophila ananassae]
 gi|190627105|gb|EDV42629.1| GF18089 [Drosophila ananassae]
          Length = 295

 Score =  322 bits (824), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 155/288 (53%), Positives = 195/288 (67%), Gaps = 7/288 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G   A+A+  +
Sbjct: 4   RRATHAGSWYTDSGSELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  Y+TP+ DL +D ++  EL+ TG+F  MD+  
Sbjct: 63  SPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDTQINAELEKTGQFSWMDMKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E +     IVPILVG++N + EA YG L + Y  DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPDQEAQYGSLLSTYFMDPTNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF+Y +YD+  G IHKSIE LDK GMDIIE+  P AF +YL +Y NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGPIHKSIEKLDKQGMDIIESLSPQAFTEYLRKYSNTICGR 242

Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L  +         K+  KFL+Y QSSQC+   DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDMEDSSVSYASGS 290


>gi|340721535|ref|XP_003399175.1| PREDICTED: protein MEMO1-like [Bombus terrestris]
 gi|350407005|ref|XP_003487952.1| PREDICTED: protein MEMO1-like [Bombus impatiens]
          Length = 296

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 156/293 (53%), Positives = 201/293 (68%), Gaps = 8/293 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IRRA+HAGSWY  +   L ++L+GWL  A L   P  R +IAPHAGYSY G  A +A
Sbjct: 1   MALIRRATHAGSWYAGSGLDLNKQLEGWLGAADLSHGP-ARAIIAPHAGYSYCGACAGFA 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I P  + R+F+LGPSHH     CALS+A++Y+TP+ DL +D +V  EL+ T  FE M
Sbjct: 60  YRQISPVVVRRIFILGPSHHVRLAGCALSSASIYQTPLYDLHIDQQVYRELEETRHFEWM 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D+  DE EHS+EM LP++AKV EG      I+PILVG+++ E EA+YGRL A Y+ DP  
Sbjct: 120 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPEKEALYGRLLAPYMADPQT 179

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              +SSDFCHWG RF Y +YD+  G I++SI+ LDKMGMDIIET +P  F  YL +Y NT
Sbjct: 180 LLVISSDFCHWGQRFRYTYYDRSCGPIYRSIQNLDKMGMDIIETLNPTMFTDYLKKYGNT 239

Query: 239 ICGRHPISVFLHML----GNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ICGRHPISV L  +    GN +  ++ +KFL+Y QS+QC    DSSVSYASA+
Sbjct: 240 ICGRHPISVLLQAIHTLKGNTNGQRMNLKFLKYAQSNQCNNMNDSSVSYASAS 292


>gi|52345758|ref|NP_001004925.1| protein MEMO1 [Xenopus (Silurana) tropicalis]
 gi|82183493|sp|Q6DJ03.1|MEMO1_XENTR RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
           cell motility 1; Short=Memo-1
 gi|49522376|gb|AAH75382.1| MGC89105 protein [Xenopus (Silurana) tropicalis]
 gi|89269896|emb|CAJ83509.1| novel protein orthologous to C2orf4 [Xenopus (Silurana) tropicalis]
          Length = 297

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 155/290 (53%), Positives = 197/290 (67%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L+ +LDGWL +A   K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGSQLSAQLDGWLSQAQTSKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   RVF+LGPSHH    +CALST  +Y+TP+ DL +D +V  +L  TG FE M +  
Sbjct: 67  DPSVTRRVFILGPSHHVALSRCALSTVDIYRTPLYDLHIDQKVYGDLWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H   + IVP+LVGA++   E  +G+LF+KY+ DP+N F +
Sbjct: 127 DEDEHSIEMHLPYTAKTMESHKDDLTIVPVLVGALSESKEQEFGKLFSKYLADPTNLFVI 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP  F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYTYYDESQGEIYRSIENLDKMGMGIIEQLDPVQFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAATELQKNGVNMSFSFLNYAQSSQCRSWQDSSVSYAAGALVV 296


>gi|242019097|ref|XP_002430002.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515060|gb|EEB17264.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 296

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 161/295 (54%), Positives = 197/295 (66%), Gaps = 11/295 (3%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HA SWY+D++K L  +L  WL +A L   P  R +IAPHAGY Y G  AA+A+  I
Sbjct: 4   RRATHANSWYSDSAKELERQLQTWLGQAELTHGP-ARAIIAPHAGYQYCGSCAAFAYRQI 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     C LS A   +TP  DL +D +V +EL ATG+F +M I +
Sbjct: 63  SPAVVKRIFILGPSHHVRFSGCYLSPALKCETPFYDLTVDQQVHKELMATGEFNVMGISI 122

Query: 125 DEAEHSMEMHLPYLAKV---FEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           DEAEHS+EMHLPY+AKV   F+ +   IVPILVG+ NA+ EA+YG + A Y+ DP N F 
Sbjct: 123 DEAEHSIEMHLPYIAKVMAEFKDYF-TIVPILVGSTNAKEEALYGSILAPYLADPQNLFI 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           VSSDFCHWG RF Y  YDK  G IH+SI+ LDKMGMDIIE  +P  F  YL +Y NTICG
Sbjct: 182 VSSDFCHWGQRFRYTFYDKSWGEIHQSIQTLDKMGMDIIEKLNPTEFTDYLKKYGNTICG 241

Query: 242 RHPISVFLHMLG------NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           RHPI V L  +       N   KI +KFL+Y QSSQC    DSSVSYASAA   +
Sbjct: 242 RHPIGVLLQAIAYLKNSSNFGNKINLKFLKYVQSSQCCRMSDSSVSYASAAVTFE 296


>gi|156551830|ref|XP_001604210.1| PREDICTED: protein MEMO1-like [Nasonia vitripennis]
          Length = 296

 Score =  321 bits (823), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 155/297 (52%), Positives = 199/297 (67%), Gaps = 8/297 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   RRASH+GSWY++N   L ++L  WL  A L   P  R +I+PHAGYSY G  AA+A
Sbjct: 1   MALARRASHSGSWYSENGFELNKQLGDWLNAADLSHGP-ARAIISPHAGYSYCGACAAFA 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I P  + ++F+LGPSHH   P CAL+ A VYKTP+ DL +D  + +EL  TG FE M
Sbjct: 60  YRQISPVVVRKIFILGPSHHVRLPGCALTQAQVYKTPLYDLHVDQRINKELADTGHFEWM 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D+  DE EHS+EM LPY+AKV E       ++PILVG+++ E EAMYGRL A Y+ DP  
Sbjct: 120 DMDTDEDEHSIEMQLPYIAKVMEDFKDSFTVIPILVGSLSPEREAMYGRLLAPYMADPQT 179

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDFCHWG RF Y +YD+  G IHKSI+ LD+MGM+IIE   P +F +YL +Y NT
Sbjct: 180 LFIISSDFCHWGQRFRYTYYDRSCGAIHKSIKNLDQMGMNIIENMTPSSFTEYLRKYGNT 239

Query: 239 ICGRHPISVFLHMLGNC-----STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           ICGRHPI V L  + +        ++ +KFL+Y QSSQC    DSSVSYASA+  V+
Sbjct: 240 ICGRHPIGVLLQAIQSLKETTNGQRMTLKFLKYAQSSQCNNMNDSSVSYASASMVVE 296


>gi|148223639|ref|NP_001085839.1| protein MEMO1 [Xenopus laevis]
 gi|82184294|sp|Q6GNT9.1|MEMO1_XENLA RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
           cell motility 1; Short=Memo-1
 gi|49116700|gb|AAH73413.1| MGC80879 protein [Xenopus laevis]
          Length = 297

 Score =  321 bits (822), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 154/290 (53%), Positives = 197/290 (67%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWY+ +  +L+ +LDGWL +A   K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYSASGSQLSAQLDGWLSQAQTSKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   RVF+LGPSHH    +CALST  +Y+TP+ DL +D +V  EL  TG FE M +  
Sbjct: 67  DPSVTRRVFILGPSHHVALSRCALSTVDIYRTPLYDLHVDQKVYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H   + IVP+LVGA++   E  +G++F+KY+ DP+N F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDDLTIVPVLVGALSESKEQEFGKVFSKYLADPTNLFVI 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP  F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYTYYDESQGEIYRSIENLDKMGMSIIEQLDPVQFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAATELQKNGVNMSFSFLNYAQSSQCRSWQDSSVSYAAGALVV 296


>gi|195571271|ref|XP_002103627.1| GD20526 [Drosophila simulans]
 gi|194199554|gb|EDX13130.1| GD20526 [Drosophila simulans]
          Length = 295

 Score =  320 bits (820), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 157/289 (54%), Positives = 197/289 (68%), Gaps = 7/289 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G  AA+A+  
Sbjct: 3   VRRATHAGSWYTDSGAELSGQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACAAFAYRQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P  + R+F+LGPSHH     CALS A  YKTP+ DL +D ++  EL+ TG+F  M + 
Sbjct: 62  VSPVVVKRIFILGPSHHVRLRVCALSAAKKYKTPLYDLKIDSQINAELEKTGEFTWMSMK 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
            DE EHS+EMHLPY+AKV E +     IVPILVG++N E EA YG L A Y+ DP+N F 
Sbjct: 122 TDEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLAPYLMDPTNLFV 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF+Y +YD   G IHKSIE LDK GMDIIE+  P +F +YL +Y+NTICG
Sbjct: 182 ISSDFCHWGHRFSYTYYDSTCGAIHKSIEKLDKQGMDIIESLSPHSFTEYLRKYNNTICG 241

Query: 242 RHPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           RHPI V L  +         K+  KFL+Y QSSQC+   DSSVSYAS +
Sbjct: 242 RHPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDIDDSSVSYASGS 290


>gi|195329376|ref|XP_002031387.1| GM25966 [Drosophila sechellia]
 gi|194120330|gb|EDW42373.1| GM25966 [Drosophila sechellia]
          Length = 295

 Score =  319 bits (818), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 157/289 (54%), Positives = 197/289 (68%), Gaps = 7/289 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G  AA+A+  
Sbjct: 3   VRRATHAGSWYTDSGAELSGQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACAAFAYRQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P  + R+F+LGPSHH     CALS A  YKTP+ DL +D ++  EL+ TG+F  M + 
Sbjct: 62  VSPVVVKRIFILGPSHHVRLRGCALSGAKKYKTPLYDLKIDSQINAELEKTGEFTWMSLK 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
            DE EHS+EMHLPY+AKV E +     IVPILVG++N E EA YG L A Y+ DP+N F 
Sbjct: 122 TDEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLAPYLMDPTNLFV 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF+Y +YD   G IHKSIE LDK GMDIIE+  P +F +YL +Y+NTICG
Sbjct: 182 ISSDFCHWGHRFSYTYYDSTCGAIHKSIEKLDKQGMDIIESLSPHSFTEYLRKYNNTICG 241

Query: 242 RHPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           RHPI V L  +         K+  KFL+Y QSSQC+   DSSVSYAS +
Sbjct: 242 RHPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDIDDSSVSYASGS 290


>gi|443695939|gb|ELT96731.1| hypothetical protein CAPTEDRAFT_154277 [Capitella teleta]
          Length = 297

 Score =  319 bits (818), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 158/300 (52%), Positives = 208/300 (69%), Gaps = 15/300 (5%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAY 59
           M ++R+ASHAGSWYT + + L +EL  WL +A  P   E  R +IAPHAGYSY G  +A+
Sbjct: 1   MSRVRKASHAGSWYTGDGEGLNKELGKWLSKA--PAGGEHARAIIAPHAGYSYCGACSAH 58

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+  IDP+ + RVF+LGPSHH     CA++ A  Y+TP+ DL +D ++ EEL +T  FE 
Sbjct: 59  AYKQIDPSRVHRVFILGPSHHVRLNGCAITAAEKYQTPLYDLTIDAQISEELYSTSMFEK 118

Query: 120 MDICVDEAEHSMEMHLPYLAKVFE---GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
           M    DE EHS+EMHLPY+AKV +   GH   ++P+LVG++N + EA YGR+F++Y+ DP
Sbjct: 119 MSNQTDEDEHSIEMHLPYVAKVMQSRQGHFT-VIPVLVGSLNPDKEAAYGRIFSRYLADP 177

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
           +N F +SSDFCHWG RF Y +YDK  G I+KSIE LD+MGM++IE  DP AF  YL +Y 
Sbjct: 178 NNLFVISSDFCHWGQRFRYTYYDKSKGDIYKSIEHLDRMGMNLIEEMDPPAFTAYLEQYG 237

Query: 237 NTICGRHPISVFLHML-------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           NTICGRHPI V L+ +       GN   +  + FL+Y QSSQC T +DSSVSYASAA  +
Sbjct: 238 NTICGRHPIGVLLNSIKCLRESRGN-GHRYSLHFLKYAQSSQCLTNKDSSVSYASAALTI 296


>gi|281204139|gb|EFA78335.1| hypothetical protein PPL_08986 [Polysphondylium pallidum PN500]
          Length = 289

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 153/290 (52%), Positives = 192/290 (66%), Gaps = 1/290 (0%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IR  +HAGSWY+ N+  L+++LD WL E         R +I PHAGYSYSGRAA+  
Sbjct: 1   MATIRSCTHAGSWYSRNASELSKQLDNWLGETK-SIGENTRAIIVPHAGYSYSGRAASRG 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           + N+D +   R+F+LGPSHH Y   C L+  T ++TPIG+L +D EVI++L  T  F   
Sbjct: 60  YINLDASKYKRIFILGPSHHVYMKTCGLTKLTHFETPIGNLEVDTEVIKKLDDTSIFTWT 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
              VDE EHS+E+HLPY+AK+ +   IKIVPI+VG+++ + E  YG++ A Y DDP NFF
Sbjct: 120 TKDVDEDEHSIELHLPYVAKMIQDKKIKIVPIMVGSLSKDTEEQYGKVLAPYFDDPENFF 179

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSDFCHWG RF Y HY++  G IHKSIE LD+ GM +IETGDP  F  YL E  NTIC
Sbjct: 180 VISSDFCHWGKRFGYTHYNQSAGPIHKSIEELDRTGMSMIETGDPLQFSTYLKETKNTIC 239

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           GR+PI+V L    N   K  IK L YEQSSQC    DSSVSY   A K D
Sbjct: 240 GRNPIAVMLWTAKNSRNKHSIKSLHYEQSSQCMNMNDSSVSYGVLAIKQD 289


>gi|91094281|ref|XP_970827.1| PREDICTED: similar to CG8031 CG8031-PA [Tribolium castaneum]
 gi|270014397|gb|EFA10845.1| hypothetical protein TcasGA2_TC001622 [Tribolium castaneum]
          Length = 295

 Score =  318 bits (816), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 154/290 (53%), Positives = 200/290 (68%), Gaps = 8/290 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+A HAGSWYTD+   L+ +L+ WL +A L   P  R +IAPHAGY Y G    +A+  
Sbjct: 3   VRKAIHAGSWYTDSGPELSRQLEYWLHQADLIHGP-ARAIIAPHAGYQYCGACGGHAYRQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P  + R+F+LGPSHH     CALSTA  YKTP+ DL +D +V  EL+ TG+FE MD+ 
Sbjct: 62  VSPVVVRRIFILGPSHHVRLSGCALSTARKYKTPLYDLHVDSQVNGELEMTGQFEWMDLD 121

Query: 124 VDEAEHSMEMHLPYLAKVFEG--HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
            DE EHS+EMHLPY+AKV E   H   IVPILVG+++ E EA YGR+ + Y+ DP N F 
Sbjct: 122 TDENEHSIEMHLPYIAKVMENYKHQFTIVPILVGSLSPEREACYGRILSSYLGDPQNLFI 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF Y +YD+ +G I++SIE LD++GM+IIE  +P  F  YL ++ NTICG
Sbjct: 182 ISSDFCHWGHRFRYTYYDRSYGSIYQSIEELDRLGMNIIENLNPTDFTNYLKKFGNTICG 241

Query: 242 RHPISVFLH----MLGNCSTK-IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           RHPI + L     +L N S +   +KFL+Y QSSQC+   DSSVSYASAA
Sbjct: 242 RHPIGILLQAVQELLRNDSRRNANLKFLKYAQSSQCRNMNDSSVSYASAA 291


>gi|320167720|gb|EFW44619.1| cell motility mediator [Capsaspora owczarzaki ATCC 30864]
          Length = 300

 Score =  317 bits (813), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 159/296 (53%), Positives = 192/296 (64%), Gaps = 9/296 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE---VRGVIAPHAGYSYSGRAAAYA 60
           +R ASHAGSWY +   RL  +LDGWL  A           R +IAPHAGYSYSG  AA+A
Sbjct: 5   VREASHAGSWYDEGGARLESQLDGWLAAANSDADSNHTPARAIIAPHAGYSYSGPTAAFA 64

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I P ++ RVF+LGPSHH Y   CALS  T Y TP+G+L LD  +I EL AT  FE M
Sbjct: 65  YKQIIPDNVQRVFILGPSHHVYLKGCALSRCTEYATPLGNLKLDAAIIAELNATQSFEFM 124

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
              VDE EHS+EMHLPY+ KVF  +    IVPILVGA+  E EA+YG++ A Y+ +P+N 
Sbjct: 125 TKSVDEEEHSIEMHLPYIRKVFNKNANATIVPILVGALATEKEAVYGKILAPYLANPANI 184

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
           F VSSDFCHWGSRFNY +       IH+SI  LD  GM +IE  D D F  YL    NTI
Sbjct: 185 FVVSSDFCHWGSRFNYTYTTSSQVPIHESINILDHQGMALIERLDCDGFAAYLKSTKNTI 244

Query: 240 CGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           CGRHPI V L  +    T     +  +KF++Y QSS CKT+RDSSVSYASA+ +VD
Sbjct: 245 CGRHPIGVLLQAIRAVRTTDTAFQPNLKFVKYAQSSACKTQRDSSVSYASASLRVD 300


>gi|198433196|ref|XP_002131141.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 296

 Score =  316 bits (810), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 148/288 (51%), Positives = 197/288 (68%), Gaps = 6/288 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRASHAGSWY    + L  +L  WL +A        R +I+PHAGYSY G  AA+AF  I
Sbjct: 4   RRASHAGSWYEGKGEDLDMQLSKWLSDASGSNHRPARAIISPHAGYSYCGSCAAHAFSQI 63

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP ++ +VF+LGPSHH+Y  +CALS  + Y TP+ +L +D EV +EL  TGKF  M   V
Sbjct: 64  DPNTVEKVFILGPSHHFYLKECALSPTSTYITPLYNLKIDQEVYQELYGTGKFRTMSQSV 123

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EM LPY+AKV E       I+PILVG+++  +E +YG + AKY+++P+N F +
Sbjct: 124 DEDEHSIEMQLPYIAKVMESKKGQFTIIPILVGSLDQASEKLYGEILAKYMNNPNNVFVI 183

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y  YD++HG I+KSIEALD+ GM +IE  DP+ F +YL  Y NTICGR
Sbjct: 184 SSDFCHWGRRFQYTRYDEQHGEIYKSIEALDRKGMQLIENLDPEGFYEYLAAYKNTICGR 243

Query: 243 HPISVFLHMLGNCST----KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI+V L+ +         +++ KFL Y+QSS+C+   DSSVSYA+ +
Sbjct: 244 HPIAVMLNAVKTAQNGNGLRLEFKFLDYKQSSKCRVHSDSSVSYAAGS 291


>gi|71361623|ref|NP_001025088.1| protein MEMO1 [Rattus norvegicus]
 gi|81918134|sp|Q4QQR9.1|MEMO1_RAT RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
           cell motility 1; Short=Memo-1
 gi|67678458|gb|AAH98061.1| Mediator of cell motility 1 [Rattus norvegicus]
          Length = 297

 Score =  316 bits (809), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 152/290 (52%), Positives = 195/290 (67%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKSGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 296


>gi|19526994|ref|NP_598532.1| protein MEMO1 [Mus musculus]
 gi|62899882|sp|Q91VH6.1|MEMO1_MOUSE RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
           cell motility 1; Short=Memo-1
 gi|17046305|gb|AAL34463.1|AF363447_1 C21orf19-like protein [Mus musculus]
 gi|15489467|gb|AAH13819.1| Memo1 protein [Mus musculus]
          Length = 297

 Score =  315 bits (808), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 152/290 (52%), Positives = 195/290 (67%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSVTRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 296


>gi|344288747|ref|XP_003416108.1| PREDICTED: protein MEMO1-like isoform 1 [Loxodonta africana]
          Length = 297

 Score =  315 bits (807), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 152/290 (52%), Positives = 195/290 (67%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 296


>gi|327262347|ref|XP_003215986.1| PREDICTED: protein MEMO1-like isoform 1 [Anolis carolinensis]
          Length = 297

 Score =  315 bits (806), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 153/290 (52%), Positives = 194/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L+ +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLSAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D +V  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKVYAELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     IVP+LVGA++   E  +G+LF+KY+ DP N F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIVPVLVGALSETKEQEFGKLFSKYLADPCNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 296


>gi|193697516|ref|XP_001942616.1| PREDICTED: protein MEMO1-like [Acyrthosiphon pisum]
          Length = 291

 Score =  315 bits (806), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 152/290 (52%), Positives = 200/290 (68%), Gaps = 4/290 (1%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRASH+G+WY +++K L+ +L+ WL  A L   P  R +IAPHAGY Y G  AA+A+  
Sbjct: 3   VRRASHSGTWYNESAKELSTQLENWLGAAELTHGP-ARAIIAPHAGYQYCGACAAFAYRQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I P  + R+F+LGPSH+     CALS+A  Y TP  DL +D EV EEL++TG FE M + 
Sbjct: 62  ISPFVVQRIFILGPSHNVRLSGCALSSAVKYSTPFYDLQIDNEVYEELESTGHFERMSMK 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           VDE EHS+EM LPY+AKV + +  K  IVP++VG+++ E E++YG +FA+Y+ DP N F 
Sbjct: 122 VDEDEHSIEMQLPYIAKVMQDYKDKFTIVPVMVGSLSTEKESLYGHIFAQYLADPKNLFI 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF Y  Y+K  G IH+SI  LD  GMD IET  P AF +YL  + NTICG
Sbjct: 182 ISSDFCHWGQRFRYTFYEKSWGEIHQSITTLDHKGMDTIETLKPAAFVEYLKTFSNTICG 241

Query: 242 RHPISVFLHMLGNCST-KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           RHPISV L    +  T K+ +KFL+Y QSS+C    DSSVSYASA+  ++
Sbjct: 242 RHPISVLLQAANHLRTQKLSLKFLKYAQSSKCYNLGDSSVSYASASLIIE 291


>gi|7705720|ref|NP_057039.1| protein MEMO1 isoform 1 [Homo sapiens]
 gi|349585097|ref|NP_001231806.1| mediator of cell motility 1 [Sus scrofa]
 gi|114576869|ref|XP_001164110.1| PREDICTED: protein MEMO1 isoform 3 [Pan troglodytes]
 gi|126303134|ref|XP_001371460.1| PREDICTED: protein MEMO1-like isoform 1 [Monodelphis domestica]
 gi|402890505|ref|XP_003908527.1| PREDICTED: protein MEMO1 isoform 1 [Papio anubis]
 gi|402890507|ref|XP_003908528.1| PREDICTED: protein MEMO1 isoform 2 [Papio anubis]
 gi|7388490|sp|Q9Y316.1|MEMO1_HUMAN RecName: Full=Protein MEMO1; AltName: Full=C21orf19-like protein;
           AltName: Full=Hepatitis C virus NS5A-transactivated
           protein 7; Short=HCV NS5A-transactivated protein 7;
           AltName: Full=Mediator of ErbB2-driven cell motility 1;
           Short=Mediator of cell motility 1; Short=Memo-1
 gi|75076342|sp|Q4R6D9.1|MEMO1_MACFA RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
           cell motility 1; Short=Memo-1
 gi|4680693|gb|AAD27736.1|AF132961_1 CGI-27 protein [Homo sapiens]
 gi|17046303|gb|AAL34462.1|AF363446_1 C21orf19-like protein [Homo sapiens]
 gi|17511762|gb|AAH18733.1| Mediator of cell motility 1 [Homo sapiens]
 gi|33328304|gb|AAQ09602.1| NS5ATP7 [Homo sapiens]
 gi|56377669|dbj|BAD74066.1| C2orf4 [Homo sapiens]
 gi|66267174|gb|AAH94681.1| Mediator of cell motility 1 [Homo sapiens]
 gi|67969978|dbj|BAE01336.1| unnamed protein product [Macaca fascicularis]
 gi|119620872|gb|EAX00467.1| chromosome 2 open reading frame 4, isoform CRA_a [Homo sapiens]
 gi|119620874|gb|EAX00469.1| chromosome 2 open reading frame 4, isoform CRA_a [Homo sapiens]
 gi|119620875|gb|EAX00470.1| chromosome 2 open reading frame 4, isoform CRA_a [Homo sapiens]
 gi|119620877|gb|EAX00472.1| chromosome 2 open reading frame 4, isoform CRA_a [Homo sapiens]
 gi|123982438|gb|ABM82960.1| chromosome 2 open reading frame 4 [synthetic construct]
 gi|123997099|gb|ABM86151.1| chromosome 2 open reading frame 4 [synthetic construct]
 gi|380785185|gb|AFE64468.1| protein MEMO1 isoform 1 [Macaca mulatta]
 gi|383411277|gb|AFH28852.1| protein MEMO1 isoform 1 [Macaca mulatta]
 gi|410209454|gb|JAA01946.1| mediator of cell motility 1 [Pan troglodytes]
 gi|410249514|gb|JAA12724.1| mediator of cell motility 1 [Pan troglodytes]
 gi|410298038|gb|JAA27619.1| mediator of cell motility 1 [Pan troglodytes]
 gi|410352083|gb|JAA42645.1| mediator of cell motility 1 [Pan troglodytes]
 gi|417398534|gb|JAA46300.1| Putative dioxygenase [Desmodus rotundus]
          Length = 297

 Score =  314 bits (805), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 152/290 (52%), Positives = 194/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296


>gi|195395150|ref|XP_002056199.1| GJ10804 [Drosophila virilis]
 gi|194142908|gb|EDW59311.1| GJ10804 [Drosophila virilis]
          Length = 295

 Score =  314 bits (805), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 154/289 (53%), Positives = 192/289 (66%), Gaps = 7/289 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G   A+A+  
Sbjct: 3   VRRATHAGSWYTDSGSDLSRQLDRWLGAADLSHGP-ARAIIAPHAGYAYCGACGAFAYRQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P  + R+F+LGPSHH     CALS A   +TP+ DL +D +V  EL+ TG+F  MD+ 
Sbjct: 62  VSPVVVKRIFILGPSHHVRLRGCALSVAKKCRTPLYDLKIDTQVNAELEKTGQFSWMDMK 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
            DE EHS+EMHLPY+AKV E       IVPI+VG++N E EA YG L A Y  DP+N F 
Sbjct: 122 SDEDEHSIEMHLPYIAKVMEDFKDQFTIVPIIVGSLNPEQEAQYGSLLANYFMDPTNLFV 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF+Y +YD+  G IHK IE LDK GMDIIE   P AF +YL +Y+NTICG
Sbjct: 182 ISSDFCHWGQRFSYTYYDRSCGPIHKCIEQLDKQGMDIIEKLSPTAFTEYLRKYNNTICG 241

Query: 242 RHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           RHPI V L  +     +    +  KFL+Y QSSQC    DSSVSYAS +
Sbjct: 242 RHPIGVMLGAIKALQDQGFKNMSFKFLKYAQSSQCLDMEDSSVSYASGS 290


>gi|387016908|gb|AFJ50572.1| Protein MEMO1-like [Crotalus adamanteus]
          Length = 297

 Score =  314 bits (805), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 152/290 (52%), Positives = 195/290 (67%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L+ +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLSAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D +V  +L  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLQIDQKVYTDLWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     IVP+LVGA++   E  +G+LF+KY+ DP N F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIVPVLVGALSETKEQEFGKLFSKYLADPCNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM+IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDETQGEIYRSIEHLDKMGMNIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 296


>gi|161172368|pdb|3BD0|A Chain A, Crystal Structure Of Memo, Form Ii
 gi|161172369|pdb|3BD0|B Chain B, Crystal Structure Of Memo, Form Ii
 gi|161172370|pdb|3BD0|C Chain C, Crystal Structure Of Memo, Form Ii
 gi|161172371|pdb|3BD0|D Chain D, Crystal Structure Of Memo, Form Ii
          Length = 293

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 152/290 (52%), Positives = 194/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 4   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 63  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 123 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 183 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 242

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 243 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 292


>gi|47123293|gb|AAH70046.1| MEMO1 protein [Homo sapiens]
          Length = 297

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 152/290 (52%), Positives = 193/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +   L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPHLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296


>gi|306482629|ref|NP_001182352.1| protein MEMO1 [Gallus gallus]
          Length = 297

 Score =  313 bits (801), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 150/290 (51%), Positives = 194/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP    ++F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPNITRKIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSEAKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM+IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMNIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHMLGNCS---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+ +         +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAINELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 296


>gi|428176181|gb|EKX45067.1| hypothetical protein GUITHDRAFT_71667 [Guillardia theta CCMP2712]
          Length = 295

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 153/293 (52%), Positives = 209/293 (71%), Gaps = 9/293 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR---GVIAPHAGYSYSGRAAAYA 60
           +R ASHAGSWY      L ++LD WL++  + KS E++   G+IAPHAGYSYSG  AAYA
Sbjct: 3   VRVASHAGSWYAGRGDELKKQLDRWLQD--VKKSDELQPSCGLIAPHAGYSYSGPTAAYA 60

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  +DP+ I R+F+LGPSHHYY   CALS  + Y TP+G+L +D +V++EL ++G FE M
Sbjct: 61  YRYMDPSRIKRIFVLGPSHHYYLTGCALSKHSAYATPLGNLEIDTDVVKELHSSGLFEYM 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
              VDE EHS+EM +PY+ KV E      KIVP+LVG ++ + E +YG LF++Y++DPS 
Sbjct: 121 SEEVDEDEHSLEMQMPYIYKVMEQASKSYKIVPVLVGNLSPKKEDVYGSLFSRYLNDPST 180

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
           FF +SSDFCHWGSRF Y +YD   G I++SIEALD+ GM  IE+  P+ F+ Y  +++NT
Sbjct: 181 FFVISSDFCHWGSRFRYTYYDPSFGEIYQSIEALDRQGMQEIESLSPENFQNYQAKFNNT 240

Query: 239 ICGRHPISVFLHML--GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           ICGRHPI+V L+ L   + +++  I+F+RY QSS C+ + DSSVSYASA  K 
Sbjct: 241 ICGRHPIAVLLNALKASSAASQASIQFVRYAQSSPCRCQADSSVSYASAVVKT 293


>gi|158254942|dbj|BAF83442.1| unnamed protein product [Homo sapiens]
          Length = 297

 Score =  312 bits (799), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 151/290 (52%), Positives = 193/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSH     +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHRVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296


>gi|328774127|gb|EGF84164.1| hypothetical protein BATDEDRAFT_18471 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 299

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 154/286 (53%), Positives = 192/286 (67%), Gaps = 5/286 (1%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+ASHAGSWY+ + ++L+ +LD W+  A +  +P VR +IAPHAGYSYSG AAA+A+G I
Sbjct: 9   RKASHAGSWYSASQEQLSSQLDSWISTADITPNPNVRAIIAPHAGYSYSGPAAAWAYGAI 68

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           D   I RVF+LGPSHH Y   CALS  T+YKTP+GDL +D +V+  LKATG+FE + I  
Sbjct: 69  DSHQIKRVFILGPSHHVYIGGCALSPCTIYKTPLGDLTIDTDVVASLKATGQFETLSIQA 128

Query: 125 DEAEHSMEMHLPYLAKVFEGHLI--KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPYL K+         IVPILVGA  + ++  Y  +FA Y+ D +N F V
Sbjct: 129 DEDEHSVEMHLPYLYKIMSKKTTPYTIVPILVGASRSSDQHNYATIFAPYLADSTNLFIV 188

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG  F Y +       IH SIE LD+  M +IETG+   F+ YL    NTICGR
Sbjct: 189 SSDFCHWGPNFGYTYQFDPSLQIHASIEKLDREAMRVIETGNSKDFEAYLKRTKNTICGR 248

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           +PI V LH    L N   K+ IKF  Y QSS+  TKRD+SVSYASA
Sbjct: 249 NPIGVLLHALECLENMGKKVDIKFTHYAQSSRALTKRDNSVSYASA 294


>gi|160774203|gb|AAI55285.1| Mediator of cell motility 1 [Danio rerio]
          Length = 297

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 154/290 (53%), Positives = 192/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +A     P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGSQLNAQLEGWLSQAQSIAGP-ARAIIAPHAGYTYCGACAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   RVF+LGPSHH    +CALS A VY+TP+ DL +D +V  +L  TG FE M +  
Sbjct: 67  DPSITRRVFILGPSHHVPLSRCALSPAEVYRTPLYDLRIDQKVYADLWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     IVP+LVGA++   E  YG+L +KY+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMENHKDEFSIVPVLVGALSESKEQEYGKLLSKYLADPSNLFII 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPISFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+ +         +   FL Y QSSQC+   DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAVAELKKNGIDMNFSFLNYAQSSQCRNWSDSSVSYAAGALIV 296


>gi|157167680|ref|XP_001661638.1| hypothetical protein AaeL_AAEL011394 [Aedes aegypti]
 gi|108872303|gb|EAT36528.1| AAEL011394-PA [Aedes aegypti]
          Length = 293

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 153/287 (53%), Positives = 195/287 (67%), Gaps = 6/287 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYTD+   L  +L  WL +A L   P  R +IAPHAGY Y G   A+A+  I
Sbjct: 4   REASHAGSWYTDSGSDLNRQLSKWLDDADLTFGP-ARAIIAPHAGYRYCGACGAWAYRQI 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + RVF+LGPSHH    +CALS A   +TP+ DL +D EV  EL+ATG F+ MD   
Sbjct: 63  SPAVVKRVFILGPSHHVRLSRCALSAAQYCRTPLYDLKVDQEVNAELEATGHFKWMDQKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E       IVPI+VG+++ + E  YG++ A Y+ DP N F +
Sbjct: 123 DEDEHSIEMHLPYVAKVMEDFRDQFTIVPIMVGSISNDWEETYGKILAPYLADPQNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG+RF Y +Y+  HG I+K IE LDKMGMD+IET  P++F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGARFRYTYYEDSHGPIYKWIEVLDKMGMDLIETLKPESFGEYLRKYNNTICGR 242

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L     L     ++  KFL+Y+QS+QC  K+DSSVSYAS +
Sbjct: 243 HPIGVLLQSVEELKRRGYRMSFKFLKYDQSNQCCDKKDSSVSYASGS 289


>gi|159483599|ref|XP_001699848.1| hypothetical protein CHLREDRAFT_182241 [Chlamydomonas reinhardtii]
 gi|158281790|gb|EDP07544.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 285

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 150/286 (52%), Positives = 193/286 (67%), Gaps = 2/286 (0%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAY 59
           M KIRR SHA SWYT     L  E++ W      LP  P  R +I PHAGYSY G   A+
Sbjct: 1   MPKIRRPSHAHSWYTGEGDALDAEINKWKAAVTPLPGLP-ARAIIGPHAGYSYCGHVMAH 59

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+ +I+P +  RVFLLGPSHH ++ KC LS+ T Y+TP+G + +D EV  +L+ +GKF+ 
Sbjct: 60  AYAHINPANFKRVFLLGPSHHVFSRKCLLSSQTHYETPLGSMEIDQEVYAQLRDSGKFDS 119

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           M   VDE EHS+E+H PY+     G   K+VPI+VGA+  + EA YG L   Y+DDP+N 
Sbjct: 120 MSQDVDEDEHSLELHTPYIVHTMRGRDFKLVPIMVGALTTDAEAAYGSLLGPYLDDPANL 179

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
           F VSSDFCHWGSRF+Y  YDK  G I +SI+ LD+ GM IIE+ +P  F  YL +Y NTI
Sbjct: 180 FVVSSDFCHWGSRFSYTFYDKSQGQIWESIKWLDETGMKIIESAEPSKFASYLEQYKNTI 239

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           CGRHPI VFL+ML + S + +I F +Y+QSS+C T   SSVSYA+A
Sbjct: 240 CGRHPIGVFLNMLQHSSLRHRISFTKYDQSSKCTTPSHSSVSYAAA 285


>gi|170037682|ref|XP_001846685.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167880969|gb|EDS44352.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 293

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 151/287 (52%), Positives = 196/287 (68%), Gaps = 6/287 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYTD+   L  +L  WL ++ L   P  R +IAPHAGY Y G   A+A+  I
Sbjct: 4   REASHAGSWYTDSGSELNRQLSKWLDDSDLTFGP-ARAIIAPHAGYRYCGACGAWAYRQI 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + RVF+LGPSHH    +CALS A   +TP+ DL +D +V  EL+ATG F+ MD   
Sbjct: 63  SPAVVKRVFILGPSHHVRLSRCALSAAQFCRTPLYDLKVDQQVNAELEATGHFKWMDQKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E       IVPI+VG+++ + E  YG++ A Y+ DP N F +
Sbjct: 123 DEDEHSIEMHLPYVAKVMEDFRDQFTIVPIMVGSISNDWEETYGKILAPYLADPQNLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG+RF Y +Y++ HG I+K IE LDKMGMD+IET  P++F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGARFRYTYYEESHGPIYKWIEVLDKMGMDLIETLKPESFAEYLRKYNNTICGR 242

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V L     L     ++  KFL+Y+QS+QC  K+DSSVSYAS +
Sbjct: 243 HPIGVLLQSVEELKRRGYRMSFKFLKYDQSNQCCDKKDSSVSYASGS 289


>gi|348507064|ref|XP_003441077.1| PREDICTED: protein MEMO1-like isoform 1 [Oreochromis niloticus]
          Length = 297

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 153/290 (52%), Positives = 194/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWY+ +  +L  +L+GWL +A     P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYSASGSQLNAQLEGWLSQAQSTIRP-ARAIIAPHAGYTYCGACAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   RVF+LGPSHH    +CALS A +Y+TP+ DL +D +V  +L  TG FE M +  
Sbjct: 67  DPSITRRVFILGPSHHVPLSRCALSPADIYRTPLYDLRIDQKVYADLWKTGLFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     IVP+LVGA++   E  YG+L +KY+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFSIVPVLVGALSESKEQEYGKLLSKYLADPSNLFII 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPMSFTNYLKKYRNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L     ++   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAVAELRKSGLEMNFTFLNYAQSSQCRNWQDSSVSYAAGALIV 296


>gi|343962447|dbj|BAK62811.1| protein C2orf4 [Pan troglodytes]
          Length = 297

 Score =  310 bits (794), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 151/290 (52%), Positives = 193/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQ +  +DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQRRNWQDSSVSYAAGALTV 296


>gi|41054231|ref|NP_956088.1| protein MEMO1 [Danio rerio]
 gi|82177098|sp|Q803S3.1|MEMO1_DANRE RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
           cell motility 1; Short=Memo-1
 gi|27882485|gb|AAH44360.1| Mediator of cell motility 1 [Danio rerio]
          Length = 297

 Score =  310 bits (793), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 153/290 (52%), Positives = 191/290 (65%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +A     P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGSQLNAQLEGWLSQAQSIAGP-ARAIIAPHAGYTYCGACAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   RVF+LGPSHH    +CALS A VY+TP+ DL +D +V  +L  TG FE M +  
Sbjct: 67  DPSITRRVFILGPSHHVPLSRCALSPAEVYRTPLYDLRIDQKVYADLWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     IVP+LVGA++   E  YG+L +KY+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMENHKDEFSIVPVLVGALSGSKEQEYGKLLSKYLADPSNLFII 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           S DFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SPDFCHWGQRFRYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPISFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+ +         +   FL Y QSSQC+   DSSVSYA+ A  V
Sbjct: 247 HPIGVLLNAVAELKKNGIDMNFSFLNYAQSSQCRNWSDSSVSYAAGALIV 296


>gi|74198004|dbj|BAE35184.1| unnamed protein product [Mus musculus]
          Length = 302

 Score =  309 bits (792), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 152/295 (51%), Positives = 195/295 (66%), Gaps = 11/295 (3%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSVTRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHW-----GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
           SSDFCHW     G RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y N
Sbjct: 187 SSDFCHWGYLFQGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHN 246

Query: 238 TICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           TICGRHPI V L+    L      +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 247 TICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 301


>gi|328872880|gb|EGG21247.1| hypothetical protein DFA_01125 [Dictyostelium fasciculatum]
          Length = 288

 Score =  307 bits (787), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 145/279 (51%), Positives = 188/279 (67%), Gaps = 1/279 (0%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A H G+WY  +S +L ++L GWL E    K    + +I PHAGY YSGRAA+  + N+
Sbjct: 4   RAAYHGGNWYASDSNKLGQQLSGWLGETKSIKK-NTKALIVPHAGYDYSGRAASRGYINL 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
             T+  R+F+LGPSHH YT  C LS  T  +TP+GDL +D+E+  +L  TG+F  M   +
Sbjct: 63  SDTTFKRIFVLGPSHHVYTKSCGLSKHTHLETPVGDLKVDVEITNKLNLTGQFTWMTKEI 122

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           DE EHS+EMH PY+A+V +G  I IVP+LVG V  EN   Y  +FA Y+DDP NFF +SS
Sbjct: 123 DEEEHSLEMHFPYIAQVTKGRDITIVPVLVGNVPKENLKTYADIFAPYLDDPDNFFIISS 182

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWGSRF+Y  +D   G IH+ IE +DK GM  IE+GDP  F  YL E  NTICGR+P
Sbjct: 183 DFCHWGSRFDYSPHDPSCGAIHQFIEKMDKNGMKAIESGDPSVFADYLKETKNTICGRNP 242

Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           IS+FL ++ +   K +I+ L YEQSS+C + RDSSVSY 
Sbjct: 243 ISLFLWIIHSSKLKFEIQSLYYEQSSKCNSMRDSSVSYG 281


>gi|58392176|ref|XP_319155.2| AGAP010011-PA [Anopheles gambiae str. PEST]
 gi|55236230|gb|EAA13930.2| AGAP010011-PA [Anopheles gambiae str. PEST]
          Length = 295

 Score =  307 bits (786), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 146/287 (50%), Positives = 195/287 (67%), Gaps = 6/287 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A+HAGSWYTD+   L  +L+GWL +A L   P  R +IAPHAGY Y G   A+A+  I
Sbjct: 6   RLATHAGSWYTDSGAELNRQLNGWLNDADLTFGP-ARAIIAPHAGYRYCGACGAWAYRQI 64

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + RVF+LGPSHH    +CALS      TP+ +L +D ++ ++L++TG F+ MD   
Sbjct: 65  SPAVVKRVFILGPSHHVRLSRCALSAVQYCNTPLYNLKVDQQINQQLESTGHFKWMDQKT 124

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY+AKV E +     IVP++VG++  E E  YG++FA Y+ DP N F +
Sbjct: 125 DEDEHSIEMHLPYVAKVMEDYKDQFTIVPVMVGSITNEWEETYGKIFAPYLADPQNLFVI 184

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +Y++  G I+K IE LDKMGMD+IET   D+F +YL +Y+NTICGR
Sbjct: 185 SSDFCHWGQRFRYTYYEEGSGPIYKWIETLDKMGMDLIETLKADSFSEYLRKYNNTICGR 244

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI V +     L     ++  KFL+Y+QS+QC  K+DSSVSYAS +
Sbjct: 245 HPIGVLMQAVEELKRRGHRMSFKFLKYDQSNQCCDKKDSSVSYASGS 291


>gi|393248178|gb|EJD55685.1| UPF0103-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 305

 Score =  304 bits (779), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 162/298 (54%), Positives = 187/298 (62%), Gaps = 17/298 (5%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD+S  L  +L  WL  AG       + +IAPHAGYSYSG AAA+A+  I
Sbjct: 4   RRATHAGSWYTDDSDELQGQLTSWLASAGSAPIHGCKAIIAPHAGYSYSGPAAAWAYKCI 63

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           D T I RVF+LGPSHHYY P CALS    Y TP+G+LP+D   + EL+A+GKFE M    
Sbjct: 64  DTTGIRRVFVLGPSHHYYLPGCALSVLDAYNTPVGNLPIDKTTVAELRASGKFEDMSRTT 123

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           DE EHS+EMHLPY+ K FEG  I IVP+LVGA++ E E  YG L A Y+  P  F  VSS
Sbjct: 124 DEDEHSLEMHLPYVRKAFEGLDIVIVPVLVGALDTEGERSYGALLAPYLSQPDTFTVVSS 183

Query: 185 DFCHWGSRFNY-MHYDK---------KHGV------IHKSIEALDKMGMDIIETGDPDAF 228
           DFCHWG RF Y +HY             G       IHKSIEALD+  M +IE      F
Sbjct: 184 DFCHWGIRFQYTLHYPNPPPSALAKLSRGARPPAHPIHKSIEALDREAMGVIEGRQHARF 243

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKI-KIKFLRYEQSSQCKTKRDSSVSYASA 285
             YL    NTICGRHPI V L  L        +IKF+RYEQSSQC    DSSVSYASA
Sbjct: 244 MDYLRRTRNTICGRHPIGVLLAALETLGGDDHEIKFVRYEQSSQCVDVNDSSVSYASA 301


>gi|410901597|ref|XP_003964282.1| PREDICTED: protein MEMO1-like isoform 1 [Takifugu rubripes]
          Length = 297

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 150/290 (51%), Positives = 193/290 (66%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASH+GSWY+ +  +L  +L+ WL +A   + P  R +IAPHAGYSY G  AA+A+  +
Sbjct: 8   REASHSGSWYSASGSQLNAQLEDWLSKANSKRRP-ARAIIAPHAGYSYCGACAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DPT   RVF+LGPSHH +   CALS+A +Y+TP+ D+ +D +V  EL  TG F+ M+I  
Sbjct: 67  DPTVTRRVFILGPSHHVHLTCCALSSAEIYRTPLYDMRIDQKVYAELWKTGLFDRMNIKT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E       IVPILVGA++   E  YG+L +KY+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMESRKDDFSIVPILVGALSENKEHEYGKLLSKYLADPSNLFVI 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG+RF+Y +YD+  G I++SIE LDKMGM IIE  DP +F  Y+ +Y NTICGR
Sbjct: 187 SSDFCHWGNRFHYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPPSFSNYIKKYRNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QS QC+    SSVSYA+ A  V
Sbjct: 247 HPIGVLLNAVAELRKSGLDMNFSFLNYAQSGQCRNWEQSSVSYAAGALYV 296


>gi|391341233|ref|XP_003744935.1| PREDICTED: protein MEMO1-like [Metaseiulus occidentalis]
          Length = 300

 Score =  304 bits (778), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 151/292 (51%), Positives = 188/292 (64%), Gaps = 6/292 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A+HAGSWYT+    L  +LDGWL+  G       R +IAPHAGY YSG  AA+A+  I
Sbjct: 6   RAATHAGSWYTEKENELRNQLDGWLQLVGNATHSPARAIIAPHAGYRYSGSTAAFAYKQI 65

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP  + ++F+LGPSHH +   CALS AT YKTP+ DL +D E+ +EL  TG FE M +  
Sbjct: 66  DPDRVRKIFILGPSHHVFLEGCALSPATTYKTPLMDLTIDKEIYDELYKTGLFEEMSLGT 125

Query: 125 DEAEHSMEMHLPYLAKVFEGHL---IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           DE EHS+EM LPY+AKV E        I+PI+VGA++ E EA+YGR  AKY+      F 
Sbjct: 126 DEDEHSIEMQLPYIAKVMERKTNGPFTIIPIMVGAISTEKEAIYGRALAKYLGREDVCFV 185

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           VSSDFCHWGSRF Y HYD+  G I +SI  LD  GM +IE   P +F  YL    NTICG
Sbjct: 186 VSSDFCHWGSRFRYQHYDEAQGDIWESISKLDHDGMKLIEDLSPTSFASYLKSTGNTICG 245

Query: 242 RHPISVFLH---MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           RHPI V L+    L     +   +FLRY QS++  +++DSSVSYASAA  VD
Sbjct: 246 RHPIGVLLNSIATLQQSPIRFNCEFLRYAQSNRVTSRQDSSVSYASAAFLVD 297


>gi|161172364|pdb|3BCZ|A Chain A, Crystal Structure Of Memo
 gi|161172365|pdb|3BCZ|B Chain B, Crystal Structure Of Memo
 gi|161172366|pdb|3BCZ|C Chain C, Crystal Structure Of Memo
 gi|161172367|pdb|3BCZ|D Chain D, Crystal Structure Of Memo
          Length = 293

 Score =  302 bits (774), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 148/290 (51%), Positives = 189/290 (65%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 4   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE   +  
Sbjct: 63  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGXFERXSLQT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+E HLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 123 DEDEHSIEXHLPYTAKAXESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDK G  IIE  DP +F  YL +Y NTICGR
Sbjct: 183 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKXGXSIIEQLDPVSFSNYLKKYHNTICGR 242

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L          FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 243 HPIGVLLNAITELQKNGXNXSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 292


>gi|227204473|dbj|BAH57088.1| AT2G25280 [Arabidopsis thaliana]
          Length = 185

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 150/185 (81%), Positives = 168/185 (90%)

Query: 107 VIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG 166
           +I+E++A GKF +MD+ VDEAEHSMEMHLPYLAKVFEG+ +K+VPILVGAV+ ENEAMYG
Sbjct: 1   MIKEIRAMGKFGMMDLRVDEAEHSMEMHLPYLAKVFEGNNVKVVPILVGAVSPENEAMYG 60

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
            L AKYVDDP NFFSVSSDFCHWGSRFNYMHYD  HG IHKSIEALDK GMDIIETGDPD
Sbjct: 61  ELLAKYVDDPKNFFSVSSDFCHWGSRFNYMHYDNTHGAIHKSIEALDKKGMDIIETGDPD 120

Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           AFKKYLLE++NTICGRHPIS+FLHML + S+KIKI FLRYEQSSQC+T RDSSVSYASAA
Sbjct: 121 AFKKYLLEFENTICGRHPISIFLHMLKHSSSKIKINFLRYEQSSQCQTMRDSSVSYASAA 180

Query: 287 AKVDA 291
           AK++ 
Sbjct: 181 AKLET 185


>gi|380028966|ref|XP_003698154.1| PREDICTED: protein MEMO1-like [Apis florea]
          Length = 303

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 145/273 (53%), Positives = 190/273 (69%), Gaps = 8/273 (2%)

Query: 21  LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHH 80
           L ++L+GWL  A L   P  R +IAPHAGYSY G  AA+A+  I P  + R+F+LGPSHH
Sbjct: 28  LNKQLEGWLNAADLSHGP-ARAIIAPHAGYSYCGACAAFAYRQISPVVVRRIFILGPSHH 86

Query: 81  YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAK 140
                CALS+A++Y+T + DL +D +V  EL+ T  FE MD+  DE EHS+EM LP++AK
Sbjct: 87  VRLAGCALSSASIYQTLLYDLHIDQQVYRELEETRHFEWMDLNTDEEEHSIEMQLPFIAK 146

Query: 141 VFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
           V EG      I+PILVG+++ E EA+YGRL A Y+ DP   F +SSDFCHWG RF Y +Y
Sbjct: 147 VMEGFKDSFTIIPILVGSLSPEKEALYGRLLAPYMADPQTLFVISSDFCHWGQRFRYTYY 206

Query: 199 DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----GN 254
           D+  G IH+SI+ LDKMGMDIIET +P  F  YL +Y NT+CGRHPIS+ L ++    GN
Sbjct: 207 DRSCGPIHRSIQNLDKMGMDIIETLNPTMFNDYLKKYGNTVCGRHPISILLQVIHSLKGN 266

Query: 255 CS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            +  ++ +KFL+Y QS+QC    DSSVSYASA+
Sbjct: 267 TNGQRMNLKFLKYVQSNQCNNMNDSSVSYASAS 299


>gi|299755585|ref|XP_002912116.1| hypothetical protein CC1G_13648 [Coprinopsis cinerea okayama7#130]
 gi|298411291|gb|EFI28622.1| hypothetical protein CC1G_13648 [Coprinopsis cinerea okayama7#130]
          Length = 321

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 164/319 (51%), Positives = 202/319 (63%), Gaps = 35/319 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG-----LPKSPEVRGVIAPHAGYSYSGRAAA 58
           IR ASHAGSWYT + + L +EL GWL++        P + + + VIAPHAGYSYSG AAA
Sbjct: 3   IREASHAGSWYTADGRELDQELSGWLQKVNPTDESFPIT-KSKAVIAPHAGYSYSGPAAA 61

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +A+  ID T I RVF+LGPSHH+Y   CALS  T Y+TPIG+LPLDLE I+ELKA+G+F 
Sbjct: 62  WAYKAIDTTGIKRVFILGPSHHFYLNGCALSPCTEYETPIGNLPLDLETIQELKASGRFA 121

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            M    DE EHS+EMHLPY+ K+F+G  IKIVPI+VG++N   EA +G L A Y++    
Sbjct: 122 NMSRKADEDEHSLEMHLPYIRKIFQGKDIKIVPIVVGSINQRQEAEFGALLAPYLERQDT 181

Query: 179 FFSVSSDFCHWGSRFNYMHY----DKKHGV---------------IHKSIEALDKMGMDI 219
           F  VSSDFCHWG+RF+Y +Y    D   G                IHKSIE LD   M +
Sbjct: 182 FTVVSSDFCHWGTRFSYTYYYPEADTPVGSGVSLNRSTAPTSSHPIHKSIERLDHEAMQL 241

Query: 220 IE------TGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC----STKIKIKFLRYEQS 269
           +       T   +AF KYL +  NTICGRHPI V    L +     S    +K++RYEQS
Sbjct: 242 LSVPPGTATKAHEAFAKYLKQTGNTICGRHPIGVLFGALADLEKTKSLTPIVKWVRYEQS 301

Query: 270 SQCKTKRDSSVSYASAAAK 288
           S C T RDSSVSYASA  K
Sbjct: 302 SHCHTVRDSSVSYASAYIK 320


>gi|82802793|gb|ABB92446.1| rcC2orf4 [Gorilla gorilla]
          Length = 297

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 146/290 (50%), Positives = 189/290 (65%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y    AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYAYCRSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CAL++  +Y+TP+ DL +D ++  EL  TG F  M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALASVDIYRTPLNDLLIDQKIYGELWKTGMFGRMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EH +EMHLPY AK  E H     I+P+ VGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHGIEMHLPYTAKAMESHKDEFTIIPLPVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HP  V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPTGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296


>gi|239789321|dbj|BAH71291.1| ACYPI001056 [Acyrthosiphon pisum]
          Length = 291

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 148/290 (51%), Positives = 194/290 (66%), Gaps = 4/290 (1%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRASH+G+WY +++K L+ +L+ WL  A L   P  R +IAPHAGY Y G  AA+A+  
Sbjct: 3   VRRASHSGTWYNESAKELSTQLENWLGAAELTHGP-ARAIIAPHAGYQYCGACAAFAYRQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I P  + R+F+LGPSH+     CALS+A  Y TP  DL +D EV EEL++TG FE M + 
Sbjct: 62  ISPFVVQRIFILGPSHNVRLSGCALSSAVKYSTPFYDLQIDNEVYEELESTGHFERMSMK 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           VDE EHS+EM LPY+AKV + +  K  IVP++VG+++ E E++YG     Y+ DP N F 
Sbjct: 122 VDEDEHSIEMQLPYIAKVMQDYKDKFTIVPVMVGSLSTEKESLYGPYIRSYLADPKNLFI 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG  F Y  Y+K  G IH+SI  LD  GMD IET  P AF +YL  + NTICG
Sbjct: 182 ISSDFCHWGQGFRYTFYEKSWGEIHQSITTLDHKGMDTIETLKPAAFVEYLKTFSNTICG 241

Query: 242 RHPISVFLHMLGNCST-KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           RHPISV L    +  T K+ +KFL+Y QSS+C    DSSVSYA A+  ++
Sbjct: 242 RHPISVLLQAANHLRTQKLSLKFLKYAQSSKCYNLGDSSVSYAFASLIIE 291


>gi|156368571|ref|XP_001627766.1| predicted protein [Nematostella vectensis]
 gi|156214686|gb|EDO35666.1| predicted protein [Nematostella vectensis]
          Length = 296

 Score =  301 bits (770), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 145/290 (50%), Positives = 192/290 (66%), Gaps = 9/290 (3%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWY+ +S+ L  +LDGWL +  +   P  R +IAPHAGYSY G   AYA+  +
Sbjct: 4   RRATHAGSWYSSHSRDLNSQLDGWLSKVDVNYRP-ARAIIAPHAGYSYCGSCGAYAYKQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DPT + RVF+LGPSHH +   CAL+  ++Y+TP+ +L +D  V  EL  TG F+ M    
Sbjct: 63  DPTVVKRVFILGPSHHVHLRGCALTKNSIYQTPLYNLEVDKAVNAELMGTGFFDEMSSKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+E+HLPY+AK  E       +VPILVG+++ + E  YG+L + Y+ +P N F +
Sbjct: 123 DEDEHSIELHLPYIAKAMESKRGQYTVVPILVGSLSPDKEKKYGKLLSTYLMNPENLFVI 182

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF + +YD ++  IH+SIEALDK GMD IE  DP  F +YL +Y NTICGR
Sbjct: 183 SSDFCHWGKRFKFYYYDSRYDRIHQSIEALDKKGMDFIERLDPQGFSEYLEKYGNTICGR 242

Query: 243 HPISVFLH------MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           HPI VFL+       L   S  +  +FL Y QS+QC+   DSSVSYA+  
Sbjct: 243 HPIGVFLNALSALSQLNGMSPSMSFRFLHYSQSNQCEDFDDSSVSYAAGV 292


>gi|66811492|ref|XP_639926.1| hypothetical protein DDB_G0284869 [Dictyostelium discoideum AX4]
 gi|74854058|sp|Q54NZ1.1|MEMO1_DICDI RecName: Full=Protein MEMO1 homolog
 gi|60466854|gb|EAL64898.1| hypothetical protein DDB_G0284869 [Dictyostelium discoideum AX4]
          Length = 290

 Score =  300 bits (769), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 147/282 (52%), Positives = 190/282 (67%), Gaps = 3/282 (1%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR  +HAGSWY DN+++L ++L  WL EA   ++  V+ +IAPHAGYSYSGRAAAYA+ N
Sbjct: 2   IRNCTHAGSWYLDNARKLEKQLSDWLSEAS-RQNQNVKSIIAPHAGYSYSGRAAAYAYIN 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P +  RVF+LGPSHH Y   C L+    ++TPIG+L +D +   +L  TG F      
Sbjct: 61  LIPENYKRVFILGPSHHVYMKTCGLTKLDTWETPIGNLKVDKDTTNKLFDTGSFIWNTKS 120

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           VDE EHS+E+ LPY+AKV E  L++  IVPI+VG+++ + E +YG++ A Y DDP NFF 
Sbjct: 121 VDEDEHSLELQLPYIAKVAEKRLLELSIVPIMVGSLSIDLEELYGKILAPYFDDPENFFV 180

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF Y  Y+ K   I+K IE LD+ GM+IIE+GDP  F KYL E  NTICG
Sbjct: 181 ISSDFCHWGERFGYTRYENKEVPIYKYIEELDRKGMEIIESGDPVQFSKYLKETKNTICG 240

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           R PI+V L +  N   K  I  L YEQSS+     DSSVSY 
Sbjct: 241 RCPIAVMLWITKNSQFKYNISSLYYEQSSKVIKSSDSSVSYG 282


>gi|340374122|ref|XP_003385587.1| PREDICTED: protein MEMO1-like [Amphimedon queenslandica]
          Length = 299

 Score =  300 bits (768), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 148/279 (53%), Positives = 186/279 (66%), Gaps = 8/279 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAA 57
           M   R+ASHAGSWY+ +  +L  +L+ WL +A   G   +P  R +IAPHAGYSYSG  A
Sbjct: 1   MTSTRKASHAGSWYSGSGDKLTSQLEEWLGKATSSGAKHAP-ARAIIAPHAGYSYSGPCA 59

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           AYA+  I P  I RVF+LGPSHH Y P CALS  + Y+TP+ +L +D E+   L +TGKF
Sbjct: 60  AYAYNQIIPAGIKRVFILGPSHHKYMPDCALSACSHYETPLYNLEIDREIYHSLYSTGKF 119

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDD 175
             MD+  DE EHS+EMHLPY+AKV E       IVPILVGA+  + EA YG+L  KY+ D
Sbjct: 120 SQMDVRTDEDEHSIEMHLPYIAKVMESKRGSFTIVPILVGALKPDKEAEYGQLLHKYLLD 179

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
             N F +SSDFCHWG RF + +YDK+HG IH SIEALD+MGM +IE  D   F+KYL  Y
Sbjct: 180 EQNLFVISSDFCHWGKRFQFTYYDKEHGPIHSSIEALDRMGMSLIEALDTKGFQKYLQTY 239

Query: 236 DNTICGRHPISVFLHML--GNCSTKIKIKFLRYEQSSQC 272
            NTICGRHPI V L+ +          +KF +Y QS++C
Sbjct: 240 RNTICGRHPIGVLLNAIDAAQGEKSFVLKFTQYAQSNRC 278


>gi|255075349|ref|XP_002501349.1| predicted protein [Micromonas sp. RCC299]
 gi|226516613|gb|ACO62607.1| predicted protein [Micromonas sp. RCC299]
          Length = 298

 Score =  300 bits (768), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 158/296 (53%), Positives = 195/296 (65%), Gaps = 10/296 (3%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWY+ +  RL++ELDG+L  AG   S   R +I PHAGYSYSG AAA+ + N+
Sbjct: 4   REASHAGSWYSSDRARLSKELDGFLDAAGSHGSTP-RALIVPHAGYSYSGPAAAWGYKNV 62

Query: 65  DP-TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           D    + RVFLLGPSHH +  +CALS    Y TP+GDL +D  V EEL+ATG F  MD+ 
Sbjct: 63  DAGAGVRRVFLLGPSHHVFLRRCALSKCAAYATPLGDLAVDRGVYEELRATGHFVDMDVD 122

Query: 124 VDEAEHSMEMHLPYLAKVF----EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           VDEAEHS+E+HLPY+ K F    E     +VPI+VGA++   EA YG + A Y+DD +N 
Sbjct: 123 VDEAEHSLELHLPYIFKCFEVLPERDRPTLVPIMVGALSTNAEATYGEILAPYLDDDANL 182

Query: 180 FSVSSDFCHWGSRFNYMHYD--KKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
           F VSSDFCHWG RF Y  +D  +K   +H+ IE +D+ GM+IIE+ D DAF  YL E  N
Sbjct: 183 FIVSSDFCHWGKRFGYQPWDEGRKDVGLHEQIEEMDRRGMEIIESKDADAFAAYLKETGN 242

Query: 238 TICGRHPISVFLHMLGNCS--TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           TICGRHPI V L  L       K +++F RYEQSS      DSSVSYASA    DA
Sbjct: 243 TICGRHPIGVLLQTLRRSGGFDKTRVEFTRYEQSSAATGWNDSSVSYASAVVFRDA 298


>gi|390604307|gb|EIN13698.1| UPF0103-domain-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 333

 Score =  300 bits (767), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 162/327 (49%), Positives = 201/327 (61%), Gaps = 45/327 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----PEVRG---VIAPHAGYSYSGRA 56
           +R ASHAGSWYTD+ +RL  +L GWL++    +     P + G   +I PHAGYSYSG A
Sbjct: 3   VREASHAGSWYTDDRQRLNTQLSGWLQDVKPQEDDNFHPPIEGCKAIIGPHAGYSYSGPA 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA+A+ +ID   I RVF+LGPSHH Y   CALS    Y+TPIGDLPLDLE I++L+ TGK
Sbjct: 63  AAWAYKSIDTVGIRRVFVLGPSHHVYLDGCALSRCNGYETPIGDLPLDLETIQDLRNTGK 122

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
           F  MD+  DE EHS+EMHLPY+ KVFEG  IK+VPILVGA++ E E ++G L A Y+   
Sbjct: 123 FSDMDLETDENEHSIEMHLPYIRKVFEGQNIKLVPILVGAIDTEKETLFGELIAPYLLRD 182

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDK-----------------------KHGVIHKSIEALD 213
             F  VSSDFCHWG+RF Y +Y                         +H  IH SI ALD
Sbjct: 183 DTFCVVSSDFCHWGARFRYTNYYPDPALVPSSVRATQLSTSTKATMYQHHPIHASISALD 242

Query: 214 KMGMDII----ETGDPDA---FKKYLLEYDNTICGRHPISVFLHML--------GNCSTK 258
              M+I+     T   DA   F  YL +  NTICGRHPI V L  L        G+    
Sbjct: 243 HEAMNILTIPANTATKDAHNVFAAYLAKTKNTICGRHPIGVLLGALAFIDSTKSGDQHNS 302

Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASA 285
           I ++++RYEQSS+C + RDSSVSYASA
Sbjct: 303 ITMRWVRYEQSSKCASPRDSSVSYASA 329


>gi|119630156|gb|EAX09751.1| hCG401083 [Homo sapiens]
          Length = 295

 Score =  300 bits (767), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 149/290 (51%), Positives = 190/290 (65%), Gaps = 8/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE E S+EMHLPY AK  E H     I+P+LVGA++ +N      LF+KY+ DPSN F V
Sbjct: 127 DEDERSIEMHLPYTAKAMESHKDEFTIIPLLVGALSQKNRN--SELFSKYLADPSNLFVV 184

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF + +YD+  G I +SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 185 SSDFCHWGQRFRHSYYDESQGEICRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 244

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 245 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 294


>gi|388496392|gb|AFK36262.1| unknown [Lotus japonicus]
          Length = 177

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 148/172 (86%), Positives = 162/172 (94%)

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           +MDI VDEAEHSMEMHLPYLAKVFEGH +KIVPILVGAV+AENEAMYG + AKYVDDP+N
Sbjct: 1   MMDIRVDEAEHSMEMHLPYLAKVFEGHPVKIVPILVGAVSAENEAMYGEILAKYVDDPTN 60

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
           FFS+SSDFCHWG+RFNY HYDKKHG I+KSIEALDKMGMDIIE GDPD+FK+YL EYDNT
Sbjct: 61  FFSISSDFCHWGARFNYQHYDKKHGPIYKSIEALDKMGMDIIEIGDPDSFKRYLSEYDNT 120

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           ICGRHPISVFL+ML NCSTKIKIKFLRYEQSSQC++ RDSSVSYASAAAK+D
Sbjct: 121 ICGRHPISVFLYMLRNCSTKIKIKFLRYEQSSQCQSTRDSSVSYASAAAKID 172


>gi|307197875|gb|EFN78974.1| Protein MEMO1 [Harpegnathos saltator]
          Length = 285

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 144/278 (51%), Positives = 190/278 (68%), Gaps = 10/278 (3%)

Query: 21  LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHH 80
           L ++L+ WL  A L   P  R +IAPHAGYSY G  A +A+  I P  + R+F+LGPSHH
Sbjct: 10  LRKQLEDWLSAADLSHGP-ARAIIAPHAGYSYCGECAGFAYRQISPVVVRRIFILGPSHH 68

Query: 81  YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAK 140
                CALS+ +VY+TP+ DL +D +V +EL+ TG F+ MD+  DE EHS+EM LP++AK
Sbjct: 69  VRLTSCALSSVSVYQTPLYDLRIDQKVRKELEETGHFDWMDVNTDEEEHSIEMQLPFIAK 128

Query: 141 VFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
           V E       I+PILVG+++ E EA+YGRL A Y+ DP   F +SSDFCHWG RF Y++Y
Sbjct: 129 VMEEFKDSFTIIPILVGSLSPEKEALYGRLLAPYMADPQALFVISSDFCHWGQRFRYIYY 188

Query: 199 DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC--- 255
           D+  G IH+SI+ LDKMGM+IIET +P  F  YL +Y NTICGRHPI + L ++ NC   
Sbjct: 189 DRSCGPIHRSIQNLDKMGMNIIETLNPAMFTDYLKKYGNTICGRHPIGILLQLI-NCLKG 247

Query: 256 ---STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
                ++ +KFL+Y QSSQC    DSSVSYASA+  V+
Sbjct: 248 RTNGHRMNLKFLKYAQSSQCNNMNDSSVSYASASLVVE 285


>gi|225718386|gb|ACO15039.1| MEMO1 [Caligus clemensi]
          Length = 300

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 153/291 (52%), Positives = 192/291 (65%), Gaps = 10/291 (3%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYAFG 62
           R ASHAGSWY+ + K L +E+ GWL +   P S     R +I PHAG+ YSG  AAYA+ 
Sbjct: 7   RMASHAGSWYSGDGKVLQKEMSGWLDKVS-PDSATFPARAIIGPHAGFRYSGPTAAYAYK 65

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            I P  I RVF+LGPSHH     C LS  +VY+TP+ DL +D EV  EL     F +  +
Sbjct: 66  QIQPQGIKRVFILGPSHHITVNGCLLSGCSVYETPLYDLLVDKEVNRELMDAKGFGMATL 125

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
             DE EHS+E+ LPY+AK  E       IVPILVG+++ + EA YG++ AKY+ DPSN F
Sbjct: 126 SADEDEHSLELRLPYIAKAMESRKGEFTIVPILVGSLSPDKEAKYGKILAKYLMDPSNLF 185

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSDFCHWG RF+Y +YDK+ G IH+SI +LD  GMDIIE  D DAF  YL +Y NTIC
Sbjct: 186 VISSDFCHWGERFDYTYYDKEAGEIHESISSLDHKGMDIIENLDHDAFAAYLKKYRNTIC 245

Query: 241 GRHPISVFLHMLGNCSTK-----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           GRHPI VFL M+     +     + +KFL+Y QS +C+T  DSSVSYASAA
Sbjct: 246 GRHPIGVFLGMVKAIRQRGDAPEMCLKFLKYAQSDKCRTLSDSSVSYASAA 296


>gi|426392964|ref|XP_004062805.1| PREDICTED: protein MEMO1-like isoform 1 [Gorilla gorilla gorilla]
 gi|426392966|ref|XP_004062806.1| PREDICTED: protein MEMO1-like isoform 2 [Gorilla gorilla gorilla]
          Length = 297

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 145/290 (50%), Positives = 188/290 (64%), Gaps = 6/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y    AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYAYCRSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+ +LGPSHH    +CAL++  +Y+TP+ DL +D ++  EL  TG F  M +  
Sbjct: 67  DPSITRRILILGPSHHVPLSRCALASVDIYRTPLNDLLIDQKIYGELWKTGMFGRMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EH +EMHLPY AK  E H     I+P+ VGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHGIEMHLPYTAKAMESHKDEFTIIPLPVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HP  V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 HPTGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296


>gi|29841028|gb|AAP06041.1| similar to GenBank Accession Number BC018733 CGI-27 protein in Homo
           sapiens [Schistosoma japonicum]
          Length = 298

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 145/294 (49%), Positives = 193/294 (65%), Gaps = 9/294 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRASHAGSWY+ +  RL  +L  WL E  + + P  R +I+PHAGY YSG  AA+A+  
Sbjct: 5   VRRASHAGSWYSGDRARLNNQLQNWLSEVTVIRQP-ARAIISPHAGYDYSGPCAAFAYKQ 63

Query: 64  IDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           IDP  I RVF+LGP+H+     KCALSTA +++TPI  L +D +V  +L  TG+F  + +
Sbjct: 64  IDPRLIKRVFVLGPAHYMSLRGKCALSTADLFETPIYSLSIDRDVYRDLDKTGEFVSLSL 123

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
             DE EHS+EM +PY+AKV E H     +VPILVG ++ E EA+YG++FA+Y+ DP NFF
Sbjct: 124 DRDEEEHSIEMQMPYVAKVMERHQGGFSVVPILVGYLSPEREAVYGQIFARYLSDPENFF 183

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSDFCHWG RF Y +YD+  G I +SI+ALD+ GM IIE      F  YL +Y NTIC
Sbjct: 184 VISSDFCHWGKRFQYTYYDQSKGPIWQSIQALDEAGMQIIERLAASEFTSYLEQYGNTIC 243

Query: 241 GRHPISVFLHMLGNCSTKI-----KIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           GRHPI V L  + +  T        +KF+RY QS +C     SSVSYA+   ++
Sbjct: 244 GRHPIGVLLQTVASLRTNAPNNNYSMKFVRYTQSERCNNLNQSSVSYAAGVLQI 297


>gi|298714279|emb|CBJ27395.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 351

 Score =  296 bits (758), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 137/283 (48%), Positives = 195/283 (68%), Gaps = 2/283 (0%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAFG 62
           RRASHAG+WY+D+   L ++L  WL  +     P   VR +IAPHAG+SYS + AA A+ 
Sbjct: 61  RRASHAGTWYSDSGPLLGQQLTDWLGASAHTTKPGSAVRAIIAPHAGFSYSAQTAASAYA 120

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           ++DP+ ++RVF+LGPSHH +  +CA+STA+  +TPIGDL +D +V  +L ATG F+ M  
Sbjct: 121 HMDPSKLARVFVLGPSHHVHMRRCAVSTASRLETPIGDLAIDQDVTRQLLATGDFDAMTA 180

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
            +DE EHS+EMHLP++ KV EG    +V I++GA++AE E  YGR+ A Y++D +NFF +
Sbjct: 181 RMDEDEHSIEMHLPFIKKVMEGRPCTVVAIMIGALSAELEEHYGRILAPYLEDLTNFFII 240

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           S+DFCHWGSRF Y      H  IH+ I  LD  GM +I++ D   F +YL E+ NTICGR
Sbjct: 241 STDFCHWGSRFRYQPLGAAHSAIHEHISWLDHEGMKLIQSQDAVGFHRYLAEHGNTICGR 300

Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           HPI+ F++ +   +  I + ++ Y QS++  + +DSSVSYASA
Sbjct: 301 HPIATFINAIEASTVCICVTWVEYAQSTKVVSPQDSSVSYASA 343


>gi|328857497|gb|EGG06613.1| hypothetical protein MELLADRAFT_48375 [Melampsora larici-populina
           98AG31]
          Length = 325

 Score =  296 bits (757), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 150/316 (47%), Positives = 194/316 (61%), Gaps = 36/316 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS------------------PEVRGVIAP 46
           R A+HAGSWYT +  +L  E+D WL EA  PK+                     + VIAP
Sbjct: 4   RSATHAGSWYTSSPNKLRNEIDTWL-EAAHPKAETTEETQDWRKVAMPVPISNCKAVIAP 62

Query: 47  HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
           HAGYSYSG  AA+A+  IDP+ I RVF+LGPSHH Y   CALS    Y+TP+GDL +D++
Sbjct: 63  HAGYSYSGSTAAWAYKTIDPSVIKRVFILGPSHHVYLDGCALSGCEAYETPLGDLEIDVQ 122

Query: 107 VIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG 166
           V  EL  +G FE MDI  DEAEHS+E+HLPY+  +F+   I IVPIL+G++++E E +YG
Sbjct: 123 VTSELVQSGLFEEMDIETDEAEHSIELHLPYVRHIFKDQTITIVPILIGSISSEKELIYG 182

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-----------------DKKHGVIHKSI 209
            +   Y+  P N F +SSDFCHWGSRF+Y +Y                       I++SI
Sbjct: 183 EVLRPYLLSPENLFVISSDFCHWGSRFSYTYYKDPTQENEPARKLSSSASPPATPIYQSI 242

Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQS 269
           EALD+  ++I+E+     F +YL +  NTICGRHPI V L  L   +   K+KF  Y QS
Sbjct: 243 EALDREALNILESLSHSEFSRYLSQTKNTICGRHPIGVLLGALQTITEPAKLKFTCYAQS 302

Query: 270 SQCKTKRDSSVSYASA 285
           SQC + RDSSVSYASA
Sbjct: 303 SQCVSARDSSVSYASA 318


>gi|82802790|gb|ABB92445.1| rcC2orf4 [Homo sapiens]
          Length = 295

 Score =  295 bits (754), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 147/290 (50%), Positives = 189/290 (65%), Gaps = 8/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  ++  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQVNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYFGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LG SHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGLSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE E S+EMHLPY AK  E H     I+P+LVGA++ +N      LF+KY+ DPSN F V
Sbjct: 127 DEDERSIEMHLPYTAKAMESHKDEFTIIPLLVGALSQKNRN--SELFSKYLADPSNLFVV 184

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF + +YD+  G I +SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 185 SSDFCHWGQRFRHSYYDESQGEICRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 244

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 245 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 294


>gi|395828933|ref|XP_003787616.1| PREDICTED: protein MEMO1 [Otolemur garnettii]
          Length = 324

 Score =  294 bits (753), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 145/285 (50%), Positives = 187/285 (65%), Gaps = 8/285 (2%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
            GS+Y     +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+  
Sbjct: 42  TGSYY--QGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSIT 98

Query: 70  SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
            R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EH
Sbjct: 99  RRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEH 158

Query: 130 SMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFC 187
           S+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFC
Sbjct: 159 SIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFC 218

Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
           HWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V
Sbjct: 219 HWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGV 278

Query: 248 FLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 279 LLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 323


>gi|345781967|ref|XP_540144.3| PREDICTED: protein MEMO1 [Canis lupus familiaris]
          Length = 332

 Score =  294 bits (752), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 58  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 116

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 117 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 176

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 177 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 236

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 237 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 296

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 297 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 331


>gi|148706476|gb|EDL38423.1| RIKEN cDNA 0610016J10, isoform CRA_b [Mus musculus]
          Length = 277

 Score =  294 bits (752), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 184/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 3   QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSVTRRIFILGPSH 61

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 62  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 121

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 122 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 181

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 182 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 241

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 242 NGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 276


>gi|324517153|gb|ADY46739.1| Unknown [Ascaris suum]
          Length = 302

 Score =  294 bits (752), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 146/295 (49%), Positives = 193/295 (65%), Gaps = 8/295 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR A+HAGSWY+DN K L  +L  WL  AG   +   R +I+PHAGYSY G  AAYAF  
Sbjct: 5   IRNATHAGSWYSDNPKELNRQLSEWLAAAGPRVAGPARAIISPHAGYSYCGSTAAYAFKQ 64

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I P  I  +F+LGPSH      CALS+ + Y+TP+GDL +D  V  EL+ TG FE MD  
Sbjct: 65  IVPDHIETIFVLGPSHVVCLSGCALSSCSRYRTPLGDLYIDQRVNAELRETGAFETMDFR 124

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLI---KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
            +EAEHS+EM LP++AKV E        IVPILVG++++  +  YG++FAKYV DP N F
Sbjct: 125 SEEAEHSLEMQLPFIAKVMEKQPAGSYNIVPILVGSLSSSRQISYGKIFAKYVADPRNLF 184

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
            +SSDFCHWG+RF++  Y+   G+ IH+ I ALDK GM+ I   +P AF +YL +  NTI
Sbjct: 185 VISSDFCHWGNRFHFTPYEHNTGLPIHEQIAALDKQGMEAISRLEPSAFNEYLKKTQNTI 244

Query: 240 CGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           CGR+PI V L    H     +   +++FL+Y QS+QC+   DSSVSYA+ A  ++
Sbjct: 245 CGRNPICVMLQAAEHFRQMNNHSAELRFLKYSQSNQCRNLSDSSVSYAAGALFIN 299


>gi|355702140|gb|AES01834.1| mediator of cell motility 1 [Mustela putorius furo]
          Length = 275

 Score =  294 bits (752), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 2   QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 60

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 61  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 120

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 121 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 180

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 181 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 240

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 241 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 275


>gi|229595124|ref|XP_001019799.3| hypothetical protein TTHERM_00138170 [Tetrahymena thermophila]
 gi|225566377|gb|EAR99554.3| hypothetical protein TTHERM_00138170 [Tetrahymena thermophila
           SB210]
          Length = 295

 Score =  294 bits (752), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 139/290 (47%), Positives = 193/290 (66%), Gaps = 5/290 (1%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAFG 62
           R ASHA SWYT N K L +EL G+L +A   +    +V+ +I PHAGY YSG  AA+ + 
Sbjct: 6   RLASHANSWYTGNQKELDQELSGYLSKANKQQLNIQKVKALIGPHAGYYYSGPTAAWGYQ 65

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            + P    RVFLLGP HH Y   C +S    ++TP+GD+ LD + I+ELK TGKF+++  
Sbjct: 66  YLQPIENLRVFLLGPCHHIYMNGCGISGLEKFQTPLGDITLDKQTIQELKNTGKFDVVSK 125

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
             +E EHS+EMHLPY+ K+      K+VPI+VG ++ + E  YG+L +KY DD +  F +
Sbjct: 126 KNEEDEHSLEMHLPYIQKILGQQQFKLVPIMVGNLSYQREQEYGQLLSKYFDDDNTVFII 185

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWGSRF+Y +Y+K+ G I +SI+ LD  GM++IE  D + F KYL + +NTICGR
Sbjct: 186 SSDFCHWGSRFDYQYYNKQDGEIWQSIQKLDTAGMELIENHDCEKFNKYLEQTENTICGR 245

Query: 243 HPISVFLHMLGNCST---KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L ++ +      ++K KF+ YEQS Q KT++ SSVSYA++   V
Sbjct: 246 HPIGVLLQVIKSTQKYAGQLKTKFMHYEQSEQVKTQKSSSVSYATSITYV 295


>gi|194220833|ref|XP_001917913.1| PREDICTED: protein MEMO1-like [Equus caballus]
          Length = 282

 Score =  293 bits (751), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 8   QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 66

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 67  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 126

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 127 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 186

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 187 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 246

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 247 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 281


>gi|395507123|ref|XP_003757877.1| PREDICTED: protein MEMO1 [Sarcophilus harrisii]
          Length = 299

 Score =  293 bits (751), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 143/275 (52%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DPT   R+F+LGPSH
Sbjct: 25  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPTITRRIFILGPSH 83

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 84  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 143

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 144 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 203

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 204 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 263

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 264 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 298


>gi|114576871|ref|XP_001164069.1| PREDICTED: protein MEMO1 isoform 2 [Pan troglodytes]
 gi|397513836|ref|XP_003827213.1| PREDICTED: protein MEMO1-like [Pan paniscus]
 gi|56377671|dbj|BAD74067.1| C2orf4 [Homo sapiens]
          Length = 300

 Score =  293 bits (751), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 26  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 84

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 85  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 144

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 145 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 204

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 205 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 264

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 265 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 299


>gi|281349207|gb|EFB24791.1| hypothetical protein PANDA_019259 [Ailuropoda melanoleuca]
 gi|351701080|gb|EHB03999.1| Protein MEMO1, partial [Heterocephalus glaber]
 gi|432096762|gb|ELK27340.1| Protein MEMO1, partial [Myotis davidii]
          Length = 277

 Score =  293 bits (751), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 3   QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 61

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 62  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 121

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 122 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 181

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 182 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 241

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 242 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 276


>gi|109102576|ref|XP_001105478.1| PREDICTED: protein MEMO1-like [Macaca mulatta]
          Length = 297

 Score =  293 bits (751), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 23  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 81

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 82  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 141

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 142 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 201

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 202 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 261

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 262 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296


>gi|441662627|ref|XP_003262755.2| PREDICTED: protein MEMO1 [Nomascus leucogenys]
          Length = 339

 Score =  293 bits (750), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 65  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 123

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 124 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 183

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 184 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 243

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 244 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 303

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 304 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 338


>gi|426226442|ref|XP_004007352.1| PREDICTED: protein MEMO1 [Ovis aries]
          Length = 439

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 165 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 223

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 224 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 283

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 284 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 343

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 344 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 403

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 404 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 438


>gi|431911965|gb|ELK14109.1| Protein MEMO1 [Pteropus alecto]
          Length = 297

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 141/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 23  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 81

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE +HS+EMHLPY A
Sbjct: 82  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDQHSIEMHLPYTA 141

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 142 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 201

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 202 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 261

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 262 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296


>gi|403307071|ref|XP_003944034.1| PREDICTED: protein MEMO1 [Saimiri boliviensis boliviensis]
          Length = 310

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 36  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 94

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 95  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 154

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 155 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 214

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 215 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 274

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 275 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 309


>gi|300676818|gb|ADK26694.1| mediator of cell motility 1 [Zonotrichia albicollis]
          Length = 276

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 140/275 (50%), Positives = 182/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP    ++F+LGPSH
Sbjct: 2   QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPNITRKIFILGPSH 60

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 61  HVPLSRCALSSVDIYRTPLYDLRIDQKIYAELWKTGMFERMSLQTDEDEHSIEMHLPYTA 120

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 121 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 180

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+ +     
Sbjct: 181 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAINELQK 240

Query: 257 --TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 241 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 275


>gi|301787491|ref|XP_002929161.1| PREDICTED: protein MEMO1-like [Ailuropoda melanoleuca]
          Length = 463

 Score =  292 bits (747), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 142/279 (50%), Positives = 185/279 (66%), Gaps = 6/279 (2%)

Query: 16  DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLL 75
           ++  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+L
Sbjct: 185 EDRPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFIL 243

Query: 76  GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHL 135
           GPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHL
Sbjct: 244 GPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHL 303

Query: 136 PYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRF 193
           PY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF
Sbjct: 304 PYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRF 363

Query: 194 NYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM-- 251
            Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+   
Sbjct: 364 RYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAIT 423

Query: 252 -LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 424 ELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 462


>gi|440908306|gb|ELR58341.1| Protein MEMO1, partial [Bos grunniens mutus]
          Length = 277

 Score =  292 bits (747), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 141/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 3   QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 61

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 62  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 121

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ +PSN F VSSDFCHWG RF Y +
Sbjct: 122 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLAEPSNLFVVSSDFCHWGQRFRYSY 181

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 182 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 241

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 242 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 276


>gi|395334280|gb|EJF66656.1| hypothetical protein DICSQDRAFT_95267 [Dichomitus squalens LYAD-421
           SS1]
          Length = 326

 Score =  292 bits (747), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 160/319 (50%), Positives = 190/319 (59%), Gaps = 38/319 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLRE-------AGLPKSPEVRGVIAPHAGYSYSGRAA 57
           R+A+HA SWYT N  +L +EL GWL E       A  P  P  + +IAPHAGYSYSGRAA
Sbjct: 4   RQATHADSWYTGNGSKLDQELSGWLGEVKPASEDAFNPPVPGTKAIIAPHAGYSYSGRAA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+ +ID T I RVF+LGPSHH Y   CAL+    Y+TPIG LP+D+E  +ELK TG+F
Sbjct: 64  AWAYKSIDTTGIKRVFVLGPSHHVYLDGCALTRCAQYETPIGALPIDVETTKELKQTGQF 123

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             M +  DE EHS+EMHLPY+ K FEG  IKIVPILVGA+N + E  YG L A Y+    
Sbjct: 124 SDMSLQTDEDEHSIEMHLPYIRKTFEGLDIKIVPILVGAINYDKEVSYGHLLAPYLARED 183

Query: 178 NFFSVSSDFCHWGSRFNYMHY----------------------DKKHGVIHKSIEALDKM 215
             F VSSDFCHWG RF Y  Y                        K   IH SI ALD  
Sbjct: 184 TIFVVSSDFCHWGLRFQYTFYYPQPPPSSAAPIRLSRSETPPASLKTHPIHASISALDHE 243

Query: 216 GMDIIETGDPDA------FKKYLLEYDNTICGRHPISVF---LHMLGNCSTKIKIKFLRY 266
           GM+++      A      F  YL    NTICGRHPI V    L  L    T  K+K++RY
Sbjct: 244 GMELLTLPPFTASQAHRTFADYLARTKNTICGRHPIGVLFGALAALEQQGTSPKLKWVRY 303

Query: 267 EQSSQCKTKRDSSVSYASA 285
           EQSS C+T +DSSVSYASA
Sbjct: 304 EQSSACETVKDSSVSYASA 322


>gi|312084389|ref|XP_003144256.1| hypothetical protein LOAG_08678 [Loa loa]
 gi|307760580|gb|EFO19814.1| hypothetical protein LOAG_08678 [Loa loa]
          Length = 305

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 147/297 (49%), Positives = 198/297 (66%), Gaps = 12/297 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R ASHAGSWYT++ ++L  EL  WL  A    +   R +I+PHAGYSYSGR AA+AF  
Sbjct: 8   VRNASHAGSWYTNDPRKLHRELTEWLGAADSRHAQSARAIISPHAGYSYSGRIAAFAFKQ 67

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I P ++S +F+LGPSH      CALST + Y+TPIGDL +D     ELK TG F LMD+ 
Sbjct: 68  IIPETVSTIFVLGPSHVMSLDTCALSTCSRYRTPIGDLQIDQRTNTELKETGAFSLMDLR 127

Query: 124 VDEAEHSMEMHLPYLAKVFE---GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
            +EAEHS+EM LPY+AK+ E    +   IVP+LVG+++   +A YG++F+KY+ DP   F
Sbjct: 128 SEEAEHSIEMQLPYIAKIMEKQSANGYSIVPVLVGSLSPSKQASYGKIFSKYLSDPKIVF 187

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
            VSSDFCHWGSRF++M +D   GV I++ I A+DK GMD I + +P AF +YL    NTI
Sbjct: 188 VVSSDFCHWGSRFHFMPHDNTTGVPIYEQIAAMDKQGMDAISSLNPVAFGEYLKRTQNTI 247

Query: 240 CGRHPISVFLH------MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           CGR+PISV L+       + N     +  FL+Y QS+Q ++  DSSVSYA+ A  ++
Sbjct: 248 CGRNPISVILYAAEYFRQMNN--HLAEFVFLKYAQSNQSRSLNDSSVSYAAGALFIN 302


>gi|443898721|dbj|GAC76055.1| predicted dioxygenase [Pseudozyma antarctica T-34]
          Length = 347

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 164/336 (48%), Positives = 197/336 (58%), Gaps = 54/336 (16%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL-------------------REAGLPKS-PEVRGV 43
           IR+ASHAGSWYT++   L E L  WL                   R   LP   P  R +
Sbjct: 8   IRQASHAGSWYTNDRAELDESLTEWLSDVKADQLPSPLAVCDHKPRSGSLPLPIPTCRAI 67

Query: 44  IAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
           I PHAGY+YSG AAAYA+  ID  +I RVF+LGPSHH Y   CA+S  + Y TP+G+L +
Sbjct: 68  IGPHAGYAYSGPAAAYAYKCIDIAAIERVFILGPSHHVYLDGCAVSRCSQYATPLGNLEV 127

Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA 163
           D +V EELKATGKFE M    DE EHSMEMHLPY+ KVFEG  I+IVPILVGA+N + E 
Sbjct: 128 DRQVTEELKATGKFETMTQSQDEDEHSMEMHLPYIYKVFEGRPIRIVPILVGAINTKTED 187

Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH------------------GVI 205
           +YG+L A Y++D  NFF VSSDFCHWG+RF Y +Y                        I
Sbjct: 188 VYGKLLAPYLNDSRNFFVVSSDFCHWGTRFRYTYYKPAEDSPATMLSARTPRSAYEGAPI 247

Query: 206 HKSIEALDKMGMDII-----ETGDP-------DAFKKYLLEYDNTICGRHPISVFLHMLG 253
           H+SI ALD+ G+  I     + G+         AF +YL    NT+CGRHPI V L  L 
Sbjct: 248 HQSIRALDEQGIFAITYPWTQGGESKTAAEARKAFAEYLSSTKNTVCGRHPIGVLLAALA 307

Query: 254 NCSTKIKIK----FLRYEQSSQCKTKRDSSVSYASA 285
                  IK    F RYEQSSQC + +DSSVSYASA
Sbjct: 308 ELEKAAGIKSECRFTRYEQSSQCTSPQDSSVSYASA 343


>gi|326915459|ref|XP_003204035.1| PREDICTED: protein MEMO1-like [Meleagris gallopavo]
          Length = 290

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 140/275 (50%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP    ++F+LGPSH
Sbjct: 16  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPNITRKIFILGPSH 74

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 75  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 134

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 135 KAMESHKDEFTIIPVLVGALSEAKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 194

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
           YD+  G I++SIE LDKMGM+IIE  DP +F  YL +Y NTICGRHPI V L+ +     
Sbjct: 195 YDESQGEIYRSIEHLDKMGMNIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAINELQK 254

Query: 257 --TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 255 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 289


>gi|449277723|gb|EMC85796.1| Protein MEMO1 [Columba livia]
          Length = 287

 Score =  291 bits (745), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 141/275 (51%), Positives = 182/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP    +VF+LGPSH
Sbjct: 13  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPNITRKVFILGPSH 71

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 72  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 131

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 132 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 191

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+ +     
Sbjct: 192 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAINELQK 251

Query: 257 --TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 252 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 286


>gi|354480709|ref|XP_003502547.1| PREDICTED: protein MEMO1-like [Cricetulus griseus]
          Length = 428

 Score =  291 bits (745), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP    R+F+LGPSH
Sbjct: 154 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPAITRRIFILGPSH 212

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 213 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 272

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 273 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 332

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+    L  
Sbjct: 333 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 392

Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 393 NGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 427


>gi|449496796|ref|XP_002189221.2| PREDICTED: protein MEMO1 [Taeniopygia guttata]
          Length = 360

 Score =  290 bits (742), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 140/275 (50%), Positives = 182/275 (66%), Gaps = 6/275 (2%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP    ++F+LGPSH
Sbjct: 86  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPNITRKIFILGPSH 144

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 145 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 204

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 205 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 264

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
           YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+ +     
Sbjct: 265 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAINELQK 324

Query: 257 --TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 325 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 359


>gi|393218450|gb|EJD03938.1| UPF0103-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 391

 Score =  290 bits (741), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 165/339 (48%), Positives = 198/339 (58%), Gaps = 56/339 (16%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA--------GLPKSPEVRGVIAPHAGYSY 52
           +  I RA+HAGSWYTD ++RL +EL  WL +A         LP     + +IAPHAGYSY
Sbjct: 51  LNTITRATHAGSWYTDRAQRLNDELSTWLNDAQPDAEDGFALPVK-GCKAIIAPHAGYSY 109

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           SG AA +A+ +ID T I RVF+LGPSHH Y   CALS    Y TPIG+LPLD+E IEEL+
Sbjct: 110 SGPAAGWAYKSIDTTGIKRVFILGPSHHVYLDCCALSRFQEYATPIGNLPLDIETIEELR 169

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
            TG FE M    DE EHS+EMHLPY+ K FEG  IKIVPILVGA++ E+E  +G++ A Y
Sbjct: 170 KTGSFEDMRSDTDEDEHSIEMHLPYVRKAFEGVDIKIVPILVGAISKESEETFGKILAPY 229

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY----------DKKHGV----------------IH 206
                 F  VSSDFCHWG+RF Y  Y            + G+                IH
Sbjct: 230 FTREDTFCVVSSDFCHWGTRFQYTFYYPSLPPTSGTVGESGIRLSRTNISSSLSPDHEIH 289

Query: 207 KSIEALDKMGMDIIETGDP-------DAFKKYLLEYDNTICGRHPISVFLHMLGNCST-- 257
           +SI ALD   MDI+ T  P       DAF  YL    NTICGRHPI V L  L    +  
Sbjct: 290 ESISALDHQAMDIL-TLPPSTAQESHDAFSVYLARTKNTICGRHPIGVLLGALAVLESAV 348

Query: 258 -----------KIKIKFLRYEQSSQCKTKRDSSVSYASA 285
                      K  +K++RYEQSSQC+T  DSSVSYASA
Sbjct: 349 TESEVPNGEPRKATVKWVRYEQSSQCRTISDSSVSYASA 387


>gi|302697823|ref|XP_003038590.1| hypothetical protein SCHCODRAFT_47866 [Schizophyllum commune H4-8]
 gi|300112287|gb|EFJ03688.1| hypothetical protein SCHCODRAFT_47866, partial [Schizophyllum
           commune H4-8]
          Length = 317

 Score =  289 bits (740), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 160/317 (50%), Positives = 194/317 (61%), Gaps = 36/317 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL-----REAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           R ASHAGSWYT++   L E LD WL      E   P     + VIAPHAGYSYSG AAA+
Sbjct: 1   RAASHAGSWYTNDGDELNERLDAWLAAVEPTEDAYPIKGS-KAVIAPHAGYSYSGPAAAW 59

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+ +ID T I RVF+LGPSHH Y   CALS    Y TPIG+LPLDL+ I+EL  TGKFE 
Sbjct: 60  AYKSIDTTGIKRVFVLGPSHHIYLDGCALSRCKEYATPIGNLPLDLDAIKELGNTGKFEY 119

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           +DI  DEAEHS+EMHLPY+ K+F+G   I+IVPI+VGA+N   E  +G L A Y+  P  
Sbjct: 120 LDINDDEAEHSLEMHLPYVRKIFQGFRDIRIVPIMVGAINERTEKEFGALLAPYLAQPDT 179

Query: 179 FFSVSSDFCHWGSRFNYMHY-----------------DKKHGVIHKSIEALDKMGMDII- 220
           F  VSSDFCHWG+RF Y  Y                   +   IH+SI  LD+  MDI+ 
Sbjct: 180 FCVVSSDFCHWGTRFQYTFYYPSPTAESGLRLSRSEPPSQDHPIHESITRLDREAMDILT 239

Query: 221 ------ETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC----STKIKIKFLRYEQSS 270
                  T  P+ F +YL    NTICGRHPI V    +        T+  +K++RYE SS
Sbjct: 240 IPPGKASTAHPE-FARYLSRTKNTICGRHPIGVLFGAIAKLEETQGTEATVKWVRYEHSS 298

Query: 271 QCKTKRDSSVSYASAAA 287
           +C+T RDSSVSYASA A
Sbjct: 299 ECETIRDSSVSYASAWA 315


>gi|405968067|gb|EKC33170.1| Protein MEMO1 [Crassostrea gigas]
          Length = 384

 Score =  289 bits (740), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 146/280 (52%), Positives = 187/280 (66%), Gaps = 11/280 (3%)

Query: 16  DNSKR-LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFL 74
           D  +R L+ +LD WL +A    SP  R +IAPHAGY Y G    +A+  IDP++I R+F+
Sbjct: 103 DQKRRELSSQLDEWLSKAQPSYSP-ARAIIAPHAGYVYCGACGGHAYRQIDPSNIKRIFI 161

Query: 75  LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMH 134
           LGPSHH     CAL+  + Y+TP+ DL +D ++ EEL AT  FE M +  DE EHS+EMH
Sbjct: 162 LGPSHHVRLSGCALTETSHYQTPLYDLTIDQKINEELFATKAFEKMKMSTDEDEHSIEMH 221

Query: 135 LPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSR 192
           LPY+AKV E       I+P+LVG+++A+ E +YG +F++Y+ DP NFF +SSDFCHWG R
Sbjct: 222 LPYIAKVMERRRGQFTIIPVLVGSLSADKEKLYGSIFSQYLADPENFFVISSDFCHWGQR 281

Query: 193 FNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
           F Y  YDK  G I +SIEALDKMGMD IE  DP  F  YL EY+NTICGR+PI V L  +
Sbjct: 282 FRYTFYDKSCGDIWQSIEALDKMGMDAIEQMDPAKFSHYLKEYENTICGRYPIGVLLSAI 341

Query: 253 ------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
                 GN   ++  KF+ Y QSS+CK   DSSVSYAS A
Sbjct: 342 DALRRNGN-GRRMSFKFMNYAQSSRCKKMSDSSVSYASGA 380



 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 52/104 (50%), Positives = 72/104 (69%), Gaps = 1/104 (0%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M K+RRA+HAGSWYT++ + L+ +LD WL +A    SP  R +IAPHAGY Y G    +A
Sbjct: 1   MSKVRRATHAGSWYTNDGRELSSQLDEWLSKAQPSYSP-ARAIIAPHAGYVYCGACGGHA 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD 104
           +  IDP++I R+F+LGPSHH     CAL+  + Y+TP+ DL +D
Sbjct: 60  YRQIDPSNIKRIFILGPSHHVRLSGCALTETSHYQTPLYDLTID 103


>gi|343429785|emb|CBQ73357.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 344

 Score =  289 bits (740), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 161/338 (47%), Positives = 193/338 (57%), Gaps = 53/338 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLRE---AGLP-------KSPEV---------- 40
           M   R ASHAGSWYTD++  L   L  WL     + LP       +SP            
Sbjct: 1   MAPTRTASHAGSWYTDDADELDAALSSWLEAVTPSSLPAPLSICERSPTTEALPLPVAGS 60

Query: 41  RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
           R +I PHAGYSYSG AAAYA+  ID T IS VF+LGPSHH Y   CA+S    Y TP+G 
Sbjct: 61  RAIIGPHAGYSYSGPAAAYAYRTIDTTRISTVFILGPSHHVYLDGCAVSACAQYDTPLGS 120

Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
           L +D ++ E+LKATGKF +M +  DE EHS+EMHLPY+ KVFEG  ++IVPILVGA+N  
Sbjct: 121 LQIDRKITEDLKATGKFSVMSLAEDEDEHSIEMHLPYVFKVFEGRDVRIVPILVGAINMS 180

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------- 204
            E  YGRL A Y+ DPS FF VSSDFCHWG+RF Y HY                      
Sbjct: 181 KENEYGRLLAPYLRDPSTFFVVSSDFCHWGTRFRYTHYTPSGSSVPTHLSTRSPASLFEA 240

Query: 205 --IHKSIEALDKMGMDIIE---TGDPD---------AFKKYLLEYDNTICGRHPISVFLH 250
             IH+SI  LD+ G+  I    T D           AF  YL    NT+CGRHPI V L 
Sbjct: 241 KPIHQSIRELDQAGILAITYPWTHDTQPKSAEHARLAFAHYLASTKNTVCGRHPIGVLLA 300

Query: 251 MLGNCST---KIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            L    +   + + +F RYEQSS+C   +DSSVSYASA
Sbjct: 301 ALAELESNGVQSECRFTRYEQSSECVRVQDSSVSYASA 338


>gi|325184143|emb|CCA18601.1| memolike protein putative [Albugo laibachii Nc14]
 gi|325186056|emb|CCA20558.1| memolike protein putative [Albugo laibachii Nc14]
          Length = 302

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 139/290 (47%), Positives = 193/290 (66%), Gaps = 7/290 (2%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEVRGVIAPHAGYSYSGRAA 57
           +IR+A+HA +WY D+  +L  +LD WL +A        +S  V G+IAPHAG+ +SG  A
Sbjct: 4   RIRKATHAATWYIDDPSKLGPQLDAWLEDAVHEGKDARESKRVNGIIAPHAGFRFSGSTA 63

Query: 58  AYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           A+A+ ++ + T I RVF+LGPSHH Y   CAL++A+ Y+TP G LP+D E+ E L  TGK
Sbjct: 64  AHAYCHLLERTDIKRVFVLGPSHHVYLEGCALTSASHYETPFGMLPVDEEINEILMKTGK 123

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
           F  M + VDEAEHS+EMHLP++A+  +G  + +VPILVG  N  N   YGRL A++++D 
Sbjct: 124 FRRMSMSVDEAEHSIEMHLPFIARTLKGQSLSLVPILVGNTNQNNNLEYGRLLAQFMNDQ 183

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
           SNFF +SSDFCHWG+RF Y  +D  +G IH  I+ LD   + ++E  +   F  YL    
Sbjct: 184 SNFFVISSDFCHWGARFRYQPHDASYGEIHDYIKHLDHEAIKLLEDLNATGFATYLESTK 243

Query: 237 NTICGRHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           NTICG+HPIS+ +  +L     +  I+F++Y QS  CK K +SSVSYASA
Sbjct: 244 NTICGQHPISIIMQAVLALDGLQPAIRFVKYAQSGACKKKSESSVSYASA 293


>gi|388853823|emb|CCF52544.1| uncharacterized protein [Ustilago hordei]
          Length = 353

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 158/336 (47%), Positives = 194/336 (57%), Gaps = 55/336 (16%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--------------------EVRGVI 44
           R+ASHAGSWY+DN K L   L  WL +  +   P                      R +I
Sbjct: 12  RQASHAGSWYSDNQKDLDASLSEWLEDVDVKSLPTPLSVCEHKPSAESLPLPITNCRALI 71

Query: 45  APHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD 104
            PHAGYSYSG AAAYA+  ID + I R+F+LGPSHH Y   CA+S    Y+TP+G+L +D
Sbjct: 72  GPHAGYSYSGPAAAYAYRCIDNSQIERIFILGPSHHVYLDGCAVSGCEEYETPLGNLKID 131

Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM 164
            ++ E LKATGKF  M    DE EHS+EMHLPY+ K+F+G  I+IVPILVGA+N   E  
Sbjct: 132 RQITEALKATGKFGTMTQSQDEDEHSIEMHLPYIYKMFKGKSIQIVPILVGAINTSKEDA 191

Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK------------------KHGVIH 206
           YG+L A Y+ DP NFF VSSDFCHWGSRF+Y +Y                    +   IH
Sbjct: 192 YGKLLAPYLKDPRNFFVVSSDFCHWGSRFSYTYYRPAGSSLATTLSNRSPRSAYEQPPIH 251

Query: 207 KSIEALDKMGMDII--------ETGDPD-----AFKKYLLEYDNTICGRHPISVFLHMLG 253
           +SI  LD+ G++ I        +TG        AF +YL    NT+CGRHPI V L  L 
Sbjct: 252 QSIRELDESGIEAITHPWTKGGKTGKSAEEARLAFAEYLSSTKNTVCGRHPIGVLLAALA 311

Query: 254 NCSTKIKI----KFLRYEQSSQCKTKRDSSVSYASA 285
                 KI    +F RYEQSSQC T +DSSVSYASA
Sbjct: 312 ELEKTDKIQTECRFTRYEQSSQCLTPQDSSVSYASA 347


>gi|291222620|ref|XP_002731317.1| PREDICTED: C21orf19-like protein-like [Saccoglossus kowalevskii]
          Length = 255

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 140/250 (56%), Positives = 176/250 (70%), Gaps = 5/250 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAFG 62
           RRA+HAGSWYT + K L+ ELD WL+ A L    SP  R +IAPHAGY+Y G+  A+A+ 
Sbjct: 6   RRATHAGSWYTRSGKDLSAELDQWLQGAPLDSHYSP-ARAIIAPHAGYAYCGKCGAHAYK 64

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            +DPT++ +VF+LGPSHH     CALS    Y TP+ +LP+D +V  EL +TG+FE M I
Sbjct: 65  QVDPTNVKKVFILGPSHHVRLSGCALSRTVTYDTPLYNLPIDTQVYGELMSTGQFEKMSI 124

Query: 123 CVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
             DE EHS+EMHLPY+AKV E       IVPILVGA++ + E  YGR+ AKY++DP N F
Sbjct: 125 ETDEDEHSIEMHLPYVAKVMESKRDQFTIVPILVGALSTDKEQKYGRILAKYLEDPHNLF 184

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSDFCHWG RF+Y +YD+  G I+KSI+ LD  GM+ IE  DP  F KYL  Y NTIC
Sbjct: 185 VISSDFCHWGERFHYTYYDESCGEIYKSIDVLDHKGMNHIENLDPSGFSKYLKMYGNTIC 244

Query: 241 GRHPISVFLH 250
           GRHPI VFL+
Sbjct: 245 GRHPIGVFLN 254


>gi|115497104|ref|NP_001069863.1| protein MEMO1 [Bos taurus]
 gi|118572779|sp|Q2HJH7.1|MEMO1_BOVIN RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
           cell motility 1; Short=Memo-1
 gi|88682865|gb|AAI05374.1| Mediator of cell motility 1 [Bos taurus]
          Length = 285

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 136/269 (50%), Positives = 181/269 (67%), Gaps = 4/269 (1%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ +PSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLAEPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPISVFLHMLGNCSTK-IKIKFLRYEQSS 270
           HPI V L+ +       + + F  ++Q+S
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFFQQAS 275


>gi|157104420|ref|XP_001648401.1| hypothetical protein AaeL_AAEL014294 [Aedes aegypti]
 gi|108869206|gb|EAT33431.1| AAEL014294-PA, partial [Aedes aegypti]
          Length = 276

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 142/271 (52%), Positives = 183/271 (67%), Gaps = 6/271 (2%)

Query: 21  LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHH 80
           L  +L  WL +A L   P  R +IAPHAGY Y G   A+A+  I P  + RVF+LGPSHH
Sbjct: 3   LNRQLSKWLDDADLTFGP-ARAIIAPHAGYRYCGACGAWAYRQISPAVVKRVFILGPSHH 61

Query: 81  YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAK 140
               +CALS A   +TP+ DL +D EV  EL+ATG F+ MD   DE EHS+EMHLPY+AK
Sbjct: 62  VRLSRCALSAAQYCRTPLYDLKVDQEVNAELEATGHFKWMDQKTDEDEHSIEMHLPYVAK 121

Query: 141 VFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
           V E       IVPI+VG+++ + E  YG++ A Y+ DP N F +SSDFCHWG+RF Y +Y
Sbjct: 122 VMEDFRDQFTIVPIMVGSISNDWEETYGKILAPYLADPQNLFVISSDFCHWGARFRYTYY 181

Query: 199 DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNC 255
           +  HG I+K IE LDKMGMD+IET  P++F +YL +Y+NTICGRHPI V L     L   
Sbjct: 182 EDSHGPIYKWIEVLDKMGMDLIETLKPESFGEYLRKYNNTICGRHPIGVLLQSVEELKRR 241

Query: 256 STKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
             ++  KFL+Y+QS+QC  K+DSSVSYAS +
Sbjct: 242 GYRMSFKFLKYDQSNQCCDKKDSSVSYASGS 272


>gi|225708900|gb|ACO10296.1| MEMO1 [Caligus rogercresseyi]
          Length = 302

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 145/294 (49%), Positives = 196/294 (66%), Gaps = 9/294 (3%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYA 60
           E+ R ASHAGSWY+ + + L  E+ GWL       ++   R +I PHAGY YSG  AAYA
Sbjct: 5   EQRRCASHAGSWYSGDERVLQREMSGWLEAVKTDSETFPARAIIGPHAGYRYSGPTAAYA 64

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFEL 119
           +  I+P  I RVF+LGPSHH     C LS  +VY+TP+ DL +D E+ +EL ++ G F+ 
Sbjct: 65  YKQINPEGIKRVFILGPSHHIRLNGCLLSGCSVYETPLYDLVVDQELNKELMESKGGFDK 124

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             +  DE EHS+E+HLPY+AK  E       +VPILVG+++ + E  YG++ AKY+ DPS
Sbjct: 125 ATLQADEEEHSIELHLPYIAKAMESRKGQFTVVPILVGSLSPDKEYKYGKILAKYLMDPS 184

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
           N F +SSDFCHWG RF+Y +YD++ G IH+SI  LD  GM++IE+ + DAF  YL +  N
Sbjct: 185 NLFVISSDFCHWGERFDYTYYDEEAGEIHESISKLDHKGMELIESLNHDAFAAYLKKTSN 244

Query: 238 TICGRHPISVFLHMLGNC-----STKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           TICGRHPI VFL M+          ++++KFL+Y QS +C+   DSSVSYASAA
Sbjct: 245 TICGRHPIGVFLGMVKAILQHGEPPRMQLKFLKYAQSDKCRKIADSSVSYASAA 298


>gi|301122825|ref|XP_002909139.1| memo-like protein [Phytophthora infestans T30-4]
 gi|262099901|gb|EEY57953.1| memo-like protein [Phytophthora infestans T30-4]
          Length = 305

 Score =  286 bits (733), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 143/289 (49%), Positives = 191/289 (66%), Gaps = 6/289 (2%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
           ++R A+HAGSWY+ +  +L EEL GWL  A       K P +R +IAPHAG+ YSG  AA
Sbjct: 11  RVRLATHAGSWYSHDEHKLEEELSGWLGAAEGNDTQEKQPSIRAIIAPHAGFRYSGPTAA 70

Query: 59  YAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           YA+  + +   I RVF+LGPSHH+Y   CA+STA  Y+TP+G++ +D EV E+L  +GKF
Sbjct: 71  YAYHQVLNLDRIKRVFILGPSHHFYLRGCAVSTAHEYETPLGNISIDHEVNEKLVDSGKF 130

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             M + VDE EHS+EMHLP++ KV  G     VPILVG   ++ +  YG++ A Y+++  
Sbjct: 131 ATMSMDVDEDEHSIEMHLPFIFKVMNGRKFTAVPILVGNTKSKTDEEYGKILAPYLENDE 190

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
           N F +SSDFCHWG RF Y  +D  +G IH+ I+ LD  GM  IE  D   F +YL E +N
Sbjct: 191 NLFVISSDFCHWGPRFRYQPHDSTYGEIHEYIKYLDHQGMGFIERLDAAGFTRYLDETNN 250

Query: 238 TICGRHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           TICGRHPIS+ L+ +L +   K  +KF++Y QSS C    DSSVSYASA
Sbjct: 251 TICGRHPISLLLNTILASKKLKCTLKFVKYAQSSACMRHGDSSVSYASA 299


>gi|241122968|ref|XP_002403743.1| protein MEMO1, putative [Ixodes scapularis]
 gi|215493518|gb|EEC03159.1| protein MEMO1, putative [Ixodes scapularis]
          Length = 297

 Score =  286 bits (732), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 142/291 (48%), Positives = 186/291 (63%), Gaps = 6/291 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+ASHAGSWYTD+ + L  +L+ WL  AG+P     R VIAPHAGY Y G  AAYA+  +
Sbjct: 8   RKASHAGSWYTDSPRELKYQLENWLSAAGMPSFGPARAVIAPHAGYQYCGACAAYAYKQV 67

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP  +   F +            ++ A         + + + + EEL  TG FE + + V
Sbjct: 68  DPDHVRFFFFVMTLDLLLRCLLLVNNANRQSAEYDAVSV-VPIYEELFETGAFEEVSLHV 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           DE EHS+EMHLPY+AKV E     IVPI+VG++  ENEA YGRL +KY+ +P N F +SS
Sbjct: 127 DENEHSLEMHLPYIAKVMEDREFTIVPIIVGSLTPENEAFYGRLLSKYLAEPDNLFVISS 186

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWG+RF+Y  +DK  G IH+SIE LDK GM+IIE   P AF  YL +Y NTICGRHP
Sbjct: 187 DFCHWGARFHYQFHDKSWGNIHQSIERLDKQGMNIIEELCPTAFTAYLKKYGNTICGRHP 246

Query: 245 ISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           I V L+ +    +     ++ +KFL+Y QSSQC +  DSSVSYASAA +++
Sbjct: 247 IGVLLNAVDTLQSSGNGHRMALKFLKYAQSSQCMSTSDSSVSYASAALRLE 297


>gi|71017747|ref|XP_759104.1| hypothetical protein UM02957.1 [Ustilago maydis 521]
 gi|46098896|gb|EAK84129.1| hypothetical protein UM02957.1 [Ustilago maydis 521]
          Length = 346

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 158/342 (46%), Positives = 198/342 (57%), Gaps = 58/342 (16%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------------------GLP 35
           M   R+A+HAGSWYTD+ + L E L  WL +                           LP
Sbjct: 1   MALTRKATHAGSWYTDDPRELDECLTEWLSDVQPDNLLSPLSVCSDSSCSSSSSESLALP 60

Query: 36  KSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYK 95
            S   R +I PHAGYSYSG AAAYA+  ID ++I  VF+LGPSHH Y   CA+S  + Y+
Sbjct: 61  ISSN-RAIIGPHAGYSYSGPAAAYAYRTIDTSAIKTVFILGPSHHVYLDGCAVSACSSYE 119

Query: 96  TPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
           TP+G+LP++  V  EL +TG+F  M    DE EHS+EMHLPY+ KVF+G  I+IVPILVG
Sbjct: 120 TPLGNLPINRSVTHELLSTGRFSTMSKTEDEDEHSIEMHLPYIYKVFKGTGIQIVPILVG 179

Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK--------------- 200
           A+N   E  +G+L AKY++DP NFF VSSDFCHWGSRF Y +Y                 
Sbjct: 180 AINTARENEFGKLLAKYLNDPENFFVVSSDFCHWGSRFRYTYYKPCGSNIAMNLTSRSSR 239

Query: 201 ---KHGVIHKSIEALDKMGM---------DIIETGDPD--AFKKYLLEYDNTICGRHPIS 246
              +   IH+SI  LD+ G+         D  +T +    AF KYL E  NT+CGRHPI 
Sbjct: 240 SMFEGKPIHQSIRELDEAGILAITYPWSRDRQKTAEDARLAFAKYLSETKNTVCGRHPIG 299

Query: 247 VFLHMLGNC---STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           V L  L        K + +F RYEQSSQC T ++SSVSYASA
Sbjct: 300 VLLAALAELERRGQKTECRFTRYEQSSQCTTPQESSVSYASA 341


>gi|331236660|ref|XP_003330988.1| hypothetical protein PGTG_12951 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309309978|gb|EFP86569.1| hypothetical protein PGTG_12951 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 335

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 195/332 (58%), Gaps = 45/332 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----------------EVRGVIAPH 47
           R A+HAGSWY  +  RL ++L GWL +A   + P                 + R +IAPH
Sbjct: 4   RSATHAGSWYPASKTRLEDDLKGWLSDAHANEQPVNHAQAWKSVCMPVPIKDCRAIIAPH 63

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
           AGYSYSGRAAA+A+  ID   I RVF+LGPSHH Y   CALS  T Y TP+GDLP+D  +
Sbjct: 64  AGYSYSGRAAAWAYKLIDTDQIKRVFILGPSHHVYLDCCALSKCTQYATPLGDLPIDTSI 123

Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
            EELKA  +F  M +  D+ EHS+E+HLPY+  +F+ H IKIVPILVG++    E  YG 
Sbjct: 124 NEELKAKNRFGEMSLQTDQEEHSIELHLPYIRHIFKNHDIKIVPILVGSITFAEELEYGA 183

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV----------------IHKSIEA 211
           L A Y+ DP N F VSSDFCHWG+RF+Y +Y                      IH+SIE 
Sbjct: 184 LLAPYLADPENLFVVSSDFCHWGTRFSYTYYQDPRSTEPAKRLSPGSPPPEYPIHQSIEN 243

Query: 212 LDKMGMDIIETGDP----DAFKKYLLEYDNTICGRHPISVFL-HMLGNCSTK-------I 259
           LDK  ++ I   DP    + FK YL    NTICGRHPI V L  M+    T+        
Sbjct: 244 LDKEAIEAISLQDPSQAHETFKGYLKRTKNTICGRHPIGVLLGSMIKLQQTRSTPPPPQQ 303

Query: 260 KIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           +++ +RYEQSSQC T  DSSVSY SA  ++ A
Sbjct: 304 QLQLVRYEQSSQCFTLSDSSVSYVSAFLRLPA 335


>gi|301089179|ref|XP_002894922.1| memo-like protein [Phytophthora infestans T30-4]
 gi|262105088|gb|EEY63140.1| memo-like protein [Phytophthora infestans T30-4]
          Length = 305

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 142/289 (49%), Positives = 191/289 (66%), Gaps = 6/289 (2%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
           ++R A+HAGSWY+ +  +L EEL GWL  A       K P +R +IAPHAG+ YSG  AA
Sbjct: 11  RVRLATHAGSWYSHDEHKLEEELSGWLGAAEGNDTQEKQPSIRAIIAPHAGFRYSGPTAA 70

Query: 59  YAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           YA+  + +   I RVF+LGPSHH+Y   CA+STA  Y+TP+G++ +D EV E+L  +GKF
Sbjct: 71  YAYHQVLNLDRIKRVFILGPSHHFYLRGCAVSTAHEYETPLGNISIDHEVNEKLVDSGKF 130

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             M + VDE EHS+EMHLP++ KV  G     VPILVG   ++ +  YG++ A Y+++  
Sbjct: 131 ATMSMDVDEDEHSIEMHLPFIFKVMNGRKFTAVPILVGNTKSKTDEEYGKILAPYLENDE 190

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
           N F +SSDFCHWG RF Y  +D  +G IH+ I+ LD  GM  IE  D   F +YL + +N
Sbjct: 191 NLFVISSDFCHWGPRFRYQPHDSTYGEIHEYIKYLDHQGMGFIERLDAAGFTRYLDKTNN 250

Query: 238 TICGRHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           TICGRHPIS+ L+ +L +   K  +KF++Y QSS C    DSSVSYASA
Sbjct: 251 TICGRHPISLLLNTILASKKLKCTLKFVKYAQSSACMRHGDSSVSYASA 299


>gi|195997049|ref|XP_002108393.1| hypothetical protein TRIADDRAFT_37128 [Trichoplax adhaerens]
 gi|190589169|gb|EDV29191.1| hypothetical protein TRIADDRAFT_37128 [Trichoplax adhaerens]
          Length = 297

 Score =  285 bits (728), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 147/289 (50%), Positives = 192/289 (66%), Gaps = 8/289 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+A+HAGSWY+ +   L+++L GWL +     +P  + +IAPHAGY+Y G  AAYA+  I
Sbjct: 4   RQATHAGSWYSSSGTELSKQLGGWLSKVANGNTP-AKAIIAPHAGYAYCGACAAYAYKQI 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP  I R+F+LGPSHHYY  KCA+S  +   TP+  L +D E+  +L  TGKF+ M   V
Sbjct: 63  DPRGIKRIFILGPSHHYYLTKCAISMTSQCNTPLYPLQVDTEINNQLLKTGKFDKMSKSV 122

Query: 125 DEAEHSMEMHLPYLAKVFEGHL---IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           DE EHS+EMHLPY+A+V          IVPILVGA+    E MYG +F++Y+ DP N F 
Sbjct: 123 DEDEHSIEMHLPYIAQVMSSRSPDSFTIVPILVGALGYREELMYGEIFSQYLVDPENLFV 182

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF + +Y+K H  I  SIEALD+ GMD+IE  DP  F++YL +Y NTICG
Sbjct: 183 ISSDFCHWGKRFRFTYYEKSHKHIFSSIEALDRKGMDLIEQLDPAGFREYLDKYQNTICG 242

Query: 242 RHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           RHPI V L  +     K    +K++FL Y QSSQC +  DSSVSYA+  
Sbjct: 243 RHPIGVLLQAIDAVRKKQRSNMKMRFLNYAQSSQCLSANDSSVSYAAGV 291


>gi|353237123|emb|CCA69103.1| hypothetical protein PIIN_03003 [Piriformospora indica DSM 11827]
          Length = 323

 Score =  283 bits (723), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 153/317 (48%), Positives = 194/317 (61%), Gaps = 36/317 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVR---GVIAPHAGYSYSGRAAAY 59
           R+ASHA SWY+ ++++L  +LD W+   E+    SP +R    +I PHAGYSYSG  AA+
Sbjct: 3   RQASHADSWYSGDARKLNSQLDQWINAVESSETYSPPIRQSKAIIGPHAGYSYSGPNAAW 62

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+ +ID T I RVF+LGPSHH Y   CALS    Y TP+G LPLD E IEEL  TGKFE 
Sbjct: 63  AYKSIDTTGIKRVFVLGPSHHVYLRSCALSACETYATPLGSLPLDRETIEELHNTGKFET 122

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           M +  DE EHS+E+HLPY+ K+F+G  IKIVPILVGA++ + EA YG+L A Y      F
Sbjct: 123 MALSTDEDEHSIELHLPYVRKIFQGMDIKIVPILVGALDKQGEATYGKLLAPYFAREDTF 182

Query: 180 FSVSSDFCHWGSRFNYMHY-------------------DKKHGVIHKSIEALDKMGMDII 220
             VSSDFCHWG RF+Y  Y                     K   IH+SI ALDK  MDI+
Sbjct: 183 TVVSSDFCHWGRRFSYTFYYPSPPPCEGIKLSRSTPSSTFKSFPIHESITALDKEAMDIL 242

Query: 221 ETG-----DPD----AFKKYLLEYDNTICGRHPISVFLHMLGNCSTK---IKIKFLRYEQ 268
                    P+     F  YL    NTICGRHPI V L  L + + +   + I++++Y+ 
Sbjct: 243 TLPRESDVSPNEVHTQFADYLARTRNTICGRHPIGVLLGALSSVAEQGKSVTIQWVKYDH 302

Query: 269 SSQCKTKRDSSVSYASA 285
           SS+C+T  DSSVSYASA
Sbjct: 303 SSECETVEDSSVSYASA 319


>gi|389751361|gb|EIM92434.1| UPF0103-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 331

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 155/324 (47%), Positives = 194/324 (59%), Gaps = 43/324 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVRGVIAPHAGYSYSGRAA 57
           R A+HA SWYT N  +L  EL GW        ++   P   + + +IAPHAGYSYSG AA
Sbjct: 4   RIATHADSWYTGNGTKLDSELKGWFNAVKPSAQDNYHPPIAKCKAIIAPHAGYSYSGPAA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+ +ID T I RVF+LGPSHH Y   CALS    Y+TPIGDLPLDLE I EL+++G+F
Sbjct: 64  AWAYKSIDTTGIKRVFILGPSHHVYLDGCALSECETYQTPIGDLPLDLETIAELRSSGEF 123

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             MD+  DE EHS+EMHLPY+ KVFEG  IKIVPILVGA++ E E  +GR  A Y+    
Sbjct: 124 SDMDLQTDEDEHSIEMHLPYVRKVFEGMDIKIVPILVGAIDKEKEVAFGRTLAPYLARDD 183

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHG---------------------VIHKSIEALDKMG 216
            F  VSSDFCHWG+RF+Y  Y  +                        I++SI +LD+  
Sbjct: 184 TFCVVSSDFCHWGTRFSYTFYYPERPPTALDPIRLSRTSSPPNLHKFPIYESISSLDREA 243

Query: 217 MDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHML---------GNCSTKIKI 261
           M+++      A      F +YL    NTICGRHPI V L  L         G      K+
Sbjct: 244 MELLTLPPCTARQAHTYFAEYLARTKNTICGRHPIGVLLGALSAVEKGEVEGVEKRMAKL 303

Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
           +++RYEQSSQC T +DSSVSYASA
Sbjct: 304 QWVRYEQSSQCLTVKDSSVSYASA 327


>gi|392571540|gb|EIW64712.1| UPF0103-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 325

 Score =  282 bits (721), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 159/322 (49%), Positives = 189/322 (58%), Gaps = 39/322 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPK-----SPEVRG---VIAPHAGYSYSGRA 56
           R A+HA SWY+ ++ +L  EL GWL E   P      SP V G   VIAPHAGY+YSGR 
Sbjct: 4   RNATHADSWYSGDASQLDRELSGWL-EGVSPSAEDEFSPPVSGTKAVIAPHAGYAYSGRT 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA+A+ +ID T I RVF+LGPSHH Y   CAL+  T Y+TP+G LP+D+E   ELK TG+
Sbjct: 63  AAWAYKSIDTTGIKRVFVLGPSHHVYVDGCALTRCTQYETPLGVLPIDVETTRELKNTGQ 122

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
           F  MD   DE EHS+EMHLPY+ KVFEG  I IVP+LVGA+N   E  YG+L A Y+   
Sbjct: 123 FTEMDQQTDEDEHSIEMHLPYVRKVFEGKDISIVPVLVGAINYGKEVTYGKLLAPYLARE 182

Query: 177 SNFFSVSSDFCHWGSRFNYM---------------------HYDKKHGVIHKSIEALDKM 215
             FF VSSDFCHWG RF Y                      H   K   IH+SI ALD  
Sbjct: 183 DTFFVVSSDFCHWGLRFQYTFYYPKPPPSSVSPVRLSRADSHTSVKENPIHESISALDHE 242

Query: 216 GMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI---KFLRY 266
            M+I+      A      F  YL    NTICGRHPI V L  L     + K+   K++RY
Sbjct: 243 AMEILTMPPCTAAQAHHDFADYLARTKNTICGRHPIGVLLGALTVLEKEGKVPKLKWVRY 302

Query: 267 EQSSQCKTKRDSSVSYASAAAK 288
           EQSS C   +DSSVSYASA  K
Sbjct: 303 EQSSACVAIKDSSVSYASAFVK 324


>gi|402588404|gb|EJW82337.1| hypothetical protein WUBG_06754, partial [Wuchereria bancrofti]
          Length = 283

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 142/276 (51%), Positives = 183/276 (66%), Gaps = 8/276 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R ASHAGSWYTD+ ++L  EL  WL  AG   S   R +I+PHAGYSYSGR AA+AF  
Sbjct: 8   VRNASHAGSWYTDDPRKLHRELTEWLGAAGSRHSQSARAIISPHAGYSYSGRVAAFAFKQ 67

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I P ++S +F+LGPSH      CALST   Y+TPIGDL +D     ELK TG F LMD+ 
Sbjct: 68  IIPETVSIIFVLGPSHVMSLDTCALSTCWRYRTPIGDLQIDQRTNMELKETGAFSLMDLR 127

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIK---IVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
            +EAEHS+EM LPY+AK+ E        IVP+LVG+++   +A YG++F+KY+ DP   F
Sbjct: 128 SEEAEHSIEMQLPYIAKIMEKQSTNSYSIVPVLVGSLSPSKQASYGKIFSKYLSDPKIVF 187

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
            VSSDFCHWGSRF++M +D   GV I++ I A+DK GMD I + +P AF +YL    NTI
Sbjct: 188 VVSSDFCHWGSRFHFMPHDNSTGVPIYEQIAAMDKQGMDAISSLNPAAFGEYLKRTQNTI 247

Query: 240 CGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQ 271
           CGR+PISV L    +     S   +  FL+Y QS+Q
Sbjct: 248 CGRNPISVILYAAEYFRQMNSHLAEFVFLKYAQSNQ 283


>gi|348675910|gb|EGZ15728.1| hypothetical protein PHYSODRAFT_351598 [Phytophthora sojae]
          Length = 305

 Score =  281 bits (719), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 141/289 (48%), Positives = 190/289 (65%), Gaps = 6/289 (2%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
           ++R A++AGSWY+ +   L  EL GWL  A       K P +R +I PHAG+ YSG  AA
Sbjct: 11  RVRLATYAGSWYSHDEHELEGELAGWLDAAEDNDTQEKQPSIRAIIGPHAGFRYSGPTAA 70

Query: 59  YAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           YA+ ++ +   I RVF+LGPSHH+Y   CA+STA  Y+TP+G++ +D EV E+L  +GKF
Sbjct: 71  YAYHHLLNLDRIKRVFILGPSHHFYLRGCAVSTAHEYETPLGNIVIDHEVNEKLVDSGKF 130

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             M + VDE EHS+EMHLP++ KV  G     VPILVG   ++ +  YG++ A Y+++  
Sbjct: 131 ATMSMDVDEDEHSIEMHLPFIYKVMNGRKFTAVPILVGNTKSKMDEEYGKILAPYLENDE 190

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
           N F +SSDFCHWG RF Y  +D  +G IH+ I+ LD  GM  IE  D + F +YL E +N
Sbjct: 191 NLFVISSDFCHWGPRFRYQPHDSTYGEIHEYIKYLDHQGMGFIERLDAEGFTRYLEETNN 250

Query: 238 TICGRHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           TICGRHPIS+ LH +L +   K  +KF++Y QSS C    DSSVSYASA
Sbjct: 251 TICGRHPISLLLHSILASKKLKCTLKFVKYAQSSACMRHGDSSVSYASA 299


>gi|296482698|tpg|DAA24813.1| TPA: protein MEMO1 [Bos taurus]
          Length = 250

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 131/243 (53%), Positives = 169/243 (69%), Gaps = 3/243 (1%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ +PSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLAEPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246

Query: 243 HPI 245
           HPI
Sbjct: 247 HPI 249


>gi|336383353|gb|EGO24502.1| hypothetical protein SERLADRAFT_467923 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 348

 Score =  281 bits (718), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 154/322 (47%), Positives = 194/322 (60%), Gaps = 37/322 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR-----EAGLPKSPEVRGVIAPHAGYSYSGR 55
           M+  RRA+HAGSWY+ + + L  EL  WL      E   P     + +IAPHAGYSYSG 
Sbjct: 23  MQPSRRATHAGSWYSSSGRTLDAELTQWLAAVTSDEDFAPPITGCKAIIAPHAGYSYSGP 82

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           AAA+A+ +ID T I  VF+LGPSHH Y   CALS    Y+TP+G+LPLDL+VI +L+ TG
Sbjct: 83  AAAWAYKSIDTTDIKCVFILGPSHHVYLDGCALSNCEQYETPLGELPLDLQVIRDLRDTG 142

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           +FE MDI  DE EHS+EMHL Y+ KVFE   I IVP+LVGA++ + EA YG++ A Y+  
Sbjct: 143 EFEDMDIQTDEDEHSIEMHLSYVRKVFENSDISIVPVLVGAISKDKEAAYGQILAPYLAR 202

Query: 176 PSNFFSVSSDFCHWGSRFNYMHY------DKKHGV----------------IHKSIEALD 213
               F +SSDFCHWG+RF+Y +Y          G+                IH+SI  LD
Sbjct: 203 EDTLFVISSDFCHWGTRFSYTYYYPEPLPSNTPGIRLSRSGPPPSTLSSRPIHESISDLD 262

Query: 214 KMGMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLG----NCSTKIKIKF 263
              M ++      A      F  YL    NTICGRHPI V L  L     + +  +KIK+
Sbjct: 263 HEAMQLLTIPPGSAVNAHQEFASYLGRTKNTICGRHPIGVLLGALSVLEQDETHSVKIKW 322

Query: 264 LRYEQSSQCKTKRDSSVSYASA 285
           +RYEQSSQC   RDSSVSYASA
Sbjct: 323 VRYEQSSQCFNIRDSSVSYASA 344


>gi|195145810|ref|XP_002013883.1| GL23150 [Drosophila persimilis]
 gi|194102826|gb|EDW24869.1| GL23150 [Drosophila persimilis]
          Length = 269

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 141/286 (49%), Positives = 175/286 (61%), Gaps = 29/286 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD    L+ +LD WL  A L   P  R +IAPHAGY+Y G  +A+A+  I
Sbjct: 4   RRATHAGSWYTDAGSELSRQLDRWLGAAELSHGP-ARAIIAPHAGYTYCGACSAFAYRQI 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  YKTP+ DL +D+++  EL+ T +F   D   
Sbjct: 63  SPVVVKRIFILGPSHHVRLRGCALSVAKKYKTPLYDLKIDIQINTELEKTPQFSYKD--- 119

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
                                   IVPILVG++N E EA YG L + Y  DP+N F +SS
Sbjct: 120 ---------------------QFTIVPILVGSLNPEQEAQYGSLLSTYFMDPTNLFVISS 158

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWG RF+Y +YD+  G IHKSIE LDK GMD+IET  P AF +YL +Y+NTICGRHP
Sbjct: 159 DFCHWGQRFSYTYYDRSCGQIHKSIEKLDKQGMDLIETLSPTAFTEYLRKYNNTICGRHP 218

Query: 245 ISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           I V L  +         K+  KFL+Y QSSQC    DSSVSYAS +
Sbjct: 219 IGVMLGAVKALQDQGYDKMSFKFLQYAQSSQCLDMEDSSVSYASGS 264


>gi|344288749|ref|XP_003416109.1| PREDICTED: protein MEMO1-like isoform 2 [Loxodonta africana]
          Length = 274

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 141/290 (48%), Positives = 179/290 (61%), Gaps = 29/290 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAP                  
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPR----------------- 49

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                 R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 50  ------RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 103

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 163

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 223

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 224 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 273


>gi|212286159|ref|NP_001131074.1| protein MEMO1 isoform 2 [Homo sapiens]
 gi|114576873|ref|XP_001163989.1| PREDICTED: protein MEMO1 isoform 1 [Pan troglodytes]
 gi|334312891|ref|XP_003339794.1| PREDICTED: protein MEMO1-like isoform 2 [Monodelphis domestica]
 gi|410209452|gb|JAA01945.1| mediator of cell motility 1 [Pan troglodytes]
          Length = 274

 Score =  277 bits (708), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 141/290 (48%), Positives = 178/290 (61%), Gaps = 29/290 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAP                  
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPR----------------- 49

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                 R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 50  ------RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 103

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 163

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 223

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 224 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 273


>gi|327262349|ref|XP_003215987.1| PREDICTED: protein MEMO1-like isoform 2 [Anolis carolinensis]
          Length = 274

 Score =  277 bits (708), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 142/290 (48%), Positives = 178/290 (61%), Gaps = 29/290 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L+ +L+GWL +    K P  R +IAP                  
Sbjct: 8   REASHAGSWYTASGPQLSAQLEGWLSQVQSTKRP-ARAIIAPR----------------- 49

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                 R+F+LGPSHH    +CALS+  +Y+TP+ DL +D +V  EL  TG FE M +  
Sbjct: 50  ------RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKVYAELWKTGMFERMSLQT 103

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     IVP+LVGA++   E  +G+LF+KY+ DP N F V
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFTIVPVLVGALSETKEQEFGKLFSKYLADPCNLFVV 163

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 223

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 224 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 273


>gi|449669566|ref|XP_004207060.1| PREDICTED: protein MEMO1-like [Hydra magnipapillata]
          Length = 291

 Score =  276 bits (707), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 142/286 (49%), Positives = 193/286 (67%), Gaps = 4/286 (1%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
            RRASHAGSWY+D+  +L ++L+ WL E  +  +P  R +I+PHAGY+Y G  AAYA+  
Sbjct: 3   FRRASHAGSWYSDSGDQLNKQLEQWLSEVNVKSTP-ARALISPHAGYAYCGACAAYAYKQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I+P +I R+F+LGPSHH   P CA++  T Y+TP+ +L +D  +  EL  TGKF++M   
Sbjct: 62  INPMTIKRIFILGPSHHVALPGCAVTQTTSYETPLYNLKIDNLINNELLGTGKFDIMSKE 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
            DE EHS+EM LP++AKV E +     +VPILVG+ + E E +YG +F+KY+ +P N F 
Sbjct: 122 TDENEHSIEMQLPFIAKVMESNKDNFTVVPILVGSTSHEQERLYGVIFSKYLKNPENLFV 181

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF +  YDK  G I++SIE LDKMGM  IE  D ++F KYL ++ NTICG
Sbjct: 182 ISSDFCHWGKRFRFTPYDKSKGEIYESIEDLDKMGMKYIEQLDTNSFYKYLEKFSNTICG 241

Query: 242 RHPISVFLHMLGNC-STKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           RHPI + L+ +    S     KF  Y QSS+C+   DSSVSYA+  
Sbjct: 242 RHPIGILLNAVNEVRSADASFKFHYYSQSSKCRRFEDSSVSYAAGT 287


>gi|345307803|ref|XP_001509250.2| PREDICTED: protein MEMO1-like [Ornithorhynchus anatinus]
          Length = 273

 Score =  275 bits (702), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 132/256 (51%), Positives = 173/256 (67%), Gaps = 7/256 (2%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI 98
           E++G+   HAGY+Y G  AA+A+  +DP+   R+F+LGPSHH    +CALS+  +Y+TP+
Sbjct: 19  EIKGL--SHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSHHVPLSRCALSSVDIYRTPL 76

Query: 99  GDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGA 156
            DL +D ++  EL  TG FE M +  DE EHS+EMHLPY AK  E H     I+P+LVGA
Sbjct: 77  YDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGA 136

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
           ++   E  +G+LF+KY+ DPSN F +SSDFCHWG RF Y +YD+  G I++SIE LDKMG
Sbjct: 137 LSESKEQEFGKLFSKYLADPSNLFVISSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMG 196

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCK 273
           M+IIE  DP +F  YL +Y NTICGRHPI V L+    L      +   FL Y QSSQC+
Sbjct: 197 MNIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQKNGVNMSFSFLNYAQSSQCR 256

Query: 274 TKRDSSVSYASAAAKV 289
             +DSSVSYA+ A  V
Sbjct: 257 NWQDSSVSYAAGALTV 272


>gi|403415251|emb|CCM01951.1| predicted protein [Fibroporia radiculosa]
          Length = 326

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 152/319 (47%), Positives = 188/319 (58%), Gaps = 38/319 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-------EAGLPKSPEVRGVIAPHAGYSYSGRAA 57
           RRA+HAGSWY  +  +L  EL   L+       EA  P     + +IAPHAGYSYSG  A
Sbjct: 4   RRATHAGSWYDADGGKLDTELSSNLQSVEPSLEEAFAPPVKGSKAIIAPHAGYSYSGSTA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+ +ID   I RVF+LGP+HH Y   CALST   Y+TP+G LPLDL+ I+EL+ TGKF
Sbjct: 64  AWAYKSIDTAGIKRVFILGPAHHVYLDGCALSTCERYETPLGTLPLDLDTIQELRDTGKF 123

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             MD+  DE EHS+EMHLPY+ K+FEG  I IVPILVGA+N   EA +G + A Y+    
Sbjct: 124 SDMDVDTDEDEHSIEMHLPYVRKIFEGLNISIVPILVGAINYNKEATFGTILAPYLARDD 183

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHG----------------------VIHKSIEALDKM 215
            F  VSSDFCHWG+RF Y  Y  K                         I+ SI ALD  
Sbjct: 184 TFCVVSSDFCHWGTRFQYTFYYPKAPPTSIPAIRLSKADPSPPSLTTHPIYSSISALDHE 243

Query: 216 GMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI---KFLRY 266
            M+++      A      F +YL    NTICGRHPI V L  L     + K+   K++RY
Sbjct: 244 AMELLTMPPSTAAQSHHEFAEYLARTKNTICGRHPIGVLLGALAVLQKQGKVPHLKWVRY 303

Query: 267 EQSSQCKTKRDSSVSYASA 285
           EQSS+C T +DSSVSYASA
Sbjct: 304 EQSSECVTIKDSSVSYASA 322


>gi|339247511|ref|XP_003375389.1| protein MEMO1 [Trichinella spiralis]
 gi|316971269|gb|EFV55071.1| protein MEMO1 [Trichinella spiralis]
          Length = 301

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 136/291 (46%), Positives = 180/291 (61%), Gaps = 7/291 (2%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           E  R  SHAGSWY+ N + L  +L+ WL        P  + +I+PHAGY Y G +AAYA+
Sbjct: 6   ESCRPCSHAGSWYSANEEDLRSQLEDWLSAVKTNHGP-AKAIISPHAGYCYCGASAAYAY 64

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             IDP ++ R+F+LGPSHH     CA+S    Y TP  +L +D +V  EL  TG FE M+
Sbjct: 65  KQIDPNTVDRIFILGPSHHVCLSGCAVSKFNSYGTPFYNLTVDQQVNNELIETGLFEKME 124

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           +  D AEHS+EM LPY+A V +       IVPILVG++    ++ YGR+ ++Y+ D  N 
Sbjct: 125 LLTDTAEHSIEMQLPYIAHVMQSRKGAFTIVPILVGSLTPSRQSAYGRILSRYLADDRNL 184

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
           F +S+DFCHWG RF++  YDK  G IH SIE LD+ GM IIE  D  AF  YL    NTI
Sbjct: 185 FIISTDFCHWGHRFHFTKYDKTGGEIHSSIERLDREGMKIIENLDSAAFNDYLKRTGNTI 244

Query: 240 CGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           CGRHPISV L    H+    + +   +FL Y QS+ C++  DSSVSYA+ +
Sbjct: 245 CGRHPISVLLQATEHLHEINNRRGDFQFLHYSQSNHCRSMHDSSVSYAAGS 295


>gi|341887659|gb|EGT43594.1| CBN-TAG-253 protein [Caenorhabditis brenneri]
          Length = 302

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 138/289 (47%), Positives = 186/289 (64%), Gaps = 6/289 (2%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +  R ASHAGSWY+ N + L  +L  WL  AG  +    R +IAPHAGYSY G  AAYAF
Sbjct: 8   DHTRAASHAGSWYSGNQRDLDRQLTKWLDNAG-DRYGTARALIAPHAGYSYCGETAAYAF 66

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +  +++ RVF+LGPSH      CA++T + Y+TP+GD+ +D  + EEL+AT  F+LMD
Sbjct: 67  KQVVSSAVDRVFILGPSHVVALSGCAITTCSKYRTPLGDMTVDHTINEELRATRHFDLMD 126

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
              +EAEHS+EM LP+LAKV       IVPILVG++       YG +FA Y++DP N F 
Sbjct: 127 RRDEEAEHSLEMQLPFLAKVMGSRRYTIVPILVGSLPGSRAQTYGGIFAHYMEDPRNLFV 186

Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           +SSDFCHWG RF++  YD+   V I++ I +LDK GM+ IET +P  F  YL +  NTIC
Sbjct: 187 ISSDFCHWGDRFSFSPYDRNSNVPIYEQITSLDKQGMNAIETLNPTVFNDYLKKTQNTIC 246

Query: 241 GRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           GR+PI + L    H   + +   + KFL Y QS++ ++  DSSVSYAS 
Sbjct: 247 GRNPILIMLQAAEHFRISNNHTHEFKFLHYTQSNKVRSPSDSSVSYASG 295


>gi|194386136|dbj|BAG59632.1| unnamed protein product [Homo sapiens]
          Length = 274

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 140/290 (48%), Positives = 177/290 (61%), Gaps = 29/290 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAP                  
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPR----------------- 49

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                 R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 50  ------RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 103

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+ F+KY+ DPSN F V
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKPFSKYLADPSNLFVV 163

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 223

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 224 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 273


>gi|348507066|ref|XP_003441078.1| PREDICTED: protein MEMO1-like isoform 2 [Oreochromis niloticus]
          Length = 274

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 142/290 (48%), Positives = 178/290 (61%), Gaps = 29/290 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWY+ +  +L  +L+GWL +A     P  R +IAP                  
Sbjct: 8   REASHAGSWYSASGSQLNAQLEGWLSQAQSTIRP-ARAIIAPR----------------- 49

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                 RVF+LGPSHH    +CALS A +Y+TP+ DL +D +V  +L  TG FE M +  
Sbjct: 50  ------RVFILGPSHHVPLSRCALSPADIYRTPLYDLRIDQKVYADLWKTGLFERMSLQT 103

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     IVP+LVGA++   E  YG+L +KY+ DPSN F +
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFSIVPVLVGALSESKEQEYGKLLSKYLADPSNLFII 163

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPMSFTNYLKKYRNTICGR 223

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L     ++   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 224 HPIGVLLNAVAELRKSGLEMNFTFLNYAQSSQCRNWQDSSVSYAAGALIV 273


>gi|336370572|gb|EGN98912.1| hypothetical protein SERLA73DRAFT_122776 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 327

 Score =  273 bits (697), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 151/323 (46%), Positives = 193/323 (59%), Gaps = 38/323 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR-----EAGLPKSPEVRGVIAPHAGYSYSGR 55
           M+  RRA+HAGSWY+ + + L  EL  WL      E   P     + +IAPHAGYSYSG 
Sbjct: 1   MQPSRRATHAGSWYSSSGRTLDAELTQWLAAVTSDEDFAPPITGCKAIIAPHAGYSYSGP 60

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           AAA+A+ +ID T I  VF+LGPSHH Y   CALS    Y+TP+G+LPLDL+VI +L+ TG
Sbjct: 61  AAAWAYKSIDTTDIKCVFILGPSHHVYLDGCALSNCEQYETPLGELPLDLQVIRDLRDTG 120

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYVD 174
           +FE MDI  DE EHS+EMHL Y+ K++     I IVP+LVGA++ + EA YG++ A Y+ 
Sbjct: 121 EFEDMDIQTDEDEHSIEMHLSYVRKIYTSSSDISIVPVLVGAISKDKEAAYGQILAPYLA 180

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHY------DKKHGV----------------IHKSIEAL 212
                F +SSDFCHWG+RF+Y +Y          G+                IH+SI  L
Sbjct: 181 REDTLFVISSDFCHWGTRFSYTYYYPEPLPSNTPGIRLSRSGPPPSTLSSRPIHESISDL 240

Query: 213 DKMGMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLG----NCSTKIKIK 262
           D   M ++      A      F  YL    NTICGRHPI V L  L     + +  +KIK
Sbjct: 241 DHEAMQLLTIPPGSAVNAHQEFASYLGRTKNTICGRHPIGVLLGALSVLEQDETHSVKIK 300

Query: 263 FLRYEQSSQCKTKRDSSVSYASA 285
           ++RYEQSSQC   RDSSVSYASA
Sbjct: 301 WVRYEQSSQCFNIRDSSVSYASA 323


>gi|146186006|ref|XP_001032869.2| hypothetical protein TTHERM_00486570 [Tetrahymena thermophila]
 gi|146143075|gb|EAR85206.2| hypothetical protein TTHERM_00486570 [Tetrahymena thermophila
           SB210]
          Length = 296

 Score =  272 bits (695), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 133/288 (46%), Positives = 181/288 (62%), Gaps = 4/288 (1%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAY 59
           ++ R+A+ AGSWY  N  +L +EL+ +L+ A L  SP  +++ +I PHAGY YSG   A+
Sbjct: 5   QQTRKAAFAGSWYEGNQAKLNQELNNYLQNAQLQISPIQKIKAMIGPHAGYRYSGPTQAW 64

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           ++  I+P    R+FLLGPSH      C LS+     TP G + +D +VI+EL  TG F  
Sbjct: 65  SYKYINPQEHQRIFLLGPSHRQRFQGCGLSSCNSLDTPFGQIQVDTDVIQELIKTGSFVQ 124

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
            D   +EAEHS+EMHLPY+ K+       IVPI+VG  + + E  YG + +KY DD    
Sbjct: 125 TDKQTEEAEHSLEMHLPYIKKIMGDKPFTIVPIMVGQTSYDLEQKYGGILSKYFDDEKTL 184

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
           F +SSDFCHWG RFNY ++D + G I++SI+ LDK GMD IE+     F +YL E  NTI
Sbjct: 185 FIISSDFCHWGERFNYQYHDPQAGKIYQSIQKLDKEGMDSIESHSSKKFAQYLNETSNTI 244

Query: 240 CGRHPISVFLHMLGNC--STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           CGR  I + LH++     S  +K KF++Y QSSQ   + DSSVSYASA
Sbjct: 245 CGRRGIGILLHIIEQSALSKTLKTKFVQYAQSSQVVDEDDSSVSYASA 292


>gi|47229018|emb|CAG09533.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 308

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 144/304 (47%), Positives = 183/304 (60%), Gaps = 35/304 (11%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+ WL +A   + P  R +IAPHAGYSY G  AA+A+  +DPT   RVF+LGPSH
Sbjct: 5   QLNAQLEDWLSKAQSTRRP-ARAIIAPHAGYSYCGACAAHAYKQVDPTVTRRVFILGPSH 63

Query: 80  HYYTPKCALSTATVYKTPIGDLPLD-----------------------------LEVIEE 110
           H +   CALS+A +Y+TP+ D+ +D                             L V  E
Sbjct: 64  HVHLTCCALSSAEIYRTPLYDMRIDQKGLAHFYMPRERLSACAAKAVHAKLDCCLPVYAE 123

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRL 168
           L  TG F+ M+I  DE EHS+EMHLPY AK  E       IVPILVGA++   E  YG+L
Sbjct: 124 LWKTGLFDRMNIKTDEDEHSIEMHLPYTAKAMESRKDDFSIVPILVGALSENKEHEYGKL 183

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
            +KY+ DPSN F VSSDFCHWG+RF+Y +YD+  G I++SIE LDKMGM IIE  DP +F
Sbjct: 184 LSKYLADPSNLFVVSSDFCHWGNRFHYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPPSF 243

Query: 229 KKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
             Y+ +Y NTICGRHPI V L+    L     ++   FL Y QS QC+    SSVSYA+ 
Sbjct: 244 SNYIKKYRNTICGRHPIGVLLNAVAELRKSGLEMNFSFLNYAQSGQCRNWEQSSVSYAAG 303

Query: 286 AAKV 289
           A  V
Sbjct: 304 ALYV 307


>gi|409051530|gb|EKM61006.1| hypothetical protein PHACADRAFT_180158 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 333

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 152/327 (46%), Positives = 187/327 (57%), Gaps = 45/327 (13%)

Query: 4   IRRASHAGSWYTDNSKRLA-----EELDGWL---------REAGLPKSPEVRGVIAPHAG 49
           +RRA+HAGSWYTD+ K  A     + LD  L          E   P     + VIAPHAG
Sbjct: 3   VRRATHAGSWYTDDRKLTAHSCRWQTLDNELARNLAAVQSSEEFTPPVAGCKAVIAPHAG 62

Query: 50  YSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
           YSYSG AAA+A+  ID T I RVF+LGPSHH Y   C LS    Y+TPIGDLPLD+E I 
Sbjct: 63  YSYSGPAAAWAYKTIDTTGIKRVFILGPSHHVYLDGCDLSQCKTYQTPIGDLPLDIETIA 122

Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
           ELK +GKF +M +  DE EHS+E+HLPY+ KVFE   I +VPILVGA++ + E  YG L 
Sbjct: 123 ELKKSGKFGVMSVETDEDEHSIELHLPYVRKVFERKDISVVPILVGAIDKDKEDAYGELL 182

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRF----------------------NYMHYDKKHGVIHK 207
           A Y+      F VSSDFCHWG RF                      N    D  +  +H+
Sbjct: 183 APYLAREDTIFVVSSDFCHWGQRFQYTYYYPEPPPTHRPAIRLSRSNASPADLANHPVHQ 242

Query: 208 SIEALDKMGMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLGNCSTK--- 258
           SI ALD+  M+++      A      F  YL    NTICGRHPI V L  L     +   
Sbjct: 243 SIRALDQEAMELLTMPPASATSAHYDFTDYLRRTKNTICGRHPIGVLLGALSALEQEGRV 302

Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASA 285
            K++++RYEQSS C++  DSSVSYASA
Sbjct: 303 PKLQWVRYEQSSACQSVNDSSVSYASA 329


>gi|407922334|gb|EKG15436.1| UPF0103/Mediator of ErbB2-driven cell motility (Memo-related)
           [Macrophomina phaseolina MS6]
          Length = 327

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 146/324 (45%), Positives = 188/324 (58%), Gaps = 41/324 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL----------REAG------LPKSPEVRGVIAPH 47
           +R ASHAGSWY+ N  +L+ +LD WL          R A       +P S   R +IAPH
Sbjct: 3   VREASHAGSWYSANKSQLSAQLDQWLDAVPASATGIRPASQDDLIDIPSS-GARAIIAPH 61

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
           AGY+YSG AAA+A+  +D +   R+FLLGPSHHYY   CALS    Y+TP+GDL +D E 
Sbjct: 62  AGYAYSGPAAAWAYKALDLSKAKRIFLLGPSHHYYLTGCALSKCDAYETPLGDLQIDKET 121

Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAEN 161
           + EL+    F+ M    DE EHS+EMHLPY+ KV             +VPILVGA +A  
Sbjct: 122 VAELRKKASFDTMTQSQDEEEHSLEMHLPYIYKVLSRQFSDPSEFPTLVPILVGATSAAT 181

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG---------------VIH 206
           E  +G L A Y+ DP++ F +SSDFCHWGSRF Y +Y++  G                IH
Sbjct: 182 ERQFGELLAPYLADPTSVFIISSDFCHWGSRFRYTYYEQAPGQVRNLRSSDKSPSSPAIH 241

Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKF 263
           +SI A+D   MD +ET   + F   L E  NT+CGRHPI V +     L     K K +F
Sbjct: 242 ESIAAVDGYCMDAVETKSHEKFLGILEETGNTVCGRHPIGVVMAAVEALSLAEDKGKFRF 301

Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
           +RYE+SS C + +DSSVSY SA A
Sbjct: 302 IRYERSSDCVSPKDSSVSYCSAVA 325


>gi|268555180|ref|XP_002635578.1| C. briggsae CBR-TAG-253 protein [Caenorhabditis briggsae]
          Length = 391

 Score =  271 bits (692), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 135/290 (46%), Positives = 185/290 (63%), Gaps = 8/290 (2%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           E  R ASHAGSWY  N + L  +L  WL  AG  +    R +I+PHAGYSY G  AAYAF
Sbjct: 97  EHARSASHAGSWYNGNQRDLDRQLTKWLDNAG-DRFGTARALISPHAGYSYCGETAAYAF 155

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +  +++ RVF+LGPSH      CA++T + Y+TP+GDL +D ++ EEL+AT  F+LMD
Sbjct: 156 KQVVSSAVERVFILGPSHIVALNGCAITTCSKYRTPLGDLSVDHKINEELRATRHFDLMD 215

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
              +E+EHS+EM LP++AKV       I+P+LVG++    +  YG +FA Y++DP N F 
Sbjct: 216 RRDEESEHSIEMQLPFIAKVMGNRRYTIIPVLVGSLPGSRQQTYGNIFAHYMEDPRNLFV 275

Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           +SSDFCHWG RF++  YD+   + I++ I  LDK GM  IET +P  F  YL +  NTIC
Sbjct: 276 ISSDFCHWGDRFSFSPYDRNSNLPIYEQITNLDKQGMSAIETLNPTVFNDYLKKTQNTIC 335

Query: 241 GRHPISVFLH-----MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           GR+PI + L       L N  T  + +FL Y QS++ ++  DSSVSYAS 
Sbjct: 336 GRNPILIMLQAAEGFRLSNNHTH-EFRFLHYTQSNKVRSPSDSSVSYASG 384


>gi|308506181|ref|XP_003115273.1| CRE-TAG-253 protein [Caenorhabditis remanei]
 gi|308255808|gb|EFO99760.1| CRE-TAG-253 protein [Caenorhabditis remanei]
          Length = 411

 Score =  270 bits (691), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 138/295 (46%), Positives = 188/295 (63%), Gaps = 9/295 (3%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           E +R ASHAGSWY+ N + L  +L  WL  AG  +    R +I+PHAGYSY G  AAYAF
Sbjct: 116 EHVRSASHAGSWYSGNQRDLDRQLTKWLDSAG-ERFGTARALISPHAGYSYCGETAAYAF 174

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             I P+++ RVF+LGPSH      CA++T + Y+TP+GDL +D +V E+L+AT  F+LMD
Sbjct: 175 KQIVPSAVERVFILGPSHVVALNGCAITTCSKYRTPLGDLIVDQKVTEDLRATRHFDLMD 234

Query: 122 ICVDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
              +E+EHS+EM LP++AKV        IVP+LVG++    +  YG +FA Y++DP N F
Sbjct: 235 RRDEESEHSIEMQLPFIAKVMGPTRRYTIVPVLVGSLPGSRQQTYGNIFAHYMEDPKNLF 294

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
            +SSDFCHWG RF++  +D+   + I + I  +DK GM  IET +P  F  YL +  NTI
Sbjct: 295 VISSDFCHWGDRFSFSPFDRNSNLPIFEQITNMDKQGMAAIETLNPTVFNDYLKKTQNTI 354

Query: 240 CGRHPISVFLHM-----LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           CGR+PI + L       L N  T  + KFL Y QS++ ++  DSSVSYAS    V
Sbjct: 355 CGRNPILIMLQAAEHFRLSNNHTH-EFKFLHYTQSNKVRSPSDSSVSYASGVLFV 408


>gi|345571372|gb|EGX54186.1| hypothetical protein AOL_s00004g219 [Arthrobotrys oligospora ATCC
           24927]
          Length = 330

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 193/328 (58%), Gaps = 41/328 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEV--RGVIAPHAGYSYS 53
           M  IR ASHAGSWY+ +   L  EL G+L +      G+ + P    R +IAPHAGYSYS
Sbjct: 1   MTSIRPASHAGSWYSKDKNTLDSELSGYLSQVPSTIDGIGEIPPAGARVIIAPHAGYSYS 60

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G AAA+A+ +++ T+I RVF+LGPSHH Y   CA+S+   Y TP+G LP+D +   +L  
Sbjct: 61  GPAAAWAYKSLNLTNIKRVFILGPSHHIYINGCAVSSHGAYATPLGPLPIDTKATLDLIK 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFE-----GHLIKIVPILVGAVNAENEAMYGRL 168
           T  F  M+  VD  EHS+EMHLPY  K+         +  IVPILVGA+N E E  YG++
Sbjct: 121 TNSFSFMNQVVDSDEHSIEMHLPYTYKMLSTFFGPTDIPPIVPILVGAINTETEEKYGKI 180

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSIE 210
            + ++DDP N F VSSDFCHWGSRF+Y +Y  + G                   I KSIE
Sbjct: 181 LSTHLDDPENAFIVSSDFCHWGSRFSYQYYIPQPGKDGMSLKSRGHRVPEGGQEIWKSIE 240

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-----------GNCSTKI 259
            LD+  ++ IE+G  + F  YL    NT+CGRHPI V +  L            +   K 
Sbjct: 241 GLDQAAIEAIESGKHENFVDYLSHTRNTVCGRHPIGVIMAGLEEVLKIRAERGEDVEGKG 300

Query: 260 KIKFLRYEQSSQCKTKRDSSVSYASAAA 287
             KF+RYEQSSQCK+  DSSVSYASA A
Sbjct: 301 LFKFVRYEQSSQCKSVEDSSVSYASAFA 328


>gi|313221650|emb|CBY36133.1| unnamed protein product [Oikopleura dioica]
          Length = 292

 Score =  270 bits (689), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 130/288 (45%), Positives = 187/288 (64%), Gaps = 8/288 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRASHAGSWY+ + + L  +L  W+  +      + + VI PHAGY+YS   AA++F  +
Sbjct: 4   RRASHAGSWYSGDPRELENQLKNWI--SKAKFEKKAKAVIVPHAGYAYSAPTAAWSFLQL 61

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           D  +  ++F++GPSHH Y P CAL      +TP+G+L +D +++ EL ATG F  MD+  
Sbjct: 62  DAQTTKKIFVIGPSHHVYLPNCALPVVKECETPLGNLRIDKDIVTELHATGLFCTMDVPT 121

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EM LP+LA +F+  L  + +VPI+VG++  E EA Y ++FAKY++DP   F +
Sbjct: 122 DEEEHSIEMQLPFLAHIFKNRLEEVSVVPIMVGSIKQEKEADYAQVFAKYLNDPEVVFVI 181

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG RF Y +  +++  I +SI+A+D+ GMD I + +   F  YL    NTICGR
Sbjct: 182 SSDFCHWGKRFRYTYRLEEYEQIFESIDAVDREGMDHIASKNLSNFHNYLRRTKNTICGR 241

Query: 243 HPISVFLHMLGNCSTKI----KIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +PI + L  +     K+    +IKFL+Y QSSQ ++  DSSVSYASA 
Sbjct: 242 NPICLLLATINLLEQKVNMQSQIKFLKYAQSSQVRSPNDSSVSYASAV 289


>gi|189201744|ref|XP_001937208.1| hypothetical protein PTRG_06875 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187984307|gb|EDU49795.1| hypothetical protein PTRG_06875 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 377

 Score =  270 bits (689), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 142/329 (43%), Positives = 191/329 (58%), Gaps = 47/329 (14%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
           R ASHAGSWYTDN + L+++LDGWL                  E  +P +P  R +IAPH
Sbjct: 48  RSASHAGSWYTDNGQLLSQQLDGWLEAVPNSTTPIGTASSQQGEVTIP-TPNARAIIAPH 106

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
           AG+SYSG AAA+A+ + D  +  RVFLLGPSHHYY    A++    Y+TP+GDL +D E+
Sbjct: 107 AGFSYSGPAAAWAYKSADWANAKRVFLLGPSHHYYLTGAAITGCDKYRTPLGDLTIDTEL 166

Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVN 158
           ++E++     E M    DE EHS+EMHLPY+ K+          +   + ++PI+VG  +
Sbjct: 167 VQEIQTEWDLETMSKRTDEDEHSLEMHLPYIYKMLSLKNASFQNDTTSVPLIPIMVGNTD 226

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-------------- 204
           A  E  YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y   +G               
Sbjct: 227 AAAEVHYGSLLAPYLSDPANIFVISSDFCHWGSRFRYTYYQPANGSAATQLRTSSRVPSD 286

Query: 205 --IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----GNCSTK 258
             IH+SI A+D+  MD +E+G    F   L +  NT+CGRHPI +F+  +    G     
Sbjct: 287 YPIHESIAAVDQESMDAVESGSHRQFLDQLRKTGNTVCGRHPIGLFMAAVERAEGLGEGS 346

Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            K KFLRYE+SS  +  +DSSVSY SA A
Sbjct: 347 GKFKFLRYERSSLVEEVKDSSVSYCSAFA 375


>gi|330920737|ref|XP_003299128.1| hypothetical protein PTT_10063 [Pyrenophora teres f. teres 0-1]
 gi|311327315|gb|EFQ92775.1| hypothetical protein PTT_10063 [Pyrenophora teres f. teres 0-1]
          Length = 333

 Score =  269 bits (688), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 142/329 (43%), Positives = 190/329 (57%), Gaps = 47/329 (14%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
           R+A+HAGSWYTDN + L+++LDGWL                  E  +P +P  R +IAPH
Sbjct: 4   RKATHAGSWYTDNGQLLSQQLDGWLEAVPSSTTPIGTTSSQQGEVTIP-TPNARAIIAPH 62

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
           AG+SYSG AAA+A+ + D  +  RVFLLGPSHHYY    A +    Y+TP+GDL +D E+
Sbjct: 63  AGFSYSGPAAAWAYKSADWANAKRVFLLGPSHHYYLTGAATTGCDTYRTPLGDLAIDTEL 122

Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVN 158
           + E++A    E M    DE EHS+EMHLPY+ K+          +   + ++PIL+G  +
Sbjct: 123 VHEIQAAWDLETMTKRTDEDEHSLEMHLPYIYKMLSLNNTSFQNDTTSVPLIPILIGNTD 182

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-------------- 204
           A  EA YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y   +                
Sbjct: 183 AATEAHYGSLLAPYLSDPANVFVISSDFCHWGSRFRYTYYQPANASPATHLKTSSRVPSD 242

Query: 205 --IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----GNCSTK 258
             IH+SI A+D   MD +E+G    F   L +  NT+CGRHPI +F+  +    G     
Sbjct: 243 YPIHESIAAVDHESMDAVESGSHRQFLDQLRKTGNTVCGRHPIGLFMAAVEQAEGLGQGS 302

Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            K KFLRYE+SS  +  +DSSVSY SA A
Sbjct: 303 GKFKFLRYERSSLVQDVKDSSVSYCSAFA 331


>gi|406864123|gb|EKD17169.1| DUF52 domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 330

 Score =  268 bits (685), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 152/327 (46%), Positives = 192/327 (58%), Gaps = 46/327 (14%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--------VRGVIAPHAGYSYSGRA 56
           R ASHAGSWY+    +L+ ELD WL  A +P S +         R +IAPHAGYSYSG A
Sbjct: 4   RSASHAGSWYSSGKDKLSNELDQWL--AQVPDSIDGTQLPVAGARVIIAPHAGYSYSGPA 61

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA+A+ ++D ++  R+FLLGPSH  Y P CA S  + Y TP+GDL +D E++++L+ TGK
Sbjct: 62  AAWAYKSLDLSNAQRIFLLGPSHALYLPGCATSKHSRYATPLGDLVIDTEIVKKLQDTGK 121

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYGRLFA 170
           FE M    DE EHS+EMHLPY+ K+             IVPILVG  +A +E  YG + A
Sbjct: 122 FEKMSTDADETEHSLEMHLPYIYKLCSQSFNSPAEFPPIVPILVGNTSATSEKAYGDVLA 181

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------------DKKHGVIHKS 208
            Y+ DP++ F VSSDFCHWG RF Y +Y                      D     IH+S
Sbjct: 182 PYLADPTSVFIVSSDFCHWGLRFQYTYYLPASPSAAAGTAGGYSLKQRHDDPTDPPIHES 241

Query: 209 IEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML------GNCST--KIK 260
           I  LDK+ MD IETG  D F   L E  NT+CGRHPI V +  +      G  S   K K
Sbjct: 242 IGRLDKLAMDAIETGKHDVFLGNLKETGNTVCGRHPIGVVMAAIEVLEKEGKVSPEGKGK 301

Query: 261 IKFLRYEQSSQCKTKRDSSVSYASAAA 287
            KF+RYE+SS+ +   DSSVSYASA A
Sbjct: 302 FKFVRYERSSEVEEISDSSVSYASAYA 328


>gi|402225258|gb|EJU05319.1| UPF0103-domain-containing protein [Dacryopinax sp. DJM-731 SS1]
          Length = 327

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 149/321 (46%), Positives = 185/321 (57%), Gaps = 36/321 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRG---VIAPHAGYSYSGR 55
           M  +RRA+HA SWY+    RL  +L GWL   E     SP ++G   +I PHAGY YSG 
Sbjct: 2   MSGMRRATHANSWYSGARSRLDHQLKGWLSAVEPSSDYSPPIKGCKAIIGPHAGYDYSGP 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            AA+A+  ID T I RVFLLGPSHH Y   CAL+    Y TP+G+LP+DL    ELKATG
Sbjct: 62  TAAWAYKAIDVTGIKRVFLLGPSHHVYLDGCALTPFATYDTPLGELPVDLATTAELKATG 121

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           KF +M    DE EHS+E+HLPY+ K FEG  IK+VPILVGA+N + EA +G + A+Y++ 
Sbjct: 122 KFSMMSNKTDENEHSIELHLPYVRKTFEGCDIKVVPILVGAINDKKEADFGGILAQYLEQ 181

Query: 176 PSNFFSVSSDFCHWGSRFNYMHY-----------------DKKHG--VIHKSIEALDKMG 216
               F VSSDFCHWG RF++  Y                 D   G   IH+SI  LD   
Sbjct: 182 EDTMFVVSSDFCHWGERFDFTFYYPGAPSPTSSIRLEDDSDIPRGGQPIHESISRLDHEA 241

Query: 217 MDIIETGDPDAFKK--------YLLEYDNTICGRHPISVFLHMLGNCST----KIKIKFL 264
           ++++      +  K        YL    NTICGRHPI V L  L         K  +KF+
Sbjct: 242 INMLTLPTSTSSAKQAHASLSSYLRRTGNTICGRHPIGVLLGALATLEEDNKHKASLKFV 301

Query: 265 RYEQSSQCKTKRDSSVSYASA 285
           +Y QSS CK   DSSVSYASA
Sbjct: 302 KYAQSSPCKHIEDSSVSYASA 322


>gi|405978340|gb|EKC42740.1| Protein MEMO1 [Crassostrea gigas]
          Length = 272

 Score =  267 bits (683), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 132/248 (53%), Positives = 168/248 (67%), Gaps = 9/248 (3%)

Query: 47  HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
           HAGY Y G    +A+  IDP++I R+F+LGPSHH     CAL+  + Y+TP+ DL +D +
Sbjct: 22  HAGYVYCGACGGHAYRQIDPSNIKRIFILGPSHHVRLSGCALTETSHYQTPLYDLTIDQK 81

Query: 107 VIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAM 164
           + EEL AT  FE M +  DE EHS+EMHLPY+AKV E       I+P+LVG+++A+ E +
Sbjct: 82  INEELFATKAFEKMKMSTDEDEHSIEMHLPYIAKVMERRRGQFTIIPVLVGSLSADKEKL 141

Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
           YG +F++Y+ DP NFF +SSDFCHWG RF Y  YDK  G I +SIEALDKMGMD IE  D
Sbjct: 142 YGSIFSQYLADPENFFVISSDFCHWGQRFRYTFYDKSCGDIWQSIEALDKMGMDAIEQMD 201

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHML------GNCSTKIKIKFLRYEQSSQCKTKRDS 278
           P  F  YL EY+NTICGR+PI V L  +      GN   ++  KF+ Y QSS+CK   DS
Sbjct: 202 PAKFSHYLKEYENTICGRYPIGVLLSAIDALRRNGN-GRRMSFKFMNYAQSSRCKKMSDS 260

Query: 279 SVSYASAA 286
           SVSYAS A
Sbjct: 261 SVSYASGA 268


>gi|347836555|emb|CCD51127.1| similar to DUF52 domain protein [Botryotinia fuckeliana]
          Length = 331

 Score =  266 bits (681), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 146/331 (44%), Positives = 188/331 (56%), Gaps = 46/331 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
           M   R ASHAGSWYT N  +L+ ELD WL +  +P +        P  R +IAPHAGYSY
Sbjct: 1   MGSTREASHAGSWYTSNPSKLSSELDNWLSQ--VPSTISDTKLPIPGARVIIAPHAGYSY 58

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           SG AAA+A+  +D ++  RVFLLGPSH +Y  +CALS  + Y TP+GDL +D  +++EL 
Sbjct: 59  SGPAAAWAYATLDLSTAKRVFLLGPSHAWYLTECALSKHSKYATPLGDLTIDTALVQELS 118

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYG 166
           ATG+F+ M    DE EHS+E+HLPY+ K+             ++PILVG  NA  E  +G
Sbjct: 119 ATGEFKRMSTDQDETEHSLELHLPYIYKLLSLNFPSPSSFPPLIPILVGNTNAATERKFG 178

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------DKKHGV------------ 204
            + A Y+ DPSN F +SSDFCHWGSRF Y +Y          DK  G             
Sbjct: 179 SILAPYLSDPSNIFIISSDFCHWGSRFQYTYYLPASPSAEVRDKSGGYSLSRRDKAPTQP 238

Query: 205 -IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-------S 256
            IH+SI  LD++ M  I  G    F   L E  NT+CGRHPI V +  +           
Sbjct: 239 PIHESIGRLDQLSMSAISGGVHQEFLDNLEETGNTVCGRHPIGVIMAAIEKVKEEKSLDG 298

Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            K K KF+RYE+S++     D SVSYASA A
Sbjct: 299 EKGKFKFVRYERSNEVTRVDDGSVSYASAYA 329


>gi|169596398|ref|XP_001791623.1| hypothetical protein SNOG_00959 [Phaeosphaeria nodorum SN15]
 gi|160701301|gb|EAT92454.2| hypothetical protein SNOG_00959 [Phaeosphaeria nodorum SN15]
          Length = 340

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 145/336 (43%), Positives = 193/336 (57%), Gaps = 54/336 (16%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
           R ASHAGSWYT N K+L+++LDGWL                  +  +P +P  R +IAPH
Sbjct: 4   REASHAGSWYTSNGKQLSQQLDGWLEAVPSSTTPIGTASSEQGDVSIP-TPNARAIIAPH 62

Query: 48  AGYSYSGRAAAYAFGNIDPTSIS-------------RVFLLGPSHHYYTPKCALSTATVY 94
           AGYSYSG AAA+A+ + D  + S             RVFLLGPSHH+Y    A +    Y
Sbjct: 63  AGYSYSGPAAAWAYKSADWANASGVYMNITDHKSSKRVFLLGPSHHHYLSGAATTACDKY 122

Query: 95  KTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH----LIKIV 150
            TP+GDL +D  +++E+K     E M   VDEAEHS+EMHLPY+ K+   H     + +V
Sbjct: 123 ATPLGDLIIDTALVQEIKQEWGLETMSQDVDEAEHSLEMHLPYIYKMLSLHNNPSSVPLV 182

Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------ 204
           PI++G  +   E+ YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y+   G       
Sbjct: 183 PIMIGNTSPSTESKYGSLLAPYLSDPTNIFVISSDFCHWGSRFRYTYYESPDGASATQLT 242

Query: 205 ----------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGN 254
                     IH+SI+A+DK  MD +E+G    F + L E  NT+CGRHPI VF+  + +
Sbjct: 243 RKSKIDEDWPIHESIKAVDKESMDAVESGHHKRFLEQLKETGNTVCGRHPIGVFMAAVES 302

Query: 255 CST---KIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
                 K + KF+RYE+SS  +   DSSVSY SA A
Sbjct: 303 ADVGEGKGRFKFVRYERSSLVEDYGDSSVSYCSAFA 338


>gi|440633615|gb|ELR03534.1| hypothetical protein GMDG_01285 [Geomyces destructans 20631-21]
          Length = 327

 Score =  265 bits (678), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 191/324 (58%), Gaps = 43/324 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGRA 56
           R ASHAGSWY+     L+ ELDGWL  A +P S        P  R +IAPHAGYSYSG A
Sbjct: 4   RAASHAGSWYSSAPATLSSELDGWL--AQVPDSIDGNKLPVPGARVIIAPHAGYSYSGPA 61

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA+A+ ++D + + RVFLLGPSH  Y   CA+S+   Y TP+G+L LD   I+EL+AT K
Sbjct: 62  AAWAYKSLDLSKVKRVFLLGPSHTLYLSGCAISSQEYYATPLGNLKLDQATIKELQATSK 121

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRLFA 170
           F+      +EAEHS+EMHLPY+ KV            ++VP+LVGA +   E  YG++FA
Sbjct: 122 FDPWKAKSEEAEHSLEMHLPYIYKVLANTFKSPDDFPQLVPVLVGATSGPTERSYGQIFA 181

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHY-----------------DK--KHGVIHKSIEA 211
            Y+ DP++ F VSSDFCHWG RF Y +Y                 DK   +  IH+SI  
Sbjct: 182 PYLADPTSVFVVSSDFCHWGERFQYTYYLPASPSANSDGYSLRRRDKTPTNPWIHESIGK 241

Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI--------KF 263
           +DK+ MD IE G    F   L +  NT+CGRHPI + +  L     + KI        KF
Sbjct: 242 IDKLAMDAIEQGTHQGFLDNLEDTGNTVCGRHPIGLVMAALEVLKQEGKISAENGGRFKF 301

Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
           +RYE+SS  K  +DSSVSYASA A
Sbjct: 302 VRYERSSDVKDIKDSSVSYASAVA 325


>gi|330797324|ref|XP_003286711.1| hypothetical protein DICPUDRAFT_94236 [Dictyostelium purpureum]
 gi|325083309|gb|EGC36765.1| hypothetical protein DICPUDRAFT_94236 [Dictyostelium purpureum]
          Length = 277

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 139/285 (48%), Positives = 175/285 (61%), Gaps = 16/285 (5%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR  SHAG+WY DN+ RL ++L  WL+ A    S  V+ VIAPHAGY+YSGR A++A+ N
Sbjct: 2   IRNCSHAGTWYFDNASRLEKQLSDWLKVAKRLNS-NVKSVIAPHAGYTYSGRTASHAYIN 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P    RVF+LGPSHH Y   C L+    ++TPIG+L +D E+ + L  TG F      
Sbjct: 61  LVPEKFKRVFILGPSHHVYMKTCGLTKLDTWETPIGNLKVDREISDTLYNTGDFVWNSKS 120

Query: 124 VDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           VDE EHS+EM LPY+AKV E  L  +KIVPI+VG ++   E  YG++ A Y DDP NFF 
Sbjct: 121 VDEDEHSLEMQLPYIAKVAENKLSDLKIVPIMVGNLSLNLEEKYGKILAPYFDDPDNFFV 180

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSDFCHWG RF Y  Y+ +   I++ IE LDK  M IIETGDP  F KYL      I  
Sbjct: 181 ISSDFCHWGERFGYTKYENQQVPIYQYIEELDKQAMSIIETGDPVQFDKYL-----KILK 235

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            H        L +   +  I  + YEQSS+    RDSSVSYA  A
Sbjct: 236 IH--------LWSLPNRYDISSVHYEQSSKVVQPRDSSVSYAVLA 272


>gi|410901599|ref|XP_003964283.1| PREDICTED: protein MEMO1-like isoform 2 [Takifugu rubripes]
          Length = 274

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 137/290 (47%), Positives = 177/290 (61%), Gaps = 29/290 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASH+GSWY+ +  +L  +L+ WL +A   + P  R +IAP                  
Sbjct: 8   REASHSGSWYSASGSQLNAQLEDWLSKANSKRRP-ARAIIAPR----------------- 49

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                 RVF+LGPSHH +   CALS+A +Y+TP+ D+ +D +V  EL  TG F+ M+I  
Sbjct: 50  ------RVFILGPSHHVHLTCCALSSAEIYRTPLYDMRIDQKVYAELWKTGLFDRMNIKT 103

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E       IVPILVGA++   E  YG+L +KY+ DPSN F +
Sbjct: 104 DEDEHSIEMHLPYTAKAMESRKDDFSIVPILVGALSENKEHEYGKLLSKYLADPSNLFVI 163

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHWG+RF+Y +YD+  G I++SIE LDKMGM IIE  DP +F  Y+ +Y NTICGR
Sbjct: 164 SSDFCHWGNRFHYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPPSFSNYIKKYRNTICGR 223

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QS QC+    SSVSYA+ A  V
Sbjct: 224 HPIGVLLNAVAELRKSGLDMNFSFLNYAQSGQCRNWEQSSVSYAAGALYV 273


>gi|396468359|ref|XP_003838155.1| similar to DUF52 domain protein [Leptosphaeria maculans JN3]
 gi|312214722|emb|CBX94676.1| similar to DUF52 domain protein [Leptosphaeria maculans JN3]
          Length = 332

 Score =  264 bits (674), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 139/327 (42%), Positives = 187/327 (57%), Gaps = 44/327 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLP----------------KSPEVRGVIAPHA 48
           R  SHAGSWY+DN + L+ +LDGWL     P                 +P  R +IAPHA
Sbjct: 4   RSPSHAGSWYSDNKELLSRQLDGWLEAVPSPARPIGTSSSQQGEVSIPTPNARAIIAPHA 63

Query: 49  GYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
           GYSYSG AAA+A+  +D T+  RVFLLGPSHHYY    A +    Y TP+GDL +D  ++
Sbjct: 64  GYSYSGPAAAWAYKTVDWTNAKRVFLLGPSHHYYLTGAATTGCDNYGTPLGDLIVDTALV 123

Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH---------LIKIVPILVGAVNA 159
           +E++   + E+M   VDE EHS+EMHLPY+ K+   +          + +VPI++G  + 
Sbjct: 124 KEIQTKWQLEVMSKSVDEDEHSLEMHLPYIHKMLSLNNPSFQSSPSSVPLVPIMIGNTDP 183

Query: 160 ENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV--------------- 204
             E+ YG L A Y+ DPSN F VSSDFCHWGSRF Y +Y    G                
Sbjct: 184 STESHYGALLAPYLSDPSNIFVVSSDFCHWGSRFRYTYYQPADGSAAMTLRSSSRVPPEY 243

Query: 205 -IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST---KIK 260
            IH+SI A+D+  M  +E+G+   F + L +  NT+CGRHPI VF+  +        K +
Sbjct: 244 PIHESIAAVDRESMAAVESGEHTRFLEQLRKTGNTVCGRHPIGVFMAAVEKAEMDGGKGR 303

Query: 261 IKFLRYEQSSQCKTKRDSSVSYASAAA 287
            KFLRYE+S   ++ +DSSVSY S  A
Sbjct: 304 FKFLRYERSGLVESVKDSSVSYCSGFA 330


>gi|145514724|ref|XP_001443267.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410645|emb|CAK75870.1| unnamed protein product [Paramecium tetraurelia]
          Length = 294

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 128/284 (45%), Positives = 179/284 (63%), Gaps = 4/284 (1%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           KIR ASHAGSWY  + K+L  +L+ +L +A     P ++ +I PHAG+SYSG  AA+A+ 
Sbjct: 7   KIREASHAGSWYIGDGKQLDAQLNDFLSKAKGETIPNIKAIIGPHAGFSYSGPTAAFAYQ 66

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK--FELM 120
           ++      RVFLLGP HH Y     LS    Y+TP+G++ LD   I++L A  K  +   
Sbjct: 67  HLVQKERMRVFLLGPCHHTYIKGIGLSELEQYETPLGNIELDQPTIKQLSAELKKNYVFT 126

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           +  ++E EHS+EMHLP++ K+F     K++PI+VGA + E +A    +  KY  DP+  F
Sbjct: 127 NKDIEEQEHSLEMHLPFIYKIFPK--CKLIPIMVGATSEEQDAQVASVLVKYFVDPNTVF 184

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSDFCHWG RF Y  Y+K+HG IH+SI  LD   + +IE+ +   F KYL E +NTIC
Sbjct: 185 VISSDFCHWGKRFQYTPYNKEHGEIHQSIAQLDGQAIKLIESHNIKEFYKYLDETENTIC 244

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           GRHPI V L+++     ++K +  RY QSSQ     DSSVSYA+
Sbjct: 245 GRHPICVLLNIINLSKLQLKTQLARYAQSSQVTKPNDSSVSYAA 288


>gi|451852596|gb|EMD65891.1| hypothetical protein COCSADRAFT_87269 [Cochliobolus sativus ND90Pr]
          Length = 333

 Score =  261 bits (666), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 140/329 (42%), Positives = 187/329 (56%), Gaps = 47/329 (14%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
           R+ASHAGSWYTDN + L+++LDGWL                  +  +P +P  R +IAPH
Sbjct: 4   RKASHAGSWYTDNGQLLSQQLDGWLGAVPASTTPIGSASSQKGQVSIP-TPSARAIIAPH 62

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
           AGYSYSG AAA+A+ + D  +  RVFLLGPSHHYY    A +    Y TP+GDL +D  +
Sbjct: 63  AGYSYSGPAAAWAYKSADWANAKRVFLLGPSHHYYLTGAATTGCDKYHTPLGDLIVDTTL 122

Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVN 158
           ++ ++     E+M    DE EHS+EMHLPY+ K+          +   + ++PI+VG  +
Sbjct: 123 VQTIQQEWDLEIMSKRTDEDEHSLEMHLPYIYKMLSLKNSDFQSDSSSVPLIPIMVGNTD 182

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-------------- 204
           A  E  YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y   +G               
Sbjct: 183 AAAETRYGSLLAPYLADPTNIFVISSDFCHWGSRFRYTYYQPPNGAAATQLKSASRVPSD 242

Query: 205 --IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST----K 258
             IH+SI A+D   MD IETG    F   L +  NT+CGRHPI +F+  + +       K
Sbjct: 243 YPIHESIAAVDHESMDAIETGSHRKFLDQLRKTGNTVCGRHPIGLFMAAVESTEALEEGK 302

Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
               FLRYE+SS  +  +DSSVSY SA A
Sbjct: 303 GIFSFLRYERSSLVEDLQDSSVSYCSAFA 331


>gi|451997153|gb|EMD89618.1| hypothetical protein COCHEDRAFT_1194970 [Cochliobolus
           heterostrophus C5]
          Length = 333

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 142/329 (43%), Positives = 188/329 (57%), Gaps = 47/329 (14%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
           R+ASHAGSWYTDN + L+++LDGWL                  +  +P +P  R +IAPH
Sbjct: 4   RKASHAGSWYTDNGQLLSQQLDGWLGAVPASTTPIGSASSQKGQVSIP-TPNARAIIAPH 62

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
           AGYSYSG AAA+A+ + D  +  RVFLLGPSHHYY    A +    Y TP+GDL +D  +
Sbjct: 63  AGYSYSGPAAAWAYKSADWANAKRVFLLGPSHHYYLTGAATTGCDKYHTPLGDLIVDTTL 122

Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVN 158
           ++ ++     E+M    DE EHS+EMHLPY+ K+          +   + +VPI+VG  +
Sbjct: 123 VQTIQEEWDLEIMSKRTDEDEHSLEMHLPYIYKMLSLKNKDFQSDPSSVPLVPIMVGNTD 182

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--------------- 203
           A  EA YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y   +G               
Sbjct: 183 AAAEARYGSLLAPYLADPANIFVISSDFCHWGSRFRYTYYQPPNGGAATQLKSASRVPSD 242

Query: 204 -VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST----K 258
             IH+SI A+D   MD IETG    F   L +  NT+CGRHPI +F+  + +       K
Sbjct: 243 YPIHESIAAVDHESMDAIETGSHRKFLDQLRKTGNTVCGRHPIGLFMAAVESAEALEEGK 302

Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
               FLRYE+SS  +  +DSSVSY SA A
Sbjct: 303 GIFSFLRYERSSLVEDLQDSSVSYCSAFA 331


>gi|32566861|ref|NP_741570.2| Protein TAG-253, isoform a [Caenorhabditis elegans]
 gi|351059096|emb|CCD66949.1| Protein TAG-253, isoform a [Caenorhabditis elegans]
          Length = 302

 Score =  259 bits (661), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 127/275 (46%), Positives = 177/275 (64%), Gaps = 6/275 (2%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           E  R ASHAGSWY  N + L  +L  WL  AG P+    R +I+PHAGYSY G  AAYAF
Sbjct: 8   EHTRSASHAGSWYNANQRDLDRQLTKWLDNAG-PRIGTARALISPHAGYSYCGETAAYAF 66

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +  +++ RVF+LGPSH      CA++T + Y+TP+GDL +D ++ EEL+AT  F+LMD
Sbjct: 67  KQVVSSAVERVFILGPSHVVALNGCAITTCSKYRTPLGDLIVDHKINEELRATRHFDLMD 126

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
              +E+EHS+EM LP++AKV       IVP+LVG++    +  YG +FA Y++DP N F 
Sbjct: 127 RRDEESEHSIEMQLPFIAKVMGSKRYTIVPVLVGSLPGSRQQTYGNIFAHYMEDPRNLFV 186

Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           +SSDFCHWG RF++  YD+   + I++ I  +DK GM  IET +P AF  YL +  NTIC
Sbjct: 187 ISSDFCHWGERFSFSPYDRHSSIPIYEQITNMDKQGMSAIETLNPAAFNDYLKKTQNTIC 246

Query: 241 GRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQ 271
           GR+PI + L    H   + +   + +FL Y QS++
Sbjct: 247 GRNPILIMLQAAEHFRISNNHTHEFRFLHYTQSNK 281


>gi|403333881|gb|EJY66071.1| Cell motility mediator [Oxytricha trifallax]
          Length = 309

 Score =  259 bits (661), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 137/299 (45%), Positives = 182/299 (60%), Gaps = 16/299 (5%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLP--------KSPEVRGVIAPHAGYSYSGRA 56
           R ASHAGSWY  + K+L  +L  +L +A +            ++R +IAPHAG  YSG+ 
Sbjct: 7   RLASHAGSWYPSDKKKLDLQLTQYLEKAKIDLGDRIKDNSRGKIRAIIAPHAGIDYSGQV 66

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AAYA+  +DP    RV LLGPSHH Y   CAL+  + Y+TP+GDL +D E  E+L     
Sbjct: 67  AAYAYCQLDPQQYKRVILLGPSHHVYLESCALTLCSKYQTPLGDLVIDKEFNEQLLKENS 126

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV-DD 175
           F+ M   +DE EHS+EMHLPY+ KVF G  I++VPILVG +  + EA+YG+  AKY+ D 
Sbjct: 127 FKQMPKQIDEEEHSLEMHLPYIKKVF-GDQIQLVPILVGNLTKDKEALYGKTLAKYLRDK 185

Query: 176 PSNFFSVSSDFCHWGSRFNYMHY----DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
             N F +SSDFCHWG  F+YM+     DK    I K IE +DK G++ I   D + F +Y
Sbjct: 186 DENLFIISSDFCHWGKDFDYMYLVEKVDKDQNTISKQIERIDKDGINHIIGQDVEKFNEY 245

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTK-IKIKFLRYEQSSQ-CKTKRDSSVSYASAAAK 288
           L   DNTICGRHPI V L  L    T+  + +  +Y QS+     KR++SVSYAS   +
Sbjct: 246 LDITDNTICGRHPIGVLLECLKEMDTRTFETELAQYGQSTNIIDDKRETSVSYASIVTR 304


>gi|25146594|ref|NP_741571.1| Protein TAG-253, isoform b [Caenorhabditis elegans]
 gi|30316377|sp|Q22915.3|TG253_CAEEL RecName: Full=MEMO1 family protein tag-253
 gi|351059097|emb|CCD66950.1| Protein TAG-253, isoform b [Caenorhabditis elegans]
          Length = 350

 Score =  259 bits (661), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 127/275 (46%), Positives = 177/275 (64%), Gaps = 6/275 (2%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           E  R ASHAGSWY  N + L  +L  WL  AG P+    R +I+PHAGYSY G  AAYAF
Sbjct: 56  EHTRSASHAGSWYNANQRDLDRQLTKWLDNAG-PRIGTARALISPHAGYSYCGETAAYAF 114

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +  +++ RVF+LGPSH      CA++T + Y+TP+GDL +D ++ EEL+AT  F+LMD
Sbjct: 115 KQVVSSAVERVFILGPSHVVALNGCAITTCSKYRTPLGDLIVDHKINEELRATRHFDLMD 174

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
              +E+EHS+EM LP++AKV       IVP+LVG++    +  YG +FA Y++DP N F 
Sbjct: 175 RRDEESEHSIEMQLPFIAKVMGSKRYTIVPVLVGSLPGSRQQTYGNIFAHYMEDPRNLFV 234

Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           +SSDFCHWG RF++  YD+   + I++ I  +DK GM  IET +P AF  YL +  NTIC
Sbjct: 235 ISSDFCHWGERFSFSPYDRHSSIPIYEQITNMDKQGMSAIETLNPAAFNDYLKKTQNTIC 294

Query: 241 GRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQ 271
           GR+PI + L    H   + +   + +FL Y QS++
Sbjct: 295 GRNPILIMLQAAEHFRISNNHTHEFRFLHYTQSNK 329


>gi|417398158|gb|JAA46112.1| Putative dioxygenase [Desmodus rotundus]
          Length = 271

 Score =  259 bits (661), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 133/290 (45%), Positives = 171/290 (58%), Gaps = 32/290 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 8   REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           DP+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  
Sbjct: 67  DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126

Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSDFCHW                               E  DP +F  YL +Y NTICGR
Sbjct: 187 SSDFCHWXXX--------------------------XXEQLDPVSFSNYLKKYHNTICGR 220

Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           HPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 221 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 270


>gi|340503207|gb|EGR29819.1| hypothetical protein IMG5_148030 [Ichthyophthirius multifiliis]
          Length = 330

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 125/295 (42%), Positives = 180/295 (61%), Gaps = 7/295 (2%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           +IR +S+AGSWY++N   L  +++ WL  A   +     V+ +I PHAG+S+SG   AYA
Sbjct: 34  EIRESSYAGSWYSNNEHELNIQINVWLDMAKCEIQNIQAVKAIIVPHAGFSFSGPTQAYA 93

Query: 61  FGNIDP---TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           +  +         RVFL+GP H+ +  +C LS   VY+TP+G++ +D E I EL+    F
Sbjct: 94  YKYLKQYCHQKKLRVFLMGPFHYIFIRQCGLSGMKVYETPVGNIEIDQETINELRELAYF 153

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
           E  D   +E EHS+EM L +L K      IK++PI+VGA++   E +YG++F+K+VD   
Sbjct: 154 ETTDKDAEEEEHSLEMQLCFLIKTLGAENIKLIPIMVGALDQSQETLYGQIFSKFVDQDD 213

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
             F VS+DFCHWG ++ Y +++K  G I++ IE LD+ GM IIE  D   F  YL E +N
Sbjct: 214 TLFIVSTDFCHWGQKYKYTYFNKDDGEIYEQIEKLDQRGMTIIEKHDNQEFISYLKETEN 273

Query: 238 TICGRHPISVFLHMLGNC--STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
            ICGRH +S+ LH   N   S  I+  F+RY  SS  K K D SVSYASA + ++
Sbjct: 274 NICGRHALSILLHAFNNSKYSKDIQTHFVRYALSSLVKDKSDQSVSYASAVSIIE 328


>gi|358060843|dbj|GAA93461.1| hypothetical protein E5Q_00102 [Mixia osmundae IAM 14324]
          Length = 339

 Score =  258 bits (659), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 153/330 (46%), Positives = 196/330 (59%), Gaps = 45/330 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG----------LPKSPEVRGVIAPHAGY 50
           M   R A+HAGSWY D+SK L E L  WL E             P  P ++ +I+PHAGY
Sbjct: 1   MSGARPATHAGSWYDDDSKTLDELLSDWLGEVDSTPCQEDKDFAPPVPNLKAIISPHAGY 60

Query: 51  SYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
           +YSG AAA+A+  I+P +  RVF+LGPSHH Y   CALS  + YKTP+G+LPLDL+ I  
Sbjct: 61  AYSGAAAAWAYACIEPFAYKRVFILGPSHHVYLNGCALSQRSSYKTPLGNLPLDLDTIAR 120

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
           L+AT +FE M    D+ EHS+EM LPY+AKVF+GH + IVPI+VG+++   E+ +G+L A
Sbjct: 121 LRATRQFEDMSPSADDDEHSIEMQLPYIAKVFQGHAVNIVPIMVGSISTSKESAFGKLLA 180

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEALD 213
            Y+ D    F +SSDFCHWG+RF Y +Y  K                    IHKSIE +D
Sbjct: 181 PYLADEDTLFVISSDFCHWGTRFGYTYYVPKEDSSAPVQLAKTNSAHADFPIHKSIELID 240

Query: 214 KMGMDIIE-TGDP-----DA---FKKYLLEYDNTICGRHPISVFLHML---------GNC 255
           + GM  +E T D      DA   F  YL    NTICGRHPI V L  +         G  
Sbjct: 241 REGMRHVELTKDSGKSAQDAHRDFASYLTRTKNTICGRHPIGVLLATIASLQDSKTAGFA 300

Query: 256 STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
              +++KF+RYE+SS  K   DSSVSYASA
Sbjct: 301 QDDVRLKFVRYERSSLVKKLSDSSVSYASA 330


>gi|145497196|ref|XP_001434587.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401714|emb|CAK67190.1| unnamed protein product [Paramecium tetraurelia]
          Length = 345

 Score =  256 bits (654), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 118/289 (40%), Positives = 184/289 (63%), Gaps = 7/289 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +R A HAGSWY+  S  L  +++ WL +A   +    +++ ++ PHAGY+YSG  AA+++
Sbjct: 51  VREAVHAGSWYSSKSNELKIQINCWLEQAKAEVTTVAQLKALVVPHAGYAYSGPTAAFSY 110

Query: 62  GNID---PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
             +    P+   +VF+LGP H+ Y  +C L+   +Y+TP+G++ +DLE +++L   G FE
Sbjct: 111 KYLKKYPPSEKLKVFILGPCHYVYITQCCLTRQEIYETPLGNIKVDLETVKQLHEQGLFE 170

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             D   +E EHS+EM LP+LA +       I+PI+VG+++A++E  YGRL ++Y D    
Sbjct: 171 QSDKDAEEEEHSIEMQLPFLAHILGTDNFTIIPIMVGSIDAKSEEYYGRLLSEYFDMDDT 230

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +S+DFCHWG++F Y +Y+   G I +SIE LD+  M+ IE  D D F  YL EY+N 
Sbjct: 231 LFIISTDFCHWGTKFAYTYYNSADGEIFESIEKLDQKAMEHIELHDLDKFNDYLREYENN 290

Query: 239 ICGRHPISVFLHMLGNCSTK--IKIKFLRYEQSSQCKTKRDSSVSYASA 285
           +CG+H I++ LH +        ++ KF+RY QS   + K+DSSVSYA+A
Sbjct: 291 VCGKHCIAILLHCIAMSQNTHMMETKFIRYAQSCLVRDKKDSSVSYAAA 339


>gi|145542007|ref|XP_001456691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424504|emb|CAK89294.1| unnamed protein product [Paramecium tetraurelia]
          Length = 344

 Score =  256 bits (654), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 118/289 (40%), Positives = 184/289 (63%), Gaps = 7/289 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +R A HAGSWY+  S  L  +++ WL +A   +    +++ ++ PHAGY+YSG  AA+++
Sbjct: 50  VREAVHAGSWYSSKSNELKIQINCWLEQAKAEVTTVAQLKALVVPHAGYAYSGPTAAFSY 109

Query: 62  GNID---PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
             +    P+   +VF+LGP H+ Y  +C L+   +Y+TP+G++ +DLE +++L   G FE
Sbjct: 110 KYLKKYPPSEKLKVFILGPCHYVYITQCCLTRQEIYETPLGNIKVDLETVKQLHEQGLFE 169

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             D   +E EHS+EM LP+LA +       I+PI+VG+++A++E  YGRL ++Y D    
Sbjct: 170 QSDKDAEEEEHSIEMQLPFLAHILGTDNFTIIPIMVGSIDAKSEEYYGRLLSEYFDMDDT 229

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +S+DFCHWG++F Y +Y+   G I +SIE LD+  M+ IE  D D F  YL EY+N 
Sbjct: 230 LFIISTDFCHWGTKFAYTYYNSADGEIFESIEKLDQKAMEHIELHDLDKFNDYLREYENN 289

Query: 239 ICGRHPISVFLHMLGNCSTK--IKIKFLRYEQSSQCKTKRDSSVSYASA 285
           +CG+H I++ LH +        ++ KF+RY QS   + K+DSSVSYA+A
Sbjct: 290 VCGKHCIAILLHCIAMSQNTHMMETKFIRYAQSCLVRDKKDSSVSYAAA 338


>gi|452988647|gb|EME88402.1| hypothetical protein MYCFIDRAFT_86111 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 336

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/330 (43%), Positives = 182/330 (55%), Gaps = 44/330 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----------------PEVRGVIAPH 47
           IR ASHAGSWY+D+  +L  +LDGWL     P S                P  R +IAPH
Sbjct: 7   IRDASHAGSWYSDSKSQLNSQLDGWLAAVDTPVSCIGPQSSQEVFPDVPVPSARVIIAPH 66

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
           AGYSYSG AAA+A+ + D +   RVFLLGPSHHYY  K ALS  T Y TPIG+L +D + 
Sbjct: 67  AGYSYSGPAAAWAYKSWDVSKAERVFLLGPSHHYYLSKAALSKCTQYATPIGNLTVDRQT 126

Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-------LIKIVPILVGAVNAE 160
             +L  + +FE M   VDE EHS+EMHLPY+ K+   H       L  +VPI+VG  +A 
Sbjct: 127 TAKLYTSAQFEWMTQSVDEQEHSLEMHLPYIYKMLAKHFGGNPAGLPPLVPIMVGNTSAS 186

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------I 205
            E   G++ A Y+ DP+N F VSSDF HWG RF Y +Y    G                I
Sbjct: 187 TERALGQILAPYLADPANVFVVSSDFAHWGLRFRYTYYRPSTGSALHLSSSAKSPQSPPI 246

Query: 206 HKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIK----- 260
           H+SI+ +D   M   E+G    +   L E  NT+CGRHPI V +  +             
Sbjct: 247 HESIKTVDFECMGACESGSHSKWLSTLEETGNTVCGRHPIGVIMAAIEQVRKDSGEQHAG 306

Query: 261 -IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
             KF+RYE+SS+ K   DSSVSYASA A +
Sbjct: 307 LFKFVRYERSSEVKRIGDSSVSYASAFAVI 336


>gi|154289990|ref|XP_001545597.1| hypothetical protein BC1G_15807 [Botryotinia fuckeliana B05.10]
          Length = 323

 Score =  255 bits (652), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 139/323 (43%), Positives = 181/323 (56%), Gaps = 46/323 (14%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
           M   R ASHAGSWYT N  +L+ ELD WL +  +P +        P  R +IAPHAGYSY
Sbjct: 1   MGSTREASHAGSWYTSNPSKLSSELDNWLSQ--VPSTISDTKLPIPGARVIIAPHAGYSY 58

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           SG AAA+A+  +D ++  RVFLLGPSH +Y  +CALS  + Y TP+GDL +D  +++EL 
Sbjct: 59  SGPAAAWAYATLDLSTAKRVFLLGPSHAWYLTECALSKHSKYATPLGDLTIDTALVQELS 118

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYG 166
           ATG+F+ M    DE EHS+E+HLPY+ K+             ++PILVG  NA  E  +G
Sbjct: 119 ATGEFKRMSTDQDETEHSLELHLPYIYKLLSLNFPSPSSFPPLIPILVGNTNAATERKFG 178

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------DKKHGV------------ 204
            + A Y+ DPSN F +SSDFCHWGSRF Y +Y          DK  G             
Sbjct: 179 SILAPYLSDPSNIFIISSDFCHWGSRFQYTYYLPASPSAEVRDKSGGYSLSRRDKAPTQP 238

Query: 205 -IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-------S 256
            IH+SI  LD++ M  I  G    F   L E  NT+CGRHPI V +  +           
Sbjct: 239 PIHESIGRLDQLSMSAISGGVHQEFLDNLEETGNTVCGRHPIGVIMAAIEKVKEEKSLDG 298

Query: 257 TKIKIKFLRYEQSSQCKTKRDSS 279
            K K KF+RYE+S++     D S
Sbjct: 299 EKGKFKFVRYERSNEVTRVDDGS 321


>gi|388582149|gb|EIM22455.1| UPF0103-domain-containing protein [Wallemia sebi CBS 633.66]
          Length = 318

 Score =  254 bits (650), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 182/314 (57%), Gaps = 29/314 (9%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAA 57
           M+ IR  +HAG+WYTDNS +L ++L  W  +A   G+  +   + +IAPHAG S+SGR A
Sbjct: 1   MQNIREPTHAGAWYTDNSIKLNKQLQHWFDDAISDGIHNNSRSKVIIAPHAGLSFSGRTA 60

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           AYA+   + +   R+ +LGPSHH Y   CALS    Y TP+G L +D E+ +EL +T  F
Sbjct: 61  AYAYSAANISQYKRIIILGPSHHVYLKSCALSPFDAYDTPLGTLSIDKEINDELSSTKAF 120

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
           + M    DE EHS EMH P++ K+   H IKIVPILVG ++ E E  YG+ FAKY++DP 
Sbjct: 121 KRMSKVTDEDEHSFEMHTPFIKKLANEHDIKIVPILVGNISHEREVEYGKYFAKYINDPE 180

Query: 178 NFFSVSSDFCHWGSRFNYMHY-----------------DKKHGVIHKSIEALDKMGMDII 220
             F++SSDF HWGSRF Y +Y                 D+    IH+SI  LD    DI+
Sbjct: 181 TLFAISSDFSHWGSRFRYTYYRPSRNEPGGRLGKFDDVDETGDSIHESISGLDHEAWDIL 240

Query: 221 ETG--DP----DAFKKYLLEYDNTICGRHPISVFLHMLGNCS---TKIKIKFLRYEQSSQ 271
                DP    ++F KYL E  NTICGRH   + L  L + +   T   +K+  Y QSS+
Sbjct: 241 SEALKDPKQTHESFTKYLSETGNTICGRHAFGILLGALHHANRDYTSGSVKWTHYTQSSK 300

Query: 272 CKTKRDSSVSYASA 285
                DSSVSY++ 
Sbjct: 301 ALLPTDSSVSYSAG 314


>gi|398398762|ref|XP_003852838.1| hypothetical protein MYCGRDRAFT_70695 [Zymoseptoria tritici IPO323]
 gi|339472720|gb|EGP87814.1| hypothetical protein MYCGRDRAFT_70695 [Zymoseptoria tritici IPO323]
          Length = 338

 Score =  254 bits (650), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 148/334 (44%), Positives = 183/334 (54%), Gaps = 46/334 (13%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----------------PEVRGVIA 45
           + IR  SHAGSWY+D+S +L  ELDGWL     P +                P  R +IA
Sbjct: 5   QTIRDCSHAGSWYSDSSSKLNAELDGWLAAVDAPVTCIGPRSEGEQVQQLPVPGARMIIA 64

Query: 46  PHAGYSYSGRAAAYAFGNIDPTSISR--VFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
           PHAGYSYSG AAA+A+   D +  +R  VFLLGPSHH+Y  K ALS  T Y TPIG+L +
Sbjct: 65  PHAGYSYSGPAAAWAYKAWDVSKANRKKVFLLGPSHHHYLTKAALSRCTQYATPIGNLTV 124

Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF-------EGHLIKIVPILVGA 156
           D E   EL ATG FE M   VDE EHS+EMHLPY+ K+          H   +VP+++G 
Sbjct: 125 DRETTAELHATGVFEWMSHSVDEQEHSLEMHLPYIYKMLSRTFGEDSAHFPPLVPLMIGN 184

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------ 204
            +   E   GR  A Y+ DPSN F +SSDF HWG RF Y +Y    G             
Sbjct: 185 TSPSTEKALGRRLAPYLADPSNAFVISSDFAHWGLRFRYTYYRPSTGTAVDLTSSSRSPK 244

Query: 205 ---IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-----GNCS 256
              IH SI+ +D   M   E+G  D +   L +  NT+CGRHPI V +  +     G  S
Sbjct: 245 EPAIHDSIKTVDFESMGACESGSHDEWLGQLEDTGNTVCGRHPIGVMMAAVEEVRKGAAS 304

Query: 257 TKI-KIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
                 KF+RYE+SS+ K   DSSVSYASA A V
Sbjct: 305 QGTGAFKFVRYERSSEVKRVSDSSVSYASAFACV 338


>gi|56755259|gb|AAW25809.1| SJCHGC02434 protein [Schistosoma japonicum]
          Length = 304

 Score =  253 bits (645), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 121/295 (41%), Positives = 186/295 (63%), Gaps = 9/295 (3%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
            + +R +SH GSWY+ +  +L+ +L  WL   E  +     VR +I PHAGY +SG  AA
Sbjct: 5   FKSVRVSSHCGSWYSADRTQLSSQLSTWLESCENSVLSGYSVRAIIVPHAGYRHSGFCAA 64

Query: 59  YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           +A+  I+P  I R+F+LGPSH       CAL+  + Y+TP  +L +D ++  +LK    F
Sbjct: 65  HAYRQINPDKIERIFILGPSHRLDIGDTCALTCVSEYETPFCNLKIDTDIYSDLKKLSYF 124

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           +++    DEAEHS+EM LP++A + +G      IVP++VG ++ E + ++G+L + Y+ D
Sbjct: 125 KVLTKNQDEAEHSVEMQLPFIAYIMKGKKDQYSIVPVVVGCLSTERQELFGKLLSNYLLD 184

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
             N F +SSDFCHWG RF Y +YDK  G I +SIE LD +G+  I++  P++F +YL ++
Sbjct: 185 EKNLFVISSDFCHWGKRFRYQYYDKSDGAIWQSIEKLDHLGLGAIQSLKPESFLQYLKKF 244

Query: 236 DNTICGRHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            NTICGR  I + L M+ +   K    +++K L Y QS++C++  DSSVSYA+ A
Sbjct: 245 SNTICGRRSIGLLLFMIDSIRQKQLFNLELKVLYYTQSNRCQSMEDSSVSYAACA 299


>gi|353229019|emb|CCD75190.1| hypothetical protein Smp_097240.2 [Schistosoma mansoni]
          Length = 304

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 127/292 (43%), Positives = 180/292 (61%), Gaps = 9/292 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           IR +SHAGSWY+ +   L+ +LD WL   E  +     VR +I PHAGY +SG  AA+A+
Sbjct: 8   IRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAAHAY 67

Query: 62  GNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
             I+P  I  +F++GPSH       CAL+  + Y+TP  +L ++  V   LK+   F ++
Sbjct: 68  RLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCFSVL 127

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           +   DEAEHS+EM LPY+A + +    +  IVPI+VG+++ E +  +G L + Y+ D  N
Sbjct: 128 EKSQDEAEHSVEMQLPYIAHIMKRRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLDEKN 187

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDFCHWG RF Y +Y+K  G I +SIE LD +G++ I +  P +F KYL EY NT
Sbjct: 188 LFVISSDFCHWGKRFRYQYYNKSDGAIWQSIEKLDHLGLNAIRSLKPKSFIKYLEEYSNT 247

Query: 239 ICGRHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ICGR  I V L M+     K    +++K L Y QSS C++  DSSVSYA+ A
Sbjct: 248 ICGRRSIGVLLFMIDCIRQKQPSNLELKILNYTQSSHCQSMEDSSVSYAACA 299


>gi|353229018|emb|CCD75189.1| hypothetical protein Smp_097240.1 [Schistosoma mansoni]
          Length = 303

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 127/291 (43%), Positives = 179/291 (61%), Gaps = 8/291 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           IR +SHAGSWY+ +   L+ +LD WL   E  +     VR +I PHAGY +SG  AA+A+
Sbjct: 8   IRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAAHAY 67

Query: 62  GNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
             I+P  I  +F++GPSH       CAL+  + Y+TP  +L ++  V   LK+   F ++
Sbjct: 68  RLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCFSVL 127

Query: 121 DICVDEAEHSMEMHLPYLAKVFE-GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           +   DEAEHS+EM LPY+A + +      IVPI+VG+++ E +  +G L + Y+ D  N 
Sbjct: 128 EKSQDEAEHSVEMQLPYIAHIMKRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLDEKNL 187

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
           F +SSDFCHWG RF Y +Y+K  G I +SIE LD +G++ I +  P +F KYL EY NTI
Sbjct: 188 FVISSDFCHWGKRFRYQYYNKSDGAIWQSIEKLDHLGLNAIRSLKPKSFIKYLEEYSNTI 247

Query: 240 CGRHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           CGR  I V L M+     K    +++K L Y QSS C++  DSSVSYA+ A
Sbjct: 248 CGRRSIGVLLFMIDCIRQKQPSNLELKILNYTQSSHCQSMEDSSVSYAACA 298


>gi|452821363|gb|EME28394.1| hypothetical protein Gasu_40900 [Galdieria sulphuraria]
          Length = 312

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 127/291 (43%), Positives = 180/291 (61%), Gaps = 14/291 (4%)

Query: 9   HAGSWYTDNSK---RLAEELDGWLREA-----GLPKSPE--VRGVIAPHAGYSYSGRAAA 58
           + G+WY ++ K    L  +LD WL++A      L +     ++ VI PHAGYSYSG  AA
Sbjct: 16  YEGTWYGEDEKDSIALKHQLDLWLQKAKENVGNLEQYSNGLLKAVIVPHAGYSYSGATAA 75

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +++  IDP+   RVF+LGPSH+Y    C ++     +TP G L +D++  E L AT KF 
Sbjct: 76  HSYARIDPSLYDRVFILGPSHYYRMSDCRVTPFERLETPFGPLVIDVQCCESLVATKKFT 135

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            +D  VD  EHS+E+  P+L  VF    +++V I+VG ++   +  YG+L A Y+ DP  
Sbjct: 136 FLDKGVDTQEHSLELQFPFLKAVFGSSNVQLVNIMVGVLDKAAKEEYGKLLAPYMKDPRT 195

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDFCHWG RF Y ++D++ G I KSIE LD++GM  IE      F  YL +Y NT
Sbjct: 196 LFVISSDFCHWGKRFGYQYHDEEDGEIFKSIEKLDRLGMTAIEHQSAKEFSSYLDKYQNT 255

Query: 239 ICGRHPISVFLHMLGNC----STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           ICGR+PI V LHM+       S +++  F+RY+QS +CKT +D SVSYA+A
Sbjct: 256 ICGRNPIGVLLHMIEYLKRTESMQLETLFVRYDQSMKCKTMQDLSVSYAAA 306


>gi|145514095|ref|XP_001442958.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410319|emb|CAK75561.1| unnamed protein product [Paramecium tetraurelia]
          Length = 294

 Score =  248 bits (632), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 123/285 (43%), Positives = 178/285 (62%), Gaps = 4/285 (1%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +KIR A+HAGSWY  + K+L  +L+ +L +A     P ++ +I PHAG+SYSG  AA+A+
Sbjct: 6   QKIREATHAGSWYIGDRKQLDAQLNDFLSKAKAETIPNIKAIIGPHAGFSYSGPTAAFAY 65

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK--FEL 119
            ++      +VFLLGP HH Y     LS   +Y+TP+G++ LD   I++L A  K  F  
Sbjct: 66  QHLVQKEGMKVFLLGPCHHTYIKGIGLSELEIYETPLGNIELDQPTIKQLSAELKKNFIF 125

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
            +  V+E EHS+EMHLP++ KVF     K++PI+VGA   + +A    +  KY  DP+  
Sbjct: 126 TNKEVEEEEHSLEMHLPFIYKVFPK--CKLIPIMVGATTEQQDAQVASVLVKYFVDPNTV 183

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
           F +SSDFCHWG  F Y  Y+K+HG IH+SI  LD   + +IE+ +   F KYL +  NTI
Sbjct: 184 FVISSDFCHWGKSFRYTPYNKEHGEIHQSITQLDGQAIKLIESHNIPEFYKYLEDTKNTI 243

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           CGRHPI V L+++     +++ +  +Y QS+Q     DSSVSYA+
Sbjct: 244 CGRHPICVLLNIINLSKLQLRTQLTKYNQSNQVTKPNDSSVSYAA 288


>gi|387219389|gb|AFJ69403.1| protein memo1 [Nannochloropsis gaditana CCMP526]
          Length = 254

 Score =  246 bits (629), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 123/233 (52%), Positives = 159/233 (68%), Gaps = 8/233 (3%)

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           ++DP+ I RVF+LGPSHH Y   CA+S A+V +TP+G L +D  V  EL A   F+ M  
Sbjct: 2   HLDPSVIQRVFILGPSHHVYLRACAVSGASVCETPVGPLSVDEAVRAELLADPLFQTMAQ 61

Query: 123 CVDEAEHSMEMHLPYLAK-VFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
            VDE EHS+EMHLPY+A  V E   + IVPILVG+++   EA+YGR+ AKY+ DP+N F 
Sbjct: 62  RVDEDEHSIEMHLPYVAHLVGEVQGVSIVPILVGSLDPTQEAVYGRILAKYLSDPANLFI 121

Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           VS+DFCHWG RF Y  +D+ +G  IH+ IE LD+ GM +IE GDP AF  YL +Y NTIC
Sbjct: 122 VSTDFCHWGRRFQYQPFDRAYGEDIHEYIEWLDRQGMALIEQGDPQAFTTYLRQYRNTIC 181

Query: 241 GRHPISVFLHML------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
           GRHP+ VFL         G     ++++F+RY QSS+C  + DSSVSYASA A
Sbjct: 182 GRHPLGVFLQAAQALREGGKEGLDVRVEFVRYAQSSRCMHETDSSVSYASAVA 234


>gi|171680871|ref|XP_001905380.1| hypothetical protein [Podospora anserina S mat+]
 gi|27764330|emb|CAD60610.1| unnamed protein product [Podospora anserina]
 gi|170940063|emb|CAP65290.1| unnamed protein product [Podospora anserina S mat+]
          Length = 335

 Score =  246 bits (629), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 143/333 (42%), Positives = 186/333 (55%), Gaps = 49/333 (14%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R ASHAGSWY D+ K L+ +LDG+L          GLP  P  R +IAPHAGYSYSG  A
Sbjct: 4   RDASHAGSWYDDDEKELSSQLDGFLSRVPDQLDDHGLP-VPGARVIIAPHAGYSYSGPCA 62

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+  +D  +  RVF+LGPSH YY   CAL+T + Y TP GDL +D   I EL+ TGKF
Sbjct: 63  AWAYKALDLRAAKRVFILGPSHTYYLRGCALTTFSKYATPFGDLVVDRNTINELRETGKF 122

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYGRLFA 170
             +    D  EHS+EMH+P+L K  +            IVPILVG  +AE E  +G+L +
Sbjct: 123 TDIPARRDVDEHSLEMHVPFLWKRLQQTFGDDSTKYPPIVPILVGDGSAEEEKAFGKLLS 182

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNY-----------MHYDKKHGV--------------- 204
            Y+ DP+  + VSSDFCHWGSRF+Y           M   +  GV               
Sbjct: 183 SYLKDPTTAWIVSSDFCHWGSRFSYRPHFSDGAIRDMDAPRSKGVRHEVLNVTPPDWSKL 242

Query: 205 --------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS 256
                   IH  I+ LD++ MD +E+G+   F K + +  NT+CGRHPI V +  L    
Sbjct: 243 GISSGEPEIHDVIKVLDQLAMDAVESGEHSEFYKVVQDSHNTVCGRHPIGVIMAALEAVD 302

Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            K K KF++Y++S+  K   D SVSYASA A V
Sbjct: 303 GKGKFKFVQYQRSNLVKKSFDFSVSYASAYAVV 335


>gi|296415955|ref|XP_002837648.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633527|emb|CAZ81839.1| unnamed protein product [Tuber melanosporum]
          Length = 329

 Score =  246 bits (628), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 136/321 (42%), Positives = 185/321 (57%), Gaps = 40/321 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-----EAGLPKSP--EVRGVIAPHAGYSYSGRAA 57
           R A+HA SWY  + ++L  +L+ W++     + G+P+ P    R ++APHAGYSYSG  A
Sbjct: 4   RIATHANSWYVADKRKLDNQLNAWIKIAGSEDCGIPEFPVSNARIIMAPHAGYSYSGATA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
            +A+   D +  +R+F+LGP+HH Y   C++S    Y TP+G+L LD E++ EL +TGKF
Sbjct: 64  GFAYKAWDLSQATRIFILGPAHHVYINGCSISGCDSYDTPLGELTLDSELLRELASTGKF 123

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE----GHL-IKIVPILVGAVNAENEAMYGRLFAKY 172
             + + VDE EHS+EMHLPY+ K  E     H  +KIVPI+VG+++   E  YG L   Y
Sbjct: 124 TWLSLKVDEDEHSIEMHLPYIYKAIERAGKSHKDVKIVPIVVGSISTAQEKAYGTLLKPY 183

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYD----------------KKHGV------IHKSIE 210
           ++DP N F +S+DFCHWG+RF+Y  Y                 K+HGV      +HKSI 
Sbjct: 184 LEDPQNAFVISTDFCHWGARFSYTGYFRELPTHDNPNPEFIHLKEHGVQAIRHPLHKSIR 243

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH----MLG--NCSTKIKIKFL 264
           ALD  GM  I TG    F  YL +  NTICGRHPI V +     M G    S   K ++L
Sbjct: 244 ALDHEGMVAIATGKHSEFSSYLSKTKNTICGRHPIGVIMAAAELMFGENKGSNLGKFRYL 303

Query: 265 RYEQSSQCKTKRDSSVSYASA 285
            Y QS   +    SSVSY S 
Sbjct: 304 YYAQSEALEDVNGSSVSYVSG 324


>gi|395731964|ref|XP_002812197.2| PREDICTED: protein MEMO1-like, partial [Pongo abelii]
          Length = 240

 Score =  246 bits (628), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 122/234 (52%), Positives = 155/234 (66%), Gaps = 7/234 (2%)

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           F  + P+   R+F+LGPSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M
Sbjct: 8   FCCVSPSR--RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERM 65

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            +  DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN
Sbjct: 66  SLQTDEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSN 125

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F VSSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NT
Sbjct: 126 LFVVSSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNT 185

Query: 239 ICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           ICGRHPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 186 ICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 239


>gi|389644230|ref|XP_003719747.1| hypothetical protein MGG_04091 [Magnaporthe oryzae 70-15]
 gi|351639516|gb|EHA47380.1| hypothetical protein MGG_04091 [Magnaporthe oryzae 70-15]
 gi|440466506|gb|ELQ35770.1| hypothetical protein OOU_Y34scaffold00689g3 [Magnaporthe oryzae
           Y34]
 gi|440477091|gb|ELQ58235.1| hypothetical protein OOW_P131scaffold01683g71 [Magnaporthe oryzae
           P131]
          Length = 338

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 146/340 (42%), Positives = 188/340 (55%), Gaps = 53/340 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWL-------REAGLP-KSPEVRGVIAPHAGYSY 52
           M  IRRASHAGSWYT +S +L  ELD WL        E+ +P K   V  +IAPHAGYSY
Sbjct: 1   MSGIRRASHAGSWYTHSSSKLNTELDEWLSEVPDKINESDVPIKGASV--IIAPHAGYSY 58

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           SG  AA+A+   D ++  R+F+LGPSH YY   CAL+T   Y+TP+G+L +DL+ I++L+
Sbjct: 59  SGPCAAWAYKVFDVSAAKRIFVLGPSHTYYLSGCALTTYATYETPLGNLRVDLDTIKQLR 118

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK------------IVPILVGAVNAE 160
            TGKF+ +    DE EHS+EMHLPYLAK                    +VPIL+G    +
Sbjct: 119 DTGKFKDIPRDNDEDEHSLEMHLPYLAKRLTQTFGGGSDGDGDASWPPVVPILIGDNKRD 178

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------- 204
            E  +G L   ++ DP N F VSSDFCHWG+RF+Y  Y     V                
Sbjct: 179 AEKAFGELLLPHLRDPDNAFIVSSDFCHWGNRFSYTKYTADGTVEGVRSLGRADRNLPVP 238

Query: 205 IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-----------G 253
           IH+ I  LD + MD IETG  DAF   L    NT+CGRHPI V +  L           G
Sbjct: 239 IHEGIRVLDHLAMDAIETGSHDAFYDNLKATGNTVCGRHPIGVVMAALEMLKKERAGGDG 298

Query: 254 NCSTKIK----IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
             S   K      F++Y++SS  +   DSSVSYA+A A +
Sbjct: 299 AASGSKKGDGVFTFVKYDRSSLVENVSDSSVSYAAAYAVI 338


>gi|156035813|ref|XP_001586018.1| hypothetical protein SS1G_13111 [Sclerotinia sclerotiorum 1980]
 gi|154698515|gb|EDN98253.1| hypothetical protein SS1G_13111 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 335

 Score =  245 bits (626), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 138/332 (41%), Positives = 181/332 (54%), Gaps = 46/332 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
           M  IR ASHAGSWYT N   L+ +LD WL +  +P S        P  R +IAPHAGYSY
Sbjct: 1   MGSIREASHAGSWYTSNPSTLSTQLDEWLSQ--VPSSIDDTKIPIPGARVIIAPHAGYSY 58

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           SG AAA+A+ ++D +S  R+FLLGPSH +Y  +CALS  + Y TP+GDL +D   + EL 
Sbjct: 59  SGPAAAWAYKSLDLSSAKRIFLLGPSHAWYLSECALSKHSTYATPLGDLRIDRATVSELA 118

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYG 166
            TG+F+ M+   DE EHS+EMHLPY+ K+             ++PILVG  NA  E  +G
Sbjct: 119 QTGEFKTMNSDQDETEHSLEMHLPYIYKMLSLNFDSPSDFPPLIPILVGNTNASTEKKFG 178

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------DKKHGV------------ 204
            + A Y+ D ++ F VSSDFCHWG RF Y +Y          D   G             
Sbjct: 179 AILAPYLADETSVFIVSSDFCHWGPRFQYTYYLPSDSSTDANDTSTGYSLSRWDKSPTQP 238

Query: 205 -IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-------S 256
            IH+SI  LDK+ M+ I  G    F   L E  NT+CGRHPI + +  +          S
Sbjct: 239 PIHESIGKLDKLSMEAISGGKHQEFLNNLEETGNTVCGRHPIGITMAAIEKLKEDKLLQS 298

Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            K K KF+RYE+S++     D S    S   +
Sbjct: 299 DKGKFKFVRYERSNEVVKVDDGSFELYSKTTR 330


>gi|134076429|emb|CAK39657.1| unnamed protein product [Aspergillus niger]
          Length = 315

 Score =  243 bits (619), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 135/311 (43%), Positives = 176/311 (56%), Gaps = 33/311 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R  SHAGSWY+D+   L+ +LD WL+E        G    P  R +IAPHAGY+YSGR A
Sbjct: 4   REDSHAGSWYSDDKSTLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
           A+A+  +D +   R+F++GPSHH+Y    AL   T Y TP+ D  LPLD E I +L++T 
Sbjct: 64  AFAYKALDLSQAKRIFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTR 123

Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
                    +F  M   VDEAEHS+E+HLPY+ ++ +             +VPILVGAV 
Sbjct: 124 AGSRNGLELQFTTMSRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSEYPPLVPILVGAVT 183

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
              E  +G L A Y+DDP N F +SSDFCHWG RF Y     +   IH+SI A+D   M 
Sbjct: 184 ESTEKAFGALLAPYIDDPENAFVISSDFCHWGQRFRYT---SREPPIHESISAVDLATMA 240

Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKT 274
            I TG+   F   L    NT+CGRHPI V +     +  N   K +  F+RY++SS    
Sbjct: 241 AITTGEYARFSTILKNTGNTVCGRHPIGVIMAGIEEIRKNEGEKGRFHFIRYDRSSNVIE 300

Query: 275 KRDSSVSYASA 285
             DSSVSY SA
Sbjct: 301 VDDSSVSYVSA 311


>gi|426202099|gb|EKV52022.1| hypothetical protein AGABI2DRAFT_239501 [Agaricus bisporus var.
           bisporus H97]
          Length = 333

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 132/321 (41%), Positives = 176/321 (54%), Gaps = 35/321 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGYSYSGRAA 57
            R A+HAGSWYT N+ +L  +L GWL      +  E       + +IAPHAGYS+ G   
Sbjct: 13  FRGAAHAGSWYTSNAPQLTADLSGWLSLVRPRREDEEFPVQGCKAIIAPHAGYSHCGETG 72

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+ +IDP++  RVF++GPSH +    CALS    Y TPIG LP+D +V+  L  TG F
Sbjct: 73  AWAYKSIDPSTTRRVFVIGPSHFWRIDNCALSRCHTYDTPIGTLPVDTDVVVSLYETGLF 132

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             M++  DE EHS+EM LPYL +V EG  IKIVPI+VG +  E E  YGR+ A Y+ +  
Sbjct: 133 RWMNLTRDEREHSIEMQLPYLRRVCEGKDIKIVPIMVGDITKEQELEYGRILAPYLAEEG 192

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGV-------------------IHKSIEALDKMGMD 218
             F  SSDFCHWG  + Y  Y  K                      I +S   LD +GM 
Sbjct: 193 TVFIASSDFCHWGPEYYYTFYFPKPNCLPSEGYQATRATPPEKGYKIWQSTTQLDHIGMG 252

Query: 219 IIETGDPDA------FKKYLLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQ 268
           I+   D  A      F +YL+E DN+ICGRH I +      H+      K    +++Y+Q
Sbjct: 253 ILADQDRSALTAHRDFHRYLVETDNSICGRHAIGILYGALAHLEETTGRKKFCNWVKYDQ 312

Query: 269 SSQCKTKRDSSVSYASAAAKV 289
           S+Q     D+SVSYASA  K+
Sbjct: 313 SAQVTKITDNSVSYASAWIKI 333


>gi|19115252|ref|NP_594340.1| UPF0103 family [Schizosaccharomyces pombe 972h-]
 gi|1723280|sp|Q10212.1|YAY4_SCHPO RecName: Full=MEMO1 family protein C4H3.04c
 gi|1184017|emb|CAA93343.1| UPF0103 family [Schizosaccharomyces pombe]
          Length = 309

 Score =  240 bits (613), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 128/311 (41%), Positives = 188/311 (60%), Gaps = 29/311 (9%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAAY 59
           + IR A+HAGSWY D+++ L ++L  +++      +PE   R VI+PHAGY YSG+ A+ 
Sbjct: 3   QAIREATHAGSWYLDDTELLTKQLKSFIKNP----TPETGKRFVISPHAGYMYSGKVASQ 58

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT-GKFE 118
            F  +D + I RVF+ GPSHH +T KC +S A++  TP+GDL +D ++ ++L A+   F+
Sbjct: 59  GFQQLDFSKIQRVFVFGPSHHIFTRKCLVSRASICSTPLGDLKVDEDLCQKLVASDNSFD 118

Query: 119 LMDICVDEAEHSMEMHLPYLA--KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVD 174
            M + VDE+EHS+EM  P LA   + +G L  +KIVPI++GA+ +       +  ++Y+ 
Sbjct: 119 SMTLDVDESEHSLEMQFPLLAFHLLKQGCLGKVKIVPIMIGALTSTTMMAAAKFLSQYIK 178

Query: 175 DPSNFFSVSSDFCHWGSRFNY--------------MHYDKKHG----VIHKSIEALDKMG 216
           D SN F +SSDFCHWG RF Y              + Y ++ G     I++SI  LD +G
Sbjct: 179 DESNSFVISSDFCHWGRRFGYTLYLNDTNQLEDAVLKYKRRGGPTSPKIYESISNLDHIG 238

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
           M IIET   D F +YL    NTICGR+PI + +  +   +   + KF+ Y QSS  +   
Sbjct: 239 MKIIETKSSDDFSEYLKTTQNTICGRYPIELIMKSMECANFSERFKFISYAQSSHVELVT 298

Query: 277 DSSVSYASAAA 287
           DSSVSYA+A A
Sbjct: 299 DSSVSYATATA 309


>gi|367032250|ref|XP_003665408.1| hypothetical protein MYCTH_2309073 [Myceliophthora thermophila ATCC
           42464]
 gi|347012679|gb|AEO60163.1| hypothetical protein MYCTH_2309073 [Myceliophthora thermophila ATCC
           42464]
          Length = 335

 Score =  240 bits (612), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 140/333 (42%), Positives = 183/333 (54%), Gaps = 49/333 (14%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLRE-------AGLPKSPEVRGVIAPHAGYSYSGRAA 57
           R ASHAGSWY D+ + L+ +LD +L           LP  P  R +IAPHAGYSYSG  A
Sbjct: 4   REASHAGSWYEDDPEELSSQLDDFLHRVPAELDFTPLP-IPGARVIIAPHAGYSYSGPCA 62

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+ ++D  S  RVF+LGPSH YY   CAL+T   Y+TP GDL +D     EL+ TG+F
Sbjct: 63  AWAYKSLDLRSAKRVFILGPSHTYYLRGCALTTFDKYETPFGDLVVDKPTTSELRKTGRF 122

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYGRLFA 170
             M    +  EHS+EMH+PYL K  E            IVPILVG  + + E  +G L +
Sbjct: 123 SDMPARREVEEHSLEMHIPYLWKRLEQTFGTDVSKYPSIVPILVGNASEQEEKSWGELLS 182

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNY---------------------------MHYDKKHG 203
            Y+ DP   + VSSDFCHWGSRF+Y                              D +  
Sbjct: 183 PYLKDPETAWIVSSDFCHWGSRFSYRPEFHKGVVRDLDNKDDDLKVSPDWAQAAADPERP 242

Query: 204 VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-------GNCS 256
            IH+ I+ LD+M MD +E+G    F K + E  NT+CGRHPI V +  L       G  +
Sbjct: 243 EIHEVIKVLDQMAMDAVESGAHSEFYKVIRETRNTVCGRHPIGVTMAALETIAKEGGTNA 302

Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            K K +F++Y++S+  K +RD SVSYASA A V
Sbjct: 303 GKGKFRFVQYQRSNLVKKERDFSVSYASAYAVV 335


>gi|223999071|ref|XP_002289208.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974416|gb|EED92745.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 311

 Score =  240 bits (612), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 126/307 (41%), Positives = 185/307 (60%), Gaps = 24/307 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF-- 61
           +RRA HAGSWY++    L E L  +L +A +  S   +  I+PHAG+ YSG  AAY++  
Sbjct: 5   VRRAHHAGSWYSEEPNELDEALANYLAKAMVASS---KTSISPHAGFQYSGPTAAYSYLA 61

Query: 62  ---GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
                I   S+  V +L PSHH Y   CA+S A+  +TP+GDL +D  + E++ +TGKF+
Sbjct: 62  LKEALISNPSLRTVVVLHPSHHIYLDGCAVSGASTLETPLGDLVVDNNLREQMLSTGKFD 121

Query: 119 LMDICVDEAEHSMEMHLPYLAKVF----------EGHLIKIVPILVGAVNAENEAMYGRL 168
           +M+  VDE EHS EM  P++AKV           +   IK++PI++G++  + E ++G+L
Sbjct: 122 VMEKRVDEQEHSGEMQYPFIAKVINDVKKHTPEHDNLSIKVLPIMIGSIKQKKEELFGKL 181

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH--GVIHKSIEALDKMGMDIIETGDPD 226
            A ++ D   F  +SSDFCHWG RF+Y     +     +H+ IE LD+ GM +IE   P 
Sbjct: 182 LAPFLSDDGVFTVISSDFCHWGKRFSYTPQPSREVANEVHEYIEYLDRKGMSLIEMQRPG 241

Query: 227 AFKKYLLEYDNTICGRHPISVFLHML----GNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
           AF  YL EY NTICGRHPI+V+L+ +         ++ ++F +Y QS +  + RD SVSY
Sbjct: 242 AFADYLREYSNTICGRHPIAVWLNTVVTNREEGKERLDVRFAKYAQSGKATSLRDFSVSY 301

Query: 283 ASAAAKV 289
           ASA A+ 
Sbjct: 302 ASAVART 308


>gi|426335211|ref|XP_004029126.1| PREDICTED: protein MEMO1 [Gorilla gorilla gorilla]
          Length = 330

 Score =  239 bits (611), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 127/284 (44%), Positives = 168/284 (59%), Gaps = 26/284 (9%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LGPSH
Sbjct: 58  QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 116

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 117 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 176

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 177 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 236

Query: 198 YDKKHGVIHKSIEALDKM-GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM----- 251
           YD+  G I++SIE LDKM  +  I   D    K + +             VFL M     
Sbjct: 237 YDESQGEIYRSIEHLDKMVTIQKIILRDTSVLKIFKVN-----------KVFLDMVFSNI 285

Query: 252 ------LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
                 L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 286 INAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 329


>gi|453089768|gb|EMF17808.1| UPF0103-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 336

 Score =  238 bits (607), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 134/332 (40%), Positives = 184/332 (55%), Gaps = 44/332 (13%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----------------PEVRGVIA 45
           + +R ASHAGSWY+ +  +L  +L+GWL     P S                P  R +IA
Sbjct: 5   QTVRHASHAGSWYSSSKSQLDGQLEGWLSAVKPPVSCIGPQSAGQTYSELPVPGARVIIA 64

Query: 46  PHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDL 105
           PHAGYSYSG AAA+A+   D +   R+FLLGPSHHY+  K ALS  T Y TP+G+L +D 
Sbjct: 65  PHAGYSYSGPAAAWAYQAWDVSKAKRIFLLGPSHHYHLTKAALSRCTHYATPLGNLVVDK 124

Query: 106 EVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE-------GHLIKIVPILVGAVN 158
           +   +L A+G+F+ M   VDE EHS+EMHLPY+ K+              +VPI+VG  +
Sbjct: 125 DTTAKLHASGQFQWMSQSVDEDEHSLEMHLPYIYKMLSKTFGDDPTEFPPLVPIMVGNTS 184

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--------------- 203
              E   GR+ A+++ DP+N F +SSDF HWGSRF Y +Y    G               
Sbjct: 185 QSTERSLGRILAEHLADPTNAFVISSDFAHWGSRFRYTYYKPSTGSAVDLKSSGPVPTSP 244

Query: 204 VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML------GNCST 257
            IH+SI+A+D   M+  E+G  + +   L +  NT+CGRHPI V +  +      G    
Sbjct: 245 AIHESIKAVDFECMEACESGSHERWLHALEDTGNTVCGRHPIGVIMAAVEEVRKSGGKEG 304

Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
               KF++Y++SS      DSSVSYASA A +
Sbjct: 305 IGAFKFVQYDRSSLVIRVSDSSVSYASAFAVI 336


>gi|367047497|ref|XP_003654128.1| hypothetical protein THITE_2116866 [Thielavia terrestris NRRL 8126]
 gi|347001391|gb|AEO67792.1| hypothetical protein THITE_2116866 [Thielavia terrestris NRRL 8126]
          Length = 341

 Score =  237 bits (604), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 142/339 (41%), Positives = 182/339 (53%), Gaps = 55/339 (16%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R ASHAGSWY D++  L+ +LD +L           LP  P  R +IAPHAGYSYSG  A
Sbjct: 4   REASHAGSWYDDDADVLSSQLDDFLNRVPATLNNNNLP-IPGARVIIAPHAGYSYSGPCA 62

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+  +D  S  RVF+LGPSH YY   CAL+T   Y+TP GDL +D     ELK TG+F
Sbjct: 63  AWAYKALDLRSAKRVFILGPSHTYYLRGCALTTFDKYETPFGDLVVDKATTSELKRTGRF 122

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYGRLFA 170
             M    D  EHS+EMH+PYL K  E            IVPILVG  + E E  +G L +
Sbjct: 123 SNMPPGRDVDEHSLEMHIPYLWKRLEQTFGSDSTTYPPIVPILVGDGSEEEEKSFGELLS 182

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNY---------------------------------MH 197
           +Y+ DP   + VSSDFCHWGSRF+Y                                 + 
Sbjct: 183 RYLKDPETAWIVSSDFCHWGSRFSYRPEFHQGSIRNLDAPRSQGSKRDVVQAAPDWSKVA 242

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----- 252
            D     IH+ I+ALD+M MD +E+G    F K + +  NT+CGRHPI V +  L     
Sbjct: 243 EDPGAPQIHEVIKALDQMAMDAVESGTHSEFYKVIQDTHNTVCGRHPIGVIMAALELVAK 302

Query: 253 --GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
             G    K K +F++Y++S+  K   DSSVSYASA A +
Sbjct: 303 EQGLQEGKGKFRFIQYQRSNLVKKHWDSSVSYASAYAVI 341


>gi|213403404|ref|XP_002172474.1| MEMO1 [Schizosaccharomyces japonicus yFS275]
 gi|212000521|gb|EEB06181.1| MEMO1 [Schizosaccharomyces japonicus yFS275]
          Length = 309

 Score =  234 bits (598), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 124/305 (40%), Positives = 178/305 (58%), Gaps = 22/305 (7%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
            +R  +HAG+WYTD+   L ++LDG++  AG    P  R +I+PHAGY YSG  A   +G
Sbjct: 7   NVRPLTHAGTWYTDDFSALKKQLDGFM--AGTQIVPGTRMIISPHAGYLYSGPTAGKCYG 64

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-FELMD 121
           ++D +   RVF+LGPSHH YT  C +S+   Y T +G+LP+D EV + L ++ K  +L+ 
Sbjct: 65  SLDFSHCKRVFVLGPSHHMYTRDCLVSSFDAYATHLGNLPIDREVCDFLLSSSKHVQLLS 124

Query: 122 ICVDEAEHSMEMHLPYLAKVFEG----HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
              DEAEHS+EM +P LA+  +       + IVPI+VGA++  +E  +G   A +V +  
Sbjct: 125 ENADEAEHSLEMQIPILAQALQNQNALQHVSIVPIMVGALSHSHEQAFGETLAPFVANEE 184

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHG---------------VIHKSIEALDKMGMDIIET 222
           N F +SSDFCHWG RF Y  Y    G                I +S+  LD++ M  IE 
Sbjct: 185 NVFIISSDFCHWGLRFGYTGYLNNDGKFEMLAASSHSPRAPKIFQSVHQLDRLVMSAIEE 244

Query: 223 GDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
           GD + F + L +  NT+CG+HPI+V L  L   +   + +F+ Y+QSSQ     DSSVSY
Sbjct: 245 GDYNHFVRILSKTKNTVCGQHPIAVILAALTKANIDKRFRFIAYDQSSQVVDIEDSSVSY 304

Query: 283 ASAAA 287
           A+  A
Sbjct: 305 AAGLA 309


>gi|340931817|gb|EGS19350.1| hypothetical protein CTHT_0048080 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 337

 Score =  234 bits (598), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 139/335 (41%), Positives = 185/335 (55%), Gaps = 51/335 (15%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVRGVIAPHAGYSYSGRAA 57
           R ASHAGSWY D+   L+ +LD +L        ++ LP  P  R +IAPHAGYSYSG  A
Sbjct: 4   REASHAGSWYEDDPSVLSSQLDEFLGRVPASLNDSALP-IPGARVIIAPHAGYSYSGPCA 62

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+  +D ++  RVF+LGPSH YY   CAL+T   Y+TP GDL +D     ELK TG+F
Sbjct: 63  AWAYKTLDLSNAKRVFILGPSHTYYLRGCALTTFDKYETPFGDLIVDKATTAELKRTGRF 122

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE-------GHLIKIVPILVGAVNAENEAMYGRLFA 170
             M    D  EHS+EMH+PYL K  E            I+PILVG    E E  YG+L A
Sbjct: 123 SDMPARRDVDEHSLEMHIPYLWKRLEQTFGTDSAKYPPIIPILVGDGTEEQEQAYGQLLA 182

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYM------------------HYDKKHGV-------- 204
           +Y+ DP+  + +SSDFCHWGSRF+Y                   H + + G         
Sbjct: 183 QYLKDPTTAWIISSDFCHWGSRFSYRPVFDNGKIRNLDADRKDKHLELRPGWEQLVDNPD 242

Query: 205 ---IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST---- 257
              IH+ IE LDKM M+ +E+G    F   + +  NT+CGRHPI V +  L   +     
Sbjct: 243 NMKIHEVIEMLDKMAMNAVESGAHHKFYHVIQQTQNTVCGRHPIGVIMAALEVLTKDQRP 302

Query: 258 ---KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
              + K+KF++Y++S+  +   D SVSYASA A V
Sbjct: 303 EDGRGKLKFVQYQRSNLVEKVWDFSVSYASAYAIV 337


>gi|321258823|ref|XP_003194132.1| hypothetical protein CGB_E1360W [Cryptococcus gattii WM276]
 gi|317460603|gb|ADV22345.1| conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 346

 Score =  234 bits (598), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 143/344 (41%), Positives = 184/344 (53%), Gaps = 55/344 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-------PKSPEVRGVIAPHAGYSYS 53
           M  +R A+HAGSWYT +   L  +L   L            P     + +IAPHAGYSYS
Sbjct: 1   MSGVREATHAGSWYTSSRSGLHNQLSQNLSAVKPISTLDYDPPVSNAKAIIAPHAGYSYS 60

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G AAA+A+  +    I RVFLLGPSHH Y P  ALS    Y+TP+GD+PLD++ I EL+A
Sbjct: 61  GPAAAWAYAAVPTEKIKRVFLLGPSHHAYLPGVALSKFETYETPLGDIPLDIDTINELRA 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEG-HLIKIVPILVGAVNAENEAMYGRLFAKY 172
           T  F  M    DE EHS+EMHLPY+  +F+G + +K+VPILVG  NA   A      AKY
Sbjct: 121 TRIFSDMKSSTDEDEHSLEMHLPYIRLIFQGRNDLKLVPILVGHPNASTSAKLSEALAKY 180

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY------------------DKKHGV---------- 204
             D   FF +SSDFCHWGSRF+   Y                      G+          
Sbjct: 181 WQDDETFFVISSDFCHWGSRFSCTPYYPNIPPLVNPVPPVKSSTSSTSGILSQPPELVKK 240

Query: 205 -----------IHKSIEALDKMGMDII----ETGDPDAFKKYLLEYDNTICGRHPISVFL 249
                      I KSIE +D  GMD++    E G  + +  YL    NTICGR+PI+V L
Sbjct: 241 FSSATSNPDVPIWKSIEYMDHEGMDLLRKPGEDGAVEKWHGYLERTKNTICGRNPITVLL 300

Query: 250 HMLG----NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +++     +   K +  F+RYEQSS+C T +DSSVSY S   +V
Sbjct: 301 NLVQFVYKDQQVKPEFIFVRYEQSSRCVTGKDSSVSYVSGVLRV 344


>gi|317140845|ref|XP_001818439.2| hypothetical protein AOR_1_2650174 [Aspergillus oryzae RIB40]
          Length = 366

 Score =  234 bits (596), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 139/363 (38%), Positives = 182/363 (50%), Gaps = 82/363 (22%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS---------PEVRGVIAPHAGYSYSGR 55
           R ASHAGSWY+DN + L  +LDGWL  A +P S         P  R +IAPHAGYSYSG 
Sbjct: 4   REASHAGSWYSDNQRTLTHQLDGWL--AQVPNSIEGIGSLPVPGARIIIAPHAGYSYSGP 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKA 113
            AAYA+  +D +   R+F+LGPSHH+Y    AL   T Y TP+ D  LPLD E+I +L +
Sbjct: 62  CAAYAYKALDLSKAKRIFILGPSHHHYLSTLALPELTSYYTPLSDEPLPLDTELITKLLS 121

Query: 114 TG---------KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--------IVPILVGA 156
           T           F  M   +DE EHS+E+HLPY+ ++ + H           +VPI+VG+
Sbjct: 122 TKAVKSNGSTISFTTMSRSIDEDEHSIELHLPYIHRLLQLHFPNKPTSQYPPLVPIMVGS 181

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------ 198
            +A  E+ +G L A Y+ DP+N F +SSDFCHWG RF Y +Y                  
Sbjct: 182 TSASTESAFGALLAPYLQDPTNAFVISSDFCHWGLRFRYTYYVPQAPKPGPKLPLSANSL 241

Query: 199 -----------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
                                          +   IH+SI A D   M  I TG    F 
Sbjct: 242 PQPGDDLSSVEEKMESVSTGQALQRRDRISSREPAIHESISAFDIATMAAISTGATANFL 301

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           + +    NT+CGRHPI V +  +   ++     K K  F+RYE+SS      DSSVSY S
Sbjct: 302 ETIQTTGNTVCGRHPIGVIMAAIEEITSQEEGKKGKFHFVRYERSSDAVDVDDSSVSYVS 361

Query: 285 AAA 287
           A A
Sbjct: 362 AFA 364


>gi|320589421|gb|EFX01882.1| duf52 domain containing protein [Grosmannia clavigera kw1407]
          Length = 336

 Score =  234 bits (596), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 137/333 (41%), Positives = 178/333 (53%), Gaps = 47/333 (14%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG-------LPKSPEVRGVIAPHAGYSYS 53
           +   R A HAGSWYT + + L+EEL+GWL EA        LP +   R VIAPHAGY YS
Sbjct: 3   LRGTRPAEHAGSWYTADPRNLSEELNGWLEEASCAINGSLLPVT-NARAVIAPHAGYRYS 61

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G  AA+A+  +D     RVF+LGPSH YY   CAL+T   Y TP GD  +D +V   L+ 
Sbjct: 62  GPCAAWAYKCLDLNKARRVFVLGPSHTYYLRGCALTTFEKYATPFGDFQVDADVTARLRE 121

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL------IKIVPILVGAVNAENEAMYGR 167
           +G F+ +    D  EHS+EM LPY+ K+ E  L        +VPI+VG  +  +E  +G+
Sbjct: 122 SGFFKDIPRHNDVGEHSLEMQLPYIWKMLERTLGTGARWPSLVPIIVGDGSGADEREFGQ 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------DKK----HGVIHK 207
           L   Y++DP N F +SSDFCHWG  F + HY                D K       +H+
Sbjct: 182 LLVPYLEDPENAFVISSDFCHWGRNFEFWHYCPSSDFSKIRSLSPRGDSKPSPTEPPVHE 241

Query: 208 SIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI-------- 259
            I  LD+     IETG    F  YL   +NT+CGRHPISV +  L    T+         
Sbjct: 242 FIRVLDEAAFRAIETGSQAKFVDYLHVTENTVCGRHPISVAMAALELMGTRAPTSTSTNS 301

Query: 260 -----KIKFLRYEQSSQCKTKRDSSVSYASAAA 287
                K +F+RYE+S   +T  DSSVSY SA A
Sbjct: 302 PIGHPKFEFVRYERSDLAETTSDSSVSYGSAFA 334


>gi|405120620|gb|AFR95390.1| hypothetical protein CNAG_02444 [Cryptococcus neoformans var.
           grubii H99]
          Length = 346

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 140/344 (40%), Positives = 182/344 (52%), Gaps = 55/344 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-------PKSPEVRGVIAPHAGYSYS 53
           M  +R A+HAGSWYT +   L  +L   L            P     + +IAPHAGYSYS
Sbjct: 1   MSGVREATHAGSWYTSSRSGLRNQLSQNLSAVKPISTLDYDPPVSNAKAIIAPHAGYSYS 60

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G AAA+A+  +    I RVFLLGPSHH Y P  ALS    Y+TP+GD+PLD++ I+EL+A
Sbjct: 61  GPAAAWAYAAVPTEKIKRVFLLGPSHHAYLPGVALSKFEAYETPLGDIPLDIDTIDELRA 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-LIKIVPILVGAVNAENEAMYGRLFAKY 172
           TG F  M    DE EHS+EMHLPY+  +F+G   +K+VPILVG  +A          AKY
Sbjct: 121 TGIFSDMKSSTDEDEHSLEMHLPYIRLIFQGRDDLKLVPILVGHPSASTSVKLSEALAKY 180

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY---------------------------------- 198
             D   FF +SSDFCHWGSRF+   Y                                  
Sbjct: 181 WQDDETFFVISSDFCHWGSRFSCTPYYPNVPPLANPVPPVKSSTSVTSSTLTQPLELVKK 240

Query: 199 -----DKKHGVIHKSIEALDKMGMDII----ETGDPDAFKKYLLEYDNTICGRHPISVFL 249
                      I KSIE +D  GMD++    E G  + +  YL    NTICGR+PI+V L
Sbjct: 241 FSSASSNPDVPIWKSIEYMDHEGMDLLRKPGEEGAVEKWHGYLDRTKNTICGRNPITVLL 300

Query: 250 HMLG----NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +++     +   K +  F+RYEQSS+C T +DSSVSY S   ++
Sbjct: 301 NLVQLVYKDQPVKPEFTFVRYEQSSRCVTGKDSSVSYVSGVLRL 344


>gi|118376702|ref|XP_001021532.1| hypothetical protein TTHERM_00148970 [Tetrahymena thermophila]
 gi|89303299|gb|EAS01287.1| hypothetical protein TTHERM_00148970 [Tetrahymena thermophila
           SB210]
          Length = 333

 Score =  231 bits (589), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 112/288 (38%), Positives = 171/288 (59%), Gaps = 16/288 (5%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +R  ++ G WY+     L  +L+ WL  A   +     ++ +IAPHAG+S+SG  AA+A+
Sbjct: 54  VREPTYQGLWYSSQDYELNIQLNVWLDMAKCDITSIKSIKAIIAPHAGFSFSGPTAAWAY 113

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
                    R     P        CALS    Y+TP+G++ +D E I++L   G FE+ D
Sbjct: 114 ---------RYLKQNPPTQ---KSCALSGMKTYETPLGNIEIDQETIQQLLKEGSFEVSD 161

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
              +E EHS+EM L +L K      IK++P++VG++++  E  YG+LF+KY++     F 
Sbjct: 162 KDAEEEEHSIEMQLCFLIKTLGLENIKLIPVMVGSIDSTQEEKYGKLFSKYINQEDTLFV 221

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +S+DFCHWG +F Y +  K+   I++ IE LD+ GM+ IE  D DAF++Y+ E +N ICG
Sbjct: 222 ISTDFCHWGQKFKYTYSTKEDCEIYEQIEKLDQKGMEHIEMNDIDAFQQYIRETENNICG 281

Query: 242 RHPISVFLHMLGNC--STKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
           RH IS+ ++ +     S +I  KF+RY QSS  + K DSSVSYA+A +
Sbjct: 282 RHAISILMNAIAKSKISNQIDTKFIRYSQSSLVRDKNDSSVSYAAAVS 329


>gi|358368861|dbj|GAA85477.1| DUF52 domain protein [Aspergillus kawachii IFO 4308]
          Length = 365

 Score =  231 bits (588), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 180/358 (50%), Gaps = 77/358 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R  SHAGSWY+D+   L+ +LD WL+E        G    P  R +IAPHAGY+YSGR A
Sbjct: 4   REDSHAGSWYSDDKSTLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
           A+A+  +D +   R+F++GPSHH+Y    AL   T Y TP+ D  LPLD E I +L++T 
Sbjct: 64  AFAYKALDLSQAKRIFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTK 123

Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
                    +F  M   VDEAEHS+E+HLPY+ ++ +             +VPILVGAV 
Sbjct: 124 AVSRNGLELQFTTMSRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSQYPPLVPILVGAVT 183

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD------------------- 199
              E  +G L A Y+DDP+N F +SSDFCHWG RF Y +Y                    
Sbjct: 184 ESTEKAFGTLLAPYIDDPANAFVISSDFCHWGQRFRYTYYTSQAPNPGPSIPVSAPGLPQ 243

Query: 200 ----------------------------KKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                        +   IH+SI A+D   M  I TG+   F   
Sbjct: 244 PGIDGDSVTDQMEMVSAGRSLKSRERIRSREPPIHESISAVDIATMAAIATGEYARFSTI 303

Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           L    NT+CGRHPI V +     +  N   K + +F+RY++SS      DSSVSY SA
Sbjct: 304 LKNTGNTVCGRHPIGVIMAGIEDIRKNEGEKGRFRFIRYDRSSHVIEVDDSSVSYVSA 361


>gi|58267092|ref|XP_570702.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|134111713|ref|XP_775392.1| hypothetical protein CNBE1080 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50258051|gb|EAL20745.1| hypothetical protein CNBE1080 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57226936|gb|AAW43395.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 346

 Score =  230 bits (587), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 184/344 (53%), Gaps = 55/344 (15%)

Query: 1   MEKIRRASHAGSWYTDNS----KRLAEELDGWLREAGLPKSPEV---RGVIAPHAGYSYS 53
           M  +R A+HAGSWYT +     K+L++ L      + L   P V   + +IAPHAGYSYS
Sbjct: 1   MSGVREATHAGSWYTSSRPGLHKQLSQNLSAVKPISTLDYDPPVSNAKAIIAPHAGYSYS 60

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G AAA+A+  +    I RVFLLGPSHH Y P  ALS    Y+TP+GD+PLD + I EL+ 
Sbjct: 61  GPAAAWAYAAVPTEKIKRVFLLGPSHHAYLPGVALSKFEAYETPLGDIPLDTDTINELRD 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-LIKIVPILVGAVNAENEAMYGRLFAKY 172
           TG F  M    DE EHS+EMHLPY+  +F+G   +K+VPILVG  +A   A      AKY
Sbjct: 121 TGIFSDMKSSTDEDEHSLEMHLPYIRLIFQGRDDLKLVPILVGHPSASTSAKLSEALAKY 180

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY---------------------------------- 198
             D   FF +SSDFCHWGSRF+   Y                                  
Sbjct: 181 WQDGETFFVISSDFCHWGSRFSCTPYYPNIPPLANPVPPVKSSTSATPGTLTQPPELVKK 240

Query: 199 -----DKKHGVIHKSIEALDKMGMDII----ETGDPDAFKKYLLEYDNTICGRHPISVFL 249
                      I KSIE +D  GMD++    E G  + +  YL    NTICGR+PI+V L
Sbjct: 241 FSSASSNPDVPIWKSIEYMDHEGMDLLRKPGEDGAVEKWHGYLERTKNTICGRNPITVLL 300

Query: 250 HMLG----NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +++     N   K +  F+RYEQSS+C   +DSSVSY S   ++
Sbjct: 301 NLVQFVYKNQPVKPEFVFVRYEQSSKCVNGKDSSVSYVSGVLRL 344


>gi|432906940|ref|XP_004077602.1| PREDICTED: protein MEMO1 [Oryzias latipes]
          Length = 394

 Score =  230 bits (587), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 114/228 (50%), Positives = 149/228 (65%), Gaps = 8/228 (3%)

Query: 45  APHAGYSYSGRAAA-----YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG 99
           A HAG  YS          +A+  +DP+   RVF+LGPSHH    +CALS A +Y+TP+ 
Sbjct: 10  ASHAGSWYSASXXXXXXXXHAYKQVDPSITRRVFILGPSHHVPLSRCALSPADIYRTPLY 69

Query: 100 DLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAV 157
           DL +D +V  +L  TG FE + + +DE EHS+EMHLPY AK  E H     IVP+LVGA+
Sbjct: 70  DLRIDQKVYADLWKTGLFERISMQIDEDEHSIEMHLPYTAKAMESHKDEFSIVPVLVGAL 129

Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
           +   E  YG+L +KY+ DP+N F +SSDFCHWG RF Y +YD+  G I++SIE LDKMGM
Sbjct: 130 SESKEQEYGKLLSKYLADPTNLFIISSDFCHWGQRFRYTYYDESQGEIYRSIENLDKMGM 189

Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK-IKIKFL 264
            IIE  DP +F  YL +Y NTICGRHPI V L +L    T+ + + F+
Sbjct: 190 SIIEQLDPMSFTNYLKKYRNTICGRHPIGVLLALLSEVRTEDVNVTFV 237


>gi|425769569|gb|EKV08060.1| hypothetical protein PDIP_70230 [Penicillium digitatum Pd1]
 gi|425771206|gb|EKV09656.1| hypothetical protein PDIG_60800 [Penicillium digitatum PHI26]
          Length = 367

 Score =  230 bits (586), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 138/362 (38%), Positives = 180/362 (49%), Gaps = 79/362 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R+ASH+GSWY+D+++ LA +LDGWL +        G   +P  R +IAPHAGYSYSG  A
Sbjct: 4   RQASHSGSWYSDSARTLARQLDGWLAQVPDTMEKVGSLPTPGARVIIAPHAGYSYSGPCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
           AYA+  +D +   R+F+LGPSHH      AL T T Y+TP+ D  LPLD E+I +L    
Sbjct: 64  AYAYKALDLSKAKRIFILGPSHHVSLATLALPTLTSYRTPLSDEPLPLDTELITQLFETQ 123

Query: 112 --KATG---KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
             K  G   +F  M   VDE EHS+EMHLPY+ ++ +             +VPI+VG  +
Sbjct: 124 ATKPNGTKIRFTTMSRSVDEDEHSIEMHLPYIHRLLQLQYPDSSASEYPPLVPIMVGNTS 183

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
           A  E  +G L A Y+ DP N F +SSDFCHWG RF Y +Y                    
Sbjct: 184 AATEQAFGALLAPYLADPENAFVISSDFCHWGLRFRYTYYVPQAPRPGPQLPLSGDILPQ 243

Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                        +   IH+SI A D   M  I TG   +F   
Sbjct: 244 PGMAASSVHQAVDMVSLGHSLRQRDRISSRGPAIHESISAFDMATMAAITTGSAKSFLDI 303

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKI------KIKFLRYEQSSQCKTKRDSSVSYASA 285
           +    NT+CGRHPI V +  L   +         +  FLRYE+SS  +   DSSVSY SA
Sbjct: 304 IERTGNTVCGRHPIGVIMAALEFIAASYAADQEGRFYFLRYERSSDVEEIDDSSVSYVSA 363

Query: 286 AA 287
            A
Sbjct: 364 FA 365


>gi|449298413|gb|EMC94428.1| hypothetical protein BAUCODRAFT_149584 [Baudoinia compniacensis
           UAMH 10762]
          Length = 336

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 138/330 (41%), Positives = 183/330 (55%), Gaps = 44/330 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK---SPEVRG-------------VIAPH 47
           +R ASHAGSWY+ ++ +L  +L+G L     P     P+  G             +I PH
Sbjct: 7   VRDASHAGSWYSASASQLDSQLEGHLAAVKPPLKCIGPQSEGQSVSELPISGARVIIGPH 66

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
           AGYSYSG AAA+A+   D +   RVFLLGPSHH+YT + AL+  T Y TP+G L +D  +
Sbjct: 67  AGYSYSGPAAAWAYRCWDVSKAKRVFLLGPSHHHYTSRAALTRCTHYATPLGHLTIDRAM 126

Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYL----AKVFEGHLIK---IVPILVGAVNAE 160
             EL  +GKFE M   VDE EHS+EMHLPY+    +K F G+  K   +VP++VG  +A 
Sbjct: 127 TTELYKSGKFEWMSQSVDEDEHSLEMHLPYIHKIFSKTFSGNAAKFPLLVPVMVGNTSAS 186

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK---------------KHGVI 205
            EA  G+L A Y+ D +N F +SSDF HWG RF Y  Y                  +  I
Sbjct: 187 TEATLGKLLAPYLADETNAFVISSDFAHWGLRFRYTLYRSPGASQVNLAAGAKAPTNPAI 246

Query: 206 HKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKI--- 259
           H+SI+A+D   +   E+GD   + + L    NT+CGRHPI V +     L   S K    
Sbjct: 247 HESIKAVDFDCIASCESGDHSQWLETLESTGNTVCGRHPIGVMMAAVEELRRTSGKKDAG 306

Query: 260 KIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
             KF+RYE+S + K    SSVSYASA A +
Sbjct: 307 AFKFVRYERSEEVKKVSQSSVSYASAFAVI 336


>gi|350635890|gb|EHA24251.1| hypothetical protein ASPNIDRAFT_209410 [Aspergillus niger ATCC
           1015]
          Length = 365

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 178/358 (49%), Gaps = 77/358 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R  SHAGSWY+D+   L+ +LD WL+E        G    P  R +IAPHAGY+YSGR A
Sbjct: 4   REDSHAGSWYSDDKSTLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
           A+A+  +D +   R+F++GPSHH+Y    AL   T Y TP+ D  LPLD E I +L++T 
Sbjct: 64  AFAYKALDLSQAKRIFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTR 123

Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
                    +F  M   VDEAEHS+E+HLPY+ ++ +             +VPILVGAV 
Sbjct: 124 AGSRNGLELQFTTMSRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSEYPPLVPILVGAVT 183

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD------------------- 199
              E  +G L A Y+DDP N F +SSDFCHWG RF Y +Y                    
Sbjct: 184 ESTEKAFGALLAPYIDDPENAFVISSDFCHWGQRFRYTYYTPQAPNPGPSIPVSAPGLPQ 243

Query: 200 ----------------------------KKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                        +   IH+SI A+D   M  I TG+   F   
Sbjct: 244 PGIDSDNVTDQMEMVSTGRSLKSRDRIRSREPPIHESISAVDLATMAAITTGEYARFSTI 303

Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           L    NT+CGRHPI V +     +  N   K +  F+RY++SS      DSSVSY SA
Sbjct: 304 LKNTGNTVCGRHPIGVIMAGIEEIRKNEGEKGRFHFIRYDRSSNVIEVDDSSVSYVSA 361


>gi|317029592|ref|XP_001391938.2| hypothetical protein ANI_1_1068064 [Aspergillus niger CBS 513.88]
          Length = 365

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 178/358 (49%), Gaps = 77/358 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R  SHAGSWY+D+   L+ +LD WL+E        G    P  R +IAPHAGY+YSGR A
Sbjct: 4   REDSHAGSWYSDDKSTLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
           A+A+  +D +   R+F++GPSHH+Y    AL   T Y TP+ D  LPLD E I +L++T 
Sbjct: 64  AFAYKALDLSQAKRIFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTR 123

Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
                    +F  M   VDEAEHS+E+HLPY+ ++ +             +VPILVGAV 
Sbjct: 124 AGSRNGLELQFTTMSRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSEYPPLVPILVGAVT 183

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD------------------- 199
              E  +G L A Y+DDP N F +SSDFCHWG RF Y +Y                    
Sbjct: 184 ESTEKAFGALLAPYIDDPENAFVISSDFCHWGQRFRYTYYTPQAPNPGPSIPVSAPGLPQ 243

Query: 200 ----------------------------KKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                        +   IH+SI A+D   M  I TG+   F   
Sbjct: 244 PGIDGDNVTDQMEMVSTGRSLKSRDRIRSREPPIHESISAVDLATMAAITTGEYARFSTI 303

Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           L    NT+CGRHPI V +     +  N   K +  F+RY++SS      DSSVSY SA
Sbjct: 304 LKNTGNTVCGRHPIGVIMAGIEEIRKNEGEKGRFHFIRYDRSSNVIEVDDSSVSYVSA 361


>gi|255943395|ref|XP_002562466.1| Pc18g06390 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211587199|emb|CAP94863.1| Pc18g06390 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 367

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 136/362 (37%), Positives = 178/362 (49%), Gaps = 79/362 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R ASH+GSWY+D+++ L  +LDGWL +        G   +P  R +IAPHAGYSYSG  A
Sbjct: 4   RHASHSGSWYSDSARTLTRQLDGWLAQVPDTMEKVGSLPTPGARVIIAPHAGYSYSGPCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
           AYA+  +D +   R+F+LGPSHH      AL T T Y+TP+ D  LPLD E+I +L  T 
Sbjct: 64  AYAYKALDLSKAKRIFILGPSHHVSLATLALPTLTSYRTPLSDEPLPLDTELITQLLKTQ 123

Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
                     F  M   VDE EHS+EMHLPY+ ++ +             +VPI+VG  +
Sbjct: 124 ATRPNGAKISFTTMSRSVDEDEHSIEMHLPYIHRLLQLQYPDSSASEYPPLVPIMVGNTS 183

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
           A  E  +G L A Y+ DP+N F +SSDFCHWG RF Y +Y                    
Sbjct: 184 AATEQAFGALLAPYLADPANAFVISSDFCHWGLRFRYTYYVPQAARPGPELPLSGDILPQ 243

Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                        +   IH+SI A D   M  I TG   +F   
Sbjct: 244 PGMDASSVDQVVEMVSAGHSLRQRDRISSREPAIHESISAFDMATMAAITTGSAKSFLDI 303

Query: 232 LLEYDNTICGRHPISVFLHMLGNCST------KIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           +    NT+CGRHPI V +  L   +       + +  FLRYE+SS  +   DSSVSY SA
Sbjct: 304 IERTGNTVCGRHPIGVIMAALEVVTASRAADQEGRFYFLRYERSSDVEDIDDSSVSYVSA 363

Query: 286 AA 287
            A
Sbjct: 364 FA 365


>gi|237834219|ref|XP_002366407.1| hypothetical protein TGME49_027840 [Toxoplasma gondii ME49]
 gi|211964071|gb|EEA99266.1| hypothetical protein TGME49_027840 [Toxoplasma gondii ME49]
 gi|221486634|gb|EEE24895.1| conserved hypothetical protein [Toxoplasma gondii GT1]
 gi|221508391|gb|EEE33978.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 302

 Score =  228 bits (580), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 125/294 (42%), Positives = 170/294 (57%), Gaps = 10/294 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
           +RRA+HAGSWY+ +   L +  D  L  A   K+ E V  +I PHAGY+Y  R AA+A+ 
Sbjct: 3   VRRAAHAGSWYSSDPGELTQLFDRCLATAE--KTEENVIALICPHAGYAYCARTAAWAWR 60

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATV--YKTPIGDLPLDLEVIEELKATGKFELM 120
            + P ++ RVF+LGPSHH + P CAL  ++V  Y TP+GD+ LD  V+E L++   FE +
Sbjct: 61  QVSPENVRRVFVLGPSHHVFLPGCALPASSVRAYATPLGDISLDTAVLETLRSAKLFETL 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
            +  DE EHS+EM LP+L  +  G    +VPI+VG +     A   +    Y     N F
Sbjct: 121 ALRDDEEEHSIEMQLPFLRHILRGKSFTLVPIVVGDLRPSGHAAVAKALRPYFLQEGNLF 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV--IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
             SSDFCHWG RF Y +         I + I  LDK G  +IE  DP  F++Y     NT
Sbjct: 181 VFSSDFCHWGRRFRYSYLPPATASLPIFERIGILDKEGAALIEQQDPAGFQEYYERTGNT 240

Query: 239 ICGRHPISVFLHML---GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           ICG +PIS+FLH+L   G   +  K K L Y QSSQ + +  SSVSYA+ A+ +
Sbjct: 241 ICGHNPISIFLHLLEASGRPRSAFKTKLLDYSQSSQVENESSSSVSYAAFASSL 294


>gi|70989359|ref|XP_749529.1| DUF52 domain protein [Aspergillus fumigatus Af293]
 gi|66847160|gb|EAL87491.1| DUF52 domain protein [Aspergillus fumigatus Af293]
 gi|159128940|gb|EDP54054.1| DUF52 domain protein [Aspergillus fumigatus A1163]
          Length = 402

 Score =  227 bits (579), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 138/361 (38%), Positives = 178/361 (49%), Gaps = 78/361 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R  SHAGSWY+DN   L  +LD WL          G    P  R +IAPHAGY+YSG  A
Sbjct: 40  REPSHAGSWYSDNRSTLTRQLDQWLAHVPNEIEGIGSLPVPGSRVIIAPHAGYAYSGPCA 99

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
           AYA+  +D +   R+F+LGPSHH+Y    AL   T Y TP+ D  LPLD E+I +L    
Sbjct: 100 AYAYRALDLSKAKRIFILGPSHHHYLSTLALPQLTSYYTPLSDEPLPLDTELIAKLLSAK 159

Query: 112 -----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE-GHLIK-------IVPILVGAVN 158
                 +T  F  M   VDE EHS+E+HLPY+ ++ +  H  K       +VPILVG+ +
Sbjct: 160 AVKPNGSTVSFTTMTRSVDEDEHSIELHLPYIHRLLQLQHPTKRTSQYPPLVPILVGSTS 219

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
           A  E  +G L A Y++DPSN F +SSDFCHWG RF+Y +Y                    
Sbjct: 220 ASTEQAFGALLASYLEDPSNVFVISSDFCHWGLRFSYTYYVPQAPKPGPKLPLSADALPQ 279

Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                      + +   IH+SI A D   M  I TG+ + F   
Sbjct: 280 PSNDLDEFEERIELVSAGHSLQRRDRINSREPAIHESISAFDIATMAAIATGETENFLDV 339

Query: 232 LLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +    NT+CGRHPI V +  +    T     K    F+RYE+SS      DSSVSY SA 
Sbjct: 340 IQRTGNTVCGRHPIGVIMAAIEATRTQEDGKKGAFHFIRYERSSDAVNVTDSSVSYVSAF 399

Query: 287 A 287
           A
Sbjct: 400 A 400


>gi|119480005|ref|XP_001260031.1| hypothetical protein NFIA_080780 [Neosartorya fischeri NRRL 181]
 gi|119408185|gb|EAW18134.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 366

 Score =  226 bits (577), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 138/361 (38%), Positives = 178/361 (49%), Gaps = 78/361 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R  SHAGSWY+DN   L  +LD WL          G    P  R +IAPHAGY+YSG  A
Sbjct: 4   REPSHAGSWYSDNRSTLTRQLDQWLAHVPDEIEGIGSLPVPGSRVIIAPHAGYAYSGPCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
           AYA+  +D +   R+F+LGPSHH+Y    AL   T Y TP+ D  LPLD E+I +L    
Sbjct: 64  AYAYRALDLSKAKRIFILGPSHHHYLSTLALPQLTSYYTPLSDEPLPLDTELIAKLLSAK 123

Query: 112 -----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE-GHLIK-------IVPILVGAVN 158
                 +T  F  M   VDE EHS+E+HLPY+ ++ +  H  K       +VPILVG+ +
Sbjct: 124 AVKPNGSTVSFTTMTRSVDEDEHSIELHLPYIHRLLQLQHPTKRTSQYPPLVPILVGSTS 183

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
           A  E  +G L A Y++DPSN F +SSDFCHWG RF+Y +Y                    
Sbjct: 184 ASTEQAFGALLASYMEDPSNVFVISSDFCHWGLRFSYTYYVPQAPKPGPKLPLSADVLPQ 243

Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                      + +   IH+SI A D   M  I TG+ + F   
Sbjct: 244 PSNDLDEFEEKIEQVSAGHSLQRRDRINSREPAIHESISAFDIATMAAIATGETENFLDA 303

Query: 232 LLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +    NT+CGRHPI V +  +    T     K    F+RYE+SS      DSSVSY SA 
Sbjct: 304 IQRTGNTVCGRHPIGVIMAAIEATRTQEDGKKGAFHFIRYERSSDAVNVTDSSVSYVSAF 363

Query: 287 A 287
           A
Sbjct: 364 A 364


>gi|443924045|gb|ELU43122.1| Memo-like domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 486

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 134/297 (45%), Positives = 167/297 (56%), Gaps = 44/297 (14%)

Query: 21  LAEELDGWLR-------EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVF 73
           L  +LDGWL        E   P     + +IAPHAGYSYSG AAA+A+ +ID T I RVF
Sbjct: 66  LDSQLDGWLALVQDGALEGFRPPVTGCKAIIAPHAGYSYSGPAAAWAYKSIDTTGIKRVF 125

Query: 74  LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEM 133
           +LGPSHH Y   CALS    Y+TPI         I +L+ TG FE M +  DE EHS+EM
Sbjct: 126 ILGPSHHVYLRSCALSKCQSYETPIA--------IAKLRETGAFEDMPLGTDEDEHSIEM 177

Query: 134 HLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHW---- 189
           HLPY+ K+F G  IK+VPILVGAV+ + EA +G L A Y  +P     VSSDFCH     
Sbjct: 178 HLPYVRKIFAGLDIKVVPILVGAVDQDQEADFGALLAPYFAEPDTICIVSSDFCHCYTFY 237

Query: 190 ---------GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE--TGDP---------DAFK 229
                     ++ +      +   I+KSI ALDK  MDI+   T  P         DAF 
Sbjct: 238 YPSPPPSATPTKLSRSIQPNEDYPIYKSITALDKQAMDILTLPTEAPDTKTAAQTHDAFA 297

Query: 230 KYLLEYDNTICGRHPISVFLHML-----GNCSTKIKIKFLRYEQSSQCKTKRDSSVS 281
           KYL E  NTICGRHPI V L  L     G+      +K++RYEQSS+C+  +DSS +
Sbjct: 298 KYLKETKNTICGRHPIGVLLGALAELERGDSPRASAVKWVRYEQSSECRKLQDSSTA 354


>gi|83766294|dbj|BAE56437.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 392

 Score =  225 bits (574), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 134/357 (37%), Positives = 177/357 (49%), Gaps = 82/357 (22%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS---------PEVRGVIAPHAGYSYSGR 55
           R ASHAGSWY+DN + L  +LDGWL  A +P S         P  R +IAPHAGYSYSG 
Sbjct: 4   REASHAGSWYSDNQRTLTHQLDGWL--AQVPNSIEGIGSLPVPGARIIIAPHAGYSYSGP 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKA 113
            AAYA+  +D +   R+F+LGPSHH+Y    AL   T Y TP+ D  LPLD E+I +L +
Sbjct: 62  CAAYAYKALDLSKAKRIFILGPSHHHYLSTLALPELTSYYTPLSDEPLPLDTELITKLLS 121

Query: 114 TG---------KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--------IVPILVGA 156
           T           F  M   +DE EHS+E+HLPY+ ++ + H           +VPI+VG+
Sbjct: 122 TKAVKSNGSTISFTTMSRSIDEDEHSIELHLPYIHRLLQLHFPNKPTSQYPPLVPIMVGS 181

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------ 198
            +A  E+ +G L A Y+ DP+N F +SSDFCHWG RF Y +Y                  
Sbjct: 182 TSASTESAFGALLAPYLQDPTNAFVISSDFCHWGLRFRYTYYVPQAPKPGPKLPLSANSL 241

Query: 199 -----------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
                                          +   IH+SI A D   M  I TG    F 
Sbjct: 242 PQPGDDLSSVEEKMESVSTGQALQRRDRISSREPAIHESISAFDIATMAAISTGATANFL 301

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVS 281
           + +    NT+CGRHPI V +  +   ++     K K  F+RYE+SS      DSS S
Sbjct: 302 ETIQTTGNTVCGRHPIGVIMAAIEEITSQEEGKKGKFHFVRYERSSDAVDVDDSSFS 358


>gi|67526663|ref|XP_661393.1| hypothetical protein AN3789.2 [Aspergillus nidulans FGSC A4]
 gi|40740807|gb|EAA59997.1| hypothetical protein AN3789.2 [Aspergillus nidulans FGSC A4]
 gi|259481656|tpe|CBF75380.1| TPA: DUF52 domain protein (AFU_orthologue; AFUA_2G03970)
           [Aspergillus nidulans FGSC A4]
          Length = 366

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 143/364 (39%), Positives = 182/364 (50%), Gaps = 78/364 (21%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRA 56
            R ASHAGSWY+ N + LA +LD WL          G   +P  R +IAPHAGYSYSG  
Sbjct: 3   FRAASHAGSWYSGNRQTLALQLDQWLDRVPGTLEGLGSLPAPGARIIIAPHAGYSYSGPC 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKAT 114
           AAYA+  +D +   R+F+LGPSHH+Y    AL T T Y TP+ D  LPLD E+I +L++T
Sbjct: 63  AAYAYKALDLSKAKRIFVLGPSHHHYFTSLALPTLTGYYTPLSDDPLPLDTELIAKLRST 122

Query: 115 G---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAV 157
                      FE M   VDE EHS+E+HLPY+ ++ +             +VPILVG+ 
Sbjct: 123 PAAKSNGSTIAFEDMSQSVDEDEHSIELHLPYIHRLLQLQHPDKKAAEYPPLVPILVGST 182

Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------- 204
           +   EA +G L A Y++DPSN F VSSDFCHWG RF Y +Y  +  V             
Sbjct: 183 SKATEAAFGALLAPYLEDPSNAFVVSSDFCHWGLRFRYTYYVPQAPVPGPRLPLSSYDLP 242

Query: 205 ----------------------------------IHKSIEALDKMGMDIIETGDPDAFKK 230
                                             IH+SI + D   M  I TG  + F  
Sbjct: 243 QPEDDETEIESKIEVACAGYHLKRRDPIRRGKPLIHESISSFDIATMAAIATGKTERFTD 302

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCST----KIKIKFLRYEQSSQC-KTKRDSSVSYASA 285
            L +  NT+CGRHPI V +  +         K K  F RYE+SS   +   DSSVSY SA
Sbjct: 303 ILEKTGNTVCGRHPIGVIMAAIEAAGKQEGGKGKFHFARYERSSDIGEIVEDSSVSYVSA 362

Query: 286 AAKV 289
            A V
Sbjct: 363 FAVV 366


>gi|449018288|dbj|BAM81690.1| probable CGI-27 protein [Cyanidioschyzon merolae strain 10D]
          Length = 365

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 131/346 (37%), Positives = 179/346 (51%), Gaps = 63/346 (18%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR---------------------------------- 30
           R A HAGSWY  +++RL++ L+ WLR                                  
Sbjct: 6   REAQHAGSWYQSDTERLSQALEQWLRRVPSQLTPLRPADDAILFRNLWGKRDHFLDDSNV 65

Query: 31  EAGLP--KSPEV---------RG----VIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLL 75
           EA L   ++P+          RG    ++ PHAGY+YSG  AA+A+  +DP    RV ++
Sbjct: 66  EASLSSEQNPQQVQTQTQVQGRGTLEILVVPHAGYAYSGETAAFAYAQVDPARFDRVVII 125

Query: 76  GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT--GKFELMDICVDEAEHSMEM 133
           GPSHH Y  KC LS A    TP+GDL +D  +++         FE++    DE EHS+EM
Sbjct: 126 GPSHHVYMQKCGLSGAERLATPLGDLLVDTALVQSWSQLYPELFEILSKSTDEDEHSIEM 185

Query: 134 HLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGS 191
            LP+LAKVF+G L  ++ VPI+ GA++   E   GR  A+ V  P     VS+DFCHWG 
Sbjct: 186 QLPFLAKVFQGRLQQVRFVPIMCGALSPSKEKACGRFCAEQVLQPRTLLVVSTDFCHWGQ 245

Query: 192 RFNYM--------HYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           RF Y            ++ GV +   I ALD MGM +I   D + F+ YL +  NTICGR
Sbjct: 246 RFGYTPGCKDKLAQERQRLGVGLSDYIRALDSMGMYLISLKDAEGFRAYLQKTGNTICGR 305

Query: 243 HPISVFLHMLGNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
           HP++V L  +     +  ++ FLRY QSS C    DSSVSY + A 
Sbjct: 306 HPLAVMLWAVHFAEPSAWRLVFLRYAQSSACVDAHDSSVSYVAGAG 351


>gi|403348996|gb|EJY73945.1| Cell motility mediator [Oxytricha trifallax]
          Length = 323

 Score =  224 bits (571), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 124/300 (41%), Positives = 177/300 (59%), Gaps = 18/300 (6%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           IR+A+HAGSWYT++   L  EL  +L ++     + +  +++ +I PHAG+ +SG  AA+
Sbjct: 16  IRKANHAGSWYTEDPDDLNTELKQYLNQSQQTVDVNEQGQMKALIGPHAGFRFSGPTAAW 75

Query: 60  AFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
           A+ NI +P    RVFLLGPSH  Y    A +    ++TP+G+L +D   I EL   GK  
Sbjct: 76  AYINIKNPEQYKRVFLLGPSHKVYLDNIATTACDEWETPLGNLKIDHITITELIQNGKEQ 135

Query: 117 --FELMDICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYV 173
             F+ +    +E EHS+EMH+P++ K+FEG   + +VP++VG +  +    Y  L   Y 
Sbjct: 136 ELFQQISKKYEENEHSLEMHIPFIRKMFEGREDVTLVPLMVGQIPDDKFDRYAELLMPYF 195

Query: 174 DDPSNFFSVSSDFCHWGSRFNY-MHYDK--KHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
            D    F VSSDFCHWG RF + + Y    K   IHKSIE LD  GM +I+  D + FK+
Sbjct: 196 LDQQTLFIVSSDFCHWGKRFQFTLRYQDQDKSEPIHKSIEKLDHEGMSLIQNHDFNGFKQ 255

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCST----KIKIKFLRYEQSSQCKT-KRDSSVSYASA 285
           YL    NTICGR+PI V L ++         K   KF++Y QSS+ +  + DSSVSYAS+
Sbjct: 256 YLKTLKNTICGRYPIQVLLAVINKVEQTQQYKCSTKFVKYAQSSEIQMDEDDSSVSYASS 315


>gi|336467325|gb|EGO55489.1| hypothetical protein NEUTE1DRAFT_131235 [Neurospora tetrasperma
           FGSC 2508]
 gi|350288045|gb|EGZ69281.1| UPF0103-domain-containing protein [Neurospora tetrasperma FGSC
           2509]
          Length = 355

 Score =  224 bits (570), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 181/336 (53%), Gaps = 52/336 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYS 53
            ++ R ASHAGSWY  N+ RL+ +LD ++           LP  P  R +IAPHAGYSYS
Sbjct: 17  FQRTRPASHAGSWYLGNAARLSSQLDEFMSRVPNKLDGRDLP-IPGARVIIAPHAGYSYS 75

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G  AA+A+  +D  ++ RVFLLGPSH +Y   CALST   Y TP GDL +D + ++EL  
Sbjct: 76  GPCAAWAYKILDLANVKRVFLLGPSHTFYLKGCALSTFGKYSTPFGDLVVDGKAVDELME 135

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYG 166
             KF  + +  D  EH +EMHLPYL K  E  L         IVP+LVG ++A+ E   G
Sbjct: 136 DQKFSPIPVEYDIREHCLEMHLPYLWKRLEQTLGSDSSQFPPIVPVLVGDLSADGEKAVG 195

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------------- 204
            L A Y+ DP N F +SSDFCHWG  ++Y     K  V                      
Sbjct: 196 SLLAPYLADPKNAFIISSDFCHWGKNYHYRPQLHKGLVRDQDAQWASLKIQPGFKTKADG 255

Query: 205 ---------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML--- 252
                    IH+ I+ALD + MD I+TGD   +   L   +NT+CGRHPI V L  L   
Sbjct: 256 SEPRDAEVPIHEVIKALDDLVMDSIKTGDHSDYYSILKGTNNTVCGRHPIGVVLAALEKM 315

Query: 253 ---GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
               +  +K K +F++Y++S+  + + D SVSY SA
Sbjct: 316 GGAQSGGSKGKFQFVQYQRSNLVEKQSDFSVSYVSA 351


>gi|355565594|gb|EHH22023.1| hypothetical protein EGK_05205, partial [Macaca mulatta]
          Length = 240

 Score =  224 bits (570), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 115/272 (42%), Positives = 156/272 (57%), Gaps = 37/272 (13%)

Query: 20  RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
           +L  +L+GWL +    K P  R +I  HAGY+Y G  A +A+  +DP+   R+F+LGPSH
Sbjct: 3   QLNAQLEGWLSQVQSTKRP-ARAII--HAGYTYCGSCATHAYKQVDPSVTRRIFILGPSH 59

Query: 80  HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
           H    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLPY A
Sbjct: 60  HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 119

Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
           K  E H     I+P+LVGA++   E  + +LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 120 KAMESHKDEFTIIPVLVGALSESKEQEFRKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 179

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
           YD+  G I++SIE LDKM +  ++                                    
Sbjct: 180 YDESQGEIYRSIEHLDKMAITELQKN--------------------------------GM 207

Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 208 NMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 239


>gi|121710340|ref|XP_001272786.1| DUF52 domain protein [Aspergillus clavatus NRRL 1]
 gi|119400936|gb|EAW11360.1| DUF52 domain protein [Aspergillus clavatus NRRL 1]
          Length = 366

 Score =  224 bits (570), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 134/361 (37%), Positives = 180/361 (49%), Gaps = 78/361 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R ASHAGSWY+D+   L  +LD WL +        G    P  R +IAPHAGY+YSG  A
Sbjct: 4   REASHAGSWYSDSKTTLTRQLDQWLGQVPNTIDGIGDLPVPGSRVIIAPHAGYAYSGPCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
           AYA+  +D +   R+F+LGPSHH+Y    A+   T Y TP+ D  LPLD E+I +L +T 
Sbjct: 64  AYAYKALDLSQAKRIFILGPSHHHYLSTLAVPQLTSYYTPLSDEPLPLDTELISQLLSTK 123

Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE-GHLIK-------IVPILVGAVN 158
                     F  M   +DE EHS+E+HLPY+ ++ +  H  K       +VPILVG+ +
Sbjct: 124 AVKPNGSTVSFTTMTRSIDEDEHSIELHLPYIHRLLQLQHPTKRTSEYPLLVPILVGSTS 183

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
              E  +G L A Y++DP+N F +SSDFCHWGSRF+Y +Y                    
Sbjct: 184 TTTEKAFGALLAPYLEDPTNAFIISSDFCHWGSRFSYTYYVPQAPKPGPKLPLSANALPQ 243

Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                        +   IH+SI A D   M  I TG+ + F   
Sbjct: 244 PGEKPDEFEESVEQVSAGHALQRRDRISSREPAIHESISAFDIATMAAIATGETENFLDV 303

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIK-----FLRYEQSSQCKTKRDSSVSYASAA 286
           +    NT+CGRHPI V +  +   + +   K     F+RYE+SS      DSSVSY SA 
Sbjct: 304 IQRTGNTVCGRHPIGVIMAAIEETAAREDGKKGMFHFIRYERSSDAVDVTDSSVSYVSAF 363

Query: 287 A 287
           A
Sbjct: 364 A 364


>gi|242808341|ref|XP_002485142.1| DUF52 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218715767|gb|EED15189.1| DUF52 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 354

 Score =  223 bits (569), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 134/333 (40%), Positives = 170/333 (51%), Gaps = 68/333 (20%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL--------REAGLPKSPEVRGVIAPHAGYSYSGRA 56
           RR +HAGSWY D+S  L  ELDGWL        +   LP  P  R +I PHAGY+YSG  
Sbjct: 4   RRPTHAGSWYEDDSTTLKAELDGWLDAVPNEIDKLGALPI-PGARIIIGPHAGYAYSGPC 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELK-A 113
           AA+A+  +D +   R+FLLGPSHH+     AL   T Y TP+ D  LPLD E+I +++ A
Sbjct: 63  AAWAYKALDLSKAKRIFLLGPSHHHPLATIALPEVTSYATPLSDEPLPLDTEIINKIRTA 122

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAK----VFEGHLIK----IVPILVGAVNAENEAMY 165
           +  FE M   VDE EHSME+HLPY+ +     F G        +VPI+VG+ NAE E   
Sbjct: 123 SSAFETMSRRVDEREHSMELHLPYIHRKLQLTFPGRPASEYPPLVPIMVGSTNAETERAV 182

Query: 166 GRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK------------------------ 201
           G L A Y+ DPSN F +SSDFCHWG RF Y +Y +                         
Sbjct: 183 GALLATYLADPSNAFVISSDFCHWGQRFGYTYYVRDAPQQKPPLPLSFDSLPQPPTTVSA 242

Query: 202 -------------------HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
                              H  I+ SI A+D   M  I TG+   F K L    NT+CGR
Sbjct: 243 AEDTLQKVPGRSLRRNSELHAEIYDSISAVDIATMKAITTGNASQFLKSLRTTGNTVCGR 302

Query: 243 HPISVFLHMLGNCSTKI-----KIKFLRYEQSS 270
           HPI V +  L   ++       K  F+RYE+SS
Sbjct: 303 HPIGVIMAALEEYASVAGEEVGKFHFIRYERSS 335


>gi|296818439|ref|XP_002849556.1| DUF52 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238840009|gb|EEQ29671.1| DUF52 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 371

 Score =  223 bits (569), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 135/367 (36%), Positives = 182/367 (49%), Gaps = 83/367 (22%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVRGVIAPHAGYSYSGRA 56
           +RR SHAGSWY+D+   L  ++D WL          G    P  R +IAPHAGYSYSG  
Sbjct: 3   VRRPSHAGSWYSDDGATLQAQIDKWLDLVPNELEGVGSLPVPGARVIIAPHAGYSYSGPC 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKAT 114
           AA+A+  +D T   R+FL+GPSHH++    AL   T Y+TP+    LPLD +++ EL+A+
Sbjct: 63  AAFAYKCLDLTKAKRIFLIGPSHHHHLTTIALPELTGYRTPLSSSPLPLDTKLLSELRAS 122

Query: 115 G-----KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAEN 161
                 +F  M   VDEAEHSME+HLPY+ ++ +             +VP++VG+ +A  
Sbjct: 123 TASSAKRFTAMSPSVDEAEHSMELHLPYIHRLLQRLYPSEPTSAYPPLVPMMVGSTSAAT 182

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------------- 198
           E  +G + + Y+ D  N F VSSDFCHWG+RF Y +Y                       
Sbjct: 183 ERAFGSILSPYLADEENAFVVSSDFCHWGTRFGYTYYTTHAPGAPFDMPLTYPSLPVPSD 242

Query: 199 -----------------------DKKH-----GVIHKSIEALDKMGMDIIETGDPDAFKK 230
                                  DK +       IH+SI A D   M  I +GD D F  
Sbjct: 243 RLTAAKAAADVDAVSSGQMLREKDKSYRAADACAIHESISACDIACMSAIASGDTDVFLD 302

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCS----------TKIKIKFLRYEQSSQCKTKRDSSV 280
            L    NT+CGRHPI V +  +               K +  F+RYE+SS C + RDSSV
Sbjct: 303 ALKRTGNTVCGRHPIGVVMAGMEGLQQEQAAEKEGVQKGRFYFMRYERSSDCFSVRDSSV 362

Query: 281 SYASAAA 287
           SY SA A
Sbjct: 363 SYVSAFA 369


>gi|170575060|ref|XP_001893082.1| Hypothetical UPF0103 protein C37C3.8 in chromosome V [Brugia
           malayi]
 gi|158601091|gb|EDP38091.1| Hypothetical UPF0103 protein C37C3.8 in chromosome V, putative
           [Brugia malayi]
          Length = 261

 Score =  223 bits (568), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 113/233 (48%), Positives = 151/233 (64%), Gaps = 8/233 (3%)

Query: 66  PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVD 125
           P  +S +F+LGPSH      CALST + Y+TPIGDL +D     ELK TG F LMD+  +
Sbjct: 20  PKPVSIIFVLGPSHVMSLDTCALSTCSRYRTPIGDLQIDQRTNMELKETGAFSLMDLRSE 79

Query: 126 EAEHSMEMHLPYLAKVFEGHLIK---IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
           EAEHS+EM LPY+AK+ E   +    IVP+LVG+++   +A YG++F+KY+ DP   F V
Sbjct: 80  EAEHSIEMQLPYIAKIMEKKSMNSYSIVPVLVGSLSPSKQASYGKIFSKYLSDPKIVFVV 139

Query: 183 SSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           SSDFCHWG+RF++M YD   GV I++ I A+DK GMD I + +P  F +YL    NTICG
Sbjct: 140 SSDFCHWGNRFHFMPYDNSTGVPIYEQIAAMDKQGMDAISSLNPAVFGEYLKRTQNTICG 199

Query: 242 RHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           R+PISV LH             +  FL+Y QS+Q +   DSSVSYA+ A  ++
Sbjct: 200 RNPISVILHAAEXFRQMNNHFAEFIFLKYAQSNQSRNLNDSSVSYAAGALFIN 252


>gi|85092720|ref|XP_959514.1| hypothetical protein NCU11386 [Neurospora crassa OR74A]
 gi|28920949|gb|EAA30278.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 355

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 134/336 (39%), Positives = 181/336 (53%), Gaps = 52/336 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYS 53
            ++ R ASHAGSWY  N+ RL+ +LD ++           LP  P  R +IAPHAGYSYS
Sbjct: 17  FQRTRPASHAGSWYLGNAARLSSQLDEFMSRVPNKLDGRDLP-IPGARVIIAPHAGYSYS 75

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G  AA+A+  +D  ++ RVFLLGPSH +Y   CALST   Y TP GDL +D + ++EL  
Sbjct: 76  GPCAAWAYKILDLANVKRVFLLGPSHTFYLKGCALSTFGKYSTPFGDLVVDGKAVDELME 135

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYG 166
             KF  + +  D  EH +EMHLPYL K  E  L         IVP+LVG ++A+ E   G
Sbjct: 136 DQKFSPIPVEYDIREHCLEMHLPYLWKRLEQTLGGDSSQFPPIVPVLVGDLSADGEKAVG 195

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------------- 204
            + A Y+ DP N F +SSDFCHWG  ++Y     K  V                      
Sbjct: 196 SILAPYLADPKNAFIISSDFCHWGKNYHYRPQLHKGLVRDQDAQWASLKIQPGFKTKADG 255

Query: 205 ---------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML--- 252
                    IH+ I+ALD + MD I+TGD   +   L   +NT+CGRHPI V L  L   
Sbjct: 256 SEPRDAEVPIHEVIKALDDLVMDSIKTGDHSDYYSILKGTNNTVCGRHPIGVVLAALEKM 315

Query: 253 ---GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
               +  +K K +F++Y++S+  + + D SVSY SA
Sbjct: 316 GGAQSGESKGKFQFVQYQRSNLVEKQSDFSVSYVSA 351


>gi|348574319|ref|XP_003472938.1| PREDICTED: protein MEMO1-like [Cavia porcellus]
          Length = 341

 Score =  222 bits (566), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 103/204 (50%), Positives = 139/204 (68%), Gaps = 3/204 (1%)

Query: 17  NSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLG 76
           N  +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+   R+F+LG
Sbjct: 120 NGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILG 178

Query: 77  PSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLP 136
           PSHH    +CALS+  +Y+TP+ DL +D ++  EL  TG FE M +  DE EHS+EMHLP
Sbjct: 179 PSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLP 238

Query: 137 YLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFN 194
           Y AK  E H     I+P+LVGA++   E  +G+LF+KY+ DPSN F VSSDFCHWG RF 
Sbjct: 239 YTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFR 298

Query: 195 YMHYDKKHGVIHKSIEALDKMGMD 218
           Y +YD+  G I++SIE LDKM ++
Sbjct: 299 YSYYDESQGEIYRSIEHLDKMVLE 322


>gi|336260613|ref|XP_003345100.1| hypothetical protein SMAC_07390 [Sordaria macrospora k-hell]
 gi|380096552|emb|CCC06600.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 343

 Score =  221 bits (563), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 138/338 (40%), Positives = 183/338 (54%), Gaps = 53/338 (15%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSG 54
           ++ R ASHAGSWY  ++ RL+ +LD ++           LP  P  R +IAPHAGYSYSG
Sbjct: 5   QETRPASHAGSWYLGSAARLSFQLDEYMSRVPNKLDGKDLP-IPGARVIIAPHAGYSYSG 63

Query: 55  RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
             AA+A+  +D  ++ RVFLLGPSH +Y   CALS    Y +P GDL +D E I+EL  T
Sbjct: 64  PCAAWAYKVLDLANVKRVFLLGPSHTFYLKGCALSAFGKYSSPFGDLAVDGEAIDELMKT 123

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYGR 167
            KF  +    +  EH +EMHLPYL K  E  L         IVPILVG ++A+ E   G 
Sbjct: 124 KKFSTIPADYEIQEHCLEMHLPYLWKRLEQTLGSDSSQFPPIVPILVGDLSADGEETIGA 183

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNY-----------------------MHYDKKHGV 204
           L A Y+ DP N F +SSDFCHWG  + Y                       +  +K  G 
Sbjct: 184 LLAPYLADPKNAFIISSDFCHWGKNYRYRPQLYKGLVRDQDAQWATLKIQPIFENKVDGT 243

Query: 205 --------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML---- 252
                   IH+ I+ALD M +D I+TGD   +   L    NT+CGRHPI V L  L    
Sbjct: 244 EPRDADVPIHEVIKALDDMVIDAIKTGDHSEYYSTLKGTHNTVCGRHPIGVVLAALEEFL 303

Query: 253 -GNCS--TKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            G+ S  +K K +F++Y++S+  + + D SVSY SA A
Sbjct: 304 GGDQSGGSKGKFQFIQYQRSNLVERQSDFSVSYVSAYA 341


>gi|409083485|gb|EKM83842.1| hypothetical protein AGABI1DRAFT_103914 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 503

 Score =  221 bits (562), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 128/325 (39%), Positives = 174/325 (53%), Gaps = 42/325 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGYSYSGRAAA 58
           R A+HAGS  +  + +L  +L GWL      +  E       + +IAPHAGYS+ G   A
Sbjct: 181 RGAAHAGSCLS--APQLTADLSGWLSLVRPRREDEEFPVQGCKAIIAPHAGYSHCGETGA 238

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +A+ +IDP++  RVF++GPSH +    CALS    Y TPIG LP+D +V+  L  TG F 
Sbjct: 239 WAYKSIDPSTTRRVFVIGPSHFWRIDNCALSRCHTYDTPIGTLPVDTDVVVSLYETGLFR 298

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHL-----IKIVPILVGAVNAENEAMYGRLFAKYV 173
            M++  DE EHS+EM LPYL +V EG +     IKIVPI+VG +  E E  YGR+ A Y+
Sbjct: 299 WMNLTRDEREHSIEMQLPYLRRVCEGRVMIRKDIKIVPIMVGDITKEQELEYGRILAPYL 358

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-------------------IHKSIEALDK 214
            +    F  SSDFCHWG  + Y  Y  K                      I +S   LD 
Sbjct: 359 AEEGTVFIASSDFCHWGPDYYYTFYFPKPNCAPSEGYQTTRATPPEKGYKIWQSTTQLDH 418

Query: 215 MGMDIIETGDPDAFK------KYLLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFL 264
           +GM I+   D  AF       +YL+E DN+ICGRH I +      H+      K    ++
Sbjct: 419 IGMGILADQDRSAFTAHRDFHRYLVETDNSICGRHAIGILYGALAHLEETTGRKKFCNWV 478

Query: 265 RYEQSSQCKTKRDSSVSYASAAAKV 289
           +Y+QS+Q     D+SVSYASA  K+
Sbjct: 479 KYDQSAQVTKITDNSVSYASAWIKI 503


>gi|50549243|ref|XP_502092.1| YALI0C21450p [Yarrowia lipolytica]
 gi|49647959|emb|CAG82412.1| YALI0C21450p [Yarrowia lipolytica CLIB122]
          Length = 319

 Score =  219 bits (559), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 117/312 (37%), Positives = 174/312 (55%), Gaps = 29/312 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAG--LPKS--PEVRGVIAPHAGYSYSGRAAAYA 60
           R A+HAGSWY+ N+ +L  E+D  L      L KS  P  R ++ PHAG +Y+G      
Sbjct: 5   RPATHAGSWYSSNATKLGAEVDRHLANGASVLGKSAIPGARVLVGPHAGLAYAGPQLGET 64

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +   D  +I R+F+LGPSHH Y    A S    Y+TP G++ +D+E  ++L  +G  + M
Sbjct: 65  YAAFDFKNIKRLFILGPSHHVYLEHAATSAFHSYETPFGNVNVDVETTQKLNDSGVTKYM 124

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
               D+ EHS EMH+P+L ++     +KIVPI+VG  + E E    +L   YV+DP+N F
Sbjct: 125 SATTDKDEHSFEMHMPFLKRLVGDQNVKIVPIMVGQTSQEYEKRLAKLLLPYVEDPTNAF 184

Query: 181 SVSSDFCHWGSRFNYMHY--------------------DKKHGVIHKSIEALDKMGMDII 220
            +S+DFCHWG+ F Y  Y                     + +  I+KSIE LDK GM++ 
Sbjct: 185 VISTDFCHWGNNFRYWGYADSENCDNVSQSREELRRALKRSNTPIYKSIEYLDKKGMEVA 244

Query: 221 ETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK-----IKIKFLRYEQSSQCKTK 275
                D +K+Y  + DNTICGR P+++ + ML N + +     I ++++ Y QS+Q +  
Sbjct: 245 SLTSYDKWKEYCKKTDNTICGRKPLAILISMLENYAIEKGDKPISLEWIGYSQSNQVRDI 304

Query: 276 RDSSVSYASAAA 287
            + SVSYAS  A
Sbjct: 305 SEGSVSYASGFA 316


>gi|119178448|ref|XP_001240897.1| hypothetical protein CIMG_08060 [Coccidioides immitis RS]
 gi|392867144|gb|EAS29654.2| hypothetical protein CIMG_08060 [Coccidioides immitis RS]
          Length = 383

 Score =  218 bits (555), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 140/378 (37%), Positives = 185/378 (48%), Gaps = 95/378 (25%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
           R A+HAGSWY+DN+  L  +LD W+        G+   P    R +IAPHAGY+YSG  A
Sbjct: 4   REATHAGSWYSDNAATLTRQLDEWMNRVPNEIEGIGSLPVAGARIIIAPHAGYAYSGPCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
           A+A+ ++D +   R+FLLGPSHH+   K AL   + Y TP+    LPLD E+I+EL    
Sbjct: 64  AFAYKSLDLSKAKRIFLLGPSHHHPFSKIALPELSSYSTPLSQEPLPLDREIIDELSTRT 123

Query: 112 -KATGKFELMDICVDEAEHSMEMHLPY----LAKVFEGHLI----KIVPILVGAVNAENE 162
              T +F  M+  +DEAEHS+E+HLPY    L +++ G       K+VP++VG+ +A  E
Sbjct: 124 ENGTVRFTTMNQAIDEAEHSLELHLPYIHYLLQRLYPGEPAASYPKLVPMMVGSTSAPTE 183

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
             +GR+ A Y+ +P N F VSSDFCHWG RF Y +Y                        
Sbjct: 184 QAFGRILAPYLANPENAFIVSSDFCHWGLRFAYAYYVPDAPAPGPVLPLSYAALPQPSEA 243

Query: 199 -------------------------DKKHGV--IHKSIEALDKMGMDIIETGDPDAFKKY 231
                                     K  GV  IH+SI A D   M  I +G    F   
Sbjct: 244 LKLGAARPQISGMSKGRYLRAGDPLPKGAGVPSIHESISACDIACMSAIASGQTQTFLDA 303

Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIK------------------IKFLRYEQS 269
           L    NTICGRHPI V +     +LG     +K                    F+RYE+S
Sbjct: 304 LKSTGNTICGRHPIGVIMAAIEFVLGKDKENMKDLDIKAGDEAQTHLMRGAFNFVRYERS 363

Query: 270 SQCKTKRDSSVSYASAAA 287
           S C +  DSSVSY SA A
Sbjct: 364 SDCISVTDSSVSYVSAFA 381


>gi|406607120|emb|CCH41508.1| hypothetical protein BN7_1049 [Wickerhamomyces ciferrii]
          Length = 317

 Score =  217 bits (553), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 125/314 (39%), Positives = 174/314 (55%), Gaps = 32/314 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVRGVIAPHAGYSYSGRAAAYAFG 62
           +R ASHAG+WY+ N  +L ++LD +L+ A  P+S    R +I PHAGY+Y+G   A  + 
Sbjct: 3   VRAASHAGAWYSANPLKLNKQLDEYLKAA--PQSVAGARLLIGPHAGYTYAGSTLAETYQ 60

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             D + + RVF+LGPSHH Y    AL S    Y+TP G+LP+D EV++ LK    F+ M 
Sbjct: 61  AWDTSKVKRVFILGPSHHVYFKNVALLSGVDEYETPFGNLPVDREVVDSLKNNKIFKEMS 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFEG---HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           + VDE EHS EMH PY+ K+ +     +  IVPI++   +         + ++Y+ D +N
Sbjct: 121 LEVDEDEHSFEMHAPYIYKLTQNIPQGIPSIVPIMISHSDENFNKKISSILSEYLKDEAN 180

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSIEALDKMGMDII 220
            F +SSDFCHWGSRF Y  Y  K  +                  IHKSIE LD +   I 
Sbjct: 181 TFVISSDFCHWGSRFGYTAYTGKGTLDDLKDLSYTTKVPSGGLSIHKSIEFLDILATRIA 240

Query: 221 ETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK-------IKIKFLRYEQSSQCK 273
            TG    +K Y+    NTICG+ PI++ L  +     +        K +++ Y QSSQ  
Sbjct: 241 STGSSKDWKDYIKVTGNTICGQRPIAILLETIEKVKEEGTYDDDWGKFEWIGYSQSSQAT 300

Query: 274 TKRDSSVSYASAAA 287
           + RDSSVSYASA A
Sbjct: 301 SARDSSVSYASAFA 314


>gi|303310391|ref|XP_003065208.1| hypothetical protein CPC735_020910 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240104868|gb|EER23063.1| hypothetical protein CPC735_020910 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 381

 Score =  217 bits (552), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 137/376 (36%), Positives = 184/376 (48%), Gaps = 93/376 (24%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
           R A+HAGSWY+DN+  L  +LD W+        G+   P    R +IAPHAGY+YSG  A
Sbjct: 4   REATHAGSWYSDNAATLTRQLDEWMNRVPNEIEGIGSLPVAGARIIIAPHAGYAYSGPCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
           A+A+ ++D +   R+FLLGPSHH+   K AL   + Y TP+    LPLD E+I+EL    
Sbjct: 64  AFAYKSLDLSKAKRIFLLGPSHHHPFSKIALPELSSYSTPLSQEPLPLDREIIDELSTRT 123

Query: 112 -KATGKFELMDICVDEAEHSMEMHLPYLAKVFEG--------HLIKIVPILVGAVNAENE 162
              T +F  M+  +DEAEHS+E+HLPY+  + +G           K+VP++VG+ +A  E
Sbjct: 124 ENGTVRFTTMNQAIDEAEHSLELHLPYIHYLLQGLYPGEPAASYPKLVPMMVGSTSAPTE 183

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
             +GR+ A Y+ +P N F VSSDFCHWG RF Y +Y                        
Sbjct: 184 QAFGRILAPYLANPENAFIVSSDFCHWGLRFAYAYYVPDAPAPGPVLPLSYAALPQPSEA 243

Query: 199 -------------------------DKKHGV--IHKSIEALDKMGMDIIETGDPDAFKKY 231
                                     +  GV  IH+SI A D   M  I +G    F   
Sbjct: 244 LKLGAARPQISGMSKGRYLRAGDQLPRGAGVPSIHESISACDIACMSAIASGHTQTFLDA 303

Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIK----------------IKFLRYEQSSQ 271
           L    NTICGRHPI V +     +LG     ++                  F+RYE+SS 
Sbjct: 304 LKSTGNTICGRHPIGVIMAAIEFVLGKDKENMRDLDIKAGDETHLMRGAFNFVRYERSSD 363

Query: 272 CKTKRDSSVSYASAAA 287
           C +  DSSVSY SA A
Sbjct: 364 CISVTDSSVSYVSAFA 379


>gi|212537757|ref|XP_002149034.1| DUF52 domain protein [Talaromyces marneffei ATCC 18224]
 gi|210068776|gb|EEA22867.1| DUF52 domain protein [Talaromyces marneffei ATCC 18224]
          Length = 365

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 132/332 (39%), Positives = 167/332 (50%), Gaps = 66/332 (19%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           RR +HAGSWY D++  L  ELDGWL          G    P  R +I PHAGY+YSG  A
Sbjct: 15  RRPTHAGSWYEDDTTTLKAELDGWLDAVPGEIEPFGALPIPGARIIIGPHAGYAYSGPCA 74

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL-KAT 114
           A+A+  +D +   R+FLLGPSHHY     AL   T Y TP+ D  LPLD+EVI+++  A+
Sbjct: 75  AWAYKALDLSKAKRIFLLGPSHHYPLATIALPEVTSYATPVSDEPLPLDIEVIDKICNAS 134

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAK----VFEGHLI----KIVPILVGAVNAENEAMYG 166
             FE+M   +DE EHSME+HLPY+ +     F G        +VPI+VG+  AE E   G
Sbjct: 135 STFEIMSRRIDEREHSMELHLPYIHRKLQLTFPGRPAAEYPPLVPIMVGSTKAETERAVG 194

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---------------------------- 198
            L A Y+ D SN F +SSDFCHWG RF Y +Y                            
Sbjct: 195 ALLATYLADSSNAFIISSDFCHWGQRFGYTYYVPDAPQQKPPLPLSFDNLPQPPTTVSEA 254

Query: 199 -DKKHGV--------------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
            D    V              I+ SI A+D   M  I TG    F K L    NT+CGRH
Sbjct: 255 EDAIQKVSGRSLRSNKEIQTEIYDSISAVDIATMKAITTGSTTQFLKILRSTGNTVCGRH 314

Query: 244 PISVFLHMLGNCSTKI-----KIKFLRYEQSS 270
           PI V +  L   ++       K  F+RYE+SS
Sbjct: 315 PIGVIMAALEAYASASGEEVGKFHFIRYERSS 346


>gi|315041727|ref|XP_003170240.1| MEMO1 protein [Arthroderma gypseum CBS 118893]
 gi|311345274|gb|EFR04477.1| MEMO1 protein [Arthroderma gypseum CBS 118893]
          Length = 379

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 137/375 (36%), Positives = 178/375 (47%), Gaps = 91/375 (24%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL-----REAGLPK--SPEVRGVIAPHAGYSYSGRA 56
           +RR +HAGSWY+DN   L  +LD WL        GL K   P  R +IAPHAGYSYSG  
Sbjct: 3   VRRPTHAGSWYSDNRATLQAQLDQWLDLVPGELEGLGKLPVPGARVIIAPHAGYSYSGPC 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELK-- 112
           AA+A+  +D +   R+FL+GPSHH +    AL   T Y TP+    LPLD +V+ EL+  
Sbjct: 63  AAFAYKCLDLSKAKRIFLIGPSHHRHLTTIALPELTGYLTPLSSTPLPLDTKVLGELESH 122

Query: 113 ---ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAEN 161
              +  +F  M   VDEAEHSME+HLPY+ ++ +             +VP++VG+ +A  
Sbjct: 123 ASSSAKRFTTMSPSVDEAEHSMELHLPYIHRLLQRLYPSQPTSAYPPLVPMMVGSTSAAT 182

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD---------------------- 199
           E  +G +   Y+ D  N F +SSDFCHWGSRF Y +Y                       
Sbjct: 183 ERAFGSILGPYLADEENAFVISSDFCHWGSRFAYTYYTTHAADAPYKMPLTYPSLPVPSD 242

Query: 200 ------------------------------KKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
                                         K    IH+SI A D   M  I +G+ D F 
Sbjct: 243 ELTAAKAAEEVEVVSSGRMLGEKDKSYYQAKDACAIHESISACDIACMSAIASGNTDVFL 302

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCS-----------------TKIKIKFLRYEQSSQC 272
             L    NT+CGRHPI V +  +                      K K  FLRYE+SS C
Sbjct: 303 DALRRTGNTVCGRHPIGVVMAGMEGLQHDQAETTTTTTTEDGGVRKGKFYFLRYERSSDC 362

Query: 273 KTKRDSSVSYASAAA 287
            + RDSSVSY SA A
Sbjct: 363 FSLRDSSVSYVSAFA 377


>gi|310795369|gb|EFQ30830.1| memo-like protein [Glomerella graminicola M1.001]
          Length = 325

 Score =  215 bits (548), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 127/320 (39%), Positives = 176/320 (55%), Gaps = 36/320 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R     G WY  ++ RLA++LD +L +        GLP  P  R +IAPHAGYSYSG  A
Sbjct: 7   RPPGKQGEWYLGDADRLAKQLDEFLADVPDQIDGQGLP-IPGARVIIAPHAGYSYSGATA 65

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+  +D +   RVFLLGPSH YY   CA++T   Y TP G+L +D E  + ++     
Sbjct: 66  AWAYKALDLSKAKRVFLLGPSHTYYLRGCAVTTYKKYGTPWGELRIDEETTDAIRRRDGV 125

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYGRLFAK 171
           E M +  ++ EHS+EMHLPYL K  +       +   +VPILVG  N   E   G+  A+
Sbjct: 126 EDMPVRNEDREHSLEMHLPYLFKRLQQTFGSPDNFPTLVPILVGDNNKAEEKEVGKWLAE 185

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNY-----------MHYDKKHGV------IHKSIEALDK 214
           Y+ DP N F VSSDFCHWGS F+Y           M   + HG       IH++++ +D 
Sbjct: 186 YLRDPDNAFIVSSDFCHWGSHFDYTVYCPDGDVTKMQKLRPHGSAPDGPPIHETVKLVDD 245

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS-----TKIKIKFLRYEQS 269
           + ++ I+TG   AF   L +  NT+CGRHPI V +  L          + + +F++Y++S
Sbjct: 246 LAIEAIKTGRHSAFYDNLKQTKNTVCGRHPIGVTMAALEELGGEGQEQQHRFRFIQYQRS 305

Query: 270 SQCKTKRDSSVSYASAAAKV 289
           S      DSSVSY SA A V
Sbjct: 306 SLVTGPGDSSVSYVSAYAVV 325


>gi|452847073|gb|EME49005.1| hypothetical protein DOTSEDRAFT_67898 [Dothistroma septosporum
           NZE10]
          Length = 276

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 120/271 (44%), Positives = 155/271 (57%), Gaps = 28/271 (10%)

Query: 43  VIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP 102
           +IAPHAGYSYSG AAA+A+   D +   +VFLLGPSHH+Y  K ALS  T Y TPIG+L 
Sbjct: 2   IIAPHAGYSYSGPAAAWAYKAWDVSKAKKVFLLGPSHHHYLTKAALSKCTEYATPIGNLK 61

Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE-------GHLIKIVPILVG 155
           ++ EV  EL +TG+FE M   VDE EHS+EMHLPY+ K           +   +VPI+VG
Sbjct: 62  VNREVTAELHSTGQFEWMSRSVDEDEHSLEMHLPYIYKRLSQTFGEDAANFPTLVPIMVG 121

Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG------------ 203
             +A  E   GR+ A Y+ D SN F VSSDF HWG RF Y +Y    G            
Sbjct: 122 NTSASTEKSLGRILAPYLADRSNVFVVSSDFAHWGLRFRYTYYQPATGNAIDLTSSAKLP 181

Query: 204 ---VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI- 259
               IH+SI+ +D   M   E+G  +A+   L +  NT+CGRHPI V +  +      + 
Sbjct: 182 KDPAIHESIKVVDFQCMGACESGSHEAWLNVLDKTGNTVCGRHPIGVMMAAVEKLREDLD 241

Query: 260 -----KIKFLRYEQSSQCKTKRDSSVSYASA 285
                  KF++Y++SS      DSSVSYASA
Sbjct: 242 RLDAGAFKFVKYDRSSLVNRVGDSSVSYASA 272


>gi|322702109|gb|EFY93857.1| DUF52 domain-containing protein [Metarhizium acridum CQMa 102]
          Length = 323

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 124/321 (38%), Positives = 174/321 (54%), Gaps = 39/321 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGR 55
           +R A  AGSWY  +   L  ELD +L +  +P++        P  R +IAPHAGY++SG 
Sbjct: 3   LRSAGKAGSWYVASPNNLGSELDEYLSD--VPETVDGSTLPIPGARIIIAPHAGYTFSGP 60

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            AA+A+  +D +   RVF+LGPSH YY   CA++T + Y TP G+L +D ++I+ +K   
Sbjct: 61  CAAWAYKCLDLSKAKRVFVLGPSHTYYLDGCAVTTYSKYVTPFGNLTVDRDIIQRVKEAA 120

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRLF 169
           + + +    D AEHS+EMHLPYL K  E       +   IVPIL+G  N + E   G++ 
Sbjct: 121 QMDDIPPSRDSAEHSLEMHLPYLYKRCEQTFGSPENFPTIVPILIGDNNRDEEKAVGQVL 180

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-----------------VIHKSIEAL 212
             Y+ D  N F +SSDFCHWG+ F YM Y   +                   IH++I  L
Sbjct: 181 VPYLKDEENAFIISSDFCHWGAHFQYMVYSPDNDPARLINLHKGDKAPIGPPIHETIRLL 240

Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS------TKIKIKFLRY 266
           DK  MD +E+G  DAF   L    NT+CGRHPI V +  L   S       K + +  +Y
Sbjct: 241 DKAAMDAVESGHHDAFVDNLKLTKNTVCGRHPIGVAMAALELLSGEVADVKKCRFRITQY 300

Query: 267 EQSSQCKTKRDSSVSYASAAA 287
           ++S+  +   D SVSY SA A
Sbjct: 301 QRSNLVEKPSDFSVSYVSAYA 321


>gi|402078969|gb|EJT74234.1| hypothetical protein GGTG_08077 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 332

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 141/333 (42%), Positives = 179/333 (53%), Gaps = 45/333 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVRGVIAPHAGYSYS 53
           M  +R ASHAG WYT     L  +LD +L        ++ LP     R +IAPHAGYSYS
Sbjct: 1   MSGVRTASHAGDWYTAEPDGLEHDLDRYLDAVPEQINDSDLPVK-GARVIIAPHAGYSYS 59

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G  AA+A+  +D  +  RVF+LGPSH YY    AL+T   Y TP+GDL +D++ +  L+A
Sbjct: 60  GPCAAWAYKALDLGAAKRVFILGPSHTYYLSGAALTTYAAYATPLGDLRVDVDTVAALRA 119

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL---------IKIVPILVGAVNAENEAM 164
           TG+F  +    DE EHS+EMHLPYLAK                 +VPI+VG    E E  
Sbjct: 120 TGRFSDVPRQRDEDEHSLEMHLPYLAKRLAQTFGPQDTPAAWPPVVPIVVGDNGREAERA 179

Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---DKKHGV---------------IH 206
            G L   Y+ D  + F VSSDFCHWGSRF+Y  Y     + GV               IH
Sbjct: 180 LGELLVPYLRDRDSAFIVSSDFCHWGSRFSYTAYTPDGTEAGVRSLTRRDAAPDLPVPIH 239

Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC------STKIK 260
           +SI ALD M M  IE+G  DAF   L    NT+CGRHPI V +  L         S+ +K
Sbjct: 240 ESISALDHMAMGAIESGSHDAFFDNLKATGNTVCGRHPIGVVMAGLEALRKEKAESSGLK 299

Query: 261 ----IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
                 F++Y++SS     RDSSVSY SA A +
Sbjct: 300 GDGVFTFVKYDRSSLVNGLRDSSVSYGSAYAII 332


>gi|302409576|ref|XP_003002622.1| MEMO1 [Verticillium albo-atrum VaMs.102]
 gi|261358655|gb|EEY21083.1| MEMO1 [Verticillium albo-atrum VaMs.102]
          Length = 329

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 128/328 (39%), Positives = 174/328 (53%), Gaps = 42/328 (12%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYS 53
            ++R+   AGSWY  ++KRLAE+L+G+L +  +P          P  R +IAPHAGYSYS
Sbjct: 4   REVRKPGKAGSWYLADAKRLAEQLEGFLDD--VPSQINSSDVPIPGARVIIAPHAGYSYS 61

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G  AA+A+ ++D +   RVFLLGPSH +Y   CA++T   Y TP G++ +D EV+  L+A
Sbjct: 62  GPTAAWAYKSLDLSQTRRVFLLGPSHTFYLKGCAVTTFKHYGTPFGNIRVDEEVVSTLRA 121

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYGR 167
                 M +  D  EHS+EMHLPYL  +F            +VPILVG      E   G 
Sbjct: 122 ELSIPDMPLSNDNKEHSLEMHLPYLWTMFTKAFDSPDEFPTLVPILVGDGTKTAERAVGA 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSI 209
               Y+ DP+N F VSSDFCHWG  F+Y  Y     V                  IH++I
Sbjct: 182 WLLPYLRDPTNAFVVSSDFCHWGDNFSYTPYSPDAQVDDTLQHISSRSRVPAGRPIHETI 241

Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST--------KIKI 261
           + LD   +  I+TG+   F   L    NT+CGRHPI V +  L   +         K + 
Sbjct: 242 KELDDQAIAAIKTGEYAQFYDNLSLTKNTVCGRHPIGVMMAALELLAEEEGRGGEDKGRF 301

Query: 262 KFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +F+RYE+S+  +   D SVSY SA A V
Sbjct: 302 QFVRYERSNLVEDADDMSVSYVSAYAVV 329


>gi|71747482|ref|XP_822796.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70832464|gb|EAN77968.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 323

 Score =  213 bits (543), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 127/316 (40%), Positives = 183/316 (57%), Gaps = 27/316 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG---LPKSPEVRGVIAPHAGYSYSGRAA 57
           M   RRA+HAGSWY  +   L   +D     A    L +S ++ GVI+PHAG SYSG  A
Sbjct: 1   MTYSRRATHAGSWYEGSPNALKALVDTLFSTASKSPLKESEKMIGVISPHAGISYSGNTA 60

Query: 58  AYAFGNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDL----E 106
           ++ + ++          SI+R+FLLGPSHH       +  A  Y+TP G L ++     E
Sbjct: 61  SHVYVHLRDYIYGHKGRSITRIFLLGPSHHKGFDGVEVCAAQRYETPFGPLVVNAKVGQE 120

Query: 107 VIEELKATG-KFELMDICVDEAEHSMEMHLPYLAKVF----EGH-----LIKIVPILVGA 156
           V +EL+A G     M    DE EHS+EM LP+++ +      G+      +++VP+L+G 
Sbjct: 121 VEKELRAAGVPVGTMHRMTDEDEHSIEMQLPFISHLLHYPPNGYKPAMDRVELVPLLIGG 180

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-KHGVIHKSIEALDK 214
            N + E + G + +KY+ D  NFF +SSDFCHWG+RF YM HY+K ++  I  +I ++D 
Sbjct: 181 TNRKMENLIGSVLSKYLKDNQNFFVISSDFCHWGARFQYMYHYEKAEYPDIGDAIISMDH 240

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
            GM ++E  D D + KYL   +NTICGR PISV +  L +    + ++FL Y QS++CK 
Sbjct: 241 EGMRLLEARDMDGWYKYLSTTNNTICGRRPISVLMAALDSKKEAV-VRFLHYSQSNRCKN 299

Query: 275 KRDSSVSYASAAAKVD 290
             DSSVSYA A   +D
Sbjct: 300 MSDSSVSYAGAIVTLD 315


>gi|342878232|gb|EGU79587.1| hypothetical protein FOXB_09870 [Fusarium oxysporum Fo5176]
          Length = 324

 Score =  213 bits (543), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 125/324 (38%), Positives = 172/324 (53%), Gaps = 39/324 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
           M   R AS AGSWY  + K L  EL  +L  A +P+S        P  R ++APHAGY +
Sbjct: 1   MTTTRPASKAGSWYEGDQKLLRRELQNYL--AAVPESFDGVALPIPGARVIVAPHAGYKF 58

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           SG  AA+A+  +D +   RV +LGPSH YY   CA +T   Y TP GDL +D E+  EL+
Sbjct: 59  SGPCAAWAYKTLDLSRAKRVIVLGPSHTYYLEGCAATTFEKYATPFGDLEIDTELATELE 118

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYL----AKVFEG--HLIKIVPILVGAVNAENEAMYG 166
                E M    +  EHS+EMH+PYL     + F+      KIVP+LVG+ N + E + G
Sbjct: 119 DAVAMEPMSRRGEVNEHSLEMHMPYLYLRCQETFDSPDKFPKIVPVLVGSNNRQEEKVIG 178

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD----------------KKHG-VIHKSI 209
           R    Y+ DP N F +SSDFCHWG  F+Y+ Y                 K +G  IH++I
Sbjct: 179 RALLPYLRDPENVFIISSDFCHWGDNFSYLPYSSTKSPSDLTQLQKESPKPNGPPIHETI 238

Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV------FLHMLGNCSTKIKIKF 263
             +D+  MD +++G  DAF   L +  NT+CGRHPI V      F+        K +   
Sbjct: 239 RVIDEAAMDAVKSGSHDAFLDSLKQTRNTVCGRHPIGVVMAALEFIRQEDAFQDKGRFSI 298

Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
           ++Y++S+  +   D SVSY S  A
Sbjct: 299 IKYDRSNLVEYPNDMSVSYVSGYA 322


>gi|261332591|emb|CBH15586.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 323

 Score =  213 bits (542), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 127/316 (40%), Positives = 183/316 (57%), Gaps = 27/316 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG---LPKSPEVRGVIAPHAGYSYSGRAA 57
           M   RRA+HAGSWY  +   L   +D     A    L +S ++ GVI+PHAG SYSG  A
Sbjct: 1   MTYSRRATHAGSWYEGSPNALKALVDTLFSTASKSPLKESEKMIGVISPHAGISYSGNTA 60

Query: 58  AYAFGNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDL----E 106
           ++ + ++          SI+R+FLLGPSHH       +  A  Y+TP G L ++     E
Sbjct: 61  SHVYVHLRDYIYGHKGRSITRIFLLGPSHHKGFDGVEVCAAQRYETPFGPLVVNAKVGQE 120

Query: 107 VIEELKATG-KFELMDICVDEAEHSMEMHLPYLAKVF----EGH-----LIKIVPILVGA 156
           V +EL+A G     M    DE EHS+EM LP+++ +      G+      +++VP+L+G 
Sbjct: 121 VEKELRAAGVPVGTMHRMTDEDEHSIEMQLPFISHLLHYPPNGYKPAMDRVELVPLLIGG 180

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-KHGVIHKSIEALDK 214
            N + E + G + +KY+ D  NFF +SSDFCHWG+RF YM HY+K ++  I  +I ++D 
Sbjct: 181 TNRKMENLIGSVLSKYLKDNQNFFVISSDFCHWGARFQYMYHYEKAEYPDIGDAIISMDH 240

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
            GM ++E  D D + KYL   +NTICGR PISV +  L +    + ++FL Y QS++CK 
Sbjct: 241 EGMRLLEARDMDGWYKYLSTTNNTICGRRPISVLMAALDSKKEAV-VRFLHYSQSNRCKN 299

Query: 275 KRDSSVSYASAAAKVD 290
             DSSVSYA A   +D
Sbjct: 300 MSDSSVSYAGAIVTLD 315


>gi|378726099|gb|EHY52558.1| hypothetical protein HMPREF1120_00769 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 388

 Score =  213 bits (542), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 139/384 (36%), Positives = 182/384 (47%), Gaps = 100/384 (26%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPK--SPEVRGVIAPHAGYSYSG 54
           +R ASHAGSWY+DNS++LA +LD WL          G+ K   P  R VIAPHAGY+YSG
Sbjct: 3   VREASHAGSWYSDNSRQLASQLDLWLSRVPEKDILPGIEKLPVPGARAVIAPHAGYAYSG 62

Query: 55  RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELK 112
             AA+A+  +D +   R+F+L PSHH +    AL     Y+TP+ D  LPLD E I EL 
Sbjct: 63  PCAAWAYKCLDLSQAKRIFILHPSHHRHLRTAALPVVDAYETPLSDQPLPLDRETIHELS 122

Query: 113 A---------TGKFELMDICVDEAEHSMEMHLPYLAKVF--------EGHLIKIVPILVG 155
           +         T KF  M   VDEAEHS EM LPY+ ++         E     +VPI+VG
Sbjct: 123 SLSATNENGETVKFTTMSKSVDEAEHSAEMQLPYIHRLLQKLYPGQPESSYPPLVPIMVG 182

Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------- 198
             +   E   G++ A Y+ D  N F +SSDFCHWGSRF Y +Y                 
Sbjct: 183 GTSVTTEQTLGKMLAPYIADEQNAFVISSDFCHWGSRFGYTYYVPDAPSPALSPLMLPNG 242

Query: 199 -------------DKKHGV---------------------IHKSIEALDKMGMDIIETGD 224
                        +  H V                     I++SI  +D+  M  I TG 
Sbjct: 243 VRGEFTTAKESTNEDVHKVPTLGQGEELRSNTRIPKGGPQIYESIAHVDRACMCAIATGS 302

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCS---------------------TKIKIKF 263
              F + L    NT+CGRHPI VF+  +                         T+ K +F
Sbjct: 303 HSEFCRVLRITGNTVCGRHPIGVFMAGVEEVERNKRDKGEHAKADTDPDTTDPTRGKFRF 362

Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
           +RYE+SS   + +DSSVSY SA A
Sbjct: 363 MRYERSSDVISPKDSSVSYVSAFA 386


>gi|322711962|gb|EFZ03535.1| hypothetical protein MAA_00609 [Metarhizium anisopliae ARSEF 23]
          Length = 323

 Score =  213 bits (542), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 123/321 (38%), Positives = 173/321 (53%), Gaps = 39/321 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGR 55
           +R A  AGSWY  +   L  ELD +L +  +P++        P  R +IAPHAGY++SG 
Sbjct: 3   LRTAGKAGSWYVASPDDLGSELDEYLSD--VPETVDDSTLPIPGARIIIAPHAGYTFSGP 60

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            AA+A+  +D +   RVF+LGPSH YY   CA++T + Y TP G+L +D ++I+ +K   
Sbjct: 61  CAAWAYKCLDLSKAKRVFVLGPSHTYYLDGCAVTTYSKYATPFGNLTVDRDIIQRVKEAA 120

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRLF 169
           + + +    D AEHS+EMHLPYL K  E       +   IVPIL+G  N + E   G++ 
Sbjct: 121 QMDDIPPSRDSAEHSLEMHLPYLYKRCEQTFGSPENFPTIVPILIGDNNRDEEKAIGQVL 180

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-----------------VIHKSIEAL 212
             Y+ D  N F +SSDFCHWG+ F YM Y   +                   IH++I  L
Sbjct: 181 VPYLKDEENAFIISSDFCHWGAHFQYMVYSPDNDPAQLINLRRGDKAPAGPPIHETIRLL 240

Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS------TKIKIKFLRY 266
           DK  MD +E+G  D F   L    NT+CGRHPI V +  L   S       K + +  +Y
Sbjct: 241 DKAAMDAVESGHHDVFVDNLKLTKNTVCGRHPIGVAMAALELLSGEVADARKCRFRITQY 300

Query: 267 EQSSQCKTKRDSSVSYASAAA 287
           ++S+  +   D SVSY SA A
Sbjct: 301 QRSNLVEKPSDFSVSYVSAYA 321


>gi|340518150|gb|EGR48392.1| predicted protein [Trichoderma reesei QM6a]
          Length = 323

 Score =  213 bits (541), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 127/320 (39%), Positives = 175/320 (54%), Gaps = 39/320 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGRA 56
           R AS AGSWY     +L + LD +L  A +P++        P  R +IAPHAGY+YSG  
Sbjct: 4   RPASKAGSWYEKTPAKLMKTLDNYL--ADVPETVDGSSLPIPGARVIIAPHAGYAYSGPN 61

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA+A+  +D +   RVF+LGPSH Y    CA++T + + TP GD  +D +V+E +K  G 
Sbjct: 62  AAWAYSCLDLSKAKRVFILGPSHTYGLSGCAVTTFSKWATPFGDFTVDRDVLERVKEAGG 121

Query: 117 FELMDICVDEAEHSMEMHLPYLAK----VFEG--HLIKIVPILVGAVNAENEAMYGRLFA 170
              +    D AEHS+EMHLPYL K     FE       IVPI+VG+++   E   GR+  
Sbjct: 122 MGDVPPENDVAEHSLEMHLPYLYKRCQQTFESPEDFPTIVPIIVGSLSRSREKDVGRVLL 181

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEALD 213
            Y+ D  N F +SSDFCHWG+RF Y  Y     +                 I+++I  LD
Sbjct: 182 SYLKDEENAFIISSDFCHWGTRFGYAVYSPNGDIQRLTTLHDYSPKPSGPPIYETIRLLD 241

Query: 214 KMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLG------NCSTKIKIKFLRYE 267
           +  MD +++G  DAF + L    NT+CGRHPI V +  L       N   K + K ++Y+
Sbjct: 242 EAAMDAVKSGSHDAFVENLRHTGNTVCGRHPIGVAMAALELYAKELNEEDKSRFKIVKYD 301

Query: 268 QSSQCKTKRDSSVSYASAAA 287
           +SS+     DSSVSY SA A
Sbjct: 302 RSSEVVWPDDSSVSYVSAYA 321


>gi|358381478|gb|EHK19153.1| hypothetical protein TRIVIDRAFT_69842 [Trichoderma virens Gv29-8]
          Length = 323

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 128/320 (40%), Positives = 174/320 (54%), Gaps = 39/320 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGRA 56
           R A+ AGSWY  + + LA ELD +L  A +P+         P  R +IAPHAGY+YSG  
Sbjct: 4   RPATKAGSWYEKDPEILAGELDEYL--AKVPEDVDGSSLPIPGARIIIAPHAGYAYSGPN 61

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA+A+  +D +   RVF+LGPSH Y    CA++T + Y TP GD  +D + IE +K  G 
Sbjct: 62  AAWAYSCLDLSKAKRVFILGPSHTYGFSGCAVTTFSHYSTPFGDFTVDRDTIERVKEAGG 121

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRLFA 170
            +      D AEHS+EMHLPYL K  E          K+VPI+VG+ +  +E   GRL  
Sbjct: 122 MKNAPPRNDVAEHSLEMHLPYLYKRCEQTFKSPEEFPKVVPIIVGSTSRADEKDIGRLLL 181

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEALD 213
            Y+ D  N F +SSDFCHWGSRF+Y  Y     +                 I+++I  LD
Sbjct: 182 SYLKDEENAFIISSDFCHWGSRFDYAVYTPDGNIGKLSSLHDRSPKPSGPPIYETIRLLD 241

Query: 214 KMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLG------NCSTKIKIKFLRYE 267
           +  MD +++G  DAF   L    NT+CGRHPI + +  L       N     + K ++Y+
Sbjct: 242 EAAMDAVKSGSHDAFVDNLRLTGNTVCGRHPIGIAMAALELYAKELNDEGMSRFKVVKYD 301

Query: 268 QSSQCKTKRDSSVSYASAAA 287
           +SS+     DSSVSY SA A
Sbjct: 302 RSSEVIWPDDSSVSYVSAYA 321


>gi|326482860|gb|EGE06870.1| DUF52 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 377

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 132/375 (35%), Positives = 182/375 (48%), Gaps = 88/375 (23%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWL-----REAGLPK--SPEVRGVIAPHAGYSYS 53
           M  IR  +HAGSWY+D+   L  +LD WL        GL K   P  R +IAPHAGY+YS
Sbjct: 1   MMSIRSHTHAGSWYSDHGPTLEAQLDQWLDLVPGELEGLGKLPVPGARVIIAPHAGYAYS 60

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEEL 111
           G  AA+A+  +D +   R+FL+GPSHH +    A+   T Y TP+    LPLD E++ ++
Sbjct: 61  GPCAAFAYKCLDLSKAKRIFLIGPSHHLHLTTIAVPQLTEYLTPLSPMPLPLDTEMLAKI 120

Query: 112 K-----ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
           +     A   F  M   VDE EHSME+HLPY+ ++ +             +VP++VG+ +
Sbjct: 121 EHEASLADRPFSKMSRSVDEQEHSMELHLPYIHRLLQRLYPTMPTSAYPPLVPMMVGSTS 180

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--------------- 203
           A NE ++G +   Y+ D  N F +SSDFCHWGSRF Y +Y +  G               
Sbjct: 181 AANERVFGSILRPYLADEENAFVISSDFCHWGSRFAYTYYVQAPGPTTYKLPLTYPSLPV 240

Query: 204 -------------------------------------VIHKSIEALDKMGMDIIETGDPD 226
                                                 IH+S+ A D   M  I +G+ D
Sbjct: 241 PSDMLTADGAAEAVRAVSSGRMLRDNDKSYNKEKYMWAIHESVSACDMACMSAIASGNTD 300

Query: 227 AFKKYLLEYDNTICGRHPISVFL--------------HMLGNCSTKIKIKFLRYEQSSQC 272
            F++ L    NT+CGRHPI V +               M  +   K K  F+RYE+SS C
Sbjct: 301 VFQEALRRTGNTVCGRHPIGVVMAGIEGLQLQHQAEAAMTQDVVKKGKFYFIRYERSSDC 360

Query: 273 KTKRDSSVSYASAAA 287
            + +DSSVSY SA A
Sbjct: 361 FSVQDSSVSYVSAFA 375


>gi|146422342|ref|XP_001487111.1| hypothetical protein PGUG_00488 [Meyerozyma guilliermondii ATCC
           6260]
 gi|146388232|gb|EDK36390.1| hypothetical protein PGUG_00488 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 328

 Score =  211 bits (537), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 125/320 (39%), Positives = 170/320 (53%), Gaps = 41/320 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA--------GLPKSPEVRGVIAPHAGYSYSGR 55
           +R A+HAGSWY+ N+  LA +++ ++ +A        G P  P  R +I PHAGY+YSG 
Sbjct: 3   VRPATHAGSWYSANNASLASQMERFISKAQNNLKKSHGGPHVPGARVLIGPHAGYTYSGT 62

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
             A  +   D T + RVF+LGPSHH Y++    +S    Y+TP G+L +D +V  EL   
Sbjct: 63  QLAETYEAWDTTGVKRVFILGPSHHVYFSSTAKVSKFDSYQTPFGNLDVDTKVCSELVDK 122

Query: 115 GKFELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAENEAMYGRL--- 168
           G F  M    DE EHS EMH P++    K       KIVPI++ A+N   E +Y ++   
Sbjct: 123 GAFSYMTEEEDENEHSFEMHAPFIRYKTKDLPHGSPKIVPIMISAMN---ERLYNKIVKA 179

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK----HGV-------------------I 205
              Y  D SN F+VSSDFCHWG+RF Y  Y +K     G+                   I
Sbjct: 180 LEPYFADKSNTFAVSSDFCHWGARFGYTKYLQKIPDSEGITSQSLVSLKSSSQLVQSIPI 239

Query: 206 HKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLR 265
           H+SIE LDK  M I   G    + +Y+ E  NTICG+ PISV L +L N        ++ 
Sbjct: 240 HRSIEILDKEAMKIASKGTHTDWNRYIDETQNTICGQKPISVVLRLLDNSGAHPGFSWIG 299

Query: 266 YEQSSQCKTKRDSSVSYASA 285
           Y QS +  +  +SSVSYAS 
Sbjct: 300 YSQSGEVTSPTESSVSYASG 319


>gi|302923736|ref|XP_003053739.1| hypothetical protein NECHADRAFT_98907 [Nectria haematococca mpVI
           77-13-4]
 gi|256734680|gb|EEU48026.1| hypothetical protein NECHADRAFT_98907 [Nectria haematococca mpVI
           77-13-4]
          Length = 324

 Score =  211 bits (536), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 173/324 (53%), Gaps = 39/324 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
           M   R A+ AGSWY  N+K L  EL   L  A +P+S        P  R +IAPHAGY +
Sbjct: 1   MVATRPAAKAGSWYKSNAKTLQAELLNNL--AAVPESLDGASLPIPGARIIIAPHAGYEF 58

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           SG  AA+A+  +D +   RVF+LGPSH YY   CA +T   Y+TP G+L +D  + ++ +
Sbjct: 59  SGPCAAWAYKTLDLSRAKRVFVLGPSHTYYLEGCAATTFDKYETPFGELTVDRALAKKFE 118

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYL----AKVF--EGHLIKIVPILVGAVNAENEAMYG 166
                +LM    +  EHS+EMH+PYL     + F       KIVP+LVG+ N   E   G
Sbjct: 119 DAASMDLMPQRNEVLEHSLEMHMPYLYLRCQQTFGSPDKFPKIVPVLVGSNNGPEEKEVG 178

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-----------------VIHKSI 209
           R    Y+ DP N F +SSDFCHWG  F+YM Y                       IH++I
Sbjct: 179 RALLPYLKDPENAFIISSDFCHWGHNFSYMVYSPTSSPGDLVKLRRQDRAPAGPPIHETI 238

Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVF---LHMLG---NCSTKIKIKF 263
            A+D+  MD +++G  DAF   L +  NT+CGRHPI V    L +LG       K + + 
Sbjct: 239 RAIDEAAMDGVKSGSHDAFLATLKQTKNTVCGRHPIGVMMAALELLGQEPQYQGKGRFQV 298

Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
           ++Y++S+  +   D SVSY SA A
Sbjct: 299 IQYDRSNLVQEPDDMSVSYVSAYA 322


>gi|116200592|ref|XP_001226108.1| hypothetical protein CHGG_10841 [Chaetomium globosum CBS 148.51]
 gi|88175555|gb|EAQ83023.1| hypothetical protein CHGG_10841 [Chaetomium globosum CBS 148.51]
          Length = 298

 Score =  210 bits (535), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 164/324 (50%), Gaps = 68/324 (20%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSW  ++                             HAGYSYSG  AA+A+  +
Sbjct: 4   REASHAGSWMRNS-----------------------------HAGYSYSGPCAAWAYKAL 34

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           D  S  RVF+LGPSH YY   CAL++   Y+TP GDL +D     EL+ T +F  M    
Sbjct: 35  DLRSAKRVFILGPSHTYYLRGCALTSFDKYETPFGDLVVDNATTSELRQTNRFSEMPKRS 94

Query: 125 DEAEHSMEMHLPYLAKVFE-----GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           D +EHS+EMHLPYL K  E     G    I+PILVG  + + E  +G+L +KY+ DP + 
Sbjct: 95  DVSEHSLEMHLPYLRKRLEQTFGDGDYPPIIPILVGDASGDEEKSWGQLLSKYLKDPESA 154

Query: 180 FSVSSDFCHWGSRFNY---------MHYDKKHGV------------------IHKSIEAL 212
           + VSSDFCHWGSRF Y            D   G                   IH+ IE L
Sbjct: 155 WIVSSDFCHWGSRFTYRPKFRDGVIRDLDNDRGALQVTPDWSQVAEDPESPEIHEVIEVL 214

Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-------GNCSTKIKIKFLR 265
           D+M +D +E+G    F K + +  NT+CGRHPI V +  L       G      K  F++
Sbjct: 215 DQMAIDAVESGSHSEFSKVIRQTQNTVCGRHPIGVMMAALEAVAEERGLEGKNGKFSFVQ 274

Query: 266 YEQSSQCKTKRDSSVSYASAAAKV 289
           Y++S+  K  +D SVSYASA A V
Sbjct: 275 YQRSNLVKRDKDFSVSYASAYAVV 298


>gi|326474665|gb|EGD98674.1| hypothetical protein TESG_06154 [Trichophyton tonsurans CBS 112818]
          Length = 376

 Score =  209 bits (533), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 131/372 (35%), Positives = 180/372 (48%), Gaps = 88/372 (23%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL-----REAGLPK--SPEVRGVIAPHAGYSYSGRA 56
           IR  +HAGSWY+D+   L  +LD WL        GL K   P  R +IAPHAGY+YSG  
Sbjct: 3   IRSHTHAGSWYSDHGPTLEAQLDQWLDLVPGELEGLGKLPVPGARVIIAPHAGYAYSGPC 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELK-- 112
           AA+A+   D +   R+FL+GPSHH +    A+   T Y TP+    LPLD E++ +++  
Sbjct: 63  AAFAYKCFDLSKAKRIFLIGPSHHLHLTTIAVPQLTEYLTPLSPMPLPLDTEMLAKIEHE 122

Query: 113 ---ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAEN 161
              A   F  M   VDE EHSME+HLPY+ ++ +             +VP++VG+ +A N
Sbjct: 123 ASLADRPFSKMSRSVDEQEHSMELHLPYIHRLLQRLYPTMPTSAYPPLVPMMVGSTSAAN 182

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG------------------ 203
           E ++G +   Y+ D  N F +SSDFCHWGSRF Y +Y +  G                  
Sbjct: 183 ERVFGSILRPYLADEENAFVISSDFCHWGSRFAYTYYVQAPGPTTYKLPLTYPSLPVPSD 242

Query: 204 ----------------------------------VIHKSIEALDKMGMDIIETGDPDAFK 229
                                              IH+S+ A D   M  I +G+ D F+
Sbjct: 243 MLTADGAAEAVRAVSSGRMLRDNDKSYNKEKYMWAIHESVSACDMACMSAIASGNTDVFQ 302

Query: 230 KYLLEYDNTICGRHPISVFL--------------HMLGNCSTKIKIKFLRYEQSSQCKTK 275
           + L    NT+CGRHPI V +               M  +   K K  F+RYE+SS C + 
Sbjct: 303 EALRRTGNTVCGRHPIGVVMAGIEGLQQQHQAEAAMTQDVVKKGKFYFIRYERSSDCFSV 362

Query: 276 RDSSVSYASAAA 287
           +DSSVSY SA A
Sbjct: 363 QDSSVSYVSAFA 374


>gi|358253785|dbj|GAA53778.1| hypothetical protein CLF_111105, partial [Clonorchis sinensis]
          Length = 729

 Score =  209 bits (533), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 101/217 (46%), Positives = 142/217 (65%), Gaps = 6/217 (2%)

Query: 76  GPSHHYYTPK-CALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMH 134
           GPSHH    K CA+   ++Y+TP+ DL +D  + +E   TG+F       DEAEHS+EM 
Sbjct: 508 GPSHHVDIGKYCAIPEVSIYQTPLADLEVDHSICDEFAKTGRFVKFSKAQDEAEHSIEMQ 567

Query: 135 LPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFN 194
           LPY+AKV +G+   IVP++VG +  E EA YG + A Y+ DP   F +SSDFCHWG RF 
Sbjct: 568 LPYIAKVMQGNPFTIVPLVVGTLTPEIEAAYGEILAPYLADPETVFVISSDFCHWGRRFR 627

Query: 195 YMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGN 254
           + +YD+  G I +SIE LD  GM+ IE+ +P+ F  YL +Y NTICGR  I+V L+ +  
Sbjct: 628 FQYYDQADGEIWQSIEKLDLQGMNAIESLNPEEFTSYLRKYGNTICGRRGINVLLNTIQK 687

Query: 255 CS-----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            +     +  +++FL+Y QSSQC++  DSSVSYA+A+
Sbjct: 688 MNRDRSKSGHELRFLKYAQSSQCRSLDDSSVSYAAAS 724


>gi|357622477|gb|EHJ73941.1| hypothetical protein KGM_12231 [Danaus plexippus]
          Length = 302

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 122/307 (39%), Positives = 160/307 (52%), Gaps = 75/307 (24%)

Query: 21  LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHH 80
           L+ +LD WL +A L   P  R +IAPHAGYSY G  AA+A+  + P  + R+F+LGPSHH
Sbjct: 26  LSRQLDLWLSKADLTHGP-ARAIIAPHAGYSYCGACAAFAYRQVSPLVVKRIFILGPSHH 84

Query: 81  YYTPKCALSTATVYKTPIGDLPLD------------------------------------ 104
                CALS+   Y TP+ DL +D                                    
Sbjct: 85  VRLRGCALSSLDKYATPLYDLTIDKQNYDRPDGRMAIPFHSLCGIRNLLLHTACWFRGVS 144

Query: 105 -LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAEN 161
            + +  EL+ATG+FE MD+  DE EHS+EMHLPY+AKV E       I+PILVG+++ + 
Sbjct: 145 GVTLYAELEATGQFEFMDVQTDEDEHSIEMHLPYIAKVMEEFKTAFTIIPILVGSLSPDR 204

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDII- 220
           EA YG + A Y+ DP N   VSSDFCHWGSRF Y   D     IH++IE LD    + + 
Sbjct: 205 EAKYGAILAPYLADPQNLVVVSSDFCHWGSRFRYTWRDGNRN-IHQAIEWLDHQACEKLR 263

Query: 221 -ETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS 279
            ++G P                                ++ +KFL+Y QSSQC T RDSS
Sbjct: 264 CQSGAP--------------------------------QMSMKFLKYAQSSQCMTMRDSS 291

Query: 280 VSYASAA 286
           VSYASA+
Sbjct: 292 VSYASAS 298


>gi|71029084|ref|XP_764185.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351139|gb|EAN31902.1| hypothetical protein, conserved [Theileria parva]
          Length = 290

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 110/288 (38%), Positives = 168/288 (58%), Gaps = 2/288 (0%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R A+HAGSWY+++S  L  E+    +         ++ +IAPHAGY+YS + AA+A+  
Sbjct: 1   MRVATHAGSWYSNSSGALESEIKSSFQGLTNAAYKTLKYIIAPHAGYAYSLKTAAHAYSQ 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           ID  S   +F+LGPSHH++   CA+   +  +TP+G L +D++++++L     F +++  
Sbjct: 61  IDSASYKTIFVLGPSHHFFLRGCAVDRFSSLQTPLGPLQVDVDIVDKLSNLKGFSVINNE 120

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
             E EHS+EMHLP L  VF+   +K+VPI+VG  +             Y +D    F  S
Sbjct: 121 ASEDEHSIEMHLPLLRFVFKKEPVKVVPIMVGDFSESLADELTSALVPYFNDERTLFVFS 180

Query: 184 SDFCHWGSRFNY--MHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           SDFCH+GSRF +    Y+ ++  +++ IE LDK G+D+I     D F  YL E +NTICG
Sbjct: 181 SDFCHFGSRFQFSITGYESENKPLYEKIEMLDKRGIDLIVNHKHDDFLWYLTETENTICG 240

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           R+PI + L +L   +  +  + L Y QSS+     DSSVSYA+    V
Sbjct: 241 RNPILLLLRLLAASNLNVTSRLLHYSQSSRITRVSDSSVSYAAIVGLV 288


>gi|358390568|gb|EHK39973.1| hypothetical protein TRIATDRAFT_152882 [Trichoderma atroviride IMI
           206040]
          Length = 323

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 119/322 (36%), Positives = 171/322 (53%), Gaps = 39/322 (12%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSG 54
           + R A  AGSWY      L  ++D +L  A +P++        P  R +IAPHAGY YSG
Sbjct: 2   ETRLAKKAGSWYKAAEVDLMRDIDKFL--AKVPETVDDSQLPVPGARIIIAPHAGYEYSG 59

Query: 55  RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
             AA+A+  +D +   RVF+LGPSH Y    CA++T + Y TP GD  +D + IE +K  
Sbjct: 60  ETAAWAYSCLDLSRAKRVFILGPSHTYGLSGCAVTTFSQYATPFGDFTVDRDTIERVKEA 119

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRL 168
              + +    DEAEHS+EMHLP L K  +           +VPI+VG+ +  NE   GR+
Sbjct: 120 ASMQNVPRRNDEAEHSLEMHLPLLYKRCQQTFKSPEEFPMVVPIIVGSTSRANEKDIGRV 179

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEA 211
              Y+ D  N F +SSDFCHWG RF+Y  Y     +                 I++SI+ 
Sbjct: 180 LLSYIKDEENAFIISSDFCHWGERFDYTAYTANGDIDSLKHLQPYSSKPSGPPIYESIQL 239

Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLG------NCSTKIKIKFLR 265
           +D+  MD +++G  DAF   +    NT+CGRHPI + +  L       N   + + + ++
Sbjct: 240 VDEAAMDAVKSGSHDAFVDNIKRTGNTVCGRHPIGIAMAALELYAEEVNDENRSRFRVVK 299

Query: 266 YEQSSQCKTKRDSSVSYASAAA 287
           Y++SS+     DSSVSY SA A
Sbjct: 300 YDRSSEVVWPDDSSVSYVSAYA 321


>gi|380485591|emb|CCF39262.1| memo-like protein, partial [Colletotrichum higginsianum]
          Length = 322

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 117/313 (37%), Positives = 170/313 (54%), Gaps = 32/313 (10%)

Query: 8   SHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ++ G WY  ++K+L E+LDG+L +         LP  P  R +IAPHAGY+YSG  AA+A
Sbjct: 11  ANRGEWYLGDAKKLREQLDGFLADVPDQIDGNALP-VPGARVIIAPHAGYTYSGATAAWA 69

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  +D +   RVFLLGPSH +Y   CA++  + Y TP G+L +D +V++ L+       +
Sbjct: 70  YKALDLSKAKRVFLLGPSHTFYLADCAVTGYSNYGTPWGNLRVDRQVVDRLRDELDIPKI 129

Query: 121 DICVDEAEHSMEMHLPYLAKVFE-------GHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
               D+ EHS+EMHLPYL    E            +VPIL+G  +   E   G+  A+Y+
Sbjct: 130 PTANDDREHSLEMHLPYLWVRLEQTFGASPDAFPPVVPILIGDNDEAGEKAVGKWLAEYL 189

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEALDKMG 216
            DP N F VSSDFCHWG  F+Y  Y     +                 IH++I+ +D + 
Sbjct: 190 RDPDNAFIVSSDFCHWGRHFDYTVYSPDGTLDSLRKLRPVGPAPEGPPIHETIKLVDDLA 249

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
           ++ I TG   +F   L +  NT+CGRHPI V +  L       + KF++Y++S+      
Sbjct: 250 IEAIGTGRHGSFYNNLKQTKNTVCGRHPIGVTMAALEELGGGHRFKFIQYQRSNLVTVPG 309

Query: 277 DSSVSYASAAAKV 289
           D+SVSY SA A V
Sbjct: 310 DTSVSYVSAYAVV 322


>gi|353229020|emb|CCD75191.1| hypothetical protein Smp_175570 [Schistosoma mansoni]
          Length = 215

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 99/212 (46%), Positives = 139/212 (65%), Gaps = 10/212 (4%)

Query: 85  KCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEG 144
           KCALSTA  ++TP+  L +   V  +L+ TG+F  + +  DE EHS+EM +PY+AK+ EG
Sbjct: 6   KCALSTADFFETPLYSLSI---VYRDLEETGEFVSLPLDRDEEEHSIEMQMPYIAKMMEG 62

Query: 145 HLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH 202
           +  K  +VPILVG +  E EA+YG++F++Y+ +P NFF +SSDFCHWG RF Y +YD+  
Sbjct: 63  YQGKFSVVPILVGYLTPEREAVYGQIFSRYLSNPENFFVISSDFCHWGKRFQYTYYDQSK 122

Query: 203 GVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-----ST 257
           G I +SI+ALD+ GM++IE   P  F  YL +Y NTICGRHPI V L ++        + 
Sbjct: 123 GAIWQSIQALDETGMELIERLAPSEFTSYLEQYGNTICGRHPIGVLLQIVAYLRRNMPNN 182

Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
              +KF+RY QS  C     SSVSYA+   ++
Sbjct: 183 NFNMKFVRYAQSEHCNNMNQSSVSYAAGVLQI 214


>gi|393906536|gb|EJD74311.1| hypothetical protein, variant [Loa loa]
          Length = 221

 Score =  206 bits (525), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 106/215 (49%), Positives = 143/215 (66%), Gaps = 12/215 (5%)

Query: 86  CALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH 145
           CALST + Y+TPIGDL +D     ELK TG F LMD+  +EAEHS+EM LPY+AK+ E  
Sbjct: 6   CALSTCSRYRTPIGDLQIDQRTNTELKETGAFSLMDLRSEEAEHSIEMQLPYIAKIMEKQ 65

Query: 146 LIK---IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH 202
                 IVP+LVG+++   +A YG++F+KY+ DP   F VSSDFCHWGSRF++M +D   
Sbjct: 66  SANGYSIVPVLVGSLSPSKQASYGKIFSKYLSDPKIVFVVSSDFCHWGSRFHFMPHDNTT 125

Query: 203 GV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH------MLGNC 255
           GV I++ I A+DK GMD I + +P AF +YL    NTICGR+PISV L+       + N 
Sbjct: 126 GVPIYEQIAAMDKQGMDAISSLNPVAFGEYLKRTQNTICGRNPISVILYAAEYFRQMNN- 184

Query: 256 STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
               +  FL+Y QS+Q ++  DSSVSYA+ A  ++
Sbjct: 185 -HLAEFVFLKYAQSNQSRSLNDSSVSYAAGALFIN 218


>gi|346972214|gb|EGY15666.1| MEMO1 protein [Verticillium dahliae VdLs.17]
          Length = 343

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 124/322 (38%), Positives = 168/322 (52%), Gaps = 42/322 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGR 55
           +R+   AGSWY  ++K+LAE+L+G+L +  +P          P  R +IAPHAGYSYSG 
Sbjct: 6   VRKPGKAGSWYLADAKQLAEQLEGFLDD--VPSQINSSDVPIPGARVIIAPHAGYSYSGP 63

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            AA+A+ ++D +   RVFLLGPSH +Y   CA++T   Y TP G++ +D EV   L+   
Sbjct: 64  TAAWAYKSLDLSQTKRVFLLGPSHTFYLKGCAVTTFKHYGTPFGNIRVDEEVTSTLRTAL 123

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVF------EGHLIKIVPILVGAVNAENEAMYGRLF 169
               M    D  EHS+EMHLPYL  +F            +VPILVG      E   G   
Sbjct: 124 SLPDMPPANDNKEHSLEMHLPYLWTMFAKTFGSPDAFPTLVPILVGDGTKTAERAVGAWL 183

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSIEA 211
             Y+ DP+N F VSSDFCHWG  F+Y  Y     V                  IH++I+ 
Sbjct: 184 LPYLRDPANAFVVSSDFCHWGDNFSYTPYSPHAKVDGSLSHISSRSRVPAGRPIHETIKE 243

Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST--------KIKIKF 263
           LD   +  I+TG+   F   L    NT+CGRHPI V +  L   +         K + +F
Sbjct: 244 LDDQAIAAIKTGEYARFYDNLSLTKNTVCGRHPIGVMMAALELLAEEEGRGGEDKGRFQF 303

Query: 264 LRYEQSSQCKTKRDSSVSYASA 285
           +RYE+S+  +   D SVSY SA
Sbjct: 304 VRYERSNLVEDADDMSVSYVSA 325


>gi|154419148|ref|XP_001582591.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121916827|gb|EAY21605.1| hypothetical protein TVAG_013700 [Trichomonas vaginalis G3]
          Length = 292

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 112/294 (38%), Positives = 162/294 (55%), Gaps = 12/294 (4%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYA 60
           K+  A HAGSWY    + L E LD     A + +  +  V+ +IAPHAGY YS   A+YA
Sbjct: 4   KVVPAYHAGSWYPIG-QELKEMLDESFSNANVSQDKKGIVKAIIAPHAGYVYSVATASYA 62

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL--KATGKFE 118
           +  IDP++  RV +LGPSH  Y  KC ++ A   +TP G +P+D +  +EL  K    F+
Sbjct: 63  YKAIDPSNFDRVVILGPSHRIYVKKCTIAAADGCETPYGTVPIDRKAADELLQKYPDSFQ 122

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           ++ I     EHS+EM LP L  VF      ++PI++G +               + DP  
Sbjct: 123 VLSIDQSAKEHSLEMQLPLLKYVFGDKPFSVIPIMIGDLKEAQHKQVVEALTPIISDPKT 182

Query: 179 FFSVSSDFCHWGSRFNYMHYDK---KHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
              +SSDFCHWG+ F+Y +  K   K   ++K IE LDKM  + ++  DP  F KY+ E 
Sbjct: 183 LLVISSDFCHWGNNFDYFYLPKEIEKSEPVYKRIERLDKMAWEYVKDHDPKGFTKYISET 242

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +NTICG  PI++ + +L N       +F  Y QSS  K+K DSSVSY++   ++
Sbjct: 243 ENTICGYVPITMAMEILVNYDA----EFPHYSQSSHAKSKHDSSVSYSAGILRI 292


>gi|219119433|ref|XP_002180477.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407950|gb|EEC47885.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 356

 Score =  204 bits (520), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/325 (38%), Positives = 169/325 (52%), Gaps = 47/325 (14%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IRRA HAGSWY ++   L   L  +L  A    S     +I PHAGYSYSG  AAYA+  
Sbjct: 21  IRRAHHAGSWYENDPVTLRTTLQQYLDHAASDSS-----LIVPHAGYSYSGPTAAYAYQP 75

Query: 64  IDPTSISRV-------FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL----- 111
           +    +SRV        +L PSHH Y   CA+S +    TP+G+L  D  + EEL     
Sbjct: 76  LF-QELSRVDCPIQILLVLHPSHHVYLDGCAISNSHTINTPVGNLATDDGIREELLLLNH 134

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFE--GHL--------IKIVPILVGAVNAEN 161
                F +M    DE EHS EM  PY+A + +  G L        I+++PI+ GA++ + 
Sbjct: 135 NNKSIFTVMSQKEDEEEHSGEMQYPYIAHILQACGKLHNNGSNKPIRVLPIMCGALSNQQ 194

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------IHKSIEALDKM 215
           EA YG L  + +        VSSDFCHWG RF Y     K         +H+ I++LD+ 
Sbjct: 195 EASYGHLLQRVIAREDVLTIVSSDFCHWGPRFRYQPIPTKEKSYKDSMPLHEFIKSLDRQ 254

Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI-------------KIK 262
           GMD IE   P AF  YL    NTICGRH I+V++  +    T I             +++
Sbjct: 255 GMDAIEAQQPGAFANYLARTRNTICGRHAIAVWMQAIAASETTIGNKDDTDPTGELLRVR 314

Query: 263 FLRYEQSSQCKTKRDSSVSYASAAA 287
           F+RY QSS  ++ RD+SVSYA+A A
Sbjct: 315 FVRYAQSSPAESLRDNSVSYAAALA 339


>gi|399217242|emb|CCF73929.1| unnamed protein product [Babesia microti strain RI]
          Length = 290

 Score =  203 bits (517), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 113/291 (38%), Positives = 155/291 (53%), Gaps = 10/291 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE----VRGVIAPHAGYSYSGRAAAY 59
           IR A+HAGSWY+     L   +      + L   PE    V+ +I PHAGY YS + A Y
Sbjct: 3   IRIATHAGSWYSSVGNVLKTRI-----SSELSTHPENEVLVKYIITPHAGYDYSLKTALY 57

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+  I     S++F+LGPSHH Y   C L     Y+TPIG+L +D E +  L A   F  
Sbjct: 58  AYSKIRTFKYSKIFILGPSHHVYFEGCGLDKCIKYETPIGNLDIDTETVTNLLANDHFMN 117

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           +     E EHS+EM LP L  + EG    KI+PI++G ++ ++         KY +D  N
Sbjct: 118 ISKGTSEDEHSIEMQLPILKLILEGFPDAKIIPIMIGDIDEQSTIGIANSLLKYFEDKDN 177

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDFCH+G+RF +         IH+++E LDK G+D+I   +   F  YL +  NT
Sbjct: 178 LFVISSDFCHFGARFGFYKTPYPGIPIHEAVEKLDKDGIDLILKHNYKGFISYLDDTKNT 237

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           ICGR+PI V L +L      I      Y QS+     RDSSVSYA+    +
Sbjct: 238 ICGRNPIQVLLKLLTFSKLNIFSNLESYAQSNPATNIRDSSVSYAAVVGTI 288


>gi|225680304|gb|EEH18588.1| DUF52 domain protein [Paracoccidioides brasiliensis Pb03]
          Length = 439

 Score =  203 bits (516), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 134/374 (35%), Positives = 178/374 (47%), Gaps = 91/374 (24%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R A+HAGSWY+D+   L+++LD WL +        G    P  R +IAPHAGY+YSG  A
Sbjct: 64  REATHAGSWYSDHEPTLSKQLDQWLAQVPDELPDIGPLPIPGARIIIAPHAGYAYSGPCA 123

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEEL---- 111
           A+A+  +D ++  R+FL+GPSHH+     AL   T Y TP+    LPLD E+I  L    
Sbjct: 124 AFAYKTLDLSNAERIFLIGPSHHHILSTLALPQLTSYLTPLSREPLPLDTELIAHLLTST 183

Query: 112 -KATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENE 162
             +  +F  M   VD AEHS+E+HLPY+  +               +VPI+VG+     E
Sbjct: 184 NASNHRFTTMSPAVDSAEHSLELHLPYIHHLLRRLYPTKPTAQYPSLVPIMVGSTLPATE 243

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
           A +G + A Y+ DPSN F +SSDFCHWG RF + +Y                        
Sbjct: 244 AAFGSVLAPYLADPSNAFIISSDFCHWGLRFGHTYYVPNARTPAEPLPLSYPQLPLPSTS 303

Query: 199 ---------------------DK---KHGV-----IHKSIEALDKMGMDIIETGDPDAFK 229
                                DK   + GV     IH+SI A D   M  I +G+   F 
Sbjct: 304 DAGVAAEAIEAVSGGISLKVKDKDRVRKGVAGVPLIHESISACDVACMTAIASGETRRFL 363

Query: 230 KYLLEYDNTICGRHPISVFLHML----GNCS------------TKIKIKFLRYEQSSQCK 273
             L +  NT+CGRHPI V +  L    GN               K +  F+RYE+SS   
Sbjct: 364 DALRKTGNTVCGRHPIGVVIAGLEIVRGNKEGGGKVDAVDGEMQKGRFHFIRYERSSDVV 423

Query: 274 TKRDSSVSYASAAA 287
              DSSVSY SA A
Sbjct: 424 AVGDSSVSYVSAFA 437


>gi|391870565|gb|EIT79745.1| putative dioxygenase [Aspergillus oryzae 3.042]
          Length = 390

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 125/357 (35%), Positives = 169/357 (47%), Gaps = 84/357 (23%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS---------PEVRGVIAPHAGYSYSGR 55
           R ASHAGSWY+DN + L  +LDGWL  A +P S         P  R +IAP+   + S  
Sbjct: 4   REASHAGSWYSDNQRTLTHQLDGWL--AQVPNSIEGIGSLPVPGARIIIAPYVPENISAP 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKA 113
               ++   DP  I R+F+LGPSHH+Y    AL   T Y TP+ D  LPLD E+I +L +
Sbjct: 62  CTVNSYA--DPLDIKRIFILGPSHHHYLSTLALPELTSYYTPLSDEPLPLDTELITKLLS 119

Query: 114 TG---------KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--------IVPILVGA 156
           T           F  M   +DE EHS+E+HLPY+ ++ + H           +VPI+VG+
Sbjct: 120 TKAVKSNGSTISFTTMSRSIDEDEHSIELHLPYIHRLLQLHFPNKPTSQYPPLVPIMVGS 179

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------ 198
            +A  E+ +G L A Y+ DP+N F +SSDFCHWG RF Y +Y                  
Sbjct: 180 TSASTESAFGALLAPYLQDPTNAFVISSDFCHWGLRFRYTYYVPQAPKPGPKLPLSANSL 239

Query: 199 -----------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
                                          +   IH+SI A D   M  I TG    F 
Sbjct: 240 PQPGDDLSSVEEKMESVSTGQALQRRDRISSREPAIHESISAFDIATMAAISTGATANFL 299

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVS 281
           + +    NT+CGRHPI V +  +   ++     K K  F+RYE+SS      DSS S
Sbjct: 300 ETIQTTGNTVCGRHPIGVIMAAIEEITSQEEGKKGKFHFVRYERSSDAVDVDDSSFS 356


>gi|254585889|ref|XP_002498512.1| ZYRO0G12034p [Zygosaccharomyces rouxii]
 gi|238941406|emb|CAR29579.1| ZYRO0G12034p [Zygosaccharomyces rouxii]
          Length = 319

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 120/319 (37%), Positives = 173/319 (54%), Gaps = 36/319 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A+HAGSWY+D+ + L  +L G+L   GL   P    +++PHAGY Y G+  A+A+ ++
Sbjct: 3   RPATHAGSWYSDDPRLLKSQLLGYLEATGLRGEPGANIIVSPHAGYRYCGKTMAHAYASL 62

Query: 65  DPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           D T++SRVF++GPSHH Y+  K  +S     +TP+G L +D E+ +EL  +  F  MD  
Sbjct: 63  DLTNVSRVFIMGPSHHVYFKNKVLVSGFDSLETPLGTLKIDRELGDELITSRHFAEMDPG 122

Query: 124 VDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           VD  EHS+EM    LA+  +        +K+VP++V   + E +   G +  KY+  P  
Sbjct: 123 VDMEEHSIEMQFSMLAQTLQWKNVPLDTVKVVPLMVSHNSKEVDWQIGNVLGKYL-GPGT 181

Query: 179 FFSVSSDFCHWGSRFNYMHY----DKKHGV----------------------IHKSIEAL 212
            F +SSDFCHWG RF Y  Y    D+ +                        I KSIE L
Sbjct: 182 IFIISSDFCHWGRRFGYTGYVGSQDELNDAIAQETEIEMLTSRSKLSHHQLDIWKSIELL 241

Query: 213 DKMGMDIIE-TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSS 270
           D+  MD+++  GD  D +KKYL    NTICG  PI++ L +L   +   + +F  Y QSS
Sbjct: 242 DRAAMDVLKLQGDKYDEWKKYLEVTGNTICGARPIALMLAILAQ-TRDARFQFPAYSQSS 300

Query: 271 QCKTKRDSSVSYASAAAKV 289
             +   DSSVSY S  A +
Sbjct: 301 HVRDLHDSSVSYGSGFATM 319


>gi|294910181|ref|XP_002777909.1| Protein C2orf4, putative [Perkinsus marinus ATCC 50983]
 gi|239885888|gb|EER09704.1| Protein C2orf4, putative [Perkinsus marinus ATCC 50983]
          Length = 341

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 115/282 (40%), Positives = 154/282 (54%), Gaps = 32/282 (11%)

Query: 37  SPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATV--Y 94
           S   + +I PHAGY Y G  AA A+  I   +++RVF+LGPSHH Y   CAL   ++  Y
Sbjct: 58  STTTKAIITPHAGYRYCGSVAANAYNTIS-DNVNRVFILGPSHHQYLDNCALPHPSIHQY 116

Query: 95  KTPIGDLPLDLEVIEELK-------ATGKF-ELMDICVDEAEHSMEMHLPYLAKVFEGHL 146
            TP G L LD +V+ EL+       + G+F   +    DE EHS+EM LP +    +  L
Sbjct: 117 DTPFGPLKLDEDVLGELRGLSKSSNSGGEFGTTLTKEEDEDEHSIEMQLPLIYHQLKHRL 176

Query: 147 ----IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH----- 197
               + IVPILVG ++   E + GRL AKY  DP   F +SSDFCHWG+RF Y       
Sbjct: 177 GIQDLTIVPILVGVLSPNVERVVGRLLAKYFKDPGTLFVISSDFCHWGTRFRYTQLQKDK 236

Query: 198 ---------YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVF 248
                    +D     I+  IEA+D+ GM++I   D + FK+YL    NTICGRHPISV 
Sbjct: 237 VKLAVKSIVFDPNTQPINAGIEAMDREGMELIVNQDSEGFKRYLQMTGNTICGRHPISVL 296

Query: 249 LHMLGNCSTKIKIKFLRYEQSS---QCKTKRDSSVSYASAAA 287
           L +L N   +  IKF+ Y QS+       + DS V+YA+   
Sbjct: 297 LEILRNVKEEYTIKFVDYNQSTLLPSVPCRNDSCVAYAAGVV 338


>gi|296004434|ref|XP_002808659.1| Memo-like protein [Plasmodium falciparum 3D7]
 gi|225631643|emb|CAX63929.1| Memo-like protein [Plasmodium falciparum 3D7]
          Length = 296

 Score =  201 bits (511), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 106/297 (35%), Positives = 164/297 (55%), Gaps = 8/297 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME  RRA H+GSWYT+  + L  +++   + A   K   V+  I PHAGY Y+    ++ 
Sbjct: 1   MENYRRAYHSGSWYTNKREVLKSKIEESFKRANAQKQ-NVKAAICPHAGYDYALETNSHV 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK---F 117
           + +ID  ++  +F+LGP+HH Y           Y+TP G L ++ ++I ++  +     +
Sbjct: 60  YASIDVENVKNIFILGPNHHIYNKGFLFPRVEKYETPFGFLQINKQIISDIIKSDTHNLY 119

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA--MYGRLFAKYVDD 175
             +D   DE EHS+EM LP +  + +   IKIVPI VG++  + +   ++     KY  D
Sbjct: 120 SFIDPDDDEEEHSIEMQLPLIKYIIKDKDIKIVPIYVGSIGNDLKKIDLFANPLKKYFQD 179

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHG--VIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
             N F  SSDFCH+G RF + +  +K+    I K IE +DK  ++II   D   F  YL 
Sbjct: 180 QHNLFLFSSDFCHYGPRFRFTNILQKYSDTFIFKQIENMDKDAVNIISHHDLTGFVDYLN 239

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           E  NTICG +PI + L++L + S  +  K + Y QS+  K++ DSSVSYA   + V+
Sbjct: 240 ETHNTICGSNPIKIMLNLLQHYSASVSTKLMHYSQSNHAKSRSDSSVSYAGVISTVN 296


>gi|46108134|ref|XP_381125.1| hypothetical protein FG00949.1 [Gibberella zeae PH-1]
          Length = 390

 Score =  201 bits (510), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 114/279 (40%), Positives = 152/279 (54%), Gaps = 33/279 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGRA 56
           R A  AG WY   +  L  EL  ++  A +P+S        P  R VIAPHAGY YSG  
Sbjct: 4   RPAKKAGYWYKSEAAELKSELQQYM--AAVPESINGAPVPVPGARVVIAPHAGYEYSGPC 61

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA+A+  +D +   RVF+LGPSH YY   CA +    Y TP GDL +D+++ +EL+    
Sbjct: 62  AAWAYKTLDLSCAKRVFVLGPSHTYYLEGCAATIFGKYATPFGDLEIDVDMAKELEDAIM 121

Query: 117 FELMDICVDEAEHSMEMHLPYL----AKVFEG--HLIKIVPILVGAVNAENEAMYGRLFA 170
            E M    +  EHS+EMH+PYL     + FE      KIVP+LVG+  A+ E + GR   
Sbjct: 122 MEKMPRQGEINEHSLEMHMPYLYLRCEETFETPDKFPKIVPVLVGSNTAKEEKVIGRALL 181

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-----------------VIHKSIEALD 213
            Y+ DP N F +SSDFCHWGS F+Y+ Y   +                   IH++I  +D
Sbjct: 182 PYLRDPENAFIISSDFCHWGSGFSYLPYSPTNSPSDLTQLKKRDPKPDGPPIHETIRVID 241

Query: 214 KMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
           +  MD +ETG  DAF   L +  NT+CGRHPI V +  L
Sbjct: 242 QAAMDAVETGSHDAFISTLKQTRNTVCGRHPIGVTMAAL 280


>gi|149050674|gb|EDM02847.1| similar to RIKEN cDNA 0610016J10 gene, isoform CRA_a [Rattus
           norvegicus]
          Length = 180

 Score =  201 bits (510), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 98/178 (55%), Positives = 120/178 (67%), Gaps = 5/178 (2%)

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVD 174
           FE M +  DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ 
Sbjct: 2   FERMSLQTDEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLA 61

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
           DPSN F VSSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +
Sbjct: 62  DPSNLFVVSSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKK 121

Query: 235 YDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           Y NTICGRHPI V L+    L      +   FL Y QSSQC++ +DSSVSYA+ A  V
Sbjct: 122 YHNTICGRHPIGVLLNAITELQKSGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 179


>gi|429853874|gb|ELA28918.1| duf52 domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 296

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 123/311 (39%), Positives = 161/311 (51%), Gaps = 47/311 (15%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           RR    G WYT +  RL+++LDG+L +        GLP  P  R +IAPHAGYSYSG  A
Sbjct: 7   RRPGKQGEWYTGDPTRLSKQLDGFLSDVPDQIDGDGLP-VPGARVIIAPHAGYSYSGPTA 65

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+  +D +   RVFLLGPSH YY                    L   V  E++A    
Sbjct: 66  AWAYKALDLSKAKRVFLLGPSHTYY--------------------LRGAVTAEIRARDGV 105

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF------EGHLIKIVPILVGAVNAENEAMYGRLFAK 171
             M +  D+ EHS+EMHLPYL K        E +   +VPILVG  N   E   G+  A 
Sbjct: 106 ADMPVRNDDKEHSLEMHLPYLFKRLQQTFGAEENFPTLVPILVGDNNKSEEKEVGKWLAP 165

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------IHKSIEALDKMGMDI 219
           Y+ DP N F VSSDFCHWGS F+Y  Y     V            IH++I+ +D + ++ 
Sbjct: 166 YLKDPENAFIVSSDFCHWGSHFDYTVYSPDGTVEGMQRLRGDSPPIHETIKMVDDLAIEA 225

Query: 220 IETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI-KFLRYEQSSQCKTKRDS 278
           I+TG    F   L +  NT+CGRHPI V +  L        + +F++Y++SS      DS
Sbjct: 226 IKTGKHSNFYDNLKQTKNTVCGRHPIGVTMAALEELGEGHPVFRFVQYQRSSMVTEPSDS 285

Query: 279 SVSYASAAAKV 289
           SVSY SA A V
Sbjct: 286 SVSYVSAYAVV 296


>gi|354548291|emb|CCE45027.1| hypothetical protein CPAR2_700310 [Candida parapsilosis]
          Length = 328

 Score =  200 bits (508), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 120/326 (36%), Positives = 173/326 (53%), Gaps = 40/326 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYA 60
           +R+A+HAGSWY+ + K L  ++   +  A    S  V G   ++ PHAGY+YSG     A
Sbjct: 3   VRQATHAGSWYSSDPKTLKAQVQTLIHVAQKSGSNGVSGARVLVGPHAGYTYSGERLGEA 62

Query: 61  FGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL------KA 113
           F   D +++ RVFLLGPSHH Y+  K  LS    Y+TP+GD+P+D E I +L      K 
Sbjct: 63  FNAWDTSTVKRVFLLGPSHHVYFKDKALLSPFDFYETPLGDIPVDRETINDLLKKRFKKK 122

Query: 114 TGK--FELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAENEAMYGRL 168
            G+  F+LM   +DE EHS EMH P++    +  +  + KIVPIL+  +++E +      
Sbjct: 123 HGQPVFKLMSEEIDEDEHSFEMHAPFIYHQGRKAKHGVPKIVPILISGMDSELQTELVDA 182

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV----------------------IH 206
              Y+ +  N F +SSDFCHWGSRF Y                               ++
Sbjct: 183 LYPYIQNDENHFIISSDFCHWGSRFGYTKVLTNKNTTLNTLETDLHSLRFSSAPNDTPVY 242

Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST---KIKIKF 263
           +SIE LD+M + I   G    +K+Y+    NTICG+ PI+V L +L    +   +    +
Sbjct: 243 ESIEWLDRMALKIASKGSSAEWKEYIRISGNTICGQKPIAVILQLLEQYRSNDGQDVFNW 302

Query: 264 LRYEQSSQCKTKRDSSVSYASAAAKV 289
           L Y QS+  K+ RDSSVSYAS   K+
Sbjct: 303 LGYSQSNPAKSSRDSSVSYASGYVKL 328


>gi|295657135|ref|XP_002789140.1| DUF52 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226284526|gb|EEH40092.1| DUF52 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 379

 Score =  200 bits (508), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 133/374 (35%), Positives = 177/374 (47%), Gaps = 91/374 (24%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R A+HAGSWY+D+   L+++LD WL +        G    P  R +IAPHAGY+YSG  A
Sbjct: 4   REATHAGSWYSDHEPTLSKQLDQWLAQVPDELPDIGPLPIPGARIIIAPHAGYAYSGPCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEEL---- 111
           A+A+  +D ++  R+FL+GPSH++     AL   T Y TP+    LPLD E+I  L    
Sbjct: 64  AFAYKTLDLSNAERIFLIGPSHNHILSTLALPQLTCYLTPLSREPLPLDTELIAHLLTST 123

Query: 112 -KATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENE 162
             +  +F  M   VD AEHS+E+HLPY+  +               +VPI+VG+     E
Sbjct: 124 NASNHRFTTMSPAVDSAEHSLELHLPYIHHLLRRLYPTKPTAQYPPLVPIMVGSTLPAAE 183

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
           A +G + A Y+ DPSN F +SSDFCHWG RF + +Y                        
Sbjct: 184 AAFGSVLAPYLADPSNAFIISSDFCHWGLRFGHTYYVPNARKPAESLPLSYPQLPLPSTS 243

Query: 199 ---------------------DK---KHGV-----IHKSIEALDKMGMDIIETGDPDAFK 229
                                DK   + GV     IH+SI A D   M  I +G+   F 
Sbjct: 244 DAGVAAEAIEAVAGGISLKVKDKDRVRKGVAGVPLIHESISACDVACMTAIASGETRGFL 303

Query: 230 KYLLEYDNTICGRHPISVFLHML----------GNCST------KIKIKFLRYEQSSQCK 273
             L +  NTICGRHPI V +  L          G          K +  F+RYE+SS   
Sbjct: 304 DALRKTGNTICGRHPIGVVMAGLEIVRGSKEGGGKLDAVDGEMQKCRFHFIRYERSSDVV 363

Query: 274 TKRDSSVSYASAAA 287
              DSSVSY SA A
Sbjct: 364 AVGDSSVSYVSAFA 377


>gi|390474594|ref|XP_002757945.2| PREDICTED: protein MEMO1-like [Callithrix jacchus]
          Length = 180

 Score =  199 bits (507), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 98/178 (55%), Positives = 119/178 (66%), Gaps = 5/178 (2%)

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVD 174
           FE M +  DE EHS+EMHLPY AK  E H     I+P+LVGA++   E  +G+LF+KY+ 
Sbjct: 2   FERMSLQTDEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLA 61

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
           DPSN F VSSDFCHWG RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +
Sbjct: 62  DPSNLFVVSSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKK 121

Query: 235 YDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           Y NTICGRHPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 122 YHNTICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 179


>gi|407834999|gb|EKF99095.1| hypothetical protein TCSYLVIO_010007 [Trypanosoma cruzi]
          Length = 315

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 118/307 (38%), Positives = 172/307 (56%), Gaps = 27/307 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGW---LREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           RRASHA SWY  ++K L   +D     +++  L     + G+I+PHAG SYSG  AA+ +
Sbjct: 5   RRASHANSWYEGSAKVLKATVDDLFERVKDFQLGGDETLIGIISPHAGISYSGPTAAHVY 64

Query: 62  GNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI----EE 110
             +         ++++ +FLLGP HH       +S A  Y+TP G L +D  V+    + 
Sbjct: 65  KVLRNYIYGPKGSNVTHIFLLGPDHHKGFDGVEMSGAQQYETPFGALSIDTSVVSSVFQS 124

Query: 111 LKATG-KFELMDICVDEAEHSMEMHLPYLAKVFE---------GHLIKIVPILVGAVNAE 160
           LKA G +   M    DE EHS+EM LP+++ +            + I++VP+LVG  + E
Sbjct: 125 LKAAGVQTTRMSQSTDEEEHSIEMQLPFISHILHHPPAGLTGAKNRIQLVPMLVGWTDRE 184

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNY-MHYDK-KHGVIHKSIEALDKMGMD 218
            E   G + + Y  D  N F +SSDFCHWGSRF Y  HY K ++  I  +I A+D  GM+
Sbjct: 185 TEERIGAVLSSYSQDSHNIFVLSSDFCHWGSRFQYKYHYQKSEYPAIADAIIAMDHEGME 244

Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
            +E  D + + +YL +  NTICGR PIS+ +  + N      ++FL Y QS++C +  DS
Sbjct: 245 WLEKRDMNGWYEYLNKTHNTICGRRPISIGMEAVKN-DRSTTVRFLHYSQSNRCASLSDS 303

Query: 279 SVSYASA 285
           SVSYASA
Sbjct: 304 SVSYASA 310


>gi|254570361|ref|XP_002492290.1| Putative protein of unknown function [Komagataella pastoris GS115]
 gi|238032088|emb|CAY70010.1| Putative protein of unknown function [Komagataella pastoris GS115]
 gi|328353703|emb|CCA40101.1| Protein MEMO1 [Komagataella pastoris CBS 7435]
          Length = 314

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 115/317 (36%), Positives = 168/317 (52%), Gaps = 36/317 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAA 57
           M K+R A+HAGSWY+ N   L  +L  +L  A       V+G   ++ PHAGYS++G+  
Sbjct: 1   MVKVRPATHAGSWYSGNPNELNSQLSHYLSLAKSSGKTSVKGARVLVGPHAGYSFAGKTL 60

Query: 58  AYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           A  + + DPT I RVF+LGPSHH Y+  +   +    Y TP+G++P+D + I++L +  +
Sbjct: 61  AQTYNSFDPTGIKRVFILGPSHHIYFKDEVRTTRYGAYATPLGNVPVDTDTIKDLVSNAR 120

Query: 117 -FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRL------- 168
             + M   VDE EHS E+H+P     ++  L K +      +        G+L       
Sbjct: 121 HIDYMSSSVDENEHSFELHMPLY---YKACLDKGLSTPPPIIPILISGFPGQLADSLTST 177

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSIE 210
              Y +D  N F VS+DFCHWG RF Y  Y     +                  I++SIE
Sbjct: 178 LQPYFEDKENAFFVSTDFCHWGDRFGYTSYTPNGDLESMDDITVAFNGKSNSLKIYESIE 237

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSS 270
           A+DK GM++I  GD   +++YL   +NTICG +P++V L ++G   T    ++L Y QSS
Sbjct: 238 AVDKKGMELISKGDVSLWRQYLQATENTICGAYPLTVLLKLMGKADT---FEWLGYTQSS 294

Query: 271 QCKTKRDSSVSYASAAA 287
                 DSSVSYAS  A
Sbjct: 295 HVLDPYDSSVSYASGYA 311


>gi|407397346|gb|EKF27707.1| hypothetical protein MOQ_008560 [Trypanosoma cruzi marinkellei]
          Length = 315

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 120/311 (38%), Positives = 170/311 (54%), Gaps = 27/311 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAA 57
           M   RRASHA SWY  ++K L   +D     A          + G+IAPHAG SYSG  A
Sbjct: 1   MAYSRRASHANSWYEGSAKVLKATVDELFERAKDFQWGGDETLIGIIAPHAGISYSGLTA 60

Query: 58  AYAFGNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI-- 108
           A+ +  +          +++ +FLLGP HH       +S A  Y+TP G L +D  V+  
Sbjct: 61  AHVYKVLRNYIYGPKGGNVTHIFLLGPDHHKGFDGVEMSGAQQYETPFGALSIDTSVVSS 120

Query: 109 --EELKATG-KFELMDICVDEAEHSMEMHLPYLAKVFE---------GHLIKIVPILVGA 156
             + LKA G +   M    DE EHS+EM LP+++ +            + I++VP+LVG 
Sbjct: 121 VFQSLKAAGVQTTRMSQSTDEEEHSIEMQLPFISHILHYPPAGLTGAKNRIQLVPMLVGW 180

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNY-MHYDK-KHGVIHKSIEALDK 214
            + E E   G + + Y  +P N F +SSDFCHWGSRF Y  HY K ++  I  +I A+D 
Sbjct: 181 TDRETEERIGAVLSSYSQNPHNIFVLSSDFCHWGSRFQYKYHYQKSEYPAIADAIIAMDH 240

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
            GM+ +E  D + +  YL +  NTICGR PIS+ +  + N  +   ++FL Y QS++C +
Sbjct: 241 EGMEWLEKRDMNGWYDYLSKTHNTICGRRPISIGMEAVKNDKSA-SVRFLHYSQSNRCAS 299

Query: 275 KRDSSVSYASA 285
             DSSVSYASA
Sbjct: 300 LSDSSVSYASA 310


>gi|448534928|ref|XP_003870861.1| hypothetical protein CORT_0G00430 [Candida orthopsilosis Co 90-125]
 gi|380355217|emb|CCG24733.1| hypothetical protein CORT_0G00430 [Candida orthopsilosis]
          Length = 328

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 118/326 (36%), Positives = 175/326 (53%), Gaps = 40/326 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYA 60
           +R+A+HAGSWY+ + + L  ++   +  A    +  V G   +I PHAGY+YSG     A
Sbjct: 3   VRQATHAGSWYSSDPRTLKLQIQTLISAAQKSGNKSVNGARILIGPHAGYTYSGERLGEA 62

Query: 61  FGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL------KA 113
           F   D +++ R+FLLGPSHH Y+  K  LS    Y+TP+GD+P+D E I++L      K 
Sbjct: 63  FNVWDTSTVKRIFLLGPSHHVYFKDKALLSPFDFYETPLGDIPIDRETIDDLLKKRFKKK 122

Query: 114 TGK--FELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAENEAMYGRL 168
            G+  F+LM   VDE EHS EMH P++    +  +  + KIVPIL+  ++++ ++     
Sbjct: 123 HGQTVFKLMSEDVDEDEHSFEMHAPFIYHQGQKSKHGVPKIVPILISGMDSDLQSELVDA 182

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------------DKKHGVIH 206
              YV +  N F +SSDFCHWGSRF Y                           K   ++
Sbjct: 183 LFPYVQNEENHFIISSDFCHWGSRFGYTKVLTTKNATLDSLESDLYSLRFSSASKDIPVY 242

Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIK---IKF 263
           +SIE LD+M + I   G    +K+Y+    NTICG+ PI+V L +L    +        +
Sbjct: 243 ESIEVLDRMALQIASKGSAAEWKEYIRISGNTICGQKPIAVILQLLEQYRSNKGEDVFNW 302

Query: 264 LRYEQSSQCKTKRDSSVSYASAAAKV 289
           L Y QS+  K+ +DSSVSYA+   K+
Sbjct: 303 LGYSQSNPAKSSKDSSVSYAAGYVKL 328


>gi|340057164|emb|CCC51506.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 314

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 118/310 (38%), Positives = 175/310 (56%), Gaps = 26/310 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR--GVIAPHAGYSYSGRAAA 58
           M   R ASHAGSWY  +S  L   +   L      K  + R  GVI+PHAG  YSG  A 
Sbjct: 1   MSYSRCASHAGSWYEGSSATLKSTVGALLAVVQDHKLGDERLVGVISPHAGIRYSGGTAG 60

Query: 59  YAFGNIDP-------TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI--- 108
           + F  +          +++R+FL+GPSH+       +  A  Y+TP G L +  +V+   
Sbjct: 61  HVFAAVRDYVYGPRGGALTRMFLIGPSHYKSFSGVEVCGAKEYETPFGPLSVSAQVLKQL 120

Query: 109 -EELKATG-KFELMDICVDEAEHSMEMHLPYLAKV-----FEGH----LIKIVPILVGAV 157
            EE ++ G     M    DE EHS+E+ LP+++ +     F G      +++VP+L+G  
Sbjct: 121 AEEFRSAGVPVGTMTRATDEEEHSIELQLPFISHILHFPCFGGSPAKDRVQLVPLLIGGT 180

Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-KHGVIHKSIEALDKM 215
           +A  +   G + AKY  DP+N F +SSDFCHWGSRF Y  H+++ K+  I  +I A+D  
Sbjct: 181 DAAMDKAIGNVLAKYTRDPANLFVISSDFCHWGSRFQYTYHFERSKYPAIGDAIVAMDHA 240

Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTK 275
           GM ++E  +  A+ KYL E +NTICGR PI+V +  L + S+   ++F+ Y QS++C++ 
Sbjct: 241 GMQLLEQRNLQAWYKYLEETNNTICGRRPITVAMAALHHHSSA-AVRFVHYSQSNRCQSL 299

Query: 276 RDSSVSYASA 285
            DSSVSYA A
Sbjct: 300 SDSSVSYAGA 309


>gi|71410526|ref|XP_807553.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871580|gb|EAN85702.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 315

 Score =  197 bits (501), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 118/307 (38%), Positives = 168/307 (54%), Gaps = 27/307 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           RRASHA SWY  ++K L   +D     A    L     + G+I+PHAG SYSG  AA+ +
Sbjct: 5   RRASHANSWYEGSAKVLKATVDDLFERAKDFQLGGDETLIGIISPHAGISYSGPTAAHVY 64

Query: 62  GNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI----EE 110
             +          +++ +FLLGP HH       +S A  Y+TP G L +D  V+    + 
Sbjct: 65  KALRNYIYGPKGNNVTHIFLLGPDHHKGFDGVEMSGAQQYETPFGALSIDTSVVSSVFQS 124

Query: 111 LKATG-KFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVNAE 160
           LKA G +   M    DE EHS+EM LP+++ +            + I++VP+LVG  + +
Sbjct: 125 LKAAGVQTTRMSQSTDEEEHSIEMQLPFISHILYHPPAGLTGAKNRIQLVPMLVGWTDRK 184

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNY-MHYDK-KHGVIHKSIEALDKMGMD 218
            E   G + + Y  D  N F +SSDFCHWGSRF Y  HY K ++  I  +I A+D  GM+
Sbjct: 185 TEERIGAVLSSYSQDSHNIFVLSSDFCHWGSRFQYKYHYQKSEYPAIADAIIAMDHEGME 244

Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
            +E  D + +  YL +  NTICGR PIS+ +  + N      ++FL Y QS++C +  DS
Sbjct: 245 WLEKRDMNGWYDYLNKTHNTICGRRPISIGMEAVKN-DRSTTVRFLHYSQSNRCASLSDS 303

Query: 279 SVSYASA 285
           SVSYASA
Sbjct: 304 SVSYASA 310


>gi|154345628|ref|XP_001568751.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066093|emb|CAM43881.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 328

 Score =  196 bits (499), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 121/324 (37%), Positives = 168/324 (51%), Gaps = 39/324 (12%)

Query: 1   MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGLPKSPEVRG------VIAPHAGY 50
           M  +R A+HA      WY    +RL   +D +LRE       E RG      +I PHAG 
Sbjct: 1   MHFVRSATHAAPHGCGWYEAIPERLKATIDDYLREVTHHYEEESRGGARMMGLIVPHAGM 60

Query: 51  SYSGRAAAYAFG---------NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
           SYSGR A  AF              + + R+F+LGPSH      C LS A+ Y+TP G L
Sbjct: 61  SYSGRTACEAFAVFREYLYAKGSKGSKVERIFILGPSHTKGFEGCELSAASAYETPFGPL 120

Query: 102 PLDLEVIEEL-----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
            +D   ++ +     KA           DEAEHS+EM  PYL+ +               
Sbjct: 121 KVDTATVDRVITALCKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPAATGTSTQPA 180

Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-K 201
              + IVPI+VG  N ++E     +   Y+DD  NFF  SSDFCHWG RF+Y  HY + +
Sbjct: 181 AARVSIVPIIVGWTNRQDEKAICDVLKPYMDDARNFFIFSSDFCHWGDRFSYTYHYQRSQ 240

Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
           +  I  SI A+D   M+++E  D + +  YL    NTICGR PIS+ +    + ++K++I
Sbjct: 241 YPNIGDSIIAMDHAAMELLERRDLEGWYGYLRTTKNTICGRAPISIGMQYWMDEASKVQI 300

Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
           KF+ Y QS++C +  DSSVSYA+A
Sbjct: 301 KFVHYSQSNKCLSVEDSSVSYAAA 324


>gi|429329595|gb|AFZ81354.1| hypothetical protein BEWA_007630 [Babesia equi]
          Length = 287

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 101/273 (36%), Positives = 156/273 (57%), Gaps = 9/273 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RR +H+GSWY  + + L  E+   L         +++ +IAPHAGY+YS + AAY++  
Sbjct: 1   MRRPTHSGSWYPSSPRVLGSEIRLALENVSDLTHHKLKYIIAPHAGYAYSLKTAAYSYAQ 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           ID T + R+F+LGPSH+ Y   C +       TP+G L +D E+I++L     F  +   
Sbjct: 61  IDATQVKRIFILGPSHYLYLLGCGIDKFAKLDTPLGHLDVDTEIIDQLSKVEGFSTIKND 120

Query: 124 VDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
             E EHS+EMHLP +  +  E   IK+VPI+VG  N + +         Y +D SN F +
Sbjct: 121 CSEEEHSIEMHLPLVKYITNENKDIKVVPIIVGDFNNKLKDHIANTLLPYFNDESNLFII 180

Query: 183 SSDFCHWGSRFNYMH--YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SSDFCH+GSRF +    Y+K++  +H+SIE LD+  ++ I + D   F+ Y+ E  NTIC
Sbjct: 181 SSDFCHFGSRFQFTKTGYEKENRPLHESIEMLDRKSIEYITSHDLIGFENYIDETGNTIC 240

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCK 273
           GR+PI + L +       I++ F+ ++     K
Sbjct: 241 GRNPIQILLKV------SIQLPFMNFKDDISIK 267


>gi|340501035|gb|EGR27856.1| memo family protein, putative [Ichthyophthirius multifiliis]
          Length = 220

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 86/198 (43%), Positives = 129/198 (65%), Gaps = 2/198 (1%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           KIR ASHA SWYT N ++L +EL+ +L+  +  +    +++G+I PHAG+ YSG  AA++
Sbjct: 5   KIRLASHANSWYTGNKQKLDQELNEYLQNSQVEIQDIKQIKGIIGPHAGFYYSGPTAAWS 64

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  + P    RVFLLGP HH Y   C  S    Y TP+G + LD ++IE+LK T +F+++
Sbjct: 65  YKYLCPQDNLRVFLLGPCHHIYLNSCGTSDLDFYDTPLGSIRLDKQIIEQLKQTEQFQVL 124

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           +   +E EHS+EMHLPY+ K        ++PI+VG +N + E  YG++ +++ DD +  F
Sbjct: 125 NKSDEEDEHSLEMHLPYIQKQLGSKPFTLIPIMVGNINFQQEKQYGQILSQFFDDENTVF 184

Query: 181 SVSSDFCHWGSRFNYMHY 198
            +SSDFCHWGSR+ Y  Y
Sbjct: 185 IISSDFCHWGSRYIYQFY 202


>gi|157876688|ref|XP_001686688.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68129763|emb|CAJ09069.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 370

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 168/324 (51%), Gaps = 39/324 (12%)

Query: 1   MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGY 50
           M  +R A+HA      WY    +RL   +D +  EA    S E      + G+IAPHAG 
Sbjct: 43  MRFVRPATHAAPHGRGWYEAIPERLKVTIDDYFSEATHCYSDEEQECARMTGLIAPHAGM 102

Query: 51  SYSGRAAAYAFG---------NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
           SYSGR AA AF              + + R+F+LGPSH      C LS A+ Y+TP G L
Sbjct: 103 SYSGRTAAEAFAVFREYLYAKGSKGSELERIFILGPSHTRGFEGCELSAASAYETPFGPL 162

Query: 102 PLDLEVIEEL-----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
            +D  V++ +     KA           DEAEHS+EM  PYL+ +               
Sbjct: 163 RVDTAVVDRVITDLRKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPTTTGAPVQPA 222

Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-- 201
              + IVPI+VG  N ++E     +   Y+DD  NFF  SSDFCHWG RF+Y ++ K+  
Sbjct: 223 AGRVAIVPIIVGWTNRQDEKAICDVLKPYMDDARNFFICSSDFCHWGERFSYTYHYKRSE 282

Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
           +  I  SI A+D   M+++E  D + +  YL    NTICGR PIS+ +    +   K ++
Sbjct: 283 YPNIGDSIIAMDHAAMELLEKRDLERWYAYLQMTKNTICGRAPISIGMQRWASKGNKARV 342

Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
           KF+ Y QS++CK   DSSVSYA+A
Sbjct: 343 KFVHYSQSNKCKDADDSSVSYAAA 366


>gi|68481987|ref|XP_715011.1| hypothetical protein CaO19.7210 [Candida albicans SC5314]
 gi|46436613|gb|EAK95972.1| hypothetical protein CaO19.7210 [Candida albicans SC5314]
 gi|238878225|gb|EEQ41863.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 335

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 122/335 (36%), Positives = 176/335 (52%), Gaps = 51/335 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----------EVRGVIAPHAGYSY 52
           IR A+HAGSWY++N  +L  +LD +  +A   K+P             R +I PHAGY+Y
Sbjct: 3   IRPATHAGSWYSNNPTKLKNQLDQFFAKARSLKTPLSGEDASNGIDGARILIGPHAGYTY 62

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
           SG   A +F   D + + RVFLLGPSHH Y+     LS    Y+TP G++P+D+  I+EL
Sbjct: 63  SGERLAESFNAWDTSKVKRVFLLGPSHHVYFKNHALLSPFEYYETPFGNIPVDVSTIKEL 122

Query: 112 -----KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAEN 161
                K    F+ M   +DE EHS EMH P++   ++G      L KI+PIL+  ++ + 
Sbjct: 123 LDEQCKHGPVFKYMSEEIDEEEHSFEMHAPFI--YYKGETAIHGLPKIIPILISGMDGKL 180

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---------------------DK 200
           +         Y+D+  N F +SSDFCHWGSRF Y  Y                      +
Sbjct: 181 QEGIVNSLLPYLDNEENHFIISSDFCHWGSRFGYTKYLSSGETTLDTLSDNITSLGSLHR 240

Query: 201 KHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----GNC 255
             G+ I+KSIE LDK+ + I   G   A+ KY+    NTICG+ PI + L +L     N 
Sbjct: 241 PKGLQIYKSIELLDKIALQIATNGSSAAWSKYIDISGNTICGQKPILIVLKLLEKYNNNS 300

Query: 256 STKIK-IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           S++     ++ Y QS+  +   +SSVSYAS   ++
Sbjct: 301 SSRGNTFNWIGYSQSNPVQRTSESSVSYASGYVRL 335


>gi|342184221|emb|CCC93702.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 316

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 120/317 (37%), Positives = 177/317 (55%), Gaps = 29/317 (9%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWL---REAGLPKSPEVRGVIAPHAGYSYSGRAA 57
           M   RRA+HAGSWY  +S  L   ++G+     ++ + ++  + GVI+PHAG SYSG  A
Sbjct: 1   MAYSRRATHAGSWYEASSAALKSAIEGYFAVASKSSVNENESLIGVISPHAGISYSGMTA 60

Query: 58  AYAFGNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI-- 108
           A+A+  +         + ++R+F+LGPSH  +     +  A  Y+TP G + ++  V   
Sbjct: 61  AHAYVKMRDYIYGPKGSCVTRIFVLGPSHVKWFDGVEVCEARQYETPFGPVCVNSAVAGG 120

Query: 109 --EELKATG-KFELMDICVDEAEHSMEMHLPYLAKVFE---------GHLIKIVPILVGA 156
             + L++ G     MD   DE EHS+EM LP+LA V              I++VP+L+G 
Sbjct: 121 VGQALRSAGVPVGTMDRSTDEEEHSIEMQLPFLAYVLNYPPNGCPPAKDRIELVPLLIGD 180

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-KHGVIHKSIEALDK 214
                E   G + A++  D  N F +SSDFCHWGSRF Y  HY++ K+  I  +I A+D 
Sbjct: 181 TTRAMEEAIGVVLAEHFRDKQNLFVMSSDFCHWGSRFRYTYHYEREKYPEIGDAIIAMDH 240

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST-KIKIKFLRYEQSSQCK 273
            GM ++E  D + + +YL    NTICGR PISV +  L  C+  K K+ F+ Y QS++C+
Sbjct: 241 EGMRLLEGRDVEGWYRYLSATKNTICGRRPISVAVAAL--CANKKAKVSFVHYSQSNRCR 298

Query: 274 TKRDSSVSYASAAAKVD 290
              DSSVSYA+A    D
Sbjct: 299 NLTDSSVSYAAAVITED 315


>gi|367014857|ref|XP_003681928.1| hypothetical protein TDEL_0E04740 [Torulaspora delbrueckii]
 gi|359749589|emb|CCE92717.1| hypothetical protein TDEL_0E04740 [Torulaspora delbrueckii]
          Length = 332

 Score =  193 bits (491), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 119/329 (36%), Positives = 175/329 (53%), Gaps = 43/329 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR A+HAGSWY+++   L+ +L G+L + G       R +I+PHAGY Y G   AY++ +
Sbjct: 3   IRPATHAGSWYSNHGNELSTQLRGYLDDTGKSAVRSARLIISPHAGYRYCGPTMAYSYAS 62

Query: 64  ID-PTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK---FE 118
           +D  +++ R+F+LGPSHH Y+  +  ++     +TP+GDL +D  + ++L        F 
Sbjct: 63  LDIDSNVKRIFILGPSHHVYFRNQIFVTKFESLETPLGDLQVDQALCQKLMKLQDGHLFV 122

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGRLFAKYV 173
            MD   D  EHS+EM  P L +  +        +K+VP+LV   +AE +   G++   Y+
Sbjct: 123 PMDHDADMGEHSLEMQFPMLVQTLKWRNVPLESVKVVPMLVSHNSAEVDLAVGQVLKTYM 182

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHGV-IHK 207
           DDPSN F +SSDFCHWG RF Y  Y                            H + I +
Sbjct: 183 DDPSNLFIISSDFCHWGRRFAYTGYVGSEDELKDALQQETEIEMLTARSKLSHHQIDIWQ 242

Query: 208 SIEALDKMGMDII-ETGDP---DAFKKYLLEYDNTICGRHPISVFLHMLGN---CSTKIK 260
           SIE LDKM M ++ +T +    +A+K+YL    NTICG  PI+V L  L      S KI 
Sbjct: 243 SIEILDKMAMKVLSDTANEARYNAWKQYLDITGNTICGARPIAVILSTLSQIDEASHKID 302

Query: 261 IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            ++  Y QSS   +   SSVSYA+  A +
Sbjct: 303 FQWPNYSQSSHVNSLDHSSVSYAAGYATL 331


>gi|255711188|ref|XP_002551877.1| KLTH0B01980p [Lachancea thermotolerans]
 gi|238933255|emb|CAR21439.1| KLTH0B01980p [Lachancea thermotolerans CBS 6340]
          Length = 322

 Score =  193 bits (491), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 117/322 (36%), Positives = 170/322 (52%), Gaps = 38/322 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R ASHAGSWY+ NS +L  +L  +L + GLP +   R +I+PHAGY+Y G      +  
Sbjct: 3   VRPASHAGSWYSTNSGKLGLQLQQYLGQ-GLPVAG-ARVIISPHAGYAYCGATMGKVYAA 60

Query: 64  IDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           +D T++ R+F+LGPSHH Y+  +  +S      TP+G LP+D+  ++EL     FE MD 
Sbjct: 61  LDLTNVKRIFVLGPSHHIYFKNRAHVSNFKEVATPLGSLPVDIATVKELVEDEAFEYMDP 120

Query: 123 CVDEAEHSMEMHLPYLAKVF-----EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
            VD  EHS+EM  P L  +      +   + +VPI++   +  ++   G++ ++Y+ D S
Sbjct: 121 EVDVEEHSLEMQFPMLRALLKLRGVDTASVGVVPIMISHNSTRSDQHLGKVLSEYLADQS 180

Query: 178 NFFSVSSDFCHWGSRFNYMHY--DKK-----------------------HGV-IHKSIEA 211
             F +SSDFCHWG RF++  Y  DK                        H V I KSIE 
Sbjct: 181 TVFIISSDFCHWGRRFSFTGYVGDKSEIQEALAEDTEIESLTARSKLSHHQVPIWKSIEI 240

Query: 212 LDKMGMDIIETGDP----DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYE 267
           LD+  M I+   +P    + +K YL    NTICG   ISV L  L + +     ++  Y 
Sbjct: 241 LDQYAMSILSQTEPRDKYELWKHYLEVTGNTICGEKAISVMLCALSHQAEPAVFQWPAYA 300

Query: 268 QSSQCKTKRDSSVSYASAAAKV 289
           QSSQ  +  DSSVSY     K+
Sbjct: 301 QSSQVTSVMDSSVSYGGGYCKI 322


>gi|255728913|ref|XP_002549382.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240133698|gb|EER33254.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 335

 Score =  193 bits (490), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 112/291 (38%), Positives = 161/291 (55%), Gaps = 42/291 (14%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----------EVRGVIAPHAGYSY 52
           +R A+HAG+WYT++  +L ++LD +  +A L KSP             R +I PHAGY+Y
Sbjct: 3   VRSATHAGTWYTNSPTKLRKQLDEFFHKAKLLKSPINGDDASNGVEGARILIGPHAGYAY 62

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
           SG   A  F   D T + RVFLLGPSHH Y+     LS+   Y+TP+GD+P+D E+I++L
Sbjct: 63  SGERLAETFNAWDTTKVKRVFLLGPSHHVYFKSYAMLSSFDYYETPLGDIPVDKEIIKDL 122

Query: 112 --KATGK---FELMDICVDEAEHSMEMHLPYLAKVFEGH---LIKIVPILVGAVNAENEA 163
             K   +   F+ M   +DE EHS EMH P++    E     + KI+PIL+  ++ + ++
Sbjct: 123 INKKCSRGPIFKYMSDEIDEDEHSFEMHAPFIRYKGESSPHGVPKIIPILISGMDEKLQS 182

Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------- 204
                   ++D+  N F +SSDFCHWGSRF Y  Y     V                   
Sbjct: 183 DIVDSLLPFMDNEENHFIISSDFCHWGSRFGYTKYLSSSDVTLDNLEDEITSLGMFQRTK 242

Query: 205 ---IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
              I+KSIE LDK+ ++I  TG   A+K+Y+    NTICG+ PIS+ L +L
Sbjct: 243 GLEIYKSIELLDKLALEIASTGSSAAWKQYIDTTGNTICGQKPISIVLKLL 293


>gi|240276562|gb|EER40073.1| DUF52 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 416

 Score =  193 bits (490), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 128/369 (34%), Positives = 174/369 (47%), Gaps = 88/369 (23%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
           R A+HAGSWY+D+   L+ +L+ WL +      G+ + P    R +IAPHAGY+YSG  A
Sbjct: 44  REATHAGSWYSDHEPTLSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPHAGYAYSGPCA 103

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEEL---- 111
           A+A+  +D +   R+FLLGPSHH++    AL   T YKTP+    LPLD ++I  L    
Sbjct: 104 AWAYKALDLSKAKRIFLLGPSHHHHLSTLALPQLTSYKTPLSPDPLPLDTDLITHLLSTT 163

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENEA 163
                F  M   VD AEHS+E+HLPY+  +              ++VP++VG+ +   EA
Sbjct: 164 STNTHFTTMSPPVDSAEHSLELHLPYIHHLLRTLYPTRAPAAYPRLVPMMVGSTSPATEA 223

Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------- 198
            +G L A Y+ D +N F VSSDFCHWG RF Y +Y                         
Sbjct: 224 AFGALLAPYLADDTNAFVVSSDFCHWGLRFGYTYYLPVPAASSTTTTTTRGTATGAAEAR 283

Query: 199 ---------------DKKHG------VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                           +  G      +IH+SI A D   M  I +G+ + F   L    N
Sbjct: 284 AAIDAVSGGGAFKARSRMLGGGAGVPLIHESISACDVACMRAIASGETELFLDALRRTGN 343

Query: 238 TICGRHPISVFLHMLGNCS---------------------TKIKIKFLRYEQSSQCKTKR 276
           T+CGRHPI V +  L                          K K  F+RYE+SS      
Sbjct: 344 TVCGRHPIGVVMAGLEAVRRSQGEGERDGDGDGDGDGDGLQKSKFYFVRYERSSDVVAAG 403

Query: 277 DSSVSYASA 285
           +SSVSY SA
Sbjct: 404 ESSVSYVSA 412


>gi|365984735|ref|XP_003669200.1| hypothetical protein NDAI_0C02970 [Naumovozyma dairenensis CBS 421]
 gi|343767968|emb|CCD23957.1| hypothetical protein NDAI_0C02970 [Naumovozyma dairenensis CBS 421]
          Length = 335

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 118/333 (35%), Positives = 174/333 (52%), Gaps = 44/333 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   R A+HAGSWY+ N+  L+ +L  +L  +  P     R +I+PHAGY Y G   AY+
Sbjct: 1   MSLSRPATHAGSWYSRNAAELSSQLQTYLNNSKKPIVKNSRIIISPHAGYRYCGSTMAYS 60

Query: 61  FGNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL---KATG 115
           + ++D   +I R+F+LGPSHH +++ +  +S      TP+GDL +D ++  +L   K +G
Sbjct: 61  YASLDLNENIKRIFILGPSHHIFFSNEIFVSAFDSISTPLGDLKVDKDLCSKLVRKKISG 120

Query: 116 K--FELMDICVDEAEHSMEMHLPYLAKVFEGHLI-----KIVPILVGAVNAENEAMYGRL 168
           K  F  M+  VD  EHS+EM    L +      +     K++P++V     E +   G++
Sbjct: 121 KKIFSFMNQDVDTDEHSLEMQFSMLVQTLNWRKVPLDNVKVIPMMVSHNTKEFDMSVGKI 180

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHG 203
            ++Y++DPSN F +SSDFCHWG RF Y  Y                          K H 
Sbjct: 181 LSEYLNDPSNLFIISSDFCHWGRRFEYTGYVGSEDELKEATEEETEIEMLTSRSKLKHHQ 240

Query: 204 V-IHKSIEALDKMGM----DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC--S 256
           + I KSIE LDK  M    D   +   DA+K+YL    NTICG  PI V L  L     +
Sbjct: 241 IPIWKSIEILDKHAMRTLSDTQNSDRYDAWKQYLEITGNTICGERPIGVVLAALAYLPEN 300

Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +K+  ++  Y QSSQ ++  +SSVSY S    +
Sbjct: 301 SKVTFEWPHYSQSSQVQSLEESSVSYTSGYVTI 333


>gi|260940813|ref|XP_002615246.1| hypothetical protein CLUG_04128 [Clavispora lusitaniae ATCC 42720]
 gi|238850536|gb|EEQ40000.1| hypothetical protein CLUG_04128 [Clavispora lusitaniae ATCC 42720]
          Length = 304

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 116/303 (38%), Positives = 159/303 (52%), Gaps = 20/303 (6%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A+HAGSWYT + +RLA ++  +    G    P  R +I PHAGY+Y G   A  +   
Sbjct: 3   RPATHAGSWYTGDKRRLAAQIADFFAATG-SMVPGARVLIGPHAGYTYCGARLAETYAAW 61

Query: 65  DPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGK-FELMD 121
           D T + RVF+LGPSHH Y+     +S    Y+TP+G L +D EV ++L KA G  F  M 
Sbjct: 62  DTTGVRRVFILGPSHHVYFRETAKVSPFARYETPLGSLRVDTEVCDQLVKAPGSAFTYMT 121

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLI--KIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
              DE EHS EMH P++A   +   +   IVPI++  +            + Y+ DP+N 
Sbjct: 122 EDEDEDEHSFEMHAPFVAFRAQKDNVAPTIVPIMISGMGPRLRDRLVAQLSPYMADPANT 181

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGV-------------IHKSIEALDKMGMDIIETGDPD 226
           F VSSDFCHWG RF Y  Y  K  +             +H SIE LD+ GM     G   
Sbjct: 182 FVVSSDFCHWGRRFGYTQYTPKADLSLLAAYKHSSGQPLHASIEFLDRSGMRAASAGS-T 240

Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            ++ Y+    NTICG+ P+SV LH++         +++ Y QS Q     DSSVSYAS  
Sbjct: 241 VWEDYIESTGNTICGQKPVSVVLHLVEKHVDGPLFEWIGYSQSGQAVDASDSSVSYASGY 300

Query: 287 AKV 289
            ++
Sbjct: 301 VRM 303


>gi|146103116|ref|XP_001469487.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134073857|emb|CAM72596.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 370

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 120/324 (37%), Positives = 167/324 (51%), Gaps = 39/324 (12%)

Query: 1   MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGY 50
           M  +R A+HA      WY    +RL   +D +  EA      E      + G+IAPHAG 
Sbjct: 43  MRFVRPATHAAPHGRGWYEAIPERLKVTIDDYFSEATHCYDEEGHECARMTGLIAPHAGM 102

Query: 51  SYSGRAAAYAFG---------NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
           SYSGR AA AF              + + R+F+LGPSH      C LS A+ Y+TP G L
Sbjct: 103 SYSGRTAAEAFAVFRDYLYAKGSKGSELERIFILGPSHTKGFEGCELSAASAYETPFGRL 162

Query: 102 PLDLEVIEEL-----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
            +D  V++ +     KA           DEAEHS+EM  PYL+ +               
Sbjct: 163 RVDTAVVDRVMTDLRKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPTTTGAPAQPA 222

Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-- 201
              + IVPI+VG  + ++E     +   Y+DD  NFF  SSDFCHWG RF+Y ++ K+  
Sbjct: 223 AGRVAIVPIIVGWTDRQDEKAICDVLKPYMDDARNFFIFSSDFCHWGERFSYTYHYKRSE 282

Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
           +  I  SI A+D   M+++E  D + +  YL    NTICGR PIS+ +    +   K ++
Sbjct: 283 YPNIGDSIIAMDHAAMELLEKRDLERWYAYLQMTKNTICGRAPISIGMQRWADEVNKARV 342

Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
           KF+ Y QS++CK   DSSVSYA+A
Sbjct: 343 KFVHYSQSNKCKDADDSSVSYAAA 366


>gi|392578607|gb|EIW71735.1| hypothetical protein TREMEDRAFT_56524, partial [Tremella
           mesenterica DSM 1558]
          Length = 304

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 112/293 (38%), Positives = 161/293 (54%), Gaps = 46/293 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPK---SP---EVRGVIAPHAGYSYSGRA 56
           +RRASHAGSWY  N  +L EEL+  L +   LP    SP   + + +I+PHAGY+YSG+A
Sbjct: 7   VRRASHAGSWYDANPIQLKEELESNLSQVVPLPTLDYSPPCQDAKAIISPHAGYTYSGQA 66

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA+A+ +I      +VF++GPSHH      ALS    Y TP G++ L  + I+ L+ TG+
Sbjct: 67  AAWAYASIPTDRYKKVFVIGPSHHQSFHGIALSPFKSYATPFGEILLCTDTIQALRETGE 126

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGH-LIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           F  M    DE EHS+EM +PYL  +F+G   +++VP++V    A     Y  + A Y +D
Sbjct: 127 FTQMRSAGDEEEHSLEMQMPYLRLIFQGRDDLRLVPLIVSHPTAAQYESYASILAPYWND 186

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKH------------------------------GVI 205
           P +FF  S+DFCHWGSRF+Y +Y  +                               G I
Sbjct: 187 PESFFVFSTDFCHWGSRFSYTNYYPQAPFPEDPVPPISSPGEKPAIPLTMRSTPGGGGQI 246

Query: 206 HKSIEALDKMGMDIIETGDP------DAFKKYLLEYDNTICGRHPISVFLHML 252
            +SI+ +D  G+ I+   DP      D +  YL    NTICGR+ +S+FLH++
Sbjct: 247 WQSIQYMDHEGIAILR--DPTSPTAVDDWNAYLNRTGNTICGRYGLSLFLHLV 297


>gi|398024154|ref|XP_003865238.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322503475|emb|CBZ38560.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 370

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 120/324 (37%), Positives = 167/324 (51%), Gaps = 39/324 (12%)

Query: 1   MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGY 50
           M  +R A+HA      WY    +RL   +D +  EA      E      + G+IAPHAG 
Sbjct: 43  MRFVRPATHAAPHGRGWYEAIPERLKVTIDDYFSEATHCYDEEGHECARMTGLIAPHAGM 102

Query: 51  SYSGRAAAYAFG---------NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
           SYSGR AA AF              + + R+F+LGPSH      C LS A+ Y+TP G L
Sbjct: 103 SYSGRTAAEAFAVFRDYLYAKGSKGSELERIFILGPSHTKGFEGCELSAASAYETPFGRL 162

Query: 102 PLDLEVIEEL-----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
            +D  V++ +     KA           DEAEHS+EM  PYL+ +               
Sbjct: 163 RVDTAVVDRVITDLRKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPTTTGAPAQPA 222

Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-- 201
              + IVPI+VG  + ++E     +   Y+DD  NFF  SSDFCHWG RF+Y ++ K+  
Sbjct: 223 AGRVAIVPIIVGWTDRQDEKAICDVLKPYMDDARNFFIFSSDFCHWGERFSYTYHYKRSE 282

Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
           +  I  SI A+D   M+++E  D + +  YL    NTICGR PIS+ +    +   K ++
Sbjct: 283 YPNIGDSIIAMDHAAMELLEKRDLERWYAYLQMTKNTICGRAPISIGMQRWADEMNKARV 342

Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
           KF+ Y QS++CK   DSSVSYA+A
Sbjct: 343 KFVHYSQSNKCKDADDSSVSYAAA 366


>gi|320581608|gb|EFW95828.1| ATP-dependent helicase DHX8 , putative [Ogataea parapolymorpha
           DL-1]
          Length = 1522

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 105/267 (39%), Positives = 148/267 (55%), Gaps = 27/267 (10%)

Query: 12  SWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISR 71
           SWYT ++ RL  +LD WL +A  P +   R ++ PHAGY+Y+G   A  +  +D + I R
Sbjct: 399 SWYTGSALRLESQLDKWLSQASGPVA-GARLLVGPHAGYAYAGETLAQTYSALDASGIKR 457

Query: 72  VFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEEL--KATGKFELMDICVDEAE 128
           VF++GPSHH Y   C + S    Y+TP+G++P+D + I++L  K +  F LM   +D  E
Sbjct: 458 VFIMGPSHHVYFRGCVMTSNFDYYETPLGNVPVDKQTIKDLVAKDSKMFRLMSEEMDREE 517

Query: 129 HSMEMHLPYLAKVF---EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSD 185
           HS EMH+P+L +V     G + KI+PI++ A   + E         Y  D  N F +S+D
Sbjct: 518 HSFEMHMPFLYRVTSKARGSVPKIIPIMISATEEKFERKLAEHLTPYFQDHENAFVISTD 577

Query: 186 FCHWGSRFNYMHY----------DKKHGV----------IHKSIEALDKMGMDIIETGDP 225
           FCHWGSRF Y  Y          D ++G           I+KSIEALDK  M I      
Sbjct: 578 FCHWGSRFGYTSYTPSGKIADIQDLRYGTSVLKKSDALPIYKSIEALDKEAMKIASKTSY 637

Query: 226 DAFKKYLLEYDNTICGRHPISVFLHML 252
            AF+ YL   +NTICG  P+SV L ++
Sbjct: 638 KAFRDYLAATENTICGAKPLSVLLLLM 664


>gi|327298557|ref|XP_003233972.1| hypothetical protein TERG_05841 [Trichophyton rubrum CBS 118892]
 gi|326464150|gb|EGD89603.1| hypothetical protein TERG_05841 [Trichophyton rubrum CBS 118892]
          Length = 375

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 112/320 (35%), Positives = 159/320 (49%), Gaps = 74/320 (23%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL-----REAGLPK--SPEVRGVIAPHAGYSYSGRA 56
           IRR +HAGSWY+D+   L  +LD WL     +  GL K   P  R +IAPHAGY+YSG  
Sbjct: 3   IRRHTHAGSWYSDHGPTLEAQLDQWLDLVPGKLEGLGKLPVPGARVIIAPHAGYAYSGPC 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELK-- 112
           AA+A+  +D +   R+FL+GPSHH +    A+   T Y TP+    LPLD +V+ +++  
Sbjct: 63  AAFAYKCLDLSKAKRIFLIGPSHHLHLTTIAVPQLTEYLTPLSPMPLPLDTDVLAKIEQE 122

Query: 113 ---ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAEN 161
              A   F  M   VDE EHSME+HLPY+ ++ +             +VP++VG+ +A N
Sbjct: 123 ASLADKPFSRMSRSVDEQEHSMELHLPYIHRLLQRLYPSMPTSAYPPLVPMMVGSTSAAN 182

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG------------------ 203
           E ++G +   Y+ D  N F +SSDFCHWG+RF Y +Y +  G                  
Sbjct: 183 ERVFGSILRPYLADEENAFVISSDFCHWGTRFAYTYYVQAPGPTTYKLPLTYPSLPVPSD 242

Query: 204 ----------------------------------VIHKSIEALDKMGMDIIETGDPDAFK 229
                                              IH+S+ A D   M  I +G+ D F+
Sbjct: 243 MLTAEKAAEEVRAASSGRMLRENDKSYNKEKYMWAIHESVSACDMACMSAIASGNTDVFQ 302

Query: 230 KYLLEYDNTICGRHPISVFL 249
           + L    NT+CGRHPI V +
Sbjct: 303 EALRRTGNTVCGRHPIGVVM 322


>gi|444316904|ref|XP_004179109.1| hypothetical protein TBLA_0B07740 [Tetrapisispora blattae CBS 6284]
 gi|387512149|emb|CCH59590.1| hypothetical protein TBLA_0B07740 [Tetrapisispora blattae CBS 6284]
          Length = 335

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/332 (36%), Positives = 180/332 (54%), Gaps = 46/332 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IRRA+HAGSWY+++S  L+++L+  L +A   K+P  + R +++PHAGY Y G   AY++
Sbjct: 3   IRRATHAGSWYSNHSLELSQQLESCLSKADTMKAPIDKARIIVSPHAGYRYCGPTMAYSY 62

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK--- 116
            +++ T ++ R+F+LGPSHH Y+  +  LS     +TP+G+L +D E+ E+L   GK   
Sbjct: 63  ASLNLTQNVKRIFILGPSHHLYFRNEILLSKFNQLETPLGNLTIDNELNEKLIKDGKKHS 122

Query: 117 --FELMDICVDEAEHSMEMHLPYLAKVFEGHLI-----KIVPILVGAVNAENEAMYGRLF 169
             F  MD   D  EHS+EM  P L +      I     KI+P+LV     + +   G++ 
Sbjct: 123 NIFNYMDKDTDLDEHSLEMQYPMLLQTLNWRKISPDKVKIIPMLVSHNTKDVDMSLGKIL 182

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-------------------------HGV 204
             Y+ D  N F +SSDFCHWG RF +  Y                            H V
Sbjct: 183 LPYLKDEKNLFIISSDFCHWGRRFQFTGYVSDEKELEEAIEEETEIEMLTSRSKLPHHKV 242

Query: 205 -IHKSIEALDKMGMDII-ETGDPDAF---KKYLLEYDNTICGRHPISVFLHMLGNCSTK- 258
            I +SIE LDK  M ++ E+ + + +   KKYL    NT+CG +P SV L ++     + 
Sbjct: 243 PIWQSIEILDKHAMKVLTESANIEKYSNWKKYLEISGNTVCGANPCSVILAIIALLKDER 302

Query: 259 -IKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            IK ++ +Y QSS+     DSSVSYAS  AK+
Sbjct: 303 LIKFEWPKYSQSSKVTNVDDSSVSYASGFAKI 334


>gi|401884619|gb|EJT48772.1| spliceosome-related protein [Trichosporon asahii var. asahii CBS
            2479]
 gi|406694217|gb|EKC97549.1| spliceosome-related protein [Trichosporon asahii var. asahii CBS
            8904]
          Length = 1139

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 104/205 (50%), Positives = 129/205 (62%), Gaps = 7/205 (3%)

Query: 1    MEKIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSP---EVRGVIAPHAGYSYS 53
            M  +R ASHAGSWYT+NS RL  EL G+L +       P SP   E + +IAPHAG+SYS
Sbjct: 850  MSGVREASHAGSWYTNNSSRLTAELSGYLDKVHPLNNPPFSPPVKEAKAIIAPHAGFSYS 909

Query: 54   GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
            G AAA+A+  +    I RV LLGPSHH Y    ALS    Y TPIGD+PLDLE I +L+ 
Sbjct: 910  GPAAAWAYAAVPVDKIKRVVLLGPSHHVYLSGIALSKFASYGTPIGDIPLDLEAIADLRK 969

Query: 114  TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
            +  F  M   VDE EHS+EMHLPY+ +VF    IK+VP+LVG  + +         +KY 
Sbjct: 970  SRLFSDMKSSVDEDEHSLEMHLPYIRQVFGKRDIKLVPLLVGHPSEDKRKAISAALSKYW 1029

Query: 174  DDPSNFFSVSSDFCHWGSRFNYMHY 198
            +D   FF +SSDFCHWGSRF+   Y
Sbjct: 1030 NDDETFFIISSDFCHWGSRFSCTPY 1054


>gi|115390657|ref|XP_001212833.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114193757|gb|EAU35457.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 455

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 132/398 (33%), Positives = 174/398 (43%), Gaps = 116/398 (29%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R ASHAGSWY+D+ + L+ +LD WL +        G    P  R +IAP+ GY  S    
Sbjct: 57  RDASHAGSWYSDSQRTLSRQLDQWLAQVPDQIEGLGSMPIPGARIIIAPY-GYDMSSSGY 115

Query: 58  AYAF-------------------------GNIDPTSIS------------RVFLLGPSHH 80
           +  F                          N  P+S +            R+F+LGPSHH
Sbjct: 116 SADFLAMPATHTQVPAQLMPTKRWICRKRKNPKPSSFTTTTYGILPICSKRIFILGPSHH 175

Query: 81  YYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATGK---------FELMDICVDEAEH 129
           +Y    AL   T Y TP+ D  LPLD E+I +L AT           F  M   +DE EH
Sbjct: 176 HYLSTLALPELTSYSTPLSDDPLPLDTELINKLLATKATRPNGSSVGFTTMSRSIDEDEH 235

Query: 130 SMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           S+E+HLPY+ +  +             +VPI+VG+ +A  E  +G L A Y++DPSN F 
Sbjct: 236 SIELHLPYIHRRLQLQYPDKPTSEYPPLVPIMVGSTSASTENAFGALLAPYLEDPSNAFV 295

Query: 182 VSSDFCHWGSRFNYMHY------------------------------------------- 198
           +SSDFCHWGSRF Y +Y                                           
Sbjct: 296 ISSDFCHWGSRFRYTYYVPQAPKPGPRLPLSAEDLPQPGDEAKEVEEKVEMVSAGQSLQR 355

Query: 199 ----DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGN 254
                 +   IH+SI A D   M  I TG    F   L +  NT+CGRHPI V +  + +
Sbjct: 356 GDRISSREPAIHESISAFDIATMAAITTGTTSKFLDILQKTGNTVCGRHPIGVIMAAIES 415

Query: 255 C-----STKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
                 S K +  F+RYE+SS      DSSVSY SA A
Sbjct: 416 VTSQEESKKGRFHFIRYERSSDAVDVTDSSVSYVSAVA 453


>gi|150866198|ref|XP_001385707.2| hypothetical protein PICST_36516 [Scheffersomyces stipitis CBS
           6054]
 gi|149387454|gb|ABN67678.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 345

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 176/346 (50%), Gaps = 58/346 (16%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-------PEVRGVIAPHAGYSYS 53
           M  IR A+HAGSWY++N  +L  +L+ +  +A            P  R +I PHAG++YS
Sbjct: 1   MSYIRPATHAGSWYSNNPTKLGLQLEAYFHKAESHSGEDSRHIIPGARILIGPHAGFAYS 60

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G   A  F   D + + R+F+LGPSHH Y+     +S    Y+TP G++P+D E IE+L 
Sbjct: 61  GERLAETFTVWDTSKVKRIFMLGPSHHVYFKNSVMVSQFEWYETPFGNIPVDTETIEKLL 120

Query: 113 AT--------------GKFELMDICVDEAEHSMEMHLPYLAKV---FEGHLIKIVPILVG 155
            T                F+ M   +DE EHS EMH P++ +        + KI+PIL+ 
Sbjct: 121 HTKPQSHGHSLTHAKDSVFKYMSEEMDEDEHSFEMHAPFIYQKTHDLPQGIPKIIPILIS 180

Query: 156 AVNAE-NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK------------- 201
            ++ + N+ +   L   Y+++  N F +SSDFCHWGSRF Y  Y  +             
Sbjct: 181 GMDEKLNDEVVSALLP-YLENEENHFIISSDFCHWGSRFGYTKYVPQKVDSLQLLTENLS 239

Query: 202 ---HGV--------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH 250
              H +        I+KSIE LDK  M+I  +G    +K Y+ +  NTICG+ PI+V L 
Sbjct: 240 SLGHSLRTKPNELPIYKSIEVLDKAAMEIASSGSYSDWKTYISQTGNTICGQKPIAVVLK 299

Query: 251 ML-------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           ++       G+       K++ Y QS+Q +   DSSVSYAS    +
Sbjct: 300 LIQKYRLAAGDTDKAAIFKWIGYSQSNQARRASDSSVSYASGYVTI 345


>gi|84996959|ref|XP_953201.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65304197|emb|CAI76576.1| hypothetical protein, conserved [Theileria annulata]
          Length = 297

 Score =  190 bits (482), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 102/245 (41%), Positives = 148/245 (60%), Gaps = 2/245 (0%)

Query: 47  HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
           HAGY+YS + AA+A+  ID TSI  +F+LGPSHH++   CA+   +  +TP+G L +D++
Sbjct: 51  HAGYAYSLKTAAHAYSQIDATSIKTIFVLGPSHHFFLRGCAVDRFSSLQTPLGVLQVDVD 110

Query: 107 VIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG 166
           ++E+L     F +++    E EHS+EMHLP L  VF+   +K+VPI+VG  +        
Sbjct: 111 IVEKLSDLKGFSVINNEASEDEHSIEMHLPLLKFVFKKEHVKVVPIMVGEFSESLADELT 170

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNY--MHYDKKHGVIHKSIEALDKMGMDIIETGD 224
                Y +D +  F +SSDFCH+GSRF +    Y+ ++  +++ IE LDK G+D+I    
Sbjct: 171 GALVPYFNDENTLFVISSDFCHFGSRFQFSITGYESENKPLYEKIEMLDKRGIDLIVNHK 230

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            D F  YL E +NTICGR+PI + L +L   +  I  K L Y QSS+     DSSVSYAS
Sbjct: 231 YDDFLWYLNETENTICGRNPILLLLRLLAASNLNISSKLLHYSQSSRITRVSDSSVSYAS 290

Query: 285 AAAKV 289
               V
Sbjct: 291 IVGLV 295


>gi|241950435|ref|XP_002417940.1| conserved hypothetical protein [Candida dubliniensis CD36]
 gi|223641278|emb|CAX45658.1| conserved hypothetical protein [Candida dubliniensis CD36]
          Length = 335

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 117/335 (34%), Positives = 174/335 (51%), Gaps = 51/335 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----------EVRGVIAPHAGYSY 52
           IR A+HAGSWY++N  +L  +LD +  +A   K+P             R +I PHAGY+Y
Sbjct: 3   IRPATHAGSWYSNNPAKLRNQLDQFFAKARSLKTPLSGEDASNGIDGARIIIGPHAGYTY 62

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
           SG   A +F   D + + RVFLLGPSHH Y+     LS    Y+TP G++P+D+  I++L
Sbjct: 63  SGERLAESFNAWDTSKVKRVFLLGPSHHVYFKNNALLSPFEYYETPFGNIPVDVSTIKDL 122

Query: 112 -----KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAEN 161
                K    F+ M   +DE EHS EMH P++   ++G      + KIVPIL+  ++++ 
Sbjct: 123 LDQKCKHGPVFKYMSEEIDEDEHSFEMHAPFI--YYKGETAVHGVPKIVPILISGMDSKL 180

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------------D 199
           +         Y+++  N F +SSDFCHWGSRF Y  Y                       
Sbjct: 181 QDGIVNSLLPYLENEENHFIISSDFCHWGSRFGYTKYLLSGETTLDTLSDNITSLGSLHQ 240

Query: 200 KKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
            K   I+KSIE LDK+ + I   G   ++ KY+    NTICG+ PI + L +L   + + 
Sbjct: 241 PKGLEIYKSIELLDKIALQIATNGSSASWTKYIDISGNTICGQKPILIVLKLLEKYNNRS 300

Query: 260 KI-----KFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            +      ++ Y QS+  +   +SSVSYAS   ++
Sbjct: 301 LLHGNTFNWIDYSQSNPVQRTSESSVSYASGYVRL 335


>gi|294659081|ref|XP_461417.2| DEHA2F24794p [Debaryomyces hansenii CBS767]
 gi|202953600|emb|CAG89832.2| DEHA2F24794p [Debaryomyces hansenii CBS767]
          Length = 335

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 117/333 (35%), Positives = 170/333 (51%), Gaps = 48/333 (14%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           IR ASHAG+WY +++ +L  +L+G+ ++A      G P     R +I PHAG++YSG   
Sbjct: 3   IRPASHAGTWYPNSTIKLGSQLEGFFKQAHNVSSNGKPVK-GARILIGPHAGFAYSGARL 61

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEEL----- 111
           A  F   D + + RVF+LGPSHH Y    AL S    Y+TP G LP+D+EV ++L     
Sbjct: 62  AETFTAWDTSKVKRVFILGPSHHVYFKSSALVSRYEYYETPFGKLPVDVEVTKQLTGIEG 121

Query: 112 -KATGKFELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAENEAMYGR 167
            K +  F+ M   VD+ EHS EMH P++   +K     +  I+PI++  +          
Sbjct: 122 SKGSRVFKYMSEDVDDDEHSFEMHTPFIYYKSKHLPQGVPSIIPIMISGMGDRLSTDIVD 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---------------DKKHGV-------- 204
               Y+DD  N F +SSDFCHWGSRF Y  Y               + +H          
Sbjct: 182 ALLPYLDDEENTFIISSDFCHWGSRFGYTKYLLQKPTKGIQEDDLINLRHSTKMKSTDLP 241

Query: 205 IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIK---- 260
           I++SIE LD+  M+I   G  + +++Y+    NTICG+ PI + L +L       K    
Sbjct: 242 IYQSIELLDREAMNIASAGSYNLWQQYISVTSNTICGQKPIGIILRLLEEYEKSGKSIVT 301

Query: 261 ----IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
                 ++ Y QSS+     DSSVSYAS   ++
Sbjct: 302 GSKVFNWINYSQSSEVHKHHDSSVSYASGYVQL 334


>gi|401842455|gb|EJT44665.1| YJR008W-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 338

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 119/332 (35%), Positives = 178/332 (53%), Gaps = 51/332 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           +R A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   VRPATHAGSWYSNRTQELSQQLHTYLTKSSL-KGPIRNARVIICPHAGYKYCGPTMAYSY 61

Query: 62  GNIDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-------K 112
            ++D +S + R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D ++ + L       K
Sbjct: 62  ASLDISSDVKRIFILGPSHHIYFKNQILISAFSELETPLGNLKVDTDLCKALVKKEHSEK 121

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
               F+LMD   D AEHS+EM  P L +  +        +KIVP++V   + + +   G 
Sbjct: 122 GKKLFKLMDQDTDMAEHSLEMQFPMLVETLKWRGLSLDEVKIVPMIVSHNSIDIDHSIGD 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ +P+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 VLSEYIKNPNNLFIVSSDFCHWGRRFQYTGYVGSKEELDEAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDII-ETGDPD---AFKKYLLEYDNTICGRHPISVFLHMLGNC-- 255
            V I +SIE +DK  M  + +T +P+   A+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDKYAMKTLADTANPERYGAWKQYLEITGNTICGEKPISVILSALSRIKD 301

Query: 256 --STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
             +  IK  +  Y QSS   +  DSSVSY S 
Sbjct: 302 ASALTIKFHWPHYSQSSHVTSIDDSSVSYTSG 333


>gi|365759923|gb|EHN01681.1| YJR008W-like protein [Saccharomyces cerevisiae x Saccharomyces
           kudriavzevii VIN7]
          Length = 338

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 119/337 (35%), Positives = 180/337 (53%), Gaps = 51/337 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           +R A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   VRPATHAGSWYSNRTQELSQQLHTYLTKSSL-KGPIRNARVIICPHAGYKYCGPTMAYSY 61

Query: 62  GNIDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-------K 112
            ++D +S + R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D ++ + L       K
Sbjct: 62  ASLDISSDVKRIFILGPSHHIYFKNQILISAFSELETPLGNLKVDTDLCKALVKKEHSEK 121

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
               F+LMD   D AEHS+EM  P L +  +        +KIVP++V   + + +   G 
Sbjct: 122 GKKLFKLMDQDTDMAEHSLEMQFPMLVETLKWRGLSLDEVKIVPMIVSHNSIDIDHSIGD 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ +P+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 VLSEYIKNPNNLFIVSSDFCHWGRRFQYTGYVGSKEELDEAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDII-ETGDPD---AFKKYLLEYDNTICGRHPISVFLHMLGNC-- 255
            V I +SIE +DK  M  + +T +P+   A+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDKYAMKTLADTANPERYGAWKQYLEITGNTICGEKPISVILSALSRIKD 301

Query: 256 --STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
             +  IK  +  Y QSS   +  DSSVSY S    ++
Sbjct: 302 ASALSIKFHWPHYSQSSHVTSIDDSSVSYTSGYVTLE 338


>gi|401419944|ref|XP_003874461.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322490697|emb|CBZ25959.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 370

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 120/324 (37%), Positives = 163/324 (50%), Gaps = 39/324 (12%)

Query: 1   MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGL------PKSPEVRGVIAPHAGY 50
           M  +R A+HA      WY    +RL   +D +  EA         +   +  +IAPHAG 
Sbjct: 43  MRFVRPATHAAPHGRGWYEAIPERLKVTIDDYFSEATHCYDNEGKECARMTCLIAPHAGM 102

Query: 51  SYSGRAAAYAFGNIDP---------TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
           SYSGR AA AF                + R+F+LGPSH      C LS A+ Y+TP G L
Sbjct: 103 SYSGRTAAEAFAVFREYLYAKGSKGCELERIFILGPSHTKGFEGCELSAASAYETPFGSL 162

Query: 102 PLDLE----VIEELKATG-KFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
            +D      VI +L+  G          DEAEHS+EM  PYL+ +               
Sbjct: 163 KVDTAAVDLVITDLRKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPTTTGAPAQPA 222

Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG 203
              + IVPI+VG    ++E     +   Y+DD  NFF  SSDFCHWG RF+Y ++ K+  
Sbjct: 223 AGRVSIVPIIVGWTTRQDEKAIYDVLKPYMDDARNFFIFSSDFCHWGERFSYTYHYKRSE 282

Query: 204 VIH--KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
             H   SI A+D   M+++E  D + +  YL    NTICGR PIS+ +    N   K ++
Sbjct: 283 YPHIGDSIIAMDHAAMELLEKRDLEKWYAYLQITKNTICGRAPISIGMQRWANEVNKAQV 342

Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
           KF+ Y QS++CK   DSSVSYA+A
Sbjct: 343 KFVHYSQSNKCKDADDSSVSYAAA 366


>gi|344232722|gb|EGV64595.1| hypothetical protein CANTEDRAFT_120277 [Candida tenuis ATCC 10573]
          Length = 326

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 118/322 (36%), Positives = 158/322 (49%), Gaps = 40/322 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A+H+GSWY+ +   L  ++ G+      P  P  R ++ PHAG+SYSG   A  F   
Sbjct: 4   RPATHSGSWYSGSRHALHTQITGFFANKVNPI-PRARVLVGPHAGFSYSGHRLAETFSAW 62

Query: 65  DPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKA---TGK--FE 118
           D   + RVF+LGPSHH Y+     +S    Y TPIG LP+D  +   L     +GK  F+
Sbjct: 63  DTDHVQRVFILGPSHHVYFRNHVKVSHYAFYDTPIGKLPVDTSINHHLTQGLHSGKRVFQ 122

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFE----GHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            MD   DE EHS EMH P++    +       + IVPI++  ++        R  A Y  
Sbjct: 123 YMDAETDEDEHSFEMHCPFIKHRLDIDQKSAGVTIVPIMISGMDGPCADAVCRALAPYFS 182

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKH------GV--------------------IHKS 208
           DP+N F VSSDFCHWG RFNY  Y  K       G+                    IH+S
Sbjct: 183 DPANTFVVSSDFCHWGERFNYTQYVPKPLDYTAVGIDDVVAELMTVRSNKQLDTTEIHES 242

Query: 209 IEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGN---CSTKIKIKFLR 265
           IE LD++ M I   G    +K+Y+ +  NTICG  P+ V + +L      +      +L 
Sbjct: 243 IEILDRVAMHIASHGTWKDWKRYIEKTRNTICGEKPLMVVMRLLEGERAAAAGSAFHWLG 302

Query: 266 YEQSSQCKTKRDSSVSYASAAA 287
           Y QSS      DSSVSYAS  A
Sbjct: 303 YSQSSSVINVHDSSVSYASGYA 324


>gi|353229017|emb|CCD75188.1| hypothetical protein Smp_097240.4 [Schistosoma mansoni]
          Length = 240

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 135/220 (61%), Gaps = 5/220 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +  IR +SHAGSWY+ +   L+ +LD WL   E  +     VR +I PHAGY +SG  AA
Sbjct: 5   LGSIRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAA 64

Query: 59  YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           +A+  I+P  I  +F++GPSH       CAL+  + Y+TP  +L ++  V   LK+   F
Sbjct: 65  HAYRLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCF 124

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDD 175
            +++   DEAEHS+EM LPY+A + +    +  IVPI+VG+++ E +  +G L + Y+ D
Sbjct: 125 SVLEKSQDEAEHSVEMQLPYIAHIMKRRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLD 184

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
             N F +SSDFCHWG RF Y +Y+K  G I +SIE LD +
Sbjct: 185 EKNLFVISSDFCHWGKRFRYQYYNKSDGAIWQSIEKLDHL 224


>gi|219564308|dbj|BAH03725.1| hypothetical protein [Saccharomyces pastorianus]
 gi|219564312|dbj|BAH03728.1| hypothetical protein [Saccharomyces pastorianus]
          Length = 338

 Score =  187 bits (475), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 122/337 (36%), Positives = 177/337 (52%), Gaps = 51/337 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++  + L+++L  +L ++   K P    R +I PHAGY Y G   A+++
Sbjct: 3   IRPATHAGSWYSNKPQELSQQLQTYLTKS-TTKGPIHNARIIICPHAGYRYCGHTMAHSY 61

Query: 62  GNIDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-----KAT 114
            ++D  S   R+F+LGPSHH Y+  +  +S     +TP+G L +D ++ + L      A 
Sbjct: 62  ASLDLNSKTKRIFILGPSHHVYFRNQILVSAFNGLETPLGSLNVDTDLCKALVKREHPAN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+LMD   D AEHS+EM  P L +  +        +K+VP++V   + + +   G 
Sbjct: 122 GKKLFKLMDHDTDMAEHSLEMQFPMLVETLKWRGVSLEEVKVVPMMVSHNSTDIDQSIGE 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           +  +Y+ DP+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 ILFEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSREELNEAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDII-ETGDP---DAFKKYLLEYDNTICGRHPISVFLHML----G 253
            V I +SIE +D+  M  + ET +    DA+K+YL    NT+CG  PISV L  L    G
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSETPNSERYDAWKQYLEITGNTVCGEKPISVILSALSKIEG 301

Query: 254 NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
             +T IK ++  Y QSS   +  DSSVSY S    +D
Sbjct: 302 ARATNIKFQWPSYSQSSHVTSIDDSSVSYTSGYVTLD 338


>gi|448084446|ref|XP_004195606.1| Piso0_005007 [Millerozyma farinosa CBS 7064]
 gi|359377028|emb|CCE85411.1| Piso0_005007 [Millerozyma farinosa CBS 7064]
          Length = 344

 Score =  187 bits (475), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 119/340 (35%), Positives = 170/340 (50%), Gaps = 54/340 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS------------PEVRGVIAPHAGYS 51
           +R A+H G WY+++S +LA +++ + ++A   K                R +I PHAGYS
Sbjct: 3   VRPATHGGLWYSNSSSKLAHQMEAYFKKAASLKEQLYGDDDITGPISGARILIGPHAGYS 62

Query: 52  YSGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
           YSG   A  F   D + + RVF+LGPSHH Y+     +S    Y+TP+G+LP+D  + +E
Sbjct: 63  YSGERLAETFTVWDTSKVKRVFILGPSHHVYFKNYAQISKYDYYETPLGNLPVDSSLSKE 122

Query: 111 LKA-----TGK--FELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAE 160
           L +     TG   F+ M   VDE EHS EMH PY+   +K     +  I+PI++  ++  
Sbjct: 123 LVSYRGSRTGAPVFKYMSEEVDEDEHSFEMHAPYIYYRSKGLAQGVPSIIPIMISGMDDS 182

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK------------------- 201
                    + Y+ D  N F +SSDFCHWG RF +  Y KK                   
Sbjct: 183 TCDDLASALSPYLADEENTFIISSDFCHWGYRFGFTEYLKKVPSGTNISSNDIITLTPST 242

Query: 202 ----HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
                  IH+SIE LDK+ M I   G  DA+KKY+    NTICG+ PI V L  L    +
Sbjct: 243 TTSPSMPIHRSIEHLDKLAMLIASKGSHDAWKKYIALTGNTICGQRPIYVILKTLDKYKS 302

Query: 258 --------KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
                   +    ++ Y QSS  +  ++SSVSYAS   K+
Sbjct: 303 SKPDLFQGQSVFNWINYSQSSPIRVFKESSVSYASGYVKL 342


>gi|290991406|ref|XP_002678326.1| predicted protein [Naegleria gruberi]
 gi|284091938|gb|EFC45582.1| predicted protein [Naegleria gruberi]
          Length = 309

 Score =  187 bits (474), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 113/302 (37%), Positives = 170/302 (56%), Gaps = 20/302 (6%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAG------LPKSPEVRGVIAPHAGYSYSGRAAA 58
           RR + AG+WY D+ + L+ EL+ +L +A       +    +V+ +I+PHAGY YS   +A
Sbjct: 5   RRDAFAGAWYQDDKEALSHELNDYLEKASSTSSTSIHHGKKVKAIISPHAGYRYSAPTSA 64

Query: 59  YAFGNIDPTS----ISRVFLLGPSHHYYTP-KCALSTATVYKTPIGDLPLDLEVIEEL-- 111
           + +  +  +S    I  VF+LGP H  Y   +C LS      TP+G L ++  +  EL  
Sbjct: 65  HGYSCVLNSSNYDKIRTVFVLGPCHRLYLDDQCVLSKCDQCNTPLGALKVNKSIQSELVS 124

Query: 112 KATGKFE-LMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRL 168
           K +  F  L D   +E EHS+E+  P++A +F   L  I+IVPI+VG +++E+ +     
Sbjct: 125 KYSDLFGYLKDKKNEEDEHSLELQFPFIAYLFREKLNQIQIVPIMVGDLSSESISRMAES 184

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDII--ETGDPD 226
              Y+D   + F +SSDFCHWG RF+Y +   K   + KSI+ LD +G+  I   + +P+
Sbjct: 185 LTPYLDSEESLFVISSDFCHWGKRFSYQYVINKDISLSKSIQQLDMLGVSKITEHSKNPE 244

Query: 227 AFKKYLLEYDNTICGRHPISVFLHML--GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            F  Y+ +Y NTICG++PI V L+ L   N     +  FL Y QSS  K+  DSSVSY S
Sbjct: 245 EFYNYMKKYKNTICGKNPILVMLNALQKSNNWKNHESLFLNYSQSSAAKSMEDSSVSYVS 304

Query: 285 AA 286
            A
Sbjct: 305 IA 306


>gi|344302348|gb|EGW32653.1| hypothetical protein SPAPADRAFT_61718 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 341

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/339 (34%), Positives = 167/339 (49%), Gaps = 53/339 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-----------PEVRGVIAPHAGYSY 52
           +R A+HAGSWYT++  +L  +LD +  +A   +            P  R +I PHAGY+Y
Sbjct: 3   VRSATHAGSWYTNSPTKLRAQLDSFFEKAKRNEKTIHGPIANDSIPGARILIGPHAGYAY 62

Query: 53  SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEEL 111
           SG   A  F   D + + RVFLLGPSHH Y    AL S+   Y TP G++P+D E I +L
Sbjct: 63  SGERLAETFTAWDTSKVKRVFLLGPSHHVYFRNYALVSSFDSYATPFGEIPVDKETISKL 122

Query: 112 --------KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH---LIKIVPILVGAVNAE 160
                        F+LM   VDE EHS EMH P++    E     + KI+PI++  ++ +
Sbjct: 123 LRKESKFDSRDAVFKLMSEEVDEDEHSFEMHAPFIHYRGEKAPLGMPKIIPIMISGMDEK 182

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---------------------- 198
                      Y+D   N F +SSDFCHWGSRF+Y  Y                      
Sbjct: 183 LNEDIVECLLPYMDHEENHFIISSDFCHWGSRFSYTKYLNTEDTSLKTVGHDLYSLGTFN 242

Query: 199 DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK 258
             K   I+KSIE LD+M +DI   G    +++Y+    NTICG+ PIS+ L ++     K
Sbjct: 243 KPKELEIYKSIELLDRMALDIATKGSSRDWQEYIKITGNTICGQKPISIVLRLVEEYHKK 302

Query: 259 IK--------IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
                       ++ Y QS+  +   +SSVSYAS   ++
Sbjct: 303 APRSSPVDGVFNWIGYSQSNPVRRSGESSVSYASGYVQL 341


>gi|448079961|ref|XP_004194508.1| Piso0_005007 [Millerozyma farinosa CBS 7064]
 gi|359375930|emb|CCE86512.1| Piso0_005007 [Millerozyma farinosa CBS 7064]
          Length = 344

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 172/344 (50%), Gaps = 62/344 (18%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS------------PEVRGVIAPHAGYS 51
           +R A+H G WY+++S +LA +++ +  +A   K                R +I PHAGYS
Sbjct: 3   VRPATHGGLWYSNSSSKLAHQMEAYFMKAASLKEQLYGDDDVTGPVSGARILIGPHAGYS 62

Query: 52  YSGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
           YSG   A  F   D + + RVF+LGPSHH Y+     +S    Y+TP+G+LP+D  V + 
Sbjct: 63  YSGERLAETFTVWDTSKVKRVFILGPSHHVYFKNYAQISKYDYYETPLGNLPVDSTVSKA 122

Query: 111 LKA-----TGK--FELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAE 160
           L +     TG   F+ M   VDE EHS EMH PY+   +K     +  I+PI++  ++  
Sbjct: 123 LASYRGNRTGAPVFKYMSEEVDEDEHSFEMHAPYIYYRSKGLAQGVPSIIPIMISGMDDG 182

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK--HGV-------------- 204
                    + Y+ D  N F VSSDFCHWG RF +  Y K+   G               
Sbjct: 183 TCDDLASALSPYLADEENTFIVSSDFCHWGYRFGFTEYLKRVPSGTNISSNDIVTLTPST 242

Query: 205 -------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
                  IH+SIE LDK+ M I   G  DA+KKY+    NTICG+ PI V L ML     
Sbjct: 243 ATSPSMPIHRSIEHLDKLAMLIASKGSHDAWKKYIALTGNTICGQRPIYVILKML----D 298

Query: 258 KIKIK------------FLRYEQSSQCKTKRDSSVSYASAAAKV 289
           K KI+            ++ Y QSS  +  ++SSVSYAS   K+
Sbjct: 299 KYKIQKPDLFQGQSVFNWINYSQSSPVRVFKESSVSYASGYVKL 342


>gi|320033881|gb|EFW15827.1| hypothetical protein CPSG_07454 [Coccidioides posadasii str.
           Silveira]
          Length = 274

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 133/217 (61%), Gaps = 24/217 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLRE--------AGLPKSPEVRGVIAPHAGYSYSGRA 56
           R A+HAGSWY+DN+  L  +LD W+            LP +   R +IAPHAGY+YSG  
Sbjct: 4   REATHAGSWYSDNAATLTRQLDEWMNRVPNEIEGIGSLPVA-GARIIIAPHAGYAYSGPC 62

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL--- 111
           AA+A+ ++D +   R+FLLGPSHH+   K AL   + Y TP+    LPLD E+I+EL   
Sbjct: 63  AAFAYKSLDLSKAKRIFLLGPSHHHPFSKIALPELSSYSTPLSQEPLPLDREIIDELSTR 122

Query: 112 --KATGKFELMDICVDEAEHSMEMHLPYLAKVFEG--------HLIKIVPILVGAVNAEN 161
               T +F  M+  +DEAEHS+E+HLPY+  + +G           K+VP++VG+ +A  
Sbjct: 123 TENGTVRFTTMNQAIDEAEHSLELHLPYIHYLLQGLYPGEPAASYPKLVPMMVGSTSAPT 182

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
           E  +GR+ A Y+ +P N F VSSDFCHWG RF Y +Y
Sbjct: 183 EQAFGRILAPYLANPENAFIVSSDFCHWGLRFAYAYY 219


>gi|50303155|ref|XP_451515.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49640646|emb|CAH03103.1| KLLA0A11836p [Kluyveromyces lactis]
          Length = 321

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 121/323 (37%), Positives = 163/323 (50%), Gaps = 42/323 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           M  +R A+HAGSWY+ +S  L   L   L R+  +P +   + +I+PHAGY Y G   A 
Sbjct: 1   MSSVRPATHAGSWYS-SSPALITTLQQLLSRDEAVPGT---KHLISPHAGYKYCGDTMAL 56

Query: 60  AFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
            +  +D T++ R  ++GPSHH Y+  K  +++     TP GD  +D  +  EL   GK  
Sbjct: 57  GYSKLDFTNVKRCIIMGPSHHIYFKNKIQVTSFDEVNTPFGDFKVDGGLRNELLVGGKVV 116

Query: 119 LMDICVDEAEHSMEMHLPYL---AKV--FEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
           LM   VD  EHS+EM  P L   AK+   +   IKI+P+LV   + + +   G+L  KYV
Sbjct: 117 LMSKEVDMDEHSLEMQFPMLWAAAKLRGVDPTNIKILPLLVSHNSTKVDFELGKLLQKYV 176

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHY---------------------------DKKHGV-I 205
           DD    F +SSDFCHWG RF Y  Y                              H V I
Sbjct: 177 DDEETLFILSSDFCHWGRRFGYTGYVMESSDIDESLENDSEVEYLTQRSFNHKSSHNVPI 236

Query: 206 HKSIEALDKMGMDIIETGDPDAF---KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIK 262
           +KSIE LD+  MDI+     + +   KKY+    NTICG  PI VFL  +   +   +  
Sbjct: 237 YKSIEILDQYAMDILGKNTENKYYDWKKYINVTGNTICGERPIGVFLCAMSLLNKPYQFN 296

Query: 263 FLRYEQSSQCKTKRDSSVSYASA 285
           +  Y QSSQC    DSSVSYAS 
Sbjct: 297 WAHYSQSSQCVDLEDSSVSYASG 319


>gi|149239700|ref|XP_001525726.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146451219|gb|EDK45475.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 356

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 120/355 (33%), Positives = 174/355 (49%), Gaps = 68/355 (19%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLP----------------KSPEVRGVIAP 46
            +R A+HAG WY+ ++  L ++LD  + +A                   K P  R +I P
Sbjct: 2   NVRPATHAGLWYSGDAGALKQQLDALILKAQRSANTTTATTATTTTKNGKVPGARVLIGP 61

Query: 47  HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTA-TVYKTPIGDLPLDL 105
           HAGY+YSG   A  +   D + + RVFLLGPSHH Y    A+ T    Y+TP+G+LP+D 
Sbjct: 62  HAGYTYSGERLAETYNVWDTSKVKRVFLLGPSHHVYFKNYAMLTPFEFYETPLGNLPVDT 121

Query: 106 EVIEELKATGK--------FELMDICVDEAEHSMEMHLPYL----AKVFEGHLIKIVPIL 153
             I++L +T          F++M   VDE EHS EMH P++         G+  KI+PI+
Sbjct: 122 ATIKKLLSTTSSSSISKPLFKVMAEDVDEDEHSFEMHAPFIYHQTVNTSGGNPPKIIPIM 181

Query: 154 VGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY--DKKHGV------- 204
           +  ++   ++        ++    N F +SSDFCHWGSRF Y       KH         
Sbjct: 182 ISGMDHATQSDLAEALLPFIASEENHFIISSDFCHWGSRFGYTKVLPSNKHTTFNTLEKD 241

Query: 205 ---------------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL 249
                          IH+S+  LDKM ++I   G  D +K+Y+    NTICG+ PIS+ L
Sbjct: 242 LVSLTSMSKLNSGTSIHESVANLDKMALEIATRGSSDMWKQYIKITGNTICGQKPISIVL 301

Query: 250 HML---GNCSTKIK------------IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            +L   G  +  IK             +++ Y QS+  K+ RDSSVSYAS   K+
Sbjct: 302 QLLEKYGRLNYNIKSEAGNTDSINNVFEWIGYSQSNPAKSVRDSSVSYASGYVKL 356


>gi|323308440|gb|EGA61685.1| YJR008W-like protein [Saccharomyces cerevisiae FostersO]
          Length = 338

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/336 (35%), Positives = 178/336 (52%), Gaps = 51/336 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGSTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D +     + +E    
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+ MD   D AEHS+EM LP L +  +        +K++P++V   + + +   G 
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ DP+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
            V I +SIE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301

Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
              + IK ++  Y QSS   +  DSSVSY S    +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSGYVTI 337


>gi|365764811|gb|EHN06331.1| YJR008W-like protein [Saccharomyces cerevisiae x Saccharomyces
           kudriavzevii VIN7]
          Length = 338

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 118/336 (35%), Positives = 178/336 (52%), Gaps = 51/336 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-----KAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D ++ + L        
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKVYPEN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+ MD   D AEHS+EM LP L +  +        +K++P++V   + + +   G 
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ DP+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
            V I +SIE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301

Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
              + IK ++  Y QSS   +  DSSVSY S    +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSGYVTI 337


>gi|151945084|gb|EDN63335.1| conserved protein [Saccharomyces cerevisiae YJM789]
 gi|190409495|gb|EDV12760.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
 gi|207343905|gb|EDZ71221.1| YJR008Wp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|256270208|gb|EEU05430.1| YJR008W-like protein [Saccharomyces cerevisiae JAY291]
 gi|323332839|gb|EGA74242.1| YJR008W-like protein [Saccharomyces cerevisiae AWRI796]
 gi|323354328|gb|EGA86168.1| YJR008W-like protein [Saccharomyces cerevisiae VL3]
          Length = 338

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 118/336 (35%), Positives = 178/336 (52%), Gaps = 51/336 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D +     + +E    
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+ MD   D AEHS+EM LP L +  +        +K++P++V   + + +   G 
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ DP+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
            V I +SIE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301

Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
              + IK ++  Y QSS   +  DSSVSY S    +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSGYVTI 337


>gi|6322467|ref|NP_012541.1| Mho1p [Saccharomyces cerevisiae S288c]
 gi|1353049|sp|P47085.1|YJX8_YEAST RecName: Full=MEMO1 family protein YJR008W
 gi|854581|emb|CAA60930.1| ORF YJR83.9 [Saccharomyces cerevisiae]
 gi|1015634|emb|CAA89530.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|285812901|tpg|DAA08799.1| TPA: Mho1p [Saccharomyces cerevisiae S288c]
 gi|392298436|gb|EIW09533.1| hypothetical protein CENPK1137D_1303 [Saccharomyces cerevisiae
           CEN.PK113-7D]
          Length = 338

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 119/336 (35%), Positives = 178/336 (52%), Gaps = 51/336 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D +     + +E    
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+ MD   D AEHS+EM LP L +  +        +K+ P++V   + + +   G 
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVFPMMVSHNSVDVDRCIGN 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ DP+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
            V I +SIE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301

Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
              + IK ++  Y QSS   +  DSSVSYAS    +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYASGYVTI 337


>gi|258577287|ref|XP_002542825.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237903091|gb|EEP77492.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 383

 Score =  184 bits (466), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 119/349 (34%), Positives = 167/349 (47%), Gaps = 77/349 (22%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
           R A+H+GSWY+D++  L  +LD W+        G+   P    R +IAPHAG++YSGR A
Sbjct: 4   REAAHSGSWYSDDAATLTRQLDEWMGRVPNEIEGIGSLPVAGARVIIAPHAGFAYSGRCA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELKATG 115
           A+A+  +D +   R+FL+GPSHH+   K AL   + Y TP+    LPLD EVI EL    
Sbjct: 64  AFAYKCLDLSKAKRIFLIGPSHHHPFSKIALPEVSSYSTPLSPDPLPLDKEVIAELLNRA 123

Query: 116 -----KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENE 162
                +F  M   +DEAEHS+E+HLPY+  + +            K+VP++VG+ +A  E
Sbjct: 124 ENGHVRFCTMSQAIDEAEHSLELHLPYIHYLLQRLYPDEPAASYPKLVPMMVGSTSAPTE 183

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
             +GR+ A Y+ +P N F +SSDFCHWG RF Y +Y                        
Sbjct: 184 QAFGRILAPYLANPENAFIISSDFCHWGLRFAYAYYIDDVPSPGPVLPLSYDALPQPSEA 243

Query: 199 ----------------------DK--KHG---VIHKSIEALDKMGMDIIETGDPDAFKKY 231
                                 D+  KH     I++SI A D   M  I +G    F   
Sbjct: 244 LKLGSARRQITAVSSGRYLRAGDQLPKHADVPAIYESISACDIACMSAIASGHKQTFLDA 303

Query: 232 LLEYDNTICGRHPI----SVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
           +    NTICGRHPI    S F  +LG     I+   ++    +Q    R
Sbjct: 304 IKSTGNTICGRHPIGLIMSAFEFVLGKDKENIRDLEIKAADETQTHLMR 352


>gi|349579198|dbj|GAA24361.1| K7_Yjr008wp [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 338

 Score =  184 bits (466), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 118/332 (35%), Positives = 177/332 (53%), Gaps = 51/332 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D +     + +E    
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+ MD   D AEHS+EM LP L +  +        +K++P++V   + + +   G 
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ DP+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
            V I +SIE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301

Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
              + IK ++  Y QSS   +  DSSVSY S 
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSG 333


>gi|145524772|ref|XP_001448213.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415747|emb|CAK80816.1| unnamed protein product [Paramecium tetraurelia]
          Length = 258

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 80/207 (38%), Positives = 132/207 (63%), Gaps = 5/207 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +R A HAGSWY+  S  L  +++ WL +A   +    +++ ++ PHAGY+YSG  AA+++
Sbjct: 51  VREAVHAGSWYSSKSNELKIQINCWLEQAKAEVTTVAQLKALVVPHAGYAYSGPTAAFSY 110

Query: 62  GNID---PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
             +    P+   +VF+LGP H+ Y  +C L+   +Y+TP+G++ +DLE +++L   G FE
Sbjct: 111 KYLKKYPPSEKLKVFILGPCHYVYITQCCLTRQEIYETPLGNIKVDLETVKQLHEQGLFE 170

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             D   +E EHS+EM LP+LA +       I+PI+VG+++A++E  YG+L ++Y D    
Sbjct: 171 QSDKDAEEEEHSIEMQLPFLAHILGTDNFTIIPIMVGSIDAKSEEYYGKLLSEYFDMDDT 230

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVI 205
            F +S+DFCHWG++F Y +Y+   G I
Sbjct: 231 LFIISTDFCHWGTKFAYTYYNSADGEI 257


>gi|219564300|dbj|BAH03719.1| hypothetical protein [Saccharomyces pastorianus]
 gi|219564304|dbj|BAH03722.1| hypothetical protein [Saccharomyces pastorianus]
          Length = 338

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 118/336 (35%), Positives = 177/336 (52%), Gaps = 51/336 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D +     + +E    
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+ MD   D AEHS+EM LP L +  +        +K+ P++V   + + +   G 
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVFPMMVSHNSVDVDRCIGN 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ DP+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241

Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
            V I +SIE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301

Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
              + IK ++  Y QSS   +  DSSVSY S    +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSGYVTI 337


>gi|290771211|emb|CBK33739.1| EC1118_1J11_2674p [Saccharomyces cerevisiae EC1118]
          Length = 336

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 115/329 (34%), Positives = 177/329 (53%), Gaps = 51/329 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D +     + +E    
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+ MD   D AEHS+EM LP L +  +        +K++P++V   + + +   G 
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY--------------------DKKHGVIH- 206
           + ++Y+ DP+N F VSSDFCHWG RF Y  Y                      +  ++H 
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLLHH 241

Query: 207 -----KSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
                +SIE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L     
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301

Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSY 282
              + IK ++  Y QSS   +  DSSVSY
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSY 330


>gi|400595291|gb|EJP63096.1| memo-like protein [Beauveria bassiana ARSEF 2860]
          Length = 373

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 124/317 (39%), Positives = 170/317 (53%), Gaps = 43/317 (13%)

Query: 13  WYTDNSKRLAEELDGWLREA-----GLPKS-PEVRGVIAPHAGYSYSGRAAAYAFGNIDP 66
           WY     +L  E D +L +      G P   P  R VIAPHAGY+YSGR +A+AF  +D 
Sbjct: 56  WYELAPFQLRWEFDEYLEDVPQTVDGSPLPIPGARAVIAPHAGYAYSGRCSAWAFRCLDL 115

Query: 67  TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF----ELMDI 122
           +   RVF+LGP+H++Y    ALST + Y TP+G   +D + + ++ A        E+ +I
Sbjct: 116 SQAKRVFVLGPTHYFYFSGLALSTFSQYSTPLGAFSVDQQTLRDIAAAAASAGAPEVRNI 175

Query: 123 CVDEA--EHSMEMHLPYLAKVFEGHLIK------IVPILVGAVNAENEAMYGRLFAKYVD 174
                  EHS+EM + +L +  E    +      IVP+LV   NA++E   GRL   Y+ 
Sbjct: 176 PRRRELDEHSLEMEIAFLYQRCEATFSRPEDFPTIVPMLVSG-NADDEKAIGRLLLPYLR 234

Query: 175 DPSNFFSVSSDFCHWGSRF-NYMHY----DKKHGV-------------IHKSIEALDKMG 216
           DP+N F VSSDFCHWG +F +Y  Y    D+K  V             IH+SI+ LD   
Sbjct: 235 DPTNAFVVSSDFCHWGRQFGDYRPYFPGGDRKRRVNLRDGDAPPTSPPIHESIKMLDDEA 294

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVF---LHMLGNCST---KIKIKFLRYEQSS 270
           ++ +ETG  DAF   L E DN++CGRHPI      L MLG  +    K K K +RY++S+
Sbjct: 295 IEAMETGVHDAFVANLEETDNSVCGRHPIGAMMAALEMLGENAPLEKKPKFKHVRYDRSA 354

Query: 271 QCKTKRDSSVSYASAAA 287
                  SSVSY SA A
Sbjct: 355 LLTDPTKSSVSYVSAYA 371


>gi|363752797|ref|XP_003646615.1| hypothetical protein Ecym_4787 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356890250|gb|AET39798.1| hypothetical protein Ecym_4787 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 329

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 120/327 (36%), Positives = 169/327 (51%), Gaps = 42/327 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYAF 61
           R A+H+GSWY  N   L+++L  +  EA       V+G   +I+PHAGYSY G   A A+
Sbjct: 3   RPATHSGSWYLSNPSSLSQQLGTFFTEAIKENGGPVKGTRAIISPHAGYSYCGETMAKAY 62

Query: 62  GNIDPT-SISRVFLLGPSHHYY-TPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-FE 118
             +D T  I RVF+LGPSHH+Y   K  +S   V +TP+G L +D+E +E+L +  + F 
Sbjct: 63  AKLDITDQIKRVFILGPSHHFYFKNKGLISRYEVLETPLGSLKVDVETVEKLLSNPELFA 122

Query: 119 LMDICVDEAEHSMEMHLP--YLAKVFEG---HLIKIVPILVGAVNAENEAMYGRLFAKYV 173
            +D   D+ EHS+EM  P  Y   +  G     I +VPIL+   + + +   G+   +Y+
Sbjct: 123 NLDQEADQDEHSLEMQYPMWYQTLIHNGLSPSKIGVVPILISHNSTKIDYTIGKCLLEYL 182

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--------------------------VIHK 207
               + F +SSDFCHWG RF Y  Y   H                            I+K
Sbjct: 183 KSGDSIFIISSDFCHWGRRFGYTGYVSSHEDIADAIAELTEIETLTTRSKLDHHCLKIYK 242

Query: 208 SIEALDKMGMDII--ETGDPD---AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIK 262
           SIE LD+  MDI+  + G  D   A+K+YL    NTICG  P  VFL +L   +T    K
Sbjct: 243 SIELLDRYAMDILAQKNGPKDTYAAWKEYLEVTGNTICGEKPTGVFLCLLSYLNTSHPFK 302

Query: 263 FLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +  Y QSS  ++  +SSVSYA+    V
Sbjct: 303 WSGYSQSSNVESLDESSVSYAAGYCVV 329


>gi|401409412|ref|XP_003884154.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325118572|emb|CBZ54123.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 254

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 113/294 (38%), Positives = 154/294 (52%), Gaps = 52/294 (17%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
           +RRA+HA SWY+++   L +  D +L +A   K+ E V  ++ P                
Sbjct: 3   VRRAAHASSWYSNDPSELTQLFDRFLAKAE--KTEENVIALVCPR--------------- 45

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATV--YKTPIGDLPLDLEVIEELKATGKFELM 120
                   RVF+LGPSHH Y P CAL   +V  Y TP+G++PLD EV+E L+    FE +
Sbjct: 46  --------RVFVLGPSHHIYLPGCALPPPSVRAYATPLGEIPLDAEVLEALRTQNVFETI 97

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
            +  DE EHS+EM LP+L  V  G    +VPI+VG +                       
Sbjct: 98  ALRDDEEEHSIEMQLPFLRYVLRGQTFTLVPIVVGDLRRSGH------------------ 139

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV--IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            V+ DFCHWG RF Y +         I++ I  LDK G + IE  DP AF++Y  E  NT
Sbjct: 140 -VAVDFCHWGRRFRYTYLPPAPASLPIYERIRILDKEGANTIEQQDPAAFQEYYEETGNT 198

Query: 239 ICGRHPISVFLHML---GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           ICG +PIS+FLH+L   G   +  K K L Y QSSQ + +  SSVSYA+ A+ +
Sbjct: 199 ICGHNPISIFLHLLEESGMPRSAFKTKLLHYSQSSQVEHESSSSVSYAAFASSL 252


>gi|256089011|ref|XP_002580612.1| hypothetical protein [Schistosoma mansoni]
          Length = 225

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 87/210 (41%), Positives = 128/210 (60%), Gaps = 5/210 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +  IR +SHAGSWY+ +   L+ +LD WL   E  +     VR +I PHAGY +SG  AA
Sbjct: 5   LGSIRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAA 64

Query: 59  YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           +A+  I+P  I  +F++GPSH       CAL+  + Y+TP  +L ++  V   LK+   F
Sbjct: 65  HAYRLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCF 124

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDD 175
            +++   DEAEHS+EM LPY+A + +    +  IVPI+VG+++ E +  +G L + Y+ D
Sbjct: 125 SVLEKSQDEAEHSVEMQLPYIAHIMKRRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLD 184

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVI 205
             N F +SSDFCHWG RF Y +Y+K  G I
Sbjct: 185 EKNLFVISSDFCHWGKRFRYQYYNKSDGAI 214


>gi|256089009|ref|XP_002580611.1| hypothetical protein [Schistosoma mansoni]
          Length = 224

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 127/209 (60%), Gaps = 4/209 (1%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +  IR +SHAGSWY+ +   L+ +LD WL   E  +     VR +I PHAGY +SG  AA
Sbjct: 5   LGSIRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAA 64

Query: 59  YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           +A+  I+P  I  +F++GPSH       CAL+  + Y+TP  +L ++  V   LK+   F
Sbjct: 65  HAYRLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCF 124

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE-GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
            +++   DEAEHS+EM LPY+A + +      IVPI+VG+++ E +  +G L + Y+ D 
Sbjct: 125 SVLEKSQDEAEHSVEMQLPYIAHIMKRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLDE 184

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVI 205
            N F +SSDFCHWG RF Y +Y+K  G I
Sbjct: 185 KNLFVISSDFCHWGKRFRYQYYNKSDGAI 213


>gi|256089013|ref|XP_002580613.1| hypothetical protein [Schistosoma mansoni]
          Length = 224

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 87/210 (41%), Positives = 128/210 (60%), Gaps = 5/210 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +  IR +SHAGSWY+ +   L+ +LD WL   E  +     VR +I PHAGY +SG  AA
Sbjct: 5   LGSIRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAA 64

Query: 59  YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           +A+  I+P  I  +F++GPSH       CAL+  + Y+TP  +L ++  V   LK+   F
Sbjct: 65  HAYRLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCF 124

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDD 175
            +++   DEAEHS+EM LPY+A + +    +  IVPI+VG+++ E +  +G L + Y+ D
Sbjct: 125 SVLEKSQDEAEHSVEMQLPYIAHIMKRRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLD 184

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVI 205
             N F +SSDFCHWG RF Y +Y+K  G I
Sbjct: 185 EKNLFVISSDFCHWGKRFRYQYYNKSDGAI 214


>gi|38047939|gb|AAR09872.1| similar to Drosophila melanogaster CG8031, partial [Drosophila
           yakuba]
          Length = 166

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 86/164 (52%), Positives = 111/164 (67%), Gaps = 3/164 (1%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWYTD+   L+ +LD WL  A L   P  R +IAPHAGY+Y G   A+A+  +
Sbjct: 4   RRATHAGSWYTDSGAELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
            P  + R+F+LGPSHH     CALS A  Y+TP+ DL +D ++  EL+ TG+F  MD+  
Sbjct: 63  IPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDTQINAELEKTGQFSWMDMKT 122

Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYG 166
           DE EHS+EMHLPY+AKV E +     IVPILVG++N E EA YG
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYG 166


>gi|374110238|gb|AEY99143.1| FAGR321Wp [Ashbya gossypii FDAG1]
          Length = 330

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 118/324 (36%), Positives = 162/324 (50%), Gaps = 42/324 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYA 60
           IR A+HAGSWY  N  RL+++L  +L +A        RG   +I+PHAGYSY G   A  
Sbjct: 3   IRPATHAGSWYLSNPTRLSQQLSKFLNDATTEHGGVQRGTRAIISPHAGYSYCGETMAKC 62

Query: 61  FGNIDPT-SISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEELKATGK-F 117
           +  +D      RVF+LGPSHH++    AL S  T  +TP+G+L +D E + +L    + F
Sbjct: 63  YAKLDVNKDTKRVFVLGPSHHFHFRNIALVSRYTALETPLGNLQVDEETVSKLLGHARLF 122

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF-----EGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
             +D   D+ EHS+EM  P L +           +KIVPIL+   + E +   G++  +Y
Sbjct: 123 RPLDYDDDQKEHSLEMQYPMLHETLLQRGVRPSDVKIVPILISYNSTEVDYAIGKILIEY 182

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY-----DKKHGV---------------------IH 206
           +        VSSDFCHWG RF Y  Y     D +  V                     I+
Sbjct: 183 LQCNDTAVIVSSDFCHWGRRFGYTGYVSSVEDIQDAVADGTEIETLTTRSKMDHHSIQIY 242

Query: 207 KSIEALDKMGMDII-ETGDP----DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
           KSIE LD+  MDI+ +  D       +K YL    NTICG  PI V L      +T +  
Sbjct: 243 KSIELLDRYAMDILGQMADSREKYQIWKDYLEVTGNTICGEKPIGVLLCAFSYVNTPLSF 302

Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
            +  Y QSS+ K+  DSSVSYA+ 
Sbjct: 303 HWCGYSQSSEVKSLEDSSVSYAAG 326


>gi|302309543|ref|NP_986987.2| AGR321Wp [Ashbya gossypii ATCC 10895]
 gi|299788408|gb|AAS54811.2| AGR321Wp [Ashbya gossypii ATCC 10895]
          Length = 330

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 118/324 (36%), Positives = 162/324 (50%), Gaps = 42/324 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYA 60
           IR A+HAGSWY  N  RL+++L  +L +A        RG   +I+PHAGYSY G   A  
Sbjct: 3   IRPATHAGSWYLSNPTRLSQQLSKFLNDATTEHGGVQRGTRAIISPHAGYSYCGETMAKC 62

Query: 61  FGNIDPT-SISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEELKATGK-F 117
           +  +D      RVF+LGPSHH++    AL S  T  +TP+G+L +D E + +L    + F
Sbjct: 63  YAKLDVNKDTKRVFVLGPSHHFHFRNIALVSRYTALETPLGNLQVDEETVSKLLGHARLF 122

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF-----EGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
             +D   D+ EHS+EM  P L +           +KIVPIL+   + E +   G++  +Y
Sbjct: 123 RPLDYDDDQKEHSLEMQYPMLHETLLQRGVRPSDVKIVPILISYNSTEVDYAIGKILIEY 182

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY-----DKKHGV---------------------IH 206
           +        VSSDFCHWG RF Y  Y     D +  V                     I+
Sbjct: 183 LQCDDTAVIVSSDFCHWGRRFGYTGYVSSVEDIQDAVADGTEIETLTTRSKMDHHSIQIY 242

Query: 207 KSIEALDKMGMDII-ETGDP----DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
           KSIE LD+  MDI+ +  D       +K YL    NTICG  PI V L      +T +  
Sbjct: 243 KSIELLDRYAMDILGQMADSREKYQIWKDYLEVTGNTICGEKPIGVLLCAFSYVNTPLSF 302

Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
            +  Y QSS+ K+  DSSVSYA+ 
Sbjct: 303 HWCGYSQSSEVKSLEDSSVSYAAG 326


>gi|238484865|ref|XP_002373671.1| DUF52 domain protein [Aspergillus flavus NRRL3357]
 gi|220701721|gb|EED58059.1| DUF52 domain protein [Aspergillus flavus NRRL3357]
          Length = 346

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 116/345 (33%), Positives = 161/345 (46%), Gaps = 66/345 (19%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS---------PEVRGVIAPHAGYSYSGR 55
           R ASHAGSWY+DN + L  +LDGWL  A +P S         P  R +IAP+   + S  
Sbjct: 4   REASHAGSWYSDNQRTLTHQLDGWL--AQVPNSIEGIGSLPVPGARIIIAPYVPENISAP 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYK-----TPIGDLPLDL----- 105
               ++   DP  +  +  LGP  H +T        + +      T I   PL       
Sbjct: 62  CTVNSYA--DPLDMRAIHTLGPVPHTHTRLWIFRKPSEFSFLDHLTIITSRPLLFLNLLR 119

Query: 106 ----------EVIEELKATGK---FELMDICVDEAEHSMEMHLPYLAKVFEGHLIK---- 148
                      +   +K+ G    F  M   +DE EHS+E+HLPY+ ++ + H       
Sbjct: 120 TIPRFPTSLFHLTRTVKSNGSTISFTTMSRSIDEDEHSIELHLPYIHRLLQLHFPNKPTS 179

Query: 149 ----IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------ 198
               +VPI+VG+ +A  E+ +G L A Y+ DP+N F +SSDFCHWG RF Y +Y      
Sbjct: 180 QYPPLVPIMVGSTSASTESAFGALLAPYLQDPTNAFVISSDFCHWGLRFRYTYYVPQSVS 239

Query: 199 -----------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
                        +   IH+SI A D   M  I TG    F + +    NT+CGRHPI V
Sbjct: 240 TGQALQRRDRISSREPAIHESISAFDIATMAAISTGVTANFLETIQTTGNTVCGRHPIGV 299

Query: 248 FLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            +  +   ++     K K  F+RYE+SS      DSSVSY SA A
Sbjct: 300 IMAAIEEITSQEEGKKGKFHFVRYERSSDAVDVDDSSVSYVSAFA 344


>gi|221053714|ref|XP_002258231.1| hypothetical protein in Apicomplexan species [Plasmodium knowlesi
           strain H]
 gi|193808064|emb|CAQ38768.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 296

 Score =  177 bits (448), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 102/297 (34%), Positives = 156/297 (52%), Gaps = 8/297 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   RRA HAGSWY++N   L   +D         K   V+  I PHAGY Y+ +  ++ 
Sbjct: 1   MTNYRRAYHAGSWYSNNGTALKNSIDALFESINCQKQ-NVKAAICPHAGYDYALKTNSHV 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I+  ++  +F+LGP+HH Y   C       Y+TP G L ++ EVI ++       L 
Sbjct: 60  YACINVENVKNIFILGPNHHIYNKGCLFPQVDKYETPFGFLQINKEVISDIMKNDVDHLF 119

Query: 121 DIC---VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM--YGRLFAKYVDD 175
           D      DE EHS+EM LP +  + +   IKI+PI VG +  + + +  + +   KY  D
Sbjct: 120 DFIGDEDDEEEHSIEMQLPLIKYIIKEKDIKIIPIYVGCIGNDIQKISRFCKPIKKYFQD 179

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH--KSIEALDKMGMDIIETGDPDAFKKYLL 233
             N F  SSDFCH+G RF++ +  +K+   H  K +E +DK   +II       F  YL 
Sbjct: 180 EENLFLFSSDFCHYGRRFSFTNILQKYNDTHIFKQVENMDKDAANIISRHSVADFIDYLN 239

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           +  NTICG +PI + L +L +    +  K + Y QS+Q ++  +SSVSYA   + ++
Sbjct: 240 KTHNTICGSNPIKMMLRLLEDFPGNVSTKLMHYSQSNQARSTNESSVSYAGMVSFIN 296


>gi|156098107|ref|XP_001615086.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148803960|gb|EDL45359.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 296

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 104/297 (35%), Positives = 158/297 (53%), Gaps = 8/297 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   R+A HAGSWY+ + + L   +D    E+   K   V+  I PHAGY Y+ +  ++ 
Sbjct: 1   MTSYRKAYHAGSWYSSSGRALKNSIDTHF-ESISCKKQSVKAAICPHAGYDYALQTNSHV 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I   ++  +F+LGP+HH Y   C       Y+TP G L ++ EVI E+       L 
Sbjct: 60  YACISVENVKNIFILGPNHHIYNKGCLFPHVEKYETPFGFLQINREVISEILQNDVDHLF 119

Query: 121 DIC---VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM--YGRLFAKYVDD 175
           D      DE EHS+EM LP +  + +   IKI+PI VG +  + + +  +     KY  D
Sbjct: 120 DFIGDEDDEEEHSIEMQLPLIKYIIKEKDIKIIPIYVGCIGNDIQKIDRFCNPLKKYFQD 179

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH--KSIEALDKMGMDIIETGDPDAFKKYLL 233
             N F  SSDFCH+G RF++ +  +K+   H  K +E +D+    II   D   F +YL 
Sbjct: 180 EGNLFLFSSDFCHYGRRFSFTNILQKYNDTHIFKQVENMDRDAASIISRHDIADFIEYLN 239

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           +  NTICG +PI + L +L +   K+  K + Y QS+Q ++  DSSVSYA   + ++
Sbjct: 240 KTHNTICGSNPIKMMLQLLQDLPGKVSTKLMHYSQSNQARSANDSSVSYAGMVSFIN 296


>gi|156844360|ref|XP_001645243.1| hypothetical protein Kpol_1060p41 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156115902|gb|EDO17385.1| hypothetical protein Kpol_1060p41 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 331

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 114/330 (34%), Positives = 172/330 (52%), Gaps = 47/330 (14%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV----RGVIAPHAGYSYSGRAAAYA 60
           R ASH+ SWY+D    L+++L G+L   G   S E+    R +I+PHAGY Y G   A++
Sbjct: 4   RPASHSSSWYSDKPDELSDQLQGYLN--GASSSAELKKDARLIISPHAGYKYCGSTMAHS 61

Query: 61  FGNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
           + ++D   +I R+F+LGPSH+ Y+  +  LS      TP+GDL +D    E L  +    
Sbjct: 62  YASLDLNENIERIFILGPSHNVYFRNQIFLSAFETLSTPLGDLNVDTAYCEMLINSNNSL 121

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHL-----IKIVPILVGAVNAENEAMYGRLFAK 171
           F  MD+  D +EHS+EM  P L K  +        IK++P++V     + +        K
Sbjct: 122 FAYMDMETDISEHSLEMQFPMLVKTIQWRKLSLDKIKVIPMMVSHNTEDIDTKIAEFILK 181

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHY------------DK-------------KHGV-I 205
              DP N F +SSDFCHWG RF +  Y            D+             +H V I
Sbjct: 182 KFRDPKNLFIISSDFCHWGRRFRFTGYVGSPEELQDSINDEVEIEMLTSRSKLPRHRVPI 241

Query: 206 HKSIEALDKMGMDIIE-TGDPDAF---KKYLLEYDNTICGRHPISVFLHM--LGNCSTKI 259
            +SI+ LD+  MD+++ + +P+ +   K+YL    NTICG  PI + L +  L     +I
Sbjct: 242 WESIKFLDEYAMDLLDKSENPEKYKDWKQYLELTGNTICGAKPIQLILKVIELEKYPERI 301

Query: 260 KIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           + ++ +Y Q+SQ +   DSSVSYAS  A +
Sbjct: 302 RFEWPKYSQTSQVENVNDSSVSYASGYAII 331


>gi|366992726|ref|XP_003676128.1| hypothetical protein NCAS_0D01850 [Naumovozyma castellii CBS 4309]
 gi|342301994|emb|CCC69766.1| hypothetical protein NCAS_0D01850 [Naumovozyma castellii CBS 4309]
          Length = 331

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 115/314 (36%), Positives = 159/314 (50%), Gaps = 43/314 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR A+HAGSWY+ N+  L+ +L  +L     P     R +I+PHAGY Y G   AY+F +
Sbjct: 3   IRPATHAGSWYSKNATDLSHQLQTYLTSTKKPIVSNARVIISPHAGYRYCGHTMAYSFAS 62

Query: 64  IDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KAT-GK--F 117
           +  T  + RVF+LGPSHH Y+  +  +S     +TP+G+  +D +V ++L KA  GK  F
Sbjct: 63  LALTKKVKRVFILGPSHHIYFKNEVFVSGFDAIETPLGEFKVDKDVCKKLVKAKHGKKLF 122

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF-----EGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
             M+  VD  EHS+EM    LA+       E   +K++P++V   + E +   G   AKY
Sbjct: 123 AFMNQDVDMDEHSLEMQFSMLAQTLKWRDIELEQVKVIPMMVSHNSTEVDMEVGEELAKY 182

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHGV-IH 206
           + DP N F +SSDFCHWG RF Y  Y                          K H V I 
Sbjct: 183 LVDPENLFILSSDFCHWGRRFEYTGYVGSEGELKEAMEEETEIEMLTARSKLKHHQVPIW 242

Query: 207 KSIEALDKMGMDIIETGDPD----AFKKYLLEYDNTICGRHPISVFLHMLGNC--STKIK 260
           KSIE LD+  M  +     +    A+K+YL    NTICG  PI V L  L       K+ 
Sbjct: 243 KSIEILDRAAMHTLSESTNENRYKAWKRYLEITGNTICGERPIGVILAALSYLPKDEKVT 302

Query: 261 IKFLRYEQSSQCKT 274
            ++  Y QSSQ + 
Sbjct: 303 FEWPNYSQSSQVQN 316


>gi|50285415|ref|XP_445136.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49524439|emb|CAG58036.1| unnamed protein product [Candida glabrata]
          Length = 332

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 117/329 (35%), Positives = 166/329 (50%), Gaps = 45/329 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IR A+HAG+WY+ ++K L+++L  +L  +      + R +I+PHAGY Y G   A+A
Sbjct: 1   MVSIRSATHAGTWYSAHAKDLSQQLLQYLESSSRECVSKARIIISPHAGYRYCGPTMAHA 60

Query: 61  FGNID-PTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL---KATG 115
           + ++D      R+F+LGPSHH Y+  +  +S     +TP+G+L +D EV E L      G
Sbjct: 61  YASLDLKKKPKRIFILGPSHHVYFKNEVLVSHFQEVETPLGNLNVDTEVCETLVGGSGDG 120

Query: 116 K--FELMDICVDEAEHSMEMHLPYLAKVFEGHLI-----KIVPILVGAVNAENEAMYGRL 168
           +  F  MD   DE EHS+EM  P LA+      I     K+VP++V   +   +   G  
Sbjct: 121 RKLFRYMDKDTDEDEHSLEMQFPMLAQALLWRQIPLSEVKVVPLMVSHNSVNVDTSVGST 180

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHG 203
            AKY+ D  N F +SSDFCHWG RF+Y  Y                            H 
Sbjct: 181 LAKYMKDEENLFIISSDFCHWGRRFSYTGYVGSTEELENVIAEETEVEMLTTRSKLSHHQ 240

Query: 204 V-IHKSIEALDKMGMDIIETG--DPD----AFKKYLLEYDNTICGRHPISVFLHMLGNCS 256
           V I KSIE +DK  M ++      P+     +K+YL    NTICG  PISV    L +  
Sbjct: 241 VPIWKSIEIMDKYAMHVLSRNAKSPENLYSEWKQYLEISGNTICGEKPISVIACALASVG 300

Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            K   ++  Y QSS   +  +SSVSYA+ 
Sbjct: 301 YK-DFEWPAYSQSSNVNSINESSVSYAAG 328


>gi|2425141|gb|AAB70854.1| similar to C. elegans CEESS08F encoded by GenBank Accession Number
           U64857 [Dictyostelium discoideum]
          Length = 168

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 85/172 (49%), Positives = 116/172 (67%), Gaps = 5/172 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR  +HAGSWY DN+++L ++L  WL EA    +  V+ +IAPHAGYSYSGRAAAYA+ N
Sbjct: 2   IRNCTHAGSWYLDNARKLEKQLSDWLSEAS-RLNQNVKSIIAPHAGYSYSGRAAAYAYIN 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P +  RVF+LGPSHH Y   C L+    ++TPIG+L +D +   +L  TG F      
Sbjct: 61  LIPENYKRVFILGPSHHVYMKTCGLTKLDTWETPIGNLKVDKDTTNKLFDTGSFIWNTKS 120

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           VDE EHS+E+ LPY+AKV E     IVPI+VG+++ + E +YG++ A Y DD
Sbjct: 121 VDEDEHSLELQLPYIAKVAEN----IVPIMVGSLSIDLEELYGKILAPYFDD 168


>gi|239615278|gb|EEQ92265.1| DUF52 domain-containing protein [Ajellomyces dermatitidis ER-3]
 gi|327353638|gb|EGE82495.1| DUF52 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 393

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 95/223 (42%), Positives = 129/223 (57%), Gaps = 30/223 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYS 53
           M   R A+HAGSWY+D+   L+ +L+ WL +        G    P  R +IAPHAGY+YS
Sbjct: 1   MSSSREATHAGSWYSDHEPTLSSQLNKWLSQVPNELPGLGRLPVPGARIIIAPHAGYAYS 60

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP--LDLEVIEEL 111
           G  AA+A+  +D +   R+FL+GPSHH+Y    AL   T Y TP+   P  LD ++I  L
Sbjct: 61  GPCAAWAYKMLDLSKAKRIFLIGPSHHHYLSTIALPQLTRYLTPLSPDPLLLDTDLITHL 120

Query: 112 KATG---KFELMDICVDEAEHSMEMHLPYLAKVFEGHLI-------------KIVPILVG 155
            +T    +F  M   VD AEHS+E+HLPY+      HLI             ++VP++VG
Sbjct: 121 LSTTTNPRFTTMSPAVDSAEHSLELHLPYI-----HHLIRTLYPTKPATAYPRLVPMMVG 175

Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
           + +A  EA +G L A Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 176 STSAATEAAFGALLAPYLADETNAFIVSSDFCHWGLRFGYTYY 218



 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 46/105 (43%), Gaps = 22/105 (20%)

Query: 205 IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST------- 257
           IH+SI A D   M  I +G+   F   L +  NT+CGRHPI V +  L            
Sbjct: 287 IHESISACDVACMKAIASGETALFLDALRKTGNTVCGRHPIGVVMAGLEAVRKGSGEGEG 346

Query: 258 ---------------KIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
                          K K  F+RYE+SS      DSSVSY SA A
Sbjct: 347 EEDNESGGNGDRELEKGKFYFVRYERSSDVVAVADSSVSYVSAFA 391


>gi|261187552|ref|XP_002620196.1| DUF52 domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239594159|gb|EEQ76740.1| DUF52 domain-containing protein [Ajellomyces dermatitidis SLH14081]
          Length = 340

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 95/223 (42%), Positives = 129/223 (57%), Gaps = 30/223 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYS 53
           M   R A+HAGSWY+D+   L+ +L+ WL +        G    P  R +IAPHAGY+YS
Sbjct: 1   MSSSREATHAGSWYSDHEPTLSSQLNKWLSQVPNELPGLGRLPVPGARIIIAPHAGYAYS 60

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP--LDLEVIEEL 111
           G  AA+A+  +D +   R+FL+GPSHH+Y    AL   T Y TP+   P  LD ++I  L
Sbjct: 61  GPCAAWAYKMLDLSKAKRIFLIGPSHHHYLSTIALPQLTRYLTPLSPDPLLLDTDLITHL 120

Query: 112 KATG---KFELMDICVDEAEHSMEMHLPYLAKVFEGHLI-------------KIVPILVG 155
            +T    +F  M   VD AEHS+E+HLPY+      HLI             ++VP++VG
Sbjct: 121 LSTTTNPRFTTMSPAVDSAEHSLELHLPYI-----HHLIRTLYPTKPATAYPRLVPMMVG 175

Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
           + +A  EA +G L A Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 176 STSAATEAAFGALLAPYLADETNAFIVSSDFCHWGLRFGYTYY 218



 Score = 41.2 bits (95), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 205 IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPI 245
           IH+SI A D   M  I +G+   F   L +  NT+CGRHPI
Sbjct: 287 IHESISACDVACMKAIASGETALFLDALRKTGNTVCGRHPI 327


>gi|225556304|gb|EEH04593.1| DUF52 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 320

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 92/215 (42%), Positives = 128/215 (59%), Gaps = 21/215 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
           R A+HAGSWY+D+   L+ +L+ WL +      G+ + P    R +IAPHAGY+YSG  A
Sbjct: 44  REATHAGSWYSDHEPTLSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPHAGYAYSGPCA 103

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELKATG 115
           A+A+  +D +   R+FLLGPSHH++    AL   T YKTP+    LPLD ++I  L +T 
Sbjct: 104 AWAYKALDLSKAKRIFLLGPSHHHHLSTLALPQLTSYKTPLSPDPLPLDTDLITHLLSTT 163

Query: 116 K----FELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENEA 163
                F  M   VD AEHS+E+HLPY+  +              ++VP++VG+ +   EA
Sbjct: 164 STNPHFTTMSPPVDSAEHSLELHLPYIHHLLRTLYPTRAPAAYPRLVPMMVGSTSPATEA 223

Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
            +G L A Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 224 AFGALLAPYLADDTNAFVVSSDFCHWGLRFGYTYY 258


>gi|123498609|ref|XP_001327441.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121910370|gb|EAY15218.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 286

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 93/256 (36%), Positives = 139/256 (54%), Gaps = 6/256 (2%)

Query: 35  PKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVY 94
           P   +V+ VI+PHAGY +    A++AF  IDP+  SRV ++GPSH      C +S A  +
Sbjct: 33  PIEGKVKAVISPHAGYRHCAETASHAFATIDPSLYSRVIIMGPSHRLPIDYCTISEAKSF 92

Query: 95  KTPIGDLPLDLEVIEEL--KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI 152
           +TP   L +D  + EEL  K    F+ + I     EHS+E+ LP++  +F+G  + +VPI
Sbjct: 93  ETPTRSLEID-PIAEELTSKYGSIFKKLSIETSNREHSLELMLPWIDYIFKGKNVTVVPI 151

Query: 153 LVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEAL 212
           +VG ++             Y++DPS    +SSDF HWGSRF+Y +  +K G I + I A+
Sbjct: 152 MVGHLDQTKLEQAVSALKPYINDPSTLLVISSDFTHWGSRFSYTYLPEKDGEIWEKISAI 211

Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQC 272
           D   M+ I T     F+ Y+     TICGR+PI++ +          ++ +  Y QSS  
Sbjct: 212 DHGAMEAISTCKAKNFQDYIKSTRATICGRNPITIAMMAFDK---PYRVDWPHYSQSSHA 268

Query: 273 KTKRDSSVSYASAAAK 288
           K   DSSVSYA    +
Sbjct: 269 KNMADSSVSYAGGVIR 284


>gi|346320315|gb|EGX89916.1| DUF52 domain protein [Cordyceps militaris CM01]
          Length = 370

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 113/314 (35%), Positives = 159/314 (50%), Gaps = 43/314 (13%)

Query: 13  WYTDNSKRLAEELDGWLREA-----GLPKS-PEVRGVIAPHAGYSYSGRAAAYAFGNIDP 66
           WY     +L    D +L E      G P   P  R +IAPHAGY+YSGR +A+AF  +D 
Sbjct: 56  WYELAPFQLRWNFDEYLDEVPETIQGSPLPIPGARAIIAPHAGYAYSGRCSAWAFRCLDL 115

Query: 67  TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-------KFEL 119
               RVF+LGP+H++Y    ALST   Y TP+G   +D + + +++A             
Sbjct: 116 ARAKRVFVLGPTHYFYFRGLALSTFASYSTPLGAFRVDQQTLRDVRAAADALGGAPDVRN 175

Query: 120 MDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
           M    +  EHS+EM + +L +  E           IVP+LV   +A++E   GRL   Y+
Sbjct: 176 MPRRRELDEHSLEMEIAFLYQRCEAVFASPADFPAIVPMLVSG-DADDERAVGRLLLPYL 234

Query: 174 DDPSNFFSVSSDFCHWGSRF-NYMHY----DKKHGV------------IHKSIEALDKMG 216
            DP+N F VSSDFCHWG +F +Y  Y    D+   V            IH+SI+ LD   
Sbjct: 235 RDPANAFVVSSDFCHWGRQFGDYRPYFRGGDRARRVNLRDADAPPSPPIHESIKMLDDEA 294

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML------GNCSTKIKIKFLRYEQSS 270
           +  +ETG  DAF   L E +N++CGRHPI   +  L      G    + + K +RY++S+
Sbjct: 295 IAAMETGSHDAFVANLEETENSVCGRHPIGAMMAALELLGADGAPEDRPRFKHVRYDRSA 354

Query: 271 QCKTKRDSSVSYAS 284
                  SSVSY S
Sbjct: 355 LLTEPTKSSVSYNS 368


>gi|328853086|gb|EGG02227.1| hypothetical protein MELLADRAFT_38671 [Melampsora larici-populina
           98AG31]
          Length = 268

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 99/262 (37%), Positives = 142/262 (54%), Gaps = 34/262 (12%)

Query: 5   RRASHAGSWYTDNSK-RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           R A+HAGSWYT + +    EE   W + A        + VIAPH GY YSG  AA+A+  
Sbjct: 4   RSATHAGSWYTSSRQAETTEETQDWRKVAMPVPISHCKAVIAPHVGYLYSGLTAAWAYKT 63

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL---- 119
           IDP+ I       P  H Y   CALS    Y+ P+G+L +D++V  +  +   F L    
Sbjct: 64  IDPSVIY------PLSHVYLDGCALSGCEAYEIPLGNLEIDIQVSSQSVSIFIFLLGHLS 117

Query: 120 --------MDICVDEAEHSMEMHLPYLAKVFEG-HLIKIVPILVGAVNAENEAMYGRLFA 170
                   MDI  D+AEHS+E+HLPY+  + +    I IVPIL+G++++E E +YG++  
Sbjct: 118 YLGIGAMEMDIKTDKAEHSIELHLPYVRHISKSFQTITIVPILIGSISSEEELIYGKVLR 177

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHY-DKKHGV----------IHKSIEALDKMGMDI 219
            Y+  P N F +SSDFCH    F+Y +Y D    +          I++SIEALD+  + I
Sbjct: 178 PYLLSPENLFVISSDFCHC---FSYTYYKDPARKLSSGASPPATPIYQSIEALDQEALKI 234

Query: 220 IETGDPDAFKKYLLEYDNTICG 241
           +E+     F +YL +  N ICG
Sbjct: 235 LESLSHSEFSRYLSQTKNNICG 256


>gi|323347913|gb|EGA82173.1| YJR008W-like protein [Saccharomyces cerevisiae Lalvin QA23]
          Length = 312

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 105/295 (35%), Positives = 159/295 (53%), Gaps = 47/295 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-----KAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D ++ + L        
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKXYPEN 121

Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
           GK  F+ MD   D AEHS+EM LP L +           +K++P++V   + + +   G 
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLXWREISLDTVKVIPMMVSHNSVDVDRCIGN 181

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
           + ++Y+ DP+N F VSSDFCHWG RF Y  Y                            H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLXHH 241

Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHML 252
            V I +SIE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSAL 296


>gi|226288183|gb|EEH43696.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 407

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 123/374 (32%), Positives = 161/374 (43%), Gaps = 114/374 (30%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           R A+HAGSWY+D+   L+++LD WL +        G    P  R +IAP           
Sbjct: 55  REATHAGSWYSDHEPTLSKQLDQWLAQVPDELPDIGPLPIPGARIIIAPE---------- 104

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELKATG 115
                        R+FL+GPSHH+     AL   T Y TP+    LPLD E+I  L  + 
Sbjct: 105 -------------RIFLIGPSHHHILSTLALPQLTSYLTPLSREPLPLDTELIAHLLTST 151

Query: 116 -----KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENE 162
                +F  M   VD AEHS+E+HLPY+  +               +VPI+VG+     E
Sbjct: 152 NASNHRFTTMSPAVDSAEHSLELHLPYIHHLLRRLYPTKPTAQYPSLVPIMVGSTLPATE 211

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
           A +G + A Y+ DPSN F +SSDFCHWG RF + +Y                        
Sbjct: 212 AAFGSVLAPYLADPSNAFIISSDFCHWGLRFGHTYYVPNARTPAEPLPLSYPQLPLPSTS 271

Query: 199 ---------------------DK---KHGV-----IHKSIEALDKMGMDIIETGDPDAFK 229
                                DK   + GV     IH+SI A D   M  I +G+   F 
Sbjct: 272 DAGVAAEAIEAVSGGISLKVKDKDRVRKGVAGVPLIHESISACDVACMTAIASGETRRFL 331

Query: 230 KYLLEYDNTICGRHPISVFLHML----GNCS------------TKIKIKFLRYEQSSQCK 273
             L +  NT+CGRHPI V +  L    GN               K +  F+RYE+SS   
Sbjct: 332 DALRKTGNTVCGRHPIGVVIAGLEIVRGNKEGGGKVDAVDGEMQKGRFHFIRYERSSDVV 391

Query: 274 TKRDSSVSYASAAA 287
              DSSVSY SA A
Sbjct: 392 AVGDSSVSYVSAFA 405


>gi|358253784|dbj|GAA53777.1| hypothetical protein CLF_111104 [Clonorchis sinensis]
          Length = 166

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 80/162 (49%), Positives = 105/162 (64%), Gaps = 8/162 (4%)

Query: 133 MHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWG 190
           M LPY+AKV E       IVPI+VG+++   EA+YGR+ A Y+ +P N F +SSDFCHWG
Sbjct: 1   MQLPYVAKVMERRRGAFTIVPIVVGSLSPAREAVYGRILAPYLLNPENLFVISSDFCHWG 60

Query: 191 SRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH 250
            RF Y HYD+  G I +SI+ALD+ GM  IE   P  F  YL +Y NTICGRHPI + L 
Sbjct: 61  RRFQYTHYDQHKGAIWQSIQALDEEGMHAIERLSPADFTAYLDQYGNTICGRHPIGILLQ 120

Query: 251 MLGNC------STKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           M+ +       ST+   KF++Y QS  C++  DSSVSYA+ +
Sbjct: 121 MVESLRKGSSPSTQFNFKFIKYTQSEHCQSMNDSSVSYAAGS 162


>gi|410079789|ref|XP_003957475.1| hypothetical protein KAFR_0E01860 [Kazachstania africana CBS 2517]
 gi|372464061|emb|CCF58340.1| hypothetical protein KAFR_0E01860 [Kazachstania africana CBS 2517]
          Length = 356

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 117/349 (33%), Positives = 174/349 (49%), Gaps = 63/349 (18%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
           IR+ASHAG+WY  NS +L  +L   L    L +  +  R +I+PHAGY Y G   AY++ 
Sbjct: 7   IRKASHAGTWYLQNSTQLQTQLQSNLNATKLVQPIKGSRIIISPHAGYRYCGNVMAYSYA 66

Query: 63  NIDPTSIS-------RVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           ++D T          RVF+LGPSHH Y+  K  +S+    +TP+G+L +D EV++EL   
Sbjct: 67  SMDLTKFKNASDDKIRVFILGPSHHIYFRDKILVSSFEEVETPLGNLTVDREVVDELIKK 126

Query: 115 GK-----FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAM 164
                  FEL+D  VD  EHS+EM    LA+           +K++P+LV   +   ++ 
Sbjct: 127 NHNDKKVFELLDKDVDLDEHSLEMQFSMLAQTLHWRSVDLSKVKVIPMLVSHNSETVDSK 186

Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-----DKKHGV--------------- 204
                  Y++D  +FF +SSDFCHWG RF Y  Y     D    +               
Sbjct: 187 VAGFLKPYLNDGKSFFIISSDFCHWGRRFEYTGYVGNEDDLNEAMNEETEIEMLTSRSKL 246

Query: 205 ------IHKSIEALDKMGMDII---------ETGDPD-------AFKKYLLEYDNTICGR 242
                 I KSIE LDK  M ++         ++ +P+        +K+YL    NTICG+
Sbjct: 247 SHHQISIWKSIELLDKYAMKVLSGEEIKKETDSNEPNEDLTLYEKWKEYLDITGNTICGQ 306

Query: 243 HP--ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +P  + + L    N   K + ++L Y QSS+  +  +SSVSYAS   ++
Sbjct: 307 NPIKLILKLLEELNEDEKFEFEWLNYSQSSKVSSIEESSVSYASGCVRI 355


>gi|313227292|emb|CBY22438.1| unnamed protein product [Oikopleura dioica]
          Length = 176

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 82/173 (47%), Positives = 113/173 (65%), Gaps = 6/173 (3%)

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
           MD   DE EHS+EM LP+LA +F+  L  + +VPI+VG++  E EA Y ++FAKY++DP 
Sbjct: 1   MDAPTDEEEHSIEMQLPFLAHIFKNRLEEVSVVPIMVGSIKQEKEADYAQVFAKYLNDPE 60

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
             F +SSDFCHWG RF Y +  +++  I +SI+A+D+ GMD I + +   F  YL    N
Sbjct: 61  VVFVISSDFCHWGKRFRYTYRLEEYEQIFESIDAVDREGMDHIASKNLSNFHNYLRRTKN 120

Query: 238 TICGRHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           TICGR+PI + L  +     K     +IKFL+Y QSSQ ++  DSSVSYASA 
Sbjct: 121 TICGRNPICLLLATINLLEQKGQMQSQIKFLKYAQSSQVRSPNDSSVSYASAV 173


>gi|403218503|emb|CCK72993.1| hypothetical protein KNAG_0M01400 [Kazachstania naganishii CBS
           8797]
          Length = 334

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 110/325 (33%), Positives = 159/325 (48%), Gaps = 43/325 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRA+HAGSWY  +  +L + +    +  G       R V+APHAGY Y       A+ ++
Sbjct: 10  RRATHAGSWYPSDGGQLRDTV---FKHRGQGPRAGARVVVAPHAGYKYCSSVLWPAYESL 66

Query: 65  DPT----SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFE 118
           D +    S  RVF+LGPSHH Y+     +S  +   TP+G LP+D E  E L +  G F+
Sbjct: 67  DTSCPGSSHLRVFVLGPSHHAYFKDTFLVSGYSELDTPLGPLPVDTEYCESLVQGGGPFK 126

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLI-------KIVPILVGAVNAENEAMYGRLFAK 171
            M   VD AEHS+EM  P L  + E   +       ++VPI+V   + E  A    + A+
Sbjct: 127 RMSEAVDLAEHSLEMQYPALRCLLEQRGLSSGARGAQVVPIMVSHASREKYASLAAVLAE 186

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHY--------------------DKKHGVIH----- 206
            +        VSSDFCHWG RF Y  Y                      +  + H     
Sbjct: 187 QLASQRAIVVVSSDFCHWGRRFEYTGYVGSQEELTTALAEETEVEELTARSKLAHHRLPI 246

Query: 207 -KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIK-IKFL 264
            +S+E LD+  M ++E+ D D + +YL    NT+CGR+P+ V L  L          +++
Sbjct: 247 WQSVELLDRGAMHVLESNDLDKWTEYLAITGNTVCGRNPLCVLLLALQQHRGGAPHWEWV 306

Query: 265 RYEQSSQCKTKRDSSVSYASAAAKV 289
           +Y QSS+ +    SSVSYASA  K+
Sbjct: 307 KYAQSSKVQDITQSSVSYASAYIKL 331


>gi|366998487|ref|XP_003683980.1| hypothetical protein TPHA_0A04710 [Tetrapisispora phaffii CBS 4417]
 gi|357522275|emb|CCE61546.1| hypothetical protein TPHA_0A04710 [Tetrapisispora phaffii CBS 4417]
          Length = 341

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 108/339 (31%), Positives = 164/339 (48%), Gaps = 54/339 (15%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           R A+H+GSWY+D+ +RL  EL   L +A   +      R +I+PHAGY Y G   AY++ 
Sbjct: 3   RPATHSGSWYSDDLRRLDAELSRHLVDASKNVNMVKGSRLIISPHAGYKYCGSTMAYSYA 62

Query: 63  NIDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKA-----TG 115
           ++D  S + R+ ++GPSHH Y+  +  +S  +  +TP+G+L +D + IE+         G
Sbjct: 63  SLDLNSNVKRIIIMGPSHHIYFKNQIYVSGFSEVQTPLGNLHVDKDFIEDKLMGNSIEKG 122

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLI-----KIVPILVGAVNAENEAMYGRLFA 170
            F  MD   D +EHS+EM  P + K  +   I     K+VP++V   +   +     +  
Sbjct: 123 LFAYMDEETDLSEHSLEMQFPMVVKTLQFRNIDVDKVKVVPMIVSHNSEAVDTKLANVLK 182

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHY--------------------DKKHGVIHK--- 207
               +P   F VSSDFCHWG RF Y  Y                      +   +H+   
Sbjct: 183 NEFMNPETIFIVSSDFCHWGRRFQYTGYVGNSEELKDSLEEQTEIEMLTSRSKSLHREVK 242

Query: 208 ---SIEALDKMGMDIIETGDPD----AFKKYLLEYDNTICGRHPISVFLHML-------- 252
              SI  +DK GM ++   D       +K Y+    NT+CG +PI V L  +        
Sbjct: 243 IWESIMIIDKFGMKLLSDSDNSDKYKNWKNYIEISGNTVCGTNPIKVILKTIELIANEVV 302

Query: 253 --GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
             G+ S+KI   +  Y QSS+  +  +SSVSYAS  A +
Sbjct: 303 SSGSDSSKIDFYWPSYSQSSKVTSVNESSVSYASGYAII 341


>gi|340502344|gb|EGR29045.1| mediator of cell motility 1, putative [Ichthyophthirius
           multifiliis]
          Length = 230

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 83/234 (35%), Positives = 121/234 (51%), Gaps = 38/234 (16%)

Query: 21  LAEELDGWLREAGL--PKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPS 78
           L  EL G+L +  L      +++ +I+PH G  Y G   AY++  I+P +I R+FLLGPS
Sbjct: 2   LEIELQGYLDKVNLNVQNIQKLKALISPHTGLKYCGAVQAYSYKYINPQNIQRIFLLGPS 61

Query: 79  HHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK--FELMDICVDEAEHSMEMHLP 136
           H  +   C LS+   ++TP G++ +D+E+I EL    K  F  +D   ++ EHS+E+ LP
Sbjct: 62  HRIFIQGCGLSSMDFFETPFGNIEIDVEIINELYKNQKDYFVKLDQETEQNEHSLELQLP 121

Query: 137 YLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM 196
            L K        ++PI+VG  N E +  YG++F+KY DD S  F +SSDFCHWG      
Sbjct: 122 MLKKQMGDKQFILIPIMVGQTNQELDVQYGKIFSKYFDDDSTLFIISSDFCHWG------ 175

Query: 197 HYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH 250
                       IE                 F+ YL    NTICGR  I++ L+
Sbjct: 176 -----------QIE-----------------FQNYLNSTKNTICGRRGINILLN 201


>gi|123427666|ref|XP_001307302.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121888923|gb|EAX94372.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 310

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 90/289 (31%), Positives = 148/289 (51%), Gaps = 11/289 (3%)

Query: 4   IRRASHAGSWY-TDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR+A+HAG WY TD+  ++  E      +        ++GVI+PH+ Y    R AAY+F 
Sbjct: 4   IRKATHAGEWYPTDDQLKIMLEASFHYAQPKPNADLGLKGVISPHSCYQVCLRTAAYSFS 63

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT--GKFELM 120
            I+P    R+ +LG  HH       +S AT  +TP G+L +D EV E+L        + M
Sbjct: 64  CINPDKFERIIILGTCHHIALKAGLVSHATEVETPFGNLQVDTEVTEKLATEYGEAIQWM 123

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D  VDE EHS+EM  P +  +++   +KI+P+L+G+++   E       +  + D   FF
Sbjct: 124 DQKVDENEHSLEMQYPLIKYIWQDRPVKIIPMLIGSLSEPREIEIAEALSPIITDEKTFF 183

Query: 181 SVSSDFCHWGSRFNYMHYDK-KHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
            +SSDF HWG  F++      K   + + ++  D+  + II   + + F+    E   +I
Sbjct: 184 IISSDFTHWGEIFHHTPIQSTKKKQLSQQLQIADERSIGIIHQFNYEHFRFICEEIHGSI 243

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQC--KTKRDSSVSYASAA 286
           CG + I + L +L         +F  +++S  C  +T +D S+SY +A 
Sbjct: 244 CGCYSICLMLRILAE-----GYEFEVFDRSELCELRTMKDFSISYVAAG 287


>gi|384490136|gb|EIE81358.1| hypothetical protein RO3G_06063 [Rhizopus delemar RA 99-880]
          Length = 626

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/295 (35%), Positives = 147/295 (49%), Gaps = 65/295 (22%)

Query: 7   ASHAGSWYTDNSKRLAEELDGWLREA---------GLPKSPEVRGVIAPHAGYSYSGRAA 57
           ASHAGSWYT++ K L +EL+ +++++          LP    V+ +IAP           
Sbjct: 381 ASHAGSWYTNDKKALDKELNLYIKKSVEEMTADGLQLPIK-HVKAIIAP----------- 428

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
                        RVF+LGPSH   T +C L     Y+TP+G+LP+D  VIE+L  TG F
Sbjct: 429 ------------KRVFILGPSHVCATRECLLPNYDTYETPLGNLPVDWSVIEDLHKTGLF 476

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE---GHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            L D   +  EHS+EMHLP++ K FE    H +   PI   +V  ++  ++         
Sbjct: 477 GLADSEDEINEHSIEMHLPFIYKAFEKTQKHCLSSRPI---SVTDKDHGIH--------- 524

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
              N  + SS                    I++SI+ALD  G+  IE+   D F+ YL E
Sbjct: 525 --LNRNNASSTIVR---------------PIYESIQALDHQGIKAIESLSFDQFENYLNE 567

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            DNTICGR+PI V L  L         + L+Y QSS+C+T  DSSVSYAS   +V
Sbjct: 568 TDNTICGRNPIGVLLACLEEMKGDRSARCLKYAQSSKCRTLEDSSVSYASIYIQV 622


>gi|82541169|ref|XP_724845.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23479636|gb|EAA16410.1| Protein of unknown function [Plasmodium yoelii yoelii]
          Length = 264

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 89/257 (34%), Positives = 128/257 (49%), Gaps = 40/257 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   RRA H+GSWY+DNS  L   ++    +  LPK  +V+  I PHAGY+Y    +++ 
Sbjct: 1   MPTYRRAYHSGSWYSDNSNVLKNSIESLFEKINLPKQ-QVKAAICPHAGYAYCLETSSHV 59

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--------DLPLD-------- 104
           +  I+  +I  +F+LGP+HH Y   C L     Y+TP G        +LPL         
Sbjct: 60  YSCINVENIKNIFILGPNHHIYNKGCLLPQVDKYETPFGFLQINKDGNLPLATCHLPLAT 119

Query: 105 ---------------LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKI 149
                          +  I        ++ +D   DE EHS+EM LP +  + +   IKI
Sbjct: 120 YHLPLTTINVYIYMFISDIMNNDTQNLYDYIDEIDDEEEHSIEMQLPLIKYIIKEKDIKI 179

Query: 150 VPILVGA----VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-- 203
           VPI VG     VN  NE  +     KY  D +N F  SSDFCH+G RF++ +  +K+   
Sbjct: 180 VPIYVGCIGNDVNKINE--FSNPLKKYFQDKTNAFIFSSDFCHFGRRFSFTNILEKYNDK 237

Query: 204 VIHKSIEALDKMGMDII 220
            IHK IE +DK G+++I
Sbjct: 238 YIHKKIENMDKDGINVI 254


>gi|385305757|gb|EIF49708.1| atp-dependent helicase dhx8 [Dekkera bruxellensis AWRI1499]
          Length = 204

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 80/197 (40%), Positives = 111/197 (56%), Gaps = 9/197 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
           IR A+HAG+WY      L  +LD +L   G  K  +  R ++ PHAGYSY+G   A A+G
Sbjct: 10  IREATHAGTWYVGTPSVLDRQLDQFLN--GKTKIIDGARILLGPHAGYSYAGPTLAKAYG 67

Query: 63  NIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL--KATGKFEL 119
             DP+ + RVF++GPSHH Y++     S    Y TP+G +P+D E I  L  K    F+ 
Sbjct: 68  XFDPSKVKRVFIMGPSHHVYFSGSVMTSKFDAYDTPLGKIPVDKETISXLMKKDPKMFKQ 127

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGH---LIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
           M    DE EH  EMH+P+L KV E     + KI+PI++   N + E    +    Y ++ 
Sbjct: 128 MSFQADEDEHCFEMHMPFLYKVTENSPNGVPKIIPIMISDSNEKFERKLAKDLQPYFENK 187

Query: 177 SNFFSVSSDFCHWGSRF 193
            N F ++SDFCHWG RF
Sbjct: 188 ENAFIITSDFCHWGFRF 204


>gi|256089017|ref|XP_002580615.1| hypothetical protein [Schistosoma mansoni]
          Length = 145

 Score =  140 bits (352), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 67/142 (47%), Positives = 95/142 (66%), Gaps = 2/142 (1%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RRASHAGSWY+    +L  +L  WL E  + + P  R +I+PHAGY YSG  AA+A+  
Sbjct: 5   VRRASHAGSWYSAERAKLNNQLQKWLSEVTVDRQP-TRAIISPHAGYDYSGPCAAFAYKQ 63

Query: 64  IDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           IDP  I R+F+LGP+H+     KCALSTA  ++TP+  L +  +V  +L+ TG+F  + +
Sbjct: 64  IDPRHIKRIFVLGPAHYMSLRGKCALSTADFFETPLYSLSIGKDVYRDLEETGEFVSLPL 123

Query: 123 CVDEAEHSMEMHLPYLAKVFEG 144
             DE EHS+EM +PY+AK+ EG
Sbjct: 124 DRDEEEHSIEMQMPYIAKMMEG 145


>gi|403223866|dbj|BAM41996.1| predicted protein [Theileria orientalis strain Shintoku]
          Length = 197

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 66/174 (37%), Positives = 105/174 (60%), Gaps = 4/174 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAF 61
           +R A+HAGSWY+D+S+ L  ++       GLP   +P ++ +IAPHAGY+YS   A +A+
Sbjct: 1   MRIATHAGSWYSDSSRALESQIKS--SSDGLPNVPNPTLKYIIAPHAGYAYSLSTAVHAY 58

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            +ID ++I  VF+LGPSHH++   CA+   +  +TP+G L +D+ +++ L     F +++
Sbjct: 59  SHIDASTIKTVFVLGPSHHFFLRGCAVDQFSSLQTPLGQLQVDVNIVDSLSNLKGFSVVN 118

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
               E EHS+EMHLP L  +F+   +K+VPI+VG  N             Y +D
Sbjct: 119 KQASEDEHSIEMHLPLLKFMFKKENVKVVPIIVGDFNETMRDQIASSLLPYFND 172


>gi|156084952|ref|XP_001609959.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797211|gb|EDO06391.1| conserved hypothetical protein [Babesia bovis]
          Length = 245

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 91/293 (31%), Positives = 139/293 (47%), Gaps = 58/293 (19%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RR+ HAGSWY+++++ L E++D  L +A  P    ++ +I+P+                
Sbjct: 1   MRRSIHAGSWYSNDAEILTEQIDQALSQAVEPFGSALKYIISPNT--------------- 45

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
                   VF+LGPSHH     CA+  ++  +TP G+L +D ++  EL     F+ +   
Sbjct: 46  --------VFILGPSHHLPLKGCAVDVSSTLQTPFGELQVDNDITTELLKGKCFKELSKR 97

Query: 124 VDEAEHSMEMHLP---YLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
             E EHS+EM LP   Y+A       IK+VPI+VG +  E     G+    Y +     F
Sbjct: 98  NSEEEHSIEMQLPILHYVANKSNADHIKVVPIVVGYMLNEGLEDVGQALLPYFEKEDTIF 157

Query: 181 SVSSDFCHWGSRFNYMH--YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            +SSDFCH+G RF +    ++ +   I K+IE+LD  G+ +I   D +   KY       
Sbjct: 158 VISSDFCHFGKRFGFTRTGFEDQDMPIWKAIESLDLDGVKLIVEHDLEVSNKY------- 210

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
                               IKI  +   QSS+C  + DSSVSY S A  + A
Sbjct: 211 --------------------IKITLV---QSSKCSERSDSSVSYCSIAGMLSA 240


>gi|300176356|emb|CBK23667.2| unnamed protein product [Blastocystis hominis]
          Length = 195

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 65/167 (38%), Positives = 103/167 (61%), Gaps = 1/167 (0%)

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
           +E EHS+E+ LP++  +F G   K+V ++VGAV++  ++    + A ++ DP   F +SS
Sbjct: 4   EENEHSLEIELPFVYHLF-GSRAKVVLMMVGAVSSSYKSKIAPVLAPFLTDPKTVFVISS 62

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
           DFCHWG  F++++Y  +   I ++IE LDK G  +IE    DAF +Y+ + DNTICG + 
Sbjct: 63  DFCHWGYNFDFLYYKDRSKPIWQNIEELDKRGAALIEQQSADAFAEYIEDTDNTICGHNC 122

Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           I +FL  L     +   + LRY QSS+ ++  DSSVSY +    ++A
Sbjct: 123 IEIFLRALAASGLQTTTRMLRYGQSSRAESMADSSVSYCALKTALNA 169


>gi|302507880|ref|XP_003015901.1| hypothetical protein ARB_06213 [Arthroderma benhamiae CBS 112371]
 gi|291179469|gb|EFE35256.1| hypothetical protein ARB_06213 [Arthroderma benhamiae CBS 112371]
          Length = 272

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 85/268 (31%), Positives = 122/268 (45%), Gaps = 81/268 (30%)

Query: 101 LPLDLEVIEELK-----ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLI 147
           LPLD EV+ +++     A   F  M   VDE EHSME+HLPY+ ++ +            
Sbjct: 3   LPLDTEVLAKIEQEASSADRPFSRMSRSVDEQEHSMELHLPYIHRLLQRLYPTMPTSAYP 62

Query: 148 KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG---- 203
            +VP++VG+ +A NE ++G +   Y+ D  N F +SSDFCHWG+RF Y +Y +  G    
Sbjct: 63  PLVPMMVGSTSAANERVFGSILRPYLADEENAFVISSDFCHWGTRFAYTYYVQAPGPTTY 122

Query: 204 ------------------------------------------------VIHKSIEALDKM 215
                                                            IH+S+ A D  
Sbjct: 123 KLPLTYPSLPVPSDMLTAERAAEEVRAASSGRMLRENDKSYNKEKYMWAIHESVSACDMA 182

Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL-----------HMLGNCSTKIKIK-- 262
            M  I +G+ D F++ L    NT+CGRHPI V +             +G   T+  +K  
Sbjct: 183 CMSAIASGNTDVFQEALRRTGNTVCGRHPIGVVMAGIEGLQHQAETTMGAAVTQDGVKKG 242

Query: 263 ---FLRYEQSSQCKTKRDSSVSYASAAA 287
              F+RYE+SS C + +DSSVSY SA A
Sbjct: 243 KFYFIRYERSSDCFSVQDSSVSYVSAFA 270


>gi|159108244|ref|XP_001704394.1| Hypothetical protein GL50803_112932 [Giardia lamblia ATCC 50803]
 gi|157432456|gb|EDO76720.1| hypothetical protein GL50803_112932 [Giardia lamblia ATCC 50803]
          Length = 285

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/292 (32%), Positives = 149/292 (51%), Gaps = 25/292 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYAF 61
           +R    +GSWY    K    ++  +LR+AG    P+  VR +IAPHAG  YSG  AA+++
Sbjct: 1   MRLCLKSGSWY-PLPKEAKAQVAAFLRKAGEAVHPDANVRFIIAPHAGLVYSGLTAAHSY 59

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI---GDLPLDLEVIEELKATGKFE 118
            +IDPT  + V +LG  H ++    + S    +  P+   G   L +E I  L +  K  
Sbjct: 60  SSIDPTRYTSVVMLGVCHAFHQRGLSTSPFASWANPLMEKGSPSLSMETIPGLPSCQK-- 117

Query: 119 LMDICVDEAEHSMEMHLPYLAKVF----EGHLIKIVPILV--GAVNAENEAMYGRLFAKY 172
             D C  E EHS+E+ +P+LA VF    E   +K   +    GA   E +++       Y
Sbjct: 118 --DDC--EEEHSLELQIPFLAHVFANQIEAGTVKFSAVYCSYGATRTEIDSLM-----DY 168

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           V + ++   VSSDFCH+G RF +    +     ++++  LD   ++ +  G  ++F++ L
Sbjct: 169 VTEHNSLIVVSSDFCHYGPRFQFTPMIQGK-TANETVTMLDNKCINGVMLG-ANSFEEAL 226

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            E  NT+CG + I   L +L      +K + L Y +S + +T RD SVSY S
Sbjct: 227 KETQNTVCGHYTILTCLRVLEGLKLDVKKELLFYTKSDEPRTARDHSVSYVS 278


>gi|253744686|gb|EET00850.1| Hypothetical protein GL50581_1896 [Giardia intestinalis ATCC 50581]
          Length = 285

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 91/292 (31%), Positives = 149/292 (51%), Gaps = 25/292 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +R    +GSWY  + K    ++   LR+A   +   P V+ +I+PHAG  YSG  AA+++
Sbjct: 1   MRLCLKSGSWY-PSPKEAKAQVAALLRKAEETVHPDPNVKFIISPHAGLVYSGLTAAHSY 59

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI---GDLPLDLEVIEELKATGKFE 118
            +IDPT  S V +LG  H ++    + S    +  P+   G   L +E +  L +  K +
Sbjct: 60  SSIDPTRYSSVVMLGVCHAFHQRGLSTSPFASWANPLTEKGSPSLSMEGVPGLSSCQKSD 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHL------IKIVPILVGAVNAENEAMYGRLFAKY 172
                  E EHS+E+ +P+LA VF   +         V    GA  AE +++     A+Y
Sbjct: 120 C------EEEHSLELQIPFLAHVFANQIEAGTVTFSAVYCSYGATKAEIDSL-----AEY 168

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           + + ++   VSSDFCH+G RF +    +     ++++  LD   ++ +  G  D+F++ L
Sbjct: 169 IVEHNSLIVVSSDFCHYGPRFQFTPLIQGK-TANETVTMLDNKCINGVMLG-ADSFEEAL 226

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            E  NT+CG + I   L +L   +  +K + L Y +S + KT RD SVSY S
Sbjct: 227 KETQNTVCGHYTIKTCLCILEGLNLNVKKELLFYTKSDEPKTTRDHSVSYVS 278


>gi|301063138|ref|ZP_07203692.1| memo-like protein [delta proteobacterium NaphS2]
 gi|300442738|gb|EFK06949.1| memo-like protein [delta proteobacterium NaphS2]
          Length = 362

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 148/291 (50%), Gaps = 15/291 (5%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP---EVRGVIAPHAGYSYSGRAA 57
           ++ +  A+H G+WY  + KRL   L  +  +A  PK+     +  +I+PHAGY YSG  A
Sbjct: 56  LKHVFTATHGGTWYPGDKKRLEALLTKFFDQAH-PKTTAKGRLMALISPHAGYPYSGPTA 114

Query: 58  AYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           ++ +  +     ++  V ++G SH+      AL     ++TP+G + +  +++E+L    
Sbjct: 115 SFGYKLLKQRDQNVKLVVIIGTSHYVPFHGMALPDWDGFRTPLGTVGVATKIVEKLDKEP 174

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
            FE+ D      EHS+EM +P L   F     +IVPI+ G++N E+      L    ++ 
Sbjct: 175 PFEIFDQAFAR-EHSVEMQVPMLQ--FALKKFQIVPIVAGSLNMEDIQKAAELLRPLLNR 231

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P   F +SSDF H+G  + Y  +  +     ++++ L    +  I+  D  AF ++    
Sbjct: 232 PDVLFVISSDFTHYGPNYGYAPFTDERP---ENLKKLADQALGTIKRLDLRAFMEHWQTT 288

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            +TICG  PISV L ML   +T   ++ L ++ S +     ++SVSY S A
Sbjct: 289 GDTICGFFPISVLLAMLPQNTT---VEPLHFDTSGRITNNYENSVSYFSIA 336


>gi|308163228|gb|EFO65584.1| LigB protein [Giardia lamblia P15]
          Length = 285

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 92/292 (31%), Positives = 149/292 (51%), Gaps = 25/292 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYAF 61
           +R    +GSWY    K    ++  +L++AG    P+  VR ++APHAG +YSG  AA+++
Sbjct: 1   MRLCLKSGSWY-PLPKEAKAQVAAFLQKAGETVHPDANVRFILAPHAGLAYSGLTAAHSY 59

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI---GDLPLDLEVIEELKATGKFE 118
            +IDPT  + V +LG  H ++    + S    +  P    G   L +E I  L +  K  
Sbjct: 60  SSIDPTRYTSVVMLGVCHAFHQRGLSTSPFASWANPFTEKGSPSLSMETIPGLPSCQK-- 117

Query: 119 LMDICVDEAEHSMEMHLPYLAKVF----EGHLIKIVPILV--GAVNAENEAMYGRLFAKY 172
             D C  E EHS+E+ +P+LA VF    E   +K   +    GA   E +++       Y
Sbjct: 118 --DDC--EEEHSLELQIPFLAHVFANQIEAGTVKFSAVYCSYGATRTEIDSLM-----DY 168

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           + + ++   VSSDFCH+G RF +    +     ++++  LD   ++ +  G  D+F++ L
Sbjct: 169 ITEHNSLIVVSSDFCHYGPRFQFTPMIQGK-TANETVTMLDNKCINGVMLG-ADSFEEAL 226

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            E  +T+CG + I   L +L      +K + L Y +S + KT RD SVSY S
Sbjct: 227 KETQSTVCGHYTILTCLRVLEGLKLDVKKELLFYAKSDEPKTARDHSVSYVS 278


>gi|70945450|ref|XP_742543.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56521586|emb|CAH75500.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 187

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 70/183 (38%), Positives = 103/183 (56%), Gaps = 11/183 (6%)

Query: 47  HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
           HAGY+Y    +++ +  I+  +I  +F+LGP+HH Y   C L     Y+TP+G L ++  
Sbjct: 2   HAGYAYCLETSSHVYSCINVENIKNIFILGPNHHIYNKGCLLPQVDKYETPLGFLQINKN 61

Query: 107 VIEEL---KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA----VNA 159
           VI ++        ++ +D   DE EHS+EM LP +  + +   IKIVPI VG     VN 
Sbjct: 62  VISDIMNNDTQNLYDYIDELDDEEEHSIEMQLPLIKYIIKEKDIKIVPIYVGCIGNDVNK 121

Query: 160 ENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--VIHKSIEALDKMGM 217
            NE  +     KY  D +N F  SSDFCH+G RF++ +  +K+    IHK IE +DK G+
Sbjct: 122 INE--FCNPLKKYFQDETNAFIFSSDFCHYGRRFSFTNILEKYNDKYIHKKIENMDKDGI 179

Query: 218 DII 220
            II
Sbjct: 180 SII 182


>gi|164661966|ref|XP_001732105.1| hypothetical protein MGL_0698 [Malassezia globosa CBS 7966]
 gi|159106007|gb|EDP44891.1| hypothetical protein MGL_0698 [Malassezia globosa CBS 7966]
          Length = 200

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 77/189 (40%), Positives = 102/189 (53%), Gaps = 27/189 (14%)

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
           VDEAEHS+EMHLPY+ KV+    +KIVP+LVG +  +    Y   FA+Y  DP   F +S
Sbjct: 5   VDEAEHSIEMHLPYIYKVWGERDVKIVPVLVGHLPEQMNFAYALCFAQYFADPRTLFVIS 64

Query: 184 SDFCHWGSRFNYMHYD---KKHGV---------------IHKSIEALDKMGMDIIETGD- 224
           SDFCHWGSRF Y  Y       G+               I++SI+ LD  GM  I     
Sbjct: 65  SDFCHWGSRFQYTWYQPTSTSKGIMLSSANKSCIEPKMPIYQSIQNLDAEGMSAISFNKH 124

Query: 225 -----PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKR 276
                  AF  +L +  NTICGR+PI + L +L     +    + +F  Y+QS+ C   +
Sbjct: 125 GSRRARQAFAMHLTKTGNTICGRNPILLLLTILEILEDRGAMFECRFTHYKQSNACCHVQ 184

Query: 277 DSSVSYASA 285
           DSSVSYA+A
Sbjct: 185 DSSVSYATA 193


>gi|116748738|ref|YP_845425.1| hypothetical protein Sfum_1298 [Syntrophobacter fumaroxidans MPOB]
 gi|116697802|gb|ABK16990.1| protein of unknown function DUF52 [Syntrophobacter fumaroxidans
           MPOB]
          Length = 276

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 81/286 (28%), Positives = 132/286 (46%), Gaps = 18/286 (6%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYA 60
           E+IR +  AGSWY DN   L  E+ G+L+ A   P    + G++APHAGY YSG  AA+A
Sbjct: 4   ERIRESVIAGSWYPDNPGALRNEIAGYLKRARPHPGRENLIGLVAPHAGYMYSGEVAAHA 63

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  ++ +   RV ++ PSH    P   +     ++TP+G +PLD+++++      +    
Sbjct: 64  YRQLEGSRFERVLIVAPSHRTRFPSSTIYHLGGFRTPLGVVPLDVDLVQAFLEHPQLRGY 123

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
               ++ EHS+E+ LP+L  V      K+VPI++G    E+      L  K   + +   
Sbjct: 124 YPQAEDQEHSLEIQLPFLQVVLGE--FKLVPIIMGNQTYEHCTELAELIEKLCRNKNVLL 181

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSD  H+ S   Y    +  G++   + A D  G+          F+  L +     C
Sbjct: 182 IASSDLSHYHS---YREAKQLDGIVTARVAAFDCQGL----------FRD-LNDGACEAC 227

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           G  P+   +   G     +  + L Y  S     +R   V Y +AA
Sbjct: 228 GAGPMVAVMLAAGKLGANVS-RVLHYANSGDVTGERSGVVGYMAAA 272


>gi|218200888|gb|EEC83315.1| hypothetical protein OsI_28694 [Oryza sativa Indica Group]
          Length = 84

 Score =  117 bits (294), Expect = 5e-24,   Method: Composition-based stats.
 Identities = 64/76 (84%), Positives = 71/76 (93%)

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
           MGM+IIETG+PDAFK+YL EY+NTICGRHPISVFL ML +CSTKIKI F+RYEQSSQCK+
Sbjct: 1   MGMEIIETGNPDAFKQYLQEYENTICGRHPISVFLSMLKHCSTKIKIGFVRYEQSSQCKS 60

Query: 275 KRDSSVSYASAAAKVD 290
            RDSSVSYASAAAKVD
Sbjct: 61  MRDSSVSYASAAAKVD 76


>gi|373457187|ref|ZP_09548954.1| AMMECR1-domain protein [Caldithrix abyssi DSM 13497]
 gi|371718851|gb|EHO40622.1| AMMECR1-domain protein [Caldithrix abyssi DSM 13497]
          Length = 502

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 72/209 (34%), Positives = 114/209 (54%), Gaps = 15/209 (7%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR----GVIAPHAGYSYSGRAAA 58
           KIR+ + AG++Y  ++K L   ++ +L++A   +SPE++    G+IAPHAGY YSG  AA
Sbjct: 33  KIRKPAVAGAFYPSSAKALQSMIESFLKKA---QSPEIKDAITGIIAPHAGYVYSGGVAA 89

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKF 117
           YA+  +    I RV ++ PSH  Y    A+   T Y+TP+G +P+D E   +L K + + 
Sbjct: 90  YAYKVLQGRKIKRVVVISPSHVEYFSGAAVYDGTHYETPLGLIPVDAEFARKLAKKSDRL 149

Query: 118 ELMD-----ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
           EL +           EH++E+ LP+L  V +    ++VPI++G  + E     GR  A  
Sbjct: 150 ELSEHGHTINFQGRGEHALEVQLPFLQVVLDD--FQLVPIVMGEQSYETARALGRALADL 207

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKK 201
           ++D       SSD  H+ S    +  DKK
Sbjct: 208 IEDDRTIIVASSDLSHFHSYQKAIELDKK 236


>gi|349804847|gb|AEQ17896.1| putative protein memo1 [Hymenochirus curtipes]
          Length = 100

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 53/100 (53%), Positives = 71/100 (71%), Gaps = 1/100 (1%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWYT +  +L+ +L+GWL +A   K P  R +IAPHAGY+Y G  AA+A+  +
Sbjct: 1   REASHAGSWYTASGPQLSAQLEGWLSQAQTSKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 59

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD 104
           DP+   RVF+LGPSHH    +CALST  +Y+TP+ DL +D
Sbjct: 60  DPSVTRRVFILGPSHHVALSRCALSTVDLYRTPLYDLHVD 99


>gi|361130005|gb|EHL01881.1| putative protein MEMO1 [Glarea lozoyensis 74030]
          Length = 201

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 70/200 (35%), Positives = 96/200 (48%), Gaps = 59/200 (29%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ASHAGSWY  N   L+ EL+G+L  A +P + + + +  P A                
Sbjct: 4   RSASHAGSWYKANRTALSSELEGYL--AAVPAAFDEQQLPIPKA---------------- 45

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                               +  ++ ATV               EELKATGKF+ +   V
Sbjct: 46  --------------------RVIIAPATV---------------EELKATGKFDSIPRDV 70

Query: 125 DEAEHSMEMHLPYLAKVFEGHL------IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           DEAEHS+EMH PY+  +   H        K+VP++VG      E  YG+LF+ Y+ DP++
Sbjct: 71  DEAEHSLEMHCPYIYTMLSRHFDSPADYPKLVPVMVGNTAPAAEKEYGKLFSSYLADPTS 130

Query: 179 FFSVSSDFCHWGSRFNYMHY 198
            F +SSDFCHWG RF Y +Y
Sbjct: 131 VFVISSDFCHWGGRFRYTYY 150


>gi|390952709|ref|YP_006416468.1| putative dioxygenase [Thiocystis violascens DSM 198]
 gi|390429278|gb|AFL76343.1| putative dioxygenase [Thiocystis violascens DSM 198]
          Length = 261

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 129/286 (45%), Gaps = 31/286 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  +R+ + A  +Y  + + L   LDG+L  A  P++P  + VI PHAGY YSG  AA A
Sbjct: 1   MSLVRQPAVADRFYPGDPRELTRMLDGFLANAQ-PRTPPPKAVIVPHAGYIYSGPIAASA 59

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  + P   S++RV L+GPSH       A S+A  + TP+G +PLD   I +  A  +  
Sbjct: 60  YATLAPARESVTRVILMGPSHRVPFRGLATSSADWFDTPLGAVPLDRVAINQALALPQVR 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            MD      EHS+E+ LP+L +V       + P +VG  + E+ A    L       P  
Sbjct: 120 SMDEA-HALEHSLEVQLPFLQRVLVD--FSLAPFVVGDASPESVAEVLELLWG---GPET 173

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              +SSD  H+       H D       +S   LD      IE+  PD      + YD  
Sbjct: 174 LIVISSDLSHY-------HDD-------RSARRLDAATSAAIESLRPDD-----IGYDQA 214

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            CGR P++  L +      + +   LR   S      RD  V Y +
Sbjct: 215 -CGRTPVNGLLTVAKRRGMQARTLDLR--NSGDTAGSRDQVVGYGA 257


>gi|333984577|ref|YP_004513787.1| hypothetical protein [Methylomonas methanica MC09]
 gi|333808618|gb|AEG01288.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methylomonas methanica MC09]
          Length = 259

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 86/269 (31%), Positives = 130/269 (48%), Gaps = 41/269 (15%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           R  + AGS+Y  ++  L+  L+G+L R AG   +P  + +I PHAGY YSG  AA A+  
Sbjct: 3   RAPAVAGSFYPADAGTLSHMLNGFLDRTAGDEHAP--KAIIVPHAGYIYSGPIAASAYAR 60

Query: 64  IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           + P   S+SRV L+GPSH       A+S A  + TP+GD+ +D   I++L+       ++
Sbjct: 61  LKPVRDSVSRVVLIGPSHRVAFQGLAVSAADTFSTPLGDIAIDQSAIQQLQQLPFVGYLE 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-----AVNAENEAMYGRLFAKYVDDP 176
                 EHS+E+HLP+L +       K+VPI+ G      V+   E ++G         P
Sbjct: 121 EA-HALEHSLEVHLPFLQQTLSH--FKLVPIVAGDASPHQVSQVLEILWG--------GP 169

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                +SSD  H+        +D      + + + LDK   ++IE  D  A         
Sbjct: 170 ETLIVISSDLSHY--------HD------YATSQTLDKHTSELIEALDYQALSS------ 209

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLR 265
           +  CGR P+S  L +L   S  IK   LR
Sbjct: 210 DAACGRVPVSGLLKLLREKSLAIKTIDLR 238


>gi|119719191|ref|YP_919686.1| hypothetical protein Tpen_0273 [Thermofilum pendens Hrk 5]
 gi|119524311|gb|ABL77683.1| protein of unknown function DUF52 [Thermofilum pendens Hrk 5]
          Length = 287

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 87/295 (29%), Positives = 135/295 (45%), Gaps = 30/295 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSPE--------VRGVIAPHAGYSYSGR 55
           R A HAG +Y    + L E L + +L E G  K PE        + GV+APHAGY YSG 
Sbjct: 6   RPAVHAGYFYPAEKRELIEALKNAFLHELGPGKLPEKPGSYTGNLIGVVAPHAGYIYSGH 65

Query: 56  AAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+A+  + +      VF+LGP+HH      AL    V++TP+GD+ +D  V +EL + 
Sbjct: 66  VAAHAYLALAEGGKPDVVFILGPNHHALGAPIALDENEVWETPLGDVEVDFRVSKELASR 125

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            +    D      EHS+E+ +P+L  VF G    IVPI +     E     G   A  + 
Sbjct: 126 EQIIRFDFQAHAYEHSIEVQVPFLQFVF-GEGFTIVPISMMLQTPEAARRVGEAIAGLIM 184

Query: 175 DPS--NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA-LDKMGMDIIETGDPDAFKKY 231
           +     +   SSD  H+               +   I A  DK+ ++ +   D +     
Sbjct: 185 EKGLRAYVVASSDMSHY---------------VESRIAAEKDKVAIERVLFLDAEGLYNA 229

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ++E D ++CG  P+   +++         ++ L+Y  S      + S V+YAS A
Sbjct: 230 VIEEDISMCGPGPVMALIYIAKKLGYA-NVRLLKYADSGDVTGDKSSVVAYASLA 283


>gi|91204537|emb|CAJ70765.1| conserved hypothetical protein [Candidatus Kuenenia
           stuttgartiensis]
          Length = 347

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 87/291 (29%), Positives = 134/291 (46%), Gaps = 25/291 (8%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFGNIDPTS 68
           AG +Y  N   L  +++ +L E    KS      +I+PHAGY YSG+ AAY +  I    
Sbjct: 37  AGRFYPGNEAALKGQINEFLNEVSSQKSNGRPLAIISPHAGYVYSGQVAAYGYSAIKGHG 96

Query: 69  ISRVFLLGPSHHYYTPKCA-LSTATVYKTPIGDLPLDLEVIEELKATG---------KFE 118
             RV +L PSH     + A +  AT YKTP+G + +D E  + L  T             
Sbjct: 97  FKRVIVLSPSHSGRRYRGASILKATSYKTPLGKISIDQEACDYLLNTSFTAESKNKRNSS 156

Query: 119 LMDICVD-----EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
            + +  D     + EHS+EM LP+L          +VPI++G +   +           +
Sbjct: 157 PLKLFGDYDGAYKGEHSLEMQLPFLQMTLGD--FNLVPIMIGILIDNDFDKVAEAIRPLL 214

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
           DD +    VSSDF H+G  + Y+ + +    + ++I+ LD    + I   D D  ++Y  
Sbjct: 215 DDKT-LLVVSSDFTHYGDAYRYVPFREN---VEENIKILDYGAFEKILNKDFDGLREYRK 270

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +     CG  PIS+ L +L + +   K + L Y+ S         SVSYAS
Sbjct: 271 QTGINACGILPISILLKLLPDSA---KGQILHYDTSGHQTNNFVYSVSYAS 318


>gi|53802717|ref|YP_112612.1| hypothetical protein MCA0072 [Methylococcus capsulatus str. Bath]
 gi|53756478|gb|AAU90769.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath]
          Length = 267

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 90/290 (31%), Positives = 140/290 (48%), Gaps = 39/290 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  +R  + AG +Y  + ++L   L G+L EA    +P  + +IAPHAGY YSG  AA A
Sbjct: 1   MRSVRAPAVAGLFYPADRRQLHALLQGFLGEAKTAGTPP-KALIAPHAGYVYSGPVAASA 59

Query: 61  FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  + P    I RV LLGPSH       ALS+   + TP+G++PLDLE    L A     
Sbjct: 60  YALLKPLRHRIRRVVLLGPSHRIAFRGLALSSTQSFATPLGEIPLDLEAQSVLAALPFVH 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD---- 174
           ++D      EHS+E+ LP+L +V E    K+VPI+VG      +A+ G++ A+ +D    
Sbjct: 120 VLDQA-HVLEHSLEVQLPFLQEVLED--FKLVPIVVG------DAVPGQV-AEAIDLVWG 169

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
                  VSSD  H+        +D      + +   +D+   D IE+  P+      + 
Sbjct: 170 GDETLIVVSSDLSHY--------HD------YATARRMDRATSDAIESLHPEE-----IG 210

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +++  CGR P++  L  +      +  K +    S      +DS V Y +
Sbjct: 211 FEDA-CGRLPVAGLL--IAARRRGLSAKTVDLRNSGDTAGSKDSVVGYGA 257


>gi|323304343|gb|EGA58116.1| YJR008W-like protein [Saccharomyces cerevisiae FostersB]
          Length = 210

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 72/209 (34%), Positives = 100/209 (47%), Gaps = 39/209 (18%)

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGRLFAKYVD 174
           MD   D AEHS+EM LP L +  +        +K++P++V   + + +   G + ++Y+ 
Sbjct: 1   MDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGNILSEYIK 60

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHGV-IHKS 208
           DP+N F VSSDFCHWG RF Y  Y                            H V I +S
Sbjct: 61  DPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIXEETEVEMLTARSKLSHHQVPIWQS 120

Query: 209 IEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS----TKIK 260
           IE +D+  M  +     G+  DA+K+YL    NTICG  PISV L  L        + IK
Sbjct: 121 IEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRDAGPSGIK 180

Query: 261 IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            ++  Y QSS   +  DSSVSY S    +
Sbjct: 181 FQWPNYSQSSHVTSIDDSSVSYTSGYVTI 209


>gi|84489659|ref|YP_447891.1| hypothetical protein Msp_0859 [Methanosphaera stadtmanae DSM 3091]
 gi|84372978|gb|ABC57248.1| conserved hypothetical protein [Methanosphaera stadtmanae DSM 3091]
          Length = 283

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 92/302 (30%), Positives = 142/302 (47%), Gaps = 38/302 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSP----EVRGVIAPHAGYSYSGRAA 57
           IR+ S AG +Y  N   L E +     + G  +P +P    + +  I PHAGY YSG+ A
Sbjct: 2   IRKPSVAGLFYESNKSLLEENIKNIFNKTGQNIPTTPSNTYKTKAAIVPHAGYIYSGKTA 61

Query: 58  AYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           +YA+G+I  + I   V ++GP+H  Y    +L+T+  ++TPIGD+ +D E   EL+    
Sbjct: 62  SYAYGDIARSGICDTVVIIGPNHTGYGDDISLTTSNTWQTPIGDVCVDSEFNNELEKINS 121

Query: 117 FELMDICVDEAEHSMEMHLPYLAKV--FEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
                      EHS+E+ LP+L  +   +    KIVPI++       +  +    A  + 
Sbjct: 122 NITFSPEAHIKEHSIEVELPFLQYISNIQKKSFKIVPIVI----TRQQKNFCVELAHSIY 177

Query: 175 DPS-----NFFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
           D S     N   V S+D  H+ +  +  + D+K   I KSIE      MDI      D+ 
Sbjct: 178 DVSKKLNRNIMVVASTDLTHYENATSAKNKDEK---ILKSIE-----NMDI------DSL 223

Query: 229 KKYLLEYDNTICGRHP--ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
              + +Y+ T+CG  P   ++    L N +  I    L Y  S +     +S V YASA 
Sbjct: 224 LNNINKYNITMCGYGPTITAIIYSKLVNANRSI---ILNYSNSGEVYNDYESVVGYASAV 280

Query: 287 AK 288
            K
Sbjct: 281 IK 282


>gi|294673326|ref|YP_003573942.1| hypothetical protein PRU_0573 [Prevotella ruminicola 23]
 gi|294474323|gb|ADE83712.1| conserved hypothetical protein [Prevotella ruminicola 23]
          Length = 464

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 81/282 (28%), Positives = 130/282 (46%), Gaps = 13/282 (4%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           + ++R A+ A  +YT +++ L  E+D  L++        +  +I PHAGY +SG  AA A
Sbjct: 3   LGQVRPATQANRFYTGDAEELENEVDSLLKQHNSQVYDNLAALIVPHAGYYFSGNVAASA 62

Query: 61  FGNIDP-TSISRVFLLGPSHHYYTPKCALST-ATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  ID      R+FLLGPSHH +    +++T A  Y TP+G++ +D E   EL       
Sbjct: 63  YMTIDAKKEYKRIFLLGPSHHEWLNGASVNTEADYYATPLGNVKVDHETAIELTKADSIF 122

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
                    EH +E+ LP+L +   G +  IVPI++   +         +   Y  D  N
Sbjct: 123 AYHRSAHAQEHCLEVQLPFLQRKL-GEVPPIVPIIISTNDYYKLKRMADVLKPYFTD-EN 180

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            F +SSDF H+ S  +    D K G   K+IE  D          + ++ K+ L     +
Sbjct: 181 LFVISSDFSHYPSYQDAYEVDAKTG---KAIETGDVEEFIAAIEANANSGKRNLA---TS 234

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSV 280
            CG   I   + M+    ++  +K L Y+ S     +  S V
Sbjct: 235 ACGEFAIVTLMLMI---DSRYDVKHLMYQNSGDIDNRDLSRV 273


>gi|345878384|ref|ZP_08830100.1| putative dioxygenase [endosymbiont of Riftia pachyptila (vent
           Ph05)]
 gi|344224602|gb|EGV50989.1| putative dioxygenase [endosymbiont of Riftia pachyptila (vent
           Ph05)]
          Length = 271

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 85/291 (29%), Positives = 140/291 (48%), Gaps = 33/291 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  +R A+ AG +Y D    L   +D +L +AG  + P  + ++APHAG+ YSG  AA  
Sbjct: 8   MGGVRTAAVAGLFYPDAPAELRAMVDDYLGQAGEGRPP--KAIVAPHAGFVYSGPVAASV 65

Query: 61  FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  + P +  ISRV LLGPSH       ALS+A  ++TP+G++PL+ +V +EL    +  
Sbjct: 66  YHRLTPVADRISRVVLLGPSHRVAFRGLALSSAQSFQTPLGEIPLE-QVSDELLRLPQLS 124

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           + D+   + EHS+E+ LP+L +V      +++PI+VG  +    A    +  +  D    
Sbjct: 125 VNDMAHAQ-EHSLEVQLPFLQRVLGE--FRLLPIVVGDASGAEVA---EVLEQVWDGEET 178

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              +SSD  H+    +Y+           + + +D+   D I    P+      L Y++ 
Sbjct: 179 LIVISSDLSHY---HDYL-----------TAQQMDRTTSDAILALQPER-----LHYEDA 219

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            CGR P+   L +        ++  LR   S      +D  V Y + A  V
Sbjct: 220 -CGRIPLQGLLEVAQRKGLDGELVDLR--NSGDTAGLKDRVVGYGAYAFAV 267


>gi|320160154|ref|YP_004173378.1| hypothetical protein ANT_07440 [Anaerolinea thermophila UNI-1]
 gi|319994007|dbj|BAJ62778.1| hypothetical protein ANT_07440 [Anaerolinea thermophila UNI-1]
          Length = 281

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 117/223 (52%), Gaps = 9/223 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVRGVIAPHAGYSYSGRAAAYAFG 62
           +R +  AG+WY+ +++RL  +++ +L EA +P  + EV  V+APHAG+ YSGR A +AF 
Sbjct: 7   VRPSPIAGTWYSGDAQRLRRQVETYLSEAHIPSLNGEVVAVVAPHAGHRYSGRTAGHAFA 66

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD---LEVIEELKATGKFEL 119
            +   S   V +L P H  +  +   +    Y TP+G +P+D   L ++E+  A     L
Sbjct: 67  AVRGQSPEVVAVLSPFHAMHPAELLTTAHQAYATPLGLIPVDHEALLILEQDLAEQHLSL 126

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
             +  D +EHS+E+ LP+L    +G   +++P++V + + +     G+L A+ + D    
Sbjct: 127 TRVAYD-SEHSLEIELPFLQVALQGDF-RLLPLMVRSHSPQVLQTLGKLLAQVLKDRRAL 184

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIET 222
              S+D  H+         D +   + + +EA D + M   ET
Sbjct: 185 LVASTDLSHFYPEKIARQLDAE---MLRRLEAFDPLAMLEAET 224


>gi|313680677|ref|YP_004058416.1| hypothetical protein [Oceanithermus profundus DSM 14977]
 gi|313153392|gb|ADR37243.1| protein of unknown function DUF52 [Oceanithermus profundus DSM
           14977]
          Length = 260

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 128/283 (45%), Gaps = 30/283 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AGS+Y     +LA  +D  L  A  P +   + V+APHAGY YSG  AAY+F  
Sbjct: 2   VRKPAVAGSFYPAEPMQLARMVDELLAMATRPPAHAPKAVVAPHAGYVYSGPVAAYSFRA 61

Query: 64  IDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           ++P +     VFL+GP+H+      +  T T ++TP+G +P+D E +  L          
Sbjct: 62  LEPLAGKTPTVFLMGPAHYLAFEGVSTGTYTYWETPLGQVPVDTERVGALLERSALFTTA 121

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
               + EHS+E+ LP+L  V  G   K+VP+L G V+    A+ G L A  V  P +   
Sbjct: 122 DEPHQPEHSLEVELPFLQAVLGG--FKLVPLLFGLVD--PLAVAGHLEA--VLRPGDVLV 175

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           VS+D  H+         D       ++  ALD  G+   E                  CG
Sbjct: 176 VSTDLSHYHPDAEARKLD---AATLETALALDARGLSHREA-----------------CG 215

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           RHP +    +    S + ++  L Y  S      R   V YA+
Sbjct: 216 RHPWAALTALADRRSWRPEL--LAYATSGDTSGDRSRVVGYAA 256


>gi|119620873|gb|EAX00468.1| chromosome 2 open reading frame 4, isoform CRA_b [Homo sapiens]
          Length = 201

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 55/103 (53%), Positives = 66/103 (64%), Gaps = 3/103 (2%)

Query: 190 GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL 249
           G RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L
Sbjct: 98  GQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLL 157

Query: 250 HM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 158 NAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 200


>gi|154151753|ref|YP_001405371.1| hypothetical protein Mboo_2214 [Methanoregula boonei 6A8]
 gi|154000305|gb|ABS56728.1| protein of unknown function DUF52 [Methanoregula boonei 6A8]
          Length = 262

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 89/290 (30%), Positives = 138/290 (47%), Gaps = 37/290 (12%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K R  +  G +Y  +S  L + L+G+      P  P + G+++PHAGY YSG+ AAYAF 
Sbjct: 2   KTRSCAVCGMFYPRDSAHLTQILEGFFSRTA-PGEPAL-GIVSPHAGYIYSGQIAAYAFS 59

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            IDP       ++GPSHH Y      ++A  ++TP+G + +D E I+ L         DI
Sbjct: 60  RIDPGFSGTFVVIGPSHHGYLTS---ASAIPWETPLGLVEIDAEFIDAL---------DI 107

Query: 123 CVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR--LFAKYVDDP 176
            VDE     EHS+E+ LP++   F     ++VPI++G  +  + A      + A+ +   
Sbjct: 108 PVDEPSHEEEHSIEVQLPFIRHRFP--RARVVPIMMGEQDPAHAAAVAEKIVAAQRLTKK 165

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                 SSDF H+           K G ++ +IE L +  +DI E      F + + E  
Sbjct: 166 EIRVVASSDFSHY-----VPEAQAKTGDLY-AIEPLAR--LDIKE------FYRRIREKR 211

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            T CG  PIS  + +   C    + + +RY  S      R+  V YA+ A
Sbjct: 212 VTACGYGPISAMVAVC-QCMGAGRGELIRYATSGDVTNDRNQVVGYAAIA 260


>gi|302348701|ref|YP_003816339.1| hypothetical protein ASAC_0902 [Acidilobus saccharovorans 345-15]
 gi|302329113|gb|ADL19308.1| hypothetical protein ASAC_0902 [Acidilobus saccharovorans 345-15]
          Length = 288

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 142/301 (47%), Gaps = 28/301 (9%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAG---LPKSPEVR-----GVIAPHAGYS 51
           M++IR+ +HAGS+Y  +   L + ++  +L   G   LP++P  R     G + PHAGY 
Sbjct: 1   MQEIRKPAHAGSFYPADRDDLVKSIESSFLHRLGPGKLPQAPPSRNRTSVGYLVPHAGYM 60

Query: 52  YSGRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
           YSG  AA+++ N+      +V ++ GP+H       +L     ++TP+G + +D EV + 
Sbjct: 61  YSGPVAAHSYYNMAQEGAPKVVVVAGPNHTGLGEAASLWQGDAWETPLGTVEVDREVEKL 120

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
           + +  K+   D      EHS+E+ +P+L  +F+ +  K+VPI++   N E          
Sbjct: 121 IISNTKYFTFDNEAHLYEHSVEVQVPFLQYIFDSNF-KLVPIVIKLQNPEVSKDLADALV 179

Query: 171 KYVDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
           K +++      F  SSD           HY+     + K  EAL       +E+ DP+  
Sbjct: 180 KIINENGVDLLFVASSDMN---------HYEPHDVTVSKDNEAL-----RYVESLDPEGL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            + L   D T+CG  P    L+M G      +   L++  S     ++D  V YASA   
Sbjct: 226 FRVLEADDITMCGPGPAMTLLYM-GRSLGATRAVILKHATSGDVTGEKDWVVGYASAVVP 284

Query: 289 V 289
           +
Sbjct: 285 L 285


>gi|288940496|ref|YP_003442736.1| hypothetical protein Alvin_0757 [Allochromatium vinosum DSM 180]
 gi|288895868|gb|ADC61704.1| protein of unknown function DUF52 [Allochromatium vinosum DSM 180]
          Length = 267

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 84/293 (28%), Positives = 127/293 (43%), Gaps = 35/293 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLRE-----AGLPKSPEVRGVIAPHAGYSYSGR 55
           M  IR+ + A  +Y  +   L   LD  L +     A  P +   + +I PHAGY YSG 
Sbjct: 1   MPTIRQPAVADRFYPGDPVELGRMLDALLADSPASAASRPGTAPPKALIVPHAGYIYSGP 60

Query: 56  AAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
            AA A+  + P    I RV LLGPSH       A ++A V+ TP+G +P+D   +E    
Sbjct: 61  IAATAYATLAPVHDQIHRVVLLGPSHRLPFMGLAATSADVFATPLGPVPIDRAAVERTLT 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
             +  L+D      EHS+E+ LP+L +V +     +VP++VG    E+ A    L     
Sbjct: 121 LPQVRLLD-AAHAQEHSLEVQLPFLQRVLDD--FSLVPLVVGEAAPESVAEVLDLLWG-- 175

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
             P     VSSD  H      Y+ Y        ++ + +D    + IE   P+A      
Sbjct: 176 -GPETLILVSSDLTH------YLDY--------RTAQRIDAATSEAIEALRPEAIGS--- 217

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
              +  CGR P++  L +       ++ + L    S      RD  V Y + A
Sbjct: 218 ---DQACGRAPLNGLLTL--ARRRGLQAETLDRRNSGDTAGSRDQVVGYGAYA 265


>gi|349605881|gb|AEQ00969.1| Protein MEMO1-like protein, partial [Equus caballus]
          Length = 103

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 54/101 (53%), Positives = 65/101 (64%), Gaps = 3/101 (2%)

Query: 192 RFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM 251
           RF Y +YD+  G I++SIE LDKMGM IIE  DP +F  YL +Y NTICGRHPI V L+ 
Sbjct: 2   RFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNA 61

Query: 252 ---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
              L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 62  ITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 102


>gi|344341515|ref|ZP_08772434.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thiocapsa marina 5811]
 gi|343798635|gb|EGV16590.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thiocapsa marina 5811]
          Length = 261

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 85/287 (29%), Positives = 130/287 (45%), Gaps = 33/287 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   RR + AG +Y  + + L   +DG+L EA    +P  + +IAPHAGY YSG  AA A
Sbjct: 1   MTLSRRPAVAGLFYPGDPRELKHMVDGYLAEAPARDTPP-KALIAPHAGYIYSGSIAASA 59

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  + P   +I+RV LLGP+H       A S+A  + TP+G + LD   I+   A  +  
Sbjct: 60  YATLLPAHAAITRVVLLGPAHRVPVRGLATSSAECFDTPLGSVDLDRAAIDSALALPQVR 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN-AENEAMYGRLFAKYVDDPS 177
           +MD    + EHS+E+ LP+L +  +     +VP +VG  + AE   +   L+      P 
Sbjct: 120 IMDEAHAQ-EHSLEVQLPFLQEALD--RFTLVPFVVGDTSPAEVAEVLDLLWGG----PE 172

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
               VSSD  H+   ++Y    +       +IEAL+                   + Y+ 
Sbjct: 173 TLIVVSSDLSHY---YDYATAKRLDTATSTAIEALEPQD----------------IHYEQ 213

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             CGR P++  L +        +   LR   S      RD  V Y +
Sbjct: 214 A-CGRIPVNGLLELARRRGLHARTLDLR--NSGDTAGPRDQVVGYGA 257


>gi|386826852|ref|ZP_10113959.1| putative dioxygenase [Beggiatoa alba B18LD]
 gi|386427736|gb|EIJ41564.1| putative dioxygenase [Beggiatoa alba B18LD]
          Length = 264

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 81/286 (28%), Positives = 132/286 (46%), Gaps = 36/286 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGR--AAAYAF 61
           IR  + AG +Y  N   L + +  +L+ A  P + + + +I PHAGY YSG   A AYA+
Sbjct: 6   IRPPAVAGMFYPANPAELQQMVKQFLQVATPPSAIQPKALIVPHAGYLYSGAIAATAYAY 65

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
                 +I RV LLGPSH Y     ALSTA  Y +P+G++ LD +  E L        +D
Sbjct: 66  WLSLARTIKRVVLLGPSHRYPLRGLALSTAQAYASPLGNVLLDHQTDETLLKLPFVHAID 125

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
               ++EHS+E+HLP+L ++F+     ++P++VG  + +     G++             
Sbjct: 126 -QAHQSEHSLEVHLPFLQQLFQQ--FTLIPLVVGDASPQQ---VGQVLELLWGGEETVIV 179

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSD  H      Y  Y     +  ++ +A++++  +              L Y++  CG
Sbjct: 180 ISSDLSH------YHPYATAQAMDARTSQAIEQLNFNA-------------LHYEDA-CG 219

Query: 242 RHPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKRDSSVSYAS 284
           R+PI+  L     C  K   + I  +    S       D  V Y +
Sbjct: 220 RNPIAGLL-----CVAKQRHLTITTVDLRNSGDTAGSHDKVVGYGA 260


>gi|336398113|ref|ZP_08578913.1| AMMECR1 domain protein [Prevotella multisaccharivorax DSM 17128]
 gi|336067849|gb|EGN56483.1| AMMECR1 domain protein [Prevotella multisaccharivorax DSM 17128]
          Length = 509

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 82/269 (30%), Positives = 123/269 (45%), Gaps = 35/269 (13%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELD-------GWLREAGL--PKSPEVRGVIAPHAGYSY 52
           + +R+ + AG +Y  + + L  +L+       G L  +G+    S  V+ VI PHAGY +
Sbjct: 33  QNVRQMAVAGQFYAASRQDLQSDLEKCYAEGRGLLARSGVVVGDSARVQAVIVPHAGYVF 92

Query: 53  SGRAAAYAFGNI-DPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEE 110
           SG  AA AF +I    S   +FLLGPSHH  +      S    Y+TP+GD+ +D ++ +E
Sbjct: 93  SGSVAASAFASIPKDASYQHIFLLGPSHHVAFDGVSVCSVFDAYRTPLGDVAVDTKLCKE 152

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
           L              + EH +E+ LP+L       L  IVP+++G  +         +  
Sbjct: 153 LMRRSPVFTYVSSAHDREHCLEVQLPFLQYRL-ASLPPIVPMIIGTQDFATLKKMAEVLM 211

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
            Y   P N F VSSDF H+ S  + +  D+  G              + I TG  D F +
Sbjct: 212 PYF-TPDNLFVVSSDFSHYPSYSDALTVDRNSG--------------EAITTGRLDKFVE 256

Query: 231 YLLE-----YDN---TICGRHPISVFLHM 251
            L E      DN   + CG+  I+V L M
Sbjct: 257 ALRENECRGVDNLYTSACGQSAIAVLLMM 285


>gi|194333116|ref|YP_002014976.1| hypothetical protein Paes_0272 [Prosthecochloris aestuarii DSM 271]
 gi|194310934|gb|ACF45329.1| protein of unknown function DUF52 [Prosthecochloris aestuarii DSM
           271]
          Length = 287

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 76/291 (26%), Positives = 139/291 (47%), Gaps = 17/291 (5%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE---VRGVIAPHAGYSYSGRAAA 58
           E +R  + A S+Y  + + L   L+  L E+    + E   +R ++ PHAGY++SGRA+A
Sbjct: 3   ENVRYPAVAESFYPSDREELDTFLESILSESTATNNSEKASIRALLVPHAGYAFSGRASA 62

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
            A+  +       VF+LG +H Y     AL T  ++++P+G +P++++  E+ +      
Sbjct: 63  EAYSRLAGNQYRTVFILGNAHAYRFNGIALDTHHIWQSPLGRIPINMDAAEQFRTAAPRL 122

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
            + +DI    ++H +E+ LP+L K  +     I+PIL G  NA++ ++        +  P
Sbjct: 123 IDYLDIA-HHSDHVLEVQLPFLQKTLKTGF-SILPILFGE-NAKDISLKTARILSDILQP 179

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
            +    SSD  H+ S      YD  + +  K+++A+  + M  +E    +  K+ +   D
Sbjct: 180 DDLLIASSDLSHYPS------YDDANAIDRKTLDAIVNIDMKGLERHVRNTMKQNVAHED 233

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTK-RDSSVSYASAA 286
              CG   +   L +          + L Y  S   + + RD+ V Y S A
Sbjct: 234 ALFCGPDGLKTLLRIA--TQRGWTAEELAYCNSGDAEWRDRDAVVGYGSVA 282


>gi|144900060|emb|CAM76924.1| protein containing DUF52 [Magnetospirillum gryphiswaldense MSR-1]
          Length = 452

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 88/297 (29%), Positives = 129/297 (43%), Gaps = 47/297 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDG-WLR-----EAGLPKSPEVRGVIAPHAGYSYSG 54
           M  IR ++ AG +Y      L  +LD  W +     E G+PK+     +IAPHAGY YSG
Sbjct: 1   MTSIRPSAVAGMFYPAEPAELMRQLDACWAQARPHPELGVPKA-----IIAPHAGYVYSG 55

Query: 55  RAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
             AA A+  +     +ISRV LLGPSH       A+STA  + +P+G + LD + +E  K
Sbjct: 56  AVAASAYDLLKTARGTISRVVLLGPSHRVGFRGMAVSTADAWASPLGAVRLDRDGVERAK 115

Query: 113 A---TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
               TG+ E         EHS+E+HLP+L ++       +VP++VG    E  A    L 
Sbjct: 116 TVPLTGELE----AAHAQEHSLEVHLPFLQRMLGD--FTLVPVVVGDAAPEQVAA---LL 166

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
                 P     +S+D  H+    +Y            +IEALD+  +D           
Sbjct: 167 DALWGGPETLIVISTDLSHY---LDYAACQDMDSRTAAAIEALDEGALD----------- 212

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
                  +  CGR P+   L      +  + IK +    S      +D  V Y + A
Sbjct: 213 ------RDQACGRIPVGGLLRT--AKARGMTIKTIDRRNSGDTAGPKDRVVGYGAWA 261


>gi|431932323|ref|YP_007245369.1| dioxygenase [Thioflavicoccus mobilis 8321]
 gi|431830626|gb|AGA91739.1| putative dioxygenase [Thioflavicoccus mobilis 8321]
          Length = 261

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 125/286 (43%), Gaps = 31/286 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   R  + AG +Y  +   L   +D +L +A +   P  + +IAPHAGY YSG  AA A
Sbjct: 1   MSLARPPAVAGRFYPSDPPALKRMVDDYLAKAKV-DGPPPKALIAPHAGYIYSGLIAATA 59

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  +     +IS+V LLGP+H  +    A S+A  ++TP+G +PLD   I+      +  
Sbjct: 60  YAPLVAARETISQVVLLGPAHRVFVRGLAASSAERFETPLGTVPLDRAAIDRTLTLPQVS 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           + D    E EHS+E+ LP+L  V +    K+VP +VG  + E  A    L       P  
Sbjct: 120 ISDAAHAE-EHSLEVQLPFLQVVLDE--FKLVPFVVGDASPEEVAEVLELLWS---GPET 173

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              VSSD  H+    +Y    +      K+IE L    +                 YD  
Sbjct: 174 LIVVSSDLSHY---HDYATAHRLDTATSKAIEGLHPEDIG----------------YDQA 214

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            CGR P++  L +      + K   LR   S      RD  V Y +
Sbjct: 215 -CGRQPVNGLLELARRRGLRAKTLDLR--NSGDTSGPRDQVVGYGA 257


>gi|386811290|ref|ZP_10098516.1| conserved hypothetical protein [planctomycete KSU-1]
 gi|386406014|dbj|GAB61397.1| conserved hypothetical protein [planctomycete KSU-1]
          Length = 529

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 87/304 (28%), Positives = 143/304 (47%), Gaps = 33/304 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE---VRGVIAPHAGYSYSGRAA 57
           ++++     AG +Y++N   L  +++ + +   +P+ P+      VI+PHAGY YSG+ A
Sbjct: 25  LQEVWEPQVAGRFYSENENVLKGQINTFFKN--IPEQPDKGKPLAVISPHAGYQYSGQVA 82

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPK---CALSTATVYKTPIGDLPLDLEVIEEL--- 111
           AY + +I     +RV LL PSH     +    ++     +KTP+G + +D E    L   
Sbjct: 83  AYGYKSIKGYEFTRVILLSPSHFSSGKRFRGVSILKTKNFKTPLGLITVDQEACNYLLTF 142

Query: 112 ----------KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN 161
                     K T  F   +      EHS+E  LP+L ++  G+   +VPI+VG +  ++
Sbjct: 143 AKGSSPTAPQKTTTLFGSYEGAYS-GEHSLETQLPFL-QLSLGNFT-LVPIMVGILVDDD 199

Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE 221
                      ++D +    VSSDF H+G  + Y+ +      I K+I+ALD    + I 
Sbjct: 200 FDQIAHAIRPLLNDKT-LVVVSSDFTHYGEGYGYVPFKTN---IEKNIKALDYGAFEKIL 255

Query: 222 TGDPDAFKKYLLEYDNTICGRHPISVFLHMLG-NCSTKIKIKFLRYEQSSQCKTKRDSSV 280
           + D    + Y  E     CG  PI++ L +L  N   +I    L Y+ S +       SV
Sbjct: 256 SKDFHGLRTYRKETGINTCGIIPIALLLKLLPENAQGEI----LNYDTSGRQSNNFSFSV 311

Query: 281 SYAS 284
           SYAS
Sbjct: 312 SYAS 315


>gi|410696082|gb|AFV75150.1| putative dioxygenase [Thermus oshimai JL-2]
          Length = 280

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 90/291 (30%), Positives = 128/291 (43%), Gaps = 36/291 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME +RR + AG +Y    + L  E++G L  A  P  P VRG +APHAGY YSGR AA A
Sbjct: 1   MELVRREAAAGVFYPREGETLRREVEGLLEAARTPPLPGVRGALAPHAGYLYSGRVAAEA 60

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
           F  +        RV LLGPSH       A      ++TP+G+ P+DLE    L A G  F
Sbjct: 61  FRALSAWKGRARRVLLLGPSHFVPFLGVAFHPYRAWRTPLGETPVDLEGGRRLLARGHPF 120

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
             ++    E EHS+E+ LP+L          I+P+L G V+     EA+   L       
Sbjct: 121 RALEEPFRE-EHSLEVLLPFLQVALP--QTPILPLLFGEVDPLGVAEALEEEL------G 171

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P +    SSD  H+                H    A  + G    +T +       L   
Sbjct: 172 PRDLVVASSDLSHY----------------HPEPVARARDG----KTLEKALALDPLALA 211

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ++  CGR P +    +      + ++  L Y  S++ K +R   V YA+ A
Sbjct: 212 ESEACGRLPWASLTALARGLGWRPRL--LAYATSAEAKGERARVVGYAAVA 260


>gi|153875832|ref|ZP_02003453.1| protein of unknown function DUF52 [Beggiatoa sp. PS]
 gi|152067703|gb|EDN66546.1| protein of unknown function DUF52 [Beggiatoa sp. PS]
          Length = 195

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 96/188 (51%), Gaps = 9/188 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG +Y DN+  L+  +D +L++  +   P  + +IAPHAGY YSG  AA  +  
Sbjct: 5   VRTPAVAGMFYPDNAAELSNMVDQFLQDVKV-AGPVPKAIIAPHAGYIYSGPIAASVYAR 63

Query: 64  IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           + P    I RV LLGPSH       A ST   + TP+G +P+D   I+ + +  +  +++
Sbjct: 64  LTPVHDQIQRVVLLGPSHQVPLKGLAASTMLQFATPLGKIPVDKPAIDNILSLPQVSVLE 123

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EHS+E+ LP+L KV       ++P++VG    +  A    +  K    P     
Sbjct: 124 QA-HVNEHSLEVQLPFLQKVLGN--FSLIPLVVGEATPKQVA---EVLEKLWGGPETLIV 177

Query: 182 VSSDFCHW 189
           VSSD  H+
Sbjct: 178 VSSDLSHY 185


>gi|313227297|emb|CBY22443.1| unnamed protein product [Oikopleura dioica]
          Length = 113

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/111 (43%), Positives = 73/111 (65%), Gaps = 2/111 (1%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RRASHAGSWY+ + + L  +L+ W+ +A      + + VI PHAGY+YS   AA++F  +
Sbjct: 4   RRASHAGSWYSGDPRELENQLENWISKAK--FEKKAKAVIVPHAGYAYSAPTAAWSFLQL 61

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           D  +  ++F++GPSHH Y P CAL      +TP+G+L +D +++ EL ATG
Sbjct: 62  DAQTTKKIFVIGPSHHVYLPNCALPVVKECETPLGNLRIDKDIVTELHATG 112


>gi|429217048|ref|YP_007175038.1| dioxygenase [Caldisphaera lagunensis DSM 15908]
 gi|429133577|gb|AFZ70589.1| putative dioxygenase [Caldisphaera lagunensis DSM 15908]
          Length = 279

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 85/293 (29%), Positives = 142/293 (48%), Gaps = 36/293 (12%)

Query: 8   SHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +HAGS+Y      L + ++         G L  A   +  E  G + PHAGY YSG  AA
Sbjct: 3   AHAGSFYPSKKDDLIKSIESSYLHKLGPGMLPNAERERKKESIGYMVPHAGYIYSGPIAA 62

Query: 59  YAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           +++ NI      +VF++ GP+H       ++    +++TP+G++ +D E+ + L +  K+
Sbjct: 63  HSYFNISKEGKPKVFIIAGPNHTGLGENASVWKEGIWETPLGEVEVDSEISKLLVSYSKY 122

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-----NEAMYGRLFAKY 172
              D      EHS+E+ +P+L  +F+   IKIVPI++   N E       A+Y  +    
Sbjct: 123 FTFDEEAHIYEHSVEVQIPFLQYIFKD--IKIVPIVIKLQNYEVSLDLANALYRIITENG 180

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           VD     +  SSD  H      Y  YD    + +K     D++ +  IE+ D +   K L
Sbjct: 181 VD---LIYIASSDMNH------YEPYD----ITYKK----DEIALKKIESLDTEGLFKVL 223

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            E + T+CG  P+   +H+    ++K +I  L++  S     ++D  V YASA
Sbjct: 224 DENNITMCGPGPVGALIHLSKKLNSKAEI--LKHATSGDITGEKDWVVGYASA 274


>gi|254458451|ref|ZP_05071876.1| ammecr1 domain protein [Sulfurimonas gotlandica GD1]
 gi|373866308|ref|ZP_09602706.1| protein containing MEMO/AMMECR1 domain [Sulfurimonas gotlandica
           GD1]
 gi|207084759|gb|EDZ62046.1| ammecr1 domain protein [Sulfurimonas gotlandica GD1]
 gi|372468409|gb|EHP28613.1| protein containing MEMO/AMMECR1 domain [Sulfurimonas gotlandica
           GD1]
          Length = 485

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 69/278 (24%), Positives = 130/278 (46%), Gaps = 27/278 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AGS+YT+N   L  E++  LR+A       +  +I PHAGY +S   A+ A+  
Sbjct: 23  LRPTAVAGSFYTNNKPELQREVNTLLRDAKTFPKQNINALIVPHAGYVFSANVASTAYKT 82

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           ++      +FL+G SH+      ++     YKTP+G++ ++  +  +L    K+ +    
Sbjct: 83  LNK-KYKNIFLIGSSHYTSFDGASVYNIGDYKTPLGNVQVNRSIASKLIKDSKYFVFRAE 141

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
             E EH++E+ LP+L  ++ G  +KIVPI++   N +      +  + Y +D  N F +S
Sbjct: 142 AHEKEHTLEVQLPFLQTIY-GDDLKIVPIIIATSNLQTIISISKTLSPYFND-DNLFIIS 199

Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN------ 237
           +D  H+ +               +    +DK  +D +    P  F + L + ++      
Sbjct: 200 TDLSHYPT--------------FRDANIVDKNILDSLVKNSPKKFVEALAKNEDAKIEAL 245

Query: 238 --TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCK 273
             + CG   +   L+M  + + K +   L Y+ S+  K
Sbjct: 246 QTSACGWTSLLTLLYMTQDKAYKYET--LEYKNSAHSK 281


>gi|357406969|ref|YP_004918893.1| hypothetical protein MEALZ_3652 [Methylomicrobium alcaliphilum 20Z]
 gi|351719634|emb|CCE25310.1| conserved protein of unknown function [Methylomicrobium
           alcaliphilum 20Z]
          Length = 259

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 84/286 (29%), Positives = 128/286 (44%), Gaps = 39/286 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR----EAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           RR + AG +Y D+  +L   +D +L     EA +PK+     +IAPHAGY YSG  AA A
Sbjct: 3   RRPAVAGLFYPDDPTKLRTMVDHYLNDAETEAKVPKA-----IIAPHAGYIYSGPVAASA 57

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  +     SI+RV L+GPSH       A++    Y TP+G++ +D   +E L      E
Sbjct: 58  YARLKQARDSITRVVLIGPSHRVAFRGLAVTRTDYYTTPLGNVVIDRTAVEALIKLPFVE 117

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            ++      EHS+E+HLP+L +       K+VPI+ G  +A+      R        P  
Sbjct: 118 YIEQA-HTHEHSLEVHLPFLQETLAD--FKLVPIVAGDASADQ---VSRALDLVWGGPET 171

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              +SSD  H+                + + + LD+    +IE        +Y       
Sbjct: 172 LIVISSDLSHYHD--------------YSTAKKLDQQTSTLIEQ------MQYEKLSSEA 211

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            CG+ P+S  L +L   S +IK   LR   S      +D  V Y +
Sbjct: 212 ACGKVPVSGLLKLLRQKSMRIKAVDLR--NSGDTAGDKDRVVGYGA 255


>gi|340781796|ref|YP_004748403.1| hypothetical protein Atc_1054 [Acidithiobacillus caldus SM-1]
 gi|340555949|gb|AEK57703.1| protein containing DUF52 [Acidithiobacillus caldus SM-1]
          Length = 266

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 82/287 (28%), Positives = 128/287 (44%), Gaps = 32/287 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IR  + AG +Y      L  +LD ++  +   K    + +IAPHAGY YSG  A  A
Sbjct: 1   MSHIRPPAVAGRFYPARVTELQHQLDAFIPSSVASKQLHPKVLIAPHAGYVYSGPVAGKA 60

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  +      ISRV LLGP+H+      A+  A  + TP+G +P++ + +  +    +  
Sbjct: 61  YAQLQGLRGQISRVVLLGPAHYVRLRGLAIPEADAFATPLGTVPVNRDALNAISDLPQVI 120

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAKYVDDPS 177
             D+     EHS+E+HLP+L ++       +VP+LVG A   E   +  RL+      P 
Sbjct: 121 QSDVP-HNPEHSLEVHLPFLQRMLGD--FSVVPLLVGDATYTEVAEVLERLWG----GPE 173

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
               +S+D  H      Y+ Y         + + LD    +IIE   P+A     + Y+ 
Sbjct: 174 TLIVISTDLSH------YLDY--------ATAQRLDARTAEIIEALQPEA-----IHYEQ 214

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             CGR P+   L +      +I+   LR   S      RD  V Y +
Sbjct: 215 A-CGRIPLGGLLELARKRHMRIERVDLR--NSGDTTGPRDQVVGYGA 258


>gi|119946258|ref|YP_943938.1| hypothetical protein Ping_2620 [Psychromonas ingrahamii 37]
 gi|119864862|gb|ABM04339.1| hypothetical protein DUF52 [Psychromonas ingrahamii 37]
          Length = 282

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 103/192 (53%), Gaps = 14/192 (7%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K R A+ AGS+Y   + ++ +EL  +L  A    + + + +I PHAGY YSG  A YA+ 
Sbjct: 24  KYRTAAVAGSFYPATADQIDQELSVFLN-APSESTTQAKALIVPHAGYCYSGAVAGYAYS 82

Query: 63  NIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
            +   +  I+RV LLGPSH      CA+S+   + TPIG +P+D     +L      E +
Sbjct: 83  YLKNIAHNINRVILLGPSHRVALQGCAISSCDFFTTPIGPIPVDKSAYTQLLD----EKL 138

Query: 121 DICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
               D+A   EHS+E+ LP+L +  +  +  +VPI+VG  + ++ +    +    V++P 
Sbjct: 139 VTINDQAHLLEHSLEVQLPFLQRSLQNFV--LVPIVVGQCSVQHVSQILEILK--VNEPG 194

Query: 178 NFFSVSSDFCHW 189
               VSSD  H+
Sbjct: 195 TLVVVSSDLSHY 206


>gi|345873097|ref|ZP_08825017.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thiorhodococcus drewsii AZ1]
 gi|343917581|gb|EGV28376.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thiorhodococcus drewsii AZ1]
          Length = 261

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 85/293 (29%), Positives = 128/293 (43%), Gaps = 41/293 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M +IR+ + A  +Y  N++ L   LD +L        P  + +IAPHAGY YSG  AA A
Sbjct: 1   MSQIRQPAVANRFYPGNAQELGVMLDLYLSSDRTGAQPP-KALIAPHAGYVYSGPIAASA 59

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  + P   +I RV LLGPSH       A S A+ + TP+G + LD   IE      + +
Sbjct: 60  YATLAPVRDAIRRVVLLGPSHRLPFQGLAASMASDFATPLGLVRLDRAAIESALKLPQVQ 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYGRLFAKYV 173
            ++      EHS+E+ LP+L +       K+VP +VG    E+     +A++G       
Sbjct: 120 PLE-SAHTFEHSLEVQLPFLQRALGD--FKLVPFVVGEATPESVAEVLDALWG------- 169

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
             P     +SSD  H      Y+ Y+  +         LD+     IET DP+       
Sbjct: 170 -GPETLILISSDLSH------YLDYETAN--------RLDRQTSTAIETLDPNGIGP--- 211

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
              +  CG +P+   L +      + +   LR   S      RD  V Y + A
Sbjct: 212 ---DQACGHYPLRGLLTLAKRRGLQAETLDLR--NSGDTAGSRDQVVGYGAYA 259


>gi|307594859|ref|YP_003901176.1| hypothetical protein Vdis_0732 [Vulcanisaeta distributa DSM 14429]
 gi|307550060|gb|ADN50125.1| protein of unknown function DUF52 [Vulcanisaeta distributa DSM
           14429]
          Length = 286

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 139/299 (46%), Gaps = 29/299 (9%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPE-----VRGVIAPHAGY 50
           M ++R+ + AG++Y  +  +L ++++ W     L    L K P      V  VI PHAGY
Sbjct: 1   MVRVRKPAVAGAFYEADRDKLIKQIE-WSISHPLGPGKLVKQPREGFKAVPIVIVPHAGY 59

Query: 51  SYSGRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
            YSG  AA +F  I      R F+L GP+H+      A+     ++TP+G L +D E+ +
Sbjct: 60  IYSGPVAAMSFAEIYRFHSPRTFILVGPNHYGIGAPVAIYPEGAWETPLGTLEIDSEIAK 119

Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
           EL +  K+   D+     EHS+E+ LP++  +F G+ +KIVPI++     E     G   
Sbjct: 120 ELMSRVKYLEPDVYAFAQEHSLEVQLPFIQYIF-GNNVKIVPIIIWRQTKEVARDLGNAI 178

Query: 170 AKYVD--DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
           A+ +   +P +   V+S         ++ HY+       K ++A+D      +   D D+
Sbjct: 179 AEVISSHEPGSIVYVASS--------DWNHYEPHEITTEKDMKAIDP-----VLRLDEDS 225

Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           F   +  YD + CG   I+  +        K  +  LR+  S         +V YAS A
Sbjct: 226 FFDVIERYDVSACGYGAIATAIVTAKRLGVK-NVVLLRHATSGDTSGYTLETVGYASIA 283


>gi|383764689|ref|YP_005443671.1| hypothetical protein CLDAP_37340 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381384957|dbj|BAM01774.1| hypothetical protein CLDAP_37340 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 270

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 80/288 (27%), Positives = 127/288 (44%), Gaps = 30/288 (10%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K+R+ + AG +Y  +   L   +  ++ +A +P    VR VIAPHAGY  SG  A  AF 
Sbjct: 5   KVRKPAVAGRFYPASPATLHNAVRQYIEQASIPALQGVRAVIAPHAGYVCSGAVAGAAFC 64

Query: 63  NIDPTSISR--VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
            +   S S   ++LLGP+H+      A+S+A  + TP+G +P+ ++ +  L  T     +
Sbjct: 65  ALQTASASEPVIYLLGPAHYKIVRGVAVSSADAFATPLGPVPVAIDAVTRLATTNHLAHI 124

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D      EH +E+ LP+L  +F G  + IVP+L+    A+   +   L  +  + P    
Sbjct: 125 DDWAHAPEHCLEVELPFLQYIF-GERLHIVPMLLDE-EADVTGVAEFLAQEVRERPDALI 182

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEAL--DKMGMDIIETGDPDAFKKYLLEYDNT 238
            VSSD  H+                H   EA+  D+  ++ I  GD              
Sbjct: 183 VVSSDLSHY----------------HPYAEAIRRDRSLLEAILAGDLARVAA------GE 220

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            CGR PI   +H+    +   +   L Y  S      +   V YA+ A
Sbjct: 221 ACGRLPILCLMHVAQQLNW--QAHRLAYANSGDTCGPKHEVVGYAALA 266


>gi|335436406|ref|ZP_08559201.1| hypothetical protein HLRTI_04897 [Halorhabdus tiamatea SARL4B]
 gi|334897718|gb|EGM35847.1| hypothetical protein HLRTI_04897 [Halorhabdus tiamatea SARL4B]
          Length = 283

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 141/301 (46%), Gaps = 38/301 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
           M+  R+ + AG +Y  N   L E+++G    A  P S        PE+ G++APHAG  +
Sbjct: 1   MDSARQPAVAGRFYVRNEAPLREQVEGMFTHAVGPGSVPTATAGSPEIAGLVAPHAGLPF 60

Query: 53  SGRAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
           SG  AA+++  +  +     V +LGP+H       A+     ++TP+G + +D ++ E++
Sbjct: 61  SGPVAAHSYAALAESGTPETVVILGPNHTGVGAAVAVPGDDEWRTPLGSVQIDADLREQI 120

Query: 112 KATGKFELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
                 +  D  VD+    +EH+ E+ LP+L  +++   + ++PI +   +A+     G 
Sbjct: 121 -----VDSTDATVDDRAHASEHAAEVQLPFLQYLYDD--LAVLPISLRRQDADVSRQLGE 173

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
             A + D+ +   + SSDF H+          + H V   +IE  D++ +D IE  DP  
Sbjct: 174 ALAAHTDEETVVIA-SSDFTHY----------EPHDV---AIE-RDELALDRIEANDPAG 218

Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
             + +     ++CG   I+  L   G  S   ++    +  S      RD  V YAS A 
Sbjct: 219 LIETVEREGLSVCGYGAIAAMLWANGEDS---QVDVFAHATSGDTAGSRDDVVGYASVAV 275

Query: 288 K 288
           +
Sbjct: 276 R 276


>gi|383783463|ref|YP_005468029.1| hypothetical protein LFE_0171 [Leptospirillum ferrooxidans C2-3]
 gi|383082372|dbj|BAM05899.1| hypothetical protein LFE_0171 [Leptospirillum ferrooxidans C2-3]
          Length = 269

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 84/294 (28%), Positives = 140/294 (47%), Gaps = 37/294 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRG----VIAPHAGYSYSG 54
           M  +R  + AG +Y      L + ++G L EA   +P + + RG    +I PHAG  YSG
Sbjct: 1   MTIVRSPAVAGVFYPKEPDILLDMVEGMLEEALGKIPPAIQARGTPRALIVPHAGLIYSG 60

Query: 55  RAAAYAFGNIDPTSIS-RVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA A+  ++P+ +  R+F+LGPSH  +    A STAT + TP+G++P+D  ++++++ 
Sbjct: 61  PIAASAYCLMNPSFLPKRIFVLGPSHRIFFKGIATSTATSFGTPLGEIPVDRTLVDQMET 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
               +  D    + EHS+E+ LP+L KV +   + I+P++ G  +A + A   R+    +
Sbjct: 121 LPGIDTYDPAHTQ-EHSIEIQLPFL-KV-QMDQVPIIPLIAGESSASSVA---RVIKTAL 174

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD-KMGMDIIETGDPDAFKKYL 232
             P     +S+D  H      Y+  D+         + LD +   DII         +Y 
Sbjct: 175 SFPDTLVVISTDLSH------YLPPDEG--------QLLDARTASDIIS-------PEYR 213

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
              +   CG  P++ FL        K +I  LR   S      ++  V Y S A
Sbjct: 214 PLTNEMACGSQPLNGFLMAAQEAGLKGEILDLR--NSGDTAGPKNEVVGYGSFA 265


>gi|385811190|ref|YP_005847586.1| dioxygenase [Ignavibacterium album JCM 16511]
 gi|383803238|gb|AFH50318.1| Putative dioxygenase [Ignavibacterium album JCM 16511]
          Length = 272

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 127/286 (44%), Gaps = 19/286 (6%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVRGVIAPHAGYSYSGRAAAY 59
           M+K+R    AG +Y  +   L  E+   L+EA   KS  ++ G+++PHAGY YSG+ AAY
Sbjct: 1   MKKVRPPHVAGYFYPADPVTLHNEIQRMLKEAKSEKSFDKIVGIVSPHAGYMYSGKTAAY 60

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+  +   +     ++ PSH  Y P  ++     Y+TP+G + +D E   +L    K   
Sbjct: 61  AYNTLKGKNYKTAIVISPSHAEYFPGISVYDGDAYETPLGIVEIDEERANKLVEGSKLIF 120

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
             I     EH++E+ +P+L  V +    KIVP+++G  ++       R  A+ +DD +  
Sbjct: 121 RGIQGHRREHALEVQIPFLQSVLDD--FKIVPVVMGDQSSVFVNELARKLAEVMDDKT-I 177

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSD  H+   +N    D+   ++ + I   D            +  +K L  +    
Sbjct: 178 IVASSDLSHF---YNASQADELDSIVEQRINNFDF-----------EQLQKDLDSHRCEA 223

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           CG  PI V +          K   LR   S      +   V Y SA
Sbjct: 224 CGGGPI-VAMMKAAALKNYDKATVLRRTDSGDVTGDKSEVVGYLSA 268


>gi|381159503|ref|ZP_09868735.1| putative dioxygenase [Thiorhodovibrio sp. 970]
 gi|380877567|gb|EIC19659.1| putative dioxygenase [Thiorhodovibrio sp. 970]
          Length = 267

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/298 (28%), Positives = 131/298 (43%), Gaps = 45/298 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----EVRGVIAPHAGYSYSGR 55
           M  IR A  AG++Y  +   L  E+D    +     +P      ++ +I PHAGY YSG 
Sbjct: 1   MTTIRHACVAGTFYPADPAHLRAEIDACFAQRAPANAPGLAGERIKALIVPHAGYVYSGP 60

Query: 56  AAAYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
            AA A+ ++ P +  I RV LLGP+H       ALS A  ++TP+G +P+     E+L  
Sbjct: 61  VAASAYASLQPQADEIRRVVLLGPAHRMPFQGLALSAADAWQTPLGSVPVARSACEQLLV 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYGRL 168
             + E +D   D  EHS+E+ LP+L ++      +++P+LVGA +A       E+++G  
Sbjct: 121 FPQVERLDAAFD-GEHSLEVQLPFLQRLLGD--FELIPLLVGAASAAEVAEVLESLWG-- 175

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
                        +SSD  H+              +   S  ALD      I   +P+  
Sbjct: 176 ------GDETLILISSDLSHF--------------LDDTSARALDAQTAAAILRLEPE-- 213

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
               L      CGR+P+   L +       +K + L    SSQ     D  V Y + A
Sbjct: 214 ----LIGPEQACGRYPMGGLLRI--ARMRGLKPEQLDLRNSSQTAGSPDRVVGYGAYA 265


>gi|296272331|ref|YP_003654962.1| hypothetical protein [Arcobacter nitrofigilis DSM 7299]
 gi|296096505|gb|ADG92455.1| protein of unknown function DUF52 [Arcobacter nitrofigilis DSM
           7299]
          Length = 265

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 129/285 (45%), Gaps = 37/285 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
            + + +G++Y D+ K L +  + +   E  +      R +I PHAGY YSG  A  A+  
Sbjct: 4   NKMNFSGTFYPDDKKELLKYFEVFTANEEKIKLDINPRAIIVPHAGYIYSGSVANVAYNL 63

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
              T   RV ++GPSH Y+    +++    Y+TP+GDL +D E   +L    K++ +   
Sbjct: 64  SKDTKPKRVIVIGPSHRYFLQGASIAQFDSYETPLGDLVIDKEFGSKL--IEKYDFLHFV 121

Query: 124 VD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
            D   EHS E  +P++   F+   +K++ I+ G ++A+  ++      K   D  NF  +
Sbjct: 122 EDAHMEHSTETQIPFIKNYFD---VKVLEIVYGKIDAKELSLLVLELLK---DKDNFLVI 175

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           S+D  H+ +                    LD + +D I+  D +   K     D   CG+
Sbjct: 176 STDLSHFYTL--------------DEANILDGICIDAIKNQDIEKLTK-----DCEACGK 216

Query: 243 HPISVFLHMLGNCSTKI---KIKFLRYEQSSQCKTKRDSSVSYAS 284
             +   L      + K+   K+KFL Y+ S +    +   V YAS
Sbjct: 217 EGVKALLQ-----TAKVFDYKVKFLDYKTSYEITKDKSKVVGYAS 256


>gi|327311477|ref|YP_004338374.1| hypothetical protein TUZN_1595 [Thermoproteus uzoniensis 768-20]
 gi|326947956|gb|AEA13062.1| hypothetical protein TUZN_1595 [Thermoproteus uzoniensis 768-20]
          Length = 281

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 81/294 (27%), Positives = 130/294 (44%), Gaps = 28/294 (9%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELD-------GWLREAGLPKSPEVRGVIAPHAGYSYS 53
           M +IR+ + AG +Y  + + L E +        G   E+  P+  +  G +APHAGY YS
Sbjct: 1   MARIRKPAVAGYFYEADREALVERIRWSIYHDLGPKTESLSPEGAKALGAVAPHAGYIYS 60

Query: 54  GRAAAYAFGNIDPTS-ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+A+  +        V ++GP+H+      A+  + +++TP+G L +D E  E L 
Sbjct: 61  GPVAAWAYAALRGYGRPDTVVVIGPNHYGVGAPVAIMKSGIWETPLGALEIDEEAAEFLS 120

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
           +  +    D      EHS+E+H+P++   F    +KIVP+ +           G   AK 
Sbjct: 121 SEYRSLEDDFYAFSKEHSIEVHIPFIQYFFGD--VKIVPVALWRQTPSTAKELGAALAKM 178

Query: 173 VDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
           V +     +   SSD  H      Y H        H+     D++ +  I  GD + F  
Sbjct: 179 VANFKKRIYIIASSDLNH------YEH--------HEVTAKKDELAISKIVAGDVEGFFD 224

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            +  YD + CG  PI     M    +   K K L++  S      R+ +V YAS
Sbjct: 225 VISRYDISACGIGPIGAL--MKAAAALGFKAKLLKHATSGDTSGYREETVGYAS 276


>gi|328848465|gb|EGF97679.1| hypothetical protein MELLADRAFT_41191 [Melampsora larici-populina
           98AG31]
          Length = 144

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 55/136 (40%), Positives = 81/136 (59%), Gaps = 18/136 (13%)

Query: 120 MDICVDEAEHSMEMHLPY---LAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
           MDI  D+AEHS+E+HLPY   ++K F+   I  VPIL+G++++E E +YG++   Y+  P
Sbjct: 1   MDIKTDKAEHSIELHLPYVRHISKSFQT--ITNVPILIGSISSEKEFIYGKVLQPYLLSP 58

Query: 177 SNFFSVSSDFCHWGSRFNYMHY-----------DKKHGVIHKSIEALDKMGMDIIETGDP 225
            N F +SSDF   GSRF+Y +Y                 I++SIEALD+  + I+E+   
Sbjct: 59  ENLFVISSDF--LGSRFSYTYYKDLARKLSSGASPPATPIYQSIEALDQEALKILESLSH 116

Query: 226 DAFKKYLLEYDNTICG 241
             F +YL +  N ICG
Sbjct: 117 SEFSRYLSQTKNNICG 132


>gi|386359785|ref|YP_006058030.1| dioxygenase [Thermus thermophilus JL-18]
 gi|383508812|gb|AFH38244.1| putative dioxygenase [Thermus thermophilus JL-18]
          Length = 263

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 88/291 (30%), Positives = 122/291 (41%), Gaps = 36/291 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+ +R    AG +Y      L EE++  L  A  P  P VRGV++PHAGY+Y+GR  A A
Sbjct: 1   MDLVRPPQVAGYFYPGEKAALEEEVEALLAGARTPPLPGVRGVLSPHAGYAYAGRVMAEA 60

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
           F  +        RVFLLGPSH    P  A      ++TP+G++ +DLE    L   G  F
Sbjct: 61  FRALSAWRGKARRVFLLGPSHFVAFPGVAFFPYRAWRTPLGEVAVDLEGGRRLLGQGAPF 120

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
                   E EHS+E+ LP+L          I+P+L G V+     EA+   L       
Sbjct: 121 RAYREPFLE-EHSLEVLLPFLQVALP--QTPILPLLFGEVDPGEVAEALLPEL------G 171

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P +    SSD  H+         D K     K   ALD  G+                  
Sbjct: 172 PKDLVVASSDLSHYHPDPVARRLDAK---TLKRALALDAEGVA----------------- 211

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
               CGR P S    +    +   K + L Y  S++ +  R+  V Y + A
Sbjct: 212 QGEACGRLPWSTLTAL--ARALGWKPRLLAYATSAEARGGRERVVGYGALA 260


>gi|373458175|ref|ZP_09549942.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Caldithrix abyssi DSM 13497]
 gi|371719839|gb|EHO41610.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Caldithrix abyssi DSM 13497]
          Length = 276

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 79/286 (27%), Positives = 128/286 (44%), Gaps = 23/286 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
           IR+ + AG +Y+ N   L  E+  +L  +   K+   + GV+APHAGY YSG  AA A+ 
Sbjct: 6   IRKPAVAGMFYSGNRSTLEREVAVFLENSNQEKNVRHIYGVVAPHAGYMYSGGVAARAYR 65

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA-TGKFELMD 121
            +       V ++ PSHH Y    ++     Y+TP+G +P+D  +  +L     +  L  
Sbjct: 66  QVMDFEYEVVVVIAPSHHVYFETVSIYDGDFYETPMGLIPVDKNLCRQLADFDQRLTLSS 125

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           I  +  EH++E+ LP+L  +FE    K+VPI++G  + +N        A  +D+      
Sbjct: 126 IGHEGEEHALEVQLPFLQHIFEE--FKLVPIVMGDQSMKNIQALANALAAVLDNKKTLIV 183

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H+ S              ++    LD + +  I     D   + L      +CG
Sbjct: 184 ASSDLSHYHS--------------YEEAVRLDSVVISHINNFKEDNLYEDLQSGLCEMCG 229

Query: 242 RHPISVFLHMLGNC--STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
             P+   + M+  C  S   K K + Y  S     +R   V Y +A
Sbjct: 230 GGPV---IAMMKACRKSGADKAKVVLYRNSGDVTGERAPVVGYLAA 272


>gi|148656531|ref|YP_001276736.1| hypothetical protein RoseRS_2409 [Roseiflexus sp. RS-1]
 gi|148568641|gb|ABQ90786.1| protein of unknown function DUF52 [Roseiflexus sp. RS-1]
          Length = 284

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 83/288 (28%), Positives = 124/288 (43%), Gaps = 23/288 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
           IR +  AGSWY  N   L  E+D +L +A  P  P +V GV+APHAG  YSG  AA+AF 
Sbjct: 3   IRPSPIAGSWYPSNPAHLQHEIDRYLAQAEPPVLPGKVWGVLAPHAGVRYSGPIAAWAFA 62

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI----EELKATGKFE 118
            +   +   + +  P H         +  T Y+TP+G +P+D   I    E L+    F 
Sbjct: 63  CVRGRTPEIIVIASPWHRGGPTPLITTGHTAYETPLGIVPVDNNAIAQLDEALRRRAGFG 122

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           L     D+ EH++E+ LP+L +VF      ++P+++   +A   A  G   A+ +     
Sbjct: 123 LTPRRHDD-EHAVEIELPFLQRVFGSFW--LLPVMLADQSAVTSAALGAALAETLRGRDA 179

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSD  H+          +   V  +    LD   +  +E  DPDA      E    
Sbjct: 180 LLVASSDLSHY----------EPEPVARR----LDAGLLRRVEAFDPDAVIAAEAEGVGY 225

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            CG   I+  L          ++  LRY  S   +      V Y +AA
Sbjct: 226 ACGSGAIAAVLWACRELGAN-RVTILRYGTSGDVERAAPWVVGYGAAA 272


>gi|427428692|ref|ZP_18918732.1| Putative dioxygenase [Caenispirillum salinarum AK4]
 gi|425881800|gb|EKV30484.1| Putative dioxygenase [Caenispirillum salinarum AK4]
          Length = 470

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 82/292 (28%), Positives = 133/292 (45%), Gaps = 33/292 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWL---REAGLPKSPEVRGVIAPHAGYSYSGRAA 57
           M  +R A+ AG++Y  ++  L   +D  +   R A   ++P  + +IAPHAGY +SG  A
Sbjct: 1   MTHMRPAAVAGAFYPGDAATLTRTVDDLMAEARAATPAETPVPKAIIAPHAGYMFSGPTA 60

Query: 58  AYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           A A+  + P    + RV LLGP+H       AL T   ++TP+G + LD + I ++    
Sbjct: 61  ALAYARLAPLKGRVKRVVLLGPAHRVPFRGLALPTVDSFETPLGPVALDTDAINDVAEHV 120

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA-ENEAMYGRLFAKYVD 174
                     + EHS+E+HLP+L  +  G   ++VP+ VG   A E  A+   L+     
Sbjct: 121 PGAGFSDKAHQQEHSLEVHLPFLRAIL-GPEPEVVPVCVGDAKADEVAALLDHLWGGE-- 177

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
                  +S+D  H      ++ YD    V   ++ A++ +        DP       L 
Sbjct: 178 --ETLIVISTDLSH------FLDYDTARRVDAGTVAAVEHL--------DPSR-----LT 216

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ++N  CG  P++  L  L     ++ I+ L    S      RD  V YA+ A
Sbjct: 217 HENA-CGLTPMAGLL--LSAKKRRMSIETLDARSSGDTAGPRDRVVGYAAWA 265


>gi|218295903|ref|ZP_03496683.1| protein of unknown function DUF52 [Thermus aquaticus Y51MC23]
 gi|218243641|gb|EED10169.1| protein of unknown function DUF52 [Thermus aquaticus Y51MC23]
          Length = 264

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 88/295 (29%), Positives = 132/295 (44%), Gaps = 36/295 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+ +R A  AG +Y    + L EE+   L+ A  P  P VRGV+APHAGY YSG+ AA +
Sbjct: 1   MDLVRPAYVAGQFYPGEREALLEEVASLLKAAKTPPLPGVRGVLAPHAGYGYSGKVAAES 60

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
           F  +        R  LLGPSH       A      ++TP+G++ +DLE    L   G  F
Sbjct: 61  FQALSGFRGQAQRAILLGPSHFVPFLGVAFYPYRAWRTPLGEVAVDLEGGRRLLERGFPF 120

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
             ++    E EHS+E+ LP+L     G  + I+P+L G V+     EA+   L       
Sbjct: 121 LRLEEPFWE-EHSLEVLLPFLQVALPG--VPILPLLFGEVDPLRVAEALLPEL------S 171

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P +    SSD  H+       H D    V  +     D+  ++I  + D +A  +     
Sbjct: 172 PQDLVVASSDLSHY-------HPDP---VARER----DRRTLEIALSLDAEALSR----- 212

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
               CGR P +    +      + ++  L Y  S++    R   V YA+ A  +D
Sbjct: 213 -AEACGRLPWASLTALARELGWRPRL--LAYATSAEAFGDRRRVVGYAAVAYGLD 264


>gi|55981680|ref|YP_144977.1| hypothetical protein TTHA1711 [Thermus thermophilus HB8]
 gi|55773093|dbj|BAD71534.1| conserved hypothetical protein [Thermus thermophilus HB8]
          Length = 456

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 88/291 (30%), Positives = 121/291 (41%), Gaps = 36/291 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+ +R    AG +Y      L EE+   L  A  P  P VRGV++PHAGY+Y+GR  A A
Sbjct: 1   MDLVRPPQVAGYFYPGEKAALKEEVKALLAGARTPPLPGVRGVLSPHAGYAYAGRVMAEA 60

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
           F  +        RVFLLGPSH    P  A      ++TP+G++ +DLE    L   G  F
Sbjct: 61  FRALSAWRGKARRVFLLGPSHFVAFPGVAFFPYRAWRTPLGEVAVDLEGGRRLLGQGAPF 120

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
                   E EHS+E+ LP+L          I+P+L G V+     EA+   L       
Sbjct: 121 RAYREPFLE-EHSLEVLLPFLQVALPQ--TPILPLLFGEVDPGEVAEALLPEL------G 171

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P +    SSD  H+         D K     K   ALD  G+                  
Sbjct: 172 PKDLVVASSDLSHYHPDPVARRLDAK---TLKRALALDAEGVA----------------- 211

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
               CGR P S    +    +   K + L Y  S++ +  R+  V Y + A
Sbjct: 212 QGEACGRLPWSTLTAL--ARALGWKPRLLAYATSAEARGGRERVVGYGALA 260


>gi|452944125|ref|YP_007500290.1| putative dioxygenase [Hydrogenobaculum sp. HO]
 gi|452882543|gb|AGG15247.1| putative dioxygenase [Hydrogenobaculum sp. HO]
          Length = 262

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 136/282 (48%), Gaps = 24/282 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR  + AG +Y  +   L + ++  + E  + +  + +GVI PHAGY YSG  A   +  
Sbjct: 2   IREEAVAGLFYPGDKDILLQAIN-LVCENEVIQDVDAKGVIVPHAGYIYSGHTACSVYKR 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P     + ++GP+H    P  ++ ++  +KTP+G++P++ E+ +++ + G  E+ +  
Sbjct: 61  LKPKK--NIIMMGPNHTGQGPVISIDSSDAWKTPLGEVPINKELRDKIASFG-IEI-EPK 116

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
               EHS+E+ LP+L   F+     IVP+++G ++ E     G+  + +VDD      VS
Sbjct: 117 AHIKEHSLEVQLPFLQVYFKD--FTIVPMILGFLDYETIVKLGKFLSSFVDD-DTLILVS 173

Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
           SD  H+ S+       KK   ++ +I  LD                +  ++Y+ T+CG  
Sbjct: 174 SDMSHYISQ---QEAQKKDSFLYDAISKLDS-----------KELYRRAIQYNITMCGFI 219

Query: 244 PISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           P +V +  L +   K+    + Y  S       +  V+Y S 
Sbjct: 220 PAAVTIESLKDKVKKVYP--VEYTNSGMVTKDFEKVVAYLSV 259


>gi|384431889|ref|YP_005641249.1| hypothetical protein [Thermus thermophilus SG0.5JP17-16]
 gi|333967357|gb|AEG34122.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thermus thermophilus SG0.5JP17-16]
          Length = 263

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 88/291 (30%), Positives = 120/291 (41%), Gaps = 36/291 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+ +R    AG +Y      L EE+   L  A  P  P VRGV++PHAGY+Y+GR  A A
Sbjct: 1   MDLVRPPQVAGYFYPGEKAALKEEVKALLAGARTPPLPGVRGVLSPHAGYAYAGRVMAEA 60

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
           F  +        RVFLLGPSH    P  A      ++TP+G++ +DLE    L   G  F
Sbjct: 61  FRALSAWRGKARRVFLLGPSHSVAFPGVAFFPYRAWRTPLGEVAVDLEGGRRLLGQGAPF 120

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
                   E EHS+E+ LP+L          I+P+L G V+     EA+   L       
Sbjct: 121 RAYREPFLE-EHSLEVLLPFLQVALP--QTPILPLLFGEVDPGEVAEALLPEL------G 171

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P +    SSD  H+         D K     K   ALD  G+                  
Sbjct: 172 PKDLVVASSDLSHYHPDPVARRLDAK---TLKRALALDAEGVA----------------- 211

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
               CGR P S    +    +   K + L Y  S++    R+  V Y + A
Sbjct: 212 QGEACGRLPWSTLTAL--ARALGWKPRLLAYATSAEAGGGRERVVGYGALA 260


>gi|160902549|ref|YP_001568130.1| hypothetical protein Pmob_1088 [Petrotoga mobilis SJ95]
 gi|160360193|gb|ABX31807.1| protein of unknown function DUF52 [Petrotoga mobilis SJ95]
          Length = 274

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 78/260 (30%), Positives = 128/260 (49%), Gaps = 31/260 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV---RGVIAPHAGYSYSGRAA 57
           MEK+R    AG++Y  N ++L   L  +  ++    + ++    G I PHAGY YSG  A
Sbjct: 1   MEKVRNPVVAGTFYPSNPEKLKNLLKSFFGDSFKVNTEKLIPPMGAIVPHAGYIYSGETA 60

Query: 58  AYAFGNIDPTSIS-RVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           A A+  +    I+ RVFLLGP+H     K ++ T+  +KTP+G + +D       K  GK
Sbjct: 61  AKAYKKVFEKGIAKRVFLLGPNHTGLGSKISVFTSGSWKTPLGTINVD------GKTAGK 114

Query: 117 -FELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
             + +DI  DE+    EHS+E+ LP+L     G+  +IVPI +   + E     G + A 
Sbjct: 115 ILKELDIYNDESAHSREHSLEVQLPFLQYAI-GNDFEIVPICMMDQSLETSKNLGEILAD 173

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
            +++  +    SSD  H+ S    +  D+      K IE L  M +  +     D  ++Y
Sbjct: 174 IIEE-GDLIIASSDMNHYESHEKTLLKDE------KVIETLKNMNLQEMY----DTIRRY 222

Query: 232 LLEYDNTICGRHPISVFLHM 251
            +    ++CG  P++  L +
Sbjct: 223 NI----SMCGYGPVAALLSI 238


>gi|406911945|gb|EKD51642.1| hypothetical protein ACD_62C00211G0005 [uncultured bacterium]
          Length = 262

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 86/289 (29%), Positives = 128/289 (44%), Gaps = 41/289 (14%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLRE----AGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           KIR+ + AG +Y    + L +++  +L +    A +PK+     +I PHAGY YSG+ AA
Sbjct: 4   KIRKPAVAGMFYPAEPQVLKQDIQTYLSQVQDCALVPKA-----IIVPHAGYIYSGQVAA 58

Query: 59  YAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
            A+  +      I RV LLGPSH       AL+ A  Y T +GD+ LD    E L    K
Sbjct: 59  QAYAQVSTIKDKIKRVVLLGPSHRVGFRGAALTNARCYNTSLGDVFLDTTAHEVLAGLPK 118

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
             + D      EHS+E+ LP+L  V   H   +VP++VG    E     G +  +     
Sbjct: 119 VVVFDQA-HVPEHSLEVQLPFLQVVL--HDFVLVPVVVGDARPEE---VGEMLCRLWGGE 172

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                +S+D  H+                + S  A D     ++E    D+F     E+D
Sbjct: 173 ETLIVISTDLSHFED--------------YASACAHDARTCRLVEDMAYDSF-----EHD 213

Query: 237 NTICGRHPISVFLHMLGNCSTK-IKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +  CGR P+     ML   S K ++IK L  + S      +D  V Y +
Sbjct: 214 DA-CGRVPLG---GMLRAASQKGLRIKTLELKNSGDTAGTKDRVVGYGA 258


>gi|288869705|ref|ZP_05975843.2| conserved hypothetical protein [Methanobrevibacter smithii DSM
           2374]
 gi|288861209|gb|EFC93507.1| conserved hypothetical protein [Methanobrevibacter smithii DSM
           2374]
          Length = 285

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 139/298 (46%), Gaps = 30/298 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSPEVR---------GVIAPHAGYSYS 53
           +R+ + AG +Y D+S+ L + + D +L   G  K P++           V+ PHAG+ YS
Sbjct: 5   LRKPAVAGMFYPDDSEELVKTIEDCFLHSFGPGKIPDIESFEGNDYPVNVMVPHAGFQYS 64

Query: 54  GRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+++  +       VF++ GP+H     + ++     + TP+G++ +D E  + L 
Sbjct: 65  GTIAAHSYCELAKNGFPEVFIIIGPNHTGLGSEVSVFNKGEWITPLGNIQIDEEFADTLI 124

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA--VNAENEAMYGRLFA 170
           +   F   D      EHS+E+ LP+L   +  +  KIVP+++G+  ++A N+     L A
Sbjct: 125 SFSDFASADFAAHMREHSIEVQLPFLQ--YFSNDFKIVPVVLGSQTISAANDLAAAILKA 182

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
               D S     SSD  H+ ++      +K  G + + IE +D+  +            +
Sbjct: 183 GEKLDKSYCVIASSDLSHFNTQ---ERANKVDGFVLEDIENMDEFKL-----------LE 228

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            +++Y+ T+CG  P+   + +   C      + L Y+ S        S V YAS   K
Sbjct: 229 EIIQYNITMCGYGPVMTTMILSKMCGKNTS-EILAYKTSGDISGDLSSVVGYASGIFK 285


>gi|195953308|ref|YP_002121598.1| hypothetical protein HY04AAS1_0933 [Hydrogenobaculum sp. Y04AAS1]
 gi|195932920|gb|ACG57620.1| protein of unknown function DUF52 [Hydrogenobaculum sp. Y04AAS1]
          Length = 262

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/280 (25%), Positives = 139/280 (49%), Gaps = 26/280 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG +Y  +   L + ++  + E  + +  + +GVI PHAGY YSG  A   +  
Sbjct: 2   VREEAVAGLFYPGDKDILLQAIN-LVCENEVIQDVDAKGVIVPHAGYIYSGHTACSVYKR 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P     + ++GP+H    P  ++ ++  +KTP+G++P++ E+ +++ + G  E+ +  
Sbjct: 61  LKPKK--NIIMMGPNHTGQGPVISIDSSDAWKTPLGEVPINKELRDKIASFG-IEI-EPK 116

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
               EHS+E+ LP+L   F+     IVP+++G ++ E    +G+  + +VDD      VS
Sbjct: 117 AHIKEHSLEVQLPFLQVYFKD--FTIVPMILGFLDYETIVKFGKFLSSFVDD-DTLILVS 173

Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
           SD  H+ S+       KK   ++ +I  LD   +            +  ++Y+ T+CG  
Sbjct: 174 SDMSHYISQ---EEAQKKDSFLYDTISRLDSKEL-----------YRRAIQYNITMCGFI 219

Query: 244 PISVFLHMLGNCSTKIKIKF-LRYEQSSQCKTKRDSSVSY 282
           P +V +  L +   K+K  + + Y  S       +  V+Y
Sbjct: 220 PAAVAIEALKD---KVKKAYAVEYTNSGMVTKDFEKVVAY 256


>gi|435851774|ref|YP_007313360.1| putative dioxygenase [Methanomethylovorans hollandica DSM 15978]
 gi|433662404|gb|AGB49830.1| putative dioxygenase [Methanomethylovorans hollandica DSM 15978]
          Length = 266

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 83/287 (28%), Positives = 126/287 (43%), Gaps = 31/287 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+A+ AG +Y  N + L +ELD       +     ++GV+ PHAGY YSGR AA ++  
Sbjct: 1   MRQATVAGKFYPANPEALRKELDNCFSGTTIDPRSGIKGVVVPHAGYIYSGRVAARSYAV 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +       +F  GP+H  Y    ALST T + TP+G +  D E+ E+L  +       IC
Sbjct: 61  LPEADTYVIF--GPNHTGYGSPVALSTET-WSTPLGKVETDHELAEKLAGS------IIC 111

Query: 124 VDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN- 178
            DE     EHS+E+ +P+L   F+ H  +I+P+ +G  + E     G   A+ +      
Sbjct: 112 QDEVAHRFEHSIEVQIPFLQYRFK-HDFRILPVCIGMQDEEIALEVGMEIARAITAIGRK 170

Query: 179 -FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                SSDF          HY   H       E+ D+  +  I   + D     L   D 
Sbjct: 171 AVIIASSDFT---------HYQPAH-----VAESNDRYLIKAILDMNVDDLYNRLYARDI 216

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           + CG  PI+  +          K + L+Y  S +        V YAS
Sbjct: 217 SACGYGPIAATITAARKLGAS-KAELLKYATSGEVTGDDSGVVGYAS 262


>gi|148643498|ref|YP_001274011.1| dioxygenase [Methanobrevibacter smithii ATCC 35061]
 gi|206558195|sp|A5UN65.1|Y1438_METS3 RecName: Full=MEMO1 family protein Msm_1438
 gi|148552515|gb|ABQ87643.1| predicted dioxygenase [Methanobrevibacter smithii ATCC 35061]
          Length = 282

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 139/298 (46%), Gaps = 30/298 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSPEVR---------GVIAPHAGYSYS 53
           +R+ + AG +Y D+S+ L + + D +L   G  K P++           V+ PHAG+ YS
Sbjct: 2   LRKPAVAGMFYPDDSEELVKTIEDCFLHSFGPGKIPDIESFEGNDYPVNVMVPHAGFQYS 61

Query: 54  GRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+++  +       VF++ GP+H     + ++     + TP+G++ +D E  + L 
Sbjct: 62  GTIAAHSYCELAKNGFPEVFIIIGPNHTGLGSEVSVFNKGEWITPLGNIQVDEEFADTLI 121

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA--VNAENEAMYGRLFA 170
           +   F   D      EHS+E+ LP+L   +  +  KIVP+++G+  ++A N+     L A
Sbjct: 122 SFSDFASADFAAHMREHSIEVQLPFLQ--YFSNDFKIVPVVLGSQTISAANDLAAAILKA 179

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
               D S     SSD  H+ ++      +K  G + + IE +D+  +            +
Sbjct: 180 GEKLDKSYCVIASSDLSHFNTQ---ERANKVDGFVLEDIENMDEFKL-----------LE 225

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            +++Y+ T+CG  P+   + +   C      + L Y+ S        S V YAS   K
Sbjct: 226 EIIQYNITMCGYGPVMTTMILSKMCGKNTS-EILAYKTSGDISGDLSSVVGYASGIFK 282


>gi|325295386|ref|YP_004281900.1| hypothetical protein Dester_1205 [Desulfurobacterium
           thermolithotrophum DSM 11699]
 gi|325065834|gb|ADY73841.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Desulfurobacterium thermolithotrophum DSM 11699]
          Length = 267

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 124/248 (50%), Gaps = 22/248 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG +Y  NS+ L   L+ + R   +PK  + + +I PHAGY YSG+ A   +  
Sbjct: 2   VRYPAVAGQFYPGNSEELKLYLESFCRN-NIPKV-DAKAIIVPHAGYIYSGKVAGETYSR 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           ++  S++   ++GP+H       ++  + ++ TP+G++P++  +  +L     FE  D  
Sbjct: 60  VEIPSLN--IIMGPNHTGLGKSVSVYPSGIWITPLGEIPINEHITSKLLNNSPFE-ADTA 116

Query: 124 VDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
               EHS+E+ LP+L +   G+   + IVPI    ++  +    G + AK +++ +    
Sbjct: 117 AHIYEHSLEVQLPFL-QYCSGYREDLSIVPITYKYISYSDCIKAGEVLAKVLEEDNGLIV 175

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +S+DF H+ S+                 E  D + +D I + +P+   K +  Y+ ++CG
Sbjct: 176 ISTDFSHYISQ--------------SEAEKYDSLAIDAILSLNPEELYKRVFTYNISMCG 221

Query: 242 RHPISVFL 249
             P ++ L
Sbjct: 222 VIPATIGL 229


>gi|308050676|ref|YP_003914242.1| hypothetical protein Fbal_2966 [Ferrimonas balearica DSM 9799]
 gi|307632866|gb|ADN77168.1| protein of unknown function DUF52 [Ferrimonas balearica DSM 9799]
          Length = 257

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 96/188 (51%), Gaps = 9/188 (4%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           +IR A+ AG +Y   +  LA E+D  L +         RG+I PHAGY YSG  AA A+ 
Sbjct: 2   RIRPAAVAGYFYPGEANELAREVDRLLSQPKPALPAPARGLIVPHAGYVYSGAIAASAYL 61

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE-LMD 121
            +      RV L+GP+H       AL   T Y+TP+G + LD + I  L   G++E  +D
Sbjct: 62  TLPEQRFRRVILMGPAHTKAVRGIALPDWTHYQTPLGRVALDTDAINRLD--GQYESRID 119

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EH++E+ LP++     G L  ++P++VGA+ +E  A       +    P +   
Sbjct: 120 NLAHAREHALEVQLPFIQVRLSGCL--LLPMVVGAIRSETLAECLTALMQ----PDDLLI 173

Query: 182 VSSDFCHW 189
           VS+D  H+
Sbjct: 174 VSTDMSHF 181


>gi|222445004|ref|ZP_03607519.1| hypothetical protein METSMIALI_00620 [Methanobrevibacter smithii
           DSM 2375]
 gi|222434569|gb|EEE41734.1| Memo-like protein [Methanobrevibacter smithii DSM 2375]
          Length = 285

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 139/298 (46%), Gaps = 30/298 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSPEVR---------GVIAPHAGYSYS 53
           +R+ + AG +Y D+S+ L + + D +L   G  K P++           V+ PHAG+ YS
Sbjct: 5   LRKPAVAGMFYPDDSEELVKTIEDCFLHSFGPGKIPDIESFEGNDYPVNVMVPHAGFQYS 64

Query: 54  GRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+++  +       VF++ GP+H     + ++     + TP+G++ +D E  + L 
Sbjct: 65  GTIAAHSYCELARNGFPEVFIIIGPNHTGLGSEVSVFNKGEWITPLGNIQVDEEFADTLI 124

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA--VNAENEAMYGRLFA 170
           +   F   D      EHS+E+ LP+L   +  +  KIVP+++G+  ++A N+     L A
Sbjct: 125 SFSDFASADFAAHMREHSIEVQLPFLQ--YFSNDFKIVPVVLGSQTISAANDLAAAILKA 182

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
               D S     SSD  H+ ++      +K  G + + IE +D+  +            +
Sbjct: 183 GEKLDKSYCVIASSDLSHFNTQ---ERANKVDGFVLEDIENMDEFKL-----------LE 228

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            +++Y+ T+CG  P+   + +   C      + L Y+ S        S V YAS   K
Sbjct: 229 EIIQYNITMCGYGPVMTTMILSKMCGKNTS-EILAYKTSGDISGDLSSVVGYASGIFK 285


>gi|394987639|ref|ZP_10380478.1| hypothetical protein SCD_00035 [Sulfuricella denitrificans skB26]
 gi|393792858|dbj|GAB70117.1| hypothetical protein SCD_00035 [Sulfuricella denitrificans skB26]
          Length = 271

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 81/291 (27%), Positives = 133/291 (45%), Gaps = 34/291 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEV-RGVIAPHAGYSYSGRAA 57
           M++IR A+ AG++YT + + LA E+   L EA   + KS  + + VI PHAGY YSG  A
Sbjct: 1   MKQIRPAAVAGAFYTGDPRSLASEVAAMLEEAEQHMEKSATLPKAVIVPHAGYIYSGPVA 60

Query: 58  AYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           A A+  ++       RV LLGP H       AL +   + TP+G +PLDL+ + ++    
Sbjct: 61  ASAYALLEQGRNIYRRVVLLGPVHRVPVHGLALPSTEGFATPLGVVPLDLDAMAKIAGLP 120

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           +    D      EHS+E+ LP+L  V +    K+VP+ VG V  +  A    +  +    
Sbjct: 121 QVVEYD-AAHALEHSLEVQLPFLQAVLDD--FKLVPLAVGDVTPQEVA---EVLERLWGG 174

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P     +SSD  H      ++ Y +   +   + +A+    +D+          ++ + +
Sbjct: 175 PETLIVISSDLSH------FLTYREAQQIDSTTAQAI----LDL----------RWPVSH 214

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +   CG  P++  L  L      +K + L    S      R   V Y + A
Sbjct: 215 EQA-CGGTPVNGLL--LAAQHHHLKPRLLNLRNSGDTAGDRSRVVGYGAFA 262


>gi|352683068|ref|YP_004893592.1| putative dioxygenase [Thermoproteus tenax Kra 1]
 gi|350275867|emb|CCC82514.1| Predicted dioxygenase [Thermoproteus tenax Kra 1]
          Length = 281

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 135/296 (45%), Gaps = 32/296 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR-------GVIAPHAGYSYS 53
           M K+RR + AG +Y    + L E +   +R    P S  ++       G + PHAGY YS
Sbjct: 1   MHKVRRPAVAGYFYEAEREALLERIKWAVRHEVGPGSETLKLEGTRALGAVVPHAGYIYS 60

Query: 54  GRAAAYAFGNIDP-TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+A+  +    +   V ++GP+H+      A+  + V++TP+G L +D E+ E L 
Sbjct: 61  GPVAAWAYAALRSFGAPDSVVIVGPNHYGVGSPVAIMRSGVWETPLGSLSVDEELAEGLI 120

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRL 168
              K    D      EHS+E+ LP+L   F    +KIVPI +     +V  E  +   R+
Sbjct: 121 REYKALEDDFYAFSREHSIEVQLPFLQYFFGD--VKIVPIALWRQTPSVAKELGSALARV 178

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
            +K+  +   +   SSDF H      Y H        H+     D++ +  I  GD D F
Sbjct: 179 ISKH--NKKVYVLASSDFNH------YEH--------HEITVKKDEVAIKEIVEGDVDRF 222

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            + L +YD + CG  P +V   M        K + L++  S       + +V YAS
Sbjct: 223 FEVLSKYDISACG--PGAVGALMRAASELGFKARLLKHATSGDTSGYVEETVGYAS 276


>gi|117925442|ref|YP_866059.1| hypothetical protein Mmc1_2152 [Magnetococcus marinus MC-1]
 gi|117609198|gb|ABK44653.1| protein of unknown function DUF52 [Magnetococcus marinus MC-1]
          Length = 481

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 71/224 (31%), Positives = 110/224 (49%), Gaps = 20/224 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPK---SPEVRGVIAPHAGYSYSGRAAAY 59
           +IR A+ AG +Y   +  L + +   L++A  PK     E R  +APHAGY YSG  AAY
Sbjct: 31  RIRPAAVAGMFYPAQADALRQLVRSLLQQA--PKRHDQGEPRAFVAPHAGYRYSGLTAAY 88

Query: 60  AFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           A+  +      R   VFLLGPSH       +L     ++TP+G + +DL ++E + A   
Sbjct: 89  AYNTLQAAPKERPRRVFLLGPSHRVALHGASLGNYDAFETPLGLVEVDLPLVERMAAQES 148

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
             ++D      EHS+E+HLP+L +    H  ++VP++ G +     A    + AKY  + 
Sbjct: 149 DLVLDNAPHAQEHSLEVHLPFLQESL-AHF-RLVPMVFGRIEPSRVA---EILAKY-REA 202

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDII 220
            +    SSD  H      +  YD   G+ H   EA+  M +  +
Sbjct: 203 DDLIVGSSDLSH------FYDYDTAVGLDHTCNEAVLNMDLQAM 240


>gi|170725419|ref|YP_001759445.1| hypothetical protein Swoo_1057 [Shewanella woodyi ATCC 51908]
 gi|169810766|gb|ACA85350.1| protein of unknown function DUF52 [Shewanella woodyi ATCC 51908]
          Length = 268

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 67/205 (32%), Positives = 106/205 (51%), Gaps = 14/205 (6%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRG-----VIAPHAGYSYSGRAAA 58
           R+A+ AG +Y      L + ++ +L R A L   P+++      +I PHAGY YSG+ AA
Sbjct: 6   RQAAVAGRFYPAEPTALKQMINTYLGRCASLTNDPKLKSSPTKVIIVPHAGYIYSGQVAA 65

Query: 59  YAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
            AF  + P    I RV LLGP+H  +   CAL ++  ++TP+G   +D   + +L    K
Sbjct: 66  SAFALLAPRRKQIKRVVLLGPAHRVHLKGCALPSSESFETPLGQQVIDKVALRQLSEHSK 125

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
             + D+  +E EHS+E+ LP+L    +    +++PILVG ++    AM   L   +  D 
Sbjct: 126 ILISDLPHNE-EHSLEVELPFLQLCLDK--FELIPILVGDISP--HAMASILEQVWGGD- 179

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKK 201
                VSSD  H+ S       D++
Sbjct: 180 ETLIVVSSDLSHYHSYQEAAELDRE 204


>gi|375262828|ref|YP_005025058.1| hypothetical protein VEJY3_18321 [Vibrio sp. EJY3]
 gi|369843255|gb|AEX24083.1| hypothetical protein VEJY3_18321 [Vibrio sp. EJY3]
          Length = 267

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 65/190 (34%), Positives = 96/190 (50%), Gaps = 11/190 (5%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLRE-AGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           K R A+ AG +Y   +  L  +LDG+  E A L ++P  + +I PHAGY YSG  AA AF
Sbjct: 2   KYREAAVAGRFYPATANELITQLDGYFAEQALLTEAP--KALIVPHAGYFYSGEVAAKAF 59

Query: 62  GNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
             +  +  + SRV L GPSHH     CA+  + V+ TP G++ +D E +E+L +      
Sbjct: 60  RLLSNSKNTFSRVVLFGPSHHVALDGCAVPDSDVFLTPTGEVMIDREGVEDLLSQDLVAT 119

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
            D      EHS+E+ LP+L          ++PI+VG    +      R+       P+  
Sbjct: 120 SDQA-HHWEHSLEVQLPFLQYCLND--FTLLPIVVG---RDLHGYVKRILEAVTQTPNTL 173

Query: 180 FSVSSDFCHW 189
             VSSD  H+
Sbjct: 174 IVVSSDLSHY 183


>gi|251773044|gb|EES53600.1| protein of unknown function DUF52 [Leptospirillum
           ferrodiazotrophum]
          Length = 469

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 69/209 (33%), Positives = 105/209 (50%), Gaps = 40/209 (19%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG---------LPKSPEVRGVIAPHAGYS 51
           M+++R  + AGS+Y   +  L  +L+  L  A          LPK+     +I PHAGY 
Sbjct: 1   MDRVRYPAVAGSFYEGEASALRAKLESLLARARSEGSGGRKELPKA-----LIVPHAGYL 55

Query: 52  YSGRAAAYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
           YSG  AA A+  ++     ISRV LLGP+H  Y    AL  A V++TP+GD+P+D E++ 
Sbjct: 56  YSGPVAARAYSLLEKAQGRISRVVLLGPAHRVYFHGLALPGADVFRTPLGDVPVDPELVH 115

Query: 110 ELKATGKFELMDIC----VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN---- 161
            +++     L  +C        EHS+E+ LP+L +V       ++P++VG  +A      
Sbjct: 116 RVES-----LPQVCERPEAHRQEHSLEVQLPFLQRVL--GRFGLLPLVVGEASAREVLEV 168

Query: 162 -EAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
            EA+ G         P     VSSD  H+
Sbjct: 169 LEAVRG--------GPETLLVVSSDLSHY 189


>gi|78358001|ref|YP_389450.1| mediator of ErbB2-driven cell motility-containing protein
           [Desulfovibrio alaskensis G20]
 gi|78220406|gb|ABB39755.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Desulfovibrio alaskensis G20]
          Length = 298

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 126/302 (41%), Gaps = 37/302 (12%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           + IRR   AG +Y D+   L   L  +L EA  P        + PHAGY +SG  A    
Sbjct: 19  QHIRRPVVAGRFYPDSPAELQSMLRAYLDEAAAPPQKPTLLAMVPHAGYVFSGAVAGCTL 78

Query: 62  GN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
              + P ++   F+LGP+H       A+    V++TP+GD+P+D  +  E  A       
Sbjct: 79  AQAMLPQTL---FVLGPNHTGRGSGIAVWPEGVWRTPLGDVPVDNALAAEFCALCAPARP 135

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRLF--AKYVD 174
           D     AEHS+E+ LP+L        ++IVP+ +G    AV     A   ++   A    
Sbjct: 136 DTLAHSAEHSLEVVLPFLQ--LRVPRVRIVPVSIGDPSLAVLTAAGAAMAQIIRRAAQTA 193

Query: 175 DPSN----FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
           DP         VSSD  H      ++  D+           LD M ++ +   +P     
Sbjct: 194 DPGGQNRIAMVVSSDMTH------FLPQDEAR--------RLDAMALEQVTALNPQGLYT 239

Query: 231 YLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            + E   ++CG  P++  L    +LG    ++     RY  S     K DS V YA A  
Sbjct: 240 TVREKRISMCGVLPMTAALEACRLLGATGARLT----RYATSGDITGKTDSVVGYAGALV 295

Query: 288 KV 289
            V
Sbjct: 296 DV 297


>gi|345863536|ref|ZP_08815746.1| putative dioxygenase [endosymbiont of Tevnia jerichonana (vent
           Tica)]
 gi|345125318|gb|EGW55188.1| putative dioxygenase [endosymbiont of Tevnia jerichonana (vent
           Tica)]
          Length = 241

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 76/267 (28%), Positives = 128/267 (47%), Gaps = 33/267 (12%)

Query: 25  LDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTS--ISRVFLLGPSHHYY 82
           +D +L +AG  + P  + ++APHAG+ YSG  AA  +  + P +  ISRV L+GPSH   
Sbjct: 2   VDDYLGQAGEGRPP--KAIVAPHAGFVYSGPVAASVYHRLTPVADRISRVVLVGPSHRVA 59

Query: 83  TPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF 142
               ALS+A  ++TP+G++PL+ +V +EL    +  + D+     EHS+E+ LP+L +V 
Sbjct: 60  FRGLALSSAQSFQTPLGEIPLE-QVSDELLRLPQLSVNDMA-HAQEHSLEVQLPFLQRVL 117

Query: 143 EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH 202
                +++PI+VG  +    A    +  +  D       +SSD  H+    +Y+      
Sbjct: 118 GE--FRLLPIVVGDASG---AEVAEVLEQVWDGEETLIVISSDLSHY---HDYL------ 163

Query: 203 GVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIK 262
                + + +D+   D I    P+      L Y++  CGR P+   L +        ++ 
Sbjct: 164 -----TAQQMDRTTSDAILALQPER-----LHYEDA-CGRIPLQGLLEVAQRKGLDGELV 212

Query: 263 FLRYEQSSQCKTKRDSSVSYASAAAKV 289
            LR   S      +D  V Y + A  V
Sbjct: 213 DLR--NSGDTAGLKDRVVGYGAYAFAV 237


>gi|302665144|ref|XP_003024185.1| hypothetical protein TRV_01684 [Trichophyton verrucosum HKI 0517]
 gi|291188230|gb|EFE43574.1| hypothetical protein TRV_01684 [Trichophyton verrucosum HKI 0517]
          Length = 205

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 89/203 (43%), Gaps = 68/203 (33%)

Query: 153 LVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------- 198
           +VG+ +A NE ++G +   Y+ +  N F +SSDFCHWG+RF Y +Y              
Sbjct: 1   MVGSTSAANERVFGSILRPYLANEENAFVISSDFCHWGTRFAYTYYVQAPDPTTYKLPLT 60

Query: 199 --------------------------------DKKHG------VIHKSIEALDKMGMDII 220
                                           DK +        IH+S+ A D   M  I
Sbjct: 61  YPSLPVPSDMLTAEKAAEEVRAASSGRMLRENDKSYNKEKYMWAIHESVSACDMACMSAI 120

Query: 221 ETGDPDAFKKYLLEYDNTICGRHPISVFL-----------HMLGNCSTKIKIK-----FL 264
            +G+ D F++ L    NT+CGRHPI V +             +G   T+  +K     F+
Sbjct: 121 ASGNTDVFQEALRRTGNTVCGRHPIGVVMAGIEGLQHQAETTMGAAVTQDGVKKGKFYFI 180

Query: 265 RYEQSSQCKTKRDSSVSYASAAA 287
           RYE+SS C + +DSSVSY SA A
Sbjct: 181 RYERSSDCFSVQDSSVSYVSAFA 203


>gi|116749427|ref|YP_846114.1| AMMECR1 domain-containing protein [Syntrophobacter fumaroxidans
           MPOB]
 gi|116698491|gb|ABK17679.1| AMMECR1 domain protein [Syntrophobacter fumaroxidans MPOB]
          Length = 522

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/228 (28%), Positives = 109/228 (47%), Gaps = 17/228 (7%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVRGVIAPHAGYSYSGRAAAYA 60
           E++R+ + AG+WY  +   L ++++G+L     PK   ++  +I+PHAG  YSG+ AAY 
Sbjct: 40  EQVRKPAIAGTWYPASPVELRKQIEGFLNRVPEPKPRGQLVALISPHAGTIYSGQVAAYG 99

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  ++    + V ++ PSH       A      ++TP+G +PLD ++IE L+   K    
Sbjct: 100 YKLLEKQKFASVIVISPSHRARFEGVATYELGGFQTPLGIVPLDRDLIEALRRRDKRIAH 159

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
              V   EH++E+ LP+L  V E    K+VP+++G  +           A  V +     
Sbjct: 160 RPEVHSEEHALEIQLPFLQTVLEE--FKLVPLIMGEQDFATCKRLAEAIADTVREKRVLV 217

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
             SSD  H      +  Y++         +ALDK+  D +   DP   
Sbjct: 218 IASSDLSH------FHPYERA--------KALDKVAADRVGALDPQGL 251


>gi|344941525|ref|ZP_08780813.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methylobacter tundripaludum SV96]
 gi|344262717|gb|EGW22988.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methylobacter tundripaludum SV96]
          Length = 259

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/264 (29%), Positives = 122/264 (46%), Gaps = 31/264 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           R+ + AGS+Y  N ++L   LD +L +A    K P  + +IAPHAGY YSG  AA A+  
Sbjct: 3   RQPAVAGSFYPANPEQLHLMLDQFLNDAATNEKVP--KAIIAPHAGYIYSGPIAASAYAR 60

Query: 64  IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           +      I+RV L+GPSH       A+S +  + TP+G +P+D   ++ L      E ++
Sbjct: 61  LKKAHDKITRVVLIGPSHRVAFRGLAVSRSETFTTPLGSVPVDQAAVQTLVQLPFVEYIE 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EHS+E+HLP+L ++ +    +IVPI+ G   A  E +   L A +  D      
Sbjct: 121 QA-HTYEHSLEVHLPFLQEMLDS--FEIVPIVAG--EASPEQVSQVLDALWGGD-ETLIV 174

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSD  H+         DK      ++IE L                 +Y      + CG
Sbjct: 175 ISSDLSHYHDYATAQQMDKS---TSQAIEQL-----------------QYERLAPESACG 214

Query: 242 RHPISVFLHMLGNCSTKIKIKFLR 265
           + P+S  L +    S  +    LR
Sbjct: 215 KVPVSGLLKLARKKSLSVTTIDLR 238


>gi|114778372|ref|ZP_01453219.1| hypothetical protein SPV1_08141 [Mariprofundus ferrooxydans PV-1]
 gi|114551335|gb|EAU53892.1| hypothetical protein SPV1_08141 [Mariprofundus ferrooxydans PV-1]
          Length = 256

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 82/291 (28%), Positives = 134/291 (46%), Gaps = 45/291 (15%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
            +R A+ AG +Y  +   L + +   L +  +  SP  R ++APHAGY YSG  AAYA+ 
Sbjct: 2   SVRPAAVAGLFYPADRAELCDMIARELAQVDV-VSPCPRALVAPHAGYIYSGPTAAYAYA 60

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            +  + +SRV + GP+H       A+++++ ++TP+GD+PL  +   EL        +D 
Sbjct: 61  LLSKSPVSRVVIAGPAHRVAFRGVAVASSSAFRTPLGDVPLAADG-RELAERFDCVAIDD 119

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYGRLFAKYVDDPS 177
                EHS+E+ LP+L  V      +++P+ VGAV A       E+++G        D  
Sbjct: 120 GAHAMEHSLEVQLPFLQTVLGS--FELLPLCVGAVEASELADVLESLWG--------DAQ 169

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA--LDKMGMDIIETGDPDAFKKYLLEY 235
             F +SSD  H+                H   EA  +D   +D I  G         +++
Sbjct: 170 MLFVISSDLSHY----------------HPYAEARRMDLTTIDRILEGQS-------IDH 206

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +   CG   I+  +++L      ++ + L Y  S      RD  V YAS A
Sbjct: 207 EQA-CGATGINA-MNILAK-QHGLQAQLLDYRNSGDTAGDRDRVVGYASLA 254


>gi|119872612|ref|YP_930619.1| hypothetical protein Pisl_1105 [Pyrobaculum islandicum DSM 4184]
 gi|166228883|sp|A1RTJ4.1|Y1105_PYRIL RecName: Full=MEMO1 family protein Pisl_1105
 gi|119674020|gb|ABL88276.1| protein of unknown function DUF52 [Pyrobaculum islandicum DSM 4184]
          Length = 281

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 137/295 (46%), Gaps = 33/295 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-------RGVIAPHAGYSYSGR 55
           ++R+ + AG +Y  + ++L ++L+  ++    PK+P++        G +APHAGY YSG 
Sbjct: 2   RVRKPAVAGYFYESDREKLLQQLEWAIKHELGPKAPQIPKLGAETLGGVAPHAGYIYSGP 61

Query: 56  AAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+ +  +       VF++ GP+H+      A+  + V++TP+G + +D E+ E++   
Sbjct: 62  VAAWVYSALAGFGKPDVFIIIGPNHYGIGAPVAIMKSGVWETPLGRVEIDGELAEKIMRY 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            K    D      EHS+E+ +P++   F    +KIVPI +           G+  A  + 
Sbjct: 122 FKELEDDFHAFSREHSIEVQIPFIQYFFGD--VKIVPITIWRQTLSTSRELGKALANAIR 179

Query: 175 DPSN--FFSVSSDFCHWGSRFNYMHYD---KKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
           +     +   SSDF H      Y H+D   KK  +    I  LD+ G+  + +       
Sbjct: 180 EYGRRAYIIASSDFNH------YEHHDITTKKDEMAIGKILQLDETGLFEVAS------- 226

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
               ++D +ICG  PI   +        K  +  L++  S      R+ +V YAS
Sbjct: 227 ----KFDISICGIGPIGALIVAAKELGYK-NVTLLKHATSGDTSGYREETVGYAS 276


>gi|397779743|ref|YP_006544216.1| hypothetical protein BN140_0577 [Methanoculleus bourgensis MS2]
 gi|396938245|emb|CCJ35500.1| UPF0103 protein [Methanoculleus bourgensis MS2]
          Length = 262

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 84/293 (28%), Positives = 132/293 (45%), Gaps = 45/293 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           +R  S AG +Y    + L + L+ + ++    K+P  + RG++APHAGY YSG   A AF
Sbjct: 3   MRPCSVAGMFYPAEPRHLEQLLETFFQK----KAPGIDARGIVAPHAGYVYSGETGACAF 58

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             I P       ++GPSH  Y   CA  +A  ++TP+G + +D E ++ L         D
Sbjct: 59  STIPPDFDGTFVVIGPSHRGYM-TCA--SAVPWETPLGIVDVDTEFVDAL---------D 106

Query: 122 ICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
           I +DEA    EHS+E+ +P +   F     +I P+++G  + E  A       + ++   
Sbjct: 107 IEIDEASHRNEHSIEVQVPIIKYRFP--RARIAPVMMGDQSYEAAASLAEHLLRAIEHTG 164

Query: 178 NFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
               +  SSDF H+              V  ++    D   +D ++T D   F + L E 
Sbjct: 165 RNVRIVASSDFSHY--------------VPDETARRQDLYAIDALKTLDIPEFYRRLRET 210

Query: 236 DNTICGRHPISVFLHMLGNCST--KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
             T CG  PI+    M   C T    + + LRY  S       +  V YA+ A
Sbjct: 211 GATACGYGPIAA---MCITCRTLGATRGELLRYTTSGDVTGDFNQVVGYAAIA 260


>gi|384440528|ref|YP_005655252.1| hypothetical protein [Thermus sp. CCB_US3_UF1]
 gi|359291661|gb|AEV17178.1| hypothetical protein TCCBUS3UF1_21410 [Thermus sp. CCB_US3_UF1]
          Length = 263

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 87/290 (30%), Positives = 127/290 (43%), Gaps = 34/290 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+ +R  + AG +Y    + L +E+   L EA  P  P +RG++ PHAGY YSGR AA  
Sbjct: 1   MDLVRPPAVAGYFYPQEGRALEQEVARLLGEAASPPGPRIRGLLLPHAGYRYSGRVAAKG 60

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
           F  + P      R  LLGPSH       A      + TP+G++ +DLE    L   G   
Sbjct: 61  FAALLPWRGRARRAILLGPSHFLAFRGVAFYPYRAWATPLGEVAVDLEGGRRLLRLGPPF 120

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
            E  +  ++  EHS+E+ LP+L  V   H   I+P+L G V+   EA  G L       P
Sbjct: 121 REYQEPFLE--EHSLEVLLPFLQAVLP-H-TPILPLLFGEVD-PGEAARGLL---ETLGP 172

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
            +    SSD  H+         D K   + +++ ALD  G+   E               
Sbjct: 173 EDLVLASSDLSHYHPDPMARTLDAK--TLERAL-ALDAQGVAGAEA-------------- 215

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
              CGR P +  L +L   +   + + L Y  S++        V YA+ A
Sbjct: 216 ---CGRLPWAT-LTLLAR-AQGWRAQLLAYATSAEASGDAGRVVGYAAVA 260


>gi|90413323|ref|ZP_01221317.1| hypothetical protein P3TCK_17149 [Photobacterium profundum 3TCK]
 gi|90325724|gb|EAS42187.1| hypothetical protein P3TCK_17149 [Photobacterium profundum 3TCK]
          Length = 260

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 92/185 (49%), Gaps = 7/185 (3%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R  + AG +Y  ++  L ++L+ W     +     +R +I PHAGY YSG  AA A+  +
Sbjct: 4   RPPAMAGRFYEKSASLLQQQLNDWCSPP-VTHRGVIRALIVPHAGYIYSGEVAAKAYCQL 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
              +I +V L+GPSH Y    CA+  +  + TP+G++ +D + I+ L      ++ D  V
Sbjct: 63  QAKNIKKVILIGPSHRYAFHGCAVPNSRYFSTPLGNVSIDTQSIDNLIKIDDIKVSDQ-V 121

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
              EH +E+ LP+L      H   ++P+L   +++ N A   R+             +SS
Sbjct: 122 HAQEHCLEVQLPFLQTCL--HHFTVLPLLTSDISSINVA---RIINDLWQQEDTLLVISS 176

Query: 185 DFCHW 189
           D  H+
Sbjct: 177 DLSHF 181


>gi|294139598|ref|YP_003555576.1| hypothetical protein SVI_0827 [Shewanella violacea DSS12]
 gi|293326067|dbj|BAJ00798.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 271

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 67/196 (34%), Positives = 100/196 (51%), Gaps = 13/196 (6%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-----RGVIAPHAGYSYSGR 55
           +  IR  + AG +Y  +   L+ EL  ++ +A   +  +V     + +I PHAGY YSG 
Sbjct: 2   LSSIRPPAVAGLFYPADPITLSTELTSYMDQAVQAQHLQVSNKAPKAIIVPHAGYIYSGL 61

Query: 56  AAAYAFGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
            AAY F +I      I RV LLGP+H  Y   CAL  ++ + TP+GDLP+D     +L A
Sbjct: 62  VAAYGFFHIQRYKHKIKRVVLLGPAHTVYLQGCALPESSHFCTPLGDLPIDRAGAAKLAA 121

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
                + +I   E EHS+E+ LP+L    +    +++PILVG +    E M   L   + 
Sbjct: 122 DDAVTISNIPHKE-EHSLEVELPFLQHCLDE--FELLPILVGEIAP--EPMASLLNLVWG 176

Query: 174 DDPSNFFSVSSDFCHW 189
            D      +SSD  H+
Sbjct: 177 GD-ETLIVISSDLSHF 191


>gi|21227863|ref|NP_633785.1| hypothetical protein MM_1761 [Methanosarcina mazei Go1]
 gi|23822345|sp|Q8PW40.1|Y1761_METMA RecName: Full=MEMO1 family protein MM_1761
 gi|20906277|gb|AAM31457.1| conserved protein [Methanosarcina mazei Go1]
          Length = 267

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 80/288 (27%), Positives = 128/288 (44%), Gaps = 32/288 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAF 61
           ++R+ + AG +Y    + L  EL       GL  +  EV G + PHAGY YSG+ AA+ +
Sbjct: 2   EMRQPAVAGQFYPLRCENLENELKRCFE--GLEIREQEVLGAVCPHAGYMYSGKVAAHVY 59

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +       +F  GP+H  Y    ++S  T +KTP+G++ +DLE+ +     G     D
Sbjct: 60  ATLPEADTYVIF--GPNHTGYGSPVSVSRET-WKTPLGNIDVDLELADGF--LGSIVDAD 114

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
               + EHS+E+ LP+L   FE    KI+PI +G  + E     G L A  + +      
Sbjct: 115 ELGHKYEHSIEVQLPFLQYRFERDF-KILPICMGMQDEETAVEVGNLLADLISESGKRAV 173

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSDF H+ +               +  + +D   +D I   D       L   + ++
Sbjct: 174 IIASSDFTHYET--------------AERAKEIDSEVIDSILNFDISGMYDRLYRRNASV 219

Query: 240 CGRHPISVFLHMLGNCSTKI---KIKFLRYEQSSQCKTKRDSSVSYAS 284
           CG  PI+  L      S K+   +   L+Y  S      +D+ V YA+
Sbjct: 220 CGYGPITAML----TASKKLGGSRATLLKYANSGDVSGDKDAVVGYAA 263


>gi|319955906|ref|YP_004167169.1| hypothetical protein Nitsa_0147 [Nitratifractor salsuginis DSM
           16511]
 gi|319418310|gb|ADV45420.1| protein of unknown function DUF52 [Nitratifractor salsuginis DSM
           16511]
          Length = 266

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 80/254 (31%), Positives = 119/254 (46%), Gaps = 28/254 (11%)

Query: 4   IRRASHAGSWYTDNS---KRLAEELDGWLREAGLPKS---PEVRGVIAPHAGYSYSGRAA 57
           IR A + G++Y  N    +++ E  +G L EA   KS    + R +I PHAGY YSG  A
Sbjct: 2   IRLAGNMGAFYPANCSEIEKMIEHWNGILDEALTDKSVLDEKPRAIIVPHAGYIYSGFTA 61

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
             A   +  +   RV ++GPSHH Y    + S    Y+TP G+LP+D   IEEL      
Sbjct: 62  NIAHRILANSRPKRVVVIGPSHHVYFEGVSASMQESYQTPCGNLPIDKAYIEELTKEYPL 121

Query: 118 ELMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
           E     V EA   EHS E  +P++   +E    K+V ++ G ++         L    + 
Sbjct: 122 EF----VPEAHHKEHSTETQMPFIQH-YEPQA-KVVELIYGKIDYRQLV---PLIDTILS 172

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDK--KHGVIHKSIEALDK-------MGMD-IIETGD 224
           DP N   +SSD  H+ +       D     GV  +S+E L+K       +G+  I+E   
Sbjct: 173 DPENVVVISSDLSHFYTLEEAKQLDNICLAGVAEESVEILEKGCEACGIIGIKAIVEVAK 232

Query: 225 PDAFKKYLLEYDNT 238
              +K  LL+Y  +
Sbjct: 233 KHGWKTQLLDYRTS 246


>gi|384263323|ref|YP_005418511.1| hypothetical protein RSPPHO_02915 [Rhodospirillum photometricum DSM
           122]
 gi|378404425|emb|CCG09541.1| Putative uncharacterized protein [Rhodospirillum photometricum DSM
           122]
          Length = 272

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 86/289 (29%), Positives = 126/289 (43%), Gaps = 36/289 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  +R  + AG +Y D    LA  +D  L EA    +P  + +IAPHAGY YSG  AA A
Sbjct: 1   MLPLRPTAVAGIFYPDAPTALARAVDDLLAEAPGSDAPAPKALIAPHAGYVYSGPTAAAA 60

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA---TG 115
           +  + P   +I RV LLGPSH       AL+ A  Y +P+G + +D    + L+A    G
Sbjct: 61  YARLAPGRDTIERVVLLGPSHRVPFRGLALTEAQGYASPLGPVAIDHAWADRLRALPYAG 120

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
             E   +     EH++E+HLP+L ++       +VP++ G V+    A      A+    
Sbjct: 121 TLEDAHL----PEHALEVHLPFLHRMLT--RFTLVPVVCGQVSVAQMAAA---LAELWGG 171

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P     VSSD  H      Y+ YD+          ALD      IE  D +A        
Sbjct: 172 PETLIVVSSDLSH------YLPYDQA--------RALDDATAQTIEALDAEALGP----- 212

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
               CG  P++  L +      +++   LR   S      R+  V Y +
Sbjct: 213 -REACGHIPVAGLLALARAHGGRLERLDLR--SSGDTAGSRERVVGYGA 258


>gi|377575424|ref|ZP_09804418.1| hypothetical protein MOPEL_130_01900 [Mobilicoccus pelagius NBRC
           104925]
 gi|377536001|dbj|GAB49583.1| hypothetical protein MOPEL_130_01900 [Mobilicoccus pelagius NBRC
           104925]
          Length = 283

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 77/223 (34%), Positives = 113/223 (50%), Gaps = 23/223 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPE-----VRGVIAPHAGYSYSGRA 56
           +R  + AG +Y D+   L   +DG L +A   +P +PE     VR ++APHAGY YSG  
Sbjct: 1   MRTPAVAGLFYDDDPDDLRASVDGLLDDARRRVPPAPEEDPATVRALLAPHAGYVYSGPT 60

Query: 57  AAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA  +  ++     ISRV LLGP H       A   A  + TP G++P++  +  EL+A+
Sbjct: 61  AAAGYLRVEACRDRISRVVLLGPVHRVPVRGLAHPEAAAFATPWGEVPVE-PLSPELRAS 119

Query: 115 GKFELMDICVDEA-EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
              +L+D  +  A EHS+E+H+P+L +V  G    ++P+ VG V ++  A          
Sbjct: 120 FP-QLLDSRLAHAQEHSLEVHVPFLQRVL-GEF-TLLPLAVGGVGSKAVADVLDAVTSGA 176

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---IHKSIEALD 213
           DD +    VSSD  H      Y+ YD    V       IEALD
Sbjct: 177 DDEATMVVVSSDLSH------YLPYDDARRVDAGTLAQIEALD 213


>gi|262198838|ref|YP_003270047.1| hypothetical protein [Haliangium ochraceum DSM 14365]
 gi|262082185|gb|ACY18154.1| protein of unknown function DUF52 [Haliangium ochraceum DSM 14365]
          Length = 275

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 124/264 (46%), Gaps = 25/264 (9%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYA 60
           E IR  + AG +Y  ++  L+ +++ +L+ AG  P  P V  V+ PHAG  YSG+ A   
Sbjct: 6   ETIREPAVAGRFYPHDAGELSTQVESFLQPAGFAPPQPAV-AVMVPHAGLVYSGKVAGQV 64

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           F  ++     RV +L P+H    P+ ++  +  ++ P G++ +D  +  EL A       
Sbjct: 65  FAEVEIP--ERVIILAPNHTGRGPQVSVMASGQWRMPGGEVAVDERLAAELLAAYPRAQA 122

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN-- 178
           D      EH++E+ LP+L     G  ++IVP+++G ++       G+  A  +    +  
Sbjct: 123 DTEAHAGEHAIEVELPFLVARRPG--VRIVPVVLGMLSELEAVHLGQALATAIAAVGDEV 180

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSD  H+ S               +    +D++ ++ +  G+P+   + + E+  +
Sbjct: 181 LVVASSDMSHFLS--------------EQQARRIDRVALEPLLEGEPEGLYRTVQEHRIS 226

Query: 239 ICGRHPISVFL---HMLGNCSTKI 259
           +CG  P +V L     LG  + ++
Sbjct: 227 MCGFIPATVMLAYARQLGRGTPRL 250


>gi|157960725|ref|YP_001500759.1| hypothetical protein Spea_0897 [Shewanella pealeana ATCC 700345]
 gi|157845725|gb|ABV86224.1| protein of unknown function DUF52 [Shewanella pealeana ATCC 700345]
          Length = 278

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 134/303 (44%), Gaps = 52/303 (17%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKS-----------PEVRGVIAPHAGY 50
           +R A+ AG +Y  ++  L   LDG+L +A   L +            P  + +I PHAGY
Sbjct: 5   VRPAAVAGLFYPASADELKAALDGYLTQASSILARKTDCQNSQDESYPSPKVLIVPHAGY 64

Query: 51  SYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
            YSG+ AAYA+  I P   +I +V L+GP+H  Y    AL  +  ++TP+G +P+  + +
Sbjct: 65  LYSGQVAAYAYALIQPLADTIKKVLLIGPAHRVYLQGGALPLSRYFETPLGQIPIAPDSV 124

Query: 109 EELKATGKFELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM 164
           E L          IC+ E     EHS+E+ LP+L    +    +++P+L+G    +  A+
Sbjct: 125 EILGCQ-----QCICISELAHQQEHSLEVQLPFLQHFLKE--FELLPLLIGESEPKEMAL 177

Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
              L  +          VS+D  H      +  YD+           LD++    I  G 
Sbjct: 178 ---LLEQVWGGNETLIVVSTDLSH------FHQYDEAL--------RLDRVTCQKILAGQ 220

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
                +         CG   ++  L +L     ++++  L Y+ S      ++  V YAS
Sbjct: 221 GTILSE-------QACGCSSLNALLPLLKR--HQLRLTQLSYQNSGDTAGDKNRVVGYAS 271

Query: 285 AAA 287
            A+
Sbjct: 272 FAS 274


>gi|384171487|ref|YP_005552864.1| hypothetical protein [Arcobacter sp. L]
 gi|345471097|dbj|BAK72547.1| conserved hypothetical protein [Arcobacter sp. L]
          Length = 263

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 55/190 (28%), Positives = 103/190 (54%), Gaps = 11/190 (5%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVRGVIAPHAGYSYSGRAA--AYA 60
           IR+A  +GS+Y    + + + ++ +   +  +    +++ +I PHAGY YSG  A  AY 
Sbjct: 3   IRKAVVSGSFYPQKKEEILKYINHFNSVKTNVESFEDIKAIIVPHAGYIYSGFTANLAYK 62

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
             +     I RV +LGPSH  Y    +++    Y+TP+G+L +D E  ++L    K++ +
Sbjct: 63  LVSFSKKDIKRVVVLGPSHRVYLKGASVALYDEYETPLGNLKVDKEFSQKL--IDKYDFL 120

Query: 121 DICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           D  ++ E EHS E   P++   FE   +++V I+ G ++ ++     ++  + + D SNF
Sbjct: 121 DFNIECEFEHSTETQAPFIKHYFEN--VQLVEIVYGQIDYKD---LSKVIDEVLKDKSNF 175

Query: 180 FSVSSDFCHW 189
             +S+D  H+
Sbjct: 176 VVISTDLSHF 185


>gi|404496814|ref|YP_006720920.1| MEMO-like protein [Geobacter metallireducens GS-15]
 gi|418065236|ref|ZP_12702610.1| protein of unknown function DUF52 [Geobacter metallireducens RCH3]
 gi|78194419|gb|ABB32186.1| MEMO-like protein [Geobacter metallireducens GS-15]
 gi|373562414|gb|EHP88625.1| protein of unknown function DUF52 [Geobacter metallireducens RCH3]
          Length = 267

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 73/284 (25%), Positives = 126/284 (44%), Gaps = 23/284 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+ + AG +YTD  KRL  +L   +    + +   V+G++APHAGY YSG  A   +G I
Sbjct: 3   RQPAVAGQFYTDEPKRLRGDLQRMIPAETMKQ--RVKGIVAPHAGYVYSGHVAGAVYGAI 60

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           +  S   V +LGP+HH +    AL     + TP+G +P+D+ + + ++        ++  
Sbjct: 61  EIPST--VVILGPNHHGFGAAAALYPEGEWLTPLGSVPIDVRLSQLVQKHVPLVEPEMSA 118

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFSV 182
              EHS+E+ +P+L        + IVPI +G  + E   + G   A+ + +         
Sbjct: 119 HRFEHSLEVQVPFLQYCRAD--VSIVPICLGFGDYERCRLLGEGIARAIREFGEDVLIVA 176

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSD  H+ S                +  A D+  +  +   +P+   +     + T+CG 
Sbjct: 177 SSDMTHYES--------------ADAARAKDEQAIAEVLALNPEGLVQICRRKNITMCGV 222

Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            P +V L +        K + + Y  S          V+YA+ A
Sbjct: 223 IPATVML-VTAKALGASKAELISYATSGDVTGDNRQVVAYAAMA 265


>gi|74317604|ref|YP_315344.1| hypothetical protein Tbd_1586 [Thiobacillus denitrificans ATCC
           25259]
 gi|74057099|gb|AAZ97539.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
           25259]
          Length = 464

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/194 (36%), Positives = 99/194 (51%), Gaps = 13/194 (6%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLRE--AGLPK-SPEVRGVIAPHAGYSYSGRAA 57
           M  +R A+ AG +Y D+   L + LD  L E  AG  + +P  + VI PHAGY YSG  A
Sbjct: 1   MPTVRPAAVAGLFYPDDPVVLTQTLDALLAEGEAGAERLTP--KAVIVPHAGYIYSGPIA 58

Query: 58  AYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           A A+  +      I RV LLGP+H  +    AL  A  + TP+G++ +D E IE L A  
Sbjct: 59  AAAYARLGDLKGRIRRVVLLGPAHRVFVRGLALPEAERFVTPLGEVTIDREAIELLAALP 118

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           +    D    + EHS+E+ LP+L +V       +VP+ VG   AE  A    L A +  D
Sbjct: 119 QVTRSD-AAHQMEHSLEVQLPFLQRVLGD--FTLVPLAVGQARAEEVAEV--LDALWGGD 173

Query: 176 PSNFFSVSSDFCHW 189
                 +SSD  H+
Sbjct: 174 -ETLIVISSDLSHF 186


>gi|13542189|ref|NP_111877.1| hypothetical protein TVN1358 [Thermoplasma volcanium GSS1]
 gi|23822367|sp|Q978N2.1|Y1383_THEVO RecName: Full=MEMO1 family protein TV1383
 gi|14325622|dbj|BAB60525.1| hypothetical protein [Thermoplasma volcanium GSS1]
          Length = 269

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 74/289 (25%), Positives = 135/289 (46%), Gaps = 30/289 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGW-LREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR+ + AG +Y  N   L   +  + ++++ +   P   GV+ PHAG  YSGR A Y++ 
Sbjct: 2   IRKPAVAGYFYPSNRNELLSLISSFHVQQSEVSCQP--IGVVVPHAGIVYSGRTAMYSYN 59

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            +  +SI    ++GP+H   TP  ++  +  ++TP+G+  ++ E+  EL    ++ + D 
Sbjct: 60  ALRNSSIRDFIIIGPNHRPMTPYASIFPSGSWETPLGNAIINEELASELYKNSQYIVKDE 119

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRLFAKYVDDPSN 178
                EHS+E+ +P+L  +F G+    VP+++G     V  +  +   RL   Y+     
Sbjct: 120 ESHSVEHSIEVQIPFLQYMF-GNSFTFVPVILGDQEKVVANDIASALMRLSKPYI----- 173

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSDF          HY++   V  K ++ + ++ +D+    D D F   +   + T
Sbjct: 174 -LIASSDFT---------HYERSDIVERKDMDLISRI-VDL----DIDGFYDTIERENVT 218

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            CG   I++ + +      KI +  L +  S       D  V Y+S  A
Sbjct: 219 ACGYGAIAILMIIAKKIGAKISL--LNHSNSGDVTNDYDEVVGYSSIVA 265


>gi|71907600|ref|YP_285187.1| hypothetical protein Daro_1971 [Dechloromonas aromatica RCB]
 gi|71847221|gb|AAZ46717.1| Protein of unknown function DUF52 [Dechloromonas aromatica RCB]
          Length = 267

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 95/191 (49%), Gaps = 8/191 (4%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   R  + AG++Y  + + L+  +D +L +A      + + +I PHAGY YSG  AA A
Sbjct: 1   MINTRPPAVAGTFYPGDPETLSTTVDRFLADASPQSGQQPKALIVPHAGYIYSGSTAANA 60

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  + P   +I RV LLGP+H       AL     + TP+G + LD + I  +    +  
Sbjct: 61  YATLKPWAKTIRRVILLGPTHRVAVEGIALPEVEAFSTPLGTIRLDAQAIASIAGLPQIV 120

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             +  V   EHS+E+HLP+L ++ E     +VP+ VG  +A  +A+   +       P  
Sbjct: 121 FSN-HVHAFEHSLEVHLPFLQRILEQ--FTLVPLAVG--DAAPDAV-AEILDLLWGGPET 174

Query: 179 FFSVSSDFCHW 189
              VSSD  H+
Sbjct: 175 LIVVSSDLSHF 185


>gi|305663860|ref|YP_003860148.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
 gi|304378429|gb|ADM28268.1| protein of unknown function DUF52 [Ignisphaera aggregans DSM 17230]
          Length = 299

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 122/255 (47%), Gaps = 21/255 (8%)

Query: 40  VRGVIAPHAGYSYSGRAAAYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYK-- 95
           + G + PHAGY YSG  AA+ +  +     +I  V ++G +H  +  K  ++T T YK  
Sbjct: 51  IIGGVVPHAGYGYSGFCAAWLYKELGEHIDNIDVVVIMGTNHTGFGGK--ITTTTYYKRW 108

Query: 96  -TPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV 154
            TP+G++ +D+  I  LK        D      EHS+E+ LP+L  ++ G+   +VPI+V
Sbjct: 109 STPLGEIDVDINFINRLKNYYPNLDDDALAHTREHSVEVQLPFLQYLY-GNKFSLVPIVV 167

Query: 155 GAVNAENEAMYGRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEAL 212
             +  E    + +   K +++    F V  SSDF H G  + Y  + +    I  ++  L
Sbjct: 168 RDITEEEATTFSKALNKAINEDDRRFVVLASSDFTHHGDLYGYTIFYEN---IAANVRKL 224

Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI---KIKFLRYEQS 269
           D M ++ I   D   F   + +Y+ T+CG   I++ +       +KI   +   L+Y  S
Sbjct: 225 DLMFIEAIVNLDTRRFLSLIDKYNATVCGYGAIAILMEY-----SKINGARASLLKYYHS 279

Query: 270 SQCKTKRDSSVSYAS 284
           +      D  V YAS
Sbjct: 280 ADITGDEDIVVGYAS 294


>gi|88801036|ref|ZP_01116585.1| hypothetical protein MED297_05099 [Reinekea blandensis MED297]
 gi|88776239|gb|EAR07465.1| hypothetical protein MED297_05099 [Reinekea sp. MED297]
          Length = 261

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 56/162 (34%), Positives = 88/162 (54%), Gaps = 10/162 (6%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGL---PKSPEVRGVIAPHAGYSYSGRAAAYA 60
           IR  + AG+ Y  +++RL  +++ WL  A +     +P+V  +IAPH+G+ YSG +AA A
Sbjct: 3   IRTPAVAGTLYEADTQRLLVQMENWLESAPVKTTQSTPKV--LIAPHSGFHYSGESAARA 60

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  ++     I RV LLGP+H        L    V+ TP+G +PLD   +  L+      
Sbjct: 61  YQTLNAVYDRIRRVILLGPAHRTTVDHLVLPEDDVFATPLGQVPLDKTAVNWLRRQPGV- 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
           + D  +   EHS+EM LP+L    E     +VPI+VG V+ +
Sbjct: 120 ITDNTLHAPEHSLEMQLPFLQTALEDFF--LVPIIVGQVDPD 159


>gi|116753790|ref|YP_842908.1| hypothetical protein Mthe_0477 [Methanosaeta thermophila PT]
 gi|116665241|gb|ABK14268.1| protein of unknown function DUF52 [Methanosaeta thermophila PT]
          Length = 269

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 81/284 (28%), Positives = 130/284 (45%), Gaps = 22/284 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
           +R  + AG +Y   ++ L  EL    R  GL + P  V G + PHAGY YSG  AA  + 
Sbjct: 1   MRPPAAAGMFYPARAEALLAELSSAFR--GLKREPMPVIGAVVPHAGYIYSGYVAAEVYA 58

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            + P   + V L+GP+H       A+S  T ++TP+G +P+DLE+ + L+  G     D 
Sbjct: 59  RL-PERETYV-LIGPNHTGLGLPVAVSRET-WRTPLGSVPVDLELADALE--GTIVASDE 113

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
                EHS+E+ LP+L + F G   +I+PI +G  + E     G   A  +       +V
Sbjct: 114 MAHMEEHSIEVQLPFLQRCFSG--FRILPICMGMQDQETAIEVGEAVANAILKLGRSCTV 171

Query: 183 --SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSD  H      Y+ Y+    +    +E++ +M +D + +       +Y   Y + +C
Sbjct: 172 IASSDLTH------YLRYETARQLDTSILESVIRMDIDDLYSKIEMLDSRY---YGHGVC 222

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           G  P++  +         +  K LRY  S      R   V Y +
Sbjct: 223 GYGPMAATIAASRRLGASMG-KLLRYATSGDVSGDRGQVVGYGA 265


>gi|336476945|ref|YP_004616086.1| hypothetical protein [Methanosalsum zhilinae DSM 4017]
 gi|335930326|gb|AEH60867.1| protein of unknown function DUF52 [Methanosalsum zhilinae DSM 4017]
          Length = 265

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 80/283 (28%), Positives = 128/283 (45%), Gaps = 24/283 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R    AG +Y  N K L +E+    +   +  S ++ G +APHAGYS+SG  AA+ +  
Sbjct: 1   MREPVVAGQFYPSNPKELKKEIRKCFKGIEV-SSHDLIGAVAPHAGYSFSGHVAAHVYAR 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I   S     ++GP+H  Y    + S A  + TP+G++  D ++I+ L   G    +D  
Sbjct: 60  I--PSADTYIIIGPNHTGYGSPISAS-ADTWSTPLGEIRTDGDLIKAL--AGSIIDVDEL 114

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV- 182
               EHS+E+ LP+L    + H  +I+PI +G  + E  +  G    + V D      + 
Sbjct: 115 AHRFEHSIEVQLPFLQYTLD-HDFEILPICMGLQDEETASEVGHEIVRAVKDAGKKVVII 173

Query: 183 -SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSDF H+ S    + YD    +    IEA++KM        D     + ++E + + CG
Sbjct: 174 ASSDFTHYQSA--DIAYDTDQYI----IEAIEKM--------DVSEIYRRIMEKNASACG 219

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             PI+  +          K K L Y  S        S V Y +
Sbjct: 220 FGPIAAMI-TAAKLLDATKAKLLTYSTSGDVIGDMSSVVGYGA 261


>gi|381167018|ref|ZP_09876230.1| Predicted dioxygenase [Phaeospirillum molischianum DSM 120]
 gi|380683833|emb|CCG41042.1| Predicted dioxygenase [Phaeospirillum molischianum DSM 120]
          Length = 457

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 76/288 (26%), Positives = 121/288 (42%), Gaps = 29/288 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IR  + AG +Y  +   +  ++   L  A   ++   + +I PHAG+ YSG  AA A
Sbjct: 1   MTAIRLPAFAGQFYPSDPIAIRHQIATMLATAPHARATTPKAIIVPHAGWVYSGPIAAAA 60

Query: 61  FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  + P    + RV LLGPSH       AL  A  + +P+G + LD +    L       
Sbjct: 61  YAQLVPARGRVRRVVLLGPSHRIALRGLALCDAEAWASPLGQVLLDRDSALRLTTLPDHS 120

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           + D    + EH++E+ LP+L  V  G   +++PILVG  +AE   +          D   
Sbjct: 121 ISDPAHAQ-EHALEVQLPFLQAVL-GDDFRLLPILVGEASAE---IVAATLDTVWGDSET 175

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              VS+D  H+    +Y    ++      +IE  D   + +                 ++
Sbjct: 176 LIVVSTDLSHY---LDYESCRRQDSGTAAAIERFDGTELAL-----------------DS 215

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            CGR+P++  L  L      +KI  L    S      RD  V Y + A
Sbjct: 216 ACGRNPLAGLL--LAAQRRGMKIDRLDLRNSGDTAGSRDRVVGYGAWA 261


>gi|218961087|ref|YP_001740862.1| hypothetical protein CLOAM0773 [Candidatus Cloacamonas
           acidaminovorans]
 gi|167729744|emb|CAO80656.1| conserved hypothetical protein [Candidatus Cloacamonas
           acidaminovorans str. Evry]
          Length = 272

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 80/285 (28%), Positives = 124/285 (43%), Gaps = 21/285 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
           IR   HAG++Y    +++  +++GW+  A  P S E   GVI PHAGY YSG  A     
Sbjct: 2   IRNPMHAGTFYPRFEQQIKRQIEGWISNAVTPLSSERALGVILPHAGYMYSGECATLGLH 61

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           +I   +I    +L PSH       ++S    Y  P+G L LD+E+  ++      + + +
Sbjct: 62  SISHENIDCFIILHPSHQANYFDFSVSPYQEYVNPLGTLDLDVELYNKIAPEAD-QNIPL 120

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAK-YVDDPSNFF 180
            + + EHSME+ LP L   F     KI+PI+ G  + A  + +   L+   Y        
Sbjct: 121 ILHQEEHSMEIQLPILNYFFPK--AKILPIMFGNQIPAVAKRLADILYETIYSSTKRIVI 178

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSD  H+         D   G++ K + AL           DP+   + ++  D   C
Sbjct: 179 LCSSDLSHYHRARTAEEMD---GILVKDVTAL-----------DPEHLWQDIIHSDCEAC 224

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           G   I   L+      T  K+K ++Y  S +        V Y SA
Sbjct: 225 GIGGILCLLYY-AQHYTNAKMKVIQYTHSGKISGNDSQVVGYLSA 268


>gi|222056095|ref|YP_002538457.1| hypothetical protein Geob_3013 [Geobacter daltonii FRC-32]
 gi|221565384|gb|ACM21356.1| protein of unknown function DUF52 [Geobacter daltonii FRC-32]
          Length = 267

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 23/283 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR+ + AG +YTD+  +L  +L   +  A  P      GVI+PHAGY YSG  A   FG 
Sbjct: 2   IRQPAVAGQFYTDDPDKLRTQLKAMIAPA--PFKERALGVISPHAGYMYSGAVAGKLFGE 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +     S V +LGP+HH    + A+     ++TP+G  P++  +   +K        D  
Sbjct: 60  V--AVPSTVVILGPNHHGIGARAAIFPEGSWQTPLGTAPVNSSLATLIKKHSSIVEADQK 117

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
               EHS+E+ LP+L   F    + IVP+ +G  + ++  + G   A  + D  +     
Sbjct: 118 AHIFEHSLEVQLPFLQ--FLRPDVSIVPVCLGFRDFKSCLLLGSGIASAIRDYGDEVLIV 175

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H+ +  +    D++       + ALD  G+ ++             + D T+CG
Sbjct: 176 ASSDMTHYETAASAKQKDEQA---INMVLALDAEGLLLV-----------CAQKDITMCG 221

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             P +V L +        K + ++Y  S          V+YA+
Sbjct: 222 VVPATVML-VAAKGLGATKARLVQYATSGDVTGDNRQVVAYAA 263


>gi|171184544|ref|YP_001793463.1| hypothetical protein Tneu_0059 [Pyrobaculum neutrophilum V24Sta]
 gi|170933756|gb|ACB39017.1| protein of unknown function DUF52 [Pyrobaculum neutrophilum V24Sta]
          Length = 282

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 130/291 (44%), Gaps = 28/291 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-------EVRGVIAPHAGYSYSGRA 56
           +R+ + AG +Y  + +RL ++++  ++    PK+P       E  G +APHAGY YSG  
Sbjct: 5   VRKPAVAGYFYEADRERLIQQIEWSIKHELGPKAPQLPKLGAEALGGVAPHAGYMYSGPV 64

Query: 57  AAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           AA+ +  +        F ++GP+H+      A+  +  ++TP+G + +D E+ EE+  +G
Sbjct: 65  AAWIYAALSGFGKPDAFVIVGPNHYGIGAPVAIMKSGAWETPLGRVEIDAELAEEIARSG 124

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
             E  D      EHS+E+ +P++   F    +KIVPI +           G+  A  +  
Sbjct: 125 AAE-EDPHAFSREHSIEVQIPFIQYFFGD--VKIVPIALWRQTPSASRELGKAIAAALKK 181

Query: 176 PSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
                +   SSDF H+          + H V  K     D+M +  I   D     +   
Sbjct: 182 RGGRIYVIASSDFNHY----------EPHDVTVKK----DEMAIGKILKLDEAGLFEVAS 227

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           ++D +ICG  PI V +              L++  S      RD +V YAS
Sbjct: 228 KFDISICGLGPIGVLITAAKELGYN-NATLLKHATSGDTSGYRDETVGYAS 277


>gi|452077450|gb|AGF93409.1| protein containing UPF0103/Mediator of ErbB2-driven cell motility
           (Memo), related domain protein [uncultured organism]
          Length = 283

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 102/210 (48%), Gaps = 10/210 (4%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVR-------GVIAPHAGYSY 52
           M +IR  + AG +Y      L  +L+   + E G  K PEV+       G+I+PHAGY Y
Sbjct: 1   MPEIRNPAVAGRFYRGTKTGLTNQLEECFKHECGPGKIPEVQEGPRRMIGLISPHAGYPY 60

Query: 53  SGRAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
           SG  AA+ F  + +      V ++GP+H    P  ++ ++ +++TP+G + L+ E+ E +
Sbjct: 61  SGPVAAHGFSKLAEDGKPDSVVIIGPNHSGVGPDVSVDSSDLWETPLGQIDLNKEIRERI 120

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
               +   +D      EHS+E+ LP+L  +F+     IVPI + A NAE     G+    
Sbjct: 121 LEESEVAELDSSSHLNEHSLEVQLPFLQYLFDEDF-DIVPICMKAQNAETSEDIGKAIQA 179

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKK 201
             +        S+D  H+  +      D+K
Sbjct: 180 AANGEDILIIASTDLTHYEPQQMAETKDRK 209


>gi|397689697|ref|YP_006526951.1| dioxygenase [Melioribacter roseus P3M]
 gi|395811189|gb|AFN73938.1| Putative dioxygenase [Melioribacter roseus P3M]
          Length = 271

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 105/195 (53%), Gaps = 8/195 (4%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAY 59
           M+K+R+ + AG +Y    + LA+E++  L  AG  ++PE + G++ PHAGY YSG+ AA 
Sbjct: 1   MKKVRQPAVAGMFYPSLPEELADEINEMLDNAGTEEAPENLYGIVVPHAGYIYSGQTAAK 60

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+  +       V ++ PSH  Y P  ++     Y TP+G++ +D E+ + +    K   
Sbjct: 61  AYKLLKDKKYKTVVVISPSHREYFPGISVFDGDAYLTPLGEIEIDSEMRDRIAEGSKIIF 120

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD--DPS 177
             +    +EH++E+ LP+L +V  G   K++P+++G    +++     L  +  D  D +
Sbjct: 121 KGVNGHRSEHALEVQLPFL-QVALGDF-KLLPVVMG---DQSKLFVDELSERLSDVVDEN 175

Query: 178 NFFSVSSDFCHWGSR 192
                SSD  H+ S+
Sbjct: 176 TLIVSSSDLSHYYSK 190


>gi|206901742|ref|YP_002251218.1| hypothetical protein DICTH_1399 [Dictyoglomus thermophilum H-6-12]
 gi|206740845|gb|ACI19903.1| conserved protein [Dictyoglomus thermophilum H-6-12]
          Length = 269

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 129/283 (45%), Gaps = 23/283 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K R  + AG +Y   S+ L   L  +++E  +P    + G I PHAGY YSG+ A   + 
Sbjct: 2   KKRLPAVAGYFYPATSRELLNMLSQFIKE--VPNKVPLWGCIVPHAGYIYSGKVAGAVYS 59

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            ++   ++   ++GP+H+      A+ +  ++ TP+GD+ +D  + +++     +   D 
Sbjct: 60  LMEVPDVA--IIIGPNHNGLGYPSAIYSEGIWTTPLGDVEIDSVLSQKILQESDYLKEDA 117

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFF 180
              + EHS+E+ +P+L  +     IKIVPI V   + E     G   ++ V D   +   
Sbjct: 118 LAHQPEHSLEVQVPFLQYINPN--IKIVPITVVDYSYEVLKNLGYAISQAVRDYEYNTVI 175

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSDF H+                H+  +  D   ++ I   D   F K ++E D +IC
Sbjct: 176 IASSDFSHYEP--------------HEIAKQKDLYAIESILNLDEKEFIKRVIEKDISIC 221

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           G  PI + +  +    +K + K + Y+ S       D  V YA
Sbjct: 222 GVGPIVITMIAVKLLGSK-RAKLVEYQTSGNVTRDFDQVVGYA 263


>gi|153004135|ref|YP_001378460.1| hypothetical protein Anae109_1268 [Anaeromyxobacter sp. Fw109-5]
 gi|152027708|gb|ABS25476.1| protein of unknown function DUF52 [Anaeromyxobacter sp. Fw109-5]
          Length = 265

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 76/284 (26%), Positives = 123/284 (43%), Gaps = 23/284 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR  + AG +Y  +   LA E+D +L   G  +    RGV+ PHAGY YSG  A   +  
Sbjct: 2   IREPAVAGRFYPGDPTVLAGEVDRFLAAPG--ERLRARGVVVPHAGYVYSGAVAGAVYAR 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTAT-VYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           ++    +RV +LGP+H       AL  A+  ++TP+G +P+D E+   L A+      D 
Sbjct: 60  VEIP--ARVLVLGPNHTGIGAAAALWPASGTWRTPLGQVPVDAELTAALSAS-PLVTADT 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
                EHS+E+ +P+L +   G  + I  + +  +   +    GR  A+           
Sbjct: 117 RAHRLEHSLEVQVPFLERARPG--VSIAALCLTHLPYADCVEIGRTVAQAASAAGALVVA 174

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSD  H+         D      H++IE +  +        DP    + +   D ++CG 
Sbjct: 175 SSDMSHYLPAPVARQKD------HRAIERILAL--------DPRGLYEVVHAEDISMCGI 220

Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            P +V L +        + + +RY  S +     D  V YA  A
Sbjct: 221 VPATVML-VAARLMGATRAELVRYAHSGEVSGDEDQVVGYAGIA 263


>gi|307352919|ref|YP_003893970.1| hypothetical protein Mpet_0758 [Methanoplanus petrolearius DSM
           11571]
 gi|307156152|gb|ADN35532.1| protein of unknown function DUF52 [Methanoplanus petrolearius DSM
           11571]
          Length = 266

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 130/286 (45%), Gaps = 27/286 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RR + AG +Y      L E+L  +     LP   +  G+++PHAG  YSG+AAAY+F  
Sbjct: 3   VRRVTLAGRFYPGEPASLKEQLAYFFSHVALPHPVDACGIVSPHAGTIYSGQAAAYSFSA 62

Query: 64  IDPTSISRVFL-LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           I P+  +  F+ + PSH  Y P C  ++   ++TP+G +  D  ++  L         D 
Sbjct: 63  I-PSDFNGTFIVVAPSHAGY-PDC--TSGLSWETPLGIIECDEALVSALDINRD----DF 114

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
            + + E+S+E+ +P++   F     K+VP+L+G    +          + V + S+   +
Sbjct: 115 AMAQPENSLEVQMPFIKFRFPDA--KVVPVLMGRQTLQGAEAMAEAIVRTVRETSSDVRI 172

Query: 183 --SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSDF H      Y+  ++   +   +IEAL  +        D + F + + E   + C
Sbjct: 173 VASSDFSH------YIPDEEARRLDGYAIEALFDL--------DSEEFFRRIREEGVSAC 218

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           G  PI+V +          K + L+Y  S          V YA+ A
Sbjct: 219 GYGPIAVMVEACRELFGAKKSELLKYMTSGDVSGDFSQVVGYAAVA 264


>gi|73668946|ref|YP_304961.1| hypothetical protein Mbar_A1422 [Methanosarcina barkeri str.
           Fusaro]
 gi|121722214|sp|Q46CL1.1|Y1422_METBF RecName: Full=MEMO1 family protein Mbar_A1422
 gi|72396108|gb|AAZ70381.1| conserved protein [Methanosarcina barkeri str. Fusaro]
          Length = 265

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 86/287 (29%), Positives = 136/287 (47%), Gaps = 32/287 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           +R+ + AG +Y      L +EL       GL  +   + G + PHAGY YSGR AA+ + 
Sbjct: 1   MRQPAVAGQFYPLRPDNLEKELKQCFE--GLEIRERNILGAVCPHAGYVYSGRVAAHVYA 58

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            + P + + V L GP+H  Y    ++ST T +KTP+G + +D E+ E L  TG    +D 
Sbjct: 59  -VLPKADTYV-LFGPNHTGYGSPVSVSTDT-WKTPLGIIEVDRELAEGL--TGSIVDVDE 113

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FF 180
                EHS+E+ LP+L   F+    +I+PI +G  + E     G L A  V        F
Sbjct: 114 IGHRYEHSIEVQLPFLQYRFDQDF-RILPICLGMQDEETVIEVGTLIANLVSKSGKKVAF 172

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSDF H      Y   +      ++ IEA+  + +D+     P  +++ L   + ++C
Sbjct: 173 IASSDFTH------YQPANLARETDNEIIEAI--LNLDV-----PGIYER-LYRRNASVC 218

Query: 241 GRHPISVFLHMLGNCSTKI---KIKFLRYEQSSQCKTKRDSSVSYAS 284
           G  PIS  L      S K+   + + L Y  S +     ++ V YA+
Sbjct: 219 GYGPISAML----TASKKLGATRAELLNYSNSGEVSGDMNAVVGYAA 261


>gi|381153737|ref|ZP_09865606.1| putative dioxygenase [Methylomicrobium album BG8]
 gi|380885709|gb|EIC31586.1| putative dioxygenase [Methylomicrobium album BG8]
          Length = 260

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 84/281 (29%), Positives = 129/281 (45%), Gaps = 43/281 (15%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           R+ + AG++Y    ++L   LD +L +A   PK P  + +I PHAGY YSG  AA A+  
Sbjct: 3   RQPAVAGTFYPAEPQQLHLLLDQYLNDADTAPKVP--KAIIVPHAGYIYSGPIAATAYAR 60

Query: 64  IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           +      ISRV L+GPSH       A+++A  + TP+G + +D   +    A  +   +D
Sbjct: 61  LLKAHDQISRVVLIGPSHRVGFHGLAITSAQNFVTPLGSIEVDQRAVH---AIAQLPFVD 117

Query: 122 ICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA-MYGRLFAKYVDDPS 177
             +++A   EHS+E+HLP+L ++ +    KIVPI+ G   AE  A + G L+        
Sbjct: 118 -YLEQAHVMEHSLEVHLPFLQEMLDD--FKIVPIVTGDAPAEQVAQVLGMLWGG----EE 170

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEAL--DKMGMDIIETGDPDAFKKYLLEY 235
               VSSD  H+         DK       +IE L  + +G+D                 
Sbjct: 171 TLIVVSSDLSHYHDYATCQELDK---ATSAAIEKLQYESLGID----------------- 210

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
             + CGR  I   L +  + S  +K   LR    +    KR
Sbjct: 211 --SACGRVAIGGLLKLARDKSLAVKNIDLRNSGDTAGDKKR 249


>gi|126179754|ref|YP_001047719.1| hypothetical protein Memar_1811 [Methanoculleus marisnigri JR1]
 gi|125862548|gb|ABN57737.1| protein of unknown function DUF52 [Methanoculleus marisnigri JR1]
          Length = 262

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/293 (27%), Positives = 129/293 (44%), Gaps = 45/293 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAAYAF 61
           +R  S AG +Y    + L + L+ + R+ G    P +  RG+++PHAGY YSG   A AF
Sbjct: 3   MRPCSVAGMFYPAEPRHLEQLLETFFRKRG----PGITSRGIVSPHAGYVYSGETGACAF 58

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             I P       ++GPSH  Y      ++A  ++TP+G + +D E I+          MD
Sbjct: 59  STIPPDFDGTFLVIGPSHRGYMTS---ASAVPWETPLGIVDVDTEFIDA---------MD 106

Query: 122 ICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
           I +DEA    EHS+E+ +P++   F     +  P+L+G  + E          + V+   
Sbjct: 107 IEIDEASHQSEHSIEVQMPFIKYRFP--RARAAPVLMGEQSYEAAESLAEHLLQAVEHTK 164

Query: 178 NFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
               +  SSDF H+              V  +     D   ++ ++T D   F + L E 
Sbjct: 165 RDVRIVASSDFSHY--------------VPDEVARKQDLYAIEALKTLDLPEFYRRLQET 210

Query: 236 DNTICGRHPISVFLHMLGNCST--KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
             T+CG  PI+    M   C +    + + LRY  S       +  V YA+ A
Sbjct: 211 RATVCGYGPIAT---MCIACRSLGATRGELLRYTTSGDVTEDYNQVVGYAAIA 260


>gi|56479181|ref|YP_160770.1| hypothetical protein ebA6549 [Aromatoleum aromaticum EbN1]
 gi|56315224|emb|CAI09869.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1]
          Length = 275

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/294 (28%), Positives = 129/294 (43%), Gaps = 46/294 (15%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLP----KSPEVRGVIAPHAGYSYSGRAAA 58
            IR A+ AG +Y D+   L  ++ G +  A +P     +P  + +I PHAGY YSG  AA
Sbjct: 5   SIRPAAVAGYFYPDDPYALRAQI-GEMLAAAVPLETVSAP--KALIVPHAGYVYSGPVAA 61

Query: 59  YAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
            A+ N+      + RV LLGP+H       AL  A  + TP+G +P+     E L+A   
Sbjct: 62  SAYANLAGLRDVVRRVVLLGPAHRMAVRGFALPAAQFFSTPLGQVPMSRGGWETLQAR-- 119

Query: 117 FELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
               D+ VD+    AEHS+E+ LP+L    +    ++VP+LVG  NA  EA+   +    
Sbjct: 120 ---PDVVVDDRPHAAEHSLEVQLPFLQMALD--TFELVPLLVG--NASAEAV-AEILETL 171

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
              P     +SSD  H+ S              +   ++ D+  +  I T       +  
Sbjct: 172 WGGPETLIVISSDLSHYHS--------------YGQAQSCDRATVGQILT------LRTG 211

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +E+D   CG  P++  L         ++   L    S      R   V YAS A
Sbjct: 212 IEHDQA-CGATPVNGLLR--AAAQHGLEPHLLDLRNSGDTAGDRSRVVGYASIA 262


>gi|390960737|ref|YP_006424571.1| hypothetical protein CL1_0570 [Thermococcus sp. CL1]
 gi|390519045|gb|AFL94777.1| hypothetical protein CL1_0570 [Thermococcus sp. CL1]
          Length = 293

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 73/294 (24%), Positives = 137/294 (46%), Gaps = 8/294 (2%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAY 59
           M ++R  + AGS+Y  + + L E L+ +  + G   S   +   +APHAGY +SG  A+ 
Sbjct: 1   MTEVRYPAVAGSFYPAD-ESLIEMLEEFFNDLGEEGSERRITAGVAPHAGYIFSGYTASR 59

Query: 60  AFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
            +  I    +   F +LGP+H       A+  +  + TP+G++ +D E+ + +       
Sbjct: 60  TYKAIFEDGLPETFVILGPNHTGLGSPIAVYPSGEWLTPLGEIEVDAEMAKAIARLSGIA 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
            +D      EHS+E+ +P++  + E  G  ++IVPI +G  + E     GR   +   + 
Sbjct: 120 DLDELAHRYEHSIEVQVPFIQYLAELAGKKVRIVPITLGIQDEEVSRALGRAIFEASQEL 179

Query: 177 SNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
                V  S+DF H+G+ + Y+ +  +   +   I+  D   +  I   D D   + L E
Sbjct: 180 GRDVVVIASTDFMHYGAMYGYVPFRVRADELPHRIKEWDFRLVRRILDFDVDGLFRELRE 239

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            ++T+CG   +   + +    +  ++ + L Y  S +     D+ V YAS   +
Sbjct: 240 MNHTMCGPGAVGTAI-VYSRSAGALEAELLHYTTSYEVSRSTDAVVGYASIVMR 292


>gi|337283757|ref|YP_004623231.1| hypothetical protein PYCH_02600 [Pyrococcus yayanosii CH1]
 gi|334899691|gb|AEH23959.1| hypothetical protein PYCH_02600 [Pyrococcus yayanosii CH1]
          Length = 292

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/294 (25%), Positives = 138/294 (46%), Gaps = 14/294 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
           IR  + AG +Y    + L + L  + R+ G P    ++   IAPHAGY +SG  A+  + 
Sbjct: 2   IRYPAVAGQFYP-LGEELVDMLKAFFRDLGEPGGERKITAGIAPHAGYVFSGYTASRTYK 60

Query: 63  NIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I    +   F++ GP+H       A+     + TP+G++ +D E+  E+        +D
Sbjct: 61  AIYEDGLPETFVIFGPNHTGLGSPIAVYPEGKWVTPLGEVEVDGELAREIVKNSGIADLD 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAE-NEAMYGRLFAKYVDDPSN 178
               + EHS+E+ LP++  + E  G  ++IVPI +   + E +E + G +F    +   +
Sbjct: 121 EIAHKYEHSIEVQLPFIQYIAERAGKTVRIVPITLSIQDEEVSEDLGGAVFEAARELGRD 180

Query: 179 FFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
              + S+DF H+G+ + Y+ +  +   +   ++  D   +  I   D +   + + E D+
Sbjct: 181 VVVIASTDFMHYGTFYGYVPFRARANELPNLVKEWDLRVIRRILDFDVEGMFREIREMDH 240

Query: 238 TICGRHPIS---VFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           T+CG   +    VF  ++G     ++ + L Y  S       D+ V YAS   +
Sbjct: 241 TMCGPGGVGASVVFSRLIG----AVEAELLHYTTSFDISRSTDAIVGYASIVMR 290


>gi|217970617|ref|YP_002355851.1| hypothetical protein Tmz1t_2215 [Thauera sp. MZ1T]
 gi|217507944|gb|ACK54955.1| protein of unknown function DUF52 [Thauera sp. MZ1T]
          Length = 274

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 83/293 (28%), Positives = 125/293 (42%), Gaps = 40/293 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           +  IR A+ AG +Y  + + L  +L   L  A     +P  + +I PHAGY YSG  AA 
Sbjct: 3   LASIRPAAVAGQFYPRDERVLRTQLAEMLSTAVAFESAPAPKAIIVPHAGYMYSGPVAAS 62

Query: 60  AFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+  + P    I RV +LGP+H       +L  A  + TP+G++P+     E L+A    
Sbjct: 63  AYALLAPLRERIRRVVMLGPTHRVAVRGFSLPAAQRFATPLGEVPVARGDWEALQARD-- 120

Query: 118 ELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
              D+ VD+     EH +E+ LP+L    E    +IVP+LVG  + E  A    L  +  
Sbjct: 121 ---DVLVDDRPHALEHCLEVQLPFLQVCLER--FEIVPVLVGDASPEATA---SLIERLW 172

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
             P     +SSD  H      Y+ Y        +  +  D+  +D +    P       L
Sbjct: 173 GGPETLILISSDLSH------YLSY--------REAQWSDRAAVDQVLELRPG------L 212

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +++   CG  PI+  L  L      ++   L    S      R   V Y S A
Sbjct: 213 DHEQA-CGATPINGLL--LAARKHDLQAHLLDLRNSGDTAGDRSRVVGYTSIA 262


>gi|54308444|ref|YP_129464.1| hypothetical protein PBPRA1251 [Photobacterium profundum SS9]
 gi|46912873|emb|CAG19662.1| conserved hypothetical protein [Photobacterium profundum SS9]
          Length = 260

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 94/190 (49%), Gaps = 15/190 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG +Y +++  L ++LD W       +   +R +I PHAGY YSG  AA A+  
Sbjct: 3   MRLPAVAGRFYEESAPLLQKQLDDWCSPPTTHRD-LIRALIVPHAGYIYSGEVAAKAYCQ 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +   +I +V L+GPSH Y    CA+  +  + TP+G + +D++ I+ L      ++ DI 
Sbjct: 62  LQAETIKKVILIGPSHRYAFHGCAVPNSDYFSTPLGSVSIDVQSIDNL-----IKIDDIK 116

Query: 124 VDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           V E     EH +E+ LP+L      H   ++P+L   V+    A   ++           
Sbjct: 117 VSEQVHAQEHCLEVQLPFLQTCL--HQFTLLPLLTSNVSFIKVA---KIIDALWQQDDTL 171

Query: 180 FSVSSDFCHW 189
             +SSD  H+
Sbjct: 172 LVISSDLSHF 181


>gi|399157091|ref|ZP_10757158.1| hypothetical protein SclubSA_09246 [SAR324 cluster bacterium SCGC
           AAA001-C10]
          Length = 279

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/279 (26%), Positives = 125/279 (44%), Gaps = 52/279 (18%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLR----EAGLPKS-----PEVRGVIAPHAGYSYS 53
            IR  + A  +Y   +  L +E+  +L     E  +PKS      E+R +I PHAGY YS
Sbjct: 9   NIRPPAVASMFYPGKAAELRKEVQNYLSNAETEEDVPKSKKGETAELRTLIVPHAGYIYS 68

Query: 54  GRAAAYAFGNI--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
           G+ AA A+  +  +     RV LLGP+H  +    A      ++TP+G++ LD E+IE  
Sbjct: 69  GKIAACAYQLLQENQNQFKRVLLLGPAHRVWLQGAAFPETDAFETPLGEITLDKELIE-- 126

Query: 112 KATGKFELMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRL 168
           K   +F  + +  DEA   EH +E+ LP+L +       +++P++VG    E+      +
Sbjct: 127 KMLEEFSWISVS-DEAHSEEHCLEVQLPFLQETLGD--FELLPLVVGETKTEH---ITEI 180

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
             K+ +D      +S+D  H      +  Y     +  ++  A++ +  + I T D    
Sbjct: 181 IQKFSEDTETLIVISTDLSH------FHDYQTAREIDARTANAIELLEQNRISTEDA--- 231

Query: 229 KKYLLEYDNTICGRHPISVFL----------HMLGNCST 257
                      CG +P+   L          H LG C++
Sbjct: 232 -----------CGAYPLRGALLAASQNHWNVHRLGLCNS 259


>gi|288817556|ref|YP_003431903.1| putative dioxygenase [Hydrogenobacter thermophilus TK-6]
 gi|384128321|ref|YP_005510934.1| hypothetical protein [Hydrogenobacter thermophilus TK-6]
 gi|288786955|dbj|BAI68702.1| putative dioxygenase [Hydrogenobacter thermophilus TK-6]
 gi|308751158|gb|ADO44641.1| protein of unknown function DUF52 [Hydrogenobacter thermophilus
           TK-6]
          Length = 264

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 67/218 (30%), Positives = 113/218 (51%), Gaps = 19/218 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           KI+R + AG +Y  + K+L + +   L+ + L P  P   G++APHAGY+YSG  A   +
Sbjct: 2   KIKRPNVAGIFYPSDPKKLRDTVLALLKNSKLFPLKP--VGLVAPHAGYTYSGAVAGAVY 59

Query: 62  GNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
             ++   +S+   V ++ PSH+++       +   ++TP+G + +D + IE    T K  
Sbjct: 60  KQLENLDLSKDWMVVIIAPSHYFFFEGITFGSYQAFETPLGQVEVDRKAIERFIDTRKSL 119

Query: 117 -FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
                DI  D+ EHS+E+ LP+L  + +    ++VP+L    +A+ E +   L   + + 
Sbjct: 120 RVSFSDIPYDK-EHSLEVQLPFLQVLLKS--FRLVPVLYS--DAKPEEIKEVL--NFFEG 172

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD 213
            +  F VSSD  H+ S  N   Y  K    H  IE+LD
Sbjct: 173 ENTLFVVSSDLSHYHSE-NTARY--KDSFCHAGIESLD 207


>gi|156937782|ref|YP_001435578.1| hypothetical protein Igni_0992 [Ignicoccus hospitalis KIN4/I]
 gi|166227550|sp|A8AB69.1|Y992_IGNH4 RecName: Full=MEMO1 family protein Igni_0992
 gi|156566766|gb|ABU82171.1| protein of unknown function DUF52 [Ignicoccus hospitalis KIN4/I]
          Length = 285

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 138/292 (47%), Gaps = 26/292 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPEVR-----GVIAPHAGYSYS 53
           +R  + AG++Y  +++ L   ++ W     L    +P+   VR     G + PHAGY YS
Sbjct: 3   VRPPAVAGTFYPADAEELIRLIE-WSFTHPLGPGEVPEVSPVRRKASVGYMVPHAGYIYS 61

Query: 54  GRAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+++ ++    +   V ++GP+H    P  ++   ++++TP+G +  D E I EL 
Sbjct: 62  GPVAAWSYYHLAQEGAPETVVIIGPNHTGLGPAVSVMPPSIWETPLGGVKTDDEAISELL 121

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
                   D      EHS+E+ LP+L  +F G   +IVPI++            RL  + 
Sbjct: 122 KVSNVVEEDYSAHAYEHSLEVQLPFLQYLF-GDSFRIVPIVMKV----QTPSVARLLMQS 176

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           + +     ++  D+    S  +  HY+     + K + AL+K     I   DP+  ++ L
Sbjct: 177 IKE--AMENLGRDYVVLSSS-DLNHYEPHDITVEKDMLALEK-----IVNLDPEGLQEVL 228

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           ++YD ++CG  P+ V +++      +  I  L++  S      + + V YA+
Sbjct: 229 VKYDISMCGPGPVMVNMYLDKEYGAERAI-LLKHATSGDTSGDKSAVVGYAA 279


>gi|89093257|ref|ZP_01166207.1| hypothetical protein MED92_04232 [Neptuniibacter caesariensis]
 gi|89082553|gb|EAR61775.1| hypothetical protein MED92_04232 [Neptuniibacter caesariensis]
          Length = 260

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 66/198 (33%), Positives = 96/198 (48%), Gaps = 7/198 (3%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R A+ AG +Y D+ + L  EL G +      +      ++ PHAGY YSG  AA A+  I
Sbjct: 4   RTAAVAGMFYPDDKECLLTELSGLISSQSEREGTPPSLLVVPHAGYQYSGTVAAQAYKQI 63

Query: 65  -DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
            D +   RV LLGPSH       ALS A  + +P+G+L LD E+I EL +       +  
Sbjct: 64  TDWSYYERVLLLGPSHRVPLRGMALSDADKFSSPLGELNLDTELIAELNSQ-DLAAYNSA 122

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
             E EHS+E+ LP+L   F    + I+P++VG    +  A   R+  +          VS
Sbjct: 123 AHELEHSLEVQLPFLQ--FLNCDLPIIPVVVGVAPRDEVASLIRIVEQSY---RILVIVS 177

Query: 184 SDFCHWGSRFNYMHYDKK 201
           +D  H+ S       D+K
Sbjct: 178 TDLSHFHSFEEACDIDRK 195


>gi|406904129|gb|EKD46005.1| hypothetical protein ACD_69C00050G0003 [uncultured bacterium]
          Length = 260

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 85/285 (29%), Positives = 127/285 (44%), Gaps = 46/285 (16%)

Query: 10  AGSWYTDNSKRLAEE----LDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID 65
           AG +Y DN   L ++    LD    +A LP     + +IAPHAGY YSG+ AA A+  + 
Sbjct: 8   AGKFYPDNPAVLRQDILNMLDNAADKANLPLP---KAIIAPHAGYIYSGQVAASAYVCLA 64

Query: 66  P--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
                I RV L  P+H Y     A + A  Y TP+G + +D E I  L     F  +++ 
Sbjct: 65  KAKNQIKRVVLFAPAHQYPVDGIATTNAKSYLTPLGQIAIDQETISSL----SFPYLNV- 119

Query: 124 VDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAKYVDDPSNF 179
           ++EA   EH++E+HLP+L       L  +VP+LVG A  +E E +   L+      P   
Sbjct: 120 IEEAFSSEHAVEVHLPFLQLTLGSFL--LVPLLVGSAAVSEVENILEELWG----GPETL 173

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
             +SSD  H+                +K  ++ DK   D I   +PD      L Y++  
Sbjct: 174 VVISSDLSHYHP--------------YKLAKSWDKDTADAIIALNPDG-----LSYEHA- 213

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           CG   I   L +      K+ +  L    S      +D+ V Y +
Sbjct: 214 CGATGIKALLGV--AAKKKMHVLQLDLRNSGDTAGSKDNVVGYGA 256


>gi|341582002|ref|YP_004762494.1| hypothetical protein GQS_04580 [Thermococcus sp. 4557]
 gi|340809660|gb|AEK72817.1| hypothetical protein GQS_04580 [Thermococcus sp. 4557]
          Length = 293

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 77/292 (26%), Positives = 135/292 (46%), Gaps = 12/292 (4%)

Query: 1   MEKIRRASHAGSWYTDNSK--RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           M ++R  + AGS+Y        + EE  G L E G  +   +   +APHAGY +SG  A+
Sbjct: 1   MAEMRYPAVAGSFYPSGEALIEMLEEFLGDLGEEGSER--RITAGVAPHAGYVFSGYTAS 58

Query: 59  YAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
             +  I    +   F +LGP+H       A+  +  ++TP+GD+ +D  + + +      
Sbjct: 59  RTYKAIFEDGLPETFVILGPNHTGLGSPIAIYPSGKWRTPLGDVDVDSGMTKAIAKLSGI 118

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
             +D    + EHS+E+ LP++  + E  G  ++IVPI +G  + E     GR   +   +
Sbjct: 119 ADLDELAHKYEHSIEVQLPFIQYLSELAGKEVRIVPIALGIQDEEVSEDLGRAVFEASQE 178

Query: 176 PSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
                 V  S+DF H+G  + Y+ +  +   +   I+  D   +  I   D     + L 
Sbjct: 179 LGRDVVVIASTDFMHYGPMYGYVPFRARASELPHRIKEWDFRVIRRILDFDVKGMFRELR 238

Query: 234 EYDNTICGRHPISVFLHML-GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           E ++T+CG  P +V   M+    +  ++ + L Y  S +     D+ V YAS
Sbjct: 239 EMNHTMCG--PGAVGTAMVYSRSAGALEAELLHYTTSFEVSRSTDAIVGYAS 288


>gi|374628598|ref|ZP_09700983.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methanoplanus limicola DSM 2279]
 gi|373906711|gb|EHQ34815.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methanoplanus limicola DSM 2279]
          Length = 268

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 82/294 (27%), Positives = 127/294 (43%), Gaps = 39/294 (13%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLR---EAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           ++RR++ AGS+Y  +S  L ++L  +     +  L   PE  GV++PHAG  +SG+ +AY
Sbjct: 2   ELRRSTLAGSFYPADSSVLRKQLKSFFENIPDKSLSALPEPFGVVSPHAGIIFSGQTSAY 61

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+  I         ++GPSH  Y P C   +   ++TP+G L  D   I  L        
Sbjct: 62  AYSAIPKEFKGTFVVIGPSHAGY-PDCV--STDNWETPLGTLKNDNGFIRNL-------- 110

Query: 120 MDICVD-----EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            DI VD     + E+S+E+ + ++   F     +IVP+LVG+    N         K ++
Sbjct: 111 -DIPVDNYAHRQRENSLEVQMQFIKYRFPD--CRIVPVLVGSQTRRNSLSLAEKLFKAIE 167

Query: 175 DPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
              N   +  SSDF H+              +  K    LD   +D +   D   F + +
Sbjct: 168 LSENPVKIVASSDFSHY--------------ISDKDARMLDNYAIDALLHLDTAEFYRRI 213

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            +   + CG  PI V   +      K  +K L Y  S      R   V YAS A
Sbjct: 214 NDEGISACGYGPIMVMAEVSKRLGAKKAVK-LHYSTSGDINGDRSQVVGYASLA 266


>gi|219564289|dbj|BAH03713.1| hypoyhetical protein [Saccharomyces pastorianus]
          Length = 139

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/138 (38%), Positives = 85/138 (61%), Gaps = 12/138 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
           IR A+HAGSWY++ ++ L+++L  +L ++ L K P    R +I PHAGY Y G   AY++
Sbjct: 3   IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGSTMAYSY 61

Query: 62  GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
            ++D   ++ R+F+LGPSHH Y+  +  +S  +  +TP+G+L +D +     + +E    
Sbjct: 62  ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121

Query: 115 GK--FELMDICVDEAEHS 130
           GK  F+ MD   D AEHS
Sbjct: 122 GKKLFKPMDHDTDMAEHS 139


>gi|219852949|ref|YP_002467381.1| hypothetical protein Mpal_2377 [Methanosphaerula palustris E1-9c]
 gi|219547208|gb|ACL17658.1| protein of unknown function DUF52 [Methanosphaerula palustris
           E1-9c]
          Length = 262

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 83/291 (28%), Positives = 133/291 (45%), Gaps = 39/291 (13%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           KIR  + AG +Y      L + L  +    G   S    G++APHAGY YSG  AA A+ 
Sbjct: 2   KIRTCAVAGMFYPGEPSHLEQLLGKFFAVTGHETS--ASGIVAPHAGYIYSGAVAALAYA 59

Query: 63  NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            + P+S    F ++GPSH  Y P C  ++A  ++TP+G +  D +++  L         D
Sbjct: 60  TL-PSSFDGTFVVIGPSHRGY-PTC--TSAVPWETPLGIVDTDTDLVSAL---------D 106

Query: 122 ICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
           + V+E+    EHS+E+ +P++   F     +I P+L+G  +  +    GR+ A+   +  
Sbjct: 107 LPVNESYHADEHSLEVQMPFIKYRFP--RARIAPVLMGDQDLASSVRLGRMIARAARETR 164

Query: 178 NFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
               V  SSDF H      Y+      G    +IEAL  + +++        F   L E 
Sbjct: 165 REIRVVASSDFSH------YVPETVAKGQDLYAIEALKSLEIEL--------FYNRLRER 210

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            ++ CG   I+  +        + K + L+Y  S      R S V YA+ A
Sbjct: 211 GSSACGYGTIAAMITASIEAGAE-KGELLKYATSGDVTGDRASVVGYAAIA 260


>gi|126465721|ref|YP_001040830.1| hypothetical protein Smar_0821 [Staphylothermus marinus F1]
 gi|126014544|gb|ABN69922.1| protein of unknown function DUF52 [Staphylothermus marinus F1]
          Length = 289

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 80/299 (26%), Positives = 132/299 (44%), Gaps = 40/299 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSG 54
           +R    AG +Y    ++L E ++         G L +    +  +  G +APHAGY YSG
Sbjct: 7   VRLPVVAGYFYPSEPEKLRELIEHSFKHPLGPGELPQVSSTRRKDALGYVAPHAGYIYSG 66

Query: 55  RAAAYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
             AA+ + N+     P +I    ++G +H       ++  A  + TP+G+L +D E+  +
Sbjct: 67  PIAAHVYYNLALDGKPETI---IIIGTNHTGLGSLVSVYPAGKWITPLGELEVDAELARD 123

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-----NEAMY 165
           + +      +D      EHS+E+ LP++  +F G+ +KIVPI++G    +     ++A+Y
Sbjct: 124 IVSNSDLAELDTYAHVEEHSVEVQLPFIQYLF-GNNVKIVPIVLGLHTPDVARDLSKAIY 182

Query: 166 GRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDP 225
             L     D        SSDF H+                H    A DK  +D I   D 
Sbjct: 183 ESLQTMGRD---TIIIASSDFNHYDP--------------HDITVAKDKKALDRILALDT 225

Query: 226 DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           D   K +LE   TICG   I   + ++     K + K L+Y  S      + + V YA+
Sbjct: 226 DGLYKVILEEPVTICGPGGIMTLIELV-KLYGKGEAKLLKYATSGDVSGDKSAVVGYAA 283


>gi|383789560|ref|YP_005474134.1| putative dioxygenase [Spirochaeta africana DSM 8902]
 gi|383106094|gb|AFG36427.1| putative dioxygenase [Spirochaeta africana DSM 8902]
          Length = 281

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 86/303 (28%), Positives = 131/303 (43%), Gaps = 55/303 (18%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREA--------GLPKSPE-----VRGVIAPHAG 49
            +R  + AG +Y  ++  L + + G L  A        GLP +       VR VIAPHAG
Sbjct: 5   SVRNPAVAGKFYPRSAAELDQVVAGLLTRASTRLQDVSGLPDAANRTAGTVRAVIAPHAG 64

Query: 50  YSYSGR------AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
           Y +SG       AAA  + +  P    RV LLGP+H+      AL  A  ++TP+GD P+
Sbjct: 65  YMFSGSIAALSFAAAAQYRDAPP---RRVILLGPAHYAAGITVALPQADSFRTPLGDHPV 121

Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA 163
           D E ++ L  TG  E  +     AEHS+E+ LP++  V+    + I+P+L   +  E   
Sbjct: 122 DGEAMQRLLQTGLAE-TNGAAHSAEHSLEVMLPFIMAVWGA--LPILPLLCSRIAPE--- 175

Query: 164 MYGRLFAKYVDD--PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE 221
              RL    V++  P +   VSSD  H+                  + E LD   ++  E
Sbjct: 176 ---RLAEILVEEIGPGDLLVVSSDLSHFHPA--------------ATAEQLDNALLEAWE 218

Query: 222 TGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVS 281
            GDP    +         CG+ PI   + +      + ++  L Y  S       +S V 
Sbjct: 219 QGDPHGVSQ------GEACGKLPILASMALARQQGWRHRV--LGYAHSGAAGIGSESVVG 270

Query: 282 YAS 284
           Y +
Sbjct: 271 YGA 273


>gi|337288628|ref|YP_004628100.1| hypothetical protein TOPB45_1081 [Thermodesulfobacterium sp. OPB45]
 gi|334902366|gb|AEH23172.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thermodesulfobacterium geofontis OPF15]
          Length = 268

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 75/284 (26%), Positives = 128/284 (45%), Gaps = 28/284 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL--REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           R  + AG +Y+DN + L   L   +  RE  +      +G+I PHAGY YSG  A   +G
Sbjct: 3   REPAVAGYFYSDNPRELKFHLSTLIKFRENKI----RAKGIIVPHAGYMYSGWVAGKVYG 58

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           +I P   +   ++G +H     + +L     + TP+G++ +D E+ E+L  +      D+
Sbjct: 59  SIIPPDTA--IIIGTNHTGLGERISLFPGEAFITPLGEVSVDKEISEDLIKSVPLLSRDV 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
                EHS+E+ +P+L  +     +KIVPI +  ++ E     G+  A+ +   S+ + +
Sbjct: 117 MAHLHEHSIEVQVPFLQYI--NPKVKIVPICLSKLSLEEVKELGKGIAEVIKKHSDKYIL 174

Query: 183 ---SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSDF H+              V H+  +  D M +  I     + F K + E   + 
Sbjct: 175 IVGSSDFSHY--------------VPHEVAKKKDLMAIREILKLSEEDFLKVIYEEKISA 220

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           CG  PI+V ++   N     K + + Y+ S        S V Y 
Sbjct: 221 CGYIPIAVTINTCKNLGAS-KAELVEYQTSGDIIKDYSSVVGYG 263


>gi|218781792|ref|YP_002433110.1| AMMECR1 domain-containing protein [Desulfatibacillum alkenivorans
           AK-01]
 gi|218763176|gb|ACL05642.1| AMMECR1 domain protein [Desulfatibacillum alkenivorans AK-01]
          Length = 499

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 85/296 (28%), Positives = 129/296 (43%), Gaps = 49/296 (16%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           +E +R  + AGS+YT N   L  ++  +LREA   P    VR ++APHAGY YSG  AA 
Sbjct: 40  IENVREPAVAGSFYTANPSALKRQVKEFLREAKQTPVKGRVRALVAPHAGYVYSGVVAAA 99

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           ++  I P  + +V ++G SH       ++   + Y+TP+G +P+  E     ++ G   L
Sbjct: 100 SYEKI-PRDVRKVIIIGASHRARYRGASIPNVSAYRTPLGLVPIAPEAGRLAESPGFVRL 158

Query: 120 MDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILV-GAVNAEN-----EAMYGRLFA 170
                D+A   EHS+E+ LP+L  V      + +PIL+ G VN E      E +  R F 
Sbjct: 159 -----DQAHAQEHSIEVQLPFLQVVLGD--FEFIPILLGGGVNPEKLAEALEPLVQRGF- 210

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
                   F   S+D  H      Y  Y+        + + LD +    I   D +A K 
Sbjct: 211 --------FLIASTDLSH------YYPYE--------TAQNLDNICCQAIPDLDYEAMK- 247

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
                +   CG  P    + +  N      +  L Y  S      +D  V YA+ A
Sbjct: 248 -----NCKACGWVPTVAVMKVAENLGWTGTL--LDYRNSGDTAGSKDKVVGYAAIA 296


>gi|297527606|ref|YP_003669630.1| hypothetical protein Shell_1646 [Staphylothermus hellenicus DSM
           12710]
 gi|297256522|gb|ADI32731.1| protein of unknown function DUF52 [Staphylothermus hellenicus DSM
           12710]
          Length = 289

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 76/267 (28%), Positives = 122/267 (45%), Gaps = 31/267 (11%)

Query: 27  GWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI----DPTSISRVFLLGPSHHYY 82
           G L +    +  E  G +APHAGY YSG  AA+ + N+     P +I    ++G +H   
Sbjct: 39  GELPKVSNARRKETLGYVAPHAGYIYSGPIAAHVYYNLALDGKPETI---IIIGTNHSGL 95

Query: 83  TPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF 142
               ++  A  + TP+G+L +D E+  ++        +D+     EHS+E+ LP++  +F
Sbjct: 96  GSLVSVYPAGKWITPLGELEVDAELARDIVNNSDLAELDVYAHVEEHSIEVQLPFIQYLF 155

Query: 143 EGHLIKIVPILVGAVNAE-----NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
            G  +KIVPI++G    +     ++A+Y  L +   D   +    SSDF H+        
Sbjct: 156 -GDNVKIVPIVLGLHTPDVARDLSKAIYESLQSTGRD---SIIIASSDFNHYDP------ 205

Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
                   H    A DK  ++ I   D D F K LLE   TICG   I   + ++     
Sbjct: 206 --------HDITVAKDKKALERILALDTDGFYKVLLEEHVTICGPGGIMTLIELV-KLYG 256

Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           K + K L+Y  S      + + V YA+
Sbjct: 257 KGEAKLLKYATSGDVSGDKSTVVGYAA 283


>gi|374340342|ref|YP_005097078.1| dioxygenase [Marinitoga piezophila KA3]
 gi|372101876|gb|AEX85780.1| putative dioxygenase [Marinitoga piezophila KA3]
          Length = 268

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 137/291 (47%), Gaps = 35/291 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLR---EAGLP-KSPEVRGVIAPHAGYSYSGRAAAY 59
           +R    AG++Y  + + L +++  +     E  +P KS ++ G+I PHAGY +SG+ AAY
Sbjct: 1   MRYPVVAGTFYPADYEELKKQISSFFERFEEIEIPEKSIDLTGLIVPHAGYVFSGQTAAY 60

Query: 60  AFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE----VIEELKAT 114
            +  +     +    ++GP+H    P  ++     + TP G + +D E    ++ EL+  
Sbjct: 61  GYYELMKKGKVKTAIIIGPNHTGVGPSLSVYPEGSWLTPFGTIDVDTEISNFILSELRIK 120

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
           G     DI   E EHS+E+ LP+L  ++ G+  KIVPI++G    E      ++  K+  
Sbjct: 121 G-----DIIAHEYEHSIEVQLPFLQYLY-GNDFKIVPIIMGVQTLEMSKKLSKVIKKFAK 174

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
           D     + SSD  H+    N+    +K   I   IEAL +          P+   +Y+ +
Sbjct: 175 DGVVIIA-SSDLNHYE---NHEVTLEKGNFI---IEALKEK--------SPEKVYEYIKK 219

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           Y+ T CG   I+  L+     +    ++ L +  S +     + +V Y SA
Sbjct: 220 YNITACGFGAINTLLY-----TGFENVRILHHTTSGEIFGDYERTVGYLSA 265


>gi|346224900|ref|ZP_08846042.1| AMMECR1 domain-containing protein [Anaerophaga thermohalophila DSM
           12881]
          Length = 496

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 72/237 (30%), Positives = 111/237 (46%), Gaps = 25/237 (10%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP----EVRGVIAPHAGYSYSGRAAA 58
           K R  + AG +Y  +S  L E L      A   KSP    EV  +I PHAGY YSG  AA
Sbjct: 25  KDRVPAVAGMFYEGDSASLEEHLSSLFERA---KSPVCDGEVAALIVPHAGYVYSGGVAA 81

Query: 59  YAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
             F  I + T    VFL+G SH       ++ T   Y TP+G + ++ E+ E+L  +   
Sbjct: 82  SGFNQIPEDTGYENVFLIGSSHRIAFNGASVYTKGDYLTPLGRVEVNRELAEKLVESSSV 141

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL---IKIVPILVGAVNAENEAMYGRLFAKYVD 174
              +      EHS+E+ LP+L    + HL    K++P+++G  +A        +   Y  
Sbjct: 142 ISDNTAPHIEEHSLEVQLPFL----QYHLKKPFKLIPVVMGPQDASGAEKTAEVLKPYF- 196

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHG---------VIHKSIEALDKMGMDIIET 222
            P N F +S+DF H+    +    DKK           ++ K++E   K+G++ + T
Sbjct: 197 KPGNLFVISTDFSHYPGYEDAKEVDKKTAEAILKNDPRLLLKTLEENRKLGIEGLAT 253


>gi|407776168|ref|ZP_11123457.1| dioxygenase [Thalassospira profundimaris WP0211]
 gi|407280884|gb|EKF06451.1| dioxygenase [Thalassospira profundimaris WP0211]
          Length = 487

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 73/293 (24%), Positives = 126/293 (43%), Gaps = 33/293 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRA 56
           M  IR  + AG++Y  ++  L  E+DG +  A     +      + +I PHAG  +SG  
Sbjct: 1   MTIIRPPAVAGTFYPADADLLRSEIDGLMDAALHSDAVSGDASPKAIIVPHAGLMFSGSL 60

Query: 57  AAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA  F  +     +I R+ ++GP+H       AL+ A  + TP+G++  DL  +++  A 
Sbjct: 61  AALGFATVRALKDTIKRIVIIGPAHRMAFQGIALARADQFATPLGNMRCDLPALQKALAL 120

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
              +++D      EH +E+ LP++ ++F E   I IVP+LV   +         +  K  
Sbjct: 121 PHVQMLDDA-HTLEHGLEIELPFIQRLFGEDADIGIVPLLVSRCSPRQ---VHEVIEKLW 176

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
             P     +SSD  H      +  YD    + +++   ++    + I+T D         
Sbjct: 177 GGPETLIVISSDLSH------FHDYDTAQRMDNRTRTMIENFDAENIDTTD--------- 221

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
                 CG  P++  L +  N    +KIK L    S      +   V Y + A
Sbjct: 222 -----ACGALPVAGMLMVARN--RGMKIKTLGMRNSGDVTGDKSRVVGYGAWA 267


>gi|320451436|ref|YP_004203532.1| hypothetical protein TSC_c23830 [Thermus scotoductus SA-01]
 gi|320151605|gb|ADW22983.1| conserved hypothetical protein [Thermus scotoductus SA-01]
          Length = 270

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 82/290 (28%), Positives = 127/290 (43%), Gaps = 34/290 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+ +R  + AG +Y   ++RL +E++G L++A     P VRG+++PHAGY YSG+ AA  
Sbjct: 1   MDLVRLPAVAGYFYPLEAERLRQEVEGLLQKAQASPDPRVRGLLSPHAGYFYSGKVAAEG 60

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           F  +        R+ LLGPSH       A      + TP+G++ +DLE    L A G   
Sbjct: 61  FKVLSAWRGRARRILLLGPSHFVPFLGAAFYPYRAWATPLGEVAVDLEGGSRLVAKGSPF 120

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDP 176
           L        EHS+E+ LP+L          I+P+L G  + +   EA+   L        
Sbjct: 121 LTYEEPFREEHSLEVLLPFLQVALPD--TPILPLLFGEADPKEVAEALLSEL------GE 172

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
            +    SSD  H+       + D+K   + K++ ALD  G+   E               
Sbjct: 173 GDLVVASSDLSHYHPDPVARNRDQK--TLEKAL-ALDVEGVAQGEA-------------- 215

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
              CG  P +    + G+   K  +  L Y  S++        V YA+ A
Sbjct: 216 ---CGHLPWATLTALAGSLGWKPTL--LAYATSAEASGDTSRVVGYAAVA 260


>gi|46199650|ref|YP_005317.1| hypothetical protein TTC1348 [Thermus thermophilus HB27]
 gi|46197276|gb|AAS81690.1| hypothetical conserved protein [Thermus thermophilus HB27]
          Length = 280

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 87/291 (29%), Positives = 119/291 (40%), Gaps = 36/291 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+ +R    AG +Y      L EE+   L  A  P  P VRGV++PHAGY+Y+GR  A A
Sbjct: 18  MDLVRLPRVAGYFYPGEKAALEEEVGALLAGARTPPLPGVRGVLSPHAGYAYAGRVMAEA 77

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
           F  +        RVFLLGPSH       A      ++TP+G++ +DLE    L   G  F
Sbjct: 78  FRALSAWRGKARRVFLLGPSHFVAFSGVAFFPYRAWRTPLGEVAVDLEGGRRLLGQGAPF 137

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
                   E EHS+E+ LP+L          I+P+L G V+     EA+   L  K    
Sbjct: 138 RAYREPFLE-EHSLEVLLPFLQVALP--QTPILPLLFGEVDPGEVAEALLPELGQK---- 190

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
             +    SSD  H+         D K     K   ALD  G+   E              
Sbjct: 191 --DLVVASSDLSHYHPDPVARKLDAK---TLKRALALDAEGVAQGEA------------- 232

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
               CGR P      +    +   K + L Y  S++    R+  V Y + A
Sbjct: 233 ----CGRLPWGTLTAL--ARALGWKPRLLAYATSAEAGGGRERVVGYGALA 277


>gi|91774303|ref|YP_566995.1| hypothetical protein Mbur_2394 [Methanococcoides burtonii DSM 6242]
 gi|121689012|sp|Q12TI1.1|Y2394_METBU RecName: Full=MEMO1 family protein Mbur_2394
 gi|91713318|gb|ABE53245.1| memo-like protein of unknown function DUF52 [Methanococcoides
           burtonii DSM 6242]
          Length = 264

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 83/287 (28%), Positives = 124/287 (43%), Gaps = 25/287 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y  ++K L +E+        +  S +V G + PHAGY YSG  AA+AF  
Sbjct: 1   MRQPTVAGKFYPLSTKALRKEIVKCFHGLEI-MSEDVIGAVVPHAGYVYSGPVAAHAFAR 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +       +F  GP+H  Y    A+S   V+ TP GD+  D E+ + L   G    MD  
Sbjct: 60  LPKADTYVIF--GPNHTGYGSPVAMSQ-DVWNTPFGDVETDRELGKLL--AGTIIDMDEV 114

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
               EHS+E+ +P+L  +F G   K++PI +G  + +     G   A+ V +      F 
Sbjct: 115 AHRYEHSVEVQIPFLQYLF-GSDFKVLPICMGMQDEDTAVEVGLEVARAVKESGKKVVFI 173

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H+              V  +  E  D   +D I   D     +   E D T CG
Sbjct: 174 ASSDLSHY--------------VPQEKAEKSDNYLIDAILDMDVPEIYRRKYEKDITACG 219

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
             PI+  L     C  K   + ++Y  S    T     V YA+   K
Sbjct: 220 YGPITAMLTAAKECGAK-NTELVKYGTSGDV-TGDPMVVGYAAIIVK 264


>gi|452965208|gb|EME70234.1| dioxygenase [Magnetospirillum sp. SO-1]
          Length = 456

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 81/292 (27%), Positives = 130/292 (44%), Gaps = 37/292 (12%)

Query: 1   MEKIRRASHAGSWY----TDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRA 56
           M  IR  + AG +Y     + +++L   LDG        + P  + +IAPHAG+ YSG  
Sbjct: 1   MTAIRPTAVAGQFYPADFAEANRQLTAFLDGAAPAPCAGRRP--KALIAPHAGWVYSGPV 58

Query: 57  AAYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA A+  + P     +RV LLGPSH       ALS+A  + +P+G  PLD +    L   
Sbjct: 59  AAGAYALLKPFRGGYARVVLLGPSHRLAFRGMALSSADQWASPLGPQPLDKD-WSRLAGV 117

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
               ++D    + EHS+E+H+P+L          ++P+++G  +A  E + G L A +  
Sbjct: 118 AGVGVLDQAHAQ-EHSLEVHVPFLQATL--GEFTLLPVVIG--DATPEMVAGLLDALWGG 172

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
           D      +S+D  H      Y+ Y++  G   ++  A+++M        DP A  +    
Sbjct: 173 D-ETLIVISTDLSH------YLPYEQCRGTDGQTAAAIERM--------DPVAISR---- 213

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
                CGR P+   L         ++I  L    S      +D  V Y + A
Sbjct: 214 --EGACGRLPVGGLLT--AAKRRGLEIVTLDVRNSGDTAGPKDRVVGYGAWA 261


>gi|350561283|ref|ZP_08930122.1| protein of unknown function DUF52 [Thioalkalivibrio thiocyanoxidans
           ARh 4]
 gi|349781390|gb|EGZ35698.1| protein of unknown function DUF52 [Thioalkalivibrio thiocyanoxidans
           ARh 4]
          Length = 267

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 95/192 (49%), Gaps = 10/192 (5%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAAY 59
           E +R  + AG +Y D+   L   +   + +A  P +     + +IAPHAGY YSG  AA 
Sbjct: 5   ETVRDPAVAGLFYPDDPAALRRAVQALMADATGPAAGRSLPKALIAPHAGYRYSGPVAAS 64

Query: 60  AFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+  +   S  I RV LLGPSH       A + A  Y+TP+G + +D   +  ++     
Sbjct: 65  AYAALGNASDRIQRVVLLGPSHRVPFRGIAATGAGAYRTPLGTIAVDRPALASIRELPGV 124

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             +D+     EHS+E+HLP+L ++  G+   +VP++VG    E  A    +  +    P 
Sbjct: 125 VTLDLA-HGPEHSLEVHLPFL-QLLLGNF-NLVPLVVGDAGPEQVAA---VLERLWGGPE 178

Query: 178 NFFSVSSDFCHW 189
               VSSD  H+
Sbjct: 179 TLIVVSSDLSHY 190


>gi|167622889|ref|YP_001673183.1| hypothetical protein Shal_0949 [Shewanella halifaxensis HAW-EB4]
 gi|167352911|gb|ABZ75524.1| protein of unknown function DUF52 [Shewanella halifaxensis HAW-EB4]
          Length = 269

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 71/294 (24%), Positives = 130/294 (44%), Gaps = 38/294 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGL-------PKSPEVRGVIAPHAGYSYSGRA 56
           IR A+ AG +Y      L   L+G+L  A +        +  + + +I PHAGY YSG  
Sbjct: 5   IRPAAVAGLFYPAAPDELRAMLEGYLARARMSLISQANTQHSQAKVIIVPHAGYIYSGLV 64

Query: 57  AAYAFGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA+A+  I+    ++++V L+GP+H  Y    AL  +  ++TP+G + +D   +E L++ 
Sbjct: 65  AAHAYVLIESMAATVNKVLLIGPAHRVYLQGGALPQSQYFETPLGQISIDKRSVEMLESN 124

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
               + D+   + EH +E+ LP+L         +++P+L+G  + +  A   +L  +   
Sbjct: 125 PHITVSDL-PHQQEHCLEVQLPFLQHCLNQ--FELIPLLIGESDPQETA---KLLEQLWG 178

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
                  VS+D  H+       H DK      K ++ L+ +                   
Sbjct: 179 GEETLVVVSTDLSHFHRYAEATHLDKL--TCQKILQGLETI------------------- 217

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           +    CG   ++    ML     ++ +  L Y+ S      ++  V YAS A++
Sbjct: 218 FPEQACGSFCLNAL--MLQLKRHQLCLTQLSYQNSGDTAGDKNRVVGYASFASR 269


>gi|389582562|dbj|GAB65300.1| hypothetical protein PCYB_053180 [Plasmodium cynomolgi strain B]
          Length = 149

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 68/120 (56%)

Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
           KY  D  N F  SSDFCH+G RF++ +  +K+   H   + +D+   +II   D   F  
Sbjct: 30  KYFQDEGNLFLFSSDFCHYGRRFSFTNILQKYSDTHIFKQNMDRDAANIISRHDIAGFID 89

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           YL +  NTICG +PI + L +L + S  +  K + Y QS+Q ++  DSSVSYA   + ++
Sbjct: 90  YLNKTHNTICGSNPIKMMLQLLQDFSGNVSTKLMHYSQSNQARSTNDSSVSYAGMVSFIN 149


>gi|116833213|gb|ABK29519.1| CG8031-PA-like protein [Helicoverpa armigera]
          Length = 82

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 40/80 (50%), Positives = 52/80 (65%), Gaps = 1/80 (1%)

Query: 11 GSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSIS 70
          GSWY +N   L+ +LD WL +A L   P  R +IAPHAGYSY G  AA+A+  + P  + 
Sbjct: 1  GSWYIENGNELSRQLDLWLSKADLTHGP-ARAIIAPHAGYSYCGACAAFAYRQVSPVVVK 59

Query: 71 RVFLLGPSHHYYTPKCALST 90
          R+F+LGPSHH     CALS+
Sbjct: 60 RIFILGPSHHVRLGGCALSS 79


>gi|297568555|ref|YP_003689899.1| protein of unknown function DUF52 [Desulfurivibrio alkaliphilus
           AHT2]
 gi|296924470|gb|ADH85280.1| protein of unknown function DUF52 [Desulfurivibrio alkaliphilus
           AHT2]
          Length = 270

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 72/284 (25%), Positives = 127/284 (44%), Gaps = 24/284 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL--REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           IR  + A  +Y   ++ L E L   +  R A  P  P  R V+ PHAGY +SGR A    
Sbjct: 2   IRTPAVAHQFYPGEAEVLRELLTRLIGRRAADQPGQP-ARAVVMPHAGYVFSGRVAGETV 60

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             ++      V +LGP+HH      AL ++  +  P+G++ ++ E+   ++  G   + D
Sbjct: 61  AQVEVPE--DVIVLGPNHHGLGAPAALMSSGSWAMPMGEVQINAELAGLIRRHGPEIVDD 118

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
                AEHS+E+ LP+L     G  ++IVP+ +  ++  +  + G+  A  ++D S    
Sbjct: 119 EQAHRAEHSLEVLLPFLQFCQPG--LRIVPLCLARLSPADCRLVGQALAAAIEDFSRPVL 176

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
            + S+D  H+ SR               +  A D + ++ +   DP+   + + E   ++
Sbjct: 177 LAASTDMSHYESR--------------SAATAKDHLAIERVLALDPEGLYRTVAEQQISM 222

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           CG  P  + L M        + + +RY  S +        V YA
Sbjct: 223 CGVIPTVITL-MAALQLGATEAELVRYSDSGEASGDTAQVVGYA 265


>gi|161527723|ref|YP_001581549.1| hypothetical protein Nmar_0215 [Nitrosopumilus maritimus SCM1]
 gi|229554364|sp|A9A2Y3.1|Y215_NITMS RecName: Full=MEMO1 family protein Nmar_0215
 gi|160339024|gb|ABX12111.1| protein of unknown function DUF52 [Nitrosopumilus maritimus SCM1]
          Length = 275

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 73/286 (25%), Positives = 121/286 (42%), Gaps = 21/286 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-----KSPEVRGVIAPHAGYSYSGRAAA 58
           IR+   AG +Y      L E ++  ++    P      +  + GVI+PHAGY YSG  A 
Sbjct: 2   IRKPVVAGQFYPGTKNELEEMINSCIQHKFGPGNQIQNNEGIYGVISPHAGYVYSGPTAC 61

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           Y++  I   +   V +LGP+H       A      ++TP+G + +D E  +E+    K+ 
Sbjct: 62  YSYKAISSKNPELVIILGPNHFGVGKDVATMVNAQWETPLGLVDVDSEAAKEIANNSKYI 121

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            +D      +HS+E+ +P L  +F     KI+PI++   + E     G   A+     + 
Sbjct: 122 EIDEFSHSRDHSLEVQIPMLQSIFSEKF-KILPIILRDQSLEMAKDVGNAVAQIAKSRNT 180

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSDF H+    ++ H             + DK  ++ I   D + F   L+E   T
Sbjct: 181 MIVASSDFTHYEEN-SFAH-------------SQDKALIEPILEMDVEKFYSVLMEKRVT 226

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            CG   ++  +    N    +K + L Y  S        S V Y +
Sbjct: 227 ACGYGAMASVMIACKNLGA-VKGELLSYTTSGDVMGDTSSVVGYGA 271


>gi|406883043|gb|EKD30701.1| hypothetical protein ACD_77C00487G0011 [uncultured bacterium]
          Length = 476

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 132/285 (46%), Gaps = 19/285 (6%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVRGVIAPHAGYSYSGRAAAYAFGN 63
           R+ + AG +Y     +L + L+  L  A LPK    VR +I PHAGY +SG  AA AF  
Sbjct: 10  RKPAVAGMFYPAEPDKLQQTLES-LFAAALPKQCSNVRAIICPHAGYIFSGGVAASAFNQ 68

Query: 64  IDP-TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           ID  T+  RVF++  SH       A+     +  P G   +D E  + L  +G  E+   
Sbjct: 69  IDKNTNYKRVFIIASSHQVSFEGAAIYCDGDFLMPYGREKVDTEFGKML-VSGFPEIFTS 127

Query: 123 CVDE--AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
                  EHS+E+ LP+L  V + +   IVPI++GA +         +   Y   P N F
Sbjct: 128 HTGPHINEHSLEVQLPFLHYVLKTNYC-IVPIIIGASDPRICNNIALVLKPYF-TPDNLF 185

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSDF H      Y +Y+    V  ++ EA+     + ++T       K +     ++C
Sbjct: 186 IISSDFSH------YPNYEDAKKVDAETKEAILSNNPETLQTCLKKNTAKQISHLSTSLC 239

Query: 241 GRHPISVFLHM-LGNCSTKIKIKFLRYEQS--SQCKTKRDSSVSY 282
           G   +   L+M  GN S  ++   + Y+ S  ++C  ++D  V Y
Sbjct: 240 GWTSVLTMLYMTTGNDS--LEYSGIEYKNSGDAECYGEKDKVVGY 282


>gi|83312805|ref|YP_423069.1| dioxygenase [Magnetospirillum magneticum AMB-1]
 gi|82947646|dbj|BAE52510.1| Predicted dioxygenase [Magnetospirillum magneticum AMB-1]
          Length = 456

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 82/293 (27%), Positives = 135/293 (46%), Gaps = 39/293 (13%)

Query: 1   MEKIRRASHAGSWY----TDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRA 56
           M  IR  + AG +Y     + +++L   LDG +      + P  + +IAPHAG+ YSG  
Sbjct: 1   MTAIRPTAVAGQFYPADFAEANRQLTAFLDGAVAAPCAGRRP--KALIAPHAGWVYSGPV 58

Query: 57  AAYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA A+  + P   S SRV LLGPSH       ALS+A  + +P+G +PLD +    L   
Sbjct: 59  AAGAYALLKPFRGSWSRVVLLGPSHRVAFQGMALSSADQWASPLGAVPLDKD-WSRLAGV 117

Query: 115 GKFELMDICVDEAEHSMEMHLPYL-AKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
               ++D    + EHS+E+H+P+L A + E     ++P+++G  ++  E + G L A + 
Sbjct: 118 AGVGVLDQAHAQ-EHSLEVHVPFLQATIGE---FTLLPVVIG--DSSPEMVAGLLEALWG 171

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
            D      +S+D  H      Y+ Y++      +++ A++ M        D  A  +   
Sbjct: 172 GD-ETLIVISTDLSH------YLPYEQCRSTDGQTVAAIEHM--------DAAALSR--- 213

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
              +  CGR P+   L         ++I  L    S      +D  V Y + A
Sbjct: 214 ---DGACGRLPVGGLLTAAKR--RGLEIVTLDVRNSGDTAGPKDRVVGYGAWA 261


>gi|374855667|dbj|BAL58522.1| hypothetical conserved protein [uncultured candidate division OP1
           bacterium]
          Length = 290

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 78/280 (27%), Positives = 127/280 (45%), Gaps = 53/280 (18%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--PEVRGVIAPHAGYSYSGRAAA 58
           ME+IR A+ AGS+Y  +S+RL + ++  L +A  P++    +RG+I PHAGY YSG  AA
Sbjct: 2   MERIRPAAVAGSFYPASSERLRKTVEALLAQAQ-PQTITGRLRGLIVPHAGYIYSGPVAA 60

Query: 59  YAF----------------GNIDPTSIS-----------RVFLLGPSHHYYTPKCALSTA 91
             +                G   P S             R+ LLGP+H+ Y    A    
Sbjct: 61  TGYKLLKDLTPQPPSLKRKGEPSPPSSGEGPGERSAQKVRLLLLGPAHYVYFVGAATLDV 120

Query: 92  TVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVP 151
             ++TP+G +    ++I      G  +  +      EHS+E+ LP++  V      +I P
Sbjct: 121 DAWQTPLGLVECARDLIVRALERGDLQ-SNFDAHAPEHSLEVQLPFVQTVLPN--CEIFP 177

Query: 152 ILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA 211
           IL G  ++   A   R+ + Y+ D  +FF VSSD  H+   + Y    ++  + H++IE 
Sbjct: 178 ILTGECDSRELA---RVLSLYL-DAIDFFIVSSDLSHY---YPYDEAVQRDALAHRAIEN 230

Query: 212 LDKMGMD-------------IIETGDPDAFKKYLLEYDNT 238
            D   M+             ++       +K  LL+Y N+
Sbjct: 231 FDFTLMEDRVEACGKTAILTLMRIAHEQGWKAKLLDYRNS 270


>gi|152992570|ref|YP_001358291.1| hypothetical protein SUN_0977 [Sulfurovum sp. NBC37-1]
 gi|151424431|dbj|BAF71934.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
          Length = 267

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 116/249 (46%), Gaps = 27/249 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR--------EAGLPKSPEVRGVIAPHAGYSYSGRA 56
           R  + AG +Y  ++  + E  + + R        E  L   P  R VI PHAGY YS   
Sbjct: 4   RETAVAGQFYPSDANEIKEMFEHYNRIIEEHLQNEELLHMKP--RAVIVPHAGYVYSAFT 61

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           A  A   +  T   RV ++GPSH  Y    ++S    Y TP+G LP+D E++ ELK+  +
Sbjct: 62  ANVAMRLLGNTEAKRVVVIGPSHRVYLKGTSISDYDSYNTPLGALPIDRELVNELKS--R 119

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
           F L  +     EHS E+ +P++ K ++     +V ++ G    E+ A    +    +DDP
Sbjct: 120 FGLQFVPDAHHEHSTEVQMPFV-KTYDTD-ASVVELVYG---DEDPARLAEVIDYLLDDP 174

Query: 177 SNFFSVSSDFCHW-----GSRFNYMHYDKKHGV----IHKSIEALDKMGMD-IIETGDPD 226
                +S+D  H+       + + +  D    +    +H+  EA  K+G++ ++ T    
Sbjct: 175 DTVVVISTDLSHYYDINKAKQLDSICLDAVQKLDTAELHQGCEACGKIGVEAMLITAKRR 234

Query: 227 AFKKYLLEY 235
             K  LL+Y
Sbjct: 235 GLKPLLLDY 243


>gi|217967891|ref|YP_002353397.1| hypothetical protein Dtur_1509 [Dictyoglomus turgidum DSM 6724]
 gi|217336990|gb|ACK42783.1| protein of unknown function DUF52 [Dictyoglomus turgidum DSM 6724]
          Length = 269

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 77/283 (27%), Positives = 127/283 (44%), Gaps = 23/283 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K R  S AG +Y    K L   L  +++E  +P+   + G + PHAGY YSG+ A   + 
Sbjct: 2   KRRLPSVAGYFYPAREKELLNMLSQFIKE--VPQKTPLLGCVVPHAGYIYSGKVAGLIYS 59

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            ++   ++   ++GP+H+      A+ +  V+ TP+GD+ +D  +  ++    K+   D 
Sbjct: 60  IMEVPEVA--IIIGPNHNGLGYPSAIYSEGVWATPLGDVEIDSILSRKIINNSKYLKEDF 117

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFF 180
                EHS+E+ +P+L   +    IKIVPI V   + E     G    K + D   +   
Sbjct: 118 VAHHPEHSLEVQVPFLQ--YLNFNIKIVPITVVDYSYEVLKDLGYAIGKSLLDYEKNTVI 175

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSDF H      Y  Y+        +IEA+  +        D   F K ++E D +IC
Sbjct: 176 IASSDFSH------YEPYEIAKQKDFYAIEAILNL--------DEKEFIKRVVEKDISIC 221

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           G  PI + +  +     + K + + Y+ S       D  V YA
Sbjct: 222 GVGPIVITIIAVKLLGGR-KAQLVAYQTSGDITKDIDQVVGYA 263


>gi|220935846|ref|YP_002514745.1| hypothetical protein Tgr7_2683 [Thioalkalivibrio sulfidophilus
           HL-EbGr7]
 gi|219997156|gb|ACL73758.1| conserved hypothetical protein [Thioalkalivibrio sulfidophilus
           HL-EbGr7]
          Length = 274

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 78/290 (26%), Positives = 122/290 (42%), Gaps = 38/290 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLR---EAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           +R A+ AG +Y  ++  L   ++G L    EA  P  P  + +I PHAGY YSG  AA A
Sbjct: 13  VRPAAVAGHFYPADAGLLRTVVEGLLSDFAEAEPPARPP-KALIVPHAGYVYSGAVAARA 71

Query: 61  FGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE---ELKATGK 116
           +  +     I RV L+GP+H       A   A  ++TP+G +P+D   +E   EL +  +
Sbjct: 72  YNRLRGANGIRRVVLVGPAHRVGFYGIAAPEAAWFETPLGRVPVDQPAVEQACELSSVFR 131

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
           ++         EH +E+ LP+L +        +VP++VG     + A   R        P
Sbjct: 132 YD----AAHAPEHCLEVQLPFLQRTLSEDF-TLVPLVVGGATTNDMA---RTLEAVWGGP 183

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                +SSD  H      ++ YD    V  ++  A++ +  + IE G             
Sbjct: 184 ETLVVISSDLSH------HLDYDAARIVDERTSRAIETLDEEGIEAG------------- 224

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
              CGR PI   L +       +  + L    S      RD  V Y + A
Sbjct: 225 -MACGRVPIQGLLRLARQ--RGLAAERLDLRCSGDTAGPRDRVVGYGAYA 271


>gi|54294113|ref|YP_126528.1| hypothetical protein lpl1177 [Legionella pneumophila str. Lens]
 gi|53753945|emb|CAH15416.1| hypothetical protein lpl1177 [Legionella pneumophila str. Lens]
          Length = 447

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 68/251 (27%), Positives = 121/251 (48%), Gaps = 31/251 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y D+   L + +  +L +A + K P  + ++ PHAGY YSG  AA A+ +
Sbjct: 3   VRQPAVAGYFYPDDPLTLKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGSVAASAYAS 61

Query: 64  I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
           +     SI+++ +LGP+H  Y    A      + TP+G++  D E++ + +     + L 
Sbjct: 62  LRDKKNSINKIIILGPAHRLYFKGIAYDPVDKFATPLGEIQQDKELLTQIIDLPYVYSLP 121

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           +    + EH +E+ LP+   +F     KI+P++VG  N ++ A   RL A+     +   
Sbjct: 122 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVVGETNPQDVA---RLIARVWGGDNTLL 174

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSD  H+         DKK  +   SI+ L+  G +I+                +  C
Sbjct: 175 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 214

Query: 241 GRHPISVFLHM 251
           G  P+  FL+ 
Sbjct: 215 GYFPLRGFLYF 225


>gi|18312203|ref|NP_558870.1| hypothetical protein PAE0818 [Pyrobaculum aerophilum str. IM2]
 gi|23822329|sp|Q8ZYE1.1|Y818_PYRAE RecName: Full=MEMO1 family protein PAE0818
 gi|18159641|gb|AAL63052.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
          Length = 281

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 132/292 (45%), Gaps = 27/292 (9%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-------PEVRGVIAPHAGYSYSGR 55
           ++R+ + AG +Y    + L ++++  ++    PK+        E  G +APHAGY YSG 
Sbjct: 2   RVRKPAVAGYFYESGREELLQQIEWAVKHELGPKALQMPKLGGEALGGVAPHAGYMYSGP 61

Query: 56  AAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+ +  +       VF++ GP+H+      A+  +  ++TP+G + +D E+ E + + 
Sbjct: 62  VAAWLYSALAGYGKPDVFVIVGPNHYGIGAPVAIMKSGAWETPLGRVEVDRELAEVITSH 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            K    D      EHS+E+ +P++   F    +KIVPI++           GR  AK + 
Sbjct: 122 FKEVEDDFYAFSKEHSVEVQVPFIQYYFGD--VKIVPIVMWRQTLSTSRELGRAIAKALK 179

Query: 175 DPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           +     +   SSDF H+          + H +  +     D+M +  I   D     +  
Sbjct: 180 EYGRKAYVIASSDFNHY----------EPHDITTRK----DEMAISKILKLDEAGLFEIS 225

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            ++D +ICG  PI V +         I +  L++  S      +D +V YAS
Sbjct: 226 SKFDISICGIGPIGVLIAAAKELGY-INVTLLKHATSGDTSGYKDETVGYAS 276


>gi|409196691|ref|ZP_11225354.1| AMMECR1 domain-containing protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 481

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 65/197 (32%), Positives = 96/197 (48%), Gaps = 12/197 (6%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----EVRGVIAPHAGYSYSGR 55
           +++ R+ + AG +Y D+    A  L G L+E      P     EV  +I PHAGY YSG+
Sbjct: 6   LKRDRKPAVAGMFYEDD----ASLLHGHLKELFSRAKPRDTNDEVAAIIVPHAGYVYSGK 61

Query: 56  AAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA  F  I +   I R+FL+G SH       A+     + TP G + +D +V  EL   
Sbjct: 62  IAASGFNQIPEDAKIERIFLIGSSHRVAFDGAAVYMQGDFMTPFGPVEVDADVAGELVEQ 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
                 D  V   EHS+E+ +P+L +    H  K+VP+LVG  +A       +   K+  
Sbjct: 122 ESGIFSDPEVHAREHSLEVQVPFL-QFHLKHPFKLVPLLVGPHDASVTPRIAKTLEKWFT 180

Query: 175 DPSNFFSVSSDFCHWGS 191
              N F +S+DF H+ S
Sbjct: 181 S-GNLFVISTDFSHYPS 196


>gi|16081384|ref|NP_393717.1| hypothetical protein Ta0237 [Thermoplasma acidophilum DSM 1728]
 gi|13878848|sp|Q9HLJ1.1|Y237_THEAC RecName: Full=MEMO1 family protein Ta0237
 gi|10639380|emb|CAC11382.1| conserved hypothetical protein [Thermoplasma acidophilum]
          Length = 268

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 130/291 (44%), Gaps = 32/291 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPK---SPEVRGVIAPHAGYSYSGRAAAYAF 61
           R+ + AG +Y +      +EL   L    +P+   S  + G + PHAG  YSGR A Y++
Sbjct: 3   RKPAVAGYFYPERK----DELYSLLSSFAIPEQHVSGTIIGAVVPHAGIIYSGRTAMYSY 58

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             I+ +++    ++GP+H   TP  +L     + TP+GD  ++  + E L     + + D
Sbjct: 59  RAIEKSAVRDFVIIGPNHRPLTPYASLYPEGEWSTPLGDALINDRMAEALYRDSNYIVKD 118

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EHS+E+ +P+L  +F G   + VP+++G    +     G    K ++DP  F +
Sbjct: 119 EESHLMEHSVEVQIPFLQYLF-GDGFRFVPVILGDQEIDVARDIGEAIMK-IEDPFIFIA 176

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE---TGDPDAFKKYLLEYDNT 238
            SSDF H+                 +  + ++K  MD+I    T D D F   L + + T
Sbjct: 177 -SSDFTHY-----------------EDAKRVEKKDMDLISAILTLDLDKFYSVLEKENVT 218

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
            CG   I+  ++         ++ FL +  S          V YAS  + +
Sbjct: 219 ACGYGAIAALMYYTKKRGG--RMIFLNHSNSGDVTGDYSEVVGYASLVSVI 267


>gi|298674293|ref|YP_003726043.1| hypothetical protein Metev_0320 [Methanohalobium evestigatum
           Z-7303]
 gi|298287281|gb|ADI73247.1| protein of unknown function DUF52 [Methanohalobium evestigatum
           Z-7303]
          Length = 265

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 79/287 (27%), Positives = 133/287 (46%), Gaps = 32/287 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG +Y  +   L +E+     +  + +   + G + PHAGY +SG  AAYAF  
Sbjct: 1   MRNPTVAGQFYALDPSGLEKEISRCFGDIDISQQ-NIHGAVIPHAGYMFSGPVAAYAFAQ 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P + + + +LGP+H       A+S  T + TP+G++  D E+ + L  +    ++D  
Sbjct: 60  L-PEADTYI-ILGPNHTGLGSAVAISQDT-WSTPLGNVETDKELGKSLAGS----ILD-- 110

Query: 124 VDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN- 178
           VDE     EHS+E+ +P+L   F  H  K++PI +G  + E     G   +  + + +  
Sbjct: 111 VDEMAHMYEHSIEVQIPFLQYRFS-HGFKVLPICMGLQDEETALEVGNQISSAITESNKK 169

Query: 179 -FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
            F   SSD  H+          +   V + +    DK  ++ IE  D   F + + E   
Sbjct: 170 IFVIASSDLTHF----------EPENVANDT----DKYLIESIENMDVHEFYRRISERSA 215

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           T CG  PIS  L +  +   +   K L+Y  S+       S V YA+
Sbjct: 216 TACGYGPISAMLTVAKSLGGRTG-KLLKYGTSADVTGDSSSVVGYAA 261


>gi|372489024|ref|YP_005028589.1| putative dioxygenase [Dechlorosoma suillum PS]
 gi|359355577|gb|AEV26748.1| putative dioxygenase [Dechlorosoma suillum PS]
          Length = 268

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 81/297 (27%), Positives = 129/297 (43%), Gaps = 40/297 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAY 59
           M  +R A+ AG +Y  +  +L  ++D +L +A  P +   ++ +I PHAGY YS   AA 
Sbjct: 1   MPSLRPAAVAGLFYPADPAQLGADIDAYLAQARPPAAAGPIKALIVPHAGYLYSAPIAAT 60

Query: 60  AFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+  + P +  I RV LLGP+H       A+ T   + TP+G LP+D E  + +      
Sbjct: 61  AYAALAPQAGRIRRVILLGPTHRVAVRGLAVPTVDSFATPLGPLPIDAEARQRIA----- 115

Query: 118 ELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
           +L  +  D+     EHS+E+ LP+L ++       ++P+ VGA + E  A   ++  +  
Sbjct: 116 DLPQVVADDGAHALEHSLEVQLPFLQRLLGP--FTLLPLAVGAASGEAVA---QVLERLW 170

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
                   VSSD  H      Y  Y +         E LD+     I     D   +   
Sbjct: 171 GGEETLILVSSDLSH------YQPYAEA--------ERLDQATCARIAAARADLAPQ--- 213

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
                 CG  PI+  L  L      ++ + L    S      R+  V YA+ A + D
Sbjct: 214 ----EACGAIPINGLL--LAAARHGLRPRLLDRRNSGDTAGDRERVVGYAAFALEPD 264


>gi|307609923|emb|CBW99449.1| hypothetical protein LPW_12241 [Legionella pneumophila 130b]
          Length = 447

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 68/251 (27%), Positives = 120/251 (47%), Gaps = 31/251 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y D+   L + +  +L +A + K P  + ++ PHAGY YSG  AA A+ +
Sbjct: 3   VRQPAVAGYFYPDDPLTLKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGSVAASAYAS 61

Query: 64  I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
           +     SI+++ +LGP+H  Y    A      + TP+G++  D E++ + +     + L 
Sbjct: 62  LRDKKNSINKIIILGPAHRLYFKGIAYDPVDKFATPLGEIQQDKELLTQIIDLPYVYSLP 121

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           +    + EH +E+ LP+   +F     KI+P++VG  N ++ A   RL A+         
Sbjct: 122 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVVGETNPQDVA---RLIARVWGGDDTLL 174

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSD  H+         DKK  +   SI+ L+  G +I+                +  C
Sbjct: 175 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 214

Query: 241 GRHPISVFLHM 251
           G  P+  FL+ 
Sbjct: 215 GYFPLRGFLYF 225


>gi|54297126|ref|YP_123495.1| hypothetical protein lpp1171 [Legionella pneumophila str. Paris]
 gi|397663639|ref|YP_006505177.1| hypothetical protein LPO_1184 [Legionella pneumophila subsp.
           pneumophila]
 gi|397666836|ref|YP_006508373.1| hypothetical protein LPV_1323 [Legionella pneumophila subsp.
           pneumophila]
 gi|53750911|emb|CAH12322.1| hypothetical protein lpp1171 [Legionella pneumophila str. Paris]
 gi|395127050|emb|CCD05235.1| conserved protein of unknown function [Legionella pneumophila
           subsp. pneumophila]
 gi|395130247|emb|CCD08485.1| conserved protein of unknown function [Legionella pneumophila
           subsp. pneumophila]
          Length = 447

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 68/251 (27%), Positives = 120/251 (47%), Gaps = 31/251 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y D+   L + +  +L +A + K P  + ++ PHAGY YSG  AA A+ +
Sbjct: 3   VRQPAVAGYFYPDDPLTLKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGSVAASAYAS 61

Query: 64  I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
           +     SI+++ +LGP+H  Y    A      + TP+G++  D E++ + +     + L 
Sbjct: 62  LRDKKNSINKIIILGPAHRLYFKGIAYDPVDKFATPLGEIQQDKELLTQIIDLPYVYSLP 121

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           +    + EH +E+ LP+   +F     KI+P++VG  N ++ A   RL A+         
Sbjct: 122 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVVGETNPQDVA---RLIARVWGGDDTLL 174

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSD  H+         DKK  +   SI+ L+  G +I+                +  C
Sbjct: 175 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 214

Query: 241 GRHPISVFLHM 251
           G  P+  FL+ 
Sbjct: 215 GYFPLRGFLYF 225


>gi|157374143|ref|YP_001472743.1| hypothetical protein Ssed_1004 [Shewanella sediminis HAW-EB3]
 gi|157316517|gb|ABV35615.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3]
          Length = 270

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 95/199 (47%), Gaps = 16/199 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVR--GVIAPHAGYS 51
           +  IR+A+ A  +Y  +   L  ++  +L        E G+    + R   +I PHAGY 
Sbjct: 2   LSSIRQAAVADLFYPADPVELTAQIKQFLCATNDKCGEQGIKPGGKQRPKALIVPHAGYI 61

Query: 52  YSGRAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
           YSG  A  AF  + D   I  V LLGP+H  Y   CAL  A  + TP+G +P++ +  E 
Sbjct: 62  YSGAVAGRAFARLLDDNKIETVVLLGPAHRVYLQGCALPEANTFATPLGKIPIEQQNYEA 121

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
           L A     + D+     EH +E+ LP+L    +    K++PI+VG  +    A    LF+
Sbjct: 122 LSALDAVTISDL-PHLDEHCLEVQLPFLQLCLQR--FKLLPIVVGECSPSVVADILELFS 178

Query: 171 KYVDDPSNFFSVSSDFCHW 189
           +     +    VS+D  H+
Sbjct: 179 Q---RENTLIVVSTDLSHY 194


>gi|20089490|ref|NP_615565.1| hypothetical protein MA0601 [Methanosarcina acetivorans C2A]
 gi|23822315|sp|Q8TT38.1|Y601_METAC RecName: Full=MEMO1 family protein MA_0601
 gi|19914397|gb|AAM04045.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 267

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 84/288 (29%), Positives = 131/288 (45%), Gaps = 32/288 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAF 61
           ++R+ + AG +Y  + + L +EL       GL  +  EV G + PHAGY YSG+ AA+ +
Sbjct: 2   EMRQPAVAGQFYPLHCENLEKELTRCFE--GLEIREREVFGAVCPHAGYIYSGKVAAHVY 59

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             + P + + V L GP+H  Y    +LS  T +KTP+G + +DLE+ +     G     D
Sbjct: 60  ATL-PEADTYV-LFGPNHTGYGSPVSLSRET-WKTPLGTIDVDLELADGF--LGSIVDTD 114

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
                 EHS+E+ LP+L   F G   KI+PI +G  + +     G L A  V +      
Sbjct: 115 ELGHTYEHSIEVQLPFLQYRF-GRDFKILPICMGMQDKDTAVEVGSLVADLVSESGKRAV 173

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSDF H+ +  +    D +   +  +I  LD  GM              L   + ++
Sbjct: 174 IIASSDFTHYETAEHARETDSE---VIDAILKLDVPGM-----------YDSLYRRNASV 219

Query: 240 CGRHPISVFLHMLGNCSTKI---KIKFLRYEQSSQCKTKRDSSVSYAS 284
           CG  PI+  L    + S K+   +   L Y  S        + V YA+
Sbjct: 220 CGYGPIAAML----SASQKLGGSRATLLEYANSGDVSGDMSAVVGYAA 263


>gi|312137117|ref|YP_004004454.1| hypothetical protein Mfer_0894 [Methanothermus fervidus DSM 2088]
 gi|311224836|gb|ADP77692.1| protein of unknown function DUF52 [Methanothermus fervidus DSM
           2088]
          Length = 280

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 82/288 (28%), Positives = 128/288 (44%), Gaps = 23/288 (7%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVRG------VIAPHAGYSYSGRAA 57
           R+ + AG++Y  + K L   ++   + E G    P+V G      VIAPHAGY YSG  A
Sbjct: 3   RKPAVAGTFYESDPKALKNRIEWCFKHELGPGDFPKVGGDRNIVGVIAPHAGYIYSGPIA 62

Query: 58  AYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           A+A+  I        F +L P+H       +L     ++TP+G + +D E   EL    +
Sbjct: 63  AHAYYRIVEDGFPETFIILCPNHTGMGSSVSLMAEGEWETPLGSVKIDEEFSSELFKKSE 122

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
              +D      EHS E+H+P+L + F+    KIVPI +   + E      +     +   
Sbjct: 123 IIDIDDTAHSQEHSCEVHIPFL-QYFKKDF-KIVPICMWLQDLETTKEVAQAIVDTIK-- 178

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
               S++ D     S  ++ HY+ +     +  E+ DK  +D I   D     K + + +
Sbjct: 179 ----SLNKDIVLIAST-DFTHYEPQ-----EIAESTDKKIIDAILKLDEKVMYKRINDLN 228

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            T+CG  P+SV L          K + L+Y  S      R S V YAS
Sbjct: 229 ATMCGYGPVSVVLISSKKLGAN-KARLLKYGTSGDVTGDRSSVVGYAS 275


>gi|294496196|ref|YP_003542689.1| hypothetical protein Mmah_1548 [Methanohalophilus mahii DSM 5219]
 gi|292667195|gb|ADE37044.1| protein of unknown function DUF52 [Methanohalophilus mahii DSM
           5219]
          Length = 265

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 83/287 (28%), Positives = 125/287 (43%), Gaps = 24/287 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y    K L +EL    R+  L  S +  G++ PHAGY +SG  AAYA+  
Sbjct: 1   MRQPTVAGQFYPLGIKPLKKELGHCFRDLEL-SSEDAYGIVVPHAGYMFSGPVAAYAYSK 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +       +F  GP+H  Y    ALST   + TP+GD+  D ++ E L   G    MD  
Sbjct: 60  LPKADTYVIF--GPNHTGYGSAVALST-QAWSTPLGDVEADEKMGEML--AGSIIDMDEI 114

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
               EHS+E+ LP+L   F G   KI+PI +G  + +     G   AK  +       F 
Sbjct: 115 AHMYEHSIEVQLPFLQYRF-GDSFKILPICMGLQDIDTAREVGLEVAKAAEKTGKKVVFI 173

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSDF H+                H++ +  D   +D +   D + F +   E   T CG
Sbjct: 174 ASSDFTHYQP--------------HETAKETDMYLIDALLNMDVEEFDRRREERSATACG 219

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
             PI + ++           + L+Y  S        + V YA+   K
Sbjct: 220 YGPI-IAMYTAAEYMGANTAELLKYATSGDMTGDYSAVVGYAAILVK 265


>gi|14520755|ref|NP_126230.1| hypothetical protein PAB0370 [Pyrococcus abyssi GE5]
 gi|23822337|sp|Q9V189.1|Y539_PYRAB RecName: Full=MEMO1 family protein PYRAB05390
 gi|5457971|emb|CAB49461.1| Predicted dioxygenase [Pyrococcus abyssi GE5]
 gi|380741294|tpe|CCE69928.1| TPA: Dioxygenase [Pyrococcus abyssi GE5]
          Length = 291

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 73/291 (25%), Positives = 133/291 (45%), Gaps = 8/291 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR  + AG +Y +  + L E L+ + R+ G    S ++   +APHAGY +SG  A+  + 
Sbjct: 2   IRYPAVAGQFYPEG-ETLVEMLEEFFRDLGEQGNSRKITAGVAPHAGYVFSGYTASRTYK 60

Query: 63  NIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I    +   F++ GP+H       A+     + TP+G + +D E+ +E+    K   +D
Sbjct: 61  AIYEDGLPETFVIFGPNHTGLGSPIAVYPEGDWVTPLGKVKIDSELAKEIVKLSKIADLD 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
               + EHS+E+ LP++  + E  G   +IVPI +G  + +     GR   +  +     
Sbjct: 121 DLAHKYEHSIEVQLPFIQYIAEKAGTDFRIVPITLGIQDEDVSEALGRAVFEAAEALGRD 180

Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
             V  S+DF H+GS + Y+ +  +   +   ++  D   +  I   D     + + E D+
Sbjct: 181 VIVIASTDFMHYGSFYGYVPFRGRANELPNMVKEWDMRIIRRILDFDLKGMFEEIREMDH 240

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           T+CG   +   + +        + + L Y  S +     D+ V YAS   +
Sbjct: 241 TMCGPGGVGAGI-VYSRLMNAREAELLHYTTSFEVSRSTDAIVGYASIVMR 290


>gi|149188588|ref|ZP_01866880.1| hypothetical protein VSAK1_15917 [Vibrio shilonii AK1]
 gi|148837498|gb|EDL54443.1| hypothetical protein VSAK1_15917 [Vibrio shilonii AK1]
          Length = 267

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 21/197 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP------EVRGVIAPHAGYSYSGRAA 57
           IR  + AG +Y  ++++L  +++  L    +  +       ++RG+I PHAGY +SG  A
Sbjct: 3   IRPPAVAGRFYNKDTEQLTRQVNQLLTGTNVRDAETCNVQNDLRGLIVPHAGYVFSGETA 62

Query: 58  AYAFGNIDPTSIS--RVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
             A+  +   +    RV L+GPSH      CAL +   ++TP+G + +D + +E L    
Sbjct: 63  GLAYHQLQSVAQQFLRVILVGPSHRVAFHGCALPSVGAFETPLGRVSIDRDCVELLADNS 122

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD- 174
              + D      EHS+E+ LP+L  V +    +++PI+ G V+A          AK ++ 
Sbjct: 123 MVSINDQ-AHAQEHSLEVQLPFLQTVLDD--FQLLPIVTGQVSALE-------IAKLIEP 172

Query: 175 --DPSNFFSVSSDFCHW 189
             D      +S+D  H+
Sbjct: 173 IWDSKTLLVISTDLSHF 189


>gi|159042151|ref|YP_001541403.1| hypothetical protein Cmaq_1590 [Caldivirga maquilingensis IC-167]
 gi|189038957|sp|A8M9U2.1|Y1590_CALMQ RecName: Full=MEMO1 family protein Cmaq_1590
 gi|157920986|gb|ABW02413.1| protein of unknown function DUF52 [Caldivirga maquilingensis
           IC-167]
          Length = 282

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 78/294 (26%), Positives = 136/294 (46%), Gaps = 30/294 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--------VRGVIAPHAGYSYSGRA 56
           R+ + AG +Y  +   L  ++   +     PKS          V  VI PHAGY YSG  
Sbjct: 4   RKPAVAGMFYEASKDGLINQIKWSIEHELGPKSTMNVKENRQYVLSVIVPHAGYVYSGPV 63

Query: 57  AAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           AA+A+  +      +VF ++GP+H+      A+ T+  ++TP+G + +D EV +++KA  
Sbjct: 64  AAHAYVEVGKYIKPKVFVIIGPNHYGVGSPAAIMTSGTWETPLGQVEIDEEVAKQIKAKV 123

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           K    D    E EHS+E+ +P++  +F G   +IVPI++     +     G   ++ +D 
Sbjct: 124 KDLAEDPIAFEREHSIEVQVPFIQYLFPGS--RIVPIVLWNQTIDLSRRLGSAISEVIDG 181

Query: 176 PSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
            +    V  SSD           HY+       K ++ +++     I   D + F   + 
Sbjct: 182 RAGEVVVVASSDLN---------HYEPHEVTTDKDMKVIER-----ILNMDEEGFYTVMD 227

Query: 234 EYDNTICGRHPI-SVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +YD ++CG   I +  ++     ST +K+  L++  S         +V YAS A
Sbjct: 228 KYDVSVCGFGAIMTAIVYSRKQGSTGVKL--LKHATSGDTSGYLLETVGYASIA 279


>gi|430760607|ref|YP_007216464.1| protein of unknown function DUF52 [Thioalkalivibrio nitratireducens
           DSM 14787]
 gi|430010231|gb|AGA32983.1| protein of unknown function DUF52 [Thioalkalivibrio nitratireducens
           DSM 14787]
          Length = 267

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 80/288 (27%), Positives = 127/288 (44%), Gaps = 34/288 (11%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           E +R  + AG +Y D+   L  E+   +    A  P+ P  + +IAPHAGY YSG  AA 
Sbjct: 5   ETVREPAVAGLFYPDDPAALRREVRALMSGATAPAPERPAPKALIAPHAGYRYSGPVAAS 64

Query: 60  AFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+  +      I RV LLGPSH       A + A  Y+TP+G +P+D      ++     
Sbjct: 65  AYAALGAAVAHIRRVVLLGPSHRVPFRGIAATGAGAYRTPLGAIPVDQAANASIRDLPGV 124

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-EAMYGRLFAKYVDDP 176
             +D+     EHS+E+HLP+L  + +     +VP++VG  +     A+  RL+      P
Sbjct: 125 VTLDLA-HGPEHSLEVHLPFLQLLLDD--FALVPLVVGDADPNQVTAVLERLWGG----P 177

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                VSSD  H+            HG  +++ ++LD      IE+      ++ +L  D
Sbjct: 178 ETLIVVSSDLSHY------------HG--YRTAQSLDAETCRAIES-----MREDMLTPD 218

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
              CG  P++  L      +  + +  L    S      R   V Y +
Sbjct: 219 RA-CGCRPLAGLLR--AGRAHGLTVHTLDLRNSGDTAGPRTEVVGYGA 263


>gi|441504807|ref|ZP_20986799.1| Putative dioxygenase [Photobacterium sp. AK15]
 gi|441427389|gb|ELR64859.1| Putative dioxygenase [Photobacterium sp. AK15]
          Length = 222

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 52/161 (32%), Positives = 79/161 (49%), Gaps = 8/161 (4%)

Query: 43  VIAPHAGYSYSGRAA--AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
           +I PHAGY YSG  A  AY++       I RV L+GPSH Y+   CAL     + TP+G 
Sbjct: 1   MIVPHAGYIYSGEIASRAYSYLKQQADRIKRVILIGPSHRYFFQGCALPGVDYFATPLGQ 60

Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
           +P+D + +E L+     E+ D      EH +E+ LP+L +        ++PIL   V+ +
Sbjct: 61  IPIDTQSVEILRKIDDIEVSD-QAHAFEHCLEVQLPFLQRCLNE--FTLLPILTSDVSPK 117

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK 201
             A   R+      D +    +SSD  H+ S       D++
Sbjct: 118 TVA---RIINSVWQDENTLLVISSDLSHYHSYTEAQTLDQQ 155


>gi|325298560|ref|YP_004258477.1| AMMECR1 domain-containing protein [Bacteroides salanitronis DSM
           18170]
 gi|324318113|gb|ADY36004.1| AMMECR1 domain protein [Bacteroides salanitronis DSM 18170]
          Length = 496

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 83/282 (29%), Positives = 136/282 (48%), Gaps = 34/282 (12%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           +++R  + AG++Y   +  L + +  ++ +A  +    E++ VI PHAGY YSG  A+ A
Sbjct: 27  KRVRMPAVAGAFYPAKADTLRKMIVSFMSDASSVEPDKEMQAVIVPHAGYVYSGAIASKA 86

Query: 61  FGNIDP-TSISRVFLLGPSHHYYTPKCALSTATV------YKTPIGDLPLDLEVIEE-LK 112
              I    +  R+FLLGPSH     + A   A+V      Y TP+G++ +D E  +  L+
Sbjct: 87  IKAIPADAAYKRIFLLGPSH-----RAAFDGASVNGAFGSYATPLGEVQVDKEACDALLR 141

Query: 113 ATGKFELMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
           A   F      V EA   EH +E+ LP L +V    +  IVPI++G  +        +  
Sbjct: 142 ADTVFTY----VPEAHVREHCLEVQLPLL-QVHLRDVPSIVPIIIGTQDLAKLRRIAQAL 196

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE--ALDKMGMDIIETGDPDA 227
             Y     N F +SSDF H+ S  +    DK  G   ++IE  +L K  +D +E    + 
Sbjct: 197 QPYFTS-DNLFVISSDFSHYPSYEDANRVDKATG---EAIESGSLRKF-LDALEMNAREG 251

Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQS 269
             + L     + CG+ PI+V L ++      ++++ L Y  S
Sbjct: 252 VPQLL----TSACGQSPIAVLLSLM-EGRKDLQVRHLAYCNS 288


>gi|15606539|ref|NP_213919.1| hypothetical protein aq_1336 [Aquifex aeolicus VF5]
 gi|7388474|sp|O67355.1|Y1336_AQUAE RecName: Full=MEMO1 family protein aq_1336
 gi|2983762|gb|AAC07322.1| hypothetical protein aq_1336 [Aquifex aeolicus VF5]
          Length = 374

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 85/289 (29%), Positives = 134/289 (46%), Gaps = 32/289 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYAFG 62
           R  SHAG  Y +N     +EL  ++ E     S +   RG++ PH     +       + 
Sbjct: 103 REPSHAGQAYPEN----PQELKNFIEETVNKNSEKFKARGILVPHMDLRVASGVYGSVYS 158

Query: 63  NIDPTSISRVFLLGPSHHYY-TPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM- 120
            I       V LLG SH+++ TP   L      +TP+GDL +D+E +EEL+    ++L  
Sbjct: 159 AIKENEYDTVVLLGVSHYFHETPFSVLPLDL--RTPLGDLKVDIERVEELQKMFDYDLSH 216

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D+   + EHS+E    +L  +F    +K++P +V   + ++         K ++D  N  
Sbjct: 217 DVLAYKNEHSIEFQTIFLKYLFPE--VKVIPAIVSYGDTKSLKEIAHKITKVLEDSQNPL 274

Query: 181 SVSS-DFCHWGSRFNYMH-YDKKHGVIHKSIEALDKMGMDII-ETGDPDAFKKYLLEYDN 237
            +SS DF H G +F   H YD             D+  ++++ E  + +AF   LL+ DN
Sbjct: 275 IISSVDFSHVGRKFGDPHSYDPS---------PRDREYINLLAELKNEEAFN--LLQSDN 323

Query: 238 T---ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
               I G+    VFL +L N   K + K L Y+   +  T  DS VSYA
Sbjct: 324 NRTRIDGQFTNFVFLEILKNLGVK-EGKLLDYDVYHEAPT--DSKVSYA 369


>gi|350534319|ref|ZP_08913260.1| hypothetical protein VrotD_24493 [Vibrio rotiferianus DAT722]
          Length = 264

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 61/203 (30%), Positives = 94/203 (46%), Gaps = 14/203 (6%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA--YAF 61
           +R  + AG +Y  + ++L+  +  W + +     P +R ++ PHAGY YSG  AA    F
Sbjct: 3   VRPPAVAGRFYPHSPQQLSSMIQSWEKPSACRFQPVIRAMVVPHAGYIYSGEVAAGVCQF 62

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
                  I RV LLGPSHH+    CAL +   + TP+G    ++EV  E +A    +   
Sbjct: 63  LKSQADRIHRVILLGPSHHFSFLGCALPSVDWFLTPLG----NIEVDREWRAILAQQQDV 118

Query: 122 ICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           +  D+A   EH +E+ LP L         +I+PIL   ++    A    L      D  +
Sbjct: 119 VVSDDAHAPEHCLEVQLPLLQTYLGA--FRILPILTSRISP---ARMAELLDPVWLDEES 173

Query: 179 FFSVSSDFCHWGSRFNYMHYDKK 201
              VSSD  H+ S       D++
Sbjct: 174 LLVVSSDLSHYHSYRQAQRLDQQ 196


>gi|291614303|ref|YP_003524460.1| hypothetical protein Slit_1844 [Sideroxydans lithotrophicus ES-1]
 gi|291584415|gb|ADE12073.1| protein of unknown function DUF52 [Sideroxydans lithotrophicus
           ES-1]
          Length = 263

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 78/288 (27%), Positives = 122/288 (42%), Gaps = 32/288 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  +R  + AG +Y      L+ E+  +L EA  P++   + +I PHAGY YSG  AA A
Sbjct: 1   MLAVRPPAVAGMFYPAEPGELSHEILEFLTEAQ-PRTLIPKALIVPHAGYVYSGAIAASA 59

Query: 61  FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  +   S  I RV LLGP+H       AL     + TP+G + LD E++  + +  +  
Sbjct: 60  YATLKSISARIRRVVLLGPTHRVAVRGLALPDVEAFDTPLGRVMLDTELMRIISSLPQVT 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            +       EHS+E+ LP+L  V       ++P+ VG   AE  A    +          
Sbjct: 120 -VSAAAHAQEHSLEVQLPFLQSVLND--FTLLPLAVGMATAEEVA---EVLELAWGGEET 173

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              +SSD  H      Y+ Y     V  ++I        +I+E   P A ++        
Sbjct: 174 LIVISSDLSH------YLPYAAAQSVDGRTIR-------NILELKQPIAHEEA------- 213

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            CG  PIS  +  +     ++K   L    S      +++ V Y S A
Sbjct: 214 -CGGTPISGLI--IVARKHQLKPHLLDLRNSGDTAGSKNNVVGYTSIA 258


>gi|344200325|ref|YP_004784651.1| mediator of ErbB2-driven cell motility-containing protein
           [Acidithiobacillus ferrivorans SS3]
 gi|343775769|gb|AEM48325.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Acidithiobacillus ferrivorans SS3]
          Length = 273

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 79/295 (26%), Positives = 134/295 (45%), Gaps = 40/295 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV------RGVIAPHAGYSYSG 54
           M+ +R A+ AG++Y   +  L  E+   L  AG   + E       + +I PHAGY YSG
Sbjct: 1   MQFVRPAAVAGTFYPAKASVLRAEVARLL--AGAESADEAAAVAWPKAIIVPHAGYVYSG 58

Query: 55  RAAAYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
             AA A+  +      I RV LLGP+H       AL      +TP+G + +D   +  L 
Sbjct: 59  AVAASAYALLAKGRGHIRRVVLLGPAHRVPFRGLALPGVQAMQTPLGVVAVDQAGVAAL- 117

Query: 113 ATGKFELMDICVDEA-EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
             G  +++++    A EH++E+ LP++ +V +G  + ++P++VG  +A  EA+   +  K
Sbjct: 118 -AGLPQVLEMPAAHAQEHALEVQLPFIQEVLDG--VSVIPLVVG--DARPEAV-AEVLEK 171

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                     +SSD  H      Y  Y +   + H ++E + +             F   
Sbjct: 172 LWGGEETVIVISSDLSH------YHPYTEARAIDHHTVEEILR-------------FDPL 212

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            +E+D   CG  PI+ FL +       ++ + +    S      RD+ V YA+ A
Sbjct: 213 PIEHDQA-CGATPINGFLPV--ARKHHLRPRLVDLCNSGDTAGSRDAVVGYAAVA 264


>gi|261403545|ref|YP_003247769.1| hypothetical protein Metvu_1434 [Methanocaldococcus vulcanius M7]
 gi|261370538|gb|ACX73287.1| protein of unknown function DUF52 [Methanocaldococcus vulcanius M7]
          Length = 287

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 133/301 (44%), Gaps = 33/301 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEVRG-------VIAPHAGYSY 52
           M  +R  + AG +Y  +   L E ++  +L + G PKS  VRG       ++ PHAGY Y
Sbjct: 1   MTNVRYPAVAGLFYPSHPDELIEMIEQCYLHKFG-PKSMPVRGTYEKPIGLLCPHAGYVY 59

Query: 53  SGRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
           SG   A+++      +DP   +   +LGP+H       ++    +++TP+GD+  D E +
Sbjct: 60  SGPIQAHSYYELSKRVDPLEETTAVILGPNHTGLGSGVSVCNG-IWRTPLGDVKCDEEFV 118

Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYG 166
           EEL    +   +D      EHS+E+ LP+L   ++      KIVPI +   + E     G
Sbjct: 119 EELWRKCEIIDLDETAHMNEHSIEVQLPFLKHLELLNIAKFKIVPICMMFQDYETSVEVG 178

Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
              AK   +      +  SSD  H+  +       KK  ++ K          DI+E  D
Sbjct: 179 YFVAKVAKELGRRVVIIASSDLTHYEPQ---EVASKKDAIVIK----------DILEM-D 224

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             A  + ++ Y+ ++CG  P+   L  +     K   K L Y  S        S V YAS
Sbjct: 225 EKALYEDVINYNISMCGYGPVIAMLRAMKILGAK-NSKLLAYATSGDITGDYSSVVGYAS 283

Query: 285 A 285
           A
Sbjct: 284 A 284


>gi|148358756|ref|YP_001249963.1| hypothetical protein LPC_0635 [Legionella pneumophila str. Corby]
 gi|296106801|ref|YP_003618501.1| putative dioxygenase [Legionella pneumophila 2300/99 Alcoy]
 gi|148280529|gb|ABQ54617.1| hypothetical protein LPC_0635 [Legionella pneumophila str. Corby]
 gi|295648702|gb|ADG24549.1| Predicted dioxygenase [Legionella pneumophila 2300/99 Alcoy]
          Length = 447

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/251 (27%), Positives = 120/251 (47%), Gaps = 31/251 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y D+   L + +  +L +A + K P  + ++ PHAGY YSG  AA A+ +
Sbjct: 3   VRQPAVAGYFYPDDPLILKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGSVAASAYAS 61

Query: 64  I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
           +     SI+++ +LGP+H  Y    A      + TP+G++  D E++ + +     + L 
Sbjct: 62  LRDKKNSINKIIILGPAHRLYFKGIAYDPVDKFATPLGEIQQDKELLTQIIDLPYVYSLP 121

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           +    + EH +E+ LP+   +F     KI+P++VG  N ++ A   RL A+         
Sbjct: 122 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVVGETNPQDVA---RLIARVWGGDDTLL 174

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSD  H+         DKK  +   SI+ L+  G +I+                +  C
Sbjct: 175 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 214

Query: 241 GRHPISVFLHM 251
           G  P+  FL+ 
Sbjct: 215 GYFPLRGFLYF 225


>gi|390939216|ref|YP_006402953.1| putative dioxygenase [Sulfurospirillum barnesii SES-3]
 gi|390192323|gb|AFL67378.1| putative dioxygenase [Sulfurospirillum barnesii SES-3]
          Length = 266

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 72/253 (28%), Positives = 115/253 (45%), Gaps = 30/253 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAA-- 57
           M K R+AS AG +Y DN   L   +  + +      +    R +I PHAGY YSG  A  
Sbjct: 1   MHKYRKASVAGMFYPDNCSELKHYISHFDKATPTFTQDLFPRALIVPHAGYIYSGYTANL 60

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           AYA+       I RV ++GPSH  Y    +++    Y TP GD+ +DL     L+   KF
Sbjct: 61  AYAYTASRRCDIERVVVIGPSHRVYIAGASIAFFDAYHTPCGDITIDLSYAHALQK--KF 118

Query: 118 ELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR---LFAKYV 173
             +D   +   EHS E+ +P++   F+     +V I+ G +       YG    L  + +
Sbjct: 119 AFLDFHPNAHEEHSTEVQMPFIHHYFKK--ANVVEIVYGDIT------YGELSSLINEVL 170

Query: 174 DDPSNFFSVSSDFCHWGSR----------FNYMHYDKKHGVIHKSIEALDKMGMD-IIET 222
            D      +S+D  H+ +            N ++    H +  ++ EA  K+G+  ++ +
Sbjct: 171 KDDKTLIVISTDLSHFHTEEEANRRDTFCINAINTLDLHAL--ENCEACGKLGVKALVHS 228

Query: 223 GDPDAFKKYLLEY 235
               AFK Y L+Y
Sbjct: 229 ARLHAFKPYFLDY 241


>gi|315655805|ref|ZP_07908703.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333]
 gi|315489869|gb|EFU79496.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333]
          Length = 488

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 69/218 (31%), Positives = 103/218 (47%), Gaps = 28/218 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------GLPKSPEVRGVIAPH 47
           M  +R+ + AGS+Y  +S  L + +D  L++A             G P   ++ GV  PH
Sbjct: 1   MSFVRKPAVAGSFYPQDSGELQKTVDELLKKATALITNDVSRDSQGRPLR-QLMGVQVPH 59

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           AGY YSG  AA  F  +      RV + GP+H       ALS AT + TP+G++P+D   
Sbjct: 60  AGYIYSGLIAAVGFQLVSALRPRRVIIAGPTHRVGCHGIALSEATAFATPLGEVPVDQVA 119

Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--E 162
            + + E  +  KF      VD  EH++E+ LP+L  VF      IVP+ VG    E+  +
Sbjct: 120 QQYLVEHSSVAKF-FEPSHVD--EHALEVQLPFLQTVFRDIDFAIVPLAVGDCTPEDVCD 176

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK 200
           ++   L     +       VSSD  H      Y+ YD+
Sbjct: 177 SLQTLLSIPGENPDDTLIIVSSDLSH------YLAYDE 208


>gi|403178136|ref|XP_003336578.2| hypothetical protein PGTG_17889 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375173354|gb|EFP92159.2| hypothetical protein PGTG_17889 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 293

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 66/167 (39%), Positives = 79/167 (47%), Gaps = 29/167 (17%)

Query: 147 IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-- 204
           IKIV ILVG +    E  Y  L A Y+ DP N F VSS     G  F + +Y   H +  
Sbjct: 86  IKIVLILVGLIMFAEELKYSALLAPYLADPENLFVVSSVLSLGGHDF-HTYYQDPHLIEL 144

Query: 205 --------------IHKSIEALDKMGMDIIETGDP----DAFKKYLLEYDNTICGRHPIS 246
                         IH+SIE L K  +  I   DP      FK YL    NTI  RHPIS
Sbjct: 145 AKQLSPGSPLPEYPIHQSIENLYKEAITAISLQDPKQAHKTFKSYLKRKKNTIGSRHPIS 204

Query: 247 VFL----HMLGNCST----KIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           V L     +   C T    + +++ + YEQSS C T RDSSVSY SA
Sbjct: 205 VLLGSMIKLQQTCLTPPPHEQQLQLVCYEQSSPCFTVRDSSVSYVSA 251


>gi|15678074|ref|NP_275188.1| hypothetical protein MTH45 [Methanothermobacter thermautotrophicus
           str. Delta H]
 gi|7388365|sp|O26151.1|Y045_METTH RecName: Full=MEMO1 family protein MTH_45
 gi|2621080|gb|AAB84552.1| conserved protein [Methanothermobacter thermautotrophicus str.
           Delta H]
          Length = 277

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 124/291 (42%), Gaps = 29/291 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPEVR---GVIAPHAGYSYSGR 55
           IRR + AG++Y  +   L   ++ W     L    LP    +R   GVIAPHAGY YSG 
Sbjct: 2   IRRPAVAGAFYERDPAALRRRIE-WCFEHELGPGTLPAVGSMRRIKGVIAPHAGYMYSGP 60

Query: 56  AAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+A+  +    I   + ++ P+H       +L     ++TP+G + +D E+ E +   
Sbjct: 61  VAAHAYHELVSDGIPGTLVIICPNHTGMGSGVSLMQQGAWETPLGTVEIDSELAEAIVRE 120

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
                +D     AEHS E+H+P++    +    +IVP+ +     E  A  G   A  + 
Sbjct: 121 SGIIDLDETAHLAEHSCEVHVPFIQYFTDN--FRIVPVTMWMQGHETAADVGHAVASAIR 178

Query: 175 DPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           +     +V  S+DF H+  +                 EA D+  +D I   D       +
Sbjct: 179 ETGRDAAVIASTDFTHYSPQ--------------DIAEATDRRIIDRITAMDDTGMYGVI 224

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            E + T+CG  P++  + +        +   LRY  S      R S V YA
Sbjct: 225 SELNATMCGYGPVAATI-IASRILGATECDLLRYATSGDVTGDRSSVVGYA 274


>gi|52841403|ref|YP_095202.1| hypothetical protein lpg1169 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|378777079|ref|YP_005185516.1| hypothetical protein lp12_1149 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
 gi|52628514|gb|AAU27255.1| hypothetical protein lpg1169 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|364507893|gb|AEW51417.1| hypothetical protein lp12_1149 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
          Length = 453

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 72/284 (25%), Positives = 130/284 (45%), Gaps = 33/284 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y D+   L + +  +L +A + K P  + ++ PHAGY YSG  AA A+ +
Sbjct: 9   VRQPAVAGYFYPDDPLTLKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGAVAASAYAS 67

Query: 64  I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
           +     +I+++ LLGP+H  Y    A      + TP+G++  D E++ + +     + L 
Sbjct: 68  LRDKKDTINKIILLGPAHRLYFKGIAYDPVDKFATPLGEIDQDKELLTQIIDLPYVYSLP 127

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           +    + EH +E+ LP+   +F     KI+P+++G  N ++ A   RL A+         
Sbjct: 128 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVIGETNPQDVA---RLIARIWGGDDTLL 180

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSD  H+         DKK  +   SI+ L+  G +I+                +  C
Sbjct: 181 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 220

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           G  P+  FL+        +  + L    S      ++  V YA+
Sbjct: 221 GYFPLRGFLYF--GRQNHVYSRLLDLRNSGDTAGDKERVVGYAA 262


>gi|206890214|ref|YP_002248185.1| hypothetical protein THEYE_A0338 [Thermodesulfovibrio yellowstonii
           DSM 11347]
 gi|206742152|gb|ACI21209.1| conserved protein [Thermodesulfovibrio yellowstonii DSM 11347]
          Length = 269

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 73/288 (25%), Positives = 129/288 (44%), Gaps = 32/288 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR--GVIAPHAGYSYSGRAAAYAFG 62
           RR S AG +Y  NSK L +E++ ++     P + ++   G I PHAGY YSG  A   + 
Sbjct: 4   RRPSVAGYFYPSNSKELLQEIEEYM-----PTTLKIDAFGAICPHAGYVYSGSVAGAVYS 58

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            + P  I    L+GP+H  Y    AL T   ++ P+G + +  E+ +++         DI
Sbjct: 59  KLKPKEI--FILIGPNHTGYGENVALMTEGEWEIPLGSIKIHSELAKKIIEKSPLAKDDI 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF--- 179
                EHS+E+ LP++ K+      +IVPI +  ++ ++     +     V++  NF   
Sbjct: 117 HAHLHEHSLEVQLPFIYKL--NPEAQIVPITLKMLSLKDCLSLAQGIVLAVNE-LNFRDK 173

Query: 180 --FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                S+D  H+              +  +    +D + ++ I   DP+     +LE+  
Sbjct: 174 VVVVASTDMSHY--------------LPDELARKVDSLAVEKIRHFDPEGLYNTVLEHGI 219

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           ++CG  P    L+       K +I+ ++Y  S++     D  V Y  A
Sbjct: 220 SMCGFVPTVTMLYTTKLLGAK-EIQVIKYTTSAEVSRDYDKVVGYLGA 266


>gi|148264297|ref|YP_001231003.1| hypothetical protein Gura_2247 [Geobacter uraniireducens Rf4]
 gi|146397797|gb|ABQ26430.1| protein of unknown function DUF52 [Geobacter uraniireducens Rf4]
          Length = 267

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 128/285 (44%), Gaps = 27/285 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR+ + AG +YT++ K L  +L+  +      +  +V GVIAPHAGY YSG  A   +G+
Sbjct: 2   IRQPAVAGQFYTNDPKELRAQLERMVTVETFRE--KVLGVIAPHAGYMYSGAVAGRLYGS 59

Query: 64  ID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           ++  PT    V +LGP+H     + A+  A  + TP+G + ++  +   +K        +
Sbjct: 60  VEVPPT----VIILGPNHQGVGARAAIYPAGKWLTPLGAVSINSRLSALIKKHAPLVEEE 115

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNF 179
                 EHS+E+ +P+L   F    + IVP+ +G  +  +    G   A+ + +      
Sbjct: 116 TSAHHFEHSLEVQVPFLQ--FVRSDVTIVPLCLGFGDYASCKSLGSGIARAIREYGEEVL 173

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSD  H+ S    M  +K            D+  ++ I   +P+   +   E D T+
Sbjct: 174 IVASSDMTHYESAV--MAKEK------------DEHAINEILALNPEGLLRVCREKDITM 219

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           CG  P +V L  +       + K +RY  S          V+YA+
Sbjct: 220 CGVIPATVMLVAVNELGAT-QAKLVRYATSGDVTGDNRQVVAYAA 263


>gi|83591788|ref|YP_425540.1| hypothetical protein Rru_A0448 [Rhodospirillum rubrum ATCC 11170]
 gi|386348477|ref|YP_006046725.1| hypothetical protein F11_02290 [Rhodospirillum rubrum F11]
 gi|83574702|gb|ABC21253.1| Protein of unknown function DUF52 [Rhodospirillum rubrum ATCC
           11170]
 gi|346716913|gb|AEO46928.1| hypothetical protein F11_02290 [Rhodospirillum rubrum F11]
          Length = 282

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 121/285 (42%), Gaps = 41/285 (14%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSP---EVRGVIAPHAGYSYSGRAAAYAFGNIDP 66
           AG +Y  +  +LA+E+D  L  AG+  SP     + V+APHAG +YSG  AA+AF  +  
Sbjct: 22  AGRFYPGDPAQLAQEVDSLL--AGVKASPAPLRPKAVVAPHAGLAYSGPTAAHAFAPLAR 79

Query: 67  --TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD---LEVIEELKATGKFELMD 121
              +I RV +LGPSH       ALS    Y +P+G +P+D    + + +L   G+ E   
Sbjct: 80  HRDAIHRVVVLGPSHRLAFRGLALSGTRAYDSPLGPVPVDHSWQDRLSDLAFVGRLEQ-- 137

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EH++E+ LP+L ++       IVPI+ G   AE  A       +    P     
Sbjct: 138 --AHGPEHALEVELPFLQRILG--PFSIVPIVCGEAAAEEVAQA---LERVWGGPETLIV 190

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           VSSD  H+     Y    ++ G    +I  LD   +   E                  CG
Sbjct: 191 VSSDLSHY---LEYEDARRQDGRTATAIARLDGAPIGPREA-----------------CG 230

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
             P+   L +    +  +  + L    S      RD  V Y + A
Sbjct: 231 HVPLGGLLCL--ARAAGMTTRCLDLRNSGDTAGPRDRVVGYGAWA 273


>gi|163796279|ref|ZP_02190240.1| Predicted dioxygenase [alpha proteobacterium BAL199]
 gi|159178421|gb|EDP62963.1| Predicted dioxygenase [alpha proteobacterium BAL199]
          Length = 467

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 93/188 (49%), Gaps = 8/188 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R    AG +Y +      + L+ +L  A      + + ++APHAG+ +SG  A  A+  
Sbjct: 12  VRAPQVAGRFYPNGHNACIDLLERYLSAARPSGGVDAKVMVAPHAGWIFSGPIAGTAYAA 71

Query: 64  IDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             P +  ISRV LLGP+H       A ++A  + TP+G LP+D + +    A     +MD
Sbjct: 72  FLPKAEQISRVVLLGPAHRVPFKGIATTSADGWATPLGTLPVDWDALRTALAIPGVAVMD 131

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
               E EHS+E+H P++ +VF      IVP+LVG   A   ++   +  +    P     
Sbjct: 132 EAFAE-EHSLEVHAPFVQRVFP--RAAIVPLLVGDAPA---SLVSGVVERLWGGPETLIL 185

Query: 182 VSSDFCHW 189
           VSSD  H+
Sbjct: 186 VSSDLSHF 193


>gi|443695207|gb|ELT96159.1| hypothetical protein CAPTEDRAFT_96193, partial [Capitella teleta]
          Length = 156

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 84/154 (54%), Gaps = 6/154 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR+ + +G++Y D+   L   +   + +A  P+    + +I PHAG+ YSG  AA A+  
Sbjct: 3   IRQPAVSGTFYPDSPSALDHFVKELVNDAK-PRDFSPKALIVPHAGFIYSGPVAASAYRL 61

Query: 64  IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           +D    +I RV LLGPSH       AL     + TP+G +PLD   +EE+K   + +++D
Sbjct: 62  LDSMKENIRRVVLLGPSHRVPLQGMALPACEAFSTPLGTIPLDTAAMEEIKTFSQMQILD 121

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
                 EHS+E+  P++    +    K++PI+VG
Sbjct: 122 KA-HAYEHSLEVQCPFIQTCLDD--FKLIPIVVG 152


>gi|39996929|ref|NP_952880.1| MEMO-like protein [Geobacter sulfurreducens PCA]
 gi|39983817|gb|AAR35207.1| MEMO-like protein [Geobacter sulfurreducens PCA]
          Length = 267

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 71/252 (28%), Positives = 114/252 (45%), Gaps = 30/252 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +YTD+ +RL EEL   +R    P+   V GVIAPHAGY YSG  A   +G+
Sbjct: 2   VRQPAVAGQFYTDDPRRLREELGHLIRPVPAPR--RVTGVIAPHAGYMYSGAIAGAVYGS 59

Query: 64  IDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           I    I R V +LGP+HH      +L     + +P+G++P++  +   +         D+
Sbjct: 60  I---VIPRTVVILGPNHHGLGAAASLYPDGTWLSPLGEVPIEQRLSSLVLEHVPQAEPDV 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA-----VNAENEAMYGRLFAKYVDDPS 177
                EHS+E+ +P+L   +    + IVP+ +G           E +  R  A Y ++  
Sbjct: 117 IAHRFEHSLEVQVPFLR--YLNSDVAIVPMCLGGGGYGWCRQVGEGL-ARAIAAYGEE-- 171

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                SSD  H+ S               +S    D+  +  +   D +   K   +   
Sbjct: 172 VLIVASSDMTHYES--------------AESARLKDEAALSCVLALDAEGLLKVCRQRGI 217

Query: 238 TICGRHPISVFL 249
           T+CG  P +V L
Sbjct: 218 TMCGVIPSTVML 229


>gi|220923217|ref|YP_002498519.1| hypothetical protein Mnod_3293 [Methylobacterium nodulans ORS 2060]
 gi|219947824|gb|ACL58216.1| protein of unknown function DUF52 [Methylobacterium nodulans ORS
           2060]
          Length = 274

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 94/194 (48%), Gaps = 21/194 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA------GLPKSPEVRGVIAPHAGYSYSGRAAA 58
           R A+ AG +Y D+S RL   ++  L EA      G PK+     +IAPHAGYSYSG  AA
Sbjct: 11  RPAAAAGLFYPDDSHRLRSAVERLLAEAPRAAASGRPKA-----IIAPHAGYSYSGPVAA 65

Query: 59  YAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
            AF  ++     I RV L+GP+H+       +     ++TP+G +PLDL  +        
Sbjct: 66  AAFAPLEGAEQDIERVILIGPAHYIRFRGICIPMVEGFETPLGRVPLDLNALAVSAEMPG 125

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-EAMYGRLFAKYVDD 175
            +  D      EH++E+ LP+L  +        VP+LVG    E   A+  RL+      
Sbjct: 126 VQAAD-APHAPEHALEVELPFLQVLLS--RFAAVPLLVGDARPEEVAAVLDRLWGG---- 178

Query: 176 PSNFFSVSSDFCHW 189
           P     VSSD  H+
Sbjct: 179 PETVIVVSSDLSHF 192


>gi|409095141|ref|ZP_11215165.1| hypothetical protein TzilA_00680 [Thermococcus zilligii AN1]
          Length = 292

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 72/292 (24%), Positives = 134/292 (45%), Gaps = 8/292 (2%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAF 61
           ++R  + AGS+Y D+ + L   L+ + R+ G   S  ++   +APHAGY +SG  A+  +
Sbjct: 2   EVRYPAVAGSFYPDD-ETLIGMLERFFRDLGEEGSGRKITAGVAPHAGYVFSGYTASRTY 60

Query: 62  GNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
             I    +   F +LGP+H       A+     + TP+G + +D E+ + +        +
Sbjct: 61  KAIFEDGLPETFVILGPNHTGLGSPIAVYPGGEWLTPLGGIEVDSEMAKAIARLSGIADL 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D      EHS+E+ +P++  + E  G  +KIVPI +G  + E     G+   +   +   
Sbjct: 121 DKRAHTYEHSIEVQVPFIQYIAEQAGKDVKIVPITLGIQDEEVARDLGKAIFEASKELGR 180

Query: 179 FFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
              V  S+DF H+G  + Y+ +  +   +   I+  D   +  I   D D     + + D
Sbjct: 181 DVVVIASTDFTHYGVVYGYVPFRARADELPHRIKEWDFQVIRHILDFDVDGMFNEVRKLD 240

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           +T+CG   +   + +    +  ++ + L Y  S +     D+ V YAS   +
Sbjct: 241 HTMCGPGGVGTAI-VYSRLAGAVEAELLHYTTSFEVSRSTDAVVGYASIVMR 291


>gi|48478030|ref|YP_023736.1| hypothetical protein PTO0958 [Picrophilus torridus DSM 9790]
 gi|48430678|gb|AAT43543.1| hypothetical conserved protein DUF52 [Picrophilus torridus DSM
           9790]
          Length = 268

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 79/284 (27%), Positives = 124/284 (43%), Gaps = 19/284 (6%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR    AG++Y D+   L         E    K  ++ GV+ PHAGY YSG+ A  ++  
Sbjct: 2   IRDPYVAGAFYPDSESELLNYFKNLEPERFDIKFNKILGVVVPHAGYEYSGKIAWASYSI 61

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +   +  R  ++GP+H+ Y    A+ +   ++TP+GD  +D E+ E+L         D  
Sbjct: 62  LKEYNARRFLIIGPNHYGYPFYPAIYSNGSWRTPLGDSIIDNELSEQLIMKSGIIKNDPE 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
               EHS+E+ LP+L  +F+      VP+++G  + E     G         P      S
Sbjct: 122 THSTEHSIEVQLPFLQYIFKNQF-TFVPLILGDQSYEISRDLGETILSLDRIP--LIIAS 178

Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
           SD  H+ S      YDK +       E  + +  DII     D F   + +Y  T CG  
Sbjct: 179 SDLNHYES------YDKNN-------EKDEIIINDIINLRIKDFFND-IYKYRITACGFG 224

Query: 244 PISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            I+V +++      KI +  LR+  S      R   V YA+  A
Sbjct: 225 AIAVLMYITKKNHGKIAL--LRHANSGDASGDRKRVVGYAAMLA 266


>gi|410670088|ref|YP_006922459.1| hypothetical protein Mpsy_0882 [Methanolobus psychrophilus R15]
 gi|409169216|gb|AFV23091.1| hypothetical protein Mpsy_0882 [Methanolobus psychrophilus R15]
          Length = 265

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 86/284 (30%), Positives = 125/284 (44%), Gaps = 26/284 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y    K L +EL    ++  + K   + G + PHAGY YSG  AA+A+  
Sbjct: 1   MRQTTVAGQFYPRGPKDLKKELSRCFKDIEI-KPRNIIGAVVPHAGYVYSGEVAAHAYAL 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           + P + + VF  GP+H  Y    ALS  T + TP+G +  D E+ + L   G     D  
Sbjct: 60  L-PKADTYVF-FGPNHTGYGSAVALSQDT-WVTPLGVVDTDRELGKLL--AGSIVDYDEV 114

Query: 124 VDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FF 180
               EHS+E+ +P+L   F EG   KI+P+ +G  + E     G   A+          F
Sbjct: 115 AHRFEHSIEVQIPFLQHRFAEG--FKILPVCMGLQDEETAVEIGVEVARAARASGKKVIF 172

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSDF H+ S       D  H +I   I       +DI E      F +   E + T C
Sbjct: 173 IASSDFTHYQSEEKAS--DNDHYLIDPVIM------IDIPE------FYRRREERNITAC 218

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           G  PI+  +   G      K   L+Y  S      R   V YA+
Sbjct: 219 GFGPIAATMAAAGEYGAA-KASLLKYATSGDVTGDRSQVVGYAA 261


>gi|407461723|ref|YP_006773040.1| hypothetical protein NKOR_00910 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407045345|gb|AFS80098.1| hypothetical protein NKOR_00910 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 275

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 106/243 (43%), Gaps = 20/243 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-----KSPEVRGVIAPHAGYSYSGRAAA 58
           IR+   AG +Y +    L E +D  L+    P     K  +V G+I PHAGY YSG  A 
Sbjct: 2   IRKPVVAGQFYPETKDELEEMIDSCLQHKYGPGNQIQKDEKVYGIICPHAGYVYSGPTAC 61

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +++  I   +   V ++GP+H       A      ++TP+G + +D E  +E+    K+ 
Sbjct: 62  HSYKAISSKNPELVIIIGPNHFGIGKDVATMIDAQWETPLGLVEVDSEAAQEIANNSKYI 121

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            +D      +HS+E+ +P L         KI+PI++   + E     G   A+     + 
Sbjct: 122 EIDEFSHSKDHSLEVQIPMLQSTLSNKF-KILPIILRDQSLEMAKDVGNAVAQIAKSRNA 180

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSDF H+    N   Y +   +I   +E      M++      + F   L+E   T
Sbjct: 181 MIIASSDFTHY--EENSFAYSQDKALIEPILE------MNV------EKFYSVLMEKKIT 226

Query: 239 ICG 241
            CG
Sbjct: 227 ACG 229


>gi|149922583|ref|ZP_01911012.1| hypothetical protein PPSIR1_06693 [Plesiocystis pacifica SIR-1]
 gi|149816539|gb|EDM76035.1| hypothetical protein PPSIR1_06693 [Plesiocystis pacifica SIR-1]
          Length = 270

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 91/191 (47%), Gaps = 10/191 (5%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWL---REAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           + R A+ AG +Y      LA  LD  L     A    +   + +I PHAGY YSG  AA 
Sbjct: 6   RTRPAALAGRFYPGRPGELARTLDAMLDAASRARAAPARRPKALIVPHAGYVYSGPIAAS 65

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
            +  +    + +V LLGP+H  Y    AL  A  + TP+G++P+D  +   L+    F  
Sbjct: 66  GYAQLRGHGVDKVVLLGPAHRVYVEGLALPGADRFATPLGEVPVDAALAAALRPL-PFVE 124

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-EAMYGRLFAKYVDDPSN 178
                   EHS+E+HLP+L  V +G    ++P++VG    E   A+  R++      P  
Sbjct: 125 TSAEAHALEHSLEVHLPFLQAVLDGPF-SLLPLVVGGATPEQVSAVLERVWGG----PET 179

Query: 179 FFSVSSDFCHW 189
              +SSD  H+
Sbjct: 180 LVVISSDLSHF 190


>gi|121998650|ref|YP_001003437.1| hypothetical protein Hhal_1871 [Halorhodospira halophila SL1]
 gi|121590055|gb|ABM62635.1| protein of unknown function DUF52 [Halorhodospira halophila SL1]
          Length = 268

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 95/193 (49%), Gaps = 14/193 (7%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAY 59
           K+R  + AG +Y   ++ L   +   L EAG P+    R    ++ PHAGY +SG AAA 
Sbjct: 7   KVRPPAVAGRFYPGEAETLRRAVRELLAEAGEPERDPTRAPHAMVLPHAGYPFSGAAAAR 66

Query: 60  AFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGK 116
            +  I P    +  V LLGP+H       AL  A    TP+G +P+   + E  L+  G 
Sbjct: 67  GYQRIVPIREQLRHVVLLGPAHFVDLSGIALPAADALATPLGTVPVSATLRERALEHPGV 126

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
              +D    E EHS+E+HLP+L  + +     ++P++VG   AE+    GRL  +   D 
Sbjct: 127 H--IDDSAHEREHSLEVHLPFLQTLLDD--FDVLPLVVGRGPAES---CGRLIEQLWQD- 178

Query: 177 SNFFSVSSDFCHW 189
                VSSD  H+
Sbjct: 179 DTLVVVSSDLSHF 191


>gi|451947838|ref|YP_007468433.1| putative dioxygenase [Desulfocapsa sulfexigens DSM 10523]
 gi|451907186|gb|AGF78780.1| putative dioxygenase [Desulfocapsa sulfexigens DSM 10523]
          Length = 273

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 83/288 (28%), Positives = 129/288 (44%), Gaps = 32/288 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLRE-AG---LPKSPE-VRGVIAPHAGYSYSGRAAA 58
           IR  + AG +Y  +    AE L   +RE AG   LP + + V   ++PHAGY YSG  AA
Sbjct: 5   IRSPAVAGRFYPGD----AESLRLAIRELAGNHPLPATKQKVLAALSPHAGYVYSGGVAA 60

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
              G ++      V LLGP+H       ALSTAT +  P+G +P++    ++L       
Sbjct: 61  ETLGRVEVP--ETVILLGPNHTGKGAPVALSTAT-WNMPMGSVPVNRHFAQDLLEATDII 117

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDP 176
             D      EHS+E+ +P+L  V     + IVPI +  V+ +     G   A+ +  +D 
Sbjct: 118 EEDELAHTYEHSLEVQIPFLQMVQPN--LSIVPITISHVSYQILNELGLALAEVIKQNDS 175

Query: 177 SNFFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
            N   V S+D  H+  R      DKK   + K I  +           DP+   + ++ +
Sbjct: 176 GNILIVASTDMTHYEPREA---ADKKDHYVLKKIADM-----------DPNILYRSVIGH 221

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           + ++CG  P++V L          K + +RY  S +        V YA
Sbjct: 222 NISMCGIMPVTVALIAALELGAT-KTELVRYTDSGEISGDTKQVVGYA 268


>gi|308273082|emb|CBX29686.1| hypothetical protein N47_J06670 [uncultured Desulfobacterium sp.]
          Length = 493

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 129/292 (44%), Gaps = 36/292 (12%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEVRGVIAPHAGYSYSGRA 56
           + +R  S+AGS+Y      L + ++    +A      +P    ++ +I PHAGY YSG  
Sbjct: 27  DNVREPSYAGSFYPSAKYELEKTINDLTYQAKKTAFTVPSGKTLKALILPHAGYIYSGLT 86

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           A++A         S+V ++GP H      C LS A  Y+TP+G + +  +  +       
Sbjct: 87  ASHASLVFSENQFSKVIIMGPDHRVGFNNCVLSDADAYETPLGYVRIHKDAAKLRSEKEL 146

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN--AENEAMYGRLFAKYVD 174
           F  + +  D  EHS+E+ LP+L    +    +I+P+++G+ +  A  EA+   L      
Sbjct: 147 FRSLPVS-DINEHSIEVVLPFLQTYLKN--FEIIPVVIGSADYHAIAEAIDPLL------ 197

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
           D       SSD  H      Y+ Y        +++E  D   +++I   + +   K    
Sbjct: 198 DNKTLLVASSDLSH------YLKY-------RQAVEK-DNETINMIVNLENNKLSK---- 239

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            DN+ CG+ PI   + + G    K  +  + Y  S +   +R   V YA  A
Sbjct: 240 LDNSACGKIPILAIMDLAGKYGWKPVL--VHYSNSGETAGERSRVVGYAVIA 289


>gi|409912350|ref|YP_006890815.1| MEMO-like protein [Geobacter sulfurreducens KN400]
 gi|298505941|gb|ADI84664.1| MEMO-like protein [Geobacter sulfurreducens KN400]
          Length = 267

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 72/257 (28%), Positives = 115/257 (44%), Gaps = 40/257 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +YTD+ +RL EEL   +R    P+   V GVIAPHAGY YSG  A   +G+
Sbjct: 2   VRQPAVAGQFYTDDPRRLREELGHLIRPVPAPR--RVTGVIAPHAGYMYSGAIAGAVYGS 59

Query: 64  IDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           I    I R V +LGP+HH      +L     + +P+G++P++  +   +         D+
Sbjct: 60  I---VIPRTVVILGPNHHGLGAAASLYPDGTWLSPLGEVPIEQRLSSLVLEHVAQAEPDV 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG----------RLFAKY 172
                EHS+E+ +P+L   +    + IVP+ +G      +  YG          R  A Y
Sbjct: 117 IAHRFEHSLEVQVPFLR--YLNSDVAIVPMCLG------DGGYGWCRQVGEGLARAIAAY 168

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
            ++       SSD  H+ S               +S    D+  +  +   D +   K  
Sbjct: 169 GEE--VLIVASSDMTHYES--------------AESARLKDEAALSCVLALDAEGLLKVC 212

Query: 233 LEYDNTICGRHPISVFL 249
            +   T+CG  P +V L
Sbjct: 213 RQRGITMCGVIPSTVML 229


>gi|188996255|ref|YP_001930506.1| hypothetical protein SYO3AOP1_0307 [Sulfurihydrogenibium sp.
           YO3AOP1]
 gi|188931322|gb|ACD65952.1| protein of unknown function DUF52 [Sulfurihydrogenibium sp.
           YO3AOP1]
          Length = 265

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/198 (29%), Positives = 102/198 (51%), Gaps = 20/198 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAY 59
           M  +R+ + A  +YT N +RL   +  +L +A L    PE  G++ PHAGY YSG  AA 
Sbjct: 1   MLTVRKPAVANMFYTGNKERLLYTIKNYLNKAPLYDYIPE--GIVVPHAGYIYSGPVAAV 58

Query: 60  AFG---NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATG 115
           ++    N+DP+   ++ L+GPSHH Y    +      ++TP+G + ++ E I +  K   
Sbjct: 59  SYKQLLNLDPSKHYKILLIGPSHHVYFNGVSYGFYDYWETPLGKVKVNKEAIIKFSKEHP 118

Query: 116 KFEL-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA---ENEAMYGRLFAK 171
            F L ++    + EHS+E+ +P+L  V +    +I+P + G +++   EN         +
Sbjct: 119 DFPLTLNTLPHQREHSLEVQVPFLQVVLKD--FEILPFVYGDIDSSIVEN-------IIE 169

Query: 172 YVDDPSNFFSVSSDFCHW 189
           Y  D      +S+D  H+
Sbjct: 170 YFKDNDTVVIISTDLSHY 187


>gi|386001676|ref|YP_005919975.1| hypothetical protein Mhar_0982 [Methanosaeta harundinacea 6Ac]
 gi|357209732|gb|AET64352.1| hypothetical protein Mhar_0982 [Methanosaeta harundinacea 6Ac]
          Length = 260

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 125/282 (44%), Gaps = 30/282 (10%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
           AG +Y    ++L EEL      A   +   +RG + PHAGY YSG  AA  +G + P   
Sbjct: 2   AGQFYPLQPQKLLEELKAAFSGASGGEPLPIRGAVVPHAGYIYSGAVAAEVYGRL-PERE 60

Query: 70  SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
           + V L+GP+HH      A+S  + + TP+G +  D+E+ + L   G    +D      EH
Sbjct: 61  TFV-LIGPNHHGLGLPVAISRDS-WMTPLGTVESDVELADAL--AGSILEVDESAHLYEH 116

Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV--SSDFC 187
           S+E+ +P+L   F G   KI+PI +G  + E     G    +         ++  SSDF 
Sbjct: 117 SLEVQIPFLQARFSG--FKILPIAMGLQDEETAVEVGEAVGEAAKSLGRDCTIIASSDFT 174

Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
           H+  +      D       ++IEA+ +M        D  +  + +   + T CG  PIS 
Sbjct: 175 HYEPQEEARRKDA------QAIEAILRM--------DVPSVYRAVYGQNLTACGYGPISA 220

Query: 248 FL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            +    +LG  + K+    LRY  S          V Y + A
Sbjct: 221 TITAARILGAETGKL----LRYSTSGDVLGDYSQVVGYGAIA 258


>gi|340344183|ref|ZP_08667315.1| dioxygenase [Candidatus Nitrosoarchaeum koreensis MY1]
 gi|339519324|gb|EGP93047.1| dioxygenase [Candidatus Nitrosoarchaeum koreensis MY1]
          Length = 277

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/289 (27%), Positives = 123/289 (42%), Gaps = 24/289 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSP------EVRGVIAPHAGYSYSGR 55
           +IR  + AG +Y +  K L + + + +L   G  K P      ++ GVI PHAGY YSG 
Sbjct: 2   QIRTPAVAGMFYPNEKKELKKVIKECFLHNFGPGKIPPSNIKKKIFGVICPHAGYVYSGP 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            A  +F  I     +   ++GP+H       A  T T ++TP+G++ ++ E+ EE+    
Sbjct: 62  IACNSFYEISSDLPNLFIIIGPNHWGIGSSVATMTDTKWETPLGEIEVNSEIAEEISKLT 121

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
               +D      EHS+E+ +P L ++      KIVPI +   + E     G   AK    
Sbjct: 122 DVIEIDNFSHSREHSLEVQIPILQEIAAN--FKIVPIALINQSKEIAIKVGIAVAKIAQK 179

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
                  SSDF H+ S  N   Y++   +I   +E             D D F   L + 
Sbjct: 180 NKVMIIGSSDFTHYES--NKFAYEQDAALIEPILEL------------DVDRFYDILHKK 225

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           D + CG   I+  +          K + L+Y  S      + S V Y S
Sbjct: 226 DISACGYGAIASTMIACKEIGAT-KGELLKYATSGDISGDKSSVVGYGS 273


>gi|383788631|ref|YP_005473200.1| hypothetical protein CSE_09710 [Caldisericum exile AZM16c01]
 gi|381364268|dbj|BAL81097.1| hypothetical protein CSE_09710 [Caldisericum exile AZM16c01]
          Length = 264

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 133/287 (46%), Gaps = 33/287 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF- 61
           KIR  + AG++Y ++ + L   +  ++  A L  +  ++ +IAPHAGY YSG  A Y++ 
Sbjct: 2   KIRDTAVAGAFYPEDREELKSMIRRFVESAPLEDTDNIKAIIAPHAGYIYSGPIAGYSYK 61

Query: 62  --GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
              NID     +V ++ PSH+ Y    ++     YKTP+G + +  +  + L+      +
Sbjct: 62  QLTNIDYLKNIKVIIIAPSHYAYFHGASVGLFDAYKTPLGFVNVSKDAQKLLQLEEFHFI 121

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           +D  ++  EHS+E+ LP+L         +IVPIL   + +E   + G L    + D +  
Sbjct: 122 LDAHLE--EHSIEVQLPFLQYTLSH--FEIVPILYSEI-SEESLLRGIL---SIFDENTI 173

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
             VS+D  H+   + Y    KK     K+++ L+               KK++L  +   
Sbjct: 174 LVVSTDLSHY---YPYDIAIKKDSHCIKAVQELN---------------KKHILNCEA-- 213

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           CG+  I+  +      +  +K K L+Y  S      +   V Y SA 
Sbjct: 214 CGKTGIATVIDFAK--ANNLKSKVLKYATSGDTAGPKTQVVGYLSAV 258


>gi|304390953|ref|ZP_07372905.1| conserved hypothetical protein [Mobiluncus curtisii subsp. curtisii
           ATCC 35241]
 gi|304325836|gb|EFL93082.1| conserved hypothetical protein [Mobiluncus curtisii subsp. curtisii
           ATCC 35241]
          Length = 488

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 102/218 (46%), Gaps = 28/218 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------GLPKSPEVRGVIAPH 47
           M  +R+ + AGS+Y  +   L + +D  L++A             G P   ++ GV  PH
Sbjct: 1   MSFVRKPAVAGSFYPQDPGELQKTVDELLKKATALITNDVSRDSQGRPLR-QLMGVQVPH 59

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           AGY YSG  AA  F  +      RV + GP+H       ALS AT + TP+G++P+D   
Sbjct: 60  AGYIYSGLIAAVGFQLVSTLRPRRVIIAGPTHRVGCHGIALSEATAFATPLGEVPVDQVA 119

Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--E 162
            + + E  +  KF      VD  EH++E+ LP+L  VF      IVP+ VG    E+  +
Sbjct: 120 QQYLVEHSSVAKF-FEPSHVD--EHALEVQLPFLQTVFRDVDFAIVPMAVGDCTPEDVCD 176

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK 200
           ++   L     +       VSSD  H      Y+ YD+
Sbjct: 177 SLQTLLSIPGENPDDTLIIVSSDLSH------YLGYDE 208


>gi|15668579|ref|NP_247377.1| hypothetical protein MJ_0403 [Methanocaldococcus jannaschii DSM
           2661]
 gi|2495960|sp|Q57846.1|Y403_METJA RecName: Full=MEMO1 family protein MJ0403
 gi|1591108|gb|AAB98390.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
           2661]
          Length = 287

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/304 (26%), Positives = 137/304 (45%), Gaps = 33/304 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEVRG-------VIAPHAGYSY 52
           M KIR  + AG +Y  +   L + ++  +L + G PKS  V G       ++ PHAGY Y
Sbjct: 1   MNKIRYPAVAGLFYPSHPDELIDMIEQCYLHKFG-PKSMPVHGTYEKPIGLLCPHAGYVY 59

Query: 53  SGRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
           SG   A+++      +D    + V +LGP+H       ++    +++TP+GD+  D E +
Sbjct: 60  SGPIQAHSYYELSKRVDALEETTVVILGPNHTGLGSGVSVMDG-IWRTPLGDVKCDEEFV 118

Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYG 166
           EEL    +   +D      EHS+E+ LP+L   ++      KIVPI +   + E     G
Sbjct: 119 EELWRKCEIVDLDETAHLNEHSIEVQLPFLKHLELLNIAKFKIVPICMMFQDYETAVEVG 178

Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
              AK   + +    V  SSD  H+  +       KK  ++ K          DI+E  +
Sbjct: 179 YFIAKIAKELNRRIVVIASSDLTHYEPQ---EIASKKDAIVIK----------DILEMNE 225

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            + ++  ++ Y+ ++CG  P+   L  +     + K K L Y  S        + V YAS
Sbjct: 226 KELYED-VVNYNISMCGYGPVIAMLKAMKTLGAE-KAKLLAYATSGDITGDYSAVVGYAS 283

Query: 285 AAAK 288
           A  +
Sbjct: 284 AIVE 287


>gi|296109608|ref|YP_003616557.1| protein of unknown function DUF52 [methanocaldococcus infernus ME]
 gi|295434422|gb|ADG13593.1| protein of unknown function DUF52 [Methanocaldococcus infernus ME]
          Length = 287

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 79/297 (26%), Positives = 135/297 (45%), Gaps = 30/297 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEV-----RGVIAPHAGYSYSGRAA 57
           +R  + AG +Y  N + L E ++  +L E G  K P +      G++ PHAGY YSG   
Sbjct: 1   MRYPAVAGLFYPSNPEELLEMIESCYLHELGPKKLPSIGSYSILGLVCPHAGYIYSGPIQ 60

Query: 58  AYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           A+++  +    DP+      +LGP+H       ++S A +++TP+GD+  D E I+ L  
Sbjct: 61  AHSYLALSKHADPSEEITAVILGPNHTGLGSGVSVS-ADIWRTPLGDMEPDKEFIDLLWR 119

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
             +   +D      EHS+E+ LP+L   ++ E    KIVPI +   + E     G   AK
Sbjct: 120 ECEIIDLDETAHLQEHSIEVQLPFLKHLELLEIAKFKIVPICMMFQDYETAIEVGYFIAK 179

Query: 172 YVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
              + +    V  SSD           HY+ +   + K     D++ ++ I   D     
Sbjct: 180 VAKELNRRVVVIASSDLS---------HYEPQEIAVEK-----DRIVINDIIKMDEKRLY 225

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           + +++Y+ ++CG  P+   +  L     + + K L Y  S        + V Y+S A
Sbjct: 226 EDVVKYNISMCGYGPVMAMIRALKELGGE-RAKLLAYATSGDVTGDYSAVVGYSSIA 281


>gi|14591400|ref|NP_143479.1| hypothetical protein PH1626 [Pyrococcus horikoshii OT3]
 gi|3258055|dbj|BAA30738.1| 294aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 294

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 81/296 (27%), Positives = 135/296 (45%), Gaps = 11/296 (3%)

Query: 1   MEKIRR-ASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAA 58
           M K+RR  + AG +Y +    L E L  + ++ G   +   +   +APHAGY +SG  A+
Sbjct: 1   MAKMRRYPAVAGQFYPEGDA-LIEMLSSFFKDLGEEGTKRTITAGVAPHAGYVFSGFTAS 59

Query: 59  YAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
             +  I    +  VF++ GP+H       AL     + TP+G + +D +  +E+      
Sbjct: 60  RTYKAIYEDGLPEVFVIFGPNHTGLGSPIALYPEGEWITPMGSIKVDSKFAKEIVKRSGI 119

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGR-LF-AKYV 173
             +D    + EHS+E+ LP++  + E  G  +KIVPI +G  + E     GR +F A   
Sbjct: 120 ADLDDLAHKYEHSIEVQLPFIQYIAEKAGVEVKIVPITLGIQDEEVSRSLGRSIFEASTS 179

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
                    S+DF H+GS + Y+ +  +   +   +   D   +  I   D D     + 
Sbjct: 180 LGRDTIIIASTDFMHYGSFYGYVPFRGRPEELPNMVRDWDMRIIRRILDFDLDGMFSEIR 239

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIK-IKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           E ++T+CG  P  V   ++ +   K K  + L Y  S +     D+ V YAS   K
Sbjct: 240 EMNHTMCG--PGGVGAGIIYSRLMKAKEAELLHYTTSFEVSRSPDAIVGYASIIMK 293


>gi|315656270|ref|ZP_07909161.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii
           ATCC 35242]
 gi|315493272|gb|EFU82872.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii
           ATCC 35242]
          Length = 488

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 101/218 (46%), Gaps = 28/218 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------GLPKSPEVRGVIAPH 47
           M  +R+ + AGS+Y  +   L   +D  L++A             G P   ++ GV  PH
Sbjct: 1   MSFVRKPAVAGSFYPQDPGELQRTVDELLKKATALITNDVSRDSQGRPLR-QLMGVQVPH 59

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           AGY YSG  AA  F  +      RV + GP+H       ALS AT + TP+G++P+D   
Sbjct: 60  AGYIYSGLIAAVGFQLVSTLRPRRVIIAGPTHRVGCHGIALSEATAFATPLGEVPVDQVA 119

Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--E 162
            + + E  +  KF      VD  EH++E+ LP+L  VF      IVP+ VG    E+  +
Sbjct: 120 QQYLVEHSSVAKF-FEPSHVD--EHALEVQLPFLQTVFRDIDFAIVPLAVGDCTPEDVCD 176

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK 200
           ++   L     +       VSSD  H      Y+ YD+
Sbjct: 177 SLQTLLSIPGENPDDTLIIVSSDLSH------YLAYDE 208


>gi|304314295|ref|YP_003849442.1| hypothetical protein MTBMA_c05340 [Methanothermobacter marburgensis
           str. Marburg]
 gi|302587754|gb|ADL58129.1| conserved hypothetical protein [Methanothermobacter marburgensis
           str. Marburg]
          Length = 280

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 78/294 (26%), Positives = 129/294 (43%), Gaps = 35/294 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGW-----LREAGLP---KSPEVRGVIAPHAGYSYSGRA 56
           RR + AG++Y  +   L   ++ W     L   GLP    + +++GVIAPHAGY YSG  
Sbjct: 3   RRPAVAGAFYERDPAALRRRIE-WCFNHELGPGGLPARGSARKIKGVIAPHAGYMYSGPV 61

Query: 57  AAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           AA+A+  +    I   F ++ P+H       +L     ++TP+G + +D E+ E +    
Sbjct: 62  AAHAYHELVSDGIPETFVIICPNHTGMGSGVSLMQRGAWETPLGVVDIDSELAEVIVRES 121

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD- 174
               +D      EHS E+H+P++ + F     +IVPI +   + E     G   A  ++ 
Sbjct: 122 GIIDIDGTAHLGEHSCEVHVPFI-QYFTDRF-RIVPITMWMQDQETATDVGHAVAAAIER 179

Query: 175 -DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
            +       S+DF H      Y   D    +  + IE + +M        D       + 
Sbjct: 180 TERDAVVIASTDFTH------YSPADVAGSIDSRIIERITEM--------DDTGMYGVIT 225

Query: 234 EYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           E + T+CG  P++  +    MLG      + + L+Y  S      + S V YAS
Sbjct: 226 ELNATMCGYGPVAASITASRMLGAS----ECRLLKYATSGDITGDQSSVVGYAS 275


>gi|432329928|ref|YP_007248071.1| putative dioxygenase [Methanoregula formicicum SMSP]
 gi|432136637|gb|AGB01564.1| putative dioxygenase [Methanoregula formicicum SMSP]
          Length = 263

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/294 (27%), Positives = 124/294 (42%), Gaps = 44/294 (14%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLRE----AGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           K+R  + AG +Y      L + L+ +       AG P      G+++PHAGY YSG+ AA
Sbjct: 2   KMRACAVAGMFYPREPSHLEQLLEKFFSAVPDVAGRPL-----GIVSPHAGYIYSGQVAA 56

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
            AFG I P       ++GPSH  Y   C   +   ++TP+G +  D E +  L       
Sbjct: 57  TAFGAIPPDFSGTFVIIGPSHRGYI-TCV--SEVPWETPLGVVDTDTEFVRAL------- 106

Query: 119 LMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG-RLFAKYV 173
             D+  DE     EHS+E+ +P++   F     +I P+++G  +       G ++FA   
Sbjct: 107 --DMETDELSHRDEHSIEVQVPFIKYRFP--RARIAPVMMGQQDYPGAVRLGEKIFAAIR 162

Query: 174 DDPSNFFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
               +   V SSDF H+              V  K  +  D   +D + + D   F + +
Sbjct: 163 RTKRDVRIVASSDFSHY--------------VPEKKAKEEDLFAIDALRSLDTREFYRRI 208

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            E   T CG  PI+  +        K   + +RY  S +        V YAS A
Sbjct: 209 AEKRITACGYGPITAMVTACTGLGAK-AAELIRYATSGETTGDYREVVGYASIA 261


>gi|375082610|ref|ZP_09729663.1| hypothetical protein OCC_01034 [Thermococcus litoralis DSM 5473]
 gi|374742688|gb|EHR79073.1| hypothetical protein OCC_01034 [Thermococcus litoralis DSM 5473]
          Length = 292

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/291 (24%), Positives = 132/291 (45%), Gaps = 8/291 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR  + AGS+Y    + L   LD +  + G L +  ++   +APHAGY +SG  A+  + 
Sbjct: 2   IRYPAVAGSFYPSGGE-LKLMLDEFFSDLGELGEKRKITSGVAPHAGYIFSGYTASRTYK 60

Query: 63  NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I   S+   F ++GP+H       A+    +++TP+GD+ +D E+ + +        +D
Sbjct: 61  AIYEDSLPETFVIIGPNHTGLGSPVAIYPEGIWRTPMGDVEVDAELAKTIAKHSSLADLD 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFEG--HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
               + EHS+E+ +P++  + E     +KIVPI +G  + E     GR   +   +    
Sbjct: 121 DFAHKYEHSLEVQVPFIQYISEKAEKEVKIVPIALGLQDEEVAEDLGRAIFEASQELGRD 180

Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
             V  S+D  H+G  + Y+ +  +   +   I   D   +  I   D       + + D+
Sbjct: 181 VVVIASTDMMHYGYAYGYVPFRARGDDLLGRIREWDFRVIQKILEFDYKGMFDEIRKMDH 240

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           T+CG   ++  + +    S  ++ + L Y  S +     D+ V Y S   +
Sbjct: 241 TMCGPGGVATAI-VFSRLSGAVEAEVLHYTTSFEVSRSTDAIVGYVSIVMR 290


>gi|297625218|ref|YP_003686981.1| hypothetical protein PFREUD_00190 [Propionibacterium freudenreichii
           subsp. shermanii CIRM-BIA1]
 gi|296920983|emb|CBL55520.1| Hypothetical protein PFREUD_00190 [Propionibacterium freudenreichii
           subsp. shermanii CIRM-BIA1]
          Length = 265

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/214 (31%), Positives = 103/214 (48%), Gaps = 15/214 (7%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPK---SP-EVRGVIAPHAGYSYSGRAAAYA 60
           R  + AGS+Y     RL  ELD    +A  P    SP  V+ ++ PHAGY YSG  AA  
Sbjct: 4   RPTAVAGSFYPGQRGRLTRELDTLFEQADRPPAQVSPARVKAIVVPHAGYVYSGTTAATG 63

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  +    I+RV +LGP+H       AL+ A  + TP+G +P+D ++    +A     + 
Sbjct: 64  YELLRGRPINRVVVLGPTHRVGIRGMALAGADAFDTPLGSVPVDPDLTAIAEAV-PLVVT 122

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAKYVDDPSNF 179
              V   EHS+E+ LP++  V       ++P+ VG A+  E  A+   ++ +        
Sbjct: 123 RPDVHAREHSLEVQLPFIRTVLP--QASVLPVAVGDALPDEVAALLDAVWGES----DTA 176

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD 213
             +SSD  H+ S  +    D   G I K + ALD
Sbjct: 177 IVISSDLSHYHSYDDARKLDA--GTIDK-VLALD 207


>gi|18978010|ref|NP_579367.1| hypothetical protein PF1638 [Pyrococcus furiosus DSM 3638]
 gi|397652713|ref|YP_006493294.1| hypothetical protein PFC_10415 [Pyrococcus furiosus COM1]
 gi|23822355|sp|Q8U0F2.1|Y1638_PYRFU RecName: Full=MEMO1 family protein PF1638
 gi|18893791|gb|AAL81762.1| hypothetical protein PF1638 [Pyrococcus furiosus DSM 3638]
 gi|393190304|gb|AFN05002.1| hypothetical protein PFC_10415 [Pyrococcus furiosus COM1]
          Length = 292

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 72/291 (24%), Positives = 132/291 (45%), Gaps = 8/291 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IRRA  AG +Y D+++ L E L  +  + G    S  +   +APHAGY +SG  A+  + 
Sbjct: 2   IRRAVVAGQFYPDDAE-LVEMLKRFFTDLGEEGNSRRITAGVAPHAGYIFSGYTASRTYK 60

Query: 63  NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I    +  VF +LGP+H       A+     ++TP+G + +D ++   +    +   +D
Sbjct: 61  AIYEDGLPEVFVILGPNHTGLGSPIAVYPKGEWETPLGRIKVDEKLARRITELSEIADLD 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
               + EHS+E+ LP++  + E  G  +KIVPI +G  + E     G+   +   +    
Sbjct: 121 DLAHKYEHSIEVQLPFIQYLAELSGKDVKIVPITLGIQDEEVSYALGKAIYEASQELGRD 180

Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
             V  S+DF H+G  + Y+ +  +   +   ++  D   +  I   D +   + +   ++
Sbjct: 181 IVVIASTDFMHYGEFYGYVPFRARADELPNLVKEWDMRVIRRILDFDVEGMFEEINAMNH 240

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           T+C          +    +  I+ + L Y  S +     D+ V YAS   +
Sbjct: 241 TMC-GPGGVGVGIVYSKLAGAIEAELLHYTTSFEVSRSTDAIVGYASIVMR 290


>gi|223478425|ref|YP_002582779.1| phosphomevalonate decarboxylase [Thermococcus sp. AM4]
 gi|214033651|gb|EEB74478.1| phosphomevalonate decarboxylase [Thermococcus sp. AM4]
          Length = 291

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/291 (24%), Positives = 133/291 (45%), Gaps = 8/291 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
           IR  + AGS+Y  + + L   L+ +  + G   S   +   +APHAGY +SG  A+  + 
Sbjct: 2   IRYPAVAGSFYPADEE-LVLMLERFFSDLGEEGSERRITAGVAPHAGYIFSGYTASRTYK 60

Query: 63  NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I    +   F +LGP+H       A+     + TP+G + +D E+ +E+        +D
Sbjct: 61  AIFEDGLPETFVILGPNHTGLGSPIAVYPEGEWLTPLGSIEVDAEMAKEIAKLSGIADLD 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
               + EHS+E+ +P++  + E  G  ++IVPI +G  + E     G+   +  ++    
Sbjct: 121 DLAHKYEHSIEVQVPFIQYLAELAGKEVRIVPITLGIQDEEVSRALGKAIFEASEELGRD 180

Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
             V  S+DF H+G  + Y+ +  +   +   I+  D   +  I   D D   + L E  +
Sbjct: 181 VVVIASTDFMHYGPAYGYVPFRARADELPHRIKEWDFRLIRRILDFDVDGLFRELREMRH 240

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           T+CG   +   + +    +  ++ + L Y  S +     ++ V YAS   +
Sbjct: 241 TMCGPGAVGTAI-VYSRLAGALEAELLHYTTSYEVSRSTEAIVGYASIVMR 290


>gi|298345398|ref|YP_003718085.1| dioxygenase [Mobiluncus curtisii ATCC 43063]
 gi|298235459|gb|ADI66591.1| dioxygenase [Mobiluncus curtisii ATCC 43063]
          Length = 488

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 101/218 (46%), Gaps = 28/218 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------GLPKSPEVRGVIAPH 47
           M  +R+ + AGS+Y  +   L   +D  L++A             G P   ++ GV  PH
Sbjct: 1   MSFVRKPAVAGSFYPQDPGELQRTVDELLKKATALITNDVSRNSQGRPLR-QLMGVQVPH 59

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           AGY YSG  AA  F  +      RV + GP+H       ALS AT + TP+G++P+D   
Sbjct: 60  AGYIYSGLIAAVGFQLVSTLRPRRVIIAGPTHRVGCHGIALSEATAFVTPLGEVPVDQVA 119

Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--E 162
            + + E  +  KF      VD  EH++E+ LP+L  VF      IVP+ VG    E+  +
Sbjct: 120 QQYLVEHSSVAKF-FEPSHVD--EHALEVQLPFLQTVFRDVDFAIVPMAVGDCTPEDVCD 176

Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK 200
           ++   L     +       VSSD  H      Y+ YD+
Sbjct: 177 SLQTLLSIPGENPDDTLIIVSSDLSH------YLGYDE 208


>gi|325969093|ref|YP_004245285.1| Memo domain containing protein [Vulcanisaeta moutnovskia 768-28]
 gi|323708296|gb|ADY01783.1| Memo domain containing protein [Vulcanisaeta moutnovskia 768-28]
          Length = 283

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 79/297 (26%), Positives = 132/297 (44%), Gaps = 31/297 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPE-----VRGVIAPHAGYSYS 53
           +R+ + AG++Y  +  +L ++++ W     L    L + P      V  VI PHAGY YS
Sbjct: 1   MRKPAVAGAFYEADKDKLIKQIE-WSISHSLGPGQLIRQPREGYKAVPIVIVPHAGYIYS 59

Query: 54  GRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA +F  I      + F+L GP+H+      A+     ++TP+G + +D +V +EL 
Sbjct: 60  GPVAAMSFVEIYRFHKPKTFILIGPNHYGVGAPVAIYPEGTWETPLGVVEVDSDVTKELM 119

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
           +  K+   D+     EHS+E+ LP++  +F G+ +KIVPI++     E     G   A  
Sbjct: 120 SRVKYLEPDVYAFTQEHSIEVQLPFIQYIF-GNDVKIVPIIIWRQTKEVARDLGNAIADV 178

Query: 173 VDD---PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
           +      S  +  SSD+ H+                H+     D   +D +   D D+F 
Sbjct: 179 ISGHELGSIVYVASSDWNHYEP--------------HEITTEKDMRAIDPVLRLDEDSFF 224

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
             +  YD + CG   I+  +        +  +  LR+  S         +V YAS A
Sbjct: 225 DAIERYDVSACGYGAIATAIIAAKKLGVR-NVILLRHATSGDTSGYTLETVGYASIA 280


>gi|257094199|ref|YP_003167840.1| hypothetical protein CAP2UW1_2625 [Candidatus Accumulibacter
           phosphatis clade IIA str. UW-1]
 gi|257046723|gb|ACV35911.1| protein of unknown function DUF52 [Candidatus Accumulibacter
           phosphatis clade IIA str. UW-1]
          Length = 272

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 95/189 (50%), Gaps = 10/189 (5%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R A+ AG +Y  +S  LA +L   L ++   +    + VI PHAGY YSG  AA  +  
Sbjct: 8   VRPAAVAGMFYPGSSSVLARDLRDLLNDSRPAQGRRPKAVIVPHAGYIYSGPIAASVYAP 67

Query: 64  IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           I     S+ RV LLGP+H       A   +T + TP+G +P+D + +  ++A  +  + D
Sbjct: 68  IAALRESVRRVVLLGPTHRVAVDGLAFPASTAFATPLGVVPVDAQALALIEALPQVVVSD 127

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAKYVDDPSNFF 180
                 EHS+E+ LP+L  V +     +VP+ VG A  AE   +   L+    D+     
Sbjct: 128 -AAHALEHSLEVQLPFLQTVLDE--FTLVPLAVGRASPAEVAEVLECLWGG--DE--TLI 180

Query: 181 SVSSDFCHW 189
            +SSD  H+
Sbjct: 181 VISSDLSHY 189


>gi|212223278|ref|YP_002306514.1| hypothetical protein TON_0132 [Thermococcus onnurineus NA1]
 gi|229557467|sp|B6YST0.1|Y132_THEON RecName: Full=MEMO1 family protein TON_0132
 gi|212008235|gb|ACJ15617.1| hypothetical protein, conserved, DUF52 family [Thermococcus
           onnurineus NA1]
          Length = 291

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 132/287 (45%), Gaps = 8/287 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
           +R  + AGS+Y  + + L E L+ + R+ G   S  ++   + PHAGY +SG  A+  F 
Sbjct: 2   VRYPAVAGSFYPGD-ETLIEMLEKFFRDLGEHGSERKITAGVVPHAGYVFSGYTASRTFK 60

Query: 63  NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I    +   F +LGP+H       A+  +  + TP+G++ +D E+ + +        +D
Sbjct: 61  AIYEDGLPETFVILGPNHTGIGSPIAVYPSGSWLTPLGEIEVDSEMAKTIAKLSGIADLD 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVD--DPS 177
               + EHS+E+ LP++  + E     ++IVPI +G  + E     G+   +  +  D  
Sbjct: 121 ELAHKYEHSIEVQLPFIQYLAEKARTDVRIVPITLGIQDEEVVEDLGKAIYEAANELDRD 180

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                S+DF H+G  + Y+ +  +   +   ++  D   +  I   D       L + D+
Sbjct: 181 VVIIASTDFMHYGPAYGYVPFRARADELPHRVKEWDFRVIQKILDFDVKGMFGELRKMDH 240

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           T+CG   +   + +    +  ++ + L Y  S +     D+ V YAS
Sbjct: 241 TMCGPGGVGTAI-VYSRLAGALEAELLHYTTSFEVSRSTDAIVGYAS 286


>gi|371778140|ref|ZP_09484462.1| AMMECR1 domain-containing protein [Anaerophaga sp. HS1]
          Length = 475

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 120/270 (44%), Gaps = 13/270 (4%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           K R+ S AG +Y      L   L     EA  G+  + +V  +I PHAGY YSG  AA  
Sbjct: 4   KDRQPSVAGMFYAGEPVALQNHLKSLFEEAQPGVGDA-QVAALIVPHAGYLYSGGVAASG 62

Query: 61  FGNIDPTS-ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           F  I+  +    +FL+G SH       ++ T   Y TP+G + +D  + ++L  +  +  
Sbjct: 63  FAQINENAHYKTIFLIGSSHRMAFNGASVYTQGDYLTPLGRVDVDKALAQKLVESSPYIT 122

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
                 + EHS+E+ LP+L    +    K+VPI++G  +A    M       Y   P N 
Sbjct: 123 DIFAPHKDEHSLEVELPFLQYRLKNSF-KLVPIVMGPHDAVGARMVAEALKPYF-KPGNL 180

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
           F +S+DF H      Y  Y+    V   +++A+ K   D++        K  +     ++
Sbjct: 181 FVISTDFSH------YPKYEDAKKVDAITVDAILKNDPDLLLNTLEQNRKLGVTNLVTSL 234

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQS 269
           CG   +   L++  +  T +  + + Y  S
Sbjct: 235 CGWTSVLTLLYITQD-DTCLDFRHIEYRNS 263


>gi|358637773|dbj|BAL25070.1| hypothetical protein AZKH_2764 [Azoarcus sp. KH32C]
          Length = 275

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 81/291 (27%), Positives = 124/291 (42%), Gaps = 42/291 (14%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAA--YA 60
           IR A+ AG +Y D+ + L  ++   L  A  L      + ++ PHAGY YSG  AA  YA
Sbjct: 6   IRPAAVAGYFYPDDPRVLQMQISEMLSTAVPLEVVHAPKALVVPHAGYVYSGPVAASGYA 65

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
                   + +V +LGP+H       AL  +  + TP+G + L     + L+A       
Sbjct: 66  CATELRQRVRKVVMLGPAHRAEVRGFALPASQSFATPLGMVKLHHSDWQALQARP----- 120

Query: 121 DICVD----EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA-MYGRLFAKYVDD 175
           D+ VD    E EH +E+ LP+L  V +    +IVPILVG    E  A + G+L+      
Sbjct: 121 DVVVDDRPHEFEHCLEVQLPFLQTVLDA--FEIVPILVGGATGEQVADLLGQLWGG---- 174

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P     +SSD  H      Y  YD+       +++ + +M              K +L++
Sbjct: 175 PETLILISSDLSH------YHTYDQARSTDRHAVDQVLRM--------------KPVLDH 214

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
               CG  PI+  +      +  ++   L    S      R   V YAS A
Sbjct: 215 QQA-CGATPINGLIR--AAAARGLEPHLLDLRNSGDTAGDRSRVVGYASIA 262


>gi|254489889|ref|ZP_05103084.1| conserved hypothetical protein [Methylophaga thiooxidans DMS010]
 gi|224464974|gb|EEF81228.1| conserved hypothetical protein [Methylophaga thiooxydans DMS010]
          Length = 262

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 102/204 (50%), Gaps = 20/204 (9%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           ME +R  + AG +Y  ++ +L +++   +           + +I PHAGY YSG  AA A
Sbjct: 1   MEIVRPNAVAGLFYPADAVKLNKQVAADMAACHADPLLSPKALIVPHAGYMYSGAVAASA 60

Query: 61  FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           + ++ P    I RV L+GPSH       A+STA  ++TP+G + +D +   ++ A     
Sbjct: 61  YQHLKPLKNIIKRVVLIGPSHRVPFDGLAISTADWFETPLGLIAVDRQAESQIIAIDGV- 119

Query: 119 LMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
              I +++A   EHS+E+ LP+L  + +    KIVPI+ G  + +   +  ++  K    
Sbjct: 120 ---IALEQAHAREHSLEVQLPFLQFLLDD--FKIVPIVAGHASPK---VIAKVLDKLWGG 171

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYD 199
           P     +SSD  H      Y+ YD
Sbjct: 172 PETLIVISSDLSH------YLDYD 189


>gi|339483947|ref|YP_004695733.1| hypothetical protein Nit79A3_2567 [Nitrosomonas sp. Is79A3]
 gi|338806092|gb|AEJ02334.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Nitrosomonas sp. Is79A3]
          Length = 260

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 77/292 (26%), Positives = 127/292 (43%), Gaps = 40/292 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IR  + AG +Y  ++ +L +++   L  A    + + + +I PHAGY YSG  AA A
Sbjct: 1   MTTIRSPAVAGLFYPADAWQLEQDVQHLLAMAEFHDT-KPKALIVPHAGYVYSGAIAATA 59

Query: 61  FGNI--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           + +I     +I RV LLGP+H       AL    V+ TP+G + LD +++  +       
Sbjct: 60  YASIYSVAAAIRRVVLLGPAHRVAVQGLALPGVDVFDTPLGRVKLDTDLVNAIA-----H 114

Query: 119 LMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
           L  + V +     EHS+E+ LP+L KV       ++P+ VG  +AE  A    +  +   
Sbjct: 115 LPQVTVSKEAHVLEHSLEVQLPFLQKVLSD--FTLLPLAVGMTSAEAVA---EVLEQLWG 169

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
                  +SSD  H      Y+ Y     V  K++++       I++   P       ++
Sbjct: 170 GEETLIVISSDLSH------YLPYATAQRVDSKTVQS-------IVQLRQP-------ID 209

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           +D   CG   I+  +  +      +    L    S      RD  V YA+ A
Sbjct: 210 HDQA-CGSTAINGLI--IAAQQHHLTPYLLDLRNSGDTAGSRDQVVGYAAIA 258


>gi|85858129|ref|YP_460331.1| cytoplasmic protein [Syntrophus aciditrophicus SB]
 gi|85721220|gb|ABC76163.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
          Length = 278

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 122/283 (43%), Gaps = 18/283 (6%)

Query: 4   IRRASHAGSWYTDNSKRLAEE-LDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           I+R+  AGSWY  + + L+ + +D +    G      + G++APHAGY YSG+ AA+A+ 
Sbjct: 6   IKRSLIAGSWYPGSPRVLSRDIMDYFDNVPGKTVQGRILGLVAPHAGYMYSGQVAAHAYK 65

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            I   +   VF++GPSH  +    +L     Y+TP+G + +  ++   L           
Sbjct: 66  EIKGQTYDVVFVIGPSHRAFFRGVSLFKEGGYETPLGIVDVHEDMAARLLEQDPRIAFLP 125

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
            V   EHS+E+ LP+L +V  G     VP+++G  + E   +          +       
Sbjct: 126 DVHLQEHSVEIQLPFL-QVALGEF-SFVPLIMGDQDYETCRVLADAIVNCCGNKQVLIVG 183

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSD  H      Y  Y++   +  + +E L KM        D     + L       CG 
Sbjct: 184 SSDLSH------YHGYEQAVRMDSRILEHLRKM--------DECGLIRDLSSGTGEACGG 229

Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            P +V + M+       K   L+Y  S      R   V YA+A
Sbjct: 230 GPAAVTM-MVARQLGADKAAVLKYANSGDVTGDRSGVVGYAAA 271


>gi|374327733|ref|YP_005085933.1| putative dioxygenase [Pyrobaculum sp. 1860]
 gi|356643002|gb|AET33681.1| putative dioxygenase [Pyrobaculum sp. 1860]
          Length = 281

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 131/292 (44%), Gaps = 27/292 (9%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-------PEVRGVIAPHAGYSYSGR 55
           ++R+ + AG +Y    + L +++D  ++    PK+        E  G +APHAGY YSG 
Sbjct: 2   RVRKPAVAGYFYESEREELLQQIDWAVKHELGPKALQMPKLGAEALGGVAPHAGYMYSGP 61

Query: 56  AAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+ +  +        F ++GP+H+      A+  + V++TP+G L +D E+   + + 
Sbjct: 62  VAAWLYSYLAGFGKPDAFVVVGPNHYGIGAPVAIMKSGVWETPLGRLEIDEELAGLIASH 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            K    D      EHS+E+ +P++   F    +KIVPI +           G   A+ + 
Sbjct: 122 YKELEDDFYAFAKEHSLEVQMPFIQYYFGD--VKIVPITIWRQTLSTSRELGVAIARALR 179

Query: 175 DPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           +     +   SSDF H+          + H V  +     D+M +  I   D  +  +  
Sbjct: 180 EYGRRVYVIASSDFNHY----------EPHDVTTRK----DEMAISKILKLDEASLFEVA 225

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            ++D +ICG  PI V + ++      +    L++  S      R+ +V YAS
Sbjct: 226 SKFDISICGLGPIGVLI-VVAKELGYLNATLLKHATSGDTSGYREETVGYAS 276


>gi|403253107|ref|ZP_10919412.1| hypothetical protein EMP_04985 [Thermotoga sp. EMP]
 gi|402811869|gb|EJX26353.1| hypothetical protein EMP_04985 [Thermotoga sp. EMP]
          Length = 277

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 129/296 (43%), Gaps = 39/296 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
           R+ + AG +Y      L E+     LD  +    LP   E +     G+++PHAGY YSG
Sbjct: 3   RKPAVAGLFYPSRKDELIEQIRMCFLDKRIGPGKLPGPVETKLQNPIGLVSPHAGYIYSG 62

Query: 55  RAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+ F   +     S V ++GP+H        +     ++TP+G +P++   +E + +
Sbjct: 63  PVAAWGFLEAVKFGEPSVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNERAVEIVLS 122

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV----GAVNAENEAMYGRLF 169
             ++   D      EHS+E+ +P+L  VFE   + IVPI +     AV  +  +   +L 
Sbjct: 123 NSRYAEKDFMSHIREHSIEVQIPFLQFVFEE--VPIVPICLMDQSPAVAEDLASALAKLV 180

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
           A++   P      S+D           HY+ +   + K     D   ++ IE  DP    
Sbjct: 181 AEF---PGVLIIASTDLN---------HYEDQRTTLRK-----DSYIIEAIEGMDPSLLY 223

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           +YL+  D ++CG   ++  L+M         ++ L++  S          V Y SA
Sbjct: 224 EYLVREDISMCGYGGVATLLNMDFE-----NVRILKHATSGDVSGDTLEVVGYLSA 274


>gi|451979656|ref|ZP_21928071.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
 gi|451763184|emb|CCQ89268.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
          Length = 267

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 121/282 (42%), Gaps = 23/282 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + +G +Y D  + L +E+   L  +G   +    GV+APHAG+ YSG  A   +  
Sbjct: 2   LRQPAVSGRFYPDRPQALIKEVKSHL--SGDRTAVRALGVVAPHAGFMYSGDVAGSVYER 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I+     R+ L+GP+H       A+ +   +  P+GD+ +D E+   ++        D  
Sbjct: 60  IEIP--ERILLIGPNHTGRGRPVAIQSHGAWVMPMGDVQIDTELAAAIRKHLPGVEEDDD 117

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
             E EHS+E  LP+L   +     K VP+ +  ++  +    G   A+ +++        
Sbjct: 118 AHENEHSLETQLPFLQ--YFKQPFKFVPLCLKRLSLNDCQKVGHALARSIEEVGEPVLVV 175

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H+ S              H+S    D+  +D +   D       +     T+CG
Sbjct: 176 ASSDMTHYES--------------HESASLKDRRAIDRMLDLDAAGLHTTVHNERITMCG 221

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            +P +V L  + +   K  +   RY  S +     DS V YA
Sbjct: 222 VNPATVMLTTVQDLGAKQAV-LSRYMTSGEVSGDMDSVVGYA 262


>gi|240103783|ref|YP_002960092.1| hypothetical protein TGAM_1726 [Thermococcus gammatolerans EJ3]
 gi|259646472|sp|C5A7L6.1|Y1726_THEGJ RecName: Full=MEMO1 family protein TGAM_1726
 gi|239911337|gb|ACS34228.1| Dioxygenase, putative [Thermococcus gammatolerans EJ3]
          Length = 291

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 72/292 (24%), Positives = 131/292 (44%), Gaps = 10/292 (3%)

Query: 4   IRRASHAGSWYT--DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           IR  + AGS+Y   D    + E+    L E G  +   +   +APHAGY +SG  A+  +
Sbjct: 2   IRYPAVAGSFYPADDELILMLEKFFSDLGEEGSER--RITAGVAPHAGYIFSGYTASRTY 59

Query: 62  GNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
             I    +   F +LGP+H       A+     + TP+G + +D E+ +E+        +
Sbjct: 60  KAIFEDGLPETFVILGPNHTGLGSPIAVYPEGEWLTPLGSIEVDAEMAKEIAKLSGIADL 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D    + EHS+E+ +P++  + E  G  +KIVPI +G  + +     G+   +  +    
Sbjct: 120 DELAHKYEHSIEVQVPFIQYLTELAGKEVKIVPITLGIQDEDVSRALGKAIFEASEKLGR 179

Query: 179 FFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
              V  S+DF H+G  + Y+ +  +   +   I+  D   +  I   D D   + L E  
Sbjct: 180 DVVVIASTDFMHYGPAYGYVPFRARADELPHRIKEWDFRLIRRILDFDVDGLFRELREMR 239

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           +T+CG   +   + +    +  ++ + L Y  S +     ++ V YAS   +
Sbjct: 240 HTMCGPGAVGTAI-VYSRLAGAVEAELLHYTTSYEISRSTEAVVGYASIVMR 290


>gi|225848005|ref|YP_002728168.1| hypothetical protein SULAZ_0171 [Sulfurihydrogenibium azorense
           Az-Fu1]
 gi|225643721|gb|ACN98771.1| conserved hypothetical protein [Sulfurihydrogenibium azorense
           Az-Fu1]
          Length = 265

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 102/195 (52%), Gaps = 14/195 (7%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAY 59
           M  +R+ + A  +YT + +RL   +  ++ +A L P  PE  G+I+PHAGY YSG  A  
Sbjct: 1   MLTVRKPAVANLFYTGDKERLYYTVKSYIEKAPLYPYKPE--GLISPHAGYMYSGIVAGV 58

Query: 60  AFG---NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATG 115
           ++    N+D        L+GPSH+ Y    +      ++TP+G++ +  E IE  +K+  
Sbjct: 59  SYKQLLNLDLDKHYTFLLIGPSHYVYLQGISFGYYDFWQTPLGNVKVAKEKIEAFIKSYP 118

Query: 116 KFEL-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            F + ++    + EHS+E+ +P+L  + +     I+P++ G +    +++Y +    +  
Sbjct: 119 NFPITLNTLPHQKEHSLEVQVPFLQVIMKN--FDIIPVVYGDI----DSIYVKKVIDFFK 172

Query: 175 DPSNFFSVSSDFCHW 189
           D +    +SSD  H+
Sbjct: 173 DENTVVIISSDLSHY 187


>gi|127511864|ref|YP_001093061.1| hypothetical protein Shew_0931 [Shewanella loihica PV-4]
 gi|126637159|gb|ABO22802.1| protein of unknown function DUF52 [Shewanella loihica PV-4]
          Length = 266

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 92/193 (47%), Gaps = 18/193 (9%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV----RGVIAPHAGYSYSGRAAA 58
           K R+A+ AG +Y      L ++L  +  EA     P V    + +I PHAGY YSG  AA
Sbjct: 2   KYRQAAVAGRFYPAEPGLLTQQLIHYFGEA-----PSVHIIPKALILPHAGYLYSGEVAA 56

Query: 59  YAFGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
            A   +   P    RV LLGPSH+     CAL  +  + TP+G++P+D  V  EL    +
Sbjct: 57  KAVNLLRNHPDGYRRVVLLGPSHYVGLNGCALPRSDSFITPLGEIPID-RVGIELLLNRR 115

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
             +      + EH++E+ LP L    +  +  ++P++VG  + E          + V D 
Sbjct: 116 LAIASDMAHQREHALEVELPLLQFCLDDFI--LLPVVVGGASPEAVCQ----LIQAVADS 169

Query: 177 SNFFSVSSDFCHW 189
                VSSD  H+
Sbjct: 170 DTLIVVSSDLSHY 182


>gi|23831516|sp|O59292.2|Y1626_PYRHO RecName: Full=MEMO1 family protein PH1626
          Length = 291

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 131/291 (45%), Gaps = 10/291 (3%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFGN 63
           R  + AG +Y +    L E L  + ++ G   +   +   +APHAGY +SG  A+  +  
Sbjct: 3   RYPAVAGQFYPEGDA-LIEMLSSFFKDLGEEGTKRTITAGVAPHAGYVFSGFTASRTYKA 61

Query: 64  IDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           I    +  VF++ GP+H       AL     + TP+G + +D +  +E+        +D 
Sbjct: 62  IYEDGLPEVFVIFGPNHTGLGSPIALYPEGEWITPMGSIKVDSKFAKEIVKRSGIADLDD 121

Query: 123 CVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGR-LF-AKYVDDPSN 178
              + EHS+E+ LP++  + E  G  +KIVPI +G  + E     GR +F A        
Sbjct: 122 LAHKYEHSIEVQLPFIQYIAEKAGVEVKIVPITLGIQDEEVSRSLGRSIFEASTSLGRDT 181

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               S+DF H+GS + Y+ +  +   +   +   D   +  I   D D     + E ++T
Sbjct: 182 IIIASTDFMHYGSFYGYVPFRGRPEELPNMVRDWDMRIIRRILDFDLDGMFSEIREMNHT 241

Query: 239 ICGRHPISVFLHMLGNCSTKIK-IKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           +CG  P  V   ++ +   K K  + L Y  S +     D+ V YAS   K
Sbjct: 242 MCG--PGGVGAGIIYSRLMKAKEAELLHYTTSFEVSRSPDAIVGYASIIMK 290


>gi|387127441|ref|YP_006296046.1| dioxygenase [Methylophaga sp. JAM1]
 gi|386274503|gb|AFI84401.1| dioxygenase [Methylophaga sp. JAM1]
          Length = 262

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 122/286 (42%), Gaps = 30/286 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IR ++ AG +Y   S  L + L     +         + +I PH GY YSG+ AA A
Sbjct: 1   MISIRPSAVAGLYYPARSTELEDLLAKQFDKPSDHIHHHAKALIVPHGGYFYSGQVAAKA 60

Query: 61  FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           + ++   +  I RV L+GPSH       A+S A  + TPIG + +D ++  +L      E
Sbjct: 61  YQSLMEVADDIERVILIGPSHRTDFKGIAMSEADYFATPIGSVAVDKKIYPQLSRIKGVE 120

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             +   D  EH +E+ LP+L   +  +  +IVP+L G  N+   ++   +      D  +
Sbjct: 121 TYESPHDN-EHCLEVQLPFLQ--YSLNQFEIVPLLTGKANS---SLIADVLGTATQDVKS 174

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              +SSD  H      Y+ Y+    +   + +A+  M     +  D D F          
Sbjct: 175 LIVISSDLSH------YLDYETARKIDQYTSQAIISM-----DNHDIDKFHA-------- 215

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            CG   I  FL    N +   +I  L    S     K++S V Y +
Sbjct: 216 -CGCDAIRGFLEYARNANMSGQIMAL--SNSGDIVRKKESVVGYGA 258


>gi|150401805|ref|YP_001325571.1| hypothetical protein Maeo_1383 [Methanococcus aeolicus Nankai-3]
 gi|150014508|gb|ABR56959.1| protein of unknown function DUF52 [Methanococcus aeolicus Nankai-3]
          Length = 284

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 131/298 (43%), Gaps = 37/298 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD-GWLREAG---LPKSPEVR---GVIAPHAGYSYSGRA 56
           +R    AG++Y  +   L E ++  +  + G   +P   E +   GVIAPHAGY YSG  
Sbjct: 3   VRNPVVAGAFYPSDPNELIEIIEYCYFHKLGPNTIPIGGEYKKPVGVIAPHAGYIYSGAP 62

Query: 57  AAYAFGNI------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
           AAY++  I      D T+I    ++GP+H       ++    ++KTP+GD+  D E I++
Sbjct: 63  AAYSYSAISERVSGDITAI----IIGPNHTGLGEGVSVMDG-IWKTPLGDVSTDTEFIDK 117

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLI--KIVPILVGAVNAENEAMYGRL 168
           L        +D      EHS+E+ LP+L  +     +  KIVPI +   + E     G  
Sbjct: 118 LWKECDVVELDELAHSREHSIEIQLPFLQHIALRQSVKFKIVPICMAMQDYETSMDVGYF 177

Query: 169 FAKYVDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
            AK   + +       S+DF H+  + N     KK  +I K+I A+D+  +        D
Sbjct: 178 IAKIAKELNRNVIIIASTDFSHYEPQEN---ASKKDALIIKNILAMDEKSL------YSD 228

Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           A     + Y+ ++CG  P    +  +         K L Y  S        S V Y S
Sbjct: 229 A-----ITYNISMCGYGPTMAMIRAMKELGATTS-KLLSYCTSGDITGDYSSVVGYGS 280


>gi|317153856|ref|YP_004121904.1| hypothetical protein Daes_2152 [Desulfovibrio aespoeensis Aspo-2]
 gi|316944107|gb|ADU63158.1| protein of unknown function DUF52 [Desulfovibrio aespoeensis
           Aspo-2]
          Length = 268

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 74/284 (26%), Positives = 123/284 (43%), Gaps = 26/284 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+   AG +Y  +  RL  ++DG+L + G P+  +    + PHAGY YSG          
Sbjct: 3   RQPVVAGRFYDVDPARLFGQVDGFLSQ-GAPRGGQTLLAMVPHAGYVYSGAVCGQTLAQ- 60

Query: 65  DPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
              S++R V LLGP+H       AL     +  P G +P+D+E+   L         D  
Sbjct: 61  --ASLARTVLLLGPNHTGLGQPFALWPDGAWHIPGGSVPVDVELATALLGADPRIAPDTA 118

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS-- 181
               EHS+E+ LP+L ++    L+ IVP+ + +   E+ A  GR   + +    +  S  
Sbjct: 119 AHLREHSLEVVLPFLYRLDP--LVSIVPLAIASHVFEDVAGVGRAIGRAIKAFGHPVSIV 176

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           VSSD  H+              + H     LD + ++   + DP      +  +  ++CG
Sbjct: 177 VSSDMSHY--------------ISHDQARRLDALALEAAVSLDPRGLFDAVRAHSISMCG 222

Query: 242 RHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             P++  L+  L   +T+ ++    Y  S +     D  V YA 
Sbjct: 223 VLPMTAGLYAALEMGATRGELA--AYATSGEVSGDFDQVVGYAG 264


>gi|148269972|ref|YP_001244432.1| hypothetical protein Tpet_0837 [Thermotoga petrophila RKU-1]
 gi|166235097|sp|A5IKY3.1|Y837_THEP1 RecName: Full=MEMO1 family protein Tpet_0837
 gi|147735516|gb|ABQ46856.1| protein of unknown function DUF52 [Thermotoga petrophila RKU-1]
          Length = 277

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 127/296 (42%), Gaps = 39/296 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
           R+ + AG +Y      L E+     LD  +    LP   E +     G ++PHAGY YSG
Sbjct: 3   RKPAVAGLFYPSRRDELVEQIRICFLDKRIGPGELPDPSETKLQSPIGFVSPHAGYIYSG 62

Query: 55  RAAAYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
             AA+ F  +    +P   S V ++GP+H        +     ++TP+G +P++    E 
Sbjct: 63  PVAAWGFLEVAKFGEP---SVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNQRAAEI 119

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI-LVGAVNAENEAMYGRLF 169
           +  + ++   D      EHS+E+ +P+L  VF    + IVPI L+    A  E +   L 
Sbjct: 120 ILNSSRYAEEDFMSHIREHSIEVQIPFLQFVFGD--VSIVPICLMDQSPAVAEDLANALT 177

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
               + PS     S+D           HY+ +   + K     D   M+ I   DP    
Sbjct: 178 KLVAEFPSVLIIASTDLN---------HYEDQRTTLRK-----DSYIMEAIRNKDPRLLY 223

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           +YL++ D ++CG   ++  L+M  N       + L++  S      +   V Y SA
Sbjct: 224 EYLVKEDISMCGYGGVATLLNM--NFKNA---RILKHATSGDVSGDKLEVVGYLSA 274


>gi|70605939|ref|YP_254809.1| hypothetical protein Saci_0089 [Sulfolobus acidocaldarius DSM 639]
 gi|449066131|ref|YP_007433213.1| hypothetical protein SacN8_00420 [Sulfolobus acidocaldarius N8]
 gi|449068407|ref|YP_007435488.1| hypothetical protein SacRon12I_00420 [Sulfolobus acidocaldarius
           Ron12/I]
 gi|76363480|sp|Q4JCG3.1|Y089_SULAC RecName: Full=MEMO1 family protein Saci_0089
 gi|68566587|gb|AAY79516.1| conserved Archaeal protein [Sulfolobus acidocaldarius DSM 639]
 gi|449034639|gb|AGE70065.1| hypothetical protein SacN8_00420 [Sulfolobus acidocaldarius N8]
 gi|449036915|gb|AGE72340.1| hypothetical protein SacRon12I_00420 [Sulfolobus acidocaldarius
           Ron12/I]
          Length = 284

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 90/301 (29%), Positives = 138/301 (45%), Gaps = 41/301 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD-GWLREAG---LPKSPEV-------RGVIAPHAGYSY 52
           IR  + AGS+Y  +  RL ++++  +L + G   LP  P+        R  + PHAGY Y
Sbjct: 2   IRIPAVAGSFYEADPVRLRKQIEWSFLHDLGPKSLPSVPQNKPPQRSNRFFVVPHAGYMY 61

Query: 53  SGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV-IEE 110
           SG  AA+A+ ++    S   V +LGP+H       ++     +KTP+G++ +D E+ +E 
Sbjct: 62  SGPVAAHAYYHLSLEGSPDTVIILGPNHTGLGSYVSIWHKGKWKTPLGEVSVDDEISLEL 121

Query: 111 LKATGKFELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG 166
           +K T   E++DI  DE     EHS+E+ +P+L  +F G   KIVPI++     +      
Sbjct: 122 VKLT---EIIDI--DERAHLYEHSIEVQIPFLQYLF-GQNFKIVPIVIMMQTPDVAESLA 175

Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
               K V        V  SSD  H      Y  +D       K+IE  D + +D I+  D
Sbjct: 176 EGIYKLVSSGKKDIVVLASSDLNH------YEPHD-------KTIEK-DNLAIDEIQKLD 221

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
                + + E D T CG  PI   L +      K  +  LR+  S      + S V Y S
Sbjct: 222 YKGLFRVVEEKDVTACGYGPIMTVLILAKKLGKKPYV--LRHATSGDTSGDKSSVVGYLS 279

Query: 285 A 285
            
Sbjct: 280 V 280


>gi|408403728|ref|YP_006861711.1| hypothetical protein Ngar_c11120 [Candidatus Nitrososphaera
           gargensis Ga9.2]
 gi|408364324|gb|AFU58054.1| uncharacterized protein family UPF0103 [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 282

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 76/293 (25%), Positives = 127/293 (43%), Gaps = 31/293 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSP------EVRGVIAPHAGYSYSGRA 56
           R  + AG +Y DN + L   +D   R    G  ++P       + G+++PHAGY YSG  
Sbjct: 4   RAPAVAGMFYPDNPRELRGLVDESFRNQRFGPGRAPPSANKRRIYGIVSPHAGYVYSGAV 63

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           AA  F  +   +   V ++GP+H+      A      ++TP+G++ ++ E+ E + A   
Sbjct: 64  AANGFYEVSSINFQDVIMVGPNHYGIGSWVAAMKEGEWETPLGEVKVNSELAEGIAARSS 123

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYGRLFAK 171
               D      +H +E+ LP+L  + +    +IVPI++ +  +E       A+   +  K
Sbjct: 124 TLDFDNFAHSRDHCLEVQLPFLQYIKQD--FRIVPIVLVSQRSETAYDLGNAISETIVEK 181

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
            V D S     SSDF H+    N  H  +K   + K+I ALD            + F   
Sbjct: 182 RVAD-STLLIASSDFTHYEPN-NEAH--RKDAELIKAILALDV-----------NKFYAV 226

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           L   D + CG   I+  +    N     + + L+Y  S        + V Y+S
Sbjct: 227 LERLDVSACGYGAIATMMVAAKNLGAT-RGELLKYATSGDITGDTSAVVGYSS 278


>gi|379005390|ref|YP_005261062.1| putative dioxygenase [Pyrobaculum oguniense TE7]
 gi|375160843|gb|AFA40455.1| putative dioxygenase [Pyrobaculum oguniense TE7]
          Length = 281

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 72/295 (24%), Positives = 131/295 (44%), Gaps = 33/295 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-------RGVIAPHAGYSYSGR 55
           ++R+ + AG +Y    ++L +++D  ++    PK+ ++        G + PHAGY YSG 
Sbjct: 2   RVRKPAVAGYFYAAEKEKLIQQIDWSIKHELGPKALQMPKLGEKALGGVVPHAGYIYSGP 61

Query: 56  AAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
            AA+ +    G   P +I    ++GP+H+      A+  + V++TP+G + +D ++ E +
Sbjct: 62  VAAWLYSALAGYGKPDAI---IIIGPNHYGIGAPVAVMKSGVWETPLGRVEVDGDLAELI 118

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
               K    D      EHS+E+ +P++   F    ++IVPI+V           G+  A 
Sbjct: 119 MRHYKGVEDDFYAFSKEHSVEVQIPFIQYYFGD--VRIVPIVVWRQTLSTSRELGKAVAT 176

Query: 172 YVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
            + +   +  V  SSDF          HY+     + K     D M +  I   D     
Sbjct: 177 AIREYGRYVYVLASSDFN---------HYEPHEVTVQK-----DDMAISKILKVDEAGLF 222

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +   ++D +ICG  PI   + +         +  L++  S      +D +V YAS
Sbjct: 223 EVASKFDISICGLGPIGALIVIAKELGFG-NVTLLKHATSGDTSGYKDETVGYAS 276


>gi|386283998|ref|ZP_10061221.1| hypothetical protein SULAR_02073 [Sulfurovum sp. AR]
 gi|385344901|gb|EIF51614.1| hypothetical protein SULAR_02073 [Sulfurovum sp. AR]
          Length = 268

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/289 (26%), Positives = 119/289 (41%), Gaps = 35/289 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEEL-------DGWLREAGLPKSPEVRGVIAPHAGYSYSGRAA 57
           R AS AG +Y  +   +   L       D  L+E       + R VI PHAGY YSG  A
Sbjct: 4   REASVAGQFYPSSPDEIHSMLEHYNQIIDEHLKEKDSVLHLKPRAVIVPHAGYVYSGFTA 63

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
             A   +  + + RV ++GPSH  Y    ++S    Y TP+G L +D  ++ +LK   +F
Sbjct: 64  NIALRLLSNSDVKRVVIIGPSHRVYLTGTSISEFDTYHTPLGALLIDKPLVLDLKE--RF 121

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
           ++  +     EHS E+ +P++        + +V ++ G    E+      +    +DDP 
Sbjct: 122 DIGFVPEAHHEHSTEVQVPFVKTYTPD--VSVVELVYG---DESPQKLAEVIEYLLDDPE 176

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
               +S+D  H+        YD K         ALD + +D +E           L    
Sbjct: 177 TAVVISTDLSHY--------YDIKKA------NALDSICLDAVEK-----LSTAELHEGC 217

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
             CG+  I V   +L      ++   L Y  S+         V Y SAA
Sbjct: 218 EACGK--IGVEAMLLAAKKNGLRSVLLDYRTSADASGDESQVVGYMSAA 264


>gi|333987636|ref|YP_004520243.1| hypothetical protein MSWAN_1428 [Methanobacterium sp. SWAN-1]
 gi|333825780|gb|AEG18442.1| UPF0103 protein [Methanobacterium sp. SWAN-1]
          Length = 286

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/295 (25%), Positives = 125/295 (42%), Gaps = 29/295 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPE-------VRGVIAPHAGYSYSGR 55
           IR+ + AG +Y  +   L E+++   + + G  K P+       + GVIAPHAGY YSG 
Sbjct: 2   IRKPAVAGLFYEVDPDSLNEQIEWCFKSQIGPGKVPKKIGNKRNITGVIAPHAGYIYSGP 61

Query: 56  AAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+++  I        F +L P+H       +  T   ++TP+G++ +D E  + L   
Sbjct: 62  VAAHSYYKIAEDGFPETFVILSPNHTGLGSGVSAMTEGAWETPLGNVEIDEEFAQNLVRN 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL---IKIVPILVGAVNAENEAMYGRLFAK 171
                 D      EHS E+ +P+L  +   +L    KIVP+ +   + E     G    K
Sbjct: 122 TSIMDSDPSAHMQEHSAEVQIPFLQYLKNKYLEKDFKIVPVCMWMQDIETAMEVGISIQK 181

Query: 172 YVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
             +D      V  S+DF H+          K   + H      D   +D I + D +   
Sbjct: 182 TAEDLGRDVVVIASTDFTHY----------KPQDIAHDG----DMQVIDAITSMDENLMM 227

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             +   D T+CG  P++  + +        + +F +Y  S       +S V YAS
Sbjct: 228 ARVSALDVTMCGYGPVAATM-VASKLRGADRCEFEKYATSGDATEDYNSVVGYAS 281


>gi|284991034|ref|YP_003409588.1| hypothetical protein Gobs_2556 [Geodermatophilus obscurus DSM
           43160]
 gi|284064279|gb|ADB75217.1| protein of unknown function DUF52 [Geodermatophilus obscurus DSM
           43160]
          Length = 268

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 79/289 (27%), Positives = 124/289 (42%), Gaps = 41/289 (14%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           ++R  + AG +Y  + + L + +D  L E  G P+      V+APHAGY YSG  AA A+
Sbjct: 2   RVRPPAVAGRFYPADPEELHDLVDRLLDEVCGQPRPVPPAAVVAPHAGYRYSGAVAATAY 61

Query: 62  GNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
            ++   P +++RV LLGP+H +     A+       TP+G + +D     +   T    L
Sbjct: 62  AHLTAAPHAVTRVVLLGPAHFWPLDGMAVPAVDALATPLGSVDVD-----DDARTVAAAL 116

Query: 120 MDICVD----EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
             + VD    + EHS+E  LP+L +V  G  + ++P+LVG    E+ A         VD 
Sbjct: 117 PGVAVDDRPHDGEHSLETQLPFLQRVL-GPEVAVLPVLVGRTQPESVAA----VLTAVD- 170

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
                 VS+D  H        H D+       S    D+   D +   D  A +      
Sbjct: 171 -GALAVVSTDLSH--------HLDE------PSARERDRRTADAVLARDAAALRP----- 210

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            +  CG  P+   LH        + ++ LR   S+     R   V Y +
Sbjct: 211 -DDACGHQPLRGPLHH--AAERHLSVELLRMATSADSGAGRARVVGYGA 256


>gi|395645086|ref|ZP_10432946.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methanofollis liminatans DSM 4140]
 gi|395441826|gb|EJG06583.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methanofollis liminatans DSM 4140]
          Length = 261

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 69/253 (27%), Positives = 114/253 (45%), Gaps = 36/253 (14%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           + R  + AG +Y      L + L+  +     P +P+  G+++PHAGY YSG   A AF 
Sbjct: 2   ETRPCTFAGMFYPGEPGHLEQFLE-TVTPVERP-APDALGIVSPHAGYPYSGAVGARAFS 59

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            I P       ++GPSH  +      ++A  ++TP+G + +D++ +  L         DI
Sbjct: 60  AIRPDFDGTFVVIGPSHRGF---LTCTSAIPWETPLGIVDVDVDFVRAL---------DI 107

Query: 123 CVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVN-AENEAMYGRLF-AKYVDDP 176
            VDE     EHS+E+  P++   F     +I PI++G  + A  E + G++  A      
Sbjct: 108 RVDEVSMEDEHSLEVQTPFIKHFFP--RAQIAPIMMGDQSPASAELVAGKIVRAIRATGR 165

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                 SSDF H+              +        D M ++ IE  D D   + ++EYD
Sbjct: 166 EVRIVASSDFSHY--------------IPESQARKRDAMAIEAIERLDVDGLYRSIVEYD 211

Query: 237 NTICGRHPISVFL 249
            + CG  PI+  +
Sbjct: 212 ISACGYGPIAAMI 224


>gi|347523464|ref|YP_004781034.1| hypothetical protein Pyrfu_0913 [Pyrolobus fumarii 1A]
 gi|343460346|gb|AEM38782.1| protein of unknown function DUF52 [Pyrolobus fumarii 1A]
          Length = 285

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 72/293 (24%), Positives = 125/293 (42%), Gaps = 28/293 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSG 54
           +RR + AG +Y  + + L  +++         G L   G        G ++PHAGY YSG
Sbjct: 5   VRRPAVAGMFYEADPEALRSQIEWCFTHTLGPGKLPPRGGGSERLSVGFVSPHAGYMYSG 64

Query: 55  RAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+ +  +        V ++GP+H       ++    V++TP+G + +D E+ + +  
Sbjct: 65  PVAAHVYYQLALEKKPDTVVIVGPNHTGLGTLVSVMVEGVWETPLGRVEIDSELAKLIVK 124

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
                 +D      EHS+E+ +P+L  ++ G   +IVPI++           G   AK  
Sbjct: 125 YSDLADIDDKAHLFEHSVEVQVPFLQYIY-GDEFRIVPIVMWDQTPRAARDLGEAVAKAA 183

Query: 174 DDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
            +      +  SSDF H+          + H V  K     D++ +D I   DP+   + 
Sbjct: 184 AELGRDVIYIASSDFTHY----------EPHEVAAKK----DRLAIDKILALDPEGLHEV 229

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +  YD ++CG  P+   L+           + L+Y  S      R S V YA+
Sbjct: 230 IQRYDISMCGPGPVMSMLY-YARARGASNAQLLKYATSGDVTGDRSSVVGYAA 281


>gi|407771035|ref|ZP_11118398.1| dioxygenase [Thalassospira xiamenensis M-5 = DSM 17429]
 gi|407285924|gb|EKF11417.1| dioxygenase [Thalassospira xiamenensis M-5 = DSM 17429]
          Length = 495

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 21/201 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----PEVRGVIAPHAGYSYSGRA 56
           M  IR  + AG++Y      L  E+DG +  A   ++    P  + +I PHAG  +SG  
Sbjct: 1   MTIIRPPAIAGTFYPAAPDMLRSEIDGLIHAAVESQTENDRPIPKAIIVPHAGLMFSGAI 60

Query: 57  AAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           A   F  +     ++ R+ ++GP+H       A++ A  + TP+GD+  DL  ++   A 
Sbjct: 61  AGLGFATVRALKGTVKRIVIIGPAHRMAFQGIAIARADGFATPLGDVRCDLAGLQAALAW 120

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAEN-----EAMYGRL 168
            + +++D      EH +E+ LP++ ++F E   I IVP+LV   +A       EA++G  
Sbjct: 121 PQVQVLDEA-HRLEHGLEIELPFIQRLFGENADIGIVPLLVSRCSAWQVHEVIEALWG-- 177

Query: 169 FAKYVDDPSNFFSVSSDFCHW 189
                  P     +SSD  H+
Sbjct: 178 ------GPETLIVISSDLSHF 192


>gi|406892372|gb|EKD37739.1| hypothetical protein ACD_75C01015G0003 [uncultured bacterium]
          Length = 267

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 120/282 (42%), Gaps = 22/282 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + A  +Y  +++ LA  +   L      +  +   V++PHAGY YSG  AA  F +
Sbjct: 1   MRQPAVADRFYPGSAEALALAVKELLPAKSTSEKLKGLAVVSPHAGYVYSGALAAETFSS 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +       V +LGP+H       AL T T +  P+GD+P+D EV + + +      +D  
Sbjct: 61  V--VVPETVIILGPNHRGQGAPVALGT-TSWNMPLGDVPIDREVADLILSHSPQIKVDEM 117

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE--NEAMYGRLFAKYVDDPSNFFS 181
               EHS+E+ +P+L K+ E   + IVP+++  ++     E       A           
Sbjct: 118 AHRYEHSLEVQVPFLQKLQER--LCIVPLVISHISYPLCEEVANALALAIRQSGKEILIV 175

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H+ SR              +S E  D++ +  IE  DP      + ++  T+CG
Sbjct: 176 ASSDMSHYESR--------------RSAEKKDRLALQCIERLDPYDLYHTVRDHRITMCG 221

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
             P+ + L     C    K   + Y  S          V YA
Sbjct: 222 VIPVVIALLAAKACGAS-KYHLIGYTDSGFVSGDTAQVVGYA 262


>gi|145590329|ref|YP_001152331.1| hypothetical protein Pars_0062 [Pyrobaculum arsenaticum DSM 13514]
 gi|166227788|sp|A4WH12.1|Y062_PYRAR RecName: Full=MEMO1 family protein Pars_0062
 gi|145282097|gb|ABP49679.1| protein of unknown function DUF52 [Pyrobaculum arsenaticum DSM
           13514]
          Length = 281

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 72/295 (24%), Positives = 131/295 (44%), Gaps = 33/295 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-------RGVIAPHAGYSYSGR 55
           ++R+ + AG +Y    ++L +++D  ++    PK+ ++        G + PHAGY YSG 
Sbjct: 2   RVRKPAVAGYFYPAEKEKLIQQIDWSIKHELGPKALQMPKLGEKALGGVVPHAGYIYSGP 61

Query: 56  AAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
            AA+ +    G   P +I    ++GP+H+      A+  + V++TP+G + +D ++ E +
Sbjct: 62  VAAWLYSALAGYGKPDAI---IIIGPNHYGIGAPVAVMKSGVWETPLGRVEVDGDLAELI 118

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
               K    D      EHS+E+ +P++   F    ++IVPI+V           G+  A 
Sbjct: 119 MRHYKGVEDDFYAFSKEHSVEVQIPFIQYYFGD--VRIVPIVVWRQTLSTSRELGKAVAT 176

Query: 172 YVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
            + +   +  V  SSDF          HY+     + K     D M +  I   D     
Sbjct: 177 AIREYGRYVYVLASSDFN---------HYEPHEVTVQK-----DDMAISKILKVDEAGLF 222

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +   ++D +ICG  PI   + +         +  L++  S      +D +V YAS
Sbjct: 223 EVASKFDISICGLGPIGALIVIAKELGFG-NVTLLKHATSGDTSGYKDETVGYAS 276


>gi|410867018|ref|YP_006981629.1| MEMO_like multi-domain protein [Propionibacterium acidipropionici
           ATCC 4875]
 gi|410823659|gb|AFV90274.1| MEMO_like multi-domain protein [Propionibacterium acidipropionici
           ATCC 4875]
          Length = 275

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 123/301 (40%), Gaps = 54/301 (17%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-----RGVIAPHAGYSYSGR 55
            + IR  + AG +Y  +   L E LD  L  A  P   +V     R +I PHAGY YSG 
Sbjct: 7   QDAIRPPAVAGFFYPADPAELTETLDRLL-AASEPDEEDVDPDRLRTLIVPHAGYIYSGP 65

Query: 56  AAAYAFGNIDPTSI----SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
            AA+ +  +   +      RV +LGP+H       AL  A  Y TP+G  P+D       
Sbjct: 66  TAAHGYRLLARLAAHHPPRRVAVLGPTHRVAIRGLALPAAGGYATPLGVCPVD------- 118

Query: 112 KATGKFELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
            A+G  +L  + V  A    EHS+E+ +P+L ++     I I P+ VG    ++ A    
Sbjct: 119 -ASGLDDLPQVAVHAATHAEEHSLEVQVPFLQRILGD--IPITPLAVGLAEPDSVAQVIE 175

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
             A    DP  F  +SSD  H+ +       D     I + +E  D +  D         
Sbjct: 176 TLAA---DPDTFIVISSDLSHYHAHAEARRLDDH--TIARILERQDTIPAD--------- 221

Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS--VSYASA 285
                       CG HP++  L      S ++        +++   T  D S  V YAS 
Sbjct: 222 ----------RACGAHPLNGML----QASRRLGWAPHILARATSADTAGDPSRVVGYASL 267

Query: 286 A 286
           A
Sbjct: 268 A 268


>gi|257054055|ref|YP_003131888.1| hypothetical protein Huta_2994 [Halorhabdus utahensis DSM 12940]
 gi|256692818|gb|ACV13155.1| protein of unknown function DUF52 [Halorhabdus utahensis DSM 12940]
          Length = 283

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 70/301 (23%), Positives = 128/301 (42%), Gaps = 38/301 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
           M   R  + AG +Y      L E+++        P S        P + G++APHAG  +
Sbjct: 1   MASTRHPAVAGRFYVRTEPSLREQIEAAFTHEIGPGSVPTATAGPPAITGLVAPHAGLPF 60

Query: 53  SGRAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
           SG  AA+++  +  +     V ++GP+H       A+     ++TP+G +P+D  + +  
Sbjct: 61  SGPLAAHSYAALAKSGTPETVVIIGPNHTGVGAAVAVPGDDEWRTPLGSVPIDNGLRDRT 120

Query: 112 KATGKFELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
            A       D  VD+    +EH+ E+ LP+L  +++     ++PI +   +A+     G 
Sbjct: 121 VAE-----TDASVDDRAHASEHAAEVQLPFLQHLYDD--FAVLPISLRRQDADVAQALGD 173

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
           +   +  + +   + SSDF H+                H +    D++ +D I+  DP  
Sbjct: 174 VLETHAGEGAVVIA-SSDFTHYEP--------------HDTAIGRDELALDRIDATDPAG 218

Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
             + +     ++CG   I+  L   G  S   ++  L +  S      RD  V YAS A 
Sbjct: 219 LIETVEREGLSVCGYGAIAAMLWASGADS---QVDVLAHATSGDTAGSRDEVVGYASVAV 275

Query: 288 K 288
           +
Sbjct: 276 R 276


>gi|327398420|ref|YP_004339289.1| hypothetical protein Hipma_0253 [Hippea maritima DSM 10411]
 gi|327181049|gb|AEA33230.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Hippea maritima DSM 10411]
          Length = 267

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 97/200 (48%), Gaps = 9/200 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR+ + A  +Y  +  +L   LD  + +  +   P  + +I PHAGY YSG  AA A+  
Sbjct: 3   IRKPAVADMFYPGSPSQLKAYLDSVMFQPDVKVKP--KALIVPHAGYVYSGEVAAKAYSL 60

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           ID  S     ++GP+H     + ++    +Y+ P GD+  DLE+IE + A  ++  +D  
Sbjct: 61  ID--SFDTYIVMGPNHTGLGAEISIFDG-IYQMPFGDVEPDLELIESI-AKNEYAQIDYY 116

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
               EHS+E+ LP++  + +    KIVPI+VG  N           A+ + D        
Sbjct: 117 AHLQEHSLEVQLPFIDYISKKPY-KIVPIVVGTHNVAKLYSMAETIAETIKDAGKDVLIV 175

Query: 182 VSSDFCHWGSRFNYMHYDKK 201
           VS+D  H+  +   +  DK 
Sbjct: 176 VSTDMNHYEDQTTTLIKDKN 195


>gi|124028270|ref|YP_001013590.1| dioxygenase [Hyperthermus butylicus DSM 5456]
 gi|123978964|gb|ABM81245.1| predicted dioxygenase [Hyperthermus butylicus DSM 5456]
          Length = 301

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 74/261 (28%), Positives = 120/261 (45%), Gaps = 24/261 (9%)

Query: 35  PKSPEVRGVIAPHAGYSYSGRAAAYAFGNI--DPTSISRVFLLGPSHHYYTPKCALSTAT 92
           P+   V GV+ PHAGYSYSG  AA+ +  +  +   +  V ++G +H  Y      +T T
Sbjct: 49  PRYTPVGGVV-PHAGYSYSGPCAAHLYLELAENAPEVDTVVVMGTNHTGY--GGVYTTTT 105

Query: 93  VYK---TPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKI 149
            YK   TP+G +  D+E IE LK        D      EHS+E+ LP+L  ++ G+  K+
Sbjct: 106 RYKAWATPLGTVETDIEFIELLKKLYPRLEDDYLAHMREHSIEVELPFLQYIY-GNNFKL 164

Query: 150 VPILVGAVNAENEAMYGRLFAKYVDDPSN------FFSVSSDFCHWGSRFNYMHYDKKHG 203
           VPI+V     E      R  A+ V   +           SSDF H G  ++Y+ + +   
Sbjct: 165 VPIVV----KEPSERMAREMAEAVKRAAEELGRRILVIASSDFTHHGYMYDYVLFTEN-- 218

Query: 204 VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKF 263
            + +++  LD   ++ I   D   F + +  Y  T+CG   I+  +        + K+  
Sbjct: 219 -VRENVAKLDMAIIEHILRLDTKGFLETIYRYGATVCGYGAIATLIEYAKLEGARAKL-- 275

Query: 264 LRYEQSSQCKTKRDSSVSYAS 284
           L+Y  S+       + V YA+
Sbjct: 276 LKYYNSADVTGDEAAVVGYAA 296


>gi|325095393|gb|EGC48703.1| DUF52 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 164

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 38/87 (43%), Positives = 52/87 (59%), Gaps = 8/87 (9%)

Query: 120 MDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENEAMYGRLFAK 171
           M   VD AEHS+E+HLPY+  +              ++VP++VG+ +   EA +G L A 
Sbjct: 1   MSPPVDSAEHSLELHLPYIHHLLRTLYPTRAPAAYPRLVPMMVGSTSPATEAAFGALLAP 60

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHY 198
           Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 61  YLADDTNAFVVSSDFCHWGLRFGYTYY 87


>gi|393795899|ref|ZP_10379263.1| hypothetical protein CNitlB_06112 [Candidatus Nitrosoarchaeum
           limnia BG20]
          Length = 277

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/289 (26%), Positives = 124/289 (42%), Gaps = 24/289 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSP------EVRGVIAPHAGYSYSGR 55
           +IR  + AG +Y+   K L + + D +L + G  K P      ++ GVI PHAGY+YSG 
Sbjct: 2   QIRTPAVAGMFYSGEKKELKKSINDCFLHKFGPGKIPPSDNKKKIFGVICPHAGYAYSGP 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            A ++F  I         ++GP+H       A    + ++TP+G++ +D E  EE+    
Sbjct: 62  IACHSFYEISSNLPELFIIVGPNHWGIGSSVATMKDSKWQTPLGEVEVDSETAEEISNIT 121

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           K   +D      EHS+E+ +P L ++      +I+PI++   + E     G   A     
Sbjct: 122 KIIEVDNFSHSREHSLEVQIPILQEI--SKKFRILPIVLINQSKEVAIQLGSAIANIARK 179

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
                  SSDF H+    N   +++   +    IE + K+        D D F   L + 
Sbjct: 180 KKVMIIGSSDFTHY--EPNEFAHEQDAAL----IEPILKL--------DVDRFYDVLNKK 225

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           D + CG   I+  +          K + L+Y  S        S V Y S
Sbjct: 226 DISACGYGAIASTMIACKELGAT-KGELLKYATSGDITGDTSSVVGYGS 273


>gi|170290628|ref|YP_001737444.1| dioxygenase [Candidatus Korarchaeum cryptofilum OPF8]
 gi|170174708|gb|ACB07761.1| Predicted dioxygenase [Candidatus Korarchaeum cryptofilum OPF8]
          Length = 285

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 134/298 (44%), Gaps = 29/298 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--------EVRGVIAPHAGYSYSGR 55
           +R+ + AGS+Y   S  L + LD   R AG    P         +  ++ PHAGY YSG+
Sbjct: 5   LRKPAVAGSFYPSRSDDLLKLLDSLFRGAGSGGIPSPNPAGERRIASMMVPHAGYIYSGK 64

Query: 56  AAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KA 113
            AA A+  +    I   F ++GP+H       + + +  ++TP+G + +DLE+ E + + 
Sbjct: 65  TAASAYSILAQDGIPETFVIIGPNHTGLGSAFSATKSKFWETPLGRVEVDLELAEAISRY 124

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
            G  +  D+     EHS+E+ +P+L  ++ G  +KI+PI++     E     G+  A   
Sbjct: 125 FGDLDFDDLA-HTWEHSIEVQVPFLQSIY-GASLKILPIVMAIQEPEPSIDLGKSIALAS 182

Query: 174 DDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
           +          SSD           HY  +   I +     D+  +D I   D  +  + 
Sbjct: 183 EKLGRDVLVIASSDMS---------HYLPEGEAIRR-----DRAAIDAILNMDVHSLFRI 228

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           + E D ++CG  P+SV +          K + +RY  S++    +   V YAS   ++
Sbjct: 229 IQELDVSMCGPGPVSVAIS-FAKLKGVSKGELVRYSTSAETSGDKSFVVGYASVVFRL 285


>gi|329765804|ref|ZP_08257370.1| hypothetical protein Nlim_1150 [Candidatus Nitrosoarchaeum limnia
           SFB1]
 gi|329137647|gb|EGG41917.1| hypothetical protein Nlim_1150 [Candidatus Nitrosoarchaeum limnia
           SFB1]
          Length = 277

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/289 (26%), Positives = 124/289 (42%), Gaps = 24/289 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSP------EVRGVIAPHAGYSYSGR 55
           +IR  + AG +Y+   K L + + D +L + G  K P      ++ GVI PHAGY+YSG 
Sbjct: 2   QIRTPAVAGMFYSGEKKELKKSINDCFLHKFGPGKIPPSDNKKKIFGVICPHAGYAYSGP 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            A ++F  I         ++GP+H       A    + ++TP+G++ +D E  EE+    
Sbjct: 62  IACHSFYEISSNLPELFIIVGPNHWGIGSSVATMKDSKWQTPLGEVEVDSETAEEISNIT 121

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           K   +D      EHS+E+ +P L ++      +I+PI++   + E     G   A     
Sbjct: 122 KIIEVDNFSHSREHSLEVQIPILQEI--SKKFRILPIVLINQSKEVAIQIGSAIANIARK 179

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
                  SSDF H+    N   +++   +    IE + K+        D D F   L + 
Sbjct: 180 KKVMIIGSSDFTHY--EPNEFAHEQDAAL----IEPILKL--------DVDRFYDVLNKK 225

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           D + CG   I+  +          K + L+Y  S        S V Y S
Sbjct: 226 DISACGYGAIASTMIACKELGAT-KGELLKYATSGDITGDTSSVVGYGS 273


>gi|126458656|ref|YP_001054934.1| hypothetical protein Pcal_0028 [Pyrobaculum calidifontis JCM 11548]
 gi|166227773|sp|A3MS51.1|Y028_PYRCJ RecName: Full=MEMO1 family protein Pcal_0028
 gi|126248377|gb|ABO07468.1| protein of unknown function DUF52 [Pyrobaculum calidifontis JCM
           11548]
          Length = 281

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 132/296 (44%), Gaps = 35/296 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-------EVRGVIAPHAGYSYSGR 55
           +IR+ + AG +Y   +  L   ++  ++    PK+P       E  G + PHAGY YSG 
Sbjct: 2   RIRKPAVAGYFYESRADELQARIEWSIKHEIGPKAPVLPKLGTEALGGVVPHAGYMYSGP 61

Query: 56  AAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+ +  +        F ++GP+H+      A+  +  ++TP G + +D E +  L A+
Sbjct: 62  VAAWLYSALAGYGAPTTFVIIGPNHYGIGAPVAIMKSGAWETPFGRVEID-EELASLIAS 120

Query: 115 GKFELMDIC-VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
              E+ D       EHS+E+ LP++   F+   +K VPI V           G+  AK +
Sbjct: 121 NYREIEDDAHAFSKEHSIEVQLPFIQYYFKN--VKFVPIAVWRQTLSTSRELGKAIAKAL 178

Query: 174 DDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
            +     +   SSDF          HY+     + K   A+ K+ +++ E G  D   + 
Sbjct: 179 REYKRRVYLLASSDFN---------HYEPHDVTVKKDDMAIGKI-LELDEAGLFDVASR- 227

Query: 232 LLEYDNTICGRHPIS---VFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
              +D +ICG  PI+   V    LG  +  +    L++  S      RD +V YAS
Sbjct: 228 ---FDISICGIGPIASVIVAAKELGFSNAAL----LKHATSGDTSGYRDETVGYAS 276


>gi|156743011|ref|YP_001433140.1| hypothetical protein Rcas_3068 [Roseiflexus castenholzii DSM 13941]
 gi|156234339|gb|ABU59122.1| protein of unknown function DUF52 [Roseiflexus castenholzii DSM
           13941]
          Length = 277

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/288 (26%), Positives = 122/288 (42%), Gaps = 23/288 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR +  AGSWY  N  RL   ++ +L +A  P  S  + G++APHAG  YSG  AA+AF 
Sbjct: 3   IRPSPIAGSWYPSNPARLRRAIEQYLAQATPPGLSGRIWGLLAPHAGLRYSGPVAAWAFA 62

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT----GKFE 118
            I  ++   + ++ P H         +    Y+TP+G + +D + I +L A       + 
Sbjct: 63  CIRGSTPDIIVIVSPWHRGGDTPLITTGHAAYETPLGVVEVDADAIVQLDAALCAHAGYG 122

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           L     D+ EH++E+ LP+L  V      +++P+++        A  G   A+ V     
Sbjct: 123 LARRRYDD-EHAIEIELPFLQCVLGS--FRLLPVMMADQRTTTAAALGAALAETVRGRDV 179

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSD  H+          +   V H+    LD   +  +   DPDA      E    
Sbjct: 180 LLIASSDLSHY----------EPASVAHR----LDAELLRRVAAFDPDAVIAAEYEGAGY 225

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            CG   I+  L    +     ++  +RY  S   +      V Y +AA
Sbjct: 226 ACGSGAIAAVLWACRDLGAD-QVTIVRYGTSGDVEPDAPWVVGYGAAA 272


>gi|170288657|ref|YP_001738895.1| hypothetical protein TRQ2_0860 [Thermotoga sp. RQ2]
 gi|229890255|sp|B1LA64.1|Y860_THESQ RecName: Full=MEMO1 family protein TRQ2_0860
 gi|170176160|gb|ACB09212.1| protein of unknown function DUF52 [Thermotoga sp. RQ2]
          Length = 277

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 73/293 (24%), Positives = 128/293 (43%), Gaps = 33/293 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
           R+ + AG +Y      L E+     LD  +    LP   E +     G+++PHAGY YSG
Sbjct: 3   RKPAVAGLFYPSRRDELIEQIRMCFLDKRIGPGELPVPVETKLQNPIGLVSPHAGYIYSG 62

Query: 55  RAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+ F   +     S V ++GP+H        +     ++TP+G +P++   +E + +
Sbjct: 63  PVAAWGFLEAVKFGEPSVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNQRAVEVILS 122

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI-LVGAVNAENEAMYGRLFAKY 172
             ++   D      EHS+E+ +P+L  VF    + +VPI L+    A  E +   L    
Sbjct: 123 NSRYAEEDFMSHIREHSIEVQIPFLQFVFGE--VSMVPICLMDQSPAVAEDLASALMKLA 180

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
            + P      S+D           HY+ +   + K     D   ++ I++ DP    +YL
Sbjct: 181 AEFPGVLIIASTDLN---------HYEDQRTTLRK-----DSYIIEAIKSNDPRLLYEYL 226

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           ++ D ++CG   ++  L+M  N       + L++  S      +   V Y SA
Sbjct: 227 VKEDISMCGYGGVATLLNM--NFKNA---RILKHATSGDVSGDKLEVVGYLSA 274


>gi|152993349|ref|YP_001359070.1| hypothetical protein SUN_1766 [Sulfurovum sp. NBC37-1]
 gi|151425210|dbj|BAF72713.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
          Length = 273

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 97/195 (49%), Gaps = 14/195 (7%)

Query: 2   EKIRRASHAGSWYTDNSK---RLAEELDGWLREAGLPKSP-EVR--GVIAPHAGYSYSGR 55
           + IR+A+ AGS+Y +  +   R   E +       + K   ++R   +I PHAGY YSG 
Sbjct: 3   QGIRKAAVAGSFYPERCREIRRYIREFNAAFDRLSIKKEILDIRPGAIIVPHAGYIYSGF 62

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            A +A+  +  T   R+ ++GPSHHYY    +      ++TP G++ +D   +  L    
Sbjct: 63  TANFAYRFLKHTKPKRIIVIGPSHHYYFKGISAGHFENFETPCGEIEIDNPYLFAL--AK 120

Query: 116 KFEL-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
           +F +  D    E EHS E+ +P++   F     K++ ++ G V A+  A+   +    + 
Sbjct: 121 EFNIGFDPKAHEKEHSTEVQMPFIQHYFPK--AKVIELVYGDVPAKELAL---IITALLK 175

Query: 175 DPSNFFSVSSDFCHW 189
           +P N   +SSD  H+
Sbjct: 176 NPDNAVVISSDLSHF 190


>gi|302878778|ref|YP_003847342.1| hypothetical protein Galf_1560 [Gallionella capsiferriformans ES-2]
 gi|302581567|gb|ADL55578.1| protein of unknown function DUF52 [Gallionella capsiferriformans
           ES-2]
          Length = 266

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 72/288 (25%), Positives = 120/288 (41%), Gaps = 32/288 (11%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IR  + AG +Y  ++  L +E+  +L  A  P     + +I PHAGY YSG  AA A
Sbjct: 1   MPVIRPPAVAGLFYPADAGELKQEVQEYL-AAAQPFDLSPKALIVPHAGYIYSGAIAATA 59

Query: 61  FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  +   +  I RV LLGP+H       AL     + TP+G + +D+   + ++   +  
Sbjct: 60  YATLRKLARKIRRVILLGPTHRVAIRGIALPGVDAFATPLGRIRIDVAAADAIRHLAQVT 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
                    EH++E+ LP+L  V       ++P+ VG  +A   A    +     D    
Sbjct: 120 TSS-HAHAQEHALEVQLPFLQSVLSD--FTVLPLAVGNASA---AEVAEVLECLWDGEET 173

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
              +SSD  H      Y+ Y     + +++++A       I+    P A +         
Sbjct: 174 LIVISSDLSH------YLPYATAQRIDYETVQA-------ILNLTQPIAHEHA------- 213

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            CG  P+S    ++      ++   L  + S      RD  V YA+ A
Sbjct: 214 -CGGTPVSGL--IIAAAHHHLRPHLLALKNSGDTAGDRDRVVGYAALA 258


>gi|15642862|ref|NP_227903.1| hypothetical protein TM0087 [Thermotoga maritima MSB8]
 gi|418046024|ref|ZP_12684118.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thermotoga maritima MSB8]
 gi|7388377|sp|Q9WXU2.1|Y087_THEMA RecName: Full=MEMO1 family protein TM_0087
 gi|4980576|gb|AAD35181.1|AE001695_7 conserved hypothetical protein [Thermotoga maritima MSB8]
 gi|351675577|gb|EHA58737.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thermotoga maritima MSB8]
          Length = 277

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 74/296 (25%), Positives = 128/296 (43%), Gaps = 39/296 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
           R+ + AG +Y      L E+     LD  +    LP   E +     G+++PHAGY YSG
Sbjct: 3   RKPAVAGLFYPSRRDELIEQIRMCFLDKRIGPGKLPGPVETKLQNPIGLVSPHAGYIYSG 62

Query: 55  RAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+ F   +     S V ++GP+H        +     ++TP+G +P++   +E + +
Sbjct: 63  PVAAWGFLEAVKFGEPSVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNERAVEIVLS 122

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV----GAVNAENEAMYGRLF 169
             ++   D      EHS+E+ +P+L  VF    + IVPI +     AV  +  +   +L 
Sbjct: 123 NSRYAEEDFMSHIREHSIEVQIPFLQFVF--GEVSIVPICLMDQSPAVAEDLASALAKLV 180

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
           A++   P      S+D           HY+ +   + K     D   ++ IE  DP    
Sbjct: 181 AEF---PGVLIIASTDLN---------HYEDQRTTLRK-----DSYIIEAIEGMDPSLLY 223

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           +YL+  D ++CG   ++  L+M         ++ L++  S          V Y SA
Sbjct: 224 EYLVREDISMCGYGGVATLLNMDFE-----NVRILKHATSGDVSGDTLEVVGYLSA 274


>gi|158522440|ref|YP_001530310.1| hypothetical protein Dole_2429 [Desulfococcus oleovorans Hxd3]
 gi|158511266|gb|ABW68233.1| protein of unknown function DUF52 [Desulfococcus oleovorans Hxd3]
          Length = 484

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 71/288 (24%), Positives = 121/288 (42%), Gaps = 31/288 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
           IR+   AG++Y  + + L+  +    R+A       P    +  +I PHAGY YSG  AA
Sbjct: 26  IRQPVAAGTFYPADPEALSRLITDLTRQAEKTAVKTPAEKPLCALILPHAGYVYSGLTAA 85

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +    +      RV ++ P H     +CA++ A   +TPIG +P+  +    LK +  F 
Sbjct: 86  HGALALKNRRFKRVVVMAPDHCAGIGQCAVTGADACRTPIGIIPVGTKTAARLKQSKLFV 145

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
                 +  EH++E+ LP+L    +     +VP++ G V  +   + G +    V D + 
Sbjct: 146 TTPPASENREHAVEVVLPFLQAWLKD--FSLVPVITGPVAPQR--LAGTI--DSVLDETT 199

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSD  H+ S+      D++               +++I  G   +    L+   + 
Sbjct: 200 LLVASSDLSHYLSQARAREKDRET--------------INMILQGRTGS----LVSEPDR 241

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            CG  PI   LH+    + +  +  L Y  S       D  V YA+ A
Sbjct: 242 ACGAIPIETVLHLAKRHNWQPVL--LHYSNSGDTAGPTDRVVGYAAVA 287


>gi|157363361|ref|YP_001470128.1| hypothetical protein Tlet_0497 [Thermotoga lettingae TMO]
 gi|157313965|gb|ABV33064.1| protein of unknown function DUF52 [Thermotoga lettingae TMO]
          Length = 279

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 81/293 (27%), Positives = 124/293 (42%), Gaps = 35/293 (11%)

Query: 5   RRASHAGSWYTDNSKRLA----EELDGWLREAGLPKSPEV-----RGVIAPHAGYSYSGR 55
           R A  AGS+Y  + +RL     E     L    +P  P+        VI PHAGY YSG 
Sbjct: 3   RTAVAAGSFYPASPERLKFSIEEAFTSSLGPGHIPDKPDQPLKKHSSVIVPHAGYIYSGP 62

Query: 56  AAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+A+  I        V L+GP+H  Y     +     + TP G+L +  +  E     
Sbjct: 63  VAAHAYAEISKLGKPELVVLVGPNHTGYGRPIGVYNRGTWVTPFGELHVSTDAAEIFLQY 122

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV---NAENEAMYGRLFAK 171
                 D     +EHS+E+ LP+L  +F+    +I+PI V  +   + E  A+     A+
Sbjct: 123 CDEANADFKSHISEHSIEVQLPFLQYLFDD--FQILPIAVFPIFIKSCEKIAIALDHIAE 180

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
             + PS  F  ++DF          HY+     + K     D+M +D I T DP      
Sbjct: 181 --NFPSTLFVFTTDF---------NHYESDEITVKK-----DQMVIDKILTKDPTGLYSV 224

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           + +   T+CG   ++ FL M      ++    L++  S     +R   V YAS
Sbjct: 225 VAKEKVTMCGLSVVTSFLFMKHFGLPRL----LKHATSGDVTGERSQVVGYAS 273


>gi|239618223|ref|YP_002941545.1| hypothetical protein Kole_1859 [Kosmotoga olearia TBF 19.5.1]
 gi|239507054|gb|ACR80541.1| protein of unknown function DUF52 [Kosmotoga olearia TBF 19.5.1]
          Length = 277

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 79/297 (26%), Positives = 129/297 (43%), Gaps = 41/297 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEEL----DGWLREAGLPK-----SPEVRGVIAPHAGYSYSGR 55
           R    AG +Y  N++ L +++    D  +    LP      + +  G+I PHAGY YSG 
Sbjct: 3   RNPVFAGRFYASNAEELKKQITACYDHPVGPGELPGPVFSFALKNAGLITPHAGYMYSGP 62

Query: 56  AAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+ +  +      R + ++GP+H  Y  + ++     + TP+G L +D  ++ EL   
Sbjct: 63  VAAHGYLELSKIGKPRKIIIIGPNHTGYGARLSIWPEGSWHTPLGTLRIDESLVGELVRN 122

Query: 115 GKFELM-DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPIL-----VGAVNAENEAMYGRL 168
            + EL  D      EHS+E+ LP++  +F+     IVPI+     + AV    EA+   L
Sbjct: 123 SQGELKPDTSAHLYEHSIEVQLPFIQHIFDNDP-TIVPIIMTDQSINAVRTLVEAISSIL 181

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
                 +   F   SSD           HYD     + K     D++ +  + T D D  
Sbjct: 182 ----KKEEGIFVIASSDMN---------HYDNHETTLRK-----DELLIKALLTKDIDRI 223

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            +   E   T CG  PI+V L +        ++K L++  S      +  +V Y SA
Sbjct: 224 YQVARENRITACGLGPIAVVLKLFD------ELKILKHATSGDVSGDKYHTVGYLSA 274


>gi|389860859|ref|YP_006363099.1| hypothetical protein TCELL_0536 [Thermogladius cellulolyticus 1633]
 gi|388525763|gb|AFK50961.1| hypothetical protein TCELL_0536 [Thermogladius cellulolyticus 1633]
          Length = 290

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 76/303 (25%), Positives = 134/303 (44%), Gaps = 38/303 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSP--------EVRGVIAPHAGYSYSGR 55
           R  + AG +Y  + K L   ++  +  + G  K P         V G + PHAGY YSG 
Sbjct: 7   RYPAVAGFFYPASRKELVVSIEQSFTHKLGPGKKPVVAASRNKSVVGFVVPHAGYMYSGP 66

Query: 56  AAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+A+  +    +   V +LG +H  Y    ++     + TP+G++ +D E+ ++L   
Sbjct: 67  VAAHAYLELASAGVPETVVILGTNHTGYGALVSVYPGGTWVTPLGEVRVDSELAKKLVEF 126

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
             F  +D      EHS+E+ +P++  ++E   +KI+P+++G    +          K V 
Sbjct: 127 SGFAELDEEAHIEEHSVEVQVPFIQYMYEDR-VKILPVVIGLHTVDVARDLALSLKKAVS 185

Query: 175 DPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           +      V  SSDF          HY+ +H  + K +EA+     + I  GD + F + +
Sbjct: 186 ELGRDVVVLASSDFN---------HYEPQHVTVAKDMEAI-----NYILRGDSEGFYRAI 231

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKI------KIKFLRYEQSSQCKTKRDSSVSYASAA 286
            E + ++CG   I   +       TK+      +I  L++  S        + V YAS  
Sbjct: 232 TEKNISVCGPGGIMTLIEY-----TKLYEKYTPRITLLKHATSGDVTGDYSAVVGYASVK 286

Query: 287 AKV 289
            +V
Sbjct: 287 FEV 289


>gi|313673595|ref|YP_004051706.1| hypothetical protein Calni_1636 [Calditerrivibrio nitroreducens DSM
           19672]
 gi|312940351|gb|ADR19543.1| protein of unknown function DUF52 [Calditerrivibrio nitroreducens
           DSM 19672]
          Length = 265

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 70/240 (29%), Positives = 110/240 (45%), Gaps = 19/240 (7%)

Query: 44  IAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
           I PHAGY YSG  A      I+     RV L+GP+H  +  + ++     ++TP GD+ +
Sbjct: 40  IVPHAGYIYSGEVAFRVLSAINIKR--RVILMGPNHTGFGQRVSIYPGGSWETPFGDIDI 97

Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA 163
           D E++E     G     D      EHS+E+ LP L  + E    KIVPI +  +  + E 
Sbjct: 98  DEELVERFVNVGL--KTDTLAHLKEHSLEVILPILKYLNEN--CKIVPITISYLRYD-EC 152

Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
           M   + A+ + D  N+F     F    S FN  H++ +   + K     DK  +D I   
Sbjct: 153 M---MVAEKILDVVNYFKDEVTFV-ISSDFN--HFEDEKTTLLK-----DKYAIDAILRL 201

Query: 224 DPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           DP+     + + D ++CG  P ++ L +  N   + K + L +  S +     D  V YA
Sbjct: 202 DPEGLYNVVRDKDISMCGVIPSTIGLIVAKNLGAQ-KGELLIHTTSGRVSGDYDRVVGYA 260


>gi|319790446|ref|YP_004152079.1| hypothetical protein Theam_1477 [Thermovibrio ammonificans HB-1]
 gi|317114948|gb|ADU97438.1| protein of unknown function DUF52 [Thermovibrio ammonificans HB-1]
          Length = 268

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 72/286 (25%), Positives = 129/286 (45%), Gaps = 30/286 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG +Y   ++ L   L  + R   +PK  + + VI PHAGY YSG  A   +  
Sbjct: 2   VRYPAVAGQFYPGTAQELKLYLSSFCRR-DVPKI-KAKAVIVPHAGYIYSGAVAGATYSR 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +    ++   ++GP+H       ++    V+ TP+G++P++ E+  EL +   +E  D  
Sbjct: 60  VVIPEVN--VIMGPNHTGLGRPASVYPEGVWVTPLGEVPVNGEIASELTSRYPYE-ADPS 116

Query: 124 VDEAEHSMEMHLPYL--AKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN-FF 180
               EHS+E+ +P+L     F   L  IVP++   +   +    G   A+ + D  +   
Sbjct: 117 AHIYEHSLEVQVPFLQFCSGFNEEL-SIVPVVFQHLPYRDCVAAGEALAQVLADREDALM 175

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +S+DF H+ S+              +  + LD + +D I   +P+   + +  Y+ T+C
Sbjct: 176 VISTDFSHYVSQ--------------EEAKRLDSLAIDAILDLNPEELYRRVHAYNITMC 221

Query: 241 GRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           G  P +V L    +LG    ++    + Y  S       D  V YA
Sbjct: 222 GVIPATVGLVAAKLLGAQGAEL----VDYRTSGDVTGDYDQVVGYA 263


>gi|146304886|ref|YP_001192202.1| hypothetical protein Msed_2139 [Metallosphaera sedula DSM 5348]
 gi|189039506|sp|A4YIM6.1|Y2139_METS5 RecName: Full=MEMO1 family protein Msed_2139
 gi|145703136|gb|ABP96278.1| protein of unknown function DUF52 [Metallosphaera sedula DSM 5348]
          Length = 282

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 73/291 (25%), Positives = 128/291 (43%), Gaps = 31/291 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSGR 55
           RR + AGS+Y D+S +L + ++         G +   G   S      I PHAGY YSG 
Sbjct: 3   RRPAVAGSFYEDDSAQLRKRIEWAFHHPIGPGGIPSVGSTGSRSNPIFIVPHAGYIYSGP 62

Query: 56  AAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            AA+++  +        V +LGP+H  Y  + ++     ++TP+G   ++ ++++EL + 
Sbjct: 63  VAAHSYYYLAQEGKPDIVIILGPNHTGYGSQVSIWPGGDWETPLGSAQVNAQLVKELVSV 122

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
            +   +D      EHS+E+ LP+L   F+   + I+P+++     E          +++ 
Sbjct: 123 SEVVDIDEKAHLYEHSIEVQLPFLQYFFDN--LSILPVVILMQTPEIAEFVAEGIWRFIQ 180

Query: 175 ---DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
              D       SSD  H+      M  D+   ++ + I+ +D  G+            K 
Sbjct: 181 RHSDKDIVVLASSDLNHYDPHDVTMTKDE---LVIRKIQDMDYKGL-----------YKV 226

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
           + EYD T+CG  PI   L +      K  I  L++  S      + S V Y
Sbjct: 227 VEEYDVTVCGYAPIMASLILAKKMHKKPYI--LKHATSGDTSGDKSSVVGY 275


>gi|20094399|ref|NP_614246.1| dioxygenase [Methanopyrus kandleri AV19]
 gi|23822318|sp|Q8TWR9.1|Y963_METKA RecName: Full=MEMO1 family protein MK0963
 gi|19887476|gb|AAM02176.1| Predicted dioxygenase [Methanopyrus kandleri AV19]
          Length = 283

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 124/292 (42%), Gaps = 29/292 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVR-------GVIAPHAGYSYSGRA 56
           R  + AG +Y  + + L + ++   R E G    PE         GV+APHAGY +SG  
Sbjct: 3   RSPAVAGQFYPADPEELRKMIEWCFRHELGPGDLPETNDGPCTLPGVVAPHAGYQFSGPV 62

Query: 57  AAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           AA+ +  + +  +   V +LGP+H       A  T   ++TP+G + +D E    L    
Sbjct: 63  AAHTYKVLAESGTPETVVILGPNHTGLGSAVATMTDGAWRTPLGSVEIDSEFATALVRKC 122

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD- 174
                D+     EHS+E+ LP+L  V+ G   + VP+ +   + +     G       + 
Sbjct: 123 GVMDDDLTAHANEHSIEVQLPFLQYVY-GESFRFVPVCMAMHDLQTAREVGEAIVDVAEE 181

Query: 175 -DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM-DIIETGDPDAFKKYL 232
            D +     S+DF H+         D+K   + + I ALD+ GM +I+E           
Sbjct: 182 LDRNTVVIASTDFTHYEPHDQAQKKDRK---VIERITALDEAGMIEIVE----------- 227

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             Y+ ++CG  P +  +  +       + + L+Y  S          V YA+
Sbjct: 228 -RYNVSMCGVGPTAATIVAVKAMGAS-EGELLKYATSGDVSGDYSQVVGYAA 277


>gi|344345281|ref|ZP_08776135.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Marichromatium purpuratum 984]
 gi|343803110|gb|EGV21022.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Marichromatium purpuratum 984]
          Length = 260

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 67/220 (30%), Positives = 103/220 (46%), Gaps = 15/220 (6%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M + R  + AG +Y  + + L  E+DG+L  A     P  + ++ PHAGY YSG  A +A
Sbjct: 1   MPQTRPPAVAGRFYPASPEALGAEIDGYLAVAA--TGPAPKAIVVPHAGYRYSGAIAGHA 58

Query: 61  FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  +      I +V LLGP+H       A S AT + TP+G++ LD   I +  A     
Sbjct: 59  YAALGEARARIRQVVLLGPAHRVAVRGVATSAATAFATPLGEVALDQVAIAQALAFDWVH 118

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV-NAENEAMYGRLFAKYVDDPS 177
            ++    E EHS+E+ LP+L +  +     +VP++VG    AE  A+   L+        
Sbjct: 119 TLEAAHAE-EHSLEVQLPFLQRALDD--FSLVPLVVGETPTAELVALLEALW----GGAE 171

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
               VSSD  H+  R      D+       +IE LD   +
Sbjct: 172 TLIVVSSDLSHYQLRAEAQALDRA---TSTAIERLDAAAI 208


>gi|149050676|gb|EDM02849.1| similar to RIKEN cDNA 0610016J10 gene, isoform CRA_c [Rattus
           norvegicus]
          Length = 79

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 41/78 (52%), Positives = 49/78 (62%), Gaps = 3/78 (3%)

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQ 271
           MGM IIE  DP +F  YL +Y NTICGRHPI V L+    L      +   FL Y QSSQ
Sbjct: 1   MGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQKSGMNMSFSFLNYAQSSQ 60

Query: 272 CKTKRDSSVSYASAAAKV 289
           C++ +DSSVSYA+ A  V
Sbjct: 61  CRSWQDSSVSYAAGALTV 78


>gi|257076981|ref|ZP_05571342.1| hypothetical protein Faci_07956 [Ferroplasma acidarmanus fer1]
          Length = 271

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 72/289 (24%), Positives = 125/289 (43%), Gaps = 22/289 (7%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLP--KSPEVRGVIAPHAGYSYSGRAAA 58
           M+  R    +G +Y D+ + L + + G + +  +P  +  ++ G I PH  Y YS   AA
Sbjct: 1   MDNTREPHVSGGFYPDDYEALMDSIKGSM-DITMPDIRYKKLIGAIVPHGQYMYSSHTAA 59

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           YA+  I  +      ++GP+H  +    A+     + TP+G   ++ E+ E L       
Sbjct: 60  YAYKIIQESKKRTFLIIGPNHSKFPAYPAIYPDGYWSTPLGYAKVNSELSERLLLKSDII 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             D      EHS+E+ LP+L  +F+ +     P+++G   A              + P  
Sbjct: 120 HDDKEAHITEHSIEVQLPFLQYLFKSNF-TFTPLILGNQGAAIARNIAETIESLEEKP-- 176

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
           F  +SS+F H      Y+ YDK +         LD++ ++ I+      F + + +Y  T
Sbjct: 177 FVLISSNFNH------YLPYDKNN--------ELDQILINDIQEMRTLKFYQDIEKYGIT 222

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
            CG   I++   ML   + K KI  L +  S      R   V Y++  A
Sbjct: 223 ACGYGAIAIL--MLITKNMKGKIALLNHSNSGDYSKDRKRVVGYSAMIA 269


>gi|406894537|gb|EKD39328.1| hypothetical protein ACD_75C00439G0006 [uncultured bacterium]
          Length = 267

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 74/284 (26%), Positives = 124/284 (43%), Gaps = 26/284 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG--VIAPHAGYSYSGRAAAYAF 61
           +R+ + A  +Y  ++K L   +   L    +P S +++G  V++PHAGY YSG  AA  F
Sbjct: 1   MRQPAVADRFYPGSAKALGLAVTELL--PAVPDSGKIKGLAVVSPHAGYVYSGALAAETF 58

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +       V +LGP+HH      ALST T +  P+G +P+D E+ + + A  +   +D
Sbjct: 59  SAV--VIPETVIILGPNHHGQGAPVALST-TSWDMPMGSVPIDRELTDLILAHSQHIKVD 115

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
               + EHS+E+ +P+L ++ E HL  IVP+++  +            AK +        
Sbjct: 116 ELAHKYEHSLEVQVPFLQRLQE-HL-SIVPLVISHITYPLCEDVAATLAKAIRISGRDIL 173

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSD  H+ SR                 E  D++ +  IE  +P      +     ++
Sbjct: 174 IVASSDMNHYESR--------------PKTEKKDRLALQCIEQLNPYDLYHTVHSQRISM 219

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           CG  P+ + +     C    K   + +  S       D  V YA
Sbjct: 220 CGVIPVVIAMLAALACGAS-KYHLVGHTDSGYVSGDTDQVVGYA 262


>gi|325982175|ref|YP_004294577.1| hypothetical protein NAL212_1534 [Nitrosomonas sp. AL212]
 gi|325531694|gb|ADZ26415.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Nitrosomonas sp. AL212]
          Length = 271

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 94/207 (45%), Gaps = 17/207 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  IR  +  G +Y  +  +L +++  +L E       + + +I PHAGY YSG  AA A
Sbjct: 9   MATIRSPAVTGLFYPADEHQLRQDVQ-FLLEKANQHDLKPKALIVPHAGYQYSGAIAATA 67

Query: 61  FGN--IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           + +  +   +I RV LLGP+H       AL    V+ TP+G + +D E++  +       
Sbjct: 68  YASLRVAAANIQRVVLLGPAHRLAMQGLALPGVDVFTTPLGGVNVDTELVNAIA-----N 122

Query: 119 LMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
           L  + ++ A    EHS+E+ LP+L  V        +P+ +G  +AE  A    +      
Sbjct: 123 LPQVSINRAAHALEHSLEVQLPFLQSVLNE--FTFLPLAIGRASAEEVA---EVLDYLWG 177

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKK 201
                  +SSD  H+       H D +
Sbjct: 178 GEETLIVISSDLSHFLPYATAQHIDNQ 204


>gi|296243113|ref|YP_003650600.1| hypothetical protein Tagg_1388 [Thermosphaera aggregans DSM 11486]
 gi|296095697|gb|ADG91648.1| protein of unknown function DUF52 [Thermosphaera aggregans DSM
           11486]
          Length = 287

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 76/294 (25%), Positives = 127/294 (43%), Gaps = 29/294 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGW----------LREAGLPKSPEVRGVIAPHAGYSYSG 54
           RR      ++  N K   E L  W          L E    +  E  G +APHAGY YSG
Sbjct: 3   RRHPAVAGYFYPNDKHELEALIKWSFTHKIGPGKLPETSPSRRRETIGYVAPHAGYVYSG 62

Query: 55  RAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+ +  +    +   V ++G +H  Y    ++    V++TP+G L +D E+ + L  
Sbjct: 63  PVAAHTYYQMALDGVPETVVVIGTNHTGYGALVSVYPGGVWETPLGLLEVDSELAKALAD 122

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
                 +D      EHS+E+ LP++  ++ G  ++I+P+++G    +      R  AK +
Sbjct: 123 KSSIAELDEYAHLEEHSVEVQLPFIQYIYGGK-VRILPVVMGLHTPD----VAREVAKSL 177

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
            D ++  S+  D     S  ++ HYD     + K  EA+      +I   D   F   LL
Sbjct: 178 LDAAS--SLKRDIIIVASS-DFNHYDPHDVTVRKDGEAV-----SMILKLDSVGFYNTLL 229

Query: 234 EYDNTICGRHPISVFLH---MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             D ++CG   I V +    + G    + ++  L+Y  S      +   V YAS
Sbjct: 230 REDVSVCGPGGIMVLIEYAKLAGGGGARAEL--LKYATSGDVSGDKSHVVGYAS 281


>gi|315425791|dbj|BAJ47445.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
 gi|343484630|dbj|BAJ50284.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 284

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 66/293 (22%), Positives = 127/293 (43%), Gaps = 26/293 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSP-------EVRGVIAPHAGYSY 52
           M+  RR + AG +Y  + + L  +++  +L   G  K+P           +I+PHAG  Y
Sbjct: 1   MKPRRRPAVAGYFYEGSREALLRQVEHCFLSPHGPGKTPAREWGKRRAPALISPHAGLMY 60

Query: 53  SGRAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
           SG  AA+ +  +   ++   V + GP+H+      ++     + TP+G++ +D ++  E+
Sbjct: 61  SGPVAAHGYYELTKYAVPESVVVFGPNHYGVGTVVSIYPGGSWVTPLGEVKIDEKLAAEI 120

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
               +F  +D      EHS+E+ LP+L  ++     + VPI +   + E     G    +
Sbjct: 121 AGQREFFYLDEVSHSREHSIEVQLPFLQYLYGD--FQFVPICINDQSLETCVEIGEAVGE 178

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
            VD  +     S+DF H+                H+++   D++ ++ IE  D +     
Sbjct: 179 VVDGRNILMIASTDFTHYEP--------------HETVLKKDRLALERIERLDVEGLYGV 224

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +  +D T+CG   ++  L         ++   L+   S        S V YA+
Sbjct: 225 IERHDITMCGYGAVAALLTAAKKLGA-VEATVLKQATSGDTGGDYGSVVGYAA 276


>gi|281412147|ref|YP_003346226.1| hypothetical protein Tnap_0717 [Thermotoga naphthophila RKU-10]
 gi|281373250|gb|ADA66812.1| protein of unknown function DUF52 [Thermotoga naphthophila RKU-10]
          Length = 277

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 76/299 (25%), Positives = 130/299 (43%), Gaps = 45/299 (15%)

Query: 5   RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
           R+ + AG +Y      L E+     LD  +    LP   E +     G+++PHAGY YSG
Sbjct: 3   RKPAVAGLFYPSRKDELIEQIRMCFLDKRIGPGKLPVPVETKLQNPIGLVSPHAGYIYSG 62

Query: 55  RAAAYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
             AA+ F       +P   S V ++GP+H        +     ++TP+G +P++   +E 
Sbjct: 63  PVAAWGFLEAAKFGEP---SVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNERAVEI 119

Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV----GAVNAENEAMYG 166
           +    ++   D      EHS+E+ +P+L  VF    + IVPI +     AV  +  +   
Sbjct: 120 ILNNSRYAEEDFMSHIREHSIEVQIPFLQFVFGE--VSIVPICLMDQSPAVAEDLASALA 177

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
           +L A++   P      S+D           HY+ +   + K     D   ++ IE  DP 
Sbjct: 178 KLVAEF---PGVLIIASTDLN---------HYEDQRTTLRK-----DSYIIEAIEGMDPS 220

Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
              +YL++ D ++CG   ++  L+M  N       + L++  S      +   V Y SA
Sbjct: 221 LLYEYLVKEDISMCGYGGVATLLNM--NFKNA---RILKHATSGDVSGDKLEVVGYLSA 274


>gi|212702288|ref|ZP_03310416.1| hypothetical protein DESPIG_00299 [Desulfovibrio piger ATCC 29098]
 gi|212674281|gb|EEB34764.1| Memo-like protein [Desulfovibrio piger ATCC 29098]
          Length = 289

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 70/297 (23%), Positives = 127/297 (42%), Gaps = 34/297 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--------EVRGVIAPHAGYSYSGR 55
           +R+ + AG +YTD    L  E++ +L++     +          + G++ PHAG+ YSGR
Sbjct: 3   LRQPAVAGRFYTDVPADLRAEVESFLKQGAALPASALAAEDAAHLAGLMLPHAGHVYSGR 62

Query: 56  AAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
                  ++    + R VFLL P+H       ++  A  ++TP+G +P+D E+   L   
Sbjct: 63  VIGATLAHV---RLPRTVFLLCPNHTGLGTPLSVWPAGAWQTPLGPVPVDGEMARLLCEA 119

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV- 173
            +    D+     EHS+E+ LP+L        + +VP+ VG   A      G+  A+ + 
Sbjct: 120 DEDFSADVMGHVREHSIEVLLPFLQVCAGDAPLHVVPVCVGTGQAPVLRTAGQRLARVLE 179

Query: 174 -----DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
                D       VSSD           HY+ +   + K     D + ++    GD DA 
Sbjct: 180 RCRDRDGQRPLLLVSSDM---------NHYEDQQTTLRK-----DDLALEAALKGDADAL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCS--TKIKIKFLRYEQSSQCKTKRDSSVSYA 283
              +     ++CG  P+++  + L   +    ++   + +E S+      +  V YA
Sbjct: 226 LATVARAGISMCGAAPLALAFYALHAWTPDATVRAALVMHETSAAVSHDTEHVVGYA 282


>gi|83648587|ref|YP_437022.1| dioxygenase [Hahella chejuensis KCTC 2396]
 gi|83636630|gb|ABC32597.1| predicted dioxygenase [Hahella chejuensis KCTC 2396]
          Length = 259

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 98/189 (51%), Gaps = 11/189 (5%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           +R+ + +G +Y  N++ L+E +  ++  +     SP  + +IAPHAGY YSG  A  A+ 
Sbjct: 3   VRKPAVSGLFYPANAEDLSETVSRYIATSPSFDHSP--KAIIAPHAGYVYSGAIAGVAYS 60

Query: 63  NIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
            +  ++  IS+V LLGPSH       A  ++  + TP+G + +D + + +L +  +   +
Sbjct: 61  ALHNSAKRISKVVLLGPSHRVGFRGIAAPSSDAFSTPLGAIAIDADNLVKLASLPQVVTL 120

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
           D      EHS+E+HLP+L +  +    ++ P+++G  +AE   +   +            
Sbjct: 121 D-SAHAQEHSLEVHLPFLQQCLD--CFELTPLVIGDADAE---LVAEVLELLWGGDETLI 174

Query: 181 SVSSDFCHW 189
            +S+D  H+
Sbjct: 175 VISTDLSHY 183


>gi|198282863|ref|YP_002219184.1| hypothetical protein Lferr_0726 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|198247384|gb|ACH82977.1| protein of unknown function DUF52 [Acidithiobacillus ferrooxidans
           ATCC 53993]
          Length = 282

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 73/289 (25%), Positives = 121/289 (41%), Gaps = 34/289 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL----REAGLPKSPEVRGVIAPHAGYSYSGRAAA- 58
           +R A+ AG +Y   +  L  E++  L    ++     +P  + +I PHAGY YSG  AA 
Sbjct: 11  VRPAAVAGMFYPGEAAVLRTEVERLLARAEQDGEAASAPWPKAIIVPHAGYIYSGAVAAS 70

Query: 59  -YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
            YA        I RV LLGP+H       AL      +TP+G + +D   +E L    + 
Sbjct: 71  GYALLAKGRGHIRRVVLLGPAHRLPFRGLALPGVQAMQTPLGTVAVDQAGVEALAGLPEV 130

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             M       EH++E+ LP++ +V     + +VP++VG    +  A   R+  K      
Sbjct: 131 REMP-AAHAQEHALEVQLPFIQEVLGD--VSVVPLVVGDARPDEVA---RVLEKLWGGEE 184

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
               +SSD  H      Y  Y +   + H ++E + +             F    ++++ 
Sbjct: 185 TVIVISSDLSH------YHPYAEARAIDHHTVEEILR-------------FDPTPIDHEQ 225

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
             CG  PI+  L +       +  + +    S      RD+ V YA+ A
Sbjct: 226 A-CGATPINGLLPV--ARKHHLHPRLVGLCNSGDTAGSRDAVVGYAAVA 271


>gi|57641412|ref|YP_183890.1| hypothetical protein TK1477 [Thermococcus kodakarensis KOD1]
 gi|119391268|sp|Q5JJC3.1|Y1477_PYRKO RecName: Full=MEMO1 family protein TK1477
 gi|57159736|dbj|BAD85666.1| hypothetical protein, conserved, DUF52 family [Thermococcus
           kodakarensis KOD1]
          Length = 291

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 129/288 (44%), Gaps = 10/288 (3%)

Query: 4   IRRASHAGSWYTDNSKR--LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +R  + AGS+Y  + +   + E     L E G      +   +APHAGY +SG  A+  +
Sbjct: 2   VRYPAVAGSFYPADEELVLMLERFFSDLGEEG--NDRRITAGVAPHAGYIFSGYTASRTY 59

Query: 62  GNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
             I    +   F +LGP+H       A+     + TP+G++ +D E+  E+        +
Sbjct: 60  KAIFEDGLPETFVILGPNHTGLGSPIAVHPPGTWITPLGEIEVDGELAREIAKISGIADL 119

Query: 121 DICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           D    + EHS+E+ +P++  + E  G  +KIVPI +G  + +     GR   +   +   
Sbjct: 120 DDLAHKYEHSIEVQVPFIQYLAEKAGKEVKIVPITLGIQDEDVAEDLGRAIFEASRELGK 179

Query: 179 FFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
              V  S+DF H+G  + Y+ +  +   +   I+  D   +  I   D     + + E +
Sbjct: 180 DVVVIASTDFMHYGQIYGYVPFRARADELPHRIKEWDFRIIRRILDFDVRGMFEEIREMN 239

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +T+CG   +   + +    +  ++ + L Y  S +     ++ V YAS
Sbjct: 240 HTMCGPGGVGTAI-VYSRLAGAVEAELLHYTTSFEVSRSTEAVVGYAS 286


>gi|332157855|ref|YP_004423134.1| hypothetical protein PNA2_0212 [Pyrococcus sp. NA2]
 gi|331033318|gb|AEC51130.1| hypothetical protein PNA2_0212 [Pyrococcus sp. NA2]
          Length = 292

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 117/250 (46%), Gaps = 6/250 (2%)

Query: 44  IAPHAGYSYSGRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLP 102
           IAPHAGY +SG  A+  +  I    I   F++ GP+H       A+  +  + TP+G + 
Sbjct: 42  IAPHAGYIFSGYTASRTYKAIYEDGIPETFVIFGPNHTGLGSPIAVYPSGEWVTPLGSVK 101

Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAE 160
           +D ++ +E+    +   +D    + EHS+E+ LP++  + E  G  +KIVPI +G  + +
Sbjct: 102 VDSKLAKEIVKVSRIADLDDLAHKYEHSIEVQLPFIQYIAEKAGVEVKIVPITLGIQDED 161

Query: 161 -NEAMYGRLF-AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
            +E++   +F A            S+DF H+GS + Y+ +  +   +   +   D   + 
Sbjct: 162 ISESLGKAVFEASRSLGRDVIILASTDFMHYGSFYGYVPFTGRPDELPNLVREGDMRVIR 221

Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
            I   D D   + +   ++T+CG   +   + +    S   + + L Y  S +     D+
Sbjct: 222 RILEFDLDGMFEEIRRMNHTMCGPGGVGAAI-VFSKLSGAERAELLHYTTSFEVSRSTDA 280

Query: 279 SVSYASAAAK 288
            V YAS   +
Sbjct: 281 IVGYASIVMR 290


>gi|222099579|ref|YP_002534147.1| hypothetical protein CTN_0605 [Thermotoga neapolitana DSM 4359]
 gi|254806533|sp|B9K748.1|Y605_THENN RecName: Full=MEMO1 family protein CTN_0605
 gi|221571969|gb|ACM22781.1| Hypothetical Protein CTN_0605 [Thermotoga neapolitana DSM 4359]
          Length = 277

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 80/296 (27%), Positives = 128/296 (43%), Gaps = 37/296 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYS 53
           IR    AG +Y      L E+     LD  +    LP   E       G+++PHAGY YS
Sbjct: 2   IREPVVAGLFYPSRKDELIEQIRICFLDRRIGPGELPGPVEKNLQNPVGLVSPHAGYIYS 61

Query: 54  GRAAAYAFGNIDPTS-ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+ F  +      S V ++GP+H        +     ++TP+G +P++ E +E L 
Sbjct: 62  GPVAAWGFLEVARIGKPSLVVIIGPNHTGLGKPVGIWPEGFWETPLGRVPVNEEAVEILL 121

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV---GAVNAENEAMYGRLF 169
            + ++   D      EHS+E+ LP+L  VF      IVP+ +       AE+ A   R  
Sbjct: 122 NSSRYAEEDTLSHLKEHSIEVQLPFLQFVFGD--FSIVPVCLMDQSPTVAEDLAFAVREL 179

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
            K      N   ++S         +  HY+ +   + K     D + ++ IE  D     
Sbjct: 180 MKSF---RNVLIIAST--------DLNHYEDQKTTLKK-----DYLVVEAIEKRDSRLLY 223

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           +YL++ D ++CG   ++V L+ LG  S +I    L++  S          V Y SA
Sbjct: 224 EYLVKEDISMCGYGGVAVLLN-LGFSSVRI----LKHATSGDVSGDTLEVVGYLSA 274


>gi|46200973|ref|ZP_00056132.2| COG1355: Predicted dioxygenase [Magnetospirillum magnetotacticum
           MS-1]
          Length = 456

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/292 (26%), Positives = 128/292 (43%), Gaps = 37/292 (12%)

Query: 1   MEKIRRASHAGSWY----TDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRA 56
           M  IR  + AG +Y     + +++L+  LD  +      + P  + +IAPHAG+ YSG  
Sbjct: 1   MTAIRPTAVAGQFYPADFAEANRQLSAFLDHAVAAPCAGRRP--KALIAPHAGWVYSGPV 58

Query: 57  AAYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA A+  + P   S SRV LLGPSH       AL ++  + +P+G + LD +    L   
Sbjct: 59  AAGAYALLRPFRGSWSRVVLLGPSHRVGFQGMALCSSDQWASPLGAVALDKD-WSRLAGV 117

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
               ++D    + EHS+E+H+P+L          ++P+++G  +A  + + G L A +  
Sbjct: 118 AGVGVLDQAHAQ-EHSLEVHVPFLQATIGD--FTLLPVVIG--DASPDMVAGLLDALWGG 172

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
           D      +S+D  H      Y+ YD+      +++ A++              F    + 
Sbjct: 173 D-ETLIVISTDLSH------YLPYDQCRDTDGQTVAAIEH-------------FDALAIS 212

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            D   CGR P+   L         ++I  L    S      +D  V Y S A
Sbjct: 213 RDGA-CGRIPVGGLLTAAKR--RGLEIVTLDVRNSGDTAGPKDRVVGYGSWA 261


>gi|94970386|ref|YP_592434.1| hypothetical protein Acid345_3359 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94552436|gb|ABF42360.1| protein of unknown function DUF52 [Candidatus Koribacter versatilis
           Ellin345]
          Length = 271

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 71/286 (24%), Positives = 119/286 (41%), Gaps = 24/286 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR  + AG +Y  N ++L  ++  +       K   + G + PHAGY YSG  A   +  
Sbjct: 5   IREPAVAGRFYPGNPEKLTADIGDYTTPTNAEKLAAI-GCVVPHAGYMYSGHVAGAVYER 63

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +D     R  +L P+H       A+     ++TP+GD  +D E+ ++L A       D  
Sbjct: 64  LDLPK--RFVILCPNHTGAGHPLAVMREGSWRTPLGDAAIDAELADQLLAAFPLTSEDAD 121

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV- 182
               EH++E+ LP+L  +      + VP+ VG    +  +  G   AK V   +    V 
Sbjct: 122 AHRTEHALEVQLPFLQILVPN--FRFVPVAVGTGRFDVLSALGESIAKVVQSAAERVMVI 179

Query: 183 -SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM-DIIETGDPDAFKKYLLEYDNTIC 240
            SSD  H+ +  +    D+   +  + + ALD  G+ D++             E + ++C
Sbjct: 180 ASSDMNHYENDADTRVKDR---LAIERLLALDAKGLYDVVH------------EKNISMC 224

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           G  P    L          + + ++Y  S      RD  V YA  A
Sbjct: 225 GYGPAVAMLTAAKRVGAS-RAELIKYATSGDVSGDRDMVVGYAGIA 269


>gi|390939154|ref|YP_006402892.1| putative dioxygenase [Desulfurococcus fermentans DSM 16532]
 gi|390192261|gb|AFL67317.1| putative dioxygenase [Desulfurococcus fermentans DSM 16532]
          Length = 283

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 81/301 (26%), Positives = 125/301 (41%), Gaps = 43/301 (14%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLRE---AGLPKSPEVR------GVIAPHAGYSYS 53
           K R    AG +Y D  + L   ++   +     G P SP         G +APHAGY YS
Sbjct: 2   KKRSPIVAGYFYPDKPEELRSVIEWSFKHRIGPGKPPSPSDTPTTNSIGYVAPHAGYIYS 61

Query: 54  GRAAAYAFG----NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
           G  AA+ +     N  P +I    +LG +H       ++    V++TP+GDL +D E+  
Sbjct: 62  GPVAAHVYFDMALNKKPDTI---VILGTNHTGLGRPVSVYPEGVWETPLGDLVVDDEIGR 118

Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
            +    +    D      EHS+E+ LP++  ++ G  +KI PI++G ++  + A   R  
Sbjct: 119 LIVENSEIAEFDEYAHLEEHSIEVQLPFIVYIYGGD-VKITPIVIG-IHTPDVA---RDL 173

Query: 170 AKYVDDPSN------FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
           AK +   S           SSDF H+ S              H+     D M +D I   
Sbjct: 174 AKSIHKASMSTGKRIIVIASSDFNHYES--------------HEETSRKDSMAIDRILKL 219

Query: 224 DPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           D D     +L  D +ICG   I   +  +     K ++  L+Y  S          V YA
Sbjct: 220 DTDGLYNVILHNDISICGPGGIMTLMEYIKKLGGKAQL--LKYATSGDTSGDYSHVVGYA 277

Query: 284 S 284
           +
Sbjct: 278 A 278


>gi|332796340|ref|YP_004457840.1| hypothetical protein Ahos_0655 [Acidianus hospitalis W1]
 gi|332694075|gb|AEE93542.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 284

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 135/299 (45%), Gaps = 39/299 (13%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPEVRG------VIAPHAGYSYS 53
           R  + AG++Y  + + L + +  W     +    LP  P+ +G       I PHAGY YS
Sbjct: 3   RLPAVAGAFYEADPQELKDRI-VWSFTHPVGPGKLPTVPKEKGKRDNLFFIVPHAGYIYS 61

Query: 54  GRAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+++  +        V +LGP+H  Y    +L     ++TP+GD+ +D E+  EL 
Sbjct: 62  GPVAAHSYYYLASEGKPDLVIILGPNHTGYGSYVSLWNKGCWETPLGDVKIDEEMAMELV 121

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-----NEAMYGR 167
              +   +D      EHS+E+ +P+L   F+   IKI+PI++     E      E +Y R
Sbjct: 122 KYSEVIDIDEQAHLYEHSVEVQIPFLQFFFDSE-IKIIPIVILYQTPEIAEYIAEGIY-R 179

Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM-DIIETGDPD 226
           L  K+ +      + SSD  H+      + Y K    I K IE LD  G+ D++E     
Sbjct: 180 LMQKHPEKDIVVLA-SSDMNHYDPYD--ITYKKDEMAIEK-IEQLDYKGLYDVVENK--- 232

Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
                    D T+CG  P+ V + +    + K  I  L++  S      + S V Y +A
Sbjct: 233 ---------DVTMCGYAPVMVAMILAKKFNKKPYI--LKHATSGDTSGDKSSVVGYLAA 280


>gi|292493574|ref|YP_003529013.1| hypothetical protein Nhal_3603 [Nitrosococcus halophilus Nc4]
 gi|291582169|gb|ADE16626.1| protein of unknown function DUF52 [Nitrosococcus halophilus Nc4]
          Length = 261

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/215 (28%), Positives = 100/215 (46%), Gaps = 12/215 (5%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M  +R+ + AG +Y  +   L  ++  + +      +P  + +IAPHAGY YSG  AA A
Sbjct: 1   MATLRQPAVAGLFYPADLVSLQTQIQNFFKTNDARGNPP-KAIIAPHAGYRYSGPVAASA 59

Query: 61  FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +  +      I RV LLGPSH       A S A  + TP+G +P++ + +++  A  +  
Sbjct: 60  YACLRKVQGRIHRVILLGPSHRVPFYGIATSRANGFATPLGIVPVNQDDLQQALALPQVR 119

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
            +D      EHS+E+ LP+L ++       +VP++VG  + +       LF         
Sbjct: 120 ALDEP-HALEHSLEVQLPFLQEILGD--FSLVPLVVGHSSPQEVKEVLDLFW---GSEET 173

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD 213
              VSSD  H+         D+      K+IEAL 
Sbjct: 174 LIIVSSDLSHYHDYTTAQQLDR---ATTKAIEALQ 205


>gi|289193137|ref|YP_003459078.1| protein of unknown function DUF52 [Methanocaldococcus sp. FS406-22]
 gi|288939587|gb|ADC70342.1| protein of unknown function DUF52 [Methanocaldococcus sp. FS406-22]
          Length = 287

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 132/301 (43%), Gaps = 33/301 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEVRG-------VIAPHAGYSY 52
           M KIR  + AG +Y  +   L + ++  +L + G PKS  V G       ++ PHAGY Y
Sbjct: 1   MTKIRYPAVAGLFYPSHPDELIDMIEQCYLHKFG-PKSMPVHGTYEKPIGLLCPHAGYVY 59

Query: 53  SGRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
           SG   A+++      +D        +LGP+H       ++    +++TP+GD+  D E +
Sbjct: 60  SGPIQAHSYYELSKRVDALEEITAVILGPNHTGLGSGVSVMDG-IWRTPLGDVRCDEEFV 118

Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYG 166
           EEL    +   +D      EHS+E+ LP+L   ++      KIVPI +   + E     G
Sbjct: 119 EELWTKCEIVDLDETAHLNEHSIEVQLPFLKHLELLNVAKFKIVPICMMFQDYETAVEVG 178

Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
              AK   + +    +  SSD  H+  +       KK  ++ K           I+E  +
Sbjct: 179 YFIAKIAKELNRRVVIIASSDLTHYEPQ---EIASKKDAIVIKH----------ILEMNE 225

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            + ++  ++ Y+ ++CG  P+   L  +     K K   L Y  S        + V YAS
Sbjct: 226 KELYED-VINYNISMCGYGPVIAMLKAMKTLGAK-KANLLAYATSGDITGDYSAVVGYAS 283

Query: 285 A 285
           A
Sbjct: 284 A 284


>gi|376294830|ref|YP_005166060.1| hypothetical protein DND132_0038 [Desulfovibrio desulfuricans
           ND132]
 gi|323457391|gb|EGB13256.1| protein of unknown function DUF52 [Desulfovibrio desulfuricans
           ND132]
          Length = 269

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 70/276 (25%), Positives = 114/276 (41%), Gaps = 21/276 (7%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
           AG +Y      L   +DG+L  A   ++ +    + PHAGY +SG       G  D  S 
Sbjct: 8   AGRFYDAQPDELYAMVDGFLGLAEKRQAEQTLLAMVPHAGYVFSGAVCGKTLGTADLAST 67

Query: 70  SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
             V LLGP+H     + AL     +  P G L +D  + E L A     + D      EH
Sbjct: 68  --VLLLGPNHTGRGERFALWPDGSWLIPGGSLAVDTGLAEALLAADPAIMADTAAHMGEH 125

Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS--VSSDFC 187
           S+E+ LP+L ++       IVP+ V A + ++    GR   + + D     S  VSSD  
Sbjct: 126 SLEVVLPFLFRLNPA--TTIVPVCVSAPSLDSLERVGRAVGRVLADYPEPVSIVVSSDMS 183

Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
           H+              + H+    +D M ++ + T  P      +     ++CG  P+++
Sbjct: 184 HY--------------ISHEDAREMDGMALEPMMTLAPARLYDTVRSRRISMCGVLPMTM 229

Query: 248 FLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            L   G    + + + + Y  S +     +  V YA
Sbjct: 230 GLFAAGELGAE-RAELVAYATSGEVSGDFEQVVGYA 264


>gi|193213539|ref|YP_001999492.1| hypothetical protein Cpar_1900 [Chlorobaculum parvum NCIB 8327]
 gi|193087016|gb|ACF12292.1| protein of unknown function DUF52 [Chlorobaculum parvum NCIB 8327]
          Length = 287

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 74/287 (25%), Positives = 119/287 (41%), Gaps = 17/287 (5%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVRGVIAPHAGYSYSGRAAAYAF 61
            IR  + AG++Y  +   L   L     +   P S P  + VI PHAGY +SGR AA AF
Sbjct: 7   NIRNPAVAGAFYPADPSELNTLLAELFEKTSQPSSQPSPKAVIVPHAGYRFSGRVAARAF 66

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV---IEELKATGKFE 118
             ++   I    LLG +H       A+     ++TP+G +PLD  +   + EL  T  + 
Sbjct: 67  STLEGQHIRTAVLLGNAHAALFDSIAIDPHDGWRTPLGTVPLDKMMRIRLIELDPT-LYH 125

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             DI   + +H +E+ LP L         KI+P+L G   A             + D   
Sbjct: 126 QSDIA-HQRDHVLEVQLPLLQHALASGF-KILPVLFGQNPAGTYWQCAEALLSLLSDDDL 183

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
             + SSD  H      Y  Y     +  +++E L ++ +  +E  + +  ++ +   +  
Sbjct: 184 LIA-SSDLSH------YPTYHDAQAIDTRTLEHLVELDIQGLEQHEQEVMQRRIPGLETV 236

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQC-KTKRDSSVSYAS 284
            CG   +   L +        K K L Y  S    +  RD+ V Y +
Sbjct: 237 FCGPDAVKTVLEI--GSRLGWKGKLLSYRNSGDADQASRDAVVGYGA 281


>gi|320354378|ref|YP_004195717.1| hypothetical protein Despr_2282 [Desulfobulbus propionicus DSM
           2032]
 gi|320122880|gb|ADW18426.1| protein of unknown function DUF52 [Desulfobulbus propionicus DSM
           2032]
          Length = 268

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 116/285 (40%), Gaps = 30/285 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R  + A  +Y  N  +L   +D  +      +  +   V+ PHAGY YSG  A      +
Sbjct: 3   RMPAVADRFYPGNPDQLRAAMDMLVPVVAEERKRDALAVVMPHAGYVYSGATAGATISRV 62

Query: 65  D-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
             P +   V ++GP+HH      AL T   ++ P+G +P+D ++   +  +      D  
Sbjct: 63  RVPET---VLIMGPNHHGRGQALALGTEE-WRMPLGTVPIDRQLATAILQSSSIIREDQE 118

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFFS 181
             + EHS+E+ +P+L +V     + IVP++V  +  +      R  A  +     S    
Sbjct: 119 AHQYEHSLEVQVPFLQQVQPN--LAIVPLVVSMIPFDLCQTVARELAVAIGSLRRSVLMV 176

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H+ SR       KK           D++ +D I   DP      +L    ++CG
Sbjct: 177 ASSDMSHYESR---QQASKK-----------DRLAIDRILAMDPQGLYATVLGQHISMCG 222

Query: 242 RHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
             P ++ L     LG  +       +RY  S +        V YA
Sbjct: 223 VIPATIALLAAQELGATTA----DLIRYTDSGEASGDTSQVVGYA 263


>gi|67470020|ref|XP_650981.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56467653|gb|EAL45595.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449704833|gb|EMD45002.1| Memo family protein [Entamoeba histolytica KU27]
          Length = 284

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 104/249 (41%), Gaps = 30/249 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG----VIAPHAGYSYSGRAAAY 59
           IR+A  AG W++ N   LA E+D ++  A L K P ++G     I+PHAG+ YSG+ A Y
Sbjct: 6   IRKAIGAGRWFSANGNELANEVDHYINNA-LNKLPSIQGKILGCISPHAGFRYSGQTAGY 64

Query: 60  AFGNIDPTSISR-----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            F  +   S        VF+LG SH       A+       TPI    +D E I      
Sbjct: 65  DFAALKRDSEINGKPDVVFILGFSHSSRFDCAAVMDGKAISTPIATTEIDNEAITMFCEG 124

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
             +          EHS E  LP++ +   G  +K+V +L+G   +E      +       
Sbjct: 125 RNYLKCFYKPHNGEHSAENELPFVQRALPG--VKVVMVLIGTHKSEVLEQVSQGLQAVCS 182

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA-FKKYLL 233
               +   SSD  H          D+ H ++ K+    DK  + + E  D      K+  
Sbjct: 183 KKKMYVIASSDMLH----------DESHNLVEKT----DKETIQLTEKMDIKGLLSKW-- 226

Query: 234 EYDNTI-CG 241
            Y+N I CG
Sbjct: 227 SYENQIYCG 235


>gi|383320694|ref|YP_005381535.1| dioxygenase [Methanocella conradii HZ254]
 gi|379322064|gb|AFD01017.1| putative dioxygenase [Methanocella conradii HZ254]
          Length = 264

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/286 (28%), Positives = 123/286 (43%), Gaps = 34/286 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG +Y  + + L +++         PK  +  GV+ PHAGY+YSG  AA  + +
Sbjct: 1   MRNPAVAGMFYPGSKEALRQQIASLAPRQPTPKI-DAMGVVVPHAGYAYSGGVAARVYAS 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I+        +LGP H +     A+ST   +KTP+G + +  + I EL   G  +  +I 
Sbjct: 60  IE--GAQTFIILGPRHGWEGSAIAVSTEP-WKTPLGVVDVAHDFI-ELLPPGIIDRDEIA 115

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE------NEAMYGRLFAKYVDDPS 177
               EHS+E+ +P+L   FEG   +IVPI +G  + E      NE M      KY     
Sbjct: 116 -HSREHSLEVQVPFLQYFFEG--FRIVPIAIGLQDYETVSEVANELMMA--LEKY--PKK 168

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                SSDF H+         D +   + + IE LD +G           F   +     
Sbjct: 169 AVIVASSDFSHYEPVEAAKRKDSR---LIERIEKLDVVG-----------FYDEIARLSA 214

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           T CG  PI+  +        K + + L Y  S      R+  V YA
Sbjct: 215 TCCGYGPIAAMMLCCKRMGAK-RAELLAYATSGDVTGDRE-VVGYA 258


>gi|284162154|ref|YP_003400777.1| hypothetical protein Arcpr_1045 [Archaeoglobus profundus DSM 5631]
 gi|284012151|gb|ADB58104.1| protein of unknown function DUF52 [Archaeoglobus profundus DSM
           5631]
          Length = 263

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 110/248 (44%), Gaps = 24/248 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +RR + AG +Y  N + L   L  ++          V   ++PHAGY YSGR A   +  
Sbjct: 1   MRRPAVAGMFYPSNRESLLAMLKEFV---DYHPDDSVIACVSPHAGYVYSGRTAGRVYSL 57

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I    +    ++GP+H  Y    ALST T + TP+G++ +D+E ++ +    K   +D  
Sbjct: 58  I--PQVETYVIVGPNHTGYGSPVALSTDT-WITPLGEVEVDMEFVKAMPK--KIIDLDET 112

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDPSNFFS 181
               EHS+E+ +P+L  +  G   KIVPI +G  + E   E     + AK          
Sbjct: 113 AHRFEHSLEVQVPFLQYINMGRKFKIVPICMGMQDEETAREVAEEIITAKEETGKEIVVV 172

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H+              +  +     D++ ++ I + D   F   +     ++CG
Sbjct: 173 ASSDMHHY--------------LPDEECRRRDRIVIEAILSMDVSKFYDTIYRMQASVCG 218

Query: 242 RHPISVFL 249
             PI+V +
Sbjct: 219 YGPIAVAM 226


>gi|89899212|ref|YP_521683.1| hypothetical protein Rfer_0398 [Rhodoferax ferrireducens T118]
 gi|89343949|gb|ABD68152.1| protein of unknown function DUF52 [Rhodoferax ferrireducens T118]
          Length = 274

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 62/233 (26%), Positives = 101/233 (43%), Gaps = 35/233 (15%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAA 57
           M  +R  + AG++Y   +  L  ++   L +A      + P  + +I PHAGY YSG  A
Sbjct: 1   MSLVRPPAVAGTFYPGQASVLLNDVTTLLTQARPGANLREPAPKALIVPHAGYIYSGATA 60

Query: 58  AYAFGNI--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           A A+  +     SI RV LLGP H       AL     + TP+G + +D + +  +    
Sbjct: 61  ARAYAELAGGRASIQRVVLLGPVHRVPVRGLALPGVDAFATPLGRIEVDQDAVAAIA--- 117

Query: 116 KFELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYG 166
              L  + V  A    EHS+E+ LP+L  V +    K+VP+ VG           EA++G
Sbjct: 118 --HLPQVVVSRAAHAQEHSLEVQLPFLQAVLDD--FKLVPLAVGDATPAQVAEVIEALWG 173

Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDI 219
                          +SSD  H      ++ Y     V  ++++ + ++   I
Sbjct: 174 --------GDETLIVISSDLSH------FLPYGTAQAVDSETVQNMMQLNSSI 212


>gi|386875004|ref|ZP_10117208.1| memo-like protein [Candidatus Nitrosopumilus salaria BD31]
 gi|386807164|gb|EIJ66579.1| memo-like protein [Candidatus Nitrosopumilus salaria BD31]
          Length = 276

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 118/287 (41%), Gaps = 22/287 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR------GVIAPHAGYSYSGRAA 57
           IR    AG +Y      L   +   +     P S  V       GVI PHAGY YSG  A
Sbjct: 2   IREPVVAGQFYPRRKDDLENMIKYCMEHKYGPGSKPVLSNEKIFGVICPHAGYVYSGPTA 61

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
            +++ +    +   V ++GP+H       A      +KTP+G + +D E   ++    KF
Sbjct: 62  CHSYMSFASQNPELVIIIGPNHFGIGKDVATMVDAQWKTPLGMVQIDSESARKISEISKF 121

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
             +D      +HS+E+ +P L ++      +I+PI++ A + E     G   ++     +
Sbjct: 122 VEIDEYSHSQDHSLEVQIPMLQEILSSDF-QILPIILRAQDMETAMDVGNAVSEIAKRKN 180

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                SSDF H+          +++   HK  +AL +  +DI    + + F + L     
Sbjct: 181 TIIVASSDFTHY----------EENSFAHKQDKALIEPILDI----EVERFYQVLRNKRV 226

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           T CG   ++  +    N     K + L Y  S        S V Y++
Sbjct: 227 TACGYGAMASTMIACKNLGAT-KGELLSYATSGDVSGDTKSVVGYSA 272


>gi|152990853|ref|YP_001356575.1| hypothetical protein NIS_1109 [Nitratiruptor sp. SB155-2]
 gi|151422714|dbj|BAF70218.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2]
          Length = 264

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 97/208 (46%), Gaps = 17/208 (8%)

Query: 41  RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
           R +I+PHAGY YSG  A  A   +  T+  RV ++GPSH  Y    + S    ++TP GD
Sbjct: 45  RAIISPHAGYVYSGFTANVAHRLLQNTAAKRVVVIGPSHRVYLSGISGSFYDAFETPCGD 104

Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
           L +D   +E L    KF +  +     EHS E+ +P++          ++ ++ G V+  
Sbjct: 105 LLIDTAYLEMLAQ--KFGIGFVPEAHQEHSTEVQMPFIHHYLPN--TSVIELVYGDVDPR 160

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHW--GSRFNYMHYDKKHGV-------IHKSIEA 211
           N AM   +  + + DP N   +S+D  H+   S+   +      GV       +H   EA
Sbjct: 161 NVAM---ICEEVLSDPHNVVVISTDLSHYYPLSKAKELDMHCLQGVAELDITELHNGCEA 217

Query: 212 LDKMGMDIIETGDPD-AFKKYLLEYDNT 238
             K+G++ I     D   +  +L+Y  +
Sbjct: 218 CGKIGVEAIILAAKDLGLRPRILDYRTS 245


>gi|57233975|ref|YP_182025.1| hypothetical protein DET1313 [Dehalococcoides ethenogenes 195]
 gi|57224423|gb|AAW39480.1| protein of unknown function /conserved hypothetical protein
           TIGR00296 [Dehalococcoides ethenogenes 195]
          Length = 438

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 75/288 (26%), Positives = 125/288 (43%), Gaps = 35/288 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +R AS AG +Y  + +++       E+ G  +E  L       G + PHAGY YSG  A 
Sbjct: 2   LRPASAAGRYYPASKEKIRSVIAPMEVKGVAKERYL-------GAVMPHAGYPYSGGVAM 54

Query: 59  YAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
                +D P +   V +LGPSH     + A+  + +++TP+G++ +D  +   +    + 
Sbjct: 55  AVASRLDLPET---VIILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSTLAHSIMKYCRH 111

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
              D    + EHS+E+ LP L + F+   IKIVP+ V    AE  A  G   A  + +  
Sbjct: 112 IKADPSAHQYEHSIEVQLPIL-QYFKPD-IKIVPLTVSFGKAETLAEIGHGIASALRETG 169

Query: 178 N--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
                  SSD  H+ S+ +           H+     D + +D I   D     + + + 
Sbjct: 170 REAIIIASSDMTHYESQAD----------AHQK----DSLALDAIIKLDAAQMLERIQQN 215

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
             T+CG  P++  L  +     K + + + Y+ S       D  V YA
Sbjct: 216 HITMCGFAPVAAMLTAVKEMGAK-RARVVAYQTSGDVTHHLDQVVGYA 262


>gi|315229903|ref|YP_004070339.1| dioxygenase [Thermococcus barophilus MP]
 gi|315182931|gb|ADT83116.1| hypothetical dioxygenase [Thermococcus barophilus MP]
          Length = 292

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 69/291 (23%), Positives = 129/291 (44%), Gaps = 8/291 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR  + AGS+Y    + L   L+ +  + G L    ++   +APHAGY +SG  A+  + 
Sbjct: 2   IRYPAVAGSFYP-TGEELVIMLERFFSDLGELGSERKITAGVAPHAGYVFSGYTASRTYK 60

Query: 63  NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I    +   F ++GP+H       A+     + TP+G + +D E+ + +        +D
Sbjct: 61  AIYEDGLPETFVIIGPNHTGLGSPVAVYPEGEWITPLGGVEVDAELAKAIVKNSSIADLD 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
               + EHS+E+ LP++  + +  G  IKIVPI +G  + E     G+   +   +    
Sbjct: 121 ELAHKYEHSIEVQLPFIQYIADKAGRKIKIVPITLGLQDEEVAEDLGKAIFEASQELGRD 180

Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
             V  S+D  H+G  + Y+ +  +   +   I+  D   +  I   D       + + D+
Sbjct: 181 VVVIASTDMMHYGYMYGYIPFRARGEDLLGRIKEWDFRVIQRILEFDYKGMFDEIRKMDH 240

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           T+CG   ++  + +    +   + + L Y  S +     D+ V YAS   +
Sbjct: 241 TMCGPGGVAAGI-VFSRLADANEAELLHYTTSFEVSRSTDAIVGYASVVMR 290


>gi|406911104|gb|EKD50972.1| hypothetical protein ACD_62C00382G0002 [uncultured bacterium]
          Length = 499

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 66/228 (28%), Positives = 104/228 (45%), Gaps = 20/228 (8%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           E IR+   AG++Y ++   L  ++  +L          VR V+ PHAGY YSG  AA AF
Sbjct: 44  EVIRQPIAAGTFYPNDPTALYHDIKRYLGAEPSLGYTNVRAVLVPHAGYVYSGAVAASAF 103

Query: 62  GNIDPTSISRVFLLGPSHH---YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
             + P   +RVF+L  SHH         +L+  + Y  P   +PLD  V++ELK    F 
Sbjct: 104 REVSP-DFTRVFILA-SHHNGDVTLSGVSLTEVSHYAIPGAKIPLD-GVVDELKKNPLF- 159

Query: 119 LMDICVDEAEHSMEM---HLPY---LAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
                 +   H+M+M    LP+   L  + E     I+P++VGA++        ++  KY
Sbjct: 160 ----VHEPLAHTMQMIEVELPFLHALKGLSEKPEFTIIPMIVGALSKTQTEELAQILNKY 215

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKK--HGVIHKSIEALDKMGMD 218
            +D    F  S D  H+ +    +  D +    ++ +   ALD    D
Sbjct: 216 -NDAKTLFVFSVDLSHYFNAKEALTLDARTIDALMGRDTTALDSATTD 262


>gi|88604124|ref|YP_504302.1| hypothetical protein Mhun_2891 [Methanospirillum hungatei JF-1]
 gi|88189586|gb|ABD42583.1| protein of unknown function DUF52 [Methanospirillum hungatei JF-1]
          Length = 264

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/294 (26%), Positives = 132/294 (44%), Gaps = 39/294 (13%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K R ++ AG +Y     RL + L    R   +  S +  GV++PHAGY YSG+ AA AF 
Sbjct: 2   KSRASTVAGMFYPGEPGRLRQLLSELFRN--VTSSGDAAGVVSPHAGYVYSGKTAAKAFA 59

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
             D        ++GPSH  + P C   +   ++TP+G L  D+E+            + +
Sbjct: 60  AFDEKFDGTFLVIGPSHRGF-PTCI--SQVPWETPLGMLEPDIEL---------GSCIHL 107

Query: 123 CVDE------AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
            VDE      +E+S+E+ +P++   F     +IVP+++G    +       +    ++  
Sbjct: 108 PVDERAMSYGSENSLEVQMPFIQYRFPRS--RIVPLMMGHQTLQEVHRISSIIITALEKY 165

Query: 177 SNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
                +  SSDF H      Y+  +K +   H +IEAL +M +       P+ F + +  
Sbjct: 166 QKPVKIVASSDFSH------YVSAEKAYRDDHYAIEALLQMNV-------PEFFSR-IQS 211

Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           ++ T CG  PI   + +L     K +   L Y  S +     D  V YA+ A +
Sbjct: 212 HNITACGYGPIGCMMEVL-RTRGKTRCDLLEYTTSGETSGDYDQVVGYAALAVR 264


>gi|339499622|ref|YP_004697657.1| hypothetical protein Spica_0999 [Spirochaeta caldaria DSM 7334]
 gi|338833971|gb|AEJ19149.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Spirochaeta caldaria DSM 7334]
          Length = 291

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 73/278 (26%), Positives = 117/278 (42%), Gaps = 44/278 (15%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR------GVIAPHAGYSYSGRAAA 58
           R    AG +Y D    L   L+ +   +G+ + PE R      G+I PHA +  SG    
Sbjct: 15  RSPVMAGMFYPDTDAALRIALNRYYEISGM-QGPETRTDCNPLGIIVPHAAWELSGLLLG 73

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-F 117
            AF  +    I +V LLGP H        +S +  ++TPIGDL +    ++EL +    F
Sbjct: 74  RAFSLVRKRPIEQVVLLGPLHASTQEGIYVSESDSFETPIGDLWVHKNKVDELLSCSTIF 133

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP- 176
           EL DI    +EHS+E+ LP++A  F    + I+PIL+G     +     R       D  
Sbjct: 134 ELNDI-PHLSEHSLEVVLPWIADAFPQ--VSIIPILIGTSRTNHIHALSRALELCFSDEL 190

Query: 177 -SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD--KMGMDIIETGDPDAFKKYLL 233
               F  +++           HYD          EA +  +  +++++TGD     +   
Sbjct: 191 DKTLFIATTNVS--------AHYDND--------EAFEQGRRFLELVQTGDGAQLLQLAQ 234

Query: 234 EYDNTICGRHPISVFL-------------HMLGNCSTK 258
           +   T CG   ++  L             H+LG  S++
Sbjct: 235 QGHITACGAAGVAALLQCRFWEVAPLPKPHLLGQASSR 272


>gi|338730664|ref|YP_004660056.1| hypothetical protein Theth_0876 [Thermotoga thermarum DSM 5069]
 gi|335365015|gb|AEH50960.1| protein of unknown function DUF52 [Thermotoga thermarum DSM 5069]
          Length = 281

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 81/301 (26%), Positives = 134/301 (44%), Gaps = 47/301 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-----KSPEVR------GVIAPHAGYSY 52
           IR    AG++Y  +  +L + +   +     P     K P  +      G+I PHAGY Y
Sbjct: 2   IRYPVVAGTFYPGSPTKLRQTIMDLITSPLGPGNLDFKVPVTQPLNKNVGIIVPHAGYVY 61

Query: 53  SGRAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
           SG  A +A+  +    + +   V L+GP+H     K  +     + TP+G + +D +  +
Sbjct: 62  SGPIAVHAY--VAAARLGKPNLVVLIGPNHTGRGAKVGVWDKGSWLTPLGKVEVDEQASK 119

Query: 110 ELKATGKFELMDICVDE-----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEA 163
            L     FE  ++C  +      EHS+E+ LP+L   F+    KI+PI +  V+ +  + 
Sbjct: 120 LL-----FENCEVCKADFDSHLLEHSLEVQLPFLQFFFDE--FKILPISIFPVSIDLCKK 172

Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
           +   L A   +  +  F VS+DF          HY+ +   I K     D+M ++ IE  
Sbjct: 173 IAAGLDAIASEYKNTLFVVSTDF---------NHYENQDVTIKK-----DQMAIEKIEAK 218

Query: 224 DPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           DP    + + +YD ++CG   ++ FL+M     T  K K L +  S          V YA
Sbjct: 219 DPIGLVEVVDKYDISMCGVSAVASFLYM----KTFGKPKLLCHATSGDVSKDFLQVVGYA 274

Query: 284 S 284
           S
Sbjct: 275 S 275


>gi|392374695|ref|YP_003206528.1| hypothetical protein DAMO_1639 [Candidatus Methylomirabilis
           oxyfera]
 gi|258592388|emb|CBE68697.1| conserved protein of unknown function [Candidatus Methylomirabilis
           oxyfera]
          Length = 267

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 75/289 (25%), Positives = 118/289 (40%), Gaps = 37/289 (12%)

Query: 4   IRRASHAGSWYTDNSKRL---AEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           IRRA+ AGS+Y    + L   A +L  W  +     +    G + PHAGY YSGR A   
Sbjct: 2   IRRAAVAGSFYPGTPELLRAQAADLITWDLQ-----TVRALGAVVPHAGYIYSGRVAGAV 56

Query: 61  FGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           +  +   +   VF +LGP+H        +     ++TP+G + +D E+   +    +   
Sbjct: 57  YARL---TFPDVFVILGPNHTGIGAGAGIMADGTWETPMGQVSIDTELARAILQNSRTIE 113

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPS 177
            D      EHS+E+ LP L     G     VPI + +         G   A+ +   D S
Sbjct: 114 DDDLGHRREHSIEVQLPLLQA--HGRPFSFVPICLFSSEYAVCQDVGLAVAQAIAGSDRS 171

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                S+D  H+ SR              +  +A D++ ++ I   DP    + +   D 
Sbjct: 172 VLMVASTDMSHYVSR--------------EQAKAKDRLAIEAIVACDPQRLYQVVRREDI 217

Query: 238 TICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            +CG HP +  L     LG  S ++    + Y  S+       S V YA
Sbjct: 218 AMCGFHPTTALLIAARELGATSGEL----ISYATSADVTRDDSSVVGYA 262


>gi|347755627|ref|YP_004863191.1| putative dioxygenase [Candidatus Chloracidobacterium thermophilum
           B]
 gi|347588145|gb|AEP12675.1| putative dioxygenase [Candidatus Chloracidobacterium thermophilum
           B]
          Length = 421

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 81/318 (25%), Positives = 134/318 (42%), Gaps = 40/318 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +R A+HAG  Y D  + L + LD  L        G  +  ++ GV+APH       RA A
Sbjct: 106 VRPAAHAGVSYPDEPEALRQRLDAILAHTPACPLGAARPEQLVGVVAPHIDLRVGERAYA 165

Query: 59  YAFGNID------PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
            A+  I+      P      F++  + HY      +++   Y TP+G +P D E ++ L+
Sbjct: 166 PAYRLIEQLAATLPDGEPVTFVVLGTSHYGGDGLFVASRKDYATPLGAMPCDREFLDRLE 225

Query: 113 AT-GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL-----IKIVPILVGAVN---AENEA 163
              G     D      EH++E    +L  +F+ HL     +++VPIL  +++   A +EA
Sbjct: 226 TRLGASISADDTPHRQEHAIEFQAVFLRHIFDRHLEAGRPVRMVPILCTSLHELYAADEA 285

Query: 164 MYGRLFAKYV------------DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA 211
              R  A+Y                     V  D  H G +F    +D +       +E 
Sbjct: 286 CRERTQAEYRAFIEALQATLAEQTHRTLLLVGGDLAHVGPKFGD-RFDAQTRA--AELER 342

Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNT--ICGRHPISVFLHMLGNCSTKIKIKFLRYEQS 269
            D   +D +  GD  A   ++   ++   +CG  P+  +L  L         + L YEQ 
Sbjct: 343 ADTELLDYVAAGDSAAVLDHIARDEDARRVCGFPPLMAYLDALAAFG-PTDGQVLHYEQW 401

Query: 270 SQCKTKRDSSVSYASAAA 287
            +  T+  S+V+Y +AAA
Sbjct: 402 QERPTR--SAVTYGAAAA 417


>gi|256811124|ref|YP_003128493.1| hypothetical protein Mefer_1184 [Methanocaldococcus fervens AG86]
 gi|256794324|gb|ACV24993.1| protein of unknown function DUF52 [Methanocaldococcus fervens AG86]
          Length = 287

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 80/301 (26%), Positives = 133/301 (44%), Gaps = 33/301 (10%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEVRG-------VIAPHAGYSY 52
           M KIR  + +G +Y  +   L + ++  +L + G PKS  V G       ++ PHAGY Y
Sbjct: 1   MNKIRYPAVSGLFYPSHPDELIDMIEQCYLHKYG-PKSMPVHGNYKKPVGLLCPHAGYIY 59

Query: 53  SGRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
           SG   A+++      +D        +LGP+H       ++    +++TP+GD+  D E I
Sbjct: 60  SGPIKAHSYYELSKRVDALEEITAVILGPNHTGLGSGVSVMDG-IWRTPLGDVKTDEEFI 118

Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYG 166
           +EL    +   +D      EHS+E+ LP+L   ++      +IVPI +   + E     G
Sbjct: 119 DELWKKCEIIDLDETAHLNEHSIEVQLPFLKHLELLNIAKFRIVPITMMLQDYETAVEVG 178

Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
              AK   + +    +  SSD  H+  +       KK  ++ K          DI+E  D
Sbjct: 179 YFIAKIAMELNRRVVIIASSDLTHYEPQEV---ASKKDAIVIK----------DILEM-D 224

Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             A  + ++ Y+ ++CG  P+   L  +     + K K L Y  S        + V YAS
Sbjct: 225 ERALYEDVVNYNISMCGYGPVIAMLKAMKTLGAE-KSKLLAYATSGDMTGDYSAVVGYAS 283

Query: 285 A 285
           A
Sbjct: 284 A 284


>gi|307721110|ref|YP_003892250.1| hypothetical protein Saut_1190 [Sulfurimonas autotrophica DSM
           16294]
 gi|306979203|gb|ADN09238.1| protein of unknown function DUF52 [Sulfurimonas autotrophica DSM
           16294]
          Length = 266

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 72/291 (24%), Positives = 123/291 (42%), Gaps = 38/291 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLR--------EAGLPKSPEVRGVIAPHAGYSYSGR 55
           IR+ + AG +Y  + + + +    + +        EA L   P  R +I PHAGY YS  
Sbjct: 3   IRKDAVAGQFYPASKEEIEKMFSHYNKIIDESIKDEAILNLKP--RAIIVPHAGYVYSAF 60

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            A  AF  +  +   RV ++GPSH  Y    +++    Y+TP+G+L +D ++ +EL    
Sbjct: 61  TANIAFRLLKNSHAKRVVVIGPSHRVYLNGTSVAEYDSYETPLGNLSIDKKLADEL--IE 118

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
           KF+L       +EHS E+ +P++          +V ++ G    E+    G++    + D
Sbjct: 119 KFDLHFQADAHSEHSTEVQMPFVKNYLPN--ASVVELVYG---NEDPVNLGKVINYLLKD 173

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
                 +S+D  H      Y   DK + +    +EA+ K+    +  G            
Sbjct: 174 EDTTVVISTDLSH------YYDIDKANQLDSICLEAVAKLNPTELHQG------------ 215

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
               CG+  I V   ++      +K   L Y  S+     +   V Y SAA
Sbjct: 216 -CEACGK--IGVEAMLIAAKENGLKPTLLDYRTSADASGDKSQVVGYMSAA 263


>gi|357632292|ref|ZP_09130170.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Desulfovibrio sp. FW1012B]
 gi|357580846|gb|EHJ46179.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Desulfovibrio sp. FW1012B]
          Length = 288

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 72/284 (25%), Positives = 119/284 (41%), Gaps = 25/284 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIA--PHAGYSYSGRAAAYAFG 62
           R+   AG +Y   +  L  E   +L EA +P   +   ++A  PHAGY YSG  A    G
Sbjct: 20  RQPVVAGRFYPGAAPALRREAGAFLAEAEVPAEADGPTLLAMVPHAGYVYSGSVAGRTLG 79

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFELMD 121
                    V LLGP+H     + A+  +  +  P  D+P+  ++  +L +A  +    D
Sbjct: 80  AARLAGT--VLLLGPNHTGRGKRLAVWPSGAWAVPGCDVPVAADLARDLLRAEARLS-PD 136

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EHS+E+ LP+L    +   ++IVPI V   + +         A  +   S   S
Sbjct: 137 AAAHLEEHSLEVLLPFL--CVKNPAVRIVPIAVAEPDPDVLRQVAGTMAGVLGQRSEPVS 194

Query: 182 --VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
             VSSD  H+              V H++ +  D + +D +   DPD   + + E   T+
Sbjct: 195 LVVSSDMSHY--------------VPHETAKRRDALALDRVLALDPDGLYRVVREAGITM 240

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           CG  P+ + L++      +  +    Y  S +     +  V YA
Sbjct: 241 CGVLPMVLGLYLAKALGAREAV-LAAYATSGEASGDYNQVVGYA 283


>gi|327400495|ref|YP_004341334.1| hypothetical protein Arcve_0598 [Archaeoglobus veneficus SNP6]
 gi|327316003|gb|AEA46619.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Archaeoglobus veneficus SNP6]
          Length = 261

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 112/248 (45%), Gaps = 26/248 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R    AGS+Y  N   L   L  ++R +     P +   +APHAGY YSGR A   +  
Sbjct: 1   MRHPVVAGSFYPSNPDSLLAMLAEFVRPS---PDPTIAACVAPHAGYMYSGRTAGKIYSL 57

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I P + + V +LGP+H  Y    A+ST T + TP+G++  D+E IE +        MD  
Sbjct: 58  I-PKAETYV-ILGPNHTGYGSMVAVSTDT-WLTPLGEIEPDVEFIEAMPKV--IVDMDEI 112

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDPSNFFS 181
               EHS+E+ LP+L  V +    KIVPI +G  + E   E  +  L A+          
Sbjct: 113 AHRYEHSIEVQLPFLQYVHKD--FKIVPICLGMQDEETAREVAHEILTAEERTGRKIVVI 170

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H+              +  ++    D   +D I + D   + + + +   ++CG
Sbjct: 171 ASSDMHHY--------------LPDRTCREADAKVIDAILSMDVARYYQTIYDMQASVCG 216

Query: 242 RHPISVFL 249
              I V +
Sbjct: 217 YGAIGVVM 224


>gi|407039725|gb|EKE39785.1| Memo family protein [Entamoeba nuttalli P19]
          Length = 284

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 68/248 (27%), Positives = 101/248 (40%), Gaps = 28/248 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG----VIAPHAGYSYSGRAAAY 59
           IR+A  AG W++ N   LA E+D ++  A   K P ++G     I+PHAG+ YSG+ A Y
Sbjct: 6   IRKAIGAGRWFSANGNELANEVDHYINNA-FNKLPSIQGKILGCISPHAGFRYSGQTAGY 64

Query: 60  AFGNIDPTSISR-----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            F  +   S        VF+LG SH       A+       TPI    +D E I      
Sbjct: 65  DFAALKRDSEMNGKPDVVFILGFSHSSRFDCAAVMDGKAISTPIATTEIDNEAITMFCEG 124

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
             +          EHS E  LP++ +   G  +K+V +L+G   +E      +       
Sbjct: 125 RNYLKCFYKPHNGEHSAENELPFVQRALPG--VKVVMVLIGTHKSEVLEQVSQGLQAVCS 182

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
               +   SSD  H          D+ H ++ K+    DK  + + E  D          
Sbjct: 183 KKKMYVIASSDMLH----------DESHSLVEKT----DKETIQLTEKMDIKGLLTR-WS 227

Query: 235 YDNTI-CG 241
           Y+N I CG
Sbjct: 228 YENQIYCG 235


>gi|410995995|gb|AFV97460.1| hypothetical protein B649_05730 [uncultured Sulfuricurvum sp.
           RIFRC-1]
          Length = 266

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 116/251 (46%), Gaps = 30/251 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV---------RGVIAPHAGYSYSGR 55
           R+ S AGS+Y ++S  +   +D +     L   P+V           VI PHAG+ YSG 
Sbjct: 4   RKMSVAGSFYPESSIEITTMIDYF--NTILESHPDVAARFDALHGNAVIVPHAGWVYSGF 61

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            A  AF  +  +    + ++GPSH       +++ +  Y+TP+G+L +D  ++EELK   
Sbjct: 62  TANIAFRILSHSLPKTIIVIGPSHKVGFEGVSIADSEFYQTPLGELEIDTALVEELKK-- 119

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY-VD 174
           +F L        EHS E+ +P++    +   +K+V +    V A  + +       Y ++
Sbjct: 120 QFALTTFETAHHEHSTEVQMPFIKHYMDN--VKVVEL----VYAHADPLQISPIIDYLLN 173

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYD---------KKHGVIHKSIEALDKMGMD-IIETGD 224
            P     +S+D  H+ S       D         +   ++H+  EA  K+G++ +++  +
Sbjct: 174 QPDTAVVISTDLSHYYSLDEAKELDLICLEAIRHENSTMLHQGCEACGKIGVEAMLDVAN 233

Query: 225 PDAFKKYLLEY 235
               +  LL+Y
Sbjct: 234 KRTMEAILLDY 244


>gi|340359794|ref|ZP_08682267.1| hypothetical protein HMPREF9062_1392 [Actinomyces sp. oral taxon
           448 str. F0400]
 gi|339884084|gb|EGQ73906.1| hypothetical protein HMPREF9062_1392 [Actinomyces sp. oral taxon
           448 str. F0400]
          Length = 277

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 68/225 (30%), Positives = 99/225 (44%), Gaps = 20/225 (8%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWL---REAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           ++ R  + AG +Y     RL  +++  L   R            +I PHAGY YSG  AA
Sbjct: 9   QQTRPPAVAGLFYPAEPARLRADVESMLTGARADADAAPTPPAALIVPHAGYVYSGPTAA 68

Query: 59  YAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
            A+   +     + RV LLGP+H       AL       TP+G  P+ +EV  +++A   
Sbjct: 69  LAWARAESLRGRVRRVVLLGPAHRMGVRALALPGHRAMDTPLG--PVTVEVPAQIEALAS 126

Query: 117 FELMDICVD--EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
             L+    D   AEHS+E+ LP+L  V       +VP+ VG V+A+  A   R F    D
Sbjct: 127 SSLVVARPDVHAAEHSLEVQLPFLLTVLPEA--SVVPLAVGRVDADRVAEAIRPFLGGSD 184

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDI 219
                  VSSD  H      Y+   +   V   +IE +  +  DI
Sbjct: 185 ---TLVVVSSDLSH------YLPQAEARRVDDATIERILALRADI 220


>gi|322419450|ref|YP_004198673.1| hypothetical protein GM18_1934 [Geobacter sp. M18]
 gi|320125837|gb|ADW13397.1| protein of unknown function DUF52 [Geobacter sp. M18]
          Length = 266

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 71/290 (24%), Positives = 128/290 (44%), Gaps = 34/290 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +YT +  RL EEL   + +    ++ +  G+IAPHAGY YSG+ A   +  
Sbjct: 2   VRQPAVAGRFYTGDPVRLREELSAMVPQG---EAQKAIGIIAPHAGYVYSGKVAGKVYSA 58

Query: 64  ID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           +  P +   V +LGP+H       AL+  T + TP+G +P++  + + +         D 
Sbjct: 59  VRIPDT---VLILGPNHTGAGVAAALAPETQWLTPLGSVPVNRRLSKLILEHAPQVREDA 115

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV---GAVNAENEAMYGRLFAKYVDDPSNF 179
                EHS+E+ +P+L     G  I  + + +    +++   E +  R  A Y ++    
Sbjct: 116 AAHRFEHSLEVQVPFLQYRNPGVSIAAICLALPDFASISKIGEGI-ARAIAAYGEE--VL 172

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSD  H+ S                + +  D   +  +   +P+   +   E + T+
Sbjct: 173 IVASSDMTHYES--------------AAAAKVKDDQALAQLTELNPEGLLRVCREKEITM 218

Query: 240 CGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           CG  P +  L    +LG  S ++    + Y  S +      S V+YA+ A
Sbjct: 219 CGVIPATALLVAAKVLGATSCRL----VDYATSGEVNGDLASVVAYAALA 264


>gi|432329137|ref|YP_007247281.1| putative dioxygenase [Aciduliprofundum sp. MAR08-339]
 gi|432135846|gb|AGB05115.1| putative dioxygenase [Aciduliprofundum sp. MAR08-339]
          Length = 271

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 123/292 (42%), Gaps = 37/292 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSP--------EVRGVIAPHAGYSYSG 54
           +R  + AG +Y    K L   L+  ++   G  + P        ++ G + PHAGY +SG
Sbjct: 1   MRYPAVAGQFYPGEKKELQIMLNQLFIHPLGPGEVPSLSPDGPRKIIGGVVPHAGYIFSG 60

Query: 55  RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
             AA+ +  +        F++   +HY        T   + TP+G + +D E+ +E+   
Sbjct: 61  PVAAHFYAALAKDGFPDTFIIIGPNHYGIGSGVAVTLEDFITPLGRVQIDRELAKEIAR- 119

Query: 115 GKFELMDI--CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
              E++DI       EHS+E+ LP+L   F    IK VPI +     E     G++    
Sbjct: 120 ---EMVDIDDYAHRYEHSIEVQLPFLQ--FFKREIKFVPISMLIQEYEIAVELGKIIKDA 174

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           ++        SSDF H+  R N   Y K            D + ++ I  GD       +
Sbjct: 175 IEGKDVVVIASSDFSHYIPRQN--AYRK------------DALAIEKILKGDTRGLYDVI 220

Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            ++  T+CG  PI+  L   G  +T      L+Y  S   +  +D  V YA+
Sbjct: 221 AKHSITMCGYGPITAMLTATGGKAT-----LLKYATSGDVQPMQD-VVGYAA 266


>gi|300087467|ref|YP_003757989.1| hypothetical protein Dehly_0349 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
 gi|299527200|gb|ADJ25668.1| protein of unknown function DUF52 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
          Length = 440

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 112/248 (45%), Gaps = 28/248 (11%)

Query: 42  GVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
           GV++PHAGY YSG  AA  FG I+  +     ++GP+H       ++ T   ++TP+G++
Sbjct: 40  GVVSPHAGYIYSGGVAAAVFGRIE--TADTYVIIGPNHTGMGKPFSIMTVGAWQTPLGEV 97

Query: 102 PLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN 161
            +D  + + + A  K+   D    + EHS+E+ LP L   +    +KIVPI++     + 
Sbjct: 98  AIDSSLAQNILANSKYLQEDRTAHQGEHSVEVQLPLLQ--YHQPQLKIVPIVLAVATLDI 155

Query: 162 EAMYGRLFAKYVD---DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
               G   A+ +    D       SSD  H+ ++      D++     +SI ALD+  + 
Sbjct: 156 YREIGASIAQAIQESPDKKVVIVASSDMTHYEAQDIATAKDQRA---IESIIALDEAQL- 211

Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTK 275
                      + +++   ++CG  P    +     LG  S ++    +RY+ S      
Sbjct: 212 ----------LERVIKERISMCGYAPTVTMMAAARALGARSAEL----VRYQTSGDASGD 257

Query: 276 RDSSVSYA 283
             + V YA
Sbjct: 258 YTAVVGYA 265


>gi|289432991|ref|YP_003462864.1| hypothetical protein DehalGT_1048 [Dehalococcoides sp. GT]
 gi|452205478|ref|YP_007485607.1| hypothetical protein btf_1189 [Dehalococcoides mccartyi BTF08]
 gi|288946711|gb|ADC74408.1| protein of unknown function DUF52 [Dehalococcoides sp. GT]
 gi|452112534|gb|AGG08265.1| hypothetical protein btf_1189 [Dehalococcoides mccartyi BTF08]
          Length = 438

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 124/287 (43%), Gaps = 33/287 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +R+AS AG +Y  + +++       E+ G  +E  L       G + PHAGY YSG  A 
Sbjct: 2   LRQASAAGRYYPASKEKIRSVITPMEVKGIAKERYL-------GAVMPHAGYPYSGGVAM 54

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
                +D   I+   +LGPSH     + A+  + +++TP+G++ +D  +   +    +  
Sbjct: 55  AVASRLDLPEIA--IILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHI 112

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             D    + EHS+E+ +P L + F+   IKIVPI V    +E  A  G   A  + +   
Sbjct: 113 KADPSAHQYEHSIEVQIPIL-QYFKPD-IKIVPITVSFGKSETLADIGYGIASALRETGR 170

Query: 179 --FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                 SSD  H+ S+ +  H               D + +D I   D     + + +  
Sbjct: 171 EAIIIASSDMTHYESQSD-AHLK-------------DSLALDAIIKLDAAEMLERIQQNH 216

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            T+CG  P++  L  +     K + + + Y+ S       D  V YA
Sbjct: 217 ITMCGYAPVAAMLTAVKEMGAK-RARVVAYQTSGDVTHHLDQVVGYA 262


>gi|355570784|ref|ZP_09042054.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methanolinea tarda NOBI-1]
 gi|354826066|gb|EHF10282.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Methanolinea tarda NOBI-1]
          Length = 263

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 76/288 (26%), Positives = 130/288 (45%), Gaps = 36/288 (12%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           R+ + AG +Y  +   L + L+ + R  ++GL    + RG+++PHAGY YSG  +A A+G
Sbjct: 4   RKCAVAGMFYPRDPDHLEQLLEKFFRNKDSGL----DARGIVSPHAGYPYSGEVSAVAYG 59

Query: 63  NIDPTSISRVFL-LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I P++ S  F+ +GPSH  +   C   +   ++TP+G +  D     +L +    E+ +
Sbjct: 60  AI-PSTFSGTFIVIGPSHRGF---CTCISLVPWETPLGLVDND----SQLGSLLGVEVDE 111

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF-- 179
           +     E+S+E+ +P++   F     +IVP+L+G  + ++ A         +        
Sbjct: 112 VSHQYQENSLEVQIPFIKYRFP--RARIVPVLMGDQDLQSAADLAEKIVAGIRKSGRSDV 169

Query: 180 -FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSDF          HY  K   I   + A++ +      T D + F + L E   +
Sbjct: 170 RIVASSDFS---------HYVPKATAIENDLYAIEPL-----LTLDVEGFYRRLRERRVS 215

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            CG  PI+       +     + K LRY  S    T     V YA+ A
Sbjct: 216 ACGYGPIATMCLACRSLGAS-EGKLLRYATSGDV-TGDPEVVGYAAIA 261


>gi|229579952|ref|YP_002838351.1| hypothetical protein YG5714_2180 [Sulfolobus islandicus Y.G.57.14]
 gi|259646598|sp|C3N8S4.1|Y2180_SULIY RecName: Full=MEMO1 family protein YG5714_2180
 gi|228010667|gb|ACP46429.1| protein of unknown function DUF52 [Sulfolobus islandicus Y.G.57.14]
          Length = 284

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 35/297 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRG------VIAPHAGYSYSG 54
           R  + AGS+Y  + K+L  +++   R       +PK    +        + PHAGY YSG
Sbjct: 3   RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62

Query: 55  RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
             AA+++  +   S  R   V +LGP+H       +      ++TP+G + +D E++ +L
Sbjct: 63  PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
               +   +D      EHS+E+ LP+L   F+    KIVPI++     E  E +   ++ 
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQHFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179

Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
               +P     V  SSD           HYD     + K +EA++K     I+  D    
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDVEAIEK-----IQQLDYKGL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            + +   D T+CG  PI V L +      K  I  L++  S      +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280


>gi|154272045|ref|XP_001536875.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150408862|gb|EDN04318.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 137

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 56/85 (65%), Gaps = 7/85 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
           R A+HAGSWY+D+   L+ +L+ WL +      G+ + P    R +IAPHAGY+YSG  A
Sbjct: 44  REATHAGSWYSDHEPTLSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPHAGYAYSGPCA 103

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYY 82
           A+A+  +D +    +FLLGPSHH++
Sbjct: 104 AWAYKALDLSKAKSIFLLGPSHHHH 128


>gi|430813553|emb|CCJ29100.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 130

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 46/110 (41%), Positives = 56/110 (50%), Gaps = 21/110 (19%)

Query: 148 KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK------- 200
           KIVPILVGA+  E E  YG+L A Y+ D  N F VS      G RF+Y +Y +       
Sbjct: 18  KIVPILVGAIGTEQEEWYGQLLAPYIGDTKNRFIVS------GQRFSYTYYAEAGEKGRR 71

Query: 201 --------KHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
                   +   I+KSIE  D+ GM  IE G    F +YL    NTICGR
Sbjct: 72  LWREEEPLRGEPIYKSIERCDREGMQKIEEGSHKGFVEYLSRTRNTICGR 121


>gi|197118676|ref|YP_002139103.1| MEMO-like protein [Geobacter bemidjiensis Bem]
 gi|197088036|gb|ACH39307.1| MEMO-like protein [Geobacter bemidjiensis Bem]
          Length = 266

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 131/288 (45%), Gaps = 34/288 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR+ + AG +Y+ + ++L EEL   + RE    +  +  GV+APHAGY YSG AA   + 
Sbjct: 2   IRQPAVAGKFYSADPEQLREELSRMIPRE----EPAKAIGVVAPHAGYVYSGGAAGKVYA 57

Query: 63  NID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            ++ P +   V +LGP+H       AL+ +  + TP+G +P++  + + +         D
Sbjct: 58  AVEVPDA---VIVLGPNHTGMGAAAALAPSGEWLTPLGPVPVNDRLSQLIMKYAPLVRED 114

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
                 EHS+E+ +P+L   +    + I  + +   + ++ +  G   A  + +  +   
Sbjct: 115 SAAHRFEHSLEVQVPFLQ--YRNPQVSIAALCLSLPDFDSISRVGTGIAHAIAEYGSEVL 172

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSD  H+ S                  +A D++ ++ +   DP+   +   + D ++
Sbjct: 173 IVASSDMTHYQS--------------AAEAKAKDELALERLANMDPEGLLRVCRDKDISM 218

Query: 240 CGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           CG  P +  L     LG  S+++    + Y  S +      + V+YA+
Sbjct: 219 CGVIPATAMLVAAKTLGATSSRL----ICYTNSGEVNGDLKAVVAYAA 262


>gi|336121517|ref|YP_004576292.1| hypothetical protein Metok_0534 [Methanothermococcus okinawensis
           IH1]
 gi|334856038|gb|AEH06514.1| UPF0103 protein [Methanothermococcus okinawensis IH1]
          Length = 284

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 126/296 (42%), Gaps = 33/296 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSG 54
           IR  + AG++Y      L E ++         G L   G+ K P   GV+ PHAGY YSG
Sbjct: 3   IRNPAVAGAFYPAEPNNLIEMIEYCYLHKIGPGTLPSKGVFKKP--VGVVCPHAGYIYSG 60

Query: 55  RAAAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
             AA+++  I   +   +   ++GP+H       + +   +++TP+G++  D E I+ L 
Sbjct: 61  PVAAHSYNAISKKTDGTITAVIIGPNHTGLGSGVS-TMKGIWRTPLGNMSSDDEFIDRLW 119

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
                  +D      EHS+E+ LP+L   ++      KIVPI +   + E     G   A
Sbjct: 120 NECDILDLDETAHIREHSIEVQLPFLQHLELLNAVKFKIVPICMMMQDYETAVEIGYFIA 179

Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
           K   + +    V  SSDF H+  +       KK  ++ K          DI+   + + +
Sbjct: 180 KISKELNRRVVVIASSDFTHYEPQ---EIASKKDAIVIK----------DILNMNEKELY 226

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
              ++  + T+CG  P+   +  +   + K   + L Y  S        S V Y S
Sbjct: 227 AD-VINNNITMCGYGPVIAMIKAMKELNAK-NSRLLAYSTSGDITGNYSSVVGYGS 280


>gi|336324129|ref|YP_004604096.1| hypothetical protein Flexsi_1893 [Flexistipes sinusarabici DSM
           4947]
 gi|336107710|gb|AEI15528.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Flexistipes sinusarabici DSM 4947]
          Length = 262

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 73/253 (28%), Positives = 115/253 (45%), Gaps = 27/253 (10%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTP 97
           E + +I PHAGY +SG  A      I  P +I    LLGP+H     + A+     +  P
Sbjct: 34  EAKMIIVPHAGYVFSGATAVKTISRIKLPKNI---ILLGPNHTGTGGRIAVYPGGKWSCP 90

Query: 98  IGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV 157
           +GD+P++  ++E+L   G FE  D      EHS+E+ LP L K F   L  IVPI    +
Sbjct: 91  LGDVPVNENMVEKLIDKG-FE-SDQPAHVKEHSLEVQLPIL-KYFRDDL-NIVPIAFKGL 146

Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
             ++    G++    VD+  +   VSSDF          H++       K  +A++++ +
Sbjct: 147 GFDDCRNAGKVLKDLVDETDSMIVVSSDFN---------HFEDIETTNEKDFDAINRI-L 196

Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
           D+    D       +L  + ++CG  P  V L  L     K  +K    E ++  +T  D
Sbjct: 197 DL----DSKGLYDTVLSKNISMCGIIPTVVALESL---EKKENLKASLVEHTTSAETSGD 249

Query: 278 SS--VSYASAAAK 288
           +S  V YA    K
Sbjct: 250 ASQVVGYAGIIIK 262


>gi|229581387|ref|YP_002839786.1| hypothetical protein YN1551_0739 [Sulfolobus islandicus Y.N.15.51]
 gi|259647115|sp|C3NMP4.1|Y739_SULIN RecName: Full=MEMO1 family protein YN1551_0739
 gi|228012103|gb|ACP47864.1| protein of unknown function DUF52 [Sulfolobus islandicus Y.N.15.51]
          Length = 284

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 35/297 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLRE----AGLPKSPEVRG------VIAPHAGYSYSG 54
           R  + AGS+Y  + K+L  +++   R       +PK    +        + PHAGY YSG
Sbjct: 3   RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62

Query: 55  RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
             AA+++  +   S  R   V +LGP+H       +      ++TP+G + +D E++ +L
Sbjct: 63  PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
               +   +D      EHS+E+ LP+L   F+    KIVPI++     E  E +   ++ 
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179

Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
               +P     V  SSD           HYD     + K +EA++K     I+  D    
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDVEAIEK-----IQQLDYKGL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            + +   D T+CG  PI V L +      K  I  L++  S      +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280


>gi|14601616|ref|NP_148156.1| hypothetical protein APE_1771 [Aeropyrum pernix K1]
 gi|7388527|sp|Q9YB24.1|Y1771_AERPE RecName: Full=MEMO1 family protein APE_1771
 gi|5105461|dbj|BAA80774.1| conserved hypothetical protein [Aeropyrum pernix K1]
          Length = 281

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 75/297 (25%), Positives = 128/297 (43%), Gaps = 35/297 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYS 53
           KIR  +HAG++Y    + L + ++         G L + G     +    I PHAGY YS
Sbjct: 2   KIRNPAHAGTFYPATREELVKSIESSFTHPLGPGRLPQRGGGSGEQAIAYIPPHAGYMYS 61

Query: 54  GRAAAYAFGNIDPTSISR----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
           G  AA+ + ++   S+ R    V LLGP+H       +L    V++TP+G++ +D E   
Sbjct: 62  GPIAAHVYYDM---SLGRKPDVVVLLGPNHTGLGLAASLWDEGVWRTPLGEVEVDSEAGR 118

Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
            +         D      EHS+E+ LP+L  ++ G   +IVPI+V     +      R +
Sbjct: 119 LVVEYSGIVAPDDEGHIYEHSLEVQLPFLQYLYGGDF-RIVPIVVLHQTLDISIRIARAY 177

Query: 170 AKYVDDP--SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
            +  ++   +     +SD  H      Y  Y++         +  D + +  IE GDP+A
Sbjct: 178 HRLREENGVNAVLVATSDLNH------YEPYEEN--------KRKDLLLLKAIEEGDPEA 223

Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             K +  +  + CG  PI+  +         +K + L Y  S     ++   V Y +
Sbjct: 224 VFKTIEAHAISACGPSPIAAAVE--AGRLAGVKPRVLAYANSGDVTGEKAWVVGYPA 278


>gi|218884668|ref|YP_002429050.1| putative dioxygenase [Desulfurococcus kamchatkensis 1221n]
 gi|254800053|sp|B8D6F2.1|Y1357_DESK1 RecName: Full=MEMO1 family protein DKAM_1357
 gi|218766284|gb|ACL11683.1| Predicted dioxygenase [Desulfurococcus kamchatkensis 1221n]
          Length = 283

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 120/301 (39%), Gaps = 43/301 (14%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSP------EVRGVIAPHAGYSYS 53
           K R    AG +Y D    L   ++   +     G P SP         G +APHAGY YS
Sbjct: 2   KKRSPIVAGYFYPDKPGELRSVIEWSFKHGIGPGKPPSPSDIPATNSIGYVAPHAGYIYS 61

Query: 54  GRAAAYAFG----NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
           G  AA+ +     N  P +I    +LG +H       ++    V++TP+GDL +D E+  
Sbjct: 62  GPVAAHVYFDMALNKKPDTI---VILGTNHTGLGRPVSVYPEGVWETPLGDLVVDSEIGR 118

Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
            +    +    D      EHS+E+ LP++  ++ G  +KI PI++G    +      R  
Sbjct: 119 LIVENSEIAEFDEYAHLEEHSIEVQLPFIVYIY-GEDVKITPIVIGIHTPD----IARDL 173

Query: 170 AKYVDDPSN------FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
           AK + + S           SSDF H+                H+     D M +D I   
Sbjct: 174 AKSIYEASMSTGKRIIVIASSDFNHYEP--------------HEETSRKDSMAIDRILKL 219

Query: 224 DPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           D D     +L  D +ICG   I   +        K ++  L+Y  S          V YA
Sbjct: 220 DTDGLYNVILHNDISICGPGGIMTLMEYTKKLGGKAQL--LKYATSGDTSGDYSHVVGYA 277

Query: 284 S 284
           +
Sbjct: 278 A 278


>gi|218667506|ref|YP_002425064.1| hypothetical protein AFE_0572 [Acidithiobacillus ferrooxidans ATCC
           23270]
 gi|218519719|gb|ACK80305.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC
           23270]
          Length = 264

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 73/275 (26%), Positives = 114/275 (41%), Gaps = 36/275 (13%)

Query: 15  TDNSKRLAE-ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA--YAFGNIDPTSISR 71
           T+  + LA  E DG    A  PK+     +I PHAGY YSG  AA  YA        I R
Sbjct: 12  TEVERLLARAEQDGEAASAPWPKA-----IIVPHAGYIYSGAVAASGYALLAKGRGHIRR 66

Query: 72  VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSM 131
           V LLGP+H       AL      +TP+G + +D   +E L    +   M       EH++
Sbjct: 67  VVLLGPAHRLPFRGLALPGVQAMQTPLGTVAVDQAGVEALAGLPEVREMP-AAHAQEHAL 125

Query: 132 EMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGS 191
           E+ LP++ +V     + +VP++VG    +  A   R+  K          +SSD  H   
Sbjct: 126 EVQLPFIQEVLGD--VSVVPLVVGDARPDEVA---RVLEKLWGGEETVIVISSDLSH--- 177

Query: 192 RFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM 251
              Y  Y +   + H ++E + +             F    ++++   CG  PI+  L +
Sbjct: 178 ---YHPYAEARAIDHHTVEEILR-------------FDPTPIDHEQA-CGATPINGLLPV 220

Query: 252 LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
                  +  + +    S      RD+ V YA+ A
Sbjct: 221 --ARKHHLHPRLVGLCNSGDTAGSRDAVVGYAAVA 253


>gi|227831071|ref|YP_002832851.1| hypothetical protein LS215_2219 [Sulfolobus islandicus L.S.2.15]
 gi|259646604|sp|C3MJQ3.1|Y2219_SULIL RecName: Full=MEMO1 family protein LS215_2219
 gi|227457519|gb|ACP36206.1| protein of unknown function DUF52 [Sulfolobus islandicus L.S.2.15]
          Length = 284

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 35/297 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLRE----AGLPKSPEVRG------VIAPHAGYSYSG 54
           R  + AGS+Y  + K+L  +++   R       +PK    +        + PHAGY YSG
Sbjct: 3   RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62

Query: 55  RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
             AA+++  +   S  R   V +LGP+H       +      ++TP+G + +D E++ +L
Sbjct: 63  PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
               +   +D      EHS+E+ LP+L   F+    KIVPI++     E  E +   ++ 
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTLEIAEFLADAIYN 179

Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
               +P     V  SSD           HYD     + K +EA++K     I+  D    
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDVEAIEK-----IQQLDYKGL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            + +   D T+CG  PI V L +      K  I  L++  S      +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280


>gi|406896361|gb|EKD40670.1| hypothetical protein ACD_74C00263G0001 [uncultured bacterium]
          Length = 267

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 60/245 (24%), Positives = 105/245 (42%), Gaps = 23/245 (9%)

Query: 42  GVIAPHAGYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
            V++PHAGY YSG  A   F  ++ P  I    LLGP+HH Y    +L     +  P+G+
Sbjct: 38  AVVSPHAGYIYSGSVAGETFAAVEIPKDI---VLLGPNHHGYGAPVSLMHTGSWNMPLGE 94

Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
           + ++  + + L A       D      EHS+E+ +P+L + F   +  + P+++  ++ +
Sbjct: 95  IQINTVLAQTLLAQTDLIEPDTLAHRFEHSLEVQVPFL-QYFRPDM-TLTPMVISHLSFK 152

Query: 161 NEAMYGRLFAKYVD--DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
                G   A+ +   +       SSD  H+ SR               S  A D + + 
Sbjct: 153 ACQEVGETLAEAIRKYNKPVLIVASSDMTHYESRV--------------SATAKDSLAIH 198

Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
            +E  DP+     +L+   T+CG  P ++ L          + + +RY  S +     + 
Sbjct: 199 RLEALDPEGLYNTVLDKGITMCGIMPTTIALVAALRLGAT-QARLIRYTDSGEASGDTNQ 257

Query: 279 SVSYA 283
            V YA
Sbjct: 258 VVGYA 262


>gi|148658050|ref|YP_001278255.1| hypothetical protein RoseRS_3955 [Roseiflexus sp. RS-1]
 gi|148570160|gb|ABQ92305.1| protein of unknown function DUF52 [Roseiflexus sp. RS-1]
          Length = 405

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/273 (26%), Positives = 112/273 (41%), Gaps = 23/273 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           R+ + AG  Y  N   L   LDG++   G   P  P  RGV++PH  Y   GR  A  + 
Sbjct: 114 RQPALAGQSYPANPADLRRLLDGYVAAVGQVTPAPPTGRGVLSPHIDYERGGRVYAQVWQ 173

Query: 63  NIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK-ATG-KFE 118
                  +   V L+G  H  Y+P+    T   Y TP G LP D  +++ L  A G +  
Sbjct: 174 RAAEMVRAAEIVILIGTDH--YSPEPITLTRQRYATPFGVLPTDAAIVDALAHALGEEAA 231

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA----VNAENEAMYGRLFA---- 170
                    EHS+E+   +L  V       ++PILVG+    +  E+ A   RL A    
Sbjct: 232 FAGELYHRVEHSLELAAVWLQYVRGNPPCPVIPILVGSFARYIGGEDPATDRRLEALMTA 291

Query: 171 --KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
             + +D       +S D  H G  F      +       ++   D+  +D +  GD   F
Sbjct: 292 LRQVMDTRRTLVVISGDMSHVGPAFGGAPLSESD---KAALRRSDEQVIDRMRAGDAAGF 348

Query: 229 KKYLLEYDN--TICGRHPISVFLHMLGNCSTKI 259
            + + +  +   ICG  P  V L ++G    ++
Sbjct: 349 FRVIADTGDRQNICGLPPTYVALRLMGAVEGEL 381


>gi|436841594|ref|YP_007325972.1| conserved protein of unknown function [Desulfovibrio hydrothermalis
           AM13 = DSM 14728]
 gi|432170500|emb|CCO23871.1| conserved protein of unknown function [Desulfovibrio hydrothermalis
           AM13 = DSM 14728]
          Length = 270

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 127/287 (44%), Gaps = 32/287 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL----REAGLPKSPEVRGVIAPHAGYSYSGR--AAA 58
           R+   AG +YTDN ++L +EL  ++    ++A  P     R V+ PHAGY +SG      
Sbjct: 3   RKPVVAGRFYTDNPQQLKQELKQFIGPLNKKAAAPYD---RLVMLPHAGYMFSGEPCGKT 59

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
            A  N+ PT    + LLGP+H       ++     ++ P G L +D ++ ++L  +G   
Sbjct: 60  LAGANLAPT----IILLGPNHTGLGSPLSVWDCGSWEFPGGKLDVDEDLAQQLIDSGTGF 115

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
           + +      EHS+E+ +P+L   +    I+IVP+ V     +     G   A  +D  S 
Sbjct: 116 VENQAAHSREHSLEVIVPFLH--YLNPEIRIVPVCVSESAPKVLHKAGEAIADIIDAYSK 173

Query: 179 FFS--VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
             S  VSSD  H      ++  DK         + +D M ++ I   DP      +    
Sbjct: 174 PVSIVVSSDMSH------FIKADKA--------KKMDSMALEAIIRMDPADLYSIVSSNQ 219

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            ++CG  P+++ ++           + + Y  S +     +S V+YA
Sbjct: 220 ISMCGVLPMTMGMYAAKKLGANAG-RLIEYTNSGKVTGDFESVVAYA 265


>gi|15922382|ref|NP_378051.1| hypothetical protein ST2062 [Sulfolobus tokodaii str. 7]
 gi|23822365|sp|Q96YW6.1|Y2062_SULTO RecName: Full=MEMO1 family protein STK_20620
 gi|15623171|dbj|BAB67160.1| hypothetical protein STK_20620 [Sulfolobus tokodaii str. 7]
          Length = 284

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 81/300 (27%), Positives = 130/300 (43%), Gaps = 39/300 (13%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG---------VIAPHAGYSYS 53
           IR  + AG++Y     +L ++++  +L   G  K P+V            I PHAGY YS
Sbjct: 2   IRLPAVAGAFYEGEEDKLKKQIEWSFLHPLGPGKIPQVPPQKSKRNNLFFIVPHAGYMYS 61

Query: 54  GRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+A+  +    I   V +LGP+H       +L     +KTP+G++ +D ++  +L 
Sbjct: 62  GPVAAHAYYYLASEGIPDTVIILGPNHTGLGSYVSLWPKGKWKTPLGEIEIDEQIAMDL- 120

Query: 113 ATGKFELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRL 168
              + E++DI  DE     EHS+E+ +P+L   F+    KIVPI++     E        
Sbjct: 121 -VRESEVIDI--DEKAHLYEHSIEVQVPFLQYFFDSK-TKIVPIVIMMQTPEISEYLAEG 176

Query: 169 FAKYVD---DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDP 225
            +K +    D       SSD  H+                 K+IE  D M ++ I + D 
Sbjct: 177 ISKIMQKYKDKDIVVIASSDMNHYEPH-------------EKTIEK-DNMAIEKILSLDY 222

Query: 226 DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
                 + E D T CG  P+   L +    + K  +  L++  S      + S V Y S 
Sbjct: 223 KGLFNVVEEKDVTACGFGPMMTVLMLAKKFNKKPYV--LKHATSGDTSGDKSSVVGYLSV 280


>gi|386393059|ref|ZP_10077840.1| putative dioxygenase [Desulfovibrio sp. U5L]
 gi|385733937|gb|EIG54135.1| putative dioxygenase [Desulfovibrio sp. U5L]
          Length = 288

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 74/284 (26%), Positives = 119/284 (41%), Gaps = 25/284 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIA--PHAGYSYSGRAAAYAFG 62
           R+   AG +Y   +  L  E+  +L EA  P   +   ++A  PHAGY YSG  A    G
Sbjct: 20  RQPVVAGRFYPGAAPALRREVGAFLAEAEAPAEADGPTLLAMVPHAGYVYSGSVAGRTLG 79

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELMD 121
                    V LLGP+H     + A+  +  +  P  D+P+  ++  + L+A  +    D
Sbjct: 80  AARLADT--VLLLGPNHTGRGKRLAVWPSGAWAVPGCDVPVAADLARDFLRAEARLS-PD 136

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EHS+E+ LP+L    +   ++IVPI V   + E         A  +   S   S
Sbjct: 137 AAAHREEHSLEVLLPFL--CVKNPAVRIVPIAVAEPDPEALRQVAGAMAGVLGQRSEPVS 194

Query: 182 --VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
             VSSD  H+              V H++ +  D + +D +   DPD   + + E   T+
Sbjct: 195 IVVSSDMSHY--------------VPHETAKRRDALALDRVLALDPDGLYRVVREAGITM 240

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           CG  P+ + LH+      +  +    Y  S +     +  V YA
Sbjct: 241 CGVLPMVLGLHLAKALGARQAV-LAAYATSGEASGDYNQVVGYA 283


>gi|95928913|ref|ZP_01311659.1| protein of unknown function DUF52 [Desulfuromonas acetoxidans DSM
           684]
 gi|95135258|gb|EAT16911.1| protein of unknown function DUF52 [Desulfuromonas acetoxidans DSM
           684]
          Length = 267

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 90/188 (47%), Gaps = 8/188 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR+ + +G +YTD+   L ++++ +L     P  P   GV+ PHAGY YSG  A      
Sbjct: 2   IRQPAVSGQFYTDDPYELRQQVELFLT-TDQPGKP-AYGVMMPHAGYMYSGAIAGETLAG 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +D      V LLGP+H      CAL +   +KTP+G++P+   + + L         ++ 
Sbjct: 60  VDVPDT--VLLLGPNHRGIGHPCALYSQGSWKTPLGEVPIAETMAQRLLDAVPHLTGEMQ 117

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPSNFFS 181
             + EHS+E+ LP+L    +   + I+P+++G ++       G      +  D       
Sbjct: 118 SHQGEHSLEVLLPFLQ--VKNPNLSIIPLMLGPLSFSILQQLGAGIGAVLKEDGGRVLIV 175

Query: 182 VSSDFCHW 189
            SSD  H+
Sbjct: 176 ASSDMTHY 183


>gi|148706475|gb|EDL38422.1| RIKEN cDNA 0610016J10, isoform CRA_a [Mus musculus]
          Length = 77

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 39/76 (51%), Positives = 47/76 (61%), Gaps = 3/76 (3%)

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCK 273
           M IIE  DP +F  YL +Y NTICGRHPI V L+    L      +   FL Y QSSQC+
Sbjct: 1   MSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCR 60

Query: 274 TKRDSSVSYASAAAKV 289
           + +DSSVSYA+ A  V
Sbjct: 61  SWQDSSVSYAAGALTV 76


>gi|385805502|ref|YP_005841900.1| putative dioxygenase [Fervidicoccus fontis Kam940]
 gi|383795365|gb|AFH42448.1| putative dioxygenase [Fervidicoccus fontis Kam940]
          Length = 287

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 133/292 (45%), Gaps = 24/292 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD-------GWLREAGLPKSPEVR--GVIAPHAGYSYSG 54
           +R    AG +Y    ++L E++        G  +EA    S      G IAPHAGY YSG
Sbjct: 6   VRDPVVAGEFYELRKEKLLEQIRWAFLHPIGPGKEAIFSNSITKNNFGYIAPHAGYIYSG 65

Query: 55  RAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+ + NI        F++ GP+H     + +L+    ++TP+G + +D+E+ + L  
Sbjct: 66  PVAAHTYYNISLEKKPDTFIIIGPNHTGLGAQVSLAPWKQWRTPLGLINVDIELRDYLIT 125

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
             K  + D      EHS+E+ LP+L  +F+    +I+PI+   +  +   +  ++  + +
Sbjct: 126 KSKVLVPDYNAHLYEHSIEVQLPFLQYIFQKDF-QILPIV---LMRQTPKVVEKIAEELL 181

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
           D   +   +  DF    S  +  HY+       K ++A      ++I+  D + F  YL 
Sbjct: 182 DATEH---LGRDFTIIASS-DMSHYEPYDVAKRKDLKAF-----EMIKKRDINLFYNYLE 232

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
             + ++CG   + + L  +       + + L+Y  S      +D+ V Y SA
Sbjct: 233 TENVSMCGPGGVMI-LMTISKMKNGREPEILKYATSGDLTGDKDAVVGYLSA 283


>gi|167379219|ref|XP_001735045.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165903085|gb|EDR28738.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 284

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/289 (26%), Positives = 116/289 (40%), Gaps = 36/289 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG----VIAPHAGYSYSGRAAAY 59
           +R+A  AG W+  N   LA E+D ++  A L K P ++G     I+PHAG+ YSG  A Y
Sbjct: 6   VRKAVGAGRWFNANGNELANEIDHYINNA-LNKLPPIQGKILGCISPHAGFRYSGPTAGY 64

Query: 60  AFGNIDPTSISR-----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            F  +   S        VF+LG SH       A+       TPI    +D E I      
Sbjct: 65  DFAALKRDSEINGKPDVVFILGFSHSSRFDYAAIMDGKAISTPIATTEIDNEAITIFCEG 124

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
             +          EHS E  LP++ +   G  +K+V +L+G   +E      +       
Sbjct: 125 RNYLKCFYKPHNGEHSAENELPFVQRALPG--VKVVMVLIGTHKSEVFEQVAQGLQAVAS 182

Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
               +   SSD  H          D+ H ++ K+    DK  + + E  D          
Sbjct: 183 KKKMYVIASSDMLH----------DESHNLVEKT----DKETILLTEKMDIKGLLTR-WS 227

Query: 235 YDNTI-CGRH---PISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS 279
           Y+N I CG     P  ++   +G C+  I +        S+  TKR +S
Sbjct: 228 YENQIYCGITAIIPTMMYAQSIG-CAKAITLDL----TDSETVTKRMNS 271


>gi|404493275|ref|YP_006717381.1| MEMO-like protein [Pelobacter carbinolicus DSM 2380]
 gi|77545333|gb|ABA88895.1| MEMO-like protein [Pelobacter carbinolicus DSM 2380]
          Length = 267

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 116/281 (41%), Gaps = 23/281 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R  + AG +Y  + K L   ++ +L +A   +S    G++ PHAGY +SG  A   FG +
Sbjct: 3   RMPAVAGQFYPADPKSLRSMVETYLEKAT--QSHPAIGLMVPHAGYVFSGAIAGQTFGCV 60

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           D    S+V ++GP+H  Y    AL     + TP+G++P+   + + +       + D   
Sbjct: 61  DIP--SKVLVIGPNHTGYGESLALFAKGSWVTPLGEVPIAEGLADRVLQAHPRLMADDLA 118

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPSNFFSV 182
              EHS+E+ +P+L        ++IVP+ +  V  E     G    + +  +        
Sbjct: 119 HRFEHSLEVQIPFLQ--VRAPDVQIVPLCLAPVPYEELLALGNAIGQVLAAEKEPVLLVA 176

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
           SSD           HY+       K     D M +  +   DP+   + +   + ++CG 
Sbjct: 177 SSDM---------THYEPGAAARKK-----DHMALKHVCDLDPEGLYRTVASENISMCGV 222

Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
               V L     C    K + + Y  S +    +   V YA
Sbjct: 223 VSTVVMLAA-AQCLGAQKGRLVCYGNSGEVTGDQSEVVGYA 262


>gi|385776635|ref|YP_005649203.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323475383|gb|ADX85989.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 284

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 81/297 (27%), Positives = 129/297 (43%), Gaps = 35/297 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRG------VIAPHAGYSYSG 54
           R  + AGS+Y  + K+L  +++   R       +PK    +        + PHAGY YSG
Sbjct: 3   RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62

Query: 55  RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
             AA+++  +   S  R   V +LGP+H       +      ++TP+G + +D E++ EL
Sbjct: 63  PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMEL 120

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
               +   +D      EHS+E+ LP+L   F+    KIVPI++     E  E +   ++ 
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179

Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
               +P     V  SSD           HYD     + K  EA++K     I+  D    
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDEEAIEK-----IQQLDYKGL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            + +   D T+CG  PI V L +      K  I  L++  S      +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280


>gi|239907803|ref|YP_002954544.1| hypothetical protein DMR_31670 [Desulfovibrio magneticus RS-1]
 gi|239797669|dbj|BAH76658.1| hypothetical protein [Desulfovibrio magneticus RS-1]
          Length = 292

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/288 (26%), Positives = 119/288 (41%), Gaps = 31/288 (10%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
            IR+   AG +Y  +   LA E   +L EAG P        +APHAG  YSG  A    G
Sbjct: 24  SIRQPVVAGRFYPADPAALARETAAYLAEAGAPSDKPTLLAMAPHAGAVYSGPVAGKTLG 83

Query: 63  --NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
             N+  T    + LLGP+H     + A+ +   ++ P  D+P++  + E L A       
Sbjct: 84  AANLADT----LLLLGPNHTGRGGRLAVWSDGAWRIPGRDVPVEAALAEALLAAAPGLSP 139

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRLFAKYVDDP 176
           D      EHS+E+ LP++  V  G   +IVP+ V      V AE       + A +   P
Sbjct: 140 DRAAHLGEHSLEVLLPFVTAVRPG--CRIVPVAVAESRPGVLAETAQAMAEVLAAW-SGP 196

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG-MDIIETGDPDAFKKYLLEY 235
            +   VSSD  H+          K+  +    + ALD  G +D++               
Sbjct: 197 VSII-VSSDMSHY---LPEAAAKKRDSLALARVLALDPQGLLDVVR------------RE 240

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           D ++CG  P+++ L ++         +   Y  S +     D  V YA
Sbjct: 241 DISMCGVLPMTLGL-LIAQALGATSARLAAYATSGEVSGDYDQVVGYA 287


>gi|302341544|ref|YP_003806073.1| hypothetical protein Deba_0100 [Desulfarculus baarsii DSM 2075]
 gi|301638157|gb|ADK83479.1| protein of unknown function DUF52 [Desulfarculus baarsii DSM 2075]
          Length = 267

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 72/137 (52%), Gaps = 4/137 (2%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+ + AG +Y   ++ LA E+D  L +   P  P +  VI PHAGY +SGR A   FG 
Sbjct: 2   VRQPAVAGRFYPAQAEALAAEVDRLL-QTDQPPQPAL-AVICPHAGYVFSGRVAGQVFGQ 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           +  +   RV L+GP+H       AL +   ++TP+G + LD ++ + +     +   D  
Sbjct: 60  V--SVPRRVLLMGPNHSGMGRPAALMSRGQWRTPLGLINLDDQLGQAIMDRAAYVEQDDL 117

Query: 124 VDEAEHSMEMHLPYLAK 140
               EHS+E+  P+L +
Sbjct: 118 AHRNEHSLEVQTPFLQR 134


>gi|333910057|ref|YP_004483790.1| hypothetical protein [Methanotorris igneus Kol 5]
 gi|333750646|gb|AEF95725.1| UPF0103 protein [Methanotorris igneus Kol 5]
          Length = 286

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 81/303 (26%), Positives = 135/303 (44%), Gaps = 44/303 (14%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRGV-------IAPHAGYSYSGR 55
           IR  + AG++Y  +   L E ++  +L   G  + P   G+       + PHAGY YSG 
Sbjct: 2   IRHPAVAGAFYPADPNELIETIEYCYLHNLGPKELPSEGGIFEKPIGLVCPHAGYIYSGP 61

Query: 56  AAAYAFG----NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
            AA+++      +D        +LGP+H       A +   ++KTP+GDL +D E  + L
Sbjct: 62  IAAHSYNALSKRVDVEEEITAIILGPNHTGLGTGVA-TMKGIWKTPLGDLEIDDEFADRL 120

Query: 112 KATGKFELMDICVDEA----EHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMY 165
               + ++MD+  DE     EHS+E+ LP+L  +   ++   K VPI +   + E  A  
Sbjct: 121 WK--ECDIMDL--DETSHLHEHSIEVQLPFLQHLSMLNIAKFKFVPISMLLQDYETAADI 176

Query: 166 GRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM--DIIE 221
           G   AK   + +    +  S+DF H+  +       KK  +  K+I  +D+ G+  D++ 
Sbjct: 177 GYFIAKIAKELNRRVVIIASTDFTHYEPQEVAA---KKDAIAIKNILNMDEEGLYSDVV- 232

Query: 222 TGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVS 281
                        Y+ ++CG  P+   +  +     K + K L Y  S        S V 
Sbjct: 233 ------------NYNISMCGCGPVMAMIKAIKLLGGK-EAKLLAYATSGDITKDYSSVVG 279

Query: 282 YAS 284
           YAS
Sbjct: 280 YAS 282


>gi|15897032|ref|NP_341637.1| hypothetical protein SSO0066 [Sulfolobus solfataricus P2]
 gi|284173376|ref|ZP_06387345.1| hypothetical protein Ssol98_01802 [Sulfolobus solfataricus 98/2]
 gi|384433532|ref|YP_005642890.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|7388414|sp|P95994.1|Y066_SULSO RecName: Full=MEMO1 family protein SSO0066
 gi|1707828|emb|CAA69536.1| orf c05005 [Sulfolobus solfataricus P2]
 gi|13813197|gb|AAK40427.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
 gi|261601686|gb|ACX91289.1| protein of unknown function DUF52 [Sulfolobus solfataricus 98/2]
          Length = 284

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 35/297 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG----------VIAPHAGYSYSG 54
           R  + AGS+Y  + K+L  +++   R    P+    +            I PHAGY YSG
Sbjct: 3   RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQSYEKKKRDNLFFIVPHAGYIYSG 62

Query: 55  RAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+++  +        V +LGP+H       +      ++TP+G + +D EV+ +L  
Sbjct: 63  PVAAHSYYYLASEGKPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKVDEEVLMQLVM 122

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-----NEAMYGRL 168
             +   ++      EHS+E+ LP+L   F+ +  KIVPI++     E      +A+Y ++
Sbjct: 123 ESEVIDLEEKSHLYEHSIEVQLPFLQYFFDDNF-KIVPIVIMMQTPEIAEFLADAIY-KV 180

Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
             KY D      + SSD           HYD     + K  EA++K     I+  D    
Sbjct: 181 IQKYSDKDIVVLA-SSDMN---------HYDPHEITMKKDEEAIEK-----IQQLDYRGL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            + +   D T+CG  PI V L +      K  I  L++  S      +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVSLILAKKLGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280


>gi|284998567|ref|YP_003420335.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284446463|gb|ADB87965.1| protein of unknown function DUF52 [Sulfolobus islandicus L.D.8.5]
          Length = 284

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 79/297 (26%), Positives = 129/297 (43%), Gaps = 35/297 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRG------VIAPHAGYSYSG 54
           R  + AGS+Y  + K+L  +++   R       +PK    +        + PHAGY YSG
Sbjct: 3   RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62

Query: 55  RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
             AA+++  +   S  R   V +LGP+H       +      ++TP+G + +D E++ +L
Sbjct: 63  PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
               +   +D      EHS+E+ LP+L   F+    KIVPI++     E  E +   ++ 
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179

Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
               +P     V  SSD           HYD     + K +E ++K     I+  D    
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDVEVIEK-----IQQLDYKGL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            + +   D T+CG  PI V L +      K  I  L++  S      +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280


>gi|410721675|ref|ZP_11361005.1| putative dioxygenase [Methanobacterium sp. Maddingley MBC34]
 gi|410598421|gb|EKQ52994.1| putative dioxygenase [Methanobacterium sp. Maddingley MBC34]
          Length = 280

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 75/298 (25%), Positives = 127/298 (42%), Gaps = 32/298 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL--REAGLPKSP-------EVRGVIAPHAGYSYSG 54
           IR+ + AG +Y  +   L + +  W    E G  + P        ++G+I+PHAGYSYSG
Sbjct: 2   IRKPAVAGMFYESDEDSLRKRIK-WCYQHELGPGRLPGKMGNKRSIKGLISPHAGYSYSG 60

Query: 55  RAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA ++  +    +   V +L P+H       +  T   + TP+G++ +D +   EL  
Sbjct: 61  PVAACSYMELAEDGMPETVVILCPNHTGIGSGLSTMTEGSWLTPLGEVEIDHQFASELLN 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
                  D      EHS E+ LP+L ++      ++VP+ +   + E     G   +   
Sbjct: 121 YYPLLDDDPSAHLKEHSCEVQLPFLQEISSD--FQMVPVCMMMQDLETSRELGEAISHTA 178

Query: 174 DDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                   V  S+DF H+                H+  EA DK  ++ I   D     + 
Sbjct: 179 RKLGRDLVVIASTDFTHYQP--------------HEVAEAQDKKVLEAISAMDELEMMRT 224

Query: 232 LLEYDNTICGRHPISVFLHML-GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           + E++ T+CG  P++  +    G  +T++ +  L+Y  S        S V YASA  K
Sbjct: 225 IQEFNVTMCGYGPVAATIEASKGMGATEVSV--LQYATSGDTGGDYTSVVGYASAIFK 280


>gi|149193804|ref|ZP_01870902.1| hypothetical protein CMTB2_01928 [Caminibacter mediatlanticus TB-2]
 gi|149135757|gb|EDM24235.1| hypothetical protein CMTB2_01928 [Caminibacter mediatlanticus TB-2]
          Length = 263

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 78/149 (52%), Gaps = 8/149 (5%)

Query: 41  RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
           + +I PHAG+ YSG  A +A+     T+  RV ++GPSH +     + +   VY+TP G 
Sbjct: 46  KALIVPHAGWMYSGFTANFAYRIASNTNPKRVVVIGPSHRFPIKGISTTLEDVYETPCGL 105

Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
           LP+D+E  +EL     F++ ++ +   EHS E+ +P++   F    I +V ++ G    E
Sbjct: 106 LPIDIEFAKEL--IKNFDVQNLEMVHQEHSTEVQMPFIYHYFGK--IPVVELIYGDYAPE 161

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
                 +   KY  + ++   +SSD  H+
Sbjct: 162 KL----KEIIKYAIEDNSLVVISSDLSHY 186


>gi|218781935|ref|YP_002433253.1| hypothetical protein Dalk_4100 [Desulfatibacillum alkenivorans
           AK-01]
 gi|218763319|gb|ACL05785.1| protein of unknown function DUF52 [Desulfatibacillum alkenivorans
           AK-01]
          Length = 282

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 70/251 (27%), Positives = 114/251 (45%), Gaps = 33/251 (13%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVR--GVIAPHAGYSYSGRAAAY 59
           +IRRA   GSWY  +     E+++ ++  A LP+  PEV+  G I PHAG+ YSG+ AA 
Sbjct: 2   EIRRAQFMGSWYPASENECREKIEDFI--ADLPQVKPEVKRIGGIVPHAGWVYSGKIAAQ 59

Query: 60  AFGNI---DPTSISRVFL--LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
               +   +P  +  VF   LGP H  +     +  +  ++TP G L +  +V  +L   
Sbjct: 60  VIAALKKNNPPDVIAVFGMHLGPRHPNF-----IMASGAWETPFGPLAIAEDVAGQLAKG 114

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN---AENEAMYGRLFAK 171
            +FE+        ++++E+ LP++   F    ++IVPI V  V+   A  EA        
Sbjct: 115 FRFEIESPDRHVQDNTIELQLPFIKYFFPD--VRIVPIGVPPVSRSLAIGEA-------- 164

Query: 172 YVDDPSNFFSV-----SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
           +VD   +F        S+D  H+G+ + +         I       DK  ++ IE  DP 
Sbjct: 165 FVDIARHFGYTAKVIGSTDLTHYGANYGFSPVGSGPKAIQWVKNENDKKLVERIEAMDPQ 224

Query: 227 AFKKYLLEYDN 237
                 +E  N
Sbjct: 225 GVIDQAMENSN 235


>gi|147921497|ref|YP_684688.1| hypothetical protein LRC400 [Methanocella arvoryzae MRE50]
 gi|110620084|emb|CAJ35362.1| conserved hypothetical protein [Methanocella arvoryzae MRE50]
          Length = 264

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 81/294 (27%), Positives = 126/294 (42%), Gaps = 49/294 (16%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
           +RR + AG +Y  +   +   +         P+   +  G++ PHAGY YSG  AA  + 
Sbjct: 1   MRRPAVAGQFYPGSETEVRSLIARLAPGTSEPRDKIKACGIVVPHAGYVYSGGVAADVYS 60

Query: 63  NID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +I+  PT +    LLGPSH+      A+ST   ++TP+G + +D + +          L+
Sbjct: 61  SIEGAPTFV----LLGPSHYGVGSPVAVSTQP-WETPLGKVEVDQDFV---------RLL 106

Query: 121 DICVD------EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKY 172
           D  +D      ++EHS+E+ +P+L   F+    +IVPI +G  + ++  E     + A  
Sbjct: 107 DGIIDRDEIAHQSEHSIEVQIPFLQYFFKD--FRIVPICLGMQDYDSVREVATELITALE 164

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYD---KKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
             D       SSDF H      Y H     KK   + K IE LD              F 
Sbjct: 165 KYDGQVVLVASSDFTH------YEHISVAKKKDMTLIKDIEKLDV-----------PTFY 207

Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
             L   + T CG  PI+  + + G      K + +RY  S    T     V YA
Sbjct: 208 DDLYRLNATACGYGPIATTMMVCGARGAS-KAQLVRYATSGDV-TGDSQVVGYA 259


>gi|149173820|ref|ZP_01852449.1| hypothetical protein PM8797T_05265 [Planctomyces maris DSM 8797]
 gi|148847350|gb|EDL61684.1| hypothetical protein PM8797T_05265 [Planctomyces maris DSM 8797]
          Length = 1051

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 68/286 (23%), Positives = 128/286 (44%), Gaps = 28/286 (9%)

Query: 3    KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
            +IR A  AG++Y  +  R+  +L G L    +   P    ++ PHAG+ YSG+ AA    
Sbjct: 784  EIRPAGVAGTFYPADPARMTAQL-GELFHDQVDAQPWAAAMV-PHAGWKYSGKIAARVLN 841

Query: 63   NID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             I  P++I    ++GP H       A++    ++ P G+L  D  + ++L        +D
Sbjct: 842  RIQLPSTI---IVIGPKHTREGVDWAVAPHQAWQLPGGNLNSDRALAQKLAEQIPGLELD 898

Query: 122  ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN----AENEAMYGRLFAKYVDDPS 177
                 +EH++E+ LP + ++      K++ I++G+ N     E  A   R+  +  + P 
Sbjct: 899  AAAHRSEHAIEVELPLIQRLAPD--AKVIGIVIGSGNLPRCEEFAAGLARVIQEMPEPP- 955

Query: 178  NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
                +SSD  H+ +       DK++         LD++ ++ + + DP+   + + EY  
Sbjct: 956  -LLLISSDMNHFAT-------DKEN-------RRLDELALEKMRSLDPEGLLETVREYHI 1000

Query: 238  TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            ++CG  P  + +  L       +I+ + Y  S          V YA
Sbjct: 1001 SMCGVLPAVIVMKTLQKMGKLNQIEQVGYATSGDVTGDSSRVVGYA 1046


>gi|330834015|ref|YP_004408743.1| hypothetical protein Mcup_0150 [Metallosphaera cuprina Ar-4]
 gi|329566154|gb|AEB94259.1| conserved hypothetical protein [Metallosphaera cuprina Ar-4]
          Length = 263

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 66/246 (26%), Positives = 110/246 (44%), Gaps = 22/246 (8%)

Query: 44  IAPHAGYSYSGRAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP 102
           I PHAGY YSG  A+ ++   +       V +LGP+H  Y    ++     ++TP+G   
Sbjct: 32  IVPHAGYIYSGPIASNSYYYLVQEGKPDLVIILGPNHTGYGSWVSIWPEGEWRTPLGSYN 91

Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENE 162
           +D ++++EL +  +   +D      EHS+E+ LP+L   F    ++I+PI+V     E  
Sbjct: 92  VDQKLVKELVSVSEVIDIDEKAHLYEHSIEVQLPFLQYFFND--LRILPIVVLMQTPEIA 149

Query: 163 AMYGRLFAKYVDDPSNFFSV---SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDI 219
                   ++++  S+   V   SSD           HYD      H+   A D + +  
Sbjct: 150 ESIAEGIWRFINRHSDLDIVVLASSDLN---------HYDP-----HEVTMAKDDLVIRK 195

Query: 220 IETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS 279
           +E  D     + + E D T+CG  PI V L +    + +  I  LR+  S      + S 
Sbjct: 196 VEELDHKGLYRVIEEQDVTVCGYAPIMVSLILAKKMNKRPYI--LRHATSGDTSGDKSSV 253

Query: 280 VSYASA 285
           V Y +A
Sbjct: 254 VGYLAA 259


>gi|158520316|ref|YP_001528186.1| hypothetical protein Dole_0299 [Desulfococcus oleovorans Hxd3]
 gi|158509142|gb|ABW66109.1| protein of unknown function DUF52 [Desulfococcus oleovorans Hxd3]
          Length = 282

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 100/226 (44%), Gaps = 9/226 (3%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR A++AGSWY D +     ++  +L E  L P   E  G I PHAG+ YSG  A  A  
Sbjct: 3   IRPATYAGSWYPDTAAECRRQIKAFLAEKTLTPPDREPCGAIVPHAGWIYSGGIACRALA 62

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            +  + +    ++   H Y      + +     TP+G L +D  +  +L     F+   +
Sbjct: 63  LLKGSQVPDTIVIYGHHLYPKSSPRILSEGSMATPLGMLAVDEALAAKLMDRFTFQAESV 122

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
                ++++E+ LP++  +F    +KIVP  VG          GR   +   D      V
Sbjct: 123 HHFAPDNTIELQLPFIRYLFGD--VKIVP--VGVPPDAMAVEIGRFTVQAARDLGLSVKV 178

Query: 183 --SSDFCHWGSRFNYMHYDKKHGVIHKSIEA-LDKMGMDIIETGDP 225
             S+D  H+G+ F   HY       H+ + A  D+  +D + T DP
Sbjct: 179 VGSTDLTHYGAHFGLSHYGLGEAA-HRQVRANEDRRIIDRMLTLDP 223


>gi|340624697|ref|YP_004743150.1| hypothetical protein GYY_07765 [Methanococcus maripaludis X1]
 gi|339904965|gb|AEK20407.1| hypothetical protein GYY_07765 [Methanococcus maripaludis X1]
          Length = 284

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 78/298 (26%), Positives = 125/298 (41%), Gaps = 30/298 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
           R    AG +Y      L E ++  +L   G  + P  RG       +++PHAGY YSG  
Sbjct: 3   RNPVVAGMFYPAEYHELLEMIEYCYLNPRGPKELPSKRGKYTKPLGIVSPHAGYIYSGPV 62

Query: 57  AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA+ +  I       +   +LGP+H       A    T +KTP GD+ +D E  + L   
Sbjct: 63  AAHGYKKISENISGEITAIILGPNHTGLGSGIATMKGT-WKTPFGDMEIDNEFADRLWKE 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
                MD      EHS+E+ LP+L  + + ++   K VPI +   + E+    G + AK 
Sbjct: 122 CDILDMDENSHLREHSIEVQLPFLKHLEDLNIAKFKFVPISMMMQDYESCIDVGYVIAKV 181

Query: 173 VDDPS--NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
             + +  +    S+DF H+  +       KK  V+ K          DI+E  D + F  
Sbjct: 182 TRELNRKSVIIASTDFSHYEPQ---EQASKKDAVVIK----------DILELNDEEIFTD 228

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            ++ ++ ++CG  P+   +  + +   K     L Y  S          V YAS   K
Sbjct: 229 -VVTHNISMCGYGPVIAMVKAMKDLGAKTSY-LLYYSTSGDVTKDYSEVVGYASLLIK 284


>gi|282164310|ref|YP_003356695.1| hypothetical protein MCP_1640 [Methanocella paludicola SANAE]
 gi|282156624|dbj|BAI61712.1| conserved hypothetical protein [Methanocella paludicola SANAE]
          Length = 263

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 85/289 (29%), Positives = 130/289 (44%), Gaps = 35/289 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG +Y  +++ L  ++        L    +  GV+ PHAGY YSG  AA  + +
Sbjct: 1   MRNPAVAGMFYPGSNEALRRQIAELAPSVSL--KLDAMGVVVPHAGYMYSGGVAAKVYAS 58

Query: 64  ID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           I+  PT I    +LGP H +     A+ST   +KTP+G + +D + I +L   G  +  D
Sbjct: 59  IEGAPTFI----ILGPRHSWEGSAVAVSTVP-WKTPLGIVDVDHDFI-DLLPPGIID-HD 111

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-EAMYGRLFAKYVDDPSNFF 180
               + EHS+E+ +P+L   F+    +IVPI +G  + E    + G +       P    
Sbjct: 112 ETAHQREHSLEVQVPFLQYFFKD--FRIVPIALGLQDYETVREVAGEITMAIEKYPKKVV 169

Query: 181 SV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
            V SSDF H+      +   K H +I K IE LD  G           F   +   + T 
Sbjct: 170 IVASSDFTHYEP--VEVAKRKDHILIEK-IEKLDVPG-----------FYDEVANLNATC 215

Query: 240 CGRHPISVFLHMLGNCSTK--IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           CG  PI+    M+ +C  +   K + L Y  S      R+  V YA  A
Sbjct: 216 CGYGPIAA---MMLSCKRRGAKKAELLAYATSGDVTGDRE-VVGYAGLA 260


>gi|268323602|emb|CBH37190.1| conserved hypothetical protein, Memo-like protein family
           [uncultured archaeon]
          Length = 266

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 120/285 (42%), Gaps = 27/285 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR--GVIAPHAGYSYSGRAAAYAF 61
           +RR   AG++Y      L  +L+     AG+ +  + R  G + PHAGY +S   AA A+
Sbjct: 1   MRRPIVAGAFYEGERAGLERQLNDCF--AGITREEDERVIGAVVPHAGYMHSCSVAASAY 58

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +    I    +LGP+H       A+ST T ++TP+G + +D   +E L    +   +D
Sbjct: 59  AKLPSADI--FVILGPNHQGIGSLVAVSTDT-WETPLGAVDIDEAFVEALPK--RIIDLD 113

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPSNF 179
                 EH++E+ LP+L  +F       VPI +   + +     G   A  +   D    
Sbjct: 114 ENAHRYEHAIEVQLPFLQFLFHEQF-TFVPICMCLSDEDTAKEVGNELADTIAKTDKKVV 172

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSDF H+          +  G+        D+  ++ I+  D   F   +   + T 
Sbjct: 173 MLASSDFTHY----------EPEGIARDK----DEYVIEAIKELDVSKFYNRVYTRNVTA 218

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           CG  PI+  +H       + + + + Y  S      R S V YA+
Sbjct: 219 CGIGPIAAVMHAAKRLGAR-EGELVTYATSGDSTGDRSSVVGYAA 262


>gi|317052576|ref|YP_004113692.1| hypothetical protein Selin_2421 [Desulfurispirillum indicum S5]
 gi|316947660|gb|ADU67136.1| protein of unknown function DUF52 [Desulfurispirillum indicum S5]
          Length = 264

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 74/287 (25%), Positives = 119/287 (41%), Gaps = 35/287 (12%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
            IR+++ AG +Y DN + +   LD  +  A  P   +   V+ PHAG+ YSG+ AA    
Sbjct: 2   TIRQSAVAGMFYPDNPREIRAFLDHHMAAATAPV--DACAVVVPHAGWIYSGQLAAQVLA 59

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            +       V ++GP+H       ++    +++TP+GD+P+       L A       D 
Sbjct: 60  RVRVPDT--VVIIGPNHTGLGSAISIFPPGLWRTPVGDIPVSPHA-RTLAARLGLTC-DT 115

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
              + EHS+E+ LP L   +    ++I+ I V   + E+                     
Sbjct: 116 AAHQREHSLEVLLPMLH--YRNPSLQIIAITVAGGSPEDARELAHTLEAEFQGEDVLLLA 173

Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN----T 238
           SSD  H+ S+      D+K+ +  + I ALD  G               LLE  N    +
Sbjct: 174 SSDMNHFESQAV---SDRKNALAMERIAALDSPG---------------LLEVVNRHRIS 215

Query: 239 ICGRHPISVFLH--MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           +CG  P++V L   +L   S       + Y  S+      DS V YA
Sbjct: 216 MCGAFPVAVALENALLRGAS---HASLVGYTDSAAVSGDHDSVVGYA 259


>gi|227828313|ref|YP_002830093.1| hypothetical protein M1425_2054 [Sulfolobus islandicus M.14.25]
 gi|229585543|ref|YP_002844045.1| hypothetical protein M1627_2134 [Sulfolobus islandicus M.16.27]
 gi|238620505|ref|YP_002915331.1| hypothetical protein M164_2061 [Sulfolobus islandicus M.16.4]
 gi|385773993|ref|YP_005646560.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|259646573|sp|C3MZ11.1|Y2054_SULIM RecName: Full=MEMO1 family protein M1425_2054
 gi|259646574|sp|C4KJ99.1|Y2061_SULIK RecName: Full=MEMO1 family protein M164_2061
 gi|259646592|sp|C3N060.1|Y2134_SULIA RecName: Full=MEMO1 family protein M1627_2134
 gi|227460109|gb|ACP38795.1| protein of unknown function DUF52 [Sulfolobus islandicus M.14.25]
 gi|228020593|gb|ACP56000.1| protein of unknown function DUF52 [Sulfolobus islandicus M.16.27]
 gi|238381575|gb|ACR42663.1| protein of unknown function DUF52 [Sulfolobus islandicus M.16.4]
 gi|323478108|gb|ADX83346.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 284

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 129/297 (43%), Gaps = 35/297 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRG------VIAPHAGYSYSG 54
           R  + AGS+Y  + K+L  +++   R       +PK    +        + PHAGY YSG
Sbjct: 3   RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62

Query: 55  RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
             AA+++  +   S  R   V +LGP+H       +      ++TP+G + +D E++ +L
Sbjct: 63  PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
               +   +D      EHS+E+ LP+L   F+    KIVPI++     E  E +   ++ 
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179

Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
               +P     V  SSD           HYD     + K  EA++K     I+  D    
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDEEAIEK-----IQQLDYKGL 225

Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
            + +   D T+CG  PI V L +      K  I  L++  S      +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280


>gi|134045236|ref|YP_001096722.1| hypothetical protein MmarC5_0191 [Methanococcus maripaludis C5]
 gi|166227494|sp|A4FWD2.1|Y191_METM5 RecName: Full=MEMO1 family protein MmarC5_0191
 gi|132662861|gb|ABO34507.1| protein of unknown function DUF52 [Methanococcus maripaludis C5]
          Length = 284

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 76/298 (25%), Positives = 125/298 (41%), Gaps = 30/298 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
           R    AG +Y      L E ++  +L   G  + P  RG       +++PHAGY YSG  
Sbjct: 3   RNPVVAGMFYPAEYHELLEMIEYCYLSPRGPKELPSKRGNYTKPLGIVSPHAGYIYSGPV 62

Query: 57  AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA+ +  I       V   +LGP+H       + +   ++KTP GD+ +D E  + L   
Sbjct: 63  AAHGYKKISENINGEVTAIILGPNHTGLGSGIS-TMKGIWKTPFGDMEIDNEFADRLWKE 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
                +D      EHS+E+ LP+L  + + ++   K VPI +   + E     G   AK 
Sbjct: 122 CDVLDLDENSHLREHSIEVQLPFLKHLEDLNIAKFKFVPICMMMQDYETSMDVGYFIAKV 181

Query: 173 VDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
             + +       S+DF H+ S+ +     KK  ++ K          DI+E  D + F  
Sbjct: 182 AKEMNRKIIIIASTDFSHYESQES---ASKKDALVIK----------DILELKDEEIFTD 228

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            ++ ++ ++CG  P+   +  + +   K     L Y  S          V YAS   K
Sbjct: 229 -VVTHNISMCGYGPVIAMIKAMKDLGAK-NSNLLYYSTSGDVTKDYSEVVGYASILVK 284


>gi|330508374|ref|YP_004384802.1| hypothetical protein MCON_2560 [Methanosaeta concilii GP6]
 gi|328929182|gb|AEB68984.1| Protein of unknown function (DUF52) [Methanosaeta concilii GP6]
          Length = 237

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 73/227 (32%), Positives = 111/227 (48%), Gaps = 27/227 (11%)

Query: 38  PEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTP 97
           P V GV+ PHAGY YSGR AA  +  + P   + V +LGP+HH      ALS  + ++T 
Sbjct: 8   PIVGGVV-PHAGYVYSGRIAAMVYSRL-PKRETYV-ILGPNHHGLGSPVALSRDS-WRTA 63

Query: 98  IGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV 157
           +G +  DLE+ + L  TG     D    + EHS+E+ +P+L K FE    KI+ I +G  
Sbjct: 64  LGVVTPDLELADLL--TGSIIDHDEAAHQHEHSIEVQIPFLQKRFED--FKILAISMGMQ 119

Query: 158 NAENEAMYGRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
           + E     G   A+         +V  SSDF H+  R      +    V  + IEA+  +
Sbjct: 120 DEETAVEVGEEIARAALKLGRKCTVIASSDFTHYEPR------EVARKVDSQVIEAI--L 171

Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKI 259
            MDI     P  + + +  ++ + CG  PI+  +    +LG  S K+
Sbjct: 172 NMDI-----PQIYSR-VYRFNASSCGIGPIAATIAASRLLGASSGKL 212


>gi|452203979|ref|YP_007484112.1| hypothetical protein dcmb_1170 [Dehalococcoides mccartyi DCMB5]
 gi|452111038|gb|AGG06770.1| hypothetical protein dcmb_1170 [Dehalococcoides mccartyi DCMB5]
          Length = 438

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 72/287 (25%), Positives = 122/287 (42%), Gaps = 33/287 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +R+AS AG +Y  + +++       E+ G  +E  L       G + PHAGY YSG  A 
Sbjct: 2   LRQASAAGRYYPASKEKIRSVITPMEVKGIAKERYL-------GAVMPHAGYPYSGGVAM 54

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
                +D   I+   +LGPSH     + A+  + +++TP+G++ +D  +   +    +  
Sbjct: 55  AVGSRLDLPEIA--IILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHI 112

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             D    + EHS+E+ +P L + F+   IKIVPI V    +E  A  G   A  + +   
Sbjct: 113 KADPSAHQYEHSIEVQIPIL-QYFKPD-IKIVPITVSFGKSETLADIGYGIASALRETGR 170

Query: 179 --FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                 SSD  H+  + +  H               D + +D I   D     + +    
Sbjct: 171 EAIIIASSDMTHYEPQAD-AHLK-------------DSLALDAIIKLDAAEMLERIQANH 216

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            T+CG  P++  L  +     K + + + Y+ S       D  V YA
Sbjct: 217 ITMCGYAPVAAMLTAVKELGAK-RARVVAYQTSGDITHHLDQVVGYA 262


>gi|147669761|ref|YP_001214579.1| hypothetical protein DehaBAV1_1122 [Dehalococcoides sp. BAV1]
 gi|146270709|gb|ABQ17701.1| protein of unknown function DUF52 [Dehalococcoides sp. BAV1]
          Length = 438

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 72/287 (25%), Positives = 122/287 (42%), Gaps = 33/287 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +R+AS AG +Y  + +++       E+ G  +E  L       G + PHAGY YSG  A 
Sbjct: 2   LRQASAAGRYYPASKEKIRSVITPMEVKGIAKERYL-------GAVMPHAGYPYSGGVAM 54

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
                +D    +   +LGPSH     + A+  + +++TP+G++ +D  +   +    +  
Sbjct: 55  AVASRLDLPETA--IILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHI 112

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
             D    + EHS+E+ +P L + F+   IKIVPI V    +E  A  G   A  + +   
Sbjct: 113 KADPSAHQYEHSVEVQIPIL-QYFKPD-IKIVPITVSFGKSETLADIGYGIASALRETGR 170

Query: 179 --FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                 SSD  H+ S+ +  H               D + +D I   D     + +    
Sbjct: 171 EAIIIASSDMTHYESQAD-AHLK-------------DSLALDAIIKLDAAEMLERIQANH 216

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            T+CG  P++  L  +     K + + + Y+ S       D  V YA
Sbjct: 217 ITMCGYAPVAAMLTAVKELGAK-RARVVAYQTSGDITHHLDQVVGYA 262


>gi|124027421|ref|YP_001012741.1| hypothetical protein Hbut_0536 [Hyperthermus butylicus DSM 5456]
 gi|123978115|gb|ABM80396.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
          Length = 297

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/293 (21%), Positives = 122/293 (41%), Gaps = 30/293 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSGR 55
           R  + AG +Y  +++ L  +++         G++ EA   +  E  G + PHAGY YSG 
Sbjct: 18  RLPAVAGLFYEADAEALKAQIEAAFRHPLGPGYVPEASSERRRESLGFLVPHAGYMYSGP 77

Query: 56  AAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
            A +A+  +     +  +++ GP+H       ++   T + TP+G+L +D E+   L   
Sbjct: 78  IAVHAYAKMALEGSAETYVIVGPNHTGLGASVSVYPGTAWSTPLGELQVDTELARVLVKA 137

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
             +  +D      EHS+E+ LP+L  +F    ++I+P++V     E          + V+
Sbjct: 138 SSYAELDEKAHLYEHSVEVQLPFLQYLFNAR-VRILPVVVYEQTPEVARDIAEALVEAVE 196

Query: 175 DPSN---FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                  F + S          N  HY+     + K     DK  +D I + + +   + 
Sbjct: 197 KTGRDVVFIATS----------NLTHYEPYEAAVEK-----DKKVIDAIISLNAEELYRI 241

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +     ++CG       ++          ++ L+Y  S      + S V YA+
Sbjct: 242 VTSTPVSMCGPAAAMALIYY-ARLRDARGVELLKYATSGDVAGDKKSVVGYAA 293


>gi|452851477|ref|YP_007493161.1| conserved protein of unknown function [Desulfovibrio piezophilus]
 gi|451895131|emb|CCH48010.1| conserved protein of unknown function [Desulfovibrio piezophilus]
          Length = 268

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 102/248 (41%), Gaps = 23/248 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R    AG +Y  +  RL   LDG+L  A     P +  ++ PHAGY YSG       G  
Sbjct: 3   REPVVAGRFYEADPTRLNTVLDGYLSGAVRRTEPTILAMV-PHAGYVYSGALCGQTLGT- 60

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
             +  S V LLGP+H     + +L     +  P G++P+D  +   L  +      D   
Sbjct: 61  -ASLASTVLLLGPNHTGRGKQFSLWPEGEWAIPGGNVPIDTSLATVLLESNSMLSADTEA 119

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV---DDPSNFFS 181
              EHS+E+ LP+L  +        VPI + A   E     GR     +   + P +   
Sbjct: 120 HIGEHSLEVVLPFLKHMNPD--TTFVPIALSANRFEAMEEIGRTIGDVIQAFERPVSII- 176

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           VSSD  H+              + H+  + +D M ++     DP A    + E   ++CG
Sbjct: 177 VSSDMSHY--------------ISHEKAQEMDSMALEACVELDPLALFTTVREKRISMCG 222

Query: 242 RHPISVFL 249
             P++  L
Sbjct: 223 VLPMTCGL 230


>gi|297619510|ref|YP_003707615.1| hypothetical protein Mvol_0985 [Methanococcus voltae A3]
 gi|297378487|gb|ADI36642.1| protein of unknown function DUF52 [Methanococcus voltae A3]
          Length = 286

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 73/261 (27%), Positives = 119/261 (45%), Gaps = 47/261 (18%)

Query: 42  GVIAPHAGYSYSGRAAAYAFGNI-----DPTSISRVFLLGPSHHYYTPKCALSTATVYKT 96
           G+I PHAGY YSG  A+Y++  +     D T+I    LL P+H     + A+S   +++T
Sbjct: 51  GLICPHAGYEYSGITASYSYYELSKRLGDETTI---ILLAPNHTGMGARVAISNE-IWET 106

Query: 97  PIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILV 154
           P+GD+  DLE+I+EL +   FEL DI     E+S+E+ LP++   ++      KIVPI  
Sbjct: 107 PLGDIKPDLELIDELISHDLFELDDIA-HLQEYSVEVQLPFIKHLELLNISKFKIVPICC 165

Query: 155 GAVNAENEAMYGRLFAKYVDDPSNF-----FSVSSDFCHWGSRFNYMHYDKKHGVIHKSI 209
            ++  ++   Y  + A   +  +          S+DF H+  +   +  D K   + K+I
Sbjct: 166 QSMEYDD---YVNMGASIYESATKLNKKVVIIASTDFSHYEPQETTIKKDAK---VIKNI 219

Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTI--CGRHPISVFLHMLGNCSTKI----KIKF 263
             +D+  +             Y   YDN +  CG   +   L     C+ K+      + 
Sbjct: 220 LEMDEKAI-------------YEAIYDNNVSMCGYGQVITML-----CALKLFGAESAEL 261

Query: 264 LRYEQSSQCKTKRDSSVSYAS 284
           L Y  S     +  S V Y S
Sbjct: 262 LNYMTSGDLTQEYTSVVGYGS 282


>gi|389852931|ref|YP_006355165.1| hypothetical protein Py04_1519 [Pyrococcus sp. ST04]
 gi|388250237|gb|AFK23090.1| hypothetical protein Py04_1519 [Pyrococcus sp. ST04]
          Length = 291

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/290 (23%), Positives = 122/290 (42%), Gaps = 8/290 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           R  + AG +Y +  + L E L+ +  + G       +   +APHAGY +SG  A+  +  
Sbjct: 3   RYPAVAGQFYPEGDE-LIEMLENFFSDLGEEGDRRRITAGVAPHAGYVFSGYTASRTYKA 61

Query: 64  IDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           I       VF++ GP+H       A+     ++TP+G + +D ++   +        +D 
Sbjct: 62  IYEDGFPEVFVIFGPNHTGMGSPIAVYPEGEWETPLGSVKVDSQMARTIVELSGIADLDD 121

Query: 123 CVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
              + EHS+E+ LP++  + E  G    IVPI +G  + +     GR   +         
Sbjct: 122 LAHKYEHSIEVQLPFIQYLAELAGKDFMIVPITLGIQDEDVANTLGRAVYEAAQSLGRDV 181

Query: 181 SV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
            V  S+DF H+G  + Y+ +  +   I   +   D   +  I   D +     + E ++T
Sbjct: 182 VVIASTDFMHYGYFYGYVPFTGRAEDIPNMVREWDMRVIRRILDFDVEGMFDEIREMNHT 241

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           +C          +    S  ++ + L Y  S +     D+ V YAS   K
Sbjct: 242 MC-GPGGVGVGIVYSKLSGAVEAELLHYTTSYEVSKSTDAIVGYASIVMK 290


>gi|23466270|ref|NP_696873.1| hypothetical protein BL1725 [Bifidobacterium longum NCC2705]
 gi|239621655|ref|ZP_04664686.1| conserved hypothetical protein [Bifidobacterium longum subsp.
           infantis CCUG 52486]
 gi|23327019|gb|AAN25509.1| widely conserved hypothetical protein with duf51 [Bifidobacterium
           longum NCC2705]
 gi|239515530|gb|EEQ55397.1| conserved hypothetical protein [Bifidobacterium longum subsp.
           infantis CCUG 52486]
          Length = 596

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 76/282 (26%), Positives = 120/282 (42%), Gaps = 64/282 (22%)

Query: 4   IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
           IR ++ AGS+Y               D  ++L ++L+  L  AG+P     R VI PHAG
Sbjct: 48  IRPSAVAGSFYPADRTALKQLINQQLDYGRKLLQQLEPTL-PAGVP-----RAVIVPHAG 101

Query: 50  YSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           Y YSG AAA A+  ++    S++R  ++GP+H       A STA  ++TP+G +P+D   
Sbjct: 102 YIYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 161

Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
                 L V E L+        A     +++      EH++E+ +P+L  V  G  + IV
Sbjct: 162 ERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPDLTIV 220

Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
           P+  G    +     G +       P     +SSD  H+                H+   
Sbjct: 221 PLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 263

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
           ALD   +        D    +L  +    CG +PI+  L +L
Sbjct: 264 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 298


>gi|419847558|ref|ZP_14370727.1| AMMECR1 family protein [Bifidobacterium longum subsp. longum 1-6B]
 gi|386410763|gb|EIJ25536.1| AMMECR1 family protein [Bifidobacterium longum subsp. longum 1-6B]
          Length = 599

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 76/282 (26%), Positives = 120/282 (42%), Gaps = 64/282 (22%)

Query: 4   IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
           IR ++ AGS+Y               D  ++L ++L+  L  AG+P     R VI PHAG
Sbjct: 51  IRPSAVAGSFYPADRTALKQLINQQLDYGRKLLQQLEPTL-PAGVP-----RAVIVPHAG 104

Query: 50  YSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           Y YSG AAA A+  ++    S++R  ++GP+H       A STA  ++TP+G +P+D   
Sbjct: 105 YIYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 164

Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
                 L V E L+        A     +++      EH++E+ +P+L  V  G  + IV
Sbjct: 165 ERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPDLTIV 223

Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
           P+  G    +     G +       P     +SSD  H+                H+   
Sbjct: 224 PLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 266

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
           ALD   +        D    +L  +    CG +PI+  L +L
Sbjct: 267 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 301


>gi|124485503|ref|YP_001030119.1| hypothetical protein Mlab_0680 [Methanocorpusculum labreanum Z]
 gi|124363044|gb|ABN06852.1| protein of unknown function DUF52 [Methanocorpusculum labreanum Z]
          Length = 279

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 80/154 (51%), Gaps = 14/154 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R ++ +G +Y  N + L + L   L  A      +  G++ PHAGY YSG+ AAY +  I
Sbjct: 17  RLSTLSGRFYPKNEQEL-DALLSALFAATETSVSDPYGILVPHAGYVYSGKTAAYGYAAI 75

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTAT---VYKTPIGDLPLDLEVIEELKATGKFELMD 121
            P       LLGPSH        L T+T   +++TP+G++  D   IE L A  +  + +
Sbjct: 76  SPAFNGTFVLLGPSH------AGLETSTADMIWETPLGNVFPDSAFIEALSA--QIPVRN 127

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
             +   E+S+E+ LP++   F     +IVPIL+G
Sbjct: 128 DLISAEENSLEVQLPFIRYRFPKA--RIVPILMG 159


>gi|150402556|ref|YP_001329850.1| hypothetical protein MmarC7_0632 [Methanococcus maripaludis C7]
 gi|166228803|sp|A6VGX3.1|Y632_METM7 RecName: Full=MEMO1 family protein MmarC7_0632
 gi|150033586|gb|ABR65699.1| protein of unknown function DUF52 [Methanococcus maripaludis C7]
          Length = 284

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/298 (25%), Positives = 125/298 (41%), Gaps = 30/298 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
           R    AG +Y  +   L E ++  +L   G  + P  RG       +++PHAGY YSG  
Sbjct: 3   RNPVVAGMFYPADYHELLEMIEYCYLSPRGPKELPSKRGNYTKPLGIVSPHAGYIYSGPV 62

Query: 57  AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA+ +  I       V   +LGP+H       + +   ++KTP GD+ +D E  ++L   
Sbjct: 63  AAHGYKKISENVSGEVTAIILGPNHTGLGSGIS-TMKGIWKTPFGDMEIDNEFADKLWKE 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
                +D      EHS+E+ LP+L  + E ++   K VPI +   + E     G   AK 
Sbjct: 122 CDVLDLDENSHLREHSIEVQLPFLKHLEELNIAKFKFVPICMMMQDYETSMDVGYFIAKV 181

Query: 173 VDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
             + +       S+DF H+  + +     KK  ++ K          DI+E  D + F  
Sbjct: 182 AKEMNRKIIIIASTDFSHYEPQES---ASKKDALVIK----------DILELNDEEIFTD 228

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            ++ ++ ++CG  P+   +  + +         L Y  S          V YAS   K
Sbjct: 229 -VVTHNISMCGYGPVIAMVKAMKDLGAT-NSNLLYYSTSGDVTKDYSEVVGYASILVK 284


>gi|154249943|ref|YP_001410768.1| hypothetical protein Fnod_1264 [Fervidobacterium nodosum Rt17-B1]
 gi|154153879|gb|ABS61111.1| protein of unknown function DUF52 [Fervidobacterium nodosum
           Rt17-B1]
          Length = 267

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/286 (23%), Positives = 127/286 (44%), Gaps = 31/286 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--PEVRGVIAPHAGYSYSGRAAAYAFG 62
           R+   +G +Y     +L +  + ++ E    K    +  G+I PHAGY YSG+ A    G
Sbjct: 3   RKPVVSGKFYPGTPIQLEKTCEAFVGERATNKDFFSQPVGLILPHAGYVYSGKTA--GMG 60

Query: 63  NIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
               T   R   + + GP+H  Y    ++ +  +++TP+G++ ++ E+ ++L        
Sbjct: 61  IKKATEYGRPKNIIIFGPNHTGYGELVSVWSEGIWQTPLGNIEVNSEIADKL-IDNTVIF 119

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI-LVGAVNAENEAMYGRLFAKYVDDPSN 178
            D      EHS+E+ LP L   F G   KI+P+ ++    +    +  +L     + P  
Sbjct: 120 SDEMAHLYEHSIEVQLPLLQYAF-GEF-KIIPVCMMDQRLSTVSKIVDKLKQIIKEYPDT 177

Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
               SSDF          HYD     + K     DK+ ++ I  GD +   + + +++ T
Sbjct: 178 LVVASSDFN---------HYDPHEITLEK-----DKLAIEKILEGDIEGLYERIKKHNIT 223

Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +CG  P++V   +  N      ++ + +  S++       +V YAS
Sbjct: 224 MCGPGPVAVVRSLFSN------VELVYHTTSAEFSQDYSYTVGYAS 263


>gi|15606223|ref|NP_213601.1| hypothetical protein aq_890 [Aquifex aeolicus VF5]
 gi|7388441|sp|O67039.1|Y890_AQUAE RecName: Full=MEMO1 family protein aq_890
 gi|2983422|gb|AAC07005.1| hypothetical protein aq_890 [Aquifex aeolicus VF5]
          Length = 267

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/246 (26%), Positives = 116/246 (47%), Gaps = 21/246 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAF 61
           ++R  + AG++Y  + + L + +D  L     PK   + + ++ PHAGY YSG+ A   +
Sbjct: 2   RVREPAVAGTFYPKDKEELNKLMD--LLCGFEPKEKIKPKAILVPHAGYIYSGKTACEVY 59

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             I+     +V LLGP+H       ++ +   ++TP G + +D E+ E++     +   D
Sbjct: 60  KRIEIP--EKVVLLGPNHTGLGKPISVYSGDAWETPYGVVEIDGELREKILKY-PYANPD 116

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EHS+E+ LP+L + +     KI+PI+V  V  E    +GR   + + +      
Sbjct: 117 EYAHLYEHSLEVQLPFLQR-YARREFKILPIVVTFVEYEVAKDFGRFLGEVLKEEDALIV 175

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +SSD  H+          KK  ++  ++E L+   +          FK   ++Y+ T+CG
Sbjct: 176 ISSDMSHY---VPAEEARKKDEILISAMERLNTEEL---------YFKA--VQYNITMCG 221

Query: 242 RHPISV 247
             P  V
Sbjct: 222 VVPAVV 227


>gi|242280444|ref|YP_002992573.1| hypothetical protein Desal_2982 [Desulfovibrio salexigens DSM 2638]
 gi|242123338|gb|ACS81034.1| protein of unknown function DUF52 [Desulfovibrio salexigens DSM
           2638]
          Length = 270

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 123/286 (43%), Gaps = 30/286 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGW--LREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           R+   AG +Y D+  +L +EL  +   + AG  + P  R ++ PHAGY +SG        
Sbjct: 3   RQPVVAGRFYPDSPDQLRKELQMYSGTQNAGGDR-PVDRLIMVPHAGYMFSGEPCGKTLA 61

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
               +    VFLLGP+H       ++     ++ P G L +D E+  +L  +    + + 
Sbjct: 62  QSKLSDT--VFLLGPNHTGLGSPLSVWDKGSWEFPGGKLDVDEELAAKLIESRTGFVSNE 119

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS- 181
                EHS+E+ +P+L   +    +KIVP+ V   +  N    G   A+ + D S   S 
Sbjct: 120 KAHSREHSLEVIVPFLH--YLNPAMKIVPVCVSEASPSNLRKAGEAIAEIMKDSSQPISM 177

Query: 182 -VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            VSSD  H+              V     + LD M ++ I   DP      +     ++C
Sbjct: 178 VVSSDMSHF--------------VSADQAKKLDSMALEAIIRMDPSDLYSIVSSNQISMC 223

Query: 241 GRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           G  P+++ +     LG  S K+    ++Y  S Q     +  V+YA
Sbjct: 224 GVLPMTMAMFAAKKLGATSGKL----VQYTNSGQATGDYERVVAYA 265


>gi|310778291|ref|YP_003966624.1| hypothetical protein [Ilyobacter polytropus DSM 2926]
 gi|309747614|gb|ADO82276.1| protein of unknown function DUF52 [Ilyobacter polytropus DSM 2926]
          Length = 277

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 123/294 (41%), Gaps = 38/294 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR-----GVIAPHAGYSYSGRAAA 58
           +R+   AGS+Y ++ + + E  +  L++       E++     G ++PHAGY Y  R A 
Sbjct: 3   VRKCGVAGSFYPNSPEEIKEIFEEALQKESDNIKTELKNKNIIGGVSPHAGYVYCVREAV 62

Query: 59  YAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
           + F  +         V L+ P+H  Y    ++ +   ++TP G + LD E  +EL     
Sbjct: 63  HLFEILREKGEKYDTVVLVNPNHTGYGEAVSIDSNKAWETPFGSIELDSEFGDELSFP-- 120

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
              ++      EHS E+ LPYL   F     KI+PI +       +    +  A+ + + 
Sbjct: 121 ---VEPMAQRFEHSGEVMLPYLY-YFIKKGFKILPICM----MRQDLKTAKNIAEKIKNA 172

Query: 177 SN------FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
           S          +SSDF H+ S       D        +IE+L KM        D   F+ 
Sbjct: 173 SEKLNRKILILISSDFTHFHSSEEGAKLDS------YAIESLLKM--------DSGEFQD 218

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            +LE D +ICG  PI V L          K++ L+   S +     D  V Y S
Sbjct: 219 RILEKDISICGMGPIMVLLEYSKMILKNPKLEILKRGHSGEV-YPSDEVVDYVS 271


>gi|118580112|ref|YP_901362.1| hypothetical protein Ppro_1691 [Pelobacter propionicus DSM 2379]
 gi|118502822|gb|ABK99304.1| protein of unknown function DUF52 [Pelobacter propionicus DSM 2379]
          Length = 267

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 4/134 (2%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+   AG +Y  N  +L  +L   + +A  PK   VRG+IAPHAGY YSG  A  A+G I
Sbjct: 3   RQPVVAGQFYPGNPDQLRNDLARMIPQAD-PKR-RVRGIIAPHAGYVYSGAIAGQAYGAI 60

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
              + +   ++GP+HH      AL  A  + TP+G + ++  + + +     F  +D   
Sbjct: 61  QIPATA--LIIGPNHHGAGDPAALFPAGQWLTPLGPVTINSRLNDLIATHVPFVHLDSLA 118

Query: 125 DEAEHSMEMHLPYL 138
              EHS+E+ LP+L
Sbjct: 119 HRFEHSLEVQLPFL 132


>gi|374302050|ref|YP_005053689.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
 gi|332554986|gb|EGJ52030.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Desulfovibrio africanus str. Walvis Bay]
          Length = 269

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 116/283 (40%), Gaps = 25/283 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RR   AG +Y   + +L   +  +L +AG          + PHAGY +SG  A    G  
Sbjct: 3   RRPVVAGQFYPGQADQLESMVGEYLSKAGAKAQERTILAMTPHAGYVFSGSVAGQTLGRA 62

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
           +      V LLGP+H     + A+ +  +++ P G L ++  + + +       + D   
Sbjct: 63  NLAKT--VLLLGPNHTGMGSRFAVWSDGIWELPGGGLNVNEGLAKAIIKADARLVADQTA 120

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRLFAKYVDDPSNFF 180
              EHS+E+ LP+L  +       IVPI V      V  E  A  GR+ A +   P +  
Sbjct: 121 HAREHSLEVVLPFLRAIDPE--TTIVPIAVAEPRLEVLLEVGAAIGRVLASW-KHPVSMV 177

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            VSSD  H+              V H+  +  D M ++ I   +P    + + E   ++C
Sbjct: 178 -VSSDMSHY--------------VTHEEAKRWDSMALEPILALNPSEAYRVVRESGISMC 222

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           G  P++V L    +     K + + Y  S +        V YA
Sbjct: 223 GIMPMTVGLTAAESLEAS-KAELVAYATSGETSGDYTQVVGYA 264


>gi|86159016|ref|YP_465801.1| hypothetical protein Adeh_2594 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85775527|gb|ABC82364.1| protein of unknown function DUF52 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 265

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 84/163 (51%), Gaps = 9/163 (5%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG++Y   +  LA E+DGWL   G P +P + GV+ PHAGY YSG  A   F  
Sbjct: 2   VREPAVAGAFYDARAATLAAEVDGWLSAGGAP-APAL-GVLVPHAGYVYSGAVAGATFAR 59

Query: 64  IDPTSISRVFLLGPSHHYYT-PKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           +      R  +LGP+H        AL     ++TP+G +P+D E+   L A       D 
Sbjct: 60  V--AVPPRAIVLGPNHTGLGHAAAALWPGGAWRTPLGTVPVDAELTAALAAAPGVA-GDR 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN-AENEAM 164
                EHS+E+ +P+L +   G  + I  + +G ++ AE EA+
Sbjct: 117 LAHLREHSLEVEVPFLQRARPG--VAIAALCLGPLSFAECEAL 157


>gi|91203620|emb|CAJ71273.1| conserved hypothetical protein [Candidatus Kuenenia
           stuttgartiensis]
          Length = 267

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 123/285 (43%), Gaps = 29/285 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR+ + AGS+Y+ +  RL  E+D ++ +    +S    G ++PHAGY YSG  A   + +
Sbjct: 2   IRQPAVAGSFYSGDKSRLQHEIDTFIIKDCEKQS--ALGAVSPHAGYMYSGSIAGSLYSH 59

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           I  T    V +L P+H  Y    ++     + TP G++ ++ E ++EL  +      D  
Sbjct: 60  I--TIPDLVVILSPNHTGYGKPYSIWPGGSWITPFGEIAVNEEAVDELVNSCHLIERDKE 117

Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFFS 181
               EH+ E+ +P++   +     +IV + + +   ++    G+  ++ +    P     
Sbjct: 118 AHLYEHAAEVQIPFIQ--YFNQKTEIVVMTIASRKIQDLKTIGKCMSQMLQKLHPDALVV 175

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSD  H   + +    +KK  +    I AL++           D     + E   ++CG
Sbjct: 176 ASSDMTHHEPQAS---ANKKDNIAINEILALNE-----------DGLYNKVRELRISMCG 221

Query: 242 RHPISVFLHMLGNCSTKIKIK---FLRYEQSSQCKTKRDSSVSYA 283
            +P  + L     CS +   K    +RY  S       D  V YA
Sbjct: 222 IYPAVIML----TCSKERGAKEAILVRYATSGDVTGDYDQVVGYA 262


>gi|291287744|ref|YP_003504560.1| hypothetical protein Dacet_1840 [Denitrovibrio acetiphilus DSM
           12809]
 gi|290884904|gb|ADD68604.1| protein of unknown function DUF52 [Denitrovibrio acetiphilus DSM
           12809]
          Length = 260

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 89/185 (48%), Gaps = 11/185 (5%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+A+ AG +Y  ++  + E +   L + G P+  +  GV+ PHAGY YSG  A      +
Sbjct: 3   RQAAVAGVFYPADAGSVTEFIKSSLPD-GAPE--KAVGVVVPHAGYIYSGATAVRTLARV 59

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                  V L+GP+H    P  ++     + TP+GD+P+D  ++++      F   D   
Sbjct: 60  KIPD--TVILVGPNHTGAGPSISVYPEGSWATPLGDVPVDSVLVDKFCENPLFN-KDTTA 116

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
             +EHS+E+ LP L K F    +K+V + V  +N E      +  A   D     F +SS
Sbjct: 117 HHSEHSLEVILPIL-KYFNPD-VKVVCVTVKYINLETAETAAKHIADVTD---ALFVISS 171

Query: 185 DFCHW 189
           D  H+
Sbjct: 172 DLNHF 176


>gi|320101532|ref|YP_004177124.1| hypothetical protein [Desulfurococcus mucosus DSM 2162]
 gi|319753884|gb|ADV65642.1| protein of unknown function DUF52 [Desulfurococcus mucosus DSM
           2162]
          Length = 283

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 123/309 (39%), Gaps = 59/309 (19%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVR--------GVIAPHAGYSYS 53
           K R A  AG +Y D  + L + ++   +    P S P V         G + PHAGY YS
Sbjct: 2   KRRTAVVAGYFYPDKREELVKVIEWSFKHGVGPGSTPSVSEAPVSNIIGYVVPHAGYIYS 61

Query: 54  GRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
           G  AA+A+     N  P +I    +LG +H       ++    +++TPIGDL +D EV E
Sbjct: 62  GPVAAHAYYDMAVNGRPDTI---VILGTNHTGMGRIVSVYPGGLWETPIGDLTVDGEVAE 118

Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEG-----------HLIKIVPILVGAVN 158
            +         D      EHS+E+ LP++A ++ G           H   I   L  A+ 
Sbjct: 119 GIVKHSSVAEFDEYAHLEEHSVEVQLPFIAYLYNGRVRIVPVVIRVHTPDIARDLAEAIR 178

Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
              E+   ++              SSDF          HY+     + K + A+++    
Sbjct: 179 RVTESSGKKI----------IILASSDFN---------HYEPHDETVRKDMAAINE---- 215

Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTK 275
            I   D D F   +L  D +ICG   I   +    MLG      K   L++  S      
Sbjct: 216 -ILKLDSDGFYNTMLRDDISICGPGGIMTLIEYTRMLGG-----KALLLKHATSGDTSGD 269

Query: 276 RDSSVSYAS 284
           R   V YAS
Sbjct: 270 RSHVVGYAS 278


>gi|254167087|ref|ZP_04873940.1| conserved hypothetical protein [Aciduliprofundum boonei T469]
 gi|197623943|gb|EDY36505.1| conserved hypothetical protein [Aciduliprofundum boonei T469]
          Length = 271

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 72/291 (24%), Positives = 128/291 (43%), Gaps = 35/291 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSP--------EVRGVIAPHAGYSYSG 54
           +R  + AG +Y    K L   L+  ++   G    P        +++G + PHAGY +SG
Sbjct: 1   MRYPAVAGQFYPAERKELEIMLNELFIHPIGPGSVPSLIPDGKRKIKGGVVPHAGYMFSG 60

Query: 55  RAAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+ +  +        F ++GP+H+      A++    + TP G + +D E+ +++ A
Sbjct: 61  PVAAHFYHALALDGFPESFIIIGPNHYGVGSGVAIALED-FLTPFGKVKVDRELAKDI-A 118

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
            G  ++ D      EHS+E+ LP+L   F    IK VPI +   + E     G +  + +
Sbjct: 119 KGVIDI-DGYAHRYEHSIEVQLPFLQ--FFKKEIKFVPITMLLQDYEIAIEVGEIIKEAI 175

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
            D       SSDF H+  + N  + +             D M ++ I  GD       + 
Sbjct: 176 ADKDVVIIASSDFSHYVPK-NKAYTN-------------DYMAIERIVNGDVKGLYDVIY 221

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +++ T+CG  P++  L  +G  +T      L+Y  S   +   +  V YA+
Sbjct: 222 KHNITMCGYGPVTAMLTAIGGKAT-----LLKYATSGDIQP-MNEVVGYAA 266


>gi|291303740|ref|YP_003515018.1| hypothetical protein Snas_6306 [Stackebrandtia nassauensis DSM
           44728]
 gi|290572960|gb|ADD45925.1| protein of unknown function DUF52 [Stackebrandtia nassauensis DSM
           44728]
          Length = 254

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 65/232 (28%), Positives = 106/232 (45%), Gaps = 32/232 (13%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-RGVIAPHAGYSYSGRAAAY 59
           M  +R A+ AG +Y   ++ L   ++  +    +P S E+ R  + PHAGY YSG  AA+
Sbjct: 1   MSDVRPAAVAGRFYPGAAESLRTMIESMVDSIEVPDSDELARAYVVPHAGYRYSGPTAAH 60

Query: 60  AFGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
            +  +      + RV L+GPSH       A S A  ++TP+G   +     E + A    
Sbjct: 61  VYARLRHHAARVKRVVLVGPSHFVPLQGLATSPAAGWQTPLGT--VTTPAAEGIPA---- 114

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
              +    E EHS+E+ LP+L +V  G  +++ PI+VG    E+ A         VDD +
Sbjct: 115 ---EEAPHEREHSLEVQLPFL-QVCVGD-VEVTPIVVGKSTIEDAAT---AINGLVDDQT 166

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
                S+D  H+        +D+      ++ + LD+   D I   +P   K
Sbjct: 167 VLL-CSTDLSHY--------HDE------ETAKRLDRATADAILAAEPRRIK 203


>gi|163782830|ref|ZP_02177826.1| hypothetical protein HG1285_15881 [Hydrogenivirga sp. 128-5-R1-1]
 gi|159881951|gb|EDP75459.1| hypothetical protein HG1285_15881 [Hydrogenivirga sp. 128-5-R1-1]
          Length = 266

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 64/281 (22%), Positives = 124/281 (44%), Gaps = 24/281 (8%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           R+ + AG++Y    ++L++ +D    E    K  + + ++ PHAG  YSG+ A   +  +
Sbjct: 3   RKPAVAGTFYPAEVEKLSKLVDLLCGEESAEKM-KAKAILVPHAGLIYSGKTACAVYKRV 61

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                 R+ LLGP+H       ++     +++P G++ +D E + ++     +   D   
Sbjct: 62  --YIPERIVLLGPNHTGMGTDISVYPGEAWESPYGEVQID-EELRDMVLKYPYASPDESA 118

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
              EHS+E+ LP+L +  E    +I+PI++G ++ +    +GR     +        +SS
Sbjct: 119 HIYEHSLEVQLPFLFRYAE-RPFRILPIILGLLDYDRARDFGRFLGSLLQGIDALIVISS 177

Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL--LEYDNTICGR 242
           D  H      Y+  D+        I A++++  D          + YL  ++Y+ T+CG 
Sbjct: 178 DMSH------YIPADEARRKDEILISAMERLQTD----------ELYLKRVQYNITMCGF 221

Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            P    +        +  I  + Y  S +     D  V+YA
Sbjct: 222 IPAVAGIEASKVLGARQGI-LIDYTNSGEVTGDYDRVVAYA 261


>gi|410465125|ref|ZP_11318489.1| putative dioxygenase [Desulfovibrio magneticus str. Maddingley
           MBC34]
 gi|409981753|gb|EKO38278.1| putative dioxygenase [Desulfovibrio magneticus str. Maddingley
           MBC34]
          Length = 278

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 78/288 (27%), Positives = 118/288 (40%), Gaps = 31/288 (10%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
            +R+   AG +Y  +   LA E   +L EAG P        +APHAG  YSG  A    G
Sbjct: 10  SLRQPVVAGRFYPADPAALARETAAYLAEAGKPSERPTILAMAPHAGAVYSGPVAGKTLG 69

Query: 63  --NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
             N+  T    + LLGP+H       A+     ++ P  D+P++  + E L A       
Sbjct: 70  AANLADT----LLLLGPNHTGRGAPMAVWPDGAWRIPGRDVPVEAALAEALLAAAPGLSP 125

Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV-----GAVNAENEAMYGRLFAKYVDD 175
           D      EHS+E+ LP+L  V  G   +IVP+ V     G +     AM G L A    +
Sbjct: 126 DQAAHLGEHSLEVLLPFLTAVRPG--CRIVPVAVAEPRPGVLAETARAMAGVLAAW--PE 181

Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
           P +   VSSD  H+          K+       + ALD  G+  +  G+           
Sbjct: 182 PISII-VSSDMSHY---LPQATAQKRDSAALARVLALDPQGLLEVVRGE----------- 226

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           D ++CG  P+++ L ++         +   Y  S +     +  V YA
Sbjct: 227 DISMCGVLPMTLGL-LIARALGATSARLAAYATSGEVSGDYEQVVGYA 273


>gi|374633849|ref|ZP_09706214.1| putative dioxygenase [Metallosphaera yellowstonensis MK1]
 gi|373523637|gb|EHP68557.1| putative dioxygenase [Metallosphaera yellowstonensis MK1]
          Length = 282

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 78/294 (26%), Positives = 128/294 (43%), Gaps = 29/294 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRGV--------IAPHAGYSYSG 54
           IRR + AGS+Y D+ K L + ++  +L   G  + PE            I PHAGY YSG
Sbjct: 2   IRRPAVAGSFYEDDPKELRKRIEWSFLHHVGPGRLPEPSNAARRTSPFFIVPHAGYIYSG 61

Query: 55  RAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+++ + +       V ++GP+H  Y    ++     ++TP+G +P+ +E ++E+ +
Sbjct: 62  PVAAHSYYHLVREGRPDLVIMVGPNHTGYGSSVSVWPGGHWETPLGKVPVPIEFVKEIVS 121

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG--RLFAK 171
             +    +      EHS+E+ +P+L   F    I  + IL+           G  R   +
Sbjct: 122 LSEVLDPEERAHLYEHSLEVQVPFLQYFFTEIPIIPITILLQTPETAEFVAEGIWRFMGR 181

Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
           +  D       SSD           HYD     I K  +A+ K    IIE  D     + 
Sbjct: 182 H-KDLDIVVLASSDMN---------HYDPHEMTIEKDEKAIKK----IIEL-DYRGLYRV 226

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
           + E D T+CG  P+ V + +      K  I  L++  S      + S V Y +A
Sbjct: 227 VEEEDVTMCGYAPVMVSMILARKAGKKPYI--LKHATSGDTSGDKSSVVGYLAA 278


>gi|45358950|ref|NP_988507.1| hypothetical protein MMP1387 [Methanococcus maripaludis S2]
 gi|48428699|sp|P61652.1|Y1387_METMP RecName: Full=MEMO1 family protein MMP1387
 gi|45047816|emb|CAF30943.1| Protein of unknown function DUF52 [Methanococcus maripaludis S2]
          Length = 284

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 124/298 (41%), Gaps = 30/298 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
           R    AG +Y      L E ++  +L   G  + P  RG       +++PHAGY YSG  
Sbjct: 3   RNPVVAGMFYPAEYHELLEMIEYCYLNPRGPRELPSKRGKYTKPLGIVSPHAGYIYSGPV 62

Query: 57  AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA+ +  I       +   +LGP+H       A    T +KTP GD+ +D E  + L   
Sbjct: 63  AAHGYKKISENISGEITAIILGPNHTGLGSGIATMKGT-WKTPFGDMEIDNEFADRLWKE 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
                MD      EHS+E+ LP+L  + + ++   K VPI +   + E+    G + AK 
Sbjct: 122 CDILDMDENSHLREHSIEVQLPFLKHLEDLNIAKFKFVPISMMMQDYESCIDVGYVIAKV 181

Query: 173 VDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
             + +    +  S+DF H+  +       KK  V+ K          DI+E  D + F  
Sbjct: 182 ARELNRKIVIIASTDFSHYEPQ---EQASKKDAVVIK----------DILELNDEEIFTD 228

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            ++ +  ++CG  P+   +  + +   +     L Y  S          V YAS   K
Sbjct: 229 -VVTHTISMCGYGPVIAMVKAMKDLGARTSY-LLYYSTSGDVTKDYSEVVGYASLLIK 284


>gi|254167561|ref|ZP_04874412.1| conserved hypothetical protein [Aciduliprofundum boonei T469]
 gi|289596975|ref|YP_003483671.1| protein of unknown function DUF52 [Aciduliprofundum boonei T469]
 gi|197623370|gb|EDY35934.1| conserved hypothetical protein [Aciduliprofundum boonei T469]
 gi|289534762|gb|ADD09109.1| protein of unknown function DUF52 [Aciduliprofundum boonei T469]
          Length = 271

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 72/291 (24%), Positives = 128/291 (43%), Gaps = 35/291 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSP--------EVRGVIAPHAGYSYSG 54
           +R  + AG +Y    K L   L+  ++   G    P        +++G + PHAGY +SG
Sbjct: 1   MRYPAVAGQFYPAERKELEIMLNELFIHPIGPGSVPSLIPDGKRKIKGGVVPHAGYMFSG 60

Query: 55  RAAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA+ +  +        F ++GP+H+      A++    + TP G + +D E+ +++ A
Sbjct: 61  PVAAHFYHALALDGFPESFIIIGPNHYGVGSGVAIALED-FLTPFGKVKVDRELAKDI-A 118

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
            G  ++ D      EHS+E+ LP+L   F    IK VPI +   + E     G +  + +
Sbjct: 119 KGVIDI-DGYAHRYEHSIEVQLPFLQ--FFKKEIKFVPITMLLQDYEIAIEVGEIIKEAI 175

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
            D       SSDF H+  + N  + +             D M ++ I  GD       + 
Sbjct: 176 VDKDVVIIASSDFSHYVPK-NKAYTN-------------DYMAIERIVNGDVKGLYDVIY 221

Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           +++ T+CG  P++  L  +G  +T      L+Y  S   +   +  V YA+
Sbjct: 222 KHNITMCGYGPVAAMLTAIGGKTT-----LLKYATSGDIQP-MNEVVGYAA 266


>gi|374635034|ref|ZP_09706639.1| protein of unknown function DUF52 [Methanotorris formicicus
           Mc-S-70]
 gi|373563436|gb|EHP89630.1| protein of unknown function DUF52 [Methanotorris formicicus
           Mc-S-70]
          Length = 286

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 81/303 (26%), Positives = 130/303 (42%), Gaps = 44/303 (14%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD-GWLR---------EAGLPKSPEVRGVIAPHAGYSYS 53
           IR    AG +Y  +S  L E ++  +L          E G+ K P   G+I PHAGY YS
Sbjct: 2   IRNPVVAGVFYPADSNELIELIEYCYLHKLGPKEIPSEGGIFKKP--VGLICPHAGYIYS 59

Query: 54  GRAAAYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
           G  AA+++  +    D        ++GP+H       A +   ++KTP+G+L +D E ++
Sbjct: 60  GPIAAHSYNALSKRADIEDEITAVIIGPNHTGLGTGVA-TMKGIWKTPLGNLEIDNEFVD 118

Query: 110 ELKATGKFELMDICVDEA----EHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEA 163
            L    + ++MD+  DE     EHS+E+ LP+L    +      K VPI +   + E   
Sbjct: 119 RLWK--ECDIMDL--DETSHLHEHSIEVQLPFLQHLSILNIAKFKFVPISMLFQDYETSV 174

Query: 164 MYGRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE 221
             G   AK   + +    V  S+DF          HY+ +     K     D + +  I 
Sbjct: 175 DVGYFIAKIAKELNRRVVVIASTDFT---------HYEPQEVAAKK-----DAIAIKNIL 220

Query: 222 TGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVS 281
             D +     ++ Y+ ++CG  P+   +  +     K + K L Y  S        S V 
Sbjct: 221 NMDEEELYSDVVNYNISMCGCGPVMAMIKAMKLLGGK-EAKLLSYATSGDITKDYSSVVG 279

Query: 282 YAS 284
           YAS
Sbjct: 280 YAS 282


>gi|296454703|ref|YP_003661846.1| dioxygenase [Bifidobacterium longum subsp. longum JDM301]
 gi|296184134|gb|ADH01016.1| dioxygenase [Bifidobacterium longum subsp. longum JDM301]
          Length = 595

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 76/282 (26%), Positives = 121/282 (42%), Gaps = 64/282 (22%)

Query: 4   IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
           IR  + AGS+Y               D S+ + ++L+  L  AG+PK+     VI PHAG
Sbjct: 48  IRPPAVAGSFYPADRTTLKQLITHQLDYSREVLQQLEPTL-PAGVPKA-----VIVPHAG 101

Query: 50  YSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           Y YSG AAA A+  ++    S++R  ++GP+H       A STA  ++TP+G +P+D   
Sbjct: 102 YVYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 161

Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
                 L V E L+        A+    +++      EH++E+ +P+L  V  G  + IV
Sbjct: 162 ERKALGLSVNEPLRSGTHARPGASAPAMIVNGPTHAQEHAVEVQIPFLQTVL-GPDLTIV 220

Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
           P+  G    +     G +       P     +SSD  H+                H+   
Sbjct: 221 PLNAGDATPQE---AGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 263

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
           ALD   +        D    +L  +    CG +PI+  L +L
Sbjct: 264 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 298


>gi|159905669|ref|YP_001549331.1| hypothetical protein MmarC6_1286 [Methanococcus maripaludis C6]
 gi|229554358|sp|A9A9S6.1|Y1286_METM6 RecName: Full=MEMO1 family protein MmarC6_1286
 gi|159887162|gb|ABX02099.1| protein of unknown function DUF52 [Methanococcus maripaludis C6]
          Length = 284

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 75/298 (25%), Positives = 125/298 (41%), Gaps = 30/298 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
           R    AG +Y      L E ++  +L   G  + P  RG       +++PHAGY YSG  
Sbjct: 3   RNPVVAGMFYPAEYHELLEMIEYCYLSPRGPKELPSKRGNYTKPLGIVSPHAGYIYSGPV 62

Query: 57  AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
           AA+ +  I       V   +LGP+H       + +   ++KTP GD+ +D E  ++L   
Sbjct: 63  AAHGYKKISENVSGEVTAIILGPNHTGLGSGIS-TMKGIWKTPFGDMEIDNEFADKLWKE 121

Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
                +D      EHS+E+ LP+L  + + ++   K VPI +   + E     G   AK 
Sbjct: 122 CDVLDIDENSHLREHSIEVQLPFLKHLEDLNIAKFKFVPISMMMQDYETSIDVGYFIAKV 181

Query: 173 VDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
             + +       S+DF H+  + +     KK  ++ K          DI+E  D + F  
Sbjct: 182 AKEMNRKIIIIASTDFSHYEPQES---ASKKDAIVIK----------DILELKDEEIFID 228

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
            ++ ++ ++CG  P+   +  + +   K     L Y  S          V YAS   K
Sbjct: 229 -VVTHNISMCGYGPVIAMVKAMKDLGAKTA-NLLYYSTSGDVTKDYSEVVGYASILVK 284


>gi|407464049|ref|YP_006774931.1| hypothetical protein NSED_00885 [Candidatus Nitrosopumilus sp. AR2]
 gi|407047237|gb|AFS81989.1| hypothetical protein NSED_00885 [Candidatus Nitrosopumilus sp. AR2]
          Length = 276

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 59/248 (23%), Positives = 109/248 (43%), Gaps = 29/248 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLR-----EAGLPKSPE-----VRGVIAPHAGYSYS 53
           IR    AG +Y +      E+L   +R     E G   +P+     + G+I PHAGY YS
Sbjct: 2   IREPVVAGQFYPEKE----EDLKNMIRYCIDHEYGPGNNPKTSNDKIYGIICPHAGYVYS 57

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G  A +++  I   +     ++GP+H       A    + ++TP+G + +D E  +++  
Sbjct: 58  GPTACHSYKAISSQNPELAIIIGPNHFGVGKDAATMIDSQWQTPLGMVSIDSESAKQVAE 117

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
             +F  +D      +HS+E+ +P L ++      +I+PI++ A + +     G    +  
Sbjct: 118 ISEFIEIDEYSHSQDHSLEVQIPMLQEMLSNEF-QILPIILRAQDMKTAMDVGNAVYEIA 176

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
              +     SSDF H+          +++   H+     DK  ++ I   + + F + L 
Sbjct: 177 KRKNAIIVASSDFTHY----------EENSFAHQQ----DKALIEPILEMNVERFYQVLY 222

Query: 234 EYDNTICG 241
           E   T CG
Sbjct: 223 ERRVTACG 230


>gi|242399201|ref|YP_002994625.1| hypothetical protein TSIB_1222 [Thermococcus sibiricus MM 739]
 gi|242265594|gb|ACS90276.1| hypothetical protein TSIB_1222 [Thermococcus sibiricus MM 739]
          Length = 292

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 67/295 (22%), Positives = 134/295 (45%), Gaps = 16/295 (5%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           +R  + AGS+Y    + L   L  + R+ G L +  ++   +APHAGY +SG  A+  + 
Sbjct: 2   LRYPTVAGSFYP-TGEELKMMLKEFFRDLGELGEERKITAGVAPHAGYVFSGFTASRTYK 60

Query: 63  NIDPTSISRVFL-LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            I    +   F+ +GP+H       A+     + TP+G + +D ++ + +        +D
Sbjct: 61  AIYEDGLPETFVVIGPNHTGLGSPVAIYPEGKWITPMGGIKVDEDLAKAIARHSGIADLD 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFEG--HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
               + EHS+E+ +P++  + +     ++IVPI +G  + E     G+   +   +    
Sbjct: 121 ELAHKYEHSIEVQIPFIQYISQKAEEEVRIVPITLGLQDEEVAEDLGKAIFEASQELGRD 180

Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALD-KMGMDIIETGDPDAFKKYLLEYD 236
             V  S+D  H+G  + Y+ +  +   +   I+  D ++   I+E      F + + + +
Sbjct: 181 IVVIASTDMMHYGYAYGYVPFRARGDDLLGRIKEWDFRIIQKILEFDHKGMFDE-IRKMN 239

Query: 237 NTICGRHPIS---VFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           +T+CG   ++   VF  + G   T++    L Y  S +     D+ V Y S   +
Sbjct: 240 HTMCGPGGVATAIVFSRLAGAVETEV----LHYTTSFEVSRSTDAIVGYVSIVMR 290


>gi|217076175|ref|YP_002333891.1| hypothetical protein THA_42 [Thermosipho africanus TCF52B]
 gi|419760738|ref|ZP_14287007.1| hypothetical protein H17ap60334_08093 [Thermosipho africanus
           H17ap60334]
 gi|217036028|gb|ACJ74550.1| hypothetical protein THA_42 [Thermosipho africanus TCF52B]
 gi|407514244|gb|EKF49089.1| hypothetical protein H17ap60334_08093 [Thermosipho africanus
           H17ap60334]
          Length = 262

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 74/284 (26%), Positives = 117/284 (41%), Gaps = 32/284 (11%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN- 63
           R   +A  +Y  NS+ L +  + + +   +  +    G+I PHAGY YSG+ A YA    
Sbjct: 3   REMYYADKFYPKNSELLIKFFNQFCKVKEIKNTNSNFGLILPHAGYVYSGKTALYAICKA 62

Query: 64  IDPTSISRVFLLGPSHHYYTPK-CALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           +D     R+ + G +H       C+L     + TP G + +D E+   L     FE  + 
Sbjct: 63  LDFGKPERIIIFGTNHTGIAGNICSLWPRGKWNTPFGYVGIDEELNNYLLKNEIFE-ENY 121

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
                EHS+E+ LP L   F     K VPI+    + EN      L  K +D  +     
Sbjct: 122 KAHILEHSIEVLLPILKYYFGS--FKFVPIIYNYQSYENTIKIVELLKK-IDLKNTLLVA 178

Query: 183 SSDFCHWGSRFNYMHYDKK--HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
           SSDF H+ S    +  D+K    ++ ++IE +                  Y L    + C
Sbjct: 179 SSDFNHYESHEITLRKDEKLIEKILERNIEGI------------------YSLRKSISAC 220

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           G  P+SV      N      +K + +  S++       +V YAS
Sbjct: 221 GIGPMSVLTMYFDN------VKLVYHTTSAEYSKDYTFTVGYAS 258


>gi|268678675|ref|YP_003303106.1| hypothetical protein Sdel_0033 [Sulfurospirillum deleyianum DSM
           6946]
 gi|268616706|gb|ACZ11071.1| protein of unknown function DUF52 [Sulfurospirillum deleyianum DSM
           6946]
          Length = 263

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 89/195 (45%), Gaps = 13/195 (6%)

Query: 3   KIRRASHAGSWYTDNS---KRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAA-- 57
           K R+++ AG +Y ++    K      +  ++E      P  + +I PHAGY YSG  A  
Sbjct: 2   KTRKSAVAGMFYPESCHDIKAFITHFESQIKEPTCTIVP--KALIVPHAGYVYSGYTANL 59

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           AY +     + I  V ++GPSH  Y    +++    Y TP G++ +DLE    L+    F
Sbjct: 60  AYHYTASKRSDIHCVVVIGPSHRIYLEGASIALYESYHTPCGEIAIDLEYSHALQKRFSF 119

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
                   E EHS E+ +P++   F+   +K+V I+ G +          L  + + +  
Sbjct: 120 LSFHPSAHE-EHSTEVQMPFIRHYFKH--VKVVEIVYGDI---THGELSNLMDEVLKEEG 173

Query: 178 NFFSVSSDFCHWGSR 192
               VS+D  H+ S 
Sbjct: 174 RLLVVSTDLSHFHSE 188


>gi|303248253|ref|ZP_07334516.1| protein of unknown function DUF52 [Desulfovibrio fructosovorans JJ]
 gi|302490391|gb|EFL50302.1| protein of unknown function DUF52 [Desulfovibrio fructosovorans JJ]
          Length = 280

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 21/283 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--PEVRGVIAPHAGYSYSGRAAAYAF 61
           IR+   AG +Y  +   L  E+ G+L +A  P+S  P +  ++ PHAGY +SG  A    
Sbjct: 13  IRQPVVAGRFYPGDPAGLRREVAGYLAQAA-PRSEKPTLLAMV-PHAGYVFSGLVAGRTL 70

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
           G        R+ LLGP+H     + A+     +  P  ++P+D  +  EL A       D
Sbjct: 71  GAA--RLADRLLLLGPNHTGQGARLAVWPEGAWLVPGCEVPVDAGLAGELMAAAPRLASD 128

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
           +     EHS+E+ LP+L  V  G   +IVPI V    AE +    R  A  +       +
Sbjct: 129 VAAHMGEHSLEVELPFLCAVNPGA--RIVPICV----AEPDPGVLREVAGAI------VA 176

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           V  +  H  S    +  D  H + H + +  D M +  +   DP+     +     T+CG
Sbjct: 177 VLRELSHPVSIV--VSSDMSHYISHTAAKKRDSMALSRVLALDPEGLCAVVRSEGITMCG 234

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             P+++ L +      +  +    Y  S +     +  V YA 
Sbjct: 235 VLPMALGLFVAIGLGARDAL-LAAYATSGEVTGDMEQVVGYAG 276


>gi|227546624|ref|ZP_03976673.1| dioxygenase [Bifidobacterium longum subsp. longum ATCC 55813]
 gi|227212941|gb|EEI80820.1| dioxygenase [Bifidobacterium longum subsp. infantis ATCC 55813]
          Length = 596

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 75/282 (26%), Positives = 119/282 (42%), Gaps = 64/282 (22%)

Query: 4   IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
           IR ++ AGS+Y               D  ++L ++L+  L  AG+P     R VI PHAG
Sbjct: 48  IRPSAVAGSFYPADRTALKQLINQQLDYGRKLLQQLEPTL-PAGVP-----RAVIVPHAG 101

Query: 50  YSYSGRAAAYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           Y YSG AAA A+  ++     ++R  ++GP+H       A STA  ++TP+G +P+D   
Sbjct: 102 YIYSGTAAALAYALLERGRGRVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 161

Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
                 L V E L+        A     +++      EH++E+ +P+L  V  G  + IV
Sbjct: 162 ERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPDLTIV 220

Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
           P+  G    +     G +       P     +SSD  H+                H+   
Sbjct: 221 PLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 263

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
           ALD   +        D    +L  +    CG +PI+  L +L
Sbjct: 264 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 298


>gi|189345729|ref|YP_001942258.1| hypothetical protein Clim_0180 [Chlorobium limicola DSM 245]
 gi|189339876|gb|ACD89279.1| protein of unknown function DUF52 [Chlorobium limicola DSM 245]
          Length = 285

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 52/232 (22%), Positives = 113/232 (48%), Gaps = 19/232 (8%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGL--PKSPEVRGVIAPHAGYSYSGRAAAYA 60
           +IR  + A ++Y  + + L   +   L++      K  ++R ++ PH+GY YSG  AA A
Sbjct: 4   EIRYPAVAEAFYPADIRELESLITSLLQQVKPEHKKRKKIRAIVTPHSGYLYSGSIAACA 63

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  I  ++   VFL+G +H Y     AL +   + TP+G++P++ ++   L       + 
Sbjct: 64  YKAITGSTFKNVFLMGHAHAYLFEGVALDSHKAWHTPLGNVPVNTDMNARLIGLAPELVH 123

Query: 121 DICV-DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
           ++ +   ++H++E+ +P+L +  +     IVP+L G  NA +  +        + +P + 
Sbjct: 124 ELNIAHHSDHTLEVQIPFLLRTLKPDF-SIVPMLFGE-NAIDVHLKTADLLLPLLNPDDL 181

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
             VS+D  H+ +              ++    +D++ ++ I + D D  +++
Sbjct: 182 LVVSTDLSHYPA--------------YRDANTIDRLTLEHIVSSDIDGLEEH 219


>gi|413951293|gb|AFW83942.1| hypothetical protein ZEAMMB73_880994, partial [Zea mays]
          Length = 136

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 34/59 (57%), Positives = 41/59 (69%)

Query: 48  AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
           +G  +  R   +A   +     SRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D E
Sbjct: 69  SGSIHGAREEGFARRLLVHQRCSRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQE 127


>gi|372270868|ref|ZP_09506916.1| dioxygenase [Marinobacterium stanieri S30]
          Length = 267

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 94/195 (48%), Gaps = 18/195 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           +R+ + AG +Y      L + +D  L ++     LP+ P     + PHAG  YSG  AA 
Sbjct: 6   VRQPAVAGVFYPGTETDLNQLVDQLLVQSTEPVSLPRHP--CAFVVPHAGLVYSGPIAAL 63

Query: 60  AFGNID---PTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
            +  I+   P +  R V LLGP+H       A    T + TP+GD+ +DL++   L+A  
Sbjct: 64  VYRQIERWMPLAGWRQVVLLGPNHRVPLSGFAGVAETNWHTPLGDVAIDLDLERSLQA-- 121

Query: 116 KFEL-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
           +F+L +     + EH +E+ LP+L KV      +++P+LVG    E       L A+   
Sbjct: 122 RFDLPVRQDAHQLEHCLEVQLPFLQKVAPD--ARVLPLLVGHTPVET---VSALIAELWQ 176

Query: 175 DPSNFFSVSSDFCHW 189
                  VSSD  H+
Sbjct: 177 REDVLVLVSSDLSHF 191


>gi|298529330|ref|ZP_07016733.1| protein of unknown function DUF52 [Desulfonatronospira
           thiodismutans ASO3-1]
 gi|298510766|gb|EFI34669.1| protein of unknown function DUF52 [Desulfonatronospira
           thiodismutans ASO3-1]
          Length = 266

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 69/276 (25%), Positives = 115/276 (41%), Gaps = 24/276 (8%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
           AG +Y  ++  L E+L  +L+  G   S      + PHAGY +SG  A       +  S 
Sbjct: 8   AGQFYPGSAAALDEKLSRYLQ--GEQASARTLLAMVPHAGYPFSGPVAGKVLARSNLAS- 64

Query: 70  SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
            R+ LLGP+H     + A+     ++ P G++ +D +    +     +   D      EH
Sbjct: 65  -RIILLGPNHTGQGQRIAVWGDGSWRLPGGEVMVDEDTASVIGKLPGYSF-DYQAHLREH 122

Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS--VSSDFC 187
           S+E+ LP+LA+   G   KIVPI V   + +     G   A+ + D +   S  VS+D  
Sbjct: 123 SLEVILPFLARAVPG--CKIVPISVAEPDLQVLMRAGEDLARTIRDLALDVSLVVSTDMS 180

Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
           H+              V  +    LD M ++ I   DP    + + +   ++CG  P ++
Sbjct: 181 HF--------------VPQEQARRLDHMAIERILDLDPQGLHQVVKQNRISMCGVMPTTL 226

Query: 248 FLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            L  +     +   +F+ Y  S          V YA
Sbjct: 227 GLACVREMGAR-STEFIDYATSGDAIGDYSQVVGYA 261


>gi|270308481|ref|YP_003330539.1| hypothetical protein DhcVS_1094 [Dehalococcoides sp. VS]
 gi|270154373|gb|ACZ62211.1| hypothetical protein DhcVS_1094 [Dehalococcoides sp. VS]
          Length = 438

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 72/288 (25%), Positives = 120/288 (41%), Gaps = 35/288 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
           +R AS AG +Y  + +++       E+ G  +E  L       G + PHAGY YSG  A 
Sbjct: 2   LRPASAAGRYYPASKEKIRSVITPLEIKGVTKERYL-------GAVMPHAGYPYSGGVAM 54

Query: 59  YAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
                +D P +   V +LGPSH     + A+  + +++TP+G++ +D  +   +    + 
Sbjct: 55  AVATRLDLPET---VIILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRH 111

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
              D    + EHS+E+ +P L + F+ + IKIVPI +     E     G   A  + +  
Sbjct: 112 IKADPSAHQYEHSIEVQIPIL-QYFKPN-IKIVPITLSFGKTETLEDIGHGIASALRETG 169

Query: 178 N--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
                  SSD  H+  +    H               D M +D I   D     + +   
Sbjct: 170 REAIIIASSDMTHYEPQTE-AHLK-------------DSMALDAIVKLDAADMLERIQAN 215

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
             T+CG  P++  L  +     K + + + Y+ S       D  V YA
Sbjct: 216 HITMCGYAPVAAMLTAVKEMGAK-RARVVAYQTSGDITHHMDQVVGYA 262


>gi|253700550|ref|YP_003021739.1| hypothetical protein GM21_1928 [Geobacter sp. M21]
 gi|251775400|gb|ACT17981.1| protein of unknown function DUF52 [Geobacter sp. M21]
          Length = 266

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 66/287 (22%), Positives = 128/287 (44%), Gaps = 34/287 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           IR+ + AG +Y+ + ++L EEL   + RE    +  +  GV+APHAGY YSG AA   + 
Sbjct: 2   IRQPAVAGKFYSADPEQLREELSRMIPRE----EPAKAIGVVAPHAGYVYSGGAAGKVYA 57

Query: 63  NID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
            ++ P +   V +LGP+H       AL+ +  + TP+G + ++  + + +         D
Sbjct: 58  AVEVPDA---VIVLGPNHTGMGAAAALAPSGEWLTPLGPVRVNDRLSQLIMKHAPLVRED 114

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
                 EHS+E+ +P+L   +    + I  + +   + ++ +  G   A  + +  +   
Sbjct: 115 SVAHRFEHSLEVQVPFLQ--YRNPQVSIAALCLSLPDFDSISRVGTGIAHAIAEYGSEVL 172

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
              SSD  H+ S                  +A D + ++ +   DP+   +   + D ++
Sbjct: 173 IVASSDMTHYES--------------AAQAKAKDDLALERMACMDPEGLLRVCRDKDISM 218

Query: 240 CGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           CG  P +  L     LG   +++    + Y  S +     ++ V+YA
Sbjct: 219 CGVIPATAMLVAAKTLGASGSRL----ICYTNSGEVNGDLNAVVAYA 261


>gi|374316993|ref|YP_005063421.1| putative dioxygenase [Sphaerochaeta pleomorpha str. Grapes]
 gi|359352637|gb|AEV30411.1| putative dioxygenase [Sphaerochaeta pleomorpha str. Grapes]
          Length = 301

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 133/302 (44%), Gaps = 33/302 (10%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAG---LPKS---PEVRGVIAPHAGYSYSGRA 56
           ++R A  AGSWY  ++  L   L+  L+EA    L K    PE R  I PHAG S+S R 
Sbjct: 9   RVRHAGFAGSWYPLDATELESFLEISLQEAKDRELQKGSFYPEGRFAILPHAGLSFSQRG 68

Query: 57  AAYAFGNIDPTSISRVFLLGPSHH--YYTPKCALSTATVYKTPIGDL---PLDLEVIEEL 111
            A  F +  P +  RV ++ PSH+   Y+ + ++ +   Y+TP+G+L    L   +    
Sbjct: 69  IAPFFLHFPPQA-ERVLIIAPSHYTSLYSDELSVGSFDEYETPLGNLKGFSLSFGIKGGE 127

Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEG--HLIKIVPILVGAVNAENEA------ 163
           +A            ++EH++EM +PYLA V +     + +   LV  ++  ++A      
Sbjct: 128 RAI-----------QSEHAVEMVMPYLAWVVKTRKQELAVSTALVSHLSDCSKAKEIAHR 176

Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-HGVIHKSIEALDKMGMDIIET 222
           +   L  + +         SSDF H+G RF +  Y+      I   ++  D     ++  
Sbjct: 177 LIDNLGEESLRSGKTLVIASSDFTHYGRRFAHTPYENNLPRAIFNRVKEDDLALSALLID 236

Query: 223 GDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
           G+      +   + +T+CG    ++ +  L  C          Y  +   + + D  V+Y
Sbjct: 237 GNIKEALAFCENHHSTVCGLASGAI-VATLATCFNAKGTLSEYYTSNDVFENEEDEFVTY 295

Query: 283 AS 284
           +S
Sbjct: 296 SS 297


>gi|256089007|ref|XP_002580610.1| hypothetical protein [Schistosoma mansoni]
          Length = 80

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 5/79 (6%)

Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-----STKIKIKFLRYEQSS 270
           GM++IE   P  F  YL +Y NTICGRHPI V L ++        +    +KF+RY QS 
Sbjct: 1   GMELIERLAPSEFTSYLEQYGNTICGRHPIGVLLQIVAYLRRNMPNNNFNMKFVRYAQSE 60

Query: 271 QCKTKRDSSVSYASAAAKV 289
            C     SSVSYA+   ++
Sbjct: 61  HCNNMNQSSVSYAAGVLQI 79


>gi|189424944|ref|YP_001952121.1| hypothetical protein Glov_1885 [Geobacter lovleyi SZ]
 gi|189421203|gb|ACD95601.1| protein of unknown function DUF52 [Geobacter lovleyi SZ]
          Length = 267

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 24/249 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR+A+ AG +Y  +   L  EL   L      K  +  GVI+PHAGY YSG AA      
Sbjct: 2   IRQAAVAGQFYPGSQSALIRELQILLPAVKTRK--KAIGVISPHAGYLYSGSAAGQLLAG 59

Query: 64  ID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           I+ P +   V +LGP+HH      A+S    ++TP+G +P++  +   ++        D 
Sbjct: 60  IELPRT---VLILGPNHHGTGALAAVSPDDGWQTPLGVVPIEKRLAGLIQQHVPRVEQDS 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFF 180
                EHS+E+ +P+L  +     + IVP+ +   +     + G   A  + D       
Sbjct: 117 SAHRHEHSLEVQVPFLQYLRPD--LNIVPLCLAFGDYAGCELLGNGLAAAIRDFGEEVLI 174

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSD  H+ S        +K  +  + I A D  G+  +E    +           T+C
Sbjct: 175 LASSDMTHYES---VATARRKDSLALERILAFDAQGL--VEVCRTERI---------TMC 220

Query: 241 GRHPISVFL 249
           G  P +V L
Sbjct: 221 GVIPAAVML 229


>gi|383787285|ref|YP_005471854.1| putative dioxygenase [Fervidobacterium pennivorans DSM 9078]
 gi|383110132|gb|AFG35735.1| putative dioxygenase [Fervidobacterium pennivorans DSM 9078]
          Length = 266

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 72/283 (25%), Positives = 117/283 (41%), Gaps = 26/283 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           RR   AG +Y     +L    + +L  E    +  +  G+I PHAGY YSG+ A      
Sbjct: 3   RRPVVAGKFYPGTPNQLTSMCEAFLGSEKPKNRLEKPLGLILPHAGYIYSGKTAGLGIKK 62

Query: 64  -IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            ++    S + +LGP+H  Y    ++     +  P G + ++  + EE+   GK    D 
Sbjct: 63  AVEFGKPSNIIILGPNHTGYGASVSVWKEGEWFVPNGSVRVNSIIAEEI-IDGKIIEEDE 121

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD-PSNFFS 181
                EHS+E+ LP L   F G   +IVPI +                K ++  P     
Sbjct: 122 SAHLYEHSIEVQLPLLIHAF-GEF-QIVPICMMDQRLTKARFIAERIRKILEKYPDTLVV 179

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
            SSDF          HYD     + K     D++ +  I + D +   + L +YD T+CG
Sbjct: 180 ASSDFN---------HYDPHELTMKK-----DEIAITRIISNDLEGLYEDLKKYDITMCG 225

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
             P++V   +  N       + + +  S++       +V YAS
Sbjct: 226 PGPVAVVRSLFENA------ELVYHTTSAEFSQDFSYTVGYAS 262


>gi|325958574|ref|YP_004290040.1| hypothetical protein Metbo_0817 [Methanobacterium sp. AL-21]
 gi|325330006|gb|ADZ09068.1| UPF0103 protein [Methanobacterium sp. AL-21]
          Length = 279

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 74/296 (25%), Positives = 132/296 (44%), Gaps = 37/296 (12%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEV------RGVIAPHAGYSYSGRA 56
           IR+ + AG +Y  +   L ++++   +    P S P++      +GV+ PHAGY YSG  
Sbjct: 2   IRKPAVAGIFYEKDPDMLRKQIEWCFKHNLGPGSIPDIGNERTIKGVVTPHAGYIYSGPV 61

Query: 57  AAYAFGNIDPTSISRVFL-LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           AA+++ +I        F+ L P+H       +      ++TP+G++ +D E   +L    
Sbjct: 62  AAHSYHDIAEDGFPETFIVLCPNHTGLGSGVSTMNQGSWETPLGNVEID-ETFADL-LIE 119

Query: 116 KFELMDICVDE--AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
           K  +MD   D    EHS E+ LP+L   +     K VP+ +   + +     G   A+  
Sbjct: 120 KTGIMDSNPDAHLREHSAEVQLPFLQ--YLDPDFKFVPVTMWMQDLQTSLEIGVSIAQAA 177

Query: 174 D--DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
              D       S+DF H+  +      D +   + ++I+A+D+  M              
Sbjct: 178 KELDRDVIVIASTDFTHYQPKKQAYMQDMQ---VIEAIKAMDEKRM-----------MNV 223

Query: 232 LLEYDNTICGRHPIS---VFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
           + E + T+CG  P++   V +  +G  S+++K    +Y  S        S V+YAS
Sbjct: 224 VAEQNVTMCGYGPVAATLVAVKQMGANSSELK----KYATSGDTTGDNSSVVAYAS 275


>gi|116620880|ref|YP_823036.1| hypothetical protein Acid_1761 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224042|gb|ABJ82751.1| protein of unknown function DUF52 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 404

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 130/302 (43%), Gaps = 34/302 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLRE-AGLPKSPEVRG---VIAPHAGYSYSGRAAAY 59
           +R  +HAGS Y       AEE++  +RE  G  + PE  G   + APH       ++   
Sbjct: 111 VREPAHAGSAYPGE----AEEVNAMMREYMGGVEPPEPDGLFAIAAPHVSPEGGWQSYRA 166

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG--KF 117
           A+G + P    R F++  + HY  P+    T   ++TP+GD   D  +++ L   G    
Sbjct: 167 AYGMLRPEHKDRTFVVLATSHYGEPEKFGLTRKAFRTPLGDARSDTALVDWLAKHGGAGV 226

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN--------AENEAMYGRLF 169
           ++ D C    EH++E+ + +L  +  G  +KI+P+L+G            E++      F
Sbjct: 227 KMEDYC-HSFEHTVELQVIFLQHML-GADVKILPVLIGGFAHSIYEGGMPEDDDKVRAFF 284

Query: 170 -----AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
                 +  +    F+ +  D  H G+R+      +      +++ + D+  ++ I   D
Sbjct: 285 DALSELREREGDKLFWVMGVDMAHMGARYQDSFAARAGEGEMEAVASRDEQRIERINALD 344

Query: 225 PDAFKKYLLEYDNTI--CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
              F + + E  + +  CG  P   FL      + K + + L YE   Q     +S VS+
Sbjct: 345 AGGFWELVRERQDDLKWCGSSPFYTFL----KTAPKARGELLSYE---QWNIDENSVVSF 397

Query: 283 AS 284
           A 
Sbjct: 398 AG 399


>gi|322691725|ref|YP_004221295.1| hypothetical protein BLLJ_1536 [Bifidobacterium longum subsp.
           longum JCM 1217]
 gi|320456581|dbj|BAJ67203.1| conserved hypothetical protein [Bifidobacterium longum subsp.
           longum JCM 1217]
          Length = 596

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 75/282 (26%), Positives = 119/282 (42%), Gaps = 64/282 (22%)

Query: 4   IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
           IR ++ AGS+Y               D  ++L ++L+  L  A +P     R VI PHAG
Sbjct: 48  IRPSAVAGSFYPADRTALKQLINQQLDYGRKLLQQLEPTL-PAWVP-----RAVIVPHAG 101

Query: 50  YSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
           Y YSG AAA A+  ++    S++R  ++GP+H       A STA  ++TP+G +P+D   
Sbjct: 102 YIYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 161

Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
                 L V E L+        A     +++      EH++E+ +P+L  V  G  + IV
Sbjct: 162 ERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPDLTIV 220

Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
           P+  G    +     G +       P     +SSD  H+                H+   
Sbjct: 221 PLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 263

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
           ALD   +        D    +L  +    CG +PI+  L +L
Sbjct: 264 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 298


>gi|163785595|ref|ZP_02180153.1| hypothetical protein HG1285_07278 [Hydrogenivirga sp. 128-5-R1-1]
 gi|159879135|gb|EDP73081.1| hypothetical protein HG1285_07278 [Hydrogenivirga sp. 128-5-R1-1]
          Length = 250

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 64/252 (25%), Positives = 107/252 (42%), Gaps = 33/252 (13%)

Query: 41  RGVIAPHAGYSYSGRAAAYA---FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTP 97
             VI+PHAGY YSG+ A Y+   F N+D     +V L+GPSH       +      ++TP
Sbjct: 22  EAVISPHAGYIYSGQTATYSYKQFLNLDKNKHYKVLLIGPSHFVPFNGISFGYYDYWETP 81

Query: 98  IGDLPLDLEVIEE--LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
           +G + +D E IE+  +K        +    + EHS+E+ +P+L  V E     I+P++ G
Sbjct: 82  LGKVKVDKEEIEKFAMKNPDLPITFNTIPHQREHSLEVQVPFLQMVLED--FSIIPVVYG 139

Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDK 214
            ++     +  ++ A    D  +   V S+D  H+                +++   +DK
Sbjct: 140 QIDFR---IVEKIIADIKGDRDDVVVVISTDLSHYYP--------------YETARQIDK 182

Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
             +  +E  D    +K         CG   ++  +        K K K L Y  S     
Sbjct: 183 YCLLAVEEEDVSYLQK------CEACGEIGMAAIIDWASKA--KWKSKVLDYRTSGDTAG 234

Query: 275 KRDSSVSYASAA 286
            +D+ V Y SA 
Sbjct: 235 NKDAVVGYLSAV 246


>gi|255513873|gb|EET90138.1| protein of unknown function DUF52 [Candidatus Micrarchaeum
           acidiphilum ARMAN-2]
          Length = 278

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/289 (24%), Positives = 119/289 (41%), Gaps = 26/289 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAF 61
           +R    +G++Y      LA+ ++  +  A +       V+ ++APHAGY YSG  AAY++
Sbjct: 1   MRAMQFSGTFYASGKSELAKFIEEAVSHADVKAKHGSAVKAIVAPHAGYVYSGSIAAYSY 60

Query: 62  GNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
             +    +        +LGP+H       A+S    + TP G +  D ++ + +  + K+
Sbjct: 61  KALMEDREHGKPDTAIILGPNHTGLGEPIAVSMED-WATPFGTVKNDKKLSKAIIGSSKY 119

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
              D      EHS+E+ LP+L  +F         I  G  + +   +  +       +  
Sbjct: 120 ISEDESAHSEEHSIEVQLPFLKYLFPD--TAACFICQGDQSLDASILLSKAVIAAAKESG 177

Query: 178 NFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
              +V  SSDF          HY+       K     D   +  IE  D D F   +LE 
Sbjct: 178 KRIAVIASSDF---------NHYEPASAAKEK-----DSRLLAAIEKLDSDEFNSSVLET 223

Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            ++ CG  PI+V +      +   K   LRY  S +      S V+Y++
Sbjct: 224 GDSACGIGPITVAME-YSKANGVPKATLLRYGNSGEATMDYSSVVAYSA 271


>gi|78778042|ref|YP_394357.1| hypothetical protein Suden_1848 [Sulfurimonas denitrificans DSM
           1251]
 gi|78498582|gb|ABB45122.1| Protein of unknown function DUF52 [Sulfurimonas denitrificans DSM
           1251]
          Length = 262

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 68/143 (47%), Gaps = 5/143 (3%)

Query: 5   RRASHAGSWYTDNS---KRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           R  S  GS+Y   +   +R  E       E  +    + R VI PHAGY YSG +A  A+
Sbjct: 3   REMSVVGSFYPARAVELERYFEHFSTTYDEENILPDIKSRVVIVPHAGYIYSGYSANVAY 62

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +  + + +  ++GPSH       +L   + Y+TP G +P  L+++EEL  T  F L  
Sbjct: 63  RVLKKSGVKKFLVIGPSHRVGFEGISLGDFSSYETPFGAIPASLDLVEELSNT--FLLSC 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFEG 144
                 EHS E+  P++    EG
Sbjct: 121 YRDTHFEHSTEVQFPFIKYYIEG 143


>gi|288560176|ref|YP_003423662.1| phosphomevalonate decarboxylase [Methanobrevibacter ruminantium M1]
 gi|288542886|gb|ADC46770.1| phosphomevalonate decarboxylase [Methanobrevibacter ruminantium M1]
          Length = 282

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 69/296 (23%), Positives = 122/296 (41%), Gaps = 30/296 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEEL-DGWLREAG---------LPKSPEVRGVIAPHAGYSYS 53
           IR A  AG++Y +N K L E + D +    G         + K  EV  ++ PHAGY YS
Sbjct: 2   IREAVVAGTFYENNVKYLRESIEDCFTHRLGPGEIPKLSRVNKEKEVNAIMVPHAGYVYS 61

Query: 54  GRAAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           G  AA+A+  +        F ++ P+H  +    ++     +  P G   +D E+   + 
Sbjct: 62  GPTAAHAYSKLVQDGYPETFVIIAPNHTGFGEHVSIFNEGSWIVPNGVADVDDELANAII 121

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
               +   D      EHS+E+ +P L K F+ +  KIVPI +   + +          + 
Sbjct: 122 NQSNYAKADFLAHRNEHSIEVQIPLL-KYFDSNF-KIVPICMMDQSPQASIDLANSIFEA 179

Query: 173 VDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
            +D +   ++  S+D  H+ S+              K+IE  D +  + +  G+ +   +
Sbjct: 180 ANDLNRKITLIDSTDLSHFKSQ-------------EKTIEH-DNLVFNEVINGNTEGLYQ 225

Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            + +   T+CG  P  V +      + K   + L++  S        S V Y S  
Sbjct: 226 VVKKEQITMCGYGPTMVSMEYCKKLNQK-NFELLQHSTSGDITLDYASVVGYGSGV 280


>gi|88813479|ref|ZP_01128714.1| hypothetical protein NB231_00440 [Nitrococcus mobilis Nb-231]
 gi|88789269|gb|EAR20401.1| hypothetical protein NB231_00440 [Nitrococcus mobilis Nb-231]
          Length = 264

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 122/287 (42%), Gaps = 33/287 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           K+R  + A  +Y   ++ L   L   + R    P +P  + +I PHAGY +SG  AA A+
Sbjct: 4   KVREPAVAELFYPGEAESLRRLLGELMSRPVAEPHTP--KALIVPHAGYVFSGSTAAAAY 61

Query: 62  GNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
             I P    I RV LLGP+H       A  T   + TP+G++ LD   +  L        
Sbjct: 62  ARIAPAHARIERVVLLGPAHRVRFAGLAAHTGDAFLTPLGEVALDGSALGRLADLAYVSF 121

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
            D    E EHS+E+ +P+L  + +    ++VP+ VG  +AE  A    +  +        
Sbjct: 122 SDKAHAE-EHSLEVQVPFLQYLLDD--FRLVPLAVGDASAEAVAA---VLERLWGGAETL 175

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
             +S+D  H+        +D      +++   LD    + I    P+A       Y +  
Sbjct: 176 VVISTDLSHY--------HD------YETARRLDNRTAENILGKRPEAIG-----YQDA- 215

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           CGR P+   L +      +++I+ LR   S      R   V Y + A
Sbjct: 216 CGRTPLVGLLEL--ARRRELRIEQLRLCSSGDTAGNRQRVVGYGAFA 260


>gi|119897971|ref|YP_933184.1| hypothetical protein azo1680 [Azoarcus sp. BH72]
 gi|119670384|emb|CAL94297.1| conserved hypothetical protein [Azoarcus sp. BH72]
          Length = 274

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 94/200 (47%), Gaps = 25/200 (12%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAA 58
           +  +R A+ AG +Y  +   L  ++ G +  + +P +     + ++ PHAGY YSG  AA
Sbjct: 3   IASVRPAAVAGQFYPADDLVLRTQI-GEMLSSAVPLNVVYAPKAIVVPHAGYIYSGPVAA 61

Query: 59  YAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
            AF  + P    ISRV LLGP+H       AL  A  + TP+G++         L     
Sbjct: 62  SAFAAVAPLRREISRVVLLGPAHRMAVDGFALPAAQSFATPLGEV--------RLSRPDW 113

Query: 117 FELMD---ICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
             L D   + VD+     EH +E+ LP+L  V E    ++VP+LVG  ++E+ A    L 
Sbjct: 114 LTLQDHPGVVVDDRPHAFEHCLEVQLPFLQTVLES--FELVPLLVGNASSEDTAA---LL 168

Query: 170 AKYVDDPSNFFSVSSDFCHW 189
                       +SSD  H+
Sbjct: 169 ETLWGGHETLVVISSDLSHY 188


>gi|218886422|ref|YP_002435743.1| hypothetical protein DvMF_1326 [Desulfovibrio vulgaris str.
           'Miyazaki F']
 gi|218757376|gb|ACL08275.1| protein of unknown function DUF52 [Desulfovibrio vulgaris str.
           'Miyazaki F']
          Length = 282

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 63/238 (26%), Positives = 99/238 (41%), Gaps = 26/238 (10%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID-PTS 68
           AG +Y D+   L   +   L  A    +P +  ++ PHAGY YSG  A    G  + P +
Sbjct: 19  AGMFYPDDPDLLRHTVREHLAAAPREAAPTLLAMV-PHAGYVYSGEVAGQTLGAANLPDT 77

Query: 69  ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAE 128
           I    LLGP+H       ++     + TP+G +P+D E+ + L  +      D      E
Sbjct: 78  I---VLLGPNHTGLGAPLSVWPGGHWHTPLGPVPVDEELADTLAESDAGFTRDTAAHMRE 134

Query: 129 HSMEMHLPYLAKVFEGHLIKIVPILVG-----AVNAENEAMYGRLFAKYVDDPSNFFSVS 183
           HS+E+ LP+L +      + IVP+ V      A+    +A+ G L             VS
Sbjct: 135 HSIEVVLPFLQEYTPD--LTIVPVAVAEPHPQALREAAQALAGALRRLEAAGRRVAMVVS 192

Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           SD  H+              V H     LD++ +  IE  DP      + +   ++CG
Sbjct: 193 SDMSHY--------------VTHADAARLDRLALARIEALDPLGLYAIVRDRGISMCG 236


>gi|347732159|ref|ZP_08865241.1| memo-like family protein [Desulfovibrio sp. A2]
 gi|347518985|gb|EGY26148.1| memo-like family protein [Desulfovibrio sp. A2]
          Length = 282

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 117/283 (41%), Gaps = 33/283 (11%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID-PTS 68
           AG +Y D+   L  E+   L  A    +P +  ++ PHAGY YSG  A    G  + P +
Sbjct: 19  AGMFYPDDPDLLRHEVRQHLAAAARDAAPTLLAMV-PHAGYVYSGEVAGQTLGAANLPDT 77

Query: 69  ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAE 128
           I    LLGP+H       ++     + TP+G +P+D E+ + L  +      D      E
Sbjct: 78  I---VLLGPNHTGLGAPLSVWPGGNWHTPLGPVPVDEELADALAESDAGFTRDTAAHVRE 134

Query: 129 HSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA---KYVDDPSNFFS--VS 183
           HS+E+ LP+L +      + IVP+ V   + +      +  A   + V D     +  VS
Sbjct: 135 HSIEVILPFLQEYTPD--LSIVPVAVAEPHPQTLREAAQTLADVLRRVTDAGRRVAMVVS 192

Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
           SD  H+              V H      D++ +  IE  DP      + +   ++CG  
Sbjct: 193 SDMSHY--------------VTHAEAARRDRLALARIEALDPLGLYATVRDRGISMCGVL 238

Query: 244 PISVFLH---MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           P ++ L     LG  + ++    + Y  S +     +  V YA
Sbjct: 239 PATLGLFAALALGAQTARV----VAYATSGEVSGDMERVVGYA 277


>gi|150021612|ref|YP_001306966.1| hypothetical protein Tmel_1742 [Thermosipho melanesiensis BI429]
 gi|149794133|gb|ABR31581.1| protein of unknown function DUF52 [Thermosipho melanesiensis BI429]
          Length = 262

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 130/292 (44%), Gaps = 50/292 (17%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAA------A 58
           R  + A  +Y  + ++L +  + +  +  L  S    G+I PHAGY YSG+ A      +
Sbjct: 3   RTMTFADKFYPKSPEKLKKFFEVYAEKVKLEFSNNNFGMILPHAGYVYSGKTALLTLKES 62

Query: 59  YAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           + FG  +     RV +LG +H       C++     ++TP G++ +D ++ E+L    KF
Sbjct: 63  FRFGKPE-----RVIILGTNHTGTGIGICSVWERGEWETPFGNIAVDEKITEKLL---KF 114

Query: 118 ELM--DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRL--FAKYV 173
           ++   D      EHS+E+ LP+L K + G  IK+VPI+    N  +     R+    K +
Sbjct: 115 DIFVSDYAAHTMEHSIEVILPFL-KYYYGD-IKLVPIV---YNFPSHLYTMRIVEILKKL 169

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDK--KHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
           +  S     SSD  H+      M  D+     ++ +++E L K+   I            
Sbjct: 170 ELKSTLIIASSDLNHYDPHDITMKKDEIFIENILKQNVEELFKLSKKI------------ 217

Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
                 T CG  PI+V L+   N      +K + +  S++     + +V YA
Sbjct: 218 ------TACGTGPIAVLLNYFSN------VKLVYHTTSAEFSNDYNFTVGYA 257


>gi|220905130|ref|YP_002480442.1| hypothetical protein Ddes_1868 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
 gi|219869429|gb|ACL49764.1| protein of unknown function DUF52 [Desulfovibrio desulfuricans
           subsp. desulfuricans str. ATCC 27774]
          Length = 298

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 75/312 (24%), Positives = 132/312 (42%), Gaps = 51/312 (16%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-----------PEVRGVIAPHAGYS 51
            IR    AG +Y  ++++L +E+  WL   G+P++             + G++ PHAGY 
Sbjct: 2   SIRHPIAAGRFYPADAEQLKKEIRMWLMSGGVPENLVPGEIPREANARLLGLMLPHAGYV 61

Query: 52  YSGRA------AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDL 105
           Y GR       +A+           R+F+L P+H        + T   + TP+G + +D 
Sbjct: 62  YCGRVIGGTLTSAWDGSTAGANLPERLFILCPNHTGQGRALGVWTGGHWLTPLGPVVVDE 121

Query: 106 EVIEEL-KATGKFELMDICVDEAEHSMEMHLPYLAKV----FEGHLIKIVPILVGAVNAE 160
           ++ + L +A G F   ++C   +EHS+E+ LP+L  +      G    IVP+ VG  + +
Sbjct: 122 KLGQALVQAAGGFFEDELC-HVSEHSIEVLLPFLQSLPLPRESGR--SIVPVCVGTRSPD 178

Query: 161 NEAMYGRLFAKYV-----DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
                G   A+ +     +  S    VSSD           HY  +   +HK     D +
Sbjct: 179 ALRAAGLALARVIQAFEAEGHSVGIVVSSDL---------NHYQSQELTLHK-----DAL 224

Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQC 272
            +  +   DPD     +     T+CG   +++ L   H +G+   ++ +    Y+ S+  
Sbjct: 225 ALAQVLACDPDGLLTVVERESITMCGAGAMALALFTAHAMGSPWAELTL----YDTSAAA 280

Query: 273 KTKRDSSVSYAS 284
                  V YA 
Sbjct: 281 SGDTSRVVGYAG 292


>gi|237757257|ref|ZP_04585659.1| conserved hypothetical protein [Sulfurihydrogenibium yellowstonense
           SS-5]
 gi|237690581|gb|EEP59787.1| conserved hypothetical protein [Sulfurihydrogenibium yellowstonense
           SS-5]
          Length = 121

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 63/112 (56%), Gaps = 6/112 (5%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAY 59
           M  +R+ + A  +YT + +RL   +  +L +A L    PE  G++ PHAGY YSG  AA 
Sbjct: 1   MLTVRKPAVANMFYTGDKERLLYTIKNYLNKAPLYDYVPE--GIVVPHAGYMYSGPVAAV 58

Query: 60  AFG---NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
           ++    N+DP    ++ L+GPSHH Y    +      ++TP+G + ++ E+I
Sbjct: 59  SYKQLLNLDPNKHYKILLIGPSHHVYFNGVSYGFYDYWETPLGKVKVNKEMI 110


>gi|325971983|ref|YP_004248174.1| hypothetical protein [Sphaerochaeta globus str. Buddy]
 gi|324027221|gb|ADY13980.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Sphaerochaeta globus str. Buddy]
          Length = 288

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 70/254 (27%), Positives = 111/254 (43%), Gaps = 23/254 (9%)

Query: 4   IRRASHAGSWYTDNSKRLA----EELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           IR+A  AGSWY  + + L     E ++  L+       P  R  + PHAG  YS +A   
Sbjct: 8   IRKAHFAGSWYPADKQTLRALINESIEVVLKNTTYQHCP-YRFAVLPHAGLFYS-KAGIA 65

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATV--YKTPIGDLPLDLEVIEELKATGKF 117
            F   D +++ R+ +L PSH+    + AL +A +   +TP+G L    E      A  K+
Sbjct: 66  PFFAADLSNVQRLVVLAPSHYANLSQDALVSAPLDGIETPLGML----ESQNLSSAQAKY 121

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI----LVGAVNAENEAMYGRLFAKYV 173
                   ++EH++EM LPY+A +     + +  +       AV    +A+   L  + +
Sbjct: 122 ----FSAIQSEHALEMVLPYIASLSRPPTVTLALVSHFSQPQAVKTIADALVAELGEQEI 177

Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
                    SSDF H+G RF Y  Y  + G   K  E    +   + E    +AF  +  
Sbjct: 178 QAGRTALIASSDFTHYGPRFGYTPY--QSGAPLKVREDDLALSHLLCEGRIEEAF-AFCS 234

Query: 234 EYDNTICGRHPISV 247
              +T+CG  P  V
Sbjct: 235 SKRSTVCGYAPAMV 248


>gi|11499891|ref|NP_071135.1| hypothetical protein AF2310 [Archaeoglobus fulgidus DSM 4304]
 gi|7388528|sp|O27974.1|Y2310_ARCFU RecName: Full=MEMO1 family protein AF_2310
 gi|2648211|gb|AAB88947.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 261

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 70/250 (28%), Positives = 112/250 (44%), Gaps = 28/250 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           +R    AGS+Y  N     E L   LRE   P K   V   ++PHAGY YSGR A     
Sbjct: 1   MRHPRVAGSFYPANP----ESLLAMLREYTYPAKDESVIACVSPHAGYVYSGRTAG-KVH 55

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           ++ P + + V ++GP+H  Y    A+ST T + TP+G++ +D E +E +         D 
Sbjct: 56  SLLPDAETFV-IVGPNHTGYGLPVAVSTDT-WLTPLGEVEVDTEFVEAMPKI--ITAPDE 111

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDPSNFF 180
                EHS+E+ +P+L  + +    KIVPI +G  + E   E     L A+         
Sbjct: 112 IAHRYEHSLEVQVPFLQYLHDD--FKIVPICLGMQDEETAMEVAEEILTAERETGRKVVV 169

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSD  H+              +  +    LD + +D I + D   + + +     ++C
Sbjct: 170 IASSDMHHY--------------LPDEECRRLDSIVIDAILSMDVKKYYETIYRLQASVC 215

Query: 241 GRHPISVFLH 250
           G   I+V ++
Sbjct: 216 GYGCIAVAMY 225


>gi|389843600|ref|YP_006345680.1| dioxygenase [Mesotoga prima MesG1.Ag.4.2]
 gi|387858346|gb|AFK06437.1| putative dioxygenase [Mesotoga prima MesG1.Ag.4.2]
          Length = 275

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 93/215 (43%), Gaps = 25/215 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELD--------------GWLREAGLPKSPEVRGVIAPHAG 49
           IRR   +G +Y DN + L  ++               GW +  G        G+I PHAG
Sbjct: 2   IRRPVVSGKFYADNEEELKSQIRNCFFHPVGPGSLPHGWEKNEG------PIGLIVPHAG 55

Query: 50  YSYSGRAAAYAFGNIDP-TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
           Y  SG  AA+++         S V ++GP+H     K +L     ++TP+G + +D E  
Sbjct: 56  YMASGPVAAWSYWRAGALVKPSTVLIVGPNHTGLGTKLSLWLEGAWETPLGSVEIDHEFG 115

Query: 109 EELKATGKFELM-DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
             L A     +M D      EHS+E+ LP+L  ++     KIVP+++     E       
Sbjct: 116 RSLMAESNGMIMPDTSAHLYEHSIEVQLPFLQFLYSD--FKIVPLIMTDQRVEVALSLSE 173

Query: 168 LFAKYVDDPSNFFSV-SSDFCHWGSRFNYMHYDKK 201
           +  K +   ++   + SSD  H+ +    M  D +
Sbjct: 174 IIKKELSRRNDVLVIASSDLNHYEAHEVTMEKDNE 208


>gi|225849677|ref|YP_002729911.1| hypothetical protein PERMA_0113 [Persephonella marina EX-H1]
 gi|225645192|gb|ACO03378.1| conserved hypothetical protein [Persephonella marina EX-H1]
          Length = 269

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 97/195 (49%), Gaps = 18/195 (9%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVRGVIAPHAGYSYSGRAAAYA-- 60
           +R  + + ++Y  N + L + +  +L  A L    PE   V++PHAGY YSG  AA    
Sbjct: 5   VREPAVSNAFYPGNPEELRDLIRHYLSMAPLYDIKPE--AVVSPHAGYIYSGEVAAVGYK 62

Query: 61  -FGNIDPTSISRVFLLGPSHHYYTPKCALSTA--TVYKTPIGDLPLDLEVIEEL--KATG 115
            F N+D      + L+GPSH  Y P   +S      ++TP+G++ ++ +VIE+   +   
Sbjct: 63  QFLNLDRDKHYNILLIGPSH--YVPFAGISFGYYDFWETPLGEVRVNKKVIEDFVYRNPD 120

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
               ++      EHS+E+ +P+L  V +     I+P++ G V+ +   +  R+     DD
Sbjct: 121 IPVTLNTLPHSKEHSLEVQVPFLQVVLDN--FSIIPVIYGHVDYK---VLERVIDFIKDD 175

Query: 176 PSNFFSV-SSDFCHW 189
             +   V SSD  H+
Sbjct: 176 RDDVIVVISSDLSHY 190


>gi|251772877|gb|EES53436.1| conserved hypothetical protein [Leptospirillum ferrodiazotrophum]
          Length = 269

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 58/199 (29%), Positives = 92/199 (46%), Gaps = 15/199 (7%)

Query: 1   MEKIRRASHAGSWYTDNS----KRLAEELDGWLREAGLPK-SPE-VRGVIAPHAGYSYSG 54
           M   R  + AG++Y ++     +R+A  LD       LP+  P+ +R ++ PH     +G
Sbjct: 1   MTPERPPALAGNFYPEDPLELERRIAALLDSP-PPGNLPRLDPDRIRAIVVPHGDLLRAG 59

Query: 55  --RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
             +A AY          +R+ LLGP H   +    L T   ++ P G  P+D   I +L 
Sbjct: 60  EVQAIAYRLLQKRKKPPARILLLGPLHTGSSYGIVLPTHPSFRIPTGAFPVDRTTIRQLA 119

Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
           A  +    D    E EHS+E  LP+L  ++ G  I IVP  VG  +     ++ R+    
Sbjct: 120 AFAETLFSDES-HEFEHSLETQLPFLMGIWGG--IPIVP--VGYSDITAHVLF-RVLEPL 173

Query: 173 VDDPSNFFSVSSDFCHWGS 191
           ++DP      S+DF H+ S
Sbjct: 174 MNDPETVTICSADFSHYFS 192


>gi|386812428|ref|ZP_10099653.1| conserved hypothetical protein [planctomycete KSU-1]
 gi|386404698|dbj|GAB62534.1| conserved hypothetical protein [planctomycete KSU-1]
          Length = 267

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 136/283 (48%), Gaps = 25/283 (8%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR  + AGS+Y+ ++ +L  +++ +  + G  K P V G+I+PHAGY YSGR A   +  
Sbjct: 2   IREPAVAGSFYSGDAIQLKRDIEDFSIK-GCEKQP-VLGIISPHAGYIYSGRVAGNLYSR 59

Query: 64  ID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           I+ P +   V +L P+H  Y    ++     + TP+G++ +D EV+EEL         D 
Sbjct: 60  IEIPNT---VVILAPNHTGYGVPYSVWPDGSWNTPLGNVKVDEEVVEELVQASDLIKKDK 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFF 180
               +EH+ E+ +P++   +    +KIV I++ + +  N    G+  ++ +    P    
Sbjct: 117 EAHFSEHAAEVQIPFIQ--YFNPSVKIVVIVISSGDIINLKDIGKNLSRVLQKLRPDALV 174

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSD  H+ S+ +    DK    IH+ I +L++  +        + + K++     T+C
Sbjct: 175 VASSDMTHYESQTSANRKDKI--AIHE-ILSLNESNLYY------EVYSKHI-----TMC 220

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
           G +P    L    +   K   + +RYE S       +  V YA
Sbjct: 221 GIYPTIAMLVCSKDRGAK-NAELVRYETSGDITGDYNQVVGYA 262


>gi|386346075|ref|YP_006044324.1| hypothetical protein [Spirochaeta thermophila DSM 6578]
 gi|339411042|gb|AEJ60607.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Spirochaeta thermophila DSM 6578]
          Length = 293

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 84/196 (42%), Gaps = 7/196 (3%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR-GVIAPHAGYSYSGRAAAYA 60
           E +R     G +Y D +  L   LD  L        P     +I PHA + Y G     A
Sbjct: 13  ELVREPIVEGIFYPDAADTLEARLDALLSRPSPESPPPRPFALIVPHATWDYVGDLLGAA 72

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           F ++ P      RV L GP+H    P   L  +T++ TP+G L LDL   +E  A+    
Sbjct: 73  FAHVLPWREHFRRVVLWGPTHREPFPTLFLPGSTLFATPLGPLALDLPATQEALASSTAF 132

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA-VNAENEAMYGRLFAKYVDD-P 176
            +D      EH +E+  P+L  +       I+P+LVGA +    E +   L+  Y  +  
Sbjct: 133 AVDDLSHMEEHCLEVVFPFLRSILPH--APILPVLVGAPLLTLAETLARALYLIYATEWE 190

Query: 177 SNFFSVSSDFCHWGSR 192
              F VS++   +  R
Sbjct: 191 HTLFVVSTNLTPFAPR 206


>gi|307718050|ref|YP_003873582.1| dioxygenase [Spirochaeta thermophila DSM 6192]
 gi|306531775|gb|ADN01309.1| putative dioxygenase [Spirochaeta thermophila DSM 6192]
          Length = 293

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 84/196 (42%), Gaps = 7/196 (3%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR-GVIAPHAGYSYSGRAAAYA 60
           E +R     G +Y D +  L   LD  L        P     +I PHA + Y G     A
Sbjct: 13  ELVREPIVEGIFYPDAADALEARLDALLSRPSPESPPPRPFALIVPHATWDYVGGLLGTA 72

Query: 61  FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           F ++ P      RV L GP+H    P   L  +T++ TP+G L LDL   +E  A+    
Sbjct: 73  FAHVVPWREHFRRVVLWGPTHREPFPSLFLPASTLFATPLGPLSLDLPATQEALASSTAF 132

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA-VNAENEAMYGRLFAKYVDD-P 176
            +D      EH +E+  P+L  +       I+P+LVGA +    E +   L+  Y  +  
Sbjct: 133 AVDDLSHMEEHCLEVVFPFLRSILPH--APILPVLVGAPLFTLVETLARALYLIYAAEWE 190

Query: 177 SNFFSVSSDFCHWGSR 192
              F VS++   +  R
Sbjct: 191 HTLFVVSTNLTPFAPR 206


>gi|150399448|ref|YP_001323215.1| hypothetical protein Mevan_0697 [Methanococcus vannielii SB]
 gi|166228835|sp|A6UQ31.1|Y697_METVS RecName: Full=MEMO1 family protein Mevan_0697
 gi|150012151|gb|ABR54603.1| protein of unknown function DUF52 [Methanococcus vannielii SB]
          Length = 284

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/249 (26%), Positives = 106/249 (42%), Gaps = 22/249 (8%)

Query: 42  GVIAPHAGYSYSGRAAAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIG 99
           GVI+PHAGY YSG  AA+++  I       +   ++GP+H       + +   ++KTP+G
Sbjct: 48  GVISPHAGYVYSGPIAAHSYKEISKKVSGNITAVIIGPNHSGMGSVVS-TMEGIWKTPLG 106

Query: 100 DLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAV 157
           +L +D E  E L        +D      EHS+E+ LP+L   ++      K VPI +   
Sbjct: 107 NLEIDNEFSERLWKECDIIDLDETAHLKEHSIEVQLPFLKHLELLNIAKFKFVPISMSLQ 166

Query: 158 NAENEAMYGRLFAKYVDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
           + +     G + AK   + +       SSDF H+          KK  +I K        
Sbjct: 167 DYDTAVGVGYMVAKVAKELNRKIIIIASSDFSHYEPE---EIASKKDAIIIK-------- 215

Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTK 275
             DI++  + + F   ++  + T+CG  PI   +  +     K + K L Y  S      
Sbjct: 216 --DILKMEEEEIFTD-VVTNNVTMCGYGPIIAMIKAMKVLGAK-ESKLLSYSTSGDVTKD 271

Query: 276 RDSSVSYAS 284
               V Y +
Sbjct: 272 YSEVVGYGA 280


>gi|307106373|gb|EFN54619.1| hypothetical protein CHLNCDRAFT_135146 [Chlorella variabilis]
          Length = 337

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 85/202 (42%), Gaps = 33/202 (16%)

Query: 72  VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSM 131
           V +LG +H    P   LS+A  ++TP+G  PL+ E+  +L A G    +D      EHS+
Sbjct: 112 VLILGTNHFTALPPACLSSAAAWRTPLGAAPLEPELCRQLAAAGL--PLDDAPHNLEHSI 169

Query: 132 EMHLPYLAKVF-------EGHLIKIVPILVGAVNAENEAMYGRL-------FAKYVDDPS 177
           E  LP+L  V              I P+ VG +   +EA  GRL        A+Y     
Sbjct: 170 ENQLPFLQHVLGSPGPSPGARPFAIAPVCVGWLG--SEAAAGRLARQLAAAVARYEQRRQ 227

Query: 178 N--FFSVSSDFCHWGSRFN------YMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
                  +SDF H G  +       +M   ++H V        D   +  I  G P  F 
Sbjct: 228 RQVLLIATSDFTHGGPSYGEAPAAPWMSL-REHCVWR------DSPLLHAICGGSPRGFL 280

Query: 230 KYLLEYDNTICGRHPISVFLHM 251
           +       ++CGR P++VF+ +
Sbjct: 281 QACDLVGASLCGRWPVAVFMQV 302


>gi|197121665|ref|YP_002133616.1| hypothetical protein AnaeK_1254 [Anaeromyxobacter sp. K]
 gi|196171514|gb|ACG72487.1| protein of unknown function DUF52 [Anaeromyxobacter sp. K]
          Length = 265

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/138 (32%), Positives = 67/138 (48%), Gaps = 6/138 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R A+ AG++Y   +  LA E+DGWL             V+ PHAGY YSG  A   F  
Sbjct: 2   VREAAVAGAFYDARAATLAAEVDGWLSAGA--APAPALAVMVPHAGYVYSGAVAGATFAR 59

Query: 64  IDPTSISRVFLLGPSHHYYT-PKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           +     +R  +LGP+H        AL  A  ++TP+G +P+D E +    A       D 
Sbjct: 60  V--AVPARAIVLGPNHTGLGHAGAALWPAGAWRTPLGAVPVDPE-LTAALAAAPGVAADR 116

Query: 123 CVDEAEHSMEMHLPYLAK 140
                EHS+E+ +P+L +
Sbjct: 117 LAHLREHSLEVEVPFLQR 134


>gi|254456740|ref|ZP_05070168.1| conserved hypothetical protein [Sulfurimonas gotlandica GD1]
 gi|373867875|ref|ZP_09604273.1| protein containing DUF52 [Sulfurimonas gotlandica GD1]
 gi|207085532|gb|EDZ62816.1| conserved hypothetical protein [Sulfurimonas gotlandica GD1]
 gi|372469976|gb|EHP30180.1| protein containing DUF52 [Sulfurimonas gotlandica GD1]
          Length = 261

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 62/222 (27%), Positives = 96/222 (43%), Gaps = 16/222 (7%)

Query: 5   RRASHAGSWYTDNS---KRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           R  S AGS+Y       +R  E  +    E       + R VI PHAGY YSG  A  A+
Sbjct: 3   RMMSVAGSFYPAREVEIERYFEHFNTVYEENFTLPDVKSRAVIVPHAGYIYSGYTANIAY 62

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
             +  + + +  ++GPSH       ++   T Y+TP  DL   ++V + L+   KF L  
Sbjct: 63  RVLQRSDVRKFVVIGPSHRVAFNGVSMCDFTTYETPFEDLKAAIDVAQRLRE--KFSLTR 120

Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
                 EHS E+  P++   F     KIV ++     A NE      F    +D      
Sbjct: 121 FENAHQEHSTEVQFPFIK--FYMPDAKIVELVYSGAGA-NEISNIIDFVLSQNDCGVI-- 175

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
           +S+D  H+   +N    +K   +    +EA+DK+    + TG
Sbjct: 176 ISTDLSHF---YNLEDANKLDNI---CLEAVDKLDTQKLHTG 211


>gi|220916461|ref|YP_002491765.1| hypothetical protein A2cp1_1355 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219954315|gb|ACL64699.1| protein of unknown function DUF52 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 265

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/138 (32%), Positives = 67/138 (48%), Gaps = 6/138 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  + AG++Y   +  LA E+DGWL             V+ PHAGY YSG  A   F  
Sbjct: 2   VREPAVAGAFYDARAATLAAEVDGWLSAGA--APAPALAVMVPHAGYVYSGAVAGATFAR 59

Query: 64  IDPTSISRVFLLGPSHHYYT-PKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           +     +RV +LGP+H        AL  A  ++TP+G +P+D E +    A       D 
Sbjct: 60  V--ALPARVIVLGPNHTGLGHAGAALWPAGAWRTPLGSVPVDPE-LTAALAAAPGVAADR 116

Query: 123 CVDEAEHSMEMHLPYLAK 140
                EHS+E+ +P+L +
Sbjct: 117 LAHLREHSLEVEVPFLQR 134


>gi|303249056|ref|ZP_07335299.1| protein of unknown function DUF52 [Desulfovibrio fructosovorans JJ]
 gi|302489521|gb|EFL49463.1| protein of unknown function DUF52 [Desulfovibrio fructosovorans JJ]
          Length = 401

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 75/279 (26%), Positives = 108/279 (38%), Gaps = 31/279 (11%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           +  R    AG  Y DN + L   LD  L  A    P S  +  VIAPH          A 
Sbjct: 108 QDTREPLFAGPAYPDNPELLVPYLDAILAGAPEASPGSGRILAVIAPHIDPEAGKAGYAA 167

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
           A+  +   S  RV +LG  H        L T   Y TP+GD+P D   +  L+  G    
Sbjct: 168 AYAPLRGLSPKRVVVLGVGHQILRGLYCL-TDKAYGTPLGDIPADAAAVARLRRAGG--- 223

Query: 120 MDICVDEA------EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
              CVD        EHS+E    +L  V  G    +VP+L G+  +   A+    F  Y 
Sbjct: 224 --ACVDPGDLQHRDEHSVEFQAVFLRHVLAGDF-SLVPVLCGSPRSVLPALSRSAFLDYA 280

Query: 174 D-----------DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIET 222
                       DP   F    DF H G +F    +DK    + ++  A D   +  +  
Sbjct: 281 GPFLDALRGMAADPDTLFVAGVDFSHIGPKFG---HDKPALELEQAAMAHDAALLAALAA 337

Query: 223 GDPDAFKKYLLEYDN--TICGRHPISVFLHMLGNCSTKI 259
           GDP+AF       D+   +CG   ++    ++  C+  +
Sbjct: 338 GDPEAFWAESARVDDGYNVCGLASLATLAEIVPACALTV 376


>gi|145524770|ref|XP_001448212.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415746|emb|CAK80815.1| unnamed protein product [Paramecium tetraurelia]
          Length = 79

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 32/73 (43%), Positives = 47/73 (64%), Gaps = 4/73 (5%)

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM----LGNCSTKIKIKFLRYEQSSQC 272
           M+ IE  D D F  YL EY+N +CG+H I++ LH+    +   +  ++ KF+RY QS   
Sbjct: 1   MEHIELHDLDKFNDYLREYENNVCGKHCIAILLHVICIAMSQNTHMMETKFIRYAQSCLV 60

Query: 273 KTKRDSSVSYASA 285
           + K+DSSVSYA+A
Sbjct: 61  RDKKDSSVSYAAA 73


>gi|440302594|gb|ELP94901.1| hypothetical protein EIN_249450 [Entamoeba invadens IP1]
          Length = 284

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 63/246 (25%), Positives = 101/246 (41%), Gaps = 26/246 (10%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA--GLPK-SPEVRGVIAPHAGYSYSGRAAAYAF 61
           R  +  G W+  ++  L  E++G++++A   LPK   +V G +APHAG+ YSG  A ++ 
Sbjct: 7   RHTAGDGRWFQSSTNGLKNEVNGYIQDAYKALPKLEGKVLGSVAPHAGFRYSGPTAGFSV 66

Query: 62  GNIDPTSISR-----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
             +   S +      VF+LG SH  +    A+      K+PI    +D + I        
Sbjct: 67  AALLKDSETNGKPDVVFILGFSHSAHFEYAAIMDGAAIKSPICTSKIDTDAINIFMKDRP 126

Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
              +       EHS E  +P++   F     K+V +L+G   A+          +     
Sbjct: 127 NMKLKYNPHNGEHSAENEIPFVQVAFPE--TKVVMVLIGTHKAQVFKEVSDGLLEVSKTR 184

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
             +   SSD  H    FN              +E  D++  D+ E  D D   K    Y+
Sbjct: 185 KMYVVASSDMLH-DEDFNL-------------VEKTDRVTADLTENVDFDGLLKN-WSYE 229

Query: 237 NTI-CG 241
           N I CG
Sbjct: 230 NQIFCG 235


>gi|119620876|gb|EAX00471.1| chromosome 2 open reading frame 4, isoform CRA_c [Homo sapiens]
          Length = 82

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 32/74 (43%), Positives = 43/74 (58%), Gaps = 12/74 (16%)

Query: 5  RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAP-----------HAGYSYS 53
          R ASHAGSWYT +  +L  +L+GWL +    K P  R +IAP           HAGY+Y 
Sbjct: 8  REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPRDGVLPCWPGCHAGYTYC 66

Query: 54 GRAAAYAFGNIDPT 67
          G  AA+A+  +DP+
Sbjct: 67 GSCAAHAYKQVDPS 80


>gi|406880605|gb|EKD28916.1| Ammecr1 protein [uncultured bacterium]
          Length = 300

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 6/105 (5%)

Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           VSSDF H+GSRF +  ++     I ++I+ +D      IET D   F  Y  E   TICG
Sbjct: 2   VSSDFTHFGSRFGFKPFNSN---IKENIKNIDFKAFKYIETLDQKGFMDYCHESSITICG 58

Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           ++PIS+ L ML   +   K+  L+Y  S +     ++ VSY S  
Sbjct: 59  KNPISILLAML---NKDFKVTLLKYSTSGELTGDYNNCVSYISCV 100


>gi|118576793|ref|YP_876536.1| dioxygenase [Cenarchaeum symbiosum A]
 gi|118195314|gb|ABK78232.1| dioxygenase [Cenarchaeum symbiosum A]
          Length = 273

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 74/256 (28%), Positives = 103/256 (40%), Gaps = 28/256 (10%)

Query: 35  PKSPEVRGVIA---PHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTA 91
           P  PE+ GVI    PHAGY YSG  A +   +I  TS     + GP+H       A   A
Sbjct: 36  PDPPELDGVIGMICPHAGYRYSGPVACHGLLSIRHTSPRLFVMAGPNHWGLGLGIAGIGA 95

Query: 92  TVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVP 151
             + TP G +  D     EL+  G  E  D      EHS+E+ +P L + F      I+P
Sbjct: 96  CRWITPAGYVETDDAGSVELERCGIKE--DFFAHSKEHSLEVIVPMLQEFF--GEFGILP 151

Query: 152 ILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA 211
           IL+     E  A  G   A+      +    SSD  H+ S  N   +++   +    IEA
Sbjct: 152 ILLSEQGEEQAAKVGGAMARAAKGRDSMLIGSSDLTHYES--NAFAHEQDGAL----IEA 205

Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQ 268
           +  + MD ++      F   L E   + CG   I+  +     LG      K   L+Y  
Sbjct: 206 I--LSMDTVK------FYAVLRERKVSACGYGAIAATMAASIELG----ATKGTLLKYAT 253

Query: 269 SSQCKTKRDSSVSYAS 284
           S        S V Y S
Sbjct: 254 SGDITGDEQSVVGYCS 269


>gi|291280086|ref|YP_003496921.1| hypothetical protein DEFDS_1709 [Deferribacter desulfuricans SSM1]
 gi|290754788|dbj|BAI81165.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1]
          Length = 262

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 104/247 (42%), Gaps = 29/247 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R+A+  G +Y D      EEL  +  +       +    I PHAGY YSG+ A     +
Sbjct: 2   LRKAAVKGYFYPDKK----EELIDFFEKNKTESKIDAFLAIVPHAGYIYSGKTAVKTLSS 57

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
           ++     +V L+GP+H  +  + AL     ++TP G + +D E+  +L    + E  DI 
Sbjct: 58  LNLKK--KVLLIGPNHTGFGERVALYPDGEWETPFGFVRVDSELNRKLSLIPEIE-EDII 114

Query: 124 VDEAEHSMEMHLP---YLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
               EHS+E+ LP   Y  K      I ++P+           +Y     K + D     
Sbjct: 115 AHVREHSLEVILPILHYFKKDITFSAITMMPLKYNQCVKLASDIY-----KQIKDEDLNI 169

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
            +SSDF H+                 ++ +  D M ++ I + D       + + D ++C
Sbjct: 170 IISSDFNHYEDA--------------ETTDKKDLMAIEKILSLDTKGLYDTVFDNDISMC 215

Query: 241 GRHPISV 247
           G +P  V
Sbjct: 216 GIYPAIV 222


>gi|408381879|ref|ZP_11179426.1| hypothetical protein A994_05481 [Methanobacterium formicicum DSM
           3637]
 gi|407815327|gb|EKF85912.1| hypothetical protein A994_05481 [Methanobacterium formicicum DSM
           3637]
          Length = 280

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 72/298 (24%), Positives = 125/298 (41%), Gaps = 32/298 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGW-----LREAGLP----KSPEVRGVIAPHAGYSYSG 54
           IR+ + AG +Y  +   L + +  W     L    LP        ++G+IAPHAGY YSG
Sbjct: 2   IRKPAVAGMFYESDEDSLKKRIK-WCYQHKLGPGRLPGKIGSKRSIKGLIAPHAGYVYSG 60

Query: 55  RAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
             AA ++  +    +   V +L P+H       +  T   + TP+G++ +D +   EL  
Sbjct: 61  PVAACSYLELAEDGMPETVVILCPNHTGIGSGLSTMTEGSWLTPLGEVEIDGQFASELLN 120

Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
                  D      EHS E+ LP+L ++      ++VP+ +   + E     G       
Sbjct: 121 YYPLLDDDPSAHLNEHSCEVQLPFLQEISPD--FQLVPVCMMMQDLETSRELGEAITHTA 178

Query: 174 D--DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
                      S+DF H+        +D++   + ++I A+D++ M              
Sbjct: 179 QKLKRDTVVIASTDFTHYQPHDVARAHDEE---VLEAIAAMDELEM-----------MNQ 224

Query: 232 LLEYDNTICGRHPISVFLHM-LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
           + +++ T+CG  P++  +    G  +T+  I  L+Y  S        S V YASA  K
Sbjct: 225 IQKFNVTMCGYGPVAATIEASRGMGATEATI--LKYATSGDTGGDYTSVVGYASAIFK 280


>gi|383762695|ref|YP_005441677.1| hypothetical protein CLDAP_17400 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381382963|dbj|BAL99779.1| hypothetical protein CLDAP_17400 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 411

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 108/272 (39%), Gaps = 26/272 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP----EVRGVIAPHAGYSYSGRAAAYA 60
           R  + AG+ Y D+ K L   L  +L  + +   P       G+++PH  Y   GR  A+ 
Sbjct: 113 RPPALAGAGYPDDEKALWSLLQDYLEASDVAADPIDWSAAIGLLSPHIDYPRGGRVYAHV 172

Query: 61  FGNIDPTSISR--VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           +      +     V LLG  H+   P     T   Y TP G LP +  +++ L A    E
Sbjct: 173 WKRAAQAAREAELVILLGVDHYGADPFTL--TRQNYATPYGVLPTETSIVDRLAAVIGEE 230

Query: 119 --LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV-----NAENEAMYGRLFA- 170
                      EHS+E+   +L  + EG  + +VPIL G+      N +  A    + A 
Sbjct: 231 AAFAGELRHRGEHSLELVAIWLHHLREGRPVPVVPILTGSFYPFMRNGDTPAADPTIGAV 290

Query: 171 -----KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDP 225
                K +         S D  H G  F     D       +++ A D+  ++ +  GD 
Sbjct: 291 LDVLRKAMAGRRTLIVASGDLAHVGPAFGGAPLDATG---RRTLRAADEELIEAMRRGDA 347

Query: 226 DAFKKYL--LEYDNTICGRHPISVFLHMLGNC 255
             F   +  ++  N +CG  PI + L +LG  
Sbjct: 348 TGFFNAIRRVQNRNNVCGVAPIYLTLRLLGQS 379


>gi|46581301|ref|YP_012109.1| hypothetical protein DVU2897 [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|387154520|ref|YP_005703456.1| hypothetical protein Deval_2676 [Desulfovibrio vulgaris RCH1]
 gi|46450722|gb|AAS97369.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|311234964|gb|ADP87818.1| protein of unknown function DUF52 [Desulfovibrio vulgaris RCH1]
          Length = 329

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 65/255 (25%), Positives = 99/255 (38%), Gaps = 32/255 (12%)

Query: 43  VIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP 102
           V+ PHAGY YSGR A      +    +  VF+LGP+H       A+     + TP+G +P
Sbjct: 94  VMLPHAGYVYSGRVAGRTLSQVRLAPV--VFMLGPNHTGRGAPLAVWPEGDWLTPLGSVP 151

Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENE 162
           +       L         +    E EHS+E+ LP L        + I+P+ V   +A   
Sbjct: 152 VHERAAAALLDKDGGYTANRTAHEGEHSLEVLLPLLQVRHPA--LSIIPVAVSEQDAGAL 209

Query: 163 AMYGRLFAKYVDD------PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
              G   A+ + +      PS+   +SSD  H+ +R                 E  D + 
Sbjct: 210 QRAGASLARTMQELAAAGVPSSIV-LSSDMSHYVTRTQ--------------AEERDALA 254

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH---MLGNCSTKIKIKFLRYEQSSQCK 273
           +  +   DP+     +     T+CG  P  V LH    LG  S  +      Y+ S+   
Sbjct: 255 LGRMAALDPEGLYATVRHNRITMCGVLPAVVALHACRALGAESACLAA----YDTSASAS 310

Query: 274 TKRDSSVSYASAAAK 288
              +  V YA   AK
Sbjct: 311 GDHERVVGYAGMVAK 325


>gi|332295079|ref|YP_004437002.1| hypothetical protein Thena_0222 [Thermodesulfobium narugense DSM
           14796]
 gi|332178182|gb|AEE13871.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thermodesulfobium narugense DSM 14796]
          Length = 264

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 24/211 (11%)

Query: 34  LPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTAT 92
           +PK   V+GV+ PHAGY++SG  A   + +I+ P +     L+GP H   +    LS  T
Sbjct: 29  IPKR-RVKGVVVPHAGYNFSGSIAGKVYSSIECPDTF---LLIGPKHSMESDGIFLS-QT 83

Query: 93  VYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI 152
            + TP+G++  D ++ E L    +F  ++  +   EHS+E+ +P++  V      KIVP+
Sbjct: 84  SWATPLGEVMPDRDLGESLLHHCEFIHLNERIHANEHSLEVQVPFIKYVCPK--AKIVPV 141

Query: 153 LVGAVNAENEAMYGRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
            V   +    +  G+  A  + + +    V  SSD  H   +   +  D+K   + K++ 
Sbjct: 142 AVSTTSEGILSSTGKCIANVLKESNKSVVVVMSSDLNHHEPQEITLEKDEK---VIKNLM 198

Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
           +L           DP    K + E D ++CG
Sbjct: 199 SL-----------DPRGLLKTVYEEDVSMCG 218


>gi|224372878|ref|YP_002607250.1| protein of unknown function DUF52 [Nautilia profundicola AmH]
 gi|223589628|gb|ACM93364.1| protein of unknown function DUF52 [Nautilia profundicola AmH]
          Length = 263

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 24/183 (13%)

Query: 41  RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
           + +I PHAG+ YSG  A +A+     T+   + ++GPSH +     + +    Y+TP G+
Sbjct: 46  KALIVPHAGWMYSGFTANFAYRIAQNTTPKAIAVIGPSHKFAFEGISTTLENEYETPCGN 105

Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
           LP+D     EL     F++ ++     EHS E+ +P++   F       +P++       
Sbjct: 106 LPIDTATAYEL--INNFDVQNLEYVHVEHSTEVQMPFIKHYFNN-----IPVI------- 151

Query: 161 NEAMYGRLFAKYVDDPSNFF-------SVSSDFCHWGS--RFNYMHYDKKHGVIHKSIEA 211
            E +Y     K + +  N+         +SSD  H+      N + Y+    V ++ +  
Sbjct: 152 -ELIYSNYSPKKLKEIINYLIQKDILVVISSDLSHYYDIKTANALDYNCLEAVNNQDVLQ 210

Query: 212 LDK 214
           L+K
Sbjct: 211 LEK 213


>gi|120601519|ref|YP_965919.1| hypothetical protein Dvul_0469 [Desulfovibrio vulgaris DP4]
 gi|120561748|gb|ABM27492.1| protein of unknown function DUF52 [Desulfovibrio vulgaris DP4]
          Length = 329

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 65/255 (25%), Positives = 99/255 (38%), Gaps = 32/255 (12%)

Query: 43  VIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP 102
           V+ PHAGY YSGR A      +    +  VF+LGP+H       A+     + TP+G +P
Sbjct: 94  VMLPHAGYVYSGRVAGRTLSQVRLAPV--VFMLGPNHTGRGAPLAVWPEGDWLTPLGSVP 151

Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENE 162
           +       L         +    E EHS+E+ LP L        + I+P+ V   +A   
Sbjct: 152 VHERAAAALLDKDGGYTANRTAHEGEHSLEVLLPLLQ--VRHPALSIIPVAVSEQDAGAL 209

Query: 163 AMYGRLFAKYVDD------PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
              G   A+ + +      PS+   +SSD  H+ +R                 E  D + 
Sbjct: 210 QRAGASLARTMQELAAAGVPSSIV-LSSDMSHYVTRTQ--------------AEERDALA 254

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH---MLGNCSTKIKIKFLRYEQSSQCK 273
           +  +   DP+     +     T+CG  P  V LH    LG  S  +      Y+ S+   
Sbjct: 255 LGRMAALDPEGLYATVRHNRITMCGVLPAVVALHACRALGAESACLAA----YDTSASAS 310

Query: 274 TKRDSSVSYASAAAK 288
              +  V YA   AK
Sbjct: 311 GDHERVVGYAGMVAK 325


>gi|156743567|ref|YP_001433696.1| hypothetical protein Rcas_3629 [Roseiflexus castenholzii DSM 13941]
 gi|156234895|gb|ABU59678.1| protein of unknown function DUF52 [Roseiflexus castenholzii DSM
           13941]
          Length = 405

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 109/285 (38%), Gaps = 61/285 (21%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           R  + AG  Y  +   L   LD ++   G   P  P  RGV++PH  Y+  GR  A  + 
Sbjct: 114 RPPALAGQSYPADPAELRRLLDDFIAAVGPVAPAPPTGRGVLSPHIDYARGGRVYAQVWQ 173

Query: 63  NIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
                  +   V L+G  H  Y+P+    T   Y TP+G LP D  V++ L A       
Sbjct: 174 RAAEMVRAAEIVLLIGTDH--YSPEPVTLTRQRYATPLGVLPTDTSVVDALAAA------ 225

Query: 121 DICVDEA---------EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
            I  D A         EHS+E+   +L  +       ++PIL G+            FA+
Sbjct: 226 -IGEDAAFAGELYHRVEHSLELVAVWLQYIRGDAPCPVIPILAGS------------FAR 272

Query: 172 YV--DDPSN--------------------FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSI 209
           Y+  DDP+                        +S D  H G  F             +++
Sbjct: 273 YMDGDDPATDPRFEALITALRRIIASRHAVVIISGDMSHVGPAFGGAPLSNAD---KEAL 329

Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDN--TICGRHPISVFLHML 252
              D++ +D +  GD   F + + E  +   ICG  P  + L ++
Sbjct: 330 RRADELVIDRMRAGDAAGFFRVIAETGDRQNICGLPPTYLALRLM 374


>gi|258404346|ref|YP_003197088.1| hypothetical protein Dret_0208 [Desulfohalobium retbaense DSM 5692]
 gi|257796573|gb|ACV67510.1| protein of unknown function DUF52 [Desulfohalobium retbaense DSM
           5692]
          Length = 268

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 115/287 (40%), Gaps = 44/287 (15%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID-PTS 68
           AG +Y        +++  +L +A  P+SP +  ++ PHAGY +SG  A    G    P +
Sbjct: 8   AGQFYPGTKTAWEQQVRTYLAQATAPQSPSLMAMV-PHAGYPFSGPVAGKTLGAASLPET 66

Query: 69  ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEA- 127
           +    LLGP+H       AL +   +  P   L ++    E+L          I  DEA 
Sbjct: 67  L---LLLGPNHTGLGKALALWSRGAWDIPGASLAIE----EKLAGAIVHGCPHIASDEAA 119

Query: 128 ---EHSMEMHLPYLAKVFEGHLIKIVPILVGA-------VNAEN-EAMYGRLFAKYVDDP 176
              EHS+E+ LP+L  +      +IVP+ V           AE+   + GR         
Sbjct: 120 HLREHSLEVILPFLWAL--NPRTRIVPLAVAEHHLPTLLETAEHLHTVLGR------QQQ 171

Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
                VSSD  H+ S       + KH         LD M +  I   DP+     +    
Sbjct: 172 QVGIVVSSDMSHFISE-----QEAKH---------LDDMALQRILDLDPEGLYNTVRNNR 217

Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            ++CG  P+++ L ++      +  K + Y  S+     R   V YA
Sbjct: 218 ISMCGVLPMTLGLQLVKRMGA-VSAKLVDYATSADATGDRSQVVGYA 263


>gi|288931699|ref|YP_003435759.1| hypothetical protein Ferp_1332 [Ferroglobus placidus DSM 10642]
 gi|288893947|gb|ADC65484.1| protein of unknown function DUF52 [Ferroglobus placidus DSM 10642]
          Length = 261

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 77/289 (26%), Positives = 125/289 (43%), Gaps = 31/289 (10%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
           +R  + AG +Y  N     E L   LRE    +S E V  +++PHAGY YSGR A  A  
Sbjct: 1   MRHPAVAGYFYPSNP----ESLLAMLREFAEKRSDEDVVAIVSPHAGYVYSGRTAG-AVH 55

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           ++ P  +    ++GP+H       A+S    + TP+G +  D E I+ +    K  ++D 
Sbjct: 56  SLLP-DVETYVIVGPNHTGLGAAVAVSYDD-WMTPLGVVESDREFIDAMPK--KICVVDE 111

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDPSNFF 180
                EHS+E+ +P+L  + +    KIV I +G  + E   E +   L A+         
Sbjct: 112 IAHREEHSLEVQVPFLQYLHKD--FKIVAICMGLQDKETAEEVVEEILEAQEKTGRRIAL 169

Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
             SSD  H+              +        D++ ++ I + D D +   + +   ++C
Sbjct: 170 VASSDMHHY--------------LPDDECRRRDEIVIEAILSMDVDKYYDTIYKLQASVC 215

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           G  PI+V   ML +     K + + Y  S     K    V YA  A +V
Sbjct: 216 GYGPIAVA--MLYSKELGAKAELVHYSTSGDVADK-SQVVGYAGIAFRV 261


>gi|322689777|ref|YP_004209511.1| hypothetical protein BLIF_1595 [Bifidobacterium longum subsp.
           infantis 157F]
 gi|320461113|dbj|BAJ71733.1| conserved hypothetical protein [Bifidobacterium longum subsp.
           infantis 157F]
          Length = 500

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/226 (26%), Positives = 95/226 (42%), Gaps = 44/226 (19%)

Query: 46  PHAGYSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
           PHAGY YSG AAA A+  ++    S++R  ++GP+H       A STA  ++TP+G +P+
Sbjct: 2   PHAGYIYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPV 61

Query: 104 D---------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL 146
           D         L V E L+        A     +++      EH++E+ +P+L  V  G  
Sbjct: 62  DIAAERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPD 120

Query: 147 IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH 206
           + IVP+  G    +     G +       P     +SSD  H+                H
Sbjct: 121 LTIVPLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------H 163

Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
           +   ALD   +        D    +L  +    CG +PI+  L +L
Sbjct: 164 EVARALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 202


>gi|430742361|ref|YP_007201490.1| dioxygenase [Singulisphaera acidiphila DSM 18658]
 gi|430014081|gb|AGA25795.1| putative dioxygenase [Singulisphaera acidiphila DSM 18658]
          Length = 413

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 133/313 (42%), Gaps = 38/313 (12%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPE-------VRGVIAPHAGYS 51
           E++R A+ AG  Y    + L  +L  +   A   G+P+ PE       +RGV++PH  + 
Sbjct: 106 EQVRPAALAGRSYAGTERALRSQLARYFAHADGSGVPR-PEPQTLPSRLRGVLSPHIDFQ 164

Query: 52  YSGRAAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
             G    +++  +   S +  F +LG +H Y   + AL T   ++TP+G +  D E ++ 
Sbjct: 165 RGGLVYTWSYKELVEQSDADTFVILGVAHQYCRNRFAL-TRKDFETPLGRVRTDREYVDR 223

Query: 111 LKATGKFELM-DICVDEAEHSMEMHLPYLAKVFEG-HLIKIVPILVGA----VNAENEAM 164
           + A    +L  D      EHS+E  + +L  +  G     IVPILVG+    ++A  + +
Sbjct: 224 IAALAGQDLFEDELSHRTEHSIEFQVVFLQYLLGGVRDFSIVPILVGSFHDLMDAGTDPI 283

Query: 165 YGRLFAKYVD-----DPSNFFSVS----SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
                 ++V+     + +N   V+     D CH G  F     D     I   + + D  
Sbjct: 284 ESDDVRRFVEALRASEAANGRKVAYIGGIDLCHVGPEFG--DPDPLDPEILAEVRSFDTA 341

Query: 216 GMDIIETGDPDAFKKYLLEYDN--TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCK 273
            +      D   +     E  N   +CG       LH +G     +    L+Y+Q+    
Sbjct: 342 MLARAAAHDAAGWFGTAAEIGNRWRVCGLAAAYTMLHAMGPARGTL----LKYDQA--VD 395

Query: 274 TKRDSSVSYASAA 286
            +R   VS+AS A
Sbjct: 396 HERTCCVSFASLA 408


>gi|126728359|ref|ZP_01744175.1| Predicted dioxygenase [Sagittula stellata E-37]
 gi|126711324|gb|EBA10374.1| Predicted dioxygenase [Sagittula stellata E-37]
          Length = 450

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 74/172 (43%), Gaps = 13/172 (7%)

Query: 33  GLPKSPEVR-GVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTA 91
           G P +PE    VI+PHAGY +SGR  A A       +   + +L PSH +     A  + 
Sbjct: 33  GAPTAPEPPVAVISPHAGYRFSGRLTARALATTREAAPKSIAVLSPSHRHAFDGIAAPSQ 92

Query: 92  TVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVP 151
             +  P G   +D+     + A G   + D   D+ EH +E+ LP L  +     + ++P
Sbjct: 93  DAFALPTGTQRIDIATRAAMVAAGLIHVEDAAHDQ-EHGVEVQLPVLHALHPD--VPVLP 149

Query: 152 ILVGAVNAENEAMYGRLFAKYVD--DPSNFFSVSSDFCHWGSRFNYMHYDKK 201
           +++G    +         A  VD         +SSD  H+ +R +    D +
Sbjct: 150 LVIGRTGNDR-------VAALVDALPEGTLIVLSSDLSHFLTRDDARAKDAR 194


>gi|116747526|ref|YP_844213.1| hypothetical protein Sfum_0075 [Syntrophobacter fumaroxidans MPOB]
 gi|116696590|gb|ABK15778.1| protein of unknown function DUF52 [Syntrophobacter fumaroxidans
           MPOB]
          Length = 414

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/272 (29%), Positives = 113/272 (41%), Gaps = 29/272 (10%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSP-------EVRGVIAPHAGYSYS 53
           + +RR  HAG  Y  +++ L   L+G+   E G P  P        V G++APH      
Sbjct: 109 DPVRRMRHAGQSYPSDAELLNRRLEGFFSAEDGGPGLPGAARDNRPVLGLVAPHIDIQAG 168

Query: 54  GRAAAYAF-GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
           GR  A+A+    D  S     +LG  H   +   AL TA  ++TP+G +  D E    L 
Sbjct: 169 GRCFAHAYKAAADSVSPRTWIVLGTGHELVSNYFAL-TAKDFETPLGLVGHDEECCAHLV 227

Query: 113 ATGKFELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAV--NAENEAMYG 166
            + K    DI   E     EH++E    +LA V  G   KIVP+L      + E +  Y 
Sbjct: 228 NSAK---RDILAGEYNHVREHTVEFQAVFLAYVQPG--AKIVPLLCSFSHEDLETDGEYI 282

Query: 167 RLFAKYVDDPSNFFSV----SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIET 222
             FA  + D     SV    S D  H G R+          V  K   A D+  ++ +  
Sbjct: 283 DHFAGLLRDLVLTRSVGILASVDLAHIGPRYGDRFQPTDSTV--KDHMASDRGLVESLRE 340

Query: 223 GDPDAFKKYLLEYDN--TICGRHPISVFLHML 252
            D +AF + +    N   ICG  P+ V    L
Sbjct: 341 CDAEAFIRQIRLEGNRRKICGVAPLYVLAQAL 372


>gi|386391427|ref|ZP_10076208.1| putative dioxygenase [Desulfovibrio sp. U5L]
 gi|385732305|gb|EIG52503.1| putative dioxygenase [Desulfovibrio sp. U5L]
          Length = 406

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 123/311 (39%), Gaps = 48/311 (15%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPH-------AGY--SYS 53
           +R A  AG  Y D    L   LD  L  A   P +  V  VIAPH       AGY  +Y 
Sbjct: 110 VREAVFAGQAYPDAPALLVPYLDAMLEAASARPAAGPVVAVIAPHIDPEAGKAGYGAAYG 169

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
              AA A G   P    RV +LG  H        L T   + TP+G++P D   +  L+ 
Sbjct: 170 ALRAALA-GRTPP---KRVLVLGVGHQVIKGLYCL-TDKAFATPLGEVPADGPAVAALRR 224

Query: 114 TGKFELMDICVDEA------EHSMEMHLPYLAKVFEGHLIKIVPILVGAV---------N 158
            G+       VD A      EHS+E    +L  V       +VPIL G+           
Sbjct: 225 AGQ-----ATVDPADLPHRSEHSVEFQAVFLRHVLGDAPFTLVPILCGSPLGALPRHSRQ 279

Query: 159 AENEAMYGRLFA--KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
           A  EA    L A  +   DP        D  H G +F +         + ++  A D   
Sbjct: 280 AFREAAGPFLSALREMAADPDTLVVAGVDLFHIGPKFGHAEPAAA---LEEAALAHDTAL 336

Query: 217 MDIIETGDPDAF--KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
           +D +  GD DAF  +   ++    +CG   ++    +L   +    +  L ++   + +T
Sbjct: 337 LDALARGDADAFWEESARVKDAYNVCGFTALATLAEILPPAT----MALLAHDVMREAET 392

Query: 275 KRDSSVSYASA 285
           +  S+VS+A A
Sbjct: 393 R--SAVSFAGA 401


>gi|237808838|ref|YP_002893278.1| hypothetical protein Tola_2093 [Tolumonas auensis DSM 9187]
 gi|237501099|gb|ACQ93692.1| protein of unknown function DUF52 [Tolumonas auensis DSM 9187]
          Length = 258

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 75/299 (25%), Positives = 121/299 (40%), Gaps = 56/299 (18%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV----RGVIAPHAGYSYSGRA 56
           M+ I + + A  +Y  +  RL +    W++   +  +PE     R +I PHAGY +SG  
Sbjct: 1   MQVIHQPAVADMFYPADPDRLRD----WMQH-HVKAAPETGRKPRMLILPHAGYRFSGAI 55

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTA-TVYKTPIGDLPLDLEVIEELKATG 115
           AA  +  +      RV +L P+H  Y    A+        TP G +PLD   I  L +  
Sbjct: 56  AAQGYSLLQKGDFKRVIVLCPAHRVYLQGIAVPEHWDGEATPFGTIPLDKPAIASLLSLP 115

Query: 116 KFELMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
                 I   EA   EH++E+ LP+L   ++     ++P++VG   AE          + 
Sbjct: 116 GV----IAATEAHRQEHAIEVQLPFLQ--YQLGDFSLIPLVVGDCPAET---VSHAIEQI 166

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
           + D      VSSD  H+ S+                         DI E  D D   + +
Sbjct: 167 MTD-DTLIIVSSDLSHYLSK-------------------------DIAELQD-DYTIRQI 199

Query: 233 LEYDNTI-----CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
           L++  T+     CG + ++  LH       ++ IK L    S      ++  V YA+ A
Sbjct: 200 LQFSETLTGDQACGCYALNGALHWAHQ--QQLDIKLLAKCNSGDTSHDKERVVGYAAFA 256


>gi|337286681|ref|YP_004626154.1| hypothetical protein Thein_1323 [Thermodesulfatator indicus DSM
           15286]
 gi|335359509|gb|AEH45190.1| protein of unknown function DUF52 [Thermodesulfatator indicus DSM
           15286]
          Length = 267

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 86/189 (45%), Gaps = 10/189 (5%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           IR  + AG +Y  N + L +E++ +L     PK P + G + PHAGY +SG  A   +  
Sbjct: 2   IRMPAVAGRFYEANPELLRKEIEAYLDPTA-PKEPAI-GAVCPHAGYMFSGHVAGAVYSR 59

Query: 64  -IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
            I P +   V +LGP+H       A+     ++ P G +P++ ++   +    +    D+
Sbjct: 60  LIIPDT---VVILGPNHTGLGHPAAIMAKGAWQMPFGTVPIEEKLAAFILQESQVLSHDV 116

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FF 180
                EHS+E+ +P+L  +     + IVPI +  +  E     G   AK +         
Sbjct: 117 EAHLYEHSLEVQVPFLQYLNPN--VAIVPICLSHLPYEALEDIGLAVAKGIAAYGGPVLI 174

Query: 181 SVSSDFCHW 189
             S+D  H+
Sbjct: 175 VASTDMSHY 183


>gi|403740291|ref|ZP_10952468.1| hypothetical protein AUCHE_22_00080 [Austwickia chelonae NBRC
           105200]
 gi|403190089|dbj|GAB79238.1| hypothetical protein AUCHE_22_00080 [Austwickia chelonae NBRC
           105200]
          Length = 271

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 74/153 (48%), Gaps = 9/153 (5%)

Query: 10  AGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID 65
           AG  Y   ++ L + +   L E      L   P  + V+  H+ Y + G  +A A+  ++
Sbjct: 10  AGRHYPAEARELTDLMRSQLAEGRADMALDPEPPPKAVLVAHSAYVHCGHGSAAAYARVE 69

Query: 66  PT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT-GKFELMDI 122
               ++ RV LLGP+         L   T + TP+GD+P+D + +  L A+  +   +D 
Sbjct: 70  TARGTVRRVVLLGPARQDPPRGITLPAWTTFTTPLGDVPVDQDAVRALTASMPEVVHVDD 129

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
               AE S+E+ LP+L +V  G    +VP+ VG
Sbjct: 130 GPHLAETSLEVQLPFLQEVL-GEF-TVVPLAVG 160


>gi|313682443|ref|YP_004060181.1| hypothetical protein Sulku_1318 [Sulfuricurvum kujiense DSM 16994]
 gi|313155303|gb|ADR33981.1| protein of unknown function DUF52 [Sulfuricurvum kujiense DSM
           16994]
          Length = 266

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 54/98 (55%), Gaps = 2/98 (2%)

Query: 41  RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
             +I PHAG+ YSG  A  A+  ++ +    V ++GPSH       ++   T Y+TP+GD
Sbjct: 47  NAIIVPHAGWVYSGFTANVAYRILNNSEPKTVVVIGPSHRVGFDGVSICDLTHYQTPLGD 106

Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYL 138
           L +D ++ +EL+   +F L        EHS E+ +P++
Sbjct: 107 LTIDPQLTKELRE--RFALPYFSEAHHEHSTEVQMPFI 142


>gi|193214202|ref|YP_001995401.1| hypothetical protein Ctha_0483 [Chloroherpeton thalassium ATCC
           35110]
 gi|193087679|gb|ACF12954.1| protein of unknown function DUF52 [Chloroherpeton thalassium ATCC
           35110]
          Length = 292

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 87/200 (43%), Gaps = 9/200 (4%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M+ IR       + +D  +++   L  +     LP+  ++ G + PHA + YSG+ AA  
Sbjct: 10  MQFIREPQAMDLYPSDCGQQIFAFLKKYTPSTELPE--KLIGAVVPHAAWKYSGKVAART 67

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFEL 119
              +   S   + +L  + H    K  L     +KTP+G LP+  E+   L  A  K   
Sbjct: 68  LHTLSERSSPDICVLLGADHIGLKKHTLLPDGTWKTPLGSLPIASELASALYSALPKILS 127

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPS 177
            D+   E EHS+E+  P +A  +    + I+PI++  V   N    G    + +     +
Sbjct: 128 NDMSAHELEHSLEVISPMIAYFWPK--LSILPIII--VPNANALEIGNALVQLLKASGKT 183

Query: 178 NFFSVSSDFCHWGSRFNYMH 197
             F  S+D  H+G  +   H
Sbjct: 184 AIFVASTDLTHYGRFYGNAH 203


>gi|146332000|gb|ABQ22506.1| memo-like protein [Callithrix jacchus]
          Length = 61

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 30/59 (50%), Positives = 36/59 (61%), Gaps = 3/59 (5%)

Query: 234 EYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
           +Y NTICGRHPI V L+    L      +   FL Y QSSQC+  +DSSVSYA+ A  V
Sbjct: 2   KYHNTICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 60


>gi|357634174|ref|ZP_09132052.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Desulfovibrio sp. FW1012B]
 gi|357582728|gb|EHJ48061.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Desulfovibrio sp. FW1012B]
          Length = 406

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 120/309 (38%), Gaps = 44/309 (14%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPH-------AGYSYSGR 55
           +R A  AG  Y D    L   LD  L  A   P +  V  VIAPH       AGY  +  
Sbjct: 110 VRPAVFAGQAYPDAPALLVPYLDAMLEAASARPAAGPVVAVIAPHIDPEAGKAGYGAAYG 169

Query: 56  AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
           A   A     P    RV +LG  H        L T  V+ TP+G++P D   +  L+  G
Sbjct: 170 ALRAALDGRTPPK--RVVVLGVGHQVIKGLYCL-TDKVFATPLGEVPADGPAVAALRRAG 226

Query: 116 KFELMDICVDEA------EHSMEMHLPYLAKVFEGHLIKIVPILVGAV---------NAE 160
           +       VD A      EHS+E    +L  V       +VPIL G+           A 
Sbjct: 227 Q-----ATVDPADLPHRSEHSVEFQAVFLRHVLGDAPFTLVPILCGSPLGALPRHSRQAF 281

Query: 161 NEAMYGRLFA--KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
            EA    L A  +   DP        D  H G +F +    +    + ++  A D   +D
Sbjct: 282 REAAGPFLSALREMAADPDTLVVAGVDLFHIGPKFGHAEPAEA---LEEAALAHDTALLD 338

Query: 219 IIETGDPDAFKKYLLEYDNT--ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
            +  GD DA  +      +   +CG   ++    +L   +    +  L ++   + +T+ 
Sbjct: 339 ALARGDADAIWEESARVKDVYNVCGFTALATLAEILPPAT----MALLAHDVMREAETR- 393

Query: 277 DSSVSYASA 285
            S+VS+A A
Sbjct: 394 -SAVSFAGA 401


>gi|337286172|ref|YP_004625645.1| hypothetical protein Thein_0803 [Thermodesulfatator indicus DSM
           15286]
 gi|335359000|gb|AEH44681.1| protein of unknown function DUF52 [Thermodesulfatator indicus DSM
           15286]
          Length = 390

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/199 (29%), Positives = 89/199 (44%), Gaps = 18/199 (9%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAAYA 60
           K+R +SHAG  Y      L   L+  L     PK P    R +IAPH  +    +  A A
Sbjct: 112 KVRPSSHAGHAYPLKPDELKTFLNQILNL--WPKRPNYNPRIIIAPHIDFRAGAQTFAAA 169

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
           +  +     +RV +LG  H   TP  +L+    ++TP+G +  D E + EL      +L 
Sbjct: 170 YQGLSWPKGARVIVLGTGHFLETP-VSLAYKD-FETPLGLVKYDREFVAELSKKIDEDLR 227

Query: 121 D-ICVDEAEHSMEMHLPYLAKVFEGHLI---KIVPILVGAVNAENEAMYGRLFAKYVD-- 174
                 ++EHS++  +     VF  HL+    +VPILV +    +   + +L     D  
Sbjct: 228 GHEWAHKSEHSIDFQV-----VFLKHLLGEFSLVPILVASPQG-HRNFFKKLAESLRDLL 281

Query: 175 DPSNFFSVSSDFCHWGSRF 193
           D   +  V  DFCH G R+
Sbjct: 282 DEKTYLVVGVDFCHLGLRY 300


>gi|406919694|gb|EKD57918.1| hypothetical protein ACD_57C00099G0002 [uncultured bacterium]
          Length = 300

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 6/150 (4%)

Query: 40  VRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG 99
           V G I PH G +   +  +  F  I   +I  + L+GP+H         ++   ++T  G
Sbjct: 75  VTGAIIPHHGLA--AKYFSSFFNVISGKNIKTIILIGPNHKLVGDNPVYTSDLTWETEFG 132

Query: 100 DLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA 159
            +  D E++++L  TGK  + D  + E EHS+   +PY+AK   G  +K+VP++   +N 
Sbjct: 133 QVNADYELVQKLTETGKVGIDDKII-EDEHSVATIMPYIAKYMPG--VKVVPLVCKEINL 189

Query: 160 ENEAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
                   +    + +P      + DF H+
Sbjct: 190 NKIGELTEIITANL-EPGVIVISAVDFSHY 218


>gi|358340513|dbj|GAA48391.1| hypothetical protein CLF_101548 [Clonorchis sinensis]
          Length = 256

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/163 (26%), Positives = 71/163 (43%), Gaps = 20/163 (12%)

Query: 148 KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHK 207
           K+VP+ +G +  +   +  R F  Y+++P N F  S+  CHWG  + +         I  
Sbjct: 75  KVVPVYIGTLAHDQLTVVARRFRDYLEEPENAFIFSASLCHWGDIYGFSAQIPDAPTILD 134

Query: 208 SIEALDKMGMDIIETGDPDAFKKYLLE-----YDNTICG------RHPISVFLHMLGNCS 256
           SI+A DK  +  +       F ++LL+      DN I        R  ++V LH L +  
Sbjct: 135 SIKAFDKRAIAAMRELRFRPFDEFLLDTKAKVLDNQIIALLLFMVRQLLNVNLHALISMD 194

Query: 257 TKIKI----KFLRYEQSSQC-----KTKRDSSVSYASAAAKVD 290
            +  +    K  R++   Q       T  DS +SY SA  ++D
Sbjct: 195 EEKVVNQMRKLARFKLMGQAWSYPDVTSADSCISYVSAVVELD 237


>gi|149390717|gb|ABR25376.1| unknown [Oryza sativa Indica Group]
          Length = 53

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 38/45 (84%), Positives = 41/45 (91%)

Query: 246 SVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
           SVFL ML +CSTKIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD
Sbjct: 1   SVFLSMLKHCSTKIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVD 45


>gi|41615137|ref|NP_963635.1| hypothetical protein NEQ347 [Nanoarchaeum equitans Kin4-M]
 gi|40068861|gb|AAR39196.1| NEQ347 [Nanoarchaeum equitans Kin4-M]
          Length = 266

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 60/236 (25%), Positives = 101/236 (42%), Gaps = 15/236 (6%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
           AG +Y  N ++L   L+   +  GL K   ++  I PHAGY YSG+A A  + N D    
Sbjct: 5   AGKFYPSNREQLIALLNELFK--GLKKEYSLKAGIVPHAGYIYSGKAMAEFWINSDRNKT 62

Query: 70  SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
             +  +G +H        +S   + +TP+G   +D E        G F+  D      EH
Sbjct: 63  YAI--IGTNHTGLGSLANVSLMPI-ETPLGIAKIDEEAAMIFMKNG-FDYDDRPF-LYEH 117

Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
           S+E  +P+L  +   + + +  ++        E   G+  A  + +     + SSDF H+
Sbjct: 118 SVENQIPFLQYLHGDNFLIVPSVMFNVYRFAKEV--GKQLALELPERVRLVA-SSDFTHY 174

Query: 190 GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPI 245
           G  + Y  +        + ++ LD   +  I   D   F K ++    T+CG  PI
Sbjct: 175 GDIYGYKPFSDG-----RKVKELDMKLISYILKLDSLGFYKEIVRTGATVCGWGPI 225


>gi|256830987|ref|YP_003159715.1| hypothetical protein Dbac_3224 [Desulfomicrobium baculatum DSM
           4028]
 gi|256580163|gb|ACU91299.1| protein of unknown function DUF52 [Desulfomicrobium baculatum DSM
           4028]
          Length = 270

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 66/246 (26%), Positives = 97/246 (39%), Gaps = 27/246 (10%)

Query: 44  IAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
           + PHAG+ +SG  A         T    V LLGP+H       A+     +  P   + +
Sbjct: 43  MVPHAGHVFSGGVAGQTLARAKLTDT--VLLLGPNHTGMGAPLAVWPDGKWLLPGAAMDV 100

Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA 163
           D E+   L A       D      EHS+E+ LP+L    +   ++I+PI VG        
Sbjct: 101 DAELAAALLAAEPALTADRVAHLQEHSLEVVLPFLWA--KNPQMRIIPIAVGDPRPHK-- 156

Query: 164 MYGRLFAKYVDDPSNF-----FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
             G   AK  +  S         VSSD  H+ S       D+K  VI       D+  +D
Sbjct: 157 -LGGAAAKIAEALSALGREVSVVVSSDMNHFAS-------DEKTRVI-------DQHALD 201

Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
            I   DP  F   + E + ++CG  P+++ +H L N     K + + Y  S         
Sbjct: 202 RILALDPMGFYGKVREENISMCGVLPMTLGMH-LANIQGAKKAELVAYATSGDVNGDMSR 260

Query: 279 SVSYAS 284
            V YA 
Sbjct: 261 VVGYAG 266


>gi|386347394|ref|YP_006045643.1| hypothetical protein [Spirochaeta thermophila DSM 6578]
 gi|339412361|gb|AEJ61926.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Spirochaeta thermophila DSM 6578]
          Length = 275

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 56/203 (27%), Positives = 90/203 (44%), Gaps = 28/203 (13%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           K+R       WY D  + + EE+ GW R E  LP +P V   I PHAG+ +SG  A    
Sbjct: 2   KVRERMLPPGWYPDEKEEVLEEIAGWKRLERRLP-TPAV-ACIVPHAGWYFSGELAFMGI 59

Query: 62  GNIDPTSISRVFLLG----PSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
            ++DP +     + G     S   Y P+  LS      TP+GDLP++  ++E L+     
Sbjct: 60  RSLDPEAEVVAVVGGHLSERSPLLYMPEDGLS------TPLGDLPVERPLVEALREHVPT 113

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLI---KIVPILVGAVNAENEAMYGRLFAKYVD 174
           E  D+  D   +++E+ LP +   F    +   +  P+L      E      R+   Y  
Sbjct: 114 E-PDVYPD---NTVEIQLPLVKYCFPHASVVGLRCPPLL------EVAERTARVLYDYGK 163

Query: 175 DPSNFFSV--SSDFCHWGSRFNY 195
           +      V  S+D  H+G ++ +
Sbjct: 164 ETGKKLVVIGSTDLTHYGPQYGF 186


>gi|374814620|ref|ZP_09718357.1| hypothetical protein TpriZ_12205 [Treponema primitia ZAS-1]
          Length = 283

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 53/161 (32%), Positives = 76/161 (47%), Gaps = 12/161 (7%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG--VIAPHAGYSYSGRAAAYA 60
           K R    +G +Y D+      E+   +R  GL +    R   +IAPH  + +SG  AA A
Sbjct: 11  KGRSPVVSGLFYPDDRA----EMGTRIRALGLKRGVGGRASAIIAPHGAWDFSGSVAATA 66

Query: 61  FGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-F 117
           F         +SRV +LG  H    P    S +  + TP+G LP+D+E+ E L +    F
Sbjct: 67  FSAAAGRANEVSRVVILGNVHRMEEPGVFFSDSHYFDTPLGRLPVDMELSESLASCSTLF 126

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN 158
           E+ DI     E ++E+ LP     F G    IVPIL+G   
Sbjct: 127 EVNDIP-HLQEIAVEVLLPLTQFCFPGA--AIVPILMGGAQ 164


>gi|337288503|ref|YP_004627975.1| hypothetical protein TOPB45_0950 [Thermodesulfobacterium sp. OPB45]
 gi|334902241|gb|AEH23047.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thermodesulfobacterium geofontis OPF15]
          Length = 396

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/233 (24%), Positives = 99/233 (42%), Gaps = 14/233 (6%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
           +R  +HA   Y  ++      ++  L+      S   + +IAPH       +A A ++  
Sbjct: 111 LRPMAHANQAYPLSASEAKFFIEDILKLCNPDSSKPPKILIAPHIDIRAGAKAFAESYSR 170

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM-DI 122
               S SRV +LG  HH   P   L+      TP G +  D   +  L  + K EL  + 
Sbjct: 171 FKIPSGSRVIILGVGHHLDLPWSVLTKDVA--TPFGVVKNDRGGLLYLSNSKKIELFPNH 228

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD------DP 176
              + EHS+E  + +L  + +   + ++P L+G +    E     L  K+++      D 
Sbjct: 229 IAHKLEHSIEFQVLFLHHLLKDEFV-VLPFLIGPLIVFFEEKTKELLEKFIEALSELIDD 287

Query: 177 SNFFSVSSDFCHWGSRF-NYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
             +  +  DFCH G R+ +     + H  I K +E  DK  +++   G+ + F
Sbjct: 288 RTYIVLGIDFCHLGLRYGDPFEVSETH--IKKILET-DKRLLELTFNGNSEEF 337


>gi|392411412|ref|YP_006448019.1| putative dioxygenase [Desulfomonile tiedjei DSM 6799]
 gi|390624548|gb|AFM25755.1| putative dioxygenase [Desulfomonile tiedjei DSM 6799]
          Length = 413

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 74/307 (24%), Positives = 128/307 (41%), Gaps = 35/307 (11%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-------RGVIAPHAGYSYSGRA 56
           +R  SHAG+ Y  + + L  +L  +    G P S +         G+IAPH      G+ 
Sbjct: 112 VRPCSHAGASYEADPESLHRQLSSFFSGDGGPGSIQSGTDPRRPLGLIAPHIDVRSGGKC 171

Query: 57  AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK-ATG 115
            A A+  +     S ++++  + H        +T   ++TP+G +  D E +  L    G
Sbjct: 172 FASAYHALAAGQPSDIYVIFGTGHSGVEGIFTATNLDFETPLGVVRTDREFLGHLTDRLG 231

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEG-HLIKIVPILVGAVNAENEAMYG------RL 168
                +  +   EH +E  + +L  +F   H   IVPIL  +++ E     G      R 
Sbjct: 232 HDPAAEEILHATEHVIEFQVIFLQHIFAARHPFTIVPILC-SLSPEMLHETGPFQDRKRK 290

Query: 169 FAKYVD---------DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDI 219
           F  +             S  F  S+D  H G R+       K G +++++E  D+  +  
Sbjct: 291 FDAFCSAVRSACQRIGKSVCFIASADLDHIGPRYGDGFIPHK-GTVNQTLEN-DRRMLGY 348

Query: 220 IETGDPDAFKKYLLEYDNT--ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
           +E  D   F +++     T  ICG  PI+   H +   +T+ ++  L Y Q       ++
Sbjct: 349 LEALDLPGFVRHVSSDQETTRICGFSPIAAMFHCMD--ATEGRLLALDYAQVD----DKN 402

Query: 278 SSVSYAS 284
           S VS+AS
Sbjct: 403 SFVSFAS 409


>gi|303325747|ref|ZP_07356190.1| hypothetical cytosolic protein [Desulfovibrio sp. 3_1_syn3]
 gi|302863663|gb|EFL86594.1| hypothetical cytosolic protein [Desulfovibrio sp. 3_1_syn3]
          Length = 298

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 68/310 (21%), Positives = 115/310 (37%), Gaps = 51/310 (16%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR---------------GVIAPHA 48
           IR    AG +Y      L  E+   L      + P V+               G++ PHA
Sbjct: 3   IRNPVAAGRFYPAAPADLIREVRACLDAGAAIRRPGVQDTSAPAETEAVSRLWGLMLPHA 62

Query: 49  GYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE- 106
           GY Y GR        ++ P ++    +L P+H        +     + TP+  LP+D + 
Sbjct: 63  GYVYCGRVLGTTLAGVELPRTL---VVLCPNHTGRGQALGVWPEGAWLTPLAPLPVDADL 119

Query: 107 ---VIEELKATGKFELMDICVDEAEHSMEMHLPYL-AKVFEGHLIKIVPILVG-----AV 157
              +IE     G F   D      EH++E+ LP+L     E   ++I P+ VG     A+
Sbjct: 120 AAALIERGDGNGGFA-ADTLSHLGEHAVEVVLPFLQVAAGEERPLRITPVCVGTQQPEAL 178

Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
            A  +A+   L             VSSD           HY+ +   + K     D + +
Sbjct: 179 RAAGQALADVLAGCRSKGQEVGVIVSSDM---------NHYEDERRTVEK-----DALAL 224

Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKT 274
           +     DP+           ++CG  P+++ L     LG    + + +   ++ S+    
Sbjct: 225 ERALACDPEGLLAVAARERISMCGAGPLALALFAARQLG----RARAELTAHDTSATASG 280

Query: 275 KRDSSVSYAS 284
             +  V YA 
Sbjct: 281 DTEHVVGYAG 290


>gi|345892602|ref|ZP_08843421.1| hypothetical protein HMPREF1022_02081 [Desulfovibrio sp.
           6_1_46AFAA]
 gi|345047094|gb|EGW50963.1| hypothetical protein HMPREF1022_02081 [Desulfovibrio sp.
           6_1_46AFAA]
          Length = 298

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 68/310 (21%), Positives = 115/310 (37%), Gaps = 51/310 (16%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR---------------GVIAPHA 48
           IR    AG +Y      L  E+   L      + P V+               G++ PHA
Sbjct: 3   IRNPVAAGRFYPAAPADLIREVRACLDAGAAIRRPGVQDASAPAETEAVSRPWGLMLPHA 62

Query: 49  GYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE- 106
           GY Y GR        ++ P ++    +L P+H        +     + TP+  LP+D + 
Sbjct: 63  GYVYCGRVLGTTLAGVELPRTL---VVLCPNHTGRGQALGVWPEGAWLTPLAPLPVDADL 119

Query: 107 ---VIEELKATGKFELMDICVDEAEHSMEMHLPYL-AKVFEGHLIKIVPILVG-----AV 157
              +IE     G F   D      EH++E+ LP+L     E   ++I P+ VG     A+
Sbjct: 120 AAALIERGDGNGGFA-ADTLSHLGEHAVEVVLPFLQVAAGEERPLRITPVCVGTQQPEAL 178

Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
            A  +A+   L             VSSD           HY+ +   + K     D + +
Sbjct: 179 RAAGQALADVLAGCRSKGQEVGVIVSSDM---------NHYEDERRTVEK-----DALAL 224

Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKT 274
           +     DP+           ++CG  P+++ L     LG    + + +   ++ S+    
Sbjct: 225 ERALACDPEGLLAVAARERISMCGAGPLALALFAARQLG----RARAELTAHDTSATASG 280

Query: 275 KRDSSVSYAS 284
             +  V YA 
Sbjct: 281 DTEHVVGYAG 290


>gi|307719287|ref|YP_003874819.1| dioxygenase [Spirochaeta thermophila DSM 6192]
 gi|306533013|gb|ADN02547.1| putative dioxygenase [Spirochaeta thermophila DSM 6192]
          Length = 299

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/201 (26%), Positives = 87/201 (43%), Gaps = 24/201 (11%)

Query: 3   KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           K+R       WY D  + + EE+ GW R      +P V   I PHAG+ +SG  A     
Sbjct: 26  KVRERMLPPGWYPDEKEEVLEEIAGWKRLERRLSTPAV-ACIVPHAGWYFSGELAFMGIR 84

Query: 63  NIDPTSISRVFLLG----PSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
           ++DP +     + G     S   Y P+  LS      TP+GDLP++  ++E L+     E
Sbjct: 85  SLDPEAEVVAVVGGHLSERSPLLYMPEDGLS------TPLGDLPVERPLVETLREHVPTE 138

Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLI---KIVPIL-VGAVNAENEAMYGRLFAKYVD 174
             D+  D   +++E+ LP +   F    +   +  P+L V    A     YG+   K   
Sbjct: 139 -PDVYPD---NTVEIQLPLVKYCFPHASVVGLRCPPLLEVAERTARVLYDYGKETGK--- 191

Query: 175 DPSNFFSVSSDFCHWGSRFNY 195
                   S+D  H+G ++ +
Sbjct: 192 --KPVVIGSTDLTHYGPQYGF 210


>gi|94270265|ref|ZP_01291693.1| Protein of unknown function DUF52 [delta proteobacterium MLMS-1]
 gi|93450875|gb|EAT01892.1| Protein of unknown function DUF52 [delta proteobacterium MLMS-1]
          Length = 267

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 58/246 (23%), Positives = 99/246 (40%), Gaps = 27/246 (10%)

Query: 43  VIAPHAGYSYSG--RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
           V+ PHAGY +SG    A  A   I P     V +LGP+HH      A+     ++ P G 
Sbjct: 39  VVMPHAGYIFSGPVAGATVAAAQIPP----EVIVLGPNHHGLGATAAVMDQGAWEMPWGT 94

Query: 101 LPLDLEVIEE-LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA 159
           +P++  +  + L+    F+  D      EHS+E+ +P+L   +    ++IVPI +   + 
Sbjct: 95  VPINASLAAKVLEHCPDFQ-ADELAHRREHSLEVLVPFLH--YRQPELQIVPICLSRSDY 151

Query: 160 E--NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
           +    A  G   A          + S+D  H+ SR              ++    D + +
Sbjct: 152 QFCQRAGAGLAAAIKAWPEPVLLAASTDMSHFESR--------------EATTTKDNLAI 197

Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
             I   DP    + + E   ++CG  P  + L        + + + +RY  S +      
Sbjct: 198 GHILELDPAGLYRTVTEQRISMCGVIPTVISLIAALELGAR-RAELIRYADSGEASGDTA 256

Query: 278 SSVSYA 283
             V YA
Sbjct: 257 QVVGYA 262


>gi|239908973|ref|YP_002955715.1| hypothetical protein DMR_43380 [Desulfovibrio magneticus RS-1]
 gi|239798840|dbj|BAH77829.1| hypothetical protein [Desulfovibrio magneticus RS-1]
          Length = 408

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 81/209 (38%), Gaps = 19/209 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPH----AGYSYSGRAAA 58
           R    AG  Y D    LA  LD  L EA         ++ VIAPH    AG +    A A
Sbjct: 113 RETVFAGQAYPDAGDELAAYLDAVLAEAPEACTAPGPIKAVIAPHIDPEAGRAAYAAAYA 172

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
                I   +  RV +LG  H        L T   + TP+GD+  D E +  L+A G   
Sbjct: 173 ALGAAIKAAAPKRVIVLGVGHQIIDGLFCL-TDKAFATPLGDVGADAEAVARLRAAGGRS 231

Query: 119 L-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF-------- 169
           +       +AEHS+E    +L  +       +VP+L G+     +A   + F        
Sbjct: 232 VDASTLPHKAEHSVEFQAVFLRHLLGDAPFAMVPVLCGSPAGVMDAPTRQAFRDCAGPFL 291

Query: 170 ---AKYVDDPSNFFSVSSDFCHWGSRFNY 195
              A+    P        DFCH G +F +
Sbjct: 292 DALAELASQPDTLIVAGVDFCHIGGKFGH 320


>gi|94265991|ref|ZP_01289714.1| Protein of unknown function DUF52 [delta proteobacterium MLMS-1]
 gi|93453457|gb|EAT03873.1| Protein of unknown function DUF52 [delta proteobacterium MLMS-1]
          Length = 267

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/246 (23%), Positives = 99/246 (40%), Gaps = 27/246 (10%)

Query: 43  VIAPHAGYSYSG--RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
           V+ PHAGY +SG    A  A   I P     V +LGP+HH      A+     ++ P G 
Sbjct: 39  VVMPHAGYIFSGPVAGATVAAAQIPP----EVIVLGPNHHGLGATAAVMDQGAWEMPWGT 94

Query: 101 LPLDLEVIEE-LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA 159
           +P++  +  + L+    F+  D      EHS+E+ +P+L   +    ++IVPI +   + 
Sbjct: 95  VPINASLAAKVLEHCPDFQ-ADELAHRREHSLEVLVPFLH--YRQPELQIVPICLSRSDY 151

Query: 160 E--NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
           +    A  G   A          + S+D  H+ SR              ++    D + +
Sbjct: 152 QFCQRAGAGLAAAIKAWPEPVLLAASTDMSHFESR--------------EATTTKDNLAI 197

Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
             I   DP    + + E   ++CG  P  + L        + + + +RY  S +      
Sbjct: 198 GHILELDPAGLYRTVTEQRISMCGVIPTVISLIAALELGAR-RAELIRYADSGEASGDTA 256

Query: 278 SSVSYA 283
             V YA
Sbjct: 257 QVVGYA 262


>gi|328950699|ref|YP_004368034.1| hypothetical protein Marky_1184 [Marinithermus hydrothermalis DSM
           14884]
 gi|328451023|gb|AEB11924.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Marinithermus hydrothermalis DSM 14884]
          Length = 381

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 59/215 (27%), Positives = 98/215 (45%), Gaps = 32/215 (14%)

Query: 40  VRGVIAPHAGYSYSGRAAAYAFGNI-----DPTSISRVFLLGPSHHYYTPKCALSTATVY 94
           VRGV+ PH       R     +G +           RV ++G +H     + A +    +
Sbjct: 139 VRGVVMPH----LEPRRVPEVYGAVLEALAQTPPPERVLVVGVAHRPIQ-EIAAALPLAF 193

Query: 95  KTPIGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPIL 153
           +TP+G + +DLE +  L A   FEL +  +    EHS+E    +L  V+     KI+P++
Sbjct: 194 ETPLGRVEVDLEALGALDALVPFELFNTPLAFREEHSVEFPAVFLKGVWPEASFKILPLI 253

Query: 154 VG----AVNAENEAMYGRLFAKYVDDPSNFFSVSS-DFCHWGSRFNYMHYDKKHGVIHKS 208
           V     A+  E  A    + AK  D P  F  V+S D  H G+RF        HG + ++
Sbjct: 254 VSGSEEAIPLEELARAVEVLAK--DYP--FLPVASVDLSHVGARFG-------HGPLTRA 302

Query: 209 I----EALDKMGMDIIETGDPD-AFKKYLLEYDNT 238
           +    +A+D+  ++++  G+ D AF    L  + T
Sbjct: 303 LATRAQAVDRRYLELVAGGEFDRAFLDVTLPKNPT 337


>gi|86159727|ref|YP_466512.1| hypothetical protein Adeh_3308 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85776238|gb|ABC83075.1| protein of unknown function DUF52 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 301

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 75/289 (25%), Positives = 119/289 (41%), Gaps = 25/289 (8%)

Query: 10  AGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVI-APHAGYSYSGRAAAYAFGNIDP 66
           AG+ Y D    L   LD WL       +     RGV+ APH  Y+      A+A+  ++ 
Sbjct: 25  AGAVYPDAPGALRRALDRWLALPAGAPAAPPAPRGVVVAPHIDYARGAAGYAHAYRALE- 83

Query: 67  TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFELM-DICV 124
            S + +F++  + H   P+    T   Y TP+G +  D  +++ L  A G+  L+ D   
Sbjct: 84  ASRADLFVIFGTAHATPPRPFTLTRLDYGTPLGPVRTDRALVDALCGALGEDALLGDELC 143

Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN-----AENEAMYGRLFAKYVDDPSNF 179
              EHS+E+    LA         ++P+L  A+      A   A +    A+ V   S  
Sbjct: 144 HRDEHSIELQAVVLAHRLR-RPFTVLPVLCSAIGHLADPAAATAPFLDALARAVAGRSVC 202

Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE--YDN 237
           +   +D  H G R  Y            ++ A D+  +  +E GDP  F +  +      
Sbjct: 203 WVAGADLAHVGPR--YGDARPPAPAELAALAAADRRTLRYVEAGDPAGFHRDAVRDGARR 260

Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
            +CG  PI   L   G  +     + L Y+Q     T    SVS+A+AA
Sbjct: 261 RLCGIAPIYAALRSAGAGA-----RLLHYQQ----WTDGVDSVSFAAAA 300


>gi|73748995|ref|YP_308234.1| hypothetical protein cbdb_A1253 [Dehalococcoides sp. CBDB1]
 gi|73660711|emb|CAI83318.1| conserved hypothetical protein [Dehalococcoides sp. CBDB1]
          Length = 385

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/213 (23%), Positives = 90/213 (42%), Gaps = 19/213 (8%)

Query: 73  FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSME 132
            +LGPSH     + A+  + +++TP+G++ +D  +   +    +    D    + EHS+E
Sbjct: 14  IILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHIKADPSAHQYEHSIE 73

Query: 133 MHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFSVSSDFCHWG 190
           + +P L + F+   IKIVPI V    +E  A  G   A  + +         SSD  H+ 
Sbjct: 74  VQIPML-QYFKPD-IKIVPITVSFGKSETLADIGYGIASALRETGREAIIIASSDMTHYE 131

Query: 191 SRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH 250
           S+ +  H               D + +D I   D     + + +   T+CG  P++  L 
Sbjct: 132 SQSD-AHLK-------------DSLALDAIIKLDAAEMLERIQQNHITMCGYAPVAAMLT 177

Query: 251 MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
            +     K + + + Y+ S       D  V YA
Sbjct: 178 AVKEMGAK-RARVVAYQTSGDVTHHLDQVVGYA 209


>gi|162452116|ref|YP_001614483.1| hypothetical protein sce3843 [Sorangium cellulosum So ce56]
 gi|50403845|gb|AAT76677.1| Dor2 [Sorangium cellulosum So ce56]
 gi|161162698|emb|CAN94003.1| hypothetical protein sce3843 [Sorangium cellulosum So ce56]
          Length = 422

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/300 (24%), Positives = 117/300 (39%), Gaps = 42/300 (14%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR------------GVIAPH---- 47
           IR A+HAG  Y  + ++LA+ ++    +   P+    R            G+ APH    
Sbjct: 111 IRMAAHAGGAYHGDRRKLADFIERKCLQVARPQGGATRAAASQGAPARMVGLCAPHMDLW 170

Query: 48  ---AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD 104
              AGY ++  A   A        +    LLG SH       A+   T + TP+G L  D
Sbjct: 171 RAAAGYGHAYAALEQALAGPGLEQVDTFVLLGTSHAAMRRPYAVCEKT-FATPLGPLEPD 229

Query: 105 LEVIEELKATGKFELM-DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-------- 155
            E+I EL A  +F++  D  + + EHS+E    ++  +  G    IVPIL G        
Sbjct: 230 REMIAELAAASRFDVREDQYLHKNEHSIEFQAVFVRHLLGGRAASIVPILCGLSECQARR 289

Query: 156 ---AVNAENEAMYGRLFAKYVDDPSNFFSVS-SDFCHWGSRF-NYMHYDKKHGVIHKSIE 210
              A +   E+    L       P     ++ +D  H G RF +    D++      ++ 
Sbjct: 290 RDPAQDDGAESFLRALRDALAKRPGRVLVIAGADLAHVGPRFGDPAPLDERQ---RTALR 346

Query: 211 ALDKMGMDIIETGDPDAFKKYLLE--YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQ 268
             D   ++   + D   F   +        +CG  PI   L  L   S   + + L YEQ
Sbjct: 347 DRDLASIERATSIDAPGFFVDVARDLASRRVCGLGPIYTLLRALPPSS---RGEMLHYEQ 403


>gi|242079151|ref|XP_002444344.1| hypothetical protein SORBIDRAFT_07g020505 [Sorghum bicolor]
 gi|241940694|gb|EES13839.1| hypothetical protein SORBIDRAFT_07g020505 [Sorghum bicolor]
          Length = 152

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/51 (68%), Positives = 41/51 (80%)

Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
           G + ++    ML +CSTKIKI F+RYEQSSQCK  RDSSVSYASAAAKVD+
Sbjct: 94  GGNGVAATNKMLKHCSTKIKIGFVRYEQSSQCKNFRDSSVSYASAAAKVDS 144


>gi|302037806|ref|YP_003798128.1| hypothetical protein NIDE2493 [Candidatus Nitrospira defluvii]
 gi|300605870|emb|CBK42203.1| conserved protein of unknown function DUF52 [Candidatus Nitrospira
           defluvii]
          Length = 420

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 63/300 (21%), Positives = 126/300 (42%), Gaps = 28/300 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWL--REAGLPKSPE-----VRGVIAPHAGYSYSGRAA 57
           R    AG  Y  ++ +L  +++G+   +E    K  E     ++G++AP     ++G   
Sbjct: 125 RLPQFAGRSYEADAAKLRAQINGFFISKEGPEVKKSEHAGKLIKGLVAPTYELKHAGPIY 184

Query: 58  AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A+A+  +       +++L  + +         T   ++TP+G + +D  V + L+     
Sbjct: 185 AWAYKELQEAQQPDLYVLIGTAYSGLEHPVAMTDKDFETPLGLVNVDRTVTDRLREHIPS 244

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVG---------AVNAENEAMYGR 167
              D      EH++E  LP+L +   +     IVPIL            +  + E     
Sbjct: 245 AFTDELAHHNEHALEFQLPFLQESLGKDRPFTIVPILTSFSADSLRDPQIREQVETFLQA 304

Query: 168 LFAKYVDDPSNF-FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
           L    V    N+   V ++  H G R+            H+ ++ +D   +  +E  DP+
Sbjct: 305 LKDALVATGRNYCVVVGAELAHIGMRYGDSAPPTDFS-FHRCMQ-IDLEMLKHVENRDPE 362

Query: 227 AFKKYLLEYDNT--ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
            F K++ + ++   I G  PI   L ++   + ++    LRY++      + +S+V+YAS
Sbjct: 363 EFAKFIQKENDQRRISGFSPIYSLLRLIQAETGQV----LRYDRG--ITDQYNSTVTYAS 416


>gi|406886416|gb|EKD33450.1| hypothetical protein ACD_76C00027G0005 [uncultured bacterium]
          Length = 592

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 50/173 (28%), Positives = 80/173 (46%), Gaps = 12/173 (6%)

Query: 72  VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSM 131
           + ++GP+H     +      + ++TP G +  D  V++ L A+G        V   EHS+
Sbjct: 133 IVIVGPNHESIGAEIQTMRGS-FETPFGRVSTDNSVVDALVASGAAS-ASPDVFRTEHSI 190

Query: 132 EMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD-PSNFFSVSS-DFCHW 189
             H+PY+AK+F     KIVP+L  +     E    R+ +      PSN F VSS DF H 
Sbjct: 191 AFHVPYIAKLFPS--AKIVPVLYHSSVPSGEV--ARVLSNMRSALPSNSFFVSSIDFSHG 246

Query: 190 GSRFNYMHYDKKHGVI--HKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
               +    D+K   I  ++S + LD +  + I++  P +   YL   D   C
Sbjct: 247 LDSAHSNFNDRKTWNIMQNRSFDELDALQPEFIDS--PPSASAYLQAIDAQKC 297


>gi|392413871|ref|YP_006450478.1| putative dioxygenase [Desulfomonile tiedjei DSM 6799]
 gi|390627007|gb|AFM28214.1| putative dioxygenase [Desulfomonile tiedjei DSM 6799]
          Length = 277

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 62/261 (23%), Positives = 114/261 (43%), Gaps = 24/261 (9%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
           M   +RA  +G WY +++++   E++ +L     P     +G IAPHAG+ +SG+AAA  
Sbjct: 1   MSTRKRALPSG-WYPESAEQCRREIEEFLTGFKAPAG-SWKGGIAPHAGWYFSGKAAAKV 58

Query: 61  FGNIDPTS-ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
              +  ++   R+ L G   H  +    +     ++TP G  PLD  + +   ++G    
Sbjct: 59  MKTLAESAHPDRIVLFG--GHLPSGSPIIYADDEWETPFGMQPLDSHLAKTAVSSGNAVF 116

Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP--S 177
                + A++++E+ LP +   F      I  I V +   +  A+        +++   S
Sbjct: 117 AG--RNFADNTVEIQLPMIRMFFP----HIPVIAVHSPATKQAAILAESIHNLLNEKGLS 170

Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD-----PDAFKKYL 232
             +  S+D  H+G  + +         +    E  D+  +D   TGD      DAF+K+ 
Sbjct: 171 AIYIGSADLTHYGPNYGFSPKGTGASAVTWVREENDRSLIDKALTGDVQGVIDDAFQKH- 229

Query: 233 LEYDNTICGRHPISVFLHMLG 253
               NT C   PI+  +  +G
Sbjct: 230 ----NT-CSAGPIASVMASVG 245


>gi|384431098|ref|YP_005640458.1| hypothetical protein [Thermus thermophilus SG0.5JP17-16]
 gi|333966566|gb|AEG33331.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Thermus thermophilus SG0.5JP17-16]
          Length = 370

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 91/223 (40%), Gaps = 12/223 (5%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
           E R ++ PH   S        A   ++ T    R++L+G +H     + A +    ++TP
Sbjct: 136 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKERAA-ALPVPFQTP 194

Query: 98  IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
            G    DL  ++ L A   FEL +  +    EHS+E+ L +L   F     +++P+LV  
Sbjct: 195 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPKA--RVLPLLVAR 252

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
            +AE     G      + D      ++ D  H G RF      +    + +     D   
Sbjct: 253 RSAE----LGEALKVVLRDHPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 305

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
           ++ +  G+P+A   +L      I G   ++  L +L     K+
Sbjct: 306 LERLAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRERKGKV 348


>gi|313679973|ref|YP_004057712.1| hypothetical protein [Oceanithermus profundus DSM 14977]
 gi|313152688|gb|ADR36539.1| protein of unknown function DUF52 [Oceanithermus profundus DSM
           14977]
          Length = 384

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 80/173 (46%), Gaps = 9/173 (5%)

Query: 70  SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVD-EAE 128
           +R  +LG +H   + + A +  T  +TP+G LP+DLE ++ L A   FEL +  +    E
Sbjct: 169 ARAVVLGVAHRGLS-EVAAAWGTPLETPLGPLPVDLEALQALDALLPFELFNTPLAFREE 227

Query: 129 HSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF---FSVSSD 185
           HS+E    +L   +     +I+P++V   + E +A    L          +      S D
Sbjct: 228 HSIEFPAVFLKAAWSDDAPRILPLIVSG-DPERKAELDELAQALALLAERYPLWPLASVD 286

Query: 186 FCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
             H G+RF +   D++     +++   D+  ++++  G  +A    L+   N+
Sbjct: 287 LSHVGARFGHPPLDRERVTQARTV---DRRYLELVAAGSFEAAWASLIPAGNS 336


>gi|320104323|ref|YP_004179914.1| hypothetical protein Isop_2799 [Isosphaera pallida ATCC 43644]
 gi|319751605|gb|ADV63365.1| protein of unknown function DUF52 [Isosphaera pallida ATCC 43644]
          Length = 318

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 58/230 (25%), Positives = 95/230 (41%), Gaps = 24/230 (10%)

Query: 10  AGSWYTDNSKRLAEELDGWLREA--GLPK-SPEVRGVIAPHAGYSYSGRAAA---YAFGN 63
           AG  Y ++ ++LA +LD W  +A   LP  S   R  + PH  ++  G   A    A  +
Sbjct: 15  AGLSYPNDPRQLAAQLDTWFNQALDNLPALSTTWRAALCPHIDFARGGAVYATVHRALNH 74

Query: 64  IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL-MDI 122
           ++  S S   + G SH  +      +T   + TP+G +  D   ++ L+     EL +D 
Sbjct: 75  LNQPS-STYLIYGVSHRVWCRHRFAATRRDFATPLGLVRTDQRFLDHLERVFGPELFVDA 133

Query: 123 CVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGA--------VNAENEAMYGRLFAKYV 173
              + E S+E    +L  +  G     IVPILVG+        V+   + +  R+    V
Sbjct: 134 DAHDPEWSIEFQAVWLQHLLGGRRDFTIVPILVGSFHDLMVRGVDPLEDPLVQRMVEAIV 193

Query: 174 D-----DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
                 D   F   S DF H G  F      + H +  + +   D+  +D
Sbjct: 194 RAERDHDRPVFHLASVDFSHIGPEFGDPEPLRDHDL--EQLRQFDRAMLD 241


>gi|410463514|ref|ZP_11317025.1| putative dioxygenase [Desulfovibrio magneticus str. Maddingley
           MBC34]
 gi|409983368|gb|EKO39746.1| putative dioxygenase [Desulfovibrio magneticus str. Maddingley
           MBC34]
          Length = 408

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 84/209 (40%), Gaps = 19/209 (9%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPH----AGYSYSGRAAA 58
           R    AG  Y D+   LA  LD  L EA    +    V+ VIAPH    AG +    A A
Sbjct: 113 RETVFAGQAYPDDGAELAAYLDAVLAEAPEACAAPGPVKAVIAPHIDPEAGRAAYAAAYA 172

Query: 59  YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
                I   +  RV +LG  H        L T   + TP+GD+  D E    L+  G   
Sbjct: 173 ALGAAIKAAAPKRVIVLGVGHQIIDGLYCL-TDKAFATPLGDVAADAEASARLRTAGGQA 231

Query: 119 LMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPIL----VGAVNAENEAMYGRLFAKYV 173
           +    +  +AEHS+E    +L ++       +VPIL    VG V+A     +      ++
Sbjct: 232 VDASSLPHKAEHSVEFQAVFLRRLLGDAPFAMVPILCGSPVGVVDAPTRRAFRAYAGPFL 291

Query: 174 D-------DPSNFFSVSSDFCHWGSRFNY 195
           D        P        DFCH G +F +
Sbjct: 292 DALAEIAAAPDTLIVAGVDFCHIGGKFGH 320


>gi|410696913|gb|AFV75981.1| putative dioxygenase [Thermus oshimai JL-2]
          Length = 367

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 70/159 (44%), Gaps = 9/159 (5%)

Query: 43  VIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
           ++ PH   S    A   A  +++ T    R++LLG +H     K A +    + TP G  
Sbjct: 137 LLMPHLEPSRVPEAYGAALASLEKTPPPERIYLLGVAHRPLKEKAA-ALPVPFATPFGPA 195

Query: 102 PLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
             DL+ ++ L A   FEL +  +    EHS+E+ L +L  V+     K++P+L+G  + E
Sbjct: 196 EPDLKALQALDALLPFELFNTPLAFREEHSLELPLFFLKGVYPEA--KVLPLLIGERSPE 253

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD 199
                G      + +      ++ D  H G RF     D
Sbjct: 254 ----LGEALRVLMGEFPGLLVLAVDLSHVGPRFGDRPLD 288


>gi|386360570|ref|YP_006058815.1| dioxygenase [Thermus thermophilus JL-18]
 gi|383509597|gb|AFH39029.1| putative dioxygenase [Thermus thermophilus JL-18]
          Length = 370

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 90/223 (40%), Gaps = 12/223 (5%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
           E R ++ PH   S        A   ++ T    R++L+G +H     + A +    ++TP
Sbjct: 136 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKERAA-ALPVPFQTP 194

Query: 98  IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
            G    DL  ++ L A   FEL +  +    EHS+E+ L +L   F     +++P+LV  
Sbjct: 195 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPEA--RVLPLLVAR 252

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
            + E     G      + D      ++ D  H G RF      +    + +     D   
Sbjct: 253 RSPE----LGEALKVVLRDHPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 305

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
           ++ I  G+P+A   +L      I G   ++  L +L     K+
Sbjct: 306 LERIAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRGRKGKV 348


>gi|55980893|ref|YP_144190.1| hypothetical protein TTHA0924 [Thermus thermophilus HB8]
 gi|7388540|sp|Q56419.2|Y924_THET8 RecName: Full=MEMO1 family protein TTHA0924
 gi|295400|gb|AAA27496.1| The first start codon in ORFD is located at position 2046 [Thermus
           thermophilus HB8]
 gi|55772306|dbj|BAD70747.1| conserved hypothetical protein [Thermus thermophilus HB8]
          Length = 326

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 90/223 (40%), Gaps = 12/223 (5%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
           E R ++ PH   S        A   ++ T    R++L+G +H     K A +    ++TP
Sbjct: 92  EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKEKAA-ALPVPFQTP 150

Query: 98  IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
            G    DL  ++ L A   FEL +  +    EHS+E+ L +L   F     +++P+LV  
Sbjct: 151 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPEA--RVLPLLVAR 208

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
            + E     G      + D      ++ D  H G RF      +    + +     D   
Sbjct: 209 RSPE----LGEALKVVLRDFPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 261

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
           ++ +  G+P+A   +L      I G   ++  L +L     K+
Sbjct: 262 LERLAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRERKGKV 304


>gi|281495|pir||S27738 hypothetical protein D - Thermus aquaticus
          Length = 339

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 90/223 (40%), Gaps = 12/223 (5%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
           E R ++ PH   S        A   ++ T    R++L+G +H     K A +    ++TP
Sbjct: 105 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKEKAA-ALPVPFQTP 163

Query: 98  IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
            G    DL  ++ L A   FEL +  +    EHS+E+ L +L   F     +++P+LV  
Sbjct: 164 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPEA--RVLPLLVAR 221

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
            + E     G      + D      ++ D  H G RF      +    + +     D   
Sbjct: 222 RSPE----LGEALKVVLRDFPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 274

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
           ++ +  G+P+A   +L      I G   ++  L +L     K+
Sbjct: 275 LERLAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRERKGKV 317


>gi|46198863|ref|YP_004530.1| hypothetical protein TTC0555 [Thermus thermophilus HB27]
 gi|46196486|gb|AAS80903.1| hypothetical conserved protein [Thermus thermophilus HB27]
          Length = 370

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 90/223 (40%), Gaps = 12/223 (5%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
           E R ++ PH   S        A   ++ T    R++L+G +H     K A +    ++TP
Sbjct: 136 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKEKAA-ALPVPFQTP 194

Query: 98  IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
            G    DL  ++ L A   FEL +  +    EHS+E+ L +L   F     +++P+LV  
Sbjct: 195 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPEA--RVLPLLVAR 252

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
            + E     G      + D      ++ D  H G RF      +    + +     D   
Sbjct: 253 RSPE----LGEALKVVLRDFPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 305

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
           ++ +  G+P+A   +L      I G   ++  L +L     K+
Sbjct: 306 LERLAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRERKGKV 348


>gi|381189704|ref|ZP_09897229.1| hypothetical protein RLTM_01015 [Thermus sp. RL]
 gi|380452281|gb|EIA39880.1| hypothetical protein RLTM_01015 [Thermus sp. RL]
          Length = 370

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/223 (22%), Positives = 89/223 (39%), Gaps = 12/223 (5%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSIS-RVFLLGPSHHYYTPKCALSTATVYKTP 97
           E R ++ PH   S        A   ++ T    R++L+G +H       A +    ++TP
Sbjct: 136 EARXLLMPHLEPSRVPEVYGAALAALEKTPPPDRIYLVGVAHRPLKDXAA-ALPVPFQTP 194

Query: 98  IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
            G    DL  ++ L A   FEL +  +    EHS+E+ L +L   F     +++P+LV  
Sbjct: 195 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRF--PXARVLPLLVAR 252

Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
            +AE     G      + D      ++ D  H G RF      +    + +     D   
Sbjct: 253 RSAE----LGEALKVVLRDXPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 305

Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
           ++ +  G+P+A   +L        G   ++  L +L     K+
Sbjct: 306 LERLAEGEPEAALAFLGANPTRXDGVEVVASLLPLLRERKGKV 348


>gi|333999164|ref|YP_004531776.1| hypothetical protein TREPR_1172 [Treponema primitia ZAS-2]
 gi|333739456|gb|AEF84946.1| conserved hypothetical protein [Treponema primitia ZAS-2]
          Length = 285

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 75/166 (45%), Gaps = 14/166 (8%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA- 60
           EK R    AG +Y +    L  +L     E+G+    +   +IAPH  ++ SG  AA A 
Sbjct: 10  EKGRHPVVAGLFYPEEKIDLESQLCSLGLESGV--GGKAAAIIAPHGAWNLSGAPAAAAF 67

Query: 61  -------FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
                  +       ISRV +LG  H+        S +  ++TP+G + +D EV E L +
Sbjct: 68  SAAAGRAWQRNGGEGISRVVILGTIHNKEDQGIFFSDSQFFETPLGRMWIDSEVSETLAS 127

Query: 114 TGK-FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN 158
               FE  DI     E S+E+ LP +   F G    IVP+L+G   
Sbjct: 128 CSTLFETNDI-PHLKETSVEVLLPLVQFCFPGA--AIVPVLMGGAQ 170


>gi|302338446|ref|YP_003803652.1| hypothetical protein Spirs_1932 [Spirochaeta smaragdinae DSM 11293]
 gi|301635631|gb|ADK81058.1| protein of unknown function DUF52 [Spirochaeta smaragdinae DSM
           11293]
          Length = 283

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 52/227 (22%), Positives = 103/227 (45%), Gaps = 20/227 (8%)

Query: 2   EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
           +K+R+      WY D  + + + +  W          +    I PHAG+++SG  A    
Sbjct: 4   KKVRKRGLPVGWYPDVEREVLKRITDWEATDTESHLDKAFASIIPHAGWTFSGALAWKGI 63

Query: 62  GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE----VIEELKATGKF 117
             ++  + S + + G  H +      ++    + TP+G +P D+     ++EEL  +GK 
Sbjct: 64  RLLNEDAESVIVVGG--HLFAGSGILMALEDAFDTPLGLIPADIGFRDLLLEEL--SGKV 119

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP- 176
            + +    E ++S+E+HLP +   F     K++ + VG  + +   + G   A+   D  
Sbjct: 120 SVREDT--ETDNSVEIHLPLIKYHFPK--AKLIWLRVG--SGDESLVLGECAARLAMDEG 173

Query: 177 -SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIET 222
            +     S+D  H+G  +++M      G  ++ +EA D    ++IET
Sbjct: 174 GATVMIGSTDLTHYGPDYHFMPAGTG-GEAYRWVEANDA---EMIET 216


>gi|389580823|ref|ZP_10170850.1| putative dioxygenase [Desulfobacter postgatei 2ac9]
 gi|389402458|gb|EIM64680.1| putative dioxygenase [Desulfobacter postgatei 2ac9]
          Length = 294

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/207 (24%), Positives = 87/207 (42%), Gaps = 22/207 (10%)

Query: 4   IRRASHAGSWY---TDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAA 58
           +++ + AGSWY    D  +   E  +  LR     K+    V G++ PHAG+ YSG+ A 
Sbjct: 3   LKKMAFAGSWYPGSADQCRSALERFNDDLRAEADTKTLNNPVAGIV-PHAGWIYSGKLAC 61

Query: 59  YAFGNI--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL---KA 113
             F  +     ++  + L G   H  +P   L   T   TP+G + +D      L    A
Sbjct: 62  RVFAALAHGRRAVETIVLFGVHMHADSPSFVLDC-TAVDTPLGAIEIDKAFTGRLVKQAA 120

Query: 114 TGKFELMDICVDE--AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM-YGRLFA 170
               +L  +       E+++E+  P++   F         I+V AV   + A   G +  
Sbjct: 121 AANVDLKQLTPARFPEENTLELQYPFIKYFFPK-----ARIVVCAVPPSDAAQALGIIVV 175

Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNY 195
           +   +     +V  S+D  H+G RF +
Sbjct: 176 EVAQELDRSVAVVGSTDMTHYGPRFGF 202


>gi|395731960|ref|XP_003775991.1| PREDICTED: protein MEMO1-like, partial [Pongo abelii]
          Length = 100

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 32/48 (66%), Gaps = 1/48 (2%)

Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPT 67
          +L  +L+GWL +    K P  R +IAPHAGY+Y G  AA+A+  +DP+
Sbjct: 52 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPS 98


>gi|384439816|ref|YP_005654540.1| hypothetical protein [Thermus sp. CCB_US3_UF1]
 gi|359290949|gb|AEV16466.1| hypothetical protein TCCBUS3UF1_14250 [Thermus sp. CCB_US3_UF1]
          Length = 370

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 9/154 (5%)

Query: 43  VIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
           ++ PH   S    A   A   ++ T    RV+L+G +H     + A +    ++TP G  
Sbjct: 140 LLLPHLEPSRVPEAYGAALAALEKTPPPERVYLVGVAHRPLKERAA-ALPVPFQTPFGPA 198

Query: 102 PLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
             DLE ++ L A   FEL +  +    EHS+E+ L +L   +     + +P+LVG  + E
Sbjct: 199 LPDLEALQALDALLPFELFNTPLAFREEHSLELPLLFLKGAYPEA--RFLPLLVGRRSPE 256

Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFN 194
                G      + D      ++ D  H G RF 
Sbjct: 257 ----LGEALKVVLRDYPGLLVLAVDLSHVGPRFG 286


>gi|114319687|ref|YP_741370.1| dioxygenase-like protein [Alkalilimnicola ehrlichii MLHE-1]
 gi|114226081|gb|ABI55880.1| dioxygenase-like protein [Alkalilimnicola ehrlichii MLHE-1]
          Length = 225

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 83/185 (44%), Gaps = 9/185 (4%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
           M ++RR   AG +Y  + + L + LD  LR    L   P  R ++ P  G + +G A A 
Sbjct: 1   MSEVRRPVAAGLFYPGHPELLRQSLDTLLRAHPTLGTRP--RALVLPPGGQARAGAAIAA 58

Query: 60  AFGNIDPTSI--SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
           A   +   +   +RV+LL  + H      A S    + TP+G L LD   IE L+     
Sbjct: 59  ACSLLQAAAAPPNRVYLLATTPHRTAEGPAFSGKRQFATPLGRLTLDAAGIERLQDDAGG 118

Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
            L D      EH +E  LPYL +V      ++VP+L+      + A  GR+    ++D +
Sbjct: 119 ALDDRA-HALEHRLEAPLPYLQRVLPP--FQLVPVLLPEAGTTSAAC-GRILQLALEDRA 174

Query: 178 NFFSV 182
               V
Sbjct: 175 GLLVV 179


>gi|224370218|ref|YP_002604382.1| putative dioxygenase [Desulfobacterium autotrophicum HRM2]
 gi|223692935|gb|ACN16218.1| putative dioxygenase [Desulfobacterium autotrophicum HRM2]
          Length = 324

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 58/247 (23%), Positives = 95/247 (38%), Gaps = 18/247 (7%)

Query: 5   RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           + A+  G+WY  + +    E+  +L E     + +  G I PHAG+ +SG  A     ++
Sbjct: 34  KDAAFKGAWYPADGQACEREIVRFLNERTGTLTGDYVGGIVPHAGWVFSGSIACRVIASL 93

Query: 65  DPTS-------ISRVFLLGPSHHYYTPKCALSTA-TVYKTPIGDLPLDLEVIEEL-KATG 115
             T        +  V L G  HH +     L      ++TP GDL +   V   L KA G
Sbjct: 94  AGTGSGGAHGKVDLVLLFG--HHMHPLDAPLVMGRGAWETPFGDLVVHQGVAGALAKAVG 151

Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
              L      + E+++E+ LP++   F      IVP+  G   + +    G L  +    
Sbjct: 152 ATTLGPDAFPD-ENTIELQLPFVKYFFPD--AAIVPM--GVPPSAHAERIGILAVEAALK 206

Query: 176 PSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
                 V  S+D  H+G  + +    K    +       D   +  +   D D      L
Sbjct: 207 SGLSLRVIGSTDMTHYGPNYGFSPAGKGQAALDWVKNTNDSRAIKAMVEMDVDTILAQGL 266

Query: 234 EYDNTIC 240
           E+ N  C
Sbjct: 267 EHHNLCC 273


>gi|220918543|ref|YP_002493847.1| hypothetical protein A2cp1_3451 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219956397|gb|ACL66781.1| protein of unknown function DUF52 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 299

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 44/160 (27%), Positives = 72/160 (45%), Gaps = 7/160 (4%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGWLREAGLP---KSPEVRGVIAPHAGYSYSGRAAAYA 60
            R  + AG+ Y D    L   LDGWL          +P    V+APH  Y+      A+A
Sbjct: 17  FRPPACAGAVYPDAPGALRRALDGWLALPAGAPAASTPPPGVVVAPHIDYARGAAGYAHA 76

Query: 61  FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT-GKFEL 119
           +  ++  S + +F++  + H   P+    T   Y TP+G +  D  +++ L  T G+  L
Sbjct: 77  YRALE-ASRADLFVIFGTAHATPPRPFTLTRLDYGTPLGPVRTDRALVDALCTTLGEDAL 135

Query: 120 M-DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN 158
           + D  V   EHS+E+    LA         ++P+L  A+ 
Sbjct: 136 LGDELVHRDEHSVELQAVILAHRLR-RPFTVLPVLCSAIG 174


>gi|189218196|ref|YP_001938838.1| dioxygenase [Methylacidiphilum infernorum V4]
 gi|189185054|gb|ACD82239.1| Predicted dioxygenase [Methylacidiphilum infernorum V4]
          Length = 320

 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 60/265 (22%), Positives = 109/265 (41%), Gaps = 23/265 (8%)

Query: 39  EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI 98
           E   ++APH  +  S +A  YAF      + +  F+L    H+   + ++     Y TP+
Sbjct: 58  ETLAILAPHIDFQVSPKAYTYAFSPWFSMAEADFFILLGVGHHSRIEWSIDPRD-YITPL 116

Query: 99  GDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK----IVPILV 154
           G      +++EE+     F L D    + EHS+E  + ++  +     I+     +P+L 
Sbjct: 117 GRAYNKRDLVEEINRKVDFCLTDPLAHQKEHSIEFPIVFMQALRYWMGIQKPLCFLPVLC 176

Query: 155 GAV-------NAENEAMYGRLFAKYVDDPSNFFS------VSSDFCHWGSRFNYMHYDKK 201
           G +       N   E +  +  A  + +  +F+       +S D CH G RF   H  K 
Sbjct: 177 GGLHELIIYDNGREELLRMKKLASILREILSFYKKKVGLVLSIDGCHIGPRFG--HPFKV 234

Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
              + K+ +  +K     +E  + + F  +L +  N        ++ L M      K+ +
Sbjct: 235 TEEMLKNTDLWEKELWRHVEEQNFEGFISHLQKEKNIRFFDGVGAIALTMEIFKDQKVSL 294

Query: 262 KFLRYEQSSQCKTKRDSSVSYASAA 286
           +   YEQ  +    RDSSV   S+ 
Sbjct: 295 RRTYYEQWFE---GRDSSVVTFSSG 316


>gi|415987739|ref|ZP_11559786.1| hypothetical protein GGI1_14396, partial [Acidithiobacillus sp.
           GGI-221]
 gi|339834966|gb|EGQ62687.1| hypothetical protein GGI1_14396 [Acidithiobacillus sp. GGI-221]
          Length = 209

 Score = 45.8 bits (107), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 43/200 (21%), Positives = 79/200 (39%), Gaps = 28/200 (14%)

Query: 87  ALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL 146
           AL      +TP+G + +D   +E L    +   M       EH++E+ LP++ +V     
Sbjct: 27  ALPGVQAMQTPLGTVAVDQAGVEALAGLPEVREMP-AAHAQEHALEVQLPFIQEVLGD-- 83

Query: 147 IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH 206
           + +VP++VG    +  A   R+  K          +SSD  H      Y  Y +   + H
Sbjct: 84  VSVVPLVVGDARPDEVA---RVLEKLWGGEETVIVISSDLSH------YHPYAEARAIDH 134

Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRY 266
            ++E + +             F    ++++   CG  PI+  L +       +  + +  
Sbjct: 135 HTVEEILR-------------FDPTPIDHEQA-CGATPINGLLPVARK--HHLHPRLVGL 178

Query: 267 EQSSQCKTKRDSSVSYASAA 286
             S      RD+ V YA+ A
Sbjct: 179 CNSGDTAGSRDAVVGYAAVA 198


>gi|218296601|ref|ZP_03497319.1| protein of unknown function DUF52 [Thermus aquaticus Y51MC23]
 gi|218242914|gb|EED09447.1| protein of unknown function DUF52 [Thermus aquaticus Y51MC23]
          Length = 370

 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 41/171 (23%), Positives = 72/171 (42%), Gaps = 11/171 (6%)

Query: 70  SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVD-EAE 128
            R++L+G +H     K A +    + TP G +  DL  ++ L A   +EL +  +    E
Sbjct: 168 ERIYLVGVAHRPLEAKAA-ALPVPFHTPFGPMEPDLPALQALDALLPYELFNTPLAFREE 226

Query: 129 HSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCH 188
           HS+E+ L +L   +     ++VP+LVG    E     G      + D      ++ D  H
Sbjct: 227 HSLELPLFFLKGRYPK--ARLVPLLVGRRGPE----LGEALRVVLKDHPGLVVLAVDLSH 280

Query: 189 WGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
            G RF      +    + +     D   ++ +  G+P+A   +L E    I
Sbjct: 281 VGPRFGDGPLSRP---LAEEARRRDLGFLERLAQGEPEAALAFLGENPTRI 328


>gi|328953314|ref|YP_004370648.1| hypothetical protein Desac_1616 [Desulfobacca acetoxidans DSM
           11109]
 gi|328453638|gb|AEB09467.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
           [Desulfobacca acetoxidans DSM 11109]
          Length = 281

 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 43/201 (21%), Positives = 84/201 (41%), Gaps = 17/201 (8%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAY 59
           M K+R  +    WY ++ +     L  ++     L    ++ G + PHAG+ +SGR AA 
Sbjct: 1   MMKVRPRTLPIGWYPESGRECQNRLTAFIAGVKPLAAGAKIYGGVMPHAGWYFSGRLAAQ 60

Query: 60  AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
            F     +   +V  +   H        L     ++TP+G++ L  E+ + L+       
Sbjct: 61  VFYLAAQSGQPQVVCIFGGHLGSGSPPLLVMEDAWETPLGEITLAKEMYQPLQEK----- 115

Query: 120 MDICVDE--AEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD- 175
              C  E   ++++E+ LP +   F +  L+ +      A +++   + G+      +D 
Sbjct: 116 -ITCAREYPGDNTIEIQLPMVKHFFPQAKLLALR-----APHSQQAVLLGQAVVAVANDL 169

Query: 176 -PSNFFSVSSDFCHWGSRFNY 195
             S     SSD  H+G  + +
Sbjct: 170 NLSLLAFGSSDLTHYGPNYGW 190


>gi|333997968|ref|YP_004530580.1| hypothetical protein TREPR_2499 [Treponema primitia ZAS-2]
 gi|333739632|gb|AEF85122.1| protein of unknown function DUF52 [Treponema primitia ZAS-2]
          Length = 303

 Score = 44.3 bits (103), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 65/274 (23%), Positives = 114/274 (41%), Gaps = 20/274 (7%)

Query: 4   IRRASHAGSWYTDNSKRLAEELDGW-LREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
           +R+      WY  ++  +A+ L G  L     P +P     +APHAG+ YSG  AA A  
Sbjct: 3   LRKQRLQAGWYPFSASGVADFLGGISLDGEHYPSAP---AAVAPHAGWFYSGAIAARAVL 59

Query: 63  NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
           ++D  + +   + G       P  A   A   +TP+G+LP+D E    L        +D 
Sbjct: 60  SLDRDAETVAIIGGHLPAGMPPLIAQEDA--LETPLGELPVDREFALALG-----RALDC 112

Query: 123 CVDE-AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
             D   ++++E+ LP +   F     + + + + A  A  EA  GR  A+         +
Sbjct: 113 RGDRYTDNTVEVLLPMVHYFFP--RAQALWLRLPAEAASLEA--GREIARIAKKIGRRVA 168

Query: 182 V--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
           V  S+D  H+G  + +    K    +       D+  ++ + +GDP A  +   E + + 
Sbjct: 169 VLGSTDLTHYGENYEFSPRGKGEQALAWVRGVNDRRFIEAVLSGDPRAVLER-AEEERSA 227

Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCK 273
           C    +   +   G  S   K + L Y  S+  +
Sbjct: 228 CSAGAVLGAIGFAGE-SGAPKGELLAYGTSADAR 260


>gi|384914669|ref|ZP_10015421.1| Predicted dioxygenase [Methylacidiphilum fumariolicum SolV]
 gi|384527286|emb|CCG91289.1| Predicted dioxygenase [Methylacidiphilum fumariolicum SolV]
          Length = 314

 Score = 43.1 bits (100), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 118/281 (41%), Gaps = 31/281 (11%)

Query: 27  GWLREAGLPKSPEVR-GVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFL-LGPSHHYYTP 84
            W ++  + K  + R  V++PH  +  S +A  +AF +    + +  F+ LG  HH    
Sbjct: 39  AWPKKNTIGKEKKERLAVLSPHIDFQVSPKAYTHAFSHWFSRAEADFFIILGVGHHSRLE 98

Query: 85  KCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA--KVF 142
               S    Y TP+G     +E++E ++ +  F L D    + EHS+E  + ++   + +
Sbjct: 99  WSIDSRD--YITPLGRAYNRIEIVERIERSVNFSLADPDGHQREHSIEFPIVFMQALRYW 156

Query: 143 EG--HLIKIVPILVGAV------NAENEAM-YGRLFAKYVDDPSNFFS------VSSDFC 187
            G    ++ VPIL G +      N   EA+   R+ A  + +    +       +S D C
Sbjct: 157 MGIEKPLEFVPILCGGLHDLIIYNNGKEALSMMRMLASALREVLTAYGEKAALIISIDGC 216

Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT--ICGRHPI 245
           H G RF   H       + K  E  +K     +E  + + F  +L +  N     G   I
Sbjct: 217 HIGPRFG--HPLDLTKALLKDTELWEKELWKNVEEQNLEGFLAHLQKEKNIRFFDGVGAI 274

Query: 246 SVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS-VSYASA 285
           ++ + +      K K  F +     Q    RDSS V+++S 
Sbjct: 275 ALLMEIF-----KDKPFFFKRTYYEQWFEPRDSSAVTFSSG 310


>gi|312134017|ref|YP_004001356.1| protein [Bifidobacterium longum subsp. longum BBMN68]
 gi|311773323|gb|ADQ02811.1| Hypothetical protein BBMN68_1758 [Bifidobacterium longum subsp.
           longum BBMN68]
          Length = 462

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 43/183 (23%), Positives = 70/183 (38%), Gaps = 42/183 (22%)

Query: 87  ALSTATVYKTPIGDLPLD---------LEVIEELK--------ATGKFELMDICVDEAEH 129
           A STA  ++TP+G +P+D         L V E L+        A     +++      EH
Sbjct: 7   ACSTAAAFETPLGTVPVDIAAERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEH 66

Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
           ++E+ +P+L  V  G  + IVP+  G    +     G +       P     +SSD  H+
Sbjct: 67  AVEVQIPFLQTVL-GPDLTIVPLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHY 122

Query: 190 GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL 249
                           H+   ALD   +        D    +L  +    CG +PI+  L
Sbjct: 123 HP--------------HEVARALDDQTIA-------DIAALHLPIHPRRACGAYPINGLL 161

Query: 250 HML 252
            +L
Sbjct: 162 DVL 164


>gi|88802626|ref|ZP_01118153.1| hypothetical protein PI23P_08550 [Polaribacter irgensii 23-P]
 gi|88781484|gb|EAR12662.1| hypothetical protein PI23P_08550 [Polaribacter irgensii 23-P]
          Length = 173

 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 60/123 (48%), Gaps = 12/123 (9%)

Query: 35  PKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH----HYYTPKCALST 90
           P S   + VI+PH  Y Y+G+    +   I+  +   V L+G +H    +    K    +
Sbjct: 56  PNSKTWKTVISPHDDYKYAGKLYHTSLTGINAKT---VILIGVAHKARNYKLQDKIIFGS 112

Query: 91  ATVYKTPIGDLPL-DLEV-IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK 148
            T +KTP G L + DL   I+E  +   F + D  +   EHS+E  +P+L K  +  ++K
Sbjct: 113 FTHWKTPYGKLKVSDLNTKIQEKLSKESFVVHD-GMQTLEHSLEAIVPFLQK--KTAMLK 169

Query: 149 IVP 151
           + P
Sbjct: 170 LYP 172


>gi|21674191|ref|NP_662256.1| hypothetical protein CT1369 [Chlorobium tepidum TLS]
 gi|21647354|gb|AAM72598.1| conserved hypothetical protein [Chlorobium tepidum TLS]
          Length = 369

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 63/254 (24%), Positives = 103/254 (40%), Gaps = 54/254 (21%)

Query: 35  PKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH----HYYTPKCALST 90
           P     R  I PH  Y+Y+G        NI   +   V + G +H    ++   +    +
Sbjct: 73  PAGTAWRAAICPHDDYTYAGWLYPAVLQNIKAKT---VIIFGVAHTAWRYHLENQLIFDS 129

Query: 91  ATVYKTPIGDL---PLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLI 147
            + ++ P G++   PL  E++E L   G   + D    E E+S+E  +P+L   ++   +
Sbjct: 130 FSSWRGPYGNVKVSPLRDEILERLP-RGMAIVHDPMQSE-EYSVEALIPFLQ--YQNRNV 185

Query: 148 KIVPILVGAVNAENEAMYGRLFAK---YVDDPSNF-------FSVSSDFCHWGSR----F 193
           +I+ ILV  ++ E   +  + FAK    V   +N          +SSD  H+G       
Sbjct: 186 EIISILVPFMDFERMQIVSQHFAKALFAVMKKNNLRWGKDVALLISSDAVHYGDEDWDGR 245

Query: 194 NYMHYDKKHGVIHKSIEALDKMGMDII------------------ETGDPDAFKKYLLEY 235
           N+ +Y    G  +   EA D+   +II                   T DP+ FKK    +
Sbjct: 246 NFAYYGTG-GKANALAEAHDR---EIISHSFESELTEKNIARFYASTTDPNDFKKSRWSW 301

Query: 236 DNTICGRHPISVFL 249
               CGR  I V L
Sbjct: 302 ----CGRFAIPVGL 311


>gi|406968882|gb|EKD93650.1| hypothetical protein ACD_28C00105G0005 [uncultured bacterium]
          Length = 285

 Score = 41.6 bits (96), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 71/173 (41%), Gaps = 18/173 (10%)

Query: 27  GWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYY-- 82
           G L  AG P   S  V G I PH         A Y         ++R+ L+ P+H  Y  
Sbjct: 17  GLLVSAGKPPTISERVAGAILPHHLLVEGYLEAFYQHLREQIPGVTRIILISPNHFNYGY 76

Query: 83  -----TPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPY 137
                T    L++     TP    PLD+ VI  +  +    + +    + EH + +HLP+
Sbjct: 77  FKIRTTDGVFLNSELFPSTP----PLDMAVIRAVTDSSPLSIENTDFTK-EHGVMVHLPF 131

Query: 138 LAKVFEGHLIKIVPILVGAVNAENEA-MYGRLFAKYVDDPSNFFSVSSDFCHW 189
           +A+ F      +VPI++    +E +  M       Y DD +     S DF H+
Sbjct: 132 IAEHFPE--ASVVPIILKEGTSEAQLDMLASALIPYRDDHTLVLG-SIDFTHY 181


>gi|406913932|gb|EKD53215.1| hypothetical protein ACD_61C00108G0001, partial [uncultured
           bacterium]
          Length = 479

 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 45/191 (23%), Positives = 78/191 (40%), Gaps = 24/191 (12%)

Query: 16  DNSKRLAEELDGWLREAGLPKSPEVRGVIAPH--AGYSYSGRAAAYAFGNIDPTSISRVF 73
           DN   L    +  +R +G P     R +I PH     S   R  +   G+   T    V 
Sbjct: 42  DNRSFLPAVAENRIRPSGSPP----RVMILPHHLMASSIIARGISLLTGSPPKT----VI 93

Query: 74  LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDE----AEH 129
           +L P+H        +S+   + TP G + +D E+++     G      +C+D+    +EH
Sbjct: 94  ILSPNHANIGQCDIISSKNKWDTPFGRITIDEELLDLFLKDGS-----VCLDDQAMNSEH 148

Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS-DFCH 188
            +   LP++         KIVP+++      +  +      K +  P +  +++S DF H
Sbjct: 149 GIAGLLPFIKYYLPN--TKIVPLILR--KEIDPVLLTNFTQKLISSPEDVITIASIDFSH 204

Query: 189 WGSRFNYMHYD 199
             SR   M  D
Sbjct: 205 GLSRAKAMEAD 215


>gi|310287811|ref|YP_003939069.1| hypothetical protein BBIF_1290 [Bifidobacterium bifidum S17]
 gi|313140612|ref|ZP_07802805.1| predicted protein [Bifidobacterium bifidum NCIMB 41171]
 gi|421734885|ref|ZP_16173933.1| hypothetical protein B216_08181 [Bifidobacterium bifidum LMG
          13195]
 gi|421735802|ref|ZP_16174687.1| hypothetical protein B217_02108 [Bifidobacterium bifidum IPLA
          20015]
 gi|309251747|gb|ADO53495.1| conserved hypothetical protein [Bifidobacterium bifidum S17]
 gi|313133122|gb|EFR50739.1| predicted protein [Bifidobacterium bifidum NCIMB 41171]
 gi|407077199|gb|EKE50057.1| hypothetical protein B216_08181 [Bifidobacterium bifidum LMG
          13195]
 gi|407296907|gb|EKF16404.1| hypothetical protein B217_02108 [Bifidobacterium bifidum IPLA
          20015]
          Length = 96

 Score = 40.4 bits (93), Expect = 1.00,   Method: Composition-based stats.
 Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%)

Query: 4  IRRASHAGSWYTDNSKRLAEELDGWLREAG------LPKSPE--VRGVIAPHAGYSYSGR 55
          +R A+ AG++Y  N+  L   +D  +  A        P+ P+   + VI PHAGY YSG 
Sbjct: 15 VRPAAVAGAFYPANAGELQAMIDRQMNHARRLVSSLAPRLPQGAPKAVIVPHAGYVYSGS 74

Query: 56 AAAYAFGNIDPT 67
           AA A+  ++ T
Sbjct: 75 VAALAYALLERT 86


>gi|320450023|ref|YP_004202119.1| hypothetical protein TSC_c09450 [Thermus scotoductus SA-01]
 gi|320150192|gb|ADW21570.1| conserved hypothetical protein [Thermus scotoductus SA-01]
          Length = 371

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 14/200 (7%)

Query: 54  GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
           G A A   G  +P    R++L+G +H     K A +    ++TP G    DL  ++ L A
Sbjct: 156 GAALAALEGTPEP---QRIYLVGVAHRPLKEKAA-ALPVPFQTPFGPAEPDLAALQALDA 211

Query: 114 TGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
              +EL +  +    EHS+E+ L +L  VF     K++P+LVG  + E     G      
Sbjct: 212 LLPYELFNTPLAFREEHSLELPLFFLKGVFPKS--KVLPLLVGRRSPE----LGEALKVV 265

Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
             D      ++ D  H G RF      +    +       D   ++ +  G+P+A   +L
Sbjct: 266 RKDYPGLLVLAVDLSHVGPRFGDPPLSR---TLAAEARRRDLGFLERLAEGEPEAALAFL 322

Query: 233 LEYDNTICGRHPISVFLHML 252
                 I G   ++  L +L
Sbjct: 323 GANPTRIDGVEVVASLLPLL 342


>gi|431793613|ref|YP_007220518.1| dioxygenase [Desulfitobacterium dichloroeliminans LMG P-21439]
 gi|430783839|gb|AGA69122.1| putative dioxygenase [Desulfitobacterium dichloroeliminans LMG
           P-21439]
          Length = 299

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 89/201 (44%), Gaps = 13/201 (6%)

Query: 6   RASHAGSWYTDNSKRLAEELDGWLRE-AGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
           RA +A +     S    +EL   L   +G PK  E R V A    +  +GR    A   +
Sbjct: 34  RAQNAPTVAIHPSFVTQQELTTILASGSGKPKITERRAVSAVLPHHLLAGRLLVDAIEAL 93

Query: 65  DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
                + V L+GP+H  +  +  +S  + ++TP G + +D EV++ L A G   + D  V
Sbjct: 94  ALQEPNLVILVGPNHFNHGGRI-ISGFSGWQTPEGVMEVDAEVVDHLLAKG-LVVTDEEV 151

Query: 125 DEAEHSMEMHLPYLAKVF-EGHLIKIV---PILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
              EHS+    P L     E  ++ ++    I +  V+A  E +       +VDD +   
Sbjct: 152 LAKEHSIGALAPMLKHFLPEADIVPLILHHDISLKEVDALLEGL-----EVFVDDKAILI 206

Query: 181 SVSSDFCHWGSRFNYMHYDKK 201
           + S DF H+ +R      D++
Sbjct: 207 A-SVDFSHYLTRSEAQFKDQE 226


>gi|315427636|dbj|BAJ49233.1| conserved hypothetical protein, partial [Candidatus Caldiarchaeum
           subterraneum]
          Length = 119

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 59/119 (49%), Gaps = 9/119 (7%)

Query: 1   MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSP-------EVRGVIAPHAGYSY 52
           M+  RR + AG +Y  + + L  +++  +L   G  K+P           +I+PHAG  Y
Sbjct: 1   MKPRRRPAVAGYFYEGSREALLRQVEHCFLSPHGPGKTPAREWGKRRAPALISPHAGLMY 60

Query: 53  SGRAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
           SG  AA+ +  +   ++   V + GP+H+      ++     + TP+G++ +D ++  E
Sbjct: 61  SGPVAAHGYYELTKYAVPESVVVFGPNHYGVGTVVSIYPGGSWVTPLGEVKIDEKLAAE 119


>gi|325095395|gb|EGC48705.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 112

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 7/52 (13%)

Query: 5  RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAG 49
          R A+HAGSWY+D+   L+ +L+ WL +      G+ + P    R +IAP+A 
Sbjct: 44 REATHAGSWYSDHEPTLSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPYAS 95


>gi|311064695|ref|YP_003971420.1| hypothetical protein BBPR_1333 [Bifidobacterium bifidum PRL2010]
 gi|310867014|gb|ADP36383.1| Conserved hypothetical protein [Bifidobacterium bifidum PRL2010]
          Length = 94

 Score = 38.9 bits (89), Expect = 2.9,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 8/72 (11%)

Query: 4  IRRASHAGSWYTDNSKRLAEELDGWLREAG------LPKSPE--VRGVIAPHAGYSYSGR 55
          +R A+ AG++Y  N+  L   +D  +  A        P+ P+   + VI PHAGY YSG 
Sbjct: 13 VRPAAVAGAFYPANAGELQAMIDRQMNHARRLVSSLAPRLPQGAPKAVIVPHAGYVYSGS 72

Query: 56 AAAYAFGNIDPT 67
           A  A+  ++ T
Sbjct: 73 VATLAYALLERT 84


>gi|390937225|ref|YP_006394784.1| hypothetical protein BBB_1313 [Bifidobacterium bifidum BGN4]
 gi|389890838|gb|AFL04905.1| hypothetical protein BBB_1313 [Bifidobacterium bifidum BGN4]
          Length = 94

 Score = 38.5 bits (88), Expect = 3.1,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 8/72 (11%)

Query: 4  IRRASHAGSWYTDNSKRLAEELDGWLREAG------LPKSPE--VRGVIAPHAGYSYSGR 55
          +R A+ AG++Y  N+  L   +D  +  A        P+ P+   + +I PHAGY YSG 
Sbjct: 13 VRPAAVAGAFYPVNAGELQAMIDRQMNHARRLVSSLAPRLPQGAPKALIVPHAGYVYSGS 72

Query: 56 AAAYAFGNIDPT 67
           AA A+  ++ T
Sbjct: 73 VAALAYALLERT 84


>gi|374813154|ref|ZP_09716891.1| hypothetical protein TpriZ_04731 [Treponema primitia ZAS-1]
          Length = 283

 Score = 38.1 bits (87), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 44/195 (22%), Positives = 85/195 (43%), Gaps = 15/195 (7%)

Query: 49  GYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
           G+ YSG  AA AF ++    +  V ++G  H     +  ++     +TP+GD+P+D E  
Sbjct: 46  GWRYSGAIAARAFLSLK-RDVDTVLVIG-GHIPGGMRPLMAEEDAVETPLGDIPVDTEFR 103

Query: 109 EELKATGKFELMDICVDE-AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
             L  +     +D   D+ A++++E+ LP +   F     K++ +   A  +  E+  G+
Sbjct: 104 IALGQS-----LDCLPDQYADNTVEVQLPMVRYFFPAA--KLLWLRFPADESSFES--GK 154

Query: 168 LFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDP 225
              +  +       V  S+D  H+G+ + Y         +    +  D   +  +++GDP
Sbjct: 155 EITRIAEKIGRNLVVLGSTDLTHYGNNYGYSPQGGGKKALAWVRDVNDANFIAAVKSGDP 214

Query: 226 DAFKKYLLEYDNTIC 240
            A      E D + C
Sbjct: 215 RAILDR-AEKDRSAC 228


>gi|407940365|ref|YP_006856006.1| hypothetical protein C380_18370 [Acidovorax sp. KKS102]
 gi|407898159|gb|AFU47368.1| hypothetical protein C380_18370 [Acidovorax sp. KKS102]
          Length = 191

 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 7/85 (8%)

Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH-----KSIEALDKMGMDIIETGD 224
           A Y  DP++ F+ S +  H+G+  N   +DKK G +      KS +A     +  + TG 
Sbjct: 24  ANYAIDPTHTFA-SFEISHFGAAVNRGRFDKKEGTVQFDKAGKSGKAEITFDVTSVNTGT 82

Query: 225 PDAFKKYLLEYDNTICGRHPISVFL 249
           P AF K+L   D     +HP   F+
Sbjct: 83  P-AFDKHLQSADLFDAAKHPTMKFV 106


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.135    0.410 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,658,532,019
Number of Sequences: 23463169
Number of extensions: 196891449
Number of successful extensions: 468330
Number of sequences better than 100.0: 908
Number of HSP's better than 100.0 without gapping: 645
Number of HSP's successfully gapped in prelim test: 263
Number of HSP's that attempted gapping in prelim test: 465485
Number of HSP's gapped (non-prelim): 993
length of query: 291
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 150
effective length of database: 9,050,888,538
effective search space: 1357633280700
effective search space used: 1357633280700
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)