BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013382
(444 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224055474|ref|XP_002298511.1| predicted protein [Populus trichocarpa]
gi|222845769|gb|EEE83316.1| predicted protein [Populus trichocarpa]
Length = 543
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/417 (85%), Positives = 388/417 (93%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MVV PITPGQVSFLLG IPVFVAWIYSEFLEYKK SS KVHSD NL++LEKETIKEDD
Sbjct: 1 MVVSGPITPGQVSFLLGFIPVFVAWIYSEFLEYKKTSSPQKVHSDNNLLDLEKETIKEDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RAVLLEGGL RSASA+ SS+IK NLIRFMT+DD+FLLENRATLRAM+EFGA+L YFYIC
Sbjct: 61 RAVLLEGGLPRSASAKFHSSAIKMNLIRFMTLDDSFLLENRATLRAMSEFGAVLLYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+LG+STK+YNRDLF+FLY+LL+IVS+MTSL+KH DKS F+GK++ YLNRHQTEEWK
Sbjct: 121 DRTNILGESTKSYNRDLFVFLYILLIIVSSMTSLRKHTDKSAFTGKSMLYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA EIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFS+ RF+QMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSVARFSQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCI+LNNDYMLYYICPMHTLFT+MVYGA+GIFNKYNE SVM VKIL+CFLVVIL
Sbjct: 241 FFVAFCCIILNNDYMLYYICPMHTLFTLMVYGALGIFNKYNENSSVMAVKILSCFLVVIL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
IWEIPGVFD WSPLTF+LGY+DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY+HP E
Sbjct: 301 IWEIPGVFDFLWSPLTFLLGYSDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYFHPNIE 360
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
KWMEKLEESE K+KLS+K GIV V++ VGYLWYE IYKLDKV+YNKYHPYTSWIPIT
Sbjct: 361 KWMEKLEESETKKKLSMKTGIVAVSVSVGYLWYEYIYKLDKVSYNKYHPYTSWIPIT 417
>gi|359483835|ref|XP_002272126.2| PREDICTED: CAS1 domain-containing protein 1-like [Vitis vinifera]
Length = 545
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/439 (82%), Positives = 392/439 (89%), Gaps = 2/439 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MVV PITPGQVSFLLGIIPVFVAWIYSEFLEYKK SS +KVHSD NLVEL ETIKEDD
Sbjct: 1 MVVSSPITPGQVSFLLGIIPVFVAWIYSEFLEYKKSSSPSKVHSDNNLVELGSETIKEDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RA+LLEGGL++SASA+ SSSIK NLIRF+TMDD+FLLENR TLRAM+EFGAIL YFY+C
Sbjct: 61 RAILLEGGLTKSASAKFNSSSIKVNLIRFLTMDDSFLLENRLTLRAMSEFGAILTYFYVC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT LLGDSTKNYNRDLF+FLYLLLVIV MTSLKKH+DKS FSGK + YLNRHQTEEWK
Sbjct: 121 DRTELLGDSTKNYNRDLFIFLYLLLVIVCFMTSLKKHHDKSAFSGKALLYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RF QMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAAAEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFTQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIFNKYNEI SVM VKILACFLVVIL
Sbjct: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFNKYNEIRSVMAVKILACFLVVIL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVFDIFWSP F+LGY+ DP+K LPRLHEWHFRSGLDRYIWIIGMIYAYYHP
Sbjct: 301 IWEIPGVFDIFWSPSAFLLGYSDPDPSKQGLPRLHEWHFRSGLDRYIWIIGMIYAYYHPN 360
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITY 418
EKWMEKLEE+E KR+L+IK IVTV +FVGYLWYE IYKLDKVTYNK+HPYTSWIPIT
Sbjct: 361 VEKWMEKLEETETKRRLTIKTSIVTVTVFVGYLWYEYIYKLDKVTYNKFHPYTSWIPITV 420
Query: 419 VLFIFYFFSLVKHLSGSLY 437
+ + F ++ S +L+
Sbjct: 421 YISLRNFTQQLRSYSLTLF 439
>gi|224116436|ref|XP_002317300.1| predicted protein [Populus trichocarpa]
gi|222860365|gb|EEE97912.1| predicted protein [Populus trichocarpa]
Length = 565
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/411 (84%), Positives = 381/411 (92%)
Query: 7 ITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLE 66
I+ GQV+FLLGIIP+FVAWIYSEFLEYKK SS +K+HSD NL++LEKETIKEDDRAVLLE
Sbjct: 9 ISAGQVAFLLGIIPIFVAWIYSEFLEYKKTSSLSKLHSDNNLLDLEKETIKEDDRAVLLE 68
Query: 67 GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLL 126
GGL RSASA+ SS+ K NLIRFMTMDD+FLLENR TLR M+EFGA+L YFYICDRTN+L
Sbjct: 69 GGLPRSASAKFHSSATKMNLIRFMTMDDSFLLENRTTLRVMSEFGAVLVYFYICDRTNIL 128
Query: 127 GDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVL 186
G+STKNYNRDLF+FLYLLL+IVSAMTSL+KH DKS F+GK+ YLNRHQTEEWKGWMQV+
Sbjct: 129 GESTKNYNRDLFVFLYLLLIIVSAMTSLRKHTDKSTFTGKSTLYLNRHQTEEWKGWMQVI 188
Query: 187 FLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFC 246
FLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFS+ RFAQMMWRLN FVAFC
Sbjct: 189 FLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSVARFAQMMWRLNLFVAFC 248
Query: 247 CIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPG 306
CIVLNNDYMLYYICPMHTLFT+MVYG +GIFNKYNE SV+ VKIL+CFL+VILIWE PG
Sbjct: 249 CIVLNNDYMLYYICPMHTLFTVMVYGVLGIFNKYNENSSVIAVKILSCFLMVILIWETPG 308
Query: 307 VFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKL 366
VFDI WSPLTF+LGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY+HP EKWMEKL
Sbjct: 309 VFDILWSPLTFLLGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYFHPNVEKWMEKL 368
Query: 367 EESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
EESE K+KLSIK G+V V+L VGYLWYECIYKLDKV+YNKYHPYTSWIPIT
Sbjct: 369 EESEIKKKLSIKTGLVAVSLSVGYLWYECIYKLDKVSYNKYHPYTSWIPIT 419
>gi|255557403|ref|XP_002519732.1| O-acetyltransferase, putative [Ricinus communis]
gi|223541149|gb|EEF42705.1| O-acetyltransferase, putative [Ricinus communis]
Length = 578
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/417 (86%), Positives = 380/417 (91%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M V ITPGQVSFLLGIIPVFVAW YSEFL+YKK SS +KVHSD NLVELEKETIKEDD
Sbjct: 1 MAVSGAITPGQVSFLLGIIPVFVAWAYSEFLDYKKFSSPSKVHSDNNLVELEKETIKEDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RAVLLEGGL R+AS + S+SIK NLIRFMTMDDAFLLE+RATL+AM+EFGAI+ +FYI
Sbjct: 61 RAVLLEGGLPRTASMKFHSASIKMNLIRFMTMDDAFLLEHRATLKAMSEFGAIMIFFYIS 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT LLGDS K+Y+RDLFLFLY LL+IVSAMTSLKKHNDKS FSGK I YLNRHQTEEWK
Sbjct: 121 DRTTLLGDSAKSYSRDLFLFLYFLLIIVSAMTSLKKHNDKSAFSGKAILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFC IVLNNDYMLYYICPMHTLFT+MVYGA+GIFNKYNE SVM VKIL+CFLVVIL
Sbjct: 241 FFVAFCSIVLNNDYMLYYICPMHTLFTLMVYGALGIFNKYNENSSVMAVKILSCFLVVIL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
IWEIPGVFDI WSPL F+LGYTDPAKPDLPR+HEWHFRSGLDRYIWIIGMIYAYYHP E
Sbjct: 301 IWEIPGVFDILWSPLFFLLGYTDPAKPDLPRMHEWHFRSGLDRYIWIIGMIYAYYHPNIE 360
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
KWMEKLEESE KR+LSIK GIV VA VG LWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 361 KWMEKLEESETKRRLSIKTGIVAVASLVGCLWYEYIYKLDKVTYNKYHPYTSWIPIT 417
>gi|356512163|ref|XP_003524790.1| PREDICTED: CAS1 domain-containing protein 1-like [Glycine max]
Length = 543
Score = 730 bits (1885), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/437 (81%), Positives = 394/437 (90%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MVV PITPGQVSFLLG+IPVFV+WIYSE+LEYKK+SS KVHSD +L EL K+ IKEDD
Sbjct: 1 MVVSGPITPGQVSFLLGVIPVFVSWIYSEYLEYKKLSSPPKVHSDASLEELGKDAIKEDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RA+LLE GL+RSASA+ S+K NLIRF+TMDD+FLLENRATLRAMAEFG ILFYFYIC
Sbjct: 61 RAILLESGLTRSASAKFHPPSVKLNLIRFLTMDDSFLLENRATLRAMAEFGLILFYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+LGDSTK+Y+RDLF+FLYLLL+IVSA++SLKKHND S FSGK I YLNRHQTEEWK
Sbjct: 121 DRTNILGDSTKSYSRDLFIFLYLLLIIVSALSSLKKHNDSSTFSGKNILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLPRFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLPRFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDYMLYYICPMHTLFT+MVYGA+GI+NKYNEI SVM KILACFLVVIL
Sbjct: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIYNKYNEIPSVMAAKILACFLVVIL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WEIPG FDIFWSP LGYTDPAKPDLPR+HEWHFRSGLDRYIWIIGMIYAY+HP E
Sbjct: 301 VWEIPGFFDIFWSPFALFLGYTDPAKPDLPRMHEWHFRSGLDRYIWIIGMIYAYFHPNVE 360
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVL 420
KWMEKLEES+ KR+++IK GIV+VALFVGYLWYE IYKLDKV+YNK HPYTSWIPIT +
Sbjct: 361 KWMEKLEESDTKRRVTIKTGIVSVALFVGYLWYEYIYKLDKVSYNKLHPYTSWIPITVYI 420
Query: 421 FIFYFFSLVKHLSGSLY 437
+ F +++ S +L+
Sbjct: 421 CLRNFSQRLRNFSLTLF 437
>gi|356528112|ref|XP_003532649.1| PREDICTED: CAS1 domain-containing protein 1-like [Glycine max]
Length = 545
Score = 727 bits (1877), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/437 (81%), Positives = 391/437 (89%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MVV PITPGQVSFLLG+IPVFV+WIYSE+LEYKK SS KVHSD L EL K+ IKEDD
Sbjct: 1 MVVSGPITPGQVSFLLGVIPVFVSWIYSEYLEYKKPSSPPKVHSDVCLEELGKDAIKEDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RA+LLE GL+RSASA+ SS+K NLIRF+TMDD+FLLENRATLRAMAEFG ILFYFYIC
Sbjct: 61 RAILLESGLTRSASAKFHPSSVKLNLIRFLTMDDSFLLENRATLRAMAEFGLILFYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+LGDSTK+Y+RDLF+FLYLLL+IVSA++SLKKHND S FSGK I YLNRHQTEEWK
Sbjct: 121 DRTNILGDSTKSYSRDLFIFLYLLLIIVSALSSLKKHNDSSTFSGKNILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDYMLYYICPMHTLFT+MVYGA+GI+NKYNE+ SVM KILACFLVVIL
Sbjct: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIYNKYNEVPSVMAAKILACFLVVIL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WEIPG FDIFWSP LGYTDPAKPDLPR+HEWHFRSGLDRYIWIIGMIYAY+HP E
Sbjct: 301 VWEIPGFFDIFWSPFALFLGYTDPAKPDLPRMHEWHFRSGLDRYIWIIGMIYAYFHPNVE 360
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVL 420
KWMEKLEESE KR+ +IK IV+VALFVGYLWYE IYKLDKV+YNK HPYTSWIPIT +
Sbjct: 361 KWMEKLEESEIKRRATIKTSIVSVALFVGYLWYEYIYKLDKVSYNKLHPYTSWIPITVYI 420
Query: 421 FIFYFFSLVKHLSGSLY 437
+ F +++ S +L+
Sbjct: 421 CLRNFSQRLRNFSLTLF 437
>gi|297823229|ref|XP_002879497.1| O-acetyltransferase family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297325336|gb|EFH55756.1| O-acetyltransferase family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 540
Score = 724 bits (1869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/413 (81%), Positives = 377/413 (91%)
Query: 5 RPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVL 64
PITPGQVSFLLG+IPVF+AWIYSEFLEYK+ S H+KVHSD NLVEL + KED+ AVL
Sbjct: 5 EPITPGQVSFLLGVIPVFIAWIYSEFLEYKRSSLHSKVHSDNNLVELGEVKNKEDEVAVL 64
Query: 65 LEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTN 124
LEGGL+RS S + +S IKTNLIRF+T++D+FL+ENRATLRAMAEFGAILFYFYI DRT+
Sbjct: 65 LEGGLARSVSTKFYNSPIKTNLIRFLTLEDSFLIENRATLRAMAEFGAILFYFYISDRTS 124
Query: 125 LLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQ 184
LLG+S KNYNRDLFLFLY LL+IV+ MTSLKKHNDKSP +GK+I YLNRHQTEEWKGWMQ
Sbjct: 125 LLGESKKNYNRDLFLFLYCLLIIVAYMTSLKKHNDKSPITGKSILYLNRHQTEEWKGWMQ 184
Query: 185 VLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVA 244
VLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RF QMMWRLN FVA
Sbjct: 185 VLFLMYHYFAAAEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFTQMMWRLNLFVA 244
Query: 245 FCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEI 304
F CI+LNNDYMLYYICPMHTLFT+MVYGA+GIF++YNEI SVM++KI +CFLVVI++WEI
Sbjct: 245 FSCIILNNDYMLYYICPMHTLFTLMVYGALGIFSRYNEIPSVMVLKIASCFLVVIVMWEI 304
Query: 305 PGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWME 364
PGVF+IFWSPLTF+LGYTDPAKPDLP LHEWHFRSGLDRYIWIIGMIYAY+HPT E+WME
Sbjct: 305 PGVFEIFWSPLTFLLGYTDPAKPDLPLLHEWHFRSGLDRYIWIIGMIYAYFHPTVERWME 364
Query: 365 KLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
KLEE + KRK+SIK I+ ++ FVGYLWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 365 KLEEFDAKRKMSIKTSIIAISSFVGYLWYEYIYKLDKVTYNKYHPYTSWIPIT 417
>gi|42569609|ref|NP_180988.3| O-acetyltransferase-like protein [Arabidopsis thaliana]
gi|79324285|ref|NP_001031478.1| O-acetyltransferase-like protein [Arabidopsis thaliana]
gi|51536464|gb|AAU05470.1| At2g34410 [Arabidopsis thaliana]
gi|55733775|gb|AAV59284.1| At2g34410 [Arabidopsis thaliana]
gi|62320442|dbj|BAD94920.1| hypothetical protein [Arabidopsis thaliana]
gi|110737554|dbj|BAF00719.1| hypothetical protein [Arabidopsis thaliana]
gi|222423437|dbj|BAH19689.1| AT2G34410 [Arabidopsis thaliana]
gi|330253875|gb|AEC08969.1| O-acetyltransferase-like protein [Arabidopsis thaliana]
gi|330253876|gb|AEC08970.1| O-acetyltransferase-like protein [Arabidopsis thaliana]
Length = 540
Score = 724 bits (1868), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/413 (81%), Positives = 376/413 (91%)
Query: 5 RPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVL 64
+PITPGQVSFLLG+IPVF+AWIYSEFLEYK+ S H+KVHSD NLVEL + KED+ VL
Sbjct: 5 QPITPGQVSFLLGVIPVFIAWIYSEFLEYKRSSLHSKVHSDNNLVELGEVKNKEDEGVVL 64
Query: 65 LEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTN 124
LEGGL RS S + +S IKTNLIRF+T++D+FL+ENRATLRAMAEFGAILFYFYI DRT+
Sbjct: 65 LEGGLPRSVSTKFYNSPIKTNLIRFLTLEDSFLIENRATLRAMAEFGAILFYFYISDRTS 124
Query: 125 LLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQ 184
LLG+S KNYNRDLFLFLY LL+IVSAMTSLKKHNDKSP +GK+I YLNRHQTEEWKGWMQ
Sbjct: 125 LLGESKKNYNRDLFLFLYCLLIIVSAMTSLKKHNDKSPITGKSILYLNRHQTEEWKGWMQ 184
Query: 185 VLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVA 244
VLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RF QMMWRLN FVA
Sbjct: 185 VLFLMYHYFAAAEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFTQMMWRLNLFVA 244
Query: 245 FCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEI 304
F CI+LNNDYMLYYICPMHTLFT+MVYGA+GIF++YNEI SVM +KI +CFLVVI++WEI
Sbjct: 245 FSCIILNNDYMLYYICPMHTLFTLMVYGALGIFSRYNEIPSVMALKIASCFLVVIVMWEI 304
Query: 305 PGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWME 364
PGVF+IFWSPLTF+LGYTDPAKP+LP LHEWHFRSGLDRYIWIIGMIYAY+HPT E+WME
Sbjct: 305 PGVFEIFWSPLTFLLGYTDPAKPELPLLHEWHFRSGLDRYIWIIGMIYAYFHPTVERWME 364
Query: 365 KLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
KLEE + KRK+SIK I+ ++ FVGYLWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 365 KLEECDAKRKMSIKTSIIAISSFVGYLWYEYIYKLDKVTYNKYHPYTSWIPIT 417
>gi|10177715|dbj|BAB11089.1| unnamed protein product [Arabidopsis thaliana]
Length = 436
Score = 720 bits (1858), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/434 (78%), Positives = 381/434 (87%), Gaps = 5/434 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MV PITPGQVSFLLG+IP+FV WIYSE LEY+K K HSD NLVEL K+DD
Sbjct: 1 MVDPGPITPGQVSFLLGVIPIFVGWIYSELLEYRKSWVPLKPHSDNNLVELGDVAEKDDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+A LLEGGL+RS S + +SSI+TN+IRF++M+D+FLLE+RATLRAM+EFGAIL YFYIC
Sbjct: 61 KADLLEGGLARSPSVKFHNSSIRTNIIRFLSMEDSFLLEHRATLRAMSEFGAILIYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT LLGDSTKNYNRDLFLFLY+LL+IVSAMTSL+KHNDKSP SGK+I YLNRHQTEEWK
Sbjct: 121 DRTELLGDSTKNYNRDLFLFLYVLLIIVSAMTSLRKHNDKSPISGKSILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA EIYNAIRIFIAAYVWMTGFGNFSYYY+RKDFS+ RFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSVARFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIF+KYNEIGSVM +KI +CFLVV L
Sbjct: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFSKYNEIGSVMALKIFSCFLVVFL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WEIPG F+IFW PLTF+LGY DPAKPDL RLHEWHFRSGLDRYIWIIGMIYAYYHPT E
Sbjct: 301 LWEIPGAFEIFWGPLTFLLGYNDPAKPDLHRLHEWHFRSGLDRYIWIIGMIYAYYHPTVE 360
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYV- 419
+WMEKLE+ E K++LSIKA IVT+ + VGY+WYECIYKLD+ +YN YHPYTSWIPIT V
Sbjct: 361 RWMEKLEDCETKKRLSIKAAIVTITVLVGYVWYECIYKLDRTSYNMYHPYTSWIPITSVH 420
Query: 420 ----LFIFYFFSLV 429
L +F F S V
Sbjct: 421 KTLFLLVFIFVSYV 434
>gi|15983464|gb|AAL11600.1|AF424606_1 AT5g46340/MPL12_14 [Arabidopsis thaliana]
Length = 540
Score = 719 bits (1856), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/437 (77%), Positives = 384/437 (87%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MV PITPGQVSFLLG+IP+FV WIYSE LEY+K K HSD NLVEL K+DD
Sbjct: 1 MVDPGPITPGQVSFLLGVIPIFVGWIYSELLEYRKSWVPLKPHSDNNLVELGDVAEKDDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+A LLEGGL+RS S + +SSI+TN+IRF++M+D+FLLE+RATLRAM+EFGAIL YFYIC
Sbjct: 61 KADLLEGGLARSPSVKFHNSSIRTNIIRFLSMEDSFLLEHRATLRAMSEFGAILIYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT LLGDSTKNYNRDLFLFLY+LL+IVSAMTSL+KHNDKSP SGK+I YLNRHQTEEWK
Sbjct: 121 DRTELLGDSTKNYNRDLFLFLYVLLIIVSAMTSLRKHNDKSPISGKSILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA EIYNAIRIFIAAYVWMTGFGNFSYYY+RKDFS+ RFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSVARFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIF+KYNEIGSVM +KI +CFLVV L
Sbjct: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFSKYNEIGSVMALKIFSCFLVVFL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WEIPG F+IFW PLTF+LGY DPAKPDL RLHEWHFRSGLDRYIWIIGMIYAYYHPT E
Sbjct: 301 LWEIPGAFEIFWGPLTFLLGYNDPAKPDLHRLHEWHFRSGLDRYIWIIGMIYAYYHPTVE 360
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVL 420
+WMEKLE+ E K++LSIKA IVT+ + VGY+WYECIYKLD+ +YN YHPYTSWIPIT +
Sbjct: 361 RWMEKLEDCETKKRLSIKAAIVTITVLVGYVWYECIYKLDRTSYNMYHPYTSWIPITVYI 420
Query: 421 FIFYFFSLVKHLSGSLY 437
+ F ++ +S +L+
Sbjct: 421 CLRNFTHQLRSVSLTLF 437
>gi|18422663|ref|NP_568662.1| putative O-acetyltransferase [Arabidopsis thaliana]
gi|22531002|gb|AAM97005.1| putative protein [Arabidopsis thaliana]
gi|332007988|gb|AED95371.1| putative O-acetyltransferase [Arabidopsis thaliana]
Length = 540
Score = 719 bits (1856), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/437 (77%), Positives = 384/437 (87%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MV PITPGQVSFLLG+IP+FV WIYSE LEY+K K HSD NLVEL K+DD
Sbjct: 1 MVDPGPITPGQVSFLLGVIPIFVGWIYSELLEYRKSWVPLKPHSDNNLVELGDVAEKDDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+A LLEGGL+RS S + +SSI+TN+IRF++M+D+FLLE+RATLRAM+EFGAIL YFYIC
Sbjct: 61 KADLLEGGLARSPSVKFHNSSIRTNIIRFLSMEDSFLLEHRATLRAMSEFGAILIYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT LLGDSTKNYNRDLFLFLY+LL+IVSAMTSL+KHNDKSP SGK+I YLNRHQTEEWK
Sbjct: 121 DRTELLGDSTKNYNRDLFLFLYVLLIIVSAMTSLRKHNDKSPISGKSILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA EIYNAIRIFIAAYVWMTGFGNFSYYY+RKDFS+ RFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSVARFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIF+KYNEIGSVM +KI +CFLVV L
Sbjct: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFSKYNEIGSVMALKIFSCFLVVFL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WEIPG F+IFW PLTF+LGY DPAKPDL RLHEWHFRSGLDRYIWIIGMIYAYYHPT E
Sbjct: 301 LWEIPGAFEIFWGPLTFLLGYNDPAKPDLHRLHEWHFRSGLDRYIWIIGMIYAYYHPTVE 360
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVL 420
+WMEKLE+ E K++LSIKA IVT+ + VGY+WYECIYKLD+ +YN YHPYTSWIPIT +
Sbjct: 361 RWMEKLEDCETKKRLSIKAAIVTITVLVGYVWYECIYKLDRTSYNMYHPYTSWIPITVYI 420
Query: 421 FIFYFFSLVKHLSGSLY 437
+ F ++ +S +L+
Sbjct: 421 CLRNFTHQLRSVSLTLF 437
>gi|449433583|ref|XP_004134577.1| PREDICTED: CAS1 domain-containing protein 1-like [Cucumis sativus]
Length = 542
Score = 712 bits (1837), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/433 (80%), Positives = 386/433 (89%), Gaps = 2/433 (0%)
Query: 7 ITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVEL--EKETIKEDDRAVL 64
I PGQVSFLLG+IP+FVAW+YSEFLEY+K +KVHSD NLVEL EK +DD A L
Sbjct: 7 INPGQVSFLLGVIPIFVAWVYSEFLEYQKSPLLSKVHSDNNLVELAEEKGDKVKDDEAAL 66
Query: 65 LEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTN 124
LEGGL+RSAS +L SSSIKTNLIRF+T+D++FLLENRATLRA++EFGAILFYFY+CDRTN
Sbjct: 67 LEGGLARSASVKLNSSSIKTNLIRFLTLDESFLLENRATLRALSEFGAILFYFYVCDRTN 126
Query: 125 LLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQ 184
+LGDSTKNYNRDLFLFLY+LL+IVSAMTSLKKH DKS FSGK I YLNRHQTEEWKGWMQ
Sbjct: 127 ILGDSTKNYNRDLFLFLYILLIIVSAMTSLKKHTDKSAFSGKAILYLNRHQTEEWKGWMQ 186
Query: 185 VLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVA 244
VLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RFAQMMWRLNFFVA
Sbjct: 187 VLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFAQMMWRLNFFVA 246
Query: 245 FCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEI 304
FCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIFNKYNEI SVMI KI+ACFLVVI +WEI
Sbjct: 247 FCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFNKYNEIRSVMIGKIIACFLVVIAVWEI 306
Query: 305 PGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWME 364
PGVF I WSP F+LGYTDPAK DLPRLHEWHFRSGLDRYIWIIGMIYAYYHP EKWME
Sbjct: 307 PGVFYIIWSPFKFLLGYTDPAKVDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPNVEKWME 366
Query: 365 KLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFY 424
+LEE+E ++++SIK IVTV++ GY+WYECIYKLDKVTYNKYHPYTSWIPIT + +
Sbjct: 367 RLEEAETRKRISIKTSIVTVSVIAGYMWYECIYKLDKVTYNKYHPYTSWIPITVYICLRN 426
Query: 425 FFSLVKHLSGSLY 437
F ++ S +L+
Sbjct: 427 FTHQLRSYSLTLF 439
>gi|297791045|ref|XP_002863407.1| hypothetical protein ARALYDRAFT_494338 [Arabidopsis lyrata subsp.
lyrata]
gi|297309242|gb|EFH39666.1| hypothetical protein ARALYDRAFT_494338 [Arabidopsis lyrata subsp.
lyrata]
Length = 540
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/437 (75%), Positives = 383/437 (87%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MV PITPGQVSFL G+IP+FV WIYSE LEY+K K HSD NLVEL K+DD
Sbjct: 1 MVDPGPITPGQVSFLFGVIPIFVGWIYSESLEYRKSLVPLKPHSDNNLVELGDVAEKDDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+A LLEGGL+R+ S R +SSI+TN++RF++M+D+FLLE+RATLRAM+EFGAIL YFYIC
Sbjct: 61 KADLLEGGLTRATSVRFHNSSIRTNIVRFLSMEDSFLLEHRATLRAMSEFGAILIYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT LLGDSTK+YNRDLFLFLY+LL+IVSAMTSL+KHNDKSP +GK+I YLNRHQTEEWK
Sbjct: 121 DRTELLGDSTKHYNRDLFLFLYVLLIIVSAMTSLRKHNDKSPITGKSILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA EIYNAIRIFIAAYVWMTGFGNFSYYY+RKDFS+ RFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSVARFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIF+KYN+IGSVM +KI +CFLVV L
Sbjct: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFSKYNDIGSVMALKIFSCFLVVFL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WE+PG F+IFW PLTF+LGY+DPAKPDL RLHEWHFRSGLDRYIWIIGMIYAYYHPT E
Sbjct: 301 LWEVPGAFEIFWGPLTFLLGYSDPAKPDLHRLHEWHFRSGLDRYIWIIGMIYAYYHPTVE 360
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVL 420
+W+EKLE+ E K++LSIK IVT+ + VGY+WYECIYKLDK +YN YHPYTSWIPIT +
Sbjct: 361 RWIEKLEDCETKKRLSIKTAIVTITVLVGYVWYECIYKLDKNSYNMYHPYTSWIPITVYI 420
Query: 421 FIFYFFSLVKHLSGSLY 437
+ F ++ +S +L+
Sbjct: 421 CLRNFTHQLRSVSLTLF 437
>gi|449479194|ref|XP_004155531.1| PREDICTED: LOW QUALITY PROTEIN: CAS1 domain-containing protein
1-like [Cucumis sativus]
Length = 542
Score = 709 bits (1830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/433 (79%), Positives = 385/433 (88%), Gaps = 2/433 (0%)
Query: 7 ITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVEL--EKETIKEDDRAVL 64
I PGQVSFLLG+IP+FVAW+YSEFLEY+K +KVHSD NLVEL EK +DD A L
Sbjct: 7 INPGQVSFLLGVIPIFVAWVYSEFLEYQKSPLLSKVHSDNNLVELAEEKGDKVKDDEAAL 66
Query: 65 LEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTN 124
LEGGL+RSAS +L SSSIKTNLIRF+T+D++FLLENRATLRA++EFGAILFYFY+CDRTN
Sbjct: 67 LEGGLARSASVKLNSSSIKTNLIRFLTLDESFLLENRATLRALSEFGAILFYFYVCDRTN 126
Query: 125 LLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQ 184
+LGDSTKNYNRDLFLFLY+LL+IVSAMTSLKKH DKS FSGK I YLNRHQTEEWKGWMQ
Sbjct: 127 ILGDSTKNYNRDLFLFLYILLIIVSAMTSLKKHTDKSAFSGKAILYLNRHQTEEWKGWMQ 186
Query: 185 VLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVA 244
VLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RFAQMMWRLN FVA
Sbjct: 187 VLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFAQMMWRLNXFVA 246
Query: 245 FCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEI 304
FCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIFNKYNEI SVMI KI+ACFLVVI +WEI
Sbjct: 247 FCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFNKYNEIRSVMIGKIIACFLVVIAVWEI 306
Query: 305 PGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWME 364
PGVF I WSP F+LGYTDPAK DLPRLHEWHFRSGLDRYIWIIGMIYAYYHP EKWME
Sbjct: 307 PGVFYIIWSPFKFLLGYTDPAKVDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPNVEKWME 366
Query: 365 KLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFY 424
+LEE+E ++++SIK IVTV++ GY+WYECIYKLDKVTYNKYHPYTSWIPIT + +
Sbjct: 367 RLEEAETRKRISIKTSIVTVSVIAGYMWYECIYKLDKVTYNKYHPYTSWIPITVYICLRN 426
Query: 425 FFSLVKHLSGSLY 437
F ++ S +L+
Sbjct: 427 FTHQLRSYSLTLF 439
>gi|297845910|ref|XP_002890836.1| hypothetical protein ARALYDRAFT_473199 [Arabidopsis lyrata subsp.
lyrata]
gi|297336678|gb|EFH67095.1| hypothetical protein ARALYDRAFT_473199 [Arabidopsis lyrata subsp.
lyrata]
Length = 582
Score = 696 bits (1797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/459 (73%), Positives = 378/459 (82%), Gaps = 42/459 (9%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MVV +PITPGQVSFLLG+IP+ +AW+YSEFLEY++ S H KVHSD NLVELE T KED+
Sbjct: 1 MVVSQPITPGQVSFLLGVIPLMIAWLYSEFLEYRRSSLHAKVHSDKNLVELEMVTNKEDE 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
VL+EGGL RS S++ SSSIKTNL+RF+T++D+FLLENRATLRAMAEFGAIL YFYIC
Sbjct: 61 GTVLMEGGLPRSVSSKFYSSSIKTNLLRFLTLEDSFLLENRATLRAMAEFGAILLYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT+LLG S KNYNRDLFLFLY LL+IVSAMTSLKKH+DKSP +GK+I YLNRHQTEEWK
Sbjct: 121 DRTSLLGQSKKNYNRDLFLFLYCLLIIVSAMTSLKKHSDKSPITGKSILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFA------- 233
GWMQVLFLMYHYFAA E YNAIR+FIA YVWMTGFGNFSYYYIRKDFSL RF
Sbjct: 181 GWMQVLFLMYHYFAAVEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFTQVRLTVL 240
Query: 234 ---------------------------------QMMWRLNFFVAFCCIVLNNDYMLYYIC 260
QMMWRLNFFVAFCCI+LNNDYMLYYIC
Sbjct: 241 LHHHTLFSLPCDMLLESIMSFKAQEFHESFYFIQMMWRLNFFVAFCCIILNNDYMLYYIC 300
Query: 261 PMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILG 320
PMHTLFT+MVYGA+GI+++YNEI SVM +KI +CFLVVI +WEIPGVF+IFWSPL F+LG
Sbjct: 301 PMHTLFTLMVYGALGIYSQYNEIASVMALKIASCFLVVIFLWEIPGVFEIFWSPLAFLLG 360
Query: 321 YTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTA--EKWMEKLEESEPKRKLSIK 378
YTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY+HPT E+WMEKLEE + KR++SIK
Sbjct: 361 YTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYFHPTVILERWMEKLEECDAKRRMSIK 420
Query: 379 AGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
I+ ++ FVGYLWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 421 TSIIAISSFVGYLWYEYIYKLDKVTYNKYHPYTSWIPIT 459
>gi|449517180|ref|XP_004165624.1| PREDICTED: probable O-acetyltransferase CAS1-like [Cucumis sativus]
Length = 540
Score = 694 bits (1792), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/431 (77%), Positives = 382/431 (88%)
Query: 7 ITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLE 66
ITPGQ+SFLLGI P+FV+WIYSEFLEY+K S+ K HSD NL +L T+KEDD+AVLLE
Sbjct: 7 ITPGQISFLLGISPIFVSWIYSEFLEYRKSSAPPKAHSDINLADLGGVTVKEDDQAVLLE 66
Query: 67 GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLL 126
GGL+R ASA++ SSSI TNLIRF T+DD FLLENR+TLRAM+EFGAIL YF++CDRT++L
Sbjct: 67 GGLARPASAKIHSSSITTNLIRFFTLDDTFLLENRSTLRAMSEFGAILLYFFVCDRTSIL 126
Query: 127 GDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVL 186
DS KNY+RDLFLFLY+LL+IVSA TSLKKH+DKS FSGK+I YLNRHQTEEWKGWMQVL
Sbjct: 127 ADSKKNYSRDLFLFLYILLIIVSAATSLKKHSDKSAFSGKSILYLNRHQTEEWKGWMQVL 186
Query: 187 FLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFC 246
FLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFS+ RFAQMMWRLNFFV FC
Sbjct: 187 FLMYHYFAAAEIYNAIRMFIAAYVWMTGFGNFSYYYIRKDFSVARFAQMMWRLNFFVIFC 246
Query: 247 CIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPG 306
CIVLNNDYMLYYICPMHTLFT+MVYGA+GIFNKYNE SV+ KILACFLVVILIWE+PG
Sbjct: 247 CIVLNNDYMLYYICPMHTLFTLMVYGALGIFNKYNEKSSVIAAKILACFLVVILIWEVPG 306
Query: 307 VFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKL 366
VFD WSPLTF LGYTDPAKP LP+LHEWHFRSGLDRYIWI+GMIYAY+HP EKWMEKL
Sbjct: 307 VFDALWSPLTFFLGYTDPAKPQLPKLHEWHFRSGLDRYIWIVGMIYAYFHPNVEKWMEKL 366
Query: 367 EESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFF 426
EE++ ++++SIKA IVTVAL VGY+WYE IYKLDK++YNKYHPYTSWIPIT + + F
Sbjct: 367 EEADTRKRVSIKACIVTVALSVGYMWYEWIYKLDKISYNKYHPYTSWIPITVYICLRNFT 426
Query: 427 SLVKHLSGSLY 437
++ S +L+
Sbjct: 427 QQFRNYSLTLF 437
>gi|79318876|ref|NP_001031111.1| O-acetyltransferase-like protein [Arabidopsis thaliana]
gi|332193026|gb|AEE31147.1| O-acetyltransferase-like protein [Arabidopsis thaliana]
Length = 584
Score = 693 bits (1789), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/456 (73%), Positives = 377/456 (82%), Gaps = 39/456 (8%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MVV +PITPGQVSFLLG+IP+ +AW+YSEFLEY++ S H KVHSD NLVELE T KED+
Sbjct: 6 MVVSQPITPGQVSFLLGVIPLMIAWLYSEFLEYRRSSFHAKVHSDKNLVELEMVTNKEDE 65
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
VL+EGGL RSAS++ SS IKTNLIRF+T++D+FLLENRATLRAMAEFGAIL YFYIC
Sbjct: 66 GTVLMEGGLPRSASSKFYSSPIKTNLIRFLTLEDSFLLENRATLRAMAEFGAILLYFYIC 125
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT+L+G S KNY+RDLFLFL+ LL+IVSAMTSLKKH DKSP +GK+I YLNRHQTEEWK
Sbjct: 126 DRTSLIGQSQKNYSRDLFLFLFCLLIIVSAMTSLKKHTDKSPITGKSILYLNRHQTEEWK 185
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFA------- 233
GWMQVLFLMYHYFAA E YNAIR+FIA YVWMTGFGNFSYYYIRKDFSL RF
Sbjct: 186 GWMQVLFLMYHYFAAVEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFTQVRSTIF 245
Query: 234 --------------------------------QMMWRLNFFVAFCCIVLNNDYMLYYICP 261
QMMWRLNFFVAFCCI+LNNDYMLYYICP
Sbjct: 246 DHHSLFSLPCDVLLESTMSFKAQDFYESFYLIQMMWRLNFFVAFCCIILNNDYMLYYICP 305
Query: 262 MHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGY 321
MHTLFT+MVYGA+GI+++YNEI SVM +KI +CFLVVIL+WEIPGVF+IFWSPL F+LGY
Sbjct: 306 MHTLFTLMVYGALGIYSQYNEIASVMALKIASCFLVVILMWEIPGVFEIFWSPLAFLLGY 365
Query: 322 TDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI 381
TDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY+HPT E+WMEKLEE + KR++SIK I
Sbjct: 366 TDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYFHPTVERWMEKLEECDAKRRMSIKTSI 425
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
+ ++ F GYLWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 426 IGISSFAGYLWYEYIYKLDKVTYNKYHPYTSWIPIT 461
>gi|334184680|ref|NP_001031479.2| O-acetyltransferase-like protein [Arabidopsis thaliana]
gi|330253877|gb|AEC08971.1| O-acetyltransferase-like protein [Arabidopsis thaliana]
Length = 527
Score = 689 bits (1779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/413 (78%), Positives = 364/413 (88%), Gaps = 13/413 (3%)
Query: 5 RPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVL 64
+PITPGQVSFLLG+IPVF+AWIYSEFLEYK+ S H+KVHSD NLVEL + KED+ VL
Sbjct: 5 QPITPGQVSFLLGVIPVFIAWIYSEFLEYKRSSLHSKVHSDNNLVELGEVKNKEDEGVVL 64
Query: 65 LEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTN 124
LEGGL RS S + +S IKTNLIRF+T++D+FL+ENRATLRA+ DRT+
Sbjct: 65 LEGGLPRSVSTKFYNSPIKTNLIRFLTLEDSFLIENRATLRAI-------------DRTS 111
Query: 125 LLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQ 184
LLG+S KNYNRDLFLFLY LL+IVSAMTSLKKHNDKSP +GK+I YLNRHQTEEWKGWMQ
Sbjct: 112 LLGESKKNYNRDLFLFLYCLLIIVSAMTSLKKHNDKSPITGKSILYLNRHQTEEWKGWMQ 171
Query: 185 VLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVA 244
VLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RF QMMWRLN FVA
Sbjct: 172 VLFLMYHYFAAAEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFTQMMWRLNLFVA 231
Query: 245 FCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEI 304
F CI+LNNDYMLYYICPMHTLFT+MVYGA+GIF++YNEI SVM +KI +CFLVVI++WEI
Sbjct: 232 FSCIILNNDYMLYYICPMHTLFTLMVYGALGIFSRYNEIPSVMALKIASCFLVVIVMWEI 291
Query: 305 PGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWME 364
PGVF+IFWSPLTF+LGYTDPAKP+LP LHEWHFRSGLDRYIWIIGMIYAY+HPT E+WME
Sbjct: 292 PGVFEIFWSPLTFLLGYTDPAKPELPLLHEWHFRSGLDRYIWIIGMIYAYFHPTVERWME 351
Query: 365 KLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
KLEE + KRK+SIK I+ ++ FVGYLWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 352 KLEECDAKRKMSIKTSIIAISSFVGYLWYEYIYKLDKVTYNKYHPYTSWIPIT 404
>gi|359477106|ref|XP_003631938.1| PREDICTED: CAS1 domain-containing protein 1-like [Vitis vinifera]
gi|296083216|emb|CBI22852.3| unnamed protein product [Vitis vinifera]
Length = 544
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/419 (75%), Positives = 363/419 (86%), Gaps = 3/419 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M++ PITPGQV+F +G + VF AWIY+EFLEYKK + +K HSD NLVEL ET+KEDD
Sbjct: 1 MMIVGPITPGQVAFFIGFVSVFAAWIYAEFLEYKKNAFPSKTHSDLNLVEL-NETVKEDD 59
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RAVLLEGG +S S + SSS+ +++ RF+ M+++FL+E R TLRAM EFGA+L YFY+C
Sbjct: 60 RAVLLEGGGLQSVSPKARSSSVTSHIFRFLLMEESFLIEYRLTLRAMCEFGALLAYFYLC 119
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTNL GDS K+YNRDLF+FLY LL+IVSA+TS K H+DKS FSGK+I YLNRHQTEEWK
Sbjct: 120 DRTNLFGDSKKSYNRDLFIFLYFLLIIVSAVTSFKVHHDKSSFSGKSILYLNRHQTEEWK 179
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLN
Sbjct: 180 GWMQVLFLMYHYFAATEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLN 239
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
F V FCC+VLNN YMLYYICPMHTLFT+MVYGA+GI NKYNEIG V+ VKI+ACFLVV+L
Sbjct: 240 FLVLFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGLVIAVKIVACFLVVVL 299
Query: 301 IWEIPGVFDIFWSPLTFILGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
+WEIPGVF+ WSPLTFILGYT DP+K RLHEWHFRSGLDRYIWIIGMIYAYYHPT
Sbjct: 300 LWEIPGVFEFVWSPLTFILGYTDPDPSKQKFSRLHEWHFRSGLDRYIWIIGMIYAYYHPT 359
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEE+E K +++IK I TVAL VGYLW+E IYKLDK+TYNKYHPYTSWIPI+
Sbjct: 360 VERWMEKLEETEVKLRVAIKMAIATVALTVGYLWFEHIYKLDKLTYNKYHPYTSWIPIS 418
>gi|115438729|ref|NP_001043644.1| Os01g0631100 [Oryza sativa Japonica Group]
gi|54291551|dbj|BAD61411.1| O-acetyltransferase-like [Oryza sativa Japonica Group]
gi|55297065|dbj|BAD68634.1| O-acetyltransferase-like [Oryza sativa Japonica Group]
gi|113533175|dbj|BAF05558.1| Os01g0631100 [Oryza sativa Japonica Group]
Length = 539
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/419 (77%), Positives = 368/419 (87%), Gaps = 6/419 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF P+T GQVSFLLG+ PV +AWIYSE LEY+K SS KVHSD+NL E T+KEDD
Sbjct: 1 MEVFGPVTAGQVSFLLGLFPVLIAWIYSEVLEYRK-SSSMKVHSDSNL---ENGTVKEDD 56
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+ VLLEGGLS+S S + +S K NLIRF+TMD++FLLENRA LRAMAEFG +L YFYIC
Sbjct: 57 KTVLLEGGLSKSPSTKFRINSTKANLIRFITMDESFLLENRAVLRAMAEFGIVLVYFYIC 116
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ +S K+YNRDLFLFLY+LL+I SA+TSLKKH+DKS FSGK+I YLNRHQTEEWK
Sbjct: 117 DRTNIFPESKKSYNRDLFLFLYILLIIASALTSLKKHHDKSAFSGKSILYLNRHQTEEWK 176
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYI+KDFSL RFAQMMWRLN
Sbjct: 177 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIKKDFSLARFAQMMWRLN 236
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVL+NDYMLYYICPMHTLFT+MVYG++G+FNKYNEI SVM +KI++CFL VIL
Sbjct: 237 FFVAFCCIVLDNDYMLYYICPMHTLFTLMVYGSLGLFNKYNEIPSVMAMKIVSCFLAVIL 296
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVF++ WSP TF+LGY D P+K +LP LHEWHFRSGLDRYIWIIGMIYAY+HP
Sbjct: 297 IWEIPGVFELLWSPFTFLLGYKDPEPSKANLPLLHEWHFRSGLDRYIWIIGMIYAYFHPN 356
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +LSIK I++++L GYLWYE IYKLDK+TYNKYHPYTSWIPIT
Sbjct: 357 VERWMEKLEESETKVRLSIKGTIISISLVAGYLWYEYIYKLDKITYNKYHPYTSWIPIT 415
>gi|222618900|gb|EEE55032.1| hypothetical protein OsJ_02705 [Oryza sativa Japonica Group]
Length = 498
Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/419 (77%), Positives = 368/419 (87%), Gaps = 6/419 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF P+T GQVSFLLG+ PV +AWIYSE LEY+K SS KVHSD+NL E T+KEDD
Sbjct: 1 MEVFGPVTAGQVSFLLGLFPVLIAWIYSEVLEYRK-SSSMKVHSDSNL---ENGTVKEDD 56
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+ VLLEGGLS+S S + +S K NLIRF+TMD++FLLENRA LRAMAEFG +L YFYIC
Sbjct: 57 KTVLLEGGLSKSPSTKFRINSTKANLIRFITMDESFLLENRAVLRAMAEFGIVLVYFYIC 116
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ +S K+YNRDLFLFLY+LL+I SA+TSLKKH+DKS FSGK+I YLNRHQTEEWK
Sbjct: 117 DRTNIFPESKKSYNRDLFLFLYILLIIASALTSLKKHHDKSAFSGKSILYLNRHQTEEWK 176
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYI+KDFSL RFAQMMWRLN
Sbjct: 177 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIKKDFSLARFAQMMWRLN 236
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVL+NDYMLYYICPMHTLFT+MVYG++G+FNKYNEI SVM +KI++CFL VIL
Sbjct: 237 FFVAFCCIVLDNDYMLYYICPMHTLFTLMVYGSLGLFNKYNEIPSVMAMKIVSCFLAVIL 296
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVF++ WSP TF+LGY D P+K +LP LHEWHFRSGLDRYIWIIGMIYAY+HP
Sbjct: 297 IWEIPGVFELLWSPFTFLLGYKDPEPSKANLPLLHEWHFRSGLDRYIWIIGMIYAYFHPN 356
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +LSIK I++++L GYLWYE IYKLDK+TYNKYHPYTSWIPIT
Sbjct: 357 VERWMEKLEESETKVRLSIKGTIISISLVAGYLWYEYIYKLDKITYNKYHPYTSWIPIT 415
>gi|218188708|gb|EEC71135.1| hypothetical protein OsI_02953 [Oryza sativa Indica Group]
Length = 539
Score = 662 bits (1707), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/419 (76%), Positives = 367/419 (87%), Gaps = 6/419 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF +T GQVSFLLG+ PV +AWIYSE LEY+K SS KVHSD+NL E T+KEDD
Sbjct: 1 MEVFGAVTAGQVSFLLGLFPVLIAWIYSEVLEYRK-SSSMKVHSDSNL---ENGTVKEDD 56
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+ VLLEGGLS+S S + +S K NLIRF+TMD++FLLENRA LRAMAEFG +L YFYIC
Sbjct: 57 KTVLLEGGLSKSPSTKFRINSTKANLIRFITMDESFLLENRAVLRAMAEFGIVLVYFYIC 116
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ +S K+YNRDLFLFLY+LL+I SA+TSLKKH+DKS FSGK+I YLNRHQTEEWK
Sbjct: 117 DRTNIFPESKKSYNRDLFLFLYILLIIASALTSLKKHHDKSAFSGKSILYLNRHQTEEWK 176
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYI+KDFSL RFAQMMWRLN
Sbjct: 177 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIKKDFSLARFAQMMWRLN 236
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVL+NDYMLYYICPMHTLFT+MVYG++G+FNKYNEI SVM +KI++CFL VIL
Sbjct: 237 FFVAFCCIVLDNDYMLYYICPMHTLFTLMVYGSLGLFNKYNEIPSVMAMKIVSCFLAVIL 296
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVF++ WSP TF+LGY D P+K +LP LHEWHFRSGLDRYIWIIGMIYAY+HP
Sbjct: 297 IWEIPGVFELLWSPFTFLLGYKDPEPSKANLPLLHEWHFRSGLDRYIWIIGMIYAYFHPN 356
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +LSIK I++++L GYLWYE IYKLDK+TYNKYHPYTSWIPIT
Sbjct: 357 VERWMEKLEESETKVRLSIKGTIISISLVAGYLWYEYIYKLDKITYNKYHPYTSWIPIT 415
>gi|242088987|ref|XP_002440326.1| hypothetical protein SORBIDRAFT_09g029750 [Sorghum bicolor]
gi|241945611|gb|EES18756.1| hypothetical protein SORBIDRAFT_09g029750 [Sorghum bicolor]
Length = 540
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/419 (75%), Positives = 363/419 (86%), Gaps = 5/419 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF P+TPGQVSFLLG+ PV +AWIYSE LEYKK SH KVHSDTNL + TIKEDD
Sbjct: 1 MEVFGPVTPGQVSFLLGLFPVLIAWIYSEILEYKKSLSHGKVHSDTNL---DNGTIKEDD 57
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
++VLLEGG +S S + + S K NL+RF+TMD++FLLENRA LRAMAEFG +L YFYIC
Sbjct: 58 KSVLLEGGQLKSPSTKFRNLSTKANLLRFITMDESFLLENRAVLRAMAEFGVVLVYFYIC 117
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ +S K+YNRDLFLFLY+LL+I SA+TSLKKH++KS FSGK+I YLNRHQTEEWK
Sbjct: 118 DRTNIFPESKKSYNRDLFLFLYILLIIASALTSLKKHHEKSAFSGKSILYLNRHQTEEWK 177
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIA YVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLN
Sbjct: 178 GWMQVLFLMYHYFAATEIYNAIRVFIACYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLN 237
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVL+ND MLYYICPMHTLFT+MVYG++G+FNKYNEI S+M +KI CFL VIL
Sbjct: 238 FFVAFCCIVLDNDLMLYYICPMHTLFTLMVYGSLGLFNKYNEIPSIMAIKIACCFLSVIL 297
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVF++ W+P TF+LGY D P+K LP LHEWHFRSGLDRYIWI+GMIYAY+HP
Sbjct: 298 IWEIPGVFELLWAPFTFLLGYKDPSPSKAHLPLLHEWHFRSGLDRYIWIVGMIYAYFHPN 357
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +LSIK IVT++L G+LWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 358 VERWMEKLEESETKVRLSIKGTIVTLSLTAGFLWYEYIYKLDKVTYNKYHPYTSWIPIT 416
>gi|218197332|gb|EEC79759.1| hypothetical protein OsI_21144 [Oryza sativa Indica Group]
Length = 613
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/419 (75%), Positives = 362/419 (86%), Gaps = 5/419 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF P+TPGQVSFLLG+ PV + WIY+E LEY+K + KVHSD NL E ETIKEDD
Sbjct: 1 MEVFGPVTPGQVSFLLGLFPVLIGWIYAEILEYRKSLLYGKVHSDANL---ENETIKEDD 57
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+AVLLEGG S+S S +L + S K NLIRF+TMD++FLLENRA LRAMAE G IL YFYIC
Sbjct: 58 KAVLLEGGQSKSPSTKLRNMSTKANLIRFITMDESFLLENRAVLRAMAEVGIILVYFYIC 117
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ ++ K+YNRDLFLFLY+LL+I SA+TSLKKHN+KS F+GK+I YLNRHQTEEWK
Sbjct: 118 DRTNIFPETKKSYNRDLFLFLYILLIIASALTSLKKHNEKSAFTGKSILYLNRHQTEEWK 177
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLN
Sbjct: 178 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLN 237
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVL+NDYMLYYICPMHTLFT+MVYG++G+FNKYNE SVM +KI CFL VIL
Sbjct: 238 FFVAFCCIVLDNDYMLYYICPMHTLFTLMVYGSLGLFNKYNEKPSVMAIKIACCFLTVIL 297
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVF+ W+P TF+LGY D P+K +LP LHEWHFRSGLDRYIWIIGMIYAY+HP
Sbjct: 298 IWEIPGVFEFLWAPFTFLLGYKDPEPSKANLPLLHEWHFRSGLDRYIWIIGMIYAYFHPN 357
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +L IK IVT++L GYLWYE IY+LDK+TYNKYHPYTSWIPIT
Sbjct: 358 VERWMEKLEESETKVRLFIKGAIVTLSLTAGYLWYEYIYRLDKITYNKYHPYTSWIPIT 416
>gi|356521737|ref|XP_003529508.1| PREDICTED: CAS1 domain-containing protein 1-like [Glycine max]
Length = 545
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/419 (75%), Positives = 358/419 (85%), Gaps = 2/419 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M++ P+TPGQVSFLLGIIPV VAWIYSE LEY+K S ++ SD NLVE+ + +K++D
Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEILEYRKNSVSSRAQSDINLVEMGSDVVKDED 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RAVLLEGG +S S + S + ++IRF+ MD+ FLLENR TLRAM+EFG IL YFY+C
Sbjct: 61 RAVLLEGGALQSGSPKARSLTGSPSIIRFLLMDECFLLENRLTLRAMSEFGLILAYFYLC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT+ S K+YNRDLFLFLY LL+IVSAMTS K H+DKSP SGK+I YLNRHQTEEWK
Sbjct: 121 DRTDFFASSNKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFV FCCIVLNN YMLYYICPMHTLFT+MVYGA+GI +KYNEIGSV+ VKI+ACFLVVIL
Sbjct: 241 FFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIACFLVVIL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
+WEIPGVF+ WSP TF LGYTD PAK L RLHEWHFRSGLDRYIWIIGMIYAYYHPT
Sbjct: 301 VWEIPGVFEWVWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYHPT 360
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEE+E KR++SIKA +V + VGYLW+E IYKLDK+ YNKYHPYTSWIPIT
Sbjct: 361 VERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKIAYNKYHPYTSWIPIT 419
>gi|48475132|gb|AAT44201.1| unknown protein [Oryza sativa Japonica Group]
Length = 540
Score = 655 bits (1689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/419 (75%), Positives = 362/419 (86%), Gaps = 5/419 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF P+TPGQVSFLLG+ PV + WIY+E LEY+K + KVHSD NL E ET+KEDD
Sbjct: 1 MEVFGPVTPGQVSFLLGLFPVLIGWIYAEILEYRKSLLYGKVHSDANL---ENETMKEDD 57
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+AVLLEGG S+S S +L + S K NLIRF+TMD++FLLENRA LRAMAE G IL YFYIC
Sbjct: 58 KAVLLEGGQSKSPSTKLRNMSTKANLIRFITMDESFLLENRAVLRAMAEVGIILVYFYIC 117
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ ++ K+YNRDLFLFLY+LL+I SA+TSLKKHN+KS F+GK+I YLNRHQTEEWK
Sbjct: 118 DRTNIFPETKKSYNRDLFLFLYILLIIASALTSLKKHNEKSAFTGKSILYLNRHQTEEWK 177
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLN
Sbjct: 178 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLN 237
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVL+NDYMLYYICPMHTLFT+MVYG++G+FNKYNE SVM +KI CFL VIL
Sbjct: 238 FFVAFCCIVLDNDYMLYYICPMHTLFTLMVYGSLGLFNKYNEKPSVMAIKIACCFLTVIL 297
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVF+ W+P TF+LGY D P+K +LP LHEWHFRSGLDRYIWIIGMIYAY+HP
Sbjct: 298 IWEIPGVFEFLWAPFTFLLGYKDPEPSKANLPLLHEWHFRSGLDRYIWIIGMIYAYFHPN 357
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +L IK IVT++L GYLWYE IY+LDK+TYNKYHPYTSWIPIT
Sbjct: 358 VERWMEKLEESETKVRLFIKGAIVTLSLTAGYLWYEYIYRLDKITYNKYHPYTSWIPIT 416
>gi|147805322|emb|CAN69621.1| hypothetical protein VITISV_008604 [Vitis vinifera]
Length = 794
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/407 (77%), Positives = 345/407 (84%), Gaps = 24/407 (5%)
Query: 43 HSDTNLVELEKETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRA 102
HSD NLVEL ETIKEDDRA+LLEGGL++SASA+ F+TMDD+FLLENR
Sbjct: 5 HSDNNLVELGSETIKEDDRAILLEGGLTKSASAK------------FLTMDDSFLLENRL 52
Query: 103 TLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSP 162
TLRAM+EFGAIL YFY+CDRT LLGDSTKNYNRDLF+FLYLLLVIV MTSLKKH+DKS
Sbjct: 53 TLRAMSEFGAILTYFYVCDRTELLGDSTKNYNRDLFIFLYLLLVIVCFMTSLKKHHDKSA 112
Query: 163 FSGKTIQYLNRHQTEEWKGWMQ----------VLFLMYHYFAATEIYNAIRIFIAAYVWM 212
FSGK + YLNRHQTEEWKGWMQ VLFLMYHYFAA EIYNAIR+FIAAYVWM
Sbjct: 113 FSGKALLYLNRHQTEEWKGWMQASFSKLNFNAVLFLMYHYFAAAEIYNAIRVFIAAYVWM 172
Query: 213 TGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYG 272
TGFGNFSYYYIRKDFSL RF QMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFT+MVYG
Sbjct: 173 TGFGNFSYYYIRKDFSLARFTQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTLMVYG 232
Query: 273 AVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYT--DPAKPDLP 330
A+GIFNKYNEI SVM VKILACFLVVILIWEIPGVFDIFWSP F+LGY+ DP+K LP
Sbjct: 233 ALGIFNKYNEIRSVMAVKILACFLVVILIWEIPGVFDIFWSPSAFLLGYSDPDPSKQGLP 292
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGY 390
RLHEWHFRSGLDRYIWIIGMIYAYYHP EKWMEKLEE+E KR+L+IK IVTV +FVGY
Sbjct: 293 RLHEWHFRSGLDRYIWIIGMIYAYYHPNVEKWMEKLEETETKRRLTIKTSIVTVTVFVGY 352
Query: 391 LWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLY 437
LWYE IYKLDKVTYNK+HPYTSWIPIT + + F ++ S +L+
Sbjct: 353 LWYEYIYKLDKVTYNKFHPYTSWIPITVYISLRNFTQQLRSYSLTLF 399
>gi|222632697|gb|EEE64829.1| hypothetical protein OsJ_19686 [Oryza sativa Japonica Group]
Length = 614
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/419 (75%), Positives = 362/419 (86%), Gaps = 5/419 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF P+TPGQVSFLLG+ PV + WIY+E LEY+K + KVHSD NL E ET+KEDD
Sbjct: 1 MEVFGPVTPGQVSFLLGLFPVLIGWIYAEILEYRKSLLYGKVHSDANL---ENETMKEDD 57
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+AVLLEGG S+S S +L + S K NLIRF+TMD++FLLENRA LRAMAE G IL YFYIC
Sbjct: 58 KAVLLEGGQSKSPSTKLRNMSTKANLIRFITMDESFLLENRAVLRAMAEVGIILVYFYIC 117
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ ++ K+YNRDLFLFLY+LL+I SA+TSLKKHN+KS F+GK+I YLNRHQTEEWK
Sbjct: 118 DRTNIFPETKKSYNRDLFLFLYILLIIASALTSLKKHNEKSAFTGKSILYLNRHQTEEWK 177
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLN
Sbjct: 178 GWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLN 237
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVL+NDYMLYYICPMHTLFT+MVYG++G+FNKYNE SVM +KI CFL VIL
Sbjct: 238 FFVAFCCIVLDNDYMLYYICPMHTLFTLMVYGSLGLFNKYNEKPSVMAIKIACCFLTVIL 297
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVF+ W+P TF+LGY D P+K +LP LHEWHFRSGLDRYIWIIGMIYAY+HP
Sbjct: 298 IWEIPGVFEFLWAPFTFLLGYKDPEPSKANLPLLHEWHFRSGLDRYIWIIGMIYAYFHPN 357
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +L IK IVT++L GYLWYE IY+LDK+TYNKYHPYTSWIPIT
Sbjct: 358 VERWMEKLEESETKVRLFIKGAIVTLSLTAGYLWYEYIYRLDKITYNKYHPYTSWIPIT 416
>gi|357132410|ref|XP_003567823.1| PREDICTED: CAS1 domain-containing protein 1-like [Brachypodium
distachyon]
Length = 540
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/419 (74%), Positives = 359/419 (85%), Gaps = 5/419 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF P+TPGQVSFLLG+ PV +AW Y+E LEY+K SH KVHSD L E ET KED+
Sbjct: 1 MEVFGPVTPGQVSFLLGLFPVLIAWTYAEILEYRKSLSHGKVHSDATL---ENETTKEDE 57
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+A+L+EGG +S S + + S K NLIRF+TMD++FLLENRA LRAMAEFG +L YFYIC
Sbjct: 58 KAILIEGGQLKSPSVKFRNMSTKANLIRFITMDESFLLENRAVLRAMAEFGVVLVYFYIC 117
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ +S KNYNRDLFLFLY+LL+I SA+TSLKKH +KS FSGK+I YLNRHQTEEWK
Sbjct: 118 DRTNIFPESKKNYNRDLFLFLYILLIIASALTSLKKHQEKSAFSGKSILYLNRHQTEEWK 177
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA EIYNAIR+FIA YVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLN
Sbjct: 178 GWMQVLFLMYHYFAAFEIYNAIRVFIACYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLN 237
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVL+ND+MLYYICPMHTLFT+MVYG++G+FNKYNE+ SVM +KI CFL VIL
Sbjct: 238 FFVAFCCIVLDNDFMLYYICPMHTLFTLMVYGSLGLFNKYNEVPSVMAIKIACCFLSVIL 297
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
IWEIPGVF+I W+P TF+LGY D P+K +LP LHEWHFRSGLDRYIWIIGMIYAY+HP
Sbjct: 298 IWEIPGVFEILWAPFTFLLGYKDPEPSKSNLPLLHEWHFRSGLDRYIWIIGMIYAYFHPN 357
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +LSIK IVT+++ YLWYE IYKLDK+TYNKYHPYTSWIPIT
Sbjct: 358 VERWMEKLEESETKVRLSIKGTIVTLSVMAAYLWYEYIYKLDKITYNKYHPYTSWIPIT 416
>gi|115452663|ref|NP_001049932.1| Os03g0314200 [Oryza sativa Japonica Group]
gi|108707811|gb|ABF95606.1| O-acetyltransferase, putative, expressed [Oryza sativa Japonica
Group]
gi|113548403|dbj|BAF11846.1| Os03g0314200 [Oryza sativa Japonica Group]
gi|215704728|dbj|BAG94756.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 552
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/417 (72%), Positives = 349/417 (83%), Gaps = 1/417 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M +TPGQVS LLG + VF AW Y+E L Y+K ++ K HSD NL ++ + K +D
Sbjct: 13 MAASTSLTPGQVSALLGFLWVFTAWAYAEVLYYRKNAASIKAHSDVNLAVMDSSSNKGED 72
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+ +LLE G+ ++ + + +S+ + + R +D A +LENR TLRA++EFG L YFYIC
Sbjct: 73 QVMLLEEGV-QAPVQKPVYASLTSQMFRLFLLDQALILENRLTLRAISEFGGHLLYFYIC 131
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTNLLG+S KNY+RD+FLFLY LL+IV+AMTS K H DKS F+GK+I YLNRHQTEEWK
Sbjct: 132 DRTNLLGESAKNYSRDMFLFLYFLLIIVAAMTSFKVHQDKSSFTGKSILYLNRHQTEEWK 191
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYF A EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLN
Sbjct: 192 GWMQVLFLMYHYFNAKEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLN 251
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDY LYYICPMHTLFT+MVYGA+GI NKYNEIGSVM +K +ACFLVVIL
Sbjct: 252 FFVAFCCIVLNNDYTLYYICPMHTLFTLMVYGALGILNKYNEIGSVMAIKFVACFLVVIL 311
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
IWEIPGVF+I WSP TF+LGYTDP+KPDLPRLHEWHFRSGLDRYIWI+GMIYAYYHPT E
Sbjct: 312 IWEIPGVFEIVWSPFTFLLGYTDPSKPDLPRLHEWHFRSGLDRYIWIVGMIYAYYHPTVE 371
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
KWMEKLEE+E K KL IKA IV++AL G LWYE IYKLDK+TYNKYHPYTSWIPIT
Sbjct: 372 KWMEKLEEAETKTKLYIKALIVSIALTAGCLWYEYIYKLDKITYNKYHPYTSWIPIT 428
>gi|356565093|ref|XP_003550779.1| PREDICTED: CAS1 domain-containing protein 1-like [Glycine max]
Length = 545
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/419 (74%), Positives = 354/419 (84%), Gaps = 2/419 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M++ P+TPGQVSFLLGIIPV VAWIYSE LEY+ ++ SD NLVE+ + +K++D
Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEMLEYRNNYVPSRAQSDINLVEIGSDVVKDED 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RA LLEGG +S S + S + ++IRF+ MD+ FLLENR TLRAM+EFG IL YFY+C
Sbjct: 61 RAALLEGGALQSGSPKARSLTASPSIIRFLLMDEYFLLENRLTLRAMSEFGLILAYFYLC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT+ S K+YNRDLFLFLY LL+IVSAMTS K H+DKSP SGK+I YLNRHQTEEWK
Sbjct: 121 DRTDFFASSKKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLN
Sbjct: 181 GWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLN 240
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFV F CIVLNN YMLYYICPMHTLFT+MVYGA+GI +KYNEIGSV+ VKI+ CFLVVIL
Sbjct: 241 FFVVFSCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIGCFLVVIL 300
Query: 301 IWEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
+WEIPGVF+ WSP TF LGYTD PAK L RLHEWHFRSGLDRYIWIIGMIYAYYHPT
Sbjct: 301 VWEIPGVFEWLWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYHPT 360
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEE+E KR++SIKA +V + VGYLW+E IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 361 VERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKVTYNKYHPYTSWIPIT 419
>gi|224108786|ref|XP_002314967.1| predicted protein [Populus trichocarpa]
gi|222864007|gb|EEF01138.1| predicted protein [Populus trichocarpa]
Length = 567
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 307/414 (74%), Positives = 352/414 (85%), Gaps = 2/414 (0%)
Query: 6 PITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD-RAVL 64
P+T GQ SFLLGI+PVF AWIY+E+LEYKK ++ K HSD LVEL E +KEDD RAVL
Sbjct: 6 PVTRGQFSFLLGIVPVFAAWIYAEYLEYKKNNTLAKAHSDIGLVELGNEAVKEDDDRAVL 65
Query: 65 LEGGLS-RSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRT 123
LEGG + AS + + + + RF+ M++ FL++NR TLRA+ EFG + YFYICDRT
Sbjct: 66 LEGGGGLQPASPKARTPTSSFPIFRFLMMEEQFLIDNRLTLRAILEFGFFMAYFYICDRT 125
Query: 124 NLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWM 183
++LG S K+YNRDLFLFLY LL+IVSA+TS H+DKSPFSGK I YLNRHQTEEWKGWM
Sbjct: 126 DMLGSSKKSYNRDLFLFLYFLLIIVSAVTSFTIHHDKSPFSGKPILYLNRHQTEEWKGWM 185
Query: 184 QVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFV 243
QVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLNF V
Sbjct: 186 QVLFLMYHYFAATEIYNAIRMFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLNFLV 245
Query: 244 AFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWE 303
FCC+VL+N YMLYYICPMHTLFT+MVY A+GIFNKYNEIGSVM KI+ACF VVIL+WE
Sbjct: 246 LFCCVVLDNSYMLYYICPMHTLFTLMVYAALGIFNKYNEIGSVMAAKIIACFFVVILMWE 305
Query: 304 IPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWM 363
IPGVF++ WSP TF++GYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHP E WM
Sbjct: 306 IPGVFEVIWSPFTFLVGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPKVEGWM 365
Query: 364 EKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
EKLEE+E KR++ IK + T++L VGY WYE IYKLDK++YNKYHPYTSWIPIT
Sbjct: 366 EKLEETEAKRRIPIKTAVATISLAVGYTWYEYIYKLDKISYNKYHPYTSWIPIT 419
>gi|222624809|gb|EEE58941.1| hypothetical protein OsJ_10613 [Oryza sativa Japonica Group]
Length = 550
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/427 (70%), Positives = 349/427 (81%), Gaps = 11/427 (2%)
Query: 1 MVVFRPITPGQ----------VSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVE 50
M +TPGQ VS LLG + VF AW Y+E L Y+K ++ K HSD NL
Sbjct: 1 MAASTSLTPGQGCGSKGVNHQVSALLGFLWVFTAWAYAEVLYYRKNAASIKAHSDVNLAV 60
Query: 51 LEKETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEF 110
++ + K +D+ +LLE G+ ++ + + +S+ + + R +D A +LENR TLRA++EF
Sbjct: 61 MDSSSNKGEDQVMLLEEGV-QAPVQKPVYASLTSQMFRLFLLDQALILENRLTLRAISEF 119
Query: 111 GAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQY 170
G L YFYICDRTNLLG+S KNY+RD+FLFLY LL+IV+AMTS K H DKS F+GK+I Y
Sbjct: 120 GGHLLYFYICDRTNLLGESAKNYSRDMFLFLYFLLIIVAAMTSFKVHQDKSSFTGKSILY 179
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLP 230
LNRHQTEEWKGWMQVLFLMYHYF A EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL
Sbjct: 180 LNRHQTEEWKGWMQVLFLMYHYFNAKEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLA 239
Query: 231 RFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVK 290
RFAQMMWRLNFFVAFCCIVLNNDY LYYICPMHTLFT+MVYGA+GI NKYNEIGSVM +K
Sbjct: 240 RFAQMMWRLNFFVAFCCIVLNNDYTLYYICPMHTLFTLMVYGALGILNKYNEIGSVMAIK 299
Query: 291 ILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGM 350
+ACFLVVILIWEIPGVF+I WSP TF+LGYTDP+KPDLPRLHEWHFRSGLDRYIWI+GM
Sbjct: 300 FVACFLVVILIWEIPGVFEIVWSPFTFLLGYTDPSKPDLPRLHEWHFRSGLDRYIWIVGM 359
Query: 351 IYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPY 410
IYAYYHPT EKWMEKLEE+E K KL IKA IV++AL G LWYE IYKLDK+TYNKYHPY
Sbjct: 360 IYAYYHPTVEKWMEKLEEAETKTKLYIKALIVSIALTAGCLWYEYIYKLDKITYNKYHPY 419
Query: 411 TSWIPIT 417
TSWIPIT
Sbjct: 420 TSWIPIT 426
>gi|297604974|ref|NP_001056436.2| Os05g0582100 [Oryza sativa Japonica Group]
gi|255676607|dbj|BAF18350.2| Os05g0582100, partial [Oryza sativa Japonica Group]
Length = 529
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 309/408 (75%), Positives = 353/408 (86%), Gaps = 5/408 (1%)
Query: 12 VSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLSR 71
VSFLLG+ PV + WIY+E LEY+K + KVHSD NL E ET+KEDD+AVLLEGG S+
Sbjct: 1 VSFLLGLFPVLIGWIYAEILEYRKSLLYGKVHSDANL---ENETMKEDDKAVLLEGGQSK 57
Query: 72 SASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTK 131
S S +L + S K NLIRF+TMD++FLLENRA LRAMAE G IL YFYICDRTN+ ++ K
Sbjct: 58 SPSTKLRNMSTKANLIRFITMDESFLLENRAVLRAMAEVGIILVYFYICDRTNIFPETKK 117
Query: 132 NYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYH 191
+YNRDLFLFLY+LL+I SA+TSLKKHN+KS F+GK+I YLNRHQTEEWKGWMQVLFLMYH
Sbjct: 118 SYNRDLFLFLYILLIIASALTSLKKHNEKSAFTGKSILYLNRHQTEEWKGWMQVLFLMYH 177
Query: 192 YFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLN 251
YFAATEIYNAIR+FIAAYVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLNFFVAFCCIVL+
Sbjct: 178 YFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLNFFVAFCCIVLD 237
Query: 252 NDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIF 311
NDYMLYYICPMHTLFT+MVYG++G+FNKYNE SVM +KI CFL VILIWEIPGVF+
Sbjct: 238 NDYMLYYICPMHTLFTLMVYGSLGLFNKYNEKPSVMAIKIACCFLTVILIWEIPGVFEFL 297
Query: 312 WSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEES 369
W+P TF+LGY D P+K +LP LHEWHFRSGLDRYIWIIGMIYAY+HP E+WMEKLEES
Sbjct: 298 WAPFTFLLGYKDPEPSKANLPLLHEWHFRSGLDRYIWIIGMIYAYFHPNVERWMEKLEES 357
Query: 370 EPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E K +L IK IVT++L GYLWYE IY+LDK+TYNKYHPYTSWIPIT
Sbjct: 358 ETKVRLFIKGAIVTLSLTAGYLWYEYIYRLDKITYNKYHPYTSWIPIT 405
>gi|224101517|ref|XP_002312313.1| predicted protein [Populus trichocarpa]
gi|222852133|gb|EEE89680.1| predicted protein [Populus trichocarpa]
Length = 576
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 302/411 (73%), Positives = 348/411 (84%), Gaps = 3/411 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M + P+TPGQ SFLLGI+PVF AWIY+E+LEYKK ++ K HSD LVEL E +KEDD
Sbjct: 26 MPMLSPVTPGQFSFLLGIVPVFAAWIYTEYLEYKKNNTLAKAHSDVGLVELGNEAVKEDD 85
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RAVLLEGG+ +SAS + SS+ + RF TM++ FL++NR TLRA++EFG + YFYIC
Sbjct: 86 RAVLLEGGV-QSASPKARSSTSTFPIFRFFTMEEQFLIDNRLTLRAISEFGFFMVYFYIC 144
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT++LG S K+YNRDLFLFLY LL+IVSA+TS K H+DKSPFSGK I YLNRHQTEEWK
Sbjct: 145 DRTDILGSSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKPILYLNRHQTEEWK 204
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATE YNAIR+FIA+YVWMTGFGNFSYYY+RKDFSL RFAQMMWRLN
Sbjct: 205 GWMQVLFLMYHYFAATEFYNAIRVFIASYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLN 264
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
F V CC+VLNN YMLYYICPMHTLFT+MVY A+GIFNKYNEIGSVM KI+ACFLVVIL
Sbjct: 265 FLVLVCCVVLNNSYMLYYICPMHTLFTLMVYAALGIFNKYNEIGSVMAAKIIACFLVVIL 324
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WEIPGVF++ WSP TF+ GYTDPAKPDLPRLHEWHFRSGLDRYIWI+GMIYAYYHP E
Sbjct: 325 MWEIPGVFEVVWSPFTFLFGYTDPAKPDLPRLHEWHFRSGLDRYIWIVGMIYAYYHPMVE 384
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVT--YNKYHP 409
WMEKLEE+E KR++SIK + T++L VGY+WYE IYKLDK + K HP
Sbjct: 385 GWMEKLEETEAKRRISIKTAVATISLAVGYMWYEYIYKLDKCVHLFEKCHP 435
>gi|357479083|ref|XP_003609827.1| CAS1 domain-containing protein [Medicago truncatula]
gi|355510882|gb|AES92024.1| CAS1 domain-containing protein [Medicago truncatula]
Length = 569
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 300/443 (67%), Positives = 352/443 (79%), Gaps = 26/443 (5%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M + P+TPGQVSFLLG+ PV +AWIYSE LE++K S +K HSD LVE+ + +K+++
Sbjct: 1 MHILSPVTPGQVSFLLGLFPVIIAWIYSEILEFRKNSLTSKAHSDIGLVEVRTDVVKDEE 60
Query: 61 RAVLLEGGLSRSASA--RLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFY 118
VLLEGG + AS + S + T++IRF +D+ FL ENR TLRAM+EFG +L Y+Y
Sbjct: 61 TTVLLEGGALQPASPTPKARSFTASTSIIRFFFLDEHFLHENRLTLRAMSEFGLLLAYYY 120
Query: 119 ICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEE 178
+CDRT+ G S K+YNRDLF+FLY LL+IVSA+TS H+DKSPFSGK+I YLNRHQTEE
Sbjct: 121 LCDRTDFFGSSKKSYNRDLFIFLYFLLIIVSAITSFTIHHDKSPFSGKSILYLNRHQTEE 180
Query: 179 WKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWR 238
WKGWMQVLFLMYHYFAA+EIYN+IR+FIAAYVWMTGFGNFSYYYIRKDFS+ RFAQMMWR
Sbjct: 181 WKGWMQVLFLMYHYFAASEIYNSIRLFIAAYVWMTGFGNFSYYYIRKDFSMARFAQMMWR 240
Query: 239 LNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVV 298
LNF V FCC+VLNN YMLYYICPMHTLFT+MVYGA+GI NKYNE GSV+ KI ACFLVV
Sbjct: 241 LNFLVLFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEFGSVIAAKIGACFLVV 300
Query: 299 ILIWEIPGVFDIFWSPLTFILGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYH 356
IL+WEIPGVF+ WSP TF+LGYT DP+K RLHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 301 ILVWEIPGVFEWVWSPFTFMLGYTDPDPSKSHFTRLHEWHFRSGLDRYIWIIGMIYAYYH 360
Query: 357 PT----------------------AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYE 394
PT E+WMEKLEE+E KR++SIKA +V ++ +GYLW+E
Sbjct: 361 PTVSFSFYIFEQAKTNLTFTSILSVERWMEKLEETEIKRRISIKASVVLISSVMGYLWFE 420
Query: 395 CIYKLDKVTYNKYHPYTSWIPIT 417
IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 421 YIYKLDKVTYNKYHPYTSWIPIT 443
>gi|357112507|ref|XP_003558050.1| PREDICTED: CAS1 domain-containing protein 1-like [Brachypodium
distachyon]
Length = 551
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 298/417 (71%), Positives = 346/417 (82%), Gaps = 1/417 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M +TPGQ S LLG + V AW+Y E L +KK ++ K HSD NL E++ + K +D
Sbjct: 13 MASSTSVTPGQASALLGFLWVLAAWVYGEVLSHKKNAASIKTHSDINLPEMDNSSNKAED 72
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+ LLE G ++A A+ + +S + +IR MD + LLE+R TLRA++E G L YFYIC
Sbjct: 73 QTKLLEEG-GQAAGAKPVYASFASQMIRLFFMDQSLLLEHRLTLRAISELGGHLLYFYIC 131
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTNL G+S KNY+RDLFLFLY LL+IV+A+TS K H DKS F+GK++ YLNRHQTEEWK
Sbjct: 132 DRTNLFGESEKNYSRDLFLFLYFLLIIVAAITSFKVHQDKSTFTGKSVLYLNRHQTEEWK 191
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYF A EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLN
Sbjct: 192 GWMQVLFLMYHYFNAKEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLGRFAQMMWRLN 251
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
FFVAFCCIVLNNDY LYYICPMHTLFT+MVYGA+GI NKYNEI SV+ +K +ACFLVVIL
Sbjct: 252 FFVAFCCIVLNNDYTLYYICPMHTLFTLMVYGALGILNKYNEIRSVIAIKFVACFLVVIL 311
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WEIPGVF++ WSP TF+LGY DP+KPDLPRLHEWHFRSGLDRYIWI+GMIYAYYHPT E
Sbjct: 312 VWEIPGVFEMVWSPFTFLLGYNDPSKPDLPRLHEWHFRSGLDRYIWIVGMIYAYYHPTVE 371
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
KWMEKLEE+E + KL IKA IVTV+L VGYLWYE IYKLDK++YNKYHPYTSWIPIT
Sbjct: 372 KWMEKLEEAETRTKLYIKASIVTVSLTVGYLWYEYIYKLDKISYNKYHPYTSWIPIT 428
>gi|242035931|ref|XP_002465360.1| hypothetical protein SORBIDRAFT_01g037160 [Sorghum bicolor]
gi|241919214|gb|EER92358.1| hypothetical protein SORBIDRAFT_01g037160 [Sorghum bicolor]
Length = 553
Score = 623 bits (1606), Expect = e-176, Method: Compositional matrix adjust.
Identities = 299/412 (72%), Positives = 346/412 (83%), Gaps = 2/412 (0%)
Query: 7 ITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKV-HSDTNLVELEKETIKEDDRAVLL 65
+TPGQVS +LG + VF AW Y+E L ++K ++ K HSD NL ++ ++K +D+ +LL
Sbjct: 19 VTPGQVSAILGFLWVFAAWAYAEVLFHRKNTASIKTRHSDVNLAVMDNISVKAEDQTMLL 78
Query: 66 EGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNL 125
E G ++ A+ +S+ + ++R + MD LLENR TLRA++EFG L YFYICDRTNL
Sbjct: 79 EEG-GQAVVAKPAYTSLTSQILRLIFMDQMLLLENRLTLRAISEFGGYLLYFYICDRTNL 137
Query: 126 LGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQV 185
LG+S KNY+RDLFLFLY LL+IV+AMTS K H DKS F+GK+I YLNRHQTEEWKGWMQV
Sbjct: 138 LGESAKNYSRDLFLFLYFLLIIVAAMTSFKVHQDKSAFTGKSILYLNRHQTEEWKGWMQV 197
Query: 186 LFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAF 245
LFLMYHYF A EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLNFFV F
Sbjct: 198 LFLMYHYFNAKEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLGRFAQMMWRLNFFVIF 257
Query: 246 CCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIP 305
CCIVLNNDY LYYICPMHTLFT+MVYGA+GI NKYNEI SVM +K +ACFLVV+L+WE+P
Sbjct: 258 CCIVLNNDYTLYYICPMHTLFTLMVYGALGILNKYNEIRSVMAMKFVACFLVVVLVWEVP 317
Query: 306 GVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEK 365
GVFDI WSP TF+LGYTDP+KPDLPRLHEW FRSGLDRYIWIIGMIYAYYHPT EKWMEK
Sbjct: 318 GVFDIVWSPFTFLLGYTDPSKPDLPRLHEWQFRSGLDRYIWIIGMIYAYYHPTVEKWMEK 377
Query: 366 LEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
LEE+E + KL IKA IVTV+L GYLWYE IYKLDK+TYNK HPYTSWIPIT
Sbjct: 378 LEETEMRTKLYIKASIVTVSLMAGYLWYEYIYKLDKITYNKLHPYTSWIPIT 429
>gi|147773557|emb|CAN63274.1| hypothetical protein VITISV_000398 [Vitis vinifera]
Length = 529
Score = 619 bits (1597), Expect = e-175, Method: Compositional matrix adjust.
Identities = 298/419 (71%), Positives = 344/419 (82%), Gaps = 25/419 (5%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M++ PITPGQV+F +G + VF AWIY+EFLEYKK + +K HSD NLVEL ET+KEDD
Sbjct: 1 MMIVGPITPGQVAFFIGFVSVFAAWIYAEFLEYKKNAFPSKTHSDLNLVEL-NETVKEDD 59
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RAVLLEGG +S S + SSS+ +++ RF+ M+++FL+E R TLRAM EFGA+L YFY+C
Sbjct: 60 RAVLLEGGGLQSVSPKARSSSVTSHIFRFLLMEESFLIEYRLTLRAMCEFGALLAYFYLC 119
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTNL GDS K+YNRDLF+FLY LL+IVSA+TS K H+DKSPFSGK+I YLNRHQTEEWK
Sbjct: 120 DRTNLFGDSKKSYNRDLFIFLYFLLIIVSAVTSFKVHHDKSPFSGKSILYLNRHQTEEWK 179
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL
Sbjct: 180 GWMQVLFLMYHYFAATEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLA---------- 229
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
YMLYYICPMHTLFT+MVYGA+GI NKYNEIG V+ VKI+ACFLVV+L
Sbjct: 230 ------------SYMLYYICPMHTLFTLMVYGALGILNKYNEIGLVIAVKIVACFLVVVL 277
Query: 301 IWEIPGVFDIFWSPLTFILGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
+WEIPGVF+ WSPLTFILGYT DP+K RLHEWHFRSGLDRYIWIIGMIYAYYHPT
Sbjct: 278 LWEIPGVFEFVWSPLTFILGYTDPDPSKQKFSRLHEWHFRSGLDRYIWIIGMIYAYYHPT 337
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEE+E K +++IK I TVAL VGYLW+E IYKLDK+TYNKYHPYTSWIPI+
Sbjct: 338 VERWMEKLEETEVKLRVAIKMAIATVALTVGYLWFEHIYKLDKLTYNKYHPYTSWIPIS 396
>gi|297833418|ref|XP_002884591.1| hypothetical protein ARALYDRAFT_477961 [Arabidopsis lyrata subsp.
lyrata]
gi|297330431|gb|EFH60850.1| hypothetical protein ARALYDRAFT_477961 [Arabidopsis lyrata subsp.
lyrata]
Length = 545
Score = 619 bits (1595), Expect = e-174, Method: Compositional matrix adjust.
Identities = 296/419 (70%), Positives = 345/419 (82%), Gaps = 2/419 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKV-HSDTNLVELEKETIKED 59
M + P+TPG +S +LGI+PV VAW+YSE+L Y K S K HSD NLVE+ K+ +KED
Sbjct: 1 MAISSPVTPGLMSVVLGIVPVIVAWLYSEYLHYAKHSVSAKTRHSDVNLVEIAKDFVKED 60
Query: 60 DRAVLLE-GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFY 118
D+A+L+E GG +SAS R + + LIRF+ +D++FL+ENR TLRA+ EF ++ YFY
Sbjct: 61 DKALLIEDGGGLQSASPRAKGPTTHSPLIRFVLLDESFLVENRLTLRAIIEFALLMVYFY 120
Query: 119 ICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEE 178
ICDRT++ S K+YNRDLFLFLY LL+IVSA+TS HNDKSPFSGK I YLNRHQTEE
Sbjct: 121 ICDRTDVFNSSKKSYNRDLFLFLYFLLIIVSAITSFTIHNDKSPFSGKAIMYLNRHQTEE 180
Query: 179 WKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWR 238
WKGWMQVLFLMYHYFAA E YNAIR+FIA YVWMTGFGNFSYYYIRKDFSLPRFAQMMWR
Sbjct: 181 WKGWMQVLFLMYHYFAAAEYYNAIRVFIACYVWMTGFGNFSYYYIRKDFSLPRFAQMMWR 240
Query: 239 LNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVV 298
LNF V F CIVLNN YMLYYICPMHTLFT+MVYGA+GI NKYNE+GSV+ K ACF+VV
Sbjct: 241 LNFLVIFSCIVLNNSYMLYYICPMHTLFTLMVYGALGIMNKYNEMGSVIAAKFFACFVVV 300
Query: 299 ILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
I++WEIPGVF+ WSP T ++GY DPAKP LP LHEWHFRSGLDRYIWIIGM+YAYYHPT
Sbjct: 301 IIVWEIPGVFEWIWSPFTLLMGYNDPAKPQLPLLHEWHFRSGLDRYIWIIGMLYAYYHPT 360
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E WM+KLEE+E K +++IK + +AL VGY WYE IYKLDK+TYNKYHPYTSWIPIT
Sbjct: 361 VESWMDKLEEAEMKFRMAIKTTVALIALTVGYFWYEYIYKLDKLTYNKYHPYTSWIPIT 419
>gi|186509847|ref|NP_001118592.1| O-acetyltransferase family protein [Arabidopsis thaliana]
gi|51970928|dbj|BAD44156.1| unknown protein [Arabidopsis thaliana]
gi|332640893|gb|AEE74414.1| O-acetyltransferase family protein [Arabidopsis thaliana]
Length = 544
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 291/418 (69%), Positives = 341/418 (81%), Gaps = 1/418 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M P+TPG +S + GI+PV VAW+YSE+L Y K S K HSD NLVE+ K+ +KEDD
Sbjct: 1 MASSSPVTPGLMSVVFGIVPVIVAWLYSEYLHYAKYSVSAKTHSDVNLVEIAKDFVKEDD 60
Query: 61 RAVLLE-GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYI 119
+A+L+E GG +SAS R + + LIRF+ +D++FL+ENR TLRA+ EF ++ YFYI
Sbjct: 61 KALLIEDGGGLQSASPRAKGPTTHSPLIRFVLLDESFLVENRLTLRAIIEFAVLMVYFYI 120
Query: 120 CDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEW 179
CDRT++ S K+YNRDLFLFLY LL+IVSA+TS H DKSPFSGK I YLNRHQTEEW
Sbjct: 121 CDRTDVFNSSKKSYNRDLFLFLYFLLIIVSAITSFTIHTDKSPFSGKAIMYLNRHQTEEW 180
Query: 180 KGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRL 239
KGWMQVLFLMYHYFAA E YNAIR+FIA YVWMTGFGNFSYYYIRKDFSL RFAQMMWRL
Sbjct: 181 KGWMQVLFLMYHYFAAAEYYNAIRVFIACYVWMTGFGNFSYYYIRKDFSLARFAQMMWRL 240
Query: 240 NFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVI 299
NF V F CIVLNN YMLYYICPMHTLFT+MVYGA+GI +KYNE+GSV+ K ACF+VVI
Sbjct: 241 NFLVIFSCIVLNNSYMLYYICPMHTLFTLMVYGALGIMSKYNEMGSVIAAKFFACFVVVI 300
Query: 300 LIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTA 359
++WEIPGVF+ WSP T ++GY DPAKP LP LHEWHFRSGLDRYIWIIGM+YAYYHPT
Sbjct: 301 IVWEIPGVFEWIWSPFTLLMGYNDPAKPQLPLLHEWHFRSGLDRYIWIIGMLYAYYHPTV 360
Query: 360 EKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E WM+KLEE+E K +++IK + +AL VGY WYE IYK+DK+TYNKYHPYTSWIPIT
Sbjct: 361 ESWMDKLEEAEMKFRVAIKTSVALIALTVGYFWYEYIYKMDKLTYNKYHPYTSWIPIT 418
>gi|12322675|gb|AAG51327.1|AC020580_7 unknown protein; 43703-46116 [Arabidopsis thaliana]
gi|46518437|gb|AAS99700.1| At3g06550 [Arabidopsis thaliana]
Length = 418
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 291/418 (69%), Positives = 341/418 (81%), Gaps = 1/418 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M P+TPG +S + GI+PV VAW+YSE+L Y K S K HSD NLVE+ K+ +KEDD
Sbjct: 1 MASSSPVTPGLMSVVFGIVPVIVAWLYSEYLHYAKYSVSAKTHSDVNLVEIAKDFVKEDD 60
Query: 61 RAVLLE-GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYI 119
+A+L+E GG +SAS R + + LIRF+ +D++FL+ENR TLRA+ EF ++ YFYI
Sbjct: 61 KALLIEDGGGLQSASPRAKGPTTHSPLIRFVLLDESFLVENRLTLRAIIEFAVLMVYFYI 120
Query: 120 CDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEW 179
CDRT++ S K+YNRDLFLFLY LL+IVSA+TS H DKSPFSGK I YLNRHQTEEW
Sbjct: 121 CDRTDVFNSSKKSYNRDLFLFLYFLLIIVSAITSFTIHTDKSPFSGKAIMYLNRHQTEEW 180
Query: 180 KGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRL 239
KGWMQVLFLMYHYFAA E YNAIR+FIA YVWMTGFGNFSYYYIRKDFSL RFAQMMWRL
Sbjct: 181 KGWMQVLFLMYHYFAAAEYYNAIRVFIACYVWMTGFGNFSYYYIRKDFSLARFAQMMWRL 240
Query: 240 NFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVI 299
NF V F CIVLNN YMLYYICPMHTLFT+MVYGA+GI +KYNE+GSV+ K ACF+VVI
Sbjct: 241 NFLVIFSCIVLNNSYMLYYICPMHTLFTLMVYGALGIMSKYNEMGSVIAAKFFACFVVVI 300
Query: 300 LIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTA 359
++WEIPGVF+ WSP T ++GY DPAKP LP LHEWHFRSGLDRYIWIIGM+YAYYHPT
Sbjct: 301 IVWEIPGVFEWIWSPFTLLMGYNDPAKPQLPLLHEWHFRSGLDRYIWIIGMLYAYYHPTV 360
Query: 360 EKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E WM+KLEE+E K +++IK + +AL VGY WYE IYK+DK+TYNKYHPYTSWIPIT
Sbjct: 361 ESWMDKLEEAEMKFRVAIKTSVALIALTVGYFWYEYIYKMDKLTYNKYHPYTSWIPIT 418
>gi|145338189|ref|NP_187307.3| O-acetyltransferase family protein [Arabidopsis thaliana]
gi|110741438|dbj|BAE98681.1| hypothetical protein [Arabidopsis thaliana]
gi|332640891|gb|AEE74412.1| O-acetyltransferase family protein [Arabidopsis thaliana]
Length = 545
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 291/419 (69%), Positives = 341/419 (81%), Gaps = 2/419 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKV-HSDTNLVELEKETIKED 59
M P+TPG +S + GI+PV VAW+YSE+L Y K S K HSD NLVE+ K+ +KED
Sbjct: 1 MASSSPVTPGLMSVVFGIVPVIVAWLYSEYLHYAKYSVSAKTRHSDVNLVEIAKDFVKED 60
Query: 60 DRAVLLE-GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFY 118
D+A+L+E GG +SAS R + + LIRF+ +D++FL+ENR TLRA+ EF ++ YFY
Sbjct: 61 DKALLIEDGGGLQSASPRAKGPTTHSPLIRFVLLDESFLVENRLTLRAIIEFAVLMVYFY 120
Query: 119 ICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEE 178
ICDRT++ S K+YNRDLFLFLY LL+IVSA+TS H DKSPFSGK I YLNRHQTEE
Sbjct: 121 ICDRTDVFNSSKKSYNRDLFLFLYFLLIIVSAITSFTIHTDKSPFSGKAIMYLNRHQTEE 180
Query: 179 WKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWR 238
WKGWMQVLFLMYHYFAA E YNAIR+FIA YVWMTGFGNFSYYYIRKDFSL RFAQMMWR
Sbjct: 181 WKGWMQVLFLMYHYFAAAEYYNAIRVFIACYVWMTGFGNFSYYYIRKDFSLARFAQMMWR 240
Query: 239 LNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVV 298
LNF V F CIVLNN YMLYYICPMHTLFT+MVYGA+GI +KYNE+GSV+ K ACF+VV
Sbjct: 241 LNFLVIFSCIVLNNSYMLYYICPMHTLFTLMVYGALGIMSKYNEMGSVIAAKFFACFVVV 300
Query: 299 ILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
I++WEIPGVF+ WSP T ++GY DPAKP LP LHEWHFRSGLDRYIWIIGM+YAYYHPT
Sbjct: 301 IIVWEIPGVFEWIWSPFTLLMGYNDPAKPQLPLLHEWHFRSGLDRYIWIIGMLYAYYHPT 360
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E WM+KLEE+E K +++IK + +AL VGY WYE IYK+DK+TYNKYHPYTSWIPIT
Sbjct: 361 VESWMDKLEEAEMKFRVAIKTSVALIALTVGYFWYEYIYKMDKLTYNKYHPYTSWIPIT 419
>gi|413955894|gb|AFW88543.1| hypothetical protein ZEAMMB73_609012 [Zea mays]
Length = 552
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 293/411 (71%), Positives = 346/411 (84%), Gaps = 1/411 (0%)
Query: 7 ITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLE 66
++PGQVS +LG + VF AW Y+E L ++K ++ K HSD NL ++ ++K +D+ +LLE
Sbjct: 19 VSPGQVSAILGFLWVFAAWAYAEVLFHRKNTASIKTHSDVNLAVMDDSSVKAEDQTMLLE 78
Query: 67 GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLL 126
G ++ +A+ +S+ + ++R + MD LLENR TLRA++EFG L YFYICDRTNLL
Sbjct: 79 EG-GQAMAAKPAYTSLTSQILRLIFMDQLLLLENRLTLRALSEFGGYLLYFYICDRTNLL 137
Query: 127 GDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVL 186
G+S KNY+RDLFLFLY LL+IV+AMTS K H DKS F+GK++ YLNRHQTEEWKGWMQVL
Sbjct: 138 GESAKNYSRDLFLFLYFLLIIVAAMTSFKVHQDKSSFTGKSVLYLNRHQTEEWKGWMQVL 197
Query: 187 FLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFC 246
FLMYHYF A EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLNFFV FC
Sbjct: 198 FLMYHYFNAKEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLGRFAQMMWRLNFFVIFC 257
Query: 247 CIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPG 306
CIVLNNDY LYYICPMHTLFT+MVYGA+GI NKYNEI SVM +K +ACFLVV+L+WE+PG
Sbjct: 258 CIVLNNDYTLYYICPMHTLFTLMVYGALGILNKYNEIRSVMAMKFVACFLVVVLVWEVPG 317
Query: 307 VFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKL 366
VFDI WSP TF+LGYTDP+KPDLPRLHEW FRSGLDRYIWI+GMIYAYYHPT EKW+EKL
Sbjct: 318 VFDIVWSPFTFLLGYTDPSKPDLPRLHEWQFRSGLDRYIWIVGMIYAYYHPTVEKWLEKL 377
Query: 367 EESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
EE+E + KL IK IVTV++ GYLWYE IYKLDK+TYNK HPYTSWIPIT
Sbjct: 378 EETEMRTKLYIKTSIVTVSMMAGYLWYEYIYKLDKITYNKLHPYTSWIPIT 428
>gi|145331988|ref|NP_001078116.1| O-acetyltransferase family protein [Arabidopsis thaliana]
gi|332640892|gb|AEE74413.1| O-acetyltransferase family protein [Arabidopsis thaliana]
Length = 568
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 291/419 (69%), Positives = 341/419 (81%), Gaps = 2/419 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKV-HSDTNLVELEKETIKED 59
M P+TPG +S + GI+PV VAW+YSE+L Y K S K HSD NLVE+ K+ +KED
Sbjct: 1 MASSSPVTPGLMSVVFGIVPVIVAWLYSEYLHYAKYSVSAKTRHSDVNLVEIAKDFVKED 60
Query: 60 DRAVLLE-GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFY 118
D+A+L+E GG +SAS R + + LIRF+ +D++FL+ENR TLRA+ EF ++ YFY
Sbjct: 61 DKALLIEDGGGLQSASPRAKGPTTHSPLIRFVLLDESFLVENRLTLRAIIEFAVLMVYFY 120
Query: 119 ICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEE 178
ICDRT++ S K+YNRDLFLFLY LL+IVSA+TS H DKSPFSGK I YLNRHQTEE
Sbjct: 121 ICDRTDVFNSSKKSYNRDLFLFLYFLLIIVSAITSFTIHTDKSPFSGKAIMYLNRHQTEE 180
Query: 179 WKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWR 238
WKGWMQVLFLMYHYFAA E YNAIR+FIA YVWMTGFGNFSYYYIRKDFSL RFAQMMWR
Sbjct: 181 WKGWMQVLFLMYHYFAAAEYYNAIRVFIACYVWMTGFGNFSYYYIRKDFSLARFAQMMWR 240
Query: 239 LNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVV 298
LNF V F CIVLNN YMLYYICPMHTLFT+MVYGA+GI +KYNE+GSV+ K ACF+VV
Sbjct: 241 LNFLVIFSCIVLNNSYMLYYICPMHTLFTLMVYGALGIMSKYNEMGSVIAAKFFACFVVV 300
Query: 299 ILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
I++WEIPGVF+ WSP T ++GY DPAKP LP LHEWHFRSGLDRYIWIIGM+YAYYHPT
Sbjct: 301 IIVWEIPGVFEWIWSPFTLLMGYNDPAKPQLPLLHEWHFRSGLDRYIWIIGMLYAYYHPT 360
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E WM+KLEE+E K +++IK + +AL VGY WYE IYK+DK+TYNKYHPYTSWIPIT
Sbjct: 361 VESWMDKLEEAEMKFRVAIKTSVALIALTVGYFWYEYIYKMDKLTYNKYHPYTSWIPIT 419
>gi|302773051|ref|XP_002969943.1| hypothetical protein SELMODRAFT_146620 [Selaginella moellendorffii]
gi|300162454|gb|EFJ29067.1| hypothetical protein SELMODRAFT_146620 [Selaginella moellendorffii]
Length = 542
Score = 607 bits (1565), Expect = e-171, Method: Compositional matrix adjust.
Identities = 287/417 (68%), Positives = 336/417 (80%), Gaps = 1/417 (0%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MV P T GQV+ +LG IPV AW+YSEFLEY+K K HSD NL ELE ++++
Sbjct: 1 MVEISPPTTGQVALVLGFIPVLTAWLYSEFLEYRKQPVPGKAHSDINLSELEHGPRRDNE 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
+ LLE G S S + + S SI+ L +F T+++ FL+ENR+ LRA+AEFG +L YFYIC
Sbjct: 61 KDSLLENGFSVSGTLKG-SFSIRMQLFKFFTLNETFLVENRSLLRAIAEFGCLLCYFYIC 119
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRTN+ G+ KNY+RDLF+FLY LL+IVS++TSLKKH +KS SGK+I YLNRHQTEEWK
Sbjct: 120 DRTNVFGELKKNYSRDLFVFLYFLLIIVSSITSLKKHAEKSVASGKSILYLNRHQTEEWK 179
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQVLFLMYHYFAA EIYNAIR+FIA YVWMTGFGNFSYYY+RKDFSL RFAQMMWRLN
Sbjct: 180 GWMQVLFLMYHYFAAAEIYNAIRLFIAGYVWMTGFGNFSYYYVRKDFSLGRFAQMMWRLN 239
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
F V FCCIVLNN YMLYYICPMHTLFT+MVY ++GI NKYNE+ SV+ KI ACF VVIL
Sbjct: 240 FLVTFCCIVLNNSYMLYYICPMHTLFTLMVYCSLGILNKYNEVPSVIGAKIAACFAVVIL 299
Query: 301 IWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAE 360
+WE+PGVFD W P TF++ YTDP KPDLP LHEWHFRSGLDRYIWI GMI AY+HPT E
Sbjct: 300 VWEVPGVFDFVWRPFTFLVEYTDPGKPDLPVLHEWHFRSGLDRYIWIYGMICAYFHPTVE 359
Query: 361 KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
+W+EKLEE E +RK + K+ IV VA VGYLWY IYKLDK++YNK HPYTSWIPI+
Sbjct: 360 RWLEKLEELECRRKFTYKSVIVFVASLVGYLWYVHIYKLDKLSYNKLHPYTSWIPIS 416
>gi|413955893|gb|AFW88542.1| hypothetical protein ZEAMMB73_609012 [Zea mays]
Length = 553
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 293/412 (71%), Positives = 346/412 (83%), Gaps = 2/412 (0%)
Query: 7 ITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKV-HSDTNLVELEKETIKEDDRAVLL 65
++PGQVS +LG + VF AW Y+E L ++K ++ K HSD NL ++ ++K +D+ +LL
Sbjct: 19 VSPGQVSAILGFLWVFAAWAYAEVLFHRKNTASIKTRHSDVNLAVMDDSSVKAEDQTMLL 78
Query: 66 EGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNL 125
E G ++ +A+ +S+ + ++R + MD LLENR TLRA++EFG L YFYICDRTNL
Sbjct: 79 EEG-GQAMAAKPAYTSLTSQILRLIFMDQLLLLENRLTLRALSEFGGYLLYFYICDRTNL 137
Query: 126 LGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQV 185
LG+S KNY+RDLFLFLY LL+IV+AMTS K H DKS F+GK++ YLNRHQTEEWKGWMQV
Sbjct: 138 LGESAKNYSRDLFLFLYFLLIIVAAMTSFKVHQDKSSFTGKSVLYLNRHQTEEWKGWMQV 197
Query: 186 LFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAF 245
LFLMYHYF A EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLNFFV F
Sbjct: 198 LFLMYHYFNAKEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLGRFAQMMWRLNFFVIF 257
Query: 246 CCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIP 305
CCIVLNNDY LYYICPMHTLFT+MVYGA+GI NKYNEI SVM +K +ACFLVV+L+WE+P
Sbjct: 258 CCIVLNNDYTLYYICPMHTLFTLMVYGALGILNKYNEIRSVMAMKFVACFLVVVLVWEVP 317
Query: 306 GVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEK 365
GVFDI WSP TF+LGYTDP+KPDLPRLHEW FRSGLDRYIWI+GMIYAYYHPT EKW+EK
Sbjct: 318 GVFDIVWSPFTFLLGYTDPSKPDLPRLHEWQFRSGLDRYIWIVGMIYAYYHPTVEKWLEK 377
Query: 366 LEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
LEE+E + KL IK IVTV++ GYLWYE IYKLDK+TYNK HPYTSWIPIT
Sbjct: 378 LEETEMRTKLYIKTSIVTVSMMAGYLWYEYIYKLDKITYNKLHPYTSWIPIT 429
>gi|414588192|tpg|DAA38763.1| TPA: hypothetical protein ZEAMMB73_588942 [Zea mays]
Length = 534
Score = 600 bits (1548), Expect = e-169, Method: Compositional matrix adjust.
Identities = 285/408 (69%), Positives = 334/408 (81%), Gaps = 5/408 (1%)
Query: 12 VSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLSR 71
V FLLG+ V AWIYSEFL Y+ SSH KVHSD V + +TIKE+DRAVLLE G S+
Sbjct: 5 VPFLLGLSSVLAAWIYSEFLGYRASSSHEKVHSD---VHVGDKTIKENDRAVLLEEGESK 61
Query: 72 SASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTK 131
S + + S + LIRF+T+D++FL+ NRATLRA+AEFG IL YFY CDRTN+ +S K
Sbjct: 62 PPSTKAPNMSARAKLIRFITLDESFLVGNRATLRAIAEFGIILVYFYTCDRTNIFAESKK 121
Query: 132 NYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYH 191
NYNRD+FLFLY+LLV+ S +TSLKKH + S SGKT+ YLNR+QT+EW+GWMQVLFLMYH
Sbjct: 122 NYNRDMFLFLYILLVVASTITSLKKHPENSVVSGKTVFYLNRNQTDEWRGWMQVLFLMYH 181
Query: 192 YFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLN 251
YFAA+EIYNAIR+FIA YVWMTGFGNFSYYY +KDFS+ RFAQMMWR+NFF CC+VL+
Sbjct: 182 YFAASEIYNAIRVFIACYVWMTGFGNFSYYYKKKDFSIARFAQMMWRVNFFAMSCCMVLD 241
Query: 252 NDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIF 311
NDYMLYYI PMHTLFT+MVYG++ + NKYNEI SVM +KI C L VILIWEIPGVF+IF
Sbjct: 242 NDYMLYYIAPMHTLFTLMVYGSLFVLNKYNEIPSVMAIKIAGCLLTVILIWEIPGVFEIF 301
Query: 312 WSPLTFILGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEES 369
W+P TF+LGY +P+K +LP LHEW FRSGLDRYIWIIGMIYAY+HP E+WMEKLEES
Sbjct: 302 WAPFTFLLGYKNPEPSKMNLPLLHEWRFRSGLDRYIWIIGMIYAYFHPNVERWMEKLEES 361
Query: 370 EPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E K ++ IK IVT +L GYLWYE IYKLDK TYNKYHPYTSWIPIT
Sbjct: 362 ETKVRVLIKGTIVTASLMAGYLWYEYIYKLDKHTYNKYHPYTSWIPIT 409
>gi|302799308|ref|XP_002981413.1| hypothetical protein SELMODRAFT_114335 [Selaginella moellendorffii]
gi|300150953|gb|EFJ17601.1| hypothetical protein SELMODRAFT_114335 [Selaginella moellendorffii]
Length = 557
Score = 595 bits (1535), Expect = e-167, Method: Compositional matrix adjust.
Identities = 280/406 (68%), Positives = 330/406 (81%), Gaps = 1/406 (0%)
Query: 12 VSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLSR 71
V+ +LG IPV AW+YSEFLEY+K K HSD NL ELE +++++ LLE G S
Sbjct: 1 VALVLGFIPVLTAWLYSEFLEYRKQPVPGKAHSDINLSELEHGPRRDNEKDSLLENGFSV 60
Query: 72 SASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTK 131
S + + S SI+ L +F T+++ FL+ENR+ LRA+AEFG +L YFYICDRTN+ G+ K
Sbjct: 61 SGTLKG-SFSIRMQLFKFFTLNETFLVENRSLLRAIAEFGCLLCYFYICDRTNVFGELKK 119
Query: 132 NYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYH 191
NY+RDLF+FLY LL+IVS++TSLKKH +KS SGK+I YLNRHQTEEWKGWMQVLFLMYH
Sbjct: 120 NYSRDLFVFLYFLLIIVSSITSLKKHAEKSVASGKSILYLNRHQTEEWKGWMQVLFLMYH 179
Query: 192 YFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLN 251
YFAA EIYNAIR+FIA YVWMTGFGNFSYYY+RKDFSL RFAQMMWRLNF V FCCIVLN
Sbjct: 180 YFAAAEIYNAIRLFIAGYVWMTGFGNFSYYYVRKDFSLGRFAQMMWRLNFLVTFCCIVLN 239
Query: 252 NDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIF 311
N YMLYYICPMHTLFT+MVY ++GI NKYNE+ SV+ KI ACF VVIL+WE+PGVFD
Sbjct: 240 NSYMLYYICPMHTLFTLMVYCSLGILNKYNEVPSVIGAKIAACFAVVILVWEVPGVFDFV 299
Query: 312 WSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEP 371
W P TF++ YTDP KPDLP LHEWHFRSGLDRYIWI GMI AY+HPT E+W+EKLEE E
Sbjct: 300 WRPFTFLVEYTDPGKPDLPVLHEWHFRSGLDRYIWIYGMICAYFHPTVERWLEKLEELEC 359
Query: 372 KRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
+RK + K+ IV VA VGYLWY IY+LDK++YNK HPYTSWIPI+
Sbjct: 360 RRKFTYKSVIVFVASLVGYLWYVHIYRLDKLSYNKLHPYTSWIPIS 405
>gi|42562402|ref|NP_174282.2| O-acetyltransferase-like protein [Arabidopsis thaliana]
gi|332193025|gb|AEE31146.1| O-acetyltransferase-like protein [Arabidopsis thaliana]
Length = 470
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 274/330 (83%), Positives = 305/330 (92%)
Query: 88 RFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVI 147
RF+T++D+FLLENRATLRAMAEFGAIL YFYICDRT+L+G S KNY+RDLFLFL+ LL+I
Sbjct: 18 RFLTLEDSFLLENRATLRAMAEFGAILLYFYICDRTSLIGQSQKNYSRDLFLFLFCLLII 77
Query: 148 VSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIA 207
VSAMTSLKKH DKSP +GK+I YLNRHQTEEWKGWMQVLFLMYHYFAA E YNAIR+FIA
Sbjct: 78 VSAMTSLKKHTDKSPITGKSILYLNRHQTEEWKGWMQVLFLMYHYFAAVEFYNAIRVFIA 137
Query: 208 AYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFT 267
YVWMTGFGNFSYYYIRKDFSL RF QMMWRLNFFVAFCCI+LNNDYMLYYICPMHTLFT
Sbjct: 138 GYVWMTGFGNFSYYYIRKDFSLARFTQMMWRLNFFVAFCCIILNNDYMLYYICPMHTLFT 197
Query: 268 IMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKP 327
+MVYGA+GI+++YNEI SVM +KI +CFLVVIL+WEIPGVF+IFWSPL F+LGYTDPAKP
Sbjct: 198 LMVYGALGIYSQYNEIASVMALKIASCFLVVILMWEIPGVFEIFWSPLAFLLGYTDPAKP 257
Query: 328 DLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALF 387
DLPRLHEWHFRSGLDRYIWIIGMIYAY+HPT E+WMEKLEE + KR++SIK I+ ++ F
Sbjct: 258 DLPRLHEWHFRSGLDRYIWIIGMIYAYFHPTVERWMEKLEECDAKRRMSIKTSIIGISSF 317
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
GYLWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 318 AGYLWYEYIYKLDKVTYNKYHPYTSWIPIT 347
>gi|167999823|ref|XP_001752616.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696147|gb|EDQ82487.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 598
Score = 561 bits (1447), Expect = e-157, Method: Compositional matrix adjust.
Identities = 271/429 (63%), Positives = 344/429 (80%), Gaps = 4/429 (0%)
Query: 11 QVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLS 70
+V+F LG++P+ VAWIY+E LE++K S KVH+D NL EL T K++++A LLE GLS
Sbjct: 50 KVAFFLGLMPLIVAWIYAEILEFRK-RSLNKVHNDVNLEELNDGTSKDEEKAALLEAGLS 108
Query: 71 RSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDST 130
S +++S++ +L++F T+++AFL+ENR TLRA+AEF +L ++Y+CDRTN S
Sbjct: 109 ASGVVPAVTASVQASLLKFCTLNEAFLIENRLTLRAIAEFSGLLCFYYLCDRTNFFSASR 168
Query: 131 KNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMY 190
++Y+RDLFLFLYLLLV+ SA+TSLKKHNDKS +GK+ YLNRHQTEEWKGWMQVLFLMY
Sbjct: 169 RHYSRDLFLFLYLLLVLASALTSLKKHNDKSAATGKSFLYLNRHQTEEWKGWMQVLFLMY 228
Query: 191 HYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVL 250
HYF A E YNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RFAQMMWRLNF V F CIVL
Sbjct: 229 HYFEAKEFYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLGRFAQMMWRLNFLVFFSCIVL 288
Query: 251 NNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDI 310
NNDYMLYYICPMHTLFT+MVYG +GI++KYNE+ +VM KI FLVVI++WEIPG+FDI
Sbjct: 289 NNDYMLYYICPMHTLFTLMVYGCLGIYSKYNEVPTVMATKIFISFLVVIIVWEIPGMFDI 348
Query: 311 FWSPLTFILGYTDP--AKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEE 368
WSPLT +L Y +P A P P LHEWHFRSGLDRYIWI+GM+YAY+HPT E+W+EKLEE
Sbjct: 349 LWSPLTILLAYQNPSHATPQ-PLLHEWHFRSGLDRYIWIVGMLYAYFHPTVERWLEKLEE 407
Query: 369 SEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSL 428
E K + +K+ +++ A+ +GYLWY + LD+V+YN+ HPYTSWIPIT + + F
Sbjct: 408 MESKSRAFVKSVVISAAVLMGYLWYVHVCTLDRVSYNRVHPYTSWIPITVYIVLRNFSQS 467
Query: 429 VKHLSGSLY 437
+++ S +L+
Sbjct: 468 LRNYSLTLF 476
>gi|9972357|gb|AAG10607.1|AC008030_7 Unknown protein [Arabidopsis thaliana]
Length = 398
Score = 560 bits (1442), Expect = e-157, Method: Compositional matrix adjust.
Identities = 278/377 (73%), Positives = 310/377 (82%), Gaps = 26/377 (6%)
Query: 55 TIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAIL 114
T KED+ VL+EGGL RSAS++ F+T++D+FLLENRATLRAMAEFGAIL
Sbjct: 3 TNKEDEGTVLMEGGLPRSASSK------------FLTLEDSFLLENRATLRAMAEFGAIL 50
Query: 115 FYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRH 174
YFYICDRT+L+G S KNY+RDLFLFL+ LL+IVSAMTSLKKH DKSP +GK+I YLNRH
Sbjct: 51 LYFYICDRTSLIGQSQKNYSRDLFLFLFCLLIIVSAMTSLKKHTDKSPITGKSILYLNRH 110
Query: 175 QTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQ 234
QTEEWKGWMQVLFLMYHYFAA E YNAIR+FIA YVWMTGFGNFSYYYIRKDFSL RF Q
Sbjct: 111 QTEEWKGWMQVLFLMYHYFAAVEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFTQ 170
Query: 235 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILAC 294
MMWRLNFFVAFCCI+LNNDYMLYYICPMHTLFT+MVYGA+GI+++YNEI SVM +KI +C
Sbjct: 171 MMWRLNFFVAFCCIILNNDYMLYYICPMHTLFTLMVYGALGIYSQYNEIASVMALKIASC 230
Query: 295 FLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY 354
FLVVIL+WEIPGVF+IFWSPL F+LGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY
Sbjct: 231 FLVVILMWEIPGVFEIFWSPLAFLLGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY 290
Query: 355 YHPTA--EKWMEKLEESEPKRK----LSIKAGIVTVALFV--------GYLWYECIYKLD 400
+HPT E W ++ L I + +FV GYLWYE IYKLD
Sbjct: 291 FHPTVIPEPWCFHFFHISSRKTQLNILGISVFSTIILMFVSWFPNVQAGYLWYEYIYKLD 350
Query: 401 KVTYNKYHPYTSWIPIT 417
KVTYNKYHPYTSWIPIT
Sbjct: 351 KVTYNKYHPYTSWIPIT 367
>gi|168025452|ref|XP_001765248.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683567|gb|EDQ69976.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 535
Score = 557 bits (1435), Expect = e-156, Method: Compositional matrix adjust.
Identities = 270/419 (64%), Positives = 335/419 (79%), Gaps = 8/419 (1%)
Query: 19 IPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLSRSASARLL 78
+P+ ++WIY+E L+YKK KVH D L L+ + K++++A LLE GL S + R
Sbjct: 1 MPLIISWIYAEILDYKKHPPLIKVHDDIKLEVLKDGSSKDEEKAALLEAGLPISGTGRAE 60
Query: 79 SSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLF 138
++S++ +LI+F+T+++ FL+ENR TLRA+AEF +L + Y+CDRTN S K+Y+RDLF
Sbjct: 61 AASVRASLIKFLTLNETFLVENRLTLRAIAEFLGLLCFLYLCDRTNTFSASRKHYSRDLF 120
Query: 139 LFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI 198
LFLYLLLV+ SA+TSLKKH DKS +GK+I YLNRHQTEEWKGWMQVLFLMYHYF A E
Sbjct: 121 LFLYLLLVLASALTSLKKHADKSAAAGKSIFYLNRHQTEEWKGWMQVLFLMYHYFEAKEF 180
Query: 199 YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYY 258
YNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RFAQMMWRLNF V FCCIVLNNDYMLYY
Sbjct: 181 YNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLGRFAQMMWRLNFLVFFCCIVLNNDYMLYY 240
Query: 259 ICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFI 318
ICPMHTLFT+MVYG +GI++KYNE+ +VM +K++ FLVV+++WEIPGVFD+ WSPLTF+
Sbjct: 241 ICPMHTLFTLMVYGCLGIYSKYNEVPAVMTIKMILSFLVVVIVWEIPGVFDLLWSPLTFL 300
Query: 319 LGYTDP-AKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSI 377
L Y DP AK P LHEWHFRSGLDRYIWIIGM+YAY+HPT E W+EKLEE E K K +
Sbjct: 301 LAYQDPLAKSPAPLLHEWHFRSGLDRYIWIIGMLYAYFHPTVETWLEKLEEMEYKIKSVV 360
Query: 378 KAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSL 436
K +V+ A+ +GYLWY IY LDK++YNK HPYTSWIPIT + ++++LS SL
Sbjct: 361 KFAVVSAAVLMGYLWYVHIYTLDKISYNKLHPYTSWIPIT-------VYIILRNLSQSL 412
>gi|168062596|ref|XP_001783265.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665269|gb|EDQ51960.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 527
Score = 543 bits (1398), Expect = e-152, Method: Compositional matrix adjust.
Identities = 264/400 (66%), Positives = 329/400 (82%), Gaps = 1/400 (0%)
Query: 19 IPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLSRSASARLL 78
+P+ +AWIY+E L++KK + TK H D +L EL+ +IK++++A LLE LS S S R
Sbjct: 1 MPLIIAWIYAEILDFKKFPTSTKGHDDVDLEELKDGSIKDEEKATLLESSLSVSGSFRSG 60
Query: 79 SSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLF 138
+ ++T+LI+F+T+++ FL+ENR TLRA+AEF +L + Y+CDRTN+ S K+Y+RDLF
Sbjct: 61 PAPVRTSLIKFLTLNETFLVENRLTLRAIAEFLGLLCFLYLCDRTNIFSASRKHYSRDLF 120
Query: 139 LFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI 198
LFLYLLLV+ SA+TSLKKH DKS +GK+I YLNRHQTEEWKGWMQVLFLMYHYF A E
Sbjct: 121 LFLYLLLVLASALTSLKKHVDKSIATGKSILYLNRHQTEEWKGWMQVLFLMYHYFEAKEF 180
Query: 199 YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYY 258
YNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RFAQMMWRLNF V F CIVLNNDYMLYY
Sbjct: 181 YNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLGRFAQMMWRLNFLVFFACIVLNNDYMLYY 240
Query: 259 ICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFI 318
ICPMHTLFT++VYG +GI++KYNE+ SVM KI+ FLVV+++WEIPGVF++ WSPLTF+
Sbjct: 241 ICPMHTLFTLLVYGCLGIYSKYNEVPSVMFTKIVVSFLVVVIVWEIPGVFELLWSPLTFL 300
Query: 319 LGYTDP-AKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSI 377
L Y DP +K P LHEWHFRSGLDRYIWI+GM+YAY+HPT E+W+EK+EE E KR+ I
Sbjct: 301 LAYQDPLSKHPTPLLHEWHFRSGLDRYIWIVGMLYAYFHPTVERWLEKVEEMEYKRRTFI 360
Query: 378 KAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
K+ +++ + +GY+WY IY LDK +YNK HPYTSWIPIT
Sbjct: 361 KSAVISATMLMGYMWYVHIYTLDKTSYNKLHPYTSWIPIT 400
>gi|218192689|gb|EEC75116.1| hypothetical protein OsI_11296 [Oryza sativa Indica Group]
Length = 519
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 274/431 (63%), Positives = 315/431 (73%), Gaps = 50/431 (11%)
Query: 1 MVVFRPITPGQ----------VSFLLGIIPVFVAWIYSEFLEYKKVSSHTKV-HSDTNLV 49
M +TPGQ VS LLG + VF AW Y+E L Y+K ++ K HSD NL
Sbjct: 1 MAASTSLTPGQGCGSKGVNHQVSALLGFLWVFTAWAYAEVLYYRKNAASIKARHSDVNLA 60
Query: 50 ELEKETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAE 109
++ + K ++ + ++ GG S+ K+ +FM +
Sbjct: 61 VMDSSSNKGEESSDVIGGGCP--------STQCKS---QFMLLSH--------------- 94
Query: 110 FGAILFYFYICDRT---NLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGK 166
+ C T + + NY+RD+FLFLY LL+IV+AMTS K H DKS F+GK
Sbjct: 95 --------HKCSGTYSRESIDVESNNYSRDMFLFLYFLLIIVAAMTSFKVHQDKSSFTGK 146
Query: 167 TIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKD 226
+I YLNRHQTEEWKGWMQVLFLMYHYF A EIYNAIR+FIAAYVWMTGFGNFSYYY+RKD
Sbjct: 147 SILYLNRHQTEEWKGWMQVLFLMYHYFNAKEIYNAIRVFIAAYVWMTGFGNFSYYYVRKD 206
Query: 227 FSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSV 286
FSL RFAQMMWRLNFFVAFCCIVLNNDY LYYICPMHTLFT+MVYGA+GI NKYNEIGSV
Sbjct: 207 FSLARFAQMMWRLNFFVAFCCIVLNNDYTLYYICPMHTLFTLMVYGALGILNKYNEIGSV 266
Query: 287 MIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIW 346
M +K +ACFLVVILIWEIPGVF+I WSP TF+LGYTDP+KPDLPRLHEWHFRSGLDRYIW
Sbjct: 267 MAIKFVACFLVVILIWEIPGVFEIVWSPFTFLLGYTDPSKPDLPRLHEWHFRSGLDRYIW 326
Query: 347 IIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNK 406
I+GMIYAYYHPT KWMEKLEE+E K KL IK IV++AL G LWYE IYKLDK+TYNK
Sbjct: 327 IVGMIYAYYHPT--KWMEKLEEAETKTKLYIKPLIVSIALTAGCLWYEYIYKLDKITYNK 384
Query: 407 YHPYTSWIPIT 417
YHPYTSWIPIT
Sbjct: 385 YHPYTSWIPIT 395
>gi|293336548|ref|NP_001170173.1| uncharacterized protein LOC100384114 [Zea mays]
gi|224034033|gb|ACN36092.1| unknown [Zea mays]
Length = 391
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 219/267 (82%), Positives = 240/267 (89%)
Query: 151 MTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYV 210
MTS K H DKS F+GK++ YLNRHQTEEWKGWMQVLFLMYHYF A EIYNAIR+FIAAYV
Sbjct: 1 MTSFKVHQDKSSFTGKSVLYLNRHQTEEWKGWMQVLFLMYHYFNAKEIYNAIRVFIAAYV 60
Query: 211 WMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMV 270
WMTGFGNFSYYY+RKDFSL RFAQMMWRLNFFV FCCIVLNNDY LYYICPMHTLFT+MV
Sbjct: 61 WMTGFGNFSYYYVRKDFSLGRFAQMMWRLNFFVIFCCIVLNNDYTLYYICPMHTLFTLMV 120
Query: 271 YGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLP 330
YGA+GI NKYNEI SVM +K +ACFLVV+L+WE+PGVFDI WSP TF+LGYTDP+KPDLP
Sbjct: 121 YGALGILNKYNEIRSVMAMKFVACFLVVVLVWEVPGVFDIVWSPFTFLLGYTDPSKPDLP 180
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGY 390
RLHEW FRSGLDRYIWI+GMIYAYYHPT EKW+EKLEE+E + KL IK IVTV++ GY
Sbjct: 181 RLHEWQFRSGLDRYIWIVGMIYAYYHPTVEKWLEKLEETEMRTKLYIKTSIVTVSMMAGY 240
Query: 391 LWYECIYKLDKVTYNKYHPYTSWIPIT 417
LWYE IYKLDK+TYNK HPYTSWIPIT
Sbjct: 241 LWYEYIYKLDKITYNKLHPYTSWIPIT 267
>gi|255556462|ref|XP_002519265.1| conserved hypothetical protein [Ricinus communis]
gi|223541580|gb|EEF43129.1| conserved hypothetical protein [Ricinus communis]
Length = 533
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 227/329 (68%), Positives = 264/329 (80%), Gaps = 8/329 (2%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKV--HSDTNLVELEKETIKE 58
M++ P+TPGQVS LGI+PV AWIYSEFLEY K ++ HSD L EL +KE
Sbjct: 1 MIISSPVTPGQVSLFLGIVPVIAAWIYSEFLEYNKNAAAAAAKAHSDIGLTELGNGIVKE 60
Query: 59 DD-RAVLLEGG-----LSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGA 112
DD RAVLLEGG S + R + + + RF MD+ FL++NR TLRA++EFGA
Sbjct: 61 DDDRAVLLEGGGGLQAASSPKATRTVLPASSPPIFRFFLMDEQFLIDNRLTLRAISEFGA 120
Query: 113 ILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLN 172
+L YFY+CDRT+ S K++NRD+F FLY LL++VSA+TS K H+DKSPFSGK I YLN
Sbjct: 121 LLGYFYVCDRTDFFNGSKKSFNRDIFWFLYSLLIVVSAITSFKIHHDKSPFSGKPILYLN 180
Query: 173 RHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRF 232
RHQTEEWKGWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RF
Sbjct: 181 RHQTEEWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARF 240
Query: 233 AQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKIL 292
AQMMWRLNF V FCCI+LNN Y+LYYICPMHTLFT+MVYGA+GI NKYNEIGSV+ VKI+
Sbjct: 241 AQMMWRLNFLVIFCCIILNNSYVLYYICPMHTLFTLMVYGALGIMNKYNEIGSVIAVKII 300
Query: 293 ACFLVVILIWEIPGVFDIFWSPLTFILGY 321
ACFLVVILIWEIPGVF++ WSP TF LG+
Sbjct: 301 ACFLVVILIWEIPGVFELLWSPFTFFLGW 329
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/22 (95%), Positives = 22/22 (100%)
Query: 396 IYKLDKVTYNKYHPYTSWIPIT 417
IYKLDK+TYNKYHPYTSWIPIT
Sbjct: 355 IYKLDKITYNKYHPYTSWIPIT 376
>gi|449467353|ref|XP_004151388.1| PREDICTED: CAS1 domain-containing protein 1-like, partial [Cucumis
sativus]
Length = 362
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 204/239 (85%), Positives = 222/239 (92%)
Query: 179 WKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWR 238
WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFS+ RFAQMMWR
Sbjct: 1 WKGWMQVLFLMYHYFAAAEIYNAIRMFIAAYVWMTGFGNFSYYYIRKDFSVARFAQMMWR 60
Query: 239 LNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVV 298
LNFFV FCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIFNKYNE SV+ KILACFLVV
Sbjct: 61 LNFFVIFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFNKYNEKSSVIAAKILACFLVV 120
Query: 299 ILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPT 358
ILIWE+PGVFD WSPLTF LGYTDPAKP LP+LHEWHFRSGLDRYIWI+GMIYAY+HP
Sbjct: 121 ILIWEVPGVFDALWSPLTFFLGYTDPAKPQLPKLHEWHFRSGLDRYIWIVGMIYAYFHPN 180
Query: 359 AEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
EKWMEKLEE++ ++++SIKA IVTVAL VGY+WYE IYKLDK++YNKYHPYTSWIPIT
Sbjct: 181 VEKWMEKLEEADTRKRVSIKACIVTVALSVGYMWYEWIYKLDKISYNKYHPYTSWIPIT 239
>gi|297740647|emb|CBI30829.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 206/235 (87%), Positives = 217/235 (92%), Gaps = 2/235 (0%)
Query: 185 VLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVA 244
VLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYYIRKDFSL RF QMMWRLNFFVA
Sbjct: 7 VLFLMYHYFAAAEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFTQMMWRLNFFVA 66
Query: 245 FCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEI 304
FCCIVLNNDYMLYYICPMHTLFT+MVYGA+GIFNKYNEI SVM VKILACFLVVILIWEI
Sbjct: 67 FCCIVLNNDYMLYYICPMHTLFTLMVYGALGIFNKYNEIRSVMAVKILACFLVVILIWEI 126
Query: 305 PGVFDIFWSPLTFILGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKW 362
PGVFDIFWSP F+LGY+ DP+K LPRLHEWHFRSGLDRYIWIIGMIYAYYHP EKW
Sbjct: 127 PGVFDIFWSPSAFLLGYSDPDPSKQGLPRLHEWHFRSGLDRYIWIIGMIYAYYHPNVEKW 186
Query: 363 MEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
MEKLEE+E KR+L+IK IVTV +FVGYLWYE IYKLDKVTYNK+HPYTSWIPIT
Sbjct: 187 MEKLEETETKRRLTIKTSIVTVTVFVGYLWYEYIYKLDKVTYNKFHPYTSWIPIT 241
>gi|413946697|gb|AFW79346.1| hypothetical protein ZEAMMB73_780731 [Zea mays]
Length = 377
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 190/238 (79%), Positives = 213/238 (89%), Gaps = 2/238 (0%)
Query: 182 WMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNF 241
W+QVLFLMYHYFAA+EIYNAIR+FIA YVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLNF
Sbjct: 16 WLQVLFLMYHYFAASEIYNAIRVFIACYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLNF 75
Query: 242 FVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILI 301
FVAFCCIVL+ND MLYYICPMHTLFT+MVYG++G+FNKYNE+ S+M +KI CFL VILI
Sbjct: 76 FVAFCCIVLDNDLMLYYICPMHTLFTLMVYGSLGLFNKYNEVPSIMAIKIACCFLSVILI 135
Query: 302 WEIPGVFDIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTA 359
WEIPGVF+ W+P TF+LGY D P+K LP LHEWHFRSGLDRYIWIIGMIYAY+HP
Sbjct: 136 WEIPGVFESLWAPFTFLLGYKDPSPSKAHLPLLHEWHFRSGLDRYIWIIGMIYAYFHPNV 195
Query: 360 EKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
E+WMEKLEESE K +LSIK IVT++L VG+LWYE IYKLDKVTYNKYHPYTSWIPIT
Sbjct: 196 ERWMEKLEESETKVRLSIKGTIVTLSLTVGFLWYEYIYKLDKVTYNKYHPYTSWIPIT 253
>gi|307108605|gb|EFN56845.1| hypothetical protein CHLNCDRAFT_35000 [Chlorella variabilis]
Length = 571
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 212/445 (47%), Positives = 283/445 (63%), Gaps = 31/445 (6%)
Query: 6 PITPGQVSFLLGIIPVFVAWIYSEFLEYKKV---SSHTKVHSDTNLVELEKETIKEDD-- 60
P+T GQV+ L I W+ +E+L YK+ + T H++ V K ++ +
Sbjct: 14 PLTAGQVACLGAYITAIALWMCAEWLNYKRSHAKKAQTHAHTEAAAVPEGKPLLEMNGVG 73
Query: 61 RAVLLEGGLSRSASARLLSSS--------------IKTNLIRFMTMDDAFLLENRATLRA 106
AV +E +A A + SS + R + +D LL +R LR+
Sbjct: 74 SAVDVEATSGEAAGAGMPSSPPPGTDASGKPTYAWASSAFYRCLMLDGDALLSSREALRS 133
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGK 166
EFG IL +FY+ DRT L+ TK Y RD+ LF++L+L V+ SLK+H +
Sbjct: 134 AVEFGTILAWFYVADRTQLIAPGTKTYTRDVLLFIFLVLTAVAGGYSLKQH--------R 185
Query: 167 TIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKD 226
T+ L+R QTEEWKGWMQVLFL+YHY+ A EIYNAIR+FIAAYVWMTGFGNFSYYY D
Sbjct: 186 TL-LLHRTQTEEWKGWMQVLFLLYHYYNAKEIYNAIRVFIAAYVWMTGFGNFSYYYRTAD 244
Query: 227 FSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSV 286
F + RF QMMWRLNF V FCC+VLNN YMLYYICPMHTLFTIMVYGA+ +F KYN+ +
Sbjct: 245 FGVGRFFQMMWRLNFLVLFCCLVLNNSYMLYYICPMHTLFTIMVYGALAVFPKYNKSDAA 304
Query: 287 MIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIW 346
+ +KI ACFL V + W++ VF W+P F++GYTDP +P +++EW FRS LDRYIW
Sbjct: 305 IWLKIGACFLTVFVFWDLKMVFYSVWTPFMFLVGYTDPRRPGGDKMYEWFFRSSLDRYIW 364
Query: 347 IIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNK 406
I GMI A+ HP EK+++ ++ E R+ + +A ++++ VGY WY +Y L KV YN+
Sbjct: 365 IYGMICAFMHPKVEKFLQAVDGMEAHRRRATRAVLISLFCAVGYAWYHFVYLLPKVEYNR 424
Query: 407 YHPYTSWIPIT---YVLFIFYFFSL 428
HPYTSWIPIT +L + +FF L
Sbjct: 425 IHPYTSWIPITGEAGILVVDFFFML 449
>gi|118481045|gb|ABK92476.1| unknown [Populus trichocarpa]
Length = 332
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 179/206 (86%), Positives = 193/206 (93%)
Query: 212 MTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
MTGFGNFSYYYIRKDFS+ RF+QMMWRLNFFVAFCCI+LNNDYMLYYICPMHTLFT+MVY
Sbjct: 1 MTGFGNFSYYYIRKDFSVARFSQMMWRLNFFVAFCCIILNNDYMLYYICPMHTLFTLMVY 60
Query: 272 GAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPR 331
GA+GIFNKYNE SVM VKIL+CFLVVILIWEIPGVFD WSPLTF+LGY+DPAKPDLPR
Sbjct: 61 GALGIFNKYNENSSVMAVKILSCFLVVILIWEIPGVFDFLWSPLTFLLGYSDPAKPDLPR 120
Query: 332 LHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYL 391
LHEWHFRSGLDRYIWIIGMIYAY+HP EKWMEKLEESE K+KLS+K GIV V++ VGYL
Sbjct: 121 LHEWHFRSGLDRYIWIIGMIYAYFHPNIEKWMEKLEESETKKKLSMKTGIVAVSVSVGYL 180
Query: 392 WYECIYKLDKVTYNKYHPYTSWIPIT 417
WYE IYKLDKV+YNKYHPYTSWIPIT
Sbjct: 181 WYEYIYKLDKVSYNKYHPYTSWIPIT 206
>gi|303291260|ref|XP_003064916.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453587|gb|EEH50896.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 451
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/325 (56%), Positives = 233/325 (71%), Gaps = 8/325 (2%)
Query: 93 DDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMT 152
D LL NR TLRA AEFG +L FY+ DRT + DS K+Y+RDLFL L+L T
Sbjct: 5 DKIALLANRLTLRAWAEFGLVLGMFYLADRTGGISDSGKSYDRDLFLTLFLTFAAYGWRT 64
Query: 153 SLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWM 212
SL K P LNR QTEEWKGWMQVLFL+YHYF A+E+YNAIRIFIAAYVWM
Sbjct: 65 SLGKTETYVP--------LNRKQTEEWKGWMQVLFLLYHYFKASEVYNAIRIFIAAYVWM 116
Query: 213 TGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYG 272
TG+GNFSYYY+RKDF +PRF QMMWRLNFFV F C+VL NDYMLYYICPMH+LF VY
Sbjct: 117 TGYGNFSYYYVRKDFGVPRFLQMMWRLNFFVFFTCVVLRNDYMLYYICPMHSLFACFVYF 176
Query: 273 AVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRL 332
+ ++ K+NE V+ K L C L+ ++WE+PGVF + +SP ++L YTDPAKPD+ +
Sbjct: 177 TLLVYKKHNEKNKVIATKFLVCVLMCYVLWEVPGVFKLVFSPFRWLLKYTDPAKPDMDPM 236
Query: 333 HEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLW 392
HEW FRSGLDRY+WI GM+ A+ HP + +++ +++ +++I+ IVT L V Y +
Sbjct: 237 HEWFFRSGLDRYVWIHGMLCAFCHPRYDAFLQWIDKKAKLTRVAIQTTIVTFTLLVVYWY 296
Query: 393 YECIYKLDKVTYNKYHPYTSWIPIT 417
+E +Y L K+ YNK+HPYTSWIPIT
Sbjct: 297 HEKVYVLPKLEYNKFHPYTSWIPIT 321
>gi|412986715|emb|CCO15141.1| predicted protein [Bathycoccus prasinos]
Length = 624
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/343 (52%), Positives = 236/343 (68%), Gaps = 9/343 (2%)
Query: 87 IRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLV 146
+R + D + R LR+ AE ILF FY+CDRTN + TK+Y+RD F+ +++ L
Sbjct: 179 MRILQGDADAIERARLILRSWAELFFILFVFYVCDRTNTFPERTKSYSRDFFIAIFVTLA 238
Query: 147 IVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFI 206
+L++ +P LNR QTEEWKGWMQVLFL+YHYF A+EIYNAIRIFI
Sbjct: 239 AYGCYQTLRQSKTSAP--------LNREQTEEWKGWMQVLFLLYHYFKASEIYNAIRIFI 290
Query: 207 AAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLF 266
AAYVWMTGFGNFSYY++RKDFS+ RF+QMMWRLNFFV F CI L NDY+LYYICPMHTLF
Sbjct: 291 AAYVWMTGFGNFSYYHVRKDFSVGRFSQMMWRLNFFVLFVCIALRNDYVLYYICPMHTLF 350
Query: 267 TIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAK 326
T+ VY ++ +F ++N ++++ K++ ++V +IWE+PGVF + + P ++L YTDP +
Sbjct: 351 TLFVYFSLLVFKEHNTSTNMIVAKMVILIVLVYVIWEVPGVFTVLFKPFEWLLSYTDPKR 410
Query: 327 PDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVAL 386
PD+ +HEW FRSGLDRY+WI GM+ A+ HP E ++ ++E K K I+ I+TV
Sbjct: 411 PDVNPMHEWFFRSGLDRYVWIYGMVCAFVHPKYEAFVRWVDEKPTKEKTVIQGLILTVNT 470
Query: 387 FVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLV 429
Y WY Y L K+ YN HPYTSWIPIT + IF FSL
Sbjct: 471 IAAYWWYTTYYVLPKLEYNVVHPYTSWIPIT-IFIIFRNFSLT 512
>gi|159472625|ref|XP_001694445.1| O-acetyltransferase-related protein [Chlamydomonas reinhardtii]
gi|158276669|gb|EDP02440.1| O-acetyltransferase-related protein [Chlamydomonas reinhardtii]
Length = 491
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 188/412 (45%), Positives = 253/412 (61%), Gaps = 53/412 (12%)
Query: 6 PITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLL 65
PIT GQ +FL+ + +AWI SE L + ++
Sbjct: 7 PITSGQATFLVAFVWSLLAWIASEVLNWLHIA---------------------------- 38
Query: 66 EGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNL 125
+ + + M LL++R TLR EFGA++ ++++CDRT +
Sbjct: 39 -----------------HSGFVDCLLMKRKALLKHRLTLRTTVEFGALMCWYFLCDRTTV 81
Query: 126 LGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQV 185
G+ K+Y+RDLF+FL+++L V+ +SL+ K P LNR QTEEWKGWMQV
Sbjct: 82 FGEGEKSYSRDLFVFLFVILTSVAVGSSLQAF--KMPL------LLNRPQTEEWKGWMQV 133
Query: 186 LFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAF 245
LFL+YHYF A E YNAIRIFIA YVWMTGFGNFSYYY DF + RFAQMMWRLNF V F
Sbjct: 134 LFLLYHYFEAREAYNAIRIFIAGYVWMTGFGNFSYYYKTGDFCIGRFAQMMWRLNFMVFF 193
Query: 246 CCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIP 305
C+VLNN YMLYYICPMHT+FT+ VY A+ I +YN+ ++KI ACFL + + W++
Sbjct: 194 TCVVLNNSYMLYYICPMHTIFTVFVYAALAIAPQYNQNNFWCLMKIAACFLFIFVTWDLK 253
Query: 306 GVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEK 365
VF W+P TF++GY DP KP LHEW+FRS LDRYIWI GM+ A+ HP ++
Sbjct: 254 TVFYAIWTPFTFLMGYNDPRKPTNDALHEWYFRSSLDRYIWIYGMLCAFMHPPVAALLKY 313
Query: 366 LEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
++E R++++++ I+++ GY +Y IY L K+ YN+ HPYTSWIPIT
Sbjct: 314 IDEMPIVRRVTVRSFIISLCGVAGYFYYVHIYCLPKLEYNQVHPYTSWIPIT 365
>gi|217074354|gb|ACJ85537.1| unknown [Medicago truncatula]
Length = 220
Score = 368 bits (944), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 182/220 (82%), Positives = 204/220 (92%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MVV PITPGQVSFLLG+IPVFV WIYSE+LEYK+ SS TKVHSD NL EL K+TIKEDD
Sbjct: 1 MVVSGPITPGQVSFLLGVIPVFVTWIYSEYLEYKRTSSPTKVHSDINLDELGKDTIKEDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RA+LLE GL+RSASA+L +SS+K NLIRF+TMDD+FLLENRATLRAMAEFG ILFYFYIC
Sbjct: 61 RAILLEAGLTRSASAKLHASSVKLNLIRFLTMDDSFLLENRATLRAMAEFGLILFYFYIC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT++LGDSTKNYNRDLF+FL++LL+IVSAMTSLKKHND S FS +++ YLNRHQTEEWK
Sbjct: 121 DRTDILGDSTKNYNRDLFIFLFILLLIVSAMTSLKKHNDTSSFSARSMLYLNRHQTEEWK 180
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSY 220
GWMQVLFLMYHYFAATEIYN+IR+FIAAYVWMTGFGNFSY
Sbjct: 181 GWMQVLFLMYHYFAATEIYNSIRVFIAAYVWMTGFGNFSY 220
>gi|384252364|gb|EIE25840.1| O-acetyltransferase-related protein, partial [Coccomyxa
subellipsoidea C-169]
Length = 468
Score = 364 bits (934), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 174/328 (53%), Positives = 226/328 (68%), Gaps = 8/328 (2%)
Query: 90 MTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVS 149
+ + LL++R LRA EFG+I+ +FY+ DRT TK+Y+RD+ F+++ L V+
Sbjct: 2 LMLKQKALLDSRLYLRAAVEFGSIMLWFYLVDRTTFFTYGTKSYSRDVLTFIFVTLTAVA 61
Query: 150 AMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAY 209
A S++K+ K+P LNR QTEEWKGWMQVLFL+YHYF A EIYNAIR+FIA Y
Sbjct: 62 AWKSMRKY--KAP------DMLNRWQTEEWKGWMQVLFLLYHYFNAREIYNAIRVFIAGY 113
Query: 210 VWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIM 269
VWMTG+GNF YYY KDF L RF QMMWRLNF V CIVL N YMLYYICPMHT+FTI
Sbjct: 114 VWMTGYGNFMYYYHYKDFCLGRFMQMMWRLNFLVTVVCIVLRNSYMLYYICPMHTIFTIF 173
Query: 270 VYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDL 329
VY +G+ KYN + ++VKI C L++ + W+I VF WSP F++GY DP KP
Sbjct: 174 VYATLGLGQKYNATNTGILVKIGLCVLLIYVCWDIKAVFYAIWSPFMFLVGYNDPRKPTD 233
Query: 330 PRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVG 389
LHEWHFRS LDRY+WI GM+ A+ P AE ++++++ + +++ + ++ L V
Sbjct: 234 DLLHEWHFRSSLDRYVWIHGMLCAFLKPWAEGFLQRVDSLAFRARVAARTAVIGCTLVVL 293
Query: 390 YLWYECIYKLDKVTYNKYHPYTSWIPIT 417
LWYE +YKL KV YNK HPYTSWIPI+
Sbjct: 294 ALWYEHVYKLPKVEYNKLHPYTSWIPIS 321
>gi|308810421|ref|XP_003082519.1| O-acetyltransferase (ISS) [Ostreococcus tauri]
gi|116060988|emb|CAL56376.1| O-acetyltransferase (ISS) [Ostreococcus tauri]
Length = 538
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 185/389 (47%), Positives = 240/389 (61%), Gaps = 22/389 (5%)
Query: 40 TKVHSDTNLVELEKETIKEDDRAVLLEGGLSRSA----------SARLLSSSIKTNLIRF 89
T D + V L E + A E GL+R R +S
Sbjct: 35 TSSQRDGDGVALVNAVGNEAEDA---ESGLARVDDAVEDDVDERGQRRSTSGGARGFAAC 91
Query: 90 MTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVS 149
+D L ENR ++A AE G I+F F+ CDR+ L+ D+ K Y+RDLFL +++ L
Sbjct: 92 AMLDPIALRENRPAIQAAAELGVIMFLFWFCDRSGLVRDAKKAYDRDLFLAVFVALAAYG 151
Query: 150 AMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAY 209
T+LKK K+ L+R QTEE KGWMQVLFL+YHYF A E+YN IR+FIAAY
Sbjct: 152 WKTALKK--------SKSTSALHREQTEEMKGWMQVLFLLYHYFDAGEMYNLIRVFIAAY 203
Query: 210 VWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIM 269
VWMTGFGNFS+YY++KDFS RFAQMMWRLNFFV C V+ NDYMLYYICP+HTLFT+
Sbjct: 204 VWMTGFGNFSFYYVKKDFSATRFAQMMWRLNFFVFVTCAVMKNDYMLYYICPLHTLFTLF 263
Query: 270 VYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDL 329
V+GA+ I + N + K CFL+ + +WE+PG+F + P FI GYT+PAKP
Sbjct: 264 VFGALAIGREKNSDPRWVYAKFACCFLLSVTLWEVPGLFKRVFRPFVFIFGYTNPAKPHG 323
Query: 330 PRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSI-KAGIVTVALFV 388
LHEW FRSGLDRY+WI GM+ AY HP E ++ +++ + R+ I + GI+ VA
Sbjct: 324 DPLHEWRFRSGLDRYVWIYGMMCAYVHPRYEAVLKWIDDRQSTRERYIYQGGIIIVASMT 383
Query: 389 GYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
W++ K+DK YN++HPYTSWIPIT
Sbjct: 384 LLWWWQTFMKMDKFAYNEWHPYTSWIPIT 412
>gi|145353177|ref|XP_001420899.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581135|gb|ABO99192.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 448
Score = 363 bits (932), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 175/324 (54%), Positives = 224/324 (69%), Gaps = 13/324 (4%)
Query: 97 LLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKK 156
L E+R ++A AE G I+F F+ CDR+ + ++ K+Y+RDL ++ LL T+LK+
Sbjct: 7 LRESRRAVQAWAELGVIMFLFWFCDRSGAVREAKKSYDRDLLFAVFALLGTYGWKTTLKE 66
Query: 157 HNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFG 216
+P L+R QTEE KGWMQVLFL+YHYF A E+YN IR+FIAAYVWMTGFG
Sbjct: 67 SKTHAP--------LHREQTEEMKGWMQVLFLLYHYFDAGEMYNLIRVFIAAYVWMTGFG 118
Query: 217 NFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI 276
NFS+YY++KDFS RFAQMMWRLNFFV C V+ NDYMLYYICPMHTLFT++VYGA+ I
Sbjct: 119 NFSFYYVKKDFSAVRFAQMMWRLNFFVFVTCAVMKNDYMLYYICPMHTLFTLLVYGALAI 178
Query: 277 FNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWH 336
+ N + K +ACF +V +WEIPG F ++P TF+ Y +PAKPD+ LHEW
Sbjct: 179 AREKNSEPKWIYGKFIACFAIVAAVWEIPGAFKRIFTPFTFLFRYVNPAKPDVHPLHEWQ 238
Query: 337 FRSGLDRYIWIIGMIYAYYHPTAE---KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWY 393
FRSGLDRY+WI GM AY+HP E KW++ E S + ++ + G+V +A Y W+
Sbjct: 239 FRSGLDRYVWIYGMACAYFHPKYELTMKWID--ERSTTRERMMYQGGVVALASGALYWWW 296
Query: 394 ECIYKLDKVTYNKYHPYTSWIPIT 417
+ LDKV YNKYHPYTSWIPIT
Sbjct: 297 VTFFTLDKVEYNKYHPYTSWIPIT 320
>gi|255076697|ref|XP_002502020.1| predicted protein [Micromonas sp. RCC299]
gi|226517285|gb|ACO63278.1| predicted protein [Micromonas sp. RCC299]
Length = 458
Score = 360 bits (924), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 171/321 (53%), Positives = 219/321 (68%), Gaps = 8/321 (2%)
Query: 97 LLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKK 156
L+ +R TLRA E G +L F+I DR+ + D+ K+Y+RDLF ++ L T+ K+
Sbjct: 24 LIASRDTLRAWTELGCVLALFWIADRSGAVPDAEKHYDRDLFWAIFGALGAYGWFTTAKE 83
Query: 157 HNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFG 216
+P LNR QTEEWKGWMQVLFL+YHYF A+EIYNAIR+FIAAYVWMTGFG
Sbjct: 84 CRSNAP--------LNREQTEEWKGWMQVLFLLYHYFKASEIYNAIRVFIAAYVWMTGFG 135
Query: 217 NFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI 276
NFSYYY+RKDFS RFAQMMWRLNFFV F C+ + NDY+LYYICPMHTLFT VY A+ +
Sbjct: 136 NFSYYYVRKDFSFHRFAQMMWRLNFFVFFVCLAMRNDYVLYYICPMHTLFTWAVYFALYV 195
Query: 277 FNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWH 336
++N +V++ K CF ++WEIPGVF + P + Y DP KPD+ +HEW
Sbjct: 196 GREHNTSTNVVLGKFAVCFAACYVLWEIPGVFTAVFGPFQGLFEYVDPKKPDVDPMHEWF 255
Query: 337 FRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECI 396
FRSGLDRY+WI GM A++HP E M+K++E + +R+++ + I L Y WY +
Sbjct: 256 FRSGLDRYVWIHGMACAFFHPRFESAMKKVDEMDQRRRVTAQCAIGAATLGAMYWWYSSV 315
Query: 397 YKLDKVTYNKYHPYTSWIPIT 417
Y L K+ YN HPYTSWIPIT
Sbjct: 316 YVLPKLEYNALHPYTSWIPIT 336
>gi|3128234|gb|AAC26714.1| hypothetical protein [Arabidopsis thaliana]
gi|20197158|gb|AAM14946.1| hypothetical protein [Arabidopsis thaliana]
Length = 230
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 169/244 (69%), Positives = 195/244 (79%), Gaps = 26/244 (10%)
Query: 151 MTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYV 210
MTSLKKHNDKSP +GK+I YLNRHQTEEWKGWMQVLFLMYHYFAA EIYNAIR+FIAAYV
Sbjct: 1 MTSLKKHNDKSPITGKSILYLNRHQTEEWKGWMQVLFLMYHYFAAAEIYNAIRVFIAAYV 60
Query: 211 WMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMV 270
WMTGFGNFSYYYIRKDFSL RF QMMWRLN FVAF CI+LNNDYMLYYICPMHTLFT+MV
Sbjct: 61 WMTGFGNFSYYYIRKDFSLARFTQMMWRLNLFVAFSCIILNNDYMLYYICPMHTLFTLMV 120
Query: 271 YGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLP 330
YGA+GIF++YNEI SVM +KI +CFLVVI++WEIPGVF+IFWSPLTF+LG
Sbjct: 121 YGALGIFSRYNEIPSVMALKIASCFLVVIVMWEIPGVFEIFWSPLTFLLG---------- 170
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFV-- 388
+++ L + + GM+ E+WMEKLEE + KRK+SIK I+ ++ FV
Sbjct: 171 -----QYQTLLISFT-VFGMV--------ERWMEKLEECDAKRKMSIKTSIIAISSFVSL 216
Query: 389 GYLW 392
G LW
Sbjct: 217 GSLW 220
>gi|302844179|ref|XP_002953630.1| hypothetical protein VOLCADRAFT_106065 [Volvox carteri f.
nagariensis]
gi|300261039|gb|EFJ45254.1| hypothetical protein VOLCADRAFT_106065 [Volvox carteri f.
nagariensis]
Length = 518
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 172/348 (49%), Positives = 227/348 (65%), Gaps = 8/348 (2%)
Query: 70 SRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDS 129
+ +L S + LI + L+++R TLRA+ EFG I+ ++++ DRT ++
Sbjct: 52 AHGHQGSILESMANSGLISCLLFKQPALIQHRDTLRAVVEFGLIMIWYFVADRTTVVPWG 111
Query: 130 TKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLM 189
K+Y+RD+F+FL+++L V+ TS++ K P LNR QTEEWKGWMQVLFL+
Sbjct: 112 EKSYSRDVFIFLFVVLTSVALGTSIRAF--KMPL------LLNRPQTEEWKGWMQVLFLL 163
Query: 190 YHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIV 249
YHYF A E+YNAIRIFIAAYVWMTGFGNFSYYY DF + RFAQM+WRLNF V F C+V
Sbjct: 164 YHYFEAKEVYNAIRIFIAAYVWMTGFGNFSYYYKTGDFCIGRFAQMLWRLNFLVFFACVV 223
Query: 250 LNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFD 309
L N YMLYYICPMHT+FT+ VY A+ I + N + VKIL CF + + W++ VF
Sbjct: 224 LRNSYMLYYICPMHTIFTVFVYLALAIAPQLNVNHGWLFVKILLCFAFIFVTWDLKFVFY 283
Query: 310 IFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEES 369
WSP FILGY+DP P LHEW+FRSGLDRYIW+ GM+ A HP A ++ ++E
Sbjct: 284 AIWSPFKFILGYSDPRNPTEDVLHEWYFRSGLDRYIWVYGMLCALVHPQAAAVLKYIDEL 343
Query: 370 EPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
R+ + + I+++ Y +Y +Y K YN HPYTSWIPIT
Sbjct: 344 PVIRRYTARTIILSLCGVASYYYYITVYSKPKYEYNTVHPYTSWIPIT 391
>gi|217074356|gb|ACJ85538.1| unknown [Medicago truncatula]
gi|388497864|gb|AFK36998.1| unknown [Medicago truncatula]
Length = 309
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 156/203 (76%), Positives = 178/203 (87%)
Query: 235 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILAC 294
MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFT+MVYGA+GI+NKYNEI SVM VK LAC
Sbjct: 1 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGIYNKYNEIASVMAVKFLAC 60
Query: 295 FLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY 354
FLVVILIWEIPG F++FWSP F LGYTDPAKPD+PR+HEWHFRSGLDRYIWI+GMIYAY
Sbjct: 61 FLVVILIWEIPGFFELFWSPFAFFLGYTDPAKPDVPRMHEWHFRSGLDRYIWIVGMIYAY 120
Query: 355 YHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWI 414
+HP EKWMEKLEESE KR+++IK IV+ ALFVGY+WYE IYKLDKV+YNK HPYTSWI
Sbjct: 121 FHPNVEKWMEKLEESETKRRVTIKTSIVSAALFVGYMWYEYIYKLDKVSYNKLHPYTSWI 180
Query: 415 PITYVLFIFYFFSLVKHLSGSLY 437
PIT + + F +++ S +L+
Sbjct: 181 PITVYICLRNFTQHLRNFSLTLF 203
>gi|297740646|emb|CBI30828.3| unnamed protein product [Vitis vinifera]
Length = 187
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 154/185 (83%), Positives = 166/185 (89%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
MVV PITPGQVSFLLGIIPVFVAWIYSEFLEYKK SS +KVHSD NLVEL ETIKEDD
Sbjct: 1 MVVSSPITPGQVSFLLGIIPVFVAWIYSEFLEYKKSSSPSKVHSDNNLVELGSETIKEDD 60
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
RA+LLEGGL++SASA+ SSSIK NLIRF+TMDD+FLLENR TLRAM+EFGAIL YFY+C
Sbjct: 61 RAILLEGGLTKSASAKFNSSSIKVNLIRFLTMDDSFLLENRLTLRAMSEFGAILTYFYVC 120
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT LLGDSTKNYNRDLF+FLYLLLVIV MTSLKKH+DKS FSGK + YLNRHQTEEWK
Sbjct: 121 DRTELLGDSTKNYNRDLFIFLYLLLVIVCFMTSLKKHHDKSAFSGKALLYLNRHQTEEWK 180
Query: 181 GWMQV 185
GWMQ
Sbjct: 181 GWMQA 185
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 297 bits (761), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 134/173 (77%), Positives = 152/173 (87%), Gaps = 2/173 (1%)
Query: 189 MYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCI 248
+YHYFAA+EIYNAIR+FIA YVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLNFFVAFCCI
Sbjct: 169 VYHYFAASEIYNAIRVFIACYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLNFFVAFCCI 228
Query: 249 VLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVF 308
VL+ND MLYYICPMHTLFT+MVYG++G+FNK NE+ S+M +KI CFL VILIWEIPGVF
Sbjct: 229 VLDNDLMLYYICPMHTLFTLMVYGSLGLFNKCNEVPSIMAIKIACCFLSVILIWEIPGVF 288
Query: 309 DIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTA 359
++ W P TF+L Y D P+K LP LHEWHFRSGLDRYI IIGMIYAY+HP A
Sbjct: 289 ELLWGPFTFLLDYKDPSPSKAHLPLLHEWHFRSGLDRYICIIGMIYAYFHPNA 341
>gi|407404748|gb|EKF30093.1| hypothetical protein MOQ_006101 [Trypanosoma cruzi marinkellei]
Length = 554
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 172/392 (43%), Positives = 224/392 (57%), Gaps = 26/392 (6%)
Query: 48 LVELEKETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFL---------- 97
LV L ++ V G LS S ++++ + T ++ + MD F+
Sbjct: 41 LVVLVFASLVASSTTVFPTGPLS---SGQIMAFILITFIVAWAAMDFLFVQLNLMEESYM 97
Query: 98 -LENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKK 156
+ +R LR AE GA L ++CDRT LL S K Y+ D F L L VS T LKK
Sbjct: 98 TVNSRLQLRGGAELGAYLLLMFVCDRTTLLPRSEKIYSMDFFWLLCAALFGVSLFT-LKK 156
Query: 157 HNDKSPF--SGKT--------IQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFI 206
S F SG + + L R QTEEWKGWMQVLFL YHYF IYN+IRIFI
Sbjct: 157 AKPPSSFVVSGGSESPVECFHVPPLTRSQTEEWKGWMQVLFLWYHYFHNVSIYNSIRIFI 216
Query: 207 AAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLF 266
AAYVWMTGFGNFSYYY+RKD+S R M WRLNF VA + L N YMLYYICP+HTLF
Sbjct: 217 AAYVWMTGFGNFSYYYVRKDYSFNRLCVMQWRLNFLVAAVSLTLGNQYMLYYICPLHTLF 276
Query: 267 TIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIP-GVFDIFWSPLTFILGYTDPA 325
T+ +Y ++ F N M VK+ FL LIW++ F + WSP ++++G+ +P
Sbjct: 277 TLFIYQSLYFFQDMNVTNRGMCVKVFLSFLFCGLIWDVSREFFYVIWSPFSWLVGFNNPY 336
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVA 385
+P L EW FR+ LD YIWI GMI AY HP ++ +L+E +K I A
Sbjct: 337 RPKQHILWEWFFRTSLDHYIWIYGMICAYSHPRYCHYLRRLDELPKYASWLMKFVITITA 396
Query: 386 LFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
L VG+++ + +Y + K+ YN HPYTS+IPIT
Sbjct: 397 LCVGFVYIKVVYLVPKLEYNTIHPYTSFIPIT 428
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 134/173 (77%), Positives = 152/173 (87%), Gaps = 2/173 (1%)
Query: 189 MYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCI 248
+YHYFAA+EIYNAIR+FIA YVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLNFFVAFCCI
Sbjct: 108 VYHYFAASEIYNAIRVFIACYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLNFFVAFCCI 167
Query: 249 VLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVF 308
VL+ND MLYYICPMHTLFT+MVYG++G+FNK NE+ S+M +KI CFL VILIWEIPGVF
Sbjct: 168 VLDNDLMLYYICPMHTLFTLMVYGSLGLFNKCNEVPSIMAIKIACCFLSVILIWEIPGVF 227
Query: 309 DIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTA 359
++ W P TF+L Y D P+K LP LHEWHFRSGLDRYI IIGMIYAY+HP A
Sbjct: 228 ELLWGPFTFLLDYKDPSPSKAHLPLLHEWHFRSGLDRYICIIGMIYAYFHPNA 280
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 133/180 (73%), Positives = 154/180 (85%), Gaps = 2/180 (1%)
Query: 189 MYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCI 248
+YHYFAA+EIYNAIR+FIA YVWMT FGNFSYYYI+KDFS+ RFAQMMWRLNFFVAFCCI
Sbjct: 136 VYHYFAASEIYNAIRVFIACYVWMTRFGNFSYYYIKKDFSIARFAQMMWRLNFFVAFCCI 195
Query: 249 VLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVF 308
VL+ND MLYYICPMHTLFT+MVYG++G+FNK N + S+M +KI FL VILIWEIPGVF
Sbjct: 196 VLDNDLMLYYICPMHTLFTLMVYGSLGLFNKCNVVPSIMAIKIACYFLSVILIWEIPGVF 255
Query: 309 DIFWSPLTFILGYTD--PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKL 366
++ W+P TF+LGY D P+K LP LHEWHFRSGL+RYIWIIGMIYAY+HP A + L
Sbjct: 256 ELLWAPFTFLLGYKDPSPSKAHLPLLHEWHFRSGLNRYIWIIGMIYAYFHPNALLGQDAL 315
>gi|156387820|ref|XP_001634400.1| predicted protein [Nematostella vectensis]
gi|156221483|gb|EDO42337.1| predicted protein [Nematostella vectensis]
Length = 388
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 197/318 (61%), Gaps = 21/318 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L+++A GAI+FYFYICD + T+ Y+RD+F+FLYL+L++V+ + ++ DK
Sbjct: 1 LKSIALLGAIMFYFYICDYDHFFPAGTRVYSRDVFVFLYLVLMVVAVCFTTRECKDKI-- 58
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYI 223
LNR QTEEWKG MQVLF+ YHYF A E YNAIR+FIAAYVWMTGFGNFS+++I
Sbjct: 59 -------LNRDQTEEWKGIMQVLFVWYHYFKAAETYNAIRVFIAAYVWMTGFGNFSFFWI 111
Query: 224 RKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEI 283
R+DFSL R +M++RLNF V CIV N+YMLYYICPMHT + + VY + K N
Sbjct: 112 RQDFSLYRMLKMLFRLNFLVVITCIVTTNEYMLYYICPMHTFWFLSVYAMMRPLYKRNAD 171
Query: 284 GSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDR 343
VM K F++V L+++ GV ++ + P IL Y LHEW FR+GLD
Sbjct: 172 PKVMAAKFGIYFVLVFLLFDASGVGEMVFKPFYPILSYRK-------SLHEWMFRAGLDH 224
Query: 344 YIWIIGMIYAYYHPTAEKWMEKLEESEPKRK-----LSIKAGIVTVALFVGYLWYECIYK 398
Y +GM+ AY++P EK+M LE RK L +K GI + W+ ++
Sbjct: 225 YATFLGMLCAYFYPNFEKFMSSLELEVCDRKTQVTHLLVKFGIAAALVTSVGFWWCHVFV 284
Query: 399 LDKVTYNKYHPYTSWIPI 416
L K YNK HPY S IP+
Sbjct: 285 LPKFDYNKLHPYYSCIPL 302
>gi|221132223|ref|XP_002156799.1| PREDICTED: CAS1 domain-containing protein 1-like [Hydra
magnipapillata]
Length = 511
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 201/312 (64%), Gaps = 16/312 (5%)
Query: 111 GAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQY 170
G I+ ++++CD + + + YNRDLF+FL ++L +V+ + G + +
Sbjct: 105 GWIMVFYFLCDYLHYFPKADRVYNRDLFIFLSIMLFLVALSFTWT--------VGPSGKL 156
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLP 230
LNR QTEEWKGWMQV+F+ YHYF A E YNA+R++IAAYV+MTGFGNFS++++RKD+SL
Sbjct: 157 LNRDQTEEWKGWMQVMFVWYHYFRAAETYNAVRVYIAAYVFMTGFGNFSFFWVRKDYSLR 216
Query: 231 RFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVK 290
R ++++RLNF VA +V +N Y+LYYIC MHT + + VY + F++ N + VM+ K
Sbjct: 217 RVLKLLFRLNFLVAVVVLVTDNQYILYYICGMHTFWFVTVYIMMRPFHELNHVKKVMLAK 276
Query: 291 ILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGM 350
+ F+V+ +I+++P V + P + +LGY DP P L+EW FRSGLD Y+ +IGM
Sbjct: 277 FVLYFVVIYVIFDVPNVAQYVFLPFSNLLGYGDP-----PTLNEWIFRSGLDHYVTLIGM 331
Query: 351 IYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPY 410
I AY+HP EK + + ++I IV + V LWY+ ++ L K YN HPY
Sbjct: 332 ICAYFHPNLEKQLNFINNHRNNIAINILISIVIFPILV--LWYKYVFVLSKYPYNVLHPY 389
Query: 411 TSWIP-ITYVLF 421
TS+IP I YV F
Sbjct: 390 TSFIPIIAYVYF 401
>gi|323449614|gb|EGB05501.1| hypothetical protein AURANDRAFT_1346 [Aureococcus anophagefferens]
Length = 404
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 138/319 (43%), Positives = 189/319 (59%), Gaps = 13/319 (4%)
Query: 103 TLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSP 162
+LR +EF + + Y+CDRT+L+ K+ + F L+L + V+A SL+
Sbjct: 1 SLRHGSEFALWIAFMYLCDRTDLVPRGAKHTDPRSFWLLWLA-ICVAAACSLR------- 52
Query: 163 FSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYY 222
S ++ + L R QTEEWKGWMQ++FL+YHYFA ++YNAIRI+IA YVWMTG+GNF YY
Sbjct: 53 -SCRSPKVLAREQTEEWKGWMQLMFLLYHYFAQGQLYNAIRIYIAGYVWMTGYGNFLYYR 111
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNE 282
DFS R Q ++RLNFFV C+ L N+YMLYYI PMHTLFT+ V+ A+ + N
Sbjct: 112 KSGDFSAVRMCQTLFRLNFFVVVVCVALRNEYMLYYIAPMHTLFTLFVWLALRVAKDRNG 171
Query: 283 IGSVMIVKILACFLVVILIWEIPGVFDIF--WSPLTFILGYTDPAKPDLP-RLHEWHFRS 339
+V KI+A L++++ VF W PL ++ + DP P+ LHEWHFRS
Sbjct: 172 DDAVAAGKIVATLAATALLYDVEPVFKAAFGWRPLRDLVAFHDPLHPEFSDELHEWHFRS 231
Query: 340 GLDRYIWIIGMIYAYYHPTAEKWMEKLEE-SEPKRKLSIKAGIVTVALFVGYLWYECIYK 398
GLDRYIWI GM A P E+ + L + R+++ VAL W ++
Sbjct: 232 GLDRYIWIFGMACALGLPYLERRLGALAALDDGARRVAAHGAAACVALAPAVAWVALVFL 291
Query: 399 LDKVTYNKYHPYTSWIPIT 417
DK YN HPYTSW+P+
Sbjct: 292 KDKYAYNALHPYTSWVPVA 310
>gi|291236991|ref|XP_002738420.1| PREDICTED: CG2938-like [Saccoglossus kowalevskii]
Length = 588
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 152/423 (35%), Positives = 218/423 (51%), Gaps = 62/423 (14%)
Query: 34 KKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMD 93
++ T+ L+ + + ++A+ + G ++ + L S+ K L++ +
Sbjct: 97 NSINDSTENAETQKFKALDAHVLDKIEKAID-DAGKDQNLNEELDSNEPKEPLLKNDKTN 155
Query: 94 DAFLLENRATLRAMAE----------------------FGAILFYFYICDRTNLLGDSTK 131
D +EN E FG I+ YFY+CD ++ +
Sbjct: 156 DQIKVENTEKNGKNGEKSSFSKIKPPPSFDDFLFYVIVFGGIMLYFYLCDYQHIFKSGER 215
Query: 132 NYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYH 191
Y+RDLF+FL LLL QTEEW+GWMQV+F+ YH
Sbjct: 216 TYSRDLFMFLVLLL----------------------------DQTEEWRGWMQVMFVWYH 247
Query: 192 YFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLN 251
YFAA E YN IRIFIA YVWMTGFGNFS+++IRKDF+ R +M WRLNF V CIV +
Sbjct: 248 YFAAKETYNYIRIFIACYVWMTGFGNFSFFWIRKDFTFWRMLKMQWRLNFLVTVVCIVTD 307
Query: 252 NDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIF 311
NDYMLYYIC MHT + + V+ + + N +N M K A F+ +I++IPGV ++
Sbjct: 308 NDYMLYYICAMHTYWFLTVWIFMRVLNSWNTDRKKMAAKFFAYFVFNAIIFDIPGVGNVV 367
Query: 312 WSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEP 371
+ P +IL Y D +HEW FRSGLD Y +GM+ AY +P E +++ LE +
Sbjct: 368 FQPFYYILRYQD-------SMHEWLFRSGLDHYATFLGMLCAYNYPHYENFLKYLERKDD 420
Query: 372 ----KRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFS 427
K + IKA I V + +W+ I DK YN YHPYTS+IPI ++ F
Sbjct: 421 LNHYKLGIVIKAIIGIVIGIIILIWHSTIMYRDKYEYNLYHPYTSFIPILAFIYFRNLFP 480
Query: 428 LVK 430
+++
Sbjct: 481 ILR 483
>gi|260827533|ref|XP_002608719.1| hypothetical protein BRAFLDRAFT_120589 [Branchiostoma floridae]
gi|229294071|gb|EEN64729.1| hypothetical protein BRAFLDRAFT_120589 [Branchiostoma floridae]
Length = 666
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/332 (43%), Positives = 202/332 (60%), Gaps = 21/332 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L+ + FG IL YFY+CD ++ + Y+RDLFLFL LL V+ +LK
Sbjct: 246 LKYLVVFGGILLYFYLCDYDHIFPRRERTYSRDLFLFLCFLLFAVAGGFTLKPC------ 299
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYI 223
T LNR QTEEWKGWMQV+F+ YHY+AA E YN IR+FIA YVWMTGFGNFS++++
Sbjct: 300 ---TSSVLNRDQTEEWKGWMQVMFVWYHYYAAKETYNYIRVFIACYVWMTGFGNFSFFWV 356
Query: 224 RKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEI 283
RKD+SL R +M++RLNF V C+V NN+YMLYYIC MHT + + VY + + + +N+
Sbjct: 357 RKDYSLWRLLKMLFRLNFLVVCVCVVTNNEYMLYYICAMHTYWFLSVYAFMRVLSSWNQH 416
Query: 284 GSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDR 343
M K L F+ +I+++P V + ++ FILGY +HEW FR+GLD
Sbjct: 417 RWKMAAKFLVYFVCNTVIFDVPHVGETLFTLFRFILGYKGG-------MHEWMFRAGLDH 469
Query: 344 YIWIIGMIYAYYHPTAEKWMEKLEE-----SEPKRKLSIKAGIVTVALFVGYLWYECIYK 398
+ +IGM+ AY +P EK ++ LE+ +E + + IK + TV L +W+ I
Sbjct: 470 HATLIGMLCAYNYPNYEKLLKYLEKKFADPNEQRLAVMIKFVLSTVFLVAAVIWHTSIAS 529
Query: 399 LDKVTYNKYHPYTSWIPITYVLFIFYFFSLVK 430
+K YN HPYTSWIPI +F F L++
Sbjct: 530 KEKFDYNAIHPYTSWIPILSYIFFRNLFPLMR 561
>gi|390331513|ref|XP_794454.3| PREDICTED: uncharacterized protein LOC589726 [Strongylocentrotus
purpuratus]
Length = 809
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 154/396 (38%), Positives = 216/396 (54%), Gaps = 39/396 (9%)
Query: 37 SSHTKVHSDTNLVELE------KETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFM 90
+S T H+ +N E E KE IK+ + S + S + SS+I ++
Sbjct: 327 ASETNGHA-SNREERESNGVLFKEKIKDRYASENTGNQKSETTSQNITSSTISA--VKSP 383
Query: 91 TMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSA 150
+ FLL G IL YF+ C+ + + YNRD F+F LL+ +V+
Sbjct: 384 PSLEKFLL-------YAVGLGVILLYFFFCEVWDEWPAGERIYNRDQFMFFALLMFLVAG 436
Query: 151 MTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYV 210
+ +++ DK +NR QTEEWKGWMQV+F+ YHYF A E YN +R +IA YV
Sbjct: 437 VFTVRTCPDK---------LINRDQTEEWKGWMQVMFVWYHYFRAAETYNLVRFYIACYV 487
Query: 211 WMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMV 270
WMTGFGNFS++++RKDFSL R +MM+RLNF V +V +N YMLYYIC MHT + + V
Sbjct: 488 WMTGFGNFSFFWVRKDFSLWRMMKMMFRLNFLVILVVMVTDNSYMLYYICAMHTYWFLTV 547
Query: 271 YGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLP 330
Y + F +NE M +K + + +I++ P + IF PL FIL Y
Sbjct: 548 YLFMFTFRSWNENPRWMAIKFVVYLICNAIIFDTPLLVYIF-RPLWFILSYEGS------ 600
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRK-----LSIKAGIVTVA 385
LHEW FR+GLD Y +GM+ AY +P E+WM L++ R+ +SIK I+ V
Sbjct: 601 -LHEWQFRAGLDHYACFVGMLCAYNYPYYERWMNYLDKKHIDRRDKFLSVSIKGFIIGVL 659
Query: 386 LFVGYLWYECIYKLDKVTYNKYHPYTSWIPI-TYVL 420
L + +WY +K YN HPY SW PI TY++
Sbjct: 660 LLLLVVWYREFMMKEKYAYNAIHPYISWFPILTYII 695
>gi|413946696|gb|AFW79345.1| hypothetical protein ZEAMMB73_006676 [Zea mays]
Length = 207
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 125/184 (67%), Positives = 151/184 (82%), Gaps = 3/184 (1%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M VF P+TPGQVSFLLG+ PV ++WIYSE LEYKK SH KVHSD NL + TIKED+
Sbjct: 1 MEVFGPVTPGQVSFLLGLFPVLISWIYSEILEYKKSLSHGKVHSDANL---DNGTIKEDE 57
Query: 61 RAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYIC 120
++VLLEGG +S S + + S K NL+RF+TMD++FLLENRA LRAMAEFG +L YFYIC
Sbjct: 58 KSVLLEGGQLKSPSTKFRNLSTKANLLRFITMDESFLLENRAVLRAMAEFGVVLVYFYIC 117
Query: 121 DRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWK 180
DRT++ +S K+YNRDLFLFLY+LL+I S +TSLKKH++KS FSGK+I YLNRHQTEEWK
Sbjct: 118 DRTDIFPESKKSYNRDLFLFLYILLIIASTLTSLKKHHEKSAFSGKSILYLNRHQTEEWK 177
Query: 181 GWMQ 184
GWMQ
Sbjct: 178 GWMQ 181
>gi|260827535|ref|XP_002608720.1| hypothetical protein BRAFLDRAFT_73941 [Branchiostoma floridae]
gi|229294072|gb|EEN64730.1| hypothetical protein BRAFLDRAFT_73941 [Branchiostoma floridae]
Length = 569
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 138/332 (41%), Positives = 190/332 (57%), Gaps = 44/332 (13%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L+ + FG IL YFY+CD ++ + Y+RDLFLFL
Sbjct: 172 LKYLVLFGGILLYFYLCDYDHIFPRRERTYSRDLFLFL---------------------- 209
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYI 223
QTEEWKGWMQV+F+ YHY+AA E YN IR+FIA+YVWMTGFGNFS++++
Sbjct: 210 ----------DQTEEWKGWMQVMFVWYHYYAAKETYNYIRVFIASYVWMTGFGNFSFFWV 259
Query: 224 RKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEI 283
RKD+SL R +M++RLNF V C+V NN+YMLYYIC MHT + + VY + + + +N+
Sbjct: 260 RKDYSLWRLLKMLFRLNFLVVCVCVVTNNEYMLYYICAMHTYWFLSVYAFMRVLSSWNQH 319
Query: 284 GSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDR 343
M K L F+ +I+++P V + ++P FILGY +HEW FR+GLD
Sbjct: 320 RWKMAAKFLVYFVCNTVIFDVPHVGETLFTPFKFILGYKGG-------MHEWMFRAGLDH 372
Query: 344 YIWIIGMIYAYYHPTAEKWMEKLEE--SEP-KRKLSIKAGIVTVALFVG--YLWYECIYK 398
+ ++GM+ AY +P EK + LE + P K L++ A V F YLW I
Sbjct: 373 HATLLGMLCAYNYPNYEKLLNYLERKFTAPYKTVLAVAAKFVLSVAFAAALYLWQTNIMY 432
Query: 399 LDKVTYNKYHPYTSWIPITYVLFIFYFFSLVK 430
+K YN HPYTSWIPI +F F L++
Sbjct: 433 KEKFEYNAIHPYTSWIPILSYIFFRNLFPLMR 464
>gi|224002785|ref|XP_002291064.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972840|gb|EED91171.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
Length = 416
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 183/314 (58%), Gaps = 7/314 (2%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQ 169
FG IL Y YIC+ K Y+RD F F +L+V+V+ S++++ D GK IQ
Sbjct: 7 FGIILLYSYICEHHPPYPHDEKVYDRDEFFFWTILVVVVAGWNSVRRNTDVKN-RGKRIQ 65
Query: 170 Y--LNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDF 227
+ LNR+QTEEWKGWMQ FL+YHY ATE+YN IR+ I YVWMTGFGNFS++Y+ D+
Sbjct: 66 HVILNRNQTEEWKGWMQFTFLLYHYMHATEVYNGIRVMITCYVWMTGFGNFSFFYMTNDY 125
Query: 228 SLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVM 287
SLPR QM+WRLNF V F C+ N Y+LYYICP+HT F +MVY + + + N +
Sbjct: 126 SLPRVLQMLWRLNFLVLFLCLTHGNPYILYYICPLHTYFFLMVYAVMYVGKEKNYTKWWI 185
Query: 288 IVKILACFLVVILIWEI-PGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIW 346
K+ ++ L+W++ G+F+ F LGY + + EW+FRS LD +
Sbjct: 186 RTKLGVLAFIIFLVWDVDSGIFERVHR--LFFLGYEPTTGAPMGSMWEWYFRSYLDHWST 243
Query: 347 IIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNK 406
++GMI+A P + KLE R+ K+ + L +W +DK+ YN
Sbjct: 244 LLGMIFAVNFPIVSLFYRKLEARSRLRQWLGKSAVAAGILCALAMWTRGPMMMDKIQYNS 303
Query: 407 YHPYTSWIP-ITYV 419
+PY +IP ITY+
Sbjct: 304 TNPYFGFIPLITYI 317
>gi|443683754|gb|ELT87905.1| hypothetical protein CAPTEDRAFT_145222, partial [Capitella teleta]
Length = 414
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 136/296 (45%), Positives = 185/296 (62%), Gaps = 10/296 (3%)
Query: 133 YNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHY 192
Y RD FLFL LL + + + S+ +DK I + R QTEEWKGWMQVLF+ YHY
Sbjct: 9 YTRDTFLFLVFLLFLAAIVGSVSDTSDKILNRYSPICFAYRDQTEEWKGWMQVLFVWYHY 68
Query: 193 FAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNN 252
FAA E YN IR++IA YVWMTGFGNFS+++I++D+SL RF +M +RLNF V C + N
Sbjct: 69 FAAVEWYNWIRVYIACYVWMTGFGNFSFFWIKQDYSLWRFMKMFFRLNFLVILVCATVGN 128
Query: 253 DYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFW 312
+YMLYYIC MHT + + VY + IF +N S+M +K+ +I++IPGV + +
Sbjct: 129 EYMLYYICAMHTYWFLSVYFTMAIFPSWNAQTSMMTLKLFIYAACNYVIFDIPGVASVLF 188
Query: 313 SPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPK 372
P ILG D D+ +HEW FR+GLD +I +GM+ AY +P E+++ E+
Sbjct: 189 KPFWLILGLND-GHGDV--MHEWVFRAGLDHWICFVGMLCAYNYPHFEQFIAYTEKQNEV 245
Query: 373 RKLS------IKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPI-TYVLF 421
R L IKA + + L V Y WYE I LDK +YN HPYTS+IP+ +Y+ F
Sbjct: 246 RFLRIPSGNWIKAAVGGLILCVFYAWYEHILPLDKFSYNHLHPYTSFIPVMSYIYF 301
>gi|156375566|ref|XP_001630151.1| predicted protein [Nematostella vectensis]
gi|156217166|gb|EDO38088.1| predicted protein [Nematostella vectensis]
Length = 414
Score = 231 bits (588), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 130/312 (41%), Positives = 188/312 (60%), Gaps = 23/312 (7%)
Query: 111 GAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQY 170
G I+ YF++CD ++ + K Y+RD+F+FL+ +LV+V+ + +++K +K
Sbjct: 1 GFIMLYFWLCDFQHIWPKTDKQYSRDMFVFLFSVLVLVAFVFTVRKTPEK---------L 51
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLP 230
LNR QTEEWKGWMQV F+ YHYF A E++N+IR +I AYVWMTGFGNFSY++I+KD+S+
Sbjct: 52 LNRDQTEEWKGWMQVQFVWYHYFDAQEVFNSIRCYIGAYVWMTGFGNFSYFWIKKDYSIF 111
Query: 231 RFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVK 290
R +MM+RLNF V V N++++ YYIC MHT + + VY + + ++N VM K
Sbjct: 112 RLLKMMFRLNFLVVMVMAVTNHEFVRYYICAMHTYWFLSVYVMMAVGKQHNANRKVMAAK 171
Query: 291 ILACFLVVILIWEIPGV-FDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIG 349
+ + +LI+++PG + IFW P FIL L W FRS LD +G
Sbjct: 172 FIIYLVFNLLIFDVPGASWKIFW-PFQFILNVKG-------NLRYWIFRSTLDHLATWVG 223
Query: 350 MIYAYYHPTAEK---WMEKLEES--EPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTY 404
M+ AY +P E+ W+++ ES E R +K I + L LW I LD+ TY
Sbjct: 224 MLCAYNYPYLERLLSWLDRSHESDREKHRAWLLKGAITLIVLGAVTLWSHYILMLDRKTY 283
Query: 405 NKYHPYTSWIPI 416
HP+TSWIPI
Sbjct: 284 MVIHPFTSWIPI 295
>gi|299470477|emb|CBN78469.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1059
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 113/311 (36%), Positives = 174/311 (55%), Gaps = 14/311 (4%)
Query: 101 RATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDK 160
RA ++ G ++ + Y+C+ K ++ D+F + L+L++ SA+ K
Sbjct: 94 RAFSWELSRLGVLITFAYMCEHHPPFAHGEKAHDMDMFWCVALMLLVSSALNVRKS---- 149
Query: 161 SPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSY 220
K LNR QTEEWKGWMQ +FLMYHYF+A E+YN+IR+FI AYVWMTGFGNFS+
Sbjct: 150 -----KGGDVLNREQTEEWKGWMQFMFLMYHYFSAHEVYNSIRVFITAYVWMTGFGNFSF 204
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
+Y++ D+ R QM+WRLNF V C+ + N Y+LYYICP+HT + +VY +
Sbjct: 205 FYLKGDYGAVRLLQMLWRLNFLVILLCMAMGNTYILYYICPLHTFYFFVVYAIMSPAKSA 264
Query: 281 NEIGSVMIVKILACFLVVILIWEIPGVFDI-FWSPLTFILGYTDPAKPDLPRLHEWHFRS 339
N + M K+L ++ +W+ +D+ + + F LG + EW+FR+
Sbjct: 265 NYTKNGMRWKLLVAGSIIFCVWD----WDLHIFEKIFFFLGREKVVGAGNGTMWEWYFRT 320
Query: 340 GLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKL 399
LD + +GMI+A +P +W++K+E R+ +IK + V L W I L
Sbjct: 321 SLDHWSTFLGMIFALNYPATAQWVKKIESLPFGRQWAIKGSVAAVLLSATAWWAANILPL 380
Query: 400 DKVTYNKYHPY 410
+K+ YN+ + Y
Sbjct: 381 EKLVYNQKNAY 391
>gi|242072204|ref|XP_002446038.1| hypothetical protein SORBIDRAFT_06g000800 [Sorghum bicolor]
gi|241937221|gb|EES10366.1| hypothetical protein SORBIDRAFT_06g000800 [Sorghum bicolor]
Length = 236
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 97/135 (71%), Positives = 116/135 (85%), Gaps = 2/135 (1%)
Query: 287 MIVKILACFLVVILIWEIPGVFDIFWSPLTFILGY--TDPAKPDLPRLHEWHFRSGLDRY 344
M VKI +CFL VILIWEIPGVF+I W+PLTF++GY +P+K +LP LHEWHFRSGLDRY
Sbjct: 1 MSVKIASCFLTVILIWEIPGVFEIVWAPLTFLIGYKNPEPSKVNLPLLHEWHFRSGLDRY 60
Query: 345 IWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTY 404
IWIIGMIYAY+HP E+WMEKLEESE K ++ IK IVT+++ +G+LWYE IYKLDK TY
Sbjct: 61 IWIIGMIYAYFHPNVERWMEKLEESEIKVRVLIKGTIVTISVMIGHLWYEYIYKLDKHTY 120
Query: 405 NKYHPYTSWIPITYV 419
NKYHPYTSWIPIT++
Sbjct: 121 NKYHPYTSWIPITWL 135
>gi|294950327|ref|XP_002786574.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239900866|gb|EER18370.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 621
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 171/310 (55%), Gaps = 31/310 (10%)
Query: 66 EGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNL 125
+G S+ + RL SS + MD L R+T + GAIL Y + C+ +
Sbjct: 62 DGEASKQSDDRLTSSFNGS-------MDSWKQLLLRST-----QLGAILLYAFACEWAPV 109
Query: 126 LGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQV 185
+ Y+RD+F FL+ + ++ S K ++ P K + + R+ +EE KGW+Q
Sbjct: 110 YPHGLREYSRDIFWFLFAVFIL--HCLSWGK-TERRPEEAKLVVF-GRNNSEELKGWLQF 165
Query: 186 LFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAF 245
LFL YHYF AT++YN IR+F++AYVWMTGFGNFSY+Y + D+SL R M+WRLN
Sbjct: 166 LFLAYHYFHATDVYNMIRVFVSAYVWMTGFGNFSYFYTQNDYSLGRLVSMLWRLNMSAVL 225
Query: 246 CCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIP 305
C+ LN Y+LYYI P+HT + ++ Y +G F N +M VK+ A L + LIW++
Sbjct: 226 LCLALNTTYILYYIVPLHTFYFLLTYVTMGAFRSANYHRWLMKVKLTALGLAIYLIWDVD 285
Query: 306 GV-FDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWME 364
FD+ + P IL T L+EWHFR+GLD + +P+ KWME
Sbjct: 286 HSWFDVVFGP---ILPSTALQGAKAGVLYEWHFRTGLD-----------HCYPSTTKWME 331
Query: 365 KLEESEPKRK 374
+E+ P R+
Sbjct: 332 MVEKLPPTRE 341
>gi|219118178|ref|XP_002179869.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408922|gb|EEC48855.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 469
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 186/324 (57%), Gaps = 16/324 (4%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTI- 168
FG IL Y Y+C+ K Y+RD F L+++++ +LK++ S +++
Sbjct: 73 FGCILLYAYLCEYHPPFPHGVKTYDRDEFF-FLTALLLLASAFTLKRNQPLGTGSSRSVA 131
Query: 169 ------QYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QTEEWKGWMQ +FL+YHYF A E+YNAIR+ I YV+MTG+GNFS++Y
Sbjct: 132 PSVEATEILNRDQTEEWKGWMQFMFLLYHYFHAEEVYNAIRVMITCYVFMTGYGNFSFFY 191
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNE 282
I+ D+S+ R QM+WRLNF V F C+ Y+LYYIC +HT F +MVY + I N
Sbjct: 192 IKGDYSIVRVMQMLWRLNFLVVFLCLSQGTTYILYYICLLHTYFFLMVYVTMKIGTDLNY 251
Query: 283 IGSVMIVKILACFLVVILIWEI-PGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGL 341
+ +K+ L++ ++W++ G+F + P LG L EW+FRS L
Sbjct: 252 SKWGIRLKLGGLALLIFIVWDVDSGIFRLLHWPF---LGEVPVLGATSGSLWEWYFRSTL 308
Query: 342 DRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYEC-IYKLD 400
D + I+GMI+A P + KLE + ++ KA I+ +AL + W+ C ++L
Sbjct: 309 DHWSTILGMIFALNFPITSLFFRKLESLPFGQHVAAKA-ILGLALGGIFYWWVCWPFQLS 367
Query: 401 KVTYNKYHPYTSWIPITYVLFIFY 424
K YN+ + Y IP+ +++I+Y
Sbjct: 368 KFDYNQTNSYFGCIPV--LVYIYY 389
>gi|428167294|gb|EKX36256.1| hypothetical protein GUITHDRAFT_79030, partial [Guillardia theta
CCMP2712]
Length = 424
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/310 (35%), Positives = 172/310 (55%), Gaps = 14/310 (4%)
Query: 113 ILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLN 172
IL + Y + + K ++ D+F F LL + S K D L
Sbjct: 37 ILLFTYQSEYYPVFPHLAKEHDMDMFWFATLLFLAYSFTRWQKSRTD---------DLLG 87
Query: 173 RHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRF 232
R QTEEWKGWMQ +FL+YHY AA+E+YN IR+ I +Y+WMTGFGNFS++YI+KD+ + R
Sbjct: 88 REQTEEWKGWMQFMFLLYHYCAASEVYNIIRVMITSYLWMTGFGNFSFFYIKKDYGIVRV 147
Query: 233 AQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKIL 292
QMMWRLNFFV IV N Y+LYYICP+HT F ++ + + + + N + +K+
Sbjct: 148 LQMMWRLNFFVFLLMIVQGNTYILYYICPLHTFFFLVTWLTMRVCSNLNYTKWGVRIKLS 207
Query: 293 ACFLVVILIWE-IPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMI 351
+V+ +W+ P +FD+ + +G L+EW+FR+ LD + +GMI
Sbjct: 208 VVAVVIFAVWDGAPWLFDLIFGSF---MGRAPCKGATSGFLYEWYFRTTLDHWSTFLGMI 264
Query: 352 YAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYT 411
+A +P W ++L + + + + V+L + LW+ I+ L K+ YN + +
Sbjct: 265 FALNYPFTNMWFKELAKLPAASQGKVTLVVAAVSLTLMVLWFCFIFTLPKLQYNNTNAFL 324
Query: 412 SWIPI-TYVL 420
+ +P+ +Y+L
Sbjct: 325 APVPVLSYIL 334
>gi|449459412|ref|XP_004147440.1| PREDICTED: CAS1 domain-containing protein 1-like [Cucumis sativus]
Length = 151
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 89/126 (70%), Positives = 108/126 (85%)
Query: 7 ITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLE 66
ITPGQ+SFLLGI P+FV+WIYSEFLEY+K S+ K HSD NL +L T+KEDD+AVLLE
Sbjct: 7 ITPGQISFLLGISPIFVSWIYSEFLEYRKSSAPPKAHSDINLADLGGVTVKEDDQAVLLE 66
Query: 67 GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLL 126
GGL+R ASA++ SSSI TNLIRF T+DD FLLENR+TLRAM+EFGAIL YF++CDRT++L
Sbjct: 67 GGLARPASAKIHSSSITTNLIRFFTLDDTFLLENRSTLRAMSEFGAILLYFFVCDRTSIL 126
Query: 127 GDSTKN 132
DS K+
Sbjct: 127 ADSKKD 132
>gi|320163752|gb|EFW40651.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 708
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 179/351 (50%), Gaps = 70/351 (19%)
Query: 87 IRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLV 146
+R + A L E+ L+A+ +FGAI+ YFY+CD ++ ++YNRD++ + +L +
Sbjct: 82 LRLWQVSTATLQESVQLLQAVVQFGAIMLYFYVCDMDHIWDVGLRSYNRDVYASIVVLAL 141
Query: 147 I---------VSAMTSLKK----------HNDKSPFSGKTIQY----------------- 170
+ + ++ +K H P + + +
Sbjct: 142 LVGLLVSVKVIGGQSTNEKIRAAAEAAALHGANRPGADGLLPHHTAVAPAPAPAAARASD 201
Query: 171 ---------LNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYY 221
LNR Q++EWKGWMQ +F++YH+F A EIYN IR+F+AAYV++TGFGNFSY+
Sbjct: 202 IYAPGKPVILNRDQSDEWKGWMQCMFVLYHFFKAREIYNEIRVFVAAYVFLTGFGNFSYF 261
Query: 222 YIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYN 281
+ +KDFS+ R +M++RLNF V + NN Y+LYYICPMHT + +++Y + +
Sbjct: 262 WTKKDFSVFRLVKMLFRLNFLVLLVMVTTNNRYVLYYICPMHTFWFLVIYLCMFACKSHY 321
Query: 282 EIGSVMIVKILACFLVVILIWEIPGVFD------------------IFWSPLTFILGYTD 323
VM F + LIW+ P + +FW P FIL +D
Sbjct: 322 NNRLVMGTMFTVLFGLAFLIWDWPLSDEEQPKSVTSRSVSAIVGNVVFW-PFKFILSESD 380
Query: 324 PAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRK 374
L EW RSGLD + +GM++AY+HPT ++++ LEE +
Sbjct: 381 GT------LKEWMIRSGLDHFAAPVGMLFAYFHPTLQRFLVWLEEGSASAR 425
>gi|413925799|gb|AFW65731.1| hypothetical protein ZEAMMB73_861291 [Zea mays]
Length = 384
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/122 (71%), Positives = 102/122 (83%), Gaps = 7/122 (5%)
Query: 179 WKGWMQVLFL-------MYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPR 231
WK +V L +YHYFAA+EIYNAI +FIA YVWMTGFGNFSYYYI+KDFS+ R
Sbjct: 217 WKASSEVDILHGGQFARVYHYFAASEIYNAICVFIACYVWMTGFGNFSYYYIKKDFSIAR 276
Query: 232 FAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKI 291
FAQMMWRLNFFVAFCCIVL+ND MLYYICPMHTLFT+MVYG++G+FNK NE+ S+M +KI
Sbjct: 277 FAQMMWRLNFFVAFCCIVLDNDLMLYYICPMHTLFTLMVYGSLGLFNKCNEVPSIMAIKI 336
Query: 292 LA 293
+
Sbjct: 337 VC 338
>gi|413956402|gb|AFW89051.1| hypothetical protein ZEAMMB73_733469 [Zea mays]
Length = 290
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 83/105 (79%), Positives = 97/105 (92%)
Query: 189 MYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCI 248
+YHYFAA+EIYNAI +FIA YVWMTGFGNFSYYYI+KDFS+ RFAQMMWRLNFFVAFCCI
Sbjct: 140 VYHYFAASEIYNAICVFIACYVWMTGFGNFSYYYIKKDFSIARFAQMMWRLNFFVAFCCI 199
Query: 249 VLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILA 293
VL+ND MLYYICPMHTLFT+MVYG++G+FNK NE+ S+M +KI+
Sbjct: 200 VLDNDLMLYYICPMHTLFTLMVYGSLGLFNKCNEVPSIMAIKIVC 244
>gi|449673388|ref|XP_002169412.2| PREDICTED: uncharacterized protein LOC100207689, partial [Hydra
magnipapillata]
Length = 434
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 103/284 (36%), Positives = 163/284 (57%), Gaps = 16/284 (5%)
Query: 41 KVHSDTNLVELEKETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLEN 100
K++ +V +E I EG + + A+L K+ ++ T D
Sbjct: 156 KINEKNAIVPKTREEINLSSPKKDKEGAVFKVHLAKL-----KSGFLKLCTSHDKEQPSF 210
Query: 101 RATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDK 160
L M FG +FY ++ D ++ K Y+RD+F+FL+ +LV V+ + +++K DK
Sbjct: 211 EKFLLKMFIFGWFMFYIFLSDFLHIWPKVNKQYSRDMFVFLFFILVFVAVIFTIRKTKDK 270
Query: 161 SPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSY 220
+LNR QTEEWKGWMQV F+ YHYF A E +N+IR ++ AYVWMTGFGN+ Y
Sbjct: 271 ---------FLNRDQTEEWKGWMQVQFVWYHYFDANETFNSIRCYVGAYVWMTGFGNYIY 321
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVL-NNDYMLYYICPMHTLFTIMVYGAVGIFNK 279
+ +KD+S+ R +M++RLNF V FC + + N++++ YYIC MHT + + V+G + + N+
Sbjct: 322 FSSQKDYSIFRLLKMLFRLNFLV-FCVMAMANHEFVRYYICAMHTYWFLSVWGVMVVLNR 380
Query: 280 YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTD 323
YN+ M++K F + LI+ +PG D ++P +IL D
Sbjct: 381 YNDNPKFMLLKFFIYFAINALIFNLPGASDFVFAPFIWILHDKD 424
>gi|384497183|gb|EIE87674.1| hypothetical protein RO3G_12385 [Rhizopus delemar RA 99-880]
Length = 772
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 101/311 (32%), Positives = 169/311 (54%), Gaps = 22/311 (7%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQ 169
FG + Y Y DRT L G K+++ +F + + + I+ K F
Sbjct: 303 FGLCVIYMYFGDRTQLFGKIQKHFDVSMFTVMMIAIGILGVSKLQHKTEGDQGF------ 356
Query: 170 YLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKD 226
LNRHQT+EWKGWMQV+ L+YH+ A+ I YNA+RI +AAY++ TG+G+F ++Y + D
Sbjct: 357 -LNRHQTDEWKGWMQVIILVYHFCGASRISGIYNAVRILVAAYLFQTGYGHFFFFYKKAD 415
Query: 227 FSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSV 286
F + R +M RLN +++ DY+ YY P+ + + ++++ + +++N++ S
Sbjct: 416 FGMGRVLNVMVRLNLLTFVLQYLMDTDYLSYYFAPLVSFWFLVIWVVMYAGHQWNQVPSF 475
Query: 287 MIVKI-LACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYI 345
++VK+ ++CF +LI + PG+ + + L + A EW FR LD YI
Sbjct: 476 LLVKLAVSCFFTTVLI-KTPGILEFIFDLLQVVFNTHWNAA-------EWRFRLALDAYI 527
Query: 346 WIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYN 405
+GM+ AY A ++ K+ + + A +++V V Y W+E + +DK YN
Sbjct: 528 VYVGMLCAYAFIKAAEF--KIADHPKWSIMKRLAMVLSVLALVWYFWFE-LNCVDKFAYN 584
Query: 406 KYHPYTSWIPI 416
+ HP+ SWIPI
Sbjct: 585 RSHPFISWIPI 595
>gi|403172587|ref|XP_003331718.2| hypothetical protein PGTG_12883 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169912|gb|EFP87299.2| hypothetical protein PGTG_12883 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 893
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 179/324 (55%), Gaps = 27/324 (8%)
Query: 95 AFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIV---SAM 151
++L + L +MA FG + ++ DRT++ K ++R F L LL ++ SA+
Sbjct: 359 SYLFPSNDRLNSMAIFGGSIVLIFLSDRTSMFNKEQKQFDRLQFGTLNLLALVAGIWSAV 418
Query: 152 TSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAA 208
TS K + LNR QT+EWKGWMQ+ L+YHY AA++ IYN IRI +A+
Sbjct: 419 TSEKG----------DMGLLNREQTDEWKGWMQIAILIYHYLAASQVSGIYNPIRICVAS 468
Query: 209 YVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTI 268
Y++MTG+G+F+YYY++KDFS PR ++ RLN V++ DY+ YY P+ +++ +
Sbjct: 469 YLFMTGYGHFTYYYLKKDFSFPRILSVLVRLNLLTLVLAYVMDTDYLSYYFSPLVSMWFL 528
Query: 269 MVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPD 328
+++ + I +++N+ V ++ LAC VV+++W F PL ++ G+ +
Sbjct: 529 IIWATMFIGHQWND-RMVFLLPKLACS-VVLIVW-----FFKADKPLGWVFGFINLVFGT 581
Query: 329 LPRLHEWHFRSGLDRYIWIIGMIYA--YYHPTAEKWMEKLEESEPKRKLSIKAGIVTVAL 386
+EW FR LD +I GM+ + Y KW+E+ + + I I + L
Sbjct: 582 EWNANEWRFRVTLDMFIVYWGMLTSLIYLKVKEHKWLERDHSNRFDQLRKIAIWISALGL 641
Query: 387 FVGYLWYECIYKLDKVTYNKYHPY 410
V + W+E I + +K+ YN YHPY
Sbjct: 642 -VWFFWFE-ISRPNKLIYNLYHPY 663
>gi|303291252|ref|XP_003064912.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453583|gb|EEH50892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 266
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/286 (36%), Positives = 150/286 (52%), Gaps = 48/286 (16%)
Query: 70 SRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDS 129
++ +SA L + + ++R + D L +N TLRA AE G IL +
Sbjct: 17 TQKSSASTLQALGRLAMVRCLRGDPHALKQNHRTLRAWAELGVIL--------------T 62
Query: 130 TKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLM 189
NY RD+F L++ + I + TSL + +P SG HQTEEWKGWMQ++
Sbjct: 63 LINYERDIFSALFVAMSIFAFCTSLTPQHSATPVSG--------HQTEEWKGWMQII--- 111
Query: 190 YHYFAATEIYNAIRIFIAAYVWMTGFGNFSYY-YIRKDF--SLPRFAQMMWRLNFFVAFC 246
AI +FIA+YVWMTG G+FS+Y Y +K++ S+ RF Q+ WR NFFV+ C
Sbjct: 112 -----------AIPVFIASYVWMTGLGDFSFYLYSQKEYVHSVRRFLQVTWRFNFFVSLC 160
Query: 247 CI-VLNNDYMLYYICPMHTLF----TIMVYGA----VGIFNKYNEIGSVMIVKILACFLV 297
C+ V N L + P +T +Y A G +N+ N+ V++ K+ C +
Sbjct: 161 CLKVFNYAKRLSSVLPSNTFVYRSCAQTIYRAHLLLAGAWNEKNDNTVVLLQKLAVCASL 220
Query: 298 VILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDR 343
V + WEIPGVF + P TF+L YT+P + L E F SGLDR
Sbjct: 221 VYVTWEIPGVFHACFRPNTFLLKYTNPERQVDDPLRECFFSSGLDR 266
>gi|328859806|gb|EGG08914.1| hypothetical protein MELLADRAFT_22930 [Melampsora larici-populina
98AG31]
Length = 706
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 172/318 (54%), Gaps = 22/318 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L +A FG + ++ DRT K +N F L LL + V +TS + +DK
Sbjct: 317 LGPLAAFGYSIVLIFLADRTTFFNKEQKQFNGWWFGLLNLLGLAVGILTS--QVSDKGDL 374
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSY 220
LNR QT+EWKGWMQ+ L+YHY +A++I YN IR+ +A+Y++MTG+G+F++
Sbjct: 375 G-----LLNREQTDEWKGWMQIAILIYHYLSASKISGIYNPIRVCVASYLFMTGYGHFTF 429
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
+Y +KDF L R +M RLN +++ DY+ YY P+ +++ ++++ + + +++
Sbjct: 430 FYKKKDFGLSRIVGVMVRLNLLTLVLAYIMDTDYLSYYFSPLVSMWFMIIWVTMYVGHQW 489
Query: 281 NEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSG 340
N+ +IVK++ +V +++ +PL F + EW FR
Sbjct: 490 NDRLDFLIVKLIGSATLVTFLFQST-------TPLKFTFAVLNKVFQTQWFATEWAFRVT 542
Query: 341 LDRYIWIIGMIYAYYHPTAEKWMEKLEESEPK--RKLSIKAGIVTVALFVGYLWYECIYK 398
LD YI GMI A + ++ KL E P+ +K+ + I++ V + W+E + +
Sbjct: 543 LDMYIVYWGMIAALVYIKVKE--SKLIERNPETWQKVWTASIILSGLGIVWFFWFE-LTR 599
Query: 399 LDKVTYNKYHPYTSWIPI 416
+K+ YN+ HPYTS IPI
Sbjct: 600 SNKLEYNQTHPYTSIIPI 617
>gi|401884477|gb|EJT48636.1| O-acetyltransferase [Trichosporon asahii var. asahii CBS 2479]
gi|406694076|gb|EKC97412.1| O-acetyltransferase [Trichosporon asahii var. asahii CBS 8904]
Length = 894
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 111/354 (31%), Positives = 179/354 (50%), Gaps = 46/354 (12%)
Query: 96 FLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLK 155
F+ +A L A++ FG + Y + DRT + K+++ +F L ++ ++ T
Sbjct: 378 FVPGTKAAL-AVSTFGLSVTYLWFTDRTTMFLKENKDWDPKVFTILSVVSLVAGLATVRN 436
Query: 156 KHNDKSPFSGKTIQYLNRHQTEEWKGWMQ-VLFLMYHYFAATEI---YNAIRIFIAAYVW 211
+ GK + +LNR T+EWKGWMQ V L+YH+ A++I YN IR+ +AAY++
Sbjct: 437 R--------GKDLGFLNRDLTDEWKGWMQTVAILIYHFVGASKISGIYNPIRVLVAAYLF 488
Query: 212 MTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
MTG+G+F +YY + DF L R A ++ RLN +N +Y+ YY P+ + + +++Y
Sbjct: 489 MTGYGHFFFYYKKADFGLQRVATVLVRLNLLSVVLPYTMNTNYVFYYFAPLVSWWYLIIY 548
Query: 272 GAVGIFNKYNEIGSVMIVKILA-----------CFLVVILIWEIPGVFDIFWSPLTFILG 320
G + + ++YN+ + ++ K++ FL+ L + VF I W+
Sbjct: 549 GVMALGSQYNDRAAFLLPKLVLSAVAIAAFMHYSFLMEYLFAFLNTVFKIQWT------- 601
Query: 321 YTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG 380
EW FR LD YI GM+ AY + A++ +L E +L A
Sbjct: 602 -----------AREWTFRVTLDLYIVWGGMLTAYAYIKAKE--HRLTEQPYFNQLRGGAC 648
Query: 381 IVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPI-TYVLFIFYFFSLVKHLS 433
+ +VA V Y WYE ++ K YNKYH Y S +PI YV+ L +H S
Sbjct: 649 VASVAALVWYFWYE-LHLESKFVYNKYHAYVSIVPILGYVVLRNASSRLRQHSS 701
>gi|58262436|ref|XP_568628.1| O-acetyltransferase [Cryptococcus neoformans var. neoformans JEC21]
gi|134118932|ref|XP_771969.1| hypothetical protein CNBN1490 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817708|sp|P0CM57.1|CAS1_CRYNB RecName: Full=Probable O-acetyltransferase CAS1; AltName:
Full=Capsule synthesis protein 1
gi|338817709|sp|P0CM56.1|CAS1_CRYNJ RecName: Full=Probable O-acetyltransferase CAS1; AltName:
Full=Capsule synthesis protein 1
gi|17063556|gb|AAL35099.1|AF355592_1 O-acetyltransferase [Cryptococcus neoformans var. neoformans]
gi|50254573|gb|EAL17322.1| hypothetical protein CNBN1490 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57230802|gb|AAW47111.1| O-acetyltransferase [Cryptococcus neoformans var. neoformans JEC21]
Length = 960
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 166/321 (51%), Gaps = 23/321 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ FG + Y ++ DRT++ K+Y+ +F + L V+ + ++K SG
Sbjct: 434 ALSTFGLAMGYLFLADRTHVFQKEQKDYDAVIFGMI-TLAAFVAGLLTIKN-------SG 485
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYY 222
K + +LNR T+EWKGWMQ+ L+YH+F A++I YN IR+ +A+Y++MTG+G+F +YY
Sbjct: 486 KDLGFLNRDITDEWKGWMQIAILIYHFFGASKISGIYNPIRVLVASYLFMTGYGHFFFYY 545
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNE 282
+ DF R ++ RLN +N DY YY P+ + + +++Y + I +KYN+
Sbjct: 546 KKADFGFQRVVMVLVRLNLLSVVLPYTMNTDYAFYYFAPLVSWWYLIIYATMAIGSKYND 605
Query: 283 IGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLD 342
+ ++ K+ C +V L P + + + L + AK EW FR LD
Sbjct: 606 RPAFLLTKLFTCAGLVTLFMHFPWLMEDVFKVLNTVFNIQWSAK-------EWSFRVTLD 658
Query: 343 RYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVAL-FVGYLWYECIYKLDK 401
+I +GM+ AY K+ E P + A +V L + Y W+E ++ K
Sbjct: 659 LFIVWVGMLCAYGF---VKFNEHQISDRPWFPVMRTATLVGSVLGMIWYFWFE-LHLASK 714
Query: 402 VTYNKYHPYTSWIPITYVLFI 422
YN+YH +PI +F+
Sbjct: 715 FVYNEYHAVVCIVPIMSFVFL 735
>gi|353238692|emb|CCA70630.1| related to O-acetyltransferase CAS1 EC=2.3.1.--Cryptococcus
neoformans [Piriformospora indica DSM 11827]
Length = 894
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/333 (32%), Positives = 163/333 (48%), Gaps = 49/333 (14%)
Query: 105 RAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFS 164
+ + FG++++ I DRT L S K +N +F L +L+++ T K D P
Sbjct: 361 KPLTIFGSVVWLCRIADRTGLWLKSQKQFNPWIFTMLSVLIIVACLATVKKGSKDLGP-- 418
Query: 165 GKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYY 221
LNR QT+EWKGWMQ+ L+YHY A+ I Y IR+ +A+Y++MTG+G+ YY
Sbjct: 419 ------LNRQQTDEWKGWMQLFILIYHYMGASRIAGIYAPIRVLVASYLFMTGYGHTMYY 472
Query: 222 YIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYN 281
+ DFS+ R AQ++ RLN ++ DY+ YY P+ + + ++VY + + +N
Sbjct: 473 LRKADFSILRVAQVLVRLNLLTIILAYTMDTDYIFYYFAPLVSFWYLVVYFTLLGASSHN 532
Query: 282 EIGSVMIVKILACFLVVILI----WEIPGVF-------DIFWSPLTFILGYTDPAKPDLP 330
++ KI+A FLVV +I W I GVF +I WSP
Sbjct: 533 TKPIFVLCKIIASFLVVAIIHSQVWLIEGVFRLLRTLANINWSP---------------- 576
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYY-----HPTAEKWMEKLEESEPKRKLSIKAGIVTVA 385
EW FR LD I +GMI A P + P + S + +A
Sbjct: 577 --KEWAFRVNLDYLIVYVGMIIALVLHTLDSPGSAGVGTARLVDHP--RWSWLRHTLLLA 632
Query: 386 LFVGYLWYEC--IYKLDKVTYNKYHPYTSWIPI 416
F+G +W+ + + DK YN +HPY S +P+
Sbjct: 633 SFIGLVWFAAFELSRKDKFAYNLWHPYVSVVPV 665
>gi|50547609|ref|XP_501274.1| YALI0C00187p [Yarrowia lipolytica]
gi|49647141|emb|CAG81569.1| YALI0C00187p [Yarrowia lipolytica CLIB122]
Length = 834
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/325 (31%), Positives = 165/325 (50%), Gaps = 26/325 (8%)
Query: 98 LENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKH 157
+++ L A A L Y + CDRT G S+K + F L LL ++ + S +
Sbjct: 368 IQDETLLTACVVIAASLAYSFFCDRTQFFGKSSKQFEASEFWVLILLFLVATGY-SFEPQ 426
Query: 158 NDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTG 214
ND S +LNRHQTEEWKGWMQ++ L+YH A++I Y +R+ +AAY++MTG
Sbjct: 427 NDNS--------FLNRHQTEEWKGWMQIIILIYHITGASKILPIYKFVRVLVAAYLFMTG 478
Query: 215 FGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAV 274
FG+ +++ R DFSL R +++R+NF V++ DY+ YY P+ + + +V+
Sbjct: 479 FGHATFFIKRGDFSLKRATSVLFRMNFLSILLAYVMDTDYLFYYFAPLVSFWFCVVWITF 538
Query: 275 GIFNKYNEIGSV---MIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPR 331
+ +N I + ++ KI A +++ ++ F I ++ L ++
Sbjct: 539 RVLPSWNSIDTSVGPVLAKISASAVILNILVRFQLPFQIVFAVLKYLFNIQ-------WN 591
Query: 332 LHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYL 391
L EW FR LD YI IGM A ++ L+ R + G+V + F+ Y+
Sbjct: 592 LREWRFRLILDIYIVYIGMFAAVATLRYKQGNFPLKNLVTNR---LVLGVVALITFIVYI 648
Query: 392 WYECIYKLDKVTYNKYHPYTSWIPI 416
+ + K YN+ HPY SW+PI
Sbjct: 649 AVAASFGI-KQNYNQAHPYISWMPI 672
>gi|392574602|gb|EIW67738.1| hypothetical protein TREMEDRAFT_33393 [Tremella mesenterica DSM
1558]
Length = 974
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 169/335 (50%), Gaps = 22/335 (6%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
+++ FG + Y ++ DRT + K+++ +F L + ++V T + G
Sbjct: 457 SLSTFGLAIGYLFMADRTTVFLKEGKDFDPWIFASLIIAFLVVGLATMKNR--------G 508
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
K + +LNR T+EWKGWMQ+ L+YH+ A++ IYN IR+ +AAY++MTG+G+F +YY
Sbjct: 509 KDLGFLNREITDEWKGWMQIAILVYHFLGASKVSGIYNPIRVLVAAYLFMTGYGHFFFYY 568
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNE 282
+ DF R A ++ RLN +N DY YY P+ + + +++Y + + +YN
Sbjct: 569 KKADFGFDRVAAVLVRLNLLSVVLPYTMNTDYAFYYFAPLVSWWYLIIYATMALGRQYNP 628
Query: 283 IGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLD 342
+I K+L C V L P + + +S L I A+ EW FR LD
Sbjct: 629 RPVFLIPKLLLCAGAVTLFMRHPVLLEHIFSILHSIFRIQWSAR-------EWSFRVTLD 681
Query: 343 RYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKV 402
+I GM AY + +++ L E L I +++ F+ Y W+E L K
Sbjct: 682 LFIVWCGMFTAYGYIKIKEY--GLIEKPYFNSLRITTLVISFIGFIWYFWFEL--SLSKF 737
Query: 403 TYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLY 437
TYN YH + IPI +F+ ++ +S +L+
Sbjct: 738 TYNNYHAVVASIPILAFVFLRNANPRLRSVSSALF 772
>gi|342319811|gb|EGU11757.1| O-acetyltransferase [Rhodotorula glutinis ATCC 204091]
Length = 1196
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 159/311 (51%), Gaps = 23/311 (7%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQ 169
FG ++ DRTNL K Y+ F L LL V+ +T D +
Sbjct: 367 FGFSTTLLFVADRTNLFLKENKQYDALTFAVLCLLTVVAGVVTMKPPEKD--------LG 418
Query: 170 YLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKD 226
+LNR QT+EWKGWMQ+ L+YHY A++I YN IR+ +AAY++MTG+G+ S++ + D
Sbjct: 419 FLNRDQTDEWKGWMQIAILIYHYLGASKISGIYNPIRVLVAAYLFMTGYGHLSFFLKKAD 478
Query: 227 FSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSV 286
F R A ++ RLN +++ DY+ YY P+ T++ +++ + +KYN+ +
Sbjct: 479 FGFARVANILVRLNLLTVVLAYLMDTDYLSYYFSPLVTIWFGIIWVTLWAGHKYNDKPAF 538
Query: 287 MIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIW 346
+I K++ + +EIPG + + + + + A EW FR LD +I
Sbjct: 539 LITKLVVAACLTAAFFEIPGPLEKTFELINTVFATSWNAA-------EWRFRQTLDMWIV 591
Query: 347 IIGMIYAYYHPTAEKWMEKLEESEPK-RKLSIKAGIVTVALFVGYLWYECIYKLDKVTYN 405
+G AY +++ + P ++ +I A VT+A GY +E + + K YN
Sbjct: 592 WVGAFTAYAFIKIKEYRVTDDLRWPSWQRWTIIASAVTMA---GYFVFE-LTRESKFVYN 647
Query: 406 KYHPYTSWIPI 416
YHPY S +P+
Sbjct: 648 AYHPYVSILPV 658
>gi|74622990|sp|Q8X226.1|CAS1_CRYNH RecName: Full=Probable O-acetyltransferase CAS1; AltName:
Full=Capsule synthesis protein 1
gi|17063558|gb|AAL35100.1|AF355593_1 O-acetyltransferase [Cryptococcus neoformans var. grubii]
gi|405123904|gb|AFR98667.1| O-acetyltransferase [Cryptococcus neoformans var. grubii H99]
Length = 959
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 166/335 (49%), Gaps = 35/335 (10%)
Query: 98 LENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKH 157
L + + A++ FG + Y ++ DRT++ K+Y+ +F + V+ + ++K
Sbjct: 425 LPSPSIAPALSTFGLAVGYLFLADRTHVFQKEQKDYDAVVFGVI-TFAAFVAGLLTIKN- 482
Query: 158 NDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTG 214
SGK + +LNR T+EWKGWMQ+ L+YH+F A++I YN IR+ +A+Y++MTG
Sbjct: 483 ------SGKDLGFLNRDITDEWKGWMQIAILIYHFFGASKISGIYNPIRVLVASYLFMTG 536
Query: 215 FGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAV 274
+G+F +YY + DF R ++ RLN +N DY YY P+ + + +++Y +
Sbjct: 537 YGHFFFYYKKADFGFQRVVMVLVRLNLLSVVLPYTMNTDYAFYYFAPLVSWWYLIIYATM 596
Query: 275 GIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHE 334
+KYN+ + ++ K+ C +V L P + + + L + AK E
Sbjct: 597 AFGSKYNDRPAFLLAKLFTCAGLVTLFMHFPWLMEDVFKVLNTVFNIQWSAK-------E 649
Query: 335 WHFRSGLDRYIWIIGMIYAY-------YHPTAEKWMEKLEESEPKRKLSIKAGIVTVALF 387
W FR LD +I GM+ AY Y + W + + I +V
Sbjct: 650 WSFRVTLDLFIVWAGMLCAYGFVKFKEYQISDRPWFPTMHTA---------TLIGSVLGM 700
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+ Y W+E ++ +K YN+YH +PI +F+
Sbjct: 701 IWYFWFE-LHLANKFVYNEYHAVVCIVPIISFIFL 734
>gi|409051775|gb|EKM61251.1| hypothetical protein PHACADRAFT_247738 [Phanerochaete carnosa
HHB-10118-sp]
Length = 799
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 158/307 (51%), Gaps = 32/307 (10%)
Query: 118 YICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTE 177
+I DRT K +N F FL +L +++ +T + D+ LNR QT+
Sbjct: 315 FIADRTGYWLKEHKQFNPWTFAFLCVLCLVIGLLTVKRADRDQG--------ILNRDQTD 366
Query: 178 EWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQ 234
EWKGWMQ+ L+YHY A++I YN IR+ +A+Y++MTG+G+ ++Y + DF R AQ
Sbjct: 367 EWKGWMQIAILIYHYLGASKISGIYNPIRVLVASYLFMTGYGHATFYLKKADFGFIRVAQ 426
Query: 235 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVK-ILA 293
++ RLN F +N DY+ YY P+ + + ++VYG + + +++N+ ++ K +L+
Sbjct: 427 ILVRLNLFTLVLAYTMNTDYLFYYFSPLVSWWYLIVYGTMFVGSRFNDSTPFLVTKWLLS 486
Query: 294 CFLVVILI---WEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGM 350
LV I + W + G F +F L + G A+ EW FR LD +I +GM
Sbjct: 487 MALVTIAMRATWLLEGAF-VF---LERVCGIHWSAR-------EWAFRVNLDIWIVYVGM 535
Query: 351 IYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECI-YKLDKVTYNKYHP 409
A K E P+ +++K T L +W+ ++K YN YHP
Sbjct: 536 FSAL---AVIKSREYRITDHPRWPMAVKVAAATSVL--ALIWFFAFELSMNKFDYNLYHP 590
Query: 410 YTSWIPI 416
SWIP+
Sbjct: 591 LASWIPV 597
>gi|392598149|gb|EIW87471.1| hypothetical protein CONPUDRAFT_79031 [Coniophora puteana
RWD-64-598 SS2]
Length = 854
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 104/345 (30%), Positives = 167/345 (48%), Gaps = 49/345 (14%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQ 169
F ++ DRT+ K Y+ F L ++V + +LK+ + K +
Sbjct: 370 FSGAFAMIFVADRTSYWNKEQKYYDTWSFAALNACALLV-GLATLKRTD-------KDLG 421
Query: 170 YLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKD 226
+LNR QT+EWKGWMQ+ L+YHY A++I YN IR+ +AAY++MTG+G+ ++Y + D
Sbjct: 422 FLNRDQTDEWKGWMQIAILIYHYLGASKISGIYNPIRVLVAAYLFMTGYGHTTFYAKKAD 481
Query: 227 FSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSV 286
F R Q++ RLN +N DY+ YY P+ +++ +++Y + I +++N+ +
Sbjct: 482 FGFSRITQVLVRLNILTLILAYTMNTDYLSYYFAPLVSMWYLIIYATMAIGSQFNDRMPL 541
Query: 287 MIVKILA--CFLVVILIWEIPG---------VFDIFWSPLTFILGYTDPAKPDLPRLHEW 335
++ KI+ C + E P VFDI WS EW
Sbjct: 542 LLGKIVTSMCLFTAFMKSEWPLQLLFGFLEYVFDIRWS------------------AREW 583
Query: 336 HFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYEC 395
FR LD+YI GM+ A K + P L++K + L LW+
Sbjct: 584 TFRVTLDQYIVYFGMLAAL---AVIKVRDYRLTDHPLWPLTVKVSVGLSGL--AMLWF-F 637
Query: 396 IYKLD---KVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLY 437
+++LD K TYN +HPY S+IPIT + + L++ S Y
Sbjct: 638 VFELDQESKFTYNGWHPYVSFIPITAFVILRNASPLLRSCSSRAY 682
>gi|409083167|gb|EKM83524.1| hypothetical protein AGABI1DRAFT_66224 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 807
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 155/309 (50%), Gaps = 23/309 (7%)
Query: 111 GAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQY 170
GA+L Y + DRT L K ++ F FL L+ VI T K D + +
Sbjct: 312 GALLIY--VADRTGLWLKEQKQFDFWAFTFLSLVSVIFGVSTIRKGDKD--------MGF 361
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDF 227
LNR QT+EWKGWMQ+ L+YHYF A++I YN IR +A+Y++M+G+G+ ++Y + DF
Sbjct: 362 LNRDQTDEWKGWMQLAILIYHYFGASKISGIYNPIRTLVASYLFMSGYGHTTFYLRKADF 421
Query: 228 SLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVM 287
S R AQ+M RLN ++ DYM YY P+ +++ +++Y + ++YNE + +
Sbjct: 422 SFTRVAQIMVRLNLITVLLAYTMDTDYMFYYFAPLVSMWFLIIYATLFAGSRYNERTAFL 481
Query: 288 IVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWI 347
+ KI V +E ++ + L + AK EW FR LD +I
Sbjct: 482 LGKIFLSASFVTWFFEAKWPLELLFGLLKDVFNIQWSAK-------EWTFRVTLDLWIVY 534
Query: 348 IGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKY 407
+GM+ A+ K E P ++K V A + + + + + K TYN +
Sbjct: 535 VGMLTAF---AVIKIREYQLTDHPMWPATVKVVTVVSAFIILWFFAFELMQESKFTYNAW 591
Query: 408 HPYTSWIPI 416
HPY S P+
Sbjct: 592 HPYISPFPV 600
>gi|336367046|gb|EGN95391.1| hypothetical protein SERLA73DRAFT_76503 [Serpula lacrymans var.
lacrymans S7.3]
gi|336379771|gb|EGO20925.1| hypothetical protein SERLADRAFT_441316 [Serpula lacrymans var.
lacrymans S7.9]
Length = 765
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 163/319 (51%), Gaps = 37/319 (11%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQ 169
F A + +I DRT K + D+++F +L L ++ KH+DK P
Sbjct: 302 FSAAIGVIFIADRTGFWLKEQKEF--DVWIFTFLSLASLAVGLVKVKHSDKDP------G 353
Query: 170 YLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKD 226
+L+R QTEEWKGWMQ+ L+YHY A++I YN IR+ +A+Y++MTG+G+ ++Y + D
Sbjct: 354 FLSREQTEEWKGWMQISILIYHYLGASKIPAIYNPIRVLVASYLFMTGYGHTTFYIKKAD 413
Query: 227 FSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSV 286
F L R AQ+M RLN +N DY+ YY P+ +++ ++YG + + +++N
Sbjct: 414 FGLLRIAQVMIRLNILTLTLAYTMNTDYLSYYFSPLVSMWFFIIYGTMALGSRFNNYTPF 473
Query: 287 MIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIW 346
++ KI+A +V ++ + ++ + L I A+ EW FR LD++I
Sbjct: 474 VVFKIIASMGIVTWFFKETWLLNVLFDFLERIFAIHWSAR-------EWTFRVTLDQFIV 526
Query: 347 IIGMIYAYYHPTAEKWMEKLEESE--------PKRKLSIKAGIVTVALFVGYLWYECIYK 398
GM+ A + K+ E P K++ + + F+ + + +
Sbjct: 527 YFGMLAALA-------VIKIREHRLTDHVYWPPLVKIANGLSAIILLCFLSF----ALTE 575
Query: 399 LDKVTYNKYHPYTSWIPIT 417
+KV YN +HPY S +PI
Sbjct: 576 KNKVAYNAWHPYISILPIA 594
>gi|426201782|gb|EKV51705.1| hypothetical protein AGABI2DRAFT_198117 [Agaricus bisporus var.
bisporus H97]
Length = 807
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 155/309 (50%), Gaps = 23/309 (7%)
Query: 111 GAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQY 170
GA+L Y + DRT L K ++ F FL L+ VI T K D + +
Sbjct: 312 GALLIY--VADRTGLWLKEQKQFDFWAFTFLSLVSVIFGVSTIRKGDKD--------MGF 361
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDF 227
LNR QT+EWKGWMQ+ L+YH+F A++I YN IR +A+Y++M+G+G+ ++Y + DF
Sbjct: 362 LNRDQTDEWKGWMQLAILIYHFFGASKISGIYNPIRTLVASYLFMSGYGHTTFYLRKADF 421
Query: 228 SLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVM 287
S R AQ+M RLN ++ DYM YY P+ +++ +++Y + ++YNE + +
Sbjct: 422 SFTRVAQIMVRLNLITVLLAYTMDTDYMFYYFAPLVSMWYLIIYATLFAGSRYNERTAFL 481
Query: 288 IVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWI 347
+ KI V +E ++ + L + AK EW FR LD +I
Sbjct: 482 LGKIFLSASFVTWFFEAKWPLELLFGLLKDVFNIQWSAK-------EWTFRVTLDLWIVY 534
Query: 348 IGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKY 407
+GM+ A+ K E P ++K V A + + + + + K TYN +
Sbjct: 535 VGMLTAF---AVIKIREYQLTDHPMWPATVKVVTVVSAFIILWFFAFELMQESKFTYNAW 591
Query: 408 HPYTSWIPI 416
HPY S P+
Sbjct: 592 HPYISPFPV 600
>gi|432882477|ref|XP_004074050.1| PREDICTED: CAS1 domain-containing protein 1-like [Oryzias latipes]
Length = 797
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 106/350 (30%), Positives = 176/350 (50%), Gaps = 45/350 (12%)
Query: 95 AFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLF---LFLYLLLVIVSAM 151
A L +A +A+ G I+ YFY+CDR ++ K Y F LF +L I +
Sbjct: 358 AVPLGQKAPFQALCRMGIIMGYFYLCDRADVFMKEHKFYTHSAFFIPLFYIFVLGIFYSE 417
Query: 152 TSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAA 208
S K + LNR QT+EWKGWMQ++ L+YH A+ +Y +R+ +AA
Sbjct: 418 NS------------KETKLLNREQTDEWKGWMQLVILIYHISGASAFIPVYMHVRVLVAA 465
Query: 209 YVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTI 268
Y++ TG+G+FS+++++ DF L R Q+++RLNF V C+V++ Y YY P+ T + +
Sbjct: 466 YLFQTGYGHFSFFWLKGDFGLNRVCQVLFRLNFLVLVLCVVMDRPYQFYYFVPLVTFWFV 525
Query: 269 MVYGAVGIFNK-------YNEIGSVMIVKILACFLVVILIWEIP-----GVFDIFWSPLT 316
++YG + ++ + N + + ++ L LV I ++ VF ++
Sbjct: 526 IIYGTMAVWPQILQKKANSNRMWHLAVLAKLLGLLVFICLFSFSQEFFESVFSVWPVSKL 585
Query: 317 FILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLS 376
F L + +HEW FR LDR++ I GM++A+ + +K + L ES+ + L
Sbjct: 586 FELDGS---------VHEWWFRWKLDRFVVIHGMLFAFIYLLLQK-CQVLSESKGEPLLP 635
Query: 377 IKAGIVTVAL----FVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
K + ++L FV Y + K K N+ HPY S + I + I
Sbjct: 636 PKISNLLLSLSIFSFVTYSIWASNCK-TKTECNEMHPYISVVQILAFILI 684
>gi|403417488|emb|CCM04188.1| predicted protein [Fibroporia radiculosa]
Length = 1430
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 164/328 (50%), Gaps = 28/328 (8%)
Query: 118 YICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTE 177
YI DRT K ++ +F FL +L +IV +T + ND + +LNR QT+
Sbjct: 946 YISDRTGFWLKEQKQFSPWIFTFLSVLSLIVGLLTVRQADND--------LGFLNREQTD 997
Query: 178 EWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQ 234
EWKGWMQ+ L+YHY A++I YN IR+ +AAY++MTG+G+ +YY + DF R AQ
Sbjct: 998 EWKGWMQIAILIYHYTGASKISGIYNVIRVLVAAYLYMTGYGHVTYYVKKADFGFTRVAQ 1057
Query: 235 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILAC 294
++ RLN +N DY+ YY P+ + + +++YG + I ++YN+ ++ KIL
Sbjct: 1058 IIVRLNLLTLLLAYTMNTDYLSYYFAPLVSWWFLIIYGTMVIGSQYNDRTVFLVCKILFS 1117
Query: 295 FLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY 354
+V + + ++ L G A+ EW FR LD +I GM A
Sbjct: 1118 MGLVTWFMSESWLLENIFTFLERACGIHSSAR-------EWAFRVNLDLWIVYFGMFTAI 1170
Query: 355 YHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWI 414
K E P+ L +K + + + + +Y+ DK YN +HPY +++
Sbjct: 1171 ---AVMKIREHRLTDHPQWPLVVKGAAGASGIVLLWFFAFELYQPDKFAYNLWHPYIAFL 1227
Query: 415 PITYVLFIFYFFSLVKHLSGSLYMMACR 442
P+ F ++++ +G L + R
Sbjct: 1228 PVGA-------FVILRNANGILRSASSR 1248
>gi|302697721|ref|XP_003038539.1| hypothetical protein SCHCODRAFT_42660 [Schizophyllum commune H4-8]
gi|300112236|gb|EFJ03637.1| hypothetical protein SCHCODRAFT_42660, partial [Schizophyllum
commune H4-8]
Length = 705
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 155/308 (50%), Gaps = 33/308 (10%)
Query: 118 YICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTE 177
++ DRT+ K Y F FL LL + V + ++K+ + K + +LNR QT+
Sbjct: 334 FLADRTHFYLKEQKQYEPWSFAFLNLLTLGV-GLATVKRGD-------KDMGFLNRDQTD 385
Query: 178 EWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQ 234
EWKGWMQ+ L+YHY A++I YN IR+ +A+Y++MTG+G+ ++Y + DF R AQ
Sbjct: 386 EWKGWMQIAILIYHYTGASKISGIYNPIRVLVASYLFMTGYGHATFYIKKADFGFLRIAQ 445
Query: 235 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILAC 294
++ R+N F ++N +Y++YY P+ + + +++Y + I +YN+ ++ K +
Sbjct: 446 VLVRINLFTCILAYIMNTNYIVYYFSPLVSFWFLIIYATMAIGARYNDRAPLLAGKFVTS 505
Query: 295 FLVVILI----WEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGM 350
+V ++ W + +FD+ L + AK EW FR LD +I GM
Sbjct: 506 AAIVTVVMKAGWPMQSLFDL----LEMLCNIHWSAK-------EWSFRVTLDLWIVYAGM 554
Query: 351 --IYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYH 408
A+ + E R G+V V FV L E K TYN +H
Sbjct: 555 FAALAFIKIKDHRLTEHPYWPHAHRAAVSLGGMVLVWFFVFELGQE-----SKFTYNAWH 609
Query: 409 PYTSWIPI 416
PY SW P+
Sbjct: 610 PYVSWAPV 617
>gi|405965981|gb|EKC31313.1| CAS1 domain-containing protein 1 [Crassostrea gigas]
Length = 804
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 102/332 (30%), Positives = 174/332 (52%), Gaps = 35/332 (10%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
+A+ G I+ Y ++CDRTN K Y FL + L+I+ ++ D++
Sbjct: 379 CLAKLGLIMAYTFLCDRTNFFMKENKYYTHVNFLLPFAYLMILGFF--FTENTDQT---- 432
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ L+R QT+EWKGWMQ++ ++YH A++ IY IR+ ++AY+++TG+G+FSY++
Sbjct: 433 ---KVLHRDQTDEWKGWMQLVIMIYHLTGASKVLPIYMHIRVLVSAYLFLTGYGHFSYFW 489
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNE 282
+ DFS+ R+ Q+M+RLNF V C V+N Y YY P+ + + +VY + I+ + ++
Sbjct: 490 NKSDFSVQRYCQVMFRLNFLVVVLCFVMNRPYQFYYFVPLVSFWFTVVYLTMAIWPRISD 549
Query: 283 IGS--------VMIVKILACFLVVILIWEIPGVFD--IFWSPLTFILGYTDPAKPDLPRL 332
MI+K + V+ L + +F+ P+ + +D + L
Sbjct: 550 ASCEASNLHYLYMIIKFIILTTVITLFFMSEVLFEKVFLMRPIKALFVRSDDS------L 603
Query: 333 HEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGY 390
HEW FR LDRY + GM++A+ YH +K + + ++ K S + +T+ F+G
Sbjct: 604 HEWRFRWTLDRYSVVYGMVFAFGYHLLLKKGI--IADNHTKSLFSTGVSLSLTILSFIGL 661
Query: 391 LWYECIYKL--DKVTYNKYHPYTSWIP-ITYV 419
Y L +K N H Y +P I+YV
Sbjct: 662 GSYAIFSFLCKNKTECNDTHSYLVALPLISYV 693
>gi|443703564|gb|ELU01043.1| hypothetical protein CAPTEDRAFT_167181 [Capitella teleta]
Length = 789
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 168/334 (50%), Gaps = 33/334 (9%)
Query: 100 NRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHND 159
+ L A+A+ G I+ YF++ DR N + K+Y+ F ++ V + S K
Sbjct: 357 KKDALVALAKLGLIMTYFFLADRNNYFMKTNKHYSHLHFFLAFIYFVFLGLFFSEKS--- 413
Query: 160 KSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFG 216
K + L+R QT+EWKGWMQ++ L+YHY A++ IY +R+ ++ Y++ +GFG
Sbjct: 414 ------KQTKVLHRDQTDEWKGWMQIVILIYHYTGASQVLPIYMQVRVLVSTYLFTSGFG 467
Query: 217 NFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI 276
+F +++ + D+S+ RF +M+R+NF V C V+N Y YY P+ + + +++Y + +
Sbjct: 468 HFCFFWNKADYSIHRFCMVMFRMNFLVIVLCFVMNRPYQFYYFVPLVSFWFLVIYSTMAM 527
Query: 277 FNK--------YNEIGSV-MIVKILACFLVVILIWEIPGVFD--IFWSPLTFILGYTDPA 325
+ + Y+ G + M++K ++ L + +F+ P+ + D +
Sbjct: 528 WPRVSAATQGAYSSGGVLYMLLKFFLLVAIITLFYASEVLFEQVFLTQPIQSLFVSADSS 587
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEK---WMEKLEESEPKRKLSIKAGIV 382
+HEW FR LDR+ I GM++ + +K W + + ++ G V
Sbjct: 588 ------IHEWRFRWQLDRFSTIYGMVFGLFFILGQKVKLWDDSGDAGLFSVPVNAAVGFV 641
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPI 416
+ F+GY + + K + N H Y ++IPI
Sbjct: 642 ALIGFIGYSVFASTCE-SKPSCNHIHSYIAFIPI 674
>gi|400599365|gb|EJP67062.1| CAS1 domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 818
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 94/317 (29%), Positives = 159/317 (50%), Gaps = 25/317 (7%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKK----HNDKSPFSG 165
F L ++ DRT + +K + L LL+ V+ + S+++ +D
Sbjct: 370 FVTALLACFLADRTQVFAKGSKQFVGSELAAL-LLVTAVAGIVSIRRMKPLRSDSLTRPD 428
Query: 166 KTIQ---YLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFS 219
+ +Q L+R QT+EWKGWMQ L YH+ A++ +Y IR+ +AAY++ G+G+
Sbjct: 429 EKLQDAAPLSRDQTDEWKGWMQAAILAYHWTGASKSLPVYIFIRLLVAAYLFQMGYGHTI 488
Query: 220 YYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK 279
Y+ ++KDFSL R A +M RLN V+N +YMLYY P+ + + ++VY + I +
Sbjct: 489 YFLVKKDFSLKRVAAVMLRLNLLSCALPYVMNTNYMLYYFAPLASFWFVIVYLTLAIKSD 548
Query: 280 YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRS 339
YN+ + +++K+ V ++ V + ++ L + + D LHEW FR
Sbjct: 549 YNDTTTALLIKLAISASAVASVFLFTPVSEWIFTALRYAF------RIDW-DLHEWQFRV 601
Query: 340 GLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKL 399
LD I +GM+ + W L +S+ G++ +A+ GY +
Sbjct: 602 SLDSLIVYVGMLAGMASAKGKAWDRLLVQSK-------LPGLIGLAVLAGYCYLSSQIFQ 654
Query: 400 DKVTYNKYHPYTSWIPI 416
K YNK+HPY S++PI
Sbjct: 655 VKQDYNKWHPYVSFVPI 671
>gi|297288869|ref|XP_001097112.2| PREDICTED: CAS1 domain-containing protein 1 [Macaca mulatta]
Length = 877
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/334 (29%), Positives = 176/334 (52%), Gaps = 32/334 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 449 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 501
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 502 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 559
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 560 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 619
Query: 281 NEIGS------VMIVKILACFLVVILIWEIP-GVFDIFWS--PLTFILGYTDPAKPDLPR 331
+ + +++++IL C +++ + G F+ +S PL+
Sbjct: 620 IQKKANGKYTFLLMLQILTCTILIYFLDSFSQGAFEKIFSLWPLSKCFELKG-------N 672
Query: 332 LHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIVTVALFV 388
++EW FR LDRY+ GM++A+ + +K E + EP K+S ++V F+
Sbjct: 673 VYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFISVVSFL 732
Query: 389 GYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
Y + K +K N+ HP S + I + I
Sbjct: 733 TYSIWASSCK-NKAECNELHPSVSVVQILAFILI 765
>gi|321265714|ref|XP_003197573.1| O-acetyltransferase [Cryptococcus gattii WM276]
gi|317464053|gb|ADV25786.1| O-acetyltransferase [Cryptococcus gattii WM276]
Length = 960
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 160/321 (49%), Gaps = 23/321 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ FG + Y ++ DRT++ K+Y+ +F + L + +T SG
Sbjct: 434 ALSTFGLAMGYLFLADRTHVFQKEQKDYDAIIFGTITLAAFVAGLLTVRN--------SG 485
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYY 222
K + +LNR T+EWKGWMQ+ L+YH+F A++I YN IR+ +A+Y++MTG ++YY
Sbjct: 486 KDLGFLNRDITDEWKGWMQIAILIYHFFGASKISGIYNPIRVMVASYLFMTGCEYIAFYY 545
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNE 282
+ DFS R ++ RLN +N DY YY P+ + + +++Y + I ++YN+
Sbjct: 546 KKADFSFQRVIMVLVRLNLLSVVLPYTMNTDYAFYYFAPLVSWWYLIIYATMAIGSRYND 605
Query: 283 IGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLD 342
+ ++ K+ C +V L P + + L + AK EW FR LD
Sbjct: 606 RPAFLLPKLFICAGLVTLFMHFPWLMADVFKVLNTVFNIQWSAK-------EWSFRVTLD 658
Query: 343 RYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVAL-FVGYLWYECIYKLDK 401
+I GM+ AY K+ E P + + +V L + YLW+E ++ K
Sbjct: 659 LFIVWAGMLCAYGF---VKFKEHQISDRPWFPVMRTSTLVGSVLGMIWYLWFE-LHLPSK 714
Query: 402 VTYNKYHPYTSWIPITYVLFI 422
YN+YH +PI +F+
Sbjct: 715 FVYNEYHAVVCVVPIMSFVFL 735
>gi|198430770|ref|XP_002127948.1| PREDICTED: similar to CAS1 domain-containing protein 1 [Ciona
intestinalis]
Length = 803
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 102/344 (29%), Positives = 174/344 (50%), Gaps = 41/344 (11%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A A+ ILFYFYICDRT++ S K+Y F FL LL ++ + + S
Sbjct: 356 AAAKMSLILFYFYICDRTDVFMKSNKHYTNTRF-FLPLLYIVFLGIFGID--------ST 406
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSYYY 222
K +LNR QT+EWKGWMQ++ L+YH A+ IY +R+ +A Y++MTG+G+FSY++
Sbjct: 407 KQPVFLNRDQTDEWKGWMQLVILIYHVTGASVNVPIYMHVRLLVAMYLFMTGYGHFSYFW 466
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIF----- 277
+ DF + R +M+RLNF C+ ++ Y YY PM + + +++Y + ++
Sbjct: 467 NKGDFGVHRVFGVMFRLNFLTVMLCLTMDRSYQFYYFVPMCSFWFLVLYLFMALWPRAYM 526
Query: 278 ---NKYNEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGY 321
NK + VM K+ ++L++ +F+ +S I +
Sbjct: 527 VATNKETTLSDGETKEVTFTSPIFVMCAKLSLLLFTIVLVFLSQELFESMFSWWPVIRLF 586
Query: 322 TDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI 381
P P + R EW FR LDR+ + G ++A+ + ++ + +++S LS K +
Sbjct: 587 ELP--PGMVR--EWWFRCHLDRFAMLHGAVFAFGYIILKR-LSIVDDSRQGCLLSTKVSL 641
Query: 382 VTVALFVGYLWYECIYKL---DKVTYNKYHPYTSWIPITYVLFI 422
+ V + + ++ L DK + N+ H + S IPI+ + I
Sbjct: 642 IAVTASILCTLFYSVWALQCSDKQSCNEVHSFASLIPISAFILI 685
>gi|449551006|gb|EMD41970.1| hypothetical protein CERSUDRAFT_41933 [Ceriporiopsis subvermispora
B]
Length = 811
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 159/303 (52%), Gaps = 23/303 (7%)
Query: 118 YICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTE 177
Y+ DRT K ++ F FL + +++ +T K DK + +LNR QT+
Sbjct: 318 YVADRTGFWLKEQKQFDSWTFGFLSIFTLVLGLLT--MKGGDKD------LGFLNREQTD 369
Query: 178 EWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQ 234
EWKGWMQ+ L+YHY A++I YN IR+ +A+Y++MTG+G+ ++Y + DF R AQ
Sbjct: 370 EWKGWMQLAILIYHYTGASKISGIYNPIRVLVASYLFMTGYGHTTFYIKKADFGFQRVAQ 429
Query: 235 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILAC 294
+M RLN V+N DY+ YY P+ +++ I++Y + KYN+ ++ KIL
Sbjct: 430 VMVRLNLLTLLLAYVMNTDYISYYFAPLVSMWYIIIYLTMLAGAKYNDRTIFLVGKILVS 489
Query: 295 FLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY 354
++ P + ++ + L + G A+ EW FR LD +I GM A
Sbjct: 490 MALITWFMSEPILLEMAFEFLDRVCGIHWSAR-------EWAFRVNLDLWIVYFGMFAAL 542
Query: 355 YHPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGYLWYECIYKLDKVTYNKYHPYTSW 413
K E P+ L++KA + + A+ + Y +E +Y+ DK YN +HPY S+
Sbjct: 543 ---AVIKIREYRLMDHPQWPLAVKAAVGASAAVMLWYFAFE-LYQPDKFVYNLWHPYVSF 598
Query: 414 IPI 416
+P+
Sbjct: 599 LPV 601
>gi|410911936|ref|XP_003969446.1| PREDICTED: CAS1 domain-containing protein 1-like [Takifugu
rubripes]
Length = 794
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 171/335 (51%), Gaps = 32/335 (9%)
Query: 101 RATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDK 160
+A +A+ G I+ YFY+CDR ++ K Y+ F + ++I+ S
Sbjct: 363 KAAFQALCRMGVIMAYFYLCDRADVFMKEQKFYSHSTFFIPLIYILILGFFYSE------ 416
Query: 161 SPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGN 217
+ K + LNR QT+EWKGWMQ++ L+YH A+ +Y +R+ +AAY++ TG+G+
Sbjct: 417 ---NSKETKLLNREQTDEWKGWMQLVILIYHISGASAFLPVYMHVRVLVAAYLFQTGYGH 473
Query: 218 FSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAV--- 274
FS+++++ DFSL R Q+++RLNF V C+V++ Y YY P+ T + ++YG +
Sbjct: 474 FSFFWLKGDFSLYRVCQVLFRLNFLVLVLCVVMDRPYQFYYFVPLVTFWFFIIYGTLVVW 533
Query: 275 -GIFNKYNEIGSV----MIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDL 329
I K G V ++VK L L + F+ +S F +
Sbjct: 534 PQILQKKANSGGVWYMGVLVKFLGLLLFICFFAFSQSFFESIFSAWPFSKLFERNGS--- 590
Query: 330 PRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIK----AGIVTVA 385
+ EW FR LDR+ I GM++A+ + +K + L ES+ + S + +++V
Sbjct: 591 --VREWWFRWKLDRFAVIYGMLFAFIYLVLQK-RQVLSESKGEALFSTRISSLLLLLSVV 647
Query: 386 LFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVL 420
+ Y + K +K N+ HPY S I +++VL
Sbjct: 648 SVITYSIWASSCK-NKTECNEMHPYISVI-LSFVL 680
>gi|327274796|ref|XP_003222162.1| PREDICTED: CAS1 domain-containing protein 1-like [Anolis
carolinensis]
Length = 798
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 175/335 (52%), Gaps = 33/335 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L+ + + G I+ YFY+CDR NL K Y F F+ ++ ++V + +
Sbjct: 370 LQCLCKLGFIMSYFYLCDRANLFMKENKFYTHSSF-FIPIIYILVLGIFYAE-------- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 NTKETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 480
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ +++ +++Y + ++ +
Sbjct: 481 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRSYQSYYFVPLVSVWFMIIYITLALWPQI 540
Query: 281 NEIGS--------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPAKPDLP 330
+ + +++K + F+ + + G F+ +S PL+
Sbjct: 541 VQKKANGNCFWHFGLLLKFVFLFICIYFLAYSQGTFEKIFSIWPLSKCFELNG------- 593
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIVTVALF 387
++EW FR LDRY+ GM++A+ + T +K E + EP K+S ++V F
Sbjct: 594 SIYEWWFRWKLDRYVVFHGMLFAFIYLTFQKRQALTEGKGEPLFPNKVSNTLLFISVVSF 653
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+ Y + K +K N+ HP S + I + I
Sbjct: 654 LTYSIWASSCK-NKSECNEMHPSVSVVQILAFILI 687
>gi|406858717|gb|EKD11813.1| Cas1p-like protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 872
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 108/338 (31%), Positives = 172/338 (50%), Gaps = 26/338 (7%)
Query: 95 AFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSL 154
A L L A+A FG +L Y + DRT + K + FL + L+VI++ + S+
Sbjct: 381 ARFLPQPGVLGALAVFGLVLCYCFYADRTQIFDKGHKLFQHSQFL-IACLVVIIAGVLSI 439
Query: 155 KKHNDKSPFSGKTIQ---YLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAA 208
+K ++ P G IQ +L+R QT+EWKGWMQ + L+YHY A++ IY IR+ IAA
Sbjct: 440 RK--NREPAGGTKIQDLDFLSRDQTDEWKGWMQFIVLIYHYTHASQHLGIYQFIRLLIAA 497
Query: 209 YVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTI 268
Y++MTGFG+ ++ + D+S R A ++ RLN ++ DY+ YY P+ + + +
Sbjct: 498 YLFMTGFGHTVFFLKQVDYSFHRVAAVLVRLNLLSCVLPYMMRTDYLFYYFAPLVSFWFL 557
Query: 269 MVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPD 328
+VY + I N + ++ KIL + + +IPGV + + +L YT +
Sbjct: 558 VVYFTLKIACHKNSNFNFLLGKILVSATLTTALTKIPGVLEF----VATVLNYTCAISWN 613
Query: 329 LPRLHEWHFRSGLDRYIWIIGM----IYAYYHPTAEKWMEKLEESEPKRKLSI-----KA 379
+ EW FR+ LD YI +GM +Y Y + ++ +L+I KA
Sbjct: 614 VT---EWRFRTFLDMYIVYVGMLIAALYLRYSRIQSGAVTPNSITDYLIQLTITYRFFKA 670
Query: 380 GIVTVALFV-GYLWYECIYKLDKVTYNKYHPYTSWIPI 416
++ VAL + LW K YN + P S+IPI
Sbjct: 671 LLIAVALGIPPGLWVLLRKSHTKEDYNWWQPGISFIPI 708
>gi|354469202|ref|XP_003497019.1| PREDICTED: CAS1 domain-containing protein 1-like [Cricetulus
griseus]
Length = 763
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 171/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 334 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 386
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 387 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 444
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V C+V++ Y YY P+ T++ +++Y + ++ +
Sbjct: 445 FWIKGDFGIYRVCQVLFRLNFLVVVLCLVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 504
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 505 IQKKANGNCLWHLGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELNG-- 557
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + ++W E + EP K+S +
Sbjct: 558 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQRWQILSEGKGEPLFSTKISTFLLFI 612
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 613 SVVSFLTYSIWASSCK-NKAQCNELHPSVSVVQIVAFILI 651
>gi|348578551|ref|XP_003475046.1| PREDICTED: CAS1 domain-containing protein 1-like [Cavia porcellus]
Length = 923
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 171/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+
Sbjct: 494 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNE---- 544
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
S K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 545 STKETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 604
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 605 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 664
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 665 IQKKANGNCFWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELQG-- 717
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + T +K E + EP K+S V
Sbjct: 718 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLTLQKHQVLSEGKGEPLFSNKISNFLLFV 772
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 773 SVVSFLTYSIWASSCK-NKSECNELHPSVSVVQILAFILI 811
>gi|302405815|ref|XP_003000744.1| CAS1 domain-containing protein [Verticillium albo-atrum VaMs.102]
gi|261360701|gb|EEY23129.1| CAS1 domain-containing protein [Verticillium albo-atrum VaMs.102]
Length = 840
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 108/340 (31%), Positives = 166/340 (48%), Gaps = 40/340 (11%)
Query: 97 LLENRATLRA-----MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAM 151
+L+ RA RA F L Y DRT L ++K + D F L + V+V
Sbjct: 366 ILKARACRRAPFDMETGLFVTALLACYHADRTQFLAKASKMFVYDEFTVLAAICVLVFLF 425
Query: 152 T-----SLKKHNDKSPFSGKTIQYL-NRHQTEEWKGWMQVLFLMYHYFAATE---IYNAI 202
T + + + +P + L R QTEEWKGWMQ L+YH+ A++ IY I
Sbjct: 426 TIRRSRAAPQPDTIAPQKAPEVNVLLPRDQTEEWKGWMQAAILVYHWTGASKDLGIYIFI 485
Query: 203 RIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPM 262
R+ +A+Y++ TG+G+ ++ +KDF R A M RLN V+ DYM YY P+
Sbjct: 486 RLLVASYLFQTGYGHTIFFLKKKDFGFKRIAATMLRLNLLSCALPYVMGTDYMFYYFAPL 545
Query: 263 HTLFTIMVYGAVGIFNKYNE-IGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGY 321
+ + ++VY + I ++N+ +G+++ +++ F+ +++ + P V + ++ L +
Sbjct: 546 VSFWFMVVYSTLAIGRQHNDNLGALLGKIVISAFITYLVMMKSP-VPEWSFTALRLLCNI 604
Query: 322 TDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYH---PTAEKWMEKLEESEPKRKLSIK 378
LHEW FR GLD +I +GM+ A H P A W LS
Sbjct: 605 K-------WNLHEWTFRVGLDAFIVFVGMLTAIAHVRYPNACAW-----------ALSSY 646
Query: 379 AGIVT--VALFVGYLWYECIYKLDKVTYNKYHPYTSWIPI 416
AG V VA++ GY Y K YN +HPY SWIPI
Sbjct: 647 AGAVGGLVAMY-GYYHACGEYFPTKEIYNSWHPYISWIPI 685
>gi|338724280|ref|XP_001494057.3| PREDICTED: CAS1 domain-containing protein 1 [Equus caballus]
Length = 774
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 345 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 397
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y +R+ +AAY++ TG+G+FSY
Sbjct: 398 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHVRVLVAAYLFQTGYGHFSY 455
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 456 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDQPYQFYYFVPLVTVWFMVIYVTLALWPQI 515
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 516 IQKKANGNCFWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 568
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRYI GM++A+ + +K E + EP K+S +
Sbjct: 569 -----NVYEWWFRWRLDRYIVFHGMLFAFIYLALQKRQVLSEGKGEPLFSNKISNFLLFI 623
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 624 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 662
>gi|410952330|ref|XP_003982834.1| PREDICTED: LOW QUALITY PROTEIN: CAS1 domain-containing protein 1
[Felis catus]
Length = 757
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 170/341 (49%), Gaps = 45/341 (13%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 328 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 380
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 381 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 438
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 439 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 498
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 499 IQKKANGNCFWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 551
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG----I 381
++EW FR LDRY+ GM++A+ + +K + L E + + S K
Sbjct: 552 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQK-RQVLSEGKGEPXFSNKISNFLLF 605
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
++V F+ Y + K +K N+ HP S + I + I
Sbjct: 606 ISVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 645
>gi|389751529|gb|EIM92602.1| Cas1p-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 888
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/322 (29%), Positives = 163/322 (50%), Gaps = 24/322 (7%)
Query: 100 NRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHND 159
+ L + GA+ Y+ DR+ K ++ F FL L + + + ++K+ +
Sbjct: 368 DEGQLPPLVMSGAVAV-IYLADRSWFWLKEQKQFDSWAFGFLNFLFLAI-GLATVKRAD- 424
Query: 160 KSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFG 216
K + +LNR QT+EWKGWMQ+ L+YHY+ A++ IYN IR+ +A+Y++MTG+G
Sbjct: 425 ------KDLGFLNREQTDEWKGWMQIAILIYHYYGASKVSGIYNPIRVLVASYLFMTGYG 478
Query: 217 NFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI 276
+ ++Y + DF R AQ+M RLN +N DY+ YY P+ +++ +++Y + I
Sbjct: 479 HTTFYVKKADFGFLRVAQIMVRLNLLTLLLAYTMNTDYISYYFAPLVSMWYLIIYATMAI 538
Query: 277 FNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWH 336
+++N+ + ++ K+ A +V P + + ++ L AK EW
Sbjct: 539 GSQFNDRTAFLVFKLFASMALVTWFMSEPWLLETIFAFLANFCAIHWSAK-------EWA 591
Query: 337 FRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVAL-FVGYLWYEC 395
FR LD +I GM A T K E P ++ I T AL + Y +E
Sbjct: 592 FRVNLDLWIVYFGMFSAL---TFIKIREHRLTEHPHWHHAVNVAIGTSALVMIWYFGFE- 647
Query: 396 IYKLDKVTYNKYHPYTSWIPIT 417
+ + DK YN +HPY S++P+
Sbjct: 648 LAQPDKFVYNTWHPYVSFLPVA 669
>gi|390604747|gb|EIN14138.1| Cas1p-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 853
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 95/307 (30%), Positives = 161/307 (52%), Gaps = 31/307 (10%)
Query: 118 YICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTE 177
+I DRT K+++ +F L +++ + ++K+ + K + +LNR QT+
Sbjct: 378 FIADRTGFWLKEHKHFDPWIFSIL-AFSALLAGLITIKRAD-------KDLGFLNRKQTD 429
Query: 178 EWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQ 234
EWKGWMQ+ L+YHYF A++I YN IR+ +A+Y++MTG+G+ ++Y + DF L R AQ
Sbjct: 430 EWKGWMQIAILIYHYFGASKIPGIYNPIRVLVASYLFMTGYGHTTFYVKKADFGLRRVAQ 489
Query: 235 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILAC 294
++ RLN + V+N DY+ YY P+ + + +++Y + + + N+ +I KIL
Sbjct: 490 VLIRLNLYTLLLAYVMNTDYISYYFAPLVSSWYLIIYVTMALGSHLNDRTGFLIAKILLS 549
Query: 295 FLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY 354
V P + + + L G A+ EW FR LD +I +GM+ A
Sbjct: 550 MAAVTWFMREPRLLGMIFEFLERFCGIRWSAR-------EWAFRFTLDLWIVYVGMLTAL 602
Query: 355 YHPTAEKWMEKLEESEPKRKLSIKAGIV--TVALFVGYLWYECIYKLDKVT---YNKYHP 409
+ K E ++K +V TVAL LW+ +++L + T YN +HP
Sbjct: 603 AY---IKIREHRLTDHAHWPAAVKTALVLSTVAL----LWF-FVFELSQPTKFAYNAWHP 654
Query: 410 YTSWIPI 416
Y S++P+
Sbjct: 655 YISFVPV 661
>gi|281342929|gb|EFB18513.1| hypothetical protein PANDA_006698 [Ailuropoda melanoleuca]
Length = 797
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQVLSEGKGEPLFSNKISNFLLFI 646
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 647 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|311264507|ref|XP_003130198.1| PREDICTED: CAS1 domain-containing protein 1 [Sus scrofa]
Length = 797
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/339 (28%), Positives = 169/339 (49%), Gaps = 41/339 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKP 327
+ + + + + CFL G F+ +S F +
Sbjct: 539 IQKKANGNCFWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPFSKCFELKGN- 592
Query: 328 DLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG----IVT 383
++EW FR LDRY+ GM++A+ + +K + L E + + S K ++
Sbjct: 593 ----VYEWWFRWRLDRYVVFHGMLFAFIYLALQK-RQILSEGKGEPLFSNKVSNFLLFIS 647
Query: 384 VALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
V F+ Y + K +K N+ HP S + I + I
Sbjct: 648 VVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|351695701|gb|EHA98619.1| CAS1 domain-containing protein 1, partial [Heterocephalus glaber]
Length = 753
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 171/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 324 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 376
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y +R+ +AAY++ TG+G+FSY
Sbjct: 377 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHVRVLVAAYLFQTGYGHFSY 434
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 435 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 494
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 495 IQKKANGNCFWHLGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 547
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + T +K E + EP K+S +
Sbjct: 548 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLTLQKRQVLSEGKGEPLFSNKISNFLLFI 602
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 603 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 641
>gi|301765710|ref|XP_002918283.1| PREDICTED: LOW QUALITY PROTEIN: CAS1 domain-containing protein
1-like [Ailuropoda melanoleuca]
Length = 884
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 455 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 507
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 508 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 565
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 566 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 625
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 626 IQKKANGNCFWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 678
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 679 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQVLSEGKGEPLFSNKISNFLLFI 733
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 734 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 772
>gi|426228297|ref|XP_004008249.1| PREDICTED: CAS1 domain-containing protein 1 [Ovis aries]
Length = 768
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 170/341 (49%), Gaps = 45/341 (13%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 339 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 391
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 392 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 449
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++++ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 450 FWVKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 509
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 510 IQKKANGNCLWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 562
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG----I 381
++EW FR LDRY+ GM++A+ + +K + L E + + S K
Sbjct: 563 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQK-RQILSEGKGEPLFSSKISNFLLF 616
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
++V F+ Y + K +K N+ HP S + I + I
Sbjct: 617 ISVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 656
>gi|358411722|ref|XP_611653.5| PREDICTED: CAS1 domain-containing protein 1 isoform 1 [Bos taurus]
gi|359064525|ref|XP_002686702.2| PREDICTED: CAS1 domain-containing protein 1 [Bos taurus]
Length = 797
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 170/341 (49%), Gaps = 45/341 (13%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++++ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWVKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 TQKKANGNCLWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG----I 381
++EW FR LDRY+ GM++A+ + +K + L E + + S K
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQK-RQILSEGKGEPLFSSKISNFLLF 645
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
++V F+ Y + K +K N+ HP S + I + I
Sbjct: 646 ISVVSFLTYSIWASSCK-NKAQCNELHPSVSVVQILAFILI 685
>gi|440893298|gb|ELR46122.1| CAS1 domain-containing protein 1, partial [Bos grunniens mutus]
Length = 753
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 170/341 (49%), Gaps = 45/341 (13%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 324 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 376
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 377 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 434
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++++ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 435 FWVKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 494
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 495 TQKKANGNCLWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 547
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG----I 381
++EW FR LDRY+ GM++A+ + +K + L E + + S K
Sbjct: 548 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQK-RQILSEGKGEPLFSSKISNFLLF 601
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
++V F+ Y + K +K N+ HP S + I + I
Sbjct: 602 ISVVSFLTYSIWASSCK-NKAQCNELHPSVSVVQILAFILI 641
>gi|293346634|ref|XP_001053100.2| PREDICTED: CAS1 domain-containing protein 1-like, partial [Rattus
norvegicus]
Length = 680
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 169/346 (48%), Gaps = 55/346 (15%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 251 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 303
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 304 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 361
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIF--- 277
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++
Sbjct: 362 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYITLALWPQV 421
Query: 278 --NKYNEIG--------SVMIVKILACFLV--------VILIWEIPGVFDIFWSPLTFIL 319
K N G + + + CFL + +W + F++ S
Sbjct: 422 TQKKANGNGFWYLGLLLKLAFLLLCICFLAYSQGAFEKIFSLWPLSKCFELEGS------ 475
Query: 320 GYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESEP--KRKLS 376
++EW FR LDRY+ + G ++A+ Y + + + EP K+S
Sbjct: 476 ------------VYEWWFRWRLDRYVVLHGALFAFIYLALQRRQILSEGKGEPLFSNKIS 523
Query: 377 IKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
V+V F+ Y + K +K N+ HP S + I + I
Sbjct: 524 NFLLFVSVVSFLTYSIWASSCK-NKAECNELHPSVSVVQIVAFILI 568
>gi|402221011|gb|EJU01081.1| Cas1p-domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 875
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 184/373 (49%), Gaps = 39/373 (10%)
Query: 75 ARLLSSSIKTNLIR-FMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNY 133
ARL S + L+R F DA ++ FG + Y+ DRT L K +
Sbjct: 344 ARLWISKRERPLLRSFFPSGDA--------QHQLSIFGLSVLLCYLADRTPLWDKEQKQF 395
Query: 134 NRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYF 193
D F LLL+ + A K +K +LNR QT+EWKGWMQ++ L+YHY+
Sbjct: 396 --DTVWFFGLLLLGLGAGVGTLKWGEKD------AGFLNRDQTDEWKGWMQIIILVYHYY 447
Query: 194 AATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVL 250
++I YN +R+ +AAY++MTG+G+F +YY + D+S R AQ++ RLN V+
Sbjct: 448 GGSQISGIYNPVRVLVAAYLFMTGYGHFHFYYRKADYSFTRIAQVLVRLNMLTVALAYVM 507
Query: 251 NNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDI 310
N DY+ YY P+ + + +++Y + + KYN+ ++ K+ A +V L V +
Sbjct: 508 NTDYLFYYFAPLVSFWFLVIYFTMLVGAKYNDRLLFLLPKMAASAALVALSMYNAWVNEA 567
Query: 311 FWSPLTFILGYTDPAKPDLP-RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEES 369
++ L + LP EW FR LD +I +G + A T + +L E
Sbjct: 568 LFALLRKVF--------RLPWNAREWTFRVSLDLWIVWLGALTALC--TLKFTQHRLAEH 617
Query: 370 EPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLV 429
A ++++A + W+E + + K+ YN YHPY S++PIT F L+
Sbjct: 618 TFFPIAKRFAAVLSLATLAWFFWFE-LTRETKLVYNAYHPYVSFLPITA-------FVLL 669
Query: 430 KHLSGSLYMMACR 442
++L+ SL R
Sbjct: 670 RNLTPSLRSSHSR 682
>gi|296488693|tpg|DAA30806.1| TPA: CAS1 domain containing 1-like [Bos taurus]
Length = 940
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 170/341 (49%), Gaps = 45/341 (13%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 511 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 563
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 564 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 621
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++++ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 622 FWVKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 681
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 682 TQKKANGNCLWHFGLLLKLAFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 734
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG----I 381
++EW FR LDRY+ GM++A+ + +K + L E + + S K
Sbjct: 735 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQK-RQILSEGKGEPLFSSKISNFLLF 788
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
++V F+ Y + K +K N+ HP S + I + I
Sbjct: 789 ISVVSFLTYSIWASSCK-NKAQCNELHPSVSVVQILAFILI 828
>gi|345779885|ref|XP_532464.3| PREDICTED: CAS1 domain-containing protein 1 [Canis lupus
familiaris]
Length = 1008
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 579 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 631
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 632 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 689
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 690 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 749
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 750 IQKKANGNCFWHFGLLLKLAFLLLCICFLAYSQ-----GAFEKIFSLWPLSKCFELKG-- 802
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 803 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQVLSEGKGEPLFSNKISNFLLFI 857
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 858 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 896
>gi|426356961|ref|XP_004045818.1| PREDICTED: CAS1 domain-containing protein 1 [Gorilla gorilla
gorilla]
Length = 728
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 299 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 351
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 352 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 409
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 410 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 469
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 470 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 522
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 523 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 577
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 578 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 616
>gi|17016934|gb|AAL33538.1|AF355594_1 O-acetyltransferase [Homo sapiens]
Length = 797
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 171/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY++ GM++A+ + +K E + EP K+S +
Sbjct: 592 -----NVYEWWFRWRLDRYVFFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 646
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 647 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|390466738|ref|XP_002751640.2| PREDICTED: CAS1 domain-containing protein 1 [Callithrix jacchus]
Length = 797
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYITLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQVLSEGKGEPLFSNKISNFLLFI 646
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 647 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|397476765|ref|XP_003809762.1| PREDICTED: CAS1 domain-containing protein 1 [Pan paniscus]
Length = 728
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 299 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 351
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 352 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 409
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 410 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 469
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 470 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 522
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 523 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 577
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 578 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 616
>gi|332206950|ref|XP_003252558.1| PREDICTED: CAS1 domain-containing protein 1 [Nomascus leucogenys]
Length = 728
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 299 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 351
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 352 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 409
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 410 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 469
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 470 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 522
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 523 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 577
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 578 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 616
>gi|355747840|gb|EHH52337.1| hypothetical protein EGM_12765, partial [Macaca fascicularis]
Length = 753
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 324 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 376
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 377 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 434
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 435 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 494
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 495 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 547
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 548 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 602
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 603 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 641
>gi|119597198|gb|EAW76792.1| CAS1 domain containing 1, isoform CRA_d [Homo sapiens]
Length = 933
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 171/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 504 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 556
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 557 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 614
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 615 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 674
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 675 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 727
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY++ GM++A+ + +K E + EP K+S +
Sbjct: 728 -----NVYEWWFRWRLDRYVFFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 782
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 783 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 821
>gi|410219456|gb|JAA06947.1| CAS1 domain containing 1 [Pan troglodytes]
gi|410264310|gb|JAA20121.1| CAS1 domain containing 1 [Pan troglodytes]
gi|410306602|gb|JAA31901.1| CAS1 domain containing 1 [Pan troglodytes]
gi|410350143|gb|JAA41675.1| CAS1 domain containing 1 [Pan troglodytes]
Length = 797
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 646
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 647 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|380815690|gb|AFE79719.1| CAS1 domain-containing protein 1 precursor [Macaca mulatta]
gi|383420873|gb|AFH33650.1| CAS1 domain-containing protein 1 precursor [Macaca mulatta]
gi|384948856|gb|AFI38033.1| CAS1 domain-containing protein 1 precursor [Macaca mulatta]
Length = 797
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 646
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 647 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|395818938|ref|XP_003782866.1| PREDICTED: CAS1 domain-containing protein 1 [Otolemur garnettii]
Length = 992
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 169/340 (49%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 563 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 615
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 616 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 673
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ + +Y + I+ +
Sbjct: 674 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMTIYVTLAIWPQI 733
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 734 IQKKANGNCFWHFGLLLKLAFLLLFICFLAYSQ-----GAFEKIFSLWPLSKCFELKG-- 786
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ Y ++ + + EP K+S +
Sbjct: 787 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQLLSEGKGEPLFPNKISNFLLFI 841
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 842 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 880
>gi|393248032|gb|EJD55539.1| Cas1p-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 863
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 158/315 (50%), Gaps = 25/315 (7%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGK 166
M+ FG + + DRT + K Y D ++F +L L + A + K DK
Sbjct: 381 MSIFGLAMGLTFFADRTGVWLKEQKAY--DPWVFAFLTLAALGAGLATMKSADKD----- 433
Query: 167 TIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYI 223
+ +LNR QT+EWKGWMQ+ L+YHY A++I YN IR +AAY++MTG+G+ ++Y
Sbjct: 434 -LGFLNREQTDEWKGWMQLAILIYHYLGASKISGIYNPIRALVAAYLFMTGYGHTTFYLK 492
Query: 224 RKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEI 283
+ DF R AQ++ R+N +N DY+ YY P+ + + +++Y + + YN
Sbjct: 493 KADFGFLRIAQVLIRINLLTVALAYTMNTDYISYYFAPLVSWWFVVIYVTLYLGKAYNNS 552
Query: 284 GSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDR 343
+++KI+ ++ + + + ++ L AK EW+FR LD
Sbjct: 553 AIFVLIKIILSVALMTWFMKTSWILEDLFALLKQFFRINWSAK-------EWNFRVSLDL 605
Query: 344 YIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYEC--IYKLDK 401
+I GM+ A + K+ E P+ + +A +V L + LWY + + DK
Sbjct: 606 WIVYFGMLAALGY---IKFRELRLGDHPRWPMVQRASLVLSGLTL--LWYVGFELSQPDK 660
Query: 402 VTYNKYHPYTSWIPI 416
YN +HPY + +P+
Sbjct: 661 FRYNVWHPYIAVLPV 675
>gi|119597195|gb|EAW76789.1| CAS1 domain containing 1, isoform CRA_a [Homo sapiens]
Length = 654
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 225 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 277
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 278 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 335
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 336 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 395
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 396 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 448
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 449 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 503
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 504 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 542
>gi|332866671|ref|XP_519208.3| PREDICTED: CAS1 domain-containing protein 1 isoform 3 [Pan
troglodytes]
Length = 933
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 504 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 556
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 557 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 614
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 615 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 674
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 675 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 727
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 728 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 782
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 783 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 821
>gi|403257579|ref|XP_003921383.1| PREDICTED: CAS1 domain-containing protein 1 [Saimiri boliviensis
boliviensis]
Length = 907
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 478 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 530
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 531 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 588
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 589 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMIIYITLSLWPQI 648
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 649 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 701
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 702 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQVLSEGKGEPLFSNKISNFLLFI 756
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 757 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 795
>gi|189067266|dbj|BAG36976.1| unnamed protein product [Homo sapiens]
Length = 797
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 646
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 647 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|37620157|ref|NP_663373.2| CAS1 domain-containing protein 1 precursor [Mus musculus]
gi|81894449|sp|Q7TN73.1|CASD1_MOUSE RecName: Full=CAS1 domain-containing protein 1; Flags: Precursor
gi|31376261|dbj|BAC77246.1| O-acetyltransferase [Mus musculus]
gi|116138481|gb|AAI25378.1| CAS1 domain containing 1 [Mus musculus]
gi|116138838|gb|AAI25380.1| CAS1 domain containing 1 [Mus musculus]
Length = 797
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 173/335 (51%), Gaps = 33/335 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGSV-----MIVKILACFLVVILIWEIP---GVFDIFWS--PLTFILGYTDPAKPDLP 330
+ + + +L L+++ IW + G F+ +S PL+
Sbjct: 539 TQKKANGNFFWYLGLLLKLGLLLLCIWFLAYSQGAFEKIFSLWPLSKCFELEG------- 591
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESEP--KRKLSIKAGIVTVALF 387
++EW FR LDRY+ G+++A+ Y + + + EP K+S V+V F
Sbjct: 592 SVYEWWFRWRLDRYVVFHGVLFAFIYLALQRRQILSEGKGEPLFSNKISNFLLFVSVVSF 651
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+ Y + K +K N+ HP S + I + I
Sbjct: 652 LTYSIWASSCK-NKAECNELHPSVSVVQIVAFILI 685
>gi|148682035|gb|EDL13982.1| CAS1 domain containing 1 [Mus musculus]
Length = 754
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 173/335 (51%), Gaps = 33/335 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 325 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 377
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 378 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 435
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 436 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 495
Query: 281 NEIGSV-----MIVKILACFLVVILIWEIP---GVFDIFWS--PLTFILGYTDPAKPDLP 330
+ + + +L L+++ IW + G F+ +S PL+
Sbjct: 496 TQKKANGNFFWYLGLLLKLGLLLLCIWFLAYSQGAFEKIFSLWPLSKCFELEG------- 548
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESEP--KRKLSIKAGIVTVALF 387
++EW FR LDRY+ G+++A+ Y + + + EP K+S V+V F
Sbjct: 549 SVYEWWFRWRLDRYVVFHGVLFAFIYLALQRRQILSEGKGEPLFSNKISNFLLFVSVVSF 608
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+ Y + K +K N+ HP S + I + I
Sbjct: 609 LTYSIWASSCK-NKAECNELHPSVSVVQIVAFILI 642
>gi|170784865|ref|NP_075051.4| CAS1 domain-containing protein 1 precursor [Homo sapiens]
gi|74717082|sp|Q96PB1.1|CASD1_HUMAN RecName: Full=CAS1 domain-containing protein 1; Flags: Precursor
gi|15987887|gb|AAK97479.2|AF397424_1 C7ORF12 [Homo sapiens]
gi|51094891|gb|EAL24136.1| O-acetyltransferase [Homo sapiens]
gi|193784822|dbj|BAG53975.1| unnamed protein product [Homo sapiens]
Length = 797
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 646
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 647 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|402864212|ref|XP_003896368.1| PREDICTED: CAS1 domain-containing protein 1 [Papio anubis]
Length = 867
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 505 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 557
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 558 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 615
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 616 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 675
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 676 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 728
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ Y ++ + + EP K+S +
Sbjct: 729 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 783
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 784 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 822
>gi|299755413|ref|XP_001828646.2| O-acetyltransferase [Coprinopsis cinerea okayama7#130]
gi|298411215|gb|EAU93150.2| O-acetyltransferase [Coprinopsis cinerea okayama7#130]
Length = 846
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 93/307 (30%), Positives = 161/307 (52%), Gaps = 32/307 (10%)
Query: 114 LFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNR 173
L Y+ DR+ L K ++ F FL LL V + + SLK+ ++ + +LNR
Sbjct: 364 LIAIYLSDRSWLWLKEHKQFDAINFTFLCLLAVGI-GLASLKRADND-------LGFLNR 415
Query: 174 HQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFA 233
QT+EWKGWMQ + IYN IR+ +A+Y++MTG+G+ ++Y + DF R A
Sbjct: 416 DQTDEWKGWMQ------RASKVSGIYNPIRVLVASYLFMTGYGHTTFYLRKADFGFKRLA 469
Query: 234 QMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKIL- 292
Q++ RLN V++ DY+ YY P+ +++ +++YG + I ++ N ++++KIL
Sbjct: 470 QVLIRLNLLTVLLAYVMDTDYISYYFSPLVSMWYLVIYGTMAIGSQLNSRTPILLLKILV 529
Query: 293 ACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIY 352
+ L+ +W+ + +F S L I G A+ EW FR LD +I +GM+
Sbjct: 530 SATLMTSFMWKSQPLEAVF-SFLELIFGIRWSAR-------EWSFRVNLDLWIVYVGMLT 581
Query: 353 AYYHPTAEKWMEKLEESE--P-KRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHP 409
+ + A + +L + P K++I I+ + F + + + K TYN++HP
Sbjct: 582 SIFVVKARE--NRLTDHRLWPLTTKIAIGGSILVLVWFFAFE----LCQDSKFTYNRWHP 635
Query: 410 YTSWIPI 416
Y S +P+
Sbjct: 636 YISLLPV 642
>gi|119597197|gb|EAW76791.1| CAS1 domain containing 1, isoform CRA_c [Homo sapiens]
Length = 933
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 170/340 (50%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 504 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 556
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 557 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 614
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 615 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 674
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 675 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 727
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 728 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 782
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 783 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 821
>gi|395334564|gb|EJF66940.1| O-acetyltransferase [Dichomitus squalens LYAD-421 SS1]
Length = 873
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/306 (30%), Positives = 153/306 (50%), Gaps = 29/306 (9%)
Query: 118 YICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTE 177
++ DRT K ++ F FL LL + V +T + D + +LNR QT+
Sbjct: 380 FLADRTGYWLKEQKQFDPWTFAFLSLLSLAVGLLTVTRADKD--------LGFLNREQTD 431
Query: 178 EWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQ 234
EWKGWMQ+ L+YHY A++I YN IR+ +AAY++MTG+G+ ++Y + DF L R AQ
Sbjct: 432 EWKGWMQIAILIYHYTGASKISGIYNPIRVLVAAYLFMTGYGHTTFYVKKADFGLLRVAQ 491
Query: 235 MMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILAC 294
++ RLN +N DY+ YY P+ + + +++Y + + +YN+ ++VK+L
Sbjct: 492 VLIRLNLLTLLLAYTMNTDYLYYYFAPLVSQWYLIIYVTMAVGAQYNDRTPFLVVKLLLS 551
Query: 295 FLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY 354
+V + + D+ + L A+ EW FR LD +I GM A
Sbjct: 552 MAIVTVFMSQQWILDMVFEFLERFCNIRWDAR-------EWAFRVNLDLWIVYFGMFTAL 604
Query: 355 YHPTAEKWMEKLEESEP--KRKLSIKAGIVTVALFVGYLWYEC--IYKLDKVTYNKYHPY 410
K ++ P + I +G+ V L LW+ + + DK YN +HPY
Sbjct: 605 ---AVIKIRDQRLTDHPLWPQATKIASGLSAVVL----LWFFAFELSQPDKFAYNAWHPY 657
Query: 411 TSWIPI 416
S++P+
Sbjct: 658 VSFLPV 663
>gi|76879715|dbj|BAE45726.1| putative protein product of Nbla04196 [Homo sapiens]
Length = 463
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 171/341 (50%), Gaps = 45/341 (13%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 34 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 86
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 87 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 144
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 145 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 204
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 205 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 257
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEE--SEP--KRKLSIKAGI 381
++EW FR LDRY+ GM++A+ + +K + L E EP K+S
Sbjct: 258 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQK-RQILSEGKGEPLFSNKISNFLLF 311
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
++V F+ Y + K +K N+ HP S + I + I
Sbjct: 312 ISVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 351
>gi|37748371|gb|AAH58953.1| Casd1 protein, partial [Mus musculus]
Length = 495
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 173/335 (51%), Gaps = 33/335 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 66 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 118
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 119 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 176
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 177 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 236
Query: 281 NEIGSV-----MIVKILACFLVVILIWEIP---GVFDIFWS--PLTFILGYTDPAKPDLP 330
+ + + +L L+++ IW + G F+ +S PL+
Sbjct: 237 TQKKANGNFFWYLGLLLKLGLLLLCIWFLAYSQGAFEKIFSLWPLSKCFELEG------- 289
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESEP--KRKLSIKAGIVTVALF 387
++EW FR LDRY+ G+++A+ Y + + + EP K+S V+V F
Sbjct: 290 SVYEWWFRWRLDRYVVFHGVLFAFIYLALQRRQILSEGKGEPLFSNKISNFLLFVSVVSF 349
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+ Y + K +K N+ HP S + I + I
Sbjct: 350 LTYSIWASSCK-NKAECNELHPSVSVVQIVAFILI 383
>gi|392571169|gb|EIW64341.1| Cas1p-domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 869
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 156/311 (50%), Gaps = 23/311 (7%)
Query: 109 EFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTI 168
F A L Y + DRT K ++ F FL LL + ++ + ND +
Sbjct: 375 SFAAALIY--VADRTGFWLKEQKQFDPWTFGFLALLSLAAGLLSVKRADND--------L 424
Query: 169 QYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIRK 225
+LNR QT+EWKGWMQ+ L+YHY A++I YN IR+ +AAY++MTG+G+ ++Y +
Sbjct: 425 GFLNREQTDEWKGWMQIAILIYHYTGASKISGIYNPIRVLVAAYLFMTGYGHTTFYVKKA 484
Query: 226 DFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGS 285
DF L R AQ++ RLN +N DY+ YY P+ + + +++YG + + +KYN+
Sbjct: 485 DFGLLRVAQVLVRLNLLTLLLAYTMNTDYISYYFAPLVSQWYLIIYGTMFLGSKYNDRTI 544
Query: 286 VMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYI 345
+I KIL VV P + + + L A+ EW FR LD +I
Sbjct: 545 FLITKILLSMGVVTWFMSEPWLLEAVFQFLARFCNIQWSAR-------EWAFRVNLDLWI 597
Query: 346 WIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYN 405
GM A K E P+ L +++ V A+ + + + + + DK YN
Sbjct: 598 VYFGMFAAI---AVMKVREHRITDHPRWPLVVRSAAVLSAVVLLWFFSFELAQADKFAYN 654
Query: 406 KYHPYTSWIPI 416
+HPY S++P+
Sbjct: 655 LWHPYVSFLPV 665
>gi|320170086|gb|EFW46985.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 1008
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 142/277 (51%), Gaps = 38/277 (13%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGK 166
+A+ G +L + ++CDRT++ K Y F + + V+ A H +P
Sbjct: 547 VAQLGLVLVFAFLCDRTHIFPKERKIYTH--FDLIASIGVMFVAGWCTLTHVKTAP---- 600
Query: 167 TIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYI 223
+L+R QT+EWKGWMQ++FL+YH+ + + Y +R+F+A+Y+++TGFG+F+++Y
Sbjct: 601 --TFLSRDQTDEWKGWMQIVFLIYHFLGGSSVLPLYMLVRVFVASYLFLTGFGHFTFFYT 658
Query: 224 RKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEI 283
+K F + R Q++ RLN C+V++ Y YY P+ + + ++VY A+ F +
Sbjct: 659 KKQFGITRIIQVLTRLNVLTIGLCMVMDKPYQFYYFVPLSSFWFLVVY-AIMAFPSIPAV 717
Query: 284 -----GSVMIVKILACFLVVILIW---------EIPGVFDIF-----WSPLTFILGYTDP 324
G +ILA F + ++W F IF W P+ + +D
Sbjct: 718 QKRFPGLDAPARILALFTICSIVWLPFSGNYKSTSADDFPIFQFLFDWWPIKELFSVSD- 776
Query: 325 AKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEK 361
L+EW FRSGLD +I GM+ AY + A+
Sbjct: 777 ------SLYEWRFRSGLDCFIVAHGMLIAYAYQIAKS 807
>gi|302889163|ref|XP_003043467.1| hypothetical protein NECHADRAFT_88136 [Nectria haematococca mpVI
77-13-4]
gi|256724384|gb|EEU37754.1| hypothetical protein NECHADRAFT_88136 [Nectria haematococca mpVI
77-13-4]
Length = 856
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 96/322 (29%), Positives = 154/322 (47%), Gaps = 34/322 (10%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQ 169
F L Y Y+ DRT+L K + F L + IV A T +++ + + TI
Sbjct: 394 FPMALLYCYVADRTDLFSKGMKEFATLEFALLVAVYAIVFAAT-IRRTRLRVVAADATIT 452
Query: 170 Y--------LNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNF 218
L+R QTEEWKGWMQ L+YH+ A+ +Y IR+ +AAY++ TG+G+
Sbjct: 453 KPIKEDAGILSRDQTEEWKGWMQAAILIYHWTGASRNLPVYMFIRLLVAAYLFQTGYGHT 512
Query: 219 SYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFN 278
++ +KDFS R A ++ RLN +++ DYM YY P+ + + ++VY +GI +
Sbjct: 513 IFFLAKKDFSPRRIANVLLRLNMLSCALPYIMDTDYMFYYFAPLVSFWFLVVYATMGIGS 572
Query: 279 KYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFR 338
+YN+ +I KI ++V +I + V + + L I K D LHEW FR
Sbjct: 573 RYNDSTYAVISKICGSLVLVTVILKFTPVMEWVFVALKTIFRI----KWD---LHEWEFR 625
Query: 339 SGLDRYIWIIGMIYAYYHPTAEK---WMEKLEES-EPKRKLSIKAGIVTVALFVGYLWYE 394
LD +I +GM+ H E+ W + P + + + +
Sbjct: 626 INLDSFIVYVGMLAGVAHQRMERNSTWFTNYRNAIAPSLAILMICAVPCL---------- 675
Query: 395 CIYKLDKVTYNKYHPYTSWIPI 416
I+ +K Y HPY S++P+
Sbjct: 676 -IFCDEKRKYTMVHPYLSFLPV 696
>gi|219114524|ref|XP_002176432.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217402678|gb|EEC42668.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 460
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 104/324 (32%), Positives = 158/324 (48%), Gaps = 37/324 (11%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L A + +L Y+ + N +N N ++F + +++ A + KH+ S
Sbjct: 19 LLAQGQIAVVLVVAYVGN--NWPHSYPRNENAKPYMFWAMNALLLIAGIATLKHDGNS-- 74
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYI 223
S + +Q L+R QTEEWKGWMQ F+MYHY+ YNAIR+F++AYVWMTGFGNF Y+
Sbjct: 75 SSRGVQLLSRPQTEEWKGWMQWAFIMYHYYRYYGAYNAIRVFVSAYVWMTGFGNFQYFDK 134
Query: 224 RKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMH------TLFTIMVYGAVGIF 277
+ DFSL R M R+N+F + L LYY+ P+H T+ T + AV
Sbjct: 135 KSDFSLERAISMWLRINYFPILLSLFLTVPLELYYVVPLHTAAFFITMATCALAQAVESR 194
Query: 278 NKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHF 337
K++ S I I CFLV +L +E F+ ++D E++F
Sbjct: 195 KKWSRTHS-NIFAIAVCFLVHVLFYETKASH--------FLKLFSD----------EYYF 235
Query: 338 RSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSI----KAGIVTVALFVGYLWY 393
R DRY +G++ + + +++ E R ++ G+ +AL+ W
Sbjct: 236 RFTSDRYSAWVGILSGFSWGHFKSYIQWCYGGEQVRAGAMWMQRAGGVGLIALW----WI 291
Query: 394 ECIYKLDKVTYNKYHPYTSWIPIT 417
+ DK TYN HPY WIP+
Sbjct: 292 LFGHIQDKFTYNPIHPYVFWIPVA 315
>gi|391337842|ref|XP_003743273.1| PREDICTED: CAS1 domain-containing protein 1 [Metaseiulus
occidentalis]
Length = 821
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 174/331 (52%), Gaps = 37/331 (11%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
++++ G I+ YF ICDRTN K + + L +L + VSA+ + P
Sbjct: 395 SLSKLGLIMMYFLICDRTNFFLKENKYFTQ---LNFFLPICYVSALGLFFTED---PRCE 448
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYY 222
TI L+R QT+EWKGWMQ++ L+YH A++I Y +R+ + +Y+++TG+ +F++++
Sbjct: 449 STI--LHRKQTQEWKGWMQLVVLIYHMTGASKIVPIYMHVRVLVTSYLFLTGYNHFTHFW 506
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY-------GAVG 275
+ D S+ R A++++RLN V C+ +N Y YY P+ T T++V+
Sbjct: 507 MGGDASIFRLAKVVFRLNLLVFVLCLSMNRPYQFYYFVPLVTFCTLLVHLTMVLPPRVTA 566
Query: 276 IFNKYNEIGSV-MIVKILACFLVVILIWEIPGVFD-IFWSP---LTFILGYTDPAKPDLP 330
+ N I + M++K +A F + L++ +F+ IF P F++ ++ +
Sbjct: 567 ASAEQNPIQYMYMVLKFVALFSAITLLYMSQVLFEKIFLMPPWKALFVISASENVR---- 622
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKW---MEKLEESEPKRKLSI--KAGIVTVA 385
EW+FR +DRY GM++A+ +A+K ++ + E+ + +L I + ++
Sbjct: 623 ---EWYFRWKIDRYSTPCGMLFAFLLHSAKKHGLIIDSVSENVVRSRLLILFMTAVSVIS 679
Query: 386 LFVGYLWYECIYKLDKVTYNKYHPYTSWIPI 416
+ + Y +K N+ HPY SW+PI
Sbjct: 680 FMISTTF--AFYCGEKPDCNEVHPYISWLPI 708
>gi|344270703|ref|XP_003407183.1| PREDICTED: CAS1 domain-containing protein 1 [Loxodonta africana]
Length = 797
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 167/341 (48%), Gaps = 45/341 (13%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLAFLLFCICFLAY-----SQGAFEKIFSLWPLSKCFELRG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG----I 381
++EW FR LDRY+ GM++A+ + +K + L E + + S +
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQK-RQVLSEGKGEPLFSNRVSNFLLF 645
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N HP S + I + I
Sbjct: 646 FSVVSFLTYSIWASSCK-NKAKCNDLHPSVSVVQILAFILI 685
>gi|302887853|ref|XP_003042814.1| hypothetical protein NECHADRAFT_86689 [Nectria haematococca mpVI
77-13-4]
gi|256723727|gb|EEU37101.1| hypothetical protein NECHADRAFT_86689 [Nectria haematococca mpVI
77-13-4]
Length = 854
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 100/326 (30%), Positives = 155/326 (47%), Gaps = 34/326 (10%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGK 166
+A F L Y DRT +K Y F L L I + L K G
Sbjct: 386 VATFVTGLLACYWADRTQSFAKGSKEYVTFEFNLLTALCFI-AGFAFLAKSKPPPARPGS 444
Query: 167 T---------IQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTG 214
T ++ L+R QT+EWKGWMQ + L+YH+ A+ IY +R+ +AAY++ TG
Sbjct: 445 TTPAPATLDDMKPLSRDQTDEWKGWMQAIILVYHWTGASRDLNIYVGVRLLVAAYLFQTG 504
Query: 215 FGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAV 274
+G+ ++ +KDFS R A ++ RLN ++N DYM YY P+ + + +++Y
Sbjct: 505 YGHAVFFSTKKDFSFKRVAAVLLRLNLLSCALPFIMNTDYMFYYFAPLVSFWFVIIYALF 564
Query: 275 GIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPR--- 331
I KYN+ ++VKI+ + PGV + W+P +L + A + R
Sbjct: 565 AIGKKYNDNTYALVVKIIISAAIC------PGV--MLWTP---VLEWVFAALSLVFRINW 613
Query: 332 -LHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGY 390
LHEW FR GLD I +G++ + + L +S ++ AGI+++ + Y
Sbjct: 614 DLHEWQFRLGLDGLIVYVGILMGIASVRTKAYNLILTKS---YGIAGIAGILSMPV---Y 667
Query: 391 LWYECIYKLDKVTYNKYHPYTSWIPI 416
W + K Y K HP S+IPI
Sbjct: 668 WWVAVSHAEKKQDYTKLHPIFSFIPI 693
>gi|38648760|gb|AAH63284.1| CAS1 domain containing 1 [Homo sapiens]
Length = 797
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 169/340 (49%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CD NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDSANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 478
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 479 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 538
Query: 281 NEIGS-------------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + + + CFL G F+ +S PL+
Sbjct: 539 IQKKANGNCFWHFGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 591
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 592 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 646
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 647 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 685
>gi|449280418|gb|EMC87736.1| CAS1 domain-containing protein 1, partial [Columba livia]
Length = 756
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 169/335 (50%), Gaps = 33/335 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L + + G I+ YFY+CDR NL K Y F + ++++ +
Sbjct: 327 LHSFCKLGLIMTYFYLCDRANLFMKENKFYTHSSFFIPIVYILVLGVFYTE--------- 377
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 378 NTKETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 437
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++++ DF + R Q+++RLNF V C+V++ Y YY P+ T++ +++Y + I+ +
Sbjct: 438 FWVKGDFGVYRVCQVLFRLNFLVVVLCVVMDRPYQFYYFVPLVTVWFMIIYATLAIWPQI 497
Query: 281 NEI---GSVM-----IVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPAKPDLP 330
+ G+ + ++K++ + + G F+ +S PL+
Sbjct: 498 VQKKANGNCLWHFGLLLKLICLLTCIYFLSYSQGAFEKIFSFWPLSKCFELNG------- 550
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESEP--KRKLSIKAGIVTVALF 387
++EW FR LDRY+ GM++A+ Y ++ M + +P ++S ++V F
Sbjct: 551 NVYEWWFRWKLDRYVVFHGMLFAFIYLALQKRQMISEGKGDPLFSNRVSNVLLFISVVSF 610
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
Y + K +K N+ HP S + I + I
Sbjct: 611 STYSIWASSCK-NKTECNELHPSVSVVQILAFILI 644
>gi|320202971|ref|NP_001038707.2| CAS1 domain-containing protein 1 precursor [Danio rerio]
gi|160017664|sp|Q1LW89.2|CASD1_DANRE RecName: Full=CAS1 domain-containing protein 1; Flags: Precursor
Length = 781
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/337 (28%), Positives = 167/337 (49%), Gaps = 41/337 (12%)
Query: 98 LENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKH 157
L + L A+ + I+ YFY+CDR ++ K Y F + + ++ S
Sbjct: 353 LNPKGPLLAIGKMSLIMLYFYLCDRADIFMKEQKFYTHSAFFIPLIYIFVLGVFYSE--- 409
Query: 158 NDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTG 214
+ K + LNR QT+EWKGWMQ++ L+YH A+ +Y +R+ +AAY++ TG
Sbjct: 410 ------NSKETKLLNREQTDEWKGWMQLVILIYHISGASAFIPVYMHVRVLVAAYLFQTG 463
Query: 215 FGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAV 274
+G+FS+++++ DF L R Q+++RLNF V C+V++ Y YY P+ T + ++Y +
Sbjct: 464 YGHFSFFWLKGDFGLYRVCQVLFRLNFLVVVLCLVMDRPYQFYYFVPLVTFWFAVIYATM 523
Query: 275 GI--------------FNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILG 320
+ +N + + ++ + F ++E G+F ++ PL+ +
Sbjct: 524 ALWPQILQKQANGSAFWNLALLLKLLGLLLFIGFFAYSQELFE--GIFSVW--PLSKLFE 579
Query: 321 YTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSI 377
+HEW FR LDR+ + GM++A+ + +K+ E + EP K+S
Sbjct: 580 LQG-------SIHEWWFRWKLDRFAVVNGMLFAFIYLLLQKYQLLSEGKGEPLFSNKISN 632
Query: 378 KAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWI 414
V+V F+ Y + K +K N+ HPY S I
Sbjct: 633 CLLFVSVVSFMTYSIWASGCK-NKSECNEMHPYISVI 668
>gi|363729948|ref|XP_001235382.2| PREDICTED: CAS1 domain-containing protein 1 [Gallus gallus]
Length = 801
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 95/338 (28%), Positives = 169/338 (50%), Gaps = 39/338 (11%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L + G I+ YFY+CDR NL K Y F + ++++ +
Sbjct: 372 LHCFCKLGLIMTYFYLCDRANLFMKENKFYTHSSFFIPIVYILVLGVFYTE--------- 422
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 423 NTKETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 482
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 483 FWIKGDFGVYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMIIYATLAMWPQI 542
Query: 281 NEIGS-----------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPAKP 327
+ + + ++ +LAC + + G F+ +S PL+
Sbjct: 543 VQKKANGNCLWHFGLLLKLICLLAC---IYFLSYSQGAFEKVFSFWPLSKCFELNG---- 595
Query: 328 DLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEK-WMEKLEESEP--KRKLSIKAGIVTV 384
++EW FR LDRY+ GM++A+ + +K M + +P ++S +++
Sbjct: 596 ---NVYEWWFRWKLDRYVVFHGMLFAFIYLALQKHQMISEGKGDPLFSSRVSNVLLFISI 652
Query: 385 ALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
F+ Y + K +K N+ HP S + I + I
Sbjct: 653 VSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 689
>gi|346972781|gb|EGY16233.1| CAS1 domain-containing protein [Verticillium dahliae VdLs.17]
Length = 840
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 152/321 (47%), Gaps = 33/321 (10%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLK-----KHNDKSPFS 164
F L Y DRT L ++K + D F + + +V T + + + +P
Sbjct: 384 FVTALLACYHADRTQFLAKASKMFVYDEFTVMTAICALVFLFTIRRSRVAPQPDTIAPQK 443
Query: 165 GKTIQYL-NRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ L R QTEEWKGWMQ L+YH+ A++ IY IR+ +A+Y++ TG+G+ +
Sbjct: 444 APEVNVLLPRDQTEEWKGWMQAAILVYHWTGASKDLGIYIFIRLLVASYLFQTGYGHTIF 503
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
+ +KDF R A M RLN V+ DYM YY P+ + + ++VY + I ++
Sbjct: 504 FLKKKDFGFKRIAATMLRLNLLSCALPYVMGTDYMFYYFAPLVSFWFMVVYSTLAIGRQH 563
Query: 281 NEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSG 340
N+ ++ KI L+ ++ V + ++ L + K D LHEW FR G
Sbjct: 564 NDNLGALLGKISISALITYVVMMKSPVPEWGFTVLRLLCNI----KWD---LHEWTFRVG 616
Query: 341 LDRYIWIIGMIYAYYH---PTAEKWMEKLEESEPKRKLSIKAGIVT--VALFVGYLWYEC 395
LD +I +GM+ A H P A W LS G V VA++ GY
Sbjct: 617 LDAFIVFVGMLTAIAHVRYPNACAW-----------ALSSYVGAVGGLVAMY-GYYHACG 664
Query: 396 IYKLDKVTYNKYHPYTSWIPI 416
Y K YN +HPY SWIPI
Sbjct: 665 EYFPTKEIYNSWHPYISWIPI 685
>gi|342885185|gb|EGU85284.1| hypothetical protein FOXB_04200 [Fusarium oxysporum Fo5176]
Length = 932
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 160/328 (48%), Gaps = 39/328 (11%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSP------- 162
F L Y + DRT+L K + + F L + +I++ +T ++K ++P
Sbjct: 458 FAVSLLYCFAADRTHLFSKGMKEFVPNEFYLLIAICLIIAGLT-IRKTKFRAPRLPVAEA 516
Query: 163 -----FSGKTIQ----YLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYV 210
+ TI L+R QTEEWKGWMQ L+YH+ A IY IR+ +AAY+
Sbjct: 517 VAAPTTTTMTIDEDAGVLSRDQTEEWKGWMQAAILVYHWTGAIRDLPIYIFIRLLVAAYL 576
Query: 211 WMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMV 270
+ TG+G+ Y+ +KDFS R A +M RLN V+ DYM YY P+ + + ++V
Sbjct: 577 FQTGYGHTIYFLSKKDFSFRRIASVMLRLNILSCALPYVMGTDYMFYYFAPLVSFWFMVV 636
Query: 271 YGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLT-FILGYTDPAKPDL 329
Y + I + +N+ ++ KI F++V +I + ++PLT + D
Sbjct: 637 YATMAICSGFNDSVKIVTSKIFVSFIIVTVI--------LNFTPLTQWAFAILDVVFRIK 688
Query: 330 PRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFV- 388
L EW FR LD I +GM+ A++ +E+ + + KA I+ AL +
Sbjct: 689 WNLDEWMFRVTLDGAIVFVGMLAG----VAQQRLER----DSAWYTNYKAAIIPSALSIL 740
Query: 389 GYLWYECIYKLDKVTYNKYHPYTSWIPI 416
GY ++ C Y D+ Y HP + +PI
Sbjct: 741 GYAYF-CTYIGDRKAYIMMHPVIAAVPI 767
>gi|224044921|ref|XP_002195019.1| PREDICTED: CAS1 domain-containing protein 1 [Taeniopygia guttata]
Length = 758
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 170/335 (50%), Gaps = 33/335 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L + G I+ YFY+CDR NL K Y F + ++++ +
Sbjct: 329 LHCFCKLGLIMTYFYLCDRANLFMKENKFYTHSSFFIPIVYILVLGVFYTE--------- 379
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 380 NTKETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 439
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 440 FWIKGDFGVYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMIIYATLAVWPQI 499
Query: 281 NEI---GSVM-----IVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPAKPDLP 330
+ G+ + ++K++ + + + G F+ +S PL+
Sbjct: 500 VQKKANGNCLWHFGLLLKLICLLICIYFLSYSQGAFEKIFSFWPLSKCFELNG------- 552
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEK-WMEKLEESEP--KRKLSIKAGIVTVALF 387
++EW FR LDRY+ GM++A+ + +K M + +P ++S +++ F
Sbjct: 553 NVYEWWFRWKLDRYVVFHGMLFAFIYLALQKHQMISEGKGDPLFSNRVSNVLLFISIVSF 612
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+ Y + K +K N+ HP S + I + I
Sbjct: 613 LTYSIWASSCK-NKTECNELHPSVSVVQILAFILI 646
>gi|355560822|gb|EHH17508.1| hypothetical protein EGK_13929, partial [Macaca mulatta]
Length = 709
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 94/327 (28%), Positives = 158/327 (48%), Gaps = 33/327 (10%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 282 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 334
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + L R +T+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 335 --KETKVLKRERTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 392
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIF--- 277
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++
Sbjct: 393 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 452
Query: 278 -----NKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPAKPDLP 330
N G F+ +S PL+
Sbjct: 453 IQKKANXXXXXXXXXXXXXXXXXXXXXXXXXXXGAFEKIFSLWPLSKCFELKG------- 505
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIVTVALF 387
++EW FR LDRY+ GM++A+ + +K E + EP K+S ++V F
Sbjct: 506 NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFISVVSF 565
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWI 414
+ Y + K +K N+ HP S I
Sbjct: 566 LTYSIWASSCK-NKAECNELHPSVSVI 591
>gi|390343616|ref|XP_003725918.1| PREDICTED: CAS1 domain-containing protein 1-like
[Strongylocentrotus purpuratus]
Length = 343
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 158/323 (48%), Gaps = 42/323 (13%)
Query: 116 YFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQ 175
Y Y+CDRT+L K Y+ F F + ++S N K P TI LN Q
Sbjct: 3 YVYLCDRTDLFPKRQKYYSSFYFFFSLFMGFVISLFL---HANTKKP----TI--LNLDQ 53
Query: 176 TEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRF 232
T+EWKGWMQ+L L+YHY A++ IY +R+ +A+Y++MT +G F + + F L R
Sbjct: 54 TKEWKGWMQLLILIYHYLGASKVVPIYMHLRLIVASYLFMTAYGQFCASWDKNKFGLARV 113
Query: 233 AQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGS------- 285
+M+R+N V F C+V++ Y LYY P+ + + +++Y + IF + +
Sbjct: 114 CNVMFRMNLMVFFLCLVMDRQYQLYYFVPLVSFWFLIIYATMAIFPRVTRQKAEESSRCY 173
Query: 286 -VMIVKILACFLVVILIWEIPGVFDIF--WSPLTFILGYTDPAKPDLPRLHEWHFRSGLD 342
M+ K++ +++ + F++ W P+ + Y + ++EW FR LD
Sbjct: 174 LYMLAKLVILVVIITCLSFSQVTFNLMFEWWPMNQLFCYPGTS------IYEWWFRWNLD 227
Query: 343 RYIWIIGMIYAY--YHPTAEKWMEKLEESEPKRKLSIKAGIV------TVALFVGYLWYE 394
+YI GM + + + KW L++S S+ I+ V L Y Y+
Sbjct: 228 KYIIPYGMFFGFILLSSKSSKW---LDDSHSGDLFSLVKTILVYIIAAAVLLMYSYHLYQ 284
Query: 395 CIYKLDKVTYNKYHPYTSWIPIT 417
C K T N H YTS+IP+T
Sbjct: 285 C---ESKPTCNAVHSYTSFIPVT 304
>gi|154316245|ref|XP_001557444.1| hypothetical protein BC1G_03708 [Botryotinia fuckeliana B05.10]
gi|347836389|emb|CCD50961.1| hypothetical protein [Botryotinia fuckeliana]
Length = 902
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 176/372 (47%), Gaps = 34/372 (9%)
Query: 97 LLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKK 156
LL L A+ F + Y Y DRT L K +++ L + V+V+ + S++
Sbjct: 383 LLPQPEVLGALMTFALAICYCYYADRTQLFEKEHKQFHKREVL-IASSAVLVAGLLSVRM 441
Query: 157 HNDKSPFSGK--------TIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIF 205
+N SP G +L+R QT+EWKGWMQ L L+YHY ++ ++ +R
Sbjct: 442 NNPSSPSKGNHEVRAHAFDYGFLSRDQTDEWKGWMQFLILIYHYTHGSKTLWLFEIMRNL 501
Query: 206 IAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTL 265
+A Y++MTG+G+ Y+ R+D+S R A ++ RLN ++ DY +Y P+ +
Sbjct: 502 VAGYLFMTGYGHTMYFLRREDYSFKRVASVLIRLNMLTCALSYMMRTDYTAHYFVPLVSF 561
Query: 266 FTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPA 325
+ ++VY + + + N ++ KI ++ +IPGV ++ L FIL YT
Sbjct: 562 WFLVVYLTLKVRQEKNSSIGFLLGKIFISAILTTAFIKIPGVLEV----LAFILKYTFAI 617
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYA--YYHPTAEKWMEKLEESEPKRKL-------- 375
+ + EW +R+G+D +I IGMI A Y T K + +S+ L
Sbjct: 618 SWN---MQEWRYRAGMDMFIVYIGMITAILYLRLTRIKAASVICKSKIDTLLRPVVRHPI 674
Query: 376 --SIKAGIVTVALFVGY-LWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHL 432
A + +V L G+ L E D+ Y +HPY S +PI + + S+++
Sbjct: 675 FFKTTAIVASVILLPGFRLLTE--RSPDREDYLWWHPYISAVPILAFVTLRNSHSILRSY 732
Query: 433 SGSLYMMACRYS 444
+++ R S
Sbjct: 733 HSTIFAWLGRCS 744
>gi|342875261|gb|EGU77064.1| hypothetical protein FOXB_12447 [Fusarium oxysporum Fo5176]
Length = 848
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 150/331 (45%), Gaps = 43/331 (12%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIV--SAMTSLKKH------- 157
+A F L Y DRT +K YN F + L IV + MT K
Sbjct: 383 VATFVTGLLACYWADRTQSFAKGSKEYNMFDFNLMSALCFIVGFAFMTKSKPPPPRPGAA 442
Query: 158 --------NDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFI 206
+D P L+R QT+EWKGWMQ L L+YH+ A+ IY IR+ +
Sbjct: 443 PAAAPATLDDAKP--------LSRDQTDEWKGWMQALILVYHWTGASRDLNIYVGIRLLV 494
Query: 207 AAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLF 266
AAY++ TGFG+ ++ +KDFS R A ++ RLN +N DYM YY P+ + +
Sbjct: 495 AAYLFQTGFGHGVFFSSKKDFSFKRVAAVLLRLNLLSCALPFFMNTDYMFYYFAPLVSFW 554
Query: 267 TIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSP-LTFILGYTDPA 325
+++Y I KYN+ ++ KI + PG + W+P L ++ +
Sbjct: 555 FLIIYALFAIGQKYNDNTWALMGKIAVSAAIC------PGA--MLWTPVLQWVFDALNMV 606
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVA 385
LHEW FR GLD I +G+I + + + L +S L+ AGI+++
Sbjct: 607 FRIEWDLHEWQFRLGLDGLIVYVGIIMGVASVRTKLYNKILTQS---YGLAGVAGILSIP 663
Query: 386 LFVGYLWYECIYKLDKVTYNKYHPYTSWIPI 416
L Y W K Y HP S+IPI
Sbjct: 664 L---YWWVAVSSAETKQDYTAIHPVFSFIPI 691
>gi|408398471|gb|EKJ77601.1| hypothetical protein FPSE_02099 [Fusarium pseudograminearum CS3096]
Length = 851
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 100/330 (30%), Positives = 153/330 (46%), Gaps = 36/330 (10%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG- 165
+A F L Y DRT +K Y+ F + L IV + K P G
Sbjct: 382 VATFVTGLLACYWADRTQSFAKGSKQYSMFDFNLMSALCFIV-GFALMTKSKPPPPRPGA 440
Query: 166 --------------KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAA 208
+ + L+R QT+EWKGWMQ L L+YH+ A+ IY IR+ +AA
Sbjct: 441 QAATAAAPAAPATLEDAKPLSRDQTDEWKGWMQALILVYHWTGASRDLNIYVGIRLLVAA 500
Query: 209 YVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTI 268
Y++ TGFG+ ++ +KDFS R A ++ RLN +N DYM YY P+ + + +
Sbjct: 501 YLFQTGFGHGVFFSSKKDFSFKRVAAVLLRLNLLSVALPFFMNTDYMFYYFAPLVSFWFL 560
Query: 269 MVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSP-LTFILGYTDPAKP 327
++Y I KYN+ ++ KI + PG + W+P L ++ +
Sbjct: 561 IIYALFAICPKYNDNTWALMGKIAISAAIC------PGA--MLWTPVLQWVFDALNIVFR 612
Query: 328 DLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALF 387
LHEW FR GLD I +G+I + + + L +S L+ AGI+++ L+
Sbjct: 613 IEWDLHEWQFRLGLDGLIVYVGIIMGVASVRTKLYNKILTQS---YGLAGIAGILSIPLY 669
Query: 388 VGYLWYECIYKLD-KVTYNKYHPYTSWIPI 416
W+ + K + K Y HP S+IPI
Sbjct: 670 ----WWVAVSKAEKKQDYTALHPIFSFIPI 695
>gi|321471896|gb|EFX82868.1| hypothetical protein DAPPUDRAFT_48914 [Daphnia pulex]
Length = 819
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 186/381 (48%), Gaps = 27/381 (7%)
Query: 48 LVELEKETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAM 107
L+E E + + + D V++ G S A T R ++D +R + A+
Sbjct: 341 LMEAEDDEMAKKD--VVVANGHSTVVVAEGFDEK-ATTAHRKAAVEDV-ESSSRVVVTAL 396
Query: 108 AEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKT 167
+ G I+ YFY+CDRT K Y+ L ++ + V A+ N++S ++
Sbjct: 397 GKLGLIMAYFYLCDRTTFFMKENKYYSH---LNFWVPVGYVFALGLF--FNEES----RS 447
Query: 168 IQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYYYIR 224
+ L+R QT+EWKGWMQ++ L+YH A+++ Y +R+ +++Y+++TGFG+ S+++
Sbjct: 448 TKVLHRDQTDEWKGWMQLVILIYHMTGASQVIPLYMQMRVLVSSYLFLTGFGHLSFFWNG 507
Query: 225 KDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYN--- 281
S PR Q+++R+N C+ +N Y YY P+ + + ++VY + + K +
Sbjct: 508 GTASFPRLFQVLFRMNLMTVVICLCMNRPYQSYYFVPLVSFWYLVVYIVLAVPPKVSAAI 567
Query: 282 -EIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSG 340
+ S + I+ F +I I + ++F+ + I + +HEW FR
Sbjct: 568 CDANSFAYLYIIIKFCTLIGAITILYMSEVFFETIFLIRPWKALFVTSSDEIHEWWFRWS 627
Query: 341 LDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI---VTVALFVGYLWYEC-- 395
LDRY GMI+ + + A+K+ L + ++ L + I V +A +G Y
Sbjct: 628 LDRYSICYGMIFGFLYLNAQKF--GLIDDSTRQHLLSRTKIRFLVVLAALIGLGGYTAFT 685
Query: 396 IYKLDKVTYNKYHPYTSWIPI 416
I K N+ H Y +++PI
Sbjct: 686 ITCHSKPECNEVHSYLAFLPI 706
>gi|355675241|gb|AER95472.1| CAS1 domain containing 1 [Mustela putorius furo]
Length = 313
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 106/176 (60%), Gaps = 12/176 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 124 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSTFFIPIIYILVLGVF-----YNENT-- 176
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 177 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 234
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI 276
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + +
Sbjct: 235 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLAL 290
>gi|10438077|dbj|BAB15163.1| unnamed protein product [Homo sapiens]
Length = 322
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 106/176 (60%), Gaps = 12/176 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 116 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 168
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 169 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 226
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI 276
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + +
Sbjct: 227 FWIKGDFGIYRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLAL 282
>gi|90084681|dbj|BAE91182.1| unnamed protein product [Macaca fascicularis]
Length = 420
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 161/323 (49%), Gaps = 45/323 (13%)
Query: 116 YFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQ 175
YFY+CDR NL K Y F + ++++ +N+ + K + LNR Q
Sbjct: 3 YFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT----KETKVLNREQ 53
Query: 176 TEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRF 232
T+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY++I+ DF + R
Sbjct: 54 TDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSYFWIKGDFGIHRV 113
Query: 233 AQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGS------- 285
Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ + + +
Sbjct: 114 CQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQIIQKKANGNCFWH 173
Query: 286 ------VMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPAKPDLPRLHEWHF 337
+ + + CFL G F+ +S PL+ ++EW
Sbjct: 174 FGLLLKLGFLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-------NVYEWWL 221
Query: 338 RSGLDRYIWIIGMIYAYYHPTAEKWMEKLEE--SEP--KRKLSIKAGIVTVALFVGYLWY 393
R LDRY+ GM++A+ + +K + L E EP K+S ++V F+ Y +
Sbjct: 222 RWRLDRYVVFHGMLFAFIYLALQK-RQILSEGKGEPLFSNKISNFLLFISVVSFLTYSIW 280
Query: 394 ECIYKLDKVTYNKYHPYTSWIPI 416
K +K N+ HP S + I
Sbjct: 281 ASSCK-NKAECNELHPSVSVVQI 302
>gi|156037568|ref|XP_001586511.1| hypothetical protein SS1G_12498 [Sclerotinia sclerotiorum 1980]
gi|154697906|gb|EDN97644.1| hypothetical protein SS1G_12498 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 905
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 133/267 (49%), Gaps = 19/267 (7%)
Query: 98 LENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKH 157
L L A F ++ Y Y DRT L K + R FL ++I+ + S++
Sbjct: 380 LPQAEVLGAFMTFALVICYCYYADRTQLFEKEQKQFRRREFLIASSFVLII-GLLSIRST 438
Query: 158 NDKSPFSG------KTIQY--LNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFI 206
SP G + Y L+R QT+EWKGWMQ L L+YHY ++ ++ +R +
Sbjct: 439 TPSSPSKGIHEIGAHALDYGFLSRDQTDEWKGWMQFLILIYHYTHGSKTLWLFEIMRNLV 498
Query: 207 AAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLF 266
A Y++MTG+G+ Y+ R+D+S R A ++ RLN ++ DYM +Y P+ + +
Sbjct: 499 AGYLFMTGYGHTMYFLKREDYSFKRVASVLLRLNLLTCALSYMMRTDYMAHYFVPLVSFW 558
Query: 267 TIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAK 326
+++Y + I N ++ KI ++ + PGV ++ L F+L Y+
Sbjct: 559 FLVIYFTLKIRQTGNSSTGFLLGKIFISAILTTAFIKAPGVLEV----LAFLLKYSCAIS 614
Query: 327 PDLPRLHEWHFRSGLDRYIWIIGMIYA 353
+ + EW +R+G+D +I IGMI A
Sbjct: 615 WN---MKEWRYRAGMDMFIVYIGMITA 638
>gi|346970438|gb|EGY13890.1| CAS1 domain-containing protein [Verticillium dahliae VdLs.17]
Length = 911
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 174/346 (50%), Gaps = 54/346 (15%)
Query: 97 LLENRATLRAMA---EFGAILFYFYIC---DRTNLLGDSTKNYNRDLFLFLYLLLVIVSA 150
L+ R+T R A E G+ + +C DRT++LG K ++ F + + V A
Sbjct: 384 LIARRSTPRYAAFNMETGSFVMALLMCFFADRTHILGKGAKVWHYANFAIMCAPCLAV-A 442
Query: 151 MTSLKKHNDKSPFSGKTIQ----YLNRHQTEEWKGWMQVLFLMYHYFAA---TEIYNAIR 203
+ +++K G+ ++ +L+R QT+EWKGWMQ + L+YH+ AA T IY +R
Sbjct: 443 IVTIRKSKPPRAKPGQLVEADQPFLSRDQTDEWKGWMQFVILVYHWTAAARSTGIYIFVR 502
Query: 204 IFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMH 263
+ +AAY++ TG+G+ +++ ++KDFS R A +M RLN +N DYM YY P+
Sbjct: 503 LLVAAYLFQTGYGHTTFFLVKKDFSFKRMASVMLRLNLLSVSLAYFMNTDYMFYYFSPLV 562
Query: 264 TLFTIMVYGAVGIFNK-YNEIGSVMIVKI-LACFLVVILIWEIPGV----------FDIF 311
+ + ++VY + + +K YN+ +++ KI ++ +V +++ P F+I
Sbjct: 563 SFWFMVVYLTMAVGHKRYNDDIQLVLAKICISAVIVAVVVLVTPLTKWTFGFLRIFFNIK 622
Query: 312 WSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEP 371
WS L EW +R LD +I IGM+ A + K++E E
Sbjct: 623 WS------------------LEEWEYRVALDLFIVYIGMLSAIAY-------LKVKE-EL 656
Query: 372 KRKLSIKAGIVTVALFVGYLWYECIYKLDKV-TYNKYHPYTSWIPI 416
+ L +V + + GY Y C L + Y +HPY S++PI
Sbjct: 657 RLALRCSMALVGLVVMAGYA-YACGTTLASMGDYRLWHPYLSFVPI 701
>gi|358058627|dbj|GAA95590.1| hypothetical protein E5Q_02246 [Mixia osmundae IAM 14324]
Length = 1279
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/319 (28%), Positives = 156/319 (48%), Gaps = 24/319 (7%)
Query: 102 ATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKS 161
+ L ++ FG + + ++ DRT+L K ++ F L L + V T D
Sbjct: 766 SVLVPLSTFGFAIVFIFLGDRTSLFLKEAKQFSALQFAVLNLAALGVGLATMKPAEKD-- 823
Query: 162 PFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNF 218
+ +LNR QT+EWKGWMQ+ L+YHY A++ IYN IR +AAY++ +G+G+
Sbjct: 824 ------LGFLNRDQTDEWKGWMQIAILIYHYLGASKVSGIYNPIRTLVAAYLFQSGYGHT 877
Query: 219 SYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFN 278
++YY + D+ L R ++ RLN ++ DY+ YY P+ T+ ++ + + N
Sbjct: 878 TFYYKKGDYGLARVMSVVIRLNLLTIALTYAMDTDYLSYYFSPLVTMHFGFIWIIMFVGN 937
Query: 279 KYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFR 338
++N+ ++ KI+ + + +E+ + + + + A+ EW FR
Sbjct: 938 QWNKNAYFLLAKIIIGAALAAIYFELGKPLEYTFRAINAVFRTQWIAR-------EWSFR 990
Query: 339 SGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVAL-FVGYLWYECIY 397
LD YI GM A K+ E P + +AGI+ +L +G++ +E
Sbjct: 991 VTLDMYIVFWGMAAALAF---IKFQEHKIADRPDWPIFARAGIIVGSLGMLGFMIFELTQ 1047
Query: 398 KLDKVTYNKYHPYTSWIPI 416
+K YN YHP+ S IP+
Sbjct: 1048 --NKFDYNVYHPFISIIPV 1064
>gi|46114564|ref|XP_383300.1| hypothetical protein FG03124.1 [Gibberella zeae PH-1]
Length = 851
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 100/337 (29%), Positives = 154/337 (45%), Gaps = 50/337 (14%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNR-DLFLFLYLLLVIVSAMTSLKKH-------- 157
+A F L Y DRT +K Y+ D L L ++ A+ + K
Sbjct: 382 VATFVTGLLACYWADRTQSFAKGSKQYSMFDFNLMAALCFIVGFALMTKSKPPPPRPGAQ 441
Query: 158 -------------NDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNA 201
+D P L+R QT+EWKGWMQ L L+YH+ A+ IY
Sbjct: 442 AAAAAAPAAPATLDDAKP--------LSRDQTDEWKGWMQALILVYHWTGASRDLNIYVG 493
Query: 202 IRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICP 261
IR+ +AAY++ TGFG+ ++ +KDFS R A ++ RLN +N DYM YY P
Sbjct: 494 IRLLVAAYLFQTGFGHGVFFSSKKDFSFKRVAAVLLRLNLLSVALPFFMNTDYMFYYFAP 553
Query: 262 MHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSP-LTFILG 320
+ + + +++Y I KYN+ ++ KI + PG + W+P L ++
Sbjct: 554 LVSFWFLIIYSLFAICPKYNDNTWALMGKIAISAAIC------PGA--MLWTPVLQWVFD 605
Query: 321 YTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG 380
+ LHEW FR GLD I +G+I + + + L +S L+ AG
Sbjct: 606 ALNIVFRIEWDLHEWQFRLGLDGLIVYVGIIMGVASVRTKLYNKILTQS---YGLAGIAG 662
Query: 381 IVTVALFVGYLWYECIYKLD-KVTYNKYHPYTSWIPI 416
I+++ L+ W+ + K + K Y HP S+IPI
Sbjct: 663 ILSIPLY----WWVAVSKAEKKQDYTALHPIFSFIPI 695
>gi|395738731|ref|XP_002818293.2| PREDICTED: CAS1 domain-containing protein 1 [Pongo abelii]
Length = 935
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 167/340 (49%), Gaps = 43/340 (12%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+ DR NL K + L LF +N+ +
Sbjct: 506 LQSFCKLGLIMAYFYMGDRANLFMKENKFIHIHLSLFQLYTFWFGEYF-----YNENT-- 558
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 559 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 616
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK- 279
++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 617 FWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQI 676
Query: 280 ------------YNEIGSVMIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPA 325
+ + + ++ + CFL G F+ +S PL+
Sbjct: 677 IQKKANGNCFWHFGLLLKLGVLLLFICFLAY-----SQGAFEKIFSLWPLSKCFELKG-- 729
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIV 382
++EW FR LDRY+ GM++A+ + +K E + EP K+S +
Sbjct: 730 -----NVYEWWFRWRLDRYVVFHGMLFAFIYLALQKRQILSEGKGEPLFSNKISNFLLFI 784
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
+V F+ Y + K +K N+ HP S + I + I
Sbjct: 785 SVVSFLTYSIWASSCK-NKAECNELHPSVSVVQILAFILI 823
>gi|408389310|gb|EKJ68771.1| hypothetical protein FPSE_11039 [Fusarium pseudograminearum CS3096]
Length = 938
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 84/276 (30%), Positives = 130/276 (47%), Gaps = 32/276 (11%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTI- 168
F L Y Y DRT+L K + + F L + I +T ++K ++P I
Sbjct: 460 FAVALIYCYAADRTHLFSKGMKEFVSNEFYLLSGICAIFGGLT-IRKVQFRAPRPPPAIV 518
Query: 169 --------------------QYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIF 205
L R QTEEWKGWMQ L+YH+ A+ IY IR+
Sbjct: 519 ATEPESEPTPAPVTPVVQDAGILARDQTEEWKGWMQAAILVYHWTGASTSLPIYIFIRLL 578
Query: 206 IAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTL 265
+AAY++ TG+G+ Y+ +KDFS R A ++ RLN V+ DYM YY P+ +
Sbjct: 579 VAAYLFQTGYGHTIYFLSKKDFSFRRIASVLLRLNILSCALPYVMGTDYMFYYFAPLVSF 638
Query: 266 FTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPA 325
+ ++VY + + + +N+ V+ +KILA F+ L+ + + ++ L +
Sbjct: 639 WFLVVYATMAVCSSFNDSFKVVGIKILASFIFFTLVLNVTPLMKWLFAILELVFRI---- 694
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEK 361
K D L+EW FR LD I +GM+ H E+
Sbjct: 695 KWD---LNEWEFRVTLDGAIVFVGMLAGVVHQRVER 727
>gi|431908921|gb|ELK12512.1| CAS1 domain-containing protein 1, partial [Pteropus alecto]
Length = 612
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/321 (29%), Positives = 163/321 (50%), Gaps = 36/321 (11%)
Query: 124 NLLGDSTKNYNRDLFLF-LYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGW 182
++L ST+N ++ +F + L+ + M SL + S +T + LNR QT+EWKGW
Sbjct: 194 SILNSSTRNSKSNVKMFSVSKLIAQETIMESLDGLHLPEK-SRETTKVLNREQTDEWKGW 252
Query: 183 MQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRL 239
MQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY++I+ DF + R Q+++RL
Sbjct: 253 MQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSYFWIKGDFGIHRVCQVLFRL 312
Query: 240 NFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK-------------YNEIGSV 286
NF V CIV++ Y YY P+ T++ +++Y + ++ + + + +
Sbjct: 313 NFLVVVLCIVMDRPYQFYYFVPLVTVWFMVIYVTLALWPQIIQKKANGNCFWHFGLLLKL 372
Query: 287 MIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDPAKPDLPRLHEWHFRSGLDRY 344
+ + CFL G F+ +S PL+ ++EW FR LDRY
Sbjct: 373 AFLLLCICFLAY-----SQGAFEKIFSLWPLSKCFELKG-------NVYEWWFRWRLDRY 420
Query: 345 IWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGIVTVALFVGYLWYECIYKLDK 401
+ GM++A+ + +K E + EP K+S ++V F+ Y + K +K
Sbjct: 421 VVFHGMLFAFIYLALQKRQVLSEGKGEPLFSNKISNVLLFISVVSFLTYSIWASSCK-NK 479
Query: 402 VTYNKYHPYTSWIPITYVLFI 422
N+ HP S + I + I
Sbjct: 480 AECNELHPSVSVVQILAFILI 500
>gi|390369855|ref|XP_782149.3| PREDICTED: CAS1 domain-containing protein 1-like
[Strongylocentrotus purpuratus]
Length = 387
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 138/281 (49%), Gaps = 39/281 (13%)
Query: 158 NDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTG 214
N K P LN QT+EWKGWMQ+L L+YHY A++ IY +R+ +A+Y++MT
Sbjct: 12 NTKKP------TILNLDQTKEWKGWMQLLILIYHYLGASKVVPIYMHLRLIVASYLFMTA 65
Query: 215 FGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAV 274
+G F + + F L R +M+R+N V F C+V++ Y LYY P+ + + +++Y +
Sbjct: 66 YGQFCASWDKNKFGLARVCNVMFRMNLMVFFLCLVMDRQYQLYYFVPLVSFWFLIIYATM 125
Query: 275 GIFNKYNEIGS--------VMIVKILACFLVVILIWEIPGVFDIF--WSPLTFILGYTDP 324
IF + + M+ K++ +++ + F++ W P+ + Y
Sbjct: 126 AIFPRVTRQKAEESSRCYLYMLAKLVILVVIITCLSFSQVTFNLMFEWWPMNQLFCYPGT 185
Query: 325 AKPDLPRLHEWHFRSGLDRYIWIIGMIYAY--YHPTAEKWMEKLEESEPKRKLSIKAGIV 382
+ ++EW FR LD+YI GM + + + KW L++S S+ I+
Sbjct: 186 S------IYEWWFRWNLDKYIIPYGMFFGFILLSSKSSKW---LDDSHSGDLFSLVKTIL 236
Query: 383 ------TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPIT 417
V L Y Y+C K T N H YTS+IP+T
Sbjct: 237 VYIIAAAVLLMYSYHLYQC---ESKPTCNAVHSYTSFIPVT 274
>gi|46127017|ref|XP_388062.1| hypothetical protein FG07886.1 [Gibberella zeae PH-1]
Length = 861
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 129/276 (46%), Gaps = 32/276 (11%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTI- 168
F L Y Y DRT+L K + + F L + I +T ++K ++P +
Sbjct: 392 FAVALIYCYAADRTHLFSKGMKEFVSNEFYLLSGICAIFGGLT-IRKVQFRAPRPPPAVV 450
Query: 169 --------------------QYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIF 205
L R QTEEWKGWMQ L+YH+ A+ IY IR+
Sbjct: 451 ATEPESEPTPAPVTPVVQDAGILARDQTEEWKGWMQAAILVYHWTGASTSLPIYIFIRLL 510
Query: 206 IAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTL 265
+AAY++ TG+G+ Y+ +KDFS R A ++ RLN V+ DYM YY P+ +
Sbjct: 511 VAAYLFQTGYGHTIYFLSKKDFSFRRIASVLLRLNILSCALPYVMGTDYMFYYFAPLVSF 570
Query: 266 FTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPA 325
+ ++VY + + + +N+ V+ KILA F+ L+ + + ++ L +
Sbjct: 571 WFLVVYVTMAVCSSFNDSFKVVGSKILASFIFFTLVLNVTPLMKWLFAILELVFRI---- 626
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEK 361
K D L+EW FR LD I +GM+ H E+
Sbjct: 627 KWD---LNEWEFRVTLDGAIVFVGMLAGVVHQRVER 659
>gi|126343465|ref|XP_001381322.1| PREDICTED: CAS1 domain-containing protein 1-like [Monodelphis
domestica]
Length = 444
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 126/221 (57%), Gaps = 27/221 (12%)
Query: 144 LLVIVSAMTSLKKHN--DKSPFSGKT-----IQYLNRHQTEEWKGWMQVLFLMYHYFAAT 196
++V+ +A S+K HN D++ +T + LNR QT+EWKGWMQ++ L+YH A+
Sbjct: 161 VVVVRAATWSIKAHNGSDEALAQYRTNITALTKVLNREQTDEWKGWMQLVILIYHMSGAS 220
Query: 197 E---IYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNND 253
+Y IR+ +AAY++ TG+G+FSY++I+ DF + R Q+++RLNF V CIV++
Sbjct: 221 SFLPVYMHIRVLVAAYLFQTGYGHFSYFWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRP 280
Query: 254 YMLYYICPMHTLFTIMVYGAVGIFNKYNEI---GSV-----MIVKILACFLVVILIWEIP 305
Y YY P+ T++ +++Y + ++ + + GS +++K+ + FL + +
Sbjct: 281 YQFYYFVPLVTVWFLVIYVTLALWPQITQKKANGSCVWHVGLLMKLASLFLCICFLAYSQ 340
Query: 306 GVFDIFWS--PLTFILGYTDPAKPDLPRLHEWHFRSGLDRY 344
G F+ +S PL+ ++EW FR LDRY
Sbjct: 341 GAFEKIFSLWPLSKCFELNG-------NIYEWWFRWKLDRY 374
>gi|395537653|ref|XP_003770809.1| PREDICTED: CAS1 domain-containing protein 1 [Sarcophilus harrisii]
Length = 635
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 148/296 (50%), Gaps = 34/296 (11%)
Query: 163 FSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFS 219
+ K + LNR QTEEWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FS
Sbjct: 331 LTQKLTKVLNREQTEEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFS 390
Query: 220 YYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK 279
Y++I+ DF + R Q+++RLNF V CIV++ Y YY P+ T++ +++Y + ++ +
Sbjct: 391 YFWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVPLVTMWFLVIYITLALWPQ 450
Query: 280 YNEI---GSV----------MIVKILACFLVVILIWEIPGVFDIFWS--PLTFILGYTDP 324
+ GS ++ + CFL G F+ +S PL+
Sbjct: 451 ITQKKANGSCFWHLGLLMKLGLLFLCICFLAYSQ-----GAFEKIFSLWPLSKCFELNG- 504
Query: 325 AKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEP--KRKLSIKAGI 381
++EW FR LDRY+ GM++A+ + +K E + EP ++S
Sbjct: 505 ------NVYEWWFRWKLDRYVVFHGMLFAFVYLALQKRQVLSEGKGEPLLPNRMSNLLLF 558
Query: 382 VTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLY 437
V+V F+ Y + K K N+ HP S + I + I V+ + S +
Sbjct: 559 VSVVSFLTYSIWASSCK-TKAECNELHPAVSVVQILAFVLIRNIPGYVRSVYSSFF 613
>gi|328716998|ref|XP_003246092.1| PREDICTED: CAS1 domain-containing protein 1-like [Acyrthosiphon
pisum]
Length = 792
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 164/327 (50%), Gaps = 32/327 (9%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
++A+ G I+ YFYICDRTN K Y+ D +L + V V + ++
Sbjct: 369 SLAKLGLIMIYFYICDRTNFFMKENKYYS-DASFWLPVGYVFVLGLFFTEE--------S 419
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAA---TEIYNAIRIFIAAYVWMTGFGNFSYYY 222
+ + L+R QT+EWKGWMQ++ L+Y+ A T I N ++I I+AY+++TG+G+F Y +
Sbjct: 420 RYTKVLHRDQTDEWKGWMQLIILIYNLTGASIKTSIANHVQILISAYLFLTGYGHFYYMW 479
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY----GAVGIFN 278
R D L R+ Q+++RLN C+ +N Y YY P+ + + +VY +
Sbjct: 480 HRSDAGLTRYFQILFRLNMLTVVLCVCMNRPYQFYYYIPLVSFWFTIVYLLLICPPRVTA 539
Query: 279 KYNEIGSV----MIVKILACFLVVILIWEIPGVFD-IFWS-PLTFILGYTDPAKPDLPRL 332
+EI +I+KILA F+ + +++ FD IF + P + TD +
Sbjct: 540 ASSEIRPAQYLYIILKILALFIFITILYMSEVFFDKIFLTRPWKALFVTTD------DDI 593
Query: 333 HEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLW 392
HEW +R L+R+ + G+I ++ A+++ + + L A + F+G L
Sbjct: 594 HEWWYRWKLNRFSVVNGVIVSFIIILAQRYNLIDDNNHSNLVLPRLAVFSSFVAFIG-LI 652
Query: 393 YECIYKL---DKVTYNKYHPYTSWIPI 416
+Y + +K + YTS IPI
Sbjct: 653 ASTVYNILCQNKTECYELLSYTSVIPI 679
>gi|241673512|ref|XP_002411502.1| conserved hypothetical protein [Ixodes scapularis]
gi|215504174|gb|EEC13668.1| conserved hypothetical protein [Ixodes scapularis]
Length = 815
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 93/328 (28%), Positives = 159/328 (48%), Gaps = 37/328 (11%)
Query: 109 EFGAILFYFYICDRTNLLGDSTKNYNR-DLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKT 167
+ G I+ YF++CDRTN K Y + FL + + + T +H
Sbjct: 392 QLGLIMGYFFLCDRTNFFMKENKYYTHLNFFLPVAYVFALGLFFTEETQHT--------- 442
Query: 168 IQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYYIR 224
Q L+R QT EWKGWMQ++ L+YH A++ IY +R+ + +Y+++TG+G+F+Y++
Sbjct: 443 -QVLHRDQTNEWKGWMQLIILIYHTTGASQVLPIYMHVRVLVTSYLFLTGYGHFTYFWHN 501
Query: 225 KDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIM-------VYGAVGIF 277
DF L R QM++R+N V C+ +N Y YY P+ + + ++ + V
Sbjct: 502 GDFGLYRLWQMLFRMNLLVVVLCLSMNRPYQFYYFVPLVSFWFVVVFVTMTSIPQVVAAS 561
Query: 278 NKYNEIGSVMIV-KILACFLVVILIWEIPGVFD-IFWS-PLTFILGYTDPAKPDLPRLHE 334
+ N + + IV K + F VV +++ F+ IF + P + TD + + E
Sbjct: 562 AEANPLQYLYIVLKFVGLFSVVTILYMSEVFFEKIFVTRPWKALFVTTDDS------IKE 615
Query: 335 WHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYE 394
W FR +DRY GM++A+ H +++ ++++ S + G A+ G L
Sbjct: 616 WWFRWKIDRYSVASGMLFAFAHYLLKQY-RVVDDNHHGNLFSRENGSSCTAVAHGVLLAF 674
Query: 395 CIYKL------DKVTYNKYHPYTSWIPI 416
Y K N+ H Y S++PI
Sbjct: 675 QFYTTFALVCRAKPDCNEIHAYVSFVPI 702
>gi|323450322|gb|EGB06204.1| hypothetical protein AURANDRAFT_65873 [Aureococcus anophagefferens]
Length = 1681
Score = 118 bits (296), Expect = 6e-24, Method: Composition-based stats.
Identities = 59/156 (37%), Positives = 86/156 (55%), Gaps = 11/156 (7%)
Query: 123 TNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKK--HNDKSPFSGKTIQYLNRHQTEEWK 180
+ LLG+ N DL+ F LL +V + + ND + + NR Q EWK
Sbjct: 1198 SRLLGE-----NPDLWWFAMALLAVVCFAPPMVRSIQND----AADPALFFNRAQCNEWK 1248
Query: 181 GWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLN 240
GWMQV F+ YHY A +Y IR F++AYVW+TGFGN Y++ DFS RF +M+WR+N
Sbjct: 1249 GWMQVAFVAYHYANAQGVYVPIRWFVSAYVWLTGFGNARYFWKTSDFSGARFLKMVWRIN 1308
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI 276
F A + ++ YY+ +HT+ + + A G+
Sbjct: 1309 CFAAPLSLATGTHWIAYYVVALHTVHFALCFAAFGL 1344
>gi|388581678|gb|EIM21985.1| Cas1p-domain-containing protein [Wallemia sebi CBS 633.66]
Length = 720
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 156/311 (50%), Gaps = 44/311 (14%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMT--SLKKHNDKSPFS 164
+A F A L + Y+ DR++ STK+++ + L LL ++ A+ S K+++D +
Sbjct: 312 VAIFSASLLFAYVADRSHFFAKSTKDFSSTMVL---LLFSVIGAIGWYSWKRNDDDT--- 365
Query: 165 GKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYY 221
KTI LNR QT+E KGWMQ++ L+YHY+ ++I Y IRI +A+Y+ + G+GN Y
Sbjct: 366 TKTIGVLNRPQTDEMKGWMQLMILIYHYYGGSKIPQFYIPIRILVASYLVLLGYGN-GRY 424
Query: 222 YIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYN 281
++ +L R+ M R N C +LN Y+ YY P+ +++ +++ I + N
Sbjct: 425 FLNNRPTLSRYICTMTRYNLLTLALCWILNGSYINYYFVPIISIWFTLIWLTFSINAEVN 484
Query: 282 EIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGL 341
++I+K+ ++ I + IP L+ I+ + E+ FR L
Sbjct: 485 SNLKLLIIKLAISLIIAISLLNIPQ--------LSAIIPF------------EYKFRLNL 524
Query: 342 DRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDK 401
D G++ + +++ K+ L A ++++ALFVG + K
Sbjct: 525 DILAPFFGVLLSIVANEQVEFVNKIT------TLISGATLLSLALFVG------VITPIK 572
Query: 402 VTYNKYHPYTS 412
+YN+YHP+ S
Sbjct: 573 FSYNQYHPFIS 583
>gi|46103822|ref|XP_380282.1| hypothetical protein FG00106.1 [Gibberella zeae PH-1]
Length = 1232
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 149/331 (45%), Gaps = 48/331 (14%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHN----DKSP 162
+ F L Y DRT ++ K + F L + + +T + + D S
Sbjct: 767 VGSFILALLMCYYADRTQMMAKGEKLWLPIDFAVLCAPCIAILLLTIRRSRSPISMDMSL 826
Query: 163 FSGKTIQ-YLNRHQTEEWKGWMQVLFLMYHYFAATE----IYNAIRIFIAAYVWMTGFGN 217
+ +T + +L+RHQTEEWKGWMQ + L+YH+ A + IY IR+ +AAY++ TG+G+
Sbjct: 827 LTKETNESFLSRHQTEEWKGWMQAVILIYHWTGAIKGSKSIYILIRLCVAAYLFQTGYGH 886
Query: 218 FSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIF 277
Y+ + DFS R A ++ RLN V++ DYM YY P+ + + ++VY + +
Sbjct: 887 TLYFVRKNDFSFRRVATVLLRLNVLSCSLAYVMDTDYMFYYFSPLVSFWFLVVYATMAVG 946
Query: 278 NK-YNEIGSVMIVKILACFLVVILIWE-----------IPGVFDIFWSPLTFILGYTDPA 325
K +N +++ KI L++ I+ + VF+I WS
Sbjct: 947 GKRFNSDPQIVLSKICISGLLISAIFMCTPFTQFVFGLLKTVFNIQWS------------ 994
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVA 385
W +R LD +I GM+ A H E + L ++ +
Sbjct: 995 ------YETWQYRVTLDMFIVYAGMLTAVVHN---------EMKQTSVHLGLRIILAFAG 1039
Query: 386 LFVGYLWYECIYKLDKVTYNKYHPYTSWIPI 416
LFV +++ L Y +HP S+IPI
Sbjct: 1040 LFVTMYYFKSTLHLRHSVYKTWHPLVSFIPI 1070
>gi|260807505|ref|XP_002598549.1| hypothetical protein BRAFLDRAFT_66940 [Branchiostoma floridae]
gi|229283822|gb|EEN54561.1| hypothetical protein BRAFLDRAFT_66940 [Branchiostoma floridae]
Length = 633
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 168/360 (46%), Gaps = 73/360 (20%)
Query: 101 RATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFL--FLYLLLVIVSAMTSLKKHN 158
R TL A+A+FG I+ YFY+CDRT L K+Y F F+++LLV L H+
Sbjct: 230 RTTLVALAKFGVIMAYFYLCDRTPLFMKENKHYTHLQFFVPFVWVLLV------GLFFHS 283
Query: 159 DKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNF 218
D T QYL IY IRI +A Y++ TG+G+F
Sbjct: 284 D-------TKQYL-------------------------PIYMHIRILVAMYLFQTGYGHF 311
Query: 219 SYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFN 278
Y++++ D+ + R Q+ +RLNF V F C+V++ Y YY P+ T + +VYG + +
Sbjct: 312 FYFWMKGDYGIVRLCQVNFRLNFLVVFLCMVMDRPYQFYYFVPLVTFWFFVVYGTMAVLP 371
Query: 279 KYNEIGS--------VMIVKILACFLVVILIWEIPGVFDIF--WSPLTFILGYTDPAKPD 328
+ S +M+ K+L ++ L+ + +F+ W P +
Sbjct: 372 RVTAKSSGDNSSGFVLMMFKLLVLCGIIALLASMQVLFESLFSWWPAVQLFQLEGS---- 427
Query: 329 LPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLS--IKAGIVT--V 384
+ EW FR LDRY GM +A+ + +K ++ +++S + + IVT V
Sbjct: 428 ---IREWWFRWQLDRYAVSYGMFFAFTYLGLKK-LQIIDDSFHGNLFTPCVTYLIVTLSV 483
Query: 385 ALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLYMMACRYS 444
+ +GY + KV N+ HPY S++PIT F LV+++ G L RYS
Sbjct: 484 CITLGYTIFMTTCS-SKVECNRLHPYISFLPITS-------FILVRNVPGYL---RSRYS 532
>gi|323448712|gb|EGB04607.1| hypothetical protein AURANDRAFT_67109 [Aureococcus anophagefferens]
Length = 1934
Score = 115 bits (288), Expect = 4e-23, Method: Composition-based stats.
Identities = 56/151 (37%), Positives = 86/151 (56%), Gaps = 7/151 (4%)
Query: 127 GDSTKNYNRDLFLFLYLLLVIVSA--MTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQ 184
G + + N DL+ F+ L + SA M L+ K + +++R Q EWKGWMQ
Sbjct: 1485 GTTVRTENVDLWTFVMASLALASAWQMRPLEVDGGK-----EAALFMSRAQANEWKGWMQ 1539
Query: 185 VLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVA 244
V F+ YHY A ++Y IR ++AYVW+TGFGN Y++ DFS RFAQ +WR+NF A
Sbjct: 1540 VAFVAYHYTNAQDVYVPIRWAVSAYVWLTGFGNGVYFWSSADFSPKRFAQQLWRMNFLCA 1599
Query: 245 FCCIVLNNDYMLYYICPMHTLFTIMVYGAVG 275
+ N ++ YY + T+ +++ ++G
Sbjct: 1600 LLAMATNTAWIDYYFVALATVHFGLIFLSLG 1630
>gi|427799029|gb|JAA64966.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 772
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 162/332 (48%), Gaps = 32/332 (9%)
Query: 101 RATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDK 160
++ L+ ++ G I+ YF++CDRTN K Y L FL + V + ++
Sbjct: 345 QSVLQQLSRLGLIMAYFFVCDRTNFFMRENKYYTH-LNFFLPVAYVFALGLFFTEE---- 399
Query: 161 SPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGN 217
+ Q L+R QT+EWKGWMQ++ L+YH A+++ Y +R ++AY+++ G+G+
Sbjct: 400 ----TQQTQVLHRDQTDEWKGWMQLVLLVYHMTGASQVLPVYVHVRALVSAYLFLAGYGH 455
Query: 218 FSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIF 277
FSY++ + DF L R A++++R N V C+ +N Y YY P+ + + ++V +G
Sbjct: 456 FSYFWHQADFGLLRLARVLFRTNLLVVLLCLCMNRPYQFYYFVPLVSFWFLVVAATLGSL 515
Query: 278 NKYNEIGS--------VMIVKILACFLVVILIWEIPGVFD-IFWS-PLTFILGYTDPAKP 327
+ + + +++K + F V+ +++ F+ IF + P + TD +
Sbjct: 516 PRISAASAEANPLHHLYVVLKFVGLFSVLTVLYMSEVFFEKIFVTRPWKALFVTTDDS-- 573
Query: 328 DLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPK---RKLSIKAGIVTV 384
+ EW FR +DRY GM++ + H ++ + R LS+ +
Sbjct: 574 ----IKEWWFRWKIDRYSVASGMLFGFAHYLLRQYRVIGDNHHGNLFSRGLSLTVAFGAL 629
Query: 385 ALFVGYLWYECIYKLDKVTYNKYHPYTSWIPI 416
L +G+ K N+ H Y S++PI
Sbjct: 630 -LGLGFYTTFAFVCRSKPDCNEVHAYVSFVPI 660
>gi|408390360|gb|EKJ69762.1| hypothetical protein FPSE_10078 [Fusarium pseudograminearum CS3096]
Length = 882
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 149/324 (45%), Gaps = 48/324 (14%)
Query: 114 LFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKS---PFSGKTIQ- 169
L Y DRT ++ K + F LY V + ++ + + S P S K +
Sbjct: 423 LLMCYYADRTQMMAKGQKLWQPLDFALLYAPCVAILLLSIRRSGSPISMDMPLSVKELDE 482
Query: 170 -YLNRHQTEEWKGWMQVLFLMYHYFAATE----IYNAIRIFIAAYVWMTGFGNFSYYYIR 224
+L+RHQT+EWKGWMQ L L+ ++ AT IY +R+ +AAY++ TG+G+ Y+ +
Sbjct: 483 AFLSRHQTDEWKGWMQALILICNWTGATRESKSIYILVRLCVAAYLFQTGYGHTLYFLRK 542
Query: 225 KDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFN-KYNEI 283
DFS R ++ RLN +++ DYM YY P+ + + ++VY + + + N
Sbjct: 543 NDFSFRRVGAVLLRLNLLSCSLAYIMDTDYMFYYFSPLVSFWFLVVYATMSVGGERCNSD 602
Query: 284 GSVMIVKILACFLVVILIWE-----------IPGVFDIFWSPLTFILGYTDPAKPDLPRL 332
+++ KI LVV ++ + +F+I WS
Sbjct: 603 PQLVVSKIFLSGLVVSAVFMGTPFTEFVFGLLKAIFNIQWS------------------H 644
Query: 333 HEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLW 392
EW +R LD I +GM+ A A ME+ R + + AG++ Y +
Sbjct: 645 KEWQYRVTLDILIVYVGMLTA----VATNKMERSAVRLGFRIVLVVAGVLATT----YYF 696
Query: 393 YECIYKLDKVTYNKYHPYTSWIPI 416
Y ++ L K Y +HP S+IPI
Sbjct: 697 YTTLH-LRKKMYKIWHPLVSFIPI 719
>gi|397573298|gb|EJK48633.1| hypothetical protein THAOC_32549, partial [Thalassiosira oceanica]
Length = 619
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 138/295 (46%), Gaps = 38/295 (12%)
Query: 131 KNYNRDLFLF-LYLLLVIVSAMTSLKKHNDKSPFSGKT--IQYLNRHQTEEWKGWMQVLF 187
+N N+ +F L ++V +++ + G T I L R QTEEWKGWMQ F
Sbjct: 199 RNENKAPSMFWLVNAFLLVGTISTWTWKASSAGRGGGTPRIVCLGREQTEEWKGWMQWAF 258
Query: 188 LMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCC 247
+ YHY+ +YN IR+F++AYVWMTGFGNF Y+ + DFS+ RF M+ R+N+F
Sbjct: 259 IFYHYYRVHYVYNEIRVFVSAYVWMTGFGNFLYFDRKGDFSIERFVSMIIRINYFPLMLS 318
Query: 248 IVLNNDYMLYYICPMHT--LFTIMVYGAVGIFNKYNEIG----SVMIVKILACFLVVILI 301
L LYY+ P+HT M+ +G+ +G I +L FLV +
Sbjct: 319 YFLTVPLELYYVVPLHTTGFVVTMITCFIGL-QIERTLGMSYWKSRIAAVLLSFLVHVCF 377
Query: 302 WEIPGVFDIFWSPLTFILGYTDPAKPDLPRL--HEWHFRSGLDRYIWIIGMIYA-YYHPT 358
+E V +L +L E++FR D+Y +GM+ +
Sbjct: 378 YETSAV--------------------ELLKLFSEEYYFRFQADKYSAWLGMVGGLLWGKV 417
Query: 359 AE--KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKL-DKVTYNKYHPY 410
E W E + ++ SI IV VAL WY + DK TYN HPY
Sbjct: 418 GEYMNWAHGFENEQRRQMASIVQCIVGVALIA--FWYIFFGSISDKYTYNPIHPY 470
>gi|397617812|gb|EJK64620.1| hypothetical protein THAOC_14629 [Thalassiosira oceanica]
Length = 413
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 122/251 (48%), Gaps = 33/251 (13%)
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLP 230
L R QTEEWKGWMQ F+ YHY+ +YN IR+F++AYVWMTGFGNF Y+ + DFS+
Sbjct: 36 LGREQTEEWKGWMQWAFIFYHYYRVHYVYNEIRVFVSAYVWMTGFGNFLYFDRKGDFSIE 95
Query: 231 RFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHT--LFTIMVYGAVGI-FNKYNEIG--S 285
RF M+ R+N+F L LYY+ P+HT M+ +G+ F + +
Sbjct: 96 RFVSMIIRINYFPLMLSYFLTVPLELYYVVPLHTTGFVVTMITCFIGLQFERTLGMSYWK 155
Query: 286 VMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRL--HEWHFRSGLDR 343
I +L FLV + +E V +L +L E++FR D+
Sbjct: 156 SRIAAVLLSFLVHVCFYETSAV--------------------ELLKLFSEEYYFRFQADK 195
Query: 344 YIWIIGMIYA-YYHPTAE--KWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKL- 399
Y +GM+ + E W E + ++ SI IV VAL WY +
Sbjct: 196 YSAWLGMVGGLLWGKVGEYMNWAHGFENEQRRQMASIVQCIVGVALIA--FWYIFFGSIS 253
Query: 400 DKVTYNKYHPY 410
DK TYN HPY
Sbjct: 254 DKYTYNPIHPY 264
>gi|170034416|ref|XP_001845070.1| CAS1 domain containing 1 [Culex quinquefasciatus]
gi|167875703|gb|EDS39086.1| CAS1 domain containing 1 [Culex quinquefasciatus]
Length = 799
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 175/364 (48%), Gaps = 46/364 (12%)
Query: 99 ENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHN 158
+++ + A+A I+ YFY+CDRTN K Y+ F + ++ +
Sbjct: 369 SSQSPVMALASLAVIMVYFYLCDRTNFFMKENKYYSEFSFWIPVGYVFVLGLFFT----- 423
Query: 159 DKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGF 215
+ S F+ + L+R QT+E KGWMQ++ L+Y+ A+ IY I++ I+ Y++++G+
Sbjct: 424 EDSKFT----KVLHRDQTDELKGWMQLVILIYYMTGASHVLPIYMQIKVLISGYLFLSGY 479
Query: 216 GNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVG 275
G+F+Y + + L RF +M+++NFF C+ +N Y Y+ P+ + + M+Y +
Sbjct: 480 GHFTYCWQTGNLGLERFLSVMFKINFFTVVLCLCMNRPYQFYFFVPLLSFWFCMIYFLLS 539
Query: 276 I-------FNKYNEIGSVMIVKILACFLVVILIWEIPGVF--DIFWS-PLTFILGYTDPA 325
I ++ N + +V C L VI I + VF IF + P + TD
Sbjct: 540 IPPRITAQSSENNAYQYLYVVLKFVCILSVITILYMSEVFFERIFVTRPWKALFVTTD-- 597
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPK---RKLSIKAGIV 382
+ W +R LDRY GMI+A A+K+ + + R++S+ +
Sbjct: 598 ----DDVTVWWYRWKLDRYTITFGMIFAAIFHIAQKYYIFDDNNHGNLFSRRISLSS--- 650
Query: 383 TVALFVGYLWYE--CIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLYMMA 440
T+A +G +Y + +K + H Y +IPI + L++++SG ++
Sbjct: 651 TLAAIIGIGFYTTWSFFCRNKQDCEEIHSYVVFIPIVG-------YILLRNISG---ILR 700
Query: 441 CRYS 444
RYS
Sbjct: 701 TRYS 704
>gi|157106627|ref|XP_001649411.1| hypothetical protein AaeL_AAEL004545 [Aedes aegypti]
gi|108879830|gb|EAT44055.1| AAEL004545-PA, partial [Aedes aegypti]
Length = 799
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 173/362 (47%), Gaps = 42/362 (11%)
Query: 99 ENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHN 158
+++ + A+A I+ YFY+CDRTN K Y+ F + ++ +
Sbjct: 369 NSQSPIIALASLAIIMAYFYLCDRTNFFMKENKYYSEFSFWIPVGYVFVLGLFFT----- 423
Query: 159 DKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGF 215
+ S F+ + L+R QT+E KGWMQ++ L+Y+ A+ IY I++ I+ Y++++G+
Sbjct: 424 EDSKFT----KVLHRDQTDELKGWMQLVILIYYMTGASHVLPIYMHIKVLISGYLFLSGY 479
Query: 216 GNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVG 275
+F+Y + + + RF +M+++NF C+ +N Y Y+ P+ + + M+Y +
Sbjct: 480 SHFTYCWQTGNSGIVRFLHVMFKINFLTVILCLCMNRPYQFYFFVPLLSFWYCMIYFLLS 539
Query: 276 IFNKYNEIGS-------VMIVKILACFLVVILIWEIPGVF--DIFWS-PLTFILGYTDPA 325
I K S + +V C L VI + + VF IF + P + TD
Sbjct: 540 IPPKITAQSSENNAYQYLYVVLKFVCMLSVITVLYMSEVFFERIFVTRPWKALFVTTD-- 597
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWM---EKLEESEPKRKLSIKAGIV 382
+HEW +R LDRY GMI+A A+++ + + R++S+ + +
Sbjct: 598 ----DDIHEWWYRWKLDRYTITYGMIFAAMFHIAQRYYFFDDNNHGNLFSRRISLTSTLA 653
Query: 383 TVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLYMMACR 442
+A +G+ + +K + H Y +IPI + L++++SG M+ R
Sbjct: 654 AIA-GIGFYTTWTFFCRNKQDCEEIHSYVVFIPIV-------GYILLRNISG---MLRTR 702
Query: 443 YS 444
YS
Sbjct: 703 YS 704
>gi|350417862|ref|XP_003491616.1| PREDICTED: CAS1 domain-containing protein 1-like [Bombus impatiens]
Length = 794
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 124/250 (49%), Gaps = 42/250 (16%)
Query: 25 WIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLSRSASARLLSSSIKT 84
W+Y +F +Y+ S++ V + +E E +E + + +T
Sbjct: 330 WVYRKFCQYRTEISYSHVAT------VEVENTEEANNS--------------------ET 363
Query: 85 NLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLL 144
I + D + L + ++A IL+YFY+CDRTN K Y+ F F L
Sbjct: 364 TQIEQPKVQDFYTL-----MTSLALLSIILYYFYLCDRTNFFMKENKYYSE--FSFWLPL 416
Query: 145 LVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNA 201
I++ + ++ P + LNR QT+EW+G MQ + L+YH A IY
Sbjct: 417 GYILALGLFFTEDRERGP------RTLNREQTDEWRGLMQAVVLIYHVTGAKNVLPIYMY 470
Query: 202 IRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICP 261
+R+ +AY++++G+G+F Y++ D SL RFAQ+M+RLNF C+ +N Y Y+ P
Sbjct: 471 LRLINSAYLFLSGYGHFCYFWQTGDVSLVRFAQVMFRLNFLTVSLCLCMNRPYQFYHFVP 530
Query: 262 MHTLFTIMVY 271
+ + + +++Y
Sbjct: 531 LVSFWFLVIY 540
>gi|340715722|ref|XP_003396358.1| PREDICTED: CAS1 domain-containing protein 1-like [Bombus
terrestris]
Length = 794
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 124/250 (49%), Gaps = 42/250 (16%)
Query: 25 WIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDDRAVLLEGGLSRSASARLLSSSIKT 84
W+Y +F +Y+ S++ V + +E E +E + + +T
Sbjct: 330 WVYRKFCQYRTEISYSHVTT------VEVENTEEANNS--------------------ET 363
Query: 85 NLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLL 144
I + D + L + ++A IL+YFY+CDRTN K Y+ F F L
Sbjct: 364 TQIEQPKVQDFYTL-----MTSLALLSIILYYFYLCDRTNFFMKENKYYSE--FSFWLPL 416
Query: 145 LVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNA 201
I++ + ++ P + LNR QT+EW+G MQ + L+YH A IY
Sbjct: 417 GYILALGLFFTEDRERGP------RTLNREQTDEWRGLMQAVVLIYHVTGAKNVLPIYMY 470
Query: 202 IRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICP 261
+R+ +AY++++G+G+F Y++ D SL RFAQ+M+RLNF C+ +N Y Y+ P
Sbjct: 471 LRLINSAYLFLSGYGHFCYFWQTGDVSLVRFAQVMFRLNFLTVSLCLCMNRPYQFYHFVP 530
Query: 262 MHTLFTIMVY 271
+ + + +++Y
Sbjct: 531 LVSFWFLVIY 540
>gi|453082782|gb|EMF10829.1| Cas1p-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 839
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 163/364 (44%), Gaps = 29/364 (7%)
Query: 100 NRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDL------FLFLYLLLVIVSAMTS 153
+R LRA F + +I DRT++ T+ DL F+ L +
Sbjct: 335 SRPVLRAFCFFTVAVSLQWIADRTHIFDQGTR---LDLVKSNLYFMIAAAFLFGFITIRR 391
Query: 154 LKKHNDKSPFSGKT-IQYLNRHQTEEWKGWMQVLFLMYHYFAA---TEIYNAIRIFIAAY 209
K SP K+ + L R QT EWKGWMQ L ++YHY A E + IR+ +++Y
Sbjct: 392 CKPLRTSSPGEIKSSLPCLPRDQTNEWKGWMQALIVIYHYNKAWKYDEFWQVIRLSVSSY 451
Query: 210 VWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIM 269
+++TGFG+ Y+ +KD+SL RF +M R N V+ ++LYY P+ T ++
Sbjct: 452 LFLTGFGHTLYFLQKKDYSLQRFTNVMIRTNLLPVTLSYVMRTRWLLYYYMPLSTFNFVV 511
Query: 270 VYGAVGIFNKYNEIGSVMIVKILA-CFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPD 328
VY + + +YN + ++ KI A FLV I + D+ P TF+ + +
Sbjct: 512 VYLTLALGRRYNAYTAFLLGKIAASAFLVHIFL----TTRDL---PTTFVRLFRITCNLN 564
Query: 329 LPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE-ESEPKRK-------LSIKAG 380
+ R D+YI +GM+ A + +E + + P R L A
Sbjct: 565 FDSGEFFGHRVEQDQYIVFVGMLAAMVYIWIRAILESRDRQDRPSRSFRAAWPVLKWSAI 624
Query: 381 IVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLYMMA 440
++ A G+ ++ + + + K PY + IPI + + ++++ + +
Sbjct: 625 VLAAATHAGFWYWIRTHIRTQAQFRKIQPYATGIPILTFIVLRNAHPILRNWHSAAFAWL 684
Query: 441 CRYS 444
+YS
Sbjct: 685 GKYS 688
>gi|342871827|gb|EGU74275.1| hypothetical protein FOXB_15203 [Fusarium oxysporum Fo5176]
Length = 1370
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 144/323 (44%), Gaps = 50/323 (15%)
Query: 114 LFYFYICDRTNLLGDSTKNYN-RDLFLF----LYLLLVIVSAMTSLKKHNDKSPFSGKTI 168
L Y DRT ++ +K + +DL + ++L + + S +
Sbjct: 406 LLMCYYADRTQMMAKGSKLWQLKDLVALCIPCIAIMLATIRRIKSPVPEDLSVDIQESNQ 465
Query: 169 QYLNRHQTEEWKGWMQVLFLMYHYFAAT--EIYNAIRIFIAAYVWMTGFGNFSYYYIRKD 226
+L+R QT+EWKGWMQ L+ ++ A IY IR+ +AAY++ TG+G+ Y+ + D
Sbjct: 466 LFLSRDQTDEWKGWMQFFILICYWTGAQGGSIYVFIRVCVAAYLFQTGYGHTLYFLNKND 525
Query: 227 FSLPRFAQMMWRLNFFVAFCCIV--LNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIG 284
FS R A + RLN CC+ ++ DYM YY + + + ++VY + I +YN
Sbjct: 526 FSFNRVAATLLRLNILS--CCLAYFMDTDYMFYYFPTLMSFWFLVVYATMAIRPRYNSDL 583
Query: 285 SVMIVKI-LACFLVVILIWEIP----------GVFDIFWSPLTFILGYTDPAKPDLPRLH 333
VM+ KI ++C +V +++ P VF I WS
Sbjct: 584 QVMLAKICMSCLIVSMILMGTPLTRWVFGILNTVFKIQWS------------------YK 625
Query: 334 EWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWY 393
+W+ R LD I +GM+ A A + ++ L ++ + +F ++
Sbjct: 626 QWYRRVTLDMLIVYVGMLTA----VANRHLKM------PIHLRLRVTLALAGVFATIHYF 675
Query: 394 ECIYKLDKVTYNKYHPYTSWIPI 416
L Y K+HPY S +P+
Sbjct: 676 YATSGLRMAAYAKWHPYVSLVPV 698
>gi|242009500|ref|XP_002425523.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212509384|gb|EEB12785.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 784
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 97/168 (57%), Gaps = 12/168 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A+A+ I+ YFYICDRTN TK +++ F + ++ V + N K
Sbjct: 361 ALAKLALIMTYFYICDRTNFFMKETKTFSQSAFWIPIVYILCVGLFFTEDSGNSK----- 415
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
L+R+QT+EWKGWMQ++ L+YH+ A + I+ R+ I+AYV+++G+G+F YY+
Sbjct: 416 ----VLHRNQTDEWKGWMQLVLLIYHWTGAQKVLPIFLLSRVIISAYVFLSGYGHFFYYW 471
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMV 270
D S+ RF ++++RLNF C+ +N Y Y P+ + + +M+
Sbjct: 472 HSGDGSIVRFFRVLFRLNFMQFVVCLCMNRPYQFYEFLPLVSFWFVMM 519
>gi|47201872|emb|CAF89002.1| unnamed protein product [Tetraodon nigroviridis]
Length = 253
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 105/191 (54%), Gaps = 23/191 (12%)
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAAT---EIYNAIRIFIAAYVWMTGFGNFSYYYIRKDF 227
LNR QT+EWKGWMQ++ L+YH A+ +Y +R+ +AAY++ TG+G+FS+++++ DF
Sbjct: 1 LNREQTDEWKGWMQLVILIYHVSGASVFIPVYMHVRVLVAAYLFQTGYGHFSFFWLKGDF 60
Query: 228 SLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIF-----NKYNE 282
SL R Q+++RLNF V C+V++ Y YY P+ T + ++YG + ++ K N
Sbjct: 61 SLYRVCQVLFRLNFLVLVLCVVMDRPYQFYYFVPLVTFWFFIIYGTLAMWPQILQKKANS 120
Query: 283 IG--------SVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHE 334
G ++ + + CF + E G F+ +S F + + E
Sbjct: 121 SGMWYMGVLVKLLGLLLFICFFAFSQVGE--GFFESIFSAWPFSKLFELNGS-----VRE 173
Query: 335 WHFRSGLDRYI 345
W FR LDR++
Sbjct: 174 WWFRWKLDRFV 184
>gi|66549058|ref|XP_395026.2| PREDICTED: CAS1 domain-containing protein 1-like [Apis mellifera]
Length = 791
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 96/171 (56%), Gaps = 11/171 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
+ ++A IL YFY+CDRTN K Y+ F F L I++ + ++ P
Sbjct: 375 MTSLALLSIILSYFYLCDRTNFFMKENKYYSE--FSFWLPLGYILALGLFFTEDRERGP- 431
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ LNR QT+EWKG MQ + L+YH A IY +R+ +AY++++G+G+F Y
Sbjct: 432 -----RTLNREQTDEWKGLMQAVVLIYHVTGAKNVLPIYMYLRLINSAYLFLSGYGHFCY 486
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
++ D SL RFA++M+RLNF C+ +N Y Y+ P+ + + +++Y
Sbjct: 487 FWQTGDVSLIRFARVMFRLNFLTVSLCLCMNRPYQFYHFVPLVSFWFLVIY 537
>gi|380019345|ref|XP_003693570.1| PREDICTED: CAS1 domain-containing protein 1-like [Apis florea]
Length = 791
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 96/171 (56%), Gaps = 11/171 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
+ ++A IL YFY+CDRTN K Y+ F F L I++ + ++ P
Sbjct: 375 MTSLALLSIILSYFYLCDRTNFFMKENKYYSE--FSFWLPLGYILALGLFFTEDRERGP- 431
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ LNR QT+EWKG MQ + L+YH A IY +R+ +AY++++G+G+F Y
Sbjct: 432 -----RTLNREQTDEWKGLMQAVVLIYHVTGAKNVLPIYMYLRLINSAYLFLSGYGHFCY 486
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
++ D SL RFA++M+RLNF C+ +N Y Y+ P+ + + +++Y
Sbjct: 487 FWQTGDVSLIRFARVMFRLNFLTVSLCLCMNRPYQFYHFVPLVSFWFLVIY 537
>gi|408395154|gb|EKJ74340.1| hypothetical protein FPSE_05486 [Fusarium pseudograminearum CS3096]
Length = 700
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 84/326 (25%), Positives = 141/326 (43%), Gaps = 55/326 (16%)
Query: 114 LFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMT----------SLKKHNDKSPF 163
L Y DRT ++ +K + F+ L L + + T S + N PF
Sbjct: 392 LLMCYYADRTQMMAKGSKLWQLGDFVALCLPCIAICLSTIRRSDPPWNLSFTQSNTDQPF 451
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE--IYNAIRIFIAAYVWMTGFGNFSYY 221
L+ Q +EWKGWMQV L+YH+ A I+ +R+ + AYV+ TG+ + +
Sbjct: 452 -------LSPDQIDEWKGWMQVFILIYHWAGAQGGLIHVLVRLCMGAYVFQTGYVHTLDF 504
Query: 222 YIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYN 281
KDFS A + RLN ++ +YM Y+ P+ + + ++VY + I + +N
Sbjct: 505 MNEKDFSFNHAASTLLRLNILPCLLAYFMDTEYMAYHFSPLLSFWFLVVYATMAIDSGHN 564
Query: 282 EIGSVMIVKI-LACFLVVILIWEIP----------GVFDIFWSPLTFILGYTDPAKPDLP 330
++VKI ++C ++ I+ P G+F I WS
Sbjct: 565 NELQFLLVKICVSCMIISIVFLATPFTSWTFYIFQGIFKIQWS----------------- 607
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGY 390
EW LD +I +GM+ A ++++ E +L ++ +V LF
Sbjct: 608 -AEEWQRSVTLDLFIAYVGMLAAVIG-------REMKKGEVSVRLGLRVCLVFGGLFSIL 659
Query: 391 LWYECIYKLDKVTYNKYHPYTSWIPI 416
+ + + +Y K+HPY S IPI
Sbjct: 660 HYLSFTSNITESSYMKWHPYVSVIPI 685
>gi|383853160|ref|XP_003702091.1| PREDICTED: CAS1 domain-containing protein 1-like [Megachile
rotundata]
Length = 794
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 96/171 (56%), Gaps = 11/171 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
+ ++A IL YFY+CDRTN K Y+ F F L I++ + ++ P
Sbjct: 378 MTSLALLSIILAYFYLCDRTNFFMKENKYYSE--FSFWLPLGYILALGLFFTEDRERGP- 434
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ LNR QT+EW+G MQ + L+YH A IY +R+ +AY++++G+G+F Y
Sbjct: 435 -----RALNREQTDEWRGLMQAVVLIYHVTGAKNVLPIYMYLRLINSAYLFLSGYGHFCY 489
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
++ D SL RFA++M+RLNF C+ +N Y Y+ P+ + + +++Y
Sbjct: 490 FWQTGDVSLVRFARVMFRLNFLTVSLCLCMNRPYQFYHFVPLVSFWFLVIY 540
>gi|307212835|gb|EFN88471.1| CAS1 domain-containing protein 1 [Harpegnathos saltator]
Length = 794
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 95/169 (56%), Gaps = 11/169 (6%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
++A F IL YFY+CDRTN K Y+ F F L I++ + ++ P +
Sbjct: 380 SLALFSIILAYFYLCDRTNFFMKENKYYSE--FSFWLPLGYILALGLFFTEDRERGPSA- 436
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
LNR QT+EW+G MQ + L+YH A IY +R+ +AY++++G+G+F Y++
Sbjct: 437 -----LNREQTDEWRGLMQAVVLIYHVTGAKNVLPIYMYLRLINSAYLFLSGYGHFCYFW 491
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
D SL RFA++M+RLN C+ +N Y Y+ P+ + + +++Y
Sbjct: 492 QTGDVSLVRFARVMFRLNLLTVSLCLCMNRPYQFYHFVPLVSFWFLVIY 540
>gi|452842205|gb|EME44141.1| hypothetical protein DOTSEDRAFT_71832 [Dothistroma septosporum
NZE10]
Length = 921
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 160/351 (45%), Gaps = 36/351 (10%)
Query: 87 IRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLV 146
+RF + D+ R+ +RA+ +I+ Y+ DRT++ + + L + +
Sbjct: 415 LRFSYLSDS----QRSVVRAICAVASIVALQYVADRTHVFEQVQRLPLQLSNLLSMIFIT 470
Query: 147 IVSAMTSLKKHNDK---SPFSGKTIQ-YLNRHQTEEWKGWMQVLFLMYHY----FAATEI 198
V + +L++ P K Q YL R QT+EWKGWMQ L ++YHY A
Sbjct: 471 TVVGLATLRRCRPARAIQPGENKPHQPYLPRDQTDEWKGWMQALIIIYHYNMAWRGADWF 530
Query: 199 YNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYY 258
+ IR+ +A+Y+++TGFG+ Y+ +KDFS RF +M R N V+ ++LYY
Sbjct: 531 WEIIRLTVASYLFLTGFGHTVYFLQKKDFSAKRFINVMVRTNLLPVTLAYVMRTRWVLYY 590
Query: 259 ICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFI 318
+ T + ++Y + + +N +++VK+ F+ + D+ P T +
Sbjct: 591 YMALSTFWYCVLYVTMAVKKDWNASTPLLLVKM---FVSAFAVHTFLNTKDL---PETVV 644
Query: 319 LGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIK 378
+ + +H R D+YI +GM+ A + W++ + S+ ++ S +
Sbjct: 645 KMFKITCRMSFDAGDFFHHRVDQDQYIVYVGMLVAMLY----VWVKDVLSSDERQNRSSR 700
Query: 379 A-------------GIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPI 416
A + T+A FV Y ++ +++K PY + PI
Sbjct: 701 AFRKAFPILKYFIIALATIA-FVYYFYWTNTTLDSTTSFSKLQPYLTITPI 750
>gi|307179514|gb|EFN67828.1| CAS1 domain-containing protein 1 [Camponotus floridanus]
Length = 813
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 96/171 (56%), Gaps = 11/171 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
+ ++A F IL YFY+CDRTN K Y+ F F L I++ + ++ P
Sbjct: 397 ITSLALFSIILAYFYLCDRTNFFMKENKYYSE--FSFWLPLGYILALGLFFTEDRERGP- 453
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ LNR QT+EW+G MQ + L+YH A IY +R+ +AY++++G+G+F Y
Sbjct: 454 -----RVLNREQTDEWRGLMQAVVLIYHVTGAKNVLPIYMYLRLINSAYLFLSGYGHFYY 508
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
++ D SL RFA++M+RLN C+ +N Y Y+ P+ + + +++Y
Sbjct: 509 FWQTGDVSLVRFARVMFRLNLLTVSLCLCMNRPYQFYHFVPLVSFWFLVIY 559
>gi|332022478|gb|EGI62785.1| CAS1 domain-containing protein 1 [Acromyrmex echinatior]
Length = 814
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 96/171 (56%), Gaps = 11/171 (6%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
+ ++A F IL YFY+CDRTN K Y+ F F L I++ + ++ P
Sbjct: 398 ITSLALFSIILAYFYLCDRTNFFMKENKYYSE--FSFWLPLGYILALGLFFTEDRERGP- 454
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ LNR QT+EW+G MQ + L+YH A IY +R+ +AY++++G+G+F Y
Sbjct: 455 -----RVLNREQTDEWRGLMQAVVLIYHVTGAKNVLPIYMYLRLINSAYLFLSGYGHFYY 509
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
++ D SL RFA++M+RLN C+ +N Y Y+ P+ + + +++Y
Sbjct: 510 FWQTGDVSLVRFARVMFRLNLLTVSLCLCMNRPYQFYHFVPLVSFWFLVIY 560
>gi|413936589|gb|AFW71140.1| hypothetical protein ZEAMMB73_668761 [Zea mays]
Length = 266
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 101/153 (66%), Gaps = 4/153 (2%)
Query: 51 LEKETIKEDDRAVLLEGGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEF 110
++ ++K +D+ +LLE G ++ +A+ +S+ + ++R + MD LLENR TLRA++EF
Sbjct: 1 MDDSSVKAEDQTMLLEEG-GQAMAAKPAYTSLTSQILRLIFMD-QLLLENRLTLRAISEF 58
Query: 111 GAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLK--KHNDKSPFSGKTI 168
G L YFYICDRTNLLG+S KNY+RDLFLFLY LL+IV+AMTS K K D+ + I
Sbjct: 59 GGYLLYFYICDRTNLLGESAKNYSRDLFLFLYFLLIIVAAMTSFKDIKQRDEFICVQREI 118
Query: 169 QYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNA 201
+ +H T+ K ++V+ + T++ NA
Sbjct: 119 RRQPQHWTDHAKKQLRVMQQLDEDHTLTQLTNA 151
>gi|326426882|gb|EGD72452.1| hypothetical protein PTSG_11589 [Salpingoeca sp. ATCC 50818]
Length = 695
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/163 (36%), Positives = 89/163 (54%), Gaps = 18/163 (11%)
Query: 114 LFYFYICDRTNLLGDSTKNYNRDLF---LFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQY 170
+ + +I DRT + + K + F L L +VS TS K
Sbjct: 487 MIFIFIADRTGVFMVAAKEFQWSHFSLGCLAILALGLVSVRTSAKA------------GA 534
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYYIRKDF 227
LNR QT+EWKGWMQ+ FL+YHY A++ IY IR+ +AAY++MTGFG++ Y+ +K+F
Sbjct: 535 LNRDQTDEWKGWMQLCFLLYHYTGASQVLSIYVFIRVCVAAYIFMTGFGHYIYFTQKKEF 594
Query: 228 SLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMV 270
+ R ++ R+N F F + +N Y YY P+ T + +MV
Sbjct: 595 TWHRITMVVLRVNIFTLFLSMTMNRTYQFYYFVPLSTFWFLMV 637
>gi|452984897|gb|EME84654.1| hypothetical protein MYCFIDRAFT_173604 [Pseudocercospora fijiensis
CIRAD86]
Length = 873
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 169/393 (43%), Gaps = 63/393 (16%)
Query: 101 RATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNR-DLFLFLYLLLVIVSAMTSLKKHND 159
R L A A F + +I DR+++ G K+ R + L+L+ +I T +
Sbjct: 299 RPILYAFASFSIAILLVFIADRSHVFGHVRKDAFRYENGGMLFLISLIAGIATIRRGSTG 358
Query: 160 KSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHY-----------------FAATEIYNAI 202
KS + +L R Q+ EWKGWMQ++ + YHY AA +++ +
Sbjct: 359 KSQCANN--NFLPREQSSEWKGWMQLVIIFYHYGQHDASLSAFRGFDSMQVAALPVWHFL 416
Query: 203 RIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPM 262
R+ +++Y+++TGFG+ ++ +KD+SL RF ++ RLN + ++ YY P+
Sbjct: 417 RLCVSSYLFLTGFGHTVFFLQKKDYSLRRFVNIIIRLNLLAITLVYTMRTSWVTYYYIPL 476
Query: 263 HTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYT 322
T++ ++VY + + +N ++ KI A ++ + G+ + L F T
Sbjct: 477 CTIWFLIVYATLAVGRHHNSNTIFLLAKIAAAAFLLNKVLYADGLREAL---LAFFTAMT 533
Query: 323 DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYA-----------YYH--------------P 357
P + + R DRYI +GM A +H P
Sbjct: 534 KGYFP----VSFFDKRFNNDRYIVFVGMAVAPVYLWLTDLIKSHHGGHKSLLATANEKIP 589
Query: 358 TAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPI- 416
+ + KL+++ + IK V AL ++ + L K ++ YT+WIPI
Sbjct: 590 STRQNFTKLDKAVLQHWTLIKRAAVITALIGLLAFWRWGFTLTKSAFSAQQAYTNWIPII 649
Query: 417 ----------TYVLFIFYFFSLVKHLSGSLYMM 439
T + FF+ V + SG +Y++
Sbjct: 650 CFAILRNAHPTLRNYHSKFFAWVGNYSGEIYVL 682
>gi|347965821|ref|XP_321732.5| AGAP001402-PA [Anopheles gambiae str. PEST]
gi|333470338|gb|EAA01095.5| AGAP001402-PA [Anopheles gambiae str. PEST]
Length = 802
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 80/357 (22%), Positives = 171/357 (47%), Gaps = 30/357 (8%)
Query: 98 LENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKH 157
+E ++ + A+A I+ YFY+CDRTN K Y+ F ++ + V A+
Sbjct: 371 IETQSPVAALASLAIIMTYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLFFTE 427
Query: 158 NDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTG 214
+ K + L+R QT+E KGWMQ++ L+Y+ A+ I Y I++ I+ +++++G
Sbjct: 428 D------SKLTKVLHRDQTDELKGWMQIVILIYYMTGASHILPIYMHIKVLISGFLFLSG 481
Query: 215 FGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPM----HTLFTIMV 270
+ +F+Y++ + L RF +M+R+NF C+ +N Y Y+ P+ +++ +M+
Sbjct: 482 YAHFTYWWQTGNAGLVRFLNVMFRMNFLTVILCLCMNRPYQFYFFVPLLSFWYSIMYLML 541
Query: 271 YGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLP 330
I + E + ++ F+ ++ + + ++F+ + +
Sbjct: 542 SLPPRITAQSTEANPYQYLYVVIKFVTMLATVTVLYMSEVFFERIFVTRPWKALFVTTDD 601
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPK---RKLSIKAGIVTVALF 387
+HEW +R LDRY GMI+A ++++ + + +++S+ + + +
Sbjct: 602 DIHEWWYRWKLDRYTITYGMIFAAIFQISQRFAVVDDNNHGNLFSKRISLTSTLAAITGI 661
Query: 388 VGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLYMMACRYS 444
Y+ + + ++ + H Y +IPI + L++++SG ++ RYS
Sbjct: 662 GCYMTW-TFFCRNRQDCEEVHSYVVFIPIV-------GYILLRNISG---ILRTRYS 707
>gi|348539668|ref|XP_003457311.1| PREDICTED: CAS1 domain-containing protein 1-like [Oreochromis
niloticus]
Length = 718
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/329 (26%), Positives = 142/329 (43%), Gaps = 94/329 (28%)
Query: 101 RATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDK 160
+A +A+ G I+ YFY+CDR ++ K Y F F+ L+ + V + ++D
Sbjct: 364 KAPFQALCRMGVIMGYFYLCDRADVFMKEQKFYTHSTF-FIPLIYIFVLGVF----YSDN 418
Query: 161 SPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGN 217
S K + LNR QT+EWKGWMQ++ L+YH A+ +Y +R+ +AAY++ TG+G+
Sbjct: 419 S----KEAKLLNREQTDEWKGWMQLVILIYHISGASAFIPVYMHVRVLVAAYLFQTGYGH 474
Query: 218 FSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIF 277
FS+++++ DF L Y +C GIF
Sbjct: 475 FSFFWLKGDFGL---------------------------YRVCQ-------------GIF 494
Query: 278 NKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHF 337
+ V +W I +F++ S +HEW F
Sbjct: 495 ER------------------VFSVWPISKLFELNGS------------------IHEWWF 518
Query: 338 RSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAG----IVTVALFVGYLWY 393
R LDR+ I GM++A+ + +K + L E + + S K ++V F+ Y +
Sbjct: 519 RWKLDRFAVIHGMVFAFIYLVLQK-RQVLSEGKGEALFSAKISNFLLFLSVVSFITYSIW 577
Query: 394 ECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
K K N+ HPY S + I + I
Sbjct: 578 ASSCK-TKTECNEMHPYISVVQILAFILI 605
>gi|224004760|ref|XP_002296031.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586063|gb|ACI64748.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 502
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 125/293 (42%), Gaps = 60/293 (20%)
Query: 171 LNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWM------------------ 212
L R QTEEWKGWMQ F+ YHY+ + +YN IR+F++AYVWM
Sbjct: 90 LGREQTEEWKGWMQWAFIFYHYYRVSYVYNEIRVFVSAYVWMVSEVRLLEEECFLCAQTN 149
Query: 213 ---------------TGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLY 257
TGFGNF Y+ + DFS+ RF M+ R+N+F L LY
Sbjct: 150 STTLDSPRIHRPNNKTGFGNFLYFDKKADFSVERFISMILRINYFPLLLSYFLTVPLELY 209
Query: 258 YICPMHT--LFTIMVYGAVGIFNKYNEIG----SVMIVKILACFLVVILIWEIPGVFDIF 311
Y+ P+HT M+ VG ++ ++G ++ L +L +E V
Sbjct: 210 YVVPLHTTGFVMTMISCYVG-YSFERKLGWSYWKSRTAAVVVSLLAHVLFYETAAV---- 264
Query: 312 WSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE--ES 369
F+L ++ E+H R D+Y +GM ++M+ E+
Sbjct: 265 ----NFLLLFSK----------EYHLRFQTDKYSAWMGMACGLLWGKVGEYMQWAHGFEN 310
Query: 370 EPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
E +R+ + A + A + +Y + DK YN HPY PI L I
Sbjct: 311 ERRRRNATIAQFLGGAFLIWIWYYFFGFISDKHVYNPVHPYVFIFPIVGWLMI 363
>gi|46138121|ref|XP_390751.1| hypothetical protein FG10575.1 [Gibberella zeae PH-1]
Length = 934
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 156/375 (41%), Gaps = 68/375 (18%)
Query: 78 LSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDL 137
LS+ + ++ + +A+ L N +R + F +L Y DRT ++ S+K + D
Sbjct: 425 LSACVVCEILDLTSAKEAWSLLN---MR-IGSFVLVLLMCYFSDRTQMMAKSSKLWEVDG 480
Query: 138 FLFLYLLLV----IVSAMTSLKKHNDKSPFSGKTIQ------------------YLNRHQ 175
F L + + T K H +T + +L+R Q
Sbjct: 481 FAILCAACLLPLLVTIRRTRPKSHQHLPSTEDETSEKLLPENPELEENYQQDEPFLSRTQ 540
Query: 176 TEEWKGWMQVLFLMYHYFAATE----IYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPR 231
TEEWKGWMQ L+Y + A + +Y R+ IAAY++ TG+G+ Y+ DFSL R
Sbjct: 541 TEEWKGWMQCFVLIYQWTGADQGPISLYILFRLCIAAYMFQTGYGHAVYFITTSDFSLKR 600
Query: 232 FAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKI 291
+ RLN F +N DYM YY P+ + + +++Y + I + N +++ K+
Sbjct: 601 VVTTLLRLNAFSCALAYSMNMDYMFYYSAPLASFWFLVIYATMAIGKQCNSDTQMVVAKV 660
Query: 292 LACFLVVILIWEIPG----------VFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGL 341
++V ++ P +F+I WS +W + L
Sbjct: 661 CISGVLVFAMFVTPLPRWIFNLFEIIFNIQWS------------------ADQWIRYATL 702
Query: 342 DRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDK 401
D +I IGM+ A + LS++ + +F ++ L +
Sbjct: 703 DMFIVYIGMVTAIVSQMGGTQI----------ILSLRLMLGLAGVFATCYYFIKGSTLSQ 752
Query: 402 VTYNKYHPYTSWIPI 416
+Y+ HPY S IPI
Sbjct: 753 SSYDSLHPYLSSIPI 767
>gi|346971264|gb|EGY14716.1| hypothetical protein VDAG_05880 [Verticillium dahliae VdLs.17]
Length = 824
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 162/374 (43%), Gaps = 51/374 (13%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLL-VIVSAMTSLKKHNDKSPFS 164
A+A AI Y Y+ DRT+L + K ++ + LLL V+ +A + +P
Sbjct: 315 AVASMLAIASYCYLADRTHLFDKNAKQFDASSVAWSSLLLAVLATASVRSSRAGFLTPVH 374
Query: 165 GKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGFGNFSYY 221
K + +L R QTEE KG MQ L+Y + A+EI + R+ ++AY++ +G+ Y+
Sbjct: 375 RKPVSFLGRVQTEELKGLMQGFLLLYDFHGASEIPAMHLPFRLSVSAYIFFLCYGHARYF 434
Query: 222 YIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYN 281
+DFS R A ++ RLN +L+ + Y + + + ++ Y + F N
Sbjct: 435 LRTEDFSFTRAAHILCRLNVLSCLLSFMLDAQWATQYFSALASFWFLVTYATLASFRHLN 494
Query: 282 EIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYT-DPAKPDLPRLHEWHFRSG 340
++++VK+L ++ +I E PGV L G D + + H
Sbjct: 495 NNLALLVVKVLTAGVMTSVIIETPGVVGGILLALNKCFGVVWD--------VQKVHLYVS 546
Query: 341 LDRYIWIIGMIYA---YYHPTAEKWMEKLEES-------------------------EPK 372
LDR+I + G+++A + ++ + +S K
Sbjct: 547 LDRFIPMFGVLFAAVVHRFTVLQQQQDDGGKSILVNTGSPLDRLNYALDRELVGMIYPGK 606
Query: 373 RKLSIK------AGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFF 426
+S+K +G+ VAL G L + D Y+ YHPY S + +L +
Sbjct: 607 DAISVKSIWIGFSGLYLVAL--GVLGFAANSDSDNRAYDAYHPYASPWAVLAILAVRNCH 664
Query: 427 SLV--KHLSGSLYM 438
+ +H++G++ +
Sbjct: 665 RGLRQRHMAGAIAL 678
>gi|46115520|ref|XP_383778.1| hypothetical protein FG03602.1 [Gibberella zeae PH-1]
Length = 1454
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 78/326 (23%), Positives = 140/326 (42%), Gaps = 55/326 (16%)
Query: 114 LFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMT----------SLKKHNDKSPF 163
L Y DRT ++ +K + F+ L L + + T SL + + PF
Sbjct: 346 LLMCYYADRTQMMAKGSKPWQIGDFIALCLPCIAICLSTIRRSDPPRYLSLTQPSTDQPF 405
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE--IYNAIRIFIAAYVWMTGFGNFSYY 221
L+ Q +EWKGW+Q L+ H+ A E I + + + AY++ TG+ + +
Sbjct: 406 -------LSLDQIDEWKGWIQAFILICHWTGAQEGSIQVLVSLCVGAYIFQTGYVHTLDF 458
Query: 222 YIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYN 281
+K FS A ++RLN ++ DYM Y++ P+ + + ++VY V + + N
Sbjct: 459 MNKKGFSFNHAASTLFRLNILSCLLAYFMDTDYMTYHLSPLLSFWFLVVYATVAVDRQQN 518
Query: 282 EIGSVMIVKIL-ACFLVVILIWEIP----------GVFDIFWSPLTFILGYTDPAKPDLP 330
++VKI +C ++ + + P G+F I W
Sbjct: 519 NELKFLLVKICTSCMIISFIFLDTPSTSWTFNILQGIFKIQW------------------ 560
Query: 331 RLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGY 390
R+ EW LD +I +GM+ A +++++E +L ++ + LF
Sbjct: 561 RVEEWQRSVTLDLFIAYVGMLAAVIG-------REMKKAEVSVRLGLRVCLAFGGLFSIL 613
Query: 391 LWYECIYKLDKVTYNKYHPYTSWIPI 416
+ + + +Y K+HPY S IPI
Sbjct: 614 HYLSFTSHVRESSYMKWHPYVSAIPI 639
>gi|119597196|gb|EAW76790.1| CAS1 domain containing 1, isoform CRA_b [Homo sapiens]
Length = 359
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 86/146 (58%), Gaps = 14/146 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 225 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 277
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 278 --KETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 335
Query: 221 YYIRKDFSLPRFAQMMWRLNFFVAFC 246
++I+ DF + R Q + ++ +A+C
Sbjct: 336 FWIKGDFGIYRVCQEI--VSGILAYC 359
>gi|358336687|dbj|GAA55143.1| CAS1 domain-containing protein 1 [Clonorchis sinensis]
Length = 715
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 154/368 (41%), Gaps = 67/368 (18%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGK 166
++ GA+L YFYICDRT + G + K+ +L +L + V + S++ S K
Sbjct: 247 LSRMGAVLIYFYICDRTTMFGKTHKD-PESFWLRGEVLCLTVLGLLSIR--------SSK 297
Query: 167 TIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYN---AIRIFIAAYVWMTGFGNFSYYYI 223
+ N T EWKGWMQ+ + YH+ + N ++R ++AY++++ +G+ Y++
Sbjct: 298 HTDFNNLDVTREWKGWMQLYIVTYHFVCGHSVVNGYISVRYLVSAYLFLSAYGHTCYFWR 357
Query: 224 R--------KDFS-------------LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPM 262
+ +D + + R+ ++M+R+N F C ++N YM YY P+
Sbjct: 358 KYAQAGGESQDGTNVAFKCIEEIWPLVRRYLEVMYRMNLFAVVLCFMMNQKYMFYYFVPL 417
Query: 263 ----HTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVI-------------LIWEIP 305
T ++ + I + ++ G + I + C VV L+ P
Sbjct: 418 ISFWFTYLSVTMPLFFQILMRNDDAGGLDIDRAKLCRFVVKLAAFALMALAPIELLHRTP 477
Query: 306 GVFD--IFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWM 363
+F F PL I + + W FR LDRY GMI + ++++
Sbjct: 478 FLFGRIFFTPPLRGIFAIYETWEHKKSSQEPWLFRWSLDRYTLAFGMILTVFLKWCQQFL 537
Query: 364 ------EKLEESEPKRKL--------SIKAGIVTVALFVGYLWYECIYKL-DKVTYNKYH 408
LEE+ + S+ G+ + L V +Y DK + H
Sbjct: 538 GTDVLPNALEETSRSQATSHKQSTWNSVAVGLTILGLTVTVAVAVSVYSCRDKTSAILLH 597
Query: 409 PYTSWIPI 416
PY IPI
Sbjct: 598 PYICIIPI 605
>gi|312373691|gb|EFR21390.1| hypothetical protein AND_17127 [Anopheles darlingi]
Length = 769
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 176/365 (48%), Gaps = 48/365 (13%)
Query: 99 ENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHN 158
E + + A+ I+ YFY+CDRTN K Y+ F ++ + V A+ +
Sbjct: 339 EKESPIAALTSLAIIMTYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLFFTED 395
Query: 159 DKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYVWMTGF 215
K + L+R QT+E KGWMQ++ L+Y+ A+ I Y I++ I+ +++++G+
Sbjct: 396 ------SKLTKVLHRDQTDELKGWMQIVILIYYMTGASHILPIYMHIKVLISGFLFLSGY 449
Query: 216 GNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVG 275
+F+ ++ + + + RF M+R+NF C+ +N Y Y+ P+ + + +++ +
Sbjct: 450 THFTCWWQQGETGVSRFLYRMFRMNFLTVLLCLCMNRPYQFYFFVPLLSFWYCIMFLTLS 509
Query: 276 IFNKYNEIGSV--------MIVKILACFLVVILIWEIPGVFD-IFWS-PLTFILGYTDPA 325
+ + + + +++KI+A ++ +++ F+ IF + P + TD
Sbjct: 510 LPPRLSAQSTESNPYHYLYLVLKIVAMLSIITVLYMSEVFFERIFVTRPWKALFVTTDDD 569
Query: 326 KPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPK---RKLSIK---A 379
+HEW +R LDRY GMI+A A+++ + + +++S+ A
Sbjct: 570 ------IHEWWYRWKLDRYTVTYGMIFAALFQAAQRFSLVDDSNHGNLFSKRISLTSTLA 623
Query: 380 GIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLYMM 439
I + ++ + ++ C + D + H Y +IPI + L++++SG ++
Sbjct: 624 AITGIGCYITWTFF-CRNRQD---CEEVHSYVVFIPIV-------GYILLRNISG---VL 669
Query: 440 ACRYS 444
RYS
Sbjct: 670 RTRYS 674
>gi|322790711|gb|EFZ15455.1| hypothetical protein SINV_00416 [Solenopsis invicta]
Length = 812
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 95/179 (53%), Gaps = 21/179 (11%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
++A F IL YFY+CDRTN K Y+ F F L I++ + ++ P
Sbjct: 388 SLALFSLILAYFYLCDRTNFFMKENKYYSE--FSFWLPLGYILALGLFFTEDRERGP--- 442
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QT+EW+G MQ + L+YH A IY +R+ +AY++++G+G+F Y++
Sbjct: 443 ---RVLNREQTDEWRGLMQAVVLIYHVTGAKNVLPIYMYLRLINSAYLFLSGYGHFYYFW 499
Query: 223 IRKDFSLPRFA----------QMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVY 271
D SL RFA Q+M+RLN C+ +N Y Y+ P+ + + +++Y
Sbjct: 500 QTGDVSLVRFARVNSNLVVVFQVMFRLNLLTVSLCLCMNRPYQFYHFVPLVSFWFLVIY 558
>gi|453085416|gb|EMF13459.1| Cas1p-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 944
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 128/290 (44%), Gaps = 45/290 (15%)
Query: 99 ENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHN 158
+R +RA++ F A++ Y+ DRT++ T+ LF +++ V + S+++
Sbjct: 361 SSRPGVRALSAFTAVVAISYVADRTHVFEQVTRLPLNKFNLFSMIIIAGVIGVLSIRRSK 420
Query: 159 DKSPFSGKTIQYLN----------------------------RHQTEEWKGWMQVLFLMY 190
P R QT+EWKGWMQ+L ++Y
Sbjct: 421 GPPPPRRPLPAAPAAGVATAATSAASPPLPPPPPSPSFPFLPRDQTDEWKGWMQLLIIIY 480
Query: 191 HYFAA---TEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCC 247
HY A E + IR+ +++Y+++TGFG+ Y+ ++DFSL R ++ R N
Sbjct: 481 HYNMAFWYDEFWQIIRLCVSSYLFLTGFGHTVYFLQKRDFSLKRLVNVLIRTNLLPCTLA 540
Query: 248 IVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGV 307
V+ +++LYY P+ T + ++VY + I KYN ++ KI G+
Sbjct: 541 YVMRTNWLLYYYMPLSTFWFLIVYATLAIGQKYNHRTLFLLFKIAFS----------AGL 590
Query: 308 FDIFWS----PLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYA 353
F S P T + + K +H R +D+YI +GM+ A
Sbjct: 591 VHTFLSTKDLPETVVRFFVITCKMKFDTNEFFHHRVQVDQYIVYVGMVAA 640
>gi|189235703|ref|XP_001807591.1| PREDICTED: similar to CG2938 CG2938-PB [Tribolium castaneum]
Length = 796
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 150/323 (46%), Gaps = 24/323 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
++A+ I+ YF++CDRTN K Y+ F + L + + L D
Sbjct: 373 SLAKMALIMSYFFLCDRTNFFMKENKYYSE----FSFWLPIGYVTVLGLFFTED-----S 423
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNA---IRIFIAAYVWMTGFGNFSYYY 222
K + L+R Q EWKGWMQ++ L+YH A+ I I++ I+AY+++ G+ F +
Sbjct: 424 KYTKVLHRDQLNEWKGWMQLVILVYHITGASRILPINMHIKVLISAYLFLLGYEQFCCVW 483
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVG----IFN 278
R D + F +++++LNF C+ +N Y YY P+ + + +M Y + I
Sbjct: 484 QRGDIGIVSFFRVLFQLNFITVTLCLCMNRPYQFYYFVPLLSFWYLMTYCFLAFPPHITA 543
Query: 279 KYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFR 338
+ +E + +L F+V + I + ++F+ + + + EW FR
Sbjct: 544 QTSENNVMQYFYLLIKFVVFFTVITILFMSEVFFEKVFVTRPWKALFVTTDDDIREWWFR 603
Query: 339 SGLDRYIWIIGMIYAYYHPTAEKWM---EKLEESEPKRKLSIKAGIVTVALFVGYL--WY 393
LDRY I GM +A A+++ + + R L++ +V +A YL +
Sbjct: 604 WKLDRYTIIYGMGFAVILLLAQRYNIYDDNNHNNLFSRGLALTGILVAIAGIGCYLSITF 663
Query: 394 ECIYKLDKVTYNKYHPYTSWIPI 416
C +L+ ++ H Y +IPI
Sbjct: 664 LCSTELE---CSEIHSYIVFIPI 683
>gi|270003398|gb|EEZ99845.1| hypothetical protein TcasGA2_TC002627 [Tribolium castaneum]
Length = 831
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 150/323 (46%), Gaps = 24/323 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
++A+ I+ YF++CDRTN K Y+ F + L + + L D
Sbjct: 408 SLAKMALIMSYFFLCDRTNFFMKENKYYSE----FSFWLPIGYVTVLGLFFTED-----S 458
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNA---IRIFIAAYVWMTGFGNFSYYY 222
K + L+R Q EWKGWMQ++ L+YH A+ I I++ I+AY+++ G+ F +
Sbjct: 459 KYTKVLHRDQLNEWKGWMQLVILVYHITGASRILPINMHIKVLISAYLFLLGYEQFCCVW 518
Query: 223 IRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVG----IFN 278
R D + F +++++LNF C+ +N Y YY P+ + + +M Y + I
Sbjct: 519 QRGDIGIVSFFRVLFQLNFITVTLCLCMNRPYQFYYFVPLLSFWYLMTYCFLAFPPHITA 578
Query: 279 KYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFR 338
+ +E + +L F+V + I + ++F+ + + + EW FR
Sbjct: 579 QTSENNVMQYFYLLIKFVVFFTVITILFMSEVFFEKVFVTRPWKALFVTTDDDIREWWFR 638
Query: 339 SGLDRYIWIIGMIYAYYHPTAEKWM---EKLEESEPKRKLSIKAGIVTVALFVGYL--WY 393
LDRY I GM +A A+++ + + R L++ +V +A YL +
Sbjct: 639 WKLDRYTIIYGMGFAVILLLAQRYNIYDDNNHNNLFSRGLALTGILVAIAGIGCYLSITF 698
Query: 394 ECIYKLDKVTYNKYHPYTSWIPI 416
C +L+ ++ H Y +IPI
Sbjct: 699 LCSTELE---CSEIHSYIVFIPI 718
>gi|326921775|ref|XP_003207131.1| PREDICTED: CAS1 domain-containing protein 1-like, partial
[Meleagris gallopavo]
Length = 472
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L + G I+ YFY+CDR NL K Y F + ++++ +
Sbjct: 332 LHCFCKLGLIMTYFYLCDRANLFMKENKFYTHSSFFIPIVYILVLGVFYTE--------- 382
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ K + LNR QT+EWKGWMQ++ L+YH A+ +Y IR+ +AAY++ TG+G+FSY
Sbjct: 383 NTKETKVLNREQTDEWKGWMQLVILIYHISGASTFLPVYMHIRVLVAAYLFQTGYGHFSY 442
Query: 221 YYIRKDFSLPRFAQM 235
++I+ DF + R Q+
Sbjct: 443 FWIKGDFGVYRVCQV 457
>gi|408400060|gb|EKJ79148.1| hypothetical protein FPSE_00749 [Fusarium pseudograminearum CS3096]
Length = 922
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 128/289 (44%), Gaps = 64/289 (22%)
Query: 107 MAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLV-IVSAMTSLKKHNDKSPFSG 165
+ F +L Y DRT ++ S+K + +++ F L V +++ + ++++ KSP
Sbjct: 435 IGSFFLVLLMSYYSDRTQMMAKSSKLW--EMYGFGILCAVCLIALLITIRRTRPKSPEQL 492
Query: 166 KTIQ-------------------------YLNRHQTEEWKGWMQVLFLMYHYFAA----T 196
+ + +L+R QTEEWKGWMQ L+Y + A T
Sbjct: 493 SSTEDETSEKLLPENCPSELEEHGEQDEPFLSRKQTEEWKGWMQCFVLIYQWTGADQGPT 552
Query: 197 EIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYML 256
+Y R+ IAAY++ TG+G+ Y+ DFS R A + R N +N DYM
Sbjct: 553 SLYVLFRLCIAAYMFQTGYGHTYYFITTGDFSFKRVATTLLRFNILSCALAYSMNMDYMF 612
Query: 257 YYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILI------------WEI 304
YY P+ + + ++VY + I ++N ++I K+ ++V ++ +EI
Sbjct: 613 YYSAPLASFWFLVVYATMAIGKQHNNNTQMVIAKVFISGVLVSVVFMTSLTKWMFNFFEI 672
Query: 305 PGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYA 353
+F+I W +W + + LD I IGMI A
Sbjct: 673 --LFNIQWD------------------ADQWKYYANLDILIVYIGMITA 701
>gi|167522068|ref|XP_001745372.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776330|gb|EDQ89950.1| predicted protein [Monosiga brevicollis MX1]
Length = 422
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 117/224 (52%), Gaps = 22/224 (9%)
Query: 102 ATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKS 161
AT RA +L + DRT LL TK ++ L + ++ ++A+ +++ ++ +
Sbjct: 9 ATRRACGVLAFVLLLLMLADRTPLLLKQTKAWS-PWALGIGSSIIGLAALLWIEEVDEDT 67
Query: 162 PFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT------EIYNAIRIFIAAYVWMTGF 215
+++RHQT EWKGWMQ+ FL+YHY AA+ +Y+ IRI + +++++TG+
Sbjct: 68 --------FMSRHQTNEWKGWMQLTFLVYHYTAASGARQVLAVYSWIRILVGSFLFLTGY 119
Query: 216 GNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVG 275
G+F + +R+ L R ++ RLN + C+ ++ YM YY P+ T + + G
Sbjct: 120 GHF-LHALRQGVKLHRVGMVLVRLNLYAVALCLCMHRPYMAYYFIPLVTFWYLETIGLAV 178
Query: 276 IFNKYNEIGSVMIVKILACFLVVILIWEIPG----VFD-IFWSP 314
++ + V +LA L +L W G +FD IF +P
Sbjct: 179 VYPHLGRHRDAITV-VLALVLPSVLFWSPDGQATPIFDAIFNAP 221
>gi|429863446|gb|ELA37897.1| cas1 domain-containing protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 787
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 110/213 (51%), Gaps = 8/213 (3%)
Query: 103 TLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSP 162
+L A A AI Y + DRT+LL K+++ F+ + LVI++ ++ + S
Sbjct: 288 SLLAAATLVAIAVYCLLADRTHLLIKHDKHFDVPDFILPVIGLVILAGLSLRSRPPVLSS 347
Query: 163 FSG-KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNF 218
G T +L+R QT+EWKGWMQ L+Y+Y AA+E +Y + +A Y++++ +G+
Sbjct: 348 AKGLATTSFLSREQTDEWKGWMQAFILLYNYHAASESLAMYKVHKFLVATYIFLSCYGHT 407
Query: 219 SYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFN 278
SY+ +DFSL R ++ RLN + ++++++Y P+ T + + + ++ F
Sbjct: 408 SYFLRTEDFSLRRATYVLVRLNLLSCILGLATSSEWIIYS-APLMTFWFGVTFASLACFK 466
Query: 279 KYNEIGSVMIVKIL---ACFLVVILIWEIPGVF 308
N +KI+ C ++ +P F
Sbjct: 467 TLNNNPVAFFLKIVVFATCTTFLVRSTHLPETF 499
>gi|195133848|ref|XP_002011351.1| GI16046 [Drosophila mojavensis]
gi|193907326|gb|EDW06193.1| GI16046 [Drosophila mojavensis]
Length = 853
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 92/332 (27%), Positives = 155/332 (46%), Gaps = 36/332 (10%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L A++ G IL YFY+CDRTN K Y+ F ++ + V A+ ++S F
Sbjct: 426 LVALSLLGLILAYFYLCDRTNFFMKENKYYSEFSF---WIPVAYVFALGLF--FTEESGF 480
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY 220
+ + LNRHQT+E +GW+ ++ L+Y+ A I+ I++ I+ Y ++TG+ +F+
Sbjct: 481 T----KVLNRHQTDELRGWILLVVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTD 536
Query: 221 YYIRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFN 278
+ RF Q M+RLNF C +N Y YY P+ + + ++Y + +
Sbjct: 537 IWQTGGSGSMFVRFFQEMFRLNFLCVLLCFCMNRPYQFYYFVPLLSFWVCVIYFVLSLPP 596
Query: 279 KYNEIGSV---------MIVKILACFLVVILIWEIPGVFD-IFWS-PLTFILGYTDPAKP 327
+ SV ++ K + C V+ +++ F+ IF + P + TD
Sbjct: 597 RIT-TASVDAYPLHYLYLVCKCIGCLGVITVLFMSEVFFERIFVTRPWKALFVTTDDD-- 653
Query: 328 DLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI-VTVAL 386
+HEW ++ LDRY GMIYA A+K+ +++ S + I VT+
Sbjct: 654 ----IHEWWYQWKLDRYTVTFGMIYAACFHIAQKY-SIFDDNNHGNLFSRRTSISVTLLA 708
Query: 387 FVGYLWYECIYKLDKVTYN--KYHPYTSWIPI 416
+G Y L + N + H Y +IPI
Sbjct: 709 LLGVGIYTSFSFLCRNVQNCEEIHSYILFIPI 740
>gi|194763260|ref|XP_001963751.1| GF21094 [Drosophila ananassae]
gi|190618676|gb|EDV34200.1| GF21094 [Drosophila ananassae]
Length = 875
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/364 (27%), Positives = 163/364 (44%), Gaps = 41/364 (11%)
Query: 67 GGLSRSASARLLSSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLL 126
GG +R A LS A + + + A++ G IL YFY+CDRTN
Sbjct: 426 GGRTRGAPGNALS---------------ALITDYGTPMVALSLLGLILAYFYLCDRTNFF 470
Query: 127 GDSTKNYNRDLFLFLYLLLVIVSAMTSLKKH-NDKSPFSGKTIQYLNRHQTEEWKGWMQV 185
K Y+ Y + V + +L + S F+ + LNR QT+E +GW+ +
Sbjct: 471 MKENKYYSE------YSFWIPVGYVFALGLFFTEDSRFT----KVLNRDQTDELRGWILL 520
Query: 186 LFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSY-YYIRKDFSL-PRFAQMMWRLN 240
+ L+Y+ A I+ I++ I+ Y ++TG+ +F++ ++ SL RF Q M+R N
Sbjct: 521 VVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTHLWHTGSSGSLFVRFFQAMFRAN 580
Query: 241 FFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVIL 300
F C +N Y YY P+ + + +VY + + + + SV + +LV
Sbjct: 581 FLSVLLCFCMNRPYQFYYFVPLLSFWLCVVYFVLALPPRISS-ASVDANPLHYLYLVCKC 639
Query: 301 IWEIPGVFDIFWSPLTFILGY-TDPAKPDL----PRLHEWHFRSGLDRYIWIIGMIYAYY 355
I + G+ +F S + F + T P K LHEW + LDRY GMIYA
Sbjct: 640 IGCLGGITVLFMSEVFFERIFVTRPWKALFVTTDDDLHEWWHQWKLDRYTVAFGMIYAAC 699
Query: 356 HPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGYLWYECIYKLDKVTYN--KYHPYTS 412
A+K+ +++ S + I VT+ +G Y L + N + H Y
Sbjct: 700 FHIAQKY-NVFDDNNHGNLFSRRTSISVTLLALLGVGVYTAFSFLCRNVQNCEEIHSYIL 758
Query: 413 WIPI 416
+IPI
Sbjct: 759 FIPI 762
>gi|195399424|ref|XP_002058320.1| GJ15558 [Drosophila virilis]
gi|194150744|gb|EDW66428.1| GJ15558 [Drosophila virilis]
Length = 857
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 160/342 (46%), Gaps = 37/342 (10%)
Query: 95 AFLLENRAT-LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTS 153
A L+ + T L A++ G I+ YFY+CDRTN K Y+ F ++ + V A+
Sbjct: 420 AVLITDYGTPLVALSLLGLIMAYFYLCDRTNFFMKENKYYSEFSF---WIPVAYVFALGL 476
Query: 154 LKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYV 210
++S F+ + LNR QT+E +GW+ ++ L+Y+ A I + I++ I+ Y
Sbjct: 477 F--FTEESRFT----KVLNRQQTDELRGWILLVVLIYYMTGAQRILPIHMHIKLLISGYF 530
Query: 211 WMTGFGNFSYYYIRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTI 268
++TG+ +F++ + RF Q M+RLNF C +N Y YY P+ + +
Sbjct: 531 FLTGYTHFTHIWQTGGSGSMFVRFFQSMFRLNFLSILLCFCMNRPYQFYYFVPLLSFWLC 590
Query: 269 MVYGAVGIFNKYNEIGSV---------MIVKILACFLVVILIWEIPGVFD-IFWS-PLTF 317
++Y + + + SV ++ K + C V+ +++ F+ IF + P
Sbjct: 591 VIYFVLSLPPRITS-ASVDANPLHYLYLVCKCIGCLGVITVLFMSEVFFERIFVTRPWKA 649
Query: 318 ILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSI 377
+ TD +HEW ++ LDRY GMIYA A+K+ +++ S
Sbjct: 650 LFVTTDDD------IHEWWYQWKLDRYTVTFGMIYAACFHIAQKY-NVFDDNNHGNLFSR 702
Query: 378 KAGI-VTVALFVGYLWYECIYKLDKVTYN--KYHPYTSWIPI 416
+ I VT+ +G Y L + N + H Y +IPI
Sbjct: 703 RTSISVTLLALLGVGIYTAFSLLCRNVQNCEEIHSYILFIPI 744
>gi|357438359|ref|XP_003589455.1| hypothetical protein MTR_1g024790 [Medicago truncatula]
gi|355478503|gb|AES59706.1| hypothetical protein MTR_1g024790 [Medicago truncatula]
Length = 124
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 71/108 (65%), Gaps = 7/108 (6%)
Query: 1 MVVFRPITPGQVSFLLGIIPVFVAWIYSEFLEYKKVSSHTKVHSDTNLVELEKETIKEDD 60
M + P+TPGQVSFLLG+ PV +AWIYSE LE++K S +K HSD LVE+ + +K+++
Sbjct: 1 MHILSPVTPGQVSFLLGLFPVIIAWIYSEILEFRKNSLTSKAHSDIGLVEVRTDVVKDEE 60
Query: 61 RAVLLEGGLSRSAS----ARLLSSSIKTNLIRFMTMDDAFLLENRATL 104
VLLEGG + AS AR ++S T++IR D FLL + T+
Sbjct: 61 TTVLLEGGALQPASPTPKARSFTAS--TSIIR-SEFDHYFLLMDGKTI 105
>gi|194887926|ref|XP_001976832.1| GG18568 [Drosophila erecta]
gi|190648481|gb|EDV45759.1| GG18568 [Drosophila erecta]
Length = 866
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 150/324 (46%), Gaps = 24/324 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ G IL YFY+CDRTN K Y+ F ++ + V A+ + S F+
Sbjct: 441 ALSLLGLILAYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLF--FTEDSRFT- 494
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QT+E +GW+ ++ L+Y+ A I+ I++ I+ Y ++TG+ +F++ +
Sbjct: 495 ---KVLNRDQTDELRGWILLVVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTHMW 551
Query: 223 IRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
RF Q M+R NF C +N Y YY P+ + + +VY + + +
Sbjct: 552 QTGGSGSLFVRFFQAMFRANFLSVLLCFCMNRPYQFYYFVPLLSFWLCIVYFVLALPPRI 611
Query: 281 NEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGY-TDPAKP----DLPRLHEW 335
+ SV + +LV I + G+ +F S + F + T P K LHEW
Sbjct: 612 SS-ASVDANPLHYLYLVCKCIGCLGGITVLFMSEVFFERIFVTRPWKALFVTTDDDLHEW 670
Query: 336 HFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGYLWYE 394
+ LDRY GMIYA A+K+ +++ S + I VT+ +G Y
Sbjct: 671 WHQWKLDRYTVAFGMIYAACFHIAQKY-NVFDDNNHGNLFSRRTSISVTLLALLGVGVYT 729
Query: 395 CIYKLDKVTYN--KYHPYTSWIPI 416
L + N + H Y +IPI
Sbjct: 730 SFSFLCRNVQNCEEIHSYILFIPI 753
>gi|195477260|ref|XP_002100146.1| GE16879 [Drosophila yakuba]
gi|194187670|gb|EDX01254.1| GE16879 [Drosophila yakuba]
Length = 868
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 150/324 (46%), Gaps = 24/324 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ G IL YFY+CDRTN K Y+ F ++ + V A+ + S F+
Sbjct: 443 ALSLLGLILAYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLF--FTEDSRFT- 496
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QT+E +GW+ ++ L+Y+ A I+ I++ I+ Y ++TG+ +F++ +
Sbjct: 497 ---KVLNRDQTDELRGWILLVVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTHMW 553
Query: 223 IRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
RF Q M+R NF C +N Y YY P+ + + +VY + + +
Sbjct: 554 QTGGSGSLFVRFFQAMFRANFLSVLLCFCMNRPYQFYYFVPLLSFWLCIVYFVLALPPRI 613
Query: 281 NEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGY-TDPAKP----DLPRLHEW 335
+ SV + +LV I + G+ +F S + F + T P K LHEW
Sbjct: 614 SS-SSVDANPLHYLYLVCKCIGCLGGITVLFMSEVFFERIFVTRPWKALFVTTDDDLHEW 672
Query: 336 HFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGYLWYE 394
+ LDRY GMIYA A+K+ +++ S + I VT+ +G Y
Sbjct: 673 WHQWKLDRYTVAFGMIYAACFHIAQKY-NVFDDNNHGNLFSRRTSISVTLLALLGVGVYT 731
Query: 395 CIYKLDKVTYN--KYHPYTSWIPI 416
L + N + H Y +IPI
Sbjct: 732 SFTFLCRNVQNCEEIHSYILFIPI 755
>gi|195564968|ref|XP_002106079.1| GD16317 [Drosophila simulans]
gi|194203450|gb|EDX17026.1| GD16317 [Drosophila simulans]
Length = 859
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 150/324 (46%), Gaps = 24/324 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ G IL YFY+CDRTN K Y+ F ++ + V A+ + S F+
Sbjct: 434 ALSLLGLILAYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLF--FTEDSRFT- 487
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QT+E +GW+ ++ L+Y+ A I+ I++ I+ Y ++TG+ +F++ +
Sbjct: 488 ---KVLNRDQTDELRGWILLVVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTHMW 544
Query: 223 IRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
RF Q M+R NF C +N Y YY P+ + + +VY + + +
Sbjct: 545 QTGGSGSLFVRFFQAMFRANFLSVLLCFCMNRPYQFYYFVPLLSFWLCIVYFVLALPPRL 604
Query: 281 NEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTF-ILGYTDPAKP----DLPRLHEW 335
+ SV + +LV I + G+ +F S + F + T P K LHEW
Sbjct: 605 SS-ASVDANPLHYLYLVCKCIGCLGGITVLFMSEVFFERIFVTRPWKALFVTTDDDLHEW 663
Query: 336 HFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGYLWYE 394
+ LDRY GMIYA A+K+ +++ S + I VT+ +G Y
Sbjct: 664 WHQWKLDRYTVAFGMIYAACFHIAQKY-NVFDDNNHGNLFSRRTSISVTLLALLGVGVYT 722
Query: 395 CIYKLDKVTYN--KYHPYTSWIPI 416
L + N + H Y +IPI
Sbjct: 723 SFSFLCRNVQNCEEIHSYILFIPI 746
>gi|18543323|ref|NP_570085.1| CG2938, isoform B [Drosophila melanogaster]
gi|442615118|ref|NP_001259226.1| CG2938, isoform C [Drosophila melanogaster]
gi|7290452|gb|AAF45907.1| CG2938, isoform B [Drosophila melanogaster]
gi|17862336|gb|AAL39645.1| LD22456p [Drosophila melanogaster]
gi|220952842|gb|ACL88964.1| CG2938-PB [synthetic construct]
gi|440216421|gb|AGB95072.1| CG2938, isoform C [Drosophila melanogaster]
Length = 862
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 150/324 (46%), Gaps = 24/324 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ G IL YFY+CDRTN K Y+ F ++ + V A+ + S F+
Sbjct: 437 ALSLLGLILAYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLF--FTEDSRFT- 490
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QT+E +GW+ ++ L+Y+ A I+ I++ I+ Y ++TG+ +F++ +
Sbjct: 491 ---KVLNRDQTDELRGWILLVVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTHMW 547
Query: 223 IRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
RF Q M+R NF C +N Y YY P+ + + +VY + + +
Sbjct: 548 QTGGSGSLFVRFFQAMFRANFLSVLLCFCMNRPYQFYYFVPLLSFWLCIVYFVLALPPRI 607
Query: 281 NEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGY-TDPAKP----DLPRLHEW 335
+ SV + +LV I + G+ +F S + F + T P K LHEW
Sbjct: 608 SS-ASVDANPLHYLYLVCKCIGCLGGITVLFMSEVFFERIFVTRPWKALFVTTDDDLHEW 666
Query: 336 HFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGYLWYE 394
+ LDRY GMIYA A+K+ +++ S + I VT+ +G Y
Sbjct: 667 WHQWKLDRYTVAFGMIYAACFHIAQKY-NVFDDNNHGNLFSRRTSISVTLLALLGVGVYT 725
Query: 395 CIYKLDKVTYN--KYHPYTSWIPI 416
L + N + H Y +IPI
Sbjct: 726 SFSFLCRNVQNCEEIHSYILFIPI 749
>gi|195340919|ref|XP_002037060.1| GM12710 [Drosophila sechellia]
gi|194131176|gb|EDW53219.1| GM12710 [Drosophila sechellia]
Length = 862
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 150/324 (46%), Gaps = 24/324 (7%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ G IL YFY+CDRTN K Y+ F ++ + V A+ + S F+
Sbjct: 437 ALSLLGLILAYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLF--FTEDSRFT- 490
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QT+E +GW+ ++ L+Y+ A I+ I++ I+ Y ++TG+ +F++ +
Sbjct: 491 ---KVLNRDQTDELRGWILLVVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTHMW 547
Query: 223 IRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
RF Q M+R NF C +N Y YY P+ + + +VY + + +
Sbjct: 548 QTGGSGSLFVRFFQAMFRANFLSVLLCFCMNRPYQFYYFVPLLSFWLCIVYFVLALPPRL 607
Query: 281 NEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGY-TDPAKP----DLPRLHEW 335
+ SV + +LV I + G+ +F S + F + T P K LHEW
Sbjct: 608 SS-ASVDANPLHYLYLVCKCIGCLGGITVLFMSEVFFERIFVTRPWKALFVTTDDDLHEW 666
Query: 336 HFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGYLWYE 394
+ LDRY GMIYA A+K+ +++ S + I VT+ +G Y
Sbjct: 667 WHQWKLDRYTVAFGMIYAACFHIAQKY-NVFDDNNHGNLFSRRTSISVTLLALLGVGVYT 725
Query: 395 CIYKLDKVTYN--KYHPYTSWIPI 416
L + N + H Y +IPI
Sbjct: 726 SFSFLCRNVQNCEEIHSYILFIPI 749
>gi|422293726|gb|EKU21026.1| hypothetical protein NGA_2075300, partial [Nannochloropsis gaditana
CCMP526]
Length = 63
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 39/62 (62%), Positives = 48/62 (77%)
Query: 197 EIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYML 256
E+YNAIRI I YVWMTGFGNFS++YI++DF + RF QM+WRLNF V C+ N Y+L
Sbjct: 2 EVYNAIRIMITCYVWMTGFGNFSFFYIKQDFGVVRFLQMLWRLNFLVILLCLSQGNTYIL 61
Query: 257 YY 258
YY
Sbjct: 62 YY 63
>gi|256083149|ref|XP_002577812.1| hypothetical protein [Schistosoma mansoni]
gi|353230313|emb|CCD76484.1| hypothetical protein Smp_157560 [Schistosoma mansoni]
Length = 1067
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 97/188 (51%), Gaps = 42/188 (22%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A+++FGAI+ YF+ICDRT L + K Y +L FL + V + FSG
Sbjct: 472 ALSKFGAIMMYFFICDRTILFMKANKRYT-NLSFFLPTIYCFVLGLF----------FSG 520
Query: 166 KTIQYLNRHQ--TEEWKGWMQVLFLMYHY---FAATEIYNAIRIFIAAYVWMTGFGNFSY 220
T + H T EWKGWMQ+ L+YH+ + T I+ + R+ +++Y++++GFG+F Y
Sbjct: 521 PTKRSQVNHLDITREWKGWMQLYLLIYHFTDSYRVTPIFMSARLVVSSYLFLSGFGHFCY 580
Query: 221 YYIR-----------------KDFS---------LPRFAQMMWRLNFFVAFCCIVLNNDY 254
++ + ++F L R+ +++R NFFV C+V+N Y
Sbjct: 581 FWRKPVPQINWLKLLNYRRCSREFCTAWKALWQILHRYLIVIYRFNFFVFGLCLVMNRGY 640
Query: 255 MLYYICPM 262
+ YY P+
Sbjct: 641 LAYYFIPL 648
>gi|195059887|ref|XP_001995716.1| GH17907 [Drosophila grimshawi]
gi|193896502|gb|EDV95368.1| GH17907 [Drosophila grimshawi]
Length = 907
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/342 (26%), Positives = 158/342 (46%), Gaps = 37/342 (10%)
Query: 95 AFLLENRAT-LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTS 153
A L+ + T L A++ G I+ YFY+CDRTN K Y+ F ++ + V A+
Sbjct: 470 AVLITDYGTPLVALSLLGVIMGYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGL 526
Query: 154 LKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI---YNAIRIFIAAYV 210
+ S F+ + LN+ QT+E +GW+ ++ L+Y+ A I + I++ I+ Y
Sbjct: 527 F--FTEDSRFT----KVLNQDQTDELRGWILLVVLIYYMTGAQRILPIHMHIKLLISGYF 580
Query: 211 WMTGFGNFSYYYIRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTI 268
++TG+ +F++ + RF Q M+R+NF C +N Y YY P+ + +
Sbjct: 581 FLTGYTHFTHVWQTGGSGSMFVRFFQAMFRVNFLSVLLCFCMNRPYQFYYFVPLLSFWLC 640
Query: 269 MVYGAVGIFNKYNEIGSV---------MIVKILACFLVVILIWEIPGVFD-IFWS-PLTF 317
+VY + + + SV ++ K + C V+ +++ F+ IF + P
Sbjct: 641 IVYFVLSLPPRITA-ASVEANPLHYLYLVCKCIGCLGVITVLFMSEVFFERIFVTRPWKA 699
Query: 318 ILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSI 377
+ TD +HEW ++ LDRY GMIYA A K+ +++ S
Sbjct: 700 LFVTTDDD------IHEWWYQWKLDRYTVTFGMIYAACFHIAHKY-NVFDDNNHGNLFSR 752
Query: 378 KAGI-VTVALFVGYLWYECIYKLDKVTYN--KYHPYTSWIPI 416
+ I VT+ +G Y L + N + H Y +IPI
Sbjct: 753 RTSISVTLLALLGVGIYTSFSFLCRNVQNCEEIHSYIVFIPI 794
>gi|198469952|ref|XP_001355164.2| GA15532 [Drosophila pseudoobscura pseudoobscura]
gi|198147113|gb|EAL32221.2| GA15532 [Drosophila pseudoobscura pseudoobscura]
Length = 866
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/332 (27%), Positives = 153/332 (46%), Gaps = 40/332 (12%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ G I+ YFY+CDRTN K Y+ F ++ + V A+ + S F+
Sbjct: 441 ALSLLGLIMVYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLF--FTEDSRFT- 494
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QT+E +GW+ ++ L+Y+ A I+ I++ I+ Y ++TG+ +F++ +
Sbjct: 495 ---KVLNRDQTDELRGWILLVVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTHLW 551
Query: 223 IRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
RF Q M+R+NF C +N Y YY P+ + + +VY + + +
Sbjct: 552 QTGGSGSLFVRFFQAMFRVNFLSVLLCFCMNRPYQFYYFVPLLSFWLCIVYFVLSLPPRI 611
Query: 281 NEIGSV---------MIVKILACFLVVILIWEIPGVFD-IFWS-PLTFILGYTDPAKPDL 329
+ SV ++ K + C V+ +++ F+ IF + P + TD
Sbjct: 612 SS-ASVDSNPLHYLYLVCKCIGCLGVITVLFMSEVFFERIFVTRPWKALFVTTDDD---- 666
Query: 330 PRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPK---RKLSIKAGIVTVAL 386
LHEW + LDRY GM+YA A+K+ + + R+ SI VT+
Sbjct: 667 --LHEWWHQWKLDRYTVTFGMMYAACFHIAQKYNVFDDNNHGNLFARRTSIS---VTLLA 721
Query: 387 FVGYLWYECIYKLDKVTYN--KYHPYTSWIPI 416
+G Y L + N + H Y +IPI
Sbjct: 722 LLGVGIYTSFSFLCRNVQNCEEIHSYILFIPI 753
>gi|195163948|ref|XP_002022811.1| GL14546 [Drosophila persimilis]
gi|194104834|gb|EDW26877.1| GL14546 [Drosophila persimilis]
Length = 825
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 153/332 (46%), Gaps = 40/332 (12%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
A++ G I+ YFY+CDRTN K Y+ F ++ + V A+ + S F+
Sbjct: 441 ALSLLGLIMVYFYLCDRTNFFMKENKYYSEFSF---WIPVGYVFALGLF--FTEDSRFT- 494
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYY 222
+ LNR QT+E +GW+ ++ L+Y+ A I+ I++ I+ Y ++TG+ +F++ +
Sbjct: 495 ---KVLNRDQTDELRGWILLVVLIYYMTGAQRVLPIHMHIKLLISGYFFLTGYTHFTHLW 551
Query: 223 IRKDFS--LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKY 280
RF Q M+R+NF C +N Y YY P+ + + ++Y + + +
Sbjct: 552 QTGGSGSLFVRFFQAMFRVNFLSVLLCFCMNRPYQFYYFVPLLSFWLCIIYFVLSLPPRI 611
Query: 281 NEIGSV---------MIVKILACFLVVILIWEIPGVFD-IFWS-PLTFILGYTDPAKPDL 329
+ SV ++ K + C V+ +++ F+ IF + P + TD
Sbjct: 612 SS-ASVDSNPLHYLYLVCKCIGCLGVITVLFMSEVFFERIFVTRPWKALFVTTDDD---- 666
Query: 330 PRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLEESEPK---RKLSIKAGIVTVAL 386
LHEW + LDRY GM+YA A+K+ + + R+ SI VT+
Sbjct: 667 --LHEWWHQWKLDRYTVTFGMMYAACFHIAQKYNVFDDNNHGNLFARRTSIS---VTLLA 721
Query: 387 FVGYLWYECIYKLDKVTYN--KYHPYTSWIPI 416
+G Y L + N + H Y +IPI
Sbjct: 722 LLGVGIYTSFSFLCRNVQNCEEIHSYILFIPI 753
>gi|296417900|ref|XP_002838585.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634535|emb|CAZ82776.1| unnamed protein product [Tuber melanosporum]
Length = 838
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 152/313 (48%), Gaps = 26/313 (8%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQ 169
FG ++ Y + DRT + G S K ++ L+L++ + + + ++++ +P +
Sbjct: 369 FGLVVIYCFYADRTQVFGKSQKQFHSSDMLWLFVGTLFL-GLIAIRR---TAPAATSDQP 424
Query: 170 YLNRHQTEEWKGWMQVLFLMYHYFAATEIY--NAI-RIFIAAYVWMTGFGNFSYYYIR-K 225
L+R Q EWKG + L+ Y E NA+ R+ +AAY+++ G+G +S Y +R +
Sbjct: 425 LLSREQWYEWKGLLLATMLICDYTGGIESAKINAVYRLSVAAYLFLNGYG-YSLYLLRSQ 483
Query: 226 DFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGS 285
D++L R ++ RLNF A ++ +Y+ YY+ P+ T + +++Y + I K+N
Sbjct: 484 DYTLKRCVAVLIRLNFLSALLPYMMGTNYVFYYLAPLATYWFLIIYLTLWIGEKHNSNVK 543
Query: 286 VMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRYI 345
++ KI F+ I +PG+ +I + + G ++ D+ +R + +
Sbjct: 544 FLLGKI---FVSAIATRLLPGILEIVF----LVFGAAAGSQWDV-----LVWRDYIVSDL 591
Query: 346 WII--GMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVT 403
W++ GM+ + A E S L+ + + AL + + + +
Sbjct: 592 WMVYTGMVGSVLFSRAN---ENSYTSTRWFLLARRYAVAISALAILLFYLFLASRGEDGE 648
Query: 404 YNKYHPYTSWIPI 416
Y+ YHPY S+IP+
Sbjct: 649 YSAYHPYISFIPV 661
>gi|310798434|gb|EFQ33327.1| Cas1p-like protein [Glomerella graminicola M1.001]
Length = 889
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 91/390 (23%), Positives = 175/390 (44%), Gaps = 48/390 (12%)
Query: 77 LLSSSIKTNLIRFMTMDDAFLLENRAT-----LRAMAEFGAILFYFYICDRTNLLGDSTK 131
LL +I T I F++ +R + L A+A + Y + DRT++ K
Sbjct: 360 LLCLNIMTLPILFLSRQQGAFPSSRTSRVMDILTAVATLLVVAVYCLVADRTHVFAKQDK 419
Query: 132 NYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG-KTIQYLNRHQTEEWKGWMQVLFLMY 190
++ F+ + L +++A++ + + G T +L+R Q +EW+GWM V L+Y
Sbjct: 420 QFDVPDFIAPLIGLAVLAALSLRSRAPVLTSTRGLTTTSFLSREQADEWRGWMLVFVLVY 479
Query: 191 HYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCC 247
+Y AA+E +Y + +A ++++ +G+ Y+ +D+SL R A ++ RLN
Sbjct: 480 NYHAASESLAMYKVHKFLVATFIFLFVYGHTMYFLRTEDYSLRRVAYVLCRLNLLTCLLA 539
Query: 248 IVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGV 307
+++++++ Y P+ T + + Y ++ K N +KI+A + + E
Sbjct: 540 FTMSSEWII-YTAPLMTFWFGVTYASLACCKKANSNPVAFFLKIVAFATCTVYLVESTRA 598
Query: 308 FDIFWSPLTFILG-YTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYA------------Y 354
+ S L G Y D + + HF + DR+I G++ A
Sbjct: 599 PEWLMSMLKATCGIYWDLS------ILRGHFET--DRFIPWFGILTAAATHRVSVLRRRQ 650
Query: 355 YHPTAEKWMEK---------LEESEPKRK-LSIKAGIVT-----VALFVGYLWYECIYKL 399
+ A+ EK LE P+R + +K ++ + +F+ + +Y+
Sbjct: 651 HGINAQASFEKTNNALDHALLEIVYPERDAIPVKPIMILFSFAYLVIFIILAFVAEVYR- 709
Query: 400 DKVTYNKYHPYTS-WIPITYVLFIFYFFSL 428
+ +YN YHPYTS W ++ ++ F SL
Sbjct: 710 NNSSYNAYHPYTSPWFVLSAIIARNSFRSL 739
>gi|358340999|dbj|GAA48781.1| CAS1 domain-containing protein 1 [Clonorchis sinensis]
Length = 1128
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 93/197 (47%), Gaps = 43/197 (21%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
+++FG IL YF+ICDRT L + K + FL L ++ FSG
Sbjct: 374 TLSQFGLILVYFFICDRTVLFMKTNKGFTTMSFLLPMTYLFVLGLF-----------FSG 422
Query: 166 KTIQYLNRHQ--TEEWKGWMQVLFLMYHYFAATEIYNA---IRIFIAAYVWMTGFGNF-- 218
T + H T EWKGWMQ+ L+YH+ + + R F +AY++++GFG+F
Sbjct: 423 PTKETRLNHVDITREWKGWMQLYLLVYHFTGSYRVLPLRLFTRFFTSAYLFLSGFGHFYC 482
Query: 219 ----------SYYYIRKDFSLP--------------RFAQMMWRLNFFVAFCCIVLNNDY 254
+ +R FSL R+ +++RLNF V C+V+N DY
Sbjct: 483 LWHHPLPGSLIWEVLRLRFSLQGLYFTAQAWWAMLRRYVDVVFRLNFLVLGLCLVMNRDY 542
Query: 255 MLYYICPMHTL-FTIMV 270
M Y+ P+ T FT+ +
Sbjct: 543 MFYHFMPLVTFWFTVTM 559
>gi|443925755|gb|ELU44524.1| O-acetyltransferase [Rhizoctonia solani AG-1 IA]
Length = 832
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 54/314 (17%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQ 169
FG + DRT + K ++ F L L +V+ + ++K+ + K +
Sbjct: 373 FGVACSLLFFADRTGIWLKEHKQFDPWAFGGLSAL-ALVTGLATMKRAD-------KDLG 424
Query: 170 YLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSL 229
+LNR QT+EWKGWMQ T IY R F+ + F
Sbjct: 425 FLNREQTDEWKGWMQ----------RTHIYCRPR--------------FANSSLLSHFGF 460
Query: 230 PRFAQMMWRLNFFVAFCCIVLNNDY--MLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVM 287
R AQ++ RLN +N +YY P+ + + +++Y + + KYN+ + +
Sbjct: 461 SRVAQILVRLNLLTIALAYAMNTGQCGQIYYFAPLVSWWYLIIYFTLFVGAKYNDRTAFV 520
Query: 288 IVKILACFLVVILI----WEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDR 343
+ KI + W + +F+I TF + + EW+FR LD+
Sbjct: 521 LAKITVSMALHTWFMHESWILKELFEILEG--TFQIEWV---------AREWNFRVTLDQ 569
Query: 344 YIWIIGMIYAYYHPTAEKWMEKLEESEPKRKLSIKAGI-VTVALFVGYLWYECIYKLDKV 402
+I +GM+ A K E P L+ + I +V + W+E + + DK
Sbjct: 570 FIVYVGMLTAIAF---LKIREIRLTDHPSWPLASRCAIGASVFSLFWFFWFE-LTRTDKF 625
Query: 403 TYNKYHPYTSWIPI 416
YN +HPY S IP+
Sbjct: 626 AYNAWHPYVSAIPV 639
>gi|380496319|emb|CCF31802.1| Cas1p-like protein [Colletotrichum higginsianum]
Length = 812
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 108/201 (53%), Gaps = 11/201 (5%)
Query: 103 TLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKH--NDK 160
TL A+A + Y + DRT++L K+++ F+ + L +++A+ SL+ +
Sbjct: 310 TLMAVATLLVVAVYCLLADRTHVLAKQDKHFDVPDFVLPLVGLAVLAAL-SLRSRVPTTQ 368
Query: 161 SPFS-GKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFG 216
SP T +L+R QT+EWKGWM LMY+Y AA+E ++ A + +A ++++ +
Sbjct: 369 SPTRIPTTTSFLSREQTDEWKGWMLAFILMYNYHAASESSAMFKAHKFIVATFIFLFTYS 428
Query: 217 NFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI 276
+ Y+ +D+S R A ++ RLN +++++++ Y P+ T + + Y ++
Sbjct: 429 HTMYFLRTEDYSFRRVAYVLLRLNLLTCLLVFTMSSEWVI-YTAPLVTFWFGVTYTSLAC 487
Query: 277 FNKYNE--IG-SVMIVKILAC 294
F + N +G S+ IV AC
Sbjct: 488 FKRANSNPVGFSLKIVAFAAC 508
>gi|91992721|gb|AAH38009.1| Casd1 protein [Mus musculus]
Length = 340
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 118/234 (50%), Gaps = 21/234 (8%)
Query: 202 IRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICP 261
+R + AY++ TG+G+FSY++I+ DF + R Q+++RLNF V CIV++ Y YY P
Sbjct: 3 VRPRVRAYLFQTGYGHFSYFWIKGDFGIHRVCQVLFRLNFLVVVLCIVMDRPYQFYYFVP 62
Query: 262 MHTLFTIMVYGAVGIFNKYNEIGSV-----MIVKILACFLVVILIWEIP---GVFDIFWS 313
+ T++ +++Y + ++ + + + + +L L+++ IW + G F+ +S
Sbjct: 63 LVTVWFMVIYVTLALWPQITQKKANGNFFWYLGLLLKLGLLLLCIWFLAYSQGAFEKIFS 122
Query: 314 --PLTFILGYTDPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAY-YHPTAEKWMEKLEESE 370
PL+ ++EW FR LDRY+ G+++A+ Y + + + E
Sbjct: 123 LWPLSKCFELEG-------SVYEWWFRWRLDRYVVFHGVLFAFIYLALQRRQILSEGKGE 175
Query: 371 P--KRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
P K+S V+V F+ Y + K +K N+ HP S + I + I
Sbjct: 176 PLFSNKISNFLLFVSVVSFLTYSIWASSCK-NKAECNELHPSVSVVQIVAFILI 228
>gi|387212637|gb|AFJ69146.1| hypothetical protein NGATSA_2051500, partial [Nannochloropsis
gaditana CCMP526]
Length = 71
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 49/80 (61%), Gaps = 9/80 (11%)
Query: 119 ICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEE 178
+C++ L K+++ D++ FL LLL++ S MT K GK LNR QTEE
Sbjct: 1 MCEKYPLFPRGEKSWDGDMYWFLCLLLLVASVMTMRK---------GKCTDILNRDQTEE 51
Query: 179 WKGWMQVLFLMYHYFAATEI 198
WKGWMQ +FL+YHY+ A E+
Sbjct: 52 WKGWMQFMFLLYHYYKAEEV 71
>gi|402083792|gb|EJT78810.1| hypothetical protein, variant [Gaeumannomyces graminis var. tritici
R3-111a-1]
gi|402083793|gb|EJT78811.1| hypothetical protein GGTG_03908 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1004
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 93/194 (47%), Gaps = 9/194 (4%)
Query: 112 AILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYL 171
A L F + R G ++ N+ L + S+ +S + ++ T+ +L
Sbjct: 430 AALSAFSLRKRGQTRGQNSAGRNKPAS---TATLALRSSGSSARAMPGSPQYNENTV-FL 485
Query: 172 NRHQTEEWKGWMQVLFLMYHYFAATE---IYNAIRIFIAAYVWMTGFGNFSYYYIRKDFS 228
R ++EWKGWMQ L LM + ++ Y R+ AAY+++ +G+ Y D+S
Sbjct: 486 PREVSDEWKGWMQALVLMLSFQELSDQPWAYKLFRLISAAYLFICTYGHAMYLLRTCDYS 545
Query: 229 LPRFAQMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGI-FNKYNEIGSVM 287
L R A ++WRLN + + +D LY+ + + + + + + I + ++N S++
Sbjct: 546 LRRVATVVWRLNALPFIVSMAVGSDGTLYHFPRLASFWFFITWLTLRIGWRRFNGSPSIL 605
Query: 288 IVKIL-ACFLVVIL 300
I+K+ A F +L
Sbjct: 606 ILKVFAAAFFTTVL 619
>gi|440475955|gb|ELQ44601.1| hypothetical protein OOU_Y34scaffold00071g17 [Magnaporthe oryzae
Y34]
gi|440487788|gb|ELQ67563.1| hypothetical protein OOW_P131scaffold00314g136 [Magnaporthe oryzae
P131]
Length = 895
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 100/221 (45%), Gaps = 32/221 (14%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKK----------HND 159
F A++FY Y+ DRT L + + + L + L++ A SL++
Sbjct: 322 FLAVIFYCYLADRTQLFPKTVRRLSAG-GLVISLIVGGAFAAASLRRWRPRLAPSHISER 380
Query: 160 KSPFSGKTIQ----------------YLNRHQTEEWKGWMQVLFLMYHYFAATEI--YNA 201
K P S ++ +L R +EEWKGWMQ L +++ Y ++ +N
Sbjct: 381 KRPGSVPPLKPRQRKLPRLEHDDDPGFLPRELSEEWKGWMQGLLVVFSYQELSDQLWFNK 440
Query: 202 I-RIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYIC 260
+ R+ AAY+++ +G+ +Y+ D S R ++WRLN + LN LY
Sbjct: 441 LSRLPPAAYLFILAYGHTTYFLRTDDLSFKRVVSVIWRLNALPFLVSMALNVTGGLYSFP 500
Query: 261 PMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILI 301
+ T + ++ Y + ++N +++++KI C V I
Sbjct: 501 RLATFWFVVAYFTLRCGQRFNSNPTLVLLKI--CLAAVATI 539
>gi|389629660|ref|XP_003712483.1| hypothetical protein MGG_04984 [Magnaporthe oryzae 70-15]
gi|351644815|gb|EHA52676.1| hypothetical protein MGG_04984 [Magnaporthe oryzae 70-15]
Length = 975
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 100/221 (45%), Gaps = 32/221 (14%)
Query: 110 FGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKK----------HND 159
F A++FY Y+ DRT L + + + L + L++ A SL++
Sbjct: 402 FLAVIFYCYLADRTQLFPKTVRRLSAG-GLVISLIVGGAFAAASLRRWRPRLAPSHISER 460
Query: 160 KSPFSGKTIQ----------------YLNRHQTEEWKGWMQVLFLMYHYFAATEI--YNA 201
K P S ++ +L R +EEWKGWMQ L +++ Y ++ +N
Sbjct: 461 KRPGSVPPLKPRQRKLPRLEHDDDPGFLPRELSEEWKGWMQGLLVVFSYQELSDQLWFNK 520
Query: 202 I-RIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFFVAFCCIVLNNDYMLYYIC 260
+ R+ AAY+++ +G+ +Y+ D S R ++WRLN + LN LY
Sbjct: 521 LSRLPPAAYLFILAYGHTTYFLRTDDLSFKRVVSVIWRLNALPFLVSMALNVTGGLYSFP 580
Query: 261 PMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILI 301
+ T + ++ Y + ++N +++++KI C V I
Sbjct: 581 RLATFWFVVAYFTLRCGQRFNSNPTLVLLKI--CLAAVATI 619
>gi|392347148|ref|XP_003749742.1| PREDICTED: CAS1 domain-containing protein 1-like [Rattus
norvegicus]
Length = 717
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 51/93 (54%), Gaps = 9/93 (9%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F + ++++ +N+ +
Sbjct: 368 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT-- 420
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAAT 196
K + LNR QT+EWKGWMQ++ L+YH A+
Sbjct: 421 --KETKVLNREQTDEWKGWMQLVILIYHISGAS 451
>gi|213982957|ref|NP_001135640.1| CAS1 domain containing 1 [Xenopus (Silurana) tropicalis]
gi|197245560|gb|AAI68493.1| Unknown (protein for MGC:173009) [Xenopus (Silurana) tropicalis]
Length = 345
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 9/97 (9%)
Query: 106 AMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSG 165
++++ G I+ YFY CDR NL K Y F + ++++ +N+ +
Sbjct: 234 SLSKLGLIMAYFYFCDRANLFMKENKFYTHSSFFIPIIYILVLGVF-----YNENT---- 284
Query: 166 KTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAI 202
K + LNR QT+EWKGWMQ++ L+YH A+ + ++
Sbjct: 285 KEAKLLNREQTDEWKGWMQLVILIYHISGASSFFASV 321
>gi|156406044|ref|XP_001641041.1| predicted protein [Nematostella vectensis]
gi|156228178|gb|EDO48978.1| predicted protein [Nematostella vectensis]
Length = 725
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 103/225 (45%), Gaps = 31/225 (13%)
Query: 234 QMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSV------- 286
++M R+N F C+V+ Y YY P+ + + +++Y + F + + SV
Sbjct: 462 EVMTRMNLFTVVLCLVMGRPYQFYYFVPLISFWFVVIYATMVFFPRVSA-SSVREDPKQY 520
Query: 287 --MIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDPAKPDLPRLHEWHFRSGLDRY 344
+ +K F + ++W P +FD +S + D + + EW FRS LDRY
Sbjct: 521 IFIWLKFFVLFGTIYILWSSPILFDWVFSQWAVKQLFID----ENDSVREWRFRSWLDRY 576
Query: 345 IWIIGMIYAYYHPTAEKW------MEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYK 398
I + GM++ + + TA+ + + K K+++ +V +A++ G + C
Sbjct: 577 IVLYGMVFGFAYHTAKHFKIFDDTLRKGLFKSLHSKVTMVMSVVALAVY-GIQAFTCS-- 633
Query: 399 LDKVTYNKYHPYTSWIPITYVLFIFYFFSLVKHLSGSLYMMACRY 443
+K + N H S IPIT + L++++ GS+ R+
Sbjct: 634 -NKPSCNATHSVASCIPITAYI-------LLRNVPGSMRSRFSRF 670
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMT 152
+R MA+ G I+ YFY+CDRTNL K Y+ F L +I+ A T
Sbjct: 408 MRYMAKLGIIMLYFYLCDRTNLFFKEQKQYSNTAFFLSMLGFLILGAYT 456
>gi|444708000|gb|ELW49128.1| CAS1 domain-containing protein 1 [Tupaia chinensis]
Length = 545
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 61/131 (46%), Gaps = 36/131 (27%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F F+ ++ ++V +
Sbjct: 337 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSF-FIPIIYILVLGV------------ 383
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYI 223
+ N + E FL +Y IR+ +AAY++ TG+G+FSY++I
Sbjct: 384 ------FYNENTKE---------FL--------PVYMHIRVLVAAYLFQTGYGHFSYFWI 420
Query: 224 RKDFSLPRFAQ 234
+ DF + R Q
Sbjct: 421 KGDFGIHRVCQ 431
>gi|302410845|ref|XP_003003256.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261358280|gb|EEY20708.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 575
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 1/98 (1%)
Query: 102 ATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLL-VIVSAMTSLKKHNDK 160
+ L A+A AI Y Y+ DRT+L K + + LLL V+ +A +
Sbjct: 469 SALIAVAYMLAIAGYCYLADRTHLFDKIAKQFETSSVAWSSLLLAVLATASVRSSRAGFL 528
Query: 161 SPFSGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEI 198
+P K + +L R QTEE KG MQ L+Y + A+EI
Sbjct: 529 APIHHKPVSFLGRDQTEELKGLMQGFLLLYDFHGASEI 566
>gi|344239621|gb|EGV95724.1| Collagen alpha-2(I) chain [Cricetulus griseus]
Length = 621
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 61/131 (46%), Gaps = 36/131 (27%)
Query: 104 LRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPF 163
L++ + G I+ YFY+CDR NL K Y F F+ ++ ++V +
Sbjct: 464 LQSFCKLGLIMAYFYMCDRANLFMKENKFYTHSSF-FIPIIYILVLGV------------ 510
Query: 164 SGKTIQYLNRHQTEEWKGWMQVLFLMYHYFAATEIYNAIRIFIAAYVWMTGFGNFSYYYI 223
+ N + E FL +Y IR+ +AAY++ TG+G+FSY++I
Sbjct: 511 ------FYNENTKE---------FL--------PVYMHIRVLVAAYLFQTGYGHFSYFWI 547
Query: 224 RKDFSLPRFAQ 234
+ DF + R Q
Sbjct: 548 KGDFGIYRVCQ 558
>gi|147820965|emb|CAN63518.1| hypothetical protein VITISV_017846 [Vitis vinifera]
Length = 328
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 31/49 (63%)
Query: 93 DDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLFLFL 141
D ENR+TLR M EFGAI YFY+CD LL S+K+ N D ++ +
Sbjct: 174 DSGHHHENRSTLRTMFEFGAIPTYFYVCDSVPLLLHSSKSPNEDPWILI 222
>gi|302422892|ref|XP_003009276.1| CAS1 domain-containing protein [Verticillium albo-atrum VaMs.102]
gi|261352422|gb|EEY14850.1| CAS1 domain-containing protein [Verticillium albo-atrum VaMs.102]
Length = 377
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 35/49 (71%)
Query: 194 AATEIYNAIRIFIAAYVWMTGFGNFSYYYIRKDFSLPRFAQMMWRLNFF 242
A+T IY +R+ +AAY++ TG+G+ +++ ++KDFS R A +M RLN
Sbjct: 326 ASTGIYIFVRLLVAAYLFQTGYGHTTFFLVKKDFSFKRMASVMLRLNLL 374
>gi|147801743|emb|CAN76873.1| hypothetical protein VITISV_017984 [Vitis vinifera]
Length = 316
Score = 45.4 bits (106), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 37/97 (38%), Gaps = 40/97 (41%)
Query: 265 LFTIMVYGAVGIFNKYNEIGSVMIVKILACFLVVILIWEIPGVFDIFWSPLTFILGYTDP 324
+ T + +G GI +V + C V + W W PL
Sbjct: 33 VLTALKFGVTGI-----------VVVVCMCSYVAVFAWS--------WGPLG-------- 65
Query: 325 AKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYHPTAEK 361
R GLDRYIWIIGMIYAYYH EK
Sbjct: 66 -------------RFGLDRYIWIIGMIYAYYHLNVEK 89
>gi|413919161|gb|AFW59093.1| hypothetical protein ZEAMMB73_800081 [Zea mays]
Length = 821
Score = 45.1 bits (105), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 26/43 (60%), Gaps = 15/43 (34%)
Query: 360 EKWMEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKV 402
E+WMEKLEESE K VG+LWYE IYKLDKV
Sbjct: 348 ERWMEKLEESETK---------------VGFLWYEHIYKLDKV 375
>gi|414875740|tpg|DAA52871.1| TPA: putative RNA helicase family protein [Zea mays]
Length = 484
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 31/63 (49%), Gaps = 15/63 (23%)
Query: 363 MEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
MEKLEESE K VG+LWYE IYKLDK + H I T +++
Sbjct: 1 MEKLEESETK---------------VGFLWYEHIYKLDKGPETQMHTLAKNIAGTSLVYT 45
Query: 423 FYF 425
YF
Sbjct: 46 KYF 48
>gi|414875742|tpg|DAA52873.1| TPA: putative RNA helicase family protein [Zea mays]
Length = 940
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 31/63 (49%), Gaps = 15/63 (23%)
Query: 363 MEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
MEKLEESE K VG+LWYE IYKLDK + H I T +++
Sbjct: 1 MEKLEESETK---------------VGFLWYEHIYKLDKGPETQMHTLAKNIAGTSLVYT 45
Query: 423 FYF 425
YF
Sbjct: 46 KYF 48
>gi|414875741|tpg|DAA52872.1| TPA: putative RNA helicase family protein [Zea mays]
Length = 911
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 31/63 (49%), Gaps = 15/63 (23%)
Query: 363 MEKLEESEPKRKLSIKAGIVTVALFVGYLWYECIYKLDKVTYNKYHPYTSWIPITYVLFI 422
MEKLEESE K VG+LWYE IYKLDK + H I T +++
Sbjct: 1 MEKLEESETK---------------VGFLWYEHIYKLDKGPETQMHTLAKNIAGTSLVYT 45
Query: 423 FYF 425
YF
Sbjct: 46 KYF 48
>gi|387219833|gb|AFJ69625.1| hypothetical protein NGATSA_2045200, partial [Nannochloropsis
gaditana CCMP526]
Length = 93
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 47/94 (50%), Gaps = 5/94 (5%)
Query: 276 IFNKYNEIGSVMIVKILACFLVVILIWEI--PGVFDIFWSPLTFILGYTDPAKPDLPRLH 333
+ K+N + +K+L L++ W++ VF + +SP L L
Sbjct: 3 VLPKHNHGKWDVRLKLLGVGLLIYAAWDLFEGEVFKVLFSPF---LSTAPVIGAKAGTLW 59
Query: 334 EWHFRSGLDRYIWIIGMIYAYYHPTAEKWMEKLE 367
EW+FR+ LD + ++GMI+A P A +W+ KLE
Sbjct: 60 EWYFRTSLDHWSTLLGMIFALNFPMATRWLTKLE 93
>gi|392389975|ref|YP_006426578.1| cell division membrane protein [Ornithobacterium rhinotracheale DSM
15997]
gi|390521053|gb|AFL96784.1| bacterial cell division membrane protein [Ornithobacterium
rhinotracheale DSM 15997]
Length = 399
Score = 41.2 bits (95), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 77/175 (44%), Gaps = 13/175 (7%)
Query: 10 GQVSFLLGIIPVFVAWIYSEFLEYKKVSS-HTKVHSDTNLVELEKETIKEDD-RAVLLEG 67
G LLG + VFVA Y + + +V + ++ TN E E+ +E + + ++EG
Sbjct: 195 GSTVVLLGFLFVFVALNYGDAIPNSRVHTWKNRIERFTNPSENSLESWQETNAKTAIVEG 254
Query: 68 GLSRSASARLLSSSIKTNLIRFMTMDD---AFLLENRATLRAMAEFGAILFYFYICDRTN 124
G++ + S+IK L + D A ++E L A AI FY I R
Sbjct: 255 GITGKGPGK---SAIKHTLPQ--ASSDFIFAIIVEEYGLLGGSA---AIFFYLLILWRIV 306
Query: 125 LLGDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEW 179
++ +N+ L +F L +I A+ ++ P +G+ + ++ T W
Sbjct: 307 VIATKVQNFFGTLLVFALGLPIIFQAIINIGVAVGLFPTTGQPLPMISFGGTSLW 361
>gi|397634780|gb|EJK71574.1| hypothetical protein THAOC_06967 [Thalassiosira oceanica]
Length = 1122
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 91/212 (42%), Gaps = 34/212 (16%)
Query: 79 SSSIKTNLIRFMTMDDAFLLENRATLRAMAEFGAILFYFYICDRTNLLGDSTKNYNRDLF 138
SSS + + +DD FL E + F +IL I D + D T N
Sbjct: 227 SSSRDRDNLEEKQLDDVFLDEGTS-----KNFHSILNT--IEDEIAEIRDVT---NASWL 276
Query: 139 LFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVLFLMYH------- 191
L LL + S KK + + + G + LN QT E KG++ V L+Y
Sbjct: 277 EKLLNLLGFDLDLQSKKKSDLREVWPGADL--LNSFQTSEMKGFLSVALLIYRFSSLGPS 334
Query: 192 ---YFAATEIYNAIRIFIA-----AYVWMTGFGNFSYYY-----IRKDFSLPRFAQMMWR 238
Y + I + ++++A A++++TG+ + SY+Y ++ +PR ++R
Sbjct: 335 NFDYHSDQSIDSEAKMYLAKVATTAFLFLTGYCHASYFYYSSSKKQEREEVPRLVGTVFR 394
Query: 239 LNFFVAFCCIVLNNDYMLYYICPMHTLFTIMV 270
+NF F +V+ Y P+ +IMV
Sbjct: 395 INFSAMFLSLVIGK--AEYIALPVLHTISIMV 424
>gi|323453019|gb|EGB08891.1| hypothetical protein AURANDRAFT_63390 [Aureococcus anophagefferens]
Length = 803
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 7/70 (10%)
Query: 127 GDSTKNYNRDLFLFLYLLLVIVSAMTSLKKHNDKSPFSGKTIQYLNRHQTEEWKGWMQVL 186
G + + N DL+ F+ LL +VS + + T +L+R Q EWKGWMQV
Sbjct: 234 GTTLDSENPDLWAFIMALLFLVSLL-------NVEVLEEGTEVFLSRAQANEWKGWMQVA 286
Query: 187 FLMYHYFAAT 196
F+ Y T
Sbjct: 287 FVAYRRGVRT 296
>gi|238573035|ref|XP_002387311.1| hypothetical protein MPER_14036 [Moniliophthora perniciosa FA553]
gi|215442103|gb|EEB88241.1| hypothetical protein MPER_14036 [Moniliophthora perniciosa FA553]
Length = 59
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/56 (28%), Positives = 32/56 (57%)
Query: 239 LNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNKYNEIGSVMIVKILAC 294
LN F +N DY+ YY P+ +++ +++Y + + + NE ++++KILA
Sbjct: 1 LNLFTLLLAYAMNTDYLFYYFAPLVSVWFVIIYATMALASHLNERTPILVIKILAS 56
>gi|47196421|emb|CAF89001.1| unnamed protein product [Tetraodon nigroviridis]
Length = 190
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 15/46 (32%), Positives = 29/46 (63%)
Query: 234 QMMWRLNFFVAFCCIVLNNDYMLYYICPMHTLFTIMVYGAVGIFNK 279
Q+++RLNF V C+V++ Y YY P+ T + ++YG + ++ +
Sbjct: 1 QVLFRLNFLVLVLCVVMDRPYQFYYFVPLVTFWFFIIYGTLAMWPQ 46
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.330 0.143 0.458
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,134,007,142
Number of Sequences: 23463169
Number of extensions: 296674729
Number of successful extensions: 889625
Number of sequences better than 100.0: 303
Number of HSP's better than 100.0 without gapping: 280
Number of HSP's successfully gapped in prelim test: 23
Number of HSP's that attempted gapping in prelim test: 888609
Number of HSP's gapped (non-prelim): 390
length of query: 444
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 298
effective length of database: 8,933,572,693
effective search space: 2662204662514
effective search space used: 2662204662514
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 78 (34.7 bits)