BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 045952
(569 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297744222|emb|CBI37192.3| unnamed protein product [Vitis vinifera]
Length = 625
Score = 897 bits (2319), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/568 (72%), Positives = 479/568 (84%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
MFGC+CF W+ + + P + TFSLPAP+PTWPQGQGFASG INLGE+EV +ISRF F+W
Sbjct: 1 MFGCQCFQWSRIAELLPPDTKTFSLPAPIPTWPQGQGFASGVINLGELEVFQISRFEFVW 60
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
NL Q KKK TFY+P GIP+GF+SLGHYCQ + +PL+GFVLVAR++A S E A N
Sbjct: 61 GSNLSQDKKKGVTFYKPVGIPNGFFSLGHYCQSNDQPLQGFVLVAREVACSNPEVAQICN 120
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
L KSP LQKPLDYTL+W D+G + NY+ C +FWLPQPP+GY++MGF+VT P++PELDE
Sbjct: 121 LDKSPPLQKPLDYTLLWSPDDGSEENYDSCGYFWLPQPPEGYEAMGFVVTNKPDRPELDE 180
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVR DLTD CE HHLIF ISK S PF VWS RPC+RGMLG+G+ GTFFCSS W
Sbjct: 181 VRCVRADLTDSCETHHLIFKTISKLSKVPFRVWSLRPCHRGMLGKGIPTGTFFCSSYWNH 240
Query: 241 GQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY 300
G+ELNI CLKNL+P LHAMPN DQIHALI++YGPT+FFHP+E YLPSSV+WFF NGALLY
Sbjct: 241 GEELNIVCLKNLNPSLHAMPNLDQIHALIKHYGPTIFFHPNEAYLPSSVAWFFKNGALLY 300
Query: 301 KAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGG 360
K G+ G+AIDP G NLPSGG+NDGE+WIDLPS ++ +K GN+ESAKLYVHVKPA GG
Sbjct: 301 KKGESDGQAIDPEGLNLPSGGKNDGEYWIDLPSGDNKRSLKSGNLESAKLYVHVKPASGG 360
Query: 361 TFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQH 420
TFTDIVMWVFCPFNGP TLKVG+MN+A SKIGQHVGDWEHFTLRI NFTGELWSIYFSQH
Sbjct: 361 TFTDIVMWVFCPFNGPATLKVGLMNIALSKIGQHVGDWEHFTLRISNFTGELWSIYFSQH 420
Query: 421 SGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVD 480
SGG WV A DLE+IEGNKAIVYSS++GHASFPHPG+Y+QGS LGIG+RNDAARSNLYVD
Sbjct: 421 SGGIWVNACDLEFIEGNKAIVYSSRSGHASFPHPGSYIQGSSKLGIGIRNDAARSNLYVD 480
Query: 481 SSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENA 540
SSI+YE++ AEYLG+GVV EP WLQ+MR+WGP IVYDS++ELDK+I LP M+RYSVEN
Sbjct: 481 SSIEYEIIGAEYLGDGVVTEPCWLQYMREWGPNIVYDSRSELDKMINFLPAMVRYSVENI 540
Query: 541 VSKLPLELYGEEGPTGPKEKNNWVGDER 568
+K PLEL GEEGPTGPKEK NW GDER
Sbjct: 541 FNKFPLELSGEEGPTGPKEKKNWAGDER 568
>gi|225437979|ref|XP_002272156.1| PREDICTED: uncharacterized protein LOC100259944 [Vitis vinifera]
Length = 569
Score = 896 bits (2316), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/568 (72%), Positives = 479/568 (84%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
MFGC+CF W+ + + P + TFSLPAP+PTWPQGQGFASG INLGE+EV +ISRF F+W
Sbjct: 1 MFGCQCFQWSRIAELLPPDTKTFSLPAPIPTWPQGQGFASGVINLGELEVFQISRFEFVW 60
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
NL Q KKK TFY+P GIP+GF+SLGHYCQ + +PL+GFVLVAR++A S E A N
Sbjct: 61 GSNLSQDKKKGVTFYKPVGIPNGFFSLGHYCQSNDQPLQGFVLVAREVACSNPEVAQICN 120
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
L KSP LQKPLDYTL+W D+G + NY+ C +FWLPQPP+GY++MGF+VT P++PELDE
Sbjct: 121 LDKSPPLQKPLDYTLLWSPDDGSEENYDSCGYFWLPQPPEGYEAMGFVVTNKPDRPELDE 180
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVR DLTD CE HHLIF ISK S PF VWS RPC+RGMLG+G+ GTFFCSS W
Sbjct: 181 VRCVRADLTDSCETHHLIFKTISKLSKVPFRVWSLRPCHRGMLGKGIPTGTFFCSSYWNH 240
Query: 241 GQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY 300
G+ELNI CLKNL+P LHAMPN DQIHALI++YGPT+FFHP+E YLPSSV+WFF NGALLY
Sbjct: 241 GEELNIVCLKNLNPSLHAMPNLDQIHALIKHYGPTIFFHPNEAYLPSSVAWFFKNGALLY 300
Query: 301 KAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGG 360
K G+ G+AIDP G NLPSGG+NDGE+WIDLPS ++ +K GN+ESAKLYVHVKPA GG
Sbjct: 301 KKGESDGQAIDPEGLNLPSGGKNDGEYWIDLPSGDNKRSLKSGNLESAKLYVHVKPASGG 360
Query: 361 TFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQH 420
TFTDIVMWVFCPFNGP TLKVG+MN+A SKIGQHVGDWEHFTLRI NFTGELWSIYFSQH
Sbjct: 361 TFTDIVMWVFCPFNGPATLKVGLMNIALSKIGQHVGDWEHFTLRISNFTGELWSIYFSQH 420
Query: 421 SGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVD 480
SGG WV A DLE+IEGNKAIVYSS++GHASFPHPG+Y+QGS LGIG+RNDAARSNLYVD
Sbjct: 421 SGGIWVNACDLEFIEGNKAIVYSSRSGHASFPHPGSYIQGSSKLGIGIRNDAARSNLYVD 480
Query: 481 SSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENA 540
SSI+YE++ AEYLG+GVV EP WLQ+MR+WGP IVYDS++ELDK+I LP M+RYSVEN
Sbjct: 481 SSIEYEIIGAEYLGDGVVTEPCWLQYMREWGPNIVYDSRSELDKMINFLPAMVRYSVENI 540
Query: 541 VSKLPLELYGEEGPTGPKEKNNWVGDER 568
+K PLEL GEEGPTGPKEK NW GDER
Sbjct: 541 FNKFPLELSGEEGPTGPKEKKNWAGDER 568
>gi|147794066|emb|CAN77840.1| hypothetical protein VITISV_015561 [Vitis vinifera]
Length = 569
Score = 891 bits (2302), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/568 (72%), Positives = 478/568 (84%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
MFGC+CF W+ + + P + TFSLPAP+PTWPQGQGFASG INLGE+EV +ISRF F+W
Sbjct: 1 MFGCQCFQWSRIAELLPPDTKTFSLPAPIPTWPQGQGFASGVINLGELEVFQISRFEFVW 60
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
NL Q KKK TF++P GIP+GF+SLG YCQ + +PL+GFVLVAR++A S E A N
Sbjct: 61 GSNLSQDKKKGVTFFKPVGIPNGFFSLGDYCQSNDQPLQGFVLVAREVACSNPEVAQICN 120
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
L KSP LQKPLDYTL+W D+G + NY+ C +FWLPQPP+GY++MGF+VT P++PELDE
Sbjct: 121 LDKSPPLQKPLDYTLLWSPDDGSEENYDSCGYFWLPQPPEGYEAMGFVVTNKPDRPELDE 180
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVR DLTD CE HHLIF ISK S PF VWS RPC+RGMLG+G+ GTFFCSS W
Sbjct: 181 VRCVRADLTDSCETHHLIFKTISKLSKVPFRVWSLRPCHRGMLGKGIPTGTFFCSSYWNH 240
Query: 241 GQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY 300
G+ELNI CLKNL+P LHAMPN DQIHALI++YGPT+FFHP+E YLPSSV+WFF NGALLY
Sbjct: 241 GEELNIVCLKNLNPSLHAMPNLDQIHALIKHYGPTIFFHPNEAYLPSSVAWFFKNGALLY 300
Query: 301 KAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGG 360
K G+ G+AIDP G NLPSGG+NDGE+WIDLPS ++ +K GN+ESAKLYVHVKPA GG
Sbjct: 301 KKGESDGQAIDPEGLNLPSGGKNDGEYWIDLPSGDNKRSLKSGNLESAKLYVHVKPASGG 360
Query: 361 TFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQH 420
TFTDIVMWVFCPFNGP TLKVG+MN+A SKIGQHVGDWEHFTLRI NFTGELWSIYFSQH
Sbjct: 361 TFTDIVMWVFCPFNGPATLKVGLMNIALSKIGQHVGDWEHFTLRISNFTGELWSIYFSQH 420
Query: 421 SGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVD 480
SGG W A DLE+IEGNKAIVYS+++GHASFPHPG+Y+QGS LGIG+RNDAARSNLYVD
Sbjct: 421 SGGIWANACDLEFIEGNKAIVYSARSGHASFPHPGSYIQGSSKLGIGIRNDAARSNLYVD 480
Query: 481 SSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENA 540
SSI+YE++ AEYLG+GVV EP WLQ+MR+WGPTIVYDS++ELDK+I LP M+RYSVEN
Sbjct: 481 SSIEYEIIGAEYLGDGVVTEPCWLQYMREWGPTIVYDSRSELDKMINFLPAMVRYSVENI 540
Query: 541 VSKLPLELYGEEGPTGPKEKNNWVGDER 568
+K PLEL GEEGPTGPKEK NW GDER
Sbjct: 541 FNKFPLELSGEEGPTGPKEKKNWAGDER 568
>gi|255571186|ref|XP_002526543.1| conserved hypothetical protein [Ricinus communis]
gi|223534104|gb|EEF35821.1| conserved hypothetical protein [Ricinus communis]
Length = 571
Score = 865 bits (2235), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/569 (70%), Positives = 475/569 (83%), Gaps = 1/569 (0%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
+FGCKCF+W ++N+ P+EP T+SLPA LP WP GQGFASGRI+LGEIEV +ISR FIW
Sbjct: 4 IFGCKCFHWKRIDNLLPSEPDTYSLPASLPDWPPGQGFASGRISLGEIEVIKISRLEFIW 63
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
+C L Q +KK +FY+PA +PDGF SLGH CQ +++PLR F+LVAR++A ++ E A S+
Sbjct: 64 TCKLPQDEKKGVSFYKPAAVPDGFNSLGHQCQINNQPLRSFLLVAREVAITKTEAAIFSS 123
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
SPAL+KP+DY LVW S +Y+GC FFWLPQPPDGYK +G+LVT P+KP+LDE
Sbjct: 124 PVNSPALRKPIDYILVWSSYSFNDESYDGCGFFWLPQPPDGYKPLGYLVTNNPDKPDLDE 183
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVR DLTD C+ + I + SKFSS PF VWSTRP +RGM+G+GVSVGTFFC S S
Sbjct: 184 VRCVRADLTDGCQAYRPILNVYSKFSSFPFEVWSTRPSHRGMIGKGVSVGTFFCGSYCTS 243
Query: 241 GQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY 300
G+ELNIACL+N +P+LH+MPN +QIHALI +YGPTVFFHPDEVYLPSSVSWFF NGALLY
Sbjct: 244 GEELNIACLRNANPELHSMPNLEQIHALINHYGPTVFFHPDEVYLPSSVSWFFKNGALLY 303
Query: 301 KAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGG 360
+AG+L GE+ID SGSN P+GG ND FWIDLPSD R VKHGN+ESAKLYVHVKPA GG
Sbjct: 304 RAGNLTGESIDASGSNFPAGGTNDRAFWIDLPSDDHRDTVKHGNLESAKLYVHVKPAYGG 363
Query: 361 TFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQH 420
TFTD+ MWVFCPFNGPGTLKVG ++ FSKIGQHVGDWEHFTLRICNF+GELWS+YFSQH
Sbjct: 364 TFTDLAMWVFCPFNGPGTLKVGPLSFPFSKIGQHVGDWEHFTLRICNFSGELWSMYFSQH 423
Query: 421 SGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVD 480
SGG+W+ AY+LEYIEGNK IVYSSKNGHAS+PHPGTY+QGS L IG+RNDAARSNL+VD
Sbjct: 424 SGGQWIEAYNLEYIEGNKPIVYSSKNGHASYPHPGTYIQGSAKLWIGIRNDAARSNLFVD 483
Query: 481 SSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENA 540
SS YE++A EY+ G + EP WLQ+MR+WGPTIVY+S+ ELDKI LP+M+RYSVEN
Sbjct: 484 SSTHYEIIAGEYIA-GDIIEPGWLQYMREWGPTIVYNSRNELDKIFNRLPVMLRYSVENI 542
Query: 541 VSKLPLELYGEEGPTGPKEKNNWVGDERG 569
KLP+ELYGEEGPTGPKEKNNWVGDERG
Sbjct: 543 FYKLPVELYGEEGPTGPKEKNNWVGDERG 571
>gi|224129536|ref|XP_002320610.1| predicted protein [Populus trichocarpa]
gi|222861383|gb|EEE98925.1| predicted protein [Populus trichocarpa]
Length = 568
Score = 830 bits (2143), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/568 (70%), Positives = 461/568 (81%), Gaps = 8/568 (1%)
Query: 2 FGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWS 61
F C C YW + + P EP TFSLP+PLP W QG+GFASGRINLG+IE +ISRF FIWS
Sbjct: 9 FWCNCLYWRKTAILLPPEPETFSLPSPLPDWSQGRGFASGRINLGKIEALKISRFEFIWS 68
Query: 62 CNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNL 121
NLLQ KKK +FY+P G+P+GFYSLGHYCQF+++PL GFVLV R++A E E A+
Sbjct: 69 SNLLQDKKKGVSFYKPVGVPNGFYSLGHYCQFNNKPLWGFVLVVREVACFEPEAAN---- 124
Query: 122 FKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEV 181
SP L KPLDYTLVW SD+ + Y GC FFWLPQPP+GYK +GFLVT P+KP+LDEV
Sbjct: 125 --SPTLLKPLDYTLVWSSDDESEEKYGGCGFFWLPQPPEGYKPLGFLVTNNPDKPDLDEV 182
Query: 182 RCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISG 241
RCVR DLTD+CE + L+ ++ SKF + P V STRP +RG+LG+GVSVGTFFC W S
Sbjct: 183 RCVRADLTDECEPYRLLLESYSKFLNLPVRVSSTRPSHRGVLGKGVSVGTFFCGY-WTSE 241
Query: 242 QELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK 301
+ELNIACLKNL+ +LHAMPN +QIHAL+++YGPTVFFHP+EVYLPSSV WF NGALLYK
Sbjct: 242 EELNIACLKNLN-QLHAMPNLEQIHALMKHYGPTVFFHPNEVYLPSSVPWFLKNGALLYK 300
Query: 302 AGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGT 361
AGD GE ID G+NLP GG NDG FWIDLPSDG R VK G++ESAKLYVHVKPA+GGT
Sbjct: 301 AGDSSGEPIDAEGTNLPGGGTNDGAFWIDLPSDGKRNTVKQGSLESAKLYVHVKPALGGT 360
Query: 362 FTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 421
FTD+ +WVF PFNGPGTLKVG +N++ KIGQHVGDWEHFTLRICNFTGELWSIYFSQHS
Sbjct: 361 FTDLAVWVFYPFNGPGTLKVGPLNISLGKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 420
Query: 422 GGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDS 481
GG+WV AYDLEYIEGNKAIVYSSKNGHASFPHPG Y+QGS LGIG+RNDAARSNLYVDS
Sbjct: 421 GGEWVDAYDLEYIEGNKAIVYSSKNGHASFPHPGCYIQGSTKLGIGIRNDAARSNLYVDS 480
Query: 482 SIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAV 541
S +YE++AAEYL EP WLQFM KWGPTIVY S+ ELDKII LP+ RYSV+N
Sbjct: 481 STRYEIIAAEYLEGSDFIEPCWLQFMGKWGPTIVYGSRIELDKIINHLPVGFRYSVKNIF 540
Query: 542 SKLPLELYGEEGPTGPKEKNNWVGDERG 569
P+EL GEEGPTGPK+KNNWVGDERG
Sbjct: 541 DGFPVELCGEEGPTGPKDKNNWVGDERG 568
>gi|225465569|ref|XP_002263468.1| PREDICTED: uncharacterized protein LOC100263054 [Vitis vinifera]
gi|147791031|emb|CAN61465.1| hypothetical protein VITISV_005001 [Vitis vinifera]
Length = 568
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/571 (66%), Positives = 452/571 (79%), Gaps = 7/571 (1%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
MFGC+CF WN + P EP FSLPAP+P WP+GQGFA+G+INLGEI V +I++F IW
Sbjct: 1 MFGCECFCWNRDSQYRPLEPQPFSLPAPIPEWPRGQGFATGQINLGEIVVLKITQFERIW 60
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAE--GAHT 118
SCN L K TFY+P GIPDGF+SLGHYCQ + + LRG+VLVA+D A+S + H
Sbjct: 61 SCNQLHGKTTGVTFYKPVGIPDGFFSLGHYCQPNDQALRGYVLVAQDAAASRLKVGSVHD 120
Query: 119 SNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL 178
S L SPAL +PL YTL+W +D +Y+GC +FWLP PP GYK+MGF+VT PN+P L
Sbjct: 121 SPLGDSPALIRPLSYTLMWNTDS----HYDGCGYFWLPNPPAGYKAMGFVVTDNPNQPAL 176
Query: 179 DEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNW 238
+EVRCVR DLT+ CE +++I + S FS PF VW+ RPC RGMLG VSVGTFFCS++
Sbjct: 177 EEVRCVRTDLTETCETYNMILNTGSNFSKYPFIVWNARPCKRGMLGNSVSVGTFFCSTHP 236
Query: 239 ISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGAL 298
+E NIACLKNLD LHAMPN QIHALI++YG TVFFHPD++Y+PSSV WFF NGAL
Sbjct: 237 SPDEEPNIACLKNLDSTLHAMPNLKQIHALIKHYGATVFFHPDDIYMPSSVPWFFKNGAL 296
Query: 299 LYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPA 357
+Y+ G L G+ ID GSNLPSGG+NDGEFWIDLP D R +K GN+ESA+LYVHVKPA
Sbjct: 297 VYQNGKLEGKPIDSRGSNLPSGGKNDGEFWIDLPDDDDERTYLKSGNIESAELYVHVKPA 356
Query: 358 VGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYF 417
+GGTFTDI MWVFCPFNGP T+KVGIM++ S+IGQHVGDWEHFTLR+ NFTGELWS+YF
Sbjct: 357 LGGTFTDIAMWVFCPFNGPATIKVGIMSIPMSRIGQHVGDWEHFTLRLSNFTGELWSVYF 416
Query: 418 SQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNL 477
SQHSGG+WV A +LE+IEGNK +VYSSK GHASFPHPG Y+QGS LGIGVRNDAARS
Sbjct: 417 SQHSGGEWVDAPNLEFIEGNKPVVYSSKGGHASFPHPGIYVQGSSKLGIGVRNDAARSKF 476
Query: 478 YVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSV 537
++DSS Y++VAAEYLG+ V EP+WLQ+MR+WGPTIVYDS+ EL+KII LLP+ R+SV
Sbjct: 477 FIDSSTNYQIVAAEYLGDTAVVEPNWLQYMREWGPTIVYDSRAELEKIISLLPVFFRFSV 536
Query: 538 ENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
EN P ELYGEEGPTGPKEKNNWV DER
Sbjct: 537 ENIFDLFPTELYGEEGPTGPKEKNNWVEDER 567
>gi|356540363|ref|XP_003538659.1| PREDICTED: uncharacterized protein LOC100811439 [Glycine max]
Length = 595
Score = 795 bits (2053), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/567 (67%), Positives = 447/567 (78%), Gaps = 9/567 (1%)
Query: 2 FGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWS 61
F F WN N + T P TFSLP+PLP WPQGQGFASG +NLGEIEVC+++ F F+W+
Sbjct: 37 FKFNHFPWNRKGN-TITLPQTFSLPSPLPQWPQGQGFASGIVNLGEIEVCKVTGFEFVWN 95
Query: 62 CNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNL 121
N+ K FY+P IPDGF LGHYCQ +PLRGFVL R++ ++ +E TSN
Sbjct: 96 SNI----GKPVAFYKPVRIPDGFRILGHYCQPSDKPLRGFVLGVREVETASSE---TSNC 148
Query: 122 FKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEV 181
PAL+ PLDY LVWCS+ G + G A+FW+PQ P+GY ++G+LVT P+KP LDE+
Sbjct: 149 HTLPALKNPLDYMLVWCSNAGSKELPIGSAYFWVPQLPEGYSALGYLVTNVPDKPNLDEM 208
Query: 182 RCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISG 241
CVR DLTDKCE + L+ DA PF VW+ RP +RGMLG GVSVGTFFC+S W G
Sbjct: 209 ICVRADLTDKCEPYRLMLDAAPVTPEFPFQVWNLRPRDRGMLGEGVSVGTFFCTSCWNKG 268
Query: 242 QELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK 301
+EL + CLKNL+P L AMP QIHALI +YGPTVFFHP+E YLPSSV WFF NGALLY+
Sbjct: 269 EELPVVCLKNLNPVLPAMPRLHQIHALIEHYGPTVFFHPEEAYLPSSVDWFFNNGALLYR 328
Query: 302 AGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGT 361
G GE ID +GSNLP GGRNDGEFWIDLPSD R VKHG+++SAKLYVHVK A+GGT
Sbjct: 329 KGVSTGETIDAAGSNLPGGGRNDGEFWIDLPSDDRRDFVKHGDLKSAKLYVHVKAALGGT 388
Query: 362 FTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 421
FTD+ MWVFCPFNGP TLK+GI + AFSK+G+HVGDWEHFTLRICNFTGELWSIYFSQHS
Sbjct: 389 FTDVAMWVFCPFNGPSTLKIGITSRAFSKVGEHVGDWEHFTLRICNFTGELWSIYFSQHS 448
Query: 422 GGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDS 481
GGKWV AY+LEYI+GNKA+VYSSKNGHAS+PHPGTYLQGS LGIG+RNDA RS+LYVDS
Sbjct: 449 GGKWVDAYELEYIDGNKAVVYSSKNGHASYPHPGTYLQGSSKLGIGIRNDATRSHLYVDS 508
Query: 482 SIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAV 541
SIQYELVAAEYLG+ VV EP WLQFMR+WGP IVYDSKTELDKI+ LP +R + N +
Sbjct: 509 SIQYELVAAEYLGD-VVREPQWLQFMREWGPKIVYDSKTELDKIMNALPRGLRNAFGNLI 567
Query: 542 SKLPLELYGEEGPTGPKEKNNWVGDER 568
KLP+ELYGEEGPTGPKEKNNW+GDER
Sbjct: 568 KKLPVELYGEEGPTGPKEKNNWIGDER 594
>gi|356576727|ref|XP_003556481.1| PREDICTED: uncharacterized protein LOC100814522 [Glycine max]
Length = 559
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/569 (64%), Positives = 447/569 (78%), Gaps = 14/569 (2%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
MFGCK W+ V + +P FSLP+PLP WPQG FA GRI LGEIEV ++++F +W
Sbjct: 1 MFGCKSLCWDSVPEFTDPDPLPFSLPSPLPQWPQGGSFACGRICLGEIEVIKVNKFEKVW 60
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
C L K TFY P IP+GF+ LGHYCQ + +PLRG+VLVARD +S ++
Sbjct: 61 RCTSLNGKSLGFTFYRPLEIPEGFFCLGHYCQSNDQPLRGYVLVARDTSSD-------AS 113
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
+SPAL+KPL+Y+L+W D +++ C +FWLP PP GYK+MG +VT +PN+PE++E
Sbjct: 114 TLESPALEKPLNYSLIWSLD-----SHDECVYFWLPNPPTGYKAMGIVVTSSPNEPEVEE 168
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVRDDLT+ CE L+ SK+S F VW+T+PC+RGML RGV+VG FFC S +
Sbjct: 169 VRCVRDDLTESCETSDLLLTVKSKYSKDSFQVWNTQPCDRGMLARGVAVGAFFCGSTSVD 228
Query: 241 GQEL-NIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALL 299
+++ +IACLKNLD LHAMPN +QIHALI++YGPTV+FHPDE YLPSSV WFF NGA+L
Sbjct: 229 PEQVVDIACLKNLDSSLHAMPNQNQIHALIQHYGPTVYFHPDEKYLPSSVQWFFKNGAVL 288
Query: 300 YKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDG-GRQIVKHGNMESAKLYVHVKPAV 358
+ AG+ G AID GSNLPSGG NDG FWIDLP+D R +K GN+ESA+LYVHVKPA+
Sbjct: 289 HAAGNKKGIAIDYQGSNLPSGGTNDGAFWIDLPTDADARNNLKKGNIESAELYVHVKPAL 348
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
GG +TDIVMWVFCPFNGP TLKV +MN+ +KIG+H+GDWEHFTLRI NFTGELWS+YFS
Sbjct: 349 GGAYTDIVMWVFCPFNGPATLKVALMNIEMNKIGEHIGDWEHFTLRISNFTGELWSVYFS 408
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
QHSGG WV A+DLE+I+GNK IVYSSK+GHASFPHPGTYLQGS LGIGVRNDAA S
Sbjct: 409 QHSGGGWVNAFDLEFIKGNKPIVYSSKDGHASFPHPGTYLQGSSKLGIGVRNDAAPSKFI 468
Query: 479 VDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVE 538
VDSSI+Y++VAAEYLG+GV+AEP WLQ+MR+WGPTIVYDS++E++KII LLPL +R+SVE
Sbjct: 469 VDSSIKYQIVAAEYLGDGVIAEPCWLQYMREWGPTIVYDSRSEIEKIINLLPLFVRFSVE 528
Query: 539 NAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
N P ELYGEEGPTGPKEK+NW+GDE
Sbjct: 529 NLFELFPTELYGEEGPTGPKEKDNWLGDE 557
>gi|255561981|ref|XP_002521999.1| conserved hypothetical protein [Ricinus communis]
gi|223538803|gb|EEF40403.1| conserved hypothetical protein [Ricinus communis]
Length = 569
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/571 (63%), Positives = 446/571 (78%), Gaps = 8/571 (1%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
MFGC+CFYWN+V + P FSLPAP+P WPQG+GFA+ RINLGE++V +I++F +W
Sbjct: 1 MFGCECFYWNQVTDYEFHNPQPFSLPAPIPKWPQGKGFATARINLGELDVIKITQFESVW 60
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
SCNLL KKK TFY P GIPDGFY LG+YCQ + +PLRGFVLVA D + + E H N
Sbjct: 61 SCNLLHGKKKRITFYRPIGIPDGFYCLGYYCQSNDQPLRGFVLVASDSNAHKPEPDHIHN 120
Query: 121 LFKSP--ALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL 178
KS AL+KP +Y+L+W ++ G+ +FWLP PP GYK+MG +VT +P +P++
Sbjct: 121 S-KSDCLALRKPHNYSLIWSTNPENDGD---TGYFWLPNPPKGYKAMGIVVTNSPEEPDV 176
Query: 179 DEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNW 238
+EVRCVR DLT+KC+ IF + SK + F VW+T PC RGM +GVSVGTF+C +
Sbjct: 177 EEVRCVRADLTEKCKTCDQIFSSESKTFKNQFQVWNTTPCKRGMCAKGVSVGTFYCGTYL 236
Query: 239 ISGQEL-NIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGA 297
SG +L +ACLKNLD LHAMPN +QIHALI++YGPTV+FHPDE YLPSSV WFF NGA
Sbjct: 237 NSGDDLIEVACLKNLDSTLHAMPNLEQIHALIKHYGPTVYFHPDEEYLPSSVQWFFKNGA 296
Query: 298 LLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPS-DGGRQIVKHGNMESAKLYVHVKP 356
LLY+ GE+ID G NLPSGG NDGE+WIDLPS D R+ +K G++ESA+LYV+VKP
Sbjct: 297 LLYQDSKPTGESIDYRGLNLPSGGENDGEYWIDLPSKDEARENIKRGDLESAELYVNVKP 356
Query: 357 AVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIY 416
A+GGTFTDI MW+FCPFNGP TLKVG+M++ +KIGQHVGDWEH+TLRI NFTGELW +
Sbjct: 357 AIGGTFTDIAMWIFCPFNGPATLKVGLMSIPMTKIGQHVGDWEHYTLRISNFTGELWQAF 416
Query: 417 FSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
FS+HSGG+WV A DLE+IE NK IVYSSK+GHASFPHPGTYLQGS LGIGVRND ARS
Sbjct: 417 FSEHSGGRWVDASDLEFIEDNKIIVYSSKHGHASFPHPGTYLQGSTKLGIGVRNDVARSK 476
Query: 477 LYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYS 536
Y+DSS +Y+L+AAEYLG+GVV EP WLQ+MR+WGPTIVYDS++E DKII LLP +R+S
Sbjct: 477 YYIDSSTKYQLIAAEYLGDGVVKEPCWLQYMREWGPTIVYDSRSEADKIINLLPFFVRFS 536
Query: 537 VENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
V+ P ELYGEEGPTGPKEK+NW+GDE
Sbjct: 537 VDVLFDLFPTELYGEEGPTGPKEKDNWLGDE 567
>gi|357481157|ref|XP_003610864.1| hypothetical protein MTR_5g007860 [Medicago truncatula]
gi|355512199|gb|AES93822.1| hypothetical protein MTR_5g007860 [Medicago truncatula]
Length = 607
Score = 769 bits (1985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/572 (65%), Positives = 445/572 (77%), Gaps = 10/572 (1%)
Query: 4 CKCFYWNEVN--NMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWS 61
C CF+ ++ N + P TFSL PLP WPQGQ FASG +NLGEIEVC+I++F +W+
Sbjct: 38 CNCFFPSKPNPSDHHHLLPNTFSLNTPLPQWPQGQDFASGIVNLGEIEVCKITKFETVWN 97
Query: 62 CNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNL 121
N++ +K+ TFY+P GIPDGF+ LGHYCQ +PL GFVL A+ +A S N
Sbjct: 98 SNVMVEPRKAITFYKPVGIPDGFHILGHYCQPSYKPLWGFVLAAKQVADSSYNNICNQN- 156
Query: 122 FKSPALQKPLDYTLVWCSDEGGQG---NYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL 178
K PAL+ PLD+ LVWC++ G + + A+FWLPQPP+GYK++G+LVT KP L
Sbjct: 157 -KLPALRNPLDFALVWCTNSGRKKIAMPVDSAAYFWLPQPPEGYKALGYLVTNNREKPNL 215
Query: 179 DEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSN- 237
DE+ CVR DLTDKCE + + DA + PF WS RPC+RGMLG+GVSVGTF+CS
Sbjct: 216 DEISCVRVDLTDKCEPYRTLLDAGNIIPEFPFWAWSLRPCDRGMLGKGVSVGTFYCSGCC 275
Query: 238 WISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGA 297
W +G+EL + CLKNL+P AMP DQIHALI +YGPTVFFHP+E+YLPSSV WFF NGA
Sbjct: 276 WNNGEELPVVCLKNLNPTPSAMPWIDQIHALIEHYGPTVFFHPEEIYLPSSVDWFFNNGA 335
Query: 298 LLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGR-QIVKHGNMESAKLYVHVKP 356
LLY+ G GE ID GSNLP GG NDGEFWIDLP+D R + +KHG++ESAKLYVHVKP
Sbjct: 336 LLYRKGVSEGETIDEGGSNLPGGGTNDGEFWIDLPNDKSRREFIKHGDLESAKLYVHVKP 395
Query: 357 AVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIY 416
A GGTFTDIVMW+FCPFNGP TLK G+ ++AFSK+G+H+GDWEHFTLRI NFTGELWSIY
Sbjct: 396 AFGGTFTDIVMWIFCPFNGPSTLKFGMKSMAFSKVGEHIGDWEHFTLRISNFTGELWSIY 455
Query: 417 FSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
+SQHSGGKWV AY+LEYI+GNKAIVY+SKNGHAS+PHPGTY+QGS LGIG+RNDA RSN
Sbjct: 456 YSQHSGGKWVDAYELEYIDGNKAIVYASKNGHASYPHPGTYIQGSSKLGIGIRNDARRSN 515
Query: 477 LYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYS 536
L VDSS+ YE+VAA YLG+ VV EP WLQ+MR+WGP IVYDSKTELDKI+ LPL +R S
Sbjct: 516 LRVDSSVHYEIVAAAYLGD-VVKEPQWLQYMRQWGPKIVYDSKTELDKILNSLPLRLRSS 574
Query: 537 VENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
N KLP+ELYGEEGPTGPKEKNNW+GDER
Sbjct: 575 FGNLFRKLPVELYGEEGPTGPKEKNNWIGDER 606
>gi|357474233|ref|XP_003607401.1| hypothetical protein MTR_4g077620 [Medicago truncatula]
gi|355508456|gb|AES89598.1| hypothetical protein MTR_4g077620 [Medicago truncatula]
Length = 624
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/572 (64%), Positives = 443/572 (77%), Gaps = 10/572 (1%)
Query: 1 MFGCKCFYWNEVNNMSPTE---PGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFN 57
M CF N + + P FSLP P+P WPQGQGFASG +NLGEIEV +++RF
Sbjct: 3 MLDSVCFCRNSSITFAQSSSPLPQIFSLPQPIPQWPQGQGFASGVVNLGEIEVSKVTRFE 62
Query: 58 FIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAH 117
FIWS +L + KK+ FY+P GIP F+ LGHYCQ +PLRGFVLVA++ +E
Sbjct: 63 FIWSTAIL-NPKKAVIFYKPVGIPHTFHILGHYCQPSDKPLRGFVLVAKETRIHFSE--- 118
Query: 118 TSNLFKSPALQKPLDYTLVWCSDEGGQGNYE-GCAFFWLPQPPDGYKSMGFLVTKTPNKP 176
+N+ K PAL+ PLD+ LVW S+ GC +FWLP+PP+GYK++G+LVT P KP
Sbjct: 119 -TNVEKLPALRNPLDFELVWSSNAPASLEISSGCGYFWLPEPPEGYKAVGYLVTDKPEKP 177
Query: 177 ELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSS 236
+LDE+ CVR DLTDKCE +H+ F SK F + S+RPC+RGMLG+GV VGTF CSS
Sbjct: 178 KLDEMCCVRADLTDKCEPYHVKFAVASKIPELSFQLSSSRPCDRGMLGKGVCVGTFLCSS 237
Query: 237 NWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNG 296
+G+E IACLKNL+ L +MPN QIH LI +YGPTVFFHP+E+YLPSSV WFF+NG
Sbjct: 238 GLSNGEENPIACLKNLNAALQSMPNLQQIHGLINHYGPTVFFHPEEIYLPSSVDWFFSNG 297
Query: 297 ALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKP 356
ALL + G GE ID +GSNLPSGG NDG+FWI+LPSD R +K G++ SAKLYVHVKP
Sbjct: 298 ALLCRKGMCTGEGIDATGSNLPSGGSNDGQFWIELPSDDRRDFIKRGDLNSAKLYVHVKP 357
Query: 357 AVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIY 416
A+GGTFTDI MW+FCPFNGP TLKVGI N+ SKIG+HVGDWEHFTLRICNF GEL+SIY
Sbjct: 358 ALGGTFTDIAMWIFCPFNGPATLKVGIKNIPLSKIGEHVGDWEHFTLRICNFAGELYSIY 417
Query: 417 FSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
FSQHSGG+W+ DL+YI+GNKAIVYSSK+GHAS+P PGTY+QGS LG+G+RNDA RSN
Sbjct: 418 FSQHSGGEWMDTCDLDYIDGNKAIVYSSKSGHASYPRPGTYIQGSSKLGLGIRNDAVRSN 477
Query: 477 LYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYS 536
LYVDSSI YE+VAAEYL E VVAEP+WLQFMRKWGP I+YDSKTELDK+I LP M++YS
Sbjct: 478 LYVDSSIHYEVVAAEYL-ENVVAEPNWLQFMRKWGPKIIYDSKTELDKVINALPRMLQYS 536
Query: 537 VENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
+ + ++KLP+ELYGEEGPTGPKEKNNW+ DER
Sbjct: 537 MRDLINKLPVELYGEEGPTGPKEKNNWIHDER 568
>gi|356535196|ref|XP_003536134.1| PREDICTED: uncharacterized protein LOC100788778 [Glycine max]
Length = 560
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/570 (63%), Positives = 444/570 (77%), Gaps = 15/570 (2%)
Query: 1 MFGCKCFYWNE-VNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFI 59
MFGC W+ V + +P FSLP+PLP WPQG FASGRI LGEIEV +++ F +
Sbjct: 1 MFGCNSLCWDSSVPEFTDPDPLPFSLPSPLPQWPQGGSFASGRICLGEIEVIKVNNFEKV 60
Query: 60 WSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTS 119
W C L K TFY P IP+GF+ LG+YCQ + +PLRG+VLVAR E + +
Sbjct: 61 WRCTSLNGKSLGFTFYRPLEIPEGFFCLGYYCQSNHQPLRGYVLVAR-------ETSFDA 113
Query: 120 NLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELD 179
++ +SPAL+KPL+Y+L+W D +++ C +FWLP PP GYK+MG +VT +P +PE++
Sbjct: 114 SVLESPALEKPLNYSLIWSLD-----SHDECVYFWLPNPPTGYKAMGIVVTSSPKEPEVE 168
Query: 180 EVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWI 239
EVRCVRDDLT+ CE L+ SK++ F VW+T+PC+RGML RGV VGTFFC S +
Sbjct: 169 EVRCVRDDLTETCETSDLLLTVKSKYAKDSFQVWNTQPCDRGMLARGVDVGTFFCGSTYF 228
Query: 240 -SGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGAL 298
S Q ++I CLKNLD LHAMPN +QIHALI++YGPTV+FHPDE YLPSSV WFF NGA+
Sbjct: 229 DSEQVVDIMCLKNLDSSLHAMPNQNQIHALIQHYGPTVYFHPDEKYLPSSVQWFFKNGAV 288
Query: 299 LYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDG-GRQIVKHGNMESAKLYVHVKPA 357
L+ AG+ G AID GSNLPSGG NDG FWIDLP+DG R +K GN+ESA+LYVHVKPA
Sbjct: 289 LHAAGNKKGIAIDYQGSNLPSGGTNDGAFWIDLPTDGDARNNLKKGNIESAELYVHVKPA 348
Query: 358 VGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYF 417
+GG FTDIVMWVFCPFNGP TLKV +MN+ SKIG+HVGDWEHFTLRI NFTGELWS+YF
Sbjct: 349 LGGAFTDIVMWVFCPFNGPATLKVALMNIEMSKIGEHVGDWEHFTLRISNFTGELWSVYF 408
Query: 418 SQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNL 477
SQHSGG W+ A+DLE+ +GNK IVYSSK+GHASFPHPGTYLQGS LGIGVRNDAA+S
Sbjct: 409 SQHSGGGWIHAFDLEFNKGNKPIVYSSKDGHASFPHPGTYLQGSSKLGIGVRNDAAQSKF 468
Query: 478 YVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSV 537
VDSS++Y++VAAEYLGEGV+ EP WLQ+MR+WGPT+VYDS++E++KII LLPL +R+SV
Sbjct: 469 IVDSSVKYQIVAAEYLGEGVITEPCWLQYMREWGPTVVYDSRSEIEKIINLLPLFVRFSV 528
Query: 538 ENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
EN P ELYGEEGPTGPKEK+NW+GDE
Sbjct: 529 ENLFELFPTELYGEEGPTGPKEKDNWLGDE 558
>gi|449445816|ref|XP_004140668.1| PREDICTED: uncharacterized protein LOC101209282 [Cucumis sativus]
Length = 571
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/568 (62%), Positives = 431/568 (75%), Gaps = 1/568 (0%)
Query: 3 GCKCFYWNEVNNMSPT-EPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWS 61
GC F W+ + + P+ EP FSLP+P P WPQG GFASG +LGEIEV +I++F IW
Sbjct: 4 GCNWFNWSNAHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVLKITQFVSIWG 63
Query: 62 CNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNL 121
CNL + TFY P +P+G++ LGHYCQ + RPL G++LVAR++ E H SN+
Sbjct: 64 CNLSRRGNNGVTFYRPLRMPEGYHCLGHYCQPNDRPLHGYLLVAREVDGYFQESDHISNI 123
Query: 122 FKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEV 181
K PAL +P+D+TL+W D+G + Y CA+ WLPQPPDGYKSMG+ VT KP + EV
Sbjct: 124 VKLPALVEPIDFTLIWSPDDGSEEKYGECAYIWLPQPPDGYKSMGYFVTNKLEKPVVGEV 183
Query: 182 RCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISG 241
RCVR DLTD+CE + L+F+ SK + +WSTR C+RGMLGRGV VGTF C S +
Sbjct: 184 RCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYKGTE 243
Query: 242 QELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK 301
+EL IACLKNL+ L MPN DQIH+LI +YGPTVFFHP E+YLPSSVSWFF NG LL++
Sbjct: 244 KELPIACLKNLNSTLPTMPNIDQIHSLINHYGPTVFFHPKEIYLPSSVSWFFENGVLLHR 303
Query: 302 AGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGT 361
G GEAI G+NLP+ GRND W+DLP+DG R + +GN+ESAKLY HVKPA+GGT
Sbjct: 304 DGMSSGEAILVCGTNLPTDGRNDTVCWMDLPTDGCRDKIINGNLESAKLYAHVKPALGGT 363
Query: 362 FTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 421
FTDI MWVFCPFNGP TLK+GI+N++ KIGQHVGDWEH TLRICNFTGEL+SIYFSQHS
Sbjct: 364 FTDIAMWVFCPFNGPSTLKLGIVNISLGKIGQHVGDWEHITLRICNFTGELFSIYFSQHS 423
Query: 422 GGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDS 481
GG+WV AY+LE+IEGNKAIVYSSK+GHAS+P PG Y+QGS LGIG+RND ARS+L++DS
Sbjct: 424 GGEWVDAYNLEFIEGNKAIVYSSKSGHASYPRPGLYIQGSSKLGIGIRNDCARSHLFIDS 483
Query: 482 SIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAV 541
S YE+VAAE+L + EP WLQFMR+WGPTIVY S+T+LD I LPL IR+ V N
Sbjct: 484 STHYEIVAAEHLRRNDIVEPGWLQFMREWGPTIVYSSRTKLDNFIDRLPLKIRFPVANIF 543
Query: 542 SKLPLELYGEEGPTGPKEKNNWVGDERG 569
KLP EL+GE GPTGPKEKNNW GDERG
Sbjct: 544 RKLPAELFGEVGPTGPKEKNNWEGDERG 571
>gi|224123576|ref|XP_002319114.1| predicted protein [Populus trichocarpa]
gi|222857490|gb|EEE95037.1| predicted protein [Populus trichocarpa]
Length = 573
Score = 743 bits (1918), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/576 (62%), Positives = 442/576 (76%), Gaps = 14/576 (2%)
Query: 1 MFG--CKCFYWNEVN----NMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRIS 54
MFG C+CF W+ + + +P FSLPAPLP WPQGQGFA+GRINLGEIEV +I+
Sbjct: 1 MFGGYCECFCWSNQDFYDDDFYLPDPQPFSLPAPLPIWPQGQGFAAGRINLGEIEVAKIT 60
Query: 55 RFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAE 114
+F +WSC LL+ K K +FY P GIP+GF+ LG+YCQ + +PLRG+VLV D + + E
Sbjct: 61 KFESVWSCGLLRGKSKGVSFYRPVGIPEGFHCLGYYCQSNDQPLRGYVLVGCDSNAHKPE 120
Query: 115 GAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPN 174
H + L+KPL+Y+L+W S+ G G +FWLP PP+GYK+MG +VTK P
Sbjct: 121 VGHIHDSEVDSPLRKPLNYSLIWGSNSEKNG---GGGYFWLPNPPEGYKAMGIVVTKNPE 177
Query: 175 KPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFC 234
+P+++EVRCVR DLT++CE I + SK SS+PF VW+TRP RGM +GVSVGTF+C
Sbjct: 178 EPKVEEVRCVRADLTERCETSDQIISSDSK-SSNPFEVWNTRPHRRGMFAKGVSVGTFYC 236
Query: 235 SSNWISGQE--LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWF 292
++ ++S E ++IACLKNLD AMPN DQ+HALI++YGPTVFFHPDE LPSSV WF
Sbjct: 237 NA-YLSSDEVLIDIACLKNLDHSQPAMPNLDQVHALIKHYGPTVFFHPDEDCLPSSVQWF 295
Query: 293 FTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLP-SDGGRQIVKHGNMESAKLY 351
F NGALLY+ E ID GSNLPSGG NDGEFWIDLP D R V G++ESA+LY
Sbjct: 296 FKNGALLYQDCGQKCEPIDLRGSNLPSGGENDGEFWIDLPVDDAARNDVIAGDLESAELY 355
Query: 352 VHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGE 411
VH KPA GGTFTDI MW+FCPFNGP TLK+G+M + +KIGQHVGDWEH+TLR+ NFTGE
Sbjct: 356 VHAKPAFGGTFTDIAMWIFCPFNGPATLKIGLMTIPMTKIGQHVGDWEHYTLRVSNFTGE 415
Query: 412 LWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRND 471
LW ++FS+HSGG+WV A DLE+IEGN+ +VYSSK+GHASFPHPGTYLQGS LGIGVRND
Sbjct: 416 LWQVFFSEHSGGRWVDASDLEFIEGNRPVVYSSKHGHASFPHPGTYLQGSTKLGIGVRND 475
Query: 472 AARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPL 531
ARS YVDSS++Y+LVAAEYLG+GVV EP WLQ+MR+WGPTIVYDS++E+DKI+ LP
Sbjct: 476 VARSKYYVDSSLKYQLVAAEYLGDGVVTEPCWLQYMREWGPTIVYDSRSEIDKILDHLPF 535
Query: 532 MIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
+R+SVEN V P ELYGEEGP+GPKEK NWVGDE
Sbjct: 536 FVRFSVENLVDLFPTELYGEEGPSGPKEKYNWVGDE 571
>gi|449487421|ref|XP_004157618.1| PREDICTED: uncharacterized protein LOC101230031 [Cucumis sativus]
Length = 571
Score = 743 bits (1918), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/568 (62%), Positives = 431/568 (75%), Gaps = 1/568 (0%)
Query: 3 GCKCFYWNEVNNMSPT-EPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWS 61
GC F W+ + + P+ EP FSLP+P P WPQG GFASG +LGEIEV +I++F IW
Sbjct: 4 GCNWFNWSNAHYLLPSDEPDHFSLPSPTPEWPQGGGFASGIASLGEIEVLKITQFVSIWG 63
Query: 62 CNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNL 121
CNL + TFY P +P+G++ LGHYCQ + RPL G++LVAR++ E H SN+
Sbjct: 64 CNLSRRGNNGVTFYRPLRMPEGYHCLGHYCQPNDRPLHGYLLVAREVDGYFQESDHISNI 123
Query: 122 FKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEV 181
K PAL +P+D+TL+W D+G + Y CA+ WLPQPPDGYKSMG+ VT KP + EV
Sbjct: 124 VKLPALVEPIDFTLIWSPDDGSEEKYGECAYIWLPQPPDGYKSMGYFVTNKLEKPVVGEV 183
Query: 182 RCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISG 241
RCVR DLTD+CE + L+F+ SK + +WSTR C+RGMLGRGV VGTF C S +
Sbjct: 184 RCVRADLTDRCETYRLMFNISSKCKNFLVQIWSTRACHRGMLGRGVPVGTFHCGSYKGTE 243
Query: 242 QELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK 301
+EL IACLKNL+ L MPN DQIH+LI +YGPTVFFHP E+YLPSSVSWFF NG LL++
Sbjct: 244 KELPIACLKNLNSTLPTMPNIDQIHSLINHYGPTVFFHPKEIYLPSSVSWFFENGVLLHR 303
Query: 302 AGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGT 361
G GEAI G+NLP+ GRND W+DLP+DG R + +GN+ESA+LY HVKPA+GGT
Sbjct: 304 DGMSSGEAILVCGTNLPTDGRNDTVCWMDLPTDGCRDKIINGNLESAELYAHVKPALGGT 363
Query: 362 FTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 421
FTDI MWVFCPFNGP TLK+GI+N++ KIGQHVGDWEH TLRICNFTGEL+SIYFSQHS
Sbjct: 364 FTDIAMWVFCPFNGPSTLKLGIVNISLGKIGQHVGDWEHITLRICNFTGELFSIYFSQHS 423
Query: 422 GGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDS 481
GG+WV AY+LE+IEGNKAIVYSSK+GHAS+P PG Y+QGS LGIG+RND ARS+L++DS
Sbjct: 424 GGEWVDAYNLEFIEGNKAIVYSSKSGHASYPRPGLYIQGSSKLGIGIRNDCARSHLFIDS 483
Query: 482 SIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAV 541
S YE+VAAE+L + EP WLQFMR+WGPTIVY S+T+LD I LPL IR+ V N
Sbjct: 484 STHYEIVAAEHLRRNDIVEPGWLQFMREWGPTIVYSSRTKLDNFIDRLPLKIRFPVANIF 543
Query: 542 SKLPLELYGEEGPTGPKEKNNWVGDERG 569
KLP EL+GE GPTGPKEKNNW GDERG
Sbjct: 544 RKLPAELFGEVGPTGPKEKNNWEGDERG 571
>gi|356540920|ref|XP_003538932.1| PREDICTED: uncharacterized protein LOC100793026 [Glycine max]
Length = 543
Score = 742 bits (1915), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/550 (65%), Positives = 431/550 (78%), Gaps = 14/550 (2%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
TFSLPAP+ WPQGQGFASG +NLGEI+V +++RF FI S +++ KK+ TF+ P G+P
Sbjct: 4 TFSLPAPISQWPQGQGFASGLVNLGEIQVSKVTRFEFISSSSVMLDTKKAVTFFRPVGVP 63
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
+ F+ LGHYCQ +PL GFVLVA+ + A+ P L+ PLD+ LVW S
Sbjct: 64 ESFHILGHYCQPSGKPLHGFVLVAKICSPQNADTI--------PPLKNPLDFKLVW-SHN 114
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
G +FWLP+PP+GYK++G+LVT +KP LDE+ CVR DLTDKCE + I A
Sbjct: 115 AASMEIPG-VYFWLPEPPEGYKALGYLVTNKHDKPLLDEMCCVRADLTDKCEPYRQILAA 173
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQEL-NIACLKNLDPKLHAMP 260
S+ F VWS RPC+RGMLG+GVSVGTFFCS+ W G+EL +ACLKNL+P L AMP
Sbjct: 174 GSRIPEFSFQVWSLRPCDRGMLGKGVSVGTFFCSNGWTMGEELLPVACLKNLNPVLPAMP 233
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSG 320
+ QIHALI++YGPTVFFHP E YLPSSV WFF NGA+L+K G GE ID SGSNLPSG
Sbjct: 234 DLQQIHALIKHYGPTVFFHPQEKYLPSSVDWFFNNGAMLHKKGVSKGEGIDASGSNLPSG 293
Query: 321 GRNDGEFWIDLPSDGGRQ-IVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
G NDG+FWIDLPSD R+ VK G+++S++LYVHVKPA+GGTFTDI MWVFCPFNGP TL
Sbjct: 294 GTNDGQFWIDLPSDHDRKNFVKRGDLKSSRLYVHVKPALGGTFTDIAMWVFCPFNGPATL 353
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG-NK 438
K+GI ++ SK+G+HVGDWEHFTLRICNF+GEL+SIYFSQHSGG+WV AYDL+YI+G NK
Sbjct: 354 KIGIKSIPLSKVGEHVGDWEHFTLRICNFSGELYSIYFSQHSGGEWVDAYDLDYIKGYNK 413
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV 498
A VYSSK GHAS+PHPGTY+QGS LGIG+RNDAARSNLYVDSS+ YELVAAEYL E V
Sbjct: 414 ATVYSSKCGHASYPHPGTYMQGSSKLGIGIRNDAARSNLYVDSSVHYELVAAEYL-ENDV 472
Query: 499 AEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPK 558
EP WLQFMR+WGP IVYD+KTELDK+I LP M+RYSV N +K P+ELYGEEGPTGPK
Sbjct: 473 TEPQWLQFMREWGPKIVYDTKTELDKVINALPRMLRYSVRNLFNKFPVELYGEEGPTGPK 532
Query: 559 EKNNWVGDER 568
EKNNW+ DER
Sbjct: 533 EKNNWIQDER 542
>gi|357441407|ref|XP_003590981.1| hypothetical protein MTR_1g080370 [Medicago truncatula]
gi|357441415|ref|XP_003590985.1| hypothetical protein MTR_1g080410 [Medicago truncatula]
gi|355480029|gb|AES61232.1| hypothetical protein MTR_1g080370 [Medicago truncatula]
gi|355480033|gb|AES61236.1| hypothetical protein MTR_1g080410 [Medicago truncatula]
Length = 560
Score = 741 bits (1912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/568 (61%), Positives = 438/568 (77%), Gaps = 11/568 (1%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
MFG +CF W+ ++P FSLP+PLP WPQG GFA GRI+LG+IEV ++++F +W
Sbjct: 1 MFGFECFCWDSFPEFFDSDPLPFSLPSPLPQWPQGGGFAGGRISLGKIEVTKVNKFEKVW 60
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
C K TFY P IPDGF LG+YC + +PLRG VLVAR+ S S
Sbjct: 61 RCTNSNGKALGFTFYRPLEIPDGFSCLGYYCHSNDQPLRGHVLVARETTSKSQADCSES- 119
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
+SPAL+KPL+Y+L+WC D +++ C +FWLP PP GYK++G +VT P++P+ +E
Sbjct: 120 --ESPALKKPLNYSLIWCMD-----SHDECVYFWLPNPPKGYKAVGIVVTTNPDEPKAEE 172
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVR DLT+ CE L+ SK +S F VW+T+PC+RGML RGVSVGTFFC + + S
Sbjct: 173 VRCVRTDLTEVCETSDLLLTIKSKKNS--FQVWNTQPCDRGMLARGVSVGTFFCGTYFDS 230
Query: 241 GQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY 300
Q +++ CLKNLD LHAMPN +QIHALI +YGPTV+FHPDE Y+PSSVSWFF NGA+LY
Sbjct: 231 EQVVDVVCLKNLDSLLHAMPNLNQIHALIEHYGPTVYFHPDEKYMPSSVSWFFKNGAILY 290
Query: 301 KAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVG 359
AG+ G+AID G+NLP GG NDG FWIDLP+D R +K GN+ESA+LYVHVKPA+G
Sbjct: 291 TAGNAKGKAIDYHGTNLPGGGYNDGAFWIDLPTDEDARSNLKKGNIESAELYVHVKPALG 350
Query: 360 GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
G FTDI MWVFCPFNGP TLKV +MN+ +KIG+HVGDWEHFTLR+ NFTGELWS++FS+
Sbjct: 351 GAFTDIAMWVFCPFNGPATLKVSLMNIEMNKIGEHVGDWEHFTLRVSNFTGELWSVFFSE 410
Query: 420 HSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYV 479
HSGGKWV A+DLE+I+ NK IVYSS++GHAS+PH GTYLQGS LGIGVRNDAA+SN +
Sbjct: 411 HSGGKWVNAFDLEFIKENKPIVYSSRHGHASYPHAGTYLQGSSKLGIGVRNDAAKSNFIL 470
Query: 480 DSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVEN 539
DSS +Y++VAAEYLG+GV+ EP WLQ+MR+WGPTIVYDS++E++KII +LP+ +R+SVEN
Sbjct: 471 DSSFRYKIVAAEYLGDGVITEPCWLQYMREWGPTIVYDSRSEIEKIIDMLPIFVRFSVEN 530
Query: 540 AVSKLPLELYGEEGPTGPKEKNNWVGDE 567
P EL GEEGPTGPKEK+NW+GDE
Sbjct: 531 LFELFPTELSGEEGPTGPKEKDNWLGDE 558
>gi|297843218|ref|XP_002889490.1| hypothetical protein ARALYDRAFT_333729 [Arabidopsis lyrata subsp.
lyrata]
gi|297335332|gb|EFH65749.1| hypothetical protein ARALYDRAFT_333729 [Arabidopsis lyrata subsp.
lyrata]
Length = 1328
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/579 (59%), Positives = 438/579 (75%), Gaps = 19/579 (3%)
Query: 1 MFGCKCFYWNEVNNMSP-TEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFI 59
M G KC +WN + ++ P +P TFSLPA +P WP GQGF SG INLG+++V +I+ F FI
Sbjct: 757 MLGYKCLHWNNLIDLPPLKDPETFSLPASIPHWPPGQGFGSGTINLGKLQVIKITDFEFI 816
Query: 60 WSCNLLQSKKKSATFYEPAGI-PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHT 118
W + K KS +FY+P G+ P F+ LGHYCQ DS PLRG++L ARDL S +
Sbjct: 817 WRYRSTE-KNKSISFYKPKGLFPKDFHCLGHYCQSDSHPLRGYLLAARDLVDSLEQE--- 872
Query: 119 SNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCA---FFWLPQPPDGYKSMGFLVTKTPNK 175
+ PAL +P+D+TLVW S++ + + +FWLPQPP+GY+S+GF+VTK+ K
Sbjct: 873 ----EKPALVEPVDFTLVWSSNDSVEDECSSKSERGYFWLPQPPEGYRSIGFVVTKSSVK 928
Query: 176 PELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCS 235
PEL+EVRCVR DLTDKCE H++I A+S+ P +W TRP +RGM GRGVS GTFFC
Sbjct: 929 PELNEVRCVRADLTDKCETHNVIVTAVSESLGVPLFIWRTRPSDRGMWGRGVSAGTFFCR 988
Query: 236 SNWISGQE---LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWF 292
+ + +E + IACLKNLD +HAMPN DQI ALI++YGPT+ FHP E YLPSSVSWF
Sbjct: 989 TRLVPAREDIGIGIACLKNLDTNVHAMPNVDQIQALIQHYGPTLVFHPGETYLPSSVSWF 1048
Query: 293 FTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLP-SDGGRQIVKHGNMESAKLY 351
F NGA+L + G+ + E I+ +GSNLP GG ND +FWIDLP D R VK GN+ES+KLY
Sbjct: 1049 FKNGAVLCEKGNPIDELINENGSNLPQGGSNDKQFWIDLPCDDQQRDFVKRGNLESSKLY 1108
Query: 352 VHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGE 411
+H+KPA+GGTFTD+V W+FCPFNGP TLK+G+++V+ IGQHV DWEHFT+RI NF+GE
Sbjct: 1109 IHIKPALGGTFTDLVYWIFCPFNGPATLKLGLVDVSLISIGQHVCDWEHFTVRISNFSGE 1168
Query: 412 LWSIYFSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRN 470
L+SIYFSQHSGG+W+ AYDLE+I G NKA+VYSSK+GHASFP GTYLQGS +LGIG+RN
Sbjct: 1169 LYSIYFSQHSGGEWIEAYDLEFIPGSNKAVVYSSKHGHASFPKAGTYLQGSTMLGIGIRN 1228
Query: 471 DAARSNLYVDSSIQYELVAAEYL-GEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLL 529
D ARS L+VDSS +YE++AAEYL G GVVAEP WLQ+MR+WGP +VYDS+ E+++++
Sbjct: 1229 DTARSELFVDSSSRYEIIAAEYLGGNGVVAEPPWLQYMREWGPKVVYDSREEIERLVNRF 1288
Query: 530 PLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
P +R S+ + KLP+EL GEEGPTGPKEKNNW GDER
Sbjct: 1289 PRTVRVSLATVLRKLPVELSGEEGPTGPKEKNNWYGDER 1327
>gi|3142300|gb|AAC16751.1| Contains similarity to pre-mRNA processing protein PRP39 gb|L29224
from S. cerevisiae. ESTs gb|R64908 and gb|T88158,
gb|N38703 and gb|AA651043 come from this gene
[Arabidopsis thaliana]
Length = 1345
Score = 724 bits (1870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/579 (59%), Positives = 436/579 (75%), Gaps = 19/579 (3%)
Query: 1 MFGCKCFYWNEVNNMSP-TEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFI 59
M G KC +WN + ++ P +P TFSLP+ +P WP GQGF SG INLG+++V +I+ F FI
Sbjct: 774 MLGYKCLHWNNLIDLPPLKDPETFSLPSSIPHWPPGQGFGSGTINLGKLQVIKITDFEFI 833
Query: 60 WSCNLLQSKKKSATFYEPAGI-PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHT 118
W + KKK+ +FY+P G+ P F+ LGHYCQ DS PLRG+VL ARDL S +
Sbjct: 834 WRYRSTE-KKKNISFYKPKGLLPKDFHCLGHYCQSDSHPLRGYVLAARDLVDSLEQ---- 888
Query: 119 SNLFKSPALQKPLDYTLVWCSDEGGQ---GNYEGCAFFWLPQPPDGYKSMGFLVTKTPNK 175
+ PAL +P+D+TLVW S++ + + C +FWLPQPP+GY+S+GF+VTKT K
Sbjct: 889 ---VEKPALVEPVDFTLVWSSNDSAENECSSKSECGYFWLPQPPEGYRSIGFVVTKTSVK 945
Query: 176 PELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCS 235
PEL+EVRCVR DLTD CE H++I A+S+ P +W TRP +RGM G+GVS GTFFC
Sbjct: 946 PELNEVRCVRADLTDICEPHNVIVTAVSESLGVPLFIWRTRPSDRGMWGKGVSAGTFFCR 1005
Query: 236 SNWISGQE---LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWF 292
+ ++ +E + IACLKNLD LHAMPN DQI ALI++YGPT+ FHP E YLPSSVSWF
Sbjct: 1006 TRLVAAREDLGIGIACLKNLDLSLHAMPNVDQIQALIQHYGPTLVFHPGETYLPSSVSWF 1065
Query: 293 FTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLP-SDGGRQIVKHGNMESAKLY 351
F NGA+L + G+ + E ID +GSNLP GG ND +FWIDLP D R VK GN+ES+KLY
Sbjct: 1066 FKNGAVLCEKGNPIEEPIDENGSNLPQGGSNDKQFWIDLPCDDQQRDFVKRGNLESSKLY 1125
Query: 352 VHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGE 411
+H+KPA+GGTFTD+V W+FCPFNGP TLK+G+++++ IGQHV DWEHFTLRI NF+GE
Sbjct: 1126 IHIKPALGGTFTDLVFWIFCPFNGPATLKLGLVDISLISIGQHVCDWEHFTLRISNFSGE 1185
Query: 412 LWSIYFSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRN 470
L+SIY SQHSGG+W+ AYDLE I G NKA+VYSSK+GHASFP GTYLQGS +LGIG+RN
Sbjct: 1186 LYSIYLSQHSGGEWIEAYDLEIIPGSNKAVVYSSKHGHASFPRAGTYLQGSTMLGIGIRN 1245
Query: 471 DAARSNLYVDSSIQYELVAAEYL-GEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLL 529
D ARS L VDSS +YE++AAEYL G V+AEP WLQ+MR+WGP +VYDS+ E+++++
Sbjct: 1246 DTARSELLVDSSSRYEIIAAEYLSGNSVLAEPPWLQYMREWGPKVVYDSREEIERLVNRF 1305
Query: 530 PLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
P +R S+ + KLP+EL GEEGPTGPKEKNNW GDER
Sbjct: 1306 PRTVRVSLATVLRKLPVELSGEEGPTGPKEKNNWYGDER 1344
>gi|18396859|ref|NP_566224.1| uncharacterized protein [Arabidopsis thaliana]
gi|6721151|gb|AAF26779.1|AC016829_3 unknown protein [Arabidopsis thaliana]
gi|15450691|gb|AAK96617.1| AT3g04350/T6K12_3 [Arabidopsis thaliana]
gi|21360521|gb|AAM47376.1| AT3g04350/T6K12_3 [Arabidopsis thaliana]
gi|332640548|gb|AEE74069.1| uncharacterized protein [Arabidopsis thaliana]
Length = 567
Score = 712 bits (1839), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/578 (59%), Positives = 437/578 (75%), Gaps = 24/578 (4%)
Query: 1 MFGCKCFYWN----EVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRF 56
MFGC CFYW+ E+++ S +EP FSLPAPLP+WPQG+GFA+GRI+LGEIEV +I++F
Sbjct: 1 MFGCDCFYWSRGISELDSES-SEPKPFSLPAPLPSWPQGKGFATGRISLGEIEVVKITKF 59
Query: 57 NFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGA 116
+ +WS + K K ATFY IP+GF+ LGHYCQ +PLRG+VL AR + A+
Sbjct: 60 HRVWSSDSSHDKSKRATFYRADDIPEGFHCLGHYCQPTDQPLRGYVLAARTSKAVNAD-- 117
Query: 117 HTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKP 176
P L+KP+ Y+LVW +D G +FWLP PP GY++MG +VT P +P
Sbjct: 118 ------DFPPLKKPVSYSLVWSADSEKNGG----GYFWLPNPPVGYRAMGVIVTHEPGEP 167
Query: 177 ELDEVRCVRDDLTDKCEVHHLIFDAISKFSSS----PFSVWSTRPCNRGMLGRGVSVGTF 232
E +EVRCVR+DLT+ CE +I + S S+ PFSVWSTRPC RGML +GV+VG+F
Sbjct: 168 ETEEVRCVREDLTESCETSEMILEVGSSKKSNGSSSPFSVWSTRPCERGMLSQGVAVGSF 227
Query: 233 FCSSNWISGQEL--NIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVS 290
FC + +S + +I CLKNLDP LHAMPN DQ+HA+I ++GPTV+FHP+E Y+PSSV
Sbjct: 228 FCCTYDLSSERTVPDIGCLKNLDPTLHAMPNLDQVHAVIEHFGPTVYFHPEEAYMPSSVQ 287
Query: 291 WFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDG-GRQIVKHGNMESAK 349
WFF NGALLY++G G+ I+ +GSNLP+GG ND +FWIDLP D + +K GN+ES++
Sbjct: 288 WFFKNGALLYRSGKSEGQPINSTGSNLPAGGCNDMDFWIDLPEDEEAKSNLKKGNLESSE 347
Query: 350 LYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFT 409
LYVHVKPA+GGTFTDIVMW+FCPFNGP TLK+G+ + ++IG+HVGDWEHFT RICNF+
Sbjct: 348 LYVHVKPALGGTFTDIVMWIFCPFNGPATLKIGLFTLPMTRIGEHVGDWEHFTFRICNFS 407
Query: 410 GELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVR 469
GELW ++FSQHSGG WV A D+E+++ NK VYSSK+GHASFPHPG YLQGS LGIGVR
Sbjct: 408 GELWQMFFSQHSGGGWVDASDIEFVKDNKPAVYSSKHGHASFPHPGMYLQGSSKLGIGVR 467
Query: 470 NDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLL 529
ND A+S VDSS +Y +VAAEYLG+G V EP WLQ+MR+WGPTI YDS +E++KI+ LL
Sbjct: 468 NDVAKSKYIVDSSQRYVIVAAEYLGKGAVIEPCWLQYMREWGPTIAYDSGSEINKIMNLL 527
Query: 530 PLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
PL++R+S+EN V P+ LYGEEGPTGPKEK+NW GDE
Sbjct: 528 PLVVRFSIENIVDLFPIALYGEEGPTGPKEKDNWEGDE 565
>gi|15219644|ref|NP_171905.1| uncharacterized protein [Arabidopsis thaliana]
gi|63003838|gb|AAY25448.1| At1g04090 [Arabidopsis thaliana]
gi|133778830|gb|ABO38755.1| At1g04090 [Arabidopsis thaliana]
gi|332189534|gb|AEE27655.1| uncharacterized protein [Arabidopsis thaliana]
Length = 572
Score = 712 bits (1837), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/579 (59%), Positives = 435/579 (75%), Gaps = 19/579 (3%)
Query: 1 MFGCKCFYWNEVNNMSP-TEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFI 59
M G KC +WN + ++ P +P TFSLP+ +P WP GQGF SG INLG+++V +I+ F FI
Sbjct: 1 MLGYKCLHWNNLIDLPPLKDPETFSLPSSIPHWPPGQGFGSGTINLGKLQVIKITDFEFI 60
Query: 60 WSCNLLQSKKKSATFYEPAGI-PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHT 118
W + KKK+ +FY+P G+ P F+ LGHYCQ DS PLRG+VL ARDL S +
Sbjct: 61 WRYRSTE-KKKNISFYKPKGLLPKDFHCLGHYCQSDSHPLRGYVLAARDLVDSLEQ---- 115
Query: 119 SNLFKSPALQKPLDYTLVWCSDEGGQGNYEG---CAFFWLPQPPDGYKSMGFLVTKTPNK 175
+ PAL +P+D+TLVW S++ + C +FWLPQPP+GY+S+GF+VTKT K
Sbjct: 116 ---VEKPALVEPVDFTLVWSSNDSAENECSSKSECGYFWLPQPPEGYRSIGFVVTKTSVK 172
Query: 176 PELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCS 235
PEL+EVRCVR DLTD CE H++I A+S+ P +W TRP +RGM G+GVS GTFFC
Sbjct: 173 PELNEVRCVRADLTDICEPHNVIVTAVSESLGVPLFIWRTRPSDRGMWGKGVSAGTFFCR 232
Query: 236 SNWISGQE---LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWF 292
+ ++ +E + IACLKNLD LHAMPN DQI ALI++YGPT+ FHP E YLPSSVSWF
Sbjct: 233 TRLVAAREDLGIGIACLKNLDLSLHAMPNVDQIQALIQHYGPTLVFHPGETYLPSSVSWF 292
Query: 293 FTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLP-SDGGRQIVKHGNMESAKLY 351
F NGA+L + G+ + E ID +GSNLP GG ND +FWIDLP D R VK GN+ES+KLY
Sbjct: 293 FKNGAVLCEKGNPIEEPIDENGSNLPQGGSNDKQFWIDLPCDDQQRDFVKRGNLESSKLY 352
Query: 352 VHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGE 411
+H+KPA+GGTFTD+V W+FCPFNGP TLK+G+++++ IGQHV DWEHFTLRI NF+GE
Sbjct: 353 IHIKPALGGTFTDLVFWIFCPFNGPATLKLGLVDISLISIGQHVCDWEHFTLRISNFSGE 412
Query: 412 LWSIYFSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRN 470
L+SIY SQHSGG+W+ AYDLE I G NKA+VYSSK+GHASFP GTYLQGS +LGIG+RN
Sbjct: 413 LYSIYLSQHSGGEWIEAYDLEIIPGSNKAVVYSSKHGHASFPRAGTYLQGSTMLGIGIRN 472
Query: 471 DAARSNLYVDSSIQYELVAAEYL-GEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLL 529
D ARS L VDSS +YE++AAEYL G V+AEP WLQ+MR+WGP +VYDS+ E+++++
Sbjct: 473 DTARSELLVDSSSRYEIIAAEYLSGNSVLAEPPWLQYMREWGPKVVYDSREEIERLVNRF 532
Query: 530 PLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
P +R S+ + KLP+EL GEEGPTGPKEKNNW GDER
Sbjct: 533 PRTVRVSLATVLRKLPVELSGEEGPTGPKEKNNWYGDER 571
>gi|297791477|ref|XP_002863623.1| hypothetical protein ARALYDRAFT_494605 [Arabidopsis lyrata subsp.
lyrata]
gi|297309458|gb|EFH39882.1| hypothetical protein ARALYDRAFT_494605 [Arabidopsis lyrata subsp.
lyrata]
Length = 562
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/575 (59%), Positives = 433/575 (75%), Gaps = 21/575 (3%)
Query: 1 MFGCKCFYWNEVNNMSP-TEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFI 59
MFGCKC YWN + + P EP TFSLPA +P WP GQGF SGRINLG++E+ ++ F F+
Sbjct: 1 MFGCKCLYWNNLRELPPLKEPETFSLPASIPQWPSGQGFGSGRINLGDLELAEVTSFEFV 60
Query: 60 WSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTS 119
W + KKS +FY+P + + F+ LGHYCQ DS LRGF+LVAR + S
Sbjct: 61 WRYCSRRDNKKSVSFYKPDKLLEDFHCLGHYCQSDSHLLRGFLLVARQVNKS-------- 112
Query: 120 NLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELD 179
PAL +PLDYTLVW S++ + + G +FWLPQPP GYK++GFLVT +P+KPELD
Sbjct: 113 ---SEPALVQPLDYTLVWSSNDLSEESQSG--YFWLPQPPQGYKTIGFLVTTSPSKPELD 167
Query: 180 EVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWI 239
+VRCVR DLTDKCE H +I AIS S P +W TRP +RGM G+GVS GTFFC++
Sbjct: 168 QVRCVRADLTDKCEAHKVIITAISNSLSIPLFIWQTRPSDRGMWGKGVSTGTFFCTTQSP 227
Query: 240 SGQELN-IACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGAL 298
L+ IACLKNLD LHAMPN +QIHALI++YGP V+FHP+EVYLPSSVSWFF NGA+
Sbjct: 228 EEDHLSTIACLKNLDSSLHAMPNMEQIHALIQHYGPRVYFHPNEVYLPSSVSWFFRNGAV 287
Query: 299 LYKAGDL--VGEAIDPSGSNLPSGGRNDGEFWIDLPSDGG--RQIVKHGNMESAKLYVHV 354
L + + E ID +GSNLP GG ND +WIDLP + R+ +K GN+ES+KLYVHV
Sbjct: 288 LCSNSNSSDINEPIDETGSNLPHGGTNDKRYWIDLPINDQQRREFLKRGNLESSKLYVHV 347
Query: 355 KPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWS 414
KPA GGTFTD+ W+FCPFNGP TLK+G+MN++ +K GQHV DWEHFTLRI NF+GE++S
Sbjct: 348 KPAFGGTFTDLAFWIFCPFNGPATLKLGLMNISLAKTGQHVCDWEHFTLRISNFSGEIYS 407
Query: 415 IYFSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAA 473
IYFSQHSGG+W+ DLE+IEG NKA+VYSSK+GHASF G YLQGS++LGIG+RND A
Sbjct: 408 IYFSQHSGGEWIKPQDLEFIEGSNKAVVYSSKHGHASFSKSGLYLQGSDLLGIGIRNDTA 467
Query: 474 RSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMI 533
RS+ +VDSS++YE+VAAEYLG+ VV EP WL +MR+WGP IVY+S++E++K+ + LP +
Sbjct: 468 RSDFFVDSSLRYEIVAAEYLGDAVV-EPPWLGYMREWGPKIVYNSRSEIEKLNERLPWRM 526
Query: 534 RYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
R V+ + K+P+EL GEEGPTGPKEK+NW GDER
Sbjct: 527 RSWVDAVLRKIPVELSGEEGPTGPKEKDNWFGDER 561
>gi|297833120|ref|XP_002884442.1| hypothetical protein ARALYDRAFT_477694 [Arabidopsis lyrata subsp.
lyrata]
gi|297330282|gb|EFH60701.1| hypothetical protein ARALYDRAFT_477694 [Arabidopsis lyrata subsp.
lyrata]
Length = 567
Score = 709 bits (1831), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/577 (59%), Positives = 435/577 (75%), Gaps = 22/577 (3%)
Query: 1 MFGCKCFYWNE---VNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFN 57
MFGC CFYW+ V + +EP FSLP+PLP+WPQG+GFA+GRINLGEIEV +I++F+
Sbjct: 1 MFGCDCFYWSRGLSVLDSESSEPKPFSLPSPLPSWPQGKGFATGRINLGEIEVVKITKFH 60
Query: 58 FIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAH 117
+WS + L K K ATFY IP+GF+ LGHYCQ +PLRG+VL AR + A+
Sbjct: 61 RVWSSDSLHDKSKRATFYRAEEIPEGFHCLGHYCQPTDQPLRGYVLAARASKAVNADDL- 119
Query: 118 TSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPE 177
P L+KP+ Y+LVW +D G+ +FWLP PP GY++MG +VT P +PE
Sbjct: 120 -------PPLKKPVSYSLVWSADSEKNGS----GYFWLPNPPVGYRAMGVIVTPGPGEPE 168
Query: 178 LDEVRCVRDDLTDKCEVHHLIFDAISKFSSS----PFSVWSTRPCNRGMLGRGVSVGTFF 233
+EVRCVR+DLT+ CE +I + S +S+ PFSVWSTRPC RGML +GV+VG+FF
Sbjct: 169 PEEVRCVREDLTESCETSEMILEVGSSKNSNGSSSPFSVWSTRPCERGMLSQGVAVGSFF 228
Query: 234 CSSNWISGQEL--NIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSW 291
C + +S +I CLKNLDP LHAMPN DQ+HA+I ++GPTV+FHP+E Y+PSSV W
Sbjct: 229 CCTYDLSSDRTVPDIGCLKNLDPTLHAMPNLDQVHAVIEHFGPTVYFHPEEAYMPSSVQW 288
Query: 292 FFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDG-GRQIVKHGNMESAKL 350
FF NGALLY++G G+ I+ +GSNLP+GG ND EFWIDLP D + +K GN+ES++L
Sbjct: 289 FFKNGALLYRSGKSQGQPINSTGSNLPAGGCNDMEFWIDLPGDEEAKSNLKKGNLESSEL 348
Query: 351 YVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTG 410
YVHVKPA+GGTFTDIVMW+FCPFNGP TLK+G+ + ++IG+HVGDWEHFT RICNF+G
Sbjct: 349 YVHVKPALGGTFTDIVMWIFCPFNGPATLKIGLFTLPMTRIGEHVGDWEHFTFRICNFSG 408
Query: 411 ELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRN 470
ELW ++FSQHSGG WV A D+E+++ NK VYSSK+GHASFPHPG YLQGS LGIGVRN
Sbjct: 409 ELWQMFFSQHSGGGWVDASDIEFVKDNKPAVYSSKHGHASFPHPGMYLQGSSKLGIGVRN 468
Query: 471 DAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLP 530
D A+S VDSS +Y +VAAEYLG+ V EP WLQ+MR+WGPTI YDS +E++KI+ LLP
Sbjct: 469 DVAKSKYIVDSSQRYVIVAAEYLGKEAVIEPCWLQYMREWGPTIAYDSGSEINKIMNLLP 528
Query: 531 LMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
L++R+S+EN V P+ LYGEEGPTGPKEK+NW GDE
Sbjct: 529 LVVRFSIENIVDLFPIALYGEEGPTGPKEKDNWEGDE 565
>gi|15240055|ref|NP_199208.1| uncharacterized protein [Arabidopsis thaliana]
gi|9758554|dbj|BAB09055.1| unnamed protein product [Arabidopsis thaliana]
gi|16648997|gb|AAL24350.1| Unknown protein [Arabidopsis thaliana]
gi|20259924|gb|AAM13309.1| unknown protein [Arabidopsis thaliana]
gi|332007653|gb|AED95036.1| uncharacterized protein [Arabidopsis thaliana]
Length = 566
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/577 (59%), Positives = 427/577 (74%), Gaps = 21/577 (3%)
Query: 1 MFGCKCFYWNEVNNMSP-TEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFI 59
MFGCKC YWN + P EP TFSLPA LP WP GQGF GRINLGE+EV I+ F F+
Sbjct: 1 MFGCKCLYWNNLKEYPPLKEPETFSLPASLPQWPSGQGFGLGRINLGELEVAEITSFEFV 60
Query: 60 WSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTS 119
W + KKS +FY+P +P+ F+ LGHYCQ DS LRGF+LVAR + S
Sbjct: 61 WRYCSRRDNKKSVSFYKPDKLPEDFHCLGHYCQSDSHLLRGFLLVARQVNKS-------- 112
Query: 120 NLFKSPALQKPLDYTLVWCS-DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL 178
PAL +PLDYTLVW S D + E +FWLPQPP GYK +G+LVT +P KPEL
Sbjct: 113 ---SEPALVQPLDYTLVWSSNDLSEERQSESYGYFWLPQPPQGYKPIGYLVTTSPAKPEL 169
Query: 179 DEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNW 238
D+VRCVR DLTDKCE H +I AIS S P +W TRP +RGM G+GVS GTFFC++
Sbjct: 170 DQVRCVRADLTDKCEAHKVIITAISDSLSIPMFIWKTRPSDRGMRGKGVSTGTFFCTTQS 229
Query: 239 ISGQELN-IACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGA 297
L+ IACLKNLD LHAMPN +QIHA+I++YGP V+FHP+EVYLPSSVSWFF NGA
Sbjct: 230 PEEDHLSTIACLKNLDSSLHAMPNIEQIHAMIQHYGPRVYFHPNEVYLPSSVSWFFKNGA 289
Query: 298 LL---YKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGG--RQIVKHGNMESAKLYV 352
LL + + E ID +GSNLP GG ND +WIDLP + R+ +K G++ES+KLYV
Sbjct: 290 LLCSNSNSSVINNEPIDETGSNLPHGGTNDKRYWIDLPINDQQRREFIKRGDLESSKLYV 349
Query: 353 HVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGEL 412
HVKPA GGTFTD+ W+FCPFNGP TLK+G+M+++ +K GQHV DWEHFT+RI NF+GEL
Sbjct: 350 HVKPAFGGTFTDLAFWIFCPFNGPATLKLGLMDLSLAKTGQHVCDWEHFTVRISNFSGEL 409
Query: 413 WSIYFSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRND 471
+SIYFSQHSGG+W+ +LE++EG NKA+VYSSKNGHASF G YLQGS +LGIG+RND
Sbjct: 410 YSIYFSQHSGGEWIKPENLEFVEGSNKAVVYSSKNGHASFSKSGMYLQGSALLGIGIRND 469
Query: 472 AARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPL 531
+A+S+L+VDSS++YE+VAAEYL G V EP WL +MR+WGP IVY+S++E++K+ + LP
Sbjct: 470 SAKSDLFVDSSLKYEIVAAEYL-RGAVVEPPWLGYMREWGPKIVYNSRSEIEKLNERLPW 528
Query: 532 MIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
+R V+ + K+P+EL GEEGPTGPKEKNNW GDER
Sbjct: 529 RLRSWVDAVLRKIPVELSGEEGPTGPKEKNNWFGDER 565
>gi|297812029|ref|XP_002873898.1| hypothetical protein ARALYDRAFT_909860 [Arabidopsis lyrata subsp.
lyrata]
gi|297319735|gb|EFH50157.1| hypothetical protein ARALYDRAFT_909860 [Arabidopsis lyrata subsp.
lyrata]
Length = 553
Score = 694 bits (1792), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/572 (59%), Positives = 423/572 (73%), Gaps = 31/572 (5%)
Query: 4 CKCFYWNE---VNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
C CFYWN+ +E FSLP+PLP WPQG GFA+GRINLGEI+V +++ F+ +W
Sbjct: 3 CDCFYWNKSITELESESSESKPFSLPSPLPRWPQGSGFATGRINLGEIQVVKVTEFDRVW 62
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
C + K A+FY+P GIP+GF+ LGHYCQ +++PLRGFVL AR E H ++
Sbjct: 63 KCGRSRGKSSCASFYKPVGIPEGFHCLGHYCQPNNQPLRGFVLAAR-----ANEPGHLAD 117
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
+ P L+KP++Y+LVW SD +FWLP PP GY+++G +VT +PE+DE
Sbjct: 118 DHR-PPLKKPVNYSLVWSSDSD--------CYFWLPNPPVGYRAVGVIVTDGSEEPEVDE 168
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVR+DLT+ CE + S F+VWST+PC G+ RGV VG+FFCS+N +S
Sbjct: 169 VRCVREDLTESCETGEKVLGMGS------FNVWSTKPCETGIWSRGVEVGSFFCSTNDMS 222
Query: 241 GQ---ELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGA 297
+NIACLKNLDP LH MPN DQ+HALI++YGP V+FHP+E Y+PSSV WFF NGA
Sbjct: 223 SNYKAAMNIACLKNLDPSLHGMPNLDQVHALIQHYGPMVYFHPEETYMPSSVPWFFKNGA 282
Query: 298 LLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKP 356
LLY++G GE I+ +GSNLP+GG NDG FWIDLP D R +K GN+ES++LYVHVKP
Sbjct: 283 LLYRSGKSQGEPINSTGSNLPAGGENDGSFWIDLPEDDEVRSNLKKGNLESSELYVHVKP 342
Query: 357 AVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIY 416
A+GG FTDIVMW+FCPFNGP TLK+G++ V ++IG+HVGDWEHFT RI NF GEL ++
Sbjct: 343 ALGGIFTDIVMWIFCPFNGPATLKIGLLTVPMNRIGEHVGDWEHFTFRISNFNGELTQMF 402
Query: 417 FSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARS 475
FSQHSGG WV DLE+++G NK +VYSSK+GHASFPHPG YLQG LGIGVRND A+S
Sbjct: 403 FSQHSGGGWVDVSDLEFVKGSNKPVVYSSKHGHASFPHPGMYLQGPSKLGIGVRNDVAKS 462
Query: 476 NLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRY 535
VDSS +Y +VAAEYLGEG ++EP WLQFMR+WGPT+VYDS E++KII LLPL++RY
Sbjct: 463 KYAVDSSQRYRIVAAEYLGEGAISEPCWLQFMREWGPTLVYDSAAEINKIIDLLPLIVRY 522
Query: 536 SVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
S E S P+ELYGEEGPTGPKEK+NW GDE
Sbjct: 523 SFE---SLFPIELYGEEGPTGPKEKDNWEGDE 551
>gi|449449579|ref|XP_004142542.1| PREDICTED: uncharacterized protein LOC101216081 [Cucumis sativus]
Length = 555
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/574 (58%), Positives = 430/574 (74%), Gaps = 26/574 (4%)
Query: 1 MFGCKCFYWNEVNN-----MSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISR 55
MFG +C+ WN V + +S ++P FSLP+PLP WP G+GF++GRI+LGEIEV +IS+
Sbjct: 1 MFGWECWCWNGVVDPLDFCLSDSQP--FSLPSPLPKWPPGKGFSTGRISLGEIEVYKISK 58
Query: 56 FNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEG 115
+W C+ + A FY+P IPDGF+ LGHYCQ PLRG+VLVAR +SE +
Sbjct: 59 LKKVWRCS------QGAVFYKPQAIPDGFFCLGHYCQPSDHPLRGYVLVAR--GASEVDH 110
Query: 116 AHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNK 175
S + +SPAL++P++YTL+W S G F WLP P+GY++MGFLVT +
Sbjct: 111 VDNS-VRESPALKRPVNYTLIWSSGLNGVDP----GFIWLPNAPEGYRAMGFLVTDRSEE 165
Query: 176 PELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCS 235
P D++RCVR DLT++CE LI SK S F VW TRP RGM GVSVGTFFC
Sbjct: 166 PSRDDIRCVRADLTERCETGDLIVTIKSK--SQSFQVWETRPFERGMYKSGVSVGTFFCC 223
Query: 236 SNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTN 295
++ + LNI+CLKNL+ MPN +Q+ ALI +YGPTVFFHPDE + PSSV WFF N
Sbjct: 224 TSL--KEYLNISCLKNLNSTFEGMPNLNQVQALIGHYGPTVFFHPDEEFFPSSVPWFFKN 281
Query: 296 GALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPS-DGGRQIVKHGNMESAKLYVHV 354
GALLY+ G+ GE ID GSNLP GG NDG +WIDLP+ D R+ +K GN+++A+LYVHV
Sbjct: 282 GALLYRNGNTKGEPIDMRGSNLPCGGENDGAYWIDLPTNDNARENLKSGNIKTARLYVHV 341
Query: 355 KPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWS 414
KPA+GGTFTDIVMWVFCPFNGP +KV +N+ KIG+HV DWEHFTLRICNF+GELW
Sbjct: 342 KPALGGTFTDIVMWVFCPFNGPAAIKVSFLNIKLKKIGEHVSDWEHFTLRICNFSGELWQ 401
Query: 415 IYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR 474
+YFS+HSGGKWV A DLE+I GNK IVYSSK+GHAS+PHPG+YLQGS + GIGVRNDAAR
Sbjct: 402 VYFSEHSGGKWVDASDLEFIHGNKPIVYSSKHGHASYPHPGSYLQGS-VAGIGVRNDAAR 460
Query: 475 SNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIR 534
S +VDSS++YE++AAEYLG+G +AEP WLQ+MR+WGPT+ Y+S++E++++I LLP ++
Sbjct: 461 SKFFVDSSLKYEIIAAEYLGDGYIAEPDWLQYMREWGPTVKYNSRSEIERLIDLLPPFVQ 520
Query: 535 YSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
+S+E+ ++ P ELYGEEGPTGPKEKNNW GDER
Sbjct: 521 FSLEDLLALFPTELYGEEGPTGPKEKNNWFGDER 554
>gi|449479722|ref|XP_004155688.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101223927 [Cucumis sativus]
Length = 555
Score = 690 bits (1781), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/574 (58%), Positives = 429/574 (74%), Gaps = 26/574 (4%)
Query: 1 MFGCKCFYWNEVNN-----MSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISR 55
MFG +C+ WN V + +S ++P FSLP+PLP WP G+GF++GRI+LGEIEV +IS+
Sbjct: 1 MFGWECWCWNGVVDPLDFCLSDSQP--FSLPSPLPKWPPGKGFSTGRISLGEIEVYKISK 58
Query: 56 FNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEG 115
+W C+ + A FY+P IPDGF+ LGHYCQ PLRG+VLVAR +SE +
Sbjct: 59 LKKVWRCS------QGAVFYKPQAIPDGFFCLGHYCQPSDHPLRGYVLVAR--GASEVDH 110
Query: 116 AHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNK 175
S + +SPAL++P++YTL+W S G F WLP P+GY++MGFLVT +
Sbjct: 111 VDNS-VRESPALKRPVNYTLIWSSGLNGVDP----GFIWLPNAPEGYRAMGFLVTDRSEE 165
Query: 176 PELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCS 235
P D++RCVR DLT++CE LI SK S F VW TRP RGM GVSVGTFFC
Sbjct: 166 PSRDDIRCVRADLTERCETGDLIVTIKSK--SQSFQVWETRPFERGMYKSGVSVGTFFCC 223
Query: 236 SNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTN 295
++ + LNI+CLKNL+ MPN +Q+ ALI +YGPTVFFHPDE + PSSV WFF N
Sbjct: 224 TSL--KEYLNISCLKNLNSTFEGMPNLNQVQALIGHYGPTVFFHPDEEFFPSSVPWFFKN 281
Query: 296 GALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPS-DGGRQIVKHGNMESAKLYVHV 354
GALLY+ G+ GE ID GSNLP GG NDG +WIDLP+ D R+ +K GN+++A+LYVHV
Sbjct: 282 GALLYRNGNTKGEPIDMRGSNLPCGGENDGAYWIDLPTNDNARENLKSGNIKTARLYVHV 341
Query: 355 KPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWS 414
KPA+GGTFTDIVMWVFCPFNGP +KV +N+ KIG+HV DWEHFTLRICNF+GELW
Sbjct: 342 KPALGGTFTDIVMWVFCPFNGPAAIKVSFLNIKLKKIGEHVSDWEHFTLRICNFSGELWQ 401
Query: 415 IYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR 474
+YFS+HSGGKWV A DLE+I GNK IVYSSK+GHAS+PHPG+YLQGS + GIGVRNDAAR
Sbjct: 402 VYFSEHSGGKWVDASDLEFIHGNKPIVYSSKHGHASYPHPGSYLQGS-VAGIGVRNDAAR 460
Query: 475 SNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIR 534
S +VDSS++YE++AAEYLG+G +AEP WLQ+MR+WGPT+ Y+S++E++++I LLP ++
Sbjct: 461 SKFFVDSSLKYEIIAAEYLGDGYIAEPDWLQYMREWGPTVKYNSRSEIERLIDLLPPFVQ 520
Query: 535 YSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
+S+E+ ++ P ELYGEEGPTGPK KNNW GDER
Sbjct: 521 FSLEDLLALFPTELYGEEGPTGPKXKNNWFGDER 554
>gi|387965707|gb|AFK13836.1| putative pre-mRNA processing protein PRP39 [Beta vulgaris subsp.
vulgaris]
Length = 578
Score = 687 bits (1772), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/551 (60%), Positives = 404/551 (73%), Gaps = 29/551 (5%)
Query: 20 PGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
P FSLPA LP WPQG+GFA+G+INLGE+EV +I++F +WSCN + K TFY+P
Sbjct: 54 PEPFSLPAHLPQWPQGEGFATGKINLGELEVAKITKFESVWSCNPSRRNSKGVTFYKPFA 113
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
IPDGF+ LGHYCQ + RPL GFVLVAR++ S SPAL++P +++LVW S
Sbjct: 114 IPDGFFCLGHYCQSNDRPLWGFVLVAREVKQSS----------DSPALKRPRNFSLVWSS 163
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
+ C F WLP P GY++MG LVT TP +P +D++RCVR DLT+ CE +I
Sbjct: 164 E---------CGFIWLPNAPTGYRAMGLLVTDTPVEPSVDDIRCVRCDLTENCEACDMIL 214
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFF-CSSNWISGQELNIACLKNLDPKLHA 258
D F+VW TRPC RGM RGV VGTFF C+ + + ACLKN DP L+A
Sbjct: 215 D-------DDFTVWKTRPCRRGMYARGVCVGTFFSCTYGILEEDVVTTACLKNRDPTLNA 267
Query: 259 MPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLP 318
MPN +QIHALI+ YGPTV+FHPDE+Y+PSSV WFF NGA L + G GE ID +GS LP
Sbjct: 268 MPNLEQIHALIKQYGPTVYFHPDEIYMPSSVQWFFDNGARLCENGKCDGERIDSNGSLLP 327
Query: 319 SGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPG 377
GG NDGE+WIDLP D R VK GN+ESA+LYVHVKPA GG FTDI MWVFCPFNGP
Sbjct: 328 KGGLNDGEYWIDLPKDDDKRDYVKRGNLESAELYVHVKPAFGGAFTDIAMWVFCPFNGPA 387
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN 437
TLKVG+ N + +++G+HVGDWEHFTLR+ NFTGELWSIYFS+HSGG+WV A +LEYI N
Sbjct: 388 TLKVGLANFSLTRVGEHVGDWEHFTLRVSNFTGELWSIYFSEHSGGEWVDASNLEYINEN 447
Query: 438 KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGV 497
KAIVYSSK GHAS+ H GTYL G LGIGVRND A+S +VDSS +Y++VAAEYLG+
Sbjct: 448 KAIVYSSKCGHASYAHSGTYLLGPSKLGIGVRNDVAKSKYFVDSSTKYKIVAAEYLGD-T 506
Query: 498 VAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGP 557
V EP WLQFMR+WGP + YD+ +EL+KI+ LP+ +R+S+EN + P ELYGEEGPTGP
Sbjct: 507 VEEPCWLQFMREWGPKLTYDACSELEKIMNHLPIFVRFSMENLLDLFPTELYGEEGPTGP 566
Query: 558 KEKNNWVGDER 568
KEK+NW GDER
Sbjct: 567 KEKDNWFGDER 577
>gi|21592775|gb|AAM64724.1| unknown [Arabidopsis thaliana]
Length = 553
Score = 686 bits (1771), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/572 (58%), Positives = 418/572 (73%), Gaps = 31/572 (5%)
Query: 4 CKCFYWNE---VNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
C CFYWN+ +E FSLP+PLP WPQG+GFA+GRINLGEI+V ++ F+ +W
Sbjct: 3 CDCFYWNKGFSELESESSESKPFSLPSPLPQWPQGRGFATGRINLGEIQVVNVTEFDRVW 62
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
C + K + A+FY+P GIP+GF+ LGHYCQ +++PLRGFVL AR H
Sbjct: 63 KCGTSRGKLRCASFYKPVGIPEGFHCLGHYCQPNNQPLRGFVLAARANEPGHLADDHL-- 120
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
P L+KPL+Y+LVW SD +FWLP PP GY+++G +VT +PE+DE
Sbjct: 121 ----PPLKKPLNYSLVWSSDSD--------CYFWLPNPPVGYRAVGVIVTDGSEEPEVDE 168
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVR+DLT+ CE + S F+VWST+PC RG+ RGV VG+FFCS+N +S
Sbjct: 169 VRCVREDLTESCETGEKVLGVGS------FNVWSTKPCERGIWSRGVEVGSFFCSTNDLS 222
Query: 241 GQE---LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGA 297
+NIACLKNLDP L MPN DQ+HALI +YGP V+FHP+E Y+PSSV WFF NGA
Sbjct: 223 SDNKAAMNIACLKNLDPSLQGMPNLDQVHALIHHYGPMVYFHPEETYMPSSVPWFFKNGA 282
Query: 298 LLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKP 356
LL++ GE I+ +GSNLP+GG NDG FWIDLP D R +K GN+ES++LYVHVKP
Sbjct: 283 LLHRFEKSQGEPINSTGSNLPAGGENDGSFWIDLPEDDEVRSNLKKGNIESSELYVHVKP 342
Query: 357 AVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIY 416
A+GG FTDIVMW+FCPFNGP TLK+G++ V +++G+HVGDWEHFT RI NF G+L ++
Sbjct: 343 ALGGIFTDIVMWIFCPFNGPATLKIGLLTVPMNRLGEHVGDWEHFTFRISNFNGDLTQMF 402
Query: 417 FSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARS 475
FSQHSGG WV DLE+++G NK +VYSSK+GHASFPHPG YLQG LGIGVRND A+S
Sbjct: 403 FSQHSGGGWVDVSDLEFVKGSNKPVVYSSKHGHASFPHPGMYLQGPSKLGIGVRNDVAKS 462
Query: 476 NLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRY 535
VDSS +Y++VAAEYLGEG V+EP WLQFMR+WGPTIVYDS E++KII LLPL++RY
Sbjct: 463 KYMVDSSQRYKIVAAEYLGEGAVSEPYWLQFMREWGPTIVYDSAAEINKIIDLLPLILRY 522
Query: 536 SVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
S E S P+ELYGEEGPTGPKEK+NW GDE
Sbjct: 523 SFE---SLFPIELYGEEGPTGPKEKDNWEGDE 551
>gi|15238839|ref|NP_197350.1| uncharacterized protein [Arabidopsis thaliana]
gi|15450864|gb|AAK96703.1| Unknown protein [Arabidopsis thaliana]
gi|31711974|gb|AAP68343.1| At5g18490 [Arabidopsis thaliana]
gi|332005187|gb|AED92570.1| uncharacterized protein [Arabidopsis thaliana]
Length = 553
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/572 (58%), Positives = 417/572 (72%), Gaps = 31/572 (5%)
Query: 4 CKCFYWNE---VNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
C CFYWN+ +E FSLP+PLP WPQG+GFA+GRI+LGEI+V +++ F+ +W
Sbjct: 3 CDCFYWNKGFSELESESSESKPFSLPSPLPQWPQGRGFATGRISLGEIQVVKVTEFDRVW 62
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120
C + K + A+FY+P GIP+GF+ LGHYCQ +++PLRGFVL AR H
Sbjct: 63 KCGTSRGKLRCASFYKPVGIPEGFHCLGHYCQPNNQPLRGFVLAARANKPGHLADDH--- 119
Query: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
P L+KPL+Y+LVW SD +FWLP PP GY+++G +VT +PE+DE
Sbjct: 120 ---RPPLKKPLNYSLVWSSDSD--------CYFWLPNPPVGYRAVGVIVTDGSEEPEVDE 168
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240
VRCVR+DLT+ CE + S F+VWST+PC RG+ RGV VG+F CS+N +S
Sbjct: 169 VRCVREDLTESCETGEKVLGVGS------FNVWSTKPCERGIWSRGVEVGSFVCSTNDLS 222
Query: 241 GQE---LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGA 297
+NIACLKNLDP L MPN DQ+HALI +YGP V+FHP+E Y+PSSV WFF NGA
Sbjct: 223 SDNKAAMNIACLKNLDPSLQGMPNLDQVHALIHHYGPMVYFHPEETYMPSSVPWFFKNGA 282
Query: 298 LLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKP 356
LL++ G GE I+ +GSNLP+GG NDG FWIDLP D R +K GN+ES++LYVHVKP
Sbjct: 283 LLHRFGKSQGEPINSAGSNLPAGGENDGSFWIDLPEDEEVRSNLKKGNIESSELYVHVKP 342
Query: 357 AVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIY 416
A+GG FTD+VMW+FCPFNGP TLK+G++ V +++G+HVGDWEHFT RI NF G+L ++
Sbjct: 343 ALGGIFTDVVMWIFCPFNGPATLKIGLLTVPMNRLGEHVGDWEHFTFRISNFNGDLTQMF 402
Query: 417 FSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARS 475
FSQHSGG WV DLE+++G NK +VYSSK+GHASFPHPG YLQG LGIGVRND A+S
Sbjct: 403 FSQHSGGGWVDVSDLEFVKGSNKPVVYSSKHGHASFPHPGMYLQGPSKLGIGVRNDVAKS 462
Query: 476 NLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRY 535
VDSS +Y +VAAEYLGEG V+EP WLQFMR+WGPTIVYDS E++KII LLPL++R
Sbjct: 463 KYMVDSSQRYRIVAAEYLGEGAVSEPYWLQFMREWGPTIVYDSAAEINKIIDLLPLILRN 522
Query: 536 SVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
S E S P+ELYGEEGPTGPKEK+NW GDE
Sbjct: 523 SFE---SLFPIELYGEEGPTGPKEKDNWEGDE 551
>gi|242036963|ref|XP_002465876.1| hypothetical protein SORBIDRAFT_01g047420 [Sorghum bicolor]
gi|241919730|gb|EER92874.1| hypothetical protein SORBIDRAFT_01g047420 [Sorghum bicolor]
Length = 551
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/549 (58%), Positives = 405/549 (73%), Gaps = 20/549 (3%)
Query: 20 PGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
P F LPAPLP WPQG FA G I +GE++V I++F IW+C+ ATFYEP G
Sbjct: 22 PEPFQLPAPLPDWPQGGDFAKGTICIGELDVVNITKFRNIWTCS-------DATFYEPEG 74
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
IPDGF+ LGHY Q + RPL+GF+LVAR++AS + + PAL+KPLDY+LVW S
Sbjct: 75 IPDGFHCLGHYAQQNDRPLQGFLLVAREVASHQV-------INSKPALEKPLDYSLVWTS 127
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
+ + + C WLP PP+GYK++G++VTK P KP L+ VRCVRDDLTD C+ I
Sbjct: 128 ADLNEDDNSECGCIWLPCPPNGYKALGYVVTKGPKKPSLEAVRCVRDDLTDTCDNFRSIV 187
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAM 259
+ + +W TRPC+RG+ G G+ VGTF C+++ E + CLKN+DP L +M
Sbjct: 188 NM-----DNACQIWKTRPCHRGVRGHGIPVGTFSCATDSTDSDESSTPCLKNVDPNLRSM 242
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPS 319
PN +QI+ALI++YGPTVFFHP E YLPSSVSWFF NGA LYK +G+AI P G NLP
Sbjct: 243 PNLEQINALIKHYGPTVFFHPQETYLPSSVSWFFENGATLYKKDAKMGDAILPGGLNLPV 302
Query: 320 GGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
GG NDGE+WIDLP D +++V+ GN++SA+LY HVKPA GGTFTDI MWVFCPFNGP T+
Sbjct: 303 GGTNDGEYWIDLPDDDRKELVRDGNLKSAELYAHVKPAYGGTFTDIAMWVFCPFNGPATI 362
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
KV + A K+G+H+GDWEHFTLR+ NF+GEL SIYFSQHSGG+WV A DLE+I GNKA
Sbjct: 363 KVSFASFALQKVGRHIGDWEHFTLRVSNFSGELSSIYFSQHSGGEWVEACDLEFISGNKA 422
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
IVYSS+NGHAS+PHPG YL GSE LG+GVRND ARS+L VDSS QY++++A +LG+ VV
Sbjct: 423 IVYSSRNGHASYPHPGCYLMGSEKLGVGVRNDVARSDLSVDSSTQYKIISAGHLGDAVV- 481
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
EP WLQ+MR+WGP + Y S++E+D + LP +RY+VE + LP ELY EEGPTGPKE
Sbjct: 482 EPCWLQYMREWGPAVTYSSRSEIDTALSFLPFFLRYTVEAIFNSLPAELYEEEGPTGPKE 541
Query: 560 KNNWVGDER 568
KNNW GDER
Sbjct: 542 KNNWEGDER 550
>gi|223945089|gb|ACN26628.1| unknown [Zea mays]
gi|414864759|tpg|DAA43316.1| TPA: hypothetical protein ZEAMMB73_151042 [Zea mays]
Length = 552
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/554 (58%), Positives = 407/554 (73%), Gaps = 21/554 (3%)
Query: 15 MSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATF 74
+ P EP F LPAPLP WPQG FA G I +GE++V I++F IW+C+ ATF
Sbjct: 19 LRPPEP--FQLPAPLPEWPQGGDFAKGTICIGEVDVTSITKFRSIWTCS-------DATF 69
Query: 75 YEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYT 134
YEP GIPDGF+ LGHY Q + RPL+GF+LVAR++AS + + PAL+KP DY+
Sbjct: 70 YEPEGIPDGFHCLGHYAQQNDRPLQGFLLVAREVASHQV-------IDTKPALEKPSDYS 122
Query: 135 LVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEV 194
LVW S + + + C WLP PP+GYK++G++VTK P KP L+ VRCVRDDLTD E
Sbjct: 123 LVWTSADANEDDNSECGCIWLPCPPNGYKALGYVVTKGPKKPSLEAVRCVRDDLTDTREN 182
Query: 195 HHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDP 254
H I + + +W TRPC+RG+ GRG+ VGTF C ++ E + CLKN+D
Sbjct: 183 FHSIANM-----DNACQIWKTRPCHRGVGGRGIPVGTFSCETDSTDSDESSTPCLKNVDS 237
Query: 255 KLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSG 314
L AMPN +QI+ALI++YGPTVFFHP E YLPSSVSWFF NGA L++ +G+AI P G
Sbjct: 238 NLGAMPNLEQINALIKHYGPTVFFHPQETYLPSSVSWFFENGATLHEKDAKMGDAILPGG 297
Query: 315 SNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFN 374
NLP GG NDGE+WIDLP D +++V+ GN++SA+LY HVKPA GGTFTDI MWVFCPFN
Sbjct: 298 LNLPVGGTNDGEYWIDLPDDDRKELVRAGNLKSAELYAHVKPAHGGTFTDIAMWVFCPFN 357
Query: 375 GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYI 434
GP T+KVG + A K+G+H+GDWEHFTLR+ NF+G+L SIYFSQHSGG+WV A DLE+I
Sbjct: 358 GPATIKVGFASFALQKVGRHIGDWEHFTLRVSNFSGQLSSIYFSQHSGGEWVEACDLEFI 417
Query: 435 EGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLG 494
GNKAI+YSS+NGHAS+PHPG YL GSE LG+GVRND ARS+L VDSS +YE+++A +LG
Sbjct: 418 SGNKAIIYSSRNGHASYPHPGCYLMGSETLGVGVRNDVARSDLSVDSSTRYEIISAGHLG 477
Query: 495 EGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGP 554
E VAEP WLQ+MR+WGPTI Y S++E+D + LP +R++ E + LP ELY EEGP
Sbjct: 478 EDAVAEPCWLQYMREWGPTITYSSRSEIDTALSFLPFFLRFTAEAIFNSLPAELYEEEGP 537
Query: 555 TGPKEKNNWVGDER 568
TGPKEKNNW GDER
Sbjct: 538 TGPKEKNNWEGDER 551
>gi|226500272|ref|NP_001145283.1| uncharacterized protein LOC100278579 [Zea mays]
gi|195654083|gb|ACG46509.1| hypothetical protein [Zea mays]
Length = 552
Score = 679 bits (1751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/554 (58%), Positives = 407/554 (73%), Gaps = 21/554 (3%)
Query: 15 MSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATF 74
+ P EP F LPAPLP WPQG FA G I +GE++V I++F IW+C+ +ATF
Sbjct: 19 LRPPEP--FQLPAPLPEWPQGGDFAKGTICIGEVDVTSITKFRSIWTCS-------NATF 69
Query: 75 YEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYT 134
YEP GIPDGF LGHY Q + RPL+GF+LVAR++AS + + PAL+KP DY+
Sbjct: 70 YEPEGIPDGFXCLGHYAQQNDRPLQGFLLVAREVASHQV-------IDTKPALEKPSDYS 122
Query: 135 LVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEV 194
LVW S + + + C WLP PP+GYK++G++VTK P KP L+ VRCVRDDLTD E
Sbjct: 123 LVWTSADANEDDNSECGCIWLPCPPNGYKALGYVVTKGPKKPSLEAVRCVRDDLTDTREN 182
Query: 195 HHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDP 254
H I + + +W TRPC+RG+ GRG+ VGTF C ++ E + CLKN+D
Sbjct: 183 FHSIANM-----DNACQIWKTRPCHRGVGGRGIPVGTFSCETDSTDSDESSTPCLKNVDS 237
Query: 255 KLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSG 314
L A+PN +QI+ALI++YGPTVFFHP E YLPSSVSWFF NGA L++ +G+AI P G
Sbjct: 238 NLGAIPNLEQINALIKHYGPTVFFHPQETYLPSSVSWFFENGATLHEKDAKMGDAILPGG 297
Query: 315 SNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFN 374
NLP GG NDGE+WIDLP D +++V+ GN++SA+LY HVKPA GGTFTDI MWVFCPFN
Sbjct: 298 LNLPVGGTNDGEYWIDLPDDDRKELVRAGNLKSAELYAHVKPAHGGTFTDIAMWVFCPFN 357
Query: 375 GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYI 434
GP T+KVG + A K+G+H+GDWEHFTLR+ NF+G+L SIYFSQHSGG+WV A DLE+I
Sbjct: 358 GPATIKVGFASFALQKVGRHIGDWEHFTLRVSNFSGQLSSIYFSQHSGGEWVEACDLEFI 417
Query: 435 EGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLG 494
GNKAI+YSS+NGHAS+PHPG YL GSE LG+GVRND ARS+L VDSS +YE+++A +LG
Sbjct: 418 SGNKAIIYSSRNGHASYPHPGCYLMGSETLGVGVRNDVARSDLSVDSSTRYEIISAGHLG 477
Query: 495 EGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGP 554
E VAEP WLQ+MR+WGPTI Y S++E+D + LP +R++ E + LP ELY EEGP
Sbjct: 478 EDAVAEPCWLQYMREWGPTITYSSRSEIDTALSFLPFFLRFTAEAIFNSLPAELYEEEGP 537
Query: 555 TGPKEKNNWVGDER 568
TGPKEKNNW GDER
Sbjct: 538 TGPKEKNNWEGDER 551
>gi|357114093|ref|XP_003558835.1| PREDICTED: uncharacterized protein LOC100834654 [Brachypodium
distachyon]
Length = 550
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/549 (59%), Positives = 403/549 (73%), Gaps = 22/549 (4%)
Query: 20 PGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
P F LPAPLP WPQG FA G I +GE+EV I++F IWSC+ ATFYEP
Sbjct: 23 PEPFQLPAPLPAWPQGGDFAKGTICIGELEVVNITKFQNIWSCS-------GATFYEPKE 75
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
+P GF+ LGHY Q + RPL+G +LVAR++AS + + + PAL+KPLDYTLVW +
Sbjct: 76 VPGGFHCLGHYAQQNDRPLQGSLLVAREVASYQL-------MNREPALEKPLDYTLVWTN 128
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
+ + GC FW P PP GYK++G++VT+ KP L+ VRCVR DLTD CE + +
Sbjct: 129 ADLNEDGNSGC--FWSPSPPKGYKALGYVVTRGLKKPSLEAVRCVRHDLTDACENYRSVI 186
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAM 259
+ + VW+TRPC+RGM GRG+ VG FFC + ++ +E +I CLKNLDPKL AM
Sbjct: 187 NM-----ENSCQVWNTRPCHRGMAGRGIPVGAFFCETGAVNSEESSIPCLKNLDPKLRAM 241
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPS 319
PN DQIHALI++YGPTVFFHP E YLPSSVSWFF NGA L++ G +G+ I GSNLP+
Sbjct: 242 PNLDQIHALIKHYGPTVFFHPQETYLPSSVSWFFENGATLHQKGIKMGDTILAGGSNLPA 301
Query: 320 GGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
GG ND E+WIDLP D VK GN++SA+LY HVKPA GGTFTDI MWVFCPFNGP T+
Sbjct: 302 GGTNDHEYWIDLPDDDRNGYVKVGNLKSAELYAHVKPAHGGTFTDIAMWVFCPFNGPATI 361
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
K+G + A K+G+H GDWEHFTLRI NF+GEL SIYFS+HSGG W A DLE+I GNKA
Sbjct: 362 KIGFASFALQKVGRHTGDWEHFTLRISNFSGELSSIYFSEHSGGGWTDACDLEFISGNKA 421
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
IVYSS+NGHAS+ HPG YLQGSE LG+GVRND ARS+L VDSS +Y++++AEYLG V+
Sbjct: 422 IVYSSRNGHASYAHPGCYLQGSEKLGVGVRNDVARSDLSVDSSTKYKIISAEYLGNAVI- 480
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
EP WLQ+MR+WGPT+ Y+S++E+D ++ LP +R++ E LP+ELY EEGPTGPKE
Sbjct: 481 EPCWLQYMREWGPTVTYNSRSEVDTVLSFLPFFLRFTAEAIFDSLPVELYEEEGPTGPKE 540
Query: 560 KNNWVGDER 568
KNNW GDER
Sbjct: 541 KNNWEGDER 549
>gi|115450669|ref|NP_001048935.1| Os03g0142900 [Oryza sativa Japonica Group]
gi|108706125|gb|ABF93920.1| pre-mRNA processing protein PRP39, putative, expressed [Oryza
sativa Japonica Group]
gi|113547406|dbj|BAF10849.1| Os03g0142900 [Oryza sativa Japonica Group]
gi|125584875|gb|EAZ25539.1| hypothetical protein OsJ_09364 [Oryza sativa Japonica Group]
gi|215695078|dbj|BAG90269.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767784|dbj|BAH00013.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 548
Score = 674 bits (1738), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/550 (58%), Positives = 409/550 (74%), Gaps = 23/550 (4%)
Query: 20 PGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
P F LPAPLP WPQG F+ G I +GE+EV I++F IWSC+ ATFYEP
Sbjct: 22 PEPFQLPAPLPDWPQGGDFSKGTICIGELEVASITKFQSIWSCS-------GATFYEPQE 74
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
IPDGF+ LGHY Q + RPL+GF+LVAR+ AS ++ + PAL+KPLDYTLVW S
Sbjct: 75 IPDGFHCLGHYAQQNDRPLQGFLLVAREAASCQS-------INLKPALEKPLDYTLVWTS 127
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
+ + C FW P PPDGY+++G++VT+ P KP LD VRCVR DLTD+CE F
Sbjct: 128 TDLNDDDNSDCGCFWSPSPPDGYEALGYVVTRGPKKPSLDAVRCVRGDLTDECEN----F 183
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAM 259
+I+ + + +W+TRPC+RGM GRG+ VGTFFC ++ +E +I CLKN D L +M
Sbjct: 184 KSITNMGGNCY-IWNTRPCHRGMAGRGIPVGTFFCGTD---TEESDIPCLKNFDSSLSSM 239
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPS 319
PN +QI ALI +YGPTVFFHP E+YLPSSVSWFF NGA L+K G +G+ I SGSNLP+
Sbjct: 240 PNLEQIKALIEHYGPTVFFHPQEIYLPSSVSWFFENGATLHKKGKEMGDVILASGSNLPA 299
Query: 320 GGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
GG NDGE+WID+P + VK GN++SA+LYVH+KPA GGTFTDI MWVFCPFNGP T+
Sbjct: 300 GGTNDGEYWIDIPDGDRNEYVKAGNLKSAELYVHIKPAHGGTFTDIAMWVFCPFNGPATI 359
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
KVG + A K+G+H GDWEHFTLRI NF+GEL SIYFSQHSGG WV A DLE+I GNKA
Sbjct: 360 KVGFASFALQKVGRHTGDWEHFTLRISNFSGELSSIYFSQHSGGDWVDACDLEFISGNKA 419
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
IVYS+K+GHAS+ HPG YL GSE G+GVRNDAARS++ VDSS +Y++++A LG+ V+
Sbjct: 420 IVYSAKDGHASYAHPGCYLLGSEKAGVGVRNDAARSDILVDSSTRYKIISAGNLGDAVI- 478
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
EP WLQ+MR+WGPT+ Y+S++E+D ++ LP +R++ E ++ LP+ELY EEGPTGPKE
Sbjct: 479 EPCWLQYMREWGPTVEYNSRSEIDAVLSFLPFFLRFTAEAILNSLPVELYEEEGPTGPKE 538
Query: 560 KNNWVGDERG 569
KNNW GDERG
Sbjct: 539 KNNWEGDERG 548
>gi|108706126|gb|ABF93921.1| pre-mRNA processing protein PRP39, putative, expressed [Oryza
sativa Japonica Group]
Length = 547
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/550 (58%), Positives = 409/550 (74%), Gaps = 24/550 (4%)
Query: 20 PGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
P F LPAPLP WPQG F+ G I +GE+EV I++F IWSC+ ATFYEP
Sbjct: 22 PEPFQLPAPLPDWPQGD-FSKGTICIGELEVASITKFQSIWSCS-------GATFYEPQE 73
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
IPDGF+ LGHY Q + RPL+GF+LVAR+ AS ++ + PAL+KPLDYTLVW S
Sbjct: 74 IPDGFHCLGHYAQQNDRPLQGFLLVAREAASCQS-------INLKPALEKPLDYTLVWTS 126
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
+ + C FW P PPDGY+++G++VT+ P KP LD VRCVR DLTD+CE F
Sbjct: 127 TDLNDDDNSDCGCFWSPSPPDGYEALGYVVTRGPKKPSLDAVRCVRGDLTDECEN----F 182
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAM 259
+I+ + + +W+TRPC+RGM GRG+ VGTFFC ++ +E +I CLKN D L +M
Sbjct: 183 KSITNMGGNCY-IWNTRPCHRGMAGRGIPVGTFFCGTD---TEESDIPCLKNFDSSLSSM 238
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPS 319
PN +QI ALI +YGPTVFFHP E+YLPSSVSWFF NGA L+K G +G+ I SGSNLP+
Sbjct: 239 PNLEQIKALIEHYGPTVFFHPQEIYLPSSVSWFFENGATLHKKGKEMGDVILASGSNLPA 298
Query: 320 GGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
GG NDGE+WID+P + VK GN++SA+LYVH+KPA GGTFTDI MWVFCPFNGP T+
Sbjct: 299 GGTNDGEYWIDIPDGDRNEYVKAGNLKSAELYVHIKPAHGGTFTDIAMWVFCPFNGPATI 358
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
KVG + A K+G+H GDWEHFTLRI NF+GEL SIYFSQHSGG WV A DLE+I GNKA
Sbjct: 359 KVGFASFALQKVGRHTGDWEHFTLRISNFSGELSSIYFSQHSGGDWVDACDLEFISGNKA 418
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
IVYS+K+GHAS+ HPG YL GSE G+GVRNDAARS++ VDSS +Y++++A LG+ V+
Sbjct: 419 IVYSAKDGHASYAHPGCYLLGSEKAGVGVRNDAARSDILVDSSTRYKIISAGNLGDAVI- 477
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
EP WLQ+MR+WGPT+ Y+S++E+D ++ LP +R++ E ++ LP+ELY EEGPTGPKE
Sbjct: 478 EPCWLQYMREWGPTVEYNSRSEIDAVLSFLPFFLRFTAEAILNSLPVELYEEEGPTGPKE 537
Query: 560 KNNWVGDERG 569
KNNW GDERG
Sbjct: 538 KNNWEGDERG 547
>gi|125542356|gb|EAY88495.1| hypothetical protein OsI_09966 [Oryza sativa Indica Group]
Length = 547
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/550 (58%), Positives = 409/550 (74%), Gaps = 24/550 (4%)
Query: 20 PGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
P F LPAPLP WPQG F+ G I +GE+EV I++F IWSC+ ATFYEP
Sbjct: 22 PEPFQLPAPLPDWPQGD-FSKGTICIGELEVASITKFQSIWSCS-------GATFYEPQE 73
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
IPDGF+ LGHY Q + RPL+GF+LVAR+ AS ++ + PAL+KPLDYTLVW S
Sbjct: 74 IPDGFHCLGHYAQQNDRPLQGFLLVAREAASCQS-------INLKPALEKPLDYTLVWTS 126
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
+ + C FW P PPDGY+++G++VT+ P KP LD VRCV DLTD+CE F
Sbjct: 127 TDLNDDDNSDCGCFWSPSPPDGYEALGYVVTRGPKKPSLDAVRCVWGDLTDECEN----F 182
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAM 259
+I+ + + +W+TRPC+RGM GRG+ VGTFFC ++ +E +I CLKN D L +M
Sbjct: 183 KSITNMGGNCY-IWNTRPCHRGMTGRGIPVGTFFCGTD---TEESDIPCLKNFDSSLSSM 238
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPS 319
PN +QI ALI +YGPTVFFHP E+YLPSSVSWFF NGA L+K G +G+ I SGSNLP+
Sbjct: 239 PNLEQIKALIEHYGPTVFFHPQEIYLPSSVSWFFENGATLHKKGKEMGDVILASGSNLPA 298
Query: 320 GGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
GG NDGE+WID+P + VK GN++SA+LYVH+KPA GGTFTDI MWVFCPFNGP T+
Sbjct: 299 GGTNDGEYWIDIPDGDRNEYVKAGNLKSAELYVHIKPAHGGTFTDIAMWVFCPFNGPATI 358
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
KVG + A K+G+H GDWEHFTLRI NF+GEL SIYFSQHSGG WV A DLE+I GNKA
Sbjct: 359 KVGFASFALQKVGRHTGDWEHFTLRISNFSGELSSIYFSQHSGGDWVDACDLEFISGNKA 418
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
IVYS+K+GHAS+ HPG YL GSE G+GVRNDAARS++ VDSS +Y++++A +LG+ V+
Sbjct: 419 IVYSAKDGHASYAHPGCYLLGSEKAGVGVRNDAARSDILVDSSTRYKIISAGHLGDAVI- 477
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
EP WLQ+MR+WGPT+ Y+S++E+D ++ LP +R++ E ++ LP+ELY EEGPTGPKE
Sbjct: 478 EPCWLQYMREWGPTVEYNSRSEIDAVLSFLPFFLRFTAEAILNSLPVELYEEEGPTGPKE 537
Query: 560 KNNWVGDERG 569
KNNW GDERG
Sbjct: 538 KNNWEGDERG 547
>gi|326499592|dbj|BAJ86107.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 553
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/549 (57%), Positives = 396/549 (72%), Gaps = 20/549 (3%)
Query: 20 PGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
P F LPAPLP WPQG+ FA G I +GE+++ I++F IWS A FYEP
Sbjct: 24 PEPFQLPAPLPAWPQGRDFAKGVICIGELDIANITQFRNIWS-------YSGAMFYEPEE 76
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
+PDGF+ LGHY Q + R L+G +LVA + AS + + PAL+KPLDYTLVW +
Sbjct: 77 VPDGFHCLGHYAQQNGRLLQGSLLVASEAASCQL-------INSRPALEKPLDYTLVWTN 129
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
+ + C FWLP PP GY+++G++VTK P KP L+ VRCVR DLTD CE +
Sbjct: 130 AGVYEDDNSECGCFWLPSPPKGYEALGYVVTKGPKKPSLEAVRCVRHDLTDPCENFRSVI 189
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAM 259
+ VW TRPC+RG GRG+ VGTFFC +N ++ Q+ I CLKN DP L AM
Sbjct: 190 NL-----DRTCQVWKTRPCHRGTAGRGIPVGTFFCKTNSVNSQKSGIPCLKNCDPNLRAM 244
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPS 319
P+ +QIHALI++YGPTVFFHP E YLPSSVSWFF NGA L K +G+ I GSNLP+
Sbjct: 245 PDLEQIHALIKHYGPTVFFHPQEKYLPSSVSWFFENGATLNKKDMKMGDPILAGGSNLPA 304
Query: 320 GGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
GG NDGE+WIDLP D + VK GN++SA+LY HVKPA GGTFTDI MWVFCPFNGP T+
Sbjct: 305 GGTNDGEYWIDLPHDDMCEFVKVGNLKSAELYAHVKPAHGGTFTDIAMWVFCPFNGPATI 364
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
KVG+ + A K+G+H GDWEHFTLRI NF+GEL S+Y+S+HSGG WV A DLE+I GNKA
Sbjct: 365 KVGLASFALQKVGRHTGDWEHFTLRISNFSGELSSVYYSEHSGGGWVDACDLEFISGNKA 424
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
IVYSS+NGHAS+ HPG YLQGSE LG+G+RND ARS+L +DSS +Y++++AEYLG+ VV
Sbjct: 425 IVYSSRNGHASYAHPGCYLQGSENLGVGLRNDVARSDLSIDSSTRYKIISAEYLGDAVV- 483
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
EP WLQ+MR+WGPT+ Y+S++E+D ++ LP +R++ E LP ELY EEGPTGPKE
Sbjct: 484 EPCWLQYMREWGPTVTYNSRSEVDTVLSFLPFFLRFTAEAIFDSLPAELYEEEGPTGPKE 543
Query: 560 KNNWVGDER 568
K+NW GDER
Sbjct: 544 KDNWDGDER 552
>gi|357115423|ref|XP_003559488.1| PREDICTED: uncharacterized protein LOC100822588 [Brachypodium
distachyon]
Length = 587
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 316/580 (54%), Positives = 394/580 (67%), Gaps = 38/580 (6%)
Query: 20 PGTFSLPAPLPTWPQGQGFASGRINLG--EIEVCRISRFNFIWSCNLLQSKKK----SAT 73
P FS PLPTWPQG GFA GR+ +G E+E+ + F I C L S ++ S T
Sbjct: 14 PEPFSFAEPLPTWPQGNGFARGRMCVGGGELELAAATSFERI--CTLSPSPRRDCKSSVT 71
Query: 74 FYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDY 133
FY P +P+GF LGHYCQ DSRPL G +LVA+ A AH P L P DY
Sbjct: 72 FYRPLAVPEGFSVLGHYCQPDSRPLHGHLLVAKASAR-----AHHPPRSTQPPLCAPRDY 126
Query: 134 TLVWC--------------SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELD 179
L+W S G A+FWLP PP+GY+++GFLVT P KP L
Sbjct: 127 ELLWAFHASGGSAAAGGGRSSSSSSGYGRSDAYFWLPVPPEGYRALGFLVTADPGKPPLK 186
Query: 180 EVRCVRDDLTDKCEVH-HLIFDAISKFSSS----PFSVWSTRPCNRGMLGRGVSVGTFFC 234
+ CVR DLTD+CE H L+ +++ SSS F+V RP RGM +G+ GTF C
Sbjct: 187 DAGCVRTDLTDECEPHGSLLHLQLTRPSSSSPGEAFAVRGVRPLRRGMSEKGIGAGTFCC 246
Query: 235 S--SNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWF 292
+ + +E +ACL N++ L AMP +Q+HA+I++YGPTVFFHP EVYLPSSVSW+
Sbjct: 247 AVVGGCSTLREQGLACLNNVELDLSAMPTLEQLHAVIQHYGPTVFFHPKEVYLPSSVSWY 306
Query: 293 FTNGALLYKAG---DLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQI-VKHGNMESA 348
F NGA LY+ G D VGE ID GSNLP GG NDGE+WIDLP D R+ V G++ SA
Sbjct: 307 FKNGASLYRKGGGCDAVGEEIDGEGSNLPGGGCNDGEYWIDLPQDTQRRAAVCRGDIGSA 366
Query: 349 KLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNF 408
+LY HVKPA+GG TD+ MWVFCPFNGP LK+G++NV K GQHVGDWEHFTLR+ NF
Sbjct: 367 ELYAHVKPAMGGACTDVAMWVFCPFNGPARLKLGVLNVPLGKTGQHVGDWEHFTLRVSNF 426
Query: 409 TGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGV 468
TGEL ++YFSQHSGG+WV A LEY GN+ +VYSS+NGHAS+P+P YLQGS LGIG+
Sbjct: 427 TGELMAVYFSQHSGGRWVDASALEYAAGNRPVVYSSRNGHASYPYPAVYLQGSAALGIGI 486
Query: 469 RNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKL 528
RNDAARS L+VDSS++Y +VAAEYLG+GVVAEP WLQ+MR+WGPT+VY+S+ +++I+
Sbjct: 487 RNDAARSKLFVDSSVKYRIVAAEYLGDGVVAEPQWLQYMREWGPTVVYNSRKRMERIVGR 546
Query: 529 LPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
LP +R + EN ++K+P EL EEGPTGPKEKN W GDER
Sbjct: 547 LPRGLRDTAENILNKMPNELSKEEGPTGPKEKNMWEGDER 586
>gi|326519300|dbj|BAJ96649.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 585
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 307/574 (53%), Positives = 386/574 (67%), Gaps = 38/574 (6%)
Query: 23 FSLPAPLPTWPQGQGFASGRINL--GEIEVCRISRFNFIWSCNLLQSKK----KSATFYE 76
FS P PLP WP G GFA GRI + GE+E+ + F+ I C L S + ATFY
Sbjct: 21 FSFPEPLPAWPHGGGFAEGRIRVAGGELELAAATAFDKI--CTLSPSARLQRCDGATFYR 78
Query: 77 PAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLV 136
P G+P+GF LGHYCQ +SR L G +LVAR A P L+ P DY LV
Sbjct: 79 PRGVPEGFTVLGHYCQPNSRRLHGHLLVAR--------AADPPRSTTEPPLRAPRDYELV 130
Query: 137 WC----SDEGGQGNYEGC-------AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVR 185
W G GN C A+FWLP PP+GY+++G LVT P KP LDEV CVR
Sbjct: 131 WAFHATGGAGTSGNAGPCSGYGRSDAYFWLPVPPEGYRALGILVTTAPGKPALDEVGCVR 190
Query: 186 DDLTDKCEVHH--LIFDAISKFSSSP---FSVWSTRPCNRGMLGRGVSVGTFFCSS---- 236
DL D+CE H L S+SP F+V RP RGM +G+ GTF+C++
Sbjct: 191 ADLADECEPHGSLLRLQLTRPTSASPGKVFAVRGVRPLCRGMREKGIGAGTFWCAATDGC 250
Query: 237 NWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNG 296
+ + E +ACLKN+D L AMP +Q+HA+I++YGPT++FHP EVYLPSSV+W+F NG
Sbjct: 251 SSPTPSEQGLACLKNVDLDLSAMPTLEQVHAVIQHYGPTLYFHPKEVYLPSSVAWYFKNG 310
Query: 297 ALLYK-AGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGR-QIVKHGNMESAKLYVHV 354
A L+K G VGE ID GSNLP GG NDGE+W+D+P+D R Q + G++ESA+LY HV
Sbjct: 311 AALFKKGGGAVGEEIDAEGSNLPGGGWNDGEYWMDIPADSKRRQAILRGDIESAELYAHV 370
Query: 355 KPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWS 414
KPA+GGT TD+ MWVFCPFNGP LK+G +N+ GQHVGDWEHFTLR+ NFTGEL +
Sbjct: 371 KPAMGGTCTDVAMWVFCPFNGPARLKLGPINLPLGTTGQHVGDWEHFTLRVSNFTGELMA 430
Query: 415 IYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR 474
+Y+SQHSGG+WV A LEY GN+ VYSS+NGHAS+P G YLQGS LG+G+ N+AAR
Sbjct: 431 VYYSQHSGGRWVDAAKLEYAAGNRPAVYSSRNGHASYPRAGVYLQGSAALGVGILNEAAR 490
Query: 475 SNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIR 534
S L VDSS++Y +VAAEYLG+GVV EP WLQFMR+WGPT++Y S+T ++++K +P +
Sbjct: 491 SKLSVDSSVKYRVVAAEYLGDGVVVEPQWLQFMREWGPTVIYRSRTGTERMVKSMPQRLS 550
Query: 535 YSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
EN ++K+P EL EEGPTGPKEKN W GDER
Sbjct: 551 CPAENMLNKMPNELSKEEGPTGPKEKNMWEGDER 584
>gi|308080672|ref|NP_001183917.1| uncharacterized protein LOC100502510 [Zea mays]
gi|238015422|gb|ACR38746.1| unknown [Zea mays]
gi|413933236|gb|AFW67787.1| hypothetical protein ZEAMMB73_320181 [Zea mays]
Length = 579
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 307/556 (55%), Positives = 386/556 (69%), Gaps = 15/556 (2%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLG--EIEVCRISRFNFIWSCNLLQSKKKS---ATFYE 76
FS PLP+WP G GFA GRI +G E+E+ + F I C L ++ ATFY
Sbjct: 29 VFSFAEPLPSWPTGGGFARGRIRIGGGELELAAATAFQKI--CTLSSRRRGGGGSATFYR 86
Query: 77 PAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLV 136
P G+P+GF LG+YCQ + RPL G VLVAR A GA S P L KPLDYTL+
Sbjct: 87 PVGVPEGFSLLGYYCQPNCRPLHGHVLVAR--AGERPSGA--SPQPPPPPLCKPLDYTLI 142
Query: 137 W---CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCE 193
W S + G G A+FW+P PP+ Y+++G LVT P +P L+EV CVR+DLTD+CE
Sbjct: 143 WEFQASGDTGYGYGRSDAYFWIPVPPERYRALGCLVTTEPRRPPLEEVVCVREDLTDECE 202
Query: 194 VHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLD 253
H + SS F+V RP RG+ +GV VGTF C+++ + +E +ACL N++
Sbjct: 203 PHGSLLCLEQPRPSSFFTVRGVRPVRRGIREKGVCVGTFCCAADGLPPREHGMACLGNVE 262
Query: 254 PKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK-AGDLVGEAIDP 312
L AMP +Q HA+IR+YGPT+FFHP EVYLPSSVSWFF NGA LY+ GD GE ID
Sbjct: 263 LDLSAMPTLEQAHAVIRHYGPTLFFHPKEVYLPSSVSWFFKNGAALYRRGGDTAGEEIDG 322
Query: 313 SGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCP 372
GSNLP GG NDGEFWID+PS + V G+++SA+LY HVKPA+GGT TD+ MWVFCP
Sbjct: 323 EGSNLPGGGCNDGEFWIDVPSGERGRAVCRGSIDSAELYAHVKPAMGGTCTDVAMWVFCP 382
Query: 373 FNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE 432
FNGP LK+G +++ K G+HVGDWEHFT R+ N TGEL +Y+SQHSGG WV A LE
Sbjct: 383 FNGPARLKLGPISLPLGKTGRHVGDWEHFTFRVSNLTGELMGVYYSQHSGGHWVDASALE 442
Query: 433 YIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEY 492
YI+GN+ +VYSS+NGHAS+ +PG YLQGS LG+G+RNDAARS+L+VDSS +Y +VAA Y
Sbjct: 443 YIDGNRPVVYSSRNGHASYAYPGVYLQGSAALGVGIRNDAARSSLFVDSSARYRIVAAGY 502
Query: 493 LGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEE 552
LGEG V EP WLQFMR+WGPT+VY + ++ +I LP +R E+ ++K+P EL EE
Sbjct: 503 LGEGAVVEPQWLQFMREWGPTVVYKWRKRMEWVIGRLPPWLRCRAESMLNKMPNELSREE 562
Query: 553 GPTGPKEKNNWVGDER 568
GPTGPKEKNNW GDER
Sbjct: 563 GPTGPKEKNNWEGDER 578
>gi|115455005|ref|NP_001051103.1| Os03g0721300 [Oryza sativa Japonica Group]
gi|21321777|gb|AAM47300.1|AF377946_2 OSJNBa0031O09.02 [Oryza sativa Japonica Group]
gi|50540703|gb|AAT77860.1| expressed protein [Oryza sativa Japonica Group]
gi|62733587|gb|AAX95704.1| Plant protein of unknown function (DUF946) [Oryza sativa Japonica
Group]
gi|108710795|gb|ABF98590.1| pre-mRNA processing protein PRP39, putative, expressed [Oryza
sativa Japonica Group]
gi|113549574|dbj|BAF13017.1| Os03g0721300 [Oryza sativa Japonica Group]
gi|125587740|gb|EAZ28404.1| hypothetical protein OsJ_12384 [Oryza sativa Japonica Group]
gi|215701423|dbj|BAG92847.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 576
Score = 602 bits (1551), Expect = e-169, Method: Compositional matrix adjust.
Identities = 301/569 (52%), Positives = 380/569 (66%), Gaps = 26/569 (4%)
Query: 18 TEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEP 77
TEP F+ LP WPQG GFA+GRI +GE+E+ ++ F I + + + TFY P
Sbjct: 15 TEPARFAFADALPAWPQGGGFATGRICVGELELAAVTAFEKICALSATKGGGGGVTFYRP 74
Query: 78 AGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASS-EAEGAHTSNLFKSPALQKPLDYTLV 136
AG+P GF LGHYCQ ++RPL G +LVA+ +A E+E P L+ P DY LV
Sbjct: 75 AGVPGGFSVLGHYCQSNTRPLHGHLLVAKAVAGKPESE--------SLPPLRPPHDYELV 126
Query: 137 WCSDEGGQG-NYEGC-------AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
G G + + C A+FWLP P DGY+++G LVT P+KP L EV C R DL
Sbjct: 127 CAFRADGVGEDRKSCRGYGRTGAYFWLPVPTDGYRALGLLVTAEPDKPPLREVACARADL 186
Query: 189 TDKCEVH------HLIFDAISKFSSS---PFSVWSTRPCNRGMLGRGVSVGTFFCSSNWI 239
TD+CE H L+ + SS+ F++ RP +RGM GRG+ GTF C + +
Sbjct: 187 TDECEPHGSLLQLQLVGQSACWSSSTVPAAFALRGIRPTHRGMWGRGIGAGTFCCGAVGL 246
Query: 240 SGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALL 299
S +E +ACLKN+D L AMP +Q HA+IR+YGPT++FHP EVYLPSSVSWFF NGA L
Sbjct: 247 SPREQGMACLKNVDLDLSAMPTLEQAHAVIRHYGPTLYFHPKEVYLPSSVSWFFKNGAAL 306
Query: 300 YKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVG 359
K G+ +D GS+LP G NDGE+WI LP + + +G+++SA+LY HVKPA+G
Sbjct: 307 CKKGEDAAVELDVEGSHLPCGECNDGEYWIGLPDGKRGESIIYGDIDSAELYAHVKPAMG 366
Query: 360 GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
GT TD+ MWVFCPFNGP K+G + + K GQH+GDWEHFTLR+ NFTGEL ++YFSQ
Sbjct: 367 GTCTDVAMWVFCPFNGPARFKLGPITIPLGKTGQHIGDWEHFTLRVSNFTGELMAVYFSQ 426
Query: 420 HSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYV 479
HSGG+WV A LEY GNK VYSS+NGHAS+P PG YLQGS LGIG+RNDAARS L V
Sbjct: 427 HSGGRWVDASALEYTAGNKPAVYSSRNGHASYPFPGVYLQGSAALGIGIRNDAARSELAV 486
Query: 480 DSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVEN 539
DSS +Y +VAAEYLGEG V EP WL FMR WGPT+VY S+ ++++ + +R E
Sbjct: 487 DSSAKYRIVAAEYLGEGAVEEPRWLNFMRVWGPTVVYKSRQRMERMTSAMHRRLRSPAER 546
Query: 540 AVSKLPLELYGEEGPTGPKEKNNWVGDER 568
++KLP EL EEGPTGPKEKNNW GDER
Sbjct: 547 MLNKLPNELSREEGPTGPKEKNNWEGDER 575
>gi|125545546|gb|EAY91685.1| hypothetical protein OsI_13324 [Oryza sativa Indica Group]
Length = 576
Score = 601 bits (1550), Expect = e-169, Method: Compositional matrix adjust.
Identities = 301/569 (52%), Positives = 381/569 (66%), Gaps = 26/569 (4%)
Query: 18 TEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEP 77
TEP F+ LP WPQG GFA+GRI +GE+E+ ++ F I + + + TFY P
Sbjct: 15 TEPARFAFADALPAWPQGGGFATGRICVGELELAAVTAFEKICALSATKGGGGGVTFYRP 74
Query: 78 AGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASS-EAEGAHTSNLFKSPALQKPLDYTLV 136
AG+P+GF LGHYCQ ++RPL G +LVA+ +A E+E P L+ P DY LV
Sbjct: 75 AGVPEGFSVLGHYCQPNTRPLHGHLLVAKAVAGKPESE--------SLPPLRPPHDYELV 126
Query: 137 WCSDEGGQG-NYEGC-------AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
G G + + C A+FWLP P DGY+++G LVT P+KP L EV C R DL
Sbjct: 127 CAFRADGVGEDRKSCRGYGRTGAYFWLPVPTDGYRALGLLVTAEPDKPPLREVACARADL 186
Query: 189 TDKCEVH------HLIFDAISKFSSS---PFSVWSTRPCNRGMLGRGVSVGTFFCSSNWI 239
TD+CE H L+ + SS+ F++ RP +RGM GRG+ GTF C + +
Sbjct: 187 TDECEPHGSLLQLQLVGQSACWSSSTVPAAFALRGIRPTHRGMWGRGIGAGTFCCGAVGL 246
Query: 240 SGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALL 299
S +E +ACLKN+D L AMP +Q HA+IR+YGPT++FHP EVYLPSSVSWFF NGA L
Sbjct: 247 SPREQGMACLKNVDLDLSAMPTLEQAHAVIRHYGPTLYFHPKEVYLPSSVSWFFKNGAAL 306
Query: 300 YKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVG 359
K G+ +D GS+LP G NDGE+WI LP + + +G+++SA+LY HVKPA+G
Sbjct: 307 CKKGEDAAVELDGEGSHLPCGECNDGEYWIGLPDGKRGESIIYGDIDSAELYAHVKPAMG 366
Query: 360 GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
GT TD+ MWVFCPFNGP K+G + + K GQH+GDWEHFTLR+ NFTGEL ++YFSQ
Sbjct: 367 GTCTDVAMWVFCPFNGPARFKLGPITIPLGKTGQHIGDWEHFTLRVSNFTGELMAVYFSQ 426
Query: 420 HSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYV 479
HSGG+WV A LEY GNK VYSS+NGHAS+P PG YLQGS LGIG+RNDAARS L V
Sbjct: 427 HSGGRWVDASALEYTAGNKPAVYSSRNGHASYPFPGVYLQGSAALGIGIRNDAARSELAV 486
Query: 480 DSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVEN 539
DSS +Y +VAAEYLGEG V EP WL FMR WGPT+VY S+ ++++ + +R E
Sbjct: 487 DSSAKYRIVAAEYLGEGAVEEPRWLNFMRVWGPTVVYKSRQRMERMTSAMHRRLRSPAER 546
Query: 540 AVSKLPLELYGEEGPTGPKEKNNWVGDER 568
++KLP EL EEGPTGPKEKNNW GDER
Sbjct: 547 MLNKLPNELSREEGPTGPKEKNNWEGDER 575
>gi|242033149|ref|XP_002463969.1| hypothetical protein SORBIDRAFT_01g009850 [Sorghum bicolor]
gi|241917823|gb|EER90967.1| hypothetical protein SORBIDRAFT_01g009850 [Sorghum bicolor]
Length = 579
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 305/560 (54%), Positives = 380/560 (67%), Gaps = 16/560 (2%)
Query: 19 EPGTFSLPAPLPTWPQGQGFASGRINLG--EIEVCRISRFNFIWSCNLLQSKKKSATFYE 76
EP FS PLP WP G GFA GRI +G E+E+ + F I + + + SATFY
Sbjct: 25 EPEVFSFAHPLPFWPPGGGFAQGRIRIGGGELELAAATAFQKICTLSSRRRGGGSATFYR 84
Query: 77 PAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLV 136
P G+P GF LG+YCQ + RPL G VLVAR A + P L KPLDY L+
Sbjct: 85 PVGVPAGFSLLGYYCQPNCRPLHGHVLVAR------AGERPPGATPQPPPLAKPLDYALI 138
Query: 137 WCSDEGG------QGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
W GG G A+FW+P PP+GY ++G LVT P L+EV CVR+DLTD
Sbjct: 139 WEFHAGGSRANAGHGYGRSDAYFWVPVPPEGYWAIGCLVTTERQSPPLEEVACVREDLTD 198
Query: 191 KCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLK 250
+CE H + SS F+V RP RGM +GV VGTF C+++ + +E +ACL
Sbjct: 199 ECEPHGSLLRLQLPGPSSFFTVQGVRPVRRGMWRKGVCVGTFCCAADGLPPREHGMACLG 258
Query: 251 NLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAG--DLVGE 308
N++ L AMP +Q HA+IR+YGPT+FFHP EVYLPSSVSWFF NGA LYK G GE
Sbjct: 259 NVELDLSAMPTLEQAHAVIRHYGPTLFFHPKEVYLPSSVSWFFKNGAALYKRGGDTAAGE 318
Query: 309 AIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMW 368
ID GSNLP GG NDGE+WID+PS + V GN++SA+LY HVKPA+GGT TD+ MW
Sbjct: 319 EIDGEGSNLPGGGCNDGEYWIDVPSGERGRAVCRGNIDSAELYAHVKPAMGGTCTDVAMW 378
Query: 369 VFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAA 428
VFCPFNGP LK+G +++ K G+HVGDWEH T R+ N TGEL +Y+SQHSGG WV A
Sbjct: 379 VFCPFNGPARLKLGPISLPLGKTGRHVGDWEHSTFRVSNLTGELMGVYYSQHSGGHWVDA 438
Query: 429 YDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELV 488
LEYI+GN+ +VYSS+NGHAS+ +PG YLQGS LGIG+RNDAARS+L+VDSS +Y +V
Sbjct: 439 SALEYIDGNRPVVYSSRNGHASYAYPGVYLQGSAALGIGIRNDAARSSLFVDSSARYRIV 498
Query: 489 AAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLEL 548
AAEYLGEG V EP WLQFMR+WGPT+VY S+ ++ ++ LP ++ E+ ++K+P EL
Sbjct: 499 AAEYLGEGAVVEPQWLQFMREWGPTVVYKSRKRMEWMVGRLPPRLKCRAESMLNKMPNEL 558
Query: 549 YGEEGPTGPKEKNNWVGDER 568
EEGPTGPKEKNNW GDER
Sbjct: 559 SREEGPTGPKEKNNWEGDER 578
>gi|225450265|ref|XP_002269745.1| PREDICTED: uncharacterized protein LOC100242579 [Vitis vinifera]
Length = 559
Score = 543 bits (1398), Expect = e-151, Method: Compositional matrix adjust.
Identities = 270/549 (49%), Positives = 366/549 (66%), Gaps = 18/549 (3%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAGI 80
+F LPAP+PTWP G+GFA G I+LG +EVC+IS F +W+ + ATF+EP+ +
Sbjct: 25 SFKLPAPMPTWPPGEGFAGGTIDLGGGLEVCQISSFTKVWATHEGGPDNLGATFFEPSPM 84
Query: 81 PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSD 140
P+ F LG Y Q + +PL G+VL +D ++ GA L+KP+DYTLVW S
Sbjct: 85 PEEFCMLGCYSQPNKKPLFGWVLAGKDNTNNPLGGA----------LKKPIDYTLVWSS- 133
Query: 141 EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFD 200
E + +G + WLP PPDGYK++G ++T P KP LD++RCVR DLTD CE I+
Sbjct: 134 ESLKIKQDGNGYVWLPTPPDGYKAVGHVITNLPEKPSLDKIRCVRSDLTDLCEADTWIWG 193
Query: 201 AISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMP 260
+++ F+V+ +P +RG VSVGTF + N + L++ACLKN+ L+AMP
Sbjct: 194 LDKVANANGFNVYDLQPSDRGAQALAVSVGTF-VAQNGGAASSLSLACLKNISSNLYAMP 252
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPS 319
N Q+ AL++ Y P V+ HPDE YLPSSVSWFF NGALLY+ G+ IDP+GSNLP
Sbjct: 253 NLAQVDALVQAYSPWVYLHPDEQYLPSSVSWFFQNGALLYQQGEESKPVNIDPTGSNLPQ 312
Query: 320 GGRNDGEFWIDLPSDG-GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT 378
GG NDG +W+DLP D ++ VK G+++ + Y+H+KP +G TFTDI +W+F PFNGP
Sbjct: 313 GGSNDGAYWLDLPVDAKAKETVKKGDLQDSMTYMHIKPMLGATFTDIAIWIFYPFNGPAR 372
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNK 438
KV ++N++ KIG+HVGDWEH TLR+ NF G+L +++FS+HS G WV A +LE+ GNK
Sbjct: 373 AKVELINISLGKIGEHVGDWEHVTLRVSNFNGKLMNVFFSEHSSGTWVNASELEFQNGNK 432
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV 498
+ Y+S +GHA +P PG LQGS +GI RND A+S + +D+ +Y +VAAEYLG VV
Sbjct: 433 VVSYASLHGHAFYPKPGLVLQGSGAMGI--RNDTAKSKMVMDTGTRYIVVAAEYLGSAVV 490
Query: 499 AEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPK 558
EP WL + RKWGP I YD E+ ++ KLLP ++ + E V LP E+ GEEGPTGPK
Sbjct: 491 -EPPWLNYYRKWGPKISYDLAAEIKEVEKLLPGKLKSAFEKLVKSLPNEILGEEGPTGPK 549
Query: 559 EKNNWVGDE 567
K NW GDE
Sbjct: 550 MKKNWDGDE 558
>gi|255565176|ref|XP_002523580.1| conserved hypothetical protein [Ricinus communis]
gi|223537142|gb|EEF38775.1| conserved hypothetical protein [Ricinus communis]
Length = 551
Score = 542 bits (1397), Expect = e-151, Method: Compositional matrix adjust.
Identities = 272/566 (48%), Positives = 364/566 (64%), Gaps = 23/566 (4%)
Query: 6 CFYWNEVNNMSPTEP--GTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCN 63
C +V+ T P +F LP+ LP WP G+GF +G I+LG ++VC+IS N +WS
Sbjct: 4 CLSIKDVSKKRKTLPIETSFKLPSSLPKWPPGEGFGNGTIDLGGLQVCQISSLNKVWSTQ 63
Query: 64 LLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFK 123
A+F+EP+ IP GF+ LG YCQ +SRPL G+VL +D A
Sbjct: 64 EGGPDNVGASFFEPSQIPQGFFMLGCYCQSNSRPLYGWVLAGKDDAGG------------ 111
Query: 124 SPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRC 183
AL+KP+DYTLVW S E + + + WLP PDGY+++G +VT +P KP LD++RC
Sbjct: 112 --ALEKPIDYTLVWSS-ESLKIKQDSIGYIWLPTAPDGYRAVGLVVTSSPEKPSLDKMRC 168
Query: 184 VRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQE 243
VR DLTD+CE ++ + + F+V+S RP NRG GV VGTF + N
Sbjct: 169 VRSDLTDQCEKDIWVWGPEKQSDPNGFNVFSLRPSNRGTQAMGVCVGTF-VAQNGNDNNP 227
Query: 244 LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAG 303
++ CLKN+ L MPN +QI A+ + Y P ++FHPDE YLPSSVSW+F NGALLYK G
Sbjct: 228 ASLVCLKNVSNNLSCMPNENQIQAIFQAYSPWIYFHPDEEYLPSSVSWYFNNGALLYKQG 287
Query: 304 DLVGEA-IDPSGSNLPSGGRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGT 361
+ ++ +GSNLP GG NDG +W+DLP D + VK G+++ ++Y+H+KP G T
Sbjct: 288 EESNPVKVETNGSNLPQGGANDGVYWLDLPVDEKAKDRVKKGDLQETEVYLHIKPMFGAT 347
Query: 362 FTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 421
FTDIV+WVFCPFNGP K+ +NV KIG+HVGDWEH TLR+ NF ELWS+YFS+HS
Sbjct: 348 FTDIVIWVFCPFNGPAKAKIEFLNVPLGKIGEHVGDWEHVTLRVSNFNSELWSVYFSEHS 407
Query: 422 GGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDS 481
GG W+ A LE+ GNK + YSS +GHA + PG LQG+ GIG+RND A+S +D+
Sbjct: 408 GGSWINASQLEFQSGNKFVGYSSLHGHAMYSKPGLVLQGNN--GIGIRNDTAKSGKILDT 465
Query: 482 SIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAV 541
++ +VAAEYLG ++ EP W+ F RKWGP I Y+ E+ KI K+LP ++ + E V
Sbjct: 466 GAKFSVVAAEYLGMTII-EPPWINFFRKWGPKISYNIADEIKKIEKILPGKLKSAFEKVV 524
Query: 542 SKLPLELYGEEGPTGPKEKNNWVGDE 567
+ LP E+ GEEGPTGPK K NW GDE
Sbjct: 525 NGLPSEVLGEEGPTGPKVKGNWTGDE 550
>gi|356557046|ref|XP_003546829.1| PREDICTED: uncharacterized protein LOC100806731 [Glycine max]
Length = 548
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 272/549 (49%), Positives = 357/549 (65%), Gaps = 22/549 (4%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
TF LPA +P WPQG GFA+G IN+G +++ +IS FN +W A F+EPAGIP
Sbjct: 18 TFKLPANIPVWPQGGGFATGTINIGGLKLFQISTFNKVWKTLEGGPGDAGAAFFEPAGIP 77
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
+GF++LGHY Q +++PL G +LVA+D SS AL+KP+DYTLVW S +
Sbjct: 78 EGFFTLGHYSQPNNKPLFGSILVAKDETSSGDNNG---------ALKKPIDYTLVWSS-K 127
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ + + WLP PDGYK++G +VT TP KP LD++RCVR DLTD+CE I+
Sbjct: 128 SQKIKQDKDGYIWLPTAPDGYKTLGHVVTTTPEKPSLDKIRCVRSDLTDQCERSSWIWGP 187
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHA-MP 260
F+V+ RP NRG GV VGTFF + I L IACLKN + + MP
Sbjct: 188 DKSSDEKGFNVYEVRPSNRGTQAPGVLVGTFFAHNGEIP-SPLPIACLKNTNMNFSSSMP 246
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPS 319
N Q+ AL++ Y P ++ HPDE + P+S W+FTNGALL K G+ IDP+GSNLP
Sbjct: 247 NLPQVKALVQAYSPFMYLHPDEEFQPASTKWYFTNGALLVKKGEESKPVPIDPTGSNLPQ 306
Query: 320 GGRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT 378
GG NDGE+W+DLPSD ++ VK G+ +S + YVH KP GGTFTD+VMWVF PFNGPGT
Sbjct: 307 GGNNDGEYWLDLPSDKANKERVKKGDFKSCQAYVHAKPMFGGTFTDLVMWVFYPFNGPGT 366
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNK 438
KVG++++ KIG+H+GDWEH TLR+ NF GEL +Y SQHS G+WV A LE+ GNK
Sbjct: 367 AKVGLIDIPLGKIGEHIGDWEHVTLRVSNFDGELKKVYLSQHSNGQWVEASQLEFQSGNK 426
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV 498
++ YSS NGHA +P G +QG + GIG++N+ +S +D + +E+V+ EYLG +V
Sbjct: 427 SVCYSSLNGHAIYPKAGLVMQGLD--GIGIKNETKKSEKVIDMGVGFEVVSGEYLGSAIV 484
Query: 499 AEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPK 558
EP WL F+R+WGP I YD ELD + K+ P ++E LP EL G+EGPTGPK
Sbjct: 485 -EPPWLNFLRQWGPKITYDIAKELDNLAKVFP-----ALEALEDGLPNELLGQEGPTGPK 538
Query: 559 EKNNWVGDE 567
K NW GDE
Sbjct: 539 LKRNWQGDE 547
>gi|356528641|ref|XP_003532908.1| PREDICTED: uncharacterized protein LOC100792266 [Glycine max]
Length = 549
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 275/549 (50%), Positives = 360/549 (65%), Gaps = 21/549 (3%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
TF LPA +P WPQG FA+G IN+G +++ +IS FN +W ATF+EPAG+P
Sbjct: 18 TFKLPANIPVWPQGGEFATGTINIGGLKLFQISIFNKVWKTLEGGPGDAGATFFEPAGVP 77
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
+GF++LGHY Q +++PL G +LVA+D +SS + + AL+KP+DYTLVW S +
Sbjct: 78 EGFFTLGHYSQPNNKPLFGSILVAKDESSSSGD--------NNGALKKPVDYTLVWSS-K 128
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ + + WLP PDGYK++G +VT TP KP LD++RCVR DLTD+CE + I+
Sbjct: 129 SQKIKQDKDGYIWLPTAPDGYKTLGHVVTTTPEKPSLDKIRCVRSDLTDQCERNSWIWGP 188
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHA-MP 260
F+V RP NRG GV VGTFF + N + L IACLKN + MP
Sbjct: 189 AKSSDEKGFNVHEVRPSNRGTQAPGVLVGTFF-AHNCEAPSPLPIACLKNTSMNFSSSMP 247
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGE-AIDPSGSNLPS 319
N Q+ AL++ Y P ++ HPDE + P+S W+FTNGALL K G+ AIDP+GSNLP
Sbjct: 248 NLPQVRALVQAYSPFMYLHPDEDFQPASTKWYFTNGALLVKKGEESKPVAIDPTGSNLPQ 307
Query: 320 GGRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT 378
GG NDGEFW+DLPSD ++ VK G+ +S + YVH KP GGTFTD+VMWVF PFNGPGT
Sbjct: 308 GGNNDGEFWLDLPSDKANKERVKKGDFKSCQAYVHAKPMFGGTFTDLVMWVFYPFNGPGT 367
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNK 438
KVG++++ KIG+H+GDWEH TLR+ NF GEL +Y SQHS G+WV A LE+ GNK
Sbjct: 368 AKVGLIDIPLGKIGEHIGDWEHVTLRVSNFNGELKRVYLSQHSKGQWVEAPQLEFQSGNK 427
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV 498
A+ YSS NGHA +P G +QG + GIG++N+ RS +D + +E+V+ EYLG +V
Sbjct: 428 AVCYSSLNGHAIYPKVGLVMQGLD--GIGIKNETKRSEKVIDMGVGFEVVSGEYLGSAIV 485
Query: 499 AEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPK 558
EP WL F R+WGP I YD LDK+ K+ P +++ LP EL GEEGPTGPK
Sbjct: 486 -EPPWLNFFRQWGPKITYDIAKVLDKLEKVFP-----ALQGLEDSLPKELLGEEGPTGPK 539
Query: 559 EKNNWVGDE 567
K NW GDE
Sbjct: 540 LKRNWSGDE 548
>gi|224143636|ref|XP_002325025.1| predicted protein [Populus trichocarpa]
gi|222866459|gb|EEF03590.1| predicted protein [Populus trichocarpa]
Length = 546
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 268/548 (48%), Positives = 357/548 (65%), Gaps = 23/548 (4%)
Query: 23 FSLPAPLPTWPQGQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
F LP+PLP+ PQG+GF +G INLG ++VC++ N IW+ N ATF+EP+ +P
Sbjct: 18 FKLPSPLPSLPQGEGFGNGIINLGSGLQVCQVVSLNKIWATNEGGPDNLGATFFEPSQMP 77
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
GF LG Y Q ++R L G+VL +D AL KP+DYTLVW S E
Sbjct: 78 QGFSMLGCYSQPNNRMLHGWVLAGKDETGE--------------ALTKPVDYTLVWSS-E 122
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
Q EG + WLP P DGYK++G +VT +P KP L++VRCVR DLTD C++ I+
Sbjct: 123 SLQIKQEGVGYIWLPTPLDGYKALGCVVTTSPEKPSLEKVRCVRSDLTDDCQIDKWIWGQ 182
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPN 261
+ + F+V+S RP NRG GVSVGTF + + +++ACLKN++ L MP
Sbjct: 183 GKESDPNGFNVFSLRPSNRGTQAMGVSVGTFVAQNT--TSASISLACLKNIESNLSCMPT 240
Query: 262 CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSG 320
DQ+ A+ + Y P ++FHPDE YLPSS W+FTNGALLYK + I+ +GSNLP G
Sbjct: 241 LDQVQAIFQAYSPRIYFHPDEEYLPSSARWYFTNGALLYKKDEESKPVPIESTGSNLPQG 300
Query: 321 GRNDGEFWIDLP-SDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
G NDG +W+DLP + R+ VK G+++ ++Y+H+KP +G TFTDIV W+F PFNGPG
Sbjct: 301 GSNDGSYWLDLPVEEAARERVKRGDLQDCQVYLHIKPMLGATFTDIVAWLFYPFNGPGRA 360
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
KV ++ V K GQHVGDWEH TLRI NF G+L IY S HSGG W A +LE+ +GNKA
Sbjct: 361 KVDLITVPLGKTGQHVGDWEHVTLRISNFNGQLCGIYLSAHSGGTWYDASELEFQDGNKA 420
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
++YSS NGHA + PG LQGS IG+RND A+SN ++D+ ++ +V+AEY+G VV
Sbjct: 421 VIYSSLNGHAMYAKPGLVLQGSG--NIGIRNDTAKSNKFIDTGARFSVVSAEYMGSSVV- 477
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
EP WL ++RKWGP I YD E + + KLLP ++ + + V LP E++GEEGPTGPK
Sbjct: 478 EPPWLNYLRKWGPNITYDLAEEAETVEKLLPGKLKSAFDKFVKSLPKEIFGEEGPTGPKL 537
Query: 560 KNNWVGDE 567
K NW GDE
Sbjct: 538 KRNWTGDE 545
>gi|356560867|ref|XP_003548708.1| PREDICTED: uncharacterized protein LOC100808166 [Glycine max]
Length = 556
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 282/552 (51%), Positives = 365/552 (66%), Gaps = 30/552 (5%)
Query: 23 FSLPAPLP--TWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGI 80
F LP P+ +WPQG FASG I+LG ++V +IS FN +W + T +E +GI
Sbjct: 27 FKLPVPVANSSWPQGGNFASGIIDLGGLQVSQISTFNKVWGTYGGGPDNQGFTMFEASGI 86
Query: 81 PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSD 140
P GF+ LG YCQ +++PL G+VLVA+D++ S TSN P L++P+DYTLVW S
Sbjct: 87 PQGFFMLGSYCQPNNKPLFGWVLVAKDVSKS------TSN----PTLKQPIDYTLVWNS- 135
Query: 141 EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFD 200
+ N +G + WLP PDGYK++G +VT TPNKP D++RCVR DLTD+CE LI++
Sbjct: 136 ASMKINQDGPIYVWLPTAPDGYKALGHVVTTTPNKPSFDKIRCVRLDLTDQCETSLLIWE 195
Query: 201 AISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKL-HAM 259
S F+V+ RP NRG+ GV VGTF + S + +IACLKN + M
Sbjct: 196 ------SGSFNVYDVRPSNRGIQAPGVRVGTFVAQNG--STEPPSIACLKNNTNAIPKYM 247
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGE-AIDPSGSNLP 318
PN QI A+++ Y PT+ HPDE YLPSSV WFF+NGALLYK G +I P+G+NLP
Sbjct: 248 PNLQQIKAIMQVYSPTMCLHPDEEYLPSSVDWFFSNGALLYKKGQESNPVSIAPNGTNLP 307
Query: 319 SGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPG 377
NDG +W+DLPSD ++ VK G+++SAK YVHVKP +GGTFTDI MWVF PFNGP
Sbjct: 308 QDHNNDGAYWLDLPSDAANKERVKKGDLQSAKSYVHVKPMLGGTFTDIAMWVFYPFNGPS 367
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE-- 435
KV +NV KIG+HVGDWEH TLR+ NF GELW +YFSQHS G WV + +E+
Sbjct: 368 RAKVEFINVKLGKIGEHVGDWEHVTLRVSNFNGELWQVYFSQHSKGAWVDSSQIEFQSGG 427
Query: 436 GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGE 495
GNK + YSS +GHAS+PH G LQG++ IG+RND A+ + +D +ELV+AEYLG
Sbjct: 428 GNKPLYYSSLHGHASYPHAGLNLQGTD--NIGIRNDTAKGDKMMDMG-AFELVSAEYLGY 484
Query: 496 GVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPT 555
V+ EP WL + R+WGP I Y EL KI K LP ++ ++E + LP E+ GEEGPT
Sbjct: 485 AVI-EPPWLNYFREWGPKIDYSIDVELKKIEKFLPKNLKDALEKIMRSLPSEVLGEEGPT 543
Query: 556 GPKEKNNWVGDE 567
GPK KNNW+GDE
Sbjct: 544 GPKVKNNWIGDE 555
>gi|168047159|ref|XP_001776039.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672697|gb|EDQ59231.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 537
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 258/542 (47%), Positives = 356/542 (65%), Gaps = 14/542 (2%)
Query: 35 GQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFD 94
G GFA+G I+LGE++V +++ F +W+ K ATFY+P +P G+ L HY Q +
Sbjct: 1 GTGFATGHISLGEVDVVQVATFEEVWNALEGGVANKGATFYKPVDVPVGYSILSHYAQNN 60
Query: 95 SRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFW 154
SR G+ +D + S + +S+LF PAL+ P+DY L+W S + G G + W
Sbjct: 61 SRARAGW----KDSSKSIVDFV-SSHLFAGPALKSPVDYKLIWESSKW-TGKKNGNGWVW 114
Query: 155 LPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSS---PFS 211
LP P+GY +G++VT T NKP EV CVR DLTD E+ F PFS
Sbjct: 115 LPIAPEGYSVLGYIVTNTENKPTFSEVTCVRTDLTDILEIDSKFFTTSGTSRQEIGFPFS 174
Query: 212 VWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRN 271
VW+TRP RG+ GV+VG+FFC + + L IACLKN+ L +P Q++++ +
Sbjct: 175 VWTTRPAVRGVSAAGVNVGSFFCDRSDQAETSLPIACLKNVLFNLGYLPTMAQVNSIQQF 234
Query: 272 YGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGRNDGEFWID 330
+GP+V FHP E +LPSSV+WFF +GA+LY + ++ GSNLP GG NDG FWID
Sbjct: 235 FGPSVIFHPTEQFLPSSVAWFFDSGAMLYTKDSTIPPVRVNSDGSNLPQGGTNDGAFWID 294
Query: 331 LPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSK 390
LP DG R VK GN+E+AK+++H+KP G T+TD+ W+FCPFNGP T KVG +N+ +
Sbjct: 295 LPKDGSRDTVKKGNLETAKVHIHLKPVYGATYTDLQSWLFCPFNGPSTAKVGKVNIPLGR 354
Query: 391 IGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHAS 450
+G+HVGDWEH+TLR+ NFTGE+ +IYFSQHS G+WV +DLE+ NK +VY +KNGH
Sbjct: 355 LGEHVGDWEHYTLRVSNFTGEMEAIYFSQHSKGQWVNIWDLEFDAVNKPLVYVAKNGHPC 414
Query: 451 FPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGE----GVVAEPSWLQF 506
+PH G +LQG E G+G+RND A S + ++SS+++++ +A+Y+ + EP+WLQ+
Sbjct: 415 YPHQGDHLQGDESRGVGLRNDTALSTIVLNSSVKFQVFSADYMHQRGDKDAPEEPAWLQY 474
Query: 507 MRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGD 566
MR+WGP I YD K LDK +K LP +R S+E + KLP E+ GEEGPTGPKEKN W GD
Sbjct: 475 MREWGPKIEYDKKKYLDKFLKFLPSKLRNSLEEILDKLPSEVMGEEGPTGPKEKNMWFGD 534
Query: 567 ER 568
ER
Sbjct: 535 ER 536
>gi|168060126|ref|XP_001782049.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666460|gb|EDQ53113.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 560
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 254/546 (46%), Positives = 351/546 (64%), Gaps = 20/546 (3%)
Query: 31 TWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHY 90
+W G GFA+GRI+LGE++V +++ F +WS + K ATFY P +P G+ L HY
Sbjct: 26 SWYAGTGFATGRISLGEVDVVQVATFEEVWSALEGGVENKGATFYRPVEVPAGYSILCHY 85
Query: 91 CQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGC 150
Q +SRP G++L ++ + + S AL+ P+D+ L+W S +G G
Sbjct: 86 AQNNSRPRAGWILAIKE-----------NGMTSSSALKSPVDFRLIWSSSNW-KGKKNGD 133
Query: 151 AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSS-- 208
+ WLP P+GY +G++ T T KP EV CVR DLTD VH F S
Sbjct: 134 VWVWLPVAPEGYSVLGYITTNTEKKPATSEVTCVRTDLTDTLAVHRKFFTTSGTTKSEFG 193
Query: 209 -PFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHA 267
PFS W+TRP RG+ GV+VG+FFC+ + + +L IACLKN+ L MP +Q+
Sbjct: 194 FPFSTWTTRPAVRGVSSAGVNVGSFFCARSDDAETDLPIACLKNVAFSLDYMPTMEQVKT 253
Query: 268 LIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY-KAGDLVGEAIDPSGSNLPSGGRNDGE 326
+ + +GPTV FHP E +LPSSV+WFF +GA+LY K + I GSNLP+GG NDG
Sbjct: 254 IHQFFGPTVVFHPAETFLPSSVAWFFDSGAMLYSKDSSIPPVRIKSDGSNLPTGGTNDGA 313
Query: 327 FWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNV 386
FWIDLP DG R VK GN+ESA+ Y H+KP G T+TD+ W+F PFNGP T KVG +N+
Sbjct: 314 FWIDLPEDGSRDSVKMGNLESAEAYTHLKPVYGATYTDLQSWLFFPFNGPSTAKVGKINI 373
Query: 387 AFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKN 446
++G+HVGDWEH+TLR+ NFTGEL ++YFSQHS G+WV A+DLE+ NK +VY +K+
Sbjct: 374 PLGRMGEHVGDWEHYTLRVSNFTGELEAVYFSQHSKGEWVLAWDLEFNSANKPLVYVAKS 433
Query: 447 GHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGE----GVVAEPS 502
GH +PH G +LQG + G+G+RND A S +DSS+++ + +A+Y+ + EP+
Sbjct: 434 GHPCYPHQGDHLQGDQEHGVGLRNDTALSTHLLDSSVKFRVFSADYMHQRGDPDAPVEPT 493
Query: 503 WLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNN 562
WL +MR WGP IVYD K + K++ L+P M+ S+ + + KLP E+ G+EGPTGP+EKN
Sbjct: 494 WLHYMRPWGPKIVYDRKKNVTKMLGLMPTMLHGSLGDVLGKLPHEVMGQEGPTGPREKNM 553
Query: 563 WVGDER 568
W GDER
Sbjct: 554 WFGDER 559
>gi|449467110|ref|XP_004151268.1| PREDICTED: uncharacterized protein LOC101218337 [Cucumis sativus]
Length = 553
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 258/551 (46%), Positives = 361/551 (65%), Gaps = 21/551 (3%)
Query: 22 TFSLPAPLPTWPQGQ-GFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
TF+ P+PLP +PQG+ FA G I+LG +++ IS FN IW+ + ATF+EP+
Sbjct: 19 TFNFPSPLPPFPQGKSAFAGGVIDLGGGLKIRLISSFNKIWTTHDGGPSNLGATFFEPSP 78
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
+P GF+SLGHYCQ +++P +LV RD + AL+KP+D+TLVW S
Sbjct: 79 LPQGFFSLGHYCQPNNKPFFARILVGRD------------DSLAGDALKKPVDFTLVWTS 126
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
E + + W P PPDGY+++G +VT + KP +D VRCVR DLT++ E I+
Sbjct: 127 -EKSNIKRDTDGYIWSPTPPDGYRAVGHVVTTSSVKPSVDRVRCVRADLTEQSEKETWIW 185
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAM 259
+ F+++S RP R + GVSVGTF + L + CL+N AM
Sbjct: 186 GLKDSIDENGFNIFSFRPTRRDITAAGVSVGTFVALP--ATNSPLPLLCLRNSASISAAM 243
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY-KAGDLVGEAIDPSGSNLP 318
P+ QI L R Y P ++FHP E +LPSSV+W+F+NGALLY K+ + IDP+G+NLP
Sbjct: 244 PDVSQISTLFRAYAPLIYFHPKEKFLPSSVNWYFSNGALLYNKSVESKPVPIDPNGTNLP 303
Query: 319 SGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPG 377
GG+NDG FW+DLP DGG ++ VKHG+++S ++Y+ +KP +GGTFTDI +W+F PFNGP
Sbjct: 304 QGGQNDGGFWLDLPIDGGAKERVKHGDLQSCQVYLQIKPMIGGTFTDITIWIFFPFNGPA 363
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN 437
T KVGI+++ F KIG+HVGDWEH TLRI NFTGELW +YF+QHS G+W+ A LE+ +GN
Sbjct: 364 TAKVGIIDIPFRKIGEHVGDWEHITLRISNFTGELWKVYFAQHSKGEWIDASSLEFEKGN 423
Query: 438 KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGV 497
K + YSS NGHAS+ PG +QG IG++N+ A+S L +D+ +A EYL E
Sbjct: 424 KVVAYSSLNGHASYSKPGLVMQGGG--EIGLKNETAKSGLVLDTGASSVEIATEYLREEA 481
Query: 498 VAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGP 557
V EP+WL + R+WGP I Y E++K+ KLLP ++ + + +++LP E+ G+EGPTGP
Sbjct: 482 VTEPAWLNYFRQWGPKIEYQIAEEMEKVEKLLPGRLKEAFKQFMNRLPDEILGQEGPTGP 541
Query: 558 KEKNNWVGDER 568
K K++W GDER
Sbjct: 542 KLKDSWNGDER 552
>gi|224121106|ref|XP_002330905.1| predicted protein [Populus trichocarpa]
gi|222872727|gb|EEF09858.1| predicted protein [Populus trichocarpa]
Length = 587
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 268/579 (46%), Positives = 359/579 (62%), Gaps = 52/579 (8%)
Query: 21 GTFSLPAPLPTWPQGQ-----------------------------GFASGRINLGE-IEV 50
+F LPAPLP+WP G+ GF SG I+LG+ ++V
Sbjct: 28 ASFKLPAPLPSWPPGRFLFLFPFWSFLPSLMLSHCLYRPANLVCGGFGSGIIDLGDGLQV 87
Query: 51 CRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLAS 110
C+IS FN +W+ + A+F+EP+ +P GF LG Y Q ++R L G+VL RD
Sbjct: 88 CQISSFNKVWATHEGGPDDLGASFFEPSQLPQGFSMLGCYSQPNNRSLYGWVLAGRDETG 147
Query: 111 SEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVT 170
S AL++P+DYTLVW S E Q +G + WLP PPDGYK++G +VT
Sbjct: 148 S--------------ALKQPIDYTLVW-SSESLQIKQDGVGYIWLPTPPDGYKALGHVVT 192
Query: 171 KTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVG 230
K+P KP L ++RCVR DLT++CE ++ + + F+V+S P NRG GV VG
Sbjct: 193 KSPQKPPLGKIRCVRSDLTEQCEFDSWVWGLGKESDLNGFNVFSLFPSNRGTQAMGVCVG 252
Query: 231 TFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVS 290
TF + ++++CLKN+ L MPN DQI A+ + Y P ++FHPDE YLPSSV
Sbjct: 253 TFVAQKT--TTAPVSLSCLKNVVSNLSCMPNLDQIKAIFQAYSPWIYFHPDEEYLPSSVE 310
Query: 291 WFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGRNDGEFWIDLP-SDGGRQIVKHGNMESA 348
W+F NGALLY+ G+ I+ +GSNLP GG NDG +W+DLP +G + VK G+++ +
Sbjct: 311 WYFVNGALLYERGEESKPVPIESNGSNLPQGGSNDGAYWLDLPVEEGAKDRVKKGDLQDS 370
Query: 349 KLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNF 408
++Y+H+KP G FTDIV+WVF PFNGP KV +N+ KIG+HVGDWEH TLRI NF
Sbjct: 371 RVYLHIKPMFGAAFTDIVVWVFYPFNGPSKAKVEFINIPLGKIGEHVGDWEHLTLRISNF 430
Query: 409 TGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGV 468
GEL SIYFS+HSGG WV + +LE+ GNKA+ YSS +GHA + PG LQGS IG+
Sbjct: 431 NGELLSIYFSEHSGGTWVNSSELEFQNGNKAVTYSSLHGHAMYAKPGLVLQGSG--SIGI 488
Query: 469 RNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKL 528
RND A+S ++D+ +VAAEYLG + EP WL + RKWGP + YD E+ K+ KL
Sbjct: 489 RNDTAKSKKFIDTGTNSLVVAAEYLGMAIT-EPPWLNYFRKWGPKLTYDIAEEIKKVEKL 547
Query: 529 LPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
LP ++ + + V LP E+ GEEGPTGPK K NW GDE
Sbjct: 548 LPGKLKSAFDKFVRSLPSEVLGEEGPTGPKLKRNWTGDE 586
>gi|449501619|ref|XP_004161418.1| PREDICTED: uncharacterized protein LOC101228858 [Cucumis sativus]
Length = 553
Score = 514 bits (1323), Expect = e-143, Method: Compositional matrix adjust.
Identities = 257/551 (46%), Positives = 361/551 (65%), Gaps = 21/551 (3%)
Query: 22 TFSLPAPLPTWPQGQ-GFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAG 79
TF+ P+PLP +PQG+ FA G I+LG +++ IS FN IW+ + ATF+EP+
Sbjct: 19 TFNFPSPLPPFPQGKSAFAGGVIDLGGGLKIRLISSFNKIWTTHDGGPSNLGATFFEPSP 78
Query: 80 IPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCS 139
+P GF+SLGHYCQ +++P +LV RD + AL+KP+D+TLVW S
Sbjct: 79 LPQGFFSLGHYCQPNNKPFFARILVGRD------------DSLAGDALKKPVDFTLVWTS 126
Query: 140 DEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
E + + W P PPDGY+++G +VT + KP +D VRCVR DLT++ E I+
Sbjct: 127 -EKSNIKRDTDGYIWSPTPPDGYRAVGQVVTTSSVKPSVDRVRCVRADLTEQSEKETWIW 185
Query: 200 DAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAM 259
+ F++++ RP R + GVSVGTF + L + CL+N AM
Sbjct: 186 GLKDSIDENGFNIFNFRPTRRDITAAGVSVGTFVALP--ATNSPLPLLCLRNSASISAAM 243
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY-KAGDLVGEAIDPSGSNLP 318
P+ QI L R Y P ++FHP E +LPSSV+W+F+NGALLY K+ + IDP+G+NLP
Sbjct: 244 PDVSQISTLFRAYAPLIYFHPKEKFLPSSVNWYFSNGALLYNKSVESKPVPIDPNGTNLP 303
Query: 319 SGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPG 377
GG+NDG FW+DLP DGG ++ VKHG+++S ++Y+ +KP +GGTFTDI +W+F PFNGP
Sbjct: 304 QGGQNDGGFWLDLPIDGGAKERVKHGDLQSCQVYLQIKPMIGGTFTDITIWIFFPFNGPA 363
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN 437
KVGI+++ F KIG+HVGDWEH TLRI NFTGELW +YF+QHS G+W+ A LE+ +GN
Sbjct: 364 RAKVGIIDIPFRKIGEHVGDWEHITLRISNFTGELWKVYFAQHSKGEWIDASSLEFEKGN 423
Query: 438 KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGV 497
K + YSS NGHAS+ PG +QG IG++N+ A+S L +D+ +A EYL E
Sbjct: 424 KVVAYSSLNGHASYSKPGLVMQGGG--EIGLKNETAKSGLVLDTGASSVEIATEYLREEA 481
Query: 498 VAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGP 557
VAEP+WL + R+WGP I Y E++K+ KLLP ++ + + +++LP E+ G+EGPTGP
Sbjct: 482 VAEPAWLNYFRQWGPKIEYQIAEEMEKVEKLLPGRLKEAFKQFMNRLPDEILGQEGPTGP 541
Query: 558 KEKNNWVGDER 568
K K++W GDER
Sbjct: 542 KLKDSWNGDER 552
>gi|449531402|ref|XP_004172675.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218097
[Cucumis sativus]
Length = 560
Score = 510 bits (1313), Expect = e-142, Method: Compositional matrix adjust.
Identities = 267/563 (47%), Positives = 363/563 (64%), Gaps = 27/563 (4%)
Query: 17 PTEP--GTFSLPAP-LPTWPQGQ----GFASGRINLGE-IEVCRISRFNFIWSCNLLQSK 68
PT P FS P+P L T + GFASG I+LG + VC+IS FN IW+
Sbjct: 12 PTLPIDSKFSFPSPHLATHSETNTVNGGFASGTIDLGGGLHVCQISSFNKIWAARQGGPD 71
Query: 69 KKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQ 128
ATF+EP +P+GF+ LG++CQ + L G VL +D S + AL+
Sbjct: 72 NLGATFFEPNSLPEGFFVLGYFCQSNKNALFGSVLAGKDNGSDGED-----------ALK 120
Query: 129 KPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
KP+DYTLVW S E + +G + W P PPDGY+++G +VT +P KP +D++RCVR +L
Sbjct: 121 KPVDYTLVW-STESSKIKRDGNGYIWSPTPPDGYRAVGHVVTASPEKPSVDKIRCVRSEL 179
Query: 189 TDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQEL--NI 246
T++CE I+ + + F+++S+RP NRG+ G GVS G F +G +
Sbjct: 180 TEECENEAWIWGPMKSRDENGFNIYSSRPKNRGIPGTGVSTGGFLALPAPTTGNSPLPQL 239
Query: 247 ACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY-KAGDL 305
CLKNL+ AMP+ QI +L + Y P ++FHP E YLPSSV WFF+NGALLY K+ +
Sbjct: 240 FCLKNLNSISAAMPDLSQIDSLYQAYSPIIYFHPKEKYLPSSVDWFFSNGALLYDKSNES 299
Query: 306 VGEAIDPSGSNLPSGGRNDGEFWIDLPSDG-GRQIVKHGNMESAKLYVHVKPAVGGTFTD 364
I+P G NLP GG NDG+FW++LP+D G++ +K G+++S + Y+HVKP +GGTFTD
Sbjct: 300 NPVPINPDGLNLPQGGSNDGQFWLNLPTDEEGKEKLKKGDLQSCRGYLHVKPMIGGTFTD 359
Query: 365 IVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK 424
I W+F PFNGP T KVGI+++ F+KIG+H+GDWEH TLRI NFTGEL +YF+QHS G+
Sbjct: 360 IATWIFFPFNGPATAKVGIIDIPFTKIGEHIGDWEHITLRISNFTGELRRVYFAQHSKGE 419
Query: 425 WVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQ 484
WV LE+ +GNK + YSS NGHAS+ PG LQG+ IG+RN+ A+S L VD+
Sbjct: 420 WVDPPSLEFEKGNKVVAYSSLNGHASYSKPGLVLQGA--AEIGIRNETAKSGLVVDTGTN 477
Query: 485 YELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKL 544
Y ++ AEYL EG V EP+W+ + R+WGP I Y E++K+ LLP ++ V KL
Sbjct: 478 YLVIGAEYL-EGXVVEPAWVNYTREWGPRIEYPIVEEIEKVENLLPGRLKEGFRGFVKKL 536
Query: 545 PLELYGEEGPTGPKEKNNWVGDE 567
P E+ GEEGPTGPK KN+W GDE
Sbjct: 537 PDEIRGEEGPTGPKMKNSWNGDE 559
>gi|357451131|ref|XP_003595842.1| hypothetical protein MTR_2g062430 [Medicago truncatula]
gi|124360092|gb|ABN08108.1| Protein of unknown function DUF946, plant [Medicago truncatula]
gi|355484890|gb|AES66093.1| hypothetical protein MTR_2g062430 [Medicago truncatula]
Length = 561
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 264/548 (48%), Positives = 356/548 (64%), Gaps = 25/548 (4%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
TF LP WP G GFA+G I+LG ++V ++S FN +W+ + AT +EP GIP
Sbjct: 36 TFKLPVAPNAWPPGSGFANGIIDLGGLQVSQVSTFNKVWATYDGGPDNQGATIFEPTGIP 95
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
GF LG Y Q +++PL G+VLVA+D++SS L+KP+DYTLV +
Sbjct: 96 QGFSMLGCYSQPNNKPLFGYVLVAKDVSSSTTNST----------LKKPIDYTLV-LNTS 144
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ N + + WLP PDGYK++G +VT T +KP LD++ CVR DLTD+CE I+
Sbjct: 145 TIKVNQDTTCYIWLPIAPDGYKALGHVVTATQDKPSLDKIMCVRSDLTDQCESSSWIW-- 202
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPN 261
S+ F+ + RP NRG GV VG F + N + +I+CLKNL+ MPN
Sbjct: 203 ----GSNDFNFYDVRPINRGTQAPGVHVGAF-VAQNGGTNIPPSISCLKNLNSISKIMPN 257
Query: 262 CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSG 320
QI A+++ Y P ++ HPDE YLPSSV+WFF+NGALLYK G+ I +G NLP
Sbjct: 258 LVQIDAILKVYSPLMYLHPDEEYLPSSVNWFFSNGALLYKKGEESNPVPIAQNGINLPQD 317
Query: 321 GRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
NDG +W+DLPSD ++ VK GN++SA+ YVHVKP +GGTFTDI MW+F PFNGPG
Sbjct: 318 PNNDGIYWLDLPSDDANKERVKKGNLQSAESYVHVKPMLGGTFTDIAMWIFYPFNGPGRA 377
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
KV +NV KIG+H+GDWEH TLR+ N G+LW +YFSQHS G+W+ + LE+ + N+
Sbjct: 378 KVKFINVKLGKIGEHIGDWEHVTLRVSNLDGKLWHLYFSQHSKGEWIDSSQLEF-QSNRP 436
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
+ YSS +GHAS+PH G LQG GIG+RND+A+S + +D + ++ L++A+YLG V+
Sbjct: 437 VFYSSLHGHASYPHEGLVLQGKN--GIGIRNDSAKSEMILDLT-KFVLISADYLGSFVI- 492
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
EP WL + RKWGP I Y+ EL K+ K+LP ++ EN + LP E+ GEEGPTGPK
Sbjct: 493 EPPWLHYFRKWGPKIDYNIDEELRKVEKILPGKLKDIFENIIRSLPEEVLGEEGPTGPKL 552
Query: 560 KNNWVGDE 567
KNNW GDE
Sbjct: 553 KNNWSGDE 560
>gi|449467108|ref|XP_004151267.1| PREDICTED: uncharacterized protein LOC101218097 [Cucumis sativus]
Length = 560
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 264/563 (46%), Positives = 358/563 (63%), Gaps = 27/563 (4%)
Query: 17 PTEP--GTFSLPAP-LPTWPQGQ----GFASGRINLGE-IEVCRISRFNFIWSCNLLQSK 68
PT P FS P+P L T + GFASG I+LG + VC+IS FN IW+
Sbjct: 12 PTLPIDSKFSFPSPHLATHSETNTVNGGFASGTIDLGGGLHVCQISSFNKIWAARQGGPD 71
Query: 69 KKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQ 128
ATF+EP +P+GF+ LG++CQ + L G VL +D S + AL+
Sbjct: 72 NLGATFFEPNSLPEGFFVLGYFCQSNKNALFGSVLAGKDNGSDGED-----------ALK 120
Query: 129 KPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
KP+DYTLVW S E + +G + W P PPDGY+++G +VT +P KP +D++RCVR +L
Sbjct: 121 KPVDYTLVW-STESSKIKRDGNGYIWSPTPPDGYRAVGHVVTASPEKPSVDKIRCVRSEL 179
Query: 189 TDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQEL--NI 246
T++CE I+ + + F+++S+RP NRG+ G GVS G F +G +
Sbjct: 180 TEECENEAWIWGPMKSRDENGFNIYSSRPKNRGITGTGVSTGGFLALPAPTTGNSPLPQL 239
Query: 247 ACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY-KAGDL 305
CLKNL+ AMP+ QI +L + Y P ++FHP E YLPSSV WFF+NGALLY K+ +
Sbjct: 240 FCLKNLNSISAAMPDLSQIDSLYQAYSPIIYFHPKEKYLPSSVDWFFSNGALLYDKSNES 299
Query: 306 VGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKH-GNMESAKLYVHVKPAVGGTFTD 364
I+P G NLP GG NDG+FW++LP+D + G+++S + Y+HVKP +GGTFTD
Sbjct: 300 NPVPINPDGLNLPQGGSNDGQFWLNLPTDEEEKEKLKKGDLQSCRGYLHVKPMIGGTFTD 359
Query: 365 IVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK 424
I W+F PFNGP T KVGI+++ F+KIG+H+GDWEH TLRI NFTGEL +YF+QHS G+
Sbjct: 360 IATWIFFPFNGPATAKVGIIDIPFTKIGEHIGDWEHITLRISNFTGELRRVYFAQHSKGE 419
Query: 425 WVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQ 484
WV L + +GNK + YSS NGHAS+ PG LQG+ IG+RN+ A+S L VD+
Sbjct: 420 WVDPPSLGFEKGNKVVAYSSLNGHASYSKPGLVLQGAA--EIGIRNETAKSGLVVDTGTN 477
Query: 485 YELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKL 544
Y ++ AEYL EG V EP+W+ + R+WGP I Y E++K+ LLP ++ V KL
Sbjct: 478 YLVIGAEYL-EGAVVEPAWVNYTREWGPRIEYPIVEEIEKVENLLPGRLKEGFRGFVKKL 536
Query: 545 PLELYGEEGPTGPKEKNNWVGDE 567
P E+ GEEGPTGPK KN+W GDE
Sbjct: 537 PDEIRGEEGPTGPKMKNSWNGDE 559
>gi|356555260|ref|XP_003545952.1| PREDICTED: uncharacterized protein LOC100820073 [Glycine max]
Length = 581
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 262/549 (47%), Positives = 348/549 (63%), Gaps = 25/549 (4%)
Query: 23 FSLPAPLP-TWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
F LP + +WP G FASG I+LG +++ S FN +W + + +EP+GIP
Sbjct: 55 FKLPVHVTNSWPPGGNFASGTIDLGGLQLYEASTFNKVWGTYSGGPDDRGFSIFEPSGIP 114
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
GF LG Y Q +++PL G+VLVA+D+++ +TSN P+L++PLDYTLVW S
Sbjct: 115 QGFSMLGSYSQPNNKPLFGYVLVAKDVST------NTSN----PSLKQPLDYTLVWNS-A 163
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ + +G + WLP P GYK++G++VT TP KP LD++RC R DLTD+CE + I+
Sbjct: 164 SLKIDQDGPIYVWLPTAPQGYKAVGYVVTTTPTKPSLDKIRCARLDLTDQCEANSFIW-- 221
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPN 261
S F+ + RP NRG GV VGTF + S +I CL+N + MPN
Sbjct: 222 ----GSDNFNFYDVRPSNRGTQAPGVRVGTFVAQNG--SPNPPSIVCLRNTNAIPKYMPN 275
Query: 262 CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGE-AIDPSGSNLPSG 320
QI A+++ Y P + HPDE + PSSV WFF+NGALLYK G +I P+G+NLP
Sbjct: 276 LPQIKAILQVYSPVMSLHPDEEFFPSSVEWFFSNGALLYKKGQESKPVSISPNGANLPQD 335
Query: 321 GRNDGEFWIDLPSDG-GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
DG +W+DLP+D ++ VK G+++SA YVHVKP +GGTFTDI MWVF PFNGP
Sbjct: 336 PNIDGAYWVDLPADSTNKERVKKGDLKSAISYVHVKPMLGGTFTDIAMWVFYPFNGPARA 395
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
KV + V KIG+HVGDWEH TLR+ NF GEL +YFSQHS G W + LE+ GNK
Sbjct: 396 KVEFLTVNLGKIGEHVGDWEHVTLRVSNFNGELKHVYFSQHSKGVWFDSSQLEFQSGNKP 455
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
+ YSS +GHAS+PH G L G + IG+RND A S+ +D ++LV+AEYLG VV
Sbjct: 456 LYYSSLHGHASYPHGGLNLLGED--KIGIRNDTAISDNVMDLG-AFQLVSAEYLGSDVVE 512
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
P WL + R+WGP I Y+ EL K+ K LP ++ ++EN V LP E+ GEEGPTGPK
Sbjct: 513 PPPWLNYFREWGPKIDYNVNDELRKLEKFLPGKLKSTLENIVKNLPSEVLGEEGPTGPKV 572
Query: 560 KNNWVGDER 568
K+NW GDER
Sbjct: 573 KDNWSGDER 581
>gi|18406495|ref|NP_030964.1| uncharacterized protein [Arabidopsis thaliana]
gi|15293039|gb|AAK93630.1| unknown protein [Arabidopsis thaliana]
gi|20197184|gb|AAC16076.2| expressed protein [Arabidopsis thaliana]
gi|22136978|gb|AAM91718.1| unknown protein [Arabidopsis thaliana]
gi|330255303|gb|AEC10397.1| uncharacterized protein [Arabidopsis thaliana]
Length = 553
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 252/550 (45%), Positives = 356/550 (64%), Gaps = 31/550 (5%)
Query: 23 FSLPAPLPTWPQGQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
F P+PLPT+ +G GFA G I+LG +EV ++S FN +WS ATF+EP+ IP
Sbjct: 29 FKFPSPLPTFTRGDGFAKGTIDLGGGLEVSQVSTFNKVWSTYEGGPDNLGATFFEPSSIP 88
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
GF LG+Y Q ++R L G+VL ARDL+S+ L+ P+DYTLV + E
Sbjct: 89 SGFSILGYYAQPNNRNLFGWVLTARDLSSN--------------TLKPPVDYTLV-GNTE 133
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ +G +FW P PPDGY+++G +VT KP LD++RC+R DLT++CE I+
Sbjct: 134 SLKIKQDGTGYFWQPVPPDGYQAVGLIVTNYSQKPPLDKLRCIRSDLTEQCEADTWIW-- 191
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPN 261
++ ++ + +P RG GV VGTF + + +++CLKN MPN
Sbjct: 192 ----GTNGVNISNLKPTTRGTQATGVYVGTFTWQTQ--NSSPPSLSCLKNTKLDFSTMPN 245
Query: 262 CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGE-AIDPSGSNLPSG 320
QI L + + P ++FHPDE YLPSSV+W+F NGALLYK G+ I+ +GSNLP G
Sbjct: 246 GSQIEELFQTFSPCIYFHPDEEYLPSSVTWYFNNGALLYKKGEESKPIPIESNGSNLPQG 305
Query: 321 GRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
G NDG +W+DLP D G++ VK G+++S K+Y+H+KP +G TFTDI +W+F PFNGP
Sbjct: 306 GSNDGSYWLDLPIDKNGKERVKKGDLQSTKVYLHIKPMLGATFTDISIWIFYPFNGPAKA 365
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG--N 437
KV +N+ +IG+H+GDWEH TLRI NFTGELW ++ SQHSGG W+ A DLE+ +G N
Sbjct: 366 KVKFVNLPLGRIGEHIGDWEHTTLRISNFTGELWRVFLSQHSGGIWIDACDLEFQDGGNN 425
Query: 438 KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGV 497
K + Y+S +GHA +P PG LQG + G+G+RND + +D+ + YE++AAEY G GV
Sbjct: 426 KFVAYASLHGHAMYPKPGLVLQGDD--GVGIRNDTGKGKKVLDTGLGYEVIAAEYDGGGV 483
Query: 498 VAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGP 557
V EP W+++ RKWGP I Y+ E+ + ++LP +++ + V K+P E+YGE+GPTGP
Sbjct: 484 V-EPPWVKYFRKWGPKIDYNVDDEVKSVERILPGLLKKAFVKFVKKIPDEVYGEDGPTGP 542
Query: 558 KEKNNWVGDE 567
K K+NW GDE
Sbjct: 543 KLKSNWAGDE 552
>gi|18406488|ref|NP_030961.1| uncharacterized protein [Arabidopsis thaliana]
gi|3128170|gb|AAC16074.1| expressed protein [Arabidopsis thaliana]
gi|15450387|gb|AAK96487.1| At2g44230/F4I1.4 [Arabidopsis thaliana]
gi|20466109|gb|AAM19976.1| At2g44230/F4I1.4 [Arabidopsis thaliana]
gi|110741157|dbj|BAE98671.1| hypothetical protein [Arabidopsis thaliana]
gi|330255300|gb|AEC10394.1| uncharacterized protein [Arabidopsis thaliana]
Length = 542
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 247/551 (44%), Positives = 354/551 (64%), Gaps = 33/551 (5%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
TF+LP+PLP+WP G+GFA GRI+LG +EV ++ FN +W+ ATF+EP+ +P
Sbjct: 19 TFNLPSPLPSWPSGEGFAKGRIDLGGLEVSQVDTFNKVWTVYEGGQDNLGATFFEPSSVP 78
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
+GF LG Y Q ++R L G+ LV +DL+ +L+ P+DY L+W
Sbjct: 79 EGFSILGFYAQPNNRKLFGWTLVGKDLSGD--------------SLRPPVDYLLLWSGKS 124
Query: 142 GG-QGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFD 200
+ N +FW P PPDGY ++G +VT + KP LD++RCVR DLTD+ E LI++
Sbjct: 125 TKVENNKVETGYFWQPVPPDGYNAVGLIVTTSDEKPPLDKIRCVRSDLTDQSEPDALIWE 184
Query: 201 AISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMP 260
++ FSV S++P NRG GVSVGTFF +S + CLKN + MP
Sbjct: 185 ------TNGFSVSSSKPVNRGTQASGVSVGTFFSNS-----PNPALPCLKNNNFDFSCMP 233
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPS 319
+ QI AL + Y P ++FH DE YLPSSV+WFF+NGALLYK GD ++P+G NLP
Sbjct: 234 SKPQIDALFQTYAPWIYFHKDEKYLPSSVNWFFSNGALLYKKGDESNPVPVEPNGLNLPQ 293
Query: 320 GGRNDGEFWIDLP-SDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT 378
G NDG +W+DLP + R+ V+ G+++S ++Y+H+KP GGTFTDI +W+F PFNGP
Sbjct: 294 GEFNDGLYWLDLPVASDARKRVQCGDLQSMEVYLHIKPVFGGTFTDIAVWMFYPFNGPSR 353
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYI-EGN 437
K+ ++ +IG+H+GDWEHFTLRI NF+G+L +Y SQHSGG W A ++E+ GN
Sbjct: 354 AKLKAASIPLGRIGEHIGDWEHFTLRISNFSGKLHRMYLSQHSGGSWADASEIEFQGGGN 413
Query: 438 KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGV 497
K + Y+S NGHA + PG LQG + +G+RND +S +D+++++ +VAAEY+ G
Sbjct: 414 KPVAYASLNGHAMYSKPGLVLQGKD--NVGIRNDTGKSEKVIDTAVRFRVVAAEYM-RGE 470
Query: 498 VAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLP-LMIRYSVENAVSKLPLELYGEEGPTG 556
+ EP+WL +MR WGP I Y + E+ + K++ ++ + +A+ LP E++GEEGPTG
Sbjct: 471 LEEPAWLNYMRHWGPKIDYGHENEIRGVEKIMVGESLKTTFRSAIKGLPNEVFGEEGPTG 530
Query: 557 PKEKNNWVGDE 567
PK K NW+GDE
Sbjct: 531 PKLKRNWLGDE 541
>gi|356528609|ref|XP_003532892.1| PREDICTED: uncharacterized protein LOC100783773 [Glycine max]
Length = 579
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 253/549 (46%), Positives = 343/549 (62%), Gaps = 27/549 (4%)
Query: 23 FSLPAPLP-TWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
F LPAP+ +WP G FASG I+LG ++VC S FN +W + + +EP+G+P
Sbjct: 55 FKLPAPVTNSWPPGGNFASGTIDLGGLQVCEASTFNKVWGTYGGGPDDQGFSIFEPSGVP 114
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
GF LG Y Q +++PL G+VLVA+D+++ +TSN P+L++PLDYTLVW S
Sbjct: 115 KGFSMLGSYSQPNNKPLSGYVLVAKDVST------NTSN----PSLKQPLDYTLVWNS-A 163
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ + +G + WLP P+GYK++G +VT TP KP L+++ CVR DLTD+CE + I+D
Sbjct: 164 SLEIDQDGPIYVWLPIAPNGYKAVGHVVTTTPTKPSLEKIMCVRLDLTDQCETNSFIWD- 222
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPN 261
S F+ RP NRG+ GV VGTF + S +IACL+N + MPN
Sbjct: 223 -----SDNFNFLDVRPSNRGIQAPGVRVGTFVAQN--ASLNSPSIACLRNTNAIPKYMPN 275
Query: 262 CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSG 320
QI A+++ Y P ++ HPDE + PSSV WFF+NGALLYK G I P+G+NLP
Sbjct: 276 LPQIKAILQVYSPCLYLHPDEEFFPSSVEWFFSNGALLYKKGQESSPVPISPNGANLPQD 335
Query: 321 GRNDGEFWIDLPSDG-GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
+DG +W+DLP+D + VK G+++SA YVHVKP +GGTFTDI MWVF PFNG
Sbjct: 336 PNDDGAYWVDLPADSTNKDRVKKGDLKSAISYVHVKPMLGGTFTDISMWVFYPFNGAARA 395
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
V + + KIG+HVGDWEH TLR+ NF GEL +YF QH G W + LE+ GNK
Sbjct: 396 IVDFLTIDLGKIGEHVGDWEHVTLRVSNFNGELKHVYFLQHRKGAWFDSSQLEFQSGNKP 455
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA 499
+ YSS + HAS+PH + IG+RND A S+ +D ++LV+AEYL VV
Sbjct: 456 LYYSSLHAHASYPHAARNITDK----IGIRNDTAMSDSVMDLG-AFQLVSAEYLVSDVVE 510
Query: 500 EPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKE 559
P WL + R+WGP I Y+ EL K+ K LP ++ ++EN V LP E+ +EGPTGPK
Sbjct: 511 PPPWLNYFREWGPKIYYNVDEELKKLEKFLPGKLKSTLENIVKNLPSEVLRQEGPTGPKV 570
Query: 560 KNNWVGDER 568
K++W GDER
Sbjct: 571 KDSWSGDER 579
>gi|124360094|gb|ABN08110.1| Protein of unknown function DUF946, plant [Medicago truncatula]
Length = 556
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 253/552 (45%), Positives = 342/552 (61%), Gaps = 28/552 (5%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
TF LP +WP+G GFA G ++LG ++V + S F+ +W + A+ YEP GIP
Sbjct: 26 TFKLPIAQTSWPKGNGFAGGFLDLGGLQVYQASTFSKVWGAYEGGLDNQGASVYEPTGIP 85
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
GF LG Y Q +++PL G+V VA+D++SS L+ P+DYTLV +
Sbjct: 86 KGFSMLGSYSQPNNKPLFGYVFVAKDVSSSTT----------GRTLKPPVDYTLV-SNTA 134
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ + + WLP P+GYK++G +VT T +KP LD++ CVR DLTD+CE I+
Sbjct: 135 SFKATQDSTLYIWLPIAPNGYKALGHVVTTTQDKPSLDKIMCVRSDLTDQCESSSWIW-- 192
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPN 261
S+ F+ ++ RP NRG GV VGTFF N + +I+CLKNL+ MPN
Sbjct: 193 ----GSNNFNFYNVRPINRGTQAPGVRVGTFFAQ-NGGNTNPPSISCLKNLNSISQIMPN 247
Query: 262 CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSG 320
QI A+++ Y P ++ H DE YLPSSV+WFF+NGALLYK G+ I +G+NLP
Sbjct: 248 KKQIEAMLQVYSPFLYLHSDEEYLPSSVNWFFSNGALLYKKGEESNPVPIAQNGTNLPQD 307
Query: 321 GRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
DG +WIDLP+D ++ VK GN++SA+ YVHVKP GGTFTDI MWVF PFNGPG
Sbjct: 308 PNTDGAYWIDLPADAANKERVKKGNLQSAESYVHVKPMYGGTFTDIAMWVFYPFNGPGRA 367
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG--- 436
KV +NV KIG+HVGDWEH TLR+ N G+LW +YFSQH+ G W+ + LE+ +G
Sbjct: 368 KVEFLNVKLGKIGEHVGDWEHVTLRVSNLDGQLWHVYFSQHNLGSWIDSSQLEFQDGIGA 427
Query: 437 -NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGE 495
+ +VY+S +GHAS+PH G L G GIG R+D + +D ++ LV+AEYLG
Sbjct: 428 TKRPVVYASLHGHASYPHAGLVLLGKN--GIGARDDTNKGKNVMDMG-KFVLVSAEYLGS 484
Query: 496 GVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPT 555
V EP+WL + R+WGP + Y EL + KLLP ++ E+ + LP E GE GPT
Sbjct: 485 -EVKEPAWLNYFREWGPHVDYSLDDELKNVEKLLPGKLKNGFEDIIRSLPKEALGEAGPT 543
Query: 556 GPKEKNNWVGDE 567
GPK+KNNW GDE
Sbjct: 544 GPKQKNNWSGDE 555
>gi|357451127|ref|XP_003595840.1| hypothetical protein MTR_2g062410 [Medicago truncatula]
gi|355484888|gb|AES66091.1| hypothetical protein MTR_2g062410 [Medicago truncatula]
Length = 586
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 253/552 (45%), Positives = 343/552 (62%), Gaps = 28/552 (5%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
TF LP +WP+G GFA G ++LG ++V + S F+ +W + A+ YEP GIP
Sbjct: 56 TFKLPIAQTSWPKGNGFAGGFLDLGGLQVYQASTFSKVWGAYEGGLDNQGASVYEPTGIP 115
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
GF LG Y Q +++PL G+V VA+D++SS L+ P+DYTLV +
Sbjct: 116 KGFSMLGSYSQPNNKPLFGYVFVAKDVSSSTT----------GRTLKPPVDYTLV-SNTA 164
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ + + WLP P+GYK++G +VT T +KP LD++ CVR DLTD+CE I+
Sbjct: 165 SFKATQDSTLYIWLPIAPNGYKALGHVVTTTQDKPSLDKIMCVRSDLTDQCESSSWIW-- 222
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPN 261
S+ F+ ++ RP NRG GV VGTFF + N + +I+CLKNL+ MPN
Sbjct: 223 ----GSNNFNFYNVRPINRGTQAPGVRVGTFF-AQNGGNTNPPSISCLKNLNSISQIMPN 277
Query: 262 CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSG 320
QI A+++ Y P ++ H DE YLPSSV+WFF+NGALLYK G+ I +G+NLP
Sbjct: 278 KKQIEAMLQVYSPFLYLHSDEEYLPSSVNWFFSNGALLYKKGEESNPVPIAQNGTNLPQD 337
Query: 321 GRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
DG +WIDLP+D ++ VK GN++SA+ YVHVKP GGTFTDI MWVF PFNGPG
Sbjct: 338 PNTDGAYWIDLPADAANKERVKKGNLQSAESYVHVKPMYGGTFTDIAMWVFYPFNGPGRA 397
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG--- 436
KV +NV KIG+HVGDWEH TLR+ N G+LW +YFSQH+ G W+ + LE+ +G
Sbjct: 398 KVEFLNVKLGKIGEHVGDWEHVTLRVSNLDGQLWHVYFSQHNLGSWIDSSQLEFQDGIGA 457
Query: 437 -NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGE 495
+ +VY+S +GHAS+PH G L G GIG R+D + +D ++ LV+AEYLG
Sbjct: 458 TKRPVVYASLHGHASYPHAGLVLLGKN--GIGARDDTNKGKNVMDMG-KFVLVSAEYLGS 514
Query: 496 GVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPT 555
V EP+WL + R+WGP + Y EL + KLLP ++ E+ + LP E GE GPT
Sbjct: 515 -EVKEPAWLNYFREWGPHVDYSLDDELKNVEKLLPGKLKNGFEDIIRSLPKEALGEAGPT 573
Query: 556 GPKEKNNWVGDE 567
GPK+KNNW GDE
Sbjct: 574 GPKQKNNWSGDE 585
>gi|326497913|dbj|BAJ94819.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 555
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 257/557 (46%), Positives = 338/557 (60%), Gaps = 36/557 (6%)
Query: 22 TFSLPAPLPTWPQG---QGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPA 78
F LPA +P+WP GFA G I+LG +EV +++ F +WS ATF++P+
Sbjct: 23 AFDLPAAMPSWPSSVSDGGFAKGSIDLGGMEVRQVTTFAKVWSTAQGGQDGLGATFFKPS 82
Query: 79 GIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC 138
+P GF++LGHY Q ++RPL G VLVARD + + A L PLDYTLVW
Sbjct: 83 PVPAGFHALGHYAQPNNRPLFGRVLVARDASGTGA------------LLAPPLDYTLVWS 130
Query: 139 SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLI 198
S + G G FFWLP PPDGYK++G VT T +KP+ EV CVR D TD CE +
Sbjct: 131 SGQDGAG------FFWLPTPPDGYKAVGVAVTATKDKPQPGEVMCVRADFTDVCEAEESV 184
Query: 199 FDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELN-IACLKNLDPKLH 257
+ + FS + RP RG+ RGV GTF SN ++ +ACLKN
Sbjct: 185 WGS----DKDGFSATALRPAVRGVDARGVHTGTFLAQSNVTPAASVSTLACLKNNSAAYT 240
Query: 258 A-MPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSN 316
+ MP+ Q++A++ Y P V+ HP++ YLPSS +WFF NGALLY+ G + GSN
Sbjct: 241 SCMPDLGQVNAVLAAYSPHVYLHPNDPYLPSSATWFFENGALLYQKGSQTPTPVAADGSN 300
Query: 317 LPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNG 375
LP GG ND +W+DLP D G ++ VK G++ SAK YV KP +GGT TD+ +W F PFNG
Sbjct: 301 LPQGGGNDDAYWLDLPVDNGQKERVKKGDLASAKAYVQAKPMLGGTATDLALWFFYPFNG 360
Query: 376 PGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYI- 434
P KVG + + IG+HVGDWEH TLR+ NF+GEL +YFSQHS G WV A LEY+
Sbjct: 361 PARAKVGPLTIPLGMIGEHVGDWEHLTLRVSNFSGELLRMYFSQHSTGAWVDASQLEYLG 420
Query: 435 --EGN-KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAE 491
GN + + Y+S +GHA +P G LQG+ LG+G+RND A+ + S + E+VAAE
Sbjct: 421 DGGGNRRPVAYASLHGHAFYPREGLVLQGNSKLGVGIRNDCAKGSRMDTSGGRCEVVAAE 480
Query: 492 YLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGE 551
YLG G VAEP+WL F R WGP YD E+++ ++LP I+ + V+KL L G
Sbjct: 481 YLGAGKVAEPAWLGFERGWGPREEYDIGREINRAARILPRAIKERLAQLVNKL---LVG- 536
Query: 552 EGPTGPKEKNNWVGDER 568
EGPTGPK K +W DER
Sbjct: 537 EGPTGPKMKGSWRNDER 553
>gi|30689674|ref|NP_850414.1| uncharacterized protein [Arabidopsis thaliana]
gi|330255304|gb|AEC10398.1| uncharacterized protein [Arabidopsis thaliana]
Length = 583
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 246/538 (45%), Positives = 347/538 (64%), Gaps = 31/538 (5%)
Query: 35 GQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQF 93
G GFA G I+LG +EV ++S FN +WS ATF+EP+ IP GF LG+Y Q
Sbjct: 71 GDGFAKGTIDLGGGLEVSQVSTFNKVWSTYEGGPDNLGATFFEPSSIPSGFSILGYYAQP 130
Query: 94 DSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFF 153
++R L G+VL ARDL+S+ L+ P+DYTLV + E + +G +F
Sbjct: 131 NNRNLFGWVLTARDLSSN--------------TLKPPVDYTLV-GNTESLKIKQDGTGYF 175
Query: 154 WLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVW 213
W P PPDGY+++G +VT KP LD++RC+R DLT++CE I+ ++ ++
Sbjct: 176 WQPVPPDGYQAVGLIVTNYSQKPPLDKLRCIRSDLTEQCEADTWIW------GTNGVNIS 229
Query: 214 STRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYG 273
+ +P RG GV VGTF + + +++CLKN MPN QI L + +
Sbjct: 230 NLKPTTRGTQATGVYVGTFTWQTQ--NSSPPSLSCLKNTKLDFSTMPNGSQIEELFQTFS 287
Query: 274 PTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGE-AIDPSGSNLPSGGRNDGEFWIDLP 332
P ++FHPDE YLPSSV+W+F NGALLYK G+ I+ +GSNLP GG NDG +W+DLP
Sbjct: 288 PCIYFHPDEEYLPSSVTWYFNNGALLYKKGEESKPIPIESNGSNLPQGGSNDGSYWLDLP 347
Query: 333 SD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKI 391
D G++ VK G+++S K+Y+H+KP +G TFTDI +W+F PFNGP KV +N+ +I
Sbjct: 348 IDKNGKERVKKGDLQSTKVYLHIKPMLGATFTDISIWIFYPFNGPAKAKVKFVNLPLGRI 407
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG--NKAIVYSSKNGHA 449
G+H+GDWEH TLRI NFTGELW ++ SQHSGG W+ A DLE+ +G NK + Y+S +GHA
Sbjct: 408 GEHIGDWEHTTLRISNFTGELWRVFLSQHSGGIWIDACDLEFQDGGNNKFVAYASLHGHA 467
Query: 450 SFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRK 509
+P PG LQG + G+G+RND + +D+ + YE++AAEY G GVV EP W+++ RK
Sbjct: 468 MYPKPGLVLQGDD--GVGIRNDTGKGKKVLDTGLGYEVIAAEYDGGGVV-EPPWVKYFRK 524
Query: 510 WGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
WGP I Y+ E+ + ++LP +++ + V K+P E+YGE+GPTGPK K+NW GDE
Sbjct: 525 WGPKIDYNVDDEVKSVERILPGLLKKAFVKFVKKIPDEVYGEDGPTGPKLKSNWAGDE 582
>gi|297824437|ref|XP_002880101.1| hypothetical protein ARALYDRAFT_483550 [Arabidopsis lyrata subsp.
lyrata]
gi|297325940|gb|EFH56360.1| hypothetical protein ARALYDRAFT_483550 [Arabidopsis lyrata subsp.
lyrata]
Length = 564
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 250/559 (44%), Positives = 352/559 (62%), Gaps = 38/559 (6%)
Query: 23 FSLPAPLPTWPQG---QGFASGRINLGEI--EVCRISRFNFIWSCNLLQSKKK----SAT 73
F P+P+PT+ QG F + L ++ V R + ++ L++ ++ AT
Sbjct: 29 FKFPSPMPTFSQGLYYHQFLLISLALSQLVMMVLRNEQLTWVVVSKLVKFRRGPDNLGAT 88
Query: 74 FYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDY 133
F+EP+ IP GF LG+Y Q ++R L G+VL ARDL+S+ L+ P+DY
Sbjct: 89 FFEPSSIPSGFCILGYYAQPNNRKLFGWVLTARDLSSN--------------TLKPPVDY 134
Query: 134 TLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCE 193
TLV + E + +G +FW P PPDGY+++G +VT + KP LD++ CVR DLT++CE
Sbjct: 135 TLV-GNTESLKIKQDGTGYFWQPVPPDGYQAVGLIVTNSSQKPSLDKLSCVRSDLTEQCE 193
Query: 194 VHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLD 253
I+ ++ ++ + RP RG GVSVGTF C + S ++CLKN
Sbjct: 194 ADTWIW------GTNGVNISNLRPTTRGTQATGVSVGTFTCQTQNSSLPPPALSCLKNTK 247
Query: 254 PKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDP 312
MPN QI L + Y P ++FHPDE YLPSSV+W+F NGALLYK G+ I+
Sbjct: 248 LDFSTMPNGSQIGELFQTYSPWIYFHPDEEYLPSSVNWYFNNGALLYKKGEESKPIPIES 307
Query: 313 SGSNLPSGGRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFC 371
+GSNLP GG NDG +W+DLP D G++ VK G+++S K+Y+H+KP +G TFTDI +W+F
Sbjct: 308 NGSNLPQGGSNDGSYWLDLPIDKNGKERVKKGDLQSTKVYLHIKPMLGATFTDISIWIFY 367
Query: 372 PFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDL 431
PFNGP KV +N+ +IG+H+GDWEH TLRI NFTGELW ++ SQHSGG WV A DL
Sbjct: 368 PFNGPARAKVKFVNLPLGRIGEHIGDWEHTTLRISNFTGELWRVFLSQHSGGVWVDACDL 427
Query: 432 EYIEG---NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELV 488
E+ G NK + Y+S +GHA +P PG LQG + G+G+RND A+ +D+ + YE++
Sbjct: 428 EFQGGGGSNKFVAYASLHGHAMYPKPGLVLQGDD--GVGIRNDTAKGKKVIDTGLGYEVI 485
Query: 489 AAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLEL 548
AAEY G GVV EP W+ + RKWGP I Y+ E+ + ++LP +++ + V K+P E+
Sbjct: 486 AAEYDGGGVV-EPPWVNYFRKWGPKIDYNVDDEVKSVERILPGLLKKAFVKFVKKIPDEV 544
Query: 549 YGEEGPTGPKEKNNWVGDE 567
YGE+GPTGPK K+NW GDE
Sbjct: 545 YGEDGPTGPKLKSNWAGDE 563
>gi|218199888|gb|EEC82315.1| hypothetical protein OsI_26592 [Oryza sativa Indica Group]
Length = 538
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 259/557 (46%), Positives = 332/557 (59%), Gaps = 43/557 (7%)
Query: 22 TFSLPAPLPTWPQG---QGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPA 78
FSLPAPLP+WP GFA G I+LG +EV +++ F +WS ATF+ P
Sbjct: 12 AFSLPAPLPSWPSSVSDGGFAKGSIDLGGLEVRQVTTFAKVWSTG---QDGGGATFFRPE 68
Query: 79 GIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC 138
+P GF +LGHY Q + RPL G VLVARD++ L PLDY VW
Sbjct: 69 QVPAGFSALGHYAQRNDRPLFGHVLVARDVSGGGL-------------LAPPLDYAPVWS 115
Query: 139 SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLI 198
S +G A FWLP PPDGY+++G VT +P+KP DEV CVR D TD CE +
Sbjct: 116 SQDG-------AAHFWLPTPPDGYRAIGVAVTASPDKPPRDEVACVRADFTDACEAEATV 168
Query: 199 FDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTF-FCSSNWISGQELNIACLKNLDPKLH 257
+D FS + RP RG+ RGV GTF S+ + +ACLKN
Sbjct: 169 WD------KDGFSAVALRPAVRGVDARGVHAGTFVLARSDATAASASALACLKNNGAAYT 222
Query: 258 A-MPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSN 316
+ MP+ Q++AL+ Y P +F HPDE YLPSSV+WFF NGALLY+ G+ + GSN
Sbjct: 223 SCMPDLAQVNALLAAYAPQLFLHPDEPYLPSSVTWFFQNGALLYQKGNQTPTPVAADGSN 282
Query: 317 LPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNG 375
LP GG NDG +W+DLP D R+ VK G++ AK+YV KP +G T TD+ +W F PFNG
Sbjct: 283 LPQGGGNDGGYWLDLPVDNSQRERVKKGDLPGAKVYVQAKPMLGATATDLAVWFFYPFNG 342
Query: 376 PGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE 435
P KVG + + KIG+HVGDWEH TLR+ NF+GEL +YFSQHS G WV A LEY++
Sbjct: 343 PARAKVGPLTIPLGKIGEHVGDWEHVTLRVSNFSGELLRMYFSQHSAGAWVDAPQLEYLD 402
Query: 436 -GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR-SNLYVDSSIQYELVAAEYL 493
GN+ YSS +GHA +P G LQG LG+G+RND R S L + + E+V+AEYL
Sbjct: 403 GGNRPSAYSSLHGHALYPRAGLVLQGDARLGVGIRNDCDRGSRLDTGGAGRCEVVSAEYL 462
Query: 494 --GEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGE 551
G G VAEP+WL F R+WGP YD E++++ KLLP S + KL ++
Sbjct: 463 GGGGGGVAEPTWLLFDREWGPREEYDIGREINRVAKLLP----RSTRERLRKLVESVFVG 518
Query: 552 EGPTGPKEKNNWVGDER 568
EGPTGP+ K +W DER
Sbjct: 519 EGPTGPRMKGSWRNDER 535
>gi|357480211|ref|XP_003610391.1| hypothetical protein MTR_4g131710 [Medicago truncatula]
gi|355511446|gb|AES92588.1| hypothetical protein MTR_4g131710 [Medicago truncatula]
Length = 586
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 249/558 (44%), Positives = 341/558 (61%), Gaps = 34/558 (6%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIP 81
F LP +WP+G GFA G ++LG ++V S FN IW K A+ +EP GIP
Sbjct: 50 VFKLPTATTSWPKGNGFARGVLDLGGLQVSAASTFNKIWGAIDGGPDDKGASVFEPTGIP 109
Query: 82 DGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDE 141
GF LG Y Q +++PL G+VLVA+D++SS + L+ P+DYT V +
Sbjct: 110 QGFSMLGSYSQPNNKPLFGYVLVAKDVSSSTTKST----------LKPPIDYTFV-LKAK 158
Query: 142 GGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
+ WLP P+GY+++G +VTKT +KP LD++ CVR DLT+KCE LI++
Sbjct: 159 SYSATQVLPLYIWLPVAPNGYRALGHVVTKTRDKPPLDKIMCVRSDLTEKCETSSLIWE- 217
Query: 202 ISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPN 261
S F+ + RP +RG RG+ VGTF + ++ +I+CLKNL+ MPN
Sbjct: 218 -----SEDFNFYDVRPISRGTQARGIHVGTFVAQNGRLTNPP-SISCLKNLNSISKIMPN 271
Query: 262 CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVG-EAIDPSGSNLPSG 320
QI A+++ Y P V+ HPDE YLPSSV+WFF+NGALLYK G I +G+NLP G
Sbjct: 272 LQQIKAILKVYSPFVYLHPDEKYLPSSVNWFFSNGALLYKKGHESNPRPIAQNGTNLPQG 331
Query: 321 G--RNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPG 377
++D +WIDLP+D + VK GN++SA+ YVHVKP GGTFTD MW F PFNGPG
Sbjct: 332 PNPKHDRAYWIDLPADHANKDRVKQGNLQSAESYVHVKPMFGGTFTDFAMWTFYPFNGPG 391
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEY---- 433
LK+G +N+ +IG+HVGDWEH TLR+ N G+LW +YFS H G WV + LE+
Sbjct: 392 RLKIGFINIKLERIGEHVGDWEHVTLRVSNLDGKLWKVYFSHHKTGSWVDSSQLEFQNDT 451
Query: 434 ----IEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVA 489
+ +V++S +GH+++PH G L G + GIG R+D +S+ +D +Y LV+
Sbjct: 452 SNIDFPTKRPVVHASLHGHSTYPHAGLVLLGKK--GIGARDDTDKSSKVMDMG-KYVLVS 508
Query: 490 AEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELY 549
AEYLG V EP+WL F R WGP + Y + EL+K+ KL +++ E + LP E+
Sbjct: 509 AEYLGSK-VKEPAWLNFYRTWGPHVDYKLEDELNKLKKLFLGKLKHVFEKVIRGLPKEML 567
Query: 550 GEEGPTGPKEKNNWVGDE 567
EEGPTGPKEK NW GDE
Sbjct: 568 QEEGPTGPKEKKNWNGDE 585
>gi|356555262|ref|XP_003545953.1| PREDICTED: uncharacterized protein LOC100820603 [Glycine max]
Length = 874
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 248/562 (44%), Positives = 338/562 (60%), Gaps = 63/562 (11%)
Query: 15 MSPTEPGTFSLPAPLPTWPQGQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSAT 73
+P TF LPA +P WP G GFA+ I+LG + + +IS FN +W+ AT
Sbjct: 12 QNPRIETTFKLPADIPVWPPGGGFATSIIDLGGGLLLSQISTFNKVWTTYEGGPNNLGAT 71
Query: 74 FYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDY 133
F+EP G+ +GF+ LG YCQ +++PL G+VLV +D +S+ + AL KP+DY
Sbjct: 72 FFEPTGLSEGFFMLGCYCQPNNKPLHGWVLVGKDNSST-----------LNGALAKPVDY 120
Query: 134 TLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCE 193
LVW + + + +G + WLP P+GYK +G +VT +P KP LD++RCVR DLTD+C
Sbjct: 121 KLVWNT-KSLKIKQDGQGYIWLPIAPEGYKPVGHVVTTSPEKPSLDKIRCVRSDLTDECT 179
Query: 194 VHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNW-ISGQELNIACLKNL 252
H + + + + F+V+ RP RG+ +GVSVGTF S + + L I+CLKN
Sbjct: 180 TCHSM--KLWRTENKRFNVYDVRPIKRGIEAQGVSVGTFLAQSGGGTNSKALPISCLKNT 237
Query: 253 DPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY--KAGDLVGEAI 310
MPN QI A+I+ Y P ++ HP E YLPSSV WFFTNGA+L + G + +I
Sbjct: 238 KGSFSYMPNLSQIKAMIKAYSPYMYLHPMEEYLPSSVDWFFTNGAVLIEKRKGVIRESSI 297
Query: 311 DPSGSNLPSGGRNDGE---FWIDLPSDGGRQI-VKHGNMESAKLYVHVKPAVGGTFTDIV 366
+P+GSNLP GG ND + +W+DLP D +++ +K G++ S++ YVHVKP +GGTFTDIV
Sbjct: 298 EPNGSNLPQGGSNDDDDVTYWLDLPLDETKRVSIKKGDLASSQAYVHVKPMLGGTFTDIV 357
Query: 367 MWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWV 426
MW+F PFNG KV N+ G+HVGDWEH TLR+ NF GELW +YFSQHS GKWV
Sbjct: 358 MWIFYPFNGGARAKVACTNIPLRTKGEHVGDWEHLTLRVSNFNGELWRVYFSQHSEGKWV 417
Query: 427 AAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYE 486
A +L++ GN+ YSS +GHA FP PG +QG + G+GVRNDAARS+ +D + +E
Sbjct: 418 DASELDFQNGNRPAAYSSLHGHALFPKPGLVMQG--MRGLGVRNDAARSDAVMDMATWFE 475
Query: 487 LVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPL 546
+VAAEYLG + EP WL + WGP
Sbjct: 476 IVAAEYLG-SQIREPPWLNYWMNWGP---------------------------------- 500
Query: 547 ELYGEEGPTGPKEKNNWVGDER 568
+EGP GPK+K+ W GDER
Sbjct: 501 ----KEGPKGPKQKDMWKGDER 518
>gi|222637330|gb|EEE67462.1| hypothetical protein OsJ_24853 [Oryza sativa Japonica Group]
Length = 538
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 258/557 (46%), Positives = 330/557 (59%), Gaps = 43/557 (7%)
Query: 22 TFSLPAPLPTWPQG---QGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPA 78
FSLPAPLP+WP GFA G I+LG +EV +++ F +WS ATF+ P
Sbjct: 12 AFSLPAPLPSWPSSVSDGGFAKGSIDLGGLEVRQVTTFAKVWSTG---QDGGGATFFRPE 68
Query: 79 GIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC 138
+P GF +LGHY Q + RPL G VLVARD++ L PLDY VW
Sbjct: 69 QVPAGFSALGHYAQRNDRPLFGHVLVARDVSGGGL-------------LAPPLDYAPVWS 115
Query: 139 SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLI 198
S + A FWLP PPDGY+++G VT +P+KP DEV CVR D TD CE +
Sbjct: 116 SQDD-------AAHFWLPTPPDGYRAIGVAVTASPDKPPRDEVACVRADFTDACEAEATV 168
Query: 199 FDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTF-FCSSNWISGQELNIACLKNLDPKLH 257
+D FS + RP RG+ RGV GTF S+ + +ACLKN
Sbjct: 169 WD------KDGFSAVALRPAVRGVDARGVHAGTFVLARSDATAASASALACLKNNGAAYT 222
Query: 258 A-MPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSN 316
+ MP+ Q++AL+ Y P +F HPDE YLPSSV+WFF NGALLY+ G + GSN
Sbjct: 223 SCMPDLAQVNALLAAYAPQLFLHPDEPYLPSSVTWFFQNGALLYQKGSQTPTPVAADGSN 282
Query: 317 LPSGGRNDGEFWIDLPSDG-GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNG 375
LP GG NDG +W+DLP D R+ VK G++ AK+YV KP +G T TD+ +W F PFNG
Sbjct: 283 LPQGGGNDGGYWLDLPVDNFQRERVKKGDLAGAKVYVQAKPMLGATATDLAVWFFYPFNG 342
Query: 376 PGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE 435
P KVG + + KIG+HVGDWEH TLR+ NF+GEL +YFSQHS G WV A LEY++
Sbjct: 343 PARAKVGPLTIPLGKIGEHVGDWEHVTLRVSNFSGELLRMYFSQHSAGAWVDAPQLEYLD 402
Query: 436 -GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR-SNLYVDSSIQYELVAAEYL 493
GN+ YSS +GHA +P G LQG LG+G+RND R S L + + E+V+AEYL
Sbjct: 403 GGNRPSAYSSLHGHALYPRAGLVLQGDARLGVGIRNDCDRGSRLDTGGAGRCEVVSAEYL 462
Query: 494 --GEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGE 551
G G VAEP+WL F R+WGP YD E++++ KLLP S + KL ++
Sbjct: 463 GGGGGGVAEPTWLLFDREWGPREEYDIGREINRVAKLLP----RSTRERLRKLVESVFVG 518
Query: 552 EGPTGPKEKNNWVGDER 568
EGPTGP+ K +W DER
Sbjct: 519 EGPTGPRMKGSWRNDER 535
>gi|297824429|ref|XP_002880097.1| hypothetical protein ARALYDRAFT_483546 [Arabidopsis lyrata subsp.
lyrata]
gi|297325936|gb|EFH56356.1| hypothetical protein ARALYDRAFT_483546 [Arabidopsis lyrata subsp.
lyrata]
Length = 542
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 235/538 (43%), Positives = 335/538 (62%), Gaps = 33/538 (6%)
Query: 35 GQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFD 94
G+GF GRI+LG +EV ++ FN +W+ ATF+EP+ IP+GF LG Y Q +
Sbjct: 32 GEGFGQGRIDLGGLEVFQVEIFNKVWTVYEGGQDNLGATFFEPSSIPEGFTILGFYAQPN 91
Query: 95 SRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGG-QGNYEGCAFF 153
+R L G LV +DL+ +L+ P+D+ L+W + N FF
Sbjct: 92 NRKLFGRTLVGKDLSGD--------------SLRPPVDFLLLWSGKSTKVENNGVETGFF 137
Query: 154 WLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVW 213
W P PPDGY ++G +V + KP LD++RCVR DLTD+ E LI++ ++ FS+
Sbjct: 138 WQPVPPDGYNAVGLVVATSGEKPPLDKIRCVRSDLTDQSESDALIWE------TNGFSIS 191
Query: 214 STRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYG 273
S++P NRG GV +GTFF +S + CLKN MP+ QI AL + Y
Sbjct: 192 SSKPVNRGTQASGVCIGTFFSNS-----PTPTLHCLKNNKFDFSCMPSKPQIDALFQAYA 246
Query: 274 PTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGRNDGEFWIDLP 332
P ++FH DE YLPSSV+WFF+NGALLYK D ++P+G NLP G NDG +W+DLP
Sbjct: 247 PWIYFHKDEKYLPSSVNWFFSNGALLYKKDDESNPVPVEPNGLNLPQGESNDGLYWLDLP 306
Query: 333 -SDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKI 391
+ R+ V+ G+++S ++Y+H+KP GGTFTDI +W+F PFNGP K+ + KI
Sbjct: 307 VASDARKRVQGGDLQSMEVYLHIKPVFGGTFTDIAVWIFYPFNGPSRAKLKAATIPLGKI 366
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG-NKAIVYSSKNGHAS 450
G+H+GDWEHFTLRI NF G+L+ +Y SQHSGG W A ++E+ G NK + Y+S NGHA
Sbjct: 367 GEHIGDWEHFTLRISNFNGKLYRMYLSQHSGGSWTDASEIEFQGGRNKPVAYASLNGHAM 426
Query: 451 FPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKW 510
+ PG LQG + +G+RND +S +D+++++ +VAAEY+ G V EP+WL +MR W
Sbjct: 427 YSKPGLVLQGKD--NVGIRNDTGKSEKVIDTAVRFRVVAAEYM-RGEVEEPAWLNYMRHW 483
Query: 511 GPTIVYDSKTELDKIIKLLP-LMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
GP I Y + E+ + K++ ++ + +A+ LP E++GEEGPTGPK K NW+GDE
Sbjct: 484 GPKIDYGHENEIRGVEKIMVGESLKNTFRSAIKGLPNEVFGEEGPTGPKLKRNWLGDE 541
>gi|357451119|ref|XP_003595836.1| Vacuolar protein sorting-associated protein [Medicago truncatula]
gi|124360096|gb|ABN08112.1| Protein of unknown function DUF946, plant [Medicago truncatula]
gi|355484884|gb|AES66087.1| Vacuolar protein sorting-associated protein [Medicago truncatula]
Length = 621
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 245/543 (45%), Positives = 331/543 (60%), Gaps = 32/543 (5%)
Query: 35 GQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFD 94
G FA+ I+LGE+ V + S F+ +W + A+ YEP G+ G+ LG Y Q +
Sbjct: 100 GTSFANTVIDLGELHVMQASTFSKVWGAYESGPDDQGASVYEPTGLRYGYSMLGSYSQPN 159
Query: 95 SRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFW 154
++PL G+VL A+D++ S G L+ P+DYTLV + + + + W
Sbjct: 160 NKPLFGYVLAAKDISGSTTNGT----------LKPPVDYTLV-SNTASVMVDQDSPLYIW 208
Query: 155 LPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWS 214
LP PDGY ++G +VT T +KP D+V CVR DLT++CE I+ S+ + +
Sbjct: 209 LPIAPDGYHAVGHVVTTTQDKPSPDKVMCVRADLTEQCESSTWIW------GSNDLNFYD 262
Query: 215 TRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGP 274
RP NRG GV VGTF + N + +I+CLKNL+ MPN QI L+ +Y P
Sbjct: 263 VRPINRGSKAPGVRVGTF-VAQNGGNTNPPSISCLKNLNSISQIMPNQTQIETLLHSYSP 321
Query: 275 TVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGRNDGEFWIDLP- 332
++ HPDE YLPSSV+WFF+NGALLYK G+ I+ +G+NLP NDG +W+DLP
Sbjct: 322 FLYLHPDEEYLPSSVNWFFSNGALLYKKGEESNPVPIEQNGTNLPQDPNNDGAYWLDLPV 381
Query: 333 SDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIG 392
D ++ VK GN++SA+ YVHVKP GGTFTDI MWVF PFNGP K+ +NV K+G
Sbjct: 382 DDANKERVKQGNLQSAESYVHVKPMYGGTFTDIAMWVFYPFNGPAKAKIKFINVKLGKVG 441
Query: 393 QHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN--------KAIVYSS 444
+HVGDWEH TLR+ N G+LW +YFSQH+GG WV A LE+ N + +VY+S
Sbjct: 442 EHVGDWEHVTLRVSNLDGQLWHVYFSQHNGGSWVDASQLEFQMDNSSFDFPTQRPVVYAS 501
Query: 445 KNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWL 504
++GHAS+PH G +L G G+G R+D + + +D +Y LV+AEYL E V EP+WL
Sbjct: 502 RHGHASYPHGGLHLLGKN--GVGARDDTDKGSNVMDMG-KYVLVSAEYL-ESEVKEPAWL 557
Query: 505 QFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWV 564
F R+WGP + Y EL + KLLP ++ E + LP EL GEEGPTGPKEK NW
Sbjct: 558 NFFREWGPHVDYSLDDELKNVEKLLPGKLKDVFEKIIMGLPKELLGEEGPTGPKEKGNWN 617
Query: 565 GDE 567
GDE
Sbjct: 618 GDE 620
>gi|302771856|ref|XP_002969346.1| hypothetical protein SELMODRAFT_231324 [Selaginella moellendorffii]
gi|300162822|gb|EFJ29434.1| hypothetical protein SELMODRAFT_231324 [Selaginella moellendorffii]
Length = 514
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 251/538 (46%), Positives = 327/538 (60%), Gaps = 34/538 (6%)
Query: 36 QGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDS 95
+GF + I LGEIEV IW ++ A FY+P +PDGF++LG + +
Sbjct: 6 RGFGTRVIPLGEIEVLEAEHLEKIW-----ETSDGRALFYKPVQVPDGFHALGCHARRTE 60
Query: 96 RPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWL 155
P +LV S GA K L P+DY+LVW S + E FWL
Sbjct: 61 VPDSLLLLVR---GSGVTAGAS-----KPAPLAPPVDYSLVWKSPVRNEN--ETAMAFWL 110
Query: 156 PQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWST 215
P PP GYK++GF+ + T KP +DEV CVR+DLT + + I +S F VW T
Sbjct: 111 PLPPAGYKALGFVASVT--KPGVDEVACVREDLTSDATISNSIL-------ASSFYVWQT 161
Query: 216 RPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKL-HAMPNCDQIHALIRNYGP 274
RP G+ +G+S GTF+CSS+ S +IA L+N+ L +MP+ Q+ AL++ Y P
Sbjct: 162 RPREVGVTAKGLSTGTFYCSSS-SSSPLPSIATLQNVSFDLPSSMPSLQQLDALVKRYAP 220
Query: 275 TVFFHPDEVYLPSSVSWFFTNGALLY-KAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPS 333
+F HPDE + P SV WFF NGALLY K I +G+NLP G NDG +W+DLP
Sbjct: 221 IMFLHPDEDFFPCSVEWFFQNGALLYSKQNPSAPVQITANGANLPGGETNDGSYWLDLPR 280
Query: 334 DGGR-QIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIG 392
D + V+ GN+ESA Y+HVKP GGTFTD+ W + PFNGP TLKVG++NV KIG
Sbjct: 281 DAAAAEKVRRGNLESAITYLHVKPVFGGTFTDMQFWFYYPFNGPATLKVGLINVKLGKIG 340
Query: 393 QHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN--KAIVYSSKNGHAS 450
+HV DWEH TLR+ NFTG L S+Y HS G W DLE+ + VY+SK+GHA
Sbjct: 341 EHVSDWEHLTLRVDNFTGNLSSVYLGHHSSGTWYQPSDLEFDSTTTYRPFVYASKHGHAL 400
Query: 451 FPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKW 510
+ HPG LQG E G+G+RND RS+ D S +++L++A YLG V EP WLQFMR+W
Sbjct: 401 YAHPGDNLQGDEERGLGIRNDTKRSSFTWDVS-KHQLISAAYLG---VEEPPWLQFMREW 456
Query: 511 GPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
GP I Y+S+TEL+K+ K LP IR S+E+ ++ P EL GEEGPTGPK+KN W GDER
Sbjct: 457 GPKIQYNSRTELEKVFKFLPKKIRRSIEDIFNRFPNELSGEEGPTGPKQKNFWTGDER 514
>gi|356528607|ref|XP_003532891.1| PREDICTED: uncharacterized protein LOC100783235 [Glycine max]
Length = 530
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 249/555 (44%), Positives = 331/555 (59%), Gaps = 63/555 (11%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAGI 80
TF LPA +P WP G GFA+G I+LG + V +IS FN +W+ ATF+EP G+
Sbjct: 19 TFKLPADIPVWPPGGGFATGIIDLGGGLLVSQISTFNKVWTTYEGGPNNLGATFFEPTGL 78
Query: 81 PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSD 140
+GF+ LG YCQ +++PL G VLV +D +S TSN AL +P+DY LVW +
Sbjct: 79 SEGFFMLGCYCQPNNKPLHGCVLVGKDNSS-------TSN----GALAEPVDYKLVW-NT 126
Query: 141 EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFD 200
+ + +G + WLP PDGY +G +VT +P KP LD++RCVR DLTD+ H +
Sbjct: 127 KSQKIKQDGHGYIWLPISPDGYNPVGHVVTTSPEKPSLDKIRCVRSDLTDESTTCHSM-- 184
Query: 201 AISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNW-ISGQELNIACLKNLDPKLHAM 259
+ + + F+V+ RP RG+ +GVSVGTF S + + I CLKN M
Sbjct: 185 KLWRTENKRFNVYDVRPIKRGIEAQGVSVGTFLAQSGGGTNSKAFPIFCLKNTKGSFSYM 244
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY--KAGDLVGEAIDPSGSNL 317
PN QI A+I+ Y P ++ HP E YLPSSV WFFTNGA+L + G + +I+P+G+NL
Sbjct: 245 PNLSQIKAMIKAYSPYMYLHPMEEYLPSSVDWFFTNGAVLIEKRKGVIRESSIEPNGTNL 304
Query: 318 PSGGRND---GEFWIDLPSDGGRQI-VKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPF 373
P G ND +W+DLP D +++ VK G++ S++ YVHVKP +GGTFTDIVMWVF PF
Sbjct: 305 PQGSSNDYDDATYWLDLPLDETKRVSVKKGDLASSQAYVHVKPMLGGTFTDIVMWVFYPF 364
Query: 374 NGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEY 433
NG KV N+ G+HVGDWEH TLRI NF GELW +YFSQHS G+W A +LE+
Sbjct: 365 NGGARAKVACTNIPLRTKGEHVGDWEHLTLRISNFNGELWKVYFSQHSKGQWEDASELEF 424
Query: 434 IEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYL 493
GN+ + YSS +GHA FP PG +QG + G+GVRNDAA+S+ +D + +E+VAAEYL
Sbjct: 425 QNGNRPVAYSSLHGHALFPKPGLVMQG--MRGLGVRNDAAKSDAVMDMATWFEIVAAEYL 482
Query: 494 GEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEG 553
G + EP WL F WGP +EG
Sbjct: 483 GSQ-IREPPWLNFCMNWGP--------------------------------------KEG 503
Query: 554 PTGPKEKNNWVGDER 568
P GPK+K+ W GDER
Sbjct: 504 PKGPKQKDMWKGDER 518
>gi|242046078|ref|XP_002460910.1| hypothetical protein SORBIDRAFT_02g037340 [Sorghum bicolor]
gi|241924287|gb|EER97431.1| hypothetical protein SORBIDRAFT_02g037340 [Sorghum bicolor]
Length = 553
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 250/558 (44%), Positives = 329/558 (58%), Gaps = 42/558 (7%)
Query: 23 FSLPAPLPTWP----QGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPA 78
F PA LP P + GF G I+LG +EV +I+ F +WS ATF++P+
Sbjct: 24 FVPPAGLPAVPASGSEDGGFGKGSIDLGGLEVRQITTFAKVWSTTQGGQDGLGATFFKPS 83
Query: 79 GIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC 138
+P GF LGHY Q ++RPL G VLV RD + + A L PLDY LVW
Sbjct: 84 PVPAGFSVLGHYAQPNNRPLFGHVLVGRDTSGTGA------------VLAAPLDYALVWS 131
Query: 139 SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLI 198
S +G FWLP P+GYK++G +VT +KP DEVRCVR D TD E+
Sbjct: 132 SPDG-------AGHFWLPTAPEGYKAVGVVVTAAADKPSPDEVRCVRSDFTDASEIE--- 181
Query: 199 FDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWI-SGQELNIACLKNLDPK-L 256
D + FS + RP RG+ RGV GTF S+ +G ACLKN
Sbjct: 182 -DTVLTSDKDGFSAATLRPAVRGIDARGVHAGTFLAQSSATPAGASTTPACLKNNSASYT 240
Query: 257 HAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSN 316
AMP+ Q+++L+ Y P V+ HP+E YLPSSV+WFF N ALLY+ G + GSN
Sbjct: 241 SAMPDLAQVNSLLAAYAPHVYLHPNESYLPSSVTWFFENDALLYQKGSQTPTPVAADGSN 300
Query: 317 LPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNG 375
LP GG NDG +W+DLP D R+ VK G++ AK+YV KP +GGT TD+ +W+F PFNG
Sbjct: 301 LPQGGGNDGGYWLDLPVDNNQREKVKKGDLAGAKVYVQAKPMLGGTVTDLAVWIFFPFNG 360
Query: 376 PGTLKVG-IMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYI 434
P KVG I ++ KIG+HVGDWEH TLR+ NF+GEL +YFSQHS G WV A LEY+
Sbjct: 361 PARAKVGLIQSIPLGKIGEHVGDWEHVTLRVSNFSGELLRVYFSQHSAGTWVDASRLEYL 420
Query: 435 E----GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAA 490
GN+ + Y+S++GHA +P+ GT LQG+ LGIG+RND AR + + + E+V+
Sbjct: 421 ADGDGGNRPVAYASQHGHAFYPNAGTVLQGNTSLGIGIRNDCARGSRLDTGAGRCEVVSV 480
Query: 491 EYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYG 550
EYLG V EP+W+ F R+WGP YD E+++ ++LP +R + V K+ L G
Sbjct: 481 EYLG---VKEPAWVGFEREWGPREEYDIGREINRAARILPRSVRERLAKLVEKV---LVG 534
Query: 551 EEGPTGPKEKNNWVGDER 568
EGPTGPK NW DE+
Sbjct: 535 -EGPTGPKMHGNWRNDEK 551
>gi|357451113|ref|XP_003595833.1| hypothetical protein MTR_2g062340 [Medicago truncatula]
gi|124360100|gb|ABN08116.1| Protein of unknown function DUF946, plant [Medicago truncatula]
gi|355484881|gb|AES66084.1| hypothetical protein MTR_2g062340 [Medicago truncatula]
Length = 517
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 245/566 (43%), Positives = 330/566 (58%), Gaps = 62/566 (10%)
Query: 10 NEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSK 68
N SP +F LP+ +P P G GFASG I+LG + V +IS FN +W+ N
Sbjct: 7 NSKKKRSPLIETSFKLPSDIPVLPPGDGFASGIIDLGGGLLVSQISTFNKVWTTNEGGPD 66
Query: 69 KKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQ 128
T +EP G+ +GF+ LG+YCQ +++PL G+VLV +D +S+ AL+
Sbjct: 67 DLGVTIFEPTGLSEGFFVLGYYCQPNNKPLHGWVLVGKDNSSTTNR-----------ALK 115
Query: 129 KPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
KPL+Y LV C+ + Q + + WLP PDGYK +G +VT + KP LD++ CVR DL
Sbjct: 116 KPLNYKLV-CNIKSLQNRQDIRGYIWLPIAPDGYKVVGHVVTTSQEKPSLDKIMCVRSDL 174
Query: 189 TDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIAC 248
TD+C + I + + + F+V+ RP RG+ +GV VGTF + + L I C
Sbjct: 175 TDECVKYKSI--KLWRTENKRFNVYDVRPMKRGVEAKGVYVGTFLAQCGRKNSKTLPIVC 232
Query: 249 LKNLDP-KLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVG 307
LKN + K +MPN QI LI+ Y P ++ HP + YLPSSV WFF NG +L + D V
Sbjct: 233 LKNTNAIKFSSMPNLHQIETLIKAYSPYMYLHPMDKYLPSSVEWFFINGTILCEKRDGVI 292
Query: 308 EA--IDPSGSNLPSGGRNDG--EFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTF 362
I+P+GSNLP G + G +W+DLP D ++ VK G+++S+K YVHVKP +GGTF
Sbjct: 293 NVSPIEPTGSNLPQGCSSIGMSSYWLDLPMDEAAKERVKKGDLQSSKAYVHVKPMLGGTF 352
Query: 363 TDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSG 422
TD+VMWVF PFNG KV MN+ G+HVGDWEH TLR+ NF+GELW +Y SQHS
Sbjct: 353 TDLVMWVFYPFNGGARAKVAFMNIPLRTKGEHVGDWEHVTLRVSNFSGELWMVYLSQHSK 412
Query: 423 GKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSS 482
G+WV A DLE+ GN+ ++YSS +GHA FP PG +QG + G G+RNDA +S+L +D
Sbjct: 413 GQWVDACDLEFKNGNRPVLYSSLHGHALFPRPGCVMQG--VRGFGIRNDACKSDLVMDMV 470
Query: 483 IQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
YE+VAAEYLG + EP WL + WGP
Sbjct: 471 KGYEIVAAEYLGSE-IREPPWLNYEMNWGP------------------------------ 499
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDER 568
EGP GPK+K+ W GDER
Sbjct: 500 --------REGPKGPKQKDFWKGDER 517
>gi|302774525|ref|XP_002970679.1| hypothetical protein SELMODRAFT_94162 [Selaginella moellendorffii]
gi|300161390|gb|EFJ28005.1| hypothetical protein SELMODRAFT_94162 [Selaginella moellendorffii]
Length = 523
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 247/540 (45%), Positives = 325/540 (60%), Gaps = 29/540 (5%)
Query: 36 QGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDS 95
+GF + I LGEIEV IW ++ A FY+P +PDGF++LG + +
Sbjct: 6 RGFGTRVIPLGEIEVLEAEHLEKIW-----ETSDGRALFYKPVQVPDGFHALGCHARRTE 60
Query: 96 RPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWL 155
P +LV S GA K L P+DY+LVW S + E FWL
Sbjct: 61 VPDSLLLLVR---GSGVTAGAS-----KPAPLAPPVDYSLVWKSPVRSEN--ETAMAFWL 110
Query: 156 PQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF--DAISKFSSSPFSVW 213
P PP GYK++GF+ + KP +DEV CVR+DLT + + I+ + F VW
Sbjct: 111 PLPPAGYKALGFVASVA--KPGVDEVACVREDLTSDATISNSIWASSVADNNTVESFYVW 168
Query: 214 STRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKL-HAMPNCDQIHALIRNY 272
TRP G+ +G+S GTF+CSS S +IA L+N+ L +MP+ Q+ AL++ Y
Sbjct: 169 QTRPREVGVTAKGLSTGTFYCSSF-SSSPLPSIATLQNVSFDLPSSMPSLQQLDALVKRY 227
Query: 273 GPTVFFHPDEVYLPSSVSWFFTNGALLY-KAGDLVGEAIDPSGSNLPSGGRNDGEFWIDL 331
P +F HPDE + P SV WFF NGALLY K I +G+NLP G NDG +W+DL
Sbjct: 228 APIMFLHPDEDFFPCSVEWFFQNGALLYSKQNPSAPVQITANGANLPGGETNDGSYWLDL 287
Query: 332 PSDGGR-QIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSK 390
P D + V+ GN+E+A Y+HVKP GGTFTD+ W + PFNGP TLKVG+++V K
Sbjct: 288 PRDAAAAEKVRRGNLENAITYLHVKPVFGGTFTDMQFWFYYPFNGPATLKVGLISVKLGK 347
Query: 391 IGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN--KAIVYSSKNGH 448
IG+HV DWEH TLR+ NFTG L S+Y HS G W DLE+ + VY+SK+GH
Sbjct: 348 IGEHVSDWEHLTLRVDNFTGNLSSVYLGHHSSGTWYQPSDLEFDSTTTYRPFVYASKHGH 407
Query: 449 ASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMR 508
A + HPG LQG E G+G+RND RS+ D S +++L++A YLG V EP WLQFMR
Sbjct: 408 ALYAHPGDNLQGDEERGLGIRNDTKRSSFTWDVS-KHQLISAAYLG---VEEPPWLQFMR 463
Query: 509 KWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
+WGP I Y+S+TEL+K+ K LP IR S+E+ ++ P EL GEEGPTGPK+KN W GDER
Sbjct: 464 EWGPKIQYNSRTELEKVFKFLPKKIRRSIEDIFNRFPNELSGEEGPTGPKQKNFWTGDER 523
>gi|226493526|ref|NP_001145101.1| uncharacterized protein LOC100278316 [Zea mays]
gi|195651229|gb|ACG45082.1| hypothetical protein [Zea mays]
Length = 559
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 251/563 (44%), Positives = 338/563 (60%), Gaps = 46/563 (8%)
Query: 23 FSLPAPLPTWP----QGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPA 78
F PA LP+ P + GF G I+LG +EV +++ F +WS ATF++P+
Sbjct: 24 FVAPAALPSVPASDSEDGGFGKGSIDLGGLEVRQVTTFAKVWSTTQGGQDGLGATFFKPS 83
Query: 79 GIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC 138
+P GF LGHY Q ++RPL G VLV RD +S A GA L P+DY LVW
Sbjct: 84 PVPAGFSVLGHYAQPNNRPLFGHVLVGRD--TSGAGGA---------LLAAPVDYALVWS 132
Query: 139 SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLI 198
S +G FWLP P+GYK++G +VT +KP DEVRCVR D TD CE +
Sbjct: 133 SPDG-------AGHFWLPTAPEGYKAVGVVVTAAADKPSPDEVRCVRADFTDACETEESV 185
Query: 199 FDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWI----SGQELNIACLKNLDP 254
+ + FS + RP RG+ RGV GTF S+ +G +ACLKN
Sbjct: 186 WSS----DKDGFSASTLRPAVRGIEARGVHAGTFLAQSSATPAAAAGVSTTLACLKNNSA 241
Query: 255 K-LHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPS 313
+MP+ Q+++L+ Y P V+ HP+E Y PSSV+WFF NGAL+Y+ G P+
Sbjct: 242 SYTSSMPDLAQLNSLLAAYAPHVYLHPNEPYFPSSVAWFFENGALMYQKGSSQTPTPVPA 301
Query: 314 -GSNLPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFC 371
GSNLP GG NDG +W+DLP+DG R+ VK G++ AK+YV KP +GGT D+ +W+F
Sbjct: 302 DGSNLPQGGGNDGGYWLDLPADGNQREKVKRGDLAGAKVYVQAKPMLGGTVADLAVWIFY 361
Query: 372 PFNGPGTLKVGIM-NVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD 430
PFNGP KVG++ ++ +IG+HVGDWEH TLR+ NF+GEL +YFSQHS G WV A
Sbjct: 362 PFNGPARAKVGLIPSIPLGEIGEHVGDWEHVTLRVSNFSGELLRMYFSQHSAGTWVDASR 421
Query: 431 LEYIE-----GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQY 485
LEY++ G++ + Y+S +GHA +P+ GT LQG+ LGIG+RND AR + + +
Sbjct: 422 LEYLDGGDGGGSRPVAYASLHGHAFYPNAGTVLQGNSGLGIGIRNDCARGSRLDTGAGRC 481
Query: 486 ELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLP 545
E+V+AEYLG V EP+WL F R+WGP YD E+++ +LLP +R + V K+
Sbjct: 482 EVVSAEYLG---VKEPAWLGFEREWGPRDEYDIGREINRAARLLPRSVRERLAKLVEKV- 537
Query: 546 LELYGEEGPTGPKEKNNWVGDER 568
L G EGPTGPK NW DER
Sbjct: 538 --LVG-EGPTGPKMHGNWRNDER 557
>gi|414887277|tpg|DAA63291.1| TPA: hypothetical protein ZEAMMB73_737791 [Zea mays]
Length = 559
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 250/563 (44%), Positives = 336/563 (59%), Gaps = 46/563 (8%)
Query: 23 FSLPAPLPTWP----QGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPA 78
F PA LP+ P + GF G I+LG +EV +++ F +WS ATF++P+
Sbjct: 24 FVAPAALPSVPASDSEDGGFGKGSIDLGGLEVRQVTTFAKVWSTTQGGQDGLGATFFKPS 83
Query: 79 GIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC 138
+P GF LGHY Q ++RPL G VLV RD +S A GA L P+DY LVW
Sbjct: 84 PVPAGFSVLGHYAQPNNRPLFGHVLVGRD--TSGAGGA---------LLAAPVDYALVWS 132
Query: 139 SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLI 198
S +G FWLP P+GYK++G +VT +KP DEVRCVR D TD CE
Sbjct: 133 SPDG-------AGHFWLPTAPEGYKAVGVVVTAAADKPSPDEVRCVRADFTDACEAE--- 182
Query: 199 FDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWI----SGQELNIACLKNLDP 254
D++ FS + RP RG+ RGV GTF S+ +G +ACLKN
Sbjct: 183 -DSVWSSDKDGFSASTLRPAVRGIEARGVHAGTFLAQSSATPAAAAGVSTTLACLKNNSA 241
Query: 255 K-LHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGD-LVGEAIDP 312
+MP+ Q+++L+ Y P V+ HP+E Y PSSV+WFF NGAL+Y+ G +
Sbjct: 242 SYTSSMPDLAQLNSLLAAYAPHVYLHPNEPYFPSSVAWFFENGALMYQKGSSQTPTPVAA 301
Query: 313 SGSNLPSGGRNDGEFWIDLPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFC 371
GSNLP GG NDG +W+DLP+DG R+ K G++ AK+YV KP +GGT D+ +W+F
Sbjct: 302 DGSNLPQGGGNDGGYWLDLPADGNQREKAKRGDLAGAKVYVQAKPMLGGTVADLAVWIFY 361
Query: 372 PFNGPGTLKVGIM-NVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD 430
PFNGP KVG++ ++ +IG+HVGDWEH TLR+ NF+GEL +YFSQHS G WV A
Sbjct: 362 PFNGPARAKVGLIPSIPLGEIGEHVGDWEHVTLRVSNFSGELLRMYFSQHSAGTWVDASR 421
Query: 431 LEYIE-----GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQY 485
LEY++ G++ + Y+S +GHA +P+ GT LQG+ LGIG+RND AR + + +
Sbjct: 422 LEYLDGGDGGGSRPVAYASLHGHAFYPNAGTVLQGNSGLGIGIRNDCARGSRLDTGAGRC 481
Query: 486 ELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLP 545
E+V+AEYLG V EP+WL F R+WGP YD E+++ +LLP +R + V K+
Sbjct: 482 EVVSAEYLG---VKEPAWLGFEREWGPRDEYDIGREINRAARLLPRSVRERLAKLVEKV- 537
Query: 546 LELYGEEGPTGPKEKNNWVGDER 568
L G EGPTGPK NW DER
Sbjct: 538 --LVG-EGPTGPKMHGNWRNDER 557
>gi|115472967|ref|NP_001060082.1| Os07g0575900 [Oryza sativa Japonica Group]
gi|34393497|dbj|BAC83058.1| unknown protein [Oryza sativa Japonica Group]
gi|50509137|dbj|BAD30244.1| unknown protein [Oryza sativa Japonica Group]
gi|113611618|dbj|BAF21996.1| Os07g0575900 [Oryza sativa Japonica Group]
Length = 522
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 248/539 (46%), Positives = 319/539 (59%), Gaps = 40/539 (7%)
Query: 37 GFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSR 96
GFA G I+LG +EV +++ F +WS ATF+ P +P GF +LGHY Q + R
Sbjct: 14 GFAKGSIDLGGLEVRQVTTFAKVWSTG---QDGGGATFFRPEQVPAGFSALGHYAQRNDR 70
Query: 97 PLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLP 156
PL G VLVARD++ L PLDY VW S + A FWLP
Sbjct: 71 PLFGHVLVARDVSGGGL-------------LAPPLDYAPVWSSQDD-------AAHFWLP 110
Query: 157 QPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTR 216
PPDGY+++G VT +P+KP DEV CVR D TD CE ++D FS + R
Sbjct: 111 TPPDGYRAIGVAVTASPDKPPRDEVACVRADFTDACEAEATVWD------KDGFSAVALR 164
Query: 217 PCNRGMLGRGVSVGTF-FCSSNWISGQELNIACLKNLDPKLHA-MPNCDQIHALIRNYGP 274
P RG+ RGV GTF S+ + +ACLKN + MP+ Q++AL+ Y P
Sbjct: 165 PAVRGVDARGVHAGTFVLARSDATAASASALACLKNNGAAYTSCMPDLAQVNALLAAYAP 224
Query: 275 TVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSD 334
+F HPDE YLPSSV+WFF NGALLY+ G + GSNLP GG NDG +W+DLP D
Sbjct: 225 QLFLHPDEPYLPSSVTWFFQNGALLYQKGSQTPTPVAADGSNLPQGGGNDGGYWLDLPVD 284
Query: 335 G-GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQ 393
R+ VK G++ AK+YV KP +G T TD+ +W F PFNGP KVG + + KIG+
Sbjct: 285 NFQRERVKKGDLAGAKVYVQAKPMLGATATDLAVWFFYPFNGPARAKVGPLTIPLGKIGE 344
Query: 394 HVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE-GNKAIVYSSKNGHASFP 452
HVGDWEH TLR+ NF+GEL +YFSQHS G WV A LEY++ GN+ YSS +GHA +P
Sbjct: 345 HVGDWEHVTLRVSNFSGELLRMYFSQHSAGAWVDAPQLEYLDGGNRPSAYSSLHGHALYP 404
Query: 453 HPGTYLQGSEILGIGVRNDAAR-SNLYVDSSIQYELVAAEYL--GEGVVAEPSWLQFMRK 509
G LQG LG+G+RND R S L + + E+V+AEYL G G VAEP+WL F R+
Sbjct: 405 RAGLVLQGDARLGVGIRNDCDRGSRLDTGGAGRCEVVSAEYLGGGGGGVAEPTWLLFDRE 464
Query: 510 WGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
WGP YD E++++ KLLP S + KL ++ EGPTGP+ K +W DER
Sbjct: 465 WGPREEYDIGREINRVAKLLP----RSTRERLRKLVESVFVGEGPTGPRMKGSWRNDER 519
>gi|147858021|emb|CAN82512.1| hypothetical protein VITISV_002153 [Vitis vinifera]
Length = 913
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 219/460 (47%), Positives = 304/460 (66%), Gaps = 17/460 (3%)
Query: 29 LPTWPQGQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSL 87
+PTWP G+GFA G I+LG +EVC+IS F +W+ + ATF+EP+ +P+ F L
Sbjct: 1 MPTWPPGEGFAGGTIDLGGGLEVCQISSFTKVWATHEGGPDNLGATFFEPSPMPEEFCML 60
Query: 88 GHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNY 147
G Y Q + +PL G+VL +D ++ GA L+KP+DYTLVW S E +
Sbjct: 61 GCYSQPNKKPLFGWVLAGKDNTNNPLGGA----------LKKPIDYTLVWSS-ESLKIKQ 109
Query: 148 EGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSS 207
+G + WLP PPDGYK++G ++T P KP LD++RCVR DLTD CE I+ ++
Sbjct: 110 DGNGYVWLPTPPDGYKAVGHVITNLPEKPSLDKIRCVRSDLTDLCEADTWIWGLDKVANA 169
Query: 208 SPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHA 267
+ F+V+ +P +RG VSVGTF + N + L++ACLKN+ L+AMPN Q+ A
Sbjct: 170 NGFNVYDLQPSDRGAQALAVSVGTF-VAQNGGAASSLSLACLKNISSNLYAMPNLAQVDA 228
Query: 268 LIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGRNDGE 326
L++ Y P V+ HPDE YLPSSVSWFF NGALLY+ G+ IDP+GSNLP GG NDG
Sbjct: 229 LVQAYSPWVYLHPDEQYLPSSVSWFFQNGALLYQQGEESKPVNIDPTGSNLPQGGSNDGA 288
Query: 327 FWIDLPSDG-GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMN 385
+W+DLP D ++ VK G+++ + Y+H+KP +G TFTDI +W+F PFNGP KV ++N
Sbjct: 289 YWLDLPVDAKAKETVKKGDLQDSMTYMHIKPMLGATFTDIAIWIFYPFNGPARAKVELIN 348
Query: 386 VAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSK 445
++ KIG+HVGDWEH TLR+ NF G+L +++FS+HS G WV A +LE+ GNK + Y+S
Sbjct: 349 ISLGKIGEHVGDWEHVTLRVSNFNGKLMNVFFSEHSSGTWVNASELEFQNGNKVVSYASL 408
Query: 446 NGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQY 485
+GHA +P PG LQGS GI RND A+S + +D+ +Y
Sbjct: 409 HGHAFYPKPGLVLQGSGAXGI--RNDTAKSKMVMDTGTRY 446
>gi|357451121|ref|XP_003595837.1| hypothetical protein MTR_2g062380 [Medicago truncatula]
gi|124360095|gb|ABN08111.1| Protein of unknown function DUF946, plant [Medicago truncatula]
gi|355484885|gb|AES66088.1| hypothetical protein MTR_2g062380 [Medicago truncatula]
Length = 560
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 242/546 (44%), Positives = 324/546 (59%), Gaps = 39/546 (7%)
Query: 35 GQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFD 94
G FA+ I+LGE+ V + S F+ +W + A+ YEP G+ G+ LG Y Q +
Sbjct: 40 GTSFANTIIDLGELHVMQASTFSKVWGTYEGGPDNQGASVYEPTGLRYGYSMLGSYSQPN 99
Query: 95 SRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFW 154
S+PL G+VLVA+D+ S + G L+ P+DY+LV + + + W
Sbjct: 100 SKPLFGYVLVAKDI-SGKTNGT----------LKPPVDYSLV-LNTASITVTQDSSLYIW 147
Query: 155 LPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWS 214
LP P+GY ++G +VTKT +KP ++V CVR DLT++CE LI+ S+ + +
Sbjct: 148 LPIAPNGYHAVGHVVTKTQDKPSTNKVMCVRSDLTEQCESSTLIW------GSNGLNFYD 201
Query: 215 TRPCNRGMLGRGVSVGTFFCSSNWISGQELN---IACLKNLDPKLHAMPNCDQIHALIRN 271
RP NRG+ GV VGTF + G +N I+CLKN+D MP+ QI AL++
Sbjct: 202 VRPINRGIKALGVRVGTFVAQN----GGNINPPSISCLKNIDSITQIMPSKKQIDALLQV 257
Query: 272 YGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGRNDGEFWID 330
P ++ H DE YLPSSV+WFF+NGALLYK G I +G+NLP DG W+D
Sbjct: 258 NAPFLYLHSDEEYLPSSVNWFFSNGALLYKKGRESNPVPIAQNGNNLPQDPNTDGSCWLD 317
Query: 331 LPSDGG-RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFS 389
LP D ++ VK GN++SA+ YVHVKP GGTFTDI MWVF PFNG G KV +++
Sbjct: 318 LPVDNANKERVKKGNLQSAESYVHVKPMYGGTFTDIAMWVFYPFNGAGRAKVEFIDIKLG 377
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEY--------IEGNKAIV 441
K G+HVGDWEH TLR+ NF G+LW +YFSQH+ GKW+ + LE+ + IV
Sbjct: 378 KAGEHVGDWEHVTLRVSNFDGQLWHVYFSQHNAGKWIDSSQLEFQSDASSFDFPTKRPIV 437
Query: 442 YSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEP 501
Y+S +GHAS+PH G L G G+G R+D + +D +Y LV AEYLG V EP
Sbjct: 438 YASLHGHASYPHAGLNLLGKN--GVGARDDTNKGRNVMDMG-KYVLVCAEYLGS-EVKEP 493
Query: 502 SWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
+WL + R+WGP I +D EL + K LP +R EN + LP E GEEGPTGPKEK
Sbjct: 494 AWLNYFREWGPHIDFDLDIELKNVAKALPGKLRDGFENIIRSLPKEALGEEGPTGPKEKG 553
Query: 562 NWVGDE 567
NW GDE
Sbjct: 554 NWNGDE 559
>gi|297832746|ref|XP_002884255.1| hypothetical protein ARALYDRAFT_896056 [Arabidopsis lyrata subsp.
lyrata]
gi|297330095|gb|EFH60514.1| hypothetical protein ARALYDRAFT_896056 [Arabidopsis lyrata subsp.
lyrata]
Length = 583
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 233/555 (41%), Positives = 329/555 (59%), Gaps = 38/555 (6%)
Query: 23 FSLPAPLPTWPQGQG-FASGRINLGEIEVCRIS----RFNFIWSCNLLQSKKKSATFYEP 77
F+ P+ LP P G G F GRI+LG +EV ++S +W T ++P
Sbjct: 57 FTFPSALPVIPSGGGNFGKGRIDLGGLEVIQVSISTSTSQRVWRTYEGGPNSMGLTIFQP 116
Query: 78 AGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVW 137
+P F++LG Y Q ++R L G+VL ARD++ + +L+ P+ Y V
Sbjct: 117 INLPPSFFTLGFYAQPNNRLLFGWVLAARDVSGN--------------SLRPPVGYVEV- 161
Query: 138 CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE--VRCVRDDLTDKCEVH 195
+ N +G A+FW P PDGY+++G VT +P KP L + + CVR DLT++ E
Sbjct: 162 INTTSMNINQDGAAYFWQPLCPDGYQAVGLYVTTSPLKPSLSQESISCVRSDLTEQSETD 221
Query: 196 HLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPK 255
++ ++ ++ S RP NRG GV GTF C + + CLKN
Sbjct: 222 TWVW------GTNEMTLSSLRPANRGTEATGVHTGTFSCQPLSVPPPPPPLFCLKNTKFD 275
Query: 256 LHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSG 314
L +MP+ +Q L ++Y P ++ HPDE +LPSSV+WFF+NGALL++ G+ + P G
Sbjct: 276 LSSMPSHNQTSVLFQSYSPWIYLHPDEDFLPSSVNWFFSNGALLFQRGNESNPVPVQPDG 335
Query: 315 SNLPSGGRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPF 373
SNLP GG +DG FW+D P D ++ VK G++ + K+Y+H+KP GGTFTDIV+W+F PF
Sbjct: 336 SNLPQGGSDDGLFWLDYPVDKNAKEWVKRGDLGNTKVYLHIKPMFGGTFTDIVVWIFYPF 395
Query: 374 NGPGTLK-VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE 432
NG LK + I +++ IG+H+GDWEH TLRI NF GELW YFS+HSGG V A DLE
Sbjct: 396 NGNARLKFLFIKSLSLGDIGEHIGDWEHITLRISNFNGELWRAYFSEHSGGTLVEACDLE 455
Query: 433 YIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEY 492
+ GNK + YSS +GHA F PG LQG + G G+RND ARS+ + D+ + YELVA
Sbjct: 456 FQGGNKPVSYSSLHGHAMFSRPGLVLQGDD--GNGIRNDMARSDKFFDAGVAYELVA--- 510
Query: 493 LGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEE 552
G G + EP WL + RKWGP + +D + LD I K LP ++R ++K+P E+ E+
Sbjct: 511 -GPG-IEEPPWLNYFRKWGPFVRHDIQKNLDGIAKSLPGLLRKKFRKFINKMPREVLEED 568
Query: 553 GPTGPKEKNNWVGDE 567
GPTGPK K +W D+
Sbjct: 569 GPTGPKVKRSWTADD 583
>gi|6513944|gb|AAF14848.1|AC011664_30 hypothetical protein, 5' partial [Arabidopsis thaliana]
Length = 549
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 237/561 (42%), Positives = 329/561 (58%), Gaps = 51/561 (9%)
Query: 23 FSLPAPLPTWPQGQG-FASGRINLGEIEVCRIS----RFNFIWSCNLLQSKKKSATFYEP 77
F+ P+ LP P G G F GRI+LG +EV ++S +W + ++P
Sbjct: 24 FTFPSALPVIPSGGGNFGKGRIDLGGLEVIQVSISTSTSQRVWRTYEGGPDNMGLSIFQP 83
Query: 78 AGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVW 137
+P F +LG Y Q ++R L G+VL ARD++ + +L+ P+DY V
Sbjct: 84 INLPPSFSTLGFYGQPNNRLLFGWVLAARDVSGN--------------SLRPPVDYIQV- 128
Query: 138 CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE--VRCVRDDLTDKCEVH 195
+ N EG AFFW P P+GY+++G VT +P KP L + + CVR DLT++ E
Sbjct: 129 INTTSMNINQEGAAFFWQPLCPNGYQAVGLYVTTSPIKPSLSQESISCVRSDLTEQSETD 188
Query: 196 HLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIA------CL 249
++ + ++ S RP NRG GV GTF C Q LNI CL
Sbjct: 189 TWVW------GTEEMTLSSLRPANRGTEATGVHTGTFSC-------QPLNIPPPPPLFCL 235
Query: 250 KNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA 309
KN L +MP+ +Q L ++Y P ++ HPDE ++ SSV WFF+NGALL++ G+
Sbjct: 236 KNTKFDLSSMPSHNQTTVLFQSYSPWIYLHPDEDFISSSVDWFFSNGALLFQKGNESNPV 295
Query: 310 -IDPSGSNLPSGGRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVM 367
+ P GSNLP GG +DG FW+D P+D ++ VK G++ K+Y+H+KP GGTFTDIV+
Sbjct: 296 PVQPDGSNLPQGGSDDGLFWLDYPADKNAKEWVKRGDLGHTKVYLHIKPMFGGTFTDIVV 355
Query: 368 WVFCPFNGPGTLK-VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWV 426
W+F PFNG LK + +++ IG+H+GDWEH TLRI NF GELW YFS+HSGG V
Sbjct: 356 WIFYPFNGNARLKFLFFKSLSLGDIGEHIGDWEHVTLRISNFNGELWRAYFSEHSGGTLV 415
Query: 427 AAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYE 486
A DLE+ GNK + YSS +GHA F PG LQG + G G+RND ARSN + D+ + YE
Sbjct: 416 EACDLEFQGGNKLVSYSSLHGHAMFSKPGLVLQGDD--GNGIRNDMARSNKFFDAGVAYE 473
Query: 487 LVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPL 546
LVA G G + EP WL + RKWGP + +D + L+ I K LP ++R N ++K+P
Sbjct: 474 LVA----GPG-IQEPPWLNYFRKWGPLVPHDIQKNLEGIAKSLPGLLRKKFRNLINKIPR 528
Query: 547 ELYGEEGPTGPKEKNNWVGDE 567
E+ E+GPTGPK K +W GD+
Sbjct: 529 EVLEEDGPTGPKVKRSWTGDD 549
>gi|15232207|ref|NP_186836.1| uncharacterized protein [Arabidopsis thaliana]
gi|6091727|gb|AAF03439.1|AC010797_15 hypothetical protein [Arabidopsis thaliana]
gi|71905473|gb|AAZ52714.1| hypothetical protein At3g01870 [Arabidopsis thaliana]
gi|93007366|gb|ABE97186.1| hypothetical protein At3g01870 [Arabidopsis thaliana]
gi|332640207|gb|AEE73728.1| uncharacterized protein [Arabidopsis thaliana]
Length = 583
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 237/561 (42%), Positives = 329/561 (58%), Gaps = 51/561 (9%)
Query: 23 FSLPAPLPTWPQGQG-FASGRINLGEIEVCRIS----RFNFIWSCNLLQSKKKSATFYEP 77
F+ P+ LP P G G F GRI+LG +EV ++S +W + ++P
Sbjct: 58 FTFPSALPVIPSGGGNFGKGRIDLGGLEVIQVSISTSTSQRVWRTYEGGPDNMGLSIFQP 117
Query: 78 AGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVW 137
+P F +LG Y Q ++R L G+VL ARD++ + +L+ P+DY V
Sbjct: 118 INLPPSFSTLGFYGQPNNRLLFGWVLAARDVSGN--------------SLRPPVDYIQV- 162
Query: 138 CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE--VRCVRDDLTDKCEVH 195
+ N EG AFFW P P+GY+++G VT +P KP L + + CVR DLT++ E
Sbjct: 163 INTTSMNINQEGAAFFWQPLCPNGYQAVGLYVTTSPIKPSLSQESISCVRSDLTEQSETD 222
Query: 196 HLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIA------CL 249
++ + ++ S RP NRG GV GTF C Q LNI CL
Sbjct: 223 TWVW------GTEEMTLSSLRPANRGTEATGVHTGTFSC-------QPLNIPPPPPLFCL 269
Query: 250 KNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA 309
KN L +MP+ +Q L ++Y P ++ HPDE ++ SSV WFF+NGALL++ G+
Sbjct: 270 KNTKFDLSSMPSHNQTTVLFQSYSPWIYLHPDEDFISSSVDWFFSNGALLFQKGNESNPV 329
Query: 310 -IDPSGSNLPSGGRNDGEFWIDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVM 367
+ P GSNLP GG +DG FW+D P+D ++ VK G++ K+Y+H+KP GGTFTDIV+
Sbjct: 330 PVQPDGSNLPQGGSDDGLFWLDYPADKNAKEWVKRGDLGHTKVYLHIKPMFGGTFTDIVV 389
Query: 368 WVFCPFNGPGTLK-VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWV 426
W+F PFNG LK + +++ IG+H+GDWEH TLRI NF GELW YFS+HSGG V
Sbjct: 390 WIFYPFNGNARLKFLFFKSLSLGDIGEHIGDWEHVTLRISNFNGELWRAYFSEHSGGTLV 449
Query: 427 AAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYE 486
A DLE+ GNK + YSS +GHA F PG LQG + G G+RND ARSN + D+ + YE
Sbjct: 450 EACDLEFQGGNKLVSYSSLHGHAMFSKPGLVLQGDD--GNGIRNDMARSNKFFDAGVAYE 507
Query: 487 LVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPL 546
LVA G G + EP WL + RKWGP + +D + L+ I K LP ++R N ++K+P
Sbjct: 508 LVA----GPG-IQEPPWLNYFRKWGPLVPHDIQKNLEGIAKSLPGLLRKKFRNLINKIPR 562
Query: 547 ELYGEEGPTGPKEKNNWVGDE 567
E+ E+GPTGPK K +W GD+
Sbjct: 563 EVLEEDGPTGPKVKRSWTGDD 583
>gi|296080941|emb|CBI18663.3| unnamed protein product [Vitis vinifera]
Length = 453
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 231/549 (42%), Positives = 305/549 (55%), Gaps = 124/549 (22%)
Query: 22 TFSLPAPLPTWPQGQGFASGRINLGE-IEVCRISRFNFIWSCNLLQSKKKSATFYEPAGI 80
+F LPAP+PTWP G+GFA G I+LG +EVC+IS F +W+ + ATF+EP+ +
Sbjct: 25 SFKLPAPMPTWPPGEGFAGGTIDLGGGLEVCQISSFTKVWATHEGGPDNLGATFFEPSPM 84
Query: 81 PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSD 140
P+ F LG Y Q + +PL G+VL +D +T+N PLD
Sbjct: 85 PEEFCMLGCYSQPNKKPLFGWVLAGKD---------NTNN---------PLD-------- 118
Query: 141 EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFD 200
G + WLP PPDGYK++G ++T P KP LD++RCVR DLTD CE
Sbjct: 119 --------GNGYVWLPTPPDGYKAVGHVITNLPEKPSLDKIRCVRSDLTDLCEAD----- 165
Query: 201 AISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMP 260
+RG L N+ L+AMP
Sbjct: 166 -----------------TDRGA------------------------QALANISSNLYAMP 184
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPS 319
N Q+ AL++ Y P V+ HPDE YLPSSVSWFF NGALLY+ G+ IDP+GSNLP
Sbjct: 185 NLAQVDALVQAYSPWVYLHPDEQYLPSSVSWFFQNGALLYQQGEESKPVNIDPTGSNLPQ 244
Query: 320 GGRNDGEFWIDLPSDG-GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT 378
GG NDG +W+DLP D ++ VK G+++ + Y+H+KP +G TFTDI +W+F PFNGP
Sbjct: 245 GGSNDGAYWLDLPVDAKAKETVKKGDLQDSMTYMHIKPMLGATFTDIAIWIFYPFNGPAR 304
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNK 438
KV ++N++ KIG+HVGDWEH TLR+ NF G+L +++FS+HS G WV A +LE+ GNK
Sbjct: 305 AKVELINISLGKIGEHVGDWEHVTLRVSNFNGKLMNVFFSEHSSGTWVNASELEFQNGNK 364
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV 498
+ Y+S +GHA +P PG LQGS +GI RND A+S + +D+ +Y +VAAEYLG VV
Sbjct: 365 VVSYASLHGHAFYPKPGLVLQGSGAMGI--RNDTAKSKMVMDTGTRYIVVAAEYLGSAVV 422
Query: 499 AEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPK 558
EP WL + RKWGP I PTGPK
Sbjct: 423 -EPPWLNYYRKWGPKIR--------------------------------------PTGPK 443
Query: 559 EKNNWVGDE 567
K NW GDE
Sbjct: 444 MKKNWDGDE 452
>gi|15232208|ref|NP_186837.1| uncharacterized protein [Arabidopsis thaliana]
gi|6091728|gb|AAF03440.1|AC010797_16 hypothetical protein [Arabidopsis thaliana]
gi|6513943|gb|AAF14847.1|AC011664_29 hypothetical protein [Arabidopsis thaliana]
gi|332640208|gb|AEE73729.1| uncharacterized protein [Arabidopsis thaliana]
Length = 592
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 237/558 (42%), Positives = 326/558 (58%), Gaps = 39/558 (6%)
Query: 22 TFSLPAPLPTWP-QGQGFASGRINLGEIEVCRISRFN----FIWSCNLLQSKKKSATFYE 76
+F P+PLP+ P G F I++G +EV +IS N +W + +E
Sbjct: 61 SFKFPSPLPSMPSDGGNFGKRSIDMGGLEVTQISISNSTSHRVWRTYEGGPDNMGVSIFE 120
Query: 77 PAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLV 136
P IP F+ LG Y Q ++R L G++LVA+D++ S L+ P+DYT V
Sbjct: 121 PTTIPRNFFKLGFYAQPNNRQLFGWILVAKDVSGSN--------------LRPPVDYTEV 166
Query: 137 WCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL--DEVRCVRDDLTDKCEV 194
+ EG A+FW P P+GY ++G VT +P KP L + + CVR DLT++ E
Sbjct: 167 -GNTTTLLIKQEGPAYFWQPLCPNGYHAVGLYVTTSPMKPSLGQNSISCVRSDLTEQSEA 225
Query: 195 HHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDP 254
++ ++ S RP RG+ GV GTF C + CLKN
Sbjct: 226 DTWVWRI------KDMTISSLRPATRGVEATGVFTGTFSCKQLNFLPHPPPLFCLKNTKF 279
Query: 255 KLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAG-DLVGEAIDPS 313
L +MP+ +Q L + Y P ++ HP E +LPSSV+W F NGALL+K G + + I P+
Sbjct: 280 DLSSMPSENQTRVLFKTYSPWIYLHPKEDFLPSSVNWVFANGALLHKKGNESIPVPIHPN 339
Query: 314 GSNLPSGGRNDGEFWID-LPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCP 372
GSNLP GG ND FW+D L R+ VK G++ES K+Y+H+KP G TFTDIV+W+F P
Sbjct: 340 GSNLPQGGCNDDLFWLDYLVDKKAREKVKRGDLESTKVYLHIKPMFGATFTDIVVWLFFP 399
Query: 373 FNGPGTLK-VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDL 431
+NG LK + I +++ IG+HVGDWEH TLRI NF GELW +YFS+HSGG V A DL
Sbjct: 400 YNGNAHLKFLFIKSLSLGNIGEHVGDWEHVTLRISNFNGELWRVYFSEHSGGTLVDACDL 459
Query: 432 EYIEG-NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAA 490
E+++G NK +VYSS +GHA F PG LQG GI RND ARS+ D+ I YE++A
Sbjct: 460 EFMQGGNKPVVYSSLHGHAMFSKPGVVLQGGGKSGI--RNDMARSDKCFDAGIGYEVIA- 516
Query: 491 EYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYG 550
G GVV EP WL + RKWGP + Y L+ + K+LP+ +R + ++K+PLE+ G
Sbjct: 517 ---GPGVV-EPPWLNYFRKWGPRVHYRIDIFLNSVAKILPIFLRKGLRKLINKIPLEMRG 572
Query: 551 EEGPTGPKEKNNWVGDER 568
++GPTGPK K W GDE+
Sbjct: 573 QDGPTGPKVKVTWTGDEQ 590
>gi|297832744|ref|XP_002884254.1| hypothetical protein ARALYDRAFT_340259 [Arabidopsis lyrata subsp.
lyrata]
gi|297330094|gb|EFH60513.1| hypothetical protein ARALYDRAFT_340259 [Arabidopsis lyrata subsp.
lyrata]
Length = 590
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 238/561 (42%), Positives = 328/561 (58%), Gaps = 41/561 (7%)
Query: 12 VNNMSPTEPGTFSLPAPLPTWP-QGQGFASGRINLGEIEVCRISRFN----FIWSCNLLQ 66
++N P E TF P+ LP+ P G F I+LG +EV ++S N +W
Sbjct: 52 LSNGLPVET-TFKSPSLLPSMPSDGGNFGKRSIDLGGLEVTQLSISNSTSHRVWRTYEGG 110
Query: 67 SKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPA 126
+ +EP +P F LG Y Q ++R L G++LVA+D+A S
Sbjct: 111 PNNMGVSIFEPTTLPRNFLKLGFYAQPNNRQLFGWILVAKDVAGSN-------------- 156
Query: 127 LQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL--DEVRCV 184
L+ P+DYT V + +G A+FW P P+GY+++G VT +P KP L D + CV
Sbjct: 157 LRPPVDYTEVGNTTSLISIKQDGPAYFWQPLCPNGYQAVGLYVTTSPVKPSLGQDSINCV 216
Query: 185 RDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQEL 244
R +LT+K E ++ ++ S RP RG+ GV GTF + + L
Sbjct: 217 RSELTEKSEADTWVWRI------KDMTISSLRPATRGVEATGVYTGTFSFKNLKLLPPPL 270
Query: 245 NIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGD 304
CLKN+ L +MP+ +Q L + Y P ++ HP E +LPSSV W F NGALLY+ G+
Sbjct: 271 --FCLKNIKFDLSSMPSENQTRVLFQTYSPWIYLHPQEDFLPSSVDWVFANGALLYQKGN 328
Query: 305 LVGEA-IDPSGSNLPSGGRNDGEFWID-LPSDGGRQIVKHGNMESAKLYVHVKPAVGGTF 362
I P+GSNLP GG ND FW+D L + R+ VK G++ S K+Y+H+KP G TF
Sbjct: 329 ESNPVLIHPNGSNLPQGGCNDDLFWLDYLVDEKAREKVKRGDLGSTKVYLHIKPMFGATF 388
Query: 363 TDIVMWVFCPFNGPGTLK-VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 421
TDIV+W+F P+NG LK + I +++ IG+HVGDWEH TLRI NF GELW +YFS+HS
Sbjct: 389 TDIVVWLFFPYNGNAHLKFLFIKSMSLGNIGEHVGDWEHVTLRISNFNGELWRVYFSEHS 448
Query: 422 GGKWVAAYDLEYIE-GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVD 480
GG V A DLE+I+ GNK +VYSS +GHA F PG LQG GI RND ARS+ + D
Sbjct: 449 GGTLVDACDLEFIQGGNKPVVYSSLHGHAMFSKPGVVLQGGGKSGI--RNDMARSDKFFD 506
Query: 481 SSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENA 540
+SI YE++A G GVV EP WL + RKWGP + Y+ L+ + K+LP+ +R +
Sbjct: 507 ASIGYEVIA----GPGVV-EPPWLNYFRKWGPRVHYNIDKFLNSVAKILPIFLRKGLRKL 561
Query: 541 VSKLPLELYGEEGPTGPKEKN 561
++K+PLE+ G+ GPTGPK KN
Sbjct: 562 INKIPLEVLGQNGPTGPKVKN 582
>gi|388493228|gb|AFK34680.1| unknown [Lotus japonicus]
Length = 256
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 109/253 (43%), Positives = 157/253 (62%), Gaps = 15/253 (5%)
Query: 87 LGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGN 146
LG Y Q +++PL G++LVA+D++S+ S AL+KPLDYTLVW S + +
Sbjct: 2 LGSYGQPNNQPLFGWILVAKDVSST------------SSALKKPLDYTLVWNS-ASVKVS 48
Query: 147 YEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFS 206
+ + WLP+ PDGYK++G +VT TP+KP LD+++CVR DLT++CE + I+
Sbjct: 49 QDSPGYVWLPKAPDGYKALGHVVTTTPDKPSLDKIKCVRQDLTEQCEAYSWIWGTGGDSD 108
Query: 207 SSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIH 266
+ F+ ++ RP NRG GV VG F + N + L+I CLKN + +MPN QI
Sbjct: 109 PNSFNFYAVRPSNRGTQALGVGVGA-FVAQNGGTNSSLSITCLKNTNAISKSMPNLKQIG 167
Query: 267 ALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGRNDG 325
AL++ Y P ++ HPDE + PSSV WFF+NGALLY+ G I P+G+NLP DG
Sbjct: 168 ALLQTYSPILYLHPDEEFQPSSVDWFFSNGALLYQRGKESNPVKIAPNGTNLPQDPHTDG 227
Query: 326 EFWIDLPSDGGRQ 338
+W+DLP+D +
Sbjct: 228 AYWLDLPADADNK 240
>gi|302143566|emb|CBI22319.3| unnamed protein product [Vitis vinifera]
Length = 350
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 115/259 (44%), Positives = 145/259 (55%), Gaps = 21/259 (8%)
Query: 129 KPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
K + +W ++ G G F+ PDG+ S+G PN L V D
Sbjct: 52 KITQFERIWSCNQL-HGKTTGVTFYKPVGIPDGFFSLGHYCQ--PNDQALRGYVLVAQDA 108
Query: 189 TDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNIAC 248
+ K S PF VW+ RPC RGMLG VSVGTFFCS++ +E NIAC
Sbjct: 109 AA----------SRLKVGSYPFIVWNARPCKRGMLGNSVSVGTFFCSTHPSPDEEPNIAC 158
Query: 249 LKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGE 308
LKNLD LHAMPN QIHALI++YG TVFFHPD++Y+PSSV WFF NGAL+Y+ G L G+
Sbjct: 159 LKNLDSTLHAMPNLKQIHALIKHYGATVFFHPDDIYMPSSVPWFFKNGALVYQNGKLEGK 218
Query: 309 AIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMW 368
ID GSNLPSGG+NDGEFWIDLP D + ++ K V+ +F ++
Sbjct: 219 PIDSRGSNLPSGGKNDGEFWIDLPDDDDERTY----LKRNKPVVYSSKGGHASFPHPGIY 274
Query: 369 VFCPFNGPGTLKVGIMNVA 387
V G L +G+ N A
Sbjct: 275 V----QGSSKLGIGVRNDA 289
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 69/112 (61%), Positives = 85/112 (75%)
Query: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60
MFGC+CF WN + P EP FSLPAP+P WP+GQGFA+G+INLGEI V +I++F IW
Sbjct: 1 MFGCECFCWNRDSQYRPLEPQPFSLPAPIPEWPRGQGFATGQINLGEIVVLKITQFERIW 60
Query: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSE 112
SCN L K TFY+P GIPDGF+SLGHYCQ + + LRG+VLVA+D A+S
Sbjct: 61 SCNQLHGKTTGVTFYKPVGIPDGFFSLGHYCQPNDQALRGYVLVAQDAAASR 112
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/157 (48%), Positives = 88/157 (56%), Gaps = 45/157 (28%)
Query: 421 SGGK-----WVAAYDLE----YIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRND 471
SGGK W+ D + Y++ NK +VYSSK GHASFPHPG Y+QGS LGIGVRND
Sbjct: 229 SGGKNDGEFWIDLPDDDDERTYLKRNKPVVYSSKGGHASFPHPGIYVQGSSKLGIGVRND 288
Query: 472 AARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPL 531
AARS ++DSS Y++ KII LLP+
Sbjct: 289 AARSKFFIDSSTNYQI------------------------------------KIISLLPV 312
Query: 532 MIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDER 568
R+SVEN P ELYGEEGPTGPKEKNNWV DER
Sbjct: 313 FFRFSVENIFDLFPTELYGEEGPTGPKEKNNWVEDER 349
>gi|383131849|gb|AFG46743.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131869|gb|AFG46753.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131877|gb|AFG46757.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
Length = 139
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 81/139 (58%), Positives = 110/139 (79%), Gaps = 5/139 (3%)
Query: 367 MWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWV 426
+W+F P NGP T+KVG +N+ K G+HVGDWEHFTLR+ NFTGELW +YFSQHSGG+WV
Sbjct: 2 IWLFYPLNGPITVKVGALNMPL-KYGEHVGDWEHFTLRVSNFTGELWKVYFSQHSGGQWV 60
Query: 427 AAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYE 486
A DLE+IEGN+ VY++K+GHA+FPH G +L+G LG+G+RNDA+RS ++D+S +Y+
Sbjct: 61 NASDLEHIEGNRIAVYAAKSGHATFPHAGNFLEGDRKLGVGIRNDASRSKYFLDTSRKYQ 120
Query: 487 LVAAEYL----GEGVVAEP 501
+VAAE+L + +V EP
Sbjct: 121 IVAAEHLEALGSKDIVVEP 139
>gi|383131847|gb|AFG46742.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131851|gb|AFG46744.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131853|gb|AFG46745.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131855|gb|AFG46746.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131857|gb|AFG46747.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131859|gb|AFG46748.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131861|gb|AFG46749.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131863|gb|AFG46750.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131865|gb|AFG46751.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131867|gb|AFG46752.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131871|gb|AFG46754.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131873|gb|AFG46755.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131875|gb|AFG46756.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131879|gb|AFG46758.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
gi|383131881|gb|AFG46759.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
Length = 139
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 81/139 (58%), Positives = 110/139 (79%), Gaps = 5/139 (3%)
Query: 367 MWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWV 426
+W+F P NGP T+KVG +N+ K G+HVGDWEHFTLR+ NFTGELW +YFS+HSGG+WV
Sbjct: 2 IWLFYPLNGPITVKVGALNMPL-KYGEHVGDWEHFTLRVSNFTGELWKVYFSRHSGGQWV 60
Query: 427 AAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYE 486
A DLE+IEGNK VY++K+GHA+FPH G +L+G LG+G+RNDA+RS ++D+S +Y+
Sbjct: 61 NASDLEHIEGNKIAVYAAKSGHATFPHAGNFLEGDRKLGVGIRNDASRSKYFLDTSKKYQ 120
Query: 487 LVAAEYL----GEGVVAEP 501
+VAAE+L + +V EP
Sbjct: 121 IVAAEHLEALGSKDIVVEP 139
>gi|361067189|gb|AEW07906.1| Pinus taeda anonymous locus 0_14113_01 genomic sequence
gi|383157398|gb|AFG61041.1| Pinus taeda anonymous locus 0_14113_01 genomic sequence
gi|383157400|gb|AFG61042.1| Pinus taeda anonymous locus 0_14113_01 genomic sequence
gi|383157402|gb|AFG61043.1| Pinus taeda anonymous locus 0_14113_01 genomic sequence
gi|383157404|gb|AFG61044.1| Pinus taeda anonymous locus 0_14113_01 genomic sequence
Length = 138
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 83/138 (60%), Positives = 109/138 (78%), Gaps = 4/138 (2%)
Query: 435 EGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYL- 493
EGNKAIVYSSK+GHASFPHPG +LQG G+G+RNDAA+S +D+S +Y++VAAEY+
Sbjct: 1 EGNKAIVYSSKSGHASFPHPGDFLQGDSKRGVGIRNDAAQSKYALDTSKKYQIVAAEYMQ 60
Query: 494 ---GEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYG 550
+ +EP WLQ+MR+WGPTIVY+S+ E+ KI+K LP +R++VE + ++P EL G
Sbjct: 61 SLPSHDIPSEPCWLQYMREWGPTIVYNSEAEIRKILKYLPSKLRHAVEEILDRMPYELGG 120
Query: 551 EEGPTGPKEKNNWVGDER 568
EEGPTGPKEK+NW GDER
Sbjct: 121 EEGPTGPKEKDNWEGDER 138
>gi|361068175|gb|AEW08399.1| Pinus taeda anonymous locus 2_8324_01 genomic sequence
Length = 139
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 78/139 (56%), Positives = 109/139 (78%), Gaps = 5/139 (3%)
Query: 367 MWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWV 426
+W+F P NGP T+KVG +N+ K G+H+GDWEHFTLR+ NFTGELW +YFSQHSGG+WV
Sbjct: 2 IWLFYPLNGPITVKVGALNMPL-KYGEHIGDWEHFTLRVSNFTGELWKVYFSQHSGGQWV 60
Query: 427 AAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYE 486
DLE+IEGN+ VY++K+GHA+FPH G +L+G LG+G+RNDA+RS ++D+S +Y+
Sbjct: 61 NTSDLEHIEGNRIAVYAAKSGHATFPHAGNFLEGDRKLGVGIRNDASRSKYFLDTSRKYQ 120
Query: 487 LVAAEYL----GEGVVAEP 501
+V+AE+L + +V EP
Sbjct: 121 IVSAEHLEALGSKDIVVEP 139
>gi|361127506|gb|EHK99473.1| hypothetical protein M7I_4630 [Glarea lozoyensis 74030]
Length = 439
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/321 (32%), Positives = 155/321 (48%), Gaps = 53/321 (16%)
Query: 232 FFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSW 291
+ + I E+ IA + L K I A I+ YGP ++ HP++ YLP SV W
Sbjct: 103 LYLNQKLIKIIEIRIAMSQGLSRK--------AIEAAIQKYGPVIYTHPEDQYLPCSVEW 154
Query: 292 FFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLY 351
F T+ L+ + + P + LP+G + +++D+ VK GNMESAK Y
Sbjct: 155 FLTHCTLVESKAP-GSKTVHPLETQLPTGPKEGTRWYLDIEDS-----VKPGNMESAKAY 208
Query: 352 VHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIM---------NVAFSKIGQHVGDWEHFT 402
WVF +NG GT K + NV + +G+HVGDWE+
Sbjct: 209 ---------------FWVFSAYNGHGTAKFDSLVFNKVERTGNVNLAPLGEHVGDWEYVG 253
Query: 403 LRICNFTGELWSIYFSQHSGG----KWVAAYDLEYIEGNKAIVYSSKNGHASFPHPG-TY 457
+R+ N T EL +I S+H K + + I+YSS NGHA+FP G Y
Sbjct: 254 IRVDNTTQELIAIILSEHGKNIVFDKAAITKSFTFQDTTHPIIYSSLNGHANFPSIGPNY 313
Query: 458 LQGSEILGI------GVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWG 511
+ ++LGI + N A +++S++Y+LV A YL E V P+W+ + +WG
Sbjct: 314 TEHRKVLGIPFGLEFNLLNTTAAGGPSINTSLKYQLVNAPYLTEDKVVSPAWVGYPYRWG 373
Query: 512 P---TIVYDSKTELDKIIKLL 529
P I D+KT L +IIK++
Sbjct: 374 PEGTAINMDAKT-LGEIIKVV 393
>gi|444919734|ref|ZP_21239705.1| hypothetical protein D187_03833 [Cystobacter fuscus DSM 2262]
gi|444708028|gb|ELW49153.1| hypothetical protein D187_03833 [Cystobacter fuscus DSM 2262]
Length = 1003
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 109/442 (24%), Positives = 175/442 (39%), Gaps = 60/442 (13%)
Query: 47 EIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVAR 106
+ V +S FN+++ + + + G G+YSLG P G V
Sbjct: 533 RLSVQAVSAFNWVYDDWDTGAHSDVSIWRPNLGQTPGYYSLGDI----GMPHHGEVPRMT 588
Query: 107 DLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMG 166
+ S E + L +P+DYT +W D+G G ++ A FW P PP GY +G
Sbjct: 589 FVVSGEGD-----------VLARPVDYTWIW-DDKGTGGEHD--ASFWHPVPPAGYTCLG 634
Query: 167 FLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRG 226
+ KP D +RCV+++ + ++D ++ ++W + L
Sbjct: 635 SVANLGYGKPSTDLIRCVKNEYVLPANI-GWVWDDSGSGGNNDITLWQANARDHRGLSTS 693
Query: 227 VSVGT-FFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYL 285
VG ++ + LN + L N P+L D + AL Y P ++ H DE Y
Sbjct: 694 TMVGQGYYGDPDGGRFWALNKSALAN--PELQGG-FVDDLSAL--QYAPRIWLHNDEYYW 748
Query: 286 PSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGR------NDGEFWIDLPSDGGRQI 339
PSS +F N + P G++L + + +F DG R
Sbjct: 749 PSSTEFFLPN--------------VHPDGTHLWTNEALGCDSCTNPQFL-----DGQRPD 789
Query: 340 VKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVA------FSKIGQ 393
H + A++ + V TDI+ W F P+N + +G+ +S G
Sbjct: 790 QTHVPVY-AQIVTRTQGGVATNITDIIYWTFYPYNNGKRVCIGLYTSGGGCAGFYSTFGN 848
Query: 394 HVGDWEHFTLRICNFTGELWSIYFSQHS-GGKWVAAYDLEYIEGNKAIVYSSKNGHASFP 452
HVGDWEH T+R + G +Y SQHS GG + G YS+ H +P
Sbjct: 849 HVGDWEHLTVRFVD--GRPSQVYMSQHSQGGTFTFGNKDMASVGFHPEAYSALGSHGLYP 906
Query: 453 HPGTYLQGSEILGIGVRNDAAR 474
+ G + +D +R
Sbjct: 907 DAARHTYERIFNGDTLNDDTSR 928
>gi|115375164|ref|ZP_01462431.1| hypothetical protein STIAU_1036 [Stigmatella aurantiaca DW4/3-1]
gi|310823135|ref|YP_003955493.1| hypothetical protein STAUR_5905 [Stigmatella aurantiaca DW4/3-1]
gi|115367815|gb|EAU66783.1| hypothetical protein STIAU_1036 [Stigmatella aurantiaca DW4/3-1]
gi|309396207|gb|ADO73666.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length = 589
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 95/356 (26%), Positives = 150/356 (42%), Gaps = 34/356 (9%)
Query: 127 LQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRD 186
L +P+ YT +W SD G G ++ FW P P GY +G + + + P + +RCV+
Sbjct: 185 LARPIGYTWIW-SDWGSGGTHD--VSFWAPVAPSGYTCLGSVAVQGYSAPSPELIRCVKS 241
Query: 187 DLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQELNI 246
+ + + D+ S + ++W P + LG V N +G+ +
Sbjct: 242 EYVLQASSGWVWNDSGSG-ADYDIALWQANPRDHRSLGASTLVAQGH-HGNPEAGRFWAL 299
Query: 247 ACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV 306
P+L P D AL Y P ++ H +E Y PSSV +F N V
Sbjct: 300 NKSATAHPELQGTP-VDATTAL--QYAPRIWLHHEEYYFPSSVEFFLPN----------V 346
Query: 307 GEAIDPSGSNLPSGGRN--DGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTD 364
EA +N P G + D +F DG R H A++ + + V TD
Sbjct: 347 HEAQGYLVTNQPLGCDSCTDPQF-----LDGQRPDQTH-VPAYAQIVIRTQGGVPTNITD 400
Query: 365 IVMWVFCPFNGPGTLKVGIMNV-----AFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
++ W F P+N + +G A+S G HVGDWEH T+R + G +Y SQ
Sbjct: 401 VIYWSFYPYNNGKRVCIGWYTSLGCVGAYSTFGNHVGDWEHLTVRFVD--GRPSQVYMSQ 458
Query: 420 HSGGKWVAAYDLEY-IEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR 474
H+ G+ D ++G YS+K H +P + + G + +D +R
Sbjct: 459 HANGQTFTFGDKAVALDGWHPEAYSAKGSHGLYPDAARHTYETLFNGDTLNDDTSR 514
>gi|15894080|ref|NP_347429.1| hypothetical protein CA_C0793 [Clostridium acetobutylicum ATCC 824]
gi|337736009|ref|YP_004635456.1| hypothetical protein SMB_G0809 [Clostridium acetobutylicum DSM
1731]
gi|384457518|ref|YP_005669938.1| hypothetical protein CEA_G0804 [Clostridium acetobutylicum EA 2018]
gi|15023680|gb|AAK78769.1|AE007594_6 Uncharacterized conserved protein of probably eukaryotic origin
[Clostridium acetobutylicum ATCC 824]
gi|325508207|gb|ADZ19843.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018]
gi|336290309|gb|AEI31443.1| hypothetical protein SMB_G0809 [Clostridium acetobutylicum DSM
1731]
Length = 348
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 127/283 (44%), Gaps = 34/283 (12%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDP--SGSNLPSGGRNDG- 325
I Y P V+FHPDE P +V F + D++ E + G + SG D
Sbjct: 42 IIKYAPIVYFHPDEKCFPITVEEFL-------ECTDVMNEKNEKFCDGKEILSGSFKDAL 94
Query: 326 ---EFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT---L 379
+F++ + V+ GN+ SAK YV+V+ G + D+ + F +NGP L
Sbjct: 95 GNQKFYLKITDTS----VREGNLNSAKCYVYVRKRNKGRYLDLQYFFFYGYNGPTINIHL 150
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA 439
N + S IG++ G+WEH T+ I + G + F QH + LEY +G+
Sbjct: 151 NKNKKNYSTSNIGENYGNWEHVTVVIDSTDGSILGAIFDQHGKSCFYDIDQLEYEKGH-I 209
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGV-----------RNDAARSNLYVDSSIQYELV 488
+VYS+ + HAS+P G + + +GV +DA D+S+ +LV
Sbjct: 210 VVYSALSSHASYPSAGKSVTEYDSQELGVPPVSVGIVKFELHDATDKGRRWDTSLVCQLV 269
Query: 489 AAEYLGEGVVAEPSWLQFMRKWG-PTIVYDSKTELDKIIKLLP 530
+ G V EP W+ F +WG P + D + I LP
Sbjct: 270 SYNITGTE-VEEPKWMAFNGRWGKPYDLTDKDISILYQISRLP 311
>gi|383455089|ref|YP_005369078.1| hypothetical protein COCOR_03102 [Corallococcus coralloides DSM
2259]
gi|380729001|gb|AFE05003.1| hypothetical protein COCOR_03102 [Corallococcus coralloides DSM
2259]
Length = 597
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 114/483 (23%), Positives = 181/483 (37%), Gaps = 73/483 (15%)
Query: 48 IEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARD 107
+EV SRF +++ + + A + GF+SLG P RG
Sbjct: 133 LEVRPTSRFTWVYDDHGTGATNDIAVWRPDLSQTPGFFSLGDVAM----PNRGQAPATTF 188
Query: 108 LASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGF 167
+ E + L +P +Y W D+ G G +F W P P GY +G
Sbjct: 189 VVRGEGD-----------LLARPSNYNWTW--DDSGSGGTHDVSF-WEPVAPAGYTCLGH 234
Query: 168 LVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGV 227
+ +KP D +RCVR + ++D + VW + RG+
Sbjct: 235 VAVLGYSKPSTDLIRCVRSEYVLPANP-AWVWDDRGSGADDDIGVWQAVARDH----RGL 289
Query: 228 SVGTFFCSSN---------WISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFF 278
TF + W+ LN + N + L +P Q + + P V+
Sbjct: 290 PASTFVSRPSHGDTGGNRYWV----LNKSATSNAE--LRGLPVDAQT---VAAFAPRVWL 340
Query: 279 HPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQ 338
HPDE Y PSS + N + + G LV + L D +F DG R
Sbjct: 341 HPDEAYFPSSTQFHLAN--VHEENGHLV------TNQALGCDSCTDPQFL-----DGQRP 387
Query: 339 IVKHGNMESAKLYVHVKPAVGG----TFTDIVMWVFCPFNGPGTLKVGIMNV-----AFS 389
N +Y V P G TD++ W F P+N + +G + +S
Sbjct: 388 -----NQTPVPVYAQVIPRTQGGLPTNVTDVLYWNFYPYNNGKRVCIGWYSPWGCVGGYS 442
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE-YIEGNKAIVYSSKNGH 448
G HVGDWEH T+R + G +Y SQH+ G+ D ++ G V+S+ H
Sbjct: 443 TFGNHVGDWEHLTVRFID--GRPAQVYLSQHASGQTFTFGDKAVFLAGWHPEVFSANGSH 500
Query: 449 ASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMR 508
+P ++ + G + +D + L D+ L+ + +G +W+
Sbjct: 501 GLYPDAARHIYETIFNGDFLADDTG-AGLAWDTWNNVVLIPWQPVGT-YTGSLAWMNLTA 558
Query: 509 KWG 511
WG
Sbjct: 559 YWG 561
>gi|384219504|ref|YP_005610670.1| hypothetical protein BJ6T_58270 [Bradyrhizobium japonicum USDA 6]
gi|354958403|dbj|BAL11082.1| hypothetical protein BJ6T_58270 [Bradyrhizobium japonicum USDA 6]
Length = 639
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/337 (27%), Positives = 146/337 (43%), Gaps = 53/337 (15%)
Query: 268 LIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEF 327
L+ N+ P + DEV LPSS W+ G + Y+A G P N + ++ +
Sbjct: 9 LVSNFAPILHLDKDEVNLPSSTDWYI--GQVGYQAAG--GIYSGPGQWNWGTAEQSSPDD 64
Query: 328 WIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGT-FTDIVMWVFCPFNGPGTLKV----G 382
++ P ++K G + +A YVHV P T D+ W+F P+NG T +V G
Sbjct: 65 YLVAPQAIWASLMK-GYLPTATSYVHVLPVDDRTDMVDLQYWLFYPYNGQETFEVTGWTG 123
Query: 383 IMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD-LEYIEGNKAIV 441
A + H GDWE T+R+ N ++ ++FSQHSGG+W + D + G V
Sbjct: 124 STGQALMPLSIHWGDWECVTVRV-NIDQQVVGVFFSQHSGGQWCSPNDGFLSLSGTHPNV 182
Query: 442 YSSKNGHASFPHP--GTYLQGSEILGIGVRN------DAARS-----NLYVDSSIQYELV 488
Y + HA++P TY EI G+ V + D AR NL +
Sbjct: 183 YVASGSHANYPASSNSTY----EIGGVDVWDVRFAVADYARGGGPTVNLVTPNPPVVIQN 238
Query: 489 AAEYLGEGVVAEPSWLQFMRKWG----PTIVYDSKTELDKIIKL------LPLMIRYSVE 538
A+ + V P WL F +WG PT+ + E+ ++ L +++ +V+
Sbjct: 239 DAQAILPTPVTPPPWLAFKGRWGQPIEPTL---TDAEIGSLVAAAANGMQLNVLVEEAVQ 295
Query: 539 NAVSKLPLELYGEE-----------GPTGPKEKNNWV 564
L+ + E GPTGP +K +W
Sbjct: 296 YVAGSTTLQAWLSEVILPIAKGDLTGPTGPSQKGSWA 332
>gi|374308997|ref|YP_005055427.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358751007|gb|AEU34397.1| protein of unknown function DUF946 [Granulicella mallensis
MP5ACTX8]
Length = 368
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 151/352 (42%), Gaps = 69/352 (19%)
Query: 267 ALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRN--- 323
+ I+ Y PT++ + + P+SV FF+ L+ G+++ A+ P+ + N
Sbjct: 32 STIQQYAPTIYLNAYDNNHPTSVENFFSQSNLVDSNGNVITTAVTPASLAANTTSTNYLV 91
Query: 324 --DGEFWI---DLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT 378
+G + D S + NM V+VK GT+ DI ++F +NG
Sbjct: 92 PSNGVYPTPTNDFESGDAAVASSNANMGQVNSPVYVKVLDFGTYIDIKYFLFYTWNGFQP 151
Query: 379 LKVGIM--------NVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD 430
+VG++ N+ ++ H GDWEH T+RI L +++SQHSG +WV
Sbjct: 152 FQVGVITNFKTEPYNLDWAGFALHYGDWEHVTVRITEDQSSLIGVFYSQHSGNQWVTDPP 211
Query: 431 LEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEIL----------GIGVRNDAAR------ 474
L +G IVYS + HA++P G +Q ++I+ + V ++ A
Sbjct: 212 L---DGTHPIVYSGWDSHANYPTAG--IQVNDIILNSPGLPPVSWVKVVDNTANTGTFTV 266
Query: 475 ----SNLYVDSSI--QYELVAAEYLGEGVVAEPSWLQFMRKWGPTI---VYD-------S 518
+N Y + I ++ + L + WL F WGP + +YD +
Sbjct: 267 YHKPTNFYSNGVIWTPWQNTSQLVLLDNNATAAQWLSFNGHWGPALTSTIYDPPSLPSGA 326
Query: 519 KTELDKIIK---LLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKNNWVGDE 567
TELD + K + + Y+ EN GP GP+ ++ ++ E
Sbjct: 327 TTELDVLAKGANFIGALSSYTNEN-------------GPLGPETQDWYISTE 365
>gi|77456557|ref|YP_346062.1| hypothetical protein Pfl01_0329 [Pseudomonas fluorescens Pf0-1]
gi|77380560|gb|ABA72073.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1]
Length = 462
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 104/259 (40%), Gaps = 15/259 (5%)
Query: 43 INLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPD---GFYSLGHYCQFDSRPLR 99
I G + + S F+ IW S +F+ PA PD G++ LG +
Sbjct: 17 IRFGNLLINFTSEFHRIWDSR--GSGSAVGSFWRPAPAPDLLPGYFPLGDVAVTGYENVN 74
Query: 100 GFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPP 159
G +VA + E +G S +S AL P DY VW + G C W P PP
Sbjct: 75 GNRIVAV-VREGEPQGDAPS---RSNALSPPTDYKRVW--KDANSGAAADCTV-WRPIPP 127
Query: 160 DGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCN 219
GY +MG + + +KP L+ +RCVR+DL V LI+D + FS W P
Sbjct: 128 PGYVAMGLVCSNGRDKPLLNAIRCVREDLVMPATVGDLIWDDKGSGARQNFSAWDIEPAQ 187
Query: 220 RGMLGRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFH 279
+ GTFF + + L+ P+ ++P + + YG
Sbjct: 188 AAAGEIHFAAGTFFGVQSHSKPVNAIVHALRMQLPR-QSIPAPEAPE--LSGYGAPPELA 244
Query: 280 PDEVYLPSSVSWFFTNGAL 298
P +V + WF N L
Sbjct: 245 PAKVTQTVRIPWFAVNDQL 263
>gi|398865339|ref|ZP_10620860.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM78]
gi|398243657|gb|EJN29240.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM78]
Length = 433
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 82/185 (44%), Gaps = 17/185 (9%)
Query: 54 SRFNFIWSCNLLQSKKKSATFYEPAGIPD---GFYSLGHYCQ--FDSRPLRGFVLVARDL 108
+ F IW SK K A F+ P+ PD GF+ LG D+ + V+V
Sbjct: 28 TEFRRIWDST--GSKSKPAAFWRPSPAPDLLPGFFPLGDVVASGLDNINEKRMVMVV--- 82
Query: 109 ASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFL 168
EG +S+ K PAL+ P+D+ VW + G G C+ W P P GY ++G L
Sbjct: 83 ----CEGELSSDSGKEPALRAPIDFEQVWS--DAGSGARANCSI-WQPIAPSGYVALGQL 135
Query: 169 VTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVS 228
+ +KP ++RCVR DL + LI+D + FS W P + +
Sbjct: 136 CSNGRDKPLPSDIRCVRRDLVTPSFISDLIWDDKGSGARQAFSAWGIYPPDTAAGEILCA 195
Query: 229 VGTFF 233
GTF
Sbjct: 196 PGTFI 200
>gi|424920927|ref|ZP_18344288.1| hypothetical protein I1A_000355 [Pseudomonas fluorescens R124]
gi|404302087|gb|EJZ56049.1| hypothetical protein I1A_000355 [Pseudomonas fluorescens R124]
Length = 434
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 81/167 (48%), Gaps = 12/167 (7%)
Query: 54 SRFNFIWSCNLLQSKKKSATFYEPAGIPD---GFYSLGHYCQFDSRPLRGFVLVARDLAS 110
+ ++ IW + S+ K A F+ P PD G++ LG + G +VA
Sbjct: 28 TEYHRIW--DSAGSRAKPAAFWRPTPAPDVLPGYFPLGDVFIAGHENINGSTVVA---VV 82
Query: 111 SEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVT 170
EA+ +++ + PAL++P D+ L+W D G + +G W P P GY +MG + +
Sbjct: 83 CEAD-TPSADPTRGPALRRPDDFELIW-KDSGSKSKKDGA--VWRPLAPQGYVAMGAVCS 138
Query: 171 KTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRP 217
KP ++ VRCVR DL +V LI++ + FS WS P
Sbjct: 139 NDHEKPSVNAVRCVRADLVIASDVGELIWNDKGSGARQSFSAWSIAP 185
>gi|398975325|ref|ZP_10685473.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM25]
gi|398140549|gb|EJM29511.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM25]
Length = 466
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 79/171 (46%), Gaps = 20/171 (11%)
Query: 54 SRFNFIWSCNLLQSKKKSATFYEPAGIPD---GFYSLGHYC--QFDSRPLRGFVLVAR-- 106
+ F+ IW SK K +F+ P PD G++ LG D+ + + V R
Sbjct: 28 TEFHRIWDNT--GSKSKPGSFWRPTPAPDLLPGYFPLGDLVISGRDNINEKRVMAVVREG 85
Query: 107 DLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMG 166
DL ++EA K PAL +P DY VW SD G + N G W P PP+GY ++G
Sbjct: 86 DLQNAEAG--------KGPALSRPNDYQWVW-SDTGSRAN--GDCSIWRPIPPEGYVALG 134
Query: 167 FLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRP 217
+ + +KP + VRCVR DL LI+ + FS W+ P
Sbjct: 135 LVCSNDSSKPSFNAVRCVRADLVVASSASTLIWSDKGSGADRDFSTWTVSP 185
>gi|398962040|ref|ZP_10679060.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM30]
gi|398151563|gb|EJM40107.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM30]
Length = 434
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 92/221 (41%), Gaps = 20/221 (9%)
Query: 54 SRFNFIWSCNLLQSKKKSATFYEPAGIPD---GFYSLGHYCQFDSRPLRGFVLVARDLAS 110
S F+ IW S+ K A F+ P PD G++ LG + G +VA
Sbjct: 28 SEFHRIWDST--GSRAKPAAFWRPTPAPDVLPGYFPLGDVFTAGLHNINGSTVVA---VV 82
Query: 111 SEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVT 170
EA+ +++ PAL +P D+ L+W D G +G W P P GY +MG + +
Sbjct: 83 CEADFP-SADSMNGPALCRPEDFELIW-KDSGSGSKKDGA--VWRPLAPQGYVAMGAVCS 138
Query: 171 KTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVG 230
KP L+ VRCVR DL ++ LI++ + S WS P + G
Sbjct: 139 NDHEKPSLNAVRCVRADLVIASDIGELIWNDRGSGARQSLSAWSVTPPGAAPGEIHFAPG 198
Query: 231 TFFCSSNW--------ISGQELNIACLKNLDPKLHAMPNCD 263
TF ++ + + + I L P+L + D
Sbjct: 199 TFIAANTFNRPESNVGVYSLRMQITPLIKSSPQLPVLSAAD 239
>gi|307111823|gb|EFN60057.1| hypothetical protein CHLNCDRAFT_133315 [Chlorella variabilis]
Length = 422
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 92/215 (42%), Gaps = 23/215 (10%)
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNG----------ALLYKAGDLVGEAI 310
+ D + ++ + P F HP + ++P + +F + ALL G + +
Sbjct: 6 DLDWLCGVVARHAPAFFLHPADRFMPCTAEFFLQHSELRAEVDGRSALLLPQGSVAAPLL 65
Query: 311 DPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGG---TFTDIVM 367
+ +P G R W++L R+ + ++ ++ H K +G T + +
Sbjct: 66 LEAQRAVPPGCR----LWLNL-DPAARRGMPQEALDDVPIFAHPKLILGPDGCTVEALEL 120
Query: 368 WVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQH--SGGKW 425
+ G +VG V K G H GDWEH T R+ TGEL ++++ H G W
Sbjct: 121 TYITLYAHNGAYRVG--GVGLIKTGDHDGDWEHCTARVHPTTGELLGMWYNAHRCRDGCW 178
Query: 426 VAAYDLEY-IEGNKAIVYSSKNGHASFPHPGTYLQ 459
VA + + + Y + +GH ++P PG L+
Sbjct: 179 VAGPQVPRDPQTGRVAAYVALHGHGTYPRPGRVLR 213
>gi|398990951|ref|ZP_10694113.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM24]
gi|399011530|ref|ZP_10713861.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM16]
gi|398117678|gb|EJM07424.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM16]
gi|398141839|gb|EJM30746.1| Plant hypothetical protein (DUF946) [Pseudomonas sp. GM24]
Length = 458
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 83/183 (45%), Gaps = 12/183 (6%)
Query: 54 SRFNFIWSCNLLQSKKKSATFYEPAGIPD---GFYSLGHYCQFDSRPLRGFVLVARDLAS 110
+ F+ IWS N SK K ATF+ P PD G++ LG + G +VA +
Sbjct: 28 TEFHRIWSTN--GSKAKPATFWRPTPAPDALPGYFPLGDVLIPGDTSING-EMVAAVVCE 84
Query: 111 SEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVT 170
+ +GA + N AL +P+D+ LVW E G + + W P P GY ++G + +
Sbjct: 85 KDMKGAVSPN---GKALARPVDFELVW--KETGAPSVTRMSI-WRPLAPVGYVALGLVCS 138
Query: 171 KTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVG 230
KP L+ VRCVR DL V LI++ + FS + P + G
Sbjct: 139 NDHYKPSLNSVRCVRLDLVIAANVGELIWNDKGSGAKLSFSAFGIEPSTAAAGDIHFAPG 198
Query: 231 TFF 233
TF
Sbjct: 199 TFV 201
>gi|395496107|ref|ZP_10427686.1| hypothetical protein PPAM2_08566 [Pseudomonas sp. PAMC 25886]
Length = 433
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 88/201 (43%), Gaps = 20/201 (9%)
Query: 43 INLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDG---FYSLGHYCQFDSRPL- 98
I ++ + + F+ +WS +ATF+ P D F SLG D R +
Sbjct: 4 IRFKDLLINFTTEFDLLWSAKD-SGADNAATFWRPNTTADSLNSFSSLGDVVASDYRTIN 62
Query: 99 -RGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQ 157
R FV V SEA+ + + AL+ P+ + L+W + G + E W P
Sbjct: 63 NRKFVAVV-----SEADPVNGT------ALRAPVGFNLIW-KNSGKKTRSEFS--VWKPV 108
Query: 158 PPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRP 217
PP+GY +MG + +KP L+ VRCVR DL + + I++ + FS W
Sbjct: 109 PPEGYVAMGMVCCIGYDKPTLNAVRCVRADLAVDAYIGNAIWNDKGSGARMDFSAWDVHT 168
Query: 218 CNRGMLGRGVSVGTFFCSSNW 238
+ ++ GTF+ + W
Sbjct: 169 PDVEPEAAYIATGTFYGTGGW 189
>gi|168018380|ref|XP_001761724.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687095|gb|EDQ73480.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 112/269 (41%), Gaps = 36/269 (13%)
Query: 260 PNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDP-SGSNLP 318
P ++ L + Y P + F E + PS+V F + L+ + G + DP S +NL
Sbjct: 129 PTDSRLDCLTKAYAPLLKFDSFESFFPSAVDDFLPSVKLINEDGSAIPNQPDPLSSANLV 188
Query: 319 SGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT 378
S L + G I +S L+ D V + F PFN
Sbjct: 189 S--PTSKPLTTFLTTKGTVPIYTTIVRKSPILF------------DFVYFAFYPFNQ--- 231
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRIC--NFTGELWSIYFSQHSGGKWVAAYDLEYIEG 436
G + F+ G HVGDWE+ TLRI N +L + Y SQHS G + +L ++
Sbjct: 232 ---GKLIRNFTVWGNHVGDWEYVTLRIRFENPGPKLIAAYTSQHSSGDCLTPSNLIFVSN 288
Query: 437 NK-AIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYEL---VAAEY 492
I+Y ++ H ++ PG + IL + +D N + Q+E V
Sbjct: 289 TAHPILYVARGSHGTYNKPGRTVYRRIILIGDLADDHDGGN-----AEQWETWNNVVVRV 343
Query: 493 LGEGVVAEPSWLQFMRKW----GPTIVYD 517
G+ + E +W F +W GPT V+D
Sbjct: 344 EGQPLTREYTWWGFNGRWGNKQGPTSVFD 372
>gi|149922458|ref|ZP_01910890.1| Uncharacterized conserved protein of probably eukaryotic origin
[Plesiocystis pacifica SIR-1]
gi|149816653|gb|EDM76145.1| Uncharacterized conserved protein of probably eukaryotic origin
[Plesiocystis pacifica SIR-1]
Length = 373
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 85/205 (41%), Gaps = 43/205 (20%)
Query: 263 DQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGR 322
D + Y P ++F +E Y PSS W F P P G
Sbjct: 80 DPRQTALETYAPRIYFPANEAYWPSSAEWAF------------------PFLERFPDG-- 119
Query: 323 NDGEFWI----DLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGT 378
G++W+ +L S G +E+A +Y GG D+V +++ P+N
Sbjct: 120 -QGQYWVRSAAELGSPSDVLPFFSGELETAPVYGFWADK-GGGVVDLVYFIYYPYNRGKE 177
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD-----LEY 433
L A + G HVGDWEH T+R+ + E +Y S HS G AYD +E
Sbjct: 178 L-------ADTVWGNHVGDWEHITVRLVD--DEPSQVYLSAHSFG---GAYDWDGGEVEL 225
Query: 434 IEGNKAIVYSSKNGHASFPHPGTYL 458
EG +VYS+ H + PG ++
Sbjct: 226 FEGTHPVVYSAWGSHGFWAQPGNHV 250
>gi|384246387|gb|EIE19877.1| hypothetical protein COCSUDRAFT_48717 [Coccomyxa subellipsoidea
C-169]
Length = 504
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 88/217 (40%), Gaps = 25/217 (11%)
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGAL--------------LYKAGDLV 306
++ ++I + P +FHP+E Y P +V WF L + + G L
Sbjct: 119 TVQELQSVIASNCPVFYFHPEERYYPCTVQWFLERCELQLIRKGWRRRVLRVIEQVGGLD 178
Query: 307 GEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVK----PAVGG-T 361
GE + + + + + L D + + + +Y H K P G T
Sbjct: 179 GETLKRAEAWFGAQPFKRSFMMLRL-VDPQHRSGQRNQINQVPIYAHAKELVDPQTGRRT 237
Query: 362 FTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS--Q 419
+I F +NG L + +G H DWEH T+R+ + +Y+S +
Sbjct: 238 ALEINYMKFLAYNGSYKL---FGWIPAGNLGAHDADWEHVTMRLTPDGRSVLGVYYSAHR 294
Query: 420 HSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGT 456
H G W +A D+ E + + + + NGH S+P GT
Sbjct: 295 HEDGVWRSAKDVPRSEKGRPLAHIAVNGHGSYPTAGT 331
>gi|290956784|ref|YP_003487966.1| hypothetical protein SCAB_22911 [Streptomyces scabiei 87.22]
gi|260646310|emb|CBG69405.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 328
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 102/250 (40%), Gaps = 15/250 (6%)
Query: 268 LIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEF 327
L R Y P ++ DE Y P++V F N L + + +P L + +
Sbjct: 40 LARRYAPRIWLQQDESYFPAAVEPFLANTHLETRT-----DNGNPHQQFLVTNQALGCDS 94
Query: 328 WIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNV- 386
+ P G++ + A++ TDI W F P+N + VG +
Sbjct: 95 CTNPPFLAGQRPGQTAVPAYAEVVHRTDNGNPTNITDINYWTFFPYNNGKRVCVGWFSRW 154
Query: 387 ----AFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEY-IEGNKAIV 441
+S G HVGDWEH T+R + + QH GG+ E + G++ +V
Sbjct: 155 GCVGGYSTFGNHVGDWEHVTVRFVDDMPH--RVSMGQHDGGQTFPYGGSEVALAGDQPVV 212
Query: 442 YSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEP 501
Y+++ H +P + + G + +D L+ D+ ++ A + +G E
Sbjct: 213 YAAQGSHGMYPDARRHTYRNLPNGDTLNDDTGAGTLW-DTRQALKVFAWQPVGS-YTGEW 270
Query: 502 SWLQFMRKWG 511
+WL + +WG
Sbjct: 271 AWLNYTGRWG 280
>gi|260810666|ref|XP_002600077.1| hypothetical protein BRAFLDRAFT_79665 [Branchiostoma floridae]
gi|229285362|gb|EEN56089.1| hypothetical protein BRAFLDRAFT_79665 [Branchiostoma floridae]
Length = 409
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/273 (26%), Positives = 105/273 (38%), Gaps = 40/273 (14%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDP----------S 313
Q+ AL+ Y P V+ E Y PSSV + N +Y G + S
Sbjct: 116 QMEALVTKYAPKVWLARGEQYKPSSVDFHLRN-VKVYGGGRVYSSTASTLSSCSTSCYMS 174
Query: 314 GSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPF 373
+ S F+ P RQ + V + TDI W+F P+
Sbjct: 175 SNQRLSHASATLSFFSGEPVGPTRQPPVYA----------VWKRINSVTTDIFYWMFYPY 224
Query: 374 NGPGTLKVGIMNVA-----FSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS-GGK--W 425
N + +G +S G HVGDWEH T R+ + SIY S H+ GGK W
Sbjct: 225 NRGKRVCIGFRAWGSCIGGYSTFGHHVGDWEHMTARLVG--NQPSSIYVSAHNFGGKYDW 282
Query: 426 VAAY-------DLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
AA D EG ++YS+K H + +PGT+ +IL I D +
Sbjct: 283 DAASQTYKKGGDTVRTEGTHPVLYSAKGSHGLWSNPGTHTY-EKILKIEKLQDETSTGTV 341
Query: 479 VDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWG 511
++ + + G G SWL + +WG
Sbjct: 342 WNTWRNVPCI-KYHPGGGYTGSWSWLNYQGRWG 373
>gi|189202240|ref|XP_001937456.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187984555|gb|EDU50043.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 351
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 106/260 (40%), Gaps = 29/260 (11%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFW 328
+ Y P V+ H E Y P+ + F N ++ G + + SNL G + W
Sbjct: 35 VLKYAPVVYLHSTETYFPTDIQTFLDNTTPRVNFTEVPGPSKPLTTSNLNQMG---SDVW 91
Query: 329 I----DLPSD----GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLK 380
+ D+ D G + G + A + G D F +N G
Sbjct: 92 LTSNDDVTKDPAWIKGTKPNAQGKTDGAITAAIIVNDKGNGNVDAFYMYFSAYNYGGE-- 149
Query: 381 VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNK 438
++ + G HVGDWEH +R N GE SI++SQH+ G+ A+ +E G++
Sbjct: 150 --VLGWSALNFGNHVGDWEHTMVRFLN--GEPQSIWYSQHANGQ---AFRYPTVEKSGDR 202
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGI--GVRNDAARSNLYVD---SSIQYELVA--AE 491
I YS+K HA++ GT+ L + GV D D S+ Y+ A +
Sbjct: 203 PIAYSAKGSHANYAISGTHDHTIPNLNLPGGVLEDYTDKGTLWDPLQSAWYYKFDAKTTQ 262
Query: 492 YLGEGVVAEPSWLQFMRKWG 511
+ A +WL + +WG
Sbjct: 263 FSAYDGAAPTAWLDWRGRWG 282
>gi|260840103|ref|XP_002613786.1| hypothetical protein BRAFLDRAFT_85327 [Branchiostoma floridae]
gi|229299176|gb|EEN69795.1| hypothetical protein BRAFLDRAFT_85327 [Branchiostoma floridae]
Length = 619
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 99/239 (41%), Gaps = 32/239 (13%)
Query: 192 CEVHHLIFDAISKFSSSPFSVWSTRPC-------NRGMLGRGVSVGTFFCS---SNWISG 241
C +F +K+ + V RPC N + +G G C+ S ++SG
Sbjct: 278 CPTGQTVFIDYAKYDRTATPVCPCRPCDADCRAANSLAVLKGACEGLQECTIRTSGYVSG 337
Query: 242 QELNIACLKNLDPKLHAMP----NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGA 297
++ + L+P + DQ+ L+ Y P ++ E Y PSSV + N A
Sbjct: 338 DPCHVRHMY-LEPTYRCVTVTPVRDDQLEDLVTKYAPKIWLAKGERYKPSSVDFHLENVA 396
Query: 298 L-----LYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYV 352
+ +Y + S +G R E LP G +I G +Y
Sbjct: 397 VHDGHKVYSSNASTLPTCSESCHMSTTGWRRSDE--DSLPFFHGEEI---GPTRQPPVYA 451
Query: 353 HVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIG------QHVGDWEHFTLRI 405
V+P + TDI W+F P+NGP G+ ++ + +G +HVGDWEH TLR+
Sbjct: 452 IVRP-INSITTDIFYWMFYPYNGPDPACPGLWSLWGTCMGGMRGVLRHVGDWEHMTLRL 509
>gi|330917976|ref|XP_003298040.1| hypothetical protein PTT_08621 [Pyrenophora teres f. teres 0-1]
gi|311329012|gb|EFQ93887.1| hypothetical protein PTT_08621 [Pyrenophora teres f. teres 0-1]
Length = 351
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 105/260 (40%), Gaps = 29/260 (11%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFW 328
+ Y P V+ H E Y P+ + F + ++ G + + SNL G + W
Sbjct: 35 VLKYAPVVYLHSTETYFPTDIQTFLDHTTPRVNFTEVTGPSKPLTTSNLNQMG---SDVW 91
Query: 329 I----DLPSD----GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLK 380
+ D+ D G + G + A + G D F +N G
Sbjct: 92 LTSNDDVTKDPAWIKGTKPNAQGKTDGAITAAVIVNDKGNGNVDAFYMYFTAYNYGGE-- 149
Query: 381 VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNK 438
++ + G HVGDWEH +R N GE SI++SQH+ G+ A+ +E G++
Sbjct: 150 --VIGWSILNFGNHVGDWEHTMVRFLN--GEPQSIWYSQHANGQ---AFRYSTVEKSGDR 202
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGI--GVRNDAARSNLYVD---SSIQYELVA--AE 491
I YS+K HA++ GT+ L + GV D D S+ Y+ A +
Sbjct: 203 PIAYSAKGSHANYAIAGTHDHTIPNLNLPGGVLEDYTDKGTLWDPLQSAWYYKFDAKNTQ 262
Query: 492 YLGEGVVAEPSWLQFMRKWG 511
+ A WL + +WG
Sbjct: 263 FSAYDGAAPTGWLDWRGRWG 282
>gi|451852120|gb|EMD65415.1| hypothetical protein COCSADRAFT_139336 [Cochliobolus sativus
ND90Pr]
Length = 363
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 71/262 (27%), Positives = 107/262 (40%), Gaps = 31/262 (11%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFW 328
+ Y P V+ H +E Y P+ + F T+ ++ G + SNL N W
Sbjct: 37 VLKYAPIVYLHSEEKYYPTDLQSFLTHSTPRVDFNEVPGPSKPLKTSNLDQMANN---VW 93
Query: 329 I----DLPSD----GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLK 380
+ D+ +D G + G E A + G D F +N G +
Sbjct: 94 LTSNDDVTTDPTWIKGTRPNASGKTEGATTAAIIVNDKGNGNVDAFYMYFYAYNYGGEV- 152
Query: 381 VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE----G 436
+G + F G HVGDWEH +R N G+ +I+FSQH+ G+ A+ +E G
Sbjct: 153 LGWKQLNF---GNHVGDWEHTMVRFSN--GQPTAIWFSQHANGQ---AFRYSAVERDATG 204
Query: 437 NKAIVYSSKNGHASFPHPGTYLQGSEILGI--GVRNDAARSNLYVD---SSIQYELVAAE 491
+ I YSSK HA++ GT+ L + GV D D S+ Y+ +
Sbjct: 205 IRPIAYSSKGSHANYAMGGTHDHTIPNLNLPGGVLEDYTNRGTVWDPLLSAWYYKFTPST 264
Query: 492 YLGEGV--VAEPSWLQFMRKWG 511
E A +WL F +WG
Sbjct: 265 NKFEAYDGQAPTAWLNFKGRWG 286
>gi|353243908|emb|CCA75388.1| hypothetical protein PIIN_09372 [Piriformospora indica DSM 11827]
Length = 471
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 75/170 (44%), Gaps = 21/170 (12%)
Query: 255 KLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK--AGDLVGEAIDP 312
++ A P DQ L + Y P FH DEVY PS+V +F NG + K +G +V +D
Sbjct: 18 QVRAAPT-DQNVLLAKKYAPQFRFHKDEVYFPSTVE-YFVNGPVTVKDSSGKVVATKMDT 75
Query: 313 SGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAK--LYVHVKPAVGGTFTDIVMWVF 370
+ N G + + ++ N S + YV + P G D+ W++
Sbjct: 76 ASDN------GVGTYMSTDVNANLNGFLRGQNPSSTQTSTYVFIAPKDNG-VVDLYYWIY 128
Query: 371 CPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQH 420
CP+N + V +G HVGDWE T+R N G SI + H
Sbjct: 129 CPYNLGKKIPV------LGWMGNHVGDWERITIRTVN--GVATSIDYHAH 170
>gi|413933237|gb|AFW67788.1| hypothetical protein ZEAMMB73_320181, partial [Zea mays]
Length = 95
Score = 63.9 bits (154), Expect = 2e-07, Method: Composition-based stats.
Identities = 37/77 (48%), Positives = 47/77 (61%), Gaps = 7/77 (9%)
Query: 35 GQGFASGRINLG--EIEVCRISRFNFIWSCNLLQSKKKS---ATFYEPAGIPDGFYSLGH 89
G GFA GRI +G E+E+ + F I C L ++ ATFY P G+P+GF LG+
Sbjct: 9 GGGFARGRIRIGGGELELAAATAFQKI--CTLSSRRRGGGGSATFYRPVGVPEGFSLLGY 66
Query: 90 YCQFDSRPLRGFVLVAR 106
YCQ + RPL G VLVAR
Sbjct: 67 YCQPNCRPLHGHVLVAR 83
>gi|159464315|ref|XP_001690387.1| hypothetical protein CHLREDRAFT_144253 [Chlamydomonas reinhardtii]
gi|158279887|gb|EDP05646.1| predicted protein [Chlamydomonas reinhardtii]
Length = 391
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 86/219 (39%), Gaps = 36/219 (16%)
Query: 268 LIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLY------KAGDLVGEAIDPSGSN--LPS 319
++ Y P +F H + Y+P SV WF L Y G+L + P G LP+
Sbjct: 9 VLARYCPVLFLHHLDKYMPCSVEWFIQRAGLHYYNEGVSGCGELHEFSSVPGGVEELLPA 68
Query: 320 GGRNDGEFW---------------IDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTF-- 362
G + +D GG + + +YVH K +
Sbjct: 69 GQVTQEKLLAEQDRVLVPEHLSLTLDSMHFGGHDRRR---LSEVPIYVHAKLVLDQVHGR 125
Query: 363 ---TDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
+I F FNG + G+ G HVGDWEH T+R+ T EL ++++
Sbjct: 126 PEAYEINYITFYAFNGHYAVPFGL---PILMTGNHVGDWEHLTVRLDARTLELQGVWYNA 182
Query: 420 HSG--GKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGT 456
H G+W A + + I + + NGH +PH GT
Sbjct: 183 HRNIEGEWCPAAAVPRTPCGRIIGHVAINGHGIYPHCGT 221
>gi|302532087|ref|ZP_07284429.1| predicted protein [Streptomyces sp. C]
gi|302440982|gb|EFL12798.1| predicted protein [Streptomyces sp. C]
Length = 416
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 97/214 (45%), Gaps = 33/214 (15%)
Query: 45 LGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYC-----------QF 93
GE+E+ ++++ W + K +F+ P P GF++LG +
Sbjct: 7 FGELELAFTDQYSWRWD-DEDTGGKYWVSFWHPKP-PQGFHALGTVALPAWGSENLNPKT 64
Query: 94 DSRPLRGFV--LVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCA 151
+ ++ V L +D + +EG K P L P DYT+VW G +Y C
Sbjct: 65 PNTKVKEHVVSLCVKDGPAKASEG-------KKPPLAHPEDYTVVWKDVGSGGKHYGAC- 116
Query: 152 FFWLPQPPDGYKSMGFLVTKTP-NKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPF 210
W P P GY +MG ++++ +KP L EV CVR+DLT ++ +++ + F
Sbjct: 117 --WRPVAPGGYVAMGCVMSENSYDKPALTEVMCVREDLTHLADL-KWVYEDKGTGAKDYF 173
Query: 211 SVWSTR--PCNRGMLGRG----VSVGTFFCSSNW 238
SVW+ + P R G V+ TF +S+W
Sbjct: 174 SVWANQVPPAYRDGRDGGHRALVAPNTFTAASSW 207
>gi|451997556|gb|EMD90021.1| hypothetical protein COCHEDRAFT_1031382 [Cochliobolus
heterostrophus C5]
Length = 405
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 106/262 (40%), Gaps = 31/262 (11%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFW 328
+ Y P V+ H +E Y P+ + F + ++ G + SNL N W
Sbjct: 37 VLKYAPIVYLHSEEKYYPTDLQSFLDHTTPRVDFNEVPGPSKPLRTSNLDQMANN---VW 93
Query: 329 I----DLPSD----GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLK 380
+ D+ +D G + G E A + G D F +N G +
Sbjct: 94 LTSNDDVTTDPTWIKGTKPNASGKTEGATTAAIIVNDKGNGNVDAFYMYFYAYNYGGEV- 152
Query: 381 VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE----G 436
+G + F G HVGDWEH +R N G+ +I+FSQH+ G+ A+ +E G
Sbjct: 153 LGWKQLNF---GNHVGDWEHTMVRFSN--GQPTAIWFSQHANGQ---AFRYSAVERDAAG 204
Query: 437 NKAIVYSSKNGHASFPHPGTYLQGSEILGI--GVRNDAARSNLYVD---SSIQYELVAAE 491
+ I YSSK HA++ GT+ L + GV D D S+ Y+ +
Sbjct: 205 IRPIAYSSKGSHANYAMGGTHDHTIPNLNLPGGVLEDYTNRGTVWDPLLSAWYYKFTPST 264
Query: 492 YLGEGV--VAEPSWLQFMRKWG 511
E A +WL F +WG
Sbjct: 265 NKFEAYDGQAPTAWLNFRGRWG 286
>gi|440801430|gb|ELR22450.1| RhoGAP domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 478
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 85/190 (44%), Gaps = 22/190 (11%)
Query: 52 RISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQ--FDSRPLRGFVLVARDLA 109
R + ++ +W SK +F+ P P GF+ + Q + P + +
Sbjct: 12 RPAAYHKVWDDRKTGSKPHHVSFWRPVP-PRGFHRVSDLAQRTHEKEPAQMARCLVVAAH 70
Query: 110 SSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLV 169
S A+G+ SP L+ P+DYT VW +D+ G Y C+ W+P P+GY ++G +
Sbjct: 71 GSVADGS-------SPVLRPPVDYTCVW-TDKRSGGRYGNCSL-WVPVAPEGYCALGCVA 121
Query: 170 TKTPNKPELDEVRCVRDDLTDKCEVHH------LIFDAISKFSSSPFSVWSTRPCNRGML 223
KP L+EV CV L V L DA S S + S+W+ P +
Sbjct: 122 VVGYEKPGLNEVMCVHSSLCTPAGVVSEDPTGPLWKDAGSG-SVNDVSIWTMAPLE---I 177
Query: 224 GRGVSVGTFF 233
G G+S+ +F
Sbjct: 178 GEGISIDSFL 187
>gi|225559289|gb|EEH07572.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 351
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 100/238 (42%), Gaps = 28/238 (11%)
Query: 238 WISGQELN-------IACLK-NLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSV 289
WIS + L+ I CL+ L + D I + YGP ++ H ++ Y+PS+
Sbjct: 2 WISHRTLSMLVASLLIQCLRFVLLVDATCLAQYDGIPDFVLKYGPIIYLHSEDQYMPSAF 61
Query: 290 SWFFTNGALLYKAGDLVGEAIDPSGSNLPS----GGRN----DGEFWIDLPS--DGGRQI 339
+ N + + NL S GG++ E LP G R
Sbjct: 62 ATLLENSKPTVNYTPVADVPSPLTLDNLDSLNSLGGKDVFLTTNEGIRALPEWFSGTRPD 121
Query: 340 VKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWE 399
+++ + V+ G D + + +N G +GI + G HVGDWE
Sbjct: 122 KAGKTVDAVSSIIVVRDHGDGKMVDAFYFYYYGYN-QGNTVIGI------QFGNHVGDWE 174
Query: 400 HFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
H +R N GE ++++SQH+GG+ Y +G + + YS+ HA + PGT+
Sbjct: 175 HNMIRFEN--GEPQAVWYSQHAGGE-AFTYQATEKQGLRPVGYSANGSHAVYATPGTH 229
>gi|393238100|gb|EJD45638.1| hypothetical protein AURDEDRAFT_184584 [Auricularia delicata
TFB-10046 SS5]
Length = 555
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 66/278 (23%), Positives = 113/278 (40%), Gaps = 27/278 (9%)
Query: 245 NIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNG-ALLYKAG 303
I L P D AL R Y P + FH DE Y SSV +F + ++L G
Sbjct: 98 TIVTLLATSPLFARAAPTDANIALARKYAPQLKFHKDEKYWGSSVEYFLSGPISILDDNG 157
Query: 304 DLVGE-AIDPSG-SNLPSGGRNDGEFWIDLPSDGGRQI-----VKHGNMESAKLYVHVKP 356
V ++ PS +++P G N +I P + + K+Y + P
Sbjct: 158 VTVSTGSLTPSNLADVPGKGANT---YITTPMNAEAKAGFLAGQNPSKTNDGKIYTFIAP 214
Query: 357 AVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIY 416
G D+ W+F P+N T+ + ++G HVGDWE T+R N G S+
Sbjct: 215 KDNG-VVDLYYWLFTPYNLAKTVPL------LGEVGNHVGDWERMTVRTVN--GTATSVD 265
Query: 417 FSQHSG-GKWVAAYD--LEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAA 473
+ HS G +D ++ + + Y + H + GT+ + ++ +++
Sbjct: 266 YHAHSDTGSGTIPWDKVKKFDNDQRPVGYIADGSHGFWSSAGTFTYVNAVI-FKLQDKTG 324
Query: 474 RSNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWG 511
+ D+ + +V Y + + SWL + +WG
Sbjct: 325 DDGVAWDT--RDSIVPINY-PDTYSGDLSWLNYQGRWG 359
>gi|353243907|emb|CCA75387.1| hypothetical protein PIIN_09371 [Piriformospora indica DSM 11827]
Length = 472
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 71/159 (44%), Gaps = 15/159 (9%)
Query: 268 LIRNYGPTVFFHPDEVYLPSSVSWFFTNGALL--YKA-GDLVGEAIDPSGSNLPS-GGRN 323
L + Y P FH E Y PS++ +F G + Y A G + G + +NL R
Sbjct: 29 LAQKYAPQFRFHKSETYFPSTIEFFLAGGGGVKAYDANGPISGAPFPLTTANLGDLANRG 88
Query: 324 DGEFW-IDLPSD-GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKV 381
G + D+ ++ G + ++ N + YV + P G D+ W+FCP+N
Sbjct: 89 SGMYLTTDVKANLNGFLLGQNPNSGAGSTYVFIAPKANGV-VDLYYWIFCPYNQ------ 141
Query: 382 GIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQH 420
G + +G H+GDWE T+R N G S+ + H
Sbjct: 142 GKKIIVLGYVGDHIGDWERITVRTVN--GVATSVDYHAH 178
>gi|261204675|ref|XP_002629551.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
gi|239587336|gb|EEQ69979.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
gi|239614125|gb|EEQ91112.1| conserved hypothetical protein [Ajellomyces dermatitidis ER-3]
gi|327353894|gb|EGE82751.1| vacuolar protein sorting-associated protein 62 [Ajellomyces
dermatitidis ATCC 18188]
Length = 352
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/226 (23%), Positives = 91/226 (40%), Gaps = 20/226 (8%)
Query: 242 QELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK 301
Q L A L + P D + + Y P ++ H ++ Y+PS+ + N
Sbjct: 19 QCLRYALLVDATPASTCPAQHDSLPDFVLEYAPLIYLHSEDPYMPSAFATLLENSIPTIN 78
Query: 302 AGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVK----------HGNMESAKLY 351
+ + NL S GE +G R + + +++
Sbjct: 79 YTPVADVPSPLTLGNLDSLNILGGEDIFLTSKEGIRALPEWFRGTRPDRTGKTVDAVSSV 138
Query: 352 VHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGE 411
+ V+ G D+ + + +N G +GI + G HVGDWEH +R N G
Sbjct: 139 IVVRDHGDGKTVDVFYFYYFAYN-QGNTVIGI------QFGNHVGDWEHNMIRFEN--GA 189
Query: 412 LWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
+I++SQHS G+ Y+ +G + + YSS HA + PGT+
Sbjct: 190 PQAIWYSQHSAGQ-AFTYEATEKQGLRPVGYSSNGSHAVYSTPGTH 234
>gi|395797166|ref|ZP_10476457.1| hypothetical protein A462_17895 [Pseudomonas sp. Ag1]
gi|395338590|gb|EJF70440.1| hypothetical protein A462_17895 [Pseudomonas sp. Ag1]
Length = 434
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 59/134 (44%), Gaps = 5/134 (3%)
Query: 126 ALQKPLDYTLVW-CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCV 184
AL+ P+D+ VW S + N+ WLP P+GY +MG + +KP + +RCV
Sbjct: 80 ALRPPVDFERVWEHSGPRARANFS----IWLPTAPEGYVAMGLVCGLGNDKPPRNTIRCV 135
Query: 185 RDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQEL 244
R+DL + V LI+ + FS W P ++ GTF + +
Sbjct: 136 REDLVTQAYVDQLIWSDQGSGAHRDFSAWGVSPSQAAPGQAYLAPGTFTGAEGYAKPTRQ 195
Query: 245 NIACLKNLDPKLHA 258
+ LD LH+
Sbjct: 196 SPVHALRLDLALHS 209
>gi|225680022|gb|EEH18306.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 345
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 86/201 (42%), Gaps = 22/201 (10%)
Query: 268 LIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEF 327
+ Y P ++ H +E Y+PS+ N +VG A P + N G
Sbjct: 43 FVLKYAPLIYIHSEESYMPSNFETLVANCRPTVNYTQIVG-APSPLTLDNLDLLNNLGGK 101
Query: 328 WIDLPSDGGRQIV----------KHGNMESAKLYVHVKPAVG-GTFTDIVMWVFCPFNGP 376
+ L + G + + K G E A V V G G D+ + + +N
Sbjct: 102 NVFLTTKEGIRALPEWFRGTRPNKAGQTEDAVSSVVVLREHGDGKTADVFYFYYYAYN-Q 160
Query: 377 GTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG 436
G + G +++G HVGDWEH +R N + SI++SQH+ G+ Y+ +G
Sbjct: 161 GNMVFG------TQLGNHVGDWEHNMIRFVNEQPQ--SIWYSQHAAGQ-AFTYEATEKQG 211
Query: 437 NKAIVYSSKNGHASFPHPGTY 457
+ I YSS HA + PGT+
Sbjct: 212 FRPIGYSSNGSHAVYATPGTH 232
>gi|332707673|ref|ZP_08427701.1| protein of unknown function, DUF946 [Moorea producens 3L]
gi|332353582|gb|EGJ33094.1| protein of unknown function, DUF946 [Moorea producens 3L]
Length = 222
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 67/146 (45%), Gaps = 19/146 (13%)
Query: 43 INLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFV 102
++L ++ + + +F+ +W + ++ FY P P GF+ +G Y Q + G V
Sbjct: 40 LSLFDLNIQKCDKFDLVWY-DRGSGAERDGAFYRPVPTP-GFFVVGDYGQGNYGQPNGDV 97
Query: 103 LVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGY 162
L N K AL P+D+ L+W D G + +G FW P PP GY
Sbjct: 98 LCF--------------NPLKKSALANPIDFALIW-KDTGSGAHRDGS--FWRPIPPQGY 140
Query: 163 KSMGFLVTKTPNKPELDEVRCVRDDL 188
+G +V + KP + +C+ +DL
Sbjct: 141 SCIGDVVQEGYAKPYRPDYKCLHNDL 166
>gi|421143114|ref|ZP_15603073.1| hypothetical protein MHB_27167 [Pseudomonas fluorescens BBc6R8]
gi|404505683|gb|EKA19694.1| hypothetical protein MHB_27167 [Pseudomonas fluorescens BBc6R8]
Length = 434
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 58/134 (43%), Gaps = 5/134 (3%)
Query: 126 ALQKPLDYTLVW-CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCV 184
AL+ P+D+ VW S + N+ W P P+GY +MG + +KP + +RCV
Sbjct: 80 ALRPPVDFERVWEHSGPRARANFS----IWRPTAPEGYVAMGLVCGLGNDKPPRNTIRCV 135
Query: 185 RDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQEL 244
R+DL + V LI+ + FS W P ++ GTF + +
Sbjct: 136 REDLVTQAYVDQLIWSDQGSGAHRDFSAWGVSPSQAAPGQAYLAPGTFTGAEGYAKPTRQ 195
Query: 245 NIACLKNLDPKLHA 258
+ LD LH+
Sbjct: 196 SQVHALRLDLALHS 209
>gi|353242932|emb|CCA74530.1| hypothetical protein PIIN_08482 [Piriformospora indica DSM 11827]
Length = 449
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 72/141 (51%), Gaps = 20/141 (14%)
Query: 360 GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
G D+ + F PFN T V F+ G HVGDWEH +R C+ G +++ S
Sbjct: 176 GDILDVFYFFFYPFNQGNT-------VLFTHFGNHVGDWEHAMIRFCD--GVPQAMHLSA 226
Query: 420 HSGG---KWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HS G KW AA++ G + ++Y+++ HA +P G + S +L +G + + +
Sbjct: 227 HSDGNSWKW-AAFEK---RGERPVIYAARGSHAMYPSAGKH-DYSGVLLVGPSDYTSTGH 281
Query: 477 LYVDSSIQYELVAAEYLGEGV 497
L+ D ++ + VAA + GV
Sbjct: 282 LW-DPTLNF--VAARSIVGGV 299
>gi|156035687|ref|XP_001585955.1| hypothetical protein SS1G_13047 [Sclerotinia sclerotiorum 1980]
gi|154698452|gb|EDN98190.1| hypothetical protein SS1G_13047 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 362
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 93/214 (43%), Gaps = 28/214 (13%)
Query: 258 AMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNL 317
A PN + + + +Y P V+ E+Y PS + N Y L+ PS NL
Sbjct: 83 ARPNT-TLPSFVFDYAPLVWLDEAEMYFPSDMYAQILNTKP-YVNLTLIPNDTLPSPLNL 140
Query: 318 PSGGRNDGEFWIDLPSDGGRQIVKHGNME---SAKLYVHVKP--AVGGTFTDIVMWVFCP 372
+ +D ++ G + N++ S V P A G T I V
Sbjct: 141 EN---------LDFLNNWGLNVYLTSNIDVTTSPAYLTGVVPSSATGRTQNAISCSVILS 191
Query: 373 FNGPGTLKVGIM--------NVAF-SKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGG 423
+G G + V M N F ++G H+GDWEH +R N T + ++++SQH+ G
Sbjct: 192 DHGDGLMDVFYMYFYAYNQGNTVFGQELGDHIGDWEHNMIRFVNGTPQ--AVWYSQHANG 249
Query: 424 KWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
+ Y + G + I YS++ HA++ PGT+
Sbjct: 250 Q-AFLYSVVEKSGKRPIAYSARGSHANYAIPGTH 282
>gi|154281487|ref|XP_001541556.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150411735|gb|EDN07123.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 656
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 72/160 (45%), Gaps = 20/160 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++N+ F G HVGDWEH +R N T + +IYFS
Sbjct: 423 GHGIVDAFWFYFYSFN----LGNKVLNIRF---GNHVGDWEHSLVRFHNGTPK--AIYFS 473
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
+HS G+ +YD G + ++YS+ HA + PGT+ + IL G+ +D
Sbjct: 474 EHSFGE-AFSYDAVEKMGKRPVIYSASGTHAMYATPGTH---AYILPWGLLHDRTDRGPL 529
Query: 479 VDSSIQYELVAAEYLGE---GVVAEP----SWLQFMRKWG 511
D + Y ++ + A P +W F WG
Sbjct: 530 WDPARNYHGYTYDHNNDVLRASTANPQSPTAWFYFAGHWG 569
>gi|325093070|gb|EGC46380.1| vacuolar sorting-associated protein [Ajellomyces capsulatus H88]
Length = 639
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 72/160 (45%), Gaps = 20/160 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++N+ F G HVGDWEH +R N T + +IYFS
Sbjct: 406 GHGIVDAFWFYFYSFN----LGNKVLNIRF---GNHVGDWEHSLVRFHNGTPK--AIYFS 456
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
+HS G+ +YD G + ++YS+ HA + PGT+ + IL G+ +D
Sbjct: 457 EHSFGE-AFSYDAVEKMGKRPVIYSASGTHAMYATPGTH---AYILPWGLLHDRTDRGPL 512
Query: 479 VDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D + Y ++ + + A +W F WG
Sbjct: 513 WDPARNYHGYTYDHNNDVLRASTVNPQSPTAWFYFAGHWG 552
>gi|154273957|ref|XP_001537830.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150415438|gb|EDN10791.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 313
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 93/225 (41%), Gaps = 21/225 (9%)
Query: 244 LNIACLK-NLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKA 302
L I CL+ L + D I + Y P ++ H ++ Y+PS+ + N
Sbjct: 15 LLIQCLRFVLLVDATCLAQYDGIPDFVLKYAPIIYLHSEDQYMPSAFATLLENSKPTVNY 74
Query: 303 GDLVGEAIDPSGSNLPS----GGRN----DGEFWIDLPS--DGGRQIVKHGNMESAKLYV 352
+ + NL S GG++ E LP G R + + +
Sbjct: 75 TPVADVPSPLTLDNLDSLNSLGGKDVFLTTNEGIRALPEWFSGTRPDKAGKTVHAVSSII 134
Query: 353 HVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGEL 412
V+ GT D + + +N G +GI + G HVGDWEH + N GE
Sbjct: 135 VVRDHGDGTMVDAFYFYYYGYN-QGNTVIGI------QFGNHVGDWEHNMILFEN--GEP 185
Query: 413 WSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
++++SQH+GG+ Y +G + + YS+ HA + PGT+
Sbjct: 186 QAVWYSQHAGGE-AFTYQATEKQGLRPVGYSANGSHAVYATPGTH 229
>gi|225563113|gb|EEH11392.1| vacuolar sorting-associated protein [Ajellomyces capsulatus G186AR]
Length = 658
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 72/160 (45%), Gaps = 20/160 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++N+ F G HVGDWEH +R N T + +IYFS
Sbjct: 425 GHGIVDAFWFYFYSFN----LGNKVLNIRF---GNHVGDWEHSLVRFHNGTPK--AIYFS 475
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
+HS G+ +YD G + ++YS+ HA + PGT+ + IL G+ +D
Sbjct: 476 EHSFGE-AFSYDAVEKMGKRPVIYSASGTHAMYATPGTH---AYILPWGLLHDRTDRGPL 531
Query: 479 VDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D + Y ++ + + A +W F WG
Sbjct: 532 WDPARNYHGYTYDHNNDVLRASTVNPQSPTAWFYFAGHWG 571
>gi|327348836|gb|EGE77693.1| vacuolar protein sorting protein 62 [Ajellomyces dermatitidis ATCC
18188]
Length = 706
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 72/160 (45%), Gaps = 20/160 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 505 GNGIVDAFWFYFYSFN----LGNIVLNVRF---GNHVGDWEHSLVRFHH--GKPKAVFFS 555
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
+HS G + LE I G + ++YS+ HA +P PG++ + IL +G+ D
Sbjct: 556 EHSFGDAYSYDALEKI-GKRPVIYSATGTHAMYPTPGSH---AYILPLGLLRDQTDRGPL 611
Query: 479 VDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D + Y ++ + + A W F WG
Sbjct: 612 WDPARNYHAYTYDHNNDVLRAANVTPHAPTEWFYFAGHWG 651
>gi|261195014|ref|XP_002623911.1| vacuolar protein sorting protein 62 [Ajellomyces dermatitidis
SLH14081]
gi|239587783|gb|EEQ70426.1| vacuolar protein sorting protein 62 [Ajellomyces dermatitidis
SLH14081]
Length = 655
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 72/160 (45%), Gaps = 20/160 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 454 GNGIVDAFWFYFYSFN----LGNIVLNVRF---GNHVGDWEHSLVRFHH--GKPKAVFFS 504
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
+HS G + LE I G + ++YS+ HA +P PG++ + IL +G+ D
Sbjct: 505 EHSFGDAYSYDALEKI-GKRPVIYSATGTHAMYPTPGSH---AYILPLGLLRDQTDRGPL 560
Query: 479 VDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D + Y ++ + + A W F WG
Sbjct: 561 WDPARNYHAYTYDHNNDVLRAANVTPHAPTEWFYFAGHWG 600
>gi|239610722|gb|EEQ87709.1| vacuolar protein sorting protein 62 [Ajellomyces dermatitidis ER-3]
Length = 620
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 72/160 (45%), Gaps = 20/160 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 419 GNGIVDAFWFYFYSFN----LGNIVLNVRF---GNHVGDWEHSLVRFHH--GKPKAVFFS 469
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
+HS G + LE I G + ++YS+ HA +P PG++ + IL +G+ D
Sbjct: 470 EHSFGDAYSYDALEKI-GKRPVIYSATGTHAMYPTPGSH---AYILPLGLLRDQTDRGPL 525
Query: 479 VDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D + Y ++ + + A W F WG
Sbjct: 526 WDPARNYHAYTYDHNNDVLRAANVTPHAPTEWFYFAGHWG 565
>gi|410081930|ref|XP_003958544.1| hypothetical protein KAFR_0G03760 [Kazachstania africana CBS 2517]
gi|372465132|emb|CCF59409.1| hypothetical protein KAFR_0G03760 [Kazachstania africana CBS 2517]
Length = 437
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 110/297 (37%), Gaps = 43/297 (14%)
Query: 246 IACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDL 305
I K ++ ++ N I + P V DE+Y P + + N + ++ +
Sbjct: 47 ILNAKQVNDSNKSLVNDSYIPQYVLENAPIVHLSDDELYFPQDIETYIENFKVTDESNNT 106
Query: 306 VGEAI--------------DPSGSNLPSG-----GRNDGEFWIDLPSDGGRQIVKHGNME 346
+ E + D S SN+ S + D D + G +
Sbjct: 107 ILEDLTLQQLENTFTFRNDDLSLSNVHSSKTFLTSKTDFTQNPDYMFGSLYEYKNTGEIR 166
Query: 347 SAKLYVHVKPAVGGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRI 405
+ V + G + D + F FN GP L G G H+GDWEH +R
Sbjct: 167 KSAPSVLIVVDKGNGWVDAFWFFFYAFNNGPTVLGNG-------PWGSHLGDWEHCVVRF 219
Query: 406 CNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEI 463
F G+ +IY SQHS G +AYD +E + +V+S+K H ++ PG +
Sbjct: 220 --FNGQPKAIYMSQHSTG---SAYDFNALEKFNGRPLVFSAKGTHGNYASPGRFSHDIPY 274
Query: 464 LGIGVRNDAA-RSNLYVDSSIQYELVAAEYLGEGVV--------AEPSWLQFMRKWG 511
+ ND R L+ +S Y Y V+ SWL + WG
Sbjct: 275 IAFKPLNDVTDRGPLWDPTSKVYSYKFNPYKDNEVIPINADAKKIGRSWLYYQGHWG 331
>gi|384251651|gb|EIE25128.1| hypothetical protein COCSUDRAFT_61368 [Coccomyxa subellipsoidea
C-169]
Length = 5009
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 87/200 (43%), Gaps = 27/200 (13%)
Query: 60 WSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTS 119
W + K +F+ P P G+ SLG + P + V++ L S AE HTS
Sbjct: 2525 WDKDARYPSKSGLSFWRPIP-PQGYISLGDCAEVGYDPPQSIVVL---LDSELAEMEHTS 2580
Query: 120 NLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELD 179
+P ++ P + L+W DE + + C W P P GY +MGF++ + P P +
Sbjct: 2581 ----TPVVRPPRGFELLW-RDESDRPDR--CLSIWRPVPFPGYVAMGFVIGRGPEPPSRN 2633
Query: 180 EVRCVRDDLTDKCEVHHLIFDAISKF----SSSPFSVWSTRPCNRGMLGRGVSVGTFFCS 235
+RC+R + ++ + A ++ + FSVW+ + TF S
Sbjct: 2634 AMRCIRANEAAAVDLRGV--RASTRLPQVGRRTGFSVWTG----------DERLSTFVIS 2681
Query: 236 SNWISGQELNIACLKNLDPK 255
S ++ ++ LK LD +
Sbjct: 2682 SVGAPPRDRDLQRLKILDSR 2701
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 54/133 (40%), Gaps = 26/133 (19%)
Query: 81 PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSD 140
P G+ +LG P V RD +S + G + P L P+D+ LV+
Sbjct: 4839 PPGYGALGDVVSVGLDPPNAPVY--RDDSSEKGSG-------ERPRLAHPVDFRLVF--- 4886
Query: 141 EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFD 200
+ N W P P+GY ++G +V P P+ EV C+R DL
Sbjct: 4887 ---RINGRSPVTMWKPVAPEGYVALGTIVQGAPLMPDTSEVLCLRADLA----------- 4932
Query: 201 AISKFSSSPFSVW 213
A ++F SP W
Sbjct: 4933 ASTRFFDSPIWRW 4945
>gi|240275696|gb|EER39209.1| vacuolar sorting protein [Ajellomyces capsulatus H143]
Length = 309
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 72/160 (45%), Gaps = 20/160 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++N+ F G HVGDWEH +R N T + +IYFS
Sbjct: 76 GHGIVDAFWFYFYSFN----LGNKVLNIRF---GNHVGDWEHSLVRFHNGTPK--AIYFS 126
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
+HS G+ +YD G + ++YS+ HA + PGT+ + IL G+ +D
Sbjct: 127 EHSFGE-AFSYDAVEKMGKRPVIYSASGTHAMYATPGTH---AYILPWGLLHDRTDRGPL 182
Query: 479 VDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D + Y ++ + + A +W F WG
Sbjct: 183 WDPARNYHGYTYDHNNDVLRASTVNPQSPTAWFYFAGHWG 222
>gi|146322838|ref|XP_749727.2| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|129556827|gb|EAL87689.2| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|159129135|gb|EDP54249.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 626
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 73/162 (45%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G H+GDWEH +R N G+ +++FS
Sbjct: 409 GNGIVDAFWFFFYSFN----LGNVVLNVRF---GNHIGDWEHCLVRFYN--GQPKALFFS 459
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HS G+ AY E +E G + ++YS+ HA + PG + S +L G+ +D
Sbjct: 460 AHSAGE---AYSYEAVEKIGQRPVIYSAVGTHAMYATPGVH---SYVLPWGLLHDQTDRG 513
Query: 477 LYVDSSIQYELVAAEYLGEGV-------VAEPSWLQFMRKWG 511
D + + +++ + + +A W F WG
Sbjct: 514 PLWDPLLNSHMYTYDHVNDTLRASTASPLAPTEWFYFNGHWG 555
>gi|226292315|gb|EEH47735.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 702
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 72/162 (44%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++N+ F G HVGDWEH +R N G+ +I+ S
Sbjct: 470 GNGVVDAFWFYFYSFN----LGNAVLNIRF---GNHVGDWEHSLVRFQN--GKPKAIFLS 520
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
+HS G+ AY E +E G + ++YS+ HA + PGT+ + IL G+ D
Sbjct: 521 EHSFGE---AYIYEAVEKVGKRPVIYSATGTHALYATPGTH---AYILPWGLLRDVTDRG 574
Query: 477 LYVDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D ++ Y ++ + + A W F WG
Sbjct: 575 PLWDPTMNYHGYTYDFTNDILRAANITPHSPTEWFYFAGHWG 616
>gi|225680640|gb|EEH18924.1| vacuolar sorting protein [Paracoccidioides brasiliensis Pb03]
Length = 702
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 72/162 (44%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++N+ F G HVGDWEH +R N G+ +I+ S
Sbjct: 470 GNGVVDAFWFYFYSFN----LGNAVLNIRF---GNHVGDWEHSLVRFQN--GKPKAIFLS 520
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
+HS G+ AY E +E G + ++YS+ HA + PGT+ + IL G+ D
Sbjct: 521 EHSFGE---AYIYEAVEKVGKRPVIYSATGTHALYATPGTH---AYILPWGLLRDVTDRG 574
Query: 477 LYVDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D ++ Y ++ + + A W F WG
Sbjct: 575 PLWDPTMNYHGYTYDFTNDILRAANITPHSPTEWFYFAGHWG 616
>gi|119480381|ref|XP_001260219.1| hypothetical protein NFIA_082700 [Neosartorya fischeri NRRL 181]
gi|119408373|gb|EAW18322.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 625
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 73/162 (45%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G H+GDWEH +R N G+ +++FS
Sbjct: 408 GNGIVDAFWFFFYSFN----LGNVVLNVRF---GNHIGDWEHCLVRFYN--GKPKALFFS 458
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HS G+ AY E +E G + ++YS+ HA + PG + S +L G+ +D
Sbjct: 459 AHSAGE---AYSYEAVEKIGQRPVIYSAVGTHAMYATPGVH---SYVLPWGLLHDQTDRG 512
Query: 477 LYVDSSIQYELVAAEYLGEGV-------VAEPSWLQFMRKWG 511
D + + +++ + + +A W F WG
Sbjct: 513 PLWDPLLNSHMYTYDHVNDTLRASTASPLAPTEWFYFNGHWG 554
>gi|392864169|gb|EAS35020.2| hypothetical protein CIMG_00369 [Coccidioides immitis RS]
Length = 363
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 105/262 (40%), Gaps = 32/262 (12%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFW 328
+ Y P V+ H ++ Y PS ++ N K + G + NL G
Sbjct: 44 VLKYAPLVWTHSEDPYQPSDIATHVENTIPQIKYEPVQGVPSPLTLDNLDELNALGGTDI 103
Query: 329 IDLPSDGGRQIV---------KHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
+G R + K+G E A + G D + F +N T
Sbjct: 104 FLTSKEGIRALPPWLQGVRPNKNGKTEGAVSSAIIVTERGDGVVDTFYFYFNAYNEGNT- 162
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GN 437
V ++ G HVGDWEH +R N G+ +I++SQH+ G A++ ++ GN
Sbjct: 163 ------VFGTQFGNHVGDWEHNMIRFEN--GQPQAIWYSQHASGH---AFEWNAVDKRGN 211
Query: 438 KAIVYSSKNGHASFPHPGTY---LQGSEILGIGVRNDAARSNLYVDSSI-----QYELVA 489
+ IV+SS HA + G + + G E L IG+ D D ++ +Y+ A
Sbjct: 212 RPIVFSSNGSHAVYATGGDHDHTIPGVE-LPIGLLVDKCDEGFLWDPTLSTYIFKYDRTA 270
Query: 490 AEYLGEGVVAEPSWLQFMRKWG 511
++ +WL F +WG
Sbjct: 271 KQFSTYDGHTPVNWLNFDGQWG 292
>gi|320036494|gb|EFW18433.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 345
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 105/262 (40%), Gaps = 32/262 (12%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFW 328
+ Y P V+ H ++ Y PS ++ N K + G + NL G
Sbjct: 26 VLKYAPLVWTHSEDPYQPSDIATHVENTIPQIKYEPVQGVPSPLTLDNLDELNALGGTDI 85
Query: 329 IDLPSDGGRQIV---------KHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTL 379
+G R + K+G E A + G D + F +N T
Sbjct: 86 FLTSKEGIRALPPWLQGVRPNKNGKTEGAVSSAIIVTERGDGVVDTFYFYFNAYNEGNT- 144
Query: 380 KVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GN 437
V ++ G HVGDWEH +R N G+ +I++SQH+ G A++ ++ GN
Sbjct: 145 ------VFGTQFGNHVGDWEHNMIRFEN--GQPQAIWYSQHASGH---AFEWNAVDKRGN 193
Query: 438 KAIVYSSKNGHASFPHPGTY---LQGSEILGIGVRNDAARSNLYVDSSI-----QYELVA 489
+ IV+SS HA + G + + G E L IG+ D D ++ +Y+ A
Sbjct: 194 RPIVFSSNGSHAVYATGGDHDHTIPGVE-LPIGLLVDKCDEGFLWDPTLSTYIFKYDRTA 252
Query: 490 AEYLGEGVVAEPSWLQFMRKWG 511
++ +WL F +WG
Sbjct: 253 KQFSTYDGHTPVNWLNFDGQWG 274
>gi|358367647|dbj|GAA84265.1| vacuolar protein sorting-associated protein 62 [Aspergillus
kawachii IFO 4308]
Length = 345
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 62/129 (48%), Gaps = 10/129 (7%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R N G +I++SQH+GG+ YD +G + I YS+ HA
Sbjct: 165 QFGDHVGDWEHNMIRFDN--GVPQAIWYSQHAGGE-AFTYDATEKQGKRPIAYSANGSHA 221
Query: 450 SFPHPGTYLQGSEILGIG---VRNDAARSNLY--VDSSIQYELVAAEYLGEGVVAE--PS 502
+ PGT+ L + V + R L+ + ++ Y AA + +
Sbjct: 222 VYAVPGTHDHAIPHLNLPFGFVMDYTDRGTLWDPIANAYAYSFDAASQTFQPYDRRYPAN 281
Query: 503 WLQFMRKWG 511
WL+F +WG
Sbjct: 282 WLEFNGQWG 290
>gi|448927213|gb|AGE50787.1| protein of unknown function (DUF946) [Paramecium bursaria Chlorella
virus CVB-1]
Length = 328
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 97/251 (38%), Gaps = 46/251 (18%)
Query: 267 ALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAI-DPSGSNLPSGGRNDG 325
+L + Y P V+ E Y P + + + N ++ G +V + I DP+ + DG
Sbjct: 6 SLAQRYAPHVYLDVREPYWPCTYADYIRNSRIIKADGTVVVDKIADPNIV-----AKYDG 60
Query: 326 EFWIDLPSD---GGRQIVKHGN------MESAKLYVHVKPAVGGTFTDIVMWVFCPFNGP 376
+ +DL D G R K + +E Y V DI + +NG
Sbjct: 61 NYRLDLKKDARAGMRDKTKFPSKLEMYCIEKRITYGDVDQPPSYDMIDIYYIMIYAYNG- 119
Query: 377 GTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG 436
+ H D E +RI N T ++++FS HSGG W D++
Sbjct: 120 -------------TLEPHDSDREFVMIRIRNNTP--YAMFFSHHSGGYWRWWKDVQKASD 164
Query: 437 NKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAA----------RSNLYVDSSIQYE 486
+ +Y++ HA F PGTY ILG G ND R V + Q +
Sbjct: 165 GRPNIYAAIESHAFFDAPGTY---RRILGFG--NDTVTPRATPLSFDRDYDLVMAGTQAD 219
Query: 487 LVAAEYLGEGV 497
L Y+G +
Sbjct: 220 LTKRPYMGSRI 230
>gi|344305481|gb|EGW35713.1| hypothetical protein SPAPADRAFT_58911 [Spathaspora passalidarum
NRRL Y-27907]
Length = 438
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 96/226 (42%), Gaps = 33/226 (14%)
Query: 254 PKLHAMPNCDQ-------IHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV 306
P + ++P DQ I + Y P V + +E YLP + F TN + YK G ++
Sbjct: 56 PPIVSLPKKDQRTLKDGEIPDYVLKYSPLVHLYSEERYLPYDIQKFVTNFHVTYKNGTII 115
Query: 307 -GEAIDPSGSNLPSGGRNDGEFWIDLPSD-----------GGRQIVKHGNMESAKLYVHV 354
G D + L ++ E ++ SD + + +G ++ A V +
Sbjct: 116 PGTEEDMTLEKLSKLPKHS-EIFLTSNSDFDKNPEWITGFKNKPNLINGEIKDAPA-VLI 173
Query: 355 KPAVGGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELW 413
G + D + F FN GP + G G HVGDWEH +R + GE
Sbjct: 174 TVDKGNGWVDSYWFYFYSFNLGPFVMGQG-------PYGNHVGDWEHSLVRF--YKGEPI 224
Query: 414 SIYFSQHSGGKWVAAYDLEY--IEGNKAIVYSSKNGHASFPHPGTY 457
++ S H GG +LE ++ I++S++ HA++P G Y
Sbjct: 225 IVWMSAHGGGGAFYYKNLEKYSLDEKHPIIFSARGTHANYPSVGQY 270
>gi|398396156|ref|XP_003851536.1| hypothetical protein MYCGRDRAFT_22198, partial [Zymoseptoria
tritici IPO323]
gi|339471416|gb|EGP86512.1| hypothetical protein MYCGRDRAFT_22198 [Zymoseptoria tritici IPO323]
Length = 454
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 98/259 (37%), Gaps = 46/259 (17%)
Query: 311 DPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVF 370
D + P R D + +D PS + G SA + V P G D + F
Sbjct: 226 DRTARPHPDLRRRDHDPTLDNPS----HKPQPGGKSSAPAILIVVPKPDGV-VDAFWFFF 280
Query: 371 CPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD 430
FN L ++N+ F G HVGDWEH T+R N G+ I+ S+H+ G+ Y
Sbjct: 281 YSFN----LGQTVLNIRF---GNHVGDWEHTTVRFVN--GKPTQIFLSEHNFGQ-ALTYS 330
Query: 431 LEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAA 490
+ +VYS+ HA + PG + IL G+ +D D ++
Sbjct: 331 AVENRAKRPVVYSALGSHAMYATPGLH---PYILPFGLLHDETDRGPLWDPVQNFQAYTY 387
Query: 491 EYLGEGV-------VAEPSWLQFMRKWGPTIVYDSKTELDKIIKL-LPLMIRYSVENAVS 542
L V + W F WG DK +L P R++
Sbjct: 388 NPLTRTVRSSTLNPTSPTQWFDFAGHWG-----------DKFYELGDPRQYRFAG----- 431
Query: 543 KLPLELYGEEGPTGPKEKN 561
+ + GPTGPK KN
Sbjct: 432 ----QYHYVNGPTGPKFKN 446
>gi|297834544|ref|XP_002885154.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297330994|gb|EFH61413.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 339
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 50/97 (51%), Gaps = 29/97 (29%)
Query: 441 VYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAE 500
V SSK+G FP PG S LG G+ SI+ V E
Sbjct: 233 VNSSKHGTCEFPSPGNV---SPRLGNGL-------------SIE-------------VKE 263
Query: 501 PSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSV 537
WL+ MR+WGPTI YDS +E++KI+ LLPL++ +S+
Sbjct: 264 LCWLEHMREWGPTIAYDSASEINKIMNLLPLLVGFSL 300
>gi|347841087|emb|CCD55659.1| hypothetical protein [Botryotinia fuckeliana]
Length = 401
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 91/214 (42%), Gaps = 28/214 (13%)
Query: 258 AMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNL 317
A+PN + + + Y P V+ E+Y PS + N Y L+ E PS NL
Sbjct: 88 ALPNT-TLPSYVFEYAPIVWLDQSEMYFPSDIYAQVLNTKP-YVNRSLIPEDTLPSPLNL 145
Query: 318 PSGGRNDGEFWIDLPSDGGRQIVKHGNME---SAKLYVHVKPA--VGGTFTDIVMWVFCP 372
+ ID ++ G + ++ S V P+ G T I V
Sbjct: 146 ET---------IDKLNNWGLDVYLTSTVDITTSPSYLTGVVPSSTTGRTQDAISCSVILT 196
Query: 373 FNGPGTLKVGIM---------NVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGG 423
+G G + V M V ++G H+GDWEH +R N T + ++++SQH+ G
Sbjct: 197 DHGDGLVDVFYMYFYAYNQGNTVLGQELGNHIGDWEHNMIRFVNGTPQ--TVWYSQHANG 254
Query: 424 KWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
+ Y + G + I YS++ HA++ GT+
Sbjct: 255 Q-AFLYSVVEKNGKRPIAYSARGSHANYAISGTH 287
>gi|378733611|gb|EHY60070.1| hypothetical protein HMPREF1120_08042 [Exophiala dermatitidis
NIH/UT8656]
Length = 605
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 75/162 (46%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 387 GHGIVDAFWFFFYSFN----LGNVVLNVRF---GNHVGDWEHTAIRFHH--GKPKAVFFS 437
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
+H+ G +AY + +E G + ++YS+ HA + PGT+ IL G+ +D
Sbjct: 438 EHNFG---SAYSYDAVEKLGKRPVIYSAYGTHAMYSTPGTH---PYILPWGILHDVTDRG 491
Query: 477 LYVDSSIQYELVAAEYLGEGVVAE---PS----WLQFMRKWG 511
D ++ ++ + + A PS W F +WG
Sbjct: 492 PLWDPALNSHAYTYDFKADVLRASNITPSAPTEWFHFAGRWG 533
>gi|154293430|ref|XP_001547246.1| hypothetical protein BC1G_14341 [Botryotinia fuckeliana B05.10]
Length = 380
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 91/214 (42%), Gaps = 28/214 (13%)
Query: 258 AMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNL 317
A+PN + + + Y P V+ E+Y PS + N Y L+ E PS NL
Sbjct: 88 ALPNT-TLPSYVFEYAPIVWLDQSEMYFPSDIYAQVLNTKP-YVNRSLIPEDTLPSPLNL 145
Query: 318 PSGGRNDGEFWIDLPSDGGRQIVKHGNME---SAKLYVHVKPA--VGGTFTDIVMWVFCP 372
+ ID ++ G + ++ S V P+ G T I V
Sbjct: 146 ET---------IDKLNNWGLDVYLTSTVDITTSPSYLTGVVPSSTTGRTQDAISCSVILT 196
Query: 373 FNGPGTLKVGIM---------NVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGG 423
+G G + V M V ++G H+GDWEH +R N T + ++++SQH+ G
Sbjct: 197 DHGDGLVDVFYMYFYAYNQGNTVLGQELGNHIGDWEHNMIRFVNGTPQ--TVWYSQHANG 254
Query: 424 KWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
+ Y + G + I YS++ HA++ GT+
Sbjct: 255 Q-AFLYSVVEKNGKRPIAYSARGSHANYAISGTH 287
>gi|67900934|ref|XP_680723.1| hypothetical protein AN7454.2 [Aspergillus nidulans FGSC A4]
gi|40742844|gb|EAA62034.1| hypothetical protein AN7454.2 [Aspergillus nidulans FGSC A4]
gi|259483760|tpe|CBF79414.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 641
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 72/162 (44%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G H+GDWEH +R + G +++FS
Sbjct: 421 GNGIVDAFWFYFYSFN----LGNVVLNVRF---GNHIGDWEHCLVRFHH--GIPKALFFS 471
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HSGG+ AY E +E G + ++YS+ HA + PG + S IL G+ +D
Sbjct: 472 AHSGGE---AYSYEALEKIGRRPVIYSATGTHAMYATPGVH---SYILPWGLLHDQTDRG 525
Query: 477 LYVDSSIQYELVAAEYLGEGV-------VAEPSWLQFMRKWG 511
D + + +++ + + A W F WG
Sbjct: 526 PLWDPLLNSHMYTYDHVSDTLRASTLSPTAPTEWFYFNGHWG 567
>gi|303313195|ref|XP_003066609.1| hypothetical protein CPC735_058340 [Coccidioides posadasii C735
delta SOWgp]
gi|240106271|gb|EER24464.1| hypothetical protein CPC735_058340 [Coccidioides posadasii C735
delta SOWgp]
Length = 354
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 80/181 (44%), Gaps = 23/181 (12%)
Query: 341 KHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEH 400
K+G E A + G D + F +N T V ++ G HVGDWEH
Sbjct: 116 KNGKTEGAVSSAIIVTERGDGVVDTFYFYFNAYNEGNT-------VFGTQFGNHVGDWEH 168
Query: 401 FTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY- 457
+R N G+ +I++SQH+ G A++ ++ GN+ IV+SS HA + G +
Sbjct: 169 NMIRFEN--GQPQAIWYSQHASGH---AFEWNAVDKRGNRPIVFSSNGSHAVYATGGDHD 223
Query: 458 --LQGSEILGIGVRNDAARSNLYVDSSI-----QYELVAAEYLGEGVVAEPSWLQFMRKW 510
+ G E L IG+ D D ++ +Y+ A ++ +WL F +W
Sbjct: 224 HTIPGVE-LPIGLLVDKCDEGFLWDPTLSTYIFKYDRTAKQFSTYDGHTPVNWLNFDGQW 282
Query: 511 G 511
G
Sbjct: 283 G 283
>gi|392863555|gb|EAS35687.2| vacuolar protein sorting protein 62 [Coccidioides immitis RS]
Length = 598
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 70/165 (42%), Gaps = 24/165 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L + N+ F G HVGDWEH +R GE +I+ S
Sbjct: 372 GDGIVDAFWFYFYSFN----LGNEVFNLRF---GNHVGDWEHSLVRFHR--GEPKAIFLS 422
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
+HSGG+ AY +E G + ++YS+ HA + +PG + S IL G+ D +
Sbjct: 423 EHSGGE---AYTYNAMEKLGKRPVIYSATGTHAIYANPGIH---SYILPWGLLRDQTDAG 476
Query: 477 LYVDSSIQYELVAAEYLGEGVV-------AEPSWLQFMRKWGPTI 514
D S+ A + + A W F WG I
Sbjct: 477 PLWDPSLNMHAYAYSPHNDTLYPSNRTPNAPTEWFSFRGHWGDKI 521
>gi|154317894|ref|XP_001558266.1| predicted protein [Botryotinia fuckeliana B05.10]
gi|347831543|emb|CCD47240.1| similar to similar to DUF946 domain-containing protein [Botryotinia
fuckeliana]
Length = 399
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 92/202 (45%), Gaps = 23/202 (11%)
Query: 44 NLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPD--GFYSLGHYCQFDSRPLR-G 100
+ G++ V S FN++W+ + K+ Y P D SLG + ++ R +
Sbjct: 7 DYGDLRVTMTSAFNWVWN-DKGSGASKNFEAYHPVSQGDLRPLGSLG-FSSYNERNGKFA 64
Query: 101 FVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPD 160
+LV ++ +SS + A+ P Y L+W DE G+++G FW P+ P
Sbjct: 65 ALLVGKNPSSS-----------GTAAVASPTGYDLIW-RDEKSGGDHDGS--FWRPRAPS 110
Query: 161 GYKSMGFLVTKTPNKPELDEVRCVRDDLTDKC-EVHHLIFDAISKFSSSPFSVWSTRPCN 219
GY S+G + + + + P D++ CVR DL ++D + S SVW +
Sbjct: 111 GYVSLGDVCSGSWSAPSNDKIWCVRADLVQNSNHFKAKVWDDQKSGAKSDCSVWDIGLPS 170
Query: 220 RGMLGRG---VSVGTFFCSSNW 238
G+ G +S TF +++W
Sbjct: 171 IGINGGEKIPISSETFRANNSW 192
>gi|260780786|ref|XP_002585521.1| hypothetical protein BRAFLDRAFT_111924 [Branchiostoma floridae]
gi|229270516|gb|EEN41532.1| hypothetical protein BRAFLDRAFT_111924 [Branchiostoma floridae]
Length = 926
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 71/174 (40%), Gaps = 22/174 (12%)
Query: 263 DQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGR 322
D++ L+ Y P V+ +E Y PSSV + N +Y G ++ + S LP+
Sbjct: 496 DELGVLVTKYAPKVWLAKNEKYNPSSVDFHLQN-VKVYDGG----KSYFSTPSTLPTCSE 550
Query: 323 N---DGEFWIDLPSD-----GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFN 374
+ P GG Q+ G +Y VK + T TDI W+F +N
Sbjct: 551 TCYMSTAQPLRYPDSQLRFFGGEQV---GPTYQPPVYAVVK-RIDPTTTDIFYWMFYSYN 606
Query: 375 GPGTLKVGIMNVA-----FSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGG 423
GP + +G HVGDWEH T+R+ +Y + SGG
Sbjct: 607 GPKKVCMGFRFFGKCAGGLKSFSHHVGDWEHMTIRLVGKQPRSIYVYSAHKSGG 660
>gi|303312275|ref|XP_003066149.1| hypothetical protein CPC735_053740 [Coccidioides posadasii C735
delta SOWgp]
gi|240105811|gb|EER24004.1| hypothetical protein CPC735_053740 [Coccidioides posadasii C735
delta SOWgp]
Length = 598
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 20/163 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L + N+ F G HVGDWEH +R GE +I+ S
Sbjct: 372 GDGIVDAFWFYFYSFN----LGNEVFNLRF---GNHVGDWEHSLVRF--HRGEPKAIFLS 422
Query: 419 QHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLY 478
+HSGG+ +E + G + ++YS+ HA + +PG + S IL G+ D +
Sbjct: 423 EHSGGEAYTYNAMEKL-GKRPVIYSATGTHAIYANPGIH---SYILPWGLLRDQTDAGPL 478
Query: 479 VDSSIQYELVAAEYLGEGVV-------AEPSWLQFMRKWGPTI 514
D S+ A + + A W F WG I
Sbjct: 479 WDPSLNMHAYAYSPHNDTLYPSNRTPNAPIEWFSFRGHWGDKI 521
>gi|345565906|gb|EGX48854.1| hypothetical protein AOL_s00079g493 [Arthrobotrys oligospora ATCC
24927]
Length = 347
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 110/266 (41%), Gaps = 41/266 (15%)
Query: 271 NYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWI- 329
+Y P V+ H D+ Y PS++ N + G + NL G+
Sbjct: 40 DYAPLVYLHSDDEYRPSNIRSMLRNTQPRVNFKPIPGIPQPVTLDNLDQLNDFGGDSVYL 99
Query: 330 ----DLPSDGGRQIVKHGNME---------SAKLYVHVKPAVGGTFTDIVMWVFCPFN-G 375
D+ +D + +K + SA + V+ K A T TD+ + F FN G
Sbjct: 100 TSVNDVAADSQQAWLKGVTPDATGLTNGAISATIIVNEKDA---TTTDVFYFYFYNFNSG 156
Query: 376 PGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE 435
P + K G HVGDWEH +R N G ++++SQH+ G+ A+ E +E
Sbjct: 157 PPVWGI--------KFGDHVGDWEHIMVRFQN--GIPSALWYSQHADGQ---AFTYEAVE 203
Query: 436 --GNKAIVYSSKNGHASFPHPGTY---LQGSEILGIGVRNDAARSNLYVDSSI-----QY 485
G + + YS+K HA++ G + + + G D + D ++ ++
Sbjct: 204 KLGKRPVGYSAKGSHANYAMGGLHDHTIPNFNLPGAFFLVDDTNKGILWDPTLSAYYYKF 263
Query: 486 ELVAAEYLGEGVVAEPSWLQFMRKWG 511
++ + + + +WL F KWG
Sbjct: 264 DVGTSSFTSYNPASPVNWLYFKGKWG 289
>gi|320040149|gb|EFW22083.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 598
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 70/165 (42%), Gaps = 24/165 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L + N+ F G HVGDWEH +R GE +I+ S
Sbjct: 372 GDGIVDAFWFYFYSFN----LGNEVFNLRF---GNHVGDWEHSLVRF--HRGEPKAIFLS 422
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
+HSGG+ AY +E G + ++YS+ HA + +PG + S IL G+ D +
Sbjct: 423 EHSGGE---AYTYNAMEKLGKRPVIYSATGTHAIYANPGIH---SYILPWGLLRDQTDAG 476
Query: 477 LYVDSSIQYELVAAEYLGEGVV-------AEPSWLQFMRKWGPTI 514
D S+ A + + A W F WG I
Sbjct: 477 PLWDPSLNMHAYAYSPHNDTLYPSNRTPNAPIEWFSFRGHWGDKI 521
>gi|134057456|emb|CAK37964.1| unnamed protein product [Aspergillus niger]
Length = 344
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 82/201 (40%), Gaps = 23/201 (11%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFW 328
+ Y P V+ + E Y+PS + + + + G A P N +G G
Sbjct: 40 VMEYAPLVWLYSKETYMPSDIGQQLVHTTPMVNYTAIKG-APSPLTLNNLAGLNKLGNTS 98
Query: 329 IDLPSDGGRQIV------------KHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGP 376
+ L S G ++G + A + G D + F +N
Sbjct: 99 VYLTSIEGIDATPEPAWFRGVKPDRNGKTKGAVSSTIITRDHGNGTVDAFYFYFYAYNM- 157
Query: 377 GTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG 436
G +G+ + G HVGDWEH +R N G ++++SQH+GG+ YD +G
Sbjct: 158 GNTVLGM------QFGDHVGDWEHNMIRFDN--GVPQALWYSQHAGGE-AFTYDATEKQG 208
Query: 437 NKAIVYSSKNGHASFPHPGTY 457
+ I YS+ HA + PGT+
Sbjct: 209 KRPIAYSANGSHAVYAVPGTH 229
>gi|317028078|ref|XP_001400511.2| vacuolar protein sorting-associated protein 62 [Aspergillus niger
CBS 513.88]
Length = 342
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 82/201 (40%), Gaps = 23/201 (11%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFW 328
+ Y P V+ + E Y+PS + + + + G A P N +G G
Sbjct: 40 VMEYAPLVWLYSKETYMPSDIGQQLVHTTPMVNYTAIKG-APSPLTLNNLAGLNKLGNTS 98
Query: 329 IDLPSDGGRQIV------------KHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGP 376
+ L S G ++G + A + G D + F +N
Sbjct: 99 VYLTSIEGIDATPEPAWFRGVKPDRNGKTKGAVSSTIITRDHGNGTVDAFYFYFYAYNM- 157
Query: 377 GTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG 436
G +G+ + G HVGDWEH +R N G ++++SQH+GG+ YD +G
Sbjct: 158 GNTVLGM------QFGDHVGDWEHNMIRFDN--GVPQALWYSQHAGGE-AFTYDATEKQG 208
Query: 437 NKAIVYSSKNGHASFPHPGTY 457
+ I YS+ HA + PGT+
Sbjct: 209 KRPIAYSANGSHAVYAVPGTH 229
>gi|297742188|emb|CBI33975.3| unnamed protein product [Vitis vinifera]
Length = 2801
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 95/228 (41%), Gaps = 34/228 (14%)
Query: 53 ISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSE 112
I+ F+ IW N S +K + + P +P G G P ++V D E
Sbjct: 659 IASFHLIW-WNQNSSSRKKLSIWRPV-VPRGMVYFGDIAVQGYEPPNTCIVV-HDTGDDE 715
Query: 113 AEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTK- 171
LFK+P LD+ LV Q E +F WLPQ P G+ S+G + K
Sbjct: 716 --------LFKAP-----LDFQLV--GQIKKQRGMESISF-WLPQAPPGFVSLGCIACKG 759
Query: 172 TPNKPELDEVRCVRDDL-TDKCEVHHLIFD-AISKFSSSPFSVWSTRPCNRGMLGRGVSV 229
TP + +RC+R D+ T + ++D + +K + PFS+W+ G +
Sbjct: 760 TPKPNDFSSLRCIRSDMVTGDQFLEESVWDTSDAKHTKEPFSIWAV----------GNDL 809
Query: 230 GTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVF 277
GTF S + + LK DP + + + I A I + +F
Sbjct: 810 GTFVVRSGF--KKPPKRFALKLADPNIPSGSDDTVIDAEISTFSAVLF 855
Score = 45.4 bits (106), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 35/166 (21%), Positives = 58/166 (34%), Gaps = 34/166 (20%)
Query: 65 LQSKKKSATFYEPAGIPDGFYSLGHYCQ-FDSRPLRGFVLVARDLASSEAEGAHTSNLFK 123
++S+ ++ + P P GF G Y D P +G V V A
Sbjct: 438 IESRDQTYALWRPRA-PPGFAVFGDYLTPLDKPPTKGVVAVNTSFAK------------- 483
Query: 124 SPALQKPLDYTLVWCSDEGGQ----------------GNYEGCAFFWLPQPPDGYKSMGF 167
+++P+ + L+W + G E W P+ PDGY ++G
Sbjct: 484 ---VKRPVSFKLIWPPSASEEISGSLGIDNVMPNPVLGEGESNCSIWFPEAPDGYVALGC 540
Query: 168 LVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVW 213
+V+ +P L C+ L C + I S S + W
Sbjct: 541 VVSPGRTRPPLSSAFCILASLVSPCALRDCITIGSGNMSHSRLAFW 586
>gi|325088360|gb|EGC41670.1| vacuolar sorting-associated protein [Ajellomyces capsulatus H88]
Length = 340
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 99/244 (40%), Gaps = 27/244 (11%)
Query: 238 WISGQELN-------IACLK-NLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSV 289
WIS + L+ I CL+ L + D I + Y P ++ H ++ Y+PS+
Sbjct: 2 WISHRTLSTLVASLLIQCLRFVLLVDATCLAQYDSIPDFVLKYAPLIYLHSEDQYMPSAF 61
Query: 290 SWFFTNGALLYKAGDLVGEAIDPSGSNLPS----GGRN----DGEFWIDLPS--DGGRQI 339
+ N + + NL S GG++ E LP G R
Sbjct: 62 ATLLENSKPTVNYTPVADVPSPLTLDNLDSLNSLGGKDVFLTTNEGIRALPEWFSGTRPD 121
Query: 340 VKHGNMESAKLYVHVKPAVGGTFTD-IVMWVFCPFNGPGTLKVGIM-----NVAFSKIGQ 393
+++ + V+ G D + + G ++ GI V + G
Sbjct: 122 KAGKTVDAVSSIIVVRDHGDGKMVDAFYFYYYGKACGNLSIDDGIRYNQGNTVIGIQFGN 181
Query: 394 HVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPH 453
HVGDWEH +R N G+ ++++SQH+GG+ Y +G + + YS+ HA +
Sbjct: 182 HVGDWEHNMIRFEN--GQPQAVWYSQHAGGE-AFTYRATEKQGLRPVGYSANGSHAVYAT 238
Query: 454 PGTY 457
PGT+
Sbjct: 239 PGTH 242
>gi|119193198|ref|XP_001247205.1| hypothetical protein CIMG_00976 [Coccidioides immitis RS]
Length = 573
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 59/132 (44%), Gaps = 13/132 (9%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R GE +I+ S+HSGG+ +E + G + ++YS+ HA
Sbjct: 365 RFGNHVGDWEHSLVRFHR--GEPKAIFLSEHSGGEAYTYNAMEKL-GKRPVIYSATGTHA 421
Query: 450 SFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV-------AEPS 502
+ +PG + S IL G+ D + D S+ A + + A
Sbjct: 422 IYANPGIH---SYILPWGLLRDQTDAGPLWDPSLNMHAYAYSPHNDTLYPSNRTPNAPTE 478
Query: 503 WLQFMRKWGPTI 514
W F WG I
Sbjct: 479 WFSFRGHWGDKI 490
>gi|392574168|gb|EIW67305.1| hypothetical protein TREMEDRAFT_33600, partial [Tremella
mesenterica DSM 1558]
Length = 450
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 49/101 (48%), Gaps = 13/101 (12%)
Query: 363 TDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSG 422
D+ W F PFN G F +G HV DWE FT+R N G S +S H G
Sbjct: 130 VDLWYWTFYPFN------YGKPVGPFGILGHHVADWERFTVRTIN--GTAVSADYSDHRG 181
Query: 423 GKWVAAY----DLEYIEGNKAIVYSSKNGHASFPHPGTYLQ 459
G++ A D+ +EG + + Y++ H +P PG +L+
Sbjct: 182 GRFSAGTVRWEDVTQVEG-RPVAYAAAGSHGIWPGPGEHLK 221
>gi|322705918|gb|EFY97501.1| vacuolar protein sorting-associated protein, putative [Metarhizium
anisopliae ARSEF 23]
Length = 401
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/251 (23%), Positives = 105/251 (41%), Gaps = 45/251 (17%)
Query: 38 FASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG-----------IPDGFYS 86
A+ + G++ + S + ++W + ++S + P P+G++
Sbjct: 1 MANKTFDYGDLRITMTSGYRWVWD-DTGSGARRSVCLWTPNAQGNLCPVGDYAAPEGYWE 59
Query: 87 L-GHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQG 145
+ GH R +LV G + + K PA+ +P YT VW D G G
Sbjct: 60 INGH---------RASLLV----------GQNPNTTPKKPAVARPTGYTKVW-GDWGSGG 99
Query: 146 NYEGCAFFWLPQPPDGYKSMGFLVT-KTPNKPELDEVRCVRDDLTDKCE-VHHLIFDAIS 203
++G W P P GY ++G + T KP+++++ CVR+DL + + +D
Sbjct: 100 KHDG--IIWHPTAPAGYVALGDVGTYGYDTKPDVNKIWCVREDLVGYGKFLATSTWDDGG 157
Query: 204 KFSSSPFSVWSTRPCNRGMLGRG---VSVGTFFCSSNWIS--GQELNIACL---KNLDPK 255
+ S S+W+ +P + G+ G V TF SN+ G + + L K D
Sbjct: 158 SGAGSDCSLWAVQPGDVGVDGAERLPVLADTFVARSNYARPGGDQARVLLLPMPKKYDEY 217
Query: 256 LHAMPNCDQIH 266
A+P + H
Sbjct: 218 DVAVPKVEARH 228
>gi|115400755|ref|XP_001215966.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114191632|gb|EAU33332.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 617
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 71/162 (43%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G H+GDWEH +R + G+ +++FS
Sbjct: 400 GNGIVDAFWFYFYSFN----LGNVVLNVRF---GNHIGDWEHCLVRFHH--GKPKALFFS 450
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HSGG+ AY + +E G + ++YS+ HA + PG + IL G+ +D
Sbjct: 451 AHSGGE---AYSYDAVEKIGKRPVIYSAMGTHAMYATPGVH---DYILPWGLLHDQTDRG 504
Query: 477 LYVDSSIQYELVAAEYLGEGVVAEPS-------WLQFMRKWG 511
D + +Y + + A + W F WG
Sbjct: 505 PLWDPLLNSHSYTYDYDNDTLRASTASPDSPTEWFYFKGHWG 546
>gi|350635235|gb|EHA23597.1| hypothetical protein ASPNIDRAFT_52480 [Aspergillus niger ATCC 1015]
Length = 646
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 72/162 (44%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 430 GHGVVDAFWFYFYSFN----LGNVVLNVRF---GNHVGDWEHCLVRFHD--GKPKALFFS 480
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HS G+ AY E +E G + ++YS+ HA + PG + + IL G+ +D
Sbjct: 481 AHSAGE---AYSYEAVEKIGQRPVIYSAMGTHAMYATPGIH---AYILPWGLLHDQTDRG 534
Query: 477 LYVDSSIQYELVAAEYLGEGV---VAEPS----WLQFMRKWG 511
D + +Y + + A PS W F WG
Sbjct: 535 PLWDPLLNAHAYTYDYDKDALRASTATPSAPIEWFYFNGHWG 576
>gi|452841892|gb|EME43828.1| hypothetical protein DOTSEDRAFT_53114 [Dothistroma septosporum
NZE10]
Length = 411
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 78/188 (41%), Gaps = 24/188 (12%)
Query: 43 INLGEIEVCRISRFNFIWSCNLLQSKKKSATFY-EPAG--IPDGFYSLGHYCQFDSRPL- 98
++ G++ V + +++W + S + + PAG +P G S+G ++ P
Sbjct: 4 VDYGDLRVGMTTTLDWVWDDSSSGSNRDGNIYNPHPAGDLLPIG--SIGFRGHHNTEPRS 61
Query: 99 --RGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLP 156
RG +LV KS A + P Y L+W +D G G ++ W P
Sbjct: 62 GKRGVLLVGNVPG-------------KSAACKHPTGYNLIW-NDAGSGGKFD--VSIWRP 105
Query: 157 QPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTR 216
PDGY +MG + + KP ++ V C+R D T + +D + S W
Sbjct: 106 VAPDGYVAMGEYCSNSSTKPSVNAVWCIRQDYTADGKFGPSWWDDRGSGAKMNGSFWGVD 165
Query: 217 PCNRGMLG 224
P N + G
Sbjct: 166 PQNISLTG 173
>gi|296416127|ref|XP_002837732.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633615|emb|CAZ81923.1| unnamed protein product [Tuber melanosporum]
Length = 386
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 71/157 (45%), Gaps = 22/157 (14%)
Query: 363 TDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSG 422
TD + F FN T VA + G HVGDWEH +R + G SI++S+HSG
Sbjct: 161 TDAFYFYFYSFNLGNT-------VAGWRFGNHVGDWEHTAVRFID--GVPQSIFYSEHSG 211
Query: 423 GKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQG------SEILGIGVRNDAAR 474
G AY+ +E G + + YS+ HA++ PGT+ ++ G D +
Sbjct: 212 G---VAYEYGAVEKIGIRPVCYSAVGSHANYARPGTHYYAIPFHLLADHTDKGPLWDVTK 268
Query: 475 SNLYVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWG 511
++ + + ++++ G A W F +WG
Sbjct: 269 NSYMYNYDVDHDILKPSRDTPG--APVDWFHFGGRWG 303
>gi|317027267|ref|XP_001400559.2| vacuolar protein sorting protein 62 [Aspergillus niger CBS 513.88]
Length = 645
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 72/162 (44%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 426 GHGVVDAFWFYFYSFN----LGNVVLNVRF---GNHVGDWEHCLVRFHD--GKPKALFFS 476
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HS G+ AY E +E G + ++YS+ HA + PG + + IL G+ +D
Sbjct: 477 AHSAGE---AYSYEAVEKIGQRPVIYSAMGTHAMYATPGIH---AYILPWGLLHDQTDRG 530
Query: 477 LYVDSSIQYELVAAEYLGEGV---VAEPS----WLQFMRKWG 511
D + +Y + + A PS W F WG
Sbjct: 531 PLWDPLLNAHAYTYDYDKDALRASTATPSAPIEWFYFNGHWG 572
>gi|302690742|ref|XP_003035050.1| hypothetical protein SCHCODRAFT_105405 [Schizophyllum commune H4-8]
gi|300108746|gb|EFJ00148.1| hypothetical protein SCHCODRAFT_105405, partial [Schizophyllum
commune H4-8]
Length = 345
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 10/97 (10%)
Query: 360 GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
G F D+ + F +N + V +++G HVGDWEH +R N G +Y S
Sbjct: 144 GDFVDVFYFYFYSYNHG-------ITVLGTRLGNHVGDWEHSMVRFVN--GAPTHVYLSA 194
Query: 420 HSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGT 456
HSGG + YD G + +VY + HA++ GT
Sbjct: 195 HSGG-YAYTYDAVEKRGVRPVVYVADGSHANYATAGT 230
>gi|226291806|gb|EEH47234.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 337
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 59/123 (47%), Gaps = 11/123 (8%)
Query: 336 GRQIVKHGNMESAKLYVHVKPAVG-GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQH 394
G + K G E A V V G G D+ + + +N T V +++G H
Sbjct: 61 GTRPNKAGQTEDAVSSVVVLREHGDGKTADVFYFYYYAYNQGNT-------VFGTQLGNH 113
Query: 395 VGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHP 454
VGDWEH +R N + SI++SQH+ G+ Y+ +G + I YSS HA + P
Sbjct: 114 VGDWEHNMIRFVNEQPQ--SIWYSQHAAGQ-AFTYEATEKQGFRPIGYSSNGSHAVYATP 170
Query: 455 GTY 457
GT+
Sbjct: 171 GTH 173
>gi|295673210|ref|XP_002797151.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226282523|gb|EEH38089.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 659
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 79/184 (42%), Gaps = 24/184 (13%)
Query: 337 RQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVG 396
R++ + + S V + G D + F FN L ++N+ F G HVG
Sbjct: 405 REVKNNKSGHSKAPAVLIVVDKGNDVVDAFWFYFYSFN----LGNVVLNIRF---GNHVG 457
Query: 397 DWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHP 454
DWEH +R N G+ +I+ S+HS G+ AY E +E G + ++YS+ HA + P
Sbjct: 458 DWEHSLVRFHN--GKPKAIFLSEHSFGE---AYTYEAVEKIGKRPVIYSATGTHAIYATP 512
Query: 455 GTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPS-------WLQFM 507
GT+ + IL G+ D D + Y + + + A + W F
Sbjct: 513 GTH---AYILPWGLLRDVTDRGPLWDPIMNYHGYTYDSTNDILRAANTTPHSPTEWFYFA 569
Query: 508 RKWG 511
WG
Sbjct: 570 GHWG 573
>gi|358367614|dbj|GAA84232.1| vacuolar sorting-associated protein [Aspergillus kawachii IFO 4308]
Length = 654
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 72/162 (44%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 435 GHGVVDAFWFYFYSFN----LGNVVLNVRF---GNHVGDWEHCLVRFHD--GKPKALFFS 485
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HS G+ AY E +E G + ++YS+ HA + PG + + IL G+ +D
Sbjct: 486 AHSAGE---AYSYEAVEKIGQRPVIYSAMGTHAMYATPGIH---AYILPWGLLHDQTDRG 539
Query: 477 LYVDSSIQYELVAAEYLGEGV---VAEPS----WLQFMRKWG 511
D + +Y + + A PS W F WG
Sbjct: 540 PLWDPLLNAHAYTYDYDSDNLRASTATPSAPIEWFYFNGHWG 581
>gi|380475859|emb|CCF45029.1| vacuolar protein sorting-associated protein 62, partial
[Colletotrichum higginsianum]
Length = 362
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 86/209 (41%), Gaps = 37/209 (17%)
Query: 366 VMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK- 424
V V PF+ L G + G H+GDWEH +R + G+ +YFSQH G+
Sbjct: 161 VTQVLEPFD---RLVNGGKAASGMHFGDHIGDWEHNMIRFRD--GKPTGMYFSQHVDGES 215
Query: 425 --WVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGI---GVRNDAARSN-LY 478
W A L ++G + IVYS+ HA++P PG + + ++ G R D S Y
Sbjct: 216 YGWDDA-KLSKVDG-RPIVYSACGSHANYPTPGDQIHNAVLIDYCDEGKRWDPVLSAYFY 273
Query: 479 VDSSIQYELVAAEYLGEGVVAEP------SWLQFMRKWGPTIVYDSKTELDKIIKLLPLM 532
+ + L + E P S+ + +WG T DS
Sbjct: 274 RFDADSFTLTRLDPPDESYPTPPPPSNLTSFFYYSGRWGDTSYPDSDPR----------- 322
Query: 533 IRYSVENAVSKLPLELYGEEGPTGPKEKN 561
+ V + L + + GPTGP+ K+
Sbjct: 323 -----QETVPRFKLRRF-QTGPTGPRHKH 345
>gi|134057505|emb|CAK48859.1| unnamed protein product [Aspergillus niger]
Length = 628
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 72/162 (44%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 409 GHGVVDAFWFYFYSFN----LGNVVLNVRF---GNHVGDWEHCLVRFHD--GKPKALFFS 459
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HS G+ AY E +E G + ++YS+ HA + PG + + IL G+ +D
Sbjct: 460 AHSAGE---AYSYEAVEKIGQRPVIYSAMGTHAMYATPGIH---AYILPWGLLHDQTDRG 513
Query: 477 LYVDSSIQYELVAAEYLGEGV---VAEPS----WLQFMRKWG 511
D + +Y + + A PS W F WG
Sbjct: 514 PLWDPLLNAHAYTYDYDKDALRASTATPSAPIEWFYFNGHWG 555
>gi|452987190|gb|EME86946.1| hypothetical protein MYCFIDRAFT_151930 [Pseudocercospora fijiensis
CIRAD86]
Length = 353
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 43/76 (56%), Gaps = 9/76 (11%)
Query: 386 VAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDL----EYIEGNKAIV 441
+A +G HVGDWEH +R + + IY+SQHSGG+ A+D +Y +AI
Sbjct: 158 IAVGNVGNHVGDWEHNMIRFED--SKPTQIYYSQHSGGQ---AFDYSATEKYASTGRAIT 212
Query: 442 YSSKNGHASFPHPGTY 457
YS+ HA++ GT+
Sbjct: 213 YSANGTHANYATTGTH 228
>gi|342879295|gb|EGU80549.1| hypothetical protein FOXB_08927 [Fusarium oxysporum Fo5176]
Length = 499
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 57/117 (48%), Gaps = 14/117 (11%)
Query: 343 GNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFT 402
N SA V V G D + F +N L +MN+ F G HVGDWEH
Sbjct: 269 ANGYSAAPAVLVVVDKGSGIVDAFWFFFYSYN----LGQTVMNIRF---GNHVGDWEHCM 321
Query: 403 LRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY 457
+R + G ++FS+H GG+ AY E +E G++ ++YS+ HA + PG +
Sbjct: 322 MRFEH--GIPRGVFFSEHEGGQ---AYAFEAVEKRGDRPVIYSAVGSHAMYALPGNH 373
>gi|388599108|ref|ZP_10157504.1| hypothetical protein VcamD_04334 [Vibrio campbellii DS40M4]
Length = 437
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 48/103 (46%), Gaps = 8/103 (7%)
Query: 123 KSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTK---TPNKPELD 179
+ PAL P+DY W S G EG W P PP GY ++G + + KP++
Sbjct: 98 QQPALAHPIDYEWRWDSVNSG-ATVEGS--IWRPIPPKGYVALGDVARNQYGSDAKPDVR 154
Query: 180 EVRCVRDDLTDK--CEVHHLIFDAISKFSSSPFSVWSTRPCNR 220
EV CVR+DLT + + I+D S+W RP R
Sbjct: 155 EVMCVREDLTADGILDGNTFIWDDAGSHGDHDVSLWEIRPDQR 197
>gi|302913207|ref|XP_003050867.1| hypothetical protein NECHADRAFT_61308 [Nectria haematococca mpVI
77-13-4]
gi|256731805|gb|EEU45154.1| hypothetical protein NECHADRAFT_61308 [Nectria haematococca mpVI
77-13-4]
Length = 492
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 53/101 (52%), Gaps = 14/101 (13%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++N+ F G HVGDWEH +R + G+ ++FS
Sbjct: 278 GSGIVDAFWFFFYSYN----LGQTVLNIRF---GNHVGDWEHCMVRFEH--GKPRGVFFS 328
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY 457
+H GG+ AY E +E G++ ++YS+ HA + PG +
Sbjct: 329 EHEGGQ---AYAFEAVEKRGDRPVIYSAVGSHAMYALPGVH 366
>gi|378729679|gb|EHY56138.1| hypothetical protein HMPREF1120_04235 [Exophiala dermatitidis
NIH/UT8656]
Length = 469
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 46/82 (56%), Gaps = 6/82 (7%)
Query: 389 SKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE---YIEGNKAIVYSSK 445
+ G HVGDWEH +R N G IYFSQH+ G+ V +D + +G + V+S++
Sbjct: 197 NHFGDHVGDWEHNMIRFKN--GRPTGIYFSQHASGQ-VCDWDDDGCFSKKGQRPFVFSAR 253
Query: 446 NGHASFPHPGTYLQGSEILGIG 467
HA++P G+++ ++ I
Sbjct: 254 GSHANYPSEGSHVHDEALVDIA 275
>gi|444427041|ref|ZP_21222438.1| hypothetical protein B878_13875 [Vibrio campbellii CAIM 519 = NBRC
15631]
gi|444239724|gb|ELU51282.1| hypothetical protein B878_13875 [Vibrio campbellii CAIM 519 = NBRC
15631]
Length = 437
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 48/103 (46%), Gaps = 8/103 (7%)
Query: 123 KSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTK---TPNKPELD 179
+ PAL P+DY W S G EG W P PP GY ++G + + KP++
Sbjct: 98 QQPALAHPIDYEWRWDSVNSG-ATVEGS--IWRPIPPKGYVALGDVARDQYGSDAKPDVR 154
Query: 180 EVRCVRDDLTDK--CEVHHLIFDAISKFSSSPFSVWSTRPCNR 220
EV CVR+DLT + + I+D S+W RP R
Sbjct: 155 EVMCVREDLTADGILDGNTFIWDDAGSHGDHDVSLWEIRPDQR 197
>gi|322710243|gb|EFZ01818.1| vacuolar sorting-associated protein [Metarhizium anisopliae ARSEF
23]
Length = 526
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R N G IYFS+H GG+ A+D G++ ++YS+ HA
Sbjct: 331 RFGNHVGDWEHSMVRFQN--GVPKGIYFSEHEGGQ-AYAWDAVEKRGDRPVIYSAVGSHA 387
Query: 450 SFPHPGTY 457
+ PG +
Sbjct: 388 MYATPGDH 395
>gi|255069963|ref|XP_002507063.1| predicted protein [Micromonas sp. RCC299]
gi|226522338|gb|ACO68321.1| predicted protein [Micromonas sp. RCC299]
Length = 5648
Score = 51.6 bits (122), Expect = 0.001, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 7/110 (6%)
Query: 81 PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSD 140
P GF SLG + +P V + + SNL P + P+++ +VW D
Sbjct: 2911 PPGFVSLGDVAVAELQPPAS--TVVVQMRRVDRRRRRDSNL--RPPVAWPVNWEMVW-RD 2965
Query: 141 EG--GQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
G QG G FW P PPDGY +G + + + P LD C+R DL
Sbjct: 2966 SGWRAQGKKAGTISFWKPVPPDGYVPIGHVASASHAPPPLDTCACLRADL 3015
>gi|212544520|ref|XP_002152414.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210065383|gb|EEA19477.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 339
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 77/185 (41%), Gaps = 21/185 (11%)
Query: 336 GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHV 395
G Q HG E A V G D + F FN G + +G + G HV
Sbjct: 119 GTQPNSHGVTEGATSCAIVTVDHGDGTLDAFYFYFYSFN-KGDIVLGAV------WGDHV 171
Query: 396 GDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPG 455
GDWEH +R N GE ++++SQHSGG+ YD ++ + YS + HA++ G
Sbjct: 172 GDWEHNMVRFTN--GEPQALWYSQHSGGE-AFTYDAVLKINSRPVSYSGQGTHANYAIQG 228
Query: 456 TYLQGSEILGI----GVRNDAARSNLYVDSS-----IQYELVAAEYLGEGVVAEPSWLQF 506
+ I G+ G ND + D + Y+ V A + +WL F
Sbjct: 229 --VHDHTIPGVNLPFGPVNDYTSNGTLWDPTGNYYAYSYDNVTAVFTAYDSSYPVNWLYF 286
Query: 507 MRKWG 511
+WG
Sbjct: 287 NGQWG 291
>gi|255948158|ref|XP_002564846.1| Pc22g08330 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591863|emb|CAP98121.1| Pc22g08330 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 591
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 52/101 (51%), Gaps = 14/101 (13%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R N G+ +++FS
Sbjct: 377 GNGIVDAFWFYFYSFN----LGNTVVNVRF---GNHVGDWEHCLVRFHN--GKPKALFFS 427
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY 457
H GG+ AY E +E G + ++YS++ HA + G +
Sbjct: 428 AHQGGE---AYSYEAVEKIGQRPVIYSAEGSHAMYATAGVH 465
>gi|320582725|gb|EFW96942.1| Vacuolar protein sorting protein 62 [Ogataea parapolymorpha DL-1]
Length = 395
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 123/308 (39%), Gaps = 61/308 (19%)
Query: 243 ELNIACLKNLDPKLHAMPNCDQ-------IHALIRNYGPTVFFHPDEVYLPSSVSWFFT- 294
E+ ++ LK+L P L A P+ + + A + +Y P V+ + +E Y+P +++ F T
Sbjct: 5 EMRLSTLKSL-PPLKAWPDPHERTLRPGHVPAYVIDYAPLVYLYSEERYMPYNIAEFVTH 63
Query: 295 ------NGALLYKAGDLVG----EAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGN 344
NG L + G + EA+ S + E D D G G
Sbjct: 64 FHAELANGTKLDQFGPSLNLSALEALQGYHSPVNPVFMTANE---DFDKDPGWIT---GA 117
Query: 345 MESAKLYVHVKPAV---------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQH 394
Y P V G + D + F FN GP + G G H
Sbjct: 118 KNRPNFYTGELPDVPATLIVVDKGNGWVDAYWFYFYSFNEGPYVMGTG-------PFGDH 170
Query: 395 VGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHP 454
VGDWEH +R + GE ++ S H GG +LE ++ ++++++ HA++
Sbjct: 171 VGDWEHSLVRF--YKGEPVIVWMSAHGGGGAYFYTNLEKLD-QHPVIFAARGTHANYAST 227
Query: 455 GTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV----AEP-------SW 503
G Q + L + +D + S+ + +A Y GE V + P +W
Sbjct: 228 G---QHAHDLPYSILSDFTDRGPLWNPSLNF--LAYTYDGETVEYANGSHPGREEQLGNW 282
Query: 504 LQFMRKWG 511
L FM WG
Sbjct: 283 LHFMGYWG 290
>gi|322699668|gb|EFY91428.1| vacuolar sorting-associated protein [Metarhizium acridum CQMa 102]
Length = 528
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 3/68 (4%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R N G IYFS+H GG+ A+D G++ ++YS+ HA
Sbjct: 333 RFGNHVGDWEHSMVRFQN--GIPKGIYFSEHEGGQ-AYAWDAVEKRGDRPVIYSAVGSHA 389
Query: 450 SFPHPGTY 457
+ PG +
Sbjct: 390 MYATPGDH 397
>gi|340515349|gb|EGR45604.1| predicted protein [Trichoderma reesei QM6a]
Length = 522
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 7/70 (10%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIVYSSKNG 447
+ G HVGDWEH +R N G I+FS+H GG+ AY E +E G++ ++YS+
Sbjct: 322 RFGNHVGDWEHCMVRFQN--GVPRGIFFSEHEGGQ---AYAWEAVEKRGDRPVIYSAVGS 376
Query: 448 HASFPHPGTY 457
HA + PG++
Sbjct: 377 HAMYALPGSH 386
>gi|405120942|gb|AFR95712.1| hypothetical protein CNAG_02107 [Cryptococcus neoformans var.
grubii H99]
Length = 479
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 15/101 (14%)
Query: 363 TDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 421
D+ W F PFN G + GI+ G HV DWEH LR+ G S+ FS H
Sbjct: 180 VDLWYWAFYPFNFGKSASRFGIL-------GNHVADWEH--LRVRTVDGVPVSVDFSTHE 230
Query: 422 GGKWVAAY----DLEYIEGNKAIVYSSKNGHASFPHPGTYL 458
GG+ A D+E ++ ++ + Y + H +P PG ++
Sbjct: 231 GGRLSAGTVRWEDVEKVD-DRPVAYVAMGSHGVWPEPGDHV 270
>gi|350635200|gb|EHA23562.1| hypothetical protein ASPNIDRAFT_207448 [Aspergillus niger ATCC
1015]
Length = 319
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 60/135 (44%), Gaps = 22/135 (16%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R N G ++++SQH+GG+ +D +G + I YS+ HA
Sbjct: 140 QFGDHVGDWEHNMIRFDN--GVPQALWYSQHAGGE-AFTFDATEKQGKRPIAYSANGSHA 196
Query: 450 SFPHPGTYLQGSEILGI--GVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEP------ 501
+ PGT+ L + G D Y D ++ +A Y A
Sbjct: 197 VYAVPGTHDHAIPHLNLPFGFVMD------YTDKGTLWDPIANAYAYSFDAASQTFQPYD 250
Query: 502 -----SWLQFMRKWG 511
SWL++ +WG
Sbjct: 251 RRYPVSWLEYNGQWG 265
>gi|302799906|ref|XP_002981711.1| hypothetical protein SELMODRAFT_444980 [Selaginella moellendorffii]
gi|300150543|gb|EFJ17193.1| hypothetical protein SELMODRAFT_444980 [Selaginella moellendorffii]
Length = 4331
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 66/166 (39%), Gaps = 24/166 (14%)
Query: 39 ASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGH-YCQFDSRP 97
A R +L + R + F+ IW + TF+ P +P G+ L + P
Sbjct: 1883 AVARFHLDRGVIHRCTHFDRIWVNKPGNTSSPEVTFWRPK-VPPGYVILSDCVTSGTAPP 1941
Query: 98 LRGFVLVARDLASSEAEGAHTSNLFKSP-ALQKPLDYTLVWCS---DEGGQGNYEGCAFF 153
+G V V F S ++KPL + LVW S N E C
Sbjct: 1942 SQGVVAV-----------------FNSHHRVKKPLKFDLVWSSYGNSSNSVLNEEPCCV- 1983
Query: 154 WLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
WLP P GYK++G + + + P L+ V CVR DL V +
Sbjct: 1984 WLPVAPPGYKAVGCVAERGTSPPSLNTVHCVRSDLLTSSAVTDCVL 2029
Score = 45.8 bits (107), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 151 AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDK 191
+ W P P GY ++G + T TP+ P LD VRCVR DL +
Sbjct: 2189 VYIWYPVAPAGYVALGCVATTTPDHPPLDMVRCVRMDLVSQ 2229
>gi|169613172|ref|XP_001800003.1| hypothetical protein SNOG_09717 [Phaeosphaeria nodorum SN15]
gi|160702667|gb|EAT82982.2| hypothetical protein SNOG_09717 [Phaeosphaeria nodorum SN15]
Length = 536
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 105/261 (40%), Gaps = 66/261 (25%)
Query: 336 GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHV 395
G+++V G ++ + V V G D + F +N L + NV F G HV
Sbjct: 283 GKKVVG-GRSDAPAVLVTVPKGDG--VVDAFWFFFYSYN----LGNAVFNVRF---GNHV 332
Query: 396 GDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPH 453
GDWEH +R N GE +++FS+HS G+ AY E +E G + + +S+ HA +
Sbjct: 333 GDWEHTVVRFHN--GEPKAVFFSEHSFGE---AYTYEAVEKIGKRPVGFSATGTHAMYAT 387
Query: 454 PGTY---LQGSEILGIGVRN---DAARSNLYVDSSIQYELVAAEYLGEGV--VAEPSWLQ 505
G + L G + + R D A+ N+Y S Y+ + L + A SW
Sbjct: 388 AGVHPYVLPGGILHDVTDRGPLWDPAQ-NMY---SFTYDYRTDKLLSSNLTPTAPTSWFY 443
Query: 506 FMRKWGPTIVYDSKTELDKIIKLL-PLMIRYSVENAVSKLPLELYGEEGPTGPKEKN--- 561
F WG DK L P R+ + + GP GP+ KN
Sbjct: 444 FAGHWG-----------DKFYPLSDPRQYRFVG---------QYHYVNGPLGPRFKNLGR 483
Query: 562 -------------NWVGDERG 569
W+GD RG
Sbjct: 484 QEICQGNGECVLKKWIGDRRG 504
>gi|260792448|ref|XP_002591227.1| hypothetical protein BRAFLDRAFT_106455 [Branchiostoma floridae]
gi|229276430|gb|EEN47238.1| hypothetical protein BRAFLDRAFT_106455 [Branchiostoma floridae]
Length = 970
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 71/174 (40%), Gaps = 22/174 (12%)
Query: 263 DQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGR 322
D++ L+ Y P V+ +E Y PSSV + N +Y G ++ + S LP+
Sbjct: 640 DELGVLVTKYAPKVWLAKNEKYNPSSVDFHLKN-VKVYDGG----KSYFSTPSTLPTCSE 694
Query: 323 N---DGEFWIDLPSD-----GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFN 374
+ P GG + G +Y VK + T TDI W+F +N
Sbjct: 695 TCYMSTAQPLRYPDSQLRFFGGELV---GPTYQPPVYAIVK-RIDPTTTDIFYWMFYSYN 750
Query: 375 GPGTLKVGIM-----NVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGG 423
GP + +GI HVGDWEH T+R+ +Y + SGG
Sbjct: 751 GPKKVCMGIRFWGKCAGGLKSFVHHVGDWEHMTIRLVGKQPRSIYVYSAHKSGG 804
>gi|367013736|ref|XP_003681368.1| hypothetical protein TDEL_0D05730 [Torulaspora delbrueckii]
gi|359749028|emb|CCE92157.1| hypothetical protein TDEL_0D05730 [Torulaspora delbrueckii]
Length = 459
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 34/175 (19%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G + D + F PFN +G + + G HVGDWEH +R F GE ++ S
Sbjct: 189 GNGWVDAFWFYFYPFN------LGPYVMGYGPWGNHVGDWEHSLVRF--FDGEPKYLWMS 240
Query: 419 QHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNGHASFPHPGTYLQGSE 462
H GG +AY + IE ++ +++SS+ HA++P G +
Sbjct: 241 AHGGG---SAYRFDAIEKVKKLRREHGKLTPDVIHRPLIFSSRGTHANYPSVGQHSHDVP 297
Query: 463 ILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEP------SWLQFMRKWG 511
+ + + R ++ D S+ + + EP SWL F +WG
Sbjct: 298 FFFMPLSDFTDRGPMW-DPSLNFYAYTLDDQDVTPRGEPEEALGTSWLHFEGRWG 351
>gi|440640831|gb|ELR10750.1| hypothetical protein GMDG_05005 [Geomyces destructans 20631-21]
Length = 568
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 49/98 (50%), Gaps = 12/98 (12%)
Query: 361 TFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
T D+ F FN GP V ++G HVGDWEH +R + E +I+FSQ
Sbjct: 307 TVVDVFYTYFYSFNLGP--------TVLGQRLGNHVGDWEHNMIRFHSGVPE--AIWFSQ 356
Query: 420 HSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
H GG+ AYD G + + Y+++ HA++ G +
Sbjct: 357 HGGGQ-AFAYDAVEKIGKRPVGYAARETHANYARGGRH 393
>gi|258574459|ref|XP_002541411.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237901677|gb|EEP76078.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 537
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 65/143 (45%), Gaps = 24/143 (16%)
Query: 383 IMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAI 440
++NV F G HVGDWEH +R + GE +I+ S+H+GG+ AY +E G + +
Sbjct: 333 VLNVRF---GNHVGDWEHSLVRFHH--GEPKAIFLSEHAGGE---AYTYNAVEKSGKRPV 384
Query: 441 VYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAE 500
+YS+ HA + PG + + IL G+ D + D ++ L A Y
Sbjct: 385 IYSATGTHAMYATPGIH---AYILPWGLLRDRTDTGPLWDPALN--LHAYIYSVNNDTLY 439
Query: 501 PS---------WLQFMRKWGPTI 514
PS W F WG I
Sbjct: 440 PSSRTPHAPTEWFSFRGHWGDKI 462
>gi|67537274|ref|XP_662411.1| hypothetical protein AN4807.2 [Aspergillus nidulans FGSC A4]
gi|40741187|gb|EAA60377.1| hypothetical protein AN4807.2 [Aspergillus nidulans FGSC A4]
gi|259482346|tpe|CBF76741.1| TPA: hypothetical protein ANIA_04807 [Aspergillus nidulans FGSC A4]
Length = 397
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 3/66 (4%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R + G +I++SQHSGG+ YD +G + + YS HA
Sbjct: 164 EFGNHVGDWEHNMIRFQD--GAPQAIWYSQHSGGQ-AFTYDATEKQGKRPVAYSGNGTHA 220
Query: 450 SFPHPG 455
+ PG
Sbjct: 221 VYSTPG 226
>gi|358387922|gb|EHK25516.1| hypothetical protein TRIVIDRAFT_219282 [Trichoderma virens Gv29-8]
Length = 532
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 41/70 (58%), Gaps = 7/70 (10%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIVYSSKNG 447
+ G HVGDWEH +R N G I+FS+H GG+ AY E +E G++ ++YS+
Sbjct: 331 RFGNHVGDWEHCMVRFQN--GVPRGIFFSEHEGGQ---AYAWEAVEKRGDRPVIYSAVGS 385
Query: 448 HASFPHPGTY 457
HA + PG +
Sbjct: 386 HAMYALPGNH 395
>gi|238881334|gb|EEQ44972.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 442
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 88/208 (42%), Gaps = 24/208 (11%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRN 323
+I + Y P V + +E Y P V F TN + ++ G + + + + N
Sbjct: 79 EIPDYVIKYAPLVHLYSEERYFPYDVKKFVTNFHVTWRNGSIYPGTENYMNLDKLANLPN 138
Query: 324 DGEFWIDLPSD-----------GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCP 372
+ ++ SD + + +G ++ A + V G + D + F
Sbjct: 139 STDLFLTANSDFDSDPEWITGLKNKPSLINGEIKDAPATLIVVDK-GNGWVDAFWFYFYS 197
Query: 373 FN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDL 431
FN GP + G G HVGDWEH +R + GE ++ S HSGG +L
Sbjct: 198 FNLGPYVMGQG-------PYGNHVGDWEHSLVRY--YKGEPVIVWMSAHSGGGGYYYDNL 248
Query: 432 EY--IEGNKAIVYSSKNGHASFPHPGTY 457
E ++ N I++S++ HA++P G +
Sbjct: 249 EKYSLDPNHPIIFSARGTHANYPSVGQH 276
>gi|407924602|gb|EKG17635.1| Vacuolar protein sorting-associated protein 62 [Macrophomina
phaseolina MS6]
Length = 547
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 64/138 (46%), Gaps = 20/138 (14%)
Query: 383 IMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAI 440
++NV F G HVGDWEH +R + G+ +++FS+HS G+ AY E +E G + +
Sbjct: 343 VLNVRF---GNHVGDWEHTVVRFHH--GKPKAVFFSEHSFGE---AYAWEAVEKIGKRPV 394
Query: 441 VYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV-- 498
Y + HA + PGT+ S IL G+ +D D ++ +Y + +
Sbjct: 395 GYVATGTHAQYATPGTH---SYILPWGLLHDITDRGPLWDPALNVHSYTYDYQKDKLKSS 451
Query: 499 -----AEPSWLQFMRKWG 511
A W F WG
Sbjct: 452 TITPHAPIGWFYFQGHWG 469
>gi|407918477|gb|EKG11748.1| Vacuolar protein sorting-associated protein 62 [Macrophomina
phaseolina MS6]
Length = 353
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 88/206 (42%), Gaps = 23/206 (11%)
Query: 263 DQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGR 322
+ + + +Y P + H ++ Y PS + N ++ E + SG++ P
Sbjct: 38 ESVPQFVLDYAPLSYIHHEDAYFPSDLQAQVDN-----TQPEVDFEVV--SGASSPLDLN 90
Query: 323 NDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTD-IVMWVFCPFNGPGTLKV 381
N G ++ D K ++ VKP G D I V G GT+
Sbjct: 91 NLGT--LNSYGDNVYLTSKEDITKNPGWLNGVKPDCSGKTNDAITAAVIVNDKGNGTVDA 148
Query: 382 ----------GIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDL 431
G + + ++G HVGDWEH +R + GE +++SQHS G+ AY +
Sbjct: 149 FYMYFYAYNWGGLVLDTLQVGNHVGDWEHNMIRFED--GEPKWVWYSQHSNGQNF-AYSI 205
Query: 432 EYIEGNKAIVYSSKNGHASFPHPGTY 457
+ G++ I Y + HA++ PGT+
Sbjct: 206 LHKSGDRPINYVANGTHANYAIPGTH 231
>gi|121715250|ref|XP_001275234.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119403391|gb|EAW13808.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 621
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 51/101 (50%), Gaps = 14/101 (13%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G +++FS
Sbjct: 407 GDGIVDAFWFFFYSFN----LGNVVLNVRF---GNHVGDWEHCLVRFHH--GRPKALFFS 457
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY 457
HS G+ AY E +E G + ++YS+ HA + PG +
Sbjct: 458 AHSAGE---AYSYEAVEKIGQRPVIYSALGTHAMYATPGVH 495
>gi|357482269|ref|XP_003611420.1| Vacuolar protein sorting-associated protein 13A [Medicago truncatula]
gi|355512755|gb|AES94378.1| Vacuolar protein sorting-associated protein 13A [Medicago truncatula]
Length = 4721
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 22/166 (13%)
Query: 53 ISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSE 112
++ F IW L S+K+ + + P +P G G P +++
Sbjct: 2540 VASFRLIWWNQGLNSRKR-LSIWRPV-VPTGMVYFGDVAVKGYEPPNTCIVL-------- 2589
Query: 113 AEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTK- 171
+ N+FK+P LD+ LV Q E +F WLPQ P G+ S+G + K
Sbjct: 2590 -HDSRDENVFKTP-----LDFQLV--GQIKKQRGMESISF-WLPQAPPGFVSLGCVACKG 2640
Query: 172 TPNKPELDEVRCVRDDLT--DKCEVHHLIFDAISKFSSSPFSVWST 215
P + E +RC+R DL DK + + +K + PFS+W+
Sbjct: 2641 KPKQHEFSTLRCMRSDLVAGDKFLEESVWDTSDAKHVTEPFSIWTV 2686
Score = 45.8 bits (107), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 57/156 (36%), Gaps = 29/156 (18%)
Query: 70 KSATFYEPAGIPDGFYSLGHYCQ-FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQ 128
++ F+ P P GF LG Y D P +G + V + S ++
Sbjct: 2329 QTFAFWRPHA-PPGFAVLGDYLTPLDKPPTKGVLAVNTN----------------SITVK 2371
Query: 129 KPLDYTLVW-----------CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPE 177
+P+ + L+W SD + + W PQ P GY ++G +VT+ P
Sbjct: 2372 RPIHFRLIWPPLGTSGEEMDNSDLSWKTEVDDSCSIWFPQAPKGYVALGCIVTQGRTPPP 2431
Query: 178 LDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVW 213
L C+ C + I + SSS W
Sbjct: 2432 LSSALCIPSSSVSLCSLRDCIMIGMPNTSSSSVRFW 2467
>gi|302768787|ref|XP_002967813.1| hypothetical protein SELMODRAFT_440026 [Selaginella moellendorffii]
gi|300164551|gb|EFJ31160.1| hypothetical protein SELMODRAFT_440026 [Selaginella moellendorffii]
Length = 4754
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 66/169 (39%), Gaps = 24/169 (14%)
Query: 36 QGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDS 95
G A R +L + R + F+ IW + TF+ P +P G+ L +
Sbjct: 1885 HGDAVARFHLDRGVIHRCTHFDRIWVNKPGNTSSPEVTFWRPK-VPPGYVILSDCVTSGT 1943
Query: 96 RP-LRGFVLVARDLASSEAEGAHTSNLFKSP-ALQKPLDYTLVWCS---DEGGQGNYEGC 150
P +G V V F S ++KPL + LVW S N E C
Sbjct: 1944 APPSQGVVAV-----------------FNSHHRVKKPLKFDLVWSSYGNSSNSVFNEEPC 1986
Query: 151 AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLIF 199
WLP P GYK++G + + P L+ V CVR DL V +
Sbjct: 1987 CV-WLPVAPPGYKAVGCVAERGTFPPSLNTVHCVRSDLLTSSAVTDCVL 2034
Score = 45.8 bits (107), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 151 AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDK 191
+ W P P GY ++G + T TP+ P LD VRCVR DL +
Sbjct: 2194 VYIWYPVAPAGYVALGCVATTTPDHPPLDMVRCVRMDLVSQ 2234
>gi|308811993|ref|XP_003083304.1| A96521 protein F21D18.22 (ISS) [Ostreococcus tauri]
gi|116055184|emb|CAL57580.1| A96521 protein F21D18.22 (ISS) [Ostreococcus tauri]
Length = 2673
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 60/156 (38%), Gaps = 17/156 (10%)
Query: 44 NLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVL 103
N + VCR + IW Q K+ + + P L H F R + L
Sbjct: 988 NTPALVVCRTNDLVKIWWTRGKQKSIKTLSAWHPRAPSHTELELRHPF-FSERTEEDYKL 1046
Query: 104 VARDLASSEAEGAHTSNLF---------KSPALQKPLDYTLVWCSDEGGQGNYEGCAFFW 154
V A S L ++P P+D+ +W SD FW
Sbjct: 1047 VPFGTILVSGHNAPRSALMAVVLEYDSDQTPPTAYPIDFENIWTSDNKDVS-------FW 1099
Query: 155 LPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
P P GY ++G +VTK+ KP + V CVR+ LTD
Sbjct: 1100 KPIAPAGYAAIGNIVTKSIEKPSTECVVCVREALTD 1135
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 62/161 (38%), Gaps = 34/161 (21%)
Query: 64 LLQSKKKSATFYEPAGIPDGFYSLGH-YCQFDSRPLRGFVLVARDLASSEAEGAHTSNLF 122
+ SK + F+ P P G+ S GH + D+ PL V A +LF
Sbjct: 432 IASSKNSTIGFWAPVP-PTGYVSTGHCISKGDNPPLHTRVF------------AEDRDLF 478
Query: 123 KSP----ALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL 178
+ P L P Y G+ C W P PP G+ S+G +VT PE
Sbjct: 479 QPPDSFEQLIPPRKYG----------GSQRLC--IWKPVPPHGFVSVGVIVTTEDEHPEF 526
Query: 179 DEVRCVRDDLTDKCEVHHLI----FDAISKFSSSPFSVWST 215
D + CVR DL + L F + ++SP W T
Sbjct: 527 DTIACVRADLVSRITHGQLTKYNSFWVDDEHAASPTHFWIT 567
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 41/86 (47%), Gaps = 5/86 (5%)
Query: 141 EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPN-KPELDEVRCVRDDLTDKC--EVHHL 197
E +G+ + C FW P P G+ S+G + T + N +P LD VRC+R +L +
Sbjct: 287 ERVEGSAQPC--FWRPIAPKGFISLGLIATASENEEPSLDTVRCIRQELVTNIGKTSTRV 344
Query: 198 IFDAISKFSSSPFSVWSTRPCNRGML 223
F + + S ++W T G +
Sbjct: 345 DFTLLGDSTVSKLTLWKTNNAADGFV 370
>gi|68489428|ref|XP_711465.1| potential vacuolar targeting protein [Candida albicans SC5314]
gi|68489461|ref|XP_711448.1| potential vacuolar targeting protein [Candida albicans SC5314]
gi|46432751|gb|EAK92220.1| potential vacuolar targeting protein [Candida albicans SC5314]
gi|46432769|gb|EAK92237.1| potential vacuolar targeting protein [Candida albicans SC5314]
Length = 442
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 87/208 (41%), Gaps = 24/208 (11%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRN 323
+I + Y P V + +E Y P V F TN + ++ G + + + N
Sbjct: 79 EIPDYVIKYAPLVHLYSEERYFPYDVKKFVTNFHVTWRNGSIYPGTEKNMNLDKLANLPN 138
Query: 324 DGEFWIDLPSD-----------GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCP 372
+ ++ SD + + +G ++ A + V G + D + F
Sbjct: 139 STDLFLTANSDFDSDPEWITGLKNKPSLINGEIKDAPATLIVVDK-GNGWVDAFWFYFYS 197
Query: 373 FN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDL 431
FN GP + G G HVGDWEH +R + GE ++ S HSGG +L
Sbjct: 198 FNLGPYVMGQG-------PYGNHVGDWEHSLVRY--YKGEPVIVWMSAHSGGGGYYYDNL 248
Query: 432 EY--IEGNKAIVYSSKNGHASFPHPGTY 457
E ++ N I++S++ HA++P G +
Sbjct: 249 EKYSLDPNHPIIFSARGTHANYPSVGQH 276
>gi|409048126|gb|EKM57604.1| hypothetical protein PHACADRAFT_206496 [Phanerochaete carnosa
HHB-10118-sp]
Length = 332
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 72/181 (39%), Gaps = 30/181 (16%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R N G IY S H GG + Y +G +A+ Y + HA
Sbjct: 154 QFGDHVGDWEHSMVRFVN--GVPQDIYLSAHDGG-FAYTYGALPSQGGRAVTYVANGTHA 210
Query: 450 SFPHPGTYLQGSEIL----GIGVRNDAARS--NLYVDSSIQYELVA--AEYLGEGVVAE- 500
++ PG + +L G D + + D+S Q VA A GE +E
Sbjct: 211 NYATPGKHQHDLPLLDDYTDAGPLWDVTLNFRGYWFDNSTQTFTVANGANVGGEEEASEG 270
Query: 501 PSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEK 560
WL+F WG D+ LL E E +GPTGP K
Sbjct: 271 VGWLEFAGHWG-----------DQQYDLL-------FEGQYCVTATECKYVDGPTGPIAK 312
Query: 561 N 561
N
Sbjct: 313 N 313
>gi|121704912|ref|XP_001270719.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119398865|gb|EAW09293.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 343
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 84/201 (41%), Gaps = 25/201 (12%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRN 323
+I + + Y P V+ H +VY+PS + + + + G + + NL S N
Sbjct: 31 EIPSFVLEYAPLVWLHSQDVYMPSDIGQQLVHTTPMVDWKGISGASSPLTLDNLDS--LN 88
Query: 324 D-GEFWIDLPSDGG----------RQIV--KHGNMESAKLYVHVKPAVGGTFTDIVMWVF 370
D G + L S G R V + G E A V V D + F
Sbjct: 89 DLGNTSVYLTSHEGIDAQPQPPWFRGTVPDQQGKTEGAISSVIVVRDHNNGTLDAFYFYF 148
Query: 371 CPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD 430
+N T ++ + F G H+GDWEH +R N G +I++SQH+GG+ Y+
Sbjct: 149 YAYNQGNT----VLGMEF---GDHIGDWEHNMIRFQN--GSPQAIWYSQHAGGQ-AFTYN 198
Query: 431 LEYIEGNKAIVYSSKNGHASF 451
+G + YS+ HA +
Sbjct: 199 ATEKQGKRPYAYSANGTHAVY 219
>gi|358390269|gb|EHK39675.1| hypothetical protein TRIATDRAFT_143172 [Trichoderma atroviride IMI
206040]
Length = 507
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 68/161 (42%), Gaps = 29/161 (18%)
Query: 306 VGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQ--IVKHGNMESAKLYVHVKPAV----- 358
V DP N P+ + PS G Q I+ G+ A Y +
Sbjct: 231 VHRISDPRVKNKPA--------YFPTPSPGKEQQRIIPRGHKPDADGYSKAPATLVLVDK 282
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++ + F G HVGDWEH +R N GE I+FS
Sbjct: 283 GSGIVDAFWFFFYSYN----LGQTVLGLRF---GNHVGDWEHCMVRFQN--GEPRGIFFS 333
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY 457
+H GG+ AY IE G + ++YS+ HA + PG +
Sbjct: 334 EHEGGQ---AYAWSAIEKRGVRPVIYSAVGSHAMYALPGPH 371
>gi|241954870|ref|XP_002420156.1| vacuolar targetting protein, putative [Candida dubliniensis CD36]
gi|223643497|emb|CAX42376.1| vacuolar targetting protein, putative [Candida dubliniensis CD36]
Length = 441
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 88/210 (41%), Gaps = 28/210 (13%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAG----------DLVGEAIDPS 313
+I + Y P V + +E Y P V F TN + ++ G +L A P+
Sbjct: 78 EIPDYVIKYAPLVHLYSEERYFPYDVKKFVTNFHVTWRNGSIYPGTETNMNLDKLAHLPN 137
Query: 314 GSNLPSGGRNDGEF---WIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVF 370
++L +D + WI + I A L V K G + D + F
Sbjct: 138 STDLFLTANSDFDADPEWITGLKNKPSLINGEIKDAPATLIVVDK---GNGWVDAFWFYF 194
Query: 371 CPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAY 429
FN GP + G G HVGDWEH +R + GE ++ S HSGG
Sbjct: 195 YSFNLGPYVMGQG-------PYGNHVGDWEHSLVRY--YKGEPVIVWMSAHSGGGGYYYD 245
Query: 430 DLEY--IEGNKAIVYSSKNGHASFPHPGTY 457
+LE ++ N I++S++ HA++P G +
Sbjct: 246 NLEKYSLDPNHPIIFSARGTHANYPSVGQH 275
>gi|255729288|ref|XP_002549569.1| hypothetical protein CTRG_03866 [Candida tropicalis MYA-3404]
gi|240132638|gb|EER32195.1| hypothetical protein CTRG_03866 [Candida tropicalis MYA-3404]
Length = 445
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV-GEAID---------PS 313
+I + Y P V+ + +E Y P + F TN + +K G +V G D P+
Sbjct: 82 EIPDYVIKYAPLVYLYSEERYFPYDIKKFVTNFHVRWKNGTVVPGTEHDMNLEKLSKLPN 141
Query: 314 GSNLPSGGRNDGEF---WIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVF 370
++L +D + WI + I A L V K G + D + F
Sbjct: 142 STHLFLTANSDFDCDPEWITGLKNKPSLIDGEIKDAPATLIVVDK---GNGWVDAYWFYF 198
Query: 371 CPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAY 429
FN GP + G G HVGDWEH +R + G+ ++ S HSGG
Sbjct: 199 YSFNLGPYVMGQG-------PYGNHVGDWEHSLVRY--YKGKPIIVWMSAHSGGGGYYYQ 249
Query: 430 DLEY--IEGNKAIVYSSKNGHASFPHPGTY 457
LE ++ N I++S++ HA++P G +
Sbjct: 250 KLEKYAMDPNHPIIFSARGTHANYPSVGQH 279
>gi|254586525|ref|XP_002498830.1| ZYRO0G19602p [Zygosaccharomyces rouxii]
gi|238941724|emb|CAR29897.1| ZYRO0G19602p [Zygosaccharomyces rouxii]
Length = 445
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 103/266 (38%), Gaps = 63/266 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLR 404
H+K PAV G + D + F PFN +G + + G HVGDWEH +R
Sbjct: 161 HIKNGPAVLMVVDKGNGWVDAFWFYFYPFN------LGPFIMGYGPWGNHVGDWEHSLVR 214
Query: 405 ICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNGH 448
+ GE I+ S H GG +AY E +E ++ +++S+ H
Sbjct: 215 F--YRGEPRYIWMSAHGGG---SAYRFEAVEKVKKLRRRNGKLTPEIVHRPLIFSASGTH 269
Query: 449 ASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEP------- 501
A++ G + + + + R ++ D S+ Y A + GE ++
Sbjct: 270 ANYASVGQHAHDVPFFFMPLSDFTDRGPMW-DPSMNY--YAYVFDGEKIIPSSDREESIG 326
Query: 502 -SWLQFMRKWGPTIVY--DSKTE---------------LDKIIKLLPLMIRYSVENAVSK 543
+WL F WG + DS+ LDK + + L R+ N
Sbjct: 327 VNWLHFRGHWGDKQLPWGDSRQRWCPVQWRYIDGPTGPLDKNLPRMSLCPRFVWWNFWKG 386
Query: 544 LPLELYGEEGPTGPKEKNNWVGDERG 569
P ++G EKN+ +GD G
Sbjct: 387 CPARRLIKKGEGLDAEKNDLIGDNCG 412
>gi|392571004|gb|EIW64176.1| hypothetical protein TRAVEDRAFT_138685 [Trametes versicolor
FP-101664 SS1]
Length = 336
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 87/213 (40%), Gaps = 35/213 (16%)
Query: 360 GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
G D + F F+ G ++++ F G HVGDWEH +R N G ++Y S
Sbjct: 129 GGILDAFFFYFYSFDHGGK----VLDIEF---GDHVGDWEHSMVRFVN--GAPTTLYLSA 179
Query: 420 HSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR----- 474
HSGG +D+ + Y + HA++ PG + + + + DA
Sbjct: 180 HSGGS-AYTFDVTEKTNGRPTTYIAGGTHANYATPGQHCHDLPLNLLCDQTDAGPLWDPT 238
Query: 475 ---SNLYVDSSIQYELVA--AEYLGEGVVAE-PSWLQFMRKWGPTIVYDSKTELDKIIKL 528
+ DS+ VA A+ G+ AE SWL F WG D+ K+
Sbjct: 239 LNFRGFWFDSATSTFSVAGGADVGGQEEPAEGASWLNFAGAWG-----------DEQYKI 287
Query: 529 LPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
L +Y ++ + + E GPTGP KN
Sbjct: 288 LE-HGQYCID--IPDVVDECKFVSGPTGPIAKN 317
>gi|83767066|dbj|BAE57206.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391863733|gb|EIT73033.1| hypothetical protein Ao3042_10839 [Aspergillus oryzae 3.042]
Length = 520
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 70/162 (43%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R + G+ +++FS
Sbjct: 304 GNGVVDAFWFYFYSFN----LGNVVLNVRF---GNHVGDWEHCLVRFHH--GKPKALFFS 354
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
HS G+ AY E +E G + ++YS+ HA + PG + IL G+ +D
Sbjct: 355 AHSAGE---AYSYEAVEKNGERPVIYSALGTHAMYATPGIH---DYILPWGLLHDQTDRG 408
Query: 477 LYVDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWG 511
D + +Y + ++A W + WG
Sbjct: 409 PLWDPLLNSHSYTYDYDNDTLLASTFSPDSPTEWFYYNGHWG 450
>gi|440796603|gb|ELR17712.1| hypothetical protein ACA1_064990 [Acanthamoeba castellanii str.
Neff]
Length = 549
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 2/72 (2%)
Query: 127 LQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRD 186
L KP+D+ +VW EGG C W P PP + ++G ++T T NKP D V C+ +
Sbjct: 90 LAKPVDFEVVW-RHEGGVRGMLPCTV-WRPVPPPSFVAVGCIITLTHNKPSPDCVVCLHE 147
Query: 187 DLTDKCEVHHLI 198
L EV L+
Sbjct: 148 SLVVPAEVDALL 159
>gi|361128415|gb|EHL00350.1| putative Vacuolar protein sorting-associated protein 62 [Glarea
lozoyensis 74030]
Length = 351
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 13/103 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L +MNV F G HVGDWEH +R N G +++FS
Sbjct: 113 GDGIVDAFWFYFYSYN----LGTTVMNVRF---GNHVGDWEHSLIRFYN--GVPKAVFFS 163
Query: 419 QHSGGKWVAAYDLEYIEGN----KAIVYSSKNGHASFPHPGTY 457
HSGG + +E EG + ++YS+ HA + PG +
Sbjct: 164 AHSGGLAYSYDAVEKGEGRGREGRPVLYSAYGSHAMYATPGKH 206
>gi|406868455|gb|EKD21492.1| hypothetical protein MBM_00605 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 370
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 82/204 (40%), Gaps = 30/204 (14%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGS--NLPSGGRNDGE 326
+ +Y P V+ H EVY PS + F + +L PS + NL
Sbjct: 68 VLDYAPLVWLHAGEVYFPSDI---FAQVKNTHPNVNLTAIESPPSLTLENLDI------- 117
Query: 327 FWIDLPSDGGRQIVKHGNM----ESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVG 382
++ D GR + N+ E L V + G T + G GT+
Sbjct: 118 --LNAYGDIGRNVYLTSNIDVTTEPVWLTGIVPDSTGKTRNITSSAIIINDRGNGTVDAF 175
Query: 383 IMN---------VAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEY 433
M V F ++G H+GDWEH +R + GE +++FSQH G+ Y
Sbjct: 176 YMYFYAFNQGNIVFFQELGDHIGDWEHNMIRFQD--GEPQTMWFSQHGNGQ-AFTYKAVE 232
Query: 434 IEGNKAIVYSSKNGHASFPHPGTY 457
E + I YS+ HA++ GT+
Sbjct: 233 KESMRPISYSAMGSHANYAVQGTH 256
>gi|212528828|ref|XP_002144571.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210073969|gb|EEA28056.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 556
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 68/162 (41%), Gaps = 24/162 (14%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L + +V F G HVGDWEH +R + G +++ S
Sbjct: 307 GNDIVDAFWFFFYSYN----LGNTVFDVRF---GNHVGDWEHTMVRF--YKGHPKALFLS 357
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
H+ G+ AY E IE G + +VYS+ HA + PG + +L G+ +D
Sbjct: 358 AHTAGE---AYSYEAIEKHGRRPVVYSATGTHAMYAKPGIH---EYVLPWGLLHDQTDRG 411
Query: 477 LYVDSSIQYELVAAEYLGEGVVA---EPS----WLQFMRKWG 511
D + YL + + A PS W F WG
Sbjct: 412 PLWDPLLNSHYYTYNYLTDVLRASNLNPSAPTEWFFFNGHWG 453
>gi|242765394|ref|XP_002340966.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218724162|gb|EED23579.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 544
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 18/137 (13%)
Query: 385 NVAFS-KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIV 441
N F + G HVGDWEH +R + G +++ S H+ G+ AY E +E G + +V
Sbjct: 321 NTVFDVRFGNHVGDWEHTMVRF--YKGHPKALFLSAHTAGE---AYSYEAVEKQGKRPVV 375
Query: 442 YSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA-- 499
YS+ HA + PG + +L G+ +D D + +YL + + A
Sbjct: 376 YSATGTHAMYATPGIH---EYVLPWGLLHDQTDRGPLWDPLLNSHYFTYDYLSDVLRASN 432
Query: 500 -EPS----WLQFMRKWG 511
PS W F WG
Sbjct: 433 LNPSAPTEWFFFNGHWG 449
>gi|406859906|gb|EKD12968.1| vacuolar protein sorting-associated protein [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 571
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 19/106 (17%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++NV F G HVGDWEH +R N G +++FS
Sbjct: 341 GSGILDAYWFYFYSYN----LGTTVLNVRF---GNHVGDWEHSLIRFHN--GVPKAVFFS 391
Query: 419 QHSGGKWVAAYDLEYIEGNKA-------IVYSSKNGHASFPHPGTY 457
HSGG AY +E K ++YS+ HA + PG++
Sbjct: 392 AHSGG---LAYSYAAVEKGKGKGREGRPVIYSALGSHAMYAQPGSH 434
>gi|451850194|gb|EMD63496.1| hypothetical protein COCSADRAFT_37275 [Cochliobolus sativus ND90Pr]
Length = 644
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 65/143 (45%), Gaps = 22/143 (15%)
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNK 438
L + NV F G HVGDWEH +R + GE +++FS+HS G+ A+D G +
Sbjct: 428 LGNAVFNVRF---GNHVGDWEHTVVRFHH--GEPKAVFFSEHSFGE-AYAWDAVEKSGKR 481
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARS--------NLYVDSSIQYELVAA 490
I +S+ HA + GT+ +L G+ +D N+Y S +Y+
Sbjct: 482 PIGFSATGTHAMYATEGTH---PYVLPGGILHDVTDRGPLWDPVLNMY---SYRYDYRTD 535
Query: 491 EYLGEGV--VAEPSWLQFMRKWG 511
+ + VA SW F WG
Sbjct: 536 RLVSSNLNPVAPTSWFYFAGHWG 558
>gi|356540571|ref|XP_003538761.1| PREDICTED: uncharacterized protein LOC100780088 [Glycine max]
Length = 4246
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 72/166 (43%), Gaps = 22/166 (13%)
Query: 53 ISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSE 112
++ F +W N + +K + + P +P G G P ++V
Sbjct: 2165 VASFELVW-WNQGSNSRKRLSIWRPV-VPMGMVYFGDIAVKGFEPPNTCIVV-------- 2214
Query: 113 AEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTK- 171
+ N+FK+P LD+ LV Q E +F WLPQ P G+ S+G +V K
Sbjct: 2215 -HDSRDENIFKTP-----LDFQLV--GQIKKQRGMESMSF-WLPQAPPGFVSLGCVVCKG 2265
Query: 172 TPNKPELDEVRCVRDDLT--DKCEVHHLIFDAISKFSSSPFSVWST 215
P + + +RC+R DL DK + + +K + PFS+W+
Sbjct: 2266 KPKQNDFSTLRCMRSDLVAGDKFLEESVWDTSDAKHVTEPFSIWAV 2311
>gi|168059862|ref|XP_001781919.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666635|gb|EDQ53284.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 4687
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 46/90 (51%), Gaps = 9/90 (10%)
Query: 127 LQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRD 186
L KPL Y + G+G + W P P GY ++G +VTK P P +D VRC+R
Sbjct: 2449 LAKPLRY--LQRMHTTGRGLED--VVVWFPVAPAGYVAVGCVVTKAPEMPSVDLVRCLRV 2504
Query: 187 DLTDKCEVHHLIFDAISKFSS--SPFSVWS 214
DL + L+ AI SS SP SVWS
Sbjct: 2505 DLVIQS---RLLKPAIWTLSSERSPNSVWS 2531
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 13/83 (15%)
Query: 128 QKPLDYTLVWCSD-----EGGQG------NYEGCAFFWLPQPPDGYKSMGFLVTKTPNKP 176
QKP+ + L+W + GG N E C W+P P GY S+G + + + P
Sbjct: 2211 QKPIGFELIWSTRGDAEPRGGSDAQKDDVNSECCV--WMPIAPPGYLSLGCVAERGLSPP 2268
Query: 177 ELDEVRCVRDDLTDKCEVHHLIF 199
L VRC+R D+ + I+
Sbjct: 2269 SLSSVRCIRSDMVTSGSLSDCIY 2291
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 80/191 (41%), Gaps = 30/191 (15%)
Query: 28 PLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGI------- 80
PL T Q G ++ G + IS F +W KSA + E + +
Sbjct: 4478 PLQTDVQHAGITKAYVHSGRL----ISDFKLLWW-------DKSAPWNENSKVSIWRPIP 4526
Query: 81 PDGFYSLGHYCQ--FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC 138
P G+ S+G Q +DS L V+V RD + L + L D+ ++
Sbjct: 4527 PSGYVSVGDVVQSSYDSPDL---VMVYRDDHDGKFVTPQGFELVGTSPLY--CDFVIMPS 4581
Query: 139 SD--EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHH 196
S+ + + W P+PP GY ++G ++ +P+L V CVR D + +
Sbjct: 4582 SEVWRDAEHSAREPITIWRPRPPLGYVALGCVIVPDYYEPDLGVVSCVRQDCVSQAPLKQ 4641
Query: 197 LIFDAISKFSS 207
++ISK+++
Sbjct: 4642 ---ESISKYTT 4649
>gi|380485453|emb|CCF39350.1| vacuolar protein sorting-associated protein [Colletotrichum
higginsianum]
Length = 521
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 14/101 (13%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++ + F G HVGDWEH +R N GE I+FS
Sbjct: 313 GSGIVDAFWFFFYSYN----LGQTVLTIRF---GNHVGDWEHCMMRFEN--GEPRGIFFS 363
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY 457
+H GG+ AY +E G + ++YS+ HA + G +
Sbjct: 364 EHEGGQ---AYTYNAVEKRGVRPVIYSAVGSHAMYAQAGDH 401
>gi|344232860|gb|EGV64733.1| hypothetical protein CANTEDRAFT_113515 [Candida tenuis ATCC 10573]
Length = 442
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 81/201 (40%), Gaps = 22/201 (10%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV-GEAIDPSGSNLPSGGR 322
++ I Y P V + +E YLP + F T+ + Y G V G + + L
Sbjct: 77 EVPDYILEYAPVVHLYSEERYLPYDIGKFVTHFHVEYDNGTTVPGTEANMNLKKLSELPE 136
Query: 323 NDGEFWI---DLPSDGGRQIVKHGNMESAKLYVHVKPAV------GGTFTDIVMWVFCPF 373
+ F D SD A + PAV G + D + F F
Sbjct: 137 EEDYFLTSNSDFDSDPEWITGIKNKPSLATGEIKDAPAVLIVVDKGNGWVDAYWFYFYSF 196
Query: 374 N-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE 432
N GP + G G HVGDWEH +R N GE ++ S H GG+ ++LE
Sbjct: 197 NLGPFVMGSG-------PYGNHVGDWEHSLVRYYN--GEPVIVWMSAHGGGQAFYYHNLE 247
Query: 433 YIEG--NKAIVYSSKNGHASF 451
E N I++S++ HA++
Sbjct: 248 KYESNPNHPILFSARGTHANY 268
>gi|323337498|gb|EGA78745.1| Vps62p [Saccharomyces cerevisiae Vin13]
Length = 310
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 100/267 (37%), Gaps = 64/267 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 25 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 77
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 78 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 132
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 133 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 190
Query: 503 ---WLQFMRKWGP-----------------TIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
WL F WG + K L K ++ + L R+ N
Sbjct: 191 GLEWLSFKGAWGDKQLRPRDPRQKWCPFQWKYIDGPKGPLFKNMERVSLCQRFKWWNFWK 250
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDERG 569
P Y + G EKN+ VGD G
Sbjct: 251 GCPARRYIKRGEGLDAEKNDLVGDNCG 277
>gi|390595361|gb|EIN04767.1| hypothetical protein PUNSTDRAFT_92710 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 346
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 81/199 (40%), Gaps = 48/199 (24%)
Query: 385 NVAFSKI--GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD----LEYIEGNK 438
+ AF I G HVGDWEH +R N G IY S HSGG AAY + + +
Sbjct: 155 HAAFLDIPFGNHVGDWEHSMVRFIN--GTPTDIYLSAHSGG---AAYTYPALVAHNSSGR 209
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILG-------IGVRNDAARS--NLYVDSSIQYELVA 489
A + + HA++ G L ++G G D A++ + D+S Q VA
Sbjct: 210 AQTFIAVGSHANYATSGAQLYPLPVIGPLADHTNFGPLWDVAKNYRGFWFDNSTQTFSVA 269
Query: 490 A-------EYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
E GEG WL+F+ KWG Y E D+ Y V +
Sbjct: 270 GGVDIGGQELEGEGA----GWLEFLGKWGDE-QYPIGLEHDQ----------YCVSDD-- 312
Query: 543 KLPLELYGEEGPTGPKEKN 561
E + GPTGP KN
Sbjct: 313 ----ECHFVSGPTGPIAKN 327
>gi|346974821|gb|EGY18273.1| vacuolar protein sorting-associated protein [Verticillium dahliae
VdLs.17]
Length = 548
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 7/74 (9%)
Query: 384 MNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIV 441
VA + G HVGDWEH +R + G I+FS+H GG+ AY +E G++ ++
Sbjct: 337 QTVANIRFGNHVGDWEHCMMRFEH--GIPRGIFFSEHEGGQ---AYTYNAVEKRGDRPVL 391
Query: 442 YSSKNGHASFPHPG 455
YS+ HA + PG
Sbjct: 392 YSAVGSHAMYAQPG 405
>gi|302421612|ref|XP_003008636.1| vacuolar protein sorting-associated protein [Verticillium
albo-atrum VaMs.102]
gi|261351782|gb|EEY14210.1| vacuolar protein sorting-associated protein [Verticillium
albo-atrum VaMs.102]
Length = 548
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 7/74 (9%)
Query: 384 MNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIV 441
VA + G HVGDWEH +R + G I+FS+H GG+ AY +E G++ ++
Sbjct: 337 QTVANIRFGNHVGDWEHCMMRFEH--GIPRGIFFSEHEGGQ---AYTYNAVEKRGDRPVL 391
Query: 442 YSSKNGHASFPHPG 455
YS+ HA + PG
Sbjct: 392 YSAVGSHAMYAQPG 405
>gi|451993326|gb|EMD85800.1| hypothetical protein COCHEDRAFT_1148348 [Cochliobolus
heterostrophus C5]
Length = 674
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 64/143 (44%), Gaps = 22/143 (15%)
Query: 379 LKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNK 438
L + NV F G HVGDWEH +R + GE +++FS+HS G+ A+D G +
Sbjct: 458 LGNAVFNVRF---GNHVGDWEHTVVRFHH--GEPKAVFFSEHSFGE-AYAWDAVEKSGKR 511
Query: 439 AIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARS--------NLYVDSSIQYELVAA 490
I +S+ HA + GT+ +L G+ +D N+Y S +Y
Sbjct: 512 PIGFSATGTHAMYATEGTH---PYVLPGGILHDVTDRGPLWDPVLNMY---SYKYNYRTD 565
Query: 491 EYLGEGV--VAEPSWLQFMRKWG 511
+ + VA SW F WG
Sbjct: 566 RLVSSNLNPVAPTSWFYFAGHWG 588
>gi|341038461|gb|EGS23453.1| hypothetical protein CTHT_0001460 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 404
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 65/145 (44%), Gaps = 11/145 (7%)
Query: 44 NLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVL 103
+ G++ + S F++ W +++ A F+ P D +G F+ P
Sbjct: 7 DYGDLRITLTSEFDWRWDDTGTGARRDGA-FWHPKPQ-DDLRPVGSVV-FEGHPNPNNNW 63
Query: 104 VARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYK 163
A + S A SPA++ P Y VW SD+G G ++ W P P GY
Sbjct: 64 AALLIGDSRPPHAR-----HSPAVRSPERYDWVW-SDKGSGGKHD--VSVWRPVAPAGYV 115
Query: 164 SMGFLVTKTPNKPELDEVRCVRDDL 188
++G + + + P +DE+ CVR+DL
Sbjct: 116 ALGDVAVRGGSAPGIDEIYCVREDL 140
>gi|440798440|gb|ELR19508.1| zinc finger, C3HC4 type (RING finger) domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 407
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 127 LQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRD 186
L+ P D+ L+W + G + W P PP G+ ++G++VT NKP L+ + CV
Sbjct: 243 LRPPTDFELIWNDKKTGAKRH---CSIWRPIPPPGFVALGYVVTPHHNKPTLEAITCVHH 299
Query: 187 DLTDKCEV 194
C V
Sbjct: 300 SCVRACSV 307
>gi|302894953|ref|XP_003046357.1| hypothetical protein NECHADRAFT_75995 [Nectria haematococca mpVI
77-13-4]
gi|256727284|gb|EEU40644.1| hypothetical protein NECHADRAFT_75995 [Nectria haematococca mpVI
77-13-4]
Length = 415
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 9/77 (11%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE----YIEGNKAIVYSSKNG 447
G HVGDWEH +R + G+ IY+SQHS G AAY+ +E + +VYS+
Sbjct: 196 GCHVGDWEHNMVRFRD--GKPTGIYYSQHSSG---AAYEWNDTRLSLEDERPLVYSAYGS 250
Query: 448 HASFPHPGTYLQGSEIL 464
HA+F G ++ S +L
Sbjct: 251 HANFASSGDHVHDSVLL 267
>gi|310800992|gb|EFQ35885.1| vacuolar protein sorting-associated protein 62 [Glomerella
graminicola M1.001]
Length = 454
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 82/207 (39%), Gaps = 33/207 (15%)
Query: 366 VMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKW 425
V V PFN + G + G H+GDW H +R + G+ IYFSQH G
Sbjct: 164 VTQVLEPFN---HIVKGGEAASGMHFGDHIGDWYHNMIRFRD--GKPIGIYFSQHVDGSS 218
Query: 426 VAAYDLEYIEGN-KAIVYSSKNGHASFPHPGTYLQGSEILGI---GVRNDAARSN-LYVD 480
D E + + + IVYS+ HA++P PG + + ++ G R D S Y
Sbjct: 219 YNWDDPELSKTDGRPIVYSACGSHANYPTPGDQIHNAVLVDYCDQGKRWDPVLSAYFYRF 278
Query: 481 SSIQYELVAAEYLGEGVVAEP------SWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIR 534
+ L + G+ A P S+ + +WG DS
Sbjct: 279 DPKSFTLTRLDPPGQLTAAPPESSNLTSFFYYSGRWGDVSYPDSDPR------------- 325
Query: 535 YSVENAVSKLPLELYGEEGPTGPKEKN 561
+ V L+ + + GPTGP+ K+
Sbjct: 326 ---QKTVPWFKLKRF-QSGPTGPRYKH 348
>gi|156840798|ref|XP_001643777.1| hypothetical protein Kpol_480p6 [Vanderwaltozyma polyspora DSM
70294]
gi|156114402|gb|EDO15919.1| hypothetical protein Kpol_480p6 [Vanderwaltozyma polyspora DSM
70294]
Length = 462
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 79/357 (22%), Positives = 132/357 (36%), Gaps = 91/357 (25%)
Query: 257 HAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFT-------NGALLYKAGDLVGEA 309
++P I + I + P V+ H +E Y PS + + T NG ++ A +
Sbjct: 68 RSLPKNGSIPSYIVDNCPVVYLHSEERYWPSDIEDYVTHFSINDGNGNVIINASNDDPLK 127
Query: 310 IDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYV----------HVK--PA 357
+ S +N +++I S + N ++ ++ H+K P
Sbjct: 128 LKDLNSGYKINFKNGSKYYI---SSEDTYMTSLDNFDNDPKWLLGHKPEYGTGHIKKGPV 184
Query: 358 V------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTG 410
+ G + D + F PFN GP + G G HVGDWEH +R + G
Sbjct: 185 ILFAVDKGNGWVDAFWFYFYPFNWGPFIMGYG-------PWGNHVGDWEHSLVRF--YKG 235
Query: 411 ELWSIYFSQHSGGKWVAAYDLEYIEGNKA----------------IVYSSKNGHASFPHP 454
+ ++ S HSGG AY E IE K +++S++ HA++
Sbjct: 236 KPKYLWMSAHSGG---TAYKFEAIEKIKKLRHVNGRLTSELIERPVIFSARGTHANYAST 292
Query: 455 GTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEPS----------WL 504
G + + + + R ++ D + Y Y+ +G PS WL
Sbjct: 293 GQHPHDVPFFFMPLSDFTDRGPMW-DPVLNY----YSYVFDGTNIIPSSEKEEKLGIQWL 347
Query: 505 QFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
Q+ WG DK + R+ P++ +GPTGP KN
Sbjct: 348 QYHGTWG-----------DKQLAWKDPRQRWC--------PVQWKYIDGPTGPLYKN 385
>gi|46136941|ref|XP_390162.1| hypothetical protein FG09986.1 [Gibberella zeae PH-1]
Length = 581
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 10/109 (9%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIVYSSKNG 447
+ G HVGDWEH +R + G ++FS+H GG+ AY E +E G + ++YS+
Sbjct: 391 RFGNHVGDWEHCMVRFEH--GIPRGVFFSEHEGGQ---AYAFEAVEKRGERPVIYSAVGS 445
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEG 496
HA + PG + I+ + D D S+ +Y EG
Sbjct: 446 HAMYALPGDH---PYIIPFKLLKDVTDKGPLWDPSLNNYAYHYDYTKEG 491
>gi|452836977|gb|EME38920.1| hypothetical protein DOTSEDRAFT_75581 [Dothistroma septosporum
NZE10]
Length = 359
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 67/140 (47%), Gaps = 23/140 (16%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK-WVAAYDLEYIEGNKAIVYSSKNGH 448
++G HVGDWEH +R N T E ++++SQHS G+ + A + +G + +VY + H
Sbjct: 160 EVGDHVGDWEHTMVRFVNETPE--AVWYSQHSRGEAFTYAATEKGTDGVRPVVYVANGTH 217
Query: 449 ASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYL-------------GE 495
A++ G + I G+ V A + DS ++ +A Y G+
Sbjct: 218 ANYATTGNH--DHTIPGVNVP--AGPVEDHTDSGALWDPIANAYFYSYDMTSKNFSAYGD 273
Query: 496 GVVAEPSWLQFMRKWGPTIV 515
V +WL F+ ++G ++
Sbjct: 274 APV---NWLYFLGQYGDDMI 290
>gi|323304879|gb|EGA58637.1| Vps62p [Saccharomyces cerevisiae FostersB]
Length = 419
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 100/267 (37%), Gaps = 64/267 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 134 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 186
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 187 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 241
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 242 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 299
Query: 503 ---WLQFMRKWGP-----------------TIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
WL F WG + K L K ++ + L R+ N
Sbjct: 300 GLEWLSFKGAWGDKQLRPRDPRQKWCPFQWKYIDGPKGPLFKNMERVSLCQRFKWWNFWK 359
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDERG 569
P Y + G EKN+ VGD G
Sbjct: 360 GCPARRYIKRGEGLDAEKNDLVGDNCG 386
>gi|156400991|ref|XP_001639075.1| predicted protein [Nematostella vectensis]
gi|156226201|gb|EDO47012.1| predicted protein [Nematostella vectensis]
Length = 140
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 19/113 (16%)
Query: 362 FTDIVMWVFCPFNGPGTLKVGIMNV------AFSKIGQHVGDWEHFTLRICNFTGELWSI 415
+ DI W+F P+N + +G+ +S G HVGDWEH T+R+ N+ E +
Sbjct: 1 YYDIYYWLFYPYNRGKRVCIGLKPKFIGCIGGYSTFGHHVGDWEHVTIRL-NWNMEPLQL 59
Query: 416 YFSQHSGGKWVAAYDLE---YIEGNKA--------IVYSSKNGHASFPHPGTY 457
Y S H+ G YD + GN+ +VYS+ H + PGT+
Sbjct: 60 YVSAHNFGT-RYNYDKHTGVFRSGNRVLAMHDSHPVVYSALGSHGMWDRPGTH 111
>gi|413925785|gb|AFW65717.1| hypothetical protein ZEAMMB73_601551 [Zea mays]
Length = 2676
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 49/111 (44%), Gaps = 13/111 (11%)
Query: 121 LFKSPAL--QKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL 178
LFK + +KP+ +T V D G FFW P PP GY S+G +VTKT P
Sbjct: 774 LFKCDTVVSEKPVQFTKVTQIDRKGLEEI----FFWYPVPPPGYASLGCIVTKTDEMPSK 829
Query: 179 DEVRCVRDDLTDKCEVHHLIFDAISKFSSSP----FSVWSTRPCNRGMLGR 225
D + C + L + + D IS SSS +S+W L R
Sbjct: 830 DSICCPKLSLVSQANMSE---DPISMSSSSKGPCCWSIWKVENQGCTFLAR 877
>gi|413925786|gb|AFW65718.1| hypothetical protein ZEAMMB73_601551 [Zea mays]
gi|413925787|gb|AFW65719.1| hypothetical protein ZEAMMB73_601551 [Zea mays]
Length = 2718
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 47/99 (47%), Gaps = 13/99 (13%)
Query: 121 LFKSPAL--QKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL 178
LFK + +KP+ +T V D G FFW P PP GY S+G +VTKT P
Sbjct: 774 LFKCDTVVSEKPVQFTKVTQIDRKGLEEI----FFWYPVPPPGYASLGCIVTKTDEMPSK 829
Query: 179 DEVRCVRDDLTDKCEVHHLIFDAISKFSSSP----FSVW 213
D + C + L + + D IS SSS +S+W
Sbjct: 830 DSICCPKLSLVSQANMSE---DPISMSSSSKGPCCWSIW 865
>gi|156316804|ref|XP_001617993.1| hypothetical protein NEMVEDRAFT_v1g77984 [Nematostella vectensis]
gi|156196843|gb|EDO25893.1| predicted protein [Nematostella vectensis]
Length = 130
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 51/113 (45%), Gaps = 20/113 (17%)
Query: 73 TFYEPAGIPDGFYSLGHYC-QFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPL 131
+ + P G +GFY LG Y ++P LVA+D S AL KP+
Sbjct: 9 SIWRPHGA-EGFYRLGDYAVASHNKPSNPAALVAKDDGSG--------------ALAKPV 53
Query: 132 DYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCV 184
Y+ VW SD G + + W P GY+ +G L T P LD++RCV
Sbjct: 54 GYSRVW-SDAGSGADRD--VSLWKPTAAAGYQCLGLLATSGAT-PGLDDMRCV 102
>gi|308802177|ref|XP_003078402.1| C2 domain-containing protein (ISS) [Ostreococcus tauri]
gi|116056854|emb|CAL53143.1| C2 domain-containing protein (ISS) [Ostreococcus tauri]
Length = 4247
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 51/118 (43%), Gaps = 21/118 (17%)
Query: 72 ATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPL 131
A+F+ P P GF LG D +P V V D AEG L KP
Sbjct: 1895 ASFWRPIP-PRGFVLLGDCVTLDGKPPSHPVTVLTD-----AEG-----------LTKPP 1937
Query: 132 DYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLT 189
+ C EGG W P PPDGY + G +VT + P ++ +RC+R ++T
Sbjct: 1938 LWYDQVCKFEGGNST----TTVWRPVPPDGYYAFGCIVTTGEDLPAVEIMRCIRSEVT 1991
>gi|429859634|gb|ELA34410.1| vacuolar protein sorting protein 62 [Colletotrichum gloeosporioides
Nara gc5]
Length = 541
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 14/101 (13%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++ + F G HVGDWEH +R N G I+FS
Sbjct: 313 GSGIVDAFWFFFYSYN----LGQTVLTIRF---GNHVGDWEHCMMRFEN--GVPRGIFFS 363
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY 457
+H GG+ AY +E G++ ++YS+ HA + G +
Sbjct: 364 EHEGGQ---AYTYHAVEKRGDRPVIYSAVGSHAMYAQAGDH 401
>gi|392299394|gb|EIW10488.1| Vps62p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 419
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 100/267 (37%), Gaps = 64/267 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 134 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 186
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 187 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 241
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 242 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 299
Query: 503 ---WLQFMRKWGP-----------------TIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
WL F WG + K L K ++ + L R+ N
Sbjct: 300 GLEWLSFKGAWGDKQLRPRDPRQKWCPFQWKYIDGPKGPLFKNMERVSLCQRFKWWNFWK 359
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDERG 569
P Y + G EKN+ VGD G
Sbjct: 360 GCPARRYIKRGEGLDAEKNDLVGDNCG 386
>gi|6321580|ref|NP_011657.1| Vps62p [Saccharomyces cerevisiae S288c]
gi|1723717|sp|P53285.1|VPS62_YEAST RecName: Full=Vacuolar protein sorting-associated protein 62
gi|1323236|emb|CAA97154.1| unnamed protein product [Saccharomyces cerevisiae]
gi|151943419|gb|EDN61730.1| vacuolar sorting protein [Saccharomyces cerevisiae YJM789]
gi|190406843|gb|EDV10110.1| vacuolar protein sorting protein 62 [Saccharomyces cerevisiae
RM11-1a]
gi|285812332|tpg|DAA08232.1| TPA: Vps62p [Saccharomyces cerevisiae S288c]
gi|323354827|gb|EGA86660.1| Vps62p [Saccharomyces cerevisiae VL3]
Length = 467
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 100/267 (37%), Gaps = 64/267 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 182 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 234
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 235 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 289
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 290 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 347
Query: 503 ---WLQFMRKWGP-----------------TIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
WL F WG + K L K ++ + L R+ N
Sbjct: 348 GLEWLSFKGAWGDKQLRPRDPRQKWCPFQWKYIDGPKGPLFKNMERVSLCQRFKWWNFWK 407
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDERG 569
P Y + G EKN+ VGD G
Sbjct: 408 GCPARRYIKRGEGLDAEKNDLVGDNCG 434
>gi|349578348|dbj|GAA23514.1| K7_Vps62p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 467
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 100/267 (37%), Gaps = 64/267 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 182 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 234
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 235 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 289
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 290 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 347
Query: 503 ---WLQFMRKWGP-----------------TIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
WL F WG + K L K ++ + L R+ N
Sbjct: 348 GLEWLSFKGAWGDKQLRPRDPRQKWCPFQWKYIDGPKGPLFKNMERVSLCQRFKWWNFWK 407
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDERG 569
P Y + G EKN+ VGD G
Sbjct: 408 GCPARRYIKRGEGLDAEKNDLVGDNCG 434
>gi|323309002|gb|EGA62232.1| Vps62p [Saccharomyces cerevisiae FostersO]
Length = 419
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 100/267 (37%), Gaps = 64/267 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 134 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 186
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 187 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 241
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 242 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 299
Query: 503 ---WLQFMRKWGP-----------------TIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
WL F WG + K L K ++ + L R+ N
Sbjct: 300 GLEWLSFKGAWGDKQLRPRDPRQKWCPFQWKYIDGPKGPLFKNMERVSLCQRFKWWNFWK 359
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDERG 569
P Y + G EKN+ VGD G
Sbjct: 360 GCPARRYIKRGEGLDAEKNDLVGDNCG 386
>gi|207345068|gb|EDZ72008.1| YGR141Wp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256269040|gb|EEU04379.1| Vps62p [Saccharomyces cerevisiae JAY291]
gi|259146643|emb|CAY79900.1| Vps62p [Saccharomyces cerevisiae EC1118]
Length = 467
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 100/267 (37%), Gaps = 64/267 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 182 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 234
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 235 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 289
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 290 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 347
Query: 503 ---WLQFMRKWGP-----------------TIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
WL F WG + K L K ++ + L R+ N
Sbjct: 348 GLEWLSFKGAWGDKQLRPRDPRQKWCPFQWKYIDGPKGPLFKNMERVSLCQRFKWWNFWK 407
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDERG 569
P Y + G EKN+ VGD G
Sbjct: 408 GCPARRYIKRGEGLDAEKNDLVGDNCG 434
>gi|302887593|ref|XP_003042684.1| hypothetical protein NECHADRAFT_52243 [Nectria haematococca mpVI
77-13-4]
gi|256723597|gb|EEU36971.1| hypothetical protein NECHADRAFT_52243 [Nectria haematococca mpVI
77-13-4]
Length = 940
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 60/229 (26%), Positives = 93/229 (40%), Gaps = 49/229 (21%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGS-NLPSGG------ 321
+ NY P V+ H ++ Y+PS + L + + G AID S NL + G
Sbjct: 538 VANYAPLVWLHSEDPYMPSDLL-----AHLQHTTPTVQGHAIDGIPSINLGNLGTLNEFG 592
Query: 322 -------RNDGEF----WI--DLPSDGGR---------QIVKHGNMESAKLYVHVKPAVG 359
D F WI + P D GR +V+ ++ Y +
Sbjct: 593 DEDVALVSQDDPFSYPKWILGEAPDDAGRIHNATPCAVILVEKNEVDLDAFYFYFYSYNE 652
Query: 360 GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQ 419
G + V P N T + + F G HVGDWEH +R + G+ IY+SQ
Sbjct: 653 GP---NITQVLEPLNRLVTSEKASAGMHF---GNHVGDWEHNMVRFRD--GKPIGIYYSQ 704
Query: 420 HSGGKWVAAYDLE----YIEGNKAIVYSSKNGHASFPHPGTYLQGSEIL 464
H G+ YD G++ IVYS++ HA++ PG + + ++
Sbjct: 705 HVDGE---GYDWNDGAVSKAGDRPIVYSARGSHANYAVPGAPVHNAALV 750
>gi|345564587|gb|EGX47548.1| hypothetical protein AOL_s00083g357 [Arthrobotrys oligospora ATCC
24927]
Length = 529
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 43/74 (58%), Gaps = 8/74 (10%)
Query: 385 NVAFS-KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE--GNKAIV 441
NV F+ + G HVGDWEH +R + GE ++ SQH+GG AY E +E G + ++
Sbjct: 299 NVVFNVRFGNHVGDWEHSMVRFVD--GEPKYMFLSQHTGGD---AYVWEALEKKGKRPVI 353
Query: 442 YSSKNGHASFPHPG 455
YS+ HA++ G
Sbjct: 354 YSAVGTHANYATAG 367
>gi|385305170|gb|EIF49160.1| vacuolar protein sorting protein 62 [Dekkera bruxellensis AWRI1499]
Length = 294
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 86/205 (41%), Gaps = 24/205 (11%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G + D + F FN +G + F G HVGDWEH +R + GE ++ S
Sbjct: 71 GNGWVDSFWFYFYSFN------MGPYVMGFGPFGDHVGDWEHSLVRF--YKGEPVVVWMS 122
Query: 419 QHSGGKWVAAYDLE-YIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNL 477
H GG +LE Y +++S++ HA++P G Q + + + +D
Sbjct: 123 AHGGGGAYFYNNLEKYGNSKHPVIFSARGTHANYPSTG---QHAHDIPYSILSDFTDRGP 179
Query: 478 YVDSSIQYELVAAEYLGEGVVAEPSWLQFMR-KWGPTIVYDSKTELDKIIKLLPLMIRYS 536
D S+ Y +A Y + V A + F +G ++Y+ T DK++ +S
Sbjct: 180 LWDPSLNY--LAYTYNDDKVSAANGTVPFREASYGNWLLYNG-TWGDKMLAANDSRQHWS 236
Query: 537 VENAVSKLPLELYGEEGPTGPKEKN 561
P E +GP GP KN
Sbjct: 237 --------PFEWKYIDGPIGPLGKN 253
>gi|168005281|ref|XP_001755339.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693467|gb|EDQ79819.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 4849
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 63/145 (43%), Gaps = 26/145 (17%)
Query: 53 ISRFNFIWSCN-------LLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVA 105
+S N +WS N L + + +F+ P P G+ S+G P V+V
Sbjct: 4668 VSLGNLLWSYNKKSNGSCLGFTVADNISFWRPDP-PPGYVSVGDVAFTGDYPDNQTVIVY 4726
Query: 106 RDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSM 165
R+ +KPL + LVW + + G G+ W+P PDGY ++
Sbjct: 4727 RN---------------DEDKFEKPLGFNLVWRNWKDGSGS---PISIWMPIAPDGYLAV 4768
Query: 166 GFLVTKTPNKPELDEVRCVRDDLTD 190
G +V +P+LD V CV DLT+
Sbjct: 4769 GCVVCADYEEPQLDVVWCVHSDLTE 4793
Score = 42.7 bits (99), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 14/72 (19%)
Query: 151 AFFWLPQPPDGYKSMGFLVTKT--PNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSS- 207
+FW P PP GY ++G + K+ P+K + +RCVR+DL V H F S +S+
Sbjct: 2712 VYFWNPVPPPGYSAIGCIAGKSSRPDKDVMRSIRCVRNDL-----VSHANFSESSPWSTK 2766
Query: 208 ------SPFSVW 213
P S+W
Sbjct: 2767 YLKSGQQPMSIW 2778
>gi|50308227|ref|XP_454114.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49643249|emb|CAG99201.1| KLLA0E03785p [Kluyveromyces lactis]
Length = 428
Score = 47.4 bits (111), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 86/225 (38%), Gaps = 53/225 (23%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G + D + F PFN +G + + G H+GDWEH +R F G+ I+ S
Sbjct: 158 GNGWVDAYWFYFYPFN------LGPFIMGYGPWGNHIGDWEHSLVRF--FKGDPQYIWMS 209
Query: 419 QHSGGKWVAAYDLEYIEG------------------NKAIVYSSKNGHASFPHPGTYLQG 460
H GG A+Y IE + +++S++ HA++ G +
Sbjct: 210 AHGGG---ASYTYRCIEKKDRWRVTPEGKFDKTQVIKRPLLFSARGTHANYASVGQHAHD 266
Query: 461 SEILGIGVRNDAARSNLYVDSSIQYELVAAE----YLGEGVVAEPSWLQFMRKWGPTIVY 516
+ + R L+ D S+++ + Y EG SWL F+ +WG +
Sbjct: 267 VPFFFSALSDFTDRGPLW-DPSLRFYGFTYDGKSFYEKEGQEIGTSWLYFLGQWGNKQLK 325
Query: 517 DSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
S D K P+ +Y +GP GP KN
Sbjct: 326 WS----DPRQKWCPVQWKYI---------------DGPKGPAAKN 351
>gi|149234513|ref|XP_001523136.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146453245|gb|EDK47501.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 458
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 46/209 (22%), Positives = 91/209 (43%), Gaps = 26/209 (12%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRN 323
++ + +Y P V+ + +E YLP ++ + T+ + ++ G +V D + ++
Sbjct: 93 EVPQYVLDYSPLVYLYSEERYLPYDIADYATHFHVTFENGTIVPGTEDSLTLKKLAKLKH 152
Query: 324 DGEFWI------DLPSDGGRQIVKHGNMESAKLYVHVKPAV------GGTFTDIVMWVFC 371
++ D+ D + N+ + + + PA+ G + D + F
Sbjct: 153 SKHLYMTSNEDFDVDPDWITGVHNKPNLITGE--IKNAPAILIVVDKGNGWVDAFWFYFY 210
Query: 372 PFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD 430
FN GP + G G HVGDWEH +R + G+ ++ S H GG +
Sbjct: 211 SFNLGPFVMGQG-------PYGNHVGDWEHSLVRF--YKGKPVIVWMSAHGGGGAFYYKN 261
Query: 431 LE--YIEGNKAIVYSSKNGHASFPHPGTY 457
LE ++ I++S++ HA++P G +
Sbjct: 262 LEKWVMDERHPIIFSARGTHANYPSVGQH 290
>gi|365760606|gb|EHN02316.1| Vps62p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 382
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 67/266 (25%), Positives = 99/266 (37%), Gaps = 62/266 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 97 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 149
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG AY E IE K +++S++
Sbjct: 150 RF--YKGEPQFLWMSAHGGG---TAYKFEAIEKIKRLRRIDGKLTNEVIKKPLIFSARGT 204
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQ---YELVAAEYL----GEGVVAE 500
HA + G + + + + R L+ D S+ Y + E + E
Sbjct: 205 HAHYASVGQHSHDVPFFFMPLSDFTDRGPLW-DPSLNFYAYTITVGETMTPCGAEEAKMG 263
Query: 501 PSWLQFMRKWGPTIV--YDSKTE---------------LDKIIKLLPLMIRYSVENAVSK 543
WL F WG + +D + + L K + + L R+ N
Sbjct: 264 LDWLSFKGAWGDKQLKSWDPRQKWCPFQWKYIDGPRGPLFKNMARVSLCQRFKWWNFWKG 323
Query: 544 LPLELYGEEGPTGPKEKNNWVGDERG 569
P Y ++G EKN+ VGD G
Sbjct: 324 CPARRYIKKGEGLDAEKNDLVGDNCG 349
>gi|408396783|gb|EKJ75937.1| hypothetical protein FPSE_03885 [Fusarium pseudograminearum CS3096]
Length = 499
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 61/140 (43%), Gaps = 17/140 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++ + F G HVGDWEH +R + G ++FS
Sbjct: 285 GSGIVDAFWFFFYSYN----LGQTVLGIRF---GNHVGDWEHCMVRFEH--GIPRGVFFS 335
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSN 476
+H GG+ AY E +E G + ++YS+ HA + PG + I+ + D
Sbjct: 336 EHEGGQ---AYAFEAVEKRGERPVIYSAVGSHAMYALPGDH---PYIIPFKLLKDVTDKG 389
Query: 477 LYVDSSIQYELVAAEYLGEG 496
D S+ +Y EG
Sbjct: 390 PLWDPSLNNYAYHYDYTKEG 409
>gi|171676483|ref|XP_001903194.1| hypothetical protein [Podospora anserina S mat+]
gi|170936308|emb|CAP60966.1| unnamed protein product [Podospora anserina S mat+]
Length = 542
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 14/104 (13%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++N+ F G HVGDWEH +R N G +++ S
Sbjct: 318 GAGILDAFWFFFYSYN----LGQTVLNIRF---GNHVGDWEHCMVRFQN--GIPRAMFLS 368
Query: 419 QHSGGKWVAAYDLEYIEGN-----KAIVYSSKNGHASFPHPGTY 457
+H+GGK + +E N + ++YS+ HA + +PG +
Sbjct: 369 EHAGGKAYTWHAMEKRSQNHGKPARPVIYSAVGSHAMYANPGLH 412
>gi|83770246|dbj|BAE60379.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 193
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 65/147 (44%), Gaps = 21/147 (14%)
Query: 46 GEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGI----PDGFYSLGHYCQFDSRPLRGF 101
GE+ V S FN IW+ + + F+ P P G ++G++ + + + R
Sbjct: 53 GELVVTFTSSFNAIWN-DAGSGTTRDGGFWHPITQGTLRPLGSMAVGNFKELNGQ--RAA 109
Query: 102 VLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDG 161
+L+ A+ +SN P ++ P YT +W +D+G G FW P G
Sbjct: 110 LLIG-------AKSTSSSN----PPVKAPTSYTQLW-ADKGSGAKLNGS--FWRPIAAPG 155
Query: 162 YKSMGFLVTKTPNKPELDEVRCVRDDL 188
Y +MG +V P +V C+R DL
Sbjct: 156 YIAMGDVVQSGYTTPSTSKVWCLRSDL 182
>gi|340938937|gb|EGS19559.1| hypothetical protein CTHT_0040350 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 889
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 50/105 (47%), Gaps = 16/105 (15%)
Query: 359 GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYF 417
G D + F +N G LK+ + G HVGDWEH +R N G +++
Sbjct: 664 GAGILDAFWFFFYSYNLGQTVLKI--------RFGNHVGDWEHCMVRFQN--GIPRAMFL 713
Query: 418 SQHSGGKWVAAYDLEYIEGN-----KAIVYSSKNGHASFPHPGTY 457
S+H+GGK + LE N + ++YS+ HA + +PG +
Sbjct: 714 SEHAGGKAYTWHALEKRAQNDGKPPRPVIYSAVGSHAMYANPGLH 758
>gi|242812643|ref|XP_002486000.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218714339|gb|EED13762.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 321
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 59/129 (45%), Gaps = 14/129 (10%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASF 451
G HVGDWEH +R N GE ++++SQH+GG+ L I G + + YS + HA++
Sbjct: 150 GDHVGDWEHNMIRFQN--GEPQALWYSQHAGGEAFTYDALLKING-RPVSYSGRGTHANY 206
Query: 452 PHPGTYLQGSEILGI----GVRNDAARSNLYVDSS-----IQYELVAAEYLGEGVVAEPS 502
G + I G+ G ND + D + Y V+A + +
Sbjct: 207 AIEG--VHDHTIPGVNLPFGPLNDYTSNGTLWDPTGNYYAYSYNNVSAVFTAYDSSYPVN 264
Query: 503 WLQFMRKWG 511
WL F +WG
Sbjct: 265 WLYFDGQWG 273
>gi|453084329|gb|EMF12373.1| carbohydrate-binding module family 18 protein [Mycosphaerella
populorum SO2202]
Length = 636
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 101/284 (35%), Gaps = 60/284 (21%)
Query: 305 LVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTD 364
L G DP N P G D L +G + + G SA + V P G D
Sbjct: 316 LAGTCYDPY--NPPRGPHPDLRRRTTLDLNGDKHKPQQGGKSSAPAILIVVPKENG-IVD 372
Query: 365 IVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICN----------------F 408
+ F FN L ++N+ F G HVGDWEH +R N F
Sbjct: 373 AFWFFFYSFN----LGQTVLNIRF---GNHVGDWEHTCVRFRNGKPTEVFLSEHNFGQAF 425
Query: 409 TGELWSIYFSQHSG-GKWVAAYDLEYI--EGNKAIVYSSKNGHASFPHPGTYLQGSEILG 465
T + Y H G G + ++ E + +VYS+ HA + PG + I+
Sbjct: 426 TWKAMEKYIPSHDGSGTMIGSWSNETAGRMAKRPVVYSAIGSHAMYGTPGLH---PYIIP 482
Query: 466 IGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEP-------SWLQFMRKWGPTIVYDS 518
G+ +D D + ++ + + + A WL + WG
Sbjct: 483 FGLLHDETDRGPLWDPAQNFQAYTYDPVNRTMRASNLNPQSPVGWLSYAGHWG------- 535
Query: 519 KTELDKIIKLL-PLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
DK L P R++ + + GPTGP+ KN
Sbjct: 536 ----DKYYPLSDPRQYRFAG---------QYHYVNGPTGPRFKN 566
>gi|397606868|gb|EJK59471.1| hypothetical protein THAOC_20301 [Thalassiosira oceanica]
Length = 4649
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 64/150 (42%), Gaps = 18/150 (12%)
Query: 47 EIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYS------LGHYCQ--FDSRPL 98
+++V R+ +F IW+ S+ K + +EPA F S LGHY +D+
Sbjct: 4427 KLKVKRVGKFTQIWTSQGSMSRLK-GSVWEPADHKSTFKSNRAYVYLGHYSGMGYDNPNR 4485
Query: 99 RG---FVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWL 155
G L D S G+ L +P Y LVW G Y W
Sbjct: 4486 DGTPRLTLEITDTTGSWVGGSSWLPFVLERYLPRPARYRLVWSITTGANPFYA-----WE 4540
Query: 156 PQPP-DGYKSMGFLVTKTPNKPELDEVRCV 184
P PP D Y ++GF+ TK+ P++ +RCV
Sbjct: 4541 PVPPSDDYVALGFIGTKSEKPPDVRMMRCV 4570
>gi|322707367|gb|EFY98946.1| hypothetical protein MAA_06085 [Metarhizium anisopliae ARSEF 23]
Length = 273
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 41/68 (60%), Gaps = 9/68 (13%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEY----IEGNKAIVYSSKNG 447
G HVGDWEH +R + G+ IY+SQHSGG AAYD ++ ++ + +V+S+
Sbjct: 191 GDHVGDWEHNMVRFRD--GKPSGIYYSQHSGG---AAYDWDHADLSMKDGRPLVFSAYGS 245
Query: 448 HASFPHPG 455
HA++ G
Sbjct: 246 HANYASSG 253
>gi|255944113|ref|XP_002562824.1| Pc20g02700 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587559|emb|CAP85599.1| Pc20g02700 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 336
Score = 46.6 bits (109), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 83/200 (41%), Gaps = 21/200 (10%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFF--TNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGE 326
+ Y P V+ H E Y+PS + T + + A + V + + + + N
Sbjct: 34 VLEYAPLVWLHSQETYMPSDIQQQLDHTRPHVNWTALEGVESPLTLNNLDTLNSMGNTSV 93
Query: 327 FWIDL------PSDGGRQIVK---HGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPG 377
+ L P Q ++ G E+ + V G D + F FN G
Sbjct: 94 YLTSLEGIEADPQPAWFQGIRPDSQGRTENGTASIIVIADRGNGTVDAFYFYFYAFNKGG 153
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN 437
T ++ + F G H+GDWEH +R + G +I++SQH+ G+ +E +
Sbjct: 154 T----VLGLEF---GDHIGDWEHNMIRFVD--GSPDAIWYSQHASGQAFTYAAVEK-KDK 203
Query: 438 KAIVYSSKNGHASFPHPGTY 457
+ VYS+K HA++ G +
Sbjct: 204 RPYVYSAKGTHANYAIEGQH 223
>gi|448515185|ref|XP_003867271.1| hypothetical protein CORT_0B01130 [Candida orthopsilosis Co 90-125]
gi|380351610|emb|CCG21833.1| hypothetical protein CORT_0B01130 [Candida orthopsilosis]
Length = 439
Score = 46.6 bits (109), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 50/209 (23%), Positives = 92/209 (44%), Gaps = 26/209 (12%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV-GEAIDPSGSNLPSGGR 322
I + +Y P V + +E YLP + + TN + ++ G + G D + L S +
Sbjct: 74 HIPQYVIDYAPLVHLYSEERYLPYDIGEYVTNFHVTFENGSVYPGTEADMNLDKL-SKLK 132
Query: 323 NDGEFWIDLPSD-----------GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFC 371
+ + ++ SD + + +G +++A + V G + D + F
Sbjct: 133 HSCDLFLTSNSDFDKDPDWITGVKNKPSLINGEIKNAPAILIVVDK-GFGWVDAYWFYFY 191
Query: 372 PFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD 430
FN GP + G G HVGDWEH +R + GE ++ S H GG ++
Sbjct: 192 SFNLGPFVMGQG-------PYGNHVGDWEHSLVRF--YKGEPVIVWMSAHGGGGAFYYHN 242
Query: 431 LEY--IEGNKAIVYSSKNGHASFPHPGTY 457
LE ++ +++S++ HA++P G +
Sbjct: 243 LEKYALDPRHPVIFSARGTHANYPSVGQH 271
>gi|294655800|ref|XP_457996.2| DEHA2C07216p [Debaryomyces hansenii CBS767]
gi|199430616|emb|CAG86054.2| DEHA2C07216p [Debaryomyces hansenii CBS767]
Length = 436
Score = 46.6 bits (109), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 60/238 (25%), Positives = 98/238 (41%), Gaps = 28/238 (11%)
Query: 236 SNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTN 295
+N I+ + N+ + + K +I +Y P V + +E YLP + + TN
Sbjct: 43 NNLINSPDKNLPPISSWPKKEERTLKNGEIPQYAIDYAPLVHLYTEEKYLPYDIKDYVTN 102
Query: 296 GALLYKAGDLVG---EAIDPSG-SNLPS------GGRNDGEF---WIDLPSDGGRQIVKH 342
+ Y+ G +V E ++ S S LP+ R D + W+ + I
Sbjct: 103 FFVTYENGSVVKGTEEEMNLSKLSKLPNVPDIFLSSRTDFDKDPEWVTGAKNKPSLIDGE 162
Query: 343 GNMESAKLYVHVKPAVGGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHF 401
A L V K G + D + F FN GP + G G HVGDWEH
Sbjct: 163 IKNAPATLIVVDK---GNGWVDAFWFYFYSFNLGPFVMGSG-------PYGNHVGDWEHS 212
Query: 402 TLRICNFTGELWSIYFSQHSGGKWVAAYDLEY--IEGNKAIVYSSKNGHASFPHPGTY 457
+R N + ++ S H GG ++LE I+ I++S++ HA++ G +
Sbjct: 213 LVRFYNGVPVI--VWMSAHGGGGAYFYHNLEKWEIDDTHPIIFSARGTHANYVSVGQH 268
>gi|327305945|ref|XP_003237664.1| hypothetical protein TERG_02381 [Trichophyton rubrum CBS 118892]
gi|326460662|gb|EGD86115.1| hypothetical protein TERG_02381 [Trichophyton rubrum CBS 118892]
Length = 384
Score = 46.6 bits (109), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 67/292 (22%), Positives = 111/292 (38%), Gaps = 35/292 (11%)
Query: 239 ISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGAL 298
IS + IA + L P A + + Y P ++ H + Y PS + N
Sbjct: 26 ISAASVYIAAEQCLAPAPSAT-----VPDYVMKYAPLIWIHSQDPYQPSDIGAQLKNSIP 80
Query: 299 LYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIV----------KHGNMESA 348
+ + D + +NL + G + L S G Q + K G E A
Sbjct: 81 MINHVAIPNVPTDIALNNLDQLNKLGGNK-VCLTSKEGIQALPAWFRGVKPNKDGKTEGA 139
Query: 349 KLYVHVKPAVG--GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRIC 406
V G G D + F +N T ++ + F G HVGDWEH +R
Sbjct: 140 VSSAIVVREHGDEGKTVDAFYFYFYAYNQGNT----VLGIEF---GDHVGDWEHNMIRFR 192
Query: 407 NFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY---LQGSEI 463
+ G+ +I++SQH+ G+ Y G + I +S HA + GT+ + G ++
Sbjct: 193 D--GKPEAIWYSQHAAGQ-AFQYSATDKRGVRPIGFSGNGTHAVYATAGTHDHTIPGLDL 249
Query: 464 LGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEP----SWLQFMRKWG 511
+ ++ L+ + Y + G +P +WL F +WG
Sbjct: 250 PAGFLEDETDEGTLWDPTLSAYVYSYSSTTGLFTPYDPNTPTNWLYFQGRWG 301
>gi|354547137|emb|CCE43870.1| hypothetical protein CPAR2_500960 [Candida parapsilosis]
Length = 439
Score = 46.6 bits (109), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 50/209 (23%), Positives = 92/209 (44%), Gaps = 26/209 (12%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV-GEAIDPSGSNLPSGGR 322
I + +Y P V + +E YLP + + TN + ++ G + G D + L S +
Sbjct: 74 HIPQYVIDYAPLVHLYSEERYLPYDIGEYVTNFHVTFENGSVYPGTEADMNLDKL-SKLK 132
Query: 323 NDGEFWIDLPSD-----------GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFC 371
+ + ++ SD + + +G +++A + V G + D + F
Sbjct: 133 HSCDLFLTSNSDFDKDPDWITGVKNKPSLINGEIKNAPATLIVVDK-GFGWVDAYWFYFY 191
Query: 372 PFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYD 430
FN GP + G G HVGDWEH +R + GE ++ S H GG ++
Sbjct: 192 SFNLGPFVMGQG-------PYGNHVGDWEHSLVRF--YKGEPVIVWMSAHGGGGAFYYHN 242
Query: 431 LEY--IEGNKAIVYSSKNGHASFPHPGTY 457
LE ++ +++S++ HA++P G +
Sbjct: 243 LEKYALDPRHPVIFSARGTHANYPSVGQH 271
>gi|401838252|gb|EJT41969.1| VPS62-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 471
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 67/266 (25%), Positives = 99/266 (37%), Gaps = 62/266 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 186 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 238
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG AY E IE K +++S++
Sbjct: 239 RF--YKGEPQFLWMSAHGGG---TAYKFEAIEKIKRLRRIDGKLTNEVIKKPLIFSARGT 293
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQ---YELVAAEYL----GEGVVAE 500
HA + G + + + + R L+ D S+ Y + E + E
Sbjct: 294 HAHYASVGQHSHDVPFFFMPLSDFTDRGPLW-DPSLNFYAYTITVGETMTPCGAEEAKMG 352
Query: 501 PSWLQFMRKWGPTIV--YDSKTE---------------LDKIIKLLPLMIRYSVENAVSK 543
WL F WG + +D + + L K + + L R+ N
Sbjct: 353 LDWLSFKGAWGDKQLKSWDPRQKWCPFQWKYIDGPRGPLFKNMARVSLCQRFKWWNFWKG 412
Query: 544 LPLELYGEEGPTGPKEKNNWVGDERG 569
P Y ++G EKN+ VGD G
Sbjct: 413 CPARRYIKKGEGLDAEKNDLVGDNCG 438
>gi|255717709|ref|XP_002555135.1| KLTH0G02222p [Lachancea thermotolerans]
gi|238936519|emb|CAR24698.1| KLTH0G02222p [Lachancea thermotolerans CBS 6340]
Length = 438
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 77/347 (22%), Positives = 128/347 (36%), Gaps = 82/347 (23%)
Query: 258 AMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWF-------FTNGALLYK----AGDLV 306
++P+ + + + P V + +E Y P+ +S F +NG++L + DL
Sbjct: 54 SLPHEGFVPQFVIDNCPLVHLYSEEEYWPADISDFVRHFNIQLSNGSVLQEHPLDLQDLK 113
Query: 307 GEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKH------GNMESAKLYVHVKPAVGG 360
+ P+G+ GR +D R ++ H G +++A + V G
Sbjct: 114 SKIQGPAGA---ISGRETYLSSVDDFDKDPRWLLGHEPDYGSGQIKNAPAILIVVDK-GN 169
Query: 361 TFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQH 420
+ D + F FN +G + + G HVGDWEH +R F GE ++ S H
Sbjct: 170 GWVDAFWFYFYSFN------LGPFIMGYGPWGNHVGDWEHSLVRF--FEGEPKYLWMSAH 221
Query: 421 SGGKWVAAYDLEYIEG----------------NKAIVYSSKNGHASFPHPGTYLQGSEIL 464
GG Y + IE + +++S++ HA++ G +
Sbjct: 222 GGG---GCYRFDAIEKKTRVQYSNNHRTSKILQRPLIFSARGTHANYASVGQHAHDVPFF 278
Query: 465 GIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAEP----------SWLQFMRKWGPTI 514
+ + R L+ D S+ Y Y G V P WL +M WG
Sbjct: 279 FSALSDFTDRGPLW-DPSLNY----LAYTYNGTVVTPVSDRERELGTKWLHYMGHWGDKQ 333
Query: 515 VYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
+ S D + P+ RY +GP GP KN
Sbjct: 334 LPWS----DPRQRWCPVQWRYI---------------DGPRGPLAKN 361
>gi|159490437|ref|XP_001703183.1| hypothetical protein CHLREDRAFT_180327 [Chlamydomonas reinhardtii]
gi|158270723|gb|EDO96559.1| predicted protein [Chlamydomonas reinhardtii]
Length = 5850
Score = 46.2 bits (108), Expect = 0.041, Method: Composition-based stats.
Identities = 30/100 (30%), Positives = 43/100 (43%), Gaps = 8/100 (8%)
Query: 97 PLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLP 156
P VL D A++ EG + P L +P++Y VW + + + W P
Sbjct: 3071 PTSVVVLRDSDPAAALTEG-------RPPPLARPINYVRVWHDEHRERLGEDTYLVLWRP 3123
Query: 157 QPPDGYKSMGFLVTKTPNKPELD-EVRCVRDDLTDKCEVH 195
PP GY +G + P PE +RCVR DL E+
Sbjct: 3124 VPPAGYVPLGLVARVGPAPPEPGLPIRCVRADLVAADELQ 3163
Score = 42.7 bits (99), Expect = 0.47, Method: Composition-based stats.
Identities = 43/164 (26%), Positives = 67/164 (40%), Gaps = 16/164 (9%)
Query: 56 FNFIW-SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAE 114
F +W S + + ++ +F+ P G P G+ LG P V + +D + A
Sbjct: 5656 FKLVWRSPSTRLASSRAVSFWRPVG-PPGYTMLGDVVVLGRDPPPRPVRMYKDAPALAAA 5714
Query: 115 GAHTSNLFKSPA-----LQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLV 169
+ S +PA L P+ +TLV+ + + + W P P GY MG +
Sbjct: 5715 AVNASGSGDAPASEGPRLAPPVGWTLVF------RDSSQPPLTLWRPVAPRGYVEMGCIA 5768
Query: 170 TKTPNKPELDEVRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVW 213
+P L VRC+R DL V A+ +SS S W
Sbjct: 5769 WPEIEEPPLGLVRCLRRDLAAPGRVFE---SAVWGAASSDNSFW 5809
>gi|452840256|gb|EME42194.1| carbohydrate-binding module family 18 protein [Dothistroma
septosporum NZE10]
Length = 610
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 109/287 (37%), Gaps = 68/287 (23%)
Query: 305 LVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTD 364
+ G+ DP N P G D +DL +D + G A + + V P G D
Sbjct: 306 ISGDCYDPY--NPPRGPHPDLRRGLDLNND--KNAPDPGGRSGAPVILVVVPKEDG-IVD 360
Query: 365 IVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGG- 423
+ F FN L ++N+ F G HVGDWEH +R N G+ + ++ S+H G
Sbjct: 361 AFWFFFYSFN----LGQKVLNIRF---GNHVGDWEHTCVRFRN--GKPYQVFLSEHDFGQ 411
Query: 424 --------KWVAAYD--------------LEYIEGNKAIVYSSKNGHASFPHPGTYLQGS 461
K++ +YD +Y + + ++YS+ HA + PG LQ
Sbjct: 412 AYTWSALEKYIPSYDGSGTMIGSWSNETAAQYAK--RPVIYSAIGSHAMYGTPG--LQ-P 466
Query: 462 EILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAE-------PSWLQFMRKWGPTI 514
IL G+ +D D + + +Y + V A W + WG
Sbjct: 467 YILPFGLLHDETDRGPLWDPILNVQSYTYDYKEQKVRASTLNPNSPTGWFDYGGHWGD-- 524
Query: 515 VYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
K PL + ++ + + GPTGPK KN
Sbjct: 525 ------------KYYPLS-----DPRQYRIAGQYHYVNGPTGPKFKN 554
>gi|358053973|dbj|GAA99938.1| hypothetical protein E5Q_06641 [Mixia osmundae IAM 14324]
Length = 1207
Score = 46.2 bits (108), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 65/139 (46%), Gaps = 31/139 (22%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYI--EGNKAIVYSSKNG 447
+ G HVGDWE+ +R N G ++++S H G AY + + ++ I Y +
Sbjct: 168 RFGNHVGDWENSMIRFVN--GVPQAVFYSAHDDGD---AYTFDTVNQSNSRPIGYIAMGS 222
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEY--------------- 492
HA++P PG++ S+ +G + + Y DS ++ VAA Y
Sbjct: 223 HANYPTPGSH-NISQAIGQLLTD-------YTDSGTLWD-VAANYQAFWFSGSNDSPNFT 273
Query: 493 LGEGVVAEPSWLQFMRKWG 511
+ +G +P +L ++ KWG
Sbjct: 274 VADGSTFDPGFLYYIGKWG 292
>gi|393247863|gb|EJD55370.1| hypothetical protein AURDEDRAFT_179129 [Auricularia delicata
TFB-10046 SS5]
Length = 405
Score = 46.2 bits (108), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASF 451
G H+GDWEH +R N GE +++ S HS G E I+G A+ YS++ HA +
Sbjct: 192 GNHIGDWEHTMIRFKN--GEPQAMHLSAHSDGHAWKMSCFETIDGRPAM-YSARGSHAGY 248
Query: 452 PHPGT 456
P G+
Sbjct: 249 PRAGS 253
>gi|367026245|ref|XP_003662407.1| hypothetical protein MYCTH_2303000 [Myceliophthora thermophila ATCC
42464]
gi|347009675|gb|AEO57162.1| hypothetical protein MYCTH_2303000 [Myceliophthora thermophila ATCC
42464]
Length = 545
Score = 46.2 bits (108), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 14/104 (13%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++N+ F G HVGDWEH +R + T +++ S
Sbjct: 322 GAGILDAFWFFFYSYN----LGQTVLNIRF---GNHVGDWEHCMVRFQHGTPR--AMFLS 372
Query: 419 QHSGGK---WVAAYDLEYIEGNKA--IVYSSKNGHASFPHPGTY 457
+H+GGK W A +G A ++YS+ HA + PG +
Sbjct: 373 EHAGGKAYLWKALEKRTQKDGKPARPVIYSAVGSHAMYASPGMH 416
>gi|358393115|gb|EHK42516.1| hypothetical protein TRIATDRAFT_137431 [Trichoderma atroviride IMI
206040]
Length = 439
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 44/77 (57%), Gaps = 9/77 (11%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE----YIEGNKAIVYSSKNG 447
G HVGDWEH +R + G+ IY+SQH+GG AAY+ ++ + +V+S+
Sbjct: 186 GDHVGDWEHNMIRFRD--GKPTGIYYSQHAGG---AAYEWNDNTLSMKSGRPLVFSAYGS 240
Query: 448 HASFPHPGTYLQGSEIL 464
HA++ G ++ S ++
Sbjct: 241 HANYATSGDHIHDSALI 257
>gi|363748142|ref|XP_003644289.1| hypothetical protein Ecym_1225 [Eremothecium cymbalariae
DBVPG#7215]
gi|356887921|gb|AET37472.1| hypothetical protein Ecym_1225 [Eremothecium cymbalariae
DBVPG#7215]
Length = 449
Score = 45.8 bits (107), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 69/287 (24%), Positives = 107/287 (37%), Gaps = 59/287 (20%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV--------GEAIDPSG- 314
+I I + P V + +EVYLP V+ + L + G++V + D +G
Sbjct: 75 EIPQYIVDSCPLVHLYSEEVYLPGDVADYVQYFELRDELGNIVKRGPLDIKADFSDITGE 134
Query: 315 -------SNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVM 367
++L N + D P G I K +A L V K G + D
Sbjct: 135 KSKNLYLTSLDDVTTNPKWLFGDKPDYGTGYIAK----AAANLIVVDK---GNGWVDAFW 187
Query: 368 WVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGG---- 423
+ F PFN +G F G H+GDWEH +R + GE ++ S HS G
Sbjct: 188 FYFYPFN------LGPFVTGFGPWGNHIGDWEHSVVRYWD--GEPQYLWMSAHSDGGGYL 239
Query: 424 --------KW-VAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR 474
W V +E + +++S++ HA++P G Y + + R
Sbjct: 240 YSAVEKKDHWKVVDGKIEKSVVRRPLIFSARGTHANYPSIGQYAHDIPFFFSPLSDFTDR 299
Query: 475 SNLYVDSSIQYELVAAEYLGEGVVAEP----------SWLQFMRKWG 511
L+ D ++ + Y G A P WL F WG
Sbjct: 300 GTLW-DPAMNF----YAYTYNGSTATPHGDTEKRIGTDWLYFQGHWG 341
>gi|310792731|gb|EFQ28192.1| vacuolar protein sorting-associated protein [Glomerella graminicola
M1.001]
Length = 539
Score = 45.8 bits (107), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 14/101 (13%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L ++ + F G HVGDWEH +R N G I+FS
Sbjct: 311 GSGIVDAFWFFFYSYN----LGQTVLTIRF---GNHVGDWEHCMMRFEN--GVPRGIFFS 361
Query: 419 QHSGGKWVAAYDLEYIE--GNKAIVYSSKNGHASFPHPGTY 457
+H GG+ AY +E G + ++YS+ HA + G +
Sbjct: 362 EHEGGQ---AYTYNAVEKRGLRPVIYSAVGSHAMYAQAGDH 399
>gi|302690726|ref|XP_003035042.1| hypothetical protein SCHCODRAFT_256014 [Schizophyllum commune H4-8]
gi|300108738|gb|EFJ00140.1| hypothetical protein SCHCODRAFT_256014 [Schizophyllum commune H4-8]
Length = 333
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 34/65 (52%), Gaps = 3/65 (4%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASF 451
G HVGDWEH +R + GE +Y S HS G + YD EG + VY + HA++
Sbjct: 155 GNHVGDWEHSMVRFVD--GEPTHVYLSAHSSG-YAYTYDAIAKEGKRPKVYVADGTHANY 211
Query: 452 PHPGT 456
GT
Sbjct: 212 AITGT 216
>gi|317151396|ref|XP_001824635.2| vacuolar protein sorting-associated protein 62 [Aspergillus oryzae
RIB40]
Length = 349
Score = 45.8 bits (107), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 82/204 (40%), Gaps = 33/204 (16%)
Query: 265 IHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVG-EAID--PSGSNLPSGG 321
+ + + Y P V+ H +E Y+PS + G L +V + ID PS + L +
Sbjct: 40 VPSYVLEYAPMVWLHSEEAYMPSDI------GEQLVHTTPMVNWKPIDKAPSATTLDNLD 93
Query: 322 R--NDGEFWIDLPSD------------GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVM 367
+ N G + L S GG + + G + A + G D
Sbjct: 94 QFNNLGNTSVYLTSKEGIDADPQPSWFGGVKPDQDGRTQDAISSTIILRDHGDGTLDAFY 153
Query: 368 WVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVA 427
+ F FN T V + G H+GDWEH +R G +I++SQH+ G+
Sbjct: 154 FYFYAFNQGNT-------VLAMEFGDHIGDWEHNMIRFSE--GVPQAIWYSQHASGQAFT 204
Query: 428 AYDLEYIEGNKAIVYSSKNGHASF 451
E I G + I YS HA++
Sbjct: 205 YGATEKI-GKRPIAYSGNGTHANY 227
>gi|361125405|gb|EHK97450.1| putative vacuolar protein sorting-associated protein TDA6 [Glarea
lozoyensis 74030]
Length = 288
Score = 45.8 bits (107), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 9/128 (7%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
++G H+GDWEH +R + G +++SQHSGG+ E + G + + YS+K HA
Sbjct: 111 ELGDHIGDWEHNMIRFKD--GVPQHVWYSQHSGGQAFTYRATEKV-GKRPVSYSAKGSHA 167
Query: 450 SFPHPGTYLQGSEILGI--GVRNDAARSNLYVD---SSIQYELVAAEYLGEGVVAEP-SW 503
++ G + L + G+ D + D SS Y ++ EG+ P +
Sbjct: 168 NYAIGGKHDHTIPGLNLPDGLIVDHTSKGVSWDPTLSSYYYNFKPSDGSFEGINNSPVAP 227
Query: 504 LQFMRKWG 511
+ F KWG
Sbjct: 228 MNFKGKWG 235
>gi|254574542|ref|XP_002494380.1| Vacuolar protein sorting (VPS) protein required for cytoplasm to
vacuole targeting of proteins [Komagataella pastoris
GS115]
gi|238034179|emb|CAY72201.1| Vacuolar protein sorting (VPS) protein required for cytoplasm to
vacuole targeting of proteins [Komagataella pastoris
GS115]
Length = 424
Score = 45.4 bits (106), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 125/337 (37%), Gaps = 58/337 (17%)
Query: 252 LDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV--GEA 309
L P+ + Q+ + Y P V + +E Y P ++ + + ++ G V G+
Sbjct: 46 LFPEHKRIIQDGQVPYYVVEYAPLVHLYSEEKYFPCDIARYVDHFHAVFGNGSDVPGGQH 105
Query: 310 IDP---SGSNLPSGGRNDGEFWIDLPSDGGRQ----IVKHGNMESAKLYVHVKPAV---- 358
++ S N S + E ++ +D + +H + + PAV
Sbjct: 106 LNIGALSKLNHYSAAKEGSEVYLMANTDFDKNPDYITGRHNKPKYINGEIEEAPAVLIVV 165
Query: 359 --GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSI 415
G + D + F FN GP + G G HVGDWEH +R N G +
Sbjct: 166 DKGNGWVDAFWFYFYGFNLGPFVMGTG-------PFGNHVGDWEHSLVRFYN--GSPIIV 216
Query: 416 YFSQHSGGKWVAAYDLE--YIEGNKAIVYSSKNGHASFPHPGT------YLQGSEILGIG 467
+ S H GG +LE Y + + +++S++ HA++ G + S+ G
Sbjct: 217 WMSAHGGGGSYFYKNLEKHYNDDKRPVIFSARGTHANYASVGQQNHDLPWAMLSDFTDRG 276
Query: 468 VRNDAARSNL---YVDSSIQYELVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDK 524
D ++ L Y D I Y + E WL F +WG + S D+
Sbjct: 277 PLWDPTKNFLAYTYSDGRITYANGSHPKREE---RYGDWLYFEGRWGDNKLPSS----DE 329
Query: 525 IIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
K P +Y GPTGP KN
Sbjct: 330 RQKWSPFEWKYIA---------------GPTGPLSKN 351
>gi|448099225|ref|XP_004199090.1| Piso0_002497 [Millerozyma farinosa CBS 7064]
gi|359380512|emb|CCE82753.1| Piso0_002497 [Millerozyma farinosa CBS 7064]
Length = 438
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 82/197 (41%), Gaps = 28/197 (14%)
Query: 271 NYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDP-SGSNLPSGGRN------ 323
+Y P V + +E Y+P ++ F N Y G V + D ++LP R+
Sbjct: 80 DYAPLVHLYSEERYMPYDIAKFVKNFHATYPNGTKVKDTPDELQITDLPKLPRSPDIYLT 139
Query: 324 -DGEF-----WIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFN-GP 376
+ +F WI + I + A L V K G + D + F FN GP
Sbjct: 140 SNSDFDKDPKWITGAHNVPSLIDGKIDKAPASLIVVDK---GNGWVDAFWFYFYSFNLGP 196
Query: 377 GTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE--YI 434
+ G G HVGDWEH +R + GE ++ S H GG +LE +
Sbjct: 197 FVMGSG-------PYGNHVGDWEHSLVRF--YKGEPIIVWMSAHGGGGAYFYRNLEKYKL 247
Query: 435 EGNKAIVYSSKNGHASF 451
+ I++S++ HA++
Sbjct: 248 DRRHPIIFSARGTHANY 264
>gi|440632905|gb|ELR02824.1| hypothetical protein GMDG_05760 [Geomyces destructans 20631-21]
Length = 515
Score = 45.4 bits (106), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 46/90 (51%), Gaps = 10/90 (11%)
Query: 364 DIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGG 423
D V++ F F L + V F G HVGDWEH ++ N E +++ SQH+GG
Sbjct: 190 DGVIFAFWFFFYSYNLGNSVFGVGF---GDHVGDWEHSVMKFRNGVPE--TMFISQHTGG 244
Query: 424 KWVAAYDLEYIE--GNKAIVYSSKNGHASF 451
+ Y +E G + ++YS+K HA +
Sbjct: 245 R---TYTFGAMEKYGKRPVIYSAKGSHAMY 271
>gi|154302312|ref|XP_001551566.1| hypothetical protein BC1G_09940 [Botryotinia fuckeliana B05.10]
gi|347828985|emb|CCD44682.1| similar to vacuolar sorting-associated protein [Botryotinia
fuckeliana]
Length = 537
Score = 45.4 bits (106), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 6/72 (8%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----NKAIVYSSK 445
+ G HVGDWEH +R N GE +++FS H GG +E +G + ++YS+
Sbjct: 331 RFGNHVGDWEHSLIRFYN--GEPKAVFFSAHFGGLGYHYKAVEKGQGPGREGRPVIYSAV 388
Query: 446 NGHASFPHPGTY 457
HA + PGT+
Sbjct: 389 GSHAMYATPGTH 400
>gi|449459858|ref|XP_004147663.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101213129
[Cucumis sativus]
Length = 4194
Score = 45.4 bits (106), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 37/69 (53%), Gaps = 4/69 (5%)
Query: 129 KPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
KPL +T V + G+G E AFFW P P GY S G +V++T P LD V C R DL
Sbjct: 2261 KPLQFTKV--AHIFGKGFDE--AFFWYPIAPPGYASFGCVVSRTDEAPCLDSVCCPRMDL 2316
Query: 189 TDKCEVHHL 197
+ + +
Sbjct: 2317 VSQANIFEM 2325
>gi|146412768|ref|XP_001482355.1| hypothetical protein PGUG_05375 [Meyerozyma guilliermondii ATCC
6260]
gi|146393119|gb|EDK41277.1| hypothetical protein PGUG_05375 [Meyerozyma guilliermondii ATCC
6260]
Length = 442
Score = 45.4 bits (106), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 91/230 (39%), Gaps = 41/230 (17%)
Query: 254 PKLHAMPNCDQ-------IHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV 306
P + +MP Q I I +Y P V + +E YLP + F + Y G ++
Sbjct: 60 PPIISMPKKSQRTLELGEIPDYILDYAPLVHLYSEERYLPYDIKKFVRHLHAEYDNGTII 119
Query: 307 -----GEAID-----PSGSNLPSGGRNDGEF---WIDLPSDGGRQIVKHGNMESAKLYVH 353
G +ID P +L D WI + I A L V
Sbjct: 120 EGTEHGCSIDTLEELPRLPDLYLSSNTDFNTDPEWITGSHNKPSLIDGEIKDAPATLIVV 179
Query: 354 VKPAVGGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGEL 412
K G + D + F FN GP + G G HVGDWEH +R + G+
Sbjct: 180 DK---GNGWVDAFWFYFYSFNLGPFVMGSG-------PYGNHVGDWEHSLVRY--YKGKP 227
Query: 413 WSIYFSQHSGGKWVAAYDLEYIEG-----NKAIVYSSKNGHASFPHPGTY 457
++ S H GG A+ E++E N +++S++ HA++P G +
Sbjct: 228 IIVWMSAHGGG---GAFFYEHLEKWELDPNHPVIFSARGTHANYPSVGQH 274
>gi|395334903|gb|EJF67279.1| hypothetical protein DICSQDRAFT_142837 [Dichomitus squalens
LYAD-421 SS1]
Length = 335
Score = 45.1 bits (105), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 72/317 (22%), Positives = 122/317 (38%), Gaps = 47/317 (14%)
Query: 263 DQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGA----LLYKAGDLVGEAIDPSGSNLP 318
D + + + + P + + E + PS V+ T+ A + E I S++
Sbjct: 29 DPVPSFVLEHAPLSYLYSKEQWWPSDVAVHLTHVTPQVNFTNIADAVTFETIPSLASDVF 88
Query: 319 SGGRNDGEFWID-LPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPG 377
++D D + +D G+ + + + + KP G D + F ++ G
Sbjct: 89 LTSKDDVTTSPDWITNDAGKPDANGLSTAPSTIVLVNKP---GGILDAFFFYFYSYDHGG 145
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE-- 435
++++ F G HVGDWEH +R N G Y S HSGG +AY + +
Sbjct: 146 K----VLDIEF---GDHVGDWEHTMVRFTN--GTPTEFYLSAHSGG---SAYTFDAVNKT 193
Query: 436 GNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAA--------RSNLYVDSSIQYEL 487
+ I Y HA++ PG + + + DA+ + D+S Q
Sbjct: 194 NGRPISYIGVGTHANYATPGKHCHDLPGDLLCDQTDASFVWDLAANYRAFWFDNSTQTFS 253
Query: 488 VA--AEYLGEGVVAEPS-WLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKL 544
VA A + + AE + WLQ+ WG D + ++ + L
Sbjct: 254 VAGGAGSGAQSIDAEGAGWLQYEGAWGDEQYPDDEHGQYCLV--------------IGSL 299
Query: 545 PLELYGEEGPTGPKEKN 561
E GPTGP +KN
Sbjct: 300 VDECQFVSGPTGPIKKN 316
>gi|145353925|ref|XP_001421248.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581485|gb|ABO99541.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 4434
Score = 45.1 bits (105), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 7/65 (10%)
Query: 130 PLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLT 189
P + +W SD+ FW P PDGY ++G +V + P D V CVR+DLT
Sbjct: 2832 PSGFENIWTSDKND-------VSFWKPIAPDGYVAVGNVVAASAEMPSTDCVVCVREDLT 2884
Query: 190 DKCEV 194
EV
Sbjct: 2885 KLAEV 2889
>gi|294814724|ref|ZP_06773367.1| DUF946 domain-containing protein [Streptomyces clavuligerus ATCC
27064]
gi|326443104|ref|ZP_08217838.1| hypothetical protein SclaA2_18663 [Streptomyces clavuligerus ATCC
27064]
gi|294327323|gb|EFG08966.1| DUF946 domain-containing protein [Streptomyces clavuligerus ATCC
27064]
Length = 452
Score = 45.1 bits (105), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 86/212 (40%), Gaps = 38/212 (17%)
Query: 46 GEIEVCRISRFNFIWSCNLLQSKKKS-ATFYEP------AGIPDGFYSLGHYCQFDSRPL 98
G I + + + IW N S+ S + +EP A I ++LGH + +
Sbjct: 12 GAITIAPTATYWNIW--NDRGSRAHSDVSIFEPRVKPFFADIGQKGFTLGHVARRSHDDI 69
Query: 99 RGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQP 158
G L S H LF P+ + LVW +D G + + A W P P
Sbjct: 70 NGKSATVMPLYVS----VHPPELFA-----PPVRWDLVW-TDRGSGAHSD--ASIWRPVP 117
Query: 159 PDGYKSMGFLVTKTPNKPELDEVRCVRD-DLTDKCEVHHL-----IFDAISKFSSSPFSV 212
P GY ++G + + N P L + RCV++ D+ V I++ S S S+
Sbjct: 118 PPGYVALGDVCVRGYNAPSLPDYRCVKNGDVRGHTYVREATIGGCIWNDRGTGSGSSVSL 177
Query: 213 WSTRPCN-------RGMLGRGVSVGTFFCSSN 237
WS P N R +LG V TF +SN
Sbjct: 178 WSITPPNYPPDNVERLILG----VDTFVANSN 205
>gi|408392104|gb|EKJ71466.1| hypothetical protein FPSE_08399 [Fusarium pseudograminearum CS3096]
Length = 448
Score = 45.1 bits (105), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 93/230 (40%), Gaps = 51/230 (22%)
Query: 269 IRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAID-PSGSNLPSGGRNDGEF 327
+ ++ P V+ HP++ ++PS ++ +N L G+ I PS +L + G
Sbjct: 32 VASHAPLVWLHPEDRFMPSDLAAHISN-----TIPKLDGQPISGPSSLDLENLG------ 80
Query: 328 WIDLPSDGGRQIVKH-------------GNMESAKLYVH--VKPAV-----GGTFTDIVM 367
DL GG ++ G AK +H V AV D
Sbjct: 81 --DLNGYGGEEVALTSKEDPLTYPTWILGEAPDAKGKIHNSVPCAVILVEKSDIDVDAFY 138
Query: 368 WVFCPFN-GPGTLKV---------GIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYF 417
+ F FN GP +V G G HVGDWEH +R N G +Y+
Sbjct: 139 FYFYSFNEGPNITQVLEPINRFIAGDHLSTGIHFGNHVGDWEHNMVRFHN--GTPVGVYY 196
Query: 418 SQHSGG---KWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEIL 464
SQH G KW ++ + +G + +VYS+ HA++P G + + +
Sbjct: 197 SQHVDGVGFKWDSS-KINITDG-RPVVYSAAGSHANYPSKGHQIHNAALF 244
>gi|340519449|gb|EGR49687.1| predicted protein [Trichoderma reesei QM6a]
Length = 419
Score = 45.1 bits (105), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 45/77 (58%), Gaps = 9/77 (11%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEY----IEGNKAIVYSSKNG 447
G HVGDWEH +R + G+ I++SQH G AAY ++ ++ ++ +V+S+
Sbjct: 190 GDHVGDWEHNMIRFRD--GKPIGIWYSQHVTG---AAYKWDHKALSMKDDRPLVFSAYGS 244
Query: 448 HASFPHPGTYLQGSEIL 464
HA++ PG ++ S ++
Sbjct: 245 HANYASPGDHIHDSALI 261
>gi|50552408|ref|XP_503614.1| YALI0E06083p [Yarrowia lipolytica]
gi|49649483|emb|CAG79195.1| YALI0E06083p [Yarrowia lipolytica CLIB122]
Length = 405
Score = 45.1 bits (105), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 76/339 (22%), Positives = 124/339 (36%), Gaps = 71/339 (20%)
Query: 261 NCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSG 320
N Q+ + + P V + +E YLP + + TN + +G V + L
Sbjct: 29 NPGQVPDYVLRHCPGVHLYSEEKYLPGDPADYVTNFKIDTASGKTVLNGSESDPVTLARL 88
Query: 321 GR------------------NDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTF 362
G+ N WI +G + G ++ AK V + G
Sbjct: 89 GQYSFEKHSSRTFMTSLTDFNADPDWIT--GEGNIPHLVSGKIDHAKS-VLIVVDKGDGL 145
Query: 363 TDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSG 422
D + F PFN +G + G HVGDWEH +R + ++ ++ S H G
Sbjct: 146 VDAFWFYFYPFN------LGPFVMGGGPYGNHVGDWEHSLVRFQDGVPQI--LWMSAHGG 197
Query: 423 GKWVAAYDLEYIEGN--KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRND-AARSNLY- 478
G E ++ + + +++S++ HA++ G+ Q S L + +D R L+
Sbjct: 198 GNAYYYNAAEKMDNDPERRVLFSARGTHANY---GSVGQHSHDLPFYMLSDFTDRGALWD 254
Query: 479 -VDSSIQYELVAAE---------------YLGEGVVAEPSWLQFMRKWGPTIVYDSKTEL 522
V + Y L E +G PSWL ++ +WG + +S
Sbjct: 255 PVQNYYGYTLTHRERGRTGADRIWFDGRVLTSDGQSPHPSWLNYLGRWGDRQLPNS---- 310
Query: 523 DKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
D K P RY +GP GP KN
Sbjct: 311 DPRQKWHPFNWRYI---------------DGPRGPMAKN 334
>gi|145353997|ref|XP_001421283.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581520|gb|ABO99576.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1558
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 7/65 (10%)
Query: 130 PLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLT 189
P + +W SD+ FW P PDGY ++G +V + P D V CVR+DLT
Sbjct: 465 PSGFENIWTSDKNDVS-------FWKPIAPDGYVAVGNVVAASAEMPSTDCVVCVREDLT 517
Query: 190 DKCEV 194
EV
Sbjct: 518 KLAEV 522
>gi|302839248|ref|XP_002951181.1| hypothetical protein VOLCADRAFT_105023 [Volvox carteri f.
nagariensis]
gi|300263510|gb|EFJ47710.1| hypothetical protein VOLCADRAFT_105023 [Volvox carteri f.
nagariensis]
Length = 5909
Score = 45.1 bits (105), Expect = 0.10, Method: Composition-based stats.
Identities = 37/139 (26%), Positives = 61/139 (43%), Gaps = 13/139 (9%)
Query: 56 FNFIWSCNLLQ-SKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARD-----LA 109
F +W+ + S +S + + P G P G+ +LG P V + +D A
Sbjct: 5723 FKLVWAATGSRASGGRSMSIWRPVG-PPGYVALGDVAVTGREPPPRPVRLYKDSPALAAA 5781
Query: 110 SSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLV 169
+ A G+ + ++P L P Y+L++ + + W P+PP GY MG +
Sbjct: 5782 AVNASGSGDAPASEAPRLAPPAGYSLLF------RDSARPPLTLWRPEPPQGYVEMGCVA 5835
Query: 170 TKTPNKPELDEVRCVRDDL 188
+P L VRC+R DL
Sbjct: 5836 WPEIEEPPLGLVRCLRKDL 5854
Score = 41.2 bits (95), Expect = 1.7, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 32/68 (47%), Gaps = 2/68 (2%)
Query: 123 KSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLV-TKTPNKPELD-E 180
+ P L +P+++ VW + + G W P PP GY +G L + P +
Sbjct: 3177 RPPPLARPINFIRVWVDEHRERQGDPGALVLWRPVPPAGYVPLGLLAGLGSARVPSVQIP 3236
Query: 181 VRCVRDDL 188
+RCVR DL
Sbjct: 3237 LRCVRADL 3244
>gi|189210944|ref|XP_001941803.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977896|gb|EDU44522.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 687
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 61/141 (43%), Gaps = 16/141 (11%)
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN 437
L + NV F G HVGDWEH +R + G+ +++FS+HS G+ A+D G
Sbjct: 470 NLGNAVFNVRF---GNHVGDWEHTVVRFQH--GKPKAVFFSEHSFGE-AYAWDAVEKSGQ 523
Query: 438 KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGV 497
+ I +S+ HA + G + +L G+ +D D + +Y + +
Sbjct: 524 RPIGFSATGTHAMYATAGLH---PYVLPGGILHDVTDRGPLWDPVMNMYSYKYDYQHDKL 580
Query: 498 -------VAEPSWLQFMRKWG 511
VA SW F WG
Sbjct: 581 VPSNLNPVAPVSWFYFAGHWG 601
>gi|330916319|ref|XP_003297375.1| hypothetical protein PTT_07754 [Pyrenophora teres f. teres 0-1]
gi|311329983|gb|EFQ94531.1| hypothetical protein PTT_07754 [Pyrenophora teres f. teres 0-1]
Length = 536
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 61/141 (43%), Gaps = 16/141 (11%)
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN 437
L + NV F G HVGDWEH +R + G+ +++FS+HS G+ A+D G
Sbjct: 319 NLGNAVFNVRF---GNHVGDWEHTVVRFQH--GKPKAVFFSEHSFGE-AYAWDAVEKSGK 372
Query: 438 KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGV 497
+ I +S+ HA + G + +L G+ +D D + +Y + +
Sbjct: 373 RPIGFSATGTHAMYATAGLH---PYVLPGGILHDVTDRGPLWDPVMNMYSYKYDYQDDKL 429
Query: 498 -------VAEPSWLQFMRKWG 511
VA SW F WG
Sbjct: 430 VPSNLNPVAPVSWFYFAGHWG 450
>gi|242090405|ref|XP_002441035.1| hypothetical protein SORBIDRAFT_09g019165 [Sorghum bicolor]
gi|241946320|gb|EES19465.1| hypothetical protein SORBIDRAFT_09g019165 [Sorghum bicolor]
Length = 105
Score = 45.1 bits (105), Expect = 0.11, Method: Composition-based stats.
Identities = 17/37 (45%), Positives = 26/37 (70%)
Query: 265 IHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYK 301
+HA++ Y V+ H D+ Y PS V+WFF NGAL+++
Sbjct: 68 VHAILAAYSLLVYLHQDKSYPPSLVTWFFMNGALVHR 104
>gi|365765416|gb|EHN06924.1| Vps62p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 419
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 99/267 (37%), Gaps = 64/267 (23%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F P N GP + G G HVGDWEH +
Sbjct: 134 HIKKAPAVLFVVDKGNGWVDAFWFYFYPXNWGPYIMGSG-------PWGNHVGDWEHSLV 186
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 187 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 241
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 242 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 299
Query: 503 ---WLQFMRKWGP-----------------TIVYDSKTELDKIIKLLPLMIRYSVENAVS 542
WL F WG + K L K ++ + L R+ N
Sbjct: 300 GLEWLSFKGAWGDKQLRPRDPRQKWCPFQWKYIDGPKGPLFKNMERVSLCQRFKWWNFWK 359
Query: 543 KLPLELYGEEGPTGPKEKNNWVGDERG 569
P Y + G EKN+ VGD G
Sbjct: 360 GCPARRYIKRGEGLDAEKNDLVGDNCG 386
>gi|326471523|gb|EGD95532.1| hypothetical protein TESG_03008 [Trichophyton tonsurans CBS 112818]
Length = 384
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 89/231 (38%), Gaps = 28/231 (12%)
Query: 239 ISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGAL 298
IS + IA + L P A + + Y P ++ H + Y PS + N
Sbjct: 26 ISAASVYIAAEQCLAPAPSAT-----VPDYVMEYAPLIWIHSQDPYQPSDIGAQLKNSIP 80
Query: 299 LYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIV----------KHGNMESA 348
+ + D + +NL + G + L S G Q + K G E A
Sbjct: 81 MINHVAIPNVPADITLNNLDQLNKLGGNK-VCLTSKEGIQALPAWFRGVKPNKDGKTEGA 139
Query: 349 KLYVHVKPAVG--GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRIC 406
V G G D + F +N T ++ + F G HVGDWEH +R
Sbjct: 140 VSSAIVVREHGDEGKTVDAFYFYFYAYNQGNT----VLGIEF---GDHVGDWEHNMIRFR 192
Query: 407 NFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
+ G+ +I++SQH+ G+ Y G + I +S HA + GT+
Sbjct: 193 D--GKPEAIWYSQHAAGQ-AFQYSATDKRGVRPIGFSGNGTHAVYATAGTH 240
>gi|328353791|emb|CCA40188.1| Putative vacuolar protein sorting-associated protein YPR157W
[Komagataella pastoris CBS 7435]
Length = 626
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 49/210 (23%), Positives = 86/210 (40%), Gaps = 27/210 (12%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV--GEAIDP---SGSNLP 318
Q+ + Y P V + +E Y P ++ + + ++ G V G+ ++ S N
Sbjct: 329 QVPYYVVEYAPLVHLYSEEKYFPCDIARYVDHFHAVFGNGSDVPGGQHLNIGALSKLNHY 388
Query: 319 SGGRNDGEFWIDLPSDGGRQ----IVKHGNMESAKLYVHVKPAV------GGTFTDIVMW 368
S + E ++ +D + +H + + PAV G + D +
Sbjct: 389 SAAKEGSEVYLMANTDFDKNPDYITGRHNKPKYINGEIEEAPAVLIVVDKGNGWVDAFWF 448
Query: 369 VFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVA 427
F FN GP + G G HVGDWEH +R N G ++ S H GG
Sbjct: 449 YFYGFNLGPFVMGTG-------PFGNHVGDWEHSLVRFYN--GSPIIVWMSAHGGGGSYF 499
Query: 428 AYDLE--YIEGNKAIVYSSKNGHASFPHPG 455
+LE Y + + +++S++ HA++ G
Sbjct: 500 YKNLEKHYNDDKRPVIFSARGTHANYASVG 529
>gi|413955171|gb|AFW87820.1| hypothetical protein ZEAMMB73_269056 [Zea mays]
Length = 652
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 19/65 (29%), Positives = 39/65 (60%), Gaps = 2/65 (3%)
Query: 153 FWLPQPPDGYKSMGFLVTK-TPNKPELDEVRCVRDDLTDKCEV-HHLIFDAISKFSSSPF 210
FW P+ P G+ ++G + +K +P K + +RC+R+D+ + ++D+ + +S PF
Sbjct: 384 FWFPKAPSGFVALGCVASKSSPTKEDFSLLRCIRNDMVAGGQFSEESLWDSSNARTSEPF 443
Query: 211 SVWST 215
S+W+
Sbjct: 444 SLWTV 448
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/139 (23%), Positives = 52/139 (37%), Gaps = 30/139 (21%)
Query: 73 TFYEPAGIPDGFYSLGHYCQ-FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPL 131
+F+ P P G+ G Y + P +G + A +N+ + +++PL
Sbjct: 103 SFWRPRA-PSGYAIFGDYLTPMNEPPTKGVL-------------ALNTNIVR---VKRPL 145
Query: 132 DYTLVWCSD------------EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELD 179
Y LVW S E N + WLP P GY +MG + + +P L
Sbjct: 146 SYKLVWQSGSPRTNIFHQKDLEDKISNIDQLCSVWLPVAPAGYVAMGCVASSGTAEPPLS 205
Query: 180 EVRCVRDDLTDKCEVHHLI 198
C+ L C + I
Sbjct: 206 SAFCLTASLVSSCNLRDCI 224
>gi|47497419|dbj|BAD19476.1| vacuolar protein sorting 13C protein-like [Oryza sativa Japonica
Group]
gi|47497534|dbj|BAD19586.1| vacuolar protein sorting 13C protein-like [Oryza sativa Japonica
Group]
Length = 4190
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 10/90 (11%)
Query: 128 QKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
++P + V D G + FFW P PP GY S+G + TKT P D V C +
Sbjct: 2291 ERPTQFKKVAQIDRKGS---DEILFFWYPVPPPGYASLGCVATKTDEMPSNDSVCCPKMG 2347
Query: 188 LTDKCEVHHLIFDAISKFSSSP----FSVW 213
L + +++ D IS+ SSS +S+W
Sbjct: 2348 LVNHA---NILEDPISRSSSSKGPNCWSIW 2374
>gi|323333551|gb|EGA74945.1| Vps62p [Saccharomyces cerevisiae AWRI796]
Length = 395
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 77/192 (40%), Gaps = 47/192 (24%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 177 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 229
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 230 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 284
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVV---AEPS-- 502
HA + G + + + + R L+ D S+ Y +GE + AE +
Sbjct: 285 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNYYAYTVT-VGEKMTPCGAEETKM 342
Query: 503 ---WLQFMRKWG 511
WL F WG
Sbjct: 343 GLEWLSFKGAWG 354
>gi|115402461|ref|XP_001217307.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114189153|gb|EAU30853.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 283
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R + G +I++SQH+GG+ Y +G + I YS+ HA
Sbjct: 107 EFGDHVGDWEHNMIRFWD--GVPQAIWYSQHAGGQ-AFTYAATEKKGKRPIAYSANGSHA 163
Query: 450 SFPHPGTY 457
+ G++
Sbjct: 164 VYSISGSH 171
>gi|238480786|ref|NP_001154245.1| pleckstrin homology (PH) domain-containing protein [Arabidopsis
thaliana]
gi|332658452|gb|AEE83852.1| pleckstrin homology (PH) domain-containing protein [Arabidopsis
thaliana]
Length = 4218
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 35/165 (21%), Positives = 59/165 (35%), Gaps = 36/165 (21%)
Query: 81 PDGFYSLGHYCQ-FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC- 138
P GF SLG Y D P +G ++V +NL + +++PL + L+W
Sbjct: 1983 PPGFASLGDYLTPLDKPPTKGVLVV-------------NTNLMR---VKRPLSFKLIWSP 2026
Query: 139 --------SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
S + + W P+ P GY ++ +V+ P L C+
Sbjct: 2027 LASGGLGGSSMDDKDERDSSCSIWFPEAPKGYVALSCVVSSGSTPPSLASTFCILASSVS 2086
Query: 191 KCEVHHLIFDAISKFSSSPFSVWST----------RPCNRGMLGR 225
C + + + + S S + W P +LGR
Sbjct: 2087 PCSLRDCVAISSTDISQSSLAFWRVDNSVGSFLPADPSTLNLLGR 2131
>gi|218190714|gb|EEC73141.1| hypothetical protein OsI_07166 [Oryza sativa Indica Group]
Length = 3207
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 10/90 (11%)
Query: 128 QKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
++P + V D G + FFW P PP GY S+G + TKT P D V C +
Sbjct: 1303 ERPTQFKKVAQIDRKGS---DEILFFWYPVPPPGYASLGCVATKTDEMPSNDSVCCPKMG 1359
Query: 188 LTDKCEVHHLIFDAISKFSSSP----FSVW 213
L + +++ D IS+ SSS +S+W
Sbjct: 1360 LVNHA---NILEDPISRSSSSKGPNCWSIW 1386
>gi|297599214|ref|NP_001046831.2| Os02g0470400 [Oryza sativa Japonica Group]
gi|255670894|dbj|BAF08745.2| Os02g0470400 [Oryza sativa Japonica Group]
Length = 2989
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 10/90 (11%)
Query: 128 QKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
++P + V D G + FFW P PP GY S+G + TKT P D V C +
Sbjct: 1233 ERPTQFKKVAQIDRKGS---DEILFFWYPVPPPGYASLGCVATKTDEMPSNDSVCCPKMG 1289
Query: 188 LTDKCEVHHLIFDAISKFSSSP----FSVW 213
L + +++ D IS+ SSS +S+W
Sbjct: 1290 LVNHA---NILEDPISRSSSSKGPNCWSIW 1316
>gi|240255963|ref|NP_193447.5| pleckstrin homology (PH) domain-containing protein [Arabidopsis
thaliana]
gi|332658451|gb|AEE83851.1| pleckstrin homology (PH) domain-containing protein [Arabidopsis
thaliana]
Length = 4216
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 31/143 (21%), Positives = 54/143 (37%), Gaps = 26/143 (18%)
Query: 81 PDGFYSLGHYCQ-FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC- 138
P GF SLG Y D P +G ++V +NL + +++PL + L+W
Sbjct: 1993 PPGFASLGDYLTPLDKPPTKGVLVV-------------NTNLMR---VKRPLSFKLIWSP 2036
Query: 139 --------SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
S + + W P+ P GY ++ +V+ P L C+
Sbjct: 2037 LASGGLGGSSMDDKDERDSSCSIWFPEAPKGYVALSCVVSSGSTPPSLASTFCILASSVS 2096
Query: 191 KCEVHHLIFDAISKFSSSPFSVW 213
C + + + + S S + W
Sbjct: 2097 PCSLRDCVAISSTDISQSSLAFW 2119
>gi|260942635|ref|XP_002615616.1| hypothetical protein CLUG_04498 [Clavispora lusitaniae ATCC 42720]
gi|238850906|gb|EEQ40370.1| hypothetical protein CLUG_04498 [Clavispora lusitaniae ATCC 42720]
Length = 431
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 88/210 (41%), Gaps = 30/210 (14%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV----------GEAIDPS 313
+I + +Y P V + +E Y P ++ F + YK G V G+ D
Sbjct: 68 EIPRYVIDYAPLVHLYSEERYWPYDIADFVEHFHATYKNGTRVTNDKLNISTLGDLPDLR 127
Query: 314 GSNLPSGGRNDGEFWID---LPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVF 370
L + D +F +D + + + G ++ A + V G + D + F
Sbjct: 128 SIYLTA----DEDFDMDPDWITGSHNKPSLLSGEIKDAPATLIVVDK-GNGWVDAFWFYF 182
Query: 371 CPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAY 429
FN GP + G G HVGDWEH +R N G ++ S H GG +
Sbjct: 183 YSFNLGPFVMGSG-------PYGNHVGDWEHSLVRFHN--GVPMIVWMSAHGGGGAYYFH 233
Query: 430 DLEY--IEGNKAIVYSSKNGHASFPHPGTY 457
+LE ++ N I++S++ HA++ GT+
Sbjct: 234 NLEKHELDPNHPIIFSARGTHANYVSVGTH 263
>gi|334186631|ref|NP_001190747.1| pleckstrin homology (PH) domain-containing protein [Arabidopsis
thaliana]
gi|332658453|gb|AEE83853.1| pleckstrin homology (PH) domain-containing protein [Arabidopsis
thaliana]
Length = 4219
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 35/165 (21%), Positives = 59/165 (35%), Gaps = 36/165 (21%)
Query: 81 PDGFYSLGHYCQ-FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC- 138
P GF SLG Y D P +G ++V +NL + +++PL + L+W
Sbjct: 2019 PPGFASLGDYLTPLDKPPTKGVLVV-------------NTNLMR---VKRPLSFKLIWSP 2062
Query: 139 --------SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
S + + W P+ P GY ++ +V+ P L C+
Sbjct: 2063 LASGGLGGSSMDDKDERDSSCSIWFPEAPKGYVALSCVVSSGSTPPSLASTFCILASSVS 2122
Query: 191 KCEVHHLIFDAISKFSSSPFSVWST----------RPCNRGMLGR 225
C + + + + S S + W P +LGR
Sbjct: 2123 PCSLRDCVAISSTDISQSSLAFWRVDNSVGSFLPADPSTLNLLGR 2167
>gi|222622839|gb|EEE56971.1| hypothetical protein OsJ_06690 [Oryza sativa Japonica Group]
Length = 3159
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 10/90 (11%)
Query: 128 QKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
++P + V D G + FFW P PP GY S+G + TKT P D V C +
Sbjct: 1235 ERPTQFKKVAQIDRKGS---DEILFFWYPVPPPGYASLGCVATKTDEMPSNDSVCCPKMG 1291
Query: 188 LTDKCEVHHLIFDAISKFSSSP----FSVW 213
L + +++ D IS+ SSS +S+W
Sbjct: 1292 LVNHA---NILEDPISRSSSSKGPNCWSIW 1318
>gi|150951413|ref|XP_001387731.2| Vacuolar protein sorting protein 62 [Scheffersomyces stipitis CBS
6054]
gi|149388573|gb|EAZ63708.2| Vacuolar protein sorting protein 62, partial [Scheffersomyces
stipitis CBS 6054]
Length = 388
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 80/196 (40%), Gaps = 27/196 (13%)
Query: 271 NYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV-----------GEAIDPSGSNLPS 319
++ P V + +E YLP +S F TN + Y+ G + GE L S
Sbjct: 31 DFAPIVHLYSEERYLPYDISKFVTNFHVEYENGTTIPGFESLTLQKMGELPPEREIFLTS 90
Query: 320 GGRNDGE-FWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFN-GPG 377
D + WI + I A L V K G + D + F FN GP
Sbjct: 91 ESDFDTDPEWITGSKNKPNLINGEIKDAPATLIVVDK---GNGWVDAYWFYFYSFNLGPF 147
Query: 378 TLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEY--IE 435
+ G G HVGDWEH +R + G+ ++ S H GG ++LE ++
Sbjct: 148 VMGSG-------PFGNHVGDWEHSLVRF--YKGQPVIVWISAHGGGGAYFYHNLEKYALQ 198
Query: 436 GNKAIVYSSKNGHASF 451
I++S++ HA++
Sbjct: 199 PTHPIIFSARGTHANY 214
>gi|158341029|ref|YP_001522196.1| hypothetical protein AM1_E0113 [Acaryochloris marina MBIC11017]
gi|158311270|gb|ABW32882.1| hypothetical protein AM1_E0113 [Acaryochloris marina MBIC11017]
Length = 264
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 47/101 (46%), Gaps = 16/101 (15%)
Query: 357 AVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIY 416
+ GG+ + W+F FNG + H GDWE T+++ N ++ +
Sbjct: 120 SAGGSHRFLQFWIFFGFNG--------------SLLSHQGDWEDITIKV-NGDDQVEGAF 164
Query: 417 FSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
S H + +L IE ++ +V+ S+ HA +P PGT+
Sbjct: 165 LSAHGDRPFFPQSEL-MIENDRVVVFCSRETHAFYPAPGTH 204
>gi|50290511|ref|XP_447687.1| hypothetical protein [Candida glabrata CBS 138]
gi|49526997|emb|CAG60632.1| unnamed protein product [Candida glabrata]
Length = 446
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 103/280 (36%), Gaps = 54/280 (19%)
Query: 274 PTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRND---GEFWID 330
P + + DE Y PS+ S + ++ LL G+ + + ++ S R D + +D
Sbjct: 71 PLIHLYSDEKYWPSNFSEYVSHFKLLDDEGEAILDQLNSIKELKQSYVRYDEDGASYEVD 130
Query: 331 LPS---------DGGRQIVKHGNMESAKLYVHVKPAV------GGTFTDIVMWVFCPFNG 375
D +K + Y PA+ G + D + F PFN
Sbjct: 131 SEDIFMTSIDDFDKDPGWMKGFKPKYGTGYAAQAPALVFVVDKGNGWVDAFWFYFYPFN- 189
Query: 376 PGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE 435
G + + G H+GDWEH +R N + I+ S H GG +AY E IE
Sbjct: 190 -----WGPFIMGYGPWGNHLGDWEHTLVRFYNGVPKY--IWMSAHGGG---SAYTYEAIE 239
Query: 436 GNKA----------------IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYV 479
K +++S++ HA++P G + + + + R L+
Sbjct: 240 KKKRLRRIEGRITNELIEKPLLFSARGTHANYPSVGQHSHDVPFFFMPLSDFTDRGPLW- 298
Query: 480 DSSIQYELVAAEYLGEGVVAE--------PSWLQFMRKWG 511
D S+ Y G+ A SWL F WG
Sbjct: 299 DPSLNYYGYTITTNGKLAPATNQTSEDLGTSWLYFQGHWG 338
>gi|401623184|gb|EJS41291.1| YPR157W [Saccharomyces arboricola H-6]
Length = 467
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 76/325 (23%), Positives = 118/325 (36%), Gaps = 83/325 (25%)
Query: 244 LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAG 303
LN A + + D LH +I + + N+ P V + +E Y PS ++ F N ++
Sbjct: 60 LNPADVTDDDRTLHV---PGEIPSYVINHCPLVHLYSEEKYWPSDIAEFVQN----FRIK 112
Query: 304 DLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNM------------------ 345
D G ++ P+ NL E+ +DL +G + +
Sbjct: 113 DKRGNSM-PTKENLTLHDLK-AEYSVDLFENGTETSIPSSEVFLSSIDDFDKDPQWLLGH 170
Query: 346 --ESAKLYVHVKPAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVG 396
E Y PAV G + D + F PFN GP + G G HVG
Sbjct: 171 PPEYGTGYNSKAPAVLIVVDKGNGWVDAFWFFFYPFNHGPFIMGNG-------PWGSHVG 223
Query: 397 DWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE-------------GN------ 437
DWEH +R N G + S HS G Y E +E GN
Sbjct: 224 DWEHSLVRFHN--GVPKYFWMSAHSSG---TGYRYEAVEKFKKLRKTKRHEDGNDQGTIL 278
Query: 438 -KAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEG 496
+ +++S++ HA++ G + + + + R L+ D S+ + Y +G
Sbjct: 279 ERPLIFSARGTHANYASVGQHAHDIPFFFMPLSDFTDRGPLW-DPSLNF----YSYTFDG 333
Query: 497 VVAEPS----------WLQFMRKWG 511
PS WL F WG
Sbjct: 334 KTVTPSTEREKSLGLDWLHFQGGWG 358
>gi|452981564|gb|EME81324.1| carbohydrate-binding module family 18 protein [Pseudocercospora
fijiensis CIRAD86]
Length = 635
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 107/289 (37%), Gaps = 70/289 (24%)
Query: 305 LVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTD 364
L G DP N P G + L + + + G SA + V P G D
Sbjct: 310 LAGTCFDPY--NPPRGPHQELRRRNALDFNDDKHRPQPGGKSSAPAILIVVPKENG-IVD 366
Query: 365 IVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK 424
+ F FN L ++N+ F G HVGDWEH +R N G+ + ++ S+H+ G+
Sbjct: 367 AFWFFFYSFN----LGQTVLNIRF---GNHVGDWEHTCVRFRN--GKPYQVFLSEHNFGQ 417
Query: 425 WVAAYDLEYIE------------------------GNKAIVYSSKNGHASFPHPGTYLQG 460
AY + +E + +VYS+ HA + PG +
Sbjct: 418 ---AYTWKAMEKYIPSHDGSGTMIGSWSNETAGRMAKRPVVYSALGSHAMYGTPGLH--- 471
Query: 461 SEILGIGVRNDAA-RSNLY--VDSSIQYELVAAEYLGEGVVAEPS----WLQFMRKWGPT 513
IL G+ +D R L+ V + Y A + P+ WL + WG
Sbjct: 472 PYILPWGLLHDETDRGPLWDPVQNLQSYTYDAKNRTMRASMLNPTSPVEWLNYAGHWG-- 529
Query: 514 IVYDSKTELDKIIKLL-PLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
DK L P R++ + + GPTGPK KN
Sbjct: 530 ---------DKYYPLSDPRQYRFAG---------QYHYVNGPTGPKFKN 560
>gi|111223391|ref|YP_714185.1| hypothetical protein FRAAL3988 [Frankia alni ACN14a]
gi|111150923|emb|CAJ62630.1| hypothetical protein FRAAL3988 [Frankia alni ACN14a]
Length = 467
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 45/196 (22%), Positives = 77/196 (39%), Gaps = 31/196 (15%)
Query: 34 QGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFY-------- 85
QG+ F + G +++ RF +W + A F++P I D Y
Sbjct: 19 QGE-FMVESVRHGALDIAVTDRFVRLWDDRGSGATLHGA-FFDPL-ITDAMYAQGWRALG 75
Query: 86 SLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQG 145
S+GH D ++ R + +++A ++ P D+ LVW DE
Sbjct: 76 SVGHGGNHDDITGTRATILVRGVNAADA------------MVRPPTDFQLVW-RDEKSGA 122
Query: 146 NYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVH-----HLIFD 200
+G W P P GY ++G + +++ NKP C+R D + V LI++
Sbjct: 123 KQDGS--VWRPVAPSGYVALGHVFSRSWNKPPAHWYACIRRDFAGRRYVRDGVIGDLIWN 180
Query: 201 AISKFSSSPFSVWSTR 216
+ + VW R
Sbjct: 181 DQKSGAKTDVGVWQIR 196
>gi|302815653|ref|XP_002989507.1| hypothetical protein SELMODRAFT_447723 [Selaginella moellendorffii]
gi|300142685|gb|EFJ09383.1| hypothetical protein SELMODRAFT_447723 [Selaginella moellendorffii]
Length = 4140
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 56/137 (40%), Gaps = 21/137 (15%)
Query: 54 SRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGH-YCQFDSRPLRGFVLVARDLASSE 112
+ F+ IW+ + ++S F+ P P GF LG D P +G + + +L
Sbjct: 1956 TEFDRIWA-DEVRSGSSQIAFWRPRA-PPGFTVLGDCLTVVDEPPSKGVIAMNMNLVH-- 2011
Query: 113 AEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKT 172
L+KP+D+ LVW S + + C WLP P+GY +G + +
Sbjct: 2012 --------------LKKPVDFRLVWSSSANEDDSEQCCV--WLPIAPEGYVVLGCVASIG 2055
Query: 173 PNKPELDEVRCVRDDLT 189
+ P CV L
Sbjct: 2056 RSPPSASPTLCVLKQLV 2072
>gi|46115126|ref|XP_383581.1| hypothetical protein FG03405.1 [Gibberella zeae PH-1]
Length = 962
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 7/76 (9%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGG---KWVAAYDLEYIEGNKAIVYSSKNGH 448
G HVGDWEH +R N G +Y+SQH G KW ++ + +G + +VYS+ H
Sbjct: 687 GNHVGDWEHNMVRFHN--GTPVGVYYSQHVDGVGFKWDSS-KINITDG-RPVVYSAAGSH 742
Query: 449 ASFPHPGTYLQGSEIL 464
A++P G + + +
Sbjct: 743 ANYPSRGHQIHNAALF 758
>gi|396464307|ref|XP_003836764.1| similar to DUF946 domain-containing protein [Leptosphaeria maculans
JN3]
gi|312213317|emb|CBX93399.1| similar to DUF946 domain-containing protein [Leptosphaeria maculans
JN3]
Length = 399
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 35/68 (51%), Gaps = 3/68 (4%)
Query: 126 ALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVR 185
A+ P Y+ ++ S G G++ A W P P GY +MG ++ N P LD V CVR
Sbjct: 78 AVASPTGYSWIYDSTGTGSGSF---ACVWRPIAPAGYVAMGDVLWGGRNAPPLDHVWCVR 134
Query: 186 DDLTDKCE 193
DL + E
Sbjct: 135 SDLVTQGE 142
>gi|449299848|gb|EMC95861.1| hypothetical protein BAUCODRAFT_109511 [Baudoinia compniacensis
UAMH 10762]
Length = 546
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 18/107 (16%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN----------KA 439
+ G HVGDWEH +R N T E ++YFS+H G AY + +E +
Sbjct: 337 RFGNHVGDWEHTMIRFRNGTPE--AVYFSEHDFGD---AYTWDAVEKYLLDPNGTGFPRP 391
Query: 440 IVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYE 486
+++S+ HA + PG + IL G+ +D D + ++
Sbjct: 392 VIFSATGSHAMYARPGLH---PYILPFGLLHDQTDRGPLWDPKLNFK 435
>gi|218185030|gb|EEC67457.1| hypothetical protein OsI_34684 [Oryza sativa Indica Group]
Length = 4102
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 82/194 (42%), Gaps = 43/194 (22%)
Query: 53 ISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSE 112
++ F IWS N + S KK + + P + +G + G D+A +
Sbjct: 2124 VASFKLIWSNNGMSSPKK-LSIWRPM-LSEGMFYFG------------------DIAVNG 2163
Query: 113 AEGAHTSNLFKSPA----LQKPLDYTLVWCSDEGGQGNYEGC--AFFWLPQPPDGYKSMG 166
E +++ + ++ L+ P Y LV G + G FW PQ P G+ ++G
Sbjct: 2164 YEPPNSAVVLRNSGDDTFLRAPEGYQLV-----GRIKKHRGTEGVSFWFPQAPPGFVALG 2218
Query: 167 FLVTK-TPNKPELDEVRCVRDDLTDKCEV-HHLIFDAISKFSSSPFSVWSTRPCNRGMLG 224
+ +K +P K +L +RC+R D+ + ++D+ +S FS+W+
Sbjct: 2219 CVASKSSPAKEDLHFLRCIRSDMVKGGQFSEESVWDSSGARTSESFSLWTV--------- 2269
Query: 225 RGVSVGTFFCSSNW 238
VGTF S +
Sbjct: 2270 -DNDVGTFLVRSGF 2282
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 12/101 (11%)
Query: 96 RPL--RGFVLVARDLASSEAEGAHTSNLFKSPA--LQKPLDYTLVW--CSDEGGQGNYEG 149
RPL G+V + D+A H + ++K+ PL Y LVW C ++ Y
Sbjct: 3947 RPLCPSGYVSIG-DIAHVGIHPPHVAAVYKNVGGNFALPLGYDLVWRNCGED-----YRN 4000
Query: 150 CAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
W P+PP+GY ++G + +P LD CV + +
Sbjct: 4001 PVSIWFPRPPEGYVALGCVAVSAFEEPPLDSAFCVSERFAE 4041
>gi|222613283|gb|EEE51415.1| hypothetical protein OsJ_32493 [Oryza sativa Japonica Group]
Length = 4105
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 82/194 (42%), Gaps = 43/194 (22%)
Query: 53 ISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSE 112
++ F IWS N + S KK + + P + +G + G D+A +
Sbjct: 2127 VASFKLIWSNNGMSSPKK-LSIWRPM-LSEGMFYFG------------------DIAVNG 2166
Query: 113 AEGAHTSNLFKSPA----LQKPLDYTLVWCSDEGGQGNYEGC--AFFWLPQPPDGYKSMG 166
E +++ + ++ L+ P Y LV G + G FW PQ P G+ ++G
Sbjct: 2167 YEPPNSAVVLRNSGDDTFLRAPEGYQLV-----GRIKKHRGTEGVSFWFPQAPPGFVALG 2221
Query: 167 FLVTK-TPNKPELDEVRCVRDDLTDKCEV-HHLIFDAISKFSSSPFSVWSTRPCNRGMLG 224
+ +K +P K +L +RC+R D+ + ++D+ +S FS+W+
Sbjct: 2222 CVASKSSPAKEDLHFLRCIRSDMVKGGQFSEESVWDSSGARTSESFSLWTV--------- 2272
Query: 225 RGVSVGTFFCSSNW 238
VGTF S +
Sbjct: 2273 -DNDVGTFLVRSGF 2285
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 12/101 (11%)
Query: 96 RPL--RGFVLVARDLASSEAEGAHTSNLFKSPA--LQKPLDYTLVW--CSDEGGQGNYEG 149
RPL G+V + D+A H + ++K+ PL Y LVW C ++ Y
Sbjct: 3950 RPLCPSGYVSIG-DIAHVGIHPPHVAAVYKNVGGNFALPLGYDLVWRNCGED-----YRN 4003
Query: 150 CAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
W P+PP+GY ++G + +P LD CV + +
Sbjct: 4004 PVSIWFPRPPEGYVALGCVAVSAFEEPPLDSAFCVSERFAE 4044
>gi|413955170|gb|AFW87819.1| hypothetical protein ZEAMMB73_269056 [Zea mays]
Length = 1856
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 42/69 (60%), Gaps = 3/69 (4%)
Query: 148 EGCAFFWLPQPPDGYKSMGFLVTK-TPNKPELDEVRCVRDDLTDKCEV-HHLIFDAISKF 205
EG +F W P+ P G+ ++G + +K +P K + +RC+R+D+ + ++D+ +
Sbjct: 1584 EGVSF-WFPKAPSGFVALGCVASKSSPTKEDFSLLRCIRNDMVAGGQFSEESLWDSSNAR 1642
Query: 206 SSSPFSVWS 214
+S PFS+W+
Sbjct: 1643 TSEPFSLWT 1651
Score = 39.3 bits (90), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 12/84 (14%)
Query: 127 LQKPLDYTLVWCSD------------EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPN 174
+++PL Y LVW S E N + WLP P GY +MG + +
Sbjct: 1345 VKRPLSYKLVWQSGSPRTNIFHQKDLEDKISNIDQLCSVWLPVAPAGYVAMGCVASSGTA 1404
Query: 175 KPELDEVRCVRDDLTDKCEVHHLI 198
+P L C+ L C + I
Sbjct: 1405 EPPLSSAFCLTASLVSSCNLRDCI 1428
>gi|86159873|ref|YP_466658.1| hypothetical protein Adeh_3454 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85776384|gb|ABC83221.1| hypothetical protein Adeh_3454 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 307
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 69/197 (35%), Gaps = 30/197 (15%)
Query: 267 ALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGE 326
AL+ + P V PDE P+S SW AL G P + G
Sbjct: 42 ALLARFAPVVVLAPDEPVRPASASWLLARAALETAPGP------PPRILQASALGALAAL 95
Query: 327 FWIDLPSDGGRQIVKHGNMESA------KLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLK 380
G R ++ G + Y H A G + W PF
Sbjct: 96 VIPGTEEPGARLRLEAGAQAGSPDPADWTAYGHAYRAADGGIL-LQYWFLYPF------- 147
Query: 381 VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSG---GKWVAAYDLEYIEGN 437
N AF H GDWEH T+R+ + G + ++H+ G W A L EG+
Sbjct: 148 ----NEAFWAF-DHDGDWEHVTVRL-DAAGRPLGAWLARHADAQPGPWFAWSALRR-EGD 200
Query: 438 KAIVYSSKNGHASFPHP 454
V S + HAS+ P
Sbjct: 201 HPAVLSGRGTHASYASP 217
>gi|358385467|gb|EHK23064.1| hypothetical protein TRIVIDRAFT_209309 [Trichoderma virens Gv29-8]
Length = 451
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 44/80 (55%), Gaps = 9/80 (11%)
Query: 391 IGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE----YIEGNKAIVYSSKN 446
G HVGDWEH +R + G+ I++SQH+ G AAYD ++ + +V+S+
Sbjct: 195 FGDHVGDWEHNMVRFRD--GKPTGIFYSQHTSG---AAYDWNDKALSMKSGRPLVFSAYG 249
Query: 447 GHASFPHPGTYLQGSEILGI 466
HA++ G ++ + ++ +
Sbjct: 250 SHANYASSGNHVHDAALVDL 269
>gi|347840033|emb|CCD54605.1| hypothetical protein [Botryotinia fuckeliana]
Length = 350
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 61/141 (43%), Gaps = 23/141 (16%)
Query: 106 RDLAS---SEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGY 162
RD AS E + + + AL +P YT +W + G G ++G W P P GY
Sbjct: 6 RDYASFHTRRTEPESITTVGERSALARPTSYTQIWNDAKSG-GRHDGA--IWRPNAPSGY 62
Query: 163 KSMGFLVTKTPN-KPELDEVRCVRDDLTDKCEVHHLIFDAISKF-------SSSPFSVWS 214
S+G +V + + P + C+R DLT H+ A S F ++ S W
Sbjct: 63 ISLGDVVGGSHSVPPSTSAIWCLRVDLT------HVANYAASDFWDDKKSGAAKNVSFWQ 116
Query: 215 TRPCNRGMLGRG---VSVGTF 232
P + G+ G +S GTF
Sbjct: 117 LVPESTGLNGHEYLPISAGTF 137
>gi|366998691|ref|XP_003684082.1| hypothetical protein TPHA_0A05740 [Tetrapisispora phaffii CBS 4417]
gi|357522377|emb|CCE61648.1| hypothetical protein TPHA_0A05740 [Tetrapisispora phaffii CBS 4417]
Length = 474
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 50/191 (26%), Positives = 73/191 (38%), Gaps = 44/191 (23%)
Query: 351 YVHVKPAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
Y+ PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 190 YIKKGPAVLVVVDKGNGWVDAFWFYFYPFNWGPYVMGGG-------PWGNHVGDWEHSLV 242
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA----------------IVYSSKNG 447
R + GE ++ S HSGG AY IE K ++++++
Sbjct: 243 RF--YKGEPKYLWMSAHSGG---VAYRYSAIEKMKKLRRVNGKLTPEVIERPLIFAARGT 297
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARS-------NLYVDSSIQYELVAAEYLGEGVVAE 500
HA++ G + + + + R NLY I + A + V +
Sbjct: 298 HANYASVGQHAHDVPFFFMPLSDFTDRGLMWDPSLNLYSYIQIGTTITPANDKSKKVGTD 357
Query: 501 PSWLQFMRKWG 511
WL F KWG
Sbjct: 358 --WLYFAGKWG 366
>gi|367039287|ref|XP_003650024.1| hypothetical protein THITE_2109221 [Thielavia terrestris NRRL 8126]
gi|346997285|gb|AEO63688.1| hypothetical protein THITE_2109221 [Thielavia terrestris NRRL 8126]
Length = 543
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 47/105 (44%), Gaps = 16/105 (15%)
Query: 359 GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYF 417
G D + F +N G LK+ + G HVGDWEH +R G +++
Sbjct: 320 GADILDAFWFFFYSYNLGQTVLKI--------RFGNHVGDWEHCMVRFQR--GVPRALFM 369
Query: 418 SQHSGGK---WVAAYDLEYIEGNKA--IVYSSKNGHASFPHPGTY 457
S+H+GGK W A G A ++YS+ HA + PG +
Sbjct: 370 SEHAGGKAYLWKALEKRAQKNGKPARPVIYSAVGSHAMYASPGMH 414
>gi|380024134|ref|XP_003695861.1| PREDICTED: uncharacterized protein LOC100865359 [Apis florea]
Length = 394
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 59/251 (23%), Positives = 91/251 (36%), Gaps = 73/251 (29%)
Query: 310 IDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNME--------SAKLYVHVKPAVGGT 361
ID GS + + W +D + + KHG+ E S KL+ HV
Sbjct: 133 IDDFGSTMIFTNDLAQKIW---KTDVKQNLEKHGDFEKKKKKGNRSQKLHFHV------- 182
Query: 362 FTDIVMWVFCPFNGPGTLKV---------------GIMNVAFSKIGQHVGDWEHFTLRIC 406
W+F PF+ + V GI F + G HVGDWEH +L
Sbjct: 183 ----TYWMFYPFSEGKAVCVLDLGFFGSWPIPTVGGICLGMFKEYGNHVGDWEHMSLYFK 238
Query: 407 NFTGELWSIYFSQHSGGKWVAAYDLE--------------------------YIEGNKAI 440
N T ++Y S H G + YDL +G+ I
Sbjct: 239 N-TNHPSTMYVSAHDAGAFY-RYDLRSGTFVYESQETRKGIFQKPIFPEKVFTADGSHPI 296
Query: 441 VYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAE 500
++S++ H + PG + + + + D + + + + EL+ L E
Sbjct: 297 LFSARGSHGLWTAPGKH----KFVRLPRLYDESGFGIAWPTWKKMELL----LKEENTIL 348
Query: 501 PSWLQFMRKWG 511
PSW+ F KWG
Sbjct: 349 PSWMTFRGKWG 359
>gi|242035039|ref|XP_002464914.1| hypothetical protein SORBIDRAFT_01g028690 [Sorghum bicolor]
gi|241918768|gb|EER91912.1| hypothetical protein SORBIDRAFT_01g028690 [Sorghum bicolor]
Length = 577
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 7/71 (9%)
Query: 130 PLDYTLVW--CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
P+ Y LVW C++ +Y W P+PP GY S+G + + +P LD CV +
Sbjct: 449 PVGYDLVWRNCAE-----DYRNPVSIWFPRPPGGYVSLGCVAVPSFEEPPLDCAFCVDER 503
Query: 188 LTDKCEVHHLI 198
LT+ E I
Sbjct: 504 LTEDAEYEEQI 514
>gi|449520199|ref|XP_004167121.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101212417,
partial [Cucumis sativus]
Length = 2101
Score = 43.1 bits (100), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 96/234 (41%), Gaps = 46/234 (19%)
Query: 53 ISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSE 112
++ F IW N + KK + + P +P G G ++GF + ++
Sbjct: 328 VANFQLIW-WNRGSNSKKKLSIWRPV-VPQGKIYFGDVA------IKGF-----EPPNTS 374
Query: 113 AEGAHTSN--LFKSPALQKPLDYTLVWCSDEGGQGNYEGCA--FFWLPQPPDGYKSMGFL 168
HT + L+KSP LD+ LV G N G FWLPQ P G+ S+G +
Sbjct: 375 IVLHHTGDEELYKSP-----LDFQLV-----GQIKNQRGMEDISFWLPQAPAGFVSLGCI 424
Query: 169 VTKTPNKPELDE---VRCVRDDLT--DKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGML 223
K +KP+L + + C+R D+ D+ + +K + PFS+W
Sbjct: 425 ACK--HKPKLQDFSALGCMRMDMVTWDQLMEESAWDSSDAKLITEPFSLWIV-------- 474
Query: 224 GRGVSVGTFFCSSNWISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVF 277
G+ +GTF S S + LK D + + + I A +R VF
Sbjct: 475 --GIELGTFVVQSG--SKRPQRSFNLKLADSHVTSGSDNTVIDAEVRTLSIAVF 524
>gi|303282959|ref|XP_003060771.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458242|gb|EEH55540.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 5261
Score = 42.7 bits (99), Expect = 0.48, Method: Composition-based stats.
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 5/60 (8%)
Query: 130 PLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLT 189
P+ Y VW G A+ W P PP G+ +G +T + PEL+ +RCV + T
Sbjct: 4897 PVRYRQVW-----GHEWKHAAAYAWAPVPPPGFACVGMALTPSSEPPELESMRCVPEQWT 4951
>gi|336259649|ref|XP_003344625.1| hypothetical protein SMAC_06933 [Sordaria macrospora k-hell]
gi|380088702|emb|CCC13436.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 564
Score = 42.7 bits (99), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 57/133 (42%), Gaps = 15/133 (11%)
Query: 330 DLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFS 389
D+PSDG + + V + G D + F +N L ++ + F
Sbjct: 300 DIPSDGDKNKPNTSGYSNGPA-VLILVDKGAGILDAFWFFFYSYN----LGQTVLGIRF- 353
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK---WVAAYDLEYIEGN--KAIVYSS 444
G HVGDWEH R N G +++ S+H+GGK W A G + ++YS+
Sbjct: 354 --GNHVGDWEHCMTRFDN--GIPKAMFLSEHAGGKAYAWPALEKKPQPNGKPPRPVIYSA 409
Query: 445 KNGHASFPHPGTY 457
HA + PG +
Sbjct: 410 VGSHAMYATPGLH 422
>gi|224099073|ref|XP_002311365.1| predicted protein [Populus trichocarpa]
gi|222851185|gb|EEE88732.1| predicted protein [Populus trichocarpa]
Length = 4264
Score = 42.7 bits (99), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 129 KPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
KP+ +T V ++ G+G E FFW P P GY S+G +VT+T P L+ C R D+
Sbjct: 2315 KPVQFTKV--ANIVGKGFDE--VFFWYPIAPPGYASLGCVVTRTDEAPLLNSFCCPRLDI 2370
Query: 189 TDKCEVHHL-IFDAISKFSSSPFSVW 213
++ + + I + S +S +S+W
Sbjct: 2371 VNQANIIEVPISRSPSTKASQCWSIW 2396
>gi|359688748|ref|ZP_09258749.1| Ricin B lectin [Leptospira licerasiae serovar Varillal str.
MMD0835]
Length = 601
Score = 42.7 bits (99), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 33/72 (45%), Gaps = 2/72 (2%)
Query: 384 MNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYS 443
+ F G H DWE ++ + N G++ + +SQHSG + E + G + Y
Sbjct: 305 QSTCFLSFGSHAADWEGMSVLVVN--GQMKRVAWSQHSGWYSKETGNFEVVNGTHPVAYV 362
Query: 444 SKNGHASFPHPG 455
KN H SF G
Sbjct: 363 GKNAHGSFHDDG 374
>gi|418748247|ref|ZP_13304539.1| ricin-type beta-trefoil lectin domain protein [Leptospira
licerasiae str. MMD4847]
gi|418756148|ref|ZP_13312336.1| ricin-type beta-trefoil lectin domain protein [Leptospira
licerasiae serovar Varillal str. VAR 010]
gi|384115819|gb|EIE02076.1| ricin-type beta-trefoil lectin domain protein [Leptospira
licerasiae serovar Varillal str. VAR 010]
gi|404275316|gb|EJZ42630.1| ricin-type beta-trefoil lectin domain protein [Leptospira
licerasiae str. MMD4847]
Length = 564
Score = 42.7 bits (99), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 33/72 (45%), Gaps = 2/72 (2%)
Query: 384 MNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYS 443
+ F G H DWE ++ + N G++ + +SQHSG + E + G + Y
Sbjct: 268 QSTCFLSFGSHAADWEGMSVLVVN--GQMKRVAWSQHSGWYSKETGNFEVVNGTHPVAYV 325
Query: 444 SKNGHASFPHPG 455
KN H SF G
Sbjct: 326 GKNAHGSFHDDG 337
>gi|449461281|ref|XP_004148370.1| PREDICTED: uncharacterized protein LOC101212417 [Cucumis sativus]
Length = 4291
Score = 42.7 bits (99), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 69/274 (25%), Positives = 109/274 (39%), Gaps = 59/274 (21%)
Query: 13 NNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSA 72
++ SP++ T L P GQ F + ++ F IW N + KK
Sbjct: 2131 SHASPSQTETSHLEKS-PIVTSGQRFEA------------VANFQLIW-WNRGSNSKKKL 2176
Query: 73 TFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN--LFKSPALQKP 130
+ + P +P G G ++GF + ++ HT + L+KSP
Sbjct: 2177 SIWRPV-VPQGKIYFGDVA------IKGF-----EPPNTSIVLHHTGDEELYKSP----- 2219
Query: 131 LDYTLVWCSDEGGQGNYEGCA--FFWLPQPPDGYKSMGFLVTKTPNKPELDE---VRCVR 185
LD+ LV G N G FWLPQ P G+ S+G + K +KP+L + + C+R
Sbjct: 2220 LDFQLV-----GQIKNQRGMEDISFWLPQAPAGFVSLGCIACK--HKPKLQDFSALGCMR 2272
Query: 186 DDLT--DKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWISGQE 243
D+ D+ + +K + PFS+W G+ +GTF S S +
Sbjct: 2273 MDMVTWDQLMEESAWDSSDAKLITEPFSLWIV----------GIELGTFVVQSG--SKRP 2320
Query: 244 LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVF 277
LK D + + + I A +R VF
Sbjct: 2321 QRSFNLKLADSHVTSGSDNTVIDAEVRTLSIAVF 2354
>gi|396473766|ref|XP_003839413.1| similar to vacuolar sorting-associated protein [Leptosphaeria
maculans JN3]
gi|312215982|emb|CBX95934.1| similar to vacuolar sorting-associated protein [Leptosphaeria
maculans JN3]
Length = 626
Score = 42.7 bits (99), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 61/126 (48%), Gaps = 19/126 (15%)
Query: 336 GRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHV 395
G+++V G A + + V P G D F +N L + NV F G H+
Sbjct: 372 GKKVV--GGRSEAPVVLIVVPK-GDGVVDAFWMFFYSYN----LGNAVFNVRF---GNHI 421
Query: 396 GDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGN----KAIVYSSKNGHASF 451
GDWEH T+R + G+ +++FS+HS G+ AY E +E + + I +S+ HA +
Sbjct: 422 GDWEHTTIRFHH--GKPKAVFFSEHSFGE---AYTWEAVEKSGKREQPIGFSASGTHAMY 476
Query: 452 PHPGTY 457
G +
Sbjct: 477 ATAGLH 482
>gi|302830045|ref|XP_002946589.1| hypothetical protein VOLCADRAFT_115992 [Volvox carteri f.
nagariensis]
gi|300268335|gb|EFJ52516.1| hypothetical protein VOLCADRAFT_115992 [Volvox carteri f.
nagariensis]
Length = 1728
Score = 42.4 bits (98), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 15/100 (15%)
Query: 369 VFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGEL----------WSIYFS 418
F FNG + NV + G HVGDWEH T+R+ + EL ++++
Sbjct: 1285 TFYAFNGHYHIPA---NVPLFRTGHHVGDWEHLTVRLDATSLELQVEPGAGPCTQGVWYN 1341
Query: 419 QHSG--GKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGT 456
H G+WV + + + + + NGH +P GT
Sbjct: 1342 AHRNIEGEWVPGVAVPRTPCGRILGFVAINGHGIYPKCGT 1381
>gi|85084837|ref|XP_957383.1| hypothetical protein NCU07158 [Neurospora crassa OR74A]
gi|28918474|gb|EAA28147.1| conserved hypothetical protein [Neurospora crassa OR74A]
gi|28950255|emb|CAD71121.1| conserved hypothetical protein [Neurospora crassa]
Length = 568
Score = 42.4 bits (98), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 38/73 (52%), Gaps = 7/73 (9%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK---WVAAYDLEYIEGN--KAIVYSS 444
+ G HVGDWEH +R N G +++ S+H+GGK W A G + ++YS+
Sbjct: 356 RFGNHVGDWEHCMVRFDN--GIPKAMFLSEHAGGKAYAWPALEKKAQPNGKPPRPVIYSA 413
Query: 445 KNGHASFPHPGTY 457
HA + PG +
Sbjct: 414 VGSHAMYATPGLH 426
>gi|255552944|ref|XP_002517515.1| vacuolar protein sorting-associated protein, putative [Ricinus
communis]
gi|223543526|gb|EEF45057.1| vacuolar protein sorting-associated protein, putative [Ricinus
communis]
Length = 4226
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 31/165 (18%)
Query: 120 NLFKSPALQKPLDYTLV-WCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTK-TPNKPE 177
+LFK+P LDY LV + G + FW+PQ P G+ S+G + K +P +
Sbjct: 2209 DLFKAP-----LDYQLVGQIKKQRGMDSIS----FWMPQAPPGFVSLGCVACKGSPKLYD 2259
Query: 178 LDEVRCVRDDLT--DKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCS 235
++RC+R D+ D+ + + +K + FS+W+ G +GTF
Sbjct: 2260 FSKLRCMRSDMVAGDQFLEESVWDTSEAKSTREQFSIWTA----------GNELGTFIVR 2309
Query: 236 SNWISGQE---LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVF 277
S + LN+A DP L + + I A I + +F
Sbjct: 2310 SGFKRPPRRFALNLA-----DPSLPSGSDDTVIDAEIGTFSTAIF 2349
Score = 38.9 bits (89), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 7/66 (10%)
Query: 130 PLDYTLVW--CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
P+ Y LVW CSD+ Y+ W P+ P+G+ S G + +PE VRCV +
Sbjct: 4106 PMGYDLVWRNCSDD-----YKAPVSIWHPRAPEGFVSPGCVAVAGFEEPEPSLVRCVAES 4160
Query: 188 LTDKCE 193
++ E
Sbjct: 4161 QVEQTE 4166
>gi|308803418|ref|XP_003079022.1| unnamed protein product [Ostreococcus tauri]
gi|116057476|emb|CAL51903.1| unnamed protein product [Ostreococcus tauri]
Length = 2582
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 31/152 (20%), Positives = 61/152 (40%), Gaps = 18/152 (11%)
Query: 43 INLGEIEVCRISRFNFIWSCNLLQSKKKSATFYEPAG----------IPDGFYSLGHYCQ 92
+N ++ V F+ +W+ + ++ K + + G + G Y+ H+ Q
Sbjct: 2324 LNGFQVTVVHQKSFDKVWTSDGTATRTKGSVWAASLGKSTLKKKVERVSLGHYASPHFSQ 2383
Query: 93 FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAF 152
P FV+ D+ + G+ P+ + +W GQ +
Sbjct: 2384 PKPAP---FVVEITDINAYALTGSAYMPRVLERMFAYPVRFRQIW-----GQEWKATSLY 2435
Query: 153 FWLPQPPDGYKSMGFLVTKTPNKPELDEVRCV 184
W P PP G+ ++G +T+ P P+L +RC+
Sbjct: 2436 AWSPVPPPGFVAVGMALTRKPQPPDLSSIRCL 2467
>gi|323348420|gb|EGA82665.1| Vps62p [Saccharomyces cerevisiae Lalvin QA23]
Length = 342
Score = 42.4 bits (98), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 65/158 (41%), Gaps = 38/158 (24%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTL 403
H+K PAV G + D + F PFN GP + G G HVGDWEH +
Sbjct: 182 HIKKAPAVLFVVDKGNGWVDAFWFYFYPFNWGPYIMGSG-------PWGNHVGDWEHSLV 234
Query: 404 RICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----------------NKAIVYSSKNG 447
R + GE ++ S H GG +AY E IE K +++S++
Sbjct: 235 RF--YKGEPQYLWMSAHGGG---SAYKFEAIEKIKRLRRVDGKLTNEVIKKPLIFSARGT 289
Query: 448 HASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQY 485
HA + G + + + + R L+ D S+ Y
Sbjct: 290 HAHYASVGQHAHDVPFFFMPLSDFTDRGPLW-DPSLNY 326
>gi|413955174|gb|AFW87823.1| hypothetical protein ZEAMMB73_766796 [Zea mays]
Length = 579
Score = 42.4 bits (98), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 7/63 (11%)
Query: 130 PLDYTLVW--CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
PL Y LVW C++ +Y+ WLP+PP GY ++G + + +P LD CV +
Sbjct: 457 PLGYDLVWRNCAE-----DYKSPVSIWLPRPPGGYVALGCVAVPSFKEPTLDCAFCVDER 511
Query: 188 LTD 190
L +
Sbjct: 512 LAE 514
>gi|168050497|ref|XP_001777695.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670915|gb|EDQ57475.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 4890
Score = 42.0 bits (97), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 16/81 (19%)
Query: 144 QGNYEGCAFFWLPQPPDGYKSMGFLVTKT--PNKPELDEVRCVRDDLTDKCEVHHLIFDA 201
Q +++G +FW+P PP GY +G + K P++ + +RCVR+DL
Sbjct: 2764 QKHFDGV-YFWIPVPPPGYSVIGCIAGKNSRPDEDVMQSIRCVRNDL------------- 2809
Query: 202 ISKFSSSPFSVWSTRPCNRGM 222
+S + + S+W+TR G
Sbjct: 2810 VSSANFAESSLWTTRSLKPGQ 2830
>gi|12597877|gb|AAG60185.1|AC084763_5 unknown protein [Oryza sativa Japonica Group]
Length = 660
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 12/101 (11%)
Query: 96 RPL--RGFVLVARDLASSEAEGAHTSNLFKSPA--LQKPLDYTLVW--CSDEGGQGNYEG 149
RPL G+V + D+A H + ++K+ PL Y LVW C ++ Y
Sbjct: 505 RPLCPSGYVSIG-DIAHVGIHPPHVAAVYKNVGGNFALPLGYDLVWRNCGED-----YRN 558
Query: 150 CAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
W P+PP+GY ++G + +P LD CV + +
Sbjct: 559 PVSIWFPRPPEGYVALGCVAVSAFEEPPLDSAFCVSERFAE 599
>gi|350294926|gb|EGZ76011.1| hypothetical protein NEUTE2DRAFT_84996 [Neurospora tetrasperma FGSC
2509]
Length = 526
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 38/73 (52%), Gaps = 7/73 (9%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK---WVAAYDLEYIEGN--KAIVYSS 444
+ G HVGDWEH +R N G +++ S+H+GGK W A G + ++YS+
Sbjct: 314 RFGNHVGDWEHCMVRFDN--GIPKAMFLSEHAGGKAYAWPALEKKAQPNGKPPRPVIYSA 371
Query: 445 KNGHASFPHPGTY 457
HA + PG +
Sbjct: 372 VGSHAMYATPGLH 384
>gi|452963237|gb|EME68315.1| ATP-dependent Zn protease [Magnetospirillum sp. SO-1]
Length = 674
Score = 42.0 bits (97), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 42/79 (53%), Gaps = 8/79 (10%)
Query: 295 NGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPS-DGGRQIVKH---GNMESAKL 350
G ++ A +L G IDP+ + GR D I LP DG RQI++H G++ A L
Sbjct: 378 EGVVVIGATNLPGR-IDPA---ITRSGRLDRHIVISLPDEDGLRQIMRHHLAGDLPGADL 433
Query: 351 YVHVKPAVGGTFTDIVMWV 369
+ K A+GG+ D+ WV
Sbjct: 434 SIAAKLALGGSGADVARWV 452
>gi|326481822|gb|EGE05832.1| hypothetical protein TEQG_04842 [Trichophyton equinum CBS 127.97]
Length = 384
Score = 42.0 bits (97), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 88/231 (38%), Gaps = 28/231 (12%)
Query: 239 ISGQELNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGAL 298
IS + IA + L P A + + Y P ++ H + Y PS + N
Sbjct: 26 ISAASVYIAAEQCLAPAPSAT-----VPDYVMEYAPLIWIHSQDPYQPSDIGAQLKNSIP 80
Query: 299 LYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIV----------KHGNMESA 348
+ + D + +NL + G + L S G Q + K G E A
Sbjct: 81 MINHVAIPNVPADITLNNLDQLNKLGGNK-VCLTSKEGIQALPAWFRGVKPNKDGKTEGA 139
Query: 349 KLYVHVKPAVG--GTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRIC 406
V G G D + F +N T ++ + F G HVGDW H +R
Sbjct: 140 VSSAIVVREHGDEGKTVDAFYFYFYAYNQGNT----VLGIEF---GDHVGDWYHNMIRFR 192
Query: 407 NFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTY 457
+ G+ +I++SQH+ G+ Y G + I +S HA + GT+
Sbjct: 193 D--GKPEAIWYSQHAAGQ-AFQYSATDKRGVRPIGFSGNGTHAVYATAGTH 240
>gi|336471880|gb|EGO60040.1| hypothetical protein NEUTE1DRAFT_74789 [Neurospora tetrasperma FGSC
2508]
Length = 525
Score = 42.0 bits (97), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 38/73 (52%), Gaps = 7/73 (9%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK---WVAAYDLEYIEGN--KAIVYSS 444
+ G HVGDWEH +R N G +++ S+H+GGK W A G + ++YS+
Sbjct: 313 RFGNHVGDWEHCMVRFDN--GIPKAMFLSEHAGGKAYAWPALEKKAQPNGKPPRPVIYSA 370
Query: 445 KNGHASFPHPGTY 457
HA + PG +
Sbjct: 371 VGSHAMYATPGLH 383
>gi|403175606|ref|XP_003334382.2| hypothetical protein PGTG_16251 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171677|gb|EFP89963.2| hypothetical protein PGTG_16251 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 411
Score = 42.0 bits (97), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 11/101 (10%)
Query: 358 VGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYF 417
V GT D + F FN L + N+ F G H+ DWEH +R N T + + +
Sbjct: 194 VPGTL-DAFWFFFYSFN----LGPTVANIHF---GNHIADWEHCMIRFENGTPK--AAHL 243
Query: 418 SQHSGGKWVAAYDLEYIEGNK-AIVYSSKNGHASFPHPGTY 457
S H+ G LE +G K +++S+ HA + PGT+
Sbjct: 244 SAHADGSAYTYDCLEKFQGGKRPVIFSALGSHAMYTKPGTH 284
>gi|413955172|gb|AFW87821.1| hypothetical protein ZEAMMB73_766796 [Zea mays]
Length = 933
Score = 42.0 bits (97), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 7/63 (11%)
Query: 130 PLDYTLVW--CSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
PL Y LVW C++ +Y+ WLP+PP GY ++G + + +P LD CV +
Sbjct: 811 PLGYDLVWRNCAE-----DYKSPVSIWLPRPPGGYVALGCVAVPSFKEPTLDCAFCVDER 865
Query: 188 LTD 190
L +
Sbjct: 866 LAE 868
>gi|115483498|ref|NP_001065419.1| Os10g0565300 [Oryza sativa Japonica Group]
gi|78709021|gb|ABB47996.1| C2 domain-containing protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113639951|dbj|BAF27256.1| Os10g0565300 [Oryza sativa Japonica Group]
Length = 776
Score = 42.0 bits (97), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 12/101 (11%)
Query: 96 RPL--RGFVLVARDLASSEAEGAHTSNLFKSPA--LQKPLDYTLVW--CSDEGGQGNYEG 149
RPL G+V + D+A H + ++K+ PL Y LVW C ++ Y
Sbjct: 621 RPLCPSGYVSIG-DIAHVGIHPPHVAAVYKNVGGNFALPLGYDLVWRNCGED-----YRN 674
Query: 150 CAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
W P+PP+GY ++G + +P LD CV + +
Sbjct: 675 PVSIWFPRPPEGYVALGCVAVSAFEEPPLDSAFCVSERFAE 715
>gi|448103083|ref|XP_004199941.1| Piso0_002497 [Millerozyma farinosa CBS 7064]
gi|359381363|emb|CCE81822.1| Piso0_002497 [Millerozyma farinosa CBS 7064]
Length = 438
Score = 42.0 bits (97), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 81/197 (41%), Gaps = 28/197 (14%)
Query: 271 NYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDP-SGSNLPSGGR------- 322
+Y P V + +E Y+P ++ F N Y V + D ++LP R
Sbjct: 80 DYAPLVHLYSEERYMPYDIAEFVKNFHATYPNRTKVKDTPDELKITDLPKLPRSPDIYLT 139
Query: 323 NDGEF-----WIDLPSDGGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFN-GP 376
++ +F WI + I + A L V K G + D + F FN GP
Sbjct: 140 SNSDFDKDPEWITGVHNVPSLIDGKIDKAPATLIVVDK---GNGWVDAFWFYFYSFNLGP 196
Query: 377 GTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE--YI 434
+ G G HVGDWEH +R + GE ++ S H GG +LE +
Sbjct: 197 FVMGSG-------PYGNHVGDWEHSLVRF--YKGEPKIVWISAHGGGGAYFYRNLEKYKL 247
Query: 435 EGNKAIVYSSKNGHASF 451
+ I++S++ HA++
Sbjct: 248 DRRHPIIFSARGTHANY 264
>gi|408390928|gb|EKJ70313.1| hypothetical protein FPSE_09530 [Fusarium pseudograminearum CS3096]
Length = 353
Score = 42.0 bits (97), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 9/76 (11%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE----YIEGNKAIVYSSKNG 447
G HVGDWEH +R + G+ IY+SQHS G AAY+ + ++ +V+S+
Sbjct: 113 GDHVGDWEHNLVRFRD--GKPTGIYYSQHSSG---AAYNWNEEGLSLRNDRPLVFSAWGS 167
Query: 448 HASFPHPGTYLQGSEI 463
HA++ G ++ +
Sbjct: 168 HANYASSGDHVHDKAL 183
>gi|406868153|gb|EKD21190.1| vacuolar protein sorting-associated protein 62 [Marssonina brunnea
f. sp. 'multigermtubi' MB_m1]
Length = 410
Score = 42.0 bits (97), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 47/105 (44%), Gaps = 12/105 (11%)
Query: 348 AKLYVHVKPAVGGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRIC 406
AK V + G D F FN GP L ++G H+GDWEH +R
Sbjct: 200 AKSTVIIVADKGNGLVDAFYMYFYTFNDGPSALG--------HQVGNHLGDWEHNMVRFQ 251
Query: 407 NFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASF 451
N G ++++SQH G + Y G + + +S+K HA++
Sbjct: 252 N--GTPTAVWYSQHEFG-FAFTYSAVRKIGKRPVSFSAKGSHANY 293
>gi|359477629|ref|XP_002274451.2| PREDICTED: uncharacterized protein LOC100258011 [Vitis vinifera]
Length = 4275
Score = 41.6 bits (96), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 151 AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEV 194
FFW P P GY S+G +V+KT P++D C R DL + +
Sbjct: 2352 VFFWYPIAPPGYASLGCIVSKTYEAPQMDSFCCPRMDLVNPANI 2395
>gi|297736774|emb|CBI25975.3| unnamed protein product [Vitis vinifera]
Length = 4328
Score = 41.6 bits (96), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 151 AFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCEV 194
FFW P P GY S+G +V+KT P++D C R DL + +
Sbjct: 2403 VFFWYPIAPPGYASLGCIVSKTYEAPQMDSFCCPRMDLVNPANI 2446
>gi|242035035|ref|XP_002464912.1| hypothetical protein SORBIDRAFT_01g028683 [Sorghum bicolor]
gi|241918766|gb|EER91910.1| hypothetical protein SORBIDRAFT_01g028683 [Sorghum bicolor]
Length = 2096
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 41/73 (56%), Gaps = 3/73 (4%)
Query: 144 QGNYEGCAFFWLPQPPDGYKSMGFLVTK-TPNKPELDEVRCVRDDLTDKCEVHHL-IFDA 201
Q EG +F W P+ P G+ ++G + +K +P K E +RC+R D+ + ++D+
Sbjct: 1450 QRGTEGVSF-WFPKAPSGFVALGCVASKSSPTKEEFSLLRCIRSDMVAGGQFSEEGVWDS 1508
Query: 202 ISKFSSSPFSVWS 214
+ +S FS+W+
Sbjct: 1509 SNARTSESFSLWT 1521
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 39/96 (40%), Gaps = 16/96 (16%)
Query: 116 AHTSNLFKSPALQKPLDYTLVWCS-------------DEGGQGNYEGCAFFWLPQPPDGY 162
A +N+ K +++PL Y LVW S E + WLP P GY
Sbjct: 1206 ALNTNIVK---VKRPLSYKLVWQSGSPKTNVFNQNEDSENKISIADQLCSVWLPVAPVGY 1262
Query: 163 KSMGFLVTKTPNKPELDEVRCVRDDLTDKCEVHHLI 198
+MG +V+ +P L V C+ L C + I
Sbjct: 1263 VAMGCVVSSGTAEPPLSSVFCLTASLVSSCNLRDCI 1298
>gi|403163303|ref|XP_003323397.2| hypothetical protein PGTG_04934 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163999|gb|EFP78978.2| hypothetical protein PGTG_04934 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 411
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 70/306 (22%), Positives = 117/306 (38%), Gaps = 55/306 (17%)
Query: 254 PKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSW----------FFTNGALLYKAG 303
P+L + + ++ P V+ H EV+ PS ++ F T + L +
Sbjct: 39 PQLPYFTSKQTLPQYALDFAPLVYLHSQEVFWPSDLNRHLDNVQPQINFTTVKSYLRETQ 98
Query: 304 ---DLVG----EAIDPSGSNLPSGGRNDGEFWIDLPS--DGGRQIVKHGNMESAKLYVHV 354
D +G + I PS + + F +D S G K G + + + V
Sbjct: 99 TTQDFLGSSNLQRISPSQQEITYLSSKNPIFDLDYASWLRGAGPPDKRGRSSACAIIIAV 158
Query: 355 KPA--VGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGEL 412
VG + D+ F +N G G ++ G HVGDWE+ +R N G+
Sbjct: 159 DKTQQVGPGWVDVFYLFFYSYNR-GNYFNG------NRFGDHVGDWENTMIRFKN--GKP 209
Query: 413 WSIYFSQHSG----GKWVAAYD-LEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIG 467
+ H G G Y+ LE++ G + +VY++ H +P PG Q L +
Sbjct: 210 MYVAPEAHGGRVMLGDSAFNYNVLEHLNG-RPVVYAANGTHGMYPQPGQ--QNYTGLPVA 266
Query: 468 VRNDAARSNLYVDSSIQYE-----------LVAAEYLGEGVVAEPSWLQFMRKWGPTIVY 516
+ + + +L+ D ++ Y+ A L WL+F+ WG
Sbjct: 267 IYDVTDKGHLW-DPTLNYKSFFFSNEHGFSYTADTPLKLQSRQRLGWLRFLGYWG----- 320
Query: 517 DSKTEL 522
D K EL
Sbjct: 321 DQKPEL 326
>gi|396924949|gb|AFN89138.1| vacuolar protein sorting 13 [Mesembryanthemum crystallinum]
Length = 3718
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 62/185 (33%), Gaps = 43/185 (23%)
Query: 74 FYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDY 133
F+ P P GF LG Y L D ++A A NL K ++KP +
Sbjct: 2007 FWRPRA-PPGFAVLGDY------------LTPMDKPPTKAVLAVNMNLVK---IKKPESF 2050
Query: 134 TLVW-------CSDEGGQGNY------EGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180
LVW SD + W P P GY ++G +V+ P L
Sbjct: 2051 KLVWPLIASTDVSDSETTSRMPDIVQRDASCSIWFPVAPKGYIALGCVVSSGTAPPALSS 2110
Query: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTF----FCSS 236
C+ L C V + S S+ + W C +GTF S
Sbjct: 2111 SFCILASLVSSCPVRDCVMIGASNEHSAAMAFWRVDNC----------IGTFLPTDLTSK 2160
Query: 237 NWISG 241
N I G
Sbjct: 2161 NLIRG 2165
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 57/140 (40%), Gaps = 25/140 (17%)
Query: 53 ISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSE 112
++ F+ +W N + +K + + P +P G G P V+V D+
Sbjct: 2214 VASFHLVW-WNQSSTSRKKLSIWRPI-VPQGMVYFGDIAVKGYEPPNTCVVV-EDIGD-- 2268
Query: 113 AEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGC--AFFWLPQPPDGYKSMGFLVT 170
LFK P D+ +V G + G FWLPQ P GY +G +
Sbjct: 2269 -------ELFKEPT-----DFQMV-----GKIKKHRGMEPVSFWLPQAPPGYVPLGCIAC 2311
Query: 171 K-TPNKPELDEVRCVRDDLT 189
K +P + E +RC+R D+
Sbjct: 2312 KGSPKENEFRSLRCIRSDMV 2331
>gi|46127803|ref|XP_388455.1| hypothetical protein FG08279.1 [Gibberella zeae PH-1]
Length = 350
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 9/76 (11%)
Query: 392 GQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLE----YIEGNKAIVYSSKNG 447
G HVGDWEH +R + G+ IY+SQHS G AAY+ + ++ +V+S+
Sbjct: 170 GDHVGDWEHNFVRFRD--GKPTGIYYSQHSSG---AAYNWNEEGLSLRNDRPLVFSAWGS 224
Query: 448 HASFPHPGTYLQGSEI 463
HA++ G ++ +
Sbjct: 225 HANYASSGDHVHDKAL 240
>gi|156055926|ref|XP_001593887.1| hypothetical protein SS1G_05315 [Sclerotinia sclerotiorum 1980]
gi|154703099|gb|EDO02838.1| hypothetical protein SS1G_05315 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 537
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 13/103 (12%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F +N L +M + F G HVGDWEH +R + G +++FS
Sbjct: 307 GHGIVDAFWFYFYSYN----LGTTVMKMRF---GNHVGDWEHSLIRFHH--GVPKAVFFS 357
Query: 419 QHSGGKWVAAYDLEYIEG----NKAIVYSSKNGHASFPHPGTY 457
H GG +E +G + ++YS+ HA + PGT+
Sbjct: 358 AHFGGLGYHYKAVEKGQGPGREGRPVIYSAVGSHAMYATPGTH 400
>gi|403214018|emb|CCK68519.1| hypothetical protein KNAG_0B00710 [Kazachstania naganishii CBS
8797]
Length = 456
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 71/178 (39%), Gaps = 40/178 (22%)
Query: 359 GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYF 417
G + D + F PFN GP + G G HVGDWEH +R + G+ ++
Sbjct: 186 GNGWVDAYWFYFYPFNWGPYIMGGG-------PWGNHVGDWEHSLVRF--YMGQPKYLWM 236
Query: 418 SQHSGGKWVAAYDLEYIEGNKA----------------IVYSSKNGHASFPHPGTYLQGS 461
S H GG +AY E IE K +++S++ HA++ G +
Sbjct: 237 SAHGGG---SAYKFEAIEKVKKLRRVDGKVKNEIVMRPLIFSARGTHANYASVGQHSHDV 293
Query: 462 EILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA--------EPSWLQFMRKWG 511
+ + + R ++ D S+ Y + GE V + WL F +WG
Sbjct: 294 PFFFMPLSDFTDRGPMW-DPSLNY--YGYMFDGESVSSIGDREADLGSQWLHFEGRWG 348
>gi|303270889|ref|XP_003054806.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462780|gb|EEH60058.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 5009
Score = 41.2 bits (95), Expect = 1.4, Method: Composition-based stats.
Identities = 27/89 (30%), Positives = 43/89 (48%), Gaps = 15/89 (16%)
Query: 131 LDYTLVWCSDEG---GQGNY-EGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRD 186
+ Y LVW D G G+ + +G FW P P DGY ++G + + + P D V C+R
Sbjct: 2619 IGYHLVW-KDSGFRAGRSRHKQGTISFWRPVPQDGYVAIGDVASPSHASPPKDAVVCLRA 2677
Query: 187 DLTDKCEVHHLIFDAISKFSSSPFSVWST 215
DL A+ +F+ S + W++
Sbjct: 2678 DLA----------RALPQFAFSEGAAWTS 2696
>gi|321259401|ref|XP_003194421.1| protein-vacuolar targeting-related protein [Cryptococcus gattii
WM276]
gi|317460892|gb|ADV22634.1| protein-vacuolar targeting-related protein, putative [Cryptococcus
gattii WM276]
Length = 492
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 13/91 (14%)
Query: 363 TDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSG 422
D+ W F PFN G F +G HV DWEH LR+ G S+ F+ H G
Sbjct: 184 VDLWYWTFYPFN------FGKPAARFGILGNHVADWEH--LRVRTVDGFPVSVDFNTHEG 235
Query: 423 GKWVAAY----DLEYIEGNKAIVYSSKNGHA 449
G+ A D+E ++ ++ + Y + H
Sbjct: 236 GRLSAGTVRWEDVEKLQ-DRPVAYVAMGSHG 265
>gi|425766198|gb|EKV04822.1| hypothetical protein PDIG_86230 [Penicillium digitatum PHI26]
gi|425779120|gb|EKV17209.1| hypothetical protein PDIP_32250 [Penicillium digitatum Pd1]
Length = 532
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 89/231 (38%), Gaps = 62/231 (26%)
Query: 359 GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFS 418
G D + F FN L ++NV F G HVGDWEH +R N G+ +++FS
Sbjct: 299 GNGIVDAFWFYFYSFN----LGNTVVNVRF---GNHVGDWEHCLVRFHN--GKPKALFFS 349
Query: 419 QHSGGKWVAAYDLEYIE--GNKA-------------------IVYSSKNGHASFPHPGTY 457
H GG+ AY E +E G + ++YS++ HA + G +
Sbjct: 350 AHQGGE---AYSYEAVEKIGQRVRTQIKKVARIISLILPWQPVIYSAEGSHAMYATAGVH 406
Query: 458 LQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVA---EPS----WLQFMRKW 510
+L G+ +D D + + ++ E + A PS W + W
Sbjct: 407 ---EYLLPWGLLHDVTDRGPLWDPLLNSQAYTYDFDTENLRASTFSPSAPTEWFHYRGHW 463
Query: 511 GPTIVYDSKTELDKIIKLLPLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
G DK +L ++ + + + GPTGPK K+
Sbjct: 464 G-----------DKFYQL--------GDSRQYRFAGQYHFVNGPTGPKFKH 495
>gi|238488445|ref|XP_002375460.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220697848|gb|EED54188.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 99
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 125 PALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCV 184
P ++ P YT +W +D+G G FW P GY +MG +V P +V C+
Sbjct: 28 PPVKAPTSYTQLW-ADKGSGAKLNGS--FWRPIAASGYIAMGDVVQSGYTTPSTSKVWCL 84
Query: 185 RDDL 188
R DL
Sbjct: 85 RSDL 88
>gi|145345747|ref|XP_001417362.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577589|gb|ABO95655.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 4591
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 50/126 (39%), Gaps = 12/126 (9%)
Query: 59 IWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHT 118
+W NL +S K T I G Y+ H+ Q P F + D+ + G++
Sbjct: 4361 VWQANLGKSTLKKKT----ERISLGHYASPHFSQPKPAP---FAVEVTDINAFALTGSNY 4413
Query: 119 SNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPEL 178
P + VW + G Y W P PP G ++G ++T P P++
Sbjct: 4414 MPRVLDRLFPHPSRFRQVWGQEWKGTAVYA-----WTPVPPPGCVAIGMVLTSIPQAPDV 4468
Query: 179 DEVRCV 184
+RC+
Sbjct: 4469 STMRCI 4474
>gi|45198842|ref|NP_985871.1| AFR324Wp [Ashbya gossypii ATCC 10895]
gi|44984871|gb|AAS53695.1| AFR324Wp [Ashbya gossypii ATCC 10895]
Length = 446
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 73/332 (21%), Positives = 121/332 (36%), Gaps = 68/332 (20%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGR 322
+I + ++ P V + +E+Y+PS V F + + K G+ V +D + S G
Sbjct: 72 EIPQYVLDHCPLVHLYSEEIYMPSDVKEFVQHVRVEDKRGNEVRRGPLDIATSFAGLAGE 131
Query: 323 NDGEFWIDLPSD---------GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPF 373
+ + ++ D G G + + V + G + D + F PF
Sbjct: 132 DTSDLYLTALEDFSQNPNWLLGYAPDYGTGRISNGP-SVLIVVDKGNGWVDAFWFYFYPF 190
Query: 374 N-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK-------- 424
N GP + G G HVGDWEH +R N G+ ++ S H G
Sbjct: 191 NLGPFIMGAG-------PWGNHVGDWEHSLVRFLN--GKPQYLWMSAHGSGSGYVYDAVE 241
Query: 425 ----W-VAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYV 479
W V +L+ + +++SS+ HA++ G + + + R L+
Sbjct: 242 KKDHWRVVDGELQRTVLKRPLIFSSRGTHANYASIGQHAHDVPFFFSPLSDFTDRGPLW- 300
Query: 480 DSSIQYELVAAEYLGEGVVAEP----------SWLQFMRKWGPTIVYDSKTELDKIIKLL 529
D S+ + Y +G P WL + +WG + S DK
Sbjct: 301 DPSMNF----YAYTFDGTTVTPYSKRERELGTDWLYYNGRWGDKQLIWS----DKRQHWC 352
Query: 530 PLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
L RY +GP GP +KN
Sbjct: 353 LLQWRYI---------------DGPYGPLKKN 369
>gi|374109102|gb|AEY98008.1| FAFR324Wp [Ashbya gossypii FDAG1]
Length = 446
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 73/332 (21%), Positives = 121/332 (36%), Gaps = 68/332 (20%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEA-IDPSGSNLPSGGR 322
+I + ++ P V + +E+Y+PS V F + + K G+ V +D + S G
Sbjct: 72 EIPQYVLDHCPLVHLYSEEIYMPSDVKEFVQHVRVEDKRGNEVRRGPLDIATSFAGLAGE 131
Query: 323 NDGEFWIDLPSD---------GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPF 373
+ + ++ D G G + + V + G + D + F PF
Sbjct: 132 DTSDLYLTALEDFSQNPNWLLGYAPDYGTGRISNGP-SVLIVVDKGNGWVDAFWFYFYPF 190
Query: 374 N-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGK-------- 424
N GP + G G HVGDWEH +R N G+ ++ S H G
Sbjct: 191 NLGPFIMGAG-------PWGNHVGDWEHSLVRFLN--GKPQYLWMSAHGSGSGYVYDAVE 241
Query: 425 ----W-VAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYV 479
W V +L+ + +++SS+ HA++ G + + + R L+
Sbjct: 242 KKDHWRVVDGELQRTVLKRPLIFSSRGTHANYASIGQHAHDVPFFFSPLSDFTDRGPLW- 300
Query: 480 DSSIQYELVAAEYLGEGVVAEP----------SWLQFMRKWGPTIVYDSKTELDKIIKLL 529
D S+ + Y +G P WL + +WG + S DK
Sbjct: 301 DPSMNF----YAYTFDGTTVTPYSKRERELGTDWLYYNGRWGDKQLIWS----DKRQHWC 352
Query: 530 PLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
L RY +GP GP +KN
Sbjct: 353 LLQWRYI---------------DGPYGPLKKN 369
>gi|366994872|ref|XP_003677200.1| hypothetical protein NCAS_0F03630 [Naumovozyma castellii CBS 4309]
gi|342303068|emb|CCC70847.1| hypothetical protein NCAS_0F03630 [Naumovozyma castellii CBS 4309]
Length = 428
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 66/305 (21%), Positives = 115/305 (37%), Gaps = 43/305 (14%)
Query: 241 GQELNIACLKNLDPK-----LHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTN 295
G N L LDP+ ++PN I I + P V + +E Y P + F +
Sbjct: 25 GNSHNKRSLPLLDPEKVTNETKSLPNRSTIPNYILDNCPMVHLNSEEKYYPIDIVEFIKH 84
Query: 296 GALLYKAGDLVGEAIDPSG------SNLPSGGRN---DGEFWIDLPSDGGRQIV-KHGNM 345
L + + E + + P+G + E ++ D + GN
Sbjct: 85 FKLTDAHNNTIRENLTIQDLKPIYVTRFPNGTKQYIPSSETFLTCLDDFTKDPSWIFGNK 144
Query: 346 ESAKL-YVHVKPAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGD 397
S Y+ P++ G + D + F FN GP + G G H+GD
Sbjct: 145 PSFDTGYIKNSPSLLIVVDKGNGWVDAFWFYFYAFNQGPFVMSSG-------PWGSHLGD 197
Query: 398 WEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG----NKAIVYSSKNGHASFPH 453
WEH +R N G + S HSGG LE ++ + +++S+ HA++
Sbjct: 198 WEHTLVRFHN--GVPKYVRLSAHSGGDAYRFTSLEKLDTMGGVERRVIFSAYGTHANYAS 255
Query: 454 PGTYLQG--SEILGIGVRNDAARSNLY--VDSSIQYELVAAEYLGEGVVAEP---SWLQF 506
G + SE+ + + + R ++ + + Y + +G + WL +
Sbjct: 256 SGQFSHDLPSELFFMPLSDFTDRGRIWDPLLNFYAYRFDGVSFWPQGEREKELGVEWLYW 315
Query: 507 MRKWG 511
M +WG
Sbjct: 316 MGRWG 320
>gi|224138796|ref|XP_002326692.1| predicted protein [Populus trichocarpa]
gi|222834014|gb|EEE72491.1| predicted protein [Populus trichocarpa]
Length = 2827
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 38/157 (24%), Positives = 64/157 (40%), Gaps = 42/157 (26%)
Query: 74 FYEPAGIPDGFYSLGHYCQ-FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLD 132
F+ P P G+ LG Y D P +G V V + A +++P+
Sbjct: 1484 FWRPCA-PPGYAILGDYLTPLDKPPTKGVVAVNTNFAR----------------VKRPIS 1526
Query: 133 YTLVW---CSDE-GGQ----------------GNYEGCAFFWLPQPPDGYKSMGFLVTKT 172
+ L+W S+E GQ GNY C+ W P+ P GY ++G +V+
Sbjct: 1527 FKLIWPPLASEEISGQDVANSSFLLDSFLTKEGNY--CSI-WFPEAPKGYVALGCVVSPG 1583
Query: 173 PNKPELDEVRCVRDDLTDKCEVHHLI-FDAISKFSSS 208
+P L C+ L C + I ++++ + S+
Sbjct: 1584 RTQPPLSAAFCISASLVSSCSLRDCITINSVNSYQST 1620
>gi|317136687|ref|XP_001727218.2| hypothetical protein AOR_1_272194 [Aspergillus oryzae RIB40]
Length = 125
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 48/108 (44%), Gaps = 16/108 (14%)
Query: 81 PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSD 140
P G ++G++ + + + R +L+ SS +P ++ P YT +W +D
Sbjct: 23 PLGSMAVGNFKELNGQ--RAALLIGAKSTSSS-----------NPPVKAPTSYTQLW-AD 68
Query: 141 EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
+G G FW P GY +MG +V P +V C+R DL
Sbjct: 69 KGSGAKLNGS--FWRPIAAPGYIAMGDVVQSGYTTPSTSKVWCLRSDL 114
>gi|365757802|gb|EHM99677.1| YPR157W-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 319
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 86/224 (38%), Gaps = 48/224 (21%)
Query: 244 LNIACLKNLDPKLHAMPNCDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAG 303
LN ++N LH +I + + N+ P V + +E Y PS ++ + N + K G
Sbjct: 47 LNPVDVENDQRTLHV---PGKIPSYVINHCPLVHLYSEEKYWPSDIAEYVQNFQIKDKRG 103
Query: 304 DLVGEAIDPSGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKL------------- 350
+ + P+ NL + E+ DL +G ++ G + L
Sbjct: 104 NCM-----PTHENL-TLHDLKAEYSADLFGNGSETLIPSGEVFLTSLDDFDKDPTWLLGH 157
Query: 351 -------YVHVKPAV------GGTFTDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVG 396
Y PAV G + D + F PFN GP + G G HVG
Sbjct: 158 PPEYGTGYNSRAPAVLVVVDKGNGWVDAFWFFFYPFNHGPFIMGQG-------PWGNHVG 210
Query: 397 DWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAI 440
DWEH +R N G ++ S HS G Y E +E K +
Sbjct: 211 DWEHSLVRFYN--GIPKYLWMSAHSSG---TGYRYEAVEKFKKL 249
>gi|302505120|ref|XP_003014781.1| conserved hypothetical protein [Arthroderma benhamiae CBS 112371]
gi|291178087|gb|EFE33878.1| conserved hypothetical protein [Arthroderma benhamiae CBS 112371]
Length = 358
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 77/199 (38%), Gaps = 23/199 (11%)
Query: 273 GPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLVGEAIDPSGSNLPSGGRNDGEFWIDLP 332
P ++ H + Y PS + N + + D + +NL + G + L
Sbjct: 15 APLIWIHSQDPYQPSDIGAQLKNSIPMIDHVAIPNVPADITLNNLDQLNKLGGNK-VCLT 73
Query: 333 SDGGRQIV----------KHGNMESAKLYVHVKPAVG--GTFTDIVMWVFCPFNGPGTLK 380
S G Q + K G E A V G G D + F +N T
Sbjct: 74 SKEGIQALPAWFRGVKPNKDGKTEGAVSSAIVVREHGDEGKTVDAFYFYFYAYNQGNT-- 131
Query: 381 VGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAI 440
++ + F G HVGDWEH +R + G+ +I++SQH+ G+ Y G + I
Sbjct: 132 --VLGIEF---GDHVGDWEHNMIRFRD--GKPEAIWYSQHAAGQ-AFQYSATDKRGVRPI 183
Query: 441 VYSSKNGHASFPHPGTYLQ 459
+S HA + G +Q
Sbjct: 184 GFSGNGTHAVYATAGLLVQ 202
>gi|78186893|ref|YP_374936.1| hypothetical protein Plut_1031 [Chlorobium luteolum DSM 273]
gi|78166795|gb|ABB23893.1| hypothetical protein Plut_1031 [Chlorobium luteolum DSM 273]
Length = 964
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 35/69 (50%), Gaps = 10/69 (14%)
Query: 127 LQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTP------NKP-ELD 179
+++PL Y W D G G Y G FWLP P+GY +GF+ + N P ++
Sbjct: 1 MKRPLRYDWSW-DDSGSGGRYNGS--FWLPVSPNGYSPVGFVCSGEAIDGNGHNVPTDMT 57
Query: 180 EVRCVRDDL 188
CVR+DL
Sbjct: 58 LAYCVRNDL 66
>gi|365990525|ref|XP_003672092.1| hypothetical protein NDAI_0I02810 [Naumovozyma dairenensis CBS 421]
gi|343770866|emb|CCD26849.1| hypothetical protein NDAI_0I02810 [Naumovozyma dairenensis CBS 421]
Length = 455
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 76/191 (39%), Gaps = 46/191 (24%)
Query: 353 HVK--PAV------GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLR 404
H+K PAV G + D + F PFN G + + G H+GDWEH +R
Sbjct: 171 HIKSGPAVLILVDKGNGWVDAYWFYFYPFNW------GPFIMGYGPWGNHMGDWEHSLVR 224
Query: 405 ICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA----------------IVYSSKNGH 448
+ G ++ S HS G +AY IE K +++S++ H
Sbjct: 225 F--YKGVPKFLWMSAHSSG---SAYQFSAIEKIKKLKRVSGKLTNEVIGRPLIFSARGTH 279
Query: 449 ASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQYELVAAEYLGEGVVAE-------- 500
A++ G + + + + R ++ D ++ + + Y GE V A
Sbjct: 280 ANYASVGQHAHDVPFYFMPLSDFTDRGPMW-DPALNF--YSYRYDGEKVEAMNEKAREIG 336
Query: 501 PSWLQFMRKWG 511
SWL F +WG
Sbjct: 337 ESWLHFAGRWG 347
>gi|322706401|gb|EFY97982.1| hypothetical protein MAA_06765 [Metarhizium anisopliae ARSEF 23]
Length = 580
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 3/64 (4%)
Query: 125 PALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCV 184
P + P+ + +W D N +G FW P P+GYK +G +V + ++P D + C+
Sbjct: 457 PVVASPVGFNRIW-RDLWSGANSDGS--FWRPTAPEGYKCIGDVVQNSWSEPNKDVIWCL 513
Query: 185 RDDL 188
R DL
Sbjct: 514 RADL 517
>gi|322704671|gb|EFY96264.1| hypothetical protein MAA_08375 [Metarhizium anisopliae ARSEF 23]
Length = 209
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 62/147 (42%), Gaps = 13/147 (8%)
Query: 47 EIEVCRISRFNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRG---FVL 103
E+ S F W+ + + F+ P D F +G R + G VL
Sbjct: 9 ELMAKVTSSFTARWN-DAGSKASRDGGFWHPDSERD-FRPVGSVGVGSHRDINGSYSAVL 66
Query: 104 VARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYK 163
+A + A A+ + + +SP L P YT +W +D G + G FW P PP GY
Sbjct: 67 IAPNCA---AQKINAEGMCESP-LASPTGYTQIW-TDAGSKAKKNGS--FWRPIPPIGYV 119
Query: 164 SMGFLVTKT-PNKPELDEVRCVRDDLT 189
++G + P + +V CVR DL
Sbjct: 120 ALGDVAQDGWLPAPSVGDVWCVRRDLA 146
>gi|255556510|ref|XP_002519289.1| vacuolar protein sorting-associated protein, putative [Ricinus
communis]
gi|223541604|gb|EEF43153.1| vacuolar protein sorting-associated protein, putative [Ricinus
communis]
Length = 4423
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 5/86 (5%)
Query: 129 KPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDL 188
+P+ +T V + G+G E FFW P P GY S+G +VT+ P + + C R DL
Sbjct: 2264 RPVQFTKV--AHIMGKGIDE--VFFWYPIAPPGYASVGCVVTRIDEAPRIASMCCPRMDL 2319
Query: 189 TDKCEVHHL-IFDAISKFSSSPFSVW 213
++ + + I + S +S +S+W
Sbjct: 2320 VNQANIIEVPISRSPSSKTSQCWSIW 2345
Score = 38.5 bits (88), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 68/166 (40%), Gaps = 25/166 (15%)
Query: 56 FNFIWSCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEG 115
++ IW + + TF+ P P + LG + +R + S+A
Sbjct: 1972 YDRIWVSPKENGPRNNLTFWRPQA-PSNYVILGD------------CVTSRPIPPSQAVM 2018
Query: 116 AHTSNLFKSPALQKPLDYTLVWCSDEGGQG-------NYEGCAFFWLPQPPDGYKSMGFL 168
A SN + ++KP+ + L+ S G QG +Y W+P P+GY ++G +
Sbjct: 2019 A-VSNTYGR--VRKPVGFNLI-ASFSGIQGFLCNSHSDYVTDCSLWMPVAPEGYTALGCV 2074
Query: 169 VTKTPNKPELDEVRCVRDDLTDKCEVHHLIFDA-ISKFSSSPFSVW 213
P V C+R DL IF+ + S+S FS+W
Sbjct: 2075 AHIGRESPPNHIVYCLRSDLVSSTTYSECIFNVPPNPLSTSGFSIW 2120
>gi|224138792|ref|XP_002326691.1| predicted protein [Populus trichocarpa]
gi|222834013|gb|EEE72490.1| predicted protein [Populus trichocarpa]
Length = 389
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 47/113 (41%), Gaps = 19/113 (16%)
Query: 81 PDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWCSD 140
PDG+ S+G P + A +T LF AL PL Y LVW
Sbjct: 236 PDGYISIGDIAHVGGHP-----------PNVAAVYRNTDGLF---AL--PLGYDLVW--- 276
Query: 141 EGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDKCE 193
Q +Y+ W P+ P+GY S G + +PE V CV + L ++ E
Sbjct: 277 RNCQDDYKAPVSVWHPRAPEGYVSPGCVAVSNFEEPEPCSVYCVAESLVEETE 329
>gi|258573469|ref|XP_002540916.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237901182|gb|EEP75583.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 348
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 53/120 (44%), Gaps = 10/120 (8%)
Query: 399 EHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYL 458
EH +R N G+ +I++SQH+GG+ A+D G + + +S+ HA + GT+
Sbjct: 164 EHNMIRFKN--GQPQAIWYSQHAGGR-AFAWDAVNKRGLRPVAFSANGSHALYTTEGTHD 220
Query: 459 QG--SEILGIGVRNDAARSNLYVDS-----SIQYELVAAEYLGEGVVAEPSWLQFMRKWG 511
L IG+ D D S +Y+ + E+ +WL F +WG
Sbjct: 221 HTIPGVSLPIGLLTDKCNEGFLWDPTLSTYSFRYDRASQEFSAYDADTPVNWLNFDGQWG 280
>gi|83773375|dbj|BAE63502.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391863029|gb|EIT72343.1| vacuolar protein sorting-associated protein [Aspergillus oryzae
3.042]
Length = 300
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 51/123 (41%), Gaps = 10/123 (8%)
Query: 335 GGRQIVKHGNMESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQH 394
GG + + G + A + G D + F FN T V + G H
Sbjct: 72 GGVKPDQDGRTQDAISSTIILRDHGDGTLDAFYFYFYAFNQGNT-------VLAMEFGDH 124
Query: 395 VGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHASFPHP 454
+GDWEH +R G +I++SQH+ G+ E I G + I YS HA++
Sbjct: 125 IGDWEHNMIRFSE--GVPQAIWYSQHASGQAFTYGATEKI-GKRPIAYSGNGTHANYAIS 181
Query: 455 GTY 457
G +
Sbjct: 182 GKH 184
>gi|219883083|ref|YP_002478247.1| hypothetical protein Achl_4479 [Arthrobacter chlorophenolicus A6]
gi|219862089|gb|ACL42430.1| hypothetical protein Achl_4479 [Arthrobacter chlorophenolicus A6]
Length = 430
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 51/130 (39%), Gaps = 11/130 (8%)
Query: 73 TFYEPAGIPDGFYSLGHYCQFD-SRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPL 131
+F PA D F LG + + + P R V++ + A + P L+ P
Sbjct: 67 SFPAPADSTDQFLPLGDFATINNADPGRAAVMLVAPIPGQLA-------IDGQPLLKHPT 119
Query: 132 DYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTDK 191
+ W D+ GN A+FW P PDGY+++G PE CV + K
Sbjct: 120 GFE--WILDDKHSGNDNNIAYFW-PTAPDGYEAVGVCFGFNGATPEAANYWCVHKQVLQK 176
Query: 192 CEVHHLIFDA 201
+ DA
Sbjct: 177 ASLSGFWGDA 186
>gi|356504117|ref|XP_003520845.1| PREDICTED: uncharacterized protein LOC100800761 [Glycine max]
Length = 573
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 33/53 (62%), Gaps = 11/53 (20%)
Query: 487 LVAAEYLGEGVVAEPSWLQFMRKWGPTIVYDSKTELDKIIKLLPLMIRYSVEN 539
+VAA+YLG ++ GP I+YD + E+D++I LL + +++SVE+
Sbjct: 1 MVAAKYLGGRIIT-----------GPAIMYDIRFEIDELIDLLSIFVKFSVES 42
>gi|297800398|ref|XP_002868083.1| hypothetical protein ARALYDRAFT_355042 [Arabidopsis lyrata subsp.
lyrata]
gi|297313919|gb|EFH44342.1| hypothetical protein ARALYDRAFT_355042 [Arabidopsis lyrata subsp.
lyrata]
Length = 4274
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 51/214 (23%), Positives = 82/214 (38%), Gaps = 56/214 (26%)
Query: 13 NNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIWSCNLLQSKKKSA 72
NN+ PT P P PL + G F + ++ F IW N +K
Sbjct: 2206 NNIQPTRPQ----PQPLNSVNSGHRFEA------------VATFELIW-WNRGSGSQKKV 2248
Query: 73 TFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPA----LQ 128
+ + P + +G G D+A S E ++ +F + L+
Sbjct: 2249 SIWRPI-VSEGMAYFG------------------DIAVSGYEPPNSCVVFHDTSDQEILK 2289
Query: 129 KPLDYTLVWCSDEGGQGNYEGCAF--FWLPQPPDGYKSMGFLVTKTPNKP-ELDEVRCVR 185
+D+ LV G + G FW+PQ P G+ S+G + K KP + ++RC R
Sbjct: 2290 AAVDFQLV-----GRVKKHRGVESISFWMPQAPPGFVSLGCVACKGSPKPYDFTKLRCAR 2344
Query: 186 DDLTDKCEVHHLIFDAISKFSS-----SPFSVWS 214
D+ H +++ S PFS+WS
Sbjct: 2345 SDMVAG---DHFAEESLWDTSDVWQRVEPFSIWS 2375
>gi|284045577|ref|YP_003395917.1| hypothetical protein Cwoe_4127 [Conexibacter woesei DSM 14684]
gi|283949798|gb|ADB52542.1| YD repeat protein [Conexibacter woesei DSM 14684]
Length = 2318
Score = 39.7 bits (91), Expect = 4.0, Method: Composition-based stats.
Identities = 49/211 (23%), Positives = 80/211 (37%), Gaps = 39/211 (18%)
Query: 318 PSGGRNDGEFWIDLPSDGGRQIVK----HGNMESAKLYVHVKPAVGGTFTDIVMWVFCPF 373
PSG R G ++D +I +G+ + +V+ G+ + W+F +
Sbjct: 1991 PSGARVSGRDYLDEVGTEEEKIQDSVDIYGSNPDWQDPAYVRFWTSGSVGWVQYWLFYYY 2050
Query: 374 NGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWS----IYFSQHSGGK----- 424
N T G G H GDWE +R + LWS + +S H
Sbjct: 2051 N---TAPAGFEQ------GHHEGDWEMVQVRYRRESNGLWSDPDRVTYSNHRDAHAISCP 2101
Query: 425 WVAAYDLEYIEGNKAI-VYSSKNGHASFPHPGTYLQGSEILGIGVRNDAAR-SNLYVDSS 482
W +++ G + VY HA+F PG + + G+ D AR NL
Sbjct: 2102 WSTVDEVDPFTGETSFQVYPGAGTHANFHRPGVF----DGAAFGLAKDLARGQNLTGRPR 2157
Query: 483 IQYELVAAEYLGEGVVAEPSWLQFMRKWGPT 513
+++ + A SWL++ +WG T
Sbjct: 2158 LRF-----------LAAGASWLRWPGRWGST 2177
>gi|2245071|emb|CAB10494.1| hypothetical protein [Arabidopsis thaliana]
gi|7268464|emb|CAB80984.1| hypothetical protein [Arabidopsis thaliana]
Length = 747
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 28/128 (21%), Positives = 48/128 (37%), Gaps = 26/128 (20%)
Query: 81 PDGFYSLGHYCQ-FDSRPLRGFVLVARDLASSEAEGAHTSNLFKSPALQKPLDYTLVWC- 138
P GF SLG Y D P +G ++V +NL + +++PL + L+W
Sbjct: 550 PPGFASLGDYLTPLDKPPTKGVLVV-------------NTNLMR---VKRPLSFKLIWSP 593
Query: 139 --------SDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDDLTD 190
S + + W P+ P GY ++ +V+ P L C+
Sbjct: 594 LASGGLGGSSMDDKDERDSSCSIWFPEAPKGYVALSCVVSSGSTPPSLASTFCILASSVS 653
Query: 191 KCEVHHLI 198
C + +
Sbjct: 654 PCSLRDCV 661
>gi|444318225|ref|XP_004179770.1| hypothetical protein TBLA_0C04540 [Tetrapisispora blattae CBS 6284]
gi|387512811|emb|CCH60251.1| hypothetical protein TBLA_0C04540 [Tetrapisispora blattae CBS 6284]
Length = 467
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 53/127 (41%), Gaps = 28/127 (22%)
Query: 351 YVHVKPAV------GGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLR 404
Y+ PAV G + D + F P NG G + + G HVGDWEH +R
Sbjct: 180 YLKDAPAVLFVVDKGNGWVDAFWFYFYPMNG------GPYIMGYGPWGSHVGDWEHSLVR 233
Query: 405 ICNFTGELWSIYFSQHSGGKWVAAYDLEYIEG--------------NKAIVYSSKNGHAS 450
F G+ ++ S HSGG +E +E + +++S++ HA+
Sbjct: 234 F--FKGKPQYLWLSAHSGGTAYQFGAIEKLEKLRHDEDGKLTPEVIERPLIFSARGTHAN 291
Query: 451 FPHPGTY 457
+ G +
Sbjct: 292 YASVGQH 298
>gi|385676535|ref|ZP_10050463.1| hypothetical protein AATC3_11514 [Amycolatopsis sp. ATCC 39116]
Length = 557
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 94/237 (39%), Gaps = 39/237 (16%)
Query: 254 PKLHAMPN---CDQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDL----- 305
PK P D+ AL + P V+ DE + P + + L Y+ D
Sbjct: 22 PKEQTAPTPARVDEPTAL--QFAPRVWLAGDERFTPMDATAYIRQSVLRYEHDDCADPEP 79
Query: 306 VGEAIDP-------------SGSNLPSG-------GRNDGEFWIDLPSDGGRQIVKHGNM 345
V + +DP + +PS R+ G+ + SDGG + H N+
Sbjct: 80 VADPVDPGKLRGTVYKHHKDADPAVPSPDKPLFPCARHSGDE-VSPNSDGGFYLDAHDNV 138
Query: 346 ESAKLYVHVKPAVGGTFTDIVMWVFCPFNGPGTLKVGIMNVAFSKIGQHVGDWEHFTLRI 405
+ + + +V+ F G L+ V + H GDWE +++
Sbjct: 139 QPDDAPPPMYWEHHRADDERSAYVYWFFYGRNELRPAGTPVGNT----HEGDWERVAVQL 194
Query: 406 CNFTGELWSIYFSQHSGGKWVAAY-DLEYIEGNKAIVYSSKNGHASFPHPGTYLQGS 461
N GE ++ F H + DL++ +G+ +VYS+K HAS+P G ++ G+
Sbjct: 195 RN--GEPVAVTFFGHGAEPCSVPWADLDHDDGHP-VVYSAKGSHASYPSKGWHVSGA 248
>gi|402072414|gb|EJT68219.1| hypothetical protein GGTG_14200 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 526
Score = 39.7 bits (91), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 31/65 (47%), Gaps = 7/65 (10%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYSSKNGHA 449
+ G HVGDWEH +R GE +++ S+H GG+ A LE KA NG +
Sbjct: 316 RFGNHVGDWEHCMVRFVR--GEPTAVFLSEHEGGQAYAWRALE-----KAAAEGDNNGSS 368
Query: 450 SFPHP 454
P
Sbjct: 369 QHSRP 373
>gi|357149002|ref|XP_003574966.1| PREDICTED: uncharacterized protein LOC100837501 [Brachypodium
distachyon]
Length = 3940
Score = 39.3 bits (90), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 33/67 (49%), Gaps = 4/67 (5%)
Query: 128 QKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDEVRCVRDD 187
++P+ +T V D G FFW P P GY S+G ++TKT P D + C +
Sbjct: 2226 ERPVQFTKVAQIDRKGFDEI----FFWYPVAPPGYVSLGCVLTKTDEMPSKDSICCPKLG 2281
Query: 188 LTDKCEV 194
L ++ +
Sbjct: 2282 LVNQANI 2288
>gi|108797777|ref|YP_637974.1| hypothetical protein Mmcs_0799 [Mycobacterium sp. MCS]
gi|119866868|ref|YP_936820.1| hypothetical protein Mkms_0814 [Mycobacterium sp. KMS]
gi|108768196|gb|ABG06918.1| hypothetical protein Mmcs_0799 [Mycobacterium sp. MCS]
gi|119692957|gb|ABL90030.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length = 714
Score = 39.3 bits (90), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 32/76 (42%), Gaps = 2/76 (2%)
Query: 263 DQIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAG--DLVGEAIDPSGSNLPSG 320
D I L R + P +FFHPDEV+ PS F + L D G P G+ P
Sbjct: 234 DPIELLARRFEPVLFFHPDEVFFPSDAKRFVESAQLWTATSPFDSPGRWGGPPGTPFPRQ 293
Query: 321 GRNDGEFWIDLPSDGG 336
LP++GG
Sbjct: 294 PEVPAGQLRGLPTEGG 309
>gi|346323706|gb|EGX93304.1| hypothetical protein CCM_04677 [Cordyceps militaris CM01]
Length = 518
Score = 39.3 bits (90), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%), Gaps = 17/80 (21%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIE------------GN 437
+ G HV DWEH +R + G +Y S+H GG+ AY E +E
Sbjct: 306 RFGNHVADWEHCMVRFKH--GVPQGMYLSEHEGGQ---AYAWEALEKRNVTFNGVPDAAE 360
Query: 438 KAIVYSSKNGHASFPHPGTY 457
+ ++YS+ HA + PG +
Sbjct: 361 RPVIYSATGSHAMYATPGDH 380
>gi|320589951|gb|EFX02407.1| vacuolar sorting-associated protein [Grosmannia clavigera kw1407]
Length = 654
Score = 39.3 bits (90), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 16/82 (19%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA---------- 439
+ G H+GDWEH +R N G +++ S+H GG+ A LE + ++
Sbjct: 423 RYGNHIGDWEHCMIRFEN--GVPRALFLSEHEGGQAYAWGALEKFQPKRSADEDANLPPL 480
Query: 440 ----IVYSSKNGHASFPHPGTY 457
++YS+ HA + PG +
Sbjct: 481 PKRPVIYSAVGSHAMYAVPGNH 502
>gi|444918879|ref|ZP_21238934.1| Arabinogalactan endo-1,4-beta-galactosidase [Cystobacter fuscus DSM
2262]
gi|444709309|gb|ELW50331.1| Arabinogalactan endo-1,4-beta-galactosidase [Cystobacter fuscus DSM
2262]
Length = 578
Score = 39.3 bits (90), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 30/72 (41%), Gaps = 2/72 (2%)
Query: 384 MNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKAIVYS 443
+ F G H DWE + + + G L + F QH G A E +EG I Y
Sbjct: 281 QSTCFVSSGSHAADWESVAVLVVD--GRLSRVAFYQHGGWYSREAGSFETVEGTHPIGYV 338
Query: 444 SKNGHASFPHPG 455
KN H ++ G
Sbjct: 339 GKNAHGTYHDSG 350
>gi|406602723|emb|CCH45681.1| Vacuolar protein sorting-associated protein [Wickerhamomyces
ciferrii]
Length = 459
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 121/332 (36%), Gaps = 71/332 (21%)
Query: 264 QIHALIRNYGPTVFFHPDEVYLPSSVSWFFTNGALLYKAGDLV----------GE-AIDP 312
+I I +Y P V + +E YLP + + + L ++ ++ GE A DP
Sbjct: 92 EIPKYILDYAPLVHLYTEEEYLPYDIEDYVKHFHLEFRNKTILTNQTLSLKKLGEYANDP 151
Query: 313 SGSNLPSGGRNDGEFWIDLPSDGGRQIVKHGNMESAKLYVHVKPAV------GGTFTDIV 366
+ + +D EF + G+Q N+ S + P+V G + D
Sbjct: 152 NLNTNDLFLTSDFEFNDNPDWLTGKQ--NKPNLSSG--LIKNAPSVLIVVDKGNGWVDAY 207
Query: 367 MWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKW 425
+ F FN GP + G G H+GDWEH +R N + GG
Sbjct: 208 WFYFYSFNLGPFVMGGG-------PYGNHIGDWEHSLMRFYNGLPIFVWMSAHGGGGGYH 260
Query: 426 VAAYDLEYIEGNKAIVYSSKNGHASFPHPGTYLQGSEILGIGVRNDAARSNLYVDSSIQY 485
A + I+ K I++SS+ HA++ G Q S L G+ +D D S Y
Sbjct: 261 YKALEKSDIDSKKPIIFSSRGTHANYASVG---QHSHDLPFGMLSDFTDRGPLWDPSKNY 317
Query: 486 ELVAAEYLGEGVVAEP----------------SWLQFMRKWGPTIVYDSKTELDKIIKLL 529
Y +G + P WL F WG DK+
Sbjct: 318 ----LGYTWDGEILSPVSNESKPDNFKELNYGDWLLFKGHWGD----------DKLSPED 363
Query: 530 PLMIRYSVENAVSKLPLELYGEEGPTGPKEKN 561
P R+S P E EGPTGP K+
Sbjct: 364 PRQ-RWS--------PFEWKFIEGPTGPLTKH 386
>gi|400597825|gb|EJP65549.1| vacuolar protein sorting-associated protein [Beauveria bassiana
ARSEF 2860]
Length = 513
Score = 38.9 bits (89), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 17/80 (21%)
Query: 390 KIGQHVGDWEHFTLRICNFTGELWSIYFSQHSGGKWVAAYDLEYIEGNKA---------- 439
+ G HV DWEH +R + G +Y S+H GG+ AY E +E A
Sbjct: 300 RFGNHVADWEHCMVRFRH--GVPQGMYLSEHEGGQ---AYAWEAMEKRNATFKGVPNAAE 354
Query: 440 --IVYSSKNGHASFPHPGTY 457
++YS+ HA + PG +
Sbjct: 355 RPVIYSAMGSHAMYATPGDH 374
>gi|451849292|gb|EMD62596.1| hypothetical protein COCSADRAFT_173037 [Cochliobolus sativus
ND90Pr]
Length = 281
Score = 38.5 bits (88), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 10/73 (13%)
Query: 123 KSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELD--- 179
+ PA+ P YT +W + G G Y+G FW P P Y ++G++ N P ++
Sbjct: 48 EKPAVASPTGYTKIW-TGRGSGGKYDGS--FWRPIAPSRYVALGYICISNNNTPSVNYMY 104
Query: 180 ----EVRCVRDDL 188
E V DDL
Sbjct: 105 TEEIEKIAVLDDL 117
>gi|58268068|ref|XP_571190.1| protein-vacuolar targeting-related protein [Cryptococcus neoformans
var. neoformans JEC21]
gi|57227424|gb|AAW43883.1| protein-vacuolar targeting-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 488
Score = 38.5 bits (88), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 42/91 (46%), Gaps = 15/91 (16%)
Query: 363 TDIVMWVFCPFN-GPGTLKVGIMNVAFSKIGQHVGDWEHFTLRICNFTGELWSIYFSQHS 421
D+ W F P N G ++GI+ G HV DWEH +RI + G S +S H
Sbjct: 180 VDLWYWAFYPLNFGKPAGRLGIL-------GNHVADWEHLRVRIVD--GVPVSADYSTHE 230
Query: 422 GGKWVAAY----DLEYIEGNKAIVYSSKNGH 448
GG+ A D+E + ++ + Y + H
Sbjct: 231 GGRLSAGTVRWEDVEKVH-DRPVAYVAMGSH 260
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.139 0.450
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,699,892,250
Number of Sequences: 23463169
Number of extensions: 502229543
Number of successful extensions: 892003
Number of sequences better than 100.0: 452
Number of HSP's better than 100.0 without gapping: 143
Number of HSP's successfully gapped in prelim test: 309
Number of HSP's that attempted gapping in prelim test: 890601
Number of HSP's gapped (non-prelim): 727
length of query: 569
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 421
effective length of database: 8,886,646,355
effective search space: 3741278115455
effective search space used: 3741278115455
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 80 (35.4 bits)