BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 046815
(463 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255569363|ref|XP_002525649.1| conserved hypothetical protein [Ricinus communis]
gi|223535085|gb|EEF36767.1| conserved hypothetical protein [Ricinus communis]
Length = 491
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/460 (67%), Positives = 375/460 (81%), Gaps = 3/460 (0%)
Query: 4 PSRTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRT 63
P + AR PSYLFPC++ L ++L Y+VD+FAS+TKT+AGHNL+PTPWH+FP RT
Sbjct: 5 PPKAAARVPSYLFPCLLGLVSLTLLFF---YQVDNFASRTKTVAGHNLDPTPWHIFPPRT 61
Query: 64 FKEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKS 123
F EE+R+++AYKI+ C+YLTC RR +S Q CP+FF+ IH DL+PWA++
Sbjct: 62 FDEETRQARAYKIIQCSYLTCPYTNTTTTRRRSQSSSQANAKCPEFFRFIHHDLQPWART 121
Query: 124 RITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVD 183
IT +HI EAK+FAA R++I G+LY+D YY CVQSR MFT+WG LQLL RYPGMVPDVD
Sbjct: 122 GITKKHIAEAKKFAAFRVVIFEGRLYLDLYYACVQSRMMFTVWGLLQLLNRYPGMVPDVD 181
Query: 184 IMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI 243
IMFDCMD+PVI+K EH SFPLP+FRYCT HFDIPFPDWSFWGW E+N++ WNEEF+DI
Sbjct: 182 IMFDCMDRPVINKTEHISFPLPIFRYCTTQNHFDIPFPDWSFWGWPEINIRSWNEEFRDI 241
Query: 244 KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKK 303
K GSQ+KSW +K P AYWKGNPDVLSP+R ELM+CN S+ WGA I+RQ+W EEA+ GF++
Sbjct: 242 KRGSQSKSWSKKWPRAYWKGNPDVLSPIRTELMQCNHSRKWGAHIMRQDWGEEARAGFER 301
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
SKLSNQCN+RYKIYAEG+AWSVSLKYI+SC S+ALIIS QY+DFFSRGL+P N++P+ S
Sbjct: 302 SKLSNQCNYRYKIYAEGFAWSVSLKYIISCGSLALIISPQYEDFFSRGLVPASNYWPVAS 361
Query: 364 ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPA 423
+LCRSIK VDWGNANPSEAE IGKAGQDFME+L+M+ VYDYM HLITEYSKL +KP
Sbjct: 362 DELCRSIKFAVDWGNANPSEAESIGKAGQDFMETLSMEGVYDYMFHLITEYSKLQVFKPV 421
Query: 424 PPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
PSSA E C +SLLC ADPKQ+Q LE++AA PSP P C+L
Sbjct: 422 LPSSALEVCADSLLCFADPKQKQFLERSAAFPSPKPACSL 461
>gi|356569533|ref|XP_003552954.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
Length = 464
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 299/459 (65%), Positives = 362/459 (78%), Gaps = 2/459 (0%)
Query: 5 SRTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTF 64
S+ RSP+YL PCVI+L++ SL L L YKVDD AS+T T+ GHNLEPTPWH+FP + F
Sbjct: 5 SKHTPRSPTYLIPCVIALALFSLTGLLL-YKVDDVASRTGTVVGHNLEPTPWHVFPHKPF 63
Query: 65 KEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSR 124
EESR+ + YKI+ C+YLTC A + RR + + + CP+FF++IH+DL PW +SR
Sbjct: 64 DEESRQQRTYKILQCSYLTCRYAAGAVGGSRRSFAGGR-EECPEFFRAIHRDLAPWLESR 122
Query: 125 ITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDI 184
I+ H+ A+R+AA R++IV GK++VD YY CVQSRAMFT+WG LQL+RRYPG VPDVD+
Sbjct: 123 ISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTLWGLLQLMRRYPGKVPDVDM 182
Query: 185 MFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIK 244
MFDCMDKP +++ EH + PLPLFRYCT HFDIPFPDWSFWGWSE+N++PW EEF DIK
Sbjct: 183 MFDCMDKPSVNRTEHQAMPLPLFRYCTTKEHFDIPFPDWSFWGWSEINIRPWQEEFPDIK 242
Query: 245 HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS 304
GS+ SWK K P+AYWKGNPDV SP+R EL+ CNDS+ WGAEI+RQ+W E A+ GFK+S
Sbjct: 243 QGSRNVSWKNKFPWAYWKGNPDVASPIRTELINCNDSRKWGAEIMRQDWGEAARSGFKQS 302
Query: 305 KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSA 364
KLSNQCNHRYKIYAEGYAWSVSLKYILSC SVALIIS QY+DFFSRGLIP N + + S
Sbjct: 303 KLSNQCNHRYKIYAEGYAWSVSLKYILSCGSVALIISPQYEDFFSRGLIPNHNFWLVDSL 362
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAP 424
+LC SIK V+WGN +P EAE IGK GQDFM SL MDR+Y+YM HLI+EYSKL D+KP P
Sbjct: 363 NLCPSIKYAVEWGNQHPVEAEAIGKRGQDFMGSLNMDRIYEYMFHLISEYSKLQDFKPTP 422
Query: 425 PSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
P++A E CVES+LC AD KQR L K+ A PS PPC L
Sbjct: 423 PTTALEVCVESVLCFADEKQRMFLNKSTAFPSHKPPCNL 461
>gi|297849014|ref|XP_002892388.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
lyrata]
gi|297338230|gb|EFH68647.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
lyrata]
Length = 508
Score = 651 bits (1679), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/466 (65%), Positives = 367/466 (78%), Gaps = 13/466 (2%)
Query: 6 RTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFK 65
++ R+PSYL CV++LS S AL L YKVDDF ++TKTLAGHNLEPTPWH+FP+++F
Sbjct: 13 KSSPRTPSYLLLCVLALSFFSFTALLL-YKVDDFIAQTKTLAGHNLEPTPWHIFPRKSFS 71
Query: 66 EESRRSQAYKIVHCTYLTC--------LSAMNPIPERRRVASPQKVQTCPDFFKSIHKDL 117
E ++ SQAY+I+ C+Y +C S + RR PQ CPDFF+ IH+DL
Sbjct: 72 EGTKHSQAYRILQCSYFSCPYNAVVQPKSLQSESVSGRRTHQPQ----CPDFFRWIHRDL 127
Query: 118 EPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPG 177
EPWAK+ +T H+ AK AA R++I+ GKLYVD YY CVQSR MFTIWG LQLL +YPG
Sbjct: 128 EPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLNKYPG 187
Query: 178 MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWN 237
MVPDVD+MFDCMDKP+I++ E+ SFP+PLFRYCTN+AH DIPFPDWSFWGWSE NL+PW
Sbjct: 188 MVPDVDMMFDCMDKPIINQTEYQSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWE 247
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEA 297
EEF DIK GS+ +SW K P AYWKGNPDV+SP+R+ELMKCN S+LWGA+I+RQ+WAEEA
Sbjct: 248 EEFGDIKQGSRRRSWDNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEEA 307
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKN 357
K GF++SKLSNQCNHRYKIYAEGYAWSVSLKYILSC S+ LIIS +Y+DFFSRGL+P +N
Sbjct: 308 KGGFEQSKLSNQCNHRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKEN 367
Query: 358 HFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKL 417
++PI DLCRSIK VDWGNANPS+AE IGK GQ +MES++MDRVYDYM HLITEYSKL
Sbjct: 368 YWPISPTDLCRSIKYAVDWGNANPSQAETIGKRGQGYMESISMDRVYDYMFHLITEYSKL 427
Query: 418 LDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
+KP P+SA E C SLLC A+ K+R+ LE++ PS PC L
Sbjct: 428 QKFKPEKPASANEVCAGSLLCFAEQKERELLERSRVVPSLDQPCKL 473
>gi|15222341|ref|NP_172202.1| uncharacterized protein [Arabidopsis thaliana]
gi|8954024|gb|AAF82198.1|AC067971_6 Contains similarity to an unknown protein T2J13.180 gi|6522568 from
Arabidopsis thaliana BAC T2J13 gb|AL132967. ESTs
gb|Z29835 and gb|Z29836 come from this gene [Arabidopsis
thaliana]
gi|332189973|gb|AEE28094.1| uncharacterized protein [Arabidopsis thaliana]
Length = 507
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 301/465 (64%), Positives = 366/465 (78%), Gaps = 5/465 (1%)
Query: 1 MVFPSRTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFP 60
+ P ++ RSPSYL CV++LS S AL YKVDDF ++TKTLAGHNLEPTPWH+FP
Sbjct: 7 LRLPHKSSPRSPSYLLLCVLALSFFSFTALLF-YKVDDFIAQTKTLAGHNLEPTPWHIFP 65
Query: 61 QRTFKEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQT----CPDFFKSIHKD 116
+++F ++ SQAY+I+ C+Y +C P+ S QT CPDFF+ IH+D
Sbjct: 66 RKSFSAATKHSQAYRILQCSYFSCPYKAVVQPKSLHSESGSGRQTHQPQCPDFFRWIHRD 125
Query: 117 LEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYP 176
LEPWAK+ +T H+ AK AA R++I+ GKLYVD YY CVQSR MFTIWG LQLL +YP
Sbjct: 126 LEPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLTKYP 185
Query: 177 GMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPW 236
GMVPDVD+MFDCMDKP+I++ E+ SFP+PLFRYCTN+AH DIPFPDWSFWGWSE NL+PW
Sbjct: 186 GMVPDVDMMFDCMDKPIINQTEYQSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPW 245
Query: 237 NEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEE 296
EEF DIK GS+ +SW K P AYWKGNPDV+SP+R+ELMKCN S+LWGA+I+RQ+WAEE
Sbjct: 246 EEEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEE 305
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
AK GF++SKLSNQCNHRYKIYAEGYAWSVSLKYILSC S+ LIIS +Y+DFFSRGL+P +
Sbjct: 306 AKGGFEQSKLSNQCNHRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKE 365
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSK 416
N++PI DLCRSIK VDWGN+NPSEAE IGK GQ +MESL+M+RVYDYM HLITEYSK
Sbjct: 366 NYWPISPTDLCRSIKYAVDWGNSNPSEAETIGKRGQGYMESLSMNRVYDYMFHLITEYSK 425
Query: 417 LLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPC 461
L +KP P+SA E C SLLC+A+ K+R+ LE++ PS PC
Sbjct: 426 LQKFKPEKPASANEVCAGSLLCIAEQKERELLERSRVVPSLDQPC 470
>gi|62320292|dbj|BAD94602.1| hypothetical protein [Arabidopsis thaliana]
Length = 507
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/465 (64%), Positives = 365/465 (78%), Gaps = 5/465 (1%)
Query: 1 MVFPSRTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFP 60
+ P ++ RSPSYL CV++LS S AL YKVDDF ++TKTLAGHNLEPTPWH+FP
Sbjct: 7 LRLPHKSSPRSPSYLLLCVLALSFFSFTALLF-YKVDDFIAQTKTLAGHNLEPTPWHIFP 65
Query: 61 QRTFKEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQT----CPDFFKSIHKD 116
+++F ++ SQAY+I+ C+Y +C P+ S QT CPDFF+ IH+D
Sbjct: 66 RKSFSAATKHSQAYRILQCSYFSCPYKAVVQPKSLHSESGSGRQTHQPQCPDFFRWIHRD 125
Query: 117 LEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYP 176
LEPWAK+ +T H+ AK AA R++I+ GKLYVD YY CVQSR MFTIWG LQLL +YP
Sbjct: 126 LEPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLTKYP 185
Query: 177 GMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPW 236
GMVPDVD+MFDCMDKP+I++ E+ SFP+PLFRYCTN+AH DIPFPDWSFWGWSE NL+PW
Sbjct: 186 GMVPDVDMMFDCMDKPIINQTEYQSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPW 245
Query: 237 NEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEE 296
EF DIK GS+ +SW K P AYWKGNPDV+SP+R+ELMKCN S+LWGA+I+RQ+WAEE
Sbjct: 246 EVEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEE 305
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
AK GF++SKLSNQCNHRYKIYAEGYAWSVSLKYILSC S+ LIIS +Y+DFFSRGL+P +
Sbjct: 306 AKGGFEQSKLSNQCNHRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKE 365
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSK 416
N++PI DLCRSIK VDWGN+NPSEAE IGK GQ +MESL+M+RVYDYM HLITEYSK
Sbjct: 366 NYWPISPTDLCRSIKYAVDWGNSNPSEAETIGKRGQGYMESLSMNRVYDYMFHLITEYSK 425
Query: 417 LLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPC 461
L +KP P+SA E C SLLC+A+ K+R+ LE++ PS PC
Sbjct: 426 LQKFKPEKPASANEVCAGSLLCIAEQKERELLERSRVVPSLDQPC 470
>gi|356541729|ref|XP_003539326.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
Length = 464
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/459 (64%), Positives = 363/459 (79%), Gaps = 3/459 (0%)
Query: 5 SRTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTF 64
+ TP RSP+YL PCVI+L++ SL L L YKVDD AS+T T+ GHNLEPTPWH+FP + F
Sbjct: 6 THTP-RSPTYLIPCVIALALFSLTGLLL-YKVDDVASRTGTVVGHNLEPTPWHVFPHKPF 63
Query: 65 KEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSR 124
EESR+ +AYKI+ C+YLTC A + RR + CP FF++IH+DL PW++SR
Sbjct: 64 DEESRQQRAYKILQCSYLTCRYAAEALGGARRRTG-GGREECPKFFRAIHRDLAPWSESR 122
Query: 125 ITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDI 184
I+ H+ A+R+AA R++IV GK++VD YY CVQSRAMFT+WG LQL+RRYPGMVPDVD+
Sbjct: 123 ISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTLWGLLQLMRRYPGMVPDVDM 182
Query: 185 MFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIK 244
MFDCMDKP ++K EH + PLPLFRYCT HFDIPFPDWSFWGWSE+N++PW EEF DIK
Sbjct: 183 MFDCMDKPSVNKTEHQAMPLPLFRYCTTKEHFDIPFPDWSFWGWSEINIRPWQEEFPDIK 242
Query: 245 HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS 304
GS++ +WK KLP+AYWKGNPDV SP+R EL+ CNDS+ WGAEI+RQ+W E A++GFK+S
Sbjct: 243 RGSRSVTWKNKLPWAYWKGNPDVASPIRTELINCNDSRKWGAEIMRQDWGEAARNGFKQS 302
Query: 305 KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSA 364
KLS+QCNHRYKIYAEGYAWSVSLKYILSC SVALIIS QY+DFFSRGLIP N + +
Sbjct: 303 KLSDQCNHRYKIYAEGYAWSVSLKYILSCGSVALIISPQYEDFFSRGLIPNHNFWLVDPL 362
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAP 424
+LC SIK V+WGN +P EAE IGK GQD MESL M+R+Y+YM HLI++YSKL D+KP P
Sbjct: 363 NLCPSIKYAVEWGNQHPVEAEAIGKRGQDLMESLNMNRIYEYMFHLISDYSKLQDFKPTP 422
Query: 425 PSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
P +A E CVES+LC AD KQR L K+ PS PPC L
Sbjct: 423 PPTALEVCVESVLCFADEKQRMFLNKSFTFPSHKPPCNL 461
>gi|449455154|ref|XP_004145318.1| PREDICTED: uncharacterized protein LOC101204476 [Cucumis sativus]
Length = 472
Score = 639 bits (1649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/457 (65%), Positives = 369/457 (80%), Gaps = 8/457 (1%)
Query: 10 RSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESR 69
R PS+L P V+++ +SL L L YKVDDFA++TKT+AGHNL+PTPWHLFP +TF +E+R
Sbjct: 6 RPPSHLLPSVVAICFLSLTFL-LCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSDETR 64
Query: 70 RSQAYKIVHCTYLTCLSAMN---PIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRIT 126
++A KI+HC+YLTC A N P V++P+ CP+FF+ IH DL+PWA++RI+
Sbjct: 65 HARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPK----CPEFFRWIHHDLDPWARTRIS 120
Query: 127 MRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMF 186
M + E+++FAA R++IV G+LYVD YY CVQSRA+FTIWG +Q+LRRYPGMVPDVD+MF
Sbjct: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180
Query: 187 DCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHG 246
DCMDKP I++ E+ + PLPLFRYCT +AHFDIPFPDWSFWGW EVNL+ W EEF+DIK G
Sbjct: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKL 306
S+ SW K P AYWKGNPDV SP R EL+KCN S++WGA+I+RQ+WA+EA+DG+++SKL
Sbjct: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKL 300
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADL 366
SNQCNHRYKIYAEG+AWSVSLKYILSC S++LIIS QY+DFFSRGL P KN++PIP ++
Sbjct: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360
Query: 367 CRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPS 426
C SIK VDWGN + EAE IG+ GQ FMESL+MD VY YM HLITEYSKL D+KP PP
Sbjct: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMESLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420
Query: 427 SAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
SA E C +SLLC+AD KQ Q LEK+AAS S PPC+L
Sbjct: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSL 457
>gi|449520138|ref|XP_004167091.1| PREDICTED: uncharacterized protein LOC101228589 [Cucumis sativus]
Length = 472
Score = 639 bits (1648), Expect = e-180, Method: Compositional matrix adjust.
Identities = 298/457 (65%), Positives = 369/457 (80%), Gaps = 8/457 (1%)
Query: 10 RSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESR 69
R PS+L P V+++ +SL L L YKVDDFA++TKT+AGHNL+PTPWHLFP +TF +E+R
Sbjct: 6 RPPSHLLPSVVAICFLSLTFL-LCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSDETR 64
Query: 70 RSQAYKIVHCTYLTCLSAMN---PIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRIT 126
++A KI+HC+YLTC A N P V++P+ CP+FF+ IH DL+PWA++RI+
Sbjct: 65 HARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPK----CPEFFRWIHHDLDPWARTRIS 120
Query: 127 MRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMF 186
M + E+++FAA R++IV G+LYVD YY CVQSRA+FTIWG +Q+LRRYPGMVPDVD+MF
Sbjct: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180
Query: 187 DCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHG 246
DCMDKP I++ E+ + PLPLFRYCT +AHFDIPFPDWSFWGW EVNL+ W EEF+DIK G
Sbjct: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKL 306
S+ SW K P AYWKGNPDV SP R EL+KCN S++WGA+I+RQ+WA+EAKDG+++SKL
Sbjct: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEAKDGYEQSKL 300
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADL 366
SNQCNHRYKIYAEG+AWSVSLKYILSC S++LIIS QY+DFFSRGL P KN++PIP ++
Sbjct: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360
Query: 367 CRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPS 426
C SIK VDWGN + EAE IG+ GQ FME+L+MD VY YM HLITEYSKL D+KP PP
Sbjct: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420
Query: 427 SAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
SA E C +SLLC+AD KQ Q LEK+AAS S PPC+L
Sbjct: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSL 457
>gi|449471103|ref|XP_004153209.1| PREDICTED: uncharacterized protein LOC101204904 [Cucumis sativus]
Length = 472
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 297/457 (64%), Positives = 369/457 (80%), Gaps = 8/457 (1%)
Query: 10 RSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESR 69
R PS+L P V+++ +SL L L YKVDDFA++TKT+AGHNL+PTPWHLFP +TF +E+R
Sbjct: 6 RPPSHLLPSVVAICFLSLTFL-LCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSDETR 64
Query: 70 RSQAYKIVHCTYLTCLSAMN---PIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRIT 126
++A KI+HC+YLTC A N P V++P+ CP+FF+ IH DL+PWA++RI+
Sbjct: 65 HARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPK----CPEFFRWIHHDLDPWARTRIS 120
Query: 127 MRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMF 186
M + E+++FAA R++IV G+LYVD YY CVQSRA+FTIWG +Q+LRRYPGMVPDVD+MF
Sbjct: 121 MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMF 180
Query: 187 DCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHG 246
DCMDKP I++ E+ + PLPLFRYCT +AHFDIPFPDWSFWGW EVNL+ W EEF+DIK G
Sbjct: 181 DCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKG 240
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKL 306
S+ SW K P AYWKGNPDV SP R EL+KCN S++WGA+I+RQ+WA+EA+DG+++SKL
Sbjct: 241 SKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKL 300
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADL 366
SNQCNHRYKIYAEG+AWSVSLKYILSC S++LIIS QY+DFFSRGL P KN++PIP ++
Sbjct: 301 SNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNM 360
Query: 367 CRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPS 426
C SIK VDWGN + EAE IG+ GQ FME+L+MD VY YM HLITEYSKL D+KP PP
Sbjct: 361 CESIKHAVDWGNTHFPEAETIGRQGQKFMENLSMDTVYSYMFHLITEYSKLQDFKPTPPP 420
Query: 427 SAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
SA E C +SLLC+AD KQ Q LEK+AAS S PPC+L
Sbjct: 421 SALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSL 457
>gi|359478546|ref|XP_003632132.1| PREDICTED: protein O-glucosyltransferase 1-like [Vitis vinifera]
gi|297745896|emb|CBI15952.3| unnamed protein product [Vitis vinifera]
Length = 464
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 289/460 (62%), Positives = 350/460 (76%), Gaps = 6/460 (1%)
Query: 9 ARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEES 68
A S +L PC+I S + + Y+VDDFAS+TKT+ GHNL PTPWHLFP TF E++
Sbjct: 6 APSLPFLRPCII-FSPFFFLIILVIYQVDDFASQTKTVVGHNLNPTPWHLFPPNTFTEKT 64
Query: 69 RRSQAYKIVHCTYLTC-----LSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKS 123
R ++ KI+ C+YLTC IPE S V CP FF I DL+PW +S
Sbjct: 65 RYARVSKIIQCSYLTCRRRSITPTTTKIPEWHTRQSSNTVGKCPMFFTRIDHDLQPWVRS 124
Query: 124 RITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVD 183
I++ +MEA++FAA R++IV GKLYVD +Y CVQSRAMFT+WG LQLLRRYPG VPDVD
Sbjct: 125 GISLSSVMEAQKFAAFRVVIVGGKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVD 184
Query: 184 IMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI 243
+MFDCMDKP I ++EHGS PLPLFRYCT HFDIPFPDWSFWGW E+++ PW+EEF I
Sbjct: 185 LMFDCMDKPTISREEHGSKPLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGI 244
Query: 244 KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKK 303
K GSQ +W +KL +AYWKGNPDV SP+RV+L++CN+S + GA+I+RQ+W EEAK+GFK+
Sbjct: 245 KQGSQVLNWTQKLSYAYWKGNPDVQSPVRVDLLQCNNSDIIGAQIMRQDWVEEAKNGFKE 304
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
SKLSNQCNHRYKIYAEGYAWSVSLKYILSC S+ALII+ QY++FF+ GLI N++PI
Sbjct: 305 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCGSLALIIAPQYEEFFNHGLISMTNYWPISR 364
Query: 364 ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPA 423
D+C SIK V WGN + SEA+ IGK+GQD MES++M RVYDYM HLITEYSKLL +KP
Sbjct: 365 LDICPSIKFAVSWGNTHHSEAKAIGKSGQDLMESMSMARVYDYMYHLITEYSKLLRFKPE 424
Query: 424 PPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
PP SA E C ESLLC ADP QRQ LE++ PSP PPCTL
Sbjct: 425 PPPSAHEICEESLLCFADPTQRQCLERSTTYPSPTPPCTL 464
>gi|147785893|emb|CAN70836.1| hypothetical protein VITISV_015872 [Vitis vinifera]
Length = 922
Score = 612 bits (1578), Expect = e-172, Method: Compositional matrix adjust.
Identities = 286/453 (63%), Positives = 347/453 (76%), Gaps = 6/453 (1%)
Query: 17 PCVISLSVISLA-ALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESRRSQAYK 75
P +SL + ++ L D VDDFAS+TKT+ GHNL PTPWHLFP +TF E++R ++ K
Sbjct: 470 PMRVSLRITTVELTLGDDGDVDDFASQTKTVVGHNLNPTPWHLFPPKTFTEKTRYTRVSK 529
Query: 76 IVHCTYLTC-----LSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHI 130
I+ C+YLTC IPE S V CP FF I DL+PW +S I++ +
Sbjct: 530 IIQCSYLTCRRRSITPTTTKIPEWHTRQSSNTVGKCPMFFTRIXHDLQPWVRSGISLSSV 589
Query: 131 MEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMD 190
MEA++FAA R++IV GKLYVD +Y CVQSRAMFT+WG LQLLRRYPG VPDVD+MFDCMD
Sbjct: 590 MEAQKFAAFRVVIVGGKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMD 649
Query: 191 KPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAK 250
KP I ++EHGS PLPLFRYCT HFDIPFPDWSFWGW E+++ PW+EEF IK GSQ
Sbjct: 650 KPTISREEHGSKPLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQVL 709
Query: 251 SWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQC 310
+W +KL +AYWKGNPDV SP+RV+L++CN+S + GA+I+RQ+W EEAK+GFK+SKLSNQC
Sbjct: 710 NWTQKLSYAYWKGNPDVQSPVRVDLLQCNNSDIIGAQIMRQDWVEEAKNGFKESKLSNQC 769
Query: 311 NHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSI 370
NHRYKIYAEGYAWSVSLKYILSC S+ALII+ QY++FF+ GLI N++PI D+C SI
Sbjct: 770 NHRYKIYAEGYAWSVSLKYILSCGSLALIIAPQYEEFFNHGLISMTNYWPISRLDICPSI 829
Query: 371 KSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430
K V WGN + SEA+ IGK+GQD MES++M RVYDYM HLITEYSKLL +KP PP SA E
Sbjct: 830 KFAVSWGNTHHSEAKAIGKSGQDLMESMSMARVYDYMYHLITEYSKLLRFKPEPPPSAHE 889
Query: 431 ACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
C ESLLC ADP QRQ LE++ PSP PPCTL
Sbjct: 890 ICEESLLCFADPTQRQCLERSTTYPSPTPPCTL 922
>gi|395146518|gb|AFN53673.1| DUF821 [Linum usitatissimum]
Length = 474
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 268/465 (57%), Positives = 344/465 (73%), Gaps = 34/465 (7%)
Query: 4 PSRTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRT 63
P +P R+PSYL +++L++ + F YKVDDFA+ TKT+AGHNLEPTPWH+FP +
Sbjct: 5 PRLSPPRTPSYLLSSLLALALFTF---FFLYKVDDFATNTKTIAGHNLEPTPWHVFPAKN 61
Query: 64 FKEESRRSQAYKIVHCTYLTCLSAMNPIPE-----RRRVASPQKVQTCPDFFKSIHKDLE 118
F +E+R ++AYKI+ C+YL+C I E + P Q CP+FF IH+DLE
Sbjct: 62 FDDETRHTRAYKIIQCSYLSCPYFNRTITEPPQFVSSSSSKPAAAQQCPEFFSYIHRDLE 121
Query: 119 PWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGM 178
PWAKS IT +MEAK FAA RI+I +GKLY DPYY C QSR M TIWGFLQLL++YPGM
Sbjct: 122 PWAKSGITEDQLMEAKNFAAFRIVIYQGKLYFDPYYACFQSRMMTTIWGFLQLLKKYPGM 181
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNE 238
VPDVD+MFDCMDKP+ ++ EH + P+PLFRYCT HFDIPFPDWSFWGWSE+N++PW+E
Sbjct: 182 VPDVDLMFDCMDKPIFNRTEHQANPVPLFRYCTTREHFDIPFPDWSFWGWSEINIRPWSE 241
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK 298
EF DIK GSQAK W ++ P A+WKGNPDV+SP+R+EL++CNDS+ +GA+I+RQ+W +EAK
Sbjct: 242 EFPDIKKGSQAKRWAKRQPHAFWKGNPDVVSPVRLELLQCNDSRKFGAQIMRQDWVQEAK 301
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNH 358
+GF+ SKLSNQCN+RYKIYAEG+A RGLIP KN+
Sbjct: 302 EGFEASKLSNQCNYRYKIYAEGFA--------------------------CRGLIPKKNY 335
Query: 359 FPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLL 418
+P+ +LC+SIKS VDWGN++P+EA+ I KAGQ++MES++MDR+YDYM HLI+EYSKL
Sbjct: 336 WPVSPFELCKSIKSAVDWGNSHPAEAQAIAKAGQNYMESISMDRIYDYMFHLISEYSKLQ 395
Query: 419 DYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
+KP PP++A C +S+LC AD KQR LEK+ SPS PPC L
Sbjct: 396 KFKPVPPTTALGVCPDSVLCFADEKQRMFLEKSTTSPSSEPPCNL 440
>gi|224059286|ref|XP_002299807.1| predicted protein [Populus trichocarpa]
gi|222847065|gb|EEE84612.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 239/331 (72%), Positives = 282/331 (85%)
Query: 133 AKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKP 192
AK +A+ R++I +G+LY+DPYY CVQSR MFTIWGFLQLL+RYPGMVPDVDIMFDCMDKP
Sbjct: 1 AKNYASFRVVIYKGRLYLDPYYACVQSRMMFTIWGFLQLLKRYPGMVPDVDIMFDCMDKP 60
Query: 193 VIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSW 252
I+K EH SFPLPLFRYCT HFDIPFPDWSFWGW EVN++PW+EEF+DIK G+QA+SW
Sbjct: 61 SINKTEHDSFPLPLFRYCTTKDHFDIPFPDWSFWGWPEVNIRPWDEEFRDIKRGAQARSW 120
Query: 253 KEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNH 312
+K P AYWKGNPDV SP R L++CN +K WGA+I+RQ+W EEAK G+ SKLS+QC++
Sbjct: 121 PKKWPRAYWKGNPDVGSPTRTSLLECNHTKKWGAQIMRQDWEEEAKGGYVSSKLSHQCDY 180
Query: 313 RYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKS 372
RYKIYAEG+AWSVSLKYI+SC S+ALIIS QY+DFFSRGLIP KN++P+ S LC+SIK
Sbjct: 181 RYKIYAEGFAWSVSLKYIISCGSLALIISPQYEDFFSRGLIPEKNYWPVSSDGLCQSIKF 240
Query: 373 VVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEAC 432
VDWGN NP+EA+KIGKAGQD MESL+MDRVYDYM HLI+EYSKL D+KP PPSSA E C
Sbjct: 241 AVDWGNTNPTEAQKIGKAGQDLMESLSMDRVYDYMFHLISEYSKLQDFKPVPPSSALEVC 300
Query: 433 VESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
V+SL C AD KQ++ E+A A PSP PPCTL
Sbjct: 301 VDSLTCFADEKQKRFFERATAFPSPSPPCTL 331
>gi|326495828|dbj|BAJ90536.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 486
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 251/467 (53%), Positives = 328/467 (70%), Gaps = 23/467 (4%)
Query: 16 FPCVISLSVISLAAL-------FLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEES 68
+P + SL V ++AA L DD S+T+TL GHNLEPTPWH FP ++
Sbjct: 19 YPPLASLVVSTIAAFSAVIVIAVLHSAYDDALSRTRTLLGHNLEPTPWHPFPH----DKG 74
Query: 69 RRSQAYKIVHCTYLTCLSAMN-PIPERRRVASPQKVQTCPDFFKSIHKDLEPWAK----S 123
R + +YL+CL ++ P P A+ + + CP +F +I +DL PW +
Sbjct: 75 RPPPRAALRCASYLSCLPPLSQPKPAAAAAANASRPRQCPSYFAAIRRDLAPWRRRDGGG 134
Query: 124 RITMRHIMEAKRF-AALRILIVRG--KLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVP 180
R ++E+ RF A++R+ I G +L+VD YY CVQSRA+FT+W LQL+RRYPG VP
Sbjct: 135 GGITRALLESARFRASMRVTITGGGTRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVP 194
Query: 181 DVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPW 236
DVD+MFDCMD+P I++ EH P PLFRYCT H DIPFPDWSFWGW E +++PW
Sbjct: 195 DVDLMFDCMDRPAINRTEHSGDGAPPPPPLFRYCTTRDHLDIPFPDWSFWGWPETHIEPW 254
Query: 237 NEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEE 296
+ EF+ IK GS+ W +++P AYWKGNPDV SPLR+ L+ CND+ LW AEI+RQNW EE
Sbjct: 255 SREFRSIKQGSRRVKWPDRVPTAYWKGNPDVASPLRLALLACNDTNLWRAEIMRQNWEEE 314
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
AK G++ SKLS+QC HRYKIYAEG+AWSVSLKYILSC S+AL+I Y+DFFSRGL P
Sbjct: 315 AKSGYQNSKLSSQCTHRYKIYAEGFAWSVSLKYILSCGSMALLIDPLYQDFFSRGLEPRV 374
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSK 416
NH+P+ +A +C SI+ V+WGNA+P EAE++GK GQ M+ L MD VYDYMLHL+TEY+
Sbjct: 375 NHWPVTAAGMCESIRDAVEWGNAHPEEAERVGKRGQRLMQELGMDTVYDYMLHLLTEYAG 434
Query: 417 LLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
LLD++PAPP +A EAC S+LCLAD +QR+ LE +AASP+ PC++
Sbjct: 435 LLDFRPAPPRAAQEACAGSVLCLADDRQRRFLEASAASPATAEPCSM 481
>gi|194707538|gb|ACF87853.1| unknown [Zea mays]
Length = 480
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 247/461 (53%), Positives = 319/461 (69%), Gaps = 25/461 (5%)
Query: 16 FPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESRRSQAYK 75
F VI L+V+ A DD S+T+TL GHNLEPTPWH FP + R A
Sbjct: 29 FSAVIVLAVVHSA-------YDDAMSRTRTLLGHNLEPTPWHPFPH-----DKGRPPARA 76
Query: 76 IVHCT-YLTCLSAMNPIPERRRVAS--PQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIM 131
+ C + CL ++ + +K + CP +F +IH+DL PW + R R ++
Sbjct: 77 ALRCAPSIACLPPLSSSSSSPQPQGNKKKKQKQCPAYFAAIHRDLAPWRGQGRGVTRALL 136
Query: 132 -EAKRFAALRILIVRG--KLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDC 188
EA+R A++R+ I G +L+VD YY CVQSRA+FT+W LQL+RRYPG VPDVD+MFDC
Sbjct: 137 DEARRRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDC 196
Query: 189 MDKPVIDKKEHGSF---PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKH 245
MD+P I++ EHG P PLFRYCT HFDIPFPDWSFWGW E N++PWN EFK+I+
Sbjct: 197 MDRPAINRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIEPWNHEFKNIRA 256
Query: 246 GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSK 305
G++A W +++P AYWKGNPDV SPLR L+ CND+ LW AEI+RQNW +EA+ G++ S+
Sbjct: 257 GARATRWADRVPTAYWKGNPDVASPLREALLGCNDTALWRAEIMRQNWDDEARSGYQHSR 316
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
LS QC HRYKIYAEG+AWSVSLKYILSC S+AL+I +Y+DFFSRGL P N++P+ +
Sbjct: 317 LSAQCTHRYKIYAEGFAWSVSLKYILSCGSMALLIEPRYQDFFSRGLEPRVNYWPVAALG 376
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLL---DYKP 422
+C SI+ VDWGNANP EAE++G+ GQ ++ L M VYDYMLHL+TEY++LL D+
Sbjct: 377 MCESIRDAVDWGNANPDEAERVGRRGQRLVQDLRMHAVYDYMLHLLTEYARLLIDQDHGL 436
Query: 423 APPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
+PP A EAC SLLCLAD KQR+ L + A P+P PC L
Sbjct: 437 SPPPHAQEACEASLLCLADDKQRRFLHASKADPAPGDPCVL 477
>gi|413917716|gb|AFW57648.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 500
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 247/461 (53%), Positives = 319/461 (69%), Gaps = 25/461 (5%)
Query: 16 FPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESRRSQAYK 75
F VI L+V+ A DD S+T+TL GHNLEPTPWH FP + R A
Sbjct: 49 FSAVIVLAVVHSA-------YDDAMSRTRTLLGHNLEPTPWHPFPH-----DKGRPPARA 96
Query: 76 IVHCT-YLTCLSAMNPIPERRRVAS--PQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIM 131
+ C + CL ++ + +K + CP +F +IH+DL PW + R R ++
Sbjct: 97 ALRCAPSIACLPPLSSSSSSPQPQGNKKKKQKQCPAYFAAIHRDLAPWRGQGRGVTRALL 156
Query: 132 -EAKRFAALRILIVRG--KLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDC 188
EA+R A++R+ I G +L+VD YY CVQSRA+FT+W LQL+RRYPG VPDVD+MFDC
Sbjct: 157 DEARRRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDC 216
Query: 189 MDKPVIDKKEHGSF---PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKH 245
MD+P I++ EHG P PLFRYCT HFDIPFPDWSFWGW E N++PWN EFK+I+
Sbjct: 217 MDRPAINRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIEPWNHEFKNIRA 276
Query: 246 GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSK 305
G++A W +++P AYWKGNPDV SPLR L+ CND+ LW AEI+RQNW +EA+ G++ S+
Sbjct: 277 GARATRWADRVPTAYWKGNPDVASPLREALLGCNDTALWRAEIMRQNWDDEARSGYQHSR 336
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
LS QC HRYKIYAEG+AWSVSLKYILSC S+AL+I +Y+DFFSRGL P N++P+ +
Sbjct: 337 LSAQCTHRYKIYAEGFAWSVSLKYILSCGSMALLIEPRYQDFFSRGLEPRVNYWPVAALG 396
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLL---DYKP 422
+C SI+ VDWGNANP EAE++G+ GQ ++ L M VYDYMLHL+TEY++LL D+
Sbjct: 397 MCESIRDAVDWGNANPDEAERVGRRGQRLVQDLRMHAVYDYMLHLLTEYARLLIDQDHGL 456
Query: 423 APPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
+PP A EAC SLLCLAD KQR+ L + A P+P PC L
Sbjct: 457 SPPPHAQEACEASLLCLADDKQRRFLHASKADPAPGDPCVL 497
>gi|226497664|ref|NP_001151574.1| lipopolysaccharide-modifying protein [Zea mays]
gi|195647860|gb|ACG43398.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 480
Score = 500 bits (1288), Expect = e-139, Method: Compositional matrix adjust.
Identities = 243/458 (53%), Positives = 316/458 (68%), Gaps = 19/458 (4%)
Query: 16 FPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESRRSQAYK 75
F VI L+V+ A DD S+T+TL GHNLEPTPWH FP + +R A +
Sbjct: 29 FSAVIVLAVVHSA-------YDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPAR--AALR 79
Query: 76 IVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIM-EA 133
+ + + +K + CP +F +IH+DL PW + R R ++ EA
Sbjct: 80 CAPSIACLPPLSPSSSSSPPQGNKKKKQKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEA 139
Query: 134 KRFAALRILIVRG--KLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDK 191
+R A++R+ I G +L+VD YY CVQSRA+FT+W LQL+RRYPG VPDVD+MFDCMD+
Sbjct: 140 RRRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDR 199
Query: 192 PVIDKKEHGSF---PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQ 248
P I++ EHG P PLFRYCT HFDIPFPDWSFWGW + N++PWN EFK+I+ G++
Sbjct: 200 PAINRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPDTNIEPWNHEFKNIRAGAR 259
Query: 249 AKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSN 308
A W +++P AYWKGNPDV SPLR L+ CND+ LW AEI+RQNW +EA+ G++ S+LS
Sbjct: 260 ATRWADRVPTAYWKGNPDVASPLREALLGCNDTALWRAEIMRQNWDDEARSGYQHSRLSA 319
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
QC HRYKIYAEG+AWSVSLKYILSC S+AL+I +Y+DFFSRGL P N++P+ + +C
Sbjct: 320 QCTHRYKIYAEGFAWSVSLKYILSCGSMALLIEPRYQDFFSRGLEPRVNYWPVAALGMCE 379
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLL---DYKPAPP 425
SI+ VDWGNANP EAE++G+ GQ ++ L M VYDYMLHL+TEY++LL D+ +PP
Sbjct: 380 SIRDAVDWGNANPDEAERVGRRGQRLVQDLRMHAVYDYMLHLLTEYARLLIDQDHGLSPP 439
Query: 426 SSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
A EAC SLLCLAD KQR+ L + A P+P PC L
Sbjct: 440 PHAQEACEASLLCLADDKQRRFLHASKADPAPGDPCVL 477
>gi|194702520|gb|ACF85344.1| unknown [Zea mays]
gi|238009730|gb|ACR35900.1| unknown [Zea mays]
gi|238011028|gb|ACR36549.1| unknown [Zea mays]
Length = 435
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 236/433 (54%), Positives = 307/433 (70%), Gaps = 12/433 (2%)
Query: 41 SKTKTLAGHNLEPTPWHLFPQRTFKEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASP 100
S+T+TL GHNLEPTPWH FP + +R A + + + + +
Sbjct: 2 SRTRTLLGHNLEPTPWHPFPHDKGRPPAR--AALRCAPSIACLPPLSSSSSSPQPQGNKK 59
Query: 101 QKVQTCPDFFKSIHKDLEPW-AKSRITMRHIM-EAKRFAALRILIVRG--KLYVDPYYDC 156
+K + CP +F +IH+DL PW + R R ++ EA+R A++R+ I G +L+VD YY C
Sbjct: 60 KKQKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEARRRASMRVTITGGGRRLHVDLYYAC 119
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF---PLPLFRYCTND 213
VQSRA+FT+W LQL+RRYPG VPDVD+MFDCMD+P I++ EHG P PLFRYCT
Sbjct: 120 VQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRPAINRTEHGDGALPPPPLFRYCTTR 179
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
HFDIPFPDWSFWGW E N++PWN EFK+I+ G++A W +++P AYWKGNPDV SPLR
Sbjct: 180 DHFDIPFPDWSFWGWPETNIEPWNHEFKNIRAGARATRWADRVPTAYWKGNPDVASPLRE 239
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
L+ CND+ LW AEI+RQNW +EA+ G++ S+LS QC HRYKIYAEG+AWSVSLKYILSC
Sbjct: 240 ALLGCNDTALWRAEIMRQNWDDEARSGYQHSRLSAQCTHRYKIYAEGFAWSVSLKYILSC 299
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
S+AL+I +Y+DFFSRGL P N++P+ + +C SI+ VDWGNANP EAE++G+ GQ
Sbjct: 300 GSMALLIEPRYQDFFSRGLEPRVNYWPVAALGMCESIRDAVDWGNANPDEAERVGRRGQR 359
Query: 394 FMESLTMDRVYDYMLHLITEYSKLL---DYKPAPPSSAFEACVESLLCLADPKQRQNLEK 450
++ L M VYDYMLHL+TEY++LL D+ +PP A EAC SLLCLAD KQR+ L
Sbjct: 360 LVQDLRMHAVYDYMLHLLTEYARLLIDQDHGLSPPPHAQEACEASLLCLADDKQRRFLHA 419
Query: 451 AAASPSPYPPCTL 463
+ A P+P PC L
Sbjct: 420 SKADPAPGDPCVL 432
>gi|357129812|ref|XP_003566555.1| PREDICTED: O-glucosyltransferase rumi-like [Brachypodium
distachyon]
Length = 493
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 239/473 (50%), Positives = 318/473 (67%), Gaps = 23/473 (4%)
Query: 9 ARSPSYLFPCVISLSVISLAAL-------FLDYKVDDFASKTKTLAGHNLEPTPWHLFPQ 61
AR+ + +P + SL V ++AA L D+ S+T+TL GHNLEPTPWH FP
Sbjct: 17 ARAGAGRYPPLASLVVSTIAAFSAVIVIAVLHSAYDEAVSRTRTLLGHNLEPTPWHPFPH 76
Query: 62 RTFKEESRRSQAYKIVHCT-YLTCLSAMNPIPERRRVAS-PQKVQTCPDFFKSIHKDLEP 119
+ R + + C L+CL ++ AS ++ + CP +F +IH+DL P
Sbjct: 77 DKGRPPPRAA-----LRCAPLLSCLPPLSHPHPSPPNASRSRRTKQCPAYFAAIHRDLAP 131
Query: 120 WAK---SRITMRHIMEAKRFAALRILIVRG--KLYVDPYYDCVQSRAMFTIWGFLQLLRR 174
W + IT + A+ A++R+ I +L+VD YY CVQSRA+FT+W LQL+RR
Sbjct: 132 WRRHGHGGITRELLDSARSRASMRVTITGNGRRLHVDLYYACVQSRALFTVWSLLQLMRR 191
Query: 175 YPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPL---FRYCTNDAHFDIPFPDWSFWGWSEV 231
YPG VPDVD+MFDCMD+P I++ G P FRYCT H DIPFPDWSFWGW E
Sbjct: 192 YPGRVPDVDLMFDCMDRPAINRTTGGPNPPLPPPLFRYCTTKDHLDIPFPDWSFWGWPET 251
Query: 232 NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQ 291
++ PW +EF+ IK GS+ W +++P A+WKGNPDV SPLR+ L+ CND+ LW A+I+RQ
Sbjct: 252 HINPWAKEFRAIKQGSRRVKWGDRVPLAFWKGNPDVASPLRLALLACNDTNLWHAQIMRQ 311
Query: 292 NWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG 351
NW EEAK G++ S LS QC HRYK+YAEG+AWSVSLKYIL+C S+AL+I +Y+DFFSRG
Sbjct: 312 NWEEEAKSGYRHSALSTQCAHRYKVYAEGFAWSVSLKYILACGSMALVIDPRYEDFFSRG 371
Query: 352 LIPTKNHFPIPS-ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHL 410
L NH+P+ + +C SI+ V+WGNA+P EAE +G+ GQ M+ L MD VYDYMLHL
Sbjct: 372 LEAKVNHWPVRADVGMCESIRDAVEWGNAHPEEAELVGRRGQRLMQELGMDAVYDYMLHL 431
Query: 411 ITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYPPCTL 463
+TEY+KLLD+ P+PP +A EACV S+LCLAD QR+ LE + A P+ PC+L
Sbjct: 432 LTEYAKLLDFVPSPPDTAQEACVGSVLCLADEGQRRFLEMSKAEPATGEPCSL 484
>gi|242086909|ref|XP_002439287.1| hypothetical protein SORBIDRAFT_09g003810 [Sorghum bicolor]
gi|241944572|gb|EES17717.1| hypothetical protein SORBIDRAFT_09g003810 [Sorghum bicolor]
Length = 497
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 244/469 (52%), Positives = 317/469 (67%), Gaps = 33/469 (7%)
Query: 21 SLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESRRSQAYKIVHCT 80
S +VI LA + Y DD S+T+TL GHNLEPTPWH FP + +R + + C
Sbjct: 31 SSAVIVLAVVHSAY--DDALSRTRTLLGHNLEPTPWHPFPHSKGRPPARAA-----LRCA 83
Query: 81 -YLTCLSAMNPIP------ERRRVASPQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIME 132
+ CL ++ P A+ ++ + CP +F +IH+DL PW R R +++
Sbjct: 84 PSIACLPPLSHPPGTANASSSAAAAAGEQRRQCPAYFAAIHRDLAPWRGPGRGVTRALLD 143
Query: 133 A-KRFAALRILIVRG--KLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
A +R A++R+ I G +L+VD YY CVQSRA+FT+W LQL+RRYPG VPDVD+MFDCM
Sbjct: 144 AARRRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCM 203
Query: 190 DKPVIDKKEHGSFPLP------LFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI 243
D+P I++ +HG LFRYCT HFDIPFPDWSFWGW E N++PWN EFK I
Sbjct: 204 DRPAINRTDHGDGDPSSPPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIKPWNREFKSI 263
Query: 244 KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKK 303
K G++A W +++P AYWKGNPDV SPLRV L+ CND+ W AEI+RQNW +EAK G+
Sbjct: 264 KMGARATRWADRVPTAYWKGNPDVASPLRVALLGCNDTAAWRAEIMRQNWDDEAKSGYTH 323
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
SKLS+QC HRYKIYAEG+AWSVSLKYILSC S+AL+I +Y+DFFSRGL P N++P+
Sbjct: 324 SKLSSQCTHRYKIYAEGFAWSVSLKYILSCGSMALLIEPRYEDFFSRGLEPRVNYWPVTE 383
Query: 364 ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPA 423
+C SI+ VDWGNANP EAE +G+ GQ ++ L M VYDYMLHL+TEY++L+D++PA
Sbjct: 384 TGMCESIRDAVDWGNANPGEAELVGRRGQRLVQELRMHAVYDYMLHLLTEYARLMDFRPA 443
Query: 424 PPSSAF--------EACVESLLCLADPKQRQNLEKAAASPSPY-PPCTL 463
P + EAC S+LCLAD KQR+ LE + A P+ PC L
Sbjct: 444 APPPSSSSSHDAPQEACEASVLCLADDKQRRFLEASRAEPAVSDEPCVL 492
>gi|449485616|ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 538
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 190/372 (51%), Positives = 254/372 (68%), Gaps = 9/372 (2%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQS 159
P CPD+F+ IH+DL PWA++ IT + +R A R+LI+ GK YV+ Y Q+
Sbjct: 130 PSSSSACPDYFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQT 189
Query: 160 RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE----HGSFPLPLFRYCTNDAH 215
R FT+WG LQLLRRYPG VPD+D+MFDC+D PVI +G P PLFRYC +DA
Sbjct: 190 RDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDAT 249
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
FDI FPDWSFWGW E+N++PW KDIK G++ WK + P+AYWKGNP+V + R +L
Sbjct: 250 FDIVFPDWSFWGWPEINIKPWEPLLKDIKEGNKRIPWKSRQPYAYWKGNPEV-ADTRKDL 308
Query: 276 MKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+KCN D + W A + Q+W +E+++G+K+S LSNQC HRYKIY EG AWSVS KYIL+C
Sbjct: 309 IKCNVSDQQDWNARVFAQDWTKESQEGYKQSNLSNQCLHRYKIYIEGSAWSVSEKYILAC 368
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+SV LI+ Y DFF+RGL+P +++P+ D C+SIK VDWGN++ +A+ IGKA
Sbjct: 369 DSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASS 428
Query: 394 FM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKA 451
F+ E L MD VYDYM HL++EYSKLL +KP P +A E C E++ C A+ ++ + E
Sbjct: 429 FIQEELKMDYVYDYMFHLLSEYSKLLTFKPTLPPNAIELCSEAMACPAEGLTKKFMTESL 488
Query: 452 AASPSPYPPCTL 463
P+ PCT+
Sbjct: 489 VKRPAESNPCTM 500
>gi|449446159|ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 538
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 190/372 (51%), Positives = 254/372 (68%), Gaps = 9/372 (2%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQS 159
P CPD+F+ IH+DL PWA++ IT + +R A R+LI+ GK YV+ Y Q+
Sbjct: 130 PSSSSACPDYFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQT 189
Query: 160 RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE----HGSFPLPLFRYCTNDAH 215
R FT+WG LQLLRRYPG VPD+D+MFDC+D PVI +G P PLFRYC +DA
Sbjct: 190 RDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDAT 249
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
FDI FPDWSFWGW E+N++PW KDIK G++ WK + P+AYWKGNP+V + R +L
Sbjct: 250 FDIVFPDWSFWGWPEINIKPWEPLLKDIKEGNKRIPWKSREPYAYWKGNPEV-ADTRKDL 308
Query: 276 MKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+KCN D + W A + Q+W +E+++G+K+S LSNQC HRYKIY EG AWSVS KYIL+C
Sbjct: 309 IKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILAC 368
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+SV LI+ Y DFF+RGL+P +++P+ D C+SIK VDWGN++ +A+ IGKA
Sbjct: 369 DSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASS 428
Query: 394 FM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKA 451
F+ E L MD VYDYM HL++EYSKLL +KP P +A E C E++ C A+ ++ + E
Sbjct: 429 FIQEELKMDYVYDYMFHLLSEYSKLLTFKPTLPPNAIELCSEAMACPAEGLTKKFMTESL 488
Query: 452 AASPSPYPPCTL 463
P+ PCT+
Sbjct: 489 VKRPAESNPCTM 500
>gi|15237842|ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana]
gi|10176852|dbj|BAB10058.1| unnamed protein product [Arabidopsis thaliana]
gi|48310551|gb|AAT41837.1| At5g23850 [Arabidopsis thaliana]
gi|62320258|dbj|BAD94534.1| putative protein [Arabidopsis thaliana]
gi|332005839|gb|AED93222.1| uncharacterized protein [Arabidopsis thaliana]
Length = 542
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 183/367 (49%), Positives = 256/367 (69%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCPD+F+ IH+DL PW+++ IT + AK+ A R+ IV GK+YV+ + D Q+R +FT
Sbjct: 138 TCPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAFQTRDVFT 197
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTNDAHFDIPF 220
IWGFLQLLR+YPG +PD+++MFDC+D PV+ E P PLFRYC N+ DI F
Sbjct: 198 IWGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNEETLDIVF 257
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW+EVN++PW K+++ G++ W + P+AYWKGNP +++ R +LMKCN
Sbjct: 258 PDWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNP-MVAETRQDLMKCNV 316
Query: 281 SKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
S+ W A + Q+W +E+K+G+K+S L++QC+HRYKIY EG AWSVS KYIL+C+SV L
Sbjct: 317 SEEHEWNARLYAQDWIKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTL 376
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMES- 397
++ Y DFF+RGL+P +++P+ D CRSIK VDWGN++ +A+ IGKA DF++
Sbjct: 377 LVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWGNSHIQKAQDIGKAASDFIQQD 436
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPS 456
L MD VYDYM HL+TEYSKLL +KP P +A E C E++ CL +R+ + E P+
Sbjct: 437 LKMDYVYDYMYHLLTEYSKLLQFKPEIPRNAVEICSETMACLRSGNERKFMTESLVKQPA 496
Query: 457 PYPPCTL 463
PC +
Sbjct: 497 DSGPCAM 503
>gi|255539445|ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549902|gb|EEF51389.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 506
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 181/367 (49%), Positives = 252/367 (68%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCP++F+ IH+DL PW ++ IT + AK A R++I+ G Y++ Y Q+R +FT
Sbjct: 103 TCPEYFRWIHEDLRPWVRTGITRETMERAKATANFRLVILNGTAYLEMYEKSFQTRDVFT 162
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTNDAHFDIPF 220
+WG LQLLR+YPG VPD+++MFDC+D PV+ ++ P PLFRYC ND DI F
Sbjct: 163 LWGILQLLRKYPGRVPDLEMMFDCVDWPVVKSVDYSGSSAISPPPLFRYCGNDETLDIVF 222
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWS+WGW E N++PW + KD+K G+Q WKE+ P+AYWKGNP+V + R++LMKCN
Sbjct: 223 PDWSYWGWVETNIKPWEKIVKDLKEGNQRSKWKEREPYAYWKGNPNV-AETRLDLMKCNV 281
Query: 281 SKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
S+ W A + Q+W E++ G+K+S L+NQCNHRYKIY EG AWSVS KYIL+C+SV L
Sbjct: 282 SQEHDWNARLYTQDWVRESQQGYKQSDLANQCNHRYKIYIEGSAWSVSEKYILACDSVTL 341
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
I+ Y DFF+RGL+P +++PI D C+SIK VDWGN++ +A+ IGKA DF+ E
Sbjct: 342 IVKPHYYDFFTRGLMPNHHYWPIKEDDKCKSIKFAVDWGNSHKQKAQAIGKAASDFIQED 401
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP-KQRQNLEKAAASPS 456
L MD VYDYM HL+ EY++LL +KP P +A + C E++ C AD ++ ++ P+
Sbjct: 402 LKMDYVYDYMFHLLNEYARLLTFKPTIPQNATKLCAETMACPADGLAKKLMMDSMVEGPA 461
Query: 457 PYPPCTL 463
PCT+
Sbjct: 462 DTSPCTM 468
>gi|297808383|ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
lyrata]
gi|297317912|gb|EFH48334.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
lyrata]
Length = 543
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 181/367 (49%), Positives = 254/367 (69%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCPD+F+ IH+DL PW+ + IT + AK+ A R+ I+ GK+YV+ + D Q+R +FT
Sbjct: 139 TCPDYFRWIHEDLRPWSSTGITREALERAKKTANFRLAIIDGKIYVEKFQDAFQTRDVFT 198
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE----HGSFPLPLFRYCTNDAHFDIPF 220
IWGFLQLLR+YPG +PD+++MFDC+D PV+ E + P PLFRYC N+ DI F
Sbjct: 199 IWGFLQLLRKYPGKIPDLELMFDCVDWPVVKASEFTGANAPSPPPLFRYCGNEETLDIVF 258
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW+EVN++PW K+++ G+Q W + P+AYWKGNP +++ R +LMKCN
Sbjct: 259 PDWSFWGWAEVNIKPWESLLKELREGNQRTKWINREPYAYWKGNP-MVAETRQDLMKCNV 317
Query: 281 SKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
S+ W A + Q+W +E+ +G+K+S L++QC+HRYKIY EG AWSVS KYIL+C+SV L
Sbjct: 318 SEEHEWNARLYVQDWIKESNEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTL 377
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFME-S 397
++ Y DFF+RGL+P +++P+ D CRSIK VDWGN++ +A+ IGKA DF++
Sbjct: 378 LVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWGNSHIQKAQDIGKAASDFIQHE 437
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPS 456
L MD VYDYM HL+TEYSKLL +KP P +A E C E++ C +R+ + E P+
Sbjct: 438 LKMDYVYDYMYHLLTEYSKLLRFKPEIPQNAAEICSETMACPRSGNERKFMTESFVKHPA 497
Query: 457 PYPPCTL 463
PC +
Sbjct: 498 ESGPCAM 504
>gi|356529208|ref|XP_003533188.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 469
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/389 (47%), Positives = 266/389 (68%), Gaps = 11/389 (2%)
Query: 83 TCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL 142
TC + +PIPE + S TCP++F+ IH+DL PWA++ IT + AK+ A +++
Sbjct: 46 TCPTNQSPIPENDQ--SRPSSATCPEYFRWIHEDLRPWARTGITQEMVERAKQTANFKLV 103
Query: 143 IVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPV--IDKKE-- 198
I++GK Y++ Y Q+R +F+IWG LQLLRRYPG +PD+++MFDC+D PV +D+
Sbjct: 104 ILKGKAYLETYEKAYQTRDVFSIWGILQLLRRYPGKIPDLELMFDCVDWPVLLVDRYNGP 163
Query: 199 HGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPF 258
+ P PLFRYC NDA DI FPDWSFWGW+EVN++PW ++K G++ W + P+
Sbjct: 164 NTEQPPPLFRYCGNDATLDIVFPDWSFWGWAEVNIKPWEILLGELKEGTKRIPWLNREPY 223
Query: 259 AYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKI 316
AYWKGNP V++ R++LMKCN +++ W A + Q+W E+++G+KKS L++QC HRYK+
Sbjct: 224 AYWKGNP-VVAETRLDLMKCNVSENQDWNARLYAQDWGRESQEGYKKSDLASQCTHRYKV 282
Query: 317 YAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDW 376
Y EG AWSVS KYIL+C+S L++ Y DFF+RGLIP +++PI D CRSIK VDW
Sbjct: 283 YIEGSAWSVSEKYILACDSPTLLVKPHYYDFFTRGLIPGHHYWPIKEDDKCRSIKFAVDW 342
Query: 377 GNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVES 435
GN++ A +IGKA DF+ E + MD VYDYM HL+ Y+KL YKP+ ++A E C ES
Sbjct: 343 GNSHKQRAHQIGKAASDFIQEEVKMDYVYDYMFHLLNSYAKLFRYKPSLSANATEICAES 402
Query: 436 LLCLAD-PKQRQNLEKAAASPSPYPPCTL 463
++C A+ P ++ +E P+ PC++
Sbjct: 403 MVCGAEGPVKKFMMESLVKVPANTDPCSM 431
>gi|15229061|ref|NP_190467.1| uncharacterized protein [Arabidopsis thaliana]
gi|6522568|emb|CAB62012.1| putative protein [Arabidopsis thaliana]
gi|332644958|gb|AEE78479.1| uncharacterized protein [Arabidopsis thaliana]
Length = 539
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/367 (47%), Positives = 252/367 (68%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCPD+F+ IH+DL PW K+ IT + A A R+ I+ G++YV+ + + Q+R +FT
Sbjct: 135 TCPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFT 194
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTNDAHFDIPF 220
IWGF+QLLRRYPG +PD+++MFDC+D PV+ E P PLFRYC ND DI F
Sbjct: 195 IWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVF 254
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWS+WGW+EVN++PW K+++ G+Q W ++ P+AYWKGNP V + R++LMKCN
Sbjct: 255 PDWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTV-AETRLDLMKCNL 313
Query: 281 SKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
S++ W A + +Q+W +E+K+G+K+S L++QC+HRYKIY EG AWSVS KYIL+C+SV L
Sbjct: 314 SEVYDWKARLYKQDWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTL 373
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
++ Y DFF+RG+ P +++P+ D CRSIK VDWGN + +A+ IGK +F+ +
Sbjct: 374 MVKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQE 433
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPS 456
L MD VYDYM HL+ +YSKLL +KP P ++ E C E++ C D +R+ +E P+
Sbjct: 434 LKMDYVYDYMFHLLIQYSKLLRFKPEIPQNSTELCSEAMACPRDGNERKFMMESLVKRPA 493
Query: 457 PYPPCTL 463
PC +
Sbjct: 494 ETGPCAM 500
>gi|297816106|ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
lyrata]
gi|297321774|gb|EFH52195.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
lyrata]
Length = 539
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/394 (45%), Positives = 259/394 (65%), Gaps = 13/394 (3%)
Query: 82 LTCLSAMNPIPERRRVASPQKVQ----TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFA 137
+TC P R V + + TCPD+F+ IH+DL PW K+ IT + A A
Sbjct: 108 VTCPKNNYPTSFRSSVGEGESDRSLSATCPDYFRWIHEDLRPWEKTGITREALERANATA 167
Query: 138 ALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK 197
R+ I+ G++YV+ + + Q+R +FTIWGF+QLLRRYPG +PD+++MFDC+D PV+
Sbjct: 168 NFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAA 227
Query: 198 EHGSF----PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWK 253
E P PLFRYC ND DI FPDWS+WGW+EVN++PW K+++ G+Q W
Sbjct: 228 EFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKPWESLLKELREGNQRTKWI 287
Query: 254 EKLPFAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCN 311
++ P+AYWKGNP V + R++LMKCN S+ W A + +Q+W +E+K+G+K+S L++QC+
Sbjct: 288 DREPYAYWKGNPTV-AETRLDLMKCNLSEEYDWKARLYKQDWVKESKEGYKQSDLASQCH 346
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIK 371
HRYKIY EG AWSVS KYIL+C+SV L++ Y DFF+RG+ P +++P+ D CRSIK
Sbjct: 347 HRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIK 406
Query: 372 SVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430
VDWGN + +A+ IGK +F+ + L MD VYDYM HL+ +YSKLL +KP P ++ E
Sbjct: 407 FAVDWGNLHMRKAQDIGKKASEFVQQELKMDYVYDYMFHLLIQYSKLLRFKPEIPQNSTE 466
Query: 431 ACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
C E++ C D +R+ +E P+ PC +
Sbjct: 467 LCSEAMACPRDGNERKFMMESLVKHPAETGPCAM 500
>gi|110743626|dbj|BAE99650.1| hypothetical protein [Arabidopsis thaliana]
Length = 433
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/367 (47%), Positives = 252/367 (68%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCPD+F+ IH+DL PW K+ IT + A A R+ I+ G++YV+ + + Q+R +FT
Sbjct: 29 TCPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFT 88
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTNDAHFDIPF 220
IWGF+QLLRRYPG +PD+++MFDC+D PV+ E P PLFRYC ND DI F
Sbjct: 89 IWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVF 148
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWS+WGW+EVN++PW K+++ G+Q W ++ P+AYWKGNP V + R++LMKCN
Sbjct: 149 PDWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTV-AETRLDLMKCNL 207
Query: 281 SKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
S++ W A + +Q+W +E+K+G+K+S L++QC+HRYKIY EG AWSVS KYIL+C+SV L
Sbjct: 208 SEVYDWKARLYKQDWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTL 267
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
++ Y DFF+RG+ P +++P+ D CRSIK VDWGN + +A+ IGK +F+ +
Sbjct: 268 MVKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQE 327
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPS 456
L MD VYDYM HL+ +YSKLL +KP P ++ E C E++ C D +R+ +E P+
Sbjct: 328 LKMDYVYDYMFHLLIQYSKLLRFKPEIPQNSTELCSEAMACPRDGNERKFMMESLVKRPA 387
Query: 457 PYPPCTL 463
PC +
Sbjct: 388 ETGPCAM 394
>gi|225470185|ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
gi|302143884|emb|CBI22745.3| unnamed protein product [Vitis vinifera]
Length = 525
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 183/366 (50%), Positives = 247/366 (67%), Gaps = 9/366 (2%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F+ I+ DL PW KS IT + AKR A +++I+ G+ YV+ Y Q+R +FT+
Sbjct: 125 CPHYFRWIYGDLRPWMKSGITREMVERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTL 184
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIPFP 221
WG LQLLRRYPG VPD+++MFDC+D PVI E+ + P PLFRYC +DA DI FP
Sbjct: 185 WGILQLLRRYPGKVPDLELMFDCVDWPVIQSNEYRGPNATAPPPLFRYCGDDATLDIVFP 244
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW E+N++PW KD+K G++ W E+ P+AYWKGNP V + R++L+KCN
Sbjct: 245 DWSFWGWPEINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAA-TRLDLLKCNVS 303
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
D + W A + Q+W E+++G+K+S L++QC HRYKIY EG AWSVS KYIL+C+SV L+
Sbjct: 304 DKQDWNARVYTQDWILESQEGYKQSDLASQCIHRYKIYIEGSAWSVSQKYILACDSVTLL 363
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+ Y DFF+R L+P +++PI D CRSIK VDWGN + +A+ IGKA DF+ E L
Sbjct: 364 VKPHYYDFFTRSLMPVHHYWPIREDDKCRSIKFAVDWGNRHKQKAQSIGKAASDFIQEDL 423
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP-KQRQNLEKAAASPSP 457
MD VYDYM HL+ EY+KLL +KP P A E C E + C A+ K++ +E P
Sbjct: 424 KMDNVYDYMFHLLNEYAKLLKFKPTVPEKAVELCSERMGCGAEGLKKKFMMESMVKYPMD 483
Query: 458 YPPCTL 463
PCT+
Sbjct: 484 ASPCTM 489
>gi|359474017|ref|XP_002269577.2| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
Length = 585
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 179/367 (48%), Positives = 248/367 (67%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
CPD+F+ IH+DL+PW + I+ + AKR A R++IV+GK+Y++ Y +Q+R +FT
Sbjct: 182 VCPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVKGKVYIEKYKKSIQTRDVFT 241
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIPF 220
IWG LQLLRRYPG + D+++ FDC D+PVI +H + P PLFRYC + D+ F
Sbjct: 242 IWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNSTSPPPLFRYCGDRWTLDVVF 301
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
PDWSFWGW E+N++PW KD+K G+ W E+ P+AYWKGNP +++ R +L+ CN
Sbjct: 302 PDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYWKGNP-LVAETRRDLLTCNV 360
Query: 280 -DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
D + W A + Q+W E++ G+K+S +SNQC HRYKIY EG+AWSVS KYIL+C+SV L
Sbjct: 361 SDVQDWNARLFVQDWMLESQQGYKQSDVSNQCTHRYKIYIEGWAWSVSEKYILACDSVTL 420
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
++ +Y DFF R L P +++PI D CRSIK VDWGN++ +A+ IGKA DF+ E
Sbjct: 421 MVKPRYYDFFMRSLQPVHHYWPIKDNDKCRSIKFAVDWGNSHKQKAQAIGKAASDFIQEE 480
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPS 456
L MD VYDYM HL+ EY+KLL +KP P A E C E++ C A+ +++ +E SPS
Sbjct: 481 LKMDYVYDYMFHLLNEYAKLLRFKPTIPEGAVEVCSETVACSAEGVEKKFMMESLVNSPS 540
Query: 457 PYPPCTL 463
PC L
Sbjct: 541 VTSPCAL 547
>gi|297742541|emb|CBI34690.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 179/367 (48%), Positives = 248/367 (67%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
CPD+F+ IH+DL+PW + I+ + AKR A R++IV+GK+Y++ Y +Q+R +FT
Sbjct: 94 VCPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVKGKVYIEKYKKSIQTRDVFT 153
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIPF 220
IWG LQLLRRYPG + D+++ FDC D+PVI +H + P PLFRYC + D+ F
Sbjct: 154 IWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNSTSPPPLFRYCGDRWTLDVVF 213
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
PDWSFWGW E+N++PW KD+K G+ W E+ P+AYWKGNP +++ R +L+ CN
Sbjct: 214 PDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYWKGNP-LVAETRRDLLTCNV 272
Query: 280 -DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
D + W A + Q+W E++ G+K+S +SNQC HRYKIY EG+AWSVS KYIL+C+SV L
Sbjct: 273 SDVQDWNARLFVQDWMLESQQGYKQSDVSNQCTHRYKIYIEGWAWSVSEKYILACDSVTL 332
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
++ +Y DFF R L P +++PI D CRSIK VDWGN++ +A+ IGKA DF+ E
Sbjct: 333 MVKPRYYDFFMRSLQPVHHYWPIKDNDKCRSIKFAVDWGNSHKQKAQAIGKAASDFIQEE 392
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPS 456
L MD VYDYM HL+ EY+KLL +KP P A E C E++ C A+ +++ +E SPS
Sbjct: 393 LKMDYVYDYMFHLLNEYAKLLRFKPTIPEGAVEVCSETVACSAEGVEKKFMMESLVNSPS 452
Query: 457 PYPPCTL 463
PC L
Sbjct: 453 VTSPCAL 459
>gi|302770761|ref|XP_002968799.1| hypothetical protein SELMODRAFT_450427 [Selaginella moellendorffii]
gi|300163304|gb|EFJ29915.1| hypothetical protein SELMODRAFT_450427 [Selaginella moellendorffii]
Length = 423
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 183/365 (50%), Positives = 245/365 (67%), Gaps = 6/365 (1%)
Query: 105 TCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+CPDFF+ IH DL PW A I+ + EA+ FAA R+ I+ G+LY + YY CVQSRAMF
Sbjct: 1 SCPDFFQWIHHDLAPWRASGGISRAALEEAREFAAFRVAIIGGQLYAELYYQCVQSRAMF 60
Query: 164 TIWGFLQLLRRYP-GMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAHFDIPFP 221
T+WG L LL R+P G VPDV+ MF+CMD+P + + S P PL YC + DI FP
Sbjct: 61 TLWGLLLLLERFPAGAVPDVEFMFNCMDRPHFRRSRYKSRAPPPLLAYCGSRDTVDIAFP 120
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW+EV + W EE I HGS+ W+ + P A+WKGNP V + +R +L+ CN
Sbjct: 121 DWSFWGWAEVRIGAWGEEASSIFHGSEETRWENRRPRAHWKGNPHVGAQVRTDLLACNKT 180
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
D + +GA+I QNW E++ GFK SKLS+QC HRYK+YAEG+AWSVS KYI++C S LI
Sbjct: 181 DKRDFGADIYVQNWIAESQQGFKNSKLSDQCRHRYKLYAEGHAWSVSFKYIMACGSTTLI 240
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+ Y DFF RGL+P +++PI D+C SI V WGN++P EAE IG Q+F+ + L
Sbjct: 241 VQPDYHDFFMRGLLPLHHYWPIDRQDMCSSIDHAVKWGNSHPKEAEAIGSHAQEFLRKDL 300
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASPSPY 458
+MDRVYDYMLHL+ EY+KL +KP P A C ++ C+A+ +Q + L++ S S
Sbjct: 301 SMDRVYDYMLHLLREYAKLQRFKPRVPEGAQLLCKSAVTCIAEAEQLEFLKRTETSNSQT 360
Query: 459 PPCTL 463
PC++
Sbjct: 361 SPCSM 365
>gi|302784816|ref|XP_002974180.1| hypothetical protein SELMODRAFT_100126 [Selaginella moellendorffii]
gi|300158512|gb|EFJ25135.1| hypothetical protein SELMODRAFT_100126 [Selaginella moellendorffii]
Length = 426
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/365 (49%), Positives = 245/365 (67%), Gaps = 6/365 (1%)
Query: 105 TCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+CPDFF+ IH DL PW A I+ + EA+ FAA R+ I+ G+LY + YY CVQSRAMF
Sbjct: 1 SCPDFFQWIHHDLAPWRASGGISRAALEEAREFAAFRVAIIGGQLYAELYYQCVQSRAMF 60
Query: 164 TIWGFLQLLRRYP-GMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAHFDIPFP 221
T+WG L LL R+P G VPDV+ MF+CMD+P + + S P PL YC + DI FP
Sbjct: 61 TLWGLLLLLERFPAGAVPDVEFMFNCMDRPHFRRSRYKSRAPPPLLAYCGSRDTVDIAFP 120
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW+EV + W EE I HGS+ W+ + P A+WKGNP V + +R +L+ CN
Sbjct: 121 DWSFWGWAEVRIGAWGEEASSIFHGSEETRWENRRPRAHWKGNPHVGAQVRTDLLACNKT 180
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
D + +GA+I QNW E++ GFK SKLS+QC HRYK+YAEG+AWSVS KYI++C S LI
Sbjct: 181 DKRDFGADIYVQNWIAESQQGFKNSKLSDQCRHRYKLYAEGHAWSVSFKYIMACGSTTLI 240
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+ Y DFF RGL+P +++PI D+C SI V WGN++P EAE IG Q+F+ + L
Sbjct: 241 VQPDYHDFFMRGLLPLHHYWPIDRQDMCSSIDHAVKWGNSHPKEAEAIGSHAQEFLRKDL 300
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASPSPY 458
+MDRVY+YMLHL+ EY+KL +KP P A C ++ C+A+ +Q + L++ S S
Sbjct: 301 SMDRVYEYMLHLLREYAKLQRFKPRVPEGAQLLCKSAVTCIAEAEQLEFLKRTETSNSQT 360
Query: 459 PPCTL 463
PC++
Sbjct: 361 SPCSM 365
>gi|218191256|gb|EEC73683.1| hypothetical protein OsI_08244 [Oryza sativa Indica Group]
Length = 524
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 185/391 (47%), Positives = 261/391 (66%), Gaps = 10/391 (2%)
Query: 82 LTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALR 140
L+ A +PE + +CP +F+ IH+DL PW A IT R + A+ A R
Sbjct: 102 LSSRPACARVPEAHPLPPTTTNSSCPAYFRFIHEDLHPWRAAGGITRRMLERARDTANFR 161
Query: 141 ILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID----K 196
++++RG+ YV+ Q+R +FTIWG LQLLRRYPG VPD+D+MFDC+D PV+ +
Sbjct: 162 LVVLRGRAYVERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVQADRYQ 221
Query: 197 KEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKL 256
E+ + PLFRYC +D D+ FPDWSFWGW E+N++PW+ KD+ G++ W ++
Sbjct: 222 GENATAMPPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDRE 281
Query: 257 PFAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRY 314
P+AYWKGNPDV + R EL+KCN S W A I +Q+W +E+K G+K+S L++QC HRY
Sbjct: 282 PYAYWKGNPDVATK-RKELVKCNVSSKHEWNARIYKQDWIKESKAGYKQSDLASQCTHRY 340
Query: 315 KIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVV 374
KIY EG AWSVS KYIL+CNS+ L+++ +Y DFFSR L+PT++++P+ + C SIK V
Sbjct: 341 KIYIEGSAWSVSEKYILACNSMTLVVTPKYYDFFSRVLMPTQHYWPVRDDNKCSSIKHAV 400
Query: 375 DWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACV 433
DWGN+N +A+KIGK +F++ L+MD +YDYM HL+TEY+KLL +KP P A E C
Sbjct: 401 DWGNSNKKKAQKIGKQASNFIQQELSMDYIYDYMFHLLTEYAKLLRFKPTKPPEAIEICP 460
Query: 434 ESLLCLADPKQRQNLEKAAA-SPSPYPPCTL 463
E L C A ++R+ +E + S + PC L
Sbjct: 461 ELLACQAIGRERKFMEDSMVNSANDAGPCDL 491
>gi|255539447|ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549903|gb|EEF51390.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 528
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 178/369 (48%), Positives = 247/369 (66%), Gaps = 9/369 (2%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAM 162
V CP++++ I++DL PWA++ I+ + AK A R++IV GK YV+ Y Q+R +
Sbjct: 123 VSACPEYYRWIYEDLRPWARTGISRDMVERAKTTANFRLVIVNGKAYVEKYRRAFQTRDV 182
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS----FPLPLFRYCTNDAHFDI 218
FT+WG LQLLRRYPG VPD+++MFDC+D PVI + P PLFRYC +D D+
Sbjct: 183 FTLWGILQLLRRYPGKVPDLELMFDCVDWPVIKSSNYSGPNAMAPPPLFRYCGDDDTLDV 242
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
FPDWSFWGWSE+N++PW +++K G++ + W E+ P+AYWKGNP V + R +LMKC
Sbjct: 243 VFPDWSFWGWSEINIKPWERLLRELKEGNEKRRWMEREPYAYWKGNPAV-AETRQDLMKC 301
Query: 279 NDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
N S+ W A + Q+W +E + G+K+S L++QC HRYKIY EG AWSVS KYIL+C+SV
Sbjct: 302 NVSEQQDWNARVYAQDWIKELQQGYKQSNLASQCMHRYKIYIEGSAWSVSEKYILACDSV 361
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
L++ Y DFF+R L P +++PI D CRSIK VDWGN + +A+ IGKA +F+
Sbjct: 362 TLLVKPHYYDFFTRSLRPIHHYWPIKDYDKCRSIKFAVDWGNNHKQKAQAIGKAASEFIQ 421
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAAS 454
E L MD VYDYM HL+ EY+KLL +KP P A E C ES+ C A+ +++ +E
Sbjct: 422 EELKMDYVYDYMFHLLNEYAKLLTFKPVIPRKAVELCSESMACPANGIEKEFMMESMVQG 481
Query: 455 PSPYPPCTL 463
P+ PC +
Sbjct: 482 PAETNPCIM 490
>gi|297599669|ref|NP_001047557.2| Os02g0642700 [Oryza sativa Japonica Group]
gi|255671126|dbj|BAF09471.2| Os02g0642700, partial [Oryza sativa Japonica Group]
Length = 474
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 188/396 (47%), Positives = 264/396 (66%), Gaps = 16/396 (4%)
Query: 83 TCLSAMNPIPERRRVASPQKV------QTCPDFFKSIHKDLEPW-AKSRITMRHIMEAKR 135
T A++ P RV PQ + +CP +F+ IH+DL PW A IT R + A+
Sbjct: 47 TLYLALSSRPACARVPEPQPLPPTTTNSSCPAYFRFIHEDLHPWRAAGGITRRMLERARD 106
Query: 136 FAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID 195
A R++++RG+ YV+ Q+R +FTIWG LQLLRRYPG VPD+D+MFDC+D PV+
Sbjct: 107 TANFRLVVLRGRAYVERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVQ 166
Query: 196 ----KKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKS 251
+ E+ + PLFRYC +D D+ FPDWSFWGW E+N++PW+ KD+ G++
Sbjct: 167 ADRYQGENATAMPPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVK 226
Query: 252 WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQ 309
W ++ P+AYWKGNPDV + R EL+KCN S W A I +Q+W +E+K G+K+S L++Q
Sbjct: 227 WVDREPYAYWKGNPDVATK-RKELVKCNVSSKHEWNARIYKQDWIKESKAGYKQSDLASQ 285
Query: 310 CNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRS 369
C HRYKIY EG AWSVS KYIL+CNS+ L+++ +Y DFFSR L+PT++++P+ + C S
Sbjct: 286 CTHRYKIYIEGSAWSVSEKYILACNSMTLVVTPKYYDFFSRVLMPTQHYWPVRDDNKCSS 345
Query: 370 IKSVVDWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKPAPPSSA 428
IK VDWGN+N +A+KIGK +F++ L+MD +YDYM HL+TEY+KLL +KP P A
Sbjct: 346 IKHAVDWGNSNKKKAQKIGKQASNFIQQELSMDYIYDYMFHLLTEYAKLLRFKPTKPPEA 405
Query: 429 FEACVESLLCLADPKQRQNLEKA-AASPSPYPPCTL 463
E C E L C A ++R+ +E + S + PC L
Sbjct: 406 IEICPELLACQAIGRERKFMEDSMVKSANDAGPCDL 441
>gi|356522636|ref|XP_003529952.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 510
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/370 (48%), Positives = 253/370 (68%), Gaps = 10/370 (2%)
Query: 103 VQTCPDFFKSIHKDLEPW-AKSRITMRHIME-AKRFAALRILIVRGKLYVDPYYDCVQSR 160
+TCP +F+ IH+DL PW + R R ++E A+R A R++IV GKLYV+ Y +Q+R
Sbjct: 104 TRTCPSYFRWIHEDLWPWRERDRGITREMLEGARRTAHFRLVIVDGKLYVEKYKKAIQTR 163
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDK---KEHGSFPLPLFRYCTNDAHFD 217
+FT+WG LQLLR YPG VPD++++FDC D+PV+ K K + PLFRYC++ D
Sbjct: 164 DVFTLWGILQLLRMYPGKVPDLELLFDCDDRPVVSKERFKGPNAPTPPLFRYCSDQWSLD 223
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I FPDWSFWGW+E+N++PW K+IK G++ WK+++P+AYWKGNP ++SP R +LMK
Sbjct: 224 IVFPDWSFWGWAEINIKPWKHVLKEIKEGNEKTKWKDRVPYAYWKGNP-LVSPTRKDLMK 282
Query: 278 CN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
CN + W + Q+W +E+ G+KKS L +QC HRYKIY EG+AWSVS KYIL+C+S
Sbjct: 283 CNVTEKDDWNTHLYIQDWDQESSKGYKKSNLGDQCTHRYKIYVEGWAWSVSEKYILACDS 342
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
L + ++ DFF RG++P ++++PI C+S+K V+WGN N +A+ IG+AG F+
Sbjct: 343 TTLYVRSRFHDFFVRGMVPLEHYWPIRDNSKCKSLKFAVEWGNNNTDKAQAIGEAGSKFI 402
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKA-AA 453
E + MD VYDYM HL+ EY+KL +KP P +A E C E++ C D QR+ +E +
Sbjct: 403 HEDMDMDYVYDYMFHLLNEYAKLQRFKPTIPQNAVEYCPETMACGVDGIQRRFMEDSMVK 462
Query: 454 SPSPYPPCTL 463
SPS PCTL
Sbjct: 463 SPSDSNPCTL 472
>gi|356561622|ref|XP_003549080.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 525
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 195/464 (42%), Positives = 283/464 (60%), Gaps = 20/464 (4%)
Query: 8 PARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEE 67
P + LFP ++ + + +++ AS +K+ TP+ P F
Sbjct: 39 PRSTAVLLFPVMLIIGAFAYTRTLDTHRMFSGASSSKSAQS----TTPYGTSP---FTVS 91
Query: 68 SRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITM 127
R+ A ++CT L+ P + SP TCPD+F+ IH+DL PWA++ IT
Sbjct: 92 IRKPIA--PLNCTAYN-LTGTCPTNLQDHQNSP-ATATCPDYFRWIHEDLRPWARTGITQ 147
Query: 128 RHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFD 187
+ AK+ A R++I++G+ Y++ Y Q+R +F+IWG LQLLRRYPG +PD+++MFD
Sbjct: 148 DMVERAKQTANFRLIILKGRAYLETYSRPYQTRDVFSIWGILQLLRRYPGKIPDLELMFD 207
Query: 188 CMDKPVIDKKEHGS----FPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI 243
C D PV+ + P PLFRYC NDA DI FPDWSFWGW+E+N++PW+ ++
Sbjct: 208 CEDWPVVLADRYNGPNVEQPPPLFRYCGNDATLDIVFPDWSFWGWAEINIKPWHILLGEL 267
Query: 244 KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGF 301
K G+ W + P+AYWKGNP V + R +L+KCN +++ W A + Q+W E+++GF
Sbjct: 268 KEGTTRIPWLNREPYAYWKGNPAV-AETRQDLIKCNVSENQDWNARLFAQDWFRESQEGF 326
Query: 302 KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPI 361
KS L +QC +RYK+Y EG AWSVS KYILSC+S L++ +Y DFF+RGLIP +++PI
Sbjct: 327 NKSDLPSQCTYRYKVYIEGSAWSVSQKYILSCDSTTLLVKPKYYDFFTRGLIPVHHYWPI 386
Query: 362 PSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDY 420
D CRSIK VDWGN + A +IGK DF+ E + MD VYDYM HL+ Y+KL Y
Sbjct: 387 KDDDKCRSIKFAVDWGNNHKQRAHQIGKVASDFIQEEVKMDYVYDYMFHLLNSYAKLFRY 446
Query: 421 KPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
KP+ ++A E CVES++C A+ ++ +E P+ PCT+
Sbjct: 447 KPSKSANATELCVESMVCEAEGSVKKFMMESLVKVPANTDPCTM 490
>gi|357136753|ref|XP_003569968.1| PREDICTED: uncharacterized protein LOC100831246 [Brachypodium
distachyon]
Length = 543
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 179/368 (48%), Positives = 257/368 (69%), Gaps = 10/368 (2%)
Query: 105 TCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+CP +F+ IH+DL PW A IT + A+ A R++++RG+ Y++ Q+R +F
Sbjct: 139 SCPAYFRFIHEDLHPWRAAGGITRAMLDRARATANFRLVVLRGRAYIELIAPAFQTRDLF 198
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID----KKEHGSFPLPLFRYCTNDAHFDIP 219
TIWG LQLLRRYPG VPD+D+MFDC+D PV+ + E+ + PLFRYC ++ D+
Sbjct: 199 TIWGILQLLRRYPGRVPDLDLMFDCVDWPVVRADQYEGENATAMPPLFRYCGDNETLDVV 258
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
FPDWSFWGW+E+N++PW+ KD+ G++ W ++ P+AYWKGNPDV + +R EL+KCN
Sbjct: 259 FPDWSFWGWAEINIKPWDALRKDLDAGNRRVRWVDREPYAYWKGNPDVAA-IRQELVKCN 317
Query: 280 DS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
S + W A I +Q+W +E+K G+KKS L++QC HRYKIY EG AWSVS KYIL+C+S+
Sbjct: 318 VSSKQEWNARIYKQDWIKESKAGYKKSDLASQCTHRYKIYIEGSAWSVSEKYILACDSMT 377
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMES 397
L+I+ +Y DFFSR L+PTK+++P+ + C SIK VDWGN++ +A++IGK +F++
Sbjct: 378 LVITPKYYDFFSRVLLPTKHYWPVRADSKCSSIKYAVDWGNSHKKKAQQIGKQASNFIQQ 437
Query: 398 -LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKA-AASP 455
L+MD +YDYM HL+TEY+KLL +KP P A E C ESL C A ++R+ +E + S
Sbjct: 438 ELSMDYIYDYMFHLLTEYAKLLRFKPTKPPEAVEVCPESLACQAIGRERKFMEDSMVKSA 497
Query: 456 SPYPPCTL 463
+ PC L
Sbjct: 498 NVAGPCDL 505
>gi|359489773|ref|XP_002273091.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 519
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 172/366 (46%), Positives = 251/366 (68%), Gaps = 8/366 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCPD+F+ IH+DL PW ++ IT + A+R A R++I+ GK YV+ + +Q+R MFT
Sbjct: 117 TCPDYFRWIHQDLLPWKQTGITRDMVDRAQRTAHFRLVIIDGKAYVEKFRKSIQTRDMFT 176
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVI---DKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+WG LQLLR YPG +PD+++MFDC D+PV+ D + + P PLFRYC +D DI FP
Sbjct: 177 LWGILQLLRWYPGRLPDLELMFDCDDRPVVRMRDFRGPNAAPPPLFRYCGDDWSLDIVFP 236
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW+E N++PW KDIK G++ WK+++P AYW+GNP V +P R +L+KCN
Sbjct: 237 DWSFWGWAETNIKPWRNVLKDIKEGNRRTKWKDRVPLAYWRGNPHV-APTRGDLLKCNVS 295
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
D W + Q+W +++K G+++S L +QC HRYKIY EG+AWSVS KYIL+C+S+ L+
Sbjct: 296 DKADWNTRLYLQDWGQQSKIGYRQSNLEDQCTHRYKIYIEGWAWSVSEKYILACDSMTLL 355
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
I +Y DFF RGL+P ++++PI + CR ++ V+WGN + +A+ +G+ F+ E L
Sbjct: 356 IRPRYHDFFIRGLVPLQHYWPIRDNNKCRDLRFAVEWGNNHTDKAQTMGETTSKFIQEDL 415
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKA-AASPSP 457
MD VYDYM HL+ EY+KLL +KP P A E C E++ C A+ R+ +E++ +P+
Sbjct: 416 KMDYVYDYMFHLLNEYAKLLKFKPTIPPGAVEVCAETMACPAEGAWRKFMEESLEKNPTD 475
Query: 458 YPPCTL 463
PC+L
Sbjct: 476 TTPCSL 481
>gi|332071136|gb|AED99886.1| glycosyltransferase [Panax notoginseng]
Length = 546
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 176/367 (47%), Positives = 248/367 (67%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CP++F+ I++DL PW ++ IT + A+R A R++I+ G+ YV+ + QSR +FT
Sbjct: 145 SCPEYFRWIYEDLRPWRETGITREMVERARRTANFRLVILNGRAYVETHQKSFQSRDVFT 204
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK-EHG---SFPLPLFRYCTNDAHFDIPF 220
+WG LQLLR YPG VPD+D+MFDC+D PVI + HG + P PLFRYC +D+ DI F
Sbjct: 205 LWGILQLLRMYPGKVPDLDLMFDCVDWPVIISRFYHGPNATAPPPLFRYCADDSTLDIVF 264
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
PDW+FWGW E+N++PW KD+K G+ W ++ P+AYWKGNP +++ R++L+KCN
Sbjct: 265 PDWTFWGWPEINIKPWGSLLKDLKEGNTGTQWMDREPYAYWKGNP-IVAKTRMDLLKCNV 323
Query: 280 -DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
D + W A + +WA E++ G+K+S L++QC HRYKIY EG AWSVS KYIL+C+SV L
Sbjct: 324 SDKQDWNARVYAXDWARESQLGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSVTL 383
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
+ +Y DFF+RGL+P +++PI D CRSIK VDWGN + +A IGK +F+ E
Sbjct: 384 XVKPRYYDFFTRGLMPVHHYWPIRDDDKCRSIKFAVDWGNNHKQKAHSIGKEASNFIQED 443
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPS 456
L MD VYDYM HL+ EY+KLL YKP P A E C E++ C A+ ++ +E P+
Sbjct: 444 LKMDYVYDYMFHLLNEYAKLLRYKPTVPPKAVELCSETMACPAEGFTKKFMMESIVKGPT 503
Query: 457 PYPPCTL 463
PC +
Sbjct: 504 DKSPCVM 510
>gi|326509457|dbj|BAJ91645.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 510
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/380 (47%), Positives = 244/380 (64%), Gaps = 20/380 (5%)
Query: 98 ASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA-LRILIVRGKLYVDPYYDC 156
A P TCP +F+ IH+DL PW + IT + A+R+ A R+ ++ G+LYV Y C
Sbjct: 108 AEPPTPTTCPVYFRWIHEDLRPWRATGITREALEGARRYGAKFRVTVLSGRLYVARYGRC 167
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF---PLPLFRYCTND 213
Q+R +FT WG LQLLRRY G VPD+D+MFDC D PV++ +HGS P PLFRYC ++
Sbjct: 168 FQTRDVFTQWGILQLLRRYAGRVPDLDLMFDCQDLPVVNAGDHGSHTPPPPPLFRYCGSE 227
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI FPDWSFWGW E+N++PW ++IK + A W + P+AYWKGNP V + R
Sbjct: 228 PTLDIAFPDWSFWGWPELNIKPWEALRREIKEANAALDWTRRAPYAYWKGNPAVAA-ARR 286
Query: 274 ELMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
EL+KCN S + W A I Q+W E +DGF++S L+ QC HRYK+Y EG WSVS KYIL
Sbjct: 287 ELLKCNVSGKRDWNARIYAQDWRTEVRDGFRESDLAKQCTHRYKMYVEGRGWSVSEKYIL 346
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPS-ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+C+SVAL++ ++ DFFSRGL+P ++++P+ CRSIK VDWGN++P +A +IG
Sbjct: 347 ACDSVALVVRPRFHDFFSRGLVPLQHYWPVRGDRGQCRSIKFAVDWGNSHPHKAREIGGN 406
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLE 449
F+ E L MDRVYDYM HL++EY++LL Y P P A E S+ + R+ LE
Sbjct: 407 ASRFVQEELAMDRVYDYMYHLLSEYARLLRYTPTVPGGAVEVTARSMA-----RGRRGLE 461
Query: 450 K------AAASPSPYPPCTL 463
+ A +P PC L
Sbjct: 462 REFMVGTAVDAPGSAEPCEL 481
>gi|242066560|ref|XP_002454569.1| hypothetical protein SORBIDRAFT_04g033620 [Sorghum bicolor]
gi|241934400|gb|EES07545.1| hypothetical protein SORBIDRAFT_04g033620 [Sorghum bicolor]
Length = 552
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/390 (46%), Positives = 260/390 (66%), Gaps = 13/390 (3%)
Query: 83 TCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRI 141
TC + +P P A+ TCP +F+ IH+DL PW A IT + A+ A R+
Sbjct: 129 TCAAVPDPPP---LPANSNASSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRL 185
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID----KK 197
+++RG+ Y++ Q+R +FTIWG LQLLRRYPG VPD+D+MFDC+D PV+ +
Sbjct: 186 VVIRGRAYIERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYEG 245
Query: 198 EHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLP 257
E+ + PLFRYC N+ D+ FPDWSFWGW E+N++PW+ K++ G++ W + P
Sbjct: 246 ENATVLPPLFRYCGNNETLDVVFPDWSFWGWPEINIKPWDALQKELNRGNKRVKWLNREP 305
Query: 258 FAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYK 315
+AYWKGNPDV + +R EL+KCN S W A I +Q+W +E K G+K+S L+ QC HRYK
Sbjct: 306 YAYWKGNPDV-AVIRQELVKCNVSSEHEWNARIYKQDWLKEIKAGYKQSNLAGQCTHRYK 364
Query: 316 IYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVD 375
IY EG AWSVS KYIL+C+S+ L+++ +Y DF+SR L+P ++++PI + C SIK VD
Sbjct: 365 IYIEGSAWSVSEKYILACDSMTLVVTPKYYDFYSRVLMPMQHYWPIWDDNKCSSIKYAVD 424
Query: 376 WGNANPSEAEKIGKAGQDFME-SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVE 434
WGN++ +A++IGK G +F++ L+M+ VYDYM HL+TEY+KLL +KP P A E C E
Sbjct: 425 WGNSHKQKAQRIGKQGSNFIQKELSMEYVYDYMFHLLTEYAKLLRFKPTKPPEAIEVCPE 484
Query: 435 SLLCLADPKQRQNLEKAAA-SPSPYPPCTL 463
SL C A ++R+ ++ + S S PC L
Sbjct: 485 SLACQAIGRERKFMKDSMVRSASDAGPCDL 514
>gi|449446167|ref|XP_004140843.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 442
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 187/386 (48%), Positives = 253/386 (65%), Gaps = 11/386 (2%)
Query: 88 MNPIPERRRVASPQK--VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVR 145
M+ P+ +R P V TCP++F+ IH+DL+PWA IT + EA++ A R+++V
Sbjct: 1 MSQEPQGQRKQDPDGPMVATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVE 60
Query: 146 GKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF--- 202
GK YV+ Y QSR T+WG +QLLRRYPG +PD+D+MF C D+P I +K++
Sbjct: 61 GKAYVEAYGKAYQSRDNLTVWGVVQLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKP 120
Query: 203 -PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYW 261
P PLFRY +DA +DI FPDWSFWGW E+N++ W KDIK G++ W ++ P+AYW
Sbjct: 121 SPPPLFRYSGDDATWDIVFPDWSFWGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYW 180
Query: 262 KGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAE 319
KGNP V + R +L+KCN ++ W A + RQNW +E+K GFK S L+NQC++RYKIY E
Sbjct: 181 KGNPAV-AYTRRDLLKCNVTQKQDWSARLYRQNWDKESKAGFKDSNLANQCDYRYKIYIE 239
Query: 320 GYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNA 379
G AWSVS KYIL+C+SV+LI+ +Y DFF+R LIP K+++PI S C SIK V WGN
Sbjct: 240 GKAWSVSEKYILACDSVSLIVRPRYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVHWGNT 299
Query: 380 NPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC 438
+ EA IGKA + E L M+ +YDYM HL+ +YSKLL +KP P +A E ESL
Sbjct: 300 HSQEAMAIGKAASKLIEEELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELLSESLAS 359
Query: 439 LADPKQRQN-LEKAAASPSPYPPCTL 463
A R++ +E SP+ PC L
Sbjct: 360 AAKGSIRKSMMESVVTSPAESGPCAL 385
>gi|357497409|ref|XP_003618993.1| KTEL motif-containing protein [Medicago truncatula]
gi|355494008|gb|AES75211.1| KTEL motif-containing protein [Medicago truncatula]
Length = 515
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 173/373 (46%), Positives = 248/373 (66%), Gaps = 14/373 (3%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q TCP FF+ IH+DL+PW + IT + AKR A +++IV GK+YV+ Y +Q+R
Sbjct: 109 QNSHTCPSFFRWIHEDLKPWKEKGITREMLEGAKRTANFKVVIVDGKMYVEKYRKSIQTR 168
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF------PLPLFRYCTNDA 214
+FT+WG LQLLR +PG +PD+++MFDC D+PVI K G+F P PLFRYC++
Sbjct: 169 DVFTLWGILQLLRMFPGKLPDLELMFDCEDRPVIHK---GNFQGPNASPPPLFRYCSDQW 225
Query: 215 HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
DI FPDWSFWGW+E N++PW K+IK G++ WK+++P+AYWKGNP+V + R
Sbjct: 226 SLDIVFPDWSFWGWAETNIKPWKNILKEIKEGNKETKWKDRVPYAYWKGNPNV-AATRKN 284
Query: 275 LMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
L++CN + W + Q+W +E+ G+KKS L NQC HRYKIY EG+AWSVS KYI++
Sbjct: 285 LLRCNATSKDDWNTRLYIQDWDKESTQGYKKSSLGNQCTHRYKIYIEGWAWSVSEKYIMA 344
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
C+S+ L + + DFF RG+ P ++++PI C S+K VDWGN + +A+ IG+A
Sbjct: 345 CDSMTLYVRPNFYDFFIRGMDPLQHYWPIRDNSKCTSLKFAVDWGNKHADKAQAIGEAAS 404
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKA 451
F+ E L M+ VY+YM H++ EY+KLL +KP P A E C E++ C + QR+ +E++
Sbjct: 405 KFIQEELDMNNVYNYMFHILNEYAKLLKFKPTIPQGAVEFCSETMACDVNGNQRKFMEES 464
Query: 452 AAS-PSPYPPCTL 463
PS PCT+
Sbjct: 465 MVKVPSDSNPCTI 477
>gi|255541542|ref|XP_002511835.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549015|gb|EEF50504.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 522
Score = 369 bits (947), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 179/388 (46%), Positives = 247/388 (63%), Gaps = 8/388 (2%)
Query: 83 TCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL 142
TC + P + Q CP +FK IH+DL PW ++ IT I A+R A R++
Sbjct: 99 TCPADYYPKTHNSTNSDRQSNVICPSYFKWIHEDLRPWRETGITRDMIERARRTAHFRLV 158
Query: 143 IVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH--- 199
IV G+ YV+ Y +Q+R M T+WG LQLLR YPG VPD+++MFDC D+PV+ ++
Sbjct: 159 IVDGRAYVEKYRQSIQTRDMITLWGILQLLRLYPGKVPDLELMFDCDDRPVVRSEDFPGP 218
Query: 200 GSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFA 259
+ P PLFRYC +D DI FPDWSFWGW+EVN++PW K I GS+ K WK+++P+A
Sbjct: 219 TAGPPPLFRYCADDTSLDIVFPDWSFWGWAEVNIKPWKSMLKGITKGSKRKKWKDRVPYA 278
Query: 260 YWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIY 317
YWKGNP V S R +LM CN D W A + Q+W +E + +K SKL +QC HRYKIY
Sbjct: 279 YWKGNPYV-SANRGDLMTCNVSDKHDWNARLYAQDWGKEIRQKYKHSKLEDQCTHRYKIY 337
Query: 318 AEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWG 377
EG AWSVS KYIL+C+S+ L+++ Y DFF R ++P ++++PI + + C+ I+ V+WG
Sbjct: 338 IEGRAWSVSDKYILACDSMTLVVNPAYYDFFMRSMVPIQHYWPIRAKNKCKDIEFAVEWG 397
Query: 378 NANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL 436
N + +AE IGK G F+ E+L M+ +Y YM HL+ EY+KLL +KP P E C ESL
Sbjct: 398 NNHTDKAEAIGKGGSRFIQENLKMEYIYGYMFHLLKEYAKLLKFKPEIPKGGAEVCAESL 457
Query: 437 LCLADPKQRQNL-EKAAASPSPYPPCTL 463
C + R+ + E SPS PC +
Sbjct: 458 ACSENGLVRKFMKESMVMSPSSTLPCAM 485
>gi|226509990|ref|NP_001142048.1| uncharacterized protein LOC100274204 [Zea mays]
gi|194706898|gb|ACF87533.1| unknown [Zea mays]
gi|413919037|gb|AFW58969.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 508
Score = 369 bits (947), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 186/377 (49%), Positives = 250/377 (66%), Gaps = 14/377 (3%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFA-ALRILIVRGKLYVDPY--YDC 156
P++V CP +F+ IH+DL PW + +T + A+RFA LR+ +V G+LYV Y C
Sbjct: 105 PREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAVVAGRLYVARYGRRRC 164
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE-HGSFPLP----LFRYCT 211
Q+RA FT WG LQLLRRYPG VPD+D+MFDC D PV+ E HG+ P P LFRYC
Sbjct: 165 FQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHGAQPQPPPPPLFRYCG 224
Query: 212 NDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
++A DI FPDWSFWGW E+N++PW ++I + +W ++ P+AYWKGNP V +
Sbjct: 225 SEATLDIAFPDWSFWGWPELNIKPWEALQREINEANAVVNWTDRAPYAYWKGNPKVGAE- 283
Query: 272 RVELMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
R+ L++CN S + W A + Q+W EE + GF+ S LS QC HRY+IY EG WSVS KY
Sbjct: 284 RLLLLRCNASGERDWNARVYAQDWGEEVRHGFRGSDLSKQCTHRYRIYIEGRGWSVSEKY 343
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS-ADLCRSIKSVVDWGNANPSEAEKI- 387
IL+C SVAL++ ++ DFFSRGL P ++++P+ +CRSIK VDWGNA+ A+++
Sbjct: 344 ILACESVALMVRPRFHDFFSRGLSPLRHYWPVRGDRGMCRSIKHAVDWGNAHADRAQEMA 403
Query: 388 GKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ- 446
G A + E LTMDRVYDYM HL+TEY++LL Y+PA P A E VES++ +RQ
Sbjct: 404 GNASRFIREELTMDRVYDYMFHLLTEYARLLRYRPAVPRGAAEVTVESMIRGRRGLERQF 463
Query: 447 NLEKAAASPSPYPPCTL 463
++ AAS S PC L
Sbjct: 464 MVDTLAASASGDGPCRL 480
>gi|449526435|ref|XP_004170219.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 426
Score = 368 bits (945), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 181/369 (49%), Positives = 245/369 (66%), Gaps = 9/369 (2%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAM 162
V TCP++F+ IH+DL+PWA IT + EA++ A R+++V GK YV+ Y QSR
Sbjct: 2 VATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSRDN 61
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTNDAHFDI 218
T+WG +QLLRRYPG +PD+D+MF C D+P I +K++ P PLFRY +DA +DI
Sbjct: 62 LTVWGVVQLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATWDI 121
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
FPDWSFWGW E+N++ W KDIK G++ W ++ P+AYWKGNP V + R +L+KC
Sbjct: 122 VFPDWSFWGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAV-AYTRRDLLKC 180
Query: 279 NDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
N ++ W A + RQNW +E+K GFK S L+NQC++RYKIY EG AWSVS KYIL+C+SV
Sbjct: 181 NVTQKQDWSARLYRQNWDKESKAGFKDSNLANQCDYRYKIYIEGKAWSVSEKYILACDSV 240
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
+LI+ +Y DFF+R LIP K+++PI S C SIK V WGN + +A IGKA +
Sbjct: 241 SLIVRPRYYDFFTRSLIPMKHYWPISSNRKCSSIKFAVHWGNTHRQQAMAIGKAASKLIE 300
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQN-LEKAAAS 454
E L M+ +YDYM HL+ +YSKLL +KP P +A E ESL R++ +E S
Sbjct: 301 EELKMEYIYDYMFHLLNQYSKLLTFKPTVPPNATELLSESLASAGKGSIRKSMMESVVTS 360
Query: 455 PSPYPPCTL 463
P+ PC L
Sbjct: 361 PAESGPCAL 369
>gi|224127624|ref|XP_002320120.1| predicted protein [Populus trichocarpa]
gi|222860893|gb|EEE98435.1| predicted protein [Populus trichocarpa]
Length = 411
Score = 367 bits (942), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 174/368 (47%), Positives = 248/368 (67%), Gaps = 8/368 (2%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAM 162
TCP +F+ IH DL W ++ IT I A++ A R++IV GK YV+ Y +Q+R M
Sbjct: 9 TSTCPSYFQWIHDDLRHWKETGITQDMIERARKTAHFRLVIVNGKAYVEKYRQSIQTRDM 68
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH---GSFPLPLFRYCTNDAHFDIP 219
FT+WG LQLLR YPG +PD+++MFDC D+PVI K + P PLFRYC++ DI
Sbjct: 69 FTLWGILQLLRLYPGRLPDLELMFDCDDRPVIPSKHFRGPNAAPPPLFRYCSDWQSLDIV 128
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
FPDWSFWGW+E N++PW K+IK G+ WK++ P+AYW+GNP V SP+R +L+KCN
Sbjct: 129 FPDWSFWGWAETNIRPWKNLLKEIKEGNSRTKWKDRTPYAYWRGNPWV-SPIRQDLLKCN 187
Query: 280 DSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
S+ W + Q+W +++K+G+++S L +QC HRYKIY EG+AWSVS KYIL+C+SV
Sbjct: 188 VSEQNDWNTRLYLQDWVKQSKEGYRESNLQDQCTHRYKIYIEGWAWSVSEKYILACDSVT 247
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
L + +Y DFF RG++P ++++PI C S+K V+WGN + EA+ IG+A +F+ E
Sbjct: 248 LYVRPRYHDFFIRGMVPLQHYWPIRDNSKCTSLKFAVEWGNNHTKEAQAIGEAASNFIHE 307
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAA-SP 455
+ +D VYDY+ HL+ EY+KLL +KP P A E C E++ C + R+ +E++ SP
Sbjct: 308 DMKIDYVYDYIFHLLNEYAKLLKFKPKIPPGADELCPETMACPTNGIHRKFMEESMVLSP 367
Query: 456 SPYPPCTL 463
S PCTL
Sbjct: 368 SDAIPCTL 375
>gi|219363409|ref|NP_001136898.1| uncharacterized protein LOC100217055 [Zea mays]
gi|194697518|gb|ACF82843.1| unknown [Zea mays]
gi|413923256|gb|AFW63188.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
Length = 551
Score = 366 bits (940), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 181/390 (46%), Positives = 260/390 (66%), Gaps = 13/390 (3%)
Query: 83 TCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRI 141
TC + +P P A+ TCP +F+ IH+DL PW A IT + A+ A R+
Sbjct: 128 TCAAFPDPPP---LPANSNASSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRL 184
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID----KK 197
+++RG+ Y++ Q+R +FTIWG LQLLRRYPG VPD+D+MFDC+D PV+ +
Sbjct: 185 VVIRGRAYIERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYQG 244
Query: 198 EHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLP 257
E+ + PLFRYC ++ D+ FPDWSFWGW E+N++PW+ K++ G++ W + P
Sbjct: 245 ENATLLPPLFRYCGDNETLDVVFPDWSFWGWPEINIKPWDALQKELNGGNKRVKWLAREP 304
Query: 258 FAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYK 315
+AYWKGNPDV + R EL+KCN S W A I +Q+W +E K G+K+S L++QC HRYK
Sbjct: 305 YAYWKGNPDV-AVTRQELVKCNVSSKHEWNARIYKQDWLKEIKAGYKQSDLASQCTHRYK 363
Query: 316 IYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVD 375
IY EG AWSVS KYIL+C+S+ L+++ +Y DF+SR L+P ++++PI + C SIK VD
Sbjct: 364 IYIEGSAWSVSEKYILACDSMTLVVTPKYYDFYSRVLMPMQHYWPIWDDNKCSSIKFAVD 423
Query: 376 WGNANPSEAEKIGKAGQDFME-SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVE 434
WGN++ +A++IGK G +F++ L+M+ VYDYM HL+TEY+KLL +KP P A E C E
Sbjct: 424 WGNSHKQKAQRIGKQGSNFIQKELSMEYVYDYMFHLLTEYAKLLRFKPRKPPEAIEVCPE 483
Query: 435 SLLCLADPKQRQNLEKAAA-SPSPYPPCTL 463
SL C A ++++ +E + S S PC L
Sbjct: 484 SLACQAIGREKKFMEDSMVRSASDAGPCDL 513
>gi|357507237|ref|XP_003623907.1| KDEL motif-containing protein [Medicago truncatula]
gi|355498922|gb|AES80125.1| KDEL motif-containing protein [Medicago truncatula]
Length = 454
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 174/390 (44%), Positives = 251/390 (64%), Gaps = 11/390 (2%)
Query: 83 TCLSAMNPIPERRRVASPQK-VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRI 141
TC + +PI + P TCP+ F+ IH+DLEPW + IT + + + LRI
Sbjct: 56 TCPNDHSPIQAFHQDQDPSSSTSTCPEHFRWIHEDLEPWKSTGITREMVDSGENISQLRI 115
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH-- 199
+I +GK+YV+ Y D Q+RA FT+WG +QLLR YPG VPD++++F+ D+ V+DKK +
Sbjct: 116 VIKQGKVYVETYGDSFQTRATFTVWGIVQLLRLYPGRVPDLELLFETGDRAVVDKKRYEG 175
Query: 200 --GSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLP 257
P P+F YC + DI FPDWS+WGW+E ++PW + KDI+ ++ WK+++P
Sbjct: 176 PEAVTPPPIFSYCGKNEALDIVFPDWSYWGWAETRIKPWEKVLKDIQESNKKIKWKDRIP 235
Query: 258 FAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYK 315
+A+WKGN LS R EL KCN D W A + +W EE GFK +KL +QC HRYK
Sbjct: 236 YAFWKGN--FLSNPRHELRKCNVTDQHDWNARVYSVDWNEEIDQGFKNTKLEDQCAHRYK 293
Query: 316 IYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVD 375
IY EG +WSVS KYI++C+S+ L I +Y DFF+R L+P K+++PI ++C+ IK VD
Sbjct: 294 IYVEGISWSVSEKYIIACDSMTLFIKPRYYDFFTRSLVPYKHYWPINKQNMCQDIKYAVD 353
Query: 376 WGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVE 434
WGN +P +A++IG+ G F+ E++ M VYDYMLHL+TEY+ L+ ++P P+ A E C E
Sbjct: 354 WGNTHPEKAQEIGREGTRFVEENVNMKLVYDYMLHLLTEYANLIRFEPKIPAEAIEVCTE 413
Query: 435 SLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
++ C D R+ +E SPS PPC +
Sbjct: 414 NVACSMDGIWREFMVESMVKSPSDTPPCAM 443
>gi|357168044|ref|XP_003581455.1| PREDICTED: O-glucosyltransferase rumi homolog [Brachypodium
distachyon]
Length = 520
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 176/363 (48%), Positives = 237/363 (65%), Gaps = 16/363 (4%)
Query: 98 ASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFA-ALRILIVRGKLYVDPYYDC 156
+S +CP +F+ IH+DL PW + IT + A R+ R+ +V G+LYV Y C
Sbjct: 114 SSATTTGSCPAYFRWIHEDLRPWRATGITRETLEGAHRYMPKFRVTVVAGRLYVRRYGRC 173
Query: 157 VQSRAMFTIWGFLQLLRRYP-----GMVPDVDIMFDCMDKPVIDKKEH-GSFPLPLFRYC 210
Q+R +FT WG LQLLRRY +VPD+D+MFDC D PV+D H G P PLFRYC
Sbjct: 174 FQTRDVFTQWGILQLLRRYNTTGRRAVVPDLDLMFDCQDLPVVDAGNHRGCHPPPLFRYC 233
Query: 211 TNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
++ DI FPDWSFWGW E+N++PW ++I G+ A W + P+AYWKGNP ++
Sbjct: 234 GSEPTLDIAFPDWSFWGWPELNIKPWGTLRREIDEGNAAVDWTRRAPYAYWKGNP-MVGT 292
Query: 271 LRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLK 328
R+EL+KCN S+ W A I Q+W +E + GF++S L+ QC HRYKIY EG WSVS K
Sbjct: 293 ARLELLKCNVSRKRDWNARIYAQDWGKEVRGGFRESDLAKQCTHRYKIYIEGRGWSVSEK 352
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
YIL+C+SVAL++ ++ DFFSRGL+P ++++PI CRSIK VDWGN++ +A +IG
Sbjct: 353 YILACDSVALVVRPRFHDFFSRGLMPLQHYWPIRDRGQCRSIKFAVDWGNSHVDKAREIG 412
Query: 389 KAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQN 447
F+ E LTMDRVYDYM HL++EY++LL YKP P A E VES+ R+
Sbjct: 413 GNASKFIQEELTMDRVYDYMFHLLSEYAELLRYKPTIPDGAVEVTVESMT-----HGRRG 467
Query: 448 LEK 450
LE+
Sbjct: 468 LER 470
>gi|255541544|ref|XP_002511836.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549016|gb|EEF50505.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 549
Score = 364 bits (934), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 176/368 (47%), Positives = 237/368 (64%), Gaps = 11/368 (2%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CPD+F+ IH+DL PWA + I+ + AKRFA R++IV GK YV+ YY Q+R +FTI
Sbjct: 113 CPDYFRWIHEDLRPWASTGISRDTVESAKRFATFRLVIVDGKAYVERYYHSFQTRDLFTI 172
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLP-LFRYCTNDAHFDIPF 220
WG +QLLR YPG VPDV++MF C D P I K ++ + P P LF+Y N+ F + F
Sbjct: 173 WGIVQLLRLYPGRVPDVELMFQCGDLPEIQKGDYQGPGATLPPPALFQYSGNETAFAVTF 232
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW EV+++PW + I GSQ K+W +++P+AYW+GN V S R +L+KC
Sbjct: 233 PDWSFWGWVEVDIKPWKSMLEGITKGSQRKNWTDRVPYAYWRGNSHV-SRDRKDLLKCKS 291
Query: 281 SKL---WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
S W A + Q+W +E +GFK S L +QC HRYKIY EG AWSVS KYIL+C+S+
Sbjct: 292 SIFSHDWNARLYSQDWGKETHNGFKHSHLEDQCTHRYKIYIEGRAWSVSDKYILACDSMT 351
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
L+I Y DFF R +IP ++++PI + CR IK VDWGN + +AE IGK G F+ +
Sbjct: 352 LLIKPDYYDFFMRSMIPMQHYWPIRKTNKCRDIKFAVDWGNNHADKAETIGKGGSAFIHD 411
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAA-SP 455
+L M+ VY YMLHL EY+KL+ +KP P E C ES+ C R+ +E + SP
Sbjct: 412 NLKMEYVYGYMLHLFREYAKLMKFKPEIPQGGVEVCAESMACSEGGLIREFMESSMEISP 471
Query: 456 SPYPPCTL 463
S PC +
Sbjct: 472 SSTLPCAM 479
>gi|242073832|ref|XP_002446852.1| hypothetical protein SORBIDRAFT_06g023670 [Sorghum bicolor]
gi|241938035|gb|EES11180.1| hypothetical protein SORBIDRAFT_06g023670 [Sorghum bicolor]
Length = 505
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 175/365 (47%), Positives = 239/365 (65%), Gaps = 8/365 (2%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA--LRILIVRGKLYVDPYYDCVQSRAMF 163
CP +F+ IH+DL PW + +T + A+R A R+ +V G+LYV Y C Q+RA F
Sbjct: 114 CPSYFRWIHEDLRPWRAAGVTRGALEGARRLFAPKFRVTVVAGRLYVARYGRCFQTRAAF 173
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
T WG LQLLRRYPG VPD+D+MFDC D PV+ + P PLFRYC ++ DI FPDW
Sbjct: 174 TQWGILQLLRRYPGRVPDLDLMFDCDDLPVVGAGDRHQAPPPLFRYCGSETTLDIAFPDW 233
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS-- 281
SFWGW E+N++PW ++I + +W ++ P+AYWKGNP V + R+ L++CN S
Sbjct: 234 SFWGWPELNIKPWEALRREINEENAMVNWMDRAPYAYWKGNPKVGAE-RLLLLRCNASGE 292
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ W A + Q+W +E + GF+ S LS QC HRYKIY EG WSVS KYIL+C+SVAL++
Sbjct: 293 RDWNARVYAQDWGKEVRHGFRGSDLSKQCTHRYKIYIEGRGWSVSEKYILACDSVALMVR 352
Query: 342 QQYKDFFSRGLIPTKNHFPIPS-ADLCRSIKSVVDWGNANPSEAEKI-GKAGQDFMESLT 399
++ DFFSRGL P ++++P+ +CRSIK VDWGNA+ A+++ G A + E LT
Sbjct: 353 PRFHDFFSRGLSPLRHYWPVRGDRGMCRSIKYAVDWGNAHTDRAQEMAGNASRFIQEELT 412
Query: 400 MDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPSPY 458
MDRVYDYM HL+TEY++LL Y+P P A E VES+ +RQ + + A A+ +
Sbjct: 413 MDRVYDYMFHLLTEYARLLRYRPEVPRGAAEVTVESMTRGKRGLERQFMADTAVAAANGE 472
Query: 459 PPCTL 463
PC L
Sbjct: 473 GPCRL 477
>gi|294460113|gb|ADE75639.1| unknown [Picea sitchensis]
Length = 492
Score = 361 bits (927), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 174/394 (44%), Positives = 254/394 (64%), Gaps = 6/394 (1%)
Query: 75 KIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK 134
K+++C+ T SA + +P ++ + + CP +FK I +D++PWAKS IT+ + AK
Sbjct: 67 KMLNCSPSTVCSASSWLPLVKKSSYKDEAAFCPFYFKWIRQDMKPWAKSGITIDMVEAAK 126
Query: 135 RFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI 194
A+ R+ +V G++Y++ Y C QSR +FTIWG Q L+ YP ++PD+D MF+C D PVI
Sbjct: 127 PEASFRLTVVNGRMYIESYRKCYQSRDLFTIWGIAQFLKFYPRLLPDLDSMFNCDDNPVI 186
Query: 195 DKKEHGSF---PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKS 251
+ ++ P PLFRY ++ FDI FPDWSFWGW E+ PW K+I++GSQ
Sbjct: 187 HRGDYNDSTKPPPPLFRYSGSEDTFDIVFPDWSFWGWPEIRTPPWETLAKEIQNGSQKVK 246
Query: 252 WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN 311
W+++ P AYWKGNP + R +LM C + WG + Q+W +E + GF++SKLS+QC+
Sbjct: 247 WEDRDPTAYWKGNP-YMGQGRQDLMNCIHRRHWGGRLYNQDWDKETRQGFRQSKLSDQCH 305
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIK 371
HRYKIY EG AWSVSLK I++C+S L+I+ QY DF+ RGL+P ++++PI + C SI+
Sbjct: 306 HRYKIYIEGNAWSVSLKNIMACDSPTLLITPQYYDFYLRGLVPQRHYWPIRADKKCDSIQ 365
Query: 372 SVVDWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430
VDWGN +P EA +I K F+++ L M VYDYM H++ EYSKLL YKP+ A E
Sbjct: 366 FAVDWGNKHPKEAMEIAKEAIKFIQNELKMSNVYDYMFHILNEYSKLLKYKPSVSEKAAE 425
Query: 431 ACVESLLCLADPKQRQNL-EKAAASPSPYPPCTL 463
C E++ C A+ + + + + S PPC L
Sbjct: 426 YCSETIFCFANEAEEDYMKDSVVTTASASPPCKL 459
>gi|357507235|ref|XP_003623906.1| O-glucosyltransferase rumi [Medicago truncatula]
gi|355498921|gb|AES80124.1| O-glucosyltransferase rumi [Medicago truncatula]
Length = 442
Score = 361 bits (927), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 174/390 (44%), Positives = 250/390 (64%), Gaps = 10/390 (2%)
Query: 83 TCLSAMNPIPERRRVASPQK-VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRI 141
TC +PI + P TCP+ F+ IH+DLEPW + IT + K + LRI
Sbjct: 49 TCQKDHHPIQAFHQDQDPSSSTSTCPEHFRWIHEDLEPWKSTGITRETVESGKNISQLRI 108
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE-HG 200
+I +GK YV+ Y D Q+R +FT+WG +QLLR YPG VPD++++F+ D+ V+DKK G
Sbjct: 109 VIKKGKAYVETYADSFQTRDLFTVWGIVQLLRLYPGRVPDLELLFETGDRTVLDKKRFQG 168
Query: 201 SFPL---PLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLP 257
S + P+F YC + DI FPDWSFWGW+E ++PW + KDI+ ++ +WK+++P
Sbjct: 169 SQSVTLPPIFSYCGQNDALDIVFPDWSFWGWAETGIKPWEKVLKDIQESNKKITWKDRIP 228
Query: 258 FAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYK 315
+A+WKGN V S R +L +CN D W A I +W +E + GF +KL +QC HRYK
Sbjct: 229 YAFWKGNTHV-SSQRYKLRQCNVTDQHDWNARIYSVHWNKEIEHGFNSTKLEDQCTHRYK 287
Query: 316 IYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVD 375
IY EG +WSVS KYI++C+S+ L I +Y DFF+R L+P K+++PI ++C+ IK VD
Sbjct: 288 IYVEGRSWSVSEKYIIACDSMTLFIKPRYYDFFTRSLVPYKHYWPINKQNMCQDIKYAVD 347
Query: 376 WGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVE 434
WGN +P +AE+IG+ G F+ E++ M VYDYMLHL+TEY+KL+ ++ P+ A E C E
Sbjct: 348 WGNTHPGKAEEIGREGTRFIEENVNMKLVYDYMLHLLTEYAKLMRFEATIPAGAVEVCSE 407
Query: 435 SLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
+L C R+ +E SPS PCT+
Sbjct: 408 NLACPMGGIWREFMVESMVKSPSDTLPCTM 437
>gi|357481655|ref|XP_003611113.1| KTEL motif-containing protein [Medicago truncatula]
gi|355512448|gb|AES94071.1| KTEL motif-containing protein [Medicago truncatula]
Length = 502
Score = 361 bits (926), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 179/412 (43%), Positives = 260/412 (63%), Gaps = 23/412 (5%)
Query: 63 TFKEESRRSQAYK--IVHCTY--LTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLE 118
T E+ + S ++ +++CT TC S+ P + V CP++FK IH+DL+
Sbjct: 68 TKNEKIKHSNKFQPTVLNCTIKNQTCSSSNYPTTKNNMV--------CPEYFKWIHEDLK 119
Query: 119 PWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYY--DCVQSRAMFTIWGFLQLLRRYP 176
PW K IT + +AK+ A R+++ GK Y++ Y + +Q+R +FT+WG LQLLR+YP
Sbjct: 120 PWKKKGITREMVEKAKKTAHFRLVVKNGKGYLEKYKNKEAIQTRDVFTVWGILQLLRKYP 179
Query: 177 GMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPW 236
G +PD+++MFDC DKPV+ G P P+F YC + DI FPDWSFWGW+E+N++PW
Sbjct: 180 GKIPDLELMFDCNDKPVV---PIGLDPPPVFGYCADRWTQDIVFPDWSFWGWAEINIKPW 236
Query: 237 NEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWA 294
KDIK G++ WK++ P+AYWKGNP + R++ + CN S W + Q+W
Sbjct: 237 EHLLKDIKKGNKRVKWKDREPYAYWKGNP-YTAATRLDFLNCNVSTAQDWNLRLFTQDWI 295
Query: 295 EEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIP 354
+E++ GF S L++QC +RYK+Y EGYAWSVS KYIL+C+S AL++ +Y DFF+R L P
Sbjct: 296 KESEQGFNHSNLADQCTYRYKVYIEGYAWSVSEKYILACDSPALLVKPRYYDFFTRSLQP 355
Query: 355 TKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITE 413
++++PI D C+SIK VDWGN + +A++IGKAG F+ E L M+ +YDYM HL+ E
Sbjct: 356 LQHYWPIRDTDKCKSIKHAVDWGNNHEQKAQEIGKAGSKFIQEELNMNYIYDYMFHLLNE 415
Query: 414 YSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL--EKAAASPSPYPPCTL 463
YSKLL ++P P A E C E++ C + E PS PC+L
Sbjct: 416 YSKLLKFEPRVPEEAVELCSETMACTRSYSMEKEFMGESMVREPSTKDPCSL 467
>gi|449452346|ref|XP_004143920.1| PREDICTED: O-glucosyltransferase rumi-like [Cucumis sativus]
Length = 514
Score = 361 bits (926), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 166/375 (44%), Positives = 248/375 (66%), Gaps = 11/375 (2%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ 158
P CP++F+ IH+DL+PWA IT + + K A R+ +VRG +YV+ Y +Q
Sbjct: 107 GPSGRSVCPEYFRWIHEDLKPWAAGGITREMVEKGKATAHFRLAVVRGIVYVEHYKKSIQ 166
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL------PLFRYCTN 212
+R +FTIWG LQLLRRYPG +PD+++MFDC D+PV+ ++ + + P+FRYC +
Sbjct: 167 TRDLFTIWGILQLLRRYPGQIPDLELMFDCDDRPVVKSADYRNAGVDTVEAPPVFRYCGD 226
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
+ DI FPDWSFWGW+E+N++PW K++K G++ + W ++ FAYWKGNP V + R
Sbjct: 227 EETLDIVFPDWSFWGWAEINIRPWENLLKELKKGNEKRKWMKREAFAYWKGNPYV-ADTR 285
Query: 273 VELMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYI 330
+L+KCN S W A + Q+W +E++ G+K+SKL+NQC +RYKIY EGY WSVS KYI
Sbjct: 286 QDLLKCNLSLQNDWNARLYIQDWIQESQQGYKQSKLANQCTYRYKIYIEGYGWSVSEKYI 345
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKA 390
L+C+S+ L++ + DFFSR L P +++P+ C+SIK V WGN++ +A+ IGK
Sbjct: 346 LACDSMTLLVKPNFYDFFSRSLEPLHHYWPLSDDHKCKSIKFAVHWGNSHKQKAQDIGKT 405
Query: 391 GQDFMES-LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLAD-PKQRQNL 448
+F++ L M+ VYDYM HL+ Y+KLL ++P P+ A E C E++ C D P+++
Sbjct: 406 ASNFIQQELRMENVYDYMFHLLNHYAKLLRFQPEIPTGAMEVCSETMACPRDGPEKKFMK 465
Query: 449 EKAAASPSPYPPCTL 463
E +PS PC++
Sbjct: 466 ESMVKTPSLTIPCSM 480
>gi|294461229|gb|ADE76177.1| unknown [Picea sitchensis]
Length = 492
Score = 360 bits (925), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 188/452 (41%), Positives = 276/452 (61%), Gaps = 12/452 (2%)
Query: 19 VISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESRR--SQAYKI 76
++ L+ + ALFL DD K + LA NL W + K + S K+
Sbjct: 13 LLVLASLLGPALFLKRWTDD-VWKERALAVTNLF---WANQSEIIVKTNVWQGISVTRKM 68
Query: 77 VHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRF 136
++C+ T SA + +P ++ + + CP +FK I +D++PWAKS IT+ + A
Sbjct: 69 LNCSPSTVCSASSWLPLVKKSSYKDEAAFCPFYFKWIRQDMKPWAKSGITIDMVEAANPE 128
Query: 137 AALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDK 196
A+ R+ +V G+++++ Y C QSR +FTIWG QLL+ YP ++PD+D+MF+C D PVI +
Sbjct: 129 ASFRLTVVNGRMFIESYRKCYQSRDLFTIWGIAQLLKFYPRLLPDLDLMFNCDDNPVIHR 188
Query: 197 KEHGSF---PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWK 253
++ P PLFRY ++ FDI FPDWSFWGW E+ PW K+I++GSQ W+
Sbjct: 189 GDYNDSTKPPPPLFRYSGSEDTFDIVFPDWSFWGWPEIRTPPWETLAKEIQNGSQKVKWE 248
Query: 254 EKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR 313
++ P AYWKGNP + R +LM C + WG + Q+W +E + GF+ SKLS+QC+HR
Sbjct: 249 DRDPTAYWKGNP-YMGQGRQDLMNCIHRRHWGGRLYNQDWDKETRQGFRHSKLSDQCHHR 307
Query: 314 YKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSV 373
YKIY EG AWSVSLK I++C+S L+I+ QY DF+ RGL+P ++++PI + C SI+
Sbjct: 308 YKIYIEGNAWSVSLKNIMACDSPTLLITPQYYDFYLRGLVPQRHYWPIRADKKCDSIQFA 367
Query: 374 VDWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEAC 432
VDWGN +P EA +I K F+++ L M VYDYM H++ EYSKLL YKP+ A E C
Sbjct: 368 VDWGNKHPKEAMEIAKEAIKFIQNELKMSNVYDYMFHILNEYSKLLKYKPSVSEKAAEYC 427
Query: 433 VESLLCLADPKQRQNL-EKAAASPSPYPPCTL 463
E++ C A+ + + + + S PPC L
Sbjct: 428 SETIFCFANEAEEDYMKDSVVTTASASPPCKL 459
>gi|87241560|gb|ABD33418.1| Lipopolysaccharide-modifying protein [Medicago truncatula]
Length = 397
Score = 360 bits (925), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 174/390 (44%), Positives = 250/390 (64%), Gaps = 10/390 (2%)
Query: 83 TCLSAMNPIPERRRVASPQK-VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRI 141
TC +PI + P TCP+ F+ IH+DLEPW + IT + K + LRI
Sbjct: 4 TCQKDHHPIQAFHQDQDPSSSTSTCPEHFRWIHEDLEPWKSTGITRETVESGKNISQLRI 63
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE-HG 200
+I +GK YV+ Y D Q+R +FT+WG +QLLR YPG VPD++++F+ D+ V+DKK G
Sbjct: 64 VIKKGKAYVETYADSFQTRDLFTVWGIVQLLRLYPGRVPDLELLFETGDRTVLDKKRFQG 123
Query: 201 SFPL---PLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLP 257
S + P+F YC + DI FPDWSFWGW+E ++PW + KDI+ ++ +WK+++P
Sbjct: 124 SQSVTLPPIFSYCGQNDALDIVFPDWSFWGWAETGIKPWEKVLKDIQESNKKITWKDRIP 183
Query: 258 FAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYK 315
+A+WKGN V S R +L +CN D W A I +W +E + GF +KL +QC HRYK
Sbjct: 184 YAFWKGNTHV-SSQRYKLRQCNVTDQHDWNARIYSVHWNKEIEHGFNSTKLEDQCTHRYK 242
Query: 316 IYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVD 375
IY EG +WSVS KYI++C+S+ L I +Y DFF+R L+P K+++PI ++C+ IK VD
Sbjct: 243 IYVEGRSWSVSEKYIIACDSMTLFIKPRYYDFFTRSLVPYKHYWPINKQNMCQDIKYAVD 302
Query: 376 WGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVE 434
WGN +P +AE+IG+ G F+ E++ M VYDYMLHL+TEY+KL+ ++ P+ A E C E
Sbjct: 303 WGNTHPGKAEEIGREGTRFIEENVNMKLVYDYMLHLLTEYAKLMRFEATIPAGAVEVCSE 362
Query: 435 SLLCLADPKQRQNL-EKAAASPSPYPPCTL 463
+L C R+ + E SPS PCT+
Sbjct: 363 NLACPMGGIWREFMVESMVKSPSDTLPCTM 392
>gi|356497228|ref|XP_003517464.1| PREDICTED: KDEL motif-containing protein 1-like [Glycine max]
Length = 522
Score = 360 bits (924), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 169/363 (46%), Positives = 242/363 (66%), Gaps = 6/363 (1%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
CP++F+ IH+D+ W + I+ + AK+ A R+++ RG++YV+ Y +Q+R +FT
Sbjct: 120 VCPEYFRWIHEDVGAWKERGISREMVERAKKSAHFRLVVKRGRVYVERYKKSIQTREVFT 179
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWS 224
+WG +QLLR+YPG V D+++MFDC D PVI + + P PLFRYC + DI FPDWS
Sbjct: 180 MWGIVQLLRKYPGKVADLELMFDCDDLPVI-RGSSLAGPPPLFRYCGDRWTDDIVFPDWS 238
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL- 283
FWGW+E+N++PW K+++ G++ W ++ P+AYWKGNP V + R +L+KCN S
Sbjct: 239 FWGWAEINIRPWEHVLKEMEKGNRRIKWNDREPYAYWKGNPFV-AETRQDLLKCNVSTTQ 297
Query: 284 -WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
W A + Q+W +E++ GF S L++QC HRYKIY EGYAWSVS KYIL+C+SV L++
Sbjct: 298 DWNARLYVQDWIQESQQGFNNSNLASQCTHRYKIYIEGYAWSVSEKYILACDSVTLMVKP 357
Query: 343 QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMD 401
++ DFF R L P ++++PI C+SIK VDWGN + EA+KIGKA F+ E L MD
Sbjct: 358 RFYDFFIRSLQPMQHYWPIRDKGKCKSIKHAVDWGNNHKEEAQKIGKAASKFIQEELKMD 417
Query: 402 RVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPSPYPP 460
VYDYM HL+ EY+KLL ++P P A E CVE++ C +R+ + E PS P
Sbjct: 418 YVYDYMFHLLNEYAKLLKFEPRVPEGAEELCVEAMACTRSGLERKFMTESMVREPSTKAP 477
Query: 461 CTL 463
C+L
Sbjct: 478 CSL 480
>gi|115459624|ref|NP_001053412.1| Os04g0534000 [Oryza sativa Japonica Group]
gi|38346998|emb|CAE04582.2| OSJNBb0039L24.21 [Oryza sativa Japonica Group]
gi|113564983|dbj|BAF15326.1| Os04g0534000 [Oryza sativa Japonica Group]
gi|116310457|emb|CAH67461.1| OSIGBa0159I10.6 [Oryza sativa Indica Group]
gi|125549144|gb|EAY94966.1| hypothetical protein OsI_16774 [Oryza sativa Indica Group]
gi|125591102|gb|EAZ31452.1| hypothetical protein OsJ_15588 [Oryza sativa Japonica Group]
gi|215694886|dbj|BAG90077.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 508
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 181/373 (48%), Positives = 240/373 (64%), Gaps = 24/373 (6%)
Query: 109 FFKSIHKDLEPWAKSRITMRHI-MEAKRFAA-LRILIVRGKLYVDPYYDCVQSRAMFTIW 166
+F+ IH+DL PW + IT + A+R+ A R+ +V G+L+V Y C Q+R MFT W
Sbjct: 113 YFRWIHEDLRPWRDAGITREAVDGAARRYGAKFRVTVVAGRLHVARYGRCFQTRDMFTQW 172
Query: 167 GFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIPFPD 222
G LQLLRRY G VPD+D+MFDC D PV++ + S P PLF YC ++ DI FPD
Sbjct: 173 GVLQLLRRYQGRVPDLDLMFDCQDLPVVNAGDRRGRTSSSPPPLFGYCGSEPTLDIAFPD 232
Query: 223 WSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS- 281
WSFWGW E+N++PW +I G+ A +W + P+AYWKGNP V + R L++CN S
Sbjct: 233 WSFWGWPELNIKPWETLRGEIADGNAAVNWTGRAPYAYWKGNPTVGADRR-NLLRCNASG 291
Query: 282 -KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALII 340
+ W A I Q+W +E +DGF++S L+ QC HRYKIY EG WSVS KYIL+C++VALI+
Sbjct: 292 KRDWNARIYEQDWRKEVRDGFRESDLAKQCTHRYKIYIEGRGWSVSEKYILACDAVALIV 351
Query: 341 SQQYKDFFSRGLIPTKNHFPIPSA--DLCRSIKSVVDWGNANPSEAEKI-GKAGQDFMES 397
+Y DFFSRGL+P ++++PIP +CRSIK VDWGNA+ +A++I G A + E
Sbjct: 352 RPRYHDFFSRGLMPLQHYWPIPGGGRGMCRSIKFAVDWGNAHADKAQEIAGNATRFIQED 411
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEK------- 450
LTMDRVYDYM HL+TEY+KLL YKP P A E VES+ + R+ LE+
Sbjct: 412 LTMDRVYDYMFHLLTEYAKLLKYKPTVPDRAVEVTVESMT-----RGRRGLERQFMVDTM 466
Query: 451 AAASPSPYPPCTL 463
A PC L
Sbjct: 467 VEAGSGTGEPCEL 479
>gi|147810897|emb|CAN69493.1| hypothetical protein VITISV_037876 [Vitis vinifera]
Length = 378
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 173/342 (50%), Positives = 233/342 (68%), Gaps = 9/342 (2%)
Query: 130 IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
+ AKR A +++I+ G+ YV+ Y Q+R +FT+WG LQLLRRYPG VPD+++MFDC+
Sbjct: 2 VERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDCV 61
Query: 190 DKPVIDKKE-HG---SFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKH 245
D PVI KE HG + P PLFRYC +DA DI FPDWSFWGW E+ ++PW KD+K
Sbjct: 62 DWPVIKSKEYHGPNTTAPPPLFRYCGDDATLDIVFPDWSFWGWPEIXIKPWESLLKDLKE 121
Query: 246 GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKK 303
G++ W E+ P+AYWKGNP V + R++L+KCN D + W A + Q+W E+++G+K+
Sbjct: 122 GNKRSRWMEREPYAYWKGNPAV-AATRLDLLKCNVSDKQDWNARVYTQDWIXESQEGYKQ 180
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
S L++QC HRYKIY EG AWSVS KYIL+C+SV L++ Y DFF+R L+P +++PI
Sbjct: 181 SDLASQCIHRYKIYIEGSAWSVSQKYILACDSVTLLVKPHYYDFFTRSLMPVHHYWPIRE 240
Query: 364 ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
D CRSIK VDWGN + +A+ IGKA DF+ E L MD VYDYM HL+ EY+KLL +KP
Sbjct: 241 DDKCRSIKFAVDWGNRHKQKAQSIGKAASDFIQEDLKMDNVYDYMFHLLNEYAKLLKFKP 300
Query: 423 APPSSAFEACVESLLCLADP-KQRQNLEKAAASPSPYPPCTL 463
P A E C E + C A+ K++ +E P PCT+
Sbjct: 301 TVPEKAVELCSERMGCGAEGLKKKFMMESMVKYPMDASPCTM 342
>gi|116310458|emb|CAH67462.1| OSIGBa0159I10.7 [Oryza sativa Indica Group]
gi|218195275|gb|EEC77702.1| hypothetical protein OsI_16775 [Oryza sativa Indica Group]
Length = 537
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 175/364 (48%), Positives = 245/364 (67%), Gaps = 10/364 (2%)
Query: 109 FFKSIHKDLEPWAKSR-ITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWG 167
+F+ IH+DL PW + IT + A+ A+ R+L++ G+ +V + Q+R +FTIWG
Sbjct: 142 YFRFIHEDLRPWRDAGGITRAMLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWG 201
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVID----KKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
LQLLRRYPG VPD+D+MFDC D PV+ + +H +F PLF YC +D DI FPDW
Sbjct: 202 VLQLLRRYPGRVPDLDLMFDCADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDW 261
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS-- 281
SFWGW E+N++PW+ +D+K G+ W +++P+AYWKGNP V + R EL+ CN S
Sbjct: 262 SFWGWPEINIKPWDALRQDLKDGNNRVKWLDRVPYAYWKGNPAV-AVTRQELVNCNVSTT 320
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
K W A I +Q+W E+K G+K S L +QC HRYKIY EG AWSVS KYIL+C+S+ L+++
Sbjct: 321 KDWNARIYKQDWFRESKAGYKDSNLGSQCTHRYKIYIEGSAWSVSQKYILACDSMTLLVT 380
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMES-LTM 400
+Y DFFSR L+P ++++P+ + + C SIK VDWGN++ A++IGK DF+E + M
Sbjct: 381 PRYYDFFSRSLMPIQHYWPVHNDNKCDSIKYAVDWGNSHKQLAQRIGKQASDFIEEDVNM 440
Query: 401 DRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPSPYP 459
DRVYDYMLHL+TEY+KLL ++P P A E C +SL C A+ +++ L E S
Sbjct: 441 DRVYDYMLHLLTEYAKLLRFRPIKPPEAVEICPDSLACQAEGLEKKFLMESMVKSARDAG 500
Query: 460 PCTL 463
PC L
Sbjct: 501 PCDL 504
>gi|147800396|emb|CAN66409.1| hypothetical protein VITISV_020976 [Vitis vinifera]
Length = 439
Score = 358 bits (919), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 173/367 (47%), Positives = 239/367 (65%), Gaps = 19/367 (5%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
CPD+F+ IH+DL+PW + I+ + AKR A R++IV+GK+Y++ Y +Q+R +FT
Sbjct: 46 VCPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVKGKVYIEKYKKSIQTRDVFT 105
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIPF 220
IWG LQLLRRYPG + D+++ FDC D+PVI +H + P PLFRYC + D+ F
Sbjct: 106 IWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNSTSPPPLFRYCGDRWTLDVVF 165
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
PDWSFWGW E+N++PW KD+K G+ W E+ P+AYWKGNP +++ R +L+ CN
Sbjct: 166 PDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYWKGNP-LVAETRRDLLTCNV 224
Query: 280 -DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
D + W A + Q+W E++ G+K+S +SNQC HRYKIY EG+AWSVS KYIL+C+SV L
Sbjct: 225 SDVQDWNARLFVQDWMLESQQGYKQSDVSNQCTHRYKIYIEGWAWSVSEKYILACDSVTL 284
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
++ +Y DFF R L P +++PI D CRSIK A+ IGK DF+ E
Sbjct: 285 MVKPRYYDFFMRSLQPVHHYWPIKDNDKCRSIKFA----------AQAIGKXASDFIQEE 334
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPS 456
L MD VYDYM HL+ EY+KLL +KP P A E C E++ C A+ +++ +E SPS
Sbjct: 335 LKMDYVYDYMFHLLNEYAKLLRFKPTIPEGAVEVCSETVACSAEGVEKKFMMESLVNSPS 394
Query: 457 PYPPCTL 463
PC L
Sbjct: 395 VTSPCAL 401
>gi|15233070|ref|NP_191686.1| uncharacterized protein [Arabidopsis thaliana]
gi|6850894|emb|CAB71057.1| putative protein [Arabidopsis thaliana]
gi|27754477|gb|AAO22686.1| unknown protein [Arabidopsis thaliana]
gi|28393951|gb|AAO42383.1| unknown protein [Arabidopsis thaliana]
gi|332646660|gb|AEE80181.1| uncharacterized protein [Arabidopsis thaliana]
Length = 498
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 187/474 (39%), Positives = 273/474 (57%), Gaps = 38/474 (8%)
Query: 1 MVFPSRTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFP 60
+ FP ++ ++ ++ IS +++ L +LD+ FA L EP P+
Sbjct: 14 VTFPRKSIVKATVFIVVLFISAAILDLLG-YLDFNA--FAGLK--LTTKTKEPNPYG--- 65
Query: 61 QRTFKEESRRSQAYKIVHCTYLTCLSAMNPIPERRRV----ASPQKVQTCPDFFKSIHKD 116
C ++ S+ PI + R+ + K TCP +F+ IH+D
Sbjct: 66 ------------------CDFVQNQSSQTPISQNRKSRLNPNNSSKSSTCPSYFRWIHED 107
Query: 117 LEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYP 176
L PW ++ IT I EA R A R++I GK YV Y +Q+R FT+WG LQLLR YP
Sbjct: 108 LRPWKQTGITRGMIEEASRTAHFRLVIRNGKAYVKRYKKSIQTRDEFTLWGILQLLRWYP 167
Query: 177 GMVPDVDIMFDCMDKPVI---DKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNL 233
G +PD+++MFD D+PV+ D P P+FRYC++DA DI FPDWSFWGW+EVN+
Sbjct: 168 GKLPDLELMFDADDRPVVRSVDFIGQQKEPPPVFRYCSDDASLDIVFPDWSFWGWAEVNV 227
Query: 234 QPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQ 291
+PW + + IK G+ WK+++ +AYW+GNP V P R +L+KCN + + W + Q
Sbjct: 228 KPWGKSLEAIKEGNSMTQWKDRVAYAYWRGNPYV-DPGRGDLLKCNATEHEEWNTRLYIQ 286
Query: 292 NWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG 351
+W +E K+GFK S L NQC HRYKIY EG+AWSVS KYI++C+S+ L + ++ DF+ RG
Sbjct: 287 DWDKETKEGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYVKPRFYDFYIRG 346
Query: 352 LIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHL 410
++P ++++PI C S+K V WGN + +A +IG+ G F+ E + M VYDYM HL
Sbjct: 347 MMPLQHYWPIRDDSKCTSLKFAVHWGNTHEDKAREIGEVGSRFIREEVNMQYVYDYMFHL 406
Query: 411 ITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
+ EY+ LL +KP P A E +S+ C A + R E SPS PC +
Sbjct: 407 LKEYATLLKFKPEIPLDAEEITPDSMGCPATERWRDFKAESMIISPSEESPCEM 460
>gi|242073834|ref|XP_002446853.1| hypothetical protein SORBIDRAFT_06g023680 [Sorghum bicolor]
gi|241938036|gb|EES11181.1| hypothetical protein SORBIDRAFT_06g023680 [Sorghum bicolor]
Length = 555
Score = 355 bits (911), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 242/366 (66%), Gaps = 15/366 (4%)
Query: 109 FFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWG 167
+F+ IH+DL PW A IT + A+ A R++++ G+ YV Q+R +FTIWG
Sbjct: 160 YFRFIHEDLRPWRAAGGITRAMLDRARLTATFRLVVLGGRAYVHRLRPAFQTRDLFTIWG 219
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVID----KKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
LQLLRRYPG VPD+D+MFD +D P++ + ++ PLFRYC +D DI FPDW
Sbjct: 220 VLQLLRRYPGRVPDLDLMFDTVDWPIVRAHLYRGKYAEMLPPLFRYCGDDKTLDIVFPDW 279
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL 283
SFWGW E+N++PW+ +D+K G+ W ++ P+AYWKGNP V S R EL+KCN S
Sbjct: 280 SFWGWPEINIKPWDALQEDLKDGNNRVRWMDREPYAYWKGNPSV-SATRKELVKCNVSST 338
Query: 284 --WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
W A I Q+W +E+K G+K S LS+QC HRYKIY EG AWS+S KYIL+C+S+ L+++
Sbjct: 339 HDWNARIYAQDWFKESKAGYKDSDLSSQCAHRYKIYIEGSAWSISEKYILACDSMTLLVT 398
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
+Y DFFSR L+PT++++P+ + C SIK VDWGN++ A+ IGK +F+ E L M
Sbjct: 399 PRYYDFFSRSLMPTQHYWPVRDDNKCASIKYAVDWGNSHKQMAQHIGKQASNFIQEELNM 458
Query: 401 DRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAA------AS 454
D VYDYMLHL+TEY+KLL +KP P A E C ESL+C A+ +++ L ++ A
Sbjct: 459 DHVYDYMLHLLTEYAKLLKFKPTKPPEAVEVCSESLVCQAEGLEKKFLVESMVKFARDAG 518
Query: 455 PSPYPP 460
P PP
Sbjct: 519 PCDLPP 524
>gi|15222413|ref|NP_176531.1| uncharacterized protein [Arabidopsis thaliana]
gi|332195976|gb|AEE34097.1| uncharacterized protein [Arabidopsis thaliana]
Length = 578
Score = 354 bits (909), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 175/376 (46%), Positives = 248/376 (65%), Gaps = 16/376 (4%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
++CPD+FK IH+DL+PW ++ IT + K A R++I+ GK++V+ Y +Q+R F
Sbjct: 168 RSCPDYFKWIHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAF 227
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHG-------SFPLPLFRYCTNDAHF 216
T+WG LQLLR+YPG +PDVD+MFDC D+PVI + + P PLFRYC +
Sbjct: 228 TLWGILQLLRKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTV 287
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
DI FPDWSFWGW E+N++ W++ K+++ G + K + E+ +AYWKGNP V SP R +L+
Sbjct: 288 DIVFPDWSFWGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLL 347
Query: 277 KCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
CN S L W A I Q+W E + GF+ S ++NQC +RYKIY EGYAWSVS KYIL+C+
Sbjct: 348 TCNLSSLHDWNARIFIQDWISEGQRGFENSNVANQCTYRYKIYIEGYAWSVSEKYILACD 407
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF 394
SV L++ Y DFFSR L P ++++PI D CRSIK VDW N + +A++IG+ +F
Sbjct: 408 SVTLMVKPYYYDFFSRTLQPLQHYWPIRDKDKCRSIKFAVDWLNNHTQKAQEIGREASEF 467
Query: 395 ME-SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKA-- 451
M+ L+M+ VYDYM HL+ EYSKLL YKP P ++ E C E+L+C ++ + ++K
Sbjct: 468 MQRDLSMENVYDYMFHLLNEYSKLLKYKPQVPKNSVELCTEALVCPSEGEDVNGVDKKFM 527
Query: 452 ----AASPSPYPPCTL 463
+ P PC+L
Sbjct: 528 IGSLVSRPHASGPCSL 543
>gi|297840159|ref|XP_002887961.1| hypothetical protein ARALYDRAFT_475010 [Arabidopsis lyrata subsp.
lyrata]
gi|297333802|gb|EFH64220.1| hypothetical protein ARALYDRAFT_475010 [Arabidopsis lyrata subsp.
lyrata]
Length = 571
Score = 353 bits (906), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 175/374 (46%), Positives = 246/374 (65%), Gaps = 14/374 (3%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
++CPD+FK IH+DL+PW ++ IT + K A R++IV GK++V+ Y +Q+R F
Sbjct: 164 RSCPDYFKWIHEDLKPWRETGITKEMVERGKTTAHFRLVIVNGKVFVENYKKSIQTRDAF 223
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI-------DKKEHGSFPLPLFRYCTNDAHF 216
T+WG LQLLR+YPG +PDVD+MFDC D+PVI + P PLFRYC +
Sbjct: 224 TLWGILQLLRKYPGKLPDVDLMFDCDDRPVIRSDGYNRSNRTAEDVPPPLFRYCGDRWTV 283
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
DI FPDWSFWGW E+N++ W++ K+++ G + K + E+ +AYWKGNP V SP R +L+
Sbjct: 284 DIVFPDWSFWGWQEINIKEWSKVLKEMEEGKKKKKFMEREAYAYWKGNPFVASPSREDLL 343
Query: 277 KCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
CN S L W A I Q+W E + GF+ S ++NQC +RYKIY EGYAWSVS KYIL+C+
Sbjct: 344 TCNLSSLHDWNARIFIQDWISEGQRGFENSNVANQCTYRYKIYIEGYAWSVSEKYILACD 403
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF 394
SV L++ Y DFFSR L P ++++PI D CRSIK VDW N + +A++IG+ +F
Sbjct: 404 SVTLMVKPYYYDFFSRTLQPLQHYWPIRDKDKCRSIKFAVDWLNNHTQKAQEIGREASEF 463
Query: 395 ME-SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP----KQRQNLE 449
M+ L+M+ VYDYM HL+ EYSKLL YKP P ++ E C E+++C ++ +R +
Sbjct: 464 MQRDLSMENVYDYMFHLLNEYSKLLKYKPQVPKNSVELCTEAMVCPSEDVNGVNKRFMMG 523
Query: 450 KAAASPSPYPPCTL 463
+ P PC+L
Sbjct: 524 SLVSRPHVSSPCSL 537
>gi|42569947|ref|NP_182107.3| downstream target of AGL15 2 [Arabidopsis thaliana]
gi|330255513|gb|AEC10607.1| downstream target of AGL15 2 [Arabidopsis thaliana]
Length = 523
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 162/374 (43%), Positives = 248/374 (66%), Gaps = 9/374 (2%)
Query: 98 ASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
+S ++ TCP +F+ IH+DL PW ++ +T + +A+R A R++I+ G++YV Y +
Sbjct: 111 SSHSRISTCPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSI 170
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS----FPLPLFRYCTND 213
Q+R +FT+WG +QLLR YPG +PD+++MFD D+P + K+ P PLFRYC++D
Sbjct: 171 QTRDVFTLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDD 230
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
A DI FPDWSFWGW+EVN++PW++ I+ G++ WK+++ +AYW+GNP+V +P R
Sbjct: 231 ASLDIVFPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNV-APTRR 289
Query: 274 ELMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
+L++CN S + W + Q+W E+++GFK S L NQC HRYKIY EG+AWSVS KYI+
Sbjct: 290 DLLRCNVSAQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIM 349
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+C+S+ L + + DF+ RG++P ++++PI C S+K V WGN + +A KIG+ G
Sbjct: 350 ACDSMTLYVRPMFYDFYVRGMMPLQHYWPIRDTSKCTSLKFAVHWGNTHLDQASKIGEEG 409
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEK 450
F+ E + M+ VYDYM HL+ EY+KLL +KP P A E + + C A + R +E+
Sbjct: 410 SRFIREEVKMEYVYDYMFHLMNEYAKLLKFKPEIPWGATEITPDIMGCSATGRWRDFMEE 469
Query: 451 AAAS-PSPYPPCTL 463
+ PS PC +
Sbjct: 470 SMVMFPSEESPCEM 483
>gi|147791844|emb|CAN70600.1| hypothetical protein VITISV_027960 [Vitis vinifera]
Length = 521
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 170/368 (46%), Positives = 237/368 (64%), Gaps = 9/368 (2%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCP++F+ I++DL PW + IT + +A+ A++RI++V GK+YV+ Y ++R F
Sbjct: 111 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYVEKYKRVNRNRDEF 170
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIP 219
TIWG LQLLR YPG +PD D+MF+C D+P+I + + P PLF YC +D +DI
Sbjct: 171 TIWGILQLLRMYPGKLPDFDLMFECRDRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIV 230
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC- 278
FPDWSFWGW E N++PWN KD+K G+ W ++ P+AYWKGN + +R EL KC
Sbjct: 231 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVK-MGVVRKELFKCR 289
Query: 279 -NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
D + W A + +W E + GFK S L++QC HRYKIY EG AWSVS KYIL+C+SV
Sbjct: 290 NTDEQDWNARLYIMDWGREVQSGFKTSDLASQCTHRYKIYTEGIAWSVSEKYILACDSVT 349
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
L++ QY +FF+R L P +++PI D+C+SIK DW N + +A+KIGKAG F+ E
Sbjct: 350 LLVKPQYYEFFTRSLQPLVHYWPIKHKDMCKSIKFATDWCNNHTEKAQKIGKAGSSFVQE 409
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASP 455
+ M VYDYM HL++ Y+KLL YKP P A E C E + C + ++ L+ SP
Sbjct: 410 EIKMKFVYDYMFHLLSMYAKLLKYKPTVPPMAVEFCPEMMACAVEGLEKDYKLQSMVKSP 469
Query: 456 SPYPPCTL 463
S PC +
Sbjct: 470 SDTGPCIM 477
>gi|3386611|gb|AAC28541.1| unknown protein [Arabidopsis thaliana]
Length = 517
Score = 352 bits (902), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 162/374 (43%), Positives = 248/374 (66%), Gaps = 9/374 (2%)
Query: 98 ASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
+S ++ TCP +F+ IH+DL PW ++ +T + +A+R A R++I+ G++YV Y +
Sbjct: 105 SSHSRISTCPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSI 164
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS----FPLPLFRYCTND 213
Q+R +FT+WG +QLLR YPG +PD+++MFD D+P + K+ P PLFRYC++D
Sbjct: 165 QTRDVFTLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDD 224
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
A DI FPDWSFWGW+EVN++PW++ I+ G++ WK+++ +AYW+GNP+V +P R
Sbjct: 225 ASLDIVFPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNV-APTRR 283
Query: 274 ELMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
+L++CN S + W + Q+W E+++GFK S L NQC HRYKIY EG+AWSVS KYI+
Sbjct: 284 DLLRCNVSAQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIM 343
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+C+S+ L + + DF+ RG++P ++++PI C S+K V WGN + +A KIG+ G
Sbjct: 344 ACDSMTLYVRPMFYDFYVRGMMPLQHYWPIRDTSKCTSLKFAVHWGNTHLDQASKIGEEG 403
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEK 450
F+ E + M+ VYDYM HL+ EY+KLL +KP P A E + + C A + R +E+
Sbjct: 404 SRFIREEVKMEYVYDYMFHLMNEYAKLLKFKPEIPWGATEITPDIMGCSATGRWRDFMEE 463
Query: 451 AAAS-PSPYPPCTL 463
+ PS PC +
Sbjct: 464 SMVMFPSEESPCEM 477
>gi|359489776|ref|XP_002273194.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 521
Score = 352 bits (902), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 170/368 (46%), Positives = 237/368 (64%), Gaps = 9/368 (2%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCP++F+ I++DL PW + IT + +A+ A++RI++V GK+YV+ Y ++R F
Sbjct: 111 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYVEKYKRVNRNRDEF 170
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID----KKEHGSFPLPLFRYCTNDAHFDIP 219
TIWG LQLLR YPG +PD D+MF+C D+P+I + + P PLF YC +D +DI
Sbjct: 171 TIWGILQLLRMYPGKLPDFDLMFECRDRPMIRTHLYQGPDATVPPPLFHYCGDDETYDIV 230
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC- 278
FPDWSFWGW E N++PWN KD+K G+ W ++ P+AYWKGN + +R EL KC
Sbjct: 231 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVK-MGVVRKELFKCR 289
Query: 279 -NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
D + W A + +W E + GFK S L++QC HRYKIY EG AWSVS KYIL+C+SV
Sbjct: 290 NTDEQDWNARLYIMDWGREVQSGFKTSDLASQCTHRYKIYTEGIAWSVSEKYILACDSVT 349
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
L++ QY +FF+R L P +++PI D+C+SIK DW N + +A+KIGKAG F+ E
Sbjct: 350 LLVKPQYYEFFTRSLQPLVHYWPIKHKDMCKSIKFATDWCNNHTEKAQKIGKAGSSFVQE 409
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASP 455
+ M VYDYM HL++ Y+KLL YKP P A E C E + C + ++ L+ SP
Sbjct: 410 EIKMKFVYDYMFHLLSMYAKLLKYKPTVPPMAVEFCPEMMACAVEGLEKDYKLQSMVKSP 469
Query: 456 SPYPPCTL 463
S PC +
Sbjct: 470 SDTGPCIM 477
>gi|413923257|gb|AFW63189.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
Length = 578
Score = 352 bits (902), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 181/417 (43%), Positives = 260/417 (62%), Gaps = 40/417 (9%)
Query: 83 TCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRI 141
TC + +P P A+ TCP +F+ IH+DL PW A IT + A+ A R+
Sbjct: 128 TCAAFPDPPP---LPANSNASSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRL 184
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID----KK 197
+++RG+ Y++ Q+R +FTIWG LQLLRRYPG VPD+D+MFDC+D PV+ +
Sbjct: 185 VVIRGRAYIERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYQG 244
Query: 198 EHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLP 257
E+ + PLFRYC ++ D+ FPDWSFWGW E+N++PW+ K++ G++ W + P
Sbjct: 245 ENATLLPPLFRYCGDNETLDVVFPDWSFWGWPEINIKPWDALQKELNGGNKRVKWLAREP 304
Query: 258 FAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHR-- 313
+AYWKGNPDV + R EL+KCN S W A I +Q+W +E K G+K+S L++QC HR
Sbjct: 305 YAYWKGNPDV-AVTRQELVKCNVSSKHEWNARIYKQDWLKEIKAGYKQSDLASQCTHRFL 363
Query: 314 -------------------------YKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFF 348
YKIY EG AWSVS KYIL+C+S+ L+++ +Y DF+
Sbjct: 364 FFLLADKLKYTLSVHNTEILSLGDRYKIYIEGSAWSVSEKYILACDSMTLVVTPKYYDFY 423
Query: 349 SRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFME-SLTMDRVYDYM 407
SR L+P ++++PI + C SIK VDWGN++ +A++IGK G +F++ L+M+ VYDYM
Sbjct: 424 SRVLMPMQHYWPIWDDNKCSSIKFAVDWGNSHKQKAQRIGKQGSNFIQKELSMEYVYDYM 483
Query: 408 LHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAA-SPSPYPPCTL 463
HL+TEY+KLL +KP P A E C ESL C A ++++ +E + S S PC L
Sbjct: 484 FHLLTEYAKLLRFKPRKPPEAIEVCPESLACQAIGREKKFMEDSMVRSASDAGPCDL 540
>gi|49388103|dbj|BAD25236.1| unknown protein [Oryza sativa Japonica Group]
gi|215766221|dbj|BAG98449.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 375
Score = 351 bits (900), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 168/339 (49%), Positives = 236/339 (69%), Gaps = 9/339 (2%)
Query: 133 AKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKP 192
A+ A R++++RG+ YV+ Q+R +FTIWG LQLLRRYPG VPD+D+MFDC+D P
Sbjct: 5 ARDTANFRLVVLRGRAYVERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWP 64
Query: 193 VID----KKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQ 248
V+ + E+ + PLFRYC +D D+ FPDWSFWGW E+N++PW+ KD+ G++
Sbjct: 65 VVQADRYQGENATAMPPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNK 124
Query: 249 AKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDGFKKSKL 306
W ++ P+AYWKGNPDV + R EL+KCN S W A I +Q+W +E+K G+K+S L
Sbjct: 125 RVKWVDREPYAYWKGNPDVATK-RKELVKCNVSSKHEWNARIYKQDWIKESKAGYKQSDL 183
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADL 366
++QC HRYKIY EG AWSVS KYIL+CNS+ L+++ +Y DFFSR L+PT++++P+ +
Sbjct: 184 ASQCTHRYKIYIEGSAWSVSEKYILACNSMTLVVTPKYYDFFSRVLMPTQHYWPVRDDNK 243
Query: 367 CRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPP 425
C SIK VDWGN+N +A+KIGK +F+ + L+MD +YDYM HL+TEY+KLL +KP P
Sbjct: 244 CSSIKHAVDWGNSNKKKAQKIGKQASNFIQQELSMDYIYDYMFHLLTEYAKLLRFKPTKP 303
Query: 426 SSAFEACVESLLCLADPKQRQNLEKA-AASPSPYPPCTL 463
A E C E L C A ++R+ +E + S + PC L
Sbjct: 304 PEAIEICPELLACQAIGRERKFMEDSMVKSANDAGPCDL 342
>gi|356506263|ref|XP_003521906.1| PREDICTED: protein O-glucosyltransferase 1-like [Glycine max]
Length = 476
Score = 350 bits (899), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 169/367 (46%), Positives = 232/367 (63%), Gaps = 9/367 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CP++FK IH+DL+PW IT + K + R++IV GK YV+ Y Q+R +FT
Sbjct: 79 SCPNYFKWIHEDLKPWKSKGITRDMVERGKNVSHFRLVIVNGKAYVEKYDKVYQTRDVFT 138
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE----HGSFPLPLFRYCTNDAHFDIPF 220
IWG LQLLR YPG +PD+D+MF C DKPV+ KK+ P P+F YC ++ DI F
Sbjct: 139 IWGILQLLRLYPGKIPDLDLMFQCGDKPVVLKKDFQGPQAMSPPPVFHYCGDENAHDIVF 198
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW E+N+ PW I G++ WK++ P+A+WKGN ++ +R EL KCN
Sbjct: 199 PDWSFWGWPEINIGPWETTLHKILEGNKMIKWKDRTPYAFWKGNL-AMADIRRELGKCNP 257
Query: 281 SKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
+K W A I W +E + F+ SKL NQCN RYKIY EG AWSVS KYI+ C+S+ L
Sbjct: 258 TKEHDWNARIHNIQWNKEEANNFESSKLENQCNFRYKIYVEGAAWSVSEKYIIGCDSMTL 317
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
I Y +FF+R ++P ++++PI ++C IK VDWGNA+ A+ IG G F+ E+
Sbjct: 318 FIEPTYYEFFTRSMVPLQHYWPISPKNMCEDIKYAVDWGNAHLDNAQVIGNGGTSFIVEN 377
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKAAASPS 456
L VYDYM +L+ EY+KLL +KP P+ A E C ES+ C + ++R +E SPS
Sbjct: 378 LKTKFVYDYMFYLLNEYAKLLKFKPTIPTGAVEICSESMACSVHGLEKRFMVESMVTSPS 437
Query: 457 PYPPCTL 463
PPCT+
Sbjct: 438 DTPPCTM 444
>gi|110736095|dbj|BAF00020.1| hypothetical protein [Arabidopsis thaliana]
Length = 418
Score = 350 bits (898), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 162/374 (43%), Positives = 248/374 (66%), Gaps = 9/374 (2%)
Query: 98 ASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
+S ++ TCP +F+ IH+DL PW ++ +T + +A+R A R++I+ G++YV Y +
Sbjct: 6 SSHSRISTCPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSI 65
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS----FPLPLFRYCTND 213
Q+R +FT+WG +QLLR YPG +PD+++MFD D+P + K+ P PLFRYC++D
Sbjct: 66 QTRDVFTLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDD 125
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
A DI FPDWSFWGW+EVN++PW++ I+ G++ WK+++ +AYW+GNP+V +P R
Sbjct: 126 ASLDIVFPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNV-APTRR 184
Query: 274 ELMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
+L++CN S + W + Q+W E+++GFK S L NQC HRYKIY EG+AWSVS KYI+
Sbjct: 185 DLLRCNVSAQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIM 244
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+C+S+ L + + DF+ RG++P ++++PI C S+K V WGN + +A KIG+ G
Sbjct: 245 ACDSMTLYVRPMFYDFYVRGMMPLQHYWPIRDTSKCTSLKFAVHWGNTHLDQASKIGEEG 304
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEK 450
F+ E + M+ VYDYM HL+ EY+KLL +KP P A E + + C A + R +E+
Sbjct: 305 SRFIREEVKMEYVYDYMFHLMNEYAKLLKFKPEIPWGATEITPDIMGCSATGRWRDFMEE 364
Query: 451 AAAS-PSPYPPCTL 463
+ PS PC +
Sbjct: 365 SMVMFPSEESPCEM 378
>gi|356522630|ref|XP_003529949.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 497
Score = 350 bits (898), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 163/366 (44%), Positives = 236/366 (64%), Gaps = 8/366 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CP++F+ IH+DL+PW + IT + K + R++IV GK Y++ + Q+R +FT
Sbjct: 101 SCPEYFRWIHEDLKPWESTGITRDMVERGKHISHFRLVIVNGKAYIEKFAKSYQTRDVFT 160
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH---GSFPLPLFRYCTNDAHFDIPFP 221
IWG LQLLR YPG +PD+++MF C D+ V+ KK+ P P+F YC + +DI FP
Sbjct: 161 IWGILQLLRLYPGKIPDLELMFQCGDRTVVFKKDFQVPKMSPPPVFHYCGEENSYDIVFP 220
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
DW+FWGW+E++++PW +I G++ WK+++P+A+WKGNP V S +R EL KCN +
Sbjct: 221 DWTFWGWAELSIRPWETTLHNILEGNKLVKWKDRIPYAFWKGNPTV-SIIRRELGKCNTT 279
Query: 282 KL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
+ W A I W E F+ SKL NQC RYKIYAEG WSVS KYI++C+S+ +
Sbjct: 280 EKHDWNARIYDIQWLRERASNFENSKLENQCTFRYKIYAEGITWSVSEKYIIACDSMTMF 339
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
I +Y DFF+R ++P ++++PI + ++C IK VDWGNA+ A+ IG G +++ E+L
Sbjct: 340 IEPRYYDFFTRSMLPLQHYWPINTKNMCEEIKYAVDWGNAHLDNAQAIGNGGTNYIVENL 399
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKAAASPSP 457
M VYDYM HL+ YSKLL +KP P A E C ES+ C L ++ +E SPS
Sbjct: 400 KMKFVYDYMFHLLNRYSKLLKFKPTIPIGAVEICSESMACSLRGLRKSFMVESMVTSPSD 459
Query: 458 YPPCTL 463
PPCT+
Sbjct: 460 TPPCTM 465
>gi|297817438|ref|XP_002876602.1| hypothetical protein ARALYDRAFT_486604 [Arabidopsis lyrata subsp.
lyrata]
gi|297322440|gb|EFH52861.1| hypothetical protein ARALYDRAFT_486604 [Arabidopsis lyrata subsp.
lyrata]
Length = 496
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 186/470 (39%), Positives = 274/470 (58%), Gaps = 31/470 (6%)
Query: 1 MVFPSRTPARSPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFP 60
+ FP ++ ++ ++ IS +++ FL Y +D A L EP P+
Sbjct: 13 VTFPRKSIVKATVFIVVLFISAALLD----FLGY-LDFNAFAGLKLITKRKEPNPYGCDF 67
Query: 61 QRTFKEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPW 120
+ ++ SQ S +NPI K TCP +F+ IH+DL PW
Sbjct: 68 VQNQSNQTHFSQ----------NRASRLNPIR--------SKPSTCPSYFRWIHEDLRPW 109
Query: 121 AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVP 180
++ IT I EA R A R++I GK YV Y +Q+R FT+WG +QLLR +PG +P
Sbjct: 110 KQTGITRGMIEEASRTAHFRLVIRNGKAYVKRYRKSIQTRDDFTLWGIVQLLRWFPGKLP 169
Query: 181 DVDIMFDCMDKPVIDKKE---HGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWN 237
D+++MFD D+PV+ + P P+FRYC++DA DI FPDWSFWGW+EVN++PW
Sbjct: 170 DLELMFDADDRPVVRSADFIGQQKKPPPVFRYCSDDASLDIVFPDWSFWGWAEVNIKPWG 229
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAE 295
E + IK G+ WKE++ +AYW+GNP V P R +L+KCN +++ W + Q+W +
Sbjct: 230 ESLEAIKEGNNMTQWKERVAYAYWRGNPHV-DPGRGDLLKCNVSENEEWNTRLYIQDWDK 288
Query: 296 EAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPT 355
E+K+G+K S L NQC HRYKIY EG+AWSVS KYI++C+S+ L + ++ DF+ RG++P
Sbjct: 289 ESKEGYKNSNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYVKPRFYDFYIRGMMPL 348
Query: 356 KNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEY 414
++++PI C S+K V WGN + +A +IG+ G F+ E + M VYDYM HL+ EY
Sbjct: 349 QHYWPIRDDSKCTSLKFAVHWGNTHVDKAREIGELGSRFIREEVNMKYVYDYMFHLLKEY 408
Query: 415 SKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPSPYPPCTL 463
+ LL +KP P A E +S+ C A + R + E SPS PC +
Sbjct: 409 ATLLKFKPEIPLDAEEITPDSMGCPATERWRDFMAESMVLSPSEESPCEM 458
>gi|359489778|ref|XP_002271353.2| PREDICTED: protein O-glucosyltransferase 1 [Vitis vinifera]
Length = 455
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 168/373 (45%), Positives = 240/373 (64%), Gaps = 10/373 (2%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQS 159
P CPD+F+ I++DL PWA++ IT + AK A R++I+ GK+Y++ Y Q+
Sbjct: 46 PLSGNACPDYFRWIYEDLRPWAETGITQEMMEAAKDPAFFRLVILDGKVYMEKYKGAFQT 105
Query: 160 RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL-----PLFRYCTNDA 214
R +FTIWG LQLL+ YPG VPD+++MF+C D+P I ++G PLF YC +D
Sbjct: 106 RDVFTIWGILQLLKLYPGKVPDLELMFECGDRPRIKATDYGGRKGKKKVPPLFHYCASDD 165
Query: 215 HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
DI FPDWSFWGW E+N++PWN K+++ G+ W ++ P+AYWKGN S R
Sbjct: 166 TLDIVFPDWSFWGWPEINIKPWNSLRKELEEGNNRTKWMDREPYAYWKGNIRT-SGNRQA 224
Query: 275 LMKC--NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
L KC +++ W A + +W E+ +GFK S L++QC H+YKIY EG AWSVS KYIL+
Sbjct: 225 LFKCRPSNNHDWNARVYDMDWGRESHEGFKDSNLASQCTHKYKIYMEGIAWSVSEKYILA 284
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
C+S++L+ +Y DFF+R L PT +++PI D+CRS+K VDWGN +P +A+KIGKA
Sbjct: 285 CDSMSLVPRSRYYDFFTRSLQPTIHYWPIQQNDICRSVKYAVDWGNKHPQKAQKIGKAAS 344
Query: 393 DF-MESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCL-ADPKQRQNLEK 450
+F +E L M+ VYDYM HL++EY+KL YKP P A E E++ D ++ E
Sbjct: 345 NFVLEELKMNYVYDYMFHLLSEYAKLFKYKPTVPPGAIEIVPETMANTGGDLEKIYKNES 404
Query: 451 AAASPSPYPPCTL 463
+ P+ PCT+
Sbjct: 405 SVKGPATTSPCTM 417
>gi|297824639|ref|XP_002880202.1| hypothetical protein ARALYDRAFT_483721 [Arabidopsis lyrata subsp.
lyrata]
gi|297326041|gb|EFH56461.1| hypothetical protein ARALYDRAFT_483721 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 163/374 (43%), Positives = 245/374 (65%), Gaps = 9/374 (2%)
Query: 98 ASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
+S + TCP +F+ IH+DL PW ++ IT + +A+R A R++I+ G++YV Y + +
Sbjct: 106 SSHSRPSTCPSYFRWIHEDLRPWKETGITRGMLEKARRTAHFRVVILDGRVYVKKYRNSI 165
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTND 213
Q+R +FT+WG +QLLR YPG +PD+++MFD D+P + K+ P PLFRYC++D
Sbjct: 166 QTRDVFTLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGHQHPAPPPLFRYCSDD 225
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
A DI FPDWSFWGW+EVN++PW + I+ G++ W +++ +AYW+GNP+V +P R
Sbjct: 226 ASLDIVFPDWSFWGWAEVNIKPWAKSLVAIEEGNKMTQWTDRVAYAYWRGNPNV-APTRR 284
Query: 274 ELMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
+L++CN S + W + Q+W E+++GFK S L NQC HRYKIY EG+AWSVS KYI+
Sbjct: 285 DLLRCNVSAQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIM 344
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+C+S+ L + + DF+ RG++P ++++PI C S+K V WGN + +A KIG+ G
Sbjct: 345 ACDSMTLYVRPMFYDFYIRGMMPLQHYWPIRDNTKCTSLKFAVHWGNTHLDQASKIGEEG 404
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-E 449
F+ E + M+ VYDYM HL+ EY+KLL +KP P A E +S+ C A + R + E
Sbjct: 405 SRFIREEVKMEYVYDYMFHLMNEYAKLLKFKPEIPWGATEITPDSMGCQATGRWRDFMAE 464
Query: 450 KAAASPSPYPPCTL 463
PS PC +
Sbjct: 465 SMVMFPSEESPCEM 478
>gi|147798876|emb|CAN65870.1| hypothetical protein VITISV_012498 [Vitis vinifera]
Length = 504
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 168/373 (45%), Positives = 240/373 (64%), Gaps = 10/373 (2%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQS 159
P CPD+F+ I++DL PWA++ IT + AK A R++I+ GK+Y++ Y Q+
Sbjct: 95 PLSGNACPDYFRWIYEDLRPWAETGITQEMMEAAKDPAFFRLVILDGKVYMEKYKGAFQT 154
Query: 160 RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL-----PLFRYCTNDA 214
R +FTIWG LQLL+ YPG VPD+++MF+C D+P I ++G PLF YC +D
Sbjct: 155 RDVFTIWGILQLLKLYPGKVPDLELMFECGDRPRIKASDYGGRKGKKKVPPLFHYCASDD 214
Query: 215 HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
DI FPDWSFWGW E+N++PWN K+++ G+ W ++ P+AYWKGN S R
Sbjct: 215 TLDIVFPDWSFWGWPEINIKPWNSLRKELEEGNNRTKWMDREPYAYWKGNIRT-SGNRQA 273
Query: 275 LMKC--NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
L KC +++ W A + +W E+ +GFK S L++QC H+YKIY EG AWSVS KYIL+
Sbjct: 274 LFKCRPSNNHDWNARVYDMDWGRESHEGFKDSNLASQCTHKYKIYMEGIAWSVSEKYILA 333
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
C+S++L+ +Y DFF+R L PT +++PI D+CRS+K VDWGN +P +A+KIGKA
Sbjct: 334 CDSMSLVPRSRYYDFFTRSLQPTIHYWPIQQNDICRSVKYAVDWGNKHPQKAQKIGKAAS 393
Query: 393 DF-MESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLL-CLADPKQRQNLEK 450
+F +E L M+ VYDYM HL++EY+KL YKP P A E E++ D ++ E
Sbjct: 394 NFVLEELKMNYVYDYMFHLLSEYAKLFKYKPTVPPGAIEIVPETMANTGGDLEKIYKNES 453
Query: 451 AAASPSPYPPCTL 463
+ P+ PCT+
Sbjct: 454 SVKGPATTSPCTM 466
>gi|242094720|ref|XP_002437850.1| hypothetical protein SORBIDRAFT_10g003730 [Sorghum bicolor]
gi|241916073|gb|EER89217.1| hypothetical protein SORBIDRAFT_10g003730 [Sorghum bicolor]
Length = 552
Score = 347 bits (890), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 166/369 (44%), Positives = 237/369 (64%), Gaps = 14/369 (3%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CPD+F+ IH DL PW + IT + A+ A R+++V G+ YV+ Y Q+R +FT
Sbjct: 160 SCPDYFRYIHDDLRPWRGAGITREAVERARPHAYFRLVVVGGRAYVETYRRAYQTRDVFT 219
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL---PLFRYCTNDAHFDIPFP 221
WG LQLLRRYPG VPD+DIMF C D + + + P P+FRYC + DI FP
Sbjct: 220 QWGVLQLLRRYPGRVPDLDIMFACDDPGQVRAADFPTTPSDAPPVFRYCKDALTLDIVFP 279
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW EV ++PW + ++++ ++ W + P+A+WKGNP+ +R ELM+CN
Sbjct: 280 DWSFWGWPEVGIRPWPQLLEEVRQENERVRWPVRQPYAFWKGNPEGYR-IRHELMRCNAS 338
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
+ + W A + QNW ++GFK S++ QC +RYK+Y EG AWSVS KYIL+C+S L
Sbjct: 339 NGQEWNARLFTQNWHYAIQNGFKDSRIPKQCIYRYKVYVEGNAWSVSEKYILACDSPVLF 398
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
++ ++D SRGL+ K+++PI +C+SIK VDWGN +P++A+ IG+ G F+ E +
Sbjct: 399 VNTPFQDILSRGLVAGKHYWPINREHICKSIKFAVDWGNEHPAQAQLIGEQGSQFVREEM 458
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQR----QNLEKAAAS 454
+MD +YDYMLHL+TEY+KLL YKP P A E C ES+ C A R ++E+ AS
Sbjct: 459 SMDYIYDYMLHLLTEYAKLLRYKPTVPEKAVEICTESMACPAQGMHRDCMMDSMERQVAS 518
Query: 455 PSPYPPCTL 463
+ PCTL
Sbjct: 519 ---FNPCTL 524
>gi|297745248|emb|CBI40328.3| unnamed protein product [Vitis vinifera]
Length = 838
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 159/341 (46%), Positives = 233/341 (68%), Gaps = 8/341 (2%)
Query: 130 IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
+ A+R A R++I+ GK YV+ + +Q+R MFT+WG LQLLR YPG +PD+++MFDC
Sbjct: 2 VDRAQRTAHFRLVIIDGKAYVEKFRKSIQTRDMFTLWGILQLLRWYPGRLPDLELMFDCD 61
Query: 190 DKPVI---DKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHG 246
D+PV+ D + + P PLFRYC +D DI FPDWSFWGW+E N++PW KDIK G
Sbjct: 62 DRPVVRMRDFRGPNAAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEG 121
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKS 304
++ WK+++P AYW+GNP V +P R +L+KCN D W + Q+W +++K G+++S
Sbjct: 122 NRRTKWKDRVPLAYWRGNPHV-APTRGDLLKCNVSDKADWNTRLYLQDWGQQSKIGYRQS 180
Query: 305 KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSA 364
L +QC HRYKIY EG+AWSVS KYIL+C+S+ L+I +Y DFF RGL+P ++++PI
Sbjct: 181 NLEDQCTHRYKIYIEGWAWSVSEKYILACDSMTLLIRPRYHDFFIRGLVPLQHYWPIRDN 240
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPA 423
+ CR ++ V+WGN + +A+ +G+ F+ E L MD VYDYM HL+ EY+KLL +KP
Sbjct: 241 NKCRDLRFAVEWGNNHTDKAQTMGETTSKFIQEDLKMDYVYDYMFHLLNEYAKLLKFKPT 300
Query: 424 PPSSAFEACVESLLCLADPKQRQNLEKA-AASPSPYPPCTL 463
P A E C E++ C A+ R+ +E++ +P+ PC+L
Sbjct: 301 IPPGAVEVCAETMACPAEGAWRKFMEESLEKNPTDTTPCSL 341
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 140/422 (33%), Positives = 218/422 (51%), Gaps = 47/422 (11%)
Query: 76 IVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKR 135
+V+CT + A V V+ CP++F+ IH+DL PW + I+ + A+
Sbjct: 392 LVNCTIGSMAQACPANHPTTSVTGKLSVEACPEYFRWIHEDLRPWKSTGISRFAVESAEG 451
Query: 136 FAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID 195
A R++IV GK YV+ Y + +T+ G + G + ++ + C + I
Sbjct: 452 DADFRLVIVNGKAYVEQYRN-------YTLTGDQNWEKESNGGFKNSNLAYKCTHRYKIY 504
Query: 196 KKEHG-SFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNL------QPWN----------- 237
+ G S C + P+P F+ S V L +P N
Sbjct: 505 VEGWGWSVSEKYVLACDSMTLLIKPYPH-DFFTRSMVPLLHYWPIRPRNKCRDLKFAVEW 563
Query: 238 -----EEFKDI-KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVE---LMKCNDSKLWGA-- 286
E+ ++I K GS + K+ F Y D + L E L+K + L GA
Sbjct: 564 GNTHPEKAQEIGKAGSNFIHEELKMDFVY-----DYMFHLLNEYSKLLKFKPAVLPGAVE 618
Query: 287 ---EILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQ 343
E + W +E++ G++ S L++QC HRYKIY EG+ WSVS KY+L+C+S+ L+
Sbjct: 619 LCLETMDCIWEKESRTGYQNSNLADQCTHRYKIYVEGWGWSVSEKYVLACDSMTLLTKPY 678
Query: 344 YKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDR 402
+ DFF+R ++P ++++PI + CR +K V+WGN +P +A++IGKAG +F+ E L MD
Sbjct: 679 HHDFFTRSMVPLQHYWPIRPRNKCRDLKFAVEWGNTHPEKAQEIGKAGSNFIHEDLKMDF 738
Query: 403 VYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPK-QRQNLEKAAASPSPYPPC 461
VYDYM HL+ EYSKLL +KPA P A E C+E++ C AD Q+ +E SP+ PC
Sbjct: 739 VYDYMFHLLNEYSKLLKFKPAVPPGAVELCLETMDCSADAVLQKFVMESTVNSPTDSAPC 798
Query: 462 TL 463
++
Sbjct: 799 SM 800
>gi|359489782|ref|XP_002273411.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 525
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 167/368 (45%), Positives = 234/368 (63%), Gaps = 9/368 (2%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCP++F+ I++DL PW + IT + +A+ A++RI++V GK+Y++ Y +R F
Sbjct: 115 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYMEKYKRVNHNRDEF 174
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIP 219
TIWG LQLLR YP +PD D+MF+C D+P+I + + P PLF YC +D +DI
Sbjct: 175 TIWGILQLLRMYPEKLPDFDLMFECRDRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIV 234
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC- 278
FPDWSFWGW E N++PWN KD+K G+ W ++ P+AYWKGN + +R EL KC
Sbjct: 235 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVK-MGVVRKELFKCR 293
Query: 279 -NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
D + W A + +W E + GFK S L++QC HRYKIY EG WSVS KYIL+C+SV
Sbjct: 294 NTDEQDWNARLYIMDWGREVQSGFKTSDLASQCTHRYKIYTEGIGWSVSEKYILACDSVT 353
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
L++ QY +FF+R L P +++PI D+C+SIK DW N + +A+KIGKAG F+ E
Sbjct: 354 LLVKPQYYEFFTRSLQPLVHYWPIKHKDMCKSIKFATDWCNNHTEKAQKIGKAGSGFVQE 413
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASP 455
+ M VYDYM HL++ Y+KLL YKP P A E C E + C + ++ L+ SP
Sbjct: 414 EIKMKFVYDYMFHLLSMYAKLLKYKPTVPPMAVEFCPEMMACAVEGLEKDYKLQSMVKSP 473
Query: 456 SPYPPCTL 463
S PC +
Sbjct: 474 SDTGPCIM 481
>gi|224074911|ref|XP_002304487.1| predicted protein [Populus trichocarpa]
gi|222841919|gb|EEE79466.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 227/339 (66%), Gaps = 9/339 (2%)
Query: 133 AKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKP 192
AK A R++IV+GK Y++ Y +Q+R FTIWG LQLLRRYPG +PD+++MFDC D P
Sbjct: 5 AKTTAHFRLIIVKGKAYLEKYKKSIQTRDAFTIWGILQLLRRYPGKIPDLELMFDCDDLP 64
Query: 193 VIDKKEH----GSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQ 248
VI ++ + P PLFRYC + DI FPDWSFWGW+E+N++PW++ D+K G+
Sbjct: 65 VIQSSDYRGPNKTGPPPLFRYCGDKWTEDIVFPDWSFWGWAEINIKPWDKLLIDLKEGNN 124
Query: 249 AKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKSKL 306
W ++ P+AYWKGNP ++ R +L+ CN D + W A + Q+W E++ FK+S +
Sbjct: 125 RSRWIDREPYAYWKGNP-FVAETRKDLLTCNVSDQQDWNARLFIQDWILESQQEFKQSNV 183
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADL 366
+NQC HRYKIY EGYAWSVS KYIL+C+SV L++ Y DFF+R L P ++++PI D
Sbjct: 184 ANQCTHRYKIYIEGYAWSVSEKYILACDSVTLLVKPHYYDFFTRSLKPVEHYWPIREDDK 243
Query: 367 CRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPP 425
C+SIK VDWGN + +A+ IGKA DF+ E L MD VYDYM HL+ EY+KLL + P P
Sbjct: 244 CKSIKFAVDWGNKHKQKAQAIGKAASDFIQEGLKMDYVYDYMFHLLNEYAKLLRFTPQVP 303
Query: 426 SSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
A E C E + C AD +R+ +E +PS PCT+
Sbjct: 304 EGAAELCSEIMACSADGFEREFMMESLVKAPSTTSPCTM 342
>gi|359489780|ref|XP_002273350.2| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
Length = 509
Score = 344 bits (882), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 165/366 (45%), Positives = 235/366 (64%), Gaps = 8/366 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCP++F+ I++DL+PW ++ IT + AK A +R+++V GK+Y + Y Q+R +FT
Sbjct: 108 TCPEYFRWIYEDLKPWTETGITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFT 167
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL---PLFRYCTNDAHFDIPFP 221
IWG LQ+LR YPG +PD D+MF+C DKPVI K ++ PLF YC +D DI FP
Sbjct: 168 IWGILQVLRMYPGKLPDFDLMFECGDKPVIKKHDYQGLNATAPPLFHYCGDDETLDIVFP 227
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW E+ ++PW+ KD++ G+ W ++ P+AYWKGN + R EL KC+
Sbjct: 228 DWSFWGWPEIRIKPWSTLRKDLREGNNKTKWVDREPYAYWKGNFK-MGVTRHELSKCSKT 286
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
+ + W A I +W +E ++GFK + LS QC H+YKIYAEG AWSVS KYIL+C+SV L+
Sbjct: 287 NEQDWNARIYNMDWLQEMQNGFKSADLSTQCTHKYKIYAEGAAWSVSEKYILACDSVTLL 346
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+ QY DFF+R L P +++PI D+C+SIK +W N + +A +I AG F+ E L
Sbjct: 347 VKPQYYDFFTRSLQPLVHYWPIKLKDMCKSIKFATEWCNNHTQKAHEIRNAGSSFVQEEL 406
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSP 457
M VYDYM HL++ Y+KL YKP P A E C E+++C Q++ ++ SPS
Sbjct: 407 RMKFVYDYMFHLLSAYAKLFKYKPTVPPGAVEVCPETMVCPVKGLQKKYKIQSMVKSPSD 466
Query: 458 YPPCTL 463
PC +
Sbjct: 467 TGPCVM 472
>gi|21742339|emb|CAD41531.1| OSJNBb0020O11.18 [Oryza sativa Japonica Group]
gi|38346999|emb|CAE04583.2| OSJNBb0039L24.22 [Oryza sativa Japonica Group]
Length = 375
Score = 343 bits (881), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 166/342 (48%), Positives = 232/342 (67%), Gaps = 9/342 (2%)
Query: 130 IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
+ A+ A+ R+L++ G+ +V + Q+R +FTIWG LQLLRRYPG VPD+D+MFDC
Sbjct: 2 LARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWGVLQLLRRYPGRVPDLDLMFDCA 61
Query: 190 DKPVID----KKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKH 245
D PV+ + +H +F PLF YC +D DI FPDWSFWGW E+N++PW+ +D+K
Sbjct: 62 DWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDWSFWGWPEINIKPWDALRQDLKD 121
Query: 246 GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS--KLWGAEILRQNWAEEAKDGFKK 303
G+ W +++P+AYWKGNP V + R EL+ CN S K W A I +Q+W E+K G+K
Sbjct: 122 GNNRVKWLDRVPYAYWKGNPAV-AVTRQELVNCNVSTTKDWNARIYKQDWFRESKAGYKD 180
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
S L +QC HRYKIY EG AWSVS KYIL+C+S+ L+++ +Y DFFSR L+P ++++P+ +
Sbjct: 181 SNLGSQCTHRYKIYIEGSAWSVSQKYILACDSMTLLVTPRYYDFFSRSLMPIQHYWPVHN 240
Query: 364 ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+ C SIK VDWGN++ A++IGK DF+ E + MDRVYDYMLHL+TEY+KLL ++P
Sbjct: 241 DNKCDSIKYAVDWGNSHKQLAQRIGKQASDFIEEDVNMDRVYDYMLHLLTEYAKLLRFRP 300
Query: 423 APPSSAFEACVESLLCLADPKQRQNL-EKAAASPSPYPPCTL 463
P A E C +SL C A+ +++ L E S PC L
Sbjct: 301 IKPPEAVEICPDSLACQAEGLEKKFLMESMVKSARDAGPCDL 342
>gi|449446169|ref|XP_004140844.1| PREDICTED: O-glucosyltransferase rumi homolog [Cucumis sativus]
Length = 454
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 165/369 (44%), Positives = 233/369 (63%), Gaps = 10/369 (2%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
TCP++F+ IH+DL+PWA++ IT + + A R++IV G+ YV+ Y + Q R +F
Sbjct: 55 NTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVF 114
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI------DKKEHGSFPLPLFRYCTNDAHFD 217
T+WG LQLLR YP +PD+D+MF C D+P + + + P PLFRYC +D FD
Sbjct: 115 TLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFD 174
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I FPDWSFWGW E+NL+PW E K++K +Q K W ++ +A+WKGN + P R +L+K
Sbjct: 175 IVFPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMP-RYQLLK 233
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
C+ S + Q+W EE K GFK S L++QC RYK+Y EG WSVSLKYIL+C+S+
Sbjct: 234 CSRSTQSKLRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMT 293
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIP-SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFME 396
L++ + DFF+R L+P +++PI D+C+SIK V+WG + +A+ IGKA FME
Sbjct: 294 LMVKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQKAQAIGKAASKFME 353
Query: 397 S-LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP-KQRQNLEKAAAS 454
L MD+VYDYM H + EYSKLL +KP P +A E + L C + + ++
Sbjct: 354 EQLNMDKVYDYMFHTLNEYSKLLTFKPTIPPNATEISLNDLACPTEGLAAKSMMDTLIKR 413
Query: 455 PSPYPPCTL 463
PS PC L
Sbjct: 414 PSFSSPCFL 422
>gi|297745254|emb|CBI40334.3| unnamed protein product [Vitis vinifera]
Length = 819
Score = 342 bits (878), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 165/366 (45%), Positives = 235/366 (64%), Gaps = 8/366 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCP++F+ I++DL+PW ++ IT + AK A +R+++V GK+Y + Y Q+R +FT
Sbjct: 28 TCPEYFRWIYEDLKPWTETGITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFT 87
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL---PLFRYCTNDAHFDIPFP 221
IWG LQ+LR YPG +PD D+MF+C DKPVI K ++ PLF YC +D DI FP
Sbjct: 88 IWGILQVLRMYPGKLPDFDLMFECGDKPVIKKHDYQGLNATAPPLFHYCGDDETLDIVFP 147
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW E+ ++PW+ KD++ G+ W ++ P+AYWKGN + R EL KC+
Sbjct: 148 DWSFWGWPEIRIKPWSTLRKDLREGNNKTKWVDREPYAYWKGNFK-MGVTRHELSKCSKT 206
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
+ + W A I +W +E ++GFK + LS QC H+YKIYAEG AWSVS KYIL+C+SV L+
Sbjct: 207 NEQDWNARIYNMDWLQEMQNGFKSADLSTQCTHKYKIYAEGAAWSVSEKYILACDSVTLL 266
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+ QY DFF+R L P +++PI D+C+SIK +W N + +A +I AG F+ E L
Sbjct: 267 VKPQYYDFFTRSLQPLVHYWPIKLKDMCKSIKFATEWCNNHTQKAHEIRNAGSSFVQEEL 326
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSP 457
M VYDYM HL++ Y+KL YKP P A E C E+++C Q++ ++ SPS
Sbjct: 327 RMKFVYDYMFHLLSAYAKLFKYKPTVPPGAVEVCPETMVCPVKGLQKKYKIQSMVKSPSD 386
Query: 458 YPPCTL 463
PC +
Sbjct: 387 TGPCVM 392
Score = 331 bits (848), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 164/361 (45%), Positives = 226/361 (62%), Gaps = 13/361 (3%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
+ +DL PW + IT + +A+ A++RI++V GK+Y++ Y +R FTIWG LQLL
Sbjct: 418 LEEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYMEKYKRVNHNRDEFTIWGILQLL 477
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEH------GSFPLPLFRYCTNDAHFDIPFPDWSFW 226
R YP +PD D+MF+C D+P+I K H + P PLF YC +D +DI FPDWSFW
Sbjct: 478 RMYPEKLPDFDLMFECRDRPMI--KTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFW 535
Query: 227 GWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC--NDSKLW 284
GW E N++PWN KD+K G+ W ++ P+AYWKGN + +R EL KC D + W
Sbjct: 536 GWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVK-MGVVRKELFKCRNTDEQDW 594
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
A + +W E + GFK S L++QC HRYKIY EG WSVS KYIL+C+SV L++ QY
Sbjct: 595 NARLYIMDWGREVQSGFKTSDLASQCTHRYKIYTEGIGWSVSEKYILACDSVTLLVKPQY 654
Query: 345 KDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRV 403
+FF+R L P +++PI D+C+SIK DW N + +A+KIGKAG F+ E + M V
Sbjct: 655 YEFFTRSLQPLVHYWPIKHKDMCKSIKFATDWCNNHTEKAQKIGKAGSGFVQEEIKMKFV 714
Query: 404 YDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCT 462
YDYM HL++ Y+KLL YKP P A E C E + C + ++ L+ SPS PC
Sbjct: 715 YDYMFHLLSMYAKLLKYKPTVPPMAVEFCPEMMACAVEGLEKDYKLQSMVKSPSDTGPCI 774
Query: 463 L 463
+
Sbjct: 775 M 775
>gi|326505488|dbj|BAJ95415.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326509389|dbj|BAJ91611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 562
Score = 341 bits (874), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 165/367 (44%), Positives = 233/367 (63%), Gaps = 10/367 (2%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CPD+F+ IH DLEPW + I+ + + AA R+ +V G+ YV+ Y+ Q+R +FT
Sbjct: 168 CPDYFRHIHSDLEPWREKGISREAVERGRPKAAFRLTVVSGRAYVETYHRVFQTRDLFTQ 227
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH--GSFPLPLFRYCTNDAHFDIPFPDW 223
WG QLL RYPG VPD+D+MF+C D P + ++ S PLFRYC + ++ FPDW
Sbjct: 228 WGIAQLLARYPGRVPDLDLMFNCEDMPELRAADYPDTSAAPPLFRYCKDGTSLEVLFPDW 287
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC-NDS- 281
SFWGW EVN++PW K+I + W ++ P+A+WKGNPDV S R +L +C NDS
Sbjct: 288 SFWGWPEVNIRPWGPLMKEIAEENARLPWPDRQPYAFWKGNPDV-SRARRDLFRCSNDSA 346
Query: 282 --KLWGAEILRQNWAEEAKDGFKKS-KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
+ W A + +W ++GFK S L+ QC +RYKIY +G +WSVS KYIL+C+S L
Sbjct: 347 AGRDWNARLFALDWGAANRNGFKGSTNLAEQCRYRYKIYVQGRSWSVSEKYILACDSPML 406
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
I + DFFSRGL+ ++++P+ D CR++K VDWGNA+P+ A+++GK G F E
Sbjct: 407 AIDTPFDDFFSRGLVAGRHYWPVDPGDKCRAVKFAVDWGNAHPALAQRMGKEGSGFARED 466
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPS 456
++MD VYDYMLH++T Y+ LL YKP P A E C ES+ C A + R+ +E +
Sbjct: 467 MSMDYVYDYMLHVLTHYAALLRYKPTVPDKAVELCPESMACSAQGRDREFMMESREMYEA 526
Query: 457 PYPPCTL 463
Y PCT+
Sbjct: 527 GYEPCTM 533
>gi|226510425|ref|NP_001149452.1| LOC100283078 [Zea mays]
gi|195627344|gb|ACG35502.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 600
Score = 340 bits (873), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 172/372 (46%), Positives = 233/372 (62%), Gaps = 19/372 (5%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CPD+F+ IH DL PW ++ IT + + AA R+++V G+ YV+ Y+ Q+R FT
Sbjct: 202 CPDYFRFIHSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQ 261
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK--EHGSFPLPLFRYCTNDAHFDIPFPDW 223
WG QLL RYPG VPD+D+MF+C D P + K E S P PLFRYC +D+ DI FPDW
Sbjct: 262 WGIAQLLARYPGRVPDLDLMFNCEDMPEVKVKPSEESSAP-PLFRYCKDDSTVDIVFPDW 320
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND-SK 282
SFWGW EVN++PW +++ W ++ P+AYWKGNP V S R +L +CND S
Sbjct: 321 SFWGWPEVNIRPWAPLLEEMAAEMGRLPWADREPYAYWKGNPTV-SAERADLRRCNDSSS 379
Query: 283 LWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
G + Q+W +DGF+ S L+ QC +RYKIY G +WSVSLKYIL+C+S L+I+
Sbjct: 380 SGGTRVFWQDWGAAIRDGFRDSNLAKQCRYRYKIYVRGRSWSVSLKYILACDSPVLLIAT 439
Query: 343 QYKDFFSRGLIPTKNHFPI-PSADLCRSIK-SVVDWGNANPSEAEKIGKAGQDFM-ESLT 399
+KDFFSRGL+ ++++PI P A C I +V DWGNA+P +A ++ + G F L+
Sbjct: 440 PFKDFFSRGLVAGRHYWPIDPGARKCADINFAVHDWGNAHPEQARRMAEEGSGFARHQLS 499
Query: 400 MDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQR--------QNLEKA 451
MD VYDYMLHL+T+Y+ LL YKP P +A E C E++ C A Q+ E+
Sbjct: 500 MDYVYDYMLHLLTQYAGLLRYKPTVPENAVELCAETVACPAAHHSNRREFDFMMQSRERY 559
Query: 452 AASPSPYPPCTL 463
A Y PCTL
Sbjct: 560 IAD---YQPCTL 568
>gi|413925102|gb|AFW65034.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 599
Score = 340 bits (873), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 171/369 (46%), Positives = 233/369 (63%), Gaps = 13/369 (3%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CPD+F+ IH DL PW ++ IT + + AA R+++V G+ YV+ Y+ Q+R FT
Sbjct: 201 CPDYFRFIHSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQ 260
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK--EHGSFPLPLFRYCTNDAHFDIPFPDW 223
WG QLL RYPG VPD+D+MF+C D P + K E S P PLFRYC +D+ DI FPDW
Sbjct: 261 WGIAQLLARYPGRVPDLDLMFNCEDMPEVKVKPSEESSAP-PLFRYCKDDSTVDIVFPDW 319
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND-SK 282
SFWGW EVN++PW +++ W ++ P+AYWKGNP V S R +L +CND S
Sbjct: 320 SFWGWPEVNIRPWAPLLEEMAAEMGRLPWADREPYAYWKGNPTV-SAERADLRRCNDSSS 378
Query: 283 LWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
G + Q+W +DGF+ S L+ QC +RYKIY G +WSVSLKYIL+C+S L+I+
Sbjct: 379 SGGTRVFWQDWGAAIRDGFRDSNLAKQCRYRYKIYVRGRSWSVSLKYILACDSPVLLIAT 438
Query: 343 QYKDFFSRGLIPTKNHFPI-PSADLCRSIK-SVVDWGNANPSEAEKIGKAGQDFM-ESLT 399
+KDFFSRGL+ ++++PI P A C I +V DWGNA+P +A ++ + G F L+
Sbjct: 439 PFKDFFSRGLVAGRHYWPIDPGARKCADINFAVHDWGNAHPEQARRMAEEGSGFARHQLS 498
Query: 400 MDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKAAASP--- 455
MD VYDYMLHL+T+Y+ LL YKP P +A E C E++ C A R+ + S
Sbjct: 499 MDYVYDYMLHLLTQYAGLLRYKPTVPENAVELCAETVACPAAHHSNRREFDFMMQSRERY 558
Query: 456 -SPYPPCTL 463
+ Y PCTL
Sbjct: 559 IADYQPCTL 567
>gi|242077829|ref|XP_002443683.1| hypothetical protein SORBIDRAFT_07g000280 [Sorghum bicolor]
gi|241940033|gb|EES13178.1| hypothetical protein SORBIDRAFT_07g000280 [Sorghum bicolor]
Length = 594
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 164/366 (44%), Positives = 228/366 (62%), Gaps = 12/366 (3%)
Query: 109 FFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGF 168
+F+ IH DL PW ++ IT + + AA R+++V G+ YV+ Y+ Q+R FT WG
Sbjct: 198 YFRYIHSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGI 257
Query: 169 LQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL---PLFRYCTNDAHFDIPFPDWSF 225
QLL RYPG VPD+D+MF+C D P + + + P PLFRYC +D+ DI FPDWSF
Sbjct: 258 AQLLARYPGRVPDLDLMFNCEDMPEVRAADFAAAPSQAPPLFRYCKDDSTLDIVFPDWSF 317
Query: 226 WGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL-W 284
WGW EVN++PW +++ + W E+ P+AYWKGNP V S R +L++CNDS W
Sbjct: 318 WGWPEVNIRPWAPLLEEMAAETARLPWAEREPYAYWKGNPGV-SGERGDLLRCNDSSGEW 376
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
+ Q+W +DGF+ S L+ QC +RYKI+ G +WSVS KYIL+C+S L+++ +
Sbjct: 377 RTRVFWQDWGAAIRDGFRNSNLAKQCRYRYKIFVRGRSWSVSQKYILACDSPVLLVATPF 436
Query: 345 KDFFSRGLIPTKNHFPI-PSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDR 402
KDFFSRGL+ K+++PI P A C IK VDWGNA+P +A ++ + G F L+MD
Sbjct: 437 KDFFSRGLVAGKHYWPIDPGAGKCAGIKFAVDWGNAHPEQARRMAEEGSGFARHDLSMDY 496
Query: 403 VYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQN-----LEKAAASPSP 457
VYDYMLHL+T+Y+ LL YKP P +A E C E++ C A +E +
Sbjct: 497 VYDYMLHLLTQYAALLRYKPTVPENAVELCAETVACPAAKNNNNREFDFMMESRERFVAD 556
Query: 458 YPPCTL 463
Y PCTL
Sbjct: 557 YQPCTL 562
>gi|449523045|ref|XP_004168535.1| PREDICTED: LOW QUALITY PROTEIN: O-glucosyltransferase rumi homolog
[Cucumis sativus]
Length = 454
Score = 338 bits (867), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 165/369 (44%), Positives = 233/369 (63%), Gaps = 10/369 (2%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
TCP++F+ IH+DL+PWA++ IT + + A R++IV G++YV+ Y + Q R +F
Sbjct: 55 NTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRVYVEKYSEVFQRRDVF 114
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS------FPLPLFRYCTNDAHFD 217
T+WG LQLLR YP +PD+D+MF C D+P + + P PLFRYC +D FD
Sbjct: 115 TLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTPPPPLFRYCGDDDTFD 174
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I FPDWSFWGW E+NL+PW E K++K +Q K W ++ +A+WKGN + P R +L+K
Sbjct: 175 IVFPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMP-RYQLLK 233
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
C+ S + Q+W EE K GFK S L++QC RYK+Y EG WSVSLKYIL+C+S+
Sbjct: 234 CSRSTQSKLRVYMQDWQEEGKKGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMT 293
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIP-SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
L++ + DFF+R L+P +++PI D+C+SIK V+WG + +A+ IGKA FM
Sbjct: 294 LMVKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQKAQAIGKAASKFME 353
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP-KQRQNLEKAAAS 454
E L MD+VYDYM H + EYSKLL +KP P +A E + L C + + ++
Sbjct: 354 EQLNMDKVYDYMFHTLNEYSKLLTFKPTIPPNATEISLNDLACPTEGLAAKSMMDTLIKR 413
Query: 455 PSPYPPCTL 463
PS PC L
Sbjct: 414 PSFSSPCFL 422
>gi|115466478|ref|NP_001056838.1| Os06g0152700 [Oryza sativa Japonica Group]
gi|113594878|dbj|BAF18752.1| Os06g0152700 [Oryza sativa Japonica Group]
Length = 547
Score = 338 bits (867), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/372 (44%), Positives = 232/372 (62%), Gaps = 10/372 (2%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
+ +CPD+F+ IH DL PW + IT + +R A R+++V G+ YV+ Y Q+R
Sbjct: 146 EPAASCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTR 205
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI---DKKEHGSFPLPLFRYCTNDAHFD 217
FT WG QLLRRY G VPDVDIMF C D+ + D + P+FRYC + D
Sbjct: 206 DAFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFAAAPADAPPVFRYCRDATTLD 265
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
+ FPDWSFWGW EVN+ W + ++ S W E+ PFA+WKGNP V + +R ELMK
Sbjct: 266 VVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGV-ARIRGELMK 324
Query: 278 CN---DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
CN D K W A + Q+W +GFK S + QC HRYKIY EG AWSVS KYI++C+
Sbjct: 325 CNPASDGKDWNARLFSQDWNHAIHNGFKDSSIPKQCLHRYKIYIEGEAWSVSEKYIMACD 384
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF 394
S L ++ Y+D SRGL+ ++++PI +C SI++ VDWGNA+P+ A +IG+ G F
Sbjct: 385 SPVLFVNTPYQDILSRGLVAGEHYWPINRTRMCESIRAAVDWGNAHPAAARRIGEQGSRF 444
Query: 395 M-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL-LCLADPKQRQNLEKAA 452
+ E + MD VYDYM HLITEY+KLL YKP P++A E C ES+ A ++R+ ++++
Sbjct: 445 VREQMAMDYVYDYMFHLITEYAKLLRYKPTVPANAVEICAESMACAAAAGRERECMDESV 504
Query: 453 AS-PSPYPPCTL 463
+ + PC+L
Sbjct: 505 EGFVAGFDPCSL 516
>gi|55296633|dbj|BAD69335.1| unknown protein [Oryza sativa Japonica Group]
gi|55297286|dbj|BAD69071.1| unknown protein [Oryza sativa Japonica Group]
Length = 542
Score = 338 bits (867), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 167/368 (45%), Positives = 231/368 (62%), Gaps = 10/368 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CPD+F+ IH DL PW + IT + +R A R+++V G+ YV+ Y Q+R FT
Sbjct: 145 SCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFT 204
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVI---DKKEHGSFPLPLFRYCTNDAHFDIPFP 221
WG QLLRRY G VPDVDIMF C D+ + D + P+FRYC + D+ FP
Sbjct: 205 QWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFAAAPADAPPVFRYCRDATTLDVVFP 264
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW EVN+ W + ++ S W E+ PFA+WKGNP V + +R ELMKCN
Sbjct: 265 DWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGV-ARIRGELMKCNPA 323
Query: 280 -DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
D K W A + Q+W +GFK S + QC HRYKIY EG AWSVS KYI++C+S L
Sbjct: 324 SDGKDWNARLFSQDWNHAIHNGFKDSSIPKQCLHRYKIYIEGEAWSVSEKYIMACDSPVL 383
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
++ Y+D SRGL+ ++++PI +C SI++ VDWGNA+P+ A +IG+ G F+ E
Sbjct: 384 FVNTPYQDILSRGLVAGEHYWPINRTRMCESIRAAVDWGNAHPAAARRIGEQGSRFVREQ 443
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL-LCLADPKQRQNLEKAAAS-P 455
+ MD VYDYM HLITEY+KLL YKP P++A E C ES+ A ++R+ ++++
Sbjct: 444 MAMDYVYDYMFHLITEYAKLLRYKPTVPANAVEICAESMACAAAAGRERECMDESVEGFV 503
Query: 456 SPYPPCTL 463
+ + PC+L
Sbjct: 504 AGFDPCSL 511
>gi|255537419|ref|XP_002509776.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549675|gb|EEF51163.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 534
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 167/367 (45%), Positives = 234/367 (63%), Gaps = 24/367 (6%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
CP++F+ IH+DL PW + I+ + A+R A ++IV GK Y+ Y + Q+R FT
Sbjct: 138 VCPNYFRWIHEDLRPWIAAGISRDMVERAQRTAHFHLIIVGGKAYIKKYRESTQTRDTFT 197
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDK----KEHGSFPLPLFRYCTNDAHFDIPF 220
IWG LQLLRRYPG +PD+++MFD D+PVI +++ + P PLFRYC +
Sbjct: 198 IWGILQLLRRYPGKIPDLELMFDTDDRPVIRSSDYHEQNTTGPPPLFRYCGDR------- 250
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
PD +N++PW+E DIK G+ W ++ P+AYWKGNP ++ R +L+ CN
Sbjct: 251 PD--------INIKPWDELSIDIKEGNNGSKWIDREPYAYWKGNP-FVAETRKDLLACNV 301
Query: 280 -DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
D + W A + Q+W +E++ G+K+S L+ QC HRYKIY EGYAWSVS KYIL+CNS++L
Sbjct: 302 SDQRDWNARLFIQDWIQESQQGYKQSDLARQCAHRYKIYIEGYAWSVSEKYILACNSLSL 361
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ES 397
++ Y DFF+R L P ++++PI D C+SIK VDWGN N +A++IGKA DF+ E
Sbjct: 362 LVKPYYHDFFTRSLQPLQHYWPIRDTDKCKSIKFAVDWGNKNNQKAQEIGKAASDFIQEE 421
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPS 456
L MD VYDYM HL+ EY+KLL + P P A E C E + C AD +++ + E SP
Sbjct: 422 LKMDYVYDYMFHLLNEYAKLLKFAPRVPEEAVEMCSEIMACPADGLEKKFMTESLVKSPR 481
Query: 457 PYPPCTL 463
PCTL
Sbjct: 482 ITRPCTL 488
>gi|359489784|ref|XP_002273475.2| PREDICTED: O-glucosyltransferase rumi [Vitis vinifera]
Length = 604
Score = 336 bits (861), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 164/373 (43%), Positives = 237/373 (63%), Gaps = 9/373 (2%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ 158
S +TCP++F+ I++DL PW ++ IT + AK A +R+++V GK+Y++ Y Q
Sbjct: 197 SEPSTETCPEYFRWIYEDLRPWRETGITRDMVERAKPAAYIRVVVVDGKVYMEKYKGVYQ 256
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK----EHGSFPLPLFRYCTNDA 214
+R +FTIWG LQLLR YPG +PD D+MF C DK + + + + P PLF YC +D
Sbjct: 257 TRDVFTIWGILQLLRMYPGKLPDFDLMFSCGDKLAMKTRYFQGPNATTPPPLFHYCGDDE 316
Query: 215 HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
+DI FPDWSFWGW E++++ WN KD+K G+ W ++ P+AYWKGN + L R E
Sbjct: 317 TYDIVFPDWSFWGWPEIHIKQWNTLKKDLKEGNNRTEWIDREPYAYWKGNIN-LGLARHE 375
Query: 275 LMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
L KCN + + W A I +W +E GF S L++QC HRYKIY EG WSVS KYIL+
Sbjct: 376 LSKCNKTSEQDWNARIYDIDWRQEIHSGFNSSDLASQCTHRYKIYTEGVTWSVSEKYILA 435
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
C+SV L+ + Y DFF+R L P +++P+ D+C+SIK +W N + +A++I KAG
Sbjct: 436 CDSVTLLANPHYYDFFTRSLQPMVHYWPLKLKDMCKSIKFATEWCNNHTEKAQEIAKAGS 495
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEK 450
F+ E L M VYDYM HL++ Y+KLL +KP+ P A E C E+++C + ++ ++
Sbjct: 496 SFVQEKLKMKFVYDYMFHLLSMYAKLLKFKPSVPPGAVEFCPETMVCPVKGLEKDYKIQS 555
Query: 451 AAASPSPYPPCTL 463
SPS PCT+
Sbjct: 556 MVRSPSDTGPCTM 568
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 40/55 (72%)
Query: 137 AALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDK 191
A +R+++V GK++++ Y Q+R + TIW LQLLR YPG +PD+D++F+C K
Sbjct: 17 AHIRVVVVDGKVFMEKYEWVFQTRHVLTIWCILQLLRMYPGKLPDLDLIFECGAK 71
>gi|297745255|emb|CBI40335.3| unnamed protein product [Vitis vinifera]
Length = 578
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 163/369 (44%), Positives = 236/369 (63%), Gaps = 9/369 (2%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAM 162
+TCP++F+ I++DL PW ++ IT + AK A +R+++V GK+Y++ Y Q+R +
Sbjct: 175 TETCPEYFRWIYEDLRPWRETGITRDMVERAKPAAYIRVVVVDGKVYMEKYKGVYQTRDV 234
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK----EHGSFPLPLFRYCTNDAHFDI 218
FTIWG LQLLR YPG +PD D+MF C DK + + + + P PLF YC +D +DI
Sbjct: 235 FTIWGILQLLRMYPGKLPDFDLMFSCGDKLAMKTRYFQGPNATTPPPLFHYCGDDETYDI 294
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
FPDWSFWGW E++++ WN KD+K G+ W ++ P+AYWKGN + L R EL KC
Sbjct: 295 VFPDWSFWGWPEIHIKQWNTLKKDLKEGNNRTEWIDREPYAYWKGNIN-LGLARHELSKC 353
Query: 279 NDS--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
N + + W A I +W +E GF S L++QC HRYKIY EG WSVS KYIL+C+SV
Sbjct: 354 NKTSEQDWNARIYDIDWRQEIHSGFNSSDLASQCTHRYKIYTEGVTWSVSEKYILACDSV 413
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
L+ + Y DFF+R L P +++P+ D+C+SIK +W N + +A++I KAG F+
Sbjct: 414 TLLANPHYYDFFTRSLQPMVHYWPLKLKDMCKSIKFATEWCNNHTEKAQEIAKAGSSFVQ 473
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKAAAS 454
E L M VYDYM HL++ Y+KLL +KP+ P A E C E+++C + ++ ++ S
Sbjct: 474 EKLKMKFVYDYMFHLLSMYAKLLKFKPSVPPGAVEFCPETMVCPVKGLEKDYKIQSMVRS 533
Query: 455 PSPYPPCTL 463
PS PCT+
Sbjct: 534 PSDTGPCTM 542
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 40/55 (72%)
Query: 137 AALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDK 191
A +R+++V GK++++ Y Q+R + TIW LQLLR YPG +PD+D++F+C K
Sbjct: 20 AHIRVVVVDGKVFMEKYEWVFQTRHVLTIWCILQLLRMYPGKLPDLDLIFECGAK 74
>gi|125559830|gb|EAZ05278.1| hypothetical protein OsI_27481 [Oryza sativa Indica Group]
Length = 579
Score = 328 bits (840), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 168/372 (45%), Positives = 230/372 (61%), Gaps = 15/372 (4%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAM 162
V CP +F+ I DL W S I+ + +R A R+L+V G+ YV+ Y Q+R +
Sbjct: 177 VDYCPSYFRHIELDLAAWVASGISREAVERGRRQAHFRLLVVGGRAYVETYRRAFQTRDV 236
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI--DKKEHGSFPLPLFRYCTNDAHFDIPF 220
FT WG LQLLRRYPG VPD+D+MF+C D P + + PLFRYC + + D+ F
Sbjct: 237 FTQWGILQLLRRYPGRVPDLDLMFNCDDMPEVRAAAYPDRAAAPPLFRYCKDPSTLDVLF 296
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW EVN++PW ++ W + P+AYWKGNPDV SPLR EL++CN
Sbjct: 297 PDWSFWGWPEVNIRPWAPLLAEMAEEKARLPWSRREPYAYWKGNPDV-SPLRQELLRCNH 355
Query: 281 S--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
S + RQ+W ++ F+ S L+ QC HRYK+Y +G +WSVS KYIL+C+S L
Sbjct: 356 SLPPDDTVRLYRQDWGFANRNAFRDSNLARQCRHRYKLYVQGRSWSVSRKYILACDSPVL 415
Query: 339 IISQQYKDFFSRGLIPTKNHFPI-PS-ADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-M 395
++ Y+DFFSRGL K+++PI PS + LCR I+ V WGNA+P++A+++G AG F
Sbjct: 416 AVATPYQDFFSRGLAAGKHYWPIDPSRSKLCRDIRFAVRWGNAHPAQAQRMGLAGSAFAT 475
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQR----QNLEKA 451
+ + MD VYDYMLH++T Y+ LL YKP P A E C ES+ C + R Q+ E+
Sbjct: 476 DDMAMDYVYDYMLHVLTRYASLLRYKPTVPDRAVELCPESMACPRRGRDRDFMMQSREQY 535
Query: 452 AASPSPYPPCTL 463
A Y PCT+
Sbjct: 536 VAD---YQPCTI 544
>gi|302794981|ref|XP_002979254.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300153022|gb|EFJ19662.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 474
Score = 327 bits (839), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 168/381 (44%), Positives = 230/381 (60%), Gaps = 17/381 (4%)
Query: 93 ERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDP 152
E +R SP CP +F I DL PW ++ I+ +++ +A+ A R++IV G LY++
Sbjct: 74 ELQRCLSPNSEAHCPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMER 133
Query: 153 YYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH-----GSFPLP-L 206
Y C ++R FT+WG L LL YPGMVPDVD+MF+C D P++ + EH GS+P P L
Sbjct: 134 YNKCFETRDDFTLWGLLMLLEEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPL 193
Query: 207 FRYCTNDA-HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNP 265
F YCT+ H+DI FPDWS+WGW EVN+ PW+ E + I G++ W + P A+WKGN
Sbjct: 194 FLYCTSRGEHYDIVFPDWSYWGWPEVNILPWSLEKEKIFSGAKKLDWSHRQPIAFWKGNY 253
Query: 266 DVLSPLRVELMKC--NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAW 323
D + P R +L+KC N+S+ + Q+W E + F S LS QC HRYKIY +G W
Sbjct: 254 D-MGPARADLVKCTANNSQNYNLVTHHQDWFTEREHNFNNSDLSKQCQHRYKIYVQGGGW 312
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
SVS KYIL+C S L I +++FF+R L P + +P+ ++C S K VDWGNA+P E
Sbjct: 313 SVSFKYILACGSTVLQIEPMFQEFFARSLTPFVHFWPVDRDNICNSTKFAVDWGNAHPKE 372
Query: 384 AEKIGKAGQDFME-SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP 442
A IG + F++ L+MD VY YMLHL+ EY KLL ++P PP A +ES L P
Sbjct: 373 AAAIGDCAKSFLDKELSMDFVYQYMLHLLQEYGKLLKFRPVPPPEAQRMTLESGL----P 428
Query: 443 KQRQNLEKAAASPSPYPPCTL 463
L + P P CTL
Sbjct: 429 GHELTLPRNG--PRPQKVCTL 447
>gi|115474335|ref|NP_001060764.1| Os08g0101800 [Oryza sativa Japonica Group]
gi|38637293|dbj|BAD03556.1| unknown protein [Oryza sativa Japonica Group]
gi|50725686|dbj|BAD33152.1| unknown protein [Oryza sativa Japonica Group]
gi|113622733|dbj|BAF22678.1| Os08g0101800 [Oryza sativa Japonica Group]
gi|125601894|gb|EAZ41219.1| hypothetical protein OsJ_25724 [Oryza sativa Japonica Group]
Length = 579
Score = 327 bits (838), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 168/372 (45%), Positives = 230/372 (61%), Gaps = 15/372 (4%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAM 162
V CP +F+ I DL W S I+ + +R A R+L+V G+ YV+ Y Q+R +
Sbjct: 177 VDYCPSYFRHIELDLAAWLASGISREAVERGRRQAHFRLLVVGGRAYVETYRRAFQTRDV 236
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI--DKKEHGSFPLPLFRYCTNDAHFDIPF 220
FT WG LQLLRRYPG VPD+D+MF+C D P + + PLFRYC + + D+ F
Sbjct: 237 FTQWGILQLLRRYPGRVPDLDLMFNCDDMPEVRAAAYPDRAAAPPLFRYCKDPSTLDVLF 296
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW EVN++PW ++ W + P+AYWKGNPDV SPLR EL++CN
Sbjct: 297 PDWSFWGWPEVNIRPWAPLLAEMAEEKARLPWSRREPYAYWKGNPDV-SPLRQELLRCNH 355
Query: 281 S--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
S + RQ+W ++ F+ S L+ QC HRYK+Y +G +WSVS KYIL+C+S L
Sbjct: 356 SLPPDDTVRLYRQDWGFANRNAFRDSNLARQCRHRYKLYVQGRSWSVSRKYILACDSPVL 415
Query: 339 IISQQYKDFFSRGLIPTKNHFPI-PS-ADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-M 395
++ Y+DFFSRGL K+++PI PS + LCR I+ V WGNA+P++A+++G AG F
Sbjct: 416 AVATPYQDFFSRGLAAGKHYWPIDPSRSKLCRDIRFAVRWGNAHPAQAQRMGLAGSAFAT 475
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQR----QNLEKA 451
+ + MD VYDYMLH++T Y+ LL YKP P A E C ES+ C + R Q+ E+
Sbjct: 476 DDMAMDYVYDYMLHVLTRYASLLRYKPTVPDRAVELCPESMACPRRGRDRDFMMQSREQY 535
Query: 452 AASPSPYPPCTL 463
A Y PCT+
Sbjct: 536 VAD---YQPCTI 544
>gi|356504647|ref|XP_003521107.1| PREDICTED: protein O-glucosyltransferase 1-like [Glycine max]
Length = 373
Score = 327 bits (838), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 155/341 (45%), Positives = 219/341 (64%), Gaps = 8/341 (2%)
Query: 130 IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
I K + R++IV GK Y++ + Q+R +FTIWG LQLLR YPG VPD+++MF C
Sbjct: 2 IERGKNISHFRLVIVNGKAYIEKFAKSYQTRDVFTIWGILQLLRLYPGKVPDLELMFHCG 61
Query: 190 DKPVIDKKEHGS---FPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHG 246
DK V+ KK+ P P+F YC + +DI FPDW+FWGW+E++++PW +I+ G
Sbjct: 62 DKTVVFKKDFQGPQMSPPPVFHYCGEENSYDIVFPDWTFWGWAELSIRPWETTLHNIQEG 121
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKS 304
++ WK+++P+A+WKGNP V S +R EL KCN + + W A I W +E F+ S
Sbjct: 122 NKMVKWKDRIPYAFWKGNPKV-SIIRRELGKCNVTEKQDWNARIYDIQWLQERASNFENS 180
Query: 305 KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSA 364
KL NQC RYKIYAEG WSVS KYI++C+S+ + I +Y DFF+R ++P ++++PI +
Sbjct: 181 KLENQCTFRYKIYAEGATWSVSEKYIIACDSMTMFIEPKYYDFFTRNMLPLRHYWPISTK 240
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYKPA 423
++C IK VDWGNA+ A+ IG G ++ +E+L M VYDYM HL+ YSKLL +KP
Sbjct: 241 NMCEEIKYAVDWGNAHLDHAQAIGDGGTNYILENLKMKFVYDYMFHLLNNYSKLLKFKPT 300
Query: 424 PPSSAFEACVESLLC-LADPKQRQNLEKAAASPSPYPPCTL 463
P A E C ES+ C L ++ +E SPS PPC L
Sbjct: 301 IPIGAVEICSESMACSLHGQRKHFMVESMVISPSDTPPCIL 341
>gi|224064021|ref|XP_002301352.1| predicted protein [Populus trichocarpa]
gi|222843078|gb|EEE80625.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 327 bits (838), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 159/373 (42%), Positives = 237/373 (63%), Gaps = 19/373 (5%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP++F+ IH+DL W + I+ + AK +A R++I++GK+YV+ Y +R +FTI
Sbjct: 23 CPEYFRWIHEDLRIWKSTGISRAMVERAKDYAHFRLVILKGKIYVEKYKKSFHTRDVFTI 82
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIPFP 221
WG LQLLR YPG VPD+++MF C D+PVI KK++ + +F+YC + I FP
Sbjct: 83 WGILQLLRLYPGKVPDLELMFWCDDRPVILKKDYQGTNATSSPSIFQYCGREDALGIVFP 142
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DW+FWGW+E N+ PW KD+K ++ WK+++P+AYW+GNP+V + R +LM CN
Sbjct: 143 DWTFWGWAETNVSPWKTLSKDLKEANKRTKWKDRVPYAYWRGNPNVAASRR-QLMWCNVS 201
Query: 280 DSKLWGAEILRQ----NWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
D W A + +Q +W E++ G++ S+L +QC HRYKIY EG WSVS KYIL+C+S
Sbjct: 202 DKYDWNARLYKQASYIDWRTESEQGYEHSRLEDQCTHRYKIYIEGRGWSVSDKYILACDS 261
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ L + +Y DFF R ++P ++++P+ + + CR IK V+WGN + +A+ IGKAG F+
Sbjct: 262 MTLFVKPEYYDFFIRSMVPLQHYWPVSARNKCRDIKFAVEWGNNHTDKAQAIGKAGSKFI 321
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPK----QRQNLEK 450
E+L M+ VYDYM HL+T Y+ LL +KP P A E E+ +A P ++ E
Sbjct: 322 QENLKMEYVYDYMFHLLTNYANLLKFKPRIPEGAVEVYSET---MARPHRGLWKKFMAET 378
Query: 451 AAASPSPYPPCTL 463
PS PCT+
Sbjct: 379 LVNFPSDTLPCTM 391
>gi|224064019|ref|XP_002301351.1| predicted protein [Populus trichocarpa]
gi|222843077|gb|EEE80624.1| predicted protein [Populus trichocarpa]
Length = 413
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 170/382 (44%), Positives = 236/382 (61%), Gaps = 25/382 (6%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
CP FF+ IH+DL PW + I+ + A++ A R++IV GK YV+ Y Q+R +FT
Sbjct: 2 ACPGFFRWIHEDLRPWKDTGISRDMLERARKHAHFRLVIVEGKAYVEQYSKPYQTRDVFT 61
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE-HGS---FPLPLFRYCTNDAHFDIPF 220
IWG LQLL YPG +PD+++MF C DK VI K + GS P LF+YC + +I F
Sbjct: 62 IWGILQLLNLYPGKIPDLELMFRCGDKTVIQKHDIQGSDAMSPAVLFQYCGHSTALNIVF 121
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
PDW+FWGW+E N++PW + + G++ W+++ P+AYW+GNP V SP R +LMKCN
Sbjct: 122 PDWTFWGWAETNIKPWKIVLEGMVEGNKKIKWQDREPYAYWRGNPHV-SPNREDLMKCNV 180
Query: 280 -DSKLWGAEILRQ---------------NWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAW 323
D W A + Q NW +E + G+K SKL +QC HRYKIY EG +W
Sbjct: 181 SDKYDWLARLYEQASFLDHSPMNLADWKNWGKEREQGYKHSKLEDQCTHRYKIYIEGNSW 240
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
SVS KYIL+C+S+AL+I +Y DFFSR + P ++++PI +++ C+ IK V+WGN + +
Sbjct: 241 SVSEKYILACDSMALLIKPEYYDFFSRSMEPMQHYWPIRASNKCKDIKFAVEWGNNHSVD 300
Query: 384 AEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLL-CLAD 441
A+ IGKAG F+ E+L M+ VY+YM HL EY+KLL +KP P+ A E ES L
Sbjct: 301 AQAIGKAGSKFIQENLKMEYVYEYMFHLFKEYAKLLRFKPKIPAGAVEVSSESTASSLGG 360
Query: 442 PKQRQNLEKAAASPS-PYPPCT 462
+ LE SP PCT
Sbjct: 361 LWKMFILESVVKSPGDALVPCT 382
>gi|302823162|ref|XP_002993235.1| hypothetical protein SELMODRAFT_431353 [Selaginella moellendorffii]
gi|300138905|gb|EFJ05656.1| hypothetical protein SELMODRAFT_431353 [Selaginella moellendorffii]
Length = 475
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 167/381 (43%), Positives = 228/381 (59%), Gaps = 17/381 (4%)
Query: 93 ERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDP 152
E +R SP CP +F I DL PW ++ I+ +++ +A+ A R++IV G LY++
Sbjct: 74 ELQRCLSPNSEAQCPSYFSWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMER 133
Query: 153 YYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH-----GSFPLP-L 206
Y C ++R FT+WG L LL YPGMVPDVD+MF+C D P++ + EH GS+P P L
Sbjct: 134 YNKCFETRDDFTLWGLLMLLEEYPGMVPDVDLMFNCGDWPLVFRAEHKPEKNGSWPPPPL 193
Query: 207 FRYCTNDA-HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNP 265
F YCT+ H+DI FPDWS+WGW EVN+ PW+ E + I G++ W + P A+WKGN
Sbjct: 194 FLYCTSRGEHYDIVFPDWSYWGWPEVNILPWSLEKEKIFSGAKKLDWSHRQPIAFWKGNY 253
Query: 266 DVLSPLRVELMKC--NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAW 323
D + P R +L+KC N+S+ + Q+W E + F S LS QC HRYKIY +G W
Sbjct: 254 D-MGPARADLVKCTANNSQNYNLVTHHQDWFTEREHNFNNSDLSKQCQHRYKIYVQGGGW 312
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
SVS KYIL+C S L I +++FF+R L P + +P+ ++C S K VDWGNA+P E
Sbjct: 313 SVSFKYILACGSTVLQIEPMFQEFFARSLTPFVHFWPVDRDNICNSTKFAVDWGNAHPKE 372
Query: 384 AEKIGKAGQDFME-SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP 442
A IG + F++ L+MD VY YMLHL+ EY KLL ++P PP A +ES L
Sbjct: 373 AAAIGDCAKSFLDKDLSMDFVYQYMLHLLQEYGKLLKFRPVPPPEAQRMTLESGLL---- 428
Query: 443 KQRQNLEKAAASPSPYPPCTL 463
L P P CTL
Sbjct: 429 --GHELTLPRNGPRPRKVCTL 447
>gi|449441480|ref|XP_004138510.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
gi|449518613|ref|XP_004166331.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 380
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 157/343 (45%), Positives = 227/343 (66%), Gaps = 12/343 (3%)
Query: 130 IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
+ +R A R++IV G++YV+ Y +Q+R +FT+WG LQL R YP +PD+++MFDC
Sbjct: 2 VERGRRTAHFRVVIVEGRVYVEKYKGSIQTRDVFTMWGILQLARWYPKKLPDLELMFDCD 61
Query: 190 DKPVIDKKEHG-----SFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIK 244
D+PV+ + +G S P PLFRYC++++ DI FPDWSFWGW E+N++PW +DIK
Sbjct: 62 DRPVV--RSNGFMNAISGPPPLFRYCSDESSLDIVFPDWSFWGWGEINIKPWKMVLEDIK 119
Query: 245 HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFK 302
G++ WK+++P AYWKGNP V P R +L+KCN + W + Q+W +EAK+G+K
Sbjct: 120 EGNKRTRWKDRVPLAYWKGNPQV-DPSRRDLLKCNLTQQQNWDTLLYVQDWDKEAKEGYK 178
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIP 362
+S L +QC HRYKIY EG+AWSVS KYI++C+S+ L + ++ DFF RG++P ++ +PI
Sbjct: 179 QSNLEDQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYMKPRFYDFFIRGMVPLQHFWPIN 238
Query: 363 SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYK 421
C S+K V WGN N +AE IG+ G ++ E+L M+ VYDYM HL+ EYSKLL ++
Sbjct: 239 DQSKCSSLKFAVQWGNNNTIQAEAIGEEGSKYLQENLKMELVYDYMYHLLNEYSKLLKFR 298
Query: 422 PAPPSSAFEACVESLLCLADPKQRQNLEKA-AASPSPYPPCTL 463
P P A E E++ A ++ LE + SPS PC L
Sbjct: 299 PTVPPGAVELKPETMTGAALGLHKKFLEDSLEKSPSQTEPCDL 341
>gi|79454960|ref|NP_191687.2| uncharacterized protein [Arabidopsis thaliana]
gi|55978809|gb|AAV68866.1| hypothetical protein AT3G61280 [Arabidopsis thaliana]
gi|61742723|gb|AAX55182.1| hypothetical protein At3g61280 [Arabidopsis thaliana]
gi|110739424|dbj|BAF01622.1| hypothetical protein [Arabidopsis thaliana]
gi|332646661|gb|AEE80182.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 323 bits (829), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 164/372 (44%), Positives = 237/372 (63%), Gaps = 13/372 (3%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCPD+FK IH+DL+ W K+ IT + A+ A RI+I G+LYV Y Q+R +F
Sbjct: 129 ETCPDYFKWIHRDLKVWQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVF 188
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF------PLPLFRYCTNDAHFD 217
TIWG LQLLR YPG +PD++++F C D+P I K++ P PLF YC + +D
Sbjct: 189 TIWGILQLLRMYPGQIPDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYD 248
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I FPDWSFWGW E+N++ WN+ +K G++ W++++P+AYWKGNP V SP+R +LM+
Sbjct: 249 IVFPDWSFWGWPELNIKEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHV-SPIRGDLMR 307
Query: 278 CNDSKLWG--AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
CN S + + Q+W E + GF+ S L +QC HRYKIY EG AWSVS KYILSC+S
Sbjct: 308 CNFSDKYDPMVRLYVQDWRSEIEAGFRGSNLEDQCTHRYKIYIEGNAWSVSEKYILSCDS 367
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF- 394
+ L++ +Y DFF R ++P K+ +PI + C +K V+WGN N +A+ IG+ G ++
Sbjct: 368 MTLLVKPEYYDFFFRSMVPMKHFWPIRQNNKCGDLKFAVEWGNNNTEKAQIIGRQGSEYM 427
Query: 395 MESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQ-RQNLEKA- 451
M++L M VYDYML+++ Y KL+ P +A E C E++ C + D + RQ ++ +
Sbjct: 428 MKNLKMKYVYDYMLYVLQGYGKLMKLDVTVPENATEVCSETMACSITDGGRIRQCMDDSL 487
Query: 452 AASPSPYPPCTL 463
SPS C L
Sbjct: 488 VMSPSVKAACDL 499
>gi|302785297|ref|XP_002974420.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300158018|gb|EFJ24642.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 386
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 165/368 (44%), Positives = 226/368 (61%), Gaps = 17/368 (4%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F I DL PW ++ I+ +++ +A+ A R++IV G LY++ Y C ++R FT+
Sbjct: 1 CPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDFTL 60
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH-----GSFPLP-LFRYCTNDA-HFDI 218
WG L LL+ YPGMVPDVD+MF+C D P++ + EH GS+P P LF YCT+ H+DI
Sbjct: 61 WGLLMLLKEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRRDHYDI 120
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
FPDWS+WGW EVN+ PW+ E I G++ W + P A+WKGN D + P R +L+KC
Sbjct: 121 VFPDWSYWGWPEVNILPWSLEKGKIFSGAEKLDWSHRQPIAFWKGNYD-MGPARADLVKC 179
Query: 279 --NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
N++ + Q+W E + F S LS QC HRYKIY +G WSVS KYIL+C S
Sbjct: 180 TANNTHNYNLVTYHQDWFAEREHNFNNSDLSKQCQHRYKIYVQGGGWSVSFKYILACGST 239
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFME 396
L I +++FF+R LIP + +P+ ++C S K VDWGNA+P EA IG+ + F++
Sbjct: 240 VLQIEPMFQEFFARSLIPFVHFWPVDRDNICNSTKFAVDWGNAHPKEAAAIGECAKSFLD 299
Query: 397 -SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASP 455
L+MD VY YMLHL+ EY KLL +KP PP A +ES L P L + P
Sbjct: 300 KELSMDFVYQYMLHLLREYGKLLKFKPVPPPEAQRMTLESSL----PGHELTLPRTV--P 353
Query: 456 SPYPPCTL 463
P CTL
Sbjct: 354 RPRKVCTL 361
>gi|110738999|dbj|BAF01419.1| hypothetical protein [Arabidopsis thaliana]
Length = 523
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 158/371 (42%), Positives = 232/371 (62%), Gaps = 15/371 (4%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCPD+F+ IHKDLE W ++ IT + A A R++I G++YV Y Q+R +F
Sbjct: 117 ETCPDYFRWIHKDLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVF 176
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL--------PLFRYCTNDAH 215
TIWG +QLLR YPG VPD++++F C D P I ++++ P PLF YC +
Sbjct: 177 TIWGIVQLLRMYPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGA 236
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
FDI FPDWSFWGW E+N++ WN++ + I G + W+E+ P+AYWKGNP V + +R +L
Sbjct: 237 FDIVFPDWSFWGWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGV-AMVRRDL 295
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
M C+D + + RQ+W+ E + G++ S L +QC HRYKIY EG AWSVS KYIL+C+S
Sbjct: 296 MHCHDPMV---HLYRQDWSREGRIGYRTSNLEDQCTHRYKIYVEGRAWSVSEKYILACDS 352
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ L++ Y DFF+R L+P ++++PI + C I V WGN N +A+ IG+ G ++
Sbjct: 353 MTLLVKPFYFDFFTRSLVPMEHYWPIRPQEKCSDIVFAVHWGNNNTKKAKAIGRNGSGYV 412
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKA-A 452
++L M VYDYMLHL+ Y KL+ P A E C E++ C + + RQ+++ +
Sbjct: 413 RKNLKMKYVYDYMLHLLQSYGKLMKMNVEVPQGAKEVCPETMACPINGGRMRQSMDDSLV 472
Query: 453 ASPSPYPPCTL 463
SPS C +
Sbjct: 473 MSPSVKATCEM 483
>gi|42569949|ref|NP_182108.2| uncharacterized protein [Arabidopsis thaliana]
gi|330255514|gb|AEC10608.1| uncharacterized protein [Arabidopsis thaliana]
Length = 523
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 158/371 (42%), Positives = 231/371 (62%), Gaps = 15/371 (4%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCPD+F+ IHKDLE W ++ IT + A A R++I G++YV Y Q+R +F
Sbjct: 117 ETCPDYFRWIHKDLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVF 176
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL--------PLFRYCTNDAH 215
TIWG +QLLR YPG VPD++++F C D P I ++++ P PLF YC +
Sbjct: 177 TIWGIVQLLRMYPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGA 236
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
FDI FPDWSFWGW E+N++ WN++ + I G + W+E+ P+AYWKGNP V + +R +L
Sbjct: 237 FDIVFPDWSFWGWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGV-AMVRRDL 295
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
M C+D + + RQ+W+ E + G++ S L +QC HRYKIY EG AWSVS KYIL+C+S
Sbjct: 296 MHCHDPMV---HLYRQDWSREGRIGYRTSNLEDQCTHRYKIYVEGRAWSVSEKYILACDS 352
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ L++ Y DFF+R L+P ++++PI + C I V WGN N +A IG+ G ++
Sbjct: 353 MTLLVKPFYFDFFTRSLVPMEHYWPIRPQEKCSDIVFAVHWGNNNTKKARAIGRNGSGYV 412
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKA-A 452
++L M VYDYMLHL+ Y KL+ P A E C E++ C + + RQ+++ +
Sbjct: 413 RKNLKMKYVYDYMLHLLQSYGKLMKMNVEVPQGAKEVCPETMACPINGGRMRQSMDDSLV 472
Query: 453 ASPSPYPPCTL 463
SPS C +
Sbjct: 473 MSPSVKATCEM 483
>gi|42571241|ref|NP_973694.1| downstream target of AGL15 2 [Arabidopsis thaliana]
gi|44917445|gb|AAS49047.1| At2g45830 [Arabidopsis thaliana]
gi|46931276|gb|AAT06442.1| At2g45830 [Arabidopsis thaliana]
gi|330255512|gb|AEC10606.1| downstream target of AGL15 2 [Arabidopsis thaliana]
Length = 382
Score = 321 bits (823), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 150/342 (43%), Positives = 227/342 (66%), Gaps = 9/342 (2%)
Query: 130 IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
+ +A+R A R++I+ G++YV Y +Q+R +FT+WG +QLLR YPG +PD+++MFD
Sbjct: 2 LEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLELMFDPD 61
Query: 190 DKPVIDKKEHGS----FPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKH 245
D+P + K+ P PLFRYC++DA DI FPDWSFWGW+EVN++PW++ I+
Sbjct: 62 DRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSLVAIEE 121
Query: 246 GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS--KLWGAEILRQNWAEEAKDGFKK 303
G++ WK+++ +AYW+GNP+V +P R +L++CN S + W + Q+W E+++GFK
Sbjct: 122 GNKMTQWKDRVAYAYWRGNPNV-APTRRDLLRCNVSAQEDWNTRLYIQDWDRESREGFKN 180
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
S L NQC HRYKIY EG+AWSVS KYI++C+S+ L + + DF+ RG++P ++++PI
Sbjct: 181 SNLENQCTHRYKIYIEGWAWSVSEKYIMACDSMTLYVRPMFYDFYVRGMMPLQHYWPIRD 240
Query: 364 ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
C S+K V WGN + +A KIG+ G F+ E + M+ VYDYM HL+ EY+KLL +KP
Sbjct: 241 TSKCTSLKFAVHWGNTHLDQASKIGEEGSRFIREEVKMEYVYDYMFHLMNEYAKLLKFKP 300
Query: 423 APPSSAFEACVESLLCLADPKQRQNLEKAAAS-PSPYPPCTL 463
P A E + + C A + R +E++ PS PC +
Sbjct: 301 EIPWGATEITPDIMGCSATGRWRDFMEESMVMFPSEESPCEM 342
>gi|357127693|ref|XP_003565513.1| PREDICTED: uncharacterized protein LOC100832633 [Brachypodium
distachyon]
Length = 538
Score = 320 bits (820), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 160/369 (43%), Positives = 228/369 (61%), Gaps = 12/369 (3%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F+ IH DL+PW + IT + A A R+ +V G+ YV Q+R +FT
Sbjct: 140 CPVYFRHIHTDLDPWRSTGITRDTLERAMPHAEFRLTVVSGRAYVQNLRPSYQTRDVFTQ 199
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTNDA-HFDIPF 220
WG LQLL R+PG VPDVDIMF D + ++ + P PLFRYC + I F
Sbjct: 200 WGVLQLLARFPGRVPDVDIMFSAGDVAQVLSADYYNTTTHPPPPLFRYCKEEKLEAAIVF 259
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW E++++PW +D ++A W+ + P+A+WKGNP+V S +R +L KCN+
Sbjct: 260 PDWSFWGWPELSIRPWAPLMEDFVRENKALPWRNRQPYAFWKGNPEV-SDVRRDLFKCNN 318
Query: 281 ----SKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
K W A + Q+W +++GF+ S L+ QC++RYKIY +G+AWSVS KYIL+C+S
Sbjct: 319 DSANGKDWNARLFWQDWNAASRNGFRDSNLAKQCDYRYKIYVQGHAWSVSEKYILACDSP 378
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
L + Y DFFSRGL+ ++++PI A CR++K V WGN + +A ++G+ G F
Sbjct: 379 MLAVDTSYVDFFSRGLVAGRHYWPIDPARKCRAVKFAVGWGNEHAGQAMRMGQEGSRFAR 438
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAAS 454
E ++MD VY+YM H+ITEY+ LL Y+P P A E CVESL C ++++ L E
Sbjct: 439 EEMSMDYVYEYMFHVITEYAALLRYRPTVPEKAVEVCVESLACGRRGREKEFLMESREEY 498
Query: 455 PSPYPPCTL 463
+ Y PCTL
Sbjct: 499 EARYEPCTL 507
>gi|302808049|ref|XP_002985719.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300146628|gb|EFJ13297.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 386
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 164/368 (44%), Positives = 225/368 (61%), Gaps = 17/368 (4%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F I DL PW ++ I+ +++ +A+ A R++IV G LY++ Y C ++R FT+
Sbjct: 1 CPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDFTL 60
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH-----GSFPLP-LFRYCTNDA-HFDI 218
WG L LL+ YPGMVPDVD+MF+C D P++ + EH GS+P P LF YCT+ H+DI
Sbjct: 61 WGLLMLLKEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRGDHYDI 120
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
FPDWS+WGW EVN+ PW+ E I G++ W + P A+WKGN D + P R +L+KC
Sbjct: 121 VFPDWSYWGWPEVNILPWSLEKGKIFSGAKKLDWSHRQPIAFWKGNYD-MGPARADLVKC 179
Query: 279 --NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
N++ + Q+W E + F S LS QC HRYKIY +G WSVS KYIL+C S
Sbjct: 180 TANNTHNYNLVTYHQDWFAERERNFNNSDLSKQCQHRYKIYVQGGGWSVSFKYILACGST 239
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFME 396
L I +++FF+R LIP + +P+ ++C S K VDWGNA+P EA IG+ + F++
Sbjct: 240 VLQIEPMFQEFFARSLIPFVHFWPVDRDNICNSTKFAVDWGNAHPKEAAAIGECAKSFLD 299
Query: 397 -SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASP 455
L+MD VY YML L+ EY KLL +KP PP A +ES L P L + P
Sbjct: 300 KDLSMDFVYQYMLQLLREYGKLLKFKPVPPPEAQRMTLESSL----PGHELTLPRNG--P 353
Query: 456 SPYPPCTL 463
P CTL
Sbjct: 354 RPRKVCTL 361
>gi|302815579|ref|XP_002989470.1| hypothetical protein SELMODRAFT_44697 [Selaginella moellendorffii]
gi|300142648|gb|EFJ09346.1| hypothetical protein SELMODRAFT_44697 [Selaginella moellendorffii]
Length = 329
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 147/333 (44%), Positives = 212/333 (63%), Gaps = 8/333 (2%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F IH+ L PW + IT R + R ++R+ I+R +YV Y DC QSR F++
Sbjct: 3 CPRYFARIHEHLAPWRSTGITRRSL---DRLGSMRVAILRSDMYVSAYGDCPQSRRAFSL 59
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
WG L +L R+ VPDV+ + +C D+P++ ++ P P+ YC++ DIPFPD+SF
Sbjct: 60 WGLLLMLERFGDRVPDVEFVLNCKDRPIV-PRDGSEVPAPVLSYCSHRLSLDIPFPDYSF 118
Query: 226 WGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG 285
WGW EVN++PW EE ++I GSQ W + PFA+WKGN + LR L +CN +K G
Sbjct: 119 WGWPEVNIRPWEEESQEIFQGSQGVEWSNRQPFAFWKGNLR-MGKLRTLLARCNSTKF-G 176
Query: 286 AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYK 345
+L QNW +EA G++ S L QCN RY IYAEG AWS SLKY ++C S L + +Y
Sbjct: 177 TLVLDQNWIDEANIGYENSNLCKQCNQRYNIYAEGAAWSASLKYKMACGSTLLYLDSEYD 236
Query: 346 DFFSRGLIPTKNHFPIPS-ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRV 403
+FF++GL+P + PI S D+C+S+K+ V+WGN++ +A+ IG+ GQDF+ E + +D+V
Sbjct: 237 EFFTKGLLPNIHFLPISSKEDMCQSLKNAVEWGNSHAQQAQNIGRTGQDFIREQVNIDQV 296
Query: 404 YDYMLHLITEYSKLLDYKPAPPSSAFEACVESL 436
Y+YM HL+ EYSKL Y P P C ++
Sbjct: 297 YNYMFHLLLEYSKLQKYTPKIPKGGQVFCKRAI 329
>gi|356561570|ref|XP_003549054.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
2-like [Glycine max]
Length = 426
Score = 317 bits (812), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 160/366 (43%), Positives = 226/366 (61%), Gaps = 39/366 (10%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCPD+F+ IH+DL PWA++ A A LR +I++G+ Y++ Y Q+R +F+
Sbjct: 55 TCPDYFRWIHEDLRPWARTE-------HAPTSANLRFIILKGRAYLETYSRPYQTRDVFS 107
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS----FPLPLFRYCTNDAHFDIPF 220
IWG LQLLRRYPG +PD+++MFDC+D PV+ + P PLFRYC NDA D+ F
Sbjct: 108 IWGILQLLRRYPGKIPDLELMFDCVDWPVVXSDRYNGPNVEQPPPLFRYCGNDATLDVVF 167
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
DWSFWGW+E+N++PW+ ++K G+ W + P+AYWKGNP V + R LMKCN
Sbjct: 168 LDWSFWGWAEINIKPWHILLGELKEGTTRIPWLNREPYAYWKGNPAV-AETRXYLMKCNV 226
Query: 280 -DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
+++ W A +L Q+W E+++GF KS L +QC +RYK+Y EG AWSVS KYILSC+S L
Sbjct: 227 SENQDWNARLLAQDWLRESQEGFNKSDLPSQCTYRYKVYIEGSAWSVSQKYILSCDSTTL 286
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
++ +Y DFF+RGLIP +++PI D CRSIK VDWGN + A +IGK
Sbjct: 287 LVKPKYYDFFTRGLIPVHHYWPIKDDDKCRSIKFAVDWGNNHKQRAHQIGKVA------- 339
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSP 457
+ K YKP+ ++A E CVES++C A+ ++ +E P+
Sbjct: 340 ---------------FKK--RYKPSISANATELCVESMVCGAEGSVKKFMMESLVKVPAN 382
Query: 458 YPPCTL 463
PCT+
Sbjct: 383 TDPCTM 388
>gi|302783306|ref|XP_002973426.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300159179|gb|EFJ25800.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 449
Score = 317 bits (812), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 160/354 (45%), Positives = 219/354 (61%), Gaps = 10/354 (2%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ 158
S Q CPD+FK I +DL PW + I+ + EAK A R++IV GKLY++ Y C Q
Sbjct: 90 SAQAQGKCPDYFKWIERDLSPWKNTGISQGDLQEAKSKADFRVVIVDGKLYMERYRYCYQ 149
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF----PLPLFRYCTN-D 213
+RA +T+WG LL +PG VPD+++MF C D+P + +K + S P PLF YCT D
Sbjct: 150 TRAQYTLWGIRMLLEEFPGQVPDLELMFMCGDRPQVLRKNYSSNSVWPPPPLFSYCTTRD 209
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
H+DI FPDWSFWGW EVN+ PW E + I G++ W ++ P A WKGN + +R
Sbjct: 210 EHYDIVFPDWSFWGWPEVNIAPWTVEREKIFSGAEKIKWLQREPIAQWKGN-TWMGKIRP 268
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
L++CN S + QNW +E K+ F S LS QC++RYK+Y EG WSVSLKY++SC
Sbjct: 269 MLVQCNSST--SILVYHQNWDDEIKNNFSSSDLSKQCSYRYKVYVEGIGWSVSLKYVMSC 326
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
S L I+ QY +F+SR LIP + P+ +C+SI+ V+WGN P +A +G+ Q+
Sbjct: 327 GSTMLQINPQYLEFYSRSLIPYLHFIPVRKTKICQSIQEAVEWGNTFPHKALSLGRCAQN 386
Query: 394 FM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP-KQR 445
F+ E LTMD VY+YML L+ Y+KLL +KP P ++S L P KQR
Sbjct: 387 FLQEQLTMDYVYEYMLLLLQRYAKLLKFKPVPLQGVQAMSLDSNLPWHSPSKQR 440
>gi|302762508|ref|XP_002964676.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168405|gb|EFJ35009.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 330
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 147/334 (44%), Positives = 211/334 (63%), Gaps = 9/334 (2%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F IH+ L PW + IT R + R ++R+ I+R +YV Y DC QSR F++
Sbjct: 3 CPRYFARIHEHLAPWRSTGITRRSL---DRLGSMRVAILRSDMYVSAYGDCPQSRRAFSL 59
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
WG L +L R+ VPDV+ + +C D+P++ ++ P P+ YC++ DIPFPD+SF
Sbjct: 60 WGLLLMLERFGDRVPDVEFVLNCKDRPIV-PRDGSEVPAPVLSYCSHRLSLDIPFPDYSF 118
Query: 226 WGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG 285
WGW EVN++PW EE ++I GSQ W + PFA+WKGN + LR L +CN +K G
Sbjct: 119 WGWPEVNIRPWEEESQEIFQGSQDVEWSNRQPFAFWKGNLR-MGKLRTLLARCNSTKF-G 176
Query: 286 AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYK 345
+L QNW +EA G++ S L QCN RY IYAEG AWS SLKY ++C S L + +Y
Sbjct: 177 TLVLDQNWIDEANIGYENSNLCKQCNQRYNIYAEGAAWSASLKYKMACGSTLLYLDSEYD 236
Query: 346 DFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDR 402
+FF++GL+P + PI S D+C+S+K+ V WGN++ +A+ IG+ GQDF+ E + +D+
Sbjct: 237 EFFTKGLLPNIHFLPISSKEEDMCQSLKNAVGWGNSHAQQAQNIGRTGQDFIREQVNIDQ 296
Query: 403 VYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL 436
VY+YM HL+ EYSKL Y P P C ++
Sbjct: 297 VYNYMFHLLLEYSKLQKYTPKIPKGGQLFCKRAI 330
>gi|302773059|ref|XP_002969947.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300162458|gb|EFJ29071.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 325
Score = 314 bits (804), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 154/325 (47%), Positives = 216/325 (66%), Gaps = 5/325 (1%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
CP +F IH+DL PW++S IT H+ EA+R R++IV GKLY + +C+Q+R T
Sbjct: 1 NCPSYFAWIHEDLAPWSESGITREHLDEARRKGDFRLVIVDGKLYFERIQECIQTRDEIT 60
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLP-LFRYCTN--DAHFDIPFP 221
+ G L LL ++PGM+PD++++F C D P + K E+ S P P +F YCT+ HFDI FP
Sbjct: 61 LQGLLLLLEKFPGMIPDMEMVFVCNDFPQVPKDEYRSKPPPPVFSYCTSRFGGHFDILFP 120
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
DWSFWGW +V ++PW +E +I G+ W E+ AYWKGN V++P+R EL++CN++
Sbjct: 121 DWSFWGWPQVKIRPWEQESVEIFDGANETDWFEREAIAYWKGNLWVMTPVREELLQCNNT 180
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + +WA+E +GFK S LS QC+ RYK+YAEG WS SLKY++SC S L I
Sbjct: 181 Q-YNVVVYYLDWAKEEAEGFKTSSLSKQCHTRYKVYAEGRGWSASLKYLISCGSTILHIQ 239
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
+ +FF+R +P ++PI ++C SIK VDWGNA+P EA IGK GQ F+ E LTM
Sbjct: 240 PDFWEFFARSWLPYVEYWPISRENMCSSIKHAVDWGNAHPFEASAIGKRGQAFLKEQLTM 299
Query: 401 DRVYDYMLHLITEYSKLLDYKPAPP 425
D VY YMLH++ Y+KL +KP P
Sbjct: 300 DHVYSYMLHVMQAYAKLQRFKPEVP 324
>gi|6633846|gb|AAF19705.1|AC008047_12 F2K11.20 [Arabidopsis thaliana]
Length = 605
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 169/404 (41%), Positives = 241/404 (59%), Gaps = 45/404 (11%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
++CPD+FK IH+DL+PW ++ IT + K A R++I+ GK++V+ Y +Q+R F
Sbjct: 168 RSCPDYFKWIHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAF 227
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHG-------SFPLPLFRYCTNDAHF 216
T+WG LQLLR+YPG +PDVD+MFDC D+PVI + + P PLFRYC +
Sbjct: 228 TLWGILQLLRKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTV 287
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
DI FPDWSFWGW E+N++ W++ K+++ G + K + E+ +AYWKGNP V SP R +L+
Sbjct: 288 DIVFPDWSFWGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLL 347
Query: 277 KCNDSKL--WGAEILRQN----------------------------WAEEAKDGFKKSKL 306
CN S L W A I Q + + G++K K
Sbjct: 348 TCNLSSLHDWNARIFIQVCFYNQIYLYLSSYAIYIYIYIYIYIFDLYLHANRIGYQKDKE 407
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADL 366
+ YKIY EGYAWSVS KYIL+C+SV L++ Y DFFSR L P ++++PI D
Sbjct: 408 DLRI-RMYKIYIEGYAWSVSEKYILACDSVTLMVKPYYYDFFSRTLQPLQHYWPIRDKDK 466
Query: 367 CRSIKSVVDWGNANPSEAEKIGKAGQDFME-SLTMDRVYDYMLHLITEYSKLLDYKPAPP 425
CRSIK VDW N + +A++IG+ +FM+ L+M+ VYDYM HL+ EYSKLL YKP P
Sbjct: 467 CRSIKFAVDWLNNHTQKAQEIGREASEFMQRDLSMENVYDYMFHLLNEYSKLLKYKPQVP 526
Query: 426 SSAFEACVESLLCLADPKQRQNLEKA------AASPSPYPPCTL 463
++ E C E+L+C ++ + ++K + P PC+L
Sbjct: 527 KNSVELCTEALVCPSEGEDVNGVDKKFMIGSLVSRPHASGPCSL 570
>gi|357119765|ref|XP_003561604.1| PREDICTED: uncharacterized protein LOC100842005 [Brachypodium
distachyon]
Length = 616
Score = 311 bits (796), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 162/371 (43%), Positives = 230/371 (61%), Gaps = 16/371 (4%)
Query: 108 DFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIW 166
D+F+ IH DL PW A IT+ + A AA R+ +V G+ YV+ Y+ Q+R +FT W
Sbjct: 216 DYFRYIHSDLSPWRASGGITLPTLERAIPHAAFRLTVVSGRAYVETYHRAFQTRDVFTQW 275
Query: 167 GFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS----FPLPLFRYCTNDA-HFDIPFP 221
G LQLL RYPG VPD+D MF+ D P I + + P PLFRYC + +I FP
Sbjct: 276 GILQLLARYPGRVPDLDAMFNLEDMPEIFRNDADGNPAPPPPPLFRYCKDGGDSVEILFP 335
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND- 280
DWSFWGW EVN++PW ++ ++A W+ + PFA+WKGNP V S R +L KCN+
Sbjct: 336 DWSFWGWPEVNIRPWAPLMENFVRENRALPWQNREPFAFWKGNPYV-SNARKDLFKCNND 394
Query: 281 ---SKLWGAEILRQNWAEEAKDGFK---KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
K + A + +W +++GFK + L+ QC +RYKIY +G +WSVS KYIL+C+
Sbjct: 395 SAAGKEFNARLFDVDWRAASRNGFKDDGSTNLAKQCKYRYKIYVQGRSWSVSEKYILACD 454
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF 394
S L I ++DFFSRGL+ ++++PI A+ C ++K VDWGN +P E ++G+ G F
Sbjct: 455 SPMLAIDTSFRDFFSRGLVAGEHYWPIDPAEKCDAVKFAVDWGNKHPRETMRLGEEGSRF 514
Query: 395 M-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAA 452
E + MD VYDYMLH++TEY+ LL YKP P A E C E++ C A+ ++R+ +E
Sbjct: 515 AREEMGMDFVYDYMLHVLTEYAALLRYKPTVPEKAVELCPEAMACGAEGREREFMMESRE 574
Query: 453 ASPSPYPPCTL 463
+ Y PC+L
Sbjct: 575 RHVAGYEPCSL 585
>gi|302761722|ref|XP_002964283.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168012|gb|EFJ34616.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 401
Score = 311 bits (796), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 151/338 (44%), Positives = 211/338 (62%), Gaps = 12/338 (3%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIW 166
P +F SI DL PW IT + A+R ++R++I +GKLY++ Y C QSR++FT W
Sbjct: 58 PKYFDSIDTDLRPWKDGGITKSSLSAARRKGSMRMVISQGKLYIEVYGKCPQSRSIFTAW 117
Query: 167 GFLQLLRRYPGMVPDVDIMFDCMDKPVIDK-----KEHGSFPLPLFRYCTNDAHFDIPFP 221
G L LL R+PG VPDVD + +C D+PVI + + P +F YCT + DIPFP
Sbjct: 118 GLLLLLERFPGKVPDVDFVLNCKDRPVITRFLSFQRFVSGSPPAVFSYCTTNDMLDIPFP 177
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
D+SFWGW EV++ PW E+ + I GS+ W E+ P A+WKGN + LR L++C
Sbjct: 178 DFSFWGWPEVDIPPWEEQSQQITAGSREVKWSERRPAAFWKGNSR-MGKLRRHLLQCQSL 236
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ EIL Q+W E++ GF S+LS QC R+ IY EG AWS SLKY ++C S L +
Sbjct: 237 E---TEILDQDWISESRAGFPNSRLSQQCKDRFNIYVEGAAWSASLKYRMACGSTMLNVE 293
Query: 342 QQYKDFFSRGLIPTKNHFPIPSA--DLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+Y++FFS GLIP H I + +C+ I++ V WGN++P EAE IG+ GQDF+ + L
Sbjct: 294 SKYREFFSAGLIPNLTHLAISANPDTMCQEIQAAVKWGNSHPLEAEAIGRHGQDFITKEL 353
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL 436
TMD VY YML LI++Y+KL + P P A C +++
Sbjct: 354 TMDHVYRYMLELISQYAKLQRFTPTIPHGAQILCKDAI 391
>gi|302799316|ref|XP_002981417.1| hypothetical protein SELMODRAFT_53169 [Selaginella moellendorffii]
gi|300150957|gb|EFJ17605.1| hypothetical protein SELMODRAFT_53169 [Selaginella moellendorffii]
Length = 342
Score = 310 bits (795), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 157/341 (46%), Positives = 218/341 (63%), Gaps = 21/341 (6%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F IH+DL PW++S IT H+ EA+R R++IV GKLY + +C+Q+R T+
Sbjct: 1 CPSYFAWIHEDLAPWSESGITREHLDEARRKGDFRLVIVDGKLYFERIQECIQTRDEITL 60
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLP-LFRYCTN--DAHFDIPFPD 222
G L LL ++PGM+PD++++F C D P + K E+ S P P +F YCT+ HFDI FPD
Sbjct: 61 QGLLLLLEKFPGMIPDMEMVFVCNDFPQVPKDEYRSKPPPPIFSYCTSRFGGHFDILFPD 120
Query: 223 WSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSK 282
WSFWGW +V ++PW +E +I G+ W E+ AYWKGN V++P+R EL++CN+++
Sbjct: 121 WSFWGWPQVKIRPWEQESVEIFEGANETDWFEREAIAYWKGNLWVMTPVREELLQCNNTQ 180
Query: 283 LW------------GAEIL-----RQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSV 325
G E+ Q+WA+E +GFK S LS QC+ RYK+YAEG WS
Sbjct: 181 YNVVVYYLVRIHANGDEVFTFFFNSQDWAKEEAEGFKTSSLSKQCHTRYKVYAEGRGWSA 240
Query: 326 SLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAE 385
SLKY++SC S L I + +FF+R +P ++PI ++C SIK VDWGNA+P EA
Sbjct: 241 SLKYLISCGSTILHIQPDFWEFFARSWLPYVEYWPISRENMCSSIKHAVDWGNAHPFEAS 300
Query: 386 KIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPP 425
IGK GQ F+ E LTMD VY YMLH++ Y+KL +KP P
Sbjct: 301 AIGKRGQAFLKEQLTMDHVYSYMLHVMQAYAKLQRFKPEVP 341
>gi|302815691|ref|XP_002989526.1| hypothetical protein SELMODRAFT_130002 [Selaginella moellendorffii]
gi|300142704|gb|EFJ09402.1| hypothetical protein SELMODRAFT_130002 [Selaginella moellendorffii]
Length = 401
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 155/357 (43%), Positives = 219/357 (61%), Gaps = 18/357 (5%)
Query: 96 RVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYD 155
RV + + + P +F SI DL PW IT + A++ ++R++I +GKLY++ Y
Sbjct: 45 RVDNHNQGHSRPKYFDSIDTDLRPWKDGGITKSSLSAARKKGSMRMVISQGKLYIEVYGK 104
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL---------PL 206
C QSR++FT WG L LL R+PG VPDVD + +C D+PVI + + SF +
Sbjct: 105 CPQSRSIFTAWGLLLLLERFPGKVPDVDFVLNCKDRPVITR--YSSFHSRDLCQDEAPAV 162
Query: 207 FRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPD 266
F YCT + DIPFPD+SFWGW EV++ PW E+ + I GS+ W E+ P A+WKGN
Sbjct: 163 FSYCTTNDMLDIPFPDFSFWGWPEVDIPPWEEQSQQITAGSREVKWSERRPAAFWKGNSR 222
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
+ LR L++C + EIL Q+W E++ GF S+LS QC R+ IY EG AWS S
Sbjct: 223 -MGKLRSHLLQCQSLE---TEILDQDWISESRAGFPNSRLSQQCKDRFNIYVEGAAWSAS 278
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSA--DLCRSIKSVVDWGNANPSEA 384
LKY ++C S L + +Y++FFS GLIP H I + +C+ I++ V WGN++P EA
Sbjct: 279 LKYRMACGSTMLNVESKYREFFSAGLIPNLTHLAISANPDTMCQEIQAAVKWGNSHPLEA 338
Query: 385 EKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLA 440
E IG+ GQDF+ + LTMD VY YML LI++Y+KL + P P A C +++ LA
Sbjct: 339 EAIGRHGQDFITKELTMDHVYRYMLELISQYAKLQRFTPTIPHGAQILCKDAIKHLA 395
>gi|302789424|ref|XP_002976480.1| hypothetical protein SELMODRAFT_105631 [Selaginella moellendorffii]
gi|300155518|gb|EFJ22149.1| hypothetical protein SELMODRAFT_105631 [Selaginella moellendorffii]
Length = 357
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 159/351 (45%), Positives = 216/351 (61%), Gaps = 14/351 (3%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CPD+FK I +DL PW + I+ + EAK A R++IV GKLY++ Y C Q+RA +T+
Sbjct: 1 CPDYFKWIERDLSPWKNTGISQGDLQEAKSKADFRVVIVDGKLYMERYRYCYQTRAQYTL 60
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF-------PLPLFRYCTN-DAHFD 217
WG LL +PG VPD+++MF C D+P + +K + S P PLF YCT D H+D
Sbjct: 61 WGIRMLLEEFPGQVPDLELMFMCGDRPQVLRKNYSSNSVKRRWPPPPLFSYCTTRDEHYD 120
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I FPDWSFWGW EVN+ PW E + I G++ W ++ P A WKGN + +R L++
Sbjct: 121 IVFPDWSFWGWPEVNIAPWTVEREKIFSGAEKIKWLQREPIAQWKGN-TWMGKIRPMLVQ 179
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR-YKIYAEGYAWSVSLKYILSCNSV 336
CN S + QNW EE K+ F S LS QC++R YK+Y EG WSVSLKY++SC S
Sbjct: 180 CNSST--SILVYHQNWDEEIKNNFSSSDLSKQCSYRWYKVYVEGIGWSVSLKYVMSCGST 237
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
L I QY +F+SR LIP + P+ +C+SI+ V+WGN P +A +G+ Q+F+
Sbjct: 238 MLQIDPQYLEFYSRSLIPYLHFIPVRKTKICQSIQEAVEWGNTFPHKALSLGRCAQNFLQ 297
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP-KQR 445
E LTMD VY+YML L+ Y+KLL +KP P ++S L P KQR
Sbjct: 298 EQLTMDYVYEYMLLLLQRYAKLLKFKPVPLQGVQAMSLDSNLPWHSPSKQR 348
>gi|302795177|ref|XP_002979352.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300153120|gb|EFJ19760.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 329
Score = 308 bits (790), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 154/327 (47%), Positives = 204/327 (62%), Gaps = 6/327 (1%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F I +DL PW IT+ + EAK+ A R+LI+ G LY++ Y+ C Q+R FT+
Sbjct: 4 CPAYFSWIDRDLAPWKDHGITLHSLQEAKKRADFRVLILNGTLYMERYHKCFQTRDDFTL 63
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLP--LFRYCTN--DAHFDIPFP 221
G LL R+PGMVPDVD+MF C D P I + + P P LF YCT ++DI FP
Sbjct: 64 RGLQLLLDRFPGMVPDVDLMFGCGDFPAIPRAKGSDDPSPPPLFSYCTTARGENYDIVFP 123
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
DWSFWGW EVN+ PW EE + I G++ + W ++ FAYWKGN R +L+KC +
Sbjct: 124 DWSFWGWPEVNILPWEEEKQKIYSGAREEKWSKRQRFAYWKGNY-WTGAARPDLVKCAAN 182
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
K ++ Q+W E GFK S +S QC HRYK+Y EG WS SLKYI+SC S L I+
Sbjct: 183 KDLFVSMISQDWNAEKGQGFKSSDMSKQCTHRYKVYVEGRGWSASLKYIMSCGSTVLFIN 242
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFME-SLTM 400
+ +FFSR LIP +++PI D+C SIK+ VDWGNA P EAE +GK Q F++ L M
Sbjct: 243 PDFHEFFSRSLIPFVHYWPINRTDICNSIKAAVDWGNAFPEEAEAMGKCAQTFLDMELDM 302
Query: 401 DRVYDYMLHLITEYSKLLDYKPAPPSS 427
VY YML L+ Y++LL ++P P
Sbjct: 303 KFVYQYMLLLLQHYAQLLKFEPVLPEG 329
>gi|302821372|ref|XP_002992349.1| hypothetical protein SELMODRAFT_44687 [Selaginella moellendorffii]
gi|300139892|gb|EFJ06625.1| hypothetical protein SELMODRAFT_44687 [Selaginella moellendorffii]
Length = 330
Score = 308 bits (789), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 154/325 (47%), Positives = 203/325 (62%), Gaps = 6/325 (1%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP +F I +DL PW IT+ + EAK+ A R+LI+ G LY++ Y+ C Q+R FT+
Sbjct: 6 CPAYFSWIDRDLAPWKDHGITLHSLQEAKKRADFRVLILNGTLYMERYHKCFQTRDDFTL 65
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLP--LFRYCTN--DAHFDIPFP 221
G LL R+PGMVPDVD+MF C D P I + + P P LF YCT ++DI FP
Sbjct: 66 RGLQLLLDRFPGMVPDVDLMFGCGDFPAIPRAKGSDDPSPPPLFSYCTTARGENYDIVFP 125
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
DWSFWGW EVN+ PW EE + I G++ + W ++ FAYWKGN R +L+KC +
Sbjct: 126 DWSFWGWPEVNILPWEEEKQKIYSGAREEKWSKRQRFAYWKGNY-WTGAARPDLVKCAAN 184
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
K ++ Q+W E GFK S +S QC HRYK+Y EG WS SLKYI+SC S L I
Sbjct: 185 KDLFVSMISQDWNAEKGQGFKSSDMSKQCTHRYKVYVEGRGWSASLKYIMSCGSTVLFIH 244
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFME-SLTM 400
+ +FFSR LIP +++PI D+C SIK+ VDWGNA P EAE +GK Q F++ L M
Sbjct: 245 PDFHEFFSRSLIPFVHYWPINRTDICNSIKAAVDWGNAFPEEAEAMGKCAQTFLDMELDM 304
Query: 401 DRVYDYMLHLITEYSKLLDYKPAPP 425
VY YML L+ Y++LL ++P P
Sbjct: 305 KFVYQYMLLLLQHYAQLLKFQPVLP 329
>gi|302761718|ref|XP_002964281.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168010|gb|EFJ34614.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 342
Score = 307 bits (787), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 215/342 (62%), Gaps = 12/342 (3%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CP +F I + L PW + IT R + +R ++R+ I+ G++YV Y DC QSR++F+
Sbjct: 3 SCPRYFARIREHLAPWRSTGITRRSLDRRQRLGSMRVAILGGRMYVRAYGDCPQSRSVFS 62
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDK-KEHGSFPLPLFRYCTNDAHFDIPFPDW 223
+WG L +L R+ VPDV+ + +C D+P++ + +G P P+ YC++ DIPFPD+
Sbjct: 63 LWGLLLMLERFGDRVPDVEFVLNCKDRPIVPRDGSYGGVPSPVLSYCSHRHSLDIPFPDY 122
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL 283
SFWGW EVN++PW +E ++I GSQ W ++ P A+WKGN + LR L +CN ++
Sbjct: 123 SFWGWPEVNIRPWEQESQEIFQGSQDVEWNKRQPLAFWKGNLR-MGKLRNLLARCNSTEF 181
Query: 284 WGAEILRQNWAEEAKDGFKKSKLSNQCNHR------YKIYAEGYAWSVSLKYILSCNSVA 337
G + QNW EA G+ SKL QCN R Y IYAEG AWS SLKY ++C S
Sbjct: 182 -GTLVFDQNWIAEANVGYANSKLCKQCNQRHGYSPLYNIYAEGAAWSASLKYKMACGSTL 240
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
L + +Y +FF++GL+P + PI S ++C+S++ V+WGN++ EA+ IG++GQ FM
Sbjct: 241 LYLDSEYDEFFTKGLLPNIHFMPISSKEEEMCQSLRDAVNWGNSHAHEAQNIGRSGQAFM 300
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL 436
E + +D+VY+YM HL++EYS+L + P P C +L
Sbjct: 301 REQVNIDQVYNYMFHLLSEYSRLQRFTPEIPRGGRFFCRRAL 342
>gi|15233074|ref|NP_191688.1| uncharacterized protein [Arabidopsis thaliana]
gi|6850896|emb|CAB71059.1| putative protein [Arabidopsis thaliana]
gi|332646663|gb|AEE80184.1| uncharacterized protein [Arabidopsis thaliana]
Length = 455
Score = 307 bits (787), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 152/346 (43%), Positives = 220/346 (63%), Gaps = 11/346 (3%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAM 162
++TCPD+F+ I +DL+ W ++ IT + AK A R++I G+LYV Y +SR +
Sbjct: 41 LETCPDYFRWIQQDLKVWEETGITRETLERAKPKAHFRLVIKSGRLYVHQYDKAYESRDV 100
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE------HGSFPLP-LFRYCTNDAH 215
TIWG LQLLR YPG VPD++++F C D P I K++ + ++P P LF+YC +
Sbjct: 101 LTIWGILQLLRMYPGQVPDLELLFFCHDIPAIWKRDFRQPEPNATWPPPPLFQYCGHREA 160
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
+ I FPDWSFWGW EVN++ W + I+ ++ W +++P+AYWKGN V R L
Sbjct: 161 YGIVFPDWSFWGWPEVNIKEWTKLSVAIREANKRVKWNDRVPYAYWKGNSGVHRE-RGNL 219
Query: 276 MKCNDSKLWG--AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
MKCN S + + Q+W +E + GFK S L +QC HRYKIY EG AWSVS KYIL+C
Sbjct: 220 MKCNFSDKYDPMVRLYEQDWGKEREIGFKSSNLEDQCTHRYKIYIEGRAWSVSKKYILAC 279
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+S+ L+I +Y DFF R L+P ++++PI S + C +K V+WGN N +A+ IG+ G D
Sbjct: 280 DSMTLLIKAEYFDFFGRSLVPLEHYWPIKSHEKCGDLKFAVEWGNNNTKKAQVIGRQGSD 339
Query: 394 F-MESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC 438
+ M++L M VYDYML+++ Y KL+ P +A E C E++ C
Sbjct: 340 YIMKNLEMKYVYDYMLYVLQGYGKLMKLDVTVPENATEVCSETMAC 385
>gi|302815695|ref|XP_002989528.1| hypothetical protein SELMODRAFT_43161 [Selaginella moellendorffii]
gi|300142706|gb|EFJ09404.1| hypothetical protein SELMODRAFT_43161 [Selaginella moellendorffii]
Length = 342
Score = 307 bits (786), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 215/342 (62%), Gaps = 12/342 (3%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CP +F I + L PW + IT R + +R ++R+ I+ G++YV Y DC QSR++F+
Sbjct: 3 SCPRYFARIREHLAPWRSTGITRRSLDRRQRLGSMRVAILGGRMYVRAYGDCPQSRSVFS 62
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDK-KEHGSFPLPLFRYCTNDAHFDIPFPDW 223
+WG L +L R+ VPDV+ + +C D+P++ + +G P P+ YC++ DIPFPD+
Sbjct: 63 LWGLLLMLERFGDRVPDVEFVLNCKDRPIVPRDGSYGGAPAPVLSYCSHRHSLDIPFPDY 122
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL 283
SFWGW EVN++PW +E ++I GSQ W ++ P A+WKGN + LR L +CN ++
Sbjct: 123 SFWGWPEVNIRPWEQESQEIFQGSQDVEWNKRQPLAFWKGNLR-MGKLRNLLARCNSTEF 181
Query: 284 WGAEILRQNWAEEAKDGFKKSKLSNQCNHR------YKIYAEGYAWSVSLKYILSCNSVA 337
G + QNW EA G+ SKL QCN R Y IYAEG AWS SLKY ++C S
Sbjct: 182 -GTLVFDQNWIAEANVGYANSKLCKQCNQRHGYSPLYNIYAEGAAWSASLKYKMACGSTL 240
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
L + +Y +FF++GL+P + PI S ++C+S++ V+WGN++ EA+ IG++GQ FM
Sbjct: 241 LYLDSEYDEFFTKGLLPNIHFMPISSKEEEMCQSLRDAVNWGNSHAHEAQNIGRSGQAFM 300
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL 436
E + +D+VY+YM HL++EYS+L + P P C +L
Sbjct: 301 REQVNIDQVYNYMFHLLSEYSRLQRFTPEIPRGGRFFCRRAL 342
>gi|297824641|ref|XP_002880203.1| hypothetical protein ARALYDRAFT_483723 [Arabidopsis lyrata subsp.
lyrata]
gi|297326042|gb|EFH56462.1| hypothetical protein ARALYDRAFT_483723 [Arabidopsis lyrata subsp.
lyrata]
Length = 522
Score = 304 bits (778), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 154/371 (41%), Positives = 222/371 (59%), Gaps = 16/371 (4%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCPD+F+ I+KDLE W + IT + A A R++I G++YV Y Q+R +F
Sbjct: 117 ETCPDYFRWIYKDLEAWRGTGITRETLERASDKAHFRLIIKGGRVYVQQYKKSFQTRDVF 176
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL--------PLFRYCTNDAH 215
TIWG +QLLR YPG VPD++++F C D P I ++++ P PLF YC +
Sbjct: 177 TIWGIVQLLRMYPGQVPDLELLFMCDDLPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGA 236
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
FDI FPDWSFWGW E+N++ WN+ + I G + W+E+ P+AYWKGN V R +L
Sbjct: 237 FDIVFPDWSFWGWPEINIKEWNKMLEAISEGLKKVKWEEREPYAYWKGNARVGK--RRDL 294
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
M C+D + + Q+W +E G+K S L +QC HRYKIY EG AWSVS KYIL+C+S
Sbjct: 295 MNCHDPMV---HLYSQDWWKEGSIGYKTSNLEDQCTHRYKIYVEGRAWSVSEKYILACDS 351
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ L++ Y DFF+R L+P ++++PI + C I V WGN N A+ I + G ++
Sbjct: 352 MTLLVKPFYFDFFTRSLVPMEHYWPIRPREKCSDIVFAVHWGNNNTKRAKAIWRNGSGYI 411
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKA-A 452
++L M VYDYMLHL+ Y KL+ P A E C E++ C + + R +++ +
Sbjct: 412 RKNLKMKYVYDYMLHLLQSYGKLMTMNVEVPQGAKEVCPETIACPINGGRMRHSMDDSLV 471
Query: 453 ASPSPYPPCTL 463
SPS C +
Sbjct: 472 MSPSVKAACEM 482
>gi|302802289|ref|XP_002982900.1| hypothetical protein SELMODRAFT_53724 [Selaginella moellendorffii]
gi|300149490|gb|EFJ16145.1| hypothetical protein SELMODRAFT_53724 [Selaginella moellendorffii]
Length = 345
Score = 301 bits (771), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 207/349 (59%), Gaps = 8/349 (2%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP ++ + +DL PW + I + A++ +A R ++ G+LYV+ YY C Q+R MFTI
Sbjct: 2 CPAYYSRVFEDLAPWKEKGIQEHDLETARKHSAFRAIVRDGRLYVELYYRCFQTRMMFTI 61
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
G +QLL+R+PG +PDVDI F+C D+P I K P PLF YC+ HFDIPFPDWSF
Sbjct: 62 VGIMQLLQRFPGQIPDVDIFFNCQDRPQITKSAFDEAPPPLFGYCSTKNHFDIPFPDWSF 121
Query: 226 WGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG 285
WGW E + PW + K I +Q WK++ W+G+P S +R L+ CN +
Sbjct: 122 WGWPENKILPWRSQLKRI---TQQAEWKDRDSSVQWRGDPRT-SQIRQRLIACNSTGDKT 177
Query: 286 AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYK 345
+ QNW +++ + KL + C+ RYK+YAEGYAWS S KYI+ C S L I Y
Sbjct: 178 LLVHGQNWRDQSD--LQNWKLESHCHSRYKLYAEGYAWSASYKYIMGCGSTVLAIDSDYY 235
Query: 346 DFFSRGLIPTKNHFPIP-SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRV 403
+FF+R L ++ PI +LC+SI + WG ++P EA+ I GQ F+ E L++D+V
Sbjct: 236 EFFTRDLKAGVHYVPISREGNLCQSISNARQWGESHPGEAQAIATRGQRFLVEDLSLDQV 295
Query: 404 YDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAA 452
Y YMLHLI EY KL +KP P A + CLA P+ + LE+ A
Sbjct: 296 YGYMLHLIQEYGKLQKFKPPVPREAHVVHPGLVKCLAHPRDLETLERTA 344
>gi|302800329|ref|XP_002981922.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300150364|gb|EFJ17015.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 345
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 207/349 (59%), Gaps = 8/349 (2%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
CP ++ + +DL PW + I + A++ +A R ++ G+LYV+ YY C Q+R MFTI
Sbjct: 2 CPAYYSRVFEDLAPWKEKGIQEHDLETARKHSAFRAIVRDGRLYVELYYRCFQTRMMFTI 61
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
G +QLL+R+PG +PDVDI F+C D+P I K P PLF YC+ HFDIPFPDWSF
Sbjct: 62 VGIMQLLQRFPGQIPDVDIFFNCQDRPQITKSAFDEAPPPLFGYCSTKNHFDIPFPDWSF 121
Query: 226 WGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG 285
WGW E + PW + K I ++ WK++ W+G+P S +R L+ CN +
Sbjct: 122 WGWPENKILPWRSQLKRI---TRQAEWKDRDSSVQWRGDPRT-SQIRQRLIACNSTGDKT 177
Query: 286 AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYK 345
+ QNW +++ + KL + C+ RYK+YAEGYAWS S KYI+ C S L I Y
Sbjct: 178 LLVHGQNWRDQSD--LQNWKLESHCHSRYKLYAEGYAWSASYKYIMGCGSTVLAIDSDYY 235
Query: 346 DFFSRGLIPTKNHFPIP-SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRV 403
+FF+R L ++ PI +LC+SI + WG ++P EA+ I GQ F+ E+L +D+V
Sbjct: 236 EFFTRDLKAGVHYVPISREGNLCQSISNARQWGESHPGEAQAIATRGQRFLVEALNLDQV 295
Query: 404 YDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAA 452
Y YMLHLI EY KL +KP P A + CLA P+ + LE+ A
Sbjct: 296 YGYMLHLIQEYGKLQKFKPPVPREAHVVHPALVKCLAHPRDLETLERTA 344
>gi|302819894|ref|XP_002991616.1| hypothetical protein SELMODRAFT_43466 [Selaginella moellendorffii]
gi|300140649|gb|EFJ07370.1| hypothetical protein SELMODRAFT_43466 [Selaginella moellendorffii]
Length = 324
Score = 298 bits (762), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 152/323 (47%), Positives = 201/323 (62%), Gaps = 8/323 (2%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
QTCP +FK I DL PW K IT + + K A R++I+ GKLY Y C Q+RA +
Sbjct: 1 QTCPSYFKWIEHDLAPW-KGGITRKALEAGKEKAYFRVIILGGKLYTQTYKQCFQTRAEY 59
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH-GSFPLPLFRYCTN-DAHFDIPFP 221
T+ G LL ++PGMVPDVDIMF+C D P++ + + + P P+F YCT + H+DI FP
Sbjct: 60 TLKGLAMLLNKFPGMVPDVDIMFNCQDHPLVPRWRYLFTSPPPVFGYCTTRNRHYDIVFP 119
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
DWS WGW EVN+ PW+ E + I ++ W + P AYW+GN + +R L+KCN +
Sbjct: 120 DWSIWGWPEVNILPWSIESERIFTEAEKIDWFRRKPIAYWRGNTQ-MGLIRSNLVKCNST 178
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ I Q+W E K F S LSNQC RYKIYAEG AWSVSLKYILSC S L I
Sbjct: 179 NI---LIQHQDWITEEKANFTNSDLSNQCFSRYKIYAEGNAWSVSLKYILSCGSTMLRIE 235
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
Y DFFSR L+P + PI ++C SI+ + W N+N +A +GK GQ+F+ E L+
Sbjct: 236 PYYWDFFSRSLLPHVHFLPITRENICDSIQEAIQWSNSNIYKAAMVGKCGQNFLKEQLST 295
Query: 401 DRVYDYMLHLITEYSKLLDYKPA 423
D VY YMLH++ Y+KL +KP
Sbjct: 296 DYVYQYMLHILQRYAKLQKFKPV 318
>gi|302776736|ref|XP_002971516.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300160648|gb|EFJ27265.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 318
Score = 294 bits (753), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 198/322 (61%), Gaps = 8/322 (2%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
QTCP +F I DL PW K IT + + K A R++I+ GKLY Y C Q+RA +
Sbjct: 2 QTCPSYFNWIEHDLAPW-KGGITRKALEAGKEKAYFRVIILGGKLYTQTYKQCFQTRAEY 60
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH-GSFPLPLFRYCTN-DAHFDIPFP 221
T+ G LL +PGMVPDVDIMF+C D P++ + + + P P+F YCT + H+DI FP
Sbjct: 61 TLKGLAMLLNEFPGMVPDVDIMFNCQDHPLVPRWRYLFTSPPPVFGYCTTRNRHYDIVFP 120
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
DWS WGW EVN+ PW+ E + I ++ W + P AYW+GN + +R L+KCN +
Sbjct: 121 DWSIWGWPEVNIPPWSIESERIFTEAEKIDWFRRKPIAYWRGNTQ-MGLIRSNLVKCNST 179
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ I Q+W E K F S LSNQC RYKIYAEG AWSVSLKYILSC S L I
Sbjct: 180 NI---LIQHQDWITEEKANFTNSDLSNQCFSRYKIYAEGNAWSVSLKYILSCGSTMLRIE 236
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
Y DFFSR L+P + PI ++C SI+ + W N+N EA +GK Q+F+ E L+
Sbjct: 237 PYYWDFFSRSLLPHVHFLPITRENICDSIQEAIQWSNSNIYEAAMVGKCAQNFLKEQLST 296
Query: 401 DRVYDYMLHLITEYSKLLDYKP 422
D VY YMLH++ Y+KL +KP
Sbjct: 297 DYVYQYMLHILQRYAKLQKFKP 318
>gi|414586114|tpg|DAA36685.1| TPA: hypothetical protein ZEAMMB73_061596 [Zea mays]
Length = 433
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 138/287 (48%), Positives = 193/287 (67%), Gaps = 8/287 (2%)
Query: 105 TCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+CP +F+ IH+DL PW A IT + A+ A R++++ G YV + Q+R +F
Sbjct: 144 SCPSYFRFIHEDLRPWRAAGGITRAMLDRARLTATFRLVVLGGNAYVQRFRPAFQTRDLF 203
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVID----KKEHGSFPLPLFRYCTNDAHFDIP 219
TIWG LQLLRR+PG VPD+D+MFD +D PV+ + ++ PLFRYC +D DI
Sbjct: 204 TIWGVLQLLRRHPGRVPDLDLMFDTVDWPVVRSHLYRGKNAEMLPPLFRYCGDDKTLDIV 263
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
FPDWSFWGW E+N++PW+ +D+K G+ W ++ P+AYWKGNP V S R EL+KCN
Sbjct: 264 FPDWSFWGWPEINIKPWDALKEDLKAGNNRVRWMDREPYAYWKGNPSV-SGTRKELVKCN 322
Query: 280 DSKL--WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
S W A I Q+W +E+K G+K S LS+QC HRYKIY EG AWS+S KYIL+C+S+
Sbjct: 323 VSSTHDWNARIYAQDWFKESKAGYKDSDLSSQCAHRYKIYIEGSAWSISEKYILACDSMT 382
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEA 384
L+++ +Y DFFSR L+P ++++P+ + C SIK VDWGN++ A
Sbjct: 383 LLVTPRYYDFFSRSLMPIQHYWPVRYDNKCASIKYAVDWGNSHKQLA 429
>gi|222629271|gb|EEE61403.1| hypothetical protein OsJ_15589 [Oryza sativa Japonica Group]
Length = 535
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 153/367 (41%), Positives = 216/367 (58%), Gaps = 16/367 (4%)
Query: 109 FFKSIHKDLEPWAKS-RITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWG 167
+F+ IH+DL PW + IT + A+ A+ R+L++ G+ +V + +
Sbjct: 140 YFRFIHEDLRPWRDAGGITRAMLARARVTASFRLLVLGGRAFVHRFARHSRPAISSPSGA 199
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-------FPLPLFRYCTNDAHFDIPF 220
PG P C P S F PLF YC +D DI F
Sbjct: 200 SSSCSAATPGGSP---TSTSCSTAPTGPSSARTSTAGSTLPFMPPLFSYCGDDRTLDIVF 256
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW E+N++PW+ +D+K G+ W +++P+AYWKGNP V + R EL+ CN
Sbjct: 257 PDWSFWGWPEINIKPWDALRQDLKDGNNRVKWLDRVPYAYWKGNPAV-AVTRQELVNCNV 315
Query: 281 S--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
S K W A I +Q+W E+K G+K S L +QC HRYKIY EG AWSVS KYIL+C+S+ L
Sbjct: 316 STTKDWNARIYKQDWFRESKAGYKDSNLGSQCTHRYKIYIEGSAWSVSQKYILACDSMTL 375
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMES- 397
+++ +Y DFFSR L+P ++++P+ + + C SIK VDWGN++ A++IGK DF+E
Sbjct: 376 LVTPRYYDFFSRSLMPIQHYWPVHNDNKCDSIKYAVDWGNSHKQLAQRIGKQASDFIEED 435
Query: 398 LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPS 456
+ MDRVYDYMLHL+TEY+KLL ++P P A E C +SL C A+ +++ L E S
Sbjct: 436 VNMDRVYDYMLHLLTEYAKLLRFRPIKPPEAVEICPDSLACQAEGLEKKFLMESMVKSAR 495
Query: 457 PYPPCTL 463
PC L
Sbjct: 496 DAGPCDL 502
>gi|147791845|emb|CAN70601.1| hypothetical protein VITISV_027961 [Vitis vinifera]
Length = 362
Score = 284 bits (727), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 141/321 (43%), Positives = 203/321 (63%), Gaps = 20/321 (6%)
Query: 149 YVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI---DKKEHGSFPLP 205
+V+ + +Q+R MFT+WG LQLLR YPG +PD+++MFDC D+PV+ D + + P P
Sbjct: 18 HVEKFRKSIQTRDMFTLWGILQLLRWYPGRLPDLELMFDCDDRPVVRMRDFRGPSAAPPP 77
Query: 206 LFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNP 265
LFRYC +D DI FPDWSFWGW+E N++PW KDIK G++ WK+++P AYW+GNP
Sbjct: 78 LFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEGNRRTKWKDRVPLAYWRGNP 137
Query: 266 DVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAW 323
V +P R +L+KCN D W + Q+W +++K G+++S L + IY EG+AW
Sbjct: 138 HV-APTRGDLLKCNVSDKADWNTRLYLQDWDQQSKIGYRQSNLRGPMHPH--IYIEGWAW 194
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
SVS KYIL+C+S+ L+ +Y DFF RGL+P ++++PI + CR ++ WG S
Sbjct: 195 SVSEKYILACDSMTLLTRPRYYDFFIRGLVPLQHYWPIRDNNKCRDLRH-RQWGRRRASS 253
Query: 384 AEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPK 443
F E L MD VYDYM HL+ EY+KLL +KP P A E C E++ C A+
Sbjct: 254 ----------FQEDLKMDYVYDYMFHLLNEYAKLLKFKPTIPPGAVEVCAETMACPAEGA 303
Query: 444 QRQNLEKAAAS-PSPYPPCTL 463
R+ +E++ P+ PC+L
Sbjct: 304 WRKFMEESLEKIPTDTTPCSL 324
>gi|297745249|emb|CBI40329.3| unnamed protein product [Vitis vinifera]
Length = 341
Score = 281 bits (720), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 138/297 (46%), Positives = 187/297 (62%), Gaps = 9/297 (3%)
Query: 175 YPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIPFPDWSFWGWSE 230
YPG +PD D+MF+C D+P+I + + P PLF YC +D +DI FPDWSFWGW E
Sbjct: 2 YPGKLPDFDLMFECRDRPMIRTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGWPE 61
Query: 231 VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC--NDSKLWGAEI 288
N++PWN KD+K G+ W ++ P+AYWKGN + +R EL KC D + W A +
Sbjct: 62 TNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVK-MGVVRKELFKCRNTDEQDWNARL 120
Query: 289 LRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFF 348
+W E + GFK S L++QC HRYKIY EG AWSVS KYIL+C+SV L++ QY +FF
Sbjct: 121 YIMDWGREVQSGFKTSDLASQCTHRYKIYTEGIAWSVSEKYILACDSVTLLVKPQYYEFF 180
Query: 349 SRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYM 407
+R L P +++PI D+C+SIK DW N + +A+KIGKAG F+ E + M VYDYM
Sbjct: 181 TRSLQPLVHYWPIKHKDMCKSIKFATDWCNNHTEKAQKIGKAGSSFVQEEIKMKFVYDYM 240
Query: 408 LHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
HL++ Y+KLL YKP P A E C E + C + ++ L+ SPS PC +
Sbjct: 241 FHLLSMYAKLLKYKPTVPPMAVEFCPEMMACAVEGLEKDYKLQSMVKSPSDTGPCIM 297
>gi|356530084|ref|XP_003533614.1| PREDICTED: KDEL motif-containing protein 1-like, partial [Glycine
max]
Length = 357
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 145/327 (44%), Positives = 213/327 (65%), Gaps = 15/327 (4%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCP++FK IH+DL+PW ++ IT + + + R++I++GK Y Y D ++R +FT
Sbjct: 35 TCPEYFKWIHEDLKPWKRTGITREMMERGRNVSYFRLVIIQGKAYAKKYADSYETRDVFT 94
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWS 224
+WG LQLLR YPG +PD++++ + DKP++DK++ P P+F YC + +DI FPDW
Sbjct: 95 VWGILQLLRLYPGDIPDLELLLETGDKPMVDKEQSQGPPPPIFHYCGHKNAYDIVFPDWI 154
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSK 282
F GW+++ ++ +E K IK WK++LP+A WKG V S R +L KCN D
Sbjct: 155 FRGWADLAIK-LDESNKKIK-------WKDRLPYAIWKGKTWV-SHKRNDLTKCNASDQH 205
Query: 283 LWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
A+I +W +E GF+ +KL +QC RYKIY EG AWSV YIL+ +S+ L I
Sbjct: 206 DSYAQIHPLHWDKEIAQGFQNTKLDDQCIQRYKIYVEGIAWSVIENYILAYDSMTLFIEP 265
Query: 343 QYKDFFSRGLIPTKNHF-PIPSAD--LCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
Y DFF+R L+P K ++ PI S + +C IK VV+WG+ANP +A+ IGKAG +F+ E+L
Sbjct: 266 IYYDFFTRSLVPRKQYYWPISSKNQSMCNDIKYVVEWGSANPDKAKAIGKAGTNFIKENL 325
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPP 425
M VYDYM +L+TEY++LL ++P P
Sbjct: 326 KMKFVYDYMFYLLTEYARLLTFEPTIP 352
>gi|413919042|gb|AFW58974.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 316
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 141/289 (48%), Positives = 190/289 (65%), Gaps = 11/289 (3%)
Query: 185 MFDCMDKPVIDKKE-HGSFPLP----LFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEE 239
MFDC D PV+ E HG+ P P LFRYC ++A DI FPDWSFWGW E+N++PW
Sbjct: 1 MFDCEDLPVVGAGERHGAQPQPPPPPLFRYCGSEATLDIAFPDWSFWGWPELNIKPWEAL 60
Query: 240 FKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS--KLWGAEILRQNWAEEA 297
++I + +W ++ P+AYWKGNP V + R+ L++CN S + W A + Q+W EE
Sbjct: 61 QREINEANAVVNWTDRAPYAYWKGNPKVGAE-RLLLLRCNASGERDWNARVYAQDWGEEV 119
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKN 357
+ GF+ S LS QC HRY+IY EG WSVS KYIL+C SVAL++ ++ DFFSRGL P ++
Sbjct: 120 RHGFRGSDLSKQCTHRYRIYIEGRGWSVSEKYILACESVALMVRPRFHDFFSRGLSPLRH 179
Query: 358 HFPIPS-ADLCRSIKSVVDWGNANPSEAEKI-GKAGQDFMESLTMDRVYDYMLHLITEYS 415
++P+ +CRSIK VDWGNA+ A+++ G A + E LTMDRVYDYM HL+TEY+
Sbjct: 180 YWPVRGDRGMCRSIKHAVDWGNAHADRAQEMAGNASRFIREELTMDRVYDYMFHLLTEYA 239
Query: 416 KLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
+LL Y+PA P A E VES++ +RQ ++ AAS S PC L
Sbjct: 240 RLLRYRPAVPRGAAEVTVESMIRGRRGLERQFMVDTLAASASGDGPCRL 288
>gi|222634972|gb|EEE65104.1| hypothetical protein OsJ_20160 [Oryza sativa Japonica Group]
Length = 498
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/369 (38%), Positives = 203/369 (55%), Gaps = 53/369 (14%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
+ +CPD+F+ IH DL PW + IT + +R A R+++V G+ YV+ Y Q+R
Sbjct: 146 EPAASCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTR 205
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
FT + D+ F
Sbjct: 206 DAFT----------------------------------------------QDATTLDVVF 219
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
PDWSFWGW EVN+ W + ++ S W E+ PFA+WKGNP V + +R ELMKCN
Sbjct: 220 PDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGV-ARIRGELMKCNP 278
Query: 280 --DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
D K W A + Q+W +GFK S + QC HRYKIY EG AWSVS KYI++C+S
Sbjct: 279 ASDGKDWNARLFSQDWNHAIHNGFKDSSIPKQCLHRYKIYIEGEAWSVSEKYIMACDSPV 338
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
L ++ Y+D SRGL+ ++++PI +C SI++ VDWGNA+P+ A +IG+ G F+ E
Sbjct: 339 LFVNTPYQDILSRGLVAGEHYWPINRTRMCESIRAAVDWGNAHPAAARRIGEQGSRFVRE 398
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL-LCLADPKQRQNLEKAAAS- 454
+ MD VYDYM HLITEY+KLL YKP P++A E C ES+ A ++R+ ++++
Sbjct: 399 QMAMDYVYDYMFHLITEYAKLLRYKPTVPANAVEICAESMACAAAAGRERECMDESVEGF 458
Query: 455 PSPYPPCTL 463
+ + PC+L
Sbjct: 459 VAGFDPCSL 467
>gi|215736906|dbj|BAG95835.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 275
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 127/257 (49%), Positives = 180/257 (70%), Gaps = 7/257 (2%)
Query: 130 IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
+ A+ A+ R+L++ G+ +V + Q+R +FTIWG LQLLRRYPG VPD+D+MFDC
Sbjct: 2 LARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWGVLQLLRRYPGRVPDLDLMFDCA 61
Query: 190 DKPVID----KKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKH 245
D PV+ + +H +F PLF YC +D DI FPDWSFWGW E+N++PW+ +D+K
Sbjct: 62 DWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDWSFWGWPEINIKPWDALRQDLKD 121
Query: 246 GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS--KLWGAEILRQNWAEEAKDGFKK 303
G+ W +++P+AYWKGNP V + R EL+ CN S K W A I +Q+W E+K G+K
Sbjct: 122 GNNRVKWLDRVPYAYWKGNPAV-AVTRQELVNCNVSTTKDWNARIYKQDWFRESKAGYKD 180
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
S L +QC HRYKIY EG AWSVS KYIL+C+S+ L+++ +Y DFFSR L+P ++++P+ +
Sbjct: 181 SNLGSQCTHRYKIYIEGSAWSVSQKYILACDSMTLLVTPRYYDFFSRSLMPIQHYWPVHN 240
Query: 364 ADLCRSIKSVVDWGNAN 380
+ C SIK VDWGN++
Sbjct: 241 DNKCDSIKYAVDWGNSH 257
>gi|222630210|gb|EEE62342.1| hypothetical protein OsJ_17131 [Oryza sativa Japonica Group]
Length = 267
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 118/198 (59%), Positives = 154/198 (77%), Gaps = 4/198 (2%)
Query: 230 EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEIL 289
E N++PW++EF+DIK G++A W++++ AYWKGNPDV SPLRV L+ CND+ +W AEI+
Sbjct: 10 ETNIEPWSKEFRDIKEGAKAIKWQDRVATAYWKGNPDVASPLRVALLNCNDTNMWHAEIM 69
Query: 290 RQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFS 349
RQNW EE K G+ SKLS+QC HRYKIYAEG+AWSVSLKYILSC S+AL+I QY+DFFS
Sbjct: 70 RQNWDEEVKSGYHNSKLSSQCTHRYKIYAEGFAWSVSLKYILSCGSMALVIDPQYEDFFS 129
Query: 350 RGLIPTKN----HFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD 405
RGL P N H + + +C SI+ V+WG A+P+EAE +G+ GQ ME L MD VYD
Sbjct: 130 RGLRPEVNFWPVHIDVAAGGMCESIRDAVEWGEAHPAEAEAVGRRGQRLMEELDMDAVYD 189
Query: 406 YMLHLITEYSKLLDYKPA 423
YMLHL+TEY++L+ ++PA
Sbjct: 190 YMLHLLTEYARLMRFRPA 207
>gi|297745250|emb|CBI40330.3| unnamed protein product [Vitis vinifera]
Length = 312
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 123/265 (46%), Positives = 178/265 (67%), Gaps = 5/265 (1%)
Query: 203 PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWK 262
P PLFRYC +D DI FPDWSFWGW+E N++PW KDIK G++ WK+++P AYW+
Sbjct: 11 PPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEGNRRTKWKDRVPLAYWR 70
Query: 263 GNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEG 320
GNP V +P R +L+KCN D W + Q+W +++K G+++S L +QC H YKIY EG
Sbjct: 71 GNPHV-APTRGDLLKCNVSDKADWNTRLYLQDWDQQSKIGYRQSNLEDQCTHIYKIYIEG 129
Query: 321 YAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNAN 380
+AWSVS KYIL+C+S+ L+ +Y DFF RGL+P ++++PI + CR ++ V+WGN +
Sbjct: 130 WAWSVSEKYILACDSMTLLTRPRYYDFFIRGLVPLQHYWPIRDNNKCRDLRFAVEWGNNH 189
Query: 381 PSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCL 439
+A+ +G+ F+ E L MD VYDYM HL+ EY+KLL +KP P A E C E++ C
Sbjct: 190 TDKAQTMGETTSKFIQEDLKMDYVYDYMFHLLNEYAKLLKFKPTIPPGAVEVCAETMACP 249
Query: 440 ADPKQRQNLEKAAAS-PSPYPPCTL 463
A+ R+ +E++ P+ PC+L
Sbjct: 250 AEGAWRKFMEESLEKIPTDTTPCSL 274
>gi|297745253|emb|CBI40333.3| unnamed protein product [Vitis vinifera]
Length = 325
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 129/288 (44%), Positives = 182/288 (63%), Gaps = 10/288 (3%)
Query: 185 MFDCMDKPVIDKKEHGSFPL-----PLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEE 239
MF+C D+P I ++G PLF YC +D DI FPDWSFWGW E+N++PWN
Sbjct: 1 MFECGDRPRIKATDYGGRKGKKKVPPLFHYCASDDTLDIVFPDWSFWGWPEINIKPWNSL 60
Query: 240 FKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC--NDSKLWGAEILRQNWAEEA 297
K+++ G+ W ++ P+AYWKGN S R L KC +++ W A + +W E+
Sbjct: 61 RKELEEGNNRTKWMDREPYAYWKGNIRT-SGNRQALFKCRPSNNHDWNARVYDMDWGRES 119
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKN 357
+GFK S L++QC H+YKIY EG AWSVS KYIL+C+S++L+ +Y DFF+R L PT +
Sbjct: 120 HEGFKDSNLASQCTHKYKIYMEGIAWSVSEKYILACDSMSLVPRSRYYDFFTRSLQPTIH 179
Query: 358 HFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSK 416
++PI D+CRS+K VDWGN +P +A+KIGKA +F +E L M+ VYDYM HL++EY+K
Sbjct: 180 YWPIQQNDICRSVKYAVDWGNKHPQKAQKIGKAASNFVLEELKMNYVYDYMFHLLSEYAK 239
Query: 417 LLDYKPAPPSSAFEACVESLLCL-ADPKQRQNLEKAAASPSPYPPCTL 463
L YKP P A E E++ D ++ E + P+ PCT+
Sbjct: 240 LFKYKPTVPPGAIEIVPETMANTGGDLEKIYKNESSVKGPATTSPCTM 287
>gi|147802992|emb|CAN70737.1| hypothetical protein VITISV_008287 [Vitis vinifera]
Length = 694
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 130/300 (43%), Positives = 183/300 (61%), Gaps = 35/300 (11%)
Query: 171 LLRRYPGMVPDVDIMFDCMDKPVI---DKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWG 227
LLR YPG +PD+++MFDC D+PV+ D + P PLFRYC +D DI FPDWSFWG
Sbjct: 385 LLRWYPGRLPDLELMFDCDDRPVVRMRDFRGPNXAPPPLFRYCGDDWSLDIVFPDWSFWG 444
Query: 228 WSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWG 285
W +P AYW+GNP V +P R +L+KCN D W
Sbjct: 445 W---------------------------VPLAYWRGNPHV-APTRGDLLKCNVSDKAEWN 476
Query: 286 AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYK 345
+ Q+W +++K G+++S L +QC HRYKIY EG+AWSVS KYIL+C+S+ L+I +Y
Sbjct: 477 TRLYLQDWGQQSKIGYRQSNLEDQCTHRYKIYIEGWAWSVSEKYILACDSMTLLIRPRYH 536
Query: 346 DFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVY 404
DFF RGL+P ++++PI + CR ++ V+WGN + +A+ +G+ F+ E L MD VY
Sbjct: 537 DFFIRGLVPLQHYWPIRDNNKCRDLRFAVEWGNNHTDKAQTMGETTSKFIQEDLKMDYVY 596
Query: 405 DYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPSPYPPCTL 463
DYM HL+ EY+KLL +KP P A E C E++ C A+ R+ + E P+ PC+L
Sbjct: 597 DYMFHLLNEYAKLLKFKPTIPPGAVEVCAETMACPAEGAWRKFMXESLEKBPTDTTPCSL 656
>gi|125583038|gb|EAZ23969.1| hypothetical protein OsJ_07696 [Oryza sativa Japonica Group]
Length = 279
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 125/261 (47%), Positives = 175/261 (67%), Gaps = 19/261 (7%)
Query: 205 PLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGN 264
PLFRYC +D D+ FPDWSFWGW E+N++PW+ KD+ G++ W ++ P+AYWKGN
Sbjct: 3 PLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKGN 62
Query: 265 PDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWS 324
PDV + R EL +W +E+K G+K+S L++QC HRYKIY EG AWS
Sbjct: 63 PDVATK-RKEL----------------DWIKESKAGYKQSDLASQCTHRYKIYIEGSAWS 105
Query: 325 VSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEA 384
VS KYIL+CNS+ L+++ +Y DFFSR L+PT++++P+ + C SIK VDWGN+N +A
Sbjct: 106 VSEKYILACNSMTLVVTPKYYDFFSRVLMPTQHYWPVRDDNKCSSIKHAVDWGNSNKKKA 165
Query: 385 EKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPK 443
+KIGK +F+ + L+MD +YDYM HL+TEY+KLL +KP P A E C E L C A +
Sbjct: 166 QKIGKQASNFIQQELSMDYIYDYMFHLLTEYAKLLRFKPTKPPEAIEICPELLACQAIGR 225
Query: 444 QRQNLEKA-AASPSPYPPCTL 463
+R+ +E + S + PC L
Sbjct: 226 ERKFMEDSMVKSANDAGPCDL 246
>gi|67633610|gb|AAY78729.1| hypothetical protein At2g45840 [Arabidopsis thaliana]
Length = 337
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 125/300 (41%), Positives = 185/300 (61%), Gaps = 15/300 (5%)
Query: 175 YPGMVPDVDIMFDCMDKPVIDKKEHGSFPL--------PLFRYCTNDAHFDIPFPDWSFW 226
YPG VPD++++F C D P I ++++ P PLF YC + FDI FPDWSFW
Sbjct: 2 YPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFW 61
Query: 227 GWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
GW E+N++ WN++ + I G + W+E+ P+AYWKGNP V + +R +LM C+D +
Sbjct: 62 GWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGV-AMVRRDLMHCHDPMV--- 117
Query: 287 EILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ RQ+W+ E + G++ S L +QC HRYKIY EG AWSVS KYIL+C+S+ L++ Y D
Sbjct: 118 HLYRQDWSREGRIGYRTSNLEDQCTHRYKIYVEGRAWSVSEKYILACDSMTLLVKPFYFD 177
Query: 347 FFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYD 405
FF+R L+P ++++PI + C I V WGN N +A IG+ G ++ ++L M VYD
Sbjct: 178 FFTRSLVPMEHYWPIRPQEKCSDIVFAVHWGNNNTKKARAIGRNGSGYVRKNLKMKYVYD 237
Query: 406 YMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKA-AASPSPYPPCTL 463
YMLHL+ Y KL+ P A E C E++ C + + RQ+++ + SPS C +
Sbjct: 238 YMLHLLQSYGKLMKMNVEVPQGAKEVCPETMACPINGGRMRQSMDDSLVMSPSVKATCEM 297
>gi|30267795|gb|AAP21678.1| hypothetical protein [Arabidopsis thaliana]
Length = 337
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 125/300 (41%), Positives = 185/300 (61%), Gaps = 15/300 (5%)
Query: 175 YPGMVPDVDIMFDCMDKPVIDKKEHGSFPL--------PLFRYCTNDAHFDIPFPDWSFW 226
YPG VPD++++F C D P I ++++ P PLF YC + FDI FPDWSFW
Sbjct: 2 YPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFW 61
Query: 227 GWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
GW E+N++ WN++ + I G + W+E+ P+AYWKGNP V + +R +LM C+D +
Sbjct: 62 GWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGV-AMVRRDLMHCHDPMV--- 117
Query: 287 EILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ RQ+W+ E + G++ S L +QC HRYKIY EG AWSVS KYIL+C+S+ L++ Y D
Sbjct: 118 HLYRQDWSREGRIGYRTSNLEDQCTHRYKIYVEGRAWSVSEKYILACDSMTLLVKPFYFD 177
Query: 347 FFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYD 405
FF+R L+P ++++PI + C I V WGN N +A IG+ G ++ ++L M VYD
Sbjct: 178 FFTRSLVPMEHYWPIRPQEKCSDIVFAVHWGNNNTKKARAIGRNGSGYVRKNLKMKYVYD 237
Query: 406 YMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKA-AASPSPYPPCTL 463
YMLHL+ Y KL+ P A E C E++ C + + RQ+++ + SPS C +
Sbjct: 238 YMLHLLQSYGKLMKMNVEVPQGAKEVCPETMACPINGGRMRQSMDDSLVMSPSVKATCEM 297
>gi|6850895|emb|CAB71058.1| putative protein [Arabidopsis thaliana]
Length = 338
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 128/301 (42%), Positives = 190/301 (63%), Gaps = 13/301 (4%)
Query: 175 YPGMVPDVDIMFDCMDKPVIDKKEHGSF------PLPLFRYCTNDAHFDIPFPDWSFWGW 228
YPG +PD++++F C D+P I K++ P PLF YC + +DI FPDWSFWGW
Sbjct: 2 YPGQIPDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGW 61
Query: 229 SEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG--A 286
E+N++ WN+ +K G++ W++++P+AYWKGNP V SP+R +LM+CN S +
Sbjct: 62 PELNIKEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHV-SPIRGDLMRCNFSDKYDPMV 120
Query: 287 EILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ Q+W E + GF+ S L +QC HRYKIY EG AWSVS KYILSC+S+ L++ +Y D
Sbjct: 121 RLYVQDWRSEIEAGFRGSNLEDQCTHRYKIYIEGNAWSVSEKYILSCDSMTLLVKPEYYD 180
Query: 347 FFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYD 405
FF R ++P K+ +PI + C +K V+WGN N +A+ IG+ G ++ M++L M VYD
Sbjct: 181 FFFRSMVPMKHFWPIRQNNKCGDLKFAVEWGNNNTEKAQIIGRQGSEYMMKNLKMKYVYD 240
Query: 406 YMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQ-RQNLEKA-AASPSPYPPCT 462
YML+++ Y KL+ P +A E C E++ C + D + RQ ++ + SPS C
Sbjct: 241 YMLYVLQGYGKLMKLDVTVPENATEVCSETMACSITDGGRIRQCMDDSLVMSPSVKAACD 300
Query: 463 L 463
L
Sbjct: 301 L 301
>gi|125554122|gb|EAY99727.1| hypothetical protein OsI_21712 [Oryza sativa Indica Group]
Length = 308
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 120/263 (45%), Positives = 162/263 (61%), Gaps = 7/263 (2%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CPD+F+ IH DL PW + IT + +R A R+++V G+ YV+ Y Q+R FT
Sbjct: 40 SCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFT 99
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVI---DKKEHGSFPLPLFRYCTNDAHFDIPFP 221
WG QLLRRY G VPDVDIMF C D+ + D + P+FRYC + D+ FP
Sbjct: 100 QWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFAAAPADAPPVFRYCRDATTLDVVFP 159
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW EVN+ W + ++ S W E+ PFA+WKGNP V + +R ELMKCN
Sbjct: 160 DWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGV-ARIRGELMKCNPA 218
Query: 280 -DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
D K W A + Q+W +GF+ S + QC HRYKIY EG AWSVS KYI++C+S L
Sbjct: 219 SDGKDWNARLFSQDWNHAIHNGFRDSSIPKQCLHRYKIYIEGEAWSVSEKYIMACDSPVL 278
Query: 339 IISQQYKDFFSRGLIPTKNHFPI 361
++ Y+D SRGL+ ++++PI
Sbjct: 279 FVNTPYQDILSRGLVAGEHYWPI 301
>gi|302143882|emb|CBI22743.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 115/238 (48%), Positives = 159/238 (66%), Gaps = 5/238 (2%)
Query: 230 EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAE 287
E+N++PW KD+K G++ W E+ P+AYWKGNP V + R++L+KCN D + W A
Sbjct: 37 EINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAV-AATRLDLLKCNVSDKQDWNAR 95
Query: 288 ILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDF 347
+ Q+W E+++G+K+S L++QC HRYKIY EG AWSVS KYIL+C+SV L++ Y DF
Sbjct: 96 VYTQDWIRESQEGYKQSDLASQCIHRYKIYIEGSAWSVSQKYILACDSVTLLVKPHYYDF 155
Query: 348 FSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDY 406
F+R L+P +++PI D CRSIK VDWGN + +A+ IGKA DF+ E L MD VYDY
Sbjct: 156 FTRSLMPVHHYWPIREDDKCRSIKFAVDWGNRHKQKAQSIGKAASDFIQEDLKMDNVYDY 215
Query: 407 MLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP-KQRQNLEKAAASPSPYPPCTL 463
M HL+ EY+KLL +KP P A E C E + C A+ K++ +E P PCT+
Sbjct: 216 MFHLLNEYAKLLKFKPTVPEKAVELCSERMGCGAEGLKKKFMMESMVKYPMDASPCTM 273
>gi|359490428|ref|XP_003634087.1| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 288
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 115/238 (48%), Positives = 159/238 (66%), Gaps = 5/238 (2%)
Query: 230 EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAE 287
E+N++PW KD+K G++ W E+ P+AYWKGNP V + R++L+KCN D + W A
Sbjct: 16 EINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAV-AATRLDLLKCNVSDKQDWNAR 74
Query: 288 ILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDF 347
+ Q+W E+++G+K+S L++QC HRYKIY EG AWSVS KYIL+C+SV L++ Y DF
Sbjct: 75 VYTQDWIRESQEGYKQSDLASQCIHRYKIYIEGSAWSVSQKYILACDSVTLLVKPHYYDF 134
Query: 348 FSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDY 406
F+R L+P +++PI D CRSIK VDWGN + +A+ IGKA DF+ E L MD VYDY
Sbjct: 135 FTRSLMPVHHYWPIREDDKCRSIKFAVDWGNRHKQKAQSIGKAASDFIQEDLKMDNVYDY 194
Query: 407 MLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADP-KQRQNLEKAAASPSPYPPCTL 463
M HL+ EY+KLL +KP P A E C E + C A+ K++ +E P PCT+
Sbjct: 195 MFHLLNEYAKLLKFKPTVPEKAVELCSERMGCGAEGLKKKFMMESMVKYPMDASPCTM 252
>gi|156766639|gb|ABU95040.1| fiber protein [Triticum aestivum]
Length = 176
Score = 237 bits (605), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 107/167 (64%), Positives = 133/167 (79%)
Query: 289 LRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFF 348
+RQNW EEAK G++ SKLS+QC HRYKIYAEG+AWSVSLKYILSC S+AL+I Y+DFF
Sbjct: 1 MRQNWEEEAKSGYQNSKLSSQCTHRYKIYAEGFAWSVSLKYILSCGSMALLIDPLYQDFF 60
Query: 349 SRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYML 408
SRGL P NH+P+ + +C SI+ V WGNA+P EAE++GK GQ M+ L MD VYDYML
Sbjct: 61 SRGLEPRVNHWPVSTVGMCESIRDAVKWGNAHPEEAERVGKRGQRLMQELGMDTVYDYML 120
Query: 409 HLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAASP 455
HL+TEY+ LLD++PAPP +A EAC S+LCLAD +QR+ LE +AA P
Sbjct: 121 HLLTEYAALLDFRPAPPHTAQEACAGSVLCLADDRQRRFLESSAAYP 167
>gi|124360768|gb|ABN08742.1| Lipopolysaccharide-modifying protein [Medicago truncatula]
Length = 253
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 108/239 (45%), Positives = 157/239 (65%), Gaps = 6/239 (2%)
Query: 229 SEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGA 286
+E ++PW + KDI+ ++ WK+++P+A+WKGN LS R EL KCN D W A
Sbjct: 6 AETRIKPWEKVLKDIQESNKKIKWKDRIPYAFWKGN--FLSNPRHELRKCNVTDQHDWNA 63
Query: 287 EILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ +W EE GFK +KL +QC HRYKIY EG +WSVS KYI++C+S+ L I +Y D
Sbjct: 64 RVYSVDWNEEIDQGFKNTKLEDQCAHRYKIYVEGISWSVSEKYIIACDSMTLFIKPRYYD 123
Query: 347 FFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYD 405
FF+R L+P K+++PI ++C+ IK VDWGN +P +A++IG+ G F+ E++ M VYD
Sbjct: 124 FFTRSLVPYKHYWPINKQNMCQDIKYAVDWGNTHPEKAQEIGREGTRFVEENVNMKLVYD 183
Query: 406 YMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPSPYPPCTL 463
YMLHL+TEY+ L+ ++P P+ A E C E++ C D R+ + E SPS PPC +
Sbjct: 184 YMLHLLTEYANLIRFEPKIPAEAIEVCTENVACSMDGIWREFMVESMVKSPSDTPPCAM 242
>gi|357497413|ref|XP_003618995.1| KTEL motif-containing protein [Medicago truncatula]
gi|355494010|gb|AES75213.1| KTEL motif-containing protein [Medicago truncatula]
Length = 303
Score = 228 bits (580), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 105/215 (48%), Positives = 148/215 (68%), Gaps = 6/215 (2%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
CP +F+ IH+DL+PW + IT + +AK+ A +++IV GK+YV+ Y +Q+R +F
Sbjct: 90 HVCPSYFRWIHEDLKPWREKGITRNMLKKAKKTAHFKLVIVDGKMYVEKYRKSIQTRDVF 149
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI--DKKEH-GSFPLPLFRYCTNDAHFDIPF 220
T+WG LQLLR YPG +PD+++MFDC DKPV+ DK + + P PLFRYC++ DI F
Sbjct: 150 TLWGILQLLRLYPGKLPDLELMFDCEDKPVVPLDKFQGPNASPPPLFRYCSDQWSLDIVF 209
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
PDWSFWGW+E N++PWN KDIK G++ WK+++P+AYWKGNP V +P R L++CN
Sbjct: 210 PDWSFWGWAETNIKPWNNILKDIKEGNKKTKWKDRVPYAYWKGNPYV-APTRQNLLQCNV 268
Query: 281 S--KLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR 313
+ W I Q+W +E+ GF KS L +QC HR
Sbjct: 269 TLENDWNTLIYIQDWIQESNQGFNKSNLGDQCTHR 303
>gi|356561560|ref|XP_003549049.1| PREDICTED: uncharacterized protein LOC100818683 [Glycine max]
Length = 289
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 109/239 (45%), Positives = 155/239 (64%), Gaps = 5/239 (2%)
Query: 229 SEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGA 286
+E+N++PW+ +K G+ W + P+AYWKGNP V + R +L+KCN +++ W A
Sbjct: 14 AEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAV-AETRQDLIKCNVYENQDWNA 72
Query: 287 EILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ Q+W E ++GF KS L +QC +RYK+Y EG AWSVS KYILSC+S L++ +Y D
Sbjct: 73 RLFAQDWFRELQEGFNKSDLPSQCTYRYKVYIEGSAWSVSQKYILSCDSTTLLVKPKYYD 132
Query: 347 FFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYD 405
FF+RGLIP +H+PI + CRSIK VDWGN + A +IGK DF+ E + MD VYD
Sbjct: 133 FFTRGLIPVHHHWPIKDDEKCRSIKFAVDWGNNHKQRAHQIGKVASDFIQEEVKMDYVYD 192
Query: 406 YMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
YM HL+ Y+KL YKP+ +A E CVES++C A+ ++ +E P+ PCT+
Sbjct: 193 YMFHLLNSYAKLFRYKPSISVNATELCVESMVCGAEGSVKKFMMESLVKVPANTDPCTM 251
>gi|297603834|ref|NP_001054658.2| Os05g0149500 [Oryza sativa Japonica Group]
gi|52353465|gb|AAU44032.1| unknown protein [Oryza sativa Japonica Group]
gi|218196097|gb|EEC78524.1| hypothetical protein OsI_18468 [Oryza sativa Indica Group]
gi|255676022|dbj|BAF16572.2| Os05g0149500 [Oryza sativa Japonica Group]
Length = 195
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 106/189 (56%), Positives = 134/189 (70%), Gaps = 8/189 (4%)
Query: 283 LWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
+W AEI+RQNW EE K G+ SKLS+QC HRYKIYAEG+AWSVSLKYILSC S+AL+I
Sbjct: 1 MWHAEIMRQNWDEEVKSGYHNSKLSSQCTHRYKIYAEGFAWSVSLKYILSCGSMALVIDP 60
Query: 343 QYKDFFSRGLIPTKN----HFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
QY+DFFSRGL P N H + + +C SI+ V+WG A+P+EAE +G+ GQ ME L
Sbjct: 61 QYEDFFSRGLRPEVNFWPVHIDVAAGGMCESIRDAVEWGEAHPAEAEAVGRRGQRLMEEL 120
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAPPSSAF----EACVESLLCLADPKQRQNLEKAAAS 454
MD VYDYMLHL+TEY++L+ ++PA E C S+LCLA KQR+ LE +AAS
Sbjct: 121 DMDAVYDYMLHLLTEYARLMRFRPAEAPPPRPPAQEVCEASVLCLAGEKQRRFLEASAAS 180
Query: 455 PSPYPPCTL 463
P+ PC +
Sbjct: 181 PAVSEPCVM 189
>gi|356577934|ref|XP_003557076.1| PREDICTED: LOW QUALITY PROTEIN: protein O-glucosyltransferase
1-like, partial [Glycine max]
Length = 268
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 107/240 (44%), Positives = 154/240 (64%), Gaps = 6/240 (2%)
Query: 229 SEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGA 286
SE+N++PW+ +K G+ W + P+AYWKGNP V + R +L+KCN +++ W A
Sbjct: 1 SEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAV-AETRQDLIKCNVSENQDWNA 59
Query: 287 EILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ Q+W E+++GF KS L +QC +RYK+Y EG AWS S KYILSC+S L++ +Y D
Sbjct: 60 RLFAQDWFRESQEGFNKSDLPSQCTYRYKVYIEGSAWSXSQKYILSCDSTTLLVKPKYYD 119
Query: 347 FFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD--FMESLTMDRVY 404
FF+RGLIP +++PI D CRSIK VDWGN + A++IGK D E + MD VY
Sbjct: 120 FFTRGLIPVHHYWPIKDDDKCRSIKFAVDWGNNHKQRAQQIGKVASDQFIQEEVKMDYVY 179
Query: 405 DYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
DYM HL+ Y+KL YKP+ ++A E CVES++C A+ ++ +E P+ CT+
Sbjct: 180 DYMFHLLNSYAKLFRYKPSISANATELCVESMVCGAEGSVKKFMMESLVKVPANTDLCTM 239
>gi|186511278|ref|NP_001118874.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646662|gb|AEE80183.1| uncharacterized protein [Arabidopsis thaliana]
Length = 378
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 101/239 (42%), Positives = 155/239 (64%), Gaps = 11/239 (4%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCPD+FK IH+DL+ W K+ IT + A+ A RI+I G+LYV Y Q+R +F
Sbjct: 129 ETCPDYFKWIHRDLKVWQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVF 188
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF------PLPLFRYCTNDAHFD 217
TIWG LQLLR YPG +PD++++F C D+P I K++ P PLF YC + +D
Sbjct: 189 TIWGILQLLRMYPGQIPDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYD 248
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I FPDWSFWGW E+N++ WN+ +K G++ W++++P+AYWKGNP V SP+R +LM+
Sbjct: 249 IVFPDWSFWGWPELNIKEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHV-SPIRGDLMR 307
Query: 278 CNDSKLWG--AEILRQNWAEEAKDGFKKSKLSNQCNHRY--KIYAEGYAWSVSLKYILS 332
CN S + + Q+W E + GF+ S L +QC HRY +I++ + + ++++++ +
Sbjct: 308 CNFSDKYDPMVRLYVQDWRSEIEAGFRGSNLEDQCTHRYMCRIHSLDHVYLINIRFVFN 366
>gi|388505300|gb|AFK40716.1| unknown [Medicago truncatula]
Length = 285
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 88/175 (50%), Positives = 123/175 (70%), Gaps = 9/175 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q TCP FF+ IH+DL+PW + IT + AKR A +++IV GK+YV+ Y +Q+R
Sbjct: 109 QNSHTCPSFFRWIHEDLKPWKEKGITREMLEGAKRTANFKVVIVDGKMYVEKYRKSIQTR 168
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF------PLPLFRYCTNDA 214
+FT+WG LQLLR +PG +PD+++MFDC D+PVI K G+F P PLFRYC++
Sbjct: 169 DVFTLWGILQLLRMFPGKLPDLELMFDCEDRPVIHK---GNFQGPNASPPPLFRYCSDQW 225
Query: 215 HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI FPDWSFWGW+E N++PW K+IK G++ WK+++P+AYWKGNP+V +
Sbjct: 226 SLDIVFPDWSFWGWAETNIKPWKNILKEIKEGNKETKWKDRVPYAYWKGNPNVAA 280
>gi|294460307|gb|ADE75735.1| unknown [Picea sitchensis]
Length = 194
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 126/188 (67%), Gaps = 6/188 (3%)
Query: 130 IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
+ A+R A R++I++G++Y+ Y Q+R +FTIWG +QL+ Y M+PD+D+MFDC+
Sbjct: 2 VERARRTATFRLVILKGRMYIRTYTKSFQTRDVFTIWGLIQLMENYGWMLPDLDLMFDCV 61
Query: 190 DKPVIDKKEHGSFPLP----LFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKH 245
D PVI K + + LP LFRYC +D DI FPDWSFWGW+EVN +PW+ DI
Sbjct: 62 DWPVIKAKAYANASLPPPPPLFRYCGDDKSLDIAFPDWSFWGWAEVNTRPWDGLLNDILK 121
Query: 246 GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSK 305
G++ W+++ P A+WKGNP V + +R +LMKCN S A + Q+W +E+ G+K SK
Sbjct: 122 GAKKLKWEDRDPTAFWKGNPYV-AAVREDLMKCNLSDR-NARLYNQDWIKESGQGYKHSK 179
Query: 306 LSNQCNHR 313
L +QC+HR
Sbjct: 180 LPDQCHHR 187
>gi|147798877|emb|CAN65871.1| hypothetical protein VITISV_012499 [Vitis vinifera]
Length = 424
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 130/202 (64%), Gaps = 9/202 (4%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCP++F+ I++DL+PW ++ IT + AK A +R+++V GK+Y + Y Q+R +FT
Sbjct: 114 TCPEYFRWIYEDLKPWTETGITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFT 173
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH----GSFPLPLFRYCTNDAHFDIPF 220
IWG LQ+LR YPG +PD D+MF+C DKPVI K ++ + P PLF YC +D DI F
Sbjct: 174 IWGILQVLRMYPGKLPDFDLMFECGDKPVIKKHDYQGPNATAPPPLFHYCGDDETLDIVF 233
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN- 279
PDWSFWGW E+ ++PW+ KD++ G+ W ++ P+AYWKGN + R EL KC+
Sbjct: 234 PDWSFWGWPEIRIRPWSTLRKDLREGNNKTKWVBREPYAYWKGNFK-MGVTRHELSKCSK 292
Query: 280 -DSKLWGAEILRQNWAEEAKDG 300
+ + W A I N A E ++
Sbjct: 293 TNEQDWNARIY--NMAHEIRNA 312
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 49/91 (53%), Gaps = 2/91 (2%)
Query: 375 DWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACV 433
DW + A +I AG F+ E L M VYDYM HL++ Y+KL YKP P A E C
Sbjct: 297 DWNARIYNMAHEIRNAGSSFVQEELRMKFVYDYMFHLLSAYAKLFKYKPTVPPGAVEVCP 356
Query: 434 ESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
E+++C Q++ ++ SPS PC +
Sbjct: 357 ETMVCPVKGLQKKYKIQSMVKSPSDTGPCXM 387
>gi|224134845|ref|XP_002321919.1| predicted protein [Populus trichocarpa]
gi|222868915|gb|EEF06046.1| predicted protein [Populus trichocarpa]
Length = 233
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 83/174 (47%), Positives = 121/174 (69%), Gaps = 2/174 (1%)
Query: 292 NWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG 351
+W +E++ G+++S L+NQC H+YKIY EG AWSVS KYIL+C+SV L++ Y DFF+R
Sbjct: 22 DWIKESQQGYQQSNLANQCVHKYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRS 81
Query: 352 LIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHL 410
L+P ++++PI D CRSIK V+WGN + EA+ +GKA +F+ E L MD VYDYM HL
Sbjct: 82 LVPNRHYWPIKEDDKCRSIKFAVEWGNNHSEEAQAMGKAASEFIQEDLKMDYVYDYMFHL 141
Query: 411 ITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
+ EY+KLL +KP P A E C E++ C A+ +++ ++ SP+ PCT+
Sbjct: 142 LNEYAKLLTFKPTIPGRAIELCAEAMACPANGLEKKFMMDSMVMSPADTSPCTM 195
>gi|326491107|dbj|BAK05653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 132/210 (62%), Gaps = 14/210 (6%)
Query: 133 AKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKP 192
A+ A R++++RG+ Y++ Q+R +FTIWG LQLLRRYPG VPD+D+MFDC+D P
Sbjct: 5 ARATANFRLVVLRGRAYIERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWP 64
Query: 193 VIDKKEH----GSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQ 248
V+ E+ + PLFRYC ++ D+ FPDWSFWGW+E+N++PW+ KD+ G++
Sbjct: 65 VVHADEYQGNNATAMPPLFRYCGDNETLDVVFPDWSFWGWAEINIKPWDALQKDLSVGNK 124
Query: 249 AKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS--KLWGAEILRQNWAEEAKDGFKKSKL 306
W ++ P+AYWKGNP+V + +R EL+KCN S + W A I +Q DG K+++
Sbjct: 125 RVRWIDREPYAYWKGNPEVAA-IRQELVKCNVSSKQEWNARIYKQ-------DGSKRARQ 176
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
+ H+ K G S ++ LS + +
Sbjct: 177 DTRNPHKKKAQQIGKKASKFIQQELSMDHI 206
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFME-SLTMDRVYDYMLHLITEYSKLLDYKPAPPSS 427
S ++ D N + +A++IGK F++ L+MD +YDYM HL+TEY+KLL +KP P
Sbjct: 171 SKRARQDTRNPHKKKAQQIGKKASKFIQQELSMDHIYDYMFHLLTEYAKLLRFKPTKPPE 230
Query: 428 AFEACVESLLCLADPKQRQNLEKA-AASPSPYPPCTL 463
A E C ESL C A ++++ ++ + S + PC L
Sbjct: 231 AVEVCPESLACQAIGREKKFMQDSMVKSANVAGPCNL 267
>gi|218197602|gb|EEC80029.1| hypothetical protein OsI_21713 [Oryza sativa Indica Group]
Length = 515
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/207 (42%), Positives = 120/207 (57%), Gaps = 9/207 (4%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CPD+F+ IH DL PW + IT + +R A R+++V G+ YV+ Y Q+R FT
Sbjct: 149 SCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFT 208
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVI---DKKEHGSFPLPLFRYCTNDAHFDIPFP 221
WG QLLRRY G VPDVDIMF C D+ + D + P+FRYC + D+ FP
Sbjct: 209 QWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFAAAPADAPPVFRYCRDATTLDVVFP 268
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN-- 279
DWSFWGW EVN+ W + ++ S W E+ PFA+WKGNP V + +R ELMKCN
Sbjct: 269 DWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGV-ARIRGELMKCNPA 327
Query: 280 -DSKLWGAEILRQNWAE--EAKDGFKK 303
D K W A + Q + + +D F +
Sbjct: 328 SDGKDWNARLFSQTYRRSYQTRDAFTQ 354
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 99/173 (57%), Gaps = 7/173 (4%)
Query: 147 KLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI---DKKEHGSFP 203
+L+ Y Q+R FT WG QLLRRY G VPDVDIMF C D+ + D +
Sbjct: 336 RLFSQTYRRSYQTRDAFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFAAAPADA 395
Query: 204 LPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKG 263
P+FRYC + D+ FPDWSFWGW EVN+ W + ++ S W E+ PFA+WKG
Sbjct: 396 PPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKG 455
Query: 264 NPDVLSPLRVELMKCN---DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR 313
NP V + +R ELMKCN D K W A + Q+W +GFK S + QC HR
Sbjct: 456 NPGV-ARIRGELMKCNPASDGKDWNARLFSQDWNHAIHNGFKDSSIPKQCLHR 507
>gi|3386610|gb|AAC28540.1| hypothetical protein [Arabidopsis thaliana]
Length = 413
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 79/172 (45%), Positives = 111/172 (64%), Gaps = 8/172 (4%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCPD+F+ IHKDLE W ++ IT + A A R++I G++YV Y Q+R +F
Sbjct: 117 ETCPDYFRWIHKDLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVF 176
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL--------PLFRYCTNDAH 215
TIWG +QLLR YPG VPD++++F C D P I ++++ P PLF YC +
Sbjct: 177 TIWGIVQLLRMYPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGA 236
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDV 267
FDI FPDWSFWGW E+N++ WN++ + I G + W+E+ P+AYWKGNP V
Sbjct: 237 FDIVFPDWSFWGWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGV 288
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 3/90 (3%)
Query: 377 GNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVES 435
GN + A IG+ G ++ ++L M VYDYMLHL+ Y KL+ P A E C E+
Sbjct: 284 GNPGVAMARAIGRNGSGYVRKNLKMKYVYDYMLHLLQSYGKLMKMNVEVPQGAKEVCPET 343
Query: 436 LLC-LADPKQRQNLEKA-AASPSPYPPCTL 463
+ C + + RQ+++ + SPS C +
Sbjct: 344 MACPINGGRMRQSMDDSLVMSPSVKATCEM 373
>gi|449495884|ref|XP_004159974.1| PREDICTED: O-glucosyltransferase rumi-like, partial [Cucumis
sativus]
Length = 210
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 118/175 (67%), Gaps = 2/175 (1%)
Query: 291 QNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSR 350
Q+W +E++ G+K+SKL+NQC +RYKIY EGY WSVS KYIL+C+S+ L++ + DFFSR
Sbjct: 2 QDWIQESQQGYKQSKLANQCTYRYKIYIEGYGWSVSEKYILACDSMTLLVKPNFYDFFSR 61
Query: 351 GLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLH 409
L P +++P+ C+SIK V WGN++ +A+ IGK +F++ L M+ VYDYM H
Sbjct: 62 SLEPLHHYWPLSDDHKCKSIKFAVHWGNSHKQKAQDIGKTASNFIQQELRMENVYDYMFH 121
Query: 410 LITEYSKLLDYKPAPPSSAFEACVESLLCLAD-PKQRQNLEKAAASPSPYPPCTL 463
L+ Y+KLL ++P P+ A E C E++ C D P+++ E +PS PC++
Sbjct: 122 LLNHYAKLLRFQPEIPTGAMEVCSETMACPRDGPEKKFMKESMVKTPSLTIPCSM 176
>gi|388494308|gb|AFK35220.1| unknown [Lotus japonicus]
Length = 211
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 114/175 (65%), Gaps = 2/175 (1%)
Query: 291 QNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSR 350
Q+W +E++ GF +S L NQC +RYKIY EG AWSVS KYIL+C+SV L + ++ DFF R
Sbjct: 2 QDWIQESQQGFNQSNLENQCTYRYKIYIEGSAWSVSEKYILACDSVTLYVKPRFYDFFIR 61
Query: 351 GLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLH 409
L P ++++PI + C+SIK V WGN + + ++IG A +F+ E L MD VYDYM H
Sbjct: 62 SLQPLQHYWPISDTNKCKSIKHAVVWGNDHKQKVQEIGNAASNFIQEELKMDYVYDYMFH 121
Query: 410 LITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL-EKAAASPSPYPPCTL 463
L+ EY++LL ++P P A E C ES+ C + +++ + E PS PC+L
Sbjct: 122 LLNEYARLLKFEPKVPEGAVELCAESMACERNGLEKKFMAESMVQEPSTKAPCSL 176
>gi|413919038|gb|AFW58970.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 241
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 74/137 (54%), Positives = 93/137 (67%), Gaps = 8/137 (5%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA-LRILIVRGKLYVDPY--YDC 156
P++V CP +F+ IH+DL PW + +T + A+RFA LR+ +V G+LYV Y C
Sbjct: 105 PREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAVVAGRLYVARYGRRRC 164
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE-HGS----FPLPLFRYCT 211
Q+RA FT WG LQLLRRYPG VPD+D+MFDC D PV+ E HG+ P PLFRYC
Sbjct: 165 FQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHGAQPQPPPPPLFRYCG 224
Query: 212 NDAHFDIPFPDWSFWGW 228
++A DI FPDWSFWGW
Sbjct: 225 SEATLDIAFPDWSFWGW 241
>gi|227206228|dbj|BAH57169.1| AT1G63420 [Arabidopsis thaliana]
Length = 299
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 65/132 (49%), Positives = 89/132 (67%), Gaps = 7/132 (5%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
++CPD+FK IH+DL+PW ++ IT + K A R++I+ GK++V+ Y +Q+R F
Sbjct: 168 RSCPDYFKWIHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAF 227
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHG-------SFPLPLFRYCTNDAHF 216
T+WG LQLLR+YPG +PDVD+MFDC D+PVI + + P PLFRYC +
Sbjct: 228 TLWGILQLLRKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTV 287
Query: 217 DIPFPDWSFWGW 228
DI FPDWSFWGW
Sbjct: 288 DIVFPDWSFWGW 299
>gi|223950283|gb|ACN29225.1| unknown [Zea mays]
Length = 199
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 93/140 (66%), Gaps = 8/140 (5%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA-LRILIVRGKLYVDPY--YDC 156
P++V CP +F+ IH+DL PW + +T + A+RFA LR+ +V G+LYV Y C
Sbjct: 60 PREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAVVAGRLYVARYGRRRC 119
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE-HGS----FPLPLFRYCT 211
Q+RA FT WG LQLLRRYPG VPD+D+MFDC D PV+ E HG+ P PLFRYC
Sbjct: 120 FQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHGAQPQPPPPPLFRYCG 179
Query: 212 NDAHFDIPFPDWSFWGWSEV 231
++A DI FPDWSFWG S
Sbjct: 180 SEATLDIAFPDWSFWGRSST 199
>gi|357497415|ref|XP_003618996.1| KDEL motif-containing protein [Medicago truncatula]
gi|355494011|gb|AES75214.1| KDEL motif-containing protein [Medicago truncatula]
Length = 202
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 98/156 (62%), Gaps = 4/156 (2%)
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIK 371
HRYKIY EG+AWSVS KYI++C+S+ L + Y DFF RG++P ++++PI + C S+K
Sbjct: 9 HRYKIYIEGWAWSVSEKYIMACDSMTLYVKSNYHDFFIRGMVPLQHYWPIRNNSKCTSLK 68
Query: 372 SVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430
V+WGN + +A+ IG+AG F+ E L M VY+YM HL+ EY+KLL +KP P A E
Sbjct: 69 FAVEWGNNHTHKAQSIGEAGSKFIQEDLDMKNVYNYMFHLLNEYAKLLKFKPTIPRGAVE 128
Query: 431 ACVESLLC---LADPKQRQNLEKAAASPSPYPPCTL 463
C E L+ + K R E PS PCT+
Sbjct: 129 ICPEKLMACDVIGGNKMRFMEESMVKVPSDSNPCTI 164
>gi|413919039|gb|AFW58971.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
gi|413919040|gb|AFW58972.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 302
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/160 (49%), Positives = 100/160 (62%), Gaps = 9/160 (5%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA-LRILIVRGKLYVDPY--YDC 156
P++V CP +F+ IH+DL PW + +T + A+RFA LR+ +V G+LYV Y C
Sbjct: 103 PREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAVVAGRLYVARYGRRRC 162
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE-HGS----FPLPLFRYCT 211
Q+RA FT WG LQLLRRYPG VPD+D+MFDC D PV+ E HG+ P PLFRYC
Sbjct: 163 FQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHGAQPQPPPPPLFRYCG 222
Query: 212 NDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKS 251
++A DI FPDWSFW S + L W + GS AK
Sbjct: 223 SEATLDIAFPDWSFWEGSLL-LWIWKAGAQHKAMGSAAKG 261
>gi|357497389|ref|XP_003618983.1| hypothetical protein MTR_6g031080, partial [Medicago truncatula]
gi|355493998|gb|AES75201.1| hypothetical protein MTR_6g031080, partial [Medicago truncatula]
Length = 226
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 61/128 (47%), Positives = 86/128 (67%), Gaps = 4/128 (3%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
+CP++FK IH+DL+PW K+ IT + K + R+++V GK Y+D + Q+R +FT
Sbjct: 99 SCPNYFKWIHEDLKPWEKTGITREMVESGKNMSHFRLVVVNGKAYIDKFAKSYQTRDVFT 158
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE----HGSFPLPLFRYCTNDAHFDIPF 220
IWG LQLLR YPG +PD+++MF C DK V+DKK+ P P+F YC ++ DI F
Sbjct: 159 IWGILQLLRLYPGKIPDLELMFQCGDKNVVDKKKFQGPENVSPPPIFHYCGDEDTLDIVF 218
Query: 221 PDWSFWGW 228
PDW+FWGW
Sbjct: 219 PDWTFWGW 226
>gi|124360767|gb|ABN08741.1| Protein of unknown function DUF821, CAP10-like [Medicago
truncatula]
Length = 185
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 94/151 (62%), Gaps = 5/151 (3%)
Query: 83 TCLSAMNPIPERRRVASPQK-VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRI 141
TC + +PI + P TCP+ F+ IH+DLEPW + IT + + + LRI
Sbjct: 35 TCPNDHSPIQAFHQDQDPSSSTSTCPEHFRWIHEDLEPWKSTGITREMVDSGENISQLRI 94
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH-- 199
+I +GK+YV+ Y D Q+RA FT+WG +QLLR YPG VPD++++F+ D+ V+DKK +
Sbjct: 95 VIKQGKVYVETYGDSFQTRATFTVWGIVQLLRLYPGRVPDLELLFETGDRAVVDKKRYEG 154
Query: 200 --GSFPLPLFRYCTNDAHFDIPFPDWSFWGW 228
P P+F YC + DI FPDWS+WGW
Sbjct: 155 PEAVTPPPIFSYCGKNEALDIVFPDWSYWGW 185
>gi|147802993|emb|CAN70738.1| hypothetical protein VITISV_008288 [Vitis vinifera]
Length = 204
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 66/130 (50%), Positives = 93/130 (71%), Gaps = 1/130 (0%)
Query: 313 RYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKS 372
+YKIY EG+ WSVS KY+L+C+S+ L+I DFF+R ++P +++PI + CR +K
Sbjct: 72 KYKIYVEGWGWSVSEKYVLACDSMTLLIKPYPHDFFTRSMVPLLHYWPIRPRNKCRDLKF 131
Query: 373 VVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEA 431
V+WGN +P +A++IGKAG +F+ E L MD VYDYM HL+ EYSKLL +KPA A E
Sbjct: 132 AVEWGNTHPEKAQEIGKAGSNFIHEELKMDFVYDYMFHLLNEYSKLLKFKPAVLPGAVEL 191
Query: 432 CVESLLCLAD 441
C+E++ C AD
Sbjct: 192 CLETMDCSAD 201
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 32/57 (56%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPY 153
V + CP++F+ IH+DL PW + I+ + A A R++IV GK YV+ Y
Sbjct: 14 VTGKLSXEACPEYFRWIHEDLRPWKSTGISRFAVESAXGDADFRLVIVNGKAYVEQY 70
>gi|13937171|gb|AAK50079.1|AF372939_1 At1g63420/F2K11_19 [Arabidopsis thaliana]
gi|21700873|gb|AAM70560.1| At1g63420/F2K11_19 [Arabidopsis thaliana]
Length = 228
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 119/230 (51%), Gaps = 46/230 (20%)
Query: 243 IKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL--WGAEILRQNWAEEAKDG 300
++ G + K + E+ +AYWKGNP V SP R +L+ CN S L W A I Q+W E + G
Sbjct: 1 MEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHDWNARIFIQDWISEGQRG 60
Query: 301 FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFP 360
F+ S ++NQC +R L P ++++P
Sbjct: 61 FENSNVANQCTYR-------------------------------------TLQPLQHYWP 83
Query: 361 IPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFME-SLTMDRVYDYMLHLITEYSKLLD 419
I D CRSIK VDW N + +A++IG+ +FM+ L+M+ VYDYM HL+ EYSKLL
Sbjct: 84 IRDKDKCRSIKFAVDWLNNHTQKAQEIGREASEFMQRDLSMENVYDYMFHLLNEYSKLLK 143
Query: 420 YKPAPPSSAFEACVESLLCLADPKQRQNLEKA------AASPSPYPPCTL 463
YKP P ++ E C E+L+C ++ + ++K + P PC+L
Sbjct: 144 YKPQVPKNSVELCTEALVCPSEGEDVNGVDKKFMIGSLVSRPHASGPCSL 193
>gi|297817442|ref|XP_002876604.1| hypothetical protein ARALYDRAFT_907657 [Arabidopsis lyrata subsp.
lyrata]
gi|297322442|gb|EFH52863.1| hypothetical protein ARALYDRAFT_907657 [Arabidopsis lyrata subsp.
lyrata]
Length = 229
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/131 (45%), Positives = 82/131 (62%), Gaps = 7/131 (5%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TCPD+F+ I +DL+ W ++ IT + AK A R++I G+LYV Y +SR +
Sbjct: 94 ETCPDYFRWIQQDLKAWEETGITRETLERAKPKAHFRLVIKSGRLYVHQYDKAYESRDVL 153
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-------FPLPLFRYCTNDAHF 216
TIWG LQLLR YPG VPD++++F C DKP I K++ P PLF+YC + +
Sbjct: 154 TIWGILQLLRMYPGQVPDLELLFFCHDKPAIWKRDFRQPQPNATWPPPPLFQYCGHREAY 213
Query: 217 DIPFPDWSFWG 227
I FPDWSFWG
Sbjct: 214 GIVFPDWSFWG 224
>gi|388513497|gb|AFK44810.1| unknown [Medicago truncatula]
Length = 162
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 83/130 (63%), Gaps = 2/130 (1%)
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF- 394
+ + I +Y DFF+R LIP ++++PI + ++C IK VDWGNA+ +A+KIG+ G ++
Sbjct: 1 MTMFIEPKYYDFFTRSLIPLQHYWPISAKNMCEEIKFAVDWGNAHLDKAQKIGEGGTNYI 60
Query: 395 MESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC-LADPKQRQNLEKAAA 453
+++L M VYDYM HL+ Y KLL +KP P A E C ES+ C L ++ +E
Sbjct: 61 IDNLKMKFVYDYMFHLLNSYVKLLRFKPKIPEGAVEVCSESMACSLRGARKHFMVEGMVI 120
Query: 454 SPSPYPPCTL 463
SPS PPCT+
Sbjct: 121 SPSDTPPCTM 130
>gi|384245503|gb|EIE18997.1| hypothetical protein COCSUDRAFT_54837 [Coccomyxa subellipsoidea
C-169]
Length = 520
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 165/377 (43%), Gaps = 73/377 (19%)
Query: 113 IHKDLEPWAKSRITMRHIME-AKRF-----AALRILIVRGKLYVDPYYD----------- 155
I +D W K+ I M + E A R+ R I+ G L+VD +
Sbjct: 89 IKQDFAQWEKTGIKMSAVTEMALRYRECFGEVFRFQIINGTLWVDHISERHSGWYPSRMG 148
Query: 156 ----CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLP-LFRYC 210
+ + + I + LR +PG +PD+D + D P + +++ G+ P P +F Y
Sbjct: 149 AGSLSAKGKIPYAILALMDTLRHHPGQIPDIDAVIQTSDFPCMLRQQPGNTPPPPVFGYN 208
Query: 211 TNDAHFDIPFPDWSFWGWS--------EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWK 262
++ DIPFPD+++WG + L W ++FK + + K + P W+
Sbjct: 209 SHARFVDIPFPDYTYWGHEYHRLVDEDGLLLFGWEKQFKLLSEKWREKEIASRKPQVIWR 268
Query: 263 GNPD-----VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKD-GFKK--SKLSNQCNHRY 314
G + LR + +C D E+ R+ + EEA+ +K +L + N+RY
Sbjct: 269 GRTEDKEYPKRDELRRQFARCGD------ELRREGFEEEAELFSLRKPEVQLHDLGNYRY 322
Query: 315 KIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLC------- 367
+Y E AW +LK L+C SV + ++ +FF+R L P + + S DLC
Sbjct: 323 LMYIESDAWVTNLKQKLACGSVLMSNQMEFFEFFTRALQPGVHFVEVDSKDLCHDATLKV 382
Query: 368 RSIKSVVDWGNANPSEAEK---------------------IGKAGQDFM-ESLTMDRVYD 405
+ + + ++ G+ S EK IG+AGQ+F+ + + M V
Sbjct: 383 QGMNAAIEKGSQEESMQEKDAESRRFLKETAQNYTGAPWEIGQAGQEFLAQHVQMKDVRL 442
Query: 406 YMLHLITEYSKLLDYKP 422
Y+ + +Y+ L + P
Sbjct: 443 YIRDALRKYASLQKFLP 459
>gi|297745251|emb|CBI40331.3| unnamed protein product [Vitis vinifera]
Length = 128
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 48/104 (46%), Positives = 74/104 (71%)
Query: 294 AEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI 353
+ + GFK S L++QC HRYKIY EG+ WSVS KY+L+C+S+ L+I + DFF+R ++
Sbjct: 13 GKRIQGGFKNSNLADQCTHRYKIYVEGWGWSVSEKYVLACDSMTLLIKPYHHDFFTRSMV 72
Query: 354 PTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMES 397
P +++PI + CR +K V+WGN +P +A++IGKAG +F+
Sbjct: 73 PLPHYWPIRPRNKCRDLKFAVEWGNTHPEKAQEIGKAGSNFIHQ 116
>gi|332019905|gb|EGI60366.1| CAP10 family protein [Acromyrmex echinatior]
Length = 407
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 81/325 (24%), Positives = 150/325 (46%), Gaps = 24/325 (7%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQ 170
I +DL P+ K I+ + ++EA + I++GKLY + DC MF G
Sbjct: 81 IVRDLRPFTKKGIS-KEMIEAAKTRGTFYQIIKGKLYREK--DC-----MFPARCAGIEH 132
Query: 171 LLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWS 229
L + G + D+D++ + D P E+ +P+F + ++DI +P W+FW G
Sbjct: 133 FLLKIIGNLSDMDLVINTRDYP--QSSEYFGNAIPVFSFSKTPQYYDIMYPAWAFWEGGP 190
Query: 230 EVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
++L P W++ K + S SW+EK +++G+ + L+ N L
Sbjct: 191 AISLYPRGLGRWDQHRKTLNKASLEISWEEKESKGFFRGSRTSSERDNLILLSRNKPHLV 250
Query: 285 GAEILRQNWAEEAKDGFKKS-----KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
A+ + + +D + L + C ++Y G A S K++ C S+
Sbjct: 251 DAQYTKNQAWKSNEDTLHATPASEVSLESHCTYKYLFNFRGVAASFRHKHLFLCRSLVFH 310
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+S ++ +F+ + P ++ P+P + +K ++++ N A+KI G+DF+ +L
Sbjct: 311 VSDEWMEFYYHAMKPWIHYIPVPKNADQQELKDLIEFARNNDDLAKKIAHRGRDFIWNNL 370
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPA 423
M + + L+ YSKLL Y P
Sbjct: 371 RMSDIIHFWKQLLKSYSKLLAYNPV 395
>gi|345497421|ref|XP_001601540.2| PREDICTED: O-glucosyltransferase rumi homolog [Nasonia vitripennis]
Length = 400
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 151/333 (45%), Gaps = 22/333 (6%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV-QSRAM 162
+ C F I +DL+ +A+ I + ++EA R I+ GKLY + +C+ SR
Sbjct: 69 KKCKCFGDMITRDLKTFAERGIDQK-LIEAARPRGTFYQIIGGKLYREK--NCMFPSRCA 125
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD 222
G L + G V DV ++ + D P H PLP+F + ++DI +P
Sbjct: 126 ----GIEHFLLKVIGNVSDVSLVVNTRDYP--QSSRHFGQPLPVFSFSKTPDYYDIMYPA 179
Query: 223 WSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
W FW G ++L P W+ K + + W++K A+++G+ + L+
Sbjct: 180 WVFWEGGPAISLYPRGLGRWDLHRKSLNEAREETPWEKKEEKAFFRGSRTSSERDNLVLL 239
Query: 277 KCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
+L A+ + + KD K+ L + C+++Y G A S K++
Sbjct: 240 SRAKPQLVDAQYTKNQAWKSEKDTLNMPPAKEVSLESHCSYKYLFNYRGVAASFRHKHLF 299
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
C S+ + ++ +F+ + P ++ P+P + ++ ++ + N + A++I G
Sbjct: 300 LCGSLVFHVGDEWTEFYYGAMKPWIHYIPVPKHASQQQLEDLIQFAKDNDAVAKRIADRG 359
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPA 423
++F+ E L M V Y LI YSKL YKP
Sbjct: 360 RNFIWEKLKMSDVTCYWKKLIRRYSKLFKYKPT 392
>gi|383858247|ref|XP_003704613.1| PREDICTED: O-glucosyltransferase rumi homolog [Megachile rotundata]
Length = 391
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 163/366 (44%), Gaps = 24/366 (6%)
Query: 69 RRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMR 128
+ + YK V T +M ER C F I +DL+P+ + I +
Sbjct: 35 KETNVYKKVPNTQYNKYYSMIEEAERNYQLCNNTNNNC--FKHIILRDLKPFKEKGIN-K 91
Query: 129 HIMEAKRFAALRILIVRGKLYVDPYYDCV-QSRAMFTIWGFLQLLRRYPGMVPDVDIMFD 187
+++A + IV+GK+Y DC+ SR G L + +PD+D++ +
Sbjct: 92 DLIDAAKARGTFYQIVKGKVYRQK--DCMFPSRCS----GIEHFLLKLAPKLPDMDLVIN 145
Query: 188 CMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFK 241
D P K G PLP+F + ++DI +P W+FW G ++L P W+E
Sbjct: 146 VRDYPQSSKYFGG--PLPVFSFSKTPQYYDITYPAWAFWEGGPAISLYPRGLGRWDEHRI 203
Query: 242 DIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILR-QNWAEEAKDG 300
+ S++ W++K A+++G+ + L+ L A+ + Q W +
Sbjct: 204 SLDKASKSLPWEKKESKAFFRGSRTSSERDNLILLSRKKPNLIDAQYTKNQAWKSDEDTL 263
Query: 301 FK----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
+ + L C ++Y G A S K++ C S+ + ++ +F+ +IP
Sbjct: 264 YAPPAPEVSLETHCKYKYLFNYRGVAASFRHKHLFLCRSLVFHVGDEWTEFYYEAMIPWI 323
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYS 415
++ P+P ++ ++ + N ++KI +G+DF+ ++L M + + L+ YS
Sbjct: 324 HYIPVPKDANQTVLEELIQFAMDNDESSKKIADSGRDFIWDNLKMSDITQFWKKLLERYS 383
Query: 416 KLLDYK 421
KLL YK
Sbjct: 384 KLLMYK 389
>gi|328709644|ref|XP_001945687.2| PREDICTED: o-glucosyltransferase rumi homolog [Acyrthosiphon pisum]
Length = 384
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 154/344 (44%), Gaps = 27/344 (7%)
Query: 93 ERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDP 152
++ + K TC FF +I KDLEP+ K IT I A R +I LY
Sbjct: 43 KKEYTSCESKNGTC--FFPNILKDLEPF-KDGITHEMITAAAD-KGTRYMIFNHDLYR-- 96
Query: 153 YYDCVQSRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC 210
+++ MF G L + PDV+ + + D P I K H P P+F +
Sbjct: 97 -----ETKCMFPARCEGIEHFLSKIQLNTPDVEFILNTRDWPQIIK--HYGDPKPVFSFS 149
Query: 211 TNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGN 264
D + DI +P WSFW G + L P W+ K I S+ WK K+ +++G+
Sbjct: 150 KTDDYADIMYPAWSFWSGGPAIKLHPSGLGRWDSLRKSILKQSEQWPWKRKISKGFFRGS 209
Query: 265 PDVLSPLRVELMKCNDSKLWGAEILR-QNWAEEAKDGF----KKSKLSNQCNHRYKIYAE 319
+ L+ N+ +L A + Q W + F + L + C ++Y
Sbjct: 210 RTSEQRDSLILLSRNEPELVDAAYTKNQAWKSDKDTLFAPPADEISLEDHCQYKYLFNFR 269
Query: 320 GYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNA 379
G A S K++ C S+ + +++K+FF + + P ++ PI IK+++ +
Sbjct: 270 GVAASFRFKHLFLCKSLVFHVGEEWKEFFYQFMKPWYHYVPINPNASENDIKNILVFFKE 329
Query: 380 NPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
+ A++I + G F+ + L M V Y L+ EY+KLL YKP
Sbjct: 330 HDDLAKEISERGYRFIRTHLRMKDVSWYWETLLHEYAKLLKYKP 373
>gi|195572998|ref|XP_002104482.1| GD18425 [Drosophila simulans]
gi|194200409|gb|EDX13985.1| GD18425 [Drosophila simulans]
Length = 404
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 144/326 (44%), Gaps = 23/326 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
C + +I K+LEP+ IT + ++KR I+RG++Y C+ + +
Sbjct: 63 CICYVATIQKNLEPYVDKGITPEMMAQSKRLGTF-YQIIRGRIYRQQ--KCLHPKRCADV 119
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
LL VPD++ + + D P + P+ Y D H DI +P WSF
Sbjct: 120 ---EDLLLDMASGVPDLEFVLNVRDWPQVHFL--SGLSGPVLSYSITDRHLDIMYPAWSF 174
Query: 226 WGWSEVNLQP-------WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
W + LQ W+ K + + W K +++G+ SP R L++
Sbjct: 175 WTNTGPILQHYPHGVGRWDWMRKHLVARASEMPWNAKRAIGFFRGSRS--SPERDSLVRL 232
Query: 279 NDSKLWGAEILRQNWAEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
+ + +++ + A D +K L C ++Y G A S L++IL C S+
Sbjct: 233 SQRR---PDLVDAQYTLLATDADPVEKMPLVEHCQYKYLFNFRGVAASFRLRHILLCRSL 289
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
+ + Q+++FF L P ++ P+ S + ++ + + AE+I + GQ F+
Sbjct: 290 VIHVGDQWQEFFYSQLKPWVHYVPVDSDADVEELAELILYLREHDDLAEEIAERGQQFVW 349
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYK 421
L M+ V Y ++ EY++LL YK
Sbjct: 350 LHLRMEDVQCYWSKMLQEYAQLLTYK 375
>gi|324516300|gb|ADY46488.1| CAP10 family protein CPIJ013394 [Ascaris suum]
Length = 381
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 153/347 (44%), Gaps = 26/347 (7%)
Query: 88 MNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGK 147
+ I E R + C F I DL P+ ++ A +R ++ G+
Sbjct: 37 IREIEEAERNYRSCDINNCTCFQSQIENDLSPFQNG--IDESMLLAASAHGVRYQLIGGR 94
Query: 148 LYVDPYYDC-VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPL 206
L+ P C ++R G L +P+ + + + D P + + PLP+
Sbjct: 95 LFRQP--QCPFEARCE----GVEYFLVHLADSLPNTEFVLNVHDHPQMRSDDS---PLPV 145
Query: 207 FRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAY 260
F + + H DI +P WSFW G ++L P WNE I ++ W + P A+
Sbjct: 146 FSFSKDMNHIDILYPAWSFWSGGPAISLYPTGIGRWNETSVKITKAAKRIEWAHRKPIAF 205
Query: 261 WKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKK---SKLS--NQCNHRYK 315
++G+ R+ L+ L A+ + KD + S+LS + C+++Y
Sbjct: 206 FRGSRTNTLRDRLILLSRRLPNLIDAKYTKNQAWRSVKDTLGEQPASELSFEDHCSYKYL 265
Query: 316 IYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVD 375
G A S L+++L C S + Q+ +FF L P HF + A+ +++ ++
Sbjct: 266 FNFAGVAASFRLRHLLLCGSPVFNVGHQWIEFFYGALYPWI-HF-VEVAEEMNNVEELLR 323
Query: 376 WGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYK 421
+ + + +IG G+ F+ES L M+ V Y L+T+YS+L+ Y+
Sbjct: 324 FAHEHDDIMHRIGTRGRAFVESHLRMEDVLCYWRQLLTQYSRLITYR 370
>gi|195331145|ref|XP_002032263.1| GM23616 [Drosophila sechellia]
gi|194121206|gb|EDW43249.1| GM23616 [Drosophila sechellia]
Length = 404
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 141/326 (43%), Gaps = 23/326 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
C + +I K+LEP+ IT + ++KR I+RG++Y C+ + +
Sbjct: 63 CLCYVATIQKNLEPYVDKGITPEMMAQSKRLGTF-YQIIRGRIYRQQ--KCLHPKRCADV 119
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
LL VPD++ + + D P + P+ Y D H DI +P WSF
Sbjct: 120 ---EDLLLDMASGVPDLEFVLNVRDWPQVHFL--SGLSGPVLSYSITDRHLDIMYPAWSF 174
Query: 226 WGWSEVNLQP-------WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
W + LQ W+ K + + W K ++ G+ SP R L++
Sbjct: 175 WTNTGPILQHYPHGVGRWDWMRKHLVARASETPWNAKRAIGFFTGSRS--SPERDNLVRL 232
Query: 279 NDSKLWGAEILRQNWAEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
+ + +++ + A D + L C ++Y G A S L++IL C S+
Sbjct: 233 SQRR---PDLVDAQYTMLATDADPVETMPLIEHCQYKYLFNFRGVAASFRLRHILLCRSL 289
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
L + Q+++FF L P ++ P+ S + ++ + + AE+I + GQ F+
Sbjct: 290 VLHVGDQWQEFFYSQLKPWVHYVPVASDADVEELAELILYLREHDDLAEEIAERGQQFVW 349
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYK 421
L M+ V Y ++ EY+ LL YK
Sbjct: 350 LHLRMEDVQCYWSKMLQEYAMLLTYK 375
>gi|25009725|gb|AAN71037.1| AT07872p [Drosophila melanogaster]
Length = 397
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 142/326 (43%), Gaps = 23/326 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
C +I +DLEP+ IT + ++KR I+RG++Y C+ + +
Sbjct: 63 CLCHVATIQRDLEPYVDKGITPEMMAQSKRLGTF-YQIIRGRIYRQQ--KCLHPKRCADV 119
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
LL V D++ + + D P + P+F Y + H DI +P WSF
Sbjct: 120 ---EDLLLDMASGVADLEFVLNVRDWPQVHFL--SGLSGPVFSYSITNRHLDIMYPAWSF 174
Query: 226 WGWSEVNLQP-------WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
W + LQ W+ K + + W K +++G+ SP R L++
Sbjct: 175 WTTTGPILQHYPHGVGRWDWMRKHLVARASELPWSAKRAIGFFRGSRS--SPERDSLVRL 232
Query: 279 NDSKLWGAEILRQNWAEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
+ + +++ + A D +K L C +Y G A S L++IL C S+
Sbjct: 233 SQRR---PDLVDAQYTILATDADPVEKMPLVEHCQFKYLFNFRGVAASFRLRHILLCRSL 289
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
L + Q+++FF L P ++ P+ S + ++ + + AE+I + GQ F+
Sbjct: 290 VLHVGDQWQEFFYSQLKPWVHYVPVASDADVDELAELILYLREHDDLAEEIAERGQQFIW 349
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYK 421
L M+ V Y ++ EY+KLL YK
Sbjct: 350 LHLRMEDVQCYWSKMLQEYAKLLTYK 375
>gi|24649142|ref|NP_732797.1| CG31139, isoform A [Drosophila melanogaster]
gi|23171999|gb|AAF56061.2| CG31139, isoform A [Drosophila melanogaster]
gi|262051015|gb|ACY07068.1| FI12101p [Drosophila melanogaster]
Length = 397
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 142/326 (43%), Gaps = 23/326 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
C +I +DLEP+ IT + ++KR I+RG++Y C+ + +
Sbjct: 63 CLCHVATIQRDLEPYVDKGITPEMMAQSKRLGTF-YQIIRGRIYRQQ--KCLHPKRCADV 119
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
LL V D++ + + D P + P+F Y + H DI +P WSF
Sbjct: 120 ---EDLLLDMASGVADLEFVLNVRDWPQVHFL--SGLSGPVFSYSITNRHLDIMYPAWSF 174
Query: 226 WGWSEVNLQP-------WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
W + LQ W+ K + + W K +++G+ SP R L++
Sbjct: 175 WTTTGPILQHYPHGVGRWDWMRKHLVARASELPWSAKRAIGFFRGSRS--SPERDSLVRL 232
Query: 279 NDSKLWGAEILRQNWAEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
+ + +++ + A D +K L C +Y G A S L++IL C S+
Sbjct: 233 SQRR---PDLVDAQYTILATDADPVEKMPLVEHCQFKYLFNFRGVAASFRLRHILLCRSL 289
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
L + Q+++FF L P ++ P+ S + ++ + + AE+I + GQ F+
Sbjct: 290 VLHVGDQWQEFFYSQLKPWVHYVPVASDADVDELAELILYLREHDDLAEEIAERGQQFIW 349
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYK 421
L M+ V Y ++ EY+KLL YK
Sbjct: 350 LHLRMEDVQCYWSKMLQEYAKLLTYK 375
>gi|148230341|ref|NP_001086860.1| MGC83543 protein precursor [Xenopus laevis]
gi|50416390|gb|AAH77568.1| MGC83543 protein [Xenopus laevis]
Length = 386
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 167/368 (45%), Gaps = 30/368 (8%)
Query: 85 LSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPW--AKSRITMRHIMEAKRFAALRIL 142
L +N I + SP F++ + +DLEP+ SR M++++ K
Sbjct: 27 LHYLNQIHTAVQRHSPPDSHNGTCFYRVLQEDLEPFRSGISRDLMQNVLSRK--LGTHYQ 84
Query: 143 IVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF 202
I+ +LY + +C+ S + FL L P + PD++++ + D P + +
Sbjct: 85 IINHRLYREE--ECMFSARCSGVEHFL--LELLPNL-PDMELVINVRDYPQVPSWMNPV- 138
Query: 203 PLPLFRYCTNDAHFDIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEK 255
+P+F + + DI +P W+FW G V L W+ +D+K + W++K
Sbjct: 139 -IPIFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLKKAADLWPWEKK 197
Query: 256 LPFAYWKGNPDVLSPLR--VELMKCNDSKLWGAEILR-QNWAEE----AKDGFKKSKLSN 308
+P Y++G+ SP R + L+ L AE + Q W E + K+ L +
Sbjct: 198 IPKGYFRGSRT--SPDRDPLILLSRESPDLVDAEYTKNQAWKSERDTLGRPPAKEVPLVD 255
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
C ++Y G A S LK++ C S+ + + +FF L P ++ P+ S DL
Sbjct: 256 HCTYKYLFNFRGVAASFRLKHLFLCGSLVFHVGDNWLEFFYNCLEPWVHYVPV-SPDL-E 313
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKPAPPSS 427
++ ++ + N N E +KI + G F+ L M+ V Y L+T+YS+LL Y+
Sbjct: 314 DLRELLQFVNENDEEVKKIAERGHKFIRQFLRMEDVSQYWGSLLTQYSQLLQYRVRKRKD 373
Query: 428 AFEACVES 435
E V S
Sbjct: 374 YREVTVRS 381
>gi|442759745|gb|JAA72031.1| Putative o-glucosyltransferase rumi [Ixodes ricinus]
Length = 410
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 154/333 (46%), Gaps = 28/333 (8%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TC I DL+ W++S IT + +++ + + I+ KLY DC+ F
Sbjct: 77 KTCSCHSAVIDNDLQLWSESGIT-KDLIQRSKSRGIHYQIINHKLYRGE--DCLFP---F 130
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
G L +PD++ + + D P K+ PLP+F + + DI +P W
Sbjct: 131 RCSGIEPFLLEIVDELPDLEFIVNTRDWPQAHKRYD---PLPVFSFSKTPDYADIMYPAW 187
Query: 224 SFW-GWSEVNLQPWNEEFKDIKHGSQAKS------WKEKLPFAYWKGN--PDVLSPLRVE 274
+FW G ++L P D++ G +K+ W +K +++G+ D PL
Sbjct: 188 TFWAGGPAISLYPTGIGRWDLQRGIISKTANQKWPWSKKRNVGFFRGSRTSDERDPLI-- 245
Query: 275 LMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
L+ L A+ + + A+D K+ + + C +Y G A S LK+
Sbjct: 246 LLSRKKPHLVDAQYTKNQAWKSAEDTLGLPAAKEVRFEDHCEFKYLFNFRGVAASFRLKH 305
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGK 389
+ C SV + ++ +FF GL P ++ P+ DL + ++ ++++ N ++ +
Sbjct: 306 LFLCKSVVFHVGHEWLEFFYPGLKPWVHYIPV-GTDLSQ-VEDLLEFARENDEVVXEMAE 363
Query: 390 AGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYK 421
G DF+ + LTM+ V Y L L+ +YS+LL YK
Sbjct: 364 RGYDFIWKHLTMEDVTCYWLQLLKKYSRLLKYK 396
>gi|307105335|gb|EFN53585.1| hypothetical protein CHLNCDRAFT_136791 [Chlorella variabilis]
Length = 592
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 153/375 (40%), Gaps = 43/375 (11%)
Query: 19 VISLSVISLAALFL-DYKVDDFASKTKTLAGHNLEPTPWHLFPQRTFKEESRRSQAYKIV 77
V+ ++I A +FL YK+ S +++ H P + F E S+RS AY
Sbjct: 26 VVPANIIGGALVFLFTYKLFT-TSPFESITVHEAPDDPVQRRYRGAFVEVSKRSHAYLQQ 84
Query: 78 HCTYLTCLSAMNPIPERRRVASPQKVQTC--PDFFKS-IHKDLEPWAKSRITMRHIMEAK 134
S + + R A V+ P F+ I +DL PW S IT + +
Sbjct: 85 ALWDRQPGSYTHNVSARYEPADEAAVRELAQPSFYLDLIRRDLLPWKDSGITHELVERSS 144
Query: 135 RF------AALRILIVRGKLYV----------------DPYYDCVQSRAMFTIWGFLQLL 172
LR ++ G L+V P + R + + ++ L
Sbjct: 145 MMFDDCDGDMLRFQVLNGSLWVHHITERLEGGWYPAPIGPGNAAAKGRVPYAVLALMETL 204
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVN 232
R +PG +PDVD + D P I + G+ P P+ + H DIPF D+++WG
Sbjct: 205 RMFPGQIPDVDAILHFADFPCIPRPRAGAPPAPILGLQGSAHHSDIPFSDYTYWGHEHQY 264
Query: 233 LQ--------PWNEEFKDIKHGSQAKSWKEKLPFAYWKG-NPDVLSP----LRVELMKCN 279
LQ W + + + + S +++P A W+G D P LR + C
Sbjct: 265 LQDPWGKPAHGWGNQAEVLARKYENVSLLDRIPQASWRGRTKDNRYPERDHLRRVFVGCV 324
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
D KL E R A L + C++RY +Y E A++ +LK + C SV +
Sbjct: 325 D-KL--KEAGRGEDAALLNVLSPPLALQDSCDYRYSVYIESQAYASNLKQKMVCGSVLVA 381
Query: 340 ISQQYKDFFSRGLIP 354
+Y DF++R + P
Sbjct: 382 PRMEYWDFYTRAMRP 396
>gi|307172256|gb|EFN63761.1| CAP10 family protein AGAP004267 [Camponotus floridanus]
Length = 406
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/363 (23%), Positives = 160/363 (44%), Gaps = 27/363 (7%)
Query: 75 KIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK 134
K ++ Y L+A+ + + + K +C F I +DL P+ K I+ I+ AK
Sbjct: 45 KAINEGYQKYLTAIQDAEQDYKECNNTK-HSC--FKDVIVRDLSPFKKKGISEEMIVAAK 101
Query: 135 RFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKP 192
I+ GKLY + DC MF G L + G + ++D++ + D P
Sbjct: 102 NRGTF-YQIIGGKLYREK--DC-----MFPARCAGIEHFLLKVIGNLSNMDLIINTRDYP 153
Query: 193 VIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHG 246
E+ +P+F + ++DI +P W+FW G ++L P W+ K +
Sbjct: 154 --QSSEYFGNAMPIFSFSKTSQYYDIMYPAWAFWEGGPAISLYPRGLDRWDRHRKLLNKA 211
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILR-QNWAEEAKDGF---- 301
S W++K +++G+ + L+ N L A+ + Q W E
Sbjct: 212 SSEIPWEKKDNKGFFRGSRTSSERDNLILLSRNKPHLVDAQYTKNQAWKSEEDTLHAPPA 271
Query: 302 KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPI 361
+ L + C ++Y G A S K++ C S+ + ++ +F+ + P ++ P+
Sbjct: 272 SEVSLESHCTYKYLFNFRGVAASFRHKHLFLCRSLVFHVGDEWAEFYYYAMKPWIHYIPV 331
Query: 362 PSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDY 420
+ +++++++ +N +KI G+DF+ +L M V + L+ YSKLL Y
Sbjct: 332 SKNADQKELENLIEFARSNDDIVKKIAYRGRDFIWNNLQMSDVIHFWKQLLKSYSKLLTY 391
Query: 421 KPA 423
KP
Sbjct: 392 KPV 394
>gi|344282265|ref|XP_003412894.1| PREDICTED: protein O-glucosyltransferase 1-like [Loxodonta
africana]
Length = 638
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 154/340 (45%), Gaps = 32/340 (9%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCVQ 158
P Q C + I +DL P+ + I+ + + E R I++ +LY + DC
Sbjct: 294 PCSSQDCSCYHGVIEQDLTPF-RGGISKKMMAEVVGRKLGTHYQIIKNRLYRE--NDC-- 348
Query: 159 SRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
MF G + G +PD++++ + D P + K + +P+F + +
Sbjct: 349 ---MFPSRCGGVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYH 403
Query: 217 DIPFPDWSFWGWSE-------VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI +P W+FW + L W+ +D+ + WK+K AY++G+ S
Sbjct: 404 DIMYPAWTFWEGGPAVWPIYPIGLGRWDLFREDLARSAAQWPWKKKNSTAYFRGS--RTS 461
Query: 270 PLR--VELMKCNDSKLWGAEILRQNWAEEAKDGFKKS-----KLSNQCNHRYKIYAEGYA 322
P R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 462 PERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVA 521
Query: 323 WSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPS 382
S K++ C S+ + +++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 522 ASFRFKHLFLCGSLVFHVGEEWLEFFYPQLKPWVHYVPVKT-DL-SNVQELLQFVKANDD 579
Query: 383 EAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYK 421
A++I + G F + L MD V Y +L+TEYSK L Y
Sbjct: 580 VAQEIAERGNQFIINHLQMDDVTCYWENLLTEYSKFLSYN 619
>gi|194222810|ref|XP_001500762.2| PREDICTED: protein O-glucosyltransferase 1 [Equus caballus]
Length = 392
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/338 (26%), Positives = 156/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ ++ I+ + + E KR I++ +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RAGISRKMMAEVVKRKLGTHYQIIKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGHLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLARSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGFKK-----SKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAQDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDI 334
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A +I + G F M L MD V Y +L+TEYSK L Y
Sbjct: 335 AREIAERGSQFIMNHLQMDDVTCYWENLLTEYSKFLSY 372
>gi|350416880|ref|XP_003491149.1| PREDICTED: O-glucosyltransferase rumi homolog [Bombus impatiens]
Length = 407
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 146/327 (44%), Gaps = 20/327 (6%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
I DL+P+ K I+ I AK + I++GKLY DC+ I FL L
Sbjct: 79 IINDLKPFKKKGISKDLINIAKTRGTV-YQIIQGKLYRQK--DCMFPSRCSGIEHFL--L 133
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+ PG+ D+D++ + D P K H PLP+F + ++DI +P W+FW G +
Sbjct: 134 KLAPGLT-DMDLVINVRDYPQSSK--HFGGPLPIFSFSKTPEYYDITYPAWAFWEGGPAI 190
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
+L P W+E + S+ W+EK A+++G+ + L+ L A
Sbjct: 191 SLYPRGLGRWDEHRVSLDKASKNTLWEEKENKAFFRGSRTSSERDNLILLSRKKPNLVDA 250
Query: 287 EILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + +D + L C ++Y G A S K++ C S+ +
Sbjct: 251 QYTKNQAWKSNEDTLYATPASEVSLEAHCKYKYLFNYRGVAASFRHKHLFLCRSLVFHVG 310
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++ +F+ +IP ++ P+ + ++ + N ++KI G+DF+ +L +
Sbjct: 311 DEWTEFYYNAMIPWIHYIPVSKDANQTVLGELIQFAIDNDETSKKIADRGRDFIWNNLKL 370
Query: 401 DRVYDYMLHLITEYSKLLDYKPAPPSS 427
V +L+ +YSKLL YK S
Sbjct: 371 SDVTQSWKNLLKKYSKLLTYKTTLDKS 397
>gi|395844808|ref|XP_003795143.1| PREDICTED: protein O-glucosyltransferase 1 [Otolemur garnettii]
Length = 392
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 91/338 (26%), Positives = 155/338 (45%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R IV+ +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIAEDLTPF-RGGISRKMMAEVVRRKLGTHYQIVKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGHLPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE R + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTRNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F M L MD + Y L+TEYSK L Y
Sbjct: 335 AQEIAERGNQFIMNHLQMDDITCYWESLLTEYSKFLSY 372
>gi|241671163|ref|XP_002400011.1| conserved hypothetical protein [Ixodes scapularis]
gi|215506249|gb|EEC15743.1| conserved hypothetical protein [Ixodes scapularis]
Length = 410
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 154/333 (46%), Gaps = 28/333 (8%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+TC I DL+ W++S IT + +++ + + I+ KLY DC+ F
Sbjct: 77 ETCSCHSAVIDSDLQLWSESGIT-KDLVQRSKSRGIHYQIINHKLYRGE--DCLFP---F 130
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
G L +PD++ + + D P K+ PLP+F + + DI +P W
Sbjct: 131 RCSGIEHFLLEIVDELPDLEFIVNTRDWPQAHKRYD---PLPVFSFSKTPDYADIMYPAW 187
Query: 224 SFW-GWSEVNLQPWNEEFKDIKHGSQAKS------WKEKLPFAYWKGN--PDVLSPLRVE 274
+FW G ++L P D++ G +K+ W +K +++G+ D PL
Sbjct: 188 TFWAGGPAISLYPTGIGRWDLQRGIISKTANQKWPWSKKRDVGFFRGSRTSDERDPLI-- 245
Query: 275 LMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
L+ L A+ + + A+D K+ + + C +Y G A S LK+
Sbjct: 246 LLSRKKPHLVDAQYTKNQAWKSAEDTLGLPAAKEVRFEDHCEFKYLFNFRGVAASFRLKH 305
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGK 389
+ C SV + ++ +FF L P ++ P+ DL + ++ ++++ + N ++ +
Sbjct: 306 LFLCKSVVFHVGHEWLEFFYPRLKPWVHYIPV-GTDLSQ-VEELLEFAHENDEVVREMAE 363
Query: 390 AGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYK 421
G DF+ + LTM+ V Y L L+ +YS+LL YK
Sbjct: 364 RGYDFIWKHLTMEDVTCYWLQLLKKYSQLLKYK 396
>gi|296226200|ref|XP_002758827.1| PREDICTED: protein O-glucosyltransferase 1 [Callithrix jacchus]
Length = 391
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 161/338 (47%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 47 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 103
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR L+++RR +PD++++ + D P + K + +P+F + + D
Sbjct: 104 PSRCSGVEHFILEVIRR----LPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 157
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 158 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SP 215
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + ++ KD K L + C ++Y G A
Sbjct: 216 ERDPLILLSRKNPKLVDAEYTKNQAWKQLKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 275
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 276 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 333
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L+TEYSK L Y
Sbjct: 334 AQEIAERGSQFIRNHLQMDDITCYWENLLTEYSKFLSY 371
>gi|410970561|ref|XP_003991747.1| PREDICTED: protein O-glucosyltransferase 1 [Felis catus]
Length = 447
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 155/338 (45%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 103 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYRE--NDCMF 159
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 160 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 213
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK K AY++G+ SP
Sbjct: 214 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKRKNSTAYFRGS--RTSP 271
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 272 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 331
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 332 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLHFVKANDDM 389
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F M L MD + Y +L+TEYSK L Y
Sbjct: 390 AQEIAERGSQFIMNHLQMDDITCYWENLLTEYSKFLSY 427
>gi|403171742|ref|XP_003330935.2| hypothetical protein PGTG_12472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169317|gb|EFP86516.2| hypothetical protein PGTG_12472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 499
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 82/342 (23%), Positives = 153/342 (44%), Gaps = 26/342 (7%)
Query: 104 QTCPDFFKSIHKDLEPW-----AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ 158
+ C F +++++L+ ++ I+++H+ A +++ R+LI R ++Y+ +
Sbjct: 158 ERCAALFPNLYQELDRAVLYHSSRGNISIQHLDRAYQYSHARVLIYRNRVYIKGFNGGPG 217
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
R + + + P ++PDV+ + D P EH PL+ + ++
Sbjct: 218 LRTEAILNSIQEAVITSPELLPDVEFVIKTSDAPQGGDDEH-----PLWVLDRTKSQEEV 272
Query: 219 -PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
PD+ F+ W E + E + SW K+ A+W+G +L LR +L++
Sbjct: 273 WLMPDYGFYSWPEPKVGGMVEVRDKTAEREASLSWDSKISKAFWRG--AILVKLREQLLE 330
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
+ K W ++I W + G K + + CN+++ ++ EGYA+S LKY+L C SV
Sbjct: 331 VSKGKSW-SDIKPIVW-QNLNGGLKTPE--DHCNYKFLVHTEGYAYSGRLKYLLMCRSVI 386
Query: 338 LIISQQYKDFF-----SRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
+ QY F SR PT+N I + ++D A+P AE I
Sbjct: 387 VGHEMQYIQHFHHLLDSRPHSPTQN-IAIADGPGIEGLPKLMDRLLADPVWAESIANNSY 445
Query: 393 D-FMESLTMDRVYDYMLHLITEYSKLLDYKP--APPSSAFEA 431
F L+ V+ Y + ++ + D+ P P + FE+
Sbjct: 446 PLFRHYLSPAGVHCYWRQMFRAWANVQDFVPHRTPKDTPFES 487
>gi|291400615|ref|XP_002716870.1| PREDICTED: KTEL (Lys-Tyr-Glu-Leu) containing 1 [Oryctolagus
cuniculus]
Length = 462
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 156/339 (46%), Gaps = 32/339 (9%)
Query: 100 PQKVQTCPDFFKSIHKDLEPW--AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
P + C + I +DL P+ SR M +++ K I++ +LY + DC+
Sbjct: 118 PCSSENCSCYHGIIEEDLTPFRGGISRKMMAEVVQRK--LGTHYQIIKNRLYRE--NDCM 173
Query: 158 -QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
SR G + G +PD++++ + D P + K + +P+F + ++
Sbjct: 174 FPSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYY 227
Query: 217 DIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI +P W+FW G V L W+ +++ + WK+K AY++G+ S
Sbjct: 228 DIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREELVKSAAQWPWKKKNSTAYFRGS--RTS 285
Query: 270 PLR--VELMKCNDSKLWGAEILRQNWAEEAKDGFKKS-----KLSNQCNHRYKIYAEGYA 322
P R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 286 PERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVA 345
Query: 323 WSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPS 382
S K++ C S+ + +++ +FF L P ++ P+ DL +++ ++ + AN
Sbjct: 346 ASFRFKHLFLCGSLVFHVGEEWLEFFYPQLKPWVHYIPV-KTDLS-NVQELLQFVKANDD 403
Query: 383 EAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F M L MD + Y +L+TEYSK L Y
Sbjct: 404 IAQEIAERGSQFIMNHLQMDDITCYWENLLTEYSKFLSY 442
>gi|312384008|gb|EFR28848.1| hypothetical protein AND_02694 [Anopheles darlingi]
Length = 406
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 76/336 (22%), Positives = 144/336 (42%), Gaps = 23/336 (6%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ 158
S + C + KDL+P+ IT I AK++ ++ KLY Q
Sbjct: 69 STTAIDGCSCHTGVLKKDLKPFKADGITKEMIERAKQYGT-HYQVIDHKLYR-------Q 120
Query: 159 SRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
+ MF G ++ +PD+D++ +C D P + + + P+F + D +
Sbjct: 121 TECMFPARCSGIEHFVKPLLPKLPDMDLIINCRDWPQVHRHWNKE-KTPVFSFSKTDEYL 179
Query: 217 DIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
DI +P W+FW G ++L P W++ + I ++ WK K P A+++G+
Sbjct: 180 DIMYPAWAFWEGGPAISLYPTGLGRWDQHRESISQAAERVPWKNKKPMAFFRGSRTSDER 239
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSV 325
+ L+ L A+ + + +D + L + C +R+ G A S
Sbjct: 240 DALVLLSREQPSLVDAQYTKNQAWKSPQDTLNAEPASEVSLEDHCRYRFLFNFRGVAASF 299
Query: 326 SLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAE 385
K++ C S+ + ++ +F+ L P ++ P+P +++++ + + EA
Sbjct: 300 RFKHLFLCRSLVFHVGDEWLEFYYPSLKPWVHYVPVPVRSSQAKLEALIRFFRDHDDEAN 359
Query: 386 KIGKAG-QDFMESLTMDRVYDYMLHLITEYSKLLDY 420
I + G + + L M V Y L+ EY KL+ Y
Sbjct: 360 AIAERGFRQVWQHLRMKDVKCYWKKLLHEYGKLIRY 395
>gi|198450765|ref|XP_002137151.1| GA27052 [Drosophila pseudoobscura pseudoobscura]
gi|198131182|gb|EDY67709.1| GA27052 [Drosophila pseudoobscura pseudoobscura]
Length = 420
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 136/322 (42%), Gaps = 28/322 (8%)
Query: 112 SIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYY----DCVQSRAMFTIWG 167
+I +D P+A++ IT R +M R + ++ G++Y P C M
Sbjct: 69 TIRRDFGPYAEAGIT-RSMMAQSRRLGVVYKVIDGRIYRQPEVPHPKRCADVEDM----- 122
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWG 227
L G +P+V+ + + D P + F P+F + + H DI P WSF
Sbjct: 123 ----LLGIAGELPNVEFILNVRDWPQVPFLS--GFTGPVFSHSVSHQHLDIMCPAWSFSS 176
Query: 228 WSEVNLQ-------PWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
S LQ W + + + SW+ K P +++G + +
Sbjct: 177 VSGPPLQRFPHGIGQWGHMRRHMAAAAAQVSWEHKQPIGFFRGTRSSTERDTLVRLSARS 236
Query: 281 SKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALII 340
L A+ AE + S C ++Y G S L++IL C S+ L +
Sbjct: 237 PDLVDAQYTSNVGAETVDE----VPFSGHCQYKYLFNFGGITASFRLRHILLCKSLVLHV 292
Query: 341 SQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLT 399
Q+++FF L P ++ P+PS S++ ++ + + AE+I + G F+ + L
Sbjct: 293 GDQWREFFYSSLKPWVHYVPVPSNATVESLEQLLVYLRLHDDLAEEIAERGFQFVWQHLR 352
Query: 400 MDRVYDYMLHLITEYSKLLDYK 421
+ V Y +L+ EY+KLL Y+
Sbjct: 353 LQDVQCYWRNLLLEYAKLLKYR 374
>gi|327268710|ref|XP_003219139.1| PREDICTED: protein O-glucosyltransferase 1-like [Anolis
carolinensis]
Length = 391
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 157/338 (46%), Gaps = 28/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCVQ 158
P + C + K +DL+P+ + I+ + EA R I+ KLY + +DC
Sbjct: 47 PCVTENCRCYQKVREQDLDPFQEG-ISEELLSEAISRRLGTHYQIIAKKLYRE--HDC-- 101
Query: 159 SRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
MF G + +PD++++ + D P + K +P+F + ++
Sbjct: 102 ---MFPARCSGVEHFILEIIDELPDMEMVINVRDYPQVPK--FMKPKVPIFSFSKTSEYY 156
Query: 217 DIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI +P W+FW G V L W+ +D+K S+ W+ K+ A+++G+
Sbjct: 157 DIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRQDLKSSSEKWPWRAKISKAFFRGSRTSAE 216
Query: 270 PLRVELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWS 324
+ L+ + +L AE + + KD K+ L++ C ++Y G A S
Sbjct: 217 RDPLILLSRENPELVDAEYTKNQAWKSEKDTLGEPPAKEISLTDHCKYKYLFNFRGVAAS 276
Query: 325 VSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEA 384
K++ C S+ + + +++FF L P ++ P+ S DL ++ ++++ N A
Sbjct: 277 FRFKHLFLCGSLVFHVGEDWQEFFYSQLKPWVHYIPVKS-DLS-DVRELLEFVKENDDVA 334
Query: 385 EKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYK 421
E+I + G+ F M+ L M+ + Y L+T+YSK L YK
Sbjct: 335 EEISERGRQFIMDHLRMEDISCYWKKLLTDYSKALTYK 372
>gi|21040486|gb|AAH30614.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Homo sapiens]
gi|312150392|gb|ADQ31708.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [synthetic construct]
Length = 392
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 157/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ R + E +R I + +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRRMMAEVVRRKLGTHYQITKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ ++KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNTKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>gi|403288562|ref|XP_003935467.1| PREDICTED: protein O-glucosyltransferase 1 [Saimiri boliviensis
boliviensis]
Length = 391
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 159/338 (47%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 47 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 103
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR L+++RR +PD++++ + D P + K + +P+F + + D
Sbjct: 104 PSRCSGVEHFILEVIRR----LPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 157
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 158 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 215
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L C ++Y G A
Sbjct: 216 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVEHCKYKYLFNFRGVAA 275
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 276 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 333
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L+TEYSK L Y
Sbjct: 334 AQEIAERGSQFIRNHLQMDDITCYWENLLTEYSKFLSY 371
>gi|340719860|ref|XP_003398363.1| PREDICTED: o-glucosyltransferase rumi homolog [Bombus terrestris]
Length = 404
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 148/331 (44%), Gaps = 22/331 (6%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
I DL+P+ K I+ I AK + I++GKLY DC+ I FL L
Sbjct: 78 IINDLKPFKKKGISKDLINIAKTRGTV-YQIIQGKLYRQK--DCMFPSRCSGIEHFL--L 132
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+ PG+ D+D++ + D P K H PLP+FR ++DI +P W+FW G +
Sbjct: 133 KLAPGLT-DMDLVINVRDYPQSSK--HFGGPLPIFRLTP--EYYDITYPAWAFWEGGPAI 187
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
+L P W+E + S+ W+EK A+++G+ + L+ L A
Sbjct: 188 SLYPRGLGRWDEHRVSLDKASKNTLWEEKENKAFFRGSRTSSERDNLILLSRKKPNLVDA 247
Query: 287 EILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + +D + L C ++Y G A S K++ C S+ +
Sbjct: 248 QYTKNQAWKSNEDTLYAPPASEVSLEAHCKYKYLFNYRGVAASFRHKHLFLCRSLVFHVG 307
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++ +F+ +IP ++ P+ ++ ++ + N ++KI G+DF+ +L +
Sbjct: 308 DEWTEFYYNAMIPWIHYIPVSKDANQTVLEELIQFAIDNDKTSKKIADRGRDFIWNNLKL 367
Query: 401 DRVYDYMLHLITEYSKLLDYKPAPPSSAFEA 431
V +L+ +YSKLL YK S +
Sbjct: 368 SDVTQSWKNLLKKYSKLLTYKTTLDKSLIKV 398
>gi|322791349|gb|EFZ15836.1| hypothetical protein SINV_05265 [Solenopsis invicta]
Length = 408
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/363 (22%), Positives = 161/363 (44%), Gaps = 27/363 (7%)
Query: 75 KIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK 134
K ++ +Y L AM + + + K + D I +DL P+ I+ +++A
Sbjct: 47 KEINESYKKYLVAMENAEKSYKKCNNTKHECFKDV---ILRDLGPFRTKGIS-EEMIQAA 102
Query: 135 RFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKP 192
R +++GKLY + DC MF G L + G + D+D++ + D P
Sbjct: 103 RTRGTFYQVIKGKLYREK--DC-----MFPARCAGIEHFLLKVIGNLSDMDLVINTRDYP 155
Query: 193 VIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHG 246
E+ +P+F + ++DI +P W+FW G ++L P W++ K +
Sbjct: 156 --QSSEYFGNAMPVFSFSKTPQYYDIMYPAWAFWEGGPAISLYPHGLGRWDQHRKSLNKA 213
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS-- 304
S W++K +++G+ + L+ + L A+ + + +D +
Sbjct: 214 SLEIPWEKKESKGFFRGSRTSSERDNLILLSRSKPHLVDAQYTKNQAWKSNEDTLHATPA 273
Query: 305 ---KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPI 361
L + C ++Y G A S K++ C S+ + ++ +F+ + P ++ P+
Sbjct: 274 SEVSLESHCTYKYLFNFRGVAASFRFKHLFLCRSLVFHVGDEWVEFYYYEMKPWIHYIPV 333
Query: 362 PSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDY 420
P + +++++++ N A+KI G+DF+ +L M + + L+ Y KL+ Y
Sbjct: 334 PKDAHQKELENLIEFVQNNDDIAKKIAYRGRDFIWNNLRMSDITYFWKQLLKSYGKLVAY 393
Query: 421 KPA 423
KP
Sbjct: 394 KPV 396
>gi|432113196|gb|ELK35717.1| Protein O-glucosyltransferase 1, partial [Myotis davidii]
Length = 364
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 156/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 20 PCSSQNCSCYRGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 76
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 77 PSRCS----GVEHFILEVIGHLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 130
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 131 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SP 188
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 189 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 248
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 249 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLHFVKANDDI 306
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F M L MD + Y +L+TEYSK L Y
Sbjct: 307 AQEIAERGSHFIMNHLRMDDITCYWENLLTEYSKFLSY 344
>gi|60302736|ref|NP_001012560.1| KTEL motif-containing protein 1 precursor [Gallus gallus]
gi|60098623|emb|CAH65142.1| hypothetical protein RCJMB04_4b16 [Gallus gallus]
Length = 392
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 155/323 (47%), Gaps = 28/323 (8%)
Query: 115 KDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQL 171
+DL P+ +S I+ I + R I++ KLY + +DC MF G
Sbjct: 63 QDLAPF-RSGISKETISDVVSRKLGTHYQIIKNKLYRE--HDC-----MFPARCSGVEHF 114
Query: 172 LRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSE 230
+R +PD++++ + D P + K +P+F + + DI +P W+FW G
Sbjct: 115 IRGIINRLPDMEMVINVRDYPQVPKWMKPI--IPVFSFSKTAEYNDIMYPAWTFWEGGPA 172
Query: 231 V------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
V L W+ +D++ ++ WK+K+ Y++G+ + L+ + +L
Sbjct: 173 VWPIYPTGLGRWDLMREDLRRSAEKWPWKKKISKGYFRGSRTSSERDPLILLSRENPELV 232
Query: 285 GAEILR-QNWAEE----AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
AE + Q W E K K+ L + C ++Y G A S LK++ C S+
Sbjct: 233 DAEYTKNQAWKSEKDTLGKPPAKEIPLVDHCKYKYLFNFRGVAASFRLKHLFLCGSLVFH 292
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+ +++ +FF + L P ++ P+ S DL ++ ++ + N + A++I + G+ F+ E L
Sbjct: 293 VGEEWLEFFYQQLKPWVHYIPVKS-DLS-DVRELLQFAKENDNIAQEIAERGRQFITEHL 350
Query: 399 TMDRVYDYMLHLITEYSKLLDYK 421
M+ V Y HL++EYS+ L YK
Sbjct: 351 QMEDVSCYWEHLLSEYSQTLTYK 373
>gi|332225464|ref|XP_003261899.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Nomascus
leucogenys]
Length = 392
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 156/339 (46%), Gaps = 32/339 (9%)
Query: 100 PQKVQTCPDFFKSIHKDLEPW--AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
P Q C + I +DL P+ SR TM ++ +R I + +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPFRGGISRKTMAEVV--RRKLGTHYQITKNRLYREN--DCM 103
Query: 158 -QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
SR G + G +PD++++ + D P + K + +P+F + +
Sbjct: 104 FPSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYH 157
Query: 217 DIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI +P W+FW G V L W+ +D+ + WK+K AY++G+ S
Sbjct: 158 DIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TS 215
Query: 270 PLR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYA 322
P R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 216 PERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVA 275
Query: 323 WSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPS 382
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 276 ASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDD 333
Query: 383 EAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 334 VAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>gi|410906075|ref|XP_003966517.1| PREDICTED: protein O-glucosyltransferase 1-like [Takifugu rubripes]
Length = 367
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 156/336 (46%), Gaps = 32/336 (9%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCV-QSRA 161
C + DL P+ K +I+ +M+A +R I+ KLY + +C+ +R
Sbjct: 28 NCSCHLSVLQDDLRPF-KFKIS-EELMDATTQRGVGTHYQIIGHKLYREQ--NCMFPARC 83
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
LQL+ R +PD++++ + D P + K S LP+F + + DI +P
Sbjct: 84 SGVEHFILQLIDR----LPDMEMVVNVRDYPQVPKWMESS--LPVFSFSKTADYLDIMYP 137
Query: 222 DWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR-- 272
W+FW G V L W+ D+K + WK+K +++G+ SP R
Sbjct: 138 AWTFWEGGPAVWPIYPTGLGRWDLMRTDLKKSADQWPWKKKETKGFFRGSR--TSPERDP 195
Query: 273 VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSL 327
+ L+ D +L AE + + KD K+ L + C ++Y G A S
Sbjct: 196 LILLSREDPELVDAEYTKNQAWKSEKDTLGRPPAKEIPLVDHCKYKYLFNFRGVAASFRF 255
Query: 328 KYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKI 387
K++ C S+ + ++++FF L P ++ P+ DL ++ ++ + N S A++I
Sbjct: 256 KHLFLCGSLVFHVGDEWQEFFYPQLKPWVHYIPV-QQDLS-DVRGLLQFAKENDSLAQEI 313
Query: 388 GKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYKP 422
G++F + L M V Y L+TEYS+LL YKP
Sbjct: 314 ATRGKEFIINHLRMQDVSCYWEKLLTEYSRLLTYKP 349
>gi|60688257|gb|AAH91408.1| Ktelc1 protein [Rattus norvegicus]
Length = 349
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 158/339 (46%), Gaps = 30/339 (8%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV 157
P Q C + I +DL P+ + I+ + + E +R I++ +L+ + DC+
Sbjct: 4 EPCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRRLGTHYQIIKHRLFRED--DCM 60
Query: 158 -QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
SR L+++RR +PD++++ + D P + K + +P+F + +
Sbjct: 61 FPSRCSGVEHFILEVIRR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYH 114
Query: 217 DIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI +P W+FW G V L W+ +D+ + W++K AY++G+ S
Sbjct: 115 DIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRT--S 172
Query: 270 PLR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYA 322
P R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 173 PERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYKYLFNFRGVA 232
Query: 323 WSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPS 382
S K++ C S+ + ++ +FF L P ++ P+ + DL ++ ++ + AN
Sbjct: 233 ASFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKT-DLS-DVQELLQFVKANDD 290
Query: 383 EAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I K G F + L MD + Y +L+TEYSK L Y
Sbjct: 291 LAQEIAKRGSQFIINHLQMDDITCYWENLLTEYSKFLSY 329
>gi|281345604|gb|EFB21188.1| hypothetical protein PANDA_003280 [Ailuropoda melanoleuca]
Length = 364
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 155/338 (45%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 20 PCSSQNCSCYHGVIDEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 76
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 77 PSRCS----GVEHFILEVIGQLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 130
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 131 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SP 188
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 189 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 248
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 249 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLHFVKANDDV 306
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F M L M+ V Y L+TEYSK L Y
Sbjct: 307 AQEIAERGSHFIMNHLQMEDVTCYWESLLTEYSKFLSY 344
>gi|281332155|ref|NP_001094122.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 precursor [Rattus norvegicus]
gi|149060499|gb|EDM11213.1| similar to RIKEN cDNA 9630046K23 [Rattus norvegicus]
Length = 392
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 158/339 (46%), Gaps = 30/339 (8%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV 157
P Q C + I +DL P+ + I+ + + E +R I++ +L+ + DC+
Sbjct: 47 EPCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRRLGTHYQIIKHRLFRED--DCM 103
Query: 158 -QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
SR L+++RR +PD++++ + D P + K + +P+F + +
Sbjct: 104 FPSRCSGVEHFILEVIRR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYH 157
Query: 217 DIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI +P W+FW G V L W+ +D+ + W++K AY++G+ S
Sbjct: 158 DIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRT--S 215
Query: 270 PLR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYA 322
P R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 216 PERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYKYLFNFRGVA 275
Query: 323 WSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPS 382
S K++ C S+ + ++ +FF L P ++ P+ + DL ++ ++ + AN
Sbjct: 276 ASFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKT-DLS-DVQELLQFVKANDD 333
Query: 383 EAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I K G F + L MD + Y +L+TEYSK L Y
Sbjct: 334 LAQEIAKRGSQFIINHLQMDDITCYWENLLTEYSKFLSY 372
>gi|412992075|emb|CCO20801.1| glycosyltransferase CAZy GT90 [Bathycoccus prasinos]
Length = 438
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 150/358 (41%), Gaps = 48/358 (13%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKR--FAALRILIVRGKLYVDPYYDCVQ-SRAMFTIWGFL 169
I DL+PW+ + IT R + A + A RI I+ GK+Y SR + +WG +
Sbjct: 75 IDVDLQPWSMTGITKRMVDLAAQQGMRANRIQIIGGKIYAQISKSSRGPSRIWYWLWGLM 134
Query: 170 QLLRRYPG-MVPDVDIMFDCMD------------KPVIDKKEH-------GSFPLPLFRY 209
+L+ +P VPDVD + + D P++ KK G P P+F
Sbjct: 135 ELIDEFPEEAVPDVDFILNTQDDPQVSIVGKRPKNPILAKKYRDFVPGIKGQAPPPVFSA 194
Query: 210 CTNDAHFDIPFPDWSFWG---------WSEVNLQPWNEEFKDIKHGSQAKSWKEKLP-FA 259
T ++D+ +P W+ WG + PW E + H ++ W E+
Sbjct: 195 VTTSNNYDLLWPLWTIWGEDVEGAGSKTGGFHDPPWKELHPKLIHFAKKNKWSERRSERI 254
Query: 260 YWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAE 319
+W+G+ +P R L++C+ + + A++ + + D + K ++Y IY +
Sbjct: 255 FWRGSVKT-NPARRALIRCSKNTVDAADVQHKLRVGKPIDALDRVK------YKYLIYLD 307
Query: 320 GYAWSVSLKYILSCNSVALIISQQ------YKDFFSRGLIPTKNHFPIPSADLCRSIKSV 373
G ++S ++ +L +V + + + F G H + ++CR++ +
Sbjct: 308 GKSFSSAVLPMLVAGAVVFLPNNSPFQTLCQRAFRENGFHQV-FHVSLSQGEICRTLSEI 366
Query: 374 VDWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430
+ E K D+ E+ L+M YM+ ++ Y+ LL Y P E
Sbjct: 367 LSGLRNEELRVENRAKDAVDWAETQLSMLAFQKYMIAMLKRYADLLKYTPTKTKDTVE 424
>gi|301758922|ref|XP_002915307.1| PREDICTED: KTEL motif-containing protein 1-like [Ailuropoda
melanoleuca]
Length = 380
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 155/338 (45%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 36 PCSSQNCSCYHGVIDEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 92
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 93 PSRCS----GVEHFILEVIGQLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 146
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 147 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SP 204
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 205 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 264
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 265 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLHFVKANDDV 322
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F M L M+ V Y L+TEYSK L Y
Sbjct: 323 AQEIAERGSHFIMNHLQMEDVTCYWESLLTEYSKFLSY 360
>gi|440897644|gb|ELR49289.1| KTEL motif-containing protein 1 [Bos grunniens mutus]
Length = 392
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 153/333 (45%), Gaps = 30/333 (9%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV-QSRAM 162
C + I +DL P+ + I+ + + E +R I++ +LY + DC+ SR
Sbjct: 53 NCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYRES--DCMFPSRCS 109
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD 222
G + G +PD++++ + D P + K + +P+F + + DI +P
Sbjct: 110 ----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPIFSFSKTSEYHDIMYPA 163
Query: 223 WSFWGWSE-------VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR--V 273
W+FW + L W+ +D+ + WK+K AY++G+ SP R +
Sbjct: 164 WTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SPERDPL 221
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLK 328
L+ + KL AE + + KD K L + C ++Y G A S K
Sbjct: 222 ILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFK 281
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN A++I
Sbjct: 282 HLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDVAQEIA 339
Query: 389 KAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
+ G F + L MD + Y +L+TEYSK L Y
Sbjct: 340 ERGSQFILNHLKMDDITCYWENLLTEYSKFLSY 372
>gi|402859114|ref|XP_003894014.1| PREDICTED: protein O-glucosyltransferase 1 [Papio anubis]
Length = 392
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 157/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>gi|426217502|ref|XP_004002992.1| PREDICTED: protein O-glucosyltransferase 1 [Ovis aries]
Length = 392
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 153/333 (45%), Gaps = 30/333 (9%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV-QSRAM 162
C + I +DL P+ + I+ + + E +R I++ +LY + DC+ SR
Sbjct: 53 NCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYRES--DCMFPSRCS 109
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD 222
G + G +PD++++ + D P + K + +P+F + + DI +P
Sbjct: 110 ----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPIFSFSKTSEYHDIMYPA 163
Query: 223 WSFWGWSE-------VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR--V 273
W+FW + L W+ +D+ + WK+K AY++G+ SP R +
Sbjct: 164 WTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SPERDPL 221
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLK 328
L+ + KL AE + + KD K L + C ++Y G A S K
Sbjct: 222 ILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFK 281
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN A++I
Sbjct: 282 HLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDIAQEIA 339
Query: 389 KAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
+ G F + L MD + Y +L+TEYSK L Y
Sbjct: 340 ERGSQFILNHLKMDDIICYWENLLTEYSKFLSY 372
>gi|380786923|gb|AFE65337.1| protein O-glucosyltransferase 1 precursor [Macaca mulatta]
Length = 392
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 157/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>gi|109033215|ref|XP_001109371.1| PREDICTED: KTEL motif-containing protein 1-like [Macaca mulatta]
gi|355559357|gb|EHH16085.1| hypothetical protein EGK_11322 [Macaca mulatta]
gi|355746435|gb|EHH51049.1| hypothetical protein EGM_10372 [Macaca fascicularis]
Length = 392
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 157/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>gi|255088471|ref|XP_002506158.1| predicted protein [Micromonas sp. RCC299]
gi|226521429|gb|ACO67416.1| predicted protein [Micromonas sp. RCC299]
Length = 658
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/357 (22%), Positives = 151/357 (42%), Gaps = 45/357 (12%)
Query: 121 AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIW--GFLQLLRRYPGM 178
+K+ + ++ + R I+ G+L+V +R W L+ +RR +
Sbjct: 217 SKADVDTAELVARGSVNSFRAQIIGGRLFVKDIRALEFARDYAPSWKITLLETMRRRRDL 276
Query: 179 VPDVDIMFDCMDKPVI---------------DKKEHGSFPLPLFRYCTN-DAHFDIPFPD 222
PD+D +F+ D P++ + +G P PLF TN D+PFPD
Sbjct: 277 -PDIDAVFNEGDYPIVLLPNDGAHAQRLYGREGMSNGQKPPPLFSPTTNVQMTRDVPFPD 335
Query: 223 WSFW-----GWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV-ELM 276
+SF G ++ W+ + +++KLP A + GN R+ E+
Sbjct: 336 FSFSPPGVKGADRLSTTRWSVAHGRLLEAGAKIPFEDKLPLAAFTGNTQAEPRQRLAEVA 395
Query: 277 KCNDSKLWGAEILRQNWAEEA---------KDGFKKSKLS----NQCNHRYKIYAEGYAW 323
+ N ++ ++ +++ E K G + K + C +RY + +
Sbjct: 396 RSNPDSVFVNQVFKKSPTGERSCVQLGLADKGGLQADKCALSFEEMCRYRYLVNVGSNGY 455
Query: 324 SVSLKYILSCNSVALII--SQQYKDFFSRGLIPTKNHFPI-PSADLCRSIKSVVDWGNAN 380
+ LK + C SV + + S K+FF L+P ++ + S+D+ +++ + + N
Sbjct: 456 ANKLKSLFLCGSVVINVESSAPNKEFFEHQLLPGVHYVSVRDSSDVPAAVREMEE----N 511
Query: 381 PSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLL 437
A+ I AG M + D VYDY+ +TEY+ + +KP +FE E L
Sbjct: 512 MRRAKSIAAAGTRRMAAFNADAVYDYVATALTEYASRMTFKPERSPGSFEVSCEDDL 568
>gi|449485840|ref|XP_002191338.2| PREDICTED: protein O-glucosyltransferase 1 [Taeniopygia guttata]
Length = 514
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 159/325 (48%), Gaps = 32/325 (9%)
Query: 115 KDLEPW--AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV-QSRAMFTIWGFLQL 171
+DL P+ S+ TM ++ K I++ KLY + DC+ +R L +
Sbjct: 185 QDLAPFRGGISKETMSDVVSRK--LGTHYQIIKNKLYREQ--DCMFPARCSGVEHFILGI 240
Query: 172 LRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSE 230
+ R +PD++++ + D P + K +P+F + + DI +P W+FW G
Sbjct: 241 IHR----LPDMEMVINVRDYPQVPKWMKPI--IPVFSFSKTSEYNDIMYPAWTFWEGGPA 294
Query: 231 V------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR--VELMKCNDSK 282
V L W+ +D++ ++ WK+K+ Y++G+ SP R + L+ + +
Sbjct: 295 VWPIYPTGLGRWDLMREDLRRSAEKWPWKKKISKGYFRGSRT--SPERDPLILLSRENPE 352
Query: 283 LWGAEILR-QNWAEE----AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
L AE + Q W E K K+ L + C ++Y G A S LK++ C S+
Sbjct: 353 LVDAEYTKNQAWKSEKDTLGKPPAKEIPLVDHCKYKYLFNFRGVAASFRLKHLFLCGSLV 412
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
+ +++ +FF L P ++ P+ S DL ++ ++ + N + A++I + G+ F+ E
Sbjct: 413 FHVGEEWLEFFYPQLKPWVHYIPVRS-DLS-DVRELLQFAKENDAIAQEISERGRQFITE 470
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYK 421
L M+ + Y HL++EYS++L YK
Sbjct: 471 HLEMEDISCYWEHLLSEYSQILTYK 495
>gi|195443900|ref|XP_002069627.1| GK11623 [Drosophila willistoni]
gi|194165712|gb|EDW80613.1| GK11623 [Drosophila willistoni]
Length = 383
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 145/325 (44%), Gaps = 23/325 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ-SRAMFT 164
C + +I ++L P+A + IT I +++RF L ++ +LY +C SR
Sbjct: 50 CECYALAIARNLRPYAGTGITKPMIDQSRRFGTL-YKVIGSRLYRSD--NCAYPSRCASV 106
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWS 224
L ++R +PD++ + + D P I P+F Y + D DI P WS
Sbjct: 107 EELLLNIVRD----LPDLEFVLNVRDWPQIHFL--SGLSGPVFSYSSTDNFLDIMCPAWS 160
Query: 225 FWGWSEVNLQP-------WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
FW + LQ W+ + I ++ W++K+ +++G+ + L+
Sbjct: 161 FWTSAGPLLQQYPRGLGRWDHMRRFIADRARRMPWQKKISIGFFRGSRSSKERDNLVLLT 220
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
L A+ + + K+ L+ C +Y G + S L++IL C S+
Sbjct: 221 KRAPHLVDAQ-----YTQSKNSPVKEMSLAEHCKFKYLFNFRGISASFRLRHILLCKSLV 275
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
L + Q++++FF L P ++ P+ S ++ ++ + + AE+I + G F+ +
Sbjct: 276 LHVGQEWQEFFYSSLKPWIHYVPVGSNASEEDLEGLILYLRQHDDLAEEIAERGFQFVWQ 335
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYK 421
L M + Y L+ EY+KLL Y
Sbjct: 336 QLRMKDILCYWRQLLQEYAKLLSYN 360
>gi|345796207|ref|XP_545115.3| PREDICTED: protein O-glucosyltransferase 1 [Canis lupus familiaris]
Length = 392
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 156/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 48 PCSSQNCSCYHGIISEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P I K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQIPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGFKK-----SKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLHFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L M+ V Y L+TEYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMEDVTCYWESLLTEYSKFLSY 372
>gi|170055423|ref|XP_001863576.1| KDEL motif-containing protein 1 [Culex quinquefasciatus]
gi|193806713|sp|B0X1Q4.1|RUMI_CULQU RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|167875399|gb|EDS38782.1| KDEL motif-containing protein 1 [Culex quinquefasciatus]
Length = 403
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 166/380 (43%), Gaps = 46/380 (12%)
Query: 65 KEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSR 124
++E+ + YK Y+T I E P + C + DL P+ +S
Sbjct: 36 QQEAPTNNLYKAADNKYITL------IEEALAAYKPCESSNCSCHLDVLKTDLRPF-RSG 88
Query: 125 ITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQLLRRYPGMVPDV 182
IT + ++E R + I+ +++ Q MF G +R +PD+
Sbjct: 89 IT-QDLIELARSYGTKYQIIGHRMFR-------QRDCMFPARCSGVEHFIRPNLPKLPDM 140
Query: 183 DIMFDCMDKPVIDKKEHGSF-PLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP----- 235
+++ +C D P I + + S PLP+ + + + DI +P W FW G ++L P
Sbjct: 141 ELIINCRDWPQISRHWNASREPLPVLSFSKTNDYLDIMYPTWGFWEGGPAISLYPTGLGR 200
Query: 236 WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPD--------VLSPLRVELMKCNDSKLWGAE 287
W++ ++ ++ W++KL A+++G+ +LS +R EL+ +K
Sbjct: 201 WDQHRVSVRKAAKVWPWEKKLQQAFFRGSRTSDERDPLVLLSRMRPELVDAQYTK----- 255
Query: 288 ILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
Q W KD ++ +L + C ++Y G A S K++ C S+ + Q
Sbjct: 256 --NQAW-RSPKDTLHAEPAQEVRLEDHCQYKYLFNFRGVAASFRFKHLFLCKSLVFHVGQ 312
Query: 343 QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMD 401
++++FF L P ++ P+P ++ ++ + + A++I G + + L M+
Sbjct: 313 EWQEFFYDSLKPWVHYVPVPVGINEWELEHLIQFFREHDQLAQEIANRGYEHIWNHLRME 372
Query: 402 RVYDYMLHLITEYSKLLDYK 421
V Y L+ Y KL+ Y+
Sbjct: 373 DVECYWKRLLRRYGKLVKYE 392
>gi|31982953|ref|NP_689518.1| protein O-glucosyltransferase 1 precursor [Homo sapiens]
gi|114588661|ref|XP_516666.2| PREDICTED: protein O-glucosyltransferase 1 isoform 2 [Pan
troglodytes]
gi|397509574|ref|XP_003825192.1| PREDICTED: protein O-glucosyltransferase 1 [Pan paniscus]
gi|426341692|ref|XP_004036160.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Gorilla
gorilla gorilla]
gi|74730148|sp|Q8NBL1.1|PGLT1_HUMAN RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; Short=hCLP46; AltName:
Full=KTEL motif-containing protein 1; AltName:
Full=Myelodysplastic syndromes relative protein; Flags:
Precursor
gi|22761537|dbj|BAC11625.1| unnamed protein product [Homo sapiens]
gi|29179612|gb|AAH48810.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Homo sapiens]
gi|31745813|gb|AAP56253.1| myelodysplastic syndromes relative [Homo sapiens]
gi|37182284|gb|AAQ88944.1| MDS010 [Homo sapiens]
gi|119599976|gb|EAW79570.1| chromosome 3 open reading frame 9 [Homo sapiens]
gi|189067501|dbj|BAG37760.1| unnamed protein product [Homo sapiens]
gi|410228232|gb|JAA11335.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410254852|gb|JAA15393.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410299780|gb|JAA28490.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410328909|gb|JAA33401.1| protein O-glucosyltransferase 1 [Pan troglodytes]
Length = 392
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 156/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I + +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQITKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>gi|383420127|gb|AFH33277.1| protein O-glucosyltransferase 1 precursor [Macaca mulatta]
Length = 392
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 157/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNCTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>gi|413953162|gb|AFW85811.1| hypothetical protein ZEAMMB73_989593 [Zea mays]
Length = 231
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/106 (45%), Positives = 64/106 (60%), Gaps = 2/106 (1%)
Query: 360 PIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLL 418
PI +C+SI V WGN +P +A+ IG+ G F+ E L+MD VYDYM+HL+TEY+ LL
Sbjct: 98 PINREHMCKSINFAVGWGNEHPVQAQLIGEQGSRFVREELSMDYVYDYMMHLLTEYAGLL 157
Query: 419 DYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAAS-PSPYPPCTL 463
YKPA P A E C ES+ C A R + + S + + CTL
Sbjct: 158 RYKPAVPEKAVEICTESVACPAQSLHRDCMMDSMESHVAGFDLCTL 203
>gi|26334375|dbj|BAC30905.1| unnamed protein product [Mus musculus]
Length = 392
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 159/338 (47%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q+C + I +DL P+ + I+ + + E +R I++ +L+ + DC+
Sbjct: 48 PCSSQSCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLFRED--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR L+++ R +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCSGVEHFILEVIHR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + W++K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C +RY G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYRYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDI 334
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I K G F + L MD + Y +L+T+YSK L Y
Sbjct: 335 AQEIAKRGSQFIINHLQMDDITCYWENLLTDYSKFLSY 372
>gi|297670280|ref|XP_002813298.1| PREDICTED: protein O-glucosyltransferase 1 [Pongo abelii]
Length = 392
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 156/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I + +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRRLGTHYQITKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>gi|62897433|dbj|BAD96657.1| x 010 protein variant [Homo sapiens]
Length = 392
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 157/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I + +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQITKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C+S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCDSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLPY 372
>gi|62460496|ref|NP_001014903.1| protein O-glucosyltransferase 1 precursor [Bos taurus]
gi|75057820|sp|Q5E9Q1.1|PGLT1_BOVIN RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; AltName: Full=KTEL
motif-containing protein 1; Flags: Precursor
gi|59858103|gb|AAX08886.1| x 010 protein [Bos taurus]
gi|296491446|tpg|DAA33499.1| TPA: KTEL motif-containing protein 1 precursor [Bos taurus]
Length = 392
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 153/333 (45%), Gaps = 30/333 (9%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV-QSRAM 162
C + I +DL P+ + I+ + + E +R I++ +LY + DC+ SR
Sbjct: 53 NCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYRES--DCMFPSRCS 109
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD 222
G + G +PD++++ + D P + K + +P+F + + DI +P
Sbjct: 110 ----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPIFSFSKTLEYHDIMYPA 163
Query: 223 WSFWGWSE-------VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR--V 273
W+FW + L W+ +D+ + WK+K AY++G+ SP R +
Sbjct: 164 WTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SPERDPL 221
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLK 328
L+ + KL AE + + KD K L + C ++Y G A S K
Sbjct: 222 ILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFK 281
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN A++I
Sbjct: 282 HLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDVAQEIA 339
Query: 389 KAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
+ G F + L MD + Y +L+TEYSK L Y
Sbjct: 340 ERGSQFILNHLKMDDITCYWENLLTEYSKFLSY 372
>gi|148665570|gb|EDK97986.1| RIKEN cDNA 9630046K23, isoform CRA_b [Mus musculus]
Length = 372
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 158/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +L+ + DC+
Sbjct: 28 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLFRED--DCMF 84
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR L+++ R +PD++++ + D P + K + +P+F + + D
Sbjct: 85 PSRCSGVEHFILEVIHR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYHD 138
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + W++K AY++G+ SP
Sbjct: 139 IMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRT--SP 196
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C +RY G A
Sbjct: 197 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYRYLFNFRGVAA 256
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 257 SFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDI 314
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I K G F + L MD + Y +L+T+YSK L Y
Sbjct: 315 AQEIAKRGSQFIINHLQMDDITCYWENLLTDYSKFLSY 352
>gi|195573000|ref|XP_002104483.1| GD18424 [Drosophila simulans]
gi|194200410|gb|EDX13986.1| GD18424 [Drosophila simulans]
Length = 411
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 145/324 (44%), Gaps = 18/324 (5%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
+ +DL P+ + +T + I + R+ + I +LY D +C+ I FL L
Sbjct: 80 MKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYRDA--NCMFPARCEGIEHFLLPL 136
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+PD+D++ + D P ++ + P+F + + DI +P W+FW G
Sbjct: 137 ---VATLPDMDLVINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPAT 193
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
L P W++ + ++ + A W +K +++G+ + L+ + +L A
Sbjct: 194 KLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGSRTSDERDSLILLSRRNPELVEA 253
Query: 287 EILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + +KD + + C ++Y G A S LK++ C S+ +
Sbjct: 254 QYTKNQGWKSSKDTLDAPAADEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCKSLVFHVG 313
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++++FF L P ++ P+ S + + ++ + N + A++I + G DF+ E L M
Sbjct: 314 DEWQEFFYDQLKPWVHYVPLKSYPSQQEYEHILSFFKKNDALAQEIAQRGYDFIWEHLRM 373
Query: 401 DRVYDYMLHLITEYSKLLDYKPAP 424
+ Y L+ Y KLL Y+ P
Sbjct: 374 KDIKCYWRKLLKRYVKLLQYEVKP 397
>gi|27369505|ref|NP_759012.1| protein O-glucosyltransferase 1 precursor [Mus musculus]
gi|110832796|sp|Q8BYB9.2|PGLT1_MOUSE RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; AltName: Full=KTEL
motif-containing protein 1; Flags: Precursor
gi|20071120|gb|AAH26809.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Mus musculus]
gi|26327463|dbj|BAC27475.1| unnamed protein product [Mus musculus]
gi|26331050|dbj|BAC29255.1| unnamed protein product [Mus musculus]
gi|26331242|dbj|BAC29351.1| unnamed protein product [Mus musculus]
gi|148665569|gb|EDK97985.1| RIKEN cDNA 9630046K23, isoform CRA_a [Mus musculus]
Length = 392
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 158/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +L+ + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLFRED--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR L+++ R +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCSGVEHFILEVIHR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + W++K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C +RY G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYRYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDI 334
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I K G F + L MD + Y +L+T+YSK L Y
Sbjct: 335 AQEIAKRGSQFIINHLQMDDITCYWENLLTDYSKFLSY 372
>gi|255930987|ref|XP_002557050.1| Pc12g01550 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211581669|emb|CAP79782.1| Pc12g01550 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 436
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/313 (25%), Positives = 143/313 (45%), Gaps = 33/313 (10%)
Query: 139 LRILIVRGKLYVDPY--YDCVQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVI 194
+R +I RG+LY+ Y SRA T+ + L +P ++P ++ +F D
Sbjct: 115 VRGIIDRGELYIVDYAPMPVTASRARATLNSLHRALTAFPDRHLLPSIEFIFTTEDF--- 171
Query: 195 DKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNE---EFKDIKHGSQAK 250
E + P P++ Y D+H + PD+ +W W EV + P++E I G A
Sbjct: 172 --AEDTTTPSPIWSYSKRDSHTSVWLMPDFGYWAWPEVQIGPYHEVRRRIAAIDDGETAA 229
Query: 251 --------SWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFK 302
++EK W+G+ P+R +L+K + W A + +W ++ F
Sbjct: 230 DGTYVPGLQFQEKKKQLVWRGSLATNPPVRSKLLKSALGRSW-ASVRIIDWDDQNDIRFN 288
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHF 359
+ + C + + + EG ++S KY+L+C SV + + ++++ L+ P N+
Sbjct: 289 LLPIEDHCRYMFLAHTEGRSFSGRGKYLLNCRSVVVSHALEWREAHHAALVSSGPDANYI 348
Query: 360 PIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMES-LTMDRVYDYMLHLITEYS 415
+ +DL R I ++D NP AE+I A + F + LT Y LI +YS
Sbjct: 349 EVDRDWSDLSRKIDYLID----NPDVAERIANNAVRTFRDRYLTPAAESCYWRQLIRQYS 404
Query: 416 KLLDYKPAPPSSA 428
D++P S+A
Sbjct: 405 AACDFEPVLFSTA 417
>gi|348566951|ref|XP_003469265.1| PREDICTED: protein O-glucosyltransferase 1-like [Cavia porcellus]
Length = 392
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 154/339 (45%), Gaps = 32/339 (9%)
Query: 100 PQKVQTCPDFFKSIHKDLEPW--AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
P Q C + I +DL P+ SR M ++ K ++R +LY + DC+
Sbjct: 48 PCSSQNCSCYRGIIEEDLTPFRGGISRKMMAEVVSRK--LGTHYQVIRNRLYREN--DCM 103
Query: 158 -QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
SR G + G +PD++++ + D P I K + +P+F + +
Sbjct: 104 FPSRCS----GVEHFILEVIGHLPDMEMVINVRDYPQIPKWMEPA--IPVFSFSKTSEYH 157
Query: 217 DIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI +P W+FW G V L W+ +D+ + W++K AY++G+ S
Sbjct: 158 DIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWEKKNSTAYFRGSRT--S 215
Query: 270 PLR--VELMKCNDSKLWGAEILRQNWAEEAKDGFKK-----SKLSNQCNHRYKIYAEGYA 322
P R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 216 PERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVA 275
Query: 323 WSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPS 382
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 276 ASFRFKHLFLCGSLVFHVGNEWLEFFYPKLKPWVHYIPVKT-DLS-NVQELLQFVKANDD 333
Query: 383 EAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F + L M+ V Y +L+TEYSK L Y
Sbjct: 334 IAQEIAERGSQFIINHLQMEDVTCYWENLLTEYSKFLSY 372
>gi|195143521|ref|XP_002012746.1| GL23775 [Drosophila persimilis]
gi|194101689|gb|EDW23732.1| GL23775 [Drosophila persimilis]
Length = 419
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/319 (24%), Positives = 134/319 (42%), Gaps = 28/319 (8%)
Query: 112 SIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYY----DCVQSRAMFTIWG 167
+I +DL P+A++ IT R +M R + ++ G++Y P C M
Sbjct: 69 TIRRDLGPYAEAGIT-RSMMAQSRRLGVVYQVIDGRIYRQPEVPHPKRCADVEDM----- 122
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWG 227
L G +P+V+ + + D P + F P+F + + H DI P WSF
Sbjct: 123 ----LLGIAGELPNVEFILNVRDWPQVPFLS--GFTGPVFSHSVSHQHLDIMCPAWSFSS 176
Query: 228 WSEVNLQ-------PWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
S LQ W + + + SW+ K P +++G + +
Sbjct: 177 VSGPPLQRFPHGIGQWGHMRRHMAAAAAQVSWEHKQPIGFFRGTRSSTERDTLVRLSRRS 236
Query: 281 SKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALII 340
L A+ AE + S C ++Y G S L++IL C S+ L +
Sbjct: 237 PDLVDAQYTSNVGAETVDE----VPFSGHCQYKYLFNFGGITASFRLRHILLCKSLVLHV 292
Query: 341 SQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLT 399
Q+++FF L P ++ P PS S++ ++ + + AE+I + G F+ + L
Sbjct: 293 GDQWQEFFYSSLKPWVHYVPAPSNATVESLEQLLVYLRLHDDLAEEIAERGFQFVWQHLR 352
Query: 400 MDRVYDYMLHLITEYSKLL 418
+ V Y +L+ EY+KLL
Sbjct: 353 LQDVQCYWRNLLQEYAKLL 371
>gi|388579121|gb|EIM19449.1| hypothetical protein WALSEDRAFT_34020 [Wallemia sebi CBS 633.66]
Length = 471
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 116/260 (44%), Gaps = 18/260 (6%)
Query: 95 RRVASPQKVQTCPDFFKSIHKDLEPWA-KSRITMRHIMEAKRFAALRILIVRGKLYVDPY 153
++++ Q ++ F + + W+ + IT H+ +A A R++I ++YV Y
Sbjct: 110 KQLSESQCLKEYDGLFYEAERAFDWWSERGGITKEHLEKASDKAHGRVVIHNNRMYVKEY 169
Query: 154 -YDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTN 212
+ V +RAM T + + PG +P+V+ F + D + P+P F
Sbjct: 170 RHGDVNTRAMATFAAIHEAVLTSPGAIPNVEFTFQ-----IQDAGDSYDEPIPTF---VL 221
Query: 213 DAHFDIP----FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVL 268
D D P PD+ FW W E + + E SW EKLP +W+G L
Sbjct: 222 DRTADQPELWLMPDFGFWSWPEPKVGSYVEVRDKAGKWESKHSWSEKLPKVFWRGASLGL 281
Query: 269 SPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLK 328
P+R +L+ W +++ NW + + C +Y I+ EG A+S LK
Sbjct: 282 -PIRDQLVDAARGHAW-SDVKIMNWGDIQPGDLL--TMEEHCGFQYLIHVEGVAYSGRLK 337
Query: 329 YILSCNSVALIISQQYKDFF 348
Y+L C+SV+++ ++ F
Sbjct: 338 YLLQCHSVSVMHEMKFIQHF 357
>gi|194744002|ref|XP_001954487.1| GF18287 [Drosophila ananassae]
gi|190627524|gb|EDV43048.1| GF18287 [Drosophila ananassae]
Length = 382
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 143/330 (43%), Gaps = 35/330 (10%)
Query: 112 SIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQL 171
+I KDLEP++ I+ I ++KR L ++R +++ + C ++ +
Sbjct: 47 TITKDLEPYSDKGISQDMISQSKRQGTL-YKVIRRRIFRQEH--CSHPLRCSSV---EDV 100
Query: 172 LRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEV 231
L G +PD++ + + D P + P+F + T H DI P WSFW
Sbjct: 101 LLEIAGDLPDLEFVLNVCDWPQVPFL--SGLSGPVFSHSTTALHLDIMCPAWSFWTVFGP 158
Query: 232 NLQP-------WNEEFKDIKHGSQAKSWKEKLPFAYWKGN---PD-----VLSPLRVELM 276
LQ W+ + I + WK K +++G+ PD +LS L+
Sbjct: 159 KLQQYPHGLGRWDWMRQHIAAAATRIPWKSKKALGFFRGSRSSPDRDNVVILSKRYPNLV 218
Query: 277 KCN----DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + + IL + AEE L++ C +Y G A S L++IL
Sbjct: 219 DAQYTLYVADKFSSHILTSDPAEELP-------LADHCQFKYLFSFRGVAASFRLRHILL 271
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
C S+ L + Q+++FF L P ++ P+ S + ++ + N AE+I + G
Sbjct: 272 CRSLVLHVGDQWQEFFYGQLKPWVHYVPVASNASVEDLAELLRYLRQNDDLAEEIAERGH 331
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYK 421
F+ L M V Y L+ EYSKLL YK
Sbjct: 332 QFVWMHLRMADVLCYWRKLLQEYSKLLMYK 361
>gi|225712798|gb|ACO12245.1| KTEL motif-containing protein 1 precursor [Lepeophtheirus salmonis]
Length = 393
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/330 (25%), Positives = 147/330 (44%), Gaps = 24/330 (7%)
Query: 109 FFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGF 168
F + IH DL + + + + + +I G+LY P +C+ I F
Sbjct: 62 FTQLIHDDLSHFHSIKKSDLDLARETAAHPVTYIISDGELYRSP--ECLFPSRCKGIEHF 119
Query: 169 LQLLRRYPGMVPDVDIMFDCMDKPVIDK---KEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
L +++ +V+ + D P ++K K P P+F + + DI +P W+F
Sbjct: 120 LHRIKK--STTANVEFVVGVHDWPHVNKYTLKSKDPIP-PVFSFSKTSDYLDITYPAWTF 176
Query: 226 -WGWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
G ++L P W++ K I S+ W++K A+++G+ + L+
Sbjct: 177 KEGGPAISLYPKGLGEWDKMRKRIL--SKKVEWEKKETKAFFRGSRTSSERDNLILLSRK 234
Query: 280 DSKLWGAEILR-QNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+L A+ + Q W E KD K+ L N C ++Y G A S K++ C
Sbjct: 235 HPELVDAQYTKNQGWKSE-KDTLGAPPAKEVALENHCKYKYLFNFRGVAASFRFKHLFLC 293
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
S+ + ++ +FF L P ++ P+ S IK ++D+ N N AE I ++G D
Sbjct: 294 ESLVFHVGDEWTEFFYSELKPWVHYVPVSSKASIEEIKELIDFFNDNQEIAEIIAESGHD 353
Query: 394 FME-SLTMDRVYDYMLHLITEYSKLLDYKP 422
F++ LT D+V Y L+ +Y L+ + P
Sbjct: 354 FIKRRLTNDQVQCYWKELLHQYGTLMKFNP 383
>gi|351697696|gb|EHB00615.1| KTEL motif-containing protein 1 [Heterocephalus glaber]
Length = 392
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 153/339 (45%), Gaps = 32/339 (9%)
Query: 100 PQKVQTCPDFFKSIHKDLEPW--AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
P Q C + I +DL P+ SR M ++ K I++ +LY + DC+
Sbjct: 48 PCSSQNCSCYQGVIEEDLTPFRGGISRKMMAEVVSRK--LGTHYQIIKKRLYRED--DCM 103
Query: 158 -QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
SR G + G +PD++++ + D P + + + +P+F + +
Sbjct: 104 FPSRCS----GVEHFILEVIGQLPDMEMVINVRDYPQVPRWMEPA--IPVFSFSKTSEYH 157
Query: 217 DIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
DI +P W+FW G V L W+ +D+ + WK+K AY++G+ S
Sbjct: 158 DIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--S 215
Query: 270 PLR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYA 322
P R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 216 PERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVA 275
Query: 323 WSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPS 382
S K++ C S+ + + +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 276 ASFRFKHLFLCGSLVFHVGSAWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDD 333
Query: 383 EAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F + L MD V Y +L+TEYSK L Y
Sbjct: 334 IAQEIAERGSQFIINHLQMDDVTCYWENLLTEYSKFLSY 372
>gi|195502713|ref|XP_002098347.1| GE24006 [Drosophila yakuba]
gi|194184448|gb|EDW98059.1| GE24006 [Drosophila yakuba]
Length = 411
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 76/339 (22%), Positives = 149/339 (43%), Gaps = 18/339 (5%)
Query: 98 ASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
+S K C + +DL P+ + +T + I + R+ + I +LY D +C+
Sbjct: 65 SSDPKDSDCSCHADVLKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYRDA--NCM 121
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
I FL L +PD+D++ + D P ++ + P+F + + D
Sbjct: 122 FPARCEGIEHFLLPL---VATLPDMDLVINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRD 178
Query: 218 IPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
I +P W+FW G L P W++ + ++ + A W +K +++G+
Sbjct: 179 IMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKRNLGFFRGSRTSDERD 238
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVS 326
+ L+ + +L A+ + + KD + + C ++Y G A S
Sbjct: 239 SLILLSRRNPELVEAQYTKNQGWKSPKDTLDAPAADEVSFEDHCKYKYLFNFRGVAASFR 298
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
LK++ C S+ + ++++FF L P ++ P+ S + + ++ + + N + A++
Sbjct: 299 LKHLFLCKSLVFHVGDEWQEFFYDQLKPWVHYVPLKSYPSQQEYEDILSFFSKNDALAQE 358
Query: 387 IGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAP 424
I + G DF+ E L M + Y L+ Y KLL Y+ P
Sbjct: 359 IAQRGYDFIWEHLRMKDIKCYWRKLLKRYVKLLKYEVKP 397
>gi|21355689|ref|NP_651095.1| rumi, isoform A [Drosophila melanogaster]
gi|74866179|sp|Q8T045.1|RUMI_DROME RecName: Full=O-glucosyltransferase rumi; Flags: Precursor
gi|17862464|gb|AAL39709.1| LD29477p [Drosophila melanogaster]
gi|23172000|gb|AAN13920.1| rumi, isoform A [Drosophila melanogaster]
gi|220945854|gb|ACL85470.1| rumi-PA [synthetic construct]
gi|220955678|gb|ACL90382.1| rumi-PA [synthetic construct]
Length = 411
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 144/324 (44%), Gaps = 18/324 (5%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
+ +DL P+ + +T + I + R+ + I +LY D +C+ I FL L
Sbjct: 80 LKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYRDA--NCMFPARCEGIEHFLLPL 136
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+PD+D++ + D P ++ + P+F + + DI +P W+FW G
Sbjct: 137 ---VATLPDMDLIINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPAT 193
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
L P W++ + ++ + A W +K +++G+ + L+ + +L A
Sbjct: 194 KLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGSRTSDERDSLILLSRRNPELVEA 253
Query: 287 EILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + KD + + C ++Y G A S LK++ C S+ +
Sbjct: 254 QYTKNQGWKSPKDTLDAPAADEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCKSLVFHVG 313
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++++FF L P ++ P+ S + + ++ + N + A++I + G DF+ E L M
Sbjct: 314 DEWQEFFYDQLKPWVHYVPLKSYPSQQEYEHILSFFKKNDALAQEIAQRGYDFIWEHLRM 373
Query: 401 DRVYDYMLHLITEYSKLLDYKPAP 424
+ Y L+ Y KLL Y+ P
Sbjct: 374 KDIKCYWRKLLKRYVKLLQYEVKP 397
>gi|326913110|ref|XP_003202884.1| PREDICTED: protein O-glucosyltransferase 1-like [Meleagris
gallopavo]
Length = 393
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 142/294 (48%), Gaps = 26/294 (8%)
Query: 143 IVRGKLYVDPYYDCVQSRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHG 200
I++ KLY + +DC MF G + +PD++++ + D P + K
Sbjct: 92 IIKNKLYRE--HDC-----MFPARCSGVEHFIHEIINRLPDMEMVINVRDYPQVPKWMKP 144
Query: 201 SFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWK 253
+P+F + + DI +P W+FW G V L W+ +D++ ++ WK
Sbjct: 145 I--IPVFSFSKTAEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEKWPWK 202
Query: 254 EKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILR-QNWAEE----AKDGFKKSKLSN 308
+K+ Y++G+ + L+ + +L AE + Q W E K K+ L +
Sbjct: 203 KKISKGYFRGSRTSSERDPLILLSRENPELVDAEYTKNQAWKSEKDTLGKPPAKEIPLVD 262
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
C ++Y G A S LK++ C S+ + +++ +FF + L P ++ P+ S DL
Sbjct: 263 HCKYKYLFNFRGVAASFRLKHLFLCGSLVFHVGEEWLEFFYQQLKPWVHYIPVKS-DLS- 320
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYK 421
++ ++ + N + A++I + G+ F+ E L M+ V Y HL++EYS+ L YK
Sbjct: 321 DVRELLQFAKENDNIAQEIAERGRRFITEHLQMEDVSCYWEHLLSEYSQTLTYK 374
>gi|195996689|ref|XP_002108213.1| hypothetical protein TRIADDRAFT_18924 [Trichoplax adhaerens]
gi|190588989|gb|EDV29011.1| hypothetical protein TRIADDRAFT_18924, partial [Trichoplax
adhaerens]
Length = 366
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 146/337 (43%), Gaps = 21/337 (6%)
Query: 102 KVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLY-VDPYYDCVQSR 160
K C + K I DL+PW + + I + +G Y + + R
Sbjct: 25 KNDRCGCYKKVIDNDLKPWRLKKGITKEIFDKAANQGSHRGAEKGSHYQIINHKVYRHER 84
Query: 161 AMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
F G L++ +P+++++ + D P + K + LP+F + DI
Sbjct: 85 CTFPARCKGIEHFLKKIAKKLPNLELIINTHDWPKVPKWDE---LLPVFSFSKTHNENDI 141
Query: 219 PFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
+P WSFW G V L W+ K ++ S W +K A+++G+
Sbjct: 142 MYPAWSFWEGGPAVWPIFPNGLGRWDVLRKSLQKASDKWPWDKKKSIAFFRGSRTSAERD 201
Query: 272 RVELMKCNDSKLWGAEILR-QNWAEEA----KDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
+ L+ KL A + Q W +A ++ K+ L + C ++Y G A S
Sbjct: 202 PLILLSRAKPKLVNASYTKNQAWRSKADTLGEEPAKEVTLEDHCKYKYLFNFRGVAASFR 261
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
+++ CNSV L I ++++FF L P ++ P+ R+ + ++ + N E ++
Sbjct: 262 FRHLFLCNSVVLHIGHEWQEFFYPALTPWVHYIPVDPDQ--RNTEEIIRFAIENDEEMKR 319
Query: 387 IGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYKP 422
+ K G+DF + L M V Y L+ +Y+KLL +KP
Sbjct: 320 LAKRGRDFILNHLRMKDVECYWELLLKQYAKLLKWKP 356
>gi|195331147|ref|XP_002032264.1| GM23615 [Drosophila sechellia]
gi|194121207|gb|EDW43250.1| GM23615 [Drosophila sechellia]
Length = 411
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 144/324 (44%), Gaps = 18/324 (5%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
+ +DL P+ + +T + I + R+ + I +LY D +C+ I FL L
Sbjct: 80 LKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYRDA--NCMFPARCEGIEHFLLPL 136
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+PD+D++ + D P ++ + P+F + + DI +P W+FW G
Sbjct: 137 ---VATLPDMDLVINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPAT 193
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
L P W++ + ++ + A W +K +++G+ + L+ + +L A
Sbjct: 194 KLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGSRTSDERDSLILLSRRNPELVEA 253
Query: 287 EILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + KD + + C ++Y G A S LK++ C S+ +
Sbjct: 254 QYTKNQGWKSPKDTLDAPAADEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCKSLVFHVG 313
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++++FF L P ++ P+ S + + ++ + N + A++I + G DF+ E L M
Sbjct: 314 DEWQEFFYDQLKPWVHYVPLKSYPSQQEYEHILSFFKKNDALAQEIAQRGYDFIWEHLRM 373
Query: 401 DRVYDYMLHLITEYSKLLDYKPAP 424
+ Y L+ Y KLL Y+ P
Sbjct: 374 KDIKCYWRKLLKRYVKLLQYEVKP 397
>gi|194910742|ref|XP_001982221.1| GG12485 [Drosophila erecta]
gi|190656859|gb|EDV54091.1| GG12485 [Drosophila erecta]
Length = 411
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/339 (22%), Positives = 148/339 (43%), Gaps = 18/339 (5%)
Query: 98 ASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV 157
+S K C + +DL P+ + +T + I + R+ + I +LY D +C+
Sbjct: 65 SSDPKDSDCSCHAAVLKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYRDA--NCM 121
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
I FL L +PD+D++ + D P ++ + P+F + + D
Sbjct: 122 FPARCEGIEHFLLPL---VTTLPDMDLVINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRD 178
Query: 218 IPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
I +P W+FW G L P W++ + ++ + A W +K +++G+
Sbjct: 179 IMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKRNLGFFRGSRTSDERD 238
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVS 326
+ L+ + +L A+ + + KD + + C ++Y G A S
Sbjct: 239 SLILLSRRNPELVEAQYTKNQGWKSPKDTLDAPAADEVSFEDHCKYKYLFNFRGVAASFR 298
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
LK++ C S+ + ++++FF L P ++ P+ S + + ++ + N + A++
Sbjct: 299 LKHLFLCKSLVFHVGDEWQEFFYDQLKPWVHYVPLKSYPSQQEYEDILSFFRKNDALAQE 358
Query: 387 IGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAP 424
I + G DF+ E L M + Y L+ Y KLL Y+ P
Sbjct: 359 IAQRGYDFIWEHLRMKDIKCYWRKLLKRYVKLLKYEVKP 397
>gi|354494153|ref|XP_003509203.1| PREDICTED: protein O-glucosyltransferase 1 [Cricetulus griseus]
Length = 401
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 174/387 (44%), Gaps = 55/387 (14%)
Query: 68 SRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKV-----------------QTCPDFF 110
S+RSQ I+ CT LS RRR S KV + C +
Sbjct: 16 SQRSQL-AILWCTCAVGLS-------RRRGGSKWKVFIDQINRALENYEPCSSRNCSCYR 67
Query: 111 KSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV-QSRAMFTIWGF 168
I +DL P+ + I+ + + E +R I++ +L+ + DC+ SR
Sbjct: 68 GVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKKRLFRED--DCMFPSRCSGVEHFI 124
Query: 169 LQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-G 227
L+++RR +PD++++ + D P + K + +P+F + + DI +P W+FW G
Sbjct: 125 LEVIRR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYHDIMYPAWTFWEG 178
Query: 228 WSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR--VELMKCN 279
V L W+ +D+ + W++K AY++G+ SP R + L+
Sbjct: 179 GPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRT--SPERDPLILLSRK 236
Query: 280 DSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
+ KL AE + + KD K L + C ++Y G A S K++ C
Sbjct: 237 NPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYKYLFNFRGVAASFRFKHLFLCG 296
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF 394
S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN A++I K G F
Sbjct: 297 SLVFHVGDEWVEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDIAQEIAKRGSQF 354
Query: 395 -MESLTMDRVYDYMLHLITEYSKLLDY 420
+ L M+ + Y L+TEYSK L Y
Sbjct: 355 IINHLHMNDITCYWESLLTEYSKFLSY 381
>gi|449283857|gb|EMC90451.1| KTEL motif-containing protein 1, partial [Columba livia]
Length = 364
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 151/310 (48%), Gaps = 28/310 (9%)
Query: 143 IVRGKLYVDPYYDCV-QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS 201
IV+ KLY + DC+ +R L+++ R +PD++++ + D P + K
Sbjct: 63 IVKNKLYREQ--DCLFPARCSGVEHFLLEIISR----LPDMEMVINVRDYPQVPKWMKPV 116
Query: 202 FPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKE 254
+P+F + + DI +P W+FW G V L W+ +D++ ++ W +
Sbjct: 117 --IPIFSFSKTPEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEKWPWMK 174
Query: 255 KLPFAYWKGNPDVLSPLR--VELMKCNDSKLWGAEILR-QNWAEE----AKDGFKKSKLS 307
K+ Y++G+ SP R + L+ + +L AE + Q W E K K+ L
Sbjct: 175 KISKGYFRGSRT--SPERDPLILLSRENPELVDAEYTKNQAWKSEKDTLGKPPAKEIPLV 232
Query: 308 NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLC 367
+ C ++Y G A S LK++ C S+ + +++ +FF L P ++ P+ S DL
Sbjct: 233 DHCKYKYLFNFRGVAASFRLKHLFLCGSLVFHVGEEWLEFFYPQLKPWVHYIPVRS-DLS 291
Query: 368 RSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPS 426
+++ ++ + N + A++I + G+ F+ E L M+ V Y HL++EYS+ L YK
Sbjct: 292 -NVRELLQFVKENDAIAQEISERGRQFITEHLQMEDVSCYWEHLLSEYSQTLTYKVKRRK 350
Query: 427 SAFEACVESL 436
S E E L
Sbjct: 351 SYSEITSEWL 360
>gi|321467420|gb|EFX78410.1| hypothetical protein DAPPUDRAFT_305151 [Daphnia pulex]
Length = 415
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/336 (22%), Positives = 146/336 (43%), Gaps = 22/336 (6%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV-Q 158
P + C F + +DL+ + IT + +++ + + I+ KLY + + C+
Sbjct: 80 PCLFKNCSCFTNLVIEDLKAFKDVGIT-KSMLDKAKDRGTKYQIIGQKLYRESF--CLFP 136
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
SR G + + +PD++++ + D P + + H LP+ + + DI
Sbjct: 137 SRCA----GIEHFILKVIKDLPDMELIINNRDWPQVSR--HFGEVLPILSFSKTKEYLDI 190
Query: 219 PFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
+P W+FW G ++L P W++ I + W EK A+++G+
Sbjct: 191 TYPAWTFWEGGPAISLYPRGLGRWDQHRISIDKVAATYPWNEKQSKAFFRGSRTSSERDP 250
Query: 273 VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSL 327
+ L+ + L A+ + + D ++ L + C+++Y G A S
Sbjct: 251 LILLSRGNLNLVDAQYTKNQAWKSDSDTLGAPPAEEVSLESHCSYKYLFNYRGVAASFRF 310
Query: 328 KYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKI 387
K++ C S+ + ++ +FF L P ++ P+ SA R + ++ + N S KI
Sbjct: 311 KHLFLCKSLVFHVGDEWIEFFYPALKPWVHYIPVSSAATQRDLARLIRFAKENDSLVSKI 370
Query: 388 GKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
G + L + V Y L+TEY+KLL +KP
Sbjct: 371 ATRGHQLVWNHLKLSDVECYWKFLLTEYAKLLRFKP 406
>gi|194744000|ref|XP_001954486.1| GF18286 [Drosophila ananassae]
gi|190627523|gb|EDV43047.1| GF18286 [Drosophila ananassae]
Length = 411
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/326 (22%), Positives = 139/326 (42%), Gaps = 22/326 (6%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQ 170
I DL P+ + +T + I +A + + I +LY D + MF G
Sbjct: 80 IKHDLAPYKATGVTRQMIEKAGEYGT-KYKIFNNRLYRD-------ANCMFPSRCQGIEH 131
Query: 171 LLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWS 229
L +P++D++ + D P ++ S P+F + + DI +P W+FW G
Sbjct: 132 FLLPLTASLPNMDLVINTRDYPQLNTAWGSSGRGPIFSFSKTKEYMDIMYPAWTFWAGGP 191
Query: 230 EVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
L P W++ + ++ S A W +K +++G+ + L+ +
Sbjct: 192 ATKLHPRGIGRWDQMREKLEKRSAAIPWSQKRELGFFRGSRTSDERDTLILLSRRSPDIV 251
Query: 285 GAEILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
A+ + + KD + + C ++Y G A S LK++ C S+
Sbjct: 252 EAQYTKNQGWKSPKDTLNAPPADEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCKSLVFH 311
Query: 340 ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESL 398
+ ++++FF L P ++ P+ S + + ++++ N A++I + G DF+ + L
Sbjct: 312 VGDEWQEFFYDQLKPWVHYVPLKSYPSQQDYEQLLEFFRRNDDLAKEIAQRGYDFIWQHL 371
Query: 399 TMDRVYDYMLHLITEYSKLLDYKPAP 424
M V Y L+ Y KLL Y+ P
Sbjct: 372 RMKDVKCYWRKLLKGYVKLLKYEVLP 397
>gi|425781966|gb|EKV19900.1| hypothetical protein PDIG_00670 [Penicillium digitatum PHI26]
gi|425784005|gb|EKV21816.1| hypothetical protein PDIP_02930 [Penicillium digitatum Pd1]
Length = 436
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/309 (25%), Positives = 141/309 (45%), Gaps = 33/309 (10%)
Query: 137 AALRILIVRGKLYVDPY--YDCVQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKP 192
A +R +I RG+LY+ Y SRA T+ + L +P ++P ++ +F D
Sbjct: 113 AMVRGIIDRGELYIVDYAPMPMTASRARATLSSLHRALTAFPDRHLLPSIEFIFTTEDF- 171
Query: 193 VIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNE---EFKDIKHGSQ 248
E + P P++ Y D+ + PD+ +W W EV++ P++E I G
Sbjct: 172 ----AEDTTAPSPIWAYSKRDSDTSVWLMPDFGYWAWPEVHIGPYHEVRRRIAAIDDGET 227
Query: 249 AK--------SWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDG 300
A ++EK W+G+ P+R +L+K + W A + +W ++
Sbjct: 228 AADGTFMPGLQFQEKKKQLVWRGSLATNPPVRSKLLKSALGRSW-ASVRVIDWDDQDDIR 286
Query: 301 FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKN 357
F + + C + + + EG ++S KY+L+C SV + + ++++ L+ P N
Sbjct: 287 FNLLPIEDHCRYMFLAHTEGRSFSGRGKYLLNCRSVVISHALEWREAHHAALVSTGPDAN 346
Query: 358 HFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMES-LTMDRVYDYMLHLITE 413
+ + +DL R I ++D NP AE+I A + F + LT Y L+ +
Sbjct: 347 YIEVDRDWSDLSRKIDYLID----NPEIAERIANNAVRTFRDRYLTPAAESCYWRQLVRQ 402
Query: 414 YSKLLDYKP 422
YS D++P
Sbjct: 403 YSAACDFEP 411
>gi|432849653|ref|XP_004066608.1| PREDICTED: protein O-glucosyltransferase 1-like [Oryzias latipes]
Length = 387
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 149/333 (44%), Gaps = 24/333 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV-QSRAMF 163
C + +DL+P+ +R I++ KLY + +C+ +R
Sbjct: 48 NCSCHLSVLQQDLQPFRGGISESLMASTVQRGMGTHYQIIQHKLYREQ--NCMFPARCSG 105
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
L+++ R +PD++++ + D P + S LP+ + + DI +P W
Sbjct: 106 VEHFILEVIDR----LPDLEMVVNVRDYPQVPN--WMSPALPVLSFSKTAEYQDIMYPAW 159
Query: 224 SFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
+FW G V L W+ D+K + WK+K P +++G+ + L+
Sbjct: 160 TFWEGGPAVWPIYPTGLGRWDLMRTDLKKSAAQWPWKKKEPKGFFRGSRTSSERDPLILL 219
Query: 277 KCNDSKLWGAEILRQNWAEEAKDGFKKSK-----LSNQCNHRYKIYAEGYAWSVSLKYIL 331
D +L AE + + KD + L + C ++Y G A S LK++
Sbjct: 220 SREDPELVDAEYTKNQAWKSEKDTLGRPPAAEIPLLDHCKYKYLFNFRGVAASFRLKHLF 279
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
C S+ + +++++FF L P ++ P+ DL ++ ++ + N A++I + G
Sbjct: 280 LCGSLVFHVGEEWQEFFYPQLKPWVHYIPV-KQDLS-DVRGLLQFVKENDDVAQEIAERG 337
Query: 392 QDF-MESLTMDRVYDYMLHLITEYSKLLDYKPA 423
Q+F ++ L M+ V Y L+T +S LL Y+P+
Sbjct: 338 QEFILQHLRMEDVSCYWEQLLTNFSHLLTYRPS 370
>gi|125773715|ref|XP_001358116.1| GA16050 [Drosophila pseudoobscura pseudoobscura]
gi|121991819|sp|Q29AU6.1|RUMI_DROPS RecName: Full=O-glucosyltransferase rumi; Flags: Precursor
gi|54637851|gb|EAL27253.1| GA16050 [Drosophila pseudoobscura pseudoobscura]
Length = 409
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 72/324 (22%), Positives = 141/324 (43%), Gaps = 18/324 (5%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
I DL P+ + ++ + I + R+ R I +LY + +C+ I FL L
Sbjct: 78 IKSDLAPYKATGVSRQMIESSARYGT-RYKIYEKRLYREE--NCMFPARCQGIEHFLLPL 134
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+PD+D++ + D P I+ P+ + H DI +P W+FW G
Sbjct: 135 ---VATLPDMDLVINTRDYPQINMAWGNGAQGPILSFSKTKDHRDIMYPAWTFWAGGPAT 191
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
L P W+ + ++ + A W +K +++G+ + L+ + +L A
Sbjct: 192 KLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGSRTSDERDSLILLSRRNPELVEA 251
Query: 287 EILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + KD + + C ++Y G A S LK++ C S+ +
Sbjct: 252 QYTKNQGWKSPKDTLDAPPAGEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCQSLVFHVG 311
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++++FF L P ++ P+ + + + ++ + N + A++I + G+DF+ + L M
Sbjct: 312 DEWQEFFYDQLKPWVHYVPLKNYPSQQEYEELLTFFRKNDALAQEIAQRGRDFIWQHLRM 371
Query: 401 DRVYDYMLHLITEYSKLLDYKPAP 424
+ Y L+ Y KLL Y+ P
Sbjct: 372 KDIKCYWRRLLKSYVKLLTYEVQP 395
>gi|195143519|ref|XP_002012745.1| GL23774 [Drosophila persimilis]
gi|194101688|gb|EDW23731.1| GL23774 [Drosophila persimilis]
Length = 409
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 72/324 (22%), Positives = 141/324 (43%), Gaps = 18/324 (5%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
I DL P+ + ++ + I + R+ R I +LY + +C+ I FL L
Sbjct: 78 IKSDLAPYKATGVSRQMIESSARYGT-RYKIYEKRLYREE--NCMFPARCQGIEHFLLPL 134
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+PD+D++ + D P I+ P+ + H DI +P W+FW G
Sbjct: 135 ---VATLPDMDLVINTRDYPQINMAWGNGAQGPILSFSKTKDHRDIMYPAWTFWAGGPAT 191
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
L P W+ + ++ + A W +K +++G+ + L+ + +L A
Sbjct: 192 KLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGSRTSDERDSLILLSRRNPELVEA 251
Query: 287 EILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + KD + + C ++Y G A S LK++ C S+ +
Sbjct: 252 QYTKNQGWKSPKDTLDAPPAGEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCQSLVFHVG 311
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++++FF L P ++ P+ + + + ++ + N + A++I + G+DF+ + L M
Sbjct: 312 DEWQEFFYDQLKPWVHYVPLKNYPSQQEYEELLTFFRKNDALAQEIAQRGRDFIWQHLRM 371
Query: 401 DRVYDYMLHLITEYSKLLDYKPAP 424
+ Y L+ Y KLL Y+ P
Sbjct: 372 KDIKCYWRRLLKSYVKLLTYEVQP 395
>gi|242025222|ref|XP_002433025.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212518534|gb|EEB20287.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 407
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 99/431 (22%), Positives = 177/431 (41%), Gaps = 44/431 (10%)
Query: 14 YLFPCVISLSVISLAALFLDYKVDDFASKTK-TLAGHNLEPTPWHLFPQRTFKEESRRSQ 72
+ F V + VI L LF+ ++F SKT T +G F + K + +
Sbjct: 5 FQFNFVCAFIVIVLFNLFIFSCSNEFCSKTDGTCSGE---------FDENLGKNKYQDDF 55
Query: 73 AYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIME 132
A KI+ S +N E P C FFK++ DL+P+ K+ IT R +++
Sbjct: 56 ASKII--------SKIN-FAESNYKNCP--YSNCSCFFKTLKNDLKPF-KNGIT-REMID 102
Query: 133 AKRFAALRILIVRGKLYVDPYYDCV-QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDK 191
+ R +I +LY D DC+ SR G +++ + + +++ + D
Sbjct: 103 SIRSRGTTYIIYEKRLYRDK--DCLFPSRCS----GIEYFIKKIISHLKNTELIINTRDW 156
Query: 192 PVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKH 245
P I + H P+F + + DI +P WSFW G + P W+ K +
Sbjct: 157 PQISR--HFKLFGPVFSFSKTQDYLDIMYPAWSFWEGGPAIKTYPTGLGRWDLHRKKLSE 214
Query: 246 GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS- 304
S W +K +++G+ + L+ L A+ + + KD K
Sbjct: 215 ESSKWPWNKKKSIGFFRGSRTSEKRDVLVLLSRKRPDLIDAQYTKNQAWKSLKDTLGKEP 274
Query: 305 ----KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFP 360
L + C ++Y G A S K++ C S+ + + +++ +FF + P ++ P
Sbjct: 275 AEVVSLEHHCQYKYLFNFRGVAASFRFKHLFLCGSLVIHVGEEWLEFFYPAMKPWIHYVP 334
Query: 361 IPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLD 419
+ + I ++ + + +I + GQ F M L + + Y +L+ +Y KLL
Sbjct: 335 LDENASEKDITDLIYFLKEHDDLVREIAEQGQKFIMNHLRFEEIKCYWRNLLRKYQKLLT 394
Query: 420 YKPAPPSSAFE 430
Y S E
Sbjct: 395 YNVTLDESLIE 405
>gi|334329589|ref|XP_001370099.2| PREDICTED: protein O-glucosyltransferase 1 [Monodelphis domestica]
Length = 383
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 151/333 (45%), Gaps = 28/333 (8%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV-QSRAM 162
+ C + KDL P+ R I++ +LY + +DC+ +R
Sbjct: 43 ENCSCHLGVMEKDLAPFQGGISKEVMTTLVSRKLGTHYQIIKNELYRE--HDCMFPARCS 100
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD 222
LQ++ +PD++++ + D P + K + +P+F + + + DI +P
Sbjct: 101 GVEHFILQVINH----LPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTNEYHDIMYPA 154
Query: 223 WSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR--V 273
W+FW G V L W+ +D+ ++ W+ K+ Y++G+ SP R +
Sbjct: 155 WTFWEGGPAVWPIYPTGLGRWDLLREDLARSAEKWPWERKISRGYFRGSRT--SPERDPL 212
Query: 274 ELMKCNDSKLWGAEILR-QNWAEE----AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLK 328
L+ + L AE + Q W E K K+ L + C ++Y G A S K
Sbjct: 213 ILLSRENPGLVDAEYTKNQAWKSEKDTLGKPPAKEVPLVDHCKYKYLFNFRGVAASFRFK 272
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
++ C S+ + +++ +FF L P ++ P+ + DL +++ ++ + N A++I
Sbjct: 273 HLFLCGSLVFHVGEEWHEFFYEQLKPWVHYIPV-NTDLS-NVRELLQFVKENDDLAQEIA 330
Query: 389 KAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
+ G+ F + L M+ + Y L+TEYSK L Y
Sbjct: 331 ERGRQFIINHLQMEDISCYWKILLTEYSKALSY 363
>gi|307104334|gb|EFN52588.1| hypothetical protein CHLNCDRAFT_138604 [Chlorella variabilis]
Length = 452
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/258 (24%), Positives = 116/258 (44%), Gaps = 14/258 (5%)
Query: 109 FFKSIHKDLEPWAKSRITMRHIMEA---KRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
F + +DLEP+ S I +RH +A A++R+ +V G +Y+ +SR M I
Sbjct: 42 LFSQVARDLEPFNHSGIGLRHAEQAYCQGSKASMRVQVVNGSVYIVGESPSYESR-MLGI 100
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVI-----DKKEHGSFPLPLFRYCTNDAHFDIPF 220
L L G+ +D + + D P + D E G P + +N +H +
Sbjct: 101 KRQLLHLWLAGGLPESIDFVVEQEDHPTVRHRSDDCPERGPILAP-AKCPSNKSHSHVLL 159
Query: 221 -PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
PD +F GW E PW E ++H ++ W ++ +++G L + +
Sbjct: 160 APDHTFAGWPEARTLPWAEMLPLLQHSAERHPWADRSALLFFRGAATGDRNLTDSDLSLS 219
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
+L +++ NW + S L++ C HR ++ G +++ LKY+L+C S ++
Sbjct: 220 YPELLDVQLV--NWTSAEERPLFVS-LADHCRHRALLHLPGNSYAARLKYLLACGSAVVM 276
Query: 340 ISQQYKDFFSRGLIPTKN 357
+++F+ L P N
Sbjct: 277 PDSPWQEFWYHLLHPPHN 294
>gi|195054040|ref|XP_001993934.1| GH18344 [Drosophila grimshawi]
gi|193895804|gb|EDV94670.1| GH18344 [Drosophila grimshawi]
Length = 408
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 143/335 (42%), Gaps = 18/335 (5%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
C + I +DL P+ + + + +A ++ R I KL+ + +C
Sbjct: 68 NCTCYAAGIKRDLAPYKSIGFSRQMLEDAAKYGT-RYKIYGQKLFREE--NCFFPARCQG 124
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWS 224
I FL L P +P++D++ + D P + S P+F + + DI +P W+
Sbjct: 125 IEHFL--LELLP-QLPNMDLVINTRDYPQLHSSWSSSRIGPVFSFSKTSEYRDIMYPAWT 181
Query: 225 FW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
FW G L P W+ +K + W+EK +++G+ + L+
Sbjct: 182 FWAGGPATKLHPTGIGRWDLMSGKLKEVTTKIPWQEKEQLGFFRGSRTSDERDSLILLSR 241
Query: 279 NDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+L A+ + + KD ++ N C ++Y G A S LK++ C
Sbjct: 242 QQPQLVEAQYTKNQAWKSPKDTLDAPPAEEVSFENHCKYKYLFNFRGVAASFRLKHLFLC 301
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
NS+ + + +++++FF L P ++ P+ S ++D+ + A++I + G D
Sbjct: 302 NSLVIHVGEEWQEFFYHQLKPWVHYVPLHSYPSQAEYVQLLDYFKNHDVLAQQIAQRGHD 361
Query: 394 FM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSS 427
F+ + L + Y L+ Y+KL Y+ P S
Sbjct: 362 FIGQHLRFQDIKCYWRKLLKRYAKLFKYEIKPDPS 396
>gi|390342113|ref|XP_779911.3| PREDICTED: protein O-glucosyltransferase 1-like isoform 1
[Strongylocentrotus purpuratus]
Length = 391
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 144/327 (44%), Gaps = 30/327 (9%)
Query: 113 IHKDLEPWA-KSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFL 169
+ +DL W + IT +++A+ L ++ +LY + + MF G
Sbjct: 62 MDEDLSVWEERGGITRADVVKAESRGTL-YQVINHRLYR-------EEKCMFPARCNGVE 113
Query: 170 QLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC-TNDAHFDIPFPDWSFW-G 227
+ R +PD++ + + D P K + P+P+ + H+DI +P W+FW G
Sbjct: 114 HFILRIIKKLPDMEFVMNVRDWPQSGKY---TDPIPVLSFSKVQSQHYDIMYPAWTFWEG 170
Query: 228 WSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
V L W+ + I SQ W K A+++G+ + L+ +D
Sbjct: 171 GPAVWPLFPTGLGRWDLFRESIDKESQKLPWDTKEDKAFFRGSRTTAERDPLVLLSRDDP 230
Query: 282 KLWGAEILR-QNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
L A + Q W +A K+ L + C +RY G A S LK++ C S+
Sbjct: 231 DLVDASYTKNQAWKSDADTLHMPPAKEMTLEDHCKYRYLFNFRGVAASFRLKHLFLCRSL 290
Query: 337 ALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
+ ++ +FF L P ++ P+ DL + + ++++ AN A+++ G+DF+
Sbjct: 291 VFHVGDEWLEFFYPALKPWVHYIPV-KQDLSDA-RELIEFAKANQEVAQQVADRGRDFIW 348
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKP 422
L MD V Y L+ Y+KL YKP
Sbjct: 349 NHLRMDDVQCYWKDLLKRYAKLQKYKP 375
>gi|195399534|ref|XP_002058374.1| GJ14378 [Drosophila virilis]
gi|194141934|gb|EDW58342.1| GJ14378 [Drosophila virilis]
Length = 410
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/375 (22%), Positives = 157/375 (41%), Gaps = 37/375 (9%)
Query: 66 EESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRI 125
+ S+ +Q+ + +Y C + +N + C + I +DL + + +
Sbjct: 47 KHSKITQSIQNALASYEPCTTDVNDV-------------NCTCYAAGIKRDLALYKSTGV 93
Query: 126 TMRHIMEAKRFAALRILIVRGKLYVDPYYDCV-QSRAMFTIWGFLQLLRRYPGMVPDVDI 184
T + I +A ++ R I +LY D +C+ +R LQLL +P++D+
Sbjct: 94 TRKMINDAAKYGT-RYKIYNKQLYRDD--NCMFPARCQGIEHFLLQLL----AELPNMDL 146
Query: 185 MFDCMDKPVIDKK-EHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WN 237
+ + D P + H P+F + + DI +P W+FW G L P W+
Sbjct: 147 VINTRDYPQLHSAWRHDG---PVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPTGIGRWD 203
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEA 297
++ S + SW +K +++G+ + L+ L A+ + +
Sbjct: 204 LMRAKLEKRSSSLSWHDKQELGFFRGSRTSDERDSLILLSRRKPHLVEAQYTKNQAWKSP 263
Query: 298 KDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL 352
KD + + C ++Y G A S LK++ C S+ I ++++FF L
Sbjct: 264 KDTLDAPPASEVSFEDHCKYKYLFNFRGVAASFRLKHLFMCKSLVFHIGDEWQEFFYHQL 323
Query: 353 IPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLI 411
P ++ P+ S + ++ + N S A++I + G DF+ + L M V Y L+
Sbjct: 324 KPWVHYVPLHSYPSQEEYEELLTYFKNNDSLAQEIAQRGYDFISQHLRMQDVKCYWRKLL 383
Query: 412 TEYSKLLDYKPAPPS 426
Y KL+ Y P +
Sbjct: 384 KRYKKLIKYDIEPDA 398
>gi|224170404|ref|XP_002339379.1| predicted protein [Populus trichocarpa]
gi|222874989|gb|EEF12120.1| predicted protein [Populus trichocarpa]
Length = 71
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 35/55 (63%), Positives = 46/55 (83%)
Query: 259 AYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR 313
AYWKGNPDV SP+R L++CN +K WGA+I+RQ+W EEAK G+ SKLS+QC++R
Sbjct: 5 AYWKGNPDVGSPIRTSLLECNHTKKWGAQIMRQDWEEEAKGGYVSSKLSHQCDYR 59
>gi|26337215|dbj|BAC32292.1| unnamed protein product [Mus musculus]
gi|74198417|dbj|BAE39692.1| unnamed protein product [Mus musculus]
Length = 317
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 142/304 (46%), Gaps = 28/304 (9%)
Query: 133 AKRFAALRILIVRGKLYVDPYYDCV-QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDK 191
+R I++ +L+ + DC+ SR L+++ R +PD++++ + D
Sbjct: 6 VRRKLGTHYQIIKNRLFRED--DCMFPSRCSGVEHFILEVIHR----LPDMEMVINVRDY 59
Query: 192 PVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV------NLQPWNEEFKDIK 244
P + K + +P+F + + DI +P W+FW G V L W+ +D+
Sbjct: 60 PQVPKWMEPT--IPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLL 117
Query: 245 HGSQAKSWKEKLPFAYWKGNPDVLSPLR--VELMKCNDSKLWGAEILRQNWAEEAKDGF- 301
+ W++K AY++G+ SP R + L+ + KL AE + + KD
Sbjct: 118 RSAAQWPWEKKNSTAYFRGSRT--SPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLG 175
Query: 302 ----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKN 357
K L + C +RY G A S K++ C S+ + ++ +FF L P +
Sbjct: 176 KPAAKDVHLIDHCKYRYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVH 235
Query: 358 HFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSK 416
+ P+ + DL +++ ++ + AN A++I K G F + L MD + Y +L+T+YSK
Sbjct: 236 YIPVKT-DLS-NVQELLQFVKANDDIAQEIAKRGSQFIINHLQMDDITCYWENLLTDYSK 293
Query: 417 LLDY 420
L Y
Sbjct: 294 FLSY 297
>gi|195113713|ref|XP_002001412.1| GI10779 [Drosophila mojavensis]
gi|193918006|gb|EDW16873.1| GI10779 [Drosophila mojavensis]
Length = 406
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/337 (21%), Positives = 146/337 (43%), Gaps = 24/337 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV-QSRAMF 163
C + I +DL P+ + T + I +A ++ R I +L+ + +C+ +R
Sbjct: 69 NCTCYAAGIKRDLAPYKSTGFTRKMIEDAAKYGT-RYKIFGKQLFRED--NCMFPARCQG 125
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK-EHGSFPLPLFRYCTNDAHFDIPFPD 222
LQLL M D++ + D P + +H P+F + + DI +P
Sbjct: 126 IEHFLLQLLPELKNM----DLVINTRDYPQLHSSWQHKG---PVFSFSKTTEYLDIMYPA 178
Query: 223 WSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
W+FW G L P W+ + +K ++A W EK +++G+ + L+
Sbjct: 179 WTFWAGGPATKLHPTGIGRWDLMREKLKKAAKAIPWGEKKEIGFFRGSRTSDERDSLILL 238
Query: 277 KCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
+L A+ + + KD + + C ++Y G A S LK++
Sbjct: 239 SRRKPQLVEAQYTKNQAWKSPKDTLDAPPANEVSFEDHCKYKYLFNFRGVAASFRLKHLF 298
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
C S+ + ++++FF L P ++ P+ S + ++++ ++ A++I + G
Sbjct: 299 LCESLVFHVGDEWQEFFYYQLQPWVHYVPLRSYPSQEEYEDILNYFKSHDELAQQIAQRG 358
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSS 427
+F+ + L M V Y L+ Y KL+ Y+ P ++
Sbjct: 359 HEFIVQHLRMQDVQCYWRKLLKRYGKLMKYEVKPDNT 395
>gi|442620543|ref|NP_001262848.1| CG31139, isoform C [Drosophila melanogaster]
gi|440217766|gb|AGB96228.1| CG31139, isoform C [Drosophila melanogaster]
Length = 312
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 126/289 (43%), Gaps = 22/289 (7%)
Query: 143 IVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF 202
I+RG++Y C+ + + LL V D++ + + D P +
Sbjct: 14 IIRGRIYRQQ--KCLHPKRCADV---EDLLLDMASGVADLEFVLNVRDWPQVHFL--SGL 66
Query: 203 PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQP-------WNEEFKDIKHGSQAKSWKEK 255
P+F Y + H DI +P WSFW + LQ W+ K + + W K
Sbjct: 67 SGPVFSYSITNRHLDIMYPAWSFWTTTGPILQHYPHGVGRWDWMRKHLVARASELPWSAK 126
Query: 256 LPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDG--FKKSKLSNQCNHR 313
+++G+ SP R L++ + + +++ + A D +K L C +
Sbjct: 127 RAIGFFRGSRS--SPERDSLVRLSQRR---PDLVDAQYTILATDADPVEKMPLVEHCQFK 181
Query: 314 YKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSV 373
Y G A S L++IL C S+ L + Q+++FF L P ++ P+ S + +
Sbjct: 182 YLFNFRGVAASFRLRHILLCRSLVLHVGDQWQEFFYSQLKPWVHYVPVASDADVDELAEL 241
Query: 374 VDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYK 421
+ + + AE+I + GQ F+ L M+ V Y ++ EY+KLL YK
Sbjct: 242 ILYLREHDDLAEEIAERGQQFIWLHLRMEDVQCYWSKMLQEYAKLLTYK 290
>gi|195443898|ref|XP_002069626.1| GK11622 [Drosophila willistoni]
gi|194165711|gb|EDW80612.1| GK11622 [Drosophila willistoni]
Length = 377
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/337 (21%), Positives = 143/337 (42%), Gaps = 23/337 (6%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF- 163
C K +++DL + + ++ + I + R+ R I +LY D + MF
Sbjct: 37 NCSCHEKVLNQDLAAYQSTGVSRQMIESSARYGT-RYKIYENQLYRD-------EKCMFP 88
Query: 164 -TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFP-LPLFRYCTNDAHFDIPFP 221
G LR+ ++P++D++ + D P I+ S P+F + + DI +P
Sbjct: 89 ARCQGIEHFLRQLLPVLPNMDLIINTRDYPQINTAWGNSVGNGPVFSFSKTKEYRDIMYP 148
Query: 222 DWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
W+FW G L P W+ + ++ + A W +K +++G+ + L
Sbjct: 149 AWTFWAGGPATRLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGSRTSEERDSLIL 208
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKS-----KLSNQCNHRYKIYAEGYAWSVSLKYI 330
+ +L A + + KD + + C ++Y G A S LK++
Sbjct: 209 LSRRQPQLVEARYTKNQAWKSPKDTLNATPADEVSFEDHCKYKYLFNFRGVAASFRLKHL 268
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKA 390
C S+ + ++++FF L P ++ P+ S + ++ + + A+ I +
Sbjct: 269 FLCKSLVFHVGDEWQEFFYDQLKPWIHYIPLKSYPSQEEYEEILQYFRTHDQLAKNIAEQ 328
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPS 426
G +F+ + L M V Y L+ Y+KLL Y+ P S
Sbjct: 329 GYNFVWQHLRMKDVKCYWRKLLKGYAKLLTYEVRPES 365
>gi|442620545|ref|NP_001262849.1| rumi, isoform B [Drosophila melanogaster]
gi|440217767|gb|AGB96229.1| rumi, isoform B [Drosophila melanogaster]
Length = 316
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/258 (22%), Positives = 117/258 (45%), Gaps = 12/258 (4%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-- 235
+PD+D++ + D P ++ + P+F + + DI +P W+FW G L P
Sbjct: 45 LPDMDLIINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRG 104
Query: 236 ---WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQN 292
W++ + ++ + A W +K +++G+ + L+ + +L A+ +
Sbjct: 105 IGRWDQMREKLEKRAAAIPWSQKRSLGFFRGSRTSDERDSLILLSRRNPELVEAQYTKNQ 164
Query: 293 WAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDF 347
+ KD + + C ++Y G A S LK++ C S+ + ++++F
Sbjct: 165 GWKSPKDTLDAPAADEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCKSLVFHVGDEWQEF 224
Query: 348 FSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDY 406
F L P ++ P+ S + + ++ + N + A++I + G DF+ E L M + Y
Sbjct: 225 FYDQLKPWVHYVPLKSYPSQQEYEHILSFFKKNDALAQEIAQRGYDFIWEHLRMKDIKCY 284
Query: 407 MLHLITEYSKLLDYKPAP 424
L+ Y KLL Y+ P
Sbjct: 285 WRKLLKRYVKLLQYEVKP 302
>gi|443694719|gb|ELT95788.1| hypothetical protein CAPTEDRAFT_221044 [Capitella teleta]
Length = 501
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/371 (23%), Positives = 156/371 (42%), Gaps = 38/371 (10%)
Query: 102 KVQTCPDFFKSIHKDLEPW------AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYD 155
K CP ++ I +DL + A + + + R A ++ ++Y Y +
Sbjct: 145 KRMNCPAGYRQIKQDLSIFGDIDMKAVKQEAVERFNQRGRHALCHYVVKNNQIYRQTYGE 204
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH 215
V + MF L L R+ +PDV+ + D P ++K++ PLP+F +C +D
Sbjct: 205 HVGFK-MFMDAMLLSLTRKV--HLPDVEFFVNLGDWP-LEKRKVSEGPLPIFSWCGSDDT 260
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + +E L+ D+ + W K A+W+G R+
Sbjct: 261 RDIVMPTYDV---TESTLETMGRITLDLLSVQANTGPKWSNKSSVAFWRGRDS--RQERL 315
Query: 274 ELMKCN-------DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
+L+K + D+KL + N +E + K + ++Y++ +G +
Sbjct: 316 DLVKLSRKHPEVIDAKLTNMFFFKHN-VDEVGELVKHISFFDFFKYKYQLNIDGTVAAYR 374
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
Y+L+ +S+ L Y + F + L P ++ P+ DL ++ + W N +AE+
Sbjct: 375 FPYLLAGDSLVLKQDSIYYEHFYKDLKPYVHYVPL-KKDLSDVMQQL-QWAQKNDRQAEQ 432
Query: 387 IGKAGQDFMESLTMDR-VYDYMLHLITEYSKLL--------DYKPAPPSSAFEACVESLL 437
I K GQDF+ M R ++ Y L Y K L D + PP + EA E L
Sbjct: 433 IAKNGQDFVREHLMSRDIFCYHAVLFNAYHKKLSSPVEVDPDMELVPPETGREA--ECLC 490
Query: 438 CLADPKQRQNL 448
+P Q+ L
Sbjct: 491 DRVNPPQKDEL 501
>gi|449669688|ref|XP_002167607.2| PREDICTED: KDEL motif-containing protein 1-like [Hydra
magnipapillata]
Length = 497
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 165/364 (45%), Gaps = 27/364 (7%)
Query: 102 KVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK----RFAALRILIVRGKLYVDPYYDCV 157
K CP ++ I KDL+P+ I + +++E+ A I + K+Y Y +
Sbjct: 143 KNMACPVNYQQIEKDLKPFPN--INLENLIESATKSYNVAFCHYTIKKNKVY-RKCYGTI 199
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
MFT L + R+ +PDV+ + D P+ K+ + P+P+F +C ++ FD
Sbjct: 200 NDFKMFTDAWLLSVARKVK--LPDVEFFTNLGDWPLTTKRFN---PMPIFSWCGSNDTFD 254
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFK-DIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLR-V 273
+ +P + +E L+ + D+ G+ SW K P A+++G L V
Sbjct: 255 LVWPTYDL---TESTLETFGGRVSLDMTSIQGNTGPSWNHKKPVAFFRGRDSRQERLDLV 311
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYIL 331
+ N + G +E K G +++S + ++Y++ +G + L Y+L
Sbjct: 312 NRFRKNANFDVGITHYFFFKHDEEKYGPIANRVSFYDFFKYKYQLNIDGTVAAYRLPYLL 371
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+ +SV L +Y + F LIP K++ P S DL +++ V W N +A+KI G
Sbjct: 372 AGDSVVLKQDSKYYEHFYGDLIPMKHYIPFNS-DLS-NLEEKVLWAIQNDEKAQKIALEG 429
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEK 450
Q + ++L D++Y Y L+ EY+K P + E C +R+N +K
Sbjct: 430 QRYARDNLLSDKLYCYTYLLLKEYAKRQSTPPTVRNGMEEVIQPKENCSC---ERRNSKK 486
Query: 451 AAAS 454
A +
Sbjct: 487 KAGN 490
>gi|380022449|ref|XP_003695058.1| PREDICTED: O-glucosyltransferase rumi homolog [Apis florea]
Length = 407
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 145/333 (43%), Gaps = 22/333 (6%)
Query: 74 YKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA 133
YK+++ Y +A+ + R+ + +C F I DL+P+ + I + +++
Sbjct: 43 YKVLNKQYSKYYNAIEEAEKDYRICNNTN-NSC--FKDVIINDLKPFKEKGIN-KDLIDT 98
Query: 134 KRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPV 193
+ I++GKLY DC+ I FL L+ PG+ D+D++ + D P
Sbjct: 99 AKIRGTFYQIIQGKLYRQK--DCMFPSRCAGIEYFL--LKLAPGLT-DMDLVINVRDYPQ 153
Query: 194 IDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGS 247
K H PLP+F + ++DI +P W+FW G ++L P W+E + S
Sbjct: 154 SSK--HFGDPLPIFSFSKTSQYYDITYPAWAFWEGGPAISLYPRGLGRWDEHCISLDKAS 211
Query: 248 QAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGF-----K 302
W++K +++G+ + L+ L A+ + + +D
Sbjct: 212 NNTLWEKKENKVFFRGSRTSSERDNLVLLSRKKPNLVNAQYTKNQAWKSNEDTLYAPPAS 271
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIP 362
+ L C ++Y G A S K++ C S+ + ++ +F+ +IP ++ P+
Sbjct: 272 EVPLEAHCKYKYLFNYRGVAASFRHKHLFLCRSLVFHVGDEWIEFYYNAMIPWIHYIPVS 331
Query: 363 SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
++ ++ + N ++KI G+DF+
Sbjct: 332 KDANQTVLEEIIQFAIDNDEISKKIANRGRDFI 364
>gi|443727132|gb|ELU14012.1| hypothetical protein CAPTEDRAFT_20245 [Capitella teleta]
Length = 371
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 155/341 (45%), Gaps = 33/341 (9%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ 158
SP C I DL+PW K+ IT + +AK + I+ KLY
Sbjct: 28 SPCNPDDCSCHLGVIESDLKPW-KNGITEQLFQQAKARGSNHYQIINHKLYR-------S 79
Query: 159 SRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC-TNDAH 215
+ MF G + +PD++ + + D P HG+ PLP+F + +
Sbjct: 80 EKCMFPSRCSGIEHFILEVIHKLPDMEFILNERDWP--QASIHGA-PLPIFSFSKVPTDN 136
Query: 216 FDIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVL 268
+DI +P W+FW G V L W+E+ K I ++ W +K A+++G+
Sbjct: 137 WDIMYPAWTFWEGGPAVWPIYPTGLGRWDEQRKIIPEAAKKWPWHKKQSKAFFRGS--RT 194
Query: 269 SPLR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGY 321
SP R + L+ + L A+ + + KD K+ L++ C +Y G
Sbjct: 195 SPDRDPLVLLSRAEPDLADAQYTKNQAWKSEKDTLNMLPAKELTLADHCEWKYLFNFRGV 254
Query: 322 AWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANP 381
A S K++ C+SV + + +FF L P ++ P+ DL + + ++ + N
Sbjct: 255 AASFRYKHLFLCDSVVFHVGDDWLEFFYPALKPWVHYIPV-RRDL-KDARDLIQFAKEND 312
Query: 382 SEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYK 421
+ ++I + G++F+ ++L ++ V Y L+L+ Y+KL+ +K
Sbjct: 313 AIVKQIAQRGREFIWQNLRLEDVSCYWLNLLKRYAKLMTWK 353
>gi|340369006|ref|XP_003383040.1| PREDICTED: protein O-glucosyltransferase 1-like [Amphimedon
queenslandica]
Length = 325
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 139/318 (43%), Gaps = 22/318 (6%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
+ DL W + R + + IV GKLY + DC+ S F G +
Sbjct: 4 LESDLGVWRERGGIKREEFIHAKSKGVHYQIVNGKLYREK--DCLFS---FRCKGVEHFI 58
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+P+++++ + D P K PLP+F + ++DI +P W+FW G V
Sbjct: 59 LNIIEDLPNMELIINVFDYP---KSHKYHSPLPVFSFSKTVHYWDIMYPAWTFWSGGPAV 115
Query: 232 NLQPWNEEFKDIKHGSQAKS-----WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
+++P D+K S KS W +K +++G+ + L+ + L A
Sbjct: 116 SVEPTGLGRWDLKRISITKSAKQWPWDKKKSLLFFRGSRTSSERDSLILLSRDKPHLVDA 175
Query: 287 EILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ +KD + KL C ++Y + G A S K++ C+SV +
Sbjct: 176 AYTKNQAWRSSKDTLNAPPADEVKLEEHCQYKYLVNFRGVAASFRFKHLFLCHSVVFHVG 235
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
+++ +FF L P ++ P+ + + I+ ++D+ N A+ I G +F+ +L
Sbjct: 236 KEWIEFFYPALKPWIHYVPLTTDTV--DIQDMIDFVKDNDDIAKSIAVRGFEFVWNNLRP 293
Query: 401 DRVYDYMLHLITEYSKLL 418
+ V Y L+ EYSKLL
Sbjct: 294 EDVECYWKRLLIEYSKLL 311
>gi|82658302|ref|NP_001032511.1| KTEL motif-containing protein 1 precursor [Danio rerio]
gi|81097746|gb|AAI09459.1| Zgc:123318 [Danio rerio]
Length = 389
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 125/274 (45%), Gaps = 17/274 (6%)
Query: 167 GFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW 226
G + + +PD++++ + D P + LP+ + + DI +P W+FW
Sbjct: 107 GVEHFILKVIDRLPDLEVVINVRDYPQVPGWIQPV--LPVLSFSKTKDYQDIMYPAWTFW 164
Query: 227 -GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
G V L W+ D+K + WK+K P +++G+ + L+
Sbjct: 165 EGGPAVWPIYPTGLGRWDLMRDDLKRSVEQWPWKKKSPKGFFRGSRTSSERDPLILLSRA 224
Query: 280 DSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
L AE + + KD K+ L + C ++Y G A S LK++ C
Sbjct: 225 APDLVDAEYTKNQAWKSDKDTLGKPPAKEVTLVDHCEYKYLFNFRGVAASFRLKHLFLCG 284
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF 394
S+ + +++ +FF L P ++ P+ DL + ++ + N + AE+I G++F
Sbjct: 285 SLVFHVGEEWIEFFYIQLKPWVHYIPV-KQDLS-DLSELLQFVKENDAVAEEIAIRGRNF 342
Query: 395 -MESLTMDRVYDYMLHLITEYSKLLDYKPAPPSS 427
++ L M+ +Y Y L+T++SKLL YKP S+
Sbjct: 343 ILDHLRMEDLYCYWEMLLTDFSKLLTYKPKRKSN 376
>gi|302834814|ref|XP_002948969.1| hypothetical protein VOLCADRAFT_89378 [Volvox carteri f.
nagariensis]
gi|300265714|gb|EFJ49904.1| hypothetical protein VOLCADRAFT_89378 [Volvox carteri f.
nagariensis]
Length = 646
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/372 (20%), Positives = 149/372 (40%), Gaps = 66/372 (17%)
Query: 109 FFKSIHKDLEPWAKSRI--------TMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
+ SIH+DL+ + ++ TM R L + + RGK+Y+ + +
Sbjct: 37 LYASIHRDLDIYKRNAGISPNLIYRTMVLHSAGYREKGLAVAVYRGKVYIISNTRSISLK 96
Query: 161 AM---FTIWG-----FLQLLRRYPGMVPDVDIMFDCMDKPV-IDKKEHGSFPLPLFRYCT 211
+W L L +Y +PDV+ ++ +D+P+ + G P+FR+C
Sbjct: 97 RFGHHVALWVAYIKVLLDLEEKYGSYLPDVEFVWHTIDRPIRLVNTTPGGENFPVFRFCK 156
Query: 212 NDAHFDIPFPDWSFWGWSEVNLQPWNEEFKD-IKHGSQAKSWKEKLPFAYW--------- 261
+ H DI P++ F+ ++P+ EF D I H + W ++ P +
Sbjct: 157 SVVHPDILVPNFHFY------MKPYQREFLDRIPHFNAEVPWAQRRPIVFARFSGYVRYV 210
Query: 262 --------------------KGNPDVLSPLRVELMKCNDS---------KLWGAEILRQN 292
KG + P+R L + +++G + +
Sbjct: 211 HPGDPSAQRLGAGGRQLCEVKGTTTSICPVREHLHNWAANYTSPLRSLKEVYGVSLSYSD 270
Query: 293 WAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL 352
+ +KD + + + +RY ++ +G S L+ +L+ S+ + Y F+ L
Sbjct: 271 DLDISKD--QHLPMKDHMEYRYLLHVDGQGLSSKLETLLTLGSLVMKEESGYMAFYHHLL 328
Query: 353 IPTKNHFPIPSADLC-RSIKSVVDWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHL 410
P ++ P+ A +I + W + +EA++I AGQ L+ + + L L
Sbjct: 329 KPFEHFVPVWRAGTGPETILDALAWARTHDAEAQRIAAAGQALTAKYLSSEARACFWLKL 388
Query: 411 ITEYSKLLDYKP 422
+ EY L YKP
Sbjct: 389 LEEYGNTLSYKP 400
>gi|348506788|ref|XP_003440939.1| PREDICTED: protein O-glucosyltransferase 1-like [Oreochromis
niloticus]
Length = 366
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 126/265 (47%), Gaps = 23/265 (8%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFP-LPLFRYCTNDAHFDIPFPDWSFW-GWSEV----- 231
+PD++++ + D P + H P LP+F + + DI +P W+FW G V
Sbjct: 96 LPDLEMVVNVRDYPQV---PHWVQPTLPVFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYP 152
Query: 232 -NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR--VELMKCNDSKLWGAEI 288
L W+ D+K + WK+K +++G+ SP R + L+ +L AE
Sbjct: 153 TGLGRWDLMRDDLKKSAAQWPWKKKESKGFFRGSRT--SPERDPLILLSREAPELVDAEY 210
Query: 289 LRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQ 343
+ + KD K+ L + C ++Y G A S K++ C S+ + +
Sbjct: 211 TKNQAWKSEKDTLGRPPAKEIPLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDE 270
Query: 344 YKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDR 402
+++FF L P ++ P+ DL ++ ++ + N + A++I G++F + L M+
Sbjct: 271 WQEFFYPQLKPWVHYIPV-RQDLS-DLRELLQFVKENDAIAQEIATRGKEFILNHLRMED 328
Query: 403 VYDYMLHLITEYSKLLDYKPAPPSS 427
V Y L+TE+S+LL YKP +S
Sbjct: 329 VSCYWEKLLTEFSQLLTYKPQRKNS 353
>gi|198433875|ref|XP_002126603.1| PREDICTED: similar to KTEL motif-containing protein 1 [Ciona
intestinalis]
Length = 398
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/326 (22%), Positives = 136/326 (41%), Gaps = 29/326 (8%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQ 170
I DL+ W + M+ ++ I+ KLY Q + MF G
Sbjct: 65 ITSDLKLWRERGGITNEDMKRGLERSVHYQIIDHKLYR-------QDKCMFPSRCSGIEH 117
Query: 171 LLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC-TNDAHFDIPFPDWSFWGWS 229
+ +PD+++ + D P + K H +P+P+ + H DI +P W+FW
Sbjct: 118 FILEIINDLPDMELGINVHDWPQVMK--HSPYPMPILSFSKVAKEHQDIMYPAWTFWAGG 175
Query: 230 EV-------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSK 282
L W+ KD+K W++K +++G+ + L+ +
Sbjct: 176 PAVWPIYRNGLGRWDLMRKDLKKKDNEFPWEKKQNIGFFRGSRTSSERDPLVLLSRENPD 235
Query: 283 LWGAEILRQNWAEEAKDGFKKS-----KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
L A+ + + KD L + C ++Y G A S LK++ C S+
Sbjct: 236 LVDAQYTKNQAWKSKKDTLGAEPAEIVHLLDHCQYKYLFNFRGVAASFRLKHLFLCGSLV 295
Query: 338 LIISQQYKDFFSRGLIPTKNHFPI-PSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
+ + + +FF L P ++ P+ PS + +K ++++ N +KI G+ F+
Sbjct: 296 FHVGEDWLEFFYPALKPWVHYIPVSPSLN---EVKDLINFAKENDEVVKKIANRGKRFIT 352
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYK 421
+ L MD + Y L+ +Y++LL YK
Sbjct: 353 KHLKMDDISCYWKKLLFQYAELLKYK 378
>gi|47216704|emb|CAG00978.1| unnamed protein product [Tetraodon nigroviridis]
Length = 392
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/345 (24%), Positives = 149/345 (43%), Gaps = 36/345 (10%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCV 157
P C I DL P+ K +I+ ++M A +R I+ KLY + +C+
Sbjct: 44 PCSSDNCSCHLSVIQDDLRPF-KGKIS-ENLMAATIQRGVGTHYQIIGHKLYREE--NCM 99
Query: 158 -QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
+R LQL+ G +PD++++ + D P + K LP+F + +
Sbjct: 100 FPARCSGVEHFILQLI----GRLPDMEMVVNVRDYPQVPKWVDSL--LPVFSFSKTADYQ 153
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSW------KEKLPFAYWKGNPDVLSP 270
DI +P W+FW + PW + K+ + + W + +L + +
Sbjct: 154 DIMYPAWTFWEGGPADQWPWKQ--KETRGFFRGSRWVMFDLIQTRLIYPAALSGSSLCFC 211
Query: 271 LRVE-------LMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYA 318
R L+ D +L AE + KD K+ L + C ++Y
Sbjct: 212 ARTSSERDPLVLLSREDPELVDAEYTKNQAWRSEKDTLGRPPAKEIPLVDHCKYKYLFNF 271
Query: 319 EGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGN 378
G A S K++ C S+ + ++ +FF L P ++ P+ DL +++++ +
Sbjct: 272 RGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPV-KQDLS-DVRALLQFAK 329
Query: 379 ANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYKP 422
N + A++I G++F + L M+ V Y L+TEYS+LL YKP
Sbjct: 330 ENDALAQEIATRGKEFILHHLRMEDVSCYWEKLLTEYSRLLTYKP 374
>gi|119481815|ref|XP_001260936.1| hypothetical protein NFIA_089970 [Neosartorya fischeri NRRL 181]
gi|119409090|gb|EAW19039.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 444
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 145/343 (42%), Gaps = 43/343 (12%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYV-----DPYYDCVQSR 160
P F I K + + IT + + + +R I RG+LY+ PY SR
Sbjct: 89 PKLFVEIEKTVSLRKERPITFKEVDDVVVEDGMVRAAIWRGELYILDYAAQPY---TYSR 145
Query: 161 AMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
A T+ + L+ +P +PD++ +F D + P P++ Y D I
Sbjct: 146 AKATLNSLHRALQSFPDRHSLPDIEFVFTADD--------FSNVPGPVWSYSKRDEDESI 197
Query: 219 -PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAK-----------SWKEKLPFAYWKGNPD 266
PD+ +W W EV + P+ + + I + + +KEK W+G+
Sbjct: 198 WLMPDFGYWAWPEVKVGPYKDIRRRIAAVDEGEVHPDGSLVPGMEFKEKKKQLVWRGSVA 257
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
+R +L+K + W A I +W E + + C + + + EG ++S
Sbjct: 258 TNPEVRGKLLKAAQGRSW-ASIRVIDWDNENDVRYNLLPMEEHCRYMFLAHTEGRSFSGR 316
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANP 381
KY+L+C SV + +++ L+ P N+ + +DL R I ++D NP
Sbjct: 317 GKYLLNCRSVVVSHKLVWREAHHAALVASGPEANYVEVERDFSDLDRKISFLID----NP 372
Query: 382 SEAEKIG-KAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
AE+I A + F + LT Y HLI +Y+ D++P
Sbjct: 373 EIAERIADNAVRAFRDRYLTPAAESCYWRHLIRQYAASCDFEP 415
>gi|328780644|ref|XP_003249836.1| PREDICTED: o-glucosyltransferase rumi homolog, partial [Apis
mellifera]
Length = 391
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 66/298 (22%), Positives = 130/298 (43%), Gaps = 19/298 (6%)
Query: 109 FFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGF 168
F I DL+P+ + I + +++ + I++GKLY DC+ I F
Sbjct: 75 FKDVIINDLKPFKEKGIN-KDLIDIAKIRGTFYQIIQGKLYRQK--DCMFPSRCAGIEYF 131
Query: 169 LQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-G 227
L L+ PG+ D+D++ + D P K + PLP+F + ++DI +P W+FW G
Sbjct: 132 L--LKLAPGLT-DMDLVINVRDYPQSSK--YFGDPLPIFSFSKTSQYYDITYPAWAFWEG 186
Query: 228 WSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSK 282
++L P W+E + S W++K +++G+ + L+
Sbjct: 187 GPAISLYPRGLGRWDEHCISLDKASNNTLWEKKENKVFFRGSRTSSERDNLVLLSRKKPN 246
Query: 283 LWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
L A+ + + +D + L C ++Y G A S K++ C S+
Sbjct: 247 LVDAQYTKNQAWKSNEDTLYAPPASEVPLEAHCKYKYLFNYRGVAASFRHKHLFLCRSLV 306
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ ++ +F+ +IP ++ P+ ++ ++ + N ++KI G+DF+
Sbjct: 307 FHVGDEWSEFYYNAMIPWIHYIPVSKDANQTVLEEIIQFAIDNDDISKKIANHGRDFI 364
>gi|56118414|ref|NP_001008114.1| MGC89395 protein precursor [Xenopus (Silurana) tropicalis]
gi|51703984|gb|AAH81318.1| MGC89395 protein [Xenopus (Silurana) tropicalis]
Length = 385
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 110/233 (47%), Gaps = 19/233 (8%)
Query: 204 LPLFRYCTNDAHFDIPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKL 256
+P+F + + DI +P W+FW G V L W+ +++K + W++K+
Sbjct: 138 IPIFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREELKKAADLWPWEKKI 197
Query: 257 PFAYWKGNPDVLSPLR--VELMKCNDSKLWGAEILR-QNWAEE----AKDGFKKSKLSNQ 309
P Y++G+ SP R + L+ L AE + Q W E + K+ L +
Sbjct: 198 PKGYFRGSRT--SPERDPLILLSRESPDLVDAEYTKNQAWKSERDTLGRPPAKEVPLVDH 255
Query: 310 CNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRS 369
C +RY G A S LK++ C S+ + + +FF L P ++ P+ S DL
Sbjct: 256 CAYRYLFNFRGVAASFRLKHLFLCGSLVFHVGDDWLEFFYHRLEPWVHYVPV-SPDLA-D 313
Query: 370 IKSVVDWGNANPSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYK 421
++ ++ + + N E ++I + G F+ L M V Y L+ +YS+LL Y+
Sbjct: 314 LRELLQFVSENDEEVKRIAERGHTFIRQFLRMADVSQYWRSLLAQYSQLLQYR 366
>gi|426245590|ref|XP_004016593.1| PREDICTED: KDEL motif-containing protein 2 [Ovis aries]
Length = 557
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 141/333 (42%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CPD I KD + + I KRF R IV + + Y +
Sbjct: 201 QKTLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNHIYRRSLGK 260
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 261 YTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNETPGPLPIISWCGSLDS 318
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + + L+ + I+ G+ SW K A+++G L++
Sbjct: 319 QDVILPTYDITHSTLEALRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQLVQ 377
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 378 LSKENPQLLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYLML 437
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI +DL +K W N EAEKI K
Sbjct: 438 GDSLVLKQDSPYYEHFYMALKPWKHYIPIKRNLSDLLEKVK----WAKENDEEAEKIAKE 493
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 494 GQLTARDLLQPHRLYCYYYRVLQKYAEHQLSKP 526
>gi|187608427|ref|NP_001120575.1| KDEL (Lys-Asp-Glu-Leu) containing 2 precursor [Xenopus (Silurana)
tropicalis]
gi|171846847|gb|AAI61552.1| LOC100145729 protein [Xenopus (Silurana) tropicalis]
gi|189442192|gb|AAI67365.1| hypothetical protein LOC100145729 [Xenopus (Silurana) tropicalis]
gi|195540008|gb|AAI68093.1| hypothetical protein LOC100145729 [Xenopus (Silurana) tropicalis]
Length = 509
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 136/331 (41%), Gaps = 15/331 (4%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q+ +CPD I KD +P+ +T KRFA ++ L + Y +
Sbjct: 154 QETLSCPDTEAQISKDFDPFPSIDLTRMLDEVPKRFADRGAIVHYTVLNNNIYRHSMGRY 213
Query: 161 AMFTIWG--FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAHFD 217
F ++ LQ L R +PD + + D PV +K + + PLP+ +C + D
Sbjct: 214 TDFKMFSDEMLQSLAR-KVRLPDFEFYINVGDWPVEHRKANDTPGPLPMISWCGSADSRD 272
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I P + + L+ + I+ G SW K +++G L++ M
Sbjct: 273 IILPTYDITHSTLETLRGVTNDLLSIQ-GHTGPSWSNKTEQGFFRGRDSREERLQLVHMS 331
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLS---NQCNHRYKIYAEGYAWSVSLKYILSCN 334
+L A I + E ++ K+ L + N++Y++ +G + Y++ +
Sbjct: 332 RKHPELLDAGITGYFFFRELEEELGKASLIGFFDFFNYKYQVNVDGTVAAYRFPYLMLGD 391
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
S+ L Y + F GL P K++ P DL I+ W + EA++I K GQ
Sbjct: 392 SLVLKQDSPYYEHFYSGLKPWKHYVPFKRNLGDLLEKIQ----WAKDHDEEAKQIAKEGQ 447
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
E L R+Y Y L Y+K KP
Sbjct: 448 TLARELLQPHRLYCYYYKLFENYAKRQTSKP 478
>gi|169617900|ref|XP_001802364.1| hypothetical protein SNOG_12132 [Phaeosphaeria nodorum SN15]
gi|111059424|gb|EAT80544.1| hypothetical protein SNOG_12132 [Phaeosphaeria nodorum SN15]
Length = 428
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/338 (22%), Positives = 151/338 (44%), Gaps = 32/338 (9%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYV---DPYYDC--VQSRA 161
PD + S+ K + A ++ ++ I + +R++I +G+L+V +C V
Sbjct: 68 PDLYYSLDKAVSRRAGRKVHIQDIEIVEGRCMMRVMIYQGELFVVNAGKPEECYVVNGNE 127
Query: 162 MFTIWGFLQLLRRYPGM-------VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA 214
I G L + R +P+++ F D P + KE G+F F Y D
Sbjct: 128 RERILGTLAQIDRAITTAPTSDPSIPNIEFSFSLDDLP-LRSKEKGAF----FGYTRKDT 182
Query: 215 -HFDIPF--PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
+D + P++++W W+ + WN ++I+ G + W +K P W+G ++ L
Sbjct: 183 PEYDNIWMMPNYAYWSWNYTHAPSWNSIRREIEQGEKKTPWHKKDPRVVWRGKIK-MAEL 241
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS--KLSNQCNHRYKIYAEGYAWSVSLKY 329
R EL++ ++ K W ++ A D K L C ++Y + EG ++S LKY
Sbjct: 242 RKELVRVSEGKRWSD--IKPVVINNATDVHTKDVMNLRQFCGYKYTVQTEGTSYSGRLKY 299
Query: 330 ILSCNSVALIISQQYKDFFSRGL---IPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
+ C S + ++++F + L P N+ I ++ +++ +++ + EAE+
Sbjct: 300 LQLCRSALITHPLEWQEFHTHLLRLSGPDINY--IEASKNFGNLEDAMEYYRVHDDEAEE 357
Query: 387 IGKAGQDFMES--LTMDRVYDYMLHLITEYSKLLDYKP 422
I + + LT + Y + T ++ + Y+P
Sbjct: 358 IARNSYETFARRYLTPAAITCYWRRMFTSWASVQGYEP 395
>gi|255955591|ref|XP_002568548.1| Pc21g15380 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590259|emb|CAP96435.1| Pc21g15380 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 469
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 101/217 (46%), Gaps = 21/217 (9%)
Query: 180 PDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQPWN 237
PD + +F DK +D + +P+ +F ++ + PD+SFW W N + P++
Sbjct: 191 PDFEFVFSVEDK--VDDVTNSEWPVWVFSRTPSEEGVWL-MPDFSFWAWDNKNNYIGPYD 247
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEA 297
+ + IKH W EK P W+G P LR LM+ K WG ++ + +W
Sbjct: 248 QVVERIKH--MDIPWSEKTPQLVWRGKPSFAPKLRRALMEAARGKSWG-DVKQVDW---- 300
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---P 354
G K+ + C + + + EG ++S SLKY +CNSV + Q+ LI P
Sbjct: 301 NTGSNVLKMEDHCRYMFIAHVEGRSYSASLKYRQACNSVIVAHKLQFIQHHHYLLISDGP 360
Query: 355 TKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+N+ + +DL I+ +V +N A++I
Sbjct: 361 NQNYVEVERDFSDLSEKIEPLV----SNTEAAKRIAN 393
>gi|118783733|ref|XP_001230897.1| AGAP004267-PA [Anopheles gambiae str. PEST]
gi|193806712|sp|A0NDG6.1|RUMI_ANOGA RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|116129010|gb|EAU76986.1| AGAP004267-PA [Anopheles gambiae str. PEST]
Length = 399
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 136/330 (41%), Gaps = 25/330 (7%)
Query: 116 DLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQLLR 173
DL+P+ IT I AK++ ++ KLY Q MF G +R
Sbjct: 81 DLKPFKAHGITKEMINRAKQYGT-HYQVIGHKLYR-------QRECMFPARCSGVEHFVR 132
Query: 174 RYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVN 232
++PD+D++ +C D P I + +P+ + + DI +P W+FW G +
Sbjct: 133 PLLPLLPDMDLIVNCRDWPQI-HRHWSKEKIPVLSFSKTAEYLDIMYPAWAFWEGGPAIA 191
Query: 233 LQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAE 287
L P W+ + I S W+ K P A+++G+ + L+ L A+
Sbjct: 192 LYPTGLGRWDLHRQTITKAS--ADWEAKEPKAFFRGSRTSDERDALVLLSRAQPSLVDAQ 249
Query: 288 ILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
+ + +D ++ L C +R+ G A S K++ C S+ +
Sbjct: 250 YTKNQAWKSPQDTLNAEPAREVTLEEHCRYRFLFNFRGVAASFRFKHLFLCRSLVFHVGD 309
Query: 343 QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMD 401
++++FF L P ++ P+P +++++ + + A I + G + + L M
Sbjct: 310 EWQEFFYPSLKPWVHYVPVPVRSTPEELEALITFFQEHDQLARAIAERGYEHIWNHLRMA 369
Query: 402 RVYDYMLHLITEYSKLLDYKPAPPSSAFEA 431
V Y L+ Y KL+ Y S+ E
Sbjct: 370 DVECYWKKLLKRYGKLIRYTVERDSTLIEV 399
>gi|148922833|ref|NP_001092215.1| KDEL motif-containing protein 2 [Danio rerio]
gi|148744719|gb|AAI42840.1| Zgc:165521 protein [Danio rerio]
Length = 518
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 77/343 (22%), Positives = 148/343 (43%), Gaps = 29/343 (8%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIME------AKRFAALRILIVRGKLY- 149
VA+ Q+V CP I KD + I ++ +++ AKR + I+ +++
Sbjct: 156 VAAWQRVMQCPQEDPQIQKDFSSFPS--IDLQQLLQEVPTRFAKRGGLIHYTILNNQVHR 213
Query: 150 --VDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPL 206
+ Y D MF+ L L R+ +PDV+ + D P+ ++K + + P+P+
Sbjct: 214 RSLGRYTDF----KMFSDEILLSLARKV--KLPDVEFYINVGDWPMENRKVNDNPGPVPV 267
Query: 207 FRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPD 266
+C + DI P + S ++ + ++ G+ +W +K+ A+++G
Sbjct: 268 ISWCGSTETRDIILPTYDITHSSLEAMRGVTNDLLSVQ-GNTGPTWSDKMNKAFFRGRDS 326
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAW 323
LR+ M + +L A I + + + K+ L + ++Y++ +G
Sbjct: 327 REERLRLVTMSKENPELLDAGITAYFFFRDREKDLGKAPLVGFFDFFKYKYQVNVDGTVA 386
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANP 381
+ Y++ NS+ L Y + F L P ++ P+ +DL IK W +N
Sbjct: 387 AYRFPYLMLGNSLVLKQDSPYYEHFYTHLKPGVHYIPVKRDLSDLIEKIK----WAKSND 442
Query: 382 SEAEKIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKPA 423
+EAE I + GQ + L R+Y Y + Y+ +PA
Sbjct: 443 TEAEAIARRGQSLVRDLLQPHRLYCYYYKVFQTYADRQSSRPA 485
>gi|405122052|gb|AFR96820.1| hypothetical protein CNAG_06994 [Cryptococcus neoformans var.
grubii H99]
Length = 422
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 79/367 (21%), Positives = 152/367 (41%), Gaps = 53/367 (14%)
Query: 93 ERRRVASPQKVQTCPDFFKSIHKDLEPW--AKSRITMRHIMEAKRFAALRILIVRGKLYV 150
E R + Q + PD + + E W +K I+ + + EA+ R++I+ +L+V
Sbjct: 49 ENRHLTEEQCQERYPDLYLEADRAQE-WFISKGGISKKMVDEAEEEGNARLVILNNQLFV 107
Query: 151 DPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC 210
Y + +R I + +PDVD VI + G P F C
Sbjct: 108 KAYKGGINTRTQAAIAAVYGTVLTATEPLPDVDF--------VIQTSDAGGGNHPHFALC 159
Query: 211 TNDAHFDI-PFPDWSFWGW--------SEVNLQPWNEEFK-----DIKHGSQAKSWKEKL 256
D+ PD+ F+ W SEV + + E D K G W K
Sbjct: 160 RKANQKDLWLMPDFGFFSWPEPGVGSYSEVRTKTLDYELDMGLEVDNKLGVTKSDWFNKT 219
Query: 257 PFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSN-------Q 309
+W+G+P V +R +L++ + + W +++ +W + ++D ++ K +
Sbjct: 220 QQLFWRGSPMV--EVRNDLLRASQDQPW-SDVQPLDWGKVSQDETERLKNNGDLKSPAEH 276
Query: 310 CNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFF-----SRGLIPTKNHFPIP-- 362
C + + + EG+A+S LKY+ C SV + +Y + R P +N+ +P
Sbjct: 277 CKYAFLAHVEGWAYSGRLKYLQQCRSVIVAHPLKYIQHYHHLLNGRDGDPNQNYVEVPLP 336
Query: 363 -SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD-----YMLHLITEYSK 416
+L R+++ ++ N EK+ + + +S+ + Y H + +Y+
Sbjct: 337 LEKNLPRAMEDLLKEQN-----EEKVQRIADNNWKSMRQGWISPAANECYYRHALRQYAS 391
Query: 417 LLDYKPA 423
+ +KP+
Sbjct: 392 VQTFKPS 398
>gi|270007100|gb|EFA03548.1| hypothetical protein TcasGA2_TC013552 [Tribolium castaneum]
Length = 399
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 153/360 (42%), Gaps = 25/360 (6%)
Query: 85 LSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIV 144
L ++ I + SP C + I DL+ + K IT + I + K + I+
Sbjct: 48 LKYLDLIKRAKENYSPCDNTKCGCYSSQISDDLKIFKKG-ITPQLIDKVKT-KGTKYQII 105
Query: 145 RGKLYVDPYYDCVQSRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF 202
KLY D MF G L + +PD++++ + D P I K++G F
Sbjct: 106 DHKLYRD-------KNCMFPARCAGIEHFLLKLLPKLPDMELIINTRDWPQI-HKDYGVF 157
Query: 203 PLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKL 256
P+F + + DI +P W+FW G ++L P W+ + W EK+
Sbjct: 158 G-PVFSFSKTSDYSDIMYPAWAFWEGGPAISLYPRGIGRWDTHRDLLGKKGNETLWDEKI 216
Query: 257 PFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILR-QNWAEEAKDGFK----KSKLSNQCN 311
P +++G+ + L+ L A+ + Q W +A + + + C
Sbjct: 217 PKGFFRGSRTSAERDPLVLLSREKPHLVDAQYTKNQAWKSDADTLHQPPAPEVSFEDHCK 276
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIK 371
++Y G A S K+IL C S+ + + +FF L P ++ P+ + I+
Sbjct: 277 YKYLFNFRGVAASFRFKHILLCKSLVFHVGSDWLEFFYPALKPWIHYIPVEANASKEKIE 336
Query: 372 SVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430
+V + + + A++I + G + + +L + V Y L+ +Y+KLL YKP + E
Sbjct: 337 ELVQFVLSYDNIAKEIAENGYNMIWNNLKLVDVTCYWRKLLKQYAKLLTYKPEIDNDLIE 396
>gi|91082811|ref|XP_968605.1| PREDICTED: similar to endoplasmic reticulum-resident kdel protein
[Tribolium castaneum]
Length = 362
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 153/360 (42%), Gaps = 25/360 (6%)
Query: 85 LSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIV 144
L ++ I + SP C + I DL+ + K IT + I + K + I+
Sbjct: 11 LKYLDLIKRAKENYSPCDNTKCGCYSSQISDDLKIFKKG-ITPQLIDKVKT-KGTKYQII 68
Query: 145 RGKLYVDPYYDCVQSRAMF--TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF 202
KLY D MF G L + +PD++++ + D P I K++G F
Sbjct: 69 DHKLYRD-------KNCMFPARCAGIEHFLLKLLPKLPDMELIINTRDWPQI-HKDYGVF 120
Query: 203 PLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKL 256
P+F + + DI +P W+FW G ++L P W+ + W EK+
Sbjct: 121 G-PVFSFSKTSDYSDIMYPAWAFWEGGPAISLYPRGIGRWDTHRDLLGKKGNETLWDEKI 179
Query: 257 PFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILR-QNWAEEAKDGFK----KSKLSNQCN 311
P +++G+ + L+ L A+ + Q W +A + + + C
Sbjct: 180 PKGFFRGSRTSAERDPLVLLSREKPHLVDAQYTKNQAWKSDADTLHQPPAPEVSFEDHCK 239
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIK 371
++Y G A S K+IL C S+ + + +FF L P ++ P+ + I+
Sbjct: 240 YKYLFNFRGVAASFRFKHILLCKSLVFHVGSDWLEFFYPALKPWIHYIPVEANASKEKIE 299
Query: 372 SVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430
+V + + + A++I + G + + +L + V Y L+ +Y+KLL YKP + E
Sbjct: 300 ELVQFVLSYDNIAKEIAENGYNMIWNNLKLVDVTCYWRKLLKQYAKLLTYKPEIDNDLIE 359
>gi|328860135|gb|EGG09242.1| hypothetical protein MELLADRAFT_115837 [Melampsora larici-populina
98AG31]
Length = 515
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 112/242 (46%), Gaps = 16/242 (6%)
Query: 107 PDFFKSIHKDLEPWAKS--RITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
P+ F + + ++ + +S ++T + EA + R+LI ++Y+ Y R
Sbjct: 180 PNLFYELERVVKFYKESNKKVTSEQLDEAIKVGHARVLIYENRVYIKEYKGGPGKRTEAL 239
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF-PDW 223
+ + + P +P+++ + +D P ++ + LPL+ D+ PD+
Sbjct: 240 LNSIQEAVITSPERLPNIEFVVKTVDAPTGEETK-----LPLWVLDRTIDQEDVWLTPDY 294
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL 283
F+ W E + E + WK+K+P A+W+G +L LR ++++
Sbjct: 295 GFYSWPEPKVGSMIEVRDKCNEIEKKLDWKDKIPKAFWRG--AILVKLREQMIEIAKGHE 352
Query: 284 WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQ 343
W +I W + DG K+ C ++Y ++AEGYA+S LKY+ C SV I+S +
Sbjct: 353 WN-DIKPIVW--QHLDGLLKTP-EEHCQYQYLVHAEGYAYSGRLKYLQMCRSV--IVSHE 406
Query: 344 YK 345
K
Sbjct: 407 MK 408
>gi|212539017|ref|XP_002149664.1| DUF821 domain protein [Talaromyces marneffei ATCC 18224]
gi|210069406|gb|EEA23497.1| DUF821 domain protein [Talaromyces marneffei ATCC 18224]
Length = 468
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 149/351 (42%), Gaps = 36/351 (10%)
Query: 99 SPQKVQTCPDFFKSIHKD--LEPWAKSRITMRHIMEAKRFAA--LRILIVRGKLYVDPYY 154
S Q + P F I + L A + IT + I K+ R +I +G+LY+ Y
Sbjct: 107 SEQCLAAFPKMFIEIDRSVALRKEANNPITFKEIDSRKKLGQGMARAMIYQGELYIIEYG 166
Query: 155 DCV--QSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC 210
D + SRA T+ + L P +P ++ F D D K G P++ Y
Sbjct: 167 DMMYTASRAKSTLHSLHRALVAMPDGESLPSIEFHFSADDFVWDDLKLAGG---PVWSYS 223
Query: 211 TNDAHFDIP-----------FPDWSFWGWSEVNLQPWNEEFKDIKH-GSQAKSWKEKLPF 258
D DI PD+ +W W EV++ P+ E + I ++ +S++ K
Sbjct: 224 KRDTA-DIDEEDVDDSNIWLMPDFGYWAWPEVDIAPYRETRRRIAAVDAEFESFQSKKKQ 282
Query: 259 AYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYA 318
W+G+ + LR L++ +K W A + +W + + + C + + +
Sbjct: 283 LMWRGSLNTAVELRNGLLEATKNKFW-ASVRVVDWGNKTSMEENIVPIEDHCRYMFLAHT 341
Query: 319 EGYAWSVSLKYILSCNSVALI-----ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSV 373
EG ++S KY+L+C+SV + + F S G P N+ + A +++S
Sbjct: 342 EGRSFSGRGKYLLNCHSVFITHPLIWREAHHAAFISSG--PEANYIEV--ARDFSNLESK 397
Query: 374 VDWGNANPSEAEKIGKAGQDFMESLTMDRVYD--YMLHLITEYSKLLDYKP 422
V++ NP AE+I + + Y +L+ +Y ++ D++P
Sbjct: 398 VEYLLDNPQVAERIANNSVATFRDRYLTPAAEACYWRYLVRKYGEVSDFEP 448
>gi|403262867|ref|XP_003923788.1| PREDICTED: KDEL motif-containing protein 2 [Saimiri boliviensis
boliviensis]
Length = 406
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 143/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + RI ++ ++ KRF R +V + + Y
Sbjct: 50 QKTLSCPTKEPQIAKDFASFP--RINLQQMLNEVPKRFGDERGAVVHYTILNNHIYRRSL 107
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 108 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 165
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 166 DSRDIVLPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 224
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 225 VQLSKKNPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 284
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI DL +K W N EA+KI
Sbjct: 285 MLGDSLVLKQDSPYYEHFYTALEPWKHYIPIKRNLGDLLEKVK----WAKENDEEAKKIA 340
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ Y++ KP
Sbjct: 341 KEGQLMARDLLQPHRLYCYYYQVLQNYAERQSGKP 375
>gi|242819900|ref|XP_002487408.1| DUF821 domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713873|gb|EED13297.1| DUF821 domain protein [Talaromyces stipitatus ATCC 10500]
Length = 482
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 149/351 (42%), Gaps = 36/351 (10%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSR--ITMRHIMEAKRFAA--LRILIVRGKLYVDPYY 154
S Q + P F I K +E ++ IT + I K R +I +G+L++ Y
Sbjct: 119 SEQCLAAFPKLFTEIDKSVEQRKETNNPITFKEIDSRKTLGQGMARAMIYKGELFIIEYG 178
Query: 155 DCV--QSRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC 210
D + SRA T+ + L P +P ++ F D D K G P++ Y
Sbjct: 179 DMMYTASRAKSTLHSLHRALVATPDRESLPSIEFHFSADDFVWDDLKLAGG---PVWAYS 235
Query: 211 TNDAHFDIP-----------FPDWSFWGWSEVNLQPWNEEFKDIKH-GSQAKSWKEKLPF 258
D DI PD+ +W W EV++ P+ E + I ++ K+++ K
Sbjct: 236 KRDTS-DIDEDAVDDSNIWLMPDFGYWAWPEVDIAPYRETRRRIAAVDAEFKTFQSKKKQ 294
Query: 259 AYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYA 318
W+G+ + + LR L+ +K W A + +W + + + C + + +
Sbjct: 295 LLWRGSLNTAAELRNGLIDATKNKYW-ASVRVVDWGNKKSVEENLLPIEDHCRYMFLAHT 353
Query: 319 EGYAWSVSLKYILSCNSVALI-----ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSV 373
EG ++S KY+L+C+SV + + F S G P N+ + A +++S
Sbjct: 354 EGRSFSGRGKYLLNCHSVFITHPLIWREAHHAAFVSSG--PEANYIEV--ARDFSNLESK 409
Query: 374 VDWGNANPSEAEKIGKAGQDFMESLTMDRVYD--YMLHLITEYSKLLDYKP 422
V++ NP AE+I + + Y +LI +Y ++ D++P
Sbjct: 410 VEYLLDNPQIAERIANNSVTTFRDRYLTPAAEACYWRYLIQKYGEVSDFEP 460
>gi|49119606|gb|AAH73128.1| LOC443629 protein, partial [Xenopus laevis]
Length = 507
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 134/333 (40%), Gaps = 19/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q+ +CPD I KD EP+ +T KRFA R IV + + Y R
Sbjct: 152 QETLSCPDTEAQISKDFEPFPSIDLTRLLNEVPKRFAD-RGAIVHYTVLNNQIYRRSMGR 210
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L R+ +PD + + D PV ++K + + PLP+ +C +
Sbjct: 211 YTDFKMFSDEILQSLARKV--RLPDFEFYINVGDWPVENRKANDTPGPLPMISWCGSSDS 268
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + L+ + I+ G SW K +++G L++
Sbjct: 269 RDIILPTYDITHSTLETLRGVTNDLLSIQ-GHTGPSWSNKTEQGFFRGRDSREERLQLVQ 327
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
M +L A I + E ++ K+ L + ++Y++ +G + Y++
Sbjct: 328 MSRKHPELLDAGITGYFFFRELENELGKASLIGFFDFFKYKYQVNVDGTVAAYRFPYLML 387
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ P DL I+ W + EA +I K
Sbjct: 388 GDSLVLKQDSPYYEHFYSALKPWKHYIPFKRNLGDLIEKIQ----WAKDHDEEARQIAKE 443
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ + L R+Y Y + Y+K KP
Sbjct: 444 GQTLVRELMQPHRLYCYYYKVFENYAKRQTSKP 476
>gi|147905630|ref|NP_001085283.1| KDEL (Lys-Asp-Glu-Leu) containing 2 precursor [Xenopus laevis]
gi|114107936|gb|AAI23308.1| LOC443629 protein [Xenopus laevis]
Length = 509
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 134/333 (40%), Gaps = 19/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q+ +CPD I KD EP+ +T KRFA R IV + + Y R
Sbjct: 154 QETLSCPDTEAQISKDFEPFPSIDLTRLLNEVPKRFAD-RGAIVHYTVLNNQIYRRSMGR 212
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L R+ +PD + + D PV ++K + + PLP+ +C +
Sbjct: 213 YTDFKMFSDEILQSLARKV--RLPDFEFYINVGDWPVENRKANDTPGPLPMISWCGSSDS 270
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + L+ + I+ G SW K +++G L++
Sbjct: 271 RDIILPTYDITHSTLETLRGVTNDLLSIQ-GHTGPSWSNKTEQGFFRGRDSREERLQLVQ 329
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
M +L A I + E ++ K+ L + ++Y++ +G + Y++
Sbjct: 330 MSRKHPELLDAGITGYFFFRELENELGKASLIGFFDFFKYKYQVNVDGTVAAYRFPYLML 389
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ P DL I+ W + EA +I K
Sbjct: 390 GDSLVLKQDSPYYEHFYSALKPWKHYIPFKRNLGDLIEKIQ----WAKDHDEEARQIAKE 445
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ + L R+Y Y + Y+K KP
Sbjct: 446 GQTLVRELMQPHRLYCYYYKVFENYAKRQTSKP 478
>gi|297482669|ref|XP_002693013.1| PREDICTED: KDEL motif-containing protein 2 [Bos taurus]
gi|296480318|tpg|DAA22433.1| TPA: KDEL (Lys-Asp-Glu-Leu) containing 2 [Bos taurus]
Length = 508
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 141/333 (42%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CPD I KD + + I KRF R IV + + Y +
Sbjct: 152 QKTLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGK 211
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 212 YTDFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETPGPLPIISWCGSLDS 269
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + + L+ + I+ G+ SW K A+++G L++
Sbjct: 270 QDVILPTYDITHSTLEALRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQLVQ 328
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 329 LSKENPQLLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYLML 388
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI +DL V+W N EA+KI K
Sbjct: 389 GDSLVLKQDSPYYEHFYMALKPWKHYIPIKRNLSDLLEK----VEWAKENDEEAKKIAKE 444
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 445 GQLTARDLLQPHRLYCYYYTVLQKYAERQLSKP 477
>gi|388854493|emb|CCF51880.1| uncharacterized protein [Ustilago hordei]
Length = 459
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/377 (22%), Positives = 155/377 (41%), Gaps = 46/377 (12%)
Query: 94 RRRVASPQKVQTC----PDFFKSIHKDLEPWAK-SRITMRHIMEAK---RFAALRILIVR 145
RRR + + C P F+ + + W K I H+ A R + +LI
Sbjct: 46 RRREDLHMRQEVCRMEFPRFYPQLAANEIAWKKKGGIGYHHVQSAASSCRHGCVHVLIKH 105
Query: 146 GKLYVDPYYDCVQSRAMFTIWGFLQLLR-RYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL 204
G++++ QSR + LQLL Y G + M + ++ VI + F
Sbjct: 106 GQVFIRQQAKDWQSR----VRSVLQLLTDAYKGAGEEERSMMEGIEL-VISTADFDGFTD 160
Query: 205 PLFRYC--------TNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKL 256
P+ N+ FPD+SF W E + + E + + + W+ K
Sbjct: 161 PIGSQGAGWVLDKKVNETDGQYLFPDFSFASWPEAGIASYPEFRRAAEQVNAETPWRSKA 220
Query: 257 PFAYWKGNPDVLSPLR-----VELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN 311
A+W+G+ + S ++ + + ++ W +++ R ++ E+ K C
Sbjct: 221 NKAFWRGDALLNSAIQARNSLLSVATGPGTEEW-SDVKRTSFWEQGPGIDKIVSAPEHCR 279
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI-----PTKNHFPIPSA-- 364
HR+ I++EG A+S K+IL C+S + +++ F LI P +NH +P
Sbjct: 280 HRFLIHSEGVAYSGRSKFILGCHSTVITHELEWEQHFHPALISSPSSPDQNHIQLPGTYF 339
Query: 365 -DLCRSIKSV----VDWGNA----NPSEAEKIGKAGQDFMES--LTMDRVYDYMLHLITE 413
+L ++++++ +D A + EKI K + + LT Y+ +
Sbjct: 340 ENLAQTMQNLIREEIDAEGAILSRMATTGEKIAKNAKRTLTDRYLTPAATACYVRAALMS 399
Query: 414 YSKLLDYKPAPPSSAFE 430
Y +++D + P A E
Sbjct: 400 YGRMMDSRSWPGGQAAE 416
>gi|432964664|ref|XP_004086966.1| PREDICTED: KDEL motif-containing protein 1-like [Oryzias latipes]
Length = 504
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 147/331 (44%), Gaps = 26/331 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEA-----KRFAALRILIVRGKLYVDPYYDCVQSR 160
CP F I +DL + R+ E +R + I K+Y+ + + V R
Sbjct: 156 CPQAFAQIDQDLSVFTTVDPD-RNAREIPPRFRQRHSLCHYTIKDNKVYIKTFGEHVGFR 214
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++K++ P+F +C +++ DI
Sbjct: 215 -IFMDAVLLSLTRKV--HLPDVEFFVNLGDWP-LEKRKPTEEIHPIFSWCGSNSSRDIVM 270
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + +E L+ D+ G+ +W EK A+W+G R+EL++
Sbjct: 271 PTYDL---TESVLETMGRVSLDMMSVQGNTGPAWPEKNATAFWRGRDSRRE--RLELVQL 325
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + + N+ +E+ G K + ++Y+I +G + L Y+L+
Sbjct: 326 SRAHPDLIDAAFTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINIDGTVAAYRLPYLLA 385
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
+SV + Y + F + L +++ P+ ADL ++ + W + SEA+KI AGQ
Sbjct: 386 GDSVVFKVDSAYYEHFYKQLRAWEHYVPV-RADLGDLLEK-IQWARDHDSEAQKIALAGQ 443
Query: 393 DFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
F M D V+ Y L TEYSKL +P
Sbjct: 444 QFARRHLMGDTVFCYYYRLFTEYSKLQVSQP 474
>gi|326913914|ref|XP_003203277.1| PREDICTED: KDEL motif-containing protein 1-like [Meleagris
gallopavo]
Length = 572
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 92/384 (23%), Positives = 152/384 (39%), Gaps = 70/384 (18%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP F I +DL + I +RF + L I ++Y+ Y + V R
Sbjct: 223 NCPQVFPQIQRDLANFPVVDPDKIAIEIPQRFGQRQSLCHYTIKDNEVYIKTYGEHVGFR 282
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P+ KK + P+F +C + DI
Sbjct: 283 -IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKKPPQNL-HPIFSWCGSSESKDIVM 338
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC-- 278
P + + + + ++ + SW++K A+W+G R+EL+K
Sbjct: 339 PTYDLTDSVLETMGRVSLDMMSVQ-ANTGPSWEDKNTTAFWRGRDSRKE--RLELVKLSR 395
Query: 279 -----------------NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGY 321
+D L+G + ++ D FK ++Y+I +G
Sbjct: 396 KYPEIIDAAFTNFFFFKHDENLYGPIVKHISFF----DFFK---------YKYQINIDGT 442
Query: 322 AWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNA 379
+ L Y+L+ NSV L Y + F L P K++ P S +DL ++ W
Sbjct: 443 VAAYRLPYLLAGNSVVLKQDSIYYEHFYNELQPWKHYIPFKSDLSDLLEKLQ----WAKE 498
Query: 380 NPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC 438
+ EA+KI K+GQ+F +L D ++ Y L EYS L
Sbjct: 499 HDEEAKKIAKSGQEFARNNLMGDHIFCYYFKLFQEYSSLQ-------------------- 538
Query: 439 LADPKQRQNLEKAAASPSPYPPCT 462
+++PK R +EK PCT
Sbjct: 539 VSEPKIRDGMEKVQQPEDDLFPCT 562
>gi|34193987|gb|AAH36526.3| KDEL (Lys-Asp-Glu-Leu) containing 2 [Homo sapiens]
Length = 507
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 145/335 (43%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 151 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 208
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 209 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 266
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G + L++
Sbjct: 267 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSLEERLQL 325
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 326 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 385
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 386 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 441
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 442 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>gi|345561463|gb|EGX44552.1| hypothetical protein AOL_s00188g220 [Arthrobotrys oligospora ATCC
24927]
Length = 437
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/275 (21%), Positives = 118/275 (42%), Gaps = 12/275 (4%)
Query: 84 CLSAMNPIPERRRVASPQKVQTC-PDFFKSIHKDLEPWAKSRITMRHIMEAKR--FAALR 140
++ +P ER + + Q+ + P + I + +E IT+ + F R
Sbjct: 73 TVTPFSPHDERNFIMTEQQCEASFPGLYVEIERAVEDRRPRPITLDELESVTHSAFGYFR 132
Query: 141 ILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHG 200
+I +LYV D SR T+ + + P +P+++ + D ID ++
Sbjct: 133 GIIYNQELYVIKMLDPNFSRGFATLMAMHRAIVTSPEPLPNIEFTLNTAD--YIDFEQSA 190
Query: 201 SFPLPLFRYCT-NDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFA 259
+ + Y ++ + PD+ +W W E + +NE + +A W++K+P
Sbjct: 191 A----TWTYARRSNETVNWLMPDFGYWSWPEPKVGSYNEVRLKARLADEAIPWEKKIPKI 246
Query: 260 YWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAE 319
W+G L P+R+ L+ W A++ +W +S+ C++R+ + E
Sbjct: 247 VWRGATLKL-PVRLALLNQTKGAAW-ADVKALDWQSAESKEKNLLSMSDHCHYRFVAHTE 304
Query: 320 GYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIP 354
G+++S LKY+ C SV + + + LIP
Sbjct: 305 GHSYSGRLKYLQQCRSVVVAHKLDWIQHYHSLLIP 339
>gi|358415519|ref|XP_003583131.1| PREDICTED: KDEL motif-containing protein 2 [Bos taurus]
Length = 460
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 141/333 (42%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CPD I KD + + I KRF R IV + + Y +
Sbjct: 104 QKTLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGK 163
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 164 YTDFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETPGPLPIISWCGSLDS 221
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + + L+ + I+ G+ SW K A+++G L++
Sbjct: 222 QDVILPTYDITHSTLEALRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQLVQ 280
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 281 LSKENPQLLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYLML 340
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI +DL V+W N EA+KI K
Sbjct: 341 GDSLVLKQDSPYYEHFYMALKPWKHYIPIKRNLSDLLEK----VEWAKENDEEAKKIAKE 396
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 397 GQLTARDLLQPHRLYCYYYTVLQKYAERQLSKP 429
>gi|119190301|ref|XP_001245757.1| hypothetical protein CIMG_05198 [Coccidioides immitis RS]
gi|392868638|gb|EAS34429.2| hypothetical protein CIMG_05198 [Coccidioides immitis RS]
Length = 463
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 100/427 (23%), Positives = 166/427 (38%), Gaps = 76/427 (17%)
Query: 3 FPSRTPAR-SPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQ 61
FPSR R S + PC++++ + + L+ D G N+ P L P
Sbjct: 5 FPSRRIYRISCLIITPCLVAIIGMGIFGLWFGMNSD----------GDNVHPLLTQLIPA 54
Query: 62 RTFKEESRRSQAYKIVHCTYLTCLSAMNPIP---------------------ERRRVA-- 98
+A +CT +C+ ++ P + R V
Sbjct: 55 GHCT-----CKASTTFNCT--SCVDSITPTTFETASPSLSPSSPAWTFVYGRDDRNVGLT 107
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDC 156
S Q + P F+ +H+ + WA + ++ R R +I G LYV
Sbjct: 108 SSQCQSSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNLYV------ 161
Query: 157 VQSRAMFT-----IWGFLQLLRR-YPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC 210
+++RA I G L + R VPD++ +F D+ V D G L R
Sbjct: 162 IETRAAQEDHRQKILGVLSSIHRALQDNVPDIEFIFSIEDR-VDDVSGTGQPLWVLGRKA 220
Query: 211 TNDAHFDIPFPDWSFWGWSE--VNLQPWNEEFKDIKH--GSQAKSWKEKLPFAYWKGNPD 266
+ ++ + IP D+ FW W N+ P+++ + I+ W K W+G
Sbjct: 221 SEESVWLIP--DFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRGKLS 278
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
LR L++ +K WG ++ W + KD F + N C + + + EG A+S S
Sbjct: 279 FSPKLRRGLLEAARNKPWG-DVKELVWKK--KDNF--ISMQNHCKYMFVAHVEGRAFSSS 333
Query: 327 LKYILSCNSVALIISQQY---KDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANP 381
LKY +C SV + QY + + P +N + +DL + ++ ANP
Sbjct: 334 LKYRQACRSVIVAHKLQYIQHHHYLLQSDGPHQNFVEVERDFSDLSEKMSELL----ANP 389
Query: 382 SEAEKIG 388
A KI
Sbjct: 390 EMARKIA 396
>gi|118084661|ref|XP_416963.2| PREDICTED: KDEL motif-containing protein 1 [Gallus gallus]
Length = 500
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 154/373 (41%), Gaps = 48/373 (12%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP F I +DL + + +RF + L I ++Y+ Y + V R
Sbjct: 151 NCPQVFPQIQRDLANFPVVDPDKIAVEIPQRFGQRQSLCHYTIKDNEVYIKTYGEHVGFR 210
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ P+F +C + DI
Sbjct: 211 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKPPQKLHPIFSWCGSSESKDIVM 266
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + SW++K A+W+G R+EL+K
Sbjct: 267 PTYDL---TDSVLETMGRVSLDMMSVQANTGPSWEDKNTTAFWRGRDS--RKERLELVKL 321
Query: 279 NDS--KLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ +L A + + ++ + K + ++Y+I +G + L Y+L+
Sbjct: 322 SRKYPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKYKYQINIDGTVAAYRLPYLLA 381
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
NSV L Y + F L P K++ P S +DL ++ W + EA+KI K+
Sbjct: 382 GNSVVLKQDSIYYEHFYNELQPWKHYIPFKSDLSDLLEKLQ----WAKEHDEEAKKIAKS 437
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLE 449
GQ+F +L D ++ Y L EYS L +++PK R +E
Sbjct: 438 GQEFARNNLMGDHIFCYYFKLFQEYSSLQ--------------------VSEPKIRDGME 477
Query: 450 KAAASPSPYPPCT 462
K PCT
Sbjct: 478 KVQQPDDDLFPCT 490
>gi|109108571|ref|XP_001104429.1| PREDICTED: KDEL motif-containing protein 2-like isoform 2 [Macaca
mulatta]
Length = 507
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 140/333 (42%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CP I KD + + KRF R IV + + Y +
Sbjct: 151 QKTLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILDNHIYRRSLGK 210
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 211 YTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSLDS 268
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 269 RDVVLPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQLVQ 327
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 328 LSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLML 387
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI +DL +K W N EA+KI K
Sbjct: 388 GDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIAKE 443
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 444 GQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>gi|73955182|ref|XP_546537.2| PREDICTED: KDEL motif-containing protein 2 [Canis lupus familiaris]
Length = 508
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 145/335 (43%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
Q+ +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 152 QRTLSCPTKEPQIAKDFASFPS--INLQQMLDEVPKRFGDERGAIVHYTILNNHIYRRSL 209
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 210 GKYTDFKMFSDEILLSLARKV--LLPDIEFYINLGDWPLEHRKVNETPGPLPIISWCGSL 267
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 268 DSRDIILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQL 326
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 327 VQLSKENPELLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYL 386
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ NS+ L +Y + F L P K++ PI +DL IK W N EA+KI
Sbjct: 387 MLGNSLVLKQDSKYYEHFYMALQPWKHYVPIKRNLSDLLEKIK----WAKENDEEAQKIA 442
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ +P
Sbjct: 443 KDGQLAARDLLQPHRLYCYYYRVLQKYAERQSSRP 477
>gi|296216111|ref|XP_002754413.1| PREDICTED: KDEL motif-containing protein 2 [Callithrix jacchus]
Length = 506
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 139/333 (41%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CP I KD + + KRF R +V + + Y +
Sbjct: 150 QKTLSCPTKELQIAKDFASFPSINLQQMLNEVPKRFGDERGAVVHYTILNNRIYRRSLGK 209
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 210 YTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSLDS 267
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 268 RDIVLPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQLVQ 326
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 327 LSQKNPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLML 386
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI DL +K W N EA+KI K
Sbjct: 387 GDSLVLKQDSPYYEHFYMALEPWKHYVPIRRNLGDLLEKVK----WAKENDEEAKKIAKE 442
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 443 GQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 475
>gi|317147538|ref|XP_001822208.2| hypothetical protein AOR_1_1442014 [Aspergillus oryzae RIB40]
gi|391873092|gb|EIT82167.1| hypothetical protein Ao3042_00673 [Aspergillus oryzae 3.042]
Length = 438
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 135/309 (43%), Gaps = 36/309 (11%)
Query: 137 AALRILIVRGKLYVDPYYDCVQSRAMFTIW-GFLQLLRRYPGMVPDVDIMFDCMDKPVID 195
+R +I G+LY+ D A FT L L R PD D + + + V+
Sbjct: 118 GMVRGIIDHGELYI---VDFGNMPATFTRGKATLNSLHRALASFPDRDRLPNV--EFVLT 172
Query: 196 KKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKH--------- 245
+++ S P++ Y + + ++ PD+ +W W EV + P+ + + I
Sbjct: 173 TEDYSSGEGPIWSYSKREENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAAIDDGEVTVD 232
Query: 246 -----GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDG 300
G Q + K++L W+GN +R +L+K + W A IL +W +E
Sbjct: 233 GQVIPGMQFQDKKKQL---VWRGNVATNPQVRGKLLKAAQGRSW-ASILAIDWGDENDIR 288
Query: 301 FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKN 357
F + C + + + EG ++S KY+L+C SV + +++ LI P N
Sbjct: 289 FNLLPIEEHCRYMFLAHTEGRSFSGRGKYLLNCRSVVISHKLVWREAHHAALISSGPEAN 348
Query: 358 HFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAG-QDFMES-LTMDRVYDYMLHLITE 413
+ + +DL I+ ++D NP AE+I + F + LT Y HLI +
Sbjct: 349 YVEVERDFSDLDHKIEFLID----NPEAAERIANNSVKTFRDRYLTPAAESCYWRHLIRQ 404
Query: 414 YSKLLDYKP 422
Y+ +++P
Sbjct: 405 YASSSEFEP 413
>gi|196003266|ref|XP_002111500.1| hypothetical protein TRIADDRAFT_55584 [Trichoplax adhaerens]
gi|190585399|gb|EDV25467.1| hypothetical protein TRIADDRAFT_55584 [Trichoplax adhaerens]
Length = 506
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 73/299 (24%), Positives = 135/299 (45%), Gaps = 22/299 (7%)
Query: 133 AKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKP 192
K A ++ K+Y + Y + V + MF L L R+ +PD++ + + D P
Sbjct: 190 GKHHAICHYSVISNKVYRNCYGEHVGFK-MFMDAILLSLARKVK--LPDLEFISNLGDWP 246
Query: 193 VIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI--KHGSQAK 250
+ +++ + +P+F +C ++ DI P + ++ L+ D+ GS
Sbjct: 247 LENQQNN---LIPIFSWCGSETTSDIVMPTYDL---TQSTLEMMGRVSVDVLAVQGSTGP 300
Query: 251 SWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW--AEEAKDG--FKKSKL 306
WK+K+ +W+G L + +M N++ L + + +EA G K L
Sbjct: 301 KWKDKIEKGFWRGRDSRQERLNLVIMGRNNTDLMDTALTNFFFFKHDEALYGPIQKHISL 360
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--A 364
+ ++Y+I +G + + YIL+ +S+ L +Y + F + L ++ PI +
Sbjct: 361 FDFFKYKYQITLDGTVAAYRVPYILAGDSLVLKQDSEYYEHFYKDLKAWTHYVPIKRDLS 420
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYKP 422
DL IK W N +A+ I + Q F +++L D ++ Y L EYSK L KP
Sbjct: 421 DLIEKIK----WAKNNDKQAKTIAQNAQQFVLDNLLPDPIFCYHWQLFNEYSKRLTSKP 475
>gi|410971943|ref|XP_003992420.1| PREDICTED: KDEL motif-containing protein 2 [Felis catus]
Length = 445
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 81/335 (24%), Positives = 142/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ ++ KRF R IV + + Y
Sbjct: 89 QKTLSCPTKEPQIAKDFASFPS--INLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSL 146
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 147 GKYTDFKMFSDEILLSLARKV--LLPDIEFYINLGDWPLEHRKVNETPGPLPIISWCGSL 204
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 205 DSRDIILPTYDISHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQL 263
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
M + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 264 VQMSKENPELLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYL 323
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL IK W N EA+KI
Sbjct: 324 MLGDSLVLKQDSTYYEHFYMALKPWKHYVPIKRNLSDLLEKIK----WAKENDEEAKKIA 379
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y + EY++ KP
Sbjct: 380 KEGQLSARDLLQPHRLYCYYYRVFREYAERQSSKP 414
>gi|307194254|gb|EFN76650.1| CAP10 family protein AGAP004267 [Harpegnathos saltator]
Length = 342
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 62/266 (23%), Positives = 113/266 (42%), Gaps = 23/266 (8%)
Query: 109 FFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIW 166
F I +DL P+ K I+ I AK I++G LY + DC MF
Sbjct: 77 FKDVILRDLRPFKKKGISEEMINAAKTRGTF-YQIIKGTLYREK--DC-----MFPARCA 128
Query: 167 GFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW 226
G L + G + D+D++ + D P E+ LP+F + ++DI +P W+FW
Sbjct: 129 GIEHFLLKVIGNLSDMDLVINTRDYP--QSSEYFGNLLPIFSFSKTPQYYDIMYPAWAFW 186
Query: 227 -GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
G ++L P W++ K + S W++K +++G+ + L+ N
Sbjct: 187 EGGPAISLYPRGLGRWDQHRKSLNKASLETPWEKKESKGFFRGSRTSSERDNLILLSRNK 246
Query: 281 SKLWGAEILRQNWAEEAKDGFKKS-----KLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
S L A+ + + +D + L + C ++Y G A S K++ C S
Sbjct: 247 SHLVDAQYTKNQAWKSNEDTLHQEPASVVSLESHCGYKYLFNFRGVAASFRHKHLFLCRS 306
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPI 361
+ + + +F+ + P ++ P+
Sbjct: 307 LVFHVGDDWTEFYYDAMKPWIHYVPL 332
>gi|426370348|ref|XP_004052127.1| PREDICTED: KDEL motif-containing protein 2 [Gorilla gorilla
gorilla]
Length = 507
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 151 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 208
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 209 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 266
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 267 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 325
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 326 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 385
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 386 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 441
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 442 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>gi|355567023|gb|EHH23402.1| hypothetical protein EGK_06865, partial [Macaca mulatta]
Length = 440
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 140/333 (42%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CP I KD + + KRF R IV + + Y +
Sbjct: 84 QKTLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILDNHIYRRSLGK 143
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 144 YTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSLDS 201
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 202 RDVVLPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQLVQ 260
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 261 LSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLML 320
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI +DL +K W N EA+KI K
Sbjct: 321 GDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIAKE 376
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 377 GQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 409
>gi|114640225|ref|XP_001141284.1| PREDICTED: KDEL motif-containing protein 2 isoform 3 [Pan
troglodytes]
gi|410227956|gb|JAA11197.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410227958|gb|JAA11198.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410227960|gb|JAA11199.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265524|gb|JAA20728.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265526|gb|JAA20729.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265528|gb|JAA20730.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265530|gb|JAA20731.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265532|gb|JAA20732.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265534|gb|JAA20733.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410304276|gb|JAA30738.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410304278|gb|JAA30739.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410304280|gb|JAA30740.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410331311|gb|JAA34602.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
Length = 507
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 151 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 208
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 209 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 266
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 267 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 325
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 326 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 385
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 386 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 441
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 442 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>gi|71018113|ref|XP_759287.1| hypothetical protein UM03140.1 [Ustilago maydis 521]
gi|46099137|gb|EAK84370.1| hypothetical protein UM03140.1 [Ustilago maydis 521]
Length = 473
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 113/264 (42%), Gaps = 33/264 (12%)
Query: 135 RFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL-RRYPGMVPDVDIMFDCMDKPV 193
R + ++I G++++ QSR T LQLL + Y G + + + V
Sbjct: 115 RHGCVHLIIKHGQIFIRAQKKDWQSRVRST----LQLLDKAYSGASEHEKALMEATEL-V 169
Query: 194 IDKKEHGSFPLPLFRYC-------TNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHG 246
I + F P R ND FPD+SF W E + + E + +
Sbjct: 170 ISTADFDGFTDPNSRGAGWVLDKRVNDTQGQYLFPDFSFASWPEAGIPSYQEFRRQAQQV 229
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKL 306
+ WK K A+W+G D L+ ++ + S GA + W++ + F S
Sbjct: 230 NAETPWKSKTNPAFWRG--DALAGQNIKPRESLLSVATGAGT--ETWSDVKRTSFWVSGP 285
Query: 307 S--------NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI----- 353
S C H++ I++EG A+S K+ILSC S ++ + +++ F LI
Sbjct: 286 SIEKIVSPPEHCRHKFLIHSEGVAYSGRSKFILSCQSAVVMHALEWEQHFHPALISDVTS 345
Query: 354 PTKNHFPIPSA---DLCRSIKSVV 374
P +NH +P + L +IKS++
Sbjct: 346 PDQNHLLLPGSYFEALPETIKSLI 369
>gi|153218487|ref|NP_714916.3| KDEL motif-containing protein 2 precursor [Homo sapiens]
gi|110810398|sp|Q7Z4H8.2|KDEL2_HUMAN RecName: Full=KDEL motif-containing protein 2; Flags: Precursor
gi|119587523|gb|EAW67119.1| KDEL (Lys-Asp-Glu-Leu) containing 2, isoform CRA_a [Homo sapiens]
gi|119587524|gb|EAW67120.1| KDEL (Lys-Asp-Glu-Leu) containing 2, isoform CRA_a [Homo sapiens]
Length = 507
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 151 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 208
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 209 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 266
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 267 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 325
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 326 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 385
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 386 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 441
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 442 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>gi|358054730|dbj|GAA99656.1| hypothetical protein E5Q_06359 [Mixia osmundae IAM 14324]
Length = 437
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 111/249 (44%), Gaps = 20/249 (8%)
Query: 107 PDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
PD ++ + + W + IT + A A R+LI ++Y+ Y + SR + +
Sbjct: 102 PDLYREAERAAQYWRHRGGITRAALDAADAQAHARVLIKDNQVYLTNYRGGINSRTLAAL 161
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWS 224
+ + +P V+ + D I G+ P +F + + PD+
Sbjct: 162 ASLNEAVLTAVEELPAVEFVIQTDDSAPIA----GAAPRWVFARTDEEDELALWLMPDFG 217
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
+ W E + E A++W K+P +W+G +++PLR EL++ +D
Sbjct: 218 HYAWPEPGVGSMAEVQAKASAFEAAQTWSSKIPKLFWRGA--LVNPLRDELIRLSDEN-- 273
Query: 285 GAEILRQNWAE-EAKD-GFKKSKL---SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
R +W + +A D G + +L + C ++Y +AEG+A+S LKYIL C SV ++
Sbjct: 274 -----RGSWGDAKALDWGRLEGELRSPAQHCAYKYLAHAEGFAYSGRLKYILQCRSVVVM 328
Query: 340 ISQQYKDFF 348
+Y F
Sbjct: 329 HKLRYTQHF 337
>gi|355752611|gb|EHH56731.1| hypothetical protein EGM_06196, partial [Macaca fascicularis]
Length = 440
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 139/333 (41%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CP I KD + + KRF R IV + + Y +
Sbjct: 84 QKTLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILDNHIYRRSLGK 143
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 144 YTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSLDS 201
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 202 RDVVLPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQLVQ 260
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 261 LSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLML 320
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI +DL +K W N EA KI K
Sbjct: 321 GDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEANKIAKE 376
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 377 GQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 409
>gi|320035743|gb|EFW17684.1| hypothetical protein CPSG_06127 [Coccidioides posadasii str.
Silveira]
Length = 463
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 100/430 (23%), Positives = 159/430 (36%), Gaps = 82/430 (19%)
Query: 3 FPSRTPAR-SPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQ 61
FPSR R S + PC++++ + + L+ D G N+ P L P
Sbjct: 5 FPSRRIYRISCLIITPCLVAIIGMGIFGLWFGMNSD----------GDNVHPLLTQLIPA 54
Query: 62 RTFKEESRRSQAYKIVHCT-----YLTCLSAMNPIP------------------------ 92
HCT C S ++PI
Sbjct: 55 G---------------HCTCKASTTFNCTSCVDPITPTTFETASPSLSPSPPAWTFVYGR 99
Query: 93 ERRRVA--SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKL 148
+ R V Q + P F+ +H+ + WA + ++ R R +I G L
Sbjct: 100 DDRNVGLTRSQCQSSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNL 159
Query: 149 YVDPYYDCVQSRAMFTIWGFLQLLRR-YPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLF 207
YV Q I G L + R VPD++ +F D+ V D G L
Sbjct: 160 YVIETR-AAQEDHRRKILGVLSSIHRALQDNVPDIEFIFSIEDR-VDDVSGTGQPLWVLG 217
Query: 208 RYCTNDAHFDIPFPDWSFWGWSE--VNLQPWNEEFKDIKH--GSQAKSWKEKLPFAYWKG 263
R + ++ + IP D+ FW W N+ P+++ + I+ W K W+G
Sbjct: 218 RKASEESVWLIP--DFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRG 275
Query: 264 NPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAW 323
LR L++ +K WG ++ W + KD F + N C + + + EG A+
Sbjct: 276 KLSFSPKLRRGLLEAARNKPWG-DVKELVWKK--KDNF--ISMQNHCKYMFVAHVEGRAF 330
Query: 324 SVSLKYILSCNSVALIISQQY---KDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGN 378
S SLKY +C SV + QY + + P +N + +DL + ++
Sbjct: 331 SSSLKYRQACRSVIVAHKLQYIQHHHYLLQSDGPHQNFVEVERDFSDLSEKMSELL---- 386
Query: 379 ANPSEAEKIG 388
ANP A KI
Sbjct: 387 ANPEMARKIA 396
>gi|332208116|ref|XP_003253144.1| PREDICTED: KDEL motif-containing protein 2 [Nomascus leucogenys]
Length = 507
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 143/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ ++ KRF R IV + + Y
Sbjct: 151 QKTLSCPTKEPQIAKDFASFPS--INLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSL 208
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 209 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 266
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 267 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 325
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 326 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 385
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 386 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 441
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 442 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>gi|158297147|ref|XP_317424.4| AGAP008037-PA [Anopheles gambiae str. PEST]
gi|157015054|gb|EAA12302.4| AGAP008037-PA [Anopheles gambiae str. PEST]
Length = 513
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 125/269 (46%), Gaps = 19/269 (7%)
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK--EHGSFPLPLFRYCTNDAHF 216
S+ M TI L L R++ +PD+++ + D P++ K + P P+F +C +D F
Sbjct: 221 SKFMDTI--LLSLARKF--TLPDMEMFVNLGDWPLVKKGGPSRTTGPYPIFSWCGSDDTF 276
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKD-IKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + +E L+ D + + W +K A+W+G L +
Sbjct: 277 DIVMPTYDI---TESTLENMGRVMLDMLSVQRRGLPWADKHAKAFWRGRDARRERLELVA 333
Query: 276 MKCNDSKLWGAEILRQNW--AEEAKDGFKKSKLSNQ--CNHRYKIYAEGYAWSVSLKYIL 331
+ +L A + + EE++ G + + +S ++RY++ +G + L Y+L
Sbjct: 334 LSRRYPELLNASLTNFFFFRDEESEFGPRVAHISMHEFFDYRYQVNVDGTVAAYRLPYLL 393
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+ +SV L Y + F R L+P +++ P ADL ++ ++W N +A +I
Sbjct: 394 AGSSVVLKQDSFYYEHFYRKLVPMRHYIPF-EADLSNLVQQ-IEWARENDEKAREIRDNA 451
Query: 392 QDFMES--LTMDRVYDYMLHLITEYSKLL 418
F+ + L +D +Y Y L EY+K +
Sbjct: 452 NAFINANLLPLD-IYCYHALLFKEYAKYI 479
>gi|121716116|ref|XP_001275667.1| DUF821 domain protein [Aspergillus clavatus NRRL 1]
gi|119403824|gb|EAW14241.1| DUF821 domain protein [Aspergillus clavatus NRRL 1]
Length = 446
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 81/340 (23%), Positives = 140/340 (41%), Gaps = 37/340 (10%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYDC--VQSRAMF 163
P F I K + + +T R + +R I RG+LYV Y SRA
Sbjct: 91 PKLFVEIDKSVTMRREKPVTFREVDGVSVDDGMVRAAIYRGELYVIDYAAMPYTYSRAKA 150
Query: 164 TIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PF 220
T+ + L +P +P+V+ +F D + P P++ Y D I
Sbjct: 151 TLNALHRALMAFPDRHSLPNVEFVFTTDD--------FSNTPGPIWSYSKRDEDDSIWLM 202
Query: 221 PDWSFWGWSEVNLQPWNE---EFKDIKHGSQAK--------SWKEKLPFAYWKGNPDVLS 269
PD+ +W W EV + + + + G+ +K+K W+G+
Sbjct: 203 PDFGYWSWPEVKIGAYKDIRRRIATVDSGTTTSDGKIIPGLEFKKKKKQLVWRGSVATNP 262
Query: 270 PLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
+R +L+K + W A I +W E + + + C + + + EG ++S KY
Sbjct: 263 EIRGKLLKAAQGRSW-ASIRVIDWDNENDIRYNLLPMEDHCRYMFLAHTEGRSFSGRGKY 321
Query: 330 ILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEA 384
+L+C SV + +++ L+ P N+ + +DL R I ++D NP A
Sbjct: 322 LLNCRSVVVSHKLVWREAHHAALVASGPDANYVKVERDFSDLDRKISFLID----NPETA 377
Query: 385 EKIG-KAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
E+I A + F + LT Y LI +Y+ D++P
Sbjct: 378 EQIADNAVRTFRDRYLTPAAESCYWRQLIRQYAASCDFEP 417
>gi|397516340|ref|XP_003828388.1| PREDICTED: KDEL motif-containing protein 2 isoform 2 [Pan paniscus]
Length = 499
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 143 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 200
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 201 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 258
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 259 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 317
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 318 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 377
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 378 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 433
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 434 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 468
>gi|452843790|gb|EME45725.1| hypothetical protein DOTSEDRAFT_71425 [Dothistroma septosporum
NZE10]
Length = 442
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 116/260 (44%), Gaps = 33/260 (12%)
Query: 139 LRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPG------MVPDVDIMFDCMDKP 192
LRIL R + D ++D R ++T+ Q+ R G ++P+++ D
Sbjct: 138 LRILETRNTHHFDDHHDS--RRVIYTL---SQINRALLGATARGEIMPNIEFAIAVQDYI 192
Query: 193 VIDKKEHGSFPLPLF--RYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKD-IKHGSQA 249
+ EH + + + + T PD++FW W + + + + I H +
Sbjct: 193 ELPDDEHDTHTIWTYDRKVGTRKDERMWLMPDFNFWAWHKGIHNSYTDATRRLISHDAPL 252
Query: 250 KSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQ 309
S K+P W+G+ S LR +L+ + K W ++ W + +
Sbjct: 253 SS---KIPQLAWRGDAG-FSQLRKDLLATANDKSWAD--VKSTWID----------FDDF 296
Query: 310 CNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPT--KNHFPIPSADLC 367
C + + IY EG+AWS LKY+L+CNS+A++ ++ F+ LIP + +F D
Sbjct: 297 CRYLFTIYTEGHAWSGRLKYMLNCNSIAIVHELEFLTFYHHLLIPDGPEQNFVSVKRDWS 356
Query: 368 RSIKSVVDWGNANPSEAEKI 387
++ V + +P EAE+I
Sbjct: 357 -DLEEKVQYYLGHPEEAERI 375
>gi|443897893|dbj|GAC75232.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
Length = 479
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 102/244 (41%), Gaps = 17/244 (6%)
Query: 126 TMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRR-YPGMVPDVDI 184
+++ E R + ++I G+++V QSR T +QLL Y G D
Sbjct: 104 NVQNAAENCRHGCVHLIIKHGQIFVRAQVKDWQSRVRST----MQLLTAAYQGASEDEKA 159
Query: 185 MFDCMDKPVIDKKEHGSFPLPLFRYC-------TNDAHFDIPFPDWSFWGWSEVNLQPWN 237
D + VI + F R ND FPD+SF W E + +
Sbjct: 160 RIDGTEL-VISTADFDGFTDSASRGAGWVLDKRVNDTEGQYLFPDFSFASWPEAGIASYP 218
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAE----ILRQNW 293
E D + + W +KL A+W+G+ S + V + + G E + R ++
Sbjct: 219 EFRHDAEQVNAETPWHDKLNPAFWRGDALKGSNIAVRASLLDVATGPGTESWSDVKRTSF 278
Query: 294 AEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI 353
EE K + C H++ I++EG A+S K+IL C S ++ + +++ F LI
Sbjct: 279 WEEGPGIGKIVSPAEHCRHKFLIHSEGVAYSGRSKFILGCQSTVVMHALEWEQHFHPALI 338
Query: 354 PTKN 357
T N
Sbjct: 339 ATPN 342
>gi|356561578|ref|XP_003549058.1| PREDICTED: LRR receptor-like serine/threonine-protein kinase
GSO1-like [Glycine max]
Length = 1258
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 10/96 (10%)
Query: 370 IKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSA 428
+KSVV +A +IGK DF+ E + MD VYDY+ HL+ Y+KL YKP+ ++A
Sbjct: 1133 LKSVV--------QAHQIGKVASDFIQEEVKMDYVYDYIFHLLNSYAKLFRYKPSISANA 1184
Query: 429 FEACVESLLCLADPKQRQ-NLEKAAASPSPYPPCTL 463
E CVES++C A+ ++ +E P+ PCT+
Sbjct: 1185 TELCVESMVCGAEGSVKKFMMESLVKVPANTDPCTM 1220
>gi|37182354|gb|AAQ88979.1| VELF1904 [Homo sapiens]
Length = 451
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 95 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 152
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 153 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 210
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 211 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 269
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 270 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 329
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 330 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 385
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 386 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 420
>gi|303315019|ref|XP_003067517.1| hypothetical protein CPC735_064720 [Coccidioides posadasii C735
delta SOWgp]
gi|240107187|gb|EER25372.1| hypothetical protein CPC735_064720 [Coccidioides posadasii C735
delta SOWgp]
Length = 463
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 99/427 (23%), Positives = 164/427 (38%), Gaps = 76/427 (17%)
Query: 3 FPSRTPAR-SPSYLFPCVISLSVISLAALFLDYKVDDFASKTKTLAGHNLEPTPWHLFPQ 61
FPSR R S + PC++++ + + L+ D G N+ P L P
Sbjct: 5 FPSRRIYRISCLIITPCLVAIIGMGIFGLWFGMNSD----------GDNVHPLLTQLIPA 54
Query: 62 RTFKEESRRSQAYKIVHCTYLTCLSAMNPIP---------------------ERRRVA-- 98
+A +CT +C+ + P + R V
Sbjct: 55 GHCT-----CKASTTFNCT--SCVDLITPTTFETASPSLSPSPPAWTFVYGRDDRNVGLT 107
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDC 156
Q + P F+ +H+ + WA + ++ R R +I G LYV
Sbjct: 108 RSQCQSSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNLYV------ 161
Query: 157 VQSRAM-----FTIWGFLQLLRR-YPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC 210
+++RA I G L + R VPD++ +F D+ V D G L R
Sbjct: 162 IETRAAQEDHRRKILGVLSSIHRALQDNVPDIEFIFSIEDR-VDDVSGTGQPLWVLGRKA 220
Query: 211 TNDAHFDIPFPDWSFWGWSE--VNLQPWNEEFKDIKH--GSQAKSWKEKLPFAYWKGNPD 266
+ ++ + IP D+ FW W N+ P+++ + I+ W K W+G
Sbjct: 221 SEESVWLIP--DFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRGKLS 278
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
LR L++ +K WG ++ W + KD F + N C + + + EG A+S S
Sbjct: 279 FSPKLRRGLLEAARNKPWG-DVKELVWKK--KDNF--ISMQNHCKYMFVAHVEGRAFSSS 333
Query: 327 LKYILSCNSVALIISQQY---KDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANP 381
LKY +C SV + QY + + P +N + +DL + ++ ANP
Sbjct: 334 LKYRQACRSVIVAHKLQYIQHHHYLLQSDGPHQNFVEVERDFSDLSEKMSELL----ANP 389
Query: 382 SEAEKIG 388
A KI
Sbjct: 390 EMARKIA 396
>gi|345324378|ref|XP_003430815.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
1-like [Ornithorhynchus anatinus]
Length = 588
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 142/336 (42%), Gaps = 28/336 (8%)
Query: 102 KVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCV 157
+V CP F I +DL + + +RF + L + K+Y+ + + V
Sbjct: 235 EVMNCPASFPQIQRDLAHFPTVDPDKIAVEIPQRFGQRQSLCHYTVKNNKVYIKTHGEHV 294
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
R +F L L R+ +PDV+ + D P+ KK + P+F +C + D
Sbjct: 295 GFR-IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKKSPDNL-HPIFSWCGSTDSKD 350
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
I P + ++ L+ D+ + +W++K A W+G R+EL
Sbjct: 351 IVMPTYDL---TDSVLETMGRVSLDMMSVQANTGPAWEDKNTTAVWRGRDSRKE--RLEL 405
Query: 276 MKCNDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
+K + + N+ +E+ G K + H+Y+I +G + L Y
Sbjct: 406 VKLSRKHPEIIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPY 465
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKI 387
+L+ NSV L Y + F L P +++ P S +DL +K W + EA+ I
Sbjct: 466 LLAGNSVVLKQDSIYYEHFYNELQPWEHYIPFKSNLSDLLEKLK----WAKDHDEEAKNI 521
Query: 388 GKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
KAGQ+F +L D ++ Y L EY+ L +P
Sbjct: 522 AKAGQEFARNNLMGDNIFCYYFKLFQEYASLQVSEP 557
>gi|156366125|ref|XP_001626991.1| predicted protein [Nematostella vectensis]
gi|156213886|gb|EDO34891.1| predicted protein [Nematostella vectensis]
Length = 412
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/331 (22%), Positives = 139/331 (41%), Gaps = 25/331 (7%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
+ C + I DL W + +A R I+ LY + DC+ F
Sbjct: 81 KGCGCYKDVIEDDLRRWKDG--INKSDFDAARSRGTHYQIINHMLYRED--DCMFP---F 133
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
G L +PD++I+ + D P + LP+F + DI +P W
Sbjct: 134 RCKGIEHFLLEVINKLPDMEIIINTRDWP---QAAVWGPALPIFSFSKTKNEMDIMYPAW 190
Query: 224 SFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
+FW G V L W+ + + SQ W++K A+++G+ + L+
Sbjct: 191 TFWEGGPAVWPIYPTGLGRWDLMREALDKKSQEWPWEKKESKAFFRGSRTSAERDPLVLL 250
Query: 277 KCNDSKLWGAEILR-QNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
+L A+ + Q W +A K+ L + C ++Y G A S K++
Sbjct: 251 SRKHPELADAQYTKNQAWKSDADTLHAPPAKEVPLEDHCQYKYLFNFRGVAASFRFKHLF 310
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
C ++ + ++++FF R L P ++ P+ + DL +++ ++++ AN + I + G
Sbjct: 311 VCKALVFHVGDEWQEFFYRALKPWVHYIPVET-DLS-NVRDLIEFAKANDGIVKGIAERG 368
Query: 392 QDF-MESLTMDRVYDYMLHLITEYSKLLDYK 421
F M+ L M + Y ++ +Y+ L +K
Sbjct: 369 YTFIMDHLRMPDIRCYWKKVLKKYASLAKWK 399
>gi|397516338|ref|XP_003828387.1| PREDICTED: KDEL motif-containing protein 2 isoform 1 [Pan paniscus]
Length = 406
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 50 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 107
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 108 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 165
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 166 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 224
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 225 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 284
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 285 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 340
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 341 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 375
>gi|189069279|dbj|BAG36311.1| unnamed protein product [Homo sapiens]
Length = 402
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 46 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 103
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 104 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 161
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 162 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 220
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 221 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 280
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 281 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 336
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 337 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 371
>gi|33327382|gb|AAQ09021.1| unknown protein [Homo sapiens]
Length = 507
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 76/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP+ I KD + I ++ +++ K F R IV + + Y
Sbjct: 151 QKTLSCPNKETQIAKDFASFPS--INLQQMLKEVPKSFGYERGAIVHYTILNNHVYRRSL 208
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 209 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 266
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 267 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 325
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 326 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 385
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 386 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 441
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 442 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>gi|358056081|dbj|GAA97978.1| hypothetical protein E5Q_04658 [Mixia osmundae IAM 14324]
Length = 1130
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 115/284 (40%), Gaps = 28/284 (9%)
Query: 95 RRVASPQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPY 153
R + S Q + P F + W A+ I I + RILI R +LY+ +
Sbjct: 782 RHLTSAQCDEFFPGLFHEADRAASWWNARGGIHRSQIDRGTVASHARILIKRNRLYIKHF 841
Query: 154 YDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMD----KPVIDKKEHGSFPLPLFRY 209
+R I + + +PDV+++ D KP + G P +
Sbjct: 842 RPVTTTRVEAAIALIHEAILSSVEPIPDVELLLHLDDTGNSKPGVPMLVLGRRPSEELLW 901
Query: 210 CTNDAHFDIPFPDWSFWGWSE-VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVL 268
PD+ F+GW E + + + ++ SWK K P A+W+G
Sbjct: 902 L---------MPDFGFYGWPEAIRVGSYIHDYDQTLESETHASWKHKKPTAFWRGAS--- 949
Query: 269 SPLRVELMKCNDSKLWG-AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGY-AWSVS 326
LR N+S+ +G A+I +W + K G K C HRY I+ EG A+S
Sbjct: 950 LGLRDRQSLVNNSRPYGWADIAIVDWLKGPK-GILNPK--QTCAHRYLIHTEGLKAYSGR 1006
Query: 327 LKYILSCNSVALIISQQYKDFF-----SRGLIPTKNHFPIPSAD 365
LKY+L C SV+++ + F S P++N I +D
Sbjct: 1007 LKYLLLCRSVSIMRKLDFIQHFHPLLDSSPGSPSQNVIEIEGSD 1050
>gi|149730404|ref|XP_001493278.1| PREDICTED: KDEL motif-containing protein 1 [Equus caballus]
Length = 502
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 141/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
+CP+ I +DL + KRF + L + K+Y+ + + V R
Sbjct: 152 SCPETITQIQRDLAHFPTVDPEKIATEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PD + + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDAEFFVNLGDWP-LEKKKSSSHIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ G+ W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQGNTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYASLQVSEP 471
>gi|403272922|ref|XP_003928283.1| PREDICTED: KDEL motif-containing protein 1 [Saimiri boliviensis
boliviensis]
Length = 502
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 85/336 (25%), Positives = 142/336 (42%), Gaps = 34/336 (10%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETISQIQRDLAHFPIVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ GS P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSGSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTALWRGRDS--RKERLELVKL 322
Query: 279 N-------DSKLWGAEILRQNWAEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
+ D+ + N E+ G K + H+Y+I +G + L Y
Sbjct: 323 SRKHPELIDAAFTNFFFFKHN---ESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPY 379
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKI 387
+L +SV L Y + F L P K++ P+ S +DL +K W + EA+KI
Sbjct: 380 LLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKI 435
Query: 388 GKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
KAGQ+F +L + ++ Y L EY+ L +P
Sbjct: 436 AKAGQEFARNNLMGNDIFCYYFKLFQEYASLQVSEP 471
>gi|315044371|ref|XP_003171561.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311343904|gb|EFR03107.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 422
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 127/301 (42%), Gaps = 23/301 (7%)
Query: 97 VASPQKVQTC-PDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYY 154
V Q+ Q PD F + + + A + IT I + +R +I +LYV
Sbjct: 72 VFDSQRCQAAFPDLFVEVDRAKQVRAGTPITFAEIDSVTPKNGYIRAMIYDQQLYVIAVN 131
Query: 155 DCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA 214
+ SR T+ + + P +P+++ +F+ DK +D + + Y +
Sbjct: 132 GTIYSREYATLHALHRAIVSSPDPLPNIEFVFNTDDK--VDS-------VAQWGYARREE 182
Query: 215 HFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
++ PD+ +W W E + E + Q +W EK+P +W+G L +R
Sbjct: 183 DTNLWLMPDFGYWSWPETKVGTMQEVRMKAEQEEQTWTWSEKIPKLFWRGATMDLE-VRE 241
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+L++ ++ W A+I W + + C +Y EG ++S LKY+ SC
Sbjct: 242 KLIEVTKNQTW-ADIKPITWRDNESMQNDLKSMPEHCQFKYLAQTEGNSYSGRLKYLQSC 300
Query: 334 NSVALIISQ---QYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
NSV + S Q++ + P +N+ + +DL I+ W + +A +I
Sbjct: 301 NSVVISHSLEWIQHQSPLMKSSGPDQNYVEVRRDWSDLYEKIQ----WLEEHEEDARRIA 356
Query: 389 K 389
+
Sbjct: 357 R 357
>gi|119629474|gb|EAX09069.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_c [Homo sapiens]
Length = 503
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 139/331 (41%), Gaps = 24/331 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV++ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVELFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVKLSR 324
Query: 279 NDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
+L A + + ++ + K + H+Y+I +G + L Y+L +
Sbjct: 325 KHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVGD 384
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
SV L Y + F L P K++ P+ S +DL +K W + EA+KI KAGQ
Sbjct: 385 SVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKAGQ 440
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+F +L D ++ Y L EY+ L +P
Sbjct: 441 EFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|12654903|gb|AAH01297.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Homo sapiens]
gi|119629472|gb|EAX09067.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_a [Homo sapiens]
gi|123980608|gb|ABM82133.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [synthetic construct]
gi|123995429|gb|ABM85316.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [synthetic construct]
Length = 502
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 137/326 (42%), Gaps = 24/326 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV++ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVELFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVKLSR 324
Query: 279 NDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
+L A + + ++ + K + H+Y+I +G + L Y+L +
Sbjct: 325 KHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVGD 384
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
SV L Y + F L P K++ P+ S +DL +K W + EA+KI KAGQ
Sbjct: 385 SVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKAGQ 440
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKL 417
+F +L D ++ Y L EY+ L
Sbjct: 441 EFARNNLMGDDIFCYYFKLFQEYANL 466
>gi|176866369|ref|NP_076994.2| KDEL motif-containing protein 1 precursor [Homo sapiens]
gi|74749382|sp|Q6UW63.1|KDEL1_HUMAN RecName: Full=KDEL motif-containing protein 1; AltName:
Full=Endoplasmic reticulum resident protein 58; Short=ER
protein 58; Short=ERp58; Flags: Precursor
gi|37183036|gb|AAQ89318.1| BK158_1 [Homo sapiens]
Length = 502
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 139/331 (41%), Gaps = 24/331 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV++ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVELFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVKLSR 324
Query: 279 NDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
+L A + + ++ + K + H+Y+I +G + L Y+L +
Sbjct: 325 KHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVGD 384
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
SV L Y + F L P K++ P+ S +DL +K W + EA+KI KAGQ
Sbjct: 385 SVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKAGQ 440
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+F +L D ++ Y L EY+ L +P
Sbjct: 441 EFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|126337303|ref|XP_001365622.1| PREDICTED: KDEL motif-containing protein 1 [Monodelphis domestica]
Length = 500
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 140/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP I +DL P+ + +RF + L + K+Y+ + + V R
Sbjct: 150 NCPQSIPQIQRDLAPFPTVDPDKIAVEIPQRFGQRQSLCHYTLKDNKIYIKTHGEHVGFR 209
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P+ KK + + P+F +C + DI
Sbjct: 210 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWPLEKKKSNANLH-PIFSWCGSTDSKDIVM 265
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 266 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWENKNTTALWRGRDS--RKERLELVKL 320
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 321 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 380
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
NSV L Y + F L P K++ P+ S +DL ++ W + EA+ I KA
Sbjct: 381 GNSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLQ----WAKDHDEEAKAIAKA 436
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F ++L D ++ Y L EY+ L +P
Sbjct: 437 GQEFARDNLMGDNIFCYYFKLFQEYASLQVTEP 469
>gi|355697428|gb|AES00667.1| KDEL containing 1 [Mustela putorius furo]
Length = 502
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 86/335 (25%), Positives = 143/335 (42%), Gaps = 32/335 (9%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRIL----IVRGKLYVDPYYDCVQ 158
CP I +DL + + I E KRF + L + K+Y+ + + V
Sbjct: 152 NCPQTITQIQRDLAYFPA--VDPEKIAEEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVG 209
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
R +F L L R+ +PDV+ + D P ++KK+ GS P+F +C + DI
Sbjct: 210 FR-IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSGSHIHPIFSWCGSTDSKDI 265
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
P + ++ L+ D+ + W+ K A W+G R+EL+
Sbjct: 266 VMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWERKNSTAVWRGRDS--RKERLELV 320
Query: 277 KCNDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYI 330
K + + N+ +E+ G K + H+Y+I +G + L Y+
Sbjct: 321 KLSRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPYL 380
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
L +SV L Y + F L P K++ P+ S +DL ++ W + EA+KI
Sbjct: 381 LVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLQ----WAKDHDEEAKKIA 436
Query: 389 KAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
KAGQ+F +L D ++ Y L EY+ L +P
Sbjct: 437 KAGQEFARNNLMGDDIFCYYFKLFQEYASLQVREP 471
>gi|410947658|ref|XP_003980560.1| PREDICTED: KDEL motif-containing protein 1 [Felis catus]
Length = 502
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 140/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I KDL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETITQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSTSHIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|296188942|ref|XP_002742569.1| PREDICTED: KDEL motif-containing protein 1 [Callithrix jacchus]
Length = 502
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/336 (25%), Positives = 141/336 (41%), Gaps = 34/336 (10%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPIVDPEKIAVEIPKRFGQRQSLCHYTVKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMLSVQANTGPPWESKNSTALWRGRDS--RKERLELVKL 322
Query: 279 N-------DSKLWGAEILRQNWAEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
+ D+ + N E+ G K + H+Y+I +G + L Y
Sbjct: 323 SRKHPELIDAAFTNFFFFKHN---ESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPY 379
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKI 387
+L +SV L Y + F L P K++ P+ S +DL +K W + EA+KI
Sbjct: 380 LLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKI 435
Query: 388 GKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
KAGQ+F +L D ++ Y L EY+ L +P
Sbjct: 436 AKAGQEFARNNLMGDDIFCYYFKLFQEYASLQVTEP 471
>gi|332375520|gb|AEE62901.1| unknown [Dendroctonus ponderosae]
Length = 508
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/267 (24%), Positives = 119/267 (44%), Gaps = 20/267 (7%)
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
MF+ L L R+ VPD++ + D P+ +H LP+F +C + DI P
Sbjct: 220 MFSDSILLSLTRK--AAVPDLEFFVNLGDWPL--SAQHLEDKLPIFSWCGSADTNDIIMP 275
Query: 222 DWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
+ +E L+ D+ G+ W ++ +W+G L + +
Sbjct: 276 TYEL---TESALENMGRVMLDMLSVQGNGEFPWNQRQSKLFWRGRDSNRHRLDLISLSRK 332
Query: 280 DSKLWGAEILRQNWAEEAKD--GFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNS 335
+L+ + + + +D G K +S +++Y++ +G ++L S
Sbjct: 333 HPELFNVSLTNFFFYRDEEDVYGPKTDHVSFFRFFDYKYQLAIDGTVAPYRTPFLLGGGS 392
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+ +Y ++F R L P ++ P+ S +DL I+ W AN +EAEKI + GQ+
Sbjct: 393 LIFKPHSKYYEYFYRDLQPNTHYVPVESNLSDLIERIQ----WAKANDAEAEKIARNGQE 448
Query: 394 FM-ESLTMDRVYDYMLHLITEYSKLLD 419
F E+L ++ Y HL+ E SK+L+
Sbjct: 449 FANENLLPQHIFCYYFHLLNELSKVLE 475
>gi|291383938|ref|XP_002708527.1| PREDICTED: KDEL (Lys-Asp-Glu-Leu) containing 2 [Oryctolagus
cuniculus]
Length = 509
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ ++ KRF R IV + + Y
Sbjct: 153 QKTLSCPTKEPQIAKDFASFPS--INLQQMLNEVPKRFGDERGAIVHYTILNNRIYRRSL 210
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ ++ + + P+P+ +C +
Sbjct: 211 GKYTDFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRRVNETPGPIPIISWCGSQ 268
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 269 DSRDIILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQL 327
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 328 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYL 387
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 388 MLGDSLVLKQDSPYYEHFYVALKPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 443
Query: 389 KAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
K GQ E L R+Y Y ++ +Y++ KP
Sbjct: 444 KEGQLMARELLQPHRLYCYYYRVLQKYAERQSSKP 478
>gi|156401513|ref|XP_001639335.1| predicted protein [Nematostella vectensis]
gi|156226463|gb|EDO47272.1| predicted protein [Nematostella vectensis]
Length = 497
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/329 (22%), Positives = 142/329 (43%), Gaps = 22/329 (6%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSRA 161
CP + I +DL + + + RF L I+ K+Y + + V +
Sbjct: 150 CPQNYSQIDRDLARFPEINLIRLAKEAVDRFGVHHALCHYSIINNKVYRKSHGEHV-GFS 208
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
MF+ L R+ +PD++ + D P+ +K++ P+P+ +C ++ DI P
Sbjct: 209 MFSDAIIHSLARKV--HLPDMEFFVNLGDWPLEKRKDN---PIPILSWCGSEDTADIVMP 263
Query: 222 DWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
+ +E L+ D+ + WK+K+P A+W+G L + +
Sbjct: 264 TYDL---TEAALETMGRVSLDMLSVQANTGPKWKDKIPKAFWRGRDSREERLNLVINGRK 320
Query: 280 DSKLWGAEILRQNW--AEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNS 335
+L+ + + +E K G KK +S N ++Y++ +G + Y++ ++
Sbjct: 321 KPELYDVALTNFFFFPYDEKKYGPKKQHVSFFNFFKYKYQLNIDGTVAAYRFPYLMGGDA 380
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ L Y + F + L P ++ P DL ++ + W AN +A+KI + Q+F
Sbjct: 381 LVLKQDSPYYEHFYKELKPWVHYVPFKR-DLS-DLEERLKWAIANDDKAQKIARQAQEFA 438
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKPA 423
E+L V+ Y L EY+K KP
Sbjct: 439 RENLQSKDVFCYHWTLFKEYAKRQTTKPV 467
>gi|115496800|ref|NP_001069153.1| KDEL motif-containing protein 1 precursor [Bos taurus]
gi|111304522|gb|AAI19854.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Bos taurus]
gi|296481621|tpg|DAA23736.1| TPA: KDEL (Lys-Asp-Glu-Leu) containing 1 [Bos taurus]
Length = 502
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 82/328 (25%), Positives = 138/328 (42%), Gaps = 28/328 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ F IH+DL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETFAQIHRDLVHFPTVDPEKIATEIPKRFGQRQSLCHYSLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSSPHIHPIFSWCGSTDSRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRRE--RLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL ++ W + EA+KI K
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLQ----WAKDHDEEAKKIAKT 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKL 417
GQ+F +L D ++ Y L EY+ L
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYASL 466
>gi|83770071|dbj|BAE60206.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 460
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/277 (24%), Positives = 122/277 (44%), Gaps = 32/277 (11%)
Query: 169 LQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWG 227
L L R PD D + + + V+ +++ S P++ Y + + ++ PD+ +W
Sbjct: 170 LNSLHRALASFPDRDRLPNV--EFVLTTEDYSSGEGPIWSYSKREENTNVWLMPDFGYWS 227
Query: 228 WSEVNLQPWNEEFKDIKH--------------GSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
W EV + P+ + + I G Q + K++L W+GN +R
Sbjct: 228 WPEVGVGPYKDARRRIAAIDDGEVTVDGQVIPGMQFQDKKKQL---VWRGNVATNPQVRG 284
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+L+K + W A IL +W +E F + C + + + EG ++S KY+L+C
Sbjct: 285 KLLKAAQGRSW-ASILAIDWGDENDIRFNLLPIEEHCRYMFLAHTEGRSFSGRGKYLLNC 343
Query: 334 NSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
SV + +++ LI P N+ + +DL I+ ++D NP AE+I
Sbjct: 344 RSVVISHKLVWREAHHAALISSGPEANYVEVERDFSDLDHKIEFLID----NPEAAERIA 399
Query: 389 KAG-QDFMES-LTMDRVYDYMLHLITEYSKLLDYKPA 423
+ F + LT Y HLI +Y+ +++P
Sbjct: 400 NNSVKTFRDRYLTPAAESCYWRHLIRQYASSSEFEPV 436
>gi|405965272|gb|EKC30658.1| KTEL motif-containing protein 1 [Crassostrea gigas]
Length = 400
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 143/310 (46%), Gaps = 28/310 (9%)
Query: 113 IHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYDCV-QSRAMFTIWGFLQ 170
I +DL PW K+ IT AK IV KLY + C+ +R L+
Sbjct: 77 IEEDLTPW-KNGITEEVFNTAKSNNYGSHYQIVNHKLYREE--GCMFPARCSGNEHFILE 133
Query: 171 LLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC-TNDAHFDIPFPDWSFW-GW 228
++++ +PD++ + + D P I ++ P+P+F + DI +P W+FW G
Sbjct: 134 VIKK----LPDMEFVINTRDWPQISSRQQ---PIPVFSFSKVRGQSLDIMYPAWTFWEGG 186
Query: 229 SEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSK 282
V L W+E+ + I +++ W++K +++G+ + L+ ++ +
Sbjct: 187 PAVWPIYPTGLGRWDEQREIIPRAAKSWPWEKKKNKGFFRGSRTSSERDPLILLSRSNPE 246
Query: 283 LWGAEILR-QNWAEEA----KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
L AE + Q W EA + K+ +L + C+++Y G A S K++ C+SV
Sbjct: 247 LVDAEYTKNQAWKSEADTLHRPPAKEIRLEDHCDYKYLFNFRGVAASFRFKHLFLCDSVV 306
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-E 396
+ + +FF + P ++ P+ DL ++ ++++ N +I K G+ F+ +
Sbjct: 307 FHVGSAWLEFFYPAMKPWVHYIPV-KEDLS-DVRDLLEFAKENDDVVHEIAKRGRQFIWD 364
Query: 397 SLTMDRVYDY 406
L M+ V Y
Sbjct: 365 HLRMEDVTCY 374
>gi|426236645|ref|XP_004012278.1| PREDICTED: KDEL motif-containing protein 1 [Ovis aries]
Length = 502
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 135/326 (41%), Gaps = 24/326 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ F IH+DL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETFAQIHRDLVHFPTVDPEKIATEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSSPHIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVKLSR 324
Query: 279 NDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
+L A + + ++ + K + H+Y+I +G + L Y+L +
Sbjct: 325 KHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPYLLVGD 384
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
SV L Y + F L P K++ PI S +DL ++ W + EA+KI K GQ
Sbjct: 385 SVVLKQDSIYYEHFYNELQPWKHYIPIKSNLSDLLEKLQ----WAKDHDEEAKKIAKTGQ 440
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKL 417
+F +L D ++ Y L EY+ L
Sbjct: 441 EFARNNLMGDDIFCYYFKLFQEYASL 466
>gi|114650583|ref|XP_522713.2| PREDICTED: KDEL motif-containing protein 1 [Pan troglodytes]
gi|397524243|ref|XP_003832113.1| PREDICTED: KDEL motif-containing protein 1 [Pan paniscus]
gi|410214980|gb|JAA04709.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Pan troglodytes]
gi|410248076|gb|JAA12005.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Pan troglodytes]
gi|410296546|gb|JAA26873.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Pan troglodytes]
gi|410329569|gb|JAA33731.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Pan troglodytes]
Length = 502
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 142/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDS--KLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ +L A + + ++ + K + H+Y+I +G + L Y+L
Sbjct: 323 SRKYPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|426375907|ref|XP_004054757.1| PREDICTED: KDEL motif-containing protein 1 [Gorilla gorilla
gorilla]
Length = 502
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 136/326 (41%), Gaps = 24/326 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVKLSR 324
Query: 279 NDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
+L A + + ++ + K + H+Y+I +G + L Y+L +
Sbjct: 325 KHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVGD 384
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
SV L Y + F L P K++ P+ S +DL +K W + EA+KI KAGQ
Sbjct: 385 SVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKAGQ 440
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKL 417
+F +L D ++ Y L EY+ L
Sbjct: 441 EFARNNLMGDDIFCYYFKLFQEYANL 466
>gi|297690135|ref|XP_002822481.1| PREDICTED: KDEL motif-containing protein 2 [Pongo abelii]
Length = 507
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/333 (22%), Positives = 138/333 (41%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CP I KD + + KRF R IV + + Y +
Sbjct: 151 QKTLSCPTREPQITKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGK 210
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 211 YTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPVISWCGSLDS 268
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 269 RDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQLVQ 327
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 328 LSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLML 387
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI +DL +K W N A+KI K
Sbjct: 388 GDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEAAKKIAKE 443
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 444 GQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>gi|302660285|ref|XP_003021823.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
gi|291185740|gb|EFE41205.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
Length = 420
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 68/318 (21%), Positives = 136/318 (42%), Gaps = 20/318 (6%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
P F I + + A+ I++ I + +R +I +LYV + SR + T+
Sbjct: 81 PGLFAEIDRAKQERAQRPISLAEIDSVIPKNGYIRAMIYDQQLYVIAVNGTIYSREIATL 140
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWS 224
+ + P +P+++ +F+ DK +D P+ + Y + ++ PD+
Sbjct: 141 HALHRAIVSSPDPLPNIEFVFNTDDK--VD-------PVAQWGYARREQDTNLWLMPDFG 191
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
+W W E + E + +W +K+P +W+G L +R +L++ + W
Sbjct: 192 YWSWPETKVGTMQEVQMGAEREEHTWAWSKKIPQLFWRGATMDLE-VREKLIQVTRGQPW 250
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ-- 342
A++ W + + C +Y EG ++S LKY+ SCNSV + S
Sbjct: 251 -ADVKPITWRDNDSMQNDLKSMPEHCRFKYLAQTEGNSYSGRLKYLQSCNSVVISHSLEW 309
Query: 343 -QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG-QDFMES-LT 399
Q++ + P +N+ + ++ ++W + +A++I + + F E LT
Sbjct: 310 IQHQSPLMKSSGPDQNYVEVRRD--WSDLREKIEWLEEHEQDAKRIARNNIRTFREQYLT 367
Query: 400 MDRVYDYMLHLITEYSKL 417
Y HLI ++K+
Sbjct: 368 PAAEVCYWRHLIRSWAKV 385
>gi|238495869|ref|XP_002379170.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
gi|220694050|gb|EED50394.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
Length = 302
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 122/276 (44%), Gaps = 32/276 (11%)
Query: 169 LQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWG 227
L L R PD D + + + V+ +++ S P++ Y + + ++ PD+ +W
Sbjct: 12 LNSLHRALASFPDRDRLPNV--EFVLTTEDYSSGEGPIWSYSKREENTNVWLMPDFGYWS 69
Query: 228 WSEVNLQPWNEEFKDIKH--------------GSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
W EV + P+ + + I G Q + K++L W+GN +R
Sbjct: 70 WPEVGVGPYKDARRRIAAIDDGEVTVDGQVIPGMQFQDKKKQL---VWRGNVATNPQVRG 126
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+L+K + W A IL +W +E F + C + + + EG ++S KY+L+C
Sbjct: 127 KLLKAAQGRSW-ASILAIDWGDENDIRFNLLPIEEHCRYMFLAHTEGRSFSGRGKYLLNC 185
Query: 334 NSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
SV + +++ LI P N+ + +DL I+ ++D NP AE+I
Sbjct: 186 RSVVISHKLVWREAHHAALISSGPEANYVEVERDFSDLDHKIEFLID----NPEAAERIA 241
Query: 389 KAG-QDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
+ F + LT Y HLI +Y+ +++P
Sbjct: 242 NNSVKTFRDRYLTPAAESCYWRHLIRQYASSSEFEP 277
>gi|297694379|ref|XP_002824457.1| PREDICTED: KDEL motif-containing protein 1 [Pongo abelii]
Length = 502
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 142/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
+CP+ I +DL + + +RF + L + K+Y+ + + V R
Sbjct: 152 SCPETIAQIQRDLAHFPAVDPEKIAVEIPQRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEQLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|395527321|ref|XP_003765798.1| PREDICTED: KDEL motif-containing protein 1 [Sarcophilus harrisii]
Length = 502
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 137/331 (41%), Gaps = 24/331 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP I +DL P+ + +RF + L + K+Y+ + + V R
Sbjct: 152 NCPQTIPQIQRDLAPFPTVDPDKIAVEIPQRFGQRQSLCHYTLKDNKIYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P+ +K + + P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWPLEKRKSNANLH-PIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNTTALWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
NSV L Y + F + L P K++ P+ S I + W + EA+ I KAGQ
Sbjct: 383 GNSVVLKQDSIYYEHFYKELQPWKHYIPVKSN--LSDILEKLQWAKDHDEEAKAIAKAGQ 440
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+F +L D ++ Y L EY+ L +P
Sbjct: 441 EFARNNLMGDNIFCYYFKLFQEYASLQVTEP 471
>gi|145250763|ref|XP_001396895.1| hypothetical protein ANI_1_1376134 [Aspergillus niger CBS 513.88]
gi|134082418|emb|CAK42432.1| unnamed protein product [Aspergillus niger]
Length = 442
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 80/342 (23%), Positives = 143/342 (41%), Gaps = 41/342 (11%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPY--YDCVQSRAMF 163
P F + K RIT + + + +R ++ G+LYV + +R
Sbjct: 91 PKLFVELDKSASLREGHRITFKELDSLQVEDGMVRGIVHHGELYVVDFGAMPATFTRGEA 150
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPD 222
T+ + L +P DI F V+ +++ + P++ Y D ++ PD
Sbjct: 151 TLHALHRALASFPDRSSLPDIEF------VLTTEDYSTNEGPVWSYSKQDEQENVWLMPD 204
Query: 223 WSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKE---------KLPFA------YWKGNPDV 267
+ +W W EV + P +KDI+ A E LPF W+G+
Sbjct: 205 FGYWSWPEVKIGP----YKDIRRRIAAVDDGEVTIDGGSTPGLPFQDKKKQLAWRGSVAT 260
Query: 268 LSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSL 327
+R +L+K + W A I +W E F + + C + + +AEG ++S
Sbjct: 261 NPEIRGKLLKAAQGRSW-ASIRVIDWDNENDVRFNLLPMEDYCQYMFLAHAEGRSFSGRG 319
Query: 328 KYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPS 382
KY+L+C SV + +++ LI P N+ + +DL R ++ ++D NP
Sbjct: 320 KYLLNCRSVVVSHKLVWREAHHAALIASGPEANYVEVERDFSDLDRKMEFLID----NPE 375
Query: 383 EAEKIGKAG-QDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
A++I + + F + LT Y LI +Y+ +Y+P
Sbjct: 376 TAQRIAENSVKTFRDRYLTPAAESCYWRQLIRQYAASCEYEP 417
>gi|451855485|gb|EMD68777.1| hypothetical protein COCSADRAFT_187637 [Cochliobolus sativus
ND90Pr]
Length = 432
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 69/303 (22%), Positives = 136/303 (44%), Gaps = 30/303 (9%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYV------DPYYDCVQSR 160
P + S+ + + ++ + I K LR+LI G+L+V + Y +
Sbjct: 72 PKLYHSLDESVRLRQGRKVHAQDIEIEKGRCMLRVLIYEGELFVVDAGMPEKCYVTNGNE 131
Query: 161 AMFTIWGFLQLLRRYPGM------VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA 214
+ Q+ R +P+++ D P +K+ G+F F Y D
Sbjct: 132 RERILGTLAQMDRAVTSASTSDPAIPNIEFSLSLDDLPRRSRKD-GTF----FGYTRKDT 186
Query: 215 --HFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
+ DI P++++W W+ + WN K+I+ + W +K P W+G ++ L
Sbjct: 187 PEYRDIWMMPNYAYWAWNYTHAPSWNSIRKEIEQKEKDLPWDKKDPRVVWRGKIK-MAKL 245
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS--KLSNQCNHRYKIYAEGYAWSVSLKY 329
R EL+K ++ K W ++ A D K L + C+++Y + EG ++S LKY
Sbjct: 246 RKELVKVSEGKEWSD--IKPVVINNASDPHTKDVMNLRDFCSYKYTVQTEGTSYSGRLKY 303
Query: 330 ILSCNSVALIISQQYKDFFS---RGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
+ C S + ++++F + R P N+ I +++ +++S +++ + +EAE+
Sbjct: 304 LQLCRSAMITHPLEWQEFHTHLMRLAGPEVNY--IEASENFGNLESAMEYYREHDNEAEQ 361
Query: 387 IGK 389
I K
Sbjct: 362 IAK 364
>gi|344287853|ref|XP_003415666.1| PREDICTED: KDEL motif-containing protein 2 [Loxodonta africana]
Length = 507
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 138/333 (41%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CP I KD + + KRF R IV + + Y +
Sbjct: 151 QKTLSCPTKEPQIEKDFLSFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGK 210
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PDV+ + D P+ +K + + P+PL +C +
Sbjct: 211 YTDFKMFSDEILLSLARKV--LLPDVEFYINVGDWPLEHRKVNETPGPIPLISWCGSLDS 268
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 269 RDVILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQLVQ 327
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 328 LSRENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLML 387
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P ++ PI +DL +K W N EA+KI K
Sbjct: 388 GDSLVLKQDSPYYEHFYTALTPWIHYVPIKRNLSDLLEKVK----WAKENDEEAKKIAKE 443
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 444 GQLTARDLLQPHRLYCYYYRVLQKYAERQSSKP 476
>gi|291393202|ref|XP_002713063.1| PREDICTED: KDEL (Lys-Asp-Glu-Leu) containing 1 [Oryctolagus
cuniculus]
Length = 501
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 140/343 (40%), Gaps = 49/343 (14%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP I KDL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPATIPQIQKDLAHFPAVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++K++ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKQKSNSNIHPIFSWCGSMDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKG--------------- 263
P + ++ L+ D+ + W+ K A W+G
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRRERLELVKLSR 324
Query: 264 -NPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYA 322
+PD++ +D L+G + ++ D FK H+Y+I +G
Sbjct: 325 KHPDLIDAAFTNFFFKHDESLYGPIVKHISFF----DFFK---------HKYQINIDGTV 371
Query: 323 WSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNAN 380
+ L Y+L +SV L Y + F L P K++ P+ S +DL ++ W +
Sbjct: 372 AAYRLPYLLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLQ----WAKDH 427
Query: 381 PSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
EA+KI KAGQ+F +L D ++ Y L EY+ LL +P
Sbjct: 428 DEEAKKIAKAGQEFARNNLMGDDIFCYYFKLFQEYANLLVSEP 470
>gi|440892542|gb|ELR45695.1| KDEL motif-containing protein 2, partial [Bos grunniens mutus]
Length = 448
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 140/338 (41%), Gaps = 22/338 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CPD I KD + + I KRF R IV + + Y +
Sbjct: 86 QKTLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGK 145
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 146 YTDFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETPGPLPIISWCGSLDS 203
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQA-----KSWKEKLPFAYWKGNPDVLSP 270
D+ P + + L+ + I+ + SW K A+++G
Sbjct: 204 QDVILPTYDITHSTLEALRGVTNDLLSIQGNTDGLKHLGPSWINKTEKAFFRGRDSREER 263
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSL 327
L++ + + +L A I + +E + K+KL + ++Y++ +G +
Sbjct: 264 LQLVQLSKENPQLLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRY 323
Query: 328 KYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
Y++ +S+ L Y + F L P K++ PI +DL V+W N EA+
Sbjct: 324 PYLMLGDSLVLKQDSPYYEHFYMALKPWKHYIPIKRNLSDLLEK----VEWAKENDEEAK 379
Query: 386 KIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
KI K GQ L R+Y Y ++ +Y++ KP
Sbjct: 380 KIAKEGQLTARDLLQPHRLYCYYYTVLQKYAERQLSKP 417
>gi|71002250|ref|XP_755806.1| DUF821 domain protein [Aspergillus fumigatus Af293]
gi|66853444|gb|EAL93768.1| DUF821 domain protein [Aspergillus fumigatus Af293]
gi|159129863|gb|EDP54977.1| DUF821 domain protein [Aspergillus fumigatus A1163]
Length = 435
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 69/283 (24%), Positives = 120/283 (42%), Gaps = 30/283 (10%)
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
SRA T+ + ++ +P DI F V+ + + P P++ Y D I
Sbjct: 135 SRAKATLNSLHRAMQSFPDRHNLPDIEF------VLTADDFSNVPGPVWSYSKRDEDESI 188
Query: 219 -PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAK-----------SWKEKLPFAYWKGNPD 266
PD+ +W W EV + P+ + + I + + ++EK W+G+
Sbjct: 189 WLMPDFGYWAWPEVKVGPYKDIRRRIAAVDEGEVRPDGSLVPRMEFREKKKQLVWRGSVA 248
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
LR +L+K + W A I +W E + + C + + + EG ++S
Sbjct: 249 TNPELRGKLLKAAQGRSW-ASIRVIDWDNENDVRYNLLPMEEHCRYMFLAHTEGRSFSGR 307
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANP 381
KY+L+C SV + +++ L+ P N+ + +DL R I ++D NP
Sbjct: 308 GKYLLNCRSVVVSHKLVWREAHHAALVASGPEANYVEVERDFSDLDRKISFLID----NP 363
Query: 382 SEAEKIG-KAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
AE+I A + F + LT Y HLI +Y+ D+ P
Sbjct: 364 EIAEQIADNAVRTFRDRYLTPAAESCYWRHLIRQYAASCDFAP 406
>gi|410897261|ref|XP_003962117.1| PREDICTED: KDEL motif-containing protein 1-like [Takifugu rubripes]
Length = 505
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 153/370 (41%), Gaps = 44/370 (11%)
Query: 106 CPDFFKSIHKDLEPWAK---SRITMRHIME-AKRFAALRILIVRGKLYVDPYYDCVQSRA 161
CPD F I +DL + RI +R + + K+YV + + V R
Sbjct: 157 CPDSFPQIDRDLSAFPNVDPDRIAQEIPQRFGQRQSLCHYTVKDNKVYVTTFGEHVGFR- 215
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+FT L L R+ +PDV+ + D P ++K++ P+F +C ++ DI P
Sbjct: 216 IFTDSILLSLTRKV--WLPDVEFFVNLGDWP-LEKRKLTDKIHPIFSWCGSNNTQDIVMP 272
Query: 222 DWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
+ +E L+ D+ + W EK A+W+G R+EL+K +
Sbjct: 273 TYDL---TESVLETMGRVSLDMTSVQANTGPPWAEKNATAFWRGRDSRQE--RLELVKLS 327
Query: 280 DSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+ + N+ +E+ G K + ++Y+I +G + L Y+L+
Sbjct: 328 RAHPDMIDAAFTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINIDGTVAAYRLPYLLAG 387
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+SV L Y + F L P +++ P+ ADL + + W ++ E +KI AGQ
Sbjct: 388 DSVVLKQDSGYYEHFYNELRPWEHYIPV-RADLG-DLLDKIRWARSHDEEVKKIALAGQQ 445
Query: 394 FMESLTM-DRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAA 452
F + M D++ Y L +Y++L + +P+ RQ +E+
Sbjct: 446 FARNHLMGDKILCYYYKLFKQYAQLQ--------------------ITEPQVRQGMERVE 485
Query: 453 ASPSPYPPCT 462
PC+
Sbjct: 486 QPTDDLFPCS 495
>gi|327267957|ref|XP_003218765.1| PREDICTED: KDEL motif-containing protein 1-like [Anolis
carolinensis]
Length = 502
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 93/386 (24%), Positives = 152/386 (39%), Gaps = 74/386 (19%)
Query: 105 TCPDFFKSIHKDLE--PWAKSRITMRHIME--AKRFAALRILIVRGKLYVDPYYDCVQSR 160
+CP IH+DL P + I + +R + I K+Y+ + + V R
Sbjct: 153 SCPQSIPQIHRDLAHFPAVDPEKIAKEIPQRFGQRQSLCHYTIKDNKVYIKTHGEHVGFR 212
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PD++ + D P ++KK+ P+F +C ++ DI
Sbjct: 213 -IFMDAILLSLTRKVK--MPDLEFFVNLGDWP-LEKKKSPQDIHPIFSWCGSNDSKDIVM 268
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+EK A W+G R+EL+K
Sbjct: 269 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWEEKNATAIWRGRDS--RKERLELVKM 323
Query: 279 -------------------NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAE 319
+D L+G + ++ D FK ++Y+I +
Sbjct: 324 SRKYPDIIDAAFTNFFFFKHDESLYGPIVKHISFF----DFFK---------YKYQINID 370
Query: 320 GYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWG 377
G + L Y+L NSV L Y + F L P K++ P + +DL ++ W
Sbjct: 371 GTVAAYRLPYLLVGNSVVLKQDSIYYEHFYNELQPWKHYIPFKNDLSDLLEKLQ----WA 426
Query: 378 NANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESL 436
+ EA+ I KAGQ+F +L D ++ Y + L EYSKL
Sbjct: 427 KDHDEEAKNIAKAGQEFARNNLMGDHMFCYYVKLFQEYSKLQ------------------ 468
Query: 437 LCLADPKQRQNLEKAAASPSPYPPCT 462
L DPK R +EK PC
Sbjct: 469 --LNDPKVRDGMEKVEQPDEDLFPCN 492
>gi|395844016|ref|XP_003794762.1| PREDICTED: KDEL motif-containing protein 2 [Otolemur garnettii]
Length = 508
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 79/339 (23%), Positives = 147/339 (43%), Gaps = 30/339 (8%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRF-----AALRILIVRGKLY---V 150
QK+ +CP I KD + I ++ ++ KRF A + I+ +Y +
Sbjct: 152 QKILSCPTKEPQIAKDFASFPT--INLQQMLNEIPKRFGDENGAVVHYTILNNHIYRRSL 209
Query: 151 DPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRY 209
Y D MF+ L L R+ ++PD++ + D P+ +K + + P+P+ +
Sbjct: 210 GKYTDF----KMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETPSPVPIISW 263
Query: 210 CTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS 269
C + DI P + + ++ + I+ G+ SW K A+++G
Sbjct: 264 CGSLDSRDIILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREE 322
Query: 270 PLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVS 326
L++ + + +L A I + +E + K+KLS + ++Y++ +G +
Sbjct: 323 RLQLVQLSKENPQLLDAGITGYFFFQEKEKELGKAKLSGFFDFFKYKYQVNVDGTVAAYR 382
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEA 384
Y++ +S+ L Y + F L P K++ PI +DL +K W N EA
Sbjct: 383 YPYLMLGDSLVLKQDSPYYEHFYMALKPWKHYVPIKRNLSDLLEKVK----WAKENDKEA 438
Query: 385 EKIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
+KI K GQ L R+Y Y ++ +Y++ KP
Sbjct: 439 QKIAKEGQLAARDLLQPHRLYCYYYGVLQKYAEHQASKP 477
>gi|301764046|ref|XP_002917444.1| PREDICTED: KDEL motif-containing protein 2-like [Ailuropoda
melanoleuca]
Length = 446
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 143/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ ++ KRF R IV + + Y
Sbjct: 90 QKTLSCPTEEPQIAKDFASFPS--INLQQMLNEVPKRFGDERGAIVHYTILNNRIYRRSL 147
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 148 GKYTDFKMFSDEILLSLARKV--LLPDMEFYINLGDWPLEHRKVNETPGPLPIISWCGSL 205
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 206 DSRDVILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQL 264
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 265 VQLSKENPELLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYL 324
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA KI
Sbjct: 325 MLGDSLVLKQESTYYEHFYMALKPWKHYVPIKRNLSDLLEKVK----WAKENDEEARKIA 380
Query: 389 KAGQ-DFMESLTMDRVYDYMLHLITEYSKLLDYKP 422
K GQ E L R+Y Y ++ +Y++ +P
Sbjct: 381 KEGQLAARELLQPHRLYCYYYRVLQKYAERQSSRP 415
>gi|350636309|gb|EHA24669.1| hypothetical protein ASPNIDRAFT_182202 [Aspergillus niger ATCC
1015]
Length = 442
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 79/342 (23%), Positives = 144/342 (42%), Gaps = 41/342 (11%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPY--YDCVQSRAMF 163
P F + K RIT + + + +R ++ G+LYV + +R
Sbjct: 91 PKLFVELDKSASLREGHRITFKELDNLQVEDGMVRGIVHHGELYVVDFGAMPATFTRGKA 150
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPD 222
T+ + L +P DI F V+ +++ + P++ Y D ++ PD
Sbjct: 151 TLHALHRALASFPDRSSLPDIEF------VLTTEDYSTNEGPVWSYSKRDEQQNVWLMPD 204
Query: 223 WSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKE---------KLPFA------YWKGNPDV 267
+ +W W EV + P +KDI+ A E LPF W+G+
Sbjct: 205 FGYWSWPEVKIGP----YKDIRRRIAAVDDGEVTIDGGSTPGLPFQDKKKQLAWRGSVAT 260
Query: 268 LSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSL 327
+R +L+K + W A I +W +E F + + C + + +AEG ++S
Sbjct: 261 NPEIRGKLLKAAQGRSW-ASIRVIDWDDENDVRFNLLPMEDYCQYMFLAHAEGRSFSGRG 319
Query: 328 KYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPS 382
KY+L+C SV + +++ LI P N+ + +DL R ++ ++D NP
Sbjct: 320 KYLLNCRSVVVSHKLVWREAHHAALIASGPEANYVEVERDFSDLDRKMEFLID----NPE 375
Query: 383 EAEKIGKAG-QDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
A++I + + F + LT Y LI +Y+ +++P
Sbjct: 376 TAQRIAENSVKTFRDRYLTPAAESCYWRQLIQQYAASCEFEP 417
>gi|452004927|gb|EMD97383.1| hypothetical protein COCHEDRAFT_1190251 [Cochliobolus
heterostrophus C5]
Length = 432
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 71/304 (23%), Positives = 134/304 (44%), Gaps = 32/304 (10%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYV------DPYYDCVQSR 160
P + S+ + + ++ + I + LR+LI G+L+V + Y
Sbjct: 72 PKLYHSLDQSVRLREGRKVDAQDIQIEEGRCMLRVLIYEGELFVVDAGIPEKCY-VTNGN 130
Query: 161 AMFTIWGFLQLLRRYPG-------MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTND 213
I G L + R VP+++ D P +KE G+F F Y D
Sbjct: 131 ERERILGTLAQIDRAVTSASTSDPAVPNIEFSLSLDDLPRRSRKE-GTF----FGYTRKD 185
Query: 214 A--HFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
+ DI P++++W W+ + WN K+I+ + W +K P W+G ++
Sbjct: 186 TPEYKDIWMMPNYAYWAWNYTHAPSWNSIRKEIEQKEKELPWHKKDPRVVWRGKIK-MAK 244
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS--KLSNQCNHRYKIYAEGYAWSVSLK 328
LR EL+K ++ K W ++ A D K L + C ++Y + EG ++S LK
Sbjct: 245 LRKELVKVSEGKDWSD--IKPVVINNASDPHTKDVMNLRDFCGYKYTVQTEGTSYSGRLK 302
Query: 329 YILSCNSVALIISQQYKDFFS---RGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAE 385
Y+ C S + ++++F + R P N+ I +++ +++S +++ + +AE
Sbjct: 303 YLQLCRSAMITHPLEWQEFHTHLMRLAGPEVNY--IEASENFANLESAMEYYRDHDHDAE 360
Query: 386 KIGK 389
+I K
Sbjct: 361 QIAK 364
>gi|281345861|gb|EFB21445.1| hypothetical protein PANDA_005665 [Ailuropoda melanoleuca]
Length = 442
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 143/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ ++ KRF R IV + + Y
Sbjct: 86 QKTLSCPTEEPQIAKDFASFPS--INLQQMLNEVPKRFGDERGAIVHYTILNNRIYRRSL 143
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 144 GKYTDFKMFSDEILLSLARKV--LLPDMEFYINLGDWPLEHRKVNETPGPLPIISWCGSL 201
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 202 DSRDVILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQL 260
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 261 VQLSKENPELLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYL 320
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA KI
Sbjct: 321 MLGDSLVLKQESTYYEHFYMALKPWKHYVPIKRNLSDLLEKVK----WAKENDEEARKIA 376
Query: 389 KAGQ-DFMESLTMDRVYDYMLHLITEYSKLLDYKP 422
K GQ E L R+Y Y ++ +Y++ +P
Sbjct: 377 KEGQLAARELLQPHRLYCYYYRVLQKYAERQSSRP 411
>gi|355697431|gb|AES00668.1| KDEL containing 2 [Mustela putorius furo]
Length = 476
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 139/333 (41%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CP I KD + + KRF R IV + + Y +
Sbjct: 120 QKTLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNRIYRRSLGK 179
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHG-SFPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ ++ + PLP+ +C +
Sbjct: 180 YTDFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRQVNEIPGPLPIISWCGSLDS 237
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 238 RDIILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQLVQ 296
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL+ + ++Y++ +G + Y++
Sbjct: 297 LSKENPELLDAGITGYFFFQEKEKELGKAKLTGFFDFFKYKYQVNVDGTVAAYRYPYLML 356
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI +DL +K W N EA KI K
Sbjct: 357 GDSLVLKQDSMYYEHFYMALTPWKHYVPIKRNLSDLLEKVK----WAKENDGEARKIAKE 412
Query: 391 GQ-DFMESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ E L R++ Y ++ +Y++ KP
Sbjct: 413 GQLAARELLQPHRLFCYYYGVLQKYAERQSSKP 445
>gi|395833263|ref|XP_003789659.1| PREDICTED: KDEL motif-containing protein 1 [Otolemur garnettii]
Length = 502
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 136/325 (41%), Gaps = 24/325 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSRA 161
CP+ I +DL + KRF + L + K+Y+ + + V R
Sbjct: 153 CPETIPQIQRDLAHFPAVDPETIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI P
Sbjct: 212 IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSSSNIHPIFSWCGSTDSKDIVMP 268
Query: 222 DWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
+ ++ L+ D+ + W+ K A W+G L + +
Sbjct: 269 TYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTALWRGRDSRKERLELVKLSRK 325
Query: 280 DSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
+L A + + ++ + K + H+Y+I +G + L Y+L +S
Sbjct: 326 HPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVGDS 385
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
V L Y + F L P K++ P+ S +DL +K W + EA+KI KAGQ+
Sbjct: 386 VVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKAGQE 441
Query: 394 FM-ESLTMDRVYDYMLHLITEYSKL 417
F ++L D ++ Y L EY+ L
Sbjct: 442 FARDNLMGDDIFCYYFKLFQEYANL 466
>gi|402902420|ref|XP_003919616.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein 1
[Papio anubis]
Length = 502
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 140/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPTVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PD++ + D P+ KK + + P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDMEFFVNLGDWPLEKKKSNANIH-PIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PXYDL---TDSXLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|432106972|gb|ELK32490.1| KDEL motif-containing protein 2, partial [Myotis davidii]
Length = 442
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 138/333 (41%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q+ +CP I KD + + KRF R IV + + Y +
Sbjct: 86 QETLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGEERGAIVHYTILNNHIYRRPLGK 145
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 146 YTDFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETPGPLPIISWCGSLDS 203
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 204 RDIILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQLVQ 262
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 263 LAKENPQLLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYLML 322
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI ADL +K W N EA++I K
Sbjct: 323 GDSLVLKQDSPYYEHFYMALQPWKHYVPIKRNLADLLEKVK----WAKENDEEAKRIAKE 378
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ Y++ KP
Sbjct: 379 GQLTARDLLQPHRIYCYYYRVLQNYAERQSSKP 411
>gi|449484611|ref|XP_002197764.2| PREDICTED: KDEL motif-containing protein 2 [Taeniopygia guttata]
Length = 445
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 115/268 (42%), Gaps = 14/268 (5%)
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAHFDIPF 220
MF+ L L R+ +PDV+ + D PV +K + + PLP+ +C + DI
Sbjct: 154 MFSDEMLLSLARKV--RLPDVEFYLNVGDWPVEHRKANDTPGPLPVISWCGSLDSRDIVL 211
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
P + + L+ + I+ G+ W+ K A ++G L + + +
Sbjct: 212 PTYDVTHSTLETLRGVTNDLLSIQ-GNTGPFWENKTERALFRGRDSREERLHLVKLSKEN 270
Query: 281 SKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILSCNSVA 337
+L A I + E + K++L + ++Y++ +G + Y+L +S+
Sbjct: 271 PELLDAGITGYFFFREKEKELGKAQLMGFFDFFKYKYQVNIDGTVAAYRFPYLLLGDSLV 330
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
L QY + F GL P K++ P+ DL IK W N EA KI K GQ
Sbjct: 331 LKQDSQYYEHFYIGLKPWKHYVPVKRNLEDLLEKIK----WAKENDEEARKIAKEGQLMA 386
Query: 396 -ESLTMDRVYDYMLHLITEYSKLLDYKP 422
E L R Y Y ++ +Y++ KP
Sbjct: 387 RELLQPHRFYCYYYKVLQKYAERQASKP 414
>gi|296811324|ref|XP_002846000.1| DUF821 domain-containing protein [Arthroderma otae CBS 113480]
gi|238843388|gb|EEQ33050.1| DUF821 domain-containing protein [Arthroderma otae CBS 113480]
Length = 420
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 148/338 (43%), Gaps = 39/338 (11%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK------RFAALRILIVRGKLYVDPYYDC 156
Q C F + ++++ AK T RHI + + +R +I +LY+
Sbjct: 73 TQRCQSAFPGLFEEIDR-AKRERTYRHISRNEIDSITPKNGYIRAMIYNQQLYIIHRNGT 131
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
+ SR T+ + + P +P+++ +F+ DK +D + + Y
Sbjct: 132 IYSREYATLQALNRAIVSSPDPLPNIEFVFNTDDK--VDS-------VAQWGYARRPQDK 182
Query: 217 DI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ PD+ +W W E + E + Q +W +K+P +W+G L +R +L
Sbjct: 183 DMWLMPDFGYWSWPETKVGTMQEVQTKAEQEEQTWTWPKKIPKLFWRGATMGLE-VRDKL 241
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKS--KLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
++ + W A++ W + KD K + C +Y EG ++S LKY+ SC
Sbjct: 242 IEVTHGQPW-ADVKPIIWRD--KDSMKNDLRSMPEHCEFKYLAQTEGNSYSGRLKYLQSC 298
Query: 334 NSVALIISQ-----QYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEK 386
NSV +IS Q++ + P +N+ + +DL I+ W ++ ++A++
Sbjct: 299 NSV--VISHTLEWIQHQSPLMKSSGPEQNYVEVRRDWSDLHEKIQ----WLESHENDAKR 352
Query: 387 IGKAG-QDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
I + + F E LT Y HLI +S+ + +KP
Sbjct: 353 IAQNNVKTFREHYLTPAAEVCYWRHLIRSWSE-VSFKP 389
>gi|432949735|ref|XP_004084232.1| PREDICTED: KDEL motif-containing protein 2-like [Oryzias latipes]
Length = 518
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/267 (23%), Positives = 115/267 (43%), Gaps = 16/267 (5%)
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
MF+ L L R+ +PDV+ + D P+ + E +P+F +C + DI P
Sbjct: 228 MFSDEMLLSLTRKV--RLPDVEFFINVGDWPLETRTEGA---VPIFSWCGSVETRDIVLP 282
Query: 222 DWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS 281
+ + L+ + ++ G+ W K A+++G L++ LM +
Sbjct: 283 TYEVTHSTLETLRGVTNDLLSVQ-GNTGPVWANKTERAFFRGRDSREERLQLALMSKKNP 341
Query: 282 KLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILSCNSVAL 338
+L A I + E + K+ L + ++Y++ +G + Y++ NS+ L
Sbjct: 342 ELLDAGITAWFFFREREKHVGKAPLVGFFDFFQYKYQVNVDGTVAAYRFPYLMLGNSLVL 401
Query: 339 IISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM- 395
QY + F L ++ P+ +DL I+ W N +EAE+IG+AGQ
Sbjct: 402 KQDSQYYEHFYLHLKAGTHYVPVKRDLSDLLEKIQ----WARDNDAEAEEIGRAGQALAR 457
Query: 396 ESLTMDRVYDYMLHLITEYSKLLDYKP 422
E L R+Y Y + Y++ +P
Sbjct: 458 ELLQPTRLYCYYQSALQAYAERQTGRP 484
>gi|444723570|gb|ELW64221.1| KDEL motif-containing protein 2 [Tupaia chinensis]
Length = 470
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ ++ +RF R IV + + Y
Sbjct: 114 QKTLSCPIKEPQIAKDFASFPS--INLQQMLNEVPERFGDERGAIVHYTVLNNHIYRRSL 171
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K + + P+P+ +C +
Sbjct: 172 GKYTDFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETPGPIPIISWCGSL 229
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 230 DSRDIILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQL 288
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 289 VQLSKENPELLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNMDGTVAAYRYPYL 348
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F GL P K++ PI +DL IK W N EA+KI
Sbjct: 349 MLGDSLVLKQDSPYYEHFYMGLEPWKHYVPIKRNLSDLLEKIK----WAKENDEEAKKIA 404
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 405 KDGQLTARDLLQPHRLYCYYYRVLQKYAQRQSSKP 439
>gi|83771305|dbj|BAE61437.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391868002|gb|EIT77226.1| hypothetical protein Ao3042_06683 [Aspergillus oryzae 3.042]
Length = 592
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 81/177 (45%), Gaps = 18/177 (10%)
Query: 220 FPDWSFWGWSE--VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
PD+ FW W ++ P+++ + +K W +K P W+G P LR LM
Sbjct: 356 MPDFGFWAWDNPRNSIGPFDQVVERVKRAD--IPWSQKTPQLVWRGKPSFAPKLRRALMD 413
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
K WG ++ + NW E + + C + + + EG ++S SLKY +CNSV
Sbjct: 414 AARDKPWG-DVKQVNWFERTN----IMSMEDHCRYMFIAHVEGRSYSASLKYRQACNSVI 468
Query: 338 LIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+ QY L+ P +N+ + +DL I+ ++D +PS A++I
Sbjct: 469 VAHKLQYIQHHHYLLVPDGPNQNYIEVERDFSDLESKIEPLLD----DPSTAQRIAN 521
>gi|380816446|gb|AFE80097.1| KDEL motif-containing protein 1 precursor [Macaca mulatta]
Length = 502
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 138/328 (42%), Gaps = 28/328 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPTVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PD++ + D P+ KK + + P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDMEFFVNLGDWPLEKKKSNANIH-PIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKL 417
GQ+F +L D ++ Y L EY+ L
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANL 466
>gi|165993289|emb|CAP71956.1| unnamed protein product [Danio rerio]
Length = 500
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 144/338 (42%), Gaps = 30/338 (8%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRIL----IVRGKLYVDPYYD 155
+K CP F I DL + KS R+ E +RF L I ++Y+ + +
Sbjct: 147 EKNMHCPASFSQIESDLSIF-KSVDPDRNAHEVIQRFGKSHSLCHYTIKNNQVYIKTHGE 205
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH 215
V R +F L L R+ +PD++ + D P ++K+ P P+F +C ++
Sbjct: 206 HVGFR-IFMDAFLLSLTRKVK--LPDIEFFVNLGDWP-LEKRRASQNPSPVFSWCGSNDT 261
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + +E L+ D+ G W++K+ +W+G R+
Sbjct: 262 RDIVMPTYDL---TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDS--RKERL 316
Query: 274 ELMKCNDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSL 327
EL+K + + N+ +E+ G K + ++Y+I +G + L
Sbjct: 317 ELVKLARANTAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINVDGTVAAYRL 376
Query: 328 KYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
Y+L+ +SV Y + F L P ++ P S +DL I+ W + EA+
Sbjct: 377 PYLLAGDSVVFKHDSIYYEHFYNELQPWVHYIPFRSDLSDLLEKIQ----WAKDHDEEAK 432
Query: 386 KIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
KI AGQ F + M D V+ Y L +Y++L KP
Sbjct: 433 KIALAGQQFARTHLMGDSVFCYCHKLFQKYAELQVTKP 470
>gi|62896693|dbj|BAD96287.1| BK158_1 (OTTHUMP00000040718) variant [Homo sapiens]
Length = 502
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 138/331 (41%), Gaps = 24/331 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I + L + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRGLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV++ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVELFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVKLSR 324
Query: 279 NDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
+L A + + ++ + K + H+Y+I +G + L Y+L +
Sbjct: 325 KHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVGD 384
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
SV L Y + F L P K++ P+ S +DL +K W + EA+KI KAGQ
Sbjct: 385 SVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKAGQ 440
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+F +L D ++ Y L EY+ L +P
Sbjct: 441 EFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|350634920|gb|EHA23282.1| hypothetical protein ASPNIDRAFT_174284 [Aspergillus niger ATCC
1015]
Length = 465
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 101/217 (46%), Gaps = 23/217 (10%)
Query: 181 DVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSEV--NLQPWN 237
D++ +F DK ++ + P+ L R T +A + +P D+ FW W + ++ P++
Sbjct: 193 DIEFVFSVEDK--VEDVTNSDHPVWTLARSATEEAVWLMP--DFGFWAWENIQNSIGPYD 248
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEA 297
+ IK W EK P W+G P LR LM+ + WG ++ + +W +
Sbjct: 249 QVVNRIKRAD--IPWSEKKPQLVWRGKPSFAPKLRRALMEAARDQPWG-DVKQVDWDQRT 305
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---P 354
+ + C + + + EG ++S SLKY +CNSV + QY L+ P
Sbjct: 306 N----IIPMEDHCQYMFIAHVEGRSYSASLKYRQACNSVIVAHKLQYIQHHHYLLVPDGP 361
Query: 355 TKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+N+ + +DL ++ ++ A+PS AE+I
Sbjct: 362 QQNYVEVERDFSDLSDKLEPLL----ADPSLAERIAN 394
>gi|332817563|ref|XP_003309985.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Pan
troglodytes]
gi|426341694|ref|XP_004036161.1| PREDICTED: protein O-glucosyltransferase 1 isoform 2 [Gorilla
gorilla gorilla]
gi|194381610|dbj|BAG58759.1| unnamed protein product [Homo sapiens]
Length = 233
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 103/216 (47%), Gaps = 19/216 (8%)
Query: 220 FPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
+P W+FW G V L W+ +D+ + WK+K AY++G+ SP R
Sbjct: 2 YPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SPER 59
Query: 273 --VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSV 325
+ L+ + KL AE + + KD K L + C ++Y G A S
Sbjct: 60 DPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASF 119
Query: 326 SLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAE 385
K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN A+
Sbjct: 120 RFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDVAQ 177
Query: 386 KIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
+I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 178 EIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 213
>gi|332225466|ref|XP_003261900.1| PREDICTED: protein O-glucosyltransferase 1 isoform 2 [Nomascus
leucogenys]
Length = 233
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 103/216 (47%), Gaps = 19/216 (8%)
Query: 220 FPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
+P W+FW G V L W+ +D+ + WK+K AY++G+ SP R
Sbjct: 2 YPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SPER 59
Query: 273 --VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSV 325
+ L+ + KL AE + + KD K L + C ++Y G A S
Sbjct: 60 DPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASF 119
Query: 326 SLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAE 385
K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN A+
Sbjct: 120 RFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDVAQ 177
Query: 386 KIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
+I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 178 EIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 213
>gi|358367934|dbj|GAA84552.1| endoplasmic reticulum-resident kdel protein [Aspergillus kawachii
IFO 4308]
Length = 465
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 101/217 (46%), Gaps = 23/217 (10%)
Query: 181 DVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSEV--NLQPWN 237
D++ +F DK ++ + P+ L R T +A + +P D+ FW W + ++ P++
Sbjct: 193 DIEFVFSVEDK--VEDVTNSDHPVWTLARSATEEAVWLMP--DFGFWAWENIQNSIGPYD 248
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEA 297
+ IK W EK P W+G P LR LM+ + WG ++ + +W +
Sbjct: 249 QVVNRIKRAD--IPWSEKKPQLVWRGKPSFAPKLRRALMEAARDQPWG-DVKQVDWDQRT 305
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---P 354
+ + C + + + EG ++S SLKY +CNSV + QY L+ P
Sbjct: 306 N----IIPMEDHCQYMFIAHVEGRSYSASLKYRQACNSVIVAHKLQYIQHHHYLLVPDGP 361
Query: 355 TKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+N+ + +DL ++ ++ A+PS AE+I
Sbjct: 362 QQNYVEVERDFSDLSDKLEPLL----ADPSLAERIAN 394
>gi|344284530|ref|XP_003414019.1| PREDICTED: KDEL motif-containing protein 1 [Loxodonta africana]
Length = 502
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 140/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIPQIQRDLAHFPTVDLEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL ++ W + EA+ I KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLQ----WAKDHDEEAKNIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFEEYASLQVSEP 471
>gi|281350380|gb|EFB25964.1| hypothetical protein PANDA_002830 [Ailuropoda melanoleuca]
Length = 497
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 138/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRIL----IVRGKLYVDPYYDCVQ 158
CP I +DL + + I E KRF + L + K+Y+ + + V
Sbjct: 152 NCPATITQIQRDLAHFPA--VDPEKISEEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVG 209
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
R +F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 210 FR-IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSSSHIHPIFSWCGSTDSKDI 265
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 266 VMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRRERLELVQL 322
Query: 277 KCNDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+L A + + + + K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYASLQVSEP 471
>gi|351711155|gb|EHB14074.1| KDEL motif-containing protein 2 [Heterocephalus glaber]
Length = 543
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 118/278 (42%), Gaps = 14/278 (5%)
Query: 152 PYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYC 210
P YD S + + G L G D++ + D P+ +K + + P+P+ +C
Sbjct: 242 PTYDITHS-TLEAMRGVTNDLLSVQGNT-DLEFYINLGDWPLEHRKVNETPGPIPIISWC 299
Query: 211 TNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
+ D+ P + + ++ + ++ G+ W K A+++G
Sbjct: 300 GSLDSRDVILPTYDITHSTLEAMRGVTNDLLSVQ-GNTGPPWSNKTEKAFFRGRDSREER 358
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSL 327
L + M + +L A I + +E + K+KL + ++Y++ +G +
Sbjct: 359 LELVQMSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRF 418
Query: 328 KYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
Y+L +S+ L Y + F R LIP K++ PI +DL +K W N EA+
Sbjct: 419 PYLLLGDSLVLKQDSPYYEHFYRALIPWKHYVPIKRNLSDLLEKVK----WARENDEEAK 474
Query: 386 KIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
KI K GQ L R+Y Y ++ +Y+K KP
Sbjct: 475 KIAKEGQLMARDLLQPHRLYCYYYRVLQQYAKRQSSKP 512
>gi|452846827|gb|EME48759.1| hypothetical protein DOTSEDRAFT_67711 [Dothistroma septosporum
NZE10]
Length = 436
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/257 (22%), Positives = 114/257 (44%), Gaps = 20/257 (7%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPW----AKSRITM-RHIMEAKRFAALRILIVRGKLYVD 151
+ S Q +Q PD ++ I + W +IT + +E R LR +I G+LY+
Sbjct: 67 LTSDQCLQAFPDLYREIDRSRAYWQNRLGDGKITEDNYGLEWSRDGGLRAMIWEGQLYIL 126
Query: 152 P-----YYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPL 206
++ + R+ T+ + + +P+++ D + K+ +
Sbjct: 127 ESRGLNHFLHWKERSQATLHNIQRAITGSREPIPNIEFSIKINDNIELTKENPNATVWNF 186
Query: 207 FRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNP 265
R T+ + PD++FW + V + + I+ G S K P W+G
Sbjct: 187 NRNVTDKVMEQVWLIPDFNFWSYPRVASSYGEYQRQAIEIGEDYNS---KTPKLVWRGTT 243
Query: 266 DVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFK---KSKLSNQCNHRYKIYAEGYA 322
D +R++L++ ++ K W +++ R AE+ D + + + C +++ ++ EG
Sbjct: 244 DFNPEIRLKLIEQSEGKSW-SDVHRV--AEDVHDEEATKYRITMPDHCKYKFAVHTEGTT 300
Query: 323 WSVSLKYILSCNSVALI 339
WS LKY+LSC+S LI
Sbjct: 301 WSGRLKYLLSCHSTILI 317
>gi|384944046|gb|AFI35628.1| KDEL motif-containing protein 1 precursor [Macaca mulatta]
Length = 502
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 138/328 (42%), Gaps = 28/328 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPTVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PD++ + D P+ KK + + P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDMEFFVNLGDWPLEKKKSNANIH-PIFSWCGSIDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKL 417
GQ+F +L D ++ Y L EY+ L
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANL 466
>gi|301758088|ref|XP_002914892.1| PREDICTED: KDEL motif-containing protein 1-like [Ailuropoda
melanoleuca]
Length = 502
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 138/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRIL----IVRGKLYVDPYYDCVQ 158
CP I +DL + + I E KRF + L + K+Y+ + + V
Sbjct: 152 NCPATITQIQRDLAHFPA--VDPEKISEEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVG 209
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
R +F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 210 FR-IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSSSHIHPIFSWCGSTDSKDI 265
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 266 VMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRRERLELVQL 322
Query: 277 KCNDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+L A + + + + K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYASLQVSEP 471
>gi|407924438|gb|EKG17488.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 440
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 126/272 (46%), Gaps = 38/272 (13%)
Query: 137 AALRILIVRGKLYVDPYYD--CVQSRAMFTIWGFLQLLRRY-----PGMVPDVDIMFDCM 189
+ + +LI G++Y+ + C + R + G L R P +P+V+ + D
Sbjct: 118 SQVHVLIYNGQMYIINEHKGACDKQRGLA---GLANLYRAIISVPDPTTIPNVEFILDTE 174
Query: 190 DKPVI---DKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHG 246
D P D + + P+ + T A PD+ W + +L + + +K
Sbjct: 175 DTPTQEMPDDRVVWGWTRPMGKLGTWVA------PDFDGWAFPISDLGAYVSFRERLKLD 228
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKL 306
++EK P A W+G P L+ LR LM + K W A++ Q+ +E + +
Sbjct: 229 EM--PFEEKDPRAVWRGTP-ALNKLRNTLMNVTEGKDW-ADV--QHLVKETR-----MLM 277
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ--QYKDFFSRGLI---PTKNHFPI 361
+ CN+++ I+ EG WS L+Y+ +CNS A ++ Q +Y+ + L+ P +N+ +
Sbjct: 278 TEFCNYKFPIHTEGNTWSGRLRYLHNCNS-ATVVHQPLEYQAHYYDLLVADGPEQNYISV 336
Query: 362 PSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
A+ ++ +++ ANP EA +I K D
Sbjct: 337 --ANDWSDLEEKIEYYRANPDEAARIAKNSVD 366
>gi|145233531|ref|XP_001400138.1| endoplasmic reticulum-resident kdel protein [Aspergillus niger CBS
513.88]
gi|134057070|emb|CAK44358.1| unnamed protein product [Aspergillus niger]
Length = 465
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 100/217 (46%), Gaps = 23/217 (10%)
Query: 181 DVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSEV--NLQPWN 237
D++ +F DK ++ + P+ L R T +A + +P D+ FW W + ++ P++
Sbjct: 193 DIEFVFSVEDK--VEDVTNSDHPVWTLARSATEEAVWLMP--DFGFWAWENIQNSIGPYD 248
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEA 297
+ IK W EK P W+G P LR LM+ + WG + + +W +
Sbjct: 249 QVVNRIKRAD--IPWSEKKPQLVWRGKPSFAPKLRRALMEAARDQPWG-NVKQVDWDQRT 305
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---P 354
+ + C + + + EG ++S SLKY +CNSV + QY L+ P
Sbjct: 306 N----IIPMEDHCQYMFIAHVEGRSYSASLKYRQACNSVIVAHKLQYIQHHHYLLVPDGP 361
Query: 355 TKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+N+ + +DL ++ ++ A+PS AE+I
Sbjct: 362 QQNYVEVERDFSDLSDKLEPLL----ADPSLAERIAN 394
>gi|213514888|ref|NP_001133941.1| KDEL motif-containing protein 2 precursor [Salmo salar]
gi|209155896|gb|ACI34180.1| KDEL motif-containing protein 2 precursor [Salmo salar]
Length = 524
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/339 (20%), Positives = 141/339 (41%), Gaps = 29/339 (8%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIME------AKRFAALRILIVRGKLY---VD 151
Q + CP I +D + I ++ +++ + R + ++ ++Y +
Sbjct: 163 QSIMQCPAEEPQIQRDFNAFPS--IDLQRLLQEVPRRFSNRGGLIHYTVINNQVYRRSLG 220
Query: 152 PYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK-EHGSFPLPLFRYC 210
Y D MF+ L L R+ +PDV+ + D P+ +K + P+P+ +C
Sbjct: 221 KYTDF----KMFSDEMLLSLARKV--RLPDVEFYINVGDWPMETRKADDDPGPVPIISWC 274
Query: 211 TNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
+ DI P + + ++ + + ++ G+ W K+ A+++G
Sbjct: 275 GSTDTRDIILPTYDITHSTLETMRGVSNDLLSVQ-GNTGPPWANKMEQAFFRGRDSREER 333
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSL 327
L + + + +L A I + E + K+ L + ++Y++ +G +
Sbjct: 334 LHLVTLSKKNPELLDAGITGWFFFREREKDLGKANLVGFFDFFKYKYQVNVDGTVAAYRF 393
Query: 328 KYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSA--DLCRSIKSVVDWGNANPSEAE 385
Y++ NS+ L Y + F L P ++ P+ + DL + I +W N +EA+
Sbjct: 394 PYLMLGNSLVLKQISPYYEHFYTHLKPGTHYIPVKRSLSDLIQKI----EWAKENDAEAQ 449
Query: 386 KIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKPA 423
I KAGQ + L R+Y Y ++ YS+ +P
Sbjct: 450 AIAKAGQAIVRELVQPSRLYCYYYSVLQMYSERQTSRPT 488
>gi|149046243|gb|EDL99136.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_a [Rattus
norvegicus]
gi|197246465|gb|AAI68987.1| Kdelc1 protein [Rattus norvegicus]
Length = 502
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 140/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I KDL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETITQIQKDLSHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +P+V+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKV--RMPEVEFFVNLGDWP-LEKKKSTSNIQPIFSWCGSTESRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + +EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WARDHDAEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L Y+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQGYANLQVSEP 471
>gi|157128435|ref|XP_001655120.1| endoplasmic reticulum-resident kdel protein [Aedes aegypti]
gi|122067566|sp|Q16QY8.1|RUMI_AEDAE RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|108872609|gb|EAT36834.1| AAEL011121-PA [Aedes aegypti]
Length = 402
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/349 (22%), Positives = 151/349 (43%), Gaps = 28/349 (8%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ 158
P + C + DL P+ K I+ + + A+ + + IV +LY DC+
Sbjct: 65 EPCQQANCSCHADVLKTDLRPF-KGGISEQMVERARSYGT-KYQIVDHRLYRQK--DCMF 120
Query: 159 SRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
+ F++ P + +PD++++ +C D P I++ LP+ + D +
Sbjct: 121 PARCSGVEHFIK-----PNLPHLPDMELIINCRDWPQINRHWKQE-KLPVLSFSKTDDYL 174
Query: 217 DIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGN--PDVL 268
DI +P W FW G ++L P W++ IK + + W++K A+++G+ D
Sbjct: 175 DIMYPTWGFWEGGPAISLYPTGLGRWDQHRVSIKKAADSWKWEKKKAKAFFRGSRTSDER 234
Query: 269 SPLRVELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
PL L+ +L A+ + + KD ++ +L + C ++Y G A
Sbjct: 235 DPL--VLLSRRKPELVDAQYTKNQAWKSPKDTLNAKPAQEVRLEDHCQYKYLFNFRGVAA 292
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++++FF L P ++ P+ ++ ++++ +
Sbjct: 293 SFRFKHLFLCRSLVFHVGSEWQEFFYPSLKPWVHYVPVRVGATQEELEELIEFFAEHDDL 352
Query: 384 AEKIGKAG-QDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEA 431
A +I G + + L M V Y L+ Y KL+ Y+ S E
Sbjct: 353 AREIADRGFEHVWKHLRMKDVECYWRKLLRRYGKLVKYEVKRDHSLVEV 401
>gi|378732944|gb|EHY59403.1| hypothetical protein HMPREF1120_07393 [Exophiala dermatitidis
NIH/UT8656]
Length = 412
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/293 (22%), Positives = 119/293 (40%), Gaps = 22/293 (7%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA-LRILIVRGKLYVDPYYDCVQSRAM 162
Q P F+ + + L S IT+ + R +R +I +LYV + SR +
Sbjct: 68 QAFPGLFEEVERPLADRRNSSITLEELDSVPRQNGYVRGMIYDQQLYVIEKTGGIYSREL 127
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDK-PVIDKKEHGSFPLPLFRYCTNDAHFDI-PF 220
T+ + + P +P+++ +F+ D+ P + ++ Y + I
Sbjct: 128 ATLHALHRAIISAPEPLPNIEFVFNSDDRIPSV----------AIWGYARREQDTKIWLI 177
Query: 221 PDWSFWGWSEVNLQPWNE-EFKDIK-HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
PD+ +W W E + E + K ++ SW K+P W+G L LR +K
Sbjct: 178 PDFGYWSWPETKVGTMREVQMKAVETEQDDGWSWSSKVPKLLWRGATMGLE-LRENFLKA 236
Query: 279 NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
+ W A++ W + + C ++Y + EG ++S LKY+ SC SV +
Sbjct: 237 AADQPW-ADVKALEWKNKESMAHDLKSMPEHCQYKYLAHTEGNSYSGRLKYLQSCKSVVV 295
Query: 339 IISQ---QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
Q+ R P KN+ + + + + W A+ +AE+I
Sbjct: 296 AHKMDWIQHHHPLMRSDGPDKNYVQVERS--YEDLPEKMAWLQAHDRDAERIA 346
>gi|238502901|ref|XP_002382684.1| endoplasmic reticulum-resident kdel protein, putative [Aspergillus
flavus NRRL3357]
gi|317148194|ref|XP_001822570.2| endoplasmic reticulum-resident kdel protein [Aspergillus oryzae
RIB40]
gi|220691494|gb|EED47842.1| endoplasmic reticulum-resident kdel protein, putative [Aspergillus
flavus NRRL3357]
Length = 465
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 81/177 (45%), Gaps = 18/177 (10%)
Query: 220 FPDWSFWGWSE--VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
PD+ FW W ++ P+++ + +K W +K P W+G P LR LM
Sbjct: 229 MPDFGFWAWDNPRNSIGPFDQVVERVKRAD--IPWSQKTPQLVWRGKPSFAPKLRRALMD 286
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
K WG ++ + NW E + + C + + + EG ++S SLKY +CNSV
Sbjct: 287 AARDKPWG-DVKQVNWFERTN----IMSMEDHCRYMFIAHVEGRSYSASLKYRQACNSVI 341
Query: 338 LIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+ QY L+ P +N+ + +DL I+ ++D +PS A++I
Sbjct: 342 VAHKLQYIQHHHYLLVPDGPNQNYIEVERDFSDLESKIEPLLD----DPSTAQRIAN 394
>gi|326483787|gb|EGE07797.1| DUF821 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 420
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 67/318 (21%), Positives = 135/318 (42%), Gaps = 20/318 (6%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
P F I + + A+ I + I + +R +I +LYV + SR + T+
Sbjct: 81 PGLFAEIDRAKQERAQRPIGLAEIDSVIPKNGYIRAMIYDQQLYVIAVNGTIYSREIATL 140
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWS 224
+ + P +P+++ +F+ DK +D P+ + Y + ++ PD+
Sbjct: 141 HALHRAIVSSPDPLPNIEFVFNTDDK--VD-------PVAQWGYARREQDTNLWLMPDFG 191
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
+W W E + E + +W +K+P +W+G L +R +L++ + W
Sbjct: 192 YWSWPETKVGTMQEVQMGAEREEHTWAWPKKIPRLFWRGATMDLE-VREKLIQVTRDQPW 250
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ-- 342
A++ W + + C +Y EG ++S LKY+ SCNSV + S
Sbjct: 251 -ADVKPITWRDNDSMRNDLKSMPEHCRFKYLAQTEGNSYSGRLKYLQSCNSVVISHSLEW 309
Query: 343 -QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG-QDFMES-LT 399
Q++ + P +N+ + ++ ++W + +A++I + + F E LT
Sbjct: 310 IQHQSPLMKSSGPDQNYVEVRRD--WSDLREKIEWLEEHEQDAKRIAQNNIKTFREQYLT 367
Query: 400 MDRVYDYMLHLITEYSKL 417
Y HLI ++++
Sbjct: 368 PAAEVCYWRHLIRSWAEV 385
>gi|449269743|gb|EMC80494.1| KDEL motif-containing protein 2, partial [Columba livia]
Length = 441
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 137/337 (40%), Gaps = 26/337 (7%)
Query: 101 QKVQTCPDFFKSIHKDLE--PWAKSRITMRHI---MEAKRFAALRILIVRGKLY---VDP 152
Q + +CP I KD P + ++ I R A + I+ +Y +
Sbjct: 85 QNIMSCPSQEPQITKDFISFPTIDLQRMLKEIPAKFSQNRGAIVHYTILNNHIYRRSLGK 144
Query: 153 YYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCT 211
Y D MF+ L L R+ +PDV+ + D PV +K + + P+P+ +C
Sbjct: 145 YTDF----KMFSDEMLLSLARKV--RLPDVEFYLNVGDWPVEYRKANDTPGPIPVISWCG 198
Query: 212 NDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
+ DI P + + L+ + I+ G+ W+ K A ++G L
Sbjct: 199 SVDSRDIILPTYDVTHSTLETLRGVTNDLLSIQ-GNTGPPWENKTEQALFRGRDSREERL 257
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLK 328
+ + + +L A I + E + K L + ++Y++ +G +
Sbjct: 258 HLVKLSKENPELLDAGITGYFFFREKEKELGKVPLMGFFDFFKYKYQVNVDGTVAAYRFP 317
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEK 386
Y+L +S+ L QY + F GL P K++ P+ DL IK W N EA K
Sbjct: 318 YLLLGDSLVLKQDSQYYEHFYIGLKPWKHYVPVKRNLEDLLEKIK----WAKENNEEARK 373
Query: 387 IGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
I K GQ E L R+Y Y ++ +Y+K KP
Sbjct: 374 IAKEGQLVARELLQPHRLYCYYYKVLQKYAKRQASKP 410
>gi|332242076|ref|XP_003270210.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein 1
[Nomascus leucogenys]
Length = 502
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 140/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F+ K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFTMSCSXWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|56207656|emb|CAI20990.1| novel protein (zgc:56065) [Danio rerio]
Length = 500
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 139/337 (41%), Gaps = 28/337 (8%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDC 156
+K CP F I DL + +RF L I ++Y+ + +
Sbjct: 147 EKNMHCPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHYTIKNNQVYIKTHGEH 206
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
V R +F L L R+ +PD++ + D P ++K+ P P+F +C ++
Sbjct: 207 VGFR-IFMDAFLLSLTRKVK--LPDIEFFVNLGDWP-LEKRRASQNPSPVFSWCGSNDTR 262
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
DI P + +E L+ D+ G W++K+ +W+G R+E
Sbjct: 263 DIVMPTYDL---TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDS--RKERLE 317
Query: 275 LMKCNDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLK 328
L+K + + N+ +E+ G K + ++Y+I +G + L
Sbjct: 318 LVKLAKANTAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINVDGTVAAYRLP 377
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEK 386
Y+L+ +SV Y + F L P ++ P S +DL I+ W + EA+K
Sbjct: 378 YLLAGDSVVFKHDSIYYEHFYNELQPWVHYIPFRSDLSDLLEKIQ----WAKDHDEEAKK 433
Query: 387 IGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
I AGQ F + M D V+ Y L +Y++L KP
Sbjct: 434 IALAGQQFARTHLMGDSVFCYYHKLFQKYAELQVTKP 470
>gi|358373714|dbj|GAA90310.1| DUF821 domain protein [Aspergillus kawachii IFO 4308]
Length = 442
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 70/309 (22%), Positives = 129/309 (41%), Gaps = 43/309 (13%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPY--YDCVQSRAMF 163
P F + K RIT + + + +R +I G+LYV + +R
Sbjct: 91 PKLFVELDKSASLREGHRITFKEVDSLQVEDGMVRGIIHHGELYVVDFGAMPATFTRGKS 150
Query: 164 TIWGFLQLLRRYP--GMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PF 220
T+ + L +P G +PD++ + D + P++ Y D ++
Sbjct: 151 TLHALHRALASFPDRGSLPDIEFVLTTEDYSIHQG--------PVWSYSKRDEQGNVWLM 202
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKE---------KLPF------AYWKGNP 265
PD+ +W W EV + P +KDI+ A E LPF W+G+
Sbjct: 203 PDFGYWSWPEVKIGP----YKDIRRRIAAVDDGEVTLDGKFIPGLPFPDKKKQLAWRGSV 258
Query: 266 DVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSV 325
+R +L+K + W A I +W +E F + + C + + +AEG ++S
Sbjct: 259 ATNPEVRGKLLKAAQGRSW-ASIRVIDWDDENDVRFNLLPMEDYCQYMFLAHAEGRSFSG 317
Query: 326 SLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNAN 380
KY+L+C SV + +++ LI P N+ + +DL + ++ ++D +
Sbjct: 318 RGKYLLNCRSVVVSHKLVWREAHHAALIASGPEANYVEVKRDFSDLDQKMEFLID----H 373
Query: 381 PSEAEKIGK 389
P A++I +
Sbjct: 374 PETAQRIAE 382
>gi|225718528|gb|ACO15110.1| KDEL motif-containing protein 2 precursor [Caligus clemensi]
Length = 507
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 61/275 (22%), Positives = 116/275 (42%), Gaps = 9/275 (3%)
Query: 153 YYDCVQSRAMFTIW--GFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC 210
Y C F ++ L L R ++PDV+ + + D P+ +K + +P+ +C
Sbjct: 199 YRTCFGEYVGFNMFVDNVLHYLTRIT-IIPDVEFIINLGDWPLWEKITNYPQAVPIISWC 257
Query: 211 TNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
ND DI +P + S + ++ S W EK+ W+G S
Sbjct: 258 KNDNFSDILWPTYDLTQASLECMGRQEVHVFSVREESSHIPWHEKINKGIWRGRDSNTSR 317
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKD--GFKKSK-LSNQCNHRYKIYAEGYAWSVSL 327
L++ + + A I R + + ++ G K + ++Y + +G S L
Sbjct: 318 LKLVKLSKESPESLDAGITRYFFFRDMENELGIKDQMPFFDFFKYKYLVTVDGTVASYRL 377
Query: 328 KYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKI 387
Y+LS NSV L Y + + + LIP ++ PI D + + W + ++I
Sbjct: 378 PYLLSGNSVVLKQESNYYEHYYKQLIPYVHYIPI--KDDLSDLNQQIQWARVHEDLVQEI 435
Query: 388 GKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYK 421
+ G+ ++ L + ++V+ Y ++ E+ K L K
Sbjct: 436 SENGRKLVDDLLLPEKVFCYHGQVLREWRKRLAIK 470
>gi|350633295|gb|EHA21660.1| hypothetical protein ASPNIDRAFT_184185 [Aspergillus niger ATCC
1015]
Length = 452
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 119/282 (42%), Gaps = 28/282 (9%)
Query: 124 RITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVD 183
RI + +I A A R+ I +G+LYV + I G L + R P +
Sbjct: 124 RIDIDNIKMADGMA--RVAIYQGRLYVLRALAKGEDHRR-KILGILASIHRALVSAPQLA 180
Query: 184 IMFDC-MDKPVIDKKEHGSFP----LPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQ--PW 236
++ + + V DK E + P L R T ++ + PD+ FW W ++ Q P+
Sbjct: 181 VILNTEIIFSVEDKLEDVAGPDHPLWVLARKATEESVW--LMPDFGFWSWGHIDTQIGPY 238
Query: 237 NEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEE 296
+E K ++ W EK W+G LR L++ + W A++ W +
Sbjct: 239 DEVVKHVE--EHEHPWDEKENKLVWRGKLSFAPKLRRALLEIARNYAW-ADVKEVEWKNK 295
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI--- 353
A + N C++R+ + EG ++S SLKY +C SV +I QY L+
Sbjct: 296 AN----YLSMDNHCDYRFIAHVEGRSYSASLKYRQACRSVIVIHELQYIQHHHYLLVSSG 351
Query: 354 PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
P +N + DL I+ ++D NP +A I D
Sbjct: 352 PQQNFVQVERDWTDLPHKIQELLD----NPIKARMIADNNVD 389
>gi|311263851|ref|XP_003129882.1| PREDICTED: KDEL motif-containing protein 2 [Sus scrofa]
Length = 508
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 142/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
Q+ +CP I KD + I ++ ++ +RF R IV + + Y
Sbjct: 152 QRTLSCPTKEPQIAKDFASFPS--INLQQMLNEIPRRFGEERGAIVHYTILNNHIYRRSL 209
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PDV+ + D P+ +K + + PLP+ +C +
Sbjct: 210 GKYTDFKMFSDEILLSLARKV--LLPDVEFYVNLGDWPLEHRKVNETPGPLPIISWCGSL 267
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + + ++ + I+ G SW K A+++G L++
Sbjct: 268 DSRDVILPTYDITHSTLEAMRGVTNDLLSIQ-GHTGPSWINKTEKAFFRGRDSREERLQL 326
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 327 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYL 386
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL I +W + EA+KI
Sbjct: 387 MLGDSLVLKQDSPYYEHFYMALKPWKHYVPIKRNLSDLLEKI----EWAKEHDEEAKKIA 442
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 443 KEGQLTARDLLQPHRLYCYYYRVLQKYAERQTSKP 477
>gi|326475589|gb|EGD99598.1| hypothetical protein TESG_06947 [Trichophyton tonsurans CBS 112818]
Length = 419
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/318 (21%), Positives = 135/318 (42%), Gaps = 20/318 (6%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
P F I + + A+ I + I + +R +I +LYV + SR + T+
Sbjct: 80 PGLFAEIDRAKQERAQRPIGLAEIDSVIPKNGYIRAMIYDQQLYVIAVNGTIYSREIATL 139
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWS 224
+ + P +P+++ +F+ DK +D P+ + Y + ++ PD+
Sbjct: 140 HALHRAIVSSPDPLPNIEFVFNTDDK--VD-------PVAQWGYARREQDTNLWLMPDFG 190
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
+W W E + E + +W +K+P +W+G L +R +L++ + W
Sbjct: 191 YWSWPETKVGTMQEVQMGAEREEHTWAWPKKIPRLFWRGATMDLE-VREKLIQVTRDQPW 249
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ-- 342
A++ W + + C +Y EG ++S LKY+ SCNSV + S
Sbjct: 250 -ADVKPITWRDNDSMRNDLKSMPEHCRFKYLAQTEGNSYSGRLKYLQSCNSVVISHSLEW 308
Query: 343 -QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG-QDFMES-LT 399
Q++ + P +N+ + ++ ++W + +A++I + + F E LT
Sbjct: 309 IQHQSPLMKSSGPDQNYVEVRRD--WSDLREKIEWLEEHEQDAKRIAQNNIKTFREQYLT 366
Query: 400 MDRVYDYMLHLITEYSKL 417
Y HLI ++++
Sbjct: 367 PAAEVCYWRHLIRSWAEV 384
>gi|41055550|ref|NP_957225.1| KDEL motif-containing protein 1 precursor [Danio rerio]
gi|82188635|sp|Q7ZVE6.1|KDEL1_DANRE RecName: Full=KDEL motif-containing protein 1; Flags: Precursor
gi|28277832|gb|AAH45893.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Danio rerio]
Length = 500
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 139/337 (41%), Gaps = 28/337 (8%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDC 156
+K CP F I DL + +RF L I ++Y+ + +
Sbjct: 147 EKNMHCPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHYTIKNNQVYIKTHGEH 206
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
V R +F L L R+ +PD++ + D P ++K+ P P+F +C ++
Sbjct: 207 VGFR-IFMDAFLLSLTRKVK--LPDIEFFVNLGDWP-LEKRRASQNPSPVFSWCGSNDTR 262
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
DI P + +E L+ D+ G W++K+ +W+G R+E
Sbjct: 263 DIVMPTYDL---TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDS--RKERLE 317
Query: 275 LMKCNDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLK 328
L+K + + N+ +E+ G K + ++Y+I +G + L
Sbjct: 318 LVKLARANTAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINVDGTVAAYRLP 377
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEK 386
Y+L+ +SV Y + F L P ++ P S +DL I+ W + EA+K
Sbjct: 378 YLLAGDSVVFKHDSIYYEHFYNELQPWVHYIPFRSDLSDLLEKIQ----WAKDHDEEAKK 433
Query: 387 IGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
I AGQ F + M D V+ Y L +Y++L KP
Sbjct: 434 IALAGQQFARTHLMGDSVFCYYHKLFQKYAELQVTKP 470
>gi|119497671|ref|XP_001265593.1| hypothetical protein NFIA_032600 [Neosartorya fischeri NRRL 181]
gi|119413757|gb|EAW23696.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 458
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 123/301 (40%), Gaps = 36/301 (11%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRF--AALRILIVRGKLYVDPYYDCVQSRAMF- 163
P F+ IH+ +E W R + A F R +I G LYV V +RA
Sbjct: 108 PGLFQDIHRGVEYWKSRGGISRDDLNAVPFEDGMARAIISNGDLYV------VATRAKGD 161
Query: 164 ----TIWGFLQLLRRYPGMV------PDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTND 213
I G L + R P ++ +F D+ V D G L R +
Sbjct: 162 DHRRKIVGTLGSIHRALSASSNRTSHPTIEFIFSIEDR-VDDVNAVGHPVWVLSRKAFEE 220
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
+ I PD+ FW W++ N+ P+ + + I + +K W+G LR
Sbjct: 221 SV--ILMPDFGFWSWAKSNIGPYGQVVERIMAAESKLKFADKEQKLVWRGKLSFAPKLRR 278
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
L+ K W +++ +W+++A + + C + + + EG A+S SLKY +C
Sbjct: 279 VLLDIARGKPW-SDLKELDWSKKA----NFLSMEDHCRYMFIGHVEGRAYSASLKYRQAC 333
Query: 334 NSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
SV + QY L+ P +N+ + +DL + + ++ NP +AE+I
Sbjct: 334 RSVVVAHKLQYIQHHHYLLVASGPEQNYVEVERDFSDLPKRMDELLQ----NPDKAERIA 389
Query: 389 K 389
Sbjct: 390 N 390
>gi|145251704|ref|XP_001397365.1| endoplasmic reticulum-resident kdel protein [Aspergillus niger CBS
513.88]
gi|134082901|emb|CAK42731.1| unnamed protein product [Aspergillus niger]
Length = 456
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 119/282 (42%), Gaps = 28/282 (9%)
Query: 124 RITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVD 183
RI + +I A A R+ I +G+LYV + I G L + R P +
Sbjct: 128 RIDIDNIKMADGMA--RVAIYQGRLYVLRALAKGEDHRR-KILGILASIHRALVSAPQLA 184
Query: 184 IMFDC-MDKPVIDKKEHGSFP----LPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQ--PW 236
++ + + V DK E + P L R T ++ + PD+ FW W ++ Q P+
Sbjct: 185 VILNTEIIFSVEDKLEDVAGPDHPLWVLARKATEESVW--LMPDFGFWSWGHIDTQIGPY 242
Query: 237 NEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEE 296
+E K ++ W EK W+G LR L++ + W A++ W +
Sbjct: 243 DEVVKHVE--EHEHPWDEKENKLVWRGKLSFAPKLRRALLEIARNYAW-ADVKEVEWKNK 299
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI--- 353
A + N C++R+ + EG ++S SLKY +C SV +I QY L+
Sbjct: 300 AN----YLSMDNHCDYRFIAHVEGRSYSASLKYRQACRSVIVIHELQYIQHHHYLLVSSG 355
Query: 354 PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
P +N + DL I+ ++D NP +A I D
Sbjct: 356 PQQNFVQVERDWTDLPHKIQELLD----NPIKARMIADNNVD 393
>gi|348511563|ref|XP_003443313.1| PREDICTED: KDEL motif-containing protein 1-like [Oreochromis
niloticus]
Length = 538
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 142/330 (43%), Gaps = 24/330 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSRA 161
CP F I +DL + +RF + L + K+YV + + V R
Sbjct: 190 CPQSFAQIERDLSFYPSVDPDHNAQEIPQRFGQRQSLCHYTVKDNKIYVRTFGEHVGFR- 248
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+F L L R+ +PDV+ + D P ++K++ P+F +C ++ DI P
Sbjct: 249 IFMDAILLSLTRKV--RLPDVEFFVNLGDWP-LEKRKPPEKLHPIFSWCGSNNTRDIVMP 305
Query: 222 DWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
+ +E L+ D+ + W EK A+W+G R+EL+K +
Sbjct: 306 TYDL---TESVLETMGRVSLDMMSVQANTGPPWPEKNATAFWRGRDS--RQERLELVKLS 360
Query: 280 DSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+ + N+ +E+ G K + ++Y+I +G + L Y+L+
Sbjct: 361 RAHPHIIDAAFTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINIDGTVAAYRLPYLLAG 420
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+SV L Y + F + L +++ P+ ADL + + W + EA+KI AGQ
Sbjct: 421 DSVVLKQDSGYYEHFYKQLRAWEHYIPV-RADLG-DLLGKIQWARDHDEEAKKIALAGQQ 478
Query: 394 FMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
F + M D ++ Y LI EY+KL +P
Sbjct: 479 FARTHLMGDTIFCYYYILIKEYAKLQVTEP 508
>gi|390332934|ref|XP_783946.3| PREDICTED: KDEL motif-containing protein 1-like [Strongylocentrotus
purpuratus]
Length = 512
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 142/332 (42%), Gaps = 46/332 (13%)
Query: 104 QTCPDF-------FKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDC 156
QT DF ++H+D+E SR AK + I+ K+Y
Sbjct: 170 QTQTDFSIFPSIDLGTLHQDVE----SRF-------AKHHSLCHYSIINNKVYRKTLGSI 218
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
V + MF+ L L R+ VPDV+ + D P ++K++ PLP+ +C +
Sbjct: 219 VGFK-MFSDAFLLSLTRKV--RVPDVEFFINLGDWP-LEKRDPEDEPLPILSWCGSTDTR 274
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKG--------NPD 266
DI P + +E L+ D+ + W+ K A+W+G N
Sbjct: 275 DIVLPTYDI---TESTLETMGRVSLDMMSVQANTGPKWENKTEKAFWRGRDSRRERLNLV 331
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWS 324
LS R EL+ D+ L R EEA+ G K +S + N +Y+I +G +
Sbjct: 332 KLSRRRPELL---DAALTNFFFFRN---EEAEYGPKVKHVSFYDFFNFKYQINVDGTVAA 385
Query: 325 VSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEA 384
L Y+L+ +S Y + F L P ++ P DL ++ ++W N A
Sbjct: 386 YRLPYLLAGDSAVFRHDSIYYEHFYAELEPWVHYIPF-KLDLS-DLEERIEWAMQNDDSA 443
Query: 385 EKIGKAGQDFM-ESLTMDRVYDYMLHLITEYS 415
+ I + G+ ++ E+LT + ++ Y L ++ EY+
Sbjct: 444 QTIAENGKAYVRENLTSNNIFCYYLQVLEEYA 475
>gi|453082862|gb|EMF10909.1| hypothetical protein SEPMUDRAFT_150863 [Mycosphaerella populorum
SO2202]
Length = 445
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/295 (22%), Positives = 115/295 (38%), Gaps = 42/295 (14%)
Query: 80 TYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRF--- 136
T+ + A NP + Q P + I + W +R H + AK
Sbjct: 71 TFNSTRDARNPA-----LTYEQCTVAFPKLYHEIERSATYWKHTR---NHSISAKDVNIT 122
Query: 137 ----AALRILIVRGKLYVDPYYDCVQSR----AMFTIWGFLQLLRRYPG------MVPDV 182
LR+LI +L + D R A I+ Q+ R G VP++
Sbjct: 123 WTGKGGLRVLIYENELRILETRDIHHFRDGHDARRAIFVLSQIHRALLGATARGEQVPNI 182
Query: 183 DIMFDCMDKPVIDKKEHGSFPLPLF--RYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEF 240
+ D + + S + F R ++ PD++FW W +
Sbjct: 183 EFAIAVNDYAHLPDERGDSHAVWTFDRRIASDKDERMWLMPDFNFWAWKPTGSAYEDARR 242
Query: 241 KDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDG 300
+ + H S K EK+P W+GN + +R L++ K W + W +
Sbjct: 243 RAMTHDSAVK---EKIPKIVWRGNRHINPEVRKALIETGKGKEWAD--VEGGWLD----- 292
Query: 301 FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPT 355
+++ C + + +Y EG++WS LKY+L+C++VA++ ++ F L P+
Sbjct: 293 -----IADFCRYLFAVYTEGHSWSGRLKYLLNCDTVAIVHELEFTTSFYHLLNPS 342
>gi|432119342|gb|ELK38424.1| KDEL motif-containing protein 1 [Myotis davidii]
Length = 504
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 140/333 (42%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRIL----IVRGKLYVDPYYDCVQ 158
CP+ I +DL + + I E KRF + L + K+Y+ + + V
Sbjct: 136 NCPETIAQIQRDLVHFPT--VDPEKIAEEIPKRFGQRQSLCHYTVKDNKVYIKTHGEHVG 193
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
R +F L L R+ +PDV+ + D P ++KK+ P+F +C + DI
Sbjct: 194 FR-IFMDAILLSLTRKV--RMPDVEFFVNLGDWP-LEKKKSSPHIHPIFSWCGSTDSKDI 249
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + + + + ++ + W+ K A W+G R+EL+K
Sbjct: 250 VMPTYDLTDSVLETMGRVSLDMMSVQ-ANTGPPWESKNSTAVWRGRDSRKE--RLELVKL 306
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 307 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINVDGTVAAYRLPYLLV 366
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P +++ P+ S +DL + +W + EA+KI K
Sbjct: 367 GDSVVLKQDSIYYEHFYNELQPWRHYIPVKSNLSDLLEKL----NWAKDHDEEAKKIAKT 422
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 423 GQEFARNNLMGDDIFCYYFKLFQEYANLQVSEP 455
>gi|431907500|gb|ELK11352.1| KDEL motif-containing protein 2 [Pteropus alecto]
Length = 447
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 75/333 (22%), Positives = 137/333 (41%), Gaps = 18/333 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q +CP I KD + + KRF R IV + + Y +
Sbjct: 92 QNTLSCPTNEPQIAKDFASFPSIDLQQMLNQVPKRFGDERGAIVHYTILNNHIYRRSLGK 151
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K + + PLP+ +C +
Sbjct: 152 YTDFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETPGPLPIISWCGSLDS 209
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 210 QDIILPTYDITHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQLVQ 268
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 269 LSKENPELLDAGITGYFFFQEKEKELGKAKLIGFFDFFKYKYQVNVDGTVAAYRYPYLML 328
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L Y + F L P K++ PI DL +K W N EA++I K
Sbjct: 329 GDSLVLKQDSPYYEHFYVALKPWKHYVPIKRNLGDLLEKVK----WAKENDQEAKRIAKE 384
Query: 391 GQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
GQ L R+Y Y ++ +Y++ KP
Sbjct: 385 GQLTARDLLQPHRLYCYYYRVLQKYAERQSSKP 417
>gi|12841643|dbj|BAB25294.1| unnamed protein product [Mus musculus]
Length = 502
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 139/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
C + I KDL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKV--RMPDVEFFVNLGDWP-LEKKKSNSNIQPIFSWCGSTESRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + +EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKEHDAEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L Y+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQGYANLQVSEP 471
>gi|312380729|gb|EFR26645.1| hypothetical protein AND_07142 [Anopheles darlingi]
Length = 459
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 59/246 (23%), Positives = 113/246 (45%), Gaps = 16/246 (6%)
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK--EHGSFPLPLFRYCTNDAHF 216
S+ M TI L L R++ +PD+++ + D P++ K + P P+F +C +D F
Sbjct: 160 SKFMDTI--LLSLARKFS--LPDMELFVNLGDWPLVKKGGPSRTTGPYPIFSWCGSDDTF 215
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKD-IKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + +E L+ D + + W +K A+W+G L +
Sbjct: 216 DIVMPTYDI---TESTLENMGRVMLDMLSIQKRGIPWPDKHRKAFWRGRDARRERLELVR 272
Query: 276 MKCNDSKLWGAEILRQNW--AEEAKDGFKKSKLSNQ--CNHRYKIYAEGYAWSVSLKYIL 331
+ L A + + EE++ G + + +S ++RY++ +G + L Y+L
Sbjct: 273 LARRHPDLLNASLTNFFFFRDEESEFGPRVAHISMHDFFDYRYQVNVDGTVAAYRLPYLL 332
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+ +SV + Y + F R L+P +++ P ADL ++ V+W N +A++I
Sbjct: 333 AGSSVVMKQDSFYYEHFYRKLVPMRHYIPF-EADLSNLLQQ-VEWARENDEKAQEIRDNA 390
Query: 392 QDFMES 397
F+ +
Sbjct: 391 NAFINA 396
>gi|12963767|ref|NP_076134.1| KDEL motif-containing protein 1 precursor [Mus musculus]
gi|81881896|sp|Q9JHP7.1|KDEL1_MOUSE RecName: Full=KDEL motif-containing protein 1; AltName:
Full=Endoplasmic reticulum resident protein 58; Short=ER
protein 58; Short=ERp58; Flags: Precursor
gi|9621686|emb|CAC00650.1| ER protein 58 [Mus musculus]
gi|26331010|dbj|BAC29235.1| unnamed protein product [Mus musculus]
gi|74213118|dbj|BAE41698.1| unnamed protein product [Mus musculus]
gi|148664475|gb|EDK96891.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_a [Mus musculus]
Length = 502
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 139/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
C + I KDL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKV--RMPDVEFFVNLGDWP-LEKKKSNSNIQPIFSWCGSTESRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + +EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKEHDAEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L Y+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQGYANLQVSEP 471
>gi|391326724|ref|XP_003737862.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
1-like [Metaseiulus occidentalis]
Length = 497
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 138/334 (41%), Gaps = 27/334 (8%)
Query: 103 VQTCPDFFKSIHKDLEPWAKSRI-------TMRHIMEAKRFAALRILIVRGKLYVDPYYD 155
V CP I DL + K RI M + +L +++ Y
Sbjct: 144 VAKCPKISDQILDDLSYFTKHRIRPSYNRDIMNKFFDQPYSISLCNYVIKSN---QIYRK 200
Query: 156 CVQSRAMFTIWG---FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTN 212
C+ F ++ L L+R+ +PDVD + + D P+ K S +P+F +C +
Sbjct: 201 CLGEYTGFKMFMDSILLSLVRKVK--LPDVDFLVNLGDYPLAKKMSVYSPQVPIFSWCGS 258
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
+ DI P + S ++ + + ++ + ++ + E+ A+W+G LR
Sbjct: 259 EDSLDIVMPTYELTEASVYMMRRVSVDVFSVQDRA-SQPYSERQTKAFWRGRDSREERLR 317
Query: 273 VELMKCNDSKLWGAEILR----QNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLK 328
+ + D +L A I ++ E G + ++Y+I +G + +
Sbjct: 318 LVELSQEDPQLLNASITNFFFFRDRMENYGGGSPHVSFFDFFEYKYQINIDGTVAAYRMP 377
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEK 386
++LS S L Y + F L ++ P+ S +DL IK +D N +
Sbjct: 378 FLLSGGSTVLKPDSMYYEHFYSLLKEDVHYVPVRSDLSDLLPKIKFCID----NEDHCAR 433
Query: 387 IGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLD 419
+ + G+ + ++L VY Y + L+ EYS+L++
Sbjct: 434 VAQNGRQIVNDALLPHHVYCYYVQLLQEYSELIE 467
>gi|396482709|ref|XP_003841528.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
JN3]
gi|312218103|emb|CBX98049.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
JN3]
Length = 428
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 76/338 (22%), Positives = 148/338 (43%), Gaps = 32/338 (9%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYV----DPYYDCV----- 157
P + S+ + + ++ ++ I + LR++I G+L+V P V
Sbjct: 68 PKLYHSLDEAVSRRQGRKVHVQDIAIREGRCMLRLMIFEGELFVIDAGKPEQCYVTNGNE 127
Query: 158 QSRAMFTIWGFLQLLRRYPG---MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA 214
+ R + T+ + L P VP+++ D P K+HG F F Y +
Sbjct: 128 RERILGTLAQIDRALTTLPTSDPAVPNIEFSLSLDDLPR-RSKDHGVF----FGYTRKEG 182
Query: 215 --HFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
+ DI P++++W W+ + WN ++I+ W K W+G ++ L
Sbjct: 183 REYNDIWMMPNYAYWSWNYTHAPSWNSIRREIEQREAEIPWAMKDSRVVWRGKVK-MAKL 241
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS--KLSNQCNHRYKIYAEGYAWSVSLKY 329
R EL++ ++ K W ++ A D K L C ++Y I EG ++S LKY
Sbjct: 242 RSELVRVSEGKAWSD--IKPVVINNASDPHTKDVMNLREFCGYKYTIQTEGTSYSGRLKY 299
Query: 330 ILSCNSVALIISQQYKDFFS---RGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
+ C S + ++++F + R P N+ I +++ +++S +++ +A+ +EAE+
Sbjct: 300 LQLCRSALITHPLEWQEFHTHLMRLAGPDANY--IEASENFGNLESAMEYYHAHDNEAEE 357
Query: 387 IGKAGQDFMES--LTMDRVYDYMLHLITEYSKLLDYKP 422
I + + LT V Y L ++ + Y+P
Sbjct: 358 IARNSHETFARRYLTPAAVTCYWRRLFWSWASVQGYEP 395
>gi|72679937|gb|AAI00618.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
Length = 508
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I +D + I ++ +++ KRF R IV + + Y
Sbjct: 152 QKTLSCPANEPQIEQDFISFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHIYRRSL 209
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ +PD++ + D P+ +K + + P+P+ +C +
Sbjct: 210 GKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTPGPIPIISWCGSL 267
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + ++ G+ SW K A+++G L++
Sbjct: 268 DSRDIILPTYDVTHSTLEAMRGVTNDLLSVQ-GNTGPSWINKTEKAFFRGRDSREERLQL 326
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
L+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 327 VLLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 386
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA++I
Sbjct: 387 MLGDSLVLKQESPYYEHFYVELRPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKRIA 442
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 443 KEGQLTARDLLQPPRLYCYYYRVLQKYAERQVSKP 477
>gi|330920453|ref|XP_003299007.1| hypothetical protein PTT_09918 [Pyrenophora teres f. teres 0-1]
gi|311327422|gb|EFQ92840.1| hypothetical protein PTT_09918 [Pyrenophora teres f. teres 0-1]
Length = 482
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/339 (23%), Positives = 143/339 (42%), Gaps = 38/339 (11%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAK------RFAALRILIVRGKLYVDPYY---DC 156
C F ++D+E K R + EA+ +R ++ +LYV + +
Sbjct: 120 CASAFPGQYEDIERAVKVRDRSGKVTEAELNSFELTKGMVRAMVFDRELYVLETHLVDNI 179
Query: 157 VQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA 214
+ +A+ + + L P +P+++ F D P K P+ D
Sbjct: 180 NRQKALAILSAIYRALLAAPANAHMPNIEFTFSVEDLPAQPAK-----PIWSLTRRVQD- 233
Query: 215 HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
H PD+ FW W +L +E ++ + W +K+ W+G LR
Sbjct: 234 HSLWLMPDFGFWSWDMPSLGTLDEVAREAVQRETREPWNQKIEKLVWRGKITFAPKLRRA 293
Query: 275 LMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILS 332
L+ K W +++ + W + FK+ L +QCN+ + +AEG ++S SLKY
Sbjct: 294 LLDAAKGKSW-SDVGQLRWTDP---NFKEQFLGPVDQCNYMFIAHAEGRSYSGSLKYRQL 349
Query: 333 CNSVALIISQ-----QYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
C SV IIS Q+ + R +N+ + +DL +++ ++D +P +A+
Sbjct: 350 CRSV--IISHKLQWIQHHHYLFRANGSNQNYVEVERDFSDLASAVEDLLD----HPEKAK 403
Query: 386 KIG-KAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
+I + Q F E LT Y L+T + ++ ++P
Sbjct: 404 RIADNSVQVFRERYLTQAAEACYWRRLMTRWKDVMSWEP 442
>gi|327297120|ref|XP_003233254.1| hypothetical protein TERG_06247 [Trichophyton rubrum CBS 118892]
gi|326464560|gb|EGD90013.1| hypothetical protein TERG_06247 [Trichophyton rubrum CBS 118892]
Length = 420
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 62/290 (21%), Positives = 123/290 (42%), Gaps = 22/290 (7%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCVQSRAMFTI 165
P F I + + A+ I++ I + +R +I +LYV + SR + T+
Sbjct: 81 PGLFAEIDRAKQERAQRPISLAEIDSVVPKNGYIRAMIYDQQLYVIAVNGTIYSREIATL 140
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWS 224
+ + P +P+++ +F+ DK +D P+ + Y + ++ PD+
Sbjct: 141 HALHRAIVSSPDPLPNIEFVFNTDDK--VD-------PVAQWGYARREQDTNLWLMPDFG 191
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
+W W E + E + +W +K+P +W+G L +R L++ + W
Sbjct: 192 YWSWPETKVGTMQEVQMGAEREEHTWAWPKKIPQLFWRGATMDLE-VRERLIQVTRDQPW 250
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ-- 342
A++ W + + C +Y EG ++S LKY+ SCNSV + S
Sbjct: 251 -ADVKPITWRDNDSMQNDLRSMPEHCRFKYLAQTEGNSYSGRLKYLQSCNSVVISHSLEW 309
Query: 343 -QYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
Q++ + P +N+ + +DL I +W + +A++I +
Sbjct: 310 IQHQSPLMKSSGPDQNYVEVRRDWSDLPEKI----EWLEGHEQDAKRIAR 355
>gi|359322520|ref|XP_003639859.1| PREDICTED: KDEL motif-containing protein 1-like [Canis lupus
familiaris]
Length = 502
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 140/335 (41%), Gaps = 32/335 (9%)
Query: 105 TCPDFFKSIHKDLE--PWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQ 158
CP+ I +DL P I KRF + L + K+Y+ + + V
Sbjct: 152 NCPENITQIQRDLAHFPIVDPEKIAAEI--PKRFGQRQSLCHYTLKDNKVYIKTHGEHVG 209
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
R +F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 210 FR-IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSNSQIHPIFSWCGSTDSKDI 265
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM 276
P + ++ L+ D+ + W+ K A W+G R+EL+
Sbjct: 266 VMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELV 320
Query: 277 KCNDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYI 330
K + + N+ +E+ G K + H+Y+I +G + L Y+
Sbjct: 321 KLSRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYL 380
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
L +SV L Y + F L P ++ P+ S +DL +K W + EA+KI
Sbjct: 381 LVGDSVVLKQDSIYYEHFYNELQPWTHYIPVKSNLSDLLEKLK----WAKDHDKEAKKIA 436
Query: 389 KAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
KAGQ+F +L D ++ Y L EY+ L +P
Sbjct: 437 KAGQEFARNNLMGDDIFCYYFKLFQEYASLQVSEP 471
>gi|349604036|gb|AEP99697.1| KTEL motif-containing protein 1-like protein, partial [Equus
caballus]
Length = 200
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 86/177 (48%), Gaps = 12/177 (6%)
Query: 252 WKEKLPFAYWKGNPDVLSPLR--VELMKCNDSKLWGAEILRQNWAEEAKDGFKK-----S 304
WK+K AY++G+ SP R + L+ + KL AE + + KD K
Sbjct: 8 WKKKNSTAYFRGSR--TSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAQDV 65
Query: 305 KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSA 364
L + C ++Y G A S K++ C S+ + ++ +FF L P ++ P+ +
Sbjct: 66 HLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT- 124
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
DL +++ ++ + AN A +I + G F M L MD V Y +L+TEYSK L Y
Sbjct: 125 DLS-NVQELLQFVKANDDIAREIAERGSQFIMNHLQMDDVTCYWENLLTEYSKFLSY 180
>gi|313241455|emb|CBY33708.1| unnamed protein product [Oikopleura dioica]
Length = 478
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 121/260 (46%), Gaps = 22/260 (8%)
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWG 227
FL LLR+ +PD++ +F+ D PV S P+F +C ++ DI P W
Sbjct: 204 FLSLLRKVN--LPDIEFLFNVGDWPV-------SREFPVFSWCGSEESSDIVVPTWDQIK 254
Query: 228 WSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAE 287
+ +++ N + ++ KSW+ K+P +++G +RV + N++ L A
Sbjct: 255 TTLLSMSKINVDILTMQLN--GKSWQSKIPKGFFRGRDSSKERMRVSALSMNNTAL-DAG 311
Query: 288 ILRQNWAEEAKDGFKK--SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYK 345
I + E+ +G K +S+ N+++++ +G Y+ +S L+ Q+ K
Sbjct: 312 ITSFQFHEQG-NGTKVPIVPMSDFGNYKFQLLLDGTVAPYRAPYVFQTSS--LVFKQKSK 368
Query: 346 DFFSRGLIP--TKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTM-DR 402
F+ P K+ + + +I+ ++W N AE I + G + + L +
Sbjct: 369 --FAEWWYPYLRKDIDFVELDEKAENIEEKIEWALENDEIAEWIAQNGFELTKELLKPEN 426
Query: 403 VYDYMLHLITEYSKLLDYKP 422
VY + L +YS+L+DY+P
Sbjct: 427 VYCHYLQAFEQYSELMDYEP 446
>gi|26348175|dbj|BAC37727.1| unnamed protein product [Mus musculus]
Length = 502
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 139/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
C + I KDL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKV--RMPDVEFFVNLGDWP-LEKKKSNSNIQPIFSWCGSTESRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--HKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + +EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKEHDAEAKKITKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L Y+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQGYANLQVSEP 471
>gi|153218522|ref|NP_001020294.3| KDEL motif-containing protein 2 precursor [Rattus norvegicus]
gi|81888044|sp|Q566E5.1|KDEL2_RAT RecName: Full=KDEL motif-containing protein 2; Flags: Precursor
gi|62471552|gb|AAH93594.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
gi|149041673|gb|EDL95514.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
Length = 508
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I +D + I ++ +++ KRF R IV + + Y
Sbjct: 152 QKTLSCPANEPQIEQDFISFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHIYRRSL 209
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ +PD++ + D P+ +K + + P+P+ +C +
Sbjct: 210 GKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTPGPIPIISWCGSL 267
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + ++ G+ SW K A+++G L++
Sbjct: 268 DSRDIILPTYDVTHSTLEAMRGVTNDLLSVQ-GNTGPSWINKTEKAFFRGRDSREERLQL 326
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
L+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 327 VLLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 386
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA++I
Sbjct: 387 MLGDSLVLKQESPYYEHFYVELRPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKRIA 442
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 443 KEGQLTARDLLQPPRLYCYYYRVLQKYAERQVSKP 477
>gi|358058241|dbj|GAA95918.1| hypothetical protein E5Q_02576 [Mixia osmundae IAM 14324]
Length = 558
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 134/317 (42%), Gaps = 39/317 (12%)
Query: 95 RRVASPQKVQTCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYV-DP 152
R + Q P F+ + + W + I MR + + A + ++I R +LY+ +P
Sbjct: 210 RHLTDTQCAAIFPGLFEELDRAASYWRERGGIHMRDLDKGMPQANVHVIIKRNRLYLKEP 269
Query: 153 YYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDC----MDKPVIDKKEHGSFPLPLFR 208
Y SR T+ + +PDV+ + +DK +D+ S L L R
Sbjct: 270 YRIGPNSRTRATLAAINDAIVTAVEPIPDVEFILTVEDMVLDKGTVDQ----SAMLALGR 325
Query: 209 YCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQA----KSWKEKLPFAYWKGN 264
+ + PD+ F+ W E P F D++ + A ++W++K +W+G
Sbjct: 326 KKSQPNLW--LMPDYGFYAWPE----PAIGAFLDVQDQTLAFESRQTWQDKFGKLFWRGA 379
Query: 265 PDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWS 324
+L+ LR ++ W A I +W K + C ++Y ++ EG A+S
Sbjct: 380 --LLNQLRTDMALDMTDYDWAA-IQAIDW----KHPDNVLSPAEHCKYKYLLHVEGIAYS 432
Query: 325 VSLKYILSCNSVALI-----ISQQYKDFFSRGLIPTKNHF--PIPSADLCRSIKSVVDWG 377
LKY+L C SV +I I + F S P +N P P D ++ S +
Sbjct: 433 GRLKYLLQCRSVTVIHDLEHIQHFHVLFNSNTSSPDQNIIMTPAPKFDHLPAVMSRL--- 489
Query: 378 NANPSEAEKIGKAGQDF 394
AN + AE+I A Q F
Sbjct: 490 VANDAYAERI--ANQSF 504
>gi|322712770|gb|EFZ04343.1| DUF821 domain-containing protein [Metarhizium anisopliae ARSEF 23]
Length = 468
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/305 (22%), Positives = 137/305 (44%), Gaps = 30/305 (9%)
Query: 107 PDFFKSIHKDLEPW-AKSRITMRHIMEAKRF-AALRILIVRGKLYV------DPYYDCVQ 158
P F I + ++ A+ IT + + ++R ALR +IV +LY+ + YD
Sbjct: 104 PGLFAEIERGVDAQKARGNITPKQLNISERGRGALRGMIVDQQLYILQETILENEYDT-- 161
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
SRA+ + + + P +P+++ F D V+ E ++P+ D +
Sbjct: 162 SRAVAVLHAIHRAIVTSPEPLPNIEFAFTVAD--VVPDPEENNYPIWGLTRKAEDEEIWL 219
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
D+ +W W + ++E + K +++K A W+G + R EL+K
Sbjct: 220 -MGDFGYWSWPLDLVGSYDEVRR--KMAEAEVKFEQKTKKAVWRGAV-ATNGHREELIKV 275
Query: 279 NDSKLWGAEILRQNWAE----EAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
K W A++ WA ++D K +S C +++ I+ EG+++S KY+ +CN
Sbjct: 276 TKDKEW-ADVRAIVWAGISDLVSEDQAKALSMSEHCKYQFVIHTEGHSYSGRGKYLQNCN 334
Query: 335 SVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
SV ++ + + + L+ P +N + +DL + ++ A+P A++I +
Sbjct: 335 SVVIMHKRMWIEPHHALLVADGPKQNFVEVAEDFSDLEAKVTELL----AHPERAKRIAQ 390
Query: 390 AGQDF 394
G D
Sbjct: 391 NGADI 395
>gi|109121223|ref|XP_001094880.1| PREDICTED: KDEL motif-containing protein 1-like isoform 1 [Macaca
mulatta]
Length = 502
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 139/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
C + I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCSETIAQIQRDLAHFPTVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PD++ + D P+ KK + + P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDMEFFVNLGDWPLEKKKSNANIH-PIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|355701086|gb|EHH29107.1| Endoplasmic reticulum resident protein 58, partial [Macaca mulatta]
Length = 497
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 139/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
C + I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCSETIAQIQRDLAHFPTVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PD++ + D P+ KK + + P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDMEFFVNLGDWPLEKKKSNANIH-PIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L EY+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>gi|147791843|emb|CAN70599.1| hypothetical protein VITISV_027959 [Vitis vinifera]
Length = 103
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 42/65 (64%), Gaps = 1/65 (1%)
Query: 400 MDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPK-QRQNLEKAAASPSPY 458
MD VYDYM HL+ EYSKLL +KPA P A E C+E++ C AD Q+ +E SP+
Sbjct: 1 MDFVYDYMFHLLNEYSKLLKFKPAVPPGAVELCLETMDCSADAVLQKFVMESTVNSPTDS 60
Query: 459 PPCTL 463
PC++
Sbjct: 61 APCSM 65
>gi|47078285|ref|NP_997610.1| KDEL (Lys-Asp-Glu-Leu) containing 2 protein precursor [Mus
musculus]
gi|148693845|gb|EDL25792.1| KDEL (Lys-Asp-Glu-Leu) containing 2, isoform CRA_b [Mus musculus]
gi|157169796|gb|AAI52824.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [synthetic construct]
Length = 503
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/268 (23%), Positives = 118/268 (44%), Gaps = 14/268 (5%)
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAHFDIPF 220
MF+ L L R+ +PD++ + D P+ +K + + P+P+ +C + DI
Sbjct: 212 MFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTPGPIPIISWCGSLDSRDIIL 269
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
P + + ++ + ++ G+ SW K A+++G L++ L+ +
Sbjct: 270 PTYDVTHSTLEAMRGVTNDLLSVQ-GNTGPSWINKTEKAFFRGRDSREERLQLVLLSKEN 328
Query: 281 SKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILSCNSVA 337
+L A I + +E + K+KL + ++Y++ +G + Y++ +S+
Sbjct: 329 PQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLMLGDSLV 388
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
L Y + F L P K++ PI DL +K W N EA+KI K GQ
Sbjct: 389 LKQESPYYEHFYVALKPWKHYVPIKRNLGDLLEKVK----WAKENDEEAKKIAKEGQLTA 444
Query: 396 ESLTM-DRVYDYMLHLITEYSKLLDYKP 422
L R+Y Y ++ +Y++ KP
Sbjct: 445 RDLLQPPRLYCYYYRVLQKYAERQASKP 472
>gi|326914389|ref|XP_003203508.1| PREDICTED: KDEL motif-containing protein 2-like [Meleagris
gallopavo]
Length = 459
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/327 (23%), Positives = 133/327 (40%), Gaps = 18/327 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q V +CP I KD P+ + +F+ R IV + + Y +
Sbjct: 103 QNVFSCPSQEPQITKDFIPFPTIDLQRMFKEIPSKFSQTRGAIVHYTILNNRIYRRSLGK 162
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ +P+V+ + D PV +K + + P+P+ +C +
Sbjct: 163 YTDFKMFSDEMLLSLARKV--HLPNVEFYLNVGDWPVEYRKANDTPGPIPIISWCGSVDS 220
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + L+ + I+ G+ W K A ++G L +
Sbjct: 221 RDIILPTYDITHSTLETLRGVTNDLLSIQ-GNTGPFWDNKTEQALFRGRDSREERLYLVK 279
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + E + K L + ++Y++ +G + Y+L
Sbjct: 280 LSKENPELLDAGITGYFFFREKEKELGKVPLMGFFDFFKYKYQVNVDGTVAAYRFPYLLL 339
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSA--DLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L QY + F L P K++ P+ + DL IK W N EA+KI K
Sbjct: 340 GDSLVLKQDSQYYEHFYTVLKPWKHYVPVKRSLEDLLEKIK----WAKENDEEAQKIAKE 395
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSK 416
GQ E L R+Y Y ++ +Y++
Sbjct: 396 GQSMARELLQPHRLYCYYYKVLQKYAE 422
>gi|134114475|ref|XP_774166.1| hypothetical protein CNBG4660 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256799|gb|EAL19519.1| hypothetical protein CNBG4660 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 437
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/380 (20%), Positives = 152/380 (40%), Gaps = 61/380 (16%)
Query: 93 ERRRVASPQKVQTCPDFFKSIHKDLEPW--AKSRITMRHIMEAKRFAALRILIVRGKL-- 148
E R + Q + PD + + + W +K I+ + + EA+ R++I+ +L
Sbjct: 56 ENRHLTEEQCQERYPDLYLEADR-AQQWFISKGGISKKMVDEAEEEGNARLVILNNQLIS 114
Query: 149 ------YVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF 202
+V Y + +R I L +PDVD VI + G
Sbjct: 115 HLGRQLFVKAYKGGINTRTQAAIAAVYGALLTATEPLPDVDF--------VIQTSDAGGG 166
Query: 203 PLPLFRYCTNDAHFDI-PFPDWSFWGW--------SEVNLQPWNEEFK-----DIKHGSQ 248
P F C D+ PD+ F+ W SEV + + E D K
Sbjct: 167 NHPHFALCRKADQKDLWLMPDFGFFSWPEPGVGSYSEVRTKTLDYELDMGLEVDDKLDVT 226
Query: 249 AKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSN 308
W K +W+G+P V +R +L++ + + W +++ +W + ++D ++ K +
Sbjct: 227 KSDWFNKTQQLFWRGSPMV--EVRNDLLRASQDQPW-SDVQPLDWGKVSQDEAERIKNNG 283
Query: 309 -------QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFF-----SRGLIPTK 356
C + + + EG+A+S LKY+ C SV + +Y + R P +
Sbjct: 284 DLKSPAEHCKYAFLAHVEGWAYSGRLKYLQQCRSVIVAHPLKYIQHYHHLLNGRDGDPNQ 343
Query: 357 NHFPIP---SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD-----YML 408
N+ +P +L R+++ ++ N EK+ + + +S+ + Y
Sbjct: 344 NYVEVPLPLEKNLPRAMEDLLKEEN-----EEKVQRIADNNWKSMRQGWISPAANECYYR 398
Query: 409 HLITEYSKLLDYKPAPPSSA 428
H + +Y+ + +KP+ A
Sbjct: 399 HALRQYASVQTFKPSLEGRA 418
>gi|225683240|gb|EEH21524.1| DUF821 domain-containing protein [Paracoccidioides brasiliensis
Pb03]
gi|226288286|gb|EEH43798.1| DUF821 domain-containing protein [Paracoccidioides brasiliensis
Pb18]
Length = 451
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/339 (21%), Positives = 138/339 (40%), Gaps = 36/339 (10%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHI-MEAKRFAALRILIVRGKLYVDPYYDC--VQSRAMF 163
P + I K + IT+ + E + + +R +I +G+L + + D +RA
Sbjct: 96 PKLYVEIEKAVVARRGRNITLDELNSEPLKHSMVRAMIYQGELSIINFEDMAFTFTRAKA 155
Query: 164 TIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA-HFDIPF 220
++ + L P +P+++ +F D P P++ Y ++ +
Sbjct: 156 SLHSLNRALNAIPNRQEIPNIEFIFTAED--------FHDDPHPVWVYSKRESDSWAWLM 207
Query: 221 PDWSFWGWSEVNLQPWN---EEFKDIKHGS-------QAKSWKEKLPFAYWKGNPDVLSP 270
PD+ +W W E+ + + I G+ QA +++K W+GN +
Sbjct: 208 PDFGYWSWPEIKAGQYRSVRKRIAAIDEGTTINGKTQQALKFQDKKKQLLWRGNLETAPE 267
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYI 330
LR L+ K W A + +WA E + + C + + + EG ++S KYI
Sbjct: 268 LRQSLVDATKGKSW-ASVRALDWANEQSMREDYIPIEDHCRYMFLAHVEGRSYSGRGKYI 326
Query: 331 LSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
+C SV + +++ LI P N+ + +DL I ++D NP AE
Sbjct: 327 QNCRSVFVAHQLTWREAHHPALIATGPNANYVKVKRDFSDLESKIHYLLD----NPDVAE 382
Query: 386 KIGKAGQDFMESLTMDRVYD--YMLHLITEYSKLLDYKP 422
KI + L + + Y LI Y+ + D++P
Sbjct: 383 KIAENSVRTFRDLYLTPAAEACYWRELIHAYASVCDFEP 421
>gi|348583746|ref|XP_003477633.1| PREDICTED: KDEL motif-containing protein 1-like [Cavia porcellus]
Length = 502
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 139/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP I KDL + +RF + L + K+Y+ + + V R
Sbjct: 152 NCPKSIAQIQKDLSHFPAIDPEKIATEIPRRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSSSNIHPIFSWCGSTDSRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTALWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL ++ W + EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLQ----WAKDHDEEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ++ +L D ++ Y L EY+ L +P
Sbjct: 439 GQEYARNNLMGDDIFCYYFKLFQEYASLQVSEP 471
>gi|26344660|dbj|BAC35979.1| unnamed protein product [Mus musculus]
Length = 465
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/268 (23%), Positives = 118/268 (44%), Gaps = 14/268 (5%)
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAHFDIPF 220
MF+ L L R+ +PD++ + D P+ +K + + P+P+ +C + DI
Sbjct: 174 MFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTPGPIPIISWCGSLDSRDIIL 231
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
P + + ++ + ++ G+ SW K A+++G L++ L+ +
Sbjct: 232 PTYDVTHSTLEAMRGVTNDLLSVQ-GNTGPSWINKTEKAFFRGRDSREERLQLVLLSKEN 290
Query: 281 SKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILSCNSVA 337
+L A I + +E + K+KL + ++Y++ +G + Y++ +S+
Sbjct: 291 PQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLMLGDSLV 350
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
L Y + F L P K++ PI DL +K W N EA+KI K GQ
Sbjct: 351 LKQESPYYEHFYVALKPWKHYVPIKRNLGDLLEKVK----WAKENDEEAKKIAKEGQLTA 406
Query: 396 ESLTM-DRVYDYMLHLITEYSKLLDYKP 422
L R+Y Y ++ +Y++ KP
Sbjct: 407 RDLLQPPRLYCYYYRVLQKYAERQASKP 434
>gi|326431248|gb|EGD76818.1| hypothetical protein PTSG_08166 [Salpingoeca sp. ATCC 50818]
Length = 537
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 118/281 (41%), Gaps = 32/281 (11%)
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPV-IDKKEHGS--FPLPLFRYCTNDAHFDIPFPDWS 224
L L + P +PDV+ + + D P+ HG P P+F +C++ H DI P +
Sbjct: 187 LLSLASKTP--LPDVEFVLNLGDWPLAFHASAHGEKMRPYPVFSWCSSTNHSDIVLPTYK 244
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
+ + D K G A W+ K A+++G P + + LM L
Sbjct: 245 MTTATIFGKNMEQIQVVDGKAGKFA-DWQSKRGVAFFRGRPSNQARVDAMLMSKERPDLV 303
Query: 285 GAEIL-------------RQNWAEEAKDGFKKSKLS---NQCNHRYKIYAEGYAWSVSLK 328
A I R++ A EAK G KK++L ++Y + +G + L
Sbjct: 304 DARITKNQFNYFPNEEARREHRAFEAKYG-KKAELQPIDTFFRNKYLLNIDGTVAAYRLA 362
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSV---VDWGNANPSEAE 385
L+ S Y + F L P ++ P+ R++ + V++ + E +
Sbjct: 363 TTLAGTSTLFKQESDYYEHFYNALEPWVHYVPVE-----RNLSDLFDRVEYAQQHDDEMQ 417
Query: 386 KIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPP 425
I +AG++F + L M +Y Y L + +YS+LL + P P
Sbjct: 418 AIARAGREFTRKHLRMPDIYCYHLRALRKYSRLLTFTPQVP 458
>gi|339265071|ref|XP_003366338.1| conserved hypothetical protein [Trichinella spiralis]
gi|316965151|gb|EFV49954.1| conserved hypothetical protein [Trichinella spiralis]
Length = 451
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/321 (22%), Positives = 134/321 (41%), Gaps = 27/321 (8%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCV-QSRAMFTIWGFLQL 171
I KDL+P+ I M R A + I+ KLY + C +R + + L+L
Sbjct: 133 IEKDLQPF--QYIRKDENMPIHRHAVV-YQIIDHKLYREQ--QCTFPARCLGVEYFLLKL 187
Query: 172 LRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWS-- 229
+ ++P+ + + + D P+I+K S +F + DI +P WSFW
Sbjct: 188 IH----VLPNTEFVVNVCDYPLINKY---SSKQAVFSFSKTADDLDIMYPVWSFWKGGPY 240
Query: 230 ----EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG 285
+ + W+ + + + ++ W K A+++G+ R+ L+ L
Sbjct: 241 IPVYKDGISRWDIQREVLIKAAKQWPWSRKSNKAFFRGSRTSKVRDRLILLSRQKPHLID 300
Query: 286 AEILRQNWAEEAKDGFKKS-----KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALII 340
A+ D K L C ++Y G A S +++ C S+ +
Sbjct: 301 AQYTTNQATRSLDDTLGKEPADFVTLDYHCRYKYLFNFRGVAASFRFRHLFLCRSLVFHV 360
Query: 341 SQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLT 399
++K+FF + P ++ P+ DL ++ ++++ N A+ I + G F + LT
Sbjct: 361 GDEWKEFFYYKMKPWIHYIPVKE-DL-NDVEELLEFVKENDDVAKDIAERGYQFILNHLT 418
Query: 400 MDRVYDYMLHLITEYSKLLDY 420
MD V Y L+ +++ L Y
Sbjct: 419 MDNVTAYWESLLKQFTDRLVY 439
>gi|224043076|ref|XP_002195971.1| PREDICTED: KDEL motif-containing protein 1 [Taeniopygia guttata]
Length = 588
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 86/373 (23%), Positives = 148/373 (39%), Gaps = 48/373 (12%)
Query: 105 TCPDFFKSIHKDLE--PWAKSRITMRHIME--AKRFAALRILIVRGKLYVDPYYDCVQSR 160
CP I +DL P + I + +R + I ++Y+ Y + V R
Sbjct: 239 NCPQTIPQIQRDLANFPIVDPDKIAKEIPQRFGQRQSLCHYTIKNNEVYIKTYGEHVGFR 298
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P+ +K + P+F +C + DI
Sbjct: 299 -IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKRKPPQNL-HPIFSWCGSSESKDIVM 354
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W +K A+W+G R+EL+K
Sbjct: 355 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWDDKNTTAFWRGRDSRKE--RLELVKL 409
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + ++Y+I +G + L Y+L+
Sbjct: 410 SRKYPEIIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKYKYQINIDGTVAAYRLPYLLA 469
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
NSV L Y + F L P K++ P S +DL ++ W + EA+ I K+
Sbjct: 470 GNSVVLKQDSIYYEHFYNELQPWKHYIPFKSDLSDLLEKLQ----WAKDHDEEAKNIAKS 525
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLE 449
GQ+F +L D ++ Y L EY+ L +++PK R +E
Sbjct: 526 GQEFARNNLMGDHIFCYYFKLFQEYAGLQ--------------------VSEPKIRDGME 565
Query: 450 KAAASPSPYPPCT 462
K PCT
Sbjct: 566 KVQQPDDDLFPCT 578
>gi|295672277|ref|XP_002796685.1| DUF821 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226283665|gb|EEH39231.1| DUF821 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 451
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/339 (21%), Positives = 138/339 (40%), Gaps = 36/339 (10%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYDCVQS--RAMF 163
P + I K + IT+ + + + +R +I +G+L + + D + RA
Sbjct: 96 PKLYVEIEKAVLARRGRNITLDELNSKPLKHSMVRAMIYQGELSIINFEDMTFTFTRAKA 155
Query: 164 TIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA-HFDIPF 220
++ + L P +P+++ +F D HG P P++ Y ++ +
Sbjct: 156 SLNSLNRALNAIPNRQDIPNIEFIFTAEDF-------HGD-PHPVWVYSKRESDSWAWLM 207
Query: 221 PDWSFWGWSEVNLQPWN---EEFKDIKHGS-------QAKSWKEKLPFAYWKGNPDVLSP 270
PD+ +W W E+ + + I G+ QA +++K W+GN
Sbjct: 208 PDFGYWSWPEIKAGQYRSVRQRIAAIDEGTTINGKAQQALKFRDKKKQLLWRGNLGTAPE 267
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYI 330
LR L+ K W A + +WA E + + C + + + EG ++S KYI
Sbjct: 268 LRQSLVNATKGKSW-ASVRALDWANEQSMREDYIPIEDHCRYMFLAHVEGRSYSGRGKYI 326
Query: 331 LSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
+C SV + +++ L+ P N+ + +DL I ++D NP AE
Sbjct: 327 QNCRSVFVAHQLTWREAHHSALVATGPNANYVKVKRDFSDLESKIHYLLD----NPDVAE 382
Query: 386 KIGKAGQDFMESLTMDRVYD--YMLHLITEYSKLLDYKP 422
+I + L + + Y LI Y+ + D++P
Sbjct: 383 RIAENSVRTFRDLYLTPAAEACYWRELIHAYASVCDFEP 421
>gi|322698054|gb|EFY89827.1| DUF821 domain-containing protein [Metarhizium acridum CQMa 102]
Length = 463
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/304 (23%), Positives = 135/304 (44%), Gaps = 30/304 (9%)
Query: 107 PDFFKSIHKDLEPW-AKSRITMRHIMEAKRF-AALRILIVRGKLYV------DPYYDCVQ 158
P F I + ++ A+ IT + + +R ALR +IV +LY+ + YD
Sbjct: 107 PGLFAEIERGVDAQKARGNITPKQLNITERGRGALRGMIVDQQLYILQETILENEYDT-- 164
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
+RA+ + + + P +P+ + F D V+ E ++P+ D +
Sbjct: 165 TRAVAILHAIHRAIITSPEPLPNTEFAFSVAD--VVPDPEENNYPIWGLTRKAEDEEIWL 222
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
D+ +W W + ++E + I +++K A W+G + R EL+K
Sbjct: 223 -MGDFGYWSWPLDLVGGYDEVRRKIAEAEV--KFEQKTKKAVWRGAV-ATNGHREELIKV 278
Query: 279 NDSKLWGAEILRQNWAEEA----KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
K W A++ WA + +D K +S C +++ I+ EG+++S KY+ +CN
Sbjct: 279 TKDKEW-ADVRAIVWAGISDLISEDQAKALSMSEHCKYQFVIHTEGHSYSGRGKYLQNCN 337
Query: 335 SVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
SV ++ + + + L+ P +N + +DL + ++ A+P A+KI +
Sbjct: 338 SVVIMHKRMWIEPHHALLVADGPKQNFVEVAEDFSDLEAKVTELL----AHPERAKKIAQ 393
Query: 390 AGQD 393
G D
Sbjct: 394 NGAD 397
>gi|356561649|ref|XP_003549092.1| PREDICTED: receptor-like protein 12-like [Glycine max]
Length = 819
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 53/121 (43%), Gaps = 46/121 (38%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
TCPD+F+ IH+DL PWA++ IT + AK+ A
Sbjct: 732 TCPDYFRWIHEDLRPWARTGITQDMVERAKQTANF------------------------- 766
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWS 224
RYPG +PD+++MFDC+D PV+ RY N DI FPD S
Sbjct: 767 ---------RYPGKIPDLELMFDCVDWPVVLSD----------RY--NGPTLDIVFPDSS 805
Query: 225 F 225
+
Sbjct: 806 Y 806
>gi|169775695|ref|XP_001822314.1| endoplasmic reticulum-resident kdel protein [Aspergillus oryzae
RIB40]
gi|83771049|dbj|BAE61181.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 455
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/299 (22%), Positives = 128/299 (42%), Gaps = 30/299 (10%)
Query: 107 PDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAAL-RILIVRGKLYVDPYY---DCVQSRA 161
P F+ +H+ W AK I+M + + + R +V G+LYV + D + +
Sbjct: 106 PGLFEDVHRAQAFWTAKHGISMHDLNDIHLVDGMARAAVVNGQLYVISTHARGDDHRRKI 165
Query: 162 MFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIP 219
+ + + L P +P+ + +F DK +D PL + ++ +
Sbjct: 166 LGILGSIHRALASSPNTSALPNTEFIFSVEDK--VDDVASPHHPLWVLARKPDEKAVWL- 222
Query: 220 FPDWSFWGWS----EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
PD+ FW W + + P+ + I+ + W+EK W+G + +R L
Sbjct: 223 MPDFGFWAWEHGKVDGKIGPYTQVVDRIRQ--KDVPWEEKEAKLVWRGKLSFAAKMRRAL 280
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
++ ++ W A++ W KD + C +R+ + EG ++S SLKY +C S
Sbjct: 281 LEAARNQPW-ADVKEIVW----KDKTNFISMEEHCRYRFIAHVEGRSYSASLKYRQACGS 335
Query: 336 VALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
V + QY L+ P +N + +DL +++++D +P+ AE+I
Sbjct: 336 VVVAHKLQYIQHHHYLLVSSGPDQNFVEVERDFSDLPAKMQTLLD----DPASAERIAN 390
>gi|391865301|gb|EIT74591.1| endoplasmic reticulum-resident kdel protein [Aspergillus oryzae
3.042]
Length = 455
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/299 (22%), Positives = 128/299 (42%), Gaps = 30/299 (10%)
Query: 107 PDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAAL-RILIVRGKLYVDPYY---DCVQSRA 161
P F+ +H+ W AK I+M + + + R +V G+LYV + D + +
Sbjct: 106 PGLFEDVHRAQAFWTAKHGISMHDLNDIHLVDGMARAAVVNGQLYVISTHARGDDHRRKI 165
Query: 162 MFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIP 219
+ + + L P +P+ + +F DK +D PL + ++ +
Sbjct: 166 LGILGSIHRALASSPNTSALPNTEFIFSVEDK--VDDVASPHHPLWVLARKPDEKAVWL- 222
Query: 220 FPDWSFWGWS----EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
PD+ FW W + + P+ + I+ + W+EK W+G + +R L
Sbjct: 223 MPDFGFWAWEHGKVDGKIGPYTQVVDRIRQ--KDVPWEEKEAKLVWRGKLSFAAKMRRAL 280
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
++ ++ W A++ W KD + C +R+ + EG ++S SLKY +C S
Sbjct: 281 LEAARNQPW-ADVKEIVW----KDKTNFISMEEHCRYRFIAHVEGRSYSASLKYRQACGS 335
Query: 336 VALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
V + QY L+ P +N + +DL +++++D +P+ AE+I
Sbjct: 336 VVVAHKLQYIQHHHYLLVSSGPDQNFVEVERDFSDLPAKMQTLLD----DPASAERIAN 390
>gi|302511259|ref|XP_003017581.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
gi|291181152|gb|EFE36936.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
Length = 303
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/279 (23%), Positives = 121/279 (43%), Gaps = 23/279 (8%)
Query: 147 KLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPL 206
KLYV + SR + T+ + + P +P+++ +F+ DK +D P+
Sbjct: 5 KLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK--VD-------PVAQ 55
Query: 207 FRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNP 265
+ Y + ++ PD+ +W W E + E + SW +K+P +W+G
Sbjct: 56 WGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREEHTWSWSKKIPQLFWRGAT 115
Query: 266 DVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSV 325
L +R +L++ + W A++ W + + C +Y EG ++S
Sbjct: 116 MDLE-VREKLIQVTRGQPW-ADVKPITWRDNESMQNDLKSMPEHCRFKYLAQTEGNSYSG 173
Query: 326 SLKYILSCNSVALIISQ---QYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNAN 380
LKY+ SCNSV + S Q++ + P +N+ + +DL IK W +
Sbjct: 174 RLKYLQSCNSVVISHSLEWIQHQSPLMKSSGPGQNYVEVRRDWSDLREKIK----WLEEH 229
Query: 381 PSEAEKIGKAG-QDFMES-LTMDRVYDYMLHLITEYSKL 417
+A++I + + F E LT Y HLI ++++
Sbjct: 230 EQDAKRIAQNNIKTFREQYLTPAAEVCYWRHLIRSWAEV 268
>gi|319411533|emb|CBQ73577.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 452
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 64/264 (24%), Positives = 110/264 (41%), Gaps = 27/264 (10%)
Query: 121 AKSRITMRHIMEAK---RFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRR-YP 176
AK + + A R + ++I G++++ QSR T LQLL+ Y
Sbjct: 77 AKGGVRFEDVQNAAQHCRHGCVHLVIRDGQIFIRAQEKDWQSRVRST----LQLLQSAYL 132
Query: 177 GMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC-------TNDAHFDIPFPDWSFWGWS 229
G + + + ++ VI + F R ND FPD+SF W
Sbjct: 133 GASEEEREVMEGVEL-VISTADFDGFTDAASRGAGWVLDKRVNDTQGQYLFPDFSFASWP 191
Query: 230 EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS--PLRVELMKCND---SKLW 284
E + + E + + A W K A+W+G+ S P R L+ + W
Sbjct: 192 EAGIPSYPEFRRAAARVNAAVPWASKANRAFWRGDALAGSSIPARESLLAVATGPATASW 251
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
+++ R ++ E A D + C H++ I++EG A+S K++LSC S ++ + Q+
Sbjct: 252 -SDVKRTSFWESAPDIGSIVAPHDHCRHKFLIHSEGVAYSGRSKFVLSCASAVVLHALQW 310
Query: 345 KDFFSRGLI-----PTKNHFPIPS 363
+ F L+ P NH +P
Sbjct: 311 QQHFHPALVADPLSPDHNHIRLPG 334
>gi|119629473|gb|EAX09068.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_b [Homo sapiens]
Length = 283
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 108/248 (43%), Gaps = 17/248 (6%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNE 238
+PDV++ + D P ++KK+ S P+F +C + DI P + ++ L+
Sbjct: 8 MPDVELFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVMPTYDL---TDSVLETMGR 63
Query: 239 EFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEE 296
D+ + W+ K A W+G L + + +L A + +
Sbjct: 64 VSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVKLSRKHPELIDAAFTNFFFFKH 123
Query: 297 AKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL 352
++ + K + H+Y+I +G + L Y+L +SV L Y + F L
Sbjct: 124 DENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVGDSVVLKQDSIYYEHFYNEL 183
Query: 353 IPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLH 409
P K++ P+ S +DL +K W + EA+KI KAGQ+F +L D ++ Y
Sbjct: 184 QPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKAGQEFARNNLMGDDIFCYYFK 239
Query: 410 LITEYSKL 417
L EY+ L
Sbjct: 240 LFQEYANL 247
>gi|67901122|ref|XP_680817.1| hypothetical protein AN7548.2 [Aspergillus nidulans FGSC A4]
gi|40742938|gb|EAA62128.1| hypothetical protein AN7548.2 [Aspergillus nidulans FGSC A4]
gi|259483867|tpe|CBF79610.1| TPA: DUF821 domain protein (AFU_orthologue; AFUA_2G14740)
[Aspergillus nidulans FGSC A4]
Length = 462
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/320 (22%), Positives = 131/320 (40%), Gaps = 53/320 (16%)
Query: 107 PDFFKSIHKDLEPWA-KSRITMRHIMEAKRFAA--------LRILIVRGKLYVDPY---- 153
P F + + E WA I+ + + R +R + G+LY+ Y
Sbjct: 101 PKLFTELERTAEFWASNGGISYGDVDDIARGGGIDGNGNGLVRAAVKDGELYIIDYGPQP 160
Query: 154 YDCVQSRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCT 211
Y + +A T+ + L YP +PD++ + D + P P++ Y
Sbjct: 161 YTFTRGKA--TLHSLHRALSSYPDRHSLPDIEFVLTTDDFST----RTSTNPSPIWAYTK 214
Query: 212 NDAHFDIPF---PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFA------YWK 262
D PD+ +W W EV+ E+KD++ AK +E LPF+ W+
Sbjct: 215 RQEDEDAAIWLMPDFGYWSWPEVDTVG---EYKDVRRRIFAK--EEGLPFSDKKKQLLWR 269
Query: 263 GNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSK--------LSNQCNHRY 314
G+ +R L+ K W A + +WA+ KS + + C + +
Sbjct: 270 GSVSANPEIRKALLDTVGGKSW-ANVKEISWADSRFQPNSKSNPSDNEVLPIEDHCTYAF 328
Query: 315 KIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRS 369
++ EG ++S KY+L+C SV + + + L+ P N+ + +DL R
Sbjct: 329 LVHTEGRSFSGRGKYLLNCKSVFITHKLTWLEAHHSALVSSGPDANYVEVDRDWSDLERK 388
Query: 370 IKSVVDWGNANPSEAEKIGK 389
++ ++D NP AE+I +
Sbjct: 389 VEFLLD----NPQSAERIAE 404
>gi|380029849|ref|XP_003698577.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
1-like [Apis florea]
Length = 497
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 116/251 (46%), Gaps = 15/251 (5%)
Query: 178 MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWN 237
++PD++ + D P++ K+ P+F +C + DI P + +E +L+
Sbjct: 224 LLPDIEFFVNLGDWPLVPKEGKN---YPIFSWCGSFDTKDIVMPTYDI---TESSLEAME 277
Query: 238 EEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAE 295
D+ GS WKEK+ +W+G L + + L+ I + +
Sbjct: 278 RVMLDMLSVQGSTDTPWKEKIEKVFWRGRDSRRERLDLIDISRKYPDLFNVSITNFFFFK 337
Query: 296 EAKD--GFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG 351
+ KD G +S +S N ++Y++ +G + Y+L+ +S+ L +Y +FF
Sbjct: 338 DEKDKYGPGQSHVSFFNFFKYKYQLNIDGTVAAYRFPYLLAGDSLVLKQDSKYYEFFYND 397
Query: 352 LIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHL 410
LIP K++ + S DL I+ ++ W + EA +I K+ + F ++L V Y + L
Sbjct: 398 LIPGKHYISVKS-DLSNLIERIM-WAKEHDQEALQIVKSARKFARDNLLPHNVLCYHVAL 455
Query: 411 ITEYSKLLDYK 421
E+SK L K
Sbjct: 456 FHEWSKRLKSK 466
>gi|431913230|gb|ELK14912.1| KDEL motif-containing protein 1 [Pteropus alecto]
Length = 1012
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 138/332 (41%), Gaps = 28/332 (8%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSRA 161
CP+ I +DL + KRF + L + K+Y+ + + V R
Sbjct: 153 CPETIAQIQRDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+F L L R+ +PD++ + D P ++KK+ P+F +C + DI P
Sbjct: 212 IFMDAILLSLTRKVK--MPDMEFFVNLGDWP-LEKKKSNLHIHPIFSWCGSTDSKDIVMP 268
Query: 222 DWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
+ ++ L+ D+ + W+ K A W+G R+EL+K +
Sbjct: 269 TYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKLS 323
Query: 280 DSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+ N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 324 RKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVG 383
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+SV L Y + F L P K++ P+ S +DL +K W + EA+KI K G
Sbjct: 384 DSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKTG 439
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
Q+F +L D ++ Y L EY+ L +P
Sbjct: 440 QEFARNNLMGDDIFCYYFKLFQEYASLQVSEP 471
>gi|351703067|gb|EHB05986.1| KDEL motif-containing protein 1, partial [Heterocephalus glaber]
Length = 497
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 140/346 (40%), Gaps = 54/346 (15%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP I KDL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPLSIPQIQKDLAHFPAIDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVEFFVNLGDWP-LEKKKSNSDIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNATALWRGRDS--RKERLELVKL 322
Query: 279 -------------------NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAE 319
+D +L+G + ++ D FK H+Y+I +
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDERLYGPIVKHISFF----DFFK---------HKYQINID 369
Query: 320 GYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWG 377
G + L Y+L +SV L Y + F L P K++ P+ S +DL ++ W
Sbjct: 370 GTVAAYRLPYLLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLQ----WA 425
Query: 378 NANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+ EA+KI KAGQ++ +L D ++ Y L EY L +P
Sbjct: 426 KDHDEEAKKIAKAGQEYARNNLMGDDIFCYYFKLFQEYGNLQVSEP 471
>gi|238492927|ref|XP_002377700.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
gi|220696194|gb|EED52536.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
Length = 321
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 112/263 (42%), Gaps = 30/263 (11%)
Query: 140 RILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVP------DVDIMFDCMDKPV 193
R+ I +G+LYV + I G L + R P D +I+F DK
Sbjct: 7 RVAIYQGRLYVLRALAKGEDHRR-KILGILASIHRALVSAPQLAAILDTEIIFSVEDK-- 63
Query: 194 IDKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQPWNEEFKDIKHGSQAK 250
++ PL L R T ++ + PD+ FW W ++ + P++E K ++ Q
Sbjct: 64 LEDVAGPDHPLWVLARKATEESVW--LMPDFGFWSWGHIDNRIGPYDEVVKHVE--EQEL 119
Query: 251 SWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQC 310
W +K W+G LR L++ + WG ++ W +A + C
Sbjct: 120 PWDKKEDKLVWRGKLSFAPKLRRTLLEVARNYAWG-DVKEVEWKNKAN----YLSMDKHC 174
Query: 311 NHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--AD 365
++R+ + EG ++S SLKY +C SV +I QY L+ P +N + AD
Sbjct: 175 DYRFIAHVEGRSYSASLKYRQACRSVVVIHKLQYIQHHHYLLVSSGPQQNFVQVERDWAD 234
Query: 366 LCRSIKSVVDWGNANPSEAEKIG 388
L I+ +++ NP+ A I
Sbjct: 235 LPHKIQQLLE----NPTRARMIA 253
>gi|354501932|ref|XP_003513042.1| PREDICTED: KDEL motif-containing protein 1 [Cricetulus griseus]
gi|344244133|gb|EGW00237.1| KDEL motif-containing protein 1 [Cricetulus griseus]
Length = 502
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 137/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I KDL + KRF + L + K+Y+ + V R
Sbjct: 152 NCPENITQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTLGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P+ KK + + P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKKSNPNIQ-PIFSWCGSTDSRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWERKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + +EA+KI K
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDAEAKKIAKT 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L Y+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQGYANLQVSEP 471
>gi|452840902|gb|EME42839.1| hypothetical protein DOTSEDRAFT_44943 [Dothistroma septosporum
NZE10]
Length = 467
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 69/306 (22%), Positives = 127/306 (41%), Gaps = 41/306 (13%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRF--AALRILIVRGKLYVDPYYDCVQS--RAM 162
P F+ +H+ + W K+ + ++ + R LI G LY+ + R M
Sbjct: 108 PGLFEDVHQGVAFWKKNDLIPALALDTIKLQDGMTRALIYEGNLYILATKSRAEDHRRKM 167
Query: 163 FTIWGFLQLLRRYPGM------VPDVDIMFDCMDKPVIDKKEHGSFPLPLF---RYCTND 213
+ L + R VP+++ +F DK GS PL+ R T
Sbjct: 168 IAV---LSAMHRALSAAQDKHSVPNIEFVFSIEDKA---SDVTGSNIQPLWVLARKATEH 221
Query: 214 AHFDIPFPDWSFWGWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDV 267
+ F IP D+ W W + + P++E + ++ + +K+ W+G
Sbjct: 222 SFFLIP--DFGLWAWDNIIKGGNHEIGPYDEVVEKALEVEKSNPFADKISQLVWRGKLSF 279
Query: 268 LSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSL 327
LR L+ + K W A + NW +A+ + + + C +++ + EG ++S SL
Sbjct: 280 APKLRRGLLNASRGKEWSA-VKELNW--DARQNYLA--MEDHCKYKFIAHVEGRSYSASL 334
Query: 328 KYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPS 382
KY +C SV + QY L+ P +N+ + +DL ++ ++D P
Sbjct: 335 KYRQACKSVIVAHKLQYIQHHHYLLLSDGPQQNYVEVERDFSDLEVKMQDLLD----RPE 390
Query: 383 EAEKIG 388
+AE+I
Sbjct: 391 KAERIA 396
>gi|154284864|ref|XP_001543227.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150406868|gb|EDN02409.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 453
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/349 (22%), Positives = 142/349 (40%), Gaps = 36/349 (10%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYD 155
++ Q Q P + I K + IT + + + +R +I +G LYV + D
Sbjct: 88 LSRAQCQQAFPKLYIEIEKAVAARRGRNITFDELNSKPLKNSMVRAMIYQGALYVINFED 147
Query: 156 C--VQSRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCT 211
SRA ++ + L P +P+++ +F D + P P++ Y
Sbjct: 148 MRYTFSRAKASLNALNRALNAVPNRYEIPNIEFIFTTED--------YHDDPHPIWVYSK 199
Query: 212 NDAH-FDIPFPDWSFWGWSEVNLQPWN---EEFKDIKHGS-------QAKSWKEKLPFAY 260
+ + + PD+ +W W E+ + + I G+ A ++ K
Sbjct: 200 RETNGWAWLMPDFGYWSWPEIKAGQYRSIRQRIAAIDEGAIINGKPRTALRFQRKKKQLL 259
Query: 261 WKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEG 320
W+G LR +L+ K W A + NWA+E + + C + + + EG
Sbjct: 260 WRGAIATAPVLRQKLLDVTKGKSW-ASVRALNWADETSMRDDYIPIEDHCKYMFLAHVEG 318
Query: 321 YAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVD 375
++S KY+ +C SV + +++ L+ P N+ + +DL + ++D
Sbjct: 319 RSYSGRGKYLQNCRSVMVAHQLVWREAHHGALVATGPEANYVKVRRDFSDLEAKMNYLLD 378
Query: 376 WGNANPSEAEKIGKAG-QDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
NP AEKI + G + F + LT Y LI Y+ + D++P
Sbjct: 379 ----NPEVAEKIAENGVRTFRDRYLTPAAEACYWRELIHAYASMCDFEP 423
>gi|452981840|gb|EME81599.1| hypothetical protein MYCFIDRAFT_32648 [Pseudocercospora fijiensis
CIRAD86]
Length = 459
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 69/303 (22%), Positives = 125/303 (41%), Gaps = 35/303 (11%)
Query: 107 PDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAAL-RILIVRGKLYVDPYYDCVQSRAMFT 164
P F+ I+ + W + +I+ + + + R +I G LYV + T
Sbjct: 99 PGLFEDINLATKHWTSNGKISTQRLDRVHLENGMTRAMIFEGNLYVVQTRSKAEDHRRKT 158
Query: 165 IWGFLQLLRRYPGM-----VPDVDIMFDCMDKPVIDKKEHGSFPLPLF---RYCTNDAHF 216
I + R +P++D +F DK GS LPL+ R + ++F
Sbjct: 159 IATLNSIHRALSAAPDRRSMPNIDFIFSIEDKAT---DVTGSKTLPLWVLARKASEQSYF 215
Query: 217 DIPFPDWSFWGWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
FPD+ +W W + + P++E + ++++K+P W+G
Sbjct: 216 --LFPDFGYWAWDNMIGKMNNEIGPYDEVVDKALARERDITFRDKVPELVWRGKLSFAPK 273
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYI 330
LR L+ K W + NW KD + L C +++ + EG ++S SLKY
Sbjct: 274 LRRALLDAARRKPWN-NVKELNWM--VKDNYLA--LDEHCKYQFIAHVEGRSYSASLKYR 328
Query: 331 LSCNSVALIISQQYKDFFSRGLIPT---KNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
+C SV + QY L P+ +N+ + +DL + +++ A+P+ A+
Sbjct: 329 QACRSVIVAHKLQYIQHHHYLLNPSGPLQNYVEVERDFSDLDEKMNALL----ADPARAQ 384
Query: 386 KIG 388
I
Sbjct: 385 TIA 387
>gi|348553234|ref|XP_003462432.1| PREDICTED: KDEL motif-containing protein 2-like [Cavia porcellus]
Length = 692
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 73/335 (21%), Positives = 140/335 (41%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
Q+ +CP + I +D + I ++ ++ KRF R IV + + Y
Sbjct: 337 QEALSCPTWEPQIEEDFASFPT--INLQQMLSEVPKRFGDERGAIVHYTILNNHIYRRSL 394
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK-EHGSFPLPLFRYCTND 213
+ MF+ L L R+ +PD++ + D P+ +K P+P+ +C +
Sbjct: 395 GKYTDFKMFSDEILLSLARKV--RLPDLEFYINLGDWPLEHRKINETPSPVPVISWCGSL 452
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + + ++ + I+ G+ SW K A+++G L++
Sbjct: 453 DSRDVILPTYDVTHSTLEAMRGVTNDLLSIQ-GNTGPSWINKTEKAFFRGRDSREERLQL 511
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 512 VQLSQENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRFPYL 571
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ P+ +DL +K W N EA+KI
Sbjct: 572 MLGDSLVLKQDSPYYEHFYTTLRPWKHYVPVNRNLSDLLEKVK----WAKENDEEAKKIA 627
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y + +Y++ KP
Sbjct: 628 KEGQLAARDLLQPHRLYCYYYRALQQYAERQSSKP 662
>gi|115437590|ref|XP_001217850.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114188665|gb|EAU30365.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 500
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 98/216 (45%), Gaps = 20/216 (9%)
Query: 181 DVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWS--EVNLQPWNE 238
D++ +F DK ++ G+ P+ + + + PD+ FW W + ++ P+++
Sbjct: 193 DIEFVFSVEDK--VEDVTDGANPVWVLARTVPETGVWL-MPDFGFWAWENPQNSIGPYDQ 249
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK 298
I+ A ++EK P W+G P LR LM K WG ++ + +W
Sbjct: 250 VVDRIRR-LDAGPFEEKTPQLVWRGKPSFAPKLRRALMDAARDKPWG-DVKQVDWT---- 303
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PT 355
D ++ + C + + + EG ++S SLKY +CNSV + QY L+ P
Sbjct: 304 DHTNILRMEDHCRYMFIAHVEGRSYSASLKYRQACNSVIVAHKLQYIQHHHYLLVSDGPN 363
Query: 356 KNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+N+ + +DL I+ ++ +P A +I +
Sbjct: 364 QNYVEVERDFSDLADKIEPLL----RDPGRAARIAE 395
>gi|40882561|gb|AAR96192.1| AT24752p [Drosophila melanogaster]
Length = 192
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 1/124 (0%)
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNH 358
D +K L C +Y G A S L++IL C S+ L + Q+++FF L P ++
Sbjct: 47 DPVEKMPLVEHCQFKYLFNFRGVAASFRLRHILLCRSLVLHVGDQWQEFFYSQLKPWVHY 106
Query: 359 FPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKL 417
P+ S + ++ + + AE+I + GQ F+ L M+ V Y ++ EY+KL
Sbjct: 107 VPVASDADVDELAELILYLREHDDLAEEIAERGQQFIWLHLRMEDVQCYWSKMLQEYAKL 166
Query: 418 LDYK 421
L YK
Sbjct: 167 LTYK 170
>gi|443893818|dbj|GAC71274.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
Length = 1239
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 113/291 (38%), Gaps = 52/291 (17%)
Query: 92 PERRRVASPQKVQTC----PDFFKSIHKDLEPW-AKSRITMRHIMEAKR-----FAALRI 141
PERRR + TC P F I +L W AK I+ + + EA R + RI
Sbjct: 805 PERRRDRT-MDAATCKAEFPLLFPQIDDNLAAWRAKGGISFQDLDEAARTCDPRWGMARI 863
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMD--KPVIDKKEH 199
+I G+L++ + +SR I L L+ P + + V+ + +
Sbjct: 864 VIRDGQLFLRQVREGDESR----ISALLHLIHTAVTTDPSSTLQAGNATGIELVLSEADR 919
Query: 200 GSFPLPLFRYCTNDA------HFDIP-------FPDWSFWGWSEVNLQPWNEEFKDIKHG 246
+ P NDA D P PD+ F GW E + ++E +
Sbjct: 920 DASP--------NDAVWAISKRVDEPKSKGTWLLPDFGFAGWPETGIASFDEFLHLAQLQ 971
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND-SKLWGAEILRQNWAEEAKDGFKKSK 305
Q W K W+G + P RV+L+ D K+ GAE WA+ + F
Sbjct: 972 DQLAPWAHKGDKILWRGLANGYPP-RVDLLDRTDPRKVAGAEA----WADVKQTSFHDVG 1026
Query: 306 --------LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFF 348
+ C HRY + EG ++S K++ SC SV + Q+ F
Sbjct: 1027 AEFHPLIPMHEHCRHRYLVQTEGNSYSGRGKFLWSCRSVTVAHPMQWTQHF 1077
>gi|225554472|gb|EEH02770.1| DUF821 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 453
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/349 (22%), Positives = 142/349 (40%), Gaps = 36/349 (10%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYD 155
++ Q Q P + I K + IT + + + +R +I +G LYV + D
Sbjct: 88 LSRAQCQQAFPKLYIEIEKAVAARRGRNITFDELNSKPLKNSMVRAMIYQGALYVINFED 147
Query: 156 C--VQSRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCT 211
SRA ++ + L P +P+++ +F D + P P++ Y
Sbjct: 148 MRYTFSRAKASLNALNRALNAIPNRYEIPNIEFIFTTED--------YHDDPHPIWVYSK 199
Query: 212 NDAH-FDIPFPDWSFWGWSEVNLQPWN---EEFKDIKHGS-------QAKSWKEKLPFAY 260
+ + + PD+ +W W E+ + + I G+ A ++ K
Sbjct: 200 RETNGWAWLMPDFGYWSWPEIKAGQYRSIRQRIAAIDEGAIINGKPRTALRFQRKKKQLL 259
Query: 261 WKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEG 320
W+G LR +L+ K W A + NWA+E + + C + + + EG
Sbjct: 260 WRGAIATAPVLRQKLLDVTKGKSW-ASVRALNWADETSMRDDYIPIEDHCKYMFLAHVEG 318
Query: 321 YAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVD 375
++S KY+ +C SV + +++ L+ P N+ + +DL + ++D
Sbjct: 319 RSYSGRGKYLQNCRSVIVAHQLVWREAHHGALVATGPEANYVKVRRDFSDLEAKMNYLLD 378
Query: 376 WGNANPSEAEKIGKAG-QDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
NP AEKI + G + F + LT Y LI Y+ + D++P
Sbjct: 379 ----NPEVAEKIAENGVRTFRDRYLTPAAEACYWRELIHAYASICDFEP 423
>gi|398410491|ref|XP_003856595.1| hypothetical protein MYCGRDRAFT_24656, partial [Zymoseptoria
tritici IPO323]
gi|339476480|gb|EGP91571.1| hypothetical protein MYCGRDRAFT_24656 [Zymoseptoria tritici IPO323]
Length = 387
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 61/276 (22%), Positives = 118/276 (42%), Gaps = 26/276 (9%)
Query: 83 TCLSAMNPIPERRR----VASPQKVQTCPDFFKSIHKDLEPW----AKSRITMRHI-MEA 133
T + +P+R R + S Q PD ++ I + W + ++T + +E
Sbjct: 8 TTKTGWTFVPDRDRNDHSLTSEQCSAAFPDLYREIDRSRAYWKARQGEEKLTDKQWGLEW 67
Query: 134 KRFAALRILIVRGKLYVDP-----YYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDC 188
LR++I +LY+ ++ + R+ T+ + + VP+++
Sbjct: 68 SSDGGLRVMIWEQQLYIIESRGLNHFLHWKERSHATLHNLQRAILASKEPVPNIEFSIKI 127
Query: 189 MDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF--PDWSFWGWSEVNLQPWNEEFKDIKHG 246
D + K H + + F ND + + PD++FW + V + + + + G
Sbjct: 128 NDNIALTDK-HPNATVWNFNRDVNDKAMEQVWLIPDFNFWAYPRVAGSYGHYQRQAMDMG 186
Query: 247 SQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFK---K 303
S S K P W+G D +R++L++ K W A AE+ KD +
Sbjct: 187 SDYNS---KKPQLVWRGTTDFNPEIRLKLLEAAKDKPWSA---VHKVAEDVKDEENMKYR 240
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
+ + C +++ ++ EG WS LKY+LSC+S ++
Sbjct: 241 ITMPDHCKYKFAVHTEGTTWSGRLKYLLSCHSTIVV 276
>gi|330919750|ref|XP_003298742.1| hypothetical protein PTT_09542 [Pyrenophora teres f. teres 0-1]
gi|311327917|gb|EFQ93162.1| hypothetical protein PTT_09542 [Pyrenophora teres f. teres 0-1]
Length = 410
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 73/348 (20%), Positives = 146/348 (41%), Gaps = 32/348 (9%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYD- 155
++S Q P+ + S+ + + ++ + I LR++I +G+++V
Sbjct: 40 LSSEQCNVAFPELYHSLDEAVNSRQGRKVHPKDIEILSGRCMLRVMIYQGEVFVTDAGKP 99
Query: 156 ----CVQSRAMFTIWGFLQLLRRYPGM-------VPDVDIMFDCMDKPVIDKKEHGSFPL 204
V I G L + R VPD++ D P K+ G+F
Sbjct: 100 EQCYVVNGNERERILGTLAQIDRAVVTASTSDTPVPDIEFALSLDDLPR-RSKDKGTF-- 156
Query: 205 PLFRYCTNDA--HFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYW 261
F Y + + DI P++++W W+ + WN K+I+ W +K P W
Sbjct: 157 --FGYTRKEGPEYRDIWMMPNYAYWSWNYTHAPSWNTIRKEIQQKETEVPWSKKDPRVVW 214
Query: 262 KGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS--KLSNQCNHRYKIYAE 319
+G ++ LR EL++ ++ + W ++ A D K L + C +++ + E
Sbjct: 215 RGKVK-MAKLRQELIRVSEGQSWSD--IKPVVINNASDPHNKDVMNLRDFCGYKFTVQTE 271
Query: 320 GYAWSVSLKYILSCNSVALIISQQYKDFFS---RGLIPTKNHFPIPSADLCRSIKSVVDW 376
G ++S LKY+ C S + ++++F + R P N+ I +++ +++S +++
Sbjct: 272 GTSYSGRLKYLQLCRSALITHPLEWQEFHTHLMRLAGPDVNY--IEASENFGNLESAMEY 329
Query: 377 GNANPSEAEKIGKAGQDFMES--LTMDRVYDYMLHLITEYSKLLDYKP 422
+ +AE I + + LT V Y L + + Y+P
Sbjct: 330 YRDHDEDAELIARNSYETFTRRYLTPAAVTCYWRRLFVSWKSVQGYEP 377
>gi|242007535|ref|XP_002424595.1| KDEL motif-containing protein 1 precursor, putative [Pediculus
humanus corporis]
gi|212508038|gb|EEB11857.1| KDEL motif-containing protein 1 precursor, putative [Pediculus
humanus corporis]
Length = 501
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 85/374 (22%), Positives = 151/374 (40%), Gaps = 45/374 (12%)
Query: 102 KVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALR------ILIVRGKLYVDPYYD 155
K C + I DLE + K + + + +++++ R +I ++Y Y +
Sbjct: 150 KNMKCKKTYNQIVSDLESFPKVKFSEVLLKVLEKYSSQRSISLCHYVIKDNEIYRKCYGE 209
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH 215
V + +F L L R+ +PD + + + D P+ D PLP+F +C ++
Sbjct: 210 YVDFK-IFVDALLLSLTRKIE--LPDFEFIVNLGDWPLEDNSPS---PLPIFSWCGSNFT 263
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + +E L+ +D+ G+ SW +K+ +W+G + L +
Sbjct: 264 SDIIMPTYDL---TEATLECMGRVSRDMLSVQGNTGASWNDKINKGFWRGRDSSVERLNL 320
Query: 274 ELMKCNDSKLWGAEILR----QNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
+ A + Q+ +E + ++Y+I +G + +
Sbjct: 321 IKLSKKFPDYVNASLTNFFFFQDKEKEYGPKTDPIPFYDFFQYKYQINVDGTVAAYRFPF 380
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGK 389
+L+ +SV L + QY + F R LIP ++ PI DL I V+ W N A+ I
Sbjct: 381 LLAGDSVVLKQNSQYFEHFYRDLIPHVHYVPI-KKDLSNLI-DVIKWLRDNDELAQNISI 438
Query: 390 AGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNL 448
Q F E+L V Y + L ++S L +ES PK Q++
Sbjct: 439 QAQKFANENLMPKDVLCYYVVLFKDWSSRL--------------IES------PKILQDM 478
Query: 449 EKAAASPSPYPPCT 462
E + P PCT
Sbjct: 479 EHVVQN-KPEFPCT 491
>gi|325087411|gb|EGC40721.1| DUF821 domain-containing protein [Ajellomyces capsulatus H88]
Length = 453
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 77/349 (22%), Positives = 142/349 (40%), Gaps = 36/349 (10%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYD 155
++ Q Q P + I K + IT + + + +R +I +G LYV + D
Sbjct: 88 LSRAQCQQAFPKLYIEIEKAVAARRGRNITFDELNSKPLKNSMVRAMIYQGALYVLNFED 147
Query: 156 C--VQSRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCT 211
SRA ++ + L P +P+++ +F D + P P++ Y
Sbjct: 148 MRYTFSRAKASLNALNRALNAIPNRYEIPNIEFIFTTED--------YHDDPHPIWVYSK 199
Query: 212 NDAH-FDIPFPDWSFWGWSEVNLQPWN---EEFKDIKHGS-------QAKSWKEKLPFAY 260
+ + + PD+ +W W E+ + + I G+ A ++ K
Sbjct: 200 RETNGWAWLMPDFGYWSWPEIKAGQYRSIRQRIAAIDEGAIINGKPRTALRFQRKKKQLL 259
Query: 261 WKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEG 320
W+G LR +L+ K W A + NWA+E + + C + + + EG
Sbjct: 260 WRGAIATAPVLRQKLLDVTKGKSW-ASVRALNWADETSMRDDYIPIEDHCKYMFLAHVEG 318
Query: 321 YAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVD 375
++S KY+ +C SV + +++ L+ P N+ + +DL + ++D
Sbjct: 319 RSYSGRGKYLQNCRSVIVAHQLVWREAHHGALVATGPEANYVKVRRDFSDLEAKMNYLLD 378
Query: 376 WGNANPSEAEKIGKAG-QDFMES-LTMDRVYDYMLHLITEYSKLLDYKP 422
NP AEKI + G + F + LT Y LI Y+ + D++P
Sbjct: 379 ----NPEVAEKIAENGVRTFRDRYLTPAAEACYWRELIHAYASICDFEP 423
>gi|345493843|ref|XP_003427163.1| PREDICTED: KDEL motif-containing protein 1-like [Nasonia
vitripennis]
Length = 507
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 61/274 (22%), Positives = 118/274 (43%), Gaps = 20/274 (7%)
Query: 153 YYDCVQSRAMFTIWG---FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRY 209
Y +C F I+ L L R+ +PDV+ + D P++ K + P+F +
Sbjct: 209 YRNCYGQHVGFKIFSDAILLSLARKIN--LPDVEFFMNLGDWPLVPKNKEIH---PIFSW 263
Query: 210 CTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDV 267
C +D +DI P + ++ +++ D+ GS W +K+ +W+G
Sbjct: 264 CGSDDSYDIVLPTYDI---TQSSMEAMGRVMLDMLSVQGSTTDPWNKKIEKMFWRGRDAR 320
Query: 268 LSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAW 323
L + + +L+ A I + + + + K ++Y++ +G
Sbjct: 321 RERLDLIDIARKHPELFNASITNFFFFRDEIEKYGPEQKHVSFFEFFKYKYQLNIDGVVA 380
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
+ Y+L +SV +Y +FF + L K+ PI S DL ++ ++W N +
Sbjct: 381 AYRFPYLLVGDSVVFKQDSKYYEFFYKDLEAGKHFIPIKS-DLSDLVQK-LEWARENDDQ 438
Query: 384 AEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSK 416
KI K + + ++L V+ Y ++LI E+SK
Sbjct: 439 VYKISKEARQYARDNLMPQDVFCYHVNLINEWSK 472
>gi|115401698|ref|XP_001216437.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190378|gb|EAU32078.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 437
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 134/311 (43%), Gaps = 44/311 (14%)
Query: 139 LRILIVRGKLYVDPYYDC--VQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVI 194
+R +I G+LY+ Y SRA T+ + L +P ++P V+ +F D
Sbjct: 118 VRGIIENGELYIVDYGAMPFTFSRAKATLHSLQRALAAFPDRHLLPSVEFVFTTDD---- 173
Query: 195 DKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKH------GS 247
+ + P++ Y + + PD+ +W W EV + P +KDI+
Sbjct: 174 ----YSTGVGPIWSYSKREEDESVWLMPDFGYWSWPEVKIGP----YKDIRRRIAAVDNG 225
Query: 248 QAKSWKEKLP---FA------YWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK 298
+ E +P FA W+G+ +R +L+K + W A + +W +E
Sbjct: 226 ETTVTGEVIPGTRFADKKKQLVWRGSVAPNPGIRGKLLKQTQGRSW-ASVRVLDWDDEND 284
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PT 355
F + C + + + EG ++S KY+L+C SV + +++ LI P
Sbjct: 285 LRFNLLPMEEHCRYMFLAHTEGRSFSGRGKYLLNCRSVVVSHRLIWREAHHAALIATGPE 344
Query: 356 KNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMES-LTMDRVYDYMLHLI 411
+N+ + +DL R I+ ++D P AE+I A + F + LT Y +LI
Sbjct: 345 QNYVEVERDFSDLQRKIEFLID----RPETAERIADNAVRTFRDRYLTPAAESCYWRYLI 400
Query: 412 TEYSKLLDYKP 422
Y+ +++P
Sbjct: 401 RAYAAACEFQP 411
>gi|260810659|ref|XP_002600074.1| hypothetical protein BRAFLDRAFT_280689 [Branchiostoma floridae]
gi|229285359|gb|EEN56086.1| hypothetical protein BRAFLDRAFT_280689 [Branchiostoma floridae]
Length = 351
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 52/229 (22%), Positives = 104/229 (45%), Gaps = 15/229 (6%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV------ 231
+PD++++ + D P + P+ F N+ H+DI +P W+FW G V
Sbjct: 84 LPDMEMIINVRDWPQSPMYQREPRPVMSFSKTFNE-HWDIMYPAWTFWEGGPAVWPLFPT 142
Query: 232 NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQ 291
L W+ I ++ W++K A+++G+ + L+ + +L A+ +
Sbjct: 143 GLGRWDLMRDSITKANEDWPWEKKEDKAFFRGSRTSAERDPLVLLSRAEPELVDAQYTKN 202
Query: 292 NWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ KD + KL + C ++Y G A S K++ C S + + + +
Sbjct: 203 QAWKSDKDTLGMPAATEIKLEDHCQYKYLFNFRGVAASFRFKHLFLCRSTVFHVGEGWLE 262
Query: 347 FFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
FF L P ++ P+ S DL + ++ ++++ AN A++I + G F+
Sbjct: 263 FFYPALKPWVHYIPV-SQDL-KDVRELIEFAKANQGVAQEIAERGFQFI 309
>gi|449303255|gb|EMC99263.1| hypothetical protein BAUCODRAFT_65196 [Baudoinia compniacensis UAMH
10762]
Length = 394
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 120/290 (41%), Gaps = 19/290 (6%)
Query: 88 MNPIPERRRVASPQKVQTC-PDFFKSIHKDLE-PWAKSRITMRHIMEAKRF-AALRILIV 144
M R S Q Q PD +K I + +E A +IT+ + + R LR +I
Sbjct: 1 MQATQSRNYGLSEQDCQAYFPDLYKEIDRAVEYRNAVGKITLDELDVSWRADGVLRGMIH 60
Query: 145 RGKLYV-DPYYDCVQS---RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHG 200
+LY+ D + C + R M T+ + + +PD++ F D ++D ++
Sbjct: 61 DNQLYIIDAHGVCDHNHRPRTMATLHAINRAVTASVEPLPDIEFTFVDHDTALLDDDDNH 120
Query: 201 SFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAY 260
+ R ++ + PD+ FW W E ++ ++E ++ + +K P
Sbjct: 121 T-TWAYSRLANQESLW--LMPDFGFWAWPEYGMRSYSELQANLD--ETEEHLLDKAPQIV 175
Query: 261 WKGNPDVL--SPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYA 318
W+G R L+K + + W A + +WA K ++ C + + +
Sbjct: 176 WRGGTKAGFGGHAREGLLKHSTGQSWSA-VEMIDWANRTDVSHKLITMAEHCGYMFAAHT 234
Query: 319 EGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
EG +S LKY+L+C+SV L + + F L P P P + R
Sbjct: 235 EGDTYSGRLKYLLNCHSVLLSHELHWIEHFHHLLDPG----PGPGQNYVR 280
>gi|91081993|ref|XP_969039.1| PREDICTED: similar to AGAP008037-PA [Tribolium castaneum]
gi|270007375|gb|EFA03823.1| hypothetical protein TcasGA2_TC013938 [Tribolium castaneum]
Length = 498
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/327 (21%), Positives = 138/327 (42%), Gaps = 24/327 (7%)
Query: 106 CPDFFKSIHKDLEP-----WAKSR-ITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQS 159
C + + I DL+P W K R ++ + + R +I ++Y Y V
Sbjct: 152 CGNTPRQILADLKPFQTVNWDKLRDKVIKKFDQPHSISLCRYVIKNNEIYRTCYGKYVGF 211
Query: 160 RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIP 219
+ MF L L R+ +PD++ + D P++ +K P+F +C + DI
Sbjct: 212 K-MFMDAILLSLSRKVN--LPDLEFFINLGDWPLVTEKIE---TFPIFSWCGSTTSLDIV 265
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
P + +E L+ D+ G+ +SW+ + A+W+G L + +
Sbjct: 266 MPTYDI---TESTLENMGRVMLDMLSVQGNVKESWENRTGQAFWRGRDSNQHRLDLIDIA 322
Query: 278 CNDSKLWGAEILRQNWAEEAKD--GFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSC 333
L+ + + + +D G K +S + +++Y++ +G + Y+L+
Sbjct: 323 RKHPDLFNVSLTNFFFFRDKQDVYGPKSEHVSFFSFFDYKYQLALDGTVAAYRFPYLLAG 382
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
S+ + QY + F LIP H+ + DL + + + W N EA+ I GQ
Sbjct: 383 GSLVIKQESQYYEHFYNDLIPN-THYILMKRDLS-DLVAKLQWSIQNDKEAQIIASNGQK 440
Query: 394 FM-ESLTMDRVYDYMLHLITEYSKLLD 419
F E+L ++ Y HL+ ++ ++
Sbjct: 441 FANENLLPQHIFCYHAHLLHQFGTRIE 467
>gi|255938965|ref|XP_002560252.1| Pc15g00250 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584874|emb|CAP82911.1| Pc15g00250 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 458
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/301 (22%), Positives = 124/301 (41%), Gaps = 36/301 (11%)
Query: 107 PDFFKSIHKDLEPW-AKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYDCVQSRAMF- 163
P + I + +E W ++ RI+ + + R +I G LYV V +RA
Sbjct: 108 PGLSQDIQRGVEYWKSRGRISREDLRTIPFKDGMARAIISNGDLYV------VATRAKGD 161
Query: 164 ----TIWGFLQLLRRYPGMV------PDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTND 213
I G L + R P ++ +F D+ +D + S P+ + ++
Sbjct: 162 DHRRKILGTLGSIHRALSASSNRTSHPTIEFIFSIEDR--VDDVDAVSHPVWVLSRKASE 219
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
I PD+ +W W + N+ P+ + + I + +K W+G LR
Sbjct: 220 ESV-ILMPDFGYWSWDKSNIGPYGQVVQSIMAAESNLKFADKEQKLVWRGKLSFAPKLRR 278
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
L+ K W +++ +W++ A + + C + + + EG A+S SLKY +C
Sbjct: 279 ALLDIARGKPW-SDVKELDWSKRA----NFLSMEDHCRYMFIGHVEGRAYSASLKYRQAC 333
Query: 334 NSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
SV + QY L+ P +N+ + +DL + + ++ NP +AE+I
Sbjct: 334 RSVVVAHQLQYIQHHHYLLVSSGPEQNYVEVERDFSDLPKRMDELL----KNPDKAERIA 389
Query: 389 K 389
Sbjct: 390 N 390
>gi|189202768|ref|XP_001937720.1| hypothetical protein PTRG_07388 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187984819|gb|EDU50307.1| hypothetical protein PTRG_07388 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 486
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 76/340 (22%), Positives = 140/340 (41%), Gaps = 38/340 (11%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAK------RFAALRILIVRGKLYVDPYY---DC 156
C F ++D+E K R + EA+ +R ++ +LY+ + +
Sbjct: 125 CTSAFPGQYEDIERAVKVRDRWGKVTEAELNSFELTKGMVRAMVFDRELYILETHMVDNV 184
Query: 157 VQSRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA 214
+ +A+ + + L P +P+++ F D P K P+ D
Sbjct: 185 NRQKALAILSAIYRALLAAPANADMPNIEFAFSVEDLPAQPAK-----PIWSLTRRVQD- 238
Query: 215 HFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
H PD+ FW W L +E ++ + W +K+ W+G LR
Sbjct: 239 HSLWLMPDFGFWSWDMPALGTLDEVAREAVQRETREPWNQKIEKLVWRGKITFAPKLRRA 298
Query: 275 LMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILS 332
L+ K W +++ + W + FK L +QCN+ + +AEG ++S SLKY
Sbjct: 299 LLDAAKGKPW-SDVGQLKWTDP---NFKDQFLGPVDQCNYMFIAHAEGRSYSGSLKYRQL 354
Query: 333 CNSVALIISQ-----QYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
C SV I+S Q+ + R +N+ + +DL +++ ++D +P +A
Sbjct: 355 CRSV--IVSHKLQWIQHHHYLFRANGSNQNYVEVERDFSDLASAVEDLLD----HPEKAR 408
Query: 386 KIG-KAGQDFMES-LTMDRVYDYMLHLITEYSKLLDYKPA 423
+I + Q F E LT Y L+T + ++ ++P
Sbjct: 409 RIADNSVQVFRERYLTQAAEACYWRRLMTRWKDVMSWEPV 448
>gi|302851936|ref|XP_002957490.1| hypothetical protein VOLCADRAFT_98596 [Volvox carteri f.
nagariensis]
gi|300257132|gb|EFJ41384.1| hypothetical protein VOLCADRAFT_98596 [Volvox carteri f.
nagariensis]
Length = 413
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 86/193 (44%), Gaps = 10/193 (5%)
Query: 181 DVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD--WSFWGWSEVNLQPWNE 238
++ +F D P++D + LP F+ C + D+P PD W + ++ W E
Sbjct: 134 ELKFIFGTEDWPIVDSWK--KLRLPSFQMCHSPDTADVPVPDFTWEQYSQAQYTNNSWWE 191
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDV-----LSPLRVEL-MKCNDSKLWGAEILRQN 292
+ + S W+ + + +G+ V L PL E+ + +D L+G ++ ++
Sbjct: 192 VRRLLLLKSAMLPWRLRERDLFMRGDAGVGYRKVLMPLMHEVQVNRSDIALFGIKVNFRS 251
Query: 293 WAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL 352
D S L N C HRY ++ G +S SLKY L+C +V + +++F+ L
Sbjct: 252 TGFYVSDLKHFSWLDNWCQHRYLVHTSGLTYSASLKYKLACGAVVVNFKGDFQEFYYPAL 311
Query: 353 IPTKNHFPIPSAD 365
+ P AD
Sbjct: 312 QHGVHLLSFPEAD 324
>gi|452984518|gb|EME84275.1| hypothetical protein MYCFIDRAFT_118382, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 387
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 98/205 (47%), Gaps = 11/205 (5%)
Query: 139 LRILIVRGKLYV-DPYYDCVQS---RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI 194
+R +I +LYV DP+ + R + T+ + + G +PD++ F D +
Sbjct: 68 VRAMIHDNQLYVIDPHAVTDHNHRPRTLATLHAIHRAVSASSGKLPDIEFSFTVHDFALH 127
Query: 195 DKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKE 254
D+ +G+ + + + PD+ WGW +V L+ + EF+ + + + +
Sbjct: 128 DR--YGNHTTWAYTRLPHQEKLWL-MPDFGLWGWPDVGLRSY-AEFQTVL-DYEEDEFVD 182
Query: 255 KLPFAYWKGNPDVLS-PLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR 313
K+P W+G+ V S +R L+K ++ + W A++L +W+ + + + C +
Sbjct: 183 KIPKLVWRGSLAVGSHDVRAGLVKHSEKQPW-ADVLELDWSNRTNIEERLLSMQDHCAYM 241
Query: 314 YKIYAEGYAWSVSLKYILSCNSVAL 338
+ EG +S LKY+L+C SV L
Sbjct: 242 FVAQTEGNTYSGRLKYLLNCRSVVL 266
>gi|391340384|ref|XP_003744522.1| PREDICTED: O-glucosyltransferase rumi homolog [Metaseiulus
occidentalis]
Length = 418
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/341 (19%), Positives = 143/341 (41%), Gaps = 33/341 (9%)
Query: 110 FKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIW--G 167
++ I +DL+P+++ I+ + A +F +R I++G+++ Q MF G
Sbjct: 68 YRQIRRDLKPFSEG-ISREDLNRAAKFG-VRYQIIQGRVFR-------QKECMFPFRCSG 118
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC-TNDAHFDIPFPDWSFW 226
L R +PD++ + + D P + G LP+F + ++DI +P W+FW
Sbjct: 119 IEHFLLRLAPKLPDLEFIVNVRDYP---QSPVGREKLPVFSFSKVPKNYYDILYPAWTFW 175
Query: 227 -GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGN-----PDVLSPLRVEL 275
G +++ P W+ K + S+ W K+ +++G+ D L L L
Sbjct: 176 EGGPAISIYPRGIGRWDIRSKSLIAESERVPWDAKVDRGFFRGSRTSAERDPLIKLSRRL 235
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
+ D+ + + + + + C ++ G A S K++ C S
Sbjct: 236 PELVDAMYTKNQAWKSDRDTLGAPPAAEVSFEDHCKFKFLFNFRGVAASFRFKHLFLCKS 295
Query: 336 VALIIS-----QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKA 390
+ + +FF L P K++ PI S + +++V+++ + ++ I +
Sbjct: 296 LVFHFGPTQSENDWIEFFYPMLKPWKHYIPI-SQNEVDQLETVIEYFKEHQEVSQSIAER 354
Query: 391 G-QDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFE 430
G + L M + Y L+ Y+ L+ ++ + +E
Sbjct: 355 GFEAIQRHLRMKDIQCYWRKLLKSYAALMKFEVQRDRTLYE 395
>gi|363729201|ref|XP_003640613.1| PREDICTED: KDEL motif-containing protein 2 [Gallus gallus]
Length = 499
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 132/327 (40%), Gaps = 18/327 (5%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
Q V +CP I KD + + +F+ R IV + + Y +
Sbjct: 143 QNVFSCPSQEPQITKDFISFPTIDLQRMFKEIPTKFSQTRGAIVHYTIVNNHIYRRSLGK 202
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ +P+V+ + D PV +K + + P+P+ +C +
Sbjct: 203 YTDFKMFSDEMLLSLARKV--HLPNVEFYLNVGDWPVEYRKVNDTPGPIPIISWCGSVDS 260
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + L+ + I+ G+ W K A ++G L +
Sbjct: 261 RDIVLPTYDVTHSTLETLRGVTNDLLSIQ-GNTGPFWDNKTEQALFRGRDSREERLYLVK 319
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + E + K L + ++Y++ +G + Y+L
Sbjct: 320 LSKENPELLDAGITGYFFFREKEKELGKVPLMGFFDFFKYKYQVNVDGTVAAYRFPYLLL 379
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPSA--DLCRSIKSVVDWGNANPSEAEKIGKA 390
+S+ L QY + F L P K++ P+ + DL IK W N EA+KI K
Sbjct: 380 GDSLVLKQDSQYYEHFYTVLKPWKHYVPVKRSLEDLLEKIK----WAKENDEEAQKIAKE 435
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSK 416
GQ E L R+Y Y ++ +Y++
Sbjct: 436 GQSVARELLQPHRLYCYYYKVLQKYAE 462
>gi|260810693|ref|XP_002600088.1| hypothetical protein BRAFLDRAFT_280682 [Branchiostoma floridae]
gi|229285373|gb|EEN56100.1| hypothetical protein BRAFLDRAFT_280682 [Branchiostoma floridae]
Length = 351
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/229 (22%), Positives = 104/229 (45%), Gaps = 15/229 (6%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV------ 231
+PD++++ + D P + P+ F N+ H+DI +P W+FW G V
Sbjct: 84 LPDMEMIINVRDWPQSPMYQREPRPVMSFSKTFNE-HWDIMYPAWTFWEGGPAVWPLFPT 142
Query: 232 NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQ 291
L W+ I ++ W++K A+++G+ + L+ + +L A+ +
Sbjct: 143 GLGRWDLMRDSITKANEDWPWEKKEDKAFFRGSRTSAERDPLVLLSRAEPELVDAQYTKN 202
Query: 292 NWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ KD + KL + C ++Y G A S K++ C S + + + +
Sbjct: 203 QAWKSDKDTLGMPAATEIKLEDHCQYKYLFNFRGVAASFRYKHLFLCRSTVFHVGEGWLE 262
Query: 347 FFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
FF L P ++ P+ S DL + ++ ++++ AN A++I + G F+
Sbjct: 263 FFYPALKPWVHYIPV-SQDL-KDVRELIEFAKANQGVAKEIAERGFQFI 309
>gi|154310746|ref|XP_001554704.1| hypothetical protein BC1G_06847 [Botryotinia fuckeliana B05.10]
Length = 438
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/244 (20%), Positives = 104/244 (42%), Gaps = 13/244 (5%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA-LRILIVRGKLYVDPYYDCVQSRAMFTI 165
P F+ + + ++ ++T + + + +R +I +LY+ + SR + T+
Sbjct: 95 PGLFEEVERPVKLRKNKKVTKKELDDTPALNGFIRAMIFDQQLYIIDTSGKIYSRGIATL 154
Query: 166 WGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
+ + P +P+++ + +D K G R N + +P ++ F
Sbjct: 155 HALHRAMLTSPEPLPNIEFTMN------VDDKMEGHPQWLYARVAKNQETWLMP--EYGF 206
Query: 226 WGWSEVNLQPWNE-EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
W W E + + E + K I S+ W K+ W+G L +R + + + K W
Sbjct: 207 WSWPETKIGSYGEMQMKAILTESEW-PWSRKIDKLLWRGATMNLE-VRKKFVNVTEGKAW 264
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
A++ +W E + C +++ + EG ++S LKY+ +C SV + ++
Sbjct: 265 -ADVKTLDWHNEGSMKNDLKSMDEHCQYKFLAHTEGNSYSARLKYLRNCRSVIVAHKLEW 323
Query: 345 KDFF 348
+FF
Sbjct: 324 MEFF 327
>gi|313232413|emb|CBY24081.1| unnamed protein product [Oikopleura dioica]
Length = 536
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/307 (23%), Positives = 130/307 (42%), Gaps = 41/307 (13%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIME---AKRFAA-------LRILIVRGKLYVDPYYD 155
CP F+ + +DL + I + + E K++AA ++I KL+V+ +
Sbjct: 172 CPANFEQLDEDLAQF--DEIDLDRLYEDGKKKKWAAGGHGEATAHVVIKDQKLFVEDFGT 229
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH 215
+ R + FL LLR+ +PD + +F+ D P+ +E+ + P P+ +C +
Sbjct: 230 IMGFRGFMSSM-FLSLLRKVK--LPDAEFIFNLGDWPL---EENLTDPQPILTWCGSSNT 283
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAK----SWKEKLPFAYWKGNPDVLSPL 271
DI P W ++ E KDI++ Q SW EK+ Y++G +P
Sbjct: 284 SDIAVPTWD---QTKNTRHALFRERKDIQYVEQISGEVVSWNEKIERGYFRGRDS--NPS 338
Query: 272 RVELMKCNDSKLWGAEI-LRQNWAEEAKDGFKKSKLSNQCNH---------RYKIYAEGY 321
R++L C S +I R W K G K Q H +Y++ +G
Sbjct: 339 RLKL--CELSMAHPEDIDARLTWNLHNKKGQDPKKYGEQVKHVSYPEMGKFKYQVLVDGT 396
Query: 322 AWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANP 381
++ +SV L Y +++ R + P ++ PI DL ++ ++W N
Sbjct: 397 VAPYRTALLMQMDSVILKQKSMYYEWWYRYMKPWQHFIPI-EEDLS-DLREKIEWARNND 454
Query: 382 SEAEKIG 388
+A +I
Sbjct: 455 EKARQIA 461
>gi|354481242|ref|XP_003502811.1| PREDICTED: KDEL motif-containing protein 2-like [Cricetulus
griseus]
Length = 472
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/335 (21%), Positives = 142/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I +D + I ++ +++ KRF R +V + + +
Sbjct: 116 QKTLSCPATEPQIEQDFTSFPS--IDLQQMLKEVPKRFGDERGAVVHYTIVNNHIHRRSL 173
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ +PD++ + D P+ +K + + P+P+ +C +
Sbjct: 174 GKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTPGPIPIISWCGSL 231
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + ++ G+ SW K A+++G L++
Sbjct: 232 DSRDIILPTYDVTHSTLEAMRGVTNDLLSVQ-GNTGPSWINKTEKAFFRGRDSREERLQL 290
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 291 VQLSQENPQLLDAGITGYFFFQEKERELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 350
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 351 MLGDSLVLKQDSPYYEHFYVALRPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 406
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R++ Y ++ +Y+ KP
Sbjct: 407 KEGQLTARDLLQPPRLFCYYYKVLQKYAARQASKP 441
>gi|110776820|ref|XP_624980.2| PREDICTED: KDEL motif-containing protein 1-like [Apis mellifera]
Length = 497
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 116/251 (46%), Gaps = 15/251 (5%)
Query: 178 MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWN 237
++PD++ + D P++ K+ P+F +C + DI P + +E +L+
Sbjct: 224 LLPDIEFFVNLGDWPLVPKEGKN---YPIFSWCGSFDTKDIVMPTYDI---TESSLEAME 277
Query: 238 EEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAE 295
D+ GS WKEK+ +W+G L + + L+ I + +
Sbjct: 278 RVMLDMLSVQGSTDTPWKEKIEKVFWRGRDSRRERLDLIDISRKYPDLFNVSITNFFFFK 337
Query: 296 EAKD--GFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG 351
+ KD G +S +S N ++Y++ +G + Y+L+ +S+ L +Y +FF
Sbjct: 338 DEKDKYGPGQSHVSFFNFFKYKYQLNIDGTVAAYRFPYLLAGDSLVLKQDSRYYEFFYND 397
Query: 352 LIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHL 410
LIP +++ + + DL I+ ++ W + EA +I K+ + F ++L + Y + L
Sbjct: 398 LIPGEHYISVKN-DLSNLIERIM-WAKEHDQEALQIVKSARKFARDNLLPHNILCYHVAL 455
Query: 411 ITEYSKLLDYK 421
E+SK L K
Sbjct: 456 FHEWSKRLKSK 466
>gi|452843550|gb|EME45485.1| hypothetical protein DOTSEDRAFT_71260 [Dothistroma septosporum
NZE10]
Length = 456
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 109/241 (45%), Gaps = 20/241 (8%)
Query: 107 PDFFKSIHK----DLEPWAKSRITMRHIMEAKRFAAL-RILIVRGKLYVDPYYDCVQS-- 159
PD +K I + L+ W IT + A R + R L+ +LY+ + V
Sbjct: 83 PDLYKEIDRAVQYRLDTWGN--ITPDELDVAWRGDGIVRALVHDNQLYIIDAHGVVDHNH 140
Query: 160 --RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
R + T+ + + Y G +P+++ F D + D + + + Y
Sbjct: 141 RPRTVATLHSLHRAISAYQGKLPNIEFTFTVHDAALRDPAGNET----TWAYTRRKHQEK 196
Query: 218 I-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP-LRVEL 275
+ PD+ WGW +V L+ + E + ++H + + +K+P W+G+ V S +R L
Sbjct: 197 LWLMPDFGLWGWPDVGLRSFAELQEVLEH--EEDEFVDKVPKLVWRGSVAVGSKDVRHGL 254
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
+ ++ K W +++ +W + + + + C++ + EG +S LKY+L+C+S
Sbjct: 255 VDHSEGKSW-SDVRALDWGNKTNIDERLLSMQDHCSYMFVAQTEGNTYSGRLKYLLNCHS 313
Query: 336 V 336
V
Sbjct: 314 V 314
>gi|344243629|gb|EGV99732.1| KDEL motif-containing protein 2 [Cricetulus griseus]
Length = 452
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/335 (21%), Positives = 142/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I +D + I ++ +++ KRF R +V + + +
Sbjct: 96 QKTLSCPATEPQIEQDFTSFPS--IDLQQMLKEVPKRFGDERGAVVHYTIVNNHIHRRSL 153
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ +PD++ + D P+ +K + + P+P+ +C +
Sbjct: 154 GKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTPGPIPIISWCGSL 211
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + ++ G+ SW K A+++G L++
Sbjct: 212 DSRDIILPTYDVTHSTLEAMRGVTNDLLSVQ-GNTGPSWINKTEKAFFRGRDSREERLQL 270
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 271 VQLSQENPQLLDAGITGYFFFQEKERELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 330
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 331 MLGDSLVLKQDSPYYEHFYVALRPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 386
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R++ Y ++ +Y+ KP
Sbjct: 387 KEGQLTARDLLQPPRLFCYYYKVLQKYAARQASKP 421
>gi|407921029|gb|EKG14198.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 441
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 62/275 (22%), Positives = 130/275 (47%), Gaps = 24/275 (8%)
Query: 120 WAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMV 179
W K IT I + + L +L +G + Y + R + + + + Y G +
Sbjct: 115 WRKEGITRCMIYDHQ----LYVLETKGTTHRRDY----RERTLAVLHNMHRAITAYNGPL 166
Query: 180 PDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNE 238
P+++ F +D V D++ + P+ ++ + + ++ PD+ +W W + + +
Sbjct: 167 PNIEFTF-SVDDWVYDEENLNTDPV-VWGFTRQQSWENVWLMPDFGYWAWPTEPVGAYQD 224
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK 298
+ +A+++ EK P W+G L+ R L+K K W ++I +W++
Sbjct: 225 VRNQMGVREKAQAFSEKKPKVVWRGA--ALTEQRQALIKQWHGKPW-SDIEPFDWSDPEI 281
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PT 355
+ K + + C +Y ++ EG ++S LKY+ +C+SV +I + + + LI P
Sbjct: 282 EK-KFLIMPDHCQWQYLLHTEGRSYSGRLKYLQNCHSVPIIPELNWVEPHHQLLIAQGPA 340
Query: 356 KNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+N+ P+ +DL ++ ++D +P EAE+I
Sbjct: 341 RNYVPVKYDFSDLGEKMQFLLD----HPDEAERIA 371
>gi|358058341|dbj|GAA95860.1| hypothetical protein E5Q_02517 [Mixia osmundae IAM 14324]
Length = 492
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/244 (23%), Positives = 99/244 (40%), Gaps = 12/244 (4%)
Query: 107 PDFFKSIHKDLEPWA-KSRITMRHIMEAKRFAALRILIVRGKLYV-DPYYDCVQSRAMFT 164
P F + + W K I R++ + R++I ++Y+ P+ +SR
Sbjct: 160 PHLFDQLIESRRFWERKGGIQERYLDTTEHGMNARVIIKSNRVYLRKPFNAGAKSRTQAL 219
Query: 165 IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWS 224
+ + + +PDV+ + + D+ +D + L + R + IP D+
Sbjct: 220 LAAIEEAVLSSIEPIPDVEFVINSEDR--VDAATAHTPILGMSRKKQQGHVWLIP--DFR 275
Query: 225 FWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
F+ W E ++ + + I + W K +W+GNP L PLR ELM W
Sbjct: 276 FYAWPEPHVGTYPDVQDQIYALEATQQWHHKRAKLFWRGNPP-LHPLRQELMTKFAKSAW 334
Query: 285 GAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
AE+ +W K L+ C+ RY + EG + L YI C SV + +Y
Sbjct: 335 -AEVSPIDW----KHTTNLLALAEHCHWRYLLNVEGVSTGGRLPYISQCKSVVITHQLEY 389
Query: 345 KDFF 348
F
Sbjct: 390 AQHF 393
>gi|442757169|gb|JAA70743.1| Hypothetical protein [Ixodes ricinus]
Length = 507
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/302 (21%), Positives = 126/302 (41%), Gaps = 20/302 (6%)
Query: 96 RVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYD 155
R PQ + DF K K + A R + +A +V+ ++Y Y
Sbjct: 160 RAVDPQILADLADFEKVDMKKVLQDALQRFDRPGSVSFCHYA-----VVKNQVYRKCYGQ 214
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH 215
V MF L L R+ ++PDV+++ + D P +++K++ P+P F +C +++
Sbjct: 215 HV-GFNMFMDQILLSLARKV--VLPDVEMLVNLGDWP-LERKDYWGKPVPFFSWCGSNST 270
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + +E +L+ D+ G +WK+K P +W+G L +
Sbjct: 271 RDIVMPTYDL---TESSLEMMGRVTLDMLSVQGHGGPAWKDKEPSGFWRGRDSRQERLDL 327
Query: 274 ELMKCNDSKLWGAEILR----QNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
+ +L A + ++ EE + H+Y+I +G + L Y
Sbjct: 328 VALSRRYPELLNASLTNFFFFRDKMEEYGPQASHISFFDFFRHKYQINVDGTVAAYRLPY 387
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGK 389
+L+ + + L +Y + F L+ +++ P DL ++ +DW N ++ +
Sbjct: 388 LLAGSGLVLKQDSEYYEHFYPRLVAMEHYVPF-RRDLSDLVEK-LDWARKNDERVQRHCE 445
Query: 390 AG 391
G
Sbjct: 446 VG 447
>gi|407916819|gb|EKG10149.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 538
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 137/311 (44%), Gaps = 46/311 (14%)
Query: 133 AKRFAALRILIVRGKLYV--DPYYDCVQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDC 188
K A ++I G+L V + + +SR + + + + +P +P+V+ + D
Sbjct: 136 GKTRAEFHLMIYNGELIVIDEAKGEPDRSRGLAAMASMYRAINAHPNPRDIPNVEFILDL 195
Query: 189 MD--KPVIDKKEHGSFPLPLFRYCTNDAHFDIPF----PDWSFWGWSEVNLQPWNEEFKD 242
D +P D K R+ T H D P+ PD+ W + + + + + D
Sbjct: 196 HDNSQPGPDGK---------IRF-TWARHKDNPYMWVVPDFDGWTYPDDAVGSYVQFRND 245
Query: 243 IKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFK 302
+ K +++K+P W+G+ V LR LM ++ K W +++ +W +
Sbjct: 246 V--AEIEKPFEDKIPQLSWRGSLGVNHGLRQALMDASEGKGW-SDVKAIDWRTRSN---- 298
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI-----ISQQYKDFFSRG----LI 353
+ + CN++Y + EG WS L+Y+ +CNSV +I ++ Y G +
Sbjct: 299 VLAMKDFCNYQYVAHTEGNTWSGRLRYLHNCNSVPVIHELDWVAHYYPLLQDSGKYQNYV 358
Query: 354 PTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKI-GKAGQDFMES-LTMDRVYDYMLHLI 411
K F +DL ++ +VD +P A++I ++ Q F + LT Y +
Sbjct: 359 KVKRDF----SDLDEKMQYLVD----HPHVAKRIAAQSAQTFRDRYLTPAAEACYWRRMF 410
Query: 412 TEYSKLLDYKP 422
Y+ +LD++P
Sbjct: 411 AHYASVLDFEP 421
>gi|327273952|ref|XP_003221743.1| PREDICTED: KDEL motif-containing protein 2-like [Anolis
carolinensis]
Length = 489
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 63/268 (23%), Positives = 110/268 (41%), Gaps = 14/268 (5%)
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAHFDIPF 220
MF+ L L R+ +PDV+ + D P+ +K + + P+P+ +C + DI
Sbjct: 198 MFSDEMLLSLSRKV--RLPDVEFYVNVADWPIEHRKANDTPGPVPILSWCGSVDSADIVL 255
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
P + + L+ + ++ G+ W+ K +++G L + + +
Sbjct: 256 PTYDVTHSTLETLRGVTNDLLSVQ-GNTGPVWENKTEQGFFRGRDSREERLLLVKLSKEN 314
Query: 281 SKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILSCNSVA 337
+L A I + E + K L + ++Y++ +G + Y+L +SV
Sbjct: 315 PELLDAGITGYFFFREKEKELGKVPLMGFFDFFKYKYQVSVDGTVAAYRFPYLLLGDSVV 374
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ-DF 394
L S Y ++F + L P ++ PI DL IK W N KI K GQ
Sbjct: 375 LKQSSPYYEYFYKELSPWSHYIPIKRNLEDLLEKIK----WVKENDEVVRKIAKEGQMTA 430
Query: 395 MESLTMDRVYDYMLHLITEYSKLLDYKP 422
E L R Y Y + EY++ KP
Sbjct: 431 RELLQPHRFYCYYFKVFQEYAERQTGKP 458
>gi|70999634|ref|XP_754534.1| DUF821 domain protein [Aspergillus fumigatus Af293]
gi|66852171|gb|EAL92496.1| DUF821 domain protein [Aspergillus fumigatus Af293]
gi|159127547|gb|EDP52662.1| DUF821 domain protein [Aspergillus fumigatus A1163]
Length = 445
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/220 (21%), Positives = 101/220 (45%), Gaps = 21/220 (9%)
Query: 177 GMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQ 234
++PD + +F DK ++ PL + ++ + PD+ FW W ++ ++
Sbjct: 178 AVIPDTEFIFSVEDK--VEDIAGPGHPLWVLARKPHEESVWL-MPDFGFWSWGHLDSQIR 234
Query: 235 PWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWA 294
P+++ + ++ + W +K W+G LR L++ WG ++ W
Sbjct: 235 PYDQVVEHVRQ--REVPWDQKRDKLVWRGKLSFAPKLRRTLLEVARGYPWG-DVREVEWR 291
Query: 295 EEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI- 353
+A + C++++ + EG ++S SLKY +C SV +I QY L+
Sbjct: 292 NKAN----FLSMEEHCDYKFIAHVEGRSYSASLKYRQACQSVVVIHKLQYIQHHHYLLVS 347
Query: 354 --PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
P +N + +DL + ++ ++D NP++A++I +
Sbjct: 348 SGPQQNFVQVERDFSDLPQKMQELLD----NPAKAQRIAR 383
>gi|357608730|gb|EHJ66122.1| KDEL motif-containing protein 1 [Danaus plexippus]
Length = 234
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/193 (22%), Positives = 87/193 (45%), Gaps = 12/193 (6%)
Query: 167 GFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW 226
G L+ +PD+D++ + D P +K + G P+F + +++DI +P WSFW
Sbjct: 33 GVEHYLKILSPTLPDMDLVINTRDWPQFNK-DWGHKKAPVFSFSKTRSYYDIMYPTWSFW 91
Query: 227 -GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND 280
G + L P W++ I ++ W +K A+++G+ + L+ +
Sbjct: 92 EGGPAIALYPTGIGRWDKHRTSISTAAEKWPWNKKEEKAFFRGSRTSEERDALILLSRSH 151
Query: 281 SKLWGAEILR-QNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
+L A+ + Q W +A + + + C ++Y G A S LK++ C S
Sbjct: 152 PELVDAKYTKNQAWKSDADTLYAPPASEVSFEDHCKYKYLFNYRGVAASFRLKHLFLCKS 211
Query: 336 VALIISQQYKDFF 348
+ + ++ +FF
Sbjct: 212 LVFHVGDEWLEFF 224
>gi|358060127|dbj|GAA94186.1| hypothetical protein E5Q_00834 [Mixia osmundae IAM 14324]
Length = 395
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 83/176 (47%), Gaps = 16/176 (9%)
Query: 221 PDWSFWGW-SEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
PD+ F W +V+ W K +W+ K+P +W+G D +P R +L++
Sbjct: 129 PDFGFHSWHGDVDHGLWTSFRSSAKQIDDGLTWRAKIPKLFWRG--DDFTPARKQLVEQA 186
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA-- 337
+ W +++ WAE +++ K + + C + + EG ++S LKYIL+C SV
Sbjct: 187 RGREW-SDVESLLWAEPSRN--KAISMPDHCRYAFLAQTEGASYSGRLKYILNCRSVVIS 243
Query: 338 --LIISQQYKDFFSRGLIPTKNHFPIPSA--DLCRSIKSVVDWGNANPSEAEKIGK 389
L Q + + P +N +P + DL ++++++ A+ S AE I
Sbjct: 244 HPLHYQQHFHSLLNATQGPEQNIVILPHSFDDLSVTMQTLL----ADSSRAEAIAN 295
>gi|121719042|ref|XP_001276271.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119404469|gb|EAW14845.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 474
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 106/258 (41%), Gaps = 31/258 (12%)
Query: 107 PDFFKSIHKDLEPW-AKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
P ++ + + + W + + M + K ++ R LI RG+LYV +
Sbjct: 112 PGLYEDVLRGVAYWRSHGALAMEDLDNIKLKYGMARALIHRGELYVAAARSVGEDHRRKI 171
Query: 165 IWGFLQLLRRYPGMVP------DVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFD 217
I L + R P D + +F DK ++ P+ L R T +A +
Sbjct: 172 I-AVLSSIHRALVADPHRASRVDFEFVFSVEDK--VEDVTSADHPVWTLARTATEEAVW- 227
Query: 218 IPFPDWSFWGWSEVN--LQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
PD+ FW W + + P+++ I+ W+EK W+G P LR L
Sbjct: 228 -LMPDFGFWAWDNPDNAIGPYDQVVGYIRQAD--VPWEEKKRQLVWRGKPSFAPKLRRAL 284
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAE---------GYAWSVS 326
M+ K WG ++ + NW ++ + K+ + C + + + E G ++S S
Sbjct: 285 MEAARGKPWG-DVRQVNWQQDTSNAI---KMEDHCKYMFIAHVEVLNPMRCVVGRSYSAS 340
Query: 327 LKYILSCNSVALIISQQY 344
LKY +C SV + QY
Sbjct: 341 LKYRQACTSVIVAHKLQY 358
>gi|348504138|ref|XP_003439619.1| PREDICTED: KDEL motif-containing protein 2 [Oreochromis niloticus]
Length = 539
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 73/338 (21%), Positives = 139/338 (41%), Gaps = 31/338 (9%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIME------AKRFAALRILIVRGKLY---VD 151
Q V CP + I D + + I ++H+ + + R + I+ ++Y +
Sbjct: 183 QSVLQCPADDQQIVADFKSFPT--IDLQHLRQEVPHRFSNRGGLIHYAIINNQVYRRTLG 240
Query: 152 PYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCT 211
Y D MF+ L L R+ VPDV+ + D P+ K S +P+ +C
Sbjct: 241 KYTDF----KMFSDEMLLSLTRKV--RVPDVEFYINVGDWPLETKT---SAAVPILSWCG 291
Query: 212 NDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
+ DI P + + L+ + ++ G+ W K A+++G L
Sbjct: 292 STDTRDIVLPTYEVTHSTLETLRGVTNDLLSVQ-GNTGPPWVNKTERAFFRGRDSREERL 350
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLK 328
++ + + +L A I + + + K+ L + ++Y++ +G +
Sbjct: 351 QLVSLSKKNPELLDAGITAWFFFRDQEKHVGKASLVGFFDFFKYKYQVNIDGTVAAYRFP 410
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEK 386
Y++ NS+ L QY + F L ++ P+ +DL I+ W N +EA++
Sbjct: 411 YLMLGNSLVLKQDSQYYEHFYSHLKAGTHYVPVKRNLSDLLEKIR----WAKENDAEAQE 466
Query: 387 IGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPA 423
I +AGQ E L R+Y Y ++ YS+ +P
Sbjct: 467 IARAGQAAARELLQPSRLYCYYYKVLHMYSERQTGQPT 504
>gi|189194465|ref|XP_001933571.1| hypothetical protein PTRG_03238 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979135|gb|EDU45761.1| hypothetical protein PTRG_03238 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 434
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 71/339 (20%), Positives = 141/339 (41%), Gaps = 32/339 (9%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYD-----CVQSRA 161
P+ + S+ + + ++ + I LR++I +G+++V V
Sbjct: 74 PELYHSLDESVNSRQGRKVHPKDIEILSGRCMLRVMIYQGEVFVTDAGKPEQCYVVNGNE 133
Query: 162 MFTIWGFLQLLRRYPGM-------VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA 214
I G L + R +P+++ D P K+ G+F F Y +
Sbjct: 134 RERILGTLAQIDRAVVTASTSDTPIPNIEFALSLDDLPR-RSKDKGTF----FGYTRKEG 188
Query: 215 --HFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
+ DI P++++W W+ + WN K+I+ W +K P W+G ++ L
Sbjct: 189 PEYRDIWMMPNYAYWSWNYTHAPSWNAIRKEIQQKETEVPWSKKDPRVVWRGKVK-MAKL 247
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFKKS--KLSNQCNHRYKIYAEGYAWSVSLKY 329
R EL+K + + W ++ A D K L + C +++ I EG ++S LKY
Sbjct: 248 RQELIKVSKGQSWSD--IKPVVINNASDPHNKDVMNLRDFCGYKFTIQTEGTSYSGRLKY 305
Query: 330 ILSCNSVALIISQQYKDFFS---RGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
+ C S + ++++F + R P N+ I +++ +++S +++ + +AE
Sbjct: 306 LQLCRSALITHPLKWQEFHTHLMRLAGPDVNY--IEASENFGNLESAMEYYRDHDEDAEL 363
Query: 387 IGKAGQDFMES--LTMDRVYDYMLHLITEYSKLLDYKPA 423
I + + LT V Y L + + Y+P
Sbjct: 364 IARNSYETFTRRYLTPAAVTCYWRRLFVSWKSVQGYEPV 402
>gi|357622883|gb|EHJ74243.1| hypothetical protein KGM_01635 [Danaus plexippus]
Length = 460
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 129/305 (42%), Gaps = 31/305 (10%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMR----------HIMEAKRFAALRILIVRGKLYVDPYYD 155
C +K I KDL P+ R+ M+ H E+ F +I K+Y D Y
Sbjct: 150 CSQTYKQIEKDLIPF--QRVEMKEQIKKIVDKYHRPESTSFC--HYVIKENKIYRDCYGK 205
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH 215
V MF L L R+ ++PD++++ + D P+I K L +F +C +D
Sbjct: 206 HV-GFNMFADNILLSLSRK--TVLPDMEMVINLGDWPLIHKNGE---KLAMFSWCGSDDT 259
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
DI P + + NL + ++ G+ + W ++ A W+G L++
Sbjct: 260 LDIVMPTYDITESTLENLGRVTLDTLSVQ-GNVERKWSDRETRAIWRGRDSRAERLKLID 318
Query: 276 MKCNDSKLWGAEILRQNW--AEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYIL 331
+ + L A + + +EAK G K +S +++Y+I +G + Y+L
Sbjct: 319 IARENPDLINASLTNFFFFREKEAKYGPKVPHISFFKFFDYKYQINVDGTVAAYRFPYLL 378
Query: 332 SCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+ + Y + F L +++ P+ S +DL IK W + +EA I +
Sbjct: 379 AGGGLVFKQDSSYYEHFYSKLTQWEHYVPVKSDLSDLVDKIK----WAKNHDTEAVDIAR 434
Query: 390 AGQDF 394
+ +DF
Sbjct: 435 SARDF 439
>gi|413923258|gb|AFW63190.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
Length = 103
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 41/65 (63%), Gaps = 1/65 (1%)
Query: 400 MDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLADPKQRQNLEKAAA-SPSPY 458
M+ VYDYM HL+TEY+KLL +KP P A E C ESL C A ++++ +E + S S
Sbjct: 1 MEYVYDYMFHLLTEYAKLLRFKPRKPPEAIEVCPESLACQAIGREKKFMEDSMVRSASDA 60
Query: 459 PPCTL 463
PC L
Sbjct: 61 GPCDL 65
>gi|149913517|ref|ZP_01902050.1| 3-hydroxybutyryl-CoA dehydrogenase [Roseobacter sp. AzwK-3b]
gi|149812637|gb|EDM72466.1| 3-hydroxybutyryl-CoA dehydrogenase [Roseobacter sp. AzwK-3b]
Length = 327
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 70/147 (47%), Gaps = 3/147 (2%)
Query: 249 AKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSN 308
++ W +K P A W+G + +P + DS +I + + AE ++
Sbjct: 146 SRCWADKRPSAVWRGMLNGQAPRIAAVRLYADSP--AHDIGQVHEAEGLAPPKGWLSIAQ 203
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
Q HRY + EG + +LK+I++ NSVAL +Y+ ++ G + HF ADL
Sbjct: 204 QLEHRYVLSLEGNDVATNLKWIMASNSVALAPKLEYETWYMEGRLEPGVHFVQIRADLS- 262
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ + W ANP E E+I + Q ++
Sbjct: 263 DLDERIAWCEANPQEMERIVRNAQTWV 289
>gi|240273509|gb|EER37029.1| DUF821 domain-containing protein [Ajellomyces capsulatus H143]
Length = 410
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 68/316 (21%), Positives = 127/316 (40%), Gaps = 34/316 (10%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPYYD 155
++ Q Q P + I K + IT + + + +R +I +G LYV + D
Sbjct: 88 LSRAQCQQAFPKLYIEIEKAVAARRGRNITFDELNSKPLKNSMVRAMIYQGALYVLNFED 147
Query: 156 C--VQSRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCT 211
SRA ++ + L P +P+++ +F D + P P++ Y
Sbjct: 148 MRYTFSRAKASLNALNRALNAIPNRYEIPNIEFIFTTED--------YHDDPHPIWVYSK 199
Query: 212 NDAH-FDIPFPDWSFWGWSEVNLQPWN---EEFKDIKHGS-------QAKSWKEKLPFAY 260
+ + + PD+ +W W E+ + + I G+ A ++ K
Sbjct: 200 RETNGWAWLMPDFGYWSWPEIKAGQYRSIRQRIAAIDEGAIINGKPCTALRFQRKKKQLL 259
Query: 261 WKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEG 320
W+G LR +L+ K W A + NWA+E + + C + + + EG
Sbjct: 260 WRGAIATAPALRQKLLDVTKGKSW-ASVRALNWADETSMRDDYIPIEDHCKYMFLAHVEG 318
Query: 321 YAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVD 375
++S KY+ +C SV + +++ L+ P N+ + +DL + ++D
Sbjct: 319 RSYSGRGKYLQNCRSVIVAHQLVWREAHHGALVATGPEANYVKVRRDFSDLEAKMNYLLD 378
Query: 376 WGNANPSEAEKIGKAG 391
NP AEKI + G
Sbjct: 379 ----NPEVAEKIAENG 390
>gi|315050978|ref|XP_003174863.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311340178|gb|EFQ99380.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 464
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 74/171 (43%), Gaps = 12/171 (7%)
Query: 177 GMVPDVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSEVN--L 233
G +P ++ +F D+ +D PL L R + + + IP D+ FW W +
Sbjct: 189 GTLPSIEFIFSIEDR--VDDVSGVGHPLWALARKASEQSIWLIP--DFGFWAWDNAANAI 244
Query: 234 QPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW 293
P+N+ I+ W KL W+G LR L+ WG ++ W
Sbjct: 245 GPYNQVVDRIQRQEVTVPWSSKLDKLVWRGKLSFAPKLRRNLLDIARGTTWG-DVKEIVW 303
Query: 294 AEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
+E K F + + C +++ + EG A+S SLKY +C SV + Q+
Sbjct: 304 SE--KQNF--ISMDDHCKYKFIAHVEGRAYSSSLKYRQACRSVVVAHKLQF 350
>gi|307170880|gb|EFN62991.1| KDEL motif-containing protein 1 [Camponotus floridanus]
Length = 458
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 72/329 (21%), Positives = 131/329 (39%), Gaps = 24/329 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRI-TMRHIMEAKRFAALRILIVRGKLYVDPYYD-CVQSRAMF 163
C + IH DL P+ +RH + + + I L + Y C F
Sbjct: 111 CRQNYTQIHNDLSPFLNIDFDKIRHSIIKRYDQPTSVSICHYVLRSNRIYRRCYGQYVGF 170
Query: 164 TIWG---FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDIP 219
I+ L L R+ ++PD++ + D P++ + PL P+F +C +D DI
Sbjct: 171 KIFMDAILLSLARKV--VLPDIEFFVNLGDWPLVPD----TGPLYPIFSWCGSDNTKDIV 224
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
P + +E +L+ D G+ SWK K +W+G L + +
Sbjct: 225 MPTYDI---TESSLEAMGRVMLDTLSVQGNTGLSWKNKTEQLFWRGRDSRRERLNLIDIS 281
Query: 278 CNDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+L+ I + + D + N ++Y++ +G + Y+L+
Sbjct: 282 RKHPELFNVSITNFFFFRDEMDKYGPVQNHVSFFNFFKYKYQLNIDGTVAAYRFPYLLAG 341
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+S+ Y +FF + L P ++ P+ S DL + ++ W + + KI K+ +
Sbjct: 342 DSLVFKQESNYYEFFYKDLTPGLHYVPVKS-DLSDLVDKII-WAKEHDEDGLKIVKSARQ 399
Query: 394 FMESLTMDR-VYDYMLHLITEYSKLLDYK 421
F + R + Y L E+SK L K
Sbjct: 400 FARDNLLPRDILCYYTVLFHEWSKHLKSK 428
>gi|301606930|ref|XP_002933070.1| PREDICTED: KDEL motif-containing protein 1-like [Xenopus (Silurana)
tropicalis]
Length = 585
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 80/344 (23%), Positives = 143/344 (41%), Gaps = 51/344 (14%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
+C + I +DL+P++ + +RF + L I K+Y+ Y + V R
Sbjct: 237 SCNESIPQIQRDLKPFSAIDPDKISVEVPERFGQRQSLCHYTIKNNKIYIKTYGEHVGFR 296
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L + VPD++ + D P ++KK+ G P+ +C + DI
Sbjct: 297 -IFMDSLLLSLTSKVK--VPDIEFFVNLGDWP-LEKKKTGDIH-PILSWCGSSDSKDIVM 351
Query: 221 PDWSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ DI + W+EK A+W+G R+EL+K
Sbjct: 352 PTYDL---TDSILETMGRVSLDILSVQANCGPKWEEKNSTAFWRGRDSCKE--RLELVKL 406
Query: 279 -------------------NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAE 319
+D L+G + + D FK ++Y+I +
Sbjct: 407 SRKHPDLIDAAFTHFFFFKHDESLYGPIVQPIPFF----DFFK---------YKYQILID 453
Query: 320 GYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNA 379
G + + Y+L+ NSV L Y + F + L P K++ P DL ++ + W
Sbjct: 454 GTVAAYRMPYLLAGNSVILKQDSVYYEHFYKDLQPWKHYVPF-KRDLSDLLEK-IHWVKD 511
Query: 380 NPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+ ++A+ I +AG++F +L D ++ Y L Y+ L KP
Sbjct: 512 HDADAKLIAEAGREFARNNLMGDHIFCYYFKLFQAYASLQISKP 555
>gi|225556827|gb|EEH05114.1| DUF821 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 458
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 105/239 (43%), Gaps = 15/239 (6%)
Query: 107 PDFFKSIHKDLEPWAK-SRITMRHIMEAK-RFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
P F+ +++ ++ W +T + + R R +I G+LYV +
Sbjct: 104 PGLFEDVNRAVKYWRHLGGLTEKQLNVINLRNGMARGMINGGELYVVETRAAQEDHRRKI 163
Query: 165 IWGFLQLLRRYPG---MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+ F + R P +PD++ +F D+ + D K G L R + ++ + P
Sbjct: 164 LGIFSSIYRALPADRRGLPDIEFIFSIEDR-LDDIKGSGQPIWVLGRKASEESVW--LMP 220
Query: 222 DWSFWGWSE--VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
D+ FW W V + ++E + I+ A W K W+G+ + +R L++
Sbjct: 221 DFGFWAWHNPSVVIGTYDEVVRKIEQREDAIPWALKQRKLVWRGSLNYAPKMRRRLLEVA 280
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
K WG ++ W+ +K+ + + CN+ + + EG ++S SLKY +C SV +
Sbjct: 281 RGKTWG-DVKEIMWS--SKNNL--ISMEDHCNYMFIAHVEGRSFSSSLKYRQTCRSVVV 334
>gi|410907365|ref|XP_003967162.1| PREDICTED: KDEL motif-containing protein 2-like [Takifugu rubripes]
Length = 496
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 73/330 (22%), Positives = 131/330 (39%), Gaps = 31/330 (9%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIME------AKRFAALRILIVRGKLY---VD 151
+ CP I KD E + I ++H+ + A R + I+ LY +
Sbjct: 140 ESTMRCPAEEPQILKDFESFPA--IDLQHLRQEVPRRFANRGGLVHYAIINNHLYRRTLG 197
Query: 152 PYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCT 211
Y D MF+ L L R+ VPDV+ + D P+ + LP+ +C
Sbjct: 198 KYTDF----KMFSDEMLLSLTRKVK--VPDVEFYINVGDWPLETRTVD---VLPILSWCG 248
Query: 212 NDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPL 271
+ DI P + + L+ + ++ G+ W K A+++G L
Sbjct: 249 STDTRDIVLPTYDVTHSTLETLRGVTNDLLSVQ-GNTGPPWVNKTARAFFRGRDSREERL 307
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLK 328
+ + + +L A I + + + K+ L + ++Y++ +G +
Sbjct: 308 HLVSLSKKNPELLDAGITAWFFFRDKEKHVGKAALVGFFDFFKYKYQVNMDGTVAAYRFP 367
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEK 386
Y++ +S+ L QY +FF L ++ P+ +DL IK W N + A+K
Sbjct: 368 YLMLGDSLVLKQDSQYYEFFYSHLKAGTHYVPVKRNLSDLLDKIK----WAQENDARAQK 423
Query: 387 IGKAGQDFM-ESLTMDRVYDYMLHLITEYS 415
+ AGQ E L R+Y Y ++ YS
Sbjct: 424 MAAAGQMLARELLQPSRLYCYYYRVLHTYS 453
>gi|407921427|gb|EKG14575.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 490
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 99/221 (44%), Gaps = 16/221 (7%)
Query: 139 LRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKP--VIDK 196
L IL RG D + R + + + + YPG +P+++ F D +
Sbjct: 128 LFILNARGARRRD-----YRQRTLAVLQSIQRAITAYPGDIPNIEFSFVVDDGAYFAVYN 182
Query: 197 KEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKL 256
E + L R +D + +P D+ F+ W E +N + I+ +++K
Sbjct: 183 NETSATTWALTREPQDDNLWLMP--DFGFYSW-EGPAGEYNALLRAIEQDEM--PFEQKD 237
Query: 257 PFAYWKGN--PDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRY 314
P A W+G P +R +L+K + K W A+I W E + +S C + +
Sbjct: 238 PRAIWRGAKAPAGHVQVRSDLLKVSKGKEW-ADIEEIIWGGEGEPK-NLIPMSRHCKYMF 295
Query: 315 KIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPT 355
++ EG+ +S LKY+L+C+S+ +I + + F LIP+
Sbjct: 296 PVHTEGHTYSGRLKYLLNCHSLTVIHKLHWLENFHNVLIPS 336
>gi|313219907|emb|CBY30822.1| unnamed protein product [Oikopleura dioica]
Length = 536
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 71/306 (23%), Positives = 129/306 (42%), Gaps = 41/306 (13%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIME---AKRFAA-------LRILIVRGKLYVDPYYD 155
CP F+ + +DL + I + + E K++AA ++I KL+V+ +
Sbjct: 172 CPANFEQLDEDLAQF--DEIDLDRLYEDGKKKKWAAGGHGEATAHVVIKDQKLFVEDFGT 229
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH 215
+ R + FL LLR+ +PD + +F+ D P+ +E+ + P P+ +C +
Sbjct: 230 IMGFRGFMSSM-FLSLLRKVK--LPDAEFIFNLGDWPL---EENLTDPQPILTWCGSSNT 283
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKS----WKEKLPFAYWKGNPDVLSPL 271
DI P W ++ E KDI++ Q W EK+ Y++G +P
Sbjct: 284 SDIAVPTWD---QTKNTRHALFRERKDIQYVEQISGEVVPWNEKIERGYFRGRDS--NPS 338
Query: 272 RVELMKCNDSKLWGAEI-LRQNWAEEAKDGFKKSKLSNQCNH---------RYKIYAEGY 321
R++L C S +I R W K G K Q H +Y++ +G
Sbjct: 339 RLKL--CELSMAHPEDIDARLTWNLHNKKGQDPKKYGEQVKHVSYPEMGKFKYQVLVDGT 396
Query: 322 AWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANP 381
++ +SV L Y +++ R + P ++ PI DL ++ ++W N
Sbjct: 397 VAPYRTALLMQMDSVILKQKSMYYEWWYRYMKPWQHFIPI-EEDLS-DLREKIEWARNND 454
Query: 382 SEAEKI 387
++ +I
Sbjct: 455 EKSRQI 460
>gi|452988436|gb|EME88191.1| hypothetical protein MYCFIDRAFT_21609, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 346
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/322 (22%), Positives = 134/322 (41%), Gaps = 50/322 (15%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAK-----SRITMRHI-MEAKRFAALRILIVRGKLYV 150
+ S Q Q PD + I++ W ++T + ++ LR LI GKLY+
Sbjct: 14 LTSQQCHQAFPDLYSEIYRSQAYWRDRLGPAGKLTEENWGLKWSNDGGLRALIWEGKLYI 73
Query: 151 DPYYDCVQSRAM--FTIW------GFLQLLRRYPGM---VPDVDIMFDCMDKPVIDKKEH 199
++SR + F W + R G +P+++ DK + +
Sbjct: 74 ------IESRGLNHFLHWKERSHATLHNIQRAIIGAREPIPNIEFSIKINDKIELTNEHP 127
Query: 200 GSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEV-----NLQPWNEEFKDIKHGSQAKSWK 253
+ R ++A + PD++FW + V + Q +F D +
Sbjct: 128 NNTVWNFNRDVHDEAMKQVWLIPDFNFWAYPRVAGAYGDFQRQAIDFYD--------DYN 179
Query: 254 EKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG--AEILRQNWAEEAKDGFKKSKLSNQCN 311
K P W+G + +RV+L++ +D K W ++ EEA + + + C
Sbjct: 180 WKTPKLIWRGTTEFNPEIRVKLIEQSDGKSWSDVHKVAEDVHDEEATKW--RINMPDHCK 237
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL---IPTKNHFPIPS--ADL 366
+++ ++ EG WS LKY+LSC+S +I +Y L P +N+ +DL
Sbjct: 238 YKFAVHTEGTTWSGRLKYLLSCHSTIIIHPLRYTTHLYHLLEAEGPNQNYVRCEKDWSDL 297
Query: 367 CRSIKSVVDWGNANPSEAEKIG 388
++ ++ ANPS+A++I
Sbjct: 298 PSTMDDLL----ANPSKAKRIA 315
>gi|355698829|gb|AES00928.1| KTEL containing 1 [Mustela putorius furo]
Length = 180
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 72/152 (47%), Gaps = 8/152 (5%)
Query: 275 LMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKY 329
L+ + KL AE + + KD K L + C ++Y G A S K+
Sbjct: 11 LLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKH 70
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGK 389
+ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN A++I +
Sbjct: 71 LFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLHFVKANDDVAQQIAE 128
Query: 390 AGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
G F M L M+ V Y L+TEYSK L Y
Sbjct: 129 RGSQFIMNHLQMEDVTCYWESLLTEYSKFLSY 160
>gi|325093445|gb|EGC46755.1| DUF821 domain-containing protein [Ajellomyces capsulatus H88]
Length = 458
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 106/245 (43%), Gaps = 15/245 (6%)
Query: 107 PDFFKSIHKDLEPWAK-SRITMRHIMEAK-RFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
P F+ +++ ++ W +T + + R R +I G+LYV
Sbjct: 104 PGLFEDVNRAVKYWKHLGGLTEKQLNVINLRNGMARGMINGGELYVVETRAAQDDHRRKI 163
Query: 165 IWGFLQLLRRYPG---MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+ F + R P +PD++ +F D+ + D K G L R + ++ + P
Sbjct: 164 LGIFSSIYRALPADRRGLPDIEFIFSIEDR-LDDIKGSGQPIWVLGRKASEESVW--LMP 220
Query: 222 DWSFWGWSE--VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
D+ FW W V + ++E + I+ A W K W+G+ + +R L++
Sbjct: 221 DFGFWAWHNPSVVIGTYDEVVRKIEQREDAIPWALKQRKLVWRGSLNYAPKMRRRLLEVA 280
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
K WG ++ W+ +K+ + + CN+ + + EG ++S SLKY +C SV +
Sbjct: 281 RGKTWG-DVKEIMWS--SKNNL--ISMEDHCNYMFIAHVEGRSFSSSLKYRQTCRSVVVS 335
Query: 340 ISQQY 344
Q+
Sbjct: 336 HKLQF 340
>gi|453085534|gb|EMF13577.1| hypothetical protein SEPMUDRAFT_148826 [Mycosphaerella populorum
SO2202]
Length = 475
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/180 (22%), Positives = 82/180 (45%), Gaps = 6/180 (3%)
Query: 160 RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIP 219
RA+ T+ + + +PD++ F+ D ++D+ R + +
Sbjct: 153 RALATLNAIYRAVSASSTKLPDIEFSFNVHDAALVDQDNGNQTTWAYTRLAHQETLW--L 210
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS-PLRVELMKC 278
PD+ W W +V L+ + E ++H + + +KL W+G+ DV S +R L+
Sbjct: 211 MPDFGVWAWPDVGLRSYPELQNLLEHTEE--HFHDKLSKLVWRGSLDVGSKEVRQGLVDH 268
Query: 279 NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
+ W A++ +W + + + C++++ EG +S LKY+L+C+S+ L
Sbjct: 269 SQGHDW-ADVQVLHWDNRTSIEERLLTMQDHCSYKFVAQTEGNTYSGRLKYLLNCHSILL 327
>gi|240276088|gb|EER39600.1| DUF821 domain-containing protein [Ajellomyces capsulatus H143]
Length = 458
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 104/239 (43%), Gaps = 15/239 (6%)
Query: 107 PDFFKSIHKDLEPWAK-SRITMRHIMEAK-RFAALRILIVRGKLYVDPYYDCVQSRAMFT 164
P F+ +++ ++ W +T + + R R +I G+LYV
Sbjct: 104 PGLFEDVNRAVKYWKHLGGLTEKQLNVINLRNGMARGMINGGELYVVETRAAQDDHRRKI 163
Query: 165 IWGFLQLLRRYPG---MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+ F + R P +PD++ +F D+ + D K G L R + ++ + P
Sbjct: 164 LGIFSSIYRALPADRRGLPDIEFIFSIEDR-LDDIKGSGQPIWVLGRKASEESVW--LMP 220
Query: 222 DWSFWGWSE--VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
D+ FW W V + ++E + I+ A W K W+G+ + +R L++
Sbjct: 221 DFGFWAWHNPSVVIGTYDEVVRKIEQREDAIPWALKQRKLVWRGSLNYAPKMRRRLLEVA 280
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
K WG ++ W+ +K+ + + CN+ + + EG ++S SLKY +C SV +
Sbjct: 281 RGKTWG-DVKEIMWS--SKNNL--ISMEDHCNYMFIAHVEGRSFSSSLKYRQTCRSVVV 334
>gi|157125824|ref|XP_001660800.1| endoplasmic reticulum-resident kdel protein [Aedes aegypti]
gi|108882653|gb|EAT46878.1| AAEL001982-PA [Aedes aegypti]
Length = 505
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 95/213 (44%), Gaps = 14/213 (6%)
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH--GSFPLPLFRYCTNDAHFDIPFPDWSF 225
L L R+ ++PD+++ + D P++ K H + P P+F +C ++ FDI P +
Sbjct: 219 LLSLARK--AILPDMELFVNLGDWPLVTKGGHRRTTGPYPIFSWCGSEDTFDIVMPTYDI 276
Query: 226 WGWSEVNLQPWNEEFKD-IKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLW 284
E L+ N D + + WK+K P A+W+G L + + L
Sbjct: 277 ---VEATLEAMNRVTLDMLSVQRKGIPWKDKEPKAFWRGRDACRERLDLVGISQKHPDLV 333
Query: 285 GAEILRQNW--AEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALII 340
A + + EE K G K + +S + N++Y+I +G + L Y+L +SV
Sbjct: 334 NASLTNFFFFRDEEKKYGPKVAYISFFDFFNYKYQINVDGTVAAYRLPYLLGGSSVVFKQ 393
Query: 341 SQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIK 371
+Y + F L K PI +DL SI+
Sbjct: 394 DSKYYEHFYSKLEKWKEFVPIKKDLSDLVGSIE 426
>gi|340711779|ref|XP_003394446.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
1-like [Bombus terrestris]
Length = 497
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 120/261 (45%), Gaps = 17/261 (6%)
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWG 227
L L R+ ++PD++ + D P++ K+ P+F +C + DI P +
Sbjct: 217 LLSLTRKV--LLPDIEFFVNLGDWPLVPKEGKN---YPIFSWCGSFDTKDIVIPTYDI-- 269
Query: 228 WSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG 285
+E +L+ D+ G+ WKEK+ +W+G L + + L+
Sbjct: 270 -TESSLEEMGRVMLDMLSIQGNTDTPWKEKIEKVFWRGRDSRRERLDLIDISRKYPDLFN 328
Query: 286 AEILRQNWAEEAKD--GFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
I + ++ D G ++S +S + ++Y++ +G + L Y+L+ +++ L
Sbjct: 329 VAITNFXFFKDEMDKYGPEQSHVSFFHFFKYKYQLCIDGTVAAYRLPYLLAGDALLLKQE 388
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
+Y +FF L+P K++ + DL ++ ++ W + + +I K+ + F ++L
Sbjct: 389 SKYYEFFYNNLVPGKHYISV-KRDLSDLVEKIM-WAKEHDQKVLQIAKSARQFARDNLLP 446
Query: 401 DRVYDYMLHLITEYSKLLDYK 421
D V Y + L E+SK L K
Sbjct: 447 DNVLCYHVVLFHEWSKRLKSK 467
>gi|407920524|gb|EKG13715.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 381
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 77/336 (22%), Positives = 132/336 (39%), Gaps = 57/336 (16%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRF--AALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQ 170
I L P + +R +R + K F ILI G++Y+ + RA + G
Sbjct: 37 IRDHLGPNSLTRDRIRINRQTKPFPHGQFHILIFNGQIYIIDEFKGACDRAR-GLAGLSN 95
Query: 171 LLRRYPGM-----VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSF 225
L R M +PDV+ + D D P D + R N + IP +F
Sbjct: 96 LYRAITAMPDPTTIPDVEFIMDVEDAPTEDMPDD-RIVWTWNRPIDNLNTWVIP----NF 150
Query: 226 WGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFA-----------YWKGNPDVLSPLRVE 274
+GW+ S++E+LP W+G+ + + +R
Sbjct: 151 YGWASPR-----------SFIGSYVSFRERLPLVERPFKDKDRRIVWRGSMN--NEVRFA 197
Query: 275 LMKCNDSKLWG--AEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
L+ K W E QN +S C +++ + EG WS L+Y+++
Sbjct: 198 LINATTGKEWADVQETTAQN----------SMHVSELCKYQFLAHTEGNTWSGRLRYLVN 247
Query: 333 CNSVALIIS-QQYKDFFSRGLI---PTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
CNS+++I +Y+ F L+ P +N+ + A+ + +++ + NPS AE+I
Sbjct: 248 CNSISVIHQPLKYQAHFYDMLVSQGPDQNYISV--ANDWSDLAEKMEFYSRNPSVAERIA 305
Query: 389 KAGQDFMESLTMDRVYD--YMLHLITEYSKLLDYKP 422
+ M + Y LI Y+ L +KP
Sbjct: 306 NNSVNTFRDRYMTPAAEACYWRRLIRNYADTLAFKP 341
>gi|344257139|gb|EGW13243.1| KTEL motif-containing protein 1 [Cricetulus griseus]
Length = 333
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 71/325 (21%), Positives = 129/325 (39%), Gaps = 75/325 (23%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P + C + I +DL P+ + I+ + + E +R I++ +L+ + DC+
Sbjct: 62 PCSSRNCSCYRGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKKRLFRED--DCMF 118
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR L+++RR +PD++++ + D P + K + +P+F + + D
Sbjct: 119 PSRCSGVEHFILEVIRR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYHD 172
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I +P W+F W+G P V PL
Sbjct: 173 IMYPAWTF-----------------------------------WEGGPAVW-PL------ 190
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
G + L + RY G A S K++ C S+
Sbjct: 191 -------------------YPTGLGRWDLFREDLLRYLFNFRGVAASFRFKHLFLCGSLV 231
Query: 338 LIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-ME 396
+ ++ +FF L P ++ P+ + DL +++ ++ + AN A++I K G F +
Sbjct: 232 FHVGDEWVEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDIAQEIAKRGSQFIIN 289
Query: 397 SLTMDRVYDYMLHLITEYSKLLDYK 421
L M+ + Y L+TEYSK L Y
Sbjct: 290 HLHMNDITCYWESLLTEYSKFLSYN 314
>gi|350591945|ref|XP_003132703.2| PREDICTED: protein O-glucosyltransferase 1-like [Sus scrofa]
Length = 289
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 63/125 (50%), Gaps = 3/125 (2%)
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
K K L + C ++Y G A S K++ C S+ + ++ +FF L P
Sbjct: 147 GKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGNEWLEFFYPQLKPWV 206
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYS 415
++ P+ + DL +++ ++ + AN A++I + G F + L MD + Y +L+TEYS
Sbjct: 207 HYIPVKT-DL-SNVQELLQFVKANDDIAQEIAERGSQFILNHLQMDDITCYWENLLTEYS 264
Query: 416 KLLDY 420
K L Y
Sbjct: 265 KFLSY 269
>gi|406860379|gb|EKD13438.1| DUF821 domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 447
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/304 (20%), Positives = 130/304 (42%), Gaps = 24/304 (7%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA-LRILIVRGKLYVDPYYD 155
++ Q Q P+F+ I + +E + +T++ E +R +I +LYV
Sbjct: 91 LSEAQCKQAFPNFYHEIDRAVEDRRSNHVTLKETDEMSMGEGYIRAMIYDQQLYVINAAG 150
Query: 156 CVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH 215
V +R + ++ + L P +P+++ V D E S P + Y
Sbjct: 151 DVNTRGLASLHALHRALLTSPEPLPNIEFTM-----LVADIAESAS---PRWAYSREKFM 202
Query: 216 FDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKG-NPDVLSPLRV 273
+ PD+ +W W E + + E + W K+ W+G + D+L +R
Sbjct: 203 TSLWLMPDFGYWSWPEPKIGSYGEVQMRAEQMDAKVPWSRKIDKLIWRGASMDLL--VRQ 260
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKS-KLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+L+ ++ K W A++ W ++A+ + K+ C +++ + EG ++S L+ + +
Sbjct: 261 QLVNASEGKDW-ADVKIMVWDDDAQGKTHDALKMDEHCQYKFVAHTEGVSYSARLQNLQN 319
Query: 333 CNSVALIISQ-----QYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAE 385
C SV I++ Q+ + P +N + S ++L ++ ++ + AE
Sbjct: 320 CRSV--IVAHKLKWLQHHHHLMKSSGPEQNFVEVASDFSNLDSVMQGLLGKKESGELNAE 377
Query: 386 KIGK 389
KI +
Sbjct: 378 KIAE 381
>gi|452988373|gb|EME88128.1| hypothetical protein MYCFIDRAFT_148747 [Pseudocercospora fijiensis
CIRAD86]
Length = 471
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 68/307 (22%), Positives = 129/307 (42%), Gaps = 29/307 (9%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHI-MEAKRFAALRILIVRGKLYV----DPYYDCVQSRA 161
P+ + I + E W + +I ++ I + A +R+L+ +L V + D + R
Sbjct: 100 PNLWTQIERSEEYWKEKKIGVQDIELFAGNDGGVRVLLSNSQLRVVRTKGLHRDDFRHRI 159
Query: 162 MFTIWGFLQLLRRYPGM---VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI 218
+ + L+ + + +PD++ D+ V ++G++ R N H +
Sbjct: 160 IAVLQQLLRAITAAEAIDEPIPDIEFTVVVDDQAVTHPDKNGAY-FAFARQYGNHRHDSL 218
Query: 219 PF-PDWSFWGW--SEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
PD+ F+G + + E+F+ KH S +K+ W+G +R L
Sbjct: 219 WITPDFHFFGAPPEAGSFREMREKFR--KHDS---PLAKKIQKVVWRGATWTNPEIREAL 273
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
+ W A++ NW +D + C ++Y + EG AWS + ++L+CNS
Sbjct: 274 INETKGHAW-ADVEAMNW----EDPSSIMPMEAFCRYKYVVNTEGRAWSARMTHLLNCNS 328
Query: 336 VALIISQQYKDFFSRGLIPTKN--HFPIPSADLCRSIKSVVDWGNANPSEAEKIG-KAGQ 392
+ L+ ++ + L N H +DL I+ + + +P EA+KI A
Sbjct: 329 LLLVHDVEWIAHYYHLLDTETNCVHVARNFSDLGEKIR----YYDTHPEEAQKIADNART 384
Query: 393 DFMESLT 399
F E T
Sbjct: 385 TFRERYT 391
>gi|121712106|ref|XP_001273668.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119401820|gb|EAW12242.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 321
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/263 (23%), Positives = 109/263 (41%), Gaps = 25/263 (9%)
Query: 140 RILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRY------PGMVPDVDIMFDCMDKPV 193
R+ I RG+LY+ C I G L + R P +PD + +F DK
Sbjct: 3 RVAIRRGQLYIIKTL-CKGEDHRRKILGILSSIHRALISAPDPAAIPDTEFIFSVEDK-- 59
Query: 194 IDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI-KHGSQAK-S 251
++ PL + + + + PD+ FW W+ N+ F I H Q +
Sbjct: 60 LEGVAGLGHPLWVLTRKATEKYVWL-MPDFGFWSWNNGNVDNMIGPFDGIVDHIRQNEIP 118
Query: 252 WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN 311
W EK+ W+G LR L++ ++ WG ++ W +A + S +
Sbjct: 119 WNEKIDKLVWRGKLSFAPKLRRTLLEAARAQPWG-DVNELEWKNKANFLSIEITASTGSS 177
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQY---KDFFSRGLIPTKNHFPIPS--ADL 366
K+ ++S SLKY +C SV +I QY + P +N+ + +DL
Sbjct: 178 RMSKL---SRSYSASLKYRQACQSVVVIHKLQYIQHHHYLLASSGPHQNYVEVERDFSDL 234
Query: 367 CRSIKSVVDWGNANPSEAEKIGK 389
++ ++D +P +A++I K
Sbjct: 235 QSKLQDLLD----HPDKAQRIAK 253
>gi|339241191|ref|XP_003376521.1| KDEL motif protein-containing protein 1 [Trichinella spiralis]
gi|316974758|gb|EFV58234.1| KDEL motif protein-containing protein 1 [Trichinella spiralis]
Length = 489
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 67/331 (20%), Positives = 136/331 (41%), Gaps = 22/331 (6%)
Query: 102 KVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--------KRFAALRILIVRGKLYVDPY 153
K + D S+++ W K I + ++E R + +R +
Sbjct: 131 KCEEFADLLSSLNESFLQWNKGAIEITKVLEEAYSMWGVHNRSHSYVHYKIRDNIIFRQT 190
Query: 154 YDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF-PLPLFRYCTN 212
+ MF+ + + R++ ++PD+D + + D P++ PLP+ +C +
Sbjct: 191 FGEYCDFKMFSDEMLVSISRKF--ILPDMDFLLNLGDWPLMTMNHLKVVSPLPILSWCGS 248
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
+ DI P + + + + F + G ++ SW+EK A+W+G L
Sbjct: 249 NNSLDIVLPTYEMM--HSILRKGADNIF--VAQGWRSISWEEKENKAFWRGRDSSKERLL 304
Query: 273 VELMKCNDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLK 328
+ + L A++ + + D + + +Y+I +G + L
Sbjct: 305 LVNISRKYPDLLDAKLTHFFFFTDKVDVYGPPVHNIAMPKFFEFKYQISVDGTVAAYRLM 364
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
Y+L+ NS+ L Y + F L P ++ P+ DL I ++ W +P + + I
Sbjct: 365 YLLAGNSIILKQDSIYYEHFYPLLKPWVHYVPV-KRDLSDLIDQIL-WSMNHPDQVKTII 422
Query: 389 KAGQDFMES-LTMDRVYDYMLHLITEYSKLL 418
K Q+F+ S LT Y Y+ + +Y+++L
Sbjct: 423 KNAQNFVNSYLTPRATYCYLADVFKKYAEIL 453
>gi|258564314|ref|XP_002582902.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237908409|gb|EEP82810.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 485
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 98/221 (44%), Gaps = 23/221 (10%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSE--VNLQP 235
P+++ F DK +D PL L R + ++ + +P D+ FW W N+ P
Sbjct: 107 APNIEFAFSIEDK--VDDVSGAGHPLWVLARKASEESVWLMP--DFGFWAWGNPASNIGP 162
Query: 236 WNEEFKDIKHGSQAKS--WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW 293
+++ + I+ ++ W K P W+G LR L+ SK WG ++ W
Sbjct: 163 YDQVVQHIERFDSEETLPWTSKNPKLIWRGKLSFAPKLRRGLLDAARSKPWG-DVKELIW 221
Query: 294 AEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFS---R 350
+ K F + + C + + + EG ++S SLKY +C SV + QY F +
Sbjct: 222 GK--KHHF--VSMEDHCRYMFIAHVEGRSFSSSLKYRQACRSVVVAHKLQYIQHFHYLLQ 277
Query: 351 GLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
P +N + +DL ++ ++ A+P +A +I +
Sbjct: 278 SGGPHQNFVEVERDFSDLSEKMEELL----ADPEKARRIAE 314
>gi|452843611|gb|EME45546.1| hypothetical protein DOTSEDRAFT_71301 [Dothistroma septosporum
NZE10]
Length = 466
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 83/185 (44%), Gaps = 19/185 (10%)
Query: 254 EKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR 313
+K+P W+GN + +R L++ K W A++ +W E + + + CN+
Sbjct: 260 KKIPKIVWRGNTG-FNSVRPALLELTAGKEW-ADMKNVDWTAETQ--VNRIHQDSYCNYS 315
Query: 314 YKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPT---KNHFPIPS--ADLCR 368
+ EG +S LKY+L+CNS+ + ++ ++ L P +N+ P+ DL
Sbjct: 316 MTVNTEGITYSGRLKYLLNCNSLTFVHDMEWTTYWYHLLEPQGADQNYVPVSRDWHDLEE 375
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD--YMLHLITEYSKLL----DYKP 422
++ + +P EAEKI + S + R + Y LI Y+ + YKP
Sbjct: 376 KVRHYL----GHPDEAEKIIRNSISTFRSKYLTRAAESCYTRQLIQSYASVTYTPEVYKP 431
Query: 423 APPSS 427
P S
Sbjct: 432 DKPGS 436
>gi|260827316|ref|XP_002608611.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae]
gi|229293962|gb|EEN64621.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae]
Length = 513
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 73/325 (22%), Positives = 128/325 (39%), Gaps = 28/325 (8%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL---IVRGKLYVDPYYDCVQSRAM 162
CP F I +DLE + K + RF L V+ + M
Sbjct: 162 CPSAFPQIERDLEIFPKINLNRLSKEAVDRFGTHHSLCHYTVKDNKIHRKCHGQHTGFKM 221
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD 222
F + R+ +PD++ + D P ++K++ PLP+ +C ++ DI P
Sbjct: 222 FMDATLHSITRKV--RIPDIEFFVNLGDWP-LEKRQVKDGPLPILSWCGSEETRDIVMPT 278
Query: 223 WSFWGWSEVNLQPWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLR-VELMK-- 277
+ +E L+ D+ G+ W K A W+G L V+L +
Sbjct: 279 YDL---TESTLETMGRVSLDMLSVQGNTGPRWVNKTEQALWRGRDSRRERLNLVDLGRKY 335
Query: 278 --CNDSKLWGAEILRQNWAEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSC 333
D+ L R EEAK G K +S + ++Y++ +G + L Y+L+
Sbjct: 336 PDLIDAALTNFFFFRD---EEAKYGPKVQHISFFDFFKYKYQLNIDGTVAAYRLPYLLAG 392
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAG 391
+S Y + F L P ++ P DL I+ W N +A +I + G
Sbjct: 393 DSAVFKHESVYYEHFYSDLEPYVHYIPFRKDLTDLVPKIR----WAKRNDDDARQIAENG 448
Query: 392 QDFM-ESLTMDRVYDYMLHLITEYS 415
+++ ++L + ++ Y L EY+
Sbjct: 449 REYARKNLLANSIFCYYERLFREYA 473
>gi|361123878|gb|EHK96023.1| putative protein O-glucosyltransferase 1 [Glarea lozoyensis 74030]
Length = 425
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 69/310 (22%), Positives = 131/310 (42%), Gaps = 40/310 (12%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHI-MEAKRFAALRILIVRGKLYVDPYY----DCVQ 158
+ C F+ + ++E + R T+ + +E ++ + + G LY+ + D +
Sbjct: 59 EQCTTAFRGLFTEIERAVRYRKTIGQVRVEEIDISSEKDVATGGMLYIIEAHISLGDHPR 118
Query: 159 SRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMD------KPVIDKKEHGSFPLPLFRYCTN 212
RA+ T+ + L P +P+ + F D +P+ L R T
Sbjct: 119 MRALATLNAIHRALLTSPVPIPNSEFSFSVNDILAPSPRPI----------WALTRLATE 168
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
+ D+ +W WS + + E I K ++EK+P W+G+ + R
Sbjct: 169 PEKW--VMSDFGYWAWSRALMGGYEEMRAQI--AELEKPFEEKIPKVVWRGDRKN-NVNR 223
Query: 273 VELMKCNDSKLWGAEILRQNWAE----EAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLK 328
V+L+ + K W A++ W + D + C +++ I EG+++S K
Sbjct: 224 VKLIAEAEGKEW-ADVRDIQWRDARHVTGYDADSTITVPEHCLYQFVIQTEGWSYSGRGK 282
Query: 329 YILSCNSVALI-----ISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
Y+ +CNSV +I I Q+ S G P +N I + +++V++ ANP +
Sbjct: 283 YLQNCNSVVIIPKRIWIEPQHALIVSSG--PLQNFVEIE--EDYSDLETVMEELLANPEK 338
Query: 384 AEKIGKAGQD 393
A+ I + G D
Sbjct: 339 AKMIAQNGVD 348
>gi|170033685|ref|XP_001844707.1| KDEL motif-containing protein 2 [Culex quinquefasciatus]
gi|167874675|gb|EDS38058.1| KDEL motif-containing protein 2 [Culex quinquefasciatus]
Length = 495
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/207 (23%), Positives = 95/207 (45%), Gaps = 20/207 (9%)
Query: 178 MVPDVDIMFDCMDKPVIDKKEH--GSFPLPLFRYCTNDAHFDIPFPDW-----SFWGWSE 230
++PD+++ + D P++ K H + P P+F +C ++ FDI P + S S
Sbjct: 217 LLPDMELFVNLGDWPLVTKGGHRRTTGPYPIFSWCGSEDTFDIVMPTYDLVEASLEAMSR 276
Query: 231 VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILR 290
V+L + + K + W+EK+P A+W+G L + + L A +
Sbjct: 277 VSLDMLSVQRKGV-------PWEEKVPKAFWRGRDACRERLDLVGLSQQHPDLVNASLTN 329
Query: 291 QNW--AEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKD 346
+ EE K G K + +S + +++Y++ +G + Y+L +SV + +Y +
Sbjct: 330 FFFFRDEEKKYGPKVAHISFFDFFDYKYQVNVDGTVAAYRFPYLLGGSSVVFKQASKYYE 389
Query: 347 FFSRGLIPTKNHFPIPS--ADLCRSIK 371
F L + + P+ +DL +I+
Sbjct: 390 HFYSKLEQGREYLPLKRDLSDLIENIQ 416
>gi|221486215|gb|EEE24485.1| thioredoxin, putative [Toxoplasma gondii GT1]
Length = 1239
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 110/249 (44%), Gaps = 21/249 (8%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNE 238
+PDV+ + + D P ++K+ LPLF + +D DI P W S +
Sbjct: 285 LPDVEFLMNLGDWP-LEKRGADEGALPLFSWSGSDDTLDIILPQWDVVKTSTA-FGKSDP 342
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW----- 293
+ ++ GS K ++P A ++G +P+RV+L + + ++ +W
Sbjct: 343 DLLTVQAGSLVPLAK-RIPKALFRGRDS--NPVRVKLAELARAHSDLLDVAITSWENDTH 399
Query: 294 AEEAK---DGFK-KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFS 349
AE+ K G+K + L +RY++ +G + Y+L S+ L +Y ++F
Sbjct: 400 AEQEKKLGGGYKARIPLEKFGEYRYQLLVDGTVAAFRTPYLLMTGSLPLKHESRYYEWFY 459
Query: 350 RGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMESLTMDRVYDY 406
L ++ P S +DL +K W +P EA+ I +A Q E L ++++ Y
Sbjct: 460 ADLEAGVHYLPFKSDLSDLVDQLK----WAEQHPVEAQAIADRARQYAQEHLAPNKIFCY 515
Query: 407 MLHLITEYS 415
+ Y+
Sbjct: 516 YFQALEAYA 524
>gi|383856972|ref|XP_003703980.1| PREDICTED: KDEL motif-containing protein 1-like [Megachile
rotundata]
Length = 496
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 56/248 (22%), Positives = 113/248 (45%), Gaps = 15/248 (6%)
Query: 178 MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWN 237
++PD++ + D P++ K P+F +C + DI P + +E +L+
Sbjct: 224 ILPDIEFFVNLGDWPLVAKNGKN---YPIFSWCGSYDTKDIVMPTYDI---TESSLEAMG 277
Query: 238 EEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW-- 293
DI G+ W++K+ +W+G L + + L+ I +
Sbjct: 278 RVMLDILSVQGNTDTPWEKKIDKLFWRGRDSCRERLDLIDISRKYPDLFNVSITNFFFFK 337
Query: 294 AEEAKDGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG 351
E K G +S +S N ++Y++ +G + Y+L+ +++ L +Y +FF
Sbjct: 338 GEMDKYGPGQSHVSFFNFFKYKYQLNIDGTVAAYRFPYLLAGDALVLKQESKYYEFFYND 397
Query: 352 LIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHL 410
L+ K++ P+ S DL ++ +V W + EA ++ K+ + F+ ++L + Y + L
Sbjct: 398 LVSGKHYIPVKS-DLSDLVEQIV-WAKNHDEEALQVIKSARQFVRDNLLPHNILCYHVAL 455
Query: 411 ITEYSKLL 418
E+SK L
Sbjct: 456 FYEWSKRL 463
>gi|401404668|ref|XP_003881784.1| hypothetical protein NCLIV_015430 [Neospora caninum Liverpool]
gi|325116198|emb|CBZ51751.1| hypothetical protein NCLIV_015430 [Neospora caninum Liverpool]
Length = 1915
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 72/304 (23%), Positives = 124/304 (40%), Gaps = 39/304 (12%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNE 238
+PDV+ + D P ++K+ LPLF + +D FDI P W S V L
Sbjct: 284 LPDVEFFMNLGDWP-LEKRNADEGGLPLFSWSGSDDTFDIILPQWDVAKTSTVGLGKSQP 342
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK-CNDSKLWGAEIL---RQNWA 294
+ I+ S + +++P A ++G + LRV+L + + I +A
Sbjct: 343 DLLTIQARS-GEPLAKRIPKALFRGRDS--NSLRVKLAELAQKHDILDVAITSWENDTYA 399
Query: 295 EEAK--DGFKKSK--LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSR 350
E+ K G KS+ L ++Y++ +G Y+L S+ L +Y ++F
Sbjct: 400 EQEKKLGGGYKSRIPLEKFGEYKYQLLVDGSVAPFRTPYLLMTGSLPLKHESRYYEWFYG 459
Query: 351 GLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMESLTMDRVYDYM 407
L ++ P + +DL +K W +P EA+ I +A Q E L ++++ Y
Sbjct: 460 DLKAGVHYLPFKNDLSDLVDQLK----WAEEHPVEAQAIADRARQYAQEHLVPNKIFCYY 515
Query: 408 LHLITEYSKLLDYKPA--------PPSSAFEACVESLLCLADPKQRQNLEKAAASPSPYP 459
+ Y+ P P++A C C A+P++ N PYP
Sbjct: 516 FQALEVYASRQKGTPTVADDMVKVDPTTAPPRCA----CEAEPRKDAN--------RPYP 563
Query: 460 PCTL 463
L
Sbjct: 564 LVQL 567
>gi|388856815|emb|CCF49602.1| uncharacterized protein [Ustilago hordei]
Length = 528
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 76/335 (22%), Positives = 126/335 (37%), Gaps = 54/335 (16%)
Query: 82 LTCLSAMNPIPERRRVASPQKVQTC----PDFFKSIHKDLEPW-AKSRITMRHIMEAKR- 135
+T + P R R SP TC P F + ++ W AK I+ + EA R
Sbjct: 87 VTTPQDLTPAQRRDRTLSPT---TCRREFPLLFPQLDANVAAWKAKGSISYTDLDEAART 143
Query: 136 ----FAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPD-VDIMFDCMD 190
+ R++I G+LY+ + +SR I L LL P D
Sbjct: 144 CVDRWGMARVVIRDGQLYLRQVREGGESR----ISALLHLLHTAISTDPSSFSSGSDTGI 199
Query: 191 KPVIDKKEHGSFPLPLFRYCTNDAHFDIP-------FPDWSFWGWSEVNLQPWNEEFKDI 243
+ V+ + + + P + + D P PD+ F GW E + F +
Sbjct: 200 ELVLSEADKEASPSSNLIWVLSK-RVDEPASKGTWLLPDFGFIGWPEAGIA----SFDEF 254
Query: 244 KHGSQAKS----WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKD 299
H Q + W K A W+G + +P L + + K+ GAE WA+ +
Sbjct: 255 THLGQLQDHLVPWHAKGDKALWRGLANGYAPRMDLLARTDPRKVKGAE----EWADVKQT 310
Query: 300 GFKKSK--------LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG 351
F + C H+Y I EG ++S K++ C SV + ++ F
Sbjct: 311 SFHDVGEEFHPLIPMHEHCRHKYLIQTEGNSYSGRGKFLWICRSVTISHPLEWTQHFHPA 370
Query: 352 L-----IPTKNHFPIPS---ADLCRSIKSVVDWGN 378
L P +N +P + L +I+ + +W +
Sbjct: 371 LNSNPQSPRQNFVELPGPLFSGLEETIRELQEWAS 405
>gi|237834397|ref|XP_002366496.1| thioredoxin, putative [Toxoplasma gondii ME49]
gi|211964160|gb|EEA99355.1| thioredoxin, putative [Toxoplasma gondii ME49]
Length = 1378
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 110/249 (44%), Gaps = 21/249 (8%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNE 238
+PDV+ + + D P ++K+ LPLF + +D DI P W S +
Sbjct: 285 LPDVEFLMNLGDWP-LEKRGADEGALPLFSWSGSDDTLDIILPQWDVVKTSTA-FGKSDP 342
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW----- 293
+ ++ GS K ++P A ++G +P+RV+L + + ++ +W
Sbjct: 343 DLLTVQAGSLVPLAK-RIPKALFRGRDS--NPVRVKLAELARAHSDLLDVAITSWENDTH 399
Query: 294 AEEAK---DGFK-KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFS 349
AE+ K G+K + L +RY++ +G + Y+L S+ L +Y ++F
Sbjct: 400 AEQEKKLGGGYKARIPLEKFGEYRYQLLVDGTVAAFRTPYLLMTGSLPLKHESRYYEWFY 459
Query: 350 RGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMESLTMDRVYDY 406
L ++ P S +DL +K W +P EA+ I +A Q E L ++++ Y
Sbjct: 460 ADLEAGVHYLPFKSDLSDLVDQLK----WAEQHPVEAQAIADRARQYAQEHLAPNKIFCY 515
Query: 407 MLHLITEYS 415
+ Y+
Sbjct: 516 YFQALEAYA 524
>gi|340939398|gb|EGS20020.1| hypothetical protein CTHT_0045170 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 490
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 91/232 (39%), Gaps = 20/232 (8%)
Query: 130 IMEAKRFAALRILIVRGKLYV-------DPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDV 182
+ + F L+ I GKLY+ D + V SR T+ + L P + D
Sbjct: 134 VKDTGEFGPLQGRIKDGKLYILHTQRRKDLSQEMVNSRTA-TLHSLHRALLTSPSPIKDT 192
Query: 183 DIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKD 242
+ +D+P + P +A + P +SFW W + + +
Sbjct: 193 IFTLNFLDQPFGTCLTYSRPADPSLHSPDPNARLFL-MPHFSFWAWPLPFIGSMSRAMEA 251
Query: 243 IKH------GSQAKSWKEKLPFAYWKGNP---DVLSP-LRVELMKCNDSKLWGAEILRQN 292
I W+ K+ A W+G V SP +R +L++ K W A++ +
Sbjct: 252 ITRLEEGPEFGGEGGWQRKVSKAVWRGTTWFSSVTSPGMRAKLVQTTKDKPW-ADVEALD 310
Query: 293 WAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
W + K K+ + C ++Y I+ EG +S +Y+ C SV L Q+
Sbjct: 311 WDGKGKSARNGLKIEDFCRYKYVIHTEGVTYSGRFQYLQQCASVVLTPPIQW 362
>gi|221501494|gb|EEE27268.1| thioredoxin, putative [Toxoplasma gondii VEG]
Length = 1378
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 110/249 (44%), Gaps = 21/249 (8%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNE 238
+PDV+ + + D P ++K+ LPLF + +D DI P W S +
Sbjct: 285 LPDVEFLMNLGDWP-LEKRGADEGALPLFSWSGSDDTLDIILPQWDVVKTSTA-FGKSDP 342
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW----- 293
+ ++ GS K ++P A ++G +P+RV+L + + ++ +W
Sbjct: 343 DLLTVQAGSLVPLAK-RIPKALFRGRDS--NPVRVKLAELARAHSDLLDVAITSWENDTH 399
Query: 294 AEEAK---DGFK-KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFS 349
AE+ K G+K + L +RY++ +G + Y+L S+ L +Y ++F
Sbjct: 400 AEQEKKLGGGYKARIPLEKFGEYRYQLLVDGTVAAFRTPYLLMTGSLPLKHESRYYEWFY 459
Query: 350 RGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMESLTMDRVYDY 406
L ++ P S +DL +K W +P EA+ I +A Q E L ++++ Y
Sbjct: 460 ADLEAGVHYLPFKSDLSDLVDQLK----WAEQHPVEAQAIADRARQYAQEHLAPNKIFCY 515
Query: 407 MLHLITEYS 415
+ Y+
Sbjct: 516 YFQALEAYA 524
>gi|398405632|ref|XP_003854282.1| hypothetical protein MYCGRDRAFT_24531, partial [Zymoseptoria
tritici IPO323]
gi|339474165|gb|EGP89258.1| hypothetical protein MYCGRDRAFT_24531 [Zymoseptoria tritici IPO323]
Length = 376
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 51/245 (20%), Positives = 114/245 (46%), Gaps = 16/245 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAK--SRITMRHIMEAKRF-AALRILIVRGKLYVDPYYDCV 157
Q + PDF+K I + ++ K +I + RF +R +I +LY+ +
Sbjct: 18 QCLAAFPDFYKEIDRAIQYRKKVAGKIKPEELEVDWRFDGMMRAMIHDNQLYIIDAHGLA 77
Query: 158 ----QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE-HGSFPLPLFRYCTN 212
++RA+ T+ + L +PD++ F D + + H ++ + T+
Sbjct: 78 PPNHRARAIATLNAIQRALTSSAIPLPDIEFTFSVHDDAHTSEDDTHTTWAYSRKAHQTS 137
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLS-PL 271
PD+ W W +VN++ ++E + + +K+P W+G+ V S +
Sbjct: 138 L----WLMPDFGLWAWPDVNIRSYSELRTQL--ALSESHFLDKIPKLVWRGSLAVGSHDV 191
Query: 272 RVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYIL 331
R L++ ++ W ++++ +W++++ + +S+ C + + EG +S LK++L
Sbjct: 192 RAGLVEHAANQPW-SDVMELDWSDKSNINSRLLSMSDHCEYMFVAQTEGNTYSGRLKFLL 250
Query: 332 SCNSV 336
+C+S+
Sbjct: 251 NCHSI 255
>gi|47210984|emb|CAF95403.1| unnamed protein product [Tetraodon nigroviridis]
Length = 505
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 72/331 (21%), Positives = 125/331 (37%), Gaps = 54/331 (16%)
Query: 133 AKRFAALRILIVRGKLY---VDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
A R + +V +LY + Y D MF+ L L R+ VPDV+ +
Sbjct: 162 ANRGGLVHYAVVANRLYRRTLGKYTDF----KMFSDEMLLSLTRKV--RVPDVEFYINVG 215
Query: 190 DKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQA 249
D P+ + + LP+ +C + DI P + + ++ + ++ G+
Sbjct: 216 DWPL---ETRATDALPILSWCGSTDTRDIVLPTYDVTHSTLETMRGVTNDLLSVQ-GNTG 271
Query: 250 KSWKEKLPFAYWKG----------------NPDVLSP-------LRVELMKCNDSKLWGA 286
W K A+++G NP++L R E + + L G
Sbjct: 272 PPWMNKTARAFFRGRDSREERLHLVSISKKNPELLDAGITAWFFFRDEEKRVGKAPLVGF 331
Query: 287 ----EILRQNWAEEAKDGFKKS-------KLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
++ +Q E + S K + H+Y++ +G + Y+L NS
Sbjct: 332 FDFFKVRQQERTSEEPLSLQPSLVFVVFLKFAFAFQHKYQVNVDGTVAAYRFPYLLLGNS 391
Query: 336 VALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+ L Y +FF L ++ P+ +DL IK W N AEK+ AGQ
Sbjct: 392 LVLKQDSPYYEFFYGHLEAGTHYLPVKRDLSDLLDQIK----WAKENDGRAEKMAAAGQA 447
Query: 394 FM-ESLTMDRVYDYMLHLITEYSKLLDYKPA 423
E L R+Y Y ++ Y++ +P
Sbjct: 448 LARELLRPGRLYCYYYRVLRAYAERQTGRPT 478
>gi|449513299|ref|XP_002199044.2| PREDICTED: KDEL motif-containing protein 1-like, partial
[Taeniopygia guttata]
Length = 214
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 67/154 (43%), Gaps = 27/154 (17%)
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRS 369
++Y+I +G + L Y+L+ NSV L Y + F L P K++ P S +DL
Sbjct: 80 YKYQINIDGTVAAYRLPYLLAGNSVVLKQDSIYYEHFYNELQPWKHYIPFKSDLSDLLEK 139
Query: 370 IKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSA 428
++ W + EA+ I K+GQ+F +L D ++ Y L EY+
Sbjct: 140 LQ----WAKDHDEEAKNIAKSGQEFARNNLMGDHIFCYYFKLFQEYAG------------ 183
Query: 429 FEACVESLLCLADPKQRQNLEKAAASPSPYPPCT 462
L +++PK R +EK PCT
Sbjct: 184 --------LQVSEPKIRDGMEKVQQPDDDLFPCT 209
>gi|159488937|ref|XP_001702457.1| predicted protein [Chlamydomonas reinhardtii]
gi|158271125|gb|EDO96952.1| predicted protein [Chlamydomonas reinhardtii]
Length = 367
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 67/316 (21%), Positives = 125/316 (39%), Gaps = 45/316 (14%)
Query: 108 DFFKSIHKDLEPW-------AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
D + LE W A +R +E A + I G+++V + QS+
Sbjct: 59 DLLGPLAVRLEAWRAAGGVNASEGKRLRQEIEGGGEGAF-VSIRNGQVHVLVKREGFQSK 117
Query: 161 AMFTIWGFLQLLRRYP-GMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIP 219
+ +L+ R+ +PD++ G P P C ++P
Sbjct: 118 TYGALMLLHRLVSRFGRKHLPDMEFGI-----------HRGDVPKPGAWMCFCGRRGELP 166
Query: 220 ----FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFA------YWKGNPD--V 267
+PD+ ++ W E+ + PW + I+ +Q E+ PFA +W+G +
Sbjct: 167 GTWLYPDFGYYAWPEIMMPPW----EAIRQRTQEVV--ERWPFAARSNKMFWRGGAGKHI 220
Query: 268 LSPLRVELMKCNDSKLWGAEILR----QNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAW 323
+ +R +L++ +++ A++ + GF + L + C H+Y +Y EG ++
Sbjct: 221 NTDVRGKLLRALENRTDIADVQAIPPFDVLLQRGLQGF--TPLWDFCKHKYILYTEGNSY 278
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S LKY + C SV + ++Y S L+ H + + + NP
Sbjct: 279 SGRLKYHVLCGSVIISHPRKYDTMLS-ALMREGRHIVTTVDNEWSDVAQIHSRLETNPQL 337
Query: 384 AEKIGKAGQDFMESLT 399
AE IG++ LT
Sbjct: 338 AESIGRSTALLANQLT 353
>gi|302508970|ref|XP_003016445.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
gi|291180015|gb|EFE35800.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
Length = 429
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 109/258 (42%), Gaps = 22/258 (8%)
Query: 99 SPQKVQTC-PDFFKSIHKDLEPW-AKSRITMRHI--MEAKRFAALRILIVRGKLYV---D 151
SP + +T P F+ ++ W ++ I+ H+ +E K A R +I RG+LY+
Sbjct: 68 SPLQCKTAFPGLFEDPYRAAAFWDSQFGISAEHLDKIELKNGMA-RAIIHRGELYIVATK 126
Query: 152 PYYDCVQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVIDKKEHGSFPL-PLFR 208
+ + + + + L G P ++ +F D+ ID P+ L R
Sbjct: 127 AAQEDHRKKILAVLSSMHHALSATAGNRTQPSIEFVFSIEDR--IDDISGSGHPIWALSR 184
Query: 209 YCTNDAHFDIPFPDWSFWGWSEVN--LQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPD 266
+ ++ + IP D+ FW W + P+N+ I+ W K W+G
Sbjct: 185 KASEESIWLIP--DFGFWAWDNAANAIGPYNQVVDRIQRQEVTVPWSSKENKLVWRGKLS 242
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
LR L+ WG ++ W+E K F + + C +++ + EG A+S S
Sbjct: 243 FAPKLRRNLLDIARDTSWG-DVKEIVWSE--KQNF--ISMDDHCKYKFIAHVEGRAYSSS 297
Query: 327 LKYILSCNSVALIISQQY 344
LKY +C SV + Q+
Sbjct: 298 LKYRQACRSVVIAHKLQF 315
>gi|198431555|ref|XP_002127452.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 495
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 60/304 (19%), Positives = 123/304 (40%), Gaps = 17/304 (5%)
Query: 102 KVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRF----AALRILIVRGKLYVDPYYDCV 157
K CP ++ I +DL+P ++ +RF + +V ++Y + +
Sbjct: 142 KALQCPPTYRQIEEDLKPHKVIELSTFAKRIVERFTPHHSVCHYAVVNNQVYRQCH-GSI 200
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
MF L L R+ +PD + + D P+ +K H PLP+ +C +D D
Sbjct: 201 TDFKMFMDAPLLSLTRKVK--LPDFEFFINLGDWPL--EKSHDD-PLPIISWCGSDGTHD 255
Query: 218 IPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
I P + L + + ++ + W +K+ +++G L + M
Sbjct: 256 IILPTYDITNSVLEMLGRVSLDMFSVQ-ANTGPRWGKKIAKGFFRGRDSRQERLDLASMS 314
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKS----KLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+ L A I + ++ + + KS + H+Y++ +G + Y+L
Sbjct: 315 VKNPDLIDAAITNYFFFKKDETKYGKSVKPISFFDFFKHKYQLNIDGTVAAYRFPYLLVG 374
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+++ +Y + F + L P K++ P+ DL + V W N +A +I +A +
Sbjct: 375 DALVFKQESEYYEHFYKDLEPWKHYVPLKH-DLS-DVMEQVKWARKNEKKAREIQRAATE 432
Query: 394 FMES 397
+ +
Sbjct: 433 YART 436
>gi|453088954|gb|EMF16994.1| hypothetical protein SEPMUDRAFT_129834 [Mycosphaerella populorum
SO2202]
Length = 440
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/277 (22%), Positives = 117/277 (42%), Gaps = 43/277 (15%)
Query: 91 IPERRRVASPQKVQTC----PDFFKSIHKDLEPWAK-----SRITMRHI-MEAKRFAALR 140
+ ER R + C PD ++ I + W ++T + ++ LR
Sbjct: 56 VAERDRNDHSLTLAQCNRAFPDLYREIDRSRAFWENRLGPGGKLTQDNWGLKWSNDGGLR 115
Query: 141 ILIVRGKLYVDPYYDCVQSRAM--FTIW------GFLQLLRRYPGM---VPDVDIMFDCM 189
+I GKLYV ++SR + F W + R G VP+++
Sbjct: 116 AMIWEGKLYV------IESRGLNHFLHWKERSHATLHAIQRAIIGAREPVPNIEFSIKIN 169
Query: 190 DKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEV-----NLQPWNEEFKDI 243
D+ + ++ + R T+ A + PD++FW + V + Q +F D
Sbjct: 170 DRIDLTEEHPNTTVWNFNRDITDKAMEQVWLIPDFNFWAYPRVAGAYGDYQRQAIDFYD- 228
Query: 244 KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG-AEILRQNWAEEAKDGFK 302
+ KLP W+G + +R +L++ ++ K W + ++ +E ++
Sbjct: 229 -------DYNNKLPKLVWRGTTEFNPEIRQKLIEQSEGKAWSDVHKVAEDVHDEEATKWR 281
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
S + + C +++ ++ EG +WS LKY+LSC+S +I
Sbjct: 282 IS-MPDHCKYKFAVHTEGTSWSGRLKYLLSCHSTIII 317
>gi|408376217|ref|ZP_11173822.1| lipopolysaccharide-modifying protein [Agrobacterium albertimagni
AOL15]
gi|407749684|gb|EKF61195.1| lipopolysaccharide-modifying protein [Agrobacterium albertimagni
AOL15]
Length = 335
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 84/181 (46%), Gaps = 12/181 (6%)
Query: 252 WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFK-KSKLS--N 308
W +K P A W+G + +PLRVEL++ + +E EAKDG + K LS
Sbjct: 150 WAKKKPMAVWRGALN--NPLRVELLRRHQQ----SEFADVGHIGEAKDGLQAKGFLSPQE 203
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
Q +RY + EG+ + +LK+I++ SV ++ +Y+ +F G + +HF D
Sbjct: 204 QMGYRYILSIEGFDVATNLKWIMASRSVCVMPKGRYETWFMEGGLIPDHHFVEVRPDFS- 262
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSA 428
++ + NP A+ I ++ S DR + + L+ K + P SA
Sbjct: 263 DLEDKIAELERNPQWAQDIVANANAYVASFC-DRKTERKISLLV-LQKYFEATGQLPPSA 320
Query: 429 F 429
F
Sbjct: 321 F 321
>gi|119491757|ref|XP_001263373.1| hypothetical protein NFIA_066400 [Neosartorya fischeri NRRL 181]
gi|119411533|gb|EAW21476.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 248
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/177 (22%), Positives = 84/177 (47%), Gaps = 18/177 (10%)
Query: 220 FPDWSFWGWSEVN--LQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
PD+ FW W ++ + P+++ + ++ + W +K W+G LR L++
Sbjct: 22 MPDFGFWSWGHLDSKIGPYDQVVEHVRQ--RDVPWDQKRDKLVWRGKLSFALKLRRTLLE 79
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
WG ++ W+ +A + C++++ + EG ++S SLKY +C SV
Sbjct: 80 VARGYPWG-DVREVEWSNKAN----FLSMEEHCDYKFIAHVEGRSYSASLKYHQACQSVV 134
Query: 338 LIISQQYKDFFSRGLIPT---KNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
+I QY L+ + +N + +DL + ++ ++D NP++A++I +
Sbjct: 135 VIHKLQYIQHHHYLLVSSGSQQNFVQVERDFSDLPQKMQELLD----NPAKAQRIAR 187
>gi|238502335|ref|XP_002382401.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
gi|220691211|gb|EED47559.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
Length = 413
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/179 (24%), Positives = 80/179 (44%), Gaps = 20/179 (11%)
Query: 220 FPDWSFWGWS----EVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
PD+ FW W + + P+ + I+ + W+EK W+G + +R L
Sbjct: 181 MPDFGFWAWEHGKVDSKIGPYTQVVDRIRQ--KDVPWEEKEAKLVWRGKLSFAAKMRRAL 238
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
++ ++ W A++ W KD + C +R+ + EG ++S SLKY +C S
Sbjct: 239 LEAARNQPW-ADVKEVVW----KDKTNFISMEEHCRYRFIAHVEGRSYSASLKYRQACGS 293
Query: 336 VALIISQQYKDFFSRGLI---PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGK 389
V + QY L+ P +N + +DL +++++D +P+ AE+I
Sbjct: 294 VVVAHKLQYIQHHHYLLVSSGPDQNFVEVERDFSDLPAKMQTLLD----DPASAERIAN 348
>gi|71022133|ref|XP_761297.1| hypothetical protein UM05150.1 [Ustilago maydis 521]
gi|46097791|gb|EAK83024.1| hypothetical protein UM05150.1 [Ustilago maydis 521]
Length = 1871
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 117/300 (39%), Gaps = 51/300 (17%)
Query: 93 ERRRVASPQKVQTC----PDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAA-----LRIL 142
ERRR + + TC P F I ++ W AK I+ + +A R A RI+
Sbjct: 1429 ERRRDRTMNEA-TCKAEFPLLFPQIQDNVAAWQAKGGISYTDLDQAARTCAGNWGMARIV 1487
Query: 143 IVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKP-------VID 195
I G+L++ + +SR I L L+ P F D P V+
Sbjct: 1488 IRDGQLFLRQVREGGESR----ISALLHLVHTAVTTDPS---SFTSADDPSNTGVELVLS 1540
Query: 196 KKEHGSFPLPLFRYCTNDAHFDIP-------------FPDWSFWGWSEVNLQPWNEEFKD 242
+ + + P T+DA + + PD+ F GW E + + E
Sbjct: 1541 EADKDASP-------TSDAVWVLSKRTSEPKSKGTWLLPDFGFAGWPETGIASFAEFLHL 1593
Query: 243 IKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCND-SKLWGAE----ILRQNWAEEA 297
W K W+G + +P RV+L+ D K+ GAE +L+ ++ +
Sbjct: 1594 ASLQDHLVPWSHKADRVLWRGLANGYAP-RVDLISRTDPRKVPGAEKWADVLQTSFHDVG 1652
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKN 357
D + + C H++ + EG ++S K++ SC SV + ++ F L N
Sbjct: 1653 DDFHPIIPMHHHCRHKFLVQTEGNSYSGRGKFLWSCRSVTVAHPMEWTQHFHAALNSNAN 1712
>gi|34531642|dbj|BAC86191.1| unnamed protein product [Homo sapiens]
Length = 409
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 116/281 (41%), Gaps = 13/281 (4%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QK +CP I KD + + KRF R IV + + Y +
Sbjct: 95 QKTLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGK 154
Query: 161 ----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTNDAH 215
MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 155 YTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSLDS 212
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 213 RDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQLVQ 271
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYILS 332
+ + +L A I + +E + K+KL + ++Y++ +G + Y++
Sbjct: 272 LSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLML 331
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIK 371
+S+ L Y + F L P K++ PI +DL +K
Sbjct: 332 GDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK 372
>gi|302661485|ref|XP_003022410.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
gi|291186353|gb|EFE41792.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
Length = 437
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 109/258 (42%), Gaps = 22/258 (8%)
Query: 99 SPQKVQTC-PDFFKSIHKDLEPW-AKSRITMRHI--MEAKRFAALRILIVRGKLYV---D 151
SP + +T P F+ ++ W ++ I+ H+ +E K A R +I RG+LY+
Sbjct: 76 SPLQCKTAFPGLFEDPYRAAAFWDSQPGISTEHLDKIELKNGMA-RAIIHRGELYIVATQ 134
Query: 152 PYYDCVQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVIDKKEHGSFPL-PLFR 208
+ + + + + L G P ++ +F D+ +D P+ L R
Sbjct: 135 AAQEDHRKKILAVLSSMHHALSATAGNRTQPSIEFVFSIEDR--VDDISGTGHPIWALSR 192
Query: 209 YCTNDAHFDIPFPDWSFWGWSEVN--LQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPD 266
+ ++ + IP D+ FW W + P+N+ I+ W K W+G
Sbjct: 193 KASEESIWLIP--DFGFWAWDNAANAIGPYNQVVDRIQRQEVTVPWSSKENKLVWRGKLS 250
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
LR L+ WG ++ W+E K F + + C +++ + EG A+S S
Sbjct: 251 FAPKLRRNLLDIARDTSWG-DVKEIVWSE--KQNF--ISMDSHCKYKFIAHVEGRAYSSS 305
Query: 327 LKYILSCNSVALIISQQY 344
LKY +C SV + Q+
Sbjct: 306 LKYRQACRSVVVAHKLQF 323
>gi|328707896|ref|XP_001945090.2| PREDICTED: KDEL motif-containing protein 1-like [Acyrthosiphon
pisum]
Length = 497
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 110/282 (39%), Gaps = 16/282 (5%)
Query: 143 IVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF 202
+V+ L Y MF L L R+ +PD++ + D P+ KE
Sbjct: 185 VVKNNLIFRKCYGEYTGFKMFMDNLLLSLNRKV--FLPDLEFFVNLGDWPLSSPKEQ--- 239
Query: 203 PLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWK 262
PLF +C ++ DI P + + N+ + ++ G+ K W +K+ +W
Sbjct: 240 -FPLFSWCGSNYSVDIVMPTYDITESALENMGRVTLDMLSVQ-GNIEKPWSQKIEKGFWM 297
Query: 263 GNPDVLSPLR-VELMKCNDSKLWGAEILRQNWAEEAKDGFKKSK----LSNQCNHRYKIY 317
G L VEL K N L A I + +E KD + K +++Y++
Sbjct: 298 GRDSSKHRLNLVELSKINPDIL-NASITNFFFYKELKDKYGPGKKPISFFKFFDYKYQLN 356
Query: 318 AEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWG 377
+G + Y+L +S+ +Y + F LIP ++ PI +
Sbjct: 357 IDGTVAAYRFPYLLVGDSLVFKQESEYYEHFYNELIPWVHYVPIKRHLDDLLDLIDIMM- 415
Query: 378 NANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLL 418
++ A KI GQ + E L + Y L L YSK L
Sbjct: 416 -SDDKTARKISLNGQKYAREHLAPHNILGYYLLLFQNYSKFL 456
>gi|388491132|gb|AFK33632.1| unknown [Medicago truncatula]
Length = 105
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 400 MDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLC---LADPKQRQNLEKAAASPS 456
M VY+YM HL+ EY+KLL +KP P A E C E L+ + K R E PS
Sbjct: 1 MKNVYNYMFHLLNEYAKLLKFKPTIPRGAVEVCPEKLMACDVIGGNKMRFMEESMVKVPS 60
Query: 457 PYPPCTL 463
PCT+
Sbjct: 61 DSNPCTI 67
>gi|70985550|ref|XP_748281.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66845909|gb|EAL86243.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 423
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 71/167 (42%), Gaps = 10/167 (5%)
Query: 176 PGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQP 235
P +PD + D+P+ + S+ P T F +P ++FW W +
Sbjct: 126 PEPLPDTVFALNIQDRPM---AQSWSYARPADPTATKAGSFLMP--HFAFWAWPLKYIGS 180
Query: 236 WNEEFKDIKHGSQAKSWKEKLPFAYWKGNP---DVLS-PLRVELMKCNDSKLWGAEILRQ 291
+ I ++++ K+P A W+G P DV + LR L++ K W A++
Sbjct: 181 MHRALTAITEIETKQTFQAKIPQAVWRGTPWFNDVQNHDLRKRLIRVTTGKPW-ADVQSL 239
Query: 292 NWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
W + + + C ++Y IY EG +S L + +C S+ L
Sbjct: 240 KWETNGQTASNGLAIEDFCRYKYIIYTEGVTYSGRLLFHQACRSIIL 286
>gi|149916135|ref|ZP_01904657.1| putative lipopolysaccharide A protein [Roseobacter sp. AzwK-3b]
gi|149809990|gb|EDM69839.1| putative lipopolysaccharide A protein [Roseobacter sp. AzwK-3b]
Length = 327
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 68/147 (46%), Gaps = 3/147 (2%)
Query: 249 AKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSN 308
++ W +K P A W+G + +P + DS +I + + AE ++
Sbjct: 146 SRCWADKRPSAVWRGMLNGQAPRIAAVRLYADSP--AHDIGQVHEAEGLAPPKGWLSIAQ 203
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
Q HRY + EG + +LK+I++ NSVAL +Y+ ++ G + HF ADL
Sbjct: 204 QLEHRYVLSLEGNDVATNLKWIMASNSVALAPKLEYETWYMEGRLEPGVHFVQIRADLS- 262
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ + W ANP +I + Q ++
Sbjct: 263 DLDERIAWCEANPRAVAEIVQNAQTWV 289
>gi|307103854|gb|EFN52111.1| hypothetical protein CHLNCDRAFT_139417 [Chlorella variabilis]
Length = 393
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 106/277 (38%), Gaps = 41/277 (14%)
Query: 153 YYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTN 212
Y +Q +A+ WG +P+++ + D + K+ G PLF YC +
Sbjct: 120 YGQVMQIQAVAKRWG-----------LPNMEFVVGWGD--IFGVKDDGDRRCPLFTYCRS 166
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSW--KEKLPFAYWKGN----PD 266
+DI PD F + W E+ K G W KE+ YW G P
Sbjct: 167 PESYDILIPDGHFL---QFKYDEWVEKMK----GGGDVPWEQKEERAVGYWHGYCAYLPQ 219
Query: 267 VLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVS 326
R E KC + L N A ++ LS Q +Y + +G S
Sbjct: 220 ADRFGRSE--KCIRAAYVNQSALGTNLT--AFSTGEREALSLQARFKYTLSTDGLGCSGR 275
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
+ +L+ V +FF L P ++ P + I+ +V + AN A+
Sbjct: 276 FQKLLATGQV---------EFFYPALRPWVHYVP-SGYNGIEEIERIVQFLRANDDMAKA 325
Query: 387 IGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
IG+ Q F E L + + Y+ L+ E KL+ Y P
Sbjct: 326 IGQNSQRFAREHLNEEARHCYLKVLMEEMHKLITYSP 362
>gi|350402459|ref|XP_003486492.1| PREDICTED: KDEL motif-containing protein 1-like [Bombus impatiens]
Length = 497
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 57/261 (21%), Positives = 118/261 (45%), Gaps = 17/261 (6%)
Query: 168 FLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWG 227
L L R+ ++PD++ + D P++ K+ P+F +C + DI P +
Sbjct: 217 LLSLTRKV--LLPDIEFFVNLGDWPLVPKEGKN---YPIFSWCGSFDTKDIVIPTYDI-- 269
Query: 228 WSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWG 285
+E +L+ D+ G+ W EK+ +W+G L + + L+
Sbjct: 270 -TESSLEEMGRVMLDMLSIQGNTDTPWNEKIEKVFWRGRDSSRERLNLIDISRKYPDLFN 328
Query: 286 AEILRQNWAEEAKD--GFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
I + ++ D G ++S +S + ++Y++ +G + L Y+L+ +++ L
Sbjct: 329 VAITNFFFFKDEMDKYGPEQSHVSFFHFFKYKYQLCIDGTVAAYRLPYLLAGDALLLKQE 388
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
+Y +FF L+ K++ + DL ++ ++ W + + +I K+ + F ++L
Sbjct: 389 SKYYEFFYNNLVAGKHYISV-KRDLSDLVEKIM-WAKEHDQKVLQIAKSARQFARDNLLP 446
Query: 401 DRVYDYMLHLITEYSKLLDYK 421
D V Y + L E+SK L K
Sbjct: 447 DNVLCYHVVLFHEWSKRLKSK 467
>gi|119499505|ref|XP_001266510.1| hypothetical protein NFIA_041910 [Neosartorya fischeri NRRL 181]
gi|119414674|gb|EAW24613.1| predicted protein [Neosartorya fischeri NRRL 181]
Length = 423
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/167 (23%), Positives = 71/167 (42%), Gaps = 10/167 (5%)
Query: 176 PGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQP 235
P +PD + D+P+ + S+ P T F +P ++FW W +
Sbjct: 126 PEPLPDTVFALNIQDRPM---AQSWSYARPADPTATKAGDFLMP--HFAFWAWPLKYIGS 180
Query: 236 WNEEFKDIKHGSQAKSWKEKLPFAYWKGNP---DVLS-PLRVELMKCNDSKLWGAEILRQ 291
+ I ++++ K+P A W+G P DV + LR L++ K W A++
Sbjct: 181 MDRALTAITEIETDQTFQAKIPQAVWRGTPWFNDVQNHDLRKRLIQVTTGKPW-ADVQSL 239
Query: 292 NWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
W + + + C ++Y IY +G +S L + +C S+ L
Sbjct: 240 KWETNGQTASNGLAIEDFCRYKYIIYTDGVTYSGRLLFHQACRSIIL 286
>gi|326485254|gb|EGE09264.1| DUF821 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 466
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 88/213 (41%), Gaps = 17/213 (7%)
Query: 140 RILIVRGKLYV---DPYYDCVQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVI 194
R +I RG+LY+ + + + + + L G P ++ +F D+ +
Sbjct: 149 RAIIHRGELYIVATKAAQEDHRKKILAILSSMHHALSATAGNRTQPSIEFIFSIEDR--V 206
Query: 195 DKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQPWNEEFKDIKHGSQAKS 251
D P+ L R + ++ + IP D+ FW W + P+N+ I+
Sbjct: 207 DDISGTGHPIWALSRKASEESIWLIP--DFGFWAWDNAANAIGPYNQVVDRIQRQEVTVP 264
Query: 252 WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN 311
W K W+G LR L+ WG ++ W+E K F + + C
Sbjct: 265 WSSKENKLVWRGKLSFAPKLRRNLLDIARDTSWG-DVKEIVWSE--KQNFLS--MDDHCK 319
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
+++ + EG A+S SLKY +C SV + Q+
Sbjct: 320 YKFIAHVEGRAYSSSLKYRQACRSVVIAHKLQF 352
>gi|326469789|gb|EGD93798.1| hypothetical protein TESG_01331 [Trichophyton tonsurans CBS 112818]
Length = 466
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 88/213 (41%), Gaps = 17/213 (7%)
Query: 140 RILIVRGKLYV---DPYYDCVQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVI 194
R +I RG+LY+ + + + + + L G P ++ +F D+ +
Sbjct: 149 RAIIHRGELYIVATKAAQEDHRKKILAILSSMHHALSATAGNRTQPSIEFIFSIEDR--V 206
Query: 195 DKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQPWNEEFKDIKHGSQAKS 251
D P+ L R + ++ + IP D+ FW W + P+N+ I+
Sbjct: 207 DDISGTGHPIWALSRKASEESIWLIP--DFGFWAWDNAANAIGPYNQVVDRIQRQEVTVP 264
Query: 252 WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN 311
W K W+G LR L+ WG ++ W+E K F + + C
Sbjct: 265 WSSKENKLVWRGKLSFAPKLRRNLLDIARDTSWG-DVKEIVWSE--KQNFLS--MDDHCK 319
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
+++ + EG A+S SLKY +C SV + Q+
Sbjct: 320 YKFIAHVEGRAYSSSLKYRQACRSVVIAHKLQF 352
>gi|159125788|gb|EDP50904.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 423
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/167 (23%), Positives = 70/167 (41%), Gaps = 10/167 (5%)
Query: 176 PGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQP 235
P +PD + D+P+ + S+ P F +P ++FW W +
Sbjct: 126 PEPLPDTVFALNIQDRPM---AQSWSYARPADPTAAKAGSFLMP--HFAFWAWPLKYIGS 180
Query: 236 WNEEFKDIKHGSQAKSWKEKLPFAYWKGNP---DVLS-PLRVELMKCNDSKLWGAEILRQ 291
+ I ++++ K+P A W+G P DV + LR L++ K W A++
Sbjct: 181 MHRALTAITEIETKQTFQAKIPQAVWRGTPWFNDVQNHDLRKRLIRVTTGKPW-ADVQSL 239
Query: 292 NWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
W + + + C ++Y IY EG +S L + +C S+ L
Sbjct: 240 KWETNGQTASNGLAIEDFCRYKYIIYTEGVTYSGRLLFHQACRSIIL 286
>gi|332025008|gb|EGI65195.1| KDEL motif-containing protein 1 [Acromyrmex echinatior]
Length = 495
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 68/329 (20%), Positives = 137/329 (41%), Gaps = 25/329 (7%)
Query: 106 CPDFFKSIHKDLEPWAK---SRITMRHIMEAKRFAALRI---LIVRGKLYVDPYYDCVQS 159
C + IH DL P++ +I I R ++ I ++ ++Y Y V
Sbjct: 149 CQQNYTQIHHDLSPFSNIDFDKIRQSIIKRYDRPTSVSICHYVLKSNRIYRQCYGQYVGF 208
Query: 160 RAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDI 218
+ +F L L R+ M+PD++ + D P++ S PL P+F +C +++ DI
Sbjct: 209 K-IFMDAILLSLTRKV--MLPDIEFFVNLGDWPLVPD----SGPLYPIFSWCGSESTKDI 261
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKD-IKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
P + +E +L+ D + W+ K +W+G L + +
Sbjct: 262 VMPTYDI---TESSLEAMGRVMLDTLSVQGNGLPWEHKTEQLFWRGRDARRERLDLIDIS 318
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSK----LSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+L+ I + + D + ++ N ++Y++ +G + Y+L+
Sbjct: 319 RKHPELFNVSITNFFFFRDEIDKYGPAQNHVSFFNFFKYKYQLNIDGTVAAYRFPYLLAG 378
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+S+ +Y +FF + L ++ P+ S DL ++ ++ W + + +I K+ +
Sbjct: 379 DSLVFKQESKYYEFFYKDLTSGLHYVPVKS-DLSDLVEKIM-WAKEHDEDGLRIVKSARQ 436
Query: 394 FMESLTMDR-VYDYMLHLITEYSKLLDYK 421
F + R + Y L E+SK L K
Sbjct: 437 FTRDNLLPRDILCYYTVLFHEWSKRLKSK 465
>gi|452977579|gb|EME77345.1| hypothetical protein MYCFIDRAFT_83283 [Pseudocercospora fijiensis
CIRAD86]
Length = 430
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 91/221 (41%), Gaps = 28/221 (12%)
Query: 137 AALRILIVRGKLYVDPYYDCV-----QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDK 191
A+R+LI +L V + + Q R++ G L LL R ++
Sbjct: 113 GAIRVLIHDNRLRVLESINALRLIGYQERSL----GILNLLHRALEAATAGGERLPTIEA 168
Query: 192 PVIDK------KEHGSFPLPLFRYCTNDAHFDIPF--PDWSFWGWSEVNLQPWNEEFKDI 243
++ + EHG+ + ++ FD + P++ FW + ++D
Sbjct: 169 AIVLQDLSLPPTEHGTETFWTWTRELGNSTFDRVWLIPNFDFWAAKGLG------SYRDA 222
Query: 244 KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKK 303
+ + A+ EK+P A W+G+ LR L K W A+IL N G
Sbjct: 223 RLEAFARDNVEKIPKAVWRGSRLNNEKLRSGLFNTTRGKEW-ADILYYN----GTTGENA 277
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
+ + CN+ ++ EG+ +S LK++ +CNS+ + Y
Sbjct: 278 IPIRDMCNYTMTVHTEGWTYSGRLKFLFNCNSLPFVHELNY 318
>gi|198461786|ref|XP_002135782.1| GA28684 [Drosophila pseudoobscura pseudoobscura]
gi|198139919|gb|EDY70866.1| GA28684 [Drosophila pseudoobscura pseudoobscura]
Length = 270
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 95/228 (41%), Gaps = 11/228 (4%)
Query: 197 KEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAK-SWKEK 255
++ S P P+F +C +D +DI P + +E L+ D+ Q W K
Sbjct: 18 QQRTSGPYPIFSWCGSDDSYDITLPTYDI---TESTLENMGRVMLDMLSVQQTDIPWDNK 74
Query: 256 LPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW--AEEAKDGFKKSKLSNQCNHR 313
+++G L++ + L A I + EE K G + +S R
Sbjct: 75 EEIGFFRGRDSRRERLKLIDLARKFPDLINASITNFFFFRNEEQKYGPRVPHISFMEFFR 134
Query: 314 YK--IYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIK 371
YK + +G + L Y+L+ S+ Y + F L+P K++ PI DL I+
Sbjct: 135 YKYQLNIDGAVAAYRLPYLLAGGSLVFKQESPYYEHFYSKLLPYKHYVPI-KRDLSDIIE 193
Query: 372 SVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLL 418
+ W N A++I + F+ E+L +Y Y + L E+S L
Sbjct: 194 K-IRWAKDNDIRAKEIALTARKFVEENLLPQHIYCYHMTLFKEWSNRL 240
>gi|321261487|ref|XP_003195463.1| hypothetical protein CGB_G6600C [Cryptococcus gattii WM276]
gi|317461936|gb|ADV23676.1| Hypothetical protein CGB_G6600C [Cryptococcus gattii WM276]
Length = 299
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 59/283 (20%), Positives = 118/283 (41%), Gaps = 46/283 (16%)
Query: 193 VIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNE------------- 238
VI + G P F C D+ PD+ F+ W E + ++E
Sbjct: 19 VIQTSDAGGGHHPHFALCRKANQKDLWLMPDFGFFSWPEPGVGSYSEVRTKTLDYELDMG 78
Query: 239 -EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEA 297
E D +++ W K +W+G+P V +R +L++ + + W +++ +W +
Sbjct: 79 LEVNDKLEVTES-DWFNKTQQLFWRGSPMV--EVRNDLLRASQDQPW-SDVQPLDWGKVG 134
Query: 298 KDGFKKSKLSN-------QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFF-- 348
+D ++ K + C + + + EG+A+S LKY+ C SV + +Y +
Sbjct: 135 QDEAERLKNNGDLKSPAEHCKYAFLAHVEGWAYSGRLKYLQQCRSVIVAHPLEYIQHYHH 194
Query: 349 ---SRGLIPTKNHFPIP---SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDR 402
SR P +N+ +P +L + +K+++ N EK+ + + +S+
Sbjct: 195 LLNSRDGDPNQNYVEVPLPLEKNLPKVMKNLL-----NEKNEEKVQRIADNNWKSMRQGW 249
Query: 403 VYD-----YMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLA 440
+ Y H + +Y+ + +KP+ A ES L +
Sbjct: 250 ISPAANECYYRHALRQYASVQTFKPSLEGRA--VPYESFLLMG 290
>gi|402895159|ref|XP_003910701.1| PREDICTED: KDEL motif-containing protein 2 [Papio anubis]
Length = 483
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/178 (24%), Positives = 79/178 (44%), Gaps = 10/178 (5%)
Query: 251 SWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQC 310
SW K A+++G L++ + + +L A I + +E + K+KL
Sbjct: 253 SWINKTERAFFRGRDSREERLQLVQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFF 312
Query: 311 N---HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--AD 365
+ ++Y++ +G + Y++ +S+ L Y + F L P K++ PI +D
Sbjct: 313 DFFKYKYQVNVDGTVAAYRYPYLMLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSD 372
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
L +K W N EA+KI K GQ L R+Y Y ++ +Y++ KP
Sbjct: 373 LLEKVK----WAKENDEEAKKIAKEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 426
>gi|67902032|ref|XP_681272.1| hypothetical protein AN8003.2 [Aspergillus nidulans FGSC A4]
gi|40739616|gb|EAA58806.1| hypothetical protein AN8003.2 [Aspergillus nidulans FGSC A4]
gi|259480754|tpe|CBF73686.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 484
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/242 (22%), Positives = 97/242 (40%), Gaps = 44/242 (18%)
Query: 180 PDVDIMFDCMDKPV-IDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWS--EVNLQPW 236
P+++ +F DK V + EH + L T D PD+ +W W + ++ P+
Sbjct: 189 PEIEFVFSIEDKLVDVTSSEHPVWVLAR----TADEEAAWLMPDFGYWAWDHLQASIGPY 244
Query: 237 NEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEE 296
++ + W++K W+G P LR LM + W A++ +W E+
Sbjct: 245 DQVVEQAAEYDNIP-WEDKKHQLVWRGKPSFAPKLRRALMDATRDQPW-ADVQAVDWQEQ 302
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGY------------------------AWSVSLKYILS 332
K K + + C + + + EG ++S SLKY +
Sbjct: 303 DKSNVLK--MEDHCKYMFIAHVEGMPPVHLPYFINREAKLTLDLPYAGRSYSASLKYRQA 360
Query: 333 CNSVALIISQQYKDFFSRGLIPT---KNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKI 387
C SV ++ QY LIP+ +N+ + +DL ++ ++D +P AE I
Sbjct: 361 CRSVIVVHQLQYIQHHHYLLIPSGPQQNYVEVARDFSDLADALMPLLD----DPVRAETI 416
Query: 388 GK 389
Sbjct: 417 AN 418
>gi|453085417|gb|EMF13460.1| hypothetical protein SEPMUDRAFT_148744 [Mycosphaerella populorum
SO2202]
Length = 432
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 92/191 (48%), Gaps = 13/191 (6%)
Query: 254 EKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR 313
+K+ W+G+ V +R L+K + K W A++LR + + A+ +K K + C +
Sbjct: 224 DKIQKLVWRGSEWVQRDVRGSLLKVTEKKPW-ADVLRTDLDDTAEKKRRKMKPAEMCKYA 282
Query: 314 YKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPT---KNHFPIPS--ADLCR 368
+ EG ++S L++IL CNS+ ++ + ++S L P +N+ P+ ++L
Sbjct: 283 ALAHTEGISYSGRLEFILLCNSLPVVHDLNWIVWYSHLLQPDGPRQNYIPVHRNFSNLET 342
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD--YMLHLITEYSKLLDYKPAPPS 426
+ ++ +P +AE I + + + R Y+ LI YS+ + ++P P
Sbjct: 343 QMTHYLE----HPDQAETIIANSLNTFRNKYLTRAAQSCYIRRLIQGYSQ-VSFEPDPYR 397
Query: 427 SAFEACVESLL 437
S+ E + L
Sbjct: 398 SSKEGGSTTAL 408
>gi|239608365|gb|EEQ85352.1| DUF821 domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327349565|gb|EGE78422.1| DUF821 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 449
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/306 (20%), Positives = 123/306 (40%), Gaps = 35/306 (11%)
Query: 139 LRILIVRGKLYVDPYYDC--VQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVI 194
+R ++ +G LYV + D SRA ++ + + P +P+++ +F D
Sbjct: 127 VRAMVYQGALYVINFEDMRYTFSRAKASLNALNRAISAIPNPNEIPNIEFIFTTED---- 182
Query: 195 DKKEHGSFPLPLFRYCT-NDAHFDIPFPDWSFWGWSEVNLQPWN---EEFKDIKHGSQAK 250
P P++ Y D + PD+ +W W E+ + + I G+
Sbjct: 183 ----FHDDPHPVWVYSKREDDGWAWLMPDFGYWSWPEIKAGQYRSVRQRIAAIDEGTVIN 238
Query: 251 S-------WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKK 303
+++K W+G LR L+ K W A + WA+E
Sbjct: 239 GKAHPPLKFQDKKKQLLWRGAIATAPELRQGLVDATKGKSW-ASVRPLTWADEKSMQENF 297
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFP 360
+ + C + + + EG ++S KY+ +C SV + +++ L+ P N+
Sbjct: 298 IPIEDHCRYMFVAHVEGRSYSGRGKYLQNCRSVMVAHQLNWREAHHGALVAAGPDANYVK 357
Query: 361 IPS--ADLCRSIKSVVDWGNANPSEAEKIGKAG-QDFMES-LTMDRVYDYMLHLITEYSK 416
+ +DL + ++D NP AE+I + + F + LT Y L+ Y+
Sbjct: 358 VKRDFSDLESKMNYLLD----NPDVAERIAENSVRTFRDRYLTPAAEACYWRELMHAYAS 413
Query: 417 LLDYKP 422
+ D++P
Sbjct: 414 MCDFEP 419
>gi|396467668|ref|XP_003837998.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
JN3]
gi|312214563|emb|CBX94554.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
JN3]
Length = 496
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 83/381 (21%), Positives = 154/381 (40%), Gaps = 60/381 (15%)
Query: 85 LSAMNPIPERRRVASPQKVQ-TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILI 143
L+A P P +RR+ P + T ++ S + + W K + + +R L+
Sbjct: 94 LAATKP-PRKRRICGPSTMHATRTNYGLSEAQVRKRWGKVTEADLSSFDLSK-GMVRGLL 151
Query: 144 VRGKLYVDPYY---DCVQSRAMFTIWGFLQLLRRYP--GMVPDVDIMFDCMDKPVIDKKE 198
+LY+ Y + + +A+ T+ + + P +P+++ +F D P K
Sbjct: 152 FERELYIIETYLVDNVNRQKALATLSALYRSVLSAPPSTDIPNIEFVFSVEDLPAQPNKP 211
Query: 199 HGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPF 258
S L R + + IP D+ FW W L +E + + W +K+
Sbjct: 212 MWS----LARRVQDHNLWLIP--DFGFWSWDMPALGTLDEVANEAVEREAVEPWDQKMEK 265
Query: 259 AYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLS--NQCNHRYKI 316
W+G LR L+ K W +++ + W + FK+ L +QCN+ +
Sbjct: 266 LVWRGKITFAPKLRRALLDAAKGKPW-SDVGQLKWTDP---NFKEQFLGPVDQCNYMFIA 321
Query: 317 YAEGY-------------------------AWSVSLKYILSCNSVALIISQ-----QYKD 346
+AEG+ ++S SLKY C SV I+S Q+
Sbjct: 322 HAEGWLHPRTIGLTTSGNPMVQDANDLLGRSYSGSLKYRQLCRSV--IVSHKLQWIQHYH 379
Query: 347 FFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMES-LTMDR 402
+ R +N + +DL +++ ++D +P +A++I + Q F E LT
Sbjct: 380 YLFRSNGSNQNFVEVERDFSDLSSAMEDLLD----HPEKAKRIADNSVQVFRERYLTQAA 435
Query: 403 VYDYMLHLITEYSKLLDYKPA 423
Y HLI + +++ ++P
Sbjct: 436 ETCYWRHLIKRWKEVMAWEPV 456
>gi|417409544|gb|JAA51271.1| Putative protein o-glucosyltransferase 1, partial [Desmodus
rotundus]
Length = 306
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 113/255 (44%), Gaps = 29/255 (11%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + ++E +R I++ +LY + DC+
Sbjct: 64 PCSSQNCSCYHGVIEEDLTPF-RGGISRKVMVEVVRRKLGTHYQIIKHRLYRE--NDCMF 120
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 121 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 174
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +++ + WK+K AY++G+ SP
Sbjct: 175 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREELVRSAVQWPWKKKNSTAYFRGS--RTSP 232
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSN--QCNHRYK----IYAEGYA 322
R + L+ + KL AE + + KD +K N NH + + E +
Sbjct: 233 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDNHRKPAEKNLKHANHSHSKNIATWVE-FQ 291
Query: 323 WSVSLKYILSCNSVA 337
V + Y+++ SVA
Sbjct: 292 SRVPVSYVMNVESVA 306
>gi|261203203|ref|XP_002628815.1| DUF821 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239586600|gb|EEQ69243.1| DUF821 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 449
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/306 (20%), Positives = 123/306 (40%), Gaps = 35/306 (11%)
Query: 139 LRILIVRGKLYVDPYYDC--VQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVI 194
+R ++ +G LYV + D SRA ++ + + P +P+++ +F D
Sbjct: 127 VRAMVYQGALYVINFEDMRYTFSRAKASLNALNRAISAIPNPNEIPNIEFIFTTED---- 182
Query: 195 DKKEHGSFPLPLFRYCT-NDAHFDIPFPDWSFWGWSEVNLQPWN---EEFKDIKHGSQAK 250
P P++ Y D + PD+ +W W E+ + + I G+
Sbjct: 183 ----FHDDPHPVWVYSKREDDGWAWLMPDFGYWSWPEIKAGQYRSVRQRIAAIDEGTVIN 238
Query: 251 S-------WKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKK 303
+++K W+G LR L+ K W A + WA+E
Sbjct: 239 GKAHPPLKFQDKKKQLLWRGAIATAPELRQGLVDATKGKSW-ASVRPLTWADEKSMQENF 297
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---PTKNHFP 360
+ + C + + + EG ++S KY+ +C SV + +++ L+ P N+
Sbjct: 298 IPIEDHCRYMFVAHVEGRSYSGRGKYLQNCRSVMVAHQLNWREAHHGALVAAGPDANYVK 357
Query: 361 IPS--ADLCRSIKSVVDWGNANPSEAEKIGKAG-QDFMES-LTMDRVYDYMLHLITEYSK 416
+ +DL + ++D NP AE+I + + F + LT Y L+ Y+
Sbjct: 358 VKRDFSDLESKMNYLLD----NPDVAERIAENSVRTFRDRYLTPAAEACYWRELMHAYAS 413
Query: 417 LLDYKP 422
+ D++P
Sbjct: 414 MCDFEP 419
>gi|321474111|gb|EFX85077.1| hypothetical protein DAPPUDRAFT_314403 [Daphnia pulex]
Length = 502
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/276 (21%), Positives = 115/276 (41%), Gaps = 18/276 (6%)
Query: 126 TMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIM 185
M+ A ++ ++ K+Y Y V MF L L R+ ++PD++ +
Sbjct: 176 AMKRFSHAGSYSFCHYAVLDNKVYRRCYGQHV-GFNMFMDNILLSLSRK--AVLPDMEFL 232
Query: 186 FDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI- 243
+ D P++ K P+ P+F +C + DI P + +E +L+ D+
Sbjct: 233 INLGDWPLVKK---NILPIIPIFSWCGSTQTADIVMPTYDI---TEASLECMGRVTLDML 286
Query: 244 -KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW--AEEAKDG 300
+ W+ K A+W+G L + + +L A + + EE G
Sbjct: 287 SVQSNPDTKWENKQEKAFWRGRDSRRERLNLVKLSRQRPELINASLTNFFFFRDEEKTYG 346
Query: 301 FKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNH 358
K+ +S +++Y++ +G + Y+L+ ++V +Y + F L P ++
Sbjct: 347 PKEDHISFFKFFDYKYQLNIDGTVAAYRFPYLLAGDAVVFKQDSEYYEHFYSDLKPGVHY 406
Query: 359 FPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDF 394
PI ADL +K + W + E KIG G+ +
Sbjct: 407 VPI-KADLSDLVKK-IQWAKTHDEEVRKIGINGRQY 440
>gi|343428802|emb|CBQ72347.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 522
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/294 (23%), Positives = 114/294 (38%), Gaps = 45/294 (15%)
Query: 92 PERRRVASPQKVQTC----PDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAAL-----RI 141
PERRR + TC P + I ++ W AK I + EA R A RI
Sbjct: 80 PERRRDRT-MDAATCQAEFPLLYPQIQHNVAAWKAKGGIKYSDLDEAARTCAANWGMARI 138
Query: 142 LIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDV--------DIMFDCMDKPV 193
+I G+L++ + +SR I L LL + +V D D + + V
Sbjct: 139 VIRDGQLFLRQVREGGESR----ISALLHLL--HTAVVTDPSSFAPETDDDATNTGVELV 192
Query: 194 IDKKEHGSFPLPLF------RYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGS 247
+ + + + P R ++ PD+ F GW E + ++E +
Sbjct: 193 LSEADKDASPTSDAVWVLSKRVSEAESKGTWLLPDFGFAGWPEAGIASFDEFLHLAQLQD 252
Query: 248 QAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDS-KLWGAEILRQNWAEEAKDGFKKSK- 305
W +K W+G + P RV+L+ D K+ GA+ WA+ + F
Sbjct: 253 HLVPWSQKADRVLWRGLANGYPP-RVDLLSRTDPLKVPGAD----KWADVKQTSFHDVGA 307
Query: 306 -------LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL 352
+ C H++ + EG ++S K++ SC SV + ++ F L
Sbjct: 308 DFHPIIPMHEHCRHKFLVQTEGNSYSGRGKFLWSCRSVTVAHPMEWTQHFHPAL 361
>gi|71020735|ref|XP_760598.1| hypothetical protein UM04451.1 [Ustilago maydis 521]
gi|46100486|gb|EAK85719.1| hypothetical protein UM04451.1 [Ustilago maydis 521]
Length = 488
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/236 (18%), Positives = 100/236 (42%), Gaps = 19/236 (8%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRH---IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
P +K I + +E + + +T+ + + + ++ R++I ++Y+ + +R
Sbjct: 134 PGLWKEIDRSVEYYTEHPLTLEYLDRVCDDGIWSHARVVIHNNRVYLKYFQQSAFTRINS 193
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDI-PFPD 222
+ Q + +PD + D + K F N + +D+ PD
Sbjct: 194 ALALLFQSVVASREKLPDAEFCLSANDWGSMGK----------FSLDRNPSLYDVWLMPD 243
Query: 223 WSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSK 282
+ F+ W E + + E + Q+ W K+ +W+G+ V + R L+ ++
Sbjct: 244 YGFYSWPEPGIGSYTEHREKTLAIEQSTPWDSKIAKLFWRGSMGVGTADRKALLAAAENH 303
Query: 283 LWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
W ++ +W + + GF + + C ++ + EG +S L+Y+ +C SV +
Sbjct: 304 EWN-DVRPLDW--DNRQGF--VSMEDHCKWKFHAFPEGMTYSGRLRYLQNCRSVIV 354
>gi|398411945|ref|XP_003857305.1| hypothetical protein MYCGRDRAFT_24570, partial [Zymoseptoria
tritici IPO323]
gi|339477190|gb|EGP92281.1| hypothetical protein MYCGRDRAFT_24570 [Zymoseptoria tritici IPO323]
Length = 362
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/319 (21%), Positives = 123/319 (38%), Gaps = 28/319 (8%)
Query: 92 PERRRVASPQKVQTC----PDFFKSIHKDLEPWAKSRITMRHI-MEAKRFAALRILIVRG 146
P R R + C P+ + I L+ W ++I +I + +R LI
Sbjct: 7 PARDRDNHGLTTEQCDAAFPELYHQIDISLDHWKGTKIESDNIKLLGGNDGGVRFLIHDQ 66
Query: 147 KLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGM-------VPDVDIMFDCMDKPVIDKKEH 199
+L + + I Q + R +PD++ DKP + K
Sbjct: 67 QLRIIETRGLHREDFRHRIIAIAQQILRAIAAAEAANEPLPDIEFTIIVDDKPYLHKDPS 126
Query: 200 GSFPLPLFRYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKS--WKEKL 256
+ + N H I PD+ F+G P E F+ ++ ++ + K+
Sbjct: 127 AAL-WSFAKQQANKIHDSIWLVPDFHFFG-----APPEAESFRLMQTRARQHDAPLERKI 180
Query: 257 PFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKI 316
P W+G LR L+K + W A++ W KD + C +RY +
Sbjct: 181 PKVVWRGVEWTNQNLRKPLLKVTAGRPW-ADVEAMKW----KDPNSVIPMDEFCKYRYVV 235
Query: 317 YAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDW 376
EG +WS L ++L+C+S+ L+ ++ + L T N+ D ++S + +
Sbjct: 236 NTEGRSWSARLTHLLNCDSLVLVHDVEWVAHYYH-LFDTLNNCVRVERDWS-DLESTIAY 293
Query: 377 GNANPSEAEKIGKAGQDFM 395
+ N EA+KI ++
Sbjct: 294 YDKNLDEAQKIANRTKEMF 312
>gi|327303802|ref|XP_003236593.1| hypothetical protein TERG_03638 [Trichophyton rubrum CBS 118892]
gi|326461935|gb|EGD87388.1| hypothetical protein TERG_03638 [Trichophyton rubrum CBS 118892]
Length = 431
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 106/262 (40%), Gaps = 24/262 (9%)
Query: 140 RILIVRGKLYV---DPYYDCVQSRAMFTIWGFLQLLRRYPG--MVPDVDIMFDCMDKPVI 194
R +I RG+LY+ + + + + + L G P ++ +F D+ V
Sbjct: 112 RAIIHRGELYIVATKAAQEDHRKKILAVLSSMHHALSATAGNRTQPSIEFVFSIEDR-VD 170
Query: 195 DKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQPWNEEFKDIKHGSQAKSW 252
D G L R + ++ + IP D+ FW W + P+N+ I+ W
Sbjct: 171 DISGPGHPIWALSRKASEESIWLIP--DFGFWAWDNAANAIGPYNQVVDRIQRQEVTVPW 228
Query: 253 KEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNH 312
K W+G LR L+ WG ++ W+E K F + + C +
Sbjct: 229 SSKENKLVWRGKLSFAPKLRRNLLDIARDTSWG-DVKEIVWSE--KQNF--ISMDDHCKY 283
Query: 313 RYKIYAEGYAWSVSLKYILSCNSVALIISQQY---KDFFSRGLIPTKNHFPIPS--ADLC 367
++ + EG A+S SLKY +C V + Q+ + + P +N + +DL
Sbjct: 284 KFIAHVEGRAYSSSLKYRQACRFVVVAHKLQFIQHHHYLLQSTGPYQNFVEVERDFSDLP 343
Query: 368 RSIKSVVDWGNANPSEAEKIGK 389
++ ++ AN AE+I K
Sbjct: 344 AKMEHLL----ANQDLAERIAK 361
>gi|453085602|gb|EMF13645.1| hypothetical protein SEPMUDRAFT_148877 [Mycosphaerella populorum
SO2202]
Length = 465
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 91/206 (44%), Gaps = 18/206 (8%)
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKH--GSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
P+ F+ + NL + D +H S ++K+P A W+G V LR L+
Sbjct: 216 MPNVEFYASMDRNLGA----YHDARHRAAQHDSSLEDKIPEAAWRGTAWVNPELRGGLVN 271
Query: 278 CNDSKLWGA-EILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
K W + + N + A + ++ C++R+ I+ EG A+S L+Y+L+C+++
Sbjct: 272 KTKGKPWASVHTIDANDPKHAH----QLRMDEFCSYRFAIHTEGIAYSGRLQYLLNCDNL 327
Query: 337 ALIISQQYKDFFSRGLI---PTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD 393
+I + ++ LI P +N+ P+ ++S V + NP A I K
Sbjct: 328 PIIHKLAWTTHWNHLLIPEGPQQNYIPVKRD--WSDLESQVKFYTENPFNANMIVKNHLK 385
Query: 394 FMESLTMDRVYD--YMLHLITEYSKL 417
+ R Y+ L+ EY+ +
Sbjct: 386 TFRDRYLTRAATSCYIRRLMHEYASV 411
>gi|452848338|gb|EME50270.1| hypothetical protein DOTSEDRAFT_68967 [Dothistroma septosporum
NZE10]
Length = 454
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 125/276 (45%), Gaps = 46/276 (16%)
Query: 137 AALRILIVRGKLYVDPYYDCVQSRAMFTIWG-------FLQLLRR--YPGM-----VPDV 182
A+RILI +L + ++S M WG FL LL+R Y +P +
Sbjct: 126 GAMRILIHDNELRI------LESEGMIGHWGYRGRGMGFLYLLQRSLYSAQAAGEKLPTI 179
Query: 183 D---IMFDCMDKPVIDKKEHGSFP-LPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNE 238
+ I D ++ P K H + T+D H+ IP ++ W +
Sbjct: 180 EAAIIAEDIVNPPNGTKGTHSFWTWTNHIGDETHDRHWMIP--NYDLWYGYPMG------ 231
Query: 239 EFKDIKHGSQAK--SWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEE 296
F++ ++ + + ++K+KL A W+G+ + +R +L + + K W ++I ++
Sbjct: 232 SFREARYRAMERDCAFKDKLSKAVWRGSGFLNPGIREKLFEVSKDKPW-SDIWFADYGGM 290
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL---I 353
D F + + C + + I+ EG ++S L YI +C+++ +I ++ + + L
Sbjct: 291 DSDHF--LTVEDMCKYMFAIHTEGVSYSGRLSYIQNCDNLPIIHDLEWNTYTTHLLQDSG 348
Query: 354 PTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKI 387
P +N+ + DL +IK + A+P EAEKI
Sbjct: 349 PGQNYVRVKRDWTDLEENIKHYI----AHPEEAEKI 380
>gi|388855380|emb|CCF51044.1| uncharacterized protein [Ustilago hordei]
Length = 493
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/236 (19%), Positives = 95/236 (40%), Gaps = 17/236 (7%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRH---IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
P +K I + ++ + + +T + + + ++ R+LI ++Y+ + +R
Sbjct: 139 PGLWKEIDRSVDYYTEHPLTAEYLDKVCDDGIWSHARVLIHNNRVYLKYFQQSAFTRVNS 198
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
+ Q + +PD + D + G F L Y + PD+
Sbjct: 199 ALALLFQSVVASKEKLPDAEFCLSANDWGSM-----GKFSLDRAPYLEDL----WLMPDY 249
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL 283
F+ W E + + E + +A W+ K+ +W+G V + R L+
Sbjct: 250 GFYSWPEPGIGSYTEHREKTLEVEKATPWENKISKLFWRGAMGVGTADRKALLAAASDHP 309
Query: 284 WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
W ++ +W +DGF + + C ++ + EG +S L+Y+ +C SV +
Sbjct: 310 WN-DVKPLDWGN--RDGF--VTMEDHCKWKFHAFPEGMTYSGRLRYLQNCRSVIVT 360
>gi|255630857|gb|ACU15791.1| unknown [Glycine max]
Length = 194
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 28/38 (73%)
Query: 403 VYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLA 440
VYDYM HL+ Y+KL YKP+ ++A E CVES++C A
Sbjct: 157 VYDYMFHLLNSYAKLFRYKPSISANATELCVESMVCGA 194
Score = 44.7 bits (104), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 229 SEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMK 277
+E+N++PW+ +K G+ W + P+AYWKGNP V + R +LM
Sbjct: 46 AEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAV-AQTRQDLMN 93
>gi|58269136|ref|XP_571724.1| hypothetical protein CNG00070 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57227960|gb|AAW44417.1| hypothetical protein CNG00070 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 299
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/279 (21%), Positives = 114/279 (40%), Gaps = 38/279 (13%)
Query: 193 VIDKKEHGSFPLPLFRYCTNDAHFDI-PFPDWSFWGW--------SEVNLQPWNEEFK-- 241
VI + G P F C D+ PD+ F+ W SEV + + E
Sbjct: 19 VIQTSDAGGGNHPHFALCRKADQKDLWLMPDFGFFSWPEPGVGSYSEVRTKTLDYELDMG 78
Query: 242 ---DIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK 298
D K W K +W+G+P V +R +L++ + + W +++ +W + ++
Sbjct: 79 LEVDDKLDVTKSDWFNKTQQLFWRGSPMV--EVRNDLLRASQDQPW-SDVQPLDWGKVSQ 135
Query: 299 DGFKKSKLSN-------QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFF--- 348
D ++ K + C + + + EG+A+S LKY+ C SV + +Y +
Sbjct: 136 DEAERIKNNGDLKSPAEHCKYAFLAHVEGWAYSGRLKYLQQCRSVIVAHPLKYIQHYHHL 195
Query: 349 --SRGLIPTKNHFPIP---SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRV 403
R P +N+ +P +L R+++ ++ N + ++I M +
Sbjct: 196 LNGRDGDPNQNYVEVPLPLEKNLPRAMEDLLK--EENEEKVQRIADNNWKSMRQGWISPA 253
Query: 404 YD--YMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLA 440
+ Y H + +Y+ + +KP+ A ES L +
Sbjct: 254 ANECYYRHALRQYASVQTFKPSLEGRA--VPYESFLLMG 290
>gi|159125995|gb|EDP51111.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 464
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/166 (22%), Positives = 72/166 (43%), Gaps = 12/166 (7%)
Query: 181 DVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQPWNE 238
D++ +F DK ++ P+ + N+ + PD+ +W W + + P+++
Sbjct: 193 DIEFVFSVEDK--VEDVTSAEHPVWVLARAANEEAVWL-MPDFGYWAWDNPDNAIGPFDQ 249
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK 298
I+ W+ K W+G P LR L++ + WG ++ + +W
Sbjct: 250 VVARIQRAD--IPWEAKKQQLVWRGKPSFAPKLRRALIEATRDQPWG-DVKQVSW----P 302
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
D + + C + + + EG ++S SLKY +C SV + QY
Sbjct: 303 DRTNVLSMEDHCQYMFIAHVEGRSYSASLKYRQACKSVIVAHKLQY 348
>gi|453088358|gb|EMF16398.1| hypothetical protein SEPMUDRAFT_145659 [Mycosphaerella populorum
SO2202]
Length = 470
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 99/228 (43%), Gaps = 23/228 (10%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDA---HFDI-PFPDWSFWGWSEVNLQ 234
+P+++ D V+D+K+ G+ L+ + DA H ++ PD+ F+
Sbjct: 189 IPNIEFSVVVDDIAVLDQKKPGA----LWSFTRRDADAVHDNLWLIPDFHFFA-----AP 239
Query: 235 PWNEEFKDI--KHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQN 292
P E F+ + K + S ++KLP W+G P+R L+ K W A + +
Sbjct: 240 PEAEGFRTMQAKFRTHDSSLEQKLPQVVWRGATWTNKPVRQSLLDVTAGKEW-ANVSEMS 298
Query: 293 WAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL 352
W A + C +RY + EG AWS + ++ +C+S+ + ++ + L
Sbjct: 299 WENLAT----VIPMDEFCKYRYTVNTEGRAWSARMTHLFNCDSLMFVHDVEWVAHYYHLL 354
Query: 353 IPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG-KAGQDFMESLT 399
H I A +++ + + + EA++I +A + F + T
Sbjct: 355 --DTGHNCISVARDFHDLEAKIQYYEHHLDEAQQIADRAKETFRQRYT 400
>gi|47211427|emb|CAF96437.1| unnamed protein product [Tetraodon nigroviridis]
Length = 456
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 121/290 (41%), Gaps = 23/290 (7%)
Query: 106 CPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSRA 161
CPD F I DL ++ +RF + L + K+YV + + V R
Sbjct: 126 CPDSFPQIDTDLSVFSSVDPDRNAQEVPQRFGQRQSLCHYTVKDNKVYVKTFGEHVGFR- 184
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFP 221
+F L L R+ +PDV+ + D P ++K++ P+F +C ++ DI P
Sbjct: 185 IFMDSILLSLTRKV--RLPDVEFFVNLGDWP-LEKRKPTDQIHPIFSWCGSNNTLDIVMP 241
Query: 222 DWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
+ +E L+ D+ S W EK A+W+G R+EL+K +
Sbjct: 242 TYDL---TESVLETMGRVSLDMMSVQASTGPPWPEKNATAFWRGRDSRQE--RLELVKLS 296
Query: 280 DSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+ + N+ +E+ G K + ++Y+I +G + L Y+L+
Sbjct: 297 RAHPDMVDAAFTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINIDGTVAAYRLPYLLAG 356
Query: 334 NSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
+SV Y + F L P +++ PI ADL ++ + W ++ E
Sbjct: 357 DSVVFKQDSGYYEHFYNELRPWEHYVPI-RADLGDLLEK-IRWARSHDEE 404
>gi|70985152|ref|XP_748082.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66845710|gb|EAL86044.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 464
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/166 (22%), Positives = 72/166 (43%), Gaps = 12/166 (7%)
Query: 181 DVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQPWNE 238
D++ +F DK ++ P+ + N+ + PD+ +W W + + P+++
Sbjct: 193 DIEFVFSVEDK--VEDVTSAEHPVWVLARAANEEAVWL-MPDFGYWAWDNPDNAIGPFDQ 249
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK 298
I+ W+ K W+G P LR L++ + WG ++ + +W
Sbjct: 250 VVARIQRAD--IPWEAKKQQLVWRGKPSFAPKLRRALIEATRDQPWG-DVKQVSW----P 302
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
D + + C + + + EG ++S SLKY +C SV + QY
Sbjct: 303 DRTNVLSMEDHCQYMFIAHVEGRSYSASLKYRQACKSVIVAHKLQY 348
>gi|452988178|gb|EME87933.1| hypothetical protein MYCFIDRAFT_75765 [Pseudocercospora fijiensis
CIRAD86]
Length = 432
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 142/341 (41%), Gaps = 48/341 (14%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMR---HIMEAKRFAALRILIVRGKLYVDPYYDCVQSR 160
QT P ++ I + + W R T+ + + ALRILIV +L + ++++
Sbjct: 76 QTFPHLYRDIERAVRYWKDRRHTLTPQDVDISWRNDGALRILIVENELRI------LETK 129
Query: 161 AMFTIWGF-------LQLLRR--YPGMVPD-------VDIMFDCMDKPVIDKKEHGSFP- 203
F G+ L L++R + + D I+ D M K H +
Sbjct: 130 GTFDNTGYRVRIQSILHLIQRALWSATIADERLPTVEAAIVVDDMSSIPNGKDTHSVWTW 189
Query: 204 LPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKH-GSQAKS-WKEKLPFAYW 261
L R ++ IP D+ + P F D + +Q S + EK+P A+W
Sbjct: 190 TSLLRGDAGQKNWLIP--DFGMFS------APATGSFLDTRRRAAQHDSPFTEKIPKAFW 241
Query: 262 KGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGY 321
+G +R L++ + W A+ NWA + C + + I+ EG
Sbjct: 242 RGVRWTNEAVRGALLEKTKGQEW-ADAAVINWASKTN----IISADEMCKYAFLIHTEGR 296
Query: 322 AWSVSLKYILSCNSVALIISQQYKDFFSRGLIP---TKNHFPIPSADLCRSIKSVVDWGN 378
++S L+++L+C+S+ ++ ++ F L P +N+ + D ++S V++
Sbjct: 297 SYSGRLQFLLNCDSLPILHELEWNAHFYHLLKPDGDDQNYISVKR-DFS-DLQSKVEYYL 354
Query: 379 ANPSEAEK-IGKAGQDFMESLTMDRVYD-YMLHLITEYSKL 417
+P EA++ I + F + T Y+ LI EYS +
Sbjct: 355 KHPDEAQRIIANSINTFRNNYTSPAATSCYLRRLIREYSTV 395
>gi|195122642|ref|XP_002005820.1| GI18870 [Drosophila mojavensis]
gi|193910888|gb|EDW09755.1| GI18870 [Drosophila mojavensis]
Length = 270
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/228 (23%), Positives = 100/228 (43%), Gaps = 11/228 (4%)
Query: 197 KEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAK-SWKEK 255
++ S P P+F +C +D +DI P + +E ++ D+ + + SW++K
Sbjct: 18 QQRTSGPYPIFSWCGSDDTYDITLPTYDI---TESTIENMGRVMLDMLSVQKDEYSWEDK 74
Query: 256 LPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNW--AEEAKDGFKKSKLS--NQCN 311
A+++G L + + A I + EE K G K +S +
Sbjct: 75 EEKAFFRGRDSRRERLDLIDLARKYPDAINASITNFFFFRNEEHKYGPKVPHISFMDFFK 134
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIK 371
++Y++ +G + L Y+L+ S+ Y + F L+P K+ PI DL I+
Sbjct: 135 YKYQLNIDGAVAAYRLPYLLAGGSLVFKQDSHYYEHFYSKLVPYKHFVPI-KRDLSDVIE 193
Query: 372 SVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLL 418
++W AN ++I + F+ E+L +Y Y + L E+S L
Sbjct: 194 K-IEWARANDLRVKEIVANARAFVEENLLPQHIYCYHIVLFKEWSSRL 240
>gi|313238371|emb|CBY13451.1| unnamed protein product [Oikopleura dioica]
Length = 488
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/253 (20%), Positives = 104/253 (41%), Gaps = 10/253 (3%)
Query: 153 YYDCVQSRAMFTIW--GFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYC 210
Y +C F ++ L L R+ + PDV+ + D P KK + + + +
Sbjct: 189 YRECFGQHVGFNMFSDATLVALSRFVNL-PDVEFWMNLGDWPH-SKKTNENHHFQMISWG 246
Query: 211 TNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
++ D+ P + + + +++ ++ ++ +W +K+ +++G
Sbjct: 247 SHADFADLVVPTYDLMDSTLGMMHRLSKDQFSVQSSARQVNWSDKIEKGFFRGRDSRQGR 306
Query: 271 LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSK----LSNQCNHRYKIYAEGYAWSVS 326
L + M + +L A I R + +E ++ + K + N +Y+I +G +
Sbjct: 307 LDLAEMSQKNPELIDAAITRYFFFKEDEEKYGKRSEHVPFGDHFNFKYQINVDGTVAAYR 366
Query: 327 LKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEK 386
L Y+L NS L Y + F L P ++ P+ DL I+ ++W N +
Sbjct: 367 LPYLLLGNSAVLKQESAYFEHFYSSLEPFQHFIPLKR-DLSDLIQK-INWAKENDDKVSD 424
Query: 387 IGKAGQDFMESLT 399
I KA +E T
Sbjct: 425 IVKAANKVVEEET 437
>gi|169596258|ref|XP_001791553.1| hypothetical protein SNOG_00886 [Phaeosphaeria nodorum SN15]
gi|160701266|gb|EAT92381.2| hypothetical protein SNOG_00886 [Phaeosphaeria nodorum SN15]
Length = 501
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 75/367 (20%), Positives = 140/367 (38%), Gaps = 64/367 (17%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFA-ALRILIVRGKLYVDPYY-------- 154
+ C F ++D+E K R + E + L +VRG +Y Y
Sbjct: 120 EQCQSAFPGQYEDIERAVKLRDRWGKVTEEDLSSFDLTKGMVRGMIYDRELYILETYLVD 179
Query: 155 DCVQSRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTN 212
+ + +A+ ++ + +R P +P+++ +F D P K P+
Sbjct: 180 NVNRQKAIASLSAMYRSVRSAPATTEIPNIEFVFSVEDLPAQPTK-----PMWSLARRVQ 234
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
D H + PD+ FW W +L +E + + W +K W+G LR
Sbjct: 235 DHHLWL-IPDFGFWSWDMPSLGTLDEVAIEAVERESVEPWDQKQEKLVWRGKITFAPKLR 293
Query: 273 VELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLS--NQCNHRYKIYAEGY--------- 321
L+ K W +++ + W + FK+ L +QCN+ + +AEG
Sbjct: 294 RALLDAAKGKPW-SDVGQLKWTDP---NFKEQFLGPVDQCNYMFIAHAEGMTDDTELPPS 349
Query: 322 -----------------AWSVSLKYILSCNSVALIISQ-----QYKDFFSRGLIPTKNHF 359
++S SLKY C SV I+S Q+ + R +N
Sbjct: 350 RPQKQATMIKTDDPAGRSYSGSLKYRQLCRSV--IVSHKLQWIQHYHYLFRNNGSNQNFV 407
Query: 360 PIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD--YMLHLITEYS 415
+ +DL +++ ++D +P +A++I + + + Y HLI +
Sbjct: 408 EVERDFSDLVPAMEDLLD----HPEKAKRIADNSVAVFRERYLTQAAESCYWRHLIKRWK 463
Query: 416 KLLDYKP 422
+L ++P
Sbjct: 464 DVLGFEP 470
>gi|429849172|gb|ELA24585.1| duf821 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 458
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 86/217 (39%), Gaps = 38/217 (17%)
Query: 140 RILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH 199
++LI+R VD D + R + + + P +PD D PV H
Sbjct: 108 KLLIIRAPRPVD-MSDQWRERQRAALHQIHRAILTSPTRLPDTVFNLYIQDTPVSKSWSH 166
Query: 200 -----GSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKE 254
S P +F P P ++FW W+ QP+ + I H + A + E
Sbjct: 167 SRPAIASSPRHIF-----------PIPHFAFWAWN----QPF---IRSITHAAAATTDIE 208
Query: 255 K-LPF------AYWKG-----NPDVLSPL-RVELMKCNDSKLWGAEILRQNWAEEAKDGF 301
LPF A W+G N +P R EL K W A++ W + +D
Sbjct: 209 AALPFHKKDQPAVWRGTAWFNNGASANPRSRQELWKIAKDARW-ADVQALEWTNQGEDAT 267
Query: 302 KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
+ + C ++Y I+ EG ++S L++ C SV L
Sbjct: 268 NALNIEDFCRYKYIIHTEGMSYSGRLQFHQLCESVIL 304
>gi|452847702|gb|EME49634.1| hypothetical protein DOTSEDRAFT_92812, partial [Dothistroma
septosporum NZE10]
Length = 367
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/211 (25%), Positives = 88/211 (41%), Gaps = 31/211 (14%)
Query: 139 LRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKE 198
LRI+ RG D + + + I L + +PD + DKP++ K
Sbjct: 69 LRIIQTRGLGRTDFRHRIIA--VLQQIQTALNAAQSAGQPLPDAEFTVIVDDKPIVGDKA 126
Query: 199 HGSFPLPLF-RYCTNDAHFDI-PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKL 256
PL F R N H +I PD+ F+G + + W+ + K K+
Sbjct: 127 R---PLWGFTRAFANPRHDNIWVIPDFHFFG-APPEAEGWS--LQQTKSREHDGPLDRKI 180
Query: 257 PFAYWKG----NPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK----DGFKKSKLSN 308
P W+G NP+V PL + + K W A+++R +W D F
Sbjct: 181 PKLAWRGVEWTNPEVRKPL----LNVTEGKPW-ADVVRMSWDNRESVIPMDSF------- 228
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
C R+ + EG AWS + ++L+C+S+ ++
Sbjct: 229 -CKFRFVVNTEGRAWSARMTHLLNCDSLLIV 258
>gi|449295662|gb|EMC91683.1| hypothetical protein BAUCODRAFT_126678 [Baudoinia compniacensis
UAMH 10762]
Length = 460
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 86/185 (46%), Gaps = 18/185 (9%)
Query: 221 PDWSFWGWSEV--NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
PD++FW + V + + ++ +IK +++K W+G LR L+
Sbjct: 223 PDFNFWDYPRVAGSFSDYQQQAIEIKQ----DRFEDKKDLLVWRGTVGFKPELRWPLIMQ 278
Query: 279 NDSKLWGAEILRQNWAEEAKDGFK-KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
+ W +++ R + D + K + + C ++Y ++ EG +WS LKY+LSC+ V
Sbjct: 279 TAGQPW-SDVHRLDTEMTTPDQLQHKISMPDHCRYKYSVHTEGTSWSGRLKYLLSCHQVV 337
Query: 338 LIISQQYKDFFSRGLIPT---KNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG-KAG 391
+I Y L P+ +N+ + + DL ++ ++ +NP +A+ I A
Sbjct: 338 IIHHLSYFTHLYHLLTPSGPGQNYVQVQNDWLDLPDKMEDLL----SNPDKAKMIADSAA 393
Query: 392 QDFME 396
DF +
Sbjct: 394 ADFRD 398
>gi|358058116|dbj|GAA96095.1| hypothetical protein E5Q_02756 [Mixia osmundae IAM 14324]
Length = 486
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 52/225 (23%), Positives = 91/225 (40%), Gaps = 23/225 (10%)
Query: 120 WAKS--RITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPG 177
W KS +T+ + + R++I G+L+V + QSR + +
Sbjct: 166 WTKSLGGVTLSDVEHSGEIWHARLMIKDGRLFVKRFRPAYQSRCQAVMAAIHDAIITSLE 225
Query: 178 MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIP---FPDWSFWGWSEVNLQ 234
VPD++ + D + G L R D+P D+ F W E ++
Sbjct: 226 PVPDIEFVIGLDDTAPFEPATWG-----LAR------RLDLPAWLVIDYGFNAWPEPMIK 274
Query: 235 PWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWA 294
W+ D+K + +W++K +W+G LS R +L ++ W A+I +W
Sbjct: 275 GWHTFLADVKSVNAQMTWRQKAKKLFWRGV--YLSEYREQLRDHTVNESW-ADIAAVDWG 331
Query: 295 EEAKDGFKKSKLSNQCNHRYKIYAEGY-AWSVSLKYILSCNSVAL 338
+ + +S+ Y EG A+S LKY+LS S +
Sbjct: 332 RPEE---TRVSMSHHSRFAYVAATEGGDAYSGRLKYLLSTQSTVV 373
>gi|119499023|ref|XP_001266269.1| hypothetical protein NFIA_039480 [Neosartorya fischeri NRRL 181]
gi|119414433|gb|EAW24372.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 464
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/166 (22%), Positives = 72/166 (43%), Gaps = 12/166 (7%)
Query: 181 DVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVN--LQPWNE 238
D++ +F DK ++ P+ + N+ + PD+ +W W + + P+++
Sbjct: 193 DIEFVFSVEDK--VEDVTSAEHPVWVLARAANEEAVWL-MPDFGYWAWDNPDNAIGPFDQ 249
Query: 239 EFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAK 298
I+ W+ K W+G P LR L++ + WG ++ + +W
Sbjct: 250 VVARIQRAD--IPWEAKKQQLVWRGKPSFAPKLRRALIEAVRDQPWG-DVKQVSW----P 302
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQY 344
D + + C + + + EG ++S SLKY +C SV + QY
Sbjct: 303 DRTNVLSMEDHCQYMFIAHVEGRSYSASLKYRQACKSVIVAHKLQY 348
>gi|443897123|dbj|GAC74465.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
Length = 495
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/236 (18%), Positives = 96/236 (40%), Gaps = 17/236 (7%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRH---IMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF 163
P +K I + ++ + + +T + + + ++ R+LI ++Y+ + +R
Sbjct: 141 PGLWKEIDRSVDYYTEHPLTAEYLDKVCDDGIWSHARVLIHNNRVYLKYFQQSPFTRVNS 200
Query: 164 TIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
+ Q + +PD + D + G F L Y + PD+
Sbjct: 201 ALALLFQSVTGAREKLPDAEFCLSANDWGSM-----GKFSLDRAPYLVDL----WLMPDY 251
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKL 283
F+ W E + + E + A W K+ +W+G V + R L+ ++
Sbjct: 252 GFYSWPEPGIGSYTEHREKTLELENATPWSAKISKLFWRGAMSVGTADRKALLAAAENHA 311
Query: 284 WGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
W ++ +W ++GF +++ C ++ + EG +S L+Y+ +C SV +
Sbjct: 312 WN-DVKPLDWGN--REGF--VSMADHCKWKFHAFPEGMTYSGRLRYLQNCRSVIVT 362
>gi|115399170|ref|XP_001215174.1| predicted protein [Aspergillus terreus NIH2624]
gi|114192057|gb|EAU33757.1| predicted protein [Aspergillus terreus NIH2624]
Length = 407
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 54/249 (21%), Positives = 109/249 (43%), Gaps = 37/249 (14%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAA--LRILIVRGKLYV---DPYYDCVQ 158
++ P F+ I + ++ W + ++A F + R +I G+LY+ +
Sbjct: 72 ESFPGLFQDISRAVQYWKPRGGISKDDLDAIPFESGMARAIISNGELYIVATKAKGKDHR 131
Query: 159 SRAMFTIWGFLQLLRRYPGMV--PDVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAH 215
++ + T+ + L P + P ++ +F D+ +D + P+ + R + ++
Sbjct: 132 TKILATLGSIYRALSSSPTRLSLPTIEFVFSIEDR--VDDIDATGHPVWVVSRKVSEESV 189
Query: 216 FDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVEL 275
F +P D+ +W W++ ++ P ++A+ + KL FA LR L
Sbjct: 190 FLMP--DFGYWSWAKSHIGP-----------AKARVERGKLSFA---------PKLRRAL 227
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
+ K WG ++ W+ K+ F + + C + + + EG A+S SLKY +C S
Sbjct: 228 LDVARGKPWG-DVKELEWSN--KENFLT--MEDHCRYMFIGHVEGRAYSASLKYRHACRS 282
Query: 336 VALIISQQY 344
V + QY
Sbjct: 283 VIVAHKLQY 291
>gi|258569104|ref|XP_002585296.1| predicted protein [Uncinocarpus reesii 1704]
gi|237906742|gb|EEP81143.1| predicted protein [Uncinocarpus reesii 1704]
Length = 483
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 67/311 (21%), Positives = 122/311 (39%), Gaps = 36/311 (11%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAKRF--AALRILIVRGKLYVDPYYDCVQSRAMFT 164
P ++ + + + W ++ + + R +I G L++ Q
Sbjct: 116 PGLYEDVDRAVRFWISRGGILKKSLNTVKLQNGMARAMIYNGNLFIIEA-RAAQEDHRRK 174
Query: 165 IWGFLQLLRRYPG-MVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDW 223
I L + R G P+++ +F DK V D G L R T + + IP D+
Sbjct: 175 ILAVLSSIHRALGNRAPNIEFIFSVEDK-VEDVSGQGHPLWVLSRKATEKSVWLIP--DF 231
Query: 224 SFWGWSE--VNLQPWNEEFKDIKHGSQAKS--WKEKLPFAYWKGNPDVLSPLRVELMKCN 279
FW W N+ P+++ K I+ + W K P W+G LR L++
Sbjct: 232 GFWAWGNPASNIGPYDQVVKRIEKFDLEDTMPWSSKTPRLVWRGKLSFAPKLRRRLLEVT 291
Query: 280 DSKLWG--AEIL--RQNWAEEAKDGFKKSKLSNQCNHRY------------KIYAEGYAW 323
+K WG EI+ R++ +D K +++ R Y G ++
Sbjct: 292 RNKPWGDVKEIVWSRKSHFISMEDHCKYMFIAHVEGMRVTSPAPEDQAMALNTYHPGRSF 351
Query: 324 SVSLKYILSCNSVAL---IISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGN 378
S S KY +C SV + + Q+ + + P +N + +DL ++ ++
Sbjct: 352 SSSFKYRQACRSVIIAHKLQHIQHHHYLLQSSGPNQNFVEVERDFSDLSDKVEELL---- 407
Query: 379 ANPSEAEKIGK 389
ANP +A++I
Sbjct: 408 ANPEKAKRIAN 418
>gi|224113737|ref|XP_002186646.1| PREDICTED: KDEL motif-containing protein 2-like [Taeniopygia
guttata]
Length = 149
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 7/115 (6%)
Query: 311 NHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCR 368
++Y++ +G + Y+L +S+ L QY + F GL P K++ P+ DL
Sbjct: 8 KYKYQVNIDGTVAAYRFPYLLLGDSLVLKQDSQYYEHFYIGLKPWKHYVPVKRNLEDLLE 67
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
IK W N EA KI K GQ L R Y Y ++ +Y++ KP
Sbjct: 68 KIK----WAKENDEEARKIAKEGQLMARELLQPHRFYCYYYKVLQKYAERQASKP 118
>gi|372281805|ref|ZP_09517841.1| hypothetical protein OS124_19274 [Oceanicola sp. S124]
Length = 327
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/108 (24%), Positives = 50/108 (46%), Gaps = 1/108 (0%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
+ Q +RY + EG + SLK+I++ NS+AL +++ +F G + HF D
Sbjct: 198 IPEQLQYRYILAPEGNDVATSLKWIMNSNSLALSPPLEFETWFMEGRLVPGEHFIGLRPD 257
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLITE 413
+ +DW ++P E +I + + + R D + L+ +
Sbjct: 258 YA-DLDEKIDWAESHPEEVARINRNARAWCAQFKDRRTEDLVAALVLQ 304
>gi|387823701|ref|YP_005823172.1| LpsA protein [Francisella cf. novicida 3523]
gi|328675300|gb|AEB27975.1| LpsA protein [Francisella cf. novicida 3523]
Length = 321
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 23/93 (24%), Positives = 50/93 (53%), Gaps = 1/93 (1%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
+ +Q ++Y I EGY + +LK+I++ NS+ + +Y+ +F G + +H+ + D
Sbjct: 206 IQDQLKYKYIISIEGYDVATNLKWIMNSNSLCFMTKPKYETWFMEGTLIPNHHYVLLKED 265
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
++ +D+ N +P +A KI K +++
Sbjct: 266 YS-DLQEKIDYYNNHPEKALKIIKNANEYINQF 297
>gi|9437337|gb|AAF87313.1|AF168711_1 x 010 protein [Homo sapiens]
Length = 273
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 94/215 (43%), Gaps = 22/215 (10%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I + +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQITKNRLYRE--NDCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGS--RTSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGFKK 303
R + L+ + KL AE + + KD K
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGK 251
>gi|208779982|ref|ZP_03247325.1| hypothetical protein FTG_0985 [Francisella novicida FTG]
gi|208743986|gb|EDZ90287.1| hypothetical protein FTG_0985 [Francisella novicida FTG]
Length = 322
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y + EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIVSIEGYDVATNLKWIMNSNSLCFMTKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|444731113|gb|ELW71476.1| KDEL motif-containing protein 1 [Tupaia chinensis]
Length = 618
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 115/281 (40%), Gaps = 23/281 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CPD I +DL + KRF + L + K+Y+ + + V R
Sbjct: 160 NCPDTIAQIQRDLAHFPTIDPEKIATEIPKRFGQRQSLCHYTVKDNKVYIKTHGEHVGFR 219
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 220 -IFMDAILLSLTRKV--RMPDVEFFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 275
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 276 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNATAVWRGRDS--RKERLELVKL 330
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L+
Sbjct: 331 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLA 390
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIK 371
+SV L Y + F L P K++ P+ S +DL +K
Sbjct: 391 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLDKLK 431
>gi|367030463|ref|XP_003664515.1| hypothetical protein MYCTH_2307437 [Myceliophthora thermophila ATCC
42464]
gi|347011785|gb|AEO59270.1| hypothetical protein MYCTH_2307437 [Myceliophthora thermophila ATCC
42464]
Length = 450
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 60/291 (20%), Positives = 110/291 (37%), Gaps = 39/291 (13%)
Query: 140 RILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH 199
+I I+ + D + + SR ++ + L P +PD + D+P +
Sbjct: 112 QISIIHAQRKSDLSKEMMNSRTA-SLHQLYRALLTSPSPLPDTIFTLNFQDQPFGTAWAY 170
Query: 200 GSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFK---DIKHGSQAKSWKEKL 256
P FR ++A + P +SFW W + K D++ W +K+
Sbjct: 171 SRAADPQFRSKGSNARTFL-MPHFSFWAWKLPFIGSMGRAAKAISDVEAEFANGRWHQKI 229
Query: 257 PFAYWKGNP---DVLSP-LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNH 312
A W+G V +P +R L+ + W A++ W + + C +
Sbjct: 230 GKAVWRGTTWFNSVYNPRMRQNLVATARGEPW-ADVEPLEWNGSTGNASNALPVEEFCRY 288
Query: 313 RYKIYAEGYAWSVSLKYILSCNSVALI--------ISQQYKDFF--------SRGLIPTK 356
+Y ++ EG +S +++ C SV L ++ K F S+G PT+
Sbjct: 289 KYIVHTEGVTYSGRFQFLQMCASVVLTPPIQWMQHVTHLVKPLFSSDLNLKGSKGWTPTE 348
Query: 357 N---HFPI----PSADLC------RSIKSVVDWGNANPSEAEKIGKAGQDF 394
N +P+ A++ + + V W NP AE I + +D
Sbjct: 349 NVRRAWPVRYKPQEANIVFVAPDWSDLGATVAWLEENPEIAEGIARRQRDL 399
>gi|333907159|ref|YP_004480745.1| lipopolysaccharide-modifying protein [Marinomonas posidonica
IVIA-Po-181]
gi|333477165|gb|AEF53826.1| lipopolysaccharide-modifying protein [Marinomonas posidonica
IVIA-Po-181]
Length = 309
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 31/127 (24%), Positives = 67/127 (52%), Gaps = 3/127 (2%)
Query: 295 EEAKDGFKKS-KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI 353
+ K+GFK + +Q ++Y + EG + +LK+ ++ NS+ ++ +++ +F GL+
Sbjct: 183 DHTKEGFKGFLSIKDQLRYKYIVSIEGKDVATNLKWAMNSNSLVMMRKPRFETWFMEGLL 242
Query: 354 PTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLI-T 412
H+ + D +K +D+ N NP+EA+ I K + +++ D + L+ +
Sbjct: 243 KPDFHY-VKLKDDFSDLKEKIDYYNENPNEAKDIIKNAKQYVKQFLNKDNEDLISVLVAS 301
Query: 413 EYSKLLD 419
+Y L+D
Sbjct: 302 KYFDLMD 308
>gi|154293302|ref|XP_001547190.1| hypothetical protein BC1G_14446 [Botryotinia fuckeliana B05.10]
Length = 442
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 46/205 (22%), Positives = 81/205 (39%), Gaps = 16/205 (7%)
Query: 143 IVRGKLYV-----DPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK 197
I GK+++ +P + +Q R + + + P +P+ F+ +D P+
Sbjct: 98 IKDGKIFIISADPNPSQNMLQERTS-VLHQIHRAIATSPSPLPNTIFAFNILDTPM---N 153
Query: 198 EHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLP 257
+F P N ++ +P +SFW W + ++ I K W EK+
Sbjct: 154 NSWTFSRPNDPNIENGNYWVMPH--FSFWSWPVSFIGTVDQAISKIDRIEMDKHWTEKID 211
Query: 258 FAYWKG----NPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHR 313
A W+G N R L+ K W A+I W + + + C ++
Sbjct: 212 KAVWRGTGWFNTVGNKDSRPSLVLKGKDKEW-ADIEALKWTTNGESAENAIGIEDFCKYK 270
Query: 314 YKIYAEGYAWSVSLKYILSCNSVAL 338
Y +Y EG +S L + +C SV L
Sbjct: 271 YIVYTEGITYSGRLLFHQACASVIL 295
>gi|254368572|ref|ZP_04984588.1| lipopolysaccharide protein [Francisella tularensis subsp.
holarctica FSC022]
gi|157121475|gb|EDO65666.1| lipopolysaccharide protein [Francisella tularensis subsp.
holarctica FSC022]
Length = 322
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y I EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIISIEGYDVATNLKWIMNSNSLCFMNKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|367041029|ref|XP_003650895.1| hypothetical protein THITE_2094595 [Thielavia terrestris NRRL 8126]
gi|346998156|gb|AEO64559.1| hypothetical protein THITE_2094595 [Thielavia terrestris NRRL 8126]
Length = 468
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 54/249 (21%), Positives = 100/249 (40%), Gaps = 28/249 (11%)
Query: 140 RILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEH 199
+I I+ + D + + SR ++ + L P +PD + D+P +
Sbjct: 125 QIYIIHAQRKSDLSQEMLNSRTA-SLHQLHRALLTSPTPMPDTIFTLNFQDQPFGTAWTY 183
Query: 200 GSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDI-----KHGSQAKSWKE 254
P F DA + P +SFW W+ + + I + + A W +
Sbjct: 184 SRHADPTFGSRDPDARSFL-MPHFSFWAWNLPFVGSMSRAAAAIAQLESGYTAPAGDWHD 242
Query: 255 KLPFAYWKGNP---DVLSP-LRVELMKCNDSKLWGAEILRQNW----AEEAKDGFKKSKL 306
K+P A W+G V +P LR +L+ + W A+I W ++ +
Sbjct: 243 KIPKAVWRGTTWFNSVHNPRLRQDLLAAARGQPW-ADIQALEWRSVPGASERNATNALPI 301
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL---IISQQYKDFFSRGLIPTKNHFPIPS 363
C ++Y ++ EG ++S +++ C SV L I+ Q+ +R P+ S
Sbjct: 302 EEFCRYKYVVHTEGVSYSGRFQFLQMCASVVLTPPIMWMQHVTHLAR---------PLFS 352
Query: 364 ADLCRSIKS 372
+DL + K+
Sbjct: 353 SDLKKGGKT 361
>gi|47197241|emb|CAF87883.1| unnamed protein product [Tetraodon nigroviridis]
Length = 238
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 51/115 (44%), Gaps = 7/115 (6%)
Query: 305 KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS- 363
K + H+Y++ +G + Y+L NS+ L Y +FF L ++ P+
Sbjct: 90 KFAFAFQHKYQVNVDGTVAAYRFPYLLLGNSLVLKQDSPYYEFFYGHLEAGTHYLPVKRD 149
Query: 364 -ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSK 416
+DL IK W N AEK+ AGQ E L R+Y Y ++ Y++
Sbjct: 150 LSDLLDQIK----WAKENDGRAEKMAAAGQALARELLRPGRLYCYYYRVLRAYAE 200
>gi|385792057|ref|YP_005825033.1| hypothetical protein [Francisella cf. novicida Fx1]
gi|328676203|gb|AEB27073.1| LpsA protein [Francisella cf. novicida Fx1]
Length = 322
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y + EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIVSIEGYDVATNLKWIMNSNSLCFMNKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|343425735|emb|CBQ69269.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 488
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 27/120 (22%), Positives = 52/120 (43%), Gaps = 5/120 (4%)
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCN 279
PD+ F+ W E + + E + W+ K+P +W+G V + R L+
Sbjct: 241 MPDYGFYSWPEPGIGSYTEHREKTLQVEHDTPWERKVPKLFWRGAMGVGTADRKALLAAA 300
Query: 280 DSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
W ++ +W + GF + + C ++ + EG +S L+Y+ +C SV +
Sbjct: 301 QDHDWN-DVKPLDWGN--RQGF--VSMEDHCKWKFHAFPEGMTYSGRLRYLQNCRSVIVT 355
>gi|56707412|ref|YP_169308.1| lipopolysaccharide protein [Francisella tularensis subsp.
tularensis SCHU S4]
gi|110669883|ref|YP_666440.1| lipopolysaccharide protein [Francisella tularensis subsp.
tularensis FSC198]
gi|134302634|ref|YP_001122603.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis WY96-3418]
gi|254370891|ref|ZP_04986896.1| lipopolysaccharide protein [Francisella tularensis subsp.
tularensis FSC033]
gi|254874251|ref|ZP_05246961.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis MA00-2987]
gi|379716605|ref|YP_005304941.1| LpsA protein [Francisella tularensis subsp. tularensis TIGB03]
gi|379725288|ref|YP_005317474.1| LpsA protein [Francisella tularensis subsp. tularensis TI0902]
gi|385794020|ref|YP_005830426.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis NE061598]
gi|421752528|ref|ZP_16189552.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis AS_713]
gi|421754395|ref|ZP_16191368.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 831]
gi|421754879|ref|ZP_16191839.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 80700075]
gi|421758126|ref|ZP_16194984.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 80700103]
gi|421759953|ref|ZP_16196777.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 70102010]
gi|424675279|ref|ZP_18112186.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 70001275]
gi|54114411|gb|AAV29839.1| NT02FT0456 [synthetic construct]
gi|56603904|emb|CAG44889.1| Lipopolysaccharide protein [Francisella tularensis subsp.
tularensis SCHU S4]
gi|110320216|emb|CAL08272.1| Lipopolysaccharide protein [Francisella tularensis subsp.
tularensis FSC198]
gi|134050411|gb|ABO47482.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis WY96-3418]
gi|151569134|gb|EDN34788.1| lipopolysaccharide protein [Francisella tularensis subsp.
tularensis FSC033]
gi|254840250|gb|EET18686.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis MA00-2987]
gi|282158555|gb|ADA77946.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis NE061598]
gi|377826737|gb|AFB79985.1| LpsA protein [Francisella tularensis subsp. tularensis TI0902]
gi|377828282|gb|AFB78361.1| LpsA protein [Francisella tularensis subsp. tularensis TIGB03]
gi|409084681|gb|EKM84848.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 831]
gi|409084956|gb|EKM85113.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis AS_713]
gi|409089752|gb|EKM89786.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 80700075]
gi|409089870|gb|EKM89902.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 70102010]
gi|409090383|gb|EKM90402.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 80700103]
gi|417434034|gb|EKT89006.1| lipopolysaccharide core biosynthesis protein [Francisella
tularensis subsp. tularensis 70001275]
Length = 322
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y + EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIVSIEGYDVATNLKWIMNSNSLCFMNKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|118496747|ref|YP_897797.1| hypothetical protein FTN_0132 [Francisella novicida U112]
gi|194324054|ref|ZP_03057829.1| hypothetical protein FTE_0422 [Francisella novicida FTE]
gi|118422653|gb|ABK89043.1| protein of unknown function [Francisella novicida U112]
gi|194321951|gb|EDX19434.1| hypothetical protein FTE_0422 [Francisella tularensis subsp.
novicida FTE]
Length = 322
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y + EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIVSIEGYDVATNLKWIMNSNSLCFMNKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|187932189|ref|YP_001892174.1| hypothetical protein FTM_1578 [Francisella tularensis subsp.
mediasiatica FSC147]
gi|187713098|gb|ACD31395.1| conserved hypothetical protein [Francisella tularensis subsp.
mediasiatica FSC147]
Length = 322
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y + EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIVSIEGYDVATNLKWIMNSNSLCFMNKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|254372112|ref|ZP_04987605.1| hypothetical protein FTCG_01254 [Francisella tularensis subsp.
novicida GA99-3549]
gi|151569843|gb|EDN35497.1| hypothetical protein FTCG_01254 [Francisella novicida GA99-3549]
Length = 322
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y + EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIVSIEGYDVATNLKWIMNSNSLCFMNKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|89255574|ref|YP_512935.1| lipopolysaccharide protein [Francisella tularensis subsp.
holarctica LVS]
gi|115314080|ref|YP_762803.1| lipopolysaccharide biosynthesis protein [Francisella tularensis
subsp. holarctica OSU18]
gi|254366976|ref|ZP_04983012.1| lipopolysaccharide protein [Francisella tularensis subsp.
holarctica 257]
gi|422938082|ref|YP_007011229.1| hypothetical protein FTS_0129 [Francisella tularensis subsp.
holarctica FSC200]
gi|89143405|emb|CAJ78578.1| Lipopolysaccharide protein [Francisella tularensis subsp.
holarctica LVS]
gi|115128979|gb|ABI82166.1| possible lipopolysaccharide biosynthesis protein [Francisella
tularensis subsp. holarctica OSU18]
gi|134252802|gb|EBA51896.1| lipopolysaccharide protein [Francisella tularensis subsp.
holarctica 257]
gi|407293233|gb|AFT92139.1| hypothetical protein FTS_0129 [Francisella tularensis subsp.
holarctica FSC200]
Length = 322
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y + EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIVSIEGYDVATNLKWIMNSNSLCFMNKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|156501517|ref|YP_001427582.1| hypothetical protein FTA_0149 [Francisella tularensis subsp.
holarctica FTNF002-00]
gi|290954407|ref|ZP_06559028.1| hypothetical protein FtulhU_09434 [Francisella tularensis subsp.
holarctica URFT1]
gi|423049878|ref|YP_007008312.1| hypothetical protein F92_00765 [Francisella tularensis subsp.
holarctica F92]
gi|156252120|gb|ABU60626.1| hypothetical protein FTA_0149 [Francisella tularensis subsp.
holarctica FTNF002-00]
gi|421950600|gb|AFX69849.1| hypothetical protein F92_00765 [Francisella tularensis subsp.
holarctica F92]
Length = 322
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPSA 364
+ +Q ++Y + EGY + +LK+I++ NS+ + +Y+ +F G LIP NH +
Sbjct: 206 IQDQLKYKYIVSIEGYDVATNLKWIMNSNSLCFMNKPKYETWFMEGTLIP--NHHYVLLK 263
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
D ++ +D+ N +P +A KI K +++
Sbjct: 264 DDYSDLQEKIDYYNNHPEKALKIIKNANEYVNQF 297
>gi|453083425|gb|EMF11471.1| hypothetical protein SEPMUDRAFT_164966 [Mycosphaerella populorum
SO2202]
Length = 459
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 43/197 (21%), Positives = 86/197 (43%), Gaps = 14/197 (7%)
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA--------EIL 289
EE + + S +K+P W+G V +R +L+ + K W +
Sbjct: 227 EESRRLAREFDMPSIADKIPKVVWRGTEWVNPEIRDKLVNVSRGKSWADVKYSNFTPTTI 286
Query: 290 RQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFS 349
N EE + +S+ C + I+ EG+++S L ++L+CNS+ LI + Y +
Sbjct: 287 NNNKDEEEQILNNHLPISHLCTYALTIHTEGFSYSGRLSHLLNCNSLPLIHNLTYTTHYY 346
Query: 350 RGLIPT---KNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQD-FMESLTMDRVYD 405
L P+ +N+ + + ++ V + ++P EA+ I + + F + + D
Sbjct: 347 HLLQPSGPQQNYISVRND--FSDLEDTVQYFLSHPEEADVIVRNSVNTFRDKYLTPQAGD 404
Query: 406 YMLHLITEYSKLLDYKP 422
L L+ K + ++P
Sbjct: 405 CYLRLLVRGYKDVAFEP 421
>gi|397610919|gb|EJK61081.1| hypothetical protein THAOC_18482 [Thalassiosira oceanica]
Length = 639
Score = 45.4 bits (106), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 54/244 (22%), Positives = 98/244 (40%), Gaps = 31/244 (12%)
Query: 218 IPFPDWSFWGW----SEVNLQPWNEEFK--DIKHGSQAKSWKEKLPFAYWKG--NPDVLS 269
+P P + W W + + W++ F D K+ W+ K+ A W+G DV
Sbjct: 385 VPIPTFEIWSWIKRANHGDPSGWDKAFDLADAKY-----PWEGKMRKAVWRGASTADVAQ 439
Query: 270 ----PL------RVELMKCNDSKLWGAEILR-----QNWAEEAKDGFKKSKLSNQCNHRY 314
PL RV M + L A ++ W + ++ +Q N+
Sbjct: 440 YGGKPLGETPRGRVVQMGMDRPDLIDAAFVKITPQYSGWENTTRLVDERMPFDDQMNYTA 499
Query: 315 KIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVV 374
I +G WS +L NSV + + Y ++F R L P +++ P +L + ++ V+
Sbjct: 500 IIDVDGNNWSSRFPKLLCLNSVTVKVEPSYIEYFHRDLTPGRHYVPASFDNLTQVVEYVI 559
Query: 375 DWGNANPSEAEKIGKAGQDFME-SLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACV 433
N SE + + + + ++ ++ V + I EY L SAF+ V
Sbjct: 560 S--PENDSEMKLVVEEANGWCRGAMGVETVTRSAMEKIGEYFSDLADVMEDDDSAFDEHV 617
Query: 434 ESLL 437
+ L+
Sbjct: 618 DDLV 621
>gi|339256434|ref|XP_003370406.1| KDEL motif-containing protein 1 [Trichinella spiralis]
gi|316964985|gb|EFV49851.1| KDEL motif-containing protein 1 [Trichinella spiralis]
Length = 138
Score = 45.4 bits (106), Expect = 0.053, Method: Composition-based stats.
Identities = 29/116 (25%), Positives = 56/116 (48%), Gaps = 3/116 (2%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
L C ++Y G A S +++ C S+ + ++K+FF + P ++ P+ D
Sbjct: 13 LDYHCRYKYLFNFRGVAASFRFRHLFLCRSLVFHVGDEWKEFFYYQMKPWIHYIPV-KED 71
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
L ++ ++++ N A+ I + G F + LTMD V Y L+ +++ L Y
Sbjct: 72 L-NDVEELLEFVKENDDVAKDIAERGYQFILNHLTMDNVTAYWESLLKQFTDRLVY 126
>gi|224053931|ref|XP_002298044.1| predicted protein [Populus trichocarpa]
gi|222845302|gb|EEE82849.1| predicted protein [Populus trichocarpa]
Length = 104
Score = 45.4 bits (106), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 384 AEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEACVESLLCLAD 441
A+ IGK F+ E L Y+YM L EY+KLL +KP P A C E L C A+
Sbjct: 2 AQAIGKIASAFIQEELKTGSAYEYMFLLPNEYAKLLKFKPQVPDGAAGMCSEILACSAE 60
>gi|169851826|ref|XP_001832602.1| Cap3p [Coprinopsis cinerea okayama7#130]
gi|116506456|gb|EAU89351.1| Cap3p [Coprinopsis cinerea okayama7#130]
Length = 720
Score = 45.1 bits (105), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 65/122 (53%), Gaps = 11/122 (9%)
Query: 305 KLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIP-- 362
+ Q ++Y I +G AWS K +++ NS+ + S Y+++F+ + P ++ PI
Sbjct: 548 NIPTQGKYKYIIDVDGNAWSSRFKRLITSNSL-IFKSTIYQEWFADRIEPWLHYVPIQID 606
Query: 363 SADLCRSI---KSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD---YMLHLITEYSK 416
+DL ++ + A+P A+KI +AG+++ SLT R D YM L EY++
Sbjct: 607 YSDLLDALYFFRGDPGGRGAHPELAKKIAEAGREW--SLTHWRRADLTAYMFRLFLEYTR 664
Query: 417 LL 418
++
Sbjct: 665 IM 666
>gi|397568477|gb|EJK46156.1| hypothetical protein THAOC_35189 [Thalassiosira oceanica]
Length = 677
Score = 44.7 bits (104), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 30/133 (22%), Positives = 60/133 (45%), Gaps = 3/133 (2%)
Query: 304 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
+++ Q +H + +G WS +L NSV + I + ++F LIP +++ P
Sbjct: 320 TRIIEQLHHDTDSWNQGNGWSSRFPKLLCMNSVTIKIEPDFIEYFHHDLIPGRHYIPASL 379
Query: 364 ADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYKP 422
+L + + VV AN E + + + + +++ ++ V + I+EY L
Sbjct: 380 GNLTQVVDYVV--SPANDDEMKNVVREANAWCQQAMVVESVARSAMEQISEYYDELTATI 437
Query: 423 APPSSAFEACVES 435
S A E V++
Sbjct: 438 DESSIAIEDVVQN 450
>gi|337269590|ref|YP_004613645.1| lipopolysaccharide-modifying protein [Mesorhizobium opportunistum
WSM2075]
gi|336029900|gb|AEH89551.1| lipopolysaccharide-modifying protein [Mesorhizobium opportunistum
WSM2075]
Length = 323
Score = 44.7 bits (104), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 39/188 (20%), Positives = 82/188 (43%), Gaps = 9/188 (4%)
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
D + ++ PD+ F GW + ++E+ + + S+ +L +W G + R
Sbjct: 102 DGYGEVAAPDFVFDGWPDAKFDDYDEKTRAMAVASEEPPRDNRL---FWAGRCMNEARKR 158
Query: 273 VELMKCNDSKLWGAEILRQNWAEEAKDGFK---KSKLSNQCNHRYKIYAEGYAWSVSLKY 329
V + L A N+ + A + + ++ +RY I EG +S LK
Sbjct: 159 VVDLAKARPDLLEAYDTEPNY-DVAVNRYSTRFRTMEEQVATYRYMIDIEGAGYSGRLKM 217
Query: 330 ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGK 389
+L V L+ + ++++F + P +++ P+ + D+ + ++W ANP +I
Sbjct: 218 LLHTKRVVLLQDRPWREWFFDDIEPFRHYVPV-ARDMS-DLAERIEWLRANPKREAEIAM 275
Query: 390 AGQDFMES 397
Q F ++
Sbjct: 276 EAQHFAQT 283
>gi|350286821|gb|EGZ68068.1| hypothetical protein NEUTE2DRAFT_160511 [Neurospora tetrasperma
FGSC 2509]
Length = 945
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 104/268 (38%), Gaps = 36/268 (13%)
Query: 103 VQTCPDFFKSIHKDLE-PWAKSRITMRHIMEAKRFAALRILIVRGKLYV-------DPYY 154
+TC F + K+++ AK ++ + +++ I GK+YV D
Sbjct: 217 AETCRTTFPGLMKEIDDTVAKGPFKVK---RSSDLGPMQVRIRDGKMYVLHAQRKRDLSR 273
Query: 155 DCVQSRAMFTIWGFLQLLRRYPG---MVPDVDIMF--DCMDKPVIDKKEHGSFPLPLFRY 209
+ V SR LL P ++ D+D + + +D P ++ P
Sbjct: 274 EMVNSRTAALHQVHRALLTLPPSDRSLISDLDTILTINILDTPFGTALQYTRNADPAHAP 333
Query: 210 CTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGS----QAKSW-KEKLPFAYWKGN 264
DA + P +SFW W + + I + Q W K P A W+G
Sbjct: 334 SDPDARTFL-IPHFSFWAWDLPFIGSISRAASAITNLEITQFQGNRWHSHKDPRAVWRGT 392
Query: 265 P---DVLSP-LRVELMKCNDSKLWGAEILRQNWAE---------EAKDGFKKSKLSNQCN 311
+ +P LR +L+ K W A++ WA E+K+ + + C
Sbjct: 393 TWFNSIHNPQLRYKLVSTTKGKPW-ADVQSLEWATATTTTMGNGESKNATNSLAIEDFCK 451
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALI 339
++Y I+ EG ++S +++ C SV L
Sbjct: 452 YKYVIHTEGISYSGRFQFLQMCTSVTLT 479
>gi|159477427|ref|XP_001696812.1| hypothetical protein CHLREDRAFT_150044 [Chlamydomonas reinhardtii]
gi|158275141|gb|EDP00920.1| predicted protein [Chlamydomonas reinhardtii]
Length = 354
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 54/117 (46%), Gaps = 2/117 (1%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
L Q ++Y I +G+A S + S+ L F+ + P +++ PI
Sbjct: 166 LHEQNKYKYLISTDGWAVSSKFDKYMLLGSLLLKAEGLTYAFYYPAIKPFEHYVPIMKKH 225
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYK 421
I +++W ++ +EA++I + Q F M L YM HLI+E SK + Y+
Sbjct: 226 K-DDILDMLEWAKSHDAEAQRIAQNAQSFAMRHLNRQSRLCYMFHLISELSKQMRYQ 281
>gi|387813097|ref|YP_005428578.1| Protein lpsA [Marinobacter hydrocarbonoclasticus ATCC 49840]
gi|381338108|emb|CCG94155.1| Putative Protein lpsA [Marinobacter hydrocarbonoclasticus ATCC
49840]
Length = 324
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 70/140 (50%), Gaps = 8/140 (5%)
Query: 251 SWKEKLPFAYWKGNPDVLSPLRVELM-KCNDSKLWGAEILRQNWAEEAKDGFKKSKLS-- 307
++++K P A W+G + P R+ K DS+L +R E A + K ++
Sbjct: 155 TFEDKHPTAVWRGKSN--HPERIAFARKWADSQLCDIGCVRHK--EPALQAYHKPFMTIP 210
Query: 308 NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLC 367
Q H++ + EG + +LK+I++ NS+ L+ +++ +F G + H+ + D
Sbjct: 211 EQLEHQFIVSVEGIDVATNLKWIMASNSLCLMRQPRFETWFMEGTLQPDIHY-VQLKDDF 269
Query: 368 RSIKSVVDWGNANPSEAEKI 387
++ V + +PSEA++I
Sbjct: 270 SDLEQKVTYYREHPSEAKRI 289
>gi|21758173|dbj|BAC05260.1| unnamed protein product [Homo sapiens]
Length = 149
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 7/115 (6%)
Query: 311 NHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCR 368
++Y++ +G + Y++ +S+ L Y + F L P K++ PI +DL
Sbjct: 8 KYKYQVNVDGTVAAYRYPYLMLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLE 67
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
+K W N EA+KI K GQ L R+Y Y ++ +Y++ KP
Sbjct: 68 KVK----WAKENDEEAKKIAKEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 118
>gi|336466316|gb|EGO54481.1| hypothetical protein NEUTE1DRAFT_124720 [Neurospora tetrasperma
FGSC 2508]
Length = 842
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 59/267 (22%), Positives = 104/267 (38%), Gaps = 36/267 (13%)
Query: 103 VQTCPDFFKSIHKDLE-PWAKSRITMRHIMEAKRFAALRILIVRGKLYV-------DPYY 154
+TC F + K+++ AK ++ + +++ I GK+YV D
Sbjct: 118 AETCRTTFPGLMKEIDDTVAKGPFKVK---RSSDLGPMQVRIRDGKMYVLHAQRKRDLSR 174
Query: 155 DCVQSRAMFTIWGFLQLLRRYPG---MVPDVDIMF--DCMDKPVIDKKEHGSFPLPLFRY 209
+ V SR LL P ++ D+D + + +D P ++ P
Sbjct: 175 EMVNSRTAALHQVHRALLTLPPSDRSLISDLDTILTINILDTPFGTALQYTRNADPAHAP 234
Query: 210 CTNDAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGS----QAKSW-KEKLPFAYWKGN 264
DA + P +SFW W + + I + Q W K P A W+G
Sbjct: 235 SDPDARTFL-IPHFSFWAWDLPFIGSISRAASAITNLEITQFQGNRWHSHKDPRAVWRGT 293
Query: 265 P---DVLSP-LRVELMKCNDSKLWGAEILRQNWAE---------EAKDGFKKSKLSNQCN 311
+ +P LR +L+ K W A++ WA E+K+ + + C
Sbjct: 294 TWFNSIHNPQLRYKLVSTTKGKPW-ADVQSLEWATATTTTMGNGESKNATNSLAIEDFCK 352
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVAL 338
++Y I+ EG ++S +++ C SV L
Sbjct: 353 YKYVIHTEGISYSGRFQFLQMCTSVTL 379
>gi|310793613|gb|EFQ29074.1| hypothetical protein GLRG_04218 [Glomerella graminicola M1.001]
Length = 461
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 44/181 (24%), Positives = 71/181 (39%), Gaps = 19/181 (10%)
Query: 169 LQLLRRY----PGMVPDVDIMFDCMDKPVIDKKEHGSFPL-PLFRYCTNDAHFDIPFPDW 223
LQ L R P ++PD D P H + P R+ HF
Sbjct: 135 LQQLNRALLTSPTLLPDTFFNLHVQDTPATLSWSHSRPAMSPSPRHIFTMPHF------- 187
Query: 224 SFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKG-----NPDVLSPL-RVELMK 277
SFW W++ ++ I + S+ K A W+G N +P R EL++
Sbjct: 188 SFWAWNQPFIRSIPHAAAAITDIEASLSFDMKDRRAVWRGTAWFNNGASANPRSRQELLR 247
Query: 278 CNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVA 337
W A++ W + ++ + + C H+Y I+ EG ++S L++ C SV
Sbjct: 248 ITKDASW-ADVQALEWVDSGENATNALMIEDFCRHKYIIHTEGVSYSGRLQFHQLCESVL 306
Query: 338 L 338
L
Sbjct: 307 L 307
>gi|302835016|ref|XP_002949070.1| hypothetical protein VOLCADRAFT_89377 [Volvox carteri f.
nagariensis]
gi|300265815|gb|EFJ50005.1| hypothetical protein VOLCADRAFT_89377 [Volvox carteri f.
nagariensis]
Length = 414
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 58/141 (41%), Gaps = 22/141 (15%)
Query: 104 QTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRG--KLYVDPYYDCV---- 157
+T + + I DL W S R EA A R RG K +YD V
Sbjct: 71 ETLGELYDGIGADLRLWNNSEGISR---EALNLAISRY-TTRGQQKGMALAFYDGVPYVV 126
Query: 158 ----------QSRAMFT-IWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPL-P 205
+FT + L L R+Y +PD++ + D+P++ P+ P
Sbjct: 127 DEPKLTGLGHHVNILFTYMLVMLDLARQYGNQIPDIEFVIASSDRPLVLTAAQQPGPIPP 186
Query: 206 LFRYCTNDAHFDIPFPDWSFW 226
+ R+C++D H +I P + F+
Sbjct: 187 VMRFCSSDEHAEIKIPIFHFY 207
>gi|159473835|ref|XP_001695039.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276418|gb|EDP02191.1| predicted protein [Chlamydomonas reinhardtii]
Length = 285
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 77/193 (39%), Gaps = 22/193 (11%)
Query: 54 TPWHLFPQRTFKEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSI 113
T W L R R S V + L + E + A K + +++I
Sbjct: 27 TQWTLHRSRDHYGGIRHSGPLSRVATVGNSFLRGLRSSKEGKEAARYCKTGDFTELYETI 86
Query: 114 HKDLEPWAKSRIT---MRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWG--- 167
+DL PW ++ T M ++ LR V G +VD + A+F G
Sbjct: 87 EQDLRPWKETGFTVSLMDWVLSGFTKYPLRSKGV-GVAFVDGTPHLITPPALFNTVGHHK 145
Query: 168 ---------FLQLLRRYPGMVPDVDIMFDCMDKP--VIDKKEHGSFP--LP-LFRYCTND 213
LQL + + +PDV+ + D+P ++ +GS P LP + R+C +D
Sbjct: 146 RLITGYLELMLQLSKTFGDQIPDVEFIVTTGDEPSTLLHHYANGSDPERLPAVLRFCKSD 205
Query: 214 -AHFDIPFPDWSF 225
+H DI PD F
Sbjct: 206 RSHADILVPDVHF 218
>gi|354603897|ref|ZP_09021890.1| hypothetical protein HMPREF9450_00805 [Alistipes indistinctus YIT
12060]
gi|353348329|gb|EHB92601.1| hypothetical protein HMPREF9450_00805 [Alistipes indistinctus YIT
12060]
Length = 323
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 7/99 (7%)
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPI 361
K L +++ + EG S SLK+++S NSVA++ Y+ +F G LIP ++ I
Sbjct: 204 KITLFEHLVYKFILTLEGIDVSTSLKWVMSTNSVAVMPRPTYETWFMEGTLIPNYHYIEI 263
Query: 362 PS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
S +DL + ++ +D +P EAE I + +++
Sbjct: 264 KSDYSDLPQRLQYYID----HPEEAEAIARHAHEYISQF 298
>gi|349605470|gb|AEQ00696.1| KDEL motif-containing protein 2-like protein, partial [Equus
caballus]
Length = 177
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 54/115 (46%), Gaps = 7/115 (6%)
Query: 311 NHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCR 368
++Y++ +G + Y++ +S+ L Y + F L P K++ PI +DL
Sbjct: 36 KYKYQVNVDGTVAAYRYPYLMLGDSLVLKQDSPYYEHFYLALKPWKHYVPIKRNLSDLLE 95
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQ-DFMESLTMDRVYDYMLHLITEYSKLLDYKP 422
+K W N EA+KI K GQ E L R+Y Y ++ +Y++ KP
Sbjct: 96 KVK----WAKENDGEAKKIAKEGQLTARELLQPHRLYCYYYRVLQKYAERQSSKP 146
>gi|26348271|dbj|BAC37775.1| unnamed protein product [Mus musculus]
Length = 254
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 41/174 (23%), Positives = 81/174 (46%), Gaps = 18/174 (10%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +L+ + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLFRED--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR L+++ R +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCSGVEHFILEVIHR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGN 264
I +P W+FW G V L W+ +D+ + W++K AY++G+
Sbjct: 159 IMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGS 212
>gi|299749731|ref|XP_001836296.2| Cap3p [Coprinopsis cinerea okayama7#130]
gi|298408574|gb|EAU85480.2| Cap3p [Coprinopsis cinerea okayama7#130]
Length = 685
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 64/124 (51%), Gaps = 11/124 (8%)
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIP 362
K Q ++Y I +G AWS K +++ NS+ + S Y+++F+ + P ++ PI
Sbjct: 515 KHDFKTQGKYKYIIDVDGNAWSSRFKRLITSNSL-IFKSTIYEEWFADRVEPWLHYVPIQ 573
Query: 363 S--ADLCRSI---KSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD---YMLHLITEY 414
+DL ++ + +P+ A+KI +AG+ + SLT R D YM L EY
Sbjct: 574 VDYSDLLDTLYFFQGDPSGFGGHPALAKKIAEAGRQW--SLTHWRKVDLTAYMFRLFLEY 631
Query: 415 SKLL 418
++++
Sbjct: 632 ARVM 635
>gi|13476922|ref|NP_108491.1| lipopolysaccharide core biosynthesis protein lpsA [Mesorhizobium
loti MAFF303099]
gi|14027683|dbj|BAB53952.1| lipopolysaccharide core biosynthesis protein; LpsA [Mesorhizobium
loti MAFF303099]
Length = 323
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 35/166 (21%), Positives = 72/166 (43%), Gaps = 6/166 (3%)
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC-NDSKLWGAEILRQNWAEE 296
++F+ + + + KLP W+G D+ +P+R ++ D A + N E
Sbjct: 131 DKFRHFQMPADTIPFANKLPLVVWRG--DLNNPIRTRFLQAVRDLPFCDAGSHKPNAPAE 188
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
F +S +RY + EG + +LK+IL+ S+ L+ Y+ +F+ +
Sbjct: 189 YAKPF--LSISQHQRYRYIVSLEGNDVATNLKWILNSKSLCLMPPPTYETWFAEQQLEAN 246
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDR 402
H+ +P + V + +P++AE+I + + +R
Sbjct: 247 VHY-VPLDPDFADLGEKVRYFERHPAQAERIIAGANAYCRKFSNER 291
>gi|412992055|emb|CCO20781.1| unknown protein [Bathycoccus prasinos]
Length = 386
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 65/157 (41%), Gaps = 12/157 (7%)
Query: 304 SKLSNQCNHRYKI-YAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHF--- 359
S + N ++Y I +A WS + +L + L + F+ L P ++
Sbjct: 228 SFIHNLSRYKYVIDFAGAGNWSRRMSLLLRSGGLILQSEKSGYQFYEYSLEPGLHYITFD 287
Query: 360 PIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLH-LITEYSKLL 418
P + ++ S ++W N AE I + Q F ++ + DY + LIT+YSK L
Sbjct: 288 PQIGKEGLGNLLSRLEWAQKNDEIAELIARRSQSFGQNCLTEASIDYFVSTLITKYSKYL 347
Query: 419 DYKPAP-PSSAFEACVESLLCLADPKQRQNLEKAAAS 454
P P P +CV +D K R L K +
Sbjct: 348 QGNPIPFPIVDLSSCV------SDRKGRSKLNKICGA 378
>gi|222824171|ref|YP_002575745.1| lipopolysaccharide core biosynthesis protein [Campylobacter lari
RM2100]
gi|222539393|gb|ACM64494.1| conserved hypothetical protein, putative lipopolysaccharide core
biosynthesis protein [Campylobacter lari RM2100]
Length = 325
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 42/194 (21%), Positives = 93/194 (47%), Gaps = 20/194 (10%)
Query: 237 NEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELM-------KCNDSKLWGAEIL 289
N F I+ +Q +++K +++G + P R++ +C+ + + G +I
Sbjct: 131 NRHFDFIQDKNQ---FEDKKDLLFFRG--AIYQPHRIKFFEKYFDNPRCDIAHVGGRKIQ 185
Query: 290 RQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFS 349
+ W + FK S+ + Q ++ + EG + +LK+ + NS+ L +Y+ +F
Sbjct: 186 AEKWIKNL--NFKISR-AYQMQFKFLLSLEGNDVASNLKWAMKTNSLVLAPKMRYETWFM 242
Query: 350 RGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLH 409
G + HF + + D ++++++D+ ANP A++I + ++E +++ Y+
Sbjct: 243 EGKLVPNEHFALINDD-YENVEALLDYYLANPHRAKEIIQNIHAYIEQFLDEKIEFYIGI 301
Query: 410 LITE----YSKLLD 419
L+ YS LD
Sbjct: 302 LVLAKYFYYSNQLD 315
>gi|452983310|gb|EME83068.1| hypothetical protein MYCFIDRAFT_137488 [Pseudocercospora fijiensis
CIRAD86]
Length = 407
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 60/305 (19%), Positives = 117/305 (38%), Gaps = 71/305 (23%)
Query: 97 VASPQKVQTCPDFFKSIHKDLEPWAKSR---------ITMRHIMEAKRFAALRILIVRGK 147
+ Q PD ++ I + + W + IT R+ A+R+L+ +
Sbjct: 45 LTEAQCTSAFPDLYREIERAVAYWRNRQHTIAAEDVDITWRN-QSNNGGGAMRVLVHNQE 103
Query: 148 LYVDPYYDCV-----QSRAMFTIWGFLQ--LLRRYPG--MVPDVD--IMFDCMDKPVIDK 196
L + D + + RA F I G LQ + G ++P ++ ++F+ + P D+
Sbjct: 104 LRILESMDTISHVGYRGRA-FGILGLLQRAVESAIAGGEILPTIEAALLFEDISNPPTDE 162
Query: 197 KEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEV-------------NLQPWNEEFKDI 243
H SFW WS + N W +
Sbjct: 163 GTH------------------------SFWTWSSLKDHAPHERLWLIPNFDFWYSSPQGS 198
Query: 244 KHGSQAKSWKEKLPFAY------WKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEA 297
++ + + PFA+ W+G LR L+ K W A++ +W+ +
Sbjct: 199 YEAARMHAMRNDAPFAHKIPKVVWRGTEWTNKELRDGLVNIGADKDW-ADLKFIDWS--S 255
Query: 298 KDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLI---P 354
+ K + + C + ++ EG ++S LKY+L+C+S+ ++ ++ + LI P
Sbjct: 256 SEAGNKIPVEDLCKYALTVHTEGVSYSGRLKYLLNCDSLPIVHDLKWNAHYYHLLIKDGP 315
Query: 355 TKNHF 359
+N+
Sbjct: 316 MQNYI 320
>gi|380487674|emb|CCF37887.1| hypothetical protein CH063_09117 [Colletotrichum higginsianum]
Length = 486
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 31/125 (24%), Positives = 55/125 (44%), Gaps = 7/125 (5%)
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKG-----NPDVLSPL-RV 273
P +SFW W++ ++ I + + + K P A W+G N +P R
Sbjct: 210 IPHFSFWAWNQPFIRSIPHAAAAIANIEASLPFDLKNPRAVWRGTAWFNNGASANPRSRQ 269
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSC 333
EL++ W A++ +W + + + C H+Y I+ EG ++S L++ C
Sbjct: 270 ELLRVTKDAEW-ADVQALDWVNSGGNATNALMIEHFCQHKYIIHTEGVSYSGRLQFHQLC 328
Query: 334 NSVAL 338
SV L
Sbjct: 329 ESVLL 333
>gi|171677945|ref|XP_001903923.1| hypothetical protein [Podospora anserina S mat+]
gi|170937041|emb|CAP61700.1| unnamed protein product [Podospora anserina S mat+]
Length = 481
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 29/123 (23%), Positives = 52/123 (42%), Gaps = 4/123 (3%)
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNP---DVLSP-LRVEL 275
P +SFW W ++ I + K+P A W+G V SP LR L
Sbjct: 225 IPHFSFWSWPLPFIRSLPHAASLITSLESTLPFPSKIPKAVWRGTTWFNSVRSPHLRQNL 284
Query: 276 MKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNS 335
++ ++ + W + ++ + + C ++Y I+ EG A+S +++ C S
Sbjct: 285 LQTTRPHPEIFDVQKLEWTGKNRNATNALPIQDFCRYKYVIHTEGIAYSGRFQFLQMCQS 344
Query: 336 VAL 338
V L
Sbjct: 345 VVL 347
>gi|390603958|gb|EIN13349.1| hypothetical protein PUNSTDRAFT_94372 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 580
Score = 42.4 bits (98), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 27/119 (22%), Positives = 59/119 (49%), Gaps = 7/119 (5%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS-- 363
L ++Y + +G WS K +++ N++ L S Y ++F+ ++P ++ P+ +
Sbjct: 450 LKEASRYKYVMDVDGNGWSSRFKRLITSNAMVL-KSSVYPEWFTERIVPWVHYVPVQNDY 508
Query: 364 ---ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL-TMDRVYDYMLHLITEYSKLL 418
D+ + V+ + A KI AG+ + +++ + V YM L+ EY++L+
Sbjct: 509 SDLLDIMAFFRGGVNGDAGHDELARKIADAGKVWSQTMWRREDVTAYMFRLMLEYARLM 567
>gi|390597549|gb|EIN06948.1| hypothetical protein PUNSTDRAFT_104368 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 533
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 27/119 (22%), Positives = 59/119 (49%), Gaps = 7/119 (5%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS-- 363
L ++Y + +G WS K +++ N++ L S Y ++F+ ++P ++ P+ +
Sbjct: 403 LKEASRYKYVMDVDGNGWSSRFKRLITSNAMVL-KSSVYPEWFTERILPWVHYVPVQNDY 461
Query: 364 ---ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL-TMDRVYDYMLHLITEYSKLL 418
D+ + V+ + A KI AG+ + +++ + V YM L+ EY++L+
Sbjct: 462 SGLLDIMAFFRGGVNGDAGHDELARKIADAGKVWSQTMWRREDVTAYMFRLMLEYARLM 520
>gi|418940798|ref|ZP_13494151.1| lipopolysaccharide-modifying protein [Rhizobium sp. PDO1-076]
gi|375052511|gb|EHS48925.1| lipopolysaccharide-modifying protein [Rhizobium sp. PDO1-076]
Length = 352
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 29/205 (14%)
Query: 237 NEEFK-DIKHGSQAKSWKEKLPFAYWKG----------------NPDVLSPLR----VEL 275
N+ F D + G A +W ++ W+G NP V+ LR ++
Sbjct: 131 NDGFSADRERGRAAPAWDDRSGDIVWRGGMNGCGWGSFWPEDTDNPAVVQRLRMVRRLKD 190
Query: 276 MKCNDSKLWGAEILRQNWAEEA-KDGFKKSKLSNQC--NHRYKIYAEGYA--WSVSLKYI 330
+ D +L G +A EA + G +++ ++ I +GY+ WS L +
Sbjct: 191 LAGTDVRLVGVRWSETEFAREAERQGLMAEPMASASWLGRKFAIDIDGYSNTWSNLLVRM 250
Query: 331 LSCNSVALIISQ-QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGK 389
L V + SQ + ++ L P +++ P+ AD+ +DW ++ SEA+ I +
Sbjct: 251 LYGCCVLKVTSQFGFSQWYYGDLTPWEHYVPV-RADMA-DFAEKIDWVRSHDSEAKAIAE 308
Query: 390 AGQDFMESLTMDRVYDYMLHLITEY 414
G+ ++LT + LI E+
Sbjct: 309 RGRALAQTLTFESQAQRATKLIEEH 333
>gi|337269405|ref|YP_004613460.1| lipopolysaccharide-modifying protein [Mesorhizobium opportunistum
WSM2075]
gi|336029715|gb|AEH89366.1| lipopolysaccharide-modifying protein [Mesorhizobium opportunistum
WSM2075]
Length = 323
Score = 42.0 bits (97), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 35/162 (21%), Positives = 72/162 (44%), Gaps = 6/162 (3%)
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC-NDSKLWGAEILRQNWAEE 296
++F+ + + + +KLP W+G D+ +P+R ++ D A + N E
Sbjct: 131 DKFRHFQMPADTIPFADKLPTVVWRG--DLNNPIRTRFLEAARDLPFCDAGSHKANAPAE 188
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
F + +RY + EG + +LK+IL+ S+ L+ Y+ +F+ +
Sbjct: 189 YAKPF--LTIRQHQRYRYIVSLEGNDVATNLKWILNSKSLCLMPPPTYETWFAERQLEAN 246
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
H+ +P A + V + +P++AE+I A + +
Sbjct: 247 VHY-VPLAADFSDLAEHVAFFERHPAKAERIVAAANAYCRTF 287
>gi|29150125|emb|CAD79685.1| hypothetical protein [Neurospora crassa]
Length = 906
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 52/237 (21%), Positives = 94/237 (39%), Gaps = 30/237 (12%)
Query: 130 IMEAKRFAALRILIVRGKLYV-------DPYYDCVQSRAMFTIWGFLQLLRRYPG---MV 179
+ + +++ I GK+YV D + V SR LL P +V
Sbjct: 144 VKRSSDLGPMQVRIKDGKMYVLHAQRKRDLSREMVNSRTAALHQLHRALLTLPPSDRSLV 203
Query: 180 PDVDIMF--DCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWN 237
D+D + + +D P ++ P+ DA + P +SFW W + +
Sbjct: 204 SDLDTILTINILDTPFGTALQYTRNADPVHAPSDPDARTFL-IPHFSFWAWDLPFIGSIS 262
Query: 238 EEFKDIKHGS----QAKSW-KEKLPFAYWKGNP---DVLSP-LRVELMKCNDSKLWGAEI 288
I + Q W K P A W+G + +P LR +L+ +K W A++
Sbjct: 263 RAASAITNLETTQFQGNRWHSHKDPRAVWRGTTWFNSIHNPQLRYKLVSTAKAKPW-ADV 321
Query: 289 LRQNWAEEAKDGFKKSK-------LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
W + G ++K + + C ++Y ++ EG ++S +++ C SV L
Sbjct: 322 QSLEWTTTSTTGNGENKNATNSLAIEDFCKYKYVLHTEGISYSGRFQFLQMCASVTL 378
>gi|407005721|gb|EKE21773.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [uncultured bacterium]
Length = 426
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 80/343 (23%), Positives = 130/343 (37%), Gaps = 57/343 (16%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAA----------LRILIVRGKLYV-DPYYDCVQSRA 161
I +DL P+ +++IT + ++E A +R I+ KLY+ P D R
Sbjct: 62 ILEDLNPFYENKITQKALLETFNTIASRREKDKTNFIRFRIINNKLYMYIPEKDFFPKRQ 121
Query: 162 MFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSF---------PLPLFRYCTN 212
FT L+ L + M P+VDI++ D + + F PL L R
Sbjct: 122 -FTFEKALRTLCKMIKM-PNVDIIYSDEDGTPLFFNQKDFFITTDPKLQAPL-LSRGKHK 178
Query: 213 DAHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
+ + PD+ + + E I + W+EK A+W+G +
Sbjct: 179 NLKYIALIPDYHDLSYKNM------EMINKITALNGKYPWEEKQNLAFWRGVNRKKARYL 232
Query: 273 VELMKCNDSKLWGAEILRQNWAEEAKD---------GFKKSKLSNQCNHRYKIYAE---- 319
+ ++ + L A N EE +D KK S+ + +YK
Sbjct: 233 LSIISYQNPSLIDAG-FPDNIHEENQDIDTPNADISHLKKEFASHDDHLKYKYLPVLDGF 291
Query: 320 -----GYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVV 374
GY W L NS+ +F +GL P +++ PI D I +
Sbjct: 292 FCTYPGYQWR------LFSNSLCFKQESLEIQWFYKGLKPYEHYIPI--KDDMSDILEKI 343
Query: 375 DWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSK 416
DW N +KI + F +L ++ Y Y+ L+TEY K
Sbjct: 344 DWARKNDGLCKKITENAMKFASNNLFIENTYAYLFLLLTEYEK 386
>gi|224013146|ref|XP_002295225.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969187|gb|EED87529.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 818
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 74/162 (45%), Gaps = 18/162 (11%)
Query: 252 WKEKLPFAYWKG------------NPDVLSPLRVELM-KCNDSKLWGAEILRQNWAEEAK 298
W+EK+P A W+G + D ++ L+ K +S L A+ R A+ A
Sbjct: 283 WEEKMPKAVWRGQYGKTDKSHNGSDIDNTHDIKYALVSKHLNSSLVDAKFSRH--ADSAP 340
Query: 299 DGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
S L +Q ++Y I EG S LK++L NS+ L S ++ + GL+
Sbjct: 341 PLMAGSYLDMKDQLRYKYIISIEGNDVSSGLKWMLFSNSIVLAPSFTWEGWAMEGLLEPH 400
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL 398
H+ +P + +++ ++ W +P+E + I + F+ L
Sbjct: 401 VHY-LPLKEDMSNVEEMIAWAEDHPNEVQLIRERSTVFIHDL 441
>gi|159486392|ref|XP_001701225.1| hypothetical protein CHLREDRAFT_194259 [Chlamydomonas reinhardtii]
gi|158271925|gb|EDO97735.1| predicted protein [Chlamydomonas reinhardtii]
Length = 476
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 97/233 (41%), Gaps = 23/233 (9%)
Query: 133 AKRFAALRILIVRGKLYVDPYYDC---VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCM 189
AKR AL +++ RG+ YV + +++R + G ++ RR +PD ++
Sbjct: 95 AKRNGAL-VVLRRGRAYVASHRGMPGYIRTRLTVNLEGLVRGARRLGLQLPDTLFAYNAQ 153
Query: 190 DKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGW--SEVNLQPWNEEFKDIKHGS 247
D+PV E PLF + I DW S+V + F H
Sbjct: 154 DEPVCRLLEGACSDAPLFSH--------IKRYDWEQGRSIDSDVLIPHMLHVFNHTIHFP 205
Query: 248 QAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFK----- 302
A + A + + D S +RV L + + S GA +L + E ++
Sbjct: 206 WAAKDPRAVLRARMQSSMDHRSCMRVVLAQLSASPA-GARLLDAGFVENRHRTYRPTAEQ 264
Query: 303 -KSKLSNQCNHRYKIY--AEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGL 352
KS L+ + RY++ A+G+ S L Y+++ NS L + +++ R L
Sbjct: 265 MKSYLTIAEHARYRLLLNADGHTASSRLGYLMTINSPVLTEQSPWIEYYYRSL 317
>gi|402085319|gb|EJT80217.1| hypothetical protein GGTG_00220 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 485
Score = 42.0 bits (97), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 60/157 (38%), Gaps = 27/157 (17%)
Query: 220 FPDWSFWGW--SEVNLQPWNEEFKDIKHGSQAK---------SWKEKLPFAYWKGNP--- 265
P +SFW W + ++E I G +A+ WK+K+P A W+G
Sbjct: 212 MPHFSFWAWPLRHITGGTFDEAAAAID-GLEARYRGGGGDGGGWKDKVPTAVWRGTAHFQ 270
Query: 266 DVLSP-LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWS 324
L P LR L+K +G + + C HRY I+ EG A+S
Sbjct: 271 SALQPGLRRGLLKTAGHD---GGGGAGWADVLPLNGTTALPIHDFCRHRYVIHTEGVAYS 327
Query: 325 VSLKYILSCNSVALI--------ISQQYKDFFSRGLI 353
+ + C SV L S + FSR L+
Sbjct: 328 GRFQLLQMCRSVVLTPPLMWVQHTSHLLRPVFSRSLL 364
>gi|385324668|ref|YP_005879107.1| putative CAP10-like protein [Neisseria meningitidis 8013]
gi|261393055|emb|CAX50650.1| putative CAP10-like protein [Neisseria meningitidis 8013]
Length = 327
Score = 42.0 bits (97), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
++ QC +RY + EG + +LK+I + NSV + +Y+ +F GL+ H+ + D
Sbjct: 212 IAQQCAYRYILSIEGNDVATNLKWISASNSVCFMTHPKYETWFCEGLMIPDLHY-VSLED 270
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFME 396
+ + + ++P A KI +A +++++
Sbjct: 271 DYSDLNEKLAFYRSHPDAARKIVEASKEYIK 301
>gi|378443249|ref|YP_005230959.1| LpsA protein [Aliivibrio fischeri]
gi|373881649|gb|AEY78252.1| LpsA protein [Aliivibrio fischeri]
Length = 302
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 27/106 (25%), Positives = 54/106 (50%), Gaps = 3/106 (2%)
Query: 302 KKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHF 359
+K KLS Q +++ + EG + SLK+ LS NS+ L++ +++ +F GL+ H+
Sbjct: 181 QKDKLSIQEQLQYKFILSIEGNDVATSLKWTLSSNSLCLMVKPKFETWFMEGLLIAGVHY 240
Query: 360 PIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYD 405
++D + +++ +P EAE+I F++ + D
Sbjct: 241 VELNSDYS-DLDEKLEYYIEHPEEAERIINNAHKFIQQFQNPNIED 285
>gi|407778883|ref|ZP_11126144.1| lipopolysaccharide core biosynthesis protein lpsA [Nitratireductor
pacificus pht-3B]
gi|407299419|gb|EKF18550.1| lipopolysaccharide core biosynthesis protein lpsA [Nitratireductor
pacificus pht-3B]
Length = 322
Score = 41.6 bits (96), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 67/140 (47%), Gaps = 12/140 (8%)
Query: 252 WKEKLPFAYWKGNPDVLSPLR-VELMKCNDSKLWGAEILRQNWAEEAKDGFK-KSKLS-- 307
+++K P A W+G + + R VEL + + G + +G K++LS
Sbjct: 147 FEQKAPRAVWRGVLNTEARRRLVELYGGHTAFDIGH-------VQTGVEGVAPKARLSIP 199
Query: 308 NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLC 367
Q RY I EG + +LK+I++ NS+ L+ +Y+ +F G + H+ ADL
Sbjct: 200 QQLASRYIISLEGNDVATNLKWIMTSNSLCLMPRPRYETWFMEGALVPGRHYAELRADLS 259
Query: 368 RSIKSVVDWGNANPSEAEKI 387
++V + +P EA +I
Sbjct: 260 DLEETVAHY-ERHPEEARRI 278
>gi|357025464|ref|ZP_09087588.1| lipopolysaccharide core biosynthesis protein lpsA [Mesorhizobium
amorphae CCNWGS0123]
gi|355542708|gb|EHH11860.1| lipopolysaccharide core biosynthesis protein lpsA [Mesorhizobium
amorphae CCNWGS0123]
Length = 323
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 36/166 (21%), Positives = 74/166 (44%), Gaps = 6/166 (3%)
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR-VELMKCNDSKLWGAEILRQNWAEE 296
++F+ + + +K P W+G ++ +P+R V L + ++ AE+
Sbjct: 131 DKFRHFHMPADRMHFADKRPAVVWRG--ELNNPIRTVFLEATRNLSFCDVGTPKRTAAEQ 188
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
+ + + Q ++RY + EG + +LK+I+S NS+ L+ Y+ +F+ I
Sbjct: 189 YRKPY--LTIEQQRHYRYIVSLEGNDVATNLKWIMSSNSLCLMPPPTYETWFAERQIEAN 246
Query: 357 NHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDR 402
H+ +P + V +P+EAE+I A + +R
Sbjct: 247 VHY-VPLEPDFADLADKVRHFERHPAEAERIIGAANAYCRKFCNER 291
>gi|390603949|gb|EIN13340.1| hypothetical protein PUNSTDRAFT_58013 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 649
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 61/114 (53%), Gaps = 10/114 (8%)
Query: 312 HRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRS 369
++Y + +G WS K +++ ++V L + Y ++FSR + P + P+ + +D+
Sbjct: 523 YKYVMDVDGNGWSSRFKRLITSHAVVLKAT-VYPEWFSRRIQPWVHFVPVKNDYSDVMDI 581
Query: 370 IKSVVDWGNANPSE--AEKIGKAGQDFMESLTMDRVYD---YMLHLITEYSKLL 418
+ +G ++ A KI +AG+++ S TM R D YM ++ EY++L+
Sbjct: 582 MAFFTGYGGGEDNDHLARKIAEAGREW--SRTMWRKEDLTAYMFRMLLEYARLM 633
>gi|258575361|ref|XP_002541862.1| hypothetical protein UREG_01378 [Uncinocarpus reesii 1704]
gi|237902128|gb|EEP76529.1| hypothetical protein UREG_01378 [Uncinocarpus reesii 1704]
Length = 2000
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/204 (21%), Positives = 80/204 (39%), Gaps = 25/204 (12%)
Query: 107 PDFFKSIHKDLEPWAKSRITMRHIMEAK-RFAALRILIVRGKLYVDPY--YDCVQSRAMF 163
P F I K + +I + +R +I G+L + + ++ +RA
Sbjct: 1726 PKLFAEIEKSVSARRGKKIEYEELASRTLEDGMVRAIIYNGELRIVNFEGHEFTFTRAKA 1785
Query: 164 TIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAH-FDIPF 220
T+ + L P +P+++ +F D HG P P++ Y D +
Sbjct: 1786 TLSSLNRALTAIPDRRSLPNIEFIFSA------DDFTHG--PGPIWTYSKRDEDSWAWLM 1837
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKS----------WKEKLPFAYWKGNPDVLSP 270
PD+ +W W E N+ P+ + + I S ++ K +W+GN
Sbjct: 1838 PDFGYWSWPEANIGPYRQIRRRIAAIDDGDSVGGRVRPGLKFQNKHQKLFWRGNIATAPV 1897
Query: 271 LRVELMKCNDSKLWGAEILRQNWA 294
LR + ++ K W A +L +W+
Sbjct: 1898 LRNKFLQVTHDKAW-ASVLPMDWS 1920
>gi|315453182|ref|YP_004073452.1| lipopolysaccharide biosynthesis protein [Helicobacter felis ATCC
49179]
gi|315132234|emb|CBY82862.1| lipopolysaccharide biosynthesis protein [Helicobacter felis ATCC
49179]
Length = 144
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/97 (22%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIP 362
K+ ++ +++ + EGY + +LK+ILS NS+AL+ +++ FF + H+ IP
Sbjct: 16 KTSIATHLEYKFILSLEGYDVASNLKWILSSNSIALMPPPKFESFFLESQLLPNVHY-IP 74
Query: 363 SADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLT 399
D +++ +++ +A P + I ++ + +
Sbjct: 75 IKDDYSDVEAQLEFFSARPKDCLDIISNANAYVRAFS 111
>gi|313231418|emb|CBY08532.1| unnamed protein product [Oikopleura dioica]
Length = 219
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/179 (22%), Positives = 84/179 (46%), Gaps = 11/179 (6%)
Query: 249 AKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKK--SKL 306
KSW+ K+P +++G +++ + N++ L A I + ++ +G K +
Sbjct: 15 GKSWQSKIPKGFFRGRDSSKERMKISALSMNNTAL-DAGITSFQFHDQG-NGTKVPIVPM 72
Query: 307 SNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIP--TKNHFPIPSA 364
S+ N+++++ +G Y+ +S L+ Q+ K F+ P K+ +
Sbjct: 73 SDFGNYKFQLLLDGTVAPYRAPYVFQTSS--LVFKQKSK--FAEWWYPYLRKDIDFVELD 128
Query: 365 DLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+ +I+ ++W N AE I + G + E L + VY + L +YS+L+DY+P
Sbjct: 129 EKAENIEEKIEWALENDEIAEWIAQNGFELTKELLKPENVYCHYLQAFEQYSELMDYEP 187
>gi|51870124|ref|YP_073677.1| putative lipopolysaccharide-modifying enzyme [Lymphocystis disease
virus - isolate China]
gi|51858332|gb|AAU11016.1| putative lipopolysaccharide-modifying enzyme [Lymphocystis disease
virus - isolate China]
Length = 933
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 61/284 (21%), Positives = 109/284 (38%), Gaps = 54/284 (19%)
Query: 179 VPDVDIMFDCMDKPVIDKKEHGSF---------PL---------PLFRYCTNDAHFDIPF 220
VPD+D F+ D P++ + E + PL P+ T + D+P
Sbjct: 171 VPDIDFFFNQRDFPLLKEDETEPYQHIFNTSKQPLLSHNYSSYCPILSMVTARHYADVPV 230
Query: 221 PDWSFW--GWSEVNLQPWNEEFKDIKHGSQAK---SWKEKLPFAYWKGNP-------DVL 268
P + W WS N++ D+ + K W KL A ++G+ +
Sbjct: 231 PTYDCWIRAWS-----AENDKTDDLTYLESIKINTDWNSKLSKAIFRGSSTGAGVTVETN 285
Query: 269 SPLRVELMKCNDS-KLWGAEILRQNWAEEAKDGFKKSKL--------------SNQCNH- 312
L+V L + L A I + N K + Q +H
Sbjct: 286 QRLKVCLKLAKERPDLIDAGITKWNLRPRKHKSSKYLETIELEEYPLADFISPQEQADHY 345
Query: 313 RYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKS 372
+Y + EG+ + + ++ L+ YK +F L P ++ P+ A C ++
Sbjct: 346 KYILCLEGHVAAFRISREMTYGCTLLLTETPYKMWFVPYLKPWIHYVPV--AYDCSNLIE 403
Query: 373 VVDWGNANPSEAEKIGKAGQDFM-ESLTMDRVYDYMLHLITEYS 415
++W + E + I + F+ E LT+D+ DY +++TE S
Sbjct: 404 RIEWCKTHDKECKLIAEMALKFVTEELTIDKTLDYFKYILTELS 447
>gi|412987631|emb|CCO20466.1| DUF821 domain-containing protein [Bathycoccus prasinos]
Length = 449
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/115 (24%), Positives = 50/115 (43%), Gaps = 10/115 (8%)
Query: 292 NWAEEA---KDGF------KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
+WAE +DGF K L H+Y ++ +G S + L NSV + +
Sbjct: 275 DWAENRTSLEDGFLDVKMYSKISLREHMEHKYILHLDGQGHSFQFEEKLGLNSVVVSEKK 334
Query: 343 QYKDFFSRGLIPTKNHFPIPSADLC-RSIKSVVDWGNANPSEAEKIGKAGQDFME 396
++ +FS+ L P ++ D + V+ + + E ++I K GQ F +
Sbjct: 335 LFQTYFSKFLKPKTHYLEFWENDEKPEDVLEVLHYARTHDEEMQQIAKNGQKFAQ 389
>gi|291276759|ref|YP_003516531.1| lipopolysaccharide core biosynthesis protein [Helicobacter mustelae
12198]
gi|290963953|emb|CBG39792.1| putative lipopolysaccharide core biosynthesis protein [Helicobacter
mustelae 12198]
Length = 324
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/85 (27%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 303 KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIP 362
++ L+ +++ + EG + +LK+IL NS+AL+ +++ +F G + H+
Sbjct: 195 RANLATHLQYKFILSLEGNDVATNLKWILHSNSLALMPKPKFETWFMEGQLQAGVHYAEI 254
Query: 363 SADLCRSIKSVVDWGNANPSEAEKI 387
S D ++SVV++ A+P A++I
Sbjct: 255 SEDY-EDLESVVEYYLAHPHHAKEI 278
>gi|407977159|ref|ZP_11158048.1| lipopolysaccharide-modifying protein [Nitratireductor indicus C115]
gi|407427392|gb|EKF40087.1| lipopolysaccharide-modifying protein [Nitratireductor indicus C115]
Length = 323
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/163 (20%), Positives = 73/163 (44%), Gaps = 11/163 (6%)
Query: 253 KEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNH 312
+EK P A W+G + + R+ + +D + +I + E +
Sbjct: 148 EEKKPVAIWRGTMNNPARERIVGLYASDRRF---DIGQTGEGFEGIPAKPHVSIREHMRC 204
Query: 313 RYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKS 372
RY + EG + +LK+I++ NS+ L+ + + +F GL+ H+ A+LC + +
Sbjct: 205 RYIVSLEGRDVATNLKWIMASNSLCLMPRPRCETWFMEGLLEPGVHY----AELCDDLSN 260
Query: 373 ---VVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYMLHLIT 412
+V+ +P EA I +A + +R ++++ ++
Sbjct: 261 LPDLVEHFERHPEEARSIIRAANAHVARFA-NRRREHLISMLV 302
>gi|302843655|ref|XP_002953369.1| hypothetical protein VOLCADRAFT_94091 [Volvox carteri f.
nagariensis]
gi|300261466|gb|EFJ45679.1| hypothetical protein VOLCADRAFT_94091 [Volvox carteri f.
nagariensis]
Length = 460
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 68/164 (41%), Gaps = 21/164 (12%)
Query: 276 MKCNDSKLWGAEILRQN-WAEEAKDGFKKSKLS--------------NQCNHRYKIYAEG 320
M+ N ++LW E+ R N D + LS + +RY + A+G
Sbjct: 252 MRPNCTRLWIIELQRSNPEGRRLLDAGITNNLSKRKDIKLVDFVPIPDHARYRYLLSADG 311
Query: 321 YAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNAN 380
+ S L +L NSV L + + +++ R L P + P ++ +K + A+
Sbjct: 312 FTASCRLGKLLGTNSVVLKETTPWIEYYYRSLKPEVHFVPFNKDNVLEVVKDL----EAD 367
Query: 381 PSEAEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLL-DYKP 422
P ++I Q F + L+ Y+ + Y+ LL D +P
Sbjct: 368 PGRCQRISAEAQQFAYTFLSQHSKAMYVKRALVYYNSLLPDMEP 411
>gi|339906002|ref|YP_004732799.1| hypothetical protein WIV_gp016 [Wiseana iridescent virus]
gi|308051872|gb|ADO00359.1| hypothetical protein [Wiseana iridescent virus]
Length = 1080
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 112/271 (41%), Gaps = 38/271 (14%)
Query: 204 LPLFRYCTNDAHFDIPFPDWSFWG-----------------WSEVNLQPWNEEF-KDIKH 245
+P+ CT+D DI P W E N Q W+++ K +
Sbjct: 254 MPILSMCTSDKFADIAIPTHEDWARIKSNEGIFFPQKCRNYTFEFNTQNWDKKVNKAVFR 313
Query: 246 GSQA-----KSWKEKLPFAYWKGNPDVLSPLRVELMKCN--DSKLWGAEILRQNWAEEAK 298
GS +L A NP++ + L V + N K +E L+ EE +
Sbjct: 314 GSNTGCGYNTKGNTRLKLAKLGTNPELKNYLDVGITNWNLRIRKNKDSEYLQIPDVEEIQ 373
Query: 299 DGFKKSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS--QQYKDFFSRGLIP 354
SKL+ Q N++Y I+ +G+ + L LS LI+ +++K +FS L P
Sbjct: 374 ---LVSKLTPEQQSNYKYLIHVDGHVSAFRLSLELSMGCCILIVESCEKWKMWFSGLLEP 430
Query: 355 TKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAG-QDFMESLTMDRVYDYMLH-LIT 412
++ PI S DL I +V W N + ++I + + + + LT + D + L T
Sbjct: 431 YVHYVPIKS-DLSDLIDQIV-WCRKNDDKCKQIAQNSLKLYNKYLTKKGILDNLQKTLFT 488
Query: 413 EYSKLLDYKPAPPSS--AFEACVESLLCLAD 441
+S++ PP F+ +E L +D
Sbjct: 489 LHSEMGVEGTTPPQDPLLFQTDIEHRLLTSD 519
>gi|88800546|ref|ZP_01116108.1| putative lipopolysaccharide A protein [Reinekea blandensis MED297]
gi|88776691|gb|EAR07904.1| putative lipopolysaccharide A protein [Reinekea sp. MED297]
Length = 317
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/146 (19%), Positives = 68/146 (46%), Gaps = 3/146 (2%)
Query: 251 SWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQC 310
SW++K A W+G+ + R+ ++ K+ A+ R K F + Q
Sbjct: 151 SWEQKQDSAIWRGHAHNDNRKRLITSNLHNPKINAAQTNRNYDGLPPKAPFMP--IPQQL 208
Query: 311 NHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSI 370
H++ + EG + +LK+ + S+ + + Y+ +F G++ H+ D +
Sbjct: 209 KHKFLLSIEGVDVASNLKWAMGSQSLVISPTLHYETWFMEGMLQPGVHYVEVKNDFS-DL 267
Query: 371 KSVVDWGNANPSEAEKIGKAGQDFME 396
++ +D+ ++P+EA+ I + ++ +
Sbjct: 268 EAKIDYYLSHPAEAKAIVRNANNYTQ 293
>gi|159491300|ref|XP_001703609.1| predicted protein [Chlamydomonas reinhardtii]
gi|158270628|gb|EDO96467.1| predicted protein [Chlamydomonas reinhardtii]
Length = 375
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 51/118 (43%), Gaps = 2/118 (1%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
+S+Q +RY + +G A S ++ S+ + + +F L P + HF
Sbjct: 174 ISHQVKYRYLVSTDGVATSRKMEVYFLFGSLVIKSASDRMGYFYDALRPDE-HFVTCLNS 232
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYKP 422
R I VV W ++ +EA +I + Q F +E L + +I E + + Y P
Sbjct: 233 SARDILDVVRWARSHDAEARRIAETAQRFAVEHLRRSARLCQIRTVIEELGRRMRYTP 290
>gi|358401551|gb|EHK50852.1| hypothetical protein TRIATDRAFT_94120 [Trichoderma atroviride IMI
206040]
Length = 298
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 54/119 (45%), Gaps = 9/119 (7%)
Query: 59 FPQRTFKEESRRSQAYKIVHCTYLTCLSA---MNPIPERRRVASPQKVQTCPDFFKS--I 113
FP + F +E R+Q ++ H L+C+S+ P RR+V ++ T PDF +
Sbjct: 41 FPVQQFSDEESRAQGRQVTH-PILSCISSTAVFTADPSRRQVQIRTQIPTLPDFLAADQF 99
Query: 114 HKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
+K LE K R + A L + +L + + + ++ R + + GF LL
Sbjct: 100 NKHLE---KIRSGIAQANLASELTPLLPRHIATRLIQNSFAEIMEDRQLLDLAGFTALL 155
>gi|159482524|ref|XP_001699319.1| hypothetical protein CHLREDRAFT_152544 [Chlamydomonas reinhardtii]
gi|158272955|gb|EDO98749.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 178 MVPDVDIMFDCMDKPVIDKKEH--GSFPLPLFRYCTNDAHFDIPFPDWSFWG 227
++PDV+ + DKP++ + +H G P P+ R+C++D H DI P W+ G
Sbjct: 71 LIPDVEFVLASSDKPLVLRADHPPGRVP-PVMRFCSSDEHADIKIP-WAARG 120
>gi|345568242|gb|EGX51139.1| hypothetical protein AOL_s00054g515 [Arthrobotrys oligospora ATCC
24927]
Length = 454
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 55/126 (43%), Gaps = 10/126 (7%)
Query: 219 PFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKG----NPDVLSPLRVE 274
P P + W W + P E + ++EK+ +W+G NP LR
Sbjct: 193 PVPHFGHWSWPVKYVGPLIEVLSQVSEIESTLPFEEKVDKLFWRGTPSFNPIQNQNLRGN 252
Query: 275 LMKCNDSKLWGAEILRQNWA--EEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
L+ + K W A+ + W E+AK+ + ++ C ++Y Y EG +S L + +
Sbjct: 253 LIGATEGKDW-ADTGQLEWTALEKAKNIVQIPEI---CRYKYIAYTEGITYSGRLPFHML 308
Query: 333 CNSVAL 338
C SV +
Sbjct: 309 CESVII 314
>gi|302832622|ref|XP_002947875.1| hypothetical protein VOLCADRAFT_103644 [Volvox carteri f.
nagariensis]
gi|300266677|gb|EFJ50863.1| hypothetical protein VOLCADRAFT_103644 [Volvox carteri f.
nagariensis]
Length = 939
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 58/119 (48%), Gaps = 6/119 (5%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKY--ILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS 363
L +Q ++Y + +G WS+S K+ L S+ L F+ + P +++ PI
Sbjct: 752 LHDQNKYKYIVSTDG--WSISSKFDKYLLLGSLVLKAEGLTYGFYYPAIKPFEHYVPIMY 809
Query: 364 ADLCRSIKSVVDWGNANPSEAEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDYK 421
I +++W ++ +EAE+I Q F M +L + Y+ LITE SK + Y+
Sbjct: 810 KHE-NDIIDMLEWAKSHDAEAEQIAMNAQRFAMRNLNRNARLCYIFRLITELSKQMKYE 867
>gi|300769995|ref|ZP_07079874.1| lipopolysaccharide biosynthesis protein [Sphingobacterium
spiritivorum ATCC 33861]
gi|300762471|gb|EFK59288.1| lipopolysaccharide biosynthesis protein [Sphingobacterium
spiritivorum ATCC 33861]
Length = 321
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 60/115 (52%), Gaps = 8/115 (6%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPS- 363
+ ++++ + EG + +LK+I+S NSVA++ +Y+ +F G LIP ++ I
Sbjct: 205 IQQHLDYKFILSLEGNDVATNLKWIMSSNSVAVMPLPKYETWFMEGKLIPDFHYIQIKDD 264
Query: 364 -ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYM-LHLITEYSK 416
+DL + +D +P EAEKI +++ ++ D + L ++ +Y K
Sbjct: 265 YSDLEEKLHYYID----HPQEAEKIVDNAHQYIQQFNNRKLEDLIALRVLEKYFK 315
>gi|347842163|emb|CCD56735.1| hypothetical protein [Botryotinia fuckeliana]
Length = 291
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 49/122 (40%), Gaps = 5/122 (4%)
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKG----NPDVLSPLRVELM 276
P +SFW W + ++ I K W EK+ A W+G N R L+
Sbjct: 24 PHFSFWSWPVSFIGTVDQAISKIDRIEMDKHWTEKIDKAVWRGTGWFNTVGNKDSRPSLV 83
Query: 277 KCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSV 336
K W A+I W + + + C ++Y +Y EG +S L + +C SV
Sbjct: 84 LKGKDKEW-ADIEALKWTTNGESAENAIGIEDFCKYKYIVYTEGITYSGRLLFHQACASV 142
Query: 337 AL 338
L
Sbjct: 143 IL 144
>gi|381401864|ref|ZP_09926755.1| hypothetical protein KKB_08214 [Kingella kingae PYKK081]
gi|380833161|gb|EIC13038.1| hypothetical protein KKB_08214 [Kingella kingae PYKK081]
Length = 319
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Query: 308 NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLC 367
Q ++Y + EG + +LK+I++ SV L+ +Y+ + G + + H+ I D
Sbjct: 207 GQLRYKYILSIEGNDVATNLKWIMASQSVCLMTRPKYETWLMEGRLQPEVHY-IGLEDDY 265
Query: 368 RSIKSVVDWGNANPSEAEKIGKAGQDFME 396
+ + + ANP AEKI + Q +M+
Sbjct: 266 SDLDEKLRFYKANPQAAEKIVQQAQQWMQ 294
>gi|333376583|ref|ZP_08468356.1| lipopolysaccharide biosynthesis protein LpsA [Kingella kingae ATCC
23330]
gi|332967737|gb|EGK06844.1| lipopolysaccharide biosynthesis protein LpsA [Kingella kingae ATCC
23330]
Length = 319
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
Q ++Y + EG + +LK+I++ SV L+ +Y+ + G + + H+ I D
Sbjct: 208 QLRYKYILSIEGNDVATNLKWIMASQSVCLMTHPKYETWLMEGRLQPEVHY-IGLEDDYS 266
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFME 396
+ + + ANP AEKI + Q +M+
Sbjct: 267 DLDEKLRFYKANPQAAEKIVQQAQQWMQ 294
>gi|261250528|ref|ZP_05943103.1| putative lipopolysaccharide A protein [Vibrio orientalis CIP 102891
= ATCC 33934]
gi|260939097|gb|EEX95084.1| putative lipopolysaccharide A protein [Vibrio orientalis CIP 102891
= ATCC 33934]
Length = 190
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 1/90 (1%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
+ Q +++ I EG + +LK+ +S NSV + +++ +F G + H+ I D
Sbjct: 72 VEEQLQYKFLICLEGNDVASNLKWAMSSNSVVVTPKMKFETWFMEGTLQAGVHY-IEVKD 130
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ + + NP EAEKI K D++
Sbjct: 131 DWSDFEEKIQYYIDNPLEAEKIVKNAHDYI 160
>gi|116201527|ref|XP_001226575.1| hypothetical protein CHGG_08648 [Chaetomium globosum CBS 148.51]
gi|88177166|gb|EAQ84634.1| hypothetical protein CHGG_08648 [Chaetomium globosum CBS 148.51]
Length = 462
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 49/217 (22%), Positives = 79/217 (36%), Gaps = 14/217 (6%)
Query: 135 RFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVI 194
R +I I+ + D + + SR ++ + L P +PD + D+P
Sbjct: 110 RIKDGQIHIIHAQRKSDLSQEMLNSRTA-SLHQLHRALLTSPTPLPDTIFTLNFQDQPFG 168
Query: 195 DKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSE--VNLQPWNEEFKDIKHGSQAKSW 252
+ P F + P +SFW W+ + E A +W
Sbjct: 169 TAWAYSRHADPQFVSKKDGHARTFLMPHFSFWAWNLPFIGSMGRAAEAIAELEAEYAGAW 228
Query: 253 KEKLPFAYWKGN---PDVLSP-LRVELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSN 308
K+ A W+G V SP +R L+K K W A++ W SN
Sbjct: 229 HGKIAKAVWRGTMWFNSVHSPRMRQNLVKEARGKPW-ADVEPLEWTSNGGGKGAAKTASN 287
Query: 309 Q------CNHRYKIYAEGYAWSVSLKYILSCNSVALI 339
C ++Y I+ EG A+S +++ C SV L
Sbjct: 288 ALPIEGFCRYKYIIHTEGIAYSGRFQFLQMCASVVLT 324
>gi|433775789|ref|YP_007306256.1| hypothetical protein Mesau_04548 [Mesorhizobium australicum
WSM2073]
gi|433667804|gb|AGB46880.1| hypothetical protein Mesau_04548 [Mesorhizobium australicum
WSM2073]
Length = 323
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 33/145 (22%), Positives = 67/145 (46%), Gaps = 6/145 (4%)
Query: 238 EEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC-NDSKLWGAEILRQNWAEE 296
++F+ + + A S+ +KLP W+G D+ +P+R + D A + N E
Sbjct: 131 DKFRHFQMPADAMSFADKLPSVVWRG--DLNNPIRTRFLNAVRDLPFCDAGSHKPNAPPE 188
Query: 297 AKDGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTK 356
F +S +RY + EG + +LK+I++ S+ ++ Y+ +F+ +
Sbjct: 189 YARPF--LSISQHQRYRYIVSLEGNDVATNLKWIMNSKSLCMMPPPTYETWFAERQLEAN 246
Query: 357 NHFPIPSADLCRSIKSVVDWGNANP 381
H+ +P A +++ V + NP
Sbjct: 247 VHY-VPLAADFSNLEDHVAYFERNP 270
>gi|352105539|ref|ZP_08960854.1| hypothetical protein HAL1_16251 [Halomonas sp. HAL1]
gi|350598412|gb|EHA14532.1| hypothetical protein HAL1_16251 [Halomonas sp. HAL1]
Length = 323
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/88 (25%), Positives = 48/88 (54%), Gaps = 1/88 (1%)
Query: 309 QCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCR 368
Q N+++ + EG + +LK+I++ NS+ + +Y+ +F G + H+ + AD
Sbjct: 209 QLNYKFILSIEGNDVATNLKWIMASNSLCFMTRPKYETWFMEGTLQPSFHY-VELADDYS 267
Query: 369 SIKSVVDWGNANPSEAEKIGKAGQDFME 396
++ + + P+EA+KI K +++E
Sbjct: 268 DLEEKLIYYQQYPNEAKKIIKNANNYVE 295
>gi|417953387|ref|ZP_12596433.1| putative lipopolysaccharide A protein [Vibrio orientalis CIP 102891
= ATCC 33934]
gi|342817261|gb|EGU52145.1| putative lipopolysaccharide A protein [Vibrio orientalis CIP 102891
= ATCC 33934]
Length = 308
Score = 39.3 bits (90), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 1/90 (1%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
+ Q +++ I EG + +LK+ +S NSV + +++ +F G + H+ I D
Sbjct: 190 VEEQLQYKFLICLEGNDVASNLKWAMSSNSVVVTPKMKFETWFMEGTLQAGVHY-IEVKD 248
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFM 395
+ + + NP EAEKI K D++
Sbjct: 249 DWSDFEEKIQYYIDNPLEAEKIVKNAHDYI 278
>gi|158424572|ref|YP_001525864.1| lipopolysaccharide-modifying enzyme [Azorhizobium caulinodans ORS
571]
gi|158331461|dbj|BAF88946.1| putative lipopolysaccharide-modifying enzyme [Azorhizobium
caulinodans ORS 571]
Length = 338
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 43/179 (24%), Positives = 76/179 (42%), Gaps = 24/179 (13%)
Query: 235 PWNEEFKDIK-------HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAE 287
PW E ++ G A SW + GNP + +R+ L ++ G +
Sbjct: 140 PWAEREDVVRWRGAPSGQGEIAHSWMDA-------GNPMLRQRVRMCLALRDEP---GVD 189
Query: 288 ILRQNWAEEAKDGFKKSKLS--NQCNHRYKIYAEGY--AW-SVSLKYILSCNSVALIISQ 342
+ A+ A G K +L + RY I +G+ AW + + +L C + +
Sbjct: 190 VAFAGRADLAPYGISKERLDPLSWAKVRYAIDIDGFSAAWMNFFSRLLLGCCVIKVASPY 249
Query: 343 QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMD 401
++ +F L+P ++ P+ ADL +K + W ANP +I + GQ S T++
Sbjct: 250 GFRQWFYDALVPWIHYVPV-KADLS-DLKEKIAWCRANPDACRQIAEEGQRLALSRTVE 306
>gi|242065946|ref|XP_002454262.1| hypothetical protein SORBIDRAFT_04g027705 [Sorghum bicolor]
gi|241934093|gb|EES07238.1| hypothetical protein SORBIDRAFT_04g027705 [Sorghum bicolor]
Length = 173
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 104 QTCPDFFKSIHKDLEPW-AKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRA 161
+CP +F IH+DL PW A IT + A A R +++ G+ YV Q+R+
Sbjct: 88 NSCPSYFCFIHEDLRPWRAAGGITRAMLDRAHLTATFRFVVLEGRAYVHRLRPAFQNRS 146
>gi|452836796|gb|EME38739.1| hypothetical protein DOTSEDRAFT_75476 [Dothistroma septosporum
NZE10]
Length = 462
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 46/224 (20%), Positives = 93/224 (41%), Gaps = 34/224 (15%)
Query: 220 FPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKS--WKEKLPFAYWKGNPDVLSPLRVELMK 277
P++ WG N ++D K ++A ++ K+P W+G+ V +R +++
Sbjct: 235 LPNFDLWGQG-------NSSYQDSKLSARAHDAPFRSKIPKIVWRGSAWVNMDIRQKMIN 287
Query: 278 CNDSKLWGAEILRQNWAEEAK-DGFKKSK---LSNQCNHRYKIYAEGYAWSVSLKYILSC 333
+WA+ K DG + C + ++ EG ++S +K++ +C
Sbjct: 288 IGGDN---------DWADMKKLDGDHNENSLPVGEWCRYAMTVHTEGVSYSGRMKWLTNC 338
Query: 334 NSVALIISQQYKDFFSRGL---IPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
NS+ +++ ++ L P +N+ + +DL + +D +P EAE+I
Sbjct: 339 NSLIFFHEREWDMYYYHLLEASGPNQNYVAVKRDWSDLEEKVLYYLD----HPEEAERII 394
Query: 389 KAGQDFMESLTMDRVYD--YMLHLITEYSKLLDYKPAPPSSAFE 430
+ R Y LI Y++ + ++PA A E
Sbjct: 395 ANNMATFRDRYLTRAATSCYTRKLIQGYAE-VSFQPATQRPASE 437
>gi|307205675|gb|EFN83937.1| KDEL motif-containing protein 1 [Harpegnathos saltator]
Length = 337
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 40/184 (21%), Positives = 79/184 (42%), Gaps = 11/184 (5%)
Query: 245 HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAEILRQNWAEEAKD--GFK 302
G+ W+ K+ +W+G L + + KL+ I + ++ D G
Sbjct: 128 QGNTDLPWENKVEQLFWRGRDSRKERLDLIDISRKHPKLFNVSITNFFFFKDKIDKYGPG 187
Query: 303 KSKLS--NQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFP 360
+S +S + ++Y++ +G + Y+L+ +S+ +Y +FF + L ++ P
Sbjct: 188 QSHVSFFDFFKYKYQLNIDGTVATYRFPYLLAGDSLVFKQESKYYEFFYKDLTSGLHYVP 247
Query: 361 IPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDR-VYDYMLHLITEYSKL 417
+ S +DL I+ W + + KI K+ + F + R + Y L E+SK
Sbjct: 248 VKSDLSDLVEKIQ----WAKEHDEDGLKIAKSARQFARDNLLPRDILCYYTTLFHEWSKR 303
Query: 418 LDYK 421
L K
Sbjct: 304 LKSK 307
>gi|342320014|gb|EGU11958.1| Glycosyltransferase family 90 protein [Rhodotorula glutinis ATCC
204091]
Length = 696
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 63/128 (49%), Gaps = 10/128 (7%)
Query: 299 DGFKKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNH 358
DGF + NQ ++Y I +G WS +++ NS+ L S + +++S + P ++
Sbjct: 561 DGFMPVEEQNQ--YKYVIDVDGNGWSGRFHRLMASNSLVL-KSTIFPEWYSDRIQPWVHY 617
Query: 359 FPIPS--ADLCRSI----KSVVDWGNANPSEAEKIGKAGQDFME-SLTMDRVYDYMLHLI 411
P+ + DL + S D + AEKI AG+ + E + + Y+L L+
Sbjct: 618 VPVKTDYTDLLPILAFFKGSPYDGSGGHDELAEKIASAGKQWAEQNWRWVDMQAYLLRLL 677
Query: 412 TEYSKLLD 419
EY+++++
Sbjct: 678 LEYARVMN 685
>gi|343506873|ref|ZP_08744335.1| hypothetical protein VII00023_01065 [Vibrio ichthyoenteri ATCC
700023]
gi|342801221|gb|EGU36699.1| hypothetical protein VII00023_01065 [Vibrio ichthyoenteri ATCC
700023]
Length = 313
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 45/91 (49%), Gaps = 1/91 (1%)
Query: 302 KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPI 361
K ++ Q +++ + EG + +LK+I+S NS+ +Y+ +F G + HF +
Sbjct: 192 KTMTIAEQLEYKFILSLEGMDVATNLKWIMSSNSLCFTPKLRYETWFMEGRLQAGTHF-V 250
Query: 362 PSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
D + +D+ ++P EAE I + Q
Sbjct: 251 EIKDDFSDLDEKMDYYLSHPEEAEAIIQNAQ 281
>gi|15222368|ref|NP_174436.1| F-box associated ubiquitination effector family protein
[Arabidopsis thaliana]
gi|12597852|gb|AAG60162.1|AC074360_27 hypothetical protein [Arabidopsis thaliana]
gi|332193245|gb|AEE31366.1| F-box associated ubiquitination effector family protein
[Arabidopsis thaliana]
Length = 309
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 42/95 (44%), Gaps = 10/95 (10%)
Query: 183 DIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWNEE--- 239
D DC V+DK HG F LPL C + I ++ +G + +++E
Sbjct: 135 DFKSDCWKVVVVDKTFHGFFRLPLSSVCIGGTPYWI---GYNIYGTVSIQSFDFSKERFE 191
Query: 240 --FKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR 272
F+ + H S WK+ L ++G D LS LR
Sbjct: 192 TLFESLHHSSIGIQWKDSLSLGIFRG--DHLSLLR 224
>gi|227538446|ref|ZP_03968495.1| lipopolysaccharide biosynthesis protein [Sphingobacterium
spiritivorum ATCC 33300]
gi|227241728|gb|EEI91743.1| lipopolysaccharide biosynthesis protein [Sphingobacterium
spiritivorum ATCC 33300]
Length = 321
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 59/113 (52%), Gaps = 8/113 (7%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRG-LIPTKNHFPIPS- 363
+ ++++ + EG + +LK+I+S NSVA++ +Y+ +F G LIP ++ I
Sbjct: 205 IQQHLDYKFILSLEGNDVATNLKWIMSSNSVAVMPLPKYETWFMEGKLIPDFHYIQIKDD 264
Query: 364 -ADLCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLTMDRVYDYM-LHLITEY 414
+DL + +D +P EAEKI +++ ++ D + L ++ +Y
Sbjct: 265 YSDLEEKLHYYID----HPHEAEKIIDNAHQYIQQFNNRKLEDLIALRVLEKY 313
>gi|164426694|ref|XP_957627.2| hypothetical protein NCU11268 [Neurospora crassa OR74A]
gi|157071438|gb|EAA28391.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 409
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 52/237 (21%), Positives = 94/237 (39%), Gaps = 30/237 (12%)
Query: 130 IMEAKRFAALRILIVRGKLYV-------DPYYDCVQSRAMFTIWGFLQLLRRYPG---MV 179
+ + +++ I GK+YV D + V SR LL P +V
Sbjct: 15 VKRSSDLGPMQVRIKDGKMYVLHAQRKRDLSREMVNSRTAALHQLHRALLTLPPSDRSLV 74
Query: 180 PDVDIMF--DCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFWGWSEVNLQPWN 237
D+D + + +D P ++ P+ DA + P +SFW W + +
Sbjct: 75 SDLDTILTINILDTPFGTALQYTRNADPVHAPSDPDARTFL-IPHFSFWAWDLPFIGSIS 133
Query: 238 EEFKDIKHGS----QAKSW-KEKLPFAYWKGNP---DVLSP-LRVELMKCNDSKLWGAEI 288
I + Q W K P A W+G + +P LR +L+ +K W A++
Sbjct: 134 RAASAITNLETTQFQGNRWHSHKDPRAVWRGTTWFNSIHNPQLRYKLVSTAKAKPW-ADV 192
Query: 289 LRQNWAEEAKDGFKKSK-------LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVAL 338
W + G ++K + + C ++Y ++ EG ++S +++ C SV L
Sbjct: 193 QSLEWTTTSTTGNGENKNATNSLAIEDFCKYKYVLHTEGISYSGRFQFLQMCASVTL 249
>gi|337269404|ref|YP_004613459.1| lipopolysaccharide-modifying protein [Mesorhizobium opportunistum
WSM2075]
gi|336029714|gb|AEH89365.1| lipopolysaccharide-modifying protein [Mesorhizobium opportunistum
WSM2075]
Length = 323
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 33/143 (23%), Positives = 61/143 (42%), Gaps = 6/143 (4%)
Query: 251 SWKEKLPFAYWKGNPDVLSPLRVELMKC-NDSKLWGAEILRQNWAEEAKDGFKKSKLSNQ 309
++ +KLP W+G D+ +PLR + D A + N + F +
Sbjct: 144 AFADKLPAVVWRG--DLNNPLRTRFLDAVRDLPFCDAGSHKPNAPAKYSKPF--LSIGQH 199
Query: 310 CNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRS 369
+RY + EG + +LK+I++ NS+ L+ Y+ +F + H+ D
Sbjct: 200 LRYRYIVSLEGNDVATNLKWIMNSNSLCLMPPPTYETWFRESQLEANVHYVPLDPDFA-D 258
Query: 370 IKSVVDWGNANPSEAEKIGKAGQ 392
+ V + +P+ AE+I A
Sbjct: 259 LAEKVSYFERHPAAAERIVLAAN 281
>gi|395519040|ref|XP_003763661.1| PREDICTED: protein O-glucosyltransferase 1 [Sarcophilus harrisii]
Length = 335
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 37/169 (21%), Positives = 76/169 (44%), Gaps = 18/169 (10%)
Query: 105 TCPDFFKSIHKDLEPWAK--SRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAM 162
C + KDL P+ S+ M +++ K I++ +LY + +DC+
Sbjct: 52 NCSCHLGVMEKDLAPFQGGISKEMMANVVNRK--LGTHYQIIKNELYRE--HDCMFPARC 107
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD 222
+ F+ + + +PD++++ + D P I K +P+F + + DI +P
Sbjct: 108 SGVEHFILEIINH---LPDMEMVINVRDYPQIPKWMEPI--IPVFSFSKTSEYHDIMYPA 162
Query: 223 WSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGN 264
W+FW G V L W+ +D+ ++ W++K+ Y++G+
Sbjct: 163 WTFWEGGPAVWPIYPTGLGRWDLLREDLARSAEKWPWEKKVSKGYFRGS 211
>gi|343515593|ref|ZP_08752645.1| hypothetical protein VIBRN418_00976 [Vibrio sp. N418]
gi|342798024|gb|EGU33657.1| hypothetical protein VIBRN418_00976 [Vibrio sp. N418]
Length = 312
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 44/91 (48%), Gaps = 1/91 (1%)
Query: 302 KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPI 361
K ++ Q +++ + EG + +LK+I+S NS+ +Y+ +F G + HF
Sbjct: 192 KSMTIAEQLEYKFILSLEGMDVATNLKWIMSSNSLCFTPKLRYETWFMEGRLQAGKHFVE 251
Query: 362 PSADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
D + +D+ ++P EAE I + Q
Sbjct: 252 VKNDFS-DLDEKMDYYLSHPEEAEAIIQNAQ 281
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.136 0.441
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,703,747,565
Number of Sequences: 23463169
Number of extensions: 327375631
Number of successful extensions: 611065
Number of sequences better than 100.0: 722
Number of HSP's better than 100.0 without gapping: 263
Number of HSP's successfully gapped in prelim test: 459
Number of HSP's that attempted gapping in prelim test: 609461
Number of HSP's gapped (non-prelim): 784
length of query: 463
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 317
effective length of database: 8,933,572,693
effective search space: 2831942543681
effective search space used: 2831942543681
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)