BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022616
(294 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359474017|ref|XP_002269577.2| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
Length = 585
Score = 395 bits (1016), Expect = e-108, Method: Compositional matrix adjust.
Identities = 190/294 (64%), Positives = 227/294 (77%), Gaps = 13/294 (4%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNY 59
I F FS + F V+N R I + + RP I PLNC +N TQTCP NY
Sbjct: 115 FISFFCFSS--RFIFLVNNSERKAVPISENHRKTPRP---IVVPLNCSARNLTQTCPGNY 169
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
P T T D + P CPDYFRWIHEDL PWK TGI+RDM+ERA ++AHFRL++V
Sbjct: 170 PTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVK 222
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVIRS D+ GPN+
Sbjct: 223 GKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNST 282
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN W++REPYAYW
Sbjct: 283 SPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYW 342
Query: 240 KGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
KGNP VAETRRDLLTCN+SD DWNARL+VQDW+LES++G++QS++++QC HRY
Sbjct: 343 KGNPLVAETRRDLLTCNVSDVQDWNARLFVQDWMLESQQGYKQSDVSNQCTHRY 396
>gi|297742541|emb|CBI34690.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 190/294 (64%), Positives = 227/294 (77%), Gaps = 13/294 (4%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNY 59
I F FS + F V+N R I + + RP I PLNC +N TQTCP NY
Sbjct: 27 FISFFCFSS--RFIFLVNNSERKAVPISENHRKTPRP---IVVPLNCSARNLTQTCPGNY 81
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
P T T D + P CPDYFRWIHEDL PWK TGI+RDM+ERA ++AHFRL++V
Sbjct: 82 PTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVK 134
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVIRS D+ GPN+
Sbjct: 135 GKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNST 194
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN W++REPYAYW
Sbjct: 195 SPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYW 254
Query: 240 KGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
KGNP VAETRRDLLTCN+SD DWNARL+VQDW+LES++G++QS++++QC HRY
Sbjct: 255 KGNPLVAETRRDLLTCNVSDVQDWNARLFVQDWMLESQQGYKQSDVSNQCTHRY 308
>gi|147800396|emb|CAN66409.1| hypothetical protein VITISV_020976 [Vitis vinifera]
Length = 439
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 175/245 (71%), Positives = 207/245 (84%), Gaps = 7/245 (2%)
Query: 49 KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERAN 108
KN TQTCP NYP T T D + P CPDYFRWIHEDL PWK TGI+RDM+ERA
Sbjct: 23 KNLTQTCPGNYPTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAK 75
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
++AHFRL++V KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVI
Sbjct: 76 RSAHFRLVIVKGKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVI 135
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
RS D+ GPN+ PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN
Sbjct: 136 RSGDHRGPNSTSPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRT 195
Query: 229 NWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQ 288
W++REPYAYWKGNP VAETRRDLLTCN+SD DWNARL+VQDW+LES++G++QS++++Q
Sbjct: 196 KWMEREPYAYWKGNPLVAETRRDLLTCNVSDVQDWNARLFVQDWMLESQQGYKQSDVSNQ 255
Query: 289 CAHRY 293
C HRY
Sbjct: 256 CTHRY 260
>gi|255539447|ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549903|gb|EEF51390.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 528
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 176/300 (58%), Positives = 218/300 (72%), Gaps = 12/300 (4%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQN------RPPKRITTPLNCVK-NQTQ 53
+ F + L S+ + S K +I P N +I PLNC N T+
Sbjct: 45 LASAFLTTRFLDSSSAFTGSSAQKPLITTKSAPTNPTLISKNALNKINIPLNCAAFNLTR 104
Query: 54 TCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHF 113
TCP+NYP T + P S CP+Y+RWI+EDL PW TGI+RDM+ERA TA+F
Sbjct: 105 TCPSNYPTTF-----TENPDRPSVSACPEYYRWIYEDLRPWARTGISRDMVERAKTTANF 159
Query: 114 RLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY 173
RL++VN K Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC D PVI+S +Y
Sbjct: 160 RLVIVNGKAYVEKYRRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIKSSNY 219
Query: 174 SGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDR 233
SGPN PPPLFRY GD T+D+VFPDWSFWGW+EINIKPWE LLRELKEGN R W++R
Sbjct: 220 SGPNAMAPPPLFRYCGDDDTLDVVFPDWSFWGWSEINIKPWERLLRELKEGNEKRRWMER 279
Query: 234 EPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
EPYAYWKGNP VAETR+DL+ CN+S++ DWNAR+Y QDWI E ++G++QSNLASQC HRY
Sbjct: 280 EPYAYWKGNPAVAETRQDLMKCNVSEQQDWNARVYAQDWIKELQQGYKQSNLASQCMHRY 339
>gi|15237842|ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana]
gi|10176852|dbj|BAB10058.1| unnamed protein product [Arabidopsis thaliana]
gi|48310551|gb|AAT41837.1| At5g23850 [Arabidopsis thaliana]
gi|62320258|dbj|BAD94534.1| putative protein [Arabidopsis thaliana]
gi|332005839|gb|AED93222.1| uncharacterized protein [Arabidopsis thaliana]
Length = 542
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 173/274 (63%), Positives = 213/274 (77%), Gaps = 12/274 (4%)
Query: 22 RNKTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTN-YPKTSQTQESISDYSIPPTST 79
R TVI S KP+ L+C N+T +CP+N YP T+ ++ D + PPT+T
Sbjct: 89 RPTTVITQSPKPE--------FTLHCSANETTASCPSNKYPTTTSFEDD--DTNHPPTAT 138
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPDYFRWIHEDL PW TGITR+ LERA +TA FRL +V K+Y+ K++ + QTRDVFTI
Sbjct: 139 CPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAFQTRDVFTI 198
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG LQLLRKYPG++PDLELMFDC D PV+R+ +++G N PPPLFRY G+ T+DIVFP
Sbjct: 199 WGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNEETLDIVFP 258
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
DWSFWGWAE+NIKPWESLL+EL+EGN WI+REPYAYWKGNP VAETR+DL+ CN+S+
Sbjct: 259 DWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAETRQDLMKCNVSE 318
Query: 260 KHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+H+WNARLY QDWI ESK G++QS+LASQC HRY
Sbjct: 319 EHEWNARLYAQDWIKESKEGYKQSDLASQCHHRY 352
>gi|297808383|ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
lyrata]
gi|297317912|gb|EFH48334.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
lyrata]
Length = 543
Score = 368 bits (945), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 171/274 (62%), Positives = 213/274 (77%), Gaps = 11/274 (4%)
Query: 22 RNKTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTN-YPKTSQTQESISDYSIPPTST 79
R TVI S KP+ L+C N+T +CP+N YP T+ E D + PP +T
Sbjct: 89 RPTTVITQSPKPE--------FTLHCSANETTASCPSNKYPTTASFGED-DDTNHPPNAT 139
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPDYFRWIHEDL PW TGITR+ LERA +TA+FRL +++ K+Y+ K++ + QTRDVFTI
Sbjct: 140 CPDYFRWIHEDLRPWSSTGITREALERAKKTANFRLAIIDGKIYVEKFQDAFQTRDVFTI 199
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG LQLLRKYPG++PDLELMFDC D PV+++ +++G N PPPLFRY G+ T+DIVFP
Sbjct: 200 WGFLQLLRKYPGKIPDLELMFDCVDWPVVKASEFTGANAPSPPPLFRYCGNEETLDIVFP 259
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
DWSFWGWAE+NIKPWESLL+EL+EGN WI+REPYAYWKGNP VAETR+DL+ CN+S+
Sbjct: 260 DWSFWGWAEVNIKPWESLLKELREGNQRTKWINREPYAYWKGNPMVAETRQDLMKCNVSE 319
Query: 260 KHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+H+WNARLYVQDWI ES G++QS+LASQC HRY
Sbjct: 320 EHEWNARLYVQDWIKESNEGYKQSDLASQCHHRY 353
>gi|332071136|gb|AED99886.1| glycosyltransferase [Panax notoginseng]
Length = 546
Score = 360 bits (925), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 166/262 (63%), Positives = 206/262 (78%), Gaps = 4/262 (1%)
Query: 34 QNRPPKRITTPLNC-VKNQTQTCPTNY-PKTSQTQESISDYSIPPTSTCPDYFRWIHEDL 91
+ +PP+++ PLNC N +TCP NY P+T Q+ SIPP S CP+YFRWI+EDL
Sbjct: 100 RKKPPRKVEIPLNCSTGNLIRTCPANYYPRTFNIQDQ-DHSSIPPVS-CPEYFRWIYEDL 157
Query: 92 SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPG 151
PW+ TGITR+M+ERA +TA+FRL+++N + Y+ +++S Q+RDVFT+WGILQLLR YPG
Sbjct: 158 RPWRETGITREMVERARRTANFRLVILNGRAYVETHQKSFQSRDVFTLWGILQLLRMYPG 217
Query: 152 RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
++PDL+LMFDC D PVI SR Y GPN PPPLFRY D T+DIVFPDW+FWGW EINI
Sbjct: 218 KVPDLDLMFDCVDWPVIISRFYHGPNATAPPPLFRYCADDSTLDIVFPDWTFWGWPEINI 277
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQD 271
KPW SLL++LKEGN G W+DREPYAYWKGNP VA+TR DLL CN+SDK DWNAR+Y D
Sbjct: 278 KPWGSLLKDLKEGNTGTQWMDREPYAYWKGNPIVAKTRMDLLKCNVSDKQDWNARVYAXD 337
Query: 272 WILESKRGFQQSNLASQCAHRY 293
W ES+ G++QS+LASQC HRY
Sbjct: 338 WARESQLGYKQSDLASQCIHRY 359
>gi|255537419|ref|XP_002509776.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549675|gb|EEF51163.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 534
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 179/286 (62%), Positives = 209/286 (73%), Gaps = 24/286 (8%)
Query: 11 LQSNFSVHNISRNKTVIIVSRKPQNRPPKR-ITTPLNCV--KNQTQTCPTNYPKTSQTQE 67
LQS SV S N TV+I + N P + I PLNC NQTQTCPTNYPK + ++
Sbjct: 73 LQSKVSVER-STNTTVVISEK--HNIPQRETIEFPLNCTINNNQTQTCPTNYPKLTTSRA 129
Query: 68 SISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKY 127
D PP + CP+YFRWIHEDL PW GI+RDM+ERA +TAHF LI+V K YI KY
Sbjct: 130 DDQD---PPRTVCPNYFRWIHEDLRPWIAAGISRDMVERAQRTAHFHLIIVGGKAYIKKY 186
Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY 187
++S QTRD FTIWGILQLLR+YPG++PDLELMFD DDRPVIRS DY N GPPPLFRY
Sbjct: 187 RESTQTRDTFTIWGILQLLRRYPGKIPDLELMFDTDDRPVIRSSDYHEQNTTGPPPLFRY 246
Query: 188 SGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
GDR PD INIKPW+ L ++KEGNNG WIDREPYAYWKGNPFVAE
Sbjct: 247 CGDR-------PD--------INIKPWDELSIDIKEGNNGSKWIDREPYAYWKGNPFVAE 291
Query: 248 TRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
TR+DLL CN+SD+ DWNARL++QDWI ES++G++QS+LA QCAHRY
Sbjct: 292 TRKDLLACNVSDQRDWNARLFIQDWIQESQQGYKQSDLARQCAHRY 337
>gi|255539445|ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549902|gb|EEF51389.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 506
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 165/265 (62%), Positives = 204/265 (76%), Gaps = 6/265 (2%)
Query: 30 SRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIH 88
S P +P R+ PLNC N T+TCPT+YP TS + S PPT CP+YFRWIH
Sbjct: 58 STVPLEKPDNRLVIPLNCHALNLTRTCPTDYPSTSSQD---PNRSSPPT--CPEYFRWIH 112
Query: 89 EDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRK 148
EDL PW TGITR+ +ERA TA+FRL+++N Y+ Y++S QTRDVFT+WGILQLLRK
Sbjct: 113 EDLRPWVRTGITRETMERAKATANFRLVILNGTAYLEMYEKSFQTRDVFTLWGILQLLRK 172
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
YPGR+PDLE+MFDC D PV++S DYSG + PPPLFRY G+ T+DIVFPDWS+WGW E
Sbjct: 173 YPGRVPDLEMMFDCVDWPVVKSVDYSGSSAISPPPLFRYCGNDETLDIVFPDWSYWGWVE 232
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
NIKPWE ++++LKEGN W +REPYAYWKGNP VAETR DL+ CN+S +HDWNARLY
Sbjct: 233 TNIKPWEKIVKDLKEGNQRSKWKEREPYAYWKGNPNVAETRLDLMKCNVSQEHDWNARLY 292
Query: 269 VQDWILESKRGFQQSNLASQCAHRY 293
QDW+ ES++G++QS+LA+QC HRY
Sbjct: 293 TQDWVRESQQGYKQSDLANQCNHRY 317
>gi|225470185|ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
gi|302143884|emb|CBI22745.3| unnamed protein product [Vitis vinifera]
Length = 525
Score = 356 bits (914), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 165/251 (65%), Positives = 197/251 (78%), Gaps = 5/251 (1%)
Query: 44 PLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
PLNC N T+TCP NYP T S D P CP YFRWI+ DL PW +GITR+
Sbjct: 92 PLNCSAGNLTRTCPRNYP----TAFSPEDPDRPSPPECPHYFRWIYGDLRPWMKSGITRE 147
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ERA +TA F+L+++N + Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC
Sbjct: 148 MVERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDC 207
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PVI+S +Y GPN PPPLFRY GD T+DIVFPDWSFWGW EINIKPWESLL++LK
Sbjct: 208 VDWPVIQSNEYRGPNATAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWESLLKDLK 267
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
EGN W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+Y QDWILES+ G++Q
Sbjct: 268 EGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARVYTQDWILESQEGYKQ 327
Query: 283 SNLASQCAHRY 293
S+LASQC HRY
Sbjct: 328 SDLASQCIHRY 338
>gi|359489773|ref|XP_002273091.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 519
Score = 356 bits (914), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 168/260 (64%), Positives = 203/260 (78%), Gaps = 7/260 (2%)
Query: 36 RPPKRITTPLNC-VKNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
+ P R+ PLNC N T+TCP NYP K + T S ++ TCPDYFRWIH+DL P
Sbjct: 76 KAPIRLEYPLNCSAGNLTKTCPGNYPTKHNPTNPDHSSTNM----TCPDYFRWIHQDLLP 131
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
WK TGITRDM++RA +TAHFRL++++ K Y+ K+++SIQTRD+FT+WGILQLLR YPGRL
Sbjct: 132 WKQTGITRDMVDRAQRTAHFRLVIIDGKAYVEKFRKSIQTRDMFTLWGILQLLRWYPGRL 191
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
PDLELMFDCDDRPV+R RD+ GPN PPPLFRY GD W++DIVFPDWSFWGWAE NIKP
Sbjct: 192 PDLELMFDCDDRPVVRMRDFRGPN-AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKP 250
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWI 273
W ++L+++KEGN W DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+QDW
Sbjct: 251 WRNVLKDIKEGNRRTKWKDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQDWG 310
Query: 274 LESKRGFQQSNLASQCAHRY 293
+SK G++QSNL QC HRY
Sbjct: 311 QQSKIGYRQSNLEDQCTHRY 330
>gi|15229061|ref|NP_190467.1| uncharacterized protein [Arabidopsis thaliana]
gi|6522568|emb|CAB62012.1| putative protein [Arabidopsis thaliana]
gi|332644958|gb|AEE78479.1| uncharacterized protein [Arabidopsis thaliana]
Length = 539
Score = 355 bits (911), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 161/260 (61%), Positives = 202/260 (77%), Gaps = 6/260 (2%)
Query: 38 PKRITTPLNCVK---NQTQTCPT-NYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
PK T LNC N T TCP NYP + ++ + P++TCPDYFRWIHEDL P
Sbjct: 92 PKEFT--LNCAAFSGNDTGTCPKDNYPTSFRSSAGEGESDRSPSATCPDYFRWIHEDLRP 149
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ TGITR+ LERAN TA FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG++
Sbjct: 150 WEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKI 209
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
PDLELMFDC D PV+++ +++G + PPPLFRY + T+DIVFPDWS+WGWAE+NIKP
Sbjct: 210 PDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKP 269
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWI 273
WESLL+EL+EGN WIDREPYAYWKGNP VAETR DL+ CNLS+ +DW ARLY QDW+
Sbjct: 270 WESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSEVYDWKARLYKQDWV 329
Query: 274 LESKRGFQQSNLASQCAHRY 293
ESK G++QS+LASQC HRY
Sbjct: 330 KESKEGYKQSDLASQCHHRY 349
>gi|297816106|ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
lyrata]
gi|297321774|gb|EFH52195.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
lyrata]
Length = 539
Score = 354 bits (908), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 165/262 (62%), Positives = 206/262 (78%), Gaps = 10/262 (3%)
Query: 38 PKRITTPLNCVK---NQTQTCP-TNYPKT--SQTQESISDYSIPPTSTCPDYFRWIHEDL 91
PK T LNC N T TCP NYP + S E SD S+ ++TCPDYFRWIHEDL
Sbjct: 92 PKEFT--LNCAGFAGNDTVTCPKNNYPTSFRSSVGEGESDRSL--SATCPDYFRWIHEDL 147
Query: 92 SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPG 151
PW+ TGITR+ LERAN TA+FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG
Sbjct: 148 RPWEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPG 207
Query: 152 RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
++PDLELMFDC D PV+++ +++G + PPPLFRY + T+DIVFPDWS+WGWAE+NI
Sbjct: 208 KIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNI 267
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQD 271
KPWESLL+EL+EGN WIDREPYAYWKGNP VAETR DL+ CNLS+++DW ARLY QD
Sbjct: 268 KPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSEEYDWKARLYKQD 327
Query: 272 WILESKRGFQQSNLASQCAHRY 293
W+ ESK G++QS+LASQC HRY
Sbjct: 328 WVKESKEGYKQSDLASQCHHRY 349
>gi|110743626|dbj|BAE99650.1| hypothetical protein [Arabidopsis thaliana]
Length = 433
Score = 348 bits (892), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 154/243 (63%), Positives = 195/243 (80%), Gaps = 1/243 (0%)
Query: 52 TQTCPT-NYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQT 110
T TCP NYP + ++ + P++TCPDYFRWIHEDL PW+ TGITR+ LERAN T
Sbjct: 1 TGTCPKDNYPTSFRSSAGEGESDRSPSATCPDYFRWIHEDLRPWEKTGITREALERANAT 60
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
A FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG++PDLELMFDC D PV+++
Sbjct: 61 AIFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKA 120
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
+++G + PPPLFRY + T+DIVFPDWS+WGWAE+NIKPWESLL+EL+EGN W
Sbjct: 121 AEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKPWESLLKELREGNQRTKW 180
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCA 290
IDREPYAYWKGNP VAETR DL+ CNLS+ +DW ARLY QDW+ ESK G++QS+LASQC
Sbjct: 181 IDREPYAYWKGNPTVAETRLDLMKCNLSEVYDWKARLYKQDWVKESKEGYKQSDLASQCH 240
Query: 291 HRY 293
HRY
Sbjct: 241 HRY 243
>gi|356497228|ref|XP_003517464.1| PREDICTED: KDEL motif-containing protein 1-like [Glycine max]
Length = 522
Score = 347 bits (890), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 159/254 (62%), Positives = 193/254 (75%), Gaps = 13/254 (5%)
Query: 40 RITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGI 99
++ LNC N Q C T+YP + + CP+YFRWIHED+ WK GI
Sbjct: 89 QLEATLNCSNNGKQRC-TSYPTSGV-------FEREEGGVCPEYFRWIHEDVGAWKERGI 140
Query: 100 TRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
+R+M+ERA ++AHFRL++ +VY+ +YK+SIQTR+VFT+WGI+QLLRKYPG++ DLELM
Sbjct: 141 SREMVERAKKSAHFRLVVKRGRVYVERYKKSIQTREVFTMWGIVQLLRKYPGKVADLELM 200
Query: 160 FDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLR 219
FDCDD PVIR G + GPPPLFRY GDRWT DIVFPDWSFWGWAEINI+PWE +L+
Sbjct: 201 FDCDDLPVIR-----GSSLAGPPPLFRYCGDRWTDDIVFPDWSFWGWAEINIRPWEHVLK 255
Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRG 279
E+++GN W DREPYAYWKGNPFVAETR+DLL CN+S DWNARLYVQDWI ES++G
Sbjct: 256 EMEKGNRRIKWNDREPYAYWKGNPFVAETRQDLLKCNVSTTQDWNARLYVQDWIQESQQG 315
Query: 280 FQQSNLASQCAHRY 293
F SNLASQC HRY
Sbjct: 316 FNNSNLASQCTHRY 329
>gi|356529208|ref|XP_003533188.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 469
Score = 346 bits (888), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 166/280 (59%), Positives = 206/280 (73%), Gaps = 6/280 (2%)
Query: 15 FSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYS 73
FS S T + +R + K I PLNC N T TCPTN Q+ +D S
Sbjct: 6 FSGSTKSALSTTLYDTRPFRVSIQKPIEIPLNCSAYNLTGTCPTN-----QSPIPENDQS 60
Query: 74 IPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
P ++TCP+YFRWIHEDL PW TGIT++M+ERA QTA+F+L+++ K Y+ Y+++ QT
Sbjct: 61 RPSSATCPEYFRWIHEDLRPWARTGITQEMVERAKQTANFKLVILKGKAYLETYEKAYQT 120
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
RDVF+IWGILQLLR+YPG++PDLELMFDC D PV+ Y+GPN + PPPLFRY G+ T
Sbjct: 121 RDVFSIWGILQLLRRYPGKIPDLELMFDCVDWPVLLVDRYNGPNTEQPPPLFRYCGNDAT 180
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+DIVFPDWSFWGWAE+NIKPWE LL ELKEG W++REPYAYWKGNP VAETR DL+
Sbjct: 181 LDIVFPDWSFWGWAEVNIKPWEILLGELKEGTKRIPWLNREPYAYWKGNPVVAETRLDLM 240
Query: 254 TCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
CN+S+ DWNARLY QDW ES+ G+++S+LASQC HRY
Sbjct: 241 KCNVSENQDWNARLYAQDWGRESQEGYKKSDLASQCTHRY 280
>gi|255541542|ref|XP_002511835.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549015|gb|EEF50504.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 522
Score = 346 bits (887), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 158/266 (59%), Positives = 199/266 (74%), Gaps = 8/266 (3%)
Query: 31 RKPQNRPPKRITTPLNCVK--NQTQTCPTNY-PKTSQTQESISDYSIPPTSTCPDYFRWI 87
+K P ++ PL C N TQTCP +Y PKT + S ++ CP YF+WI
Sbjct: 74 QKASPTPQEKAEFPLRCTATPNITQTCPADYYPKTHNSTNSDRQSNV----ICPSYFKWI 129
Query: 88 HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
HEDL PW+ TGITRDM+ERA +TAHFRL++V+ + Y+ KY+QSIQTRD+ T+WGILQLLR
Sbjct: 130 HEDLRPWRETGITRDMIERARRTAHFRLVIVDGRAYVEKYRQSIQTRDMITLWGILQLLR 189
Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
YPG++PDLELMFDCDDRPV+RS D+ GP GPPPLFRY D ++DIVFPDWSFWGWA
Sbjct: 190 LYPGKVPDLELMFDCDDRPVVRSEDFPGPT-AGPPPLFRYCADDTSLDIVFPDWSFWGWA 248
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
E+NIKPW+S+L+ + +G+ + W DR PYAYWKGNP+V+ R DL+TCN+SDKHDWNARL
Sbjct: 249 EVNIKPWKSMLKGITKGSKRKKWKDRVPYAYWKGNPYVSANRGDLMTCNVSDKHDWNARL 308
Query: 268 YVQDWILESKRGFQQSNLASQCAHRY 293
Y QDW E ++ ++ S L QC HRY
Sbjct: 309 YAQDWGKEIRQKYKHSKLEDQCTHRY 334
>gi|449452346|ref|XP_004143920.1| PREDICTED: O-glucosyltransferase rumi-like [Cucumis sativus]
Length = 514
Score = 346 bits (887), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 163/260 (62%), Positives = 202/260 (77%), Gaps = 16/260 (6%)
Query: 44 PLNC----VKNQTQ--TCPTNYPKTSQTQESISDYSIPPT--STCPDYFRWIHEDLSPWK 95
PLNC NQTQ TC +YP + + SI P+ S CP+YFRWIHEDL PW
Sbjct: 76 PLNCSSSSTTNQTQHFTCRKDYPTLYEPE------SIGPSGRSVCPEYFRWIHEDLKPWA 129
Query: 96 VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
GITR+M+E+ TAHFRL +V VY+ YK+SIQTRD+FTIWGILQLLR+YPG++PD
Sbjct: 130 AGGITREMVEKGKATAHFRLAVVRGIVYVEHYKKSIQTRDLFTIWGILQLLRRYPGQIPD 189
Query: 156 LELMFDCDDRPVIRSRDY--SGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
LELMFDCDDRPV++S DY +G + PP+FRY GD T+DIVFPDWSFWGWAEINI+P
Sbjct: 190 LELMFDCDDRPVVKSADYRNAGVDTVEAPPVFRYCGDEETLDIVFPDWSFWGWAEINIRP 249
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWI 273
WE+LL+ELK+GN R W+ RE +AYWKGNP+VA+TR+DLL CNLS ++DWNARLY+QDWI
Sbjct: 250 WENLLKELKKGNEKRKWMKREAFAYWKGNPYVADTRQDLLKCNLSLQNDWNARLYIQDWI 309
Query: 274 LESKRGFQQSNLASQCAHRY 293
ES++G++QS LA+QC +RY
Sbjct: 310 QESQQGYKQSKLANQCTYRY 329
>gi|356561622|ref|XP_003549080.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 525
Score = 343 bits (880), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 167/299 (55%), Positives = 208/299 (69%), Gaps = 17/299 (5%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRIT-----TPLNCVK-NQTQT 54
+I FA++ L ++ S +K+ S P P ++ PLNC N T T
Sbjct: 52 IIGAFAYTRTLDTHRMFSGASSSKSA--QSTTPYGTSPFTVSIRKPIAPLNCTAYNLTGT 109
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
CPTN + P T+TCPDYFRWIHEDL PW TGIT+DM+ERA QTA+FR
Sbjct: 110 CPTNLQDHQNS---------PATATCPDYFRWIHEDLRPWARTGITQDMVERAKQTANFR 160
Query: 115 LILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
LI++ + Y+ Y + QTRDVF+IWGILQLLR+YPG++PDLELMFDC+D PV+ + Y+
Sbjct: 161 LIILKGRAYLETYSRPYQTRDVFSIWGILQLLRRYPGKIPDLELMFDCEDWPVVLADRYN 220
Query: 175 GPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDRE 234
GPN + PPPLFRY G+ T+DIVFPDWSFWGWAEINIKPW LL ELKEG W++RE
Sbjct: 221 GPNVEQPPPLFRYCGNDATLDIVFPDWSFWGWAEINIKPWHILLGELKEGTTRIPWLNRE 280
Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
PYAYWKGNP VAETR+DL+ CN+S+ DWNARL+ QDW ES+ GF +S+L SQC +RY
Sbjct: 281 PYAYWKGNPAVAETRQDLIKCNVSENQDWNARLFAQDWFRESQEGFNKSDLPSQCTYRY 339
>gi|356522636|ref|XP_003529952.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 510
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 159/269 (59%), Positives = 198/269 (73%), Gaps = 16/269 (5%)
Query: 27 IIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
II + KPQ + PL+C +N TQTC +YP PT TCP YFRW
Sbjct: 67 IIPTTKPQEQ-----EFPLSCTQNVTQTCSRDYPTIHTPTN--------PTRTCPSYFRW 113
Query: 87 IHEDLSPWKVT--GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQ 144
IHEDL PW+ GITR+MLE A +TAHFRL++V+ K+Y+ KYK++IQTRDVFT+WGILQ
Sbjct: 114 IHEDLWPWRERDRGITREMLEGARRTAHFRLVIVDGKLYVEKYKKAIQTRDVFTLWGILQ 173
Query: 145 LLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
LLR YPG++PDLEL+FDCDDRPV+ + GPN PP LFRY D+W++DIVFPDWSFW
Sbjct: 174 LLRMYPGKVPDLELLFDCDDRPVVSKERFKGPNAPTPP-LFRYCSDQWSLDIVFPDWSFW 232
Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
GWAEINIKPW+ +L+E+KEGN W DR PYAYWKGNP V+ TR+DL+ CN+++K DWN
Sbjct: 233 GWAEINIKPWKHVLKEIKEGNEKTKWKDRVPYAYWKGNPLVSPTRKDLMKCNVTEKDDWN 292
Query: 265 ARLYVQDWILESKRGFQQSNLASQCAHRY 293
LY+QDW ES +G+++SNL QC HRY
Sbjct: 293 THLYIQDWDQESSKGYKKSNLGDQCTHRY 321
>gi|357497409|ref|XP_003618993.1| KTEL motif-containing protein [Medicago truncatula]
gi|355494008|gb|AES75211.1| KTEL motif-containing protein [Medicago truncatula]
Length = 515
Score = 336 bits (861), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 153/253 (60%), Positives = 197/253 (77%), Gaps = 12/253 (4%)
Query: 44 PLNCV--KNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGIT 100
PL C + +TQTCP +YP K + T ++ + TCP +FRWIHEDL PWK GIT
Sbjct: 83 PLRCTNGEKETQTCPRDYPTKHNPTNQN--------SHTCPSFFRWIHEDLKPWKEKGIT 134
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMF 160
R+MLE A +TA+F++++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR +PG+LPDLELMF
Sbjct: 135 REMLEGAKRTANFKVVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRMFPGKLPDLELMF 194
Query: 161 DCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
DC+DRPVI ++ GPN PPPLFRY D+W++DIVFPDWSFWGWAE NIKPW+++L+E
Sbjct: 195 DCEDRPVIHKGNFQGPN-ASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWKNILKE 253
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGF 280
+KEGN W DR PYAYWKGNP VA TR++LL CN + K DWN RLY+QDW ES +G+
Sbjct: 254 IKEGNKETKWKDRVPYAYWKGNPNVAATRKNLLRCNATSKDDWNTRLYIQDWDKESTQGY 313
Query: 281 QQSNLASQCAHRY 293
++S+L +QC HRY
Sbjct: 314 KKSSLGNQCTHRY 326
>gi|297824639|ref|XP_002880202.1| hypothetical protein ARALYDRAFT_483721 [Arabidopsis lyrata subsp.
lyrata]
gi|297326041|gb|EFH56461.1| hypothetical protein ARALYDRAFT_483721 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 334 bits (856), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 155/255 (60%), Positives = 190/255 (74%), Gaps = 2/255 (0%)
Query: 39 KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
+R + VKNQTQ P + + ++ S +S P STCP YFRWIHEDL PWK TG
Sbjct: 75 QRFPNQCDVVKNQTQLFPEDGSSRNNNKKPRSSHSRP--STCPSYFRWIHEDLRPWKETG 132
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
ITR MLE+A +TAHFR+++++ +VY+ KY+ SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 133 ITRGMLEKARRTAHFRVVILDGRVYVKKYRNSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 192
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
MFD DDRP +RS+D+ G + PPPLFRY D ++DIVFPDWSFWGWAE+NIKPW L
Sbjct: 193 MFDPDDRPTVRSKDFQGHQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWAKSL 252
Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKR 278
++EGN W DR YAYW+GNP VA TRRDLL CN+S + DWN RLY+QDW ES+
Sbjct: 253 VAIEEGNKMTQWTDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQDWDRESRE 312
Query: 279 GFQQSNLASQCAHRY 293
GF+ SNL +QC HRY
Sbjct: 313 GFKNSNLENQCTHRY 327
>gi|224074911|ref|XP_002304487.1| predicted protein [Populus trichocarpa]
gi|222841919|gb|EEE79466.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 333 bits (855), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 149/191 (78%), Positives = 169/191 (88%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
MLERA TAHFRLI+V K Y+ KYK+SIQTRD FTIWGILQLLR+YPG++PDLELMFDC
Sbjct: 1 MLERAKTTAHFRLIIVKGKAYLEKYKKSIQTRDAFTIWGILQLLRRYPGKIPDLELMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DD PVI+S DY GPN GPPPLFRY GD+WT DIVFPDWSFWGWAEINIKPW+ LL +LK
Sbjct: 61 DDLPVIQSSDYRGPNKTGPPPLFRYCGDKWTEDIVFPDWSFWGWAEINIKPWDKLLIDLK 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
EGNN WIDREPYAYWKGNPFVAETR+DLLTCN+SD+ DWNARL++QDWILES++ F+Q
Sbjct: 121 EGNNRSRWIDREPYAYWKGNPFVAETRKDLLTCNVSDQQDWNARLFIQDWILESQQEFKQ 180
Query: 283 SNLASQCAHRY 293
SN+A+QC HRY
Sbjct: 181 SNVANQCTHRY 191
>gi|449485616|ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 538
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 157/271 (57%), Positives = 197/271 (72%), Gaps = 10/271 (3%)
Query: 29 VSRKPQNRPPK-RITTPLNCVKNQTQT---CPTNYPKTSQTQESISDYSIPPTST--CPD 82
V P ++P + ++ L+C T CP +YP T E + PP+S+ CPD
Sbjct: 83 VPHNPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQN----PPSSSSACPD 138
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
YFRWIHEDL PW TGITR LE +TA+FRL+++N K Y+ YK+S QTRD FT+WGI
Sbjct: 139 YFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGI 198
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
LQLLR+YPG++PDL+LMFDC D PVI + +SGPN PPPLFRY GD T DIVFPDWS
Sbjct: 199 LQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWS 258
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
FWGW EINIKPWE LL+++KEGN W R+PYAYWKGNP VA+TR+DL+ CN+SD+ D
Sbjct: 259 FWGWPEINIKPWEPLLKDIKEGNKRIPWKSRQPYAYWKGNPEVADTRKDLIKCNVSDQQD 318
Query: 263 WNARLYVQDWILESKRGFQQSNLASQCAHRY 293
WNAR++ QDW ES+ G++QSNL++QC HRY
Sbjct: 319 WNARVFAQDWTKESQEGYKQSNLSNQCLHRY 349
>gi|449446159|ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 538
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 157/271 (57%), Positives = 197/271 (72%), Gaps = 10/271 (3%)
Query: 29 VSRKPQNRPPK-RITTPLNCVKNQTQT---CPTNYPKTSQTQESISDYSIPPTST--CPD 82
V P ++P + ++ L+C T CP +YP T E + PP+S+ CPD
Sbjct: 83 VPHNPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQN----PPSSSSACPD 138
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
YFRWIHEDL PW TGITR LE +TA+FRL+++N K Y+ YK+S QTRD FT+WGI
Sbjct: 139 YFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGI 198
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
LQLLR+YPG++PDL+LMFDC D PVI + +SGPN PPPLFRY GD T DIVFPDWS
Sbjct: 199 LQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWS 258
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
FWGW EINIKPWE LL+++KEGN W REPYAYWKGNP VA+TR+DL+ CN+SD+ D
Sbjct: 259 FWGWPEINIKPWEPLLKDIKEGNKRIPWKSREPYAYWKGNPEVADTRKDLIKCNVSDQQD 318
Query: 263 WNARLYVQDWILESKRGFQQSNLASQCAHRY 293
WNAR++ QDW ES+ G++QS+L++QC HRY
Sbjct: 319 WNARVFAQDWTKESQEGYKQSDLSNQCLHRY 349
>gi|224127624|ref|XP_002320120.1| predicted protein [Populus trichocarpa]
gi|222860893|gb|EEE98435.1| predicted protein [Populus trichocarpa]
Length = 411
Score = 332 bits (850), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 148/217 (68%), Positives = 183/217 (84%), Gaps = 1/217 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
TSTCP YF+WIH+DL WK TGIT+DM+ERA +TAHFRL++VN K Y+ KY+QSIQTRD+
Sbjct: 9 TSTCPSYFQWIHDDLRHWKETGITQDMIERARKTAHFRLVIVNGKAYVEKYRQSIQTRDM 68
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FT+WGILQLLR YPGRLPDLELMFDCDDRPVI S+ + GPN PPPLFRY D ++DI
Sbjct: 69 FTLWGILQLLRLYPGRLPDLELMFDCDDRPVIPSKHFRGPN-AAPPPLFRYCSDWQSLDI 127
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGWAE NI+PW++LL+E+KEGN+ W DR PYAYW+GNP+V+ R+DLL CN
Sbjct: 128 VFPDWSFWGWAETNIRPWKNLLKEIKEGNSRTKWKDRTPYAYWRGNPWVSPIRQDLLKCN 187
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+S+++DWN RLY+QDW+ +SK G+++SNL QC HRY
Sbjct: 188 VSEQNDWNTRLYLQDWVKQSKEGYRESNLQDQCTHRY 224
>gi|42569947|ref|NP_182107.3| downstream target of AGL15 2 [Arabidopsis thaliana]
gi|330255513|gb|AEC10607.1| downstream target of AGL15 2 [Arabidopsis thaliana]
Length = 523
Score = 330 bits (845), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 153/255 (60%), Positives = 188/255 (73%), Gaps = 3/255 (1%)
Query: 39 KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
+R V+NQTQ P N + + S I STCP YFRWIHEDL PWK TG
Sbjct: 81 QRFPNQCGVVQNQTQLFPQNGSSRNNDKPRSSHSRI---STCPSYFRWIHEDLRPWKETG 137
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
+TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 138 VTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 197
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
MFD DDRP +RS+D+ G + PPPLFRY D ++DIVFPDWSFWGWAE+NIKPW+ L
Sbjct: 198 MFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSL 257
Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKR 278
++EGN W DR YAYW+GNP VA TRRDLL CN+S + DWN RLY+QDW ES+
Sbjct: 258 VAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQDWDRESRE 317
Query: 279 GFQQSNLASQCAHRY 293
GF+ SNL +QC HRY
Sbjct: 318 GFKNSNLENQCTHRY 332
>gi|3386611|gb|AAC28541.1| unknown protein [Arabidopsis thaliana]
Length = 517
Score = 329 bits (844), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 153/255 (60%), Positives = 188/255 (73%), Gaps = 3/255 (1%)
Query: 39 KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
+R V+NQTQ P N + + S I STCP YFRWIHEDL PWK TG
Sbjct: 75 QRFPNQCGVVQNQTQLFPQNGSSRNNDKPRSSHSRI---STCPSYFRWIHEDLRPWKETG 131
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
+TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 132 VTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 191
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
MFD DDRP +RS+D+ G + PPPLFRY D ++DIVFPDWSFWGWAE+NIKPW+ L
Sbjct: 192 MFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSL 251
Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKR 278
++EGN W DR YAYW+GNP VA TRRDLL CN+S + DWN RLY+QDW ES+
Sbjct: 252 VAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQDWDRESRE 311
Query: 279 GFQQSNLASQCAHRY 293
GF+ SNL +QC HRY
Sbjct: 312 GFKNSNLENQCTHRY 326
>gi|357481655|ref|XP_003611113.1| KTEL motif-containing protein [Medicago truncatula]
gi|355512448|gb|AES94071.1| KTEL motif-containing protein [Medicago truncatula]
Length = 502
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 154/254 (60%), Positives = 185/254 (72%), Gaps = 23/254 (9%)
Query: 43 TPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
T LNC +KNQT + +NYP T CP+YF+WIHEDL PWK GITR
Sbjct: 82 TVLNCTIKNQTCSS-SNYPTTKNNM------------VCPEYFKWIHEDLKPWKKKGITR 128
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYK--QSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
+M+E+A +TAHFRL++ N K Y+ KYK ++IQTRDVFT+WGILQLLRKYPG++PDLELM
Sbjct: 129 EMVEKAKKTAHFRLVVKNGKGYLEKYKNKEAIQTRDVFTVWGILQLLRKYPGKIPDLELM 188
Query: 160 FDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLR 219
FDC+D+PV+ P PPP+F Y DRWT DIVFPDWSFWGWAEINIKPWE LL+
Sbjct: 189 FDCNDKPVV-------PIGLDPPPVFGYCADRWTQDIVFPDWSFWGWAEINIKPWEHLLK 241
Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRG 279
++K+GN W DREPYAYWKGNP+ A TR D L CN+S DWN RL+ QDWI ES++G
Sbjct: 242 DIKKGNKRVKWKDREPYAYWKGNPYTAATRLDFLNCNVSTAQDWNLRLFTQDWIKESEQG 301
Query: 280 FQQSNLASQCAHRY 293
F SNLA QC +RY
Sbjct: 302 FNHSNLADQCTYRY 315
>gi|357497413|ref|XP_003618995.1| KTEL motif-containing protein [Medicago truncatula]
gi|355494010|gb|AES75213.1| KTEL motif-containing protein [Medicago truncatula]
Length = 303
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 145/250 (58%), Positives = 194/250 (77%), Gaps = 10/250 (4%)
Query: 45 LNCVKNQT--QTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
L C++ ++ Q CP + P T+ +++++ CP YFRWIHEDL PW+ GITR+
Sbjct: 62 LKCIQGKSFKQKCPRDNP-TTHNPTNLNNH------VCPSYFRWIHEDLKPWREKGITRN 114
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
ML++A +TAHF+L++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR YPG+LPDLELMFDC
Sbjct: 115 MLKKAKKTAHFKLVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRLYPGKLPDLELMFDC 174
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
+D+PV+ + GPN PPPLFRY D+W++DIVFPDWSFWGWAE NIKPW ++L+++K
Sbjct: 175 EDKPVVPLDKFQGPN-ASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWNNILKDIK 233
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
EGN W DR PYAYWKGNP+VA TR++LL CN++ ++DWN +Y+QDWI ES +GF +
Sbjct: 234 EGNKKTKWKDRVPYAYWKGNPYVAPTRQNLLQCNVTLENDWNTLIYIQDWIQESNQGFNK 293
Query: 283 SNLASQCAHR 292
SNL QC HR
Sbjct: 294 SNLGDQCTHR 303
>gi|297840159|ref|XP_002887961.1| hypothetical protein ARALYDRAFT_475010 [Arabidopsis lyrata subsp.
lyrata]
gi|297333802|gb|EFH64220.1| hypothetical protein ARALYDRAFT_475010 [Arabidopsis lyrata subsp.
lyrata]
Length = 571
Score = 322 bits (824), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 157/272 (57%), Positives = 200/272 (73%), Gaps = 16/272 (5%)
Query: 31 RKPQNRPPKRITTPLNCV----KNQTQTCP-TNYPKTSQTQESISDYSIPPTSTCPDYFR 85
R+P+ + + L+C +N + +C T P +Q Q +CPDYF+
Sbjct: 119 REPEKKSQQNTGISLDCTSFLNQNGSGSCSRTPQPNNNQNQTE-------SNRSCPDYFK 171
Query: 86 WIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQL 145
WIHEDL PW+ TGIT++M+ER TAHFRL++VN KV++ YK+SIQTRD FT+WGILQL
Sbjct: 172 WIHEDLKPWRETGITKEMVERGKTTAHFRLVIVNGKVFVENYKKSIQTRDAFTLWGILQL 231
Query: 146 LRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWS 202
LRKYPG+LPD++LMFDCDDRPVIRS Y+ N PPPLFRY GDRWT+DIVFPDWS
Sbjct: 232 LRKYPGKLPDVDLMFDCDDRPVIRSDGYNRSNRTAEDVPPPLFRYCGDRWTVDIVFPDWS 291
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKH 261
FWGW EINIK W +L+E++EG + +++RE YAYWKGNPFVA +R DLLTCNLS H
Sbjct: 292 FWGWQEINIKEWSKVLKEMEEGKKKKKFMEREAYAYWKGNPFVASPSREDLLTCNLSSLH 351
Query: 262 DWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
DWNAR+++QDWI E +RGF+ SN+A+QC +RY
Sbjct: 352 DWNARIFIQDWISEGQRGFENSNVANQCTYRY 383
>gi|110736095|dbj|BAF00020.1| hypothetical protein [Arabidopsis thaliana]
Length = 418
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 143/216 (66%), Positives = 174/216 (80%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
STCP YFRWIHEDL PWK TG+TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVF
Sbjct: 12 STCPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVF 71
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
T+WGI+QLLR YPGRLPDLELMFD DDRP +RS+D+ G + PPPLFRY D ++DIV
Sbjct: 72 TLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIV 131
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGWAE+NIKPW+ L ++EGN W DR YAYW+GNP VA TRRDLL CN+
Sbjct: 132 FPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNV 191
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
S + DWN RLY+QDW ES+ GF+ SNL +QC HRY
Sbjct: 192 SAQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRY 227
>gi|449446167|ref|XP_004140843.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 442
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 139/219 (63%), Positives = 170/219 (77%)
Query: 75 PPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
P +TCP+YFRWIHEDL PW GIT+ MLE A + AHFR+++V K Y+ Y ++ Q+R
Sbjct: 16 PMVATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSR 75
Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
D T+WG++QLLR+YPG+LPDL+LMF CDDRP I +DYSG PPPLFRYSGD T
Sbjct: 76 DNLTVWGVVQLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATW 135
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWSFWGW EINIK WES+L+++KEGN W+ R+PYAYWKGNP VA TRRDLL
Sbjct: 136 DIVFPDWSFWGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLK 195
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
CN++ K DW+ARLY Q+W ESK GF+ SNLA+QC +RY
Sbjct: 196 CNVTQKQDWSARLYRQNWDKESKAGFKDSNLANQCDYRY 234
>gi|449526435|ref|XP_004170219.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 426
Score = 318 bits (814), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 138/217 (63%), Positives = 169/217 (77%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
+TCP+YFRWIHEDL PW GIT+ MLE A + AHFR+++V K Y+ Y ++ Q+RD
Sbjct: 2 VATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSRDN 61
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
T+WG++QLLR+YPG+LPDL+LMF CDDRP I +DYSG PPPLFRYSGD T DI
Sbjct: 62 LTVWGVVQLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATWDI 121
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EINIK WES+L+++KEGN W+ R+PYAYWKGNP VA TRRDLL CN
Sbjct: 122 VFPDWSFWGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLKCN 181
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
++ K DW+ARLY Q+W ESK GF+ SNLA+QC +RY
Sbjct: 182 VTQKQDWSARLYRQNWDKESKAGFKDSNLANQCDYRY 218
>gi|15222413|ref|NP_176531.1| uncharacterized protein [Arabidopsis thaliana]
gi|332195976|gb|AEE34097.1| uncharacterized protein [Arabidopsis thaliana]
Length = 578
Score = 317 bits (812), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 151/271 (55%), Positives = 200/271 (73%), Gaps = 14/271 (5%)
Query: 31 RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
R + + P+ + ++C +N++ +C ++ S+ S CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
IHEDL PW+ TGIT++M+ER TAHFRL+++N KV++ YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
RKYPG+LPD++LMFDCDDRPVIRS Y+ N PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296
Query: 204 WGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHD 262
WGW EINI+ W +L+E++EG + +++R+ YAYWKGNPFVA +R DLLTCNLS HD
Sbjct: 297 WGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHD 356
Query: 263 WNARLYVQDWILESKRGFQQSNLASQCAHRY 293
WNAR+++QDWI E +RGF+ SN+A+QC +RY
Sbjct: 357 WNARIFIQDWISEGQRGFENSNVANQCTYRY 387
>gi|15233070|ref|NP_191686.1| uncharacterized protein [Arabidopsis thaliana]
gi|6850894|emb|CAB71057.1| putative protein [Arabidopsis thaliana]
gi|27754477|gb|AAO22686.1| unknown protein [Arabidopsis thaliana]
gi|28393951|gb|AAO42383.1| unknown protein [Arabidopsis thaliana]
gi|332646660|gb|AEE80181.1| uncharacterized protein [Arabidopsis thaliana]
Length = 498
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 149/252 (59%), Positives = 183/252 (72%), Gaps = 7/252 (2%)
Query: 44 PLNC--VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
P C V+NQ+ P + + S+ + S S STCP YFRWIHEDL PWK TGITR
Sbjct: 63 PYGCDFVQNQSSQTPISQNRKSRLNPNNSSKS----STCPSYFRWIHEDLRPWKQTGITR 118
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
M+E A++TAHFRL++ N K Y+ +YK+SIQTRD FT+WGILQLLR YPG+LPDLELMFD
Sbjct: 119 GMIEEASRTAHFRLVIRNGKAYVKRYKKSIQTRDEFTLWGILQLLRWYPGKLPDLELMFD 178
Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
DDRPV+RS D+ G K PPP+FRY D ++DIVFPDWSFWGWAE+N+KPW L +
Sbjct: 179 ADDRPVVRSVDFIG-QQKEPPPVFRYCSDDASLDIVFPDWSFWGWAEVNVKPWGKSLEAI 237
Query: 222 KEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQ 281
KEGN+ W DR YAYW+GNP+V R DLL CN ++ +WN RLY+QDW E+K GF+
Sbjct: 238 KEGNSMTQWKDRVAYAYWRGNPYVDPGRGDLLKCNATEHEEWNTRLYIQDWDKETKEGFK 297
Query: 282 QSNLASQCAHRY 293
SNL +QC HRY
Sbjct: 298 NSNLENQCTHRY 309
>gi|218191256|gb|EEC73683.1| hypothetical protein OsI_08244 [Oryza sativa Indica Group]
Length = 524
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 176/226 (77%), Gaps = 4/226 (1%)
Query: 72 YSIPPTST---CPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKY 127
+ +PPT+T CP YFR+IHEDL PW+ G ITR MLERA TA+FRL+++ + Y+ +
Sbjct: 115 HPLPPTTTNSSCPAYFRFIHEDLHPWRAAGGITRRMLERARDTANFRLVVLRGRAYVERI 174
Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY 187
+ QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+++ Y G N PPLFRY
Sbjct: 175 APAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVQADRYQGENATAMPPLFRY 234
Query: 188 SGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
GD T+D+VFPDWSFWGW EINIKPW++L ++L GN W+DREPYAYWKGNP VA
Sbjct: 235 CGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKGNPDVAT 294
Query: 248 TRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
R++L+ CN+S KH+WNAR+Y QDWI ESK G++QS+LASQC HRY
Sbjct: 295 KRKELVKCNVSSKHEWNARIYKQDWIKESKAGYKQSDLASQCTHRY 340
>gi|297599669|ref|NP_001047557.2| Os02g0642700 [Oryza sativa Japonica Group]
gi|255671126|dbj|BAF09471.2| Os02g0642700, partial [Oryza sativa Japonica Group]
Length = 474
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 140/224 (62%), Positives = 175/224 (78%), Gaps = 4/224 (1%)
Query: 74 IPPTST---CPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
+PPT+T CP YFR+IHEDL PW+ G ITR MLERA TA+FRL+++ + Y+ +
Sbjct: 67 LPPTTTNSSCPAYFRFIHEDLHPWRAAGGITRRMLERARDTANFRLVVLRGRAYVERIAP 126
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
+ QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+++ Y G N PPLFRY G
Sbjct: 127 AFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVQADRYQGENATAMPPLFRYCG 186
Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
D T+D+VFPDWSFWGW EINIKPW++L ++L GN W+DREPYAYWKGNP VA R
Sbjct: 187 DDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKGNPDVATKR 246
Query: 250 RDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
++L+ CN+S KH+WNAR+Y QDWI ESK G++QS+LASQC HRY
Sbjct: 247 KELVKCNVSSKHEWNARIYKQDWIKESKAGYKQSDLASQCTHRY 290
>gi|359489784|ref|XP_002273475.2| PREDICTED: O-glucosyltransferase rumi [Vitis vinifera]
Length = 604
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 151/265 (56%), Positives = 181/265 (68%), Gaps = 11/265 (4%)
Query: 30 SRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIH 88
S KP N K+I LNC + N QTCP T + E P T TCP+YFRWI+
Sbjct: 163 SIKPCN---KKIEYSLNCSEGNMAQTCPVTPLATFEPSE-------PSTETCPEYFRWIY 212
Query: 89 EDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRK 148
EDL PW+ TGITRDM+ERA A+ R+++V+ KVY+ KYK QTRDVFTIWGILQLLR
Sbjct: 213 EDLRPWRETGITRDMVERAKPAAYIRVVVVDGKVYMEKYKGVYQTRDVFTIWGILQLLRM 272
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
YPG+LPD +LMF C D+ +++R + GPN PPPLF Y GD T DIVFPDWSFWGW E
Sbjct: 273 YPGKLPDFDLMFSCGDKLAMKTRYFQGPNATTPPPLFHYCGDDETYDIVFPDWSFWGWPE 332
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
I+IK W +L ++LKEGNN WIDREPYAYWKGN + R +L CN + + DWNAR+Y
Sbjct: 333 IHIKQWNTLKKDLKEGNNRTEWIDREPYAYWKGNINLGLARHELSKCNKTSEQDWNARIY 392
Query: 269 VQDWILESKRGFQQSNLASQCAHRY 293
DW E GF S+LASQC HRY
Sbjct: 393 DIDWRQEIHSGFNSSDLASQCTHRY 417
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 45/60 (75%)
Query: 106 RANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR 165
++ AH R+++V+ KV++ KY+ QTR V TIW ILQLLR YPG+LPDL+L+F+C +
Sbjct: 12 KSGNPAHIRVVVVDGKVFMEKYEWVFQTRHVLTIWCILQLLRMYPGKLPDLDLIFECGAK 71
>gi|297817438|ref|XP_002876602.1| hypothetical protein ARALYDRAFT_486604 [Arabidopsis lyrata subsp.
lyrata]
gi|297322440|gb|EFH52861.1| hypothetical protein ARALYDRAFT_486604 [Arabidopsis lyrata subsp.
lyrata]
Length = 496
Score = 310 bits (794), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 147/250 (58%), Positives = 181/250 (72%), Gaps = 4/250 (1%)
Query: 44 PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
P C Q Q+ T++ SQ + S + STCP YFRWIHEDL PWK TGITR M
Sbjct: 62 PYGCDFVQNQSNQTHF---SQNRASRLNPIRSKPSTCPSYFRWIHEDLRPWKQTGITRGM 118
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
+E A++TAHFRL++ N K Y+ +Y++SIQTRD FT+WGI+QLLR +PG+LPDLELMFD D
Sbjct: 119 IEEASRTAHFRLVIRNGKAYVKRYRKSIQTRDDFTLWGIVQLLRWFPGKLPDLELMFDAD 178
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
DRPV+RS D+ G K PPP+FRY D ++DIVFPDWSFWGWAE+NIKPW L +KE
Sbjct: 179 DRPVVRSADFIG-QQKKPPPVFRYCSDDASLDIVFPDWSFWGWAEVNIKPWGESLEAIKE 237
Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
GNN W +R YAYW+GNP V R DLL CN+S+ +WN RLY+QDW ESK G++ S
Sbjct: 238 GNNMTQWKERVAYAYWRGNPHVDPGRGDLLKCNVSENEEWNTRLYIQDWDKESKEGYKNS 297
Query: 284 NLASQCAHRY 293
NL +QC HRY
Sbjct: 298 NLENQCTHRY 307
>gi|297745255|emb|CBI40335.3| unnamed protein product [Vitis vinifera]
Length = 578
Score = 310 bits (793), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 156/290 (53%), Positives = 189/290 (65%), Gaps = 12/290 (4%)
Query: 5 FAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTS 63
F + QS F+ + R S KP N K+I LNC + N QTCP T
Sbjct: 113 FGVVLFFQSVFTGDSF-RKVWSHSSSIKPCN---KKIEYSLNCSEGNMAQTCPVTPLATF 168
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVY 123
+ E P T TCP+YFRWI+EDL PW+ TGITRDM+ERA A+ R+++V+ KVY
Sbjct: 169 EPSE-------PSTETCPEYFRWIYEDLRPWRETGITRDMVERAKPAAYIRVVVVDGKVY 221
Query: 124 IHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPP 183
+ KYK QTRDVFTIWGILQLLR YPG+LPD +LMF C D+ +++R + GPN PPP
Sbjct: 222 MEKYKGVYQTRDVFTIWGILQLLRMYPGKLPDFDLMFSCGDKLAMKTRYFQGPNATTPPP 281
Query: 184 LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP 243
LF Y GD T DIVFPDWSFWGW EI+IK W +L ++LKEGNN WIDREPYAYWKGN
Sbjct: 282 LFHYCGDDETYDIVFPDWSFWGWPEIHIKQWNTLKKDLKEGNNRTEWIDREPYAYWKGNI 341
Query: 244 FVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+ R +L CN + + DWNAR+Y DW E GF S+LASQC HRY
Sbjct: 342 NLGLARHELSKCNKTSEQDWNARIYDIDWRQEIHSGFNSSDLASQCTHRY 391
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 45/60 (75%)
Query: 106 RANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR 165
++ AH R+++V+ KV++ KY+ QTR V TIW ILQLLR YPG+LPDL+L+F+C +
Sbjct: 15 KSGNPAHIRVVVVDGKVFMEKYEWVFQTRHVLTIWCILQLLRMYPGKLPDLDLIFECGAK 74
>gi|359489776|ref|XP_002273194.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 521
Score = 310 bits (793), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 146/276 (52%), Positives = 183/276 (66%), Gaps = 8/276 (2%)
Query: 19 NISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPT 77
N+S + + P ++ LNC + N +QTCP P + E PP+
Sbjct: 58 NVSILTGDFLKNTSPTKLHSQKFEYSLNCREGNMSQTCPVTGPVAFEPGE-------PPS 110
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
TCP+YFRWI+EDL PW TGITR M+E+A A R+++V+ KVY+ KYK+ + RD F
Sbjct: 111 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYVEKYKRVNRNRDEF 170
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
TIWGILQLLR YPG+LPD +LMF+C DRP+IR+ Y GP+ PPPLF Y GD T DIV
Sbjct: 171 TIWGILQLLRMYPGKLPDFDLMFECRDRPMIRTHLYQGPDATVPPPLFHYCGDDETYDIV 230
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW E NIKPW ++LKEGN WIDREPYAYWKGN + R++L C
Sbjct: 231 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRN 290
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+D+ DWNARLY+ DW E + GF+ S+LASQC HRY
Sbjct: 291 TDEQDWNARLYIMDWGREVQSGFKTSDLASQCTHRY 326
>gi|357136753|ref|XP_003569968.1| PREDICTED: uncharacterized protein LOC100831246 [Brachypodium
distachyon]
Length = 543
Score = 309 bits (792), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 137/217 (63%), Positives = 170/217 (78%), Gaps = 1/217 (0%)
Query: 78 STCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
S+CP YFR+IHEDL PW+ G ITR ML+RA TA+FRL+++ + YI + QTRD+
Sbjct: 138 SSCPAYFRFIHEDLHPWRAAGGITRAMLDRARATANFRLVVLRGRAYIELIAPAFQTRDL 197
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGILQLLR+YPGR+PDL+LMFDC D PV+R+ Y G N PPLFRY GD T+D+
Sbjct: 198 FTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVRADQYEGENATAMPPLFRYCGDNETLDV 257
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGWAEINIKPW++L ++L GN W+DREPYAYWKGNP VA R++L+ CN
Sbjct: 258 VFPDWSFWGWAEINIKPWDALRKDLDAGNRRVRWVDREPYAYWKGNPDVAAIRQELVKCN 317
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+S K +WNAR+Y QDWI ESK G+++S+LASQC HRY
Sbjct: 318 VSSKQEWNARIYKQDWIKESKAGYKKSDLASQCTHRY 354
>gi|147791844|emb|CAN70600.1| hypothetical protein VITISV_027960 [Vitis vinifera]
Length = 521
Score = 308 bits (790), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 145/276 (52%), Positives = 183/276 (66%), Gaps = 8/276 (2%)
Query: 19 NISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPT 77
N+S + + P ++ LNC + N +QTCP P + E PP+
Sbjct: 58 NVSILTGDFLKNTSPTKLHSQKFEYSLNCREGNMSQTCPVTGPVAFEPGE-------PPS 110
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
TCP+YFRWI+EDL PW TGITR M+E+A A R+++V+ KVY+ KYK+ + RD F
Sbjct: 111 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYVEKYKRVNRNRDEF 170
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
TIWGILQLLR YPG+LPD +LMF+C DRP+I++ Y GP+ PPPLF Y GD T DIV
Sbjct: 171 TIWGILQLLRMYPGKLPDFDLMFECRDRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIV 230
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW E NIKPW ++LKEGN WIDREPYAYWKGN + R++L C
Sbjct: 231 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRN 290
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+D+ DWNARLY+ DW E + GF+ S+LASQC HRY
Sbjct: 291 TDEQDWNARLYIMDWGREVQSGFKTSDLASQCTHRY 326
>gi|359489780|ref|XP_002273350.2| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
Length = 509
Score = 306 bits (785), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 144/256 (56%), Positives = 182/256 (71%), Gaps = 9/256 (3%)
Query: 39 KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
KR +NC + N T+TCP T T S+ S T TCP+YFRWI+EDL PW T
Sbjct: 74 KRFEYSMNCSEGNMTKTCPV----TLLTTFEPSNLS---TGTCPEYFRWIYEDLKPWTET 126
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GITRDM+ERA AH R+++V+ KVY KYK QTRDVFTIWGILQ+LR YPG+LPD +
Sbjct: 127 GITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFTIWGILQVLRMYPGKLPDFD 186
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
LMF+C D+PVI+ DY G N PP LF Y GD T+DIVFPDWSFWGW EI IKPW +L
Sbjct: 187 LMFECGDKPVIKKHDYQGLNATAPP-LFHYCGDDETLDIVFPDWSFWGWPEIRIKPWSTL 245
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESK 277
++L+EGNN W+DREPYAYWKGN + TR +L C+ +++ DWNAR+Y DW+ E +
Sbjct: 246 RKDLREGNNKTKWVDREPYAYWKGNFKMGVTRHELSKCSKTNEQDWNARIYNMDWLQEMQ 305
Query: 278 RGFQQSNLASQCAHRY 293
GF+ ++L++QC H+Y
Sbjct: 306 NGFKSADLSTQCTHKY 321
>gi|219363409|ref|NP_001136898.1| uncharacterized protein LOC100217055 [Zea mays]
gi|194697518|gb|ACF82843.1| unknown [Zea mays]
gi|413923256|gb|AFW63188.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
Length = 551
Score = 305 bits (780), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 136/218 (62%), Positives = 170/218 (77%), Gaps = 1/218 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
+STCP YFR+IHEDL PW+ G ITR M++RA TA+FRL+++ + YI + + QTRD
Sbjct: 145 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 204
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ + Y G N PPLFRY GD T+D
Sbjct: 205 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYQGENATLLPPLFRYCGDNETLD 264
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW EINIKPW++L +EL GN W+ REPYAYWKGNP VA TR++L+ C
Sbjct: 265 VVFPDWSFWGWPEINIKPWDALQKELNGGNKRVKWLAREPYAYWKGNPDVAVTRQELVKC 324
Query: 256 NLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
N+S KH+WNAR+Y QDW+ E K G++QS+LASQC HRY
Sbjct: 325 NVSSKHEWNARIYKQDWLKEIKAGYKQSDLASQCTHRY 362
>gi|359489782|ref|XP_002273411.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 525
Score = 304 bits (778), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 141/250 (56%), Positives = 173/250 (69%), Gaps = 8/250 (3%)
Query: 45 LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
LNC + N +QTCP P + E PP+ TCP+YFRWI+EDL PW TGITR M
Sbjct: 88 LNCREGNVSQTCPVTGPVAFEPSE-------PPSETCPEYFRWIYEDLRPWMDTGITRAM 140
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
+E+A A R+++V+ KVY+ KYK+ RD FTIWGILQLLR YP +LPD +LMF+C
Sbjct: 141 VEKARPAASIRIVVVDGKVYMEKYKRVNHNRDEFTIWGILQLLRMYPEKLPDFDLMFECR 200
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
DRP+I++ Y GP+ PPPLF Y GD T DIVFPDWSFWGW E NIKPW ++LKE
Sbjct: 201 DRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGWPETNIKPWNGFKKDLKE 260
Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
GN WIDREPYAYWKGN + R++L C +D+ DWNARLY+ DW E + GF+ S
Sbjct: 261 GNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNARLYIMDWGREVQSGFKTS 320
Query: 284 NLASQCAHRY 293
+LASQC HRY
Sbjct: 321 DLASQCTHRY 330
>gi|414586114|tpg|DAA36685.1| TPA: hypothetical protein ZEAMMB73_061596 [Zea mays]
Length = 433
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 135/217 (62%), Positives = 171/217 (78%), Gaps = 1/217 (0%)
Query: 78 STCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
++CP YFR+IHEDL PW+ G ITR ML+RA TA FRL+++ Y+ +++ + QTRD+
Sbjct: 143 TSCPSYFRFIHEDLRPWRAAGGITRAMLDRARLTATFRLVVLGGNAYVQRFRPAFQTRDL 202
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWG+LQLLR++PGR+PDL+LMFD D PV+RS Y G N + PPLFRY GD T+DI
Sbjct: 203 FTIWGVLQLLRRHPGRVPDLDLMFDTVDWPVVRSHLYRGKNAEMLPPLFRYCGDDKTLDI 262
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EINIKPW++L +LK GNN W+DREPYAYWKGNP V+ TR++L+ CN
Sbjct: 263 VFPDWSFWGWPEINIKPWDALKEDLKAGNNRVRWMDREPYAYWKGNPSVSGTRKELVKCN 322
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+S HDWNAR+Y QDW ESK G++ S+L+SQCAHRY
Sbjct: 323 VSSTHDWNARIYAQDWFKESKAGYKDSDLSSQCAHRY 359
>gi|413923257|gb|AFW63189.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
Length = 578
Score = 303 bits (777), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 135/218 (61%), Positives = 170/218 (77%), Gaps = 1/218 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
+STCP YFR+IHEDL PW+ G ITR M++RA TA+FRL+++ + YI + + QTRD
Sbjct: 145 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 204
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ + Y G N PPLFRY GD T+D
Sbjct: 205 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYQGENATLLPPLFRYCGDNETLD 264
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW EINIKPW++L +EL GN W+ REPYAYWKGNP VA TR++L+ C
Sbjct: 265 VVFPDWSFWGWPEINIKPWDALQKELNGGNKRVKWLAREPYAYWKGNPDVAVTRQELVKC 324
Query: 256 NLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
N+S KH+WNAR+Y QDW+ E K G++QS+LASQC HR+
Sbjct: 325 NVSSKHEWNARIYKQDWLKEIKAGYKQSDLASQCTHRF 362
>gi|356561570|ref|XP_003549054.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
2-like [Glycine max]
Length = 426
Score = 303 bits (775), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 146/256 (57%), Positives = 178/256 (69%), Gaps = 17/256 (6%)
Query: 39 KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
++ PLNC N T TCPTN ++ P T+TCPDYFRWIHEDL PW T
Sbjct: 23 RKPIAPLNCTAYNLTGTCPTNLQDHQRS---------PATATCPDYFRWIHEDLRPWART 73
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
E A +A+ R I++ + Y+ Y + QTRDVF+IWGILQLLR+YPG++PDLE
Sbjct: 74 -------EHAPTSANLRFIILKGRAYLETYSRPYQTRDVFSIWGILQLLRRYPGKIPDLE 126
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
LMFDC D PV+ S Y+GPN + PPPLFRY G+ T+D+VF DWSFWGWAEINIKPW L
Sbjct: 127 LMFDCVDWPVVXSDRYNGPNVEQPPPLFRYCGNDATLDVVFLDWSFWGWAEINIKPWHIL 186
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESK 277
L ELKEG W++REPYAYWKGNP VAETR L+ CN+S+ DWNARL QDW+ ES+
Sbjct: 187 LGELKEGTTRIPWLNREPYAYWKGNPAVAETRXYLMKCNVSENQDWNARLLAQDWLRESQ 246
Query: 278 RGFQQSNLASQCAHRY 293
GF +S+L SQC +RY
Sbjct: 247 EGFNKSDLPSQCTYRY 262
>gi|242066560|ref|XP_002454569.1| hypothetical protein SORBIDRAFT_04g033620 [Sorghum bicolor]
gi|241934400|gb|EES07545.1| hypothetical protein SORBIDRAFT_04g033620 [Sorghum bicolor]
Length = 552
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 133/218 (61%), Positives = 169/218 (77%), Gaps = 1/218 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
+STCP YFR+IHEDL PW+ G ITR M++RA TA+FRL+++ + YI + + QTRD
Sbjct: 146 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 205
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ + Y G N PPLFRY G+ T+D
Sbjct: 206 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYEGENATVLPPLFRYCGNNETLD 265
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW EINIKPW++L +EL GN W++REPYAYWKGNP VA R++L+ C
Sbjct: 266 VVFPDWSFWGWPEINIKPWDALQKELNRGNKRVKWLNREPYAYWKGNPDVAVIRQELVKC 325
Query: 256 NLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
N+S +H+WNAR+Y QDW+ E K G++QSNLA QC HRY
Sbjct: 326 NVSSEHEWNARIYKQDWLKEIKAGYKQSNLAGQCTHRY 363
>gi|356506263|ref|XP_003521906.1| PREDICTED: protein O-glucosyltransferase 1-like [Glycine max]
Length = 476
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 146/291 (50%), Positives = 189/291 (64%), Gaps = 17/291 (5%)
Query: 6 AFSIILQSNFS-VHNISRNKTVIIVSRK-PQNRPPKRITTPLNCVK-NQTQTCPTNYPKT 62
A + +L+ N S + + KT+II +++ PQ PLNC N +TC + YP T
Sbjct: 17 ASTFLLEINLSRITRTTIFKTIIIFNKQQPQ--------FPLNCTDGNLAKTCSSYYPTT 68
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
+ S T++CP+YF+WIHEDL PWK GITRDM+ER +HFRL++VN K
Sbjct: 69 FDLDDDSS------TTSCPNYFKWIHEDLKPWKSKGITRDMVERGKNVSHFRLVIVNGKA 122
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y+ KY + QTRDVFTIWGILQLLR YPG++PDL+LMF C D+PV+ +D+ GP PP
Sbjct: 123 YVEKYDKVYQTRDVFTIWGILQLLRLYPGKIPDLDLMFQCGDKPVVLKKDFQGPQAMSPP 182
Query: 183 PLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN 242
P+F Y GD DIVFPDWSFWGW EINI PWE+ L ++ EGN W DR PYA+WKGN
Sbjct: 183 PVFHYCGDENAHDIVFPDWSFWGWPEINIGPWETTLHKILEGNKMIKWKDRTPYAFWKGN 242
Query: 243 PFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+A+ RR+L CN + +HDWNAR++ W E F+ S L +QC RY
Sbjct: 243 LAMADIRRELGKCNPTKEHDWNARIHNIQWNKEEANNFESSKLENQCNFRY 293
>gi|147810897|emb|CAN69493.1| hypothetical protein VITISV_037876 [Vitis vinifera]
Length = 378
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 132/191 (69%), Positives = 161/191 (84%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ERA +TA F+L+++N + Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC
Sbjct: 1 MVERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PVI+S++Y GPN PPPLFRY GD T+DIVFPDWSFWGW EI IKPWESLL++LK
Sbjct: 61 VDWPVIKSKEYHGPNTTAPPPLFRYCGDDATLDIVFPDWSFWGWPEIXIKPWESLLKDLK 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
EGN W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+Y QDWI ES+ G++Q
Sbjct: 121 EGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARVYTQDWIXESQEGYKQ 180
Query: 283 SNLASQCAHRY 293
S+LASQC HRY
Sbjct: 181 SDLASQCIHRY 191
>gi|359489778|ref|XP_002271353.2| PREDICTED: protein O-glucosyltransferase 1 [Vitis vinifera]
Length = 455
Score = 300 bits (768), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 146/253 (57%), Positives = 174/253 (68%), Gaps = 15/253 (5%)
Query: 42 TTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
TT NC K T TC T Q +S + CPDYFRWI+EDL PW TGIT+
Sbjct: 28 TTEENCTKENT-TCSTK-------QHPLSG------NACPDYFRWIYEDLRPWAETGITQ 73
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
+M+E A A FRL++++ KVY+ KYK + QTRDVFTIWGILQLL+ YPG++PDLELMF+
Sbjct: 74 EMMEAAKDPAFFRLVILDGKVYMEKYKGAFQTRDVFTIWGILQLLKLYPGKVPDLELMFE 133
Query: 162 CDDRPVIRSRDYSG-PNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
C DRP I++ DY G K PPLF Y T+DIVFPDWSFWGW EINIKPW SL +E
Sbjct: 134 CGDRPRIKATDYGGRKGKKKVPPLFHYCASDDTLDIVFPDWSFWGWPEINIKPWNSLRKE 193
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGF 280
L+EGNN W+DREPYAYWKGN + R+ L C S+ HDWNAR+Y DW ES GF
Sbjct: 194 LEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCRPSNNHDWNARVYDMDWGRESHEGF 253
Query: 281 QQSNLASQCAHRY 293
+ SNLASQC H+Y
Sbjct: 254 KDSNLASQCTHKY 266
>gi|356522630|ref|XP_003529949.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 497
Score = 299 bits (766), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 141/271 (52%), Positives = 185/271 (68%), Gaps = 15/271 (5%)
Query: 24 KTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTNYPKTSQTQESISDYSIPPTSTCPD 82
KT+I+ + N+ P+ PLNC ++ TCP+ YP + + S+ ++CP+
Sbjct: 58 KTIIVFN----NQQPQ---FPLNCTNGKSASTCPSYYPTKLEFDDDSSN------TSCPE 104
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
YFRWIHEDL PW+ TGITRDM+ER +HFRL++VN K YI K+ +S QTRDVFTIWGI
Sbjct: 105 YFRWIHEDLKPWESTGITRDMVERGKHISHFRLVIVNGKAYIEKFAKSYQTRDVFTIWGI 164
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
LQLLR YPG++PDLELMF C DR V+ +D+ P PPP+F Y G+ + DIVFPDW+
Sbjct: 165 LQLLRLYPGKIPDLELMFQCGDRTVVFKKDFQVP-KMSPPPVFHYCGEENSYDIVFPDWT 223
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
FWGWAE++I+PWE+ L + EGN W DR PYA+WKGNP V+ RR+L CN ++KHD
Sbjct: 224 FWGWAELSIRPWETTLHNILEGNKLVKWKDRIPYAFWKGNPTVSIIRRELGKCNTTEKHD 283
Query: 263 WNARLYVQDWILESKRGFQQSNLASQCAHRY 293
WNAR+Y W+ E F+ S L +QC RY
Sbjct: 284 WNARIYDIQWLRERASNFENSKLENQCTFRY 314
>gi|242073834|ref|XP_002446853.1| hypothetical protein SORBIDRAFT_06g023680 [Sorghum bicolor]
gi|241938036|gb|EES11181.1| hypothetical protein SORBIDRAFT_06g023680 [Sorghum bicolor]
Length = 555
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 132/212 (62%), Positives = 168/212 (79%), Gaps = 1/212 (0%)
Query: 83 YFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
YFR+IHEDL PW+ G ITR ML+RA TA FRL+++ + Y+H+ + + QTRD+FTIWG
Sbjct: 160 YFRFIHEDLRPWRAAGGITRAMLDRARLTATFRLVVLGGRAYVHRLRPAFQTRDLFTIWG 219
Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
+LQLLR+YPGR+PDL+LMFD D P++R+ Y G + PPLFRY GD T+DIVFPDW
Sbjct: 220 VLQLLRRYPGRVPDLDLMFDTVDWPIVRAHLYRGKYAEMLPPLFRYCGDDKTLDIVFPDW 279
Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
SFWGW EINIKPW++L +LK+GNN W+DREPYAYWKGNP V+ TR++L+ CN+S H
Sbjct: 280 SFWGWPEINIKPWDALQEDLKDGNNRVRWMDREPYAYWKGNPSVSATRKELVKCNVSSTH 339
Query: 262 DWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
DWNAR+Y QDW ESK G++ S+L+SQCAHRY
Sbjct: 340 DWNARIYAQDWFKESKAGYKDSDLSSQCAHRY 371
>gi|297745254|emb|CBI40334.3| unnamed protein product [Vitis vinifera]
Length = 819
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 131/217 (60%), Positives = 165/217 (76%), Gaps = 1/217 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
T TCP+YFRWI+EDL PW TGITRDM+ERA AH R+++V+ KVY KYK QTRDV
Sbjct: 26 TGTCPEYFRWIYEDLKPWTETGITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDV 85
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGILQ+LR YPG+LPD +LMF+C D+PVI+ DY G N PP LF Y GD T+DI
Sbjct: 86 FTIWGILQVLRMYPGKLPDFDLMFECGDKPVIKKHDYQGLNATAPP-LFHYCGDDETLDI 144
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EI IKPW +L ++L+EGNN W+DREPYAYWKGN + TR +L C+
Sbjct: 145 VFPDWSFWGWPEIRIKPWSTLRKDLREGNNKTKWVDREPYAYWKGNFKMGVTRHELSKCS 204
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+++ DWNAR+Y DW+ E + GF+ ++L++QC H+Y
Sbjct: 205 KTNEQDWNARIYNMDWLQEMQNGFKSADLSTQCTHKY 241
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 121/207 (58%), Positives = 148/207 (71%)
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
+ EDL PW TGITR M+E+A A R+++V+ KVY+ KYK+ RD FTIWGILQLL
Sbjct: 418 LEEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYMEKYKRVNHNRDEFTIWGILQLL 477
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
R YP +LPD +LMF+C DRP+I++ Y GP+ PPPLF Y GD T DIVFPDWSFWGW
Sbjct: 478 RMYPEKLPDFDLMFECRDRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGW 537
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
E NIKPW ++LKEGN WIDREPYAYWKGN + R++L C +D+ DWNAR
Sbjct: 538 PETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNAR 597
Query: 267 LYVQDWILESKRGFQQSNLASQCAHRY 293
LY+ DW E + GF+ S+LASQC HRY
Sbjct: 598 LYIMDWGREVQSGFKTSDLASQCTHRY 624
>gi|224064021|ref|XP_002301352.1| predicted protein [Populus trichocarpa]
gi|222843078|gb|EEE80625.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 133/221 (60%), Positives = 165/221 (74%), Gaps = 4/221 (1%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
++ CP+YFRWIHEDL WK TGI+R M+ERA AHFRL+++ K+Y+ KYK+S TRDV
Sbjct: 20 STACPEYFRWIHEDLRIWKSTGISRAMVERAKDYAHFRLVILKGKIYVEKYKKSFHTRDV 79
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGILQLLR YPG++PDLELMF CDDRPVI +DY G N P +F+Y G + I
Sbjct: 80 FTIWGILQLLRLYPGKVPDLELMFWCDDRPVILKKDYQGTNATSSPSIFQYCGREDALGI 139
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDW+FWGWAE N+ PW++L ++LKE N W DR PYAYW+GNP VA +RR L+ CN
Sbjct: 140 VFPDWTFWGWAETNVSPWKTLSKDLKEANKRTKWKDRVPYAYWRGNPNVAASRRQLMWCN 199
Query: 257 LSDKHDWNARLYVQ----DWILESKRGFQQSNLASQCAHRY 293
+SDK+DWNARLY Q DW ES++G++ S L QC HRY
Sbjct: 200 VSDKYDWNARLYKQASYIDWRTESEQGYEHSRLEDQCTHRY 240
>gi|357168046|ref|XP_003581456.1| PREDICTED: protein O-glucosyltransferase 1-like [Brachypodium
distachyon]
Length = 543
Score = 296 bits (759), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 130/215 (60%), Positives = 167/215 (77%), Gaps = 1/215 (0%)
Query: 80 CPDYFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
CP YFR+IHEDL PW+ G+TR ML RA TA FRL+++ +V++ +++ + QTRD+FT
Sbjct: 145 CPSYFRFIHEDLRPWREAGGVTRAMLARARVTASFRLVVLRGRVFVQRFRPAFQTRDLFT 204
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
IWGILQL+R+YPGR+PDL+LMFDC D PV+R+ Y G + PPLFRY GD T+DIVF
Sbjct: 205 IWGILQLIRRYPGRVPDLDLMFDCVDWPVVRTHLYRGKHAPFMPPLFRYCGDDRTLDIVF 264
Query: 199 PDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLS 258
PDWSFWGW EINIKPW +L +ELK+GNN W+DREPYAYWKGN VA +RR+L+ CN+S
Sbjct: 265 PDWSFWGWPEINIKPWGALQKELKDGNNKVRWLDREPYAYWKGNAAVAVSRRELVQCNVS 324
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
DWNAR+Y QDW E + G++ S+L+SQC +RY
Sbjct: 325 STQDWNARIYTQDWFKEGRTGYKSSDLSSQCTYRY 359
>gi|326509457|dbj|BAJ91645.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 510
Score = 296 bits (759), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 135/221 (61%), Positives = 167/221 (75%), Gaps = 3/221 (1%)
Query: 75 PPT-STCPDYFRWIHEDLSPWKVTGITRDMLERANQ-TAHFRLILVNNKVYIHKYKQSIQ 132
PPT +TCP YFRWIHEDL PW+ TGITR+ LE A + A FR+ +++ ++Y+ +Y + Q
Sbjct: 110 PPTPTTCPVYFRWIHEDLRPWRATGITREALEGARRYGAKFRVTVLSGRLYVARYGRCFQ 169
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
TRDVFT WGILQLLR+Y GR+PDL+LMFDC D PV+ + D+ G + PPPLFRY G
Sbjct: 170 TRDVFTQWGILQLLRRYAGRVPDLDLMFDCQDLPVVNAGDH-GSHTPPPPPLFRYCGSEP 228
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
T+DI FPDWSFWGW E+NIKPWE+L RE+KE N +W R PYAYWKGNP VA RR+L
Sbjct: 229 TLDIAFPDWSFWGWPELNIKPWEALRREIKEANAALDWTRRAPYAYWKGNPAVAAARREL 288
Query: 253 LTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
L CN+S K DWNAR+Y QDW E + GF++S+LA QC HRY
Sbjct: 289 LKCNVSGKRDWNARIYAQDWRTEVRDGFRESDLAKQCTHRY 329
>gi|147798876|emb|CAN65870.1| hypothetical protein VITISV_012498 [Vitis vinifera]
Length = 504
Score = 296 bits (757), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 135/217 (62%), Positives = 162/217 (74%), Gaps = 1/217 (0%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CPDYFRWI+EDL PW TGIT++M+E A A FRL++++ KVY+ KYK + QTRDVF
Sbjct: 99 NACPDYFRWIYEDLRPWAETGITQEMMEAAKDPAFFRLVILDGKVYMEKYKGAFQTRDVF 158
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG-PNNKGPPPLFRYSGDRWTMDI 196
TIWGILQLL+ YPG++PDLELMF+C DRP I++ DY G K PPLF Y T+DI
Sbjct: 159 TIWGILQLLKLYPGKVPDLELMFECGDRPRIKASDYGGRKGKKKVPPLFHYCASDDTLDI 218
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EINIKPW SL +EL+EGNN W+DREPYAYWKGN + R+ L C
Sbjct: 219 VFPDWSFWGWPEINIKPWNSLRKELEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCR 278
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
S+ HDWNAR+Y DW ES GF+ SNLASQC H+Y
Sbjct: 279 PSNNHDWNARVYDMDWGRESHEGFKDSNLASQCTHKY 315
>gi|79454960|ref|NP_191687.2| uncharacterized protein [Arabidopsis thaliana]
gi|55978809|gb|AAV68866.1| hypothetical protein AT3G61280 [Arabidopsis thaliana]
gi|61742723|gb|AAX55182.1| hypothetical protein At3g61280 [Arabidopsis thaliana]
gi|110739424|dbj|BAF01622.1| hypothetical protein [Arabidopsis thaliana]
gi|332646661|gb|AEE80182.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 147/262 (56%), Positives = 174/262 (66%), Gaps = 12/262 (4%)
Query: 37 PPKRITTPLNCVK---NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
P I PLNC N TQTCP+NYP + +I + TCPDYF+WIH DL
Sbjct: 92 PKIAIKIPLNCTSLNSNTTQTCPSNYPTKFEP-------AISSSETCPDYFKWIHRDLKV 144
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ TGITR+ LERA AHFR+++ + ++Y+H+Y+++ QTRDVFTIWGILQLLR YPG++
Sbjct: 145 WQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVFTIWGILQLLRMYPGQI 204
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
PDLEL+F C DRP I RD PPPLF Y G R DIVFPDWSFWGW E+NI
Sbjct: 205 PDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGWPELNI 264
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQD 271
K W L LKEGN W DR PYAYWKGNP V+ R DL+ CN SDK+D RLYVQD
Sbjct: 265 KEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVRLYVQD 324
Query: 272 WILESKRGFQQSNLASQCAHRY 293
W E + GF+ SNL QC HRY
Sbjct: 325 WRSEIEAGFRGSNLEDQCTHRY 346
>gi|186511278|ref|NP_001118874.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646662|gb|AEE80183.1| uncharacterized protein [Arabidopsis thaliana]
Length = 378
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/262 (56%), Positives = 174/262 (66%), Gaps = 12/262 (4%)
Query: 37 PPKRITTPLNCVK---NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
P I PLNC N TQTCP+NYP + +I + TCPDYF+WIH DL
Sbjct: 92 PKIAIKIPLNCTSLNSNTTQTCPSNYPTKFEP-------AISSSETCPDYFKWIHRDLKV 144
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ TGITR+ LERA AHFR+++ + ++Y+H+Y+++ QTRDVFTIWGILQLLR YPG++
Sbjct: 145 WQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVFTIWGILQLLRMYPGQI 204
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
PDLEL+F C DRP I RD PPPLF Y G R DIVFPDWSFWGW E+NI
Sbjct: 205 PDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGWPELNI 264
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQD 271
K W L LKEGN W DR PYAYWKGNP V+ R DL+ CN SDK+D RLYVQD
Sbjct: 265 KEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVRLYVQD 324
Query: 272 WILESKRGFQQSNLASQCAHRY 293
W E + GF+ SNL QC HRY
Sbjct: 325 WRSEIEAGFRGSNLEDQCTHRY 346
>gi|297745248|emb|CBI40328.3| unnamed protein product [Vitis vinifera]
Length = 838
Score = 293 bits (750), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 130/191 (68%), Positives = 158/191 (82%), Gaps = 1/191 (0%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M++RA +TAHFRL++++ K Y+ K+++SIQTRD+FT+WGILQLLR YPGRLPDLELMFDC
Sbjct: 1 MVDRAQRTAHFRLVIIDGKAYVEKFRKSIQTRDMFTLWGILQLLRWYPGRLPDLELMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DDRPV+R RD+ GPN PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++K
Sbjct: 61 DDRPVVRMRDFRGPN-AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIK 119
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
EGN W DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+QDW +SK G++Q
Sbjct: 120 EGNRRTKWKDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQDWGQQSKIGYRQ 179
Query: 283 SNLASQCAHRY 293
SNL QC HRY
Sbjct: 180 SNLEDQCTHRY 190
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 68/126 (53%), Gaps = 18/126 (14%)
Query: 11 LQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESI 69
+ ++FS N + N ++KP+ P + NC + + Q CP N+P TS T +
Sbjct: 369 MMTSFSYQNSAPN------TKKPRIFPKILV----NCTIGSMAQACPANHPTTSVTGKL- 417
Query: 70 SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
CP+YFRWIHEDL PWK TGI+R +E A A FRL++VN K Y+ +Y+
Sbjct: 418 ------SVEACPEYFRWIHEDLRPWKSTGISRFAVESAEGDADFRLVIVNGKAYVEQYRN 471
Query: 130 SIQTRD 135
T D
Sbjct: 472 YTLTGD 477
>gi|116310458|emb|CAH67462.1| OSIGBa0159I10.7 [Oryza sativa Indica Group]
gi|218195275|gb|EEC77702.1| hypothetical protein OsI_16775 [Oryza sativa Indica Group]
Length = 537
Score = 293 bits (750), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 131/212 (61%), Positives = 165/212 (77%), Gaps = 1/212 (0%)
Query: 83 YFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
YFR+IHEDL PW+ GITR ML RA TA FRL+++ + ++H+++ + QTRD+FTIWG
Sbjct: 142 YFRFIHEDLRPWRDAGGITRAMLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWG 201
Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
+LQLLR+YPGR+PDL+LMFDC D PV+R+ Y G + PPLF Y GD T+DIVFPDW
Sbjct: 202 VLQLLRRYPGRVPDLDLMFDCADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDW 261
Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
SFWGW EINIKPW++L ++LK+GNN W+DR PYAYWKGNP VA TR++L+ CN+S
Sbjct: 262 SFWGWPEINIKPWDALRQDLKDGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTK 321
Query: 262 DWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
DWNAR+Y QDW ESK G++ SNL SQC HRY
Sbjct: 322 DWNARIYKQDWFRESKAGYKDSNLGSQCTHRY 353
>gi|147798877|emb|CAN65871.1| hypothetical protein VITISV_012499 [Vitis vinifera]
Length = 424
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 139/257 (54%), Positives = 175/257 (68%), Gaps = 9/257 (3%)
Query: 39 KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
KR +NC + N T+TCP T T S+ S T TCP+YFRWI+EDL PW T
Sbjct: 80 KRFEYSMNCSEGNMTKTCPV----TLLTTFEPSNLS---TGTCPEYFRWIYEDLKPWTET 132
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GITRDM+ERA AH R+++V+ KVY KYK QTRDVFTIWGILQ+LR YPG+LPD +
Sbjct: 133 GITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFTIWGILQVLRMYPGKLPDFD 192
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
LMF+C D+PVI+ DY GPN PPPLF Y GD T+DIVFPDWSFWGW EI I+PW +L
Sbjct: 193 LMFECGDKPVIKKHDYQGPNATAPPPLFHYCGDDETLDIVFPDWSFWGWPEIRIRPWSTL 252
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY-VQDWILES 276
++L+EGNN W+BREPYAYWKGN + TR +L C+ +++ DWNAR+Y + I +
Sbjct: 253 RKDLREGNNKTKWVBREPYAYWKGNFKMGVTRHELSKCSKTNEQDWNARIYNMAHEIRNA 312
Query: 277 KRGFQQSNLASQCAHRY 293
F Q L + + Y
Sbjct: 313 GSSFVQEELRMKFVYDY 329
>gi|255541544|ref|XP_002511836.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549016|gb|EEF50505.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 549
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 132/261 (50%), Positives = 167/261 (63%), Gaps = 8/261 (3%)
Query: 35 NRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPW 94
N +P+ C T + +SD + CPDYFRWIHEDL PW
Sbjct: 74 NESSAHNASPVKCADQSTCKVNDTVRAFDGPEYQLSD------AACPDYFRWIHEDLRPW 127
Query: 95 KVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLP 154
TGI+RD +E A + A FRL++V+ K Y+ +Y S QTRD+FTIWGI+QLLR YPGR+P
Sbjct: 128 ASTGISRDTVESAKRFATFRLVIVDGKAYVERYYHSFQTRDLFTIWGIVQLLRLYPGRVP 187
Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
D+ELMF C D P I+ DY GP PPP LF+YSG+ + FPDWSFWGW E++IKP
Sbjct: 188 DVELMFQCGDLPEIQKGDYQGPGATLPPPALFQYSGNETAFAVTFPDWSFWGWVEVDIKP 247
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD-KHDWNARLYVQDW 272
W+S+L + +G+ +NW DR PYAYW+GN V+ R+DLL C S HDWNARLY QDW
Sbjct: 248 WKSMLEGITKGSQRKNWTDRVPYAYWRGNSHVSRDRKDLLKCKSSIFSHDWNARLYSQDW 307
Query: 273 ILESKRGFQQSNLASQCAHRY 293
E+ GF+ S+L QC HRY
Sbjct: 308 GKETHNGFKHSHLEDQCTHRY 328
>gi|6633846|gb|AAF19705.1|AC008047_12 F2K11.20 [Arabidopsis thaliana]
Length = 605
Score = 284 bits (726), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 137/248 (55%), Positives = 181/248 (72%), Gaps = 14/248 (5%)
Query: 31 RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
R + + P+ + ++C +N++ +C ++ S+ S CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
IHEDL PW+ TGIT++M+ER TAHFRL+++N KV++ YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
RKYPG+LPD++LMFDCDDRPVIRS Y+ N PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296
Query: 204 WGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHD 262
WGW EINI+ W +L+E++EG + +++R+ YAYWKGNPFVA +R DLLTCNLS HD
Sbjct: 297 WGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHD 356
Query: 263 WNARLYVQ 270
WNAR+++Q
Sbjct: 357 WNARIFIQ 364
>gi|449446169|ref|XP_004140844.1| PREDICTED: O-glucosyltransferase rumi homolog [Cucumis sativus]
Length = 454
Score = 283 bits (724), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 140/276 (50%), Positives = 181/276 (65%), Gaps = 23/276 (8%)
Query: 30 SRKPQNRPPKRIT-TPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPT---------ST 79
R+ QN P K + +P+NC T Y ++ + I +I +T
Sbjct: 6 ERELQNYPQKEVEFSPINC---------TAYSRSEKWDSGIGPTTIEEEEEDGDGKNENT 56
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP+YFRWIHEDL PW TGITR+M+ER + A FRL++V + Y+ KY + Q RDVFT+
Sbjct: 57 CPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTL 116
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS--GPNNKGPPPLFRYSGDRWTMDIV 197
WGILQLLR YP ++PDL+LMF C+D+P + +YS GPN+ PPPLFRY GD T DIV
Sbjct: 117 WGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIV 176
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW EIN+KPWE+ ++ELKE N + WIDRE YA+WKGN F++ R LL C+
Sbjct: 177 FPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSR 236
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
S + R+Y+QDW E K+GF+ SNLA QC RY
Sbjct: 237 STQS--KLRVYMQDWQEEGKQGFKNSNLADQCFSRY 270
>gi|42569949|ref|NP_182108.2| uncharacterized protein [Arabidopsis thaliana]
gi|330255514|gb|AEC10608.1| uncharacterized protein [Arabidopsis thaliana]
Length = 523
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 146/268 (54%), Positives = 173/268 (64%), Gaps = 23/268 (8%)
Query: 37 PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
P +++TTPLN C +Q TQTCP + P+ SQ + + TCPDYFRWIH+
Sbjct: 76 PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128
Query: 90 DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
DL W+ TGITR+ LERA+ AHFRLI+ +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
PG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248
Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNA 265
W EINIK W + EG W +REPYAYWKGNP VA RRDL+ C HD
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPMV 303
Query: 266 RLYVQDWILESKRGFQQSNLASQCAHRY 293
LY QDW E + G++ SNL QC HRY
Sbjct: 304 HLYRQDWSREGRIGYRTSNLEDQCTHRY 331
>gi|110738999|dbj|BAF01419.1| hypothetical protein [Arabidopsis thaliana]
Length = 523
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 146/268 (54%), Positives = 173/268 (64%), Gaps = 23/268 (8%)
Query: 37 PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
P +++TTPLN C +Q TQTCP + P+ SQ + + TCPDYFRWIH+
Sbjct: 76 PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128
Query: 90 DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
DL W+ TGITR+ LERA+ AHFRLI+ +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
PG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248
Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNA 265
W EINIK W + EG W +REPYAYWKGNP VA RRDL+ C HD
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPMV 303
Query: 266 RLYVQDWILESKRGFQQSNLASQCAHRY 293
LY QDW E + G++ SNL QC HRY
Sbjct: 304 HLYRQDWSREGRIGYRTSNLEDQCTHRY 331
>gi|357168044|ref|XP_003581455.1| PREDICTED: O-glucosyltransferase rumi homolog [Brachypodium
distachyon]
Length = 520
Score = 280 bits (716), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 192/317 (60%), Gaps = 29/317 (9%)
Query: 2 IELFAFSIILQSNFSVHNISRNKTV-----IIVSRKPQN------RPPKRITTPLNCVKN 50
+ +F +I+ + S + IS + V I V QN R P +TTP ++
Sbjct: 26 VFVFFVAIVAGAFVSSYWISASARVAPIIPIAVPAIAQNAATSPLRAPG-LTTPFTSSQD 84
Query: 51 QTQTCPTNYPK--------TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
P+ P ++ + S T +CP YFRWIHEDL PW+ TGITR+
Sbjct: 85 AANRTPSELPAGLTQQPSPAPASEAATESSSATTTGSCPAYFRWIHEDLRPWRATGITRE 144
Query: 103 MLERANQ-TAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY--PGR---LPDL 156
LE A++ FR+ +V ++Y+ +Y + QTRDVFT WGILQLLR+Y GR +PDL
Sbjct: 145 TLEGAHRYMPKFRVTVVAGRLYVRRYGRCFQTRDVFTQWGILQLLRRYNTTGRRAVVPDL 204
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
+LMFDC D PV+ + ++ G + PPPLFRY G T+DI FPDWSFWGW E+NIKPW +
Sbjct: 205 DLMFDCQDLPVVDAGNHRGCH---PPPLFRYCGSEPTLDIAFPDWSFWGWPELNIKPWGT 261
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILES 276
L RE+ EGN +W R PYAYWKGNP V R +LL CN+S K DWNAR+Y QDW E
Sbjct: 262 LRREIDEGNAAVDWTRRAPYAYWKGNPMVGTARLELLKCNVSRKRDWNARIYAQDWGKEV 321
Query: 277 KRGFQQSNLASQCAHRY 293
+ GF++S+LA QC HRY
Sbjct: 322 RGGFRESDLAKQCTHRY 338
>gi|388505300|gb|AFK40716.1| unknown [Medicago truncatula]
Length = 285
Score = 279 bits (714), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 129/212 (60%), Positives = 164/212 (77%), Gaps = 12/212 (5%)
Query: 44 PLNCV--KNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGIT 100
PL C + +TQTCP +YP K + T ++ + TCP +FRWIHEDL PWK GIT
Sbjct: 83 PLRCTNGEKETQTCPRDYPTKHNPTNQN--------SHTCPSFFRWIHEDLKPWKEKGIT 134
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMF 160
R+MLE A +TA+F++++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR +PG+LPDLELMF
Sbjct: 135 REMLEGAKRTANFKVVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRMFPGKLPDLELMF 194
Query: 161 DCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
DC+DRPVI ++ GP N PPPLFRY D+W++DIVFPDWSFWGWAE NIKPW+++L+E
Sbjct: 195 DCEDRPVIHKGNFQGP-NASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWKNILKE 253
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
+KEGN W DR PYAYWKGNP VA TR+
Sbjct: 254 IKEGNKETKWKDRVPYAYWKGNPNVAATRKTF 285
>gi|357507237|ref|XP_003623907.1| KDEL motif-containing protein [Medicago truncatula]
gi|355498922|gb|AES80125.1| KDEL motif-containing protein [Medicago truncatula]
Length = 454
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 127/250 (50%), Positives = 166/250 (66%), Gaps = 4/250 (1%)
Query: 45 LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
L C N TQTCP ++ + S TSTCP++FRWIHEDL PWK TGITR+M
Sbjct: 46 LKCFNGNPTQTCPNDHSPIQAFHQDQDPSS--STSTCPEHFRWIHEDLEPWKSTGITREM 103
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
++ + R+++ KVY+ Y S QTR FT+WGI+QLLR YPGR+PDLEL+F+
Sbjct: 104 VDSGENISQLRIVIKQGKVYVETYGDSFQTRATFTVWGIVQLLRLYPGRVPDLELLFETG 163
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
DR V+ + Y GP PPP+F Y G +DIVFPDWS+WGWAE IKPWE +L++++E
Sbjct: 164 DRAVVDKKRYEGPEAVTPPPIFSYCGKNEALDIVFPDWSYWGWAETRIKPWEKVLKDIQE 223
Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
N W DR PYA+WKGN F++ R +L CN++D+HDWNAR+Y DW E +GF+ +
Sbjct: 224 SNKKIKWKDRIPYAFWKGN-FLSNPRHELRKCNVTDQHDWNARVYSVDWNEEIDQGFKNT 282
Query: 284 NLASQCAHRY 293
L QCAHRY
Sbjct: 283 KLEDQCAHRY 292
>gi|242073832|ref|XP_002446852.1| hypothetical protein SORBIDRAFT_06g023670 [Sorghum bicolor]
gi|241938035|gb|EES11180.1| hypothetical protein SORBIDRAFT_06g023670 [Sorghum bicolor]
Length = 505
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 125/216 (57%), Positives = 153/216 (70%), Gaps = 6/216 (2%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQ--TAHFRLILVNNKVYIHKYKQSIQTRDVF 137
CP YFRWIHEDL PW+ G+TR LE A + FR+ +V ++Y+ +Y + QTR F
Sbjct: 114 CPSYFRWIHEDLRPWRAAGVTRGALEGARRLFAPKFRVTVVAGRLYVARYGRCFQTRAAF 173
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
T WGILQLLR+YPGR+PDL+LMFDCDD PV+ + D ++ PPPLFRY G T+DI
Sbjct: 174 TQWGILQLLRRYPGRVPDLDLMFDCDDLPVVGAGD----RHQAPPPLFRYCGSETTLDIA 229
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW E+NIKPWE+L RE+ E N NW+DR PYAYWKGNP V R LL CN
Sbjct: 230 FPDWSFWGWPELNIKPWEALRREINEENAMVNWMDRAPYAYWKGNPKVGAERLLLLRCNA 289
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
S + DWNAR+Y QDW E + GF+ S+L+ QC HRY
Sbjct: 290 SGERDWNARVYAQDWGKEVRHGFRGSDLSKQCTHRY 325
>gi|224064019|ref|XP_002301351.1| predicted protein [Populus trichocarpa]
gi|222843077|gb|EEE80624.1| predicted protein [Populus trichocarpa]
Length = 413
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 127/230 (55%), Positives = 157/230 (68%), Gaps = 15/230 (6%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
CP +FRWIHEDL PWK TGI+RDMLERA + AHFRL++V K Y+ +Y + QTRDVFT
Sbjct: 2 ACPGFFRWIHEDLRPWKDTGISRDMLERARKHAHFRLVIVEGKAYVEQYSKPYQTRDVFT 61
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
IWGILQLL YPG++PDLELMF C D+ VI+ D G + P LF+Y G ++IVF
Sbjct: 62 IWGILQLLNLYPGKIPDLELMFRCGDKTVIQKHDIQGSDAMSPAVLFQYCGHSTALNIVF 121
Query: 199 PDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLS 258
PDW+FWGWAE NIKPW+ +L + EGN W DREPYAYW+GNP V+ R DL+ CN+S
Sbjct: 122 PDWTFWGWAETNIKPWKIVLEGMVEGNKKIKWQDREPYAYWRGNPHVSPNREDLMKCNVS 181
Query: 259 DKHDWNARLYVQ---------------DWILESKRGFQQSNLASQCAHRY 293
DK+DW ARLY Q +W E ++G++ S L QC HRY
Sbjct: 182 DKYDWLARLYEQASFLDHSPMNLADWKNWGKEREQGYKHSKLEDQCTHRY 231
>gi|42571241|ref|NP_973694.1| downstream target of AGL15 2 [Arabidopsis thaliana]
gi|44917445|gb|AAS49047.1| At2g45830 [Arabidopsis thaliana]
gi|46931276|gb|AAT06442.1| At2g45830 [Arabidopsis thaliana]
gi|330255512|gb|AEC10606.1| downstream target of AGL15 2 [Arabidopsis thaliana]
Length = 382
Score = 276 bits (706), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 123/191 (64%), Positives = 153/191 (80%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLELMFD
Sbjct: 1 MLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLELMFDP 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DDRP +RS+D+ G + PPPLFRY D ++DIVFPDWSFWGWAE+NIKPW+ L ++
Sbjct: 61 DDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSLVAIE 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
EGN W DR YAYW+GNP VA TRRDLL CN+S + DWN RLY+QDW ES+ GF+
Sbjct: 121 EGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQDWDRESREGFKN 180
Query: 283 SNLASQCAHRY 293
SNL +QC HRY
Sbjct: 181 SNLENQCTHRY 191
>gi|15233074|ref|NP_191688.1| uncharacterized protein [Arabidopsis thaliana]
gi|6850896|emb|CAB71059.1| putative protein [Arabidopsis thaliana]
gi|332646663|gb|AEE80184.1| uncharacterized protein [Arabidopsis thaliana]
Length = 455
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 136/253 (53%), Positives = 166/253 (65%), Gaps = 12/253 (4%)
Query: 44 PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
PL C N TQTCP+ YP S+ + IS TCPDYFRWI +DL W+ TGITR+
Sbjct: 17 PLKC--NATQTCPSTYP--SRLEPMISS-----LETCPDYFRWIQQDLKVWEETGITRET 67
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
LERA AHFRL++ + ++Y+H+Y ++ ++RDV TIWGILQLLR YPG++PDLEL+F C
Sbjct: 68 LERAKPKAHFRLVIKSGRLYVHQYDKAYESRDVLTIWGILQLLRMYPGQVPDLELLFFCH 127
Query: 164 DRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
D P I RD+ P PPPLF+Y G R IVFPDWSFWGW E+NIK W L
Sbjct: 128 DIPAIWKRDFRQPEPNATWPPPPLFQYCGHREAYGIVFPDWSFWGWPEVNIKEWTKLSVA 187
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGF 280
++E N W DR PYAYWKGN V R +L+ CN SDK+D RLY QDW E + GF
Sbjct: 188 IREANKRVKWNDRVPYAYWKGNSGVHRERGNLMKCNFSDKYDPMVRLYEQDWGKEREIGF 247
Query: 281 QQSNLASQCAHRY 293
+ SNL QC HRY
Sbjct: 248 KSSNLEDQCTHRY 260
>gi|49388103|dbj|BAD25236.1| unknown protein [Oryza sativa Japonica Group]
gi|215766221|dbj|BAG98449.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 375
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 120/191 (62%), Positives = 151/191 (79%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
MLERA TA+FRL+++ + Y+ + + QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC
Sbjct: 1 MLERARDTANFRLVVLRGRAYVERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PV+++ Y G N PPLFRY GD T+D+VFPDWSFWGW EINIKPW++L ++L
Sbjct: 61 VDWPVVQADRYQGENATAMPPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLD 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
GN W+DREPYAYWKGNP VA R++L+ CN+S KH+WNAR+Y QDWI ESK G++Q
Sbjct: 121 IGNKRVKWVDREPYAYWKGNPDVATKRKELVKCNVSSKHEWNARIYKQDWIKESKAGYKQ 180
Query: 283 SNLASQCAHRY 293
S+LASQC HRY
Sbjct: 181 SDLASQCTHRY 191
>gi|115459624|ref|NP_001053412.1| Os04g0534000 [Oryza sativa Japonica Group]
gi|38346998|emb|CAE04582.2| OSJNBb0039L24.21 [Oryza sativa Japonica Group]
gi|113564983|dbj|BAF15326.1| Os04g0534000 [Oryza sativa Japonica Group]
gi|116310457|emb|CAH67461.1| OSIGBa0159I10.6 [Oryza sativa Indica Group]
gi|125549144|gb|EAY94966.1| hypothetical protein OsI_16774 [Oryza sativa Indica Group]
gi|125591102|gb|EAZ31452.1| hypothetical protein OsJ_15588 [Oryza sativa Japonica Group]
gi|215694886|dbj|BAG90077.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 508
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 121/213 (56%), Positives = 153/213 (71%), Gaps = 2/213 (0%)
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQ--TAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
YFRWIHEDL PW+ GITR+ ++ A + A FR+ +V ++++ +Y + QTRD+FT W
Sbjct: 113 YFRWIHEDLRPWRDAGITREAVDGAARRYGAKFRVTVVAGRLHVARYGRCFQTRDMFTQW 172
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPD 200
G+LQLLR+Y GR+PDL+LMFDC D PV+ + D G + PPPLF Y G T+DI FPD
Sbjct: 173 GVLQLLRRYQGRVPDLDLMFDCQDLPVVNAGDRRGRTSSSPPPLFGYCGSEPTLDIAFPD 232
Query: 201 WSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK 260
WSFWGW E+NIKPWE+L E+ +GN NW R PYAYWKGNP V RR+LL CN S K
Sbjct: 233 WSFWGWPELNIKPWETLRGEIADGNAAVNWTGRAPYAYWKGNPTVGADRRNLLRCNASGK 292
Query: 261 HDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
DWNAR+Y QDW E + GF++S+LA QC HRY
Sbjct: 293 RDWNARIYEQDWRKEVRDGFRESDLAKQCTHRY 325
>gi|297824641|ref|XP_002880203.1| hypothetical protein ARALYDRAFT_483723 [Arabidopsis lyrata subsp.
lyrata]
gi|297326042|gb|EFH56462.1| hypothetical protein ARALYDRAFT_483723 [Arabidopsis lyrata subsp.
lyrata]
Length = 522
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 148/301 (49%), Positives = 177/301 (58%), Gaps = 24/301 (7%)
Query: 4 LFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLN----CVKNQ---TQTCP 56
L S+ FS+ T I P K+I TP+N C +Q T CP
Sbjct: 43 LLGASLSWMDMFSIGGGRIKVTPIFTRNTNATIPKKKIYTPVNFTLQCSLDQNIATHVCP 102
Query: 57 TNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLI 116
+ P SQ + + TCPDYFRWI++DL W+ TGITR+ LERA+ AHFRLI
Sbjct: 103 ASNPDKSQPSKDDPE-------TCPDYFRWIYKDLEAWRGTGITRETLERASDKAHFRLI 155
Query: 117 LVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS-- 174
+ +VY+ +YK+S QTRDVFTIWGI+QLLR YPG++PDLEL+F CDD P I RDY
Sbjct: 156 IKGGRVYVQQYKKSFQTRDVFTIWGIVQLLRMYPGQVPDLELLFMCDDLPEIWRRDYRPR 215
Query: 175 -GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWID 232
G N PPPLF Y G DIVFPDWSFWGW EINIK W +L + EG W +
Sbjct: 216 PGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWGWPEINIKEWNKMLEAISEGLKKVKWEE 275
Query: 233 REPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHR 292
REPYAYWKGN V + RRDL+ C HD LY QDW E G++ SNL QC HR
Sbjct: 276 REPYAYWKGNARVGK-RRDLMNC-----HDPMVHLYSQDWWKEGSIGYKTSNLEDQCTHR 329
Query: 293 Y 293
Y
Sbjct: 330 Y 330
>gi|449523045|ref|XP_004168535.1| PREDICTED: LOW QUALITY PROTEIN: O-glucosyltransferase rumi homolog
[Cucumis sativus]
Length = 454
Score = 269 bits (688), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 141/276 (51%), Positives = 182/276 (65%), Gaps = 23/276 (8%)
Query: 30 SRKPQNRPPKRIT-TPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPT---------ST 79
R+ QN P K + +P+NC T Y ++ + I +I +T
Sbjct: 6 ERELQNYPQKEVEFSPINC---------TAYSRSEKWDSGIGPTTIEEEEEDGDGKNENT 56
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP+YFRWIHEDL PW TGITR+M+ER + A FRL++V +VY+ KY + Q RDVFT+
Sbjct: 57 CPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRVYVEKYSEVFQRRDVFTL 116
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG--PNNKGPPPLFRYSGDRWTMDIV 197
WGILQLLR YP ++PDL+LMF C+D+P + +YSG PN+ PPPLFRY GD T DIV
Sbjct: 117 WGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTPPPPLFRYCGDDDTFDIV 176
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW EIN+KPWE+ ++ELKE N + WIDRE YA+WKGN F++ R LL C+
Sbjct: 177 FPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSR 236
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
S + R+Y+QDW E K+GF+ SNLA QC RY
Sbjct: 237 STQS--KLRVYMQDWQEEGKKGFKNSNLADQCFSRY 270
>gi|357507235|ref|XP_003623906.1| O-glucosyltransferase rumi [Medicago truncatula]
gi|355498921|gb|AES80124.1| O-glucosyltransferase rumi [Medicago truncatula]
Length = 442
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 126/273 (46%), Positives = 168/273 (61%), Gaps = 11/273 (4%)
Query: 30 SRKPQNRPPKRITT--------PLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTC 80
S K ++ P +IT LNC + T+TC ++ + S TSTC
Sbjct: 16 SFKTKSTSPSKITIIKRQQTQFTLNCFNGSMTRTCQKDHHPIQAFHQDQDPSS--STSTC 73
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
P++FRWIHEDL PWK TGITR+ +E + R+++ K Y+ Y S QTRD+FT+W
Sbjct: 74 PEHFRWIHEDLEPWKSTGITRETVESGKNISQLRIVIKKGKAYVETYADSFQTRDLFTVW 133
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPD 200
GI+QLLR YPGR+PDLEL+F+ DR V+ + + G + PP+F Y G +DIVFPD
Sbjct: 134 GIVQLLRLYPGRVPDLELLFETGDRTVLDKKRFQGSQSVTLPPIFSYCGQNDALDIVFPD 193
Query: 201 WSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK 260
WSFWGWAE IKPWE +L++++E N W DR PYA+WKGN V+ R L CN++D+
Sbjct: 194 WSFWGWAETGIKPWEKVLKDIQESNKKITWKDRIPYAFWKGNTHVSSQRYKLRQCNVTDQ 253
Query: 261 HDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
HDWNAR+Y W E + GF + L QC HRY
Sbjct: 254 HDWNARIYSVHWNKEIEHGFNSTKLEDQCTHRY 286
>gi|449441480|ref|XP_004138510.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
gi|449518613|ref|XP_004166331.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 380
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 121/191 (63%), Positives = 147/191 (76%), Gaps = 1/191 (0%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ER +TAHFR+++V +VY+ KYK SIQTRDVFT+WGILQL R YP +LPDLELMFDC
Sbjct: 1 MVERGRRTAHFRVVIVEGRVYVEKYKGSIQTRDVFTMWGILQLARWYPKKLPDLELMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DDRPV+RS + + GPPPLFRY D ++DIVFPDWSFWGW EINIKPW+ +L ++K
Sbjct: 61 DDRPVVRSNGFMNAIS-GPPPLFRYCSDESSLDIVFPDWSFWGWGEINIKPWKMVLEDIK 119
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
EGN W DR P AYWKGNP V +RRDLL CNL+ + +W+ LYVQDW E+K G++Q
Sbjct: 120 EGNKRTRWKDRVPLAYWKGNPQVDPSRRDLLKCNLTQQQNWDTLLYVQDWDKEAKEGYKQ 179
Query: 283 SNLASQCAHRY 293
SNL QC HRY
Sbjct: 180 SNLEDQCTHRY 190
>gi|226509990|ref|NP_001142048.1| uncharacterized protein LOC100274204 [Zea mays]
gi|194706898|gb|ACF87533.1| unknown [Zea mays]
gi|413919037|gb|AFW58969.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 508
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 130/239 (54%), Positives = 163/239 (68%), Gaps = 7/239 (2%)
Query: 62 TSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLIL 117
T+QT + + S PP CP YFRWIHEDL PW+ TG+TR +E A + A R+ +
Sbjct: 90 TNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAV 149
Query: 118 VNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
V ++Y+ +Y ++ QTR FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + + G
Sbjct: 150 VAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHG 209
Query: 176 PNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDRE 234
+ PPP LFRY G T+DI FPDWSFWGW E+NIKPWE+L RE+ E N NW DR
Sbjct: 210 AQPQPPPPPLFRYCGSEATLDIAFPDWSFWGWPELNIKPWEALQREINEANAVVNWTDRA 269
Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
PYAYWKGNP V R LL CN S + DWNAR+Y QDW E + GF+ S+L+ QC HRY
Sbjct: 270 PYAYWKGNPKVGAERLLLLRCNASGERDWNARVYAQDWGEEVRHGFRGSDLSKQCTHRY 328
>gi|21742339|emb|CAD41531.1| OSJNBb0020O11.18 [Oryza sativa Japonica Group]
gi|38346999|emb|CAE04583.2| OSJNBb0039L24.22 [Oryza sativa Japonica Group]
Length = 375
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 117/191 (61%), Positives = 149/191 (78%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
ML RA TA FRL+++ + ++H+++ + QTRD+FTIWG+LQLLR+YPGR+PDL+LMFDC
Sbjct: 1 MLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWGVLQLLRRYPGRVPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PV+R+ Y G + PPLF Y GD T+DIVFPDWSFWGW EINIKPW++L ++LK
Sbjct: 61 ADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDWSFWGWPEINIKPWDALRQDLK 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
+GNN W+DR PYAYWKGNP VA TR++L+ CN+S DWNAR+Y QDW ESK G++
Sbjct: 121 DGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTKDWNARIYKQDWFRESKAGYKD 180
Query: 283 SNLASQCAHRY 293
SNL SQC HRY
Sbjct: 181 SNLGSQCTHRY 191
>gi|215736906|dbj|BAG95835.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 275
Score = 263 bits (673), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 117/191 (61%), Positives = 149/191 (78%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
ML RA TA FRL+++ + ++H+++ + QTRD+FTIWG+LQLLR+YPGR+PDL+LMFDC
Sbjct: 1 MLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWGVLQLLRRYPGRVPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PV+R+ Y G + PPLF Y GD T+DIVFPDWSFWGW EINIKPW++L ++LK
Sbjct: 61 ADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDWSFWGWPEINIKPWDALRQDLK 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
+GNN W+DR PYAYWKGNP VA TR++L+ CN+S DWNAR+Y QDW ESK G++
Sbjct: 121 DGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTKDWNARIYKQDWFRESKAGYKD 180
Query: 283 SNLASQCAHRY 293
SNL SQC HRY
Sbjct: 181 SNLGSQCTHRY 191
>gi|87241560|gb|ABD33418.1| Lipopolysaccharide-modifying protein [Medicago truncatula]
Length = 397
Score = 263 bits (673), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 114/217 (52%), Positives = 148/217 (68%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
TSTCP++FRWIHEDL PWK TGITR+ +E + R+++ K Y+ Y S QTRD+
Sbjct: 25 TSTCPEHFRWIHEDLEPWKSTGITRETVESGKNISQLRIVIKKGKAYVETYADSFQTRDL 84
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FT+WGI+QLLR YPGR+PDLEL+F+ DR V+ + + G + PP+F Y G +DI
Sbjct: 85 FTVWGIVQLLRLYPGRVPDLELLFETGDRTVLDKKRFQGSQSVTLPPIFSYCGQNDALDI 144
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGWAE IKPWE +L++++E N W DR PYA+WKGN V+ R L CN
Sbjct: 145 VFPDWSFWGWAETGIKPWEKVLKDIQESNKKITWKDRIPYAFWKGNTHVSSQRYKLRQCN 204
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
++D+HDWNAR+Y W E + GF + L QC HRY
Sbjct: 205 VTDQHDWNARIYSVHWNKEIEHGFNSTKLEDQCTHRY 241
>gi|294461229|gb|ADE76177.1| unknown [Picea sitchensis]
Length = 492
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 118/217 (54%), Positives = 155/217 (71%), Gaps = 3/217 (1%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
+ CP YF+WI +D+ PW +GIT DM+E AN A FRL +VN +++I Y++ Q+RD+
Sbjct: 95 AAFCPFYFKWIRQDMKPWAKSGITIDMVEAANPEASFRLTVVNGRMFIESYRKCYQSRDL 154
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGI QLL+ YP LPDL+LMF+CDD PVI DY+ + K PPPLFRYSG T DI
Sbjct: 155 FTIWGIAQLLKFYPRLLPDLDLMFNCDDNPVIHRGDYND-STKPPPPLFRYSGSEDTFDI 213
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EI PWE+L +E++ G+ W DR+P AYWKGNP++ + R+DL+ C
Sbjct: 214 VFPDWSFWGWPEIRTPPWETLAKEIQNGSQKVKWEDRDPTAYWKGNPYMGQGRQDLMNC- 272
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+ +H W RLY QDW E+++GF+ S L+ QC HRY
Sbjct: 273 IHRRH-WGGRLYNQDWDKETRQGFRHSKLSDQCHHRY 308
>gi|242094720|ref|XP_002437850.1| hypothetical protein SORBIDRAFT_10g003730 [Sorghum bicolor]
gi|241916073|gb|EER89217.1| hypothetical protein SORBIDRAFT_10g003730 [Sorghum bicolor]
Length = 552
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 112/219 (51%), Positives = 156/219 (71%), Gaps = 3/219 (1%)
Query: 76 PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
P +CPDYFR+IH+DL PW+ GITR+ +ERA A+FRL++V + Y+ Y+++ QTRD
Sbjct: 157 PAPSCPDYFRYIHDDLRPWRGAGITREAVERARPHAYFRLVVVGGRAYVETYRRAYQTRD 216
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY-SGPNNKGPPPLFRYSGDRWTM 194
VFT WG+LQLLR+YPGR+PDL++MF CDD +R+ D+ + P++ PP+FRY D T+
Sbjct: 217 VFTQWGVLQLLRRYPGRVPDLDIMFACDDPGQVRAADFPTTPSDA--PPVFRYCKDALTL 274
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWSFWGW E+ I+PW LL E+++ N W R+PYA+WKGNP R +L+
Sbjct: 275 DIVFPDWSFWGWPEVGIRPWPQLLEEVRQENERVRWPVRQPYAFWKGNPEGYRIRHELMR 334
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
CN S+ +WNARL+ Q+W + GF+ S + QC +RY
Sbjct: 335 CNASNGQEWNARLFTQNWHYAIQNGFKDSRIPKQCIYRY 373
>gi|294460113|gb|ADE75639.1| unknown [Picea sitchensis]
Length = 492
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 117/217 (53%), Positives = 153/217 (70%), Gaps = 3/217 (1%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
+ CP YF+WI +D+ PW +GIT DM+E A A FRL +VN ++YI Y++ Q+RD+
Sbjct: 95 AAFCPFYFKWIRQDMKPWAKSGITIDMVEAAKPEASFRLTVVNGRMYIESYRKCYQSRDL 154
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGI Q L+ YP LPDL+ MF+CDD PVI DY+ + K PPPLFRYSG T DI
Sbjct: 155 FTIWGIAQFLKFYPRLLPDLDSMFNCDDNPVIHRGDYND-STKPPPPLFRYSGSEDTFDI 213
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EI PWE+L +E++ G+ W DR+P AYWKGNP++ + R+DL+ C
Sbjct: 214 VFPDWSFWGWPEIRTPPWETLAKEIQNGSQKVKWEDRDPTAYWKGNPYMGQGRQDLMNC- 272
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+ +H W RLY QDW E+++GF+QS L+ QC HRY
Sbjct: 273 IHRRH-WGGRLYNQDWDKETRQGFRQSKLSDQCHHRY 308
>gi|326505488|dbj|BAJ95415.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326509389|dbj|BAJ91611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 562
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 113/217 (52%), Positives = 151/217 (69%), Gaps = 5/217 (2%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPDYFR IH DL PW+ GI+R+ +ER A FRL +V+ + Y+ Y + QTRD+FT
Sbjct: 168 CPDYFRHIHSDLEPWREKGISREAVERGRPKAAFRLTVVSGRAYVETYHRVFQTRDLFTQ 227
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WGI QLL +YPGR+PDL+LMF+C+D P +R+ DY P+ PPLFRY D +++++FP
Sbjct: 228 WGIAQLLARYPGRVPDLDLMFNCEDMPELRAADY--PDTSAAPPLFRYCKDGTSLEVLFP 285
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC--NL 257
DWSFWGW E+NI+PW L++E+ E N W DR+PYA+WKGNP V+ RRDL C +
Sbjct: 286 DWSFWGWPEVNIRPWGPLMKEIAEENARLPWPDRQPYAFWKGNPDVSRARRDLFRCSNDS 345
Query: 258 SDKHDWNARLYVQDWILESKRGFQQS-NLASQCAHRY 293
+ DWNARL+ DW ++ GF+ S NLA QC +RY
Sbjct: 346 AAGRDWNARLFALDWGAANRNGFKGSTNLAEQCRYRY 382
>gi|147791845|emb|CAN70601.1| hypothetical protein VITISV_027961 [Vitis vinifera]
Length = 362
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 114/167 (68%), Positives = 139/167 (83%), Gaps = 1/167 (0%)
Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
N++ ++ K+++SIQTRD+FT+WGILQLLR YPGRLPDLELMFDCDDRPV+R RD+ GP+
Sbjct: 14 NHQGHVEKFRKSIQTRDMFTLWGILQLLRWYPGRLPDLELMFDCDDRPVVRMRDFRGPS- 72
Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAY 238
PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++KEGN W DR P AY
Sbjct: 73 AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEGNRRTKWKDRVPLAY 132
Query: 239 WKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNL 285
W+GNP VA TR DLL CN+SDK DWN RLY+QDW +SK G++QSNL
Sbjct: 133 WRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQDWDQQSKIGYRQSNL 179
>gi|242077829|ref|XP_002443683.1| hypothetical protein SORBIDRAFT_07g000280 [Sorghum bicolor]
gi|241940033|gb|EES13178.1| hypothetical protein SORBIDRAFT_07g000280 [Sorghum bicolor]
Length = 594
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 114/211 (54%), Positives = 147/211 (69%), Gaps = 2/211 (0%)
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
YFR+IH DLSPW+ TGITR+ +ER A FRL++V+ + Y+ Y + QTRD FT WGI
Sbjct: 198 YFRYIHSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGI 257
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
QLL +YPGR+PDL+LMF+C+D P +R+ D++ ++ PPLFRY D T+DIVFPDWS
Sbjct: 258 AQLLARYPGRVPDLDLMFNCEDMPEVRAADFAAAPSQA-PPLFRYCKDDSTLDIVFPDWS 316
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
FWGW E+NI+PW LL E+ W +REPYAYWKGNP V+ R DLL CN S +
Sbjct: 317 FWGWPEVNIRPWAPLLEEMAAETARLPWAEREPYAYWKGNPGVSGERGDLLRCNDSSG-E 375
Query: 263 WNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W R++ QDW + GF+ SNLA QC +RY
Sbjct: 376 WRTRVFWQDWGAAIRDGFRNSNLAKQCRYRY 406
>gi|226510425|ref|NP_001149452.1| LOC100283078 [Zea mays]
gi|195627344|gb|ACG35502.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 600
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 125/266 (46%), Positives = 158/266 (59%), Gaps = 19/266 (7%)
Query: 41 ITTPLNCVKNQTQTCPTNYPKT-------------SQTQESISDYSIPPTSTCPDYFRWI 87
+ P +C N + CP T + T+ S S PP CPDYFR+I
Sbjct: 152 VPIPFSCGGNGS-ACPRRQTATSPPSPSPSSSSPPAATKSSSSKNKQPPPQ-CPDYFRFI 209
Query: 88 HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
H DLSPW+ TGITR+ +ER A FRL++V+ + Y+ Y + QTRD FT WGI QLL
Sbjct: 210 HSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGIAQLLA 269
Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
+YPGR+PDL+LMF+C+D P ++ + PPLFRY D T+DIVFPDWSFWGW
Sbjct: 270 RYPGRVPDLDLMFNCEDMPEVKVKPS---EESSAPPLFRYCKDDSTVDIVFPDWSFWGWP 326
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
E+NI+PW LL E+ W DREPYAYWKGNP V+ R DL CN S R+
Sbjct: 327 EVNIRPWAPLLEEMAAEMGRLPWADREPYAYWKGNPTVSAERADLRRCNDSSSSG-GTRV 385
Query: 268 YVQDWILESKRGFQQSNLASQCAHRY 293
+ QDW + GF+ SNLA QC +RY
Sbjct: 386 FWQDWGAAIRDGFRDSNLAKQCRYRY 411
>gi|413925102|gb|AFW65034.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 599
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 125/266 (46%), Positives = 158/266 (59%), Gaps = 19/266 (7%)
Query: 41 ITTPLNCVKNQTQTCPTNYPKT-------------SQTQESISDYSIPPTSTCPDYFRWI 87
+ P +C N + CP T + T+ S S PP CPDYFR+I
Sbjct: 151 VPIPFSCGGNGS-ACPRRQTATSPPSPSPSSSSPPAGTKSSSSKNKQPPPQ-CPDYFRFI 208
Query: 88 HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
H DLSPW+ TGITR+ +ER A FRL++V+ + Y+ Y + QTRD FT WGI QLL
Sbjct: 209 HSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGIAQLLA 268
Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
+YPGR+PDL+LMF+C+D P ++ + PPLFRY D T+DIVFPDWSFWGW
Sbjct: 269 RYPGRVPDLDLMFNCEDMPEVKVKPS---EESSAPPLFRYCKDDSTVDIVFPDWSFWGWP 325
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
E+NI+PW LL E+ W DREPYAYWKGNP V+ R DL CN S R+
Sbjct: 326 EVNIRPWAPLLEEMAAEMGRLPWADREPYAYWKGNPTVSAERADLRRCNDSSSSG-GTRV 384
Query: 268 YVQDWILESKRGFQQSNLASQCAHRY 293
+ QDW + GF+ SNLA QC +RY
Sbjct: 385 FWQDWGAAIRDGFRDSNLAKQCRYRY 410
>gi|357127693|ref|XP_003565513.1| PREDICTED: uncharacterized protein LOC100832633 [Brachypodium
distachyon]
Length = 538
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 116/226 (51%), Positives = 149/226 (65%), Gaps = 3/226 (1%)
Query: 71 DYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQS 130
D S+ P CP YFR IH DL PW+ TGITRD LERA A FRL +V+ + Y+ + S
Sbjct: 131 DPSVAPPPWCPVYFRHIHTDLDPWRSTGITRDTLERAMPHAEFRLTVVSGRAYVQNLRPS 190
Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG- 189
QTRDVFT WG+LQLL ++PGR+PD+++MF D + S DY PPPLFRY
Sbjct: 191 YQTRDVFTQWGVLQLLARFPGRVPDVDIMFSAGDVAQVLSADYYNTTTHPPPPLFRYCKE 250
Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
++ IVFPDWSFWGW E++I+PW L+ + N W +R+PYA+WKGNP V++ R
Sbjct: 251 EKLEAAIVFPDWSFWGWPELSIRPWAPLMEDFVRENKALPWRNRQPYAFWKGNPEVSDVR 310
Query: 250 RDLLTCN--LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
RDL CN ++ DWNARL+ QDW S+ GF+ SNLA QC +RY
Sbjct: 311 RDLFKCNNDSANGKDWNARLFWQDWNAASRNGFRDSNLAKQCDYRY 356
>gi|125559830|gb|EAZ05278.1| hypothetical protein OsI_27481 [Oryza sativa Indica Group]
Length = 579
Score = 244 bits (622), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 127/257 (49%), Positives = 159/257 (61%), Gaps = 6/257 (2%)
Query: 41 ITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTST---CPDYFRWIHEDLSPWKV 96
I P C N T TCP + P TST CP YFR I DL+ W
Sbjct: 137 IPIPFTCGSGNDTGTCPRYAASAPAPAPASMSPPPPQTSTVDYCPSYFRHIELDLAAWVA 196
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
+GI+R+ +ER + AHFRL++V + Y+ Y+++ QTRDVFT WGILQLLR+YPGR+PDL
Sbjct: 197 SGISREAVERGRRQAHFRLLVVGGRAYVETYRRAFQTRDVFTQWGILQLLRRYPGRVPDL 256
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
+LMF+CDD P +R+ Y P LFRY D T+D++FPDWSFWGW E+NI+PW
Sbjct: 257 DLMFNCDDMPEVRAAAYPDRAAAPP--LFRYCKDPSTLDVLFPDWSFWGWPEVNIRPWAP 314
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILES 276
LL E+ E W REPYAYWKGNP V+ R++LL CN S D RLY QDW +
Sbjct: 315 LLAEMAEEKARLPWSRREPYAYWKGNPDVSPLRQELLRCNHSLPPDDTVRLYRQDWGFAN 374
Query: 277 KRGFQQSNLASQCAHRY 293
+ F+ SNLA QC HRY
Sbjct: 375 RNAFRDSNLARQCRHRY 391
>gi|115474335|ref|NP_001060764.1| Os08g0101800 [Oryza sativa Japonica Group]
gi|38637293|dbj|BAD03556.1| unknown protein [Oryza sativa Japonica Group]
gi|50725686|dbj|BAD33152.1| unknown protein [Oryza sativa Japonica Group]
gi|113622733|dbj|BAF22678.1| Os08g0101800 [Oryza sativa Japonica Group]
gi|125601894|gb|EAZ41219.1| hypothetical protein OsJ_25724 [Oryza sativa Japonica Group]
Length = 579
Score = 244 bits (622), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 127/257 (49%), Positives = 159/257 (61%), Gaps = 6/257 (2%)
Query: 41 ITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTST---CPDYFRWIHEDLSPWKV 96
I P C N T TCP + P TST CP YFR I DL+ W
Sbjct: 137 IPIPFTCGSGNDTGTCPRYAASAPAPAPASMSPPPPQTSTVDYCPSYFRHIELDLAAWLA 196
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
+GI+R+ +ER + AHFRL++V + Y+ Y+++ QTRDVFT WGILQLLR+YPGR+PDL
Sbjct: 197 SGISREAVERGRRQAHFRLLVVGGRAYVETYRRAFQTRDVFTQWGILQLLRRYPGRVPDL 256
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
+LMF+CDD P +R+ Y P LFRY D T+D++FPDWSFWGW E+NI+PW
Sbjct: 257 DLMFNCDDMPEVRAAAYPDRAAAPP--LFRYCKDPSTLDVLFPDWSFWGWPEVNIRPWAP 314
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILES 276
LL E+ E W REPYAYWKGNP V+ R++LL CN S D RLY QDW +
Sbjct: 315 LLAEMAEEKARLPWSRREPYAYWKGNPDVSPLRQELLRCNHSLPPDDTVRLYRQDWGFAN 374
Query: 277 KRGFQQSNLASQCAHRY 293
+ F+ SNLA QC HRY
Sbjct: 375 RNAFRDSNLARQCRHRY 391
>gi|3386610|gb|AAC28540.1| hypothetical protein [Arabidopsis thaliana]
Length = 413
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 125/224 (55%), Positives = 148/224 (66%), Gaps = 18/224 (8%)
Query: 37 PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
P +++TTPLN C +Q TQTCP + P+ SQ + + TCPDYFRWIH+
Sbjct: 76 PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128
Query: 90 DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
DL W+ TGITR+ LERA+ AHFRLI+ +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
PG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248
Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
W EINIK W + EG W +REPYAYWKGNP VA R
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMAR 292
>gi|356504647|ref|XP_003521107.1| PREDICTED: protein O-glucosyltransferase 1-like [Glycine max]
Length = 373
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 106/191 (55%), Positives = 138/191 (72%), Gaps = 1/191 (0%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ER +HFRL++VN K YI K+ +S QTRDVFTIWGILQLLR YPG++PDLELMF C
Sbjct: 1 MIERGKNISHFRLVIVNGKAYIEKFAKSYQTRDVFTIWGILQLLRLYPGKVPDLELMFHC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D+ V+ +D+ GP PPP+F Y G+ + DIVFPDW+FWGWAE++I+PWE+ L ++
Sbjct: 61 GDKTVVFKKDFQGPQ-MSPPPVFHYCGEENSYDIVFPDWTFWGWAELSIRPWETTLHNIQ 119
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
EGN W DR PYA+WKGNP V+ RR+L CN+++K DWNAR+Y W+ E F+
Sbjct: 120 EGNKMVKWKDRIPYAFWKGNPKVSIIRRELGKCNVTEKQDWNARIYDIQWLQERASNFEN 179
Query: 283 SNLASQCAHRY 293
S L +QC RY
Sbjct: 180 SKLENQCTFRY 190
>gi|356530084|ref|XP_003533614.1| PREDICTED: KDEL motif-containing protein 1-like, partial [Glycine
max]
Length = 357
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/251 (48%), Positives = 152/251 (60%), Gaps = 17/251 (6%)
Query: 44 PLNCVKN-QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
PLNC T TCP +Y T Q+ S STCP+YF+WIHEDL PWK TGITR+
Sbjct: 3 PLNCFNGGMTYTCPKDYYPTKFAQDQDSS----TASTCPEYFKWIHEDLKPWKRTGITRE 58
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ER ++FRL+++ K Y KY S +TRDVFT+WGILQLLR YPG +PDLEL+ +
Sbjct: 59 MMERGRNVSYFRLVIIQGKAYAKKYADSYETRDVFTVWGILQLLRLYPGDIPDLELLLET 118
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D+P++ GP PPP+F Y G + DIVFPDW F GWA++ IK L
Sbjct: 119 GDKPMVDKEQSQGP----PPPIFHYCGHKNAYDIVFPDWIFRGWADLAIK--------LD 166
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
E N W DR PYA WKG +V+ R DL CN SD+HD A+++ W E +GFQ
Sbjct: 167 ESNKKIKWKDRLPYAIWKGKTWVSHKRNDLTKCNASDQHDSYAQIHPLHWDKEIAQGFQN 226
Query: 283 SNLASQCAHRY 293
+ L QC RY
Sbjct: 227 TKLDDQCIQRY 237
>gi|297849014|ref|XP_002892388.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
lyrata]
gi|297338230|gb|EFH68647.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
lyrata]
Length = 508
Score = 241 bits (614), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 163/244 (66%), Gaps = 12/244 (4%)
Query: 54 TCPTNY---PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQT 110
+CP N PK+ Q++ + P CPD+FRWIH DL PW TG+T++ ++RA
Sbjct: 89 SCPYNAVVQPKSLQSESVSGRRTHQPQ--CPDFFRWIHRDLEPWAKTGVTKEHVKRAKAN 146
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
A FR+++++ K+Y+ Y +Q+R +FTIWGILQLL KYPG +PD+++MFDC D+P+I
Sbjct: 147 AAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLNKYPGMVPDVDMMFDCMDKPIINQ 206
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
+Y P PLFRY + +DI FPDWSFWGW+E N++PWE ++K+G+ R+W
Sbjct: 207 TEY----QSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDIKQGSRRRSW 262
Query: 231 IDREPYAYWKGNP-FVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQC 289
+++P AYWKGNP V+ R +L+ CN S W A++ QDW E+K GF+QS L++QC
Sbjct: 263 DNKQPRAYWKGNPDVVSPIRLELMKCNHS--RLWGAQIMRQDWAEEAKGGFEQSKLSNQC 320
Query: 290 AHRY 293
HRY
Sbjct: 321 NHRY 324
>gi|55296633|dbj|BAD69335.1| unknown protein [Oryza sativa Japonica Group]
gi|55297286|dbj|BAD69071.1| unknown protein [Oryza sativa Japonica Group]
Length = 542
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 116/258 (44%), Positives = 160/258 (62%), Gaps = 2/258 (0%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
PP P +C S S P ++CPDYFR+IH+DL PW+
Sbjct: 103 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 162
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD FT WG+ QLLR+Y GR+PD+
Sbjct: 163 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFTQWGVAQLLRRYAGRVPDV 222
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
++MF CDDR +R+ D++ PP+FRY D T+D+VFPDWSFWGW E+NI W +
Sbjct: 223 DIMFACDDRGRVRAADFAA-APADAPPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPA 281
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQDWILE 275
L ++ + W +REP+A+WKGNP VA R +L+ CN SD DWNARL+ QDW
Sbjct: 282 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQDWNHA 341
Query: 276 SKRGFQQSNLASQCAHRY 293
GF+ S++ QC HRY
Sbjct: 342 IHNGFKDSSIPKQCLHRY 359
>gi|115466478|ref|NP_001056838.1| Os06g0152700 [Oryza sativa Japonica Group]
gi|113594878|dbj|BAF18752.1| Os06g0152700 [Oryza sativa Japonica Group]
Length = 547
Score = 240 bits (613), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 116/258 (44%), Positives = 160/258 (62%), Gaps = 2/258 (0%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
PP P +C S S P ++CPDYFR+IH+DL PW+
Sbjct: 108 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 167
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD FT WG+ QLLR+Y GR+PD+
Sbjct: 168 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFTQWGVAQLLRRYAGRVPDV 227
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
++MF CDDR +R+ D++ PP+FRY D T+D+VFPDWSFWGW E+NI W +
Sbjct: 228 DIMFACDDRGRVRAADFAA-APADAPPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPA 286
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQDWILE 275
L ++ + W +REP+A+WKGNP VA R +L+ CN SD DWNARL+ QDW
Sbjct: 287 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQDWNHA 346
Query: 276 SKRGFQQSNLASQCAHRY 293
GF+ S++ QC HRY
Sbjct: 347 IHNGFKDSSIPKQCLHRY 364
>gi|326491107|dbj|BAK05653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 109/181 (60%), Positives = 137/181 (75%), Gaps = 3/181 (1%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
ML+RA TA+FRL+++ + YI + + QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC
Sbjct: 1 MLDRARATANFRLVVLRGRAYIERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PV+ + +Y G N PPLFRY GD T+D+VFPDWSFWGWAEINIKPW++L ++L
Sbjct: 61 VDWPVVHADEYQGNNATAMPPLFRYCGDNETLDVVFPDWSFWGWAEINIKPWDALQKDLS 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
GN WIDREPYAYWKGNP VA R++L+ CN+S K +WNAR+Y QD SKR Q
Sbjct: 121 VGNKRVRWIDREPYAYWKGNPEVAAIRQELVKCNVSSKQEWNARIYKQDG---SKRARQD 177
Query: 283 S 283
+
Sbjct: 178 T 178
>gi|15222341|ref|NP_172202.1| uncharacterized protein [Arabidopsis thaliana]
gi|8954024|gb|AAF82198.1|AC067971_6 Contains similarity to an unknown protein T2J13.180 gi|6522568 from
Arabidopsis thaliana BAC T2J13 gb|AL132967. ESTs
gb|Z29835 and gb|Z29836 come from this gene [Arabidopsis
thaliana]
gi|332189973|gb|AEE28094.1| uncharacterized protein [Arabidopsis thaliana]
Length = 507
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 106/215 (49%), Positives = 151/215 (70%), Gaps = 7/215 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPD+FRWIH DL PW TG+T++ ++RA A FR+++++ K+Y+ Y +Q+R +FTI
Sbjct: 115 CPDFFRWIHRDLEPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTI 174
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WGILQLL KYPG +PD+++MFDC D+P+I +Y P PLFRY + +DI FP
Sbjct: 175 WGILQLLTKYPGMVPDVDMMFDCMDKPIINQTEY----QSFPVPLFRYCTNEAHLDIPFP 230
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTCNLS 258
DWSFWGW+E N++PWE ++K+G+ R+W +++P AYWKGNP V+ R +L+ CN S
Sbjct: 231 DWSFWGWSETNLRPWEEEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHS 290
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W A++ QDW E+K GF+QS L++QC HRY
Sbjct: 291 --RLWGAQIMRQDWAEEAKGGFEQSKLSNQCNHRY 323
>gi|62320292|dbj|BAD94602.1| hypothetical protein [Arabidopsis thaliana]
Length = 507
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 106/215 (49%), Positives = 151/215 (70%), Gaps = 7/215 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPD+FRWIH DL PW TG+T++ ++RA A FR+++++ K+Y+ Y +Q+R +FTI
Sbjct: 115 CPDFFRWIHRDLEPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTI 174
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WGILQLL KYPG +PD+++MFDC D+P+I +Y P PLFRY + +DI FP
Sbjct: 175 WGILQLLTKYPGMVPDVDMMFDCMDKPIINQTEY----QSFPVPLFRYCTNEAHLDIPFP 230
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTCNLS 258
DWSFWGW+E N++PWE ++K+G+ R+W +++P AYWKGNP V+ R +L+ CN S
Sbjct: 231 DWSFWGWSETNLRPWEVEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHS 290
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W A++ QDW E+K GF+QS L++QC HRY
Sbjct: 291 --RLWGAQIMRQDWAEEAKGGFEQSKLSNQCNHRY 323
>gi|125554122|gb|EAY99727.1| hypothetical protein OsI_21712 [Oryza sativa Indica Group]
Length = 308
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 110/219 (50%), Positives = 153/219 (69%), Gaps = 2/219 (0%)
Query: 76 PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
P ++CPDYFR+IH+DL PW+ GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD
Sbjct: 37 PAASCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRD 96
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
FT WG+ QLLR+Y GR+PD+++MF CDDR +R+ D++ PP+FRY D T+D
Sbjct: 97 AFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFA-AAPADAPPVFRYCRDATTLD 155
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW E+NI W + L ++ + W +REP+A+WKGNP VA R +L+ C
Sbjct: 156 VVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKC 215
Query: 256 N-LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
N SD DWNARL+ QDW GF+ S++ QC HRY
Sbjct: 216 NPASDGKDWNARLFSQDWNHAIHNGFRDSSIPKQCLHRY 254
>gi|294460307|gb|ADE75735.1| unknown [Picea sitchensis]
Length = 194
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 106/190 (55%), Positives = 139/190 (73%), Gaps = 3/190 (1%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ERA +TA FRL+++ ++YI Y +S QTRDVFTIWG++QL+ Y LPDL+LMFDC
Sbjct: 1 MVERARRTATFRLVILKGRMYIRTYTKSFQTRDVFTIWGLIQLMENYGWMLPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PVI+++ Y+ + PPPLFRY GD ++DI FPDWSFWGWAE+N +PW+ LL ++
Sbjct: 61 VDWPVIKAKAYANASLPPPPPLFRYCGDDKSLDIAFPDWSFWGWAEVNTRPWDGLLNDIL 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
+G W DR+P A+WKGNP+VA R DL+ CNLSD+ NARLY QDWI ES +G++
Sbjct: 121 KGAKKLKWEDRDPTAFWKGNPYVAAVREDLMKCNLSDR---NARLYNQDWIKESGQGYKH 177
Query: 283 SNLASQCAHR 292
S L QC HR
Sbjct: 178 SKLPDQCHHR 187
>gi|302784816|ref|XP_002974180.1| hypothetical protein SELMODRAFT_100126 [Selaginella moellendorffii]
gi|300158512|gb|EFJ25135.1| hypothetical protein SELMODRAFT_100126 [Selaginella moellendorffii]
Length = 426
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 146/218 (66%), Gaps = 6/218 (2%)
Query: 79 TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+CPD+F+WIH DL+PW+ +G I+R LE A + A FR+ ++ ++Y Y Q +Q+R +F
Sbjct: 1 SCPDFFQWIHHDLAPWRASGGISRAALEEAREFAAFRVAIIGGQLYAELYYQCVQSRAMF 60
Query: 138 TIWGILQLLRKYP-GRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
T+WG+L LL ++P G +PD+E MF+C DRP R Y ++ PPPL Y G R T+DI
Sbjct: 61 TLWGLLLLLERFPAGAVPDVEFMFNCMDRPHFRRSRY---KSRAPPPLLAYCGSRDTVDI 117
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTC 255
FPDWSFWGWAE+ I W + G+ W +R P A+WKGNP V A+ R DLL C
Sbjct: 118 AFPDWSFWGWAEVRIGAWGEEASSIFHGSEETRWENRRPRAHWKGNPHVGAQVRTDLLAC 177
Query: 256 NLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
N +DK D+ A +YVQ+WI ES++GF+ S L+ QC HRY
Sbjct: 178 NKTDKRDFGADIYVQNWIAESQQGFKNSKLSDQCRHRY 215
>gi|302770761|ref|XP_002968799.1| hypothetical protein SELMODRAFT_450427 [Selaginella moellendorffii]
gi|300163304|gb|EFJ29915.1| hypothetical protein SELMODRAFT_450427 [Selaginella moellendorffii]
Length = 423
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 146/218 (66%), Gaps = 6/218 (2%)
Query: 79 TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+CPD+F+WIH DL+PW+ +G I+R LE A + A FR+ ++ ++Y Y Q +Q+R +F
Sbjct: 1 SCPDFFQWIHHDLAPWRASGGISRAALEEAREFAAFRVAIIGGQLYAELYYQCVQSRAMF 60
Query: 138 TIWGILQLLRKYP-GRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
T+WG+L LL ++P G +PD+E MF+C DRP R Y ++ PPPL Y G R T+DI
Sbjct: 61 TLWGLLLLLERFPAGAVPDVEFMFNCMDRPHFRRSRY---KSRAPPPLLAYCGSRDTVDI 117
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTC 255
FPDWSFWGWAE+ I W + G+ W +R P A+WKGNP V A+ R DLL C
Sbjct: 118 AFPDWSFWGWAEVRIGAWGEEASSIFHGSEETRWENRRPRAHWKGNPHVGAQVRTDLLAC 177
Query: 256 NLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
N +DK D+ A +YVQ+WI ES++GF+ S L+ QC HRY
Sbjct: 178 NKTDKRDFGADIYVQNWIAESQQGFKNSKLSDQCRHRY 215
>gi|395146518|gb|AFN53673.1| DUF821 [Linum usitatissimum]
Length = 474
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 100/215 (46%), Positives = 139/215 (64%), Gaps = 7/215 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++F +IH DL PW +GIT D L A A FR+++ K+Y Y Q+R + TI
Sbjct: 109 CPEFFSYIHRDLEPWAKSGITEDQLMEAKNFAAFRIVIYQGKLYFDPYYACFQSRMMTTI 168
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG LQLL+KYPG +PD++LMFDC D+P+ ++ P PLFRY R DI FP
Sbjct: 169 WGFLQLLKKYPGMVPDVDLMFDCMDKPIFNRTEHQA----NPVPLFRYCTTREHFDIPFP 224
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTCNLS 258
DWSFWGW+EINI+PW ++K+G+ + W R+P+A+WKGNP V+ R +LL CN
Sbjct: 225 DWSFWGWSEINIRPWSEEFPDIKKGSQAKRWAKRQPHAFWKGNPDVVSPVRLELLQCN-- 282
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
D + A++ QDW+ E+K GF+ S L++QC +RY
Sbjct: 283 DSRKFGAQIMRQDWVQEAKEGFEASKLSNQCNYRY 317
>gi|449520138|ref|XP_004167091.1| PREDICTED: uncharacterized protein LOC101228589 [Cucumis sativus]
Length = 472
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 98/215 (45%), Positives = 141/215 (65%), Gaps = 7/215 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++FRWIH DL PW T I+ LE + + A FR+++V ++Y+ Y +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG++Q+LR+YPG +PD+++MFDC D+P I + N P PLFRY DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
DWSFWGW E+N++ W ++K+G+ +W ++ P AYWKGNP V + R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W A++ QDW E+K G++QS L++QC HRY
Sbjct: 276 RM--WGAQIMRQDWAQEAKDGYEQSKLSNQCNHRY 308
>gi|326495828|dbj|BAJ90536.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 486
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 171/311 (54%), Gaps = 27/311 (8%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
I F+ I++ S ++ +SR +T++ + +P + RPP R L C
Sbjct: 30 IAAFSAVIVIAVLHSAYDDALSRTRTLLGHNLEPTPWHPFPHDKGRPPPRAA--LRCASY 87
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK-----VTGITRDMLE 105
+ P + PK + + + CP YF I DL+PW+ GITR +LE
Sbjct: 88 LSCLPPLSQPKPAAAAAANASRP----RQCPSYFAAIRRDLAPWRRRDGGGGGITRALLE 143
Query: 106 RANQTAHFRLILVNNKVYIHK--YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
A A R+ + +H Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC
Sbjct: 144 SARFRASMRVTITGGGTRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCM 203
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
DRP I ++SG PPPLFRY R +DI FPDWSFWGW E +I+PW R +K+
Sbjct: 204 DRPAINRTEHSGDGAPPPPPLFRYCTTRDHLDIPFPDWSFWGWPETHIEPWSREFRSIKQ 263
Query: 224 GNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
G+ W DR P AYWKGNP VA R LL CN D + W A + Q+W E+K G+Q
Sbjct: 264 GSRRVKWPDRVPTAYWKGNPDVASPLRLALLACN--DTNLWRAEIMRQNWEEEAKSGYQN 321
Query: 283 SNLASQCAHRY 293
S L+SQC HRY
Sbjct: 322 SKLSSQCTHRY 332
>gi|218197602|gb|EEC80029.1| hypothetical protein OsI_21713 [Oryza sativa Indica Group]
Length = 515
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 106/222 (47%), Positives = 152/222 (68%), Gaps = 5/222 (2%)
Query: 76 PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
P ++CPDYFR+IH+DL PW+ GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD
Sbjct: 146 PAASCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRD 205
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
FT WG+ QLLR+Y GR+PD+++MF CDDR +R+ D++ PP+FRY D T+D
Sbjct: 206 AFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFA-AAPADAPPVFRYCRDATTLD 264
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW E+NI W + L ++ + W +REP+A+WKGNP VA R +L+ C
Sbjct: 265 VVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKC 324
Query: 256 N-LSDKHDWNARLYVQDW--ILESKRGFQQSNLASQCAHRYS 294
N SD DWNARL+ Q + +++ F Q +A Q RY+
Sbjct: 325 NPASDGKDWNARLFSQTYRRSYQTRDAFTQWGVA-QLLRRYA 365
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 85/175 (48%), Positives = 117/175 (66%), Gaps = 2/175 (1%)
Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
N +++ Y++S QTRD FT WG+ QLLR+Y GR+PD+++MF CDDR +R+ D++
Sbjct: 334 NARLFSQTYRRSYQTRDAFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFAAAPA 393
Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAY 238
PP+FRY D T+D+VFPDWSFWGW E+NI W + L ++ + W +REP+A+
Sbjct: 394 D-APPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAF 452
Query: 239 WKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHR 292
WKGNP VA R +L+ CN SD DWNARL+ QDW GF+ S++ QC HR
Sbjct: 453 WKGNPGVARIRGELMKCNPASDGKDWNARLFSQDWNHAIHNGFKDSSIPKQCLHR 507
>gi|357119765|ref|XP_003561604.1| PREDICTED: uncharacterized protein LOC100842005 [Brachypodium
distachyon]
Length = 616
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 110/219 (50%), Positives = 145/219 (66%), Gaps = 7/219 (3%)
Query: 82 DYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
DYFR+IH DLSPW+ +G IT LERA A FRL +V+ + Y+ Y ++ QTRDVFT W
Sbjct: 216 DYFRYIHSDLSPWRASGGITLPTLERAIPHAAFRLTVVSGRAYVETYHRAFQTRDVFTQW 275
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW-TMDIVFP 199
GILQLL +YPGR+PDL+ MF+ +D P I D G PPPLFRY D +++I+FP
Sbjct: 276 GILQLLARYPGRVPDLDAMFNLEDMPEIFRNDADGNPAPPPPPLFRYCKDGGDSVEILFP 335
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN--L 257
DWSFWGW E+NI+PW L+ N W +REP+A+WKGNP+V+ R+DL CN
Sbjct: 336 DWSFWGWPEVNIRPWAPLMENFVRENRALPWQNREPFAFWKGNPYVSNARKDLFKCNNDS 395
Query: 258 SDKHDWNARLYVQDWILESKRGFQ---QSNLASQCAHRY 293
+ ++NARL+ DW S+ GF+ +NLA QC +RY
Sbjct: 396 AAGKEFNARLFDVDWRAASRNGFKDDGSTNLAKQCKYRY 434
>gi|449455154|ref|XP_004145318.1| PREDICTED: uncharacterized protein LOC101204476 [Cucumis sativus]
Length = 472
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 141/215 (65%), Gaps = 7/215 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++FRWIH DL PW T I+ LE + + A FR+++V ++Y+ Y +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG++Q+LR+YPG +PD+++MFDC D+P I + N P PLFRY DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
DWSFWGW E+N++ W ++K+G+ +W ++ P AYWKGNP V + R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W A++ QDW E++ G++QS L++QC HRY
Sbjct: 276 RM--WGAQIMRQDWAQEARDGYEQSKLSNQCNHRY 308
>gi|449471103|ref|XP_004153209.1| PREDICTED: uncharacterized protein LOC101204904 [Cucumis sativus]
Length = 472
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 141/215 (65%), Gaps = 7/215 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++FRWIH DL PW T I+ LE + + A FR+++V ++Y+ Y +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG++Q+LR+YPG +PD+++MFDC D+P I + N P PLFRY DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
DWSFWGW E+N++ W ++K+G+ +W ++ P AYWKGNP V + R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W A++ QDW E++ G++QS L++QC HRY
Sbjct: 276 RM--WGAQIMRQDWAQEARDGYEQSKLSNQCNHRY 308
>gi|227206228|dbj|BAH57169.1| AT1G63420 [Arabidopsis thaliana]
Length = 299
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 100/183 (54%), Positives = 130/183 (71%), Gaps = 13/183 (7%)
Query: 31 RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
R + + P+ + ++C +N++ +C ++ S+ S CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
IHEDL PW+ TGIT++M+ER TAHFRL+++N KV++ YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
RKYPG+LPD++LMFDCDDRPVIRS Y+ N PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296
Query: 204 WGW 206
WGW
Sbjct: 297 WGW 299
>gi|222629271|gb|EEE61403.1| hypothetical protein OsJ_15589 [Oryza sativa Japonica Group]
Length = 535
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 131/212 (61%), Gaps = 1/212 (0%)
Query: 83 YFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
YFR+IHEDL PW+ GITR ML RA TA FRL+++ + ++H++ + +
Sbjct: 140 YFRFIHEDLRPWRDAGGITRAMLARARVTASFRLLVLGGRAFVHRFARHSRPAISSPSGA 199
Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
PG P P +G PPLF Y GD T+DIVFPDW
Sbjct: 200 SSSCSAATPGGSPTSTSCSTAPTGPSSARTSTAGSTLPFMPPLFSYCGDDRTLDIVFPDW 259
Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
SFWGW EINIKPW++L ++LK+GNN W+DR PYAYWKGNP VA TR++L+ CN+S
Sbjct: 260 SFWGWPEINIKPWDALRQDLKDGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTK 319
Query: 262 DWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
DWNAR+Y QDW ESK G++ SNL SQC HRY
Sbjct: 320 DWNARIYKQDWFRESKAGYKDSNLGSQCTHRY 351
>gi|356569533|ref|XP_003552954.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
Length = 464
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 145/215 (67%), Gaps = 7/215 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++FR IH DL+PW + I++ + A + A FR+++V KV++ Y +Q+R +FT+
Sbjct: 104 CPEFFRAIHRDLAPWLESRISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTL 163
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG+LQL+R+YPG++PD+++MFDC D+P + ++ P PLFRY + DI FP
Sbjct: 164 WGLLQLMRRYPGKVPDVDMMFDCMDKPSVNRTEHQA----MPLPLFRYCTTKEHFDIPFP 219
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLS 258
DWSFWGW+EINI+PW+ ++K+G+ +W ++ P+AYWKGNP VA R +L+ CN
Sbjct: 220 DWSFWGWSEINIRPWQEEFPDIKQGSRNVSWKNKFPWAYWKGNPDVASPIRTELINCN-- 277
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
D W A + QDW ++ GF+QS L++QC HRY
Sbjct: 278 DSRKWGAEIMRQDWGEAARSGFKQSKLSNQCNHRY 312
>gi|255569363|ref|XP_002525649.1| conserved hypothetical protein [Ricinus communis]
gi|223535085|gb|EEF36767.1| conserved hypothetical protein [Ricinus communis]
Length = 491
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 143/217 (65%), Gaps = 7/217 (3%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP++FR+IH DL PW TGIT+ + A + A FR+++ ++Y+ Y +Q+R +F
Sbjct: 102 AKCPEFFRFIHHDLQPWARTGITKKHIAEAKKFAAFRVVIFEGRLYLDLYYACVQSRMMF 161
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
T+WG+LQLL +YPG +PD+++MFDC DRPVI ++ P P+FRY + DI
Sbjct: 162 TVWGLLQLLNRYPGMVPDVDIMFDCMDRPVINKTEHIS----FPLPIFRYCTTQNHFDIP 217
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTCN 256
FPDWSFWGW EINI+ W R++K G+ ++W + P AYWKGNP ++ R +L+ CN
Sbjct: 218 FPDWSFWGWPEINIRSWNEEFRDIKRGSQSKSWSKKWPRAYWKGNPDVLSPIRTELMQCN 277
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
S K W A + QDW E++ GF++S L++QC +RY
Sbjct: 278 HSRK--WGAHIMRQDWGEEARAGFERSKLSNQCNYRY 312
>gi|356541729|ref|XP_003539326.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
Length = 464
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 140/215 (65%), Gaps = 7/215 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP +FR IH DL+PW + I++ + A + A FR+++V KV++ Y +Q+R +FT+
Sbjct: 104 CPKFFRAIHRDLAPWSESRISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTL 163
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG+LQL+R+YPG +PD+++MFDC D+P + ++ P PLFRY + DI FP
Sbjct: 164 WGLLQLMRRYPGMVPDVDMMFDCMDKPSVNKTEHQA----MPLPLFRYCTTKEHFDIPFP 219
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLS 258
DWSFWGW+EINI+PW+ ++K G+ W ++ P+AYWKGNP VA R +L+ CN
Sbjct: 220 DWSFWGWSEINIRPWQEEFPDIKRGSRSVTWKNKLPWAYWKGNPDVASPIRTELINCN-- 277
Query: 259 DKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
D W A + QDW ++ GF+QS L+ QC HRY
Sbjct: 278 DSRKWGAEIMRQDWGEAARNGFKQSKLSDQCNHRY 312
>gi|302808049|ref|XP_002985719.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300146628|gb|EFJ13297.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 386
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 137/217 (63%), Gaps = 3/217 (1%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF WI DL+PWK TGI++ L+ A A FR+++VN +Y+ +Y + +TRD FT+
Sbjct: 1 CPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDFTL 60
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-MDI 196
WG+L LL++YPG +PD++LMF+C D P++ ++ N PPPLF Y R DI
Sbjct: 61 WGLLMLLKEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRGDHYDI 120
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWS+WGW E+NI PW ++ G +W R+P A+WKGN + R DL+ C
Sbjct: 121 VFPDWSYWGWPEVNILPWSLEKGKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVKCT 180
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
++ H++N Y QDW E +R F S+L+ QC HRY
Sbjct: 181 ANNTHNYNLVTYHQDWFAERERNFNNSDLSKQCQHRY 217
>gi|413917716|gb|AFW57648.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 500
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 119/308 (38%), Positives = 170/308 (55%), Gaps = 24/308 (7%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
I F+ I+L S ++ +SR +T++ + +P + RPP R L C
Sbjct: 46 IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 102
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
+ C +S + + + CP YF IH DL+PW+ G +TR +L+ A
Sbjct: 103 -SIACLPPLSSSSSSPQPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 160
Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
+ A R+ + ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 161 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 220
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
I +R G PPPLFRY R DI FPDWSFWGW E NI+PW + ++ G
Sbjct: 221 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIEPWNHEFKNIRAGAR 279
Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQDWILESKRGFQQSNL 285
W DR P AYWKGNP VA R+ LL CN D W A + Q+W E++ G+Q S L
Sbjct: 280 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQNWDDEARSGYQHSRL 337
Query: 286 ASQCAHRY 293
++QC HRY
Sbjct: 338 SAQCTHRY 345
>gi|194707538|gb|ACF87853.1| unknown [Zea mays]
Length = 480
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 119/308 (38%), Positives = 170/308 (55%), Gaps = 24/308 (7%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
I F+ I+L S ++ +SR +T++ + +P + RPP R L C
Sbjct: 26 IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 82
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
+ C +S + + + CP YF IH DL+PW+ G +TR +L+ A
Sbjct: 83 -SIACLPPLSSSSSSPQPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 140
Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
+ A R+ + ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 141 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 200
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
I +R G PPPLFRY R DI FPDWSFWGW E NI+PW + ++ G
Sbjct: 201 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIEPWNHEFKNIRAGAR 259
Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQDWILESKRGFQQSNL 285
W DR P AYWKGNP VA R+ LL CN D W A + Q+W E++ G+Q S L
Sbjct: 260 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQNWDDEARSGYQHSRL 317
Query: 286 ASQCAHRY 293
++QC HRY
Sbjct: 318 SAQCTHRY 325
>gi|357497389|ref|XP_003618983.1| hypothetical protein MTR_6g031080, partial [Medicago truncatula]
gi|355493998|gb|AES75201.1| hypothetical protein MTR_6g031080, partial [Medicago truncatula]
Length = 226
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 102/197 (51%), Positives = 132/197 (67%), Gaps = 12/197 (6%)
Query: 15 FSVHNISRNK---TVIIVSRKPQNRPPKRITTPLNCVKNQTQ--TCPTNYPKTSQTQESI 69
F +H I +K T I + + N ++ PLNC N+T TCP +YP T +
Sbjct: 37 FIIHWIDLSKVTSTTIFKTIEVLNNIKQQPQFPLNC-NNETSNSTCPFSYPTTFHLNDD- 94
Query: 70 SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
P+S+CP+YF+WIHEDL PW+ TGITR+M+E +HFRL++VN K YI K+ +
Sbjct: 95 -----SPSSSCPNYFKWIHEDLKPWEKTGITREMVESGKNMSHFRLVVVNGKAYIDKFAK 149
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
S QTRDVFTIWGILQLLR YPG++PDLELMF C D+ V+ + + GP N PPP+F Y G
Sbjct: 150 SYQTRDVFTIWGILQLLRLYPGKIPDLELMFQCGDKNVVDKKKFQGPENVSPPPIFHYCG 209
Query: 190 DRWTMDIVFPDWSFWGW 206
D T+DIVFPDW+FWGW
Sbjct: 210 DEDTLDIVFPDWTFWGW 226
>gi|302785297|ref|XP_002974420.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300158018|gb|EFJ24642.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 386
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 136/217 (62%), Gaps = 3/217 (1%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF WI DL+PWK TGI++ L+ A A FR+++VN +Y+ +Y + +TRD FT+
Sbjct: 1 CPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDFTL 60
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-MDI 196
WG+L LL++YPG +PD++LMF+C D P++ ++ N PPPLF Y R DI
Sbjct: 61 WGLLMLLKEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRRDHYDI 120
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWS+WGW E+NI PW ++ G +W R+P A+WKGN + R DL+ C
Sbjct: 121 VFPDWSYWGWPEVNILPWSLEKGKIFSGAEKLDWSHRQPIAFWKGNYDMGPARADLVKCT 180
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
++ H++N Y QDW E + F S+L+ QC HRY
Sbjct: 181 ANNTHNYNLVTYHQDWFAEREHNFNNSDLSKQCQHRY 217
>gi|226497664|ref|NP_001151574.1| lipopolysaccharide-modifying protein [Zea mays]
gi|195647860|gb|ACG43398.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 480
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/308 (38%), Positives = 169/308 (54%), Gaps = 24/308 (7%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
I F+ I+L S ++ +SR +T++ + +P + RPP R L C
Sbjct: 26 IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 82
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
+ C +S + + CP YF IH DL+PW+ G +TR +L+ A
Sbjct: 83 -SIACLPPLSPSSSSSPPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 140
Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
+ A R+ + ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 141 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 200
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
I +R G PPPLFRY R DI FPDWSFWGW + NI+PW + ++ G
Sbjct: 201 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPDTNIEPWNHEFKNIRAGAR 259
Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQDWILESKRGFQQSNL 285
W DR P AYWKGNP VA R+ LL CN D W A + Q+W E++ G+Q S L
Sbjct: 260 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQNWDDEARSGYQHSRL 317
Query: 286 ASQCAHRY 293
++QC HRY
Sbjct: 318 SAQCTHRY 325
>gi|302783306|ref|XP_002973426.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300159179|gb|EFJ25800.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 449
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 101/224 (45%), Positives = 140/224 (62%), Gaps = 5/224 (2%)
Query: 71 DYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQS 130
++S CPDYF+WI DLSPWK TGI++ L+ A A FR+++V+ K+Y+ +Y+
Sbjct: 88 NFSAQAQGKCPDYFKWIERDLSPWKNTGISQGDLQEAKSKADFRVVIVDGKLYMERYRYC 147
Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
QTR +T+WGI LL ++PG++PDLELMF C DRP + ++YS + PPPLF Y
Sbjct: 148 YQTRAQYTLWGIRMLLEEFPGQVPDLELMFMCGDRPQVLRKNYSSNSVWPPPPLFSYCTT 207
Query: 191 R-WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
R DIVFPDWSFWGW E+NI PW ++ G W+ REP A WKGN ++ + R
Sbjct: 208 RDEHYDIVFPDWSFWGWPEVNIAPWTVEREKIFSGAEKIKWLQREPIAQWKGNTWMGKIR 267
Query: 250 RDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
L+ CN S + +Y Q+W E K F S+L+ QC++RY
Sbjct: 268 PMLVQCNSST----SILVYHQNWDDEIKNNFSSSDLSKQCSYRY 307
>gi|147785893|emb|CAN70836.1| hypothetical protein VITISV_015872 [Vitis vinifera]
Length = 922
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 106/235 (45%), Positives = 146/235 (62%), Gaps = 7/235 (2%)
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
P T++ E + S CP +F I DL PW +GI+ + A + A FR+++V
Sbjct: 545 PTTTKIPEWHTRQSSNTVGKCPMFFTRIXHDLQPWVRSGISLSSVMEAQKFAAFRVVIVG 604
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
K+Y+ + +Q+R +FT+WG+LQLLR+YPG +PD++LMFDC D+P I SR+ G
Sbjct: 605 GKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMDKPTI-SREEHGSK-- 661
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
P PLFRY DI FPDWSFWGW EI+I PW+ +K+G+ NW + YAYW
Sbjct: 662 -PLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQVLNWTQKLSYAYW 720
Query: 240 KGNPFV-AETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
KGNP V + R DLL CN SD A++ QDW+ E+K GF++S L++QC HRY
Sbjct: 721 KGNPDVQSPVRVDLLQCNNSDI--IGAQIMRQDWVEEAKNGFKESKLSNQCNHRY 773
>gi|194702520|gb|ACF85344.1| unknown [Zea mays]
gi|238009730|gb|ACR35900.1| unknown [Zea mays]
gi|238011028|gb|ACR36549.1| unknown [Zea mays]
Length = 435
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 102/219 (46%), Positives = 137/219 (62%), Gaps = 8/219 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTG--ITRDMLERANQTAHFRLILVNN--KVYIHKYKQSIQTRD 135
CP YF IH DL+PW+ G +TR +L+ A + A R+ + ++++ Y +Q+R
Sbjct: 65 CPAYFAAIHRDLAPWRGQGRGVTRALLDEARRRASMRVTITGGGRRLHVDLYYACVQSRA 124
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FT+W +LQL+R+YPGR+PD++LMFDC DRP I +R G PPPLFRY R D
Sbjct: 125 LFTVWSLLQLMRRYPGRVPDVDLMFDCMDRPAI-NRTEHGDGALPPPPLFRYCTTRDHFD 183
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD-LLT 254
I FPDWSFWGW E NI+PW + ++ G W DR P AYWKGNP VA R+ LL
Sbjct: 184 IPFPDWSFWGWPETNIEPWNHEFKNIRAGARATRWADRVPTAYWKGNPDVASPLREALLG 243
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
CN D W A + Q+W E++ G+Q S L++QC HRY
Sbjct: 244 CN--DTALWRAEIMRQNWDDEARSGYQHSRLSAQCTHRY 280
>gi|359478546|ref|XP_003632132.1| PREDICTED: protein O-glucosyltransferase 1-like [Vitis vinifera]
gi|297745896|emb|CBI15952.3| unnamed protein product [Vitis vinifera]
Length = 464
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 106/235 (45%), Positives = 146/235 (62%), Gaps = 7/235 (2%)
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
P T++ E + S CP +F I DL PW +GI+ + A + A FR+++V
Sbjct: 87 PTTTKIPEWHTRQSSNTVGKCPMFFTRIDHDLQPWVRSGISLSSVMEAQKFAAFRVVIVG 146
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
K+Y+ + +Q+R +FT+WG+LQLLR+YPG +PD++LMFDC D+P I SR+ G
Sbjct: 147 GKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMDKPTI-SREEHG---S 202
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
P PLFRY DI FPDWSFWGW EI+I PW+ +K+G+ NW + YAYW
Sbjct: 203 KPLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQVLNWTQKLSYAYW 262
Query: 240 KGNPFV-AETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
KGNP V + R DLL CN SD A++ QDW+ E+K GF++S L++QC HRY
Sbjct: 263 KGNPDVQSPVRVDLLQCNNSDI--IGAQIMRQDWVEEAKNGFKESKLSNQCNHRY 315
>gi|302823162|ref|XP_002993235.1| hypothetical protein SELMODRAFT_431353 [Selaginella moellendorffii]
gi|300138905|gb|EFJ05656.1| hypothetical protein SELMODRAFT_431353 [Selaginella moellendorffii]
Length = 475
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 135/219 (61%), Gaps = 3/219 (1%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP YF WI DL+PWK TGI++ L+ A A FR+++VN +Y+ +Y + +TRD F
Sbjct: 85 AQCPSYFSWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDF 144
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-M 194
T+WG+L LL +YPG +PD++LMF+C D P++ ++ N PPPLF Y R
Sbjct: 145 TLWGLLMLLEEYPGMVPDVDLMFNCGDWPLVFRAEHKPEKNGSWPPPPLFLYCTSRGEHY 204
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWS+WGW E+NI PW ++ G +W R+P A+WKGN + R DL+
Sbjct: 205 DIVFPDWSYWGWPEVNILPWSLEKEKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVK 264
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
C ++ ++N + QDW E + F S+L+ QC HRY
Sbjct: 265 CTANNSQNYNLVTHHQDWFTEREHNFNNSDLSKQCQHRY 303
>gi|302794981|ref|XP_002979254.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300153022|gb|EFJ19662.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 474
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 135/219 (61%), Gaps = 3/219 (1%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP YF WI DL+PWK TGI++ L+ A A FR+++VN +Y+ +Y + +TRD F
Sbjct: 85 AHCPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDF 144
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-M 194
T+WG+L LL +YPG +PD++LMF+C D P++ ++ N PPPLF Y R
Sbjct: 145 TLWGLLMLLEEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRGEHY 204
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWS+WGW E+NI PW ++ G +W R+P A+WKGN + R DL+
Sbjct: 205 DIVFPDWSYWGWPEVNILPWSLEKEKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVK 264
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
C ++ ++N + QDW E + F S+L+ QC HRY
Sbjct: 265 CTANNSQNYNLVTHHQDWFTEREHNFNNSDLSKQCQHRY 303
>gi|357129812|ref|XP_003566555.1| PREDICTED: O-glucosyltransferase rumi-like [Brachypodium
distachyon]
Length = 493
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 178/312 (57%), Gaps = 32/312 (10%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRIT---TPLNC 47
I F+ I++ S ++ +SR +T++ + +P + RPP R PL
Sbjct: 35 IAAFSAVIVIAVLHSAYDEAVSRTRTLLGHNLEPTPWHPFPHDKGRPPPRAALRCAPLLS 94
Query: 48 VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV---TGITRDML 104
+ P S+++ T CP YF IH DL+PW+ GITR++L
Sbjct: 95 CLPPLSHPHPSPPNASRSRR---------TKQCPAYFAAIHRDLAPWRRHGHGGITRELL 145
Query: 105 ERANQTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
+ A A R+ + N ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC
Sbjct: 146 DSARSRASMRVTITGNGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDC 205
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DRP I +R GPN PPPLFRY + +DI FPDWSFWGW E +I PW R +K
Sbjct: 206 MDRPAI-NRTTGGPNPPLPPPLFRYCTTKDHLDIPFPDWSFWGWPETHINPWAKEFRAIK 264
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQ 281
+G+ W DR P A+WKGNP VA R LL CN D + W+A++ Q+W E+K G++
Sbjct: 265 QGSRRVKWGDRVPLAFWKGNPDVASPLRLALLACN--DTNLWHAQIMRQNWEEEAKSGYR 322
Query: 282 QSNLASQCAHRY 293
S L++QCAHRY
Sbjct: 323 HSALSTQCAHRY 334
>gi|242086909|ref|XP_002439287.1| hypothetical protein SORBIDRAFT_09g003810 [Sorghum bicolor]
gi|241944572|gb|EES17717.1| hypothetical protein SORBIDRAFT_09g003810 [Sorghum bicolor]
Length = 497
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 121/304 (39%), Positives = 168/304 (55%), Gaps = 23/304 (7%)
Query: 9 IILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKNQTQTCPT 57
I+L S ++ +SR +T++ + +P + RPP R L C + P
Sbjct: 35 IVLAVVHSAYDDALSRTRTLLGHNLEPTPWHPFPHSKGRPPARAA--LRCAPSIACLPPL 92
Query: 58 NYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV--TGITRDMLERANQTAHFR 114
++P T+ S + + CP YF IH DL+PW+ G+TR +L+ A + A R
Sbjct: 93 SHPPGTANASSSAAAAAGEQRRQCPAYFAAIHRDLAPWRGPGRGVTRALLDAARRRASMR 152
Query: 115 LILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRD 172
+ + ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP I D
Sbjct: 153 VTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRPAINRTD 212
Query: 173 YSGPNNKGPPP--LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
+ + PPP LFRY R DI FPDWSFWGW E NIKPW + +K G W
Sbjct: 213 HGDGDPSSPPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIKPWNREFKSIKMGARATRW 272
Query: 231 IDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQC 289
DR P AYWKGNP VA R LL CN D W A + Q+W E+K G+ S L+SQC
Sbjct: 273 ADRVPTAYWKGNPDVASPLRVALLGCN--DTAAWRAEIMRQNWDDEAKSGYTHSKLSSQC 330
Query: 290 AHRY 293
HRY
Sbjct: 331 THRY 334
>gi|302789424|ref|XP_002976480.1| hypothetical protein SELMODRAFT_105631 [Selaginella moellendorffii]
gi|300155518|gb|EFJ22149.1| hypothetical protein SELMODRAFT_105631 [Selaginella moellendorffii]
Length = 357
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 8/218 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPDYF+WI DLSPWK TGI++ L+ A A FR+++V+ K+Y+ +Y+ QTR +T+
Sbjct: 1 CPDYFKWIERDLSPWKNTGISQGDLQEAKSKADFRVVIVDGKLYMERYRYCYQTRAQYTL 60
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK---GPPPLFRYSGDR-WTMD 195
WGI LL ++PG++PDLELMF C DRP + ++YS + K PPPLF Y R D
Sbjct: 61 WGIRMLLEEFPGQVPDLELMFMCGDRPQVLRKNYSSNSVKRRWPPPPLFSYCTTRDEHYD 120
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
IVFPDWSFWGW E+NI PW ++ G W+ REP A WKGN ++ + R L+ C
Sbjct: 121 IVFPDWSFWGWPEVNIAPWTVEREKIFSGAEKIKWLQREPIAQWKGNTWMGKIRPMLVQC 180
Query: 256 NLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
N S + +Y Q+W E K F S+L+ QC++R+
Sbjct: 181 NSST----SILVYHQNWDEEIKNNFSSSDLSKQCSYRW 214
>gi|297745249|emb|CBI40329.3| unnamed protein product [Vitis vinifera]
Length = 341
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 88/145 (60%), Positives = 105/145 (72%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
YPG+LPD +LMF+C DRP+IR+ Y GP+ PPPLF Y GD T DIVFPDWSFWGW E
Sbjct: 2 YPGKLPDFDLMFECRDRPMIRTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGWPE 61
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
NIKPW ++LKEGN WIDREPYAYWKGN + R++L C +D+ DWNARLY
Sbjct: 62 TNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNARLY 121
Query: 269 VQDWILESKRGFQQSNLASQCAHRY 293
+ DW E + GF+ S+LASQC HRY
Sbjct: 122 IMDWGREVQSGFKTSDLASQCTHRY 146
>gi|302773059|ref|XP_002969947.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300162458|gb|EFJ29071.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 325
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 139/217 (64%), Gaps = 9/217 (4%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF WIHEDL+PW +GITR+ L+ A + FRL++V+ K+Y + ++ IQTRD T+
Sbjct: 2 CPSYFAWIHEDLAPWSESGITREHLDEARRKGDFRLVIVDGKLYFERIQECIQTRDEITL 61
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT--MDIV 197
G+L LL K+PG +PD+E++F C+D P + +Y +K PPP+F Y R+ DI+
Sbjct: 62 QGLLLLLEKFPGMIPDMEMVFVCNDFPQVPKDEY---RSKPPPPVFSYCTSRFGGHFDIL 118
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCN 256
FPDWSFWGW ++ I+PWE E+ +G N +W +RE AYWKGN +V R +LL CN
Sbjct: 119 FPDWSFWGWPQVKIRPWEQESVEIFDGANETDWFEREAIAYWKGNLWVMTPVREELLQCN 178
Query: 257 LSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+ +N +Y DW E GF+ S+L+ QC RY
Sbjct: 179 NT---QYNVVVYYLDWAKEEAEGFKTSSLSKQCHTRY 212
>gi|302821372|ref|XP_002992349.1| hypothetical protein SELMODRAFT_44687 [Selaginella moellendorffii]
gi|300139892|gb|EFJ06625.1| hypothetical protein SELMODRAFT_44687 [Selaginella moellendorffii]
Length = 330
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 95/219 (43%), Positives = 133/219 (60%), Gaps = 8/219 (3%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
S CP YF WI DL+PWK GIT L+ A + A FR++++N +Y+ +Y + QTRD F
Sbjct: 4 SQCPAYFSWIDRDLAPWKDHGITLHSLQEAKKRADFRVLILNGTLYMERYHKCFQTRDDF 63
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRY--SGDRWTM 194
T+ G+ LL ++PG +PD++LMF C D P I R++ G ++ PPPLF Y +
Sbjct: 64 TLRGLQLLLDRFPGMVPDVDLMFGCGDFPAIPRAK---GSDDPSPPPLFSYCTTARGENY 120
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWSFWGW E+NI PWE +++ G W R+ +AYWKGN + R DL+
Sbjct: 121 DIVFPDWSFWGWPEVNILPWEEEKQKIYSGAREEKWSKRQRFAYWKGNYWTGAARPDLVK 180
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
C + D + QDW E +GF+ S+++ QC HRY
Sbjct: 181 C--AANKDLFVSMISQDWNAEKGQGFKSSDMSKQCTHRY 217
>gi|302795177|ref|XP_002979352.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300153120|gb|EFJ19760.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 329
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 95/219 (43%), Positives = 133/219 (60%), Gaps = 8/219 (3%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
S CP YF WI DL+PWK GIT L+ A + A FR++++N +Y+ +Y + QTRD F
Sbjct: 2 SQCPAYFSWIDRDLAPWKDHGITLHSLQEAKKRADFRVLILNGTLYMERYHKCFQTRDDF 61
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRY--SGDRWTM 194
T+ G+ LL ++PG +PD++LMF C D P I R++ G ++ PPPLF Y +
Sbjct: 62 TLRGLQLLLDRFPGMVPDVDLMFGCGDFPAIPRAK---GSDDPSPPPLFSYCTTARGENY 118
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWSFWGW E+NI PWE +++ G W R+ +AYWKGN + R DL+
Sbjct: 119 DIVFPDWSFWGWPEVNILPWEEEKQKIYSGAREEKWSKRQRFAYWKGNYWTGAARPDLVK 178
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
C + D + QDW E +GF+ S+++ QC HRY
Sbjct: 179 C--AANKDLFVSMISQDWNAEKGQGFKSSDMSKQCTHRY 215
>gi|302799316|ref|XP_002981417.1| hypothetical protein SELMODRAFT_53169 [Selaginella moellendorffii]
gi|300150957|gb|EFJ17605.1| hypothetical protein SELMODRAFT_53169 [Selaginella moellendorffii]
Length = 342
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/232 (43%), Positives = 139/232 (59%), Gaps = 21/232 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF WIHEDL+PW +GITR+ L+ A + FRL++V+ K+Y + ++ IQTRD T+
Sbjct: 1 CPSYFAWIHEDLAPWSESGITREHLDEARRKGDFRLVIVDGKLYFERIQECIQTRDEITL 60
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT--MDIV 197
G+L LL K+PG +PD+E++F C+D P + +Y +K PPP+F Y R+ DI+
Sbjct: 61 QGLLLLLEKFPGMIPDMEMVFVCNDFPQVPKDEY---RSKPPPPIFSYCTSRFGGHFDIL 117
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCN 256
FPDWSFWGW ++ I+PWE E+ EG N +W +RE AYWKGN +V R +LL CN
Sbjct: 118 FPDWSFWGWPQVKIRPWEQESVEIFEGANETDWFEREAIAYWKGNLWVMTPVREELLQCN 177
Query: 257 ----------LSDKHDWNARLYV-----QDWILESKRGFQQSNLASQCAHRY 293
L H ++ QDW E GF+ S+L+ QC RY
Sbjct: 178 NTQYNVVVYYLVRIHANGDEVFTFFFNSQDWAKEEAEGFKTSSLSKQCHTRY 229
>gi|297817442|ref|XP_002876604.1| hypothetical protein ARALYDRAFT_907657 [Arabidopsis lyrata subsp.
lyrata]
gi|297322442|gb|EFH52863.1| hypothetical protein ARALYDRAFT_907657 [Arabidopsis lyrata subsp.
lyrata]
Length = 229
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 91/165 (55%), Positives = 116/165 (70%), Gaps = 12/165 (7%)
Query: 44 PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
PLNC N QTCP+NYP S+ + +IS + TCPDYFRWI +DL W+ TGITR+
Sbjct: 69 PLNC--NAKQTCPSNYP--SRFEPAISS-----SETCPDYFRWIQQDLKAWEETGITRET 119
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
LERA AHFRL++ + ++Y+H+Y ++ ++RDV TIWGILQLLR YPG++PDLEL+F C
Sbjct: 120 LERAKPKAHFRLVIKSGRLYVHQYDKAYESRDVLTIWGILQLLRMYPGQVPDLELLFFCH 179
Query: 164 DRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWSFWG 205
D+P I RD+ P PPPLF+Y G R IVFPDWSFWG
Sbjct: 180 DKPAIWKRDFRQPQPNATWPPPPLFQYCGHREAYGIVFPDWSFWG 224
>gi|147802992|emb|CAN70737.1| hypothetical protein VITISV_008287 [Vitis vinifera]
Length = 694
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 93/158 (58%), Positives = 105/158 (66%), Gaps = 29/158 (18%)
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
V+T W I LLR YPGRLPDLELMFDCDDRPV+R RD+ GPN PPPLFRY GD W++D
Sbjct: 377 VYTGW-IDALLRWYPGRLPDLELMFDCDDRPVVRMRDFRGPN-XAPPPLFRYCGDDWSLD 434
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
IVFPDWSFWGW P AYW+GNP VA TR DLL C
Sbjct: 435 IVFPDWSFWGWV---------------------------PLAYWRGNPHVAPTRGDLLKC 467
Query: 256 NLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
N+SDK +WN RLY+QDW +SK G++QSNL QC HRY
Sbjct: 468 NVSDKAEWNTRLYLQDWGQQSKIGYRQSNLEDQCTHRY 505
>gi|302819894|ref|XP_002991616.1| hypothetical protein SELMODRAFT_43466 [Selaginella moellendorffii]
gi|300140649|gb|EFJ07370.1| hypothetical protein SELMODRAFT_43466 [Selaginella moellendorffii]
Length = 324
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 93/216 (43%), Positives = 127/216 (58%), Gaps = 10/216 (4%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
TCP YF+WI DL+PWK GITR LE + A+FR+I++ K+Y YKQ QTR +T
Sbjct: 2 TCPSYFKWIEHDLAPWK-GGITRKALEAGKEKAYFRVIILGGKLYTQTYKQCFQTRAEYT 60
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIV 197
+ G+ LL K+PG +PD+++MF+C D P++ Y + PPP+F Y R DIV
Sbjct: 61 LKGLAMLLNKFPGMVPDVDIMFNCQDHPLVPRWRYLFTS---PPPVFGYCTTRNRHYDIV 117
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWS WGW E+NI PW + +W R+P AYW+GN + R +L+ CN
Sbjct: 118 FPDWSIWGWPEVNILPWSIESERIFTEAEKIDWFRRKPIAYWRGNTQMGLIRSNLVKCNS 177
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+ N + QDWI E K F S+L++QC RY
Sbjct: 178 T-----NILIQHQDWITEEKANFTNSDLSNQCFSRY 208
>gi|302776736|ref|XP_002971516.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300160648|gb|EFJ27265.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 318
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 125/216 (57%), Gaps = 10/216 (4%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
TCP YF WI DL+PWK GITR LE + A+FR+I++ K+Y YKQ QTR +T
Sbjct: 3 TCPSYFNWIEHDLAPWK-GGITRKALEAGKEKAYFRVIILGGKLYTQTYKQCFQTRAEYT 61
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIV 197
+ G+ LL ++PG +PD+++MF+C D P++ Y PPP+F Y R DIV
Sbjct: 62 LKGLAMLLNEFPGMVPDVDIMFNCQDHPLVPRWRYLF---TSPPPVFGYCTTRNRHYDIV 118
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWS WGW E+NI PW + +W R+P AYW+GN + R +L+ CN
Sbjct: 119 FPDWSIWGWPEVNIPPWSIESERIFTEAEKIDWFRRKPIAYWRGNTQMGLIRSNLVKCNS 178
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+ N + QDWI E K F S+L++QC RY
Sbjct: 179 T-----NILIQHQDWITEEKANFTNSDLSNQCFSRY 209
>gi|224059286|ref|XP_002299807.1| predicted protein [Populus trichocarpa]
gi|222847065|gb|EEE84612.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 120/188 (63%), Gaps = 7/188 (3%)
Query: 107 ANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
A A FR+++ ++Y+ Y +Q+R +FTIWG LQLL++YPG +PD+++MFDC D+P
Sbjct: 1 AKNYASFRVVIYKGRLYLDPYYACVQSRMMFTIWGFLQLLKRYPGMVPDVDIMFDCMDKP 60
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
I ++ + P PLFRY + DI FPDWSFWGW E+NI+PW+ R++K G
Sbjct: 61 SINKTEH----DSFPLPLFRYCTTKDHFDIPFPDWSFWGWPEVNIRPWDEEFRDIKRGAQ 116
Query: 227 GRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNL 285
R+W + P AYWKGNP V TR LL CN + K W A++ QDW E+K G+ S L
Sbjct: 117 ARSWPKKWPRAYWKGNPDVGSPTRTSLLECNHTKK--WGAQIMRQDWEEEAKGGYVSSKL 174
Query: 286 ASQCAHRY 293
+ QC +RY
Sbjct: 175 SHQCDYRY 182
>gi|302815695|ref|XP_002989528.1| hypothetical protein SELMODRAFT_43161 [Selaginella moellendorffii]
gi|300142706|gb|EFJ09404.1| hypothetical protein SELMODRAFT_43161 [Selaginella moellendorffii]
Length = 342
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 82/217 (37%), Positives = 129/217 (59%), Gaps = 8/217 (3%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
+CP YF I E L+PW+ TGITR L+R + R+ ++ ++Y+ Y Q+R VF+
Sbjct: 3 SCPRYFARIREHLAPWRSTGITRRSLDRRQRLGSMRVAILGGRMYVRAYGDCPQSRSVFS 62
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
+WG+L +L ++ R+PD+E + +C DRP++ R Y G P P+ Y R ++DI
Sbjct: 63 LWGLLLMLERFGDRVPDVEFVLNCKDRPIVPRDGSYGG----APAPVLSYCSHRHSLDIP 118
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPD+SFWGW E+NI+PWE +E+ +G+ W R+P A+WKGN + + R L CN
Sbjct: 119 FPDYSFWGWPEVNIRPWEQESQEIFQGSQDVEWNKRQPLAFWKGNLRMGKLRNLLARCNS 178
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRYS 294
+ ++ ++ Q+WI E+ G+ S L QC R+
Sbjct: 179 T---EFGTLVFDQNWIAEANVGYANSKLCKQCNQRHG 212
>gi|124360767|gb|ABN08741.1| Protein of unknown function DUF821, CAP10-like [Medicago
truncatula]
Length = 185
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 82/163 (50%), Positives = 105/163 (64%), Gaps = 3/163 (1%)
Query: 45 LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
L C N TQTCP ++ + S TSTCP++FRWIHEDL PWK TGITR+M
Sbjct: 25 LKCFNGNPTQTCPNDHSPIQAFHQDQDPSS--STSTCPEHFRWIHEDLEPWKSTGITREM 82
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
++ + R+++ KVY+ Y S QTR FT+WGI+QLLR YPGR+PDLEL+F+
Sbjct: 83 VDSGENISQLRIVIKQGKVYVETYGDSFQTRATFTVWGIVQLLRLYPGRVPDLELLFETG 142
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
DR V+ + Y GP PPP+F Y G +DIVFPDWS+WGW
Sbjct: 143 DRAVVDKKRYEGPEAVTPPPIFSYCGKNEALDIVFPDWSYWGW 185
>gi|302761718|ref|XP_002964281.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168010|gb|EFJ34614.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 342
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 82/219 (37%), Positives = 130/219 (59%), Gaps = 8/219 (3%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
++CP YF I E L+PW+ TGITR L+R + R+ ++ ++Y+ Y Q+R V
Sbjct: 1 AASCPRYFARIREHLAPWRSTGITRRSLDRRQRLGSMRVAILGGRMYVRAYGDCPQSRSV 60
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMD 195
F++WG+L +L ++ R+PD+E + +C DRP++ R Y G P P+ Y R ++D
Sbjct: 61 FSLWGLLLMLERFGDRVPDVEFVLNCKDRPIVPRDGSYGGV----PSPVLSYCSHRHSLD 116
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
I FPD+SFWGW E+NI+PWE +E+ +G+ W R+P A+WKGN + + R L C
Sbjct: 117 IPFPDYSFWGWPEVNIRPWEQESQEIFQGSQDVEWNKRQPLAFWKGNLRMGKLRNLLARC 176
Query: 256 NLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRYS 294
N + ++ ++ Q+WI E+ G+ S L QC R+
Sbjct: 177 NST---EFGTLVFDQNWIAEANVGYANSKLCKQCNQRHG 212
>gi|302815579|ref|XP_002989470.1| hypothetical protein SELMODRAFT_44697 [Selaginella moellendorffii]
gi|300142648|gb|EFJ09346.1| hypothetical protein SELMODRAFT_44697 [Selaginella moellendorffii]
Length = 329
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 85/217 (39%), Positives = 133/217 (61%), Gaps = 11/217 (5%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP YF IHE L+PW+ TGITR L+R R+ ++ + +Y+ Y Q+R F
Sbjct: 1 ALCPRYFARIHEHLAPWRSTGITRRSLDR---LGSMRVAILRSDMYVSAYGDCPQSRRAF 57
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
++WG+L +L ++ R+PD+E + +C DRP++ RD S + P P+ Y R ++DI
Sbjct: 58 SLWGLLLMLERFGDRVPDVEFVLNCKDRPIV-PRDGS----EVPAPVLSYCSHRLSLDIP 112
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPD+SFWGW E+NI+PWE +E+ +G+ G W +R+P+A+WKGN + + R L CN
Sbjct: 113 FPDYSFWGWPEVNIRPWEEESQEIFQGSQGVEWSNRQPFAFWKGNLRMGKLRTLLARCNS 172
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRYS 294
+ + + Q+WI E+ G++ SNL QC RY+
Sbjct: 173 T---KFGTLVLDQNWIDEANIGYENSNLCKQCNQRYN 206
>gi|302761722|ref|XP_002964283.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168012|gb|EFJ34616.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 401
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 122/214 (57%), Gaps = 6/214 (2%)
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
P YF I DL PWK GIT+ L A + R+++ K+YI Y + Q+R +FT W
Sbjct: 58 PKYFDSIDTDLRPWKDGGITKSSLSAARRKGSMRMVISQGKLYIEVYGKCPQSRSIFTAW 117
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
G+L LL ++PG++PD++ + +C DRPVI R + + PP +F Y +DI FP
Sbjct: 118 GLLLLLERFPGKVPDVDFVLNCKDRPVITRFLSFQRFVSGSPPAVFSYCTTNDMLDIPFP 177
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
D+SFWGW E++I PWE +++ G+ W +R P A+WKGN + + RR LL C
Sbjct: 178 DFSFWGWPEVDIPPWEEQSQQITAGSREVKWSERRPAAFWKGNSRMGKLRRHLLQC---- 233
Query: 260 KHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+ QDWI ES+ GF S L+ QC R+
Sbjct: 234 -QSLETEILDQDWISESRAGFPNSRLSQQCKDRF 266
>gi|297745253|emb|CBI40333.3| unnamed protein product [Vitis vinifera]
Length = 325
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 82/136 (60%), Positives = 95/136 (69%), Gaps = 1/136 (0%)
Query: 159 MFDCDDRPVIRSRDYSG-PNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
MF+C DRP I++ DY G K PPLF Y T+DIVFPDWSFWGW EINIKPW SL
Sbjct: 1 MFECGDRPRIKATDYGGRKGKKKVPPLFHYCASDDTLDIVFPDWSFWGWPEINIKPWNSL 60
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESK 277
+EL+EGNN W+DREPYAYWKGN + R+ L C S+ HDWNAR+Y DW ES
Sbjct: 61 RKELEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCRPSNNHDWNARVYDMDWGRESH 120
Query: 278 RGFQQSNLASQCAHRY 293
GF+ SNLASQC H+Y
Sbjct: 121 EGFKDSNLASQCTHKY 136
>gi|297745250|emb|CBI40330.3| unnamed protein product [Vitis vinifera]
Length = 312
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 80/123 (65%), Positives = 95/123 (77%), Gaps = 1/123 (0%)
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
RD+ GP+ PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++KEGN W
Sbjct: 2 RDFRGPS-AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEGNRRTKW 60
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCA 290
DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+QDW +SK G++QSNL QC
Sbjct: 61 KDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQDWDQQSKIGYRQSNLEDQCT 120
Query: 291 HRY 293
H Y
Sbjct: 121 HIY 123
>gi|222634972|gb|EEE65104.1| hypothetical protein OsJ_20160 [Oryza sativa Japonica Group]
Length = 498
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/258 (35%), Positives = 125/258 (48%), Gaps = 51/258 (19%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
PP P +C S S P ++CPDYFR+IH+DL PW+
Sbjct: 108 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 167
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD F
Sbjct: 168 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAF------------------- 208
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
+ D T+D+VFPDWSFWGW E+NI W +
Sbjct: 209 -------------------------------TQDATTLDVVFPDWSFWGWPEVNIGAWPA 237
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQDWILE 275
L ++ + W +REP+A+WKGNP VA R +L+ CN SD DWNARL+ QDW
Sbjct: 238 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQDWNHA 297
Query: 276 SKRGFQQSNLASQCAHRY 293
GF+ S++ QC HRY
Sbjct: 298 IHNGFKDSSIPKQCLHRY 315
>gi|302762508|ref|XP_002964676.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168405|gb|EFJ35009.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 330
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 84/217 (38%), Positives = 132/217 (60%), Gaps = 11/217 (5%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP YF IHE L+PW+ TGITR L+R R+ ++ + +Y+ Y Q+R F
Sbjct: 1 ALCPRYFARIHEHLAPWRSTGITRRSLDR---LGSMRVAILRSDMYVSAYGDCPQSRRAF 57
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
++WG+L +L ++ R+PD+E + +C DRP++ RD S + P P+ Y R ++DI
Sbjct: 58 SLWGLLLMLERFGDRVPDVEFVLNCKDRPIV-PRDGS----EVPAPVLSYCSHRLSLDIP 112
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPD+SFWGW E+NI+PWE +E+ +G+ W +R+P+A+WKGN + + R L CN
Sbjct: 113 FPDYSFWGWPEVNIRPWEEESQEIFQGSQDVEWSNRQPFAFWKGNLRMGKLRTLLARCNS 172
Query: 258 SDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRYS 294
+ + + Q+WI E+ G++ SNL QC RY+
Sbjct: 173 T---KFGTLVLDQNWIDEANIGYENSNLCKQCNQRYN 206
>gi|302815691|ref|XP_002989526.1| hypothetical protein SELMODRAFT_130002 [Selaginella moellendorffii]
gi|300142704|gb|EFJ09402.1| hypothetical protein SELMODRAFT_130002 [Selaginella moellendorffii]
Length = 401
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 85/219 (38%), Positives = 120/219 (54%), Gaps = 14/219 (6%)
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
P YF I DL PWK GIT+ L A + R+++ K+YI Y + Q+R +FT W
Sbjct: 56 PKYFDSIDTDLRPWKDGGITKSSLSAARKKGSMRMVISQGKLYIEVYGKCPQSRSIFTAW 115
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVI------RSRDYSGPNNKGPPPLFRYSGDRWTM 194
G+L LL ++PG++PD++ + +C DRPVI SRD P +F Y +
Sbjct: 116 GLLLLLERFPGKVPDVDFVLNCKDRPVITRYSSFHSRDLC---QDEAPAVFSYCTTNDML 172
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DI FPD+SFWGW E++I PWE +++ G+ W +R P A+WKGN + + R LL
Sbjct: 173 DIPFPDFSFWGWPEVDIPPWEEQSQQITAGSREVKWSERRPAAFWKGNSRMGKLRSHLLQ 232
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
C + QDWI ES+ GF S L+ QC R+
Sbjct: 233 C-----QSLETEILDQDWISESRAGFPNSRLSQQCKDRF 266
>gi|6850895|emb|CAB71058.1| putative protein [Arabidopsis thaliana]
Length = 338
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 87/147 (59%), Positives = 96/147 (65%), Gaps = 2/147 (1%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGW 206
YPG++PDLEL+F C DRP I RD PPPLF Y G R DIVFPDWSFWGW
Sbjct: 2 YPGQIPDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGW 61
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
E+NIK W L LKEGN W DR PYAYWKGNP V+ R DL+ CN SDK+D R
Sbjct: 62 PELNIKEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVR 121
Query: 267 LYVQDWILESKRGFQQSNLASQCAHRY 293
LYVQDW E + GF+ SNL QC HRY
Sbjct: 122 LYVQDWRSEIEAGFRGSNLEDQCTHRY 148
>gi|413919042|gb|AFW58974.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 316
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 79/136 (58%), Positives = 94/136 (69%), Gaps = 1/136 (0%)
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
MFDC+D PV+ + + G + PPP LFRY G T+DI FPDWSFWGW E+NIKPWE+L
Sbjct: 1 MFDCEDLPVVGAGERHGAQPQPPPPPLFRYCGSEATLDIAFPDWSFWGWPELNIKPWEAL 60
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESK 277
RE+ E N NW DR PYAYWKGNP V R LL CN S + DWNAR+Y QDW E +
Sbjct: 61 QREINEANAVVNWTDRAPYAYWKGNPKVGAERLLLLRCNASGERDWNARVYAQDWGEEVR 120
Query: 278 RGFQQSNLASQCAHRY 293
GF+ S+L+ QC HRY
Sbjct: 121 HGFRGSDLSKQCTHRY 136
>gi|302802289|ref|XP_002982900.1| hypothetical protein SELMODRAFT_53724 [Selaginella moellendorffii]
gi|300149490|gb|EFJ16145.1| hypothetical protein SELMODRAFT_53724 [Selaginella moellendorffii]
Length = 345
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 119/204 (58%), Gaps = 14/204 (6%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP Y+ + EDL+PWK GI LE A + + FR I+ + ++Y+ Y + QTR +FTI
Sbjct: 2 CPAYYSRVFEDLAPWKEKGIQEHDLETARKHSAFRAIVRDGRLYVELYYRCFQTRMMFTI 61
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
GI+QLL+++PG++PD+++ F+C DRP I + ++ PPPLF Y + DI FP
Sbjct: 62 VGIMQLLQRFPGQIPDVDIFFNCQDRPQITKSAF----DEAPPPLFGYCSTKNHFDIPFP 117
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN--- 256
DWSFWGW E I PW S +LK W DR+ W+G+P ++ R+ L+ CN
Sbjct: 118 DWSFWGWPENKILPWRS---QLKRITQQAEWKDRDSSVQWRGDPRTSQIRQRLIACNSTG 174
Query: 257 ----LSDKHDWNARLYVQDWILES 276
L +W + +Q+W LES
Sbjct: 175 DKTLLVHGQNWRDQSDLQNWKLES 198
>gi|302800329|ref|XP_002981922.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300150364|gb|EFJ17015.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 345
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 119/204 (58%), Gaps = 14/204 (6%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP Y+ + EDL+PWK GI LE A + + FR I+ + ++Y+ Y + QTR +FTI
Sbjct: 2 CPAYYSRVFEDLAPWKEKGIQEHDLETARKHSAFRAIVRDGRLYVELYYRCFQTRMMFTI 61
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
GI+QLL+++PG++PD+++ F+C DRP I + ++ PPPLF Y + DI FP
Sbjct: 62 VGIMQLLQRFPGQIPDVDIFFNCQDRPQITKSAF----DEAPPPLFGYCSTKNHFDIPFP 117
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN--- 256
DWSFWGW E I PW S +LK W DR+ W+G+P ++ R+ L+ CN
Sbjct: 118 DWSFWGWPENKILPWRS---QLKRITRQAEWKDRDSSVQWRGDPRTSQIRQRLIACNSTG 174
Query: 257 ----LSDKHDWNARLYVQDWILES 276
L +W + +Q+W LES
Sbjct: 175 DKTLLVHGQNWRDQSDLQNWKLES 198
>gi|413919038|gb|AFW58970.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 241
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 79/153 (51%), Positives = 102/153 (66%), Gaps = 7/153 (4%)
Query: 61 KTSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLI 116
T+QT + + S PP CP YFRWIHEDL PW+ TG+TR +E A + A R+
Sbjct: 89 STNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVA 148
Query: 117 LVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
+V ++Y+ +Y ++ QTR FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + +
Sbjct: 149 VVAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERH 208
Query: 175 GP-NNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
G PPPLFRY G T+DI FPDWSFWGW
Sbjct: 209 GAQPQPPPPPLFRYCGSEATLDIAFPDWSFWGW 241
>gi|67633610|gb|AAY78729.1| hypothetical protein At2g45840 [Arabidopsis thaliana]
Length = 337
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/149 (54%), Positives = 90/149 (60%), Gaps = 9/149 (6%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFW 204
YPG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFW
Sbjct: 2 YPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFW 61
Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
GW EINIK W + EG W +REPYAYWKGNP VA RRDL+ C HD
Sbjct: 62 GWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPM 116
Query: 265 ARLYVQDWILESKRGFQQSNLASQCAHRY 293
LY QDW E + G++ SNL QC HRY
Sbjct: 117 VHLYRQDWSREGRIGYRTSNLEDQCTHRY 145
>gi|30267795|gb|AAP21678.1| hypothetical protein [Arabidopsis thaliana]
Length = 337
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/149 (54%), Positives = 90/149 (60%), Gaps = 9/149 (6%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFW 204
YPG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFW
Sbjct: 2 YPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFW 61
Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
GW EINIK W + EG W +REPYAYWKGNP VA RRDL+ C HD
Sbjct: 62 GWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPM 116
Query: 265 ARLYVQDWILESKRGFQQSNLASQCAHRY 293
LY QDW E + G++ SNL QC HRY
Sbjct: 117 VHLYRQDWSREGRIGYRTSNLEDQCTHRY 145
>gi|223950283|gb|ACN29225.1| unknown [Zea mays]
Length = 199
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/152 (51%), Positives = 101/152 (66%), Gaps = 7/152 (4%)
Query: 61 KTSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLI 116
T+QT + + S PP CP YFRWIHEDL PW+ TG+TR +E A + A R+
Sbjct: 44 STNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVA 103
Query: 117 LVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
+V ++Y+ +Y ++ QTR FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + +
Sbjct: 104 VVAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERH 163
Query: 175 GP-NNKGPPPLFRYSGDRWTMDIVFPDWSFWG 205
G PPPLFRY G T+DI FPDWSFWG
Sbjct: 164 GAQPQPPPPPLFRYCGSEATLDIAFPDWSFWG 195
>gi|413919039|gb|AFW58971.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
gi|413919040|gb|AFW58972.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 302
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/150 (51%), Positives = 100/150 (66%), Gaps = 7/150 (4%)
Query: 62 TSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLIL 117
T+QT + + S PP CP YFRWIHEDL PW+ TG+TR +E A + A R+ +
Sbjct: 88 TNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAV 147
Query: 118 VNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
V ++Y+ +Y ++ QTR FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + + G
Sbjct: 148 VAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHG 207
Query: 176 P-NNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
PPPLFRY G T+DI FPDWSFW
Sbjct: 208 AQPQPPPPPLFRYCGSEATLDIAFPDWSFW 237
>gi|302143882|emb|CBI22743.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 63/86 (73%), Positives = 73/86 (84%)
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
EINIKPWESLL++LKEGN W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+
Sbjct: 37 EINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARV 96
Query: 268 YVQDWILESKRGFQQSNLASQCAHRY 293
Y QDWI ES+ G++QS+LASQC HRY
Sbjct: 97 YTQDWIRESQEGYKQSDLASQCIHRY 122
>gi|359490428|ref|XP_003634087.1| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 288
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 63/86 (73%), Positives = 73/86 (84%)
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
EINIKPWESLL++LKEGN W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+
Sbjct: 16 EINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARV 75
Query: 268 YVQDWILESKRGFQQSNLASQCAHRY 293
Y QDWI ES+ G++QS+LASQC HRY
Sbjct: 76 YTQDWIRESQEGYKQSDLASQCIHRY 101
>gi|125583038|gb|EAZ23969.1| hypothetical protein OsJ_07696 [Oryza sativa Japonica Group]
Length = 279
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/112 (58%), Positives = 77/112 (68%), Gaps = 18/112 (16%)
Query: 182 PPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
PPLFRY GD T+D+VFPDWSFWGW EINIKPW++L ++L GN W+DREPYAYWKG
Sbjct: 2 PPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKG 61
Query: 242 NPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
NP VA R++L DWI ESK G++QS+LASQC HRY
Sbjct: 62 NPDVATKRKEL------------------DWIKESKAGYKQSDLASQCTHRY 95
>gi|356577934|ref|XP_003557076.1| PREDICTED: LOW QUALITY PROTEIN: protein O-glucosyltransferase
1-like, partial [Glycine max]
Length = 268
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 55/87 (63%), Positives = 67/87 (77%)
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
+EINIKPW LL LKEG W++REPYAYWKGNP VAETR+DL+ CN+S+ DWNAR
Sbjct: 1 SEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAVAETRQDLIKCNVSENQDWNAR 60
Query: 267 LYVQDWILESKRGFQQSNLASQCAHRY 293
L+ QDW ES+ GF +S+L SQC +RY
Sbjct: 61 LFAQDWFRESQEGFNKSDLPSQCTYRY 87
>gi|356561560|ref|XP_003549049.1| PREDICTED: uncharacterized protein LOC100818683 [Glycine max]
Length = 289
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 54/87 (62%), Positives = 65/87 (74%)
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
AEINIKPW LL LKEG W++REPYAYWKGNP VAETR+DL+ CN+ + DWNAR
Sbjct: 14 AEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAVAETRQDLIKCNVYENQDWNAR 73
Query: 267 LYVQDWILESKRGFQQSNLASQCAHRY 293
L+ QDW E + GF +S+L SQC +RY
Sbjct: 74 LFAQDWFRELQEGFNKSDLPSQCTYRY 100
>gi|356561649|ref|XP_003549092.1| PREDICTED: receptor-like protein 12-like [Glycine max]
Length = 819
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/174 (41%), Positives = 85/174 (48%), Gaps = 60/174 (34%)
Query: 39 KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
++ PLNC N T TCPTN T P T+TCPDYFRWIHEDL PW T
Sbjct: 700 RKPIAPLNCTAYNLTGTCPTNLQDHQST---------PATATCPDYFRWIHEDLRPWART 750
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GIT+DM+ERA QTA+FR YPG++PDLE
Sbjct: 751 GITQDMVERAKQTANFR----------------------------------YPGKIPDLE 776
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
LMFDC D PV+ S Y+GP T+DIVFPD S+ I+I
Sbjct: 777 LMFDCVDWPVVLSDRYNGP----------------TLDIVFPDSSYTSIIYISI 814
>gi|13937171|gb|AAK50079.1|AF372939_1 At1g63420/F2K11_19 [Arabidopsis thaliana]
gi|21700873|gb|AAM70560.1| At1g63420/F2K11_19 [Arabidopsis thaliana]
Length = 228
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 43/73 (58%), Positives = 59/73 (80%), Gaps = 1/73 (1%)
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQDWILESKRG 279
++EG + +++R+ YAYWKGNPFVA +R DLLTCNLS HDWNAR+++QDWI E +RG
Sbjct: 1 MEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHDWNARIFIQDWISEGQRG 60
Query: 280 FQQSNLASQCAHR 292
F+ SN+A+QC +R
Sbjct: 61 FENSNVANQCTYR 73
>gi|124360768|gb|ABN08742.1| Lipopolysaccharide-modifying protein [Medicago truncatula]
Length = 253
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/87 (51%), Positives = 61/87 (70%), Gaps = 1/87 (1%)
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
AE IKPWE +L++++E N W DR PYA+WKGN F++ R +L CN++D+HDWNAR
Sbjct: 6 AETRIKPWEKVLKDIQESNKKIKWKDRIPYAFWKGN-FLSNPRHELRKCNVTDQHDWNAR 64
Query: 267 LYVQDWILESKRGFQQSNLASQCAHRY 293
+Y DW E +GF+ + L QCAHRY
Sbjct: 65 VYSVDWNEEIDQGFKNTKLEDQCAHRY 91
>gi|147802993|emb|CAN70738.1| hypothetical protein VITISV_008288 [Vitis vinifera]
Length = 204
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 59/110 (53%), Gaps = 17/110 (15%)
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQT 110
Q CP N+P TS T + + CP+YFRWIHEDL PWK TGI+R +E A
Sbjct: 1 MAQACPANHPTTSVTGKLSXE-------ACPEYFRWIHEDLRPWKSTGISRFAVESAXGD 53
Query: 111 AHFRLILVNNKVYIHKYKQ----------SIQTRDVFTIWGILQLLRKYP 150
A FRL++VN K Y+ +Y++ S+ + V + L++ YP
Sbjct: 54 ADFRLVIVNGKAYVEQYRKYKIYVEGWGWSVSEKYVLACDSMTLLIKPYP 103
>gi|222630210|gb|EEE62342.1| hypothetical protein OsJ_17131 [Oryza sativa Japonica Group]
Length = 267
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 51/87 (58%), Gaps = 3/87 (3%)
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNAR 266
E NI+PW R++KEG W DR AYWKGNP VA R LL CN D + W+A
Sbjct: 10 ETNIEPWSKEFRDIKEGAKAIKWQDRVATAYWKGNPDVASPLRVALLNCN--DTNMWHAE 67
Query: 267 LYVQDWILESKRGFQQSNLASQCAHRY 293
+ Q+W E K G+ S L+SQC HRY
Sbjct: 68 IMRQNWDEEVKSGYHNSKLSSQCTHRY 94
>gi|255630857|gb|ACU15791.1| unknown [Glycine max]
Length = 194
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/69 (52%), Positives = 45/69 (65%), Gaps = 7/69 (10%)
Query: 192 WTMDIVFP-------DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
+T ++ FP D + AEINIKPW LL LKEG W++REPYAYWKGNP
Sbjct: 24 FTQNLCFPQCYKSCFDLAAGEGAEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPA 83
Query: 245 VAETRRDLL 253
VA+TR+DL+
Sbjct: 84 VAQTRQDLM 92
>gi|412992075|emb|CCO20801.1| glycosyltransferase CAZy GT90 [Bathycoccus prasinos]
Length = 438
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 96/215 (44%), Gaps = 31/215 (14%)
Query: 86 WIHEDLSPWKVTGITRDMLERANQTAHF--RLILVNNKVYIHKYKQSIQ-TRDVFTIWGI 142
WI DL PW +TGIT+ M++ A Q R+ ++ K+Y K S +R + +WG+
Sbjct: 74 WIDVDLQPWSMTGITKRMVDLAAQQGMRANRIQIIGGKIYAQISKSSRGPSRIWYWLWGL 133
Query: 143 LQLLRKYPGR-LPDLELMFDCDDRPVI--------------RSRDYS-GPNNKGPPPLFR 186
++L+ ++P +PD++ + + D P + + RD+ G + PPP+F
Sbjct: 134 MELIDEFPEEAVPDVDFILNTQDDPQVSIVGKRPKNPILAKKYRDFVPGIKGQAPPPVFS 193
Query: 187 YSGDRWTMDIVFPDWSFWG---------WAEINIKPWESLLRELKEGNNGRNWID-REPY 236
D+++P W+ WG + PW+ L +L W + R
Sbjct: 194 AVTTSNNYDLLWPLWTIWGEDVEGAGSKTGGFHDPPWKELHPKLIHFAKKNKWSERRSER 253
Query: 237 AYWKGNPFVAETRRDLLTC--NLSDKHDWNARLYV 269
+W+G+ RR L+ C N D D +L V
Sbjct: 254 IFWRGSVKTNPARRALIRCSKNTVDAADVQHKLRV 288
>gi|312384008|gb|EFR28848.1| hypothetical protein AND_02694 [Anopheles darlingi]
Length = 406
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 103/220 (46%), Gaps = 29/220 (13%)
Query: 38 PKRITTPLNCVKNQTQTCPTNYPKT------SQTQESISDYSIPPTSTCPDYFRWIHEDL 91
P P++ + ++ PK S ++ S C + + +DL
Sbjct: 28 PASAAVPVSVTTSAKRSSAIRGPKKFTPLPDSVLSRNLGSESTTAIDGCSCHTGVLKKDL 87
Query: 92 SPWKVTGITRDMLERANQTA-HFRLILVNNKVYIHKYKQSIQTRDVF--TIWGILQLLRK 148
P+K GIT++M+ERA Q H+++I ++K+Y QT +F GI ++
Sbjct: 88 KPFKADGITKEMIERAKQYGTHYQVI--DHKLY-------RQTECMFPARCSGIEHFVKP 138
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWA 207
+LPD++L+ +C D P + R + NK P+F +S +DI++P W+FW G
Sbjct: 139 LLPKLPDMDLIINCRDWPQVH-RHW----NKEKTPVFSFSKTDEYLDIMYPAWAFWEGGP 193
Query: 208 EINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
I++ P W+ + + W +++P A+++G+
Sbjct: 194 AISLYPTGLGRWDQHRESISQAAERVPWKNKKPMAFFRGS 233
>gi|388579121|gb|EIM19449.1| hypothetical protein WALSEDRAFT_34020 [Wallemia sebi CBS 633.66]
Length = 471
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 92/199 (46%), Gaps = 31/199 (15%)
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQS-IQTRDVFTIWGILQLLRKYPGRLPDL 156
GIT++ LE+A+ AH R+++ NN++Y+ +Y+ + TR + T I + + PG +P++
Sbjct: 140 GITKEHLEKASDKAHGRVVIHNNRMYVKEYRHGDVNTRAMATFAAIHEAVLTSPGAIPNV 199
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY--SGDR---WTMDIVFPDWSFWGWAEINI 211
E F D +G + P P F + D+ W M PD+ FW W E +
Sbjct: 200 EFTFQIQD---------AGDSYDEPIPTFVLDRTADQPELWLM----PDFGFWSWPEPKV 246
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQD 271
+ + + + + +W ++ P +W+G R L+ + + H W + + + +
Sbjct: 247 GSYVEVRDKAGKWESKHSWSEKLPKVFWRGASLGLPIRDQLV--DAARGHAW-SDVKIMN 303
Query: 272 W---------ILESKRGFQ 281
W +E GFQ
Sbjct: 304 WGDIQPGDLLTMEEHCGFQ 322
>gi|194744000|ref|XP_001954486.1| GF18286 [Drosophila ananassae]
gi|190627523|gb|EDV43047.1| GF18286 [Drosophila ananassae]
Length = 411
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 101/217 (46%), Gaps = 43/217 (19%)
Query: 50 NQTQTCPTNYPKTS-----QTQESISDYSIPPT----STCPDYFRWIHEDLSPWKVTGIT 100
Q+++ N+ + S Q +++++DY T S C + I DL+P+K TG+T
Sbjct: 34 GQSESLDGNHDEFSFKIRRQIKKAVADYKPCSTDDEDSKCACHAAVIKHDLAPYKATGVT 93
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLLRKYPGRLPD 155
R M+E+A + + + NN++Y RD ++ GI L LP+
Sbjct: 94 RQMIEKAGEYGT-KYKIFNNRLY----------RDANCMFPSRCQGIEHFLLPLTASLPN 142
Query: 156 LELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEIN 210
++L VI +RDY N + G P+F +S + MDI++P W+FW G
Sbjct: 143 MDL--------VINTRDYPQLNTAWGSSGRGPIFSFSKTKEYMDIMYPAWTFWAGGPATK 194
Query: 211 IKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
+ P W+ + +L++ + W + +++G+
Sbjct: 195 LHPRGIGRWDQMREKLEKRSAAIPWSQKRELGFFRGS 231
>gi|358056081|dbj|GAA97978.1| hypothetical protein E5Q_04658 [Mixia osmundae IAM 14324]
Length = 1130
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 92/199 (46%), Gaps = 20/199 (10%)
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GI R ++R +H R+++ N++YI ++ TR I I + + +PD+E
Sbjct: 812 GIHRSQIDRGTVASHARILIKRNRLYIKHFRPVTTTRVEAAIALIHEAILSSVEPIPDVE 871
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI--VFPDWSFWGWAE-INIKPW 214
L+ DD +G + G P L G R + ++ + PD+ F+GW E I + +
Sbjct: 872 LLLHLDD---------TGNSKPGVPMLVL--GRRPSEELLWLMPDFGFYGWPEAIRVGSY 920
Query: 215 ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWIL 274
+ E +W ++P A+W+G R+ L+ N S + W A + + DW L
Sbjct: 921 IHDYDQTLESETHASWKHKKPTAFWRGASLGLRDRQSLV--NNSRPYGW-ADIAIVDW-L 976
Query: 275 ESKRGFQQSNLASQCAHRY 293
+ +G N CAHRY
Sbjct: 977 KGPKGIL--NPKQTCAHRY 993
>gi|242065946|ref|XP_002454262.1| hypothetical protein SORBIDRAFT_04g027705 [Sorghum bicolor]
gi|241934093|gb|EES07238.1| hypothetical protein SORBIDRAFT_04g027705 [Sorghum bicolor]
Length = 173
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 40/57 (70%), Gaps = 1/57 (1%)
Query: 79 TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
+CP YF +IHEDL PW+ G ITR ML+RA+ TA FR +++ + Y+H+ + + Q R
Sbjct: 89 SCPSYFCFIHEDLRPWRAAGGITRAMLDRAHLTATFRFVVLEGRAYVHRLRPAFQNR 145
>gi|384245503|gb|EIE18997.1| hypothetical protein COCSUDRAFT_54837 [Coccomyxa subellipsoidea
C-169]
Length = 520
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 85/206 (41%), Gaps = 38/206 (18%)
Query: 86 WIHEDLSPWKVTGI----TRDMLERANQTAH--FRLILVNNKVYIHKYKQ---------- 129
WI +D + W+ TGI +M R + FR ++N +++ +
Sbjct: 88 WIKQDFAQWEKTGIKMSAVTEMALRYRECFGEVFRFQIINGTLWVDHISERHSGWYPSRM 147
Query: 130 -----SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPL 184
S + + + I ++ LR +PG++PD++ + D P + + P N PPP+
Sbjct: 148 GAGSLSAKGKIPYAILALMDTLRHHPGQIPDIDAVIQTSDFPCMLRQQ---PGNTPPPPV 204
Query: 185 FRYSGDRWTMDIVFPDWSFWGWA--------EINIKPWESLLRELKEGNNGRNWIDREPY 236
F Y+ +DI FPD+++WG + + WE + L E + R+P
Sbjct: 205 FGYNSHARFVDIPFPDYTYWGHEYHRLVDEDGLLLFGWEKQFKLLSEKWREKEIASRKPQ 264
Query: 237 AYWKGN------PFVAETRRDLLTCN 256
W+G P E RR C
Sbjct: 265 VIWRGRTEDKEYPKRDELRRQFARCG 290
>gi|307104334|gb|EFN52588.1| hypothetical protein CHLNCDRAFT_138604 [Chlorella variabilis]
Length = 452
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/230 (26%), Positives = 97/230 (42%), Gaps = 13/230 (5%)
Query: 69 ISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNNKVYIH 125
+S S+ S F + DL P+ +GI E+A A R+ +VN VYI
Sbjct: 28 LSKASLNGASYVGPLFSQVARDLEPFNHSGIGLRHAEQAYCQGSKASMRVQVVNGSVYIV 87
Query: 126 KYKQSIQTRDVFTIWGILQLLRKYPGRLPD-LELMFDCDDRPVIRSRDYSGPNNKGP--P 182
S ++R + +L L G LP+ ++ + + +D P +R R P +GP
Sbjct: 88 GESPSYESRMLGIKRQLLHLWLA--GGLPESIDFVVEQEDHPTVRHRSDDCPE-RGPILA 144
Query: 183 PLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN 242
P S + ++ PD +F GW E PW +L L+ W DR +++G
Sbjct: 145 PAKCPSNKSHSHVLLAPDHTFAGWPEARTLPWAEMLPLLQHSAERHPWADRSALLFFRG- 203
Query: 243 PFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHR 292
A R+L +LS + + + +W +R S LA C HR
Sbjct: 204 --AATGDRNLTDSDLSLSYPELLDVQLVNWTSAEERPLFVS-LADHCRHR 250
>gi|195331147|ref|XP_002032264.1| GM23615 [Drosophila sechellia]
gi|194121207|gb|EDW43250.1| GM23615 [Drosophila sechellia]
Length = 411
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 51/214 (23%), Positives = 93/214 (43%), Gaps = 31/214 (14%)
Query: 46 NCVKNQTQTCPTN---YPKTSQTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTG 98
+C +N+++ + + Q +++ +DY S P S C + + DL+P+K TG
Sbjct: 32 SCAQNESEQINEDEFSFKIRRQIEKANADYKPCSSDPQDSDCSCHADVLKRDLAPYKSTG 91
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
+TR M+E + + K+Y H+ + GI L LPD++L
Sbjct: 92 VTRQMIESSARYG------TKYKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL 145
Query: 159 MFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
VI +RDY N N P+F +S + DI++P W+FW G + P
Sbjct: 146 --------VINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHP 197
Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
W+ + +L++ W + +++G+
Sbjct: 198 RGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGS 231
>gi|307105335|gb|EFN53585.1| hypothetical protein CHLNCDRAFT_136791 [Chlorella variabilis]
Length = 592
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 79/189 (41%), Gaps = 34/189 (17%)
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQT------AHFRLILVNNKVYIHKYKQSIQ---- 132
Y I DL PWK +GIT +++ER++ R ++N +++H + ++
Sbjct: 119 YLDLIRRDLLPWKDSGITHELVERSSMMFDDCDGDMLRFQVLNGSLWVHHITERLEGGWY 178
Query: 133 ------------TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
R + + +++ LR +PG++PD++ + D P I P
Sbjct: 179 PAPIGPGNAAAKGRVPYAVLALMETLRMFPGQIPDVDAILHFADFPCIPRPRAGAP---- 234
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI-----KP---WESLLRELKEGNNGRNWID 232
P P+ G DI F D+++WG + KP W + L + +D
Sbjct: 235 PAPILGLQGSAHHSDIPFSDYTYWGHEHQYLQDPWGKPAHGWGNQAEVLARKYENVSLLD 294
Query: 233 REPYAYWKG 241
R P A W+G
Sbjct: 295 RIPQASWRG 303
>gi|195573000|ref|XP_002104483.1| GD18424 [Drosophila simulans]
gi|194200410|gb|EDX13986.1| GD18424 [Drosophila simulans]
Length = 411
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)
Query: 64 QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
Q +++ +DY S P S C + + DL+P+K TG+TR M+E + +
Sbjct: 53 QIEKAKADYKPCSSDPQDSDCSCHADVMKRDLAPYKSTGVTRQMIESSARYG------TK 106
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
K+Y H+ + GI L LPD++L VI +RDY N
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL--------VINTRDYPQLNAA 158
Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
N P+F +S + DI++P W+FW G + P W+ + +L++
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218
Query: 230 WIDREPYAYWKGN 242
W + +++G+
Sbjct: 219 WSQKRSLGFFRGS 231
>gi|383858247|ref|XP_003704613.1| PREDICTED: O-glucosyltransferase rumi homolog [Megachile rotundata]
Length = 391
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/186 (28%), Positives = 85/186 (45%), Gaps = 18/186 (9%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S +E+ +Y + + + I DL P+K GI +D+++ A F I V KV
Sbjct: 53 SMIEEAERNYQLCNNTNNNCFKHIILRDLKPFKEKGINKDLIDAAKARGTFYQI-VKGKV 111
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y K +R GI L K +LPD++L+ + D P +S Y G GP
Sbjct: 112 YRQK-DCMFPSR----CSGIEHFLLKLAPKLPDMDLVINVRDYP--QSSKYFG----GPL 160
Query: 183 PLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPY 236
P+F +S DI +P W+FW G I++ P W+ L + + W +E
Sbjct: 161 PVFSFSKTPQYYDITYPAWAFWEGGPAISLYPRGLGRWDEHRISLDKASKSLPWEKKESK 220
Query: 237 AYWKGN 242
A+++G+
Sbjct: 221 AFFRGS 226
>gi|195996689|ref|XP_002108213.1| hypothetical protein TRIADDRAFT_18924 [Trichoplax adhaerens]
gi|190588989|gb|EDV29011.1| hypothetical protein TRIADDRAFT_18924, partial [Trichoplax
adhaerens]
Length = 366
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 96/196 (48%), Gaps = 27/196 (13%)
Query: 67 ESISDYSIPPTSTCPDYFRWIHEDLSPWKVT-GITRDMLERA-NQTAH------FRLILV 118
+++S + C Y + I DL PW++ GIT+++ ++A NQ +H ++
Sbjct: 16 KAVSSFKDCKNDRCGCYKKVIDNDLKPWRLKKGITKEIFDKAANQGSHRGAEKGSHYQII 75
Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
N+KVY H+ + + R GI L+K +LP+LEL+ + D P + P
Sbjct: 76 NHKVYRHE-RCTFPAR----CKGIEHFLKKIAKKLPNLELIINTHDWPKV-------PKW 123
Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P+F +S DI++P WSFW G A I P W+ L + L++ ++ W
Sbjct: 124 DELLPVFSFSKTHNENDIMYPAWSFWEGGPAVWPIFPNGLGRWDVLRKSLQKASDKWPWD 183
Query: 232 DREPYAYWKGNPFVAE 247
++ A+++G+ AE
Sbjct: 184 KKKSIAFFRGSRTSAE 199
>gi|21355689|ref|NP_651095.1| rumi, isoform A [Drosophila melanogaster]
gi|74866179|sp|Q8T045.1|RUMI_DROME RecName: Full=O-glucosyltransferase rumi; Flags: Precursor
gi|17862464|gb|AAL39709.1| LD29477p [Drosophila melanogaster]
gi|23172000|gb|AAN13920.1| rumi, isoform A [Drosophila melanogaster]
gi|220945854|gb|ACL85470.1| rumi-PA [synthetic construct]
gi|220955678|gb|ACL90382.1| rumi-PA [synthetic construct]
Length = 411
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 83/189 (43%), Gaps = 20/189 (10%)
Query: 64 QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
Q +++ +DY S P S C + + DL+P+K TG+TR M+E + +
Sbjct: 53 QIEKANADYKPCSSDPQDSDCSCHANVLKRDLAPYKSTGVTRQMIESSARYG------TK 106
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
K+Y H+ + GI L LPD++L+ + D P + + N
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDLIINTRDYPQLN----AAWGNA 162
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDR 233
P+F +S + DI++P W+FW G + P W+ + +L++ W +
Sbjct: 163 AGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQK 222
Query: 234 EPYAYWKGN 242
+++G+
Sbjct: 223 RSLGFFRGS 231
>gi|195502713|ref|XP_002098347.1| GE24006 [Drosophila yakuba]
gi|194184448|gb|EDW98059.1| GE24006 [Drosophila yakuba]
Length = 411
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)
Query: 64 QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
Q +++ +DY S P S C + + DL+P+K TG+TR M+E + +
Sbjct: 53 QIEKANADYKPCSSDPKDSDCSCHADVLKRDLAPYKSTGVTRQMIESSARYG------TK 106
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
K+Y H+ + GI L LPD++L VI +RDY N
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL--------VINTRDYPQLNAA 158
Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
N P+F +S + DI++P W+FW G + P W+ + +L++
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218
Query: 230 WIDREPYAYWKGN 242
W + +++G+
Sbjct: 219 WSQKRNLGFFRGS 231
>gi|405122052|gb|AFR96820.1| hypothetical protein CNAG_06994 [Cryptococcus neoformans var.
grubii H99]
Length = 422
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 79/170 (46%), Gaps = 27/170 (15%)
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GI++ M++ A + + RL+++NN++++ YK I TR I + + LPD++
Sbjct: 81 GISKKMVDEAEEEGNARLVILNNQLFVKAYKGGINTRTQAAIAAVYGTVLTATEPLPDVD 140
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPW-- 214
VI++ D G N+ P F D+ + PD+ F+ W E + +
Sbjct: 141 F--------VIQTSDAGGGNH----PHFALCRKANQKDLWLMPDFGFFSWPEPGVGSYSE 188
Query: 215 ---ESLLREL--------KEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
++L EL K G +W ++ +W+G+P V E R DLL
Sbjct: 189 VRTKTLDYELDMGLEVDNKLGVTKSDWFNKTQQLFWRGSPMV-EVRNDLL 237
>gi|194910742|ref|XP_001982221.1| GG12485 [Drosophila erecta]
gi|190656859|gb|EDV54091.1| GG12485 [Drosophila erecta]
Length = 411
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)
Query: 64 QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
Q +++ +DY S P S C + + DL+P+K TG+TR M+E + +
Sbjct: 53 QIEKANADYKPCSSDPKDSDCSCHAAVLKRDLAPYKSTGVTRQMIESSARYG------TK 106
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
K+Y H+ + GI L LPD++L VI +RDY N
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVTTLPDMDL--------VINTRDYPQLNAA 158
Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
N P+F +S + DI++P W+FW G + P W+ + +L++
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218
Query: 230 WIDREPYAYWKGN 242
W + +++G+
Sbjct: 219 WSQKRNLGFFRGS 231
>gi|328860135|gb|EGG09242.1| hypothetical protein MELLADRAFT_115837 [Melampsora larici-populina
98AG31]
Length = 515
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 78/167 (46%), Gaps = 13/167 (7%)
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
+T + L+ A + H R+++ N+VYI +YK R + I + + P RLP++E
Sbjct: 200 VTSEQLDEAIKVGHARVLIYENRVYIKEYKGGPGKRTEALLNSIQEAVITSPERLPNIEF 259
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESL 217
+ D P +G K P + + D+ D+ + PD+ F+ W E + +
Sbjct: 260 VVKTVDAP-------TGEETKLPLWVLDRTIDQ--EDVWLTPDYGFYSWPEPKVGSMIEV 310
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ E +W D+ P A+W+G + + R ++ ++ H+WN
Sbjct: 311 RDKCNEIEKKLDWKDKIPKAFWRG-AILVKLREQMI--EIAKGHEWN 354
>gi|443694719|gb|ELT95788.1| hypothetical protein CAPTEDRAFT_221044 [Capitella teleta]
Length = 501
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 90/190 (47%), Gaps = 22/190 (11%)
Query: 80 CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
CP +R I +DLS + + + ++ +ER NQ L ++ NN++Y Y + +
Sbjct: 149 CPAGYRQIKQDLSIFGDIDMKAVKQEAVERFNQRGRHALCHYVVKNNQIYRQTYGEHVGF 208
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPD+E + D P+ + + ++GP P+F + G T
Sbjct: 209 K-MFMDAMLLSLTRKV--HLPDVEFFVNLGDWPLEKRK-----VSEGPLPIFSWCGSDDT 260
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
DIV P + E ++ + +L + N G W ++ A+W+G E R D
Sbjct: 261 RDIVMPTYDV---TESTLETMGRITLDLLSVQANTGPKWSNKSSVAFWRGRDSRQE-RLD 316
Query: 252 LLTCNLSDKH 261
L+ LS KH
Sbjct: 317 LV--KLSRKH 324
>gi|195399534|ref|XP_002058374.1| GJ14378 [Drosophila virilis]
gi|194141934|gb|EDW58342.1| GJ14378 [Drosophila virilis]
Length = 410
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/227 (24%), Positives = 101/227 (44%), Gaps = 43/227 (18%)
Query: 42 TTPLNCVKNQTQTC-------PTNYPKTSQTQESISDY--SIPPTST------CPDYFRW 86
T + C + TC P K S+ +SI + S P +T C Y
Sbjct: 21 TQKIQCDGAEHTTCRETDGIDPNQVAKHSKITQSIQNALASYEPCTTDVNDVNCTCYAAG 80
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----G 141
I DL+ +K TG+TR M+ A + R + N ++Y RD ++ G
Sbjct: 81 IKRDLALYKSTGVTRKMINDAAKYG-TRYKIYNKQLY----------RDDNCMFPARCQG 129
Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
I L + LP+++L+ + D P + +S + GP +F +S + DI++P W
Sbjct: 130 IEHFLLQLLAELPNMDLVINTRDYPQL----HSAWRHDGP--VFSFSKTKEYRDIMYPAW 183
Query: 202 SFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
+FW G + P W+ + +L++ ++ +W D++ +++G+
Sbjct: 184 TFWAGGPATKLHPTGIGRWDLMRAKLEKRSSSLSWHDKQELGFFRGS 230
>gi|324516300|gb|ADY46488.1| CAP10 family protein CPIJ013394 [Ascaris suum]
Length = 381
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 85/186 (45%), Gaps = 22/186 (11%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAH-FRLILVNNKV 122
+ +E+ +Y + C + I DLSP++ GI ML A +AH R L+ ++
Sbjct: 39 EIEEAERNYRSCDINNCTCFQSQIENDLSPFQ-NGIDESMLLAA--SAHGVRYQLIGGRL 95
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
+ + + + R G+ L LP+ E + + D P +RS D P
Sbjct: 96 F-RQPQCPFEAR----CEGVEYFLVHLADSLPNTEFVLNVHDHPQMRSDD-------SPL 143
Query: 183 PLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPY 236
P+F +S D +DI++P WSFW G I++ P W ++ + W R+P
Sbjct: 144 PVFSFSKDMNHIDILYPAWSFWSGGPAISLYPTGIGRWNETSVKITKAAKRIEWAHRKPI 203
Query: 237 AYWKGN 242
A+++G+
Sbjct: 204 AFFRGS 209
>gi|118783733|ref|XP_001230897.1| AGAP004267-PA [Anopheles gambiae str. PEST]
gi|193806712|sp|A0NDG6.1|RUMI_ANOGA RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|116129010|gb|EAU76986.1| AGAP004267-PA [Anopheles gambiae str. PEST]
Length = 399
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 89/184 (48%), Gaps = 21/184 (11%)
Query: 66 QESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLILVNNKVYI 124
+ +++ Y ++ C + + DL P+K GIT++M+ RA Q H+++I
Sbjct: 57 ETALAGYVACNSTNCNCHADVLKADLKPFKAHGITKEMINRAKQYGTHYQVIG------- 109
Query: 125 HKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPL 184
HK + + G+ +R LPD++L+ +C D P I R +S K P+
Sbjct: 110 HKLYRQRECMFPARCSGVEHFVRPLLPLLPDMDLIVNCRDWPQIH-RHWS----KEKIPV 164
Query: 185 FRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAY 238
+S +DI++P W+FW G I + P W+ + + + + +W +EP A+
Sbjct: 165 LSFSKTAEYLDIMYPAWAFWEGGPAIALYPTGLGRWDLHRQTITKASA--DWEAKEPKAF 222
Query: 239 WKGN 242
++G+
Sbjct: 223 FRGS 226
>gi|170055423|ref|XP_001863576.1| KDEL motif-containing protein 1 [Culex quinquefasciatus]
gi|193806713|sp|B0X1Q4.1|RUMI_CULQU RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|167875399|gb|EDS38782.1| KDEL motif-containing protein 1 [Culex quinquefasciatus]
Length = 403
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/198 (23%), Positives = 91/198 (45%), Gaps = 23/198 (11%)
Query: 26 VIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQT------QESISDYSIPPTST 79
++I ++ Q C + Q + N K + +E+++ Y +S
Sbjct: 11 LVIFAQNGQTDDGGMCMAKETCTETQQEAPTNNLYKAADNKYITLIEEALAAYKPCESSN 70
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF-- 137
C + + DL P++ +GIT+D++E A + ++ ++++ + RD
Sbjct: 71 CSCHLDVLKTDLRPFR-SGITQDLIELARSYGT-KYQIIGHRMF--------RQRDCMFP 120
Query: 138 -TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
G+ +R +LPD+EL+ +C D P I SR ++ ++ P P+ +S +DI
Sbjct: 121 ARCSGVEHFIRPNLPKLPDMELIINCRDWPQI-SRHWNA--SREPLPVLSFSKTNDYLDI 177
Query: 197 VFPDWSFW-GWAEINIKP 213
++P W FW G I++ P
Sbjct: 178 MYPTWGFWEGGPAISLYP 195
>gi|321467420|gb|EFX78410.1| hypothetical protein DAPPUDRAFT_305151 [Daphnia pulex]
Length = 415
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 106/249 (42%), Gaps = 38/249 (15%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
+S+ DY C + + EDL +K GIT+ ML++A + ++ K+
Sbjct: 69 GHISQSLLDYKPCLFKNCSCFTNLVIEDLKAFKDVGITKSMLDKAKDRGT-KYQIIGQKL 127
Query: 123 YIHKYKQSIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
Y R+ F ++ GI + K LPD+EL+ + D P + SR +
Sbjct: 128 Y----------RESFCLFPSRCAGIEHFILKVIKDLPDMELIINNRDWPQV-SRHFG--- 173
Query: 178 NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P+ +S + +DI +P W+FW G I++ P W+ + + W
Sbjct: 174 --EVLPILSFSKTKEYLDITYPAWTFWEGGPAISLYPRGLGRWDQHRISIDKVAATYPWN 231
Query: 232 DREPYAYWKG-------NPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSN 284
+++ A+++G +P + +R +L NL D + + D ++ +
Sbjct: 232 EKQSKAFFRGSRTSSERDPLILLSRGNL---NLVDAQYTKNQAWKSDSDTLGAPPAEEVS 288
Query: 285 LASQCAHRY 293
L S C+++Y
Sbjct: 289 LESHCSYKY 297
>gi|405965272|gb|EKC30658.1| KTEL motif-containing protein 1 [Crassostrea gigas]
Length = 400
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 77/148 (52%), Gaps = 25/148 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNN 120
+ ++++ +Y C + I EDL+PWK GIT ++ A N +H+++ VN+
Sbjct: 54 KIEKAVKEYKDCTQDECSCHRSVIEEDLTPWK-NGITEEVFNTAKSNNYGSHYQI--VNH 110
Query: 121 KVYIHK---YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
K+Y + + + F IL++++K LPD+E + + D P I SR
Sbjct: 111 KLYREEGCMFPARCSGNEHF----ILEVIKK----LPDMEFVINTRDWPQISSR------ 156
Query: 178 NKGPPPLFRYSGDRW-TMDIVFPDWSFW 204
+ P P+F +S R ++DI++P W+FW
Sbjct: 157 -QQPIPVFSFSKVRGQSLDIMYPAWTFW 183
>gi|332019905|gb|EGI60366.1| CAP10 family protein [Acromyrmex echinatior]
Length = 407
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 102/222 (45%), Gaps = 32/222 (14%)
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGIL 143
I DL P+ GI+++M+E A F I + K+Y + +D GI
Sbjct: 81 IVRDLRPFTKKGISKEMIEAAKTRGTFYQI-IKGKLY--------REKDCMFPARCAGIE 131
Query: 144 QLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSF 203
L K G L D++L+ + D P +S +Y G P+F +S DI++P W+F
Sbjct: 132 HFLLKIIGNLSDMDLVINTRDYP--QSSEYFG----NAIPVFSFSKTPQYYDIMYPAWAF 185
Query: 204 W-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
W G I++ P W+ + L + + +W ++E +++G+ +E R +L+ +
Sbjct: 186 WEGGPAISLYPRGLGRWDQHRKTLNKASLEISWEEKESKGFFRGSRTSSE-RDNLILLSR 244
Query: 258 SDKHDWNARLYVQDWILESKRG------FQQSNLASQCAHRY 293
+ H +A+ Y ++ +S + +L S C ++Y
Sbjct: 245 NKPHLVDAQ-YTKNQAWKSNEDTLHATPASEVSLESHCTYKY 285
>gi|291400615|ref|XP_002716870.1| PREDICTED: KTEL (Lys-Tyr-Glu-Leu) containing 1 [Oryctolagus
cuniculus]
Length = 462
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 108/250 (43%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQ---TAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ Q H+++I N
Sbjct: 108 QINRSLENYEPCSSENCSCYHGIIEEDLTPFR-GGISRKMMAEVVQRKLGTHYQII--KN 164
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 165 RLY-RENDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 212
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ EL + W
Sbjct: 213 PAIPVFSFSKTSEYYDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREELVKSAAQWPWKK 272
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 273 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 327
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 328 HLVDHCKYKY 337
>gi|195113713|ref|XP_002001412.1| GI10779 [Drosophila mojavensis]
gi|193918006|gb|EDW16873.1| GI10779 [Drosophila mojavensis]
Length = 406
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/214 (23%), Positives = 94/214 (43%), Gaps = 25/214 (11%)
Query: 42 TTPLNCVKNQTQTCPTN--YPK-TSQTQESISDYSIPPTST----CPDYFRWIHEDLSPW 94
T C+KN+ N + K T ++++++Y T C Y I DL+P+
Sbjct: 25 TGQSTCIKNEEIEVKQNAKFDKFTRNIEKALANYQPCTTDATDVNCTCYAAGIKRDLAPY 84
Query: 95 KVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLP 154
K TG TR M+E A + K++ + + GI L + L
Sbjct: 85 KSTGFTRKMIEDAAKYG------TRYKIFGKQLFREDNCMFPARCQGIEHFLLQLLPELK 138
Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
+++L+ + D P + +S +KG P+F +S +DI++P W+FW G + P
Sbjct: 139 NMDLVINTRDYPQL----HSSWQHKG--PVFSFSKTTEYLDIMYPAWTFWAGGPATKLHP 192
Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
W+ + +LK+ W +++ +++G+
Sbjct: 193 TGIGRWDLMREKLKKAAKAIPWGEKKEIGFFRGS 226
>gi|195054040|ref|XP_001993934.1| GH18344 [Drosophila grimshawi]
gi|193895804|gb|EDV94670.1| GH18344 [Drosophila grimshawi]
Length = 408
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 75/169 (44%), Gaps = 16/169 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C Y I DL+P+K G +R MLE A + K+Y K +
Sbjct: 69 CTCYAAGIKRDLAPYKSIGFSRQMLEDAAKYG------TRYKIYGQKLFREENCFFPARC 122
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
GI L + +LP+++L+ + D P + +S ++ P+F +S DI++P
Sbjct: 123 QGIEHFLLELLPQLPNMDLVINTRDYPQL----HSSWSSSRIGPVFSFSKTSEYRDIMYP 178
Query: 200 DWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
W+FW G + P W+ + +LKE W ++E +++G+
Sbjct: 179 AWTFWAGGPATKLHPTGIGRWDLMSGKLKEVTTKIPWQEKEQLGFFRGS 227
>gi|242025222|ref|XP_002433025.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212518534|gb|EEB20287.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 407
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/201 (26%), Positives = 92/201 (45%), Gaps = 27/201 (13%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S+ + S+Y P S C +F+ + DL P+K GITR+M++ I
Sbjct: 61 SKINFAESNYKNCPYSNCSCFFKTLKNDLKPFK-NGITREMIDS---------IRSRGTT 110
Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
YI K+ + +D GI ++K L + EL+ + D P I SR + K
Sbjct: 111 YIIYEKRLYRDKDCLFPSRCSGIEYFIKKIISHLKNTELIINTRDWPQI-SRHF-----K 164
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDR 233
P+F +S + +DI++P WSFW G I P W+ ++L E ++ W +
Sbjct: 165 LFGPVFSFSKTQDYLDIMYPAWSFWEGGPAIKTYPTGLGRWDLHRKKLSEESSKWPWNKK 224
Query: 234 EPYAYWKGNPFVAETRRDLLT 254
+ +++G+ +RD+L
Sbjct: 225 KSIGFFRGSR--TSEKRDVLV 243
>gi|380022449|ref|XP_003695058.1| PREDICTED: O-glucosyltransferase rumi homolog [Apis florea]
Length = 407
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 25/218 (11%)
Query: 41 ITTPLNCV--KNQTQTCPTNYPKT-SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
I + +CV KN + Y K + +E+ DY I + + I DL P+K
Sbjct: 30 IGSTEDCVEEKNIYKVLNKQYSKYYNAIEEAEKDYRICNNTNNSCFKDVIINDLKPFKEK 89
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GI +D+++ A F I + K+Y K +R GI L K L D++
Sbjct: 90 GINKDLIDTAKIRGTFYQI-IQGKLYRQK-DCMFPSRCA----GIEYFLLKLAPGLTDMD 143
Query: 158 LMFDCDDRPVIRSRDY--SGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP- 213
L VI RDY S + P P+F +S DI +P W+FW G I++ P
Sbjct: 144 L--------VINVRDYPQSSKHFGDPLPIFSFSKTSQYYDITYPAWAFWEGGPAISLYPR 195
Query: 214 ----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
W+ L + +N W +E +++G+ +E
Sbjct: 196 GLGRWDEHCISLDKASNNTLWEKKENKVFFRGSRTSSE 233
>gi|322791349|gb|EFZ15836.1| hypothetical protein SINV_05265 [Solenopsis invicta]
Length = 408
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 100/222 (45%), Gaps = 32/222 (14%)
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGIL 143
I DL P++ GI+ +M++ A F + + K+Y + +D GI
Sbjct: 82 ILRDLGPFRTKGISEEMIQAARTRGTFYQV-IKGKLY--------REKDCMFPARCAGIE 132
Query: 144 QLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSF 203
L K G L D++L+ + D P +S +Y G P+F +S DI++P W+F
Sbjct: 133 HFLLKVIGNLSDMDLVINTRDYP--QSSEYFG----NAMPVFSFSKTPQYYDIMYPAWAF 186
Query: 204 W-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
W G I++ P W+ + L + + W +E +++G+ +E R +L+ +
Sbjct: 187 WEGGPAISLYPHGLGRWDQHRKSLNKASLEIPWEKKESKGFFRGSRTSSE-RDNLILLSR 245
Query: 258 SDKHDWNARLYVQDWILESKRG------FQQSNLASQCAHRY 293
S H +A+ Y ++ +S + +L S C ++Y
Sbjct: 246 SKPHLVDAQ-YTKNQAWKSNEDTLHATPASEVSLESHCTYKY 286
>gi|340719860|ref|XP_003398363.1| PREDICTED: o-glucosyltransferase rumi homolog [Bombus terrestris]
Length = 404
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 96/218 (44%), Gaps = 25/218 (11%)
Query: 41 ITTPLNCVKNQTQTCPTNYPKT-----SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK 95
I P NC++ + + K + +E+ +Y I + Y I DL P+K
Sbjct: 27 IDNPENCIEKKDNIYKKDLNKQYIKYYNAIEEAQKNYKICNNTNNGCYKDIIINDLKPFK 86
Query: 96 VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
GI++D++ A I + K+Y K +R GI L K L D
Sbjct: 87 KKGISKDLINIAKTRGTVYQI-IQGKLYRQK-DCMFPSR----CSGIEHFLLKLAPGLTD 140
Query: 156 LELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP- 213
++L+ + D P +S + G GP P+FR + + + DI +P W+FW G I++ P
Sbjct: 141 MDLVINVRDYP--QSSKHFG----GPLPIFRLTPEYY--DITYPAWAFWEGGPAISLYPR 192
Query: 214 ----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
W+ L + + W ++E A+++G+ +E
Sbjct: 193 GLGRWDEHRVSLDKASKNTLWEEKENKAFFRGSRTSSE 230
>gi|195443898|ref|XP_002069626.1| GK11622 [Drosophila willistoni]
gi|194165711|gb|EDW80612.1| GK11622 [Drosophila willistoni]
Length = 377
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 46/214 (21%), Positives = 95/214 (44%), Gaps = 35/214 (16%)
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQT 110
Q + NY S + ++ C + + +++DL+ ++ TG++R M+E + +
Sbjct: 18 QIENAMANYEPCSSDESDVN---------CSCHEKVLNQDLAAYQSTGVSRQMIESSARY 68
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDDR 165
R + N++Y RD ++ GI LR+ LP+++L+ + D
Sbjct: 69 GT-RYKIYENQLY----------RDEKCMFPARCQGIEHFLRQLLPVLPNMDLIINTRDY 117
Query: 166 PVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI------NIKPWESLLR 219
P I + + N+ G P+F +S + DI++P W+FW I W+ +
Sbjct: 118 PQINT---AWGNSVGNGPVFSFSKTKEYRDIMYPAWTFWAGGPATRLHPRGIGRWDLMRE 174
Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+L++ W + +++G+ +E R L+
Sbjct: 175 KLEKRAAAIPWSQKRELGFFRGSR-TSEERDSLI 207
>gi|148230341|ref|NP_001086860.1| MGC83543 protein precursor [Xenopus laevis]
gi|50416390|gb|AAH77568.1| MGC83543 protein [Xenopus laevis]
Length = 386
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 98/220 (44%), Gaps = 29/220 (13%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
+Q ++ +S P + ++R + EDL P++ +GI+RD++ Q R + + ++
Sbjct: 31 NQIHTAVQRHSPPDSHNGTCFYRVLQEDLEPFR-SGISRDLM----QNVLSRKLGTHYQI 85
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
H+ + + G+ L + LPD+EL+ + D P + P+ P
Sbjct: 86 INHRLYREEECMFSARCSGVEHFLLELLPNLPDMELVINVRDYPQV-------PSWMNPV 138
Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P W+FW G A I P W+ + +LK+ + W +
Sbjct: 139 IPIFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLKKAADLWPWEKKI 198
Query: 235 PYAYWKG-------NPFVAETRR--DLLTCNLSDKHDWNA 265
P Y++G +P + +R DL+ + W +
Sbjct: 199 PKGYFRGSRTSPDRDPLILLSRESPDLVDAEYTKNQAWKS 238
>gi|156366125|ref|XP_001626991.1| predicted protein [Nematostella vectensis]
gi|156213886|gb|EDO34891.1| predicted protein [Nematostella vectensis]
Length = 412
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 95/207 (45%), Gaps = 28/207 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKV 122
+ +++S Y C Y I +DL WK GI + + A ++ H+++I N+ +
Sbjct: 67 KINQAVSTYRGCSNKGCGCYKDVIEDDLRRWK-DGINKSDFDAARSRGTHYQII--NHML 123
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y + F GI L + +LPD+E+ +I +RD+ GP
Sbjct: 124 Y-----REDDCMFPFRCKGIEHFLLEVINKLPDMEI--------IINTRDWPQAAVWGPA 170
Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S + MDI++P W+FW G A I P W+ + L + + W +E
Sbjct: 171 LPIFSFSKTKNEMDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREALDKKSQEWPWEKKE 230
Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKH 261
A+++G+ AE RD L LS KH
Sbjct: 231 SKAFFRGSRTSAE--RDPLVL-LSRKH 254
>gi|134114475|ref|XP_774166.1| hypothetical protein CNBG4660 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256799|gb|EAL19519.1| hypothetical protein CNBG4660 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 437
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 79/181 (43%), Gaps = 35/181 (19%)
Query: 98 GITRDMLERANQTAHFRLILVNN--------KVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
GI++ M++ A + + RL+++NN ++++ YK I TR I + L
Sbjct: 88 GISKKMVDEAEEEGNARLVILNNQLISHLGRQLFVKAYKGGINTRTQAAIAAVYGALLTA 147
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAE 208
LPD++ VI++ D G N+ P F D+ + PD+ F+ W E
Sbjct: 148 TEPLPDVDF--------VIQTSDAGGGNH----PHFALCRKADQKDLWLMPDFGFFSWPE 195
Query: 209 INIKPW-----ESLLRELKEG--------NNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+ + ++L EL G +W ++ +W+G+P V E R DLL
Sbjct: 196 PGVGSYSEVRTKTLDYELDMGLEVDDKLDVTKSDWFNKTQQLFWRGSPMV-EVRNDLLRA 254
Query: 256 N 256
+
Sbjct: 255 S 255
>gi|241671163|ref|XP_002400011.1| conserved hypothetical protein [Ixodes scapularis]
gi|215506249|gb|EEC15743.1| conserved hypothetical protein [Ixodes scapularis]
Length = 410
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 66/137 (48%), Gaps = 16/137 (11%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
TC + I DL W +GIT+D+++R+ ++ H+++I N+K+Y + F
Sbjct: 78 TCSCHSAVIDSDLQLWSESGITKDLVQRSKSRGIHYQII--NHKLY-----RGEDCLFPF 130
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
GI L + LPDLE + + D P R P P+F +S DI+
Sbjct: 131 RCSGIEHFLLEIVDELPDLEFIVNTRDWPQAHKR-------YDPLPVFSFSKTPDYADIM 183
Query: 198 FPDWSFW-GWAEINIKP 213
+P W+FW G I++ P
Sbjct: 184 YPAWTFWAGGPAISLYP 200
>gi|442757169|gb|JAA70743.1| Hypothetical protein [Ixodes ricinus]
Length = 507
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 21/182 (11%)
Query: 90 DLSPWKVTGITRDMLERANQTAHFRLI---LVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
D + + +D L+R ++ +V N+VY Y Q + ++F +L L
Sbjct: 172 DFEKVDMKKVLQDALQRFDRPGSVSFCHYAVVKNQVYRKCYGQHVGF-NMFMDQILLSLA 230
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
RK LPD+E++ + D P+ R +DY G P P F + G T DIV P +
Sbjct: 231 RKVV--LPDVEMLVNLGDWPLER-KDYWGK----PVPFFSWCGSNSTRDIVMPTYDL-TE 282
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-------PFVAETRR--DLLTCNL 257
+ + + +L +G+ G W D+EP +W+G VA +RR +LL +L
Sbjct: 283 SSLEMMGRVTLDMLSVQGHGGPAWKDKEPSGFWRGRDSRQERLDLVALSRRYPELLNASL 342
Query: 258 SD 259
++
Sbjct: 343 TN 344
>gi|345796207|ref|XP_545115.3| PREDICTED: protein O-glucosyltransferase 1 [Canis lupus familiaris]
Length = 392
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/253 (23%), Positives = 106/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGIISEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P I P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQI-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|158297147|ref|XP_317424.4| AGAP008037-PA [Anopheles gambiae str. PEST]
gi|157015054|gb|EAA12302.4| AGAP008037-PA [Anopheles gambiae str. PEST]
Length = 513
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 84/199 (42%), Gaps = 24/199 (12%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQS 130
T CP I DL P++ + ++ +H +++ N+V+ Y Q
Sbjct: 158 TVGCPAGDPQIDMDLIPFRAINFSSLRTRMIQQYDKPGSISHCNYVILRNQVHRRCYGQH 217
Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-NKGPPPLFRYSG 189
F +L L RK+ LPD+E+ + D P+++ GP+ GP P+F + G
Sbjct: 218 TGFSK-FMDTILLSLARKFT--LPDMEMFVNLGDWPLVKK---GGPSRTTGPYPIFSWCG 271
Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN------- 242
T DIV P + N+ +L L G W D+ A+W+G
Sbjct: 272 SDDTFDIVMPTYDITESTLENMG--RVMLDMLSVQRRGLPWADKHAKAFWRGRDARRERL 329
Query: 243 PFVAETRR--DLLTCNLSD 259
VA +RR +LL +L++
Sbjct: 330 ELVALSRRYPELLNASLTN 348
>gi|328780644|ref|XP_003249836.1| PREDICTED: o-glucosyltransferase rumi homolog, partial [Apis
mellifera]
Length = 391
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 91/213 (42%), Gaps = 23/213 (10%)
Query: 41 ITTPLNCVK----NQTQTCPTNYPKT-SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK 95
I +CVK N + +Y K + +E+ DY + + I DL P+K
Sbjct: 28 IDNTEDCVKEKKVNIYEGLNKHYSKYYNAIEEAEKDYRVCNNINNNCFKDVIINDLKPFK 87
Query: 96 VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
GI +D+++ A F I + K+Y K +R GI L K L D
Sbjct: 88 EKGINKDLIDIAKIRGTFYQI-IQGKLYRQK-DCMFPSRCA----GIEYFLLKLAPGLTD 141
Query: 156 LELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP- 213
++L+ + D P +S Y G P P+F +S DI +P W+FW G I++ P
Sbjct: 142 MDLVINVRDYP--QSSKYFG----DPLPIFSFSKTSQYYDITYPAWAFWEGGPAISLYPR 195
Query: 214 ----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
W+ L + +N W +E +++G+
Sbjct: 196 GLGRWDEHCISLDKASNNTLWEKKENKVFFRGS 228
>gi|350416880|ref|XP_003491149.1| PREDICTED: O-glucosyltransferase rumi homolog [Bombus impatiens]
Length = 407
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/188 (26%), Positives = 84/188 (44%), Gaps = 18/188 (9%)
Query: 66 QESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIH 125
+E+ +Y I + Y I DL P+K GI++D++ A I + K+Y
Sbjct: 58 EEAQKNYKICNNTNNGCYKDIIINDLKPFKKKGISKDLINIAKTRGTVYQI-IQGKLYRQ 116
Query: 126 KYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLF 185
K +R GI L K L D++L+ + D P +S + G GP P+F
Sbjct: 117 K-DCMFPSR----CSGIEHFLLKLAPGLTDMDLVINVRDYP--QSSKHFG----GPLPIF 165
Query: 186 RYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYW 239
+S DI +P W+FW G I++ P W+ L + + W ++E A++
Sbjct: 166 SFSKTPEYYDITYPAWAFWEGGPAISLYPRGLGRWDEHRVSLDKASKNTLWEEKENKAFF 225
Query: 240 KGNPFVAE 247
+G+ +E
Sbjct: 226 RGSRTSSE 233
>gi|403288562|ref|XP_003935467.1| PREDICTED: protein O-glucosyltransferase 1 [Saimiri boliviensis
boliviensis]
Length = 391
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 108/250 (43%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ DY + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 37 QINRSLEDYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 93
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R IL+++R RLPD+E++ + D P + P
Sbjct: 94 RLY-RENDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 141
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 142 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKK 201
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 202 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 256
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 257 HLVEHCKYKY 266
>gi|410970561|ref|XP_003991747.1| PREDICTED: protein O-glucosyltransferase 1 [Felis catus]
Length = 447
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 107/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 93 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 149
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 150 RLY-RENDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 197
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 198 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKR 257
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 258 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 312
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 313 HLVDHCKYKY 322
>gi|442759745|gb|JAA72031.1| Putative o-glucosyltransferase rumi [Ixodes ricinus]
Length = 410
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 66/137 (48%), Gaps = 16/137 (11%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
TC + I DL W +GIT+D+++R+ ++ H+++I N+K+Y + F
Sbjct: 78 TCSCHSAVIDNDLQLWSESGITKDLIQRSKSRGIHYQII--NHKLY-----RGEDCLFPF 130
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
GI L + LPDLE + + D P R P P+F +S DI+
Sbjct: 131 RCSGIEPFLLEIVDELPDLEFIVNTRDWPQAHKR-------YDPLPVFSFSKTPDYADIM 183
Query: 198 FPDWSFW-GWAEINIKP 213
+P W+FW G I++ P
Sbjct: 184 YPAWTFWAGGPAISLYP 200
>gi|307172256|gb|EFN63761.1| CAP10 family protein AGAP004267 [Camponotus floridanus]
Length = 406
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/271 (22%), Positives = 117/271 (43%), Gaps = 41/271 (15%)
Query: 47 CVKNQTQTCPTNYPKT---------SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
C +N+ +Y K + Q++ DY + + I DLSP+K
Sbjct: 31 CKENEIDHQKLSYAKAINEGYQKYLTAIQDAEQDYKECNNTKHSCFKDVIVRDLSPFKKK 90
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLP 154
GI+ +M+ A F I + K+Y + +D GI L K G L
Sbjct: 91 GISEEMIVAAKNRGTFYQI-IGGKLY--------REKDCMFPARCAGIEHFLLKVIGNLS 141
Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
+++L+ + D P +S +Y G P+F +S DI++P W+FW G I++ P
Sbjct: 142 NMDLIINTRDYP--QSSEYFG----NAMPIFSFSKTSQYYDIMYPAWAFWEGGPAISLYP 195
Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
W+ + L + ++ W ++ +++G+ +E R +L+ + + H +A+ Y
Sbjct: 196 RGLDRWDRHRKLLNKASSEIPWEKKDNKGFFRGSRTSSE-RDNLILLSRNKPHLVDAQ-Y 253
Query: 269 VQDWILESKRG------FQQSNLASQCAHRY 293
++ +S+ + +L S C ++Y
Sbjct: 254 TKNQAWKSEEDTLHAPPASEVSLESHCTYKY 284
>gi|125773715|ref|XP_001358116.1| GA16050 [Drosophila pseudoobscura pseudoobscura]
gi|121991819|sp|Q29AU6.1|RUMI_DROPS RecName: Full=O-glucosyltransferase rumi; Flags: Precursor
gi|54637851|gb|EAL27253.1| GA16050 [Drosophila pseudoobscura pseudoobscura]
Length = 409
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 73/175 (41%), Gaps = 24/175 (13%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ C + I DL+P+K TG++R M+E + + K+Y + +
Sbjct: 69 ANCSCHAAVIKSDLAPYKATGVSRQMIESSARYG------TRYKIYEKRLYREENCMFPA 122
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWT 193
GI L LPD++L VI +RDY N N P+ +S +
Sbjct: 123 RCQGIEHFLLPLVATLPDMDL--------VINTRDYPQINMAWGNGAQGPILSFSKTKDH 174
Query: 194 MDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
DI++P W+FW G + P W+ + +L++ W + +++G+
Sbjct: 175 RDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS 229
>gi|402859114|ref|XP_003894014.1| PREDICTED: protein O-glucosyltransferase 1 [Papio anubis]
Length = 392
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 106/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|307194254|gb|EFN76650.1| CAP10 family protein AGAP004267 [Harpegnathos saltator]
Length = 342
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 97/218 (44%), Gaps = 24/218 (11%)
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
I DL P+K GI+ +M+ A F I+ K +++ K + GI L
Sbjct: 81 ILRDLRPFKKKGISEEMINAAKTRGTFYQII---KGTLYREKDCMFPA---RCAGIEHFL 134
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-G 205
K G L D++L+ + D P +S +Y G P+F +S DI++P W+FW G
Sbjct: 135 LKVIGNLSDMDLVINTRDYP--QSSEYFG----NLLPIFSFSKTPQYYDIMYPAWAFWEG 188
Query: 206 WAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK 260
I++ P W+ + L + + W +E +++G+ +E R +L+ + +
Sbjct: 189 GPAISLYPRGLGRWDQHRKSLNKASLETPWEKKESKGFFRGSRTSSE-RDNLILLSRNKS 247
Query: 261 HDWNARLYV-QDWILESKRGFQQ----SNLASQCAHRY 293
H +A+ Q W Q+ +L S C ++Y
Sbjct: 248 HLVDAQYTKNQAWKSNEDTLHQEPASVVSLESHCGYKY 285
>gi|440897644|gb|ELR49289.1| KTEL motif-containing protein 1 [Bos grunniens mutus]
Length = 392
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 107/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 143 PAIPIFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 203 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 257
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 258 HLVDHCKYKY 267
>gi|195143519|ref|XP_002012745.1| GL23774 [Drosophila persimilis]
gi|194101688|gb|EDW23731.1| GL23774 [Drosophila persimilis]
Length = 409
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 73/175 (41%), Gaps = 24/175 (13%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ C + I DL+P+K TG++R M+E + + K+Y + +
Sbjct: 69 ANCSCHAAVIKSDLAPYKATGVSRQMIESSARYG------TRYKIYEKRLYREENCMFPA 122
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWT 193
GI L LPD++L VI +RDY N N P+ +S +
Sbjct: 123 RCQGIEHFLLPLVATLPDMDL--------VINTRDYPQINMAWGNGAQGPILSFSKTKDH 174
Query: 194 MDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
DI++P W+FW G + P W+ + +L++ W + +++G+
Sbjct: 175 RDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS 229
>gi|426217502|ref|XP_004002992.1| PREDICTED: protein O-glucosyltransferase 1 [Ovis aries]
Length = 392
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 107/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 143 PAIPIFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 203 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 257
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 258 HLVDHCKYKY 267
>gi|194222810|ref|XP_001500762.2| PREDICTED: protein O-glucosyltransferase 1 [Equus caballus]
Length = 392
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 106/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQ---TAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ + H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-AGISRKMMAEVVKRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + G LPD+E++ + D P + P
Sbjct: 95 RLY-RENDCMFPSR----CSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 143 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLARSAAQWPWKK 202
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K Q
Sbjct: 203 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAQDV 257
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 258 HLVDHCKYKY 267
>gi|109033215|ref|XP_001109371.1| PREDICTED: KTEL motif-containing protein 1-like [Macaca mulatta]
gi|355559357|gb|EHH16085.1| hypothetical protein EGK_11322 [Macaca mulatta]
gi|355746435|gb|EHH51049.1| hypothetical protein EGM_10372 [Macaca fascicularis]
Length = 392
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 106/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|380786923|gb|AFE65337.1| protein O-glucosyltransferase 1 precursor [Macaca mulatta]
Length = 392
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 106/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|443727132|gb|ELU14012.1| hypothetical protein CAPTEDRAFT_20245 [Capitella teleta]
Length = 371
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 85/191 (44%), Gaps = 27/191 (14%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S + ++++YS C + I DL PWK GIT + ++A ++N+K+
Sbjct: 18 SSIENALAEYSPCNPDDCSCHLGVIESDLKPWK-NGITEQLFQQAKARGSNHYQIINHKL 76
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y +S + GI + + +LPD+E ++ RD+ + G P
Sbjct: 77 Y-----RSEKCMFPSRCSGIEHFILEVIHKLPDMEF--------ILNERDWPQASIHGAP 123
Query: 183 -PLFRYSG---DRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P+F +S D W DI++P W+FW G A I P W+ + + E W
Sbjct: 124 LPIFSFSKVPTDNW--DIMYPAWTFWEGGPAVWPIYPTGLGRWDEQRKIIPEAAKKWPWH 181
Query: 232 DREPYAYWKGN 242
++ A+++G+
Sbjct: 182 KKQSKAFFRGS 192
>gi|345497421|ref|XP_001601540.2| PREDICTED: O-glucosyltransferase rumi homolog [Nasonia vitripennis]
Length = 400
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 104/245 (42%), Gaps = 30/245 (12%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S +E+ ++Y+ + C + I DL + GI + ++E A F I + K+
Sbjct: 54 SAIEEAEANYAECDSKKCKCFGDMITRDLKTFAERGIDQKLIEAARPRGTFYQI-IGGKL 112
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY--SGPNNKG 180
Y K +R GI L K G + D+ L V+ +RDY S +
Sbjct: 113 YREK-NCMFPSRCA----GIEHFLLKVIGNVSDVSL--------VVNTRDYPQSSRHFGQ 159
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P P+F +S DI++P W FW G I++ P W+ + L E W +E
Sbjct: 160 PLPVFSFSKTPDYYDIMYPAWVFWEGGPAISLYPRGLGRWDLHRKSLNEAREETPWEKKE 219
Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRG------FQQSNLASQ 288
A+++G+ +E RD L K Y ++ +S++ ++ +L S
Sbjct: 220 EKAFFRGSRTSSE--RDNLVLLSRAKPQLVDAQYTKNQAWKSEKDTLNMPPAKEVSLESH 277
Query: 289 CAHRY 293
C+++Y
Sbjct: 278 CSYKY 282
>gi|62460496|ref|NP_001014903.1| protein O-glucosyltransferase 1 precursor [Bos taurus]
gi|75057820|sp|Q5E9Q1.1|PGLT1_BOVIN RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; AltName: Full=KTEL
motif-containing protein 1; Flags: Precursor
gi|59858103|gb|AAX08886.1| x 010 protein [Bos taurus]
gi|296491446|tpg|DAA33499.1| TPA: KTEL motif-containing protein 1 precursor [Bos taurus]
Length = 392
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 107/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 143 PAIPIFSFSKTLEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 203 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 257
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 258 HLVDHCKYKY 267
>gi|383420127|gb|AFH33277.1| protein O-glucosyltransferase 1 precursor [Macaca mulatta]
Length = 392
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 106/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNCTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|119497671|ref|XP_001265593.1| hypothetical protein NFIA_032600 [Neosartorya fischeri NRRL 181]
gi|119413757|gb|EAW23696.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 458
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 74/181 (40%), Gaps = 15/181 (8%)
Query: 81 PDYFRWIHEDLSPWKV-TGITRDMLERA-NQTAHFRLILVNNKVYI--HKYKQSIQTRDV 136
P F+ IH + WK GI+RD L + R I+ N +Y+ + K R +
Sbjct: 108 PGLFQDIHRGVEYWKSRGGISRDDLNAVPFEDGMARAIISNGDLYVVATRAKGDDHRRKI 167
Query: 137 F-TIWGILQLLRKYPGRL--PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
T+ I + L R P +E +F +DR N G P
Sbjct: 168 VGTLGSIHRALSASSNRTSHPTIEFIFSIEDR-------VDDVNAVGHPVWVLSRKAFEE 220
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDL 252
I+ PD+ FW WA+ NI P+ ++ + + + D+E W+G F + RR L
Sbjct: 221 SVILMPDFGFWSWAKSNIGPYGQVVERIMAAESKLKFADKEQKLVWRGKLSFAPKLRRVL 280
Query: 253 L 253
L
Sbjct: 281 L 281
>gi|378732944|gb|EHY59403.1| hypothetical protein HMPREF1120_07393 [Exophiala dermatitidis
NIH/UT8656]
Length = 412
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 46/221 (20%), Positives = 94/221 (42%), Gaps = 28/221 (12%)
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
P F + L+ + + IT + L+ Q + R ++ + ++Y+ + I +R++ T+
Sbjct: 71 PGLFEEVERPLADRRNSSITLEELDSVPRQNGYVRGMIYDQQLYVIEKTGGIYSRELATL 130
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDR-PVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
+ + + P LP++E +F+ DDR P + Y+ R D T +
Sbjct: 131 HALHRAIISAPEPLPNIEFVFNSDDRIPSVAIWGYA-----------RREQD--TKIWLI 177
Query: 199 PDWSFWGWAEINIKPWESLLRELK------EGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
PD+ +W W E + +RE++ E ++G +W + P W+G E R +
Sbjct: 178 PDFGYWSWPETKV----GTMREVQMKAVETEQDDGWSWSSKVPKLLWRGATMGLELRENF 233
Query: 253 LTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
L + W A + +W + ++ C ++Y
Sbjct: 234 LKA--AADQPW-ADVKALEWKNKESMAHDLKSMPEHCQYKY 271
>gi|71018113|ref|XP_759287.1| hypothetical protein UM03140.1 [Ustilago maydis 521]
gi|46099137|gb|EAK84370.1| hypothetical protein UM03140.1 [Ustilago maydis 521]
Length = 473
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 84/192 (43%), Gaps = 20/192 (10%)
Query: 78 STCPDYFRWIHEDLSP----WKVTG--ITRDMLERANQTAH--FRLILVNNKVYIHKYKQ 129
STC F ++ L WK G + D+ A+ H LI+ + +++I K+
Sbjct: 76 STCKAEFSRLYPQLIANQIAWKSKGGILYADVKNAADNCRHGCVHLIIKHGQIFIRAQKK 135
Query: 130 SIQTRDVFTIWGILQLLRK-YPGRLPDLELMFDCDDRPVIRSRDYSG---PNNKGPPPLF 185
Q+R + LQLL K Y G + + + + VI + D+ G PN++G +
Sbjct: 136 DWQSR----VRSTLQLLDKAYSGASEHEKALMEATEL-VISTADFDGFTDPNSRGAGWVL 190
Query: 186 RYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV 245
+ +FPD+SF W E I ++ R+ ++ N W + A+W+G+
Sbjct: 191 DKRVNDTQGQYLFPDFSFASWPEAGIPSYQEFRRQAQQVNAETPWKSKTNPAFWRGDALA 250
Query: 246 AET---RRDLLT 254
+ R LL+
Sbjct: 251 GQNIKPRESLLS 262
>gi|165993289|emb|CAP71956.1| unnamed protein product [Danio rerio]
Length = 500
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 75/168 (44%), Gaps = 17/168 (10%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP F I DLS +K R+ E +++ H+ + NN+VYI + + +
Sbjct: 152 CPASFSQIESDLSIFKSVDPDRNAHEVIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK +LPD+E + D P+ + R ++ P P+F + G T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
DIV P + + + SL +G+ G W + +W+G
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRG 308
>gi|343425735|emb|CBQ69269.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 488
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 62/154 (40%), Gaps = 14/154 (9%)
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+H R+++ NN+VY+ ++QS TR + + Q + +LPD E +D
Sbjct: 167 SHARVVIHNNRVYLKYFQQSAFTRVNSALALLFQSVVGSREKLPDAEFCLSANDW----- 221
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
G K Y D W M PD+ F+ W E I + + + + W
Sbjct: 222 ----GSMGKFSLDRAPYLEDLWLM----PDYGFYSWPEPGIGSYTEHREKTLQVEHDTPW 273
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ P +W+G V R L D HDWN
Sbjct: 274 ERKVPKLFWRGAMGVGTADRKALLAAAQD-HDWN 306
>gi|31982953|ref|NP_689518.1| protein O-glucosyltransferase 1 precursor [Homo sapiens]
gi|114588661|ref|XP_516666.2| PREDICTED: protein O-glucosyltransferase 1 isoform 2 [Pan
troglodytes]
gi|397509574|ref|XP_003825192.1| PREDICTED: protein O-glucosyltransferase 1 [Pan paniscus]
gi|426341692|ref|XP_004036160.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Gorilla
gorilla gorilla]
gi|74730148|sp|Q8NBL1.1|PGLT1_HUMAN RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; Short=hCLP46; AltName:
Full=KTEL motif-containing protein 1; AltName:
Full=Myelodysplastic syndromes relative protein; Flags:
Precursor
gi|22761537|dbj|BAC11625.1| unnamed protein product [Homo sapiens]
gi|29179612|gb|AAH48810.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Homo sapiens]
gi|31745813|gb|AAP56253.1| myelodysplastic syndromes relative [Homo sapiens]
gi|37182284|gb|AAQ88944.1| MDS010 [Homo sapiens]
gi|119599976|gb|EAW79570.1| chromosome 3 open reading frame 9 [Homo sapiens]
gi|189067501|dbj|BAG37760.1| unnamed protein product [Homo sapiens]
gi|410228232|gb|JAA11335.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410254852|gb|JAA15393.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410299780|gb|JAA28490.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410328909|gb|JAA33401.1| protein O-glucosyltransferase 1 [Pan troglodytes]
Length = 392
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 105/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|301758922|ref|XP_002915307.1| PREDICTED: KTEL motif-containing protein 1-like [Ailuropoda
melanoleuca]
Length = 380
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 106/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 26 QINRSLENYEPCSSQNCSCYHGVIDEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 82
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + G+LPD+E++ + D P + P
Sbjct: 83 RLY--------RENDCMFPSRCSGVEHFILEVIGQLPDMEMVINVRDYPQV-------PK 127
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 128 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 187
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 188 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 242
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 243 KDVHLVDHCKYKY 255
>gi|62897433|dbj|BAD96657.1| x 010 protein variant [Homo sapiens]
Length = 392
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 105/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|281345604|gb|EFB21188.1| hypothetical protein PANDA_003280 [Ailuropoda melanoleuca]
Length = 364
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 107/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 10 QINRSLENYEPCSSQNCSCYHGVIDEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 66
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + G+LPD+E++ + D P + P
Sbjct: 67 RLY-RENDCMFPSR----CSGVEHFILEVIGQLPDMEMVINVRDYPQV-------PKWME 114
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 115 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKK 174
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 175 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 229
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 230 HLVDHCKYKY 239
>gi|224170404|ref|XP_002339379.1| predicted protein [Populus trichocarpa]
gi|222874989|gb|EEF12120.1| predicted protein [Populus trichocarpa]
Length = 71
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 34/57 (59%), Gaps = 3/57 (5%)
Query: 237 AYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHR 292
AYWKGNP V R LL CN + K W A++ QDW E+K G+ S L+ QC +R
Sbjct: 5 AYWKGNPDVGSPIRTSLLECNHTKK--WGAQIMRQDWEEEAKGGYVSSKLSHQCDYR 59
>gi|297670280|ref|XP_002813298.1| PREDICTED: protein O-glucosyltransferase 1 [Pongo abelii]
Length = 392
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 105/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRRLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|344282265|ref|XP_003412894.1| PREDICTED: protein O-glucosyltransferase 1-like [Loxodonta
africana]
Length = 638
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 53/250 (21%), Positives = 104/250 (41%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNN 120
Q S+ +Y + C Y I +DL+P++ GI++ M+ H+++I N
Sbjct: 284 QINRSLENYEPCSSQDCSCYHGVIEQDLTPFR-GGISKKMMAEVVGRKLGTHYQII--KN 340
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 341 RLY-RENDCMFPSR----CGGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 388
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFWGWA-------EINIKPWESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW I + W+ +L W
Sbjct: 389 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPIGLGRWDLFREDLARSAAQWPWKK 448
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 449 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 503
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 504 HLVDHCKYKY 513
>gi|169617900|ref|XP_001802364.1| hypothetical protein SNOG_12132 [Phaeosphaeria nodorum SN15]
gi|111059424|gb|EAT80544.1| hypothetical protein SNOG_12132 [Phaeosphaeria nodorum SN15]
Length = 428
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 53/104 (50%), Gaps = 12/104 (11%)
Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+P++E F DD P +RS++ + G K P + W M P++++W W
Sbjct: 152 IPNIEFSFSLDDLP-LRSKEKGAFFGYTRKDTPEY----DNIWMM----PNYAYWSWNYT 202
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W S+ RE+++G W ++P W+G +AE R++L+
Sbjct: 203 HAPSWNSIRREIEQGEKKTPWHKKDPRVVWRGKIKMAELRKELV 246
>gi|157125824|ref|XP_001660800.1| endoplasmic reticulum-resident kdel protein [Aedes aegypti]
gi|108882653|gb|EAT46878.1| AAEL001982-PA [Aedes aegypti]
Length = 505
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 80/189 (42%), Gaps = 18/189 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP+ I DL P++ + ++ + ++ N++Y Y +
Sbjct: 152 CPNSDPQIESDLIPFRAVNFSSLRPRIIQQYDKPGSVSLCNYVIKENQIYRTCYGRYTGF 211
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ ++ +L L RK LPD+EL + D P++ + GP P+F + G T
Sbjct: 212 K-MYMDALLLSLARK--AILPDMELFVNLGDWPLVTKGGHR--RTTGPYPIFSWCGSEDT 266
Query: 194 MDIVFPDWSFWGWAEINIKPWESL-LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
DIV P + E ++ + L L G W D+EP A+W+G E R DL
Sbjct: 267 FDIVMPTYDI---VEATLEAMNRVTLDMLSVQRKGIPWKDKEPKAFWRGRDACRE-RLDL 322
Query: 253 LTCNLSDKH 261
+ +S KH
Sbjct: 323 V--GISQKH 329
>gi|449669688|ref|XP_002167607.2| PREDICTED: KDEL motif-containing protein 1-like [Hydra
magnipapillata]
Length = 497
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 75/166 (45%), Gaps = 14/166 (8%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQT---AHFRLILVNNKVYIHKYKQSIQTRD 135
CP ++ I +DL P+ + +++E A ++ A + NKVY Y +I
Sbjct: 146 ACPVNYQQIEKDLKPFPNINL-ENLIESATKSYNVAFCHYTIKKNKVYRKCY-GTINDFK 203
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FT +L + RK +LPD+E + D P+ R P P+F + G T D
Sbjct: 204 MFTDAWLLSVARKV--KLPDVEFFTNLGDWPLTTKR-------FNPMPIFSWCGSNDTFD 254
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+V+P + SL +GN G +W ++P A+++G
Sbjct: 255 LVWPTYDLTESTLETFGGRVSLDMTSIQGNTGPSWNHKKPVAFFRG 300
>gi|21040486|gb|AAH30614.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Homo sapiens]
gi|312150392|gb|ADQ31708.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [synthetic construct]
Length = 392
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 105/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRRMMAEVVRRKLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 200 WKKKNSTAYFRGSRTSPERDPLILLSRKNTKLVDAEYTKNQAWKS---MKDTL--GKPAA 254
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 255 KDVHLVDHCKYKY 267
>gi|296226200|ref|XP_002758827.1| PREDICTED: protein O-glucosyltransferase 1 [Callithrix jacchus]
Length = 391
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 107/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 37 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 93
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R IL+++R RLPD+E++ + D P + P
Sbjct: 94 RLY-RENDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 141
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 142 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKK 201
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W ++D + K +
Sbjct: 202 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKQ---LKDTL--GKPAAKDV 256
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 257 HLVDHCKYKY 266
>gi|307170880|gb|EFN62991.1| KDEL motif-containing protein 1 [Camponotus floridanus]
Length = 458
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 88/194 (45%), Gaps = 32/194 (16%)
Query: 80 CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
C + IH DLSP+ I +++R +Q + +L +N++Y Y Q +
Sbjct: 111 CRQNYTQIHNDLSPFLNIDFDKIRHSIIKRYDQPTSVSICHYVLRSNRIYRRCYGQYVGF 170
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPD+E + D P++ P+ P+F + G T
Sbjct: 171 K-IFMDAILLSLARKVV--LPDIEFFVNLGDWPLV-------PDTGPLYPIFSWCGSDNT 220
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL-----KEGNNGRNWIDREPYAYWKGNPFVAET 248
DIV P + +I E++ R + +GN G +W ++ +W+G ++
Sbjct: 221 KDIVMPTY------DITESSLEAMGRVMLDTLSVQGNTGLSWKNKTEQLFWRGR----DS 270
Query: 249 RRDLLTC-NLSDKH 261
RR+ L ++S KH
Sbjct: 271 RRERLNLIDISRKH 284
>gi|198431555|ref|XP_002127452.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 495
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 75/168 (44%), Gaps = 19/168 (11%)
Query: 80 CPDYFRWIHEDLSPWKVTGIT---RDMLER---ANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP +R I EDL P KV ++ + ++ER + H+ +VNN+VY + SI
Sbjct: 146 CPPTYRQIEEDLKPHKVIELSTFAKRIVERFTPHHSVCHY--AVVNNQVY-RQCHGSITD 202
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+F +L L RK +LPD E + D P+ +S D P P+ + G T
Sbjct: 203 FKMFMDAPLLSLTRKV--KLPDFEFFINLGDWPLEKSHD-------DPLPIISWCGSDGT 253
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
DI+ P + + + + SL + N G W + +++G
Sbjct: 254 HDIILPTYDITN-SVLEMLGRVSLDMFSVQANTGPRWGKKIAKGFFRG 300
>gi|432113196|gb|ELK35717.1| Protein O-glucosyltransferase 1, partial [Myotis davidii]
Length = 364
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 105/253 (41%), Gaps = 46/253 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 10 QINRSLENYEPCSSQNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 66
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + G LPD+E++ + D P + P
Sbjct: 67 RLY--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PK 111
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 112 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 171
Query: 230 WIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGF 280
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 172 WKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAA 226
Query: 281 QQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 227 KDVHLVDHCKYKY 239
>gi|388855380|emb|CCF51044.1| uncharacterized protein [Ustilago hordei]
Length = 493
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 61/154 (39%), Gaps = 14/154 (9%)
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+H R+++ NN+VY+ ++QS TR + + Q + +LPD E +D
Sbjct: 172 SHARVLIHNNRVYLKYFQQSAFTRVNSALALLFQSVVASKEKLPDAEFCLSANDW----- 226
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
G K Y D W M PD+ F+ W E I + + E W
Sbjct: 227 ----GSMGKFSLDRAPYLEDLWLM----PDYGFYSWPEPGIGSYTEHREKTLEVEKATPW 278
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
++ +W+G V R L SD H WN
Sbjct: 279 ENKISKLFWRGAMGVGTADRKALLAAASD-HPWN 311
>gi|326913914|ref|XP_003203277.1| PREDICTED: KDEL motif-containing protein 1-like [Meleagris
gallopavo]
Length = 572
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 73/170 (42%), Gaps = 21/170 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP F I DL+ + V I ++ +R Q + +N+VYI Y + + R
Sbjct: 224 CPQVFPQIQRDLANFPVVDPDKIAIEIPQRFGQRQSLCHYTIKDNEVYIKTYGEHVGFR- 282
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
+F +L L RK ++PD+E + D P+ K PP P+F + G
Sbjct: 283 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPL---------EKKKPPQNLHPIFSWCGSS 331
Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ DIV P + + + SL + N G +W D+ A+W+G
Sbjct: 332 ESKDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPSWEDKNTTAFWRG 380
>gi|156401513|ref|XP_001639335.1| predicted protein [Nematostella vectensis]
gi|156226463|gb|EDO47272.1| predicted protein [Nematostella vectensis]
Length = 497
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 71/166 (42%), Gaps = 15/166 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQ-TAHFRLI---LVNNKVYIHKYKQSIQTRD 135
CP + I DL+ + + R E ++ H L ++NNKVY + + +
Sbjct: 150 CPQNYSQIDRDLARFPEINLIRLAKEAVDRFGVHHALCHYSIINNKVYRKSHGEHVGF-S 208
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F+ I L RK LPD+E + D P+ + +D P P+ + G T D
Sbjct: 209 MFSDAIIHSLARKV--HLPDMEFFVNLGDWPLEKRKD-------NPIPILSWCGSEDTAD 259
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
IV P + A + SL + N G W D+ P A+W+G
Sbjct: 260 IVMPTYDL-TEAALETMGRVSLDMLSVQANTGPKWKDKIPKAFWRG 304
>gi|41055550|ref|NP_957225.1| KDEL motif-containing protein 1 precursor [Danio rerio]
gi|82188635|sp|Q7ZVE6.1|KDEL1_DANRE RecName: Full=KDEL motif-containing protein 1; Flags: Precursor
gi|28277832|gb|AAH45893.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Danio rerio]
Length = 500
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 75/168 (44%), Gaps = 17/168 (10%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP F I DLS ++ R+ E +++ H+ + NN+VYI + + +
Sbjct: 152 CPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK +LPD+E + D P+ + R ++ P P+F + G T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
DIV P + + + SL +G+ G W + +W+G
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRG 308
>gi|56207656|emb|CAI20990.1| novel protein (zgc:56065) [Danio rerio]
Length = 500
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 75/168 (44%), Gaps = 17/168 (10%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP F I DLS ++ R+ E +++ H+ + NN+VYI + + +
Sbjct: 152 CPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK +LPD+E + D P+ + R ++ P P+F + G T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
DIV P + + + SL +G+ G W + +W+G
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRG 308
>gi|417409544|gb|JAA51271.1| Putative protein o-glucosyltransferase 1, partial [Desmodus
rotundus]
Length = 306
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 80/187 (42%), Gaps = 20/187 (10%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVY 123
Q S+ +Y + C Y I EDL+P++ GI+R ++ + R + + ++
Sbjct: 54 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKVMVEVVR----RKLGTHYQII 108
Query: 124 IHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP- 182
H+ + G+ + + GRLPD+E++ + D P + P P
Sbjct: 109 KHRLYRENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWMEPAI 161
Query: 183 PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREP 235
P+F +S DI++P W+FW G A I P W+ EL W +
Sbjct: 162 PVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREELVRSAVQWPWKKKNS 221
Query: 236 YAYWKGN 242
AY++G+
Sbjct: 222 TAYFRGS 228
>gi|395844808|ref|XP_003795143.1| PREDICTED: protein O-glucosyltransferase 1 [Otolemur garnettii]
Length = 392
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 106/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ V N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIAEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--VKN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + G LPD+E++ + D P + P
Sbjct: 95 RLY-RENDCMFPSR----CSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 143 PTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKK 202
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 203 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTRNQAWKS---MKDTL--GKPAAKDV 257
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 258 HLVDHCKYKY 267
>gi|432849653|ref|XP_004066608.1| PREDICTED: protein O-glucosyltransferase 1-like [Oryzias latipes]
Length = 387
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 84/199 (42%), Gaps = 27/199 (13%)
Query: 68 SISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKY 127
++ Y + C + + +DL P++ GI+ ++ Q R + + ++ HK
Sbjct: 37 AVGSYVSCSSVNCSCHLSVLQQDLQPFR-GGISESLMASTVQ----RGMGTHYQIIQHKL 91
Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLFR 186
+ G+ + + RLPDLE++ + D P + PN P P+
Sbjct: 92 YREQNCMFPARCSGVEHFILEVIDRLPDLEMVVNVRDYPQV-------PNWMSPALPVLS 144
Query: 187 YSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYW 239
+S DI++P W+FW G A I P W+ + +LK+ W +EP ++
Sbjct: 145 FSKTAEYQDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRTDLKKSAAQWPWKKKEPKGFF 204
Query: 240 KG-------NPFVAETRRD 251
+G +P + +R D
Sbjct: 205 RGSRTSSERDPLILLSRED 223
>gi|357622883|gb|EHJ74243.1| hypothetical protein KGM_01635 [Danaus plexippus]
Length = 460
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 76/179 (42%), Gaps = 27/179 (15%)
Query: 80 CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C ++ I +DL P++ + D R T+ ++ NK+Y Y + +
Sbjct: 150 CSQTYKQIEKDLIPFQRVEMKEQIKKIVDKYHRPESTSFCHYVIKENKIYRDCYGKHVGF 209
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
++F +L L RK LPD+E++ + D P+I N +F + G T
Sbjct: 210 -NMFADNILLSLSRK--TVLPDMEMVINLGDWPLIHK-------NGEKLAMFSWCGSDDT 259
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL-----KEGNNGRNWIDREPYAYWKGNPFVAE 247
+DIV P + +I E+L R +GN R W DRE A W+G AE
Sbjct: 260 LDIVMPTY------DITESTLENLGRVTLDTLSVQGNVERKWSDRETRAIWRGRDSRAE 312
>gi|312380729|gb|EFR26645.1| hypothetical protein AND_07142 [Anopheles darlingi]
Length = 459
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 70/169 (41%), Gaps = 15/169 (8%)
Query: 80 CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP I DL P++ + ++ +H +++ N V+ Y Q
Sbjct: 100 CPPGDPQIDMDLIPFRAINFSSLRTRMIQQYDKPGSISHCNYVILRNDVHRRCYGQHTGF 159
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-NKGPPPLFRYSGDRW 192
F +L L RK+ LPD+EL + D P+++ GP+ GP P+F + G
Sbjct: 160 SK-FMDTILLSLARKF--SLPDMELFVNLGDWPLVKK---GGPSRTTGPYPIFSWCGSDD 213
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
T DIV P + N+ +L L G W D+ A+W+G
Sbjct: 214 TFDIVMPTYDITESTLENMG--RVMLDMLSIQKRGIPWPDKHRKAFWRG 260
>gi|118084661|ref|XP_416963.2| PREDICTED: KDEL motif-containing protein 1 [Gallus gallus]
Length = 500
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 73/170 (42%), Gaps = 21/170 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP F I DL+ + V I ++ +R Q + +N+VYI Y + + R
Sbjct: 152 CPQVFPQIQRDLANFPVVDPDKIAVEIPQRFGQRQSLCHYTIKDNEVYIKTYGEHVGFR- 210
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
+F +L L RK ++PD+E + D P+ K PP P+F + G
Sbjct: 211 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPL---------EKKKPPQKLHPIFSWCGSS 259
Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ DIV P + + + SL + N G +W D+ A+W+G
Sbjct: 260 ESKDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPSWEDKNTTAFWRG 308
>gi|328709644|ref|XP_001945687.2| PREDICTED: o-glucosyltransferase rumi homolog [Acyrthosiphon pisum]
Length = 384
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 90/199 (45%), Gaps = 24/199 (12%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S+ ++ +Y+ + +F I +DL P+K GIT +M+ A R ++ N+ +
Sbjct: 37 SKINKAKKEYTSCESKNGTCFFPNILKDLEPFK-DGITHEMITAAADKGT-RYMIFNHDL 94
Query: 123 YIHKYKQSIQTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
Y +T+ +F GI L K PD+E + + D P I + Y P
Sbjct: 95 YR-------ETKCMFPARCEGIEHFLSKIQLNTPDVEFILNTRDWPQI-IKHYGDPK--- 143
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P WSFW G I + P W+SL + + + + W +
Sbjct: 144 --PVFSFSKTDDYADIMYPAWSFWSGGPAIKLHPSGLGRWDSLRKSILKQSEQWPWKRKI 201
Query: 235 PYAYWKGNPFVAETRRDLL 253
+++G+ +E R L+
Sbjct: 202 SKGFFRGSR-TSEQRDSLI 219
>gi|9437337|gb|AAF87313.1|AF168711_1 x 010 protein [Homo sapiens]
Length = 273
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|187608427|ref|NP_001120575.1| KDEL (Lys-Asp-Glu-Leu) containing 2 precursor [Xenopus (Silurana)
tropicalis]
gi|171846847|gb|AAI61552.1| LOC100145729 protein [Xenopus (Silurana) tropicalis]
gi|189442192|gb|AAI67365.1| hypothetical protein LOC100145729 [Xenopus (Silurana) tropicalis]
gi|195540008|gb|AAI68093.1| hypothetical protein LOC100145729 [Xenopus (Silurana) tropicalis]
Length = 509
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 86/191 (45%), Gaps = 18/191 (9%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLER-----ANQTAHFRLILVNNKVYIHKYKQSI 131
T +CPD I +D P+ +TR ML+ A++ A ++NN +Y H +
Sbjct: 156 TLSCPDTEAQISKDFDPFPSIDLTR-MLDEVPKRFADRGAIVHYTVLNNNIYRHSMGRYT 214
Query: 132 QTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR 191
+ +F+ + L RK RLPD E + D PV + P GP P+ + G
Sbjct: 215 DFK-MFSDEMLQSLARKV--RLPDFEFYINVGDWPVEHRKANDTP---GPLPMISWCGSA 268
Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
+ DI+ P + ++ + L + +G+ G +W ++ +++G ++R +
Sbjct: 269 DSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTGPSWSNKTEQGFFRGR----DSREE 323
Query: 252 -LLTCNLSDKH 261
L ++S KH
Sbjct: 324 RLQLVHMSRKH 334
>gi|258575361|ref|XP_002541862.1| hypothetical protein UREG_01378 [Uncinocarpus reesii 1704]
gi|237902128|gb|EEP76529.1| hypothetical protein UREG_01378 [Uncinocarpus reesii 1704]
Length = 2000
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/217 (23%), Positives = 86/217 (39%), Gaps = 35/217 (16%)
Query: 65 TQESISDYSIPPTS---TCPDYFRWIHEDLSPWKVTGIT-RDMLERANQTAHFRLILVNN 120
T E DY + P F I + +S + I ++ R + R I+ N
Sbjct: 1707 TSEHGDDYGLGRVQCRIAFPKLFAEIEKSVSARRGKKIEYEELASRTLEDGMVRAIIYNG 1766
Query: 121 KVYIHKYK--QSIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGP 176
++ I ++ + TR T+ + + L P R LP++E +F DD
Sbjct: 1767 ELRIVNFEGHEFTFTRAKATLSSLNRALTAIPDRRSLPNIEFIFSADD------------ 1814
Query: 177 NNKGPPPLFRYSG---DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRN---- 229
GP P++ YS D W + PD+ +W W E NI P+ + R + ++G +
Sbjct: 1815 FTHGPGPIWTYSKRDEDSWAW--LMPDFGYWSWPEANIGPYRQIRRRIAAIDDGDSVGGR 1872
Query: 230 ------WIDREPYAYWKGNPFVAETRRDLLTCNLSDK 260
+ ++ +W+GN A R+ DK
Sbjct: 1873 VRPGLKFQNKHQKLFWRGNIATAPVLRNKFLQVTHDK 1909
>gi|332225464|ref|XP_003261899.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Nomascus
leucogenys]
Length = 392
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 102/251 (40%), Gaps = 42/251 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD-MLERANQTAHFRLILVNNKV 122
Q S+ +Y + C Y I EDL+P++ GI+R M E + + N++
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKTMAEVVRRKLGTHYQITKNRL 96
Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 97 Y--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWM 141
Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 142 EPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWK 201
Query: 232 DREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 202 KKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKD 256
Query: 283 SNLASQCAHRY 293
+L C ++Y
Sbjct: 257 VHLVDHCKYKY 267
>gi|432964664|ref|XP_004086966.1| PREDICTED: KDEL motif-containing protein 1-like [Oryzias latipes]
Length = 504
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 83/189 (43%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP F I +DLS + R+ E + + H+ + +NKVYI + + +
Sbjct: 156 CPQAFAQIDQDLSVFTTVDPDRNAREIPPRFRQRHSLCHY--TIKDNKVYIKTFGEHVGF 213
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK LPD+E + D P+ + + P + P+F + G +
Sbjct: 214 R-IFMDAVLLSLTRKV--HLPDVEFFVNLGDWPLEKRK----PTEE-IHPIFSWCGSNSS 265
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL +GN G W ++ A+W+G ++RR+ L
Sbjct: 266 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGNTGPAWPEKNATAFWRGR----DSRRERL 320
Query: 254 T-CNLSDKH 261
LS H
Sbjct: 321 ELVQLSRAH 329
>gi|115496800|ref|NP_001069153.1| KDEL motif-containing protein 1 precursor [Bos taurus]
gi|111304522|gb|AAI19854.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Bos taurus]
gi|296481621|tpg|DAA23736.1| TPA: KDEL (Lys-Asp-Glu-Leu) containing 1 [Bos taurus]
Length = 502
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 81/189 (42%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDL------SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP+ F IH DL P K+ + H+ L +NKVYI + + +
Sbjct: 153 CPETFAQIHRDLVHFPTVDPEKIATEIPKRFGQRQSLCHYS--LKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK ++PD+E + D P+ + + S P+ P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK--SSPHIH---PIFSWCGSTDS 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++RR+ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRRERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|348566951|ref|XP_003469265.1| PREDICTED: protein O-glucosyltransferase 1-like [Cavia porcellus]
Length = 392
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 56/251 (22%), Positives = 104/251 (41%), Gaps = 42/251 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDML-ERANQTAHFRLILVNNKV 122
Q ++ +Y + C Y I EDL+P++ GI+R M+ E ++ ++ N++
Sbjct: 38 QINRALENYEPCSSQNCSCYRGIIEEDLTPFR-GGISRKMMAEVVSRKLGTHYQVIRNRL 96
Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
Y + D G+ + + G LPD+E++ + D P I P
Sbjct: 97 Y--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQI-------PKWM 141
Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 142 EPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWE 201
Query: 232 DREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQ 282
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 202 KKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKD 256
Query: 283 SNLASQCAHRY 293
+L C ++Y
Sbjct: 257 VHLVDHCKYKY 267
>gi|296811324|ref|XP_002846000.1| DUF821 domain-containing protein [Arthroderma otae CBS 113480]
gi|238843388|gb|EEQ33050.1| DUF821 domain-containing protein [Arthroderma otae CBS 113480]
Length = 420
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/147 (21%), Positives = 64/147 (43%), Gaps = 16/147 (10%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR--P 166
+ + R ++ N ++YI +I +R+ T+ + + + P LP++E +F+ DD+
Sbjct: 110 KNGYIRAMIYNQQLYIIHRNGTIYSREYATLQALNRAIVSSPDPLPNIEFVFNTDDKVDS 169
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
V + P +K D W M PD+ +W W E + + + + ++
Sbjct: 170 VAQWGYARRPQDK----------DMWLM----PDFGYWSWPETKVGTMQEVQTKAEQEEQ 215
Query: 227 GRNWIDREPYAYWKGNPFVAETRRDLL 253
W + P +W+G E R L+
Sbjct: 216 TWTWPKKIPKLFWRGATMGLEVRDKLI 242
>gi|391340384|ref|XP_003744522.1| PREDICTED: O-glucosyltransferase rumi homolog [Metaseiulus
occidentalis]
Length = 418
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 62/133 (46%), Gaps = 16/133 (12%)
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
++R I DL P+ GI+R+ L RA + R ++ +V+ K + F GI
Sbjct: 67 HYRQIRRDLKPFS-EGISREDLNRAAKFG-VRYQIIQGRVFRQK-----ECMFPFRCSGI 119
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDW 201
L + +LPDLE + + D P P + P+F +S + DI++P W
Sbjct: 120 EHFLLRLAPKLPDLEFIVNVRDYP-------QSPVGREKLPVFSFSKVPKNYYDILYPAW 172
Query: 202 SFW-GWAEINIKP 213
+FW G I+I P
Sbjct: 173 TFWEGGPAISIYP 185
>gi|332025008|gb|EGI65195.1| KDEL motif-containing protein 1 [Acromyrmex echinatior]
Length = 495
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 85/189 (44%), Gaps = 23/189 (12%)
Query: 80 CPDYFRWIHEDLSPWK---VTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
C + IH DLSP+ I + +++R ++ + +L +N++Y Y Q +
Sbjct: 149 CQQNYTQIHHDLSPFSNIDFDKIRQSIIKRYDRPTSVSICHYVLKSNRIYRQCYGQYVGF 208
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPD+E + D P++ P++ P+F + G T
Sbjct: 209 K-IFMDAILLSLTRKV--MLPDIEFFVNLGDWPLV-------PDSGPLYPIFSWCGSEST 258
Query: 194 MDIVFPDWSFWGWAEINIKPW-ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
DIV P + E +++ +L L NG W + +W+G E R DL
Sbjct: 259 KDIVMPTYDI---TESSLEAMGRVMLDTLSVQGNGLPWEHKTEQLFWRGRDARRE-RLDL 314
Query: 253 LTCNLSDKH 261
+ ++S KH
Sbjct: 315 I--DISRKH 321
>gi|242007535|ref|XP_002424595.1| KDEL motif-containing protein 1 precursor, putative [Pediculus
humanus corporis]
gi|212508038|gb|EEB11857.1| KDEL motif-containing protein 1 precursor, putative [Pediculus
humanus corporis]
Length = 501
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/170 (23%), Positives = 76/170 (44%), Gaps = 21/170 (12%)
Query: 80 CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
C + I DL + K + + +LE+ + L ++ +N++Y Y + +
Sbjct: 154 CKKTYNQIVSDLESFPKVKFSEVLLKVLEKYSSQRSISLCHYVIKDNEIYRKCYGEYVDF 213
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPD E + + D P+ N+ P P+F + G +T
Sbjct: 214 K-IFVDALLLSLTRKI--ELPDFEFIVNLGDWPL-------EDNSPSPLPIFSWCGSNFT 263
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
DI+ P + E ++ + R++ +GN G +W D+ +W+G
Sbjct: 264 SDIIMPTYDL---TEATLECMGRVSRDMLSVQGNTGASWNDKINKGFWRG 310
>gi|224043076|ref|XP_002195971.1| PREDICTED: KDEL motif-containing protein 1 [Taeniopygia guttata]
Length = 588
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 13/166 (7%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP I DL+ + + I +++ +R Q + NN+VYI Y + + R
Sbjct: 240 CPQTIPQIQRDLANFPIVDPDKIAKEIPQRFGQRQSLCHYTIKNNEVYIKTYGEHVGFR- 298
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F +L L RK ++PD+E + D P+ + + P N P +F + G + D
Sbjct: 299 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKRKP---PQNLHP--IFSWCGSSESKD 351
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
IV P + + + SL + N G W D+ A+W+G
Sbjct: 352 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWDDKNTTAFWRG 396
>gi|345324378|ref|XP_003430815.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
1-like [Ornithorhynchus anatinus]
Length = 588
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 82/187 (43%), Gaps = 18/187 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP F I DL+ + I ++ +R Q + NNKVYI + + + R
Sbjct: 239 CPASFPQIQRDLAHFPTVDPDKIAVEIPQRFGQRQSLCHYTVKNNKVYIKTHGEHVGFR- 297
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F +L L RK ++PD+E + D P+ + + P+N P +F + G + D
Sbjct: 298 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK---SPDNLHP--IFSWCGSTDSKD 350
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT- 254
IV P + + + SL + N G W D+ A W+G ++R++ L
Sbjct: 351 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPAWEDKNTTAVWRGR----DSRKERLEL 405
Query: 255 CNLSDKH 261
LS KH
Sbjct: 406 VKLSRKH 412
>gi|443897123|dbj|GAC74465.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
Length = 495
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 14/154 (9%)
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+H R+++ NN+VY+ ++QS TR + + Q + +LPD E +D
Sbjct: 174 SHARVLIHNNRVYLKYFQQSPFTRVNSALALLFQSVTGAREKLPDAEFCLSANDW----- 228
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
G K Y D W M PD+ F+ W E I + + E N W
Sbjct: 229 ----GSMGKFSLDRAPYLVDLWLM----PDYGFYSWPEPGIGSYTEHREKTLELENATPW 280
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ +W+G V R L ++ H WN
Sbjct: 281 SAKISKLFWRGAMSVGTADRKALLA-AAENHAWN 313
>gi|403171742|ref|XP_003330935.2| hypothetical protein PGTG_12472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169317|gb|EFP86516.2| hypothetical protein PGTG_12472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 499
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 67/166 (40%), Gaps = 25/166 (15%)
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
I+ L+RA Q +H R+++ N+VYI + R + I + + P LPD+E
Sbjct: 184 ISIQHLDRAYQYSHARVLIYRNRVYIKGFNGGPGLRTEAILNSIQEAVITSPELLPDVEF 243
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI--------VFPDWSFWGWAEIN 210
+ D P G +++ P W +D + PD+ F+ W E
Sbjct: 244 VIKTSDAP-------QGGDDEHP---------LWVLDRTKSQEEVWLMPDYGFYSWPEPK 287
Query: 211 IKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
+ + + E +W + A+W+G V + R LL +
Sbjct: 288 VGGMVEVRDKTAEREASLSWDSKISKAFWRGAILV-KLREQLLEVS 332
>gi|26334375|dbj|BAC30905.1| unnamed protein product [Mus musculus]
Length = 392
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 56/250 (22%), Positives = 107/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + +C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSQSCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
+++ + +R G+ + + RLPD+E++ + D P + P
Sbjct: 95 RLF-REDDCMFPSR----CSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A + P W+ +L W
Sbjct: 143 PTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEK 202
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 203 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 257
Query: 284 NLASQCAHRY 293
+L C +RY
Sbjct: 258 HLIDHCKYRY 267
>gi|426236645|ref|XP_004012278.1| PREDICTED: KDEL motif-containing protein 1 [Ovis aries]
Length = 502
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDL------SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP+ F IH DL P K+ + H+ L +NKVYI + + +
Sbjct: 153 CPETFAQIHRDLVHFPTVDPEKIATEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK ++PD+E + D P+ + + S P+ P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK--SSPHIH---PIFSWCGSTDS 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 263 KDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|358054730|dbj|GAA99656.1| hypothetical protein E5Q_06359 [Mixia osmundae IAM 14324]
Length = 437
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 48/222 (21%), Positives = 93/222 (41%), Gaps = 19/222 (8%)
Query: 78 STC----PDYFRWIHEDLSPWKV-TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
STC PD +R W+ GITR L+ A+ AH R+++ +N+VY+ Y+ I
Sbjct: 95 STCHGEFPDLYREAERAAQYWRHRGGITRAALDAADAQAHARVLIKDNQVYLTNYRGGIN 154
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
+R + + + + + LP +E + DD I P +F + +
Sbjct: 155 SRTLAALASLNEAVLTAVEELPAVEFVIQTDDSAPI--------AGAAPRWVFARTDEED 206
Query: 193 TMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
+ + + PD+ + W E + + + + W + P +W+G V R +
Sbjct: 207 ELALWLMPDFGHYAWPEPGVGSMAEVQAKASAFEAAQTWSSKIPKLFWRG-ALVNPLRDE 265
Query: 252 LLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
L+ + ++ W + DW + + + A CA++Y
Sbjct: 266 LIRLSDENRGSWGDAKAL-DW---GRLEGELRSPAQHCAYKY 303
>gi|49119606|gb|AAH73128.1| LOC443629 protein, partial [Xenopus laevis]
Length = 507
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 94/215 (43%), Gaps = 33/215 (15%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLER----ANQT 110
CP P+T Q T +CPD I +D P+ +TR + E A++
Sbjct: 143 CPEEDPQTWQE-----------TLSCPDTEAQISKDFEPFPSIDLTRLLNEVPKRFADRG 191
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWG--ILQ-LLRKYPGRLPDLELMFDCDDRPV 167
A ++NN++ Y++S+ F ++ ILQ L RK RLPD E + D PV
Sbjct: 192 AIVHYTVLNNQI----YRRSMGRYTDFKMFSDEILQSLARKV--RLPDFEFYINVGDWPV 245
Query: 168 IRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
+ P GP P+ + G + DI+ P + ++ + L + +G+ G
Sbjct: 246 ENRKANDTP---GPLPMISWCGSSDSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTG 301
Query: 228 RNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKH 261
+W ++ +++G ++R + L +S KH
Sbjct: 302 PSWSNKTEQGFFRGR----DSREERLQLVQMSRKH 332
>gi|147905630|ref|NP_001085283.1| KDEL (Lys-Asp-Glu-Leu) containing 2 precursor [Xenopus laevis]
gi|114107936|gb|AAI23308.1| LOC443629 protein [Xenopus laevis]
Length = 509
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 94/215 (43%), Gaps = 33/215 (15%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLER----ANQT 110
CP P+T Q T +CPD I +D P+ +TR + E A++
Sbjct: 145 CPEEDPQTWQE-----------TLSCPDTEAQISKDFEPFPSIDLTRLLNEVPKRFADRG 193
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWG--ILQ-LLRKYPGRLPDLELMFDCDDRPV 167
A ++NN++ Y++S+ F ++ ILQ L RK RLPD E + D PV
Sbjct: 194 AIVHYTVLNNQI----YRRSMGRYTDFKMFSDEILQSLARKV--RLPDFEFYINVGDWPV 247
Query: 168 IRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
+ P GP P+ + G + DI+ P + ++ + L + +G+ G
Sbjct: 248 ENRKANDTP---GPLPMISWCGSSDSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTG 303
Query: 228 RNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKH 261
+W ++ +++G ++R + L +S KH
Sbjct: 304 PSWSNKTEQGFFRGR----DSREERLQLVQMSRKH 334
>gi|47211427|emb|CAF96437.1| unnamed protein product [Tetraodon nigroviridis]
Length = 456
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 75/166 (45%), Gaps = 13/166 (7%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLI----LVNNKVYIHKYKQSIQTRD 135
CPD F I DLS + R+ E + + + + +NKVY+ + + + R
Sbjct: 126 CPDSFPQIDTDLSVFSSVDPDRNAQEVPQRFGQRQSLCHYTVKDNKVYVKTFGEHVGFR- 184
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F +L L RK RLPD+E + D P+ + + P ++ P+F + G T+D
Sbjct: 185 IFMDSILLSLTRKV--RLPDVEFFVNLGDWPLEKRK----PTDQ-IHPIFSWCGSNNTLD 237
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
IV P + + + SL + + G W ++ A+W+G
Sbjct: 238 IVMPTYDL-TESVLETMGRVSLDMMSVQASTGPPWPEKNATAFWRG 282
>gi|313241455|emb|CBY33708.1| unnamed protein product [Oikopleura dioica]
Length = 478
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 96/211 (45%), Gaps = 41/211 (19%)
Query: 49 KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERAN 108
K Q+++C N PK+ T +S C + F+ + + + I + + R++
Sbjct: 119 KLQSESC--NCPKSRATFKS--------AYQCTE-FKHLTSSFNKLQNKKINKKSIHRSS 167
Query: 109 QT----AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDD 164
+T + I+ +NK+Y + + + + F+ L LLRK LPD+E +F+ D
Sbjct: 168 ETLKGDSVIHYIIKDNKLYSKELSERLDFKR-FSDGIFLSLLRKV--NLPDIEFLFNVGD 224
Query: 165 RPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW-----SFWGWAEINIKPWESLLR 219
PV SR++ P+F + G + DIV P W + ++IN+ + L
Sbjct: 225 WPV--SREF---------PVFSWCGSEESSDIVVPTWDQIKTTLLSMSKINV---DILTM 270
Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRR 250
+L NG++W + P +++G E R
Sbjct: 271 QL----NGKSWQSKIPKGFFRGRDSSKERMR 297
>gi|157128435|ref|XP_001655120.1| endoplasmic reticulum-resident kdel protein [Aedes aegypti]
gi|122067566|sp|Q16QY8.1|RUMI_AEDAE RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|108872609|gb|EAT36834.1| AAEL011121-PA [Aedes aegypti]
Length = 402
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 61/288 (21%), Positives = 128/288 (44%), Gaps = 39/288 (13%)
Query: 21 SRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTC 80
S + + + ++++P + P++ T ++ K T+ + QE+++ Y + C
Sbjct: 20 SDDDSGMCMAKEPCSEAPQQ-ETKVDLYK------ATDNKYVALIQEALASYEPCQQANC 72
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
+ + DL P+K GI+ M+ERA + +V++++Y K
Sbjct: 73 SCHADVLKTDLRPFK-GGISEQMVERARSYG-TKYQIVDHRLYRQK-----DCMFPARCS 125
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPD 200
G+ ++ LPD+EL+ +C D P I +R + + P+ +S +DI++P
Sbjct: 126 GVEHFIKPNLPHLPDMELIINCRDWPQI-NRHW----KQEKLPVLSFSKTDDYLDIMYPT 180
Query: 201 WSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG-------NPFVAE 247
W FW G I++ P W+ +K+ + W ++ A+++G +P V
Sbjct: 181 WGFWEGGPAISLYPTGLGRWDQHRVSIKKAADSWKWEKKKAKAFFRGSRTSDERDPLVLL 240
Query: 248 TRR--DLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+RR +L+ + W + +D + + + Q+ L C ++Y
Sbjct: 241 SRRKPELVDAQYTKNQAWKS---PKDTL--NAKPAQEVRLEDHCQYKY 283
>gi|410906075|ref|XP_003966517.1| PREDICTED: protein O-glucosyltransferase 1-like [Takifugu rubripes]
Length = 367
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 47/199 (23%), Positives = 89/199 (44%), Gaps = 27/199 (13%)
Query: 68 SISDYSIPPTST-CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK 126
++S Y IP TS C + + +DL P+K I+ ++++ Q R + + ++ HK
Sbjct: 17 AVSGY-IPCTSANCSCHLSVLQDDLRPFKFK-ISEELMDATTQ----RGVGTHYQIIGHK 70
Query: 127 YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFR 186
+ G+ + + RLPD+E++ + D P + S P+F
Sbjct: 71 LYREQNCMFPARCSGVEHFILQLIDRLPDMEMVVNVRDYPQVPKWMESSL------PVFS 124
Query: 187 YSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYW 239
+S +DI++P W+FW G A I P W+ + +LK+ + W +E ++
Sbjct: 125 FSKTADYLDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRTDLKKSADQWPWKKKETKGFF 184
Query: 240 KG-------NPFVAETRRD 251
+G +P + +R D
Sbjct: 185 RGSRTSPERDPLILLSRED 203
>gi|302660285|ref|XP_003021823.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
gi|291185740|gb|EFE41205.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
Length = 420
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDKV-- 167
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
P+ ++ R D + PD+ +W W E + + + +
Sbjct: 168 -------------DPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214
Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W + P +W+G E R L+
Sbjct: 215 HTWAWSKKIPQLFWRGATMDLEVREKLI 242
>gi|148665570|gb|EDK97986.1| RIKEN cDNA 9630046K23, isoform CRA_b [Mus musculus]
Length = 372
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 56/250 (22%), Positives = 106/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 18 QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 74
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
+++ + +R G+ + + RLPD+E++ + D P + P
Sbjct: 75 RLF-REDDCMFPSR----CSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PKWME 122
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A + P W+ +L W
Sbjct: 123 PTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEK 182
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 183 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 237
Query: 284 NLASQCAHRY 293
+L C +RY
Sbjct: 238 HLIDHCKYRY 247
>gi|351697696|gb|EHB00615.1| KTEL motif-containing protein 1 [Heterocephalus glaber]
Length = 392
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 55/254 (21%), Positives = 103/254 (40%), Gaps = 48/254 (18%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ H+++I
Sbjct: 38 QINRALENYEPCSSQNCSCYQGVIEEDLTPFR-GGISRKMMAEVVSRKLGTHYQII---- 92
Query: 121 KVYIHKYKQSIQTRDVFTI----WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGP 176
K+ + D G+ + + G+LPD+E++ + D P + P
Sbjct: 93 -------KKRLYREDDCMFPSRCSGVEHFILEVIGQLPDMEMVINVRDYPQV-------P 138
Query: 177 NNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGR 228
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 139 RWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQW 198
Query: 229 NWIDREPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRG 279
W + AY++G +P + +R++ L+ + W + ++D + K
Sbjct: 199 PWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPA 253
Query: 280 FQQSNLASQCAHRY 293
+ +L C ++Y
Sbjct: 254 AKDVHLVDHCKYKY 267
>gi|27369505|ref|NP_759012.1| protein O-glucosyltransferase 1 precursor [Mus musculus]
gi|110832796|sp|Q8BYB9.2|PGLT1_MOUSE RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; AltName: Full=KTEL
motif-containing protein 1; Flags: Precursor
gi|20071120|gb|AAH26809.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Mus musculus]
gi|26327463|dbj|BAC27475.1| unnamed protein product [Mus musculus]
gi|26331050|dbj|BAC29255.1| unnamed protein product [Mus musculus]
gi|26331242|dbj|BAC29351.1| unnamed protein product [Mus musculus]
gi|148665569|gb|EDK97985.1| RIKEN cDNA 9630046K23, isoform CRA_a [Mus musculus]
Length = 392
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 56/250 (22%), Positives = 106/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
+++ + +R G+ + + RLPD+E++ + D P + P
Sbjct: 95 RLF-REDDCMFPSR----CSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A + P W+ +L W
Sbjct: 143 PTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEK 202
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 203 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 257
Query: 284 NLASQCAHRY 293
+L C +RY
Sbjct: 258 HLIDHCKYRY 267
>gi|410897261|ref|XP_003962117.1| PREDICTED: KDEL motif-containing protein 1-like [Takifugu rubripes]
Length = 505
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 73/166 (43%), Gaps = 13/166 (7%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CPD F I DLS + I +++ +R Q + +NKVY+ + + + R
Sbjct: 157 CPDSFPQIDRDLSAFPNVDPDRIAQEIPQRFGQRQSLCHYTVKDNKVYVTTFGEHVGFR- 215
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FT +L L RK LPD+E + D P+ + + + P+F + G T D
Sbjct: 216 IFTDSILLSLTRKV--WLPDVEFFVNLGDWPLEKRKLTDKIH-----PIFSWCGSNNTQD 268
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
IV P + + + SL + N G W ++ A+W+G
Sbjct: 269 IVMPTYDL-TESVLETMGRVSLDMTSVQANTGPPWAEKNATAFWRG 313
>gi|354494153|ref|XP_003509203.1| PREDICTED: protein O-glucosyltransferase 1 [Cricetulus griseus]
Length = 401
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 105/250 (42%), Gaps = 40/250 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I
Sbjct: 47 QINRALENYEPCSSRNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII---K 102
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
K + +R IL+++R RLPD+E++ + D P + P
Sbjct: 103 KRLFREDDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 151
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A + P W+ +L W
Sbjct: 152 PTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEK 211
Query: 233 REPYAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQS 283
+ AY++G +P + +R++ L+ + W + ++D + K +
Sbjct: 212 KNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDV 266
Query: 284 NLASQCAHRY 293
+L C ++Y
Sbjct: 267 HLIDHCKYKY 276
>gi|326483787|gb|EGE07797.1| DUF821 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 420
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK--- 166
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
P+ ++ R D + PD+ +W W E + + + +
Sbjct: 167 ------------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214
Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W + P +W+G E R L+
Sbjct: 215 HTWAWPKKIPRLFWRGATMDLEVREKLI 242
>gi|326475589|gb|EGD99598.1| hypothetical protein TESG_06947 [Trichophyton tonsurans CBS 112818]
Length = 419
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 109 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK--- 165
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
P+ ++ R D + PD+ +W W E + + + +
Sbjct: 166 ------------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 213
Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W + P +W+G E R L+
Sbjct: 214 HTWAWPKKIPRLFWRGATMDLEVREKLI 241
>gi|452843550|gb|EME45485.1| hypothetical protein DOTSEDRAFT_71260 [Dothistroma septosporum
NZE10]
Length = 456
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 31/137 (22%), Positives = 62/137 (45%), Gaps = 13/137 (9%)
Query: 114 RLILVNNKVYIHKYKQSI----QTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIR 169
R ++ +N++YI + + R V T+ + + + Y G+LP++E F D +
Sbjct: 119 RALVHDNQLYIIDAHGVVDHNHRPRTVATLHSLHRAISAYQGKLPNIEFTFTVHDAAL-- 176
Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRN 229
RD +G ++ W M PD+ WGW ++ ++ + L L+ +
Sbjct: 177 -RDPAGNETTWAYTRRKHQEKLWLM----PDFGLWGWPDVGLRSFAELQEVLEHEED--E 229
Query: 230 WIDREPYAYWKGNPFVA 246
++D+ P W+G+ V
Sbjct: 230 FVDKVPKLVWRGSVAVG 246
>gi|425781966|gb|EKV19900.1| hypothetical protein PDIG_00670 [Penicillium digitatum PHI26]
gi|425784005|gb|EKV21816.1| hypothetical protein PDIP_02930 [Penicillium digitatum Pd1]
Length = 436
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 60/137 (43%), Gaps = 14/137 (10%)
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKYPGR--LPDL 156
+++ R A R I+ ++YI Y T R T+ + + L +P R LP +
Sbjct: 103 KELDSRTVDDAMVRGIIDRGELYIVDYAPMPMTASRARATLSSLHRALTAFPDRHLLPSI 162
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEINIKPWE 215
E +F +D + P P++ YS D T + PD+ +W W E++I P+
Sbjct: 163 EFIFTTED---------FAEDTTAPSPIWAYSKRDSDTSVWLMPDFGYWAWPEVHIGPYH 213
Query: 216 SLLRELKEGNNGRNWID 232
+ R + ++G D
Sbjct: 214 EVRRRIAAIDDGETAAD 230
>gi|170033685|ref|XP_001844707.1| KDEL motif-containing protein 2 [Culex quinquefasciatus]
gi|167874675|gb|EDS38058.1| KDEL motif-containing protein 2 [Culex quinquefasciatus]
Length = 495
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 81/189 (42%), Gaps = 18/189 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP I+ DL P++ + ++ + ++ +N++Y Y +
Sbjct: 142 CPSEDVQINSDLIPFRAVNFSSLRPRIIQQYDKPGSVSLCNYVVKDNQIYRTCYGRYTGF 201
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ ++ +L L RK LPD+EL + D P++ + GP P+F + G T
Sbjct: 202 K-MYMDAILLSLARK--TLLPDMELFVNLGDWPLVTKGGHR--RTTGPYPIFSWCGSEDT 256
Query: 194 MDIVFPDWSFWGWAEINIKPWESL-LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
DIV P + E +++ + L L G W ++ P A+W+G E R DL
Sbjct: 257 FDIVMPTYDL---VEASLEAMSRVSLDMLSVQRKGVPWEEKVPKAFWRGRDACRE-RLDL 312
Query: 253 LTCNLSDKH 261
+ LS +H
Sbjct: 313 V--GLSQQH 319
>gi|71020735|ref|XP_760598.1| hypothetical protein UM04451.1 [Ustilago maydis 521]
gi|46100486|gb|EAK85719.1| hypothetical protein UM04451.1 [Ustilago maydis 521]
Length = 488
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 65/154 (42%), Gaps = 14/154 (9%)
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+H R+++ NN+VY+ ++QS TR + + Q + +LPD E +D +
Sbjct: 167 SHARVVIHNNRVYLKYFQQSAFTRINSALALLFQSVVASREKLPDAEFCLSANDWGSMGK 226
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
+S N P L+ D W M PD+ F+ W E I + + W
Sbjct: 227 --FSLDRN---PSLY----DVWLM----PDYGFYSWPEPGIGSYTEHREKTLAIEQSTPW 273
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ +W+G+ V R L ++ H+WN
Sbjct: 274 DSKIAKLFWRGSMGVGTADRKALLA-AAENHEWN 306
>gi|281332155|ref|NP_001094122.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 precursor [Rattus norvegicus]
gi|149060499|gb|EDM11213.1| similar to RIKEN cDNA 9630046K23 [Rattus norvegicus]
Length = 392
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 52/247 (21%), Positives = 102/247 (41%), Gaps = 34/247 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVY 123
Q ++ +Y + C Y I EDL+P++ GI+R M+ + R + + ++
Sbjct: 38 QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVR----RRLGTHYQII 92
Query: 124 IHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP- 182
H+ + G+ + + RLPD+E++ + D P + P P
Sbjct: 93 KHRLFREDDCMFPSRCSGVEHFILEVIRRLPDMEMVINVRDYPQV-------PKWMEPTI 145
Query: 183 PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREP 235
P+F +S DI++P W+FW G A + P W+ +L W +
Sbjct: 146 PVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNS 205
Query: 236 YAYWKG-------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLA 286
AY++G +P + +R++ L+ + W + ++D + K + +L
Sbjct: 206 TAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDVHLI 260
Query: 287 SQCAHRY 293
C ++Y
Sbjct: 261 DHCKYKY 267
>gi|327297120|ref|XP_003233254.1| hypothetical protein TERG_06247 [Trichophyton rubrum CBS 118892]
gi|326464560|gb|EGD90013.1| hypothetical protein TERG_06247 [Trichophyton rubrum CBS 118892]
Length = 420
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK--- 166
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
P+ ++ R D + PD+ +W W E + + + +
Sbjct: 167 ------------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214
Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W + P +W+G E R L+
Sbjct: 215 HTWAWPKKIPQLFWRGATMDLEVRERLI 242
>gi|315044371|ref|XP_003171561.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311343904|gb|EFR03107.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 422
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 28/145 (19%), Positives = 64/145 (44%), Gaps = 12/145 (8%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R+ T+ + + + P LP++E +F+ DD+ +
Sbjct: 112 KNGYIRAMIYDQQLYVIAVNGTIYSREYATLHALHRAIVSSPDPLPNIEFVFNTDDK--V 169
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
S G + + W M PD+ +W W E + + + + ++
Sbjct: 170 DSVAQWGYARR------EEDTNLWLM----PDFGYWSWPETKVGTMQEVRMKAEQEEQTW 219
Query: 229 NWIDREPYAYWKGNPFVAETRRDLL 253
W ++ P +W+G E R L+
Sbjct: 220 TWSEKIPKLFWRGATMDLEVREKLI 244
>gi|358058341|dbj|GAA95860.1| hypothetical protein E5Q_02517 [Mixia osmundae IAM 14324]
Length = 492
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 32/159 (20%), Positives = 72/159 (45%), Gaps = 11/159 (6%)
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
GI L+ + R+I+ +N+VY+ K + ++R + I + + +PD+
Sbjct: 178 GIQERYLDTTEHGMNARVIIKSNRVYLRKPFNAGAKSRTQALLAAIEEAVLSSIEPIPDV 237
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWE 215
E + + +DR D + + P+ S + + + PD+ F+ W E ++ +
Sbjct: 238 EFVINSEDRV-----DAATAHT----PILGMSRKKQQGHVWLIPDFRFYAWPEPHVGTYP 288
Query: 216 SLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
+ ++ + W + +W+GNP + R++L+T
Sbjct: 289 DVQDQIYALEATQQWHHKRAKLFWRGNPPLHPLRQELMT 327
>gi|115401698|ref|XP_001216437.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190378|gb|EAU32078.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 437
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 61/139 (43%), Gaps = 17/139 (12%)
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKYPGR--LPDL 156
+D+ A R I+ N ++YI Y T R T+ + + L +P R LP +
Sbjct: 106 QDLDGLAVDDGMVRGIIENGELYIVDYGAMPFTFSRAKATLHSLQRALAAFPDRHLLPSV 165
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWE 215
E +F DD YS G P++ YS + + PD+ +W W E+ I P++
Sbjct: 166 EFVFTTDD--------YS----TGVGPIWSYSKREEDESVWLMPDFGYWSWPEVKIGPYK 213
Query: 216 SLLRELKEGNNGRNWIDRE 234
+ R + +NG + E
Sbjct: 214 DIRRRIAAVDNGETTVTGE 232
>gi|426245590|ref|XP_004016593.1| PREDICTED: KDEL motif-containing protein 2 [Ovis aries]
Length = 557
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 77/169 (45%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CPD I +D + + + + ++E + R +V+ + H Y++S+
Sbjct: 203 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 262
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 263 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 317
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D++ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 318 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 365
>gi|358058116|dbj|GAA96095.1| hypothetical protein E5Q_02756 [Mixia osmundae IAM 14324]
Length = 486
Score = 45.4 bits (106), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 34/160 (21%), Positives = 70/160 (43%), Gaps = 19/160 (11%)
Query: 96 VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
+ G+T +E + + H RL++ + ++++ +++ + Q+R + I + +PD
Sbjct: 170 LGGVTLSDVEHSGEIWHARLMIKDGRLFVKRFRPAYQSRCQAVMAAIHDAIITSLEPVPD 229
Query: 156 LELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIK 212
+E + DD + P L R +D+ + D+ F W E IK
Sbjct: 230 IEFVIGLDD---------TAPFEPATWGLAR------RLDLPAWLVIDYGFNAWPEPMIK 274
Query: 213 PWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
W + L ++K N W + +W+G +++E R L
Sbjct: 275 GWHTFLADVKSVNAQMTWRQKAKKLFWRG-VYLSEYREQL 313
>gi|148664476|gb|EDK96892.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_b [Mus musculus]
Length = 432
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|327267957|ref|XP_003218765.1| PREDICTED: KDEL motif-containing protein 1-like [Anolis
carolinensis]
Length = 502
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 73/167 (43%), Gaps = 13/167 (7%)
Query: 79 TCPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTR 134
+CP IH DL+ + I +++ +R Q + +NKVYI + + + R
Sbjct: 153 SCPQSIPQIHRDLAHFPAVDPEKIAKEIPQRFGQRQSLCHYTIKDNKVYIKTHGEHVGFR 212
Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
+F +L L RK ++PDLE + D P+ + + + + P+F + G +
Sbjct: 213 -IFMDAILLSLTRKV--KMPDLEFFVNLGDWPLEKKK-----SPQDIHPIFSWCGSNDSK 264
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
DIV P + + + SL + N G W ++ A W+G
Sbjct: 265 DIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWEEKNATAIWRG 310
>gi|149730404|ref|XP_001493278.1| PREDICTED: KDEL motif-containing protein 1 [Equus caballus]
Length = 502
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 18/188 (9%)
Query: 79 TCPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTR 134
+CP+ I DL+ + I ++ +R Q L +NKVYI + + + R
Sbjct: 152 SCPETITQIQRDLAHFPTVDPEKIATEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
+F +L L RK ++PD E + D P+ + + S + P+F + G +
Sbjct: 212 -IFMDAILLSLTRKV--KMPDAEFFVNLGDWPLEKKKSSSHIH-----PIFSWCGSTDSK 263
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIV P + + + SL +GN G W + A W+G ++R++ L
Sbjct: 264 DIVMPTYDLTD-SVLETMGRVSLDMMSVQGNTGPPWESKNSTAVWRGR----DSRKERLE 318
Query: 255 -CNLSDKH 261
LS KH
Sbjct: 319 LVKLSRKH 326
>gi|330919750|ref|XP_003298742.1| hypothetical protein PTT_09542 [Pyrenophora teres f. teres 0-1]
gi|311327917|gb|EFQ93162.1| hypothetical protein PTT_09542 [Pyrenophora teres f. teres 0-1]
Length = 410
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 57/115 (49%), Gaps = 14/115 (12%)
Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+PD+E DD P RS+D + G K P +R D W M P++++W W
Sbjct: 134 VPDIEFALSLDDLPR-RSKDKGTFFGYTRKEGPE-YR---DIWMM----PNYAYWSWNYT 184
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ W ++ +E+++ W ++P W+G +A+ R++L+ +S+ W+
Sbjct: 185 HAPSWNTIRKEIQQKETEVPWSKKDPRVVWRGKVKMAKLRQELI--RVSEGQSWS 237
>gi|313232413|emb|CBY24081.1| unnamed protein product [Oikopleura dioica]
Length = 536
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 92/232 (39%), Gaps = 45/232 (19%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE----------RANQTAHFRLILVNNKVYIHKYKQ 129
CP F + EDL+ + + R + TAH +++ + K+++ +
Sbjct: 172 CPANFEQLDEDLAQFDEIDLDRLYEDGKKKKWAAGGHGEATAH--VVIKDQKLFVEDFGT 229
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
+ R F L LLRK +LPD E +F+ D P+ N P P+ + G
Sbjct: 230 IMGFRG-FMSSMFLSLLRKV--KLPDAEFIFNLGDWPL-------EENLTDPQPILTWCG 279
Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELK-----EGNNGR--NWIDREPYAYWKG- 241
T DI P W + +L RE K E +G +W ++ Y++G
Sbjct: 280 SSNTSDIAVPTW------DQTKNTRHALFRERKDIQYVEQISGEVVSWNEKIERGYFRGR 333
Query: 242 --NPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAH 291
NP +R L +++ D +ARL W L +K+G Q H
Sbjct: 334 DSNP----SRLKLCELSMAHPEDIDARL---TWNLHNKKGQDPKKYGEQVKH 378
>gi|327268710|ref|XP_003219139.1| PREDICTED: protein O-glucosyltransferase 1-like [Anolis
carolinensis]
Length = 391
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 87/196 (44%), Gaps = 26/196 (13%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVN 119
Q + ++ +Y T C Y + +DL P++ GI+ ++L A H+++I
Sbjct: 36 GQIERAVENYRPCVTENCRCYQKVREQDLDPFQ-EGISEELLSEAISRRLGTHYQII--A 92
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
K+Y ++ R G+ + + LPD+E++ + D P + P
Sbjct: 93 KKLY-REHDCMFPAR----CSGVEHFILEIIDELPDMEMVINVRDYPQV-------PKFM 140
Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P P+F +S DI++P W+FW G A I P W+ + ++LK + W
Sbjct: 141 KPKVPIFSFSKTSEYYDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRQDLKSSSEKWPWR 200
Query: 232 DREPYAYWKGNPFVAE 247
+ A+++G+ AE
Sbjct: 201 AKISKAFFRGSRTSAE 216
>gi|452004927|gb|EMD97383.1| hypothetical protein COCHEDRAFT_1190251 [Cochliobolus
heterostrophus C5]
Length = 432
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 55/115 (47%), Gaps = 14/115 (12%)
Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+P++E DD P RSR + G K P D W M P++++W W
Sbjct: 156 VPNIEFSLSLDDLPR-RSRKEGTFFGYTRKDTPEY----KDIWMM----PNYAYWAWNYT 206
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ W S+ +E+++ W ++P W+G +A+ R++L+ +S+ DW+
Sbjct: 207 HAPSWNSIRKEIEQKEKELPWHKKDPRVVWRGKIKMAKLRKELV--KVSEGKDWS 259
>gi|12841643|dbj|BAB25294.1| unnamed protein product [Mus musculus]
Length = 502
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|56118414|ref|NP_001008114.1| MGC89395 protein precursor [Xenopus (Silurana) tropicalis]
gi|51703984|gb|AAH81318.1| MGC89395 protein [Xenopus (Silurana) tropicalis]
Length = 385
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 108/245 (44%), Gaps = 28/245 (11%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
+ + + +S P + Y+R + EDL+P+ +GI+RD++++ R + + ++
Sbjct: 30 NHIRRAAQSHSTPDSQNGTFYYRVLQEDLAPFG-SGISRDLMQKVLS----RKLGTHYQI 84
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
H+ + + G+ L + LPD++L+ + D P + P P
Sbjct: 85 INHRLYREEECMFPARCSGVEHFLLELLPDLPDMDLVVNVRDYPQV-------PRWMDPV 137
Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P W+FW G A I P W+ + ELK+ + W +
Sbjct: 138 IPIFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREELKKAADLWPWEKKI 197
Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRG------FQQSNLASQ 288
P Y++G+ E RD L + D Y ++ +S+R ++ L
Sbjct: 198 PKGYFRGSRTSPE--RDPLILLSRESPDLVDAEYTKNQAWKSERDTLGRPPAKEVPLVDH 255
Query: 289 CAHRY 293
CA+RY
Sbjct: 256 CAYRY 260
>gi|194744002|ref|XP_001954487.1| GF18287 [Drosophila ananassae]
gi|190627524|gb|EDV43048.1| GF18287 [Drosophila ananassae]
Length = 382
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 69/161 (42%), Gaps = 23/161 (14%)
Query: 55 CPTNY--PKTSQTQ-------ESISDY--SIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
CP N P+++ T E+++DY P C + I +DL P+ GI++DM
Sbjct: 5 CPANLLSPRSNSTLDLVPNILEALADYRPCEPGDPLCLCHAATITKDLEPYSDKGISQDM 64
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
+ ++ + ++ ++ V + L + G LPDLE + +
Sbjct: 65 ISQSKRQGTLYKVIRRRIFRQEHCSHPLRCSSVEDV------LLEIAGDLPDLEFVLNVC 118
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
D P + SG + P+F +S +DI+ P WSFW
Sbjct: 119 DWPQVPF--LSGLSG----PVFSHSTTALHLDIMCPAWSFW 153
>gi|12963767|ref|NP_076134.1| KDEL motif-containing protein 1 precursor [Mus musculus]
gi|81881896|sp|Q9JHP7.1|KDEL1_MOUSE RecName: Full=KDEL motif-containing protein 1; AltName:
Full=Endoplasmic reticulum resident protein 58; Short=ER
protein 58; Short=ERp58; Flags: Precursor
gi|9621686|emb|CAC00650.1| ER protein 58 [Mus musculus]
gi|26331010|dbj|BAC29235.1| unnamed protein product [Mus musculus]
gi|74213118|dbj|BAE41698.1| unnamed protein product [Mus musculus]
gi|148664475|gb|EDK96891.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_a [Mus musculus]
Length = 502
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|82658302|ref|NP_001032511.1| KTEL motif-containing protein 1 precursor [Danio rerio]
gi|81097746|gb|AAI09459.1| Zgc:123318 [Danio rerio]
Length = 389
Score = 44.7 bits (104), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 62/271 (22%), Positives = 110/271 (40%), Gaps = 45/271 (16%)
Query: 48 VKNQTQTCPTNYPKTSQT-----QESISDYSIPPTSTCPDYFRWIHEDLSPWK---VTGI 99
++ + C ++ K QT E+ +Y C Y + +DL P+K G+
Sbjct: 14 LQTGVEFCVSDNAKQWQTFIRKITEATKNYQPCSQDNCTCYQSVLKDDLRPFKNGISEGL 73
Query: 100 TRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
D + R T H+++I + K+Y +QS + G+ + K RLPDLE++
Sbjct: 74 MADTVNRGVGT-HYQII--SKKLY---REQSCMFPARCS--GVEHFILKVIDRLPDLEVV 125
Query: 160 FDCDDRPVIRSRDYSGPNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP--- 213
+ D P + P P P+ +S + DI++P W+FW G A I P
Sbjct: 126 INVRDYPQV-------PGWIQPVLPVLSFSKTKDYQDIMYPAWTFWEGGPAVWPIYPTGL 178
Query: 214 --WESLLRELKEGNNGRNWIDREPYAYWKG-------NPFVAETRR--DLLTCNLSDKHD 262
W+ + +LK W + P +++G +P + +R DL+ +
Sbjct: 179 GRWDLMRDDLKRSVEQWPWKKKSPKGFFRGSRTSSERDPLILLSRAAPDLVDAEYTKNQA 238
Query: 263 WNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W + D K ++ L C ++Y
Sbjct: 239 WKS-----DKDTLGKPPAKEVTLVDHCEYKY 264
>gi|344257139|gb|EGW13243.1| KTEL motif-containing protein 1 [Cricetulus griseus]
Length = 333
Score = 44.3 bits (103), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 66/145 (45%), Gaps = 19/145 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I
Sbjct: 52 QINRALENYEPCSSRNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII---K 107
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
K + +R IL+++R RLPD+E++ + D P + P
Sbjct: 108 KRLFREDDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 156
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW 204
P P+F +S DI++P W+FW
Sbjct: 157 PTIPVFSFSKTSEYHDIMYPAWTFW 181
>gi|145250763|ref|XP_001396895.1| hypothetical protein ANI_1_1376134 [Aspergillus niger CBS 513.88]
gi|134082418|emb|CAK42432.1| unnamed protein product [Aspergillus niger]
Length = 442
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 62/124 (50%), Gaps = 17/124 (13%)
Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
R I+ + ++Y+ + + TR T+ + + L +P R LPD+E V+
Sbjct: 125 RGIVHHGELYVVDFGAMPATFTRGEATLHALHRALASFPDRSSLPDIEF--------VLT 176
Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
+ DYS N+GP ++ YS ++ + PD+ +W W E+ I P++ + R + ++G
Sbjct: 177 TEDYS--TNEGP--VWSYSKQDEQENVWLMPDFGYWSWPEVKIGPYKDIRRRIAAVDDGE 232
Query: 229 NWID 232
ID
Sbjct: 233 VTID 236
>gi|390342113|ref|XP_779911.3| PREDICTED: protein O-glucosyltransferase 1-like isoform 1
[Strongylocentrotus purpuratus]
Length = 391
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 104/249 (41%), Gaps = 30/249 (12%)
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV-TGITRDMLERANQTAHFRLILV 118
P S +E+ S Y T C + + EDLS W+ GITR + +A + +
Sbjct: 35 PYLSLIEEATSSYEACVTDDCSCHAGVMDEDLSVWEERGGITRADVVKAESRGTLYQV-I 93
Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
N+++Y + K R IL++++K LPD+E + + D P +S Y+
Sbjct: 94 NHRLY-REEKCMFPARCNGVEHFILRIIKK----LPDMEFVMNVRDWP--QSGKYT---- 142
Query: 179 KGPPPLFRYSG-DRWTMDIVFPDWSFW-GWAEI------NIKPWESLLRELKEGNNGRNW 230
P P+ +S DI++P W+FW G + + W+ + + + W
Sbjct: 143 -DPIPVLSFSKVQSQHYDIMYPAWTFWEGGPAVWPLFPTGLGRWDLFRESIDKESQKLPW 201
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRG------FQQSN 284
+E A+++G+ AE RD L D D Y ++ +S ++
Sbjct: 202 DTKEDKAFFRGSRTTAE--RDPLVLLSRDDPDLVDASYTKNQAWKSDADTLHMPPAKEMT 259
Query: 285 LASQCAHRY 293
L C +RY
Sbjct: 260 LEDHCKYRY 268
>gi|350636309|gb|EHA24669.1| hypothetical protein ASPNIDRAFT_182202 [Aspergillus niger ATCC
1015]
Length = 442
Score = 44.3 bits (103), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 62/124 (50%), Gaps = 17/124 (13%)
Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
R I+ + ++Y+ + + TR T+ + + L +P R LPD+E V+
Sbjct: 125 RGIVHHGELYVVDFGAMPATFTRGKATLHALHRALASFPDRSSLPDIEF--------VLT 176
Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
+ DYS N+GP ++ YS ++ + PD+ +W W E+ I P++ + R + ++G
Sbjct: 177 TEDYS--TNEGP--VWSYSKRDEQQNVWLMPDFGYWSWPEVKIGPYKDIRRRIAAVDDGE 232
Query: 229 NWID 232
ID
Sbjct: 233 VTID 236
>gi|26343369|dbj|BAC35341.1| unnamed protein product [Mus musculus]
Length = 432
Score = 44.3 bits (103), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 80/187 (42%), Gaps = 18/187 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
C + I +DL+ + I ++ +R Q L +NKVYI + + + R
Sbjct: 153 CSETISQIQKDLAHFPTVDPEEIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F +L L RK R+PD+E + D P+ + + +N P+F + G + D
Sbjct: 212 IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTESRD 264
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT- 254
IV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 265 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERLEL 319
Query: 255 CNLSDKH 261
LS KH
Sbjct: 320 VKLSRKH 326
>gi|213514888|ref|NP_001133941.1| KDEL motif-containing protein 2 precursor [Salmo salar]
gi|209155896|gb|ACI34180.1| KDEL motif-containing protein 2 precursor [Salmo salar]
Length = 524
Score = 44.3 bits (103), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 48/213 (22%), Positives = 91/213 (42%), Gaps = 14/213 (6%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLER----ANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
CP I D + + + R + E +N+ ++NN+VY + +
Sbjct: 168 CPAEEPQIQRDFNAFPSIDLQRLLQEVPRRFSNRGGLIHYTVINNQVYRRSLGKYTDFK- 226
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPV-IRSRDYSGPNNKGPPPLFRYSGDRWTM 194
+F+ +L L RK RLPD+E + D P+ R D ++ GP P+ + G T
Sbjct: 227 MFSDEMLLSLARKV--RLPDVEFYINVGDWPMETRKAD----DDPGPVPIISWCGSTDTR 280
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DI+ P + ++ + L + +GN G W ++ A+++G E R L+T
Sbjct: 281 DIILPTYDITHSTLETMRGVSNDLLSV-QGNTGPPWANKMEQAFFRGRD-SREERLHLVT 338
Query: 255 CNLSDKHDWNARLYVQDWILESKRGFQQSNLAS 287
+ + +A + + E ++ ++NL
Sbjct: 339 LSKKNPELLDAGITGWFFFREREKDLGKANLVG 371
>gi|26348271|dbj|BAC37775.1| unnamed protein product [Mus musculus]
Length = 254
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 36/145 (24%), Positives = 65/145 (44%), Gaps = 19/145 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
+++ + G+ + + RLPD+E++ + D P + P
Sbjct: 95 RLF-----REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW 204
P P+F +S DI++P W+FW
Sbjct: 143 PTIPVFSFSKTSEYHDIMYPAWTFW 167
>gi|345561463|gb|EGX44552.1| hypothetical protein AOL_s00188g220 [Arthrobotrys oligospora ATCC
24927]
Length = 437
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 43/205 (20%), Positives = 79/205 (38%), Gaps = 30/205 (14%)
Query: 99 ITRDMLERANQTA--HFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
IT D LE +A +FR I+ N ++Y+ K +R T+ + + + P LP++
Sbjct: 115 ITLDELESVTHSAFGYFRGIIYNQELYVIKMLDPNFSRGFATLMAMHRAIVTSPEPLPNI 174
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT--------MDIVFPDWSFWGWAE 208
E + + DY F S WT ++ + PD+ +W W E
Sbjct: 175 EF--------TLNTADYID---------FEQSAATWTYARRSNETVNWLMPDFGYWSWPE 217
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
+ + + + + + W + P W+G R LL N + W A +
Sbjct: 218 PKVGSYNEVRLKARLADEAIPWEKKIPKIVWRGATLKLPVRLALL--NQTKGAAW-ADVK 274
Query: 269 VQDWILESKRGFQQSNLASQCAHRY 293
DW + +++ C +R+
Sbjct: 275 ALDWQSAESKEKNLLSMSDHCHYRF 299
>gi|451855485|gb|EMD68777.1| hypothetical protein COCSADRAFT_187637 [Cochliobolus sativus
ND90Pr]
Length = 432
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 56/114 (49%), Gaps = 12/114 (10%)
Query: 153 LPDLELMFDCDDRPVIRSRD--YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEIN 210
+P++E DD P +D + G K P +R D W M P++++W W +
Sbjct: 156 IPNIEFSLSLDDLPRRSRKDGTFFGYTRKDTPE-YR---DIWMM----PNYAYWAWNYTH 207
Query: 211 IKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
W S+ +E+++ W ++P W+G +A+ R++L+ +S+ +W+
Sbjct: 208 APSWNSIRKEIEQKEKDLPWDKKDPRVVWRGKIKMAKLRKELV--KVSEGKEWS 259
>gi|26348175|dbj|BAC37727.1| unnamed protein product [Mus musculus]
Length = 502
Score = 43.9 bits (102), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 75/188 (39%), Gaps = 20/188 (10%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G + L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRD---SHKERLE 318
Query: 254 TCNLSDKH 261
LS KH
Sbjct: 319 LVKLSRKH 326
>gi|195443900|ref|XP_002069627.1| GK11623 [Drosophila willistoni]
gi|194165712|gb|EDW80613.1| GK11623 [Drosophila willistoni]
Length = 383
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 59/130 (45%), Gaps = 12/130 (9%)
Query: 75 PPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
P S C Y I +L P+ TGIT+ M++++ + + + +++Y +S
Sbjct: 45 PCESECECYALAIARNLRPYAGTGITKPMIDQSRRFGTLYKV-IGSRLY-----RSDNCA 98
Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
+ +LL LPDLE + + D P I SG + P+F YS +
Sbjct: 99 YPSRCASVEELLLNIVRDLPDLEFVLNVRDWPQIHF--LSGLSG----PVFSYSSTDNFL 152
Query: 195 DIVFPDWSFW 204
DI+ P WSFW
Sbjct: 153 DIMCPAWSFW 162
>gi|255088471|ref|XP_002506158.1| predicted protein [Micromonas sp. RCC299]
gi|226521429|gb|ACO67416.1| predicted protein [Micromonas sp. RCC299]
Length = 658
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 101/253 (39%), Gaps = 43/253 (16%)
Query: 81 PDYFRW---IHEDLSPW-KVTGITRDMLERANQTA--HFRLILVNNKVYIHKYKQSIQTR 134
P+ F W ++ + P+ K++ D E + + FR ++ ++++ + R
Sbjct: 197 PEIFDWSPFVYRAMFPYNKISKADVDTAELVARGSVNSFRAQIIGGRLFVKDIRALEFAR 256
Query: 135 DVFTIWGI--LQLLRKYPGRLPDLELMFDCDDRPVI----------RSRDYSG-PNNKGP 181
D W I L+ +R+ LPD++ +F+ D P++ R G N + P
Sbjct: 257 DYAPSWKITLLETMRRR-RDLPDIDAVFNEGDYPIVLLPNDGAHAQRLYGREGMSNGQKP 315
Query: 182 PPLFRYSGD-RWTMDIVFPDWSFW-----GWAEINIKPWESLLRELKEGNNGRNWIDREP 235
PPLF + + + T D+ FPD+SF G ++ W L E + D+ P
Sbjct: 316 PPLFSPTTNVQMTRDVPFPDFSFSPPGVKGADRLSTTRWSVAHGRLLEAGAKIPFEDKLP 375
Query: 236 YAYWKGNPFVAETRRDLLTCNLSD-----------KHDWNARLYVQDWILESKRGFQQSN 284
A + GN AE R+ L S+ K R VQ L K G Q
Sbjct: 376 LAAFTGN-TQAEPRQRLAEVARSNPDSVFVNQVFKKSPTGERSCVQ-LGLADKGGLQADK 433
Query: 285 LA----SQCAHRY 293
A C +RY
Sbjct: 434 CALSFEEMCRYRY 446
>gi|47216704|emb|CAG00978.1| unnamed protein product [Tetraodon nigroviridis]
Length = 392
Score = 43.5 bits (101), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 79/181 (43%), Gaps = 19/181 (10%)
Query: 56 PTNYPKTSQTQESISDY---SIPPTS-TCPDYFRWIHEDLSPWKVTGITRDMLERANQTA 111
P + + + +++ISD IP +S C + I +DL P+K I+ +++ Q
Sbjct: 22 PASGKRWKKIKDNISDAVKGHIPCSSDNCSCHLSVIQDDLRPFK-GKISENLMAATIQ-- 78
Query: 112 HFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSR 171
R + + ++ HK + G+ + + GRLPD+E++ + D P +
Sbjct: 79 --RGVGTHYQIIGHKLYREENCMFPARCSGVEHFILQLIGRLPDMEMVVNVRDYPQV--- 133
Query: 172 DYSGPN-NKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
P P+F +S DI++P W+FW + PW+ +E + G W
Sbjct: 134 ----PKWVDSLLPVFSFSKTADYQDIMYPAWTFWEGGPADQWPWKQ--KETRGFFRGSRW 187
Query: 231 I 231
+
Sbjct: 188 V 188
>gi|390332934|ref|XP_783946.3| PREDICTED: KDEL motif-containing protein 1-like [Strongylocentrotus
purpuratus]
Length = 512
Score = 43.5 bits (101), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 92/211 (43%), Gaps = 31/211 (14%)
Query: 46 NC---VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
NC V +Q Q Y ++ ++ +D+SI P+ +H+D+
Sbjct: 146 NCDCPVDDQDQWRSDMYCRSGVHPQTQTDFSIFPSID----LGTLHQDV---------ES 192
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
+ + H+ +I NNKVY K SI +F+ +L L RK R+PD+E +
Sbjct: 193 RFAKHHSLCHYSII--NNKVY-RKTLGSIVGFKMFSDAFLLSLTRKV--RVPDVEFFINL 247
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D P + RD P ++ P P+ + G T DIV P + + + SL
Sbjct: 248 GDWP-LEKRD---PEDE-PLPILSWCGSTDTRDIVLPTYDI-TESTLETMGRVSLDMMSV 301
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ N G W ++ A+W+G ++RR+ L
Sbjct: 302 QANTGPKWENKTEKAFWRGR----DSRRERL 328
>gi|313219907|emb|CBY30822.1| unnamed protein product [Oikopleura dioica]
Length = 536
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 70/307 (22%), Positives = 118/307 (38%), Gaps = 60/307 (19%)
Query: 5 FAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQ 64
F F + + + + N +I V K + K ++ P + +++ CP S
Sbjct: 112 FKFDGSIFFRYRMRGSAPNGIIINVKYKKKQVFRKILSEP---IYDESCYCP------SS 162
Query: 65 TQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE----------RANQTAHFR 114
E S++ CP F + EDL+ + + R + TAH
Sbjct: 163 LSEFYSEFD------CPANFEQLDEDLAQFDEIDLDRLYEDGKKKKWAAGGHGEATAH-- 214
Query: 115 LILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
+++ + K+++ + + R F L LLRK +LPD E +F+ D P+
Sbjct: 215 VVIKDQKLFVEDFGTIMGFRG-FMSSMFLSLLRKV--KLPDAEFIFNLGDWPL------- 264
Query: 175 GPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK-----EGNNGRN 229
N P P+ + G T DI P W + +L RE K E +G
Sbjct: 265 EENLTDPQPILTWCGSSNTSDIAVPTW------DQTKNTRHALFRERKDIQYVEQISGEV 318
Query: 230 --WIDREPYAYWKG---NPFVAETRRDLLTCNLSDKHDWNARLYVQDWILESKRGFQQSN 284
W ++ Y++G NP +R L +++ D +ARL W L +K+G
Sbjct: 319 VPWNEKIERGYFRGRDSNP----SRLKLCELSMAHPEDIDARL---TWNLHNKKGQDPKK 371
Query: 285 LASQCAH 291
Q H
Sbjct: 372 YGEQVKH 378
>gi|270007100|gb|EFA03548.1| hypothetical protein TcasGA2_TC013552 [Tribolium castaneum]
Length = 399
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 56/225 (24%), Positives = 101/225 (44%), Gaps = 42/225 (18%)
Query: 47 CVKNQTQTCPTN----YPKTSQTQ---------ESISDYSIPPTSTCPDYFRWIHEDLSP 93
C+K ++ C + Y K + T+ + +YS + C Y I +DL
Sbjct: 23 CLKENSEQCNASKVNMYSKEANTKYLKYLDLIKRAKENYSPCDNTKCGCYSSQISDDLKI 82
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLLRK 148
+K GIT ++++ +T + ++++K+Y RD ++ GI L K
Sbjct: 83 FK-KGITPQLIDKV-KTKGTKYQIIDHKLY----------RDKNCMFPARCAGIEHFLLK 130
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWA 207
+LPD+EL+ + D P I +DY P+F +S DI++P W+FW G
Sbjct: 131 LLPKLPDMELIINTRDWPQIH-KDYGVFG-----PVFSFSKTSDYSDIMYPAWAFWEGGP 184
Query: 208 EINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
I++ P W++ L + N W ++ P +++G+ AE
Sbjct: 185 AISLYPRGIGRWDTHRDLLGKKGNETLWDEKIPKGFFRGSRTSAE 229
>gi|60688257|gb|AAH91408.1| Ktelc1 protein [Rattus norvegicus]
Length = 349
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 51/241 (21%), Positives = 99/241 (41%), Gaps = 34/241 (14%)
Query: 70 SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
+Y + C Y I EDL+P++ GI+R M+ + R + + ++ H+ +
Sbjct: 1 ENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVR----RRLGTHYQIIKHRLFR 55
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLFRYS 188
G+ + + RLPD+E++ + D P + P P P+F +S
Sbjct: 56 EDDCMFPSRCSGVEHFILEVIRRLPDMEMVINVRDYPQV-------PKWMEPTIPVFSFS 108
Query: 189 GDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG 241
DI++P W+FW G A + P W+ +L W + AY++G
Sbjct: 109 KTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRG 168
Query: 242 -------NPFVAETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHR 292
+P + +R++ L+ + W + ++D + K + +L C ++
Sbjct: 169 SRTSPERDPLILLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDVHLIDHCKYK 223
Query: 293 Y 293
Y
Sbjct: 224 Y 224
>gi|302834814|ref|XP_002948969.1| hypothetical protein VOLCADRAFT_89378 [Volvox carteri f.
nagariensis]
gi|300265714|gb|EFJ49904.1| hypothetical protein VOLCADRAFT_89378 [Volvox carteri f.
nagariensis]
Length = 646
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 80/187 (42%), Gaps = 29/187 (15%)
Query: 79 TCPDY---FRWIHEDLSPWKV-TGITRDMLERAN--QTAHFR-----LILVNNKVYIHKY 127
C +Y + IH DL +K GI+ +++ R +A +R + + KVYI
Sbjct: 30 VCDNYEPLYASIHRDLDIYKRNAGISPNLIYRTMVLHSAGYREKGLAVAVYRGKVYIISN 89
Query: 128 KQSIQTRD--------VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
+SI + V I +L L KY LPD+E ++ DRP+ G N
Sbjct: 90 TRSISLKRFGHHVALWVAYIKVLLDLEEKYGSYLPDVEFVWHTIDRPIRLVNTTPGGENF 149
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWE-SLLRELKEGNNGRNWIDREPYAY 238
P+FR+ DI+ P++ F+ +KP++ L + N W R P +
Sbjct: 150 ---PVFRFCKSVVHPDILVPNFHFY------MKPYQREFLDRIPHFNAEVPWAQRRPIVF 200
Query: 239 WKGNPFV 245
+ + +V
Sbjct: 201 ARFSGYV 207
>gi|255930987|ref|XP_002557050.1| Pc12g01550 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211581669|emb|CAP79782.1| Pc12g01550 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 436
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 58/137 (42%), Gaps = 14/137 (10%)
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQS--IQTRDVFTIWGILQLLRKYPGR--LPDL 156
+++ R R I+ ++YI Y +R T+ + + L +P R LP +
Sbjct: 103 KELDSRTVDDGMVRGIIDRGELYIVDYAPMPVTASRARATLNSLHRALTAFPDRHLLPSI 162
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEINIKPWE 215
E +F +D + P P++ YS D T + PD+ +W W E+ I P+
Sbjct: 163 EFIFTTED---------FAEDTTTPSPIWSYSKRDSHTSVWLMPDFGYWAWPEVQIGPYH 213
Query: 216 SLLRELKEGNNGRNWID 232
+ R + ++G D
Sbjct: 214 EVRRRIAAIDDGETAAD 230
>gi|348511563|ref|XP_003443313.1| PREDICTED: KDEL motif-containing protein 1-like [Oreochromis
niloticus]
Length = 538
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 21/170 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTGI---TRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP F I DLS + +++ +R Q + +NK+Y+ + + + R
Sbjct: 190 CPQSFAQIERDLSFYPSVDPDHNAQEIPQRFGQRQSLCHYTVKDNKIYVRTFGEHVGFR- 248
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
+F +L L RK RLPD+E + D P+ + PP P+F + G
Sbjct: 249 IFMDAILLSLTRKV--RLPDVEFFVNLGDWPL---------EKRKPPEKLHPIFSWCGSN 297
Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
T DIV P + + + SL + N G W ++ A+W+G
Sbjct: 298 NTRDIVMPTYDL-TESVLETMGRVSLDMMSVQANTGPPWPEKNATAFWRG 346
>gi|189194465|ref|XP_001933571.1| hypothetical protein PTRG_03238 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979135|gb|EDU45761.1| hypothetical protein PTRG_03238 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 434
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 12/107 (11%)
Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+P++E DD P RS+D + G K P +R D W M P++++W W
Sbjct: 158 IPNIEFALSLDDLPR-RSKDKGTFFGYTRKEGPE-YR---DIWMM----PNYAYWSWNYT 208
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
+ W ++ +E+++ W ++P W+G +A+ R++L+ +
Sbjct: 209 HAPSWNAIRKEIQQKETEVPWSKKDPRVVWRGKVKMAKLRQELIKVS 255
>gi|358373714|dbj|GAA90310.1| DUF821 domain protein [Aspergillus kawachii IFO 4308]
Length = 442
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 65/140 (46%), Gaps = 28/140 (20%)
Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYP--GRLPDLELMFDCDDRPVIR 169
R I+ + ++Y+ + + TR T+ + + L +P G LPD+E V+
Sbjct: 125 RGIIHHGELYVVDFGAMPATFTRGKSTLHALHRALASFPDRGSLPDIEF--------VLT 176
Query: 170 SRDYSGPNNKGPPPLFRYS-----GDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEG 224
+ DYS P++ YS G+ W M PD+ +W W E+ I P++ + R +
Sbjct: 177 TEDYSIHQG----PVWSYSKRDEQGNVWLM----PDFGYWSWPEVKIGPYKDIRRRIAAV 228
Query: 225 NNGRNWIDREPYAYWKGNPF 244
++G +D + + G PF
Sbjct: 229 DDGEVTLDGK---FIPGLPF 245
>gi|302511259|ref|XP_003017581.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
gi|291181152|gb|EFE36936.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
Length = 303
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 27/136 (19%), Positives = 58/136 (42%), Gaps = 18/136 (13%)
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
K+Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 5 KLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDKV-------------- 50
Query: 181 PPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYA 237
P+ ++ R D + PD+ +W W E + + + + + +W + P
Sbjct: 51 -DPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREEHTWSWSKKIPQL 109
Query: 238 YWKGNPFVAETRRDLL 253
+W+G E R L+
Sbjct: 110 FWRGATMDLEVREKLI 125
>gi|258569104|ref|XP_002585296.1| predicted protein [Uncinocarpus reesii 1704]
gi|237906742|gb|EEP81143.1| predicted protein [Uncinocarpus reesii 1704]
Length = 483
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 40/189 (21%), Positives = 81/189 (42%), Gaps = 20/189 (10%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERAN------QTAHFRLILVNNKVYIHKYKQS 130
T C F ++ED+ I+R + + + Q R ++ N ++I + + +
Sbjct: 108 TGQCQTAFPGLYEDVDRAVRFWISRGGILKKSLNTVKLQNGMARAMIYNGNLFIIEARAA 167
Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
+ + + + R R P++E +F +D+ D SG + PL+ S
Sbjct: 168 QEDHRRKILAVLSSIHRALGNRAPNIEFIFSVEDK----VEDVSGQGH----PLWVLSRK 219
Query: 191 RWTMDI-VFPDWSFWGWA--EINIKPWESLLRELKEGN--NGRNWIDREPYAYWKGN-PF 244
+ + PD+ FW W NI P++ +++ +++ + + W + P W+G F
Sbjct: 220 ATEKSVWLIPDFGFWAWGNPASNIGPYDQVVKRIEKFDLEDTMPWSSKTPRLVWRGKLSF 279
Query: 245 VAETRRDLL 253
+ RR LL
Sbjct: 280 APKLRRRLL 288
>gi|67901122|ref|XP_680817.1| hypothetical protein AN7548.2 [Aspergillus nidulans FGSC A4]
gi|40742938|gb|EAA62128.1| hypothetical protein AN7548.2 [Aspergillus nidulans FGSC A4]
gi|259483867|tpe|CBF79610.1| TPA: DUF821 domain protein (AFU_orthologue; AFUA_2G14740)
[Aspergillus nidulans FGSC A4]
Length = 462
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 69/152 (45%), Gaps = 23/152 (15%)
Query: 114 RLILVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
R + + ++YI Y + TR T+ + + L YP R LPD+E + DD
Sbjct: 143 RAAVKDGELYIIDYGPQPYTFTRGKATLHSLHRALSSYPDRHSLPDIEFVLTTDD----- 197
Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEIN-IKPWESLLRELKEGN 225
+S + P P++ Y+ + D + PD+ +W W E++ + ++ + R +
Sbjct: 198 ---FSTRTSTNPSPIWAYTKRQEDEDAAIWLMPDFGYWSWPEVDTVGEYKDVRRRIFAKE 254
Query: 226 NGRNWIDREPYAYWKG----NPFVAETRRDLL 253
G + D++ W+G NP E R+ LL
Sbjct: 255 EGLPFSDKKKQLLWRGSVSANP---EIRKALL 283
>gi|358415519|ref|XP_003583131.1| PREDICTED: KDEL motif-containing protein 2 [Bos taurus]
Length = 460
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 77/169 (45%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
T +CPD I +D + + + + ++E + R +V+ + + Y++S+
Sbjct: 106 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGKYT 165
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 166 DFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 220
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D++ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 221 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 268
>gi|297482669|ref|XP_002693013.1| PREDICTED: KDEL motif-containing protein 2 [Bos taurus]
gi|296480318|tpg|DAA22433.1| TPA: KDEL (Lys-Asp-Glu-Leu) containing 2 [Bos taurus]
Length = 508
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 77/169 (45%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
T +CPD I +D + + + + ++E + R +V+ + + Y++S+
Sbjct: 154 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGKYT 213
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 214 DFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 268
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D++ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 269 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 316
>gi|198433875|ref|XP_002126603.1| PREDICTED: similar to KTEL motif-containing protein 1 [Ciona
intestinalis]
Length = 398
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 51/225 (22%), Positives = 100/225 (44%), Gaps = 41/225 (18%)
Query: 67 ESISDYSIPPTS---TCPDYFRWIHEDLSPWKV-TGITRDMLERA-NQTAHFRLILVNNK 121
++IS+ + TS C + I DL W+ GIT + ++R ++ H+++I ++K
Sbjct: 42 DAISEAAANYTSCGGNCTCHSDVITSDLKLWRERGGITNEDMKRGLERSVHYQII--DHK 99
Query: 122 VYIHKYKQSIQTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRP-VIRSRDYSGPNN 178
+Y Q + +F GI + + LPD+EL + D P V++ Y
Sbjct: 100 LYR-------QDKCMFPSRCSGIEHFILEIINDLPDMELGINVHDWPQVMKHSPY----- 147
Query: 179 KGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNW 230
P P+ +S + DI++P W+FW + W+ + ++LK+ +N W
Sbjct: 148 --PMPILSFSKVAKEHQDIMYPAWTFWAGGPAVWPIYRNGLGRWDLMRKDLKKKDNEFPW 205
Query: 231 IDREPYAYWKG-------NPFVAETRR--DLLTCNLSDKHDWNAR 266
++ +++G +P V +R DL+ + W ++
Sbjct: 206 EKKQNIGFFRGSRTSSERDPLVLLSRENPDLVDAQYTKNQAWKSK 250
>gi|119481815|ref|XP_001260936.1| hypothetical protein NFIA_089970 [Neosartorya fischeri NRRL 181]
gi|119409090|gb|EAW19039.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 444
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 37/153 (24%), Positives = 67/153 (43%), Gaps = 18/153 (11%)
Query: 81 PDYFRWIHEDLSPWKVTGIT-RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVF 137
P F I + +S K IT +++ + + R + ++YI Y T R
Sbjct: 89 PKLFVEIEKTVSLRKERPITFKEVDDVVVEDGMVRAAIWRGELYILDYAAQPYTYSRAKA 148
Query: 138 TIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
T+ + + L+ +P R LPD+E +F DD + P P++ YS
Sbjct: 149 TLNSLHRALQSFPDRHSLPDIEFVFTADDFSNV------------PGPVWSYSKRDEDES 196
Query: 196 I-VFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
I + PD+ +W W E+ + P++ + R + + G
Sbjct: 197 IWLMPDFGYWAWPEVKVGPYKDIRRRIAAVDEG 229
>gi|395519040|ref|XP_003763661.1| PREDICTED: protein O-glucosyltransferase 1 [Sarcophilus harrisii]
Length = 335
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 81/188 (43%), Gaps = 22/188 (11%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKV 122
Q +S+ +Y C + + +DL+P++ GI+++M+ N+ ++ N++
Sbjct: 37 QINKSVENYQSCSKGNCSCHLGVMEKDLAPFQ-GGISKEMMANVVNRKLGTHYQIIKNEL 95
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y ++ R G+ + + LPD+E++ + D P I P P
Sbjct: 96 Y-REHDCMFPAR----CSGVEHFILEIINHLPDMEMVINVRDYPQI-------PKWMEPI 143
Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P W+FW G A I P W+ L +L W +
Sbjct: 144 IPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLLREDLARSAEKWPWEKKV 203
Query: 235 PYAYWKGN 242
Y++G+
Sbjct: 204 SKGYFRGS 211
>gi|195572998|ref|XP_002104482.1| GD18425 [Drosophila simulans]
gi|194200409|gb|EDX13985.1| GD18425 [Drosophila simulans]
Length = 404
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 57/125 (45%), Gaps = 12/125 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C Y I ++L P+ GIT +M+ ++ + F I+ I++ ++ + + +
Sbjct: 63 CICYVATIQKNLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
LL +PDLE + + D P + SG + P+ YS +DI++P
Sbjct: 120 E---DLLLDMASGVPDLEFVLNVRDWPQVHF--LSGLSG----PVLSYSITDRHLDIMYP 170
Query: 200 DWSFW 204
WSFW
Sbjct: 171 AWSFW 175
>gi|195331145|ref|XP_002032263.1| GM23616 [Drosophila sechellia]
gi|194121206|gb|EDW43249.1| GM23616 [Drosophila sechellia]
Length = 404
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 57/125 (45%), Gaps = 12/125 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C Y I ++L P+ GIT +M+ ++ + F I+ I++ ++ + + +
Sbjct: 63 CLCYVATIQKNLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
LL +PDLE + + D P + SG + P+ YS +DI++P
Sbjct: 120 E---DLLLDMASGVPDLEFVLNVRDWPQVHF--LSGLSG----PVLSYSITDRHLDIMYP 170
Query: 200 DWSFW 204
WSFW
Sbjct: 171 AWSFW 175
>gi|91081993|ref|XP_969039.1| PREDICTED: similar to AGAP008037-PA [Tribolium castaneum]
gi|270007375|gb|EFA03823.1| hypothetical protein TcasGA2_TC013938 [Tribolium castaneum]
Length = 498
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 76/170 (44%), Gaps = 21/170 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTG--ITRDMLERANQTAH----FRLILVNNKVYIHKYKQSIQT 133
C + R I DL P++ RD + + H R ++ NN++Y Y + +
Sbjct: 152 CGNTPRQILADLKPFQTVNWDKLRDKVIKKFDQPHSISLCRYVIKNNEIYRTCYGKYVGF 211
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPDLE + D P++ + + P +F + G +
Sbjct: 212 K-MFMDAILLSLSRKV--NLPDLEFFINLGDWPLVTEKIETFP-------IFSWCGSTTS 261
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+DIV P + E ++ ++ ++ +GN +W +R A+W+G
Sbjct: 262 LDIVMPTYDI---TESTLENMGRVMLDMLSVQGNVKESWENRTGQAFWRG 308
>gi|224134845|ref|XP_002321919.1| predicted protein [Populus trichocarpa]
gi|222868915|gb|EEF06046.1| predicted protein [Populus trichocarpa]
Length = 233
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 16/23 (69%), Positives = 21/23 (91%)
Query: 271 DWILESKRGFQQSNLASQCAHRY 293
DWI ES++G+QQSNLA+QC H+Y
Sbjct: 22 DWIKESQQGYQQSNLANQCVHKY 44
>gi|348553234|ref|XP_003462432.1| PREDICTED: KDEL motif-containing protein 2-like [Cavia porcellus]
Length = 692
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 76/169 (44%), Gaps = 14/169 (8%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRDVF 137
+CP + I ED + + + + + E + R +V+ + H Y++S+ F
Sbjct: 341 SCPTWEPQIEEDFASFPTINLQQMLSEVPKRFGDERGAIVHYTILNNHIYRRSLGKYTDF 400
Query: 138 TIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
++ +L L RK RLPDLE + D P+ + P+ P P+ + G +
Sbjct: 401 KMFSDEILLSLARKV--RLPDLEFYINLGDWPLEHRKINETPS---PVPVISWCGSLDSR 455
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
D++ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 456 DVILPTYDV---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 501
>gi|350633295|gb|EHA21660.1| hypothetical protein ASPNIDRAFT_184185 [Aspergillus niger ATCC
1015]
Length = 452
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 97/232 (41%), Gaps = 52/232 (22%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTST-------------CPDYFRWIHED----LSPWKV- 96
CP P +++S S PPT T C F + ED S W
Sbjct: 64 CPAQLP----FHDTVSSSSAPPTRTFFDPLEYSYNETQCTTLFPGLFEDPIRAQSFWTAR 119
Query: 97 TGITR---DMLERANQTAHFRLILVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPG 151
GI R D ++ A+ A R+ + ++Y+ + K R + GIL + +
Sbjct: 120 HGIKRIDIDNIKMADGMA--RVAIYQGRLYVLRALAKGEDHRRKIL---GILASIHRALV 174
Query: 152 RLPDL------ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFW 204
P L E++F +D+ D +GP++ PL+ + + + PD+ FW
Sbjct: 175 SAPQLAVILNTEIIFSVEDK----LEDVAGPDH----PLWVLARKATEESVWLMPDFGFW 226
Query: 205 GWAEIN--IKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
W I+ I P++ +++ ++E + W ++E W+G F + RR LL
Sbjct: 227 SWGHIDTQIGPYDEVVKHVEEHEHP--WDEKENKLVWRGKLSFAPKLRRALL 276
>gi|145251704|ref|XP_001397365.1| endoplasmic reticulum-resident kdel protein [Aspergillus niger CBS
513.88]
gi|134082901|emb|CAK42731.1| unnamed protein product [Aspergillus niger]
Length = 456
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 97/232 (41%), Gaps = 52/232 (22%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTST-------------CPDYFRWIHED----LSPWKV- 96
CP P +++S S PPT T C F + ED S W
Sbjct: 68 CPAQLP----FHDTVSSSSAPPTRTFFDPLEYSYNETQCTTLFPGLFEDPIRAQSFWTAR 123
Query: 97 TGITR---DMLERANQTAHFRLILVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPG 151
GI R D ++ A+ A R+ + ++Y+ + K R + GIL + +
Sbjct: 124 HGIKRIDIDNIKMADGMA--RVAIYQGRLYVLRALAKGEDHRRKIL---GILASIHRALV 178
Query: 152 RLPDL------ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFW 204
P L E++F +D+ D +GP++ PL+ + + + PD+ FW
Sbjct: 179 SAPQLAVILNTEIIFSVEDK----LEDVAGPDH----PLWVLARKATEESVWLMPDFGFW 230
Query: 205 GWAEIN--IKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
W I+ I P++ +++ ++E + W ++E W+G F + RR LL
Sbjct: 231 SWGHIDTQIGPYDEVVKHVEEHEHP--WDEKENKLVWRGKLSFAPKLRRALL 280
>gi|334329589|ref|XP_001370099.2| PREDICTED: protein O-glucosyltransferase 1 [Monodelphis domestica]
Length = 383
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 84/190 (44%), Gaps = 26/190 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM---LERANQTAHFRLILVNN 120
Q +S+ +Y P C + + +DL+P++ GI++++ L H+++I N
Sbjct: 29 QINKSVENYQPCPRENCSCHLGVMEKDLAPFQ-GGISKEVMTTLVSRKLGTHYQII--KN 85
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y ++ R ILQ++ LPD+E++ + D P + P
Sbjct: 86 ELY-REHDCMFPARCSGVEHFILQVI----NHLPDMEMVINVRDYPQV-------PKWME 133
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ L +L W
Sbjct: 134 PAIPVFSFSKTNEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLLREDLARSAEKWPWER 193
Query: 233 REPYAYWKGN 242
+ Y++G+
Sbjct: 194 KISRGYFRGS 203
>gi|317147538|ref|XP_001822208.2| hypothetical protein AOR_1_1442014 [Aspergillus oryzae RIB40]
gi|391873092|gb|EIT82167.1| hypothetical protein Ao3042_00673 [Aspergillus oryzae 3.042]
Length = 438
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 31/137 (22%), Positives = 63/137 (45%), Gaps = 17/137 (12%)
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDL 156
+D+ + R I+ + ++YI + + TR T+ + + L +P R LP++
Sbjct: 108 KDVDSLTVENGMVRGIIDHGELYIVDFGNMPATFTRGKATLNSLHRALASFPDRDRLPNV 167
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWE 215
E V+ + DYS G P++ YS ++ + PD+ +W W E+ + P++
Sbjct: 168 EF--------VLTTEDYS----SGEGPIWSYSKREENTNVWLMPDFGYWSWPEVGVGPYK 215
Query: 216 SLLRELKEGNNGRNWID 232
R + ++G +D
Sbjct: 216 DARRRIAAIDDGEVTVD 232
>gi|24649142|ref|NP_732797.1| CG31139, isoform A [Drosophila melanogaster]
gi|23171999|gb|AAF56061.2| CG31139, isoform A [Drosophila melanogaster]
gi|262051015|gb|ACY07068.1| FI12101p [Drosophila melanogaster]
Length = 397
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 12/125 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C + I DL P+ GIT +M+ ++ + F I+ I++ ++ + + +
Sbjct: 63 CLCHVATIQRDLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
LL + DLE + + D P + SG + P+F YS +DI++P
Sbjct: 120 E---DLLLDMASGVADLEFVLNVRDWPQVHF--LSGLSG----PVFSYSITNRHLDIMYP 170
Query: 200 DWSFW 204
WSFW
Sbjct: 171 AWSFW 175
>gi|25009725|gb|AAN71037.1| AT07872p [Drosophila melanogaster]
Length = 397
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 12/125 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C + I DL P+ GIT +M+ ++ + F I+ I++ ++ + + +
Sbjct: 63 CLCHVATIQRDLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
LL + DLE + + D P + SG + P+F YS +DI++P
Sbjct: 120 E---DLLLDMASGVADLEFVLNVRDWPQVHF--LSGLSG----PVFSYSITNRHLDIMYP 170
Query: 200 DWSFW 204
WSFW
Sbjct: 171 AWSFW 175
>gi|348506788|ref|XP_003440939.1| PREDICTED: protein O-glucosyltransferase 1-like [Oreochromis
niloticus]
Length = 366
Score = 41.2 bits (95), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 41/184 (22%), Positives = 80/184 (43%), Gaps = 20/184 (10%)
Query: 67 ESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK 126
+++ Y+ C + + DL +K GI++D++ Q R + + ++ HK
Sbjct: 15 DAVKGYTPCEPVNCSCHLSVLQHDLQTFK-GGISQDVMAATIQ----RGVGTHYQIIGHK 69
Query: 127 YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLF 185
+ G+ + + +LPDLE++ + D P + P+ P P+F
Sbjct: 70 LYREQNCMFPARCSGVEHFILEVIDKLPDLEMVVNVRDYPQV-------PHWVQPTLPVF 122
Query: 186 RYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAY 238
+S DI++P W+FW G A I P W+ + +LK+ W +E +
Sbjct: 123 SFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRDDLKKSAAQWPWKKKESKGF 182
Query: 239 WKGN 242
++G+
Sbjct: 183 FRGS 186
>gi|260827316|ref|XP_002608611.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae]
gi|229293962|gb|EEN64621.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae]
Length = 513
Score = 41.2 bits (95), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 78/184 (42%), Gaps = 19/184 (10%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRL---ILVNNKVYIHKYKQSIQTRD 135
CP F I DL + + R E ++ H L + +NK++ + Q +
Sbjct: 162 CPSAFPQIERDLEIFPKINLNRLSKEAVDRFGTHHSLCHYTVKDNKIHRKCHGQHTGFK- 220
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F + + RK R+PD+E + D P+ + + GP P+ + G T D
Sbjct: 221 MFMDATLHSITRKV--RIPDIEFFVNLGDWPLEKRQ-----VKDGPLPILSWCGSEETRD 273
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
IV P + + + SL +GN G W+++ A W+G ++RR+ L
Sbjct: 274 IVMPTYDL-TESTLETMGRVSLDMLSVQGNTGPRWVNKTEQALWRGR----DSRRERL-- 326
Query: 256 NLSD 259
NL D
Sbjct: 327 NLVD 330
>gi|91082811|ref|XP_968605.1| PREDICTED: similar to endoplasmic reticulum-resident kdel protein
[Tribolium castaneum]
Length = 362
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 88/189 (46%), Gaps = 29/189 (15%)
Query: 70 SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
+YS + C Y I +DL +K GIT ++++ +T + ++++K+Y
Sbjct: 22 ENYSPCDNTKCGCYSSQISDDLKIFK-KGITPQLIDKV-KTKGTKYQIIDHKLY------ 73
Query: 130 SIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPL 184
RD ++ GI L K +LPD+EL+ + D P I +DY P+
Sbjct: 74 ----RDKNCMFPARCAGIEHFLLKLLPKLPDMELIINTRDWPQIH-KDYGVFG-----PV 123
Query: 185 FRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAY 238
F +S DI++P W+FW G I++ P W++ L + N W ++ P +
Sbjct: 124 FSFSKTSDYSDIMYPAWAFWEGGPAISLYPRGIGRWDTHRDLLGKKGNETLWDEKIPKGF 183
Query: 239 WKGNPFVAE 247
++G+ AE
Sbjct: 184 FRGSRTSAE 192
>gi|119190301|ref|XP_001245757.1| hypothetical protein CIMG_05198 [Coccidioides immitis RS]
gi|392868638|gb|EAS34429.2| hypothetical protein CIMG_05198 [Coccidioides immitis RS]
Length = 463
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 54/264 (20%), Positives = 101/264 (38%), Gaps = 35/264 (13%)
Query: 15 FSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKT------------ 62
F +++ N ++ P + +T NC PT +
Sbjct: 35 FGMNSDGDNVHPLLTQLIPAGHCTCKASTTFNCTSCVDSITPTTFETASPSLSPSSPAWT 94
Query: 63 ---SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLIL 117
+ ++ S S+ P F +H + W GI ++ L+ + R ++
Sbjct: 95 FVYGRDDRNVGLTSSQCQSSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMI 154
Query: 118 VNNKVYIHKYKQSIQTRDVFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGP 176
N +Y+ + + + Q I G+L + R +PD+E +F +DR D SG
Sbjct: 155 FNGNLYVIETRAA-QEDHRQKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGT 209
Query: 177 NNKGPPPLFRYSGDRWTMDIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNW 230
PL+ G + + + V PD+ FW W NI P++ ++ ++ + + W
Sbjct: 210 GQ----PLWVL-GRKASEESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPW 264
Query: 231 IDREPYAYWKGN-PFVAETRRDLL 253
+ W+G F + RR LL
Sbjct: 265 SSKTQKLLWRGKLSFSPKLRRGLL 288
>gi|195143521|ref|XP_002012746.1| GL23775 [Drosophila persimilis]
gi|194101689|gb|EDW23732.1| GL23775 [Drosophila persimilis]
Length = 419
Score = 40.8 bits (94), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 76/190 (40%), Gaps = 23/190 (12%)
Query: 73 SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
S P C + I DL P+ GITR M+ ++ RL +V + Y+Q +
Sbjct: 56 SQPSDPDCECHALTIRRDLGPYAEAGITRSMMAQSR-----RLGVVYQVIDGRIYRQP-E 109
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP-VIRSRDYSGPNNKGPPPLFRYSGDR 191
+ +L G LP++E + + D P V ++G P+F +S
Sbjct: 110 VPHPKRCADVEDMLLGIAGELPNVEFILNVRDWPQVPFLSGFTG-------PVFSHSVSH 162
Query: 192 WTMDIVFPDWSFWGWAE-------INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
+DI+ P WSF + I W + R + +W ++P +++G
Sbjct: 163 QHLDIMCPAWSFSSVSGPPLQRFPHGIGQWGHMRRHMAAAAAQVSWEHKQPIGFFRGT-- 220
Query: 245 VAETRRDLLT 254
+ T RD L
Sbjct: 221 RSSTERDTLV 230
>gi|442620545|ref|NP_001262849.1| rumi, isoform B [Drosophila melanogaster]
gi|440217767|gb|AGB96229.1| rumi, isoform B [Drosophila melanogaster]
Length = 316
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 28/128 (21%), Positives = 54/128 (42%), Gaps = 10/128 (7%)
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
K+Y H+ + GI L LPD++L+ + D P + + N
Sbjct: 13 KIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDLIINTRDYPQLNA----AWGNAA 68
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S + DI++P W+FW G + P W+ + +L++ W +
Sbjct: 69 GGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKR 128
Query: 235 PYAYWKGN 242
+++G+
Sbjct: 129 SLGFFRGS 136
>gi|321474111|gb|EFX85077.1| hypothetical protein DAPPUDRAFT_314403 [Daphnia pulex]
Length = 502
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 87/212 (41%), Gaps = 31/212 (14%)
Query: 48 VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA 107
V ++ CP + E I+DY C I DL+ ++ + + E
Sbjct: 129 VYAESCDCPLD-----SIDEMINDYQ------CSVNVAQIDHDLNQFQNVDFNQVLTEAM 177
Query: 108 NQTAH------FRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
+ +H +++NKVY Y Q + ++F +L L RK LPD+E + +
Sbjct: 178 KRFSHAGSYSFCHYAVLDNKVYRRCYGQHVGF-NMFMDNILLSLSRK--AVLPDMEFLIN 234
Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
D P+++ N P+F + G T DIV P + A + +L
Sbjct: 235 LGDWPLVKK------NILPIIPIFSWCGSTQTADIVMPTYDI-TEASLECMGRVTLDMLS 287
Query: 222 KEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ N W +++ A+W+G ++RR+ L
Sbjct: 288 VQSNPDTKWENKQEKAFWRGR----DSRRERL 315
>gi|18606235|gb|AAH23141.1| Kdelc1 protein [Mus musculus]
gi|148664477|gb|EDK96893.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_c [Mus musculus]
Length = 282
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 33/130 (25%), Positives = 56/130 (43%), Gaps = 16/130 (12%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSF 203
DIV P +
Sbjct: 263 RDIVMPTYDL 272
>gi|291383938|ref|XP_002708527.1| PREDICTED: KDEL (Lys-Asp-Glu-Leu) containing 2 [Oryctolagus
cuniculus]
Length = 509
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 77/171 (45%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + ++ Y++S+
Sbjct: 155 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNRIYRRSLGKYT 214
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ R P GP P+ + G +
Sbjct: 215 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRRVNETP---GPIPIISWCGSQD 269
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 270 SRDIILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 317
>gi|449283857|gb|EMC90451.1| KTEL motif-containing protein 1, partial [Columba livia]
Length = 364
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 44/195 (22%), Positives = 85/195 (43%), Gaps = 32/195 (16%)
Query: 62 TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILV 118
T Q ++++ Y C + +DL+P++ GI+++++ H+++ V
Sbjct: 8 TDQIKKAVEVYKPCVKENCSCHQSVWKQDLAPFR-GGISKEIISDVVSRKLGTHYQI--V 64
Query: 119 NNKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
NK+Y + +D G+ L + RLPD+E++ + D P +
Sbjct: 65 KNKLY--------REQDCLFPARCSGVEHFLLEIISRLPDMEMVINVRDYPQV------- 109
Query: 176 PNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNG 227
P P P+F +S DI++P W+FW G A I P W+ + +L+
Sbjct: 110 PKWMKPVIPIFSFSKTPEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEK 169
Query: 228 RNWIDREPYAYWKGN 242
W+ + Y++G+
Sbjct: 170 WPWMKKISKGYFRGS 184
>gi|320035743|gb|EFW17684.1| hypothetical protein CPSG_06127 [Coccidioides posadasii str.
Silveira]
Length = 463
Score = 40.4 bits (93), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 20/186 (10%)
Query: 78 STCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLILVNNKVYIHKYKQSIQTRD 135
S+ P F +H + W GI ++ L+ + R ++ N +Y+ + + + Q
Sbjct: 113 SSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNLYVIETRAA-QEDH 171
Query: 136 VFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
I G+L + R +PD+E +F +DR D SG PL+ G + +
Sbjct: 172 RRKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGTGQ----PLWVL-GRKASE 222
Query: 195 DIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNWIDREPYAYWKGN-PFVAE 247
+ V PD+ FW W NI P++ ++ ++ + + W + W+G F +
Sbjct: 223 ESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRGKLSFSPK 282
Query: 248 TRRDLL 253
RR LL
Sbjct: 283 LRRGLL 288
>gi|148922833|ref|NP_001092215.1| KDEL motif-containing protein 2 [Danio rerio]
gi|148744719|gb|AAI42840.1| Zgc:165521 protein [Danio rerio]
Length = 518
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 75/166 (45%), Gaps = 11/166 (6%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLER----ANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
CP I +D S + + + + E A + ++NN+V+ + +
Sbjct: 165 CPQEDPQIQKDFSSFPSIDLQQLLQEVPTRFAKRGGLIHYTILNNQVHRRSLGRYTDFK- 223
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F+ +L L RK +LPD+E + D P + +R + +N GP P+ + G T D
Sbjct: 224 MFSDEILLSLARKV--KLPDVEFYINVGDWP-MENRKVN--DNPGPVPVISWCGSTETRD 278
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
I+ P + + ++ + L + +GN G W D+ A+++G
Sbjct: 279 IILPTYDITHSSLEAMRGVTNDLLSV-QGNTGPTWSDKMNKAFFRG 323
>gi|303315019|ref|XP_003067517.1| hypothetical protein CPC735_064720 [Coccidioides posadasii C735
delta SOWgp]
gi|240107187|gb|EER25372.1| hypothetical protein CPC735_064720 [Coccidioides posadasii C735
delta SOWgp]
Length = 463
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 20/186 (10%)
Query: 78 STCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLILVNNKVYIHKYKQSIQTRD 135
S+ P F +H + W GI ++ L+ + R ++ N +Y+ + + + Q
Sbjct: 113 SSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNLYVIETRAA-QEDH 171
Query: 136 VFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
I G+L + R +PD+E +F +DR D SG PL+ G + +
Sbjct: 172 RRKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGTGQ----PLWVL-GRKASE 222
Query: 195 DIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNWIDREPYAYWKGN-PFVAE 247
+ V PD+ FW W NI P++ ++ ++ + + W + W+G F +
Sbjct: 223 ESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRGKLSFSPK 282
Query: 248 TRRDLL 253
RR LL
Sbjct: 283 LRRGLL 288
>gi|388494308|gb|AFK35220.1| unknown [Lotus japonicus]
Length = 211
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 269 VQDWILESKRGFQQSNLASQCAHRY 293
+QDWI ES++GF QSNL +QC +RY
Sbjct: 1 MQDWIQESQQGFNQSNLENQCTYRY 25
>gi|33327382|gb|AAQ09021.1| unknown protein [Homo sapiens]
Length = 507
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 76/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP+ I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPNKETQIAKDFASFPSINLQQMLKEVPKSFGYERGAIVHYTILNNHVYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 315
>gi|345493843|ref|XP_003427163.1| PREDICTED: KDEL motif-containing protein 1-like [Nasonia
vitripennis]
Length = 507
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 74/165 (44%), Gaps = 24/165 (14%)
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
D + N + ++ +N++Y + Y Q + + +F+ +L L RK LPD+E +
Sbjct: 188 DRYNKPNSVSICHYVIKSNQIYRNCYGQHVGFK-IFSDAILLSLARKI--NLPDVEFFMN 244
Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
D P++ P NK P+F + G + DIV P + +I E++ R +
Sbjct: 245 LGDWPLV-------PKNKEIHPIFSWCGSDDSYDIVLPTY------DITQSSMEAMGRVM 291
Query: 222 -----KEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
+G+ W + +W+G E R DL+ +++ KH
Sbjct: 292 LDMLSVQGSTTDPWNKKIEKMFWRGRDARRE-RLDLI--DIARKH 333
>gi|196003266|ref|XP_002111500.1| hypothetical protein TRIADDRAFT_55584 [Trichoplax adhaerens]
gi|190585399|gb|EDV25467.1| hypothetical protein TRIADDRAFT_55584 [Trichoplax adhaerens]
Length = 506
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 13/140 (9%)
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
D + + H+ +I +NKVY + Y + + + +F +L L RK +LPDLE + +
Sbjct: 187 DRFGKHHAICHYSVI--SNKVYRNCYGEHVGFK-MFMDAILLSLARKV--KLPDLEFISN 241
Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
D P+ ++ P+F + G T DIV P + + + + S+
Sbjct: 242 LGDWPLENQQN-------NLIPIFSWCGSETTSDIVMPTYDL-TQSTLEMMGRVSVDVLA 293
Query: 222 KEGNNGRNWIDREPYAYWKG 241
+G+ G W D+ +W+G
Sbjct: 294 VQGSTGPKWKDKIEKGFWRG 313
>gi|60302736|ref|NP_001012560.1| KTEL motif-containing protein 1 precursor [Gallus gallus]
gi|60098623|emb|CAH65142.1| hypothetical protein RCJMB04_4b16 [Gallus gallus]
Length = 392
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 85/193 (44%), Gaps = 28/193 (14%)
Query: 62 TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR----DMLERANQTAHFRLIL 117
T Q ++ Y C + +DL+P++ +GI++ D++ R T H+++I
Sbjct: 36 TDQINRAVEVYKPCVKENCSCHQSVWKQDLAPFR-SGISKETISDVVSRKLGT-HYQII- 92
Query: 118 VNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
NK+Y ++ R G+ +R RLPD+E++ + D P + P
Sbjct: 93 -KNKLY-REHDCMFPAR----CSGVEHFIRGIINRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ + +L+
Sbjct: 140 WMKPIIPVFSFSKTAEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEKWP 199
Query: 230 WIDREPYAYWKGN 242
W + Y++G+
Sbjct: 200 WKKKISKGYFRGS 212
>gi|340369006|ref|XP_003383040.1| PREDICTED: protein O-glucosyltransferase 1-like [Amphimedon
queenslandica]
Length = 325
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 59/130 (45%), Gaps = 17/130 (13%)
Query: 87 IHEDLSPWKVTGITR--DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQ 144
+ DL W+ G + + + ++ H+++ VN K+Y K F G+
Sbjct: 4 LESDLGVWRERGGIKREEFIHAKSKGVHYQI--VNGKLYREK-----DCLFSFRCKGVEH 56
Query: 145 LLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
+ LP++EL+ + D P +S Y P P+F +S DI++P W+FW
Sbjct: 57 FILNIIEDLPNMELIINVFDYP--KSHKYHSP-----LPVFSFSKTVHYWDIMYPAWTFW 109
Query: 205 -GWAEINIKP 213
G ++++P
Sbjct: 110 SGGPAVSVEP 119
>gi|449303255|gb|EMC99263.1| hypothetical protein BAUCODRAFT_65196 [Baudoinia compniacensis UAMH
10762]
Length = 394
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 79/206 (38%), Gaps = 28/206 (13%)
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ G+ R M+ NQ I+ + V H ++ R + T+ I + + L
Sbjct: 49 WRADGVLRGMIHD-NQL----YIIDAHGVCDHNHR----PRTMATLHAINRAVTASVEPL 99
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR--WTMDIVFPDWSFWGWAEINI 211
PD+E F D ++ D N R + W M PD+ FW W E +
Sbjct: 100 PDIEFTFVDHDTALLDDDD-----NHTTWAYSRLANQESLWLM----PDFGFWAWPEYGM 150
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNP---FVAETRRDLLTCNLSDKHDWNARLY 268
+ + L L E + +D+ P W+G F R LL S W+A +
Sbjct: 151 RSYSELQANLDETEE--HLLDKAPQIVWRGGTKAGFGGHAREGLL--KHSTGQSWSA-VE 205
Query: 269 VQDWILESKRGFQQSNLASQCAHRYS 294
+ DW + + +A C + ++
Sbjct: 206 MIDWANRTDVSHKLITMAEHCGYMFA 231
>gi|47078285|ref|NP_997610.1| KDEL (Lys-Asp-Glu-Leu) containing 2 protein precursor [Mus
musculus]
gi|148693845|gb|EDL25792.1| KDEL (Lys-Asp-Glu-Leu) containing 2, isoform CRA_b [Mus musculus]
gi|157169796|gb|AAI52824.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [synthetic construct]
Length = 503
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 74/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + E + R +V+ + H Y++S+
Sbjct: 149 TLSCPASEPQIEQDFVSFPSINLQQMLKEVPTRFGDERGAVVHYTILNNHIYRRSLGKYT 208
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 209 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 263
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 264 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 311
>gi|255955591|ref|XP_002568548.1| Pc21g15380 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590259|emb|CAP96435.1| Pc21g15380 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 469
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 113/283 (39%), Gaps = 46/283 (16%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCP---- 56
+I F+FS I+ F +++ I+ P +T C + TCP
Sbjct: 18 VITAFSFSAIIVIWFGMNSDREYIHPILTQLIPAGHCACESSTTFQC--STCLTCPEPGL 75
Query: 57 -TNYPKTSQTQESISDYS------IPPTSTCPDYFRWIHEDL--------SPWKVTGITR 101
TN +T+ + Y+ S C F + ED+ S +++ +
Sbjct: 76 QTNSLQTNSSPAWTFQYTRDGRNETLNKSQCKAAFPGLFEDVARGGKFWSSHGRLSSVEL 135
Query: 102 DMLERANQTAHFRLILVNNKVYI--HKYKQSIQTRDVF-TIWGILQLLRKYPGRL--PDL 156
D + Q R + N ++Y+ + K R + T+ I + L P R PD
Sbjct: 136 DAI--PIQLGMVRAFISNGQLYVVTARSKGEDHRRKILGTLSSIHRALAADPERASRPDF 193
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR---WTMDIVFPDWSFWGWAEIN--I 211
E +F +D+ D + N++ P +F + W M PD+SFW W N I
Sbjct: 194 EFVFSVEDK----VDDVT--NSEWPVWVFSRTPSEEGVWLM----PDFSFWAWDNKNNYI 243
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLL 253
P++ ++ +K + W ++ P W+G P F + RR L+
Sbjct: 244 GPYDQVVERIKHMDI--PWSEKTPQLVWRGKPSFAPKLRRALM 284
>gi|26344660|dbj|BAC35979.1| unnamed protein product [Mus musculus]
Length = 465
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 74/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + E + R +V+ + H Y++S+
Sbjct: 111 TLSCPASEPQIEQDFVSFPSINLQQMLKEVPTRFGDERGAVVHYTILNNHIYRRSLGKYT 170
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 171 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 225
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 226 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 273
>gi|444723570|gb|ELW64221.1| KDEL motif-containing protein 2 [Tupaia chinensis]
Length = 470
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/171 (23%), Positives = 75/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ V H Y++S+
Sbjct: 116 TLSCPIKEPQIAKDFASFPSINLQQMLNEVPERFGDERGAIVHYTVLNNHIYRRSLGKYT 175
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 176 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETP---GPIPIISWCGSLD 230
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 231 SRDIILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 278
>gi|115399170|ref|XP_001215174.1| predicted protein [Aspergillus terreus NIH2624]
gi|114192057|gb|EAU33757.1| predicted protein [Aspergillus terreus NIH2624]
Length = 407
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 80/173 (46%), Gaps = 30/173 (17%)
Query: 81 PDYFRWIHEDLSPWK-VTGITRDMLERAN-QTAHFRLILVNNKVYIHKYK---QSIQTRD 135
P F+ I + WK GI++D L+ ++ R I+ N ++YI K + +T+
Sbjct: 75 PGLFQDISRAVQYWKPRGGISKDDLDAIPFESGMARAIISNGELYIVATKAKGKDHRTKI 134
Query: 136 VFTIWGILQLLRKYPGRL--PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ T+ I + L P RL P +E +F +DR + D +G P++ S + +
Sbjct: 135 LATLGSIYRALSSSPTRLSLPTIEFVFSIEDR--VDDIDATGH------PVWVVS-RKVS 185
Query: 194 MDIVF--PDWSFWGWAEINIKPWES------------LLRELKEGNNGRNWID 232
+ VF PD+ +W WA+ +I P ++ L R L + G+ W D
Sbjct: 186 EESVFLMPDFGYWSWAKSHIGPAKARVERGKLSFAPKLRRALLDVARGKPWGD 238
>gi|119499505|ref|XP_001266510.1| hypothetical protein NFIA_041910 [Neosartorya fischeri NRRL 181]
gi|119414674|gb|EAW24613.1| predicted protein [Neosartorya fischeri NRRL 181]
Length = 423
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 41/95 (43%), Gaps = 9/95 (9%)
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
P LPD + DRP+ +S Y+ P + P +GD + P ++FW W
Sbjct: 126 PEPLPDTVFALNIQDRPMAQSWSYARPAD----PTATKAGD-----FLMPHFAFWAWPLK 176
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
I + L + E + + + P A W+G P+
Sbjct: 177 YIGSMDRALTAITEIETDQTFQAKIPQAVWRGTPW 211
>gi|255938965|ref|XP_002560252.1| Pc15g00250 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584874|emb|CAP82911.1| Pc15g00250 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 458
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
I+ PD+ +W W + NI P+ +++ + + + D+E W+G F + RR LL
Sbjct: 223 ILMPDFGYWSWDKSNIGPYGQVVQSIMAAESNLKFADKEQKLVWRGKLSFAPKLRRALL 281
>gi|260810659|ref|XP_002600074.1| hypothetical protein BRAFLDRAFT_280689 [Branchiostoma floridae]
gi|229285359|gb|EEN56086.1| hypothetical protein BRAFLDRAFT_280689 [Branchiostoma floridae]
Length = 351
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 81/188 (43%), Gaps = 33/188 (17%)
Query: 76 PTSTCPDYFRWIHEDLSPWK-VTGITRDMLERA---NQTAHFRLILVNNKVYIHKYKQSI 131
P C + I DL WK GIT++ +R ++ H+++I ++K+Y
Sbjct: 10 PDKKCSCHSSVITHDLKLWKDRGGITKEEFDRTLEQDKGTHYQII--DHKLYR------- 60
Query: 132 QTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS- 188
Q R +F G+ + + LPD+E++ + D P S + P P+ +S
Sbjct: 61 QDRCMFPARCSGVEHFILEIIDDLPDMEMIINVRDWP------QSPMYQREPRPVMSFSK 114
Query: 189 --GDRWTMDIVFPDWSFWGWA-------EINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
+ W DI++P W+FW + W+ + + + N W +E A++
Sbjct: 115 TFNEHW--DIMYPAWTFWEGGPAVWPLFPTGLGRWDLMRDSITKANEDWPWEKKEDKAFF 172
Query: 240 KGNPFVAE 247
+G+ AE
Sbjct: 173 RGSRTSAE 180
>gi|260810693|ref|XP_002600088.1| hypothetical protein BRAFLDRAFT_280682 [Branchiostoma floridae]
gi|229285373|gb|EEN56100.1| hypothetical protein BRAFLDRAFT_280682 [Branchiostoma floridae]
Length = 351
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 81/188 (43%), Gaps = 33/188 (17%)
Query: 76 PTSTCPDYFRWIHEDLSPWK-VTGITRDMLERA---NQTAHFRLILVNNKVYIHKYKQSI 131
P C + I DL WK GIT++ +R ++ H+++I ++K+Y
Sbjct: 10 PDKKCSCHSSVITHDLKLWKDRGGITKEEFDRTLEQDKGTHYQII--DHKLYR------- 60
Query: 132 QTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS- 188
Q R +F G+ + + LPD+E++ + D P S + P P+ +S
Sbjct: 61 QDRCMFPARCSGVEHFILEIIDDLPDMEMIINVRDWP------QSPMYQREPRPVMSFSK 114
Query: 189 --GDRWTMDIVFPDWSFWGWA-------EINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
+ W DI++P W+FW + W+ + + + N W +E A++
Sbjct: 115 TFNEHW--DIMYPAWTFWEGGPAVWPLFPTGLGRWDLMRDSITKANEDWPWEKKEDKAFF 172
Query: 240 KGNPFVAE 247
+G+ AE
Sbjct: 173 RGSRTSAE 180
>gi|443897893|dbj|GAC75232.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
Length = 479
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/160 (22%), Positives = 65/160 (40%), Gaps = 24/160 (15%)
Query: 98 GITRDMLERANQTAH---FRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL-RKYPGRL 153
GI D ++ A + LI+ + ++++ + Q+R + +QLL Y G
Sbjct: 99 GIKYDNVQNAAENCRHGCVHLIIKHGQIFVRAQVKDWQSR----VRSTMQLLTAAYQGAS 154
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD---------IVFPDWSFW 204
D + D + VI + D+ G + G W +D +FPD+SF
Sbjct: 155 EDEKARIDGTEL-VISTADFDGFTDSAS------RGAGWVLDKRVNDTEGQYLFPDFSFA 207
Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
W E I + + ++ N W D+ A+W+G+
Sbjct: 208 SWPEAGIASYPEFRHDAEQVNAETPWHDKLNPAFWRGDAL 247
>gi|83770071|dbj|BAE60206.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 460
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
TR T+ + + L +P R LP++E V+ + DYS G P++ YS
Sbjct: 164 TRGKATLNSLHRALASFPDRDRLPNVEF--------VLTTEDYS----SGEGPIWSYSKR 211
Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWID 232
++ + PD+ +W W E+ + P++ R + ++G +D
Sbjct: 212 EENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAAIDDGEVTVD 254
>gi|72679937|gb|AAI00618.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
Length = 508
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 74/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + E + R +V+ + H Y++S+
Sbjct: 154 TLSCPANEPQIEQDFISFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 213
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 214 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 268
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 269 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 316
>gi|452984518|gb|EME84275.1| hypothetical protein MYCFIDRAFT_118382, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 387
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 42/200 (21%), Positives = 78/200 (39%), Gaps = 19/200 (9%)
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ GI R M+ NQ ++ + V H ++ R + T+ I + + G+L
Sbjct: 62 WRGDGIVRAMIH-DNQL----YVIDPHAVTDHNHR----PRTLATLHAIHRAVSASSGKL 112
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
PD+E F D + D G + + W M PD+ WGW ++ ++
Sbjct: 113 PDIEFSFTVHDFAL---HDRYGNHTTWAYTRLPHQEKLWLM----PDFGLWGWPDVGLRS 165
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQDWI 273
+ L + ++D+ P W+G+ V S+K W A + DW
Sbjct: 166 YAEFQTVLDYEED--EFVDKIPKLVWRGSLAVGSHDVRAGLVKHSEKQPW-ADVLELDWS 222
Query: 274 LESKRGFQQSNLASQCAHRY 293
+ + ++ CA+ +
Sbjct: 223 NRTNIEERLLSMQDHCAYMF 242
>gi|198450765|ref|XP_002137151.1| GA27052 [Drosophila pseudoobscura pseudoobscura]
gi|198131182|gb|EDY67709.1| GA27052 [Drosophila pseudoobscura pseudoobscura]
Length = 420
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 44/190 (23%), Positives = 75/190 (39%), Gaps = 23/190 (12%)
Query: 73 SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
S P C + I D P+ GITR M+ ++ RL +V + Y+Q +
Sbjct: 56 SQPSDPDCECHALTIRRDFGPYAEAGITRSMMAQSR-----RLGVVYKVIDGRIYRQP-E 109
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP-VIRSRDYSGPNNKGPPPLFRYSGDR 191
+ +L G LP++E + + D P V ++GP +F +S
Sbjct: 110 VPHPKRCADVEDMLLGIAGELPNVEFILNVRDWPQVPFLSGFTGP-------VFSHSVSH 162
Query: 192 WTMDIVFPDWSFWGWAE-------INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
+DI+ P WSF + I W + R + +W ++P +++G
Sbjct: 163 QHLDIMCPAWSFSSVSGPPLQRFPHGIGQWGHMRRHMAAAAAQVSWEHKQPIGFFRGTR- 221
Query: 245 VAETRRDLLT 254
+ T RD L
Sbjct: 222 -SSTERDTLV 230
>gi|153218522|ref|NP_001020294.3| KDEL motif-containing protein 2 precursor [Rattus norvegicus]
gi|81888044|sp|Q566E5.1|KDEL2_RAT RecName: Full=KDEL motif-containing protein 2; Flags: Precursor
gi|62471552|gb|AAH93594.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
gi|149041673|gb|EDL95514.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
Length = 508
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 74/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + E + R +V+ + H Y++S+
Sbjct: 154 TLSCPANEPQIEQDFISFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 213
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 214 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 268
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 269 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 316
>gi|431907500|gb|ELK11352.1| KDEL motif-containing protein 2 [Pteropus alecto]
Length = 447
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 81/193 (41%), Gaps = 33/193 (17%)
Query: 52 TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA 111
T +CPTN P+ ++ D+ + DL + + + +
Sbjct: 94 TLSCPTNEPQIAK-----------------DFASFPSIDLQQM-LNQVPKRFGDERGAIV 135
Query: 112 HFRLILVNNKVYIHKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVI 168
H+ ++ NN H Y++S+ F ++ +L L RK LPDLE + D P+
Sbjct: 136 HYTIL--NN----HIYRRSLGKYTDFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLE 187
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
+ P GP P+ + G + DI+ P + ++ + L + +GN G
Sbjct: 188 HRKVNETP---GPLPIISWCGSLDSQDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGP 243
Query: 229 NWIDREPYAYWKG 241
+WI++ A+++G
Sbjct: 244 SWINKTEKAFFRG 256
>gi|410971943|ref|XP_003992420.1| PREDICTED: KDEL motif-containing protein 2 [Felis catus]
Length = 445
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/171 (22%), Positives = 76/171 (44%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 91 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 150
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPD+E + D P+ + P GP P+ + G
Sbjct: 151 DFKMFSDEILLSLARKV--LLPDIEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 205
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ DI+ P + + ++ + +L +GN G +WI++ A+++G
Sbjct: 206 SRDIILPTYDI---SHSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 253
>gi|354481242|ref|XP_003502811.1| PREDICTED: KDEL motif-containing protein 2-like [Cricetulus
griseus]
Length = 472
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 75/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN-NKVYIHKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ V H +++S+
Sbjct: 118 TLSCPATEPQIEQDFTSFPSIDLQQMLKEVPKRFGDERGAVVHYTIVNNHIHRRSLGKYT 177
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 178 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 232
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 233 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 280
>gi|219886273|gb|ACL53511.1| unknown [Zea mays]
Length = 527
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 65 TQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
T+ S S PP CPDYFR+IH DLSPW G
Sbjct: 187 TKSSSSKNKQPPPQ-CPDYFRFIHSDLSPWMKRG 219
>gi|449495884|ref|XP_004159974.1| PREDICTED: O-glucosyltransferase rumi-like, partial [Cucumis
sativus]
Length = 210
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 21/24 (87%)
Query: 270 QDWILESKRGFQQSNLASQCAHRY 293
QDWI ES++G++QS LA+QC +RY
Sbjct: 2 QDWIQESQQGYKQSKLANQCTYRY 25
>gi|407921029|gb|EKG14198.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 441
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 35/167 (20%), Positives = 73/167 (43%), Gaps = 17/167 (10%)
Query: 89 EDLS-PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
EDL W+ GITR M+ +L ++ K H+ + + R + + + + +
Sbjct: 109 EDLDMSWRKEGITRCMI------YDHQLYVLETKGTTHR--RDYRERTLAVLHNMHRAIT 160
Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIVFPDWSFWGW 206
Y G LP++E F DD Y N P ++ ++ + W + PD+ +W W
Sbjct: 161 AYNGPLPNIEFTFSVDDWV------YDEENLNTDPVVWGFTRQQSWENVWLMPDFGYWAW 214
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ ++ + ++ + + +++P W+G + E R+ L+
Sbjct: 215 PTEPVGAYQDVRNQMGVREKAQAFSEKKPKVVWRGAA-LTEQRQALI 260
>gi|73955182|ref|XP_546537.2| PREDICTED: KDEL motif-containing protein 2 [Canis lupus familiaris]
Length = 508
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 154 TLSCPTKEPQIAKDFASFPSINLQQMLDEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 213
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPD+E + D P+ + P GP P+ + G
Sbjct: 214 DFKMFSDEILLSLARKV--LLPDIEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 268
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 269 SRDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 316
>gi|238495869|ref|XP_002379170.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
gi|220694050|gb|EED50394.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
Length = 302
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
TR T+ + + L +P R LP++E V+ + DYS G P++ YS
Sbjct: 6 TRGKATLNSLHRALASFPDRDRLPNVEF--------VLTTEDYS----SGEGPIWSYSKR 53
Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWID 232
++ + PD+ +W W E+ + P++ R + ++G +D
Sbjct: 54 EENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAAIDDGEVTVD 96
>gi|344243629|gb|EGV99732.1| KDEL motif-containing protein 2 [Cricetulus griseus]
Length = 452
Score = 38.5 bits (88), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 38/171 (22%), Positives = 75/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H +++S+
Sbjct: 98 TLSCPATEPQIEQDFTSFPSIDLQQMLKEVPKRFGDERGAVVHYTIVNNHIHRRSLGKYT 157
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 158 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 212
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 213 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 260
>gi|357608730|gb|EHJ66122.1| KDEL motif-containing protein 1 [Danaus plexippus]
Length = 234
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 56/124 (45%), Gaps = 12/124 (9%)
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPD 200
G+ L+ LPD++L+ + D P ++D+ P+F +S R DI++P
Sbjct: 33 GVEHYLKILSPTLPDMDLVINTRDWPQF-NKDWGHKK----APVFSFSKTRSYYDIMYPT 87
Query: 201 WSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
WSFW G I + P W+ + W +E A+++G+ +E R L+
Sbjct: 88 WSFWEGGPAIALYPTGIGRWDKHRTSISTAAEKWPWNKKEEKAFFRGSR-TSEERDALIL 146
Query: 255 CNLS 258
+ S
Sbjct: 147 LSRS 150
>gi|322691155|ref|YP_004220725.1| hypothetical protein BLLJ_0966 [Bifidobacterium longum subsp.
longum JCM 1217]
gi|320456011|dbj|BAJ66633.1| hypothetical protein BLLJ_0966 [Bifidobacterium longum subsp.
longum JCM 1217]
Length = 1012
Score = 38.1 bits (87), Expect = 4.2, Method: Composition-based stats.
Identities = 32/117 (27%), Positives = 49/117 (41%), Gaps = 21/117 (17%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
+PGR+ D+ ++ D RP +RS FR G D++ DW G A
Sbjct: 267 FPGRVTDVRIVASRDGRPAVRS--------------FRIDGSNADWDMISRDW-LKGAAP 311
Query: 209 INIKPW---ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
W E++L G G W+ + A W + ET R L++ L+ +HD
Sbjct: 312 GAFGDWMNTENILVTADPG-GGHGWVSVD--ADWDSYEWDVETMRPLMSDTLTRRHD 365
>gi|344287853|ref|XP_003415666.1| PREDICTED: KDEL motif-containing protein 2 [Loxodonta africana]
Length = 507
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 38/171 (22%), Positives = 74/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIEKDFLSFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPD+E + D P+ + P GP PL + G
Sbjct: 213 DFKMFSDEILLSLARKV--LLPDVEFYINVGDWPLEHRKVNETP---GPIPLISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ D++ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 268 SRDVILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 315
>gi|449485840|ref|XP_002191338.2| PREDICTED: protein O-glucosyltransferase 1 [Taeniopygia guttata]
Length = 514
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 86/196 (43%), Gaps = 34/196 (17%)
Query: 62 TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR----DMLERANQTAHFRLIL 117
T Q + ++ Y C + +DL+P++ GI++ D++ R T H+++I
Sbjct: 158 TGQIKRAVEAYEPCVKENCSCHQSVWKQDLAPFR-GGISKETMSDVVSRKLGT-HYQII- 214
Query: 118 VNNKVYIHK---YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
NK+Y + + + F I GI+ RLPD+E++ + D P +
Sbjct: 215 -KNKLYREQDCMFPARCSGVEHF-ILGIIH-------RLPDMEMVINVRDYPQV------ 259
Query: 175 GPNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNN 226
P P P+F +S DI++P W+FW G A I P W+ + +L+
Sbjct: 260 -PKWMKPIIPVFSFSKTSEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAE 318
Query: 227 GRNWIDREPYAYWKGN 242
W + Y++G+
Sbjct: 319 KWPWKKKISKGYFRGS 334
>gi|396482709|ref|XP_003841528.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
JN3]
gi|312218103|emb|CBX98049.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
JN3]
Length = 428
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 48/104 (46%), Gaps = 12/104 (11%)
Query: 153 LPDLELMFDCDDRPVIRSRDYS---GPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+P++E DD P RS+D+ G K R D W M P++++W W
Sbjct: 152 VPNIEFSLSLDDLPR-RSKDHGVFFGYTRKEG----REYNDIWMM----PNYAYWSWNYT 202
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W S+ RE+++ W ++ W+G +A+ R +L+
Sbjct: 203 HAPSWNSIRREIEQREAEIPWAMKDSRVVWRGKVKMAKLRSELV 246
>gi|432106972|gb|ELK32490.1| KDEL motif-containing protein 2, partial [Myotis davidii]
Length = 442
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 74/169 (43%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++ +
Sbjct: 88 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGEERGAIVHYTILNNHIYRRPLGKYT 147
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 148 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 202
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 203 SRDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 250
>gi|154310746|ref|XP_001554704.1| hypothetical protein BC1G_06847 [Botryotinia fuckeliana B05.10]
Length = 438
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 38/214 (17%), Positives = 80/214 (37%), Gaps = 16/214 (7%)
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRDVFTI 139
P F + + K +T+ L+ F R ++ + ++YI I +R + T+
Sbjct: 95 PGLFEEVERPVKLRKNKKVTKKELDDTPALNGFIRAMIFDQQLYIIDTSGKIYSRGIATL 154
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
+ + + P LP++E + DD+ +G P + + P
Sbjct: 155 HALHRAMLTSPEPLPNIEFTMNVDDK------------MEGHPQWLYARVAKNQETWLMP 202
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
++ FW W E I + + + + W + W+G E R+ + N+++
Sbjct: 203 EYGFWSWPETKIGSYGEMQMKAILTESEWPWSRKIDKLLWRGATMNLEVRKKFV--NVTE 260
Query: 260 KHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W A + DW E ++ C +++
Sbjct: 261 GKAW-ADVKTLDWHNEGSMKNDLKSMDEHCQYKF 293
>gi|261203203|ref|XP_002628815.1| DUF821 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239586600|gb|EEQ69243.1| DUF821 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 449
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 50/239 (20%), Positives = 95/239 (39%), Gaps = 36/239 (15%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPT---STCPDYFRWIHEDLSP 93
PP R P N CP + P +T+ +Y + P + I + +
Sbjct: 47 PPCRPVIPAAGSNNNGGACPGDAPWKFETERDGDNYGLSRAQCQKAFPKLYVEIEKAVVA 106
Query: 94 WKVTGITRDML-ERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKYP 150
+ IT D L + +T+ R ++ +Y+ ++ T R ++ + + + P
Sbjct: 107 RRGRNITFDELNSKPLKTSMVRAMVYQGALYVINFEDMRYTFSRAKASLNALNRAISAIP 166
Query: 151 G--RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS---GDRWTMDIVFPDWSFWG 205
+P++E +F +D + P P++ YS D W + PD+ +W
Sbjct: 167 NPNEIPNIEFIFTTEDF------------HDDPHPVWVYSKREDDGWAW--LMPDFGYWS 212
Query: 206 WAEINIKPWESL---LRELKEGN--NGR-----NWIDREPYAYWKGNPFVA-ETRRDLL 253
W EI + S+ + + EG NG+ + D++ W+G A E R+ L+
Sbjct: 213 WPEIKAGQYRSVRQRIAAIDEGTVINGKAHPPLKFQDKKKQLLWRGAIATAPELRQGLV 271
>gi|34531642|dbj|BAC86191.1| unnamed protein product [Homo sapiens]
Length = 409
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 97 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 156
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 157 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 211
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 212 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 259
>gi|426370348|ref|XP_004052127.1| PREDICTED: KDEL motif-containing protein 2 [Gorilla gorilla
gorilla]
Length = 507
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 315
>gi|239608365|gb|EEQ85352.1| DUF821 domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327349565|gb|EGE78422.1| DUF821 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 449
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 50/239 (20%), Positives = 95/239 (39%), Gaps = 36/239 (15%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPT---STCPDYFRWIHEDLSP 93
PP R P N CP + P +T+ +Y + P + I + +
Sbjct: 47 PPCRPVIPAAGSNNNGGACPGDAPWKFETERDGDNYGLSRAQCQKAFPKLYVEIEKAVVA 106
Query: 94 WKVTGITRDML-ERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKYP 150
+ IT D L + +T+ R ++ +Y+ ++ T R ++ + + + P
Sbjct: 107 RRGRNITFDELNSKPLKTSMVRAMVYQGALYVINFEDMRYTFSRAKASLNALNRAISAIP 166
Query: 151 G--RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS---GDRWTMDIVFPDWSFWG 205
+P++E +F +D + P P++ YS D W + PD+ +W
Sbjct: 167 NPNEIPNIEFIFTTEDF------------HDDPHPVWVYSKREDDGWAW--LMPDFGYWS 212
Query: 206 WAEINIKPWESL---LRELKEGN--NGR-----NWIDREPYAYWKGNPFVA-ETRRDLL 253
W EI + S+ + + EG NG+ + D++ W+G A E R+ L+
Sbjct: 213 WPEIKAGQYRSVRQRIAAIDEGTVINGKAHPPLKFQDKKKQLLWRGAIATAPELRQGLV 271
>gi|71002250|ref|XP_755806.1| DUF821 domain protein [Aspergillus fumigatus Af293]
gi|66853444|gb|EAL93768.1| DUF821 domain protein [Aspergillus fumigatus Af293]
gi|159129863|gb|EDP54977.1| DUF821 domain protein [Aspergillus fumigatus A1163]
Length = 435
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 23/98 (23%), Positives = 45/98 (45%), Gaps = 15/98 (15%)
Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
+R T+ + + ++ +P R LPD+E + DD + P P++ YS
Sbjct: 135 SRAKATLNSLHRAMQSFPDRHNLPDIEFVLTADDFSNV------------PGPVWSYSKR 182
Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
I + PD+ +W W E+ + P++ + R + + G
Sbjct: 183 DEDESIWLMPDFGYWAWPEVKVGPYKDIRRRIAAVDEG 220
>gi|34193987|gb|AAH36526.3| KDEL (Lys-Asp-Glu-Leu) containing 2 [Homo sapiens]
Length = 507
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 39/175 (22%), Positives = 77/175 (44%), Gaps = 10/175 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
+ D+V P + ++ + L + +GN G +WI++ A+++G + E
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRGRDSLEE 321
>gi|114640225|ref|XP_001141284.1| PREDICTED: KDEL motif-containing protein 2 isoform 3 [Pan
troglodytes]
gi|410227956|gb|JAA11197.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410227958|gb|JAA11198.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410227960|gb|JAA11199.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265524|gb|JAA20728.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265526|gb|JAA20729.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265528|gb|JAA20730.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265530|gb|JAA20731.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265532|gb|JAA20732.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410265534|gb|JAA20733.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410304276|gb|JAA30738.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410304278|gb|JAA30739.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410304280|gb|JAA30740.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
gi|410331311|gb|JAA34602.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Pan troglodytes]
Length = 507
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 315
>gi|326913110|ref|XP_003202884.1| PREDICTED: protein O-glucosyltransferase 1-like [Meleagris
gallopavo]
Length = 393
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 85/193 (44%), Gaps = 28/193 (14%)
Query: 62 TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR----DMLERANQTAHFRLIL 117
T Q ++ Y C + +DL+P++ +GI++ D++ R T H+++I
Sbjct: 37 TDQIIRAVEVYKPCVKENCSCHQSVWKQDLAPFR-SGISKETISDVVSRKLGT-HYQII- 93
Query: 118 VNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
NK+Y ++ R G+ + + RLPD+E++ + D P + P
Sbjct: 94 -KNKLY-REHDCMFPAR----CSGVEHFIHEIINRLPDMEMVINVRDYPQV-------PK 140
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ + +L+
Sbjct: 141 WMKPIIPVFSFSKTAEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEKWP 200
Query: 230 WIDREPYAYWKGN 242
W + Y++G+
Sbjct: 201 WKKKISKGYFRGS 213
>gi|153218487|ref|NP_714916.3| KDEL motif-containing protein 2 precursor [Homo sapiens]
gi|110810398|sp|Q7Z4H8.2|KDEL2_HUMAN RecName: Full=KDEL motif-containing protein 2; Flags: Precursor
gi|119587523|gb|EAW67119.1| KDEL (Lys-Asp-Glu-Leu) containing 2, isoform CRA_a [Homo sapiens]
gi|119587524|gb|EAW67120.1| KDEL (Lys-Asp-Glu-Leu) containing 2, isoform CRA_a [Homo sapiens]
Length = 507
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 315
>gi|397516340|ref|XP_003828388.1| PREDICTED: KDEL motif-containing protein 2 isoform 2 [Pan paniscus]
Length = 499
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 145 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 204
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 205 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 259
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 260 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 307
>gi|407921427|gb|EKG14575.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 490
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 73/168 (43%), Gaps = 23/168 (13%)
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
WK+ GI R M+ +L ++N + ++ + R + + I + + YPG +
Sbjct: 113 WKIDGIIRAMIYDR------QLFILNAR---GARRRDYRQRTLAVLQSIQRAITAYPGDI 163
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR--WTMDIVFPDWSFWGWAEINI 211
P++E F DD Y+ + L R D W M PD+ F+ W E
Sbjct: 164 PNIEFSFVVDDGAYFAV--YNNETSATTWALTREPQDDNLWLM----PDFGFYSW-EGPA 216
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVA---ETRRDLLTCN 256
+ +LLR +++ + + ++P A W+G A + R DLL +
Sbjct: 217 GEYNALLRAIEQ--DEMPFEQKDPRAIWRGAKAPAGHVQVRSDLLKVS 262
>gi|109108571|ref|XP_001104429.1| PREDICTED: KDEL motif-containing protein 2-like isoform 2 [Macaca
mulatta]
Length = 507
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 38/171 (22%), Positives = 75/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILDNHIYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + +L +GN G +WI++ A+++G
Sbjct: 268 SRDVVLPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTERAFFRG 315
>gi|355752611|gb|EHH56731.1| hypothetical protein EGM_06196, partial [Macaca fascicularis]
Length = 440
Score = 37.4 bits (85), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 86 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILDNHIYRRSLGKYT 145
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 146 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 200
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 201 SRDVVLPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 248
>gi|327303802|ref|XP_003236593.1| hypothetical protein TERG_03638 [Trichophyton rubrum CBS 118892]
gi|326461935|gb|EGD87388.1| hypothetical protein TERG_03638 [Trichophyton rubrum CBS 118892]
Length = 431
Score = 37.4 bits (85), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 69/166 (41%), Gaps = 18/166 (10%)
Query: 98 GITRDMLERAN-QTAHFRLILVNNKVYI--HKYKQSIQTRDVFTIWGILQ-LLRKYPGR- 152
GI+ + L++ + R I+ ++YI K Q + + + + L G
Sbjct: 95 GISNEHLDKIELKDGMARAIIHRGELYIVATKAAQEDHRKKILAVLSSMHHALSATAGNR 154
Query: 153 -LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGW--AE 208
P +E +F +DR D SGP + P++ S I + PD+ FW W A
Sbjct: 155 TQPSIEFVFSIEDR----VDDISGPGH----PIWALSRKASEESIWLIPDFGFWAWDNAA 206
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
I P+ ++ ++ W +E W+G F + RR+LL
Sbjct: 207 NAIGPYNQVVDRIQRQEVTVPWSSKENKLVWRGKLSFAPKLRRNLL 252
>gi|189069279|dbj|BAG36311.1| unnamed protein product [Homo sapiens]
Length = 402
Score = 37.4 bits (85), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 48 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 107
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 108 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 162
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 163 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 210
>gi|37182354|gb|AAQ88979.1| VELF1904 [Homo sapiens]
Length = 451
Score = 37.4 bits (85), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 97 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 156
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 157 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 211
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 212 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 259
>gi|26337215|dbj|BAC32292.1| unnamed protein product [Mus musculus]
gi|74198417|dbj|BAE39692.1| unnamed protein product [Mus musculus]
Length = 317
Score = 37.4 bits (85), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 38/170 (22%), Positives = 71/170 (41%), Gaps = 29/170 (17%)
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLFRYSGDRWTMDIVFP 199
G+ + + RLPD+E++ + D P + P P P+F +S DI++P
Sbjct: 35 GVEHFILEVIHRLPDMEMVINVRDYPQV-------PKWMEPTIPVFSFSKTSEYHDIMYP 87
Query: 200 DWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG-------NPFV 245
W+FW G A + P W+ +L W + AY++G +P +
Sbjct: 88 AWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRTSPERDPLI 147
Query: 246 AETRRD--LLTCNLSDKHDWNARLYVQDWILESKRGFQQSNLASQCAHRY 293
+R++ L+ + W + ++D + K + +L C +RY
Sbjct: 148 LLSRKNPKLVDAEYTKNQAWKS---MKDTL--GKPAAKDVHLIDHCKYRY 192
>gi|355567023|gb|EHH23402.1| hypothetical protein EGK_06865, partial [Macaca mulatta]
Length = 440
Score = 37.4 bits (85), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 86 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILDNHIYRRSLGKYT 145
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 146 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 200
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 201 SRDVVLPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 248
>gi|397516338|ref|XP_003828387.1| PREDICTED: KDEL motif-containing protein 2 isoform 1 [Pan paniscus]
Length = 406
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 52 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 111
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 112 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 166
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 167 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 214
>gi|332208116|ref|XP_003253144.1| PREDICTED: KDEL motif-containing protein 2 [Nomascus leucogenys]
Length = 507
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 315
>gi|355697431|gb|AES00668.1| KDEL containing 2 [Mustela putorius furo]
Length = 476
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 38/171 (22%), Positives = 76/171 (44%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + ++ Y++S+
Sbjct: 122 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNRIYRRSLGKYT 181
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 182 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRQVNEIP---GPLPIISWCGSLD 236
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 237 SRDIILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 284
>gi|297603834|ref|NP_001054658.2| Os05g0149500 [Oryza sativa Japonica Group]
gi|52353465|gb|AAU44032.1| unknown protein [Oryza sativa Japonica Group]
gi|218196097|gb|EEC78524.1| hypothetical protein OsI_18468 [Oryza sativa Indica Group]
gi|255676022|dbj|BAF16572.2| Os05g0149500 [Oryza sativa Japonica Group]
Length = 195
Score = 37.0 bits (84), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 263 WNARLYVQDWILESKRGFQQSNLASQCAHRY 293
W+A + Q+W E K G+ S L+SQC HRY
Sbjct: 2 WHAEIMRQNWDEEVKSGYHNSKLSSQCTHRY 32
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.136 0.450
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,134,448,927
Number of Sequences: 23463169
Number of extensions: 226728506
Number of successful extensions: 425316
Number of sequences better than 100.0: 467
Number of HSP's better than 100.0 without gapping: 197
Number of HSP's successfully gapped in prelim test: 270
Number of HSP's that attempted gapping in prelim test: 424603
Number of HSP's gapped (non-prelim): 487
length of query: 294
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 153
effective length of database: 9,050,888,538
effective search space: 1384785946314
effective search space used: 1384785946314
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)