BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022102
(302 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359474017|ref|XP_002269577.2| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
Length = 585
Score = 361 bits (926), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 177/271 (65%), Positives = 205/271 (75%), Gaps = 13/271 (4%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNY 59
I F FS + F V+N R I + + RP I PLNC +N TQTCP NY
Sbjct: 115 FISFFCFSS--RFIFLVNNSERKAVPISENHRKTPRP---IVVPLNCSARNLTQTCPGNY 169
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
P T T D + P CPDYFRWIHEDL PWK TGI+RDM+ERA ++AHFRL++V
Sbjct: 170 PTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVK 222
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVIRS D+ GPN+
Sbjct: 223 GKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNST 282
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN W++REPYAYW
Sbjct: 283 SPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYW 342
Query: 240 KGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
KGNP VAETRRDLLTCN+SD DWNARL+VQ
Sbjct: 343 KGNPLVAETRRDLLTCNVSDVQDWNARLFVQ 373
>gi|297742541|emb|CBI34690.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 360 bits (923), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 177/271 (65%), Positives = 205/271 (75%), Gaps = 13/271 (4%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNY 59
I F FS + F V+N R I + + RP I PLNC +N TQTCP NY
Sbjct: 27 FISFFCFSS--RFIFLVNNSERKAVPISENHRKTPRP---IVVPLNCSARNLTQTCPGNY 81
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
P T T D + P CPDYFRWIHEDL PWK TGI+RDM+ERA ++AHFRL++V
Sbjct: 82 PTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVK 134
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVIRS D+ GPN+
Sbjct: 135 GKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNST 194
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN W++REPYAYW
Sbjct: 195 SPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYW 254
Query: 240 KGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
KGNP VAETRRDLLTCN+SD DWNARL+VQ
Sbjct: 255 KGNPLVAETRRDLLTCNVSDVQDWNARLFVQ 285
>gi|147800396|emb|CAN66409.1| hypothetical protein VITISV_020976 [Vitis vinifera]
Length = 439
Score = 348 bits (894), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 162/222 (72%), Positives = 185/222 (83%), Gaps = 7/222 (3%)
Query: 49 KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERAN 108
KN TQTCP NYP T T D + P CPDYFRWIHEDL PWK TGI+RDM+ERA
Sbjct: 23 KNLTQTCPGNYPTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAK 75
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
++AHFRL++V KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVI
Sbjct: 76 RSAHFRLVIVKGKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVI 135
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
RS D+ GPN+ PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN
Sbjct: 136 RSGDHRGPNSTSPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRT 195
Query: 229 NWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
W++REPYAYWKGNP VAETRRDLLTCN+SD DWNARL+VQ
Sbjct: 196 KWMEREPYAYWKGNPLVAETRRDLLTCNVSDVQDWNARLFVQ 237
>gi|255539447|ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549903|gb|EEF51390.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 528
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 160/277 (57%), Positives = 198/277 (71%), Gaps = 12/277 (4%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQN------RPPKRITTPLNCVK-NQTQ 53
+ F + L S+ + S K +I P N +I PLNC N T+
Sbjct: 45 LASAFLTTRFLDSSSAFTGSSAQKPLITTKSAPTNPTLISKNALNKINIPLNCAAFNLTR 104
Query: 54 TCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHF 113
TCP+NYP T + P S CP+Y+RWI+EDL PW TGI+RDM+ERA TA+F
Sbjct: 105 TCPSNYPTTF-----TENPDRPSVSACPEYYRWIYEDLRPWARTGISRDMVERAKTTANF 159
Query: 114 RLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY 173
RL++VN K Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC D PVI+S +Y
Sbjct: 160 RLVIVNGKAYVEKYRRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIKSSNY 219
Query: 174 SGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDR 233
SGPN PPPLFRY GD T+D+VFPDWSFWGW+EINIKPWE LLRELKEGN R W++R
Sbjct: 220 SGPNAMAPPPLFRYCGDDDTLDVVFPDWSFWGWSEINIKPWERLLRELKEGNEKRRWMER 279
Query: 234 EPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
EPYAYWKGNP VAETR+DL+ CN+S++ DWNAR+Y Q
Sbjct: 280 EPYAYWKGNPAVAETRQDLMKCNVSEQQDWNARVYAQ 316
>gi|297808383|ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
lyrata]
gi|297317912|gb|EFH48334.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
lyrata]
Length = 543
Score = 333 bits (854), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 155/251 (61%), Positives = 194/251 (77%), Gaps = 11/251 (4%)
Query: 22 RNKTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTN-YPKTSQTQESISDYSIPPTST 79
R TVI S KP+ L+C N+T +CP+N YP T+ E D + PP +T
Sbjct: 89 RPTTVITQSPKPE--------FTLHCSANETTASCPSNKYPTTASFGED-DDTNHPPNAT 139
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPDYFRWIHEDL PW TGITR+ LERA +TA+FRL +++ K+Y+ K++ + QTRDVFTI
Sbjct: 140 CPDYFRWIHEDLRPWSSTGITREALERAKKTANFRLAIIDGKIYVEKFQDAFQTRDVFTI 199
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG LQLLRKYPG++PDLELMFDC D PV+++ +++G N PPPLFRY G+ T+DIVFP
Sbjct: 200 WGFLQLLRKYPGKIPDLELMFDCVDWPVVKASEFTGANAPSPPPLFRYCGNEETLDIVFP 259
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
DWSFWGWAE+NIKPWESLL+EL+EGN WI+REPYAYWKGNP VAETR+DL+ CN+S+
Sbjct: 260 DWSFWGWAEVNIKPWESLLKELREGNQRTKWINREPYAYWKGNPMVAETRQDLMKCNVSE 319
Query: 260 KHDWNARLYVQ 270
+H+WNARLYVQ
Sbjct: 320 EHEWNARLYVQ 330
>gi|15237842|ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana]
gi|10176852|dbj|BAB10058.1| unnamed protein product [Arabidopsis thaliana]
gi|48310551|gb|AAT41837.1| At5g23850 [Arabidopsis thaliana]
gi|62320258|dbj|BAD94534.1| putative protein [Arabidopsis thaliana]
gi|332005839|gb|AED93222.1| uncharacterized protein [Arabidopsis thaliana]
Length = 542
Score = 332 bits (852), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 156/251 (62%), Positives = 193/251 (76%), Gaps = 12/251 (4%)
Query: 22 RNKTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTN-YPKTSQTQESISDYSIPPTST 79
R TVI S KP+ L+C N+T +CP+N YP T+ ++ D + PPT+T
Sbjct: 89 RPTTVITQSPKPE--------FTLHCSANETTASCPSNKYPTTTSFEDD--DTNHPPTAT 138
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPDYFRWIHEDL PW TGITR+ LERA +TA FRL +V K+Y+ K++ + QTRDVFTI
Sbjct: 139 CPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAFQTRDVFTI 198
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG LQLLRKYPG++PDLELMFDC D PV+R+ +++G N PPPLFRY G+ T+DIVFP
Sbjct: 199 WGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNEETLDIVFP 258
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
DWSFWGWAE+NIKPWESLL+EL+EGN WI+REPYAYWKGNP VAETR+DL+ CN+S+
Sbjct: 259 DWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAETRQDLMKCNVSE 318
Query: 260 KHDWNARLYVQ 270
+H+WNARLY Q
Sbjct: 319 EHEWNARLYAQ 329
>gi|332071136|gb|AED99886.1| glycosyltransferase [Panax notoginseng]
Length = 546
Score = 327 bits (838), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 151/238 (63%), Positives = 187/238 (78%), Gaps = 4/238 (1%)
Query: 34 QNRPPKRITTPLNC-VKNQTQTCPTNY-PKTSQTQESISDYSIPPTSTCPDYFRWIHEDL 91
+ +PP+++ PLNC N +TCP NY P+T Q+ SIPP S CP+YFRWI+EDL
Sbjct: 100 RKKPPRKVEIPLNCSTGNLIRTCPANYYPRTFNIQDQ-DHSSIPPVS-CPEYFRWIYEDL 157
Query: 92 SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPG 151
PW+ TGITR+M+ERA +TA+FRL+++N + Y+ +++S Q+RDVFT+WGILQLLR YPG
Sbjct: 158 RPWRETGITREMVERARRTANFRLVILNGRAYVETHQKSFQSRDVFTLWGILQLLRMYPG 217
Query: 152 RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
++PDL+LMFDC D PVI SR Y GPN PPPLFRY D T+DIVFPDW+FWGW EINI
Sbjct: 218 KVPDLDLMFDCVDWPVIISRFYHGPNATAPPPLFRYCADDSTLDIVFPDWTFWGWPEINI 277
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYV 269
KPW SLL++LKEGN G W+DREPYAYWKGNP VA+TR DLL CN+SDK DWNAR+Y
Sbjct: 278 KPWGSLLKDLKEGNTGTQWMDREPYAYWKGNPIVAKTRMDLLKCNVSDKQDWNARVYA 335
>gi|359489773|ref|XP_002273091.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 519
Score = 326 bits (835), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 154/237 (64%), Positives = 186/237 (78%), Gaps = 7/237 (2%)
Query: 36 RPPKRITTPLNC-VKNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
+ P R+ PLNC N T+TCP NYP K + T S ++ TCPDYFRWIH+DL P
Sbjct: 76 KAPIRLEYPLNCSAGNLTKTCPGNYPTKHNPTNPDHSSTNM----TCPDYFRWIHQDLLP 131
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
WK TGITRDM++RA +TAHFRL++++ K Y+ K+++SIQTRD+FT+WGILQLLR YPGRL
Sbjct: 132 WKQTGITRDMVDRAQRTAHFRLVIIDGKAYVEKFRKSIQTRDMFTLWGILQLLRWYPGRL 191
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
PDLELMFDCDDRPV+R RD+ GP N PPPLFRY GD W++DIVFPDWSFWGWAE NIKP
Sbjct: 192 PDLELMFDCDDRPVVRMRDFRGP-NAAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKP 250
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
W ++L+++KEGN W DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+Q
Sbjct: 251 WRNVLKDIKEGNRRTKWKDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQ 307
>gi|255541542|ref|XP_002511835.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549015|gb|EEF50504.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 522
Score = 323 bits (828), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 148/243 (60%), Positives = 185/243 (76%), Gaps = 8/243 (3%)
Query: 31 RKPQNRPPKRITTPLNCVK--NQTQTCPTNY-PKTSQTQESISDYSIPPTSTCPDYFRWI 87
+K P ++ PL C N TQTCP +Y PKT + S ++ CP YF+WI
Sbjct: 74 QKASPTPQEKAEFPLRCTATPNITQTCPADYYPKTHNSTNSDRQSNV----ICPSYFKWI 129
Query: 88 HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
HEDL PW+ TGITRDM+ERA +TAHFRL++V+ + Y+ KY+QSIQTRD+ T+WGILQLLR
Sbjct: 130 HEDLRPWRETGITRDMIERARRTAHFRLVIVDGRAYVEKYRQSIQTRDMITLWGILQLLR 189
Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
YPG++PDLELMFDCDDRPV+RS D+ GP GPPPLFRY D ++DIVFPDWSFWGWA
Sbjct: 190 LYPGKVPDLELMFDCDDRPVVRSEDFPGP-TAGPPPLFRYCADDTSLDIVFPDWSFWGWA 248
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
E+NIKPW+S+L+ + +G+ + W DR PYAYWKGNP+V+ R DL+TCN+SDKHDWNARL
Sbjct: 249 EVNIKPWKSMLKGITKGSKRKKWKDRVPYAYWKGNPYVSANRGDLMTCNVSDKHDWNARL 308
Query: 268 YVQ 270
Y Q
Sbjct: 309 YAQ 311
>gi|255539445|ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549902|gb|EEF51389.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 506
Score = 323 bits (828), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 151/242 (62%), Positives = 183/242 (75%), Gaps = 6/242 (2%)
Query: 30 SRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIH 88
S P +P R+ PLNC N T+TCPT+YP TS + S PPT CP+YFRWIH
Sbjct: 58 STVPLEKPDNRLVIPLNCHALNLTRTCPTDYPSTSSQD---PNRSSPPT--CPEYFRWIH 112
Query: 89 EDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRK 148
EDL PW TGITR+ +ERA TA+FRL+++N Y+ Y++S QTRDVFT+WGILQLLRK
Sbjct: 113 EDLRPWVRTGITRETMERAKATANFRLVILNGTAYLEMYEKSFQTRDVFTLWGILQLLRK 172
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
YPGR+PDLE+MFDC D PV++S DYSG + PPPLFRY G+ T+DIVFPDWS+WGW E
Sbjct: 173 YPGRVPDLEMMFDCVDWPVVKSVDYSGSSAISPPPLFRYCGNDETLDIVFPDWSYWGWVE 232
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
NIKPWE ++++LKEGN W +REPYAYWKGNP VAETR DL+ CN+S +HDWNARLY
Sbjct: 233 TNIKPWEKIVKDLKEGNQRSKWKEREPYAYWKGNPNVAETRLDLMKCNVSQEHDWNARLY 292
Query: 269 VQ 270
Q
Sbjct: 293 TQ 294
>gi|255537419|ref|XP_002509776.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549675|gb|EEF51163.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 534
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 163/263 (61%), Positives = 188/263 (71%), Gaps = 24/263 (9%)
Query: 11 LQSNFSVHNISRNKTVIIVSRKPQNRPPKR-ITTPLNCV--KNQTQTCPTNYPKTSQTQE 67
LQS SV S N TV+I + N P + I PLNC NQTQTCPTNYPK + ++
Sbjct: 73 LQSKVSVER-STNTTVVISEK--HNIPQRETIEFPLNCTINNNQTQTCPTNYPKLTTSRA 129
Query: 68 SISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKY 127
D PP + CP+YFRWIHEDL PW GI+RDM+ERA +TAHF LI+V K YI KY
Sbjct: 130 DDQD---PPRTVCPNYFRWIHEDLRPWIAAGISRDMVERAQRTAHFHLIIVGGKAYIKKY 186
Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY 187
++S QTRD FTIWGILQLLR+YPG++PDLELMFD DDRPVIRS DY N GPPPLFRY
Sbjct: 187 RESTQTRDTFTIWGILQLLRRYPGKIPDLELMFDTDDRPVIRSSDYHEQNTTGPPPLFRY 246
Query: 188 SGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
GDR PD INIKPW+ L ++KEGNNG WIDREPYAYWKGNPFVAE
Sbjct: 247 CGDR-------PD--------INIKPWDELSIDIKEGNNGSKWIDREPYAYWKGNPFVAE 291
Query: 248 TRRDLLTCNLSDKHDWNARLYVQ 270
TR+DLL CN+SD+ DWNARL++Q
Sbjct: 292 TRKDLLACNVSDQRDWNARLFIQ 314
>gi|225470185|ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
gi|302143884|emb|CBI22745.3| unnamed protein product [Vitis vinifera]
Length = 525
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 148/228 (64%), Positives = 176/228 (77%), Gaps = 5/228 (2%)
Query: 44 PLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
PLNC N T+TCP NYP T S D P CP YFRWI+ DL PW +GITR+
Sbjct: 92 PLNCSAGNLTRTCPRNYP----TAFSPEDPDRPSPPECPHYFRWIYGDLRPWMKSGITRE 147
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ERA +TA F+L+++N + Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC
Sbjct: 148 MVERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDC 207
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PVI+S +Y GPN PPPLFRY GD T+DIVFPDWSFWGW EINIKPWESLL++LK
Sbjct: 208 VDWPVIQSNEYRGPNATAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWESLLKDLK 267
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
EGN W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+Y Q
Sbjct: 268 EGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARVYTQ 315
>gi|15229061|ref|NP_190467.1| uncharacterized protein [Arabidopsis thaliana]
gi|6522568|emb|CAB62012.1| putative protein [Arabidopsis thaliana]
gi|332644958|gb|AEE78479.1| uncharacterized protein [Arabidopsis thaliana]
Length = 539
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 145/237 (61%), Positives = 182/237 (76%), Gaps = 6/237 (2%)
Query: 38 PKRITTPLNCVK---NQTQTCPT-NYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
PK T LNC N T TCP NYP + ++ + P++TCPDYFRWIHEDL P
Sbjct: 92 PKEFT--LNCAAFSGNDTGTCPKDNYPTSFRSSAGEGESDRSPSATCPDYFRWIHEDLRP 149
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ TGITR+ LERAN TA FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG++
Sbjct: 150 WEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKI 209
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
PDLELMFDC D PV+++ +++G + PPPLFRY + T+DIVFPDWS+WGWAE+NIKP
Sbjct: 210 PDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKP 269
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
WESLL+EL+EGN WIDREPYAYWKGNP VAETR DL+ CNLS+ +DW ARLY Q
Sbjct: 270 WESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSEVYDWKARLYKQ 326
>gi|297816106|ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
lyrata]
gi|297321774|gb|EFH52195.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
lyrata]
Length = 539
Score = 317 bits (811), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 149/239 (62%), Positives = 186/239 (77%), Gaps = 10/239 (4%)
Query: 38 PKRITTPLNCVK---NQTQTCP-TNYPKT--SQTQESISDYSIPPTSTCPDYFRWIHEDL 91
PK T LNC N T TCP NYP + S E SD S+ ++TCPDYFRWIHEDL
Sbjct: 92 PKEFT--LNCAGFAGNDTVTCPKNNYPTSFRSSVGEGESDRSL--SATCPDYFRWIHEDL 147
Query: 92 SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPG 151
PW+ TGITR+ LERAN TA+FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG
Sbjct: 148 RPWEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPG 207
Query: 152 RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
++PDLELMFDC D PV+++ +++G + PPPLFRY + T+DIVFPDWS+WGWAE+NI
Sbjct: 208 KIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNI 267
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
KPWESLL+EL+EGN WIDREPYAYWKGNP VAETR DL+ CNLS+++DW ARLY Q
Sbjct: 268 KPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSEEYDWKARLYKQ 326
>gi|356529208|ref|XP_003533188.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 469
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 152/257 (59%), Positives = 187/257 (72%), Gaps = 6/257 (2%)
Query: 15 FSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYS 73
FS S T + +R + K I PLNC N T TCPTN Q+ +D S
Sbjct: 6 FSGSTKSALSTTLYDTRPFRVSIQKPIEIPLNCSAYNLTGTCPTN-----QSPIPENDQS 60
Query: 74 IPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
P ++TCP+YFRWIHEDL PW TGIT++M+ERA QTA+F+L+++ K Y+ Y+++ QT
Sbjct: 61 RPSSATCPEYFRWIHEDLRPWARTGITQEMVERAKQTANFKLVILKGKAYLETYEKAYQT 120
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
RDVF+IWGILQLLR+YPG++PDLELMFDC D PV+ Y+GPN + PPPLFRY G+ T
Sbjct: 121 RDVFSIWGILQLLRRYPGKIPDLELMFDCVDWPVLLVDRYNGPNTEQPPPLFRYCGNDAT 180
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+DIVFPDWSFWGWAE+NIKPWE LL ELKEG W++REPYAYWKGNP VAETR DL+
Sbjct: 181 LDIVFPDWSFWGWAEVNIKPWEILLGELKEGTKRIPWLNREPYAYWKGNPVVAETRLDLM 240
Query: 254 TCNLSDKHDWNARLYVQ 270
CN+S+ DWNARLY Q
Sbjct: 241 KCNVSENQDWNARLYAQ 257
>gi|356561622|ref|XP_003549080.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 525
Score = 313 bits (803), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 154/276 (55%), Positives = 191/276 (69%), Gaps = 17/276 (6%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPP-----KRITTPLNCVK-NQTQT 54
+I FA++ L ++ S +K+ S P P ++ PLNC N T T
Sbjct: 52 IIGAFAYTRTLDTHRMFSGASSSKSA--QSTTPYGTSPFTVSIRKPIAPLNCTAYNLTGT 109
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
CPTN + P T+TCPDYFRWIHEDL PW TGIT+DM+ERA QTA+FR
Sbjct: 110 CPTNLQDHQNS---------PATATCPDYFRWIHEDLRPWARTGITQDMVERAKQTANFR 160
Query: 115 LILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
LI++ + Y+ Y + QTRDVF+IWGILQLLR+YPG++PDLELMFDC+D PV+ + Y+
Sbjct: 161 LIILKGRAYLETYSRPYQTRDVFSIWGILQLLRRYPGKIPDLELMFDCEDWPVVLADRYN 220
Query: 175 GPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDRE 234
GPN + PPPLFRY G+ T+DIVFPDWSFWGWAEINIKPW LL ELKEG W++RE
Sbjct: 221 GPNVEQPPPLFRYCGNDATLDIVFPDWSFWGWAEINIKPWHILLGELKEGTTRIPWLNRE 280
Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
PYAYWKGNP VAETR+DL+ CN+S+ DWNARL+ Q
Sbjct: 281 PYAYWKGNPAVAETRQDLIKCNVSENQDWNARLFAQ 316
>gi|449452346|ref|XP_004143920.1| PREDICTED: O-glucosyltransferase rumi-like [Cucumis sativus]
Length = 514
Score = 313 bits (802), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 149/237 (62%), Positives = 182/237 (76%), Gaps = 16/237 (6%)
Query: 44 PLNC----VKNQTQ--TCPTNYPKTSQTQESISDYSIPPT--STCPDYFRWIHEDLSPWK 95
PLNC NQTQ TC +YP + + SI P+ S CP+YFRWIHEDL PW
Sbjct: 76 PLNCSSSSTTNQTQHFTCRKDYPTLYEPE------SIGPSGRSVCPEYFRWIHEDLKPWA 129
Query: 96 VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
GITR+M+E+ TAHFRL +V VY+ YK+SIQTRD+FTIWGILQLLR+YPG++PD
Sbjct: 130 AGGITREMVEKGKATAHFRLAVVRGIVYVEHYKKSIQTRDLFTIWGILQLLRRYPGQIPD 189
Query: 156 LELMFDCDDRPVIRSRDY--SGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
LELMFDCDDRPV++S DY +G + PP+FRY GD T+DIVFPDWSFWGWAEINI+P
Sbjct: 190 LELMFDCDDRPVVKSADYRNAGVDTVEAPPVFRYCGDEETLDIVFPDWSFWGWAEINIRP 249
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
WE+LL+ELK+GN R W+ RE +AYWKGNP+VA+TR+DLL CNLS ++DWNARLY+Q
Sbjct: 250 WENLLKELKKGNEKRKWMKREAFAYWKGNPYVADTRQDLLKCNLSLQNDWNARLYIQ 306
>gi|110743626|dbj|BAE99650.1| hypothetical protein [Arabidopsis thaliana]
Length = 433
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 138/220 (62%), Positives = 175/220 (79%), Gaps = 1/220 (0%)
Query: 52 TQTCPT-NYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQT 110
T TCP NYP + ++ + P++TCPDYFRWIHEDL PW+ TGITR+ LERAN T
Sbjct: 1 TGTCPKDNYPTSFRSSAGEGESDRSPSATCPDYFRWIHEDLRPWEKTGITREALERANAT 60
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
A FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG++PDLELMFDC D PV+++
Sbjct: 61 AIFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKA 120
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
+++G + PPPLFRY + T+DIVFPDWS+WGWAE+NIKPWESLL+EL+EGN W
Sbjct: 121 AEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKPWESLLKELREGNQRTKW 180
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
IDREPYAYWKGNP VAETR DL+ CNLS+ +DW ARLY Q
Sbjct: 181 IDREPYAYWKGNPTVAETRLDLMKCNLSEVYDWKARLYKQ 220
>gi|356522636|ref|XP_003529952.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 510
Score = 310 bits (795), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 146/246 (59%), Positives = 181/246 (73%), Gaps = 16/246 (6%)
Query: 27 IIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
II + KPQ + PL+C +N TQTC +YP PT TCP YFRW
Sbjct: 67 IIPTTKPQEQ-----EFPLSCTQNVTQTCSRDYPTIHTPTN--------PTRTCPSYFRW 113
Query: 87 IHEDLSPWKVT--GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQ 144
IHEDL PW+ GITR+MLE A +TAHFRL++V+ K+Y+ KYK++IQTRDVFT+WGILQ
Sbjct: 114 IHEDLWPWRERDRGITREMLEGARRTAHFRLVIVDGKLYVEKYKKAIQTRDVFTLWGILQ 173
Query: 145 LLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
LLR YPG++PDLEL+FDCDDRPV+ + GPN PP LFRY D+W++DIVFPDWSFW
Sbjct: 174 LLRMYPGKVPDLELLFDCDDRPVVSKERFKGPNAPTPP-LFRYCSDQWSLDIVFPDWSFW 232
Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
GWAEINIKPW+ +L+E+KEGN W DR PYAYWKGNP V+ TR+DL+ CN+++K DWN
Sbjct: 233 GWAEINIKPWKHVLKEIKEGNEKTKWKDRVPYAYWKGNPLVSPTRKDLMKCNVTEKDDWN 292
Query: 265 ARLYVQ 270
LY+Q
Sbjct: 293 THLYIQ 298
>gi|356497228|ref|XP_003517464.1| PREDICTED: KDEL motif-containing protein 1-like [Glycine max]
Length = 522
Score = 310 bits (795), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 142/231 (61%), Positives = 174/231 (75%), Gaps = 13/231 (5%)
Query: 40 RITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGI 99
++ LNC N Q C T+YP + + CP+YFRWIHED+ WK GI
Sbjct: 89 QLEATLNCSNNGKQRC-TSYPTSGV-------FEREEGGVCPEYFRWIHEDVGAWKERGI 140
Query: 100 TRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
+R+M+ERA ++AHFRL++ +VY+ +YK+SIQTR+VFT+WGI+QLLRKYPG++ DLELM
Sbjct: 141 SREMVERAKKSAHFRLVVKRGRVYVERYKKSIQTREVFTMWGIVQLLRKYPGKVADLELM 200
Query: 160 FDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLR 219
FDCDD PVIR G + GPPPLFRY GDRWT DIVFPDWSFWGWAEINI+PWE +L+
Sbjct: 201 FDCDDLPVIR-----GSSLAGPPPLFRYCGDRWTDDIVFPDWSFWGWAEINIRPWEHVLK 255
Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
E+++GN W DREPYAYWKGNPFVAETR+DLL CN+S DWNARLYVQ
Sbjct: 256 EMEKGNRRIKWNDREPYAYWKGNPFVAETRQDLLKCNVSTTQDWNARLYVQ 306
>gi|357497409|ref|XP_003618993.1| KTEL motif-containing protein [Medicago truncatula]
gi|355494008|gb|AES75211.1| KTEL motif-containing protein [Medicago truncatula]
Length = 515
Score = 307 bits (786), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 141/230 (61%), Positives = 179/230 (77%), Gaps = 12/230 (5%)
Query: 44 PLNCV--KNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGIT 100
PL C + +TQTCP +YP K + T ++ + TCP +FRWIHEDL PWK GIT
Sbjct: 83 PLRCTNGEKETQTCPRDYPTKHNPTNQN--------SHTCPSFFRWIHEDLKPWKEKGIT 134
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMF 160
R+MLE A +TA+F++++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR +PG+LPDLELMF
Sbjct: 135 REMLEGAKRTANFKVVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRMFPGKLPDLELMF 194
Query: 161 DCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
DC+DRPVI ++ GP N PPPLFRY D+W++DIVFPDWSFWGWAE NIKPW+++L+E
Sbjct: 195 DCEDRPVIHKGNFQGP-NASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWKNILKE 253
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
+KEGN W DR PYAYWKGNP VA TR++LL CN + K DWN RLY+Q
Sbjct: 254 IKEGNKETKWKDRVPYAYWKGNPNVAATRKNLLRCNATSKDDWNTRLYIQ 303
>gi|297824639|ref|XP_002880202.1| hypothetical protein ARALYDRAFT_483721 [Arabidopsis lyrata subsp.
lyrata]
gi|297326041|gb|EFH56461.1| hypothetical protein ARALYDRAFT_483721 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 303 bits (777), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 141/232 (60%), Positives = 173/232 (74%), Gaps = 2/232 (0%)
Query: 39 KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
+R + VKNQTQ P + + ++ S +S P STCP YFRWIHEDL PWK TG
Sbjct: 75 QRFPNQCDVVKNQTQLFPEDGSSRNNNKKPRSSHSRP--STCPSYFRWIHEDLRPWKETG 132
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
ITR MLE+A +TAHFR+++++ +VY+ KY+ SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 133 ITRGMLEKARRTAHFRVVILDGRVYVKKYRNSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 192
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
MFD DDRP +RS+D+ G + PPPLFRY D ++DIVFPDWSFWGWAE+NIKPW L
Sbjct: 193 MFDPDDRPTVRSKDFQGHQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWAKSL 252
Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
++EGN W DR YAYW+GNP VA TRRDLL CN+S + DWN RLY+Q
Sbjct: 253 VAIEEGNKMTQWTDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQ 304
>gi|449446159|ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 538
Score = 300 bits (768), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 144/248 (58%), Positives = 178/248 (71%), Gaps = 10/248 (4%)
Query: 29 VSRKPQNRPPK-RITTPLNCVKNQTQT---CPTNYPKTSQTQESISDYSIPPTST--CPD 82
V P ++P + ++ L+C T CP +YP T E + PP+S+ CPD
Sbjct: 83 VPHNPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQN----PPSSSSACPD 138
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
YFRWIHEDL PW TGITR LE +TA+FRL+++N K Y+ YK+S QTRD FT+WGI
Sbjct: 139 YFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGI 198
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
LQLLR+YPG++PDL+LMFDC D PVI + +SGPN PPPLFRY GD T DIVFPDWS
Sbjct: 199 LQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWS 258
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
FWGW EINIKPWE LL+++KEGN W REPYAYWKGNP VA+TR+DL+ CN+SD+ D
Sbjct: 259 FWGWPEINIKPWEPLLKDIKEGNKRIPWKSREPYAYWKGNPEVADTRKDLIKCNVSDQQD 318
Query: 263 WNARLYVQ 270
WNAR++ Q
Sbjct: 319 WNARVFAQ 326
>gi|224127624|ref|XP_002320120.1| predicted protein [Populus trichocarpa]
gi|222860893|gb|EEE98435.1| predicted protein [Populus trichocarpa]
Length = 411
Score = 300 bits (767), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 135/194 (69%), Positives = 165/194 (85%), Gaps = 1/194 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
TSTCP YF+WIH+DL WK TGIT+DM+ERA +TAHFRL++VN K Y+ KY+QSIQTRD+
Sbjct: 9 TSTCPSYFQWIHDDLRHWKETGITQDMIERARKTAHFRLVIVNGKAYVEKYRQSIQTRDM 68
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FT+WGILQLLR YPGRLPDLELMFDCDDRPVI S+ + GP N PPPLFRY D ++DI
Sbjct: 69 FTLWGILQLLRLYPGRLPDLELMFDCDDRPVIPSKHFRGP-NAAPPPLFRYCSDWQSLDI 127
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGWAE NI+PW++LL+E+KEGN+ W DR PYAYW+GNP+V+ R+DLL CN
Sbjct: 128 VFPDWSFWGWAETNIRPWKNLLKEIKEGNSRTKWKDRTPYAYWRGNPWVSPIRQDLLKCN 187
Query: 257 LSDKHDWNARLYVQ 270
+S+++DWN RLY+Q
Sbjct: 188 VSEQNDWNTRLYLQ 201
>gi|42569947|ref|NP_182107.3| downstream target of AGL15 2 [Arabidopsis thaliana]
gi|330255513|gb|AEC10607.1| downstream target of AGL15 2 [Arabidopsis thaliana]
Length = 523
Score = 299 bits (766), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 139/232 (59%), Positives = 171/232 (73%), Gaps = 3/232 (1%)
Query: 39 KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
+R V+NQTQ P N + + S I STCP YFRWIHEDL PWK TG
Sbjct: 81 QRFPNQCGVVQNQTQLFPQNGSSRNNDKPRSSHSRI---STCPSYFRWIHEDLRPWKETG 137
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
+TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 138 VTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 197
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
MFD DDRP +RS+D+ G + PPPLFRY D ++DIVFPDWSFWGWAE+NIKPW+ L
Sbjct: 198 MFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSL 257
Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
++EGN W DR YAYW+GNP VA TRRDLL CN+S + DWN RLY+Q
Sbjct: 258 VAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQ 309
>gi|449485616|ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 538
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 143/248 (57%), Positives = 178/248 (71%), Gaps = 10/248 (4%)
Query: 29 VSRKPQNRPPK-RITTPLNCVKNQTQT---CPTNYPKTSQTQESISDYSIPPTST--CPD 82
V P ++P + ++ L+C T CP +YP T E + PP+S+ CPD
Sbjct: 83 VPHNPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQN----PPSSSSACPD 138
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
YFRWIHEDL PW TGITR LE +TA+FRL+++N K Y+ YK+S QTRD FT+WGI
Sbjct: 139 YFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGI 198
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
LQLLR+YPG++PDL+LMFDC D PVI + +SGPN PPPLFRY GD T DIVFPDWS
Sbjct: 199 LQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWS 258
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
FWGW EINIKPWE LL+++KEGN W R+PYAYWKGNP VA+TR+DL+ CN+SD+ D
Sbjct: 259 FWGWPEINIKPWEPLLKDIKEGNKRIPWKSRQPYAYWKGNPEVADTRKDLIKCNVSDQQD 318
Query: 263 WNARLYVQ 270
WNAR++ Q
Sbjct: 319 WNARVFAQ 326
>gi|3386611|gb|AAC28541.1| unknown protein [Arabidopsis thaliana]
Length = 517
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 139/232 (59%), Positives = 171/232 (73%), Gaps = 3/232 (1%)
Query: 39 KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
+R V+NQTQ P N + + S I STCP YFRWIHEDL PWK TG
Sbjct: 75 QRFPNQCGVVQNQTQLFPQNGSSRNNDKPRSSHSRI---STCPSYFRWIHEDLRPWKETG 131
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
+TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 132 VTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 191
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
MFD DDRP +RS+D+ G + PPPLFRY D ++DIVFPDWSFWGWAE+NIKPW+ L
Sbjct: 192 MFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSL 251
Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
++EGN W DR YAYW+GNP VA TRRDLL CN+S + DWN RLY+Q
Sbjct: 252 VAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQ 303
>gi|224074911|ref|XP_002304487.1| predicted protein [Populus trichocarpa]
gi|222841919|gb|EEE79466.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 133/168 (79%), Positives = 148/168 (88%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
MLERA TAHFRLI+V K Y+ KYK+SIQTRD FTIWGILQLLR+YPG++PDLELMFDC
Sbjct: 1 MLERAKTTAHFRLIIVKGKAYLEKYKKSIQTRDAFTIWGILQLLRRYPGKIPDLELMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DD PVI+S DY GPN GPPPLFRY GD+WT DIVFPDWSFWGWAEINIKPW+ LL +LK
Sbjct: 61 DDLPVIQSSDYRGPNKTGPPPLFRYCGDKWTEDIVFPDWSFWGWAEINIKPWDKLLIDLK 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
EGNN WIDREPYAYWKGNPFVAETR+DLLTCN+SD+ DWNARL++Q
Sbjct: 121 EGNNRSRWIDREPYAYWKGNPFVAETRKDLLTCNVSDQQDWNARLFIQ 168
>gi|357497413|ref|XP_003618995.1| KTEL motif-containing protein [Medicago truncatula]
gi|355494010|gb|AES75213.1| KTEL motif-containing protein [Medicago truncatula]
Length = 303
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 131/228 (57%), Positives = 178/228 (78%), Gaps = 10/228 (4%)
Query: 45 LNCVKNQT--QTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
L C++ ++ Q CP + P T+ +++++ CP YFRWIHEDL PW+ GITR+
Sbjct: 62 LKCIQGKSFKQKCPRDNP-TTHNPTNLNNH------VCPSYFRWIHEDLKPWREKGITRN 114
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
ML++A +TAHF+L++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR YPG+LPDLELMFDC
Sbjct: 115 MLKKAKKTAHFKLVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRLYPGKLPDLELMFDC 174
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
+D+PV+ + GP N PPPLFRY D+W++DIVFPDWSFWGWAE NIKPW ++L+++K
Sbjct: 175 EDKPVVPLDKFQGP-NASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWNNILKDIK 233
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
EGN W DR PYAYWKGNP+VA TR++LL CN++ ++DWN +Y+Q
Sbjct: 234 EGNKKTKWKDRVPYAYWKGNPYVAPTRQNLLQCNVTLENDWNTLIYIQ 281
>gi|357481655|ref|XP_003611113.1| KTEL motif-containing protein [Medicago truncatula]
gi|355512448|gb|AES94071.1| KTEL motif-containing protein [Medicago truncatula]
Length = 502
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 139/231 (60%), Positives = 167/231 (72%), Gaps = 23/231 (9%)
Query: 43 TPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
T LNC +KNQT + +NYP T CP+YF+WIHEDL PWK GITR
Sbjct: 82 TVLNCTIKNQTCSS-SNYPTTKNNM------------VCPEYFKWIHEDLKPWKKKGITR 128
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYK--QSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
+M+E+A +TAHFRL++ N K Y+ KYK ++IQTRDVFT+WGILQLLRKYPG++PDLELM
Sbjct: 129 EMVEKAKKTAHFRLVVKNGKGYLEKYKNKEAIQTRDVFTVWGILQLLRKYPGKIPDLELM 188
Query: 160 FDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLR 219
FDC+D+PV+ P PPP+F Y DRWT DIVFPDWSFWGWAEINIKPWE LL+
Sbjct: 189 FDCNDKPVV-------PIGLDPPPVFGYCADRWTQDIVFPDWSFWGWAEINIKPWEHLLK 241
Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
++K+GN W DREPYAYWKGNP+ A TR D L CN+S DWN RL+ Q
Sbjct: 242 DIKKGNKRVKWKDREPYAYWKGNPYTAATRLDFLNCNVSTAQDWNLRLFTQ 292
>gi|110736095|dbj|BAF00020.1| hypothetical protein [Arabidopsis thaliana]
Length = 418
Score = 290 bits (742), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 129/193 (66%), Positives = 157/193 (81%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
STCP YFRWIHEDL PWK TG+TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVF
Sbjct: 12 STCPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVF 71
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
T+WGI+QLLR YPGRLPDLELMFD DDRP +RS+D+ G + PPPLFRY D ++DIV
Sbjct: 72 TLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIV 131
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGWAE+NIKPW+ L ++EGN W DR YAYW+GNP VA TRRDLL CN+
Sbjct: 132 FPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNV 191
Query: 258 SDKHDWNARLYVQ 270
S + DWN RLY+Q
Sbjct: 192 SAQEDWNTRLYIQ 204
>gi|449446167|ref|XP_004140843.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 442
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 125/196 (63%), Positives = 152/196 (77%)
Query: 75 PPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
P +TCP+YFRWIHEDL PW GIT+ MLE A + AHFR+++V K Y+ Y ++ Q+R
Sbjct: 16 PMVATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSR 75
Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
D T+WG++QLLR+YPG+LPDL+LMF CDDRP I +DYSG PPPLFRYSGD T
Sbjct: 76 DNLTVWGVVQLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATW 135
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWSFWGW EINIK WES+L+++KEGN W+ R+PYAYWKGNP VA TRRDLL
Sbjct: 136 DIVFPDWSFWGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLK 195
Query: 255 CNLSDKHDWNARLYVQ 270
CN++ K DW+ARLY Q
Sbjct: 196 CNVTQKQDWSARLYRQ 211
>gi|449526435|ref|XP_004170219.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 426
Score = 290 bits (741), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 124/194 (63%), Positives = 151/194 (77%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
+TCP+YFRWIHEDL PW GIT+ MLE A + AHFR+++V K Y+ Y ++ Q+RD
Sbjct: 2 VATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSRDN 61
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
T+WG++QLLR+YPG+LPDL+LMF CDDRP I +DYSG PPPLFRYSGD T DI
Sbjct: 62 LTVWGVVQLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATWDI 121
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EINIK WES+L+++KEGN W+ R+PYAYWKGNP VA TRRDLL CN
Sbjct: 122 VFPDWSFWGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLKCN 181
Query: 257 LSDKHDWNARLYVQ 270
++ K DW+ARLY Q
Sbjct: 182 VTQKQDWSARLYRQ 195
>gi|297840159|ref|XP_002887961.1| hypothetical protein ARALYDRAFT_475010 [Arabidopsis lyrata subsp.
lyrata]
gi|297333802|gb|EFH64220.1| hypothetical protein ARALYDRAFT_475010 [Arabidopsis lyrata subsp.
lyrata]
Length = 571
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 143/249 (57%), Positives = 181/249 (72%), Gaps = 16/249 (6%)
Query: 31 RKPQNRPPKRITTPLNCV----KNQTQTCP-TNYPKTSQTQESISDYSIPPTSTCPDYFR 85
R+P+ + + L+C +N + +C T P +Q Q +CPDYF+
Sbjct: 119 REPEKKSQQNTGISLDCTSFLNQNGSGSCSRTPQPNNNQNQTE-------SNRSCPDYFK 171
Query: 86 WIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQL 145
WIHEDL PW+ TGIT++M+ER TAHFRL++VN KV++ YK+SIQTRD FT+WGILQL
Sbjct: 172 WIHEDLKPWRETGITKEMVERGKTTAHFRLVIVNGKVFVENYKKSIQTRDAFTLWGILQL 231
Query: 146 LRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWS 202
LRKYPG+LPD++LMFDCDDRPVIRS Y+ N PPPLFRY GDRWT+DIVFPDWS
Sbjct: 232 LRKYPGKLPDVDLMFDCDDRPVIRSDGYNRSNRTAEDVPPPLFRYCGDRWTVDIVFPDWS 291
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKH 261
FWGW EINIK W +L+E++EG + +++RE YAYWKGNPFVA +R DLLTCNLS H
Sbjct: 292 FWGWQEINIKEWSKVLKEMEEGKKKKKFMEREAYAYWKGNPFVASPSREDLLTCNLSSLH 351
Query: 262 DWNARLYVQ 270
DWNAR+++Q
Sbjct: 352 DWNARIFIQ 360
>gi|147798877|emb|CAN65871.1| hypothetical protein VITISV_012499 [Vitis vinifera]
Length = 424
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 134/231 (58%), Positives = 166/231 (71%), Gaps = 8/231 (3%)
Query: 39 KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
KR +NC + N T+TCP T T S+ S T TCP+YFRWI+EDL PW T
Sbjct: 80 KRFEYSMNCSEGNMTKTCPV----TLLTTFEPSNLS---TGTCPEYFRWIYEDLKPWTET 132
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GITRDM+ERA AH R+++V+ KVY KYK QTRDVFTIWGILQ+LR YPG+LPD +
Sbjct: 133 GITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFTIWGILQVLRMYPGKLPDFD 192
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
LMF+C D+PVI+ DY GPN PPPLF Y GD T+DIVFPDWSFWGW EI I+PW +L
Sbjct: 193 LMFECGDKPVIKKHDYQGPNATAPPPLFHYCGDDETLDIVFPDWSFWGWPEIRIRPWSTL 252
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
++L+EGNN W+BREPYAYWKGN + TR +L C+ +++ DWNAR+Y
Sbjct: 253 RKDLREGNNKTKWVBREPYAYWKGNFKMGVTRHELSKCSKTNEQDWNARIY 303
>gi|6633846|gb|AAF19705.1|AC008047_12 F2K11.20 [Arabidopsis thaliana]
Length = 605
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 138/249 (55%), Positives = 182/249 (73%), Gaps = 14/249 (5%)
Query: 31 RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
R + + P+ + ++C +N++ +C ++ S+ S CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
IHEDL PW+ TGIT++M+ER TAHFRL+++N KV++ YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
RKYPG+LPD++LMFDCDDRPVIRS Y+ N PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296
Query: 204 WGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHD 262
WGW EINI+ W +L+E++EG + +++R+ YAYWKGNPFVA +R DLLTCNLS HD
Sbjct: 297 WGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHD 356
Query: 263 WNARLYVQV 271
WNAR+++QV
Sbjct: 357 WNARIFIQV 365
>gi|15233070|ref|NP_191686.1| uncharacterized protein [Arabidopsis thaliana]
gi|6850894|emb|CAB71057.1| putative protein [Arabidopsis thaliana]
gi|27754477|gb|AAO22686.1| unknown protein [Arabidopsis thaliana]
gi|28393951|gb|AAO42383.1| unknown protein [Arabidopsis thaliana]
gi|332646660|gb|AEE80181.1| uncharacterized protein [Arabidopsis thaliana]
Length = 498
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 135/229 (58%), Positives = 166/229 (72%), Gaps = 7/229 (3%)
Query: 44 PLNC--VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
P C V+NQ+ P + + S+ + S S STCP YFRWIHEDL PWK TGITR
Sbjct: 63 PYGCDFVQNQSSQTPISQNRKSRLNPNNSSKS----STCPSYFRWIHEDLRPWKQTGITR 118
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
M+E A++TAHFRL++ N K Y+ +YK+SIQTRD FT+WGILQLLR YPG+LPDLELMFD
Sbjct: 119 GMIEEASRTAHFRLVIRNGKAYVKRYKKSIQTRDEFTLWGILQLLRWYPGKLPDLELMFD 178
Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
DDRPV+RS D+ G K PPP+FRY D ++DIVFPDWSFWGWAE+N+KPW L +
Sbjct: 179 ADDRPVVRSVDFIG-QQKEPPPVFRYCSDDASLDIVFPDWSFWGWAEVNVKPWGKSLEAI 237
Query: 222 KEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
KEGN+ W DR YAYW+GNP+V R DLL CN ++ +WN RLY+Q
Sbjct: 238 KEGNSMTQWKDRVAYAYWRGNPYVDPGRGDLLKCNATEHEEWNTRLYIQ 286
>gi|356506263|ref|XP_003521906.1| PREDICTED: protein O-glucosyltransferase 1-like [Glycine max]
Length = 476
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 137/266 (51%), Positives = 178/266 (66%), Gaps = 17/266 (6%)
Query: 6 AFSIILQSNFS-VHNISRNKTVIIVSRK-PQNRPPKRITTPLNCVK-NQTQTCPTNYPKT 62
A + +L+ N S + + KT+II +++ PQ PLNC N +TC + YP T
Sbjct: 17 ASTFLLEINLSRITRTTIFKTIIIFNKQQPQ--------FPLNCTDGNLAKTCSSYYPTT 68
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
+ S T++CP+YF+WIHEDL PWK GITRDM+ER +HFRL++VN K
Sbjct: 69 FDLDDDSS------TTSCPNYFKWIHEDLKPWKSKGITRDMVERGKNVSHFRLVIVNGKA 122
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y+ KY + QTRDVFTIWGILQLLR YPG++PDL+LMF C D+PV+ +D+ GP PP
Sbjct: 123 YVEKYDKVYQTRDVFTIWGILQLLRLYPGKIPDLDLMFQCGDKPVVLKKDFQGPQAMSPP 182
Query: 183 PLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN 242
P+F Y GD DIVFPDWSFWGW EINI PWE+ L ++ EGN W DR PYA+WKGN
Sbjct: 183 PVFHYCGDENAHDIVFPDWSFWGWPEINIGPWETTLHKILEGNKMIKWKDRTPYAFWKGN 242
Query: 243 PFVAETRRDLLTCNLSDKHDWNARLY 268
+A+ RR+L CN + +HDWNAR++
Sbjct: 243 LAMADIRRELGKCNPTKEHDWNARIH 268
>gi|15222413|ref|NP_176531.1| uncharacterized protein [Arabidopsis thaliana]
gi|332195976|gb|AEE34097.1| uncharacterized protein [Arabidopsis thaliana]
Length = 578
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 137/248 (55%), Positives = 181/248 (72%), Gaps = 14/248 (5%)
Query: 31 RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
R + + P+ + ++C +N++ +C ++ S+ S CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
IHEDL PW+ TGIT++M+ER TAHFRL+++N KV++ YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
RKYPG+LPD++LMFDCDDRPVIRS Y+ N PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296
Query: 204 WGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHD 262
WGW EINI+ W +L+E++EG + +++R+ YAYWKGNPFVA +R DLLTCNLS HD
Sbjct: 297 WGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHD 356
Query: 263 WNARLYVQ 270
WNAR+++Q
Sbjct: 357 WNARIFIQ 364
>gi|359489784|ref|XP_002273475.2| PREDICTED: O-glucosyltransferase rumi [Vitis vinifera]
Length = 604
Score = 283 bits (725), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 137/240 (57%), Positives = 166/240 (69%), Gaps = 11/240 (4%)
Query: 30 SRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIH 88
S KP N K+I LNC + N QTCP T + E P T TCP+YFRWI+
Sbjct: 163 SIKPCN---KKIEYSLNCSEGNMAQTCPVTPLATFEPSE-------PSTETCPEYFRWIY 212
Query: 89 EDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRK 148
EDL PW+ TGITRDM+ERA A+ R+++V+ KVY+ KYK QTRDVFTIWGILQLLR
Sbjct: 213 EDLRPWRETGITRDMVERAKPAAYIRVVVVDGKVYMEKYKGVYQTRDVFTIWGILQLLRM 272
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
YPG+LPD +LMF C D+ +++R + GPN PPPLF Y GD T DIVFPDWSFWGW E
Sbjct: 273 YPGKLPDFDLMFSCGDKLAMKTRYFQGPNATTPPPLFHYCGDDETYDIVFPDWSFWGWPE 332
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
I+IK W +L ++LKEGNN WIDREPYAYWKGN + R +L CN + + DWNAR+Y
Sbjct: 333 IHIKQWNTLKKDLKEGNNRTEWIDREPYAYWKGNINLGLARHELSKCNKTSEQDWNARIY 392
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 45/60 (75%)
Query: 106 RANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR 165
++ AH R+++V+ KV++ KY+ QTR V TIW ILQLLR YPG+LPDL+L+F+C +
Sbjct: 12 KSGNPAHIRVVVVDGKVFMEKYEWVFQTRHVLTIWCILQLLRMYPGKLPDLDLIFECGAK 71
>gi|297745255|emb|CBI40335.3| unnamed protein product [Vitis vinifera]
Length = 578
Score = 283 bits (724), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 137/240 (57%), Positives = 166/240 (69%), Gaps = 11/240 (4%)
Query: 30 SRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIH 88
S KP N K+I LNC + N QTCP T + E P T TCP+YFRWI+
Sbjct: 137 SIKPCN---KKIEYSLNCSEGNMAQTCPVTPLATFEPSE-------PSTETCPEYFRWIY 186
Query: 89 EDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRK 148
EDL PW+ TGITRDM+ERA A+ R+++V+ KVY+ KYK QTRDVFTIWGILQLLR
Sbjct: 187 EDLRPWRETGITRDMVERAKPAAYIRVVVVDGKVYMEKYKGVYQTRDVFTIWGILQLLRM 246
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
YPG+LPD +LMF C D+ +++R + GPN PPPLF Y GD T DIVFPDWSFWGW E
Sbjct: 247 YPGKLPDFDLMFSCGDKLAMKTRYFQGPNATTPPPLFHYCGDDETYDIVFPDWSFWGWPE 306
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
I+IK W +L ++LKEGNN WIDREPYAYWKGN + R +L CN + + DWNAR+Y
Sbjct: 307 IHIKQWNTLKKDLKEGNNRTEWIDREPYAYWKGNINLGLARHELSKCNKTSEQDWNARIY 366
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 45/60 (75%)
Query: 106 RANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR 165
++ AH R+++V+ KV++ KY+ QTR V TIW ILQLLR YPG+LPDL+L+F+C +
Sbjct: 15 KSGNPAHIRVVVVDGKVFMEKYEWVFQTRHVLTIWCILQLLRMYPGKLPDLDLIFECGAK 74
>gi|356522630|ref|XP_003529949.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
Length = 497
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 132/246 (53%), Positives = 173/246 (70%), Gaps = 15/246 (6%)
Query: 24 KTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTNYPKTSQTQESISDYSIPPTSTCPD 82
KT+I+ + N+ P+ PLNC ++ TCP+ YP + + S+ ++CP+
Sbjct: 58 KTIIVFN----NQQPQ---FPLNCTNGKSASTCPSYYPTKLEFDDDSSN------TSCPE 104
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
YFRWIHEDL PW+ TGITRDM+ER +HFRL++VN K YI K+ +S QTRDVFTIWGI
Sbjct: 105 YFRWIHEDLKPWESTGITRDMVERGKHISHFRLVIVNGKAYIEKFAKSYQTRDVFTIWGI 164
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
LQLLR YPG++PDLELMF C DR V+ +D+ P PPP+F Y G+ + DIVFPDW+
Sbjct: 165 LQLLRLYPGKIPDLELMFQCGDRTVVFKKDFQVP-KMSPPPVFHYCGEENSYDIVFPDWT 223
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
FWGWAE++I+PWE+ L + EGN W DR PYA+WKGNP V+ RR+L CN ++KHD
Sbjct: 224 FWGWAELSIRPWETTLHNILEGNKLVKWKDRIPYAFWKGNPTVSIIRRELGKCNTTEKHD 283
Query: 263 WNARLY 268
WNAR+Y
Sbjct: 284 WNARIY 289
>gi|297817438|ref|XP_002876602.1| hypothetical protein ARALYDRAFT_486604 [Arabidopsis lyrata subsp.
lyrata]
gi|297322440|gb|EFH52861.1| hypothetical protein ARALYDRAFT_486604 [Arabidopsis lyrata subsp.
lyrata]
Length = 496
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 133/227 (58%), Positives = 164/227 (72%), Gaps = 4/227 (1%)
Query: 44 PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
P C Q Q+ T++ SQ + S + STCP YFRWIHEDL PWK TGITR M
Sbjct: 62 PYGCDFVQNQSNQTHF---SQNRASRLNPIRSKPSTCPSYFRWIHEDLRPWKQTGITRGM 118
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
+E A++TAHFRL++ N K Y+ +Y++SIQTRD FT+WGI+QLLR +PG+LPDLELMFD D
Sbjct: 119 IEEASRTAHFRLVIRNGKAYVKRYRKSIQTRDDFTLWGIVQLLRWFPGKLPDLELMFDAD 178
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
DRPV+RS D+ G K PPP+FRY D ++DIVFPDWSFWGWAE+NIKPW L +KE
Sbjct: 179 DRPVVRSADFIG-QQKKPPPVFRYCSDDASLDIVFPDWSFWGWAEVNIKPWGESLEAIKE 237
Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
GNN W +R YAYW+GNP V R DLL CN+S+ +WN RLY+Q
Sbjct: 238 GNNMTQWKERVAYAYWRGNPHVDPGRGDLLKCNVSENEEWNTRLYIQ 284
>gi|388505300|gb|AFK40716.1| unknown [Medicago truncatula]
Length = 285
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 129/210 (61%), Positives = 164/210 (78%), Gaps = 12/210 (5%)
Query: 44 PLNCV--KNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGIT 100
PL C + +TQTCP +YP K + T ++ + TCP +FRWIHEDL PWK GIT
Sbjct: 83 PLRCTNGEKETQTCPRDYPTKHNPTNQN--------SHTCPSFFRWIHEDLKPWKEKGIT 134
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMF 160
R+MLE A +TA+F++++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR +PG+LPDLELMF
Sbjct: 135 REMLEGAKRTANFKVVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRMFPGKLPDLELMF 194
Query: 161 DCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
DC+DRPVI ++ GP N PPPLFRY D+W++DIVFPDWSFWGWAE NIKPW+++L+E
Sbjct: 195 DCEDRPVIHKGNFQGP-NASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWKNILKE 253
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRR 250
+KEGN W DR PYAYWKGNP VA TR+
Sbjct: 254 IKEGNKETKWKDRVPYAYWKGNPNVAATRK 283
>gi|359489780|ref|XP_002273350.2| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
Length = 509
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 134/231 (58%), Positives = 164/231 (70%), Gaps = 9/231 (3%)
Query: 39 KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
KR +NC + N T+TCP T T S+ S T TCP+YFRWI+EDL PW T
Sbjct: 74 KRFEYSMNCSEGNMTKTCPV----TLLTTFEPSNLS---TGTCPEYFRWIYEDLKPWTET 126
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GITRDM+ERA AH R+++V+ KVY KYK QTRDVFTIWGILQ+LR YPG+LPD +
Sbjct: 127 GITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFTIWGILQVLRMYPGKLPDFD 186
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
LMF+C D+PVI+ DY G N PPLF Y GD T+DIVFPDWSFWGW EI IKPW +L
Sbjct: 187 LMFECGDKPVIKKHDYQGLNATA-PPLFHYCGDDETLDIVFPDWSFWGWPEIRIKPWSTL 245
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
++L+EGNN W+DREPYAYWKGN + TR +L C+ +++ DWNAR+Y
Sbjct: 246 RKDLREGNNKTKWVDREPYAYWKGNFKMGVTRHELSKCSKTNEQDWNARIY 296
>gi|359489776|ref|XP_002273194.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 521
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 132/252 (52%), Positives = 166/252 (65%), Gaps = 8/252 (3%)
Query: 19 NISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPT 77
N+S + + P ++ LNC + N +QTCP P + E PP+
Sbjct: 58 NVSILTGDFLKNTSPTKLHSQKFEYSLNCREGNMSQTCPVTGPVAFEPGE-------PPS 110
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
TCP+YFRWI+EDL PW TGITR M+E+A A R+++V+ KVY+ KYK+ + RD F
Sbjct: 111 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYVEKYKRVNRNRDEF 170
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
TIWGILQLLR YPG+LPD +LMF+C DRP+IR+ Y GP+ PPPLF Y GD T DIV
Sbjct: 171 TIWGILQLLRMYPGKLPDFDLMFECRDRPMIRTHLYQGPDATVPPPLFHYCGDDETYDIV 230
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW E NIKPW ++LKEGN WIDREPYAYWKGN + R++L C
Sbjct: 231 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRN 290
Query: 258 SDKHDWNARLYV 269
+D+ DWNARLY+
Sbjct: 291 TDEQDWNARLYI 302
>gi|147791844|emb|CAN70600.1| hypothetical protein VITISV_027960 [Vitis vinifera]
Length = 521
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 166/252 (65%), Gaps = 8/252 (3%)
Query: 19 NISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPT 77
N+S + + P ++ LNC + N +QTCP P + E PP+
Sbjct: 58 NVSILTGDFLKNTSPTKLHSQKFEYSLNCREGNMSQTCPVTGPVAFEPGE-------PPS 110
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
TCP+YFRWI+EDL PW TGITR M+E+A A R+++V+ KVY+ KYK+ + RD F
Sbjct: 111 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYVEKYKRVNRNRDEF 170
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
TIWGILQLLR YPG+LPD +LMF+C DRP+I++ Y GP+ PPPLF Y GD T DIV
Sbjct: 171 TIWGILQLLRMYPGKLPDFDLMFECRDRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIV 230
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW E NIKPW ++LKEGN WIDREPYAYWKGN + R++L C
Sbjct: 231 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRN 290
Query: 258 SDKHDWNARLYV 269
+D+ DWNARLY+
Sbjct: 291 TDEQDWNARLYI 302
>gi|224064021|ref|XP_002301352.1| predicted protein [Populus trichocarpa]
gi|222843078|gb|EEE80625.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 121/195 (62%), Positives = 149/195 (76%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
++ CP+YFRWIHEDL WK TGI+R M+ERA AHFRL+++ K+Y+ KYK+S TRDV
Sbjct: 20 STACPEYFRWIHEDLRIWKSTGISRAMVERAKDYAHFRLVILKGKIYVEKYKKSFHTRDV 79
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGILQLLR YPG++PDLELMF CDDRPVI +DY G N P +F+Y G + I
Sbjct: 80 FTIWGILQLLRLYPGKVPDLELMFWCDDRPVILKKDYQGTNATSSPSIFQYCGREDALGI 139
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDW+FWGWAE N+ PW++L ++LKE N W DR PYAYW+GNP VA +RR L+ CN
Sbjct: 140 VFPDWTFWGWAETNVSPWKTLSKDLKEANKRTKWKDRVPYAYWRGNPNVAASRRQLMWCN 199
Query: 257 LSDKHDWNARLYVQV 271
+SDK+DWNARLY Q
Sbjct: 200 VSDKYDWNARLYKQA 214
>gi|218191256|gb|EEC73683.1| hypothetical protein OsI_08244 [Oryza sativa Indica Group]
Length = 524
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 123/203 (60%), Positives = 156/203 (76%), Gaps = 4/203 (1%)
Query: 72 YSIPPTST---CPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKY 127
+ +PPT+T CP YFR+IHEDL PW+ G ITR MLERA TA+FRL+++ + Y+ +
Sbjct: 115 HPLPPTTTNSSCPAYFRFIHEDLHPWRAAGGITRRMLERARDTANFRLVVLRGRAYVERI 174
Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY 187
+ QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+++ Y G N PPLFRY
Sbjct: 175 APAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVQADRYQGENATAMPPLFRY 234
Query: 188 SGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
GD T+D+VFPDWSFWGW EINIKPW++L ++L GN W+DREPYAYWKGNP VA
Sbjct: 235 CGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKGNPDVAT 294
Query: 248 TRRDLLTCNLSDKHDWNARLYVQ 270
R++L+ CN+S KH+WNAR+Y Q
Sbjct: 295 KRKELVKCNVSSKHEWNARIYKQ 317
>gi|297599669|ref|NP_001047557.2| Os02g0642700 [Oryza sativa Japonica Group]
gi|255671126|dbj|BAF09471.2| Os02g0642700, partial [Oryza sativa Japonica Group]
Length = 474
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 123/201 (61%), Positives = 155/201 (77%), Gaps = 4/201 (1%)
Query: 74 IPPTST---CPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
+PPT+T CP YFR+IHEDL PW+ G ITR MLERA TA+FRL+++ + Y+ +
Sbjct: 67 LPPTTTNSSCPAYFRFIHEDLHPWRAAGGITRRMLERARDTANFRLVVLRGRAYVERIAP 126
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
+ QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+++ Y G N PPLFRY G
Sbjct: 127 AFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVQADRYQGENATAMPPLFRYCG 186
Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
D T+D+VFPDWSFWGW EINIKPW++L ++L GN W+DREPYAYWKGNP VA R
Sbjct: 187 DDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKGNPDVATKR 246
Query: 250 RDLLTCNLSDKHDWNARLYVQ 270
++L+ CN+S KH+WNAR+Y Q
Sbjct: 247 KELVKCNVSSKHEWNARIYKQ 267
>gi|359489782|ref|XP_002273411.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 525
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 127/226 (56%), Positives = 156/226 (69%), Gaps = 8/226 (3%)
Query: 45 LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
LNC + N +QTCP P + E PP+ TCP+YFRWI+EDL PW TGITR M
Sbjct: 88 LNCREGNVSQTCPVTGPVAFEPSE-------PPSETCPEYFRWIYEDLRPWMDTGITRAM 140
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
+E+A A R+++V+ KVY+ KYK+ RD FTIWGILQLLR YP +LPD +LMF+C
Sbjct: 141 VEKARPAASIRIVVVDGKVYMEKYKRVNHNRDEFTIWGILQLLRMYPEKLPDFDLMFECR 200
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
DRP+I++ Y GP+ PPPLF Y GD T DIVFPDWSFWGW E NIKPW ++LKE
Sbjct: 201 DRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGWPETNIKPWNGFKKDLKE 260
Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYV 269
GN WIDREPYAYWKGN + R++L C +D+ DWNARLY+
Sbjct: 261 GNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNARLYI 306
>gi|356561570|ref|XP_003549054.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
2-like [Glycine max]
Length = 426
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 133/233 (57%), Positives = 160/233 (68%), Gaps = 17/233 (7%)
Query: 39 KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
++ PLNC N T TCPTN ++ P T+TCPDYFRWIHEDL PW T
Sbjct: 23 RKPIAPLNCTAYNLTGTCPTNLQDHQRS---------PATATCPDYFRWIHEDLRPWART 73
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
E A +A+ R I++ + Y+ Y + QTRDVF+IWGILQLLR+YPG++PDLE
Sbjct: 74 -------EHAPTSANLRFIILKGRAYLETYSRPYQTRDVFSIWGILQLLRRYPGKIPDLE 126
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
LMFDC D PV+ S Y+GPN + PPPLFRY G+ T+D+VF DWSFWGWAEINIKPW L
Sbjct: 127 LMFDCVDWPVVXSDRYNGPNVEQPPPLFRYCGNDATLDVVFLDWSFWGWAEINIKPWHIL 186
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
L ELKEG W++REPYAYWKGNP VAETR L+ CN+S+ DWNARL Q
Sbjct: 187 LGELKEGTTRIPWLNREPYAYWKGNPAVAETRXYLMKCNVSENQDWNARLLAQ 239
>gi|357136753|ref|XP_003569968.1| PREDICTED: uncharacterized protein LOC100831246 [Brachypodium
distachyon]
Length = 543
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 121/194 (62%), Positives = 150/194 (77%), Gaps = 1/194 (0%)
Query: 78 STCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
S+CP YFR+IHEDL PW+ G ITR ML+RA TA+FRL+++ + YI + QTRD+
Sbjct: 138 SSCPAYFRFIHEDLHPWRAAGGITRAMLDRARATANFRLVVLRGRAYIELIAPAFQTRDL 197
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGILQLLR+YPGR+PDL+LMFDC D PV+R+ Y G N PPLFRY GD T+D+
Sbjct: 198 FTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVRADQYEGENATAMPPLFRYCGDNETLDV 257
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGWAEINIKPW++L ++L GN W+DREPYAYWKGNP VA R++L+ CN
Sbjct: 258 VFPDWSFWGWAEINIKPWDALRKDLDAGNRRVRWVDREPYAYWKGNPDVAAIRQELVKCN 317
Query: 257 LSDKHDWNARLYVQ 270
+S K +WNAR+Y Q
Sbjct: 318 VSSKQEWNARIYKQ 331
>gi|413923257|gb|AFW63189.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
Length = 578
Score = 270 bits (691), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 121/195 (62%), Positives = 151/195 (77%), Gaps = 1/195 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
+STCP YFR+IHEDL PW+ G ITR M++RA TA+FRL+++ + YI + + QTRD
Sbjct: 145 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 204
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ + Y G N PPLFRY GD T+D
Sbjct: 205 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYQGENATLLPPLFRYCGDNETLD 264
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW EINIKPW++L +EL GN W+ REPYAYWKGNP VA TR++L+ C
Sbjct: 265 VVFPDWSFWGWPEINIKPWDALQKELNGGNKRVKWLAREPYAYWKGNPDVAVTRQELVKC 324
Query: 256 NLSDKHDWNARLYVQ 270
N+S KH+WNAR+Y Q
Sbjct: 325 NVSSKHEWNARIYKQ 339
>gi|297745254|emb|CBI40334.3| unnamed protein product [Vitis vinifera]
Length = 819
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 121/192 (63%), Positives = 147/192 (76%), Gaps = 1/192 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
T TCP+YFRWI+EDL PW TGITRDM+ERA AH R+++V+ KVY KYK QTRDV
Sbjct: 26 TGTCPEYFRWIYEDLKPWTETGITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDV 85
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGILQ+LR YPG+LPD +LMF+C D+PVI+ DY G N PPLF Y GD T+DI
Sbjct: 86 FTIWGILQVLRMYPGKLPDFDLMFECGDKPVIKKHDYQGLNATA-PPLFHYCGDDETLDI 144
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EI IKPW +L ++L+EGNN W+DREPYAYWKGN + TR +L C+
Sbjct: 145 VFPDWSFWGWPEIRIKPWSTLRKDLREGNNKTKWVDREPYAYWKGNFKMGVTRHELSKCS 204
Query: 257 LSDKHDWNARLY 268
+++ DWNAR+Y
Sbjct: 205 KTNEQDWNARIY 216
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 107/183 (58%), Positives = 131/183 (71%)
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
+ EDL PW TGITR M+E+A A R+++V+ KVY+ KYK+ RD FTIWGILQLL
Sbjct: 418 LEEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYMEKYKRVNHNRDEFTIWGILQLL 477
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
R YP +LPD +LMF+C DRP+I++ Y GP+ PPPLF Y GD T DIVFPDWSFWGW
Sbjct: 478 RMYPEKLPDFDLMFECRDRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGW 537
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
E NIKPW ++LKEGN WIDREPYAYWKGN + R++L C +D+ DWNAR
Sbjct: 538 PETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNAR 597
Query: 267 LYV 269
LY+
Sbjct: 598 LYI 600
>gi|219363409|ref|NP_001136898.1| uncharacterized protein LOC100217055 [Zea mays]
gi|194697518|gb|ACF82843.1| unknown [Zea mays]
gi|413923256|gb|AFW63188.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
Length = 551
Score = 270 bits (690), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 121/195 (62%), Positives = 151/195 (77%), Gaps = 1/195 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
+STCP YFR+IHEDL PW+ G ITR M++RA TA+FRL+++ + YI + + QTRD
Sbjct: 145 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 204
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ + Y G N PPLFRY GD T+D
Sbjct: 205 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYQGENATLLPPLFRYCGDNETLD 264
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW EINIKPW++L +EL GN W+ REPYAYWKGNP VA TR++L+ C
Sbjct: 265 VVFPDWSFWGWPEINIKPWDALQKELNGGNKRVKWLAREPYAYWKGNPDVAVTRQELVKC 324
Query: 256 NLSDKHDWNARLYVQ 270
N+S KH+WNAR+Y Q
Sbjct: 325 NVSSKHEWNARIYKQ 339
>gi|414586114|tpg|DAA36685.1| TPA: hypothetical protein ZEAMMB73_061596 [Zea mays]
Length = 433
Score = 270 bits (689), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 120/194 (61%), Positives = 152/194 (78%), Gaps = 1/194 (0%)
Query: 78 STCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
++CP YFR+IHEDL PW+ G ITR ML+RA TA FRL+++ Y+ +++ + QTRD+
Sbjct: 143 TSCPSYFRFIHEDLRPWRAAGGITRAMLDRARLTATFRLVVLGGNAYVQRFRPAFQTRDL 202
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWG+LQLLR++PGR+PDL+LMFD D PV+RS Y G N + PPLFRY GD T+DI
Sbjct: 203 FTIWGVLQLLRRHPGRVPDLDLMFDTVDWPVVRSHLYRGKNAEMLPPLFRYCGDDKTLDI 262
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EINIKPW++L +LK GNN W+DREPYAYWKGNP V+ TR++L+ CN
Sbjct: 263 VFPDWSFWGWPEINIKPWDALKEDLKAGNNRVRWMDREPYAYWKGNPSVSGTRKELVKCN 322
Query: 257 LSDKHDWNARLYVQ 270
+S HDWNAR+Y Q
Sbjct: 323 VSSTHDWNARIYAQ 336
>gi|357168046|ref|XP_003581456.1| PREDICTED: protein O-glucosyltransferase 1-like [Brachypodium
distachyon]
Length = 543
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 119/192 (61%), Positives = 150/192 (78%), Gaps = 1/192 (0%)
Query: 80 CPDYFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
CP YFR+IHEDL PW+ G+TR ML RA TA FRL+++ +V++ +++ + QTRD+FT
Sbjct: 145 CPSYFRFIHEDLRPWREAGGVTRAMLARARVTASFRLVVLRGRVFVQRFRPAFQTRDLFT 204
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
IWGILQL+R+YPGR+PDL+LMFDC D PV+R+ Y G + PPLFRY GD T+DIVF
Sbjct: 205 IWGILQLIRRYPGRVPDLDLMFDCVDWPVVRTHLYRGKHAPFMPPLFRYCGDDRTLDIVF 264
Query: 199 PDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLS 258
PDWSFWGW EINIKPW +L +ELK+GNN W+DREPYAYWKGN VA +RR+L+ CN+S
Sbjct: 265 PDWSFWGWPEINIKPWGALQKELKDGNNKVRWLDREPYAYWKGNAAVAVSRRELVQCNVS 324
Query: 259 DKHDWNARLYVQ 270
DWNAR+Y Q
Sbjct: 325 STQDWNARIYTQ 336
>gi|359489778|ref|XP_002271353.2| PREDICTED: protein O-glucosyltransferase 1 [Vitis vinifera]
Length = 455
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 131/228 (57%), Positives = 157/228 (68%), Gaps = 15/228 (6%)
Query: 42 TTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
TT NC K T TC T Q +S + CPDYFRWI+EDL PW TGIT+
Sbjct: 28 TTEENCTKENT-TCSTK-------QHPLSG------NACPDYFRWIYEDLRPWAETGITQ 73
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
+M+E A A FRL++++ KVY+ KYK + QTRDVFTIWGILQLL+ YPG++PDLELMF+
Sbjct: 74 EMMEAAKDPAFFRLVILDGKVYMEKYKGAFQTRDVFTIWGILQLLKLYPGKVPDLELMFE 133
Query: 162 CDDRPVIRSRDYSG-PNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
C DRP I++ DY G K PPLF Y T+DIVFPDWSFWGW EINIKPW SL +E
Sbjct: 134 CGDRPRIKATDYGGRKGKKKVPPLFHYCASDDTLDIVFPDWSFWGWPEINIKPWNSLRKE 193
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
L+EGNN W+DREPYAYWKGN + R+ L C S+ HDWNAR+Y
Sbjct: 194 LEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCRPSNNHDWNARVY 241
>gi|79454960|ref|NP_191687.2| uncharacterized protein [Arabidopsis thaliana]
gi|55978809|gb|AAV68866.1| hypothetical protein AT3G61280 [Arabidopsis thaliana]
gi|61742723|gb|AAX55182.1| hypothetical protein At3g61280 [Arabidopsis thaliana]
gi|110739424|dbj|BAF01622.1| hypothetical protein [Arabidopsis thaliana]
gi|332646661|gb|AEE80182.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 134/239 (56%), Positives = 159/239 (66%), Gaps = 12/239 (5%)
Query: 37 PPKRITTPLNCVK---NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
P I PLNC N TQTCP+NYP + +I + TCPDYF+WIH DL
Sbjct: 92 PKIAIKIPLNCTSLNSNTTQTCPSNYPTKFEP-------AISSSETCPDYFKWIHRDLKV 144
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ TGITR+ LERA AHFR+++ + ++Y+H+Y+++ QTRDVFTIWGILQLLR YPG++
Sbjct: 145 WQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVFTIWGILQLLRMYPGQI 204
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
PDLEL+F C DRP I RD PPPLF Y G R DIVFPDWSFWGW E+NI
Sbjct: 205 PDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGWPELNI 264
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
K W L LKEGN W DR PYAYWKGNP V+ R DL+ CN SDK+D RLYVQ
Sbjct: 265 KEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVRLYVQ 323
>gi|186511278|ref|NP_001118874.1| uncharacterized protein [Arabidopsis thaliana]
gi|332646662|gb|AEE80183.1| uncharacterized protein [Arabidopsis thaliana]
Length = 378
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 134/239 (56%), Positives = 159/239 (66%), Gaps = 12/239 (5%)
Query: 37 PPKRITTPLNCVK---NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
P I PLNC N TQTCP+NYP + +I + TCPDYF+WIH DL
Sbjct: 92 PKIAIKIPLNCTSLNSNTTQTCPSNYPTKFEP-------AISSSETCPDYFKWIHRDLKV 144
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ TGITR+ LERA AHFR+++ + ++Y+H+Y+++ QTRDVFTIWGILQLLR YPG++
Sbjct: 145 WQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVFTIWGILQLLRMYPGQI 204
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
PDLEL+F C DRP I RD PPPLF Y G R DIVFPDWSFWGW E+NI
Sbjct: 205 PDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGWPELNI 264
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
K W L LKEGN W DR PYAYWKGNP V+ R DL+ CN SDK+D RLYVQ
Sbjct: 265 KEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVRLYVQ 323
>gi|326509457|dbj|BAJ91645.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 510
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 122/198 (61%), Positives = 150/198 (75%), Gaps = 3/198 (1%)
Query: 75 PPT-STCPDYFRWIHEDLSPWKVTGITRDMLERANQ-TAHFRLILVNNKVYIHKYKQSIQ 132
PPT +TCP YFRWIHEDL PW+ TGITR+ LE A + A FR+ +++ ++Y+ +Y + Q
Sbjct: 110 PPTPTTCPVYFRWIHEDLRPWRATGITREALEGARRYGAKFRVTVLSGRLYVARYGRCFQ 169
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
TRDVFT WGILQLLR+Y GR+PDL+LMFDC D PV+ + D+ G + PPPLFRY G
Sbjct: 170 TRDVFTQWGILQLLRRYAGRVPDLDLMFDCQDLPVVNAGDH-GSHTPPPPPLFRYCGSEP 228
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
T+DI FPDWSFWGW E+NIKPWE+L RE+KE N +W R PYAYWKGNP VA RR+L
Sbjct: 229 TLDIAFPDWSFWGWPELNIKPWEALRREIKEANAALDWTRRAPYAYWKGNPAVAAARREL 288
Query: 253 LTCNLSDKHDWNARLYVQ 270
L CN+S K DWNAR+Y Q
Sbjct: 289 LKCNVSGKRDWNARIYAQ 306
>gi|242066560|ref|XP_002454569.1| hypothetical protein SORBIDRAFT_04g033620 [Sorghum bicolor]
gi|241934400|gb|EES07545.1| hypothetical protein SORBIDRAFT_04g033620 [Sorghum bicolor]
Length = 552
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 118/195 (60%), Positives = 151/195 (77%), Gaps = 1/195 (0%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
+STCP YFR+IHEDL PW+ G ITR M++RA TA+FRL+++ + YI + + QTRD
Sbjct: 146 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 205
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ + Y G N PPLFRY G+ T+D
Sbjct: 206 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYEGENATVLPPLFRYCGNNETLD 265
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW EINIKPW++L +EL GN W++REPYAYWKGNP VA R++L+ C
Sbjct: 266 VVFPDWSFWGWPEINIKPWDALQKELNRGNKRVKWLNREPYAYWKGNPDVAVIRQELVKC 325
Query: 256 NLSDKHDWNARLYVQ 270
N+S +H+WNAR+Y Q
Sbjct: 326 NVSSEHEWNARIYKQ 340
>gi|147810897|emb|CAN69493.1| hypothetical protein VITISV_037876 [Vitis vinifera]
Length = 378
Score = 265 bits (676), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 116/168 (69%), Positives = 141/168 (83%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ERA +TA F+L+++N + Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC
Sbjct: 1 MVERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PVI+S++Y GPN PPPLFRY GD T+DIVFPDWSFWGW EI IKPWESLL++LK
Sbjct: 61 VDWPVIKSKEYHGPNTTAPPPLFRYCGDDATLDIVFPDWSFWGWPEIXIKPWESLLKDLK 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
EGN W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+Y Q
Sbjct: 121 EGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARVYTQ 168
>gi|147798876|emb|CAN65870.1| hypothetical protein VITISV_012498 [Vitis vinifera]
Length = 504
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 120/192 (62%), Positives = 145/192 (75%), Gaps = 1/192 (0%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CPDYFRWI+EDL PW TGIT++M+E A A FRL++++ KVY+ KYK + QTRDVF
Sbjct: 99 NACPDYFRWIYEDLRPWAETGITQEMMEAAKDPAFFRLVILDGKVYMEKYKGAFQTRDVF 158
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG-PNNKGPPPLFRYSGDRWTMDI 196
TIWGILQLL+ YPG++PDLELMF+C DRP I++ DY G K PPLF Y T+DI
Sbjct: 159 TIWGILQLLKLYPGKVPDLELMFECGDRPRIKASDYGGRKGKKKVPPLFHYCASDDTLDI 218
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EINIKPW SL +EL+EGNN W+DREPYAYWKGN + R+ L C
Sbjct: 219 VFPDWSFWGWPEINIKPWNSLRKELEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCR 278
Query: 257 LSDKHDWNARLY 268
S+ HDWNAR+Y
Sbjct: 279 PSNNHDWNARVY 290
>gi|242073834|ref|XP_002446853.1| hypothetical protein SORBIDRAFT_06g023680 [Sorghum bicolor]
gi|241938036|gb|EES11181.1| hypothetical protein SORBIDRAFT_06g023680 [Sorghum bicolor]
Length = 555
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 117/189 (61%), Positives = 149/189 (78%), Gaps = 1/189 (0%)
Query: 83 YFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
YFR+IHEDL PW+ G ITR ML+RA TA FRL+++ + Y+H+ + + QTRD+FTIWG
Sbjct: 160 YFRFIHEDLRPWRAAGGITRAMLDRARLTATFRLVVLGGRAYVHRLRPAFQTRDLFTIWG 219
Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
+LQLLR+YPGR+PDL+LMFD D P++R+ Y G + PPLFRY GD T+DIVFPDW
Sbjct: 220 VLQLLRRYPGRVPDLDLMFDTVDWPIVRAHLYRGKYAEMLPPLFRYCGDDKTLDIVFPDW 279
Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
SFWGW EINIKPW++L +LK+GNN W+DREPYAYWKGNP V+ TR++L+ CN+S H
Sbjct: 280 SFWGWPEINIKPWDALQEDLKDGNNRVRWMDREPYAYWKGNPSVSATRKELVKCNVSSTH 339
Query: 262 DWNARLYVQ 270
DWNAR+Y Q
Sbjct: 340 DWNARIYAQ 348
>gi|224064019|ref|XP_002301351.1| predicted protein [Populus trichocarpa]
gi|222843077|gb|EEE80624.1| predicted protein [Populus trichocarpa]
Length = 413
Score = 263 bits (673), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 119/197 (60%), Positives = 145/197 (73%), Gaps = 1/197 (0%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
CP +FRWIHEDL PWK TGI+RDMLERA + AHFRL++V K Y+ +Y + QTRDVFT
Sbjct: 2 ACPGFFRWIHEDLRPWKDTGISRDMLERARKHAHFRLVIVEGKAYVEQYSKPYQTRDVFT 61
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
IWGILQLL YPG++PDLELMF C D+ VI+ D G + P LF+Y G ++IVF
Sbjct: 62 IWGILQLLNLYPGKIPDLELMFRCGDKTVIQKHDIQGSDAMSPAVLFQYCGHSTALNIVF 121
Query: 199 PDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLS 258
PDW+FWGWAE NIKPW+ +L + EGN W DREPYAYW+GNP V+ R DL+ CN+S
Sbjct: 122 PDWTFWGWAETNIKPWKIVLEGMVEGNKKIKWQDREPYAYWRGNPHVSPNREDLMKCNVS 181
Query: 259 DKHDWNARLYVQVISFI 275
DK+DW ARLY Q SF+
Sbjct: 182 DKYDWLARLYEQA-SFL 197
>gi|297745248|emb|CBI40328.3| unnamed protein product [Vitis vinifera]
Length = 838
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 116/168 (69%), Positives = 141/168 (83%), Gaps = 1/168 (0%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M++RA +TAHFRL++++ K Y+ K+++SIQTRD+FT+WGILQLLR YPGRLPDLELMFDC
Sbjct: 1 MVDRAQRTAHFRLVIIDGKAYVEKFRKSIQTRDMFTLWGILQLLRWYPGRLPDLELMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DDRPV+R RD+ GPN PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++K
Sbjct: 61 DDRPVVRMRDFRGPN-AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIK 119
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
EGN W DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+Q
Sbjct: 120 EGNRRTKWKDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQ 167
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 78/156 (50%), Gaps = 25/156 (16%)
Query: 11 LQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESI 69
+ ++FS N + N ++KP+ P + NC + + Q CP N+P TS T +
Sbjct: 369 MMTSFSYQNSAPN------TKKPRIFPKILV----NCTIGSMAQACPANHPTTSVTGKL- 417
Query: 70 SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
CP+YFRWIHEDL PWK TGI+R +E A A FRL++VN K Y+ +Y+
Sbjct: 418 ------SVEACPEYFRWIHEDLRPWKSTGISRFAVESAEGDADFRLVIVNGKAYVEQYRN 471
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR 165
+T+ G ++ G + L + C R
Sbjct: 472 -------YTLTGDQNWEKESNGGFKNSNLAYKCTHR 500
>gi|116310458|emb|CAH67462.1| OSIGBa0159I10.7 [Oryza sativa Indica Group]
gi|218195275|gb|EEC77702.1| hypothetical protein OsI_16775 [Oryza sativa Indica Group]
Length = 537
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 116/189 (61%), Positives = 148/189 (78%), Gaps = 1/189 (0%)
Query: 83 YFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
YFR+IHEDL PW+ GITR ML RA TA FRL+++ + ++H+++ + QTRD+FTIWG
Sbjct: 142 YFRFIHEDLRPWRDAGGITRAMLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWG 201
Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
+LQLLR+YPGR+PDL+LMFDC D PV+R+ Y G + PPLF Y GD T+DIVFPDW
Sbjct: 202 VLQLLRRYPGRVPDLDLMFDCADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDW 261
Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
SFWGW EINIKPW++L ++LK+GNN W+DR PYAYWKGNP VA TR++L+ CN+S
Sbjct: 262 SFWGWPEINIKPWDALRQDLKDGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTK 321
Query: 262 DWNARLYVQ 270
DWNAR+Y Q
Sbjct: 322 DWNARIYKQ 330
>gi|255541544|ref|XP_002511836.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
gi|223549016|gb|EEF50505.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
communis]
Length = 549
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 120/238 (50%), Positives = 152/238 (63%), Gaps = 8/238 (3%)
Query: 35 NRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPW 94
N +P+ C T + +SD + CPDYFRWIHEDL PW
Sbjct: 74 NESSAHNASPVKCADQSTCKVNDTVRAFDGPEYQLSD------AACPDYFRWIHEDLRPW 127
Query: 95 KVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLP 154
TGI+RD +E A + A FRL++V+ K Y+ +Y S QTRD+FTIWGI+QLLR YPGR+P
Sbjct: 128 ASTGISRDTVESAKRFATFRLVIVDGKAYVERYYHSFQTRDLFTIWGIVQLLRLYPGRVP 187
Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
D+ELMF C D P I+ DY GP PPP LF+YSG+ + FPDWSFWGW E++IKP
Sbjct: 188 DVELMFQCGDLPEIQKGDYQGPGATLPPPALFQYSGNETAFAVTFPDWSFWGWVEVDIKP 247
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD-KHDWNARLYVQ 270
W+S+L + +G+ +NW DR PYAYW+GN V+ R+DLL C S HDWNARLY Q
Sbjct: 248 WKSMLEGITKGSQRKNWTDRVPYAYWRGNSHVSRDRKDLLKCKSSIFSHDWNARLYSQ 305
>gi|449446169|ref|XP_004140844.1| PREDICTED: O-glucosyltransferase rumi homolog [Cucumis sativus]
Length = 454
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 165/253 (65%), Gaps = 23/253 (9%)
Query: 30 SRKPQNRPPKRIT-TPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPT---------ST 79
R+ QN P K + +P+NC T Y ++ + I +I +T
Sbjct: 6 ERELQNYPQKEVEFSPINC---------TAYSRSEKWDSGIGPTTIEEEEEDGDGKNENT 56
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP+YFRWIHEDL PW TGITR+M+ER + A FRL++V + Y+ KY + Q RDVFT+
Sbjct: 57 CPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTL 116
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS--GPNNKGPPPLFRYSGDRWTMDIV 197
WGILQLLR YP ++PDL+LMF C+D+P + +YS GPN+ PPPLFRY GD T DIV
Sbjct: 117 WGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIV 176
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW EIN+KPWE+ ++ELKE N + WIDRE YA+WKGN F++ R LL C+
Sbjct: 177 FPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSR 236
Query: 258 SDKHDWNARLYVQ 270
S + R+Y+Q
Sbjct: 237 STQS--KLRVYMQ 247
>gi|357507237|ref|XP_003623907.1| KDEL motif-containing protein [Medicago truncatula]
gi|355498922|gb|AES80125.1| KDEL motif-containing protein [Medicago truncatula]
Length = 454
Score = 253 bits (646), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 115/225 (51%), Positives = 151/225 (67%), Gaps = 4/225 (1%)
Query: 45 LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
L C N TQTCP ++ + S TSTCP++FRWIHEDL PWK TGITR+M
Sbjct: 46 LKCFNGNPTQTCPNDHSPIQAFHQDQDPSS--STSTCPEHFRWIHEDLEPWKSTGITREM 103
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
++ + R+++ KVY+ Y S QTR FT+WGI+QLLR YPGR+PDLEL+F+
Sbjct: 104 VDSGENISQLRIVIKQGKVYVETYGDSFQTRATFTVWGIVQLLRLYPGRVPDLELLFETG 163
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
DR V+ + Y GP PPP+F Y G +DIVFPDWS+WGWAE IKPWE +L++++E
Sbjct: 164 DRAVVDKKRYEGPEAVTPPPIFSYCGKNEALDIVFPDWSYWGWAETRIKPWEKVLKDIQE 223
Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
N W DR PYA+WKGN F++ R +L CN++D+HDWNAR+Y
Sbjct: 224 SNKKIKWKDRIPYAFWKGN-FLSNPRHELRKCNVTDQHDWNARVY 267
>gi|42569949|ref|NP_182108.2| uncharacterized protein [Arabidopsis thaliana]
gi|330255514|gb|AEC10608.1| uncharacterized protein [Arabidopsis thaliana]
Length = 523
Score = 253 bits (646), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 134/245 (54%), Positives = 158/245 (64%), Gaps = 23/245 (9%)
Query: 37 PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
P +++TTPLN C +Q TQTCP + P+ SQ + + TCPDYFRWIH+
Sbjct: 76 PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128
Query: 90 DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
DL W+ TGITR+ LERA+ AHFRLI+ +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
PG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248
Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNA 265
W EINIK W + EG W +REPYAYWKGNP VA RRDL+ C HD
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPMV 303
Query: 266 RLYVQ 270
LY Q
Sbjct: 304 HLYRQ 308
>gi|110738999|dbj|BAF01419.1| hypothetical protein [Arabidopsis thaliana]
Length = 523
Score = 253 bits (646), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 134/245 (54%), Positives = 158/245 (64%), Gaps = 23/245 (9%)
Query: 37 PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
P +++TTPLN C +Q TQTCP + P+ SQ + + TCPDYFRWIH+
Sbjct: 76 PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128
Query: 90 DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
DL W+ TGITR+ LERA+ AHFRLI+ +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
PG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248
Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNA 265
W EINIK W + EG W +REPYAYWKGNP VA RRDL+ C HD
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPMV 303
Query: 266 RLYVQ 270
LY Q
Sbjct: 304 HLYRQ 308
>gi|357168044|ref|XP_003581455.1| PREDICTED: O-glucosyltransferase rumi homolog [Brachypodium
distachyon]
Length = 520
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 121/244 (49%), Positives = 156/244 (63%), Gaps = 17/244 (6%)
Query: 41 ITTPLNCVKNQTQTCPTNYPK--------TSQTQESISDYSIPPTSTCPDYFRWIHEDLS 92
+TTP ++ P+ P ++ + S T +CP YFRWIHEDL
Sbjct: 75 LTTPFTSSQDAANRTPSELPAGLTQQPSPAPASEAATESSSATTTGSCPAYFRWIHEDLR 134
Query: 93 PWKVTGITRDMLERANQ-TAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY-- 149
PW+ TGITR+ LE A++ FR+ +V ++Y+ +Y + QTRDVFT WGILQLLR+Y
Sbjct: 135 PWRATGITRETLEGAHRYMPKFRVTVVAGRLYVRRYGRCFQTRDVFTQWGILQLLRRYNT 194
Query: 150 PGR---LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
GR +PDL+LMFDC D PV+ + ++ G + PPPLFRY G T+DI FPDWSFWGW
Sbjct: 195 TGRRAVVPDLDLMFDCQDLPVVDAGNHRGCH---PPPLFRYCGSEPTLDIAFPDWSFWGW 251
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
E+NIKPW +L RE+ EGN +W R PYAYWKGNP V R +LL CN+S K DWNAR
Sbjct: 252 PELNIKPWGTLRREIDEGNAAVDWTRRAPYAYWKGNPMVGTARLELLKCNVSRKRDWNAR 311
Query: 267 LYVQ 270
+Y Q
Sbjct: 312 IYAQ 315
>gi|242073832|ref|XP_002446852.1| hypothetical protein SORBIDRAFT_06g023670 [Sorghum bicolor]
gi|241938035|gb|EES11180.1| hypothetical protein SORBIDRAFT_06g023670 [Sorghum bicolor]
Length = 505
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 113/193 (58%), Positives = 137/193 (70%), Gaps = 6/193 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQ--TAHFRLILVNNKVYIHKYKQSIQTRDVF 137
CP YFRWIHEDL PW+ G+TR LE A + FR+ +V ++Y+ +Y + QTR F
Sbjct: 114 CPSYFRWIHEDLRPWRAAGVTRGALEGARRLFAPKFRVTVVAGRLYVARYGRCFQTRAAF 173
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
T WGILQLLR+YPGR+PDL+LMFDCDD PV+ + G ++ PPPLFRY G T+DI
Sbjct: 174 TQWGILQLLRRYPGRVPDLDLMFDCDDLPVVGA----GDRHQAPPPLFRYCGSETTLDIA 229
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW E+NIKPWE+L RE+ E N NW+DR PYAYWKGNP V R LL CN
Sbjct: 230 FPDWSFWGWPELNIKPWEALRREINEENAMVNWMDRAPYAYWKGNPKVGAERLLLLRCNA 289
Query: 258 SDKHDWNARLYVQ 270
S + DWNAR+Y Q
Sbjct: 290 SGERDWNARVYAQ 302
>gi|15233074|ref|NP_191688.1| uncharacterized protein [Arabidopsis thaliana]
gi|6850896|emb|CAB71059.1| putative protein [Arabidopsis thaliana]
gi|332646663|gb|AEE80184.1| uncharacterized protein [Arabidopsis thaliana]
Length = 455
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 123/230 (53%), Positives = 151/230 (65%), Gaps = 12/230 (5%)
Query: 44 PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
PL C N TQTCP+ YP S+ + IS TCPDYFRWI +DL W+ TGITR+
Sbjct: 17 PLKC--NATQTCPSTYP--SRLEPMISS-----LETCPDYFRWIQQDLKVWEETGITRET 67
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
LERA AHFRL++ + ++Y+H+Y ++ ++RDV TIWGILQLLR YPG++PDLEL+F C
Sbjct: 68 LERAKPKAHFRLVIKSGRLYVHQYDKAYESRDVLTIWGILQLLRMYPGQVPDLELLFFCH 127
Query: 164 DRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
D P I RD+ P PPPLF+Y G R IVFPDWSFWGW E+NIK W L
Sbjct: 128 DIPAIWKRDFRQPEPNATWPPPPLFQYCGHREAYGIVFPDWSFWGWPEVNIKEWTKLSVA 187
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
++E N W DR PYAYWKGN V R +L+ CN SDK+D RLY Q
Sbjct: 188 IREANKRVKWNDRVPYAYWKGNSGVHRERGNLMKCNFSDKYDPMVRLYEQ 237
>gi|357507235|ref|XP_003623906.1| O-glucosyltransferase rumi [Medicago truncatula]
gi|355498921|gb|AES80124.1| O-glucosyltransferase rumi [Medicago truncatula]
Length = 442
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 117/256 (45%), Positives = 159/256 (62%), Gaps = 11/256 (4%)
Query: 14 NFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDY 72
+F + S +K II ++ Q LNC + T+TC ++ +
Sbjct: 16 SFKTKSTSPSKITIIKRQQTQ--------FTLNCFNGSMTRTCQKDHHPIQAFHQDQDPS 67
Query: 73 SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
S TSTCP++FRWIHEDL PWK TGITR+ +E + R+++ K Y+ Y S Q
Sbjct: 68 S--STSTCPEHFRWIHEDLEPWKSTGITRETVESGKNISQLRIVIKKGKAYVETYADSFQ 125
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
TRD+FT+WGI+QLLR YPGR+PDLEL+F+ DR V+ + + G + PP+F Y G
Sbjct: 126 TRDLFTVWGIVQLLRLYPGRVPDLELLFETGDRTVLDKKRFQGSQSVTLPPIFSYCGQND 185
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
+DIVFPDWSFWGWAE IKPWE +L++++E N W DR PYA+WKGN V+ R L
Sbjct: 186 ALDIVFPDWSFWGWAETGIKPWEKVLKDIQESNKKITWKDRIPYAFWKGNTHVSSQRYKL 245
Query: 253 LTCNLSDKHDWNARLY 268
CN++D+HDWNAR+Y
Sbjct: 246 RQCNVTDQHDWNARIY 261
>gi|42571241|ref|NP_973694.1| downstream target of AGL15 2 [Arabidopsis thaliana]
gi|44917445|gb|AAS49047.1| At2g45830 [Arabidopsis thaliana]
gi|46931276|gb|AAT06442.1| At2g45830 [Arabidopsis thaliana]
gi|330255512|gb|AEC10606.1| downstream target of AGL15 2 [Arabidopsis thaliana]
Length = 382
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 109/168 (64%), Positives = 136/168 (80%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLELMFD
Sbjct: 1 MLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLELMFDP 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DDRP +RS+D+ G + PPPLFRY D ++DIVFPDWSFWGWAE+NIKPW+ L ++
Sbjct: 61 DDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSLVAIE 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
EGN W DR YAYW+GNP VA TRRDLL CN+S + DWN RLY+Q
Sbjct: 121 EGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQ 168
>gi|297824641|ref|XP_002880203.1| hypothetical protein ARALYDRAFT_483723 [Arabidopsis lyrata subsp.
lyrata]
gi|297326042|gb|EFH56462.1| hypothetical protein ARALYDRAFT_483723 [Arabidopsis lyrata subsp.
lyrata]
Length = 522
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 136/278 (48%), Positives = 163/278 (58%), Gaps = 24/278 (8%)
Query: 4 LFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLN----CVKNQ---TQTCP 56
L S+ FS+ T I P K+I TP+N C +Q T CP
Sbjct: 43 LLGASLSWMDMFSIGGGRIKVTPIFTRNTNATIPKKKIYTPVNFTLQCSLDQNIATHVCP 102
Query: 57 TNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLI 116
+ P SQ + + TCPDYFRWI++DL W+ TGITR+ LERA+ AHFRLI
Sbjct: 103 ASNPDKSQPSKDDPE-------TCPDYFRWIYKDLEAWRGTGITRETLERASDKAHFRLI 155
Query: 117 LVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS-- 174
+ +VY+ +YK+S QTRDVFTIWGI+QLLR YPG++PDLEL+F CDD P I RDY
Sbjct: 156 IKGGRVYVQQYKKSFQTRDVFTIWGIVQLLRMYPGQVPDLELLFMCDDLPEIWRRDYRPR 215
Query: 175 -GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWID 232
G N PPPLF Y G DIVFPDWSFWGW EINIK W +L + EG W +
Sbjct: 216 PGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWGWPEINIKEWNKMLEAISEGLKKVKWEE 275
Query: 233 REPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
REPYAYWKGN V + RRDL+ C HD LY Q
Sbjct: 276 REPYAYWKGNARVGK-RRDLMNC-----HDPMVHLYSQ 307
>gi|87241560|gb|ABD33418.1| Lipopolysaccharide-modifying protein [Medicago truncatula]
Length = 397
Score = 243 bits (620), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 104/192 (54%), Positives = 136/192 (70%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
TSTCP++FRWIHEDL PWK TGITR+ +E + R+++ K Y+ Y S QTRD+
Sbjct: 25 TSTCPEHFRWIHEDLEPWKSTGITRETVESGKNISQLRIVIKKGKAYVETYADSFQTRDL 84
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FT+WGI+QLLR YPGR+PDLEL+F+ DR V+ + + G + PP+F Y G +DI
Sbjct: 85 FTVWGIVQLLRLYPGRVPDLELLFETGDRTVLDKKRFQGSQSVTLPPIFSYCGQNDALDI 144
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGWAE IKPWE +L++++E N W DR PYA+WKGN V+ R L CN
Sbjct: 145 VFPDWSFWGWAETGIKPWEKVLKDIQESNKKITWKDRIPYAFWKGNTHVSSQRYKLRQCN 204
Query: 257 LSDKHDWNARLY 268
++D+HDWNAR+Y
Sbjct: 205 VTDQHDWNARIY 216
>gi|115459624|ref|NP_001053412.1| Os04g0534000 [Oryza sativa Japonica Group]
gi|38346998|emb|CAE04582.2| OSJNBb0039L24.21 [Oryza sativa Japonica Group]
gi|113564983|dbj|BAF15326.1| Os04g0534000 [Oryza sativa Japonica Group]
gi|116310457|emb|CAH67461.1| OSIGBa0159I10.6 [Oryza sativa Indica Group]
gi|125549144|gb|EAY94966.1| hypothetical protein OsI_16774 [Oryza sativa Indica Group]
gi|125591102|gb|EAZ31452.1| hypothetical protein OsJ_15588 [Oryza sativa Japonica Group]
gi|215694886|dbj|BAG90077.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 508
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 108/190 (56%), Positives = 136/190 (71%), Gaps = 2/190 (1%)
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQ--TAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
YFRWIHEDL PW+ GITR+ ++ A + A FR+ +V ++++ +Y + QTRD+FT W
Sbjct: 113 YFRWIHEDLRPWRDAGITREAVDGAARRYGAKFRVTVVAGRLHVARYGRCFQTRDMFTQW 172
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPD 200
G+LQLLR+Y GR+PDL+LMFDC D PV+ + D G + PPPLF Y G T+DI FPD
Sbjct: 173 GVLQLLRRYQGRVPDLDLMFDCQDLPVVNAGDRRGRTSSSPPPLFGYCGSEPTLDIAFPD 232
Query: 201 WSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK 260
WSFWGW E+NIKPWE+L E+ +GN NW R PYAYWKGNP V RR+LL CN S K
Sbjct: 233 WSFWGWPELNIKPWETLRGEIADGNAAVNWTGRAPYAYWKGNPTVGADRRNLLRCNASGK 292
Query: 261 HDWNARLYVQ 270
DWNAR+Y Q
Sbjct: 293 RDWNARIYEQ 302
>gi|3386610|gb|AAC28540.1| hypothetical protein [Arabidopsis thaliana]
Length = 413
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 125/224 (55%), Positives = 148/224 (66%), Gaps = 18/224 (8%)
Query: 37 PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
P +++TTPLN C +Q TQTCP + P+ SQ + + TCPDYFRWIH+
Sbjct: 76 PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128
Query: 90 DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
DL W+ TGITR+ LERA+ AHFRLI+ +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
PG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248
Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
W EINIK W + EG W +REPYAYWKGNP VA R
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMAR 292
>gi|449523045|ref|XP_004168535.1| PREDICTED: LOW QUALITY PROTEIN: O-glucosyltransferase rumi homolog
[Cucumis sativus]
Length = 454
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 166/253 (65%), Gaps = 23/253 (9%)
Query: 30 SRKPQNRPPKRIT-TPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPT---------ST 79
R+ QN P K + +P+NC T Y ++ + I +I +T
Sbjct: 6 ERELQNYPQKEVEFSPINC---------TAYSRSEKWDSGIGPTTIEEEEEDGDGKNENT 56
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP+YFRWIHEDL PW TGITR+M+ER + A FRL++V +VY+ KY + Q RDVFT+
Sbjct: 57 CPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRVYVEKYSEVFQRRDVFTL 116
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG--PNNKGPPPLFRYSGDRWTMDIV 197
WGILQLLR YP ++PDL+LMF C+D+P + +YSG PN+ PPPLFRY GD T DIV
Sbjct: 117 WGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTPPPPLFRYCGDDDTFDIV 176
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPDWSFWGW EIN+KPWE+ ++ELKE N + WIDRE YA+WKGN F++ R LL C+
Sbjct: 177 FPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSR 236
Query: 258 SDKHDWNARLYVQ 270
S + R+Y+Q
Sbjct: 237 STQS--KLRVYMQ 247
>gi|242094720|ref|XP_002437850.1| hypothetical protein SORBIDRAFT_10g003730 [Sorghum bicolor]
gi|241916073|gb|EER89217.1| hypothetical protein SORBIDRAFT_10g003730 [Sorghum bicolor]
Length = 552
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 104/196 (53%), Positives = 143/196 (72%), Gaps = 3/196 (1%)
Query: 76 PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
P +CPDYFR+IH+DL PW+ GITR+ +ERA A+FRL++V + Y+ Y+++ QTRD
Sbjct: 157 PAPSCPDYFRYIHDDLRPWRGAGITREAVERARPHAYFRLVVVGGRAYVETYRRAYQTRD 216
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY-SGPNNKGPPPLFRYSGDRWTM 194
VFT WG+LQLLR+YPGR+PDL++MF CDD +R+ D+ + P++ PP+FRY D T+
Sbjct: 217 VFTQWGVLQLLRRYPGRVPDLDIMFACDDPGQVRAADFPTTPSDA--PPVFRYCKDALTL 274
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWSFWGW E+ I+PW LL E+++ N W R+PYA+WKGNP R +L+
Sbjct: 275 DIVFPDWSFWGWPEVGIRPWPQLLEEVRQENERVRWPVRQPYAFWKGNPEGYRIRHELMR 334
Query: 255 CNLSDKHDWNARLYVQ 270
CN S+ +WNARL+ Q
Sbjct: 335 CNASNGQEWNARLFTQ 350
>gi|226509990|ref|NP_001142048.1| uncharacterized protein LOC100274204 [Zea mays]
gi|194706898|gb|ACF87533.1| unknown [Zea mays]
gi|413919037|gb|AFW58969.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 508
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 118/216 (54%), Positives = 147/216 (68%), Gaps = 7/216 (3%)
Query: 62 TSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLIL 117
T+QT + + S PP CP YFRWIHEDL PW+ TG+TR +E A + A R+ +
Sbjct: 90 TNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAV 149
Query: 118 VNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
V ++Y+ +Y ++ QTR FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + + G
Sbjct: 150 VAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHG 209
Query: 176 PNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDRE 234
+ PPP LFRY G T+DI FPDWSFWGW E+NIKPWE+L RE+ E N NW DR
Sbjct: 210 AQPQPPPPPLFRYCGSEATLDIAFPDWSFWGWPELNIKPWEALQREINEANAVVNWTDRA 269
Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
PYAYWKGNP V R LL CN S + DWNAR+Y Q
Sbjct: 270 PYAYWKGNPKVGAERLLLLRCNASGERDWNARVYAQ 305
>gi|326491107|dbj|BAK05653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 236 bits (602), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 104/168 (61%), Positives = 131/168 (77%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
ML+RA TA+FRL+++ + YI + + QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC
Sbjct: 1 MLDRARATANFRLVVLRGRAYIERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PV+ + +Y G N PPLFRY GD T+D+VFPDWSFWGWAEINIKPW++L ++L
Sbjct: 61 VDWPVVHADEYQGNNATAMPPLFRYCGDNETLDVVFPDWSFWGWAEINIKPWDALQKDLS 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
GN WIDREPYAYWKGNP VA R++L+ CN+S K +WNAR+Y Q
Sbjct: 121 VGNKRVRWIDREPYAYWKGNPEVAAIRQELVKCNVSSKQEWNARIYKQ 168
>gi|49388103|dbj|BAD25236.1| unknown protein [Oryza sativa Japonica Group]
gi|215766221|dbj|BAG98449.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 375
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 103/168 (61%), Positives = 131/168 (77%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
MLERA TA+FRL+++ + Y+ + + QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC
Sbjct: 1 MLERARDTANFRLVVLRGRAYVERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PV+++ Y G N PPLFRY GD T+D+VFPDWSFWGW EINIKPW++L ++L
Sbjct: 61 VDWPVVQADRYQGENATAMPPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLD 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
GN W+DREPYAYWKGNP VA R++L+ CN+S KH+WNAR+Y Q
Sbjct: 121 IGNKRVKWVDREPYAYWKGNPDVATKRKELVKCNVSSKHEWNARIYKQ 168
>gi|449441480|ref|XP_004138510.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
gi|449518613|ref|XP_004166331.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
Length = 380
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 107/168 (63%), Positives = 130/168 (77%), Gaps = 1/168 (0%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ER +TAHFR+++V +VY+ KYK SIQTRDVFT+WGILQL R YP +LPDLELMFDC
Sbjct: 1 MVERGRRTAHFRVVIVEGRVYVEKYKGSIQTRDVFTMWGILQLARWYPKKLPDLELMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DDRPV+RS + + GPPPLFRY D ++DIVFPDWSFWGW EINIKPW+ +L ++K
Sbjct: 61 DDRPVVRSNGFMNAIS-GPPPLFRYCSDESSLDIVFPDWSFWGWGEINIKPWKMVLEDIK 119
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
EGN W DR P AYWKGNP V +RRDLL CNL+ + +W+ LYVQ
Sbjct: 120 EGNKRTRWKDRVPLAYWKGNPQVDPSRRDLLKCNLTQQQNWDTLLYVQ 167
>gi|147791845|emb|CAN70601.1| hypothetical protein VITISV_027961 [Vitis vinifera]
Length = 362
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 105/152 (69%), Positives = 127/152 (83%), Gaps = 1/152 (0%)
Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
N++ ++ K+++SIQTRD+FT+WGILQLLR YPGRLPDLELMFDCDDRPV+R RD+ GP+
Sbjct: 14 NHQGHVEKFRKSIQTRDMFTLWGILQLLRWYPGRLPDLELMFDCDDRPVVRMRDFRGPS- 72
Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAY 238
PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++KEGN W DR P AY
Sbjct: 73 AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEGNRRTKWKDRVPLAY 132
Query: 239 WKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
W+GNP VA TR DLL CN+SDK DWN RLY+Q
Sbjct: 133 WRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQ 164
>gi|21742339|emb|CAD41531.1| OSJNBb0020O11.18 [Oryza sativa Japonica Group]
gi|38346999|emb|CAE04583.2| OSJNBb0039L24.22 [Oryza sativa Japonica Group]
Length = 375
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 102/168 (60%), Positives = 132/168 (78%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
ML RA TA FRL+++ + ++H+++ + QTRD+FTIWG+LQLLR+YPGR+PDL+LMFDC
Sbjct: 1 MLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWGVLQLLRRYPGRVPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PV+R+ Y G + PPLF Y GD T+DIVFPDWSFWGW EINIKPW++L ++LK
Sbjct: 61 ADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDWSFWGWPEINIKPWDALRQDLK 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
+GNN W+DR PYAYWKGNP VA TR++L+ CN+S DWNAR+Y Q
Sbjct: 121 DGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTKDWNARIYKQ 168
>gi|215736906|dbj|BAG95835.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 275
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 102/168 (60%), Positives = 132/168 (78%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
ML RA TA FRL+++ + ++H+++ + QTRD+FTIWG+LQLLR+YPGR+PDL+LMFDC
Sbjct: 1 MLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWGVLQLLRRYPGRVPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PV+R+ Y G + PPLF Y GD T+DIVFPDWSFWGW EINIKPW++L ++LK
Sbjct: 61 ADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDWSFWGWPEINIKPWDALRQDLK 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
+GNN W+DR PYAYWKGNP VA TR++L+ CN+S DWNAR+Y Q
Sbjct: 121 DGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTKDWNARIYKQ 168
>gi|326505488|dbj|BAJ95415.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326509389|dbj|BAJ91611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 562
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 101/192 (52%), Positives = 135/192 (70%), Gaps = 4/192 (2%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPDYFR IH DL PW+ GI+R+ +ER A FRL +V+ + Y+ Y + QTRD+FT
Sbjct: 168 CPDYFRHIHSDLEPWREKGISREAVERGRPKAAFRLTVVSGRAYVETYHRVFQTRDLFTQ 227
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WGI QLL +YPGR+PDL+LMF+C+D P +R+ DY P+ PPLFRY D +++++FP
Sbjct: 228 WGIAQLLARYPGRVPDLDLMFNCEDMPELRAADY--PDTSAAPPLFRYCKDGTSLEVLFP 285
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC--NL 257
DWSFWGW E+NI+PW L++E+ E N W DR+PYA+WKGNP V+ RRDL C +
Sbjct: 286 DWSFWGWPEVNIRPWGPLMKEIAEENARLPWPDRQPYAFWKGNPDVSRARRDLFRCSNDS 345
Query: 258 SDKHDWNARLYV 269
+ DWNARL+
Sbjct: 346 AAGRDWNARLFA 357
>gi|294461229|gb|ADE76177.1| unknown [Picea sitchensis]
Length = 492
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 106/194 (54%), Positives = 138/194 (71%), Gaps = 3/194 (1%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
+ CP YF+WI +D+ PW +GIT DM+E AN A FRL +VN +++I Y++ Q+RD+
Sbjct: 95 AAFCPFYFKWIRQDMKPWAKSGITIDMVEAANPEASFRLTVVNGRMFIESYRKCYQSRDL 154
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
FTIWGI QLL+ YP LPDL+LMF+CDD PVI DY+ + K PPPLFRYSG T DI
Sbjct: 155 FTIWGIAQLLKFYPRLLPDLDLMFNCDDNPVIHRGDYND-STKPPPPLFRYSGSEDTFDI 213
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWSFWGW EI PWE+L +E++ G+ W DR+P AYWKGNP++ + R+DL+ C
Sbjct: 214 VFPDWSFWGWPEIRTPPWETLAKEIQNGSQKVKWEDRDPTAYWKGNPYMGQGRQDLMNC- 272
Query: 257 LSDKHDWNARLYVQ 270
+ +H W RLY Q
Sbjct: 273 IHRRH-WGGRLYNQ 285
>gi|413925102|gb|AFW65034.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 599
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 114/248 (45%), Positives = 143/248 (57%), Gaps = 26/248 (10%)
Query: 41 ITTPLNCVKNQTQTCPTNYPKT-------------SQTQESISDYSIPPTSTCPDYFRWI 87
+ P +C N + CP T + T+ S S PP CPDYFR+I
Sbjct: 151 VPIPFSCGGNGS-ACPRRQTATSPPSPSPSSSSPPAGTKSSSSKNKQPPPQ-CPDYFRFI 208
Query: 88 HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
H DLSPW+ TGITR+ +ER A FRL++V+ + Y+ Y + QTRD FT WGI QLL
Sbjct: 209 HSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGIAQLLA 268
Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
+YPGR+PDL+LMF+C+D P ++ + PPLFRY D T+DIVFPDWSFWGW
Sbjct: 269 RYPGRVPDLDLMFNCEDMPEVKVKPS---EESSAPPLFRYCKDDSTVDIVFPDWSFWGWP 325
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK------- 260
E+NI+PW LL E+ W DREPYAYWKGNP V+ R DL CN S
Sbjct: 326 EVNIRPWAPLLEEMAAEMGRLPWADREPYAYWKGNPTVSAERADLRRCNDSSSSGGTRVF 385
Query: 261 -HDWNARL 267
DW A +
Sbjct: 386 WQDWGAAI 393
>gi|226510425|ref|NP_001149452.1| LOC100283078 [Zea mays]
gi|195627344|gb|ACG35502.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 600
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 114/248 (45%), Positives = 143/248 (57%), Gaps = 26/248 (10%)
Query: 41 ITTPLNCVKNQTQTCPTNYPKT-------------SQTQESISDYSIPPTSTCPDYFRWI 87
+ P +C N + CP T + T+ S S PP CPDYFR+I
Sbjct: 152 VPIPFSCGGNGS-ACPRRQTATSPPSPSPSSSSPPAATKSSSSKNKQPPPQ-CPDYFRFI 209
Query: 88 HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
H DLSPW+ TGITR+ +ER A FRL++V+ + Y+ Y + QTRD FT WGI QLL
Sbjct: 210 HSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGIAQLLA 269
Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
+YPGR+PDL+LMF+C+D P ++ + PPLFRY D T+DIVFPDWSFWGW
Sbjct: 270 RYPGRVPDLDLMFNCEDMPEVKVKPS---EESSAPPLFRYCKDDSTVDIVFPDWSFWGWP 326
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK------- 260
E+NI+PW LL E+ W DREPYAYWKGNP V+ R DL CN S
Sbjct: 327 EVNIRPWAPLLEEMAAEMGRLPWADREPYAYWKGNPTVSAERADLRRCNDSSSSGGTRVF 386
Query: 261 -HDWNARL 267
DW A +
Sbjct: 387 WQDWGAAI 394
>gi|294460113|gb|ADE75639.1| unknown [Picea sitchensis]
Length = 492
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 104/191 (54%), Positives = 134/191 (70%), Gaps = 3/191 (1%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF+WI +D+ PW +GIT DM+E A A FRL +VN ++YI Y++ Q+RD+FTI
Sbjct: 98 CPFYFKWIRQDMKPWAKSGITIDMVEAAKPEASFRLTVVNGRMYIESYRKCYQSRDLFTI 157
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WGI Q L+ YP LPDL+ MF+CDD PVI DY+ + K PPPLFRYSG T DIVFP
Sbjct: 158 WGIAQFLKFYPRLLPDLDSMFNCDDNPVIHRGDYND-STKPPPPLFRYSGSEDTFDIVFP 216
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
DWSFWGW EI PWE+L +E++ G+ W DR+P AYWKGNP++ + R+DL+ C +
Sbjct: 217 DWSFWGWPEIRTPPWETLAKEIQNGSQKVKWEDRDPTAYWKGNPYMGQGRQDLMNC-IHR 275
Query: 260 KHDWNARLYVQ 270
+H W RLY Q
Sbjct: 276 RH-WGGRLYNQ 285
>gi|242077829|ref|XP_002443683.1| hypothetical protein SORBIDRAFT_07g000280 [Sorghum bicolor]
gi|241940033|gb|EES13178.1| hypothetical protein SORBIDRAFT_07g000280 [Sorghum bicolor]
Length = 594
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 102/188 (54%), Positives = 132/188 (70%), Gaps = 2/188 (1%)
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
YFR+IH DLSPW+ TGITR+ +ER A FRL++V+ + Y+ Y + QTRD FT WGI
Sbjct: 198 YFRYIHSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGI 257
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
QLL +YPGR+PDL+LMF+C+D P +R+ D++ ++ PPLFRY D T+DIVFPDWS
Sbjct: 258 AQLLARYPGRVPDLDLMFNCEDMPEVRAADFAAAPSQA-PPLFRYCKDDSTLDIVFPDWS 316
Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
FWGW E+NI+PW LL E+ W +REPYAYWKGNP V+ R DLL CN S +
Sbjct: 317 FWGWPEVNIRPWAPLLEEMAAETARLPWAEREPYAYWKGNPGVSGERGDLLRCNDSSG-E 375
Query: 263 WNARLYVQ 270
W R++ Q
Sbjct: 376 WRTRVFWQ 383
>gi|356530084|ref|XP_003533614.1| PREDICTED: KDEL motif-containing protein 1-like, partial [Glycine
max]
Length = 357
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 111/226 (49%), Positives = 140/226 (61%), Gaps = 17/226 (7%)
Query: 44 PLNCVKN-QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
PLNC T TCP +Y T Q+ S STCP+YF+WIHEDL PWK TGITR+
Sbjct: 3 PLNCFNGGMTYTCPKDYYPTKFAQDQDSS----TASTCPEYFKWIHEDLKPWKRTGITRE 58
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ER ++FRL+++ K Y KY S +TRDVFT+WGILQLLR YPG +PDLEL+ +
Sbjct: 59 MMERGRNVSYFRLVIIQGKAYAKKYADSYETRDVFTVWGILQLLRLYPGDIPDLELLLET 118
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D+P++ GP PPP+F Y G + DIVFPDW F GWA++ IK L
Sbjct: 119 GDKPMVDKEQSQGP----PPPIFHYCGHKNAYDIVFPDWIFRGWADLAIK--------LD 166
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
E N W DR PYA WKG +V+ R DL CN SD+HD A+++
Sbjct: 167 ESNKKIKWKDRLPYAIWKGKTWVSHKRNDLTKCNASDQHDSYAQIH 212
>gi|356504647|ref|XP_003521107.1| PREDICTED: protein O-glucosyltransferase 1-like [Glycine max]
Length = 373
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 97/166 (58%), Positives = 126/166 (75%), Gaps = 1/166 (0%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ER +HFRL++VN K YI K+ +S QTRDVFTIWGILQLLR YPG++PDLELMF C
Sbjct: 1 MIERGKNISHFRLVIVNGKAYIEKFAKSYQTRDVFTIWGILQLLRLYPGKVPDLELMFHC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D+ V+ +D+ GP PPP+F Y G+ + DIVFPDW+FWGWAE++I+PWE+ L ++
Sbjct: 61 GDKTVVFKKDFQGPQ-MSPPPVFHYCGEENSYDIVFPDWTFWGWAELSIRPWETTLHNIQ 119
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
EGN W DR PYA+WKGNP V+ RR+L CN+++K DWNAR+Y
Sbjct: 120 EGNKMVKWKDRIPYAFWKGNPKVSIIRRELGKCNVTEKQDWNARIY 165
>gi|55296633|dbj|BAD69335.1| unknown protein [Oryza sativa Japonica Group]
gi|55297286|dbj|BAD69071.1| unknown protein [Oryza sativa Japonica Group]
Length = 542
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 106/235 (45%), Positives = 147/235 (62%), Gaps = 2/235 (0%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
PP P +C S S P ++CPDYFR+IH+DL PW+
Sbjct: 103 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 162
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD FT WG+ QLLR+Y GR+PD+
Sbjct: 163 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFTQWGVAQLLRRYAGRVPDV 222
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
++MF CDDR +R+ D++ PP+FRY D T+D+VFPDWSFWGW E+NI W +
Sbjct: 223 DIMFACDDRGRVRAADFAA-APADAPPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPA 281
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQ 270
L ++ + W +REP+A+WKGNP VA R +L+ CN SD DWNARL+ Q
Sbjct: 282 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQ 336
>gi|115466478|ref|NP_001056838.1| Os06g0152700 [Oryza sativa Japonica Group]
gi|113594878|dbj|BAF18752.1| Os06g0152700 [Oryza sativa Japonica Group]
Length = 547
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 106/235 (45%), Positives = 147/235 (62%), Gaps = 2/235 (0%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
PP P +C S S P ++CPDYFR+IH+DL PW+
Sbjct: 108 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 167
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD FT WG+ QLLR+Y GR+PD+
Sbjct: 168 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFTQWGVAQLLRRYAGRVPDV 227
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
++MF CDDR +R+ D++ PP+FRY D T+D+VFPDWSFWGW E+NI W +
Sbjct: 228 DIMFACDDRGRVRAADFAA-APADAPPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPA 286
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQ 270
L ++ + W +REP+A+WKGNP VA R +L+ CN SD DWNARL+ Q
Sbjct: 287 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQ 341
>gi|218197602|gb|EEC80029.1| hypothetical protein OsI_21713 [Oryza sativa Indica Group]
Length = 515
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 142/204 (69%), Gaps = 2/204 (0%)
Query: 76 PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
P ++CPDYFR+IH+DL PW+ GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD
Sbjct: 146 PAASCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRD 205
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
FT WG+ QLLR+Y GR+PD+++MF CDDR +R+ D++ PP+FRY D T+D
Sbjct: 206 AFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFA-AAPADAPPVFRYCRDATTLD 264
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW E+NI W + L ++ + W +REP+A+WKGNP VA R +L+ C
Sbjct: 265 VVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKC 324
Query: 256 N-LSDKHDWNARLYVQVISFIYIT 278
N SD DWNARL+ Q Y T
Sbjct: 325 NPASDGKDWNARLFSQTYRRSYQT 348
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 105/153 (68%), Gaps = 2/153 (1%)
Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
N +++ Y++S QTRD FT WG+ QLLR+Y GR+PD+++MF CDDR +R+ D++
Sbjct: 334 NARLFSQTYRRSYQTRDAFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFAAAPA 393
Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAY 238
PP+FRY D T+D+VFPDWSFWGW E+NI W + L ++ + W +REP+A+
Sbjct: 394 D-APPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAF 452
Query: 239 WKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQ 270
WKGNP VA R +L+ CN SD DWNARL+ Q
Sbjct: 453 WKGNPGVARIRGELMKCNPASDGKDWNARLFSQ 485
>gi|227206228|dbj|BAH57169.1| AT1G63420 [Arabidopsis thaliana]
Length = 299
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 100/183 (54%), Positives = 130/183 (71%), Gaps = 13/183 (7%)
Query: 31 RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
R + + P+ + ++C +N++ +C ++ S+ S CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
IHEDL PW+ TGIT++M+ER TAHFRL+++N KV++ YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
RKYPG+LPD++LMFDCDDRPVIRS Y+ N PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296
Query: 204 WGW 206
WGW
Sbjct: 297 WGW 299
>gi|125559830|gb|EAZ05278.1| hypothetical protein OsI_27481 [Oryza sativa Indica Group]
Length = 579
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 116/238 (48%), Positives = 145/238 (60%), Gaps = 6/238 (2%)
Query: 41 ITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTST---CPDYFRWIHEDLSPWKV 96
I P C N T TCP + P TST CP YFR I DL+ W
Sbjct: 137 IPIPFTCGSGNDTGTCPRYAASAPAPAPASMSPPPPQTSTVDYCPSYFRHIELDLAAWVA 196
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
+GI+R+ +ER + AHFRL++V + Y+ Y+++ QTRDVFT WGILQLLR+YPGR+PDL
Sbjct: 197 SGISREAVERGRRQAHFRLLVVGGRAYVETYRRAFQTRDVFTQWGILQLLRRYPGRVPDL 256
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
+LMF+CDD P +R+ Y PLFRY D T+D++FPDWSFWGW E+NI+PW
Sbjct: 257 DLMFNCDDMPEVRAAAYPDRAAAP--PLFRYCKDPSTLDVLFPDWSFWGWPEVNIRPWAP 314
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQVISF 274
LL E+ E W REPYAYWKGNP V+ R++LL CN S D RLY Q F
Sbjct: 315 LLAEMAEEKARLPWSRREPYAYWKGNPDVSPLRQELLRCNHSLPPDDTVRLYRQDWGF 372
>gi|115474335|ref|NP_001060764.1| Os08g0101800 [Oryza sativa Japonica Group]
gi|38637293|dbj|BAD03556.1| unknown protein [Oryza sativa Japonica Group]
gi|50725686|dbj|BAD33152.1| unknown protein [Oryza sativa Japonica Group]
gi|113622733|dbj|BAF22678.1| Os08g0101800 [Oryza sativa Japonica Group]
gi|125601894|gb|EAZ41219.1| hypothetical protein OsJ_25724 [Oryza sativa Japonica Group]
Length = 579
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 116/238 (48%), Positives = 145/238 (60%), Gaps = 6/238 (2%)
Query: 41 ITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTST---CPDYFRWIHEDLSPWKV 96
I P C N T TCP + P TST CP YFR I DL+ W
Sbjct: 137 IPIPFTCGSGNDTGTCPRYAASAPAPAPASMSPPPPQTSTVDYCPSYFRHIELDLAAWLA 196
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
+GI+R+ +ER + AHFRL++V + Y+ Y+++ QTRDVFT WGILQLLR+YPGR+PDL
Sbjct: 197 SGISREAVERGRRQAHFRLLVVGGRAYVETYRRAFQTRDVFTQWGILQLLRRYPGRVPDL 256
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
+LMF+CDD P +R+ Y PLFRY D T+D++FPDWSFWGW E+NI+PW
Sbjct: 257 DLMFNCDDMPEVRAAAYPDRAAAP--PLFRYCKDPSTLDVLFPDWSFWGWPEVNIRPWAP 314
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQVISF 274
LL E+ E W REPYAYWKGNP V+ R++LL CN S D RLY Q F
Sbjct: 315 LLAEMAEEKARLPWSRREPYAYWKGNPDVSPLRQELLRCNHSLPPDDTVRLYRQDWGF 372
>gi|357127693|ref|XP_003565513.1| PREDICTED: uncharacterized protein LOC100832633 [Brachypodium
distachyon]
Length = 538
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 103/203 (50%), Positives = 133/203 (65%), Gaps = 3/203 (1%)
Query: 71 DYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQS 130
D S+ P CP YFR IH DL PW+ TGITRD LERA A FRL +V+ + Y+ + S
Sbjct: 131 DPSVAPPPWCPVYFRHIHTDLDPWRSTGITRDTLERAMPHAEFRLTVVSGRAYVQNLRPS 190
Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG- 189
QTRDVFT WG+LQLL ++PGR+PD+++MF D + S DY PPPLFRY
Sbjct: 191 YQTRDVFTQWGVLQLLARFPGRVPDVDIMFSAGDVAQVLSADYYNTTTHPPPPLFRYCKE 250
Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
++ IVFPDWSFWGW E++I+PW L+ + N W +R+PYA+WKGNP V++ R
Sbjct: 251 EKLEAAIVFPDWSFWGWPELSIRPWAPLMEDFVRENKALPWRNRQPYAFWKGNPEVSDVR 310
Query: 250 RDLLTCN--LSDKHDWNARLYVQ 270
RDL CN ++ DWNARL+ Q
Sbjct: 311 RDLFKCNNDSANGKDWNARLFWQ 333
>gi|125554122|gb|EAY99727.1| hypothetical protein OsI_21712 [Oryza sativa Indica Group]
Length = 308
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 100/196 (51%), Positives = 140/196 (71%), Gaps = 2/196 (1%)
Query: 76 PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
P ++CPDYFR+IH+DL PW+ GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD
Sbjct: 37 PAASCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRD 96
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
FT WG+ QLLR+Y GR+PD+++MF CDDR +R+ D++ PP+FRY D T+D
Sbjct: 97 AFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFA-AAPADAPPVFRYCRDATTLD 155
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+VFPDWSFWGW E+NI W + L ++ + W +REP+A+WKGNP VA R +L+ C
Sbjct: 156 VVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKC 215
Query: 256 N-LSDKHDWNARLYVQ 270
N SD DWNARL+ Q
Sbjct: 216 NPASDGKDWNARLFSQ 231
>gi|297849014|ref|XP_002892388.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
lyrata]
gi|297338230|gb|EFH68647.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
lyrata]
Length = 508
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 145/221 (65%), Gaps = 12/221 (5%)
Query: 54 TCPTNY---PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQT 110
+CP N PK+ Q++ + P CPD+FRWIH DL PW TG+T++ ++RA
Sbjct: 89 SCPYNAVVQPKSLQSESVSGRRTHQPQ--CPDFFRWIHRDLEPWAKTGVTKEHVKRAKAN 146
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
A FR+++++ K+Y+ Y +Q+R +FTIWGILQLL KYPG +PD+++MFDC D+P+I
Sbjct: 147 AAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLNKYPGMVPDVDMMFDCMDKPIINQ 206
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
+Y P PLFRY + +DI FPDWSFWGW+E N++PWE ++K+G+ R+W
Sbjct: 207 TEY----QSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDIKQGSRRRSW 262
Query: 231 IDREPYAYWKGNP-FVAETRRDLLTCNLSDKHDWNARLYVQ 270
+++P AYWKGNP V+ R +L+ CN S W A++ Q
Sbjct: 263 DNKQPRAYWKGNPDVVSPIRLELMKCNHS--RLWGAQIMRQ 301
>gi|357497389|ref|XP_003618983.1| hypothetical protein MTR_6g031080, partial [Medicago truncatula]
gi|355493998|gb|AES75201.1| hypothetical protein MTR_6g031080, partial [Medicago truncatula]
Length = 226
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 102/197 (51%), Positives = 132/197 (67%), Gaps = 12/197 (6%)
Query: 15 FSVHNISRNK---TVIIVSRKPQNRPPKRITTPLNCVKNQTQ--TCPTNYPKTSQTQESI 69
F +H I +K T I + + N ++ PLNC N+T TCP +YP T +
Sbjct: 37 FIIHWIDLSKVTSTTIFKTIEVLNNIKQQPQFPLNC-NNETSNSTCPFSYPTTFHLNDD- 94
Query: 70 SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
P+S+CP+YF+WIHEDL PW+ TGITR+M+E +HFRL++VN K YI K+ +
Sbjct: 95 -----SPSSSCPNYFKWIHEDLKPWEKTGITREMVESGKNMSHFRLVVVNGKAYIDKFAK 149
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
S QTRDVFTIWGILQLLR YPG++PDLELMF C D+ V+ + + GP N PPP+F Y G
Sbjct: 150 SYQTRDVFTIWGILQLLRLYPGKIPDLELMFQCGDKNVVDKKKFQGPENVSPPPIFHYCG 209
Query: 190 DRWTMDIVFPDWSFWGW 206
D T+DIVFPDW+FWGW
Sbjct: 210 DEDTLDIVFPDWTFWGW 226
>gi|15222341|ref|NP_172202.1| uncharacterized protein [Arabidopsis thaliana]
gi|8954024|gb|AAF82198.1|AC067971_6 Contains similarity to an unknown protein T2J13.180 gi|6522568 from
Arabidopsis thaliana BAC T2J13 gb|AL132967. ESTs
gb|Z29835 and gb|Z29836 come from this gene [Arabidopsis
thaliana]
gi|332189973|gb|AEE28094.1| uncharacterized protein [Arabidopsis thaliana]
Length = 507
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 92/192 (47%), Positives = 133/192 (69%), Gaps = 7/192 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPD+FRWIH DL PW TG+T++ ++RA A FR+++++ K+Y+ Y +Q+R +FTI
Sbjct: 115 CPDFFRWIHRDLEPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTI 174
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WGILQLL KYPG +PD+++MFDC D+P+I +Y P PLFRY + +DI FP
Sbjct: 175 WGILQLLTKYPGMVPDVDMMFDCMDKPIINQTEY----QSFPVPLFRYCTNEAHLDIPFP 230
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTCNLS 258
DWSFWGW+E N++PWE ++K+G+ R+W +++P AYWKGNP V+ R +L+ CN S
Sbjct: 231 DWSFWGWSETNLRPWEEEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHS 290
Query: 259 DKHDWNARLYVQ 270
W A++ Q
Sbjct: 291 --RLWGAQIMRQ 300
>gi|62320292|dbj|BAD94602.1| hypothetical protein [Arabidopsis thaliana]
Length = 507
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 92/192 (47%), Positives = 133/192 (69%), Gaps = 7/192 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPD+FRWIH DL PW TG+T++ ++RA A FR+++++ K+Y+ Y +Q+R +FTI
Sbjct: 115 CPDFFRWIHRDLEPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTI 174
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WGILQLL KYPG +PD+++MFDC D+P+I +Y P PLFRY + +DI FP
Sbjct: 175 WGILQLLTKYPGMVPDVDMMFDCMDKPIINQTEY----QSFPVPLFRYCTNEAHLDIPFP 230
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTCNLS 258
DWSFWGW+E N++PWE ++K+G+ R+W +++P AYWKGNP V+ R +L+ CN S
Sbjct: 231 DWSFWGWSETNLRPWEVEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHS 290
Query: 259 DKHDWNARLYVQ 270
W A++ Q
Sbjct: 291 --RLWGAQIMRQ 300
>gi|294460307|gb|ADE75735.1| unknown [Picea sitchensis]
Length = 194
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 94/168 (55%), Positives = 124/168 (73%), Gaps = 3/168 (1%)
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
M+ERA +TA FRL+++ ++YI Y +S QTRDVFTIWG++QL+ Y LPDL+LMFDC
Sbjct: 1 MVERARRTATFRLVILKGRMYIRTYTKSFQTRDVFTIWGLIQLMENYGWMLPDLDLMFDC 60
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D PVI+++ Y+ + PPPLFRY GD ++DI FPDWSFWGWAE+N +PW+ LL ++
Sbjct: 61 VDWPVIKAKAYANASLPPPPPLFRYCGDDKSLDIAFPDWSFWGWAEVNTRPWDGLLNDIL 120
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
+G W DR+P A+WKGNP+VA R DL+ CNLSD+ NARLY Q
Sbjct: 121 KGAKKLKWEDRDPTAFWKGNPYVAAVREDLMKCNLSDR---NARLYNQ 165
>gi|357119765|ref|XP_003561604.1| PREDICTED: uncharacterized protein LOC100842005 [Brachypodium
distachyon]
Length = 616
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 98/191 (51%), Positives = 129/191 (67%), Gaps = 4/191 (2%)
Query: 82 DYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
DYFR+IH DLSPW+ +G IT LERA A FRL +V+ + Y+ Y ++ QTRDVFT W
Sbjct: 216 DYFRYIHSDLSPWRASGGITLPTLERAIPHAAFRLTVVSGRAYVETYHRAFQTRDVFTQW 275
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW-TMDIVFP 199
GILQLL +YPGR+PDL+ MF+ +D P I D G PPPLFRY D +++I+FP
Sbjct: 276 GILQLLARYPGRVPDLDAMFNLEDMPEIFRNDADGNPAPPPPPLFRYCKDGGDSVEILFP 335
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN--L 257
DWSFWGW E+NI+PW L+ N W +REP+A+WKGNP+V+ R+DL CN
Sbjct: 336 DWSFWGWPEVNIRPWAPLMENFVRENRALPWQNREPFAFWKGNPYVSNARKDLFKCNNDS 395
Query: 258 SDKHDWNARLY 268
+ ++NARL+
Sbjct: 396 AAGKEFNARLF 406
>gi|302784816|ref|XP_002974180.1| hypothetical protein SELMODRAFT_100126 [Selaginella moellendorffii]
gi|300158512|gb|EFJ25135.1| hypothetical protein SELMODRAFT_100126 [Selaginella moellendorffii]
Length = 426
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 93/195 (47%), Positives = 128/195 (65%), Gaps = 6/195 (3%)
Query: 79 TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+CPD+F+WIH DL+PW+ +G I+R LE A + A FR+ ++ ++Y Y Q +Q+R +F
Sbjct: 1 SCPDFFQWIHHDLAPWRASGGISRAALEEAREFAAFRVAIIGGQLYAELYYQCVQSRAMF 60
Query: 138 TIWGILQLLRKYP-GRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
T+WG+L LL ++P G +PD+E MF+C DRP R Y ++ PPPL Y G R T+DI
Sbjct: 61 TLWGLLLLLERFPAGAVPDVEFMFNCMDRPHFRRSRY---KSRAPPPLLAYCGSRDTVDI 117
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTC 255
FPDWSFWGWAE+ I W + G+ W +R P A+WKGNP V A+ R DLL C
Sbjct: 118 AFPDWSFWGWAEVRIGAWGEEASSIFHGSEETRWENRRPRAHWKGNPHVGAQVRTDLLAC 177
Query: 256 NLSDKHDWNARLYVQ 270
N +DK D+ A +YVQ
Sbjct: 178 NKTDKRDFGADIYVQ 192
>gi|302770761|ref|XP_002968799.1| hypothetical protein SELMODRAFT_450427 [Selaginella moellendorffii]
gi|300163304|gb|EFJ29915.1| hypothetical protein SELMODRAFT_450427 [Selaginella moellendorffii]
Length = 423
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 93/195 (47%), Positives = 128/195 (65%), Gaps = 6/195 (3%)
Query: 79 TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+CPD+F+WIH DL+PW+ +G I+R LE A + A FR+ ++ ++Y Y Q +Q+R +F
Sbjct: 1 SCPDFFQWIHHDLAPWRASGGISRAALEEAREFAAFRVAIIGGQLYAELYYQCVQSRAMF 60
Query: 138 TIWGILQLLRKYP-GRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
T+WG+L LL ++P G +PD+E MF+C DRP R Y ++ PPPL Y G R T+DI
Sbjct: 61 TLWGLLLLLERFPAGAVPDVEFMFNCMDRPHFRRSRY---KSRAPPPLLAYCGSRDTVDI 117
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTC 255
FPDWSFWGWAE+ I W + G+ W +R P A+WKGNP V A+ R DLL C
Sbjct: 118 AFPDWSFWGWAEVRIGAWGEEASSIFHGSEETRWENRRPRAHWKGNPHVGAQVRTDLLAC 177
Query: 256 NLSDKHDWNARLYVQ 270
N +DK D+ A +YVQ
Sbjct: 178 NKTDKRDFGADIYVQ 192
>gi|302783306|ref|XP_002973426.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300159179|gb|EFJ25800.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 449
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 89/189 (47%), Positives = 121/189 (64%), Gaps = 1/189 (0%)
Query: 71 DYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQS 130
++S CPDYF+WI DLSPWK TGI++ L+ A A FR+++V+ K+Y+ +Y+
Sbjct: 88 NFSAQAQGKCPDYFKWIERDLSPWKNTGISQGDLQEAKSKADFRVVIVDGKLYMERYRYC 147
Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
QTR +T+WGI LL ++PG++PDLELMF C DRP + ++YS + PPPLF Y
Sbjct: 148 YQTRAQYTLWGIRMLLEEFPGQVPDLELMFMCGDRPQVLRKNYSSNSVWPPPPLFSYCTT 207
Query: 191 R-WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
R DIVFPDWSFWGW E+NI PW ++ G W+ REP A WKGN ++ + R
Sbjct: 208 RDEHYDIVFPDWSFWGWPEVNIAPWTVEREKIFSGAEKIKWLQREPIAQWKGNTWMGKIR 267
Query: 250 RDLLTCNLS 258
L+ CN S
Sbjct: 268 PMLVQCNSS 276
>gi|395146518|gb|AFN53673.1| DUF821 [Linum usitatissimum]
Length = 474
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 87/185 (47%), Positives = 117/185 (63%), Gaps = 5/185 (2%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
CP++F +IH DL PW +GIT D L A A FR+++ K+Y Y Q+R +
Sbjct: 106 AQQCPEFFSYIHRDLEPWAKSGITEDQLMEAKNFAAFRIVIYQGKLYFDPYYACFQSRMM 165
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
TIWG LQLL+KYPG +PD++LMFDC D+P+ ++ P PLFRY R DI
Sbjct: 166 TTIWGFLQLLKKYPGMVPDVDLMFDCMDKPIFNRTEHQA----NPVPLFRYCTTREHFDI 221
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTC 255
FPDWSFWGW+EINI+PW ++K+G+ + W R+P+A+WKGNP V+ R +LL C
Sbjct: 222 PFPDWSFWGWSEINIRPWSEEFPDIKKGSQAKRWAKRQPHAFWKGNPDVVSPVRLELLQC 281
Query: 256 NLSDK 260
N S K
Sbjct: 282 NDSRK 286
>gi|326495828|dbj|BAJ90536.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 486
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/288 (38%), Positives = 154/288 (53%), Gaps = 27/288 (9%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
I F+ I++ S ++ +SR +T++ + +P + RPP R L C
Sbjct: 30 IAAFSAVIVIAVLHSAYDDALSRTRTLLGHNLEPTPWHPFPHDKGRPPPRAA--LRCASY 87
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK-----VTGITRDMLE 105
+ P + PK + + + CP YF I DL+PW+ GITR +LE
Sbjct: 88 LSCLPPLSQPKPAAAAAANASR----PRQCPSYFAAIRRDLAPWRRRDGGGGGITRALLE 143
Query: 106 RANQTAHFRLILVNNKVYIHK--YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
A A R+ + +H Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC
Sbjct: 144 SARFRASMRVTITGGGTRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCM 203
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
DRP I ++SG PPPLFRY R +DI FPDWSFWGW E +I+PW R +K+
Sbjct: 204 DRPAINRTEHSGDGAPPPPPLFRYCTTRDHLDIPFPDWSFWGWPETHIEPWSREFRSIKQ 263
Query: 224 GNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
G+ W DR P AYWKGNP VA R LL CN D + W A + Q
Sbjct: 264 GSRRVKWPDRVPTAYWKGNPDVASPLRLALLACN--DTNLWRAEIMRQ 309
>gi|297817442|ref|XP_002876604.1| hypothetical protein ARALYDRAFT_907657 [Arabidopsis lyrata subsp.
lyrata]
gi|297322442|gb|EFH52863.1| hypothetical protein ARALYDRAFT_907657 [Arabidopsis lyrata subsp.
lyrata]
Length = 229
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 91/165 (55%), Positives = 116/165 (70%), Gaps = 12/165 (7%)
Query: 44 PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
PLNC N QTCP+NYP S+ + +IS + TCPDYFRWI +DL W+ TGITR+
Sbjct: 69 PLNC--NAKQTCPSNYP--SRFEPAISS-----SETCPDYFRWIQQDLKAWEETGITRET 119
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
LERA AHFRL++ + ++Y+H+Y ++ ++RDV TIWGILQLLR YPG++PDLEL+F C
Sbjct: 120 LERAKPKAHFRLVIKSGRLYVHQYDKAYESRDVLTIWGILQLLRMYPGQVPDLELLFFCH 179
Query: 164 DRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWSFWG 205
D+P I RD+ P PPPLF+Y G R IVFPDWSFWG
Sbjct: 180 DKPAIWKRDFRQPQPNATWPPPPLFQYCGHREAYGIVFPDWSFWG 224
>gi|449455154|ref|XP_004145318.1| PREDICTED: uncharacterized protein LOC101204476 [Cucumis sativus]
Length = 472
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 123/192 (64%), Gaps = 7/192 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++FRWIH DL PW T I+ LE + + A FR+++V ++Y+ Y +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG++Q+LR+YPG +PD+++MFDC D+P I + N P PLFRY DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
DWSFWGW E+N++ W ++K+G+ +W ++ P AYWKGNP V + R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275
Query: 259 DKHDWNARLYVQ 270
W A++ Q
Sbjct: 276 RM--WGAQIMRQ 285
>gi|449520138|ref|XP_004167091.1| PREDICTED: uncharacterized protein LOC101228589 [Cucumis sativus]
Length = 472
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 123/192 (64%), Gaps = 7/192 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++FRWIH DL PW T I+ LE + + A FR+++V ++Y+ Y +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG++Q+LR+YPG +PD+++MFDC D+P I + N P PLFRY DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
DWSFWGW E+N++ W ++K+G+ +W ++ P AYWKGNP V + R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275
Query: 259 DKHDWNARLYVQ 270
W A++ Q
Sbjct: 276 RM--WGAQIMRQ 285
>gi|449471103|ref|XP_004153209.1| PREDICTED: uncharacterized protein LOC101204904 [Cucumis sativus]
Length = 472
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 123/192 (64%), Gaps = 7/192 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++FRWIH DL PW T I+ LE + + A FR+++V ++Y+ Y +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG++Q+LR+YPG +PD+++MFDC D+P I + N P PLFRY DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
DWSFWGW E+N++ W ++K+G+ +W ++ P AYWKGNP V + R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275
Query: 259 DKHDWNARLYVQ 270
W A++ Q
Sbjct: 276 RM--WGAQIMRQ 285
>gi|356569533|ref|XP_003552954.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
Length = 464
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 128/192 (66%), Gaps = 7/192 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP++FR IH DL+PW + I++ + A + A FR+++V KV++ Y +Q+R +FT+
Sbjct: 104 CPEFFRAIHRDLAPWLESRISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTL 163
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG+LQL+R+YPG++PD+++MFDC D+P + ++ P PLFRY + DI FP
Sbjct: 164 WGLLQLMRRYPGKVPDVDMMFDCMDKPSVNRTEHQA----MPLPLFRYCTTKEHFDIPFP 219
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLS 258
DWSFWGW+EINI+PW+ ++K+G+ +W ++ P+AYWKGNP VA R +L+ CN
Sbjct: 220 DWSFWGWSEINIRPWQEEFPDIKQGSRNVSWKNKFPWAYWKGNPDVASPIRTELINCN-- 277
Query: 259 DKHDWNARLYVQ 270
D W A + Q
Sbjct: 278 DSRKWGAEIMRQ 289
>gi|302789424|ref|XP_002976480.1| hypothetical protein SELMODRAFT_105631 [Selaginella moellendorffii]
gi|300155518|gb|EFJ22149.1| hypothetical protein SELMODRAFT_105631 [Selaginella moellendorffii]
Length = 357
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 89/183 (48%), Positives = 119/183 (65%), Gaps = 4/183 (2%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CPDYF+WI DLSPWK TGI++ L+ A A FR+++V+ K+Y+ +Y+ QTR +T+
Sbjct: 1 CPDYFKWIERDLSPWKNTGISQGDLQEAKSKADFRVVIVDGKLYMERYRYCYQTRAQYTL 60
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG---PPPLFRYSGDR-WTMD 195
WGI LL ++PG++PDLELMF C DRP + ++YS + K PPPLF Y R D
Sbjct: 61 WGIRMLLEEFPGQVPDLELMFMCGDRPQVLRKNYSSNSVKRRWPPPPLFSYCTTRDEHYD 120
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
IVFPDWSFWGW E+NI PW ++ G W+ REP A WKGN ++ + R L+ C
Sbjct: 121 IVFPDWSFWGWPEVNIAPWTVEREKIFSGAEKIKWLQREPIAQWKGNTWMGKIRPMLVQC 180
Query: 256 NLS 258
N S
Sbjct: 181 NSS 183
>gi|255569363|ref|XP_002525649.1| conserved hypothetical protein [Ricinus communis]
gi|223535085|gb|EEF36767.1| conserved hypothetical protein [Ricinus communis]
Length = 491
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 87/194 (44%), Positives = 125/194 (64%), Gaps = 7/194 (3%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP++FR+IH DL PW TGIT+ + A + A FR+++ ++Y+ Y +Q+R +F
Sbjct: 102 AKCPEFFRFIHHDLQPWARTGITKKHIAEAKKFAAFRVVIFEGRLYLDLYYACVQSRMMF 161
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
T+WG+LQLL +YPG +PD+++MFDC DRPVI ++ P P+FRY + DI
Sbjct: 162 TVWGLLQLLNRYPGMVPDVDIMFDCMDRPVINKTEHIS----FPLPIFRYCTTQNHFDIP 217
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCN 256
FPDWSFWGW EINI+ W R++K G+ ++W + P AYWKGNP V + R +L+ CN
Sbjct: 218 FPDWSFWGWPEINIRSWNEEFRDIKRGSQSKSWSKKWPRAYWKGNPDVLSPIRTELMQCN 277
Query: 257 LSDKHDWNARLYVQ 270
S K W A + Q
Sbjct: 278 HSRK--WGAHIMRQ 289
>gi|356541729|ref|XP_003539326.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
Length = 464
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 124/192 (64%), Gaps = 7/192 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP +FR IH DL+PW + I++ + A + A FR+++V KV++ Y +Q+R +FT+
Sbjct: 104 CPKFFRAIHRDLAPWSESRISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTL 163
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
WG+LQL+R+YPG +PD+++MFDC D+P + ++ P PLFRY + DI FP
Sbjct: 164 WGLLQLMRRYPGMVPDVDMMFDCMDKPSVNKTEHQA----MPLPLFRYCTTKEHFDIPFP 219
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLS 258
DWSFWGW+EINI+PW+ ++K G+ W ++ P+AYWKGNP VA R +L+ CN
Sbjct: 220 DWSFWGWSEINIRPWQEEFPDIKRGSRSVTWKNKLPWAYWKGNPDVASPIRTELINCN-- 277
Query: 259 DKHDWNARLYVQ 270
D W A + Q
Sbjct: 278 DSRKWGAEIMRQ 289
>gi|302808049|ref|XP_002985719.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300146628|gb|EFJ13297.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 386
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 85/194 (43%), Positives = 122/194 (62%), Gaps = 3/194 (1%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF WI DL+PWK TGI++ L+ A A FR+++VN +Y+ +Y + +TRD FT+
Sbjct: 1 CPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDFTL 60
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-MDI 196
WG+L LL++YPG +PD++LMF+C D P++ ++ N PPPLF Y R DI
Sbjct: 61 WGLLMLLKEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRGDHYDI 120
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWS+WGW E+NI PW ++ G +W R+P A+WKGN + R DL+ C
Sbjct: 121 VFPDWSYWGWPEVNILPWSLEKGKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVKCT 180
Query: 257 LSDKHDWNARLYVQ 270
++ H++N Y Q
Sbjct: 181 ANNTHNYNLVTYHQ 194
>gi|413917716|gb|AFW57648.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 500
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 153/285 (53%), Gaps = 24/285 (8%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
I F+ I+L S ++ +SR +T++ + +P + RPP R L C
Sbjct: 46 IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 102
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
+ C +S + + + CP YF IH DL+PW+ G +TR +L+ A
Sbjct: 103 -SIACLPPLSSSSSSPQPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 160
Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
+ A R+ + ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 161 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 220
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
I +R G PPPLFRY R DI FPDWSFWGW E NI+PW + ++ G
Sbjct: 221 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIEPWNHEFKNIRAGAR 279
Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQ 270
W DR P AYWKGNP VA R+ LL CN D W A + Q
Sbjct: 280 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQ 322
>gi|194707538|gb|ACF87853.1| unknown [Zea mays]
Length = 480
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 153/285 (53%), Gaps = 24/285 (8%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
I F+ I+L S ++ +SR +T++ + +P + RPP R L C
Sbjct: 26 IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 82
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
+ C +S + + + CP YF IH DL+PW+ G +TR +L+ A
Sbjct: 83 -SIACLPPLSSSSSSPQPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 140
Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
+ A R+ + ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 141 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 200
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
I +R G PPPLFRY R DI FPDWSFWGW E NI+PW + ++ G
Sbjct: 201 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIEPWNHEFKNIRAGAR 259
Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQ 270
W DR P AYWKGNP VA R+ LL CN D W A + Q
Sbjct: 260 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQ 302
>gi|302785297|ref|XP_002974420.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300158018|gb|EFJ24642.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 386
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 85/194 (43%), Positives = 122/194 (62%), Gaps = 3/194 (1%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF WI DL+PWK TGI++ L+ A A FR+++VN +Y+ +Y + +TRD FT+
Sbjct: 1 CPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDFTL 60
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-MDI 196
WG+L LL++YPG +PD++LMF+C D P++ ++ N PPPLF Y R DI
Sbjct: 61 WGLLMLLKEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRRDHYDI 120
Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
VFPDWS+WGW E+NI PW ++ G +W R+P A+WKGN + R DL+ C
Sbjct: 121 VFPDWSYWGWPEVNILPWSLEKGKIFSGAEKLDWSHRQPIAFWKGNYDMGPARADLVKCT 180
Query: 257 LSDKHDWNARLYVQ 270
++ H++N Y Q
Sbjct: 181 ANNTHNYNLVTYHQ 194
>gi|226497664|ref|NP_001151574.1| lipopolysaccharide-modifying protein [Zea mays]
gi|195647860|gb|ACG43398.1| lipopolysaccharide-modifying protein [Zea mays]
Length = 480
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 107/285 (37%), Positives = 152/285 (53%), Gaps = 24/285 (8%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
I F+ I+L S ++ +SR +T++ + +P + RPP R L C
Sbjct: 26 IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 82
Query: 51 QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
+ C +S + + CP YF IH DL+PW+ G +TR +L+ A
Sbjct: 83 -SIACLPPLSPSSSSSPPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 140
Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
+ A R+ + ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 141 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 200
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
I +R G PPPLFRY R DI FPDWSFWGW + NI+PW + ++ G
Sbjct: 201 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPDTNIEPWNHEFKNIRAGAR 259
Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQ 270
W DR P AYWKGNP VA R+ LL CN D W A + Q
Sbjct: 260 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQ 302
>gi|222629271|gb|EEE61403.1| hypothetical protein OsJ_15589 [Oryza sativa Japonica Group]
Length = 535
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 89/189 (47%), Positives = 114/189 (60%), Gaps = 1/189 (0%)
Query: 83 YFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
YFR+IHEDL PW+ GITR ML RA TA FRL+++ + ++H++ + +
Sbjct: 140 YFRFIHEDLRPWRDAGGITRAMLARARVTASFRLLVLGGRAFVHRFARHSRPAISSPSGA 199
Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
PG P P +G PPLF Y GD T+DIVFPDW
Sbjct: 200 SSSCSAATPGGSPTSTSCSTAPTGPSSARTSTAGSTLPFMPPLFSYCGDDRTLDIVFPDW 259
Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
SFWGW EINIKPW++L ++LK+GNN W+DR PYAYWKGNP VA TR++L+ CN+S
Sbjct: 260 SFWGWPEINIKPWDALRQDLKDGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTK 319
Query: 262 DWNARLYVQ 270
DWNAR+Y Q
Sbjct: 320 DWNARIYKQ 328
>gi|194702520|gb|ACF85344.1| unknown [Zea mays]
gi|238009730|gb|ACR35900.1| unknown [Zea mays]
gi|238011028|gb|ACR36549.1| unknown [Zea mays]
Length = 435
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 91/196 (46%), Positives = 120/196 (61%), Gaps = 8/196 (4%)
Query: 80 CPDYFRWIHEDLSPWKVTG--ITRDMLERANQTAHFRLILVNN--KVYIHKYKQSIQTRD 135
CP YF IH DL+PW+ G +TR +L+ A + A R+ + ++++ Y +Q+R
Sbjct: 65 CPAYFAAIHRDLAPWRGQGRGVTRALLDEARRRASMRVTITGGGRRLHVDLYYACVQSRA 124
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FT+W +LQL+R+YPGR+PD++LMFDC DRP I +R G PPPLFRY R D
Sbjct: 125 LFTVWSLLQLMRRYPGRVPDVDLMFDCMDRPAI-NRTEHGDGALPPPPLFRYCTTRDHFD 183
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD-LLT 254
I FPDWSFWGW E NI+PW + ++ G W DR P AYWKGNP VA R+ LL
Sbjct: 184 IPFPDWSFWGWPETNIEPWNHEFKNIRAGARATRWADRVPTAYWKGNPDVASPLREALLG 243
Query: 255 CNLSDKHDWNARLYVQ 270
CN D W A + Q
Sbjct: 244 CN--DTALWRAEIMRQ 257
>gi|302823162|ref|XP_002993235.1| hypothetical protein SELMODRAFT_431353 [Selaginella moellendorffii]
gi|300138905|gb|EFJ05656.1| hypothetical protein SELMODRAFT_431353 [Selaginella moellendorffii]
Length = 475
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 83/196 (42%), Positives = 121/196 (61%), Gaps = 3/196 (1%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP YF WI DL+PWK TGI++ L+ A A FR+++VN +Y+ +Y + +TRD F
Sbjct: 85 AQCPSYFSWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDF 144
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-M 194
T+WG+L LL +YPG +PD++LMF+C D P++ ++ N PPPLF Y R
Sbjct: 145 TLWGLLMLLEEYPGMVPDVDLMFNCGDWPLVFRAEHKPEKNGSWPPPPLFLYCTSRGEHY 204
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWS+WGW E+NI PW ++ G +W R+P A+WKGN + R DL+
Sbjct: 205 DIVFPDWSYWGWPEVNILPWSLEKEKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVK 264
Query: 255 CNLSDKHDWNARLYVQ 270
C ++ ++N + Q
Sbjct: 265 CTANNSQNYNLVTHHQ 280
>gi|302794981|ref|XP_002979254.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300153022|gb|EFJ19662.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 474
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 83/196 (42%), Positives = 121/196 (61%), Gaps = 3/196 (1%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP YF WI DL+PWK TGI++ L+ A A FR+++VN +Y+ +Y + +TRD F
Sbjct: 85 AHCPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDF 144
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-M 194
T+WG+L LL +YPG +PD++LMF+C D P++ ++ N PPPLF Y R
Sbjct: 145 TLWGLLMLLEEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRGEHY 204
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWS+WGW E+NI PW ++ G +W R+P A+WKGN + R DL+
Sbjct: 205 DIVFPDWSYWGWPEVNILPWSLEKEKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVK 264
Query: 255 CNLSDKHDWNARLYVQ 270
C ++ ++N + Q
Sbjct: 265 CTANNSQNYNLVTHHQ 280
>gi|124360767|gb|ABN08741.1| Protein of unknown function DUF821, CAP10-like [Medicago
truncatula]
Length = 185
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 82/163 (50%), Positives = 105/163 (64%), Gaps = 3/163 (1%)
Query: 45 LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
L C N TQTCP ++ + S TSTCP++FRWIHEDL PWK TGITR+M
Sbjct: 25 LKCFNGNPTQTCPNDHSPIQAFHQDQDPSS--STSTCPEHFRWIHEDLEPWKSTGITREM 82
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
++ + R+++ KVY+ Y S QTR FT+WGI+QLLR YPGR+PDLEL+F+
Sbjct: 83 VDSGENISQLRIVIKQGKVYVETYGDSFQTRATFTVWGIVQLLRLYPGRVPDLELLFETG 142
Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
DR V+ + Y GP PPP+F Y G +DIVFPDWS+WGW
Sbjct: 143 DRAVVDKKRYEGPEAVTPPPIFSYCGKNEALDIVFPDWSYWGW 185
>gi|359478546|ref|XP_003632132.1| PREDICTED: protein O-glucosyltransferase 1-like [Vitis vinifera]
gi|297745896|emb|CBI15952.3| unnamed protein product [Vitis vinifera]
Length = 464
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 91/201 (45%), Positives = 123/201 (61%), Gaps = 5/201 (2%)
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
P T++ E + S CP +F I DL PW +GI+ + A + A FR+++V
Sbjct: 87 PTTTKIPEWHTRQSSNTVGKCPMFFTRIDHDLQPWVRSGISLSSVMEAQKFAAFRVVIVG 146
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
K+Y+ + +Q+R +FT+WG+LQLLR+YPG +PD++LMFDC D+P I SR+ G
Sbjct: 147 GKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMDKPTI-SREEHG---S 202
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
P PLFRY DI FPDWSFWGW EI+I PW+ +K+G+ NW + YAYW
Sbjct: 203 KPLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQVLNWTQKLSYAYW 262
Query: 240 KGNPFV-AETRRDLLTCNLSD 259
KGNP V + R DLL CN SD
Sbjct: 263 KGNPDVQSPVRVDLLQCNNSD 283
>gi|302799316|ref|XP_002981417.1| hypothetical protein SELMODRAFT_53169 [Selaginella moellendorffii]
gi|300150957|gb|EFJ17605.1| hypothetical protein SELMODRAFT_53169 [Selaginella moellendorffii]
Length = 342
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/180 (47%), Positives = 120/180 (66%), Gaps = 6/180 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF WIHEDL+PW +GITR+ L+ A + FRL++V+ K+Y + ++ IQTRD T+
Sbjct: 1 CPSYFAWIHEDLAPWSESGITREHLDEARRKGDFRLVIVDGKLYFERIQECIQTRDEITL 60
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT--MDIV 197
G+L LL K+PG +PD+E++F C+D P + +Y +K PPP+F Y R+ DI+
Sbjct: 61 QGLLLLLEKFPGMIPDMEMVFVCNDFPQVPKDEY---RSKPPPPIFSYCTSRFGGHFDIL 117
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCN 256
FPDWSFWGW ++ I+PWE E+ EG N +W +RE AYWKGN +V R +LL CN
Sbjct: 118 FPDWSFWGWPQVKIRPWEQESVEIFEGANETDWFEREAIAYWKGNLWVMTPVREELLQCN 177
>gi|147785893|emb|CAN70836.1| hypothetical protein VITISV_015872 [Vitis vinifera]
Length = 922
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 91/201 (45%), Positives = 123/201 (61%), Gaps = 5/201 (2%)
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
P T++ E + S CP +F I DL PW +GI+ + A + A FR+++V
Sbjct: 545 PTTTKIPEWHTRQSSNTVGKCPMFFTRIXHDLQPWVRSGISLSSVMEAQKFAAFRVVIVG 604
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
K+Y+ + +Q+R +FT+WG+LQLLR+YPG +PD++LMFDC D+P I SR+ G
Sbjct: 605 GKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMDKPTI-SREEHGSK-- 661
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
P PLFRY DI FPDWSFWGW EI+I PW+ +K+G+ NW + YAYW
Sbjct: 662 -PLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQVLNWTQKLSYAYW 720
Query: 240 KGNPFV-AETRRDLLTCNLSD 259
KGNP V + R DLL CN SD
Sbjct: 721 KGNPDVQSPVRVDLLQCNNSD 741
>gi|357129812|ref|XP_003566555.1| PREDICTED: O-glucosyltransferase rumi-like [Brachypodium
distachyon]
Length = 493
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 160/289 (55%), Gaps = 32/289 (11%)
Query: 2 IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRIT---TPLNC 47
I F+ I++ S ++ +SR +T++ + +P + RPP R PL
Sbjct: 35 IAAFSAVIVIAVLHSAYDEAVSRTRTLLGHNLEPTPWHPFPHDKGRPPPRAALRCAPLLS 94
Query: 48 VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV---TGITRDML 104
+ P S+++ T CP YF IH DL+PW+ GITR++L
Sbjct: 95 CLPPLSHPHPSPPNASRSRR---------TKQCPAYFAAIHRDLAPWRRHGHGGITRELL 145
Query: 105 ERANQTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
+ A A R+ + N ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC
Sbjct: 146 DSARSRASMRVTITGNGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDC 205
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
DRP I +R GPN PPPLFRY + +DI FPDWSFWGW E +I PW R +K
Sbjct: 206 MDRPAI-NRTTGGPNPPLPPPLFRYCTTKDHLDIPFPDWSFWGWPETHINPWAKEFRAIK 264
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
+G+ W DR P A+WKGNP VA R LL CN D + W+A++ Q
Sbjct: 265 QGSRRVKWGDRVPLAFWKGNPDVASPLRLALLACN--DTNLWHAQIMRQ 311
>gi|302773059|ref|XP_002969947.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300162458|gb|EFJ29071.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 325
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 85/180 (47%), Positives = 120/180 (66%), Gaps = 6/180 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP YF WIHEDL+PW +GITR+ L+ A + FRL++V+ K+Y + ++ IQTRD T+
Sbjct: 2 CPSYFAWIHEDLAPWSESGITREHLDEARRKGDFRLVIVDGKLYFERIQECIQTRDEITL 61
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT--MDIV 197
G+L LL K+PG +PD+E++F C+D P + +Y +K PPP+F Y R+ DI+
Sbjct: 62 QGLLLLLEKFPGMIPDMEMVFVCNDFPQVPKDEY---RSKPPPPVFSYCTSRFGGHFDIL 118
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCN 256
FPDWSFWGW ++ I+PWE E+ +G N +W +RE AYWKGN +V R +LL CN
Sbjct: 119 FPDWSFWGWPQVKIRPWEQESVEIFDGANETDWFEREAIAYWKGNLWVMTPVREELLQCN 178
>gi|242086909|ref|XP_002439287.1| hypothetical protein SORBIDRAFT_09g003810 [Sorghum bicolor]
gi|241944572|gb|EES17717.1| hypothetical protein SORBIDRAFT_09g003810 [Sorghum bicolor]
Length = 497
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 152/281 (54%), Gaps = 23/281 (8%)
Query: 9 IILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKNQTQTCPT 57
I+L S ++ +SR +T++ + +P + RPP R L C + P
Sbjct: 35 IVLAVVHSAYDDALSRTRTLLGHNLEPTPWHPFPHSKGRPPARAA--LRCAPSIACLPPL 92
Query: 58 NYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV--TGITRDMLERANQTAHFR 114
++P T+ S + + CP YF IH DL+PW+ G+TR +L+ A + A R
Sbjct: 93 SHPPGTANASSSAAAAAGEQRRQCPAYFAAIHRDLAPWRGPGRGVTRALLDAARRRASMR 152
Query: 115 LILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRD 172
+ + ++++ Y +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP I D
Sbjct: 153 VTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRPAINRTD 212
Query: 173 YSGPNNKGPPP--LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
+ + PPP LFRY R DI FPDWSFWGW E NIKPW + +K G W
Sbjct: 213 HGDGDPSSPPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIKPWNREFKSIKMGARATRW 272
Query: 231 IDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
DR P AYWKGNP VA R LL CN D W A + Q
Sbjct: 273 ADRVPTAYWKGNPDVASPLRVALLGCN--DTAAWRAEIMRQ 311
>gi|302821372|ref|XP_002992349.1| hypothetical protein SELMODRAFT_44687 [Selaginella moellendorffii]
gi|300139892|gb|EFJ06625.1| hypothetical protein SELMODRAFT_44687 [Selaginella moellendorffii]
Length = 330
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 12/199 (6%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
S CP YF WI DL+PWK GIT L+ A + A FR++++N +Y+ +Y + QTRD F
Sbjct: 4 SQCPAYFSWIDRDLAPWKDHGITLHSLQEAKKRADFRVLILNGTLYMERYHKCFQTRDDF 63
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRY--SGDRWTM 194
T+ G+ LL ++PG +PD++LMF C D P I R++ G ++ PPPLF Y +
Sbjct: 64 TLRGLQLLLDRFPGMVPDVDLMFGCGDFPAIPRAK---GSDDPSPPPLFSYCTTARGENY 120
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWSFWGW E+NI PWE +++ G W R+ +AYWKGN + R DL+
Sbjct: 121 DIVFPDWSFWGWPEVNILPWEEEKQKIYSGAREEKWSKRQRFAYWKGNYWTGAARPDLVK 180
Query: 255 CNLSDKHDWNARLYVQVIS 273
C N L+V +IS
Sbjct: 181 CAA------NKDLFVSMIS 193
>gi|302795177|ref|XP_002979352.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300153120|gb|EFJ19760.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 329
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 12/199 (6%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
S CP YF WI DL+PWK GIT L+ A + A FR++++N +Y+ +Y + QTRD F
Sbjct: 2 SQCPAYFSWIDRDLAPWKDHGITLHSLQEAKKRADFRVLILNGTLYMERYHKCFQTRDDF 61
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRY--SGDRWTM 194
T+ G+ LL ++PG +PD++LMF C D P I R++ G ++ PPPLF Y +
Sbjct: 62 TLRGLQLLLDRFPGMVPDVDLMFGCGDFPAIPRAK---GSDDPSPPPLFSYCTTARGENY 118
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DIVFPDWSFWGW E+NI PWE +++ G W R+ +AYWKGN + R DL+
Sbjct: 119 DIVFPDWSFWGWPEVNILPWEEEKQKIYSGAREEKWSKRQRFAYWKGNYWTGAARPDLVK 178
Query: 255 CNLSDKHDWNARLYVQVIS 273
C N L+V +IS
Sbjct: 179 CAA------NKDLFVSMIS 191
>gi|297745249|emb|CBI40329.3| unnamed protein product [Vitis vinifera]
Length = 341
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 74/121 (61%), Positives = 88/121 (72%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
YPG+LPD +LMF+C DRP+IR+ Y GP+ PPPLF Y GD T DIVFPDWSFWGW E
Sbjct: 2 YPGKLPDFDLMFECRDRPMIRTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGWPE 61
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
NIKPW ++LKEGN WIDREPYAYWKGN + R++L C +D+ DWNARLY
Sbjct: 62 TNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNARLY 121
Query: 269 V 269
+
Sbjct: 122 I 122
>gi|302819894|ref|XP_002991616.1| hypothetical protein SELMODRAFT_43466 [Selaginella moellendorffii]
gi|300140649|gb|EFJ07370.1| hypothetical protein SELMODRAFT_43466 [Selaginella moellendorffii]
Length = 324
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 82/190 (43%), Positives = 111/190 (58%), Gaps = 9/190 (4%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
TCP YF+WI DL+PWK GITR LE + A+FR+I++ K+Y YKQ QTR +T
Sbjct: 2 TCPSYFKWIEHDLAPWK-GGITRKALEAGKEKAYFRVIILGGKLYTQTYKQCFQTRAEYT 60
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIV 197
+ G+ LL K+PG +PD+++MF+C D P++ Y + PPP+F Y R DIV
Sbjct: 61 LKGLAMLLNKFPGMVPDVDIMFNCQDHPLVPRWRYLFTS---PPPVFGYCTTRNRHYDIV 117
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN- 256
FPDWS WGW E+NI PW + +W R+P AYW+GN + R +L+ CN
Sbjct: 118 FPDWSIWGWPEVNILPWSIESERIFTEAEKIDWFRRKPIAYWRGNTQMGLIRSNLVKCNS 177
Query: 257 ---LSDKHDW 263
L DW
Sbjct: 178 TNILIQHQDW 187
>gi|302815695|ref|XP_002989528.1| hypothetical protein SELMODRAFT_43161 [Selaginella moellendorffii]
gi|300142706|gb|EFJ09404.1| hypothetical protein SELMODRAFT_43161 [Selaginella moellendorffii]
Length = 342
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 72/182 (39%), Positives = 112/182 (61%), Gaps = 5/182 (2%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
+CP YF I E L+PW+ TGITR L+R + R+ ++ ++Y+ Y Q+R VF+
Sbjct: 3 SCPRYFARIREHLAPWRSTGITRRSLDRRQRLGSMRVAILGGRMYVRAYGDCPQSRSVFS 62
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
+WG+L +L ++ R+PD+E + +C DRP++ R Y G P P+ Y R ++DI
Sbjct: 63 LWGLLLMLERFGDRVPDVEFVLNCKDRPIVPRDGSYGG----APAPVLSYCSHRHSLDIP 118
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
FPD+SFWGW E+NI+PWE +E+ +G+ W R+P A+WKGN + + R L CN
Sbjct: 119 FPDYSFWGWPEVNIRPWEQESQEIFQGSQDVEWNKRQPLAFWKGNLRMGKLRNLLARCNS 178
Query: 258 SD 259
++
Sbjct: 179 TE 180
>gi|302761718|ref|XP_002964281.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168010|gb|EFJ34614.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 342
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 72/184 (39%), Positives = 113/184 (61%), Gaps = 5/184 (2%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
++CP YF I E L+PW+ TGITR L+R + R+ ++ ++Y+ Y Q+R V
Sbjct: 1 AASCPRYFARIREHLAPWRSTGITRRSLDRRQRLGSMRVAILGGRMYVRAYGDCPQSRSV 60
Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMD 195
F++WG+L +L ++ R+PD+E + +C DRP++ R Y G P P+ Y R ++D
Sbjct: 61 FSLWGLLLMLERFGDRVPDVEFVLNCKDRPIVPRDGSYGGV----PSPVLSYCSHRHSLD 116
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
I FPD+SFWGW E+NI+PWE +E+ +G+ W R+P A+WKGN + + R L C
Sbjct: 117 IPFPDYSFWGWPEVNIRPWEQESQEIFQGSQDVEWNKRQPLAFWKGNLRMGKLRNLLARC 176
Query: 256 NLSD 259
N ++
Sbjct: 177 NSTE 180
>gi|302776736|ref|XP_002971516.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300160648|gb|EFJ27265.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 318
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 81/190 (42%), Positives = 109/190 (57%), Gaps = 9/190 (4%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
TCP YF WI DL+PWK GITR LE + A+FR+I++ K+Y YKQ QTR +T
Sbjct: 3 TCPSYFNWIEHDLAPWK-GGITRKALEAGKEKAYFRVIILGGKLYTQTYKQCFQTRAEYT 61
Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIV 197
+ G+ LL ++PG +PD+++MF+C D P++ Y PPP+F Y R DIV
Sbjct: 62 LKGLAMLLNEFPGMVPDVDIMFNCQDHPLVPRWRYLF---TSPPPVFGYCTTRNRHYDIV 118
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN- 256
FPDWS WGW E+NI PW + +W R+P AYW+GN + R +L+ CN
Sbjct: 119 FPDWSIWGWPEVNIPPWSIESERIFTEAEKIDWFRRKPIAYWRGNTQMGLIRSNLVKCNS 178
Query: 257 ---LSDKHDW 263
L DW
Sbjct: 179 TNILIQHQDW 188
>gi|302815579|ref|XP_002989470.1| hypothetical protein SELMODRAFT_44697 [Selaginella moellendorffii]
gi|300142648|gb|EFJ09346.1| hypothetical protein SELMODRAFT_44697 [Selaginella moellendorffii]
Length = 329
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 113/179 (63%), Gaps = 8/179 (4%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP YF IHE L+PW+ TGITR L+R R+ ++ + +Y+ Y Q+R F
Sbjct: 1 ALCPRYFARIHEHLAPWRSTGITRRSLDR---LGSMRVAILRSDMYVSAYGDCPQSRRAF 57
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
++WG+L +L ++ R+PD+E + +C DRP++ RD S + P P+ Y R ++DI
Sbjct: 58 SLWGLLLMLERFGDRVPDVEFVLNCKDRPIV-PRDGS----EVPAPVLSYCSHRLSLDIP 112
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
FPD+SFWGW E+NI+PWE +E+ +G+ G W +R+P+A+WKGN + + R L CN
Sbjct: 113 FPDYSFWGWPEVNIRPWEEESQEIFQGSQGVEWSNRQPFAFWKGNLRMGKLRTLLARCN 171
>gi|302761722|ref|XP_002964283.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168012|gb|EFJ34616.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 401
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 72/177 (40%), Positives = 105/177 (59%), Gaps = 1/177 (0%)
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
P YF I DL PWK GIT+ L A + R+++ K+YI Y + Q+R +FT W
Sbjct: 58 PKYFDSIDTDLRPWKDGGITKSSLSAARRKGSMRMVISQGKLYIEVYGKCPQSRSIFTAW 117
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
G+L LL ++PG++PD++ + +C DRPVI R + + PP +F Y +DI FP
Sbjct: 118 GLLLLLERFPGKVPDVDFVLNCKDRPVITRFLSFQRFVSGSPPAVFSYCTTNDMLDIPFP 177
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
D+SFWGW E++I PWE +++ G+ W +R P A+WKGN + + RR LL C
Sbjct: 178 DFSFWGWPEVDIPPWEEQSQQITAGSREVKWSERRPAAFWKGNSRMGKLRRHLLQCQ 234
>gi|224059286|ref|XP_002299807.1| predicted protein [Populus trichocarpa]
gi|222847065|gb|EEE84612.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 75/165 (45%), Positives = 105/165 (63%), Gaps = 7/165 (4%)
Query: 107 ANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
A A FR+++ ++Y+ Y +Q+R +FTIWG LQLL++YPG +PD+++MFDC D+P
Sbjct: 1 AKNYASFRVVIYKGRLYLDPYYACVQSRMMFTIWGFLQLLKRYPGMVPDVDIMFDCMDKP 60
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
I ++ + P PLFRY + DI FPDWSFWGW E+NI+PW+ R++K G
Sbjct: 61 SINKTEH----DSFPLPLFRYCTTKDHFDIPFPDWSFWGWPEVNIRPWDEEFRDIKRGAQ 116
Query: 227 GRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
R+W + P AYWKGNP V TR LL CN + K W A++ Q
Sbjct: 117 ARSWPKKWPRAYWKGNPDVGSPTRTSLLECNHTKK--WGAQIMRQ 159
>gi|147802992|emb|CAN70737.1| hypothetical protein VITISV_008287 [Vitis vinifera]
Length = 694
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 79/135 (58%), Positives = 88/135 (65%), Gaps = 29/135 (21%)
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
V+T W I LLR YPGRLPDLELMFDCDDRPV+R RD+ GP N PPPLFRY GD W++D
Sbjct: 377 VYTGW-IDALLRWYPGRLPDLELMFDCDDRPVVRMRDFRGP-NXAPPPLFRYCGDDWSLD 434
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
IVFPDWSFWGW P AYW+GNP VA TR DLL C
Sbjct: 435 IVFPDWSFWGWV---------------------------PLAYWRGNPHVAPTRGDLLKC 467
Query: 256 NLSDKHDWNARLYVQ 270
N+SDK +WN RLY+Q
Sbjct: 468 NVSDKAEWNTRLYLQ 482
>gi|413919038|gb|AFW58970.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 241
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 79/153 (51%), Positives = 102/153 (66%), Gaps = 7/153 (4%)
Query: 61 KTSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLI 116
T+QT + + S PP CP YFRWIHEDL PW+ TG+TR +E A + A R+
Sbjct: 89 STNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVA 148
Query: 117 LVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
+V ++Y+ +Y ++ QTR FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + +
Sbjct: 149 VVAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERH 208
Query: 175 GP-NNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
G PPPLFRY G T+DI FPDWSFWGW
Sbjct: 209 GAQPQPPPPPLFRYCGSEATLDIAFPDWSFWGW 241
>gi|302762508|ref|XP_002964676.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300168405|gb|EFJ35009.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 330
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 112/179 (62%), Gaps = 8/179 (4%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ CP YF IHE L+PW+ TGITR L+R R+ ++ + +Y+ Y Q+R F
Sbjct: 1 ALCPRYFARIHEHLAPWRSTGITRRSLDR---LGSMRVAILRSDMYVSAYGDCPQSRRAF 57
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
++WG+L +L ++ R+PD+E + +C DRP++ RD S + P P+ Y R ++DI
Sbjct: 58 SLWGLLLMLERFGDRVPDVEFVLNCKDRPIV-PRDGS----EVPAPVLSYCSHRLSLDIP 112
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
FPD+SFWGW E+NI+PWE +E+ +G+ W +R+P+A+WKGN + + R L CN
Sbjct: 113 FPDYSFWGWPEVNIRPWEEESQEIFQGSQDVEWSNRQPFAFWKGNLRMGKLRTLLARCN 171
>gi|302802289|ref|XP_002982900.1| hypothetical protein SELMODRAFT_53724 [Selaginella moellendorffii]
gi|300149490|gb|EFJ16145.1| hypothetical protein SELMODRAFT_53724 [Selaginella moellendorffii]
Length = 345
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 73/177 (41%), Positives = 109/177 (61%), Gaps = 7/177 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP Y+ + EDL+PWK GI LE A + + FR I+ + ++Y+ Y + QTR +FTI
Sbjct: 2 CPAYYSRVFEDLAPWKEKGIQEHDLETARKHSAFRAIVRDGRLYVELYYRCFQTRMMFTI 61
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
GI+QLL+++PG++PD+++ F+C DRP I + ++ PPPLF Y + DI FP
Sbjct: 62 VGIMQLLQRFPGQIPDVDIFFNCQDRPQITKSAF----DEAPPPLFGYCSTKNHFDIPFP 117
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
DWSFWGW E I PW S L+ + + W DR+ W+G+P ++ R+ L+ CN
Sbjct: 118 DWSFWGWPENKILPWRSQLKRITQQA---EWKDRDSSVQWRGDPRTSQIRQRLIACN 171
>gi|302800329|ref|XP_002981922.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
gi|300150364|gb|EFJ17015.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
Length = 345
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 108/177 (61%), Gaps = 7/177 (3%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
CP Y+ + EDL+PWK GI LE A + + FR I+ + ++Y+ Y + QTR +FTI
Sbjct: 2 CPAYYSRVFEDLAPWKEKGIQEHDLETARKHSAFRAIVRDGRLYVELYYRCFQTRMMFTI 61
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
GI+QLL+++PG++PD+++ F+C DRP I + ++ PPPLF Y + DI FP
Sbjct: 62 VGIMQLLQRFPGQIPDVDIFFNCQDRPQITKSAF----DEAPPPLFGYCSTKNHFDIPFP 117
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
DWSFWGW E I PW S +LK W DR+ W+G+P ++ R+ L+ CN
Sbjct: 118 DWSFWGWPENKILPWRS---QLKRITRQAEWKDRDSSVQWRGDPRTSQIRQRLIACN 171
>gi|222634972|gb|EEE65104.1| hypothetical protein OsJ_20160 [Oryza sativa Japonica Group]
Length = 498
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 81/235 (34%), Positives = 112/235 (47%), Gaps = 51/235 (21%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
PP P +C S S P ++CPDYFR+IH+DL PW+
Sbjct: 108 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 167
Query: 97 TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
GITR+ +ER + A+FRL++V+ + Y+ Y++S QTRD F
Sbjct: 168 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAF------------------- 208
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
+ D T+D+VFPDWSFWGW E+NI W +
Sbjct: 209 -------------------------------TQDATTLDVVFPDWSFWGWPEVNIGAWPA 237
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQ 270
L ++ + W +REP+A+WKGNP VA R +L+ CN SD DWNARL+ Q
Sbjct: 238 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQ 292
>gi|223950283|gb|ACN29225.1| unknown [Zea mays]
Length = 199
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/152 (51%), Positives = 101/152 (66%), Gaps = 7/152 (4%)
Query: 61 KTSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLI 116
T+QT + + S PP CP YFRWIHEDL PW+ TG+TR +E A + A R+
Sbjct: 44 STNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVA 103
Query: 117 LVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
+V ++Y+ +Y ++ QTR FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + +
Sbjct: 104 VVAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERH 163
Query: 175 GP-NNKGPPPLFRYSGDRWTMDIVFPDWSFWG 205
G PPPLFRY G T+DI FPDWSFWG
Sbjct: 164 GAQPQPPPPPLFRYCGSEATLDIAFPDWSFWG 195
>gi|302815691|ref|XP_002989526.1| hypothetical protein SELMODRAFT_130002 [Selaginella moellendorffii]
gi|300142704|gb|EFJ09402.1| hypothetical protein SELMODRAFT_130002 [Selaginella moellendorffii]
Length = 401
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 72/182 (39%), Positives = 103/182 (56%), Gaps = 9/182 (4%)
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
P YF I DL PWK GIT+ L A + R+++ K+YI Y + Q+R +FT W
Sbjct: 56 PKYFDSIDTDLRPWKDGGITKSSLSAARKKGSMRMVISQGKLYIEVYGKCPQSRSIFTAW 115
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVI------RSRDYSGPNNKGPPPLFRYSGDRWTM 194
G+L LL ++PG++PD++ + +C DRPVI SRD P +F Y +
Sbjct: 116 GLLLLLERFPGKVPDVDFVLNCKDRPVITRYSSFHSRDLC---QDEAPAVFSYCTTNDML 172
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
DI FPD+SFWGW E++I PWE +++ G+ W +R P A+WKGN + + R LL
Sbjct: 173 DIPFPDFSFWGWPEVDIPPWEEQSQQITAGSREVKWSERRPAAFWKGNSRMGKLRSHLLQ 232
Query: 255 CN 256
C
Sbjct: 233 CQ 234
>gi|413919039|gb|AFW58971.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
gi|413919040|gb|AFW58972.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 302
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/150 (51%), Positives = 100/150 (66%), Gaps = 7/150 (4%)
Query: 62 TSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLIL 117
T+QT + + S PP CP YFRWIHEDL PW+ TG+TR +E A + A R+ +
Sbjct: 88 TNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAV 147
Query: 118 VNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
V ++Y+ +Y ++ QTR FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + + G
Sbjct: 148 VAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHG 207
Query: 176 P-NNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
PPPLFRY G T+DI FPDWSFW
Sbjct: 208 AQPQPPPPPLFRYCGSEATLDIAFPDWSFW 237
>gi|297745250|emb|CBI40330.3| unnamed protein product [Vitis vinifera]
Length = 312
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 67/100 (67%), Positives = 79/100 (79%), Gaps = 1/100 (1%)
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
RD+ GP+ PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++KEGN W
Sbjct: 2 RDFRGPS-AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEGNRRTKW 60
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+Q
Sbjct: 61 KDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQ 100
>gi|297745253|emb|CBI40333.3| unnamed protein product [Vitis vinifera]
Length = 325
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 67/111 (60%), Positives = 78/111 (70%), Gaps = 1/111 (0%)
Query: 159 MFDCDDRPVIRSRDYSGPNNKGP-PPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
MF+C DRP I++ DY G K PPLF Y T+DIVFPDWSFWGW EINIKPW SL
Sbjct: 1 MFECGDRPRIKATDYGGRKGKKKVPPLFHYCASDDTLDIVFPDWSFWGWPEINIKPWNSL 60
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
+EL+EGNN W+DREPYAYWKGN + R+ L C S+ HDWNAR+Y
Sbjct: 61 RKELEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCRPSNNHDWNARVY 111
>gi|6850895|emb|CAB71058.1| putative protein [Arabidopsis thaliana]
Length = 338
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 74/124 (59%), Positives = 81/124 (65%), Gaps = 2/124 (1%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGW 206
YPG++PDLEL+F C DRP I RD PPPLF Y G R DIVFPDWSFWGW
Sbjct: 2 YPGQIPDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGW 61
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
E+NIK W L LKEGN W DR PYAYWKGNP V+ R DL+ CN SDK+D R
Sbjct: 62 PELNIKEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVR 121
Query: 267 LYVQ 270
LYVQ
Sbjct: 122 LYVQ 125
>gi|413919042|gb|AFW58974.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
Length = 316
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 67/113 (59%), Positives = 78/113 (69%), Gaps = 1/113 (0%)
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
MFDC+D PV+ + + G + PPP LFRY G T+DI FPDWSFWGW E+NIKPWE+L
Sbjct: 1 MFDCEDLPVVGAGERHGAQPQPPPPPLFRYCGSEATLDIAFPDWSFWGWPELNIKPWEAL 60
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
RE+ E N NW DR PYAYWKGNP V R LL CN S + DWNAR+Y Q
Sbjct: 61 QREINEANAVVNWTDRAPYAYWKGNPKVGAERLLLLRCNASGERDWNARVYAQ 113
>gi|67633610|gb|AAY78729.1| hypothetical protein At2g45840 [Arabidopsis thaliana]
Length = 337
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 69/126 (54%), Positives = 75/126 (59%), Gaps = 9/126 (7%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFW 204
YPG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFW
Sbjct: 2 YPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFW 61
Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
GW EINIK W + EG W +REPYAYWKGNP VA RRDL+ C HD
Sbjct: 62 GWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPM 116
Query: 265 ARLYVQ 270
LY Q
Sbjct: 117 VHLYRQ 122
>gi|30267795|gb|AAP21678.1| hypothetical protein [Arabidopsis thaliana]
Length = 337
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 69/126 (54%), Positives = 75/126 (59%), Gaps = 9/126 (7%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFW 204
YPG++PDLEL+F C D P I RDY G N PPPLF Y G DIVFPDWSFW
Sbjct: 2 YPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFW 61
Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
GW EINIK W + EG W +REPYAYWKGNP VA RRDL+ C HD
Sbjct: 62 GWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPM 116
Query: 265 ARLYVQ 270
LY Q
Sbjct: 117 VHLYRQ 122
>gi|356561649|ref|XP_003549092.1| PREDICTED: receptor-like protein 12-like [Glycine max]
Length = 819
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/174 (41%), Positives = 85/174 (48%), Gaps = 60/174 (34%)
Query: 39 KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
++ PLNC N T TCPTN T P T+TCPDYFRWIHEDL PW T
Sbjct: 700 RKPIAPLNCTAYNLTGTCPTNLQDHQST---------PATATCPDYFRWIHEDLRPWART 750
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GIT+DM+ERA QTA+FR YPG++PDLE
Sbjct: 751 GITQDMVERAKQTANFR----------------------------------YPGKIPDLE 776
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
LMFDC D PV+ S Y+GP T+DIVFPD S+ I+I
Sbjct: 777 LMFDCVDWPVVLSDRYNGP----------------TLDIVFPDSSYTSIIYISI 814
>gi|125583038|gb|EAZ23969.1| hypothetical protein OsJ_07696 [Oryza sativa Japonica Group]
Length = 279
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/71 (67%), Positives = 57/71 (80%)
Query: 182 PPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
PPLFRY GD T+D+VFPDWSFWGW EINIKPW++L ++L GN W+DREPYAYWKG
Sbjct: 2 PPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKG 61
Query: 242 NPFVAETRRDL 252
NP VA R++L
Sbjct: 62 NPDVATKRKEL 72
>gi|302143882|emb|CBI22743.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/63 (74%), Positives = 53/63 (84%)
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
EINIKPWESLL++LKEGN W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+
Sbjct: 37 EINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARV 96
Query: 268 YVQ 270
Y Q
Sbjct: 97 YTQ 99
>gi|359490428|ref|XP_003634087.1| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
Length = 288
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/63 (74%), Positives = 53/63 (84%)
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
EINIKPWESLL++LKEGN W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+
Sbjct: 16 EINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARV 75
Query: 268 YVQ 270
Y Q
Sbjct: 76 YTQ 78
>gi|356577934|ref|XP_003557076.1| PREDICTED: LOW QUALITY PROTEIN: protein O-glucosyltransferase
1-like, partial [Glycine max]
Length = 268
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 42/64 (65%), Positives = 50/64 (78%)
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
+EINIKPW LL LKEG W++REPYAYWKGNP VAETR+DL+ CN+S+ DWNAR
Sbjct: 1 SEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAVAETRQDLIKCNVSENQDWNAR 60
Query: 267 LYVQ 270
L+ Q
Sbjct: 61 LFAQ 64
>gi|356561560|ref|XP_003549049.1| PREDICTED: uncharacterized protein LOC100818683 [Glycine max]
Length = 289
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 42/64 (65%), Positives = 49/64 (76%)
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
AEINIKPW LL LKEG W++REPYAYWKGNP VAETR+DL+ CN+ + DWNAR
Sbjct: 14 AEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAVAETRQDLIKCNVYENQDWNAR 73
Query: 267 LYVQ 270
L+ Q
Sbjct: 74 LFAQ 77
>gi|147802993|emb|CAN70738.1| hypothetical protein VITISV_008288 [Vitis vinifera]
Length = 204
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 59/109 (54%), Gaps = 17/109 (15%)
Query: 52 TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA 111
Q CP N+P TS T + + CP+YFRWIHEDL PWK TGI+R +E A A
Sbjct: 2 AQACPANHPTTSVTGKLSXE-------ACPEYFRWIHEDLRPWKSTGISRFAVESAXGDA 54
Query: 112 HFRLILVNNKVYIHKYKQ----------SIQTRDVFTIWGILQLLRKYP 150
FRL++VN K Y+ +Y++ S+ + V + L++ YP
Sbjct: 55 DFRLVIVNGKAYVEQYRKYKIYVEGWGWSVSEKYVLACDSMTLLIKPYP 103
>gi|124360768|gb|ABN08742.1| Lipopolysaccharide-modifying protein [Medicago truncatula]
Length = 253
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 46/62 (74%), Gaps = 1/62 (1%)
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
AE IKPWE +L++++E N W DR PYA+WKGN F++ R +L CN++D+HDWNAR
Sbjct: 6 AETRIKPWEKVLKDIQESNKKIKWKDRIPYAFWKGN-FLSNPRHELRKCNVTDQHDWNAR 64
Query: 267 LY 268
+Y
Sbjct: 65 VY 66
>gi|412992075|emb|CCO20801.1| glycosyltransferase CAZy GT90 [Bathycoccus prasinos]
Length = 438
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 96/215 (44%), Gaps = 31/215 (14%)
Query: 86 WIHEDLSPWKVTGITRDMLERANQTA--HFRLILVNNKVYIHKYKQSIQ-TRDVFTIWGI 142
WI DL PW +TGIT+ M++ A Q R+ ++ K+Y K S +R + +WG+
Sbjct: 74 WIDVDLQPWSMTGITKRMVDLAAQQGMRANRIQIIGGKIYAQISKSSRGPSRIWYWLWGL 133
Query: 143 LQLLRKYPGR-LPDLELMFDCDDRPVI--------------RSRDYS-GPNNKGPPPLFR 186
++L+ ++P +PD++ + + D P + + RD+ G + PPP+F
Sbjct: 134 MELIDEFPEEAVPDVDFILNTQDDPQVSIVGKRPKNPILAKKYRDFVPGIKGQAPPPVFS 193
Query: 187 YSGDRWTMDIVFPDWSFWG---------WAEINIKPWESLLRELKEGNNGRNWIDREPY- 236
D+++P W+ WG + PW+ L +L W +R
Sbjct: 194 AVTTSNNYDLLWPLWTIWGEDVEGAGSKTGGFHDPPWKELHPKLIHFAKKNKWSERRSER 253
Query: 237 AYWKGNPFVAETRRDLLTC--NLSDKHDWNARLYV 269
+W+G+ RR L+ C N D D +L V
Sbjct: 254 IFWRGSVKTNPARRALIRCSKNTVDAADVQHKLRV 288
>gi|255630857|gb|ACU15791.1| unknown [Glycine max]
Length = 194
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/69 (52%), Positives = 45/69 (65%), Gaps = 7/69 (10%)
Query: 192 WTMDIVFP-------DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
+T ++ FP D + AEINIKPW LL LKEG W++REPYAYWKGNP
Sbjct: 24 FTQNLCFPQCYKSCFDLAAGEGAEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPA 83
Query: 245 VAETRRDLL 253
VA+TR+DL+
Sbjct: 84 VAQTRQDLM 92
>gi|13937171|gb|AAK50079.1|AF372939_1 At1g63420/F2K11_19 [Arabidopsis thaliana]
gi|21700873|gb|AAM70560.1| At1g63420/F2K11_19 [Arabidopsis thaliana]
Length = 228
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%), Gaps = 1/51 (1%)
Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
++EG + +++R+ YAYWKGNPFVA +R DLLTCNLS HDWNAR+++Q
Sbjct: 1 MEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHDWNARIFIQ 51
>gi|312384008|gb|EFR28848.1| hypothetical protein AND_02694 [Anopheles darlingi]
Length = 406
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 103/220 (46%), Gaps = 29/220 (13%)
Query: 38 PKRITTPLNCVKNQTQTCPTNYPKT------SQTQESISDYSIPPTSTCPDYFRWIHEDL 91
P P++ + ++ PK S ++ S C + + +DL
Sbjct: 28 PASAAVPVSVTTSAKRSSAIRGPKKFTPLPDSVLSRNLGSESTTAIDGCSCHTGVLKKDL 87
Query: 92 SPWKVTGITRDMLERANQTA-HFRLILVNNKVYIHKYKQSIQTRDVF--TIWGILQLLRK 148
P+K GIT++M+ERA Q H+++I ++K+Y QT +F GI ++
Sbjct: 88 KPFKADGITKEMIERAKQYGTHYQVI--DHKLY-------RQTECMFPARCSGIEHFVKP 138
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWA 207
+LPD++L+ +C D P + R + NK P+F +S +DI++P W+FW G
Sbjct: 139 LLPKLPDMDLIINCRDWPQVH-RHW----NKEKTPVFSFSKTDEYLDIMYPAWAFWEGGP 193
Query: 208 EINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
I++ P W+ + + W +++P A+++G+
Sbjct: 194 AISLYPTGLGRWDQHRESISQAAERVPWKNKKPMAFFRGS 233
>gi|388579121|gb|EIM19449.1| hypothetical protein WALSEDRAFT_34020 [Wallemia sebi CBS 633.66]
Length = 471
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 83/173 (47%), Gaps = 21/173 (12%)
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQS-IQTRDVFTIWGILQLLRKYPGRLPDL 156
GIT++ LE+A+ AH R+++ NN++Y+ +Y+ + TR + T I + + PG +P++
Sbjct: 140 GITKEHLEKASDKAHGRVVIHNNRMYVKEYRHGDVNTRAMATFAAIHEAVLTSPGAIPNV 199
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY--SGDR---WTMDIVFPDWSFWGWAEINI 211
E F D +G + P P F + D+ W M PD+ FW W E +
Sbjct: 200 EFTFQIQD---------AGDSYDEPIPTFVLDRTADQPELWLM----PDFGFWSWPEPKV 246
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ + + + + +W ++ P +W+G R L+ + + H W+
Sbjct: 247 GSYVEVRDKAGKWESKHSWSEKLPKVFWRGASLGLPIRDQLV--DAARGHAWS 297
>gi|194744000|ref|XP_001954486.1| GF18286 [Drosophila ananassae]
gi|190627523|gb|EDV43047.1| GF18286 [Drosophila ananassae]
Length = 411
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 104/226 (46%), Gaps = 43/226 (19%)
Query: 41 ITTPLNCVKNQTQTCPTNYPKTS-----QTQESISDYSIPPT----STCPDYFRWIHEDL 91
+ +P Q+++ N+ + S Q +++++DY T S C + I DL
Sbjct: 25 LCSPDEKSCGQSESLDGNHDEFSFKIRRQIKKAVADYKPCSTDDEDSKCACHAAVIKHDL 84
Query: 92 SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLL 146
+P+K TG+TR M+E+A + + + NN++Y RD ++ GI L
Sbjct: 85 APYKATGVTRQMIEKAGEYGT-KYKIFNNRLY----------RDANCMFPSRCQGIEHFL 133
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWTMDIVFPDWS 202
LP+++L VI +RDY N + G P+F +S + MDI++P W+
Sbjct: 134 LPLTASLPNMDL--------VINTRDYPQLNTAWGSSGRGPIFSFSKTKEYMDIMYPAWT 185
Query: 203 FW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
FW G + P W+ + +L++ + W + +++G+
Sbjct: 186 FWAGGPATKLHPRGIGRWDQMREKLEKRSAAIPWSQKRELGFFRGS 231
>gi|242065946|ref|XP_002454262.1| hypothetical protein SORBIDRAFT_04g027705 [Sorghum bicolor]
gi|241934093|gb|EES07238.1| hypothetical protein SORBIDRAFT_04g027705 [Sorghum bicolor]
Length = 173
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 40/57 (70%), Gaps = 1/57 (1%)
Query: 79 TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
+CP YF +IHEDL PW+ G ITR ML+RA+ TA FR +++ + Y+H+ + + Q R
Sbjct: 89 SCPSYFCFIHEDLRPWRAAGGITRAMLDRAHLTATFRFVVLEGRAYVHRLRPAFQNR 145
>gi|384245503|gb|EIE18997.1| hypothetical protein COCSUDRAFT_54837 [Coccomyxa subellipsoidea
C-169]
Length = 520
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 85/206 (41%), Gaps = 38/206 (18%)
Query: 86 WIHEDLSPWKVTGI----TRDMLERANQT--AHFRLILVNNKVYIHKYKQ---------- 129
WI +D + W+ TGI +M R + FR ++N +++ +
Sbjct: 88 WIKQDFAQWEKTGIKMSAVTEMALRYRECFGEVFRFQIINGTLWVDHISERHSGWYPSRM 147
Query: 130 -----SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPL 184
S + + + I ++ LR +PG++PD++ + D P + + P N PPP+
Sbjct: 148 GAGSLSAKGKIPYAILALMDTLRHHPGQIPDIDAVIQTSDFPCMLRQQ---PGNTPPPPV 204
Query: 185 FRYSGDRWTMDIVFPDWSFWGWA--------EINIKPWESLLRELKEGNNGRNWIDREPY 236
F Y+ +DI FPD+++WG + + WE + L E + R+P
Sbjct: 205 FGYNSHARFVDIPFPDYTYWGHEYHRLVDEDGLLLFGWEKQFKLLSEKWREKEIASRKPQ 264
Query: 237 AYWKGN------PFVAETRRDLLTCN 256
W+G P E RR C
Sbjct: 265 VIWRGRTEDKEYPKRDELRRQFARCG 290
>gi|195331147|ref|XP_002032264.1| GM23615 [Drosophila sechellia]
gi|194121207|gb|EDW43250.1| GM23615 [Drosophila sechellia]
Length = 411
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 51/214 (23%), Positives = 93/214 (43%), Gaps = 31/214 (14%)
Query: 46 NCVKNQTQTCPTN---YPKTSQTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTG 98
+C +N+++ + + Q +++ +DY S P S C + + DL+P+K TG
Sbjct: 32 SCAQNESEQINEDEFSFKIRRQIEKANADYKPCSSDPQDSDCSCHADVLKRDLAPYKSTG 91
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
+TR M+E + + K+Y H+ + GI L LPD++L
Sbjct: 92 VTRQMIESSARYG------TKYKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL 145
Query: 159 MFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
VI +RDY N N P+F +S + DI++P W+FW G + P
Sbjct: 146 --------VINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHP 197
Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
W+ + +L++ W + +++G+
Sbjct: 198 RGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGS 231
>gi|195996689|ref|XP_002108213.1| hypothetical protein TRIADDRAFT_18924 [Trichoplax adhaerens]
gi|190588989|gb|EDV29011.1| hypothetical protein TRIADDRAFT_18924, partial [Trichoplax
adhaerens]
Length = 366
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 96/196 (48%), Gaps = 27/196 (13%)
Query: 67 ESISDYSIPPTSTCPDYFRWIHEDLSPWKVT-GITRDMLERA-NQTAH------FRLILV 118
+++S + C Y + I DL PW++ GIT+++ ++A NQ +H ++
Sbjct: 16 KAVSSFKDCKNDRCGCYKKVIDNDLKPWRLKKGITKEIFDKAANQGSHRGAEKGSHYQII 75
Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
N+KVY H+ + + R GI L+K +LP+LEL+ + D P + P
Sbjct: 76 NHKVYRHE-RCTFPAR----CKGIEHFLKKIAKKLPNLELIINTHDWPKV-------PKW 123
Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P+F +S DI++P WSFW G A I P W+ L + L++ ++ W
Sbjct: 124 DELLPVFSFSKTHNENDIMYPAWSFWEGGPAVWPIFPNGLGRWDVLRKSLQKASDKWPWD 183
Query: 232 DREPYAYWKGNPFVAE 247
++ A+++G+ AE
Sbjct: 184 KKKSIAFFRGSRTSAE 199
>gi|195573000|ref|XP_002104483.1| GD18424 [Drosophila simulans]
gi|194200410|gb|EDX13986.1| GD18424 [Drosophila simulans]
Length = 411
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)
Query: 64 QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
Q +++ +DY S P S C + + DL+P+K TG+TR M+E + +
Sbjct: 53 QIEKAKADYKPCSSDPQDSDCSCHADVMKRDLAPYKSTGVTRQMIESSARYG------TK 106
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
K+Y H+ + GI L LPD++L VI +RDY N
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL--------VINTRDYPQLNAA 158
Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
N P+F +S + DI++P W+FW G + P W+ + +L++
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218
Query: 230 WIDREPYAYWKGN 242
W + +++G+
Sbjct: 219 WSQKRSLGFFRGS 231
>gi|21355689|ref|NP_651095.1| rumi, isoform A [Drosophila melanogaster]
gi|74866179|sp|Q8T045.1|RUMI_DROME RecName: Full=O-glucosyltransferase rumi; Flags: Precursor
gi|17862464|gb|AAL39709.1| LD29477p [Drosophila melanogaster]
gi|23172000|gb|AAN13920.1| rumi, isoform A [Drosophila melanogaster]
gi|220945854|gb|ACL85470.1| rumi-PA [synthetic construct]
gi|220955678|gb|ACL90382.1| rumi-PA [synthetic construct]
Length = 411
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 83/189 (43%), Gaps = 20/189 (10%)
Query: 64 QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
Q +++ +DY S P S C + + DL+P+K TG+TR M+E + +
Sbjct: 53 QIEKANADYKPCSSDPQDSDCSCHANVLKRDLAPYKSTGVTRQMIESSARYG------TK 106
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
K+Y H+ + GI L LPD++L+ + D P + + N
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDLIINTRDYPQLN----AAWGNA 162
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDR 233
P+F +S + DI++P W+FW G + P W+ + +L++ W +
Sbjct: 163 AGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQK 222
Query: 234 EPYAYWKGN 242
+++G+
Sbjct: 223 RSLGFFRGS 231
>gi|307105335|gb|EFN53585.1| hypothetical protein CHLNCDRAFT_136791 [Chlorella variabilis]
Length = 592
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 79/189 (41%), Gaps = 34/189 (17%)
Query: 83 YFRWIHEDLSPWKVTGITRDMLERAN------QTAHFRLILVNNKVYIHKYKQSIQ---- 132
Y I DL PWK +GIT +++ER++ R ++N +++H + ++
Sbjct: 119 YLDLIRRDLLPWKDSGITHELVERSSMMFDDCDGDMLRFQVLNGSLWVHHITERLEGGWY 178
Query: 133 ------------TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
R + + +++ LR +PG++PD++ + D P I P
Sbjct: 179 PAPIGPGNAAAKGRVPYAVLALMETLRMFPGQIPDVDAILHFADFPCIPRPRAGAP---- 234
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI-----KP---WESLLRELKEGNNGRNWID 232
P P+ G DI F D+++WG + KP W + L + +D
Sbjct: 235 PAPILGLQGSAHHSDIPFSDYTYWGHEHQYLQDPWGKPAHGWGNQAEVLARKYENVSLLD 294
Query: 233 REPYAYWKG 241
R P A W+G
Sbjct: 295 RIPQASWRG 303
>gi|383858247|ref|XP_003704613.1| PREDICTED: O-glucosyltransferase rumi homolog [Megachile rotundata]
Length = 391
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 87/191 (45%), Gaps = 18/191 (9%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S +E+ +Y + + + I DL P+K GI +D+++ A F I V KV
Sbjct: 53 SMIEEAERNYQLCNNTNNNCFKHIILRDLKPFKEKGINKDLIDAAKARGTFYQI-VKGKV 111
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y K +R GI L K +LPD++L+ + D P +S Y G GP
Sbjct: 112 YRQK-DCMFPSR----CSGIEHFLLKLAPKLPDMDLVINVRDYP--QSSKYFG----GPL 160
Query: 183 PLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPY 236
P+F +S DI +P W+FW G I++ P W+ L + + W +E
Sbjct: 161 PVFSFSKTPQYYDITYPAWAFWEGGPAISLYPRGLGRWDEHRISLDKASKSLPWEKKESK 220
Query: 237 AYWKGNPFVAE 247
A+++G+ +E
Sbjct: 221 AFFRGSRTSSE 231
>gi|195502713|ref|XP_002098347.1| GE24006 [Drosophila yakuba]
gi|194184448|gb|EDW98059.1| GE24006 [Drosophila yakuba]
Length = 411
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)
Query: 64 QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
Q +++ +DY S P S C + + DL+P+K TG+TR M+E + +
Sbjct: 53 QIEKANADYKPCSSDPKDSDCSCHADVLKRDLAPYKSTGVTRQMIESSARYG------TK 106
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
K+Y H+ + GI L LPD++L VI +RDY N
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL--------VINTRDYPQLNAA 158
Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
N P+F +S + DI++P W+FW G + P W+ + +L++
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218
Query: 230 WIDREPYAYWKGN 242
W + +++G+
Sbjct: 219 WSQKRNLGFFRGS 231
>gi|194910742|ref|XP_001982221.1| GG12485 [Drosophila erecta]
gi|190656859|gb|EDV54091.1| GG12485 [Drosophila erecta]
Length = 411
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)
Query: 64 QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
Q +++ +DY S P S C + + DL+P+K TG+TR M+E + +
Sbjct: 53 QIEKANADYKPCSSDPKDSDCSCHAAVLKRDLAPYKSTGVTRQMIESSARYG------TK 106
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
K+Y H+ + GI L LPD++L VI +RDY N
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVTTLPDMDL--------VINTRDYPQLNAA 158
Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
N P+F +S + DI++P W+FW G + P W+ + +L++
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218
Query: 230 WIDREPYAYWKGN 242
W + +++G+
Sbjct: 219 WSQKRNLGFFRGS 231
>gi|405122052|gb|AFR96820.1| hypothetical protein CNAG_06994 [Cryptococcus neoformans var.
grubii H99]
Length = 422
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 79/170 (46%), Gaps = 27/170 (15%)
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GI++ M++ A + + RL+++NN++++ YK I TR I + + LPD++
Sbjct: 81 GISKKMVDEAEEEGNARLVILNNQLFVKAYKGGINTRTQAAIAAVYGTVLTATEPLPDVD 140
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPW-- 214
VI++ D G N+ P F D+ + PD+ F+ W E + +
Sbjct: 141 F--------VIQTSDAGGGNH----PHFALCRKANQKDLWLMPDFGFFSWPEPGVGSYSE 188
Query: 215 ---ESLLREL--------KEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
++L EL K G +W ++ +W+G+P V E R DLL
Sbjct: 189 VRTKTLDYELDMGLEVDNKLGVTKSDWFNKTQQLFWRGSPMV-EVRNDLL 237
>gi|307104334|gb|EFN52588.1| hypothetical protein CHLNCDRAFT_138604 [Chlorella variabilis]
Length = 452
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 84/196 (42%), Gaps = 12/196 (6%)
Query: 69 ISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNNKVYIH 125
+S S+ S F + DL P+ +GI E+A A R+ +VN VYI
Sbjct: 28 LSKASLNGASYVGPLFSQVARDLEPFNHSGIGLRHAEQAYCQGSKASMRVQVVNGSVYIV 87
Query: 126 KYKQSIQTRDVFTIWGILQLLRKYPGRLPD-LELMFDCDDRPVIRSRDYSGPNNKGP--P 182
S ++R + +L L G LP+ ++ + + +D P +R R P +GP
Sbjct: 88 GESPSYESRMLGIKRQLLHLW--LAGGLPESIDFVVEQEDHPTVRHRSDDCPE-RGPILA 144
Query: 183 PLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN 242
P S + ++ PD +F GW E PW +L L+ W DR +++G
Sbjct: 145 PAKCPSNKSHSHVLLAPDHTFAGWPEARTLPWAEMLPLLQHSAERHPWADRSALLFFRG- 203
Query: 243 PFVAETRRDLLTCNLS 258
A R+L +LS
Sbjct: 204 --AATGDRNLTDSDLS 217
>gi|443694719|gb|ELT95788.1| hypothetical protein CAPTEDRAFT_221044 [Capitella teleta]
Length = 501
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 90/190 (47%), Gaps = 22/190 (11%)
Query: 80 CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
CP +R I +DLS + + + ++ +ER NQ L ++ NN++Y Y + +
Sbjct: 149 CPAGYRQIKQDLSIFGDIDMKAVKQEAVERFNQRGRHALCHYVVKNNQIYRQTYGEHVGF 208
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPD+E + D P+ + + ++GP P+F + G T
Sbjct: 209 K-MFMDAMLLSLTRKV--HLPDVEFFVNLGDWPLEKRK-----VSEGPLPIFSWCGSDDT 260
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
DIV P + E ++ + +L + N G W ++ A+W+G E R D
Sbjct: 261 RDIVMPTYDV---TESTLETMGRITLDLLSVQANTGPKWSNKSSVAFWRGRDSRQE-RLD 316
Query: 252 LLTCNLSDKH 261
L+ LS KH
Sbjct: 317 LV--KLSRKH 324
>gi|328860135|gb|EGG09242.1| hypothetical protein MELLADRAFT_115837 [Melampsora larici-populina
98AG31]
Length = 515
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 78/167 (46%), Gaps = 13/167 (7%)
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
+T + L+ A + H R+++ N+VYI +YK R + I + + P RLP++E
Sbjct: 200 VTSEQLDEAIKVGHARVLIYENRVYIKEYKGGPGKRTEALLNSIQEAVITSPERLPNIEF 259
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESL 217
+ D P +G K P + + D+ D+ + PD+ F+ W E + +
Sbjct: 260 VVKTVDAP-------TGEETKLPLWVLDRTIDQ--EDVWLTPDYGFYSWPEPKVGSMIEV 310
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ E +W D+ P A+W+G + + R ++ ++ H+WN
Sbjct: 311 RDKCNEIEKKLDWKDKIPKAFWRG-AILVKLREQMI--EIAKGHEWN 354
>gi|195399534|ref|XP_002058374.1| GJ14378 [Drosophila virilis]
gi|194141934|gb|EDW58342.1| GJ14378 [Drosophila virilis]
Length = 410
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/227 (24%), Positives = 101/227 (44%), Gaps = 43/227 (18%)
Query: 42 TTPLNCVKNQTQTC-------PTNYPKTSQTQESISDY--SIPPTST------CPDYFRW 86
T + C + TC P K S+ +SI + S P +T C Y
Sbjct: 21 TQKIQCDGAEHTTCRETDGIDPNQVAKHSKITQSIQNALASYEPCTTDVNDVNCTCYAAG 80
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----G 141
I DL+ +K TG+TR M+ A + R + N ++Y RD ++ G
Sbjct: 81 IKRDLALYKSTGVTRKMINDAAKYG-TRYKIYNKQLY----------RDDNCMFPARCQG 129
Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
I L + LP+++L+ + D P + +S + GP +F +S + DI++P W
Sbjct: 130 IEHFLLQLLAELPNMDLVINTRDYPQL----HSAWRHDGP--VFSFSKTKEYRDIMYPAW 183
Query: 202 SFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
+FW G + P W+ + +L++ ++ +W D++ +++G+
Sbjct: 184 TFWAGGPATKLHPTGIGRWDLMRAKLEKRSSSLSWHDKQELGFFRGS 230
>gi|324516300|gb|ADY46488.1| CAP10 family protein CPIJ013394 [Ascaris suum]
Length = 381
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 83/186 (44%), Gaps = 22/186 (11%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAH-FRLILVNNKV 122
+ +E+ +Y + C + I DLSP++ GI ML A +AH R L+ ++
Sbjct: 39 EIEEAERNYRSCDINNCTCFQSQIENDLSPFQ-NGIDESMLLAA--SAHGVRYQLIGGRL 95
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
+ + Q G+ L LP+ E + + D P +RS D P
Sbjct: 96 F-----RQPQCPFEARCEGVEYFLVHLADSLPNTEFVLNVHDHPQMRSDD-------SPL 143
Query: 183 PLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPY 236
P+F +S D +DI++P WSFW G I++ P W ++ + W R+P
Sbjct: 144 PVFSFSKDMNHIDILYPAWSFWSGGPAISLYPTGIGRWNETSVKITKAAKRIEWAHRKPI 203
Query: 237 AYWKGN 242
A+++G+
Sbjct: 204 AFFRGS 209
>gi|118783733|ref|XP_001230897.1| AGAP004267-PA [Anopheles gambiae str. PEST]
gi|193806712|sp|A0NDG6.1|RUMI_ANOGA RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|116129010|gb|EAU76986.1| AGAP004267-PA [Anopheles gambiae str. PEST]
Length = 399
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 89/184 (48%), Gaps = 21/184 (11%)
Query: 66 QESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLILVNNKVYI 124
+ +++ Y ++ C + + DL P+K GIT++M+ RA Q H+++I
Sbjct: 57 ETALAGYVACNSTNCNCHADVLKADLKPFKAHGITKEMINRAKQYGTHYQVIG------- 109
Query: 125 HKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPL 184
HK + + G+ +R LPD++L+ +C D P I R +S K P+
Sbjct: 110 HKLYRQRECMFPARCSGVEHFVRPLLPLLPDMDLIVNCRDWPQIH-RHWS----KEKIPV 164
Query: 185 FRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAY 238
+S +DI++P W+FW G I + P W+ + + + + +W +EP A+
Sbjct: 165 LSFSKTAEYLDIMYPAWAFWEGGPAIALYPTGLGRWDLHRQTITKASA--DWEAKEPKAF 222
Query: 239 WKGN 242
++G+
Sbjct: 223 FRGS 226
>gi|170055423|ref|XP_001863576.1| KDEL motif-containing protein 1 [Culex quinquefasciatus]
gi|193806713|sp|B0X1Q4.1|RUMI_CULQU RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|167875399|gb|EDS38782.1| KDEL motif-containing protein 1 [Culex quinquefasciatus]
Length = 403
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 60/291 (20%), Positives = 123/291 (42%), Gaps = 40/291 (13%)
Query: 26 VIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQT------QESISDYSIPPTST 79
++I ++ Q C + Q + N K + +E+++ Y +S
Sbjct: 11 LVIFAQNGQTDDGGMCMAKETCTETQQEAPTNNLYKAADNKYITLIEEALAAYKPCESSN 70
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF-- 137
C + + DL P++ +GIT+D++E A + ++ ++++ + RD
Sbjct: 71 CSCHLDVLKTDLRPFR-SGITQDLIELARSYGT-KYQIIGHRMF--------RQRDCMFP 120
Query: 138 -TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
G+ +R +LPD+EL+ +C D P I SR ++ ++ P P+ +S +DI
Sbjct: 121 ARCSGVEHFIRPNLPKLPDMELIINCRDWPQI-SRHWNA--SREPLPVLSFSKTNDYLDI 177
Query: 197 VFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG-------NP 243
++P W FW G I++ P W+ +++ W + A+++G +P
Sbjct: 178 MYPTWGFWEGGPAISLYPTGLGRWDQHRVSVRKAAKVWPWEKKLQQAFFRGSRTSDERDP 237
Query: 244 FV--AETRRDLLTCNLSDKHDWNA---RLYVQVISFIYITHLCSKIPGFNF 289
V + R +L+ + W + L+ + + + C FNF
Sbjct: 238 LVLLSRMRPELVDAQYTKNQAWRSPKDTLHAEPAQEVRLEDHCQYKYLFNF 288
>gi|358056081|dbj|GAA97978.1| hypothetical protein E5Q_04658 [Mixia osmundae IAM 14324]
Length = 1130
Score = 57.0 bits (136), Expect = 1e-05, Method: Composition-based stats.
Identities = 43/169 (25%), Positives = 78/169 (46%), Gaps = 16/169 (9%)
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GI R ++R +H R+++ N++YI ++ TR I I + + +PD+E
Sbjct: 812 GIHRSQIDRGTVASHARILIKRNRLYIKHFRPVTTTRVEAAIALIHEAILSSVEPIPDVE 871
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF--PDWSFWGWAE-INIKPW 214
L+ DD +G + G P L G R + ++++ PD+ F+GW E I + +
Sbjct: 872 LLLHLDD---------TGNSKPGVPMLVL--GRRPSEELLWLMPDFGFYGWPEAIRVGSY 920
Query: 215 ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDW 263
+ E +W ++P A+W+G R+ L+ N S + W
Sbjct: 921 IHDYDQTLESETHASWKHKKPTAFWRGASLGLRDRQSLV--NNSRPYGW 967
>gi|242025222|ref|XP_002433025.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212518534|gb|EEB20287.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 407
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/241 (24%), Positives = 102/241 (42%), Gaps = 37/241 (15%)
Query: 70 SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
S+Y P S C +F+ + DL P+K GITR+M++ I YI K+
Sbjct: 68 SNYKNCPYSNCSCFFKTLKNDLKPFK-NGITREMIDS---------IRSRGTTYIIYEKR 117
Query: 130 SIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFR 186
+ +D GI ++K L + EL+ + D P I SR + K P+F
Sbjct: 118 LYRDKDCLFPSRCSGIEYFIKKIISHLKNTELIINTRDWPQI-SRHF-----KLFGPVFS 171
Query: 187 YSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWK 240
+S + +DI++P WSFW G I P W+ ++L E ++ W ++ +++
Sbjct: 172 FSKTQDYLDIMYPAWSFWEGGPAIKTYPTGLGRWDLHRKKLSEESSKWPWNKKKSIGFFR 231
Query: 241 GN---------PFVAETRRDLLTCNLSDKHDWNA---RLYVQVISFIYITHLCSKIPGFN 288
G+ ++ R DL+ + W + L + + + H C FN
Sbjct: 232 GSRTSEKRDVLVLLSRKRPDLIDAQYTKNQAWKSLKDTLGKEPAEVVSLEHHCQYKYLFN 291
Query: 289 F 289
F
Sbjct: 292 F 292
>gi|405965272|gb|EKC30658.1| KTEL motif-containing protein 1 [Crassostrea gigas]
Length = 400
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 77/148 (52%), Gaps = 25/148 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNN 120
+ ++++ +Y C + I EDL+PWK GIT ++ A N +H+++ VN+
Sbjct: 54 KIEKAVKEYKDCTQDECSCHRSVIEEDLTPWK-NGITEEVFNTAKSNNYGSHYQI--VNH 110
Query: 121 KVYIHK---YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
K+Y + + + F IL++++K LPD+E + + D P I SR
Sbjct: 111 KLYREEGCMFPARCSGNEHF----ILEVIKK----LPDMEFVINTRDWPQISSR------ 156
Query: 178 NKGPPPLFRYSGDRW-TMDIVFPDWSFW 204
+ P P+F +S R ++DI++P W+FW
Sbjct: 157 -QQPIPVFSFSKVRGQSLDIMYPAWTFW 183
>gi|195113713|ref|XP_002001412.1| GI10779 [Drosophila mojavensis]
gi|193918006|gb|EDW16873.1| GI10779 [Drosophila mojavensis]
Length = 406
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/214 (23%), Positives = 94/214 (43%), Gaps = 25/214 (11%)
Query: 42 TTPLNCVKNQTQTCPTN--YPK-TSQTQESISDYSIPPTST----CPDYFRWIHEDLSPW 94
T C+KN+ N + K T ++++++Y T C Y I DL+P+
Sbjct: 25 TGQSTCIKNEEIEVKQNAKFDKFTRNIEKALANYQPCTTDATDVNCTCYAAGIKRDLAPY 84
Query: 95 KVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLP 154
K TG TR M+E A + K++ + + GI L + L
Sbjct: 85 KSTGFTRKMIEDAAKYG------TRYKIFGKQLFREDNCMFPARCQGIEHFLLQLLPELK 138
Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
+++L+ + D P + +S +KG P+F +S +DI++P W+FW G + P
Sbjct: 139 NMDLVINTRDYPQL----HSSWQHKG--PVFSFSKTTEYLDIMYPAWTFWAGGPATKLHP 192
Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
W+ + +LK+ W +++ +++G+
Sbjct: 193 TGIGRWDLMREKLKKAAKAIPWGEKKEIGFFRGS 226
>gi|195054040|ref|XP_001993934.1| GH18344 [Drosophila grimshawi]
gi|193895804|gb|EDV94670.1| GH18344 [Drosophila grimshawi]
Length = 408
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 75/169 (44%), Gaps = 16/169 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C Y I DL+P+K G +R MLE A + K+Y K +
Sbjct: 69 CTCYAAGIKRDLAPYKSIGFSRQMLEDAAKYG------TRYKIYGQKLFREENCFFPARC 122
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
GI L + +LP+++L+ + D P + +S ++ P+F +S DI++P
Sbjct: 123 QGIEHFLLELLPQLPNMDLVINTRDYPQL----HSSWSSSRIGPVFSFSKTSEYRDIMYP 178
Query: 200 DWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
W+FW G + P W+ + +LKE W ++E +++G+
Sbjct: 179 AWTFWAGGPATKLHPTGIGRWDLMSGKLKEVTTKIPWQEKEQLGFFRGS 227
>gi|332019905|gb|EGI60366.1| CAP10 family protein [Acromyrmex echinatior]
Length = 407
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 79/170 (46%), Gaps = 24/170 (14%)
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGIL 143
I DL P+ GI+++M+E A F I + K+Y + +D GI
Sbjct: 81 IVRDLRPFTKKGISKEMIEAAKTRGTFYQI-IKGKLY--------REKDCMFPARCAGIE 131
Query: 144 QLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSF 203
L K G L D++L+ + D P +S +Y G P+F +S DI++P W+F
Sbjct: 132 HFLLKIIGNLSDMDLVINTRDYP--QSSEYFG----NAIPVFSFSKTPQYYDIMYPAWAF 185
Query: 204 W-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
W G I++ P W+ + L + + +W ++E +++G+ +E
Sbjct: 186 WEGGPAISLYPRGLGRWDQHRKTLNKASLEISWEEKESKGFFRGSRTSSE 235
>gi|449669688|ref|XP_002167607.2| PREDICTED: KDEL motif-containing protein 1-like [Hydra
magnipapillata]
Length = 497
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 100/222 (45%), Gaps = 27/222 (12%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQT---AHFRLILVNNKVYIHKYKQSIQTRD 135
CP ++ I +DL P+ + +++E A ++ A + NKVY Y +I
Sbjct: 146 ACPVNYQQIEKDLKPFPNINL-ENLIESATKSYNVAFCHYTIKKNKVYRKCY-GTINDFK 203
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FT +L + RK +LPD+E + D P+ R P P+F + G T D
Sbjct: 204 MFTDAWLLSVARKV--KLPDVEFFTNLGDWPLTTKR-------FNPMPIFSWCGSNDTFD 254
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+V+P + SL +GN G +W ++P A+++G ++R++ L
Sbjct: 255 LVWPTYDLTESTLETFGGRVSLDMTSIQGNTGPSWNHKKPVAFFRGR----DSRQERL-- 308
Query: 256 NLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
+L ++ NA V + + + H + +++ ++FF
Sbjct: 309 DLVNRFRKNANFDVGITHYFFFKHDEEKYGPIANRVSFYDFF 350
>gi|380022449|ref|XP_003695058.1| PREDICTED: O-glucosyltransferase rumi homolog [Apis florea]
Length = 407
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 94/216 (43%), Gaps = 21/216 (9%)
Query: 41 ITTPLNCV--KNQTQTCPTNYPKT-SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
I + +CV KN + Y K + +E+ DY I + + I DL P+K
Sbjct: 30 IGSTEDCVEEKNIYKVLNKQYSKYYNAIEEAEKDYRICNNTNNSCFKDVIINDLKPFKEK 89
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
GI +D+++ A F I + K+Y K +R GI L K L D++
Sbjct: 90 GINKDLIDTAKIRGTFYQI-IQGKLYRQK-DCMFPSRCA----GIEYFLLKLAPGLTDMD 143
Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP--- 213
L+ + D P +S + G P P+F +S DI +P W+FW G I++ P
Sbjct: 144 LVINVRDYP--QSSKHFGD----PLPIFSFSKTSQYYDITYPAWAFWEGGPAISLYPRGL 197
Query: 214 --WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
W+ L + +N W +E +++G+ +E
Sbjct: 198 GRWDEHCISLDKASNNTLWEKKENKVFFRGSRTSSE 233
>gi|340719860|ref|XP_003398363.1| PREDICTED: o-glucosyltransferase rumi homolog [Bombus terrestris]
Length = 404
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 98/221 (44%), Gaps = 31/221 (14%)
Query: 41 ITTPLNCVKNQTQTCPTNYPKT-----SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK 95
I P NC++ + + K + +E+ +Y I + Y I DL P+K
Sbjct: 27 IDNPENCIEKKDNIYKKDLNKQYIKYYNAIEEAQKNYKICNNTNNGCYKDIIINDLKPFK 86
Query: 96 VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGR 152
GI++D++ A +T ++ K+Y + +D GI L K
Sbjct: 87 KKGISKDLINIA-KTRGTVYQIIQGKLY--------RQKDCMFPSRCSGIEHFLLKLAPG 137
Query: 153 LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINI 211
L D++L+ + D P +S + G GP P+FR + + + DI +P W+FW G I++
Sbjct: 138 LTDMDLVINVRDYP--QSSKHFG----GPLPIFRLTPEYY--DITYPAWAFWEGGPAISL 189
Query: 212 KP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
P W+ L + + W ++E A+++G+ +E
Sbjct: 190 YPRGLGRWDEHRVSLDKASKNTLWEEKENKAFFRGSRTSSE 230
>gi|148230341|ref|NP_001086860.1| MGC83543 protein precursor [Xenopus laevis]
gi|50416390|gb|AAH77568.1| MGC83543 protein [Xenopus laevis]
Length = 386
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 98/220 (44%), Gaps = 29/220 (13%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
+Q ++ +S P + ++R + EDL P++ +GI+RD++ Q R + + ++
Sbjct: 31 NQIHTAVQRHSPPDSHNGTCFYRVLQEDLEPFR-SGISRDLM----QNVLSRKLGTHYQI 85
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
H+ + + G+ L + LPD+EL+ + D P + P+ P
Sbjct: 86 INHRLYREEECMFSARCSGVEHFLLELLPNLPDMELVINVRDYPQV-------PSWMNPV 138
Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P W+FW G A I P W+ + +LK+ + W +
Sbjct: 139 IPIFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLKKAADLWPWEKKI 198
Query: 235 PYAYWKG-------NPFVAETRR--DLLTCNLSDKHDWNA 265
P Y++G +P + +R DL+ + W +
Sbjct: 199 PKGYFRGSRTSPDRDPLILLSRESPDLVDAEYTKNQAWKS 238
>gi|222630210|gb|EEE62342.1| hypothetical protein OsJ_17131 [Oryza sativa Japonica Group]
Length = 267
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNAR 266
E NI+PW R++KEG W DR AYWKGNP VA R LL CN D + W+A
Sbjct: 10 ETNIEPWSKEFRDIKEGAKAIKWQDRVATAYWKGNPDVASPLRVALLNCN--DTNMWHAE 67
Query: 267 LYVQ 270
+ Q
Sbjct: 68 IMRQ 71
>gi|322791349|gb|EFZ15836.1| hypothetical protein SINV_05265 [Solenopsis invicta]
Length = 408
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 87/189 (46%), Gaps = 25/189 (13%)
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGIL 143
I DL P++ GI+ +M++ A F + + K+Y + +D GI
Sbjct: 82 ILRDLGPFRTKGISEEMIQAARTRGTFYQV-IKGKLY--------REKDCMFPARCAGIE 132
Query: 144 QLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSF 203
L K G L D++L+ + D P +S +Y G P+F +S DI++P W+F
Sbjct: 133 HFLLKVIGNLSDMDLVINTRDYP--QSSEYFG----NAMPVFSFSKTPQYYDIMYPAWAF 186
Query: 204 W-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
W G I++ P W+ + L + + W +E +++G+ +E R +L+ +
Sbjct: 187 WEGGPAISLYPHGLGRWDQHRKSLNKASLEIPWEKKESKGFFRGSRTSSE-RDNLILLSR 245
Query: 258 SDKHDWNAR 266
S H +A+
Sbjct: 246 SKPHLVDAQ 254
>gi|195443898|ref|XP_002069626.1| GK11622 [Drosophila willistoni]
gi|194165711|gb|EDW80612.1| GK11622 [Drosophila willistoni]
Length = 377
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 95/215 (44%), Gaps = 35/215 (16%)
Query: 50 NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQ 109
Q + NY S + ++ C + + +++DL+ ++ TG++R M+E + +
Sbjct: 17 QQIENAMANYEPCSSDESDVN---------CSCHEKVLNQDLAAYQSTGVSRQMIESSAR 67
Query: 110 TAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDD 164
R + N++Y RD ++ GI LR+ LP+++L+ + D
Sbjct: 68 YGT-RYKIYENQLY----------RDEKCMFPARCQGIEHFLRQLLPVLPNMDLIINTRD 116
Query: 165 RPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI------NIKPWESLL 218
P I + + N+ G P+F +S + DI++P W+FW I W+ +
Sbjct: 117 YPQINT---AWGNSVGNGPVFSFSKTKEYRDIMYPAWTFWAGGPATRLHPRGIGRWDLMR 173
Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+L++ W + +++G+ +E R L+
Sbjct: 174 EKLEKRAAAIPWSQKRELGFFRGSR-TSEERDSLI 207
>gi|156366125|ref|XP_001626991.1| predicted protein [Nematostella vectensis]
gi|156213886|gb|EDO34891.1| predicted protein [Nematostella vectensis]
Length = 412
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 95/207 (45%), Gaps = 28/207 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKV 122
+ +++S Y C Y I +DL WK GI + + A ++ H+++I N+ +
Sbjct: 67 KINQAVSTYRGCSNKGCGCYKDVIEDDLRRWK-DGINKSDFDAARSRGTHYQII--NHML 123
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y + F GI L + +LPD+E+ +I +RD+ GP
Sbjct: 124 Y-----REDDCMFPFRCKGIEHFLLEVINKLPDMEI--------IINTRDWPQAAVWGPA 170
Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S + MDI++P W+FW G A I P W+ + L + + W +E
Sbjct: 171 LPIFSFSKTKNEMDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREALDKKSQEWPWEKKE 230
Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKH 261
A+++G+ AE RD L LS KH
Sbjct: 231 SKAFFRGSRTSAE--RDPLVL-LSRKH 254
>gi|442757169|gb|JAA70743.1| Hypothetical protein [Ixodes ricinus]
Length = 507
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 21/182 (11%)
Query: 90 DLSPWKVTGITRDMLERANQTAHFRLI---LVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
D + + +D L+R ++ +V N+VY Y Q + ++F +L L
Sbjct: 172 DFEKVDMKKVLQDALQRFDRPGSVSFCHYAVVKNQVYRKCYGQHVGF-NMFMDQILLSLA 230
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
RK LPD+E++ + D P+ R +DY G P P F + G T DIV P +
Sbjct: 231 RKVV--LPDVEMLVNLGDWPLER-KDYWGK----PVPFFSWCGSNSTRDIVMPTYDL-TE 282
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-------PFVAETRR--DLLTCNL 257
+ + + +L +G+ G W D+EP +W+G VA +RR +LL +L
Sbjct: 283 SSLEMMGRVTLDMLSVQGHGGPAWKDKEPSGFWRGRDSRQERLDLVALSRRYPELLNASL 342
Query: 258 SD 259
++
Sbjct: 343 TN 344
>gi|291400615|ref|XP_002716870.1| PREDICTED: KTEL (Lys-Tyr-Glu-Leu) containing 1 [Oryctolagus
cuniculus]
Length = 462
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 84/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQ---TAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ Q H+++I N
Sbjct: 108 QINRSLENYEPCSSENCSCYHGIIEEDLTPFR-GGISRKMMAEVVQRKLGTHYQII--KN 164
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 165 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 209
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ EL +
Sbjct: 210 WMEPAIPVFSFSKTSEYYDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREELVKSAAQWP 269
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 270 WKKKNSTAYFRGS 282
>gi|241671163|ref|XP_002400011.1| conserved hypothetical protein [Ixodes scapularis]
gi|215506249|gb|EEC15743.1| conserved hypothetical protein [Ixodes scapularis]
Length = 410
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 65/140 (46%), Gaps = 22/140 (15%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLILVNNKVYIHKYKQSIQTRDV- 136
TC + I DL W +GIT+D+++R+ H+++I N+K+Y + D
Sbjct: 78 TCSCHSAVIDSDLQLWSESGITKDLVQRSKSRGIHYQII--NHKLY--------RGEDCL 127
Query: 137 --FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
F GI L + LPDLE + + D P R P P+F +S
Sbjct: 128 FPFRCSGIEHFLLEIVDELPDLEFIVNTRDWPQAHKR-------YDPLPVFSFSKTPDYA 180
Query: 195 DIVFPDWSFW-GWAEINIKP 213
DI++P W+FW G I++ P
Sbjct: 181 DIMYPAWTFWAGGPAISLYP 200
>gi|134114475|ref|XP_774166.1| hypothetical protein CNBG4660 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256799|gb|EAL19519.1| hypothetical protein CNBG4660 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 437
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 79/181 (43%), Gaps = 35/181 (19%)
Query: 98 GITRDMLERANQTAHFRLILVNN--------KVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
GI++ M++ A + + RL+++NN ++++ YK I TR I + L
Sbjct: 88 GISKKMVDEAEEEGNARLVILNNQLISHLGRQLFVKAYKGGINTRTQAAIAAVYGALLTA 147
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAE 208
LPD++ VI++ D G N+ P F D+ + PD+ F+ W E
Sbjct: 148 TEPLPDVDF--------VIQTSDAGGGNH----PHFALCRKADQKDLWLMPDFGFFSWPE 195
Query: 209 INIKPW-----ESLLRELKEG--------NNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
+ + ++L EL G +W ++ +W+G+P V E R DLL
Sbjct: 196 PGVGSYSEVRTKTLDYELDMGLEVDDKLDVTKSDWFNKTQQLFWRGSPMV-EVRNDLLRA 254
Query: 256 N 256
+
Sbjct: 255 S 255
>gi|350416880|ref|XP_003491149.1| PREDICTED: O-glucosyltransferase rumi homolog [Bombus impatiens]
Length = 407
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 86/191 (45%), Gaps = 24/191 (12%)
Query: 66 QESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIH 125
+E+ +Y I + Y I DL P+K GI++D++ A +T ++ K+Y
Sbjct: 58 EEAQKNYKICNNTNNGCYKDIIINDLKPFKKKGISKDLINIA-KTRGTVYQIIQGKLY-- 114
Query: 126 KYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
+ +D GI L K L D++L+ + D P +S + G GP
Sbjct: 115 ------RQKDCMFPSRCSGIEHFLLKLAPGLTDMDLVINVRDYP--QSSKHFG----GPL 162
Query: 183 PLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPY 236
P+F +S DI +P W+FW G I++ P W+ L + + W ++E
Sbjct: 163 PIFSFSKTPEYYDITYPAWAFWEGGPAISLYPRGLGRWDEHRVSLDKASKNTLWEEKENK 222
Query: 237 AYWKGNPFVAE 247
A+++G+ +E
Sbjct: 223 AFFRGSRTSSE 233
>gi|321467420|gb|EFX78410.1| hypothetical protein DAPPUDRAFT_305151 [Daphnia pulex]
Length = 415
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 86/196 (43%), Gaps = 28/196 (14%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
+S+ DY C + + EDL +K GIT+ ML++A + ++ K+
Sbjct: 69 GHISQSLLDYKPCLFKNCSCFTNLVIEDLKAFKDVGITKSMLDKAKDRGT-KYQIIGQKL 127
Query: 123 YIHKYKQSIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
Y R+ F ++ GI + K LPD+EL+ + D P + SR +
Sbjct: 128 Y----------RESFCLFPSRCAGIEHFILKVIKDLPDMELIINNRDWPQV-SRHFG--- 173
Query: 178 NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P+ +S + +DI +P W+FW G I++ P W+ + + W
Sbjct: 174 --EVLPILSFSKTKEYLDITYPAWTFWEGGPAISLYPRGLGRWDQHRISIDKVAATYPWN 231
Query: 232 DREPYAYWKGNPFVAE 247
+++ A+++G+ +E
Sbjct: 232 EKQSKAFFRGSRTSSE 247
>gi|165993289|emb|CAP71956.1| unnamed protein product [Danio rerio]
Length = 500
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 95/224 (42%), Gaps = 29/224 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP F I DLS +K R+ E +++ H+ + NN+VYI + + +
Sbjct: 152 CPASFSQIESDLSIFKSVDPDRNAHEVIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK +LPD+E + D P+ + R ++ P P+F + G T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL +G+ G W + +W+G E R +L+
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDSRKE-RLELV 319
Query: 254 TCNLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
++ A L + +F + H L + F+FF
Sbjct: 320 KLARAN----TAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFF 359
>gi|158297147|ref|XP_317424.4| AGAP008037-PA [Anopheles gambiae str. PEST]
gi|157015054|gb|EAA12302.4| AGAP008037-PA [Anopheles gambiae str. PEST]
Length = 513
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 84/199 (42%), Gaps = 24/199 (12%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQS 130
T CP I DL P++ + ++ +H +++ N+V+ Y Q
Sbjct: 158 TVGCPAGDPQIDMDLIPFRAINFSSLRTRMIQQYDKPGSISHCNYVILRNQVHRRCYGQH 217
Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-NKGPPPLFRYSG 189
F +L L RK+ LPD+E+ + D P+++ GP+ GP P+F + G
Sbjct: 218 TGFSK-FMDTILLSLARKFT--LPDMEMFVNLGDWPLVKK---GGPSRTTGPYPIFSWCG 271
Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN------- 242
T DIV P + N+ +L L G W D+ A+W+G
Sbjct: 272 SDDTFDIVMPTYDITESTLENMG--RVMLDMLSVQRRGLPWADKHAKAFWRGRDARRERL 329
Query: 243 PFVAETRR--DLLTCNLSD 259
VA +RR +LL +L++
Sbjct: 330 ELVALSRRYPELLNASLTN 348
>gi|328780644|ref|XP_003249836.1| PREDICTED: o-glucosyltransferase rumi homolog, partial [Apis
mellifera]
Length = 391
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 93/218 (42%), Gaps = 23/218 (10%)
Query: 41 ITTPLNCVK----NQTQTCPTNYPKT-SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK 95
I +CVK N + +Y K + +E+ DY + + I DL P+K
Sbjct: 28 IDNTEDCVKEKKVNIYEGLNKHYSKYYNAIEEAEKDYRVCNNINNNCFKDVIINDLKPFK 87
Query: 96 VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
GI +D+++ A F I + K+Y K +R GI L K L D
Sbjct: 88 EKGINKDLIDIAKIRGTFYQI-IQGKLYRQK-DCMFPSRCA----GIEYFLLKLAPGLTD 141
Query: 156 LELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP- 213
++L+ + D P +S Y G P P+F +S DI +P W+FW G I++ P
Sbjct: 142 MDLVINVRDYP--QSSKYFG----DPLPIFSFSKTSQYYDITYPAWAFWEGGPAISLYPR 195
Query: 214 ----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
W+ L + +N W +E +++G+ +E
Sbjct: 196 GLGRWDEHCISLDKASNNTLWEKKENKVFFRGSRTSSE 233
>gi|71018113|ref|XP_759287.1| hypothetical protein UM03140.1 [Ustilago maydis 521]
gi|46099137|gb|EAK84370.1| hypothetical protein UM03140.1 [Ustilago maydis 521]
Length = 473
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 84/192 (43%), Gaps = 20/192 (10%)
Query: 78 STCPDYFRWIHEDLSP----WKVTG--ITRDMLERANQTAH--FRLILVNNKVYIHKYKQ 129
STC F ++ L WK G + D+ A+ H LI+ + +++I K+
Sbjct: 76 STCKAEFSRLYPQLIANQIAWKSKGGILYADVKNAADNCRHGCVHLIIKHGQIFIRAQKK 135
Query: 130 SIQTRDVFTIWGILQLLRK-YPGRLPDLELMFDCDDRPVIRSRDYSG---PNNKGPPPLF 185
Q+R + LQLL K Y G + + + + VI + D+ G PN++G +
Sbjct: 136 DWQSR----VRSTLQLLDKAYSGASEHEKALMEATEL-VISTADFDGFTDPNSRGAGWVL 190
Query: 186 RYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV 245
+ +FPD+SF W E I ++ R+ ++ N W + A+W+G+
Sbjct: 191 DKRVNDTQGQYLFPDFSFASWPEAGIPSYQEFRRQAQQVNAETPWKSKTNPAFWRGDALA 250
Query: 246 AET---RRDLLT 254
+ R LL+
Sbjct: 251 GQNIKPRESLLS 262
>gi|125773715|ref|XP_001358116.1| GA16050 [Drosophila pseudoobscura pseudoobscura]
gi|121991819|sp|Q29AU6.1|RUMI_DROPS RecName: Full=O-glucosyltransferase rumi; Flags: Precursor
gi|54637851|gb|EAL27253.1| GA16050 [Drosophila pseudoobscura pseudoobscura]
Length = 409
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 73/175 (41%), Gaps = 24/175 (13%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ C + I DL+P+K TG++R M+E + + K+Y + +
Sbjct: 69 ANCSCHAAVIKSDLAPYKATGVSRQMIESSARYG------TRYKIYEKRLYREENCMFPA 122
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWT 193
GI L LPD++L VI +RDY N N P+ +S +
Sbjct: 123 RCQGIEHFLLPLVATLPDMDL--------VINTRDYPQINMAWGNGAQGPILSFSKTKDH 174
Query: 194 MDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
DI++P W+FW G + P W+ + +L++ W + +++G+
Sbjct: 175 RDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS 229
>gi|56207656|emb|CAI20990.1| novel protein (zgc:56065) [Danio rerio]
Length = 500
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 96/224 (42%), Gaps = 29/224 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP F I DLS ++ R+ E +++ H+ + NN+VYI + + +
Sbjct: 152 CPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK +LPD+E + D P+ + R ++ P P+F + G T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL +G+ G W + +W+G ++R++ L
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGR----DSRKERL 316
Query: 254 TCNLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
K + A L + +F + H L + F+FF
Sbjct: 317 ELVKLAKAN-TAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFF 359
>gi|443727132|gb|ELU14012.1| hypothetical protein CAPTEDRAFT_20245 [Capitella teleta]
Length = 371
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 85/191 (44%), Gaps = 27/191 (14%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S + ++++YS C + I DL PWK GIT + ++A ++N+K+
Sbjct: 18 SSIENALAEYSPCNPDDCSCHLGVIESDLKPWK-NGITEQLFQQAKARGSNHYQIINHKL 76
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y +S + GI + + +LPD+E ++ RD+ + G P
Sbjct: 77 Y-----RSEKCMFPSRCSGIEHFILEVIHKLPDMEF--------ILNERDWPQASIHGAP 123
Query: 183 -PLFRYSG---DRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P+F +S D W DI++P W+FW G A I P W+ + + E W
Sbjct: 124 LPIFSFSKVPTDNW--DIMYPAWTFWEGGPAVWPIYPTGLGRWDEQRKIIPEAAKKWPWH 181
Query: 232 DREPYAYWKGN 242
++ A+++G+
Sbjct: 182 KKQSKAFFRGS 192
>gi|195143519|ref|XP_002012745.1| GL23774 [Drosophila persimilis]
gi|194101688|gb|EDW23731.1| GL23774 [Drosophila persimilis]
Length = 409
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 73/175 (41%), Gaps = 24/175 (13%)
Query: 78 STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
+ C + I DL+P+K TG++R M+E + + K+Y + +
Sbjct: 69 ANCSCHAAVIKSDLAPYKATGVSRQMIESSARYG------TRYKIYEKRLYREENCMFPA 122
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWT 193
GI L LPD++L VI +RDY N N P+ +S +
Sbjct: 123 RCQGIEHFLLPLVATLPDMDL--------VINTRDYPQINMAWGNGAQGPILSFSKTKDH 174
Query: 194 MDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
DI++P W+FW G + P W+ + +L++ W + +++G+
Sbjct: 175 RDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS 229
>gi|442759745|gb|JAA72031.1| Putative o-glucosyltransferase rumi [Ixodes ricinus]
Length = 410
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 64/137 (46%), Gaps = 16/137 (11%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLILVNNKVYIHKYKQSIQTRDVF 137
TC + I DL W +GIT+D+++R+ H+++I N+K+Y + F
Sbjct: 78 TCSCHSAVIDNDLQLWSESGITKDLIQRSKSRGIHYQII--NHKLY-----RGEDCLFPF 130
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
GI L + LPDLE + + D P R P P+F +S DI+
Sbjct: 131 RCSGIEPFLLEIVDELPDLEFIVNTRDWPQAHKR-------YDPLPVFSFSKTPDYADIM 183
Query: 198 FPDWSFW-GWAEINIKP 213
+P W+FW G I++ P
Sbjct: 184 YPAWTFWAGGPAISLYP 200
>gi|345796207|ref|XP_545115.3| PREDICTED: protein O-glucosyltransferase 1 [Canis lupus familiaris]
Length = 392
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 83/194 (42%), Gaps = 32/194 (16%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVN 119
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I
Sbjct: 37 EQINRSLENYEPCSSQNCSCYHGIISEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--K 93
Query: 120 NKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGP 176
N++Y + D G+ + + GRLPD+E++ + D P I P
Sbjct: 94 NRLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQI-------P 138
Query: 177 NNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGR 228
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 139 KWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQW 198
Query: 229 NWIDREPYAYWKGN 242
W + AY++G+
Sbjct: 199 PWKKKNSTAYFRGS 212
>gi|41055550|ref|NP_957225.1| KDEL motif-containing protein 1 precursor [Danio rerio]
gi|82188635|sp|Q7ZVE6.1|KDEL1_DANRE RecName: Full=KDEL motif-containing protein 1; Flags: Precursor
gi|28277832|gb|AAH45893.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Danio rerio]
Length = 500
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 95/224 (42%), Gaps = 29/224 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP F I DLS ++ R+ E +++ H+ + NN+VYI + + +
Sbjct: 152 CPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK +LPD+E + D P+ + R ++ P P+F + G T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL +G+ G W + +W+G E R +L+
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDSRKE-RLELV 319
Query: 254 TCNLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
++ A L + +F + H L + F+FF
Sbjct: 320 KLARAN----TAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFF 359
>gi|307172256|gb|EFN63761.1| CAP10 family protein AGAP004267 [Camponotus floridanus]
Length = 406
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/219 (23%), Positives = 93/219 (42%), Gaps = 33/219 (15%)
Query: 47 CVKNQTQTCPTNYPKT---------SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
C +N+ +Y K + Q++ DY + + I DLSP+K
Sbjct: 31 CKENEIDHQKLSYAKAINEGYQKYLTAIQDAEQDYKECNNTKHSCFKDVIVRDLSPFKKK 90
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLP 154
GI+ +M+ A F I + K+Y + +D GI L K G L
Sbjct: 91 GISEEMIVAAKNRGTFYQI-IGGKLY--------REKDCMFPARCAGIEHFLLKVIGNLS 141
Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
+++L+ + D P +S +Y G P+F +S DI++P W+FW G I++ P
Sbjct: 142 NMDLIINTRDYP--QSSEYFG----NAMPIFSFSKTSQYYDIMYPAWAFWEGGPAISLYP 195
Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
W+ + L + ++ W ++ +++G+ +E
Sbjct: 196 RGLDRWDRHRKLLNKASSEIPWEKKDNKGFFRGSRTSSE 234
>gi|403288562|ref|XP_003935467.1| PREDICTED: protein O-glucosyltransferase 1 [Saimiri boliviensis
boliviensis]
Length = 391
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 85/190 (44%), Gaps = 26/190 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ DY + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 37 QINRSLEDYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 93
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R IL+++R RLPD+E++ + D P + P
Sbjct: 94 RLY-RENDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 141
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 142 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKK 201
Query: 233 REPYAYWKGN 242
+ AY++G+
Sbjct: 202 KNSTAYFRGS 211
>gi|119497671|ref|XP_001265593.1| hypothetical protein NFIA_032600 [Neosartorya fischeri NRRL 181]
gi|119413757|gb|EAW23696.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 458
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 74/181 (40%), Gaps = 15/181 (8%)
Query: 81 PDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLILVNNKVYI--HKYKQSIQTRDV 136
P F+ IH + WK GI+RD L + R I+ N +Y+ + K R +
Sbjct: 108 PGLFQDIHRGVEYWKSRGGISRDDLNAVPFEDGMARAIISNGDLYVVATRAKGDDHRRKI 167
Query: 137 F-TIWGILQLLRKYPGRL--PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
T+ I + L R P +E +F +DR N G P
Sbjct: 168 VGTLGSIHRALSASSNRTSHPTIEFIFSIEDR-------VDDVNAVGHPVWVLSRKAFEE 220
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDL 252
I+ PD+ FW WA+ NI P+ ++ + + + D+E W+G F + RR L
Sbjct: 221 SVILMPDFGFWSWAKSNIGPYGQVVERIMAAESKLKFADKEQKLVWRGKLSFAPKLRRVL 280
Query: 253 L 253
L
Sbjct: 281 L 281
>gi|402859114|ref|XP_003894014.1| PREDICTED: protein O-glucosyltransferase 1 [Papio anubis]
Length = 392
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|380786923|gb|AFE65337.1| protein O-glucosyltransferase 1 precursor [Macaca mulatta]
Length = 392
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|109033215|ref|XP_001109371.1| PREDICTED: KTEL motif-containing protein 1-like [Macaca mulatta]
gi|355559357|gb|EHH16085.1| hypothetical protein EGK_11322 [Macaca mulatta]
gi|355746435|gb|EHH51049.1| hypothetical protein EGM_10372 [Macaca fascicularis]
Length = 392
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|157125824|ref|XP_001660800.1| endoplasmic reticulum-resident kdel protein [Aedes aegypti]
gi|108882653|gb|EAT46878.1| AAEL001982-PA [Aedes aegypti]
Length = 505
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 80/189 (42%), Gaps = 18/189 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP+ I DL P++ + ++ + ++ N++Y Y +
Sbjct: 152 CPNSDPQIESDLIPFRAVNFSSLRPRIIQQYDKPGSVSLCNYVIKENQIYRTCYGRYTGF 211
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ ++ +L L RK LPD+EL + D P++ + GP P+F + G T
Sbjct: 212 K-MYMDALLLSLARK--AILPDMELFVNLGDWPLVTKGGHR--RTTGPYPIFSWCGSEDT 266
Query: 194 MDIVFPDWSFWGWAEINIKPWESL-LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
DIV P + E ++ + L L G W D+EP A+W+G E R DL
Sbjct: 267 FDIVMPTYDI---VEATLEAMNRVTLDMLSVQRKGIPWKDKEPKAFWRGRDACRE-RLDL 322
Query: 253 LTCNLSDKH 261
+ +S KH
Sbjct: 323 V--GISQKH 329
>gi|345497421|ref|XP_001601540.2| PREDICTED: O-glucosyltransferase rumi homolog [Nasonia vitripennis]
Length = 400
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 22/193 (11%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S +E+ ++Y+ + C + I DL + GI + ++E A F I + K+
Sbjct: 54 SAIEEAEANYAECDSKKCKCFGDMITRDLKTFAERGIDQKLIEAARPRGTFYQI-IGGKL 112
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY--SGPNNKG 180
Y K +R GI L K G + D+ L V+ +RDY S +
Sbjct: 113 YREK-NCMFPSRCA----GIEHFLLKVIGNVSDVSL--------VVNTRDYPQSSRHFGQ 159
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P P+F +S DI++P W FW G I++ P W+ + L E W +E
Sbjct: 160 PLPVFSFSKTPDYYDIMYPAWVFWEGGPAISLYPRGLGRWDLHRKSLNEAREETPWEKKE 219
Query: 235 PYAYWKGNPFVAE 247
A+++G+ +E
Sbjct: 220 EKAFFRGSRTSSE 232
>gi|307194254|gb|EFN76650.1| CAP10 family protein AGAP004267 [Harpegnathos saltator]
Length = 342
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 86/186 (46%), Gaps = 19/186 (10%)
Query: 87 IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
I DL P+K GI+ +M+ A F I+ K +++ K + GI L
Sbjct: 81 ILRDLRPFKKKGISEEMINAAKTRGTFYQII---KGTLYREKDCMFPA---RCAGIEHFL 134
Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-G 205
K G L D++L+ + D P +S +Y G P+F +S DI++P W+FW G
Sbjct: 135 LKVIGNLSDMDLVINTRDYP--QSSEYFG----NLLPIFSFSKTPQYYDIMYPAWAFWEG 188
Query: 206 WAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK 260
I++ P W+ + L + + W +E +++G+ +E R +L+ + +
Sbjct: 189 GPAISLYPRGLGRWDQHRKSLNKASLETPWEKKESKGFFRGSRTSSE-RDNLILLSRNKS 247
Query: 261 HDWNAR 266
H +A+
Sbjct: 248 HLVDAQ 253
>gi|343425735|emb|CBQ69269.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 488
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 62/154 (40%), Gaps = 14/154 (9%)
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+H R+++ NN+VY+ ++QS TR + + Q + +LPD E +D
Sbjct: 167 SHARVVIHNNRVYLKYFQQSAFTRVNSALALLFQSVVGSREKLPDAEFCLSANDW----- 221
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
G K Y D W M PD+ F+ W E I + + + + W
Sbjct: 222 ----GSMGKFSLDRAPYLEDLWLM----PDYGFYSWPEPGIGSYTEHREKTLQVEHDTPW 273
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ P +W+G V R L D HDWN
Sbjct: 274 ERKVPKLFWRGAMGVGTADRKALLAAAQD-HDWN 306
>gi|410970561|ref|XP_003991747.1| PREDICTED: protein O-glucosyltransferase 1 [Felis catus]
Length = 447
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 93 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 149
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 150 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 194
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 195 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 254
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 255 WKRKNSTAYFRGS 267
>gi|307170880|gb|EFN62991.1| KDEL motif-containing protein 1 [Camponotus floridanus]
Length = 458
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 88/194 (45%), Gaps = 32/194 (16%)
Query: 80 CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
C + IH DLSP+ I +++R +Q + +L +N++Y Y Q +
Sbjct: 111 CRQNYTQIHNDLSPFLNIDFDKIRHSIIKRYDQPTSVSICHYVLRSNRIYRRCYGQYVGF 170
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPD+E + D P++ P+ P+F + G T
Sbjct: 171 K-IFMDAILLSLARKVV--LPDIEFFVNLGDWPLV-------PDTGPLYPIFSWCGSDNT 220
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL-----KEGNNGRNWIDREPYAYWKGNPFVAET 248
DIV P + +I E++ R + +GN G +W ++ +W+G ++
Sbjct: 221 KDIVMPTY------DITESSLEAMGRVMLDTLSVQGNTGLSWKNKTEQLFWRGR----DS 270
Query: 249 RRDLLTC-NLSDKH 261
RR+ L ++S KH
Sbjct: 271 RRERLNLIDISRKH 284
>gi|357622883|gb|EHJ74243.1| hypothetical protein KGM_01635 [Danaus plexippus]
Length = 460
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 76/179 (42%), Gaps = 27/179 (15%)
Query: 80 CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C ++ I +DL P++ + D R T+ ++ NK+Y Y + +
Sbjct: 150 CSQTYKQIEKDLIPFQRVEMKEQIKKIVDKYHRPESTSFCHYVIKENKIYRDCYGKHVGF 209
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
++F +L L RK LPD+E++ + D P+I N +F + G T
Sbjct: 210 -NMFADNILLSLSRK--TVLPDMEMVINLGDWPLIHK-------NGEKLAMFSWCGSDDT 259
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL-----KEGNNGRNWIDREPYAYWKGNPFVAE 247
+DIV P + +I E+L R +GN R W DRE A W+G AE
Sbjct: 260 LDIVMPTY------DITESTLENLGRVTLDTLSVQGNVERKWSDRETRAIWRGRDSRAE 312
>gi|440897644|gb|ELR49289.1| KTEL motif-containing protein 1 [Bos grunniens mutus]
Length = 392
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 84/190 (44%), Gaps = 26/190 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 143 PAIPIFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202
Query: 233 REPYAYWKGN 242
+ AY++G+
Sbjct: 203 KNSTAYFRGS 212
>gi|383420127|gb|AFH33277.1| protein O-glucosyltransferase 1 precursor [Macaca mulatta]
Length = 392
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNCTAYFRGS 212
>gi|426217502|ref|XP_004002992.1| PREDICTED: protein O-glucosyltransferase 1 [Ovis aries]
Length = 392
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 84/190 (44%), Gaps = 26/190 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 143 PAIPIFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202
Query: 233 REPYAYWKGN 242
+ AY++G+
Sbjct: 203 KNSTAYFRGS 212
>gi|378732944|gb|EHY59403.1| hypothetical protein HMPREF1120_07393 [Exophiala dermatitidis
NIH/UT8656]
Length = 412
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/183 (22%), Positives = 81/183 (44%), Gaps = 25/183 (13%)
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
P F + L+ + + IT + L+ Q + R ++ + ++Y+ + I +R++ T+
Sbjct: 71 PGLFEEVERPLADRRNSSITLEELDSVPRQNGYVRGMIYDQQLYVIEKTGGIYSRELATL 130
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDR-PVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
+ + + P LP++E +F+ DDR P + Y+ R D T +
Sbjct: 131 HALHRAIISAPEPLPNIEFVFNSDDRIPSVAIWGYA-----------RREQD--TKIWLI 177
Query: 199 PDWSFWGWAEINIKPWESLLRELK------EGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
PD+ +W W E + +RE++ E ++G +W + P W+G E R +
Sbjct: 178 PDFGYWSWPETKV----GTMREVQMKAVETEQDDGWSWSSKVPKLLWRGATMGLELRENF 233
Query: 253 LTC 255
L
Sbjct: 234 LKA 236
>gi|169617900|ref|XP_001802364.1| hypothetical protein SNOG_12132 [Phaeosphaeria nodorum SN15]
gi|111059424|gb|EAT80544.1| hypothetical protein SNOG_12132 [Phaeosphaeria nodorum SN15]
Length = 428
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 53/104 (50%), Gaps = 12/104 (11%)
Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+P++E F DD P +RS++ + G K P + W M P++++W W
Sbjct: 152 IPNIEFSFSLDDLP-LRSKEKGAFFGYTRKDTPEY----DNIWMM----PNYAYWSWNYT 202
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W S+ RE+++G W ++P W+G +AE R++L+
Sbjct: 203 HAPSWNSIRREIEQGEKKTPWHKKDPRVVWRGKIKMAELRKELV 246
>gi|156401513|ref|XP_001639335.1| predicted protein [Nematostella vectensis]
gi|156226463|gb|EDO47272.1| predicted protein [Nematostella vectensis]
Length = 497
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 71/166 (42%), Gaps = 15/166 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQ-TAHFRLI---LVNNKVYIHKYKQSIQTRD 135
CP + I DL+ + + R E ++ H L ++NNKVY + + +
Sbjct: 150 CPQNYSQIDRDLARFPEINLIRLAKEAVDRFGVHHALCHYSIINNKVYRKSHGEHVGF-S 208
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F+ I L RK LPD+E + D P+ + +D P P+ + G T D
Sbjct: 209 MFSDAIIHSLARKV--HLPDMEFFVNLGDWPLEKRKD-------NPIPILSWCGSEDTAD 259
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
IV P + A + SL + N G W D+ P A+W+G
Sbjct: 260 IVMPTYDL-TEAALETMGRVSLDMLSVQANTGPKWKDKIPKAFWRG 304
>gi|62460496|ref|NP_001014903.1| protein O-glucosyltransferase 1 precursor [Bos taurus]
gi|75057820|sp|Q5E9Q1.1|PGLT1_BOVIN RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; AltName: Full=KTEL
motif-containing protein 1; Flags: Precursor
gi|59858103|gb|AAX08886.1| x 010 protein [Bos taurus]
gi|296491446|tpg|DAA33499.1| TPA: KTEL motif-containing protein 1 precursor [Bos taurus]
Length = 392
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 84/190 (44%), Gaps = 26/190 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 143 PAIPIFSFSKTLEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202
Query: 233 REPYAYWKGN 242
+ AY++G+
Sbjct: 203 KNSTAYFRGS 212
>gi|198431555|ref|XP_002127452.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 495
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 83/186 (44%), Gaps = 20/186 (10%)
Query: 80 CPDYFRWIHEDLSPWKVTGIT---RDMLER---ANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP +R I EDL P KV ++ + ++ER + H+ +VNN+VY + SI
Sbjct: 146 CPPTYRQIEEDLKPHKVIELSTFAKRIVERFTPHHSVCHY--AVVNNQVY-RQCHGSITD 202
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+F +L L RK +LPD E + D P+ +S D P P+ + G T
Sbjct: 203 FKMFMDAPLLSLTRKV--KLPDFEFFINLGDWPLEKSHD-------DPLPIISWCGSDGT 253
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DI+ P + + + + SL + N G W + +++G E R DL
Sbjct: 254 HDIILPTYDITN-SVLEMLGRVSLDMFSVQANTGPRWGKKIAKGFFRGRDSRQE-RLDLA 311
Query: 254 TCNLSD 259
+ ++ +
Sbjct: 312 SMSVKN 317
>gi|258575361|ref|XP_002541862.1| hypothetical protein UREG_01378 [Uncinocarpus reesii 1704]
gi|237902128|gb|EEP76529.1| hypothetical protein UREG_01378 [Uncinocarpus reesii 1704]
Length = 2000
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/249 (22%), Positives = 99/249 (39%), Gaps = 49/249 (19%)
Query: 65 TQESISDYSIPPTS---TCPDYFRWIHEDLSPWKVTGIT-RDMLERANQTAHFRLILVNN 120
T E DY + P F I + +S + I ++ R + R I+ N
Sbjct: 1707 TSEHGDDYGLGRVQCRIAFPKLFAEIEKSVSARRGKKIEYEELASRTLEDGMVRAIIYNG 1766
Query: 121 KVYIHKYK--QSIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGP 176
++ I ++ + TR T+ + + L P R LP++E +F DD
Sbjct: 1767 ELRIVNFEGHEFTFTRAKATLSSLNRALTAIPDRRSLPNIEFIFSADD------------ 1814
Query: 177 NNKGPPPLFRYSG---DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRN---- 229
GP P++ YS D W + PD+ +W W E NI P+ + R + ++G +
Sbjct: 1815 FTHGPGPIWTYSKRDEDSWAW--LMPDFGYWSWPEANIGPYRQIRRRIAAIDDGDSVGGR 1872
Query: 230 ------WIDREPYAYWKGNPFVAETRRD-------------LLTCNLSDKHDWNARLYVQ 270
+ ++ +W+GN A R+ +L + S+ D A+ YV
Sbjct: 1873 VRPGLKFQNKHQKLFWRGNIATAPVLRNKFLQVTHDKAWASVLPMDWSNSTDIEAK-YVP 1931
Query: 271 VISFIYITH 279
+ +ITH
Sbjct: 1932 IEEHFFITH 1940
>gi|388855380|emb|CCF51044.1| uncharacterized protein [Ustilago hordei]
Length = 493
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 61/154 (39%), Gaps = 14/154 (9%)
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+H R+++ NN+VY+ ++QS TR + + Q + +LPD E +D
Sbjct: 172 SHARVLIHNNRVYLKYFQQSAFTRVNSALALLFQSVVASKEKLPDAEFCLSANDW----- 226
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
G K Y D W M PD+ F+ W E I + + E W
Sbjct: 227 ----GSMGKFSLDRAPYLEDLWLM----PDYGFYSWPEPGIGSYTEHREKTLEVEKATPW 278
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
++ +W+G V R L SD H WN
Sbjct: 279 ENKISKLFWRGAMGVGTADRKALLAAASD-HPWN 311
>gi|301758922|ref|XP_002915307.1| PREDICTED: KTEL motif-containing protein 1-like [Ailuropoda
melanoleuca]
Length = 380
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 26 QINRSLENYEPCSSQNCSCYHGVIDEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 82
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + G+LPD+E++ + D P + P
Sbjct: 83 RLY--------RENDCMFPSRCSGVEHFILEVIGQLPDMEMVINVRDYPQV-------PK 127
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 128 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 187
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 188 WKKKNSTAYFRGS 200
>gi|281345604|gb|EFB21188.1| hypothetical protein PANDA_003280 [Ailuropoda melanoleuca]
Length = 364
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 10 QINRSLENYEPCSSQNCSCYHGVIDEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 66
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + G+LPD+E++ + D P + P
Sbjct: 67 RLY--------RENDCMFPSRCSGVEHFILEVIGQLPDMEMVINVRDYPQV-------PK 111
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 112 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 171
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 172 WKKKNSTAYFRGS 184
>gi|296226200|ref|XP_002758827.1| PREDICTED: protein O-glucosyltransferase 1 [Callithrix jacchus]
Length = 391
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 26/190 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 37 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 93
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y + +R IL+++R RLPD+E++ + D P + P
Sbjct: 94 RLY-RENDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 141
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 142 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKK 201
Query: 233 REPYAYWKGN 242
+ AY++G+
Sbjct: 202 KNSTAYFRGS 211
>gi|194222810|ref|XP_001500762.2| PREDICTED: protein O-glucosyltransferase 1 [Equus caballus]
Length = 392
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQ---TAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ + H+++I N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-AGISRKMMAEVVKRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + G LPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLARSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|31982953|ref|NP_689518.1| protein O-glucosyltransferase 1 precursor [Homo sapiens]
gi|114588661|ref|XP_516666.2| PREDICTED: protein O-glucosyltransferase 1 isoform 2 [Pan
troglodytes]
gi|397509574|ref|XP_003825192.1| PREDICTED: protein O-glucosyltransferase 1 [Pan paniscus]
gi|426341692|ref|XP_004036160.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Gorilla
gorilla gorilla]
gi|74730148|sp|Q8NBL1.1|PGLT1_HUMAN RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; Short=hCLP46; AltName:
Full=KTEL motif-containing protein 1; AltName:
Full=Myelodysplastic syndromes relative protein; Flags:
Precursor
gi|22761537|dbj|BAC11625.1| unnamed protein product [Homo sapiens]
gi|29179612|gb|AAH48810.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Homo sapiens]
gi|31745813|gb|AAP56253.1| myelodysplastic syndromes relative [Homo sapiens]
gi|37182284|gb|AAQ88944.1| MDS010 [Homo sapiens]
gi|119599976|gb|EAW79570.1| chromosome 3 open reading frame 9 [Homo sapiens]
gi|189067501|dbj|BAG37760.1| unnamed protein product [Homo sapiens]
gi|410228232|gb|JAA11335.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410254852|gb|JAA15393.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410299780|gb|JAA28490.1| protein O-glucosyltransferase 1 [Pan troglodytes]
gi|410328909|gb|JAA33401.1| protein O-glucosyltransferase 1 [Pan troglodytes]
Length = 392
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|344282265|ref|XP_003412894.1| PREDICTED: protein O-glucosyltransferase 1-like [Loxodonta
africana]
Length = 638
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/191 (22%), Positives = 79/191 (41%), Gaps = 28/191 (14%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDML-ERANQTAHFRLILVNNKV 122
Q S+ +Y + C Y I +DL+P++ GI++ M+ E + ++ N++
Sbjct: 284 QINRSLENYEPCSSQDCSCYHGVIEQDLTPFR-GGISKKMMAEVVGRKLGTHYQIIKNRL 342
Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 343 Y--------RENDCMFPSRCGGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWM 387
Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFWGWA-------EINIKPWESLLRELKEGNNGRNWI 231
P P+F +S DI++P W+FW I + W+ +L W
Sbjct: 388 EPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPIGLGRWDLFREDLARSAAQWPWK 447
Query: 232 DREPYAYWKGN 242
+ AY++G+
Sbjct: 448 KKNSTAYFRGS 458
>gi|297670280|ref|XP_002813298.1| PREDICTED: protein O-glucosyltransferase 1 [Pongo abelii]
Length = 392
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRRLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|432964664|ref|XP_004086966.1| PREDICTED: KDEL motif-containing protein 1-like [Oryzias latipes]
Length = 504
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 83/189 (43%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP F I +DLS + R+ E + + H+ + +NKVYI + + +
Sbjct: 156 CPQAFAQIDQDLSVFTTVDPDRNAREIPPRFRQRHSLCHY--TIKDNKVYIKTFGEHVGF 213
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK LPD+E + D P+ + + P + P+F + G +
Sbjct: 214 R-IFMDAVLLSLTRKV--HLPDVEFFVNLGDWPLEKRK----PTEE-IHPIFSWCGSNSS 265
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL +GN G W ++ A+W+G ++RR+ L
Sbjct: 266 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGNTGPAWPEKNATAFWRGR----DSRRERL 320
Query: 254 T-CNLSDKH 261
LS H
Sbjct: 321 ELVQLSRAH 329
>gi|62897433|dbj|BAD96657.1| x 010 protein variant [Homo sapiens]
Length = 392
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|328709644|ref|XP_001945687.2| PREDICTED: o-glucosyltransferase rumi homolog [Acyrthosiphon pisum]
Length = 384
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 90/199 (45%), Gaps = 24/199 (12%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
S+ ++ +Y+ + +F I +DL P+K GIT +M+ A R ++ N+ +
Sbjct: 37 SKINKAKKEYTSCESKNGTCFFPNILKDLEPFK-DGITHEMITAAADKGT-RYMIFNHDL 94
Query: 123 YIHKYKQSIQTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
Y +T+ +F GI L K PD+E + + D P I + Y P
Sbjct: 95 YR-------ETKCMFPARCEGIEHFLSKIQLNTPDVEFILNTRDWPQI-IKHYGDPK--- 143
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P WSFW G I + P W+SL + + + + W +
Sbjct: 144 --PVFSFSKTDDYADIMYPAWSFWSGGPAIKLHPSGLGRWDSLRKSILKQSEQWPWKRKI 201
Query: 235 PYAYWKGNPFVAETRRDLL 253
+++G+ +E R L+
Sbjct: 202 SKGFFRGSR-TSEQRDSLI 219
>gi|432849653|ref|XP_004066608.1| PREDICTED: protein O-glucosyltransferase 1-like [Oryzias latipes]
Length = 387
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/215 (21%), Positives = 89/215 (41%), Gaps = 29/215 (13%)
Query: 68 SISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKY 127
++ Y + C + + +DL P++ GI+ ++ Q R + + ++ HK
Sbjct: 37 AVGSYVSCSSVNCSCHLSVLQQDLQPFR-GGISESLMASTVQ----RGMGTHYQIIQHKL 91
Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLFR 186
+ G+ + + RLPDLE++ + D P + PN P P+
Sbjct: 92 YREQNCMFPARCSGVEHFILEVIDRLPDLEMVVNVRDYPQV-------PNWMSPALPVLS 144
Query: 187 YSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYW 239
+S DI++P W+FW G A I P W+ + +LK+ W +EP ++
Sbjct: 145 FSKTAEYQDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRTDLKKSAAQWPWKKKEPKGFF 204
Query: 240 KG-------NPFVAETRRD--LLTCNLSDKHDWNA 265
+G +P + +R D L+ + W +
Sbjct: 205 RGSRTSSERDPLILLSREDPELVDAEYTKNQAWKS 239
>gi|115496800|ref|NP_001069153.1| KDEL motif-containing protein 1 precursor [Bos taurus]
gi|111304522|gb|AAI19854.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Bos taurus]
gi|296481621|tpg|DAA23736.1| TPA: KDEL (Lys-Asp-Glu-Leu) containing 1 [Bos taurus]
Length = 502
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 81/189 (42%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDL------SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP+ F IH DL P K+ + H+ L +NKVYI + + +
Sbjct: 153 CPETFAQIHRDLVHFPTVDPEKIATEIPKRFGQRQSLCHYS--LKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK ++PD+E + D P+ + + S P+ P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK--SSPHIH---PIFSWCGSTDS 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++RR+ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRRERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|417409544|gb|JAA51271.1| Putative protein o-glucosyltransferase 1, partial [Desmodus
rotundus]
Length = 306
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 80/187 (42%), Gaps = 20/187 (10%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVY 123
Q S+ +Y + C Y I EDL+P++ GI+R ++ + R + + ++
Sbjct: 54 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKVMVEVVR----RKLGTHYQII 108
Query: 124 IHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP- 182
H+ + G+ + + GRLPD+E++ + D P + P P
Sbjct: 109 KHRLYRENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWMEPAI 161
Query: 183 PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREP 235
P+F +S DI++P W+FW G A I P W+ EL W +
Sbjct: 162 PVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREELVRSAVQWPWKKKNS 221
Query: 236 YAYWKGN 242
AY++G+
Sbjct: 222 TAYFRGS 228
>gi|21040486|gb|AAH30614.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Homo sapiens]
gi|312150392|gb|ADQ31708.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [synthetic construct]
Length = 392
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRRMMAEVVRRKLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|326913914|ref|XP_003203277.1| PREDICTED: KDEL motif-containing protein 1-like [Meleagris
gallopavo]
Length = 572
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 73/170 (42%), Gaps = 21/170 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP F I DL+ + V I ++ +R Q + +N+VYI Y + + R
Sbjct: 224 CPQVFPQIQRDLANFPVVDPDKIAIEIPQRFGQRQSLCHYTIKDNEVYIKTYGEHVGFR- 282
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
+F +L L RK ++PD+E + D P+ K PP P+F + G
Sbjct: 283 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPL---------EKKKPPQNLHPIFSWCGSS 331
Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ DIV P + + + SL + N G +W D+ A+W+G
Sbjct: 332 ESKDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPSWEDKNTTAFWRG 380
>gi|187608427|ref|NP_001120575.1| KDEL (Lys-Asp-Glu-Leu) containing 2 precursor [Xenopus (Silurana)
tropicalis]
gi|171846847|gb|AAI61552.1| LOC100145729 protein [Xenopus (Silurana) tropicalis]
gi|189442192|gb|AAI67365.1| hypothetical protein LOC100145729 [Xenopus (Silurana) tropicalis]
gi|195540008|gb|AAI68093.1| hypothetical protein LOC100145729 [Xenopus (Silurana) tropicalis]
Length = 509
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 86/191 (45%), Gaps = 18/191 (9%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLER-----ANQTAHFRLILVNNKVYIHKYKQSI 131
T +CPD I +D P+ +TR ML+ A++ A ++NN +Y H +
Sbjct: 156 TLSCPDTEAQISKDFDPFPSIDLTR-MLDEVPKRFADRGAIVHYTVLNNNIYRHSMGRYT 214
Query: 132 QTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR 191
+ +F+ + L RK RLPD E + D PV + P GP P+ + G
Sbjct: 215 DFK-MFSDEMLQSLARKV--RLPDFEFYINVGDWPVEHRKANDTP---GPLPMISWCGSA 268
Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
+ DI+ P + ++ + L + +G+ G +W ++ +++G ++R +
Sbjct: 269 DSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTGPSWSNKTEQGFFRGR----DSREE 323
Query: 252 LLT-CNLSDKH 261
L ++S KH
Sbjct: 324 RLQLVHMSRKH 334
>gi|432113196|gb|ELK35717.1| Protein O-glucosyltransferase 1, partial [Myotis davidii]
Length = 364
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 10 QINRSLENYEPCSSQNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 66
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + G LPD+E++ + D P + P
Sbjct: 67 RLY--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PK 111
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 112 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 171
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 172 WKKKNSTAYFRGS 184
>gi|312380729|gb|EFR26645.1| hypothetical protein AND_07142 [Anopheles darlingi]
Length = 459
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 79/190 (41%), Gaps = 20/190 (10%)
Query: 80 CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP I DL P++ + ++ +H +++ N V+ Y Q
Sbjct: 100 CPPGDPQIDMDLIPFRAINFSSLRTRMIQQYDKPGSISHCNYVILRNDVHRRCYGQHTGF 159
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-NKGPPPLFRYSGDRW 192
F +L L RK+ LPD+EL + D P+++ GP+ GP P+F + G
Sbjct: 160 SK-FMDTILLSLARKF--SLPDMELFVNLGDWPLVKK---GGPSRTTGPYPIFSWCGSDD 213
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
T DIV P + N+ +L L G W D+ A+W+G + RR+
Sbjct: 214 TFDIVMPTYDITESTLENMG--RVMLDMLSIQKRGIPWPDKHRKAFWRGR----DARRER 267
Query: 253 LT-CNLSDKH 261
L L+ +H
Sbjct: 268 LELVRLARRH 277
>gi|332025008|gb|EGI65195.1| KDEL motif-containing protein 1 [Acromyrmex echinatior]
Length = 495
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 85/189 (44%), Gaps = 23/189 (12%)
Query: 80 CPDYFRWIHEDLSPWK---VTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
C + IH DLSP+ I + +++R ++ + +L +N++Y Y Q +
Sbjct: 149 CQQNYTQIHHDLSPFSNIDFDKIRQSIIKRYDRPTSVSICHYVLKSNRIYRQCYGQYVGF 208
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPD+E + D P++ P++ P+F + G T
Sbjct: 209 K-IFMDAILLSLTRKV--MLPDIEFFVNLGDWPLV-------PDSGPLYPIFSWCGSEST 258
Query: 194 MDIVFPDWSFWGWAEINIKPW-ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
DIV P + E +++ +L L NG W + +W+G E R DL
Sbjct: 259 KDIVMPTYDI---TESSLEAMGRVMLDTLSVQGNGLPWEHKTEQLFWRGRDARRE-RLDL 314
Query: 253 LTCNLSDKH 261
+ ++S KH
Sbjct: 315 I--DISRKH 321
>gi|118084661|ref|XP_416963.2| PREDICTED: KDEL motif-containing protein 1 [Gallus gallus]
Length = 500
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 73/170 (42%), Gaps = 21/170 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP F I DL+ + V I ++ +R Q + +N+VYI Y + + R
Sbjct: 152 CPQVFPQIQRDLANFPVVDPDKIAVEIPQRFGQRQSLCHYTIKDNEVYIKTYGEHVGFR- 210
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
+F +L L RK ++PD+E + D P+ K PP P+F + G
Sbjct: 211 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPL---------EKKKPPQKLHPIFSWCGSS 259
Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ DIV P + + + SL + N G +W D+ A+W+G
Sbjct: 260 ESKDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPSWEDKNTTAFWRG 308
>gi|9437337|gb|AAF87313.1|AF168711_1 x 010 protein [Homo sapiens]
Length = 273
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|242007535|ref|XP_002424595.1| KDEL motif-containing protein 1 precursor, putative [Pediculus
humanus corporis]
gi|212508038|gb|EEB11857.1| KDEL motif-containing protein 1 precursor, putative [Pediculus
humanus corporis]
Length = 501
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/170 (23%), Positives = 76/170 (44%), Gaps = 21/170 (12%)
Query: 80 CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
C + I DL + K + + +LE+ + L ++ +N++Y Y + +
Sbjct: 154 CKKTYNQIVSDLESFPKVKFSEVLLKVLEKYSSQRSISLCHYVIKDNEIYRKCYGEYVDF 213
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPD E + + D P+ N+ P P+F + G +T
Sbjct: 214 K-IFVDALLLSLTRKI--ELPDFEFIVNLGDWPL-------EDNSPSPLPIFSWCGSNFT 263
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
DI+ P + E ++ + R++ +GN G +W D+ +W+G
Sbjct: 264 SDIIMPTYDL---TEATLECMGRVSRDMLSVQGNTGASWNDKINKGFWRG 310
>gi|395844808|ref|XP_003795143.1| PREDICTED: protein O-glucosyltransferase 1 [Otolemur garnettii]
Length = 392
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q S+ +Y + C Y I EDL+P++ GI+R M+ R H+++ V N
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIAEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--VKN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
++Y + D G+ + + G LPD+E++ + D P + P
Sbjct: 95 RLY--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 140 WMEPTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212
>gi|391340384|ref|XP_003744522.1| PREDICTED: O-glucosyltransferase rumi homolog [Metaseiulus
occidentalis]
Length = 418
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 76/172 (44%), Gaps = 21/172 (12%)
Query: 83 YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
++R I DL P+ GI+R+ L RA + R ++ +V+ K + F GI
Sbjct: 67 HYRQIRRDLKPFS-EGISREDLNRAAKFG-VRYQIIQGRVFRQK-----ECMFPFRCSGI 119
Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDW 201
L + +LPDLE + + D P P + P+F +S + DI++P W
Sbjct: 120 EHFLLRLAPKLPDLEFIVNVRDYP-------QSPVGREKLPVFSFSKVPKNYYDILYPAW 172
Query: 202 SFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
+FW G I+I P W+ + L + W + +++G+ AE
Sbjct: 173 TFWEGGPAISIYPRGIGRWDIRSKSLIAESERVPWDAKVDRGFFRGSRTSAE 224
>gi|426236645|ref|XP_004012278.1| PREDICTED: KDEL motif-containing protein 1 [Ovis aries]
Length = 502
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDL------SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP+ F IH DL P K+ + H+ L +NKVYI + + +
Sbjct: 153 CPETFAQIHRDLVHFPTVDPEKIATEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK ++PD+E + D P+ + + S P+ P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK--SSPHIH---PIFSWCGSTDS 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 263 KDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|296811324|ref|XP_002846000.1| DUF821 domain-containing protein [Arthroderma otae CBS 113480]
gi|238843388|gb|EEQ33050.1| DUF821 domain-containing protein [Arthroderma otae CBS 113480]
Length = 420
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/147 (21%), Positives = 64/147 (43%), Gaps = 16/147 (10%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR--P 166
+ + R ++ N ++YI +I +R+ T+ + + + P LP++E +F+ DD+
Sbjct: 110 KNGYIRAMIYNQQLYIIHRNGTIYSREYATLQALNRAIVSSPDPLPNIEFVFNTDDKVDS 169
Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
V + P +K D W M PD+ +W W E + + + + ++
Sbjct: 170 VAQWGYARRPQDK----------DMWLM----PDFGYWSWPETKVGTMQEVQTKAEQEEQ 215
Query: 227 GRNWIDREPYAYWKGNPFVAETRRDLL 253
W + P +W+G E R L+
Sbjct: 216 TWTWPKKIPKLFWRGATMGLEVRDKLI 242
>gi|403171742|ref|XP_003330935.2| hypothetical protein PGTG_12472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169317|gb|EFP86516.2| hypothetical protein PGTG_12472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 499
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/159 (22%), Positives = 67/159 (42%), Gaps = 11/159 (6%)
Query: 99 ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
I+ L+RA Q +H R+++ N+VYI + R + I + + P LPD+E
Sbjct: 184 ISIQHLDRAYQYSHARVLIYRNRVYIKGFNGGPGLRTEAILNSIQEAVITSPELLPDVEF 243
Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESL 217
+ D P ++ PL+ + ++ + PD+ F+ W E + +
Sbjct: 244 VIKTSDAPQGGDDEH---------PLWVLDRTKSQEEVWLMPDYGFYSWPEPKVGGMVEV 294
Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
+ E +W + A+W+G V + R LL +
Sbjct: 295 RDKTAEREASLSWDSKISKAFWRGAILV-KLREQLLEVS 332
>gi|49119606|gb|AAH73128.1| LOC443629 protein, partial [Xenopus laevis]
Length = 507
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 91/214 (42%), Gaps = 31/214 (14%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLER----ANQT 110
CP P+T Q T +CPD I +D P+ +TR + E A++
Sbjct: 143 CPEEDPQTWQE-----------TLSCPDTEAQISKDFEPFPSIDLTRLLNEVPKRFADRG 191
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWG--ILQ-LLRKYPGRLPDLELMFDCDDRPV 167
A ++NN++ Y++S+ F ++ ILQ L RK RLPD E + D PV
Sbjct: 192 AIVHYTVLNNQI----YRRSMGRYTDFKMFSDEILQSLARKV--RLPDFEFYINVGDWPV 245
Query: 168 IRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
+ P GP P+ + G + DI+ P + ++ + L + +G+ G
Sbjct: 246 ENRKANDTP---GPLPMISWCGSSDSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTG 301
Query: 228 RNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
+W ++ +++G E L +S KH
Sbjct: 302 PSWSNKTEQGFFRGRDSREER---LQLVQMSRKH 332
>gi|147905630|ref|NP_001085283.1| KDEL (Lys-Asp-Glu-Leu) containing 2 precursor [Xenopus laevis]
gi|114107936|gb|AAI23308.1| LOC443629 protein [Xenopus laevis]
Length = 509
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 91/214 (42%), Gaps = 31/214 (14%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLER----ANQT 110
CP P+T Q T +CPD I +D P+ +TR + E A++
Sbjct: 145 CPEEDPQTWQE-----------TLSCPDTEAQISKDFEPFPSIDLTRLLNEVPKRFADRG 193
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWG--ILQ-LLRKYPGRLPDLELMFDCDDRPV 167
A ++NN++ Y++S+ F ++ ILQ L RK RLPD E + D PV
Sbjct: 194 AIVHYTVLNNQI----YRRSMGRYTDFKMFSDEILQSLARKV--RLPDFEFYINVGDWPV 247
Query: 168 IRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
+ P GP P+ + G + DI+ P + ++ + L + +G+ G
Sbjct: 248 ENRKANDTP---GPLPMISWCGSSDSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTG 303
Query: 228 RNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
+W ++ +++G E L +S KH
Sbjct: 304 PSWSNKTEQGFFRGRDSREER---LQLVQMSRKH 334
>gi|443897123|dbj|GAC74465.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
Length = 495
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 14/154 (9%)
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+H R+++ NN+VY+ ++QS TR + + Q + +LPD E +D
Sbjct: 174 SHARVLIHNNRVYLKYFQQSPFTRVNSALALLFQSVTGAREKLPDAEFCLSANDW----- 228
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
G K Y D W M PD+ F+ W E I + + E N W
Sbjct: 229 ----GSMGKFSLDRAPYLVDLWLM----PDYGFYSWPEPGIGSYTEHREKTLELENATPW 280
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ +W+G V R L ++ H WN
Sbjct: 281 SAKISKLFWRGAMSVGTADRKALLA-AAENHAWN 313
>gi|345324378|ref|XP_003430815.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
1-like [Ornithorhynchus anatinus]
Length = 588
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 82/187 (43%), Gaps = 18/187 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP F I DL+ + I ++ +R Q + NNKVYI + + + R
Sbjct: 239 CPASFPQIQRDLAHFPTVDPDKIAVEIPQRFGQRQSLCHYTVKNNKVYIKTHGEHVGFR- 297
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F +L L RK ++PD+E + D P+ + + P+N P +F + G + D
Sbjct: 298 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK---SPDNLHP--IFSWCGSTDSKD 350
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT- 254
IV P + + + SL + N G W D+ A W+G ++R++ L
Sbjct: 351 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPAWEDKNTTAVWRGR----DSRKERLEL 405
Query: 255 CNLSDKH 261
LS KH
Sbjct: 406 VKLSRKH 412
>gi|332225464|ref|XP_003261899.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Nomascus
leucogenys]
Length = 392
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 79/191 (41%), Gaps = 28/191 (14%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD-MLERANQTAHFRLILVNNKV 122
Q S+ +Y + C Y I EDL+P++ GI+R M E + + N++
Sbjct: 38 QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKTMAEVVRRKLGTHYQITKNRL 96
Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
Y + D G+ + + GRLPD+E++ + D P + P
Sbjct: 97 Y--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWM 141
Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 142 EPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWK 201
Query: 232 DREPYAYWKGN 242
+ AY++G+
Sbjct: 202 KKNSTAYFRGS 212
>gi|224043076|ref|XP_002195971.1| PREDICTED: KDEL motif-containing protein 1 [Taeniopygia guttata]
Length = 588
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 13/166 (7%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP I DL+ + + I +++ +R Q + NN+VYI Y + + R
Sbjct: 240 CPQTIPQIQRDLANFPIVDPDKIAKEIPQRFGQRQSLCHYTIKNNEVYIKTYGEHVGFR- 298
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F +L L RK ++PD+E + D P+ + + P N P +F + G + D
Sbjct: 299 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKRKP---PQNLHP--IFSWCGSSESKD 351
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
IV P + + + SL + N G W D+ A+W+G
Sbjct: 352 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWDDKNTTAFWRG 396
>gi|157128435|ref|XP_001655120.1| endoplasmic reticulum-resident kdel protein [Aedes aegypti]
gi|122067566|sp|Q16QY8.1|RUMI_AEDAE RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
gi|108872609|gb|EAT36834.1| AAEL011121-PA [Aedes aegypti]
Length = 402
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 56/263 (21%), Positives = 118/263 (44%), Gaps = 40/263 (15%)
Query: 21 SRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTC 80
S + + + ++++P + P++ T ++ K T+ + QE+++ Y + C
Sbjct: 20 SDDDSGMCMAKEPCSEAPQQ-ETKVDLYK------ATDNKYVALIQEALASYEPCQQANC 72
Query: 81 PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF--- 137
+ + DL P+K GI+ M+ERA + +V++++Y + +D
Sbjct: 73 SCHADVLKTDLRPFK-GGISEQMVERARSYG-TKYQIVDHRLY--------RQKDCMFPA 122
Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
G+ ++ LPD+EL+ +C D P I +R + + P+ +S +DI+
Sbjct: 123 RCSGVEHFIKPNLPHLPDMELIINCRDWPQI-NRHW----KQEKLPVLSFSKTDDYLDIM 177
Query: 198 FPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG-------NPF 244
+P W FW G I++ P W+ +K+ + W ++ A+++G +P
Sbjct: 178 YPTWGFWEGGPAISLYPTGLGRWDQHRVSIKKAADSWKWEKKKAKAFFRGSRTSDERDPL 237
Query: 245 VAETRR--DLLTCNLSDKHDWNA 265
V +RR +L+ + W +
Sbjct: 238 VLLSRRKPELVDAQYTKNQAWKS 260
>gi|313241455|emb|CBY33708.1| unnamed protein product [Oikopleura dioica]
Length = 478
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 96/211 (45%), Gaps = 41/211 (19%)
Query: 49 KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERAN 108
K Q+++C N PK+ T +S C + F+ + + + I + + R++
Sbjct: 119 KLQSESC--NCPKSRATFKS--------AYQCTE-FKHLTSSFNKLQNKKINKKSIHRSS 167
Query: 109 QT----AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDD 164
+T + I+ +NK+Y + + + + F+ L LLRK LPD+E +F+ D
Sbjct: 168 ETLKGDSVIHYIIKDNKLYSKELSERLDFKR-FSDGIFLSLLRKV--NLPDIEFLFNVGD 224
Query: 165 RPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW-----SFWGWAEINIKPWESLLR 219
PV SR++ P+F + G + DIV P W + ++IN+ + L
Sbjct: 225 WPV--SREF---------PVFSWCGSEESSDIVVPTWDQIKTTLLSMSKINV---DILTM 270
Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRR 250
+L NG++W + P +++G E R
Sbjct: 271 QL----NGKSWQSKIPKGFFRGRDSSKERMR 297
>gi|47211427|emb|CAF96437.1| unnamed protein product [Tetraodon nigroviridis]
Length = 456
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 75/166 (45%), Gaps = 13/166 (7%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLI----LVNNKVYIHKYKQSIQTRD 135
CPD F I DLS + R+ E + + + + +NKVY+ + + + R
Sbjct: 126 CPDSFPQIDTDLSVFSSVDPDRNAQEVPQRFGQRQSLCHYTVKDNKVYVKTFGEHVGFR- 184
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F +L L RK RLPD+E + D P+ + + P ++ P+F + G T+D
Sbjct: 185 IFMDSILLSLTRKV--RLPDVEFFVNLGDWPLEKRK----PTDQ-IHPIFSWCGSNNTLD 237
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
IV P + + + SL + + G W ++ A+W+G
Sbjct: 238 IVMPTYDL-TESVLETMGRVSLDMMSVQASTGPPWPEKNATAFWRG 282
>gi|348566951|ref|XP_003469265.1| PREDICTED: protein O-glucosyltransferase 1-like [Cavia porcellus]
Length = 392
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 81/191 (42%), Gaps = 28/191 (14%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDML-ERANQTAHFRLILVNNKV 122
Q ++ +Y + C Y I EDL+P++ GI+R M+ E ++ ++ N++
Sbjct: 38 QINRALENYEPCSSQNCSCYRGIIEEDLTPFR-GGISRKMMAEVVSRKLGTHYQVIRNRL 96
Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
Y + D G+ + + G LPD+E++ + D P I P
Sbjct: 97 Y--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQI-------PKWM 141
Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P P+F +S DI++P W+FW G A I P W+ +L W
Sbjct: 142 EPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWE 201
Query: 232 DREPYAYWKGN 242
+ AY++G+
Sbjct: 202 KKNSTAYFRGS 212
>gi|410906075|ref|XP_003966517.1| PREDICTED: protein O-glucosyltransferase 1-like [Takifugu rubripes]
Length = 367
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 49/216 (22%), Positives = 94/216 (43%), Gaps = 29/216 (13%)
Query: 68 SISDYSIPPTST-CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK 126
++S Y IP TS C + + +DL P+K I+ ++++ Q R + + ++ HK
Sbjct: 17 AVSGY-IPCTSANCSCHLSVLQDDLRPFKFK-ISEELMDATTQ----RGVGTHYQIIGHK 70
Query: 127 YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFR 186
+ G+ + + RLPD+E++ + D P + S P+F
Sbjct: 71 LYREQNCMFPARCSGVEHFILQLIDRLPDMEMVVNVRDYPQVPKWMESSL------PVFS 124
Query: 187 YSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYW 239
+S +DI++P W+FW G A I P W+ + +LK+ + W +E ++
Sbjct: 125 FSKTADYLDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRTDLKKSADQWPWKKKETKGFF 184
Query: 240 KG-------NPFVAETRRD--LLTCNLSDKHDWNAR 266
+G +P + +R D L+ + W +
Sbjct: 185 RGSRTSPERDPLILLSREDPELVDAEYTKNQAWKSE 220
>gi|358054730|dbj|GAA99656.1| hypothetical protein E5Q_06359 [Mixia osmundae IAM 14324]
Length = 437
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 81/193 (41%), Gaps = 15/193 (7%)
Query: 78 STC----PDYFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
STC PD +R W+ GITR L+ A+ AH R+++ +N+VY+ Y+ I
Sbjct: 95 STCHGEFPDLYREAERAAQYWRHRGGITRAALDAADAQAHARVLIKDNQVYLTNYRGGIN 154
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
+R + + + + + LP +E + DD I P +F + +
Sbjct: 155 SRTLAALASLNEAVLTAVEELPAVEFVIQTDDSAPI--------AGAAPRWVFARTDEED 206
Query: 193 TMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
+ + + PD+ + W E + + + + W + P +W+G V R +
Sbjct: 207 ELALWLMPDFGHYAWPEPGVGSMAEVQAKASAFEAAQTWSSKIPKLFWRG-ALVNPLRDE 265
Query: 252 LLTCNLSDKHDWN 264
L+ + ++ W
Sbjct: 266 LIRLSDENRGSWG 278
>gi|410897261|ref|XP_003962117.1| PREDICTED: KDEL motif-containing protein 1-like [Takifugu rubripes]
Length = 505
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 73/166 (43%), Gaps = 13/166 (7%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CPD F I DLS + I +++ +R Q + +NKVY+ + + + R
Sbjct: 157 CPDSFPQIDRDLSAFPNVDPDRIAQEIPQRFGQRQSLCHYTVKDNKVYVTTFGEHVGFR- 215
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+FT +L L RK LPD+E + D P+ + + + P+F + G T D
Sbjct: 216 IFTDSILLSLTRKV--WLPDVEFFVNLGDWPLEKRKLTDKIH-----PIFSWCGSNNTQD 268
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
IV P + + + SL + N G W ++ A+W+G
Sbjct: 269 IVMPTYDL-TESVLETMGRVSLDMTSVQANTGPPWAEKNATAFWRG 313
>gi|170033685|ref|XP_001844707.1| KDEL motif-containing protein 2 [Culex quinquefasciatus]
gi|167874675|gb|EDS38058.1| KDEL motif-containing protein 2 [Culex quinquefasciatus]
Length = 495
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 81/189 (42%), Gaps = 18/189 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
CP I+ DL P++ + ++ + ++ +N++Y Y +
Sbjct: 142 CPSEDVQINSDLIPFRAVNFSSLRPRIIQQYDKPGSVSLCNYVVKDNQIYRTCYGRYTGF 201
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ ++ +L L RK LPD+EL + D P++ + GP P+F + G T
Sbjct: 202 K-MYMDAILLSLARK--TLLPDMELFVNLGDWPLVTKGGHR--RTTGPYPIFSWCGSEDT 256
Query: 194 MDIVFPDWSFWGWAEINIKPWESL-LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
DIV P + E +++ + L L G W ++ P A+W+G E R DL
Sbjct: 257 FDIVMPTYDL---VEASLEAMSRVSLDMLSVQRKGVPWEEKVPKAFWRGRDACRE-RLDL 312
Query: 253 LTCNLSDKH 261
+ LS +H
Sbjct: 313 V--GLSQQH 319
>gi|326475589|gb|EGD99598.1| hypothetical protein TESG_06947 [Trichophyton tonsurans CBS 112818]
Length = 419
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 47/265 (17%), Positives = 95/265 (35%), Gaps = 53/265 (20%)
Query: 5 FAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQ 64
F FS + + N SR +++ P IT P +C + P
Sbjct: 14 FTFSTLQAQKLAAEN-SRLHRAVLMDANP-------ITAPASCQHH------AGNPGCQW 59
Query: 65 TQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERAN-------------QTA 111
+ + ++ + C F P I R ERA +
Sbjct: 60 SHKRDANTLVFGAERCQSAF--------PGLFAEIDRAKQERAQRPIGLAEIDSVIPKNG 111
Query: 112 HFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSR 171
+ R ++ + ++Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 112 YIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK------ 165
Query: 172 DYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
P+ ++ R D + PD+ +W W E + + + + +
Sbjct: 166 ---------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREEHTW 216
Query: 229 NWIDREPYAYWKGNPFVAETRRDLL 253
W + P +W+G E R L+
Sbjct: 217 AWPKKIPRLFWRGATMDLEVREKLI 241
>gi|148664476|gb|EDK96892.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_b [Mus musculus]
Length = 432
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|425781966|gb|EKV19900.1| hypothetical protein PDIG_00670 [Penicillium digitatum PHI26]
gi|425784005|gb|EKV21816.1| hypothetical protein PDIP_02930 [Penicillium digitatum Pd1]
Length = 436
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 60/137 (43%), Gaps = 14/137 (10%)
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKYPGR--LPDL 156
+++ R A R I+ ++YI Y T R T+ + + L +P R LP +
Sbjct: 103 KELDSRTVDDAMVRGIIDRGELYIVDYAPMPMTASRARATLSSLHRALTAFPDRHLLPSI 162
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEINIKPWE 215
E +F +D + P P++ YS D T + PD+ +W W E++I P+
Sbjct: 163 EFIFTTED---------FAEDTTAPSPIWAYSKRDSDTSVWLMPDFGYWAWPEVHIGPYH 213
Query: 216 SLLRELKEGNNGRNWID 232
+ R + ++G D
Sbjct: 214 EVRRRIAAIDDGETAAD 230
>gi|302660285|ref|XP_003021823.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
gi|291185740|gb|EFE41205.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
Length = 420
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDKV-- 167
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
P+ ++ R D + PD+ +W W E + + + +
Sbjct: 168 -------------DPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214
Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W + P +W+G E R L+
Sbjct: 215 HTWAWSKKIPQLFWRGATMDLEVREKLI 242
>gi|149730404|ref|XP_001493278.1| PREDICTED: KDEL motif-containing protein 1 [Equus caballus]
Length = 502
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 80/190 (42%), Gaps = 22/190 (11%)
Query: 79 TCPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
+CP+ I DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 152 SCPETITQIQRDLAHFPTVDPEKIATEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVG 209
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
R +F +L L RK ++PD E + D P+ + + S + P+F + G
Sbjct: 210 FR-IFMDAILLSLTRKV--KMPDAEFFVNLGDWPLEKKKSSSHIH-----PIFSWCGSTD 261
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
+ DIV P + + + SL +GN G W + A W+G ++R++
Sbjct: 262 SKDIVMPTYDLTD-SVLETMGRVSLDMMSVQGNTGPPWESKNSTAVWRGR----DSRKER 316
Query: 253 LT-CNLSDKH 261
L LS KH
Sbjct: 317 LELVKLSRKH 326
>gi|452843550|gb|EME45485.1| hypothetical protein DOTSEDRAFT_71260 [Dothistroma septosporum
NZE10]
Length = 456
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 36/154 (23%), Positives = 67/154 (43%), Gaps = 18/154 (11%)
Query: 93 PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGR 152
W+ GI R ++ NQ I+ + V H ++ R V T+ + + + Y G+
Sbjct: 111 AWRGDGIVRALVHD-NQL----YIIDAHGVVDHNHR----PRTVATLHSLHRAISAYQGK 161
Query: 153 LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIK 212
LP++E F D + RD +G ++ W M PD+ WGW ++ ++
Sbjct: 162 LPNIEFTFTVHDAAL---RDPAGNETTWAYTRRKHQEKLWLM----PDFGLWGWPDVGLR 214
Query: 213 PWESLLRELKEGNNGRNWIDREPYAYWKGNPFVA 246
+ L L+ + ++D+ P W+G+ V
Sbjct: 215 SFAELQEVLEHEED--EFVDKVPKLVWRGSVAVG 246
>gi|358058341|dbj|GAA95860.1| hypothetical protein E5Q_02517 [Mixia osmundae IAM 14324]
Length = 492
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 32/159 (20%), Positives = 72/159 (45%), Gaps = 11/159 (6%)
Query: 98 GITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
GI L+ + R+I+ +N+VY+ K + ++R + I + + +PD+
Sbjct: 178 GIQERYLDTTEHGMNARVIIKSNRVYLRKPFNAGAKSRTQALLAAIEEAVLSSIEPIPDV 237
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWE 215
E + + +DR D + + P+ S + + + PD+ F+ W E ++ +
Sbjct: 238 EFVINSEDRV-----DAATAHT----PILGMSRKKQQGHVWLIPDFRFYAWPEPHVGTYP 288
Query: 216 SLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
+ ++ + W + +W+GNP + R++L+T
Sbjct: 289 DVQDQIYALEATQQWHHKRAKLFWRGNPPLHPLRQELMT 327
>gi|326483787|gb|EGE07797.1| DUF821 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 420
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK--- 166
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
P+ ++ R D + PD+ +W W E + + + +
Sbjct: 167 ------------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214
Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W + P +W+G E R L+
Sbjct: 215 HTWAWPKKIPRLFWRGATMDLEVREKLI 242
>gi|71020735|ref|XP_760598.1| hypothetical protein UM04451.1 [Ustilago maydis 521]
gi|46100486|gb|EAK85719.1| hypothetical protein UM04451.1 [Ustilago maydis 521]
Length = 488
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 65/154 (42%), Gaps = 14/154 (9%)
Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+H R+++ NN+VY+ ++QS TR + + Q + +LPD E +D +
Sbjct: 167 SHARVVIHNNRVYLKYFQQSAFTRINSALALLFQSVVASREKLPDAEFCLSANDWGSMGK 226
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
+S N P L+ D W M PD+ F+ W E I + + W
Sbjct: 227 --FSLDRN---PSLY----DVWLM----PDYGFYSWPEPGIGSYTEHREKTLAIEQSTPW 273
Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ +W+G+ V R L ++ H+WN
Sbjct: 274 DSKIAKLFWRGSMGVGTADRKALLA-AAENHEWN 306
>gi|315044371|ref|XP_003171561.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311343904|gb|EFR03107.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 422
Score = 45.4 bits (106), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 28/145 (19%), Positives = 64/145 (44%), Gaps = 12/145 (8%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R+ T+ + + + P LP++E +F+ DD+ +
Sbjct: 112 KNGYIRAMIYDQQLYVIAVNGTIYSREYATLHALHRAIVSSPDPLPNIEFVFNTDDK--V 169
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
S G + + W M PD+ +W W E + + + + ++
Sbjct: 170 DSVAQWGYARR------EEDTNLWLM----PDFGYWSWPETKVGTMQEVRMKAEQEEQTW 219
Query: 229 NWIDREPYAYWKGNPFVAETRRDLL 253
W ++ P +W+G E R L+
Sbjct: 220 TWSEKIPKLFWRGATMDLEVREKLI 244
>gi|12963767|ref|NP_076134.1| KDEL motif-containing protein 1 precursor [Mus musculus]
gi|81881896|sp|Q9JHP7.1|KDEL1_MOUSE RecName: Full=KDEL motif-containing protein 1; AltName:
Full=Endoplasmic reticulum resident protein 58; Short=ER
protein 58; Short=ERp58; Flags: Precursor
gi|9621686|emb|CAC00650.1| ER protein 58 [Mus musculus]
gi|26331010|dbj|BAC29235.1| unnamed protein product [Mus musculus]
gi|74213118|dbj|BAE41698.1| unnamed protein product [Mus musculus]
gi|148664475|gb|EDK96891.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_a [Mus musculus]
Length = 502
Score = 45.4 bits (106), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|12841643|dbj|BAB25294.1| unnamed protein product [Mus musculus]
Length = 502
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317
Query: 254 T-CNLSDKH 261
LS KH
Sbjct: 318 ELVKLSRKH 326
>gi|354494153|ref|XP_003509203.1| PREDICTED: protein O-glucosyltransferase 1 [Cricetulus griseus]
Length = 401
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 82/190 (43%), Gaps = 26/190 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I
Sbjct: 47 QINRALENYEPCSSRNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII---K 102
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
K + +R IL+++R RLPD+E++ + D P + P
Sbjct: 103 KRLFREDDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 151
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A + P W+ +L W
Sbjct: 152 PTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEK 211
Query: 233 REPYAYWKGN 242
+ AY++G+
Sbjct: 212 KNSTAYFRGS 221
>gi|327267957|ref|XP_003218765.1| PREDICTED: KDEL motif-containing protein 1-like [Anolis
carolinensis]
Length = 502
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 73/167 (43%), Gaps = 13/167 (7%)
Query: 79 TCPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTR 134
+CP IH DL+ + I +++ +R Q + +NKVYI + + + R
Sbjct: 153 SCPQSIPQIHRDLAHFPAVDPEKIAKEIPQRFGQRQSLCHYTIKDNKVYIKTHGEHVGFR 212
Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
+F +L L RK ++PDLE + D P+ + + + + P+F + G +
Sbjct: 213 -IFMDAILLSLTRKV--KMPDLEFFVNLGDWPLEKKK-----SPQDIHPIFSWCGSNDSK 264
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
DIV P + + + SL + N G W ++ A W+G
Sbjct: 265 DIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWEEKNATAIWRG 310
>gi|330919750|ref|XP_003298742.1| hypothetical protein PTT_09542 [Pyrenophora teres f. teres 0-1]
gi|311327917|gb|EFQ93162.1| hypothetical protein PTT_09542 [Pyrenophora teres f. teres 0-1]
Length = 410
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 57/115 (49%), Gaps = 14/115 (12%)
Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+PD+E DD P RS+D + G K P +R D W M P++++W W
Sbjct: 134 VPDIEFALSLDDLPR-RSKDKGTFFGYTRKEGPE-YR---DIWMM----PNYAYWSWNYT 184
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ W ++ +E+++ W ++P W+G +A+ R++L+ +S+ W+
Sbjct: 185 HAPSWNTIRKEIQQKETEVPWSKKDPRVVWRGKVKMAKLRQELI--RVSEGQSWS 237
>gi|115401698|ref|XP_001216437.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190378|gb|EAU32078.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 437
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 61/139 (43%), Gaps = 17/139 (12%)
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKYPGR--LPDL 156
+D+ A R I+ N ++YI Y T R T+ + + L +P R LP +
Sbjct: 106 QDLDGLAVDDGMVRGIIENGELYIVDYGAMPFTFSRAKATLHSLQRALAAFPDRHLLPSV 165
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWE 215
E +F DD YS G P++ YS + + PD+ +W W E+ I P++
Sbjct: 166 EFVFTTDD--------YS----TGVGPIWSYSKREEDESVWLMPDFGYWSWPEVKIGPYK 213
Query: 216 SLLRELKEGNNGRNWIDRE 234
+ R + +NG + E
Sbjct: 214 DIRRRIAAVDNGETTVTGE 232
>gi|281332155|ref|NP_001094122.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 precursor [Rattus norvegicus]
gi|149060499|gb|EDM11213.1| similar to RIKEN cDNA 9630046K23 [Rattus norvegicus]
Length = 392
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 43/187 (22%), Positives = 79/187 (42%), Gaps = 20/187 (10%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVY 123
Q ++ +Y + C Y I EDL+P++ GI+R M+ + R + + ++
Sbjct: 38 QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVR----RRLGTHYQII 92
Query: 124 IHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP- 182
H+ + G+ + + RLPD+E++ + D P + P P
Sbjct: 93 KHRLFREDDCMFPSRCSGVEHFILEVIRRLPDMEMVINVRDYPQV-------PKWMEPTI 145
Query: 183 PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREP 235
P+F +S DI++P W+FW G A + P W+ +L W +
Sbjct: 146 PVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNS 205
Query: 236 YAYWKGN 242
AY++G+
Sbjct: 206 TAYFRGS 212
>gi|351697696|gb|EHB00615.1| KTEL motif-containing protein 1 [Heterocephalus glaber]
Length = 392
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 80/194 (41%), Gaps = 34/194 (17%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ H+++I
Sbjct: 38 QINRALENYEPCSSQNCSCYQGVIEEDLTPFR-GGISRKMMAEVVSRKLGTHYQII---- 92
Query: 121 KVYIHKYKQSIQTRDVFTI----WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGP 176
K+ + D G+ + + G+LPD+E++ + D P + P
Sbjct: 93 -------KKRLYREDDCMFPSRCSGVEHFILEVIGQLPDMEMVINVRDYPQV-------P 138
Query: 177 NNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGR 228
P P+F +S DI++P W+FW G A I P W+ +L
Sbjct: 139 RWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQW 198
Query: 229 NWIDREPYAYWKGN 242
W + AY++G+
Sbjct: 199 PWKKKNSTAYFRGS 212
>gi|327297120|ref|XP_003233254.1| hypothetical protein TERG_06247 [Trichophyton rubrum CBS 118892]
gi|326464560|gb|EGD90013.1| hypothetical protein TERG_06247 [Trichophyton rubrum CBS 118892]
Length = 420
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)
Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
+ + R ++ + ++Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK--- 166
Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
P+ ++ R D + PD+ +W W E + + + +
Sbjct: 167 ------------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214
Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W + P +W+G E R L+
Sbjct: 215 HTWAWPKKIPQLFWRGATMDLEVRERLI 242
>gi|452004927|gb|EMD97383.1| hypothetical protein COCHEDRAFT_1190251 [Cochliobolus
heterostrophus C5]
Length = 432
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 55/115 (47%), Gaps = 14/115 (12%)
Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+P++E DD P RSR + G K P D W M P++++W W
Sbjct: 156 VPNIEFSLSLDDLPR-RSRKEGTFFGYTRKDTPEY----KDIWMM----PNYAYWAWNYT 206
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
+ W S+ +E+++ W ++P W+G +A+ R++L+ +S+ DW+
Sbjct: 207 HAPSWNSIRKEIEQKEKELPWHKKDPRVVWRGKIKMAKLRKELV--KVSEGKDWS 259
>gi|198433875|ref|XP_002126603.1| PREDICTED: similar to KTEL motif-containing protein 1 [Ciona
intestinalis]
Length = 398
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 56/251 (22%), Positives = 109/251 (43%), Gaps = 44/251 (17%)
Query: 67 ESISDYSIPPTS---TCPDYFRWIHEDLSPWKV-TGITRDMLERA-NQTAHFRLILVNNK 121
++IS+ + TS C + I DL W+ GIT + ++R ++ H+++I ++K
Sbjct: 42 DAISEAAANYTSCGGNCTCHSDVITSDLKLWRERGGITNEDMKRGLERSVHYQII--DHK 99
Query: 122 VYIHKYKQSIQTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRP-VIRSRDYSGPNN 178
+Y Q + +F GI + + LPD+EL + D P V++ Y
Sbjct: 100 LYR-------QDKCMFPSRCSGIEHFILEIINDLPDMELGINVHDWPQVMKHSPY----- 147
Query: 179 KGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNW 230
P P+ +S + DI++P W+FW + W+ + ++LK+ +N W
Sbjct: 148 --PMPILSFSKVAKEHQDIMYPAWTFWAGGPAVWPIYRNGLGRWDLMRKDLKKKDNEFPW 205
Query: 231 IDREPYAYWKG-------NPFVAETRR--DLLTCNLSDKHDWNAR---LYVQVISFIYIT 278
++ +++G +P V +R DL+ + W ++ L + +++
Sbjct: 206 EKKQNIGFFRGSRTSSERDPLVLLSRENPDLVDAQYTKNQAWKSKKDTLGAEPAEIVHLL 265
Query: 279 HLCSKIPGFNF 289
C FNF
Sbjct: 266 DHCQYKYLFNF 276
>gi|194744002|ref|XP_001954487.1| GF18287 [Drosophila ananassae]
gi|190627524|gb|EDV43048.1| GF18287 [Drosophila ananassae]
Length = 382
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 68/162 (41%), Gaps = 25/162 (15%)
Query: 55 CPTNY--PKTSQTQ-------ESISDY--SIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
CP N P+++ T E+++DY P C + I +DL P+ GI++DM
Sbjct: 5 CPANLLSPRSNSTLDLVPNILEALADYRPCEPGDPLCLCHAATITKDLEPYSDKGISQDM 64
Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
+ ++ + ++ ++ V + L + G LPDLE + +
Sbjct: 65 ISQSKRQGTLYKVIRRRIFRQEHCSHPLRCSSVEDV------LLEIAGDLPDLEFVLNVC 118
Query: 164 DRPVIRSRDYSGPNNKG-PPPLFRYSGDRWTMDIVFPDWSFW 204
D P + P G P+F +S +DI+ P WSFW
Sbjct: 119 DWPQV-------PFLSGLSGPVFSHSTTALHLDIMCPAWSFW 153
>gi|348511563|ref|XP_003443313.1| PREDICTED: KDEL motif-containing protein 1-like [Oreochromis
niloticus]
Length = 538
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 54/226 (23%), Positives = 91/226 (40%), Gaps = 33/226 (14%)
Query: 80 CPDYFRWIHEDLSPWKVTGI---TRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
CP F I DLS + +++ +R Q + +NK+Y+ + + + R
Sbjct: 190 CPQSFAQIERDLSFYPSVDPDHNAQEIPQRFGQRQSLCHYTVKDNKIYVRTFGEHVGFR- 248
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
+F +L L RK RLPD+E + D P+ + PP P+F + G
Sbjct: 249 IFMDAILLSLTRKV--RLPDVEFFVNLGDWPL---------EKRKPPEKLHPIFSWCGSN 297
Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
T DIV P + + + SL + N G W ++ A+W+G E R +
Sbjct: 298 NTRDIVMPTYDL-TESVLETMGRVSLDMMSVQANTGPPWPEKNATAFWRGRDSRQE-RLE 355
Query: 252 LLTCNLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
L+ + + H +A +F + H L + F+FF
Sbjct: 356 LVKLSRAHPHIIDAAF----TNFFFFKHDESLYGPLVKHVSFFDFF 397
>gi|26334375|dbj|BAC30905.1| unnamed protein product [Mus musculus]
Length = 392
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + +C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSQSCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
+++ + D G+ + + RLPD+E++ + D P + P
Sbjct: 95 RLF--------REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A + P W+ +L
Sbjct: 140 WMEPTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WEKKNSTAYFRGS 212
>gi|26343369|dbj|BAC35341.1| unnamed protein product [Mus musculus]
Length = 432
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 80/187 (42%), Gaps = 18/187 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
C + I +DL+ + I ++ +R Q L +NKVYI + + + R
Sbjct: 153 CSETISQIQKDLAHFPTVDPEEIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F +L L RK R+PD+E + D P+ + + +N P+F + G + D
Sbjct: 212 IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTESRD 264
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT- 254
IV P + + + SL + N G W + A W+G ++R++ L
Sbjct: 265 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERLEL 319
Query: 255 CNLSDKH 261
LS KH
Sbjct: 320 VKLSRKH 326
>gi|327268710|ref|XP_003219139.1| PREDICTED: protein O-glucosyltransferase 1-like [Anolis
carolinensis]
Length = 391
Score = 44.7 bits (104), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 87/196 (44%), Gaps = 26/196 (13%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVN 119
Q + ++ +Y T C Y + +DL P++ GI+ ++L A H+++I
Sbjct: 36 GQIERAVENYRPCVTENCRCYQKVREQDLDPFQ-EGISEELLSEAISRRLGTHYQII--A 92
Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
K+Y ++ R G+ + + LPD+E++ + D P + P
Sbjct: 93 KKLY-REHDCMFPAR----CSGVEHFILEIIDELPDMEMVINVRDYPQV-------PKFM 140
Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
P P+F +S DI++P W+FW G A I P W+ + ++LK + W
Sbjct: 141 KPKVPIFSFSKTSEYYDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRQDLKSSSEKWPWR 200
Query: 232 DREPYAYWKGNPFVAE 247
+ A+++G+ AE
Sbjct: 201 AKISKAFFRGSRTSAE 216
>gi|344257139|gb|EGW13243.1| KTEL motif-containing protein 1 [Cricetulus griseus]
Length = 333
Score = 44.7 bits (104), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 66/145 (45%), Gaps = 19/145 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I
Sbjct: 52 QINRALENYEPCSSRNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII---K 107
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
K + +R IL+++R RLPD+E++ + D P + P
Sbjct: 108 KRLFREDDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 156
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW 204
P P+F +S DI++P W+FW
Sbjct: 157 PTIPVFSFSKTSEYHDIMYPAWTFW 181
>gi|426245590|ref|XP_004016593.1| PREDICTED: KDEL motif-containing protein 2 [Ovis aries]
Length = 557
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 77/169 (45%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CPD I +D + + + + ++E + R +V+ + H Y++S+
Sbjct: 203 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 262
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 263 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 317
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D++ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 318 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 365
>gi|148665570|gb|EDK97986.1| RIKEN cDNA 9630046K23, isoform CRA_b [Mus musculus]
Length = 372
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 18 QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 74
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
+++ + D G+ + + RLPD+E++ + D P + P
Sbjct: 75 RLF--------REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PK 119
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A + P W+ +L
Sbjct: 120 WMEPTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWP 179
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 180 WEKKNSTAYFRGS 192
>gi|26348175|dbj|BAC37727.1| unnamed protein product [Mus musculus]
Length = 502
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 75/188 (39%), Gaps = 20/188 (10%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
DIV P + + + SL + N G W + A W+G + L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRD---SHKERLE 318
Query: 254 TCNLSDKH 261
LS KH
Sbjct: 319 LVKLSRKH 326
>gi|451855485|gb|EMD68777.1| hypothetical protein COCSADRAFT_187637 [Cochliobolus sativus
ND90Pr]
Length = 432
Score = 44.3 bits (103), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 56/114 (49%), Gaps = 12/114 (10%)
Query: 153 LPDLELMFDCDDRPVIRSRD--YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEIN 210
+P++E DD P +D + G K P +R D W M P++++W W +
Sbjct: 156 IPNIEFSLSLDDLPRRSRKDGTFFGYTRKDTPE-YR---DIWMM----PNYAYWAWNYTH 207
Query: 211 IKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
W S+ +E+++ W ++P W+G +A+ R++L+ +S+ +W+
Sbjct: 208 APSWNSIRKEIEQKEKDLPWDKKDPRVVWRGKIKMAKLRKELV--KVSEGKEWS 259
>gi|270007100|gb|EFA03548.1| hypothetical protein TcasGA2_TC013552 [Tribolium castaneum]
Length = 399
Score = 44.3 bits (103), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 56/225 (24%), Positives = 101/225 (44%), Gaps = 42/225 (18%)
Query: 47 CVKNQTQTCPTN----YPKTSQTQ---------ESISDYSIPPTSTCPDYFRWIHEDLSP 93
C+K ++ C + Y K + T+ + +YS + C Y I +DL
Sbjct: 23 CLKENSEQCNASKVNMYSKEANTKYLKYLDLIKRAKENYSPCDNTKCGCYSSQISDDLKI 82
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLLRK 148
+K GIT ++++ +T + ++++K+Y RD ++ GI L K
Sbjct: 83 FK-KGITPQLIDKV-KTKGTKYQIIDHKLY----------RDKNCMFPARCAGIEHFLLK 130
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWA 207
+LPD+EL+ + D P I +DY P+F +S DI++P W+FW G
Sbjct: 131 LLPKLPDMELIINTRDWPQIH-KDYGVFG-----PVFSFSKTSDYSDIMYPAWAFWEGGP 184
Query: 208 EINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
I++ P W++ L + N W ++ P +++G+ AE
Sbjct: 185 AISLYPRGIGRWDTHRDLLGKKGNETLWDEKIPKGFFRGSRTSAE 229
>gi|27369505|ref|NP_759012.1| protein O-glucosyltransferase 1 precursor [Mus musculus]
gi|110832796|sp|Q8BYB9.2|PGLT1_MOUSE RecName: Full=Protein O-glucosyltransferase 1; AltName:
Full=CAP10-like 46 kDa protein; AltName: Full=KTEL
motif-containing protein 1; Flags: Precursor
gi|20071120|gb|AAH26809.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Mus musculus]
gi|26327463|dbj|BAC27475.1| unnamed protein product [Mus musculus]
gi|26331050|dbj|BAC29255.1| unnamed protein product [Mus musculus]
gi|26331242|dbj|BAC29351.1| unnamed protein product [Mus musculus]
gi|148665569|gb|EDK97985.1| RIKEN cDNA 9630046K23, isoform CRA_a [Mus musculus]
Length = 392
Score = 44.3 bits (103), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 82/193 (42%), Gaps = 32/193 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
+++ + D G+ + + RLPD+E++ + D P + P
Sbjct: 95 RLF--------REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A + P W+ +L
Sbjct: 140 WMEPTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWP 199
Query: 230 WIDREPYAYWKGN 242
W + AY++G+
Sbjct: 200 WEKKNSTAYFRGS 212
>gi|358058116|dbj|GAA96095.1| hypothetical protein E5Q_02756 [Mixia osmundae IAM 14324]
Length = 486
Score = 44.3 bits (103), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 34/160 (21%), Positives = 70/160 (43%), Gaps = 19/160 (11%)
Query: 96 VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
+ G+T +E + + H RL++ + ++++ +++ + Q+R + I + +PD
Sbjct: 170 LGGVTLSDVEHSGEIWHARLMIKDGRLFVKRFRPAYQSRCQAVMAAIHDAIITSLEPVPD 229
Query: 156 LELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIK 212
+E + DD + P L R +D+ + D+ F W E IK
Sbjct: 230 IEFVIGLDD---------TAPFEPATWGLAR------RLDLPAWLVIDYGFNAWPEPMIK 274
Query: 213 PWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
W + L ++K N W + +W+G +++E R L
Sbjct: 275 GWHTFLADVKSVNAQMTWRQKAKKLFWRG-VYLSEYREQL 313
>gi|82658302|ref|NP_001032511.1| KTEL motif-containing protein 1 precursor [Danio rerio]
gi|81097746|gb|AAI09459.1| Zgc:123318 [Danio rerio]
Length = 389
Score = 44.3 bits (103), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 101/243 (41%), Gaps = 40/243 (16%)
Query: 48 VKNQTQTCPTNYPKTSQT-----QESISDYSIPPTSTCPDYFRWIHEDLSPWK---VTGI 99
++ + C ++ K QT E+ +Y C Y + +DL P+K G+
Sbjct: 14 LQTGVEFCVSDNAKQWQTFIRKITEATKNYQPCSQDNCTCYQSVLKDDLRPFKNGISEGL 73
Query: 100 TRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
D + R T H+++I + K+Y +QS + G+ + K RLPDLE++
Sbjct: 74 MADTVNRGVGT-HYQII--SKKLY---REQSCMFPARCS--GVEHFILKVIDRLPDLEVV 125
Query: 160 FDCDDRPVIRSRDYSGPNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP--- 213
+ D P + P P P+ +S + DI++P W+FW G A I P
Sbjct: 126 INVRDYPQV-------PGWIQPVLPVLSFSKTKDYQDIMYPAWTFWEGGPAVWPIYPTGL 178
Query: 214 --WESLLRELKEGNNGRNWIDREPYAYWKG-------NPFVAETRR--DLLTCNLSDKHD 262
W+ + +LK W + P +++G +P + +R DL+ +
Sbjct: 179 GRWDLMRDDLKRSVEQWPWKKKSPKGFFRGSRTSSERDPLILLSRAAPDLVDAEYTKNQA 238
Query: 263 WNA 265
W +
Sbjct: 239 WKS 241
>gi|145250763|ref|XP_001396895.1| hypothetical protein ANI_1_1376134 [Aspergillus niger CBS 513.88]
gi|134082418|emb|CAK42432.1| unnamed protein product [Aspergillus niger]
Length = 442
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 62/124 (50%), Gaps = 17/124 (13%)
Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
R I+ + ++Y+ + + TR T+ + + L +P R LPD+E V+
Sbjct: 125 RGIVHHGELYVVDFGAMPATFTRGEATLHALHRALASFPDRSSLPDIEF--------VLT 176
Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
+ DYS N+GP ++ YS ++ + PD+ +W W E+ I P++ + R + ++G
Sbjct: 177 TEDYS--TNEGP--VWSYSKQDEQENVWLMPDFGYWSWPEVKIGPYKDIRRRIAAVDDGE 232
Query: 229 NWID 232
ID
Sbjct: 233 VTID 236
>gi|350636309|gb|EHA24669.1| hypothetical protein ASPNIDRAFT_182202 [Aspergillus niger ATCC
1015]
Length = 442
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 62/124 (50%), Gaps = 17/124 (13%)
Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
R I+ + ++Y+ + + TR T+ + + L +P R LPD+E V+
Sbjct: 125 RGIVHHGELYVVDFGAMPATFTRGKATLHALHRALASFPDRSSLPDIEF--------VLT 176
Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
+ DYS N+GP ++ YS ++ + PD+ +W W E+ I P++ + R + ++G
Sbjct: 177 TEDYS--TNEGP--VWSYSKRDEQQNVWLMPDFGYWSWPEVKIGPYKDIRRRIAAVDDGE 232
Query: 229 NWID 232
ID
Sbjct: 233 VTID 236
>gi|47216704|emb|CAG00978.1| unnamed protein product [Tetraodon nigroviridis]
Length = 392
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 79/181 (43%), Gaps = 19/181 (10%)
Query: 56 PTNYPKTSQTQESISDY---SIPPTS-TCPDYFRWIHEDLSPWKVTGITRDMLERANQTA 111
P + + + +++ISD IP +S C + I +DL P+K I+ +++ Q
Sbjct: 22 PASGKRWKKIKDNISDAVKGHIPCSSDNCSCHLSVIQDDLRPFK-GKISENLMAATIQ-- 78
Query: 112 HFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSR 171
R + + ++ HK + G+ + + GRLPD+E++ + D P +
Sbjct: 79 --RGVGTHYQIIGHKLYREENCMFPARCSGVEHFILQLIGRLPDMEMVVNVRDYPQV--- 133
Query: 172 DYSGPN-NKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
P P+F +S DI++P W+FW + PW+ +E + G W
Sbjct: 134 ----PKWVDSLLPVFSFSKTADYQDIMYPAWTFWEGGPADQWPWKQ--KETRGFFRGSRW 187
Query: 231 I 231
+
Sbjct: 188 V 188
>gi|390332934|ref|XP_783946.3| PREDICTED: KDEL motif-containing protein 1-like [Strongylocentrotus
purpuratus]
Length = 512
Score = 43.9 bits (102), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 92/211 (43%), Gaps = 31/211 (14%)
Query: 46 NC---VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
NC V +Q Q Y ++ ++ +D+SI P+ +H+D+
Sbjct: 146 NCDCPVDDQDQWRSDMYCRSGVHPQTQTDFSIFPSID----LGTLHQDVES--------- 192
Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
+ + H+ +I NNKVY K SI +F+ +L L RK R+PD+E +
Sbjct: 193 RFAKHHSLCHYSII--NNKVY-RKTLGSIVGFKMFSDAFLLSLTRKV--RVPDVEFFINL 247
Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
D P + RD P ++ P P+ + G T DIV P + + + SL
Sbjct: 248 GDWP-LEKRD---PEDE-PLPILSWCGSTDTRDIVLPTYDI-TESTLETMGRVSLDMMSV 301
Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ N G W ++ A+W+G ++RR+ L
Sbjct: 302 QANTGPKWENKTEKAFWRGR----DSRRERL 328
>gi|195443900|ref|XP_002069627.1| GK11623 [Drosophila willistoni]
gi|194165712|gb|EDW80613.1| GK11623 [Drosophila willistoni]
Length = 383
Score = 43.9 bits (102), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 60/133 (45%), Gaps = 12/133 (9%)
Query: 75 PPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
P S C Y I +L P+ TGIT+ M++++ + + + +++Y +S
Sbjct: 45 PCESECECYALAIARNLRPYAGTGITKPMIDQSRRFGTLYKV-IGSRLY-----RSDNCA 98
Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
+ +LL LPDLE + + D P I SG + P+F YS +
Sbjct: 99 YPSRCASVEELLLNIVRDLPDLEFVLNVRDWPQIHF--LSGLSG----PVFSYSSTDNFL 152
Query: 195 DIVFPDWSFWGWA 207
DI+ P WSFW A
Sbjct: 153 DIMCPAWSFWTSA 165
>gi|390342113|ref|XP_779911.3| PREDICTED: protein O-glucosyltransferase 1-like isoform 1
[Strongylocentrotus purpuratus]
Length = 391
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 85/197 (43%), Gaps = 22/197 (11%)
Query: 60 PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV-TGITRDMLERANQTAHFRLILV 118
P S +E+ S Y T C + + EDLS W+ GITR + +A + +
Sbjct: 35 PYLSLIEEATSSYEACVTDDCSCHAGVMDEDLSVWEERGGITRADVVKAESRGTLYQV-I 93
Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
N+++Y + K R IL++++K LPD+E + + D P +S Y+
Sbjct: 94 NHRLY-REEKCMFPARCNGVEHFILRIIKK----LPDMEFVMNVRDWP--QSGKYT---- 142
Query: 179 KGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNW 230
P P+ +S DI++P W+FW + W+ + + + W
Sbjct: 143 -DPIPVLSFSKVQSQHYDIMYPAWTFWEGGPAVWPLFPTGLGRWDLFRESIDKESQKLPW 201
Query: 231 IDREPYAYWKGNPFVAE 247
+E A+++G+ AE
Sbjct: 202 DTKEDKAFFRGSRTTAE 218
>gi|255930987|ref|XP_002557050.1| Pc12g01550 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211581669|emb|CAP79782.1| Pc12g01550 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 436
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 58/137 (42%), Gaps = 14/137 (10%)
Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQS--IQTRDVFTIWGILQLLRKYPGR--LPDL 156
+++ R R I+ ++YI Y +R T+ + + L +P R LP +
Sbjct: 103 KELDSRTVDDGMVRGIIDRGELYIVDYAPMPVTASRARATLNSLHRALTAFPDRHLLPSI 162
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEINIKPWE 215
E +F +D + P P++ YS D T + PD+ +W W E+ I P+
Sbjct: 163 EFIFTTED---------FAEDTTTPSPIWSYSKRDSHTSVWLMPDFGYWAWPEVQIGPYH 213
Query: 216 SLLRELKEGNNGRNWID 232
+ R + ++G D
Sbjct: 214 EVRRRIAAIDDGETAAD 230
>gi|26348271|dbj|BAC37775.1| unnamed protein product [Mus musculus]
Length = 254
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 66/148 (44%), Gaps = 25/148 (16%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
Q ++ +Y + C Y I EDL+P++ GI+R M+ R H+++I N
Sbjct: 38 QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94
Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
+++ + D G+ + + RLPD+E++ + D P + P
Sbjct: 95 RLF--------REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW 204
P P+F +S DI++P W+FW
Sbjct: 140 WMEPTIPVFSFSKTSEYHDIMYPAWTFW 167
>gi|189194465|ref|XP_001933571.1| hypothetical protein PTRG_03238 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979135|gb|EDU45761.1| hypothetical protein PTRG_03238 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 434
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 12/107 (11%)
Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+P++E DD P RS+D + G K P +R D W M P++++W W
Sbjct: 158 IPNIEFALSLDDLPR-RSKDKGTFFGYTRKEGPE-YR---DIWMM----PNYAYWSWNYT 208
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
+ W ++ +E+++ W ++P W+G +A+ R++L+ +
Sbjct: 209 HAPSWNAIRKEIQQKETEVPWSKKDPRVVWRGKVKMAKLRQELIKVS 255
>gi|213514888|ref|NP_001133941.1| KDEL motif-containing protein 2 precursor [Salmo salar]
gi|209155896|gb|ACI34180.1| KDEL motif-containing protein 2 precursor [Salmo salar]
Length = 524
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 73/167 (43%), Gaps = 13/167 (7%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLER----ANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
CP I D + + + R + E +N+ ++NN+VY + +
Sbjct: 168 CPAEEPQIQRDFNAFPSIDLQRLLQEVPRRFSNRGGLIHYTVINNQVYRRSLGKYTDFK- 226
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPV-IRSRDYSGPNNKGPPPLFRYSGDRWTM 194
+F+ +L L RK RLPD+E + D P+ R D ++ GP P+ + G T
Sbjct: 227 MFSDEMLLSLARKV--RLPDVEFYINVGDWPMETRKAD----DDPGPVPIISWCGSTDTR 280
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
DI+ P + ++ + L + +GN G W ++ A+++G
Sbjct: 281 DIILPTYDITHSTLETMRGVSNDLLSV-QGNTGPPWANKMEQAFFRG 326
>gi|258569104|ref|XP_002585296.1| predicted protein [Uncinocarpus reesii 1704]
gi|237906742|gb|EEP81143.1| predicted protein [Uncinocarpus reesii 1704]
Length = 483
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 40/189 (21%), Positives = 81/189 (42%), Gaps = 20/189 (10%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERAN------QTAHFRLILVNNKVYIHKYKQS 130
T C F ++ED+ I+R + + + Q R ++ N ++I + + +
Sbjct: 108 TGQCQTAFPGLYEDVDRAVRFWISRGGILKKSLNTVKLQNGMARAMIYNGNLFIIEARAA 167
Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
+ + + + R R P++E +F +D+ D SG + PL+ S
Sbjct: 168 QEDHRRKILAVLSSIHRALGNRAPNIEFIFSVEDK----VEDVSGQGH----PLWVLSRK 219
Query: 191 RWTMDI-VFPDWSFWGWA--EINIKPWESLLRELKEGN--NGRNWIDREPYAYWKGN-PF 244
+ + PD+ FW W NI P++ +++ +++ + + W + P W+G F
Sbjct: 220 ATEKSVWLIPDFGFWAWGNPASNIGPYDQVVKRIEKFDLEDTMPWSSKTPRLVWRGKLSF 279
Query: 245 VAETRRDLL 253
+ RR LL
Sbjct: 280 APKLRRRLL 288
>gi|56118414|ref|NP_001008114.1| MGC89395 protein precursor [Xenopus (Silurana) tropicalis]
gi|51703984|gb|AAH81318.1| MGC89395 protein [Xenopus (Silurana) tropicalis]
Length = 385
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 50/220 (22%), Positives = 97/220 (44%), Gaps = 29/220 (13%)
Query: 63 SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
+ + + +S P + Y+R + EDL+P+ +GI+RD++++ R + + ++
Sbjct: 30 NHIRRAAQSHSTPDSQNGTFYYRVLQEDLAPFG-SGISRDLMQKVLS----RKLGTHYQI 84
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
H+ + + G+ L + LPD++L+ + D P + P P
Sbjct: 85 INHRLYREEECMFPARCSGVEHFLLELLPDLPDMDLVVNVRDYPQV-------PRWMDPV 137
Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P W+FW G A I P W+ + ELK+ + W +
Sbjct: 138 IPIFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREELKKAADLWPWEKKI 197
Query: 235 PYAYWKG-------NPFVAETRR--DLLTCNLSDKHDWNA 265
P Y++G +P + +R DL+ + W +
Sbjct: 198 PKGYFRGSRTSPERDPLILLSRESPDLVDAEYTKNQAWKS 237
>gi|67901122|ref|XP_680817.1| hypothetical protein AN7548.2 [Aspergillus nidulans FGSC A4]
gi|40742938|gb|EAA62128.1| hypothetical protein AN7548.2 [Aspergillus nidulans FGSC A4]
gi|259483867|tpe|CBF79610.1| TPA: DUF821 domain protein (AFU_orthologue; AFUA_2G14740)
[Aspergillus nidulans FGSC A4]
Length = 462
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 68/152 (44%), Gaps = 23/152 (15%)
Query: 114 RLILVNNKVYIHKYKQS--IQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
R + + ++YI Y TR T+ + + L YP R LPD+E + DD
Sbjct: 143 RAAVKDGELYIIDYGPQPYTFTRGKATLHSLHRALSSYPDRHSLPDIEFVLTTDD----- 197
Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEIN-IKPWESLLRELKEGN 225
+S + P P++ Y+ + D + PD+ +W W E++ + ++ + R +
Sbjct: 198 ---FSTRTSTNPSPIWAYTKRQEDEDAAIWLMPDFGYWSWPEVDTVGEYKDVRRRIFAKE 254
Query: 226 NGRNWIDREPYAYWKG----NPFVAETRRDLL 253
G + D++ W+G NP E R+ LL
Sbjct: 255 EGLPFSDKKKQLLWRGSVSANP---EIRKALL 283
>gi|60688257|gb|AAH91408.1| Ktelc1 protein [Rattus norvegicus]
Length = 349
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 76/181 (41%), Gaps = 20/181 (11%)
Query: 70 SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
+Y + C Y I EDL+P++ GI+R M+ + R + + ++ H+ +
Sbjct: 1 ENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVR----RRLGTHYQIIKHRLFR 55
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLFRYS 188
G+ + + RLPD+E++ + D P + P P P+F +S
Sbjct: 56 EDDCMFPSRCSGVEHFILEVIRRLPDMEMVINVRDYPQV-------PKWMEPTIPVFSFS 108
Query: 189 GDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG 241
DI++P W+FW G A + P W+ +L W + AY++G
Sbjct: 109 KTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRG 168
Query: 242 N 242
+
Sbjct: 169 S 169
>gi|302511259|ref|XP_003017581.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
gi|291181152|gb|EFE36936.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
Length = 303
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/136 (19%), Positives = 58/136 (42%), Gaps = 18/136 (13%)
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
K+Y+ +I +R++ T+ + + + P LP++E +F+ DD+
Sbjct: 5 KLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDKV-------------- 50
Query: 181 PPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYA 237
P+ ++ R D + PD+ +W W E + + + + + +W + P
Sbjct: 51 -DPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREEHTWSWSKKIPQL 109
Query: 238 YWKGNPFVAETRRDLL 253
+W+G E R L+
Sbjct: 110 FWRGATMDLEVREKLI 125
>gi|358415519|ref|XP_003583131.1| PREDICTED: KDEL motif-containing protein 2 [Bos taurus]
Length = 460
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 77/169 (45%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
T +CPD I +D + + + + ++E + R +V+ + + Y++S+
Sbjct: 106 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGKYT 165
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 166 DFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 220
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D++ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 221 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 268
>gi|91081993|ref|XP_969039.1| PREDICTED: similar to AGAP008037-PA [Tribolium castaneum]
gi|270007375|gb|EFA03823.1| hypothetical protein TcasGA2_TC013938 [Tribolium castaneum]
Length = 498
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 24/190 (12%)
Query: 80 CPDYFRWIHEDLSPWKVTG--ITRDMLERANQTAH----FRLILVNNKVYIHKYKQSIQT 133
C + R I DL P++ RD + + H R ++ NN++Y Y + +
Sbjct: 152 CGNTPRQILADLKPFQTVNWDKLRDKVIKKFDQPHSISLCRYVIKNNEIYRTCYGKYVGF 211
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ +F +L L RK LPDLE + D P++ + + P +F + G +
Sbjct: 212 K-MFMDAILLSLSRKV--NLPDLEFFINLGDWPLVTEKIETFP-------IFSWCGSTTS 261
Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
+DIV P + E ++ ++ ++ +GN +W +R A+W+G + R D
Sbjct: 262 LDIVMPTYDI---TESTLENMGRVMLDMLSVQGNVKESWENRTGQAFWRGRD-SNQHRLD 317
Query: 252 LLTCNLSDKH 261
L+ +++ KH
Sbjct: 318 LI--DIARKH 325
>gi|358373714|dbj|GAA90310.1| DUF821 domain protein [Aspergillus kawachii IFO 4308]
Length = 442
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 60/128 (46%), Gaps = 25/128 (19%)
Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYP--GRLPDLELMFDCDDRPVIR 169
R I+ + ++Y+ + + TR T+ + + L +P G LPD+E V+
Sbjct: 125 RGIIHHGELYVVDFGAMPATFTRGKSTLHALHRALASFPDRGSLPDIEF--------VLT 176
Query: 170 SRDYSGPNNKGPPPLFRYS-----GDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEG 224
+ DYS P++ YS G+ W M PD+ +W W E+ I P++ + R +
Sbjct: 177 TEDYSIHQG----PVWSYSKRDEQGNVWLM----PDFGYWSWPEVKIGPYKDIRRRIAAV 228
Query: 225 NNGRNWID 232
++G +D
Sbjct: 229 DDGEVTLD 236
>gi|297482669|ref|XP_002693013.1| PREDICTED: KDEL motif-containing protein 2 [Bos taurus]
gi|296480318|tpg|DAA22433.1| TPA: KDEL (Lys-Asp-Glu-Leu) containing 2 [Bos taurus]
Length = 508
Score = 42.0 bits (97), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 77/169 (45%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
T +CPD I +D + + + + ++E + R +V+ + + Y++S+
Sbjct: 154 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGKYT 213
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 214 DFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 268
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D++ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 269 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 316
>gi|119481815|ref|XP_001260936.1| hypothetical protein NFIA_089970 [Neosartorya fischeri NRRL 181]
gi|119409090|gb|EAW19039.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 444
Score = 42.0 bits (97), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 37/153 (24%), Positives = 67/153 (43%), Gaps = 18/153 (11%)
Query: 81 PDYFRWIHEDLSPWKVTGIT-RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVF 137
P F I + +S K IT +++ + + R + ++YI Y T R
Sbjct: 89 PKLFVEIEKTVSLRKERPITFKEVDDVVVEDGMVRAAIWRGELYILDYAAQPYTYSRAKA 148
Query: 138 TIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
T+ + + L+ +P R LPD+E +F DD + P P++ YS
Sbjct: 149 TLNSLHRALQSFPDRHSLPDIEFVFTADDFSNV------------PGPVWSYSKRDEDES 196
Query: 196 I-VFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
I + PD+ +W W E+ + P++ + R + + G
Sbjct: 197 IWLMPDFGYWAWPEVKVGPYKDIRRRIAAVDEG 229
>gi|395519040|ref|XP_003763661.1| PREDICTED: protein O-glucosyltransferase 1 [Sarcophilus harrisii]
Length = 335
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 81/188 (43%), Gaps = 22/188 (11%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKV 122
Q +S+ +Y C + + +DL+P++ GI+++M+ N+ ++ N++
Sbjct: 37 QINKSVENYQSCSKGNCSCHLGVMEKDLAPFQ-GGISKEMMANVVNRKLGTHYQIIKNEL 95
Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
Y ++ R G+ + + LPD+E++ + D P I P P
Sbjct: 96 Y-REHDCMFPAR----CSGVEHFILEIINHLPDMEMVINVRDYPQI-------PKWMEPI 143
Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P W+FW G A I P W+ L +L W +
Sbjct: 144 IPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLLREDLARSAEKWPWEKKV 203
Query: 235 PYAYWKGN 242
Y++G+
Sbjct: 204 SKGYFRGS 211
>gi|195331145|ref|XP_002032263.1| GM23616 [Drosophila sechellia]
gi|194121206|gb|EDW43249.1| GM23616 [Drosophila sechellia]
Length = 404
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 40/168 (23%), Positives = 72/168 (42%), Gaps = 19/168 (11%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C Y I ++L P+ GIT +M+ ++ + F I+ I++ ++ + + +
Sbjct: 63 CLCYVATIQKNLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
LL +PDLE + + D P + SG + P+ YS +DI++P
Sbjct: 120 E---DLLLDMASGVPDLEFVLNVRDWPQVHF--LSGLSG----PVLSYSITDRHLDIMYP 170
Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
WSFW N P +L+ G +W+ + A P+ A+
Sbjct: 171 AWSFW----TNTGP---ILQHYPHGVGRWDWMRKHLVARASETPWNAK 211
>gi|334329589|ref|XP_001370099.2| PREDICTED: protein O-glucosyltransferase 1 [Monodelphis domestica]
Length = 383
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 84/190 (44%), Gaps = 26/190 (13%)
Query: 64 QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM---LERANQTAHFRLILVNN 120
Q +S+ +Y P C + + +DL+P++ GI++++ L H+++I N
Sbjct: 29 QINKSVENYQPCPRENCSCHLGVMEKDLAPFQ-GGISKEVMTTLVSRKLGTHYQII--KN 85
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
++Y ++ R ILQ++ LPD+E++ + D P + P
Sbjct: 86 ELY-REHDCMFPARCSGVEHFILQVI----NHLPDMEMVINVRDYPQV-------PKWME 133
Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
P P+F +S DI++P W+FW G A I P W+ L +L W
Sbjct: 134 PAIPVFSFSKTNEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLLREDLARSAEKWPWER 193
Query: 233 REPYAYWKGN 242
+ Y++G+
Sbjct: 194 KISRGYFRGS 203
>gi|195572998|ref|XP_002104482.1| GD18425 [Drosophila simulans]
gi|194200409|gb|EDX13985.1| GD18425 [Drosophila simulans]
Length = 404
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 57/125 (45%), Gaps = 12/125 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C Y I ++L P+ GIT +M+ ++ + F I+ I++ ++ + + +
Sbjct: 63 CICYVATIQKNLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
LL +PDLE + + D P + SG + P+ YS +DI++P
Sbjct: 120 E---DLLLDMASGVPDLEFVLNVRDWPQVHF--LSGLSG----PVLSYSITDRHLDIMYP 170
Query: 200 DWSFW 204
WSFW
Sbjct: 171 AWSFW 175
>gi|350633295|gb|EHA21660.1| hypothetical protein ASPNIDRAFT_184185 [Aspergillus niger ATCC
1015]
Length = 452
Score = 41.6 bits (96), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 97/232 (41%), Gaps = 52/232 (22%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTST-------------CPDYFRWIHED----LSPWKV- 96
CP P +++S S PPT T C F + ED S W
Sbjct: 64 CPAQLP----FHDTVSSSSAPPTRTFFDPLEYSYNETQCTTLFPGLFEDPIRAQSFWTAR 119
Query: 97 TGITR---DMLERANQTAHFRLILVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPG 151
GI R D ++ A+ A R+ + ++Y+ + K R + GIL + +
Sbjct: 120 HGIKRIDIDNIKMADGMA--RVAIYQGRLYVLRALAKGEDHRRKIL---GILASIHRALV 174
Query: 152 RLPDL------ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFW 204
P L E++F +D+ D +GP++ PL+ + + + PD+ FW
Sbjct: 175 SAPQLAVILNTEIIFSVEDK----LEDVAGPDH----PLWVLARKATEESVWLMPDFGFW 226
Query: 205 GWAEIN--IKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
W I+ I P++ +++ ++E + W ++E W+G F + RR LL
Sbjct: 227 SWGHIDTQIGPYDEVVKHVEEHEHP--WDEKENKLVWRGKLSFAPKLRRALL 276
>gi|302834814|ref|XP_002948969.1| hypothetical protein VOLCADRAFT_89378 [Volvox carteri f.
nagariensis]
gi|300265714|gb|EFJ49904.1| hypothetical protein VOLCADRAFT_89378 [Volvox carteri f.
nagariensis]
Length = 646
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 80/187 (42%), Gaps = 29/187 (15%)
Query: 79 TCPDY---FRWIHEDLSPWKV-TGITRDMLERAN--QTAHFR-----LILVNNKVYIHKY 127
C +Y + IH DL +K GI+ +++ R +A +R + + KVYI
Sbjct: 30 VCDNYEPLYASIHRDLDIYKRNAGISPNLIYRTMVLHSAGYREKGLAVAVYRGKVYIISN 89
Query: 128 KQSIQTRD--------VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
+SI + V I +L L KY LPD+E ++ DRP+ G N
Sbjct: 90 TRSISLKRFGHHVALWVAYIKVLLDLEEKYGSYLPDVEFVWHTIDRPIRLVNTTPGGENF 149
Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWE-SLLRELKEGNNGRNWIDREPYAY 238
P+FR+ DI+ P++ F+ +KP++ L + N W R P +
Sbjct: 150 ---PVFRFCKSVVHPDILVPNFHFY------MKPYQREFLDRIPHFNAEVPWAQRRPIVF 200
Query: 239 WKGNPFV 245
+ + +V
Sbjct: 201 ARFSGYV 207
>gi|255088471|ref|XP_002506158.1| predicted protein [Micromonas sp. RCC299]
gi|226521429|gb|ACO67416.1| predicted protein [Micromonas sp. RCC299]
Length = 658
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 86/197 (43%), Gaps = 27/197 (13%)
Query: 81 PDYFRW---IHEDLSPW-KVTGITRDMLERANQTA--HFRLILVNNKVYIHKYKQSIQTR 134
P+ F W ++ + P+ K++ D E + + FR ++ ++++ + R
Sbjct: 197 PEIFDWSPFVYRAMFPYNKISKADVDTAELVARGSVNSFRAQIIGGRLFVKDIRALEFAR 256
Query: 135 DVFTIWGI--LQLLRKYPGRLPDLELMFDCDDRPVI----------RSRDYSG-PNNKGP 181
D W I L+ +R+ LPD++ +F+ D P++ R G N + P
Sbjct: 257 DYAPSWKITLLETMRRR-RDLPDIDAVFNEGDYPIVLLPNDGAHAQRLYGREGMSNGQKP 315
Query: 182 PPLFRYSGD-RWTMDIVFPDWSFW-----GWAEINIKPWESLLRELKEGNNGRNWIDREP 235
PPLF + + + T D+ FPD+SF G ++ W L E + D+ P
Sbjct: 316 PPLFSPTTNVQMTRDVPFPDFSFSPPGVKGADRLSTTRWSVAHGRLLEAGAKIPFEDKLP 375
Query: 236 YAYWKGNPFVAETRRDL 252
A + GN AE R+ L
Sbjct: 376 LAAFTGN-TQAEPRQRL 391
>gi|260827316|ref|XP_002608611.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae]
gi|229293962|gb|EEN64621.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae]
Length = 513
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 78/184 (42%), Gaps = 19/184 (10%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQ-TAHFRL---ILVNNKVYIHKYKQSIQTRD 135
CP F I DL + + R E ++ H L + +NK++ + Q +
Sbjct: 162 CPSAFPQIERDLEIFPKINLNRLSKEAVDRFGTHHSLCHYTVKDNKIHRKCHGQHTGFK- 220
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F + + RK R+PD+E + D P+ + + GP P+ + G T D
Sbjct: 221 MFMDATLHSITRKV--RIPDIEFFVNLGDWPLEKRQ-----VKDGPLPILSWCGSEETRD 273
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
IV P + + + SL +GN G W+++ A W+G ++RR+ L
Sbjct: 274 IVMPTYDL-TESTLETMGRVSLDMLSVQGNTGPRWVNKTEQALWRGR----DSRRERL-- 326
Query: 256 NLSD 259
NL D
Sbjct: 327 NLVD 330
>gi|145251704|ref|XP_001397365.1| endoplasmic reticulum-resident kdel protein [Aspergillus niger CBS
513.88]
gi|134082901|emb|CAK42731.1| unnamed protein product [Aspergillus niger]
Length = 456
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 97/232 (41%), Gaps = 52/232 (22%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTST-------------CPDYFRWIHED----LSPWKV- 96
CP P +++S S PPT T C F + ED S W
Sbjct: 68 CPAQLP----FHDTVSSSSAPPTRTFFDPLEYSYNETQCTTLFPGLFEDPIRAQSFWTAR 123
Query: 97 TGITR---DMLERANQTAHFRLILVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPG 151
GI R D ++ A+ A R+ + ++Y+ + K R + GIL + +
Sbjct: 124 HGIKRIDIDNIKMADGMA--RVAIYQGRLYVLRALAKGEDHRRKIL---GILASIHRALV 178
Query: 152 RLPDL------ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFW 204
P L E++F +D+ D +GP++ PL+ + + + PD+ FW
Sbjct: 179 SAPQLAVILNTEIIFSVEDK----LEDVAGPDH----PLWVLARKATEESVWLMPDFGFW 230
Query: 205 GWAEIN--IKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
W I+ I P++ +++ ++E + W ++E W+G F + RR LL
Sbjct: 231 SWGHIDTQIGPYDEVVKHVEEHEHP--WDEKENKLVWRGKLSFAPKLRRALL 280
>gi|91082811|ref|XP_968605.1| PREDICTED: similar to endoplasmic reticulum-resident kdel protein
[Tribolium castaneum]
Length = 362
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 90/193 (46%), Gaps = 29/193 (15%)
Query: 66 QESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIH 125
+ + +YS + C Y I +DL +K GIT ++++ +T + ++++K+Y
Sbjct: 18 KRAKENYSPCDNTKCGCYSSQISDDLKIFK-KGITPQLIDKV-KTKGTKYQIIDHKLY-- 73
Query: 126 KYKQSIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
RD ++ GI L K +LPD+EL+ + D P I +DY
Sbjct: 74 --------RDKNCMFPARCAGIEHFLLKLLPKLPDMELIINTRDWPQIH-KDYGVFG--- 121
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S DI++P W+FW G I++ P W++ L + N W ++
Sbjct: 122 --PVFSFSKTSDYSDIMYPAWAFWEGGPAISLYPRGIGRWDTHRDLLGKKGNETLWDEKI 179
Query: 235 PYAYWKGNPFVAE 247
P +++G+ AE
Sbjct: 180 PKGFFRGSRTSAE 192
>gi|317147538|ref|XP_001822208.2| hypothetical protein AOR_1_1442014 [Aspergillus oryzae RIB40]
gi|391873092|gb|EIT82167.1| hypothetical protein Ao3042_00673 [Aspergillus oryzae 3.042]
Length = 438
Score = 41.2 bits (95), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 30/129 (23%), Positives = 60/129 (46%), Gaps = 17/129 (13%)
Query: 109 QTAHFRLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDD 164
+ R I+ + ++YI + + TR T+ + + L +P R LP++E
Sbjct: 116 ENGMVRGIIDHGELYIVDFGNMPATFTRGKATLNSLHRALASFPDRDRLPNVEF------ 169
Query: 165 RPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESLLRELKE 223
V+ + DYS G P++ YS ++ + PD+ +W W E+ + P++ R +
Sbjct: 170 --VLTTEDYS----SGEGPIWSYSKREENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAA 223
Query: 224 GNNGRNWID 232
++G +D
Sbjct: 224 IDDGEVTVD 232
>gi|25009725|gb|AAN71037.1| AT07872p [Drosophila melanogaster]
Length = 397
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 12/125 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C + I DL P+ GIT +M+ ++ + F I+ I++ ++ + + +
Sbjct: 63 CLCHVATIQRDLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
LL + DLE + + D P + SG + P+F YS +DI++P
Sbjct: 120 E---DLLLDMASGVADLEFVLNVRDWPQVHF--LSGLSG----PVFSYSITNRHLDIMYP 170
Query: 200 DWSFW 204
WSFW
Sbjct: 171 AWSFW 175
>gi|24649142|ref|NP_732797.1| CG31139, isoform A [Drosophila melanogaster]
gi|23171999|gb|AAF56061.2| CG31139, isoform A [Drosophila melanogaster]
gi|262051015|gb|ACY07068.1| FI12101p [Drosophila melanogaster]
Length = 397
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 12/125 (9%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
C + I DL P+ GIT +M+ ++ + F I+ I++ ++ + + +
Sbjct: 63 CLCHVATIQRDLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119
Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
LL + DLE + + D P + SG + P+F YS +DI++P
Sbjct: 120 E---DLLLDMASGVADLEFVLNVRDWPQVHF--LSGLSG----PVFSYSITNRHLDIMYP 170
Query: 200 DWSFW 204
WSFW
Sbjct: 171 AWSFW 175
>gi|119190301|ref|XP_001245757.1| hypothetical protein CIMG_05198 [Coccidioides immitis RS]
gi|392868638|gb|EAS34429.2| hypothetical protein CIMG_05198 [Coccidioides immitis RS]
Length = 463
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 54/264 (20%), Positives = 101/264 (38%), Gaps = 35/264 (13%)
Query: 15 FSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKT------------ 62
F +++ N ++ P + +T NC PT +
Sbjct: 35 FGMNSDGDNVHPLLTQLIPAGHCTCKASTTFNCTSCVDSITPTTFETASPSLSPSSPAWT 94
Query: 63 ---SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLIL 117
+ ++ S S+ P F +H + W GI ++ L+ + R ++
Sbjct: 95 FVYGRDDRNVGLTSSQCQSSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMI 154
Query: 118 VNNKVYIHKYKQSIQTRDVFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGP 176
N +Y+ + + + Q I G+L + R +PD+E +F +DR D SG
Sbjct: 155 FNGNLYVIETRAA-QEDHRQKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGT 209
Query: 177 NNKGPPPLFRYSGDRWTMDIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNW 230
PL+ G + + + V PD+ FW W NI P++ ++ ++ + + W
Sbjct: 210 GQ----PLWVL-GRKASEESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPW 264
Query: 231 IDREPYAYWKGN-PFVAETRRDLL 253
+ W+G F + RR LL
Sbjct: 265 SSKTQKLLWRGKLSFSPKLRRGLL 288
>gi|195143521|ref|XP_002012746.1| GL23775 [Drosophila persimilis]
gi|194101689|gb|EDW23732.1| GL23775 [Drosophila persimilis]
Length = 419
Score = 40.8 bits (94), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 44/190 (23%), Positives = 76/190 (40%), Gaps = 23/190 (12%)
Query: 73 SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
S P C + I DL P+ GITR M+ ++ RL +V + Y+Q +
Sbjct: 56 SQPSDPDCECHALTIRRDLGPYAEAGITRSMMAQSR-----RLGVVYQVIDGRIYRQP-E 109
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIR-SRDYSGPNNKGPPPLFRYSGDR 191
+ +L G LP++E + + D P + ++G P+F +S
Sbjct: 110 VPHPKRCADVEDMLLGIAGELPNVEFILNVRDWPQVPFLSGFTG-------PVFSHSVSH 162
Query: 192 WTMDIVFPDWSFWGWAE-------INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
+DI+ P WSF + I W + R + +W ++P +++G
Sbjct: 163 QHLDIMCPAWSFSSVSGPPLQRFPHGIGQWGHMRRHMAAAAAQVSWEHKQPIGFFRGT-- 220
Query: 245 VAETRRDLLT 254
+ T RD L
Sbjct: 221 RSSTERDTLV 230
>gi|18606235|gb|AAH23141.1| Kdelc1 protein [Mus musculus]
gi|148664477|gb|EDK96893.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_c [Mus musculus]
Length = 282
Score = 40.8 bits (94), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 33/130 (25%), Positives = 56/130 (43%), Gaps = 16/130 (12%)
Query: 80 CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
C + I +DL+ P K+ + H+ L +NKVYI + + +
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210
Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
R +F +L L RK R+PD+E + D P+ + + +N P+F + G +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262
Query: 194 MDIVFPDWSF 203
DIV P +
Sbjct: 263 RDIVMPTYDL 272
>gi|348506788|ref|XP_003440939.1| PREDICTED: protein O-glucosyltransferase 1-like [Oreochromis
niloticus]
Length = 366
Score = 40.8 bits (94), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 41/184 (22%), Positives = 80/184 (43%), Gaps = 20/184 (10%)
Query: 67 ESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK 126
+++ Y+ C + + DL +K GI++D++ Q R + + ++ HK
Sbjct: 15 DAVKGYTPCEPVNCSCHLSVLQHDLQTFK-GGISQDVMAATIQ----RGVGTHYQIIGHK 69
Query: 127 YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLF 185
+ G+ + + +LPDLE++ + D P + P+ P P+F
Sbjct: 70 LYREQNCMFPARCSGVEHFILEVIDKLPDLEMVVNVRDYPQV-------PHWVQPTLPVF 122
Query: 186 RYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAY 238
+S DI++P W+FW G A I P W+ + +LK+ W +E +
Sbjct: 123 SFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRDDLKKSAAQWPWKKKESKGF 182
Query: 239 WKGN 242
++G+
Sbjct: 183 FRGS 186
>gi|148922833|ref|NP_001092215.1| KDEL motif-containing protein 2 [Danio rerio]
gi|148744719|gb|AAI42840.1| Zgc:165521 protein [Danio rerio]
Length = 518
Score = 40.4 bits (93), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 75/166 (45%), Gaps = 11/166 (6%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLER----ANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
CP I +D S + + + + E A + ++NN+V+ + +
Sbjct: 165 CPQEDPQIQKDFSSFPSIDLQQLLQEVPTRFAKRGGLIHYTILNNQVHRRSLGRYTDFK- 223
Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
+F+ +L L RK +LPD+E + D P + +R + +N GP P+ + G T D
Sbjct: 224 MFSDEILLSLARKV--KLPDVEFYINVGDWP-MENRKVN--DNPGPVPVISWCGSTETRD 278
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
I+ P + + ++ + L + +GN G W D+ A+++G
Sbjct: 279 IILPTYDITHSSLEAMRGVTNDLLSV-QGNTGPTWSDKMNKAFFRG 323
>gi|321474111|gb|EFX85077.1| hypothetical protein DAPPUDRAFT_314403 [Daphnia pulex]
Length = 502
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 87/212 (41%), Gaps = 31/212 (14%)
Query: 48 VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA 107
V ++ CP + E I+DY C I DL+ ++ + + E
Sbjct: 129 VYAESCDCPLD-----SIDEMINDYQ------CSVNVAQIDHDLNQFQNVDFNQVLTEAM 177
Query: 108 NQTAHF------RLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
+ +H +++NKVY Y Q + ++F +L L RK LPD+E + +
Sbjct: 178 KRFSHAGSYSFCHYAVLDNKVYRRCYGQHVGF-NMFMDNILLSLSRK--AVLPDMEFLIN 234
Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
D P+++ N P+F + G T DIV P + A + +L
Sbjct: 235 LGDWPLVKK------NILPIIPIFSWCGSTQTADIVMPTYDI-TEASLECMGRVTLDMLS 287
Query: 222 KEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ N W +++ A+W+G ++RR+ L
Sbjct: 288 VQSNPDTKWENKQEKAFWRGR----DSRRERL 315
>gi|442620545|ref|NP_001262849.1| rumi, isoform B [Drosophila melanogaster]
gi|440217767|gb|AGB96229.1| rumi, isoform B [Drosophila melanogaster]
Length = 316
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/128 (21%), Positives = 54/128 (42%), Gaps = 10/128 (7%)
Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
K+Y H+ + GI L LPD++L+ + D P + + N
Sbjct: 13 KIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDLIINTRDYPQLNA----AWGNAA 68
Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
P+F +S + DI++P W+FW G + P W+ + +L++ W +
Sbjct: 69 GGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKR 128
Query: 235 PYAYWKGN 242
+++G+
Sbjct: 129 SLGFFRGS 136
>gi|303315019|ref|XP_003067517.1| hypothetical protein CPC735_064720 [Coccidioides posadasii C735
delta SOWgp]
gi|240107187|gb|EER25372.1| hypothetical protein CPC735_064720 [Coccidioides posadasii C735
delta SOWgp]
Length = 463
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 20/186 (10%)
Query: 78 STCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLILVNNKVYIHKYKQSIQTRD 135
S+ P F +H + W GI ++ L+ + R ++ N +Y+ + + + Q
Sbjct: 113 SSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNLYVIETRAA-QEDH 171
Query: 136 VFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
I G+L + R +PD+E +F +DR D SG PL+ G + +
Sbjct: 172 RRKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGTGQ----PLWVL-GRKASE 222
Query: 195 DIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNWIDREPYAYWKGN-PFVAE 247
+ V PD+ FW W NI P++ ++ ++ + + W + W+G F +
Sbjct: 223 ESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRGKLSFSPK 282
Query: 248 TRRDLL 253
RR LL
Sbjct: 283 LRRGLL 288
>gi|320035743|gb|EFW17684.1| hypothetical protein CPSG_06127 [Coccidioides posadasii str.
Silveira]
Length = 463
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 20/186 (10%)
Query: 78 STCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLILVNNKVYIHKYKQSIQTRD 135
S+ P F +H + W GI ++ L+ + R ++ N +Y+ + + + Q
Sbjct: 113 SSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNLYVIETRAA-QEDH 171
Query: 136 VFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
I G+L + R +PD+E +F +DR D SG PL+ G + +
Sbjct: 172 RRKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGTGQ----PLWVL-GRKASE 222
Query: 195 DIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNWIDREPYAYWKGN-PFVAE 247
+ V PD+ FW W NI P++ ++ ++ + + W + W+G F +
Sbjct: 223 ESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRGKLSFSPK 282
Query: 248 TRRDLL 253
RR LL
Sbjct: 283 LRRGLL 288
>gi|291383938|ref|XP_002708527.1| PREDICTED: KDEL (Lys-Asp-Glu-Leu) containing 2 [Oryctolagus
cuniculus]
Length = 509
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 77/171 (45%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + ++ Y++S+
Sbjct: 155 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNRIYRRSLGKYT 214
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ R P GP P+ + G +
Sbjct: 215 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRRVNETP---GPIPIISWCGSQD 269
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 270 SRDIILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 317
>gi|345493843|ref|XP_003427163.1| PREDICTED: KDEL motif-containing protein 1-like [Nasonia
vitripennis]
Length = 507
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 74/165 (44%), Gaps = 24/165 (14%)
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
D + N + ++ +N++Y + Y Q + + +F+ +L L RK LPD+E +
Sbjct: 188 DRYNKPNSVSICHYVIKSNQIYRNCYGQHVGFK-IFSDAILLSLARKI--NLPDVEFFMN 244
Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
D P++ P NK P+F + G + DIV P + +I E++ R +
Sbjct: 245 LGDWPLV-------PKNKEIHPIFSWCGSDDSYDIVLPTY------DITQSSMEAMGRVM 291
Query: 222 -----KEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
+G+ W + +W+G E R DL+ +++ KH
Sbjct: 292 LDMLSVQGSTTDPWNKKIEKMFWRGRDARRE-RLDLI--DIARKH 333
>gi|449283857|gb|EMC90451.1| KTEL motif-containing protein 1, partial [Columba livia]
Length = 364
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 44/195 (22%), Positives = 85/195 (43%), Gaps = 32/195 (16%)
Query: 62 TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILV 118
T Q ++++ Y C + +DL+P++ GI+++++ H+++ V
Sbjct: 8 TDQIKKAVEVYKPCVKENCSCHQSVWKQDLAPFR-GGISKEIISDVVSRKLGTHYQI--V 64
Query: 119 NNKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
NK+Y + +D G+ L + RLPD+E++ + D P +
Sbjct: 65 KNKLY--------REQDCLFPARCSGVEHFLLEIISRLPDMEMVINVRDYPQV------- 109
Query: 176 PNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNG 227
P P P+F +S DI++P W+FW G A I P W+ + +L+
Sbjct: 110 PKWMKPVIPIFSFSKTPEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEK 169
Query: 228 RNWIDREPYAYWKGN 242
W+ + Y++G+
Sbjct: 170 WPWMKKISKGYFRGS 184
>gi|348553234|ref|XP_003462432.1| PREDICTED: KDEL motif-containing protein 2-like [Cavia porcellus]
Length = 692
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 76/169 (44%), Gaps = 14/169 (8%)
Query: 79 TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRDVF 137
+CP + I ED + + + + + E + R +V+ + H Y++S+ F
Sbjct: 341 SCPTWEPQIEEDFASFPTINLQQMLSEVPKRFGDERGAIVHYTILNNHIYRRSLGKYTDF 400
Query: 138 TIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
++ +L L RK RLPDLE + D P+ + P+ P P+ + G +
Sbjct: 401 KMFSDEILLSLARKV--RLPDLEFYINLGDWPLEHRKINETPS---PVPVISWCGSLDSR 455
Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
D++ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 456 DVILPTYDV---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 501
>gi|345561463|gb|EGX44552.1| hypothetical protein AOL_s00188g220 [Arthrobotrys oligospora ATCC
24927]
Length = 437
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/157 (21%), Positives = 63/157 (40%), Gaps = 11/157 (7%)
Query: 99 ITRDMLERANQTA--HFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
IT D LE +A +FR I+ N ++Y+ K +R T+ + + + P LP++
Sbjct: 115 ITLDELESVTHSAFGYFRGIIYNQELYVIKMLDPNFSRGFATLMAMHRAIVTSPEPLPNI 174
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
E + + DY + + T++ + PD+ +W W E + +
Sbjct: 175 EFTLN--------TADYIDFEQSAATWTYARRSNE-TVNWLMPDFGYWSWPEPKVGSYNE 225
Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ + + + W + P W+G R LL
Sbjct: 226 VRLKARLADEAIPWEKKIPKIVWRGATLKLPVRLALL 262
>gi|260810659|ref|XP_002600074.1| hypothetical protein BRAFLDRAFT_280689 [Branchiostoma floridae]
gi|229285359|gb|EEN56086.1| hypothetical protein BRAFLDRAFT_280689 [Branchiostoma floridae]
Length = 351
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 81/188 (43%), Gaps = 33/188 (17%)
Query: 76 PTSTCPDYFRWIHEDLSPWK-VTGITRDMLERA---NQTAHFRLILVNNKVYIHKYKQSI 131
P C + I DL WK GIT++ +R ++ H+++I ++K+Y
Sbjct: 10 PDKKCSCHSSVITHDLKLWKDRGGITKEEFDRTLEQDKGTHYQII--DHKLYR------- 60
Query: 132 QTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS- 188
Q R +F G+ + + LPD+E++ + D P S + P P+ +S
Sbjct: 61 QDRCMFPARCSGVEHFILEIIDDLPDMEMIINVRDWP------QSPMYQREPRPVMSFSK 114
Query: 189 --GDRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNWIDREPYAYW 239
+ W DI++P W+FW + W+ + + + N W +E A++
Sbjct: 115 TFNEHW--DIMYPAWTFWEGGPAVWPLFPTGLGRWDLMRDSITKANEDWPWEKKEDKAFF 172
Query: 240 KGNPFVAE 247
+G+ AE
Sbjct: 173 RGSRTSAE 180
>gi|260810693|ref|XP_002600088.1| hypothetical protein BRAFLDRAFT_280682 [Branchiostoma floridae]
gi|229285373|gb|EEN56100.1| hypothetical protein BRAFLDRAFT_280682 [Branchiostoma floridae]
Length = 351
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 81/188 (43%), Gaps = 33/188 (17%)
Query: 76 PTSTCPDYFRWIHEDLSPWK-VTGITRDMLERA---NQTAHFRLILVNNKVYIHKYKQSI 131
P C + I DL WK GIT++ +R ++ H+++I ++K+Y
Sbjct: 10 PDKKCSCHSSVITHDLKLWKDRGGITKEEFDRTLEQDKGTHYQII--DHKLYR------- 60
Query: 132 QTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS- 188
Q R +F G+ + + LPD+E++ + D P S + P P+ +S
Sbjct: 61 QDRCMFPARCSGVEHFILEIIDDLPDMEMIINVRDWP------QSPMYQREPRPVMSFSK 114
Query: 189 --GDRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNWIDREPYAYW 239
+ W DI++P W+FW + W+ + + + N W +E A++
Sbjct: 115 TFNEHW--DIMYPAWTFWEGGPAVWPLFPTGLGRWDLMRDSITKANEDWPWEKKEDKAFF 172
Query: 240 KGNPFVAE 247
+G+ AE
Sbjct: 173 RGSRTSAE 180
>gi|322691155|ref|YP_004220725.1| hypothetical protein BLLJ_0966 [Bifidobacterium longum subsp.
longum JCM 1217]
gi|320456011|dbj|BAJ66633.1| hypothetical protein BLLJ_0966 [Bifidobacterium longum subsp.
longum JCM 1217]
Length = 1012
Score = 39.3 bits (90), Expect = 1.9, Method: Composition-based stats.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 22/127 (17%)
Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
+PGR+ D+ ++ D RP +RS FR G D++ DW G A
Sbjct: 267 FPGRVTDVRIVASRDGRPAVRS--------------FRIDGSNADWDMISRDW-LKGAAP 311
Query: 209 INIKPW---ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNA 265
W E++L G G W+ + A W + ET R L++ L+ +HD
Sbjct: 312 GAFGDWMNTENILVTADPG-GGHGWVSVD--ADWDSYEWDVETMRPLMSDTLTRRHDDPV 368
Query: 266 RLYVQVI 272
R Y+++I
Sbjct: 369 R-YMRLI 374
>gi|196003266|ref|XP_002111500.1| hypothetical protein TRIADDRAFT_55584 [Trichoplax adhaerens]
gi|190585399|gb|EDV25467.1| hypothetical protein TRIADDRAFT_55584 [Trichoplax adhaerens]
Length = 506
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 13/140 (9%)
Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
D + + H+ +I +NKVY + Y + + + +F +L L RK +LPDLE + +
Sbjct: 187 DRFGKHHAICHYSVI--SNKVYRNCYGEHVGFK-MFMDAILLSLARKV--KLPDLEFISN 241
Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
D P+ ++ P+F + G T DIV P + + + + S+
Sbjct: 242 LGDWPLENQQN-------NLIPIFSWCGSETTSDIVMPTYDL-TQSTLEMMGRVSVDVLA 293
Query: 222 KEGNNGRNWIDREPYAYWKG 241
+G+ G W D+ +W+G
Sbjct: 294 VQGSTGPKWKDKIEKGFWRG 313
>gi|83770071|dbj|BAE60206.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 460
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 26/105 (24%), Positives = 50/105 (47%), Gaps = 15/105 (14%)
Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
TR T+ + + L +P R LP++E V+ + DYS G P++ YS
Sbjct: 164 TRGKATLNSLHRALASFPDRDRLPNVEF--------VLTTEDYS----SGEGPIWSYSKR 211
Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDRE 234
++ + PD+ +W W E+ + P++ R + ++G +D +
Sbjct: 212 EENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAAIDDGEVTVDGQ 256
>gi|60302736|ref|NP_001012560.1| KTEL motif-containing protein 1 precursor [Gallus gallus]
gi|60098623|emb|CAH65142.1| hypothetical protein RCJMB04_4b16 [Gallus gallus]
Length = 392
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 87/198 (43%), Gaps = 28/198 (14%)
Query: 62 TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR----DMLERANQTAHFRLIL 117
T Q ++ Y C + +DL+P++ +GI++ D++ R T H+++I
Sbjct: 36 TDQINRAVEVYKPCVKENCSCHQSVWKQDLAPFR-SGISKETISDVVSRKLGT-HYQII- 92
Query: 118 VNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
NK+Y ++ R G+ +R RLPD+E++ + D P + P
Sbjct: 93 -KNKLY-REHDCMFPAR----CSGVEHFIRGIINRLPDMEMVINVRDYPQV-------PK 139
Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
P P+F +S DI++P W+FW G A I P W+ + +L+
Sbjct: 140 WMKPIIPVFSFSKTAEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEKWP 199
Query: 230 WIDREPYAYWKGNPFVAE 247
W + Y++G+ +E
Sbjct: 200 WKKKISKGYFRGSRTSSE 217
>gi|198450765|ref|XP_002137151.1| GA27052 [Drosophila pseudoobscura pseudoobscura]
gi|198131182|gb|EDY67709.1| GA27052 [Drosophila pseudoobscura pseudoobscura]
Length = 420
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 75/190 (39%), Gaps = 23/190 (12%)
Query: 73 SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
S P C + I D P+ GITR M+ ++ RL +V + Y+Q +
Sbjct: 56 SQPSDPDCECHALTIRRDFGPYAEAGITRSMMAQSR-----RLGVVYKVIDGRIYRQP-E 109
Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIR-SRDYSGPNNKGPPPLFRYSGDR 191
+ +L G LP++E + + D P + ++GP +F +S
Sbjct: 110 VPHPKRCADVEDMLLGIAGELPNVEFILNVRDWPQVPFLSGFTGP-------VFSHSVSH 162
Query: 192 WTMDIVFPDWSFWGWAE-------INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
+DI+ P WSF + I W + R + +W ++P +++G
Sbjct: 163 QHLDIMCPAWSFSSVSGPPLQRFPHGIGQWGHMRRHMAAAAAQVSWEHKQPIGFFRGT-- 220
Query: 245 VAETRRDLLT 254
+ T RD L
Sbjct: 221 RSSTERDTLV 230
>gi|443897893|dbj|GAC75232.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
Length = 479
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 36/160 (22%), Positives = 65/160 (40%), Gaps = 24/160 (15%)
Query: 98 GITRDMLERANQTAH---FRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL-RKYPGRL 153
GI D ++ A + LI+ + ++++ + Q+R + +QLL Y G
Sbjct: 99 GIKYDNVQNAAENCRHGCVHLIIKHGQIFVRAQVKDWQSR----VRSTMQLLTAAYQGAS 154
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD---------IVFPDWSFW 204
D + D + VI + D+ G + G W +D +FPD+SF
Sbjct: 155 EDEKARIDGTEL-VISTADFDGFTDSAS------RGAGWVLDKRVNDTEGQYLFPDFSFA 207
Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
W E I + + ++ N W D+ A+W+G+
Sbjct: 208 SWPEAGIASYPEFRHDAEQVNAETPWHDKLNPAFWRGDAL 247
>gi|115399170|ref|XP_001215174.1| predicted protein [Aspergillus terreus NIH2624]
gi|114192057|gb|EAU33757.1| predicted protein [Aspergillus terreus NIH2624]
Length = 407
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 80/173 (46%), Gaps = 30/173 (17%)
Query: 81 PDYFRWIHEDLSPWK-VTGITRDMLERAN-QTAHFRLILVNNKVYIHKYK---QSIQTRD 135
P F+ I + WK GI++D L+ ++ R I+ N ++YI K + +T+
Sbjct: 75 PGLFQDISRAVQYWKPRGGISKDDLDAIPFESGMARAIISNGELYIVATKAKGKDHRTKI 134
Query: 136 VFTIWGILQLLRKYPGRL--PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
+ T+ I + L P RL P +E +F +DR + D +G P++ S + +
Sbjct: 135 LATLGSIYRALSSSPTRLSLPTIEFVFSIEDR--VDDIDATGH------PVWVVS-RKVS 185
Query: 194 MDIVF--PDWSFWGWAEINIKPWES------------LLRELKEGNNGRNWID 232
+ VF PD+ +W WA+ +I P ++ L R L + G+ W D
Sbjct: 186 EESVFLMPDFGYWSWAKSHIGPAKARVERGKLSFAPKLRRALLDVARGKPWGD 238
>gi|119499505|ref|XP_001266510.1| hypothetical protein NFIA_041910 [Neosartorya fischeri NRRL 181]
gi|119414674|gb|EAW24613.1| predicted protein [Neosartorya fischeri NRRL 181]
Length = 423
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 41/95 (43%), Gaps = 9/95 (9%)
Query: 150 PGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
P LPD + DRP+ +S Y+ P + P +GD + P ++FW W
Sbjct: 126 PEPLPDTVFALNIQDRPMAQSWSYARPAD----PTATKAGD-----FLMPHFAFWAWPLK 176
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
I + L + E + + + P A W+G P+
Sbjct: 177 YIGSMDRALTAITEIETDQTFQAKIPQAVWRGTPW 211
>gi|407921029|gb|EKG14198.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 441
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 35/167 (20%), Positives = 73/167 (43%), Gaps = 17/167 (10%)
Query: 89 EDLS-PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
EDL W+ GITR M+ +L ++ K H+ + + R + + + + +
Sbjct: 109 EDLDMSWRKEGITRCMI------YDHQLYVLETKGTTHR--RDYRERTLAVLHNMHRAIT 160
Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIVFPDWSFWGW 206
Y G LP++E F DD Y N P ++ ++ + W + PD+ +W W
Sbjct: 161 AYNGPLPNIEFTFSVDDWV------YDEENLNTDPVVWGFTRQQSWENVWLMPDFGYWAW 214
Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ ++ + ++ + + +++P W+G + E R+ L+
Sbjct: 215 PTEPVGAYQDVRNQMGVREKAQAFSEKKPKVVWRGAA-LTEQRQALI 260
>gi|47078285|ref|NP_997610.1| KDEL (Lys-Asp-Glu-Leu) containing 2 protein precursor [Mus
musculus]
gi|148693845|gb|EDL25792.1| KDEL (Lys-Asp-Glu-Leu) containing 2, isoform CRA_b [Mus musculus]
gi|157169796|gb|AAI52824.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [synthetic construct]
Length = 503
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 80/193 (41%), Gaps = 25/193 (12%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
CP + P+ Q T +CP I +D + + + + E + R
Sbjct: 138 CPEDDPQVWQE-----------TLSCPASEPQIEQDFVSFPSINLQQMLKEVPTRFGDER 186
Query: 115 LILVNNKVYI-HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+V+ + H Y++S+ F ++ +L L RK LPDLE + D P+
Sbjct: 187 GAVVHYTILNNHIYRRSLGKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHR 244
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGR 228
+ P GP P+ + G + DI+ P + ++ + +L +GN G
Sbjct: 245 KVNDTP---GPIPIISWCGSLDSRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGP 298
Query: 229 NWIDREPYAYWKG 241
+WI++ A+++G
Sbjct: 299 SWINKTEKAFFRG 311
>gi|33327382|gb|AAQ09021.1| unknown protein [Homo sapiens]
Length = 507
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 76/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP+ I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPNKETQIAKDFASFPSINLQQMLKEVPKSFGYERGAIVHYTILNNHVYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 315
>gi|340369006|ref|XP_003383040.1| PREDICTED: protein O-glucosyltransferase 1-like [Amphimedon
queenslandica]
Length = 325
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 59/130 (45%), Gaps = 17/130 (13%)
Query: 87 IHEDLSPWKVTGITR--DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQ 144
+ DL W+ G + + + ++ H+++ VN K+Y K F G+
Sbjct: 4 LESDLGVWRERGGIKREEFIHAKSKGVHYQI--VNGKLYREK-----DCLFSFRCKGVEH 56
Query: 145 LLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
+ LP++EL+ + D P +S Y P P+F +S DI++P W+FW
Sbjct: 57 FILNIIEDLPNMELIINVFDYP--KSHKYHSP-----LPVFSFSKTVHYWDIMYPAWTFW 109
Query: 205 -GWAEINIKP 213
G ++++P
Sbjct: 110 SGGPAVSVEP 119
>gi|255938965|ref|XP_002560252.1| Pc15g00250 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584874|emb|CAP82911.1| Pc15g00250 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 458
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
I+ PD+ +W W + NI P+ +++ + + + D+E W+G F + RR LL
Sbjct: 223 ILMPDFGYWSWDKSNIGPYGQVVQSIMAAESNLKFADKEQKLVWRGKLSFAPKLRRALL 281
>gi|26344660|dbj|BAC35979.1| unnamed protein product [Mus musculus]
Length = 465
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 80/193 (41%), Gaps = 25/193 (12%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
CP + P+ Q T +CP I +D + + + + E + R
Sbjct: 100 CPEDDPQVWQE-----------TLSCPASEPQIEQDFVSFPSINLQQMLKEVPTRFGDER 148
Query: 115 LILVNNKVYI-HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+V+ + H Y++S+ F ++ +L L RK LPDLE + D P+
Sbjct: 149 GAVVHYTILNNHIYRRSLGKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHR 206
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGR 228
+ P GP P+ + G + DI+ P + ++ + +L +GN G
Sbjct: 207 KVNDTP---GPIPIISWCGSLDSRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGP 260
Query: 229 NWIDREPYAYWKG 241
+WI++ A+++G
Sbjct: 261 SWINKTEKAFFRG 273
>gi|313232413|emb|CBY24081.1| unnamed protein product [Oikopleura dioica]
Length = 536
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 84/208 (40%), Gaps = 42/208 (20%)
Query: 80 CPDYFRWIHEDLSPWKVTGITRDMLE----------RANQTAHFRLILVNNKVYIHKYKQ 129
CP F + EDL+ + + R + TAH +++ + K+++ +
Sbjct: 172 CPANFEQLDEDLAQFDEIDLDRLYEDGKKKKWAAGGHGEATAH--VVIKDQKLFVEDFGT 229
Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
+ R F L LLRK +LPD E +F+ D P+ N P P+ + G
Sbjct: 230 IMGFRG-FMSSMFLSLLRKV--KLPDAEFIFNLGDWPL-------EENLTDPQPILTWCG 279
Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELK-----EGNNGR--NWIDREPYAYWKG- 241
T DI P W + +L RE K E +G +W ++ Y++G
Sbjct: 280 SSNTSDIAVPTW------DQTKNTRHALFRERKDIQYVEQISGEVVSWNEKIERGYFRGR 333
Query: 242 --NPFVAETRRDLLTCNLSDKHDWNARL 267
NP +R L +++ D +ARL
Sbjct: 334 DSNP----SRLKLCELSMAHPEDIDARL 357
>gi|444723570|gb|ELW64221.1| KDEL motif-containing protein 2 [Tupaia chinensis]
Length = 470
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 40/171 (23%), Positives = 75/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ V H Y++S+
Sbjct: 116 TLSCPIKEPQIAKDFASFPSINLQQMLNEVPERFGDERGAIVHYTVLNNHIYRRSLGKYT 175
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 176 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETP---GPIPIISWCGSLD 230
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 231 SRDIILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 278
>gi|72679937|gb|AAI00618.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
Length = 508
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 80/193 (41%), Gaps = 25/193 (12%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
CP + P+ Q T +CP I +D + + + + E + R
Sbjct: 143 CPEDDPQAWQK-----------TLSCPANEPQIEQDFISFPSINLQQMLKEVPKRFGDER 191
Query: 115 LILVNNKVYI-HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+V+ + H Y++S+ F ++ +L L RK LPDLE + D P+
Sbjct: 192 GAIVHYTILNNHIYRRSLGKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHR 249
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGR 228
+ P GP P+ + G + DI+ P + ++ + +L +GN G
Sbjct: 250 KVNDTP---GPIPIISWCGSLDSRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGP 303
Query: 229 NWIDREPYAYWKG 241
+WI++ A+++G
Sbjct: 304 SWINKTEKAFFRG 316
>gi|255955591|ref|XP_002568548.1| Pc21g15380 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590259|emb|CAP96435.1| Pc21g15380 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 469
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 113/283 (39%), Gaps = 46/283 (16%)
Query: 1 MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCP---- 56
+I F+FS I+ F +++ I+ P +T C + TCP
Sbjct: 18 VITAFSFSAIIVIWFGMNSDREYIHPILTQLIPAGHCACESSTTFQC--STCLTCPEPGL 75
Query: 57 -TNYPKTSQTQESISDYS------IPPTSTCPDYFRWIHEDL--------SPWKVTGITR 101
TN +T+ + Y+ S C F + ED+ S +++ +
Sbjct: 76 QTNSLQTNSSPAWTFQYTRDGRNETLNKSQCKAAFPGLFEDVARGGKFWSSHGRLSSVEL 135
Query: 102 DMLERANQTAHFRLILVNNKVYI--HKYKQSIQTRDVF-TIWGILQLLRKYPGRL--PDL 156
D + Q R + N ++Y+ + K R + T+ I + L P R PD
Sbjct: 136 DAI--PIQLGMVRAFISNGQLYVVTARSKGEDHRRKILGTLSSIHRALAADPERASRPDF 193
Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR---WTMDIVFPDWSFWGWAEIN--I 211
E +F +D+ D + N++ P +F + W M PD+SFW W N I
Sbjct: 194 EFVFSVEDK----VDDVT--NSEWPVWVFSRTPSEEGVWLM----PDFSFWAWDNKNNYI 243
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLL 253
P++ ++ +K + W ++ P W+G P F + RR L+
Sbjct: 244 GPYDQVVERIKHMDI--PWSEKTPQLVWRGKPSFAPKLRRALM 284
>gi|357608730|gb|EHJ66122.1| KDEL motif-containing protein 1 [Danaus plexippus]
Length = 234
Score = 38.5 bits (88), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 56/124 (45%), Gaps = 12/124 (9%)
Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPD 200
G+ L+ LPD++L+ + D P ++D+ P+F +S R DI++P
Sbjct: 33 GVEHYLKILSPTLPDMDLVINTRDWPQF-NKDWGHKK----APVFSFSKTRSYYDIMYPT 87
Query: 201 WSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
WSFW G I + P W+ + W +E A+++G+ +E R L+
Sbjct: 88 WSFWEGGPAIALYPTGIGRWDKHRTSISTAAEKWPWNKKEEKAFFRGSR-TSEERDALIL 146
Query: 255 CNLS 258
+ S
Sbjct: 147 LSRS 150
>gi|219886273|gb|ACL53511.1| unknown [Zea mays]
Length = 527
Score = 38.5 bits (88), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 65 TQESISDYSIPPTSTCPDYFRWIHEDLSPW 94
T+ S S PP CPDYFR+IH DLSPW
Sbjct: 187 TKSSSSKNKQPPPQ-CPDYFRFIHSDLSPW 215
>gi|238495869|ref|XP_002379170.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
gi|220694050|gb|EED50394.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
Length = 302
Score = 38.5 bits (88), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
TR T+ + + L +P R LP++E V+ + DYS G P++ YS
Sbjct: 6 TRGKATLNSLHRALASFPDRDRLPNVEF--------VLTTEDYS----SGEGPIWSYSKR 53
Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWID 232
++ + PD+ +W W E+ + P++ R + ++G +D
Sbjct: 54 EENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAAIDDGEVTVD 96
>gi|153218522|ref|NP_001020294.3| KDEL motif-containing protein 2 precursor [Rattus norvegicus]
gi|81888044|sp|Q566E5.1|KDEL2_RAT RecName: Full=KDEL motif-containing protein 2; Flags: Precursor
gi|62471552|gb|AAH93594.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
gi|149041673|gb|EDL95514.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
Length = 508
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 100/250 (40%), Gaps = 43/250 (17%)
Query: 55 CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
CP + P+ Q T +CP I +D + + + + E + R
Sbjct: 143 CPEDDPQAWQK-----------TLSCPANEPQIEQDFISFPSINLQQMLKEVPKRFGDER 191
Query: 115 LILVNNKVYI-HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
+V+ + H Y++S+ F ++ +L L RK LPDLE + D P+
Sbjct: 192 GAIVHYTILNNHIYRRSLGKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHR 249
Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGR 228
+ P GP P+ + G + DI+ P + ++ + +L +GN G
Sbjct: 250 KVNDTP---GPIPIISWCGSLDSRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGP 303
Query: 229 NWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQVISFIYITHLCSKIPGFN 288
+WI++ A+++G ++R + L L K N +L L + I G+
Sbjct: 304 SWINKTEKAFFRGR----DSREERLQLVLLSKE--NPQL------------LDAGITGYF 345
Query: 289 FFLIFFKMLG 298
FF K LG
Sbjct: 346 FFQEKEKELG 355
>gi|396482709|ref|XP_003841528.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
JN3]
gi|312218103|emb|CBX98049.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
JN3]
Length = 428
Score = 38.5 bits (88), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 48/104 (46%), Gaps = 12/104 (11%)
Query: 153 LPDLELMFDCDDRPVIRSRDYS---GPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
+P++E DD P RS+D+ G K R D W M P++++W W
Sbjct: 152 VPNIEFSLSLDDLPR-RSKDHGVFFGYTRKEG----REYNDIWMM----PNYAYWSWNYT 202
Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
+ W S+ RE+++ W ++ W+G +A+ R +L+
Sbjct: 203 HAPSWNSIRREIEQREAEIPWAMKDSRVVWRGKVKMAKLRSELV 246
>gi|410971943|ref|XP_003992420.1| PREDICTED: KDEL motif-containing protein 2 [Felis catus]
Length = 445
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 38/171 (22%), Positives = 76/171 (44%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 91 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 150
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPD+E + D P+ + P GP P+ + G
Sbjct: 151 DFKMFSDEILLSLARKV--LLPDIEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 205
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ DI+ P + + ++ + +L +GN G +WI++ A+++G
Sbjct: 206 SRDIILPTYDI---SHSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 253
>gi|344243629|gb|EGV99732.1| KDEL motif-containing protein 2 [Cricetulus griseus]
Length = 452
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 75/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN-NKVYIHKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ V H +++S+
Sbjct: 98 TLSCPATEPQIEQDFTSFPSIDLQQMLKEVPKRFGDERGAVVHYTIVNNHIHRRSLGKYT 157
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 158 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 212
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 213 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 260
>gi|354481242|ref|XP_003502811.1| PREDICTED: KDEL motif-containing protein 2-like [Cricetulus
griseus]
Length = 472
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 75/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN-NKVYIHKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ V H +++S+
Sbjct: 118 TLSCPATEPQIEQDFTSFPSIDLQQMLKEVPKRFGDERGAVVHYTIVNNHIHRRSLGKYT 177
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 178 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 232
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 233 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 280
>gi|239608365|gb|EEQ85352.1| DUF821 domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327349565|gb|EGE78422.1| DUF821 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 449
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 51/240 (21%), Positives = 97/240 (40%), Gaps = 38/240 (15%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTC----PDYFRWIHEDLS 92
PP R P N CP + P +T+ +Y + + C P + I + +
Sbjct: 47 PPCRPVIPAAGSNNNGGACPGDAPWKFETERDGDNYGLS-RAQCQKAFPKLYVEIEKAVV 105
Query: 93 PWKVTGITRDML-ERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKY 149
+ IT D L + +T+ R ++ +Y+ ++ T R ++ + + +
Sbjct: 106 ARRGRNITFDELNSKPLKTSMVRAMVYQGALYVINFEDMRYTFSRAKASLNALNRAISAI 165
Query: 150 PG--RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS---GDRWTMDIVFPDWSFW 204
P +P++E +F +D + P P++ YS D W + PD+ +W
Sbjct: 166 PNPNEIPNIEFIFTTEDF------------HDDPHPVWVYSKREDDGWAW--LMPDFGYW 211
Query: 205 GWAEINIKPWESL---LRELKEGN--NGR-----NWIDREPYAYWKGNPFVA-ETRRDLL 253
W EI + S+ + + EG NG+ + D++ W+G A E R+ L+
Sbjct: 212 SWPEIKAGQYRSVRQRIAAIDEGTVINGKAHPPLKFQDKKKQLLWRGAIATAPELRQGLV 271
>gi|431907500|gb|ELK11352.1| KDEL motif-containing protein 2 [Pteropus alecto]
Length = 447
Score = 38.1 bits (87), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 56/120 (46%), Gaps = 9/120 (7%)
Query: 125 HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGP 181
H Y++S+ F ++ +L L RK LPDLE + D P+ + P GP
Sbjct: 143 HIYRRSLGKYTDFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETP---GP 197
Query: 182 PPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
P+ + G + DI+ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 198 LPIISWCGSLDSQDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 256
>gi|261203203|ref|XP_002628815.1| DUF821 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239586600|gb|EEQ69243.1| DUF821 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 449
Score = 38.1 bits (87), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 51/240 (21%), Positives = 97/240 (40%), Gaps = 38/240 (15%)
Query: 37 PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTC----PDYFRWIHEDLS 92
PP R P N CP + P +T+ +Y + + C P + I + +
Sbjct: 47 PPCRPVIPAAGSNNNGGACPGDAPWKFETERDGDNYGLS-RAQCQKAFPKLYVEIEKAVV 105
Query: 93 PWKVTGITRDML-ERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKY 149
+ IT D L + +T+ R ++ +Y+ ++ T R ++ + + +
Sbjct: 106 ARRGRNITFDELNSKPLKTSMVRAMVYQGALYVINFEDMRYTFSRAKASLNALNRAISAI 165
Query: 150 PG--RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS---GDRWTMDIVFPDWSFW 204
P +P++E +F +D + P P++ YS D W + PD+ +W
Sbjct: 166 PNPNEIPNIEFIFTTEDF------------HDDPHPVWVYSKREDDGWAW--LMPDFGYW 211
Query: 205 GWAEINIKPWESL---LRELKEGN--NGR-----NWIDREPYAYWKGNPFVA-ETRRDLL 253
W EI + S+ + + EG NG+ + D++ W+G A E R+ L+
Sbjct: 212 SWPEIKAGQYRSVRQRIAAIDEGTVINGKAHPPLKFQDKKKQLLWRGAIATAPELRQGLV 271
>gi|34531642|dbj|BAC86191.1| unnamed protein product [Homo sapiens]
Length = 409
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 97 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 156
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 157 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 211
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 212 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 259
>gi|73955182|ref|XP_546537.2| PREDICTED: KDEL motif-containing protein 2 [Canis lupus familiaris]
Length = 508
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 154 TLSCPTKEPQIAKDFASFPSINLQQMLDEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 213
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPD+E + D P+ + P GP P+ + G
Sbjct: 214 DFKMFSDEILLSLARKV--LLPDIEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 268
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 269 SRDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 316
>gi|344287853|ref|XP_003415666.1| PREDICTED: KDEL motif-containing protein 2 [Loxodonta africana]
Length = 507
Score = 37.7 bits (86), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 38/171 (22%), Positives = 74/171 (43%), Gaps = 14/171 (8%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIEKDFLSFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPD+E + D P+ + P GP PL + G
Sbjct: 213 DFKMFSDEILLSLARKV--LLPDVEFYINVGDWPLEHRKVNETP---GPIPLISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
+ D++ P + ++ + +L +GN G +WI++ A+++G
Sbjct: 268 SRDVILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 315
>gi|71002250|ref|XP_755806.1| DUF821 domain protein [Aspergillus fumigatus Af293]
gi|66853444|gb|EAL93768.1| DUF821 domain protein [Aspergillus fumigatus Af293]
gi|159129863|gb|EDP54977.1| DUF821 domain protein [Aspergillus fumigatus A1163]
Length = 435
Score = 37.7 bits (86), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 23/98 (23%), Positives = 45/98 (45%), Gaps = 15/98 (15%)
Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
+R T+ + + ++ +P R LPD+E + DD + P P++ YS
Sbjct: 135 SRAKATLNSLHRAMQSFPDRHNLPDIEFVLTADDFSNV------------PGPVWSYSKR 182
Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
I + PD+ +W W E+ + P++ + R + + G
Sbjct: 183 DEDESIWLMPDFGYWAWPEVKVGPYKDIRRRIAAVDEG 220
>gi|449485840|ref|XP_002191338.2| PREDICTED: protein O-glucosyltransferase 1 [Taeniopygia guttata]
Length = 514
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 86/196 (43%), Gaps = 34/196 (17%)
Query: 62 TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR----DMLERANQTAHFRLIL 117
T Q + ++ Y C + +DL+P++ GI++ D++ R T H+++I
Sbjct: 158 TGQIKRAVEAYEPCVKENCSCHQSVWKQDLAPFR-GGISKETMSDVVSRKLGT-HYQII- 214
Query: 118 VNNKVYIHK---YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
NK+Y + + + F I GI+ RLPD+E++ + D P +
Sbjct: 215 -KNKLYREQDCMFPARCSGVEHF-ILGIIH-------RLPDMEMVINVRDYPQV------ 259
Query: 175 GPNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNN 226
P P P+F +S DI++P W+FW G A I P W+ + +L+
Sbjct: 260 -PKWMKPIIPVFSFSKTSEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAE 318
Query: 227 GRNWIDREPYAYWKGN 242
W + Y++G+
Sbjct: 319 KWPWKKKISKGYFRGS 334
>gi|432106972|gb|ELK32490.1| KDEL motif-containing protein 2, partial [Myotis davidii]
Length = 442
Score = 37.4 bits (85), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 74/169 (43%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++ +
Sbjct: 88 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGEERGAIVHYTILNNHIYRRPLGKYT 147
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P GP P+ + G
Sbjct: 148 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 202
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ DI+ P + ++ + L + +GN G +WI++ A+++G
Sbjct: 203 SRDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 250
>gi|407921427|gb|EKG14575.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
Length = 490
Score = 37.4 bits (85), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 73/168 (43%), Gaps = 23/168 (13%)
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
WK+ GI R M+ +L ++N + ++ + R + + I + + YPG +
Sbjct: 113 WKIDGIIRAMIYDR------QLFILNAR---GARRRDYRQRTLAVLQSIQRAITAYPGDI 163
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR--WTMDIVFPDWSFWGWAEINI 211
P++E F DD Y+ + L R D W M PD+ F+ W E
Sbjct: 164 PNIEFSFVVDDGAYFAV--YNNETSATTWALTREPQDDNLWLM----PDFGFYSW-EGPA 216
Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVA---ETRRDLLTCN 256
+ +LLR +++ + + ++P A W+G A + R DLL +
Sbjct: 217 GEYNALLRAIEQ--DEMPFEQKDPRAIWRGAKAPAGHVQVRSDLLKVS 262
>gi|34193987|gb|AAH36526.3| KDEL (Lys-Asp-Glu-Leu) containing 2 [Homo sapiens]
Length = 507
Score = 37.4 bits (85), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 39/175 (22%), Positives = 77/175 (44%), Gaps = 10/175 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 153 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 212
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
+ D+V P + ++ + L + +GN G +WI++ A+++G + E
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRGRDSLEE 321
>gi|397516340|ref|XP_003828388.1| PREDICTED: KDEL motif-containing protein 2 isoform 2 [Pan paniscus]
Length = 499
Score = 37.4 bits (85), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 77 TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
T +CP I +D + + + + + E + R +V+ + H Y++S+
Sbjct: 145 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 204
Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
F ++ +L L RK LPDLE + D P+ + P+ P P+ + G
Sbjct: 205 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 259
Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
+ D+V P + ++ + L + +GN G +WI++ A+++G
Sbjct: 260 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 307
>gi|319411533|emb|CBQ73577.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 452
Score = 37.4 bits (85), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 41/180 (22%), Positives = 70/180 (38%), Gaps = 28/180 (15%)
Query: 92 SPWKVTGITR--DMLERANQTAH--FRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
+ WK G R D+ A H L++ + +++I ++ Q+R + LQLL+
Sbjct: 73 AAWKAKGGVRFEDVQNAAQHCRHGCVHLVIRDGQIFIRAQEKDWQSR----VRSTLQLLQ 128
Query: 148 K-YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD---------IV 197
Y G + + + + VI + D+ G + G W +D +
Sbjct: 129 SAYLGASEEEREVMEGVEL-VISTADFDGFTDAAS------RGAGWVLDKRVNDTQGQYL 181
Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAET---RRDLLT 254
FPD+SF W E I + R N W + A+W+G+ + R LL
Sbjct: 182 FPDFSFASWPEAGIPSYPEFRRAAARVNAAVPWASKANRAFWRGDALAGSSIPARESLLA 241
>gi|327303802|ref|XP_003236593.1| hypothetical protein TERG_03638 [Trichophyton rubrum CBS 118892]
gi|326461935|gb|EGD87388.1| hypothetical protein TERG_03638 [Trichophyton rubrum CBS 118892]
Length = 431
Score = 37.4 bits (85), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 69/166 (41%), Gaps = 18/166 (10%)
Query: 98 GITRDMLERAN-QTAHFRLILVNNKVYI--HKYKQSIQTRDVFTIWGILQ-LLRKYPGR- 152
GI+ + L++ + R I+ ++YI K Q + + + + L G
Sbjct: 95 GISNEHLDKIELKDGMARAIIHRGELYIVATKAAQEDHRKKILAVLSSMHHALSATAGNR 154
Query: 153 -LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGW--AE 208
P +E +F +DR D SGP + P++ S I + PD+ FW W A
Sbjct: 155 TQPSIEFVFSIEDR----VDDISGPGH----PIWALSRKASEESIWLIPDFGFWAWDNAA 206
Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
I P+ ++ ++ W +E W+G F + RR+LL
Sbjct: 207 NAIGPYNQVVDRIQRQEVTVPWSSKENKLVWRGKLSFAPKLRRNLL 252
>gi|452984518|gb|EME84275.1| hypothetical protein MYCFIDRAFT_118382, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 387
Score = 37.4 bits (85), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 37/170 (21%), Positives = 66/170 (38%), Gaps = 18/170 (10%)
Query: 94 WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
W+ GI R M+ NQ ++ + V H ++ R + T+ I + + G+L
Sbjct: 62 WRGDGIVRAMIH-DNQL----YVIDPHAVTDHNHR----PRTLATLHAIHRAVSASSGKL 112
Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
PD+E F D + D G + + W M PD+ WGW ++ ++
Sbjct: 113 PDIEFSFTVHDFAL---HDRYGNHTTWAYTRLPHQEKLWLM----PDFGLWGWPDVGLRS 165
Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDW 263
+ L + ++D+ P W+G+ V S+K W
Sbjct: 166 YAEFQTVLDYEED--EFVDKIPKLVWRGSLAVGSHDVRAGLVKHSEKQPW 213
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.139 0.460
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,261,866,332
Number of Sequences: 23463169
Number of extensions: 232841045
Number of successful extensions: 431041
Number of sequences better than 100.0: 450
Number of HSP's better than 100.0 without gapping: 193
Number of HSP's successfully gapped in prelim test: 257
Number of HSP's that attempted gapping in prelim test: 430396
Number of HSP's gapped (non-prelim): 464
length of query: 302
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 161
effective length of database: 9,050,888,538
effective search space: 1457193054618
effective search space used: 1457193054618
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 76 (33.9 bits)