BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022102
         (302 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359474017|ref|XP_002269577.2| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
          Length = 585

 Score =  361 bits (926), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 177/271 (65%), Positives = 205/271 (75%), Gaps = 13/271 (4%)

Query: 1   MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNY 59
            I  F FS   +  F V+N  R    I  + +   RP   I  PLNC  +N TQTCP NY
Sbjct: 115 FISFFCFSS--RFIFLVNNSERKAVPISENHRKTPRP---IVVPLNCSARNLTQTCPGNY 169

Query: 60  PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
           P T  T     D +  P   CPDYFRWIHEDL PWK TGI+RDM+ERA ++AHFRL++V 
Sbjct: 170 PTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVK 222

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
            KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVIRS D+ GPN+ 
Sbjct: 223 GKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNST 282

Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
            PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN   W++REPYAYW
Sbjct: 283 SPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYW 342

Query: 240 KGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           KGNP VAETRRDLLTCN+SD  DWNARL+VQ
Sbjct: 343 KGNPLVAETRRDLLTCNVSDVQDWNARLFVQ 373


>gi|297742541|emb|CBI34690.3| unnamed protein product [Vitis vinifera]
          Length = 497

 Score =  360 bits (923), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 177/271 (65%), Positives = 205/271 (75%), Gaps = 13/271 (4%)

Query: 1   MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNY 59
            I  F FS   +  F V+N  R    I  + +   RP   I  PLNC  +N TQTCP NY
Sbjct: 27  FISFFCFSS--RFIFLVNNSERKAVPISENHRKTPRP---IVVPLNCSARNLTQTCPGNY 81

Query: 60  PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
           P T  T     D +  P   CPDYFRWIHEDL PWK TGI+RDM+ERA ++AHFRL++V 
Sbjct: 82  PTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAKRSAHFRLVIVK 134

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
            KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVIRS D+ GPN+ 
Sbjct: 135 GKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVIRSGDHRGPNST 194

Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
            PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN   W++REPYAYW
Sbjct: 195 SPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRTKWMEREPYAYW 254

Query: 240 KGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           KGNP VAETRRDLLTCN+SD  DWNARL+VQ
Sbjct: 255 KGNPLVAETRRDLLTCNVSDVQDWNARLFVQ 285


>gi|147800396|emb|CAN66409.1| hypothetical protein VITISV_020976 [Vitis vinifera]
          Length = 439

 Score =  348 bits (894), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 162/222 (72%), Positives = 185/222 (83%), Gaps = 7/222 (3%)

Query: 49  KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERAN 108
           KN TQTCP NYP T  T     D +  P   CPDYFRWIHEDL PWK TGI+RDM+ERA 
Sbjct: 23  KNLTQTCPGNYPTTFDT-----DLAWKPV--CPDYFRWIHEDLKPWKTTGISRDMVERAK 75

Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
           ++AHFRL++V  KVYI KYK+SIQTRDVFTIWGILQLLR+YPG+L DLEL FDC+DRPVI
Sbjct: 76  RSAHFRLVIVKGKVYIEKYKKSIQTRDVFTIWGILQLLRRYPGKLLDLELTFDCNDRPVI 135

Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
           RS D+ GPN+  PPPLFRY GDRWT+D+VFPDWSFWGW EIN+KPW +LL++LKEGNN  
Sbjct: 136 RSGDHRGPNSTSPPPLFRYCGDRWTLDVVFPDWSFWGWPEINMKPWGNLLKDLKEGNNRT 195

Query: 229 NWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
            W++REPYAYWKGNP VAETRRDLLTCN+SD  DWNARL+VQ
Sbjct: 196 KWMEREPYAYWKGNPLVAETRRDLLTCNVSDVQDWNARLFVQ 237


>gi|255539447|ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
 gi|223549903|gb|EEF51390.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
          Length = 528

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 160/277 (57%), Positives = 198/277 (71%), Gaps = 12/277 (4%)

Query: 1   MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQN------RPPKRITTPLNCVK-NQTQ 53
           +   F  +  L S+ +    S  K +I     P N          +I  PLNC   N T+
Sbjct: 45  LASAFLTTRFLDSSSAFTGSSAQKPLITTKSAPTNPTLISKNALNKINIPLNCAAFNLTR 104

Query: 54  TCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHF 113
           TCP+NYP T        +   P  S CP+Y+RWI+EDL PW  TGI+RDM+ERA  TA+F
Sbjct: 105 TCPSNYPTTF-----TENPDRPSVSACPEYYRWIYEDLRPWARTGISRDMVERAKTTANF 159

Query: 114 RLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY 173
           RL++VN K Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC D PVI+S +Y
Sbjct: 160 RLVIVNGKAYVEKYRRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIKSSNY 219

Query: 174 SGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDR 233
           SGPN   PPPLFRY GD  T+D+VFPDWSFWGW+EINIKPWE LLRELKEGN  R W++R
Sbjct: 220 SGPNAMAPPPLFRYCGDDDTLDVVFPDWSFWGWSEINIKPWERLLRELKEGNEKRRWMER 279

Query: 234 EPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           EPYAYWKGNP VAETR+DL+ CN+S++ DWNAR+Y Q
Sbjct: 280 EPYAYWKGNPAVAETRQDLMKCNVSEQQDWNARVYAQ 316


>gi|297808383|ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317912|gb|EFH48334.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 543

 Score =  333 bits (854), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 155/251 (61%), Positives = 194/251 (77%), Gaps = 11/251 (4%)

Query: 22  RNKTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTN-YPKTSQTQESISDYSIPPTST 79
           R  TVI  S KP+          L+C  N+T  +CP+N YP T+   E   D + PP +T
Sbjct: 89  RPTTVITQSPKPE--------FTLHCSANETTASCPSNKYPTTASFGED-DDTNHPPNAT 139

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CPDYFRWIHEDL PW  TGITR+ LERA +TA+FRL +++ K+Y+ K++ + QTRDVFTI
Sbjct: 140 CPDYFRWIHEDLRPWSSTGITREALERAKKTANFRLAIIDGKIYVEKFQDAFQTRDVFTI 199

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WG LQLLRKYPG++PDLELMFDC D PV+++ +++G N   PPPLFRY G+  T+DIVFP
Sbjct: 200 WGFLQLLRKYPGKIPDLELMFDCVDWPVVKASEFTGANAPSPPPLFRYCGNEETLDIVFP 259

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
           DWSFWGWAE+NIKPWESLL+EL+EGN    WI+REPYAYWKGNP VAETR+DL+ CN+S+
Sbjct: 260 DWSFWGWAEVNIKPWESLLKELREGNQRTKWINREPYAYWKGNPMVAETRQDLMKCNVSE 319

Query: 260 KHDWNARLYVQ 270
           +H+WNARLYVQ
Sbjct: 320 EHEWNARLYVQ 330


>gi|15237842|ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana]
 gi|10176852|dbj|BAB10058.1| unnamed protein product [Arabidopsis thaliana]
 gi|48310551|gb|AAT41837.1| At5g23850 [Arabidopsis thaliana]
 gi|62320258|dbj|BAD94534.1| putative protein [Arabidopsis thaliana]
 gi|332005839|gb|AED93222.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 542

 Score =  332 bits (852), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 156/251 (62%), Positives = 193/251 (76%), Gaps = 12/251 (4%)

Query: 22  RNKTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTN-YPKTSQTQESISDYSIPPTST 79
           R  TVI  S KP+          L+C  N+T  +CP+N YP T+  ++   D + PPT+T
Sbjct: 89  RPTTVITQSPKPE--------FTLHCSANETTASCPSNKYPTTTSFEDD--DTNHPPTAT 138

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CPDYFRWIHEDL PW  TGITR+ LERA +TA FRL +V  K+Y+ K++ + QTRDVFTI
Sbjct: 139 CPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAFQTRDVFTI 198

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WG LQLLRKYPG++PDLELMFDC D PV+R+ +++G N   PPPLFRY G+  T+DIVFP
Sbjct: 199 WGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNEETLDIVFP 258

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
           DWSFWGWAE+NIKPWESLL+EL+EGN    WI+REPYAYWKGNP VAETR+DL+ CN+S+
Sbjct: 259 DWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAETRQDLMKCNVSE 318

Query: 260 KHDWNARLYVQ 270
           +H+WNARLY Q
Sbjct: 319 EHEWNARLYAQ 329


>gi|332071136|gb|AED99886.1| glycosyltransferase [Panax notoginseng]
          Length = 546

 Score =  327 bits (838), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 151/238 (63%), Positives = 187/238 (78%), Gaps = 4/238 (1%)

Query: 34  QNRPPKRITTPLNC-VKNQTQTCPTNY-PKTSQTQESISDYSIPPTSTCPDYFRWIHEDL 91
           + +PP+++  PLNC   N  +TCP NY P+T   Q+     SIPP S CP+YFRWI+EDL
Sbjct: 100 RKKPPRKVEIPLNCSTGNLIRTCPANYYPRTFNIQDQ-DHSSIPPVS-CPEYFRWIYEDL 157

Query: 92  SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPG 151
            PW+ TGITR+M+ERA +TA+FRL+++N + Y+  +++S Q+RDVFT+WGILQLLR YPG
Sbjct: 158 RPWRETGITREMVERARRTANFRLVILNGRAYVETHQKSFQSRDVFTLWGILQLLRMYPG 217

Query: 152 RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
           ++PDL+LMFDC D PVI SR Y GPN   PPPLFRY  D  T+DIVFPDW+FWGW EINI
Sbjct: 218 KVPDLDLMFDCVDWPVIISRFYHGPNATAPPPLFRYCADDSTLDIVFPDWTFWGWPEINI 277

Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYV 269
           KPW SLL++LKEGN G  W+DREPYAYWKGNP VA+TR DLL CN+SDK DWNAR+Y 
Sbjct: 278 KPWGSLLKDLKEGNTGTQWMDREPYAYWKGNPIVAKTRMDLLKCNVSDKQDWNARVYA 335


>gi|359489773|ref|XP_002273091.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
          Length = 519

 Score =  326 bits (835), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 154/237 (64%), Positives = 186/237 (78%), Gaps = 7/237 (2%)

Query: 36  RPPKRITTPLNC-VKNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
           + P R+  PLNC   N T+TCP NYP K + T    S  ++    TCPDYFRWIH+DL P
Sbjct: 76  KAPIRLEYPLNCSAGNLTKTCPGNYPTKHNPTNPDHSSTNM----TCPDYFRWIHQDLLP 131

Query: 94  WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
           WK TGITRDM++RA +TAHFRL++++ K Y+ K+++SIQTRD+FT+WGILQLLR YPGRL
Sbjct: 132 WKQTGITRDMVDRAQRTAHFRLVIIDGKAYVEKFRKSIQTRDMFTLWGILQLLRWYPGRL 191

Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
           PDLELMFDCDDRPV+R RD+ GP N  PPPLFRY GD W++DIVFPDWSFWGWAE NIKP
Sbjct: 192 PDLELMFDCDDRPVVRMRDFRGP-NAAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKP 250

Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           W ++L+++KEGN    W DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+Q
Sbjct: 251 WRNVLKDIKEGNRRTKWKDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQ 307


>gi|255541542|ref|XP_002511835.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
 gi|223549015|gb|EEF50504.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
          Length = 522

 Score =  323 bits (828), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 148/243 (60%), Positives = 185/243 (76%), Gaps = 8/243 (3%)

Query: 31  RKPQNRPPKRITTPLNCVK--NQTQTCPTNY-PKTSQTQESISDYSIPPTSTCPDYFRWI 87
           +K    P ++   PL C    N TQTCP +Y PKT  +  S    ++     CP YF+WI
Sbjct: 74  QKASPTPQEKAEFPLRCTATPNITQTCPADYYPKTHNSTNSDRQSNV----ICPSYFKWI 129

Query: 88  HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
           HEDL PW+ TGITRDM+ERA +TAHFRL++V+ + Y+ KY+QSIQTRD+ T+WGILQLLR
Sbjct: 130 HEDLRPWRETGITRDMIERARRTAHFRLVIVDGRAYVEKYRQSIQTRDMITLWGILQLLR 189

Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
            YPG++PDLELMFDCDDRPV+RS D+ GP   GPPPLFRY  D  ++DIVFPDWSFWGWA
Sbjct: 190 LYPGKVPDLELMFDCDDRPVVRSEDFPGP-TAGPPPLFRYCADDTSLDIVFPDWSFWGWA 248

Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
           E+NIKPW+S+L+ + +G+  + W DR PYAYWKGNP+V+  R DL+TCN+SDKHDWNARL
Sbjct: 249 EVNIKPWKSMLKGITKGSKRKKWKDRVPYAYWKGNPYVSANRGDLMTCNVSDKHDWNARL 308

Query: 268 YVQ 270
           Y Q
Sbjct: 309 YAQ 311


>gi|255539445|ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
 gi|223549902|gb|EEF51389.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
          Length = 506

 Score =  323 bits (828), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 151/242 (62%), Positives = 183/242 (75%), Gaps = 6/242 (2%)

Query: 30  SRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIH 88
           S  P  +P  R+  PLNC   N T+TCPT+YP TS       + S PPT  CP+YFRWIH
Sbjct: 58  STVPLEKPDNRLVIPLNCHALNLTRTCPTDYPSTSSQD---PNRSSPPT--CPEYFRWIH 112

Query: 89  EDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRK 148
           EDL PW  TGITR+ +ERA  TA+FRL+++N   Y+  Y++S QTRDVFT+WGILQLLRK
Sbjct: 113 EDLRPWVRTGITRETMERAKATANFRLVILNGTAYLEMYEKSFQTRDVFTLWGILQLLRK 172

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
           YPGR+PDLE+MFDC D PV++S DYSG +   PPPLFRY G+  T+DIVFPDWS+WGW E
Sbjct: 173 YPGRVPDLEMMFDCVDWPVVKSVDYSGSSAISPPPLFRYCGNDETLDIVFPDWSYWGWVE 232

Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
            NIKPWE ++++LKEGN    W +REPYAYWKGNP VAETR DL+ CN+S +HDWNARLY
Sbjct: 233 TNIKPWEKIVKDLKEGNQRSKWKEREPYAYWKGNPNVAETRLDLMKCNVSQEHDWNARLY 292

Query: 269 VQ 270
            Q
Sbjct: 293 TQ 294


>gi|255537419|ref|XP_002509776.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
 gi|223549675|gb|EEF51163.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
          Length = 534

 Score =  321 bits (822), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 163/263 (61%), Positives = 188/263 (71%), Gaps = 24/263 (9%)

Query: 11  LQSNFSVHNISRNKTVIIVSRKPQNRPPKR-ITTPLNCV--KNQTQTCPTNYPKTSQTQE 67
           LQS  SV   S N TV+I  +   N P +  I  PLNC    NQTQTCPTNYPK + ++ 
Sbjct: 73  LQSKVSVER-STNTTVVISEK--HNIPQRETIEFPLNCTINNNQTQTCPTNYPKLTTSRA 129

Query: 68  SISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKY 127
              D   PP + CP+YFRWIHEDL PW   GI+RDM+ERA +TAHF LI+V  K YI KY
Sbjct: 130 DDQD---PPRTVCPNYFRWIHEDLRPWIAAGISRDMVERAQRTAHFHLIIVGGKAYIKKY 186

Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY 187
           ++S QTRD FTIWGILQLLR+YPG++PDLELMFD DDRPVIRS DY   N  GPPPLFRY
Sbjct: 187 RESTQTRDTFTIWGILQLLRRYPGKIPDLELMFDTDDRPVIRSSDYHEQNTTGPPPLFRY 246

Query: 188 SGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
            GDR       PD        INIKPW+ L  ++KEGNNG  WIDREPYAYWKGNPFVAE
Sbjct: 247 CGDR-------PD--------INIKPWDELSIDIKEGNNGSKWIDREPYAYWKGNPFVAE 291

Query: 248 TRRDLLTCNLSDKHDWNARLYVQ 270
           TR+DLL CN+SD+ DWNARL++Q
Sbjct: 292 TRKDLLACNVSDQRDWNARLFIQ 314


>gi|225470185|ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
 gi|302143884|emb|CBI22745.3| unnamed protein product [Vitis vinifera]
          Length = 525

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 148/228 (64%), Positives = 176/228 (77%), Gaps = 5/228 (2%)

Query: 44  PLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
           PLNC   N T+TCP NYP    T  S  D   P    CP YFRWI+ DL PW  +GITR+
Sbjct: 92  PLNCSAGNLTRTCPRNYP----TAFSPEDPDRPSPPECPHYFRWIYGDLRPWMKSGITRE 147

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           M+ERA +TA F+L+++N + Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC
Sbjct: 148 MVERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDC 207

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D PVI+S +Y GPN   PPPLFRY GD  T+DIVFPDWSFWGW EINIKPWESLL++LK
Sbjct: 208 VDWPVIQSNEYRGPNATAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWESLLKDLK 267

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           EGN    W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+Y Q
Sbjct: 268 EGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARVYTQ 315


>gi|15229061|ref|NP_190467.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6522568|emb|CAB62012.1| putative protein [Arabidopsis thaliana]
 gi|332644958|gb|AEE78479.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 539

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 145/237 (61%), Positives = 182/237 (76%), Gaps = 6/237 (2%)

Query: 38  PKRITTPLNCVK---NQTQTCPT-NYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
           PK  T  LNC     N T TCP  NYP + ++     +    P++TCPDYFRWIHEDL P
Sbjct: 92  PKEFT--LNCAAFSGNDTGTCPKDNYPTSFRSSAGEGESDRSPSATCPDYFRWIHEDLRP 149

Query: 94  WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
           W+ TGITR+ LERAN TA FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG++
Sbjct: 150 WEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKI 209

Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
           PDLELMFDC D PV+++ +++G +   PPPLFRY  +  T+DIVFPDWS+WGWAE+NIKP
Sbjct: 210 PDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKP 269

Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           WESLL+EL+EGN    WIDREPYAYWKGNP VAETR DL+ CNLS+ +DW ARLY Q
Sbjct: 270 WESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSEVYDWKARLYKQ 326


>gi|297816106|ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321774|gb|EFH52195.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 539

 Score =  317 bits (811), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 149/239 (62%), Positives = 186/239 (77%), Gaps = 10/239 (4%)

Query: 38  PKRITTPLNCVK---NQTQTCP-TNYPKT--SQTQESISDYSIPPTSTCPDYFRWIHEDL 91
           PK  T  LNC     N T TCP  NYP +  S   E  SD S+  ++TCPDYFRWIHEDL
Sbjct: 92  PKEFT--LNCAGFAGNDTVTCPKNNYPTSFRSSVGEGESDRSL--SATCPDYFRWIHEDL 147

Query: 92  SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPG 151
            PW+ TGITR+ LERAN TA+FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG
Sbjct: 148 RPWEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPG 207

Query: 152 RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
           ++PDLELMFDC D PV+++ +++G +   PPPLFRY  +  T+DIVFPDWS+WGWAE+NI
Sbjct: 208 KIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNI 267

Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           KPWESLL+EL+EGN    WIDREPYAYWKGNP VAETR DL+ CNLS+++DW ARLY Q
Sbjct: 268 KPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSEEYDWKARLYKQ 326


>gi|356529208|ref|XP_003533188.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
          Length = 469

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 152/257 (59%), Positives = 187/257 (72%), Gaps = 6/257 (2%)

Query: 15  FSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYS 73
           FS    S   T +  +R  +    K I  PLNC   N T TCPTN     Q+    +D S
Sbjct: 6   FSGSTKSALSTTLYDTRPFRVSIQKPIEIPLNCSAYNLTGTCPTN-----QSPIPENDQS 60

Query: 74  IPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
            P ++TCP+YFRWIHEDL PW  TGIT++M+ERA QTA+F+L+++  K Y+  Y+++ QT
Sbjct: 61  RPSSATCPEYFRWIHEDLRPWARTGITQEMVERAKQTANFKLVILKGKAYLETYEKAYQT 120

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           RDVF+IWGILQLLR+YPG++PDLELMFDC D PV+    Y+GPN + PPPLFRY G+  T
Sbjct: 121 RDVFSIWGILQLLRRYPGKIPDLELMFDCVDWPVLLVDRYNGPNTEQPPPLFRYCGNDAT 180

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
           +DIVFPDWSFWGWAE+NIKPWE LL ELKEG     W++REPYAYWKGNP VAETR DL+
Sbjct: 181 LDIVFPDWSFWGWAEVNIKPWEILLGELKEGTKRIPWLNREPYAYWKGNPVVAETRLDLM 240

Query: 254 TCNLSDKHDWNARLYVQ 270
            CN+S+  DWNARLY Q
Sbjct: 241 KCNVSENQDWNARLYAQ 257


>gi|356561622|ref|XP_003549080.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
          Length = 525

 Score =  313 bits (803), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 154/276 (55%), Positives = 191/276 (69%), Gaps = 17/276 (6%)

Query: 1   MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPP-----KRITTPLNCVK-NQTQT 54
           +I  FA++  L ++      S +K+    S  P    P     ++   PLNC   N T T
Sbjct: 52  IIGAFAYTRTLDTHRMFSGASSSKSA--QSTTPYGTSPFTVSIRKPIAPLNCTAYNLTGT 109

Query: 55  CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
           CPTN      +         P T+TCPDYFRWIHEDL PW  TGIT+DM+ERA QTA+FR
Sbjct: 110 CPTNLQDHQNS---------PATATCPDYFRWIHEDLRPWARTGITQDMVERAKQTANFR 160

Query: 115 LILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
           LI++  + Y+  Y +  QTRDVF+IWGILQLLR+YPG++PDLELMFDC+D PV+ +  Y+
Sbjct: 161 LIILKGRAYLETYSRPYQTRDVFSIWGILQLLRRYPGKIPDLELMFDCEDWPVVLADRYN 220

Query: 175 GPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDRE 234
           GPN + PPPLFRY G+  T+DIVFPDWSFWGWAEINIKPW  LL ELKEG     W++RE
Sbjct: 221 GPNVEQPPPLFRYCGNDATLDIVFPDWSFWGWAEINIKPWHILLGELKEGTTRIPWLNRE 280

Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           PYAYWKGNP VAETR+DL+ CN+S+  DWNARL+ Q
Sbjct: 281 PYAYWKGNPAVAETRQDLIKCNVSENQDWNARLFAQ 316


>gi|449452346|ref|XP_004143920.1| PREDICTED: O-glucosyltransferase rumi-like [Cucumis sativus]
          Length = 514

 Score =  313 bits (802), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 149/237 (62%), Positives = 182/237 (76%), Gaps = 16/237 (6%)

Query: 44  PLNC----VKNQTQ--TCPTNYPKTSQTQESISDYSIPPT--STCPDYFRWIHEDLSPWK 95
           PLNC      NQTQ  TC  +YP   + +      SI P+  S CP+YFRWIHEDL PW 
Sbjct: 76  PLNCSSSSTTNQTQHFTCRKDYPTLYEPE------SIGPSGRSVCPEYFRWIHEDLKPWA 129

Query: 96  VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
             GITR+M+E+   TAHFRL +V   VY+  YK+SIQTRD+FTIWGILQLLR+YPG++PD
Sbjct: 130 AGGITREMVEKGKATAHFRLAVVRGIVYVEHYKKSIQTRDLFTIWGILQLLRRYPGQIPD 189

Query: 156 LELMFDCDDRPVIRSRDY--SGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
           LELMFDCDDRPV++S DY  +G +    PP+FRY GD  T+DIVFPDWSFWGWAEINI+P
Sbjct: 190 LELMFDCDDRPVVKSADYRNAGVDTVEAPPVFRYCGDEETLDIVFPDWSFWGWAEINIRP 249

Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           WE+LL+ELK+GN  R W+ RE +AYWKGNP+VA+TR+DLL CNLS ++DWNARLY+Q
Sbjct: 250 WENLLKELKKGNEKRKWMKREAFAYWKGNPYVADTRQDLLKCNLSLQNDWNARLYIQ 306


>gi|110743626|dbj|BAE99650.1| hypothetical protein [Arabidopsis thaliana]
          Length = 433

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 138/220 (62%), Positives = 175/220 (79%), Gaps = 1/220 (0%)

Query: 52  TQTCPT-NYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQT 110
           T TCP  NYP + ++     +    P++TCPDYFRWIHEDL PW+ TGITR+ LERAN T
Sbjct: 1   TGTCPKDNYPTSFRSSAGEGESDRSPSATCPDYFRWIHEDLRPWEKTGITREALERANAT 60

Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
           A FRL ++N ++Y+ K++++ QTRDVFTIWG +QLLR+YPG++PDLELMFDC D PV+++
Sbjct: 61  AIFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKA 120

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
            +++G +   PPPLFRY  +  T+DIVFPDWS+WGWAE+NIKPWESLL+EL+EGN    W
Sbjct: 121 AEFAGVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKPWESLLKELREGNQRTKW 180

Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           IDREPYAYWKGNP VAETR DL+ CNLS+ +DW ARLY Q
Sbjct: 181 IDREPYAYWKGNPTVAETRLDLMKCNLSEVYDWKARLYKQ 220


>gi|356522636|ref|XP_003529952.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
          Length = 510

 Score =  310 bits (795), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 146/246 (59%), Positives = 181/246 (73%), Gaps = 16/246 (6%)

Query: 27  IIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
           II + KPQ +       PL+C +N TQTC  +YP               PT TCP YFRW
Sbjct: 67  IIPTTKPQEQ-----EFPLSCTQNVTQTCSRDYPTIHTPTN--------PTRTCPSYFRW 113

Query: 87  IHEDLSPWKVT--GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQ 144
           IHEDL PW+    GITR+MLE A +TAHFRL++V+ K+Y+ KYK++IQTRDVFT+WGILQ
Sbjct: 114 IHEDLWPWRERDRGITREMLEGARRTAHFRLVIVDGKLYVEKYKKAIQTRDVFTLWGILQ 173

Query: 145 LLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
           LLR YPG++PDLEL+FDCDDRPV+    + GPN   PP LFRY  D+W++DIVFPDWSFW
Sbjct: 174 LLRMYPGKVPDLELLFDCDDRPVVSKERFKGPNAPTPP-LFRYCSDQWSLDIVFPDWSFW 232

Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
           GWAEINIKPW+ +L+E+KEGN    W DR PYAYWKGNP V+ TR+DL+ CN+++K DWN
Sbjct: 233 GWAEINIKPWKHVLKEIKEGNEKTKWKDRVPYAYWKGNPLVSPTRKDLMKCNVTEKDDWN 292

Query: 265 ARLYVQ 270
             LY+Q
Sbjct: 293 THLYIQ 298


>gi|356497228|ref|XP_003517464.1| PREDICTED: KDEL motif-containing protein 1-like [Glycine max]
          Length = 522

 Score =  310 bits (795), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 142/231 (61%), Positives = 174/231 (75%), Gaps = 13/231 (5%)

Query: 40  RITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGI 99
           ++   LNC  N  Q C T+YP +         +       CP+YFRWIHED+  WK  GI
Sbjct: 89  QLEATLNCSNNGKQRC-TSYPTSGV-------FEREEGGVCPEYFRWIHEDVGAWKERGI 140

Query: 100 TRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
           +R+M+ERA ++AHFRL++   +VY+ +YK+SIQTR+VFT+WGI+QLLRKYPG++ DLELM
Sbjct: 141 SREMVERAKKSAHFRLVVKRGRVYVERYKKSIQTREVFTMWGIVQLLRKYPGKVADLELM 200

Query: 160 FDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLR 219
           FDCDD PVIR     G +  GPPPLFRY GDRWT DIVFPDWSFWGWAEINI+PWE +L+
Sbjct: 201 FDCDDLPVIR-----GSSLAGPPPLFRYCGDRWTDDIVFPDWSFWGWAEINIRPWEHVLK 255

Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           E+++GN    W DREPYAYWKGNPFVAETR+DLL CN+S   DWNARLYVQ
Sbjct: 256 EMEKGNRRIKWNDREPYAYWKGNPFVAETRQDLLKCNVSTTQDWNARLYVQ 306


>gi|357497409|ref|XP_003618993.1| KTEL motif-containing protein [Medicago truncatula]
 gi|355494008|gb|AES75211.1| KTEL motif-containing protein [Medicago truncatula]
          Length = 515

 Score =  307 bits (786), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 141/230 (61%), Positives = 179/230 (77%), Gaps = 12/230 (5%)

Query: 44  PLNCV--KNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGIT 100
           PL C   + +TQTCP +YP K + T ++        + TCP +FRWIHEDL PWK  GIT
Sbjct: 83  PLRCTNGEKETQTCPRDYPTKHNPTNQN--------SHTCPSFFRWIHEDLKPWKEKGIT 134

Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMF 160
           R+MLE A +TA+F++++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR +PG+LPDLELMF
Sbjct: 135 REMLEGAKRTANFKVVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRMFPGKLPDLELMF 194

Query: 161 DCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
           DC+DRPVI   ++ GP N  PPPLFRY  D+W++DIVFPDWSFWGWAE NIKPW+++L+E
Sbjct: 195 DCEDRPVIHKGNFQGP-NASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWKNILKE 253

Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           +KEGN    W DR PYAYWKGNP VA TR++LL CN + K DWN RLY+Q
Sbjct: 254 IKEGNKETKWKDRVPYAYWKGNPNVAATRKNLLRCNATSKDDWNTRLYIQ 303


>gi|297824639|ref|XP_002880202.1| hypothetical protein ARALYDRAFT_483721 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326041|gb|EFH56461.1| hypothetical protein ARALYDRAFT_483721 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  303 bits (777), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 141/232 (60%), Positives = 173/232 (74%), Gaps = 2/232 (0%)

Query: 39  KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
           +R     + VKNQTQ  P +    +  ++  S +S P  STCP YFRWIHEDL PWK TG
Sbjct: 75  QRFPNQCDVVKNQTQLFPEDGSSRNNNKKPRSSHSRP--STCPSYFRWIHEDLRPWKETG 132

Query: 99  ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
           ITR MLE+A +TAHFR+++++ +VY+ KY+ SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 133 ITRGMLEKARRTAHFRVVILDGRVYVKKYRNSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 192

Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
           MFD DDRP +RS+D+ G  +  PPPLFRY  D  ++DIVFPDWSFWGWAE+NIKPW   L
Sbjct: 193 MFDPDDRPTVRSKDFQGHQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWAKSL 252

Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
             ++EGN    W DR  YAYW+GNP VA TRRDLL CN+S + DWN RLY+Q
Sbjct: 253 VAIEEGNKMTQWTDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQ 304


>gi|449446159|ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 538

 Score =  300 bits (768), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 144/248 (58%), Positives = 178/248 (71%), Gaps = 10/248 (4%)

Query: 29  VSRKPQNRPPK-RITTPLNCVKNQTQT---CPTNYPKTSQTQESISDYSIPPTST--CPD 82
           V   P ++P + ++   L+C      T   CP +YP    T E  +    PP+S+  CPD
Sbjct: 83  VPHNPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQN----PPSSSSACPD 138

Query: 83  YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
           YFRWIHEDL PW  TGITR  LE   +TA+FRL+++N K Y+  YK+S QTRD FT+WGI
Sbjct: 139 YFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGI 198

Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
           LQLLR+YPG++PDL+LMFDC D PVI +  +SGPN   PPPLFRY GD  T DIVFPDWS
Sbjct: 199 LQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWS 258

Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
           FWGW EINIKPWE LL+++KEGN    W  REPYAYWKGNP VA+TR+DL+ CN+SD+ D
Sbjct: 259 FWGWPEINIKPWEPLLKDIKEGNKRIPWKSREPYAYWKGNPEVADTRKDLIKCNVSDQQD 318

Query: 263 WNARLYVQ 270
           WNAR++ Q
Sbjct: 319 WNARVFAQ 326


>gi|224127624|ref|XP_002320120.1| predicted protein [Populus trichocarpa]
 gi|222860893|gb|EEE98435.1| predicted protein [Populus trichocarpa]
          Length = 411

 Score =  300 bits (767), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 135/194 (69%), Positives = 165/194 (85%), Gaps = 1/194 (0%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
           TSTCP YF+WIH+DL  WK TGIT+DM+ERA +TAHFRL++VN K Y+ KY+QSIQTRD+
Sbjct: 9   TSTCPSYFQWIHDDLRHWKETGITQDMIERARKTAHFRLVIVNGKAYVEKYRQSIQTRDM 68

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           FT+WGILQLLR YPGRLPDLELMFDCDDRPVI S+ + GP N  PPPLFRY  D  ++DI
Sbjct: 69  FTLWGILQLLRLYPGRLPDLELMFDCDDRPVIPSKHFRGP-NAAPPPLFRYCSDWQSLDI 127

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWSFWGWAE NI+PW++LL+E+KEGN+   W DR PYAYW+GNP+V+  R+DLL CN
Sbjct: 128 VFPDWSFWGWAETNIRPWKNLLKEIKEGNSRTKWKDRTPYAYWRGNPWVSPIRQDLLKCN 187

Query: 257 LSDKHDWNARLYVQ 270
           +S+++DWN RLY+Q
Sbjct: 188 VSEQNDWNTRLYLQ 201


>gi|42569947|ref|NP_182107.3| downstream target of AGL15 2 [Arabidopsis thaliana]
 gi|330255513|gb|AEC10607.1| downstream target of AGL15 2 [Arabidopsis thaliana]
          Length = 523

 Score =  299 bits (766), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 139/232 (59%), Positives = 171/232 (73%), Gaps = 3/232 (1%)

Query: 39  KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
           +R       V+NQTQ  P N    +  +   S   I   STCP YFRWIHEDL PWK TG
Sbjct: 81  QRFPNQCGVVQNQTQLFPQNGSSRNNDKPRSSHSRI---STCPSYFRWIHEDLRPWKETG 137

Query: 99  ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
           +TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 138 VTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 197

Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
           MFD DDRP +RS+D+ G  +  PPPLFRY  D  ++DIVFPDWSFWGWAE+NIKPW+  L
Sbjct: 198 MFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSL 257

Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
             ++EGN    W DR  YAYW+GNP VA TRRDLL CN+S + DWN RLY+Q
Sbjct: 258 VAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQ 309


>gi|449485616|ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 538

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 143/248 (57%), Positives = 178/248 (71%), Gaps = 10/248 (4%)

Query: 29  VSRKPQNRPPK-RITTPLNCVKNQTQT---CPTNYPKTSQTQESISDYSIPPTST--CPD 82
           V   P ++P + ++   L+C      T   CP +YP    T E  +    PP+S+  CPD
Sbjct: 83  VPHNPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQN----PPSSSSACPD 138

Query: 83  YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
           YFRWIHEDL PW  TGITR  LE   +TA+FRL+++N K Y+  YK+S QTRD FT+WGI
Sbjct: 139 YFRWIHEDLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGI 198

Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
           LQLLR+YPG++PDL+LMFDC D PVI +  +SGPN   PPPLFRY GD  T DIVFPDWS
Sbjct: 199 LQLLRRYPGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWS 258

Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
           FWGW EINIKPWE LL+++KEGN    W  R+PYAYWKGNP VA+TR+DL+ CN+SD+ D
Sbjct: 259 FWGWPEINIKPWEPLLKDIKEGNKRIPWKSRQPYAYWKGNPEVADTRKDLIKCNVSDQQD 318

Query: 263 WNARLYVQ 270
           WNAR++ Q
Sbjct: 319 WNARVFAQ 326


>gi|3386611|gb|AAC28541.1| unknown protein [Arabidopsis thaliana]
          Length = 517

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 139/232 (59%), Positives = 171/232 (73%), Gaps = 3/232 (1%)

Query: 39  KRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG 98
           +R       V+NQTQ  P N    +  +   S   I   STCP YFRWIHEDL PWK TG
Sbjct: 75  QRFPNQCGVVQNQTQLFPQNGSSRNNDKPRSSHSRI---STCPSYFRWIHEDLRPWKETG 131

Query: 99  ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
           +TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLEL
Sbjct: 132 VTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLEL 191

Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLL 218
           MFD DDRP +RS+D+ G  +  PPPLFRY  D  ++DIVFPDWSFWGWAE+NIKPW+  L
Sbjct: 192 MFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSL 251

Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
             ++EGN    W DR  YAYW+GNP VA TRRDLL CN+S + DWN RLY+Q
Sbjct: 252 VAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQ 303


>gi|224074911|ref|XP_002304487.1| predicted protein [Populus trichocarpa]
 gi|222841919|gb|EEE79466.1| predicted protein [Populus trichocarpa]
          Length = 380

 Score =  297 bits (760), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 133/168 (79%), Positives = 148/168 (88%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           MLERA  TAHFRLI+V  K Y+ KYK+SIQTRD FTIWGILQLLR+YPG++PDLELMFDC
Sbjct: 1   MLERAKTTAHFRLIIVKGKAYLEKYKKSIQTRDAFTIWGILQLLRRYPGKIPDLELMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
           DD PVI+S DY GPN  GPPPLFRY GD+WT DIVFPDWSFWGWAEINIKPW+ LL +LK
Sbjct: 61  DDLPVIQSSDYRGPNKTGPPPLFRYCGDKWTEDIVFPDWSFWGWAEINIKPWDKLLIDLK 120

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           EGNN   WIDREPYAYWKGNPFVAETR+DLLTCN+SD+ DWNARL++Q
Sbjct: 121 EGNNRSRWIDREPYAYWKGNPFVAETRKDLLTCNVSDQQDWNARLFIQ 168


>gi|357497413|ref|XP_003618995.1| KTEL motif-containing protein [Medicago truncatula]
 gi|355494010|gb|AES75213.1| KTEL motif-containing protein [Medicago truncatula]
          Length = 303

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 131/228 (57%), Positives = 178/228 (78%), Gaps = 10/228 (4%)

Query: 45  LNCVKNQT--QTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
           L C++ ++  Q CP + P T+    +++++       CP YFRWIHEDL PW+  GITR+
Sbjct: 62  LKCIQGKSFKQKCPRDNP-TTHNPTNLNNH------VCPSYFRWIHEDLKPWREKGITRN 114

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           ML++A +TAHF+L++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR YPG+LPDLELMFDC
Sbjct: 115 MLKKAKKTAHFKLVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRLYPGKLPDLELMFDC 174

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
           +D+PV+    + GP N  PPPLFRY  D+W++DIVFPDWSFWGWAE NIKPW ++L+++K
Sbjct: 175 EDKPVVPLDKFQGP-NASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWNNILKDIK 233

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           EGN    W DR PYAYWKGNP+VA TR++LL CN++ ++DWN  +Y+Q
Sbjct: 234 EGNKKTKWKDRVPYAYWKGNPYVAPTRQNLLQCNVTLENDWNTLIYIQ 281


>gi|357481655|ref|XP_003611113.1| KTEL motif-containing protein [Medicago truncatula]
 gi|355512448|gb|AES94071.1| KTEL motif-containing protein [Medicago truncatula]
          Length = 502

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 139/231 (60%), Positives = 167/231 (72%), Gaps = 23/231 (9%)

Query: 43  TPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
           T LNC +KNQT +  +NYP T                 CP+YF+WIHEDL PWK  GITR
Sbjct: 82  TVLNCTIKNQTCSS-SNYPTTKNNM------------VCPEYFKWIHEDLKPWKKKGITR 128

Query: 102 DMLERANQTAHFRLILVNNKVYIHKYK--QSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
           +M+E+A +TAHFRL++ N K Y+ KYK  ++IQTRDVFT+WGILQLLRKYPG++PDLELM
Sbjct: 129 EMVEKAKKTAHFRLVVKNGKGYLEKYKNKEAIQTRDVFTVWGILQLLRKYPGKIPDLELM 188

Query: 160 FDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLR 219
           FDC+D+PV+       P    PPP+F Y  DRWT DIVFPDWSFWGWAEINIKPWE LL+
Sbjct: 189 FDCNDKPVV-------PIGLDPPPVFGYCADRWTQDIVFPDWSFWGWAEINIKPWEHLLK 241

Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           ++K+GN    W DREPYAYWKGNP+ A TR D L CN+S   DWN RL+ Q
Sbjct: 242 DIKKGNKRVKWKDREPYAYWKGNPYTAATRLDFLNCNVSTAQDWNLRLFTQ 292


>gi|110736095|dbj|BAF00020.1| hypothetical protein [Arabidopsis thaliana]
          Length = 418

 Score =  290 bits (742), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 129/193 (66%), Positives = 157/193 (81%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           STCP YFRWIHEDL PWK TG+TR MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVF
Sbjct: 12  STCPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVF 71

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
           T+WGI+QLLR YPGRLPDLELMFD DDRP +RS+D+ G  +  PPPLFRY  D  ++DIV
Sbjct: 72  TLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIV 131

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
           FPDWSFWGWAE+NIKPW+  L  ++EGN    W DR  YAYW+GNP VA TRRDLL CN+
Sbjct: 132 FPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNV 191

Query: 258 SDKHDWNARLYVQ 270
           S + DWN RLY+Q
Sbjct: 192 SAQEDWNTRLYIQ 204


>gi|449446167|ref|XP_004140843.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 442

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 125/196 (63%), Positives = 152/196 (77%)

Query: 75  PPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
           P  +TCP+YFRWIHEDL PW   GIT+ MLE A + AHFR+++V  K Y+  Y ++ Q+R
Sbjct: 16  PMVATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSR 75

Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
           D  T+WG++QLLR+YPG+LPDL+LMF CDDRP I  +DYSG     PPPLFRYSGD  T 
Sbjct: 76  DNLTVWGVVQLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATW 135

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
           DIVFPDWSFWGW EINIK WES+L+++KEGN    W+ R+PYAYWKGNP VA TRRDLL 
Sbjct: 136 DIVFPDWSFWGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLK 195

Query: 255 CNLSDKHDWNARLYVQ 270
           CN++ K DW+ARLY Q
Sbjct: 196 CNVTQKQDWSARLYRQ 211


>gi|449526435|ref|XP_004170219.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 426

 Score =  290 bits (741), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 124/194 (63%), Positives = 151/194 (77%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
            +TCP+YFRWIHEDL PW   GIT+ MLE A + AHFR+++V  K Y+  Y ++ Q+RD 
Sbjct: 2   VATCPEYFRWIHEDLKPWAGRGITKSMLEEAQKKAHFRVVVVEGKAYVEAYGKAYQSRDN 61

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
            T+WG++QLLR+YPG+LPDL+LMF CDDRP I  +DYSG     PPPLFRYSGD  T DI
Sbjct: 62  LTVWGVVQLLRRYPGKLPDLDLMFSCDDRPEIYQKDYSGAEKPSPPPLFRYSGDDATWDI 121

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWSFWGW EINIK WES+L+++KEGN    W+ R+PYAYWKGNP VA TRRDLL CN
Sbjct: 122 VFPDWSFWGWPEINIKAWESMLKDIKEGNKKMGWMKRQPYAYWKGNPAVAYTRRDLLKCN 181

Query: 257 LSDKHDWNARLYVQ 270
           ++ K DW+ARLY Q
Sbjct: 182 VTQKQDWSARLYRQ 195


>gi|297840159|ref|XP_002887961.1| hypothetical protein ARALYDRAFT_475010 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297333802|gb|EFH64220.1| hypothetical protein ARALYDRAFT_475010 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 571

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 143/249 (57%), Positives = 181/249 (72%), Gaps = 16/249 (6%)

Query: 31  RKPQNRPPKRITTPLNCV----KNQTQTCP-TNYPKTSQTQESISDYSIPPTSTCPDYFR 85
           R+P+ +  +     L+C     +N + +C  T  P  +Q Q            +CPDYF+
Sbjct: 119 REPEKKSQQNTGISLDCTSFLNQNGSGSCSRTPQPNNNQNQTE-------SNRSCPDYFK 171

Query: 86  WIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQL 145
           WIHEDL PW+ TGIT++M+ER   TAHFRL++VN KV++  YK+SIQTRD FT+WGILQL
Sbjct: 172 WIHEDLKPWRETGITKEMVERGKTTAHFRLVIVNGKVFVENYKKSIQTRDAFTLWGILQL 231

Query: 146 LRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWS 202
           LRKYPG+LPD++LMFDCDDRPVIRS  Y+  N      PPPLFRY GDRWT+DIVFPDWS
Sbjct: 232 LRKYPGKLPDVDLMFDCDDRPVIRSDGYNRSNRTAEDVPPPLFRYCGDRWTVDIVFPDWS 291

Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKH 261
           FWGW EINIK W  +L+E++EG   + +++RE YAYWKGNPFVA  +R DLLTCNLS  H
Sbjct: 292 FWGWQEINIKEWSKVLKEMEEGKKKKKFMEREAYAYWKGNPFVASPSREDLLTCNLSSLH 351

Query: 262 DWNARLYVQ 270
           DWNAR+++Q
Sbjct: 352 DWNARIFIQ 360


>gi|147798877|emb|CAN65871.1| hypothetical protein VITISV_012499 [Vitis vinifera]
          Length = 424

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 134/231 (58%), Positives = 166/231 (71%), Gaps = 8/231 (3%)

Query: 39  KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
           KR    +NC + N T+TCP     T  T    S+ S   T TCP+YFRWI+EDL PW  T
Sbjct: 80  KRFEYSMNCSEGNMTKTCPV----TLLTTFEPSNLS---TGTCPEYFRWIYEDLKPWTET 132

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
           GITRDM+ERA   AH R+++V+ KVY  KYK   QTRDVFTIWGILQ+LR YPG+LPD +
Sbjct: 133 GITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFTIWGILQVLRMYPGKLPDFD 192

Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
           LMF+C D+PVI+  DY GPN   PPPLF Y GD  T+DIVFPDWSFWGW EI I+PW +L
Sbjct: 193 LMFECGDKPVIKKHDYQGPNATAPPPLFHYCGDDETLDIVFPDWSFWGWPEIRIRPWSTL 252

Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
            ++L+EGNN   W+BREPYAYWKGN  +  TR +L  C+ +++ DWNAR+Y
Sbjct: 253 RKDLREGNNKTKWVBREPYAYWKGNFKMGVTRHELSKCSKTNEQDWNARIY 303


>gi|6633846|gb|AAF19705.1|AC008047_12 F2K11.20 [Arabidopsis thaliana]
          Length = 605

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 138/249 (55%), Positives = 182/249 (73%), Gaps = 14/249 (5%)

Query: 31  RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
           R  + + P+   + ++C     +N++ +C           ++ S+ S      CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176

Query: 87  IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
           IHEDL PW+ TGIT++M+ER   TAHFRL+++N KV++  YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236

Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
           RKYPG+LPD++LMFDCDDRPVIRS  Y+  N      PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296

Query: 204 WGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHD 262
           WGW EINI+ W  +L+E++EG   + +++R+ YAYWKGNPFVA  +R DLLTCNLS  HD
Sbjct: 297 WGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHD 356

Query: 263 WNARLYVQV 271
           WNAR+++QV
Sbjct: 357 WNARIFIQV 365


>gi|15233070|ref|NP_191686.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6850894|emb|CAB71057.1| putative protein [Arabidopsis thaliana]
 gi|27754477|gb|AAO22686.1| unknown protein [Arabidopsis thaliana]
 gi|28393951|gb|AAO42383.1| unknown protein [Arabidopsis thaliana]
 gi|332646660|gb|AEE80181.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 498

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 135/229 (58%), Positives = 166/229 (72%), Gaps = 7/229 (3%)

Query: 44  PLNC--VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
           P  C  V+NQ+   P +  + S+   + S  S    STCP YFRWIHEDL PWK TGITR
Sbjct: 63  PYGCDFVQNQSSQTPISQNRKSRLNPNNSSKS----STCPSYFRWIHEDLRPWKQTGITR 118

Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
            M+E A++TAHFRL++ N K Y+ +YK+SIQTRD FT+WGILQLLR YPG+LPDLELMFD
Sbjct: 119 GMIEEASRTAHFRLVIRNGKAYVKRYKKSIQTRDEFTLWGILQLLRWYPGKLPDLELMFD 178

Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
            DDRPV+RS D+ G   K PPP+FRY  D  ++DIVFPDWSFWGWAE+N+KPW   L  +
Sbjct: 179 ADDRPVVRSVDFIG-QQKEPPPVFRYCSDDASLDIVFPDWSFWGWAEVNVKPWGKSLEAI 237

Query: 222 KEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           KEGN+   W DR  YAYW+GNP+V   R DLL CN ++  +WN RLY+Q
Sbjct: 238 KEGNSMTQWKDRVAYAYWRGNPYVDPGRGDLLKCNATEHEEWNTRLYIQ 286


>gi|356506263|ref|XP_003521906.1| PREDICTED: protein O-glucosyltransferase 1-like [Glycine max]
          Length = 476

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 137/266 (51%), Positives = 178/266 (66%), Gaps = 17/266 (6%)

Query: 6   AFSIILQSNFS-VHNISRNKTVIIVSRK-PQNRPPKRITTPLNCVK-NQTQTCPTNYPKT 62
           A + +L+ N S +   +  KT+II +++ PQ         PLNC   N  +TC + YP T
Sbjct: 17  ASTFLLEINLSRITRTTIFKTIIIFNKQQPQ--------FPLNCTDGNLAKTCSSYYPTT 68

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
               +  S      T++CP+YF+WIHEDL PWK  GITRDM+ER    +HFRL++VN K 
Sbjct: 69  FDLDDDSS------TTSCPNYFKWIHEDLKPWKSKGITRDMVERGKNVSHFRLVIVNGKA 122

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
           Y+ KY +  QTRDVFTIWGILQLLR YPG++PDL+LMF C D+PV+  +D+ GP    PP
Sbjct: 123 YVEKYDKVYQTRDVFTIWGILQLLRLYPGKIPDLDLMFQCGDKPVVLKKDFQGPQAMSPP 182

Query: 183 PLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN 242
           P+F Y GD    DIVFPDWSFWGW EINI PWE+ L ++ EGN    W DR PYA+WKGN
Sbjct: 183 PVFHYCGDENAHDIVFPDWSFWGWPEINIGPWETTLHKILEGNKMIKWKDRTPYAFWKGN 242

Query: 243 PFVAETRRDLLTCNLSDKHDWNARLY 268
             +A+ RR+L  CN + +HDWNAR++
Sbjct: 243 LAMADIRRELGKCNPTKEHDWNARIH 268


>gi|15222413|ref|NP_176531.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332195976|gb|AEE34097.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 578

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 137/248 (55%), Positives = 181/248 (72%), Gaps = 14/248 (5%)

Query: 31  RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
           R  + + P+   + ++C     +N++ +C           ++ S+ S      CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176

Query: 87  IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
           IHEDL PW+ TGIT++M+ER   TAHFRL+++N KV++  YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236

Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
           RKYPG+LPD++LMFDCDDRPVIRS  Y+  N      PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296

Query: 204 WGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHD 262
           WGW EINI+ W  +L+E++EG   + +++R+ YAYWKGNPFVA  +R DLLTCNLS  HD
Sbjct: 297 WGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHD 356

Query: 263 WNARLYVQ 270
           WNAR+++Q
Sbjct: 357 WNARIFIQ 364


>gi|359489784|ref|XP_002273475.2| PREDICTED: O-glucosyltransferase rumi [Vitis vinifera]
          Length = 604

 Score =  283 bits (725), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 137/240 (57%), Positives = 166/240 (69%), Gaps = 11/240 (4%)

Query: 30  SRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIH 88
           S KP N   K+I   LNC + N  QTCP     T +  E       P T TCP+YFRWI+
Sbjct: 163 SIKPCN---KKIEYSLNCSEGNMAQTCPVTPLATFEPSE-------PSTETCPEYFRWIY 212

Query: 89  EDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRK 148
           EDL PW+ TGITRDM+ERA   A+ R+++V+ KVY+ KYK   QTRDVFTIWGILQLLR 
Sbjct: 213 EDLRPWRETGITRDMVERAKPAAYIRVVVVDGKVYMEKYKGVYQTRDVFTIWGILQLLRM 272

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
           YPG+LPD +LMF C D+  +++R + GPN   PPPLF Y GD  T DIVFPDWSFWGW E
Sbjct: 273 YPGKLPDFDLMFSCGDKLAMKTRYFQGPNATTPPPLFHYCGDDETYDIVFPDWSFWGWPE 332

Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
           I+IK W +L ++LKEGNN   WIDREPYAYWKGN  +   R +L  CN + + DWNAR+Y
Sbjct: 333 IHIKQWNTLKKDLKEGNNRTEWIDREPYAYWKGNINLGLARHELSKCNKTSEQDWNARIY 392



 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 31/60 (51%), Positives = 45/60 (75%)

Query: 106 RANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR 165
           ++   AH R+++V+ KV++ KY+   QTR V TIW ILQLLR YPG+LPDL+L+F+C  +
Sbjct: 12  KSGNPAHIRVVVVDGKVFMEKYEWVFQTRHVLTIWCILQLLRMYPGKLPDLDLIFECGAK 71


>gi|297745255|emb|CBI40335.3| unnamed protein product [Vitis vinifera]
          Length = 578

 Score =  283 bits (724), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 137/240 (57%), Positives = 166/240 (69%), Gaps = 11/240 (4%)

Query: 30  SRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIH 88
           S KP N   K+I   LNC + N  QTCP     T +  E       P T TCP+YFRWI+
Sbjct: 137 SIKPCN---KKIEYSLNCSEGNMAQTCPVTPLATFEPSE-------PSTETCPEYFRWIY 186

Query: 89  EDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRK 148
           EDL PW+ TGITRDM+ERA   A+ R+++V+ KVY+ KYK   QTRDVFTIWGILQLLR 
Sbjct: 187 EDLRPWRETGITRDMVERAKPAAYIRVVVVDGKVYMEKYKGVYQTRDVFTIWGILQLLRM 246

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
           YPG+LPD +LMF C D+  +++R + GPN   PPPLF Y GD  T DIVFPDWSFWGW E
Sbjct: 247 YPGKLPDFDLMFSCGDKLAMKTRYFQGPNATTPPPLFHYCGDDETYDIVFPDWSFWGWPE 306

Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
           I+IK W +L ++LKEGNN   WIDREPYAYWKGN  +   R +L  CN + + DWNAR+Y
Sbjct: 307 IHIKQWNTLKKDLKEGNNRTEWIDREPYAYWKGNINLGLARHELSKCNKTSEQDWNARIY 366



 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 31/60 (51%), Positives = 45/60 (75%)

Query: 106 RANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR 165
           ++   AH R+++V+ KV++ KY+   QTR V TIW ILQLLR YPG+LPDL+L+F+C  +
Sbjct: 15  KSGNPAHIRVVVVDGKVFMEKYEWVFQTRHVLTIWCILQLLRMYPGKLPDLDLIFECGAK 74


>gi|356522630|ref|XP_003529949.1| PREDICTED: O-glucosyltransferase rumi homolog [Glycine max]
          Length = 497

 Score =  281 bits (718), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 132/246 (53%), Positives = 173/246 (70%), Gaps = 15/246 (6%)

Query: 24  KTVIIVSRKPQNRPPKRITTPLNCVKNQT-QTCPTNYPKTSQTQESISDYSIPPTSTCPD 82
           KT+I+ +    N+ P+    PLNC   ++  TCP+ YP   +  +  S+      ++CP+
Sbjct: 58  KTIIVFN----NQQPQ---FPLNCTNGKSASTCPSYYPTKLEFDDDSSN------TSCPE 104

Query: 83  YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
           YFRWIHEDL PW+ TGITRDM+ER    +HFRL++VN K YI K+ +S QTRDVFTIWGI
Sbjct: 105 YFRWIHEDLKPWESTGITRDMVERGKHISHFRLVIVNGKAYIEKFAKSYQTRDVFTIWGI 164

Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
           LQLLR YPG++PDLELMF C DR V+  +D+  P    PPP+F Y G+  + DIVFPDW+
Sbjct: 165 LQLLRLYPGKIPDLELMFQCGDRTVVFKKDFQVP-KMSPPPVFHYCGEENSYDIVFPDWT 223

Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
           FWGWAE++I+PWE+ L  + EGN    W DR PYA+WKGNP V+  RR+L  CN ++KHD
Sbjct: 224 FWGWAELSIRPWETTLHNILEGNKLVKWKDRIPYAFWKGNPTVSIIRRELGKCNTTEKHD 283

Query: 263 WNARLY 268
           WNAR+Y
Sbjct: 284 WNARIY 289


>gi|297817438|ref|XP_002876602.1| hypothetical protein ARALYDRAFT_486604 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322440|gb|EFH52861.1| hypothetical protein ARALYDRAFT_486604 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 496

 Score =  280 bits (716), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 133/227 (58%), Positives = 164/227 (72%), Gaps = 4/227 (1%)

Query: 44  PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
           P  C   Q Q+  T++   SQ + S  +      STCP YFRWIHEDL PWK TGITR M
Sbjct: 62  PYGCDFVQNQSNQTHF---SQNRASRLNPIRSKPSTCPSYFRWIHEDLRPWKQTGITRGM 118

Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
           +E A++TAHFRL++ N K Y+ +Y++SIQTRD FT+WGI+QLLR +PG+LPDLELMFD D
Sbjct: 119 IEEASRTAHFRLVIRNGKAYVKRYRKSIQTRDDFTLWGIVQLLRWFPGKLPDLELMFDAD 178

Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
           DRPV+RS D+ G   K PPP+FRY  D  ++DIVFPDWSFWGWAE+NIKPW   L  +KE
Sbjct: 179 DRPVVRSADFIG-QQKKPPPVFRYCSDDASLDIVFPDWSFWGWAEVNIKPWGESLEAIKE 237

Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           GNN   W +R  YAYW+GNP V   R DLL CN+S+  +WN RLY+Q
Sbjct: 238 GNNMTQWKERVAYAYWRGNPHVDPGRGDLLKCNVSENEEWNTRLYIQ 284


>gi|388505300|gb|AFK40716.1| unknown [Medicago truncatula]
          Length = 285

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 129/210 (61%), Positives = 164/210 (78%), Gaps = 12/210 (5%)

Query: 44  PLNCV--KNQTQTCPTNYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGIT 100
           PL C   + +TQTCP +YP K + T ++        + TCP +FRWIHEDL PWK  GIT
Sbjct: 83  PLRCTNGEKETQTCPRDYPTKHNPTNQN--------SHTCPSFFRWIHEDLKPWKEKGIT 134

Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMF 160
           R+MLE A +TA+F++++V+ K+Y+ KY++SIQTRDVFT+WGILQLLR +PG+LPDLELMF
Sbjct: 135 REMLEGAKRTANFKVVIVDGKMYVEKYRKSIQTRDVFTLWGILQLLRMFPGKLPDLELMF 194

Query: 161 DCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
           DC+DRPVI   ++ GP N  PPPLFRY  D+W++DIVFPDWSFWGWAE NIKPW+++L+E
Sbjct: 195 DCEDRPVIHKGNFQGP-NASPPPLFRYCSDQWSLDIVFPDWSFWGWAETNIKPWKNILKE 253

Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRR 250
           +KEGN    W DR PYAYWKGNP VA TR+
Sbjct: 254 IKEGNKETKWKDRVPYAYWKGNPNVAATRK 283


>gi|359489780|ref|XP_002273350.2| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
          Length = 509

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 134/231 (58%), Positives = 164/231 (70%), Gaps = 9/231 (3%)

Query: 39  KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
           KR    +NC + N T+TCP     T  T    S+ S   T TCP+YFRWI+EDL PW  T
Sbjct: 74  KRFEYSMNCSEGNMTKTCPV----TLLTTFEPSNLS---TGTCPEYFRWIYEDLKPWTET 126

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
           GITRDM+ERA   AH R+++V+ KVY  KYK   QTRDVFTIWGILQ+LR YPG+LPD +
Sbjct: 127 GITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDVFTIWGILQVLRMYPGKLPDFD 186

Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
           LMF+C D+PVI+  DY G N    PPLF Y GD  T+DIVFPDWSFWGW EI IKPW +L
Sbjct: 187 LMFECGDKPVIKKHDYQGLNATA-PPLFHYCGDDETLDIVFPDWSFWGWPEIRIKPWSTL 245

Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
            ++L+EGNN   W+DREPYAYWKGN  +  TR +L  C+ +++ DWNAR+Y
Sbjct: 246 RKDLREGNNKTKWVDREPYAYWKGNFKMGVTRHELSKCSKTNEQDWNARIY 296


>gi|359489776|ref|XP_002273194.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
          Length = 521

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 132/252 (52%), Positives = 166/252 (65%), Gaps = 8/252 (3%)

Query: 19  NISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPT 77
           N+S      + +  P     ++    LNC + N +QTCP   P   +  E       PP+
Sbjct: 58  NVSILTGDFLKNTSPTKLHSQKFEYSLNCREGNMSQTCPVTGPVAFEPGE-------PPS 110

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
            TCP+YFRWI+EDL PW  TGITR M+E+A   A  R+++V+ KVY+ KYK+  + RD F
Sbjct: 111 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYVEKYKRVNRNRDEF 170

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
           TIWGILQLLR YPG+LPD +LMF+C DRP+IR+  Y GP+   PPPLF Y GD  T DIV
Sbjct: 171 TIWGILQLLRMYPGKLPDFDLMFECRDRPMIRTHLYQGPDATVPPPLFHYCGDDETYDIV 230

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
           FPDWSFWGW E NIKPW    ++LKEGN    WIDREPYAYWKGN  +   R++L  C  
Sbjct: 231 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRN 290

Query: 258 SDKHDWNARLYV 269
           +D+ DWNARLY+
Sbjct: 291 TDEQDWNARLYI 302


>gi|147791844|emb|CAN70600.1| hypothetical protein VITISV_027960 [Vitis vinifera]
          Length = 521

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 131/252 (51%), Positives = 166/252 (65%), Gaps = 8/252 (3%)

Query: 19  NISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPT 77
           N+S      + +  P     ++    LNC + N +QTCP   P   +  E       PP+
Sbjct: 58  NVSILTGDFLKNTSPTKLHSQKFEYSLNCREGNMSQTCPVTGPVAFEPGE-------PPS 110

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
            TCP+YFRWI+EDL PW  TGITR M+E+A   A  R+++V+ KVY+ KYK+  + RD F
Sbjct: 111 ETCPEYFRWIYEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYVEKYKRVNRNRDEF 170

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
           TIWGILQLLR YPG+LPD +LMF+C DRP+I++  Y GP+   PPPLF Y GD  T DIV
Sbjct: 171 TIWGILQLLRMYPGKLPDFDLMFECRDRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIV 230

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
           FPDWSFWGW E NIKPW    ++LKEGN    WIDREPYAYWKGN  +   R++L  C  
Sbjct: 231 FPDWSFWGWPETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRN 290

Query: 258 SDKHDWNARLYV 269
           +D+ DWNARLY+
Sbjct: 291 TDEQDWNARLYI 302


>gi|224064021|ref|XP_002301352.1| predicted protein [Populus trichocarpa]
 gi|222843078|gb|EEE80625.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 121/195 (62%), Positives = 149/195 (76%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
           ++ CP+YFRWIHEDL  WK TGI+R M+ERA   AHFRL+++  K+Y+ KYK+S  TRDV
Sbjct: 20  STACPEYFRWIHEDLRIWKSTGISRAMVERAKDYAHFRLVILKGKIYVEKYKKSFHTRDV 79

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           FTIWGILQLLR YPG++PDLELMF CDDRPVI  +DY G N    P +F+Y G    + I
Sbjct: 80  FTIWGILQLLRLYPGKVPDLELMFWCDDRPVILKKDYQGTNATSSPSIFQYCGREDALGI 139

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDW+FWGWAE N+ PW++L ++LKE N    W DR PYAYW+GNP VA +RR L+ CN
Sbjct: 140 VFPDWTFWGWAETNVSPWKTLSKDLKEANKRTKWKDRVPYAYWRGNPNVAASRRQLMWCN 199

Query: 257 LSDKHDWNARLYVQV 271
           +SDK+DWNARLY Q 
Sbjct: 200 VSDKYDWNARLYKQA 214


>gi|218191256|gb|EEC73683.1| hypothetical protein OsI_08244 [Oryza sativa Indica Group]
          Length = 524

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 123/203 (60%), Positives = 156/203 (76%), Gaps = 4/203 (1%)

Query: 72  YSIPPTST---CPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKY 127
           + +PPT+T   CP YFR+IHEDL PW+  G ITR MLERA  TA+FRL+++  + Y+ + 
Sbjct: 115 HPLPPTTTNSSCPAYFRFIHEDLHPWRAAGGITRRMLERARDTANFRLVVLRGRAYVERI 174

Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY 187
             + QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+++  Y G N    PPLFRY
Sbjct: 175 APAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVQADRYQGENATAMPPLFRY 234

Query: 188 SGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
            GD  T+D+VFPDWSFWGW EINIKPW++L ++L  GN    W+DREPYAYWKGNP VA 
Sbjct: 235 CGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKGNPDVAT 294

Query: 248 TRRDLLTCNLSDKHDWNARLYVQ 270
            R++L+ CN+S KH+WNAR+Y Q
Sbjct: 295 KRKELVKCNVSSKHEWNARIYKQ 317


>gi|297599669|ref|NP_001047557.2| Os02g0642700 [Oryza sativa Japonica Group]
 gi|255671126|dbj|BAF09471.2| Os02g0642700, partial [Oryza sativa Japonica Group]
          Length = 474

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 123/201 (61%), Positives = 155/201 (77%), Gaps = 4/201 (1%)

Query: 74  IPPTST---CPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
           +PPT+T   CP YFR+IHEDL PW+  G ITR MLERA  TA+FRL+++  + Y+ +   
Sbjct: 67  LPPTTTNSSCPAYFRFIHEDLHPWRAAGGITRRMLERARDTANFRLVVLRGRAYVERIAP 126

Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
           + QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC D PV+++  Y G N    PPLFRY G
Sbjct: 127 AFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVQADRYQGENATAMPPLFRYCG 186

Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
           D  T+D+VFPDWSFWGW EINIKPW++L ++L  GN    W+DREPYAYWKGNP VA  R
Sbjct: 187 DDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKGNPDVATKR 246

Query: 250 RDLLTCNLSDKHDWNARLYVQ 270
           ++L+ CN+S KH+WNAR+Y Q
Sbjct: 247 KELVKCNVSSKHEWNARIYKQ 267


>gi|359489782|ref|XP_002273411.2| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
          Length = 525

 Score =  273 bits (698), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 127/226 (56%), Positives = 156/226 (69%), Gaps = 8/226 (3%)

Query: 45  LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
           LNC + N +QTCP   P   +  E       PP+ TCP+YFRWI+EDL PW  TGITR M
Sbjct: 88  LNCREGNVSQTCPVTGPVAFEPSE-------PPSETCPEYFRWIYEDLRPWMDTGITRAM 140

Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
           +E+A   A  R+++V+ KVY+ KYK+    RD FTIWGILQLLR YP +LPD +LMF+C 
Sbjct: 141 VEKARPAASIRIVVVDGKVYMEKYKRVNHNRDEFTIWGILQLLRMYPEKLPDFDLMFECR 200

Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
           DRP+I++  Y GP+   PPPLF Y GD  T DIVFPDWSFWGW E NIKPW    ++LKE
Sbjct: 201 DRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGWPETNIKPWNGFKKDLKE 260

Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYV 269
           GN    WIDREPYAYWKGN  +   R++L  C  +D+ DWNARLY+
Sbjct: 261 GNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNARLYI 306


>gi|356561570|ref|XP_003549054.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
           2-like [Glycine max]
          Length = 426

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 133/233 (57%), Positives = 160/233 (68%), Gaps = 17/233 (7%)

Query: 39  KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
           ++   PLNC   N T TCPTN     ++         P T+TCPDYFRWIHEDL PW  T
Sbjct: 23  RKPIAPLNCTAYNLTGTCPTNLQDHQRS---------PATATCPDYFRWIHEDLRPWART 73

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
                  E A  +A+ R I++  + Y+  Y +  QTRDVF+IWGILQLLR+YPG++PDLE
Sbjct: 74  -------EHAPTSANLRFIILKGRAYLETYSRPYQTRDVFSIWGILQLLRRYPGKIPDLE 126

Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
           LMFDC D PV+ S  Y+GPN + PPPLFRY G+  T+D+VF DWSFWGWAEINIKPW  L
Sbjct: 127 LMFDCVDWPVVXSDRYNGPNVEQPPPLFRYCGNDATLDVVFLDWSFWGWAEINIKPWHIL 186

Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           L ELKEG     W++REPYAYWKGNP VAETR  L+ CN+S+  DWNARL  Q
Sbjct: 187 LGELKEGTTRIPWLNREPYAYWKGNPAVAETRXYLMKCNVSENQDWNARLLAQ 239


>gi|357136753|ref|XP_003569968.1| PREDICTED: uncharacterized protein LOC100831246 [Brachypodium
           distachyon]
          Length = 543

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 121/194 (62%), Positives = 150/194 (77%), Gaps = 1/194 (0%)

Query: 78  STCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
           S+CP YFR+IHEDL PW+  G ITR ML+RA  TA+FRL+++  + YI     + QTRD+
Sbjct: 138 SSCPAYFRFIHEDLHPWRAAGGITRAMLDRARATANFRLVVLRGRAYIELIAPAFQTRDL 197

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           FTIWGILQLLR+YPGR+PDL+LMFDC D PV+R+  Y G N    PPLFRY GD  T+D+
Sbjct: 198 FTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVRADQYEGENATAMPPLFRYCGDNETLDV 257

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWSFWGWAEINIKPW++L ++L  GN    W+DREPYAYWKGNP VA  R++L+ CN
Sbjct: 258 VFPDWSFWGWAEINIKPWDALRKDLDAGNRRVRWVDREPYAYWKGNPDVAAIRQELVKCN 317

Query: 257 LSDKHDWNARLYVQ 270
           +S K +WNAR+Y Q
Sbjct: 318 VSSKQEWNARIYKQ 331


>gi|413923257|gb|AFW63189.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
          Length = 578

 Score =  270 bits (691), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 121/195 (62%), Positives = 151/195 (77%), Gaps = 1/195 (0%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
           +STCP YFR+IHEDL PW+  G ITR M++RA  TA+FRL+++  + YI +   + QTRD
Sbjct: 145 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 204

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ +  Y G N    PPLFRY GD  T+D
Sbjct: 205 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYQGENATLLPPLFRYCGDNETLD 264

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           +VFPDWSFWGW EINIKPW++L +EL  GN    W+ REPYAYWKGNP VA TR++L+ C
Sbjct: 265 VVFPDWSFWGWPEINIKPWDALQKELNGGNKRVKWLAREPYAYWKGNPDVAVTRQELVKC 324

Query: 256 NLSDKHDWNARLYVQ 270
           N+S KH+WNAR+Y Q
Sbjct: 325 NVSSKHEWNARIYKQ 339


>gi|297745254|emb|CBI40334.3| unnamed protein product [Vitis vinifera]
          Length = 819

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 121/192 (63%), Positives = 147/192 (76%), Gaps = 1/192 (0%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
           T TCP+YFRWI+EDL PW  TGITRDM+ERA   AH R+++V+ KVY  KYK   QTRDV
Sbjct: 26  TGTCPEYFRWIYEDLKPWTETGITRDMVERAKTPAHIRVVVVDGKVYTEKYKWVFQTRDV 85

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           FTIWGILQ+LR YPG+LPD +LMF+C D+PVI+  DY G N    PPLF Y GD  T+DI
Sbjct: 86  FTIWGILQVLRMYPGKLPDFDLMFECGDKPVIKKHDYQGLNATA-PPLFHYCGDDETLDI 144

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWSFWGW EI IKPW +L ++L+EGNN   W+DREPYAYWKGN  +  TR +L  C+
Sbjct: 145 VFPDWSFWGWPEIRIKPWSTLRKDLREGNNKTKWVDREPYAYWKGNFKMGVTRHELSKCS 204

Query: 257 LSDKHDWNARLY 268
            +++ DWNAR+Y
Sbjct: 205 KTNEQDWNARIY 216



 Score =  237 bits (604), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 107/183 (58%), Positives = 131/183 (71%)

Query: 87  IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
           + EDL PW  TGITR M+E+A   A  R+++V+ KVY+ KYK+    RD FTIWGILQLL
Sbjct: 418 LEEDLRPWMDTGITRAMVEKARPAASIRIVVVDGKVYMEKYKRVNHNRDEFTIWGILQLL 477

Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
           R YP +LPD +LMF+C DRP+I++  Y GP+   PPPLF Y GD  T DIVFPDWSFWGW
Sbjct: 478 RMYPEKLPDFDLMFECRDRPMIKTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGW 537

Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
            E NIKPW    ++LKEGN    WIDREPYAYWKGN  +   R++L  C  +D+ DWNAR
Sbjct: 538 PETNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNAR 597

Query: 267 LYV 269
           LY+
Sbjct: 598 LYI 600


>gi|219363409|ref|NP_001136898.1| uncharacterized protein LOC100217055 [Zea mays]
 gi|194697518|gb|ACF82843.1| unknown [Zea mays]
 gi|413923256|gb|AFW63188.1| hypothetical protein ZEAMMB73_601289 [Zea mays]
          Length = 551

 Score =  270 bits (690), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 121/195 (62%), Positives = 151/195 (77%), Gaps = 1/195 (0%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
           +STCP YFR+IHEDL PW+  G ITR M++RA  TA+FRL+++  + YI +   + QTRD
Sbjct: 145 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 204

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ +  Y G N    PPLFRY GD  T+D
Sbjct: 205 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYQGENATLLPPLFRYCGDNETLD 264

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           +VFPDWSFWGW EINIKPW++L +EL  GN    W+ REPYAYWKGNP VA TR++L+ C
Sbjct: 265 VVFPDWSFWGWPEINIKPWDALQKELNGGNKRVKWLAREPYAYWKGNPDVAVTRQELVKC 324

Query: 256 NLSDKHDWNARLYVQ 270
           N+S KH+WNAR+Y Q
Sbjct: 325 NVSSKHEWNARIYKQ 339


>gi|414586114|tpg|DAA36685.1| TPA: hypothetical protein ZEAMMB73_061596 [Zea mays]
          Length = 433

 Score =  270 bits (689), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 120/194 (61%), Positives = 152/194 (78%), Gaps = 1/194 (0%)

Query: 78  STCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
           ++CP YFR+IHEDL PW+  G ITR ML+RA  TA FRL+++    Y+ +++ + QTRD+
Sbjct: 143 TSCPSYFRFIHEDLRPWRAAGGITRAMLDRARLTATFRLVVLGGNAYVQRFRPAFQTRDL 202

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           FTIWG+LQLLR++PGR+PDL+LMFD  D PV+RS  Y G N +  PPLFRY GD  T+DI
Sbjct: 203 FTIWGVLQLLRRHPGRVPDLDLMFDTVDWPVVRSHLYRGKNAEMLPPLFRYCGDDKTLDI 262

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWSFWGW EINIKPW++L  +LK GNN   W+DREPYAYWKGNP V+ TR++L+ CN
Sbjct: 263 VFPDWSFWGWPEINIKPWDALKEDLKAGNNRVRWMDREPYAYWKGNPSVSGTRKELVKCN 322

Query: 257 LSDKHDWNARLYVQ 270
           +S  HDWNAR+Y Q
Sbjct: 323 VSSTHDWNARIYAQ 336


>gi|357168046|ref|XP_003581456.1| PREDICTED: protein O-glucosyltransferase 1-like [Brachypodium
           distachyon]
          Length = 543

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 119/192 (61%), Positives = 150/192 (78%), Gaps = 1/192 (0%)

Query: 80  CPDYFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
           CP YFR+IHEDL PW+   G+TR ML RA  TA FRL+++  +V++ +++ + QTRD+FT
Sbjct: 145 CPSYFRFIHEDLRPWREAGGVTRAMLARARVTASFRLVVLRGRVFVQRFRPAFQTRDLFT 204

Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
           IWGILQL+R+YPGR+PDL+LMFDC D PV+R+  Y G +    PPLFRY GD  T+DIVF
Sbjct: 205 IWGILQLIRRYPGRVPDLDLMFDCVDWPVVRTHLYRGKHAPFMPPLFRYCGDDRTLDIVF 264

Query: 199 PDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLS 258
           PDWSFWGW EINIKPW +L +ELK+GNN   W+DREPYAYWKGN  VA +RR+L+ CN+S
Sbjct: 265 PDWSFWGWPEINIKPWGALQKELKDGNNKVRWLDREPYAYWKGNAAVAVSRRELVQCNVS 324

Query: 259 DKHDWNARLYVQ 270
              DWNAR+Y Q
Sbjct: 325 STQDWNARIYTQ 336


>gi|359489778|ref|XP_002271353.2| PREDICTED: protein O-glucosyltransferase 1 [Vitis vinifera]
          Length = 455

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 131/228 (57%), Positives = 157/228 (68%), Gaps = 15/228 (6%)

Query: 42  TTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR 101
           TT  NC K  T TC T        Q  +S       + CPDYFRWI+EDL PW  TGIT+
Sbjct: 28  TTEENCTKENT-TCSTK-------QHPLSG------NACPDYFRWIYEDLRPWAETGITQ 73

Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
           +M+E A   A FRL++++ KVY+ KYK + QTRDVFTIWGILQLL+ YPG++PDLELMF+
Sbjct: 74  EMMEAAKDPAFFRLVILDGKVYMEKYKGAFQTRDVFTIWGILQLLKLYPGKVPDLELMFE 133

Query: 162 CDDRPVIRSRDYSG-PNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
           C DRP I++ DY G    K  PPLF Y     T+DIVFPDWSFWGW EINIKPW SL +E
Sbjct: 134 CGDRPRIKATDYGGRKGKKKVPPLFHYCASDDTLDIVFPDWSFWGWPEINIKPWNSLRKE 193

Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
           L+EGNN   W+DREPYAYWKGN   +  R+ L  C  S+ HDWNAR+Y
Sbjct: 194 LEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCRPSNNHDWNARVY 241


>gi|79454960|ref|NP_191687.2| uncharacterized protein [Arabidopsis thaliana]
 gi|55978809|gb|AAV68866.1| hypothetical protein AT3G61280 [Arabidopsis thaliana]
 gi|61742723|gb|AAX55182.1| hypothetical protein At3g61280 [Arabidopsis thaliana]
 gi|110739424|dbj|BAF01622.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646661|gb|AEE80182.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 536

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 134/239 (56%), Positives = 159/239 (66%), Gaps = 12/239 (5%)

Query: 37  PPKRITTPLNCVK---NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
           P   I  PLNC     N TQTCP+NYP   +        +I  + TCPDYF+WIH DL  
Sbjct: 92  PKIAIKIPLNCTSLNSNTTQTCPSNYPTKFEP-------AISSSETCPDYFKWIHRDLKV 144

Query: 94  WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
           W+ TGITR+ LERA   AHFR+++ + ++Y+H+Y+++ QTRDVFTIWGILQLLR YPG++
Sbjct: 145 WQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVFTIWGILQLLRMYPGQI 204

Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
           PDLEL+F C DRP I  RD          PPPLF Y G R   DIVFPDWSFWGW E+NI
Sbjct: 205 PDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGWPELNI 264

Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           K W  L   LKEGN    W DR PYAYWKGNP V+  R DL+ CN SDK+D   RLYVQ
Sbjct: 265 KEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVRLYVQ 323


>gi|186511278|ref|NP_001118874.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332646662|gb|AEE80183.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 378

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 134/239 (56%), Positives = 159/239 (66%), Gaps = 12/239 (5%)

Query: 37  PPKRITTPLNCVK---NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSP 93
           P   I  PLNC     N TQTCP+NYP   +        +I  + TCPDYF+WIH DL  
Sbjct: 92  PKIAIKIPLNCTSLNSNTTQTCPSNYPTKFEP-------AISSSETCPDYFKWIHRDLKV 144

Query: 94  WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
           W+ TGITR+ LERA   AHFR+++ + ++Y+H+Y+++ QTRDVFTIWGILQLLR YPG++
Sbjct: 145 WQKTGITRETLERARPNAHFRIVIKSGRLYVHQYEKAFQTRDVFTIWGILQLLRMYPGQI 204

Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
           PDLEL+F C DRP I  RD          PPPLF Y G R   DIVFPDWSFWGW E+NI
Sbjct: 205 PDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGWPELNI 264

Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           K W  L   LKEGN    W DR PYAYWKGNP V+  R DL+ CN SDK+D   RLYVQ
Sbjct: 265 KEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVRLYVQ 323


>gi|326509457|dbj|BAJ91645.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 510

 Score =  267 bits (682), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 122/198 (61%), Positives = 150/198 (75%), Gaps = 3/198 (1%)

Query: 75  PPT-STCPDYFRWIHEDLSPWKVTGITRDMLERANQ-TAHFRLILVNNKVYIHKYKQSIQ 132
           PPT +TCP YFRWIHEDL PW+ TGITR+ LE A +  A FR+ +++ ++Y+ +Y +  Q
Sbjct: 110 PPTPTTCPVYFRWIHEDLRPWRATGITREALEGARRYGAKFRVTVLSGRLYVARYGRCFQ 169

Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
           TRDVFT WGILQLLR+Y GR+PDL+LMFDC D PV+ + D+ G +   PPPLFRY G   
Sbjct: 170 TRDVFTQWGILQLLRRYAGRVPDLDLMFDCQDLPVVNAGDH-GSHTPPPPPLFRYCGSEP 228

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
           T+DI FPDWSFWGW E+NIKPWE+L RE+KE N   +W  R PYAYWKGNP VA  RR+L
Sbjct: 229 TLDIAFPDWSFWGWPELNIKPWEALRREIKEANAALDWTRRAPYAYWKGNPAVAAARREL 288

Query: 253 LTCNLSDKHDWNARLYVQ 270
           L CN+S K DWNAR+Y Q
Sbjct: 289 LKCNVSGKRDWNARIYAQ 306


>gi|242066560|ref|XP_002454569.1| hypothetical protein SORBIDRAFT_04g033620 [Sorghum bicolor]
 gi|241934400|gb|EES07545.1| hypothetical protein SORBIDRAFT_04g033620 [Sorghum bicolor]
          Length = 552

 Score =  267 bits (682), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 118/195 (60%), Positives = 151/195 (77%), Gaps = 1/195 (0%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
           +STCP YFR+IHEDL PW+  G ITR M++RA  TA+FRL+++  + YI +   + QTRD
Sbjct: 146 SSTCPAYFRFIHEDLHPWRAAGGITRAMVDRARATANFRLVVIRGRAYIERIAPAFQTRD 205

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +FTIWGILQLLR+YPGR+PDL+LMFDC D PV+ +  Y G N    PPLFRY G+  T+D
Sbjct: 206 LFTIWGILQLLRRYPGRVPDLDLMFDCVDWPVVHADQYEGENATVLPPLFRYCGNNETLD 265

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           +VFPDWSFWGW EINIKPW++L +EL  GN    W++REPYAYWKGNP VA  R++L+ C
Sbjct: 266 VVFPDWSFWGWPEINIKPWDALQKELNRGNKRVKWLNREPYAYWKGNPDVAVIRQELVKC 325

Query: 256 NLSDKHDWNARLYVQ 270
           N+S +H+WNAR+Y Q
Sbjct: 326 NVSSEHEWNARIYKQ 340


>gi|147810897|emb|CAN69493.1| hypothetical protein VITISV_037876 [Vitis vinifera]
          Length = 378

 Score =  265 bits (676), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 116/168 (69%), Positives = 141/168 (83%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           M+ERA +TA F+L+++N + Y+ KY+++ QTRDVFT+WGILQLLR+YPG++PDLELMFDC
Sbjct: 1   MVERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D PVI+S++Y GPN   PPPLFRY GD  T+DIVFPDWSFWGW EI IKPWESLL++LK
Sbjct: 61  VDWPVIKSKEYHGPNTTAPPPLFRYCGDDATLDIVFPDWSFWGWPEIXIKPWESLLKDLK 120

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           EGN    W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+Y Q
Sbjct: 121 EGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARVYTQ 168


>gi|147798876|emb|CAN65870.1| hypothetical protein VITISV_012498 [Vitis vinifera]
          Length = 504

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 120/192 (62%), Positives = 145/192 (75%), Gaps = 1/192 (0%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           + CPDYFRWI+EDL PW  TGIT++M+E A   A FRL++++ KVY+ KYK + QTRDVF
Sbjct: 99  NACPDYFRWIYEDLRPWAETGITQEMMEAAKDPAFFRLVILDGKVYMEKYKGAFQTRDVF 158

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG-PNNKGPPPLFRYSGDRWTMDI 196
           TIWGILQLL+ YPG++PDLELMF+C DRP I++ DY G    K  PPLF Y     T+DI
Sbjct: 159 TIWGILQLLKLYPGKVPDLELMFECGDRPRIKASDYGGRKGKKKVPPLFHYCASDDTLDI 218

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWSFWGW EINIKPW SL +EL+EGNN   W+DREPYAYWKGN   +  R+ L  C 
Sbjct: 219 VFPDWSFWGWPEINIKPWNSLRKELEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCR 278

Query: 257 LSDKHDWNARLY 268
            S+ HDWNAR+Y
Sbjct: 279 PSNNHDWNARVY 290


>gi|242073834|ref|XP_002446853.1| hypothetical protein SORBIDRAFT_06g023680 [Sorghum bicolor]
 gi|241938036|gb|EES11181.1| hypothetical protein SORBIDRAFT_06g023680 [Sorghum bicolor]
          Length = 555

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 117/189 (61%), Positives = 149/189 (78%), Gaps = 1/189 (0%)

Query: 83  YFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
           YFR+IHEDL PW+  G ITR ML+RA  TA FRL+++  + Y+H+ + + QTRD+FTIWG
Sbjct: 160 YFRFIHEDLRPWRAAGGITRAMLDRARLTATFRLVVLGGRAYVHRLRPAFQTRDLFTIWG 219

Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
           +LQLLR+YPGR+PDL+LMFD  D P++R+  Y G   +  PPLFRY GD  T+DIVFPDW
Sbjct: 220 VLQLLRRYPGRVPDLDLMFDTVDWPIVRAHLYRGKYAEMLPPLFRYCGDDKTLDIVFPDW 279

Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
           SFWGW EINIKPW++L  +LK+GNN   W+DREPYAYWKGNP V+ TR++L+ CN+S  H
Sbjct: 280 SFWGWPEINIKPWDALQEDLKDGNNRVRWMDREPYAYWKGNPSVSATRKELVKCNVSSTH 339

Query: 262 DWNARLYVQ 270
           DWNAR+Y Q
Sbjct: 340 DWNARIYAQ 348


>gi|224064019|ref|XP_002301351.1| predicted protein [Populus trichocarpa]
 gi|222843077|gb|EEE80624.1| predicted protein [Populus trichocarpa]
          Length = 413

 Score =  263 bits (673), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 119/197 (60%), Positives = 145/197 (73%), Gaps = 1/197 (0%)

Query: 79  TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
            CP +FRWIHEDL PWK TGI+RDMLERA + AHFRL++V  K Y+ +Y +  QTRDVFT
Sbjct: 2   ACPGFFRWIHEDLRPWKDTGISRDMLERARKHAHFRLVIVEGKAYVEQYSKPYQTRDVFT 61

Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
           IWGILQLL  YPG++PDLELMF C D+ VI+  D  G +   P  LF+Y G    ++IVF
Sbjct: 62  IWGILQLLNLYPGKIPDLELMFRCGDKTVIQKHDIQGSDAMSPAVLFQYCGHSTALNIVF 121

Query: 199 PDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLS 258
           PDW+FWGWAE NIKPW+ +L  + EGN    W DREPYAYW+GNP V+  R DL+ CN+S
Sbjct: 122 PDWTFWGWAETNIKPWKIVLEGMVEGNKKIKWQDREPYAYWRGNPHVSPNREDLMKCNVS 181

Query: 259 DKHDWNARLYVQVISFI 275
           DK+DW ARLY Q  SF+
Sbjct: 182 DKYDWLARLYEQA-SFL 197


>gi|297745248|emb|CBI40328.3| unnamed protein product [Vitis vinifera]
          Length = 838

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 116/168 (69%), Positives = 141/168 (83%), Gaps = 1/168 (0%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           M++RA +TAHFRL++++ K Y+ K+++SIQTRD+FT+WGILQLLR YPGRLPDLELMFDC
Sbjct: 1   MVDRAQRTAHFRLVIIDGKAYVEKFRKSIQTRDMFTLWGILQLLRWYPGRLPDLELMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
           DDRPV+R RD+ GPN   PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++K
Sbjct: 61  DDRPVVRMRDFRGPN-AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIK 119

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           EGN    W DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+Q
Sbjct: 120 EGNRRTKWKDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQ 167



 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 78/156 (50%), Gaps = 25/156 (16%)

Query: 11  LQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNC-VKNQTQTCPTNYPKTSQTQESI 69
           + ++FS  N + N      ++KP+  P   +    NC + +  Q CP N+P TS T +  
Sbjct: 369 MMTSFSYQNSAPN------TKKPRIFPKILV----NCTIGSMAQACPANHPTTSVTGKL- 417

Query: 70  SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
                     CP+YFRWIHEDL PWK TGI+R  +E A   A FRL++VN K Y+ +Y+ 
Sbjct: 418 ------SVEACPEYFRWIHEDLRPWKSTGISRFAVESAEGDADFRLVIVNGKAYVEQYRN 471

Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR 165
                  +T+ G     ++  G   +  L + C  R
Sbjct: 472 -------YTLTGDQNWEKESNGGFKNSNLAYKCTHR 500


>gi|116310458|emb|CAH67462.1| OSIGBa0159I10.7 [Oryza sativa Indica Group]
 gi|218195275|gb|EEC77702.1| hypothetical protein OsI_16775 [Oryza sativa Indica Group]
          Length = 537

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 116/189 (61%), Positives = 148/189 (78%), Gaps = 1/189 (0%)

Query: 83  YFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
           YFR+IHEDL PW+   GITR ML RA  TA FRL+++  + ++H+++ + QTRD+FTIWG
Sbjct: 142 YFRFIHEDLRPWRDAGGITRAMLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWG 201

Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
           +LQLLR+YPGR+PDL+LMFDC D PV+R+  Y G +    PPLF Y GD  T+DIVFPDW
Sbjct: 202 VLQLLRRYPGRVPDLDLMFDCADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDW 261

Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
           SFWGW EINIKPW++L ++LK+GNN   W+DR PYAYWKGNP VA TR++L+ CN+S   
Sbjct: 262 SFWGWPEINIKPWDALRQDLKDGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTK 321

Query: 262 DWNARLYVQ 270
           DWNAR+Y Q
Sbjct: 322 DWNARIYKQ 330


>gi|255541544|ref|XP_002511836.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
 gi|223549016|gb|EEF50505.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
           communis]
          Length = 549

 Score =  258 bits (659), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 120/238 (50%), Positives = 152/238 (63%), Gaps = 8/238 (3%)

Query: 35  NRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPW 94
           N       +P+ C    T             +  +SD      + CPDYFRWIHEDL PW
Sbjct: 74  NESSAHNASPVKCADQSTCKVNDTVRAFDGPEYQLSD------AACPDYFRWIHEDLRPW 127

Query: 95  KVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLP 154
             TGI+RD +E A + A FRL++V+ K Y+ +Y  S QTRD+FTIWGI+QLLR YPGR+P
Sbjct: 128 ASTGISRDTVESAKRFATFRLVIVDGKAYVERYYHSFQTRDLFTIWGIVQLLRLYPGRVP 187

Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
           D+ELMF C D P I+  DY GP    PPP LF+YSG+     + FPDWSFWGW E++IKP
Sbjct: 188 DVELMFQCGDLPEIQKGDYQGPGATLPPPALFQYSGNETAFAVTFPDWSFWGWVEVDIKP 247

Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD-KHDWNARLYVQ 270
           W+S+L  + +G+  +NW DR PYAYW+GN  V+  R+DLL C  S   HDWNARLY Q
Sbjct: 248 WKSMLEGITKGSQRKNWTDRVPYAYWRGNSHVSRDRKDLLKCKSSIFSHDWNARLYSQ 305


>gi|449446169|ref|XP_004140844.1| PREDICTED: O-glucosyltransferase rumi homolog [Cucumis sativus]
          Length = 454

 Score =  254 bits (649), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 126/253 (49%), Positives = 165/253 (65%), Gaps = 23/253 (9%)

Query: 30  SRKPQNRPPKRIT-TPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPT---------ST 79
            R+ QN P K +  +P+NC         T Y ++ +    I   +I            +T
Sbjct: 6   ERELQNYPQKEVEFSPINC---------TAYSRSEKWDSGIGPTTIEEEEEDGDGKNENT 56

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP+YFRWIHEDL PW  TGITR+M+ER  + A FRL++V  + Y+ KY +  Q RDVFT+
Sbjct: 57  CPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTL 116

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS--GPNNKGPPPLFRYSGDRWTMDIV 197
           WGILQLLR YP ++PDL+LMF C+D+P +   +YS  GPN+  PPPLFRY GD  T DIV
Sbjct: 117 WGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIV 176

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
           FPDWSFWGW EIN+KPWE+ ++ELKE N  + WIDRE YA+WKGN F++  R  LL C+ 
Sbjct: 177 FPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSR 236

Query: 258 SDKHDWNARLYVQ 270
           S +     R+Y+Q
Sbjct: 237 STQS--KLRVYMQ 247


>gi|357507237|ref|XP_003623907.1| KDEL motif-containing protein [Medicago truncatula]
 gi|355498922|gb|AES80125.1| KDEL motif-containing protein [Medicago truncatula]
          Length = 454

 Score =  253 bits (646), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 115/225 (51%), Positives = 151/225 (67%), Gaps = 4/225 (1%)

Query: 45  LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
           L C   N TQTCP ++       +     S   TSTCP++FRWIHEDL PWK TGITR+M
Sbjct: 46  LKCFNGNPTQTCPNDHSPIQAFHQDQDPSS--STSTCPEHFRWIHEDLEPWKSTGITREM 103

Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
           ++     +  R+++   KVY+  Y  S QTR  FT+WGI+QLLR YPGR+PDLEL+F+  
Sbjct: 104 VDSGENISQLRIVIKQGKVYVETYGDSFQTRATFTVWGIVQLLRLYPGRVPDLELLFETG 163

Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
           DR V+  + Y GP    PPP+F Y G    +DIVFPDWS+WGWAE  IKPWE +L++++E
Sbjct: 164 DRAVVDKKRYEGPEAVTPPPIFSYCGKNEALDIVFPDWSYWGWAETRIKPWEKVLKDIQE 223

Query: 224 GNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
            N    W DR PYA+WKGN F++  R +L  CN++D+HDWNAR+Y
Sbjct: 224 SNKKIKWKDRIPYAFWKGN-FLSNPRHELRKCNVTDQHDWNARVY 267


>gi|42569949|ref|NP_182108.2| uncharacterized protein [Arabidopsis thaliana]
 gi|330255514|gb|AEC10608.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 523

 Score =  253 bits (646), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 134/245 (54%), Positives = 158/245 (64%), Gaps = 23/245 (9%)

Query: 37  PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
           P +++TTPLN    C  +Q   TQTCP + P+ SQ  +   +       TCPDYFRWIH+
Sbjct: 76  PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128

Query: 90  DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
           DL  W+ TGITR+ LERA+  AHFRLI+   +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188

Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
           PG++PDLEL+F C D P I  RDY    G N    PPPLF Y G     DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248

Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNA 265
           W EINIK W      + EG     W +REPYAYWKGNP VA  RRDL+ C     HD   
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPMV 303

Query: 266 RLYVQ 270
            LY Q
Sbjct: 304 HLYRQ 308


>gi|110738999|dbj|BAF01419.1| hypothetical protein [Arabidopsis thaliana]
          Length = 523

 Score =  253 bits (646), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 134/245 (54%), Positives = 158/245 (64%), Gaps = 23/245 (9%)

Query: 37  PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
           P +++TTPLN    C  +Q   TQTCP + P+ SQ  +   +       TCPDYFRWIH+
Sbjct: 76  PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128

Query: 90  DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
           DL  W+ TGITR+ LERA+  AHFRLI+   +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188

Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
           PG++PDLEL+F C D P I  RDY    G N    PPPLF Y G     DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248

Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNA 265
           W EINIK W      + EG     W +REPYAYWKGNP VA  RRDL+ C     HD   
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPMV 303

Query: 266 RLYVQ 270
            LY Q
Sbjct: 304 HLYRQ 308


>gi|357168044|ref|XP_003581455.1| PREDICTED: O-glucosyltransferase rumi homolog [Brachypodium
           distachyon]
          Length = 520

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 121/244 (49%), Positives = 156/244 (63%), Gaps = 17/244 (6%)

Query: 41  ITTPLNCVKNQTQTCPTNYPK--------TSQTQESISDYSIPPTSTCPDYFRWIHEDLS 92
           +TTP    ++     P+  P            ++ +    S   T +CP YFRWIHEDL 
Sbjct: 75  LTTPFTSSQDAANRTPSELPAGLTQQPSPAPASEAATESSSATTTGSCPAYFRWIHEDLR 134

Query: 93  PWKVTGITRDMLERANQ-TAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY-- 149
           PW+ TGITR+ LE A++    FR+ +V  ++Y+ +Y +  QTRDVFT WGILQLLR+Y  
Sbjct: 135 PWRATGITRETLEGAHRYMPKFRVTVVAGRLYVRRYGRCFQTRDVFTQWGILQLLRRYNT 194

Query: 150 PGR---LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
            GR   +PDL+LMFDC D PV+ + ++ G +   PPPLFRY G   T+DI FPDWSFWGW
Sbjct: 195 TGRRAVVPDLDLMFDCQDLPVVDAGNHRGCH---PPPLFRYCGSEPTLDIAFPDWSFWGW 251

Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
            E+NIKPW +L RE+ EGN   +W  R PYAYWKGNP V   R +LL CN+S K DWNAR
Sbjct: 252 PELNIKPWGTLRREIDEGNAAVDWTRRAPYAYWKGNPMVGTARLELLKCNVSRKRDWNAR 311

Query: 267 LYVQ 270
           +Y Q
Sbjct: 312 IYAQ 315


>gi|242073832|ref|XP_002446852.1| hypothetical protein SORBIDRAFT_06g023670 [Sorghum bicolor]
 gi|241938035|gb|EES11180.1| hypothetical protein SORBIDRAFT_06g023670 [Sorghum bicolor]
          Length = 505

 Score =  250 bits (638), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 113/193 (58%), Positives = 137/193 (70%), Gaps = 6/193 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQ--TAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           CP YFRWIHEDL PW+  G+TR  LE A +     FR+ +V  ++Y+ +Y +  QTR  F
Sbjct: 114 CPSYFRWIHEDLRPWRAAGVTRGALEGARRLFAPKFRVTVVAGRLYVARYGRCFQTRAAF 173

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
           T WGILQLLR+YPGR+PDL+LMFDCDD PV+ +    G  ++ PPPLFRY G   T+DI 
Sbjct: 174 TQWGILQLLRRYPGRVPDLDLMFDCDDLPVVGA----GDRHQAPPPLFRYCGSETTLDIA 229

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
           FPDWSFWGW E+NIKPWE+L RE+ E N   NW+DR PYAYWKGNP V   R  LL CN 
Sbjct: 230 FPDWSFWGWPELNIKPWEALRREINEENAMVNWMDRAPYAYWKGNPKVGAERLLLLRCNA 289

Query: 258 SDKHDWNARLYVQ 270
           S + DWNAR+Y Q
Sbjct: 290 SGERDWNARVYAQ 302


>gi|15233074|ref|NP_191688.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6850896|emb|CAB71059.1| putative protein [Arabidopsis thaliana]
 gi|332646663|gb|AEE80184.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 455

 Score =  248 bits (632), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 123/230 (53%), Positives = 151/230 (65%), Gaps = 12/230 (5%)

Query: 44  PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
           PL C  N TQTCP+ YP  S+ +  IS        TCPDYFRWI +DL  W+ TGITR+ 
Sbjct: 17  PLKC--NATQTCPSTYP--SRLEPMISS-----LETCPDYFRWIQQDLKVWEETGITRET 67

Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
           LERA   AHFRL++ + ++Y+H+Y ++ ++RDV TIWGILQLLR YPG++PDLEL+F C 
Sbjct: 68  LERAKPKAHFRLVIKSGRLYVHQYDKAYESRDVLTIWGILQLLRMYPGQVPDLELLFFCH 127

Query: 164 DRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRE 220
           D P I  RD+  P       PPPLF+Y G R    IVFPDWSFWGW E+NIK W  L   
Sbjct: 128 DIPAIWKRDFRQPEPNATWPPPPLFQYCGHREAYGIVFPDWSFWGWPEVNIKEWTKLSVA 187

Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           ++E N    W DR PYAYWKGN  V   R +L+ CN SDK+D   RLY Q
Sbjct: 188 IREANKRVKWNDRVPYAYWKGNSGVHRERGNLMKCNFSDKYDPMVRLYEQ 237


>gi|357507235|ref|XP_003623906.1| O-glucosyltransferase rumi [Medicago truncatula]
 gi|355498921|gb|AES80124.1| O-glucosyltransferase rumi [Medicago truncatula]
          Length = 442

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 117/256 (45%), Positives = 159/256 (62%), Gaps = 11/256 (4%)

Query: 14  NFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDY 72
           +F   + S +K  II  ++ Q          LNC   + T+TC  ++       +     
Sbjct: 16  SFKTKSTSPSKITIIKRQQTQ--------FTLNCFNGSMTRTCQKDHHPIQAFHQDQDPS 67

Query: 73  SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
           S   TSTCP++FRWIHEDL PWK TGITR+ +E     +  R+++   K Y+  Y  S Q
Sbjct: 68  S--STSTCPEHFRWIHEDLEPWKSTGITRETVESGKNISQLRIVIKKGKAYVETYADSFQ 125

Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
           TRD+FT+WGI+QLLR YPGR+PDLEL+F+  DR V+  + + G  +   PP+F Y G   
Sbjct: 126 TRDLFTVWGIVQLLRLYPGRVPDLELLFETGDRTVLDKKRFQGSQSVTLPPIFSYCGQND 185

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
            +DIVFPDWSFWGWAE  IKPWE +L++++E N    W DR PYA+WKGN  V+  R  L
Sbjct: 186 ALDIVFPDWSFWGWAETGIKPWEKVLKDIQESNKKITWKDRIPYAFWKGNTHVSSQRYKL 245

Query: 253 LTCNLSDKHDWNARLY 268
             CN++D+HDWNAR+Y
Sbjct: 246 RQCNVTDQHDWNARIY 261


>gi|42571241|ref|NP_973694.1| downstream target of AGL15 2 [Arabidopsis thaliana]
 gi|44917445|gb|AAS49047.1| At2g45830 [Arabidopsis thaliana]
 gi|46931276|gb|AAT06442.1| At2g45830 [Arabidopsis thaliana]
 gi|330255512|gb|AEC10606.1| downstream target of AGL15 2 [Arabidopsis thaliana]
          Length = 382

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 136/168 (80%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           MLE+A +TAHFR+++++ +VY+ KY++SIQTRDVFT+WGI+QLLR YPGRLPDLELMFD 
Sbjct: 1   MLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVFTLWGIVQLLRWYPGRLPDLELMFDP 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
           DDRP +RS+D+ G  +  PPPLFRY  D  ++DIVFPDWSFWGWAE+NIKPW+  L  ++
Sbjct: 61  DDRPTVRSKDFQGQQHPAPPPLFRYCSDDASLDIVFPDWSFWGWAEVNIKPWDKSLVAIE 120

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           EGN    W DR  YAYW+GNP VA TRRDLL CN+S + DWN RLY+Q
Sbjct: 121 EGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRCNVSAQEDWNTRLYIQ 168


>gi|297824641|ref|XP_002880203.1| hypothetical protein ARALYDRAFT_483723 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326042|gb|EFH56462.1| hypothetical protein ARALYDRAFT_483723 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 522

 Score =  244 bits (622), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 136/278 (48%), Positives = 163/278 (58%), Gaps = 24/278 (8%)

Query: 4   LFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLN----CVKNQ---TQTCP 56
           L   S+     FS+       T I         P K+I TP+N    C  +Q   T  CP
Sbjct: 43  LLGASLSWMDMFSIGGGRIKVTPIFTRNTNATIPKKKIYTPVNFTLQCSLDQNIATHVCP 102

Query: 57  TNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLI 116
            + P  SQ  +   +       TCPDYFRWI++DL  W+ TGITR+ LERA+  AHFRLI
Sbjct: 103 ASNPDKSQPSKDDPE-------TCPDYFRWIYKDLEAWRGTGITRETLERASDKAHFRLI 155

Query: 117 LVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS-- 174
           +   +VY+ +YK+S QTRDVFTIWGI+QLLR YPG++PDLEL+F CDD P I  RDY   
Sbjct: 156 IKGGRVYVQQYKKSFQTRDVFTIWGIVQLLRMYPGQVPDLELLFMCDDLPEIWRRDYRPR 215

Query: 175 -GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWID 232
            G N    PPPLF Y G     DIVFPDWSFWGW EINIK W  +L  + EG     W +
Sbjct: 216 PGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWGWPEINIKEWNKMLEAISEGLKKVKWEE 275

Query: 233 REPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           REPYAYWKGN  V + RRDL+ C     HD    LY Q
Sbjct: 276 REPYAYWKGNARVGK-RRDLMNC-----HDPMVHLYSQ 307


>gi|87241560|gb|ABD33418.1| Lipopolysaccharide-modifying protein [Medicago truncatula]
          Length = 397

 Score =  243 bits (620), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 104/192 (54%), Positives = 136/192 (70%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
           TSTCP++FRWIHEDL PWK TGITR+ +E     +  R+++   K Y+  Y  S QTRD+
Sbjct: 25  TSTCPEHFRWIHEDLEPWKSTGITRETVESGKNISQLRIVIKKGKAYVETYADSFQTRDL 84

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           FT+WGI+QLLR YPGR+PDLEL+F+  DR V+  + + G  +   PP+F Y G    +DI
Sbjct: 85  FTVWGIVQLLRLYPGRVPDLELLFETGDRTVLDKKRFQGSQSVTLPPIFSYCGQNDALDI 144

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWSFWGWAE  IKPWE +L++++E N    W DR PYA+WKGN  V+  R  L  CN
Sbjct: 145 VFPDWSFWGWAETGIKPWEKVLKDIQESNKKITWKDRIPYAFWKGNTHVSSQRYKLRQCN 204

Query: 257 LSDKHDWNARLY 268
           ++D+HDWNAR+Y
Sbjct: 205 VTDQHDWNARIY 216


>gi|115459624|ref|NP_001053412.1| Os04g0534000 [Oryza sativa Japonica Group]
 gi|38346998|emb|CAE04582.2| OSJNBb0039L24.21 [Oryza sativa Japonica Group]
 gi|113564983|dbj|BAF15326.1| Os04g0534000 [Oryza sativa Japonica Group]
 gi|116310457|emb|CAH67461.1| OSIGBa0159I10.6 [Oryza sativa Indica Group]
 gi|125549144|gb|EAY94966.1| hypothetical protein OsI_16774 [Oryza sativa Indica Group]
 gi|125591102|gb|EAZ31452.1| hypothetical protein OsJ_15588 [Oryza sativa Japonica Group]
 gi|215694886|dbj|BAG90077.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 508

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 108/190 (56%), Positives = 136/190 (71%), Gaps = 2/190 (1%)

Query: 83  YFRWIHEDLSPWKVTGITRDMLERANQ--TAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
           YFRWIHEDL PW+  GITR+ ++ A +   A FR+ +V  ++++ +Y +  QTRD+FT W
Sbjct: 113 YFRWIHEDLRPWRDAGITREAVDGAARRYGAKFRVTVVAGRLHVARYGRCFQTRDMFTQW 172

Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPD 200
           G+LQLLR+Y GR+PDL+LMFDC D PV+ + D  G  +  PPPLF Y G   T+DI FPD
Sbjct: 173 GVLQLLRRYQGRVPDLDLMFDCQDLPVVNAGDRRGRTSSSPPPLFGYCGSEPTLDIAFPD 232

Query: 201 WSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK 260
           WSFWGW E+NIKPWE+L  E+ +GN   NW  R PYAYWKGNP V   RR+LL CN S K
Sbjct: 233 WSFWGWPELNIKPWETLRGEIADGNAAVNWTGRAPYAYWKGNPTVGADRRNLLRCNASGK 292

Query: 261 HDWNARLYVQ 270
            DWNAR+Y Q
Sbjct: 293 RDWNARIYEQ 302


>gi|3386610|gb|AAC28540.1| hypothetical protein [Arabidopsis thaliana]
          Length = 413

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 125/224 (55%), Positives = 148/224 (66%), Gaps = 18/224 (8%)

Query: 37  PPKRITTPLN----CVKNQ---TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHE 89
           P +++TTPLN    C  +Q   TQTCP + P+ SQ  +   +       TCPDYFRWIH+
Sbjct: 76  PKEKLTTPLNFTLQCSLDQNIATQTCPASNPEKSQPSKDEPE-------TCPDYFRWIHK 128

Query: 90  DLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
           DL  W+ TGITR+ LERA+  AHFRLI+   +VY+H+YK+S QTRDVFTIWGI+QLLR Y
Sbjct: 129 DLEAWRETGITRETLERASDKAHFRLIIKGGRVYVHQYKKSFQTRDVFTIWGIVQLLRMY 188

Query: 150 PGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFWG 205
           PG++PDLEL+F C D P I  RDY    G N    PPPLF Y G     DIVFPDWSFWG
Sbjct: 189 PGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFWG 248

Query: 206 WAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
           W EINIK W      + EG     W +REPYAYWKGNP VA  R
Sbjct: 249 WPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMAR 292


>gi|449523045|ref|XP_004168535.1| PREDICTED: LOW QUALITY PROTEIN: O-glucosyltransferase rumi homolog
           [Cucumis sativus]
          Length = 454

 Score =  239 bits (611), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 127/253 (50%), Positives = 166/253 (65%), Gaps = 23/253 (9%)

Query: 30  SRKPQNRPPKRIT-TPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPT---------ST 79
            R+ QN P K +  +P+NC         T Y ++ +    I   +I            +T
Sbjct: 6   ERELQNYPQKEVEFSPINC---------TAYSRSEKWDSGIGPTTIEEEEEDGDGKNENT 56

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP+YFRWIHEDL PW  TGITR+M+ER  + A FRL++V  +VY+ KY +  Q RDVFT+
Sbjct: 57  CPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRVYVEKYSEVFQRRDVFTL 116

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG--PNNKGPPPLFRYSGDRWTMDIV 197
           WGILQLLR YP ++PDL+LMF C+D+P +   +YSG  PN+  PPPLFRY GD  T DIV
Sbjct: 117 WGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTPPPPLFRYCGDDDTFDIV 176

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
           FPDWSFWGW EIN+KPWE+ ++ELKE N  + WIDRE YA+WKGN F++  R  LL C+ 
Sbjct: 177 FPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSR 236

Query: 258 SDKHDWNARLYVQ 270
           S +     R+Y+Q
Sbjct: 237 STQS--KLRVYMQ 247


>gi|242094720|ref|XP_002437850.1| hypothetical protein SORBIDRAFT_10g003730 [Sorghum bicolor]
 gi|241916073|gb|EER89217.1| hypothetical protein SORBIDRAFT_10g003730 [Sorghum bicolor]
          Length = 552

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 104/196 (53%), Positives = 143/196 (72%), Gaps = 3/196 (1%)

Query: 76  PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
           P  +CPDYFR+IH+DL PW+  GITR+ +ERA   A+FRL++V  + Y+  Y+++ QTRD
Sbjct: 157 PAPSCPDYFRYIHDDLRPWRGAGITREAVERARPHAYFRLVVVGGRAYVETYRRAYQTRD 216

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY-SGPNNKGPPPLFRYSGDRWTM 194
           VFT WG+LQLLR+YPGR+PDL++MF CDD   +R+ D+ + P++   PP+FRY  D  T+
Sbjct: 217 VFTQWGVLQLLRRYPGRVPDLDIMFACDDPGQVRAADFPTTPSDA--PPVFRYCKDALTL 274

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
           DIVFPDWSFWGW E+ I+PW  LL E+++ N    W  R+PYA+WKGNP     R +L+ 
Sbjct: 275 DIVFPDWSFWGWPEVGIRPWPQLLEEVRQENERVRWPVRQPYAFWKGNPEGYRIRHELMR 334

Query: 255 CNLSDKHDWNARLYVQ 270
           CN S+  +WNARL+ Q
Sbjct: 335 CNASNGQEWNARLFTQ 350


>gi|226509990|ref|NP_001142048.1| uncharacterized protein LOC100274204 [Zea mays]
 gi|194706898|gb|ACF87533.1| unknown [Zea mays]
 gi|413919037|gb|AFW58969.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
          Length = 508

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 118/216 (54%), Positives = 147/216 (68%), Gaps = 7/216 (3%)

Query: 62  TSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLIL 117
           T+QT  + +  S PP      CP YFRWIHEDL PW+ TG+TR  +E A + A   R+ +
Sbjct: 90  TNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAV 149

Query: 118 VNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
           V  ++Y+ +Y  ++  QTR  FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + +  G
Sbjct: 150 VAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHG 209

Query: 176 PNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDRE 234
              + PPP LFRY G   T+DI FPDWSFWGW E+NIKPWE+L RE+ E N   NW DR 
Sbjct: 210 AQPQPPPPPLFRYCGSEATLDIAFPDWSFWGWPELNIKPWEALQREINEANAVVNWTDRA 269

Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           PYAYWKGNP V   R  LL CN S + DWNAR+Y Q
Sbjct: 270 PYAYWKGNPKVGAERLLLLRCNASGERDWNARVYAQ 305


>gi|326491107|dbj|BAK05653.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 373

 Score =  236 bits (602), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 104/168 (61%), Positives = 131/168 (77%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           ML+RA  TA+FRL+++  + YI +   + QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC
Sbjct: 1   MLDRARATANFRLVVLRGRAYIERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D PV+ + +Y G N    PPLFRY GD  T+D+VFPDWSFWGWAEINIKPW++L ++L 
Sbjct: 61  VDWPVVHADEYQGNNATAMPPLFRYCGDNETLDVVFPDWSFWGWAEINIKPWDALQKDLS 120

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
            GN    WIDREPYAYWKGNP VA  R++L+ CN+S K +WNAR+Y Q
Sbjct: 121 VGNKRVRWIDREPYAYWKGNPEVAAIRQELVKCNVSSKQEWNARIYKQ 168


>gi|49388103|dbj|BAD25236.1| unknown protein [Oryza sativa Japonica Group]
 gi|215766221|dbj|BAG98449.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 375

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 103/168 (61%), Positives = 131/168 (77%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           MLERA  TA+FRL+++  + Y+ +   + QTRD+FTIWGILQLLR+YPGR+PDL+LMFDC
Sbjct: 1   MLERARDTANFRLVVLRGRAYVERIAPAFQTRDLFTIWGILQLLRRYPGRVPDLDLMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D PV+++  Y G N    PPLFRY GD  T+D+VFPDWSFWGW EINIKPW++L ++L 
Sbjct: 61  VDWPVVQADRYQGENATAMPPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLD 120

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
            GN    W+DREPYAYWKGNP VA  R++L+ CN+S KH+WNAR+Y Q
Sbjct: 121 IGNKRVKWVDREPYAYWKGNPDVATKRKELVKCNVSSKHEWNARIYKQ 168


>gi|449441480|ref|XP_004138510.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
 gi|449518613|ref|XP_004166331.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 380

 Score =  233 bits (595), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 107/168 (63%), Positives = 130/168 (77%), Gaps = 1/168 (0%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           M+ER  +TAHFR+++V  +VY+ KYK SIQTRDVFT+WGILQL R YP +LPDLELMFDC
Sbjct: 1   MVERGRRTAHFRVVIVEGRVYVEKYKGSIQTRDVFTMWGILQLARWYPKKLPDLELMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
           DDRPV+RS  +    + GPPPLFRY  D  ++DIVFPDWSFWGW EINIKPW+ +L ++K
Sbjct: 61  DDRPVVRSNGFMNAIS-GPPPLFRYCSDESSLDIVFPDWSFWGWGEINIKPWKMVLEDIK 119

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           EGN    W DR P AYWKGNP V  +RRDLL CNL+ + +W+  LYVQ
Sbjct: 120 EGNKRTRWKDRVPLAYWKGNPQVDPSRRDLLKCNLTQQQNWDTLLYVQ 167


>gi|147791845|emb|CAN70601.1| hypothetical protein VITISV_027961 [Vitis vinifera]
          Length = 362

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 105/152 (69%), Positives = 127/152 (83%), Gaps = 1/152 (0%)

Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
           N++ ++ K+++SIQTRD+FT+WGILQLLR YPGRLPDLELMFDCDDRPV+R RD+ GP+ 
Sbjct: 14  NHQGHVEKFRKSIQTRDMFTLWGILQLLRWYPGRLPDLELMFDCDDRPVVRMRDFRGPS- 72

Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAY 238
             PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++KEGN    W DR P AY
Sbjct: 73  AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEGNRRTKWKDRVPLAY 132

Query: 239 WKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           W+GNP VA TR DLL CN+SDK DWN RLY+Q
Sbjct: 133 WRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQ 164


>gi|21742339|emb|CAD41531.1| OSJNBb0020O11.18 [Oryza sativa Japonica Group]
 gi|38346999|emb|CAE04583.2| OSJNBb0039L24.22 [Oryza sativa Japonica Group]
          Length = 375

 Score =  231 bits (588), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 102/168 (60%), Positives = 132/168 (78%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           ML RA  TA FRL+++  + ++H+++ + QTRD+FTIWG+LQLLR+YPGR+PDL+LMFDC
Sbjct: 1   MLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWGVLQLLRRYPGRVPDLDLMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D PV+R+  Y G +    PPLF Y GD  T+DIVFPDWSFWGW EINIKPW++L ++LK
Sbjct: 61  ADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDWSFWGWPEINIKPWDALRQDLK 120

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           +GNN   W+DR PYAYWKGNP VA TR++L+ CN+S   DWNAR+Y Q
Sbjct: 121 DGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTKDWNARIYKQ 168


>gi|215736906|dbj|BAG95835.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 275

 Score =  230 bits (586), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 102/168 (60%), Positives = 132/168 (78%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           ML RA  TA FRL+++  + ++H+++ + QTRD+FTIWG+LQLLR+YPGR+PDL+LMFDC
Sbjct: 1   MLARARVTASFRLLVLGGRAFVHRFRPAFQTRDLFTIWGVLQLLRRYPGRVPDLDLMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D PV+R+  Y G +    PPLF Y GD  T+DIVFPDWSFWGW EINIKPW++L ++LK
Sbjct: 61  ADWPVVRTHLYRGKHAAFMPPLFSYCGDDRTLDIVFPDWSFWGWPEINIKPWDALRQDLK 120

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           +GNN   W+DR PYAYWKGNP VA TR++L+ CN+S   DWNAR+Y Q
Sbjct: 121 DGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTKDWNARIYKQ 168


>gi|326505488|dbj|BAJ95415.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326509389|dbj|BAJ91611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 562

 Score =  230 bits (586), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 101/192 (52%), Positives = 135/192 (70%), Gaps = 4/192 (2%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CPDYFR IH DL PW+  GI+R+ +ER    A FRL +V+ + Y+  Y +  QTRD+FT 
Sbjct: 168 CPDYFRHIHSDLEPWREKGISREAVERGRPKAAFRLTVVSGRAYVETYHRVFQTRDLFTQ 227

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WGI QLL +YPGR+PDL+LMF+C+D P +R+ DY  P+    PPLFRY  D  +++++FP
Sbjct: 228 WGIAQLLARYPGRVPDLDLMFNCEDMPELRAADY--PDTSAAPPLFRYCKDGTSLEVLFP 285

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC--NL 257
           DWSFWGW E+NI+PW  L++E+ E N    W DR+PYA+WKGNP V+  RRDL  C  + 
Sbjct: 286 DWSFWGWPEVNIRPWGPLMKEIAEENARLPWPDRQPYAFWKGNPDVSRARRDLFRCSNDS 345

Query: 258 SDKHDWNARLYV 269
           +   DWNARL+ 
Sbjct: 346 AAGRDWNARLFA 357


>gi|294461229|gb|ADE76177.1| unknown [Picea sitchensis]
          Length = 492

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 106/194 (54%), Positives = 138/194 (71%), Gaps = 3/194 (1%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
            + CP YF+WI +D+ PW  +GIT DM+E AN  A FRL +VN +++I  Y++  Q+RD+
Sbjct: 95  AAFCPFYFKWIRQDMKPWAKSGITIDMVEAANPEASFRLTVVNGRMFIESYRKCYQSRDL 154

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           FTIWGI QLL+ YP  LPDL+LMF+CDD PVI   DY+  + K PPPLFRYSG   T DI
Sbjct: 155 FTIWGIAQLLKFYPRLLPDLDLMFNCDDNPVIHRGDYND-STKPPPPLFRYSGSEDTFDI 213

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWSFWGW EI   PWE+L +E++ G+    W DR+P AYWKGNP++ + R+DL+ C 
Sbjct: 214 VFPDWSFWGWPEIRTPPWETLAKEIQNGSQKVKWEDRDPTAYWKGNPYMGQGRQDLMNC- 272

Query: 257 LSDKHDWNARLYVQ 270
           +  +H W  RLY Q
Sbjct: 273 IHRRH-WGGRLYNQ 285


>gi|413925102|gb|AFW65034.1| lipopolysaccharide-modifying protein [Zea mays]
          Length = 599

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 114/248 (45%), Positives = 143/248 (57%), Gaps = 26/248 (10%)

Query: 41  ITTPLNCVKNQTQTCPTNYPKT-------------SQTQESISDYSIPPTSTCPDYFRWI 87
           +  P +C  N +  CP     T             + T+ S S    PP   CPDYFR+I
Sbjct: 151 VPIPFSCGGNGS-ACPRRQTATSPPSPSPSSSSPPAGTKSSSSKNKQPPPQ-CPDYFRFI 208

Query: 88  HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
           H DLSPW+ TGITR+ +ER    A FRL++V+ + Y+  Y +  QTRD FT WGI QLL 
Sbjct: 209 HSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGIAQLLA 268

Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
           +YPGR+PDL+LMF+C+D P ++ +          PPLFRY  D  T+DIVFPDWSFWGW 
Sbjct: 269 RYPGRVPDLDLMFNCEDMPEVKVKPS---EESSAPPLFRYCKDDSTVDIVFPDWSFWGWP 325

Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK------- 260
           E+NI+PW  LL E+        W DREPYAYWKGNP V+  R DL  CN S         
Sbjct: 326 EVNIRPWAPLLEEMAAEMGRLPWADREPYAYWKGNPTVSAERADLRRCNDSSSSGGTRVF 385

Query: 261 -HDWNARL 267
             DW A +
Sbjct: 386 WQDWGAAI 393


>gi|226510425|ref|NP_001149452.1| LOC100283078 [Zea mays]
 gi|195627344|gb|ACG35502.1| lipopolysaccharide-modifying protein [Zea mays]
          Length = 600

 Score =  224 bits (571), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 114/248 (45%), Positives = 143/248 (57%), Gaps = 26/248 (10%)

Query: 41  ITTPLNCVKNQTQTCPTNYPKT-------------SQTQESISDYSIPPTSTCPDYFRWI 87
           +  P +C  N +  CP     T             + T+ S S    PP   CPDYFR+I
Sbjct: 152 VPIPFSCGGNGS-ACPRRQTATSPPSPSPSSSSPPAATKSSSSKNKQPPPQ-CPDYFRFI 209

Query: 88  HEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
           H DLSPW+ TGITR+ +ER    A FRL++V+ + Y+  Y +  QTRD FT WGI QLL 
Sbjct: 210 HSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGIAQLLA 269

Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWA 207
           +YPGR+PDL+LMF+C+D P ++ +          PPLFRY  D  T+DIVFPDWSFWGW 
Sbjct: 270 RYPGRVPDLDLMFNCEDMPEVKVKPS---EESSAPPLFRYCKDDSTVDIVFPDWSFWGWP 326

Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK------- 260
           E+NI+PW  LL E+        W DREPYAYWKGNP V+  R DL  CN S         
Sbjct: 327 EVNIRPWAPLLEEMAAEMGRLPWADREPYAYWKGNPTVSAERADLRRCNDSSSSGGTRVF 386

Query: 261 -HDWNARL 267
             DW A +
Sbjct: 387 WQDWGAAI 394


>gi|294460113|gb|ADE75639.1| unknown [Picea sitchensis]
          Length = 492

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 104/191 (54%), Positives = 134/191 (70%), Gaps = 3/191 (1%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP YF+WI +D+ PW  +GIT DM+E A   A FRL +VN ++YI  Y++  Q+RD+FTI
Sbjct: 98  CPFYFKWIRQDMKPWAKSGITIDMVEAAKPEASFRLTVVNGRMYIESYRKCYQSRDLFTI 157

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WGI Q L+ YP  LPDL+ MF+CDD PVI   DY+  + K PPPLFRYSG   T DIVFP
Sbjct: 158 WGIAQFLKFYPRLLPDLDSMFNCDDNPVIHRGDYND-STKPPPPLFRYSGSEDTFDIVFP 216

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSD 259
           DWSFWGW EI   PWE+L +E++ G+    W DR+P AYWKGNP++ + R+DL+ C +  
Sbjct: 217 DWSFWGWPEIRTPPWETLAKEIQNGSQKVKWEDRDPTAYWKGNPYMGQGRQDLMNC-IHR 275

Query: 260 KHDWNARLYVQ 270
           +H W  RLY Q
Sbjct: 276 RH-WGGRLYNQ 285


>gi|242077829|ref|XP_002443683.1| hypothetical protein SORBIDRAFT_07g000280 [Sorghum bicolor]
 gi|241940033|gb|EES13178.1| hypothetical protein SORBIDRAFT_07g000280 [Sorghum bicolor]
          Length = 594

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 102/188 (54%), Positives = 132/188 (70%), Gaps = 2/188 (1%)

Query: 83  YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
           YFR+IH DLSPW+ TGITR+ +ER    A FRL++V+ + Y+  Y +  QTRD FT WGI
Sbjct: 198 YFRYIHSDLSPWRETGITREAVERGRHRAAFRLVVVDGRAYVETYHRVFQTRDTFTQWGI 257

Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWS 202
            QLL +YPGR+PDL+LMF+C+D P +R+ D++   ++  PPLFRY  D  T+DIVFPDWS
Sbjct: 258 AQLLARYPGRVPDLDLMFNCEDMPEVRAADFAAAPSQA-PPLFRYCKDDSTLDIVFPDWS 316

Query: 203 FWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHD 262
           FWGW E+NI+PW  LL E+        W +REPYAYWKGNP V+  R DLL CN S   +
Sbjct: 317 FWGWPEVNIRPWAPLLEEMAAETARLPWAEREPYAYWKGNPGVSGERGDLLRCNDSSG-E 375

Query: 263 WNARLYVQ 270
           W  R++ Q
Sbjct: 376 WRTRVFWQ 383


>gi|356530084|ref|XP_003533614.1| PREDICTED: KDEL motif-containing protein 1-like, partial [Glycine
           max]
          Length = 357

 Score =  223 bits (569), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 111/226 (49%), Positives = 140/226 (61%), Gaps = 17/226 (7%)

Query: 44  PLNCVKN-QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
           PLNC     T TCP +Y  T   Q+  S       STCP+YF+WIHEDL PWK TGITR+
Sbjct: 3   PLNCFNGGMTYTCPKDYYPTKFAQDQDSS----TASTCPEYFKWIHEDLKPWKRTGITRE 58

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           M+ER    ++FRL+++  K Y  KY  S +TRDVFT+WGILQLLR YPG +PDLEL+ + 
Sbjct: 59  MMERGRNVSYFRLVIIQGKAYAKKYADSYETRDVFTVWGILQLLRLYPGDIPDLELLLET 118

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D+P++      GP    PPP+F Y G +   DIVFPDW F GWA++ IK        L 
Sbjct: 119 GDKPMVDKEQSQGP----PPPIFHYCGHKNAYDIVFPDWIFRGWADLAIK--------LD 166

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
           E N    W DR PYA WKG  +V+  R DL  CN SD+HD  A+++
Sbjct: 167 ESNKKIKWKDRLPYAIWKGKTWVSHKRNDLTKCNASDQHDSYAQIH 212


>gi|356504647|ref|XP_003521107.1| PREDICTED: protein O-glucosyltransferase 1-like [Glycine max]
          Length = 373

 Score =  223 bits (569), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 97/166 (58%), Positives = 126/166 (75%), Gaps = 1/166 (0%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           M+ER    +HFRL++VN K YI K+ +S QTRDVFTIWGILQLLR YPG++PDLELMF C
Sbjct: 1   MIERGKNISHFRLVIVNGKAYIEKFAKSYQTRDVFTIWGILQLLRLYPGKVPDLELMFHC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D+ V+  +D+ GP    PPP+F Y G+  + DIVFPDW+FWGWAE++I+PWE+ L  ++
Sbjct: 61  GDKTVVFKKDFQGPQ-MSPPPVFHYCGEENSYDIVFPDWTFWGWAELSIRPWETTLHNIQ 119

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
           EGN    W DR PYA+WKGNP V+  RR+L  CN+++K DWNAR+Y
Sbjct: 120 EGNKMVKWKDRIPYAFWKGNPKVSIIRRELGKCNVTEKQDWNARIY 165


>gi|55296633|dbj|BAD69335.1| unknown protein [Oryza sativa Japonica Group]
 gi|55297286|dbj|BAD69071.1| unknown protein [Oryza sativa Japonica Group]
          Length = 542

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 106/235 (45%), Positives = 147/235 (62%), Gaps = 2/235 (0%)

Query: 37  PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
           PP     P +C               S    S       P ++CPDYFR+IH+DL PW+ 
Sbjct: 103 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 162

Query: 97  TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
            GITR+ +ER  + A+FRL++V+ + Y+  Y++S QTRD FT WG+ QLLR+Y GR+PD+
Sbjct: 163 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFTQWGVAQLLRRYAGRVPDV 222

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
           ++MF CDDR  +R+ D++       PP+FRY  D  T+D+VFPDWSFWGW E+NI  W +
Sbjct: 223 DIMFACDDRGRVRAADFAA-APADAPPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPA 281

Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQ 270
            L  ++  +    W +REP+A+WKGNP VA  R +L+ CN  SD  DWNARL+ Q
Sbjct: 282 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQ 336


>gi|115466478|ref|NP_001056838.1| Os06g0152700 [Oryza sativa Japonica Group]
 gi|113594878|dbj|BAF18752.1| Os06g0152700 [Oryza sativa Japonica Group]
          Length = 547

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 106/235 (45%), Positives = 147/235 (62%), Gaps = 2/235 (0%)

Query: 37  PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
           PP     P +C               S    S       P ++CPDYFR+IH+DL PW+ 
Sbjct: 108 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 167

Query: 97  TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
            GITR+ +ER  + A+FRL++V+ + Y+  Y++S QTRD FT WG+ QLLR+Y GR+PD+
Sbjct: 168 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAFTQWGVAQLLRRYAGRVPDV 227

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
           ++MF CDDR  +R+ D++       PP+FRY  D  T+D+VFPDWSFWGW E+NI  W +
Sbjct: 228 DIMFACDDRGRVRAADFAA-APADAPPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPA 286

Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQ 270
            L  ++  +    W +REP+A+WKGNP VA  R +L+ CN  SD  DWNARL+ Q
Sbjct: 287 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQ 341


>gi|218197602|gb|EEC80029.1| hypothetical protein OsI_21713 [Oryza sativa Indica Group]
          Length = 515

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 102/204 (50%), Positives = 142/204 (69%), Gaps = 2/204 (0%)

Query: 76  PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
           P ++CPDYFR+IH+DL PW+  GITR+ +ER  + A+FRL++V+ + Y+  Y++S QTRD
Sbjct: 146 PAASCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRD 205

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
            FT WG+ QLLR+Y GR+PD+++MF CDDR  +R+ D++       PP+FRY  D  T+D
Sbjct: 206 AFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFA-AAPADAPPVFRYCRDATTLD 264

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           +VFPDWSFWGW E+NI  W + L  ++  +    W +REP+A+WKGNP VA  R +L+ C
Sbjct: 265 VVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKC 324

Query: 256 N-LSDKHDWNARLYVQVISFIYIT 278
           N  SD  DWNARL+ Q     Y T
Sbjct: 325 NPASDGKDWNARLFSQTYRRSYQT 348



 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 105/153 (68%), Gaps = 2/153 (1%)

Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
           N +++   Y++S QTRD FT WG+ QLLR+Y GR+PD+++MF CDDR  +R+ D++    
Sbjct: 334 NARLFSQTYRRSYQTRDAFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFAAAPA 393

Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAY 238
              PP+FRY  D  T+D+VFPDWSFWGW E+NI  W + L  ++  +    W +REP+A+
Sbjct: 394 D-APPVFRYCRDATTLDVVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAF 452

Query: 239 WKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQ 270
           WKGNP VA  R +L+ CN  SD  DWNARL+ Q
Sbjct: 453 WKGNPGVARIRGELMKCNPASDGKDWNARLFSQ 485


>gi|227206228|dbj|BAH57169.1| AT1G63420 [Arabidopsis thaliana]
          Length = 299

 Score =  217 bits (553), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 100/183 (54%), Positives = 130/183 (71%), Gaps = 13/183 (7%)

Query: 31  RKPQNRPPKRITTPLNCV----KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRW 86
           R  + + P+   + ++C     +N++ +C           ++ S+ S      CPDYF+W
Sbjct: 123 RVSEKKSPEETGSSVDCSSFLNQNRSGSCSRTLQSGYNQNQTESNRS------CPDYFKW 176

Query: 87  IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
           IHEDL PW+ TGIT++M+ER   TAHFRL+++N KV++  YK+SIQTRD FT+WGILQLL
Sbjct: 177 IHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLL 236

Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN---NKGPPPLFRYSGDRWTMDIVFPDWSF 203
           RKYPG+LPD++LMFDCDDRPVIRS  Y+  N      PPPLFRY GDRWT+DIVFPDWSF
Sbjct: 237 RKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSF 296

Query: 204 WGW 206
           WGW
Sbjct: 297 WGW 299


>gi|125559830|gb|EAZ05278.1| hypothetical protein OsI_27481 [Oryza sativa Indica Group]
          Length = 579

 Score =  216 bits (551), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 116/238 (48%), Positives = 145/238 (60%), Gaps = 6/238 (2%)

Query: 41  ITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTST---CPDYFRWIHEDLSPWKV 96
           I  P  C   N T TCP           +      P TST   CP YFR I  DL+ W  
Sbjct: 137 IPIPFTCGSGNDTGTCPRYAASAPAPAPASMSPPPPQTSTVDYCPSYFRHIELDLAAWVA 196

Query: 97  TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
           +GI+R+ +ER  + AHFRL++V  + Y+  Y+++ QTRDVFT WGILQLLR+YPGR+PDL
Sbjct: 197 SGISREAVERGRRQAHFRLLVVGGRAYVETYRRAFQTRDVFTQWGILQLLRRYPGRVPDL 256

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
           +LMF+CDD P +R+  Y         PLFRY  D  T+D++FPDWSFWGW E+NI+PW  
Sbjct: 257 DLMFNCDDMPEVRAAAYPDRAAAP--PLFRYCKDPSTLDVLFPDWSFWGWPEVNIRPWAP 314

Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQVISF 274
           LL E+ E      W  REPYAYWKGNP V+  R++LL CN S   D   RLY Q   F
Sbjct: 315 LLAEMAEEKARLPWSRREPYAYWKGNPDVSPLRQELLRCNHSLPPDDTVRLYRQDWGF 372


>gi|115474335|ref|NP_001060764.1| Os08g0101800 [Oryza sativa Japonica Group]
 gi|38637293|dbj|BAD03556.1| unknown protein [Oryza sativa Japonica Group]
 gi|50725686|dbj|BAD33152.1| unknown protein [Oryza sativa Japonica Group]
 gi|113622733|dbj|BAF22678.1| Os08g0101800 [Oryza sativa Japonica Group]
 gi|125601894|gb|EAZ41219.1| hypothetical protein OsJ_25724 [Oryza sativa Japonica Group]
          Length = 579

 Score =  216 bits (551), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 116/238 (48%), Positives = 145/238 (60%), Gaps = 6/238 (2%)

Query: 41  ITTPLNC-VKNQTQTCPTNYPKTSQTQESISDYSIPPTST---CPDYFRWIHEDLSPWKV 96
           I  P  C   N T TCP           +      P TST   CP YFR I  DL+ W  
Sbjct: 137 IPIPFTCGSGNDTGTCPRYAASAPAPAPASMSPPPPQTSTVDYCPSYFRHIELDLAAWLA 196

Query: 97  TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
           +GI+R+ +ER  + AHFRL++V  + Y+  Y+++ QTRDVFT WGILQLLR+YPGR+PDL
Sbjct: 197 SGISREAVERGRRQAHFRLLVVGGRAYVETYRRAFQTRDVFTQWGILQLLRRYPGRVPDL 256

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
           +LMF+CDD P +R+  Y         PLFRY  D  T+D++FPDWSFWGW E+NI+PW  
Sbjct: 257 DLMFNCDDMPEVRAAAYPDRAAAP--PLFRYCKDPSTLDVLFPDWSFWGWPEVNIRPWAP 314

Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQVISF 274
           LL E+ E      W  REPYAYWKGNP V+  R++LL CN S   D   RLY Q   F
Sbjct: 315 LLAEMAEEKARLPWSRREPYAYWKGNPDVSPLRQELLRCNHSLPPDDTVRLYRQDWGF 372


>gi|357127693|ref|XP_003565513.1| PREDICTED: uncharacterized protein LOC100832633 [Brachypodium
           distachyon]
          Length = 538

 Score =  216 bits (551), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 103/203 (50%), Positives = 133/203 (65%), Gaps = 3/203 (1%)

Query: 71  DYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQS 130
           D S+ P   CP YFR IH DL PW+ TGITRD LERA   A FRL +V+ + Y+   + S
Sbjct: 131 DPSVAPPPWCPVYFRHIHTDLDPWRSTGITRDTLERAMPHAEFRLTVVSGRAYVQNLRPS 190

Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG- 189
            QTRDVFT WG+LQLL ++PGR+PD+++MF   D   + S DY       PPPLFRY   
Sbjct: 191 YQTRDVFTQWGVLQLLARFPGRVPDVDIMFSAGDVAQVLSADYYNTTTHPPPPLFRYCKE 250

Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
           ++    IVFPDWSFWGW E++I+PW  L+ +    N    W +R+PYA+WKGNP V++ R
Sbjct: 251 EKLEAAIVFPDWSFWGWPELSIRPWAPLMEDFVRENKALPWRNRQPYAFWKGNPEVSDVR 310

Query: 250 RDLLTCN--LSDKHDWNARLYVQ 270
           RDL  CN   ++  DWNARL+ Q
Sbjct: 311 RDLFKCNNDSANGKDWNARLFWQ 333


>gi|125554122|gb|EAY99727.1| hypothetical protein OsI_21712 [Oryza sativa Indica Group]
          Length = 308

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 100/196 (51%), Positives = 140/196 (71%), Gaps = 2/196 (1%)

Query: 76  PTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
           P ++CPDYFR+IH+DL PW+  GITR+ +ER  + A+FRL++V+ + Y+  Y++S QTRD
Sbjct: 37  PAASCPDYFRYIHDDLRPWRGAGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRD 96

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
            FT WG+ QLLR+Y GR+PD+++MF CDDR  +R+ D++       PP+FRY  D  T+D
Sbjct: 97  AFTQWGVAQLLRRYAGRVPDVDIMFACDDRGRVRAADFA-AAPADAPPVFRYCRDATTLD 155

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           +VFPDWSFWGW E+NI  W + L  ++  +    W +REP+A+WKGNP VA  R +L+ C
Sbjct: 156 VVFPDWSFWGWPEVNIGAWPATLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKC 215

Query: 256 N-LSDKHDWNARLYVQ 270
           N  SD  DWNARL+ Q
Sbjct: 216 NPASDGKDWNARLFSQ 231


>gi|297849014|ref|XP_002892388.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338230|gb|EFH68647.1| hypothetical protein ARALYDRAFT_887933 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 508

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 145/221 (65%), Gaps = 12/221 (5%)

Query: 54  TCPTNY---PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQT 110
           +CP N    PK+ Q++      +  P   CPD+FRWIH DL PW  TG+T++ ++RA   
Sbjct: 89  SCPYNAVVQPKSLQSESVSGRRTHQPQ--CPDFFRWIHRDLEPWAKTGVTKEHVKRAKAN 146

Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
           A FR+++++ K+Y+  Y   +Q+R +FTIWGILQLL KYPG +PD+++MFDC D+P+I  
Sbjct: 147 AAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLNKYPGMVPDVDMMFDCMDKPIINQ 206

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
            +Y       P PLFRY  +   +DI FPDWSFWGW+E N++PWE    ++K+G+  R+W
Sbjct: 207 TEY----QSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDIKQGSRRRSW 262

Query: 231 IDREPYAYWKGNP-FVAETRRDLLTCNLSDKHDWNARLYVQ 270
            +++P AYWKGNP  V+  R +L+ CN S    W A++  Q
Sbjct: 263 DNKQPRAYWKGNPDVVSPIRLELMKCNHS--RLWGAQIMRQ 301


>gi|357497389|ref|XP_003618983.1| hypothetical protein MTR_6g031080, partial [Medicago truncatula]
 gi|355493998|gb|AES75201.1| hypothetical protein MTR_6g031080, partial [Medicago truncatula]
          Length = 226

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 102/197 (51%), Positives = 132/197 (67%), Gaps = 12/197 (6%)

Query: 15  FSVHNISRNK---TVIIVSRKPQNRPPKRITTPLNCVKNQTQ--TCPTNYPKTSQTQESI 69
           F +H I  +K   T I  + +  N   ++   PLNC  N+T   TCP +YP T    +  
Sbjct: 37  FIIHWIDLSKVTSTTIFKTIEVLNNIKQQPQFPLNC-NNETSNSTCPFSYPTTFHLNDD- 94

Query: 70  SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
                 P+S+CP+YF+WIHEDL PW+ TGITR+M+E     +HFRL++VN K YI K+ +
Sbjct: 95  -----SPSSSCPNYFKWIHEDLKPWEKTGITREMVESGKNMSHFRLVVVNGKAYIDKFAK 149

Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
           S QTRDVFTIWGILQLLR YPG++PDLELMF C D+ V+  + + GP N  PPP+F Y G
Sbjct: 150 SYQTRDVFTIWGILQLLRLYPGKIPDLELMFQCGDKNVVDKKKFQGPENVSPPPIFHYCG 209

Query: 190 DRWTMDIVFPDWSFWGW 206
           D  T+DIVFPDW+FWGW
Sbjct: 210 DEDTLDIVFPDWTFWGW 226


>gi|15222341|ref|NP_172202.1| uncharacterized protein [Arabidopsis thaliana]
 gi|8954024|gb|AAF82198.1|AC067971_6 Contains similarity to an unknown protein T2J13.180 gi|6522568 from
           Arabidopsis thaliana BAC T2J13 gb|AL132967. ESTs
           gb|Z29835 and gb|Z29836 come from this gene [Arabidopsis
           thaliana]
 gi|332189973|gb|AEE28094.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 507

 Score =  207 bits (526), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 92/192 (47%), Positives = 133/192 (69%), Gaps = 7/192 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CPD+FRWIH DL PW  TG+T++ ++RA   A FR+++++ K+Y+  Y   +Q+R +FTI
Sbjct: 115 CPDFFRWIHRDLEPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTI 174

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WGILQLL KYPG +PD+++MFDC D+P+I   +Y       P PLFRY  +   +DI FP
Sbjct: 175 WGILQLLTKYPGMVPDVDMMFDCMDKPIINQTEY----QSFPVPLFRYCTNEAHLDIPFP 230

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTCNLS 258
           DWSFWGW+E N++PWE    ++K+G+  R+W +++P AYWKGNP  V+  R +L+ CN S
Sbjct: 231 DWSFWGWSETNLRPWEEEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHS 290

Query: 259 DKHDWNARLYVQ 270
               W A++  Q
Sbjct: 291 --RLWGAQIMRQ 300


>gi|62320292|dbj|BAD94602.1| hypothetical protein [Arabidopsis thaliana]
          Length = 507

 Score =  206 bits (524), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 92/192 (47%), Positives = 133/192 (69%), Gaps = 7/192 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CPD+FRWIH DL PW  TG+T++ ++RA   A FR+++++ K+Y+  Y   +Q+R +FTI
Sbjct: 115 CPDFFRWIHRDLEPWAKTGVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTI 174

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WGILQLL KYPG +PD+++MFDC D+P+I   +Y       P PLFRY  +   +DI FP
Sbjct: 175 WGILQLLTKYPGMVPDVDMMFDCMDKPIINQTEY----QSFPVPLFRYCTNEAHLDIPFP 230

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTCNLS 258
           DWSFWGW+E N++PWE    ++K+G+  R+W +++P AYWKGNP  V+  R +L+ CN S
Sbjct: 231 DWSFWGWSETNLRPWEVEFGDIKQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHS 290

Query: 259 DKHDWNARLYVQ 270
               W A++  Q
Sbjct: 291 --RLWGAQIMRQ 300


>gi|294460307|gb|ADE75735.1| unknown [Picea sitchensis]
          Length = 194

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 94/168 (55%), Positives = 124/168 (73%), Gaps = 3/168 (1%)

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           M+ERA +TA FRL+++  ++YI  Y +S QTRDVFTIWG++QL+  Y   LPDL+LMFDC
Sbjct: 1   MVERARRTATFRLVILKGRMYIRTYTKSFQTRDVFTIWGLIQLMENYGWMLPDLDLMFDC 60

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D PVI+++ Y+  +   PPPLFRY GD  ++DI FPDWSFWGWAE+N +PW+ LL ++ 
Sbjct: 61  VDWPVIKAKAYANASLPPPPPLFRYCGDDKSLDIAFPDWSFWGWAEVNTRPWDGLLNDIL 120

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
           +G     W DR+P A+WKGNP+VA  R DL+ CNLSD+   NARLY Q
Sbjct: 121 KGAKKLKWEDRDPTAFWKGNPYVAAVREDLMKCNLSDR---NARLYNQ 165


>gi|357119765|ref|XP_003561604.1| PREDICTED: uncharacterized protein LOC100842005 [Brachypodium
           distachyon]
          Length = 616

 Score =  196 bits (499), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 98/191 (51%), Positives = 129/191 (67%), Gaps = 4/191 (2%)

Query: 82  DYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
           DYFR+IH DLSPW+ +G IT   LERA   A FRL +V+ + Y+  Y ++ QTRDVFT W
Sbjct: 216 DYFRYIHSDLSPWRASGGITLPTLERAIPHAAFRLTVVSGRAYVETYHRAFQTRDVFTQW 275

Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW-TMDIVFP 199
           GILQLL +YPGR+PDL+ MF+ +D P I   D  G     PPPLFRY  D   +++I+FP
Sbjct: 276 GILQLLARYPGRVPDLDAMFNLEDMPEIFRNDADGNPAPPPPPLFRYCKDGGDSVEILFP 335

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN--L 257
           DWSFWGW E+NI+PW  L+      N    W +REP+A+WKGNP+V+  R+DL  CN   
Sbjct: 336 DWSFWGWPEVNIRPWAPLMENFVRENRALPWQNREPFAFWKGNPYVSNARKDLFKCNNDS 395

Query: 258 SDKHDWNARLY 268
           +   ++NARL+
Sbjct: 396 AAGKEFNARLF 406


>gi|302784816|ref|XP_002974180.1| hypothetical protein SELMODRAFT_100126 [Selaginella moellendorffii]
 gi|300158512|gb|EFJ25135.1| hypothetical protein SELMODRAFT_100126 [Selaginella moellendorffii]
          Length = 426

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 93/195 (47%), Positives = 128/195 (65%), Gaps = 6/195 (3%)

Query: 79  TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           +CPD+F+WIH DL+PW+ +G I+R  LE A + A FR+ ++  ++Y   Y Q +Q+R +F
Sbjct: 1   SCPDFFQWIHHDLAPWRASGGISRAALEEAREFAAFRVAIIGGQLYAELYYQCVQSRAMF 60

Query: 138 TIWGILQLLRKYP-GRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           T+WG+L LL ++P G +PD+E MF+C DRP  R   Y    ++ PPPL  Y G R T+DI
Sbjct: 61  TLWGLLLLLERFPAGAVPDVEFMFNCMDRPHFRRSRY---KSRAPPPLLAYCGSRDTVDI 117

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTC 255
            FPDWSFWGWAE+ I  W      +  G+    W +R P A+WKGNP V A+ R DLL C
Sbjct: 118 AFPDWSFWGWAEVRIGAWGEEASSIFHGSEETRWENRRPRAHWKGNPHVGAQVRTDLLAC 177

Query: 256 NLSDKHDWNARLYVQ 270
           N +DK D+ A +YVQ
Sbjct: 178 NKTDKRDFGADIYVQ 192


>gi|302770761|ref|XP_002968799.1| hypothetical protein SELMODRAFT_450427 [Selaginella moellendorffii]
 gi|300163304|gb|EFJ29915.1| hypothetical protein SELMODRAFT_450427 [Selaginella moellendorffii]
          Length = 423

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 93/195 (47%), Positives = 128/195 (65%), Gaps = 6/195 (3%)

Query: 79  TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           +CPD+F+WIH DL+PW+ +G I+R  LE A + A FR+ ++  ++Y   Y Q +Q+R +F
Sbjct: 1   SCPDFFQWIHHDLAPWRASGGISRAALEEAREFAAFRVAIIGGQLYAELYYQCVQSRAMF 60

Query: 138 TIWGILQLLRKYP-GRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
           T+WG+L LL ++P G +PD+E MF+C DRP  R   Y    ++ PPPL  Y G R T+DI
Sbjct: 61  TLWGLLLLLERFPAGAVPDVEFMFNCMDRPHFRRSRY---KSRAPPPLLAYCGSRDTVDI 117

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTC 255
            FPDWSFWGWAE+ I  W      +  G+    W +R P A+WKGNP V A+ R DLL C
Sbjct: 118 AFPDWSFWGWAEVRIGAWGEEASSIFHGSEETRWENRRPRAHWKGNPHVGAQVRTDLLAC 177

Query: 256 NLSDKHDWNARLYVQ 270
           N +DK D+ A +YVQ
Sbjct: 178 NKTDKRDFGADIYVQ 192


>gi|302783306|ref|XP_002973426.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300159179|gb|EFJ25800.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 449

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 89/189 (47%), Positives = 121/189 (64%), Gaps = 1/189 (0%)

Query: 71  DYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQS 130
           ++S      CPDYF+WI  DLSPWK TGI++  L+ A   A FR+++V+ K+Y+ +Y+  
Sbjct: 88  NFSAQAQGKCPDYFKWIERDLSPWKNTGISQGDLQEAKSKADFRVVIVDGKLYMERYRYC 147

Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
            QTR  +T+WGI  LL ++PG++PDLELMF C DRP +  ++YS  +   PPPLF Y   
Sbjct: 148 YQTRAQYTLWGIRMLLEEFPGQVPDLELMFMCGDRPQVLRKNYSSNSVWPPPPLFSYCTT 207

Query: 191 R-WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETR 249
           R    DIVFPDWSFWGW E+NI PW     ++  G     W+ REP A WKGN ++ + R
Sbjct: 208 RDEHYDIVFPDWSFWGWPEVNIAPWTVEREKIFSGAEKIKWLQREPIAQWKGNTWMGKIR 267

Query: 250 RDLLTCNLS 258
             L+ CN S
Sbjct: 268 PMLVQCNSS 276


>gi|395146518|gb|AFN53673.1| DUF821 [Linum usitatissimum]
          Length = 474

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 87/185 (47%), Positives = 117/185 (63%), Gaps = 5/185 (2%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
              CP++F +IH DL PW  +GIT D L  A   A FR+++   K+Y   Y    Q+R +
Sbjct: 106 AQQCPEFFSYIHRDLEPWAKSGITEDQLMEAKNFAAFRIVIYQGKLYFDPYYACFQSRMM 165

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
            TIWG LQLL+KYPG +PD++LMFDC D+P+    ++       P PLFRY   R   DI
Sbjct: 166 TTIWGFLQLLKKYPGMVPDVDLMFDCMDKPIFNRTEHQA----NPVPLFRYCTTREHFDI 221

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLLTC 255
            FPDWSFWGW+EINI+PW     ++K+G+  + W  R+P+A+WKGNP  V+  R +LL C
Sbjct: 222 PFPDWSFWGWSEINIRPWSEEFPDIKKGSQAKRWAKRQPHAFWKGNPDVVSPVRLELLQC 281

Query: 256 NLSDK 260
           N S K
Sbjct: 282 NDSRK 286


>gi|326495828|dbj|BAJ90536.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 486

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 111/288 (38%), Positives = 154/288 (53%), Gaps = 27/288 (9%)

Query: 2   IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
           I  F+  I++    S ++  +SR +T++  + +P         + RPP R    L C   
Sbjct: 30  IAAFSAVIVIAVLHSAYDDALSRTRTLLGHNLEPTPWHPFPHDKGRPPPRAA--LRCASY 87

Query: 51  QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK-----VTGITRDMLE 105
            +   P + PK +    + +         CP YF  I  DL+PW+       GITR +LE
Sbjct: 88  LSCLPPLSQPKPAAAAAANASR----PRQCPSYFAAIRRDLAPWRRRDGGGGGITRALLE 143

Query: 106 RANQTAHFRLILVNNKVYIHK--YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
            A   A  R+ +      +H   Y   +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC 
Sbjct: 144 SARFRASMRVTITGGGTRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCM 203

Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKE 223
           DRP I   ++SG     PPPLFRY   R  +DI FPDWSFWGW E +I+PW    R +K+
Sbjct: 204 DRPAINRTEHSGDGAPPPPPLFRYCTTRDHLDIPFPDWSFWGWPETHIEPWSREFRSIKQ 263

Query: 224 GNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
           G+    W DR P AYWKGNP VA   R  LL CN  D + W A +  Q
Sbjct: 264 GSRRVKWPDRVPTAYWKGNPDVASPLRLALLACN--DTNLWRAEIMRQ 309


>gi|297817442|ref|XP_002876604.1| hypothetical protein ARALYDRAFT_907657 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322442|gb|EFH52863.1| hypothetical protein ARALYDRAFT_907657 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 229

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 91/165 (55%), Positives = 116/165 (70%), Gaps = 12/165 (7%)

Query: 44  PLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
           PLNC  N  QTCP+NYP  S+ + +IS      + TCPDYFRWI +DL  W+ TGITR+ 
Sbjct: 69  PLNC--NAKQTCPSNYP--SRFEPAISS-----SETCPDYFRWIQQDLKAWEETGITRET 119

Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
           LERA   AHFRL++ + ++Y+H+Y ++ ++RDV TIWGILQLLR YPG++PDLEL+F C 
Sbjct: 120 LERAKPKAHFRLVIKSGRLYVHQYDKAYESRDVLTIWGILQLLRMYPGQVPDLELLFFCH 179

Query: 164 DRPVIRSRDYSGPNNKG---PPPLFRYSGDRWTMDIVFPDWSFWG 205
           D+P I  RD+  P       PPPLF+Y G R    IVFPDWSFWG
Sbjct: 180 DKPAIWKRDFRQPQPNATWPPPPLFQYCGHREAYGIVFPDWSFWG 224


>gi|449455154|ref|XP_004145318.1| PREDICTED: uncharacterized protein LOC101204476 [Cucumis sativus]
          Length = 472

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 123/192 (64%), Gaps = 7/192 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP++FRWIH DL PW  T I+   LE + + A FR+++V  ++Y+  Y   +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WG++Q+LR+YPG +PD+++MFDC D+P I   +    N   P PLFRY       DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
           DWSFWGW E+N++ W     ++K+G+   +W ++ P AYWKGNP V +  R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275

Query: 259 DKHDWNARLYVQ 270
               W A++  Q
Sbjct: 276 RM--WGAQIMRQ 285


>gi|449520138|ref|XP_004167091.1| PREDICTED: uncharacterized protein LOC101228589 [Cucumis sativus]
          Length = 472

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 123/192 (64%), Gaps = 7/192 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP++FRWIH DL PW  T I+   LE + + A FR+++V  ++Y+  Y   +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WG++Q+LR+YPG +PD+++MFDC D+P I   +    N   P PLFRY       DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
           DWSFWGW E+N++ W     ++K+G+   +W ++ P AYWKGNP V +  R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275

Query: 259 DKHDWNARLYVQ 270
               W A++  Q
Sbjct: 276 RM--WGAQIMRQ 285


>gi|449471103|ref|XP_004153209.1| PREDICTED: uncharacterized protein LOC101204904 [Cucumis sativus]
          Length = 472

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 123/192 (64%), Gaps = 7/192 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP++FRWIH DL PW  T I+   LE + + A FR+++V  ++Y+  Y   +Q+R +FTI
Sbjct: 100 CPEFFRWIHHDLDPWARTRISMTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 159

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WG++Q+LR+YPG +PD+++MFDC D+P I   +    N   P PLFRY       DI FP
Sbjct: 160 WGLVQMLRRYPGMVPDVDMMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFP 215

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCNLS 258
           DWSFWGW E+N++ W     ++K+G+   +W ++ P AYWKGNP V +  R +LL CN S
Sbjct: 216 DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHS 275

Query: 259 DKHDWNARLYVQ 270
               W A++  Q
Sbjct: 276 RM--WGAQIMRQ 285


>gi|356569533|ref|XP_003552954.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
          Length = 464

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 84/192 (43%), Positives = 128/192 (66%), Gaps = 7/192 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP++FR IH DL+PW  + I++  +  A + A FR+++V  KV++  Y   +Q+R +FT+
Sbjct: 104 CPEFFRAIHRDLAPWLESRISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTL 163

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WG+LQL+R+YPG++PD+++MFDC D+P +   ++       P PLFRY   +   DI FP
Sbjct: 164 WGLLQLMRRYPGKVPDVDMMFDCMDKPSVNRTEHQA----MPLPLFRYCTTKEHFDIPFP 219

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLS 258
           DWSFWGW+EINI+PW+    ++K+G+   +W ++ P+AYWKGNP VA   R +L+ CN  
Sbjct: 220 DWSFWGWSEINIRPWQEEFPDIKQGSRNVSWKNKFPWAYWKGNPDVASPIRTELINCN-- 277

Query: 259 DKHDWNARLYVQ 270
           D   W A +  Q
Sbjct: 278 DSRKWGAEIMRQ 289


>gi|302789424|ref|XP_002976480.1| hypothetical protein SELMODRAFT_105631 [Selaginella moellendorffii]
 gi|300155518|gb|EFJ22149.1| hypothetical protein SELMODRAFT_105631 [Selaginella moellendorffii]
          Length = 357

 Score =  187 bits (474), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 89/183 (48%), Positives = 119/183 (65%), Gaps = 4/183 (2%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CPDYF+WI  DLSPWK TGI++  L+ A   A FR+++V+ K+Y+ +Y+   QTR  +T+
Sbjct: 1   CPDYFKWIERDLSPWKNTGISQGDLQEAKSKADFRVVIVDGKLYMERYRYCYQTRAQYTL 60

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG---PPPLFRYSGDR-WTMD 195
           WGI  LL ++PG++PDLELMF C DRP +  ++YS  + K    PPPLF Y   R    D
Sbjct: 61  WGIRMLLEEFPGQVPDLELMFMCGDRPQVLRKNYSSNSVKRRWPPPPLFSYCTTRDEHYD 120

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           IVFPDWSFWGW E+NI PW     ++  G     W+ REP A WKGN ++ + R  L+ C
Sbjct: 121 IVFPDWSFWGWPEVNIAPWTVEREKIFSGAEKIKWLQREPIAQWKGNTWMGKIRPMLVQC 180

Query: 256 NLS 258
           N S
Sbjct: 181 NSS 183


>gi|255569363|ref|XP_002525649.1| conserved hypothetical protein [Ricinus communis]
 gi|223535085|gb|EEF36767.1| conserved hypothetical protein [Ricinus communis]
          Length = 491

 Score =  186 bits (473), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 87/194 (44%), Positives = 125/194 (64%), Gaps = 7/194 (3%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           + CP++FR+IH DL PW  TGIT+  +  A + A FR+++   ++Y+  Y   +Q+R +F
Sbjct: 102 AKCPEFFRFIHHDLQPWARTGITKKHIAEAKKFAAFRVVIFEGRLYLDLYYACVQSRMMF 161

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
           T+WG+LQLL +YPG +PD+++MFDC DRPVI   ++       P P+FRY   +   DI 
Sbjct: 162 TVWGLLQLLNRYPGMVPDVDIMFDCMDRPVINKTEHIS----FPLPIFRYCTTQNHFDIP 217

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV-AETRRDLLTCN 256
           FPDWSFWGW EINI+ W    R++K G+  ++W  + P AYWKGNP V +  R +L+ CN
Sbjct: 218 FPDWSFWGWPEINIRSWNEEFRDIKRGSQSKSWSKKWPRAYWKGNPDVLSPIRTELMQCN 277

Query: 257 LSDKHDWNARLYVQ 270
            S K  W A +  Q
Sbjct: 278 HSRK--WGAHIMRQ 289


>gi|356541729|ref|XP_003539326.1| PREDICTED: O-glucosyltransferase rumi-like [Glycine max]
          Length = 464

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 84/192 (43%), Positives = 124/192 (64%), Gaps = 7/192 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP +FR IH DL+PW  + I++  +  A + A FR+++V  KV++  Y   +Q+R +FT+
Sbjct: 104 CPKFFRAIHRDLAPWSESRISKAHVAAAQRYAAFRVVIVEGKVFVDWYYACVQSRAMFTL 163

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           WG+LQL+R+YPG +PD+++MFDC D+P +   ++       P PLFRY   +   DI FP
Sbjct: 164 WGLLQLMRRYPGMVPDVDMMFDCMDKPSVNKTEHQA----MPLPLFRYCTTKEHFDIPFP 219

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLS 258
           DWSFWGW+EINI+PW+    ++K G+    W ++ P+AYWKGNP VA   R +L+ CN  
Sbjct: 220 DWSFWGWSEINIRPWQEEFPDIKRGSRSVTWKNKLPWAYWKGNPDVASPIRTELINCN-- 277

Query: 259 DKHDWNARLYVQ 270
           D   W A +  Q
Sbjct: 278 DSRKWGAEIMRQ 289


>gi|302808049|ref|XP_002985719.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300146628|gb|EFJ13297.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 386

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 85/194 (43%), Positives = 122/194 (62%), Gaps = 3/194 (1%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP YF WI  DL+PWK TGI++  L+ A   A FR+++VN  +Y+ +Y +  +TRD FT+
Sbjct: 1   CPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDFTL 60

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-MDI 196
           WG+L LL++YPG +PD++LMF+C D P++   ++    N    PPPLF Y   R    DI
Sbjct: 61  WGLLMLLKEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRGDHYDI 120

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWS+WGW E+NI PW     ++  G    +W  R+P A+WKGN  +   R DL+ C 
Sbjct: 121 VFPDWSYWGWPEVNILPWSLEKGKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVKCT 180

Query: 257 LSDKHDWNARLYVQ 270
            ++ H++N   Y Q
Sbjct: 181 ANNTHNYNLVTYHQ 194


>gi|413917716|gb|AFW57648.1| lipopolysaccharide-modifying protein [Zea mays]
          Length = 500

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 108/285 (37%), Positives = 153/285 (53%), Gaps = 24/285 (8%)

Query: 2   IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
           I  F+  I+L    S ++  +SR +T++  + +P         + RPP R    L C   
Sbjct: 46  IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 102

Query: 51  QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
            +  C      +S + +   +        CP YF  IH DL+PW+  G  +TR +L+ A 
Sbjct: 103 -SIACLPPLSSSSSSPQPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 160

Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
           + A  R+ +     ++++  Y   +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 161 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 220

Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
            I +R   G     PPPLFRY   R   DI FPDWSFWGW E NI+PW    + ++ G  
Sbjct: 221 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIEPWNHEFKNIRAGAR 279

Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQ 270
              W DR P AYWKGNP VA   R+ LL CN  D   W A +  Q
Sbjct: 280 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQ 322


>gi|194707538|gb|ACF87853.1| unknown [Zea mays]
          Length = 480

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 108/285 (37%), Positives = 153/285 (53%), Gaps = 24/285 (8%)

Query: 2   IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
           I  F+  I+L    S ++  +SR +T++  + +P         + RPP R    L C   
Sbjct: 26  IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 82

Query: 51  QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
            +  C      +S + +   +        CP YF  IH DL+PW+  G  +TR +L+ A 
Sbjct: 83  -SIACLPPLSSSSSSPQPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 140

Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
           + A  R+ +     ++++  Y   +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 141 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 200

Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
            I +R   G     PPPLFRY   R   DI FPDWSFWGW E NI+PW    + ++ G  
Sbjct: 201 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIEPWNHEFKNIRAGAR 259

Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQ 270
              W DR P AYWKGNP VA   R+ LL CN  D   W A +  Q
Sbjct: 260 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQ 302


>gi|302785297|ref|XP_002974420.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300158018|gb|EFJ24642.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 386

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 85/194 (43%), Positives = 122/194 (62%), Gaps = 3/194 (1%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP YF WI  DL+PWK TGI++  L+ A   A FR+++VN  +Y+ +Y +  +TRD FT+
Sbjct: 1   CPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDFTL 60

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-MDI 196
           WG+L LL++YPG +PD++LMF+C D P++   ++    N    PPPLF Y   R    DI
Sbjct: 61  WGLLMLLKEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRRDHYDI 120

Query: 197 VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           VFPDWS+WGW E+NI PW     ++  G    +W  R+P A+WKGN  +   R DL+ C 
Sbjct: 121 VFPDWSYWGWPEVNILPWSLEKGKIFSGAEKLDWSHRQPIAFWKGNYDMGPARADLVKCT 180

Query: 257 LSDKHDWNARLYVQ 270
            ++ H++N   Y Q
Sbjct: 181 ANNTHNYNLVTYHQ 194


>gi|226497664|ref|NP_001151574.1| lipopolysaccharide-modifying protein [Zea mays]
 gi|195647860|gb|ACG43398.1| lipopolysaccharide-modifying protein [Zea mays]
          Length = 480

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 107/285 (37%), Positives = 152/285 (53%), Gaps = 24/285 (8%)

Query: 2   IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKN 50
           I  F+  I+L    S ++  +SR +T++  + +P         + RPP R    L C   
Sbjct: 26  IAAFSAVIVLAVVHSAYDDAMSRTRTLLGHNLEPTPWHPFPHDKGRPPARAA--LRCAP- 82

Query: 51  QTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTG--ITRDMLERAN 108
            +  C      +S +     +        CP YF  IH DL+PW+  G  +TR +L+ A 
Sbjct: 83  -SIACLPPLSPSSSSSPPQGNKKKK-QKQCPAYFAAIHRDLAPWRGQGRGVTRALLDEAR 140

Query: 109 QTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
           + A  R+ +     ++++  Y   +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP
Sbjct: 141 RRASMRVTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRP 200

Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
            I +R   G     PPPLFRY   R   DI FPDWSFWGW + NI+PW    + ++ G  
Sbjct: 201 AI-NRTEHGDGALPPPPLFRYCTTRDHFDIPFPDWSFWGWPDTNIEPWNHEFKNIRAGAR 259

Query: 227 GRNWIDREPYAYWKGNPFVAETRRD-LLTCNLSDKHDWNARLYVQ 270
              W DR P AYWKGNP VA   R+ LL CN  D   W A +  Q
Sbjct: 260 ATRWADRVPTAYWKGNPDVASPLREALLGCN--DTALWRAEIMRQ 302


>gi|222629271|gb|EEE61403.1| hypothetical protein OsJ_15589 [Oryza sativa Japonica Group]
          Length = 535

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 89/189 (47%), Positives = 114/189 (60%), Gaps = 1/189 (0%)

Query: 83  YFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWG 141
           YFR+IHEDL PW+   GITR ML RA  TA FRL+++  + ++H++ +  +         
Sbjct: 140 YFRFIHEDLRPWRDAGGITRAMLARARVTASFRLLVLGGRAFVHRFARHSRPAISSPSGA 199

Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
                   PG  P           P       +G      PPLF Y GD  T+DIVFPDW
Sbjct: 200 SSSCSAATPGGSPTSTSCSTAPTGPSSARTSTAGSTLPFMPPLFSYCGDDRTLDIVFPDW 259

Query: 202 SFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
           SFWGW EINIKPW++L ++LK+GNN   W+DR PYAYWKGNP VA TR++L+ CN+S   
Sbjct: 260 SFWGWPEINIKPWDALRQDLKDGNNRVKWLDRVPYAYWKGNPAVAVTRQELVNCNVSTTK 319

Query: 262 DWNARLYVQ 270
           DWNAR+Y Q
Sbjct: 320 DWNARIYKQ 328


>gi|194702520|gb|ACF85344.1| unknown [Zea mays]
 gi|238009730|gb|ACR35900.1| unknown [Zea mays]
 gi|238011028|gb|ACR36549.1| unknown [Zea mays]
          Length = 435

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 91/196 (46%), Positives = 120/196 (61%), Gaps = 8/196 (4%)

Query: 80  CPDYFRWIHEDLSPWKVTG--ITRDMLERANQTAHFRLILVNN--KVYIHKYKQSIQTRD 135
           CP YF  IH DL+PW+  G  +TR +L+ A + A  R+ +     ++++  Y   +Q+R 
Sbjct: 65  CPAYFAAIHRDLAPWRGQGRGVTRALLDEARRRASMRVTITGGGRRLHVDLYYACVQSRA 124

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +FT+W +LQL+R+YPGR+PD++LMFDC DRP I +R   G     PPPLFRY   R   D
Sbjct: 125 LFTVWSLLQLMRRYPGRVPDVDLMFDCMDRPAI-NRTEHGDGALPPPPLFRYCTTRDHFD 183

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD-LLT 254
           I FPDWSFWGW E NI+PW    + ++ G     W DR P AYWKGNP VA   R+ LL 
Sbjct: 184 IPFPDWSFWGWPETNIEPWNHEFKNIRAGARATRWADRVPTAYWKGNPDVASPLREALLG 243

Query: 255 CNLSDKHDWNARLYVQ 270
           CN  D   W A +  Q
Sbjct: 244 CN--DTALWRAEIMRQ 257


>gi|302823162|ref|XP_002993235.1| hypothetical protein SELMODRAFT_431353 [Selaginella moellendorffii]
 gi|300138905|gb|EFJ05656.1| hypothetical protein SELMODRAFT_431353 [Selaginella moellendorffii]
          Length = 475

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 83/196 (42%), Positives = 121/196 (61%), Gaps = 3/196 (1%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           + CP YF WI  DL+PWK TGI++  L+ A   A FR+++VN  +Y+ +Y +  +TRD F
Sbjct: 85  AQCPSYFSWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDF 144

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-M 194
           T+WG+L LL +YPG +PD++LMF+C D P++   ++    N    PPPLF Y   R    
Sbjct: 145 TLWGLLMLLEEYPGMVPDVDLMFNCGDWPLVFRAEHKPEKNGSWPPPPLFLYCTSRGEHY 204

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
           DIVFPDWS+WGW E+NI PW     ++  G    +W  R+P A+WKGN  +   R DL+ 
Sbjct: 205 DIVFPDWSYWGWPEVNILPWSLEKEKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVK 264

Query: 255 CNLSDKHDWNARLYVQ 270
           C  ++  ++N   + Q
Sbjct: 265 CTANNSQNYNLVTHHQ 280


>gi|302794981|ref|XP_002979254.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300153022|gb|EFJ19662.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 474

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 83/196 (42%), Positives = 121/196 (61%), Gaps = 3/196 (1%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           + CP YF WI  DL+PWK TGI++  L+ A   A FR+++VN  +Y+ +Y +  +TRD F
Sbjct: 85  AHCPSYFAWIENDLAPWKETGISQQNLQDARSKADFRVVIVNGTLYMERYNKCFETRDDF 144

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWT-M 194
           T+WG+L LL +YPG +PD++LMF+C D P++   ++    N    PPPLF Y   R    
Sbjct: 145 TLWGLLMLLEEYPGMVPDVDLMFNCGDWPLVFRAEHQPEKNGSWPPPPLFLYCTSRGEHY 204

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
           DIVFPDWS+WGW E+NI PW     ++  G    +W  R+P A+WKGN  +   R DL+ 
Sbjct: 205 DIVFPDWSYWGWPEVNILPWSLEKEKIFSGAKKLDWSHRQPIAFWKGNYDMGPARADLVK 264

Query: 255 CNLSDKHDWNARLYVQ 270
           C  ++  ++N   + Q
Sbjct: 265 CTANNSQNYNLVTHHQ 280


>gi|124360767|gb|ABN08741.1| Protein of unknown function DUF821, CAP10-like [Medicago
           truncatula]
          Length = 185

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 82/163 (50%), Positives = 105/163 (64%), Gaps = 3/163 (1%)

Query: 45  LNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
           L C   N TQTCP ++       +     S   TSTCP++FRWIHEDL PWK TGITR+M
Sbjct: 25  LKCFNGNPTQTCPNDHSPIQAFHQDQDPSS--STSTCPEHFRWIHEDLEPWKSTGITREM 82

Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
           ++     +  R+++   KVY+  Y  S QTR  FT+WGI+QLLR YPGR+PDLEL+F+  
Sbjct: 83  VDSGENISQLRIVIKQGKVYVETYGDSFQTRATFTVWGIVQLLRLYPGRVPDLELLFETG 142

Query: 164 DRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
           DR V+  + Y GP    PPP+F Y G    +DIVFPDWS+WGW
Sbjct: 143 DRAVVDKKRYEGPEAVTPPPIFSYCGKNEALDIVFPDWSYWGW 185


>gi|359478546|ref|XP_003632132.1| PREDICTED: protein O-glucosyltransferase 1-like [Vitis vinifera]
 gi|297745896|emb|CBI15952.3| unnamed protein product [Vitis vinifera]
          Length = 464

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 91/201 (45%), Positives = 123/201 (61%), Gaps = 5/201 (2%)

Query: 60  PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
           P T++  E  +  S      CP +F  I  DL PW  +GI+   +  A + A FR+++V 
Sbjct: 87  PTTTKIPEWHTRQSSNTVGKCPMFFTRIDHDLQPWVRSGISLSSVMEAQKFAAFRVVIVG 146

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
            K+Y+  +   +Q+R +FT+WG+LQLLR+YPG +PD++LMFDC D+P I SR+  G    
Sbjct: 147 GKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMDKPTI-SREEHG---S 202

Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
            P PLFRY       DI FPDWSFWGW EI+I PW+     +K+G+   NW  +  YAYW
Sbjct: 203 KPLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQVLNWTQKLSYAYW 262

Query: 240 KGNPFV-AETRRDLLTCNLSD 259
           KGNP V +  R DLL CN SD
Sbjct: 263 KGNPDVQSPVRVDLLQCNNSD 283


>gi|302799316|ref|XP_002981417.1| hypothetical protein SELMODRAFT_53169 [Selaginella moellendorffii]
 gi|300150957|gb|EFJ17605.1| hypothetical protein SELMODRAFT_53169 [Selaginella moellendorffii]
          Length = 342

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 86/180 (47%), Positives = 120/180 (66%), Gaps = 6/180 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP YF WIHEDL+PW  +GITR+ L+ A +   FRL++V+ K+Y  + ++ IQTRD  T+
Sbjct: 1   CPSYFAWIHEDLAPWSESGITREHLDEARRKGDFRLVIVDGKLYFERIQECIQTRDEITL 60

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT--MDIV 197
            G+L LL K+PG +PD+E++F C+D P +   +Y    +K PPP+F Y   R+    DI+
Sbjct: 61  QGLLLLLEKFPGMIPDMEMVFVCNDFPQVPKDEY---RSKPPPPIFSYCTSRFGGHFDIL 117

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCN 256
           FPDWSFWGW ++ I+PWE    E+ EG N  +W +RE  AYWKGN +V    R +LL CN
Sbjct: 118 FPDWSFWGWPQVKIRPWEQESVEIFEGANETDWFEREAIAYWKGNLWVMTPVREELLQCN 177


>gi|147785893|emb|CAN70836.1| hypothetical protein VITISV_015872 [Vitis vinifera]
          Length = 922

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 91/201 (45%), Positives = 123/201 (61%), Gaps = 5/201 (2%)

Query: 60  PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
           P T++  E  +  S      CP +F  I  DL PW  +GI+   +  A + A FR+++V 
Sbjct: 545 PTTTKIPEWHTRQSSNTVGKCPMFFTRIXHDLQPWVRSGISLSSVMEAQKFAAFRVVIVG 604

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
            K+Y+  +   +Q+R +FT+WG+LQLLR+YPG +PD++LMFDC D+P I SR+  G    
Sbjct: 605 GKLYVDFFYACVQSRAMFTVWGLLQLLRRYPGTVPDVDLMFDCMDKPTI-SREEHGSK-- 661

Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYW 239
            P PLFRY       DI FPDWSFWGW EI+I PW+     +K+G+   NW  +  YAYW
Sbjct: 662 -PLPLFRYCTTMDHFDIPFPDWSFWGWPEIDIGPWDEEFIGIKQGSQVLNWTQKLSYAYW 720

Query: 240 KGNPFV-AETRRDLLTCNLSD 259
           KGNP V +  R DLL CN SD
Sbjct: 721 KGNPDVQSPVRVDLLQCNNSD 741


>gi|357129812|ref|XP_003566555.1| PREDICTED: O-glucosyltransferase rumi-like [Brachypodium
           distachyon]
          Length = 493

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 111/289 (38%), Positives = 160/289 (55%), Gaps = 32/289 (11%)

Query: 2   IELFAFSIILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRIT---TPLNC 47
           I  F+  I++    S ++  +SR +T++  + +P         + RPP R      PL  
Sbjct: 35  IAAFSAVIVIAVLHSAYDEAVSRTRTLLGHNLEPTPWHPFPHDKGRPPPRAALRCAPLLS 94

Query: 48  VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV---TGITRDML 104
                     + P  S+++          T  CP YF  IH DL+PW+     GITR++L
Sbjct: 95  CLPPLSHPHPSPPNASRSRR---------TKQCPAYFAAIHRDLAPWRRHGHGGITRELL 145

Query: 105 ERANQTAHFRLILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
           + A   A  R+ +  N  ++++  Y   +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC
Sbjct: 146 DSARSRASMRVTITGNGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDC 205

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            DRP I +R   GPN   PPPLFRY   +  +DI FPDWSFWGW E +I PW    R +K
Sbjct: 206 MDRPAI-NRTTGGPNPPLPPPLFRYCTTKDHLDIPFPDWSFWGWPETHINPWAKEFRAIK 264

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
           +G+    W DR P A+WKGNP VA   R  LL CN  D + W+A++  Q
Sbjct: 265 QGSRRVKWGDRVPLAFWKGNPDVASPLRLALLACN--DTNLWHAQIMRQ 311


>gi|302773059|ref|XP_002969947.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300162458|gb|EFJ29071.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 325

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 85/180 (47%), Positives = 120/180 (66%), Gaps = 6/180 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP YF WIHEDL+PW  +GITR+ L+ A +   FRL++V+ K+Y  + ++ IQTRD  T+
Sbjct: 2   CPSYFAWIHEDLAPWSESGITREHLDEARRKGDFRLVIVDGKLYFERIQECIQTRDEITL 61

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT--MDIV 197
            G+L LL K+PG +PD+E++F C+D P +   +Y    +K PPP+F Y   R+    DI+
Sbjct: 62  QGLLLLLEKFPGMIPDMEMVFVCNDFPQVPKDEY---RSKPPPPVFSYCTSRFGGHFDIL 118

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCN 256
           FPDWSFWGW ++ I+PWE    E+ +G N  +W +RE  AYWKGN +V    R +LL CN
Sbjct: 119 FPDWSFWGWPQVKIRPWEQESVEIFDGANETDWFEREAIAYWKGNLWVMTPVREELLQCN 178


>gi|242086909|ref|XP_002439287.1| hypothetical protein SORBIDRAFT_09g003810 [Sorghum bicolor]
 gi|241944572|gb|EES17717.1| hypothetical protein SORBIDRAFT_09g003810 [Sorghum bicolor]
          Length = 497

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 109/281 (38%), Positives = 152/281 (54%), Gaps = 23/281 (8%)

Query: 9   IILQSNFSVHN--ISRNKTVIIVSRKP---------QNRPPKRITTPLNCVKNQTQTCPT 57
           I+L    S ++  +SR +T++  + +P         + RPP R    L C  +     P 
Sbjct: 35  IVLAVVHSAYDDALSRTRTLLGHNLEPTPWHPFPHSKGRPPARAA--LRCAPSIACLPPL 92

Query: 58  NYP-KTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV--TGITRDMLERANQTAHFR 114
           ++P  T+    S +  +      CP YF  IH DL+PW+    G+TR +L+ A + A  R
Sbjct: 93  SHPPGTANASSSAAAAAGEQRRQCPAYFAAIHRDLAPWRGPGRGVTRALLDAARRRASMR 152

Query: 115 LILVNN--KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRD 172
           + +     ++++  Y   +Q+R +FT+W +LQL+R+YPGR+PD++LMFDC DRP I   D
Sbjct: 153 VTITGGGRRLHVDLYYACVQSRALFTVWSLLQLMRRYPGRVPDVDLMFDCMDRPAINRTD 212

Query: 173 YSGPNNKGPPP--LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
           +   +   PPP  LFRY   R   DI FPDWSFWGW E NIKPW    + +K G     W
Sbjct: 213 HGDGDPSSPPPPPLFRYCTTRDHFDIPFPDWSFWGWPETNIKPWNREFKSIKMGARATRW 272

Query: 231 IDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
            DR P AYWKGNP VA   R  LL CN  D   W A +  Q
Sbjct: 273 ADRVPTAYWKGNPDVASPLRVALLGCN--DTAAWRAEIMRQ 311


>gi|302821372|ref|XP_002992349.1| hypothetical protein SELMODRAFT_44687 [Selaginella moellendorffii]
 gi|300139892|gb|EFJ06625.1| hypothetical protein SELMODRAFT_44687 [Selaginella moellendorffii]
          Length = 330

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 12/199 (6%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           S CP YF WI  DL+PWK  GIT   L+ A + A FR++++N  +Y+ +Y +  QTRD F
Sbjct: 4   SQCPAYFSWIDRDLAPWKDHGITLHSLQEAKKRADFRVLILNGTLYMERYHKCFQTRDDF 63

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRY--SGDRWTM 194
           T+ G+  LL ++PG +PD++LMF C D P I R++   G ++  PPPLF Y  +      
Sbjct: 64  TLRGLQLLLDRFPGMVPDVDLMFGCGDFPAIPRAK---GSDDPSPPPLFSYCTTARGENY 120

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
           DIVFPDWSFWGW E+NI PWE   +++  G     W  R+ +AYWKGN +    R DL+ 
Sbjct: 121 DIVFPDWSFWGWPEVNILPWEEEKQKIYSGAREEKWSKRQRFAYWKGNYWTGAARPDLVK 180

Query: 255 CNLSDKHDWNARLYVQVIS 273
           C        N  L+V +IS
Sbjct: 181 CAA------NKDLFVSMIS 193


>gi|302795177|ref|XP_002979352.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300153120|gb|EFJ19760.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 329

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 12/199 (6%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           S CP YF WI  DL+PWK  GIT   L+ A + A FR++++N  +Y+ +Y +  QTRD F
Sbjct: 2   SQCPAYFSWIDRDLAPWKDHGITLHSLQEAKKRADFRVLILNGTLYMERYHKCFQTRDDF 61

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRY--SGDRWTM 194
           T+ G+  LL ++PG +PD++LMF C D P I R++   G ++  PPPLF Y  +      
Sbjct: 62  TLRGLQLLLDRFPGMVPDVDLMFGCGDFPAIPRAK---GSDDPSPPPLFSYCTTARGENY 118

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
           DIVFPDWSFWGW E+NI PWE   +++  G     W  R+ +AYWKGN +    R DL+ 
Sbjct: 119 DIVFPDWSFWGWPEVNILPWEEEKQKIYSGAREEKWSKRQRFAYWKGNYWTGAARPDLVK 178

Query: 255 CNLSDKHDWNARLYVQVIS 273
           C        N  L+V +IS
Sbjct: 179 CAA------NKDLFVSMIS 191


>gi|297745249|emb|CBI40329.3| unnamed protein product [Vitis vinifera]
          Length = 341

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 74/121 (61%), Positives = 88/121 (72%)

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
           YPG+LPD +LMF+C DRP+IR+  Y GP+   PPPLF Y GD  T DIVFPDWSFWGW E
Sbjct: 2   YPGKLPDFDLMFECRDRPMIRTHLYQGPDATVPPPLFHYCGDDETYDIVFPDWSFWGWPE 61

Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
            NIKPW    ++LKEGN    WIDREPYAYWKGN  +   R++L  C  +D+ DWNARLY
Sbjct: 62  TNIKPWNGFKKDLKEGNYRTKWIDREPYAYWKGNVKMGVVRKELFKCRNTDEQDWNARLY 121

Query: 269 V 269
           +
Sbjct: 122 I 122


>gi|302819894|ref|XP_002991616.1| hypothetical protein SELMODRAFT_43466 [Selaginella moellendorffii]
 gi|300140649|gb|EFJ07370.1| hypothetical protein SELMODRAFT_43466 [Selaginella moellendorffii]
          Length = 324

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 82/190 (43%), Positives = 111/190 (58%), Gaps = 9/190 (4%)

Query: 79  TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
           TCP YF+WI  DL+PWK  GITR  LE   + A+FR+I++  K+Y   YKQ  QTR  +T
Sbjct: 2   TCPSYFKWIEHDLAPWK-GGITRKALEAGKEKAYFRVIILGGKLYTQTYKQCFQTRAEYT 60

Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIV 197
           + G+  LL K+PG +PD+++MF+C D P++    Y   +   PPP+F Y   R    DIV
Sbjct: 61  LKGLAMLLNKFPGMVPDVDIMFNCQDHPLVPRWRYLFTS---PPPVFGYCTTRNRHYDIV 117

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN- 256
           FPDWS WGW E+NI PW      +       +W  R+P AYW+GN  +   R +L+ CN 
Sbjct: 118 FPDWSIWGWPEVNILPWSIESERIFTEAEKIDWFRRKPIAYWRGNTQMGLIRSNLVKCNS 177

Query: 257 ---LSDKHDW 263
              L    DW
Sbjct: 178 TNILIQHQDW 187


>gi|302815695|ref|XP_002989528.1| hypothetical protein SELMODRAFT_43161 [Selaginella moellendorffii]
 gi|300142706|gb|EFJ09404.1| hypothetical protein SELMODRAFT_43161 [Selaginella moellendorffii]
          Length = 342

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 72/182 (39%), Positives = 112/182 (61%), Gaps = 5/182 (2%)

Query: 79  TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
           +CP YF  I E L+PW+ TGITR  L+R  +    R+ ++  ++Y+  Y    Q+R VF+
Sbjct: 3   SCPRYFARIREHLAPWRSTGITRRSLDRRQRLGSMRVAILGGRMYVRAYGDCPQSRSVFS 62

Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
           +WG+L +L ++  R+PD+E + +C DRP++ R   Y G     P P+  Y   R ++DI 
Sbjct: 63  LWGLLLMLERFGDRVPDVEFVLNCKDRPIVPRDGSYGG----APAPVLSYCSHRHSLDIP 118

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
           FPD+SFWGW E+NI+PWE   +E+ +G+    W  R+P A+WKGN  + + R  L  CN 
Sbjct: 119 FPDYSFWGWPEVNIRPWEQESQEIFQGSQDVEWNKRQPLAFWKGNLRMGKLRNLLARCNS 178

Query: 258 SD 259
           ++
Sbjct: 179 TE 180


>gi|302761718|ref|XP_002964281.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300168010|gb|EFJ34614.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 342

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 72/184 (39%), Positives = 113/184 (61%), Gaps = 5/184 (2%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDV 136
            ++CP YF  I E L+PW+ TGITR  L+R  +    R+ ++  ++Y+  Y    Q+R V
Sbjct: 1   AASCPRYFARIREHLAPWRSTGITRRSLDRRQRLGSMRVAILGGRMYVRAYGDCPQSRSV 60

Query: 137 FTIWGILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           F++WG+L +L ++  R+PD+E + +C DRP++ R   Y G     P P+  Y   R ++D
Sbjct: 61  FSLWGLLLMLERFGDRVPDVEFVLNCKDRPIVPRDGSYGGV----PSPVLSYCSHRHSLD 116

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           I FPD+SFWGW E+NI+PWE   +E+ +G+    W  R+P A+WKGN  + + R  L  C
Sbjct: 117 IPFPDYSFWGWPEVNIRPWEQESQEIFQGSQDVEWNKRQPLAFWKGNLRMGKLRNLLARC 176

Query: 256 NLSD 259
           N ++
Sbjct: 177 NSTE 180


>gi|302776736|ref|XP_002971516.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300160648|gb|EFJ27265.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 318

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 81/190 (42%), Positives = 109/190 (57%), Gaps = 9/190 (4%)

Query: 79  TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFT 138
           TCP YF WI  DL+PWK  GITR  LE   + A+FR+I++  K+Y   YKQ  QTR  +T
Sbjct: 3   TCPSYFNWIEHDLAPWK-GGITRKALEAGKEKAYFRVIILGGKLYTQTYKQCFQTRAEYT 61

Query: 139 IWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIV 197
           + G+  LL ++PG +PD+++MF+C D P++    Y       PPP+F Y   R    DIV
Sbjct: 62  LKGLAMLLNEFPGMVPDVDIMFNCQDHPLVPRWRYLF---TSPPPVFGYCTTRNRHYDIV 118

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN- 256
           FPDWS WGW E+NI PW      +       +W  R+P AYW+GN  +   R +L+ CN 
Sbjct: 119 FPDWSIWGWPEVNIPPWSIESERIFTEAEKIDWFRRKPIAYWRGNTQMGLIRSNLVKCNS 178

Query: 257 ---LSDKHDW 263
              L    DW
Sbjct: 179 TNILIQHQDW 188


>gi|302815579|ref|XP_002989470.1| hypothetical protein SELMODRAFT_44697 [Selaginella moellendorffii]
 gi|300142648|gb|EFJ09346.1| hypothetical protein SELMODRAFT_44697 [Selaginella moellendorffii]
          Length = 329

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 113/179 (63%), Gaps = 8/179 (4%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           + CP YF  IHE L+PW+ TGITR  L+R       R+ ++ + +Y+  Y    Q+R  F
Sbjct: 1   ALCPRYFARIHEHLAPWRSTGITRRSLDR---LGSMRVAILRSDMYVSAYGDCPQSRRAF 57

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
           ++WG+L +L ++  R+PD+E + +C DRP++  RD S    + P P+  Y   R ++DI 
Sbjct: 58  SLWGLLLMLERFGDRVPDVEFVLNCKDRPIV-PRDGS----EVPAPVLSYCSHRLSLDIP 112

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           FPD+SFWGW E+NI+PWE   +E+ +G+ G  W +R+P+A+WKGN  + + R  L  CN
Sbjct: 113 FPDYSFWGWPEVNIRPWEEESQEIFQGSQGVEWSNRQPFAFWKGNLRMGKLRTLLARCN 171


>gi|302761722|ref|XP_002964283.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300168012|gb|EFJ34616.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 401

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 72/177 (40%), Positives = 105/177 (59%), Gaps = 1/177 (0%)

Query: 81  PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
           P YF  I  DL PWK  GIT+  L  A +    R+++   K+YI  Y +  Q+R +FT W
Sbjct: 58  PKYFDSIDTDLRPWKDGGITKSSLSAARRKGSMRMVISQGKLYIEVYGKCPQSRSIFTAW 117

Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVI-RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
           G+L LL ++PG++PD++ + +C DRPVI R   +    +  PP +F Y      +DI FP
Sbjct: 118 GLLLLLERFPGKVPDVDFVLNCKDRPVITRFLSFQRFVSGSPPAVFSYCTTNDMLDIPFP 177

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           D+SFWGW E++I PWE   +++  G+    W +R P A+WKGN  + + RR LL C 
Sbjct: 178 DFSFWGWPEVDIPPWEEQSQQITAGSREVKWSERRPAAFWKGNSRMGKLRRHLLQCQ 234


>gi|224059286|ref|XP_002299807.1| predicted protein [Populus trichocarpa]
 gi|222847065|gb|EEE84612.1| predicted protein [Populus trichocarpa]
          Length = 333

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 75/165 (45%), Positives = 105/165 (63%), Gaps = 7/165 (4%)

Query: 107 ANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRP 166
           A   A FR+++   ++Y+  Y   +Q+R +FTIWG LQLL++YPG +PD+++MFDC D+P
Sbjct: 1   AKNYASFRVVIYKGRLYLDPYYACVQSRMMFTIWGFLQLLKRYPGMVPDVDIMFDCMDKP 60

Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
            I   ++    +  P PLFRY   +   DI FPDWSFWGW E+NI+PW+   R++K G  
Sbjct: 61  SINKTEH----DSFPLPLFRYCTTKDHFDIPFPDWSFWGWPEVNIRPWDEEFRDIKRGAQ 116

Query: 227 GRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
            R+W  + P AYWKGNP V   TR  LL CN + K  W A++  Q
Sbjct: 117 ARSWPKKWPRAYWKGNPDVGSPTRTSLLECNHTKK--WGAQIMRQ 159


>gi|147802992|emb|CAN70737.1| hypothetical protein VITISV_008287 [Vitis vinifera]
          Length = 694

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 79/135 (58%), Positives = 88/135 (65%), Gaps = 29/135 (21%)

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           V+T W I  LLR YPGRLPDLELMFDCDDRPV+R RD+ GP N  PPPLFRY GD W++D
Sbjct: 377 VYTGW-IDALLRWYPGRLPDLELMFDCDDRPVVRMRDFRGP-NXAPPPLFRYCGDDWSLD 434

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           IVFPDWSFWGW                            P AYW+GNP VA TR DLL C
Sbjct: 435 IVFPDWSFWGWV---------------------------PLAYWRGNPHVAPTRGDLLKC 467

Query: 256 NLSDKHDWNARLYVQ 270
           N+SDK +WN RLY+Q
Sbjct: 468 NVSDKAEWNTRLYLQ 482


>gi|413919038|gb|AFW58970.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
          Length = 241

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 79/153 (51%), Positives = 102/153 (66%), Gaps = 7/153 (4%)

Query: 61  KTSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLI 116
            T+QT  + +  S PP      CP YFRWIHEDL PW+ TG+TR  +E A + A   R+ 
Sbjct: 89  STNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVA 148

Query: 117 LVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
           +V  ++Y+ +Y  ++  QTR  FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + +  
Sbjct: 149 VVAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERH 208

Query: 175 GP-NNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
           G      PPPLFRY G   T+DI FPDWSFWGW
Sbjct: 209 GAQPQPPPPPLFRYCGSEATLDIAFPDWSFWGW 241


>gi|302762508|ref|XP_002964676.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300168405|gb|EFJ35009.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 330

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 72/179 (40%), Positives = 112/179 (62%), Gaps = 8/179 (4%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           + CP YF  IHE L+PW+ TGITR  L+R       R+ ++ + +Y+  Y    Q+R  F
Sbjct: 1   ALCPRYFARIHEHLAPWRSTGITRRSLDR---LGSMRVAILRSDMYVSAYGDCPQSRRAF 57

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
           ++WG+L +L ++  R+PD+E + +C DRP++  RD S    + P P+  Y   R ++DI 
Sbjct: 58  SLWGLLLMLERFGDRVPDVEFVLNCKDRPIV-PRDGS----EVPAPVLSYCSHRLSLDIP 112

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           FPD+SFWGW E+NI+PWE   +E+ +G+    W +R+P+A+WKGN  + + R  L  CN
Sbjct: 113 FPDYSFWGWPEVNIRPWEEESQEIFQGSQDVEWSNRQPFAFWKGNLRMGKLRTLLARCN 171


>gi|302802289|ref|XP_002982900.1| hypothetical protein SELMODRAFT_53724 [Selaginella moellendorffii]
 gi|300149490|gb|EFJ16145.1| hypothetical protein SELMODRAFT_53724 [Selaginella moellendorffii]
          Length = 345

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 73/177 (41%), Positives = 109/177 (61%), Gaps = 7/177 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP Y+  + EDL+PWK  GI    LE A + + FR I+ + ++Y+  Y +  QTR +FTI
Sbjct: 2   CPAYYSRVFEDLAPWKEKGIQEHDLETARKHSAFRAIVRDGRLYVELYYRCFQTRMMFTI 61

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
            GI+QLL+++PG++PD+++ F+C DRP I    +    ++ PPPLF Y   +   DI FP
Sbjct: 62  VGIMQLLQRFPGQIPDVDIFFNCQDRPQITKSAF----DEAPPPLFGYCSTKNHFDIPFP 117

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           DWSFWGW E  I PW S L+ + +      W DR+    W+G+P  ++ R+ L+ CN
Sbjct: 118 DWSFWGWPENKILPWRSQLKRITQQA---EWKDRDSSVQWRGDPRTSQIRQRLIACN 171


>gi|302800329|ref|XP_002981922.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
 gi|300150364|gb|EFJ17015.1| glycosyltransferase, CAZy family GT90 [Selaginella moellendorffii]
          Length = 345

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 74/177 (41%), Positives = 108/177 (61%), Gaps = 7/177 (3%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           CP Y+  + EDL+PWK  GI    LE A + + FR I+ + ++Y+  Y +  QTR +FTI
Sbjct: 2   CPAYYSRVFEDLAPWKEKGIQEHDLETARKHSAFRAIVRDGRLYVELYYRCFQTRMMFTI 61

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
            GI+QLL+++PG++PD+++ F+C DRP I    +    ++ PPPLF Y   +   DI FP
Sbjct: 62  VGIMQLLQRFPGQIPDVDIFFNCQDRPQITKSAF----DEAPPPLFGYCSTKNHFDIPFP 117

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           DWSFWGW E  I PW S   +LK       W DR+    W+G+P  ++ R+ L+ CN
Sbjct: 118 DWSFWGWPENKILPWRS---QLKRITRQAEWKDRDSSVQWRGDPRTSQIRQRLIACN 171


>gi|222634972|gb|EEE65104.1| hypothetical protein OsJ_20160 [Oryza sativa Japonica Group]
          Length = 498

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 81/235 (34%), Positives = 112/235 (47%), Gaps = 51/235 (21%)

Query: 37  PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV 96
           PP     P +C               S    S       P ++CPDYFR+IH+DL PW+ 
Sbjct: 108 PPLMTLPPFSCGNGTAAPATCPATPPSPPPTSKPATGGEPAASCPDYFRYIHDDLRPWRG 167

Query: 97  TGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
            GITR+ +ER  + A+FRL++V+ + Y+  Y++S QTRD F                   
Sbjct: 168 AGITREAVERGRRHAYFRLVVVSGRAYVETYRRSYQTRDAF------------------- 208

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
                                          + D  T+D+VFPDWSFWGW E+NI  W +
Sbjct: 209 -------------------------------TQDATTLDVVFPDWSFWGWPEVNIGAWPA 237

Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN-LSDKHDWNARLYVQ 270
            L  ++  +    W +REP+A+WKGNP VA  R +L+ CN  SD  DWNARL+ Q
Sbjct: 238 TLEAVRRESARVRWPEREPFAFWKGNPGVARIRGELMKCNPASDGKDWNARLFSQ 292


>gi|223950283|gb|ACN29225.1| unknown [Zea mays]
          Length = 199

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 78/152 (51%), Positives = 101/152 (66%), Gaps = 7/152 (4%)

Query: 61  KTSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLI 116
            T+QT  + +  S PP      CP YFRWIHEDL PW+ TG+TR  +E A + A   R+ 
Sbjct: 44  STNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVA 103

Query: 117 LVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
           +V  ++Y+ +Y  ++  QTR  FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + +  
Sbjct: 104 VVAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERH 163

Query: 175 GP-NNKGPPPLFRYSGDRWTMDIVFPDWSFWG 205
           G      PPPLFRY G   T+DI FPDWSFWG
Sbjct: 164 GAQPQPPPPPLFRYCGSEATLDIAFPDWSFWG 195


>gi|302815691|ref|XP_002989526.1| hypothetical protein SELMODRAFT_130002 [Selaginella moellendorffii]
 gi|300142704|gb|EFJ09402.1| hypothetical protein SELMODRAFT_130002 [Selaginella moellendorffii]
          Length = 401

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 72/182 (39%), Positives = 103/182 (56%), Gaps = 9/182 (4%)

Query: 81  PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW 140
           P YF  I  DL PWK  GIT+  L  A +    R+++   K+YI  Y +  Q+R +FT W
Sbjct: 56  PKYFDSIDTDLRPWKDGGITKSSLSAARKKGSMRMVISQGKLYIEVYGKCPQSRSIFTAW 115

Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVI------RSRDYSGPNNKGPPPLFRYSGDRWTM 194
           G+L LL ++PG++PD++ + +C DRPVI       SRD         P +F Y      +
Sbjct: 116 GLLLLLERFPGKVPDVDFVLNCKDRPVITRYSSFHSRDLC---QDEAPAVFSYCTTNDML 172

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
           DI FPD+SFWGW E++I PWE   +++  G+    W +R P A+WKGN  + + R  LL 
Sbjct: 173 DIPFPDFSFWGWPEVDIPPWEEQSQQITAGSREVKWSERRPAAFWKGNSRMGKLRSHLLQ 232

Query: 255 CN 256
           C 
Sbjct: 233 CQ 234


>gi|413919039|gb|AFW58971.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
 gi|413919040|gb|AFW58972.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
          Length = 302

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 77/150 (51%), Positives = 100/150 (66%), Gaps = 7/150 (4%)

Query: 62  TSQTQESISDYSIPP---TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLIL 117
           T+QT  + +  S PP      CP YFRWIHEDL PW+ TG+TR  +E A + A   R+ +
Sbjct: 88  TNQTPSAGAASSAPPPREVPDCPAYFRWIHEDLRPWRGTGVTRGAVEGARRFAPRLRVAV 147

Query: 118 VNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
           V  ++Y+ +Y  ++  QTR  FT WGILQLLR+YPGR+PDL+LMFDC+D PV+ + +  G
Sbjct: 148 VAGRLYVARYGRRRCFQTRAAFTQWGILQLLRRYPGRVPDLDLMFDCEDLPVVGAGERHG 207

Query: 176 P-NNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
                 PPPLFRY G   T+DI FPDWSFW
Sbjct: 208 AQPQPPPPPLFRYCGSEATLDIAFPDWSFW 237


>gi|297745250|emb|CBI40330.3| unnamed protein product [Vitis vinifera]
          Length = 312

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 67/100 (67%), Positives = 79/100 (79%), Gaps = 1/100 (1%)

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
           RD+ GP+   PPPLFRY GD W++DIVFPDWSFWGWAE NIKPW ++L+++KEGN    W
Sbjct: 2   RDFRGPS-AAPPPLFRYCGDDWSLDIVFPDWSFWGWAETNIKPWRNVLKDIKEGNRRTKW 60

Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
            DR P AYW+GNP VA TR DLL CN+SDK DWN RLY+Q
Sbjct: 61  KDRVPLAYWRGNPHVAPTRGDLLKCNVSDKADWNTRLYLQ 100


>gi|297745253|emb|CBI40333.3| unnamed protein product [Vitis vinifera]
          Length = 325

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 67/111 (60%), Positives = 78/111 (70%), Gaps = 1/111 (0%)

Query: 159 MFDCDDRPVIRSRDYSGPNNKGP-PPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
           MF+C DRP I++ DY G   K   PPLF Y     T+DIVFPDWSFWGW EINIKPW SL
Sbjct: 1   MFECGDRPRIKATDYGGRKGKKKVPPLFHYCASDDTLDIVFPDWSFWGWPEINIKPWNSL 60

Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLY 268
            +EL+EGNN   W+DREPYAYWKGN   +  R+ L  C  S+ HDWNAR+Y
Sbjct: 61  RKELEEGNNRTKWMDREPYAYWKGNIRTSGNRQALFKCRPSNNHDWNARVY 111


>gi|6850895|emb|CAB71058.1| putative protein [Arabidopsis thaliana]
          Length = 338

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 74/124 (59%), Positives = 81/124 (65%), Gaps = 2/124 (1%)

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG--PPPLFRYSGDRWTMDIVFPDWSFWGW 206
           YPG++PDLEL+F C DRP I  RD          PPPLF Y G R   DIVFPDWSFWGW
Sbjct: 2   YPGQIPDLELLFLCHDRPAIWKRDLKKKRKDTWPPPPLFHYCGHRDAYDIVFPDWSFWGW 61

Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
            E+NIK W  L   LKEGN    W DR PYAYWKGNP V+  R DL+ CN SDK+D   R
Sbjct: 62  PELNIKEWNKLSVALKEGNKKVKWEDRVPYAYWKGNPHVSPIRGDLMRCNFSDKYDPMVR 121

Query: 267 LYVQ 270
           LYVQ
Sbjct: 122 LYVQ 125


>gi|413919042|gb|AFW58974.1| hypothetical protein ZEAMMB73_087617 [Zea mays]
          Length = 316

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 67/113 (59%), Positives = 78/113 (69%), Gaps = 1/113 (0%)

Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPP-LFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESL 217
           MFDC+D PV+ + +  G   + PPP LFRY G   T+DI FPDWSFWGW E+NIKPWE+L
Sbjct: 1   MFDCEDLPVVGAGERHGAQPQPPPPPLFRYCGSEATLDIAFPDWSFWGWPELNIKPWEAL 60

Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQ 270
            RE+ E N   NW DR PYAYWKGNP V   R  LL CN S + DWNAR+Y Q
Sbjct: 61  QREINEANAVVNWTDRAPYAYWKGNPKVGAERLLLLRCNASGERDWNARVYAQ 113


>gi|67633610|gb|AAY78729.1| hypothetical protein At2g45840 [Arabidopsis thaliana]
          Length = 337

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 69/126 (54%), Positives = 75/126 (59%), Gaps = 9/126 (7%)

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFW 204
           YPG++PDLEL+F C D P I  RDY    G N    PPPLF Y G     DIVFPDWSFW
Sbjct: 2   YPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFW 61

Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
           GW EINIK W      + EG     W +REPYAYWKGNP VA  RRDL+ C     HD  
Sbjct: 62  GWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPM 116

Query: 265 ARLYVQ 270
             LY Q
Sbjct: 117 VHLYRQ 122


>gi|30267795|gb|AAP21678.1| hypothetical protein [Arabidopsis thaliana]
          Length = 337

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 69/126 (54%), Positives = 75/126 (59%), Gaps = 9/126 (7%)

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYS---GPNNK-GPPPLFRYSGDRWTMDIVFPDWSFW 204
           YPG++PDLEL+F C D P I  RDY    G N    PPPLF Y G     DIVFPDWSFW
Sbjct: 2   YPGQVPDLELLFMCHDSPEIWRRDYRPRPGVNVTWPPPPLFHYCGHSGAFDIVFPDWSFW 61

Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
           GW EINIK W      + EG     W +REPYAYWKGNP VA  RRDL+ C     HD  
Sbjct: 62  GWPEINIKEWNKQSELISEGIKKVKWEEREPYAYWKGNPGVAMVRRDLMHC-----HDPM 116

Query: 265 ARLYVQ 270
             LY Q
Sbjct: 117 VHLYRQ 122


>gi|356561649|ref|XP_003549092.1| PREDICTED: receptor-like protein 12-like [Glycine max]
          Length = 819

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/174 (41%), Positives = 85/174 (48%), Gaps = 60/174 (34%)

Query: 39  KRITTPLNCVK-NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
           ++   PLNC   N T TCPTN      T         P T+TCPDYFRWIHEDL PW  T
Sbjct: 700 RKPIAPLNCTAYNLTGTCPTNLQDHQST---------PATATCPDYFRWIHEDLRPWART 750

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
           GIT+DM+ERA QTA+FR                                  YPG++PDLE
Sbjct: 751 GITQDMVERAKQTANFR----------------------------------YPGKIPDLE 776

Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINI 211
           LMFDC D PV+ S  Y+GP                T+DIVFPD S+     I+I
Sbjct: 777 LMFDCVDWPVVLSDRYNGP----------------TLDIVFPDSSYTSIIYISI 814


>gi|125583038|gb|EAZ23969.1| hypothetical protein OsJ_07696 [Oryza sativa Japonica Group]
          Length = 279

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 48/71 (67%), Positives = 57/71 (80%)

Query: 182 PPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           PPLFRY GD  T+D+VFPDWSFWGW EINIKPW++L ++L  GN    W+DREPYAYWKG
Sbjct: 2   PPLFRYCGDDETLDVVFPDWSFWGWPEINIKPWDALQKDLDIGNKRVKWVDREPYAYWKG 61

Query: 242 NPFVAETRRDL 252
           NP VA  R++L
Sbjct: 62  NPDVATKRKEL 72


>gi|302143882|emb|CBI22743.3| unnamed protein product [Vitis vinifera]
          Length = 309

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 47/63 (74%), Positives = 53/63 (84%)

Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
           EINIKPWESLL++LKEGN    W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+
Sbjct: 37  EINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARV 96

Query: 268 YVQ 270
           Y Q
Sbjct: 97  YTQ 99


>gi|359490428|ref|XP_003634087.1| PREDICTED: O-glucosyltransferase rumi-like [Vitis vinifera]
          Length = 288

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 47/63 (74%), Positives = 53/63 (84%)

Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARL 267
           EINIKPWESLL++LKEGN    W++REPYAYWKGNP VA TR DLL CN+SDK DWNAR+
Sbjct: 16  EINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVSDKQDWNARV 75

Query: 268 YVQ 270
           Y Q
Sbjct: 76  YTQ 78


>gi|356577934|ref|XP_003557076.1| PREDICTED: LOW QUALITY PROTEIN: protein O-glucosyltransferase
           1-like, partial [Glycine max]
          Length = 268

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 42/64 (65%), Positives = 50/64 (78%)

Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
           +EINIKPW  LL  LKEG     W++REPYAYWKGNP VAETR+DL+ CN+S+  DWNAR
Sbjct: 1   SEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAVAETRQDLIKCNVSENQDWNAR 60

Query: 267 LYVQ 270
           L+ Q
Sbjct: 61  LFAQ 64


>gi|356561560|ref|XP_003549049.1| PREDICTED: uncharacterized protein LOC100818683 [Glycine max]
          Length = 289

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 42/64 (65%), Positives = 49/64 (76%)

Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
           AEINIKPW  LL  LKEG     W++REPYAYWKGNP VAETR+DL+ CN+ +  DWNAR
Sbjct: 14  AEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPAVAETRQDLIKCNVYENQDWNAR 73

Query: 267 LYVQ 270
           L+ Q
Sbjct: 74  LFAQ 77


>gi|147802993|emb|CAN70738.1| hypothetical protein VITISV_008288 [Vitis vinifera]
          Length = 204

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 59/109 (54%), Gaps = 17/109 (15%)

Query: 52  TQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA 111
            Q CP N+P TS T +   +        CP+YFRWIHEDL PWK TGI+R  +E A   A
Sbjct: 2   AQACPANHPTTSVTGKLSXE-------ACPEYFRWIHEDLRPWKSTGISRFAVESAXGDA 54

Query: 112 HFRLILVNNKVYIHKYKQ----------SIQTRDVFTIWGILQLLRKYP 150
            FRL++VN K Y+ +Y++          S+  + V     +  L++ YP
Sbjct: 55  DFRLVIVNGKAYVEQYRKYKIYVEGWGWSVSEKYVLACDSMTLLIKPYP 103


>gi|124360768|gb|ABN08742.1| Lipopolysaccharide-modifying protein [Medicago truncatula]
          Length = 253

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 33/62 (53%), Positives = 46/62 (74%), Gaps = 1/62 (1%)

Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNAR 266
           AE  IKPWE +L++++E N    W DR PYA+WKGN F++  R +L  CN++D+HDWNAR
Sbjct: 6   AETRIKPWEKVLKDIQESNKKIKWKDRIPYAFWKGN-FLSNPRHELRKCNVTDQHDWNAR 64

Query: 267 LY 268
           +Y
Sbjct: 65  VY 66


>gi|412992075|emb|CCO20801.1| glycosyltransferase CAZy GT90 [Bathycoccus prasinos]
          Length = 438

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 56/215 (26%), Positives = 96/215 (44%), Gaps = 31/215 (14%)

Query: 86  WIHEDLSPWKVTGITRDMLERANQTA--HFRLILVNNKVYIHKYKQSIQ-TRDVFTIWGI 142
           WI  DL PW +TGIT+ M++ A Q      R+ ++  K+Y    K S   +R  + +WG+
Sbjct: 74  WIDVDLQPWSMTGITKRMVDLAAQQGMRANRIQIIGGKIYAQISKSSRGPSRIWYWLWGL 133

Query: 143 LQLLRKYPGR-LPDLELMFDCDDRPVI--------------RSRDYS-GPNNKGPPPLFR 186
           ++L+ ++P   +PD++ + +  D P +              + RD+  G   + PPP+F 
Sbjct: 134 MELIDEFPEEAVPDVDFILNTQDDPQVSIVGKRPKNPILAKKYRDFVPGIKGQAPPPVFS 193

Query: 187 YSGDRWTMDIVFPDWSFWG---------WAEINIKPWESLLRELKEGNNGRNWIDREPY- 236
                   D+++P W+ WG             +  PW+ L  +L        W +R    
Sbjct: 194 AVTTSNNYDLLWPLWTIWGEDVEGAGSKTGGFHDPPWKELHPKLIHFAKKNKWSERRSER 253

Query: 237 AYWKGNPFVAETRRDLLTC--NLSDKHDWNARLYV 269
            +W+G+      RR L+ C  N  D  D   +L V
Sbjct: 254 IFWRGSVKTNPARRALIRCSKNTVDAADVQHKLRV 288


>gi|255630857|gb|ACU15791.1| unknown [Glycine max]
          Length = 194

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/69 (52%), Positives = 45/69 (65%), Gaps = 7/69 (10%)

Query: 192 WTMDIVFP-------DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
           +T ++ FP       D +    AEINIKPW  LL  LKEG     W++REPYAYWKGNP 
Sbjct: 24  FTQNLCFPQCYKSCFDLAAGEGAEINIKPWHILLGGLKEGTTRIPWLNREPYAYWKGNPA 83

Query: 245 VAETRRDLL 253
           VA+TR+DL+
Sbjct: 84  VAQTRQDLM 92


>gi|13937171|gb|AAK50079.1|AF372939_1 At1g63420/F2K11_19 [Arabidopsis thaliana]
 gi|21700873|gb|AAM70560.1| At1g63420/F2K11_19 [Arabidopsis thaliana]
          Length = 228

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 30/51 (58%), Positives = 41/51 (80%), Gaps = 1/51 (1%)

Query: 221 LKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNARLYVQ 270
           ++EG   + +++R+ YAYWKGNPFVA  +R DLLTCNLS  HDWNAR+++Q
Sbjct: 1   MEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHDWNARIFIQ 51


>gi|312384008|gb|EFR28848.1| hypothetical protein AND_02694 [Anopheles darlingi]
          Length = 406

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 103/220 (46%), Gaps = 29/220 (13%)

Query: 38  PKRITTPLNCVKNQTQTCPTNYPKT------SQTQESISDYSIPPTSTCPDYFRWIHEDL 91
           P     P++   +  ++     PK       S    ++   S      C  +   + +DL
Sbjct: 28  PASAAVPVSVTTSAKRSSAIRGPKKFTPLPDSVLSRNLGSESTTAIDGCSCHTGVLKKDL 87

Query: 92  SPWKVTGITRDMLERANQTA-HFRLILVNNKVYIHKYKQSIQTRDVF--TIWGILQLLRK 148
            P+K  GIT++M+ERA Q   H+++I  ++K+Y        QT  +F     GI   ++ 
Sbjct: 88  KPFKADGITKEMIERAKQYGTHYQVI--DHKLY-------RQTECMFPARCSGIEHFVKP 138

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWA 207
              +LPD++L+ +C D P +  R +    NK   P+F +S     +DI++P W+FW G  
Sbjct: 139 LLPKLPDMDLIINCRDWPQVH-RHW----NKEKTPVFSFSKTDEYLDIMYPAWAFWEGGP 193

Query: 208 EINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
            I++ P     W+     + +      W +++P A+++G+
Sbjct: 194 AISLYPTGLGRWDQHRESISQAAERVPWKNKKPMAFFRGS 233


>gi|388579121|gb|EIM19449.1| hypothetical protein WALSEDRAFT_34020 [Wallemia sebi CBS 633.66]
          Length = 471

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 83/173 (47%), Gaps = 21/173 (12%)

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQS-IQTRDVFTIWGILQLLRKYPGRLPDL 156
           GIT++ LE+A+  AH R+++ NN++Y+ +Y+   + TR + T   I + +   PG +P++
Sbjct: 140 GITKEHLEKASDKAHGRVVIHNNRMYVKEYRHGDVNTRAMATFAAIHEAVLTSPGAIPNV 199

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRY--SGDR---WTMDIVFPDWSFWGWAEINI 211
           E  F   D         +G +   P P F    + D+   W M    PD+ FW W E  +
Sbjct: 200 EFTFQIQD---------AGDSYDEPIPTFVLDRTADQPELWLM----PDFGFWSWPEPKV 246

Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
             +  +  +  +  +  +W ++ P  +W+G       R  L+  + +  H W+
Sbjct: 247 GSYVEVRDKAGKWESKHSWSEKLPKVFWRGASLGLPIRDQLV--DAARGHAWS 297


>gi|194744000|ref|XP_001954486.1| GF18286 [Drosophila ananassae]
 gi|190627523|gb|EDV43047.1| GF18286 [Drosophila ananassae]
          Length = 411

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 56/226 (24%), Positives = 104/226 (46%), Gaps = 43/226 (19%)

Query: 41  ITTPLNCVKNQTQTCPTNYPKTS-----QTQESISDYSIPPT----STCPDYFRWIHEDL 91
           + +P      Q+++   N+ + S     Q +++++DY    T    S C  +   I  DL
Sbjct: 25  LCSPDEKSCGQSESLDGNHDEFSFKIRRQIKKAVADYKPCSTDDEDSKCACHAAVIKHDL 84

Query: 92  SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLL 146
           +P+K TG+TR M+E+A +    +  + NN++Y          RD   ++     GI   L
Sbjct: 85  APYKATGVTRQMIEKAGEYGT-KYKIFNNRLY----------RDANCMFPSRCQGIEHFL 133

Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWTMDIVFPDWS 202
                 LP+++L        VI +RDY   N    + G  P+F +S  +  MDI++P W+
Sbjct: 134 LPLTASLPNMDL--------VINTRDYPQLNTAWGSSGRGPIFSFSKTKEYMDIMYPAWT 185

Query: 203 FW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
           FW G     + P     W+ +  +L++ +    W  +    +++G+
Sbjct: 186 FWAGGPATKLHPRGIGRWDQMREKLEKRSAAIPWSQKRELGFFRGS 231


>gi|242065946|ref|XP_002454262.1| hypothetical protein SORBIDRAFT_04g027705 [Sorghum bicolor]
 gi|241934093|gb|EES07238.1| hypothetical protein SORBIDRAFT_04g027705 [Sorghum bicolor]
          Length = 173

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 27/57 (47%), Positives = 40/57 (70%), Gaps = 1/57 (1%)

Query: 79  TCPDYFRWIHEDLSPWKVTG-ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
           +CP YF +IHEDL PW+  G ITR ML+RA+ TA FR +++  + Y+H+ + + Q R
Sbjct: 89  SCPSYFCFIHEDLRPWRAAGGITRAMLDRAHLTATFRFVVLEGRAYVHRLRPAFQNR 145


>gi|384245503|gb|EIE18997.1| hypothetical protein COCSUDRAFT_54837 [Coccomyxa subellipsoidea
           C-169]
          Length = 520

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/206 (23%), Positives = 85/206 (41%), Gaps = 38/206 (18%)

Query: 86  WIHEDLSPWKVTGI----TRDMLERANQT--AHFRLILVNNKVYIHKYKQ---------- 129
           WI +D + W+ TGI      +M  R  +     FR  ++N  +++    +          
Sbjct: 88  WIKQDFAQWEKTGIKMSAVTEMALRYRECFGEVFRFQIINGTLWVDHISERHSGWYPSRM 147

Query: 130 -----SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPL 184
                S + +  + I  ++  LR +PG++PD++ +    D P +  +    P N  PPP+
Sbjct: 148 GAGSLSAKGKIPYAILALMDTLRHHPGQIPDIDAVIQTSDFPCMLRQQ---PGNTPPPPV 204

Query: 185 FRYSGDRWTMDIVFPDWSFWGWA--------EINIKPWESLLRELKEGNNGRNWIDREPY 236
           F Y+     +DI FPD+++WG           + +  WE   + L E    +    R+P 
Sbjct: 205 FGYNSHARFVDIPFPDYTYWGHEYHRLVDEDGLLLFGWEKQFKLLSEKWREKEIASRKPQ 264

Query: 237 AYWKGN------PFVAETRRDLLTCN 256
             W+G       P   E RR    C 
Sbjct: 265 VIWRGRTEDKEYPKRDELRRQFARCG 290


>gi|195331147|ref|XP_002032264.1| GM23615 [Drosophila sechellia]
 gi|194121207|gb|EDW43250.1| GM23615 [Drosophila sechellia]
          Length = 411

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 51/214 (23%), Positives = 93/214 (43%), Gaps = 31/214 (14%)

Query: 46  NCVKNQTQTCPTN---YPKTSQTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTG 98
           +C +N+++    +   +    Q +++ +DY    S P  S C  +   +  DL+P+K TG
Sbjct: 32  SCAQNESEQINEDEFSFKIRRQIEKANADYKPCSSDPQDSDCSCHADVLKRDLAPYKSTG 91

Query: 99  ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
           +TR M+E + +           K+Y H+  +           GI   L      LPD++L
Sbjct: 92  VTRQMIESSARYG------TKYKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL 145

Query: 159 MFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
                   VI +RDY   N    N    P+F +S  +   DI++P W+FW G     + P
Sbjct: 146 --------VINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHP 197

Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
                W+ +  +L++      W  +    +++G+
Sbjct: 198 RGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGS 231


>gi|195996689|ref|XP_002108213.1| hypothetical protein TRIADDRAFT_18924 [Trichoplax adhaerens]
 gi|190588989|gb|EDV29011.1| hypothetical protein TRIADDRAFT_18924, partial [Trichoplax
           adhaerens]
          Length = 366

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 96/196 (48%), Gaps = 27/196 (13%)

Query: 67  ESISDYSIPPTSTCPDYFRWIHEDLSPWKVT-GITRDMLERA-NQTAH------FRLILV 118
           +++S +       C  Y + I  DL PW++  GIT+++ ++A NQ +H          ++
Sbjct: 16  KAVSSFKDCKNDRCGCYKKVIDNDLKPWRLKKGITKEIFDKAANQGSHRGAEKGSHYQII 75

Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
           N+KVY H+ + +   R      GI   L+K   +LP+LEL+ +  D P +       P  
Sbjct: 76  NHKVYRHE-RCTFPAR----CKGIEHFLKKIAKKLPNLELIINTHDWPKV-------PKW 123

Query: 179 KGPPPLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
               P+F +S      DI++P WSFW  G A   I P     W+ L + L++ ++   W 
Sbjct: 124 DELLPVFSFSKTHNENDIMYPAWSFWEGGPAVWPIFPNGLGRWDVLRKSLQKASDKWPWD 183

Query: 232 DREPYAYWKGNPFVAE 247
            ++  A+++G+   AE
Sbjct: 184 KKKSIAFFRGSRTSAE 199


>gi|195573000|ref|XP_002104483.1| GD18424 [Drosophila simulans]
 gi|194200410|gb|EDX13986.1| GD18424 [Drosophila simulans]
          Length = 411

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)

Query: 64  QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
           Q +++ +DY    S P  S C  +   +  DL+P+K TG+TR M+E + +          
Sbjct: 53  QIEKAKADYKPCSSDPQDSDCSCHADVMKRDLAPYKSTGVTRQMIESSARYG------TK 106

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
            K+Y H+  +           GI   L      LPD++L        VI +RDY   N  
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL--------VINTRDYPQLNAA 158

Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
             N    P+F +S  +   DI++P W+FW G     + P     W+ +  +L++      
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218

Query: 230 WIDREPYAYWKGN 242
           W  +    +++G+
Sbjct: 219 WSQKRSLGFFRGS 231


>gi|21355689|ref|NP_651095.1| rumi, isoform A [Drosophila melanogaster]
 gi|74866179|sp|Q8T045.1|RUMI_DROME RecName: Full=O-glucosyltransferase rumi; Flags: Precursor
 gi|17862464|gb|AAL39709.1| LD29477p [Drosophila melanogaster]
 gi|23172000|gb|AAN13920.1| rumi, isoform A [Drosophila melanogaster]
 gi|220945854|gb|ACL85470.1| rumi-PA [synthetic construct]
 gi|220955678|gb|ACL90382.1| rumi-PA [synthetic construct]
          Length = 411

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/189 (23%), Positives = 83/189 (43%), Gaps = 20/189 (10%)

Query: 64  QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
           Q +++ +DY    S P  S C  +   +  DL+P+K TG+TR M+E + +          
Sbjct: 53  QIEKANADYKPCSSDPQDSDCSCHANVLKRDLAPYKSTGVTRQMIESSARYG------TK 106

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
            K+Y H+  +           GI   L      LPD++L+ +  D P +     +   N 
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDLIINTRDYPQLN----AAWGNA 162

Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDR 233
              P+F +S  +   DI++P W+FW G     + P     W+ +  +L++      W  +
Sbjct: 163 AGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQK 222

Query: 234 EPYAYWKGN 242
               +++G+
Sbjct: 223 RSLGFFRGS 231


>gi|307105335|gb|EFN53585.1| hypothetical protein CHLNCDRAFT_136791 [Chlorella variabilis]
          Length = 592

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/189 (23%), Positives = 79/189 (41%), Gaps = 34/189 (17%)

Query: 83  YFRWIHEDLSPWKVTGITRDMLERAN------QTAHFRLILVNNKVYIHKYKQSIQ---- 132
           Y   I  DL PWK +GIT +++ER++           R  ++N  +++H   + ++    
Sbjct: 119 YLDLIRRDLLPWKDSGITHELVERSSMMFDDCDGDMLRFQVLNGSLWVHHITERLEGGWY 178

Query: 133 ------------TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
                        R  + +  +++ LR +PG++PD++ +    D P I       P    
Sbjct: 179 PAPIGPGNAAAKGRVPYAVLALMETLRMFPGQIPDVDAILHFADFPCIPRPRAGAP---- 234

Query: 181 PPPLFRYSGDRWTMDIVFPDWSFWGWAEINI-----KP---WESLLRELKEGNNGRNWID 232
           P P+    G     DI F D+++WG     +     KP   W +    L       + +D
Sbjct: 235 PAPILGLQGSAHHSDIPFSDYTYWGHEHQYLQDPWGKPAHGWGNQAEVLARKYENVSLLD 294

Query: 233 REPYAYWKG 241
           R P A W+G
Sbjct: 295 RIPQASWRG 303


>gi|383858247|ref|XP_003704613.1| PREDICTED: O-glucosyltransferase rumi homolog [Megachile rotundata]
          Length = 391

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 87/191 (45%), Gaps = 18/191 (9%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
           S  +E+  +Y +   +    +   I  DL P+K  GI +D+++ A     F  I V  KV
Sbjct: 53  SMIEEAERNYQLCNNTNNNCFKHIILRDLKPFKEKGINKDLIDAAKARGTFYQI-VKGKV 111

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
           Y  K      +R      GI   L K   +LPD++L+ +  D P  +S  Y G    GP 
Sbjct: 112 YRQK-DCMFPSR----CSGIEHFLLKLAPKLPDMDLVINVRDYP--QSSKYFG----GPL 160

Query: 183 PLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPY 236
           P+F +S      DI +P W+FW G   I++ P     W+     L + +    W  +E  
Sbjct: 161 PVFSFSKTPQYYDITYPAWAFWEGGPAISLYPRGLGRWDEHRISLDKASKSLPWEKKESK 220

Query: 237 AYWKGNPFVAE 247
           A+++G+   +E
Sbjct: 221 AFFRGSRTSSE 231


>gi|195502713|ref|XP_002098347.1| GE24006 [Drosophila yakuba]
 gi|194184448|gb|EDW98059.1| GE24006 [Drosophila yakuba]
          Length = 411

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)

Query: 64  QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
           Q +++ +DY    S P  S C  +   +  DL+P+K TG+TR M+E + +          
Sbjct: 53  QIEKANADYKPCSSDPKDSDCSCHADVLKRDLAPYKSTGVTRQMIESSARYG------TK 106

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
            K+Y H+  +           GI   L      LPD++L        VI +RDY   N  
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDL--------VINTRDYPQLNAA 158

Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
             N    P+F +S  +   DI++P W+FW G     + P     W+ +  +L++      
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218

Query: 230 WIDREPYAYWKGN 242
           W  +    +++G+
Sbjct: 219 WSQKRNLGFFRGS 231


>gi|194910742|ref|XP_001982221.1| GG12485 [Drosophila erecta]
 gi|190656859|gb|EDV54091.1| GG12485 [Drosophila erecta]
          Length = 411

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 28/193 (14%)

Query: 64  QTQESISDY----SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN 119
           Q +++ +DY    S P  S C  +   +  DL+P+K TG+TR M+E + +          
Sbjct: 53  QIEKANADYKPCSSDPKDSDCSCHAAVLKRDLAPYKSTGVTRQMIESSARYG------TK 106

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-- 177
            K+Y H+  +           GI   L      LPD++L        VI +RDY   N  
Sbjct: 107 YKIYGHRLYRDANCMFPARCEGIEHFLLPLVTTLPDMDL--------VINTRDYPQLNAA 158

Query: 178 --NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRN 229
             N    P+F +S  +   DI++P W+FW G     + P     W+ +  +L++      
Sbjct: 159 WGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIP 218

Query: 230 WIDREPYAYWKGN 242
           W  +    +++G+
Sbjct: 219 WSQKRNLGFFRGS 231


>gi|405122052|gb|AFR96820.1| hypothetical protein CNAG_06994 [Cryptococcus neoformans var.
           grubii H99]
          Length = 422

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 79/170 (46%), Gaps = 27/170 (15%)

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
           GI++ M++ A +  + RL+++NN++++  YK  I TR    I  +   +      LPD++
Sbjct: 81  GISKKMVDEAEEEGNARLVILNNQLFVKAYKGGINTRTQAAIAAVYGTVLTATEPLPDVD 140

Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPW-- 214
                    VI++ D  G N+    P F         D+ + PD+ F+ W E  +  +  
Sbjct: 141 F--------VIQTSDAGGGNH----PHFALCRKANQKDLWLMPDFGFFSWPEPGVGSYSE 188

Query: 215 ---ESLLREL--------KEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
              ++L  EL        K G    +W ++    +W+G+P V E R DLL
Sbjct: 189 VRTKTLDYELDMGLEVDNKLGVTKSDWFNKTQQLFWRGSPMV-EVRNDLL 237


>gi|307104334|gb|EFN52588.1| hypothetical protein CHLNCDRAFT_138604 [Chlorella variabilis]
          Length = 452

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 53/196 (27%), Positives = 84/196 (42%), Gaps = 12/196 (6%)

Query: 69  ISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNNKVYIH 125
           +S  S+   S     F  +  DL P+  +GI     E+A      A  R+ +VN  VYI 
Sbjct: 28  LSKASLNGASYVGPLFSQVARDLEPFNHSGIGLRHAEQAYCQGSKASMRVQVVNGSVYIV 87

Query: 126 KYKQSIQTRDVFTIWGILQLLRKYPGRLPD-LELMFDCDDRPVIRSRDYSGPNNKGP--P 182
               S ++R +     +L L     G LP+ ++ + + +D P +R R    P  +GP   
Sbjct: 88  GESPSYESRMLGIKRQLLHLW--LAGGLPESIDFVVEQEDHPTVRHRSDDCPE-RGPILA 144

Query: 183 PLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN 242
           P    S    +  ++ PD +F GW E    PW  +L  L+       W DR    +++G 
Sbjct: 145 PAKCPSNKSHSHVLLAPDHTFAGWPEARTLPWAEMLPLLQHSAERHPWADRSALLFFRG- 203

Query: 243 PFVAETRRDLLTCNLS 258
              A   R+L   +LS
Sbjct: 204 --AATGDRNLTDSDLS 217


>gi|443694719|gb|ELT95788.1| hypothetical protein CAPTEDRAFT_221044 [Capitella teleta]
          Length = 501

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 53/190 (27%), Positives = 90/190 (47%), Gaps = 22/190 (11%)

Query: 80  CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
           CP  +R I +DLS +    +  + ++ +ER NQ     L   ++ NN++Y   Y + +  
Sbjct: 149 CPAGYRQIKQDLSIFGDIDMKAVKQEAVERFNQRGRHALCHYVVKNNQIYRQTYGEHVGF 208

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           + +F    +L L RK    LPD+E   +  D P+ + +      ++GP P+F + G   T
Sbjct: 209 K-MFMDAMLLSLTRKV--HLPDVEFFVNLGDWPLEKRK-----VSEGPLPIFSWCGSDDT 260

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
            DIV P +      E  ++    +  +L   + N G  W ++   A+W+G     E R D
Sbjct: 261 RDIVMPTYDV---TESTLETMGRITLDLLSVQANTGPKWSNKSSVAFWRGRDSRQE-RLD 316

Query: 252 LLTCNLSDKH 261
           L+   LS KH
Sbjct: 317 LV--KLSRKH 324


>gi|328860135|gb|EGG09242.1| hypothetical protein MELLADRAFT_115837 [Melampsora larici-populina
           98AG31]
          Length = 515

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 41/167 (24%), Positives = 78/167 (46%), Gaps = 13/167 (7%)

Query: 99  ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
           +T + L+ A +  H R+++  N+VYI +YK     R    +  I + +   P RLP++E 
Sbjct: 200 VTSEQLDEAIKVGHARVLIYENRVYIKEYKGGPGKRTEALLNSIQEAVITSPERLPNIEF 259

Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESL 217
           +    D P       +G   K P  +   + D+   D+ + PD+ F+ W E  +     +
Sbjct: 260 VVKTVDAP-------TGEETKLPLWVLDRTIDQ--EDVWLTPDYGFYSWPEPKVGSMIEV 310

Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
             +  E     +W D+ P A+W+G   + + R  ++   ++  H+WN
Sbjct: 311 RDKCNEIEKKLDWKDKIPKAFWRG-AILVKLREQMI--EIAKGHEWN 354


>gi|195399534|ref|XP_002058374.1| GJ14378 [Drosophila virilis]
 gi|194141934|gb|EDW58342.1| GJ14378 [Drosophila virilis]
          Length = 410

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 101/227 (44%), Gaps = 43/227 (18%)

Query: 42  TTPLNCVKNQTQTC-------PTNYPKTSQTQESISDY--SIPPTST------CPDYFRW 86
           T  + C   +  TC       P    K S+  +SI +   S  P +T      C  Y   
Sbjct: 21  TQKIQCDGAEHTTCRETDGIDPNQVAKHSKITQSIQNALASYEPCTTDVNDVNCTCYAAG 80

Query: 87  IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----G 141
           I  DL+ +K TG+TR M+  A +    R  + N ++Y          RD   ++     G
Sbjct: 81  IKRDLALYKSTGVTRKMINDAAKYG-TRYKIYNKQLY----------RDDNCMFPARCQG 129

Query: 142 ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW 201
           I   L +    LP+++L+ +  D P +    +S   + GP  +F +S  +   DI++P W
Sbjct: 130 IEHFLLQLLAELPNMDLVINTRDYPQL----HSAWRHDGP--VFSFSKTKEYRDIMYPAW 183

Query: 202 SFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
           +FW G     + P     W+ +  +L++ ++  +W D++   +++G+
Sbjct: 184 TFWAGGPATKLHPTGIGRWDLMRAKLEKRSSSLSWHDKQELGFFRGS 230


>gi|324516300|gb|ADY46488.1| CAP10 family protein CPIJ013394 [Ascaris suum]
          Length = 381

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 49/186 (26%), Positives = 83/186 (44%), Gaps = 22/186 (11%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAH-FRLILVNNKV 122
           + +E+  +Y     + C  +   I  DLSP++  GI   ML  A  +AH  R  L+  ++
Sbjct: 39  EIEEAERNYRSCDINNCTCFQSQIENDLSPFQ-NGIDESMLLAA--SAHGVRYQLIGGRL 95

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
           +     +  Q        G+   L      LP+ E + +  D P +RS D        P 
Sbjct: 96  F-----RQPQCPFEARCEGVEYFLVHLADSLPNTEFVLNVHDHPQMRSDD-------SPL 143

Query: 183 PLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPY 236
           P+F +S D   +DI++P WSFW G   I++ P     W     ++ +      W  R+P 
Sbjct: 144 PVFSFSKDMNHIDILYPAWSFWSGGPAISLYPTGIGRWNETSVKITKAAKRIEWAHRKPI 203

Query: 237 AYWKGN 242
           A+++G+
Sbjct: 204 AFFRGS 209


>gi|118783733|ref|XP_001230897.1| AGAP004267-PA [Anopheles gambiae str. PEST]
 gi|193806712|sp|A0NDG6.1|RUMI_ANOGA RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
 gi|116129010|gb|EAU76986.1| AGAP004267-PA [Anopheles gambiae str. PEST]
          Length = 399

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 47/184 (25%), Positives = 89/184 (48%), Gaps = 21/184 (11%)

Query: 66  QESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLILVNNKVYI 124
           + +++ Y    ++ C  +   +  DL P+K  GIT++M+ RA Q   H+++I        
Sbjct: 57  ETALAGYVACNSTNCNCHADVLKADLKPFKAHGITKEMINRAKQYGTHYQVIG------- 109

Query: 125 HKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPL 184
           HK  +  +        G+   +R     LPD++L+ +C D P I  R +S    K   P+
Sbjct: 110 HKLYRQRECMFPARCSGVEHFVRPLLPLLPDMDLIVNCRDWPQIH-RHWS----KEKIPV 164

Query: 185 FRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAY 238
             +S     +DI++P W+FW G   I + P     W+   + + + +   +W  +EP A+
Sbjct: 165 LSFSKTAEYLDIMYPAWAFWEGGPAIALYPTGLGRWDLHRQTITKASA--DWEAKEPKAF 222

Query: 239 WKGN 242
           ++G+
Sbjct: 223 FRGS 226


>gi|170055423|ref|XP_001863576.1| KDEL motif-containing protein 1 [Culex quinquefasciatus]
 gi|193806713|sp|B0X1Q4.1|RUMI_CULQU RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
 gi|167875399|gb|EDS38782.1| KDEL motif-containing protein 1 [Culex quinquefasciatus]
          Length = 403

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 60/291 (20%), Positives = 123/291 (42%), Gaps = 40/291 (13%)

Query: 26  VIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQT------QESISDYSIPPTST 79
           ++I ++  Q            C + Q +    N  K +        +E+++ Y    +S 
Sbjct: 11  LVIFAQNGQTDDGGMCMAKETCTETQQEAPTNNLYKAADNKYITLIEEALAAYKPCESSN 70

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF-- 137
           C  +   +  DL P++ +GIT+D++E A      +  ++ ++++        + RD    
Sbjct: 71  CSCHLDVLKTDLRPFR-SGITQDLIELARSYGT-KYQIIGHRMF--------RQRDCMFP 120

Query: 138 -TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI 196
               G+   +R    +LPD+EL+ +C D P I SR ++   ++ P P+  +S     +DI
Sbjct: 121 ARCSGVEHFIRPNLPKLPDMELIINCRDWPQI-SRHWNA--SREPLPVLSFSKTNDYLDI 177

Query: 197 VFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG-------NP 243
           ++P W FW G   I++ P     W+     +++      W  +   A+++G       +P
Sbjct: 178 MYPTWGFWEGGPAISLYPTGLGRWDQHRVSVRKAAKVWPWEKKLQQAFFRGSRTSDERDP 237

Query: 244 FV--AETRRDLLTCNLSDKHDWNA---RLYVQVISFIYITHLCSKIPGFNF 289
            V  +  R +L+    +    W +    L+ +    + +   C     FNF
Sbjct: 238 LVLLSRMRPELVDAQYTKNQAWRSPKDTLHAEPAQEVRLEDHCQYKYLFNF 288


>gi|358056081|dbj|GAA97978.1| hypothetical protein E5Q_04658 [Mixia osmundae IAM 14324]
          Length = 1130

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 43/169 (25%), Positives = 78/169 (46%), Gaps = 16/169 (9%)

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
           GI R  ++R    +H R+++  N++YI  ++    TR    I  I + +      +PD+E
Sbjct: 812 GIHRSQIDRGTVASHARILIKRNRLYIKHFRPVTTTRVEAAIALIHEAILSSVEPIPDVE 871

Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF--PDWSFWGWAE-INIKPW 214
           L+   DD         +G +  G P L    G R + ++++  PD+ F+GW E I +  +
Sbjct: 872 LLLHLDD---------TGNSKPGVPMLVL--GRRPSEELLWLMPDFGFYGWPEAIRVGSY 920

Query: 215 ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDW 263
                +  E     +W  ++P A+W+G       R+ L+  N S  + W
Sbjct: 921 IHDYDQTLESETHASWKHKKPTAFWRGASLGLRDRQSLV--NNSRPYGW 967


>gi|242025222|ref|XP_002433025.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212518534|gb|EEB20287.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 407

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/241 (24%), Positives = 102/241 (42%), Gaps = 37/241 (15%)

Query: 70  SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
           S+Y   P S C  +F+ +  DL P+K  GITR+M++          I      YI   K+
Sbjct: 68  SNYKNCPYSNCSCFFKTLKNDLKPFK-NGITREMIDS---------IRSRGTTYIIYEKR 117

Query: 130 SIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFR 186
             + +D        GI   ++K    L + EL+ +  D P I SR +     K   P+F 
Sbjct: 118 LYRDKDCLFPSRCSGIEYFIKKIISHLKNTELIINTRDWPQI-SRHF-----KLFGPVFS 171

Query: 187 YSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWK 240
           +S  +  +DI++P WSFW G   I   P     W+   ++L E ++   W  ++   +++
Sbjct: 172 FSKTQDYLDIMYPAWSFWEGGPAIKTYPTGLGRWDLHRKKLSEESSKWPWNKKKSIGFFR 231

Query: 241 GN---------PFVAETRRDLLTCNLSDKHDWNA---RLYVQVISFIYITHLCSKIPGFN 288
           G+           ++  R DL+    +    W +    L  +    + + H C     FN
Sbjct: 232 GSRTSEKRDVLVLLSRKRPDLIDAQYTKNQAWKSLKDTLGKEPAEVVSLEHHCQYKYLFN 291

Query: 289 F 289
           F
Sbjct: 292 F 292


>gi|405965272|gb|EKC30658.1| KTEL motif-containing protein 1 [Crassostrea gigas]
          Length = 400

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 77/148 (52%), Gaps = 25/148 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNN 120
           + ++++ +Y       C  +   I EDL+PWK  GIT ++   A   N  +H+++  VN+
Sbjct: 54  KIEKAVKEYKDCTQDECSCHRSVIEEDLTPWK-NGITEEVFNTAKSNNYGSHYQI--VNH 110

Query: 121 KVYIHK---YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           K+Y  +   +       + F    IL++++K    LPD+E + +  D P I SR      
Sbjct: 111 KLYREEGCMFPARCSGNEHF----ILEVIKK----LPDMEFVINTRDWPQISSR------ 156

Query: 178 NKGPPPLFRYSGDRW-TMDIVFPDWSFW 204
            + P P+F +S  R  ++DI++P W+FW
Sbjct: 157 -QQPIPVFSFSKVRGQSLDIMYPAWTFW 183


>gi|195113713|ref|XP_002001412.1| GI10779 [Drosophila mojavensis]
 gi|193918006|gb|EDW16873.1| GI10779 [Drosophila mojavensis]
          Length = 406

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/214 (23%), Positives = 94/214 (43%), Gaps = 25/214 (11%)

Query: 42  TTPLNCVKNQTQTCPTN--YPK-TSQTQESISDYSIPPTST----CPDYFRWIHEDLSPW 94
           T    C+KN+      N  + K T   ++++++Y    T      C  Y   I  DL+P+
Sbjct: 25  TGQSTCIKNEEIEVKQNAKFDKFTRNIEKALANYQPCTTDATDVNCTCYAAGIKRDLAPY 84

Query: 95  KVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLP 154
           K TG TR M+E A +           K++  +  +           GI   L +    L 
Sbjct: 85  KSTGFTRKMIEDAAKYG------TRYKIFGKQLFREDNCMFPARCQGIEHFLLQLLPELK 138

Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
           +++L+ +  D P +    +S   +KG  P+F +S     +DI++P W+FW G     + P
Sbjct: 139 NMDLVINTRDYPQL----HSSWQHKG--PVFSFSKTTEYLDIMYPAWTFWAGGPATKLHP 192

Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
                W+ +  +LK+      W +++   +++G+
Sbjct: 193 TGIGRWDLMREKLKKAAKAIPWGEKKEIGFFRGS 226


>gi|195054040|ref|XP_001993934.1| GH18344 [Drosophila grimshawi]
 gi|193895804|gb|EDV94670.1| GH18344 [Drosophila grimshawi]
          Length = 408

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 43/169 (25%), Positives = 75/169 (44%), Gaps = 16/169 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           C  Y   I  DL+P+K  G +R MLE A +           K+Y  K  +          
Sbjct: 69  CTCYAAGIKRDLAPYKSIGFSRQMLEDAAKYG------TRYKIYGQKLFREENCFFPARC 122

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
            GI   L +   +LP+++L+ +  D P +    +S  ++    P+F +S      DI++P
Sbjct: 123 QGIEHFLLELLPQLPNMDLVINTRDYPQL----HSSWSSSRIGPVFSFSKTSEYRDIMYP 178

Query: 200 DWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
            W+FW G     + P     W+ +  +LKE      W ++E   +++G+
Sbjct: 179 AWTFWAGGPATKLHPTGIGRWDLMSGKLKEVTTKIPWQEKEQLGFFRGS 227


>gi|332019905|gb|EGI60366.1| CAP10 family protein [Acromyrmex echinatior]
          Length = 407

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 79/170 (46%), Gaps = 24/170 (14%)

Query: 87  IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGIL 143
           I  DL P+   GI+++M+E A     F  I +  K+Y        + +D        GI 
Sbjct: 81  IVRDLRPFTKKGISKEMIEAAKTRGTFYQI-IKGKLY--------REKDCMFPARCAGIE 131

Query: 144 QLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSF 203
             L K  G L D++L+ +  D P  +S +Y G       P+F +S      DI++P W+F
Sbjct: 132 HFLLKIIGNLSDMDLVINTRDYP--QSSEYFG----NAIPVFSFSKTPQYYDIMYPAWAF 185

Query: 204 W-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
           W G   I++ P     W+   + L + +   +W ++E   +++G+   +E
Sbjct: 186 WEGGPAISLYPRGLGRWDQHRKTLNKASLEISWEEKESKGFFRGSRTSSE 235


>gi|449669688|ref|XP_002167607.2| PREDICTED: KDEL motif-containing protein 1-like [Hydra
           magnipapillata]
          Length = 497

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 100/222 (45%), Gaps = 27/222 (12%)

Query: 79  TCPDYFRWIHEDLSPWKVTGITRDMLERANQT---AHFRLILVNNKVYIHKYKQSIQTRD 135
            CP  ++ I +DL P+    +  +++E A ++   A     +  NKVY   Y  +I    
Sbjct: 146 ACPVNYQQIEKDLKPFPNINL-ENLIESATKSYNVAFCHYTIKKNKVYRKCY-GTINDFK 203

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +FT   +L + RK   +LPD+E   +  D P+   R         P P+F + G   T D
Sbjct: 204 MFTDAWLLSVARKV--KLPDVEFFTNLGDWPLTTKR-------FNPMPIFSWCGSNDTFD 254

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           +V+P +              SL     +GN G +W  ++P A+++G     ++R++ L  
Sbjct: 255 LVWPTYDLTESTLETFGGRVSLDMTSIQGNTGPSWNHKKPVAFFRGR----DSRQERL-- 308

Query: 256 NLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
           +L ++   NA   V +  + +  H       + +++  ++FF
Sbjct: 309 DLVNRFRKNANFDVGITHYFFFKHDEEKYGPIANRVSFYDFF 350


>gi|380022449|ref|XP_003695058.1| PREDICTED: O-glucosyltransferase rumi homolog [Apis florea]
          Length = 407

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 57/216 (26%), Positives = 94/216 (43%), Gaps = 21/216 (9%)

Query: 41  ITTPLNCV--KNQTQTCPTNYPKT-SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
           I +  +CV  KN  +     Y K  +  +E+  DY I   +    +   I  DL P+K  
Sbjct: 30  IGSTEDCVEEKNIYKVLNKQYSKYYNAIEEAEKDYRICNNTNNSCFKDVIINDLKPFKEK 89

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLE 157
           GI +D+++ A     F  I +  K+Y  K      +R      GI   L K    L D++
Sbjct: 90  GINKDLIDTAKIRGTFYQI-IQGKLYRQK-DCMFPSRCA----GIEYFLLKLAPGLTDMD 143

Query: 158 LMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP--- 213
           L+ +  D P  +S  + G     P P+F +S      DI +P W+FW G   I++ P   
Sbjct: 144 LVINVRDYP--QSSKHFGD----PLPIFSFSKTSQYYDITYPAWAFWEGGPAISLYPRGL 197

Query: 214 --WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
             W+     L + +N   W  +E   +++G+   +E
Sbjct: 198 GRWDEHCISLDKASNNTLWEKKENKVFFRGSRTSSE 233


>gi|340719860|ref|XP_003398363.1| PREDICTED: o-glucosyltransferase rumi homolog [Bombus terrestris]
          Length = 404

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 98/221 (44%), Gaps = 31/221 (14%)

Query: 41  ITTPLNCVKNQTQTCPTNYPKT-----SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK 95
           I  P NC++ +      +  K      +  +E+  +Y I   +    Y   I  DL P+K
Sbjct: 27  IDNPENCIEKKDNIYKKDLNKQYIKYYNAIEEAQKNYKICNNTNNGCYKDIIINDLKPFK 86

Query: 96  VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGR 152
             GI++D++  A +T      ++  K+Y        + +D        GI   L K    
Sbjct: 87  KKGISKDLINIA-KTRGTVYQIIQGKLY--------RQKDCMFPSRCSGIEHFLLKLAPG 137

Query: 153 LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINI 211
           L D++L+ +  D P  +S  + G    GP P+FR + + +  DI +P W+FW G   I++
Sbjct: 138 LTDMDLVINVRDYP--QSSKHFG----GPLPIFRLTPEYY--DITYPAWAFWEGGPAISL 189

Query: 212 KP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
            P     W+     L + +    W ++E  A+++G+   +E
Sbjct: 190 YPRGLGRWDEHRVSLDKASKNTLWEEKENKAFFRGSRTSSE 230


>gi|148230341|ref|NP_001086860.1| MGC83543 protein precursor [Xenopus laevis]
 gi|50416390|gb|AAH77568.1| MGC83543 protein [Xenopus laevis]
          Length = 386

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 51/220 (23%), Positives = 98/220 (44%), Gaps = 29/220 (13%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
           +Q   ++  +S P +     ++R + EDL P++ +GI+RD++    Q    R +  + ++
Sbjct: 31  NQIHTAVQRHSPPDSHNGTCFYRVLQEDLEPFR-SGISRDLM----QNVLSRKLGTHYQI 85

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
             H+  +  +        G+   L +    LPD+EL+ +  D P +       P+   P 
Sbjct: 86  INHRLYREEECMFSARCSGVEHFLLELLPNLPDMELVINVRDYPQV-------PSWMNPV 138

Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
            P+F +S      DI++P W+FW  G A   I P     W+ +  +LK+  +   W  + 
Sbjct: 139 IPIFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLKKAADLWPWEKKI 198

Query: 235 PYAYWKG-------NPFVAETRR--DLLTCNLSDKHDWNA 265
           P  Y++G       +P +  +R   DL+    +    W +
Sbjct: 199 PKGYFRGSRTSPDRDPLILLSRESPDLVDAEYTKNQAWKS 238


>gi|222630210|gb|EEE62342.1| hypothetical protein OsJ_17131 [Oryza sativa Japonica Group]
          Length = 267

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/64 (46%), Positives = 36/64 (56%), Gaps = 3/64 (4%)

Query: 208 EINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE-TRRDLLTCNLSDKHDWNAR 266
           E NI+PW    R++KEG     W DR   AYWKGNP VA   R  LL CN  D + W+A 
Sbjct: 10  ETNIEPWSKEFRDIKEGAKAIKWQDRVATAYWKGNPDVASPLRVALLNCN--DTNMWHAE 67

Query: 267 LYVQ 270
           +  Q
Sbjct: 68  IMRQ 71


>gi|322791349|gb|EFZ15836.1| hypothetical protein SINV_05265 [Solenopsis invicta]
          Length = 408

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 87/189 (46%), Gaps = 25/189 (13%)

Query: 87  IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGIL 143
           I  DL P++  GI+ +M++ A     F  + +  K+Y        + +D        GI 
Sbjct: 82  ILRDLGPFRTKGISEEMIQAARTRGTFYQV-IKGKLY--------REKDCMFPARCAGIE 132

Query: 144 QLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSF 203
             L K  G L D++L+ +  D P  +S +Y G       P+F +S      DI++P W+F
Sbjct: 133 HFLLKVIGNLSDMDLVINTRDYP--QSSEYFG----NAMPVFSFSKTPQYYDIMYPAWAF 186

Query: 204 W-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNL 257
           W G   I++ P     W+   + L + +    W  +E   +++G+   +E R +L+  + 
Sbjct: 187 WEGGPAISLYPHGLGRWDQHRKSLNKASLEIPWEKKESKGFFRGSRTSSE-RDNLILLSR 245

Query: 258 SDKHDWNAR 266
           S  H  +A+
Sbjct: 246 SKPHLVDAQ 254


>gi|195443898|ref|XP_002069626.1| GK11622 [Drosophila willistoni]
 gi|194165711|gb|EDW80612.1| GK11622 [Drosophila willistoni]
          Length = 377

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 46/215 (21%), Positives = 95/215 (44%), Gaps = 35/215 (16%)

Query: 50  NQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQ 109
            Q +    NY   S  +  ++         C  + + +++DL+ ++ TG++R M+E + +
Sbjct: 17  QQIENAMANYEPCSSDESDVN---------CSCHEKVLNQDLAAYQSTGVSRQMIESSAR 67

Query: 110 TAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDD 164
               R  +  N++Y          RD   ++     GI   LR+    LP+++L+ +  D
Sbjct: 68  YGT-RYKIYENQLY----------RDEKCMFPARCQGIEHFLRQLLPVLPNMDLIINTRD 116

Query: 165 RPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI------NIKPWESLL 218
            P I +   +  N+ G  P+F +S  +   DI++P W+FW            I  W+ + 
Sbjct: 117 YPQINT---AWGNSVGNGPVFSFSKTKEYRDIMYPAWTFWAGGPATRLHPRGIGRWDLMR 173

Query: 219 RELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            +L++      W  +    +++G+   +E R  L+
Sbjct: 174 EKLEKRAAAIPWSQKRELGFFRGSR-TSEERDSLI 207


>gi|156366125|ref|XP_001626991.1| predicted protein [Nematostella vectensis]
 gi|156213886|gb|EDO34891.1| predicted protein [Nematostella vectensis]
          Length = 412

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 95/207 (45%), Gaps = 28/207 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKV 122
           +  +++S Y       C  Y   I +DL  WK  GI +   + A ++  H+++I  N+ +
Sbjct: 67  KINQAVSTYRGCSNKGCGCYKDVIEDDLRRWK-DGINKSDFDAARSRGTHYQII--NHML 123

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
           Y     +       F   GI   L +   +LPD+E+        +I +RD+      GP 
Sbjct: 124 Y-----REDDCMFPFRCKGIEHFLLEVINKLPDMEI--------IINTRDWPQAAVWGPA 170

Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
            P+F +S  +  MDI++P W+FW  G A   I P     W+ +   L + +    W  +E
Sbjct: 171 LPIFSFSKTKNEMDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREALDKKSQEWPWEKKE 230

Query: 235 PYAYWKGNPFVAETRRDLLTCNLSDKH 261
             A+++G+   AE  RD L   LS KH
Sbjct: 231 SKAFFRGSRTSAE--RDPLVL-LSRKH 254


>gi|442757169|gb|JAA70743.1| Hypothetical protein [Ixodes ricinus]
          Length = 507

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 21/182 (11%)

Query: 90  DLSPWKVTGITRDMLERANQTAHFRLI---LVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
           D     +  + +D L+R ++          +V N+VY   Y Q +   ++F    +L L 
Sbjct: 172 DFEKVDMKKVLQDALQRFDRPGSVSFCHYAVVKNQVYRKCYGQHVGF-NMFMDQILLSLA 230

Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGW 206
           RK    LPD+E++ +  D P+ R +DY G     P P F + G   T DIV P +     
Sbjct: 231 RKVV--LPDVEMLVNLGDWPLER-KDYWGK----PVPFFSWCGSNSTRDIVMPTYDL-TE 282

Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-------PFVAETRR--DLLTCNL 257
           + + +    +L     +G+ G  W D+EP  +W+G          VA +RR  +LL  +L
Sbjct: 283 SSLEMMGRVTLDMLSVQGHGGPAWKDKEPSGFWRGRDSRQERLDLVALSRRYPELLNASL 342

Query: 258 SD 259
           ++
Sbjct: 343 TN 344


>gi|291400615|ref|XP_002716870.1| PREDICTED: KTEL (Lys-Tyr-Glu-Leu) containing 1 [Oryctolagus
           cuniculus]
          Length = 462

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/193 (26%), Positives = 84/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQ---TAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    Q     H+++I   N
Sbjct: 108 QINRSLENYEPCSSENCSCYHGIIEEDLTPFR-GGISRKMMAEVVQRKLGTHYQII--KN 164

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 165 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 209

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    EL +      
Sbjct: 210 WMEPAIPVFSFSKTSEYYDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREELVKSAAQWP 269

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 270 WKKKNSTAYFRGS 282


>gi|241671163|ref|XP_002400011.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215506249|gb|EEC15743.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 410

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 65/140 (46%), Gaps = 22/140 (15%)

Query: 79  TCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLILVNNKVYIHKYKQSIQTRDV- 136
           TC  +   I  DL  W  +GIT+D+++R+     H+++I  N+K+Y        +  D  
Sbjct: 78  TCSCHSAVIDSDLQLWSESGITKDLVQRSKSRGIHYQII--NHKLY--------RGEDCL 127

Query: 137 --FTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
             F   GI   L +    LPDLE + +  D P    R         P P+F +S      
Sbjct: 128 FPFRCSGIEHFLLEIVDELPDLEFIVNTRDWPQAHKR-------YDPLPVFSFSKTPDYA 180

Query: 195 DIVFPDWSFW-GWAEINIKP 213
           DI++P W+FW G   I++ P
Sbjct: 181 DIMYPAWTFWAGGPAISLYP 200


>gi|134114475|ref|XP_774166.1| hypothetical protein CNBG4660 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256799|gb|EAL19519.1| hypothetical protein CNBG4660 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 437

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/181 (24%), Positives = 79/181 (43%), Gaps = 35/181 (19%)

Query: 98  GITRDMLERANQTAHFRLILVNN--------KVYIHKYKQSIQTRDVFTIWGILQLLRKY 149
           GI++ M++ A +  + RL+++NN        ++++  YK  I TR    I  +   L   
Sbjct: 88  GISKKMVDEAEEEGNARLVILNNQLISHLGRQLFVKAYKGGINTRTQAAIAAVYGALLTA 147

Query: 150 PGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAE 208
              LPD++         VI++ D  G N+    P F         D+ + PD+ F+ W E
Sbjct: 148 TEPLPDVDF--------VIQTSDAGGGNH----PHFALCRKADQKDLWLMPDFGFFSWPE 195

Query: 209 INIKPW-----ESLLRELKEG--------NNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
             +  +     ++L  EL  G            +W ++    +W+G+P V E R DLL  
Sbjct: 196 PGVGSYSEVRTKTLDYELDMGLEVDDKLDVTKSDWFNKTQQLFWRGSPMV-EVRNDLLRA 254

Query: 256 N 256
           +
Sbjct: 255 S 255


>gi|350416880|ref|XP_003491149.1| PREDICTED: O-glucosyltransferase rumi homolog [Bombus impatiens]
          Length = 407

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 49/191 (25%), Positives = 86/191 (45%), Gaps = 24/191 (12%)

Query: 66  QESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIH 125
           +E+  +Y I   +    Y   I  DL P+K  GI++D++  A +T      ++  K+Y  
Sbjct: 58  EEAQKNYKICNNTNNGCYKDIIINDLKPFKKKGISKDLINIA-KTRGTVYQIIQGKLY-- 114

Query: 126 KYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
                 + +D        GI   L K    L D++L+ +  D P  +S  + G    GP 
Sbjct: 115 ------RQKDCMFPSRCSGIEHFLLKLAPGLTDMDLVINVRDYP--QSSKHFG----GPL 162

Query: 183 PLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPY 236
           P+F +S      DI +P W+FW G   I++ P     W+     L + +    W ++E  
Sbjct: 163 PIFSFSKTPEYYDITYPAWAFWEGGPAISLYPRGLGRWDEHRVSLDKASKNTLWEEKENK 222

Query: 237 AYWKGNPFVAE 247
           A+++G+   +E
Sbjct: 223 AFFRGSRTSSE 233


>gi|321467420|gb|EFX78410.1| hypothetical protein DAPPUDRAFT_305151 [Daphnia pulex]
          Length = 415

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/196 (23%), Positives = 86/196 (43%), Gaps = 28/196 (14%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
               +S+ DY       C  +   + EDL  +K  GIT+ ML++A      +  ++  K+
Sbjct: 69  GHISQSLLDYKPCLFKNCSCFTNLVIEDLKAFKDVGITKSMLDKAKDRGT-KYQIIGQKL 127

Query: 123 YIHKYKQSIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           Y          R+ F ++     GI   + K    LPD+EL+ +  D P + SR +    
Sbjct: 128 Y----------RESFCLFPSRCAGIEHFILKVIKDLPDMELIINNRDWPQV-SRHFG--- 173

Query: 178 NKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWI 231
                P+  +S  +  +DI +P W+FW G   I++ P     W+     + +      W 
Sbjct: 174 --EVLPILSFSKTKEYLDITYPAWTFWEGGPAISLYPRGLGRWDQHRISIDKVAATYPWN 231

Query: 232 DREPYAYWKGNPFVAE 247
           +++  A+++G+   +E
Sbjct: 232 EKQSKAFFRGSRTSSE 247


>gi|165993289|emb|CAP71956.1| unnamed protein product [Danio rerio]
          Length = 500

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/224 (25%), Positives = 95/224 (42%), Gaps = 29/224 (12%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP  F  I  DLS +K     R+  E      +++   H+   + NN+VYI  + + +  
Sbjct: 152 CPASFSQIESDLSIFKSVDPDRNAHEVIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   +LPD+E   +  D P+ + R      ++ P P+F + G   T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     +G+ G  W  +    +W+G     E R +L+
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDSRKE-RLELV 319

Query: 254 TCNLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
               ++     A L   + +F +  H       L   +  F+FF
Sbjct: 320 KLARAN----TAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFF 359


>gi|158297147|ref|XP_317424.4| AGAP008037-PA [Anopheles gambiae str. PEST]
 gi|157015054|gb|EAA12302.4| AGAP008037-PA [Anopheles gambiae str. PEST]
          Length = 513

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 84/199 (42%), Gaps = 24/199 (12%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQS 130
           T  CP     I  DL P++    +          ++    +H   +++ N+V+   Y Q 
Sbjct: 158 TVGCPAGDPQIDMDLIPFRAINFSSLRTRMIQQYDKPGSISHCNYVILRNQVHRRCYGQH 217

Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-NKGPPPLFRYSG 189
                 F    +L L RK+   LPD+E+  +  D P+++     GP+   GP P+F + G
Sbjct: 218 TGFSK-FMDTILLSLARKFT--LPDMEMFVNLGDWPLVKK---GGPSRTTGPYPIFSWCG 271

Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN------- 242
              T DIV P +        N+     +L  L     G  W D+   A+W+G        
Sbjct: 272 SDDTFDIVMPTYDITESTLENMG--RVMLDMLSVQRRGLPWADKHAKAFWRGRDARRERL 329

Query: 243 PFVAETRR--DLLTCNLSD 259
             VA +RR  +LL  +L++
Sbjct: 330 ELVALSRRYPELLNASLTN 348


>gi|328780644|ref|XP_003249836.1| PREDICTED: o-glucosyltransferase rumi homolog, partial [Apis
           mellifera]
          Length = 391

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/218 (26%), Positives = 93/218 (42%), Gaps = 23/218 (10%)

Query: 41  ITTPLNCVK----NQTQTCPTNYPKT-SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWK 95
           I    +CVK    N  +    +Y K  +  +E+  DY +        +   I  DL P+K
Sbjct: 28  IDNTEDCVKEKKVNIYEGLNKHYSKYYNAIEEAEKDYRVCNNINNNCFKDVIINDLKPFK 87

Query: 96  VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
             GI +D+++ A     F  I +  K+Y  K      +R      GI   L K    L D
Sbjct: 88  EKGINKDLIDIAKIRGTFYQI-IQGKLYRQK-DCMFPSRCA----GIEYFLLKLAPGLTD 141

Query: 156 LELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP- 213
           ++L+ +  D P  +S  Y G     P P+F +S      DI +P W+FW G   I++ P 
Sbjct: 142 MDLVINVRDYP--QSSKYFG----DPLPIFSFSKTSQYYDITYPAWAFWEGGPAISLYPR 195

Query: 214 ----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
               W+     L + +N   W  +E   +++G+   +E
Sbjct: 196 GLGRWDEHCISLDKASNNTLWEKKENKVFFRGSRTSSE 233


>gi|71018113|ref|XP_759287.1| hypothetical protein UM03140.1 [Ustilago maydis 521]
 gi|46099137|gb|EAK84370.1| hypothetical protein UM03140.1 [Ustilago maydis 521]
          Length = 473

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 84/192 (43%), Gaps = 20/192 (10%)

Query: 78  STCPDYFRWIHEDLSP----WKVTG--ITRDMLERANQTAH--FRLILVNNKVYIHKYKQ 129
           STC   F  ++  L      WK  G  +  D+   A+   H    LI+ + +++I   K+
Sbjct: 76  STCKAEFSRLYPQLIANQIAWKSKGGILYADVKNAADNCRHGCVHLIIKHGQIFIRAQKK 135

Query: 130 SIQTRDVFTIWGILQLLRK-YPGRLPDLELMFDCDDRPVIRSRDYSG---PNNKGPPPLF 185
             Q+R    +   LQLL K Y G     + + +  +  VI + D+ G   PN++G   + 
Sbjct: 136 DWQSR----VRSTLQLLDKAYSGASEHEKALMEATEL-VISTADFDGFTDPNSRGAGWVL 190

Query: 186 RYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFV 245
               +      +FPD+SF  W E  I  ++   R+ ++ N    W  +   A+W+G+   
Sbjct: 191 DKRVNDTQGQYLFPDFSFASWPEAGIPSYQEFRRQAQQVNAETPWKSKTNPAFWRGDALA 250

Query: 246 AET---RRDLLT 254
            +    R  LL+
Sbjct: 251 GQNIKPRESLLS 262


>gi|125773715|ref|XP_001358116.1| GA16050 [Drosophila pseudoobscura pseudoobscura]
 gi|121991819|sp|Q29AU6.1|RUMI_DROPS RecName: Full=O-glucosyltransferase rumi; Flags: Precursor
 gi|54637851|gb|EAL27253.1| GA16050 [Drosophila pseudoobscura pseudoobscura]
          Length = 409

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/175 (23%), Positives = 73/175 (41%), Gaps = 24/175 (13%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           + C  +   I  DL+P+K TG++R M+E + +           K+Y  +  +        
Sbjct: 69  ANCSCHAAVIKSDLAPYKATGVSRQMIESSARYG------TRYKIYEKRLYREENCMFPA 122

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWT 193
              GI   L      LPD++L        VI +RDY   N    N    P+  +S  +  
Sbjct: 123 RCQGIEHFLLPLVATLPDMDL--------VINTRDYPQINMAWGNGAQGPILSFSKTKDH 174

Query: 194 MDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
            DI++P W+FW G     + P     W+ +  +L++      W  +    +++G+
Sbjct: 175 RDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS 229


>gi|56207656|emb|CAI20990.1| novel protein (zgc:56065) [Danio rerio]
          Length = 500

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/224 (24%), Positives = 96/224 (42%), Gaps = 29/224 (12%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP  F  I  DLS ++     R+  E      +++   H+   + NN+VYI  + + +  
Sbjct: 152 CPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   +LPD+E   +  D P+ + R      ++ P P+F + G   T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     +G+ G  W  +    +W+G     ++R++ L
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGR----DSRKERL 316

Query: 254 TCNLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
                 K +  A L   + +F +  H       L   +  F+FF
Sbjct: 317 ELVKLAKAN-TAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFF 359


>gi|443727132|gb|ELU14012.1| hypothetical protein CAPTEDRAFT_20245 [Capitella teleta]
          Length = 371

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 85/191 (44%), Gaps = 27/191 (14%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
           S  + ++++YS      C  +   I  DL PWK  GIT  + ++A         ++N+K+
Sbjct: 18  SSIENALAEYSPCNPDDCSCHLGVIESDLKPWK-NGITEQLFQQAKARGSNHYQIINHKL 76

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
           Y     +S +        GI   + +   +LPD+E         ++  RD+   +  G P
Sbjct: 77  Y-----RSEKCMFPSRCSGIEHFILEVIHKLPDMEF--------ILNERDWPQASIHGAP 123

Query: 183 -PLFRYSG---DRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
            P+F +S    D W  DI++P W+FW  G A   I P     W+   + + E      W 
Sbjct: 124 LPIFSFSKVPTDNW--DIMYPAWTFWEGGPAVWPIYPTGLGRWDEQRKIIPEAAKKWPWH 181

Query: 232 DREPYAYWKGN 242
            ++  A+++G+
Sbjct: 182 KKQSKAFFRGS 192


>gi|195143519|ref|XP_002012745.1| GL23774 [Drosophila persimilis]
 gi|194101688|gb|EDW23731.1| GL23774 [Drosophila persimilis]
          Length = 409

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/175 (23%), Positives = 73/175 (41%), Gaps = 24/175 (13%)

Query: 78  STCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF 137
           + C  +   I  DL+P+K TG++R M+E + +           K+Y  +  +        
Sbjct: 69  ANCSCHAAVIKSDLAPYKATGVSRQMIESSARYG------TRYKIYEKRLYREENCMFPA 122

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN----NKGPPPLFRYSGDRWT 193
              GI   L      LPD++L        VI +RDY   N    N    P+  +S  +  
Sbjct: 123 RCQGIEHFLLPLVATLPDMDL--------VINTRDYPQINMAWGNGAQGPILSFSKTKDH 174

Query: 194 MDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGN 242
            DI++P W+FW G     + P     W+ +  +L++      W  +    +++G+
Sbjct: 175 RDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS 229


>gi|442759745|gb|JAA72031.1| Putative o-glucosyltransferase rumi [Ixodes ricinus]
          Length = 410

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 64/137 (46%), Gaps = 16/137 (11%)

Query: 79  TCPDYFRWIHEDLSPWKVTGITRDMLERANQTA-HFRLILVNNKVYIHKYKQSIQTRDVF 137
           TC  +   I  DL  W  +GIT+D+++R+     H+++I  N+K+Y     +       F
Sbjct: 78  TCSCHSAVIDNDLQLWSESGITKDLIQRSKSRGIHYQII--NHKLY-----RGEDCLFPF 130

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
              GI   L +    LPDLE + +  D P    R         P P+F +S      DI+
Sbjct: 131 RCSGIEPFLLEIVDELPDLEFIVNTRDWPQAHKR-------YDPLPVFSFSKTPDYADIM 183

Query: 198 FPDWSFW-GWAEINIKP 213
           +P W+FW G   I++ P
Sbjct: 184 YPAWTFWAGGPAISLYP 200


>gi|345796207|ref|XP_545115.3| PREDICTED: protein O-glucosyltransferase 1 [Canis lupus familiaris]
          Length = 392

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 83/194 (42%), Gaps = 32/194 (16%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVN 119
            Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   
Sbjct: 37  EQINRSLENYEPCSSQNCSCYHGIISEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--K 93

Query: 120 NKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGP 176
           N++Y        +  D        G+   + +  GRLPD+E++ +  D P I       P
Sbjct: 94  NRLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQI-------P 138

Query: 177 NNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGR 228
               P  P+F +S      DI++P W+FW  G A   I P     W+    +L       
Sbjct: 139 KWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQW 198

Query: 229 NWIDREPYAYWKGN 242
            W  +   AY++G+
Sbjct: 199 PWKKKNSTAYFRGS 212


>gi|41055550|ref|NP_957225.1| KDEL motif-containing protein 1 precursor [Danio rerio]
 gi|82188635|sp|Q7ZVE6.1|KDEL1_DANRE RecName: Full=KDEL motif-containing protein 1; Flags: Precursor
 gi|28277832|gb|AAH45893.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Danio rerio]
          Length = 500

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 55/224 (24%), Positives = 95/224 (42%), Gaps = 29/224 (12%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP  F  I  DLS ++     R+  E      +++   H+   + NN+VYI  + + +  
Sbjct: 152 CPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHY--TIKNNQVYIKTHGEHVGF 209

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   +LPD+E   +  D P+ + R      ++ P P+F + G   T
Sbjct: 210 R-IFMDAFLLSLTRKV--KLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDT 261

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     +G+ G  W  +    +W+G     E R +L+
Sbjct: 262 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDSRKE-RLELV 319

Query: 254 TCNLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
               ++     A L   + +F +  H       L   +  F+FF
Sbjct: 320 KLARAN----TAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFF 359


>gi|307172256|gb|EFN63761.1| CAP10 family protein AGAP004267 [Camponotus floridanus]
          Length = 406

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 51/219 (23%), Positives = 93/219 (42%), Gaps = 33/219 (15%)

Query: 47  CVKNQTQTCPTNYPKT---------SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVT 97
           C +N+      +Y K          +  Q++  DY     +    +   I  DLSP+K  
Sbjct: 31  CKENEIDHQKLSYAKAINEGYQKYLTAIQDAEQDYKECNNTKHSCFKDVIVRDLSPFKKK 90

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLP 154
           GI+ +M+  A     F  I +  K+Y        + +D        GI   L K  G L 
Sbjct: 91  GISEEMIVAAKNRGTFYQI-IGGKLY--------REKDCMFPARCAGIEHFLLKVIGNLS 141

Query: 155 DLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP 213
           +++L+ +  D P  +S +Y G       P+F +S      DI++P W+FW G   I++ P
Sbjct: 142 NMDLIINTRDYP--QSSEYFG----NAMPIFSFSKTSQYYDIMYPAWAFWEGGPAISLYP 195

Query: 214 -----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
                W+   + L + ++   W  ++   +++G+   +E
Sbjct: 196 RGLDRWDRHRKLLNKASSEIPWEKKDNKGFFRGSRTSSE 234


>gi|403288562|ref|XP_003935467.1| PREDICTED: protein O-glucosyltransferase 1 [Saimiri boliviensis
           boliviensis]
          Length = 391

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 85/190 (44%), Gaps = 26/190 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ DY    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 37  QINRSLEDYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 93

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           ++Y  +      +R       IL+++R    RLPD+E++ +  D P +       P    
Sbjct: 94  RLY-RENDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 141

Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
           P  P+F +S      DI++P W+FW  G A   I P     W+    +L        W  
Sbjct: 142 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKK 201

Query: 233 REPYAYWKGN 242
           +   AY++G+
Sbjct: 202 KNSTAYFRGS 211


>gi|119497671|ref|XP_001265593.1| hypothetical protein NFIA_032600 [Neosartorya fischeri NRRL 181]
 gi|119413757|gb|EAW23696.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 458

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/181 (26%), Positives = 74/181 (40%), Gaps = 15/181 (8%)

Query: 81  PDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLILVNNKVYI--HKYKQSIQTRDV 136
           P  F+ IH  +  WK   GI+RD L     +    R I+ N  +Y+   + K     R +
Sbjct: 108 PGLFQDIHRGVEYWKSRGGISRDDLNAVPFEDGMARAIISNGDLYVVATRAKGDDHRRKI 167

Query: 137 F-TIWGILQLLRKYPGRL--PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
             T+  I + L     R   P +E +F  +DR           N  G P           
Sbjct: 168 VGTLGSIHRALSASSNRTSHPTIEFIFSIEDR-------VDDVNAVGHPVWVLSRKAFEE 220

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDL 252
             I+ PD+ FW WA+ NI P+  ++  +    +   + D+E    W+G   F  + RR L
Sbjct: 221 SVILMPDFGFWSWAKSNIGPYGQVVERIMAAESKLKFADKEQKLVWRGKLSFAPKLRRVL 280

Query: 253 L 253
           L
Sbjct: 281 L 281


>gi|402859114|ref|XP_003894014.1| PREDICTED: protein O-glucosyltransferase 1 [Papio anubis]
          Length = 392

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|380786923|gb|AFE65337.1| protein O-glucosyltransferase 1 precursor [Macaca mulatta]
          Length = 392

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|109033215|ref|XP_001109371.1| PREDICTED: KTEL motif-containing protein 1-like [Macaca mulatta]
 gi|355559357|gb|EHH16085.1| hypothetical protein EGK_11322 [Macaca mulatta]
 gi|355746435|gb|EHH51049.1| hypothetical protein EGM_10372 [Macaca fascicularis]
          Length = 392

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|157125824|ref|XP_001660800.1| endoplasmic reticulum-resident kdel protein [Aedes aegypti]
 gi|108882653|gb|EAT46878.1| AAEL001982-PA [Aedes aegypti]
          Length = 505

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 80/189 (42%), Gaps = 18/189 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP+    I  DL P++    +          ++    +    ++  N++Y   Y +    
Sbjct: 152 CPNSDPQIESDLIPFRAVNFSSLRPRIIQQYDKPGSVSLCNYVIKENQIYRTCYGRYTGF 211

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           + ++    +L L RK    LPD+EL  +  D P++    +      GP P+F + G   T
Sbjct: 212 K-MYMDALLLSLARK--AILPDMELFVNLGDWPLVTKGGHR--RTTGPYPIFSWCGSEDT 266

Query: 194 MDIVFPDWSFWGWAEINIKPWESL-LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
            DIV P +      E  ++    + L  L     G  W D+EP A+W+G     E R DL
Sbjct: 267 FDIVMPTYDI---VEATLEAMNRVTLDMLSVQRKGIPWKDKEPKAFWRGRDACRE-RLDL 322

Query: 253 LTCNLSDKH 261
           +   +S KH
Sbjct: 323 V--GISQKH 329


>gi|345497421|ref|XP_001601540.2| PREDICTED: O-glucosyltransferase rumi homolog [Nasonia vitripennis]
          Length = 400

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 22/193 (11%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
           S  +E+ ++Y+   +  C  +   I  DL  +   GI + ++E A     F  I +  K+
Sbjct: 54  SAIEEAEANYAECDSKKCKCFGDMITRDLKTFAERGIDQKLIEAARPRGTFYQI-IGGKL 112

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDY--SGPNNKG 180
           Y  K      +R      GI   L K  G + D+ L        V+ +RDY  S  +   
Sbjct: 113 YREK-NCMFPSRCA----GIEHFLLKVIGNVSDVSL--------VVNTRDYPQSSRHFGQ 159

Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
           P P+F +S      DI++P W FW G   I++ P     W+   + L E      W  +E
Sbjct: 160 PLPVFSFSKTPDYYDIMYPAWVFWEGGPAISLYPRGLGRWDLHRKSLNEAREETPWEKKE 219

Query: 235 PYAYWKGNPFVAE 247
             A+++G+   +E
Sbjct: 220 EKAFFRGSRTSSE 232


>gi|307194254|gb|EFN76650.1| CAP10 family protein AGAP004267 [Harpegnathos saltator]
          Length = 342

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 48/186 (25%), Positives = 86/186 (46%), Gaps = 19/186 (10%)

Query: 87  IHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL 146
           I  DL P+K  GI+ +M+  A     F  I+   K  +++ K  +         GI   L
Sbjct: 81  ILRDLRPFKKKGISEEMINAAKTRGTFYQII---KGTLYREKDCMFPA---RCAGIEHFL 134

Query: 147 RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-G 205
            K  G L D++L+ +  D P  +S +Y G       P+F +S      DI++P W+FW G
Sbjct: 135 LKVIGNLSDMDLVINTRDYP--QSSEYFG----NLLPIFSFSKTPQYYDIMYPAWAFWEG 188

Query: 206 WAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDK 260
              I++ P     W+   + L + +    W  +E   +++G+   +E R +L+  + +  
Sbjct: 189 GPAISLYPRGLGRWDQHRKSLNKASLETPWEKKESKGFFRGSRTSSE-RDNLILLSRNKS 247

Query: 261 HDWNAR 266
           H  +A+
Sbjct: 248 HLVDAQ 253


>gi|343425735|emb|CBQ69269.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 488

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 62/154 (40%), Gaps = 14/154 (9%)

Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
           +H R+++ NN+VY+  ++QS  TR    +  + Q +     +LPD E     +D      
Sbjct: 167 SHARVVIHNNRVYLKYFQQSAFTRVNSALALLFQSVVGSREKLPDAEFCLSANDW----- 221

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
               G   K       Y  D W M    PD+ F+ W E  I  +     +  +  +   W
Sbjct: 222 ----GSMGKFSLDRAPYLEDLWLM----PDYGFYSWPEPGIGSYTEHREKTLQVEHDTPW 273

Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
             + P  +W+G   V    R  L     D HDWN
Sbjct: 274 ERKVPKLFWRGAMGVGTADRKALLAAAQD-HDWN 306


>gi|410970561|ref|XP_003991747.1| PREDICTED: protein O-glucosyltransferase 1 [Felis catus]
          Length = 447

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 93  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 149

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 150 RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 194

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 195 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 254

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 255 WKRKNSTAYFRGS 267


>gi|307170880|gb|EFN62991.1| KDEL motif-containing protein 1 [Camponotus floridanus]
          Length = 458

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 88/194 (45%), Gaps = 32/194 (16%)

Query: 80  CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
           C   +  IH DLSP+       I   +++R +Q     +   +L +N++Y   Y Q +  
Sbjct: 111 CRQNYTQIHNDLSPFLNIDFDKIRHSIIKRYDQPTSVSICHYVLRSNRIYRRCYGQYVGF 170

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           + +F    +L L RK    LPD+E   +  D P++       P+     P+F + G   T
Sbjct: 171 K-IFMDAILLSLARKVV--LPDIEFFVNLGDWPLV-------PDTGPLYPIFSWCGSDNT 220

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL-----KEGNNGRNWIDREPYAYWKGNPFVAET 248
            DIV P +      +I     E++ R +      +GN G +W ++    +W+G     ++
Sbjct: 221 KDIVMPTY------DITESSLEAMGRVMLDTLSVQGNTGLSWKNKTEQLFWRGR----DS 270

Query: 249 RRDLLTC-NLSDKH 261
           RR+ L   ++S KH
Sbjct: 271 RRERLNLIDISRKH 284


>gi|357622883|gb|EHJ74243.1| hypothetical protein KGM_01635 [Danaus plexippus]
          Length = 460

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 76/179 (42%), Gaps = 27/179 (15%)

Query: 80  CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           C   ++ I +DL P++   +        D   R   T+    ++  NK+Y   Y + +  
Sbjct: 150 CSQTYKQIEKDLIPFQRVEMKEQIKKIVDKYHRPESTSFCHYVIKENKIYRDCYGKHVGF 209

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
            ++F    +L L RK    LPD+E++ +  D P+I         N     +F + G   T
Sbjct: 210 -NMFADNILLSLSRK--TVLPDMEMVINLGDWPLIHK-------NGEKLAMFSWCGSDDT 259

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL-----KEGNNGRNWIDREPYAYWKGNPFVAE 247
           +DIV P +      +I     E+L R        +GN  R W DRE  A W+G    AE
Sbjct: 260 LDIVMPTY------DITESTLENLGRVTLDTLSVQGNVERKWSDRETRAIWRGRDSRAE 312


>gi|440897644|gb|ELR49289.1| KTEL motif-containing protein 1 [Bos grunniens mutus]
          Length = 392

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/190 (25%), Positives = 84/190 (44%), Gaps = 26/190 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           ++Y  +      +R      G+   + +  GRLPD+E++ +  D P +       P    
Sbjct: 95  RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142

Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
           P  P+F +S      DI++P W+FW  G A   I P     W+    +L        W  
Sbjct: 143 PAIPIFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202

Query: 233 REPYAYWKGN 242
           +   AY++G+
Sbjct: 203 KNSTAYFRGS 212


>gi|383420127|gb|AFH33277.1| protein O-glucosyltransferase 1 precursor [Macaca mulatta]
          Length = 392

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNCTAYFRGS 212


>gi|426217502|ref|XP_004002992.1| PREDICTED: protein O-glucosyltransferase 1 [Ovis aries]
          Length = 392

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/190 (25%), Positives = 84/190 (44%), Gaps = 26/190 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           ++Y  +      +R      G+   + +  GRLPD+E++ +  D P +       P    
Sbjct: 95  RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142

Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
           P  P+F +S      DI++P W+FW  G A   I P     W+    +L        W  
Sbjct: 143 PAIPIFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202

Query: 233 REPYAYWKGN 242
           +   AY++G+
Sbjct: 203 KNSTAYFRGS 212


>gi|378732944|gb|EHY59403.1| hypothetical protein HMPREF1120_07393 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 412

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/183 (22%), Positives = 81/183 (44%), Gaps = 25/183 (13%)

Query: 81  PDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           P  F  +   L+  + + IT + L+    Q  + R ++ + ++Y+ +    I +R++ T+
Sbjct: 71  PGLFEEVERPLADRRNSSITLEELDSVPRQNGYVRGMIYDQQLYVIEKTGGIYSRELATL 130

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDR-PVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVF 198
             + + +   P  LP++E +F+ DDR P +    Y+           R   D  T   + 
Sbjct: 131 HALHRAIISAPEPLPNIEFVFNSDDRIPSVAIWGYA-----------RREQD--TKIWLI 177

Query: 199 PDWSFWGWAEINIKPWESLLRELK------EGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
           PD+ +W W E  +      +RE++      E ++G +W  + P   W+G     E R + 
Sbjct: 178 PDFGYWSWPETKV----GTMREVQMKAVETEQDDGWSWSSKVPKLLWRGATMGLELRENF 233

Query: 253 LTC 255
           L  
Sbjct: 234 LKA 236


>gi|169617900|ref|XP_001802364.1| hypothetical protein SNOG_12132 [Phaeosphaeria nodorum SN15]
 gi|111059424|gb|EAT80544.1| hypothetical protein SNOG_12132 [Phaeosphaeria nodorum SN15]
          Length = 428

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 53/104 (50%), Gaps = 12/104 (11%)

Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
           +P++E  F  DD P +RS++   + G   K  P       + W M    P++++W W   
Sbjct: 152 IPNIEFSFSLDDLP-LRSKEKGAFFGYTRKDTPEY----DNIWMM----PNYAYWSWNYT 202

Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
           +   W S+ RE+++G     W  ++P   W+G   +AE R++L+
Sbjct: 203 HAPSWNSIRREIEQGEKKTPWHKKDPRVVWRGKIKMAELRKELV 246


>gi|156401513|ref|XP_001639335.1| predicted protein [Nematostella vectensis]
 gi|156226463|gb|EDO47272.1| predicted protein [Nematostella vectensis]
          Length = 497

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 71/166 (42%), Gaps = 15/166 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQ-TAHFRLI---LVNNKVYIHKYKQSIQTRD 135
           CP  +  I  DL+ +    + R   E  ++   H  L    ++NNKVY   + + +    
Sbjct: 150 CPQNYSQIDRDLARFPEINLIRLAKEAVDRFGVHHALCHYSIINNKVYRKSHGEHVGF-S 208

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +F+   I  L RK    LPD+E   +  D P+ + +D        P P+  + G   T D
Sbjct: 209 MFSDAIIHSLARKV--HLPDMEFFVNLGDWPLEKRKD-------NPIPILSWCGSEDTAD 259

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           IV P +     A +      SL     + N G  W D+ P A+W+G
Sbjct: 260 IVMPTYDL-TEAALETMGRVSLDMLSVQANTGPKWKDKIPKAFWRG 304


>gi|62460496|ref|NP_001014903.1| protein O-glucosyltransferase 1 precursor [Bos taurus]
 gi|75057820|sp|Q5E9Q1.1|PGLT1_BOVIN RecName: Full=Protein O-glucosyltransferase 1; AltName:
           Full=CAP10-like 46 kDa protein; AltName: Full=KTEL
           motif-containing protein 1; Flags: Precursor
 gi|59858103|gb|AAX08886.1| x 010 protein [Bos taurus]
 gi|296491446|tpg|DAA33499.1| TPA: KTEL motif-containing protein 1 precursor [Bos taurus]
          Length = 392

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/190 (25%), Positives = 84/190 (44%), Gaps = 26/190 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRALENYEPCSSPNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           ++Y  +      +R      G+   + +  GRLPD+E++ +  D P +       P    
Sbjct: 95  RLY-RESDCMFPSR----CSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWME 142

Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
           P  P+F +S      DI++P W+FW  G A   I P     W+    +L        W  
Sbjct: 143 PAIPIFSFSKTLEYHDIMYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKK 202

Query: 233 REPYAYWKGN 242
           +   AY++G+
Sbjct: 203 KNSTAYFRGS 212


>gi|198431555|ref|XP_002127452.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 495

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 83/186 (44%), Gaps = 20/186 (10%)

Query: 80  CPDYFRWIHEDLSPWKVTGIT---RDMLER---ANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP  +R I EDL P KV  ++   + ++ER    +   H+   +VNN+VY  +   SI  
Sbjct: 146 CPPTYRQIEEDLKPHKVIELSTFAKRIVERFTPHHSVCHY--AVVNNQVY-RQCHGSITD 202

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
             +F    +L L RK   +LPD E   +  D P+ +S D        P P+  + G   T
Sbjct: 203 FKMFMDAPLLSLTRKV--KLPDFEFFINLGDWPLEKSHD-------DPLPIISWCGSDGT 253

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DI+ P +     + + +    SL     + N G  W  +    +++G     E R DL 
Sbjct: 254 HDIILPTYDITN-SVLEMLGRVSLDMFSVQANTGPRWGKKIAKGFFRGRDSRQE-RLDLA 311

Query: 254 TCNLSD 259
           + ++ +
Sbjct: 312 SMSVKN 317


>gi|258575361|ref|XP_002541862.1| hypothetical protein UREG_01378 [Uncinocarpus reesii 1704]
 gi|237902128|gb|EEP76529.1| hypothetical protein UREG_01378 [Uncinocarpus reesii 1704]
          Length = 2000

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/249 (22%), Positives = 99/249 (39%), Gaps = 49/249 (19%)

Query: 65   TQESISDYSIPPTS---TCPDYFRWIHEDLSPWKVTGIT-RDMLERANQTAHFRLILVNN 120
            T E   DY +         P  F  I + +S  +   I   ++  R  +    R I+ N 
Sbjct: 1707 TSEHGDDYGLGRVQCRIAFPKLFAEIEKSVSARRGKKIEYEELASRTLEDGMVRAIIYNG 1766

Query: 121  KVYIHKYK--QSIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGP 176
            ++ I  ++  +   TR   T+  + + L   P R  LP++E +F  DD            
Sbjct: 1767 ELRIVNFEGHEFTFTRAKATLSSLNRALTAIPDRRSLPNIEFIFSADD------------ 1814

Query: 177  NNKGPPPLFRYSG---DRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRN---- 229
               GP P++ YS    D W    + PD+ +W W E NI P+  + R +   ++G +    
Sbjct: 1815 FTHGPGPIWTYSKRDEDSWAW--LMPDFGYWSWPEANIGPYRQIRRRIAAIDDGDSVGGR 1872

Query: 230  ------WIDREPYAYWKGNPFVAETRRD-------------LLTCNLSDKHDWNARLYVQ 270
                  + ++    +W+GN   A   R+             +L  + S+  D  A+ YV 
Sbjct: 1873 VRPGLKFQNKHQKLFWRGNIATAPVLRNKFLQVTHDKAWASVLPMDWSNSTDIEAK-YVP 1931

Query: 271  VISFIYITH 279
            +    +ITH
Sbjct: 1932 IEEHFFITH 1940


>gi|388855380|emb|CCF51044.1| uncharacterized protein [Ustilago hordei]
          Length = 493

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 61/154 (39%), Gaps = 14/154 (9%)

Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
           +H R+++ NN+VY+  ++QS  TR    +  + Q +     +LPD E     +D      
Sbjct: 172 SHARVLIHNNRVYLKYFQQSAFTRVNSALALLFQSVVASKEKLPDAEFCLSANDW----- 226

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
               G   K       Y  D W M    PD+ F+ W E  I  +     +  E      W
Sbjct: 227 ----GSMGKFSLDRAPYLEDLWLM----PDYGFYSWPEPGIGSYTEHREKTLEVEKATPW 278

Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
            ++    +W+G   V    R  L    SD H WN
Sbjct: 279 ENKISKLFWRGAMGVGTADRKALLAAASD-HPWN 311


>gi|301758922|ref|XP_002915307.1| PREDICTED: KTEL motif-containing protein 1-like [Ailuropoda
           melanoleuca]
          Length = 380

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 26  QINRSLENYEPCSSQNCSCYHGVIDEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 82

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  G+LPD+E++ +  D P +       P 
Sbjct: 83  RLY--------RENDCMFPSRCSGVEHFILEVIGQLPDMEMVINVRDYPQV-------PK 127

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 128 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 187

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 188 WKKKNSTAYFRGS 200


>gi|281345604|gb|EFB21188.1| hypothetical protein PANDA_003280 [Ailuropoda melanoleuca]
          Length = 364

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 10  QINRSLENYEPCSSQNCSCYHGVIDEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 66

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  G+LPD+E++ +  D P +       P 
Sbjct: 67  RLY--------RENDCMFPSRCSGVEHFILEVIGQLPDMEMVINVRDYPQV-------PK 111

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 112 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 171

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 172 WKKKNSTAYFRGS 184


>gi|296226200|ref|XP_002758827.1| PREDICTED: protein O-glucosyltransferase 1 [Callithrix jacchus]
          Length = 391

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 26/190 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 37  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 93

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           ++Y  +      +R       IL+++R    RLPD+E++ +  D P +       P    
Sbjct: 94  RLY-RENDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 141

Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
           P  P+F +S      DI++P W+FW  G A   I P     W+    +L        W  
Sbjct: 142 PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKK 201

Query: 233 REPYAYWKGN 242
           +   AY++G+
Sbjct: 202 KNSTAYFRGS 211


>gi|194222810|ref|XP_001500762.2| PREDICTED: protein O-glucosyltransferase 1 [Equus caballus]
          Length = 392

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQ---TAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    +     H+++I   N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-AGISRKMMAEVVKRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  G LPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLARSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|31982953|ref|NP_689518.1| protein O-glucosyltransferase 1 precursor [Homo sapiens]
 gi|114588661|ref|XP_516666.2| PREDICTED: protein O-glucosyltransferase 1 isoform 2 [Pan
           troglodytes]
 gi|397509574|ref|XP_003825192.1| PREDICTED: protein O-glucosyltransferase 1 [Pan paniscus]
 gi|426341692|ref|XP_004036160.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Gorilla
           gorilla gorilla]
 gi|74730148|sp|Q8NBL1.1|PGLT1_HUMAN RecName: Full=Protein O-glucosyltransferase 1; AltName:
           Full=CAP10-like 46 kDa protein; Short=hCLP46; AltName:
           Full=KTEL motif-containing protein 1; AltName:
           Full=Myelodysplastic syndromes relative protein; Flags:
           Precursor
 gi|22761537|dbj|BAC11625.1| unnamed protein product [Homo sapiens]
 gi|29179612|gb|AAH48810.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Homo sapiens]
 gi|31745813|gb|AAP56253.1| myelodysplastic syndromes relative [Homo sapiens]
 gi|37182284|gb|AAQ88944.1| MDS010 [Homo sapiens]
 gi|119599976|gb|EAW79570.1| chromosome 3 open reading frame 9 [Homo sapiens]
 gi|189067501|dbj|BAG37760.1| unnamed protein product [Homo sapiens]
 gi|410228232|gb|JAA11335.1| protein O-glucosyltransferase 1 [Pan troglodytes]
 gi|410254852|gb|JAA15393.1| protein O-glucosyltransferase 1 [Pan troglodytes]
 gi|410299780|gb|JAA28490.1| protein O-glucosyltransferase 1 [Pan troglodytes]
 gi|410328909|gb|JAA33401.1| protein O-glucosyltransferase 1 [Pan troglodytes]
          Length = 392

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++    N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|344282265|ref|XP_003412894.1| PREDICTED: protein O-glucosyltransferase 1-like [Loxodonta
           africana]
          Length = 638

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/191 (22%), Positives = 79/191 (41%), Gaps = 28/191 (14%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDML-ERANQTAHFRLILVNNKV 122
           Q   S+ +Y    +  C  Y   I +DL+P++  GI++ M+ E   +       ++ N++
Sbjct: 284 QINRSLENYEPCSSQDCSCYHGVIEQDLTPFR-GGISKKMMAEVVGRKLGTHYQIIKNRL 342

Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
           Y        +  D        G+   + +  GRLPD+E++ +  D P +       P   
Sbjct: 343 Y--------RENDCMFPSRCGGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWM 387

Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFWGWA-------EINIKPWESLLRELKEGNNGRNWI 231
            P  P+F +S      DI++P W+FW           I +  W+    +L        W 
Sbjct: 388 EPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPIGLGRWDLFREDLARSAAQWPWK 447

Query: 232 DREPYAYWKGN 242
            +   AY++G+
Sbjct: 448 KKNSTAYFRGS 458


>gi|297670280|ref|XP_002813298.1| PREDICTED: protein O-glucosyltransferase 1 [Pongo abelii]
          Length = 392

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++    N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRRLGTHYQI--TKN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|432964664|ref|XP_004086966.1| PREDICTED: KDEL motif-containing protein 1-like [Oryzias latipes]
          Length = 504

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 83/189 (43%), Gaps = 22/189 (11%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLE------RANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP  F  I +DLS +      R+  E      + +   H+   + +NKVYI  + + +  
Sbjct: 156 CPQAFAQIDQDLSVFTTVDPDRNAREIPPRFRQRHSLCHY--TIKDNKVYIKTFGEHVGF 213

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK    LPD+E   +  D P+ + +    P  +   P+F + G   +
Sbjct: 214 R-IFMDAVLLSLTRKV--HLPDVEFFVNLGDWPLEKRK----PTEE-IHPIFSWCGSNSS 265

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     +GN G  W ++   A+W+G     ++RR+ L
Sbjct: 266 RDIVMPTYDL-TESVLETMGRVSLDMMSVQGNTGPAWPEKNATAFWRGR----DSRRERL 320

Query: 254 T-CNLSDKH 261
               LS  H
Sbjct: 321 ELVQLSRAH 329


>gi|62897433|dbj|BAD96657.1| x 010 protein variant [Homo sapiens]
          Length = 392

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++    N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|328709644|ref|XP_001945687.2| PREDICTED: o-glucosyltransferase rumi homolog [Acyrthosiphon pisum]
          Length = 384

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 90/199 (45%), Gaps = 24/199 (12%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
           S+  ++  +Y+   +     +F  I +DL P+K  GIT +M+  A      R ++ N+ +
Sbjct: 37  SKINKAKKEYTSCESKNGTCFFPNILKDLEPFK-DGITHEMITAAADKGT-RYMIFNHDL 94

Query: 123 YIHKYKQSIQTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           Y        +T+ +F     GI   L K     PD+E + +  D P I  + Y  P    
Sbjct: 95  YR-------ETKCMFPARCEGIEHFLSKIQLNTPDVEFILNTRDWPQI-IKHYGDPK--- 143

Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
             P+F +S      DI++P WSFW G   I + P     W+SL + + + +    W  + 
Sbjct: 144 --PVFSFSKTDDYADIMYPAWSFWSGGPAIKLHPSGLGRWDSLRKSILKQSEQWPWKRKI 201

Query: 235 PYAYWKGNPFVAETRRDLL 253
              +++G+   +E R  L+
Sbjct: 202 SKGFFRGSR-TSEQRDSLI 219


>gi|432849653|ref|XP_004066608.1| PREDICTED: protein O-glucosyltransferase 1-like [Oryzias latipes]
          Length = 387

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 47/215 (21%), Positives = 89/215 (41%), Gaps = 29/215 (13%)

Query: 68  SISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKY 127
           ++  Y    +  C  +   + +DL P++  GI+  ++    Q    R +  + ++  HK 
Sbjct: 37  AVGSYVSCSSVNCSCHLSVLQQDLQPFR-GGISESLMASTVQ----RGMGTHYQIIQHKL 91

Query: 128 KQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLFR 186
            +           G+   + +   RLPDLE++ +  D P +       PN   P  P+  
Sbjct: 92  YREQNCMFPARCSGVEHFILEVIDRLPDLEMVVNVRDYPQV-------PNWMSPALPVLS 144

Query: 187 YSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYW 239
           +S      DI++P W+FW  G A   I P     W+ +  +LK+      W  +EP  ++
Sbjct: 145 FSKTAEYQDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRTDLKKSAAQWPWKKKEPKGFF 204

Query: 240 KG-------NPFVAETRRD--LLTCNLSDKHDWNA 265
           +G       +P +  +R D  L+    +    W +
Sbjct: 205 RGSRTSSERDPLILLSREDPELVDAEYTKNQAWKS 239


>gi|115496800|ref|NP_001069153.1| KDEL motif-containing protein 1 precursor [Bos taurus]
 gi|111304522|gb|AAI19854.1| KDEL (Lys-Asp-Glu-Leu) containing 1 [Bos taurus]
 gi|296481621|tpg|DAA23736.1| TPA: KDEL (Lys-Asp-Glu-Leu) containing 1 [Bos taurus]
          Length = 502

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 81/189 (42%), Gaps = 22/189 (11%)

Query: 80  CPDYFRWIHEDL------SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP+ F  IH DL       P K+         +     H+   L +NKVYI  + + +  
Sbjct: 153 CPETFAQIHRDLVHFPTVDPEKIATEIPKRFGQRQSLCHYS--LKDNKVYIKTHGEHVGF 210

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   ++PD+E   +  D P+ + +  S P+     P+F + G   +
Sbjct: 211 R-IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK--SSPHIH---PIFSWCGSTDS 262

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     + N G  W  +   A W+G     ++RR+ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRRERL 317

Query: 254 T-CNLSDKH 261
               LS KH
Sbjct: 318 ELVKLSRKH 326


>gi|417409544|gb|JAA51271.1| Putative protein o-glucosyltransferase 1, partial [Desmodus
           rotundus]
          Length = 306

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/187 (24%), Positives = 80/187 (42%), Gaps = 20/187 (10%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVY 123
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R ++    +    R +  + ++ 
Sbjct: 54  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKVMVEVVR----RKLGTHYQII 108

Query: 124 IHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP- 182
            H+  +           G+   + +  GRLPD+E++ +  D P +       P    P  
Sbjct: 109 KHRLYRENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWMEPAI 161

Query: 183 PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREP 235
           P+F +S      DI++P W+FW  G A   I P     W+    EL        W  +  
Sbjct: 162 PVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREELVRSAVQWPWKKKNS 221

Query: 236 YAYWKGN 242
            AY++G+
Sbjct: 222 TAYFRGS 228


>gi|21040486|gb|AAH30614.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Homo sapiens]
 gi|312150392|gb|ADQ31708.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [synthetic construct]
          Length = 392

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++    N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRRMMAEVVRRKLGTHYQI--TKN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|326913914|ref|XP_003203277.1| PREDICTED: KDEL motif-containing protein 1-like [Meleagris
           gallopavo]
          Length = 572

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 73/170 (42%), Gaps = 21/170 (12%)

Query: 80  CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
           CP  F  I  DL+ + V     I  ++ +R  Q        + +N+VYI  Y + +  R 
Sbjct: 224 CPQVFPQIQRDLANFPVVDPDKIAIEIPQRFGQRQSLCHYTIKDNEVYIKTYGEHVGFR- 282

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
           +F    +L L RK   ++PD+E   +  D P+           K PP    P+F + G  
Sbjct: 283 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPL---------EKKKPPQNLHPIFSWCGSS 331

Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
            + DIV P +     + +      SL     + N G +W D+   A+W+G
Sbjct: 332 ESKDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPSWEDKNTTAFWRG 380


>gi|187608427|ref|NP_001120575.1| KDEL (Lys-Asp-Glu-Leu) containing 2 precursor [Xenopus (Silurana)
           tropicalis]
 gi|171846847|gb|AAI61552.1| LOC100145729 protein [Xenopus (Silurana) tropicalis]
 gi|189442192|gb|AAI67365.1| hypothetical protein LOC100145729 [Xenopus (Silurana) tropicalis]
 gi|195540008|gb|AAI68093.1| hypothetical protein LOC100145729 [Xenopus (Silurana) tropicalis]
          Length = 509

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 86/191 (45%), Gaps = 18/191 (9%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLER-----ANQTAHFRLILVNNKVYIHKYKQSI 131
           T +CPD    I +D  P+    +TR ML+      A++ A     ++NN +Y H   +  
Sbjct: 156 TLSCPDTEAQISKDFDPFPSIDLTR-MLDEVPKRFADRGAIVHYTVLNNNIYRHSMGRYT 214

Query: 132 QTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR 191
             + +F+   +  L RK   RLPD E   +  D PV   +    P   GP P+  + G  
Sbjct: 215 DFK-MFSDEMLQSLARKV--RLPDFEFYINVGDWPVEHRKANDTP---GPLPMISWCGSA 268

Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
            + DI+ P +         ++   + L  + +G+ G +W ++    +++G     ++R +
Sbjct: 269 DSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTGPSWSNKTEQGFFRGR----DSREE 323

Query: 252 LLT-CNLSDKH 261
            L   ++S KH
Sbjct: 324 RLQLVHMSRKH 334


>gi|432113196|gb|ELK35717.1| Protein O-glucosyltransferase 1, partial [Myotis davidii]
          Length = 364

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 10  QINRSLENYEPCSSQNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 66

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  G LPD+E++ +  D P +       P 
Sbjct: 67  RLY--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PK 111

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 112 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 171

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 172 WKKKNSTAYFRGS 184


>gi|312380729|gb|EFR26645.1| hypothetical protein AND_07142 [Anopheles darlingi]
          Length = 459

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 49/190 (25%), Positives = 79/190 (41%), Gaps = 20/190 (10%)

Query: 80  CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP     I  DL P++    +          ++    +H   +++ N V+   Y Q    
Sbjct: 100 CPPGDPQIDMDLIPFRAINFSSLRTRMIQQYDKPGSISHCNYVILRNDVHRRCYGQHTGF 159

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN-NKGPPPLFRYSGDRW 192
              F    +L L RK+   LPD+EL  +  D P+++     GP+   GP P+F + G   
Sbjct: 160 SK-FMDTILLSLARKF--SLPDMELFVNLGDWPLVKK---GGPSRTTGPYPIFSWCGSDD 213

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
           T DIV P +        N+     +L  L     G  W D+   A+W+G     + RR+ 
Sbjct: 214 TFDIVMPTYDITESTLENMG--RVMLDMLSIQKRGIPWPDKHRKAFWRGR----DARRER 267

Query: 253 LT-CNLSDKH 261
           L    L+ +H
Sbjct: 268 LELVRLARRH 277


>gi|332025008|gb|EGI65195.1| KDEL motif-containing protein 1 [Acromyrmex echinatior]
          Length = 495

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 49/189 (25%), Positives = 85/189 (44%), Gaps = 23/189 (12%)

Query: 80  CPDYFRWIHEDLSPWK---VTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
           C   +  IH DLSP+       I + +++R ++     +   +L +N++Y   Y Q +  
Sbjct: 149 CQQNYTQIHHDLSPFSNIDFDKIRQSIIKRYDRPTSVSICHYVLKSNRIYRQCYGQYVGF 208

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           + +F    +L L RK    LPD+E   +  D P++       P++    P+F + G   T
Sbjct: 209 K-IFMDAILLSLTRKV--MLPDIEFFVNLGDWPLV-------PDSGPLYPIFSWCGSEST 258

Query: 194 MDIVFPDWSFWGWAEINIKPW-ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
            DIV P +      E +++     +L  L    NG  W  +    +W+G     E R DL
Sbjct: 259 KDIVMPTYDI---TESSLEAMGRVMLDTLSVQGNGLPWEHKTEQLFWRGRDARRE-RLDL 314

Query: 253 LTCNLSDKH 261
           +  ++S KH
Sbjct: 315 I--DISRKH 321


>gi|118084661|ref|XP_416963.2| PREDICTED: KDEL motif-containing protein 1 [Gallus gallus]
          Length = 500

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 73/170 (42%), Gaps = 21/170 (12%)

Query: 80  CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
           CP  F  I  DL+ + V     I  ++ +R  Q        + +N+VYI  Y + +  R 
Sbjct: 152 CPQVFPQIQRDLANFPVVDPDKIAVEIPQRFGQRQSLCHYTIKDNEVYIKTYGEHVGFR- 210

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
           +F    +L L RK   ++PD+E   +  D P+           K PP    P+F + G  
Sbjct: 211 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPL---------EKKKPPQKLHPIFSWCGSS 259

Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
            + DIV P +     + +      SL     + N G +W D+   A+W+G
Sbjct: 260 ESKDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPSWEDKNTTAFWRG 308


>gi|9437337|gb|AAF87313.1|AF168711_1 x 010 protein [Homo sapiens]
          Length = 273

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++    N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--TKN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  GRLPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|242007535|ref|XP_002424595.1| KDEL motif-containing protein 1 precursor, putative [Pediculus
           humanus corporis]
 gi|212508038|gb|EEB11857.1| KDEL motif-containing protein 1 precursor, putative [Pediculus
           humanus corporis]
          Length = 501

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/170 (23%), Positives = 76/170 (44%), Gaps = 21/170 (12%)

Query: 80  CPDYFRWIHEDLSPW---KVTGITRDMLERANQTAHFRL---ILVNNKVYIHKYKQSIQT 133
           C   +  I  DL  +   K + +   +LE+ +      L   ++ +N++Y   Y + +  
Sbjct: 154 CKKTYNQIVSDLESFPKVKFSEVLLKVLEKYSSQRSISLCHYVIKDNEIYRKCYGEYVDF 213

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           + +F    +L L RK    LPD E + +  D P+         N+  P P+F + G  +T
Sbjct: 214 K-IFVDALLLSLTRKI--ELPDFEFIVNLGDWPL-------EDNSPSPLPIFSWCGSNFT 263

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
            DI+ P +      E  ++    + R++   +GN G +W D+    +W+G
Sbjct: 264 SDIIMPTYDL---TEATLECMGRVSRDMLSVQGNTGASWNDKINKGFWRG 310


>gi|395844808|ref|XP_003795143.1| PREDICTED: protein O-glucosyltransferase 1 [Otolemur garnettii]
          Length = 392

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++  V N
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIAEDLTPFR-GGISRKMMAEVVRRKLGTHYQI--VKN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           ++Y        +  D        G+   + +  G LPD+E++ +  D P +       P 
Sbjct: 95  RLY--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+    +L        
Sbjct: 140 WMEPTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WKKKNSTAYFRGS 212


>gi|391340384|ref|XP_003744522.1| PREDICTED: O-glucosyltransferase rumi homolog [Metaseiulus
           occidentalis]
          Length = 418

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 76/172 (44%), Gaps = 21/172 (12%)

Query: 83  YFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGI 142
           ++R I  DL P+   GI+R+ L RA +    R  ++  +V+  K     +    F   GI
Sbjct: 67  HYRQIRRDLKPFS-EGISREDLNRAAKFG-VRYQIIQGRVFRQK-----ECMFPFRCSGI 119

Query: 143 LQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDW 201
              L +   +LPDLE + +  D P         P  +   P+F +S   +   DI++P W
Sbjct: 120 EHFLLRLAPKLPDLEFIVNVRDYP-------QSPVGREKLPVFSFSKVPKNYYDILYPAW 172

Query: 202 SFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
           +FW G   I+I P     W+   + L   +    W  +    +++G+   AE
Sbjct: 173 TFWEGGPAISIYPRGIGRWDIRSKSLIAESERVPWDAKVDRGFFRGSRTSAE 224


>gi|426236645|ref|XP_004012278.1| PREDICTED: KDEL motif-containing protein 1 [Ovis aries]
          Length = 502

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 22/189 (11%)

Query: 80  CPDYFRWIHEDL------SPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP+ F  IH DL       P K+         +     H+   L +NKVYI  + + +  
Sbjct: 153 CPETFAQIHRDLVHFPTVDPEKIATEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   ++PD+E   +  D P+ + +  S P+     P+F + G   +
Sbjct: 211 R-IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK--SSPHIH---PIFSWCGSTDS 262

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     + N G  W  +   A W+G     ++R++ L
Sbjct: 263 KDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317

Query: 254 T-CNLSDKH 261
               LS KH
Sbjct: 318 ELVKLSRKH 326


>gi|296811324|ref|XP_002846000.1| DUF821 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238843388|gb|EEQ33050.1| DUF821 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 420

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 32/147 (21%), Positives = 64/147 (43%), Gaps = 16/147 (10%)

Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDR--P 166
           +  + R ++ N ++YI     +I +R+  T+  + + +   P  LP++E +F+ DD+   
Sbjct: 110 KNGYIRAMIYNQQLYIIHRNGTIYSREYATLQALNRAIVSSPDPLPNIEFVFNTDDKVDS 169

Query: 167 VIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNN 226
           V +      P +K          D W M    PD+ +W W E  +   + +  + ++   
Sbjct: 170 VAQWGYARRPQDK----------DMWLM----PDFGYWSWPETKVGTMQEVQTKAEQEEQ 215

Query: 227 GRNWIDREPYAYWKGNPFVAETRRDLL 253
              W  + P  +W+G     E R  L+
Sbjct: 216 TWTWPKKIPKLFWRGATMGLEVRDKLI 242


>gi|403171742|ref|XP_003330935.2| hypothetical protein PGTG_12472 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375169317|gb|EFP86516.2| hypothetical protein PGTG_12472 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 499

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/159 (22%), Positives = 67/159 (42%), Gaps = 11/159 (6%)

Query: 99  ITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLEL 158
           I+   L+RA Q +H R+++  N+VYI  +      R    +  I + +   P  LPD+E 
Sbjct: 184 ISIQHLDRAYQYSHARVLIYRNRVYIKGFNGGPGLRTEAILNSIQEAVITSPELLPDVEF 243

Query: 159 MFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESL 217
           +    D P     ++         PL+     +   ++ + PD+ F+ W E  +     +
Sbjct: 244 VIKTSDAPQGGDDEH---------PLWVLDRTKSQEEVWLMPDYGFYSWPEPKVGGMVEV 294

Query: 218 LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
             +  E     +W  +   A+W+G   V + R  LL  +
Sbjct: 295 RDKTAEREASLSWDSKISKAFWRGAILV-KLREQLLEVS 332


>gi|49119606|gb|AAH73128.1| LOC443629 protein, partial [Xenopus laevis]
          Length = 507

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 54/214 (25%), Positives = 91/214 (42%), Gaps = 31/214 (14%)

Query: 55  CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLER----ANQT 110
           CP   P+T Q            T +CPD    I +D  P+    +TR + E     A++ 
Sbjct: 143 CPEEDPQTWQE-----------TLSCPDTEAQISKDFEPFPSIDLTRLLNEVPKRFADRG 191

Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWG--ILQ-LLRKYPGRLPDLELMFDCDDRPV 167
           A     ++NN++    Y++S+     F ++   ILQ L RK   RLPD E   +  D PV
Sbjct: 192 AIVHYTVLNNQI----YRRSMGRYTDFKMFSDEILQSLARKV--RLPDFEFYINVGDWPV 245

Query: 168 IRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
              +    P   GP P+  + G   + DI+ P +         ++   + L  + +G+ G
Sbjct: 246 ENRKANDTP---GPLPMISWCGSSDSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTG 301

Query: 228 RNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
            +W ++    +++G     E    L    +S KH
Sbjct: 302 PSWSNKTEQGFFRGRDSREER---LQLVQMSRKH 332


>gi|147905630|ref|NP_001085283.1| KDEL (Lys-Asp-Glu-Leu) containing 2 precursor [Xenopus laevis]
 gi|114107936|gb|AAI23308.1| LOC443629 protein [Xenopus laevis]
          Length = 509

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 54/214 (25%), Positives = 91/214 (42%), Gaps = 31/214 (14%)

Query: 55  CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLER----ANQT 110
           CP   P+T Q            T +CPD    I +D  P+    +TR + E     A++ 
Sbjct: 145 CPEEDPQTWQE-----------TLSCPDTEAQISKDFEPFPSIDLTRLLNEVPKRFADRG 193

Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWG--ILQ-LLRKYPGRLPDLELMFDCDDRPV 167
           A     ++NN++    Y++S+     F ++   ILQ L RK   RLPD E   +  D PV
Sbjct: 194 AIVHYTVLNNQI----YRRSMGRYTDFKMFSDEILQSLARKV--RLPDFEFYINVGDWPV 247

Query: 168 IRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
              +    P   GP P+  + G   + DI+ P +         ++   + L  + +G+ G
Sbjct: 248 ENRKANDTP---GPLPMISWCGSSDSRDIILPTYDITHSTLETLRGVTNDLLSI-QGHTG 303

Query: 228 RNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
            +W ++    +++G     E    L    +S KH
Sbjct: 304 PSWSNKTEQGFFRGRDSREER---LQLVQMSRKH 334


>gi|443897123|dbj|GAC74465.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
          Length = 495

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 14/154 (9%)

Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
           +H R+++ NN+VY+  ++QS  TR    +  + Q +     +LPD E     +D      
Sbjct: 174 SHARVLIHNNRVYLKYFQQSPFTRVNSALALLFQSVTGAREKLPDAEFCLSANDW----- 228

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
               G   K       Y  D W M    PD+ F+ W E  I  +     +  E  N   W
Sbjct: 229 ----GSMGKFSLDRAPYLVDLWLM----PDYGFYSWPEPGIGSYTEHREKTLELENATPW 280

Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
             +    +W+G   V    R  L    ++ H WN
Sbjct: 281 SAKISKLFWRGAMSVGTADRKALLA-AAENHAWN 313


>gi|345324378|ref|XP_003430815.1| PREDICTED: LOW QUALITY PROTEIN: KDEL motif-containing protein
           1-like [Ornithorhynchus anatinus]
          Length = 588

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 82/187 (43%), Gaps = 18/187 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
           CP  F  I  DL+ +       I  ++ +R  Q        + NNKVYI  + + +  R 
Sbjct: 239 CPASFPQIQRDLAHFPTVDPDKIAVEIPQRFGQRQSLCHYTVKNNKVYIKTHGEHVGFR- 297

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +F    +L L RK   ++PD+E   +  D P+ + +    P+N  P  +F + G   + D
Sbjct: 298 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKKK---SPDNLHP--IFSWCGSTDSKD 350

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT- 254
           IV P +     + +      SL     + N G  W D+   A W+G     ++R++ L  
Sbjct: 351 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPAWEDKNTTAVWRGR----DSRKERLEL 405

Query: 255 CNLSDKH 261
             LS KH
Sbjct: 406 VKLSRKH 412


>gi|332225464|ref|XP_003261899.1| PREDICTED: protein O-glucosyltransferase 1 isoform 1 [Nomascus
           leucogenys]
          Length = 392

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 79/191 (41%), Gaps = 28/191 (14%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD-MLERANQTAHFRLILVNNKV 122
           Q   S+ +Y    +  C  Y   I EDL+P++  GI+R  M E   +       +  N++
Sbjct: 38  QINRSLENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKTMAEVVRRKLGTHYQITKNRL 96

Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
           Y        +  D        G+   + +  GRLPD+E++ +  D P +       P   
Sbjct: 97  Y--------RENDCMFPSRCSGVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWM 141

Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
            P  P+F +S      DI++P W+FW  G A   I P     W+    +L        W 
Sbjct: 142 EPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWK 201

Query: 232 DREPYAYWKGN 242
            +   AY++G+
Sbjct: 202 KKNSTAYFRGS 212


>gi|224043076|ref|XP_002195971.1| PREDICTED: KDEL motif-containing protein 1 [Taeniopygia guttata]
          Length = 588

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 13/166 (7%)

Query: 80  CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
           CP     I  DL+ + +     I +++ +R  Q        + NN+VYI  Y + +  R 
Sbjct: 240 CPQTIPQIQRDLANFPIVDPDKIAKEIPQRFGQRQSLCHYTIKNNEVYIKTYGEHVGFR- 298

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +F    +L L RK   ++PD+E   +  D P+ + +    P N  P  +F + G   + D
Sbjct: 299 IFMDAILLSLTRKV--KMPDVEFFVNLGDWPLEKRKP---PQNLHP--IFSWCGSSESKD 351

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           IV P +     + +      SL     + N G  W D+   A+W+G
Sbjct: 352 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWDDKNTTAFWRG 396


>gi|157128435|ref|XP_001655120.1| endoplasmic reticulum-resident kdel protein [Aedes aegypti]
 gi|122067566|sp|Q16QY8.1|RUMI_AEDAE RecName: Full=O-glucosyltransferase rumi homolog; Flags: Precursor
 gi|108872609|gb|EAT36834.1| AAEL011121-PA [Aedes aegypti]
          Length = 402

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 56/263 (21%), Positives = 118/263 (44%), Gaps = 40/263 (15%)

Query: 21  SRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTC 80
           S + + + ++++P +  P++  T ++  K       T+    +  QE+++ Y     + C
Sbjct: 20  SDDDSGMCMAKEPCSEAPQQ-ETKVDLYK------ATDNKYVALIQEALASYEPCQQANC 72

Query: 81  PDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVF--- 137
             +   +  DL P+K  GI+  M+ERA      +  +V++++Y        + +D     
Sbjct: 73  SCHADVLKTDLRPFK-GGISEQMVERARSYG-TKYQIVDHRLY--------RQKDCMFPA 122

Query: 138 TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIV 197
              G+   ++     LPD+EL+ +C D P I +R +     +   P+  +S     +DI+
Sbjct: 123 RCSGVEHFIKPNLPHLPDMELIINCRDWPQI-NRHW----KQEKLPVLSFSKTDDYLDIM 177

Query: 198 FPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG-------NPF 244
           +P W FW G   I++ P     W+     +K+  +   W  ++  A+++G       +P 
Sbjct: 178 YPTWGFWEGGPAISLYPTGLGRWDQHRVSIKKAADSWKWEKKKAKAFFRGSRTSDERDPL 237

Query: 245 VAETRR--DLLTCNLSDKHDWNA 265
           V  +RR  +L+    +    W +
Sbjct: 238 VLLSRRKPELVDAQYTKNQAWKS 260


>gi|313241455|emb|CBY33708.1| unnamed protein product [Oikopleura dioica]
          Length = 478

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 96/211 (45%), Gaps = 41/211 (19%)

Query: 49  KNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERAN 108
           K Q+++C  N PK+  T +S           C + F+ +    +  +   I +  + R++
Sbjct: 119 KLQSESC--NCPKSRATFKS--------AYQCTE-FKHLTSSFNKLQNKKINKKSIHRSS 167

Query: 109 QT----AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDD 164
           +T    +    I+ +NK+Y  +  + +  +  F+    L LLRK    LPD+E +F+  D
Sbjct: 168 ETLKGDSVIHYIIKDNKLYSKELSERLDFKR-FSDGIFLSLLRKV--NLPDIEFLFNVGD 224

Query: 165 RPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDW-----SFWGWAEINIKPWESLLR 219
            PV  SR++         P+F + G   + DIV P W     +    ++IN+   + L  
Sbjct: 225 WPV--SREF---------PVFSWCGSEESSDIVVPTWDQIKTTLLSMSKINV---DILTM 270

Query: 220 ELKEGNNGRNWIDREPYAYWKGNPFVAETRR 250
           +L    NG++W  + P  +++G     E  R
Sbjct: 271 QL----NGKSWQSKIPKGFFRGRDSSKERMR 297


>gi|47211427|emb|CAF96437.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 456

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 75/166 (45%), Gaps = 13/166 (7%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLI----LVNNKVYIHKYKQSIQTRD 135
           CPD F  I  DLS +      R+  E   +    + +    + +NKVY+  + + +  R 
Sbjct: 126 CPDSFPQIDTDLSVFSSVDPDRNAQEVPQRFGQRQSLCHYTVKDNKVYVKTFGEHVGFR- 184

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +F    +L L RK   RLPD+E   +  D P+ + +    P ++   P+F + G   T+D
Sbjct: 185 IFMDSILLSLTRKV--RLPDVEFFVNLGDWPLEKRK----PTDQ-IHPIFSWCGSNNTLD 237

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           IV P +     + +      SL     + + G  W ++   A+W+G
Sbjct: 238 IVMPTYDL-TESVLETMGRVSLDMMSVQASTGPPWPEKNATAFWRG 282


>gi|348566951|ref|XP_003469265.1| PREDICTED: protein O-glucosyltransferase 1-like [Cavia porcellus]
          Length = 392

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 81/191 (42%), Gaps = 28/191 (14%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDML-ERANQTAHFRLILVNNKV 122
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+ E  ++       ++ N++
Sbjct: 38  QINRALENYEPCSSQNCSCYRGIIEEDLTPFR-GGISRKMMAEVVSRKLGTHYQVIRNRL 96

Query: 123 YIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
           Y        +  D        G+   + +  G LPD+E++ +  D P I       P   
Sbjct: 97  Y--------RENDCMFPSRCSGVEHFILEVIGHLPDMEMVINVRDYPQI-------PKWM 141

Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
            P  P+F +S      DI++P W+FW  G A   I P     W+    +L        W 
Sbjct: 142 EPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWE 201

Query: 232 DREPYAYWKGN 242
            +   AY++G+
Sbjct: 202 KKNSTAYFRGS 212


>gi|410906075|ref|XP_003966517.1| PREDICTED: protein O-glucosyltransferase 1-like [Takifugu rubripes]
          Length = 367

 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 49/216 (22%), Positives = 94/216 (43%), Gaps = 29/216 (13%)

Query: 68  SISDYSIPPTST-CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK 126
           ++S Y IP TS  C  +   + +DL P+K   I+ ++++   Q    R +  + ++  HK
Sbjct: 17  AVSGY-IPCTSANCSCHLSVLQDDLRPFKFK-ISEELMDATTQ----RGVGTHYQIIGHK 70

Query: 127 YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFR 186
             +           G+   + +   RLPD+E++ +  D P +     S        P+F 
Sbjct: 71  LYREQNCMFPARCSGVEHFILQLIDRLPDMEMVVNVRDYPQVPKWMESSL------PVFS 124

Query: 187 YSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYW 239
           +S     +DI++P W+FW  G A   I P     W+ +  +LK+  +   W  +E   ++
Sbjct: 125 FSKTADYLDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRTDLKKSADQWPWKKKETKGFF 184

Query: 240 KG-------NPFVAETRRD--LLTCNLSDKHDWNAR 266
           +G       +P +  +R D  L+    +    W + 
Sbjct: 185 RGSRTSPERDPLILLSREDPELVDAEYTKNQAWKSE 220


>gi|358054730|dbj|GAA99656.1| hypothetical protein E5Q_06359 [Mixia osmundae IAM 14324]
          Length = 437

 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 42/193 (21%), Positives = 81/193 (41%), Gaps = 15/193 (7%)

Query: 78  STC----PDYFRWIHEDLSPWK-VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
           STC    PD +R        W+   GITR  L+ A+  AH R+++ +N+VY+  Y+  I 
Sbjct: 95  STCHGEFPDLYREAERAAQYWRHRGGITRAALDAADAQAHARVLIKDNQVYLTNYRGGIN 154

Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
           +R +  +  + + +      LP +E +   DD   I            P  +F  + +  
Sbjct: 155 SRTLAALASLNEAVLTAVEELPAVEFVIQTDDSAPI--------AGAAPRWVFARTDEED 206

Query: 193 TMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
            + + + PD+  + W E  +     +  +       + W  + P  +W+G   V   R +
Sbjct: 207 ELALWLMPDFGHYAWPEPGVGSMAEVQAKASAFEAAQTWSSKIPKLFWRG-ALVNPLRDE 265

Query: 252 LLTCNLSDKHDWN 264
           L+  +  ++  W 
Sbjct: 266 LIRLSDENRGSWG 278


>gi|410897261|ref|XP_003962117.1| PREDICTED: KDEL motif-containing protein 1-like [Takifugu rubripes]
          Length = 505

 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 73/166 (43%), Gaps = 13/166 (7%)

Query: 80  CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
           CPD F  I  DLS +       I +++ +R  Q        + +NKVY+  + + +  R 
Sbjct: 157 CPDSFPQIDRDLSAFPNVDPDRIAQEIPQRFGQRQSLCHYTVKDNKVYVTTFGEHVGFR- 215

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +FT   +L L RK    LPD+E   +  D P+ + +     +     P+F + G   T D
Sbjct: 216 IFTDSILLSLTRKV--WLPDVEFFVNLGDWPLEKRKLTDKIH-----PIFSWCGSNNTQD 268

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           IV P +     + +      SL     + N G  W ++   A+W+G
Sbjct: 269 IVMPTYDL-TESVLETMGRVSLDMTSVQANTGPPWAEKNATAFWRG 313


>gi|170033685|ref|XP_001844707.1| KDEL motif-containing protein 2 [Culex quinquefasciatus]
 gi|167874675|gb|EDS38058.1| KDEL motif-containing protein 2 [Culex quinquefasciatus]
          Length = 495

 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 81/189 (42%), Gaps = 18/189 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTGITR------DMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           CP     I+ DL P++    +          ++    +    ++ +N++Y   Y +    
Sbjct: 142 CPSEDVQINSDLIPFRAVNFSSLRPRIIQQYDKPGSVSLCNYVVKDNQIYRTCYGRYTGF 201

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           + ++    +L L RK    LPD+EL  +  D P++    +      GP P+F + G   T
Sbjct: 202 K-MYMDAILLSLARK--TLLPDMELFVNLGDWPLVTKGGHR--RTTGPYPIFSWCGSEDT 256

Query: 194 MDIVFPDWSFWGWAEINIKPWESL-LRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
            DIV P +      E +++    + L  L     G  W ++ P A+W+G     E R DL
Sbjct: 257 FDIVMPTYDL---VEASLEAMSRVSLDMLSVQRKGVPWEEKVPKAFWRGRDACRE-RLDL 312

Query: 253 LTCNLSDKH 261
           +   LS +H
Sbjct: 313 V--GLSQQH 319


>gi|326475589|gb|EGD99598.1| hypothetical protein TESG_06947 [Trichophyton tonsurans CBS 112818]
          Length = 419

 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 47/265 (17%), Positives = 95/265 (35%), Gaps = 53/265 (20%)

Query: 5   FAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKTSQ 64
           F FS +     +  N SR    +++   P       IT P +C  +         P    
Sbjct: 14  FTFSTLQAQKLAAEN-SRLHRAVLMDANP-------ITAPASCQHH------AGNPGCQW 59

Query: 65  TQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERAN-------------QTA 111
           + +  ++  +     C   F        P     I R   ERA              +  
Sbjct: 60  SHKRDANTLVFGAERCQSAF--------PGLFAEIDRAKQERAQRPIGLAEIDSVIPKNG 111

Query: 112 HFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSR 171
           + R ++ + ++Y+     +I +R++ T+  + + +   P  LP++E +F+ DD+      
Sbjct: 112 YIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK------ 165

Query: 172 DYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
                      P+ ++   R   D    + PD+ +W W E  +   + +    +   +  
Sbjct: 166 ---------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREEHTW 216

Query: 229 NWIDREPYAYWKGNPFVAETRRDLL 253
            W  + P  +W+G     E R  L+
Sbjct: 217 AWPKKIPRLFWRGATMDLEVREKLI 241


>gi|148664476|gb|EDK96892.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_b [Mus musculus]
          Length = 432

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)

Query: 80  CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           C +    I +DL+      P K+         +     H+   L +NKVYI  + + +  
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   R+PD+E   +  D P+ + +     +N    P+F + G   +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     + N G  W  +   A W+G     ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317

Query: 254 T-CNLSDKH 261
               LS KH
Sbjct: 318 ELVKLSRKH 326


>gi|425781966|gb|EKV19900.1| hypothetical protein PDIG_00670 [Penicillium digitatum PHI26]
 gi|425784005|gb|EKV21816.1| hypothetical protein PDIP_02930 [Penicillium digitatum Pd1]
          Length = 436

 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 34/137 (24%), Positives = 60/137 (43%), Gaps = 14/137 (10%)

Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKYPGR--LPDL 156
           +++  R    A  R I+   ++YI  Y     T  R   T+  + + L  +P R  LP +
Sbjct: 103 KELDSRTVDDAMVRGIIDRGELYIVDYAPMPMTASRARATLSSLHRALTAFPDRHLLPSI 162

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEINIKPWE 215
           E +F  +D            +   P P++ YS  D  T   + PD+ +W W E++I P+ 
Sbjct: 163 EFIFTTED---------FAEDTTAPSPIWAYSKRDSDTSVWLMPDFGYWAWPEVHIGPYH 213

Query: 216 SLLRELKEGNNGRNWID 232
            + R +   ++G    D
Sbjct: 214 EVRRRIAAIDDGETAAD 230


>gi|302660285|ref|XP_003021823.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
 gi|291185740|gb|EFE41205.1| DUF821 domain protein [Trichophyton verrucosum HKI 0517]
          Length = 420

 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)

Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
           +  + R ++ + ++Y+     +I +R++ T+  + + +   P  LP++E +F+ DD+   
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDKV-- 167

Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
                         P+ ++   R   D    + PD+ +W W E  +   + +    +   
Sbjct: 168 -------------DPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214

Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
           +   W  + P  +W+G     E R  L+
Sbjct: 215 HTWAWSKKIPQLFWRGATMDLEVREKLI 242


>gi|149730404|ref|XP_001493278.1| PREDICTED: KDEL motif-containing protein 1 [Equus caballus]
          Length = 502

 Score = 45.8 bits (107), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 80/190 (42%), Gaps = 22/190 (11%)

Query: 79  TCPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
           +CP+    I  DL+      P K+         +     H+   L +NKVYI  + + + 
Sbjct: 152 SCPETITQIQRDLAHFPTVDPEKIATEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVG 209

Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            R +F    +L L RK   ++PD E   +  D P+ + +  S  +     P+F + G   
Sbjct: 210 FR-IFMDAILLSLTRKV--KMPDAEFFVNLGDWPLEKKKSSSHIH-----PIFSWCGSTD 261

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
           + DIV P +     + +      SL     +GN G  W  +   A W+G     ++R++ 
Sbjct: 262 SKDIVMPTYDLTD-SVLETMGRVSLDMMSVQGNTGPPWESKNSTAVWRGR----DSRKER 316

Query: 253 LT-CNLSDKH 261
           L    LS KH
Sbjct: 317 LELVKLSRKH 326


>gi|452843550|gb|EME45485.1| hypothetical protein DOTSEDRAFT_71260 [Dothistroma septosporum
           NZE10]
          Length = 456

 Score = 45.8 bits (107), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 36/154 (23%), Positives = 67/154 (43%), Gaps = 18/154 (11%)

Query: 93  PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGR 152
            W+  GI R ++   NQ      I+  + V  H ++     R V T+  + + +  Y G+
Sbjct: 111 AWRGDGIVRALVHD-NQL----YIIDAHGVVDHNHR----PRTVATLHSLHRAISAYQGK 161

Query: 153 LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIK 212
           LP++E  F   D  +   RD +G          ++    W M    PD+  WGW ++ ++
Sbjct: 162 LPNIEFTFTVHDAAL---RDPAGNETTWAYTRRKHQEKLWLM----PDFGLWGWPDVGLR 214

Query: 213 PWESLLRELKEGNNGRNWIDREPYAYWKGNPFVA 246
            +  L   L+   +   ++D+ P   W+G+  V 
Sbjct: 215 SFAELQEVLEHEED--EFVDKVPKLVWRGSVAVG 246


>gi|358058341|dbj|GAA95860.1| hypothetical protein E5Q_02517 [Mixia osmundae IAM 14324]
          Length = 492

 Score = 45.8 bits (107), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 32/159 (20%), Positives = 72/159 (45%), Gaps = 11/159 (6%)

Query: 98  GITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
           GI    L+      + R+I+ +N+VY+ K +    ++R    +  I + +      +PD+
Sbjct: 178 GIQERYLDTTEHGMNARVIIKSNRVYLRKPFNAGAKSRTQALLAAIEEAVLSSIEPIPDV 237

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWE 215
           E + + +DR      D +  +     P+   S  +    + + PD+ F+ W E ++  + 
Sbjct: 238 EFVINSEDRV-----DAATAHT----PILGMSRKKQQGHVWLIPDFRFYAWPEPHVGTYP 288

Query: 216 SLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
            +  ++      + W  +    +W+GNP +   R++L+T
Sbjct: 289 DVQDQIYALEATQQWHHKRAKLFWRGNPPLHPLRQELMT 327


>gi|326483787|gb|EGE07797.1| DUF821 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 420

 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)

Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
           +  + R ++ + ++Y+     +I +R++ T+  + + +   P  LP++E +F+ DD+   
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK--- 166

Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
                         P+ ++   R   D    + PD+ +W W E  +   + +    +   
Sbjct: 167 ------------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214

Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
           +   W  + P  +W+G     E R  L+
Sbjct: 215 HTWAWPKKIPRLFWRGATMDLEVREKLI 242


>gi|71020735|ref|XP_760598.1| hypothetical protein UM04451.1 [Ustilago maydis 521]
 gi|46100486|gb|EAK85719.1| hypothetical protein UM04451.1 [Ustilago maydis 521]
          Length = 488

 Score = 45.8 bits (107), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 65/154 (42%), Gaps = 14/154 (9%)

Query: 111 AHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
           +H R+++ NN+VY+  ++QS  TR    +  + Q +     +LPD E     +D   +  
Sbjct: 167 SHARVVIHNNRVYLKYFQQSAFTRINSALALLFQSVVASREKLPDAEFCLSANDWGSMGK 226

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
             +S   N   P L+    D W M    PD+ F+ W E  I  +     +         W
Sbjct: 227 --FSLDRN---PSLY----DVWLM----PDYGFYSWPEPGIGSYTEHREKTLAIEQSTPW 273

Query: 231 IDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
             +    +W+G+  V    R  L    ++ H+WN
Sbjct: 274 DSKIAKLFWRGSMGVGTADRKALLA-AAENHEWN 306


>gi|315044371|ref|XP_003171561.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311343904|gb|EFR03107.1| DUF821 domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 422

 Score = 45.4 bits (106), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 28/145 (19%), Positives = 64/145 (44%), Gaps = 12/145 (8%)

Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
           +  + R ++ + ++Y+     +I +R+  T+  + + +   P  LP++E +F+ DD+  +
Sbjct: 112 KNGYIRAMIYDQQLYVIAVNGTIYSREYATLHALHRAIVSSPDPLPNIEFVFNTDDK--V 169

Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
            S    G   +          + W M    PD+ +W W E  +   + +  + ++     
Sbjct: 170 DSVAQWGYARR------EEDTNLWLM----PDFGYWSWPETKVGTMQEVRMKAEQEEQTW 219

Query: 229 NWIDREPYAYWKGNPFVAETRRDLL 253
            W ++ P  +W+G     E R  L+
Sbjct: 220 TWSEKIPKLFWRGATMDLEVREKLI 244


>gi|12963767|ref|NP_076134.1| KDEL motif-containing protein 1 precursor [Mus musculus]
 gi|81881896|sp|Q9JHP7.1|KDEL1_MOUSE RecName: Full=KDEL motif-containing protein 1; AltName:
           Full=Endoplasmic reticulum resident protein 58; Short=ER
           protein 58; Short=ERp58; Flags: Precursor
 gi|9621686|emb|CAC00650.1| ER protein 58 [Mus musculus]
 gi|26331010|dbj|BAC29235.1| unnamed protein product [Mus musculus]
 gi|74213118|dbj|BAE41698.1| unnamed protein product [Mus musculus]
 gi|148664475|gb|EDK96891.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_a [Mus musculus]
          Length = 502

 Score = 45.4 bits (106), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)

Query: 80  CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           C +    I +DL+      P K+         +     H+   L +NKVYI  + + +  
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   R+PD+E   +  D P+ + +     +N    P+F + G   +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     + N G  W  +   A W+G     ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317

Query: 254 T-CNLSDKH 261
               LS KH
Sbjct: 318 ELVKLSRKH 326


>gi|12841643|dbj|BAB25294.1| unnamed protein product [Mus musculus]
          Length = 502

 Score = 45.4 bits (106), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 79/189 (41%), Gaps = 22/189 (11%)

Query: 80  CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           C +    I +DL+      P K+         +     H+   L +NKVYI  + + +  
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   R+PD+E   +  D P+ + +     +N    P+F + G   +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     + N G  W  +   A W+G     ++R++ L
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERL 317

Query: 254 T-CNLSDKH 261
               LS KH
Sbjct: 318 ELVKLSRKH 326


>gi|354494153|ref|XP_003509203.1| PREDICTED: protein O-glucosyltransferase 1 [Cricetulus griseus]
          Length = 401

 Score = 45.4 bits (106), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 82/190 (43%), Gaps = 26/190 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I    
Sbjct: 47  QINRALENYEPCSSRNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII---K 102

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           K    +      +R       IL+++R    RLPD+E++ +  D P +       P    
Sbjct: 103 KRLFREDDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 151

Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
           P  P+F +S      DI++P W+FW  G A   + P     W+    +L        W  
Sbjct: 152 PTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEK 211

Query: 233 REPYAYWKGN 242
           +   AY++G+
Sbjct: 212 KNSTAYFRGS 221


>gi|327267957|ref|XP_003218765.1| PREDICTED: KDEL motif-containing protein 1-like [Anolis
           carolinensis]
          Length = 502

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 41/167 (24%), Positives = 73/167 (43%), Gaps = 13/167 (7%)

Query: 79  TCPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTR 134
           +CP     IH DL+ +       I +++ +R  Q        + +NKVYI  + + +  R
Sbjct: 153 SCPQSIPQIHRDLAHFPAVDPEKIAKEIPQRFGQRQSLCHYTIKDNKVYIKTHGEHVGFR 212

Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
            +F    +L L RK   ++PDLE   +  D P+ + +     + +   P+F + G   + 
Sbjct: 213 -IFMDAILLSLTRKV--KMPDLEFFVNLGDWPLEKKK-----SPQDIHPIFSWCGSNDSK 264

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           DIV P +     + +      SL     + N G  W ++   A W+G
Sbjct: 265 DIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWEEKNATAIWRG 310


>gi|330919750|ref|XP_003298742.1| hypothetical protein PTT_09542 [Pyrenophora teres f. teres 0-1]
 gi|311327917|gb|EFQ93162.1| hypothetical protein PTT_09542 [Pyrenophora teres f. teres 0-1]
          Length = 410

 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 57/115 (49%), Gaps = 14/115 (12%)

Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
           +PD+E     DD P  RS+D   + G   K  P  +R   D W M    P++++W W   
Sbjct: 134 VPDIEFALSLDDLPR-RSKDKGTFFGYTRKEGPE-YR---DIWMM----PNYAYWSWNYT 184

Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
           +   W ++ +E+++      W  ++P   W+G   +A+ R++L+   +S+   W+
Sbjct: 185 HAPSWNTIRKEIQQKETEVPWSKKDPRVVWRGKVKMAKLRQELI--RVSEGQSWS 237


>gi|115401698|ref|XP_001216437.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114190378|gb|EAU32078.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 437

 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 61/139 (43%), Gaps = 17/139 (12%)

Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKYPGR--LPDL 156
           +D+   A      R I+ N ++YI  Y     T  R   T+  + + L  +P R  LP +
Sbjct: 106 QDLDGLAVDDGMVRGIIENGELYIVDYGAMPFTFSRAKATLHSLQRALAAFPDRHLLPSV 165

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWE 215
           E +F  DD        YS     G  P++ YS       + + PD+ +W W E+ I P++
Sbjct: 166 EFVFTTDD--------YS----TGVGPIWSYSKREEDESVWLMPDFGYWSWPEVKIGPYK 213

Query: 216 SLLRELKEGNNGRNWIDRE 234
            + R +   +NG   +  E
Sbjct: 214 DIRRRIAAVDNGETTVTGE 232


>gi|281332155|ref|NP_001094122.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 precursor [Rattus norvegicus]
 gi|149060499|gb|EDM11213.1| similar to RIKEN cDNA 9630046K23 [Rattus norvegicus]
          Length = 392

 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 43/187 (22%), Positives = 79/187 (42%), Gaps = 20/187 (10%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVY 123
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    +    R +  + ++ 
Sbjct: 38  QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVR----RRLGTHYQII 92

Query: 124 IHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP- 182
            H+  +           G+   + +   RLPD+E++ +  D P +       P    P  
Sbjct: 93  KHRLFREDDCMFPSRCSGVEHFILEVIRRLPDMEMVINVRDYPQV-------PKWMEPTI 145

Query: 183 PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREP 235
           P+F +S      DI++P W+FW  G A   + P     W+    +L        W  +  
Sbjct: 146 PVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNS 205

Query: 236 YAYWKGN 242
            AY++G+
Sbjct: 206 TAYFRGS 212


>gi|351697696|gb|EHB00615.1| KTEL motif-containing protein 1 [Heterocephalus glaber]
          Length = 392

 Score = 45.1 bits (105), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 46/194 (23%), Positives = 80/194 (41%), Gaps = 34/194 (17%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+          H+++I    
Sbjct: 38  QINRALENYEPCSSQNCSCYQGVIEEDLTPFR-GGISRKMMAEVVSRKLGTHYQII---- 92

Query: 121 KVYIHKYKQSIQTRDVFTI----WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGP 176
                  K+ +   D         G+   + +  G+LPD+E++ +  D P +       P
Sbjct: 93  -------KKRLYREDDCMFPSRCSGVEHFILEVIGQLPDMEMVINVRDYPQV-------P 138

Query: 177 NNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGR 228
               P  P+F +S      DI++P W+FW  G A   I P     W+    +L       
Sbjct: 139 RWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQW 198

Query: 229 NWIDREPYAYWKGN 242
            W  +   AY++G+
Sbjct: 199 PWKKKNSTAYFRGS 212


>gi|327297120|ref|XP_003233254.1| hypothetical protein TERG_06247 [Trichophyton rubrum CBS 118892]
 gi|326464560|gb|EGD90013.1| hypothetical protein TERG_06247 [Trichophyton rubrum CBS 118892]
          Length = 420

 Score = 45.1 bits (105), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 27/148 (18%), Positives = 63/148 (42%), Gaps = 18/148 (12%)

Query: 109 QTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVI 168
           +  + R ++ + ++Y+     +I +R++ T+  + + +   P  LP++E +F+ DD+   
Sbjct: 110 KNGYIRAMIYDQQLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDK--- 166

Query: 169 RSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGN 225
                         P+ ++   R   D    + PD+ +W W E  +   + +    +   
Sbjct: 167 ------------VDPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREE 214

Query: 226 NGRNWIDREPYAYWKGNPFVAETRRDLL 253
           +   W  + P  +W+G     E R  L+
Sbjct: 215 HTWAWPKKIPQLFWRGATMDLEVRERLI 242


>gi|452004927|gb|EMD97383.1| hypothetical protein COCHEDRAFT_1190251 [Cochliobolus
           heterostrophus C5]
          Length = 432

 Score = 45.1 bits (105), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 55/115 (47%), Gaps = 14/115 (12%)

Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
           +P++E     DD P  RSR    + G   K  P       D W M    P++++W W   
Sbjct: 156 VPNIEFSLSLDDLPR-RSRKEGTFFGYTRKDTPEY----KDIWMM----PNYAYWAWNYT 206

Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
           +   W S+ +E+++      W  ++P   W+G   +A+ R++L+   +S+  DW+
Sbjct: 207 HAPSWNSIRKEIEQKEKELPWHKKDPRVVWRGKIKMAKLRKELV--KVSEGKDWS 259


>gi|198433875|ref|XP_002126603.1| PREDICTED: similar to KTEL motif-containing protein 1 [Ciona
           intestinalis]
          Length = 398

 Score = 45.1 bits (105), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 56/251 (22%), Positives = 109/251 (43%), Gaps = 44/251 (17%)

Query: 67  ESISDYSIPPTS---TCPDYFRWIHEDLSPWKV-TGITRDMLERA-NQTAHFRLILVNNK 121
           ++IS+ +   TS    C  +   I  DL  W+   GIT + ++R   ++ H+++I  ++K
Sbjct: 42  DAISEAAANYTSCGGNCTCHSDVITSDLKLWRERGGITNEDMKRGLERSVHYQII--DHK 99

Query: 122 VYIHKYKQSIQTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRP-VIRSRDYSGPNN 178
           +Y        Q + +F     GI   + +    LPD+EL  +  D P V++   Y     
Sbjct: 100 LYR-------QDKCMFPSRCSGIEHFILEIINDLPDMELGINVHDWPQVMKHSPY----- 147

Query: 179 KGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNW 230
             P P+  +S   +   DI++P W+FW             +  W+ + ++LK+ +N   W
Sbjct: 148 --PMPILSFSKVAKEHQDIMYPAWTFWAGGPAVWPIYRNGLGRWDLMRKDLKKKDNEFPW 205

Query: 231 IDREPYAYWKG-------NPFVAETRR--DLLTCNLSDKHDWNAR---LYVQVISFIYIT 278
             ++   +++G       +P V  +R   DL+    +    W ++   L  +    +++ 
Sbjct: 206 EKKQNIGFFRGSRTSSERDPLVLLSRENPDLVDAQYTKNQAWKSKKDTLGAEPAEIVHLL 265

Query: 279 HLCSKIPGFNF 289
             C     FNF
Sbjct: 266 DHCQYKYLFNF 276


>gi|194744002|ref|XP_001954487.1| GF18287 [Drosophila ananassae]
 gi|190627524|gb|EDV43048.1| GF18287 [Drosophila ananassae]
          Length = 382

 Score = 45.1 bits (105), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 68/162 (41%), Gaps = 25/162 (15%)

Query: 55  CPTNY--PKTSQTQ-------ESISDY--SIPPTSTCPDYFRWIHEDLSPWKVTGITRDM 103
           CP N   P+++ T        E+++DY    P    C  +   I +DL P+   GI++DM
Sbjct: 5   CPANLLSPRSNSTLDLVPNILEALADYRPCEPGDPLCLCHAATITKDLEPYSDKGISQDM 64

Query: 104 LERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCD 163
           + ++ +      ++             ++   V  +      L +  G LPDLE + +  
Sbjct: 65  ISQSKRQGTLYKVIRRRIFRQEHCSHPLRCSSVEDV------LLEIAGDLPDLEFVLNVC 118

Query: 164 DRPVIRSRDYSGPNNKG-PPPLFRYSGDRWTMDIVFPDWSFW 204
           D P +       P   G   P+F +S     +DI+ P WSFW
Sbjct: 119 DWPQV-------PFLSGLSGPVFSHSTTALHLDIMCPAWSFW 153


>gi|348511563|ref|XP_003443313.1| PREDICTED: KDEL motif-containing protein 1-like [Oreochromis
           niloticus]
          Length = 538

 Score = 44.7 bits (104), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 54/226 (23%), Positives = 91/226 (40%), Gaps = 33/226 (14%)

Query: 80  CPDYFRWIHEDLSPWKVTGI---TRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
           CP  F  I  DLS +         +++ +R  Q        + +NK+Y+  + + +  R 
Sbjct: 190 CPQSFAQIERDLSFYPSVDPDHNAQEIPQRFGQRQSLCHYTVKDNKIYVRTFGEHVGFR- 248

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP----PLFRYSGDR 191
           +F    +L L RK   RLPD+E   +  D P+           + PP    P+F + G  
Sbjct: 249 IFMDAILLSLTRKV--RLPDVEFFVNLGDWPL---------EKRKPPEKLHPIFSWCGSN 297

Query: 192 WTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
            T DIV P +     + +      SL     + N G  W ++   A+W+G     E R +
Sbjct: 298 NTRDIVMPTYDL-TESVLETMGRVSLDMMSVQANTGPPWPEKNATAFWRGRDSRQE-RLE 355

Query: 252 LLTCNLSDKHDWNARLYVQVISFIYITH-------LCSKIPGFNFF 290
           L+  + +  H  +A       +F +  H       L   +  F+FF
Sbjct: 356 LVKLSRAHPHIIDAAF----TNFFFFKHDESLYGPLVKHVSFFDFF 397


>gi|26334375|dbj|BAC30905.1| unnamed protein product [Mus musculus]
          Length = 392

 Score = 44.7 bits (104), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    + +C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRALENYEPCSSQSCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           +++        +  D        G+   + +   RLPD+E++ +  D P +       P 
Sbjct: 95  RLF--------REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   + P     W+    +L        
Sbjct: 140 WMEPTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WEKKNSTAYFRGS 212


>gi|26343369|dbj|BAC35341.1| unnamed protein product [Mus musculus]
          Length = 432

 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 47/187 (25%), Positives = 80/187 (42%), Gaps = 18/187 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTG---ITRDMLERANQTAHF-RLILVNNKVYIHKYKQSIQTRD 135
           C +    I +DL+ +       I  ++ +R  Q        L +NKVYI  + + +  R 
Sbjct: 153 CSETISQIQKDLAHFPTVDPEEIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +F    +L L RK   R+PD+E   +  D P+ + +     +N    P+F + G   + D
Sbjct: 212 IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTESRD 264

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT- 254
           IV P +     + +      SL     + N G  W  +   A W+G     ++R++ L  
Sbjct: 265 IVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGR----DSRKERLEL 319

Query: 255 CNLSDKH 261
             LS KH
Sbjct: 320 VKLSRKH 326


>gi|327268710|ref|XP_003219139.1| PREDICTED: protein O-glucosyltransferase 1-like [Anolis
           carolinensis]
          Length = 391

 Score = 44.7 bits (104), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 47/196 (23%), Positives = 87/196 (44%), Gaps = 26/196 (13%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILVN 119
            Q + ++ +Y    T  C  Y +   +DL P++  GI+ ++L  A       H+++I   
Sbjct: 36  GQIERAVENYRPCVTENCRCYQKVREQDLDPFQ-EGISEELLSEAISRRLGTHYQII--A 92

Query: 120 NKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
            K+Y  ++      R      G+   + +    LPD+E++ +  D P +       P   
Sbjct: 93  KKLY-REHDCMFPAR----CSGVEHFILEIIDELPDMEMVINVRDYPQV-------PKFM 140

Query: 180 GPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWI 231
            P  P+F +S      DI++P W+FW  G A   I P     W+ + ++LK  +    W 
Sbjct: 141 KPKVPIFSFSKTSEYYDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRQDLKSSSEKWPWR 200

Query: 232 DREPYAYWKGNPFVAE 247
            +   A+++G+   AE
Sbjct: 201 AKISKAFFRGSRTSAE 216


>gi|344257139|gb|EGW13243.1| KTEL motif-containing protein 1 [Cricetulus griseus]
          Length = 333

 Score = 44.7 bits (104), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 39/145 (26%), Positives = 66/145 (45%), Gaps = 19/145 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I    
Sbjct: 52  QINRALENYEPCSSRNCSCYRGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII---K 107

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           K    +      +R       IL+++R    RLPD+E++ +  D P +       P    
Sbjct: 108 KRLFREDDCMFPSRCSGVEHFILEVIR----RLPDMEMVINVRDYPQV-------PKWME 156

Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW 204
           P  P+F +S      DI++P W+FW
Sbjct: 157 PTIPVFSFSKTSEYHDIMYPAWTFW 181


>gi|426245590|ref|XP_004016593.1| PREDICTED: KDEL motif-containing protein 2 [Ovis aries]
          Length = 557

 Score = 44.3 bits (103), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 39/169 (23%), Positives = 77/169 (45%), Gaps = 10/169 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CPD    I +D + +    + + ++E   +    R  +V+  +   H Y++S+    
Sbjct: 203 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 262

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P   GP P+  + G   
Sbjct: 263 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 317

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           + D++ P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 318 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 365


>gi|148665570|gb|EDK97986.1| RIKEN cDNA 9630046K23, isoform CRA_b [Mus musculus]
          Length = 372

 Score = 44.3 bits (103), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 18  QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 74

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           +++        +  D        G+   + +   RLPD+E++ +  D P +       P 
Sbjct: 75  RLF--------REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PK 119

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   + P     W+    +L        
Sbjct: 120 WMEPTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWP 179

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 180 WEKKNSTAYFRGS 192


>gi|26348175|dbj|BAC37727.1| unnamed protein product [Mus musculus]
          Length = 502

 Score = 44.3 bits (103), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 75/188 (39%), Gaps = 20/188 (10%)

Query: 80  CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           C +    I +DL+      P K+         +     H+   L +NKVYI  + + +  
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   R+PD+E   +  D P+ + +     +N    P+F + G   +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            DIV P +     + +      SL     + N G  W  +   A W+G       +  L 
Sbjct: 263 RDIVMPTYDLTD-SVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRD---SHKERLE 318

Query: 254 TCNLSDKH 261
              LS KH
Sbjct: 319 LVKLSRKH 326


>gi|451855485|gb|EMD68777.1| hypothetical protein COCSADRAFT_187637 [Cochliobolus sativus
           ND90Pr]
          Length = 432

 Score = 44.3 bits (103), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 28/114 (24%), Positives = 56/114 (49%), Gaps = 12/114 (10%)

Query: 153 LPDLELMFDCDDRPVIRSRD--YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEIN 210
           +P++E     DD P    +D  + G   K  P  +R   D W M    P++++W W   +
Sbjct: 156 IPNIEFSLSLDDLPRRSRKDGTFFGYTRKDTPE-YR---DIWMM----PNYAYWAWNYTH 207

Query: 211 IKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWN 264
              W S+ +E+++      W  ++P   W+G   +A+ R++L+   +S+  +W+
Sbjct: 208 APSWNSIRKEIEQKEKDLPWDKKDPRVVWRGKIKMAKLRKELV--KVSEGKEWS 259


>gi|270007100|gb|EFA03548.1| hypothetical protein TcasGA2_TC013552 [Tribolium castaneum]
          Length = 399

 Score = 44.3 bits (103), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 56/225 (24%), Positives = 101/225 (44%), Gaps = 42/225 (18%)

Query: 47  CVKNQTQTCPTN----YPKTSQTQ---------ESISDYSIPPTSTCPDYFRWIHEDLSP 93
           C+K  ++ C  +    Y K + T+          +  +YS    + C  Y   I +DL  
Sbjct: 23  CLKENSEQCNASKVNMYSKEANTKYLKYLDLIKRAKENYSPCDNTKCGCYSSQISDDLKI 82

Query: 94  WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIW-----GILQLLRK 148
           +K  GIT  ++++  +T   +  ++++K+Y          RD   ++     GI   L K
Sbjct: 83  FK-KGITPQLIDKV-KTKGTKYQIIDHKLY----------RDKNCMFPARCAGIEHFLLK 130

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW-GWA 207
              +LPD+EL+ +  D P I  +DY         P+F +S      DI++P W+FW G  
Sbjct: 131 LLPKLPDMELIINTRDWPQIH-KDYGVFG-----PVFSFSKTSDYSDIMYPAWAFWEGGP 184

Query: 208 EINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
            I++ P     W++    L +  N   W ++ P  +++G+   AE
Sbjct: 185 AISLYPRGIGRWDTHRDLLGKKGNETLWDEKIPKGFFRGSRTSAE 229


>gi|27369505|ref|NP_759012.1| protein O-glucosyltransferase 1 precursor [Mus musculus]
 gi|110832796|sp|Q8BYB9.2|PGLT1_MOUSE RecName: Full=Protein O-glucosyltransferase 1; AltName:
           Full=CAP10-like 46 kDa protein; AltName: Full=KTEL
           motif-containing protein 1; Flags: Precursor
 gi|20071120|gb|AAH26809.1| KTEL (Lys-Tyr-Glu-Leu) containing 1 [Mus musculus]
 gi|26327463|dbj|BAC27475.1| unnamed protein product [Mus musculus]
 gi|26331050|dbj|BAC29255.1| unnamed protein product [Mus musculus]
 gi|26331242|dbj|BAC29351.1| unnamed protein product [Mus musculus]
 gi|148665569|gb|EDK97985.1| RIKEN cDNA 9630046K23, isoform CRA_a [Mus musculus]
          Length = 392

 Score = 44.3 bits (103), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 82/193 (42%), Gaps = 32/193 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           +++        +  D        G+   + +   RLPD+E++ +  D P +       P 
Sbjct: 95  RLF--------REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   + P     W+    +L        
Sbjct: 140 WMEPTIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWP 199

Query: 230 WIDREPYAYWKGN 242
           W  +   AY++G+
Sbjct: 200 WEKKNSTAYFRGS 212


>gi|358058116|dbj|GAA96095.1| hypothetical protein E5Q_02756 [Mixia osmundae IAM 14324]
          Length = 486

 Score = 44.3 bits (103), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 34/160 (21%), Positives = 70/160 (43%), Gaps = 19/160 (11%)

Query: 96  VTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPD 155
           + G+T   +E + +  H RL++ + ++++ +++ + Q+R    +  I   +      +PD
Sbjct: 170 LGGVTLSDVEHSGEIWHARLMIKDGRLFVKRFRPAYQSRCQAVMAAIHDAIITSLEPVPD 229

Query: 156 LELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIK 212
           +E +   DD         + P       L R       +D+   +  D+ F  W E  IK
Sbjct: 230 IEFVIGLDD---------TAPFEPATWGLAR------RLDLPAWLVIDYGFNAWPEPMIK 274

Query: 213 PWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDL 252
            W + L ++K  N    W  +    +W+G  +++E R  L
Sbjct: 275 GWHTFLADVKSVNAQMTWRQKAKKLFWRG-VYLSEYREQL 313


>gi|82658302|ref|NP_001032511.1| KTEL motif-containing protein 1 precursor [Danio rerio]
 gi|81097746|gb|AAI09459.1| Zgc:123318 [Danio rerio]
          Length = 389

 Score = 44.3 bits (103), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 57/243 (23%), Positives = 101/243 (41%), Gaps = 40/243 (16%)

Query: 48  VKNQTQTCPTNYPKTSQT-----QESISDYSIPPTSTCPDYFRWIHEDLSPWK---VTGI 99
           ++   + C ++  K  QT      E+  +Y       C  Y   + +DL P+K     G+
Sbjct: 14  LQTGVEFCVSDNAKQWQTFIRKITEATKNYQPCSQDNCTCYQSVLKDDLRPFKNGISEGL 73

Query: 100 TRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELM 159
             D + R   T H+++I  + K+Y    +QS       +  G+   + K   RLPDLE++
Sbjct: 74  MADTVNRGVGT-HYQII--SKKLY---REQSCMFPARCS--GVEHFILKVIDRLPDLEVV 125

Query: 160 FDCDDRPVIRSRDYSGPNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP--- 213
            +  D P +       P    P  P+  +S  +   DI++P W+FW  G A   I P   
Sbjct: 126 INVRDYPQV-------PGWIQPVLPVLSFSKTKDYQDIMYPAWTFWEGGPAVWPIYPTGL 178

Query: 214 --WESLLRELKEGNNGRNWIDREPYAYWKG-------NPFVAETRR--DLLTCNLSDKHD 262
             W+ +  +LK       W  + P  +++G       +P +  +R   DL+    +    
Sbjct: 179 GRWDLMRDDLKRSVEQWPWKKKSPKGFFRGSRTSSERDPLILLSRAAPDLVDAEYTKNQA 238

Query: 263 WNA 265
           W +
Sbjct: 239 WKS 241


>gi|145250763|ref|XP_001396895.1| hypothetical protein ANI_1_1376134 [Aspergillus niger CBS 513.88]
 gi|134082418|emb|CAK42432.1| unnamed protein product [Aspergillus niger]
          Length = 442

 Score = 43.9 bits (102), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 62/124 (50%), Gaps = 17/124 (13%)

Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
           R I+ + ++Y+  +    +  TR   T+  + + L  +P R  LPD+E         V+ 
Sbjct: 125 RGIVHHGELYVVDFGAMPATFTRGEATLHALHRALASFPDRSSLPDIEF--------VLT 176

Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
           + DYS   N+GP  ++ YS      ++ + PD+ +W W E+ I P++ + R +   ++G 
Sbjct: 177 TEDYS--TNEGP--VWSYSKQDEQENVWLMPDFGYWSWPEVKIGPYKDIRRRIAAVDDGE 232

Query: 229 NWID 232
             ID
Sbjct: 233 VTID 236


>gi|350636309|gb|EHA24669.1| hypothetical protein ASPNIDRAFT_182202 [Aspergillus niger ATCC
           1015]
          Length = 442

 Score = 43.9 bits (102), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 62/124 (50%), Gaps = 17/124 (13%)

Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
           R I+ + ++Y+  +    +  TR   T+  + + L  +P R  LPD+E         V+ 
Sbjct: 125 RGIVHHGELYVVDFGAMPATFTRGKATLHALHRALASFPDRSSLPDIEF--------VLT 176

Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGR 228
           + DYS   N+GP  ++ YS      ++ + PD+ +W W E+ I P++ + R +   ++G 
Sbjct: 177 TEDYS--TNEGP--VWSYSKRDEQQNVWLMPDFGYWSWPEVKIGPYKDIRRRIAAVDDGE 232

Query: 229 NWID 232
             ID
Sbjct: 233 VTID 236


>gi|47216704|emb|CAG00978.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 392

 Score = 43.9 bits (102), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 42/181 (23%), Positives = 79/181 (43%), Gaps = 19/181 (10%)

Query: 56  PTNYPKTSQTQESISDY---SIPPTS-TCPDYFRWIHEDLSPWKVTGITRDMLERANQTA 111
           P +  +  + +++ISD     IP +S  C  +   I +DL P+K   I+ +++    Q  
Sbjct: 22  PASGKRWKKIKDNISDAVKGHIPCSSDNCSCHLSVIQDDLRPFK-GKISENLMAATIQ-- 78

Query: 112 HFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSR 171
             R +  + ++  HK  +           G+   + +  GRLPD+E++ +  D P +   
Sbjct: 79  --RGVGTHYQIIGHKLYREENCMFPARCSGVEHFILQLIGRLPDMEMVVNVRDYPQV--- 133

Query: 172 DYSGPN-NKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNW 230
               P       P+F +S      DI++P W+FW     +  PW+   +E +    G  W
Sbjct: 134 ----PKWVDSLLPVFSFSKTADYQDIMYPAWTFWEGGPADQWPWKQ--KETRGFFRGSRW 187

Query: 231 I 231
           +
Sbjct: 188 V 188


>gi|390332934|ref|XP_783946.3| PREDICTED: KDEL motif-containing protein 1-like [Strongylocentrotus
           purpuratus]
          Length = 512

 Score = 43.9 bits (102), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 92/211 (43%), Gaps = 31/211 (14%)

Query: 46  NC---VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRD 102
           NC   V +Q Q     Y ++    ++ +D+SI P+         +H+D+           
Sbjct: 146 NCDCPVDDQDQWRSDMYCRSGVHPQTQTDFSIFPSID----LGTLHQDVES--------- 192

Query: 103 MLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDC 162
              + +   H+ +I  NNKVY  K   SI    +F+   +L L RK   R+PD+E   + 
Sbjct: 193 RFAKHHSLCHYSII--NNKVY-RKTLGSIVGFKMFSDAFLLSLTRKV--RVPDVEFFINL 247

Query: 163 DDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELK 222
            D P +  RD   P ++ P P+  + G   T DIV P +     + +      SL     
Sbjct: 248 GDWP-LEKRD---PEDE-PLPILSWCGSTDTRDIVLPTYDI-TESTLETMGRVSLDMMSV 301

Query: 223 EGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
           + N G  W ++   A+W+G     ++RR+ L
Sbjct: 302 QANTGPKWENKTEKAFWRGR----DSRRERL 328


>gi|195443900|ref|XP_002069627.1| GK11623 [Drosophila willistoni]
 gi|194165712|gb|EDW80613.1| GK11623 [Drosophila willistoni]
          Length = 383

 Score = 43.9 bits (102), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 60/133 (45%), Gaps = 12/133 (9%)

Query: 75  PPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTR 134
           P  S C  Y   I  +L P+  TGIT+ M++++ +      + + +++Y     +S    
Sbjct: 45  PCESECECYALAIARNLRPYAGTGITKPMIDQSRRFGTLYKV-IGSRLY-----RSDNCA 98

Query: 135 DVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
                  + +LL      LPDLE + +  D P I     SG +     P+F YS     +
Sbjct: 99  YPSRCASVEELLLNIVRDLPDLEFVLNVRDWPQIHF--LSGLSG----PVFSYSSTDNFL 152

Query: 195 DIVFPDWSFWGWA 207
           DI+ P WSFW  A
Sbjct: 153 DIMCPAWSFWTSA 165


>gi|390342113|ref|XP_779911.3| PREDICTED: protein O-glucosyltransferase 1-like isoform 1
           [Strongylocentrotus purpuratus]
          Length = 391

 Score = 43.5 bits (101), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 48/197 (24%), Positives = 85/197 (43%), Gaps = 22/197 (11%)

Query: 60  PKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV-TGITRDMLERANQTAHFRLILV 118
           P  S  +E+ S Y    T  C  +   + EDLS W+   GITR  + +A        + +
Sbjct: 35  PYLSLIEEATSSYEACVTDDCSCHAGVMDEDLSVWEERGGITRADVVKAESRGTLYQV-I 93

Query: 119 NNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNN 178
           N+++Y  + K     R       IL++++K    LPD+E + +  D P  +S  Y+    
Sbjct: 94  NHRLY-REEKCMFPARCNGVEHFILRIIKK----LPDMEFVMNVRDWP--QSGKYT---- 142

Query: 179 KGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNW 230
             P P+  +S       DI++P W+FW             +  W+     + + +    W
Sbjct: 143 -DPIPVLSFSKVQSQHYDIMYPAWTFWEGGPAVWPLFPTGLGRWDLFRESIDKESQKLPW 201

Query: 231 IDREPYAYWKGNPFVAE 247
             +E  A+++G+   AE
Sbjct: 202 DTKEDKAFFRGSRTTAE 218


>gi|255930987|ref|XP_002557050.1| Pc12g01550 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211581669|emb|CAP79782.1| Pc12g01550 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 436

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 58/137 (42%), Gaps = 14/137 (10%)

Query: 101 RDMLERANQTAHFRLILVNNKVYIHKYKQS--IQTRDVFTIWGILQLLRKYPGR--LPDL 156
           +++  R       R I+   ++YI  Y       +R   T+  + + L  +P R  LP +
Sbjct: 103 KELDSRTVDDGMVRGIIDRGELYIVDYAPMPVTASRARATLNSLHRALTAFPDRHLLPSI 162

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG-DRWTMDIVFPDWSFWGWAEINIKPWE 215
           E +F  +D            +   P P++ YS  D  T   + PD+ +W W E+ I P+ 
Sbjct: 163 EFIFTTED---------FAEDTTTPSPIWSYSKRDSHTSVWLMPDFGYWAWPEVQIGPYH 213

Query: 216 SLLRELKEGNNGRNWID 232
            + R +   ++G    D
Sbjct: 214 EVRRRIAAIDDGETAAD 230


>gi|26348271|dbj|BAC37775.1| unnamed protein product [Mus musculus]
          Length = 254

 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 66/148 (44%), Gaps = 25/148 (16%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLE---RANQTAHFRLILVNN 120
           Q   ++ +Y    +  C  Y   I EDL+P++  GI+R M+    R     H+++I   N
Sbjct: 38  QINRALENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQII--KN 94

Query: 121 KVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
           +++        +  D        G+   + +   RLPD+E++ +  D P +       P 
Sbjct: 95  RLF--------REDDCMFPSRCSGVEHFILEVIHRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW 204
              P  P+F +S      DI++P W+FW
Sbjct: 140 WMEPTIPVFSFSKTSEYHDIMYPAWTFW 167


>gi|189194465|ref|XP_001933571.1| hypothetical protein PTRG_03238 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187979135|gb|EDU45761.1| hypothetical protein PTRG_03238 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 434

 Score = 43.1 bits (100), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 12/107 (11%)

Query: 153 LPDLELMFDCDDRPVIRSRD---YSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
           +P++E     DD P  RS+D   + G   K  P  +R   D W M    P++++W W   
Sbjct: 158 IPNIEFALSLDDLPR-RSKDKGTFFGYTRKEGPE-YR---DIWMM----PNYAYWSWNYT 208

Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCN 256
           +   W ++ +E+++      W  ++P   W+G   +A+ R++L+  +
Sbjct: 209 HAPSWNAIRKEIQQKETEVPWSKKDPRVVWRGKVKMAKLRQELIKVS 255


>gi|213514888|ref|NP_001133941.1| KDEL motif-containing protein 2 precursor [Salmo salar]
 gi|209155896|gb|ACI34180.1| KDEL motif-containing protein 2 precursor [Salmo salar]
          Length = 524

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 40/167 (23%), Positives = 73/167 (43%), Gaps = 13/167 (7%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLER----ANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
           CP     I  D + +    + R + E     +N+       ++NN+VY     +    + 
Sbjct: 168 CPAEEPQIQRDFNAFPSIDLQRLLQEVPRRFSNRGGLIHYTVINNQVYRRSLGKYTDFK- 226

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPV-IRSRDYSGPNNKGPPPLFRYSGDRWTM 194
           +F+   +L L RK   RLPD+E   +  D P+  R  D    ++ GP P+  + G   T 
Sbjct: 227 MFSDEMLLSLARKV--RLPDVEFYINVGDWPMETRKAD----DDPGPVPIISWCGSTDTR 280

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           DI+ P +         ++   + L  + +GN G  W ++   A+++G
Sbjct: 281 DIILPTYDITHSTLETMRGVSNDLLSV-QGNTGPPWANKMEQAFFRG 326


>gi|258569104|ref|XP_002585296.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237906742|gb|EEP81143.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 483

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 40/189 (21%), Positives = 81/189 (42%), Gaps = 20/189 (10%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERAN------QTAHFRLILVNNKVYIHKYKQS 130
           T  C   F  ++ED+       I+R  + + +      Q    R ++ N  ++I + + +
Sbjct: 108 TGQCQTAFPGLYEDVDRAVRFWISRGGILKKSLNTVKLQNGMARAMIYNGNLFIIEARAA 167

Query: 131 IQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
            +      +  +  + R    R P++E +F  +D+      D SG  +    PL+  S  
Sbjct: 168 QEDHRRKILAVLSSIHRALGNRAPNIEFIFSVEDK----VEDVSGQGH----PLWVLSRK 219

Query: 191 RWTMDI-VFPDWSFWGWA--EINIKPWESLLRELKEGN--NGRNWIDREPYAYWKGN-PF 244
                + + PD+ FW W     NI P++ +++ +++ +  +   W  + P   W+G   F
Sbjct: 220 ATEKSVWLIPDFGFWAWGNPASNIGPYDQVVKRIEKFDLEDTMPWSSKTPRLVWRGKLSF 279

Query: 245 VAETRRDLL 253
             + RR LL
Sbjct: 280 APKLRRRLL 288


>gi|56118414|ref|NP_001008114.1| MGC89395 protein precursor [Xenopus (Silurana) tropicalis]
 gi|51703984|gb|AAH81318.1| MGC89395 protein [Xenopus (Silurana) tropicalis]
          Length = 385

 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 50/220 (22%), Positives = 97/220 (44%), Gaps = 29/220 (13%)

Query: 63  SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKV 122
           +  + +   +S P +     Y+R + EDL+P+  +GI+RD++++       R +  + ++
Sbjct: 30  NHIRRAAQSHSTPDSQNGTFYYRVLQEDLAPFG-SGISRDLMQKVLS----RKLGTHYQI 84

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
             H+  +  +        G+   L +    LPD++L+ +  D P +       P    P 
Sbjct: 85  INHRLYREEECMFPARCSGVEHFLLELLPDLPDMDLVVNVRDYPQV-------PRWMDPV 137

Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
            P+F +S      DI++P W+FW  G A   I P     W+ +  ELK+  +   W  + 
Sbjct: 138 IPIFSFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREELKKAADLWPWEKKI 197

Query: 235 PYAYWKG-------NPFVAETRR--DLLTCNLSDKHDWNA 265
           P  Y++G       +P +  +R   DL+    +    W +
Sbjct: 198 PKGYFRGSRTSPERDPLILLSRESPDLVDAEYTKNQAWKS 237


>gi|67901122|ref|XP_680817.1| hypothetical protein AN7548.2 [Aspergillus nidulans FGSC A4]
 gi|40742938|gb|EAA62128.1| hypothetical protein AN7548.2 [Aspergillus nidulans FGSC A4]
 gi|259483867|tpe|CBF79610.1| TPA: DUF821 domain protein (AFU_orthologue; AFUA_2G14740)
           [Aspergillus nidulans FGSC A4]
          Length = 462

 Score = 42.7 bits (99), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 38/152 (25%), Positives = 68/152 (44%), Gaps = 23/152 (15%)

Query: 114 RLILVNNKVYIHKYKQS--IQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIR 169
           R  + + ++YI  Y       TR   T+  + + L  YP R  LPD+E +   DD     
Sbjct: 143 RAAVKDGELYIIDYGPQPYTFTRGKATLHSLHRALSSYPDRHSLPDIEFVLTTDD----- 197

Query: 170 SRDYSGPNNKGPPPLFRYSGDRWTMDI---VFPDWSFWGWAEIN-IKPWESLLRELKEGN 225
              +S   +  P P++ Y+  +   D    + PD+ +W W E++ +  ++ + R +    
Sbjct: 198 ---FSTRTSTNPSPIWAYTKRQEDEDAAIWLMPDFGYWSWPEVDTVGEYKDVRRRIFAKE 254

Query: 226 NGRNWIDREPYAYWKG----NPFVAETRRDLL 253
            G  + D++    W+G    NP   E R+ LL
Sbjct: 255 EGLPFSDKKKQLLWRGSVSANP---EIRKALL 283


>gi|60688257|gb|AAH91408.1| Ktelc1 protein [Rattus norvegicus]
          Length = 349

 Score = 42.4 bits (98), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 42/181 (23%), Positives = 76/181 (41%), Gaps = 20/181 (11%)

Query: 70  SDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQ 129
            +Y    +  C  Y   I EDL+P++  GI+R M+    +    R +  + ++  H+  +
Sbjct: 1   ENYEPCSSQNCSCYHGVIEEDLTPFR-GGISRKMMAEVVR----RRLGTHYQIIKHRLFR 55

Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLFRYS 188
                      G+   + +   RLPD+E++ +  D P +       P    P  P+F +S
Sbjct: 56  EDDCMFPSRCSGVEHFILEVIRRLPDMEMVINVRDYPQV-------PKWMEPTIPVFSFS 108

Query: 189 GDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKG 241
                 DI++P W+FW  G A   + P     W+    +L        W  +   AY++G
Sbjct: 109 KTSEYHDIMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRG 168

Query: 242 N 242
           +
Sbjct: 169 S 169


>gi|302511259|ref|XP_003017581.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
 gi|291181152|gb|EFE36936.1| DUF821 domain protein [Arthroderma benhamiae CBS 112371]
          Length = 303

 Score = 42.4 bits (98), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 27/136 (19%), Positives = 58/136 (42%), Gaps = 18/136 (13%)

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           K+Y+     +I +R++ T+  + + +   P  LP++E +F+ DD+               
Sbjct: 5   KLYVIAVNGTIYSREIATLHALHRAIVSSPDPLPNIEFVFNTDDKV-------------- 50

Query: 181 PPPLFRYSGDRWTMDI---VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYA 237
             P+ ++   R   D    + PD+ +W W E  +   + +    +   +  +W  + P  
Sbjct: 51  -DPVAQWGYARREQDTNLWLMPDFGYWSWPETKVGTMQEVQMGAEREEHTWSWSKKIPQL 109

Query: 238 YWKGNPFVAETRRDLL 253
           +W+G     E R  L+
Sbjct: 110 FWRGATMDLEVREKLI 125


>gi|358415519|ref|XP_003583131.1| PREDICTED: KDEL motif-containing protein 2 [Bos taurus]
          Length = 460

 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 77/169 (45%), Gaps = 10/169 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
           T +CPD    I +D + +    + + ++E   +    R  +V+  +  +  Y++S+    
Sbjct: 106 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGKYT 165

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P   GP P+  + G   
Sbjct: 166 DFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 220

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           + D++ P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 221 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 268


>gi|91081993|ref|XP_969039.1| PREDICTED: similar to AGAP008037-PA [Tribolium castaneum]
 gi|270007375|gb|EFA03823.1| hypothetical protein TcasGA2_TC013938 [Tribolium castaneum]
          Length = 498

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 24/190 (12%)

Query: 80  CPDYFRWIHEDLSPWKVTG--ITRDMLERANQTAH----FRLILVNNKVYIHKYKQSIQT 133
           C +  R I  DL P++       RD + +     H     R ++ NN++Y   Y + +  
Sbjct: 152 CGNTPRQILADLKPFQTVNWDKLRDKVIKKFDQPHSISLCRYVIKNNEIYRTCYGKYVGF 211

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           + +F    +L L RK    LPDLE   +  D P++  +  + P       +F + G   +
Sbjct: 212 K-MFMDAILLSLSRKV--NLPDLEFFINLGDWPLVTEKIETFP-------IFSWCGSTTS 261

Query: 194 MDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKGNPFVAETRRD 251
           +DIV P +      E  ++    ++ ++   +GN   +W +R   A+W+G     + R D
Sbjct: 262 LDIVMPTYDI---TESTLENMGRVMLDMLSVQGNVKESWENRTGQAFWRGRD-SNQHRLD 317

Query: 252 LLTCNLSDKH 261
           L+  +++ KH
Sbjct: 318 LI--DIARKH 325


>gi|358373714|dbj|GAA90310.1| DUF821 domain protein [Aspergillus kawachii IFO 4308]
          Length = 442

 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 60/128 (46%), Gaps = 25/128 (19%)

Query: 114 RLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYP--GRLPDLELMFDCDDRPVIR 169
           R I+ + ++Y+  +    +  TR   T+  + + L  +P  G LPD+E         V+ 
Sbjct: 125 RGIIHHGELYVVDFGAMPATFTRGKSTLHALHRALASFPDRGSLPDIEF--------VLT 176

Query: 170 SRDYSGPNNKGPPPLFRYS-----GDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEG 224
           + DYS        P++ YS     G+ W M    PD+ +W W E+ I P++ + R +   
Sbjct: 177 TEDYSIHQG----PVWSYSKRDEQGNVWLM----PDFGYWSWPEVKIGPYKDIRRRIAAV 228

Query: 225 NNGRNWID 232
           ++G   +D
Sbjct: 229 DDGEVTLD 236


>gi|297482669|ref|XP_002693013.1| PREDICTED: KDEL motif-containing protein 2 [Bos taurus]
 gi|296480318|tpg|DAA22433.1| TPA: KDEL (Lys-Asp-Glu-Leu) containing 2 [Bos taurus]
          Length = 508

 Score = 42.0 bits (97), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 77/169 (45%), Gaps = 10/169 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
           T +CPD    I +D + +    + + ++E   +    R  +V+  +  +  Y++S+    
Sbjct: 154 TLSCPDKEPQIEKDFASFPSINLQQMLIEVPKRFGDERGAIVHYTILNNNIYRRSLGKYT 213

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P   GP P+  + G   
Sbjct: 214 DFKMFSDEILLSLARKV--LLPDLEFYVNLGDWPLEHRKVNETP---GPLPIISWCGSLD 268

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           + D++ P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 269 SQDVILPTYDITHSTLEALRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 316


>gi|119481815|ref|XP_001260936.1| hypothetical protein NFIA_089970 [Neosartorya fischeri NRRL 181]
 gi|119409090|gb|EAW19039.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 444

 Score = 42.0 bits (97), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 37/153 (24%), Positives = 67/153 (43%), Gaps = 18/153 (11%)

Query: 81  PDYFRWIHEDLSPWKVTGIT-RDMLERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVF 137
           P  F  I + +S  K   IT +++ +   +    R  +   ++YI  Y     T  R   
Sbjct: 89  PKLFVEIEKTVSLRKERPITFKEVDDVVVEDGMVRAAIWRGELYILDYAAQPYTYSRAKA 148

Query: 138 TIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           T+  + + L+ +P R  LPD+E +F  DD   +            P P++ YS       
Sbjct: 149 TLNSLHRALQSFPDRHSLPDIEFVFTADDFSNV------------PGPVWSYSKRDEDES 196

Query: 196 I-VFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
           I + PD+ +W W E+ + P++ + R +   + G
Sbjct: 197 IWLMPDFGYWAWPEVKVGPYKDIRRRIAAVDEG 229


>gi|395519040|ref|XP_003763661.1| PREDICTED: protein O-glucosyltransferase 1 [Sarcophilus harrisii]
          Length = 335

 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 43/188 (22%), Positives = 81/188 (43%), Gaps = 22/188 (11%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA-NQTAHFRLILVNNKV 122
           Q  +S+ +Y       C  +   + +DL+P++  GI+++M+    N+       ++ N++
Sbjct: 37  QINKSVENYQSCSKGNCSCHLGVMEKDLAPFQ-GGISKEMMANVVNRKLGTHYQIIKNEL 95

Query: 123 YIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP 182
           Y  ++      R      G+   + +    LPD+E++ +  D P I       P    P 
Sbjct: 96  Y-REHDCMFPAR----CSGVEHFILEIINHLPDMEMVINVRDYPQI-------PKWMEPI 143

Query: 183 -PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
            P+F +S      DI++P W+FW  G A   I P     W+ L  +L        W  + 
Sbjct: 144 IPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLLREDLARSAEKWPWEKKV 203

Query: 235 PYAYWKGN 242
              Y++G+
Sbjct: 204 SKGYFRGS 211


>gi|195331145|ref|XP_002032263.1| GM23616 [Drosophila sechellia]
 gi|194121206|gb|EDW43249.1| GM23616 [Drosophila sechellia]
          Length = 404

 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 40/168 (23%), Positives = 72/168 (42%), Gaps = 19/168 (11%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           C  Y   I ++L P+   GIT +M+ ++ +   F  I+      I++ ++ +  +    +
Sbjct: 63  CLCYVATIQKNLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
                LL      +PDLE + +  D P +     SG +     P+  YS     +DI++P
Sbjct: 120 E---DLLLDMASGVPDLEFVLNVRDWPQVHF--LSGLSG----PVLSYSITDRHLDIMYP 170

Query: 200 DWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
            WSFW     N  P   +L+    G    +W+ +   A     P+ A+
Sbjct: 171 AWSFW----TNTGP---ILQHYPHGVGRWDWMRKHLVARASETPWNAK 211


>gi|334329589|ref|XP_001370099.2| PREDICTED: protein O-glucosyltransferase 1 [Monodelphis domestica]
          Length = 383

 Score = 41.6 bits (96), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 84/190 (44%), Gaps = 26/190 (13%)

Query: 64  QTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDM---LERANQTAHFRLILVNN 120
           Q  +S+ +Y   P   C  +   + +DL+P++  GI++++   L       H+++I   N
Sbjct: 29  QINKSVENYQPCPRENCSCHLGVMEKDLAPFQ-GGISKEVMTTLVSRKLGTHYQII--KN 85

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           ++Y  ++      R       ILQ++      LPD+E++ +  D P +       P    
Sbjct: 86  ELY-REHDCMFPARCSGVEHFILQVI----NHLPDMEMVINVRDYPQV-------PKWME 133

Query: 181 PP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWID 232
           P  P+F +S      DI++P W+FW  G A   I P     W+ L  +L        W  
Sbjct: 134 PAIPVFSFSKTNEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDLLREDLARSAEKWPWER 193

Query: 233 REPYAYWKGN 242
           +    Y++G+
Sbjct: 194 KISRGYFRGS 203


>gi|195572998|ref|XP_002104482.1| GD18425 [Drosophila simulans]
 gi|194200409|gb|EDX13985.1| GD18425 [Drosophila simulans]
          Length = 404

 Score = 41.6 bits (96), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 57/125 (45%), Gaps = 12/125 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           C  Y   I ++L P+   GIT +M+ ++ +   F  I+      I++ ++ +  +    +
Sbjct: 63  CICYVATIQKNLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
                LL      +PDLE + +  D P +     SG +     P+  YS     +DI++P
Sbjct: 120 E---DLLLDMASGVPDLEFVLNVRDWPQVHF--LSGLSG----PVLSYSITDRHLDIMYP 170

Query: 200 DWSFW 204
            WSFW
Sbjct: 171 AWSFW 175


>gi|350633295|gb|EHA21660.1| hypothetical protein ASPNIDRAFT_184185 [Aspergillus niger ATCC
           1015]
          Length = 452

 Score = 41.6 bits (96), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 97/232 (41%), Gaps = 52/232 (22%)

Query: 55  CPTNYPKTSQTQESISDYSIPPTST-------------CPDYFRWIHED----LSPWKV- 96
           CP   P      +++S  S PPT T             C   F  + ED     S W   
Sbjct: 64  CPAQLP----FHDTVSSSSAPPTRTFFDPLEYSYNETQCTTLFPGLFEDPIRAQSFWTAR 119

Query: 97  TGITR---DMLERANQTAHFRLILVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPG 151
            GI R   D ++ A+  A  R+ +   ++Y+ +   K     R +    GIL  + +   
Sbjct: 120 HGIKRIDIDNIKMADGMA--RVAIYQGRLYVLRALAKGEDHRRKIL---GILASIHRALV 174

Query: 152 RLPDL------ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFW 204
             P L      E++F  +D+      D +GP++    PL+  +       + + PD+ FW
Sbjct: 175 SAPQLAVILNTEIIFSVEDK----LEDVAGPDH----PLWVLARKATEESVWLMPDFGFW 226

Query: 205 GWAEIN--IKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
            W  I+  I P++ +++ ++E  +   W ++E    W+G   F  + RR LL
Sbjct: 227 SWGHIDTQIGPYDEVVKHVEEHEHP--WDEKENKLVWRGKLSFAPKLRRALL 276


>gi|302834814|ref|XP_002948969.1| hypothetical protein VOLCADRAFT_89378 [Volvox carteri f.
           nagariensis]
 gi|300265714|gb|EFJ49904.1| hypothetical protein VOLCADRAFT_89378 [Volvox carteri f.
           nagariensis]
          Length = 646

 Score = 41.6 bits (96), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 48/187 (25%), Positives = 80/187 (42%), Gaps = 29/187 (15%)

Query: 79  TCPDY---FRWIHEDLSPWKV-TGITRDMLERAN--QTAHFR-----LILVNNKVYIHKY 127
            C +Y   +  IH DL  +K   GI+ +++ R     +A +R     + +   KVYI   
Sbjct: 30  VCDNYEPLYASIHRDLDIYKRNAGISPNLIYRTMVLHSAGYREKGLAVAVYRGKVYIISN 89

Query: 128 KQSIQTRD--------VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNK 179
            +SI  +         V  I  +L L  KY   LPD+E ++   DRP+       G  N 
Sbjct: 90  TRSISLKRFGHHVALWVAYIKVLLDLEEKYGSYLPDVEFVWHTIDRPIRLVNTTPGGENF 149

Query: 180 GPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWE-SLLRELKEGNNGRNWIDREPYAY 238
              P+FR+       DI+ P++ F+      +KP++   L  +   N    W  R P  +
Sbjct: 150 ---PVFRFCKSVVHPDILVPNFHFY------MKPYQREFLDRIPHFNAEVPWAQRRPIVF 200

Query: 239 WKGNPFV 245
            + + +V
Sbjct: 201 ARFSGYV 207


>gi|255088471|ref|XP_002506158.1| predicted protein [Micromonas sp. RCC299]
 gi|226521429|gb|ACO67416.1| predicted protein [Micromonas sp. RCC299]
          Length = 658

 Score = 41.6 bits (96), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 49/197 (24%), Positives = 86/197 (43%), Gaps = 27/197 (13%)

Query: 81  PDYFRW---IHEDLSPW-KVTGITRDMLERANQTA--HFRLILVNNKVYIHKYKQSIQTR 134
           P+ F W   ++  + P+ K++    D  E   + +   FR  ++  ++++   +     R
Sbjct: 197 PEIFDWSPFVYRAMFPYNKISKADVDTAELVARGSVNSFRAQIIGGRLFVKDIRALEFAR 256

Query: 135 DVFTIWGI--LQLLRKYPGRLPDLELMFDCDDRPVI----------RSRDYSG-PNNKGP 181
           D    W I  L+ +R+    LPD++ +F+  D P++          R     G  N + P
Sbjct: 257 DYAPSWKITLLETMRRR-RDLPDIDAVFNEGDYPIVLLPNDGAHAQRLYGREGMSNGQKP 315

Query: 182 PPLFRYSGD-RWTMDIVFPDWSFW-----GWAEINIKPWESLLRELKEGNNGRNWIDREP 235
           PPLF  + + + T D+ FPD+SF      G   ++   W      L E      + D+ P
Sbjct: 316 PPLFSPTTNVQMTRDVPFPDFSFSPPGVKGADRLSTTRWSVAHGRLLEAGAKIPFEDKLP 375

Query: 236 YAYWKGNPFVAETRRDL 252
            A + GN   AE R+ L
Sbjct: 376 LAAFTGN-TQAEPRQRL 391


>gi|260827316|ref|XP_002608611.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae]
 gi|229293962|gb|EEN64621.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae]
          Length = 513

 Score = 41.6 bits (96), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 78/184 (42%), Gaps = 19/184 (10%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQ-TAHFRL---ILVNNKVYIHKYKQSIQTRD 135
           CP  F  I  DL  +    + R   E  ++   H  L    + +NK++   + Q    + 
Sbjct: 162 CPSAFPQIERDLEIFPKINLNRLSKEAVDRFGTHHSLCHYTVKDNKIHRKCHGQHTGFK- 220

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +F    +  + RK   R+PD+E   +  D P+ + +        GP P+  + G   T D
Sbjct: 221 MFMDATLHSITRKV--RIPDIEFFVNLGDWPLEKRQ-----VKDGPLPILSWCGSEETRD 273

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTC 255
           IV P +     + +      SL     +GN G  W+++   A W+G     ++RR+ L  
Sbjct: 274 IVMPTYDL-TESTLETMGRVSLDMLSVQGNTGPRWVNKTEQALWRGR----DSRRERL-- 326

Query: 256 NLSD 259
           NL D
Sbjct: 327 NLVD 330


>gi|145251704|ref|XP_001397365.1| endoplasmic reticulum-resident kdel protein [Aspergillus niger CBS
           513.88]
 gi|134082901|emb|CAK42731.1| unnamed protein product [Aspergillus niger]
          Length = 456

 Score = 41.6 bits (96), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 97/232 (41%), Gaps = 52/232 (22%)

Query: 55  CPTNYPKTSQTQESISDYSIPPTST-------------CPDYFRWIHED----LSPWKV- 96
           CP   P      +++S  S PPT T             C   F  + ED     S W   
Sbjct: 68  CPAQLP----FHDTVSSSSAPPTRTFFDPLEYSYNETQCTTLFPGLFEDPIRAQSFWTAR 123

Query: 97  TGITR---DMLERANQTAHFRLILVNNKVYIHKY--KQSIQTRDVFTIWGILQLLRKYPG 151
            GI R   D ++ A+  A  R+ +   ++Y+ +   K     R +    GIL  + +   
Sbjct: 124 HGIKRIDIDNIKMADGMA--RVAIYQGRLYVLRALAKGEDHRRKIL---GILASIHRALV 178

Query: 152 RLPDL------ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFW 204
             P L      E++F  +D+      D +GP++    PL+  +       + + PD+ FW
Sbjct: 179 SAPQLAVILNTEIIFSVEDK----LEDVAGPDH----PLWVLARKATEESVWLMPDFGFW 230

Query: 205 GWAEIN--IKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
            W  I+  I P++ +++ ++E  +   W ++E    W+G   F  + RR LL
Sbjct: 231 SWGHIDTQIGPYDEVVKHVEEHEHP--WDEKENKLVWRGKLSFAPKLRRALL 280


>gi|91082811|ref|XP_968605.1| PREDICTED: similar to endoplasmic reticulum-resident kdel protein
           [Tribolium castaneum]
          Length = 362

 Score = 41.2 bits (95), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 90/193 (46%), Gaps = 29/193 (15%)

Query: 66  QESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIH 125
           + +  +YS    + C  Y   I +DL  +K  GIT  ++++  +T   +  ++++K+Y  
Sbjct: 18  KRAKENYSPCDNTKCGCYSSQISDDLKIFK-KGITPQLIDKV-KTKGTKYQIIDHKLY-- 73

Query: 126 KYKQSIQTRDVFTIW-----GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
                   RD   ++     GI   L K   +LPD+EL+ +  D P I  +DY       
Sbjct: 74  --------RDKNCMFPARCAGIEHFLLKLLPKLPDMELIINTRDWPQIH-KDYGVFG--- 121

Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
             P+F +S      DI++P W+FW G   I++ P     W++    L +  N   W ++ 
Sbjct: 122 --PVFSFSKTSDYSDIMYPAWAFWEGGPAISLYPRGIGRWDTHRDLLGKKGNETLWDEKI 179

Query: 235 PYAYWKGNPFVAE 247
           P  +++G+   AE
Sbjct: 180 PKGFFRGSRTSAE 192


>gi|317147538|ref|XP_001822208.2| hypothetical protein AOR_1_1442014 [Aspergillus oryzae RIB40]
 gi|391873092|gb|EIT82167.1| hypothetical protein Ao3042_00673 [Aspergillus oryzae 3.042]
          Length = 438

 Score = 41.2 bits (95), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 30/129 (23%), Positives = 60/129 (46%), Gaps = 17/129 (13%)

Query: 109 QTAHFRLILVNNKVYIHKYKQ--SIQTRDVFTIWGILQLLRKYPGR--LPDLELMFDCDD 164
           +    R I+ + ++YI  +    +  TR   T+  + + L  +P R  LP++E       
Sbjct: 116 ENGMVRGIIDHGELYIVDFGNMPATFTRGKATLNSLHRALASFPDRDRLPNVEF------ 169

Query: 165 RPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGWAEINIKPWESLLRELKE 223
             V+ + DYS     G  P++ YS      ++ + PD+ +W W E+ + P++   R +  
Sbjct: 170 --VLTTEDYS----SGEGPIWSYSKREENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAA 223

Query: 224 GNNGRNWID 232
            ++G   +D
Sbjct: 224 IDDGEVTVD 232


>gi|25009725|gb|AAN71037.1| AT07872p [Drosophila melanogaster]
          Length = 397

 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 12/125 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           C  +   I  DL P+   GIT +M+ ++ +   F  I+      I++ ++ +  +    +
Sbjct: 63  CLCHVATIQRDLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
                LL      + DLE + +  D P +     SG +     P+F YS     +DI++P
Sbjct: 120 E---DLLLDMASGVADLEFVLNVRDWPQVHF--LSGLSG----PVFSYSITNRHLDIMYP 170

Query: 200 DWSFW 204
            WSFW
Sbjct: 171 AWSFW 175


>gi|24649142|ref|NP_732797.1| CG31139, isoform A [Drosophila melanogaster]
 gi|23171999|gb|AAF56061.2| CG31139, isoform A [Drosophila melanogaster]
 gi|262051015|gb|ACY07068.1| FI12101p [Drosophila melanogaster]
          Length = 397

 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 12/125 (9%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTI 139
           C  +   I  DL P+   GIT +M+ ++ +   F  I+      I++ ++ +  +    +
Sbjct: 63  CLCHVATIQRDLEPYVDKGITPEMMAQSKRLGTFYQIIRGR---IYRQQKCLHPKRCADV 119

Query: 140 WGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFP 199
                LL      + DLE + +  D P +     SG +     P+F YS     +DI++P
Sbjct: 120 E---DLLLDMASGVADLEFVLNVRDWPQVHF--LSGLSG----PVFSYSITNRHLDIMYP 170

Query: 200 DWSFW 204
            WSFW
Sbjct: 171 AWSFW 175


>gi|119190301|ref|XP_001245757.1| hypothetical protein CIMG_05198 [Coccidioides immitis RS]
 gi|392868638|gb|EAS34429.2| hypothetical protein CIMG_05198 [Coccidioides immitis RS]
          Length = 463

 Score = 40.8 bits (94), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 54/264 (20%), Positives = 101/264 (38%), Gaps = 35/264 (13%)

Query: 15  FSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCPTNYPKT------------ 62
           F +++   N   ++    P      + +T  NC        PT +               
Sbjct: 35  FGMNSDGDNVHPLLTQLIPAGHCTCKASTTFNCTSCVDSITPTTFETASPSLSPSSPAWT 94

Query: 63  ---SQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLIL 117
               +   ++   S    S+ P  F  +H  +  W    GI ++ L+    +    R ++
Sbjct: 95  FVYGRDDRNVGLTSSQCQSSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMI 154

Query: 118 VNNKVYIHKYKQSIQTRDVFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGP 176
            N  +Y+ + + + Q      I G+L  + R     +PD+E +F  +DR      D SG 
Sbjct: 155 FNGNLYVIETRAA-QEDHRQKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGT 209

Query: 177 NNKGPPPLFRYSGDRWTMDIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNW 230
                 PL+   G + + + V   PD+ FW W     NI P++ ++  ++  +  +   W
Sbjct: 210 GQ----PLWVL-GRKASEESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPW 264

Query: 231 IDREPYAYWKGN-PFVAETRRDLL 253
             +     W+G   F  + RR LL
Sbjct: 265 SSKTQKLLWRGKLSFSPKLRRGLL 288


>gi|195143521|ref|XP_002012746.1| GL23775 [Drosophila persimilis]
 gi|194101689|gb|EDW23732.1| GL23775 [Drosophila persimilis]
          Length = 419

 Score = 40.8 bits (94), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 44/190 (23%), Positives = 76/190 (40%), Gaps = 23/190 (12%)

Query: 73  SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
           S P    C  +   I  DL P+   GITR M+ ++      RL +V   +    Y+Q  +
Sbjct: 56  SQPSDPDCECHALTIRRDLGPYAEAGITRSMMAQSR-----RLGVVYQVIDGRIYRQP-E 109

Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIR-SRDYSGPNNKGPPPLFRYSGDR 191
                    +  +L    G LP++E + +  D P +     ++G       P+F +S   
Sbjct: 110 VPHPKRCADVEDMLLGIAGELPNVEFILNVRDWPQVPFLSGFTG-------PVFSHSVSH 162

Query: 192 WTMDIVFPDWSFWGWAE-------INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
             +DI+ P WSF   +          I  W  + R +       +W  ++P  +++G   
Sbjct: 163 QHLDIMCPAWSFSSVSGPPLQRFPHGIGQWGHMRRHMAAAAAQVSWEHKQPIGFFRGT-- 220

Query: 245 VAETRRDLLT 254
            + T RD L 
Sbjct: 221 RSSTERDTLV 230


>gi|18606235|gb|AAH23141.1| Kdelc1 protein [Mus musculus]
 gi|148664477|gb|EDK96893.1| KDEL (Lys-Asp-Glu-Leu) containing 1, isoform CRA_c [Mus musculus]
          Length = 282

 Score = 40.8 bits (94), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 33/130 (25%), Positives = 56/130 (43%), Gaps = 16/130 (12%)

Query: 80  CPDYFRWIHEDLS------PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQT 133
           C +    I +DL+      P K+         +     H+   L +NKVYI  + + +  
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHY--TLKDNKVYIKTHGEHVGF 210

Query: 134 RDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           R +F    +L L RK   R+PD+E   +  D P+ + +     +N    P+F + G   +
Sbjct: 211 R-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTES 262

Query: 194 MDIVFPDWSF 203
            DIV P +  
Sbjct: 263 RDIVMPTYDL 272


>gi|348506788|ref|XP_003440939.1| PREDICTED: protein O-glucosyltransferase 1-like [Oreochromis
           niloticus]
          Length = 366

 Score = 40.8 bits (94), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 41/184 (22%), Positives = 80/184 (43%), Gaps = 20/184 (10%)

Query: 67  ESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK 126
           +++  Y+      C  +   +  DL  +K  GI++D++    Q    R +  + ++  HK
Sbjct: 15  DAVKGYTPCEPVNCSCHLSVLQHDLQTFK-GGISQDVMAATIQ----RGVGTHYQIIGHK 69

Query: 127 YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPP-PLF 185
             +           G+   + +   +LPDLE++ +  D P +       P+   P  P+F
Sbjct: 70  LYREQNCMFPARCSGVEHFILEVIDKLPDLEMVVNVRDYPQV-------PHWVQPTLPVF 122

Query: 186 RYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAY 238
            +S      DI++P W+FW  G A   I P     W+ +  +LK+      W  +E   +
Sbjct: 123 SFSKTSDYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMRDDLKKSAAQWPWKKKESKGF 182

Query: 239 WKGN 242
           ++G+
Sbjct: 183 FRGS 186


>gi|148922833|ref|NP_001092215.1| KDEL motif-containing protein 2 [Danio rerio]
 gi|148744719|gb|AAI42840.1| Zgc:165521 protein [Danio rerio]
          Length = 518

 Score = 40.4 bits (93), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 39/166 (23%), Positives = 75/166 (45%), Gaps = 11/166 (6%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLER----ANQTAHFRLILVNNKVYIHKYKQSIQTRD 135
           CP     I +D S +    + + + E     A +       ++NN+V+     +    + 
Sbjct: 165 CPQEDPQIQKDFSSFPSIDLQQLLQEVPTRFAKRGGLIHYTILNNQVHRRSLGRYTDFK- 223

Query: 136 VFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD 195
           +F+   +L L RK   +LPD+E   +  D P + +R  +  +N GP P+  + G   T D
Sbjct: 224 MFSDEILLSLARKV--KLPDVEFYINVGDWP-MENRKVN--DNPGPVPVISWCGSTETRD 278

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           I+ P +     +   ++   + L  + +GN G  W D+   A+++G
Sbjct: 279 IILPTYDITHSSLEAMRGVTNDLLSV-QGNTGPTWSDKMNKAFFRG 323


>gi|321474111|gb|EFX85077.1| hypothetical protein DAPPUDRAFT_314403 [Daphnia pulex]
          Length = 502

 Score = 40.4 bits (93), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 49/212 (23%), Positives = 87/212 (41%), Gaps = 31/212 (14%)

Query: 48  VKNQTQTCPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA 107
           V  ++  CP +        E I+DY       C      I  DL+ ++     + + E  
Sbjct: 129 VYAESCDCPLD-----SIDEMINDYQ------CSVNVAQIDHDLNQFQNVDFNQVLTEAM 177

Query: 108 NQTAHF------RLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
            + +H          +++NKVY   Y Q +   ++F    +L L RK    LPD+E + +
Sbjct: 178 KRFSHAGSYSFCHYAVLDNKVYRRCYGQHVGF-NMFMDNILLSLSRK--AVLPDMEFLIN 234

Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
             D P+++       N     P+F + G   T DIV P +     A +      +L    
Sbjct: 235 LGDWPLVKK------NILPIIPIFSWCGSTQTADIVMPTYDI-TEASLECMGRVTLDMLS 287

Query: 222 KEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
            + N    W +++  A+W+G     ++RR+ L
Sbjct: 288 VQSNPDTKWENKQEKAFWRGR----DSRRERL 315


>gi|442620545|ref|NP_001262849.1| rumi, isoform B [Drosophila melanogaster]
 gi|440217767|gb|AGB96229.1| rumi, isoform B [Drosophila melanogaster]
          Length = 316

 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/128 (21%), Positives = 54/128 (42%), Gaps = 10/128 (7%)

Query: 121 KVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKG 180
           K+Y H+  +           GI   L      LPD++L+ +  D P + +       N  
Sbjct: 13  KIYGHRLYRDANCMFPARCEGIEHFLLPLVATLPDMDLIINTRDYPQLNA----AWGNAA 68

Query: 181 PPPLFRYSGDRWTMDIVFPDWSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDRE 234
             P+F +S  +   DI++P W+FW G     + P     W+ +  +L++      W  + 
Sbjct: 69  GGPVFSFSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKR 128

Query: 235 PYAYWKGN 242
              +++G+
Sbjct: 129 SLGFFRGS 136


>gi|303315019|ref|XP_003067517.1| hypothetical protein CPC735_064720 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240107187|gb|EER25372.1| hypothetical protein CPC735_064720 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 463

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 20/186 (10%)

Query: 78  STCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLILVNNKVYIHKYKQSIQTRD 135
           S+ P  F  +H  +  W    GI ++ L+    +    R ++ N  +Y+ + + + Q   
Sbjct: 113 SSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNLYVIETRAA-QEDH 171

Query: 136 VFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
              I G+L  + R     +PD+E +F  +DR      D SG       PL+   G + + 
Sbjct: 172 RRKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGTGQ----PLWVL-GRKASE 222

Query: 195 DIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNWIDREPYAYWKGN-PFVAE 247
           + V   PD+ FW W     NI P++ ++  ++  +  +   W  +     W+G   F  +
Sbjct: 223 ESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRGKLSFSPK 282

Query: 248 TRRDLL 253
            RR LL
Sbjct: 283 LRRGLL 288


>gi|320035743|gb|EFW17684.1| hypothetical protein CPSG_06127 [Coccidioides posadasii str.
           Silveira]
          Length = 463

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 20/186 (10%)

Query: 78  STCPDYFRWIHEDLSPWKV-TGITRDMLERAN-QTAHFRLILVNNKVYIHKYKQSIQTRD 135
           S+ P  F  +H  +  W    GI ++ L+    +    R ++ N  +Y+ + + + Q   
Sbjct: 113 SSFPGLFEDVHRAVRFWASRDGIQKESLDGIKLRGGMARAMIFNGNLYVIETRAA-QEDH 171

Query: 136 VFTIWGILQLL-RKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
              I G+L  + R     +PD+E +F  +DR      D SG       PL+   G + + 
Sbjct: 172 RRKILGVLSSIHRALQDNVPDIEFIFSIEDR----VDDVSGTGQ----PLWVL-GRKASE 222

Query: 195 DIV--FPDWSFWGWA--EINIKPWESLLRELK--EGNNGRNWIDREPYAYWKGN-PFVAE 247
           + V   PD+ FW W     NI P++ ++  ++  +  +   W  +     W+G   F  +
Sbjct: 223 ESVWLIPDFGFWAWGNPSSNIGPYDQVVETIERIDSEDRMPWSSKTQKLLWRGKLSFSPK 282

Query: 248 TRRDLL 253
            RR LL
Sbjct: 283 LRRGLL 288


>gi|291383938|ref|XP_002708527.1| PREDICTED: KDEL (Lys-Asp-Glu-Leu) containing 2 [Oryctolagus
           cuniculus]
          Length = 509

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 77/171 (45%), Gaps = 14/171 (8%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHK-YKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+  +  ++ Y++S+    
Sbjct: 155 TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNRIYRRSLGKYT 214

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   R    P   GP P+  + G + 
Sbjct: 215 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRRVNETP---GPIPIISWCGSQD 269

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
           + DI+ P +         ++    +  +L   +GN G +WI++   A+++G
Sbjct: 270 SRDIILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 317


>gi|345493843|ref|XP_003427163.1| PREDICTED: KDEL motif-containing protein 1-like [Nasonia
           vitripennis]
          Length = 507

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 74/165 (44%), Gaps = 24/165 (14%)

Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
           D   + N  +    ++ +N++Y + Y Q +  + +F+   +L L RK    LPD+E   +
Sbjct: 188 DRYNKPNSVSICHYVIKSNQIYRNCYGQHVGFK-IFSDAILLSLARKI--NLPDVEFFMN 244

Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
             D P++       P NK   P+F + G   + DIV P +      +I     E++ R +
Sbjct: 245 LGDWPLV-------PKNKEIHPIFSWCGSDDSYDIVLPTY------DITQSSMEAMGRVM 291

Query: 222 -----KEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKH 261
                 +G+    W  +    +W+G     E R DL+  +++ KH
Sbjct: 292 LDMLSVQGSTTDPWNKKIEKMFWRGRDARRE-RLDLI--DIARKH 333


>gi|449283857|gb|EMC90451.1| KTEL motif-containing protein 1, partial [Columba livia]
          Length = 364

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 44/195 (22%), Positives = 85/195 (43%), Gaps = 32/195 (16%)

Query: 62  TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERA---NQTAHFRLILV 118
           T Q ++++  Y       C  +     +DL+P++  GI+++++          H+++  V
Sbjct: 8   TDQIKKAVEVYKPCVKENCSCHQSVWKQDLAPFR-GGISKEIISDVVSRKLGTHYQI--V 64

Query: 119 NNKVYIHKYKQSIQTRDVF---TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSG 175
            NK+Y        + +D        G+   L +   RLPD+E++ +  D P +       
Sbjct: 65  KNKLY--------REQDCLFPARCSGVEHFLLEIISRLPDMEMVINVRDYPQV------- 109

Query: 176 PNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNG 227
           P    P  P+F +S      DI++P W+FW  G A   I P     W+ +  +L+     
Sbjct: 110 PKWMKPVIPIFSFSKTPEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEK 169

Query: 228 RNWIDREPYAYWKGN 242
             W+ +    Y++G+
Sbjct: 170 WPWMKKISKGYFRGS 184


>gi|348553234|ref|XP_003462432.1| PREDICTED: KDEL motif-containing protein 2-like [Cavia porcellus]
          Length = 692

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 76/169 (44%), Gaps = 14/169 (8%)

Query: 79  TCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRDVF 137
           +CP +   I ED + +    + + + E   +    R  +V+  +   H Y++S+     F
Sbjct: 341 SCPTWEPQIEEDFASFPTINLQQMLSEVPKRFGDERGAIVHYTILNNHIYRRSLGKYTDF 400

Query: 138 TIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTM 194
            ++    +L L RK   RLPDLE   +  D P+   +    P+   P P+  + G   + 
Sbjct: 401 KMFSDEILLSLARKV--RLPDLEFYINLGDWPLEHRKINETPS---PVPVISWCGSLDSR 455

Query: 195 DIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
           D++ P +         ++    +  +L   +GN G +WI++   A+++G
Sbjct: 456 DVILPTYDV---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 501


>gi|345561463|gb|EGX44552.1| hypothetical protein AOL_s00188g220 [Arthrobotrys oligospora ATCC
           24927]
          Length = 437

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 33/157 (21%), Positives = 63/157 (40%), Gaps = 11/157 (7%)

Query: 99  ITRDMLERANQTA--HFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDL 156
           IT D LE    +A  +FR I+ N ++Y+ K      +R   T+  + + +   P  LP++
Sbjct: 115 ITLDELESVTHSAFGYFRGIIYNQELYVIKMLDPNFSRGFATLMAMHRAIVTSPEPLPNI 174

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWES 216
           E   +        + DY           +    +  T++ + PD+ +W W E  +  +  
Sbjct: 175 EFTLN--------TADYIDFEQSAATWTYARRSNE-TVNWLMPDFGYWSWPEPKVGSYNE 225

Query: 217 LLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
           +  + +  +    W  + P   W+G       R  LL
Sbjct: 226 VRLKARLADEAIPWEKKIPKIVWRGATLKLPVRLALL 262


>gi|260810659|ref|XP_002600074.1| hypothetical protein BRAFLDRAFT_280689 [Branchiostoma floridae]
 gi|229285359|gb|EEN56086.1| hypothetical protein BRAFLDRAFT_280689 [Branchiostoma floridae]
          Length = 351

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 44/188 (23%), Positives = 81/188 (43%), Gaps = 33/188 (17%)

Query: 76  PTSTCPDYFRWIHEDLSPWK-VTGITRDMLERA---NQTAHFRLILVNNKVYIHKYKQSI 131
           P   C  +   I  DL  WK   GIT++  +R    ++  H+++I  ++K+Y        
Sbjct: 10  PDKKCSCHSSVITHDLKLWKDRGGITKEEFDRTLEQDKGTHYQII--DHKLYR------- 60

Query: 132 QTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS- 188
           Q R +F     G+   + +    LPD+E++ +  D P       S    + P P+  +S 
Sbjct: 61  QDRCMFPARCSGVEHFILEIIDDLPDMEMIINVRDWP------QSPMYQREPRPVMSFSK 114

Query: 189 --GDRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNWIDREPYAYW 239
              + W  DI++P W+FW             +  W+ +   + + N    W  +E  A++
Sbjct: 115 TFNEHW--DIMYPAWTFWEGGPAVWPLFPTGLGRWDLMRDSITKANEDWPWEKKEDKAFF 172

Query: 240 KGNPFVAE 247
           +G+   AE
Sbjct: 173 RGSRTSAE 180


>gi|260810693|ref|XP_002600088.1| hypothetical protein BRAFLDRAFT_280682 [Branchiostoma floridae]
 gi|229285373|gb|EEN56100.1| hypothetical protein BRAFLDRAFT_280682 [Branchiostoma floridae]
          Length = 351

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 44/188 (23%), Positives = 81/188 (43%), Gaps = 33/188 (17%)

Query: 76  PTSTCPDYFRWIHEDLSPWK-VTGITRDMLERA---NQTAHFRLILVNNKVYIHKYKQSI 131
           P   C  +   I  DL  WK   GIT++  +R    ++  H+++I  ++K+Y        
Sbjct: 10  PDKKCSCHSSVITHDLKLWKDRGGITKEEFDRTLEQDKGTHYQII--DHKLYR------- 60

Query: 132 QTRDVF--TIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS- 188
           Q R +F     G+   + +    LPD+E++ +  D P       S    + P P+  +S 
Sbjct: 61  QDRCMFPARCSGVEHFILEIIDDLPDMEMIINVRDWP------QSPMYQREPRPVMSFSK 114

Query: 189 --GDRWTMDIVFPDWSFWGWAEI-------NIKPWESLLRELKEGNNGRNWIDREPYAYW 239
              + W  DI++P W+FW             +  W+ +   + + N    W  +E  A++
Sbjct: 115 TFNEHW--DIMYPAWTFWEGGPAVWPLFPTGLGRWDLMRDSITKANEDWPWEKKEDKAFF 172

Query: 240 KGNPFVAE 247
           +G+   AE
Sbjct: 173 RGSRTSAE 180


>gi|322691155|ref|YP_004220725.1| hypothetical protein BLLJ_0966 [Bifidobacterium longum subsp.
           longum JCM 1217]
 gi|320456011|dbj|BAJ66633.1| hypothetical protein BLLJ_0966 [Bifidobacterium longum subsp.
           longum JCM 1217]
          Length = 1012

 Score = 39.3 bits (90), Expect = 1.9,   Method: Composition-based stats.
 Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 22/127 (17%)

Query: 149 YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAE 208
           +PGR+ D+ ++   D RP +RS              FR  G     D++  DW   G A 
Sbjct: 267 FPGRVTDVRIVASRDGRPAVRS--------------FRIDGSNADWDMISRDW-LKGAAP 311

Query: 209 INIKPW---ESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNA 265
                W   E++L     G  G  W+  +  A W    +  ET R L++  L+ +HD   
Sbjct: 312 GAFGDWMNTENILVTADPG-GGHGWVSVD--ADWDSYEWDVETMRPLMSDTLTRRHDDPV 368

Query: 266 RLYVQVI 272
           R Y+++I
Sbjct: 369 R-YMRLI 374


>gi|196003266|ref|XP_002111500.1| hypothetical protein TRIADDRAFT_55584 [Trichoplax adhaerens]
 gi|190585399|gb|EDV25467.1| hypothetical protein TRIADDRAFT_55584 [Trichoplax adhaerens]
          Length = 506

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 13/140 (9%)

Query: 102 DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFD 161
           D   + +   H+ +I  +NKVY + Y + +  + +F    +L L RK   +LPDLE + +
Sbjct: 187 DRFGKHHAICHYSVI--SNKVYRNCYGEHVGFK-MFMDAILLSLARKV--KLPDLEFISN 241

Query: 162 CDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL 221
             D P+   ++          P+F + G   T DIV P +     + + +    S+    
Sbjct: 242 LGDWPLENQQN-------NLIPIFSWCGSETTSDIVMPTYDL-TQSTLEMMGRVSVDVLA 293

Query: 222 KEGNNGRNWIDREPYAYWKG 241
            +G+ G  W D+    +W+G
Sbjct: 294 VQGSTGPKWKDKIEKGFWRG 313


>gi|83770071|dbj|BAE60206.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 460

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 26/105 (24%), Positives = 50/105 (47%), Gaps = 15/105 (14%)

Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
           TR   T+  + + L  +P R  LP++E         V+ + DYS     G  P++ YS  
Sbjct: 164 TRGKATLNSLHRALASFPDRDRLPNVEF--------VLTTEDYS----SGEGPIWSYSKR 211

Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDRE 234
               ++ + PD+ +W W E+ + P++   R +   ++G   +D +
Sbjct: 212 EENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAAIDDGEVTVDGQ 256


>gi|60302736|ref|NP_001012560.1| KTEL motif-containing protein 1 precursor [Gallus gallus]
 gi|60098623|emb|CAH65142.1| hypothetical protein RCJMB04_4b16 [Gallus gallus]
          Length = 392

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 48/198 (24%), Positives = 87/198 (43%), Gaps = 28/198 (14%)

Query: 62  TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR----DMLERANQTAHFRLIL 117
           T Q   ++  Y       C  +     +DL+P++ +GI++    D++ R   T H+++I 
Sbjct: 36  TDQINRAVEVYKPCVKENCSCHQSVWKQDLAPFR-SGISKETISDVVSRKLGT-HYQII- 92

Query: 118 VNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPN 177
             NK+Y  ++      R      G+   +R    RLPD+E++ +  D P +       P 
Sbjct: 93  -KNKLY-REHDCMFPAR----CSGVEHFIRGIINRLPDMEMVINVRDYPQV-------PK 139

Query: 178 NKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNNGRN 229
              P  P+F +S      DI++P W+FW  G A   I P     W+ +  +L+       
Sbjct: 140 WMKPIIPVFSFSKTAEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAEKWP 199

Query: 230 WIDREPYAYWKGNPFVAE 247
           W  +    Y++G+   +E
Sbjct: 200 WKKKISKGYFRGSRTSSE 217


>gi|198450765|ref|XP_002137151.1| GA27052 [Drosophila pseudoobscura pseudoobscura]
 gi|198131182|gb|EDY67709.1| GA27052 [Drosophila pseudoobscura pseudoobscura]
          Length = 420

 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 43/190 (22%), Positives = 75/190 (39%), Gaps = 23/190 (12%)

Query: 73  SIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQ 132
           S P    C  +   I  D  P+   GITR M+ ++      RL +V   +    Y+Q  +
Sbjct: 56  SQPSDPDCECHALTIRRDFGPYAEAGITRSMMAQSR-----RLGVVYKVIDGRIYRQP-E 109

Query: 133 TRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIR-SRDYSGPNNKGPPPLFRYSGDR 191
                    +  +L    G LP++E + +  D P +     ++GP       +F +S   
Sbjct: 110 VPHPKRCADVEDMLLGIAGELPNVEFILNVRDWPQVPFLSGFTGP-------VFSHSVSH 162

Query: 192 WTMDIVFPDWSFWGWAE-------INIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
             +DI+ P WSF   +          I  W  + R +       +W  ++P  +++G   
Sbjct: 163 QHLDIMCPAWSFSSVSGPPLQRFPHGIGQWGHMRRHMAAAAAQVSWEHKQPIGFFRGT-- 220

Query: 245 VAETRRDLLT 254
            + T RD L 
Sbjct: 221 RSSTERDTLV 230


>gi|443897893|dbj|GAC75232.1| endoplasmic reticulum protein EP58 [Pseudozyma antarctica T-34]
          Length = 479

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 36/160 (22%), Positives = 65/160 (40%), Gaps = 24/160 (15%)

Query: 98  GITRDMLERANQTAH---FRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLL-RKYPGRL 153
           GI  D ++ A +        LI+ + ++++    +  Q+R    +   +QLL   Y G  
Sbjct: 99  GIKYDNVQNAAENCRHGCVHLIIKHGQIFVRAQVKDWQSR----VRSTMQLLTAAYQGAS 154

Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD---------IVFPDWSFW 204
            D +   D  +  VI + D+ G  +          G  W +D          +FPD+SF 
Sbjct: 155 EDEKARIDGTEL-VISTADFDGFTDSAS------RGAGWVLDKRVNDTEGQYLFPDFSFA 207

Query: 205 GWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
            W E  I  +     + ++ N    W D+   A+W+G+  
Sbjct: 208 SWPEAGIASYPEFRHDAEQVNAETPWHDKLNPAFWRGDAL 247


>gi|115399170|ref|XP_001215174.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114192057|gb|EAU33757.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 407

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 80/173 (46%), Gaps = 30/173 (17%)

Query: 81  PDYFRWIHEDLSPWK-VTGITRDMLERAN-QTAHFRLILVNNKVYIHKYK---QSIQTRD 135
           P  F+ I   +  WK   GI++D L+    ++   R I+ N ++YI   K   +  +T+ 
Sbjct: 75  PGLFQDISRAVQYWKPRGGISKDDLDAIPFESGMARAIISNGELYIVATKAKGKDHRTKI 134

Query: 136 VFTIWGILQLLRKYPGRL--PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWT 193
           + T+  I + L   P RL  P +E +F  +DR  +   D +G       P++  S  + +
Sbjct: 135 LATLGSIYRALSSSPTRLSLPTIEFVFSIEDR--VDDIDATGH------PVWVVS-RKVS 185

Query: 194 MDIVF--PDWSFWGWAEINIKPWES------------LLRELKEGNNGRNWID 232
            + VF  PD+ +W WA+ +I P ++            L R L +   G+ W D
Sbjct: 186 EESVFLMPDFGYWSWAKSHIGPAKARVERGKLSFAPKLRRALLDVARGKPWGD 238


>gi|119499505|ref|XP_001266510.1| hypothetical protein NFIA_041910 [Neosartorya fischeri NRRL 181]
 gi|119414674|gb|EAW24613.1| predicted protein [Neosartorya fischeri NRRL 181]
          Length = 423

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 41/95 (43%), Gaps = 9/95 (9%)

Query: 150 PGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
           P  LPD     +  DRP+ +S  Y+ P +    P    +GD      + P ++FW W   
Sbjct: 126 PEPLPDTVFALNIQDRPMAQSWSYARPAD----PTATKAGD-----FLMPHFAFWAWPLK 176

Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPF 244
            I   +  L  + E    + +  + P A W+G P+
Sbjct: 177 YIGSMDRALTAITEIETDQTFQAKIPQAVWRGTPW 211


>gi|407921029|gb|EKG14198.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
          Length = 441

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 35/167 (20%), Positives = 73/167 (43%), Gaps = 17/167 (10%)

Query: 89  EDLS-PWKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
           EDL   W+  GITR M+         +L ++  K   H+  +  + R +  +  + + + 
Sbjct: 109 EDLDMSWRKEGITRCMI------YDHQLYVLETKGTTHR--RDYRERTLAVLHNMHRAIT 160

Query: 148 KYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR-WTMDIVFPDWSFWGW 206
            Y G LP++E  F  DD        Y   N    P ++ ++  + W    + PD+ +W W
Sbjct: 161 AYNGPLPNIEFTFSVDDWV------YDEENLNTDPVVWGFTRQQSWENVWLMPDFGYWAW 214

Query: 207 AEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
               +  ++ +  ++      + + +++P   W+G   + E R+ L+
Sbjct: 215 PTEPVGAYQDVRNQMGVREKAQAFSEKKPKVVWRGAA-LTEQRQALI 260


>gi|47078285|ref|NP_997610.1| KDEL (Lys-Asp-Glu-Leu) containing 2 protein precursor [Mus
           musculus]
 gi|148693845|gb|EDL25792.1| KDEL (Lys-Asp-Glu-Leu) containing 2, isoform CRA_b [Mus musculus]
 gi|157169796|gb|AAI52824.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [synthetic construct]
          Length = 503

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 80/193 (41%), Gaps = 25/193 (12%)

Query: 55  CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
           CP + P+  Q            T +CP     I +D   +    + + + E   +    R
Sbjct: 138 CPEDDPQVWQE-----------TLSCPASEPQIEQDFVSFPSINLQQMLKEVPTRFGDER 186

Query: 115 LILVNNKVYI-HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
             +V+  +   H Y++S+     F ++    +L L RK    LPDLE   +  D P+   
Sbjct: 187 GAVVHYTILNNHIYRRSLGKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHR 244

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGR 228
           +    P   GP P+  + G   + DI+ P +         ++    +  +L   +GN G 
Sbjct: 245 KVNDTP---GPIPIISWCGSLDSRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGP 298

Query: 229 NWIDREPYAYWKG 241
           +WI++   A+++G
Sbjct: 299 SWINKTEKAFFRG 311


>gi|33327382|gb|AAQ09021.1| unknown protein [Homo sapiens]
          Length = 507

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 76/169 (44%), Gaps = 10/169 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP+    I +D + +    + + + E      + R  +V+  +   H Y++S+    
Sbjct: 153 TLSCPNKETQIAKDFASFPSINLQQMLKEVPKSFGYERGAIVHYTILNNHVYRRSLGKYT 212

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P+   P P+  + G   
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           + D+V P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 315


>gi|340369006|ref|XP_003383040.1| PREDICTED: protein O-glucosyltransferase 1-like [Amphimedon
           queenslandica]
          Length = 325

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 59/130 (45%), Gaps = 17/130 (13%)

Query: 87  IHEDLSPWKVTGITR--DMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQ 144
           +  DL  W+  G  +  + +   ++  H+++  VN K+Y  K          F   G+  
Sbjct: 4   LESDLGVWRERGGIKREEFIHAKSKGVHYQI--VNGKLYREK-----DCLFSFRCKGVEH 56

Query: 145 LLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFW 204
            +      LP++EL+ +  D P  +S  Y  P      P+F +S      DI++P W+FW
Sbjct: 57  FILNIIEDLPNMELIINVFDYP--KSHKYHSP-----LPVFSFSKTVHYWDIMYPAWTFW 109

Query: 205 -GWAEINIKP 213
            G   ++++P
Sbjct: 110 SGGPAVSVEP 119


>gi|255938965|ref|XP_002560252.1| Pc15g00250 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584874|emb|CAP82911.1| Pc15g00250 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 458

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 1/59 (1%)

Query: 196 IVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
           I+ PD+ +W W + NI P+  +++ +    +   + D+E    W+G   F  + RR LL
Sbjct: 223 ILMPDFGYWSWDKSNIGPYGQVVQSIMAAESNLKFADKEQKLVWRGKLSFAPKLRRALL 281


>gi|26344660|dbj|BAC35979.1| unnamed protein product [Mus musculus]
          Length = 465

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 80/193 (41%), Gaps = 25/193 (12%)

Query: 55  CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
           CP + P+  Q            T +CP     I +D   +    + + + E   +    R
Sbjct: 100 CPEDDPQVWQE-----------TLSCPASEPQIEQDFVSFPSINLQQMLKEVPTRFGDER 148

Query: 115 LILVNNKVYI-HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
             +V+  +   H Y++S+     F ++    +L L RK    LPDLE   +  D P+   
Sbjct: 149 GAVVHYTILNNHIYRRSLGKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHR 206

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGR 228
           +    P   GP P+  + G   + DI+ P +         ++    +  +L   +GN G 
Sbjct: 207 KVNDTP---GPIPIISWCGSLDSRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGP 260

Query: 229 NWIDREPYAYWKG 241
           +WI++   A+++G
Sbjct: 261 SWINKTEKAFFRG 273


>gi|313232413|emb|CBY24081.1| unnamed protein product [Oikopleura dioica]
          Length = 536

 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 51/208 (24%), Positives = 84/208 (40%), Gaps = 42/208 (20%)

Query: 80  CPDYFRWIHEDLSPWKVTGITRDMLE----------RANQTAHFRLILVNNKVYIHKYKQ 129
           CP  F  + EDL+ +    + R   +              TAH  +++ + K+++  +  
Sbjct: 172 CPANFEQLDEDLAQFDEIDLDRLYEDGKKKKWAAGGHGEATAH--VVIKDQKLFVEDFGT 229

Query: 130 SIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSG 189
            +  R  F     L LLRK   +LPD E +F+  D P+         N   P P+  + G
Sbjct: 230 IMGFRG-FMSSMFLSLLRKV--KLPDAEFIFNLGDWPL-------EENLTDPQPILTWCG 279

Query: 190 DRWTMDIVFPDWSFWGWAEINIKPWESLLRELK-----EGNNGR--NWIDREPYAYWKG- 241
              T DI  P W      +       +L RE K     E  +G   +W ++    Y++G 
Sbjct: 280 SSNTSDIAVPTW------DQTKNTRHALFRERKDIQYVEQISGEVVSWNEKIERGYFRGR 333

Query: 242 --NPFVAETRRDLLTCNLSDKHDWNARL 267
             NP    +R  L   +++   D +ARL
Sbjct: 334 DSNP----SRLKLCELSMAHPEDIDARL 357


>gi|444723570|gb|ELW64221.1| KDEL motif-containing protein 2 [Tupaia chinensis]
          Length = 470

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 40/171 (23%), Positives = 75/171 (43%), Gaps = 14/171 (8%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+  V   H Y++S+    
Sbjct: 116 TLSCPIKEPQIAKDFASFPSINLQQMLNEVPERFGDERGAIVHYTVLNNHIYRRSLGKYT 175

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P   GP P+  + G   
Sbjct: 176 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETP---GPIPIISWCGSLD 230

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
           + DI+ P +         ++    +  +L   +GN G +WI++   A+++G
Sbjct: 231 SRDIILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 278


>gi|72679937|gb|AAI00618.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
          Length = 508

 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 80/193 (41%), Gaps = 25/193 (12%)

Query: 55  CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
           CP + P+  Q            T +CP     I +D   +    + + + E   +    R
Sbjct: 143 CPEDDPQAWQK-----------TLSCPANEPQIEQDFISFPSINLQQMLKEVPKRFGDER 191

Query: 115 LILVNNKVYI-HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
             +V+  +   H Y++S+     F ++    +L L RK    LPDLE   +  D P+   
Sbjct: 192 GAIVHYTILNNHIYRRSLGKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHR 249

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGR 228
           +    P   GP P+  + G   + DI+ P +         ++    +  +L   +GN G 
Sbjct: 250 KVNDTP---GPIPIISWCGSLDSRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGP 303

Query: 229 NWIDREPYAYWKG 241
           +WI++   A+++G
Sbjct: 304 SWINKTEKAFFRG 316


>gi|255955591|ref|XP_002568548.1| Pc21g15380 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211590259|emb|CAP96435.1| Pc21g15380 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 469

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 65/283 (22%), Positives = 113/283 (39%), Gaps = 46/283 (16%)

Query: 1   MIELFAFSIILQSNFSVHNISRNKTVIIVSRKPQNRPPKRITTPLNCVKNQTQTCP---- 56
           +I  F+FS I+   F +++       I+    P        +T   C  +   TCP    
Sbjct: 18  VITAFSFSAIIVIWFGMNSDREYIHPILTQLIPAGHCACESSTTFQC--STCLTCPEPGL 75

Query: 57  -TNYPKTSQTQESISDYS------IPPTSTCPDYFRWIHEDL--------SPWKVTGITR 101
            TN  +T+ +      Y+          S C   F  + ED+        S  +++ +  
Sbjct: 76  QTNSLQTNSSPAWTFQYTRDGRNETLNKSQCKAAFPGLFEDVARGGKFWSSHGRLSSVEL 135

Query: 102 DMLERANQTAHFRLILVNNKVYI--HKYKQSIQTRDVF-TIWGILQLLRKYPGRL--PDL 156
           D +    Q    R  + N ++Y+   + K     R +  T+  I + L   P R   PD 
Sbjct: 136 DAI--PIQLGMVRAFISNGQLYVVTARSKGEDHRRKILGTLSSIHRALAADPERASRPDF 193

Query: 157 ELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR---WTMDIVFPDWSFWGWAEIN--I 211
           E +F  +D+      D +  N++ P  +F  +      W M    PD+SFW W   N  I
Sbjct: 194 EFVFSVEDK----VDDVT--NSEWPVWVFSRTPSEEGVWLM----PDFSFWAWDNKNNYI 243

Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNP-FVAETRRDLL 253
            P++ ++  +K  +    W ++ P   W+G P F  + RR L+
Sbjct: 244 GPYDQVVERIKHMDI--PWSEKTPQLVWRGKPSFAPKLRRALM 284


>gi|357608730|gb|EHJ66122.1| KDEL motif-containing protein 1 [Danaus plexippus]
          Length = 234

 Score = 38.5 bits (88), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 56/124 (45%), Gaps = 12/124 (9%)

Query: 141 GILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPD 200
           G+   L+     LPD++L+ +  D P   ++D+         P+F +S  R   DI++P 
Sbjct: 33  GVEHYLKILSPTLPDMDLVINTRDWPQF-NKDWGHKK----APVFSFSKTRSYYDIMYPT 87

Query: 201 WSFW-GWAEINIKP-----WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLT 254
           WSFW G   I + P     W+     +        W  +E  A+++G+   +E R  L+ 
Sbjct: 88  WSFWEGGPAIALYPTGIGRWDKHRTSISTAAEKWPWNKKEEKAFFRGSR-TSEERDALIL 146

Query: 255 CNLS 258
            + S
Sbjct: 147 LSRS 150


>gi|219886273|gb|ACL53511.1| unknown [Zea mays]
          Length = 527

 Score = 38.5 bits (88), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 65  TQESISDYSIPPTSTCPDYFRWIHEDLSPW 94
           T+ S S    PP   CPDYFR+IH DLSPW
Sbjct: 187 TKSSSSKNKQPPPQ-CPDYFRFIHSDLSPW 215


>gi|238495869|ref|XP_002379170.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
 gi|220694050|gb|EED50394.1| DUF821 domain protein [Aspergillus flavus NRRL3357]
          Length = 302

 Score = 38.5 bits (88), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
           TR   T+  + + L  +P R  LP++E         V+ + DYS     G  P++ YS  
Sbjct: 6   TRGKATLNSLHRALASFPDRDRLPNVEF--------VLTTEDYS----SGEGPIWSYSKR 53

Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWID 232
               ++ + PD+ +W W E+ + P++   R +   ++G   +D
Sbjct: 54  EENTNVWLMPDFGYWSWPEVGVGPYKDARRRIAAIDDGEVTVD 96


>gi|153218522|ref|NP_001020294.3| KDEL motif-containing protein 2 precursor [Rattus norvegicus]
 gi|81888044|sp|Q566E5.1|KDEL2_RAT RecName: Full=KDEL motif-containing protein 2; Flags: Precursor
 gi|62471552|gb|AAH93594.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
 gi|149041673|gb|EDL95514.1| KDEL (Lys-Asp-Glu-Leu) containing 2 [Rattus norvegicus]
          Length = 508

 Score = 38.5 bits (88), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 57/250 (22%), Positives = 100/250 (40%), Gaps = 43/250 (17%)

Query: 55  CPTNYPKTSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFR 114
           CP + P+  Q            T +CP     I +D   +    + + + E   +    R
Sbjct: 143 CPEDDPQAWQK-----------TLSCPANEPQIEQDFISFPSINLQQMLKEVPKRFGDER 191

Query: 115 LILVNNKVYI-HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRS 170
             +V+  +   H Y++S+     F ++    +L L RK    LPDLE   +  D P+   
Sbjct: 192 GAIVHYTILNNHIYRRSLGKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHR 249

Query: 171 RDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGR 228
           +    P   GP P+  + G   + DI+ P +         ++    +  +L   +GN G 
Sbjct: 250 KVNDTP---GPIPIISWCGSLDSRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGP 303

Query: 229 NWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDWNARLYVQVISFIYITHLCSKIPGFN 288
           +WI++   A+++G     ++R + L   L  K   N +L            L + I G+ 
Sbjct: 304 SWINKTEKAFFRGR----DSREERLQLVLLSKE--NPQL------------LDAGITGYF 345

Query: 289 FFLIFFKMLG 298
           FF    K LG
Sbjct: 346 FFQEKEKELG 355


>gi|396482709|ref|XP_003841528.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312218103|emb|CBX98049.1| similar to DUF821 domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 428

 Score = 38.5 bits (88), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 48/104 (46%), Gaps = 12/104 (11%)

Query: 153 LPDLELMFDCDDRPVIRSRDYS---GPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEI 209
           +P++E     DD P  RS+D+    G   K      R   D W M    P++++W W   
Sbjct: 152 VPNIEFSLSLDDLPR-RSKDHGVFFGYTRKEG----REYNDIWMM----PNYAYWSWNYT 202

Query: 210 NIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLL 253
           +   W S+ RE+++      W  ++    W+G   +A+ R +L+
Sbjct: 203 HAPSWNSIRREIEQREAEIPWAMKDSRVVWRGKVKMAKLRSELV 246


>gi|410971943|ref|XP_003992420.1| PREDICTED: KDEL motif-containing protein 2 [Felis catus]
          Length = 445

 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 38/171 (22%), Positives = 76/171 (44%), Gaps = 14/171 (8%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+  +   H Y++S+    
Sbjct: 91  TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 150

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPD+E   +  D P+   +    P   GP P+  + G   
Sbjct: 151 DFKMFSDEILLSLARKV--LLPDIEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 205

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
           + DI+ P +     +   ++    +  +L   +GN G +WI++   A+++G
Sbjct: 206 SRDIILPTYDI---SHSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 253


>gi|344243629|gb|EGV99732.1| KDEL motif-containing protein 2 [Cricetulus griseus]
          Length = 452

 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 75/171 (43%), Gaps = 14/171 (8%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN-NKVYIHKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+   V  H +++S+    
Sbjct: 98  TLSCPATEPQIEQDFTSFPSIDLQQMLKEVPKRFGDERGAVVHYTIVNNHIHRRSLGKYT 157

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P   GP P+  + G   
Sbjct: 158 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 212

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
           + DI+ P +         ++    +  +L   +GN G +WI++   A+++G
Sbjct: 213 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 260


>gi|354481242|ref|XP_003502811.1| PREDICTED: KDEL motif-containing protein 2-like [Cricetulus
           griseus]
          Length = 472

 Score = 38.1 bits (87), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 75/171 (43%), Gaps = 14/171 (8%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVN-NKVYIHKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+   V  H +++S+    
Sbjct: 118 TLSCPATEPQIEQDFTSFPSIDLQQMLKEVPKRFGDERGAVVHYTIVNNHIHRRSLGKYT 177

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P   GP P+  + G   
Sbjct: 178 DFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTP---GPIPIISWCGSLD 232

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLREL--KEGNNGRNWIDREPYAYWKG 241
           + DI+ P +         ++    +  +L   +GN G +WI++   A+++G
Sbjct: 233 SRDIILPTYDV---THSTLEAMRGVTNDLLSVQGNTGPSWINKTEKAFFRG 280


>gi|239608365|gb|EEQ85352.1| DUF821 domain-containing protein [Ajellomyces dermatitidis ER-3]
 gi|327349565|gb|EGE78422.1| DUF821 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 449

 Score = 38.1 bits (87), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 51/240 (21%), Positives = 97/240 (40%), Gaps = 38/240 (15%)

Query: 37  PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTC----PDYFRWIHEDLS 92
           PP R   P     N    CP + P   +T+    +Y +   + C    P  +  I + + 
Sbjct: 47  PPCRPVIPAAGSNNNGGACPGDAPWKFETERDGDNYGLS-RAQCQKAFPKLYVEIEKAVV 105

Query: 93  PWKVTGITRDML-ERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKY 149
             +   IT D L  +  +T+  R ++    +Y+  ++    T  R   ++  + + +   
Sbjct: 106 ARRGRNITFDELNSKPLKTSMVRAMVYQGALYVINFEDMRYTFSRAKASLNALNRAISAI 165

Query: 150 PG--RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS---GDRWTMDIVFPDWSFW 204
           P    +P++E +F  +D             +  P P++ YS    D W    + PD+ +W
Sbjct: 166 PNPNEIPNIEFIFTTEDF------------HDDPHPVWVYSKREDDGWAW--LMPDFGYW 211

Query: 205 GWAEINIKPWESL---LRELKEGN--NGR-----NWIDREPYAYWKGNPFVA-ETRRDLL 253
            W EI    + S+   +  + EG   NG+      + D++    W+G    A E R+ L+
Sbjct: 212 SWPEIKAGQYRSVRQRIAAIDEGTVINGKAHPPLKFQDKKKQLLWRGAIATAPELRQGLV 271


>gi|431907500|gb|ELK11352.1| KDEL motif-containing protein 2 [Pteropus alecto]
          Length = 447

 Score = 38.1 bits (87), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 56/120 (46%), Gaps = 9/120 (7%)

Query: 125 HKYKQSIQTRDVFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGP 181
           H Y++S+     F ++    +L L RK    LPDLE   +  D P+   +    P   GP
Sbjct: 143 HIYRRSLGKYTDFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETP---GP 197

Query: 182 PPLFRYSGDRWTMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
            P+  + G   + DI+ P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 198 LPIISWCGSLDSQDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 256


>gi|261203203|ref|XP_002628815.1| DUF821 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239586600|gb|EEQ69243.1| DUF821 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 449

 Score = 38.1 bits (87), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 51/240 (21%), Positives = 97/240 (40%), Gaps = 38/240 (15%)

Query: 37  PPKRITTPLNCVKNQTQTCPTNYPKTSQTQESISDYSIPPTSTC----PDYFRWIHEDLS 92
           PP R   P     N    CP + P   +T+    +Y +   + C    P  +  I + + 
Sbjct: 47  PPCRPVIPAAGSNNNGGACPGDAPWKFETERDGDNYGLS-RAQCQKAFPKLYVEIEKAVV 105

Query: 93  PWKVTGITRDML-ERANQTAHFRLILVNNKVYIHKYKQSIQT--RDVFTIWGILQLLRKY 149
             +   IT D L  +  +T+  R ++    +Y+  ++    T  R   ++  + + +   
Sbjct: 106 ARRGRNITFDELNSKPLKTSMVRAMVYQGALYVINFEDMRYTFSRAKASLNALNRAISAI 165

Query: 150 PG--RLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYS---GDRWTMDIVFPDWSFW 204
           P    +P++E +F  +D             +  P P++ YS    D W    + PD+ +W
Sbjct: 166 PNPNEIPNIEFIFTTEDF------------HDDPHPVWVYSKREDDGWAW--LMPDFGYW 211

Query: 205 GWAEINIKPWESL---LRELKEGN--NGR-----NWIDREPYAYWKGNPFVA-ETRRDLL 253
            W EI    + S+   +  + EG   NG+      + D++    W+G    A E R+ L+
Sbjct: 212 SWPEIKAGQYRSVRQRIAAIDEGTVINGKAHPPLKFQDKKKQLLWRGAIATAPELRQGLV 271


>gi|34531642|dbj|BAC86191.1| unnamed protein product [Homo sapiens]
          Length = 409

 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+  +   H Y++S+    
Sbjct: 97  TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 156

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P+   P P+  + G   
Sbjct: 157 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 211

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           + D+V P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 212 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 259


>gi|73955182|ref|XP_546537.2| PREDICTED: KDEL motif-containing protein 2 [Canis lupus familiaris]
          Length = 508

 Score = 37.7 bits (86), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+  +   H Y++S+    
Sbjct: 154 TLSCPTKEPQIAKDFASFPSINLQQMLDEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 213

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPD+E   +  D P+   +    P   GP P+  + G   
Sbjct: 214 DFKMFSDEILLSLARKV--LLPDIEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 268

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           + DI+ P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 269 SRDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 316


>gi|344287853|ref|XP_003415666.1| PREDICTED: KDEL motif-containing protein 2 [Loxodonta africana]
          Length = 507

 Score = 37.7 bits (86), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 38/171 (22%), Positives = 74/171 (43%), Gaps = 14/171 (8%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP     I +D   +    + + + E   +    R  +V+  +   H Y++S+    
Sbjct: 153 TLSCPTKEPQIEKDFLSFPSINLQQMLNEVPKRFGDERGAIVHYTILNNHIYRRSLGKYT 212

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPD+E   +  D P+   +    P   GP PL  + G   
Sbjct: 213 DFKMFSDEILLSLARKV--LLPDVEFYINVGDWPLEHRKVNETP---GPIPLISWCGSLD 267

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELK--EGNNGRNWIDREPYAYWKG 241
           + D++ P +         ++    +  +L   +GN G +WI++   A+++G
Sbjct: 268 SRDVILPTYDI---THSTLEAMRGVTNDLLSIQGNTGPSWINKTEKAFFRG 315


>gi|71002250|ref|XP_755806.1| DUF821 domain protein [Aspergillus fumigatus Af293]
 gi|66853444|gb|EAL93768.1| DUF821 domain protein [Aspergillus fumigatus Af293]
 gi|159129863|gb|EDP54977.1| DUF821 domain protein [Aspergillus fumigatus A1163]
          Length = 435

 Score = 37.7 bits (86), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 23/98 (23%), Positives = 45/98 (45%), Gaps = 15/98 (15%)

Query: 133 TRDVFTIWGILQLLRKYPGR--LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGD 190
           +R   T+  + + ++ +P R  LPD+E +   DD   +            P P++ YS  
Sbjct: 135 SRAKATLNSLHRAMQSFPDRHNLPDIEFVLTADDFSNV------------PGPVWSYSKR 182

Query: 191 RWTMDI-VFPDWSFWGWAEINIKPWESLLRELKEGNNG 227
                I + PD+ +W W E+ + P++ + R +   + G
Sbjct: 183 DEDESIWLMPDFGYWAWPEVKVGPYKDIRRRIAAVDEG 220


>gi|449485840|ref|XP_002191338.2| PREDICTED: protein O-glucosyltransferase 1 [Taeniopygia guttata]
          Length = 514

 Score = 37.4 bits (85), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 48/196 (24%), Positives = 86/196 (43%), Gaps = 34/196 (17%)

Query: 62  TSQTQESISDYSIPPTSTCPDYFRWIHEDLSPWKVTGITR----DMLERANQTAHFRLIL 117
           T Q + ++  Y       C  +     +DL+P++  GI++    D++ R   T H+++I 
Sbjct: 158 TGQIKRAVEAYEPCVKENCSCHQSVWKQDLAPFR-GGISKETMSDVVSRKLGT-HYQII- 214

Query: 118 VNNKVYIHK---YKQSIQTRDVFTIWGILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYS 174
             NK+Y  +   +       + F I GI+        RLPD+E++ +  D P +      
Sbjct: 215 -KNKLYREQDCMFPARCSGVEHF-ILGIIH-------RLPDMEMVINVRDYPQV------ 259

Query: 175 GPNNKGPP-PLFRYSGDRWTMDIVFPDWSFW--GWAEINIKP-----WESLLRELKEGNN 226
            P    P  P+F +S      DI++P W+FW  G A   I P     W+ +  +L+    
Sbjct: 260 -PKWMKPIIPVFSFSKTSEYNDIMYPAWTFWEGGPAVWPIYPTGLGRWDLMREDLRRSAE 318

Query: 227 GRNWIDREPYAYWKGN 242
              W  +    Y++G+
Sbjct: 319 KWPWKKKISKGYFRGS 334


>gi|432106972|gb|ELK32490.1| KDEL motif-containing protein 2, partial [Myotis davidii]
          Length = 442

 Score = 37.4 bits (85), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 74/169 (43%), Gaps = 10/169 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+  +   H Y++ +    
Sbjct: 88  TLSCPTKEPQIAKDFASFPSINLQQMLNEVPKRFGEERGAIVHYTILNNHIYRRPLGKYT 147

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P   GP P+  + G   
Sbjct: 148 DFKMFSDEILLSLARKV--LLPDLEFYINLGDWPLEHRKVNETP---GPLPIISWCGSLD 202

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           + DI+ P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 203 SRDIILPTYDITHSTLEAMRGVTNDLLSI-QGNTGPSWINKTEKAFFRG 250


>gi|407921427|gb|EKG14575.1| Lipopolysaccharide-modifying protein [Macrophomina phaseolina MS6]
          Length = 490

 Score = 37.4 bits (85), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 41/168 (24%), Positives = 73/168 (43%), Gaps = 23/168 (13%)

Query: 94  WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
           WK+ GI R M+         +L ++N +      ++  + R +  +  I + +  YPG +
Sbjct: 113 WKIDGIIRAMIYDR------QLFILNAR---GARRRDYRQRTLAVLQSIQRAITAYPGDI 163

Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDR--WTMDIVFPDWSFWGWAEINI 211
           P++E  F  DD        Y+   +     L R   D   W M    PD+ F+ W E   
Sbjct: 164 PNIEFSFVVDDGAYFAV--YNNETSATTWALTREPQDDNLWLM----PDFGFYSW-EGPA 216

Query: 212 KPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVA---ETRRDLLTCN 256
             + +LLR +++  +   +  ++P A W+G    A   + R DLL  +
Sbjct: 217 GEYNALLRAIEQ--DEMPFEQKDPRAIWRGAKAPAGHVQVRSDLLKVS 262


>gi|34193987|gb|AAH36526.3| KDEL (Lys-Asp-Glu-Leu) containing 2 [Homo sapiens]
          Length = 507

 Score = 37.4 bits (85), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 39/175 (22%), Positives = 77/175 (44%), Gaps = 10/175 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+  +   H Y++S+    
Sbjct: 153 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 212

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P+   P P+  + G   
Sbjct: 213 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 267

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAE 247
           + D+V P +         ++   + L  + +GN G +WI++   A+++G   + E
Sbjct: 268 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRGRDSLEE 321


>gi|397516340|ref|XP_003828388.1| PREDICTED: KDEL motif-containing protein 2 isoform 2 [Pan paniscus]
          Length = 499

 Score = 37.4 bits (85), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 10/169 (5%)

Query: 77  TSTCPDYFRWIHEDLSPWKVTGITRDMLERANQTAHFRLILVNNKVYI-HKYKQSIQTRD 135
           T +CP     I +D + +    + + + E   +    R  +V+  +   H Y++S+    
Sbjct: 145 TLSCPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSLGKYT 204

Query: 136 VFTIWG---ILQLLRKYPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRW 192
            F ++    +L L RK    LPDLE   +  D P+   +    P+   P P+  + G   
Sbjct: 205 DFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPS---PIPIISWCGSLD 259

Query: 193 TMDIVFPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKG 241
           + D+V P +         ++   + L  + +GN G +WI++   A+++G
Sbjct: 260 SRDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRG 307


>gi|319411533|emb|CBQ73577.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 452

 Score = 37.4 bits (85), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 41/180 (22%), Positives = 70/180 (38%), Gaps = 28/180 (15%)

Query: 92  SPWKVTGITR--DMLERANQTAH--FRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLR 147
           + WK  G  R  D+   A    H    L++ + +++I   ++  Q+R    +   LQLL+
Sbjct: 73  AAWKAKGGVRFEDVQNAAQHCRHGCVHLVIRDGQIFIRAQEKDWQSR----VRSTLQLLQ 128

Query: 148 K-YPGRLPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMD---------IV 197
             Y G   +   + +  +  VI + D+ G  +          G  W +D          +
Sbjct: 129 SAYLGASEEEREVMEGVEL-VISTADFDGFTDAAS------RGAGWVLDKRVNDTQGQYL 181

Query: 198 FPDWSFWGWAEINIKPWESLLRELKEGNNGRNWIDREPYAYWKGNPFVAET---RRDLLT 254
           FPD+SF  W E  I  +    R     N    W  +   A+W+G+     +   R  LL 
Sbjct: 182 FPDFSFASWPEAGIPSYPEFRRAAARVNAAVPWASKANRAFWRGDALAGSSIPARESLLA 241


>gi|327303802|ref|XP_003236593.1| hypothetical protein TERG_03638 [Trichophyton rubrum CBS 118892]
 gi|326461935|gb|EGD87388.1| hypothetical protein TERG_03638 [Trichophyton rubrum CBS 118892]
          Length = 431

 Score = 37.4 bits (85), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 40/166 (24%), Positives = 69/166 (41%), Gaps = 18/166 (10%)

Query: 98  GITRDMLERAN-QTAHFRLILVNNKVYI--HKYKQSIQTRDVFTIWGILQ-LLRKYPGR- 152
           GI+ + L++   +    R I+   ++YI   K  Q    + +  +   +   L    G  
Sbjct: 95  GISNEHLDKIELKDGMARAIIHRGELYIVATKAAQEDHRKKILAVLSSMHHALSATAGNR 154

Query: 153 -LPDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDI-VFPDWSFWGW--AE 208
             P +E +F  +DR      D SGP +    P++  S       I + PD+ FW W  A 
Sbjct: 155 TQPSIEFVFSIEDR----VDDISGPGH----PIWALSRKASEESIWLIPDFGFWAWDNAA 206

Query: 209 INIKPWESLLRELKEGNNGRNWIDREPYAYWKGN-PFVAETRRDLL 253
             I P+  ++  ++       W  +E    W+G   F  + RR+LL
Sbjct: 207 NAIGPYNQVVDRIQRQEVTVPWSSKENKLVWRGKLSFAPKLRRNLL 252


>gi|452984518|gb|EME84275.1| hypothetical protein MYCFIDRAFT_118382, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 387

 Score = 37.4 bits (85), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 37/170 (21%), Positives = 66/170 (38%), Gaps = 18/170 (10%)

Query: 94  WKVTGITRDMLERANQTAHFRLILVNNKVYIHKYKQSIQTRDVFTIWGILQLLRKYPGRL 153
           W+  GI R M+   NQ      ++  + V  H ++     R + T+  I + +    G+L
Sbjct: 62  WRGDGIVRAMIH-DNQL----YVIDPHAVTDHNHR----PRTLATLHAIHRAVSASSGKL 112

Query: 154 PDLELMFDCDDRPVIRSRDYSGPNNKGPPPLFRYSGDRWTMDIVFPDWSFWGWAEINIKP 213
           PD+E  F   D  +    D  G +         +    W M    PD+  WGW ++ ++ 
Sbjct: 113 PDIEFSFTVHDFAL---HDRYGNHTTWAYTRLPHQEKLWLM----PDFGLWGWPDVGLRS 165

Query: 214 WESLLRELKEGNNGRNWIDREPYAYWKGNPFVAETRRDLLTCNLSDKHDW 263
           +      L    +   ++D+ P   W+G+  V            S+K  W
Sbjct: 166 YAEFQTVLDYEED--EFVDKIPKLVWRGSLAVGSHDVRAGLVKHSEKQPW 213


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.139    0.460 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,261,866,332
Number of Sequences: 23463169
Number of extensions: 232841045
Number of successful extensions: 431041
Number of sequences better than 100.0: 450
Number of HSP's better than 100.0 without gapping: 193
Number of HSP's successfully gapped in prelim test: 257
Number of HSP's that attempted gapping in prelim test: 430396
Number of HSP's gapped (non-prelim): 464
length of query: 302
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 161
effective length of database: 9,050,888,538
effective search space: 1457193054618
effective search space used: 1457193054618
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 76 (33.9 bits)