BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008526
(563 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255585251|ref|XP_002533327.1| conserved hypothetical protein [Ricinus communis]
gi|223526849|gb|EEF29063.1| conserved hypothetical protein [Ricinus communis]
Length = 565
Score = 759 bits (1960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/580 (67%), Positives = 465/580 (80%), Gaps = 32/580 (5%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHR--LPTSDNNEDEEHNRRHSTFHIDDFPNAPPIR 58
MERDSSD++DDRE LI QND KH NH+ +PTS H R STFHI+++ IR
Sbjct: 1 MERDSSDEEDDRENLIEQNDRKHHNHQQTVPTSS-----PHRRSFSTFHIEEYGGV--IR 53
Query: 59 RRFTFDFKKLNNKRY---LFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRES 115
RR L NKRY L A+ LPLLII++YFS +LRSLFS N + F+S +DRMRE+
Sbjct: 54 RR-------LFNKRYYYYLLAIFLPLLIIIVYFSADLRSLFSANISSLNFNSASDRMREA 106
Query: 116 ELRALSLLKQQQSHLLSLWNQSFVNNS--------YGNNTNNPFFQEAKSVLLNQISLNR 167
EL+AL LL+QQQ LLS++NQSF + + + N+ +N + +S LL Q++ N+
Sbjct: 107 ELQALYLLEQQQLSLLSIFNQSFPSRNKNFSSNSSFINSFDNVKIENFRSALLKQMTFNK 166
Query: 168 QIEQILLSPHKVSN----FTPNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLS 223
QI+QILLSPHK N + + + +G + C+K++S +++T+EWKP+SDKFLF ICLS
Sbjct: 167 QIQQILLSPHKSGNENVSGSFSGSGFGFDRCKKVESRFLDRKTIEWKPRSDKFLFPICLS 226
Query: 224 GQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFM 283
GQMSNHLICLEKHMF AALLNRVLV+PSSKFDYQY+RVLDIEHIN C+GRKVVV+FE F+
Sbjct: 227 GQMSNHLICLEKHMFFAALLNRVLVMPSSKFDYQYNRVLDIEHINLCVGRKVVVTFEEFV 286
Query: 284 EMEKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQ 343
+M KNH HIDRF+CYF P C+VD+EH+KKLK LGI MGK E+ WK ED +KPS++TVQ
Sbjct: 287 QMRKNHVHIDRFICYFSSPTACYVDEEHVKKLKGLGILMGKPESPWK-EDVKKPSQKTVQ 345
Query: 344 DIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTF 403
D+ KF ++DDVIA+GD+FYAD+E+DWVMQPGGP+ H+CKTLIEPSRLI+VTAQRF+QTF
Sbjct: 346 DVLAKFTSNDDVIAIGDVFYADMEQDWVMQPGGPLAHKCKTLIEPSRLILVTAQRFIQTF 405
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG NFIALHFRRHGFLKFCNAK PSCFYPIPQAADCI R+AERA APVIYLSTDAAESET
Sbjct: 406 LGKNFIALHFRRHGFLKFCNAKNPSCFYPIPQAADCIARVAERANAPVIYLSTDAAESET 465
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
LLQSL+++NGKT+ LVKRP S EKWDSLL RH +EDDSQVEAMLDKTI AMSNVFIG
Sbjct: 466 DLLQSLIIVNGKTVPLVKRPSHTSVEKWDSLLSRHGIEDDSQVEAMLDKTISAMSNVFIG 525
Query: 524 ASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
ASGSTFTEDI+RLRKDW S SLCDEYLCQGE PNFIAEDE
Sbjct: 526 ASGSTFTEDILRLRKDWESASLCDEYLCQGELPNFIAEDE 565
>gi|297795837|ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arabidopsis lyrata subsp.
lyrata]
gi|297311638|gb|EFH42062.1| hypothetical protein ARALYDRAFT_918074 [Arabidopsis lyrata subsp.
lyrata]
Length = 566
Score = 739 bits (1908), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/574 (65%), Positives = 456/574 (79%), Gaps = 19/574 (3%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRR 60
MER+SSDD++D + LI QNDT+ + P S N+R S F I+D ++RR
Sbjct: 1 MERNSSDDEEDHQHLIPQNDTRIRHREDPISSTATTTGGNQR-SAFQIEDILQR--VQRR 57
Query: 61 FTFDFKKLNNKRYLFA-LSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRA 119
+ NKRY+ +SL + I LL+ + R LFS N+ +F+ D L++R++ESELRA
Sbjct: 58 WKISL----NKRYVIVFVSLIISIGLLFLLTDPRELFSANFSSFKLDPLSNRVKESELRA 113
Query: 120 LSLLKQQQSHLLSLWNQSFVNNSYGNNTNN----PFFQEAKSVLLNQISLNRQIEQILLS 175
L LL+QQQ LLSLWN + VN S + N+ F++ KS + QISLN++I+ +LLS
Sbjct: 114 LYLLRQQQLALLSLWNGTLVNPSLNQSENDLRSSVLFEDVKSAVSKQISLNKEIQNVLLS 173
Query: 176 PHKVSNFTPNDAV----WGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLI 231
PH+ SN++ V + + CRK+D + +++TVEWKP+SDKFLFAICLSGQMSNHLI
Sbjct: 174 PHRSSNYSGGTEVDSVNFSYDRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQMSNHLI 233
Query: 232 CLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFME-MEKNHA 290
CLEKHMF AALL+RVLVIPSSKFDYQY RV+DIE IN CLGR VVVSF+ F E +KNH
Sbjct: 234 CLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIEGINTCLGRNVVVSFDQFKEKAKKNHF 293
Query: 291 HIDRFLCYFGLPQPCFVDDEHIKKLKQLGISM-GKTETVWKNEDTRKPSKRTVQDIEGKF 349
IDRF+CYF PQ C+VD+EHIKKLK LGIS+ GK E W +ED +KPSKRTVQD++ KF
Sbjct: 294 RIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPW-SEDIKKPSKRTVQDVQTKF 352
Query: 350 KTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFI 409
K+DDDVIA+GD+FYAD+E+DWVMQPGGPINH+CKTLIEPS+LI++TAQRF+QTFLG NFI
Sbjct: 353 KSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFLGKNFI 412
Query: 410 ALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSL 469
ALHFRRHGFLKFCNAK PSCFYPIPQAA+CI R+ ER+ VIYLSTDAAESETSLLQSL
Sbjct: 413 ALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETSLLQSL 472
Query: 470 VVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTF 529
VV++GK + LVKRPPRNSAEKWD+LLYRH +EDDSQV+AMLDKTICAMS+VFIGASGSTF
Sbjct: 473 VVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGASGSTF 532
Query: 530 TEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
TEDI+RLRKDWG++S CDEYLC+GEEPNFIAEDE
Sbjct: 533 TEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566
>gi|225441959|ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254979 isoform 1 [Vitis
vinifera]
Length = 559
Score = 738 bits (1906), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/584 (62%), Positives = 442/584 (75%), Gaps = 46/584 (7%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRR 60
MER+SSDD++DR+ LI +N+ K LP S F I+DF + R
Sbjct: 1 MERESSDDEEDRQNLIDENERK-----LP------------HRSGFQIEDFKSRLSAHR- 42
Query: 61 FTFDFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRAL 120
F+F NKRYLFA+ PL I+L+YF+ ++R+LF+ + + DS DRMRESELRAL
Sbjct: 43 FSF------NKRYLFAIFPPLFILLIYFTTDVRNLFTTSISIVKADSPTDRMRESELRAL 96
Query: 121 SLLKQQQSHLLSLWNQSFVNNSY---GNNTNNPF----------FQEAKSVLLNQISLNR 167
LL+QQQ L SLWN + +S N++N+ + KS LL QISLN+
Sbjct: 97 YLLRQQQLSLFSLWNHTAFADSAPIPSNSSNSTLDFSTRQVLLSSADFKSALLKQISLNK 156
Query: 168 QIEQILLSPHKVSNFTP--------NDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFA 219
+I+Q+LLS H N + N + C K++ + + T+EWKP+SDK+LFA
Sbjct: 157 EIQQVLLSSHPSGNLSELVDDNGDLNFGAYSFNRCPKVNQNMSQRPTIEWKPRSDKYLFA 216
Query: 220 ICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSF 279
ICLSGQMSNHLICLEKHMF AALLNR+LVIPSSKFDYQY+RVLDIEHIN+CLGRKVVV+F
Sbjct: 217 ICLSGQMSNHLICLEKHMFFAALLNRILVIPSSKFDYQYNRVLDIEHINNCLGRKVVVTF 276
Query: 280 ENFMEMEKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSK 339
E F E +KNH HIDR +CYF LP PC+VDD+H+KKLK LGISMGK E W ED +KP K
Sbjct: 277 EEFTESKKNHLHIDRVICYFSLPLPCYVDDDHVKKLKSLGISMGKLEPAWA-EDIKKPKK 335
Query: 340 RTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRF 399
RT QD++ KF ++DDVIA+GD+FYA+VE +WVMQPGGP+ H+C+TLIEPSRLIM+TAQRF
Sbjct: 336 RTAQDVQAKFSSNDDVIAIGDVFYANVEEEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRF 395
Query: 400 VQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAA 459
VQTFLG +F ALHFRRHGFLKFCNAK+PSCF+PIPQAADCI+R+ ERA PVIYLSTDAA
Sbjct: 396 VQTFLGKSFTALHFRRHGFLKFCNAKEPSCFFPIPQAADCISRVVERADTPVIYLSTDAA 455
Query: 460 ESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSN 519
ESET LLQSLVVLNGK + L+KRP RNSAEKWD+LLYRH L+ DSQVEAMLDKTICAM++
Sbjct: 456 ESETGLLQSLVVLNGKLVPLIKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDKTICAMAS 515
Query: 520 VFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
VFIGA GSTFTEDI+RLR+ WGS S CDEYLCQGE+PNFIA++E
Sbjct: 516 VFIGAPGSTFTEDILRLRRGWGSASHCDEYLCQGEQPNFIADNE 559
>gi|15240670|ref|NP_199853.1| peptide-O-fucosyltransferase [Arabidopsis thaliana]
gi|9758924|dbj|BAB09461.1| unnamed protein product [Arabidopsis thaliana]
gi|133778858|gb|ABO38769.1| At5g50420 [Arabidopsis thaliana]
gi|332008558|gb|AED95941.1| peptide-O-fucosyltransferase [Arabidopsis thaliana]
Length = 566
Score = 736 bits (1899), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/574 (64%), Positives = 455/574 (79%), Gaps = 19/574 (3%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRR 60
MER+SSDD++D + LI QNDT+ HR + +N + S F IDD + R +
Sbjct: 1 MERNSSDDEEDHQHLIPQNDTRI-RHREDSVSSNATTIGGNQRSAFQIDDILHRVQHRGK 59
Query: 61 FTFDFKKLNNKRYLFA-LSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRA 119
+ NKRY+ +SL + I LL+ + R LF+ N+ +F+ D L++R++ESELRA
Sbjct: 60 ISL------NKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKLDPLSNRVKESELRA 113
Query: 120 LSLLKQQQSHLLSLWNQSFVNNSYGNNTN----NPFFQEAKSVLLNQISLNRQIEQILLS 175
L LL+QQQ LLSLWN + VN S + N + F++ KS + QISLN++I+++LLS
Sbjct: 114 LYLLRQQQLALLSLWNGTLVNPSLNQSENALGSSVLFEDVKSAVSKQISLNKEIQEVLLS 173
Query: 176 PHKVSNFTPNDAV----WGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLI 231
PH+ SN++ V + CRK+D + +++TVEWKP+SDKFLFAICLSGQMSNHLI
Sbjct: 174 PHRSSNYSGGTDVDSVNFSYNRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQMSNHLI 233
Query: 232 CLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFME-MEKNHA 290
CLEKHMF AALL+RVLVIPSSKFDYQY RV+DIE IN CLGR VVV+F+ F E +KNH
Sbjct: 234 CLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRNVVVAFDQFKEKAKKNHF 293
Query: 291 HIDRFLCYFGLPQPCFVDDEHIKKLKQLGISM-GKTETVWKNEDTRKPSKRTVQDIEGKF 349
IDRF+CYF PQ C+VD+EHIKKLK LGIS+ GK E W +ED +KPSKRTVQD++ KF
Sbjct: 294 RIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPW-SEDIKKPSKRTVQDVQMKF 352
Query: 350 KTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFI 409
K+DDDVIA+GD+FYAD+E+DWVMQPGGPINH+CKTLIEPS+LI++TAQRF+QTFLG NFI
Sbjct: 353 KSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFLGKNFI 412
Query: 410 ALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSL 469
ALHFRRHGFLKFCNAK PSCFYPIPQAA+CI R+ ER+ VIYLSTDAAESETSLLQSL
Sbjct: 413 ALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETSLLQSL 472
Query: 470 VVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTF 529
VV++GK + LVKRPPRNSAEKWD+LLYRH +EDDSQV+AMLDKTICAMS+VFIGASGSTF
Sbjct: 473 VVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGASGSTF 532
Query: 530 TEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
TEDI+RLRKDWG++S CDEYLC+GEEPNFIAEDE
Sbjct: 533 TEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566
>gi|21595347|gb|AAM66093.1| unknown [Arabidopsis thaliana]
Length = 566
Score = 735 bits (1897), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/574 (64%), Positives = 455/574 (79%), Gaps = 19/574 (3%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRR 60
MER+SSDD++D + LI QNDT+ HR + +N + S F IDD + R +
Sbjct: 1 MERNSSDDEEDHQHLIPQNDTRI-RHREDSVSSNATTIGGNQRSAFQIDDILHRVQHRGK 59
Query: 61 FTFDFKKLNNKRYLFA-LSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRA 119
+ NKRY+ +SL + I LL+ + R LF+ N+ +F+ D L++R++ESELRA
Sbjct: 60 ISL------NKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKLDPLSNRVKESELRA 113
Query: 120 LSLLKQQQSHLLSLWNQSFVNNSYGNNTN----NPFFQEAKSVLLNQISLNRQIEQILLS 175
L LL+QQQ LLSLWN + VN S + N + F++ KS + QISLN++I+++LLS
Sbjct: 114 LYLLRQQQLALLSLWNGTLVNPSLNQSENALGSSVLFEDVKSAVSKQISLNKEIQEVLLS 173
Query: 176 PHKVSNFTPNDAV----WGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLI 231
PH+ SN++ V + CRK+D + +++TVEWKP+SDKFLFAICLSGQMSNHL+
Sbjct: 174 PHRSSNYSGGTDVDSVNFSYNRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQMSNHLL 233
Query: 232 CLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFME-MEKNHA 290
CLEKHMF AALL+RVLVIPSSKFDYQY RV+DIE IN CLGR VVV+F+ F E +KNH
Sbjct: 234 CLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRNVVVAFDQFKEKAKKNHF 293
Query: 291 HIDRFLCYFGLPQPCFVDDEHIKKLKQLGISM-GKTETVWKNEDTRKPSKRTVQDIEGKF 349
IDRF+CYF PQ C+VD+EHIKKLK LGIS+ GK E W +ED +KPSKRTVQD++ KF
Sbjct: 294 RIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPW-SEDIKKPSKRTVQDVQMKF 352
Query: 350 KTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFI 409
K+DDDVIA+GD+FYAD+E+DWVMQPGGPINH+CKTLIEPS+LI++TAQRF+QTFLG NFI
Sbjct: 353 KSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFLGKNFI 412
Query: 410 ALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSL 469
ALHFRRHGFLKFCNAK PSCFYPIPQAA+CI R+ ER+ VIYLSTDAAESETSLLQSL
Sbjct: 413 ALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETSLLQSL 472
Query: 470 VVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTF 529
VV++GK + LVKRPPRNSAEKWD+LLYRH +EDDSQV+AMLDKTICAMS+VFIGASGSTF
Sbjct: 473 VVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGASGSTF 532
Query: 530 TEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
TEDI+RLRKDWG++S CDEYLC+GEEPNFIAEDE
Sbjct: 533 TEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566
>gi|224071029|ref|XP_002303337.1| predicted protein [Populus trichocarpa]
gi|222840769|gb|EEE78316.1| predicted protein [Populus trichocarpa]
Length = 527
Score = 735 bits (1897), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/570 (63%), Positives = 437/570 (76%), Gaps = 50/570 (8%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRR 60
MERDSSD++DDRE LI QND KH N R+S F A I
Sbjct: 1 MERDSSDEEDDREHLIEQNDRKH--------------HQNGRYSLF-------AAAI--- 36
Query: 61 FTFDFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRAL 120
+ LPL I+ L FS ++R+LFS + DSL+ RMRESELRAL
Sbjct: 37 ----------------IFLPLFILFLSFSTDIRNLFSTHLK--VGDSLSIRMRESELRAL 78
Query: 121 SLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVS 180
LLK+QQ L SLWN + + + N+ F++ KS LL QISLN++I+Q+LL+PH+
Sbjct: 79 YLLKKQQLSLFSLWNSTGNSTLLEKDLNSVSFEDLKSALLKQISLNKEIQQVLLAPHESG 138
Query: 181 NFTP-------NDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICL 233
N + ++A ++ C K+D +++T+EWKPK +KFLFA+CLSGQMSNHLICL
Sbjct: 139 NVSSSSSDLDFSNAGGFVQRCEKVDQRFADRKTIEWKPKPNKFLFALCLSGQMSNHLICL 198
Query: 234 EKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHID 293
EKHMF AALLNRVLVIPSS+FDYQY+RVLDIEH+NDCLGRKVVV+FE F+E+ KN HID
Sbjct: 199 EKHMFFAALLNRVLVIPSSRFDYQYNRVLDIEHVNDCLGRKVVVTFEEFVEIMKNKPHID 258
Query: 294 RFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQDIEGKFKTDD 353
RF CYF P PC+VD+EH+KKLK LG+SMGK E+ WK ED +KPSK TV+D+EGKF +DD
Sbjct: 259 RFFCYFSDPTPCYVDEEHVKKLKGLGVSMGKLESPWK-EDIKKPSKLTVKDVEGKFVSDD 317
Query: 354 DVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHF 413
+VIAVGD+F+ADVE +W+MQPGGPI H+CKTLIEP+R+IM+TAQRF+QTFLGSNFIALHF
Sbjct: 318 NVIAVGDVFFADVEEEWIMQPGGPIAHKCKTLIEPTRIIMLTAQRFIQTFLGSNFIALHF 377
Query: 414 RRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLN 473
RRHGFLKFCNAKKPSCFYP+PQAADCI R+ ERA APV+YLSTDAAESET LLQSLVV+N
Sbjct: 378 RRHGFLKFCNAKKPSCFYPVPQAADCIARVVERANAPVVYLSTDAAESETGLLQSLVVVN 437
Query: 474 GKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDI 533
G+T+ LV RP RN+AEKWD+LLYRH L++D+QVEAMLDKTICAMS+VFIGASGSTFTEDI
Sbjct: 438 GRTVPLVTRPSRNAAEKWDALLYRHGLQEDAQVEAMLDKTICAMSSVFIGASGSTFTEDI 497
Query: 534 MRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
RLRK W S S CDEYLCQGE PN+IAE+E
Sbjct: 498 FRLRKGWESASSCDEYLCQGELPNYIAENE 527
>gi|297850124|ref|XP_002892943.1| hypothetical protein ARALYDRAFT_889130 [Arabidopsis lyrata subsp.
lyrata]
gi|297338785|gb|EFH69202.1| hypothetical protein ARALYDRAFT_889130 [Arabidopsis lyrata subsp.
lyrata]
Length = 583
Score = 726 bits (1873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/588 (61%), Positives = 453/588 (77%), Gaps = 37/588 (6%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEH--------NRRH-------ST 45
M R+SSDD++D LI QNDT+ + L EDE H NR + S
Sbjct: 1 MVRNSSDDEEDHRNLIPQNDTRDNDLDL----RREDELHSVTTARAINRANGGGRSPRSA 56
Query: 46 FHIDDFPNAPPIRRRFTFDFKKLNNKRYLFAL-SLPLLIILLYFSVNLRSLFSGNYVNFR 104
F ID+ + R + + NKRY+ A+ SL L + L+ + R FS + +F+
Sbjct: 57 FQIDEIVSRARNRWKISV------NKRYVVAVVSLTLFVGFLFLFTDTRRFFSVDLSSFK 110
Query: 105 FDSLADRMRESELRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQIS 164
D ++ R++ESEL+AL+LL+QQQ L+SL N++ N+S ++ K+ LL QIS
Sbjct: 111 LDPMSSRVKESELQALNLLRQQQLALVSLLNRAIFNSSNAIGSS-VLIDNVKAALLKQIS 169
Query: 165 LNRQIEQILLSPHKVSNFT----PNDAVWGL---ESCRKIDSIIPNKRTVEWKPKSDKFL 217
+N++IE++LLSPHK N++ +D++ G + C+K+D + ++T+EWKP+ +KFL
Sbjct: 170 VNKEIEEVLLSPHKTGNYSVTGSGSDSITGSYYDDRCKKVDQKLLERKTIEWKPRPEKFL 229
Query: 218 FAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVV 277
FAICLSGQMSNHLICLEKHMF AALL+RVLVIPSSKFDYQY RV+DIE IN CLGR VV+
Sbjct: 230 FAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRTVVI 289
Query: 278 SFENFMEMEK-NHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMG-KTETVWKNEDTR 335
SF+ F E++K N+AHIDRF+CYF PQPC+VD++HIKKLK LG+S+G K E W ED +
Sbjct: 290 SFDQFKEIDKKNNAHIDRFICYFSSPQPCYVDEDHIKKLKGLGVSIGGKLEAPWI-EDIK 348
Query: 336 KPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVT 395
KP+KRT +++ KFK+DD VIA+GD+FYAD+E+D VMQPGGPINH+CKTLIEPSRLI+VT
Sbjct: 349 KPTKRTSKEVVEKFKSDDGVIAIGDVFYADMEQDLVMQPGGPINHKCKTLIEPSRLILVT 408
Query: 396 AQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLS 455
AQRF+QTFLG NFI+LH RRHGFLKFCNAK PSCFYPIPQAADCI+R+ ERA APVIYLS
Sbjct: 409 AQRFIQTFLGKNFISLHLRRHGFLKFCNAKSPSCFYPIPQAADCISRMVERANAPVIYLS 468
Query: 456 TDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTIC 515
TDAAESET LLQSLVV+NGK + LVKRPPRNSAEKWDSLLYRH +EDDSQV+AMLDKTIC
Sbjct: 469 TDAAESETGLLQSLVVVNGKVVPLVKRPPRNSAEKWDSLLYRHGIEDDSQVDAMLDKTIC 528
Query: 516 AMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
AMS+VFIGASGSTFTEDI+RLRKDWG++S+CDEYLC+GEEPNFIAE+E
Sbjct: 529 AMSSVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAENE 576
>gi|449453409|ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208722 [Cucumis sativus]
gi|449517914|ref|XP_004165989.1| PREDICTED: uncharacterized protein LOC101230373 [Cucumis sativus]
Length = 573
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/541 (65%), Positives = 427/541 (78%), Gaps = 22/541 (4%)
Query: 44 STFHIDDFPNAPPIRRRFTFDFKKLN-NKRY--LFALSLPLLIILLYFSVNLRSLFSGNY 100
+TF IDD P+ P RF F K +KRY L A +LPL I++L+FSV++ SLFS
Sbjct: 34 TTFDIDDDPHFRPPIPRFPFSIPKFAFDKRYYYLLAAALPLCILVLFFSVDITSLFSTTL 93
Query: 101 VNF--RFDSLADRMRESELRALSLLKQQQSHLLSLWNQSFV-------NNSYGNN--TNN 149
+ DSL DRMRESEL AL LL+QQQ LWN S N++ NN +N+
Sbjct: 94 SSTLKTSDSLTDRMRESELTALYLLRQQQLGFFHLWNHSLFLQSNSSFNSTPSNNLSSNS 153
Query: 150 PFFQEAKSVLLNQISLNRQIEQILLSPHKVSNFTP--NDAV----WGLESCRKIDSIIPN 203
+ KS LL QI+LN++I+ +LLSPH+ N + DA+ + L+ CRK+D + +
Sbjct: 154 ALTEYIKSALLKQITLNKEIQNVLLSPHRSGNLSEEVGDALPMDTFALDRCRKMDQKLSD 213
Query: 204 KRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLD 263
+RT+EWKPKS+KFLFAIC SGQMSNHLICLEKHMF AA+LNRVLVIPS K DYQ+SRV+D
Sbjct: 214 RRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMFFAAILNRVLVIPSHKVDYQFSRVID 273
Query: 264 IEHINDCLGRKVVVSFENFMEMEKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMG 323
I+ +N CLGRKVV+SFE F E++K+H HIDRF+CYF P PC+VDDEHI KLK LGISMG
Sbjct: 274 IDRMNMCLGRKVVISFEEFSEIKKHHLHIDRFICYFSKPNPCYVDDEHISKLKNLGISMG 333
Query: 324 KTETVWKNEDTRKPSKRTVQDIEGKFKTD-DDVIAVGDLFYADVERDWVMQPGGPINHRC 382
K E+ W NEDT+ P+++TV D+E KF ++ DDVIAVGD+F+A+VE++WV QPGGPI H+C
Sbjct: 334 KLESAW-NEDTKHPNRKTVSDVESKFSSNNDDVIAVGDIFFANVEQEWVNQPGGPIAHKC 392
Query: 383 KTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITR 442
+TLIEPS LI +TAQRF+QTFLG N+IALHFRRHGFLKFCNAK+PSCFYPIPQAADCI R
Sbjct: 393 QTLIEPSHLIKLTAQRFIQTFLGKNYIALHFRRHGFLKFCNAKQPSCFYPIPQAADCIIR 452
Query: 443 LAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLED 502
+ ERA PVIYLSTDAAESE LLQSL+VLNGK I LVKRPPRNSAEKWD+LLYRH LE+
Sbjct: 453 MVERANVPVIYLSTDAAESEHGLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGLEE 512
Query: 503 DSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAED 562
DSQVEAMLDKTICAMS+ FIGA GSTFTEDI+RLRKDWG+ S+CDEYLCQGEEPNFI+E+
Sbjct: 513 DSQVEAMLDKTICAMSSTFIGAPGSTFTEDILRLRKDWGTASMCDEYLCQGEEPNFISEN 572
Query: 563 E 563
E
Sbjct: 573 E 573
>gi|30685232|ref|NP_173170.2| peptide-O-fucosyltransferase [Arabidopsis thaliana]
gi|27754290|gb|AAO22598.1| unknown protein [Arabidopsis thaliana]
gi|332191445|gb|AEE29566.1| peptide-O-fucosyltransferase [Arabidopsis thaliana]
Length = 564
Score = 715 bits (1846), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/576 (61%), Positives = 449/576 (77%), Gaps = 25/576 (4%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRL---PTSDNNEDEEHNRRHSTFHIDDFPNAPPI 57
M R+SSD+++D LI QNDT+ + L + N + S ID+ I
Sbjct: 1 MVRNSSDEEEDHRNLIPQNDTRDNDLNLRPDARTVNMANGGGRSPRSALQIDE------I 54
Query: 58 RRRFTFDFKKLNNKRYLFA-LSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESE 116
R +K NKRY+ A +SL L + LL+ + R+ FS +F+ D ++ R++ESE
Sbjct: 55 LSRARNRWKISVNKRYVVAAVSLTLFVGLLFLFTDTRTFFS----SFKLDPMSSRVKESE 110
Query: 117 LRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSP 176
L+AL+LL+QQQ L+SL N++ N+S +++ K+ LL QIS+N++IE++LLSP
Sbjct: 111 LQALNLLRQQQLALVSLLNRTNFNSSNAISSS-VVIDNVKAALLKQISVNKEIEEVLLSP 169
Query: 177 HKVSNFT----PNDAVWG---LESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNH 229
H+ N++ +D+ G + CRK+D + +++T+EWKP+ DKFLFAICLSGQMSNH
Sbjct: 170 HRTGNYSITASGSDSFTGSYNADICRKVDQKLLDRKTIEWKPRPDKFLFAICLSGQMSNH 229
Query: 230 LICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEK-N 288
LICLEKHMF AALL+RVLVIPSSKFDYQY +V+DIE IN CLGR VV+SF+ F E++K N
Sbjct: 230 LICLEKHMFFAALLDRVLVIPSSKFDYQYDKVIDIERINTCLGRTVVISFDQFKEIDKKN 289
Query: 289 HAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMG-KTETVWKNEDTRKPSKRTVQDIEG 347
+AHIDRF+CY PQPC+VD++HIKKLK LG+S+G K E W +ED +KP+KRT Q++
Sbjct: 290 NAHIDRFICYVSSPQPCYVDEDHIKKLKGLGVSIGGKLEAPW-SEDIKKPTKRTSQEVVE 348
Query: 348 KFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSN 407
KFK+DD VIA+GD+FYAD+E+D VMQPGGPINH+CKTLIEPSRLI+VTAQRF+QTFLG N
Sbjct: 349 KFKSDDGVIAIGDVFYADMEQDLVMQPGGPINHKCKTLIEPSRLILVTAQRFIQTFLGKN 408
Query: 408 FIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQ 467
FI+LH RRHGFLKFCNAK PSCFYPIPQAADCI+R+ ERA APVIYLSTDAAESET LLQ
Sbjct: 409 FISLHLRRHGFLKFCNAKSPSCFYPIPQAADCISRMVERANAPVIYLSTDAAESETGLLQ 468
Query: 468 SLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGS 527
SLVV++GK + LVKRPP+NSAEKWDSLLYRH +EDDSQV AMLDKTICAMS+VFIGASGS
Sbjct: 469 SLVVVDGKVVPLVKRPPQNSAEKWDSLLYRHGIEDDSQVYAMLDKTICAMSSVFIGASGS 528
Query: 528 TFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
TFTEDI+RLRKDWG++S+CDEYLC+GEEPNFIAE+E
Sbjct: 529 TFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAENE 564
>gi|359482537|ref|XP_003632783.1| PREDICTED: uncharacterized protein LOC100254979 isoform 2 [Vitis
vinifera]
Length = 603
Score = 712 bits (1839), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/628 (57%), Positives = 441/628 (70%), Gaps = 90/628 (14%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRR 60
MER+SSDD++DR+ LI +N+ K LP S F I+DF + R
Sbjct: 1 MERESSDDEEDRQNLIDENERK-----LP------------HRSGFQIEDFKSRLSAHR- 42
Query: 61 FTFDFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRAL 120
F+F NKRYLFA+ PL I+L+YF+ ++R+LF+ + + DS DRMRESELRAL
Sbjct: 43 FSF------NKRYLFAIFPPLFILLIYFTTDVRNLFTTSISIVKADSPTDRMRESELRAL 96
Query: 121 SLLKQQQSHLLSLWNQSFVNNSY---GNNTNNPF----------FQEAKSVLLNQISLNR 167
LL+QQQ L SLWN + +S N++N+ + KS LL QISLN+
Sbjct: 97 YLLRQQQLSLFSLWNHTAFADSAPIPSNSSNSTLDFSTRQVLLSSADFKSALLKQISLNK 156
Query: 168 QIEQILLSPHKVSNFTP--------NDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFA 219
+I+Q+LLS H N + N + C K++ + + T+EWKP+SDK+LFA
Sbjct: 157 EIQQVLLSSHPSGNLSELVDDNGDLNFGAYSFNRCPKVNQNMSQRPTIEWKPRSDKYLFA 216
Query: 220 ICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSF 279
ICLSGQMSNHLICLEKHMF AALLNR+LVIPSSKFDYQY+RVLDIEHIN+CLGRKVVV+F
Sbjct: 217 ICLSGQMSNHLICLEKHMFFAALLNRILVIPSSKFDYQYNRVLDIEHINNCLGRKVVVTF 276
Query: 280 ENFMEMEKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSK 339
E F E +KNH HIDR +CYF LP PC+VDD+H+KKLK LGISMGK E W ED +KP K
Sbjct: 277 EEFTESKKNHLHIDRVICYFSLPLPCYVDDDHVKKLKSLGISMGKLEPAWA-EDIKKPKK 335
Query: 340 RTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRF 399
RT QD++ KF ++DDVIA+GD+FYA+VE +WVMQPGGP+ H+C+TLIEPSRLIM+TAQRF
Sbjct: 336 RTAQDVQAKFSSNDDVIAIGDVFYANVEEEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRF 395
Query: 400 VQTFLGSNFIALHFRRHGFLKF-------------------------------------- 421
VQTFLG +F ALHFRRHGFLKF
Sbjct: 396 VQTFLGKSFTALHFRRHGFLKFWYAFRWLYFLALISEEHVFIILFGELLNRTLPSKFISN 455
Query: 422 ------CNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGK 475
NAK+PSCF+PIPQAADCI+R+ ERA PVIYLSTDAAESET LLQSLVVLNGK
Sbjct: 456 ILAYFVSNAKEPSCFFPIPQAADCISRVVERADTPVIYLSTDAAESETGLLQSLVVLNGK 515
Query: 476 TIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMR 535
+ L+KRP RNSAEKWD+LLYRH L+ DSQVEAMLDKTICAM++VFIGA GSTFTEDI+R
Sbjct: 516 LVPLIKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDKTICAMASVFIGAPGSTFTEDILR 575
Query: 536 LRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
LR+ WGS S CDEYLCQGE+PNFIA++E
Sbjct: 576 LRRGWGSASHCDEYLCQGEQPNFIADNE 603
>gi|5734743|gb|AAD50008.1|AC007651_3 Hypothetical Protein [Arabidopsis thaliana]
Length = 594
Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/606 (58%), Positives = 449/606 (74%), Gaps = 55/606 (9%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRL---PTSDNNEDEEHNRRHSTFHIDDFPNAPPI 57
M R+SSD+++D LI QNDT+ + L + N + S ID+ I
Sbjct: 1 MVRNSSDEEEDHRNLIPQNDTRDNDLNLRPDARTVNMANGGGRSPRSALQIDE------I 54
Query: 58 RRRFTFDFKKLNNKRYLFA-LSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESE 116
R +K NKRY+ A +SL L + LL+ + R+ FS +F+ D ++ R++ESE
Sbjct: 55 LSRARNRWKISVNKRYVVAAVSLTLFVGLLFLFTDTRTFFS----SFKLDPMSSRVKESE 110
Query: 117 LRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSP 176
L+AL+LL+QQQ L+SL N++ N+S +++ K+ LL QIS+N++IE++LLSP
Sbjct: 111 LQALNLLRQQQLALVSLLNRTNFNSSNAISSS-VVIDNVKAALLKQISVNKEIEEVLLSP 169
Query: 177 HKVSNFT----PNDAVWG---LESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNH 229
H+ N++ +D+ G + CRK+D + +++T+EWKP+ DKFLFAICLSGQMSNH
Sbjct: 170 HRTGNYSITASGSDSFTGSYNADICRKVDQKLLDRKTIEWKPRPDKFLFAICLSGQMSNH 229
Query: 230 LICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEK-N 288
LICLEKHMF AALL+RVLVIPSSKFDYQY +V+DIE IN CLGR VV+SF+ F E++K N
Sbjct: 230 LICLEKHMFFAALLDRVLVIPSSKFDYQYDKVIDIERINTCLGRTVVISFDQFKEIDKKN 289
Query: 289 HAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMG-KTETVWKNEDTRKPSKRTVQDIEG 347
+AHIDRF+CY PQPC+VD++HIKKLK LG+S+G K E W +ED +KP+KRT Q++
Sbjct: 290 NAHIDRFICYVSSPQPCYVDEDHIKKLKGLGVSIGGKLEAPW-SEDIKKPTKRTSQEVVE 348
Query: 348 KFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSN 407
KFK+DD VIA+GD+FYAD+E+D VMQPGGPINH+CKTLIEPSRLI+VTAQRF+QTFLG N
Sbjct: 349 KFKSDDGVIAIGDVFYADMEQDLVMQPGGPINHKCKTLIEPSRLILVTAQRFIQTFLGKN 408
Query: 408 FIALHFRRHGFLKF------------------------------CNAKKPSCFYPIPQAA 437
FI+LH RRHGFLKF CNAK PSCFYPIPQAA
Sbjct: 409 FISLHLRRHGFLKFWLQNLAIVAIMYLDSVAHTQLIGIPEHLSTCNAKSPSCFYPIPQAA 468
Query: 438 DCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYR 497
DCI+R+ ERA APVIYLSTDAAESET LLQSLVV++GK + LVKRPP+NSAEKWDSLLYR
Sbjct: 469 DCISRMVERANAPVIYLSTDAAESETGLLQSLVVVDGKVVPLVKRPPQNSAEKWDSLLYR 528
Query: 498 HHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPN 557
H +EDDSQV AMLDKTICAMS+VFIGASGSTFTEDI+RLRKDWG++S+CDEYLC+GEEPN
Sbjct: 529 HGIEDDSQVYAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPN 588
Query: 558 FIAEDE 563
FIAE+E
Sbjct: 589 FIAENE 594
>gi|357514371|ref|XP_003627474.1| GDP-fucose protein-O-fucosyltransferase [Medicago truncatula]
gi|355521496|gb|AET01950.1| GDP-fucose protein-O-fucosyltransferase [Medicago truncatula]
Length = 542
Score = 691 bits (1783), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/560 (62%), Positives = 422/560 (75%), Gaps = 38/560 (6%)
Query: 13 ETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTFDFKKLNNKR 72
LIHQ TK R P STF+++D + RR F F+K K
Sbjct: 12 HNLIHQPSTK---PRTPP-------------STFNVEDLNSR---FRRVNFTFQK---KH 49
Query: 73 YLFALSLPLLIILLYFSV-NLRSLFSGNYVNFRFDSLADRMRESELRALSLLKQQQSHLL 131
Y+ A+ L LL++ L+FSV NLR F+ ++ + DS+ DRM+ESELRA+ LL+QQQ L
Sbjct: 50 YILAILLLLLLLFLFFSVPNLRRYFTTSFTS---DSITDRMKESELRAIYLLRQQQQRLS 106
Query: 132 SLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVSNF--------T 183
+++N S N N N ++ KS L QIS+N +I+QILL+PH+ N
Sbjct: 107 TVFNSSDQN----QNPNPKLIEDLKSALFKQISINNEIQQILLNPHRTGNVIDPEFNFGN 162
Query: 184 PNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAALL 243
N V + CR +D + ++T+EW PK DKFL AIC+SGQMSNHLICLEKHMF AA+L
Sbjct: 163 SNFNVGNYDRCRTVDQSLSKRKTIEWNPKKDKFLVAICVSGQMSNHLICLEKHMFFAAIL 222
Query: 244 NRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHIDRFLCYFGLPQ 303
NRVLVIPSSK DYQY RV+DI+HIN CLG+KVV+SF+ F ++K H HID+FLCYF LPQ
Sbjct: 223 NRVLVIPSSKVDYQYDRVVDIDHINKCLGKKVVMSFDEFSNVKKGHLHIDKFLCYFALPQ 282
Query: 304 PCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFY 363
PC++DDE +KKL LG+ M K + VW++EDTR P K+TVQD+ KF DDDV+A+GD+FY
Sbjct: 283 PCYLDDERLKKLDGLGLGMSKPKAVWEDEDTRNPKKKTVQDVMDKFSYDDDVMAIGDVFY 342
Query: 364 ADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCN 423
A VE +WVMQPGGPI H+CKTLIEP+RLI++TAQRF+QTFLG NFIALHFRRHGFLKFCN
Sbjct: 343 AKVEHEWVMQPGGPIAHQCKTLIEPNRLILLTAQRFIQTFLGRNFIALHFRRHGFLKFCN 402
Query: 424 AKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRP 483
AKKPSCF+PIPQAADCI R+ ERA AP+IYLSTDAAESET LLQSL+VLNGK++ LV RP
Sbjct: 403 AKKPSCFFPIPQAADCILRVIERADAPIIYLSTDAAESETGLLQSLIVLNGKSVPLVIRP 462
Query: 484 PRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGST 543
RNSAEKWD+LLYRHH+E DSQVEAMLDKTICAMS+VFIGA GSTFTEDI+RLRKDWGS
Sbjct: 463 ARNSAEKWDALLYRHHIEGDSQVEAMLDKTICAMSSVFIGAPGSTFTEDILRLRKDWGSA 522
Query: 544 SLCDEYLCQGEEPNFIAEDE 563
SLCDEYLC GEEPN +AE+E
Sbjct: 523 SLCDEYLCHGEEPNIVAENE 542
>gi|356522454|ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776069 [Glycine max]
Length = 543
Score = 677 bits (1746), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/570 (60%), Positives = 424/570 (74%), Gaps = 38/570 (6%)
Query: 4 DSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTF 63
DSS D++D +HR +N+ + FH++D + RR +F
Sbjct: 2 DSSSDEED-------------DHRNLVDNNHRKPPSPPPSAAFHVEDLSSR---FRRVSF 45
Query: 64 DFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRALSLL 123
+K +Y+ A+ L ++L + + LFS +F+FDS+ DRM+ESELRA++LL
Sbjct: 46 ALQK----KYIIAILALLFLLLFFSITDFHQLFS-TPSSFKFDSITDRMKESELRAINLL 100
Query: 124 KQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVS--- 180
QQQ LL+ WN + N+ N ++ KS L QISLNR+I+QILL+PH
Sbjct: 101 YQQQQSLLTAWNHTLRTNASDPN----LLEDLKSSLFKQISLNREIQQILLNPHSTGGNA 156
Query: 181 -------NFTPNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICL 233
N T N V+ + CR +D + ++T+EW P+ KFL AIC+SGQMSNHLICL
Sbjct: 157 IEPELDLNATLNGVVY--DRCRTVDQNLSQRKTIEWNPRDGKFLLAICVSGQMSNHLICL 214
Query: 234 EKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHID 293
EKHMF AALLNRVLVIPSSK DYQY RV+DI+HIN CLG+KVVVSFE F ++K H HID
Sbjct: 215 EKHMFFAALLNRVLVIPSSKVDYQYDRVVDIDHINKCLGKKVVVSFEEFSNLKKGHLHID 274
Query: 294 RFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQDIEGKFKTDD 353
+FLCYF PQPC++DDE +KKL LG++M K E VW +EDTRKP K+TVQD+ GKF DD
Sbjct: 275 KFLCYFSHPQPCYLDDERLKKLGALGLTMSKPEAVW-DEDTRKPKKKTVQDVLGKFSFDD 333
Query: 354 DVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHF 413
DV+A+GD+FYA+VER+WVMQPGGPI H+CKTLIEP+RLI++TAQRF+QTFLG NFIALHF
Sbjct: 334 DVMAIGDVFYAEVEREWVMQPGGPIAHKCKTLIEPNRLILLTAQRFIQTFLGRNFIALHF 393
Query: 414 RRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLN 473
RRHGFLKFCNAKKPSCFYPIPQAADCI R+ E A AP+IYLSTDAAESET LLQSLVVLN
Sbjct: 394 RRHGFLKFCNAKKPSCFYPIPQAADCILRVVEMADAPIIYLSTDAAESETGLLQSLVVLN 453
Query: 474 GKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDI 533
G+ + LV RP RNSAEKWD+LLYRH+++ DSQVEAMLDKTICAMS+VFIGA GSTFTEDI
Sbjct: 454 GRPVPLVIRPARNSAEKWDALLYRHNMDGDSQVEAMLDKTICAMSSVFIGAPGSTFTEDI 513
Query: 534 MRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
+RLRKDWGS S+CDEYLCQGEEPN IAE+E
Sbjct: 514 LRLRKDWGSASMCDEYLCQGEEPNIIAENE 543
>gi|15220932|ref|NP_175780.1| peptide-O-fucosyltransferase [Arabidopsis thaliana]
gi|63003828|gb|AAY25443.1| At1g53770 [Arabidopsis thaliana]
gi|332194874|gb|AEE32995.1| peptide-O-fucosyltransferase [Arabidopsis thaliana]
Length = 563
Score = 674 bits (1740), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/568 (58%), Positives = 425/568 (74%), Gaps = 33/568 (5%)
Query: 7 DDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTFDFK 66
D++ D + L+ ++D++ R+ DE +R TF ++ + +RR F +
Sbjct: 18 DEESDLQNLLEESDSQIDQFRI------SDEAAEQR-PTFDVESLRSR--LRRSFKLNLT 68
Query: 67 KLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVN--FRFDSLADRMRESELRALSLLK 124
K K+ +F LP++IIL+Y S + + FS N FR ++L R+ ES+L+AL LL+
Sbjct: 69 K---KQSIFIF-LPIVIILIYLSTDFSNYFSVKVPNSAFRSNTLTGRVHESDLQALYLLR 124
Query: 125 QQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVSNF-- 182
+Q+S L S+WN + N S + KS + QISLNRQI+ LLSPHK N
Sbjct: 125 KQESDLFSIWNHTVSNLST--------IDDVKSAVFRQISLNRQIQNALLSPHKTGNVDI 176
Query: 183 -TPNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAA 241
+D + SCRK+D + ++T++WKP+ DKFLFAICLSGQMSNHLICLEKHMF AA
Sbjct: 177 GGSSDGYFAGGSCRKVDQKLNGRKTIQWKPRPDKFLFAICLSGQMSNHLICLEKHMFFAA 236
Query: 242 LLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKN-----HAHIDRFL 296
LL RVLVIPS +FDY YSR++DI+ IN CLGR VVVSFE F + +KN H HI+RF+
Sbjct: 237 LLKRVLVIPSHRFDYHYSRIIDIDRINTCLGRTVVVSFEEFWKKDKNRKKHHHVHINRFI 296
Query: 297 CYFGLPQPCFVDDEHIKKLKQLGISMG-KTETVWKNEDTRKPSKRTVQDIEGKFKTDDDV 355
CYF P+PC+VD EHI KLK LGI++G K +T W+ ED +PS +T +++E F++DDDV
Sbjct: 297 CYFSKPEPCYVDKEHITKLKALGITVGGKLDTPWE-EDIARPSNKTAEEVEANFRSDDDV 355
Query: 356 IAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRR 415
IA+GD+FYA+VER+WVMQPGGP+ H+C+TLIEP+RLI++TAQRF+QTFLG N+IALHFRR
Sbjct: 356 IAIGDVFYANVEREWVMQPGGPVAHKCRTLIEPNRLILLTAQRFIQTFLGKNYIALHFRR 415
Query: 416 HGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGK 475
HGFLKFCNAK PSCF+PIPQAA CITRL E+ +APV+YLSTDAAESET LLQSL++LNGK
Sbjct: 416 HGFLKFCNAKNPSCFFPIPQAASCITRLIEKVEAPVLYLSTDAAESETGLLQSLLILNGK 475
Query: 476 TIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMR 535
T+ LVKRP R+SAEKWD+LLYRH LE DSQVEAMLDKTICA+S+VFIGASGSTFTEDI+R
Sbjct: 476 TVPLVKRPARDSAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGASGSTFTEDILR 535
Query: 536 LRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
LRKDWG+ S CDEYLC E+PNFIA+ E
Sbjct: 536 LRKDWGTASECDEYLCANEQPNFIADHE 563
>gi|110737695|dbj|BAF00786.1| hypothetical protein [Arabidopsis thaliana]
Length = 563
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/568 (58%), Positives = 424/568 (74%), Gaps = 33/568 (5%)
Query: 7 DDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTFDFK 66
D++ D + L+ ++D++ R+ DE +R TF ++ + +RR F +
Sbjct: 18 DEESDLQNLLEESDSQIDQFRI------SDEAAEQR-PTFDVESLRSR--LRRSFKLNLT 68
Query: 67 KLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVN--FRFDSLADRMRESELRALSLLK 124
K K+ +F LP++IIL+Y S + + FS N FR ++L R+ ES+L+AL LL+
Sbjct: 69 K---KQSIFIF-LPIVIILIYLSTDFSNYFSVKVPNSAFRSNTLTGRVHESDLQALYLLR 124
Query: 125 QQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVSNF-- 182
+Q+S L S+WN + N S + KS + QISLNRQI+ LLSPHK N
Sbjct: 125 KQESDLFSIWNHTVSNLST--------IDDVKSAVFRQISLNRQIQNALLSPHKTGNVDI 176
Query: 183 -TPNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAA 241
+D + SCRK+D + ++T++WKP+ DKFLFAICLSGQMSNHLICLEKHMF AA
Sbjct: 177 GGSSDGYFAGGSCRKVDQKLNGRKTIQWKPRPDKFLFAICLSGQMSNHLICLEKHMFFAA 236
Query: 242 LLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKN-----HAHIDRFL 296
LL RVLVIPS +FDY YSR++DI+ IN LGR VVVSFE F + +KN H HI+RF+
Sbjct: 237 LLKRVLVIPSHRFDYHYSRIIDIDRINTYLGRTVVVSFEEFWKKDKNRKKHHHVHINRFI 296
Query: 297 CYFGLPQPCFVDDEHIKKLKQLGISMG-KTETVWKNEDTRKPSKRTVQDIEGKFKTDDDV 355
CYF P+PC+VD EHI KLK LGI++G K +T W+ ED +PS +T +++E F++DDDV
Sbjct: 297 CYFSKPEPCYVDKEHITKLKALGITVGGKLDTPWE-EDIARPSNKTAEEVEANFRSDDDV 355
Query: 356 IAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRR 415
IA+GD+FYA+VER+WVMQPGGP+ H+C+TLIEP+RLI++TAQRF+QTFLG N+IALHFRR
Sbjct: 356 IAIGDVFYANVEREWVMQPGGPVAHKCRTLIEPNRLILLTAQRFIQTFLGKNYIALHFRR 415
Query: 416 HGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGK 475
HGFLKFCNAK PSCF+PIPQAA CITRL E+ +APV+YLSTDAAESET LLQSL++LNGK
Sbjct: 416 HGFLKFCNAKNPSCFFPIPQAASCITRLIEKVEAPVLYLSTDAAESETGLLQSLLILNGK 475
Query: 476 TIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMR 535
T+ LVKRP R+SAEKWD+LLYRH LE DSQVEAMLDKTICA+S+VFIGASGSTFTEDI+R
Sbjct: 476 TVPLVKRPARDSAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGASGSTFTEDILR 535
Query: 536 LRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
LRKDWG+ S CDEYLC E+PNFIA+ E
Sbjct: 536 LRKDWGTASECDEYLCANEQPNFIADHE 563
>gi|356559323|ref|XP_003547949.1| PREDICTED: uncharacterized protein LOC548046 [Glycine max]
Length = 543
Score = 670 bits (1729), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/567 (61%), Positives = 422/567 (74%), Gaps = 35/567 (6%)
Query: 5 SSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTFD 64
SSD++DD L+ NHR P S + FH++D P RR F
Sbjct: 4 SSDEEDDHRNLVD------NNHRKPPS--------SPAAVAFHVED---PSPRFRRANFT 46
Query: 65 FKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRALSLLK 124
+K +Y+FA+ L ++L + +L LFS +FRFDSL DRM+ESELRA++LL
Sbjct: 47 LQK----KYIFAILAILFLLLFFSITDLHKLFSTTS-SFRFDSLTDRMKESELRAINLLN 101
Query: 125 QQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVSN--- 181
QQQ LL+ WN + N+ N ++ KS + QISLNR+I+QILL+PH N
Sbjct: 102 QQQQALLTAWNHTLRTNASDPN----LLEDLKSSIFKQISLNREIQQILLNPHSTGNNAI 157
Query: 182 ---FTPNDAVWGL--ESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKH 236
F N + G+ + CR +D + ++T+EW P+ KFL AIC+SGQMSNHLICLEKH
Sbjct: 158 EPEFDLNATLNGVVYDRCRTVDQNLSQRKTIEWNPRDGKFLLAICVSGQMSNHLICLEKH 217
Query: 237 MFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHIDRFL 296
+F AALLNRVLVIPSSK DYQY RV+DI+HIN CLG+KVVVSFE F ++K H HID+FL
Sbjct: 218 IFFAALLNRVLVIPSSKVDYQYDRVVDIDHINKCLGKKVVVSFEVFSNLKKGHLHIDKFL 277
Query: 297 CYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQDIEGKFKTDDDVI 356
CYF PQPC++DDE +KKL LG++M K VW +EDTR P K+TVQD+ GKF DDDV+
Sbjct: 278 CYFSQPQPCYLDDERLKKLGALGLTMSKPVAVW-DEDTRNPKKKTVQDVLGKFSFDDDVM 336
Query: 357 AVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRH 416
A+GD+FYA+VER+WVMQPGGPI H+C TLIEP+RLI++TAQRF+QTFLG NF+ALHFRRH
Sbjct: 337 AIGDVFYAEVEREWVMQPGGPIAHKCTTLIEPNRLILLTAQRFIQTFLGRNFVALHFRRH 396
Query: 417 GFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKT 476
GFLKFCNAKKPSCFY I QAADCI R+ ERA AP+IYLSTDAAESET LLQSLVVLNG+
Sbjct: 397 GFLKFCNAKKPSCFYSITQAADCILRVVERADAPIIYLSTDAAESETGLLQSLVVLNGRP 456
Query: 477 IALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRL 536
+ LV RP RNSAEKWD+LLYRH ++ DSQVEAMLDK+ICAMS+VFIGA GSTFTEDI+RL
Sbjct: 457 VPLVIRPARNSAEKWDALLYRHRMDGDSQVEAMLDKSICAMSSVFIGAPGSTFTEDILRL 516
Query: 537 RKDWGSTSLCDEYLCQGEEPNFIAEDE 563
RKDWGS S+CDEYLCQGEEPN +AE+E
Sbjct: 517 RKDWGSASMCDEYLCQGEEPNIVAENE 543
>gi|297853160|ref|XP_002894461.1| hypothetical protein ARALYDRAFT_474511 [Arabidopsis lyrata subsp.
lyrata]
gi|297340303|gb|EFH70720.1| hypothetical protein ARALYDRAFT_474511 [Arabidopsis lyrata subsp.
lyrata]
Length = 563
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/568 (57%), Positives = 419/568 (73%), Gaps = 33/568 (5%)
Query: 7 DDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTFDFK 66
D++ D L+ ++D++ R+ SD ++ TF ++ +R R + K
Sbjct: 18 DEESDLHNLLEESDSQIDQFRV--SDAAAEQR-----PTFDVES------LRSRLRRNLK 64
Query: 67 KLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVN--FRFDSLADRMRESELRALSLLK 124
K+ + LP++IIL+Y S + + FS + FR ++L R ES+L+AL LL+
Sbjct: 65 LNLTKKQSIVIFLPIVIILIYLSTDFSNYFSVRVPDSVFRSNTLTGRAHESDLQALYLLR 124
Query: 125 QQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVSNF-- 182
+Q+S L S+WN + N S ++ KS + QISLNRQI+ LLSPHK N
Sbjct: 125 KQESDLFSIWNHTLSNLST--------IEDVKSAVFRQISLNRQIQNALLSPHKTGNVEI 176
Query: 183 -TPNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAA 241
+D + SCRK+D + ++T++WKP+ DKFLFAICLSGQMSNHLICLEKHMF AA
Sbjct: 177 GGSSDVYFAGGSCRKVDQKLNGRKTIQWKPRPDKFLFAICLSGQMSNHLICLEKHMFFAA 236
Query: 242 LLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKN-----HAHIDRFL 296
LL RVLVIPS +FDY YSR++DI+ IN CLGR VVVSFE F + +KN H HI+RF+
Sbjct: 237 LLKRVLVIPSHRFDYHYSRIIDIDRINTCLGRNVVVSFEEFWKKDKNRKKHHHVHINRFI 296
Query: 297 CYFGLPQPCFVDDEHIKKLKQLGISMG-KTETVWKNEDTRKPSKRTVQDIEGKFKTDDDV 355
CYF P+PC+VD EHI KLK LGI++G K +T W+ ED PS +T +++E F++DDD+
Sbjct: 297 CYFSKPEPCYVDKEHITKLKALGITIGGKLDTPWE-EDIASPSNKTAEEVETNFRSDDDL 355
Query: 356 IAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRR 415
IA+GD+FYA+VER+WVMQPGGP+ H+C+TLIEP+RLI++TAQRF+QTFLG N+IALHFRR
Sbjct: 356 IAIGDVFYANVEREWVMQPGGPVAHKCRTLIEPNRLILLTAQRFIQTFLGKNYIALHFRR 415
Query: 416 HGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGK 475
HGFLKFCNAK PSCFYPIPQAA+CITRL E+ +APVIYLSTDAAESET LLQSL++LNGK
Sbjct: 416 HGFLKFCNAKNPSCFYPIPQAANCITRLIEKVEAPVIYLSTDAAESETGLLQSLLILNGK 475
Query: 476 TIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMR 535
T+ LVKRP R+SAEKWD+LLYRH LE D QVEAMLDKTICAMS+VF+GASGSTFTEDI+R
Sbjct: 476 TVPLVKRPARDSAEKWDALLYRHGLEGDPQVEAMLDKTICAMSSVFVGASGSTFTEDILR 535
Query: 536 LRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
LRKDWG+ S CDEYLC E PNF+A+ E
Sbjct: 536 LRKDWGTASECDEYLCANEHPNFVADPE 563
>gi|449463571|ref|XP_004149507.1| PREDICTED: uncharacterized protein LOC101202868 [Cucumis sativus]
Length = 495
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/466 (67%), Positives = 378/466 (81%), Gaps = 14/466 (3%)
Query: 105 FDSLADRMRESELRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQIS 164
FD L RM ESEL AL LLKQQ+ LL++WN SF S+ N+T+ ++ K LL QIS
Sbjct: 37 FDLLRYRMTESELHALFLLKQQRLGLLTMWNHSFT--SHSNSTS----EDLKLSLLRQIS 90
Query: 165 LNRQIEQILLSPHKVSNFTPNDAVWGLE-------SCRKIDSIIPNKRTVEWKPKSDKFL 217
LN++I+ +LLS H+ N + GL+ CRK+D + N+RT+EW PKS+KFL
Sbjct: 91 LNKEIQNVLLSSHQTGNSSNEVYDDGLDLVGSDFDRCRKVDQEVLNRRTIEWNPKSNKFL 150
Query: 218 FAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVV 277
FAIC SGQMSNHLICLEKHMF AA+LNR++VIPSSK DYQYSRVLD++H+N+CLGRKVV+
Sbjct: 151 FAICASGQMSNHLICLEKHMFFAAILNRIVVIPSSKVDYQYSRVLDVDHVNNCLGRKVVI 210
Query: 278 SFENFMEMEKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKP 337
SFE F E++KN ID+F CYF P C++DDEH+K+L +G+S K E+ W +ED + P
Sbjct: 211 SFEEFSEIQKNQMKIDKFFCYFSKPDHCYLDDEHVKQLNNIGVSTAKLESPW-DEDIKNP 269
Query: 338 SKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQ 397
SKRTVQDIE KF +D DVIA+GD+FYADVE W +QPGGPI H+CKTLIEPS LI VTAQ
Sbjct: 270 SKRTVQDIESKFSSDADVIAIGDVFYADVENGWFVQPGGPIAHKCKTLIEPSHLIKVTAQ 329
Query: 398 RFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTD 457
RF+QTFLGS+FIA+HFRRHGFLKFCNAK+PSCFYPIPQAA+CI + ER+ +PVIYLSTD
Sbjct: 330 RFIQTFLGSDFIAIHFRRHGFLKFCNAKRPSCFYPIPQAANCIASVVERSHSPVIYLSTD 389
Query: 458 AAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAM 517
AA+SET LLQSLV++NGK I LVKRPPRNSAEKWD+LLYRH +EDDSQVEAMLDK ICAM
Sbjct: 390 AADSETGLLQSLVMVNGKPIPLVKRPPRNSAEKWDALLYRHEIEDDSQVEAMLDKAICAM 449
Query: 518 SNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
S VFIG SGSTFTEDI+RLRKDW S SLCDEYLCQG++PNFIAE+E
Sbjct: 450 SRVFIGTSGSTFTEDILRLRKDWRSASLCDEYLCQGQDPNFIAENE 495
>gi|449522504|ref|XP_004168266.1| PREDICTED: uncharacterized LOC101202868 [Cucumis sativus]
Length = 512
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/466 (67%), Positives = 378/466 (81%), Gaps = 14/466 (3%)
Query: 105 FDSLADRMRESELRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQIS 164
FD L RM ESEL AL LLKQQ+ LL++WN SF S+ N+T+ ++ K LL QIS
Sbjct: 54 FDLLRYRMTESELHALFLLKQQRLGLLTMWNHSFT--SHSNSTS----EDLKLSLLRQIS 107
Query: 165 LNRQIEQILLSPHKVSNFTPNDAVWGLE-------SCRKIDSIIPNKRTVEWKPKSDKFL 217
LN++I+ +LLS H+ N + GL+ CRK+D + N+RT+EW PKS+KFL
Sbjct: 108 LNKEIQNVLLSSHQTGNSSNEVYDDGLDLVGSDFDRCRKVDQEVLNRRTIEWNPKSNKFL 167
Query: 218 FAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVV 277
FAIC SGQMSNHLICLEKHMF AA+LNR++VIPSSK DYQYS+VLD++H+N+CLGRKVV+
Sbjct: 168 FAICASGQMSNHLICLEKHMFFAAILNRIVVIPSSKVDYQYSKVLDVDHVNNCLGRKVVI 227
Query: 278 SFENFMEMEKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKP 337
SFE F E++KN ID+F CYF P C++DDEH+K+L +G+S K E+ W +ED + P
Sbjct: 228 SFEEFSEIQKNQMKIDKFFCYFSKPDHCYLDDEHVKQLNNIGVSTAKLESPW-DEDIKNP 286
Query: 338 SKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQ 397
SKRTVQDIE KF +D DVIA+GD+FYADVE W +QPGGPI H+CKTLIEPS LI VTAQ
Sbjct: 287 SKRTVQDIESKFSSDADVIAIGDVFYADVENGWFVQPGGPIAHKCKTLIEPSHLIKVTAQ 346
Query: 398 RFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTD 457
RF+QTFLGS+FIA+HFRRHGFLKFCNAK+PSCFYPIPQAA+CI + ER+ +PVIYLSTD
Sbjct: 347 RFIQTFLGSDFIAIHFRRHGFLKFCNAKRPSCFYPIPQAANCIASVVERSHSPVIYLSTD 406
Query: 458 AAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAM 517
AA+SET LLQSLV++NGK I LVKRPPRNSAEKWD+LLYRH +EDDSQVEAMLDK ICAM
Sbjct: 407 AADSETGLLQSLVMVNGKPIPLVKRPPRNSAEKWDALLYRHEIEDDSQVEAMLDKAICAM 466
Query: 518 SNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
S VFIG SGSTFTEDI+RLRKDW S SLCDEYLCQG++PNFIAE+E
Sbjct: 467 SRVFIGTSGSTFTEDILRLRKDWRSASLCDEYLCQGQDPNFIAENE 512
>gi|334183283|ref|NP_001185215.1| peptide-O-fucosyltransferase [Arabidopsis thaliana]
gi|332194875|gb|AEE32996.1| peptide-O-fucosyltransferase [Arabidopsis thaliana]
Length = 601
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/606 (54%), Positives = 424/606 (69%), Gaps = 71/606 (11%)
Query: 7 DDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTFDFK 66
D++ D + L+ ++D++ R+ DE +R TF ++ + +RR F +
Sbjct: 18 DEESDLQNLLEESDSQIDQFRI------SDEAAEQR-PTFDVESLRSR--LRRSFKLNLT 68
Query: 67 KLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVN--FRFDSLADRMRESELRALSLLK 124
K K+ +F LP++IIL+Y S + + FS N FR ++L R+ ES+L+AL LL+
Sbjct: 69 K---KQSIFIF-LPIVIILIYLSTDFSNYFSVKVPNSAFRSNTLTGRVHESDLQALYLLR 124
Query: 125 QQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVSNF-- 182
+Q+S L S+WN + N S + KS + QISLNRQI+ LLSPHK N
Sbjct: 125 KQESDLFSIWNHTVSNLST--------IDDVKSAVFRQISLNRQIQNALLSPHKTGNVDI 176
Query: 183 -TPNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAA 241
+D + SCRK+D + ++T++WKP+ DKFLFAICLSGQMSNHLICLEKHMF AA
Sbjct: 177 GGSSDGYFAGGSCRKVDQKLNGRKTIQWKPRPDKFLFAICLSGQMSNHLICLEKHMFFAA 236
Query: 242 LLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKN-----HAHIDRFL 296
LL RVLVIPS +FDY YSR++DI+ IN CLGR VVVSFE F + +KN H HI+RF+
Sbjct: 237 LLKRVLVIPSHRFDYHYSRIIDIDRINTCLGRTVVVSFEEFWKKDKNRKKHHHVHINRFI 296
Query: 297 CYFGLPQPCFVDDEHIKKLKQLGISMG-KTETVWKNEDTRKPSKRTVQDIEGKFKTDDDV 355
CYF P+PC+VD EHI KLK LGI++G K +T W+ ED +PS +T +++E F++DDDV
Sbjct: 297 CYFSKPEPCYVDKEHITKLKALGITVGGKLDTPWE-EDIARPSNKTAEEVEANFRSDDDV 355
Query: 356 IAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRR 415
IA+GD+FYA+VER+WVMQPGGP+ H+C+TLIEP+RLI++TAQRF+QTFLG N+IALHFRR
Sbjct: 356 IAIGDVFYANVEREWVMQPGGPVAHKCRTLIEPNRLILLTAQRFIQTFLGKNYIALHFRR 415
Query: 416 HGFLKF--------------------------------------CNAKKPSCFYPIPQAA 437
HGFLKF NAK PSCF+PIPQAA
Sbjct: 416 HGFLKFWYDLFSTSLLHSTFDDNVVHFITKIKSGTLAKKFSWCASNAKNPSCFFPIPQAA 475
Query: 438 DCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYR 497
CITRL E+ +APV+YLSTDAAESET LLQSL++LNGKT+ LVKRP R+SAEKWD+LLYR
Sbjct: 476 SCITRLIEKVEAPVLYLSTDAAESETGLLQSLLILNGKTVPLVKRPARDSAEKWDALLYR 535
Query: 498 HHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPN 557
H LE DSQVEAMLDKTICA+S+VFIGASGSTFTEDI+RLRKDWG+ S CDEYLC E+PN
Sbjct: 536 HGLEGDSQVEAMLDKTICALSSVFIGASGSTFTEDILRLRKDWGTASECDEYLCANEQPN 595
Query: 558 FIAEDE 563
FIA+ E
Sbjct: 596 FIADHE 601
>gi|115472923|ref|NP_001060060.1| Os07g0572600 [Oryza sativa Japonica Group]
gi|34393544|dbj|BAC83142.1| putative GDP-fucose protein-O-fucosyltransferase 2 [Oryza sativa
Japonica Group]
gi|50508964|dbj|BAD31871.1| putative GDP-fucose protein-O-fucosyltransferase 2 [Oryza sativa
Japonica Group]
gi|113611596|dbj|BAF21974.1| Os07g0572600 [Oryza sativa Japonica Group]
gi|125558880|gb|EAZ04416.1| hypothetical protein OsI_26561 [Oryza sativa Indica Group]
gi|125600798|gb|EAZ40374.1| hypothetical protein OsJ_24820 [Oryza sativa Japonica Group]
Length = 557
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 326/580 (56%), Positives = 414/580 (71%), Gaps = 44/580 (7%)
Query: 1 MERDSS--DDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNA-PPI 57
M+RD S D+DDD ETL+ QN K P S S+FH+ A PP
Sbjct: 1 MDRDPSVSDEDDDLETLVPQNHVK------PPS------------SSFHVAGLRAAVPPS 42
Query: 58 RRRFTFDFKKLNNKRYLFA-LSLPLLIILLYFSV----NLR---SLFSGNYVNFRFDSLA 109
+ L ++RYL ++LP+L ++L+ S+ +LR S+ + D A
Sbjct: 43 AASLG---RVLWSRRYLLLFVALPVLFLVLFVSLGGASSLRLPASIRLPSAAAPAADPAA 99
Query: 110 DRMRESELRALSLLKQQQSHLLSLWNQSFV-NNSYGNNTNNPFFQEAKSVLLNQISLNRQ 168
RMR++ELRAL LL+ Q+S LL+L+N++ N + +N + + LL QI +NR+
Sbjct: 100 SRMRDAELRALYLLRSQRSGLLALFNRTAAPTNGSSSGSNAVSYADLHDALLGQIKINRE 159
Query: 169 IEQILLSPHKVSNFTPNDAVWGLE------SCRKIDSIIPNKRTVEWKPKSDKFLFAICL 222
I+ LLS H+ + N GL+ CR+ + + N+RT+EW PK D+FLFAIC+
Sbjct: 160 IQAALLSAHR-TGAAGNATEDGLDLDLPADGCRRRE-LPSNRRTIEWNPKKDRFLFAICV 217
Query: 223 SGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENF 282
SGQMSNHLICLEKHMF AALL R+LV+PS K DYQY RVLDI HINDC+GRKVV+++E F
Sbjct: 218 SGQMSNHLICLEKHMFFAALLGRILVVPSQKLDYQYDRVLDINHINDCIGRKVVITYEEF 277
Query: 283 MEMEKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVW-KNEDTRKPSKRT 341
E K ID+F+CY P PCF+D++H+KKLK LGIS+GK + W ++ ++P KR
Sbjct: 278 TEKRKK-VSIDQFICYAASP-PCFIDEDHVKKLKGLGISLGKIQAAWPEDAKLKEPKKRF 335
Query: 342 VQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQ 401
V+DI KF TD +V+A+GD+FYADVE +WVMQPGGP+ H+C TLI+PSRLIM+TAQRFVQ
Sbjct: 336 VEDIMPKFTTDAEVLAIGDMFYADVEEEWVMQPGGPLAHKCNTLIQPSRLIMLTAQRFVQ 395
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
TFLG N+IALHFRRHGFLKFCN KK SCF+PIPQAA+CI R+ E+A APVIYLSTDAAES
Sbjct: 396 TFLGGNYIALHFRRHGFLKFCNVKKESCFFPIPQAAECILRIVEKANAPVIYLSTDAAES 455
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
ETSLLQSLVV N + + LVKRP +++EKWD+LLYR+H+ D+QVEAMLDKTICA+SNVF
Sbjct: 456 ETSLLQSLVVFNDRQVPLVKRPEHHNSEKWDALLYRNHMGGDNQVEAMLDKTICALSNVF 515
Query: 522 IGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAE 561
IGASGSTFTEDI+RLR+ WGS S CDEYLCQGE PNFIAE
Sbjct: 516 IGASGSTFTEDILRLRRGWGSASHCDEYLCQGELPNFIAE 555
>gi|242046046|ref|XP_002460894.1| hypothetical protein SORBIDRAFT_02g037060 [Sorghum bicolor]
gi|241924271|gb|EER97415.1| hypothetical protein SORBIDRAFT_02g037060 [Sorghum bicolor]
Length = 562
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 314/570 (55%), Positives = 399/570 (70%), Gaps = 31/570 (5%)
Query: 6 SDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTFDF 65
SDD+DD ETL+ QN K + + H P++ R
Sbjct: 8 SDDEDDLETLVPQNHAKPPSPSARSRSPPSSFSVAALHPAL-----PSSAASLGRI---- 58
Query: 66 KKLNNKRYLFA-LSLPLLIILLYFS------VNLRSLFSGNYVNFRFDSLADRMRESELR 118
L ++RYL +SLPLL ++L+ S + L + D A RMRE+EL
Sbjct: 59 --LWSRRYLVLFVSLPLLFLILFVSFGGASSIRLPASIRLPSAAPAADPAASRMREAELH 116
Query: 119 ALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHK 178
AL LL+ Q+S LLSL+N++ + G+ + + +S L +QI +NR+I+ LLS H+
Sbjct: 117 ALYLLRSQRSGLLSLFNRTAAPTN-GSASAPISLSDLQSALESQIKINREIQAALLSSHR 175
Query: 179 VSNFTPNDAVWGLE------SCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLIC 232
T N GL+ CRK + + N+RT+EW PK D+FL AIC+SGQMSNHLIC
Sbjct: 176 SG--TGNATEDGLDLDLPFSGCRKRE-LPANRRTIEWNPKKDRFLLAICISGQMSNHLIC 232
Query: 233 LEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHI 292
LEKHMF+AALL R LV+PS K DYQY RVLDI HINDC+GRKVV+++E F+E K I
Sbjct: 233 LEKHMFMAALLGRTLVVPSQKVDYQYDRVLDINHINDCIGRKVVITYEEFVEKRKKVV-I 291
Query: 293 DRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVW-KNEDTRKPSKRTVQDIEGKFKT 351
D+F+CY P PCF+D++HIKKLK LGIS+GK E W ++ ++P KR V DI KF T
Sbjct: 292 DQFICYAASP-PCFLDEDHIKKLKGLGISLGKIEAAWPEDAKLKEPKKRFVGDITPKFST 350
Query: 352 DDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIAL 411
+ +V+A+GD+FYAD+E +W+ QPGGP+ H+CKTLI+PSRLIM+TAQRFVQTFLG N+IAL
Sbjct: 351 EAEVLAIGDMFYADMEDEWLNQPGGPLAHKCKTLIQPSRLIMLTAQRFVQTFLGGNYIAL 410
Query: 412 HFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVV 471
HFRRHGFLKFCN KK SCF+PIPQAA+CI R+ E+A APVIYLSTDAAESET+LLQSLVV
Sbjct: 411 HFRRHGFLKFCNVKKESCFFPIPQAAECILRIVEKANAPVIYLSTDAAESETNLLQSLVV 470
Query: 472 LNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTE 531
N + + LVKRP +S+EKWD+LL+R+H+ D QV+AMLDKTICA+SNVFIG+SGSTFTE
Sbjct: 471 FNDRQVPLVKRPEHDSSEKWDALLHRNHIGGDVQVDAMLDKTICALSNVFIGSSGSTFTE 530
Query: 532 DIMRLRKDWGSTSLCDEYLCQGEEPNFIAE 561
DI RLR+ WGS S CDEYLCQGE PN+IAE
Sbjct: 531 DIFRLRRGWGSASHCDEYLCQGELPNYIAE 560
>gi|357122313|ref|XP_003562860.1| PREDICTED: uncharacterized protein LOC100825796 [Brachypodium
distachyon]
Length = 560
Score = 595 bits (1533), Expect = e-167, Method: Compositional matrix adjust.
Identities = 312/576 (54%), Positives = 402/576 (69%), Gaps = 33/576 (5%)
Query: 2 ERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRF 61
E SD+DDD ETL+ QN TK + + + P +
Sbjct: 4 EPSVSDEDDDLETLVPQNHTKPPSPTSRSRSSFSVSALR--------------PALPSSA 49
Query: 62 TFDFKKLNNKRYLFA-LSLPLLIILLYFSVNLRSLFSGNY----VNFRFDSLADRMRESE 116
L ++RYL +SLP+L I+L+FS+ S + + D A RMRE+E
Sbjct: 50 ASLGHALWSRRYLLLFVSLPVLFIVLFFSLGGASSLPASIRLPSADPAADPAASRMREAE 109
Query: 117 LRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSP 176
L AL LL+ Q+S LLSL+N++ + + N + + ++ LL+QI +NR+I+ LLS
Sbjct: 110 LHALYLLRSQRSGLLSLFNRT-ASPAPANASTPISLSDLQAALLSQIKINREIQAALLST 168
Query: 177 H--KVSNFT---PNDAVWGLESCRKIDSIIP-NKRTVEWKPKSDKFLFAICLSGQMSNHL 230
H N T P D CR+ + +P N+RT++W PK D+FLFAIC+SGQMSNHL
Sbjct: 169 HHSGAGNATEEDPLDLDLPGPGCRRKE--LPYNRRTIQWNPKKDRFLFAICVSGQMSNHL 226
Query: 231 ICLEKHMFLAALLNRVLVIPSS--KFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKN 288
ICLEKH+F AALL R+LV+PSS K DYQY RVLDI +INDC+GRKVV+++ F+E K
Sbjct: 227 ICLEKHIFFAALLGRILVLPSSIDKVDYQYDRVLDISNINDCIGRKVVITYREFLEKRKK 286
Query: 289 HAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVW-KNEDTRKPSKRTVQDIEG 347
ID+F+CY P PCF+D++++KKLK LGISMGK E W ++ ++P KR V DI
Sbjct: 287 -VSIDQFICYAASP-PCFMDEDYVKKLKGLGISMGKIEAAWPEDAKLKEPKKRFVGDITP 344
Query: 348 KFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSN 407
KF T+ +V+A+GD+FYAD E +W+MQPGGP+ H+CKTLI+PSRLIM+TAQRFVQTFLG N
Sbjct: 345 KFTTEAEVLAIGDMFYADAEEEWLMQPGGPLAHKCKTLIQPSRLIMLTAQRFVQTFLGGN 404
Query: 408 FIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQ 467
+IALHFRRHGFLKFCN KK SCF+PIPQAA+CI R+ E+A APVIYLSTDA ESET+LLQ
Sbjct: 405 YIALHFRRHGFLKFCNVKKESCFFPIPQAAECILRIVEKANAPVIYLSTDALESETNLLQ 464
Query: 468 SLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGS 527
SLVV N + + LVKRP ++EKWD+LLYR+H+ D+QVEAMLDKTICA+SNVFIG+SGS
Sbjct: 465 SLVVFNDRQVPLVKRPVHQNSEKWDALLYRNHMGGDAQVEAMLDKTICALSNVFIGSSGS 524
Query: 528 TFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
TFTEDI RLR+ WGS S CDEYLCQG+ PN+IAE +
Sbjct: 525 TFTEDIFRLRRSWGSMSYCDEYLCQGQLPNYIAEQD 560
>gi|147853716|emb|CAN80219.1| hypothetical protein VITISV_027891 [Vitis vinifera]
Length = 337
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 264/338 (78%), Positives = 302/338 (89%), Gaps = 1/338 (0%)
Query: 226 MSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEM 285
MSNHLICLEKHMF AALLNR+LVIPSSKFDYQY+RVLDIEHIN+CLGRKVVV+FE F E
Sbjct: 1 MSNHLICLEKHMFFAALLNRILVIPSSKFDYQYNRVLDIEHINNCLGRKVVVTFEEFTES 60
Query: 286 EKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQDI 345
+KNH HIDR +CYF LP PC+VDD+H+KKLK LGISMGK E W ED +KP KRT QD+
Sbjct: 61 KKNHLHIDRVICYFSLPLPCYVDDDHVKKLKSLGISMGKLEPAWA-EDIKKPKKRTAQDV 119
Query: 346 EGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLG 405
+ KF ++DDVIA+GD+FYA+VE +WVMQPGGP+ H+C+TLIEPSRLIM+TAQRFVQTFLG
Sbjct: 120 QAKFSSNDDVIAIGDVFYANVEEEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFVQTFLG 179
Query: 406 SNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSL 465
+F ALHFRRHGFLKFCNAK+PSCF+PIPQAADCI+R+ ERA PVIYLSTDAAESET L
Sbjct: 180 KSFTALHFRRHGFLKFCNAKEPSCFFPIPQAADCISRVVERADTPVIYLSTDAAESETGL 239
Query: 466 LQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGAS 525
LQSLVVLNGK + L+KRP RNSAEKWD+LLYRH L+ DSQVEAMLDKTICAM++VFIGA
Sbjct: 240 LQSLVVLNGKLVPLIKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDKTICAMASVFIGAP 299
Query: 526 GSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
GSTFTEDI+RLR+ WGS S CDEYLCQGE+PNFIA++E
Sbjct: 300 GSTFTEDILRLRRGWGSASHCDEYLCQGEQPNFIADNE 337
>gi|414590599|tpg|DAA41170.1| TPA: hypothetical protein ZEAMMB73_622134 [Zea mays]
Length = 453
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 279/455 (61%), Positives = 351/455 (77%), Gaps = 9/455 (1%)
Query: 112 MRESELRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQ 171
MRE+EL AL LL+ Q+S LLSL+N++ + G+ + +S L +QI +NR+I+
Sbjct: 1 MREAELHALYLLRSQRSGLLSLFNRTAAPTN-GSAPAPISLSDLQSALESQIKINREIQA 59
Query: 172 ILLSPHKVSNFTPNDAVWGLE----SCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMS 227
LLS H+ + + L+ CRK + + ++RT+EW PK D+FL IC+SGQMS
Sbjct: 60 ALLSSHRSGSGNATEDGLDLDLPFSGCRKRE-LPASRRTIEWNPKKDRFLLVICISGQMS 118
Query: 228 NHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEK 287
NHLICLEKHMF+AALL R LV+PS K DYQY RVLDI HINDC+GRKVV+++E F+E K
Sbjct: 119 NHLICLEKHMFMAALLGRTLVVPSQKADYQYDRVLDINHINDCIGRKVVITYEEFVEKRK 178
Query: 288 NHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVW-KNEDTRKPSKRTVQDIE 346
A ID+F+CY P PCF+D+EHIKKLK LGIS+ K E W ++ ++P KR+V DI
Sbjct: 179 KVA-IDQFICYAASP-PCFLDEEHIKKLKGLGISLSKIEAAWPEDAKLKEPKKRSVGDIA 236
Query: 347 GKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGS 406
KF T+ +V+A+GD+FYAD+E +W+ QPGGP+ H+CK LI+PSRLI++TAQRFVQTFLG
Sbjct: 237 PKFSTEAEVLAIGDMFYADMEDEWLNQPGGPLAHKCKALIQPSRLIVLTAQRFVQTFLGG 296
Query: 407 NFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLL 466
N+IALHFRRHGFLKFCN KK SCF+PIPQAA+CI R+ E+A APVIYLSTDAAESET+LL
Sbjct: 297 NYIALHFRRHGFLKFCNVKKESCFFPIPQAAECILRIVEKANAPVIYLSTDAAESETNLL 356
Query: 467 QSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASG 526
QSLVV N + + LVKRP +S+EKWD+LL R+ + D QV+AMLDKTICA+SNVFIG+SG
Sbjct: 357 QSLVVFNDRQVPLVKRPEHHSSEKWDALLNRNRMGGDGQVDAMLDKTICALSNVFIGSSG 416
Query: 527 STFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAE 561
STFTEDI RLR+ WGS S CDEYLCQGE PNFIAE
Sbjct: 417 STFTEDIFRLRRGWGSASHCDEYLCQGELPNFIAE 451
>gi|33457136|emb|CAE30293.1| GDP-fucose protein-O-fucosyltransferase 2 [Solanum tuberosum]
Length = 337
Score = 556 bits (1433), Expect = e-156, Method: Compositional matrix adjust.
Identities = 255/338 (75%), Positives = 292/338 (86%), Gaps = 1/338 (0%)
Query: 226 MSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEM 285
MSNHLICLEKHMF AALLNR+LVIPSSK DY++ RV+D++HIN CLGR+V+V+++ F E
Sbjct: 1 MSNHLICLEKHMFFAALLNRILVIPSSKVDYEFRRVIDVDHINKCLGREVIVTYDEFAER 60
Query: 286 EKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQDI 345
K+H HID+FLCYF PQPCF+D+E +KKLK LGISM K E W NED + P KR VQDI
Sbjct: 61 RKSHLHIDKFLCYFSQPQPCFLDEERVKKLKSLGISMNKLEAAW-NEDVKNPKKRAVQDI 119
Query: 346 EGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLG 405
KF DDDV+A+GD F+ +VE+DWVMQ G PI H+ KTLIEPS LIM+TAQRFVQTFLG
Sbjct: 120 MAKFSADDDVLAIGDAFFPNVEKDWVMQHGSPIAHKYKTLIEPSHLIMLTAQRFVQTFLG 179
Query: 406 SNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSL 465
NFIALHFRRHGFLKFCNAKKPSCFYP+PQAADCI R+ ERA +PVIYLSTDAAESET L
Sbjct: 180 DNFIALHFRRHGFLKFCNAKKPSCFYPVPQAADCINRVLERANSPVIYLSTDAAESETGL 239
Query: 466 LQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGAS 525
LQSLVV NGKT+ LV+RP RNSAEKWD+LLYRH LE D QV+AMLDKTICAMS+VFIG+S
Sbjct: 240 LQSLVVFNGKTVPLVQRPARNSAEKWDALLYRHGLEGDPQVDAMLDKTICAMSSVFIGSS 299
Query: 526 GSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
GSTFT+DI+RLRKDWGS SLCDEYLCQGE PNF+A+DE
Sbjct: 300 GSTFTDDILRLRKDWGSASLCDEYLCQGELPNFVADDE 337
>gi|302774541|ref|XP_002970687.1| protein-O-fucosyltransferase-like protein [Selaginella
moellendorffii]
gi|300161398|gb|EFJ28013.1| protein-O-fucosyltransferase-like protein [Selaginella
moellendorffii]
Length = 464
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 219/449 (48%), Positives = 290/449 (64%), Gaps = 32/449 (7%)
Query: 118 RALSLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPH 177
R LL++Q+ L +WN + ++ L +QI LN QI+Q++
Sbjct: 45 RLADLLRRQEQGLFEIWNATRDSSQ-------------ADALRSQIELNHQIQQLVSGAR 91
Query: 178 KVSNFTPNDAVWGLESCRKID-SIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKH 236
+ P G+ C+ + S + + +W+P+ D+FL IC SGQ+SNHLICLEKH
Sbjct: 92 H--GYDP-----GMGGCQGQNFSSRRHGKLFQWQPREDRFLLVICTSGQISNHLICLEKH 144
Query: 237 MFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHIDRFL 296
MF AALL+R LV+P DY Y + LDI HI CLG K +++F F+ KN I R L
Sbjct: 145 MFFAALLDRTLVLPDPSVDYSYEKTLDIAHIRSCLGEKTLLTFHEFLA--KNKTDI-RLL 201
Query: 297 CYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRK--PSKRTVQDIEGKFKTDDD 354
CY C++D +H KK+K LG+S W E + K PS + KF +D+
Sbjct: 202 CYM---SGCYLDKDHEKKIKALGLSWSSKTDAWPEEASLKGYPSHPRASEFVAKFSCEDE 258
Query: 355 VIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFR 414
V+ +GD+FY+DVE + V Q GGP+ H CKT+I P+RLI++TAQRFVQTFLGSNF+ALHFR
Sbjct: 259 VLGIGDVFYSDVESELVAQIGGPLQHSCKTIIRPNRLILLTAQRFVQTFLGSNFMALHFR 318
Query: 415 RHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNG 474
RHGFLKFCN K SCF+PIPQAA CI R E AK VIY+STDAAESE +LLQ L+ G
Sbjct: 319 RHGFLKFCNVKNESCFFPIPQAASCILRKIEEAKPRVIYISTDAAESEITLLQQLL---G 375
Query: 475 KTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIM 534
+ +++RP +S EKWD++L R L D++V AMLDKTICAM+ FIG GSTFT+DI+
Sbjct: 376 PDVPMIQRPQHDSREKWDAILQRKGLLADNEVVAMLDKTICAMAESFIGTPGSTFTDDIV 435
Query: 535 RLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
RLRK+WGS++ CD LCQG+ PNFIA+ +
Sbjct: 436 RLRKEWGSSNSCDGVLCQGQHPNFIADTD 464
>gi|217074406|gb|ACJ85563.1| unknown [Medicago truncatula]
Length = 242
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 190/242 (78%), Positives = 216/242 (89%)
Query: 322 MGKTETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHR 381
M K + VW++ DTR P K+TVQD+ KF DDDV+A+GD+FYA VE +WVMQPGGPI H+
Sbjct: 1 MSKPKAVWEDGDTRNPKKKTVQDVMDKFSYDDDVMAIGDVFYAKVEHEWVMQPGGPIAHQ 60
Query: 382 CKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCIT 441
CKTLIEP+RLI++TAQRF+QTFLG NFIALHFRRHGFLKFCNAKKPSCF+PIPQAADCI
Sbjct: 61 CKTLIEPNRLILLTAQRFIQTFLGRNFIALHFRRHGFLKFCNAKKPSCFFPIPQAADCIL 120
Query: 442 RLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLE 501
R+ ERA AP+IYLSTDAAESET LLQSL+VLNGK++ LV RP RNSAEKWD+LLYRHH+E
Sbjct: 121 RVIERADAPIIYLSTDAAESETGLLQSLIVLNGKSVPLVIRPARNSAEKWDALLYRHHIE 180
Query: 502 DDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAE 561
DSQVEAMLDKTICAMS+VFIGA GSTFTEDI+RLRKDWGS SLCDEYLC GEEPN +AE
Sbjct: 181 GDSQVEAMLDKTICAMSSVFIGAPGSTFTEDILRLRKDWGSASLCDEYLCHGEEPNIVAE 240
Query: 562 DE 563
+E
Sbjct: 241 NE 242
>gi|33457138|emb|CAE30294.1| GDP-fucose protein-O-fucosyltransferase 2 [Helianthus annuus]
Length = 190
Score = 327 bits (839), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 148/190 (77%), Positives = 168/190 (88%)
Query: 374 PGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPI 433
PGGPI H+C+TLIEPSR+I++TAQRF+QTFLG F+ALHFRRHGFLKFCNAK PSCF+PI
Sbjct: 1 PGGPIAHKCQTLIEPSRVILLTAQRFIQTFLGERFVALHFRRHGFLKFCNAKNPSCFFPI 60
Query: 434 PQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDS 493
PQAA+CI+RL ERA PVIYLSTDAAESET LLQSLV L GKT+ LV RP RNSAEKWD+
Sbjct: 61 PQAAECISRLVERANVPVIYLSTDAAESETGLLQSLVTLYGKTVPLVTRPARNSAEKWDA 120
Query: 494 LLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQG 553
LLYR L+ D QVEAMLDKTICA+S+VFIGA GSTFTEDI+RLR+ WG+ S+CDEYLC+G
Sbjct: 121 LLYRKKLDGDPQVEAMLDKTICALSSVFIGAPGSTFTEDILRLRRGWGTASVCDEYLCEG 180
Query: 554 EEPNFIAEDE 563
E PNFIA DE
Sbjct: 181 EVPNFIAGDE 190
>gi|168039767|ref|XP_001772368.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676355|gb|EDQ62839.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 335
Score = 314 bits (804), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 162/345 (46%), Positives = 220/345 (63%), Gaps = 16/345 (4%)
Query: 220 ICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSF 279
IC +GQ++NHLIC KHM+ AALLNR LV+P K DY+Y +LD+ H+ CLG V++F
Sbjct: 2 ICQTGQVTNHLICFRKHMWFAALLNRTLVLPHGKLDYKYEHLLDMAHLKKCLGNSSVITF 61
Query: 280 ENFMEMEKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSK 339
E F E+ H+DR +CY C D +H + + LG + G+ + W + + P
Sbjct: 62 EEFAELRNETLHVDRLICYM---HSCHFDKDHPRAWEALGYTFGERQDAWSS--SSNPPA 116
Query: 340 RTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCK-TLIEPSRLIMVTAQR 398
DI +F + V+AVGDLFYA VE G + H C + + P+R I+VTA+R
Sbjct: 117 AHKDDILSRFACNAKVLAVGDLFYAAVED----SHGFAVAHDCTVSPVLPNRNIVVTAER 172
Query: 399 FVQTFLGSNFIALHFRRHGFLKFCNAKK-PSCFYPIPQAADCITRLAERAKAPVIYLSTD 457
FVQT+LG NF++LHFRRHGFL FCN +K PSCFYPIPQAA CI R A+A V+YLSTD
Sbjct: 173 FVQTYLGPNFLSLHFRRHGFLVFCNKQKSPSCFYPIPQAARCIRRKVWEARADVVYLSTD 232
Query: 458 AAESETSLLQSLVVLNGKTIALVKRPPRNSAE-KWDSLLYRHHLEDDSQVEAMLDKTICA 516
A SE +L+ L G T+ V+R + AE KWDSLL+R+ LE + V AM+DKTIC
Sbjct: 233 APASEVDILKQ--ELAGVTV--VRRTKVHEAEAKWDSLLWRNGLESELAVMAMVDKTICV 288
Query: 517 MSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAE 561
+ F+G STF+ DI+R+R + + S CD ++C GE P+F+A+
Sbjct: 289 LGVGFMGTRESTFSYDILRMRNEMRTASSCDGFICDGETPSFLAQ 333
>gi|297742920|emb|CBI35787.3| unnamed protein product [Vitis vinifera]
Length = 171
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 139/171 (81%), Positives = 156/171 (91%)
Query: 393 MVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVI 452
M+TAQRFVQTFLG +F ALHFRRHGFLKFCNAK+PSCF+PIPQAADCI+R+ ERA PVI
Sbjct: 1 MLTAQRFVQTFLGKSFTALHFRRHGFLKFCNAKEPSCFFPIPQAADCISRVVERADTPVI 60
Query: 453 YLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDK 512
YLSTDAAESET LLQSLVVLNGK + L+KRP RNSAEKWD+LLYRH L+ DSQVEAMLDK
Sbjct: 61 YLSTDAAESETGLLQSLVVLNGKLVPLIKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDK 120
Query: 513 TICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
TICAM++VFIGA GSTFTEDI+RLR+ WGS S CDEYLCQGE+PNFIA++E
Sbjct: 121 TICAMASVFIGAPGSTFTEDILRLRRGWGSASHCDEYLCQGEQPNFIADNE 171
>gi|52208037|emb|CAH41978.1| protein-O-fucosyltransferase 2 [Glycine max]
Length = 164
Score = 291 bits (744), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 134/164 (81%), Positives = 148/164 (90%)
Query: 400 VQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAA 459
VQTFL SNF+ALHFRRHGFLKFCNAKKPSCFY I QAADCI R+ ERA AP+IYLSTDAA
Sbjct: 1 VQTFLESNFVALHFRRHGFLKFCNAKKPSCFYSITQAADCILRVVERADAPIIYLSTDAA 60
Query: 460 ESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSN 519
ESET LLQSLVVLNG+ + LV RP RNSAEKWD+LLYRH ++ DSQVEAMLDK+ICAMS+
Sbjct: 61 ESETGLLQSLVVLNGRPVPLVIRPARNSAEKWDALLYRHRMDGDSQVEAMLDKSICAMSS 120
Query: 520 VFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563
VFIGA GSTFTEDI+RLRKDWGS S+CDEYLCQGEEPN +AE+E
Sbjct: 121 VFIGAPGSTFTEDILRLRKDWGSASMCDEYLCQGEEPNIVAENE 164
>gi|12324014|gb|AAG51963.1|AC024260_1 hypothetical protein, 5' partial; 217-725 [Arabidopsis thaliana]
Length = 141
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 112/141 (79%), Positives = 127/141 (90%)
Query: 423 NAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKR 482
NAK PSCF+PIPQAA CITRL E+ +APV+YLSTDAAESET LLQSL++LNGKT+ LVKR
Sbjct: 1 NAKNPSCFFPIPQAASCITRLIEKVEAPVLYLSTDAAESETGLLQSLLILNGKTVPLVKR 60
Query: 483 PPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGS 542
P R+SAEKWD+LLYRH LE DSQVEAMLDKTICA+S+VFIGASGSTFTEDI+RLRKDWG+
Sbjct: 61 PARDSAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGASGSTFTEDILRLRKDWGT 120
Query: 543 TSLCDEYLCQGEEPNFIAEDE 563
S CDEYLC E+PNFIA+ E
Sbjct: 121 ASECDEYLCANEQPNFIADHE 141
>gi|302772334|ref|XP_002969585.1| hypothetical protein SELMODRAFT_410382 [Selaginella moellendorffii]
gi|300163061|gb|EFJ29673.1| hypothetical protein SELMODRAFT_410382 [Selaginella moellendorffii]
Length = 399
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 110/259 (42%), Positives = 144/259 (55%), Gaps = 58/259 (22%)
Query: 305 CFVDDEHIKKLKQLGISMGKTETVWKNEDTRK--PSKRTVQDIEGKFKTDDDVIAVGDLF 362
C++D++H KK+K LG+S W E + K PS + KF +D+V+ +GD+F
Sbjct: 4 CYLDEDHEKKIKALGLSWSSKNDAWPEEASLKGYPSHPRASEFVAKFSCEDEVLGIGDVF 63
Query: 363 YADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFC 422
Y+DVE + V Q GGP+ H CKT+I +RLI++T Q
Sbjct: 64 YSDVESELVAQIGGPLQHSCKTIIRLNRLILLTVQS------------------------ 99
Query: 423 NAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKR 482
N K SCF+PIPQAA CI+R + AK VIY+STDAAESE +LLQ L+
Sbjct: 100 NVKNESCFFPIPQAASCISRKIQEAKPRVIYISTDAAESEITLLQQLL------------ 147
Query: 483 PPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGS 542
V AMLDKTICAM+ FIG GSTFT+DI+RLRK+WGS
Sbjct: 148 --------------------GPDVVAMLDKTICAMAESFIGTPGSTFTDDIVRLRKEWGS 187
Query: 543 TSLCDEYLCQGEEPNFIAE 561
++ CD LCQG+ PNFIA+
Sbjct: 188 SNSCDGVLCQGQYPNFIAD 206
>gi|399604589|gb|AFP49325.1| GDP-fucose protein-O-fucosyltransferase 2, partial [Olea europaea]
Length = 87
Score = 157 bits (398), Expect = 1e-35, Method: Composition-based stats.
Identities = 68/87 (78%), Positives = 78/87 (89%)
Query: 477 IALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRL 536
+ LV+RP RNSAEKWD+LLYRH LE D+QVEAMLDK+ICA+S VFIG+ GSTFT+DI+RL
Sbjct: 1 VPLVQRPARNSAEKWDALLYRHGLEGDAQVEAMLDKSICALSTVFIGSGGSTFTDDILRL 60
Query: 537 RKDWGSTSLCDEYLCQGEEPNFIAEDE 563
RKDWGS S CDEYLCQGE PNF+AEDE
Sbjct: 61 RKDWGSASACDEYLCQGELPNFVAEDE 87
>gi|302771912|ref|XP_002969374.1| protein-O-fucosyltransferase-like protein [Selaginella
moellendorffii]
gi|300162850|gb|EFJ29462.1| protein-O-fucosyltransferase-like protein [Selaginella
moellendorffii]
Length = 245
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/291 (36%), Positives = 142/291 (48%), Gaps = 62/291 (21%)
Query: 237 MFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHIDRFL 296
MF AALL+R LV P DY Y + LDI HI CLG K +++ F + KN I R L
Sbjct: 1 MFFAALLDRTLVFPDPCVDYSYEKTLDIAHICSCLGEKTLLTLHEF--LGKNKTDI-RLL 57
Query: 297 CYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQDIEGKFKTDDDVI 356
CY CF+D++H KK+K LG+ +K + + + + +
Sbjct: 58 CYM---SGCFLDEDHEKKIKALGLPR-------PMPGLKKLLSKAIHLVLARANSWRR-- 105
Query: 357 AVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRH 416
D+FY+DVE + V Q GGP H CKT+I P+R I++TAQ SN LH
Sbjct: 106 ---DVFYSDVESELVAQIGGPFQHSCKTIIRPNRPILLTAQ--------SN---LHLEED 151
Query: 417 GFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKT 476
P QA + A ESE +LLQ L+ G
Sbjct: 152 ---------------PGGQALSDLHL---------------AVESEITLLQQLL---GPD 178
Query: 477 IALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGS 527
+ +++RP +S EKWD++L R L D+QV AMLDKTICAM+ FIG GS
Sbjct: 179 VPMIQRPQHDSREKWDAILQRKGLLADNQVVAMLDKTICAMAQSFIGTRGS 229
>gi|302774881|ref|XP_002970857.1| hypothetical protein SELMODRAFT_411497 [Selaginella moellendorffii]
gi|300161568|gb|EFJ28183.1| hypothetical protein SELMODRAFT_411497 [Selaginella moellendorffii]
Length = 952
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 143/318 (44%), Gaps = 88/318 (27%)
Query: 252 SKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHIDRFLCYFGLPQPCFVDDEH 311
S DY Y + LDI I CLG K +++F F+ KN I R LCY C++D++H
Sbjct: 496 SSVDYSYEKTLDI--ICSCLGEKTLLTFHEFLG--KNKTDI-RLLCYM---SGCYLDEDH 547
Query: 312 IKKLKQLGISMGKTETVWKNEDTRK--PSKRTVQDIEGKFKTDDDVIAVGDLFYADVERD 369
KK+K LG+S W E + K PS + F + +V+ +GD+FY+DVE +
Sbjct: 548 EKKIKALGLSWSSKTDAWPEEASLKGYPSHPRASEFVATFSCEVEVLGIGDVFYSDVESE 607
Query: 370 WVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSC 429
V Q I + R +Q KP
Sbjct: 608 LVAQ------------------IAICISRKIQE----------------------AKPRV 627
Query: 430 FYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAE 489
Y +STDAAESE +LLQ L+ G + +++RP +S E
Sbjct: 628 IY----------------------ISTDAAESEITLLQQLL---GPDVPMIQRPQHDSRE 662
Query: 490 KWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWG-----STS 544
KWD++L R L D++V A D A N G GSTFT+DI+RLRK+WG S
Sbjct: 663 KWDAILQRKGLLADNEVVAGSD----AAQN---GTPGSTFTDDILRLRKEWGKYSFQGLS 715
Query: 545 LCDEYLCQGEEPNFIAED 562
L E LC+G F+ ++
Sbjct: 716 LVHE-LCRGGAQYFVGKN 732
>gi|357445471|ref|XP_003593013.1| GDP-fucose protein-O-fucosyltransferase [Medicago truncatula]
gi|355482061|gb|AES63264.1| GDP-fucose protein-O-fucosyltransferase [Medicago truncatula]
Length = 182
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 64/128 (50%), Positives = 83/128 (64%), Gaps = 20/128 (15%)
Query: 285 MEKNHAHIDRFLCYFGLPQPCFVDD-EHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQ 343
+ + + HID+FLCYF LPQPC++DD E +KKL LG+ M K + VW++ED R P K V
Sbjct: 2 LRRVYLHIDKFLCYFALPQPCYLDDDERLKKLGGLGLGMSKPKAVWEDEDARNPKKTVVM 61
Query: 344 DIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTF 403
D KF DDDV+A+GD+FY +VE LIEP+ LI++TAQRF+QTF
Sbjct: 62 D---KFSYDDDVMAIGDVFYVEVE----------------LLIEPNHLILLTAQRFIQTF 102
Query: 404 LGSNFIAL 411
L NFIA
Sbjct: 103 LERNFIAF 110
>gi|238007178|gb|ACR34624.1| unknown [Zea mays]
Length = 64
Score = 112 bits (279), Expect = 7e-22, Method: Composition-based stats.
Identities = 49/59 (83%), Positives = 53/59 (89%)
Query: 503 DSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAE 561
D QV+AMLDKTICA+SNVFIG+SGSTFTEDI RLR+ WGS S CDEYLCQGE PNFIAE
Sbjct: 4 DGQVDAMLDKTICALSNVFIGSSGSTFTEDIFRLRRGWGSASHCDEYLCQGELPNFIAE 62
>gi|357445473|ref|XP_003593014.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
gi|355482062|gb|AES63265.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
Length = 628
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 39/65 (60%), Positives = 49/65 (75%)
Query: 197 IDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDY 256
I+ + ++T+EW PK+DKFL AIC+SGQMSNHLIC E HMF +LNRVLVIP+ K Y
Sbjct: 564 IEQSLSKRKTIEWNPKNDKFLVAICVSGQMSNHLICFENHMFFPEILNRVLVIPNFKVSY 623
Query: 257 QYSRV 261
QY +
Sbjct: 624 QYDMI 628
>gi|297846002|ref|XP_002890882.1| hypothetical protein ARALYDRAFT_890615 [Arabidopsis lyrata subsp.
lyrata]
gi|297336724|gb|EFH67141.1| hypothetical protein ARALYDRAFT_890615 [Arabidopsis lyrata subsp.
lyrata]
Length = 99
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 55/85 (64%), Gaps = 8/85 (9%)
Query: 92 LRSLFSGNYVNFRFDSLADRMRESELRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNN-- 149
L++L +G F+ D L++R++ESELRA LL+QQ LLSLWN + VN S + N
Sbjct: 10 LQTLAAG----FKLDPLSNRVKESELRAFYLLRQQHLSLLSLWNGTLVNPSLNQSENVLG 65
Query: 150 --PFFQEAKSVLLNQISLNRQIEQI 172
F++ KS + QISLN++I+++
Sbjct: 66 SFVLFEDVKSAVSKQISLNKEIQEV 90
>gi|297788918|ref|XP_002862488.1| hypothetical protein ARALYDRAFT_920653 [Arabidopsis lyrata subsp.
lyrata]
gi|297792671|ref|XP_002864220.1| hypothetical protein ARALYDRAFT_918378 [Arabidopsis lyrata subsp.
lyrata]
gi|297308032|gb|EFH38746.1| hypothetical protein ARALYDRAFT_920653 [Arabidopsis lyrata subsp.
lyrata]
gi|297310055|gb|EFH40479.1| hypothetical protein ARALYDRAFT_918378 [Arabidopsis lyrata subsp.
lyrata]
Length = 99
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 54/85 (63%), Gaps = 8/85 (9%)
Query: 92 LRSLFSGNYVNFRFDSLADRMRESELRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNN-- 149
L++L +G F+ D L++R++ESEL A LL+QQ LLSLWN + VN S + N
Sbjct: 10 LQTLAAG----FKLDPLSNRVKESELCAFYLLRQQHLSLLSLWNGTLVNPSLNQSENALG 65
Query: 150 --PFFQEAKSVLLNQISLNRQIEQI 172
F++ KS + QISLN++I+++
Sbjct: 66 SFVLFEDVKSAVSKQISLNKEIQEV 90
>gi|147802507|emb|CAN73120.1| hypothetical protein VITISV_031173 [Vitis vinifera]
Length = 135
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 68/143 (47%), Gaps = 52/143 (36%)
Query: 1 MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRR 60
MER SSDD++DR+ LI +++ K LP S F I+DF +
Sbjct: 1 MERXSSDDEEDRQNLIDEHERK-----LP------------HRSGFQIEDFKS------- 36
Query: 61 FTFDFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRAL 120
+L+ R+ F S+ ++ + DS DRMRESELRAL
Sbjct: 37 ------RLSTHRFSFNKSISIVKV---------------------DSPTDRMRESELRAL 69
Query: 121 SLLKQQQSHLLSLWNQS-FVNNS 142
LL+QQQ L SLWN + FV+++
Sbjct: 70 YLLRQQQLSLFSLWNHTAFVDSA 92
>gi|50079998|emb|CAH03732.1| protein-O-fucosyltransferase 2 [Caenorhabditis briggsae]
Length = 424
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 82/166 (49%), Gaps = 27/166 (16%)
Query: 397 QRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLST 456
+R +T G ++ +H+RR FL + + PS I A+ + +L E+ K IYL+T
Sbjct: 272 ERAKRTAKGGPYLGIHWRRRDFLYARHDQLPS----IKGTAEILEKLCEKLKLNRIYLAT 327
Query: 457 DAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICA 516
DA E E + L+S GK + + + D+ Y +D Q+ A++D+ +CA
Sbjct: 328 DAPEEEVNELKSYF---GKELQVFRFT--------DTQKY-----NDGQI-AIIDQYLCA 370
Query: 517 MSNVFIGASGSTFTEDIMRLRKDWG----ST--SLCDEYLCQGEEP 556
+ FIG+ STFT I R+ G ST LC + + E+P
Sbjct: 371 HAAHFIGSYESTFTFRIQEDREILGFPISSTFNRLCPDNIPDCEQP 416
>gi|392896418|ref|NP_001255070.1| Protein PAD-2, isoform a [Caenorhabditis elegans]
gi|26006707|sp|Q8WR51.1|OFUT2_CAEEL RecName: Full=GDP-fucose protein O-fucosyltransferase 2; AltName:
Full=Patterning defective protein 2; AltName:
Full=Peptide-O-fucosyltransferase 2; Short=O-FucT-2;
Flags: Precursor
gi|18182600|gb|AAL65193.1|AF455271_1 C21ORF80 [Caenorhabditis elegans]
gi|22859077|emb|CAD45602.1| Protein PAD-2, isoform a [Caenorhabditis elegans]
Length = 424
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 77/167 (46%), Gaps = 28/167 (16%)
Query: 395 TAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYL 454
T ++ +T +G ++ +H+RR FL A+ P+ IP A + L ++ IYL
Sbjct: 271 TKEKPRRTAIGGPYLGIHWRRRDFLYARRAQLPT----IPGTAKILQDLCKKLDLQKIYL 326
Query: 455 STDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRH---HLEDDSQVEAMLD 511
+TDA + E L++L LNG+ +YR +D Q+ A++D
Sbjct: 327 ATDAPDQEVDELKAL--LNGEL-----------------EVYRFTDTQKLNDGQI-AIID 366
Query: 512 KTICAMSNVFIGASGSTFTEDIMRLRKDWG-STSLCDEYLCQGEEPN 557
+ +CA + FIG+ STFT I R+ G S LC EP
Sbjct: 367 QYLCAHAAYFIGSYESTFTFRIQEDREIIGFPISTTFNRLCPDTEPT 413
>gi|392896416|ref|NP_001255069.1| Protein PAD-2, isoform b [Caenorhabditis elegans]
gi|339730623|emb|CCC42186.1| Protein PAD-2, isoform b [Caenorhabditis elegans]
Length = 426
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 77/167 (46%), Gaps = 28/167 (16%)
Query: 395 TAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYL 454
T ++ +T +G ++ +H+RR FL A+ P+ IP A + L ++ IYL
Sbjct: 273 TKEKPRRTAIGGPYLGIHWRRRDFLYARRAQLPT----IPGTAKILQDLCKKLDLQKIYL 328
Query: 455 STDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRH---HLEDDSQVEAMLD 511
+TDA + E L++L LNG+ +YR +D Q+ A++D
Sbjct: 329 ATDAPDQEVDELKAL--LNGEL-----------------EVYRFTDTQKLNDGQI-AIID 368
Query: 512 KTICAMSNVFIGASGSTFTEDIMRLRKDWG-STSLCDEYLCQGEEPN 557
+ +CA + FIG+ STFT I R+ G S LC EP
Sbjct: 369 QYLCAHAAYFIGSYESTFTFRIQEDREIIGFPISTTFNRLCPDTEPT 415
>gi|297810473|ref|XP_002873120.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297318957|gb|EFH49379.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 96
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 42/64 (65%), Gaps = 4/64 (6%)
Query: 113 RESELRALSLLKQQQSHLLSLWNQSFVNNSYGNNTNN----PFFQEAKSVLLNQISLNRQ 168
+ESELRAL LL+QQ LLSLWN + VN S + N F++ KS + QISLN++
Sbjct: 24 KESELRALYLLRQQHLSLLSLWNGTLVNPSLNQSENALGSFVLFEDVKSAVSKQISLNKE 83
Query: 169 IEQI 172
I+++
Sbjct: 84 IQEV 87
>gi|339239761|ref|XP_003375806.1| GDP-fucose protein O-fucosyltransferase 2 [Trichinella spiralis]
gi|316975516|gb|EFV58949.1| GDP-fucose protein O-fucosyltransferase 2 [Trichinella spiralis]
Length = 579
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 78/159 (49%), Gaps = 23/159 (14%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
+G ++A+H RR F+ + + P+ + AA+ I++L + K V+Y+STDA E+E
Sbjct: 275 IGGPYLAVHLRRRDFVTSRSKQIPT----VKGAAEQISKLLKMLKLEVVYVSTDAPETEV 330
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
L++ + A++KR A+ L D V A +++ ICA + FIG
Sbjct: 331 DELKAFL----NETAVIKRFKPTDAQLQKFL--------DGGV-ATIEQWICAHAKYFIG 377
Query: 524 ASGSTFTEDIMRLRKDWGST------SLCDEYLCQGEEP 556
+ STF+ I R+ G + LC ++ E+P
Sbjct: 378 TAESTFSFRIQEDREILGFSHNTTFNCLCPDHNLNCEQP 416
>gi|341878872|gb|EGT34807.1| CBN-PAD-2 protein [Caenorhabditis brenneri]
Length = 465
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 71/162 (43%), Gaps = 22/162 (13%)
Query: 397 QRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLST 456
+R ++ LG ++ +H+RR FL + PS I A + +L E+ K IYL+T
Sbjct: 270 ERARRSALGGPYLGIHWRRRDFLYARQEQLPS----IKGTALILEKLCEQLKLNRIYLAT 325
Query: 457 DAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICA 516
DA E E L+S + K N D Q+ A++D+ +CA
Sbjct: 326 DAPEEEVEELKSYLKEGLKVYRFTDTQKFN----------------DGQI-AIIDQYLCA 368
Query: 517 MSNVFIGASGSTFTEDIMRLRKDWG-STSLCDEYLCQGEEPN 557
+ FIG+ STFT I R+ G S LC + P+
Sbjct: 369 HARHFIGSHESTFTFRIQEDREILGFPVSSTFNRLCPDKNPD 410
>gi|32487281|emb|CAE02610.1| putative protein-O-fucosyltransferase [Gallus gallus]
Length = 380
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 64/137 (46%), Gaps = 20/137 (14%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T LG ++ +H RR F+ PS + AA I L + K ++++TDA E
Sbjct: 233 TALGGPYLGVHLRRKDFIWGHREDVPS----LHGAAKKIHSLLKTHKLEKVFIATDAVED 288
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
E LL+ LV +V+ P W+ L L D + A++D+ ICA + F
Sbjct: 289 EIELLKKLVP------EMVRFEP-----TWEEL----ELYKDGGM-AVIDQWICAHARYF 332
Query: 522 IGASGSTFTEDIMRLRK 538
IG S STF+ I R+
Sbjct: 333 IGTSVSTFSFRIHEERE 349
>gi|308497178|ref|XP_003110776.1| CRE-PAD-2 protein [Caenorhabditis remanei]
gi|308242656|gb|EFO86608.1| CRE-PAD-2 protein [Caenorhabditis remanei]
Length = 441
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 24/135 (17%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +I +H+RR FL + PS I AD + L+E+ K IYL+TDA E E +
Sbjct: 298 GGPYIGIHWRRRDFLYARKEQLPS----IKGTADILENLSEKLKLNKIYLATDAPEEEIN 353
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ + + N D Q+ A++D+ +CA + FIG+
Sbjct: 354 ELKYYLREELEVFRFTDSQKFN----------------DGQI-AIIDQYLCAHARHFIGS 396
Query: 525 SGSTFTEDIMRLRKD 539
STF+ R+++D
Sbjct: 397 HESTFS---FRIQED 408
>gi|118093363|ref|XP_421892.2| PREDICTED: GDP-fucose protein O-fucosyltransferase 2 [Gallus
gallus]
Length = 438
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 64/137 (46%), Gaps = 20/137 (14%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T LG ++ +H RR F+ PS + AA I L + K ++++TDA E
Sbjct: 291 TALGGPYLGVHLRRKDFIWGHREDVPS----LHGAAKKIHSLLKTHKLEKVFIATDAVED 346
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
E LL+ LV +V+ P W+ L L D + A++D+ ICA + F
Sbjct: 347 EIELLKKLVP------EMVRFEP-----TWEEL----ELYKDGGM-AVIDQWICAHARYF 390
Query: 522 IGASGSTFTEDIMRLRK 538
IG S STF+ I R+
Sbjct: 391 IGTSVSTFSFRIHEERE 407
>gi|170589127|ref|XP_001899325.1| GDP-fucose protein O-fucosyltransferase 2 precursor [Brugia malayi]
gi|158593538|gb|EDP32133.1| GDP-fucose protein O-fucosyltransferase 2 precursor, putative
[Brugia malayi]
Length = 455
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 60/134 (44%), Gaps = 21/134 (15%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G ++I +H+RR F++ +A PS I A+ I + + +YL TDA+ +E +
Sbjct: 305 GGDYICVHWRRRDFVRSHSADIPS----ISGTAEQIVVILRKENLTTLYLCTDASTAEVA 360
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L SL+ + + + N E +++D+ ICA + FIG
Sbjct: 361 TLTSLLPYDVQVERFLNDSSLNDGEV-----------------SIVDQWICAHARYFIGT 403
Query: 525 SGSTFTEDIMRLRK 538
STF+ I R+
Sbjct: 404 HTSTFSYRIQEDRE 417
>gi|291227449|ref|XP_002733708.1| PREDICTED: protein O-fucosyltransferase 2-like [Saccoglossus
kowalevskii]
Length = 442
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 62/127 (48%), Gaps = 17/127 (13%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
+G ++A+H RR F+ PS + +AA I ++ ++ K ++++TDA + E
Sbjct: 292 IGGPYMAVHLRRKDFVHSHKKDVPS----LKEAAKQIKKVLKKEKLDKLFVATDAPKVEL 347
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
L+ L + GK P ++ EK+ D V A++D+ IC + F+G
Sbjct: 348 DELKKL--MKGKATVYNFEPTKDXXEKYK----------DGGV-AIIDQWICTHARYFMG 394
Query: 524 ASGSTFT 530
S STF+
Sbjct: 395 TSVSTFS 401
>gi|167537602|ref|XP_001750469.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771009|gb|EDQ84683.1| predicted protein [Monosiga brevicollis MX1]
Length = 147
Score = 50.1 bits (118), Expect = 0.003, Method: Composition-based stats.
Identities = 42/150 (28%), Positives = 67/150 (44%), Gaps = 18/150 (12%)
Query: 392 IMVTAQRFVQTFLGSN-FIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAP 450
++ TA+ F+ T GS ++ALH R+ F + + P+ A + LA A
Sbjct: 7 LIRTAEEFIHTNFGSRAYLALHLRQGDFK--AHGRMPAIEL---VARQLLHALAADANLT 61
Query: 451 VIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAML 510
I+L++DA S+ + L + + LV+ P + D L A+L
Sbjct: 62 SIFLASDADTSDPDMAWLTAQLAAQNVPLVRLTPPAGSSLGDGQL------------AIL 109
Query: 511 DKTICAMSNVFIGASGSTFTEDIMRLRKDW 540
D+ +CA ++ FIG STFT I R W
Sbjct: 110 DQIVCARAHKFIGTPESTFTFRIQEERPRW 139
>gi|402583728|gb|EJW77671.1| O-fucosyltransferase 2 [Wuchereria bancrofti]
Length = 220
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 60/134 (44%), Gaps = 21/134 (15%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G ++I +H+RR F++ +A PS I A+ I + + +YL TDA+ +E +
Sbjct: 70 GGDYICVHWRRRDFVRSHSADIPS----INGTAEQIVVILRKENLTTLYLCTDASTAEVA 125
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L SL+ + + + N E +++D+ ICA + FIG
Sbjct: 126 TLTSLLPYDVQVERFLNDSSLNDGEV-----------------SIVDQWICAHARYFIGT 168
Query: 525 SGSTFTEDIMRLRK 538
STF+ I R+
Sbjct: 169 HTSTFSYRIQEDRE 182
>gi|449281897|gb|EMC88856.1| GDP-fucose protein O-fucosyltransferase 2, partial [Columba livia]
Length = 386
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 59/129 (45%), Gaps = 20/129 (15%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T G ++ +H RR F+ PS + A I L E K ++++TDA E
Sbjct: 239 TARGGPYLGVHLRRKDFIWGHREDVPS----LQGAVKKIRSLLETHKLQKVFVATDAVEE 294
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
E LL+ L+ +V+ P W+ L L D + A++D+ ICA + F
Sbjct: 295 EVELLKKLLP------EMVRFEP-----SWEEL----ELYKDGGL-AVIDQWICAHARYF 338
Query: 522 IGASGSTFT 530
IG S STF+
Sbjct: 339 IGTSVSTFS 347
>gi|324512707|gb|ADY45254.1| GDP-fucose protein O-fucosyltransferase 2 [Ascaris suum]
Length = 432
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 21/135 (15%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG NFI +H+RR F++ PS + AA I + R K ++LSTDA E
Sbjct: 280 LGGNFICVHWRRRDFVRSHRDDIPS----VEGAARQIVNILNREKLKKVFLSTDAPTDEV 335
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
+ L+ + T +V++ H D ++ +++D+ ICA + FIG
Sbjct: 336 ARLRKSL----PTDVIVEQ------------FLDHGSLSDGEI-SIVDQWICAHARYFIG 378
Query: 524 ASGSTFTEDIMRLRK 538
STF+ I R+
Sbjct: 379 THVSTFSYRIQEDRE 393
>gi|390331858|ref|XP_783445.3| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like
[Strongylocentrotus purpuratus]
Length = 435
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 68/154 (44%), Gaps = 19/154 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +IA+H RR F + N +K +P+AA I + + K ++++TDA SE
Sbjct: 288 GGPYIAVHLRRQDFAR--NNRKEVA--SLPKAATHIKKKLKELKLKKVFVATDAPASEID 343
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ V L G + PPR H D V A++D+ ICA + FIG
Sbjct: 344 QLK--VHLKGYEVYNYI-PPRTV----------HDRFMDGGV-AIIDQWICAHARYFIGT 389
Query: 525 SGSTFTEDIMRLRKDWG-STSLCDEYLCQGEEPN 557
STFT I R+ G T + C EP
Sbjct: 390 GQSTFTFRIFEERQLLGFDTKMTYNRFCGDGEPT 423
>gi|326936552|ref|XP_003214317.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like
[Meleagris gallopavo]
Length = 466
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 64/140 (45%), Gaps = 20/140 (14%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T LG ++ +H RR F+ PS + AA I L + ++++TDA E
Sbjct: 319 TALGGPYLGVHLRRKDFIWGHREDVPS----LHGAAKKIHSLLKTHNLDKVFIATDAVED 374
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
E LL+ LV +V+ P W+ L L D + A++D+ ICA + F
Sbjct: 375 EIELLKKLVP------EMVRFEP-----TWEEL----ELYKDGGM-AVIDQWICAHARYF 418
Query: 522 IGASGSTFTEDIMRLRKDWG 541
IG S STF+ I R+ G
Sbjct: 419 IGTSVSTFSFRIHEEREILG 438
>gi|393910987|gb|EFO25685.2| GDP-fucose protein O-fucosyltransferase 2 [Loa loa]
Length = 371
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 63/136 (46%), Gaps = 25/136 (18%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G ++I H+RR F++ +A PS I A I + +YL TDA+ +E +
Sbjct: 221 GGDYICAHWRRRDFVQSHSADIPS----INGTAKQIAVILRTENLTTLYLCTDASTAEVT 276
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKW--DSLLYRHHLEDDSQVEAMLDKTICAMSNVFI 522
L L+ P E++ DSLL +D +V +++D+ ICA + FI
Sbjct: 277 TLIDLL------------PSEVQVERFLNDSLL------NDGEV-SIVDQWICAHARYFI 317
Query: 523 GASGSTFTEDIMRLRK 538
G STF+ I R+
Sbjct: 318 GTHTSTFSYRIQEDRE 333
>gi|126344424|ref|XP_001370885.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like
[Monodelphis domestica]
Length = 443
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 20/129 (15%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T LG ++ +H RR F+ PS + A I L E K ++++TDA
Sbjct: 296 TSLGGPYLGVHLRRKDFIWGHREDVPS----LQGAVKTIRSLMETHKLTRVFVATDAVRK 351
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
E L+ L+ +V+ P W+ L L D V A++D+ IC + VF
Sbjct: 352 EAEELKRLLP------EMVRFEP-----TWEEL----ELYKDGGV-AIIDQWICTHARVF 395
Query: 522 IGASGSTFT 530
IG S STF+
Sbjct: 396 IGTSVSTFS 404
>gi|260829181|ref|XP_002609540.1| hypothetical protein BRAFLDRAFT_129871 [Branchiostoma floridae]
gi|229294902|gb|EEN65550.1| hypothetical protein BRAFLDRAFT_129871 [Branchiostoma floridae]
Length = 397
Score = 47.4 bits (111), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 62/131 (47%), Gaps = 20/131 (15%)
Query: 400 VQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAA 459
V T G ++A+H RR F + + PS + AA I ++ + K +YL+TDA
Sbjct: 246 VGTAKGGPYLAVHLRRGDFARSHSKNVPS----LQGAAKQIRQIKKEQKLKKVYLATDAN 301
Query: 460 ESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSN 519
+ E LQ +V K I K P + K ++D A++D+ IC+ +
Sbjct: 302 DQEVKELQKMV----KGIVRFK-PSKEELNK---------MKDGGI--AIIDQWICSHAR 345
Query: 520 VFIGASGSTFT 530
F+G + STF+
Sbjct: 346 YFVGTALSTFS 356
>gi|312070953|ref|XP_003138384.1| GDP-fucose protein O-fucosyltransferase 2 [Loa loa]
Length = 474
Score = 47.4 bits (111), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 64/139 (46%), Gaps = 25/139 (17%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G ++I H+RR F++ +A PS I A I + +YL TDA+ +E +
Sbjct: 324 GGDYICAHWRRRDFVQSHSADIPS----INGTAKQIAVILRTENLTTLYLCTDASTAEVT 379
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKW--DSLLYRHHLEDDSQVEAMLDKTICAMSNVFI 522
L L+ P E++ DSLL +D +V +++D+ ICA + FI
Sbjct: 380 TLIDLL------------PSEVQVERFLNDSLL------NDGEV-SIVDQWICAHARYFI 420
Query: 523 GASGSTFTEDIMRLRKDWG 541
G STF+ I R+ G
Sbjct: 421 GTHTSTFSYRIQEDREILG 439
>gi|395537025|ref|XP_003770508.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2 [Sarcophilus
harrisii]
Length = 441
Score = 47.4 bits (111), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 63/137 (45%), Gaps = 20/137 (14%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T LG ++A+H RR F+ PS + A I L E K ++++TDA
Sbjct: 294 TSLGGPYLAVHLRRKDFVWGHREDVPS----LQGAVKTIRSLMETHKLDRVFVATDAVRK 349
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
E L+ L+ +V+ P W+ L L D + A++D+ ICA + VF
Sbjct: 350 ECEELKKLLP------EMVRFEP-----TWEEL----ELYKDGGI-AIIDQWICAHARVF 393
Query: 522 IGASGSTFTEDIMRLRK 538
IG S STF+ I R+
Sbjct: 394 IGTSVSTFSFRIHEERE 410
>gi|50080000|emb|CAH03733.1| protein-O-fucosyltransferase 2 [Drosophila pseudoobscura]
Length = 421
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 62/137 (45%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +FI H RR F++ A P+ + AA I +L A ++LSTDA +
Sbjct: 264 GGDFICAHLRRGDFVRSREATTPN----LKSAAQQIKQLLRAFNATTVFLSTDATPYD-- 317
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+Q + G V P +S L R L+D A++D+ +C+ + FIG
Sbjct: 318 LVQMKNMFYG--FRFVHFQP-------ESNLQRQKLKDGGV--AIVDQLVCSYARYFIGT 366
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 367 YESTFTYRIYEEREILG 383
>gi|198470920|ref|XP_001355438.2| GA13247 [Drosophila pseudoobscura pseudoobscura]
gi|198145682|gb|EAL32496.2| GA13247 [Drosophila pseudoobscura pseudoobscura]
Length = 492
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 62/137 (45%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +FI H RR F++ A P+ + AA I +L A ++LSTDA +
Sbjct: 335 GGDFICAHLRRGDFVRSREATTPN----LKSAAQQIKQLLRAFNATTVFLSTDATPYD-- 388
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+Q + G V P +S L R L+D A++D+ +C+ + FIG
Sbjct: 389 LVQMKNMFYG--FRFVHFQP-------ESNLQRQKLKDGGV--AIVDQLVCSYARYFIGT 437
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 438 YESTFTYRIYEEREILG 454
>gi|195162191|ref|XP_002021939.1| GL14377 [Drosophila persimilis]
gi|194103837|gb|EDW25880.1| GL14377 [Drosophila persimilis]
Length = 492
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 62/137 (45%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +FI H RR F++ A P+ + AA I +L A ++LSTDA +
Sbjct: 335 GGDFICAHLRRGDFVRSREATTPN----LKSAAQQIKQLLRAFNATTVFLSTDATPYD-- 388
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+Q + G V P +S L R L+D A++D+ +C+ + FIG
Sbjct: 389 LVQMKNMFYG--FRFVHFQP-------ESNLQRQKLKDGGV--AIVDQLVCSYARYFIGT 437
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 438 YESTFTYRIYEEREILG 454
>gi|75911322|gb|ABA29473.1| protein O-fucosyltransferase 2 [Drosophila pseudoobscura]
Length = 495
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 62/137 (45%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +FI H RR F++ A P+ + AA I +L A ++LSTDA +
Sbjct: 338 GGDFICAHLRRGDFVRSREATTPN----LKSAAQQIKQLLRAFNATTVFLSTDATPYD-- 391
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+Q + G V P +S L R L+D A++D+ +C+ + FIG
Sbjct: 392 LVQMKNMFYG--FRFVHFQP-------ESNLQRQKLKDGGV--AIVDQLVCSYARYFIGT 440
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 441 YESTFTYRIYEEREILG 457
>gi|221114309|ref|XP_002160070.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like [Hydra
magnipapillata]
Length = 437
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 58/134 (43%), Gaps = 19/134 (14%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG +IA+H RR F P+ I Q A + ++ K IYLSTDA + E
Sbjct: 288 LGGPYIAVHMRRGDFSYARKETVPT----IEQYAQNAENMLKKYKLKNIYLSTDATDEEI 343
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
+S V GK + PR EK+ D V A++D+ I A + FIG
Sbjct: 344 EEFKSNVF--GKVFIYPR--PREFIEKY----------KDGGV-AIIDQWIAAHARYFIG 388
Query: 524 ASGSTFTEDIMRLR 537
STF+ I R
Sbjct: 389 TCSSTFSFRIYEER 402
>gi|348554782|ref|XP_003463204.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2 [Cavia
porcellus]
Length = 429
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 62/135 (45%), Gaps = 20/135 (14%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG ++A+H RR F+ PS + AA I L + P ++++TDA E
Sbjct: 284 LGGPYLAVHLRRKDFIWGHREDVPS----LEGAARKIRSLMTAHQLPRVFVATDAIRKEQ 339
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
L+ L+ +V+ P W+ L L D + A++D+ ICA + FIG
Sbjct: 340 EELKRLLP------EMVRFEP-----TWEEL----ELYKDGGI-AIIDQWICAHARFFIG 383
Query: 524 ASGSTFTEDIMRLRK 538
S STF+ I R+
Sbjct: 384 TSVSTFSFRIHEERE 398
>gi|5817016|emb|CAB53644.1| EG:BACN32G11.6 [Drosophila melanogaster]
Length = 423
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 17/139 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +++ H RR F++ +A P+ + AA + +L ++L+TDA E
Sbjct: 262 GGDYLCAHLRRGDFVRSRDATTPT----LKAAAQQVKQLLRGFNMTTVFLATDATPYELM 317
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L LV P ++ + R L+D A++D+ +CA + F+G
Sbjct: 318 ELKELFY----RFRLVHFAPESNVQ-------RRELKDGGV--AVVDQLVCAYARYFVGT 364
Query: 525 SGSTFTEDIMRLRKDWGST 543
STFT I R+ G T
Sbjct: 365 YESTFTYRIYEEREILGFT 383
>gi|449506401|ref|XP_002191151.2| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like
[Taeniopygia guttata]
Length = 445
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 57/129 (44%), Gaps = 20/129 (15%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T LG ++ +H RR F+ PS + A I L E ++++TDA E
Sbjct: 298 TALGGPYLGVHLRRRDFIWGHREDVPS----LQGAVKKIYSLLETLNLEKVFVATDAVEE 353
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
E LL+ L+ +V+ P L L D + A++D+ ICA + F
Sbjct: 354 EVELLKKLLP------EMVRFEPS---------LEELELYKDGGL-AVIDQWICAHARYF 397
Query: 522 IGASGSTFT 530
IG S STF+
Sbjct: 398 IGTSVSTFS 406
>gi|307195631|gb|EFN77473.1| GDP-fucose protein O-fucosyltransferase 2 [Harpegnathos saltator]
Length = 403
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 68/152 (44%), Gaps = 18/152 (11%)
Query: 390 RLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKA 449
R I T ++ Q G ++A+H RR F+ P+ I AA +T ++
Sbjct: 241 RPIDWTKEKSQQRVKGGPYLAVHLRRRDFIVGHKDSIPT----IENAASQLTEKMDKLGL 296
Query: 450 PVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAM 509
V++++TDAA E L+ + K +V+ P SL + +D A+
Sbjct: 297 HVLFVATDAAPEEFEELKRYL----KRHKVVRYVP--------SLFELNKFKDGGV--AI 342
Query: 510 LDKTICAMSNVFIGASGSTFTEDIMRLRKDWG 541
+D+ IC+ + FIG STFT I R+ G
Sbjct: 343 IDQIICSHARYFIGTHESTFTFRIQEDREIMG 374
>gi|195469737|ref|XP_002099793.1| O-fut2 [Drosophila yakuba]
gi|194187317|gb|EDX00901.1| O-fut2 [Drosophila yakuba]
Length = 490
Score = 45.4 bits (106), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 61/137 (44%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +++ H RR F++ A P+ + AA I +L ++L+TDA E
Sbjct: 329 GGDYLCAHLRRGDFVRAREATTPT----LKAAAQQIKQLLRAFNMTTVFLATDATPYELM 384
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L LV P ++A+ R L+D A++D+ +CA + F+G
Sbjct: 385 ELKELFY----RFRLVHFAPESNAQ-------RRALKDGGV--AVVDQLVCAYARYFVGT 431
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 432 YESTFTYRIYEEREILG 448
>gi|52208033|emb|CAH41976.1| protein-O-fucosyltransferase 2 [Drosophila yakuba]
Length = 490
Score = 45.4 bits (106), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 61/137 (44%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +++ H RR F++ A P+ + AA I +L ++L+TDA E
Sbjct: 329 GGDYLCAHLRRGDFVRAREATTPT----LKAAAQQIKQLLRAFNMTTVFLATDATPYELM 384
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L LV P ++A+ R L+D A++D+ +CA + F+G
Sbjct: 385 ELKELFY----RFRLVHFAPESNAQ-------RRALKDGGV--AVVDQLVCAYARYFVGT 431
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 432 YESTFTYRIYEEREILG 448
>gi|196013390|ref|XP_002116556.1| hypothetical protein TRIADDRAFT_50901 [Trichoplax adhaerens]
gi|190580832|gb|EDV20912.1| hypothetical protein TRIADDRAFT_50901 [Trichoplax adhaerens]
Length = 427
Score = 45.4 bits (106), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 68/159 (42%), Gaps = 25/159 (15%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G N++A+H RR FL+ PS + +A I ++ ++L+TDA ES +
Sbjct: 282 GGNYLAVHLRRRDFLQGRKNDIPS----LEEAVLQIKSAMKKTNLSKVFLATDAEESGLN 337
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
++ + + + V P + D + A++D+ ICA + FIG+
Sbjct: 338 SIRDQMEI---MLCYVTTTPDRIQQFGDGGI------------AIIDQWICAHAKYFIGS 382
Query: 525 SGSTFT------EDIMRLRKDWGSTSLCDEYLCQGEEPN 557
STF+ ++M + LC+ + E P+
Sbjct: 383 KESTFSFRIYEERELMGFKSKMTYNQLCNHHQSNCEGPS 421
>gi|18921103|ref|NP_569916.1| O-fucosyltransferase 2 [Drosophila melanogaster]
gi|74872809|sp|Q9W589.1|OFUT2_DROME RecName: Full=GDP-fucose protein O-fucosyltransferase 2; AltName:
Full=Peptide-O-fucosyltransferase 2; Short=O-FucT-2;
Flags: Precursor
gi|7290166|gb|AAF45629.1| O-fucosyltransferase 2 [Drosophila melanogaster]
gi|15010504|gb|AAK77300.1| GH07929p [Drosophila melanogaster]
gi|220945090|gb|ACL85088.1| O-fut2-PA [synthetic construct]
gi|220954914|gb|ACL90000.1| O-fut2-PA [synthetic construct]
Length = 490
Score = 45.1 bits (105), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 17/139 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +++ H RR F++ +A P+ + AA + +L ++L+TDA E
Sbjct: 329 GGDYLCAHLRRGDFVRSRDATTPT----LKAAAQQVKQLLRGFNMTTVFLATDATPYELM 384
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L LV P ++ + R L+D A++D+ +CA + F+G
Sbjct: 385 ELKELFY----RFRLVHFAPESNVQ-------RRELKDGGV--AVVDQLVCAYARYFVGT 431
Query: 525 SGSTFTEDIMRLRKDWGST 543
STFT I R+ G T
Sbjct: 432 YESTFTYRIYEEREILGFT 450
>gi|350592216|ref|XP_003483417.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like [Sus
scrofa]
Length = 410
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 61/135 (45%), Gaps = 20/135 (14%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG ++ +H RR F+ PS + A I L E + P ++++TDA E
Sbjct: 265 LGGPYLGVHLRRKDFIWGHRQDVPS----LDGAVRRIRSLMEAHQLPKVFVATDAVRKEL 320
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
L+ L+ +++ P W+ L L D V A++D+ ICA + FIG
Sbjct: 321 EELKRLLP------EMLRFEP-----TWEEL----ELYKDGGV-AIIDQWICAHARFFIG 364
Query: 524 ASGSTFTEDIMRLRK 538
S STF+ I R+
Sbjct: 365 TSVSTFSFRIQEERE 379
>gi|340724690|ref|XP_003400714.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like [Bombus
terrestris]
Length = 443
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 73/162 (45%), Gaps = 19/162 (11%)
Query: 395 TAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYL 454
T ++ ++ +G ++++H RR FL +PS I AA + + + +++
Sbjct: 288 TKEKGIRNAVGGPYLSVHLRRRDFL----IGRPSTVPTIQSAASQLKNKLDELQLKSLFV 343
Query: 455 STDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTI 514
+TDA + E S L+ + N + + D +L + D V A++D+ I
Sbjct: 344 ATDATQEEFSELKQYLS-NYTVLRFISS---------DYVLSKF---KDGGV-AIIDQII 389
Query: 515 CAMSNVFIGASGSTFTEDIMRLRKDWG-STSLCDEYLCQGEE 555
+ + FIG ST+T I R+ G T YLC+G E
Sbjct: 390 SSHARYFIGTYESTYTFRIQEDREIIGFPTRTTFNYLCKGTE 431
>gi|431893772|gb|ELK03590.1| GDP-fucose protein O-fucosyltransferase 2 [Pteropus alecto]
Length = 364
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 20/134 (14%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G ++ +H RR F+ PS + A I L E+ ++++TDAA +E
Sbjct: 220 GGPYLGVHLRRKDFIWGHREDVPS----LEGAVRKIRSLMEQHGLQQVFVATDAAPAEHE 275
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
LL+ L+ +V+ P W L L D V A++D+ +CA + FIG
Sbjct: 276 LLRRLLP------EMVRFEP-----TWGEL----ELYKDGGV-AIIDQWVCAHARFFIGT 319
Query: 525 SGSTFTEDIMRLRK 538
S STF+ I R+
Sbjct: 320 SVSTFSFRIHEERE 333
>gi|195347632|ref|XP_002040356.1| GM18972 [Drosophila sechellia]
gi|194121784|gb|EDW43827.1| GM18972 [Drosophila sechellia]
Length = 490
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 61/139 (43%), Gaps = 17/139 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +++ H RR F++ A P+ + AA + +L ++L+TDA E
Sbjct: 329 GGDYLCAHLRRGDFVRSREATTPT----LKTAAQQVKQLLRAFNVTTVFLATDATPYELM 384
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L LV P ++ + R L+D A++D+ +CA + F+G
Sbjct: 385 ELKELFY----RFRLVHFAPESNVQ-------RRELKDGGV--AVVDQLVCAYARYFVGT 431
Query: 525 SGSTFTEDIMRLRKDWGST 543
STFT I R+ G T
Sbjct: 432 YESTFTYRIYEEREILGFT 450
>gi|195553969|ref|XP_002076798.1| GD24636 [Drosophila simulans]
gi|194202802|gb|EDX16378.1| GD24636 [Drosophila simulans]
Length = 490
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 61/139 (43%), Gaps = 17/139 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +++ H RR F++ A P+ + AA + +L ++L+TDA E
Sbjct: 329 GGDYLCAHLRRGDFVRSREATTPT----LKTAAQQVKQLLRAFNVTTVFLATDATPYELM 384
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L LV P ++ + R L+D A++D+ +CA + F+G
Sbjct: 385 ELKELFY----RFRLVHFAPESNVQ-------RRELKDGGV--AVVDQLVCAYARYFVGT 431
Query: 525 SGSTFTEDIMRLRKDWGST 543
STFT I R+ G T
Sbjct: 432 YESTFTYRIYEEREILGFT 450
>gi|194912464|ref|XP_001982511.1| GG12697 [Drosophila erecta]
gi|190648187|gb|EDV45480.1| GG12697 [Drosophila erecta]
Length = 490
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 66/155 (42%), Gaps = 17/155 (10%)
Query: 387 EPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAER 446
E R M +R + G +++ H RR F++ A PS + AA + +L
Sbjct: 311 EVQRPAMWELERPKRNAKGGDYLCAHLRRGDFVRSREATTPS----LKAAAQQVKQLLRA 366
Query: 447 AKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQV 506
++L+TDA E L+ L LV P ++ + R L+D
Sbjct: 367 FNMTTVFLATDATPYELMELKELFY----RFRLVHFAPESNVQ-------RRELKDGGV- 414
Query: 507 EAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWG 541
A++D+ +CA + F+G STFT I R+ G
Sbjct: 415 -AVVDQLVCAYARYFVGTYESTFTYRIYEEREILG 448
>gi|440741885|ref|ZP_20921217.1| VWA containing CoxE family protein [Pseudomonas syringae BRIP39023]
gi|440378409|gb|ELQ15031.1| VWA containing CoxE family protein [Pseudomonas syringae BRIP39023]
Length = 392
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ESVWKNNQRRNSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ F
Sbjct: 331 -EPGYVWM---QRFMAKF 344
>gi|422640229|ref|ZP_16703656.1| VWA containing CoxE family protein [Pseudomonas syringae Cit 7]
gi|330952620|gb|EGH52880.1| VWA containing CoxE family protein [Pseudomonas syringae Cit 7]
Length = 392
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ESVWKNNQRRNSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ F
Sbjct: 331 -EPGYVWM---QRFMAKF 344
>gi|350398340|ref|XP_003485164.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like [Bombus
impatiens]
Length = 481
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 73/162 (45%), Gaps = 19/162 (11%)
Query: 395 TAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYL 454
T ++ ++ +G ++++H RR FL +PS I AA + + + +++
Sbjct: 326 TKEKGIRNAVGGPYLSVHLRRRDFL----IGRPSTVPTIQSAASQLKNKLDELQLKSLFV 381
Query: 455 STDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTI 514
+TDA + E S L+ + N + + D +L + D V A++D+ I
Sbjct: 382 ATDATQEEFSELKQYLS-NYTVLRFISS---------DYVLSKF---KDGGV-AIIDQII 427
Query: 515 CAMSNVFIGASGSTFTEDIMRLRKDWG-STSLCDEYLCQGEE 555
+ + FIG ST+T I R+ G T YLC+G E
Sbjct: 428 SSHARYFIGTYESTYTFRIQEDREIIGFPTRTTFNYLCKGTE 469
>gi|162287104|ref|NP_001072179.1| protein O-fucosyltransferase 2 precursor [Xenopus (Silurana)
tropicalis]
gi|75911318|gb|ABA29471.1| protein O-fucosyltransferase 2 [Xenopus (Silurana) tropicalis]
Length = 435
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 20/135 (14%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG ++A+H RR F+ PS + +AAD I L ++ K ++++TDA +
Sbjct: 290 LGGPYLAVHLRRKDFIWGHREDVPS----LRRAADEIHSLMKKLKLKKVFVATDADREDL 345
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
L+ L+ + A W+ L L D V A++D+ ICA + FIG
Sbjct: 346 EALRKLIPEMVRFEA-----------TWEEL----ELYKDGGV-AIIDQWICAHARYFIG 389
Query: 524 ASGSTFTEDIMRLRK 538
S STF+ I R+
Sbjct: 390 TSVSTFSFRIHEERE 404
>gi|237800284|ref|ZP_04588745.1| VWA containing CoxE family protein [Pseudomonas syringae pv. oryzae
str. 1_6]
gi|331023141|gb|EGI03198.1| VWA containing CoxE family protein [Pseudomonas syringae pv. oryzae
str. 1_6]
Length = 392
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ESVWKNNLRRGSERTSTQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ F
Sbjct: 331 -EPGYVWM---QRFMAKF 344
>gi|402222473|gb|EJU02539.1| hypothetical protein DACRYDRAFT_99657 [Dacryopinax sp. DJM-731 SS1]
Length = 572
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 66/159 (41%), Gaps = 26/159 (16%)
Query: 408 FIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLA-------ER-AKAPVIYLSTDAA 459
FI +H RR F K C ++P C +P AD + RL+ ER AP YL T
Sbjct: 430 FIGIHIRRGDFGKLCPQEQPDC---LPTVADHVDRLSLIRGNLTERLGHAPEYYLMTTDE 486
Query: 460 ESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSN 519
+ L ++ V G T LV + E++ + Y +LD + +MS
Sbjct: 487 TDKGFLYEA--VRAGWT--LVDHEKERTVERF-GMWY----------PTLLDSVMLSMST 531
Query: 520 VFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNF 558
F+G S ST + R +DW + + G P F
Sbjct: 532 GFLGTSTSTLSIMARRRVEDWNNGFGLFQPFPGGMGPQF 570
>gi|327270672|ref|XP_003220113.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like, partial
[Anolis carolinensis]
Length = 334
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 60/129 (46%), Gaps = 20/129 (15%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T +G ++A+H RR F+ + PS I A I L + K ++++TDA
Sbjct: 187 TAIGGPYLAVHLRRKDFIWGHRSDVPS----IQGAVKKIHSLLKLHKLERVFVATDAVVE 242
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVF 521
ET L+ L+ +V+ P W+ + D + A++D+ ICA + F
Sbjct: 243 ETEELKRLLP------QMVRFEP-----TWEEV----EQYKDGGI-AVIDQWICAHARYF 286
Query: 522 IGASGSTFT 530
IG S STF+
Sbjct: 287 IGTSVSTFS 295
>gi|52208035|emb|CAH41977.1| protein-O-fucosyltransferase 2 [Bombyx mori]
Length = 362
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 65/144 (45%), Gaps = 16/144 (11%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T L N++ +H+RR F K+ + P+ + Q + + + ++++TDA+
Sbjct: 210 TELCKNYLGVHWRRQDFAKYRSKDVPTTSGTVKQIQNIMQNKLPHVQT--VFIATDASAK 267
Query: 462 ETSLLQSLVVLNGKTIAL-VKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNV 520
E +LQ+ N +T+ V N AE E D A+L++ IC+ +
Sbjct: 268 EREILQN----NLETLGYNVHYYVPNIAEV---------TEFDDGTIAILEQIICSHAAF 314
Query: 521 FIGASGSTFTEDIMRLRKDWGSTS 544
F+G STFT I R+ G S
Sbjct: 315 FVGTHESTFTFRIQEERELLGFPS 338
>gi|432098536|gb|ELK28245.1| GDP-fucose protein O-fucosyltransferase 2 [Myotis davidii]
Length = 364
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 20/127 (15%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG ++A+H RR F+ PS + A I L + + ++++TDA E
Sbjct: 219 LGGPYLAVHLRRKDFIWGHREDVPS----LEGAVKKIRSLMKTHRLDKVFVATDAVRKEY 274
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
L+ L+ +V+ P W+ L L D V A++D+ +CA + FIG
Sbjct: 275 EELKRLLP------EMVRFEP-----TWEEL----ELYKDGGV-AIIDQWVCAHARFFIG 318
Query: 524 ASGSTFT 530
S STF+
Sbjct: 319 TSVSTFS 325
>gi|281341189|gb|EFB16773.1| hypothetical protein PANDA_007701 [Ailuropoda melanoleuca]
Length = 387
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 22/128 (17%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG ++A+H RR F+ PS + A I L + + ++++TDA +E
Sbjct: 242 LGGPYLAVHLRRKDFIWGHREDVPS----LDGAVKKIRSLMKTHQLDKVFVATDAIRTEH 297
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSL-LYRHHLEDDSQVEAMLDKTICAMSNVFI 522
L+ L+ +++ P W+ L LYR D V A++D+ ICA + FI
Sbjct: 298 ETLKKLLP------EMLRFEP-----TWEELELYR-----DGGV-AIIDQWICAHARFFI 340
Query: 523 GASGSTFT 530
G S STF+
Sbjct: 341 GTSVSTFS 348
>gi|345327633|ref|XP_001508267.2| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like
[Ornithorhynchus anatinus]
Length = 539
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 64/142 (45%), Gaps = 20/142 (14%)
Query: 400 VQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAA 459
+ T LG ++ +H RR F+ PS + AA I L + K ++++TDA
Sbjct: 389 LGTALGGPYLGVHLRRRDFVWGHREDVPS----LKGAAKKIRSLMKIHKLERVFVATDAV 444
Query: 460 ESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSN 519
E L+ L+ +V+ P W+ L L D + A++D+ ICA +
Sbjct: 445 VEEYEELKRLLP------EMVRFEP-----TWEDL----ELYKDGGI-AIIDQWICAHAR 488
Query: 520 VFIGASGSTFTEDIMRLRKDWG 541
FIG S STF+ I R+ G
Sbjct: 489 YFIGTSVSTFSFRIHEEREILG 510
>gi|195448459|ref|XP_002071667.1| GK25021 [Drosophila willistoni]
gi|194167752|gb|EDW82653.1| GK25021 [Drosophila willistoni]
Length = 511
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 58/137 (42%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G ++ H RR F+K P+ + AA I +L +++STDA E
Sbjct: 343 GGEYLCAHLRRGDFVKSRETTTPT----LKSAAQQIKQLLRAFNLTTVFVSTDATPYELL 398
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L + P +SA+ R L+D A++D+ IC+ + F+G
Sbjct: 399 ELKELFY----RFRFIHFTPESSAQ-------RLQLKDGGV--AIVDQLICSYARYFVGT 445
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 446 YESTFTYRIYEEREILG 462
>gi|195044300|ref|XP_001991794.1| GH11874 [Drosophila grimshawi]
gi|193901552|gb|EDW00419.1| GH11874 [Drosophila grimshawi]
Length = 499
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 61/137 (44%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +++ H RR F++ + P+ + AA I +L +++++DA E
Sbjct: 344 GGDYLCAHLRRGDFVRSRESTTPT----LKAAAQQIKQLLRAFNLSSVFVASDATPYELL 399
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L V P ++A+ RH L+D A++D+ ICA + F+G
Sbjct: 400 ELKELFY----RFRFVHFTPESNAQ-------RHQLKDGGV--AIVDQLICAYARHFVGT 446
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 447 YESTFTYRIYEEREILG 463
>gi|148298691|ref|NP_001091743.1| protein O-fucosyltransferase 2 precursor [Bombyx mori]
gi|75911324|gb|ABA29474.1| protein O-fucosyltransferase 2 [Bombyx mori]
Length = 405
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 65/144 (45%), Gaps = 16/144 (11%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T L N++ +H+RR F K+ + P+ + Q + + + ++++TDA+
Sbjct: 253 TELCKNYLGVHWRRQDFAKYRSKDVPTTSGTVKQIQNIMQNKLPHVQT--VFIATDASAK 310
Query: 462 ETSLLQSLVVLNGKTIAL-VKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNV 520
E +LQ+ N +T+ V N AE E D A+L++ IC+ +
Sbjct: 311 EREILQN----NLETLGYNVHYYVPNIAEV---------TEFDDGTIAILEQIICSHAAF 357
Query: 521 FIGASGSTFTEDIMRLRKDWGSTS 544
F+G STFT I R+ G S
Sbjct: 358 FVGTHESTFTFRIQEERELLGFPS 381
>gi|443693029|gb|ELT94489.1| hypothetical protein CAPTEDRAFT_128307, partial [Capitella teleta]
Length = 390
Score = 42.7 bits (99), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 68/162 (41%), Gaps = 28/162 (17%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
+G ++A+H RR FL PS + AA I + +Y++TDA + E
Sbjct: 242 IGGPYMAVHLRRKDFLSSHTKDVPS----LAGAAKQIREKLKEFDLNAVYVATDAPKEEF 297
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLED--DSQVEAMLDKTICAMSNVF 521
LQS + +VK P + L+D D V A++D+ ICA + F
Sbjct: 298 LELQSHL----SEFKVVKFEPT-----------QEQLDDYKDGGV-AIIDQWICAHARFF 341
Query: 522 IGASGSTFTEDIMRLRKDWG------STSLCDEYLCQGEEPN 557
+G STFT I R+ G C + + + E+P
Sbjct: 342 MGTGVSTFTFRIHEEREILGFDQKMTFNRFCGDNIAECEQPT 383
>gi|359323522|ref|XP_003640116.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like [Canis
lupus familiaris]
Length = 391
Score = 42.4 bits (98), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 22/128 (17%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG ++A+H RR F+ PS + A I L + + ++++TDA +E
Sbjct: 246 LGGPYLAVHLRRKDFIWGHREDVPS----LDGAVRKIRSLMKTHRLDKVFVATDAVRTEH 301
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSL-LYRHHLEDDSQVEAMLDKTICAMSNVFI 522
L+ L+ +++ P W+ L LYR D V A++D+ IC+ + FI
Sbjct: 302 EELKKLLP------EMLRFEP-----TWEELELYR-----DGGV-AIIDQWICSHARFFI 344
Query: 523 GASGSTFT 530
G S STF+
Sbjct: 345 GTSVSTFS 352
>gi|301767386|ref|XP_002919115.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like
[Ailuropoda melanoleuca]
Length = 560
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 22/128 (17%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
LG ++A+H RR F+ PS + A I L + + ++++TDA +E
Sbjct: 415 LGGPYLAVHLRRKDFIWGHREDVPS----LDGAVKKIRSLMKTHQLDKVFVATDAIRTEH 470
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSL-LYRHHLEDDSQVEAMLDKTICAMSNVFI 522
L+ L+ +++ P W+ L LYR D V A++D+ ICA + FI
Sbjct: 471 ETLKKLLP------EMLRFEP-----TWEELELYR-----DGGV-AIIDQWICAHARFFI 513
Query: 523 GASGSTFT 530
G S STF+
Sbjct: 514 GTSVSTFS 521
>gi|268572421|ref|XP_002641317.1| C. briggsae CBR-PAD-2 protein [Caenorhabditis briggsae]
Length = 490
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 397 QRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLST 456
+R +T G ++ +H+RR FL + + PS I A+ + +L E+ K IYL+T
Sbjct: 316 ERAKRTAKGGPYLGIHWRRRDFLYARHDQLPS----IKGTAEILEKLCEKLKLNRIYLAT 371
Query: 457 DAAESETSLLQS 468
DA E E + L+S
Sbjct: 372 DAPEEEVNELKS 383
>gi|422655022|ref|ZP_16717741.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
actinidiae str. M302091]
gi|330968098|gb|EGH68358.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
actinidiae str. M302091]
Length = 392
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
ETVWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ETVWKNNLRRGSERTATQDVLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
E + M QRF+ F
Sbjct: 331 -EAGYVWM---QRFMAKF 344
>gi|404401944|ref|ZP_10993528.1| VWA containing CoxE family protein [Pseudomonas fuscovaginae
UPB0736]
Length = 392
Score = 42.4 bits (98), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNLRRGSERTSTQDLLHKYGADYKVIFIGDAAMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP L M QRF++ +
Sbjct: 331 -EPGYLWM---QRFMEKY 344
>gi|156389496|ref|XP_001635027.1| predicted protein [Nematostella vectensis]
gi|156222116|gb|EDO42964.1| predicted protein [Nematostella vectensis]
Length = 388
Score = 42.0 bits (97), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 70/160 (43%), Gaps = 25/160 (15%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +I +H RR FL K PS + A + + ++ K ++LSTDA +S
Sbjct: 241 GGPYIGVHLRRADFLYAHPDKVPS----LDNVAKQLKKFLKKEKLEKVFLSTDADQSGKC 296
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
++ L K + PP+ EK+ D V A++D+ ICA + FIG
Sbjct: 297 ---NVDYLKSKLPIVKYDPPQRILEKFG----------DGGV-AIIDQWICAHARFFIGT 342
Query: 525 SGSTFT------EDIMRLRKDWGSTSLC-DEYLCQGEEPN 557
STF+ DI+ D LC D+ + + E+P
Sbjct: 343 CESTFSFRIYEERDILGFHGDQTFNCLCGDKEIGKCEQPT 382
>gi|380021096|ref|XP_003694410.1| PREDICTED: GDP-fucose protein O-fucosyltransferase 2-like, partial
[Apis florea]
Length = 395
Score = 42.0 bits (97), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 19/153 (12%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
+G ++++H RR FL ++ P+ I AA + + ++++TDA + E
Sbjct: 249 IGGPYLSVHLRRRDFLIGRSSTVPT----IKSAAFQLKNKLHELQLKFLFVATDATQKEF 304
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIG 523
+ L+ + +++ P N Y D + A++D+ IC+ + FIG
Sbjct: 305 NELREYL----SNYTILRFIPSN---------YVLDKFKDGGI-AIIDQIICSYARYFIG 350
Query: 524 ASGSTFTEDIMRLRKDWG-STSLCDEYLCQGEE 555
STFT I R+ G T YLC+ +E
Sbjct: 351 TYESTFTFRIQEDREIIGFPTKTTFNYLCRTKE 383
>gi|422591861|ref|ZP_16666497.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
morsprunorum str. M302280]
gi|330879576|gb|EGH13725.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
morsprunorum str. M302280]
Length = 392
Score = 42.0 bits (97), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
ETVWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ETVWKNNLRRGSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
E + M QRF+ F
Sbjct: 331 -EAGYVWM---QRFMAKF 344
>gi|28871058|ref|NP_793677.1| hypothetical protein PSPTO_3906 [Pseudomonas syringae pv. tomato
str. DC3000]
gi|213969709|ref|ZP_03397844.1| conserved hypothetical protein [Pseudomonas syringae pv. tomato T1]
gi|301384324|ref|ZP_07232742.1| VWA containing CoxE family protein [Pseudomonas syringae pv. tomato
Max13]
gi|302062126|ref|ZP_07253667.1| VWA containing CoxE family protein [Pseudomonas syringae pv. tomato
K40]
gi|302130116|ref|ZP_07256106.1| VWA containing CoxE family protein [Pseudomonas syringae pv. tomato
NCPPB 1108]
gi|28854308|gb|AAO57372.1| conserved protein of unknown function [Pseudomonas syringae pv.
tomato str. DC3000]
gi|213925517|gb|EEB59077.1| conserved hypothetical protein [Pseudomonas syringae pv. tomato T1]
Length = 392
Score = 42.0 bits (97), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
ETVWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ETVWKNNLRRGSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
E + M QRF+ F
Sbjct: 331 -EAGYVWM---QRFMAKF 344
>gi|241732387|ref|XP_002413854.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507676|gb|EEC17162.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 42.0 bits (97), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 64/141 (45%), Gaps = 22/141 (15%)
Query: 402 TFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAES 461
T +G F+A+H RR +L + +P + AA+ I L +R K ++++TD
Sbjct: 81 TAVGGPFLAVHLRRQDYL----SSRPKDVPDLRWAAEQIQALLDRLKLTKVFVATDGTAD 136
Query: 462 ETSLLQSLVVLNGKTIALVKRPPRNSAEKWD-SLLYRHHLEDDSQVEAMLDKTICAMSNV 520
E L+S + A +++ SL + L+D A++D+ I A +
Sbjct: 137 EFQTLKSYI---------------PGAVRYETSLETKKKLKDGGL--AIIDQWIAAHARY 179
Query: 521 FIGASGSTFTEDIMRLRKDWG 541
F+G STF+ I R+ G
Sbjct: 180 FVGTYESTFSFRIQEEREILG 200
>gi|388543364|ref|ZP_10146655.1| VWA containing CoxE family protein [Pseudomonas sp. M47T1]
gi|388278676|gb|EIK98247.1| VWA containing CoxE family protein [Pseudomonas sp. M47T1]
Length = 392
Score = 42.0 bits (97), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + QD+ K+ D VI VGD A E + QPGG + H
Sbjct: 276 ESVWKNNMRRTSERTSTQDLLHKYGADWKVIFVGDAAMAPYE---ITQPGGSVEH 327
>gi|422299626|ref|ZP_16387187.1| hypothetical protein Pav631_3750 [Pseudomonas avellanae BPIC 631]
gi|407988401|gb|EKG30949.1| hypothetical protein Pav631_3750 [Pseudomonas avellanae BPIC 631]
Length = 392
Score = 42.0 bits (97), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
ETVWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ETVWKNNLRRGSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
E + M QRF+ F
Sbjct: 331 -EAGYVWM---QRFMAKF 344
>gi|289676559|ref|ZP_06497449.1| VWA containing CoxE family protein, partial [Pseudomonas syringae
pv. syringae FF5]
Length = 390
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNQRRSSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|422619737|ref|ZP_16688424.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
japonica str. M301072]
gi|422632002|ref|ZP_16697179.1| VWA containing CoxE family protein [Pseudomonas syringae pv. pisi
str. 1704B]
gi|422664857|ref|ZP_16724730.1| VWA containing CoxE family protein [Pseudomonas syringae pv. aptata
str. DSM 50252]
gi|440719642|ref|ZP_20900067.1| VWA containing CoxE family protein [Pseudomonas syringae BRIP34876]
gi|440728855|ref|ZP_20909057.1| VWA containing CoxE family protein [Pseudomonas syringae BRIP34881]
gi|443644883|ref|ZP_21128733.1| hypothetical protein PssB64_4272 [Pseudomonas syringae pv. syringae
B64]
gi|330900104|gb|EGH31523.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
japonica str. M301072]
gi|330941919|gb|EGH44636.1| VWA containing CoxE family protein [Pseudomonas syringae pv. pisi
str. 1704B]
gi|330975276|gb|EGH75342.1| VWA containing CoxE family protein [Pseudomonas syringae pv. aptata
str. DSM 50252]
gi|440360638|gb|ELP97901.1| VWA containing CoxE family protein [Pseudomonas syringae BRIP34881]
gi|440367496|gb|ELQ04558.1| VWA containing CoxE family protein [Pseudomonas syringae BRIP34876]
gi|443284900|gb|ELS43905.1| hypothetical protein PssB64_4272 [Pseudomonas syringae pv. syringae
B64]
Length = 392
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNQRRSSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|398971945|ref|ZP_10683865.1| hypothetical protein PMI25_05633 [Pseudomonas sp. GM30]
gi|398137868|gb|EJM26904.1| hypothetical protein PMI25_05633 [Pseudomonas sp. GM30]
Length = 392
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNLRRTSERFSTQDLLHKYGADYKVIFIGDAAMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP L M QRF++ +
Sbjct: 331 -EPGYLWM---QRFMEKY 344
>gi|408483062|ref|ZP_11189281.1| VWA containing CoxE family protein [Pseudomonas sp. R81]
Length = 392
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNQRRTSERTSTQDVLHKYGADYKVIFIGDASMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ +
Sbjct: 331 -EPGYVWM---QRFMAKY 344
>gi|424066656|ref|ZP_17804120.1| hypothetical protein Pav013_1445 [Pseudomonas syringae pv.
avellanae str. ISPaVe013]
gi|408002113|gb|EKG42377.1| hypothetical protein Pav013_1445 [Pseudomonas syringae pv.
avellanae str. ISPaVe013]
Length = 392
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNQRRSSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|424071303|ref|ZP_17808729.1| hypothetical protein Pav037_1418 [Pseudomonas syringae pv.
avellanae str. ISPaVe037]
gi|407999236|gb|EKG39623.1| hypothetical protein Pav037_1418 [Pseudomonas syringae pv.
avellanae str. ISPaVe037]
Length = 392
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNQRRSSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|189238778|ref|XP_001814855.1| PREDICTED: similar to protein O-fucosyltransferase 2 [Tribolium
castaneum]
Length = 403
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 57/137 (41%), Gaps = 19/137 (13%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G ++ +H RR FL+ + PS I IT L E+ ++++TDA E E
Sbjct: 259 GGPYLGVHLRRRDFLRGRAHQVPS----INSITQQITNLLEKLNLSQVFIATDADEDEYR 314
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
++K P + R ++D A++D+ IC+ + FIG
Sbjct: 315 -------------DIIKNIPFKTQRYTPDSETREKIKDGGV--AIIDQIICSHARYFIGT 359
Query: 525 SGSTFTEDIMRLRKDWG 541
STF+ I R+ G
Sbjct: 360 YESTFSFRIQEEREILG 376
>gi|302187100|ref|ZP_07263773.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
syringae 642]
Length = 392
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNQRRSSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|298159153|gb|EFI00211.1| Thioredoxin reductase [Pseudomonas savastanoi pv. savastanoi NCPPB
3335]
Length = 392
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ESVWKNNLRRSSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
E + M QRF+ F
Sbjct: 331 -EAGYVWM---QRFIAKF 344
>gi|388469470|ref|ZP_10143679.1| hypothetical protein PseBG33_1642 [Pseudomonas synxantha BG33R]
gi|388006167|gb|EIK67433.1| hypothetical protein PseBG33_1642 [Pseudomonas synxantha BG33R]
Length = 392
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNQRRTSERTSTQDLLHKYGADYKVIFIGDAAMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ +
Sbjct: 331 -EPGYVWM---QRFMAKY 344
>gi|387892761|ref|YP_006323058.1| hypothetical protein PflA506_1551 [Pseudomonas fluorescens A506]
gi|423690614|ref|ZP_17665134.1| hypothetical protein PflSS101_1557 [Pseudomonas fluorescens SS101]
gi|387161659|gb|AFJ56858.1| hypothetical protein PflA506_1551 [Pseudomonas fluorescens A506]
gi|387999562|gb|EIK60891.1| hypothetical protein PflSS101_1557 [Pseudomonas fluorescens SS101]
Length = 392
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNQRRTSERTSTQDLLHKYGADYKVIFIGDAAMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ +
Sbjct: 331 -EPGYVWM---QRFMAKY 344
>gi|332376087|gb|AEE63184.1| unknown [Dendroctonus ponderosae]
Length = 420
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 57/139 (41%), Gaps = 21/139 (15%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G ++A+H RR FL+ K P A I++ K I+++TDA E
Sbjct: 274 GGPYLAVHMRRRDFLRGRADKVPDLL----NVAKQISKKLNELKLAKIFIATDAPRIEFE 329
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVE--AMLDKTICAMSNVFI 522
L + P+ K+ + H+E A++D+ IC+ + FI
Sbjct: 330 ELNKAL------------SPKYEVYKYQA---SKHVESKYSAGGIAIIDQIICSYARYFI 374
Query: 523 GASGSTFTEDIMRLRKDWG 541
G S STF+ I R+ G
Sbjct: 375 GTSDSTFSFRIQEEREILG 393
>gi|398984578|ref|ZP_10690662.1| hypothetical protein PMI23_01070 [Pseudomonas sp. GM24]
gi|399012514|ref|ZP_10714834.1| hypothetical protein PMI19_01613 [Pseudomonas sp. GM16]
gi|398115347|gb|EJM05131.1| hypothetical protein PMI19_01613 [Pseudomonas sp. GM16]
gi|398155390|gb|EJM43833.1| hypothetical protein PMI23_01070 [Pseudomonas sp. GM24]
Length = 392
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNMRRTSERTSTQDLLHKYGADYKVIFIGDAAMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF++ +
Sbjct: 331 -EPGYVWM---QRFMEKY 344
>gi|398854178|ref|ZP_10610755.1| hypothetical protein PMI37_04924 [Pseudomonas sp. GM80]
gi|398236946|gb|EJN22712.1| hypothetical protein PMI37_04924 [Pseudomonas sp. GM80]
Length = 392
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNMRRTSERTSTQDLLHKYGADYKVIFIGDAAMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF++ +
Sbjct: 331 -EPGYVWM---QRFMEKY 344
>gi|77457635|ref|YP_347140.1| hypothetical protein Pfl01_1408 [Pseudomonas fluorescens Pf0-1]
gi|398978171|ref|ZP_10687594.1| hypothetical protein PMI24_03735 [Pseudomonas sp. GM25]
gi|77381638|gb|ABA73151.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1]
gi|398137282|gb|EJM26344.1| hypothetical protein PMI24_03735 [Pseudomonas sp. GM25]
Length = 392
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNMRRTSERTSTQDLLHKYGADYKVIFIGDAAMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF++ +
Sbjct: 331 -EPGYVWM---QRFMEKY 344
>gi|195392940|ref|XP_002055112.1| GJ19195 [Drosophila virilis]
gi|194149622|gb|EDW65313.1| GJ19195 [Drosophila virilis]
Length = 499
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 60/137 (43%), Gaps = 17/137 (12%)
Query: 405 GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETS 464
G +++ H RR F++ + P+ + AA I +L +++++DA E
Sbjct: 340 GGDYLCAHLRRGDFVRSRESTTPT----LKAAAQQIKQLLRAFNLTTVFVASDATPYELL 395
Query: 465 LLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGA 524
L+ L V P ++A+ R L+D A++D+ ICA + F+G
Sbjct: 396 ELKELFY----RFRFVHFTPESNAQ-------RQQLKDGGV--AIVDQLICAYARHFVGT 442
Query: 525 SGSTFTEDIMRLRKDWG 541
STFT I R+ G
Sbjct: 443 YESTFTYRIYEEREILG 459
>gi|422645761|ref|ZP_16708896.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
maculicola str. ES4326]
gi|330959310|gb|EGH59570.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
maculicola str. ES4326]
Length = 392
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 ESVWKNNLRRGSERIATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
E L M QRF+ F
Sbjct: 331 -EAGYLWM---QRFMAKF 344
>gi|395500096|ref|ZP_10431675.1| VWA containing CoxE family protein [Pseudomonas sp. PAMC 25886]
Length = 392
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNMRRTSERTSTQDLLHKYGADYKVIFIGDAAMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF++ +
Sbjct: 331 -EPGYVWM---QRFMEKY 344
>gi|229589060|ref|YP_002871179.1| hypothetical protein PFLU1537 [Pseudomonas fluorescens SBW25]
gi|229360926|emb|CAY47786.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25]
Length = 392
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNQRRTSERTSTQDLLHKYGADYKVIFIGDASMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ +
Sbjct: 331 -EPGYVWM---QRFMAKY 344
>gi|312959607|ref|ZP_07774124.1| hypothetical protein PFWH6_1509 [Pseudomonas fluorescens WH6]
gi|311286324|gb|EFQ64888.1| hypothetical protein PFWH6_1509 [Pseudomonas fluorescens WH6]
Length = 392
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNQRRTSERTSTQDLLHKYGADYKVIFIGDASMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ +
Sbjct: 331 -EPGYVWM---QRFMAKY 344
>gi|410089305|ref|ZP_11285931.1| VWA containing CoxE family protein [Pseudomonas viridiflava
UASWS0038]
gi|409763592|gb|EKN48552.1| VWA containing CoxE family protein [Pseudomonas viridiflava
UASWS0038]
Length = 392
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNLRRGSERTSTQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|242007242|ref|XP_002424451.1| GDP-fucose protein O-fucosyltransferase 2 precursor, putative
[Pediculus humanus corporis]
gi|212507851|gb|EEB11713.1| GDP-fucose protein O-fucosyltransferase 2 precursor, putative
[Pediculus humanus corporis]
Length = 426
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 61/143 (42%), Gaps = 28/143 (19%)
Query: 404 LGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESET 463
+G N++ +H RR FL PS I AA+ I + + ++++TDA++ E
Sbjct: 279 IGGNYMCVHLRRKDFLYGRKKDVPS----IEGAAEQIGKALQELGLNTVFVATDASKDE- 333
Query: 464 SLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLE-----DDSQVEAMLDKTICAMS 518
KR +N +K+ L Y E D V A++++ IC+ +
Sbjct: 334 -----------------KRKLKNLLDKYKILYYNPSSEIIQTYKDGGV-AIIEQIICSHA 375
Query: 519 NVFIGASGSTFTEDIMRLRKDWG 541
F G STF+ I R+ G
Sbjct: 376 RKFYGTHESTFSFRIQEEREIMG 398
>gi|422397595|ref|ZP_16477327.1| VWA containing CoxE family protein, partial [Pseudomonas syringae
pv. glycinea str. race 4]
gi|330883212|gb|EGH17361.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
glycinea str. race 4]
Length = 142
Score = 40.8 bits (94), Expect = 1.7, Method: Composition-based stats.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 26 ESVWKNNLRRGSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEH 77
>gi|424865740|ref|ZP_18289596.1| von Willebrand factor type A domain protein [SAR86 cluster
bacterium SAR86B]
gi|400758313|gb|EJP72520.1| von Willebrand factor type A domain protein [SAR86 cluster
bacterium SAR86B]
Length = 391
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWK+ R + VQDI K+ D VI VGD A E + PGG I H
Sbjct: 275 ESVWKDNRRRSQERFLVQDIINKYSNDYKVIIVGDATMAPYE---ITNPGGSIEH 326
>gi|447915844|ref|YP_007396412.1| VWA containing CoxE family protein [Pseudomonas poae RE*1-1-14]
gi|445199707|gb|AGE24916.1| VWA containing CoxE family protein [Pseudomonas poae RE*1-1-14]
Length = 392
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 37/78 (47%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNQRRTSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ +
Sbjct: 331 -EPGYVWM---QRFMAKY 344
>gi|302833692|ref|XP_002948409.1| hypothetical protein VOLCADRAFT_88724 [Volvox carteri f.
nagariensis]
gi|300266096|gb|EFJ50284.1| hypothetical protein VOLCADRAFT_88724 [Volvox carteri f.
nagariensis]
Length = 860
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 41/79 (51%), Gaps = 12/79 (15%)
Query: 212 KSDKFLFAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHIN--- 268
+S KFL C GQ SN L CL+ + LA L RVLV+P ++ V+ + ++
Sbjct: 52 ESPKFLLVQCHRGQFSNRLKCLKLGILLARTLQRVLVMP------MFTDVVPMVDVSKYV 105
Query: 269 --DCLGRKV-VVSFENFME 284
+CL R V+F+ F E
Sbjct: 106 SLECLRRNSQAVTFDEFRE 124
>gi|395648356|ref|ZP_10436206.1| VWA containing CoxE family protein [Pseudomonas extremaustralis
14-3 substr. 14-3b]
Length = 392
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 37/78 (47%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E+VWKN R + QD+ K+ D VI +GD A E + Q GG + H +
Sbjct: 276 ESVWKNNQRRTSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQAGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
EP + M QRF+ +
Sbjct: 331 -EPGYVWM---QRFMAKY 344
>gi|289628929|ref|ZP_06461883.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
aesculi str. NCPPB 3681]
gi|289651089|ref|ZP_06482432.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
aesculi str. 2250]
gi|422583239|ref|ZP_16658366.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
aesculi str. 0893_23]
gi|330868073|gb|EGH02782.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
aesculi str. 0893_23]
Length = 392
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNLRRSSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|71737044|ref|YP_273813.1| hypothetical protein PSPPH_1562 [Pseudomonas syringae pv.
phaseolicola 1448A]
gi|71557597|gb|AAZ36808.1| conserved hypothetical protein [Pseudomonas syringae pv.
phaseolicola 1448A]
Length = 392
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNLRRGSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|257487638|ref|ZP_05641679.1| hypothetical protein PsyrptA_30483 [Pseudomonas syringae pv. tabaci
str. ATCC 11528]
gi|416015253|ref|ZP_11562866.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
glycinea str. B076]
gi|416028334|ref|ZP_11571390.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
glycinea str. race 4]
gi|422592321|ref|ZP_16666929.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
lachrymans str. M301315]
gi|422683788|ref|ZP_16742045.1| VWA containing CoxE family protein [Pseudomonas syringae pv. tabaci
str. ATCC 11528]
gi|320325231|gb|EFW81298.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
glycinea str. B076]
gi|320327776|gb|EFW83784.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
glycinea str. race 4]
gi|330989373|gb|EGH87476.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
lachrymans str. M301315]
gi|331013119|gb|EGH93175.1| VWA containing CoxE family protein [Pseudomonas syringae pv. tabaci
str. ATCC 11528]
Length = 392
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNLRRGSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|422609108|ref|ZP_16681062.1| VWA containing CoxE family protein [Pseudomonas syringae pv. mori
str. 301020]
gi|330894735|gb|EGH27396.1| VWA containing CoxE family protein [Pseudomonas syringae pv. mori
str. 301020]
Length = 392
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNLRRGSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|424865064|ref|ZP_18288947.1| CoxE family protein [SAR86 cluster bacterium SAR86B]
gi|400759051|gb|EJP73243.1| CoxE family protein [SAR86 cluster bacterium SAR86B]
Length = 206
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWK+ R + VQD+ K+ +D VI VGD A E + PGG I H
Sbjct: 90 ESVWKDNLRRSQERFLVQDMINKYSSDYKVIVVGDATMAPYE---ITNPGGSIEH 141
>gi|422676401|ref|ZP_16735731.1| VWA containing CoxE family protein [Pseudomonas syringae pv. aceris
str. M302273]
gi|330974105|gb|EGH74171.1| VWA containing CoxE family protein [Pseudomonas syringae pv. aceris
str. M302273]
Length = 392
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNLRRSSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|66044824|ref|YP_234665.1| hypothetical protein Psyr_1579 [Pseudomonas syringae pv. syringae
B728a]
gi|63255531|gb|AAY36627.1| conserved hypothetical protein [Pseudomonas syringae pv. syringae
B728a]
Length = 392
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + QD+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNLRRSSERTATQDLLHKYGEDYKVIFIGDASMAPYE---ITQPGGSVEH 327
>gi|392987618|ref|YP_006486211.1| hypothetical protein EHR_01650 [Enterococcus hirae ATCC 9790]
gi|392335038|gb|AFM69320.1| hypothetical protein EHR_01650 [Enterococcus hirae ATCC 9790]
Length = 201
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 60/125 (48%), Gaps = 13/125 (10%)
Query: 86 LYFSVNLRSLFSGNYVNFRF-DSLADRMRESELRALSLLKQQQSHLLSLWNQSFVNNSYG 144
Y S++ + LF+G +NF DS+ ++++ R+L + ++QSHL + + S
Sbjct: 80 FYLSLDQKDLFNGYPINFLVEDSIFSDIQKTPFRSLIKINKEQSHLYDA--EKIITRS-- 135
Query: 145 NNTNNPFFQEAKSVLLNQISLNRQ-IEQILLSPHKVSNFTPNDAVWGLESCRKIDSIIPN 203
N T FF E + N+Q I Q LL+ + + F P W + +I + IP
Sbjct: 136 NGTYVSFFIEDSNY-------NKQEINQKLLNYFQKNKFIPERFFWKILWQDEITTTIPE 188
Query: 204 KRTVE 208
KR E
Sbjct: 189 KRVFE 193
>gi|350534918|ref|NP_001232976.1| uncharacterized protein LOC100163639 precursor [Acyrthosiphon
pisum]
gi|239792132|dbj|BAH72442.1| ACYPI004711 [Acyrthosiphon pisum]
Length = 401
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 71/167 (42%), Gaps = 18/167 (10%)
Query: 375 GGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIP 434
GG + ++ S+ + A+ F + L +++ H RR FL PS I
Sbjct: 226 GGQEYWMIRKSMQFSKNLHEIAKEFKKRNLKQDYLCAHLRRRDFL----YGHPSDVPNIN 281
Query: 435 QAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSL 494
A I +Y++TD +++E L++++ RP + + D
Sbjct: 282 NTARQIKEKLILLNIKTVYVATDTSKTEVEELKNVL-----------RPYQMFKFEADKK 330
Query: 495 LYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWG 541
+ L+ A++D+ IC+ +N FIG S STF+ I R+ G
Sbjct: 331 TLNNFLDGGV---AIIDQIICSQANYFIGTSKSTFSFRIQEEREILG 374
>gi|88705069|ref|ZP_01102781.1| conserved hypothetical protein [Congregibacter litoralis KT71]
gi|88700764|gb|EAQ97871.1| conserved hypothetical protein [Congregibacter litoralis KT71]
Length = 391
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + DI K+ D VI +GD + E +MQPGG + H
Sbjct: 275 ESVWKNNVRRHAERTNMLDILHKYAHDYKVIIIGDASMSPYE---IMQPGGSVEH 326
>gi|422660428|ref|ZP_16722840.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
lachrymans str. M302278]
gi|331019033|gb|EGH99089.1| VWA containing CoxE family protein [Pseudomonas syringae pv.
lachrymans str. M302278]
Length = 392
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 9/78 (11%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTL 385
E VWKN R + QD+ K+ D VI +GD A E + QPGG + H +
Sbjct: 276 EIVWKNNLRRGSERTATQDLLHKYGADYKVIFIGDASMAPYE---ITQPGGSVEHWNE-- 330
Query: 386 IEPSRLIMVTAQRFVQTF 403
E + M QRF+ F
Sbjct: 331 -EAGYVWM---QRFMAKF 344
>gi|291241561|ref|XP_002740679.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 446
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 32/132 (24%), Positives = 57/132 (43%), Gaps = 15/132 (11%)
Query: 422 CNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVK 481
C AKK + A+ I + ++ K +Y++T L S ++ L K
Sbjct: 311 CEAKKSEVLLMADKVAENIADIMQKKKIECVYVAT--------LTWSQTIVEH----LAK 358
Query: 482 RPPRNSAEKWDSLLYRHHLE---DDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRK 538
R PRN D++ L DD +++++ IC + +FI + S +++ +M R
Sbjct: 359 RIPRNKIYISDNITAGDDLSVLRDDLYTMSLVEQEICFRAEIFIASCASHWSQFVMADRD 418
Query: 539 DWGSTSLCDEYL 550
G T+ C L
Sbjct: 419 GVGKTTDCTSEL 430
>gi|398888257|ref|ZP_10642688.1| hypothetical protein PMI31_00479 [Pseudomonas sp. GM55]
gi|398191086|gb|EJM78287.1| hypothetical protein PMI31_00479 [Pseudomonas sp. GM55]
Length = 392
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + D+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNMRRTSERTSTMDLLHKYGADYKVIFIGDAAMAPYE---ITQPGGSVEH 327
>gi|398909738|ref|ZP_10654701.1| hypothetical protein PMI29_00512 [Pseudomonas sp. GM49]
gi|398925933|ref|ZP_10662172.1| hypothetical protein PMI28_01783 [Pseudomonas sp. GM48]
gi|398171527|gb|EJM59429.1| hypothetical protein PMI28_01783 [Pseudomonas sp. GM48]
gi|398187437|gb|EJM74775.1| hypothetical protein PMI29_00512 [Pseudomonas sp. GM49]
Length = 392
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + D+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNMRRTSERTSTMDLLHKYGADYKVIFIGDAAMAPYE---ITQPGGSVEH 327
>gi|398951174|ref|ZP_10673872.1| hypothetical protein PMI26_01614 [Pseudomonas sp. GM33]
gi|398156956|gb|EJM45366.1| hypothetical protein PMI26_01614 [Pseudomonas sp. GM33]
Length = 392
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + D+ K+ D VI +GD A E + QPGG + H
Sbjct: 276 ESVWKNNMRRTSERTSTMDLLHKYGADYKVIFIGDAAMAPYE---ITQPGGSVEH 327
>gi|254514485|ref|ZP_05126546.1| von Willebrand factor type A(vWA) domain protein [gamma
proteobacterium NOR5-3]
gi|219676728|gb|EED33093.1| von Willebrand factor type A(vWA) domain protein [gamma
proteobacterium NOR5-3]
Length = 391
Score = 39.3 bits (90), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + DI K+ +D VI +GD + E ++QPGG + H
Sbjct: 275 ESVWKNNVRRHAERTNMLDILHKYASDYKVIIIGDASMSPYE---IVQPGGSVEH 326
>gi|431926993|ref|YP_007240027.1| hypothetical protein Psest_1851 [Pseudomonas stutzeri RCH2]
gi|431825280|gb|AGA86397.1| hypothetical protein Psest_1851 [Pseudomonas stutzeri RCH2]
Length = 392
Score = 38.9 bits (89), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + D+ K+ D V+ VGD A E V QPGG + H
Sbjct: 276 ESVWKNNFRRTSERTSTLDVLHKYGADYKVVFVGDAAMAPYE---VTQPGGSVEH 327
>gi|392421685|ref|YP_006458289.1| VWA containing CoxE family protein [Pseudomonas stutzeri CCUG
29243]
gi|390983873|gb|AFM33866.1| VWA containing CoxE family protein [Pseudomonas stutzeri CCUG
29243]
Length = 392
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + D+ K+ D V+ VGD A E V QPGG + H
Sbjct: 276 ESVWKNNFRRTSERTSTLDVLHKYGADYKVVFVGDAAMAPYE---VTQPGGSVEH 327
>gi|326435479|gb|EGD81049.1| hypothetical protein PTSG_13129 [Salpingoeca sp. ATCC 50818]
Length = 660
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 37/153 (24%), Positives = 63/153 (41%), Gaps = 25/153 (16%)
Query: 395 TAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYL 454
TA + + ++++H RR FL+ +A P P+ I LA +++
Sbjct: 513 TATTIIASHRQHRYLSVHLRRADFLRVRSANIP----PLASTRKQILALARAHDIRALFI 568
Query: 455 STDAAESETSLLQSLVVLNGKTIALVKRPPR---NSAEKWDSLLYRHHLEDDSQVEAMLD 511
+TDA + + A PP ++ + S L+ L A++
Sbjct: 569 ATDARRD-----------DEEYDAFASNPPLPLVDTPQHIKSTLHPGQL-------AVVQ 610
Query: 512 KTICAMSNVFIGASGSTFTEDIMRLRKDWGSTS 544
+ +CA + +FIG GSTFT I R+ G S
Sbjct: 611 QIVCARAFIFIGTEGSTFTRQIQEERQLRGHRS 643
>gi|424863243|ref|ZP_18287156.1| von Willebrand factor type A domain protein [SAR86 cluster
bacterium SAR86A]
gi|400757864|gb|EJP72075.1| von Willebrand factor type A domain protein [SAR86 cluster
bacterium SAR86A]
Length = 391
Score = 38.9 bits (89), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWK+ R + VQDI KF +D +I VGD A E + GG I H
Sbjct: 275 ESVWKDNRRRTQERFLVQDIINKFSSDYKIIVVGDATMAPYE---ITNAGGSIEH 326
>gi|452747796|ref|ZP_21947588.1| VWA containing CoxE family protein [Pseudomonas stutzeri NF13]
gi|452008359|gb|EME00600.1| VWA containing CoxE family protein [Pseudomonas stutzeri NF13]
Length = 392
Score = 38.5 bits (88), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + D+ K+ D V+ VGD A E V QPGG + H
Sbjct: 276 ESVWKNNFRRTSERTSTLDLLHKYGADYKVVFVGDAAMAPYE---VTQPGGSVEH 327
>gi|421617442|ref|ZP_16058431.1| VWA containing CoxE family protein [Pseudomonas stutzeri KOS6]
gi|409780597|gb|EKN60224.1| VWA containing CoxE family protein [Pseudomonas stutzeri KOS6]
Length = 392
Score = 38.5 bits (88), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + D+ K+ D V+ VGD A E V QPGG + H
Sbjct: 276 ESVWKNNFRRTSERTSTLDLLHKYGADYKVVFVGDAAMAPYE---VTQPGGSVEH 327
>gi|418292178|ref|ZP_12904128.1| VWA containing CoxE family protein [Pseudomonas stutzeri ATCC 14405
= CCUG 16156]
gi|379063611|gb|EHY76354.1| VWA containing CoxE family protein [Pseudomonas stutzeri ATCC 14405
= CCUG 16156]
Length = 392
Score = 38.5 bits (88), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 326 ETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINH 380
E+VWKN R + + D+ K+ D V+ VGD A E V QPGG + H
Sbjct: 276 ESVWKNNFRRTSERTSTLDLLHKYGADYKVVFVGDAAMAPYE---VTQPGGSVEH 327
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.136 0.412
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,924,388,782
Number of Sequences: 23463169
Number of extensions: 372510389
Number of successful extensions: 959719
Number of sequences better than 100.0: 179
Number of HSP's better than 100.0 without gapping: 45
Number of HSP's successfully gapped in prelim test: 134
Number of HSP's that attempted gapping in prelim test: 959490
Number of HSP's gapped (non-prelim): 195
length of query: 563
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 415
effective length of database: 8,886,646,355
effective search space: 3687958237325
effective search space used: 3687958237325
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 80 (35.4 bits)