BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003228
(837 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255545258|ref|XP_002513690.1| conserved hypothetical protein [Ricinus communis]
gi|223547598|gb|EEF49093.1| conserved hypothetical protein [Ricinus communis]
Length = 976
Score = 886 bits (2289), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 508/868 (58%), Positives = 626/868 (72%), Gaps = 56/868 (6%)
Query: 1 MEPLTAAQDVSIVPDHKIDKFEEYGYA--GNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
MEPLT Q+VS+V D + DK E+ A N+K++ SLE KT TD L SS + E
Sbjct: 134 MEPLTVQQEVSLVSDDEEDKIEKNTSAESSANLKEEYISLEHKTNTDVDLPSSPQIEETH 193
Query: 59 SDNKIDSENETPSTGD-VSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLP 117
++NK+ + + + D ++ S +++++ Q+DLQ +SA D +T S LP
Sbjct: 194 NENKLSGDTDQLLSADNGNYIISSNDTVDNAPVQEDLQYDSAFDSKLGVLETTPNSTNLP 253
Query: 118 EPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFSR 177
E ++ +D NL + GE + + + + S
Sbjct: 254 ESKIAK--------------IDKNL------VNGEPAYSLNIINTITEHTEAKENTIPSS 293
Query: 178 IFSDSSSISSSHAPIEPLAAVISVSSDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLL 237
S S + SS + ++ I+++SDT E L K ++ AS T + + S +
Sbjct: 294 DSSISPVLKSSEPVV--VSTSITLTSDTVSEVGNLFKDGMDSEASVPTKEELNTSTNQV- 350
Query: 238 SGEDSSSSMEVHDLNKNGSSGTSVSPS-IFPFSNEKETC---DLNESNSSSFTESPPTGS 293
S + +SSS+E++ L ++GSSG + +PF+N+++ D+N S +SS ESPP
Sbjct: 351 STDRNSSSLEMNYLTESGSSGVTSVSEWAYPFANKQDIVANDDMNLSKTSS--ESPPFSG 408
Query: 294 SSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRR 353
S S AG+PAPS V +LQV PGK+LVPAVVDQ GQAL+ALQVLKVIEADV+P DLC RR
Sbjct: 409 SFSSAGVPAPSAVPESLQVSPGKILVPAVVDQTHGQALAALQVLKVIEADVQPSDLCTRR 468
Query: 354 EYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSK 413
EYARWLV+ASS L+RST+SKVYPAMYIEN T+ AFDDITP+DPDFSSIQGLAEAGLISS+
Sbjct: 469 EYARWLVAASSALSRSTLSKVYPAMYIENATEPAFDDITPDDPDFSSIQGLAEAGLISSR 528
Query: 414 LSHRDLLN--EEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDK 471
LS+ DLL+ E+ GP+ F PESPLSRQDLVSWKMALEKRQLPEAN+KILYQLSGF D+DK
Sbjct: 529 LSNHDLLSPVEDQGPLNFSPESPLSRQDLVSWKMALEKRQLPEANRKILYQLSGFRDVDK 588
Query: 472 INPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQ 531
I+PDAWPAL+ADL+AG+QGII+LAFGCTRLFQP+KPVT AQAAVALAIGEASD VNEEL
Sbjct: 589 IHPDAWPALIADLSAGDQGIISLAFGCTRLFQPNKPVTKAQAAVALAIGEASDIVNEELA 648
Query: 532 RIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRA 591
RIEAES AENAVS H+ALVA+VE++IN SFEKEL MEREKI+ VEKMAEEAR ELERLRA
Sbjct: 649 RIEAESMAENAVSAHNALVAQVEQDINASFEKELLMEREKINAVEKMAEEARLELERLRA 708
Query: 592 EREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAEN 651
ERE D ALMKERA+IE+EME+LS+L+ EVEEQL++L+S+KVEISYEKERIN L+KEAEN
Sbjct: 709 EREADNFALMKERASIEAEMEVLSRLKGEVEEQLQTLLSSKVEISYEKERINKLQKEAEN 768
Query: 652 ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDL 711
E QEI+RLQYELEVERKALS+ARAWAEDEAKRARE AK +E ARDRWERQGIKVVVD DL
Sbjct: 769 EKQEISRLQYELEVERKALSIARAWAEDEAKRAREHAKVIEEARDRWERQGIKVVVDNDL 828
Query: 712 REESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFIS 771
REE+ A WV +QFSV+ TVSRA+ LV +LK +A++ GKSKE+INTII KIL+ IS
Sbjct: 829 REETSAGGTWVATARQFSVEGTVSRAEKLVGELKLLADNARGKSKEVINTIIQKILVIIS 888
Query: 772 NLKKWASKASMRAAELKDATILKAKGSVQE----------------------LQQSTAEF 809
LK+W S+A +A ELKDA +LKAK SV+E LQQSTAEF
Sbjct: 889 RLKEWISEARTQAGELKDAAVLKAKESVEELQKNTSEFSSTIKERARGSIYGLQQSTAEF 948
Query: 810 RSNLTEGAKRVAGDCREGVEKLTQRFKT 837
+ EGAKRVAGDCREGVE+LTQRFK+
Sbjct: 949 SFAMKEGAKRVAGDCREGVERLTQRFKS 976
>gi|225464485|ref|XP_002271744.1| PREDICTED: uncharacterized protein LOC100264485 [Vitis vinifera]
Length = 985
Score = 828 bits (2139), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/602 (71%), Positives = 511/602 (84%), Gaps = 12/602 (1%)
Query: 244 SSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESN----SSSFTESPPTGSSSSPAG 299
S +++HDLN +GS+ ++ + +PF ++ D+N N + SF ESP +S S AG
Sbjct: 388 SYVKLHDLNASGSTSSTSALP-YPFDYDQ---DVNLQNKIQRNRSFLESPIAENSFSSAG 443
Query: 300 IPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWL 359
IPAPS VS +L+VLPG+V+VPAVVDQVQGQAL+ALQVLKVIE DV+P DLC RRE+ARWL
Sbjct: 444 IPAPSAVSESLKVLPGQVVVPAVVDQVQGQALAALQVLKVIEPDVQPSDLCTRREFARWL 503
Query: 360 VSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDL 419
VSASS L+R+T+SKVYPAMYI N+T+LAFDDITPEDPDFSSIQGLAEAGLISSKLS RDL
Sbjct: 504 VSASSVLSRNTVSKVYPAMYIGNITELAFDDITPEDPDFSSIQGLAEAGLISSKLSRRDL 563
Query: 420 LN----EEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPD 475
L+ E+ P +F P+SPLSRQDLVSWKMALEKRQLPE +KK+LYQ+SGFIDID INPD
Sbjct: 564 LSFSDEEDQSPFYFSPDSPLSRQDLVSWKMALEKRQLPETDKKVLYQVSGFIDIDSINPD 623
Query: 476 AWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEA 535
AWPAL+AD +AGEQGIIALAFG TRLFQP+KPVT AQAA+ALA GE+SD V+EEL RIEA
Sbjct: 624 AWPALVADASAGEQGIIALAFGYTRLFQPNKPVTKAQAAIALATGESSDIVSEELARIEA 683
Query: 536 ESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREV 595
E+ AE AV+EHSALV +VEKE+N SFEKELS+ER+KID +EK+AEEARQELE+LRAER+
Sbjct: 684 EAMAEKAVAEHSALVDQVEKELNASFEKELSLERKKIDAMEKLAEEARQELEKLRAERDE 743
Query: 596 DKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQE 655
D I+L+KERAAIESEME+LS+LR EVEEQL+S MSNKVEISYEKERI+ LRKEAE+ENQE
Sbjct: 744 DNISLIKERAAIESEMEVLSRLRSEVEEQLQSFMSNKVEISYEKERISKLRKEAESENQE 803
Query: 656 IARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREES 715
IARLQYELEVERKALSMARAWAEDEAKRAREQAKALE ARDRWE+ GIKVVVD +LREE+
Sbjct: 804 IARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDNELREEA 863
Query: 716 DAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKK 775
A V W++ KQFSVD TVSRA++LVDKL AM +D+ GKSK++I+ I+ KI+ IS L++
Sbjct: 864 SAEVTWLDTAKQFSVDGTVSRAENLVDKLNAMGSDLRGKSKDVIDNIVQKIIHLISILRE 923
Query: 776 WASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRF 835
ASK + ELKDA ++KA GS+QELQQ+TAEF + EG KRV GDCR GVEKLTQ+F
Sbjct: 924 LASKVGTQVRELKDAAVVKAGGSIQELQQNTAEFSLAIKEGTKRVVGDCRGGVEKLTQKF 983
Query: 836 KT 837
KT
Sbjct: 984 KT 985
>gi|302143846|emb|CBI22707.3| unnamed protein product [Vitis vinifera]
Length = 1040
Score = 827 bits (2136), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/602 (71%), Positives = 511/602 (84%), Gaps = 12/602 (1%)
Query: 244 SSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESN----SSSFTESPPTGSSSSPAG 299
S +++HDLN +GS+ ++ + +PF ++ D+N N + SF ESP +S S AG
Sbjct: 443 SYVKLHDLNASGSTSSTSALP-YPFDYDQ---DVNLQNKIQRNRSFLESPIAENSFSSAG 498
Query: 300 IPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWL 359
IPAPS VS +L+VLPG+V+VPAVVDQVQGQAL+ALQVLKVIE DV+P DLC RRE+ARWL
Sbjct: 499 IPAPSAVSESLKVLPGQVVVPAVVDQVQGQALAALQVLKVIEPDVQPSDLCTRREFARWL 558
Query: 360 VSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDL 419
VSASS L+R+T+SKVYPAMYI N+T+LAFDDITPEDPDFSSIQGLAEAGLISSKLS RDL
Sbjct: 559 VSASSVLSRNTVSKVYPAMYIGNITELAFDDITPEDPDFSSIQGLAEAGLISSKLSRRDL 618
Query: 420 LN----EEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPD 475
L+ E+ P +F P+SPLSRQDLVSWKMALEKRQLPE +KK+LYQ+SGFIDID INPD
Sbjct: 619 LSFSDEEDQSPFYFSPDSPLSRQDLVSWKMALEKRQLPETDKKVLYQVSGFIDIDSINPD 678
Query: 476 AWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEA 535
AWPAL+AD +AGEQGIIALAFG TRLFQP+KPVT AQAA+ALA GE+SD V+EEL RIEA
Sbjct: 679 AWPALVADASAGEQGIIALAFGYTRLFQPNKPVTKAQAAIALATGESSDIVSEELARIEA 738
Query: 536 ESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREV 595
E+ AE AV+EHSALV +VEKE+N SFEKELS+ER+KID +EK+AEEARQELE+LRAER+
Sbjct: 739 EAMAEKAVAEHSALVDQVEKELNASFEKELSLERKKIDAMEKLAEEARQELEKLRAERDE 798
Query: 596 DKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQE 655
D I+L+KERAAIESEME+LS+LR EVEEQL+S MSNKVEISYEKERI+ LRKEAE+ENQE
Sbjct: 799 DNISLIKERAAIESEMEVLSRLRSEVEEQLQSFMSNKVEISYEKERISKLRKEAESENQE 858
Query: 656 IARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREES 715
IARLQYELEVERKALSMARAWAEDEAKRAREQAKALE ARDRWE+ GIKVVVD +LREE+
Sbjct: 859 IARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDNELREEA 918
Query: 716 DAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKK 775
A V W++ KQFSVD TVSRA++LVDKL AM +D+ GKSK++I+ I+ KI+ IS L++
Sbjct: 919 SAEVTWLDTAKQFSVDGTVSRAENLVDKLNAMGSDLRGKSKDVIDNIVQKIIHLISILRE 978
Query: 776 WASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRF 835
ASK + ELKDA ++KA GS+QELQQ+TAEF + EG KRV GDCR GVEKLTQ+F
Sbjct: 979 LASKVGTQVRELKDAAVVKAGGSIQELQQNTAEFSLAIKEGTKRVVGDCRGGVEKLTQKF 1038
Query: 836 KT 837
KT
Sbjct: 1039 KT 1040
>gi|224134861|ref|XP_002321923.1| predicted protein [Populus trichocarpa]
gi|222868919|gb|EEF06050.1| predicted protein [Populus trichocarpa]
Length = 592
Score = 826 bits (2134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/589 (71%), Positives = 483/589 (82%), Gaps = 33/589 (5%)
Query: 282 SSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLK--- 338
S F E P S S AGIPAPS VSAALQVLPGKVLVPAVVDQ+QGQ +ALQVLK
Sbjct: 4 SEPFFELPTPEISFSSAGIPAPSAVSAALQVLPGKVLVPAVVDQLQGQTFAALQVLKKNV 63
Query: 339 ------------------------VIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKV 374
VIEADV+P DLC RREYARWLV+ASS L+RST+SKV
Sbjct: 64 DYQFKIFLVLVLFFIFYFFINLFQVIEADVQPSDLCTRREYARWLVAASSVLSRSTVSKV 123
Query: 375 YPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLN---EEPGPIFFLP 431
YPAMYIENVT+LAFDDITP+DPDFSSIQGLAEAG ISSKLS+ DLL+ E GP +F
Sbjct: 124 YPAMYIENVTELAFDDITPDDPDFSSIQGLAEAGFISSKLSNHDLLSSSVENQGPFYFAA 183
Query: 432 ESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGI 491
ESPLSRQDLVSWKMAL+KRQLPEA+KK+LY+LSGF DIDKINPDAWPAL+ADL+AG+QGI
Sbjct: 184 ESPLSRQDLVSWKMALDKRQLPEADKKMLYKLSGFRDIDKINPDAWPALVADLSAGDQGI 243
Query: 492 IALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVA 551
I+LAFGCTRLFQPDKPVT AQAAVALA GEASD V+EEL RIEAES AENAVS H+ALVA
Sbjct: 244 ISLAFGCTRLFQPDKPVTKAQAAVALATGEASDTVSEELARIEAESVAENAVSAHNALVA 303
Query: 552 EVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEM 611
+ E++IN SFEKELSMEREKI+ VEKMAEEAR ELERLRAERE D +ALMKER AIESEM
Sbjct: 304 QAEQDINASFEKELSMEREKINAVEKMAEEARCELERLRAEREKDGVALMKERIAIESEM 363
Query: 612 EILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALS 671
E+LSKLRREVEEQL+SL+SNK+EISYEKERI+ L+KEAE+E QEI+RLQY+LEVERKALS
Sbjct: 364 EVLSKLRREVEEQLQSLLSNKLEISYEKERISKLQKEAESEKQEISRLQYDLEVERKALS 423
Query: 672 MARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQF-SV 730
MARAWAEDEAKRAREQAKALE AR RWE+ GIKVVVD L EES V W+ AGKQ SV
Sbjct: 424 MARAWAEDEAKRAREQAKALEEARYRWEKHGIKVVVDSSLDEESSTGVTWLTAGKQVSSV 483
Query: 731 DQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDA 790
+ TV+RA++LVDKLK MA++V GKS+E+I+ II K+ + IS L++W +KA + ELK+A
Sbjct: 484 EGTVNRAENLVDKLKLMADNVKGKSREVIDKIIQKVQVLISILREWVAKAYAQTKELKEA 543
Query: 791 TILKAKGSVQELQQSTAEFRSNLT--EGAKRVAGDCREGVEKLTQRFKT 837
TI K +GS+QELQQ+T EF +L E KRVA DCREGVEKLTQ+FK+
Sbjct: 544 TISKTRGSIQELQQNTTEFNFSLAVKESTKRVAEDCREGVEKLTQKFKS 592
>gi|356529194|ref|XP_003533181.1| PREDICTED: uncharacterized protein LOC100780360 [Glycine max]
Length = 975
Score = 818 bits (2113), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/858 (55%), Positives = 595/858 (69%), Gaps = 31/858 (3%)
Query: 1 MEPLTAAQDVSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVASD 60
M+ LT Q+ + D D+ E G + V+Q + +E + SS+ E+ SD
Sbjct: 120 MKTLTTQQEELLSSDDHNDEITEQGNVDSMVEQGNGKMEGQIDISGDYSSA-ESSNFYSD 178
Query: 61 NKI--DSENETPSTGDVSHSSSGIN-SINDVAKQDDLQRESASDDMSVAPDTALTSPKLP 117
N I DS+ + D + S G++ + ++ Q+DLQ E A + V A SP
Sbjct: 179 NSIVDDSDIGSQLIYDSKNPSDGVDDATKHISVQEDLQDELAFGNKLV---FASESPVPL 235
Query: 118 EPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTD---LGNDGSK 174
E E + NA DS + + ES + + EN +V+P N D L + +
Sbjct: 236 ESENTIDSFNAYGFRDFDSNPNVDTAESTANLK-ENLFNVDPGDAPNYDDAKPLHLNTEQ 294
Query: 175 FSRIFSDSS--------SISSSHAPIEPLAAVISV-----SSDTTVEPQILPKGDTETVA 221
I S S + SSS + E ++SV S++ +P+ + E +
Sbjct: 295 HDEITSSSGSVSFGFSETYSSSGSDNE--TGIVSVLVNPESNNMISDPKFFNEAGQENIL 352
Query: 222 SPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNG-SSGTSVSPSIFPFSNEKETCDLNES 280
S S +N++ ++ P +S E + S E + N +S+S S+ +E+ T D E
Sbjct: 353 SASKNENLDLNKIPQVSAEGNEPSFEERSVPGNDLFEESSISSSVNTLVDEQVTNDNYEV 412
Query: 281 NSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVI 340
+ ++SP +GS S GIPAPSVVSA++QVLPGKVLVPA VDQVQGQAL+ALQVLKVI
Sbjct: 413 DEVK-SKSPNSGSFFSVPGIPAPSVVSASVQVLPGKVLVPAAVDQVQGQALAALQVLKVI 471
Query: 341 EADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSS 400
E DV+P DLC RREYARWLVSASS L+RST+SKVYPAMYI+NVT+LAFDD+ PEDPDFSS
Sbjct: 472 EPDVQPSDLCTRREYARWLVSASSALSRSTVSKVYPAMYIDNVTELAFDDVIPEDPDFSS 531
Query: 401 IQGLAEAGLISSKLSHRDL---LNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANK 457
IQGLAEAGLI S+LS RD+ E+ P +F PESPLSRQDLVSWKMALEKRQLPEAN+
Sbjct: 532 IQGLAEAGLIESRLSRRDIQLSAEEDDSPFYFSPESPLSRQDLVSWKMALEKRQLPEANR 591
Query: 458 KILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
K+LYQ+SGFID DKI+P+A PAL+ADL++GEQGIIALAFG TRLFQPDKPVT AQAA+AL
Sbjct: 592 KVLYQVSGFIDTDKIHPNACPALVADLSSGEQGIIALAFGYTRLFQPDKPVTKAQAAMAL 651
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEK 577
A G+AS+ V+EEL RIEAES AENAV+ HSALVA+VEK+IN SFE+EL +EREKI VE+
Sbjct: 652 ATGDASEIVSEELARIEAESVAENAVAAHSALVAQVEKDINASFEQELFIEREKISAVER 711
Query: 578 MAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISY 637
MAEEAR ELERLRAERE D +AL KERAAI+SEME+ SKLR EVE+QL+SLM+++VEI++
Sbjct: 712 MAEEARLELERLRAEREEDNLALTKERAAIDSEMEVFSKLRHEVEDQLQSLMNDRVEIAH 771
Query: 638 EKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDR 697
EKERI+ LR++AE EN+EI RLQYELEVERKALSMARAWAEDEAKR REQA ALE ARDR
Sbjct: 772 EKERISKLREQAEVENKEICRLQYELEVERKALSMARAWAEDEAKRVREQAIALEEARDR 831
Query: 698 WERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKE 757
WER GIKVVVD DLR+E+ A V W+NA +Q SV TV RA+SL+DKLK MA D+ GKS++
Sbjct: 832 WERHGIKVVVDDDLRKEASAGVTWLNASEQVSVQGTVDRAESLLDKLKQMAADIRGKSRD 891
Query: 758 IINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGA 817
++ IIH + IS L++WA K +A E +A I K S ELQ S E S + EGA
Sbjct: 892 TLDKIIHMVSQLISKLREWACKTGKQAEEFGEAAISKVGKSASELQLSALEVGSGIKEGA 951
Query: 818 KRVAGDCREGVEKLTQRF 835
KRVAGDCREGVEK+TQ+F
Sbjct: 952 KRVAGDCREGVEKITQKF 969
>gi|356561542|ref|XP_003549040.1| PREDICTED: uncharacterized protein LOC100810148 [Glycine max]
Length = 1002
Score = 815 bits (2104), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/856 (55%), Positives = 591/856 (69%), Gaps = 27/856 (3%)
Query: 1 MEPLTAAQDVSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVASD 60
M+PLT+ Q+ + D ++ E G N V+Q + +E + SS+ E+ SD
Sbjct: 147 MKPLTSQQEELLSSDDHNNEITEQGNVDNTVEQGNGKMEGQIHISGDYSSA-ESSNFYSD 205
Query: 61 NKI--DSENETPSTGDVSHSSSGIN-SINDVAKQDDLQRESASDDMSVAPDTALTSPKLP 117
N I DS+ + D + S G++ + ++ Q+DLQ SA D+ V A SP
Sbjct: 206 NSIVDDSDIGSQLIYDSKNPSDGVDDATKHISVQEDLQDVSAFDNKLV---FASESPVPL 262
Query: 118 EPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSNPTD---LGNDGSK 174
E E + NA DS + + ES + EN +V+P N D L + +
Sbjct: 263 ESENTVDSFNAYGFRDFDSNPNVDTVESTPNLK-ENLFNVDPGDVPNYDDAKPLHLNTEQ 321
Query: 175 FSRIFSDSS--------SISSSHAPIEP-LAAVISVS--SDTTVEPQILPKGDTETVASP 223
I S S + SSS A E + +V+ +S ++ +P+ + E + S
Sbjct: 322 HDEITSSSGSVSFGFPETYSSSGADNETGIVSVVVISELNNMISDPKFFNEAGQENILSA 381
Query: 224 STIKNVEQSEKPLLSGEDSSSSMEVHDLNKNG-SSGTSVSPSIFPFSNEKETCDLNESNS 282
+N++ ++ P +S E + S E + N +S+S S +E+ D E +
Sbjct: 382 LKNENLDLNKIPQVSAEGNEPSFEERSIPGNDLFEKSSISTSANTLVDEQVRNDNYEVDE 441
Query: 283 SSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEA 342
+ES +GS S GIPAP VVS A++VLPGK+LVPA VDQ QGQAL+ALQVLKVIE
Sbjct: 442 VK-SESSNSGSFFSVPGIPAPLVVSTAVKVLPGKILVPAAVDQAQGQALAALQVLKVIEP 500
Query: 343 DVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQ 402
DV+P DLC RREYARWLVSASS L+RST+SKVYPAMYI+N T+LAFDD+TPEDPDFSSIQ
Sbjct: 501 DVQPSDLCTRREYARWLVSASSALSRSTVSKVYPAMYIDNATELAFDDVTPEDPDFSSIQ 560
Query: 403 GLAEAGLISSKLSHRDLL---NEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKI 459
GLAEAGLI S+LS RD+ + + P +F PESPLSRQDLVSWKMAL+KRQLPEA+ K+
Sbjct: 561 GLAEAGLIESRLSRRDIQLFGDGDDSPFYFSPESPLSRQDLVSWKMALQKRQLPEADSKV 620
Query: 460 LYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAI 519
LYQLSGFID DKI+P+A PAL+ADL+AGEQGIIALAFG TRLFQPDKPVT AQAA+ALA
Sbjct: 621 LYQLSGFIDTDKIHPNACPALVADLSAGEQGIIALAFGYTRLFQPDKPVTKAQAAMALAT 680
Query: 520 GEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMA 579
G+AS+ V+EEL RIEAES AENAV+ HSALVA+VEK+IN SFE+EL +EREKI VE+MA
Sbjct: 681 GDASEIVSEELARIEAESIAENAVAAHSALVAQVEKDINASFEQELFIEREKISAVERMA 740
Query: 580 EEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEK 639
EEAR ELERLRAERE D +AL KERAAIESEME+ SKLR EVE+QL+SLMS+KVEI++EK
Sbjct: 741 EEARLELERLRAEREEDNLALTKERAAIESEMEVFSKLRHEVEDQLQSLMSDKVEIAHEK 800
Query: 640 ERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWE 699
ERI+ LR++AE EN EI RLQYELEVERKALSMARAWAEDEAKR REQA ALE ARDRWE
Sbjct: 801 ERISKLREKAEVENNEIGRLQYELEVERKALSMARAWAEDEAKRVREQAIALEEARDRWE 860
Query: 700 RQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEII 759
R GIKVVVD DLR+E+ A V W+NA +Q SV TV RA+SL+DKLK MA D+ GKS++ +
Sbjct: 861 RHGIKVVVDDDLRKEASAGVTWLNASEQVSVQGTVDRAESLLDKLKQMAADIRGKSRDTL 920
Query: 760 NTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKR 819
+ IIH + FIS L++WA K +A E +A I K SV ELQQ+ E + EGAKR
Sbjct: 921 HKIIHVVSQFISKLREWACKTGKQAEEFGEAAISKVGKSVSELQQNALEVGIGIKEGAKR 980
Query: 820 VAGDCREGVEKLTQRF 835
VAGDCREGVEK+TQ+F
Sbjct: 981 VAGDCREGVEKITQKF 996
>gi|224122346|ref|XP_002318812.1| predicted protein [Populus trichocarpa]
gi|222859485|gb|EEE97032.1| predicted protein [Populus trichocarpa]
Length = 793
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/593 (70%), Positives = 486/593 (81%), Gaps = 34/593 (5%)
Query: 279 ESNSSSFTESPPTGSSS--------SPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQA 330
ESN + +P T +SS S GIPAPS VSAALQVLPGKVLVPAVVDQVQGQ
Sbjct: 201 ESNFDDKSVTPETTTSSENLPSSDISATGIPAPSAVSAALQVLPGKVLVPAVVDQVQGQV 260
Query: 331 LSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDD 390
L+ALQVLKVIEAD++ DLC RRE+ARWLV+ASS L+RST+SKVYPAMYIEN T+LAFDD
Sbjct: 261 LAALQVLKVIEADIQSSDLCTRREFARWLVTASSALSRSTVSKVYPAMYIENFTELAFDD 320
Query: 391 ITPEDPDFSSIQGLAEAGLISSKLSHRDLLN---EEPGPIFFLPESPLSRQDLVSWKMAL 447
ITP+DPDFSSIQGLAEAGLISSKLS LL+ E GP +F ESPLSRQDLVSWKMAL
Sbjct: 321 ITPDDPDFSSIQGLAEAGLISSKLSSGGLLSSSVENQGPFYFAAESPLSRQDLVSWKMAL 380
Query: 448 EKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
EKRQ PEA+KK+LY++SGF DIDK+NPDAWPAL+ADL+AG+QGII+LAFGCTRLFQPDKP
Sbjct: 381 EKRQFPEADKKMLYKVSGFRDIDKLNPDAWPALVADLSAGDQGIISLAFGCTRLFQPDKP 440
Query: 508 VTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSM 567
VT AQAAVALA GEASD V+EEL RIEAE+ AEN VS H+ALVA+VE+++N SFEKELS+
Sbjct: 441 VTKAQAAVALATGEASDIVSEELARIEAEAVAENVVSAHNALVAQVEQDVNASFEKELSI 500
Query: 568 EREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLES 627
EREKI+ +EKMAEEAR ELE LRAERE D IALMKERAAIESEME+LSKLRRE+EEQL+S
Sbjct: 501 EREKINAIEKMAEEARCELETLRAEREKDDIALMKERAAIESEMEVLSKLRRELEEQLQS 560
Query: 628 LMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQ 687
L+SNKVEISYEKERI+ L+KEAE+E QEI+RLQY+LEVERKALSMARAWAEDEAKRAREQ
Sbjct: 561 LLSNKVEISYEKERISKLQKEAESEKQEISRLQYDLEVERKALSMARAWAEDEAKRAREQ 620
Query: 688 AKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQF-SVDQTVSRAQSLVDKLKA 746
AKALE AR RWE+ GIKVVVD DL EES V W+ AGKQ SV+ TV+RA++LVD+LK
Sbjct: 621 AKALEEARYRWEKHGIKVVVDSDLNEESSTGVTWLTAGKQVSSVEGTVNRAENLVDRLKL 680
Query: 747 MANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATI-------------- 792
MA+D+ GKS+ +++ II KIL+ IS LK+W ++A R ELK+ATI
Sbjct: 681 MADDIRGKSRVVLDKIIQKILVLISVLKEWIAEACARTKELKEATISKTWASIHELQQNT 740
Query: 793 ------LKAK--GSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
+K K GS+QEL+Q TAEF S + EG KRV DCREGVEKLTQ+FK+
Sbjct: 741 TEFSSAIKEKTIGSMQELKQHTAEFGSAVKEGTKRVTEDCREGVEKLTQKFKS 793
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 45/79 (56%), Gaps = 1/79 (1%)
Query: 43 GTDNALSSSSEAIEVASDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDD 102
G + LSSS E E S+NK+ ET S V +++ +++++ Q++LQ ES DD
Sbjct: 148 GIETDLSSSPELNEAPSENKLGDNKET-SVDSVDYATRVSDTVDNEPVQENLQYESNFDD 206
Query: 103 MSVAPDTALTSPKLPEPEV 121
SV P+T +S LP ++
Sbjct: 207 KSVTPETTTSSENLPSSDI 225
>gi|449446025|ref|XP_004140772.1| PREDICTED: uncharacterized protein LOC101215442 [Cucumis sativus]
gi|449518413|ref|XP_004166236.1| PREDICTED: uncharacterized LOC101215442 [Cucumis sativus]
Length = 722
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/543 (73%), Positives = 472/543 (86%), Gaps = 4/543 (0%)
Query: 298 AGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYAR 357
AG+PAP +VSAA++ PGKVL+PAVVDQVQGQAL+ALQVLKVIE DV+P DLC RREYAR
Sbjct: 178 AGVPAP-LVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVEPSDLCTRREYAR 236
Query: 358 WLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHR 417
WLVSASS L+R+T SKVYPAMYIENVT+LAFDDITP+DPDF+SIQGLAEAG+ISSKLS
Sbjct: 237 WLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRH 296
Query: 418 DL---LNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINP 474
D+ L+E+ GP++F PES LSRQDLVSWKMALEKRQLPEA++K+L+Q+SGFID DKI+P
Sbjct: 297 DISSSLDEDQGPLYFSPESLLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHP 356
Query: 475 DAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIE 534
DA PAL+ADL+ GEQGIIALAFG TRLFQPDKPVT AQAA+ALA GEASD V+EEL RIE
Sbjct: 357 DACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIE 416
Query: 535 AESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAERE 594
AES AENAV+ HSALVA+VEK+IN SFEKELS+EREK++ VEKMAEEA+QELERLR+ERE
Sbjct: 417 AESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERE 476
Query: 595 VDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQ 654
+ +ALM ERA+IESEME+LS+LR E+EEQL+ LMSNKVE+SYEKERIN LRKEAE ENQ
Sbjct: 477 REGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQ 536
Query: 655 EIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREE 714
EI+RLQYELEVERKALSMARAWAEDEAK+AREQAKALE ARDRWE++GIKVVVD DLRE+
Sbjct: 537 EISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQ 596
Query: 715 SDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLK 774
A W+++ KQF+V++T RA++L++KLK MA +V G+S+++I II KI L +SNL+
Sbjct: 597 ESAGDTWLDSSKQFTVEETTERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLR 656
Query: 775 KWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQR 834
+W SK +A ELK+ I +A S +ELQQSTAE + EGAKRV GDCREGVEK TQ+
Sbjct: 657 QWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQK 716
Query: 835 FKT 837
F+T
Sbjct: 717 FRT 719
>gi|222424656|dbj|BAH20282.1| AT5G23890 [Arabidopsis thaliana]
Length = 805
Score = 745 bits (1924), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/748 (55%), Positives = 548/748 (73%), Gaps = 38/748 (5%)
Query: 110 ALTSPKLPEPEVVSGTENASPLEGSDS--ILDANLPESASEITGENPIDVEPSSFSN--P 165
A T P+ E E + +E+ S L+ S +LDA ES++ + EN +P S N P
Sbjct: 76 AETDPETAESEKII-SESKSLLDSSTEPILLDA---ESSNLVGVENTNSEDPESLLNTEP 131
Query: 166 TDLGNDGSKFSRIFSDS-SSISS--SHAPIEPLAAVISVSS--DTTVEPQILPKGDTET- 219
T++ + + + DS SS+S ++A + + VSS D+T +PQI+P DTET
Sbjct: 132 TNVSDLENHVNSQKEDSLSSLSGIDAYAASGTVTELPEVSSQLDSTSKPQIVPLNDTETA 191
Query: 220 VASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLN- 278
A+ + V + + + + SS + D++ +S SP P S + +LN
Sbjct: 192 FATAEELSEVNGTPEYFETSDWSS----ISDIDTTKELESSKSP--VPESTDGSKDELNI 245
Query: 279 -----ESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSA 333
++ E P GS+ S AGIPAP + ++ V PGK+LVP DQ+Q QA +A
Sbjct: 246 YSQDELDDNRMLLEIPSGGSAFSSAGIPAPFM---SVIVNPGKILVPVAADQIQCQAFAA 302
Query: 334 LQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITP 393
LQVLKVIE D +P DLC RREYARWL+SASS L+R+T SKVYPAMYIENVT+LAFDDITP
Sbjct: 303 LQVLKVIETDTQPSDLCTRREYARWLISASSALSRNTTSKVYPAMYIENVTELAFDDITP 362
Query: 394 EDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLP 453
EDPDFSSIQGLAEAGLI+SKLS+RDLL++ G F PES LSRQDL+SWKMALEKRQLP
Sbjct: 363 EDPDFSSIQGLAEAGLIASKLSNRDLLDDVEGTFLFSPESLLSRQDLISWKMALEKRQLP 422
Query: 454 EANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQA 513
EA+KK+LY+LSGFIDIDKINPDAWP+++ADL+ GEQGI ALAFGCTRLFQP KPVT QA
Sbjct: 423 EADKKMLYKLSGFIDIDKINPDAWPSIIADLSTGEQGIAALAFGCTRLFQPHKPVTKGQA 482
Query: 514 AVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKID 573
A+AL+ GEASD V+EEL RIEAES AE AVS H+ALVAEVEK++N SFEKELSMEREKI+
Sbjct: 483 AIALSSGEASDIVSEELARIEAESMAEKAVSAHNALVAEVEKDVNASFEKELSMEREKIE 542
Query: 574 VVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKV 633
VEKMAE A+ ELE+LR +RE + +AL+KERAA+ESEME+LS+LRR+ EE+LE LMSNK
Sbjct: 543 AVEKMAELAKVELEQLREKREEENLALVKERAAVESEMEVLSRLRRDAEEKLEDLMSNKA 602
Query: 634 EISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEG 693
EI++EKER+ LRKEAE E+Q I++LQYELEVERKALSMAR+WAE+EAK+AREQ +ALE
Sbjct: 603 EITFEKERVFNLRKEAEEESQRISKLQYELEVERKALSMARSWAEEEAKKAREQGRALEE 662
Query: 694 ARDRWERQGIKVVVDKDLRE----ESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMAN 749
AR RWE G++VVVDKDL+E E++ +++ +N ++ SV++T RA++L+DKLK MA
Sbjct: 663 ARKRWETNGLRVVVDKDLQETSSRETEQSIV-LNEMERSSVEETERRAKTLMDKLKEMAG 721
Query: 750 DVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEF 809
VSGKS+E+I T++ KI L+I+ LK++A RA E++DA I++AKG+ +++Q T +
Sbjct: 722 TVSGKSREVIFTVMEKIRLWITVLKEYAVNLGKRAGEMRDAAIVRAKGAAADVEQGTVQ- 780
Query: 810 RSNLTEGAKRVAGDCREGVEKLTQRFKT 837
+++ K++A +CR+GV K++QRFKT
Sbjct: 781 ---VSDKVKKMAEECRDGVGKISQRFKT 805
>gi|15237846|ref|NP_197777.1| uncharacterized protein [Arabidopsis thaliana]
gi|10176856|dbj|BAB10062.1| unnamed protein product [Arabidopsis thaliana]
gi|332005846|gb|AED93229.1| uncharacterized protein [Arabidopsis thaliana]
Length = 946
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/859 (51%), Positives = 596/859 (69%), Gaps = 59/859 (6%)
Query: 1 MEPLTAAQDVSIVPDHKI--DKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
M LT+ Q+ I +I D+ + +N+K +D+S+ES N ++ S+ E +
Sbjct: 125 MHSLTSQQESMIQSSDEISSDEIKVANSEESNLKDEDKSIES-----NDVAQKSD--EGS 177
Query: 59 SDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPE 118
++K+ + + G ++ + SI + DL + +D P+TA + + E
Sbjct: 178 GEDKLLGKETSSFDGVMTDEADATESIPQNTPEADLMVNAETD-----PETAESEKIISE 232
Query: 119 PEVV--SGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSN--PTDLGNDGSK 174
+ + S TE P+ +LDA ES++ + EN +P S N PT++ + +
Sbjct: 233 SKSLLDSSTE---PI-----LLDA---ESSNLVGVENTNSEDPESLLNTEPTNVSDLENH 281
Query: 175 FSRIFSDS-SSISS--SHAPIEPLAAVISVSS--DTTVEPQILPKGDTET-VASPSTIKN 228
+ DS SS+S ++A + + VSS D+T +PQI+P DTET A+ +
Sbjct: 282 VNSQKEDSLSSLSGIDAYAASGTVTELPEVSSQLDSTSKPQIVPLNDTETAFATAEELSE 341
Query: 229 VEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLN------ESNS 282
V + + + + SS + D++ +S SP P S + +LN ++
Sbjct: 342 VNGTPEYFETSDWSS----ISDIDTTKELESSKSP--VPESTDGSKDELNIYSQDELDDN 395
Query: 283 SSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEA 342
E P GS+ S AGIPAP + ++ V PGK+LVP DQ+Q QA +ALQVLKVIE
Sbjct: 396 RMLLEIPSGGSAFSSAGIPAPFM---SVIVNPGKILVPVAADQIQCQAFAALQVLKVIET 452
Query: 343 DVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQ 402
D +P DLC RREYARWL+SASS L+R+T SKVYPAMYIENVT+LAFDDITPEDPDFSSIQ
Sbjct: 453 DTQPSDLCTRREYARWLISASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFSSIQ 512
Query: 403 GLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQ 462
GLAEAGLI+SKLS+RDLL++ G F PES LSRQDL+SWKMALEKRQLPEA+KK+LY+
Sbjct: 513 GLAEAGLIASKLSNRDLLDDVEGTFLFSPESLLSRQDLISWKMALEKRQLPEADKKMLYK 572
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEA 522
LSGFIDIDKINPDAWP+++ADL+ GEQGI ALAFGCTRLFQP KPVT QAA+AL+ GEA
Sbjct: 573 LSGFIDIDKINPDAWPSIIADLSTGEQGIAALAFGCTRLFQPHKPVTKGQAAIALSSGEA 632
Query: 523 SDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEA 582
SD V+EEL RIEAES AE AVS H+ALVAEVEK++N SFEKELSMEREKI+ VEKMAE A
Sbjct: 633 SDIVSEELARIEAESMAEKAVSAHNALVAEVEKDVNASFEKELSMEREKIEAVEKMAELA 692
Query: 583 RQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERI 642
+ ELE+LR +RE + +AL+KERAA+ESEME+LS+LRR+ EE+LE LMSNK EI++EKER+
Sbjct: 693 KVELEQLREKREEENLALVKERAAVESEMEVLSRLRRDAEEKLEDLMSNKAEITFEKERV 752
Query: 643 NMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQG 702
LRKEAE E+Q I++LQYELEVERKALSMAR+WAE+EAK+AREQ +ALE AR RWE G
Sbjct: 753 FNLRKEAEEESQRISKLQYELEVERKALSMARSWAEEEAKKAREQGRALEEARKRWETNG 812
Query: 703 IKVVVDKDLRE----ESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEI 758
++VVVDKDL+E E++ +++ +N ++ SV++T RA++L+DKLK MA VSGKS+E+
Sbjct: 813 LRVVVDKDLQETSSRETEQSIV-LNEMERSSVEETERRAKTLMDKLKEMAGTVSGKSREV 871
Query: 759 INTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAK 818
I T++ KI L+I+ LK++A RA E++DA I++AKG+ +++Q T + +++ K
Sbjct: 872 IFTVMEKIRLWITVLKEYAVNLGKRAGEMRDAAIVRAKGAAADVEQGTVQ----VSDKVK 927
Query: 819 RVAGDCREGVEKLTQRFKT 837
++A +CR+GV K++QRFKT
Sbjct: 928 KMAEECRDGVGKISQRFKT 946
>gi|23397269|gb|AAN31916.1| unknown protein [Arabidopsis thaliana]
Length = 946
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/859 (50%), Positives = 595/859 (69%), Gaps = 59/859 (6%)
Query: 1 MEPLTAAQDVSIVPDHKI--DKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
M LT+ Q+ I +I D+ + +N+K +D+S+ES N ++ S+ E +
Sbjct: 125 MHSLTSQQESMIQSSDEISSDEIKVANSEESNLKDEDKSIES-----NDVAQKSD--EGS 177
Query: 59 SDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPE 118
++K+ + + G ++ + SI + DL + +D P+TA + + E
Sbjct: 178 GEDKLLGKETSSFDGVMTDEADATESIPQNTPEADLMVNAETD-----PETAESEKIISE 232
Query: 119 PEVV--SGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSN--PTDLGNDGSK 174
+ + S TE P+ +LDA ES++ + EN +P S N PT++ + +
Sbjct: 233 SKSLLDSSTE---PI-----LLDA---ESSNLVGVENTNSEDPESLLNTEPTNVSDLENH 281
Query: 175 FSRIFSDS-SSISS--SHAPIEPLAAVISVSS--DTTVEPQILPKGDTET-VASPSTIKN 228
+ DS SS+S ++A + + VSS D+T +PQI+P DTET A+ +
Sbjct: 282 VNSQKEDSLSSLSGIDAYAASGTVTELPEVSSQLDSTSKPQIVPLNDTETAFATAEELSE 341
Query: 229 VEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLN------ESNS 282
V + + + + SS + D++ +S SP P S + +LN ++
Sbjct: 342 VNGTPEYFETSDWSS----ISDIDTTKELESSKSP--VPESTDGSKDELNIYSQDELDDN 395
Query: 283 SSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEA 342
E P GS+ S AGIPAP + ++ V PGK+LVP DQ+Q QA +ALQVLKVIE
Sbjct: 396 RMLLEIPSGGSAFSSAGIPAPFM---SVIVNPGKILVPVAADQIQCQAFAALQVLKVIET 452
Query: 343 DVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQ 402
D +P DLC RREYARWL+SASS L+R+T SKVYPAMYIENVT+LAFDDITPEDPDFSSIQ
Sbjct: 453 DTQPSDLCTRREYARWLISASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFSSIQ 512
Query: 403 GLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQ 462
GLAEAGLI+SKLS+RDLL++ G F PES LSRQDL+SWKMALEKRQLPEA+KK+LY+
Sbjct: 513 GLAEAGLIASKLSNRDLLDDVEGTFLFSPESLLSRQDLISWKMALEKRQLPEADKKMLYK 572
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEA 522
LSGFIDIDKINPDAWP+++ADL+ GEQGI ALAFGCTRLFQP KPVT QAA+AL+ GEA
Sbjct: 573 LSGFIDIDKINPDAWPSIIADLSTGEQGIAALAFGCTRLFQPHKPVTKGQAAIALSSGEA 632
Query: 523 SDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEA 582
SD V+EEL RIEAES AE AVS H+ALVAEVEK++N SFEKELSMEREKI+ VEKMAE A
Sbjct: 633 SDIVSEELARIEAESMAEKAVSAHNALVAEVEKDVNASFEKELSMEREKIEAVEKMAELA 692
Query: 583 RQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERI 642
+ ELE+LR +RE + +AL+KERAA+ESEME+LS+LRR+ EE+LE LMSNK EI++EKER+
Sbjct: 693 KVELEQLREKREEENLALVKERAAVESEMEVLSRLRRDAEEKLEDLMSNKAEITFEKERV 752
Query: 643 NMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQG 702
LRKEAE E+Q I++LQYELE ERKALSMAR+WAE+EAK+AREQ +ALE AR RWE G
Sbjct: 753 FNLRKEAEEESQRISKLQYELEAERKALSMARSWAEEEAKKAREQGRALEEARKRWETNG 812
Query: 703 IKVVVDKDLRE----ESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEI 758
++VVVDKDL+E E++ +++ +N ++ SV++T RA++L+DKLK MA VSGKS+E+
Sbjct: 813 LRVVVDKDLQETSIRETEQSIV-LNEMERSSVEETERRAKTLMDKLKEMAGTVSGKSREV 871
Query: 759 INTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAK 818
I T++ KI L+I+ LK++A RA E++DA I++AKG+ +++Q T + +++ K
Sbjct: 872 IFTVMEKIRLWITVLKEYAVNLGKRAGEMRDAAIVRAKGAAADVEQGTVQ----VSDKVK 927
Query: 819 RVAGDCREGVEKLTQRFKT 837
++A +CR+GV K++QRFKT
Sbjct: 928 KMAEECRDGVGKISQRFKT 946
>gi|297808395|ref|XP_002872081.1| hypothetical protein ARALYDRAFT_489250 [Arabidopsis lyrata subsp.
lyrata]
gi|297317918|gb|EFH48340.1| hypothetical protein ARALYDRAFT_489250 [Arabidopsis lyrata subsp.
lyrata]
Length = 947
Score = 732 bits (1889), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/859 (51%), Positives = 590/859 (68%), Gaps = 58/859 (6%)
Query: 1 MEPLTAAQD--VSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
M LT+ Q+ V + + D+ + NN+K +D+S+ES N ++ S+ E +
Sbjct: 125 MHSLTSQQESMVQLSDETSSDEIKVANSEENNLKDEDKSIES-----NDVAQKSD--EGS 177
Query: 59 SDNKIDSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKLPE 118
++K+ G + S G+ +++ + + + + D+ ++ + T P+ E
Sbjct: 178 GEDKL--------LGTKTLSVDGV-MLDEADATESIPQNTPEADLIISVE---TDPETAE 225
Query: 119 PEVVSGTENASPLEGSDS--ILDANLPESASEITGENPIDVEPSSFSNPTDLGNDGSKFS 176
E + +E+ S L+ S +LDA ES++ + EN +P S N T+ N +
Sbjct: 226 SEKII-SESKSLLDSSTEPILLDA---ESSNLVGVENTNSEDPGSLPN-TEPTNVSDLEN 280
Query: 177 RIFSDSSSISSSHAPIEPLAAVISVSSD---------TTVEPQILPKGDTETVASPS-TI 226
R+ S SS + I+ AA +V+ + +T PQI+P DTET S +
Sbjct: 281 RVNSQKEDSLSSLSDIDAFAASGTVTEELPEVSSQSDSTSSPQIVPLNDTETAFSTGEDL 340
Query: 227 KNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNES----NS 282
V + + L +G SS + D++ + +S SP K+ ++ ++
Sbjct: 341 SEVNGTPEYLAAGSMSS----ISDIDTTKETESSNSPEPESIDGSKDELNIYSQDKLDDN 396
Query: 283 SSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEA 342
+ E P GS+ S AGIPAP + ++ V PGK+LVPA VDQVQ QA +ALQVLKVIE
Sbjct: 397 GTLLEIPSGGSAFSSAGIPAPFM---SVIVNPGKILVPAAVDQVQCQAFAALQVLKVIET 453
Query: 343 DVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQ 402
D++P DLC RREYARWLVSASS L+R+T SKVYPAMYIENVT+LAFDDITPEDPDFSSIQ
Sbjct: 454 DIQPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFSSIQ 513
Query: 403 GLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQ 462
GLAEAGLI+SKLS+RDLL++ G F PES LSRQDL+SWKMALEKRQLPEA+KK+LY+
Sbjct: 514 GLAEAGLIASKLSNRDLLDDVKGTFLFSPESLLSRQDLISWKMALEKRQLPEADKKMLYK 573
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEA 522
LSGFIDIDKINPDAWPA++ADL+ GEQGI ALAFGCTRLFQP KPVT QAA+AL+ GEA
Sbjct: 574 LSGFIDIDKINPDAWPAIIADLSTGEQGIAALAFGCTRLFQPHKPVTKGQAAIALSSGEA 633
Query: 523 SDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEA 582
SD V+EEL RIEAES AE AVS H+ALVAEVEK++N SFEKELSMEREKI+ VEKMAE A
Sbjct: 634 SDIVSEELARIEAESMAEKAVSAHNALVAEVEKDVNASFEKELSMEREKIEAVEKMAELA 693
Query: 583 RQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERI 642
+ ELE+LR +RE + +AL+KERAA+ESEME+LS+LRR+ EE+LE LMSNK EIS+EKER
Sbjct: 694 KVELEQLREKREEENLALVKERAAVESEMEVLSRLRRDAEEKLEDLMSNKAEISFEKERA 753
Query: 643 NMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQG 702
LRKEAE E+Q I++LQYELEVERKALSMAR+WAE+EAKRAREQ KALE AR RWE G
Sbjct: 754 LNLRKEAEEESQRISKLQYELEVERKALSMARSWAEEEAKRAREQGKALEDARKRWETNG 813
Query: 703 IKVVVDKDLRE----ESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEI 758
++VVVDKD +E E++ +++ +N ++ SV++T RA++L+DKLK MA V GKS+E+
Sbjct: 814 LRVVVDKDFQETISGETEQSIL-LNDVERSSVEETEERAKTLMDKLKEMAGTVIGKSREV 872
Query: 759 INTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGAK 818
I ++ KI L+I+ LK++A RA E++DA I+KAK + E+++ T + L++ K
Sbjct: 873 IFLVMEKIRLWITILKEYAVNLGKRAGEMRDAAIVKAKVAATEVEKGTVQ----LSDKVK 928
Query: 819 RVAGDCREGVEKLTQRFKT 837
++ +CR+GV K++QRFKT
Sbjct: 929 KMVDECRDGVGKISQRFKT 947
>gi|343173169|gb|AEL99287.1| hypothetical protein, partial [Silene latifolia]
Length = 672
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/551 (67%), Positives = 444/551 (80%), Gaps = 3/551 (0%)
Query: 281 NSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVI 340
S + ES P + + GIPAPS++SAALQV PGKVLVPAV DQ Q QAL+ALQVLKVI
Sbjct: 122 GSETVIESQPLQDTFTSRGIPAPSLLSAALQVPPGKVLVPAVTDQTQAQALAALQVLKVI 181
Query: 341 EADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSS 400
E+DV+ DLC RREYARWLVS+SS L+R+ + KVYPAMYIENVT+LAFDDITPEDPDF+S
Sbjct: 182 ESDVQASDLCTRREYARWLVSSSSALSRNLILKVYPAMYIENVTELAFDDITPEDPDFTS 241
Query: 401 IQGLAEAGLISSKLSHRDLL---NEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANK 457
IQGLAEAGLISSKLS RD+L +E+ G +F P+SPLSRQDLV+WKMALEKRQLPEA+K
Sbjct: 242 IQGLAEAGLISSKLSRRDMLSSPDEDIGSFYFHPDSPLSRQDLVTWKMALEKRQLPEADK 301
Query: 458 KILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
K L+QLSGFIDID+I+P+A+PAL+AD++A +QGI+A AFG TRLFQPDKPVT QAA+AL
Sbjct: 302 KELHQLSGFIDIDRIDPNAFPALVADISAKDQGIVASAFGYTRLFQPDKPVTKGQAAIAL 361
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEK 577
A GE+++ V+EEL RIEAES A+ AVS H ALVAEVEK+IN +FEKEL MEREKID V+K
Sbjct: 362 ATGESAEIVSEELARIEAESVADKAVSAHIALVAEVEKDINANFEKELIMEREKIDAVQK 421
Query: 578 MAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISY 637
MAEEA QE+ERLRAERE + ALMK+R A+ESEME+LSKLR E+EEQLE LMSNKV+ISY
Sbjct: 422 MAEEAMQEVERLRAEREEENSALMKQRVAVESEMEVLSKLRHEMEEQLEGLMSNKVKISY 481
Query: 638 EKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDR 697
EK+ + LRKE E ENQ I RLQYELEVERKALSMARAWAEDEA+R +E AK LE ARDR
Sbjct: 482 EKDMVEKLRKETEEENQAIVRLQYELEVERKALSMARAWAEDEARRVQEHAKVLEEARDR 541
Query: 698 WERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKE 757
WERQGIKVVV++DLREE+ A V W N GK+ ++++TV RA +L D+LK MA V+GKSKE
Sbjct: 542 WERQGIKVVVNEDLREEAVADVTWSNVGKKLALEETVDRADTLTDRLKLMAGQVTGKSKE 601
Query: 758 IINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGA 817
IIN +I KI IS +++W S R E KDA K S+Q +Q+ + EGA
Sbjct: 602 IINNVISKIQELISAIREWISNIGKRTIEFKDAAFAKTAESIQGIQERAVGVSVTVKEGA 661
Query: 818 KRVAGDCREGV 828
KRVA DCR GV
Sbjct: 662 KRVADDCRGGV 672
>gi|343173167|gb|AEL99286.1| hypothetical protein, partial [Silene latifolia]
Length = 672
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/551 (67%), Positives = 444/551 (80%), Gaps = 3/551 (0%)
Query: 281 NSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVI 340
S + ES P + + GIPAPS++SAALQV PGKVLVPAV DQ Q QAL+ALQVLKVI
Sbjct: 122 GSETVIESQPLQDTFTSRGIPAPSLLSAALQVPPGKVLVPAVTDQSQAQALAALQVLKVI 181
Query: 341 EADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSS 400
E+DV+ DLC RREYARWLVS+SS L+R+ + KVYPAMYIENVT+LAFDDITPEDPDF+S
Sbjct: 182 ESDVQASDLCTRREYARWLVSSSSALSRNLILKVYPAMYIENVTELAFDDITPEDPDFTS 241
Query: 401 IQGLAEAGLISSKLSHRDLL---NEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANK 457
IQGLAEAGLISSKLS RD+L +E+ G +F P+SPLSRQDLV+WKMALEKRQLPEA+K
Sbjct: 242 IQGLAEAGLISSKLSRRDMLSSPDEDIGSFYFHPDSPLSRQDLVTWKMALEKRQLPEADK 301
Query: 458 KILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
K L+QLSGFIDID+I+P+A+PAL+AD++A +QGI+A AFG TRLFQPDKPVT QAA+AL
Sbjct: 302 KELHQLSGFIDIDRIDPNAFPALVADISAKDQGIVASAFGYTRLFQPDKPVTKGQAAIAL 361
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEK 577
A GE+++ V+EEL RIEAES A+ AVS H ALVAEVEK+IN +FEKEL MEREKID V+K
Sbjct: 362 ATGESAEIVSEELARIEAESVADKAVSAHIALVAEVEKDINANFEKELIMEREKIDAVQK 421
Query: 578 MAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISY 637
MAEEA QE+ERLRAERE + ALMK+R A+ESEME+LSKLR E+EEQLE LMSNKV+ISY
Sbjct: 422 MAEEAMQEVERLRAEREEENSALMKQRVAVESEMEVLSKLRHEMEEQLEGLMSNKVKISY 481
Query: 638 EKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDR 697
EK+ + LRKE E ENQ I RLQYELEVERKALSMARAWAEDEA+R +E AK LE ARDR
Sbjct: 482 EKDMVEKLRKETEEENQAIVRLQYELEVERKALSMARAWAEDEARRVQEHAKVLEEARDR 541
Query: 698 WERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKE 757
WERQGIKVVV++DLREE+ A V W N GK+ ++++TV RA +L D+LK MA V+GKSKE
Sbjct: 542 WERQGIKVVVNEDLREEAVADVTWSNVGKKLALEETVDRADTLTDRLKLMAGQVTGKSKE 601
Query: 758 IINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGA 817
IIN +I KI IS +++W S R E KDA K S+Q +Q+ + EGA
Sbjct: 602 IINNVISKIQELISAIREWISNIGKRTIEFKDAAFAKTAESIQGIQERAVGVSVTVKEGA 661
Query: 818 KRVAGDCREGV 828
KRVA DCR GV
Sbjct: 662 KRVADDCRGGV 672
>gi|356534127|ref|XP_003535609.1| PREDICTED: uncharacterized protein LOC100801281 [Glycine max]
Length = 941
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/859 (51%), Positives = 586/859 (68%), Gaps = 50/859 (5%)
Query: 1 MEPLTAAQD--VSIVPDHKIDKFEEYGYAGNNVKQDDRSLESKTGTDNALSSSSEAIEVA 58
M+PLT Q+ V + D DK E+ +G +Q + ++E + SS+ E ++
Sbjct: 109 MKPLTTHQEQEVLLSSDDCNDKIEQVN-SGTMEEQGNGNVEGRIDVSRDCSST-EYDKIP 166
Query: 59 SDNKI--DSENETPSTGDVSHSSSGINSINDVAKQDDLQRESASDDMSVAPDTALTSPKL 116
+ ++I DS + D+ + + +++ ++ Q++LQ ESA+D+ SV P+ A+
Sbjct: 167 NSHRIIDDSNAGSQLVYDIHNKDNDSDAMKHISVQEELQIESAADEESVLPEGAM----- 221
Query: 117 PEPEVVSGTENASPLEGSDSILDANLPESASEITGENPIDVEPSSFSN------------ 164
V++G+E+ +P++ DS + S +E+ ENP VEP SN
Sbjct: 222 ----VLNGSESENPVDSFDSSTAVDSQNSITELK-ENPSFVEPKKVSNFDAEPLPVISEE 276
Query: 165 ---PTDLGNDGSKFSRIFSDSSSISSSHAPIEPLAAVISVSSDTTVEPQILPKGDTETVA 221
TD + G++ S I +D+ ++ + AV + S+ TT P ++P+ E+
Sbjct: 277 QDEITD--SSGNRSSGIVADNETVLVN-------IAVSTQSNKTTSFPAVIPEDWEESAQ 327
Query: 222 SPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESN 281
S ST +N++ + P + + SS+ ++N S SI F +E+ D NE +
Sbjct: 328 SVSTKENLDLNNMPQVLHQ---SSLAEQSFSENDLFTKSFVSSIDAFLDEQVKNDNNEVD 384
Query: 282 SSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIE 341
+E+ G+ S GIPAPS VS+ +QVLPGKVLVPA VDQVQGQAL+ALQ LKVIE
Sbjct: 385 ICR-SETSNFGAFYSAPGIPAPSAVSSVVQVLPGKVLVPAAVDQVQGQALAALQTLKVIE 443
Query: 342 ADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSI 401
DV+P DLC RREYARWLVSASS L+R T+SKVYPAM++++VT+LAFDDITPEDPDFS I
Sbjct: 444 PDVQPSDLCTRREYARWLVSASSALSRKTISKVYPAMFVDSVTELAFDDITPEDPDFSFI 503
Query: 402 QGLAEAGLISSKLSH---RDL-LNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANK 457
QGLAEAGLI S+LS R L NE+ GP +F PESPLSRQDLV+WK+ LEKRQLPEA++
Sbjct: 504 QGLAEAGLIESRLSRCYDRPLSTNEDYGPFYFSPESPLSRQDLVTWKIDLEKRQLPEADR 563
Query: 458 KILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
K+L QLSGFID DKI+ DA P L+AD++AGE GIIALAFG TRLFQP KPVT AQAA+AL
Sbjct: 564 KMLCQLSGFIDTDKIHSDACPELVADVSAGEHGIIALAFGYTRLFQPHKPVTKAQAAIAL 623
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEK 577
A G+A D VNEEL E+ES ENAV+ HSALVA+VEK+IN S E++LS+EREKI+ VE+
Sbjct: 624 AAGDAFDIVNEELACFESESMDENAVASHSALVAQVEKDINASLEQKLSIEREKINAVER 683
Query: 578 MAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISY 637
MAEEAR ELERLRAERE ++I+L++ERAAIESE + S+L+ EVE+QL++L+S+KVEI+Y
Sbjct: 684 MAEEARCELERLRAEREEERISLIEERAAIESERNVFSRLKHEVEDQLQNLISDKVEIAY 743
Query: 638 EKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDR 697
EK+RI+ LR+ AE +N+EI +LQYELEVERKALSMARAWAEDEAKR E ALE ARD
Sbjct: 744 EKDRISKLRELAEVQNKEITQLQYELEVERKALSMARAWAEDEAKRVSEHTLALERARDS 803
Query: 698 WERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKE 757
WER K VD D E+ A V +N +Q SV TV RA++L+DKLK MA +V G++++
Sbjct: 804 WERNESKAAVD-DFHEDL-AGVTLLNTEEQLSVQDTVDRAENLLDKLKKMAVEVGGRARD 861
Query: 758 IINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTEGA 817
+I+ IIH I F+S L++WA K +A ELK + I KA S E+QQS EF + E A
Sbjct: 862 MIDKIIHIISQFVSRLREWACKTGKQAEELKQSAISKAGKSAHEVQQSALEFGFTIKEEA 921
Query: 818 KRVAGDCREGVEKLTQRFK 836
KRVAGDCREGVEKLTQ+FK
Sbjct: 922 KRVAGDCREGVEKLTQKFK 940
>gi|115456771|ref|NP_001051986.1| Os03g0862100 [Oryza sativa Japonica Group]
gi|108712242|gb|ABG00037.1| expressed protein [Oryza sativa Japonica Group]
gi|113550457|dbj|BAF13900.1| Os03g0862100 [Oryza sativa Japonica Group]
Length = 918
Score = 633 bits (1633), Expect = e-178, Method: Compositional matrix adjust.
Identities = 336/572 (58%), Positives = 433/572 (75%), Gaps = 4/572 (0%)
Query: 269 SNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQG 328
SN+ E D E+ +S + + P S +S +GIPAP+++SAAL+V G+++VPA VD Q
Sbjct: 347 SNQNEGADELENQNSLYESTTPDKSFAS-SGIPAPTLLSAALRVRTGQIMVPAAVDPAQA 405
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
AL+ALQVLKVIE D + GDLC RREYARWLV AS+ L+R+T SKVYPAMYIENVT+LAF
Sbjct: 406 SALAALQVLKVIEPDAQAGDLCTRREYARWLVVASNCLSRNTSSKVYPAMYIENVTELAF 465
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDL---LNEEPGPIFFLPESPLSRQDLVSWKM 445
DDITPED DF IQGLAEAGLISSKLS D+ L+ + F PE P+SRQDLVSWKM
Sbjct: 466 DDITPEDFDFPFIQGLAEAGLISSKLSRSDMNVPLDVDNLHNLFSPECPVSRQDLVSWKM 525
Query: 446 ALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPD 505
AL+KRQLPE +K +Y+ SG++D+DKIN AWPAL+ADL AG+Q I ALAFG TRLFQPD
Sbjct: 526 ALDKRQLPEVDKTSMYKASGYMDVDKINAAAWPALVADLDAGDQSITALAFGFTRLFQPD 585
Query: 506 KPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKEL 565
KPVT Q A+AL+ G+++D V EEL RIEAE AE+AV+ H LVA+VEK++N +FE+EL
Sbjct: 586 KPVTKGQVALALSTGDSADVVMEELARIEAEKIAEDAVNAHGELVAQVEKDLNATFEREL 645
Query: 566 SMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQL 625
+ EREKI+ +EK+AEEAR EL++LRAER + AL++ RA++ESEME+LSKLR EVEEQL
Sbjct: 646 TKEREKIERLEKLAEEARVELDKLRAERVEENNALIRGRASVESEMEVLSKLRSEVEEQL 705
Query: 626 ESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAR 685
+S++S KVEIS+EK RI L+ E EN+ Q + +LQYELEVERKALSMARAWAEDEAK+AR
Sbjct: 706 QSVLSKKVEISFEKNRIEKLQTEIENDRQAVVQLQYELEVERKALSMARAWAEDEAKKAR 765
Query: 686 EQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLK 745
E A+ALE AR++WER GIKVVV+ L +++ A V W NAGK+ VD+ ++RA SL++KLK
Sbjct: 766 EHARALEEARNQWERHGIKVVVEGGLEDDASAGVTWANAGKEHQVDEAINRAGSLLEKLK 825
Query: 746 AMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQS 805
+M+ ++ +S + +I + FIS LK+ A +A+ R +L A LKAK E Q +
Sbjct: 826 SMSAEIKVRSCHSLERVIQHVRSFISILKQGAEEATQRFTDLGAAAALKAKKLSSEAQDN 885
Query: 806 TAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
F S + + +KRV DC+EG+EK RFKT
Sbjct: 886 VYVFGSTIGDKSKRVVEDCKEGLEKFVHRFKT 917
>gi|218194169|gb|EEC76596.1| hypothetical protein OsI_14447 [Oryza sativa Indica Group]
Length = 608
Score = 633 bits (1633), Expect = e-178, Method: Compositional matrix adjust.
Identities = 337/572 (58%), Positives = 433/572 (75%), Gaps = 4/572 (0%)
Query: 269 SNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQG 328
SN+ E D E+ +S + + P S +S +GIPAP+++SAALQV G+++VPA VD Q
Sbjct: 37 SNQNEGADELENQNSLYESTTPDKSFAS-SGIPAPTLLSAALQVRTGQIMVPAAVDPAQA 95
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
AL+ALQVLKVIE D + GDLC RREYARWLV AS+ L+R+T SKVYPAMYIENVT+LAF
Sbjct: 96 SALAALQVLKVIEPDAQAGDLCTRREYARWLVVASNCLSRNTSSKVYPAMYIENVTELAF 155
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDL---LNEEPGPIFFLPESPLSRQDLVSWKM 445
DDITPED DF IQGLAEAGLISSKLS D+ L+ + F PE P+SRQDLVSWKM
Sbjct: 156 DDITPEDFDFPFIQGLAEAGLISSKLSRSDMNVPLDVDNLHNLFSPECPVSRQDLVSWKM 215
Query: 446 ALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPD 505
AL+KRQLPE +K +Y+ SG++D+DKIN AWPAL+ADL AG+Q I ALAFG TRLFQPD
Sbjct: 216 ALDKRQLPEVDKTSMYKASGYMDVDKINAAAWPALVADLDAGDQSITALAFGFTRLFQPD 275
Query: 506 KPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKEL 565
KPVT Q A+AL+ G+++D V EEL RIEAE AE+AV+ H LVA+VEK++N +FE+EL
Sbjct: 276 KPVTKGQVALALSTGDSADVVMEELARIEAEKIAEDAVNAHGELVAQVEKDLNATFEREL 335
Query: 566 SMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQL 625
+ EREKI+ +EK+AEEAR EL++LRAER + AL++ RA++ESEME+LSKLR EVEEQL
Sbjct: 336 TKEREKIETLEKLAEEARVELDKLRAERVEENNALIRGRASVESEMEVLSKLRSEVEEQL 395
Query: 626 ESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAR 685
+S++S KVEIS+EK RI L+ E EN+ Q + +LQYELEVERKALSMARAWAEDEAK+AR
Sbjct: 396 QSVLSKKVEISFEKNRIEKLQTEIENDRQAVVQLQYELEVERKALSMARAWAEDEAKKAR 455
Query: 686 EQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLK 745
E A+ALE AR++WER GIKVVV+ L +++ A V W NAGK+ VD+ ++RA SL++KLK
Sbjct: 456 EHARALEEARNQWERHGIKVVVEGGLEDDASAGVTWANAGKEHQVDEAINRAGSLLEKLK 515
Query: 746 AMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQS 805
+M+ ++ +S + +I + FIS LK+ A +A+ R +L A LKAK E Q +
Sbjct: 516 SMSAEIKVRSCHSLERVIQHVRSFISILKQGAEEATQRFTDLGAAAALKAKKLSSEAQDN 575
Query: 806 TAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
F S + + +KRV DC+EG+EK RFKT
Sbjct: 576 VYVFGSTIGDKSKRVVEDCKEGLEKFVHRFKT 607
>gi|108712243|gb|ABG00038.1| expressed protein [Oryza sativa Japonica Group]
Length = 608
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 336/572 (58%), Positives = 433/572 (75%), Gaps = 4/572 (0%)
Query: 269 SNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQG 328
SN+ E D E+ +S + + P S +S +GIPAP+++SAAL+V G+++VPA VD Q
Sbjct: 37 SNQNEGADELENQNSLYESTTPDKSFAS-SGIPAPTLLSAALRVRTGQIMVPAAVDPAQA 95
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
AL+ALQVLKVIE D + GDLC RREYARWLV AS+ L+R+T SKVYPAMYIENVT+LAF
Sbjct: 96 SALAALQVLKVIEPDAQAGDLCTRREYARWLVVASNCLSRNTSSKVYPAMYIENVTELAF 155
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDL---LNEEPGPIFFLPESPLSRQDLVSWKM 445
DDITPED DF IQGLAEAGLISSKLS D+ L+ + F PE P+SRQDLVSWKM
Sbjct: 156 DDITPEDFDFPFIQGLAEAGLISSKLSRSDMNVPLDVDNLHNLFSPECPVSRQDLVSWKM 215
Query: 446 ALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPD 505
AL+KRQLPE +K +Y+ SG++D+DKIN AWPAL+ADL AG+Q I ALAFG TRLFQPD
Sbjct: 216 ALDKRQLPEVDKTSMYKASGYMDVDKINAAAWPALVADLDAGDQSITALAFGFTRLFQPD 275
Query: 506 KPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKEL 565
KPVT Q A+AL+ G+++D V EEL RIEAE AE+AV+ H LVA+VEK++N +FE+EL
Sbjct: 276 KPVTKGQVALALSTGDSADVVMEELARIEAEKIAEDAVNAHGELVAQVEKDLNATFEREL 335
Query: 566 SMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQL 625
+ EREKI+ +EK+AEEAR EL++LRAER + AL++ RA++ESEME+LSKLR EVEEQL
Sbjct: 336 TKEREKIERLEKLAEEARVELDKLRAERVEENNALIRGRASVESEMEVLSKLRSEVEEQL 395
Query: 626 ESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAR 685
+S++S KVEIS+EK RI L+ E EN+ Q + +LQYELEVERKALSMARAWAEDEAK+AR
Sbjct: 396 QSVLSKKVEISFEKNRIEKLQTEIENDRQAVVQLQYELEVERKALSMARAWAEDEAKKAR 455
Query: 686 EQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLK 745
E A+ALE AR++WER GIKVVV+ L +++ A V W NAGK+ VD+ ++RA SL++KLK
Sbjct: 456 EHARALEEARNQWERHGIKVVVEGGLEDDASAGVTWANAGKEHQVDEAINRAGSLLEKLK 515
Query: 746 AMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQS 805
+M+ ++ +S + +I + FIS LK+ A +A+ R +L A LKAK E Q +
Sbjct: 516 SMSAEIKVRSCHSLERVIQHVRSFISILKQGAEEATQRFTDLGAAAALKAKKLSSEAQDN 575
Query: 806 TAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
F S + + +KRV DC+EG+EK RFKT
Sbjct: 576 VYVFGSTIGDKSKRVVEDCKEGLEKFVHRFKT 607
>gi|357124511|ref|XP_003563943.1| PREDICTED: uncharacterized protein LOC100825490 [Brachypodium
distachyon]
Length = 911
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 368/710 (51%), Positives = 490/710 (69%), Gaps = 42/710 (5%)
Query: 144 ESASEITGENPIDVEPSSFSNPTDLGND-GSKFSRIFSDSSSISSSHAPIEPLAAVISVS 202
+S+ +++G +P + P LG++ GS +R D +S+S A + V+ ++
Sbjct: 227 DSSDKLSGADPFEGTPKLQET---LGSEAGSPENRYMDD---MSTSDAIVLDSGHVVPIT 280
Query: 203 --SDTTVEPQILPKGDTETVASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSS--- 257
SDT+VE AS + EQ+ + LS ED S + D ++GS+
Sbjct: 281 KFSDTSVE-----------AASHLNENDTEQNHQ--LSNEDEISPPRLPDYIEHGSADQM 327
Query: 258 ---GTSVSPS-----IFPFSNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAA 309
G++ P+ P +++++ + N + +S G + S AG PAPS++SAA
Sbjct: 328 LPFGSNDLPAEPGKVHQPLASDQDVGESQLENQNELVKSTEPGKAFSSAGFPAPSLLSAA 387
Query: 310 LQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRS 369
LQV G+++VPA VD QG AL+ALQVLKVIE + GDLC RREYARWLV AS+ L+R+
Sbjct: 388 LQVPAGQIVVPAAVDPTQGNALAALQVLKVIEPGAQAGDLCTRREYARWLVVASNCLSRN 447
Query: 370 TMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLL--NEEPGPI 427
T SKVYPAMY+ENV++LAFDD+T EDPDF IQGLAEAGLISSKLS D N +
Sbjct: 448 TYSKVYPAMYVENVSELAFDDVTTEDPDFPFIQGLAEAGLISSKLSRSDTNPENFQNNHY 507
Query: 428 FFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAG 487
+F PESPLSRQDLVSWKMAL+KR+LPE +K LY+ SG+IDIDKI+ AWPAL ADL AG
Sbjct: 508 WFYPESPLSRQDLVSWKMALDKRRLPEVDKNSLYKTSGYIDIDKIDAAAWPALAADLGAG 567
Query: 488 EQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHS 547
+Q I ALAFG TRLFQPDKPVT QAA+AL+ G++++ V EEL RIEAE AE AV+ H
Sbjct: 568 DQSITALAFGFTRLFQPDKPVTKGQAALALSTGDSAEVVMEELARIEAEKMAEAAVNAHG 627
Query: 548 ALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAI 607
ALVA+VEK+IN SFE+EL+ EREKI+ +EK+AEEAR ELE+LRAERE +K AL++ RAA+
Sbjct: 628 ALVAQVEKDINASFERELAREREKIETLEKLAEEARFELEKLRAEREEEKNALIRGRAAV 687
Query: 608 ESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVER 667
ESE+E+LSKLR EVEEQL+S++S KVEIS+EK RI+ L+KE ENENQ +LQYELEVER
Sbjct: 688 ESEIEVLSKLRSEVEEQLQSVLSKKVEISFEKNRIDKLQKEIENENQAAVQLQYELEVER 747
Query: 668 KALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQ 727
KALSMARAWAEDEAK+ARE A+ALE AR++WERQGIKVVV+ L +++ A V W NAGK+
Sbjct: 748 KALSMARAWAEDEAKKAREHARALEEARNQWERQGIKVVVEGGLEDDASAGVTWANAGKE 807
Query: 728 FSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAEL 787
VD+ ++RA+SL++KLK+M+ D+ ++ + ++ + FIS+LK+ A++A +
Sbjct: 808 HPVDEAINRAESLLEKLKSMSADMKVRACHALQRVMQHVRSFISSLKERAAEARQGCIDF 867
Query: 788 KDATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
A KA +L F S + + +K+V DC+ EK RFKT
Sbjct: 868 GAAAASKA----NKLSSEARAFGSTVGDKSKKVVEDCK---EKYAHRFKT 910
>gi|31193924|gb|AAP44759.1| unknown protein [Oryza sativa Japonica Group]
Length = 935
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 336/589 (57%), Positives = 433/589 (73%), Gaps = 21/589 (3%)
Query: 269 SNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQG 328
SN+ E D E+ +S + + P S +S +GIPAP+++SAAL+V G+++VPA VD Q
Sbjct: 347 SNQNEGADELENQNSLYESTTPDKSFAS-SGIPAPTLLSAALRVRTGQIMVPAAVDPAQA 405
Query: 329 QALSALQVLK-----------------VIEADVKPGDLCIRREYARWLVSASSTLTRSTM 371
AL+ALQVLK VIE D + GDLC RREYARWLV AS+ L+R+T
Sbjct: 406 SALAALQVLKMLKTIGCTICIWEYPFKVIEPDAQAGDLCTRREYARWLVVASNCLSRNTS 465
Query: 372 SKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDL---LNEEPGPIF 428
SKVYPAMYIENVT+LAFDDITPED DF IQGLAEAGLISSKLS D+ L+ +
Sbjct: 466 SKVYPAMYIENVTELAFDDITPEDFDFPFIQGLAEAGLISSKLSRSDMNVPLDVDNLHNL 525
Query: 429 FLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGE 488
F PE P+SRQDLVSWKMAL+KRQLPE +K +Y+ SG++D+DKIN AWPAL+ADL AG+
Sbjct: 526 FSPECPVSRQDLVSWKMALDKRQLPEVDKTSMYKASGYMDVDKINAAAWPALVADLDAGD 585
Query: 489 QGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSA 548
Q I ALAFG TRLFQPDKPVT Q A+AL+ G+++D V EEL RIEAE AE+AV+ H
Sbjct: 586 QSITALAFGFTRLFQPDKPVTKGQVALALSTGDSADVVMEELARIEAEKIAEDAVNAHGE 645
Query: 549 LVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIE 608
LVA+VEK++N +FE+EL+ EREKI+ +EK+AEEAR EL++LRAER + AL++ RA++E
Sbjct: 646 LVAQVEKDLNATFERELTKEREKIERLEKLAEEARVELDKLRAERVEENNALIRGRASVE 705
Query: 609 SEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERK 668
SEME+LSKLR EVEEQL+S++S KVEIS+EK RI L+ E EN+ Q + +LQYELEVERK
Sbjct: 706 SEMEVLSKLRSEVEEQLQSVLSKKVEISFEKNRIEKLQTEIENDRQAVVQLQYELEVERK 765
Query: 669 ALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQF 728
ALSMARAWAEDEAK+ARE A+ALE AR++WER GIKVVV+ L +++ A V W NAGK+
Sbjct: 766 ALSMARAWAEDEAKKAREHARALEEARNQWERHGIKVVVEGGLEDDASAGVTWANAGKEH 825
Query: 729 SVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELK 788
VD+ ++RA SL++KLK+M+ ++ +S + +I + FIS LK+ A +A+ R +L
Sbjct: 826 QVDEAINRAGSLLEKLKSMSAEIKVRSCHSLERVIQHVRSFISILKQGAEEATQRFTDLG 885
Query: 789 DATILKAKGSVQELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
A LKAK E Q + F S + + +KRV DC+EG+EK RFKT
Sbjct: 886 AAAALKAKKLSSEAQDNVYVFGSTIGDKSKRVVEDCKEGLEKFVHRFKT 934
>gi|42573662|ref|NP_974927.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008828|gb|AED96211.1| uncharacterized protein [Arabidopsis thaliana]
Length = 761
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 327/544 (60%), Positives = 407/544 (74%), Gaps = 12/544 (2%)
Query: 299 GIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARW 358
GIPAPS V QV K + P VVD VQ Q +ALQ LKVIE+D P DLC RRE+ARW
Sbjct: 224 GIPAPSTVP---QVDSLKPIFPTVVDPVQSQMFAALQALKVIESDALPYDLCTRREFARW 280
Query: 359 LVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRD 418
+VSAS+TL+R++ SKVYPAMYIENVT+LAFDDITPEDPDF IQGLAEAGLISSKLS+ +
Sbjct: 281 VVSASNTLSRNSASKVYPAMYIENVTELAFDDITPEDPDFPFIQGLAEAGLISSKLSNNN 340
Query: 419 LLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWP 478
+ + E + F PESPL+RQDL+SWKMALE RQLPEA+ K LYQLSGF+DIDKINP+AWP
Sbjct: 341 MPSSESSRVTFSPESPLTRQDLLSWKMALEFRQLPEADSKKLYQLSGFLDIDKINPEAWP 400
Query: 479 ALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESA 538
AL+ADL+AGE GI AL+FG TRLFQP K VT AQ AV+LAIG+A + V EEL RIEAE+
Sbjct: 401 ALIADLSAGEHGITALSFGRTRLFQPSKAVTKAQTAVSLAIGDAFEVVGEELARIEAEAM 460
Query: 539 AENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKI 598
AEN V H+ LVA+VEK+IN SFEKEL E+E +D VEK+AEEA+ EL RLR E+E + +
Sbjct: 461 AENVVCAHNELVAQVEKDINASFEKELLREKEIVDAVEKLAEEAKSELARLRVEKEEETL 520
Query: 599 ALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIAR 658
AL +ER +IE+EME L+++R E+EEQL+SL SNK E+SYEKER + L+K+ E+ENQEI R
Sbjct: 521 ALERERTSIETEMEALARIRNELEEQLQSLASNKAEMSYEKERFDRLQKQVEDENQEILR 580
Query: 659 LQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAA 718
LQ ELEVER ALS+AR WA+DEA+RAREQAK LE AR RWE+ G+KV+VD DL E++
Sbjct: 581 LQNELEVERNALSIARDWAKDEARRAREQAKVLEEARGRWEKYGLKVIVDSDLHEQTTKT 640
Query: 719 -VMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWA 777
W+NAGKQ V+ T+ RA +L+ KLK MA DV KS+E+I II KI L IS LK+
Sbjct: 641 ESTWLNAGKQNHVEGTMKRAGNLIAKLKKMAKDVGEKSREVIYLIIEKISLLISALKQQV 700
Query: 778 SKASMRAAELKDATILKAKGSVQELQQSTA----EFRSNLTEGAKRVAGDCREGVEKLTQ 833
+A +LK +K K +E+ + T+ E R+ AK + ++ V KL +
Sbjct: 701 HGMENKAKDLK----IKTKSKAEEVWRQTSLRADEIRNISIVKAKETVEEFKDRVGKLGE 756
Query: 834 RFKT 837
+FK+
Sbjct: 757 KFKS 760
>gi|297792573|ref|XP_002864171.1| oxidoreductase/ transition metal ion binding protein [Arabidopsis
lyrata subsp. lyrata]
gi|297310006|gb|EFH40430.1| oxidoreductase/ transition metal ion binding protein [Arabidopsis
lyrata subsp. lyrata]
Length = 749
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 340/612 (55%), Positives = 427/612 (69%), Gaps = 23/612 (3%)
Query: 235 PLLSGEDS---SSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFTESPPT 291
P+LS +D S S +N G+ + S + S E + D + T P
Sbjct: 151 PVLSLDDKDLVSKSASTSKVNDEGNKASESSAERYTLSKELDGVD-------THTSLIPY 203
Query: 292 --GSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDL 349
+ S GIPAPS V QV P + + P VVD VQ Q SALQ LKVIE+D P DL
Sbjct: 204 EKQKTRSYTGIPAPSTVP---QVNPVEPIFPTVVDPVQSQIFSALQALKVIESDALPYDL 260
Query: 350 CIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGL 409
C RRE+ARW+VSAS+TL+R++ SKVYPAMYIENVT+LAF+DITPEDPDF IQGLAEAGL
Sbjct: 261 CTRREFARWVVSASNTLSRNSASKVYPAMYIENVTELAFEDITPEDPDFPFIQGLAEAGL 320
Query: 410 ISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDI 469
ISSKLS+ ++ E F PESPL+RQDL+SWKMALE RQLPEA+ K LYQLSGF+DI
Sbjct: 321 ISSKLSNHNMPCSESSRFTFSPESPLTRQDLLSWKMALEFRQLPEADSKKLYQLSGFLDI 380
Query: 470 DKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEE 529
D+INP+AWPAL+ADL+AGE GI ALAFG TRLFQP K VT AQ AV+LAIG+A + V EE
Sbjct: 381 DRINPEAWPALIADLSAGEHGITALAFGRTRLFQPAKAVTKAQTAVSLAIGDAFEVVGEE 440
Query: 530 LQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERL 589
L RIEAE+ AEN VS H+ALV +VEK+IN SFEKE E+E +D VEK+AEEA+ EL RL
Sbjct: 441 LARIEAEAMAENVVSAHNALVTQVEKDINASFEKEFLREKEIVDAVEKLAEEAKSELARL 500
Query: 590 RAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEA 649
R E+E + AL +ER +IE+EME L+++R E+EEQL+SL SNK E+SYEKER + L+K+
Sbjct: 501 RVEKEEETFALERERTSIETEMEALARIRNELEEQLQSLASNKAEMSYEKERFDRLQKQV 560
Query: 650 ENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDK 709
E+ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LE AR RWE+ G+KV+VD
Sbjct: 561 EDENQEILRLQNELEVERNALSIARDWAKDEARRAREQAKVLEEARGRWEKYGLKVIVDS 620
Query: 710 DLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLF 769
DL E++ W+ A KQ V+ T+ RA +L+ KLK M DV K +E+IN II KI L
Sbjct: 621 DLHEQTTTESTWLIARKQNPVEGTMKRAGNLIAKLKKMTKDVGEKCREVINLIIEKISLL 680
Query: 770 ISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTA----EFRSNLTEGAKRVAGDCR 825
IS LK+ +A +LK +K K V+E+ + T+ E R+ AK + +
Sbjct: 681 ISALKQQVHGMENKAKDLK----MKTKSKVEEVCRQTSLRVDEIRNISIVKAKETVEELK 736
Query: 826 EGVEKLTQRFKT 837
+ V KL ++FK+
Sbjct: 737 DRVGKLGEKFKS 748
>gi|242037313|ref|XP_002466051.1| hypothetical protein SORBIDRAFT_01g000240 [Sorghum bicolor]
gi|241919905|gb|EER93049.1| hypothetical protein SORBIDRAFT_01g000240 [Sorghum bicolor]
Length = 945
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 321/562 (57%), Positives = 427/562 (75%), Gaps = 4/562 (0%)
Query: 279 ESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLK 338
E+ + F +PP SSP GIPAPS+VS A QV G+++VPA VD Q A++ALQ+LK
Sbjct: 383 ENQNKPFKSTPPDQYFSSP-GIPAPSIVSTASQVPVGQIVVPASVDPTQENAIAALQILK 441
Query: 339 VIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDF 398
VIE + GDLC RREYARWLV AS+ L+R+T SKVYPAMYI+NVT+LAFDD+TPEDPDF
Sbjct: 442 VIEPSARAGDLCTRREYARWLVVASNCLSRNTFSKVYPAMYIDNVTELAFDDVTPEDPDF 501
Query: 399 SSIQGLAEAGLISSKLSHRDLLNEEP---GPIFFLPESPLSRQDLVSWKMALEKRQLPEA 455
IQGLAEAGLISSKLS D+ E I F PESPLSRQDLVSWKM L++RQLPE
Sbjct: 502 PFIQGLAEAGLISSKLSRSDMNIPEDVHDNHILFSPESPLSRQDLVSWKMVLDRRQLPEV 561
Query: 456 NKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAV 515
++ L+++SG+IDIDKIN AWPAL+ADL AG+Q I AL+FG TRLFQP+KPVT QAA+
Sbjct: 562 DRNCLFKVSGYIDIDKINTAAWPALVADLGAGDQSITALSFGFTRLFQPNKPVTKGQAAL 621
Query: 516 ALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVV 575
A++ G++ + V EE+ RIEAE AE AV+ H ALVA+VEK++N FE+EL EREK++ +
Sbjct: 622 AISTGDSGEVVLEEVARIEAEKIAEAAVNAHGALVAQVEKDLNARFERELKEEREKVETL 681
Query: 576 EKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEI 635
EK+AEEAR EL+RLR ERE +K L++ RAA+ESEME+L KLR EVEEQL++++S KVE+
Sbjct: 682 EKLAEEARMELDRLREEREEEKNILLRGRAAVESEMEVLLKLRSEVEEQLQNVLSKKVEV 741
Query: 636 SYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGAR 695
S+EK RI L+KE EN+N + +LQYELEVERKALS+ARAWAE+EAK+ARE A+ALE AR
Sbjct: 742 SFEKSRIEKLQKEIENDNLAVVQLQYELEVERKALSLARAWAEEEAKKAREHARALEDAR 801
Query: 696 DRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKS 755
++WERQGIKVVV+ L++++ A V W NAGK+ VD+ ++RA+SL++KLK+M+ ++ +S
Sbjct: 802 NQWERQGIKVVVEGGLQDDASAGVTWANAGKEHPVDEVINRAESLLEKLKSMSAEMKVRS 861
Query: 756 KEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTE 815
+ + ++ + FI++LK+ A+ A +E + KA E+Q S + F + L +
Sbjct: 862 RGALERVMQHVRSFIASLKQQAADARQWCSEFGASAASKALMVSAEVQGSVSAFGATLGD 921
Query: 816 GAKRVAGDCREGVEKLTQRFKT 837
+KRV +C++G+EK + RFKT
Sbjct: 922 KSKRVMEECKDGLEKFSHRFKT 943
>gi|10177407|dbj|BAB10538.1| unnamed protein product [Arabidopsis thaliana]
Length = 790
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 327/575 (56%), Positives = 407/575 (70%), Gaps = 43/575 (7%)
Query: 299 GIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARW 358
GIPAPS V QV K + P VVD VQ Q +ALQ LKVIE+D P DLC RRE+ARW
Sbjct: 222 GIPAPSTVP---QVDSLKPIFPTVVDPVQSQMFAALQALKVIESDALPYDLCTRREFARW 278
Query: 359 LVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQ---------------- 402
+VSAS+TL+R++ SKVYPAMYIENVT+LAFDDITPEDPDF IQ
Sbjct: 279 VVSASNTLSRNSASKVYPAMYIENVTELAFDDITPEDPDFPFIQGDHRIFCTFTLKSESV 338
Query: 403 ---------------GLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMAL 447
GLAEAGLISSKLS+ ++ + E + F PESPL+RQDL+SWKMAL
Sbjct: 339 KLCLICFLHFSLDSIGLAEAGLISSKLSNNNMPSSESSRVTFSPESPLTRQDLLSWKMAL 398
Query: 448 EKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
E RQLPEA+ K LYQLSGF+DIDKINP+AWPAL+ADL+AGE GI AL+FG TRLFQP K
Sbjct: 399 EFRQLPEADSKKLYQLSGFLDIDKINPEAWPALIADLSAGEHGITALSFGRTRLFQPSKA 458
Query: 508 VTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSM 567
VT AQ AV+LAIG+A + V EEL RIEAE+ AEN V H+ LVA+VEK+IN SFEKEL
Sbjct: 459 VTKAQTAVSLAIGDAFEVVGEELARIEAEAMAENVVCAHNELVAQVEKDINASFEKELLR 518
Query: 568 EREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLES 627
E+E +D VEK+AEEA+ EL RLR E+E + +AL +ER +IE+EME L+++R E+EEQL+S
Sbjct: 519 EKEIVDAVEKLAEEAKSELARLRVEKEEETLALERERTSIETEMEALARIRNELEEQLQS 578
Query: 628 LMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQ 687
L SNK E+SYEKER + L+K+ E+ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQ
Sbjct: 579 LASNKAEMSYEKERFDRLQKQVEDENQEILRLQNELEVERNALSIARDWAKDEARRAREQ 638
Query: 688 AKALEGARDRWERQGIKVVVDKDLREESDAA-VMWVNAGKQFSVDQTVSRAQSLVDKLKA 746
AK LE AR RWE+ G+KV+VD DL E++ W+NAGKQ V+ T+ RA +L+ KLK
Sbjct: 639 AKVLEEARGRWEKYGLKVIVDSDLHEQTTKTESTWLNAGKQNHVEGTMKRAGNLIAKLKK 698
Query: 747 MANDVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQST 806
MA DV KS+E+I II KI L IS LK+ +A +LK +K K +E+ + T
Sbjct: 699 MAKDVGEKSREVIYLIIEKISLLISALKQQVHGMENKAKDLK----IKTKSKAEEVWRQT 754
Query: 807 A----EFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
+ E R+ AK + ++ V KL ++FK+
Sbjct: 755 SLRADEIRNISIVKAKETVEEFKDRVGKLGEKFKS 789
>gi|414874060|tpg|DAA52617.1| TPA: hypothetical protein ZEAMMB73_607077 [Zea mays]
Length = 919
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 318/562 (56%), Positives = 429/562 (76%), Gaps = 4/562 (0%)
Query: 279 ESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLK 338
E+ + +PP SSP GIPAPSVVS ALQV G ++VPA VD Q A++ALQ+LK
Sbjct: 358 ENQNKQLESTPPDQYFSSP-GIPAPSVVSTALQVPAGPIVVPASVDPTQENAIAALQILK 416
Query: 339 VIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDF 398
VIE+ + G+LC RREYARWLV+AS+ L+R+T SKVYPAMYI+NVT+LAFDD+TPEDPDF
Sbjct: 417 VIESSAQAGELCTRREYARWLVAASNCLSRNTFSKVYPAMYIDNVTELAFDDVTPEDPDF 476
Query: 399 SSIQGLAEAGLISSKLSHRDLLNEEP---GPIFFLPESPLSRQDLVSWKMALEKRQLPEA 455
IQGLAEAGLISSKLS D+ E I F PESPLSRQDLVSWKMAL+KRQLPE
Sbjct: 477 PFIQGLAEAGLISSKLSRSDMNIPEDVHDNHILFSPESPLSRQDLVSWKMALDKRQLPEV 536
Query: 456 NKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAV 515
++ L++LSG+IDIDKIN AWPAL ADL AG+Q I ALAFG TRLFQP+KPVT QAA+
Sbjct: 537 DRNCLFKLSGYIDIDKINTAAWPALAADLDAGDQSITALAFGFTRLFQPNKPVTKGQAAL 596
Query: 516 ALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVV 575
A + G++ + V EE+ RIEAE AE AV+ H+ALVA+VEK++N SFE+EL ERE+++ +
Sbjct: 597 AFSAGDSGEVVLEEVARIEAEKIAEAAVNAHAALVAQVEKDLNASFERELKEERERVETL 656
Query: 576 EKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEI 635
EK+AEEAR EL+RLRAERE +K L++ RAA+ESEME+L KLR EVEEQL++++S KVE+
Sbjct: 657 EKVAEEARVELDRLRAEREEEKNILVRGRAAVESEMEVLLKLRSEVEEQLQNVLSKKVEV 716
Query: 636 SYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGAR 695
S+EK RI L+KE E++N + +LQYELEVERKALSMARAWAE+EAK+ARE A+ALE AR
Sbjct: 717 SFEKSRIEKLQKEIESDNSAVVQLQYELEVERKALSMARAWAEEEAKKAREHARALEEAR 776
Query: 696 DRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRAQSLVDKLKAMANDVSGKS 755
++WERQGI+VVV+ +L++++ A V W NAGK+ +VD+++++A++L++KLK M+ ++ +S
Sbjct: 777 NQWERQGIRVVVEGELKDDASAGVTWANAGKENAVDESINQAEALLEKLKTMSGEMEVRS 836
Query: 756 KEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTAEFRSNLTE 815
+ + ++ + FI+ LK+ A+ A E +A +++ S + F + L +
Sbjct: 837 RGAVERVMQHVRSFIAILKQQAADARQWCTEFGACAASRANEVSAQVKGSVSAFGATLGD 896
Query: 816 GAKRVAGDCREGVEKLTQRFKT 837
+KR +C++G+E+++ RFKT
Sbjct: 897 KSKRAMEECKDGLERISHRFKT 918
>gi|22327782|ref|NP_200054.2| uncharacterized protein [Arabidopsis thaliana]
gi|332008827|gb|AED96210.1| uncharacterized protein [Arabidopsis thaliana]
Length = 510
Score = 593 bits (1529), Expect = e-166, Method: Compositional matrix adjust.
Identities = 310/512 (60%), Positives = 389/512 (75%), Gaps = 9/512 (1%)
Query: 331 LSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDD 390
+ALQ LKVIE+D P DLC RRE+ARW+VSAS+TL+R++ SKVYPAMYIENVT+LAFDD
Sbjct: 2 FAALQALKVIESDALPYDLCTRREFARWVVSASNTLSRNSASKVYPAMYIENVTELAFDD 61
Query: 391 ITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKR 450
ITPEDPDF IQGLAEAGLISSKLS+ ++ + E + F PESPL+RQDL+SWKMALE R
Sbjct: 62 ITPEDPDFPFIQGLAEAGLISSKLSNNNMPSSESSRVTFSPESPLTRQDLLSWKMALEFR 121
Query: 451 QLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTN 510
QLPEA+ K LYQLSGF+DIDKINP+AWPAL+ADL+AGE GI AL+FG TRLFQP K VT
Sbjct: 122 QLPEADSKKLYQLSGFLDIDKINPEAWPALIADLSAGEHGITALSFGRTRLFQPSKAVTK 181
Query: 511 AQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMERE 570
AQ AV+LAIG+A + V EEL RIEAE+ AEN V H+ LVA+VEK+IN SFEKEL E+E
Sbjct: 182 AQTAVSLAIGDAFEVVGEELARIEAEAMAENVVCAHNELVAQVEKDINASFEKELLREKE 241
Query: 571 KIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMS 630
+D VEK+AEEA+ EL RLR E+E + +AL +ER +IE+EME L+++R E+EEQL+SL S
Sbjct: 242 IVDAVEKLAEEAKSELARLRVEKEEETLALERERTSIETEMEALARIRNELEEQLQSLAS 301
Query: 631 NKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKA 690
NK E+SYEKER + L+K+ E+ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK
Sbjct: 302 NKAEMSYEKERFDRLQKQVEDENQEILRLQNELEVERNALSIARDWAKDEARRAREQAKV 361
Query: 691 LEGARDRWERQGIKVVVDKDLREESDAA-VMWVNAGKQFSVDQTVSRAQSLVDKLKAMAN 749
LE AR RWE+ G+KV+VD DL E++ W+NAGKQ V+ T+ RA +L+ KLK MA
Sbjct: 362 LEEARGRWEKYGLKVIVDSDLHEQTTKTESTWLNAGKQNHVEGTMKRAGNLIAKLKKMAK 421
Query: 750 DVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTA-- 807
DV KS+E+I II KI L IS LK+ +A +LK +K K +E+ + T+
Sbjct: 422 DVGEKSREVIYLIIEKISLLISALKQQVHGMENKAKDLK----IKTKSKAEEVWRQTSLR 477
Query: 808 --EFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
E R+ AK + ++ V KL ++FK+
Sbjct: 478 ADEIRNISIVKAKETVEEFKDRVGKLGEKFKS 509
>gi|18491169|gb|AAL69487.1| unknown protein [Arabidopsis thaliana]
Length = 510
Score = 590 bits (1520), Expect = e-165, Method: Compositional matrix adjust.
Identities = 309/512 (60%), Positives = 388/512 (75%), Gaps = 9/512 (1%)
Query: 331 LSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDD 390
+ALQ LKVIE+D P DLC RRE+ARW+VSAS+TL+R++ SKVYPAMYIENVT+LAFDD
Sbjct: 2 FAALQALKVIESDALPYDLCTRREFARWVVSASNTLSRNSASKVYPAMYIENVTELAFDD 61
Query: 391 ITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKR 450
ITPEDPDF IQGLAEAGLISSKLS+ ++ + E + F PESPL+RQDL+SWKMALE R
Sbjct: 62 ITPEDPDFPFIQGLAEAGLISSKLSNNNMPSSESSRVTFSPESPLTRQDLLSWKMALEFR 121
Query: 451 QLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTN 510
QLPEA+ K LYQLSGF+DIDKINP+AWPAL+ADL+AGE I AL+FG TRLFQP K VT
Sbjct: 122 QLPEADSKKLYQLSGFLDIDKINPEAWPALIADLSAGEHEITALSFGRTRLFQPSKAVTK 181
Query: 511 AQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMERE 570
AQ AV+LAIG+A + V EEL RIEAE+ AEN V H+ LVA+VEK+IN SFEKEL E+E
Sbjct: 182 AQTAVSLAIGDAFEVVGEELARIEAEAMAENVVCAHNELVAQVEKDINASFEKELLREKE 241
Query: 571 KIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMS 630
+D VEK+AEEA+ EL RLR E+E + +AL +ER +IE+EME L+++R E+EEQL+SL S
Sbjct: 242 IVDAVEKLAEEAKSELARLRVEKEEETLALERERTSIETEMEALARIRNELEEQLQSLAS 301
Query: 631 NKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKA 690
NK E+SYEKER + L+K+ E+ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK
Sbjct: 302 NKAEMSYEKERFDRLQKQVEDENQEILRLQNELEVERNALSIARDWAKDEARRAREQAKV 361
Query: 691 LEGARDRWERQGIKVVVDKDLREESDAA-VMWVNAGKQFSVDQTVSRAQSLVDKLKAMAN 749
LE AR RWE+ G+KV+VD DL E++ W+NAGKQ V+ T+ RA +L+ KLK MA
Sbjct: 362 LEEARGRWEKYGLKVIVDSDLHEQTTKTESTWLNAGKQNHVEGTMKRAGNLIAKLKKMAK 421
Query: 750 DVSGKSKEIINTIIHKILLFISNLKKWASKASMRAAELKDATILKAKGSVQELQQSTA-- 807
DV KS+E+I II KI L IS LK+ +A +LK +K K +E+ + T+
Sbjct: 422 DVGEKSREVIYLIIEKISLLISALKQQVHGMENKAKDLK----IKTKSKAEEVWRQTSLR 477
Query: 808 --EFRSNLTEGAKRVAGDCREGVEKLTQRFKT 837
E R+ AK + ++ V KL ++FK+
Sbjct: 478 ADEIRNISIVKAKETVEEFKDRVGKLGEKFKS 509
>gi|222422976|dbj|BAH19472.1| AT5G23890 [Arabidopsis thaliana]
Length = 755
Score = 544 bits (1402), Expect = e-152, Method: Compositional matrix adjust.
Identities = 284/451 (62%), Positives = 347/451 (76%), Gaps = 16/451 (3%)
Query: 199 ISVSSDTTVEPQILPKGDTET-VASPSTIKNVEQSEKPLLSGEDSSSSMEVHDLNKNGSS 257
+S D+T +PQI+P DTET A+ + V + + + + SS + D++
Sbjct: 311 VSSQLDSTSKPQIVPLNDTETAFATAEELSEVNGTPEYFETSDWSS----ISDIDTTKEL 366
Query: 258 GTSVSPSIFPFSNEKETCDLN------ESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQ 311
+S SP P S + +LN ++ E P GS+ S AGIPAP + ++
Sbjct: 367 ESSKSP--VPESTDGSKDELNIYSQDELDDNRMLLEIPSGGSAFSSAGIPAPFM---SVI 421
Query: 312 VLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTM 371
V PGK+LVP DQ+Q QA +ALQVLKVIE D +P DLC RREYARWL+SASS L+R+T
Sbjct: 422 VNPGKILVPVAADQIQCQAFAALQVLKVIETDTQPSDLCTRREYARWLISASSALSRNTT 481
Query: 372 SKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLP 431
SKVYPAMYIENVT+LAFDDITPEDPDFSSIQGLAEAGLI+SKLS+RDLL++ G F P
Sbjct: 482 SKVYPAMYIENVTELAFDDITPEDPDFSSIQGLAEAGLIASKLSNRDLLDDVEGTFLFSP 541
Query: 432 ESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGI 491
ES LSRQDL+SWKMALEKRQLPEA+KK+LY+LSGFIDIDKINPDAWP+++ADL+ GEQGI
Sbjct: 542 ESLLSRQDLISWKMALEKRQLPEADKKMLYKLSGFIDIDKINPDAWPSIIADLSTGEQGI 601
Query: 492 IALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVA 551
ALAFGCTRLFQP KPVT QAA+AL+ GEASD V+EEL RIEAES AE AVS H+ALVA
Sbjct: 602 AALAFGCTRLFQPHKPVTKGQAAIALSSGEASDIVSEELARIEAESMAEKAVSAHNALVA 661
Query: 552 EVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEM 611
EVEK++N SFEKELSMEREKI+ VEKMAE A+ ELE+LR +RE + +AL+KERAA+ESEM
Sbjct: 662 EVEKDVNASFEKELSMEREKIEAVEKMAELAKVELEQLREKREEENLALVKERAAVESEM 721
Query: 612 EILSKLRREVEEQLESLMSNKVEISYEKERI 642
E+LS+LRR+ EE+LE LMSNK EI++EKER+
Sbjct: 722 EVLSRLRRDAEEKLEDLMSNKAEITFEKERV 752
>gi|108712244|gb|ABG00039.1| expressed protein [Oryza sativa Japonica Group]
Length = 444
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 259/409 (63%), Positives = 324/409 (79%), Gaps = 4/409 (0%)
Query: 269 SNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQG 328
SN+ E D E+ +S + + P S +S +GIPAP+++SAAL+V G+++VPA VD Q
Sbjct: 37 SNQNEGADELENQNSLYESTTPDKSFAS-SGIPAPTLLSAALRVRTGQIMVPAAVDPAQA 95
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
AL+ALQVLKVIE D + GDLC RREYARWLV AS+ L+R+T SKVYPAMYIENVT+LAF
Sbjct: 96 SALAALQVLKVIEPDAQAGDLCTRREYARWLVVASNCLSRNTSSKVYPAMYIENVTELAF 155
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDL---LNEEPGPIFFLPESPLSRQDLVSWKM 445
DDITPED DF IQGLAEAGLISSKLS D+ L+ + F PE P+SRQDLVSWKM
Sbjct: 156 DDITPEDFDFPFIQGLAEAGLISSKLSRSDMNVPLDVDNLHNLFSPECPVSRQDLVSWKM 215
Query: 446 ALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPD 505
AL+KRQLPE +K +Y+ SG++D+DKIN AWPAL+ADL AG+Q I ALAFG TRLFQPD
Sbjct: 216 ALDKRQLPEVDKTSMYKASGYMDVDKINAAAWPALVADLDAGDQSITALAFGFTRLFQPD 275
Query: 506 KPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKEL 565
KPVT Q A+AL+ G+++D V EEL RIEAE AE+AV+ H LVA+VEK++N +FE+EL
Sbjct: 276 KPVTKGQVALALSTGDSADVVMEELARIEAEKIAEDAVNAHGELVAQVEKDLNATFEREL 335
Query: 566 SMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQL 625
+ EREKI+ +EK+AEEAR EL++LRAER + AL++ RA++ESEME+LSKLR EVEEQL
Sbjct: 336 TKEREKIERLEKLAEEARVELDKLRAERVEENNALIRGRASVESEMEVLSKLRSEVEEQL 395
Query: 626 ESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMAR 674
+S++S KVEIS+EK RI L+ E EN+ Q + +LQYELEVERKALSMAR
Sbjct: 396 QSVLSKKVEISFEKNRIEKLQTEIENDRQAVVQLQYELEVERKALSMAR 444
>gi|168049981|ref|XP_001777439.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671170|gb|EDQ57726.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 517
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 216/516 (41%), Positives = 332/516 (64%), Gaps = 17/516 (3%)
Query: 337 LKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDP 396
++V+E +V P +C RR YARWL++ SS LTRS +K+ PAMYIE T+LAFDDITP DP
Sbjct: 1 MQVVEDEVDPAAVCTRRNYARWLLATSSKLTRSAANKILPAMYIEEETELAFDDITPGDP 60
Query: 397 DFSSIQGLAEAGLISSKLSHRDLLNEE--PGPIFFLPESPLSRQDLVSWKMALEKRQLPE 454
DFS+IQGLAEAGLI SKLS D + E G + F P+SPLSRQDLVSWK++L++R LP
Sbjct: 61 DFSAIQGLAEAGLIPSKLSSMDTDSGEGETGGVLFSPDSPLSRQDLVSWKISLDRRSLPV 120
Query: 455 ANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAA 514
+K+ SGF+D+D+I WPA++ DL +GE IIA AFG TR+FQP KP T Q A
Sbjct: 121 ISKEDFQAQSGFMDVDRIESKVWPAIVTDLYSGESSIIATAFGFTRMFQPQKPATIGQVA 180
Query: 515 VALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDV 574
+ALA G+ SD + EE+ R+EAE A+ AV+ +A+ A +KE+ F++E+ ER+ +
Sbjct: 181 IALATGDTSDQLGEEVARLEAERMADEAVAADAAMEARTQKEVKALFDEEIETERKLREE 240
Query: 575 VEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVE 634
EK+ EA+ LE++ ER+ ++ +L+K +A +E+E ++L + +V+EQL++L + +VE
Sbjct: 241 AEKLLAEAKTNLEKITTERDAERDSLLKGQADVEAEKDLLYDTQYKVDEQLQALATLRVE 300
Query: 635 ISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGA 694
ISYEKER+ L + E + + +RL+ E++ E+K+L +AR AE+EA++ARE A+ LE A
Sbjct: 301 ISYEKERLQKLSSKIEQDQESASRLRTEIDSEKKSLVLARLEAEEEAQKARELARVLEEA 360
Query: 695 RDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQTVSRA--QSLVDK---LKAMAN 749
R W +GI++ VDK + + W G +++ + RA Q ++DK LK N
Sbjct: 361 RQHWAGRGIEIHVDKSFDDNNIPGPSWRYTGGNTDLEKVLHRAPLQDVIDKGENLKTRIN 420
Query: 750 DVSGKSK----EIINTIIHKILLFISNLKKWASKASMRA-----AELKDATILKAKGSVQ 800
+ + ++++ +KIL + +++ +S+ S+ ++D + A G ++
Sbjct: 421 NGVLRYWHLLLQVVSRFYYKILELLGQIRRKSSQLSLDTFSHVNHRMEDTRSVVA-GKIR 479
Query: 801 ELQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFK 836
+Q + + + EG+KR A CR GV K++QRFK
Sbjct: 480 GVQDAVLDASAGAMEGSKRFADGCRSGVGKISQRFK 515
>gi|168061250|ref|XP_001782603.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665923|gb|EDQ52592.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 538
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 226/540 (41%), Positives = 341/540 (63%), Gaps = 24/540 (4%)
Query: 317 VLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYP 376
++V VVD++Q A+SALQ LKV+E +V PG +CIRR YARWL++ S+TL+RS+ +KV P
Sbjct: 1 MVVLTVVDRMQEMAVSALQALKVVEDEVDPGAICIRRNYARWLIATSNTLSRSSATKVLP 60
Query: 377 AMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGP--IFFLPESP 434
AMYIE T+LAFDDITPEDPDF +IQGLAEAGLI SKLS D+ + E + F P+SP
Sbjct: 61 AMYIEGETELAFDDITPEDPDFPAIQGLAEAGLIPSKLSSVDIQSAEKASSGVRFSPDSP 120
Query: 435 LSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIAL 494
L+RQDL+SWK+AL++R L +K+ SGF+D D I WPA+++DL +G+ IIA
Sbjct: 121 LTRQDLLSWKIALDRRSLAAISKEDFQAQSGFMDADYIESKLWPAIVSDLYSGDSSIIAT 180
Query: 495 AFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVE 554
AFG TR+FQP KP TN QAA+ALA G+ SD EEL R++AE A++AV+ +A+ A +
Sbjct: 181 AFGFTRMFQPQKPATNGQAAIALASGDTSDLFGEELARLQAERMADDAVAADAAMEARAQ 240
Query: 555 KEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEIL 614
+E+ + E+ ER++ + VEK EE + LER+ AER+ +K +MK +AA+ +E ++L
Sbjct: 241 EEVKALYSGEIESERKRREEVEKSFEEVKSNLERVEAERQSEKETMMKSQAAVVAEKKLL 300
Query: 615 SKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMA- 673
L ++V++QL++L + + E++ EKER++ L + E + + +RL+ +LE E+KAL +A
Sbjct: 301 HDLYQKVDDQLQTLSTLRAEVANEKERLHKLTSKVEVDQESASRLKADLESEKKALVLAS 360
Query: 674 RAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQT 733
R WAE+EA++AREQA+ L AR RW +GI V VDK E++ W G +++
Sbjct: 361 RTWAEEEAEKAREQARVLGEARQRWAGRGIDVNVDKSFDEDNVPGPSWRFGGANTETNKS 420
Query: 734 VSRA--QSLVDKLKAMANDVSGKSKEIINTIIHKIL----LFISNLKKWASKASMRAAEL 787
+ RA Q ++DK D+ K I T H L F+ +++ R +++
Sbjct: 421 IQRAPLQDVMDK----GQDLKTKVNNSIVTYWHAFLDVLSRFVQRIRELLELMRSRVSQV 476
Query: 788 KDATILKAKGSVQE-----------LQQSTAEFRSNLTEGAKRVAGDCREGVEKLTQRFK 836
+ + + VQ+ + + ++ + +G KR A C+ V ++ QRFK
Sbjct: 477 IQSVFVSTRDRVQDSGSVVSGKLRGAKSAVSDMSAVAIDGTKRFADGCKTEVGRIAQRFK 536
>gi|302767100|ref|XP_002966970.1| hypothetical protein SELMODRAFT_63515 [Selaginella moellendorffii]
gi|300164961|gb|EFJ31569.1| hypothetical protein SELMODRAFT_63515 [Selaginella moellendorffii]
Length = 377
Score = 362 bits (930), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 203/382 (53%), Positives = 266/382 (69%), Gaps = 6/382 (1%)
Query: 331 LSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDD 390
+ ++V+E V +C RREYARWLV+A+ TL R+T +KV PAMYIE VT+ AFDD
Sbjct: 1 FYSFSFVQVVEPGVGASTICTRREYARWLVAANRTLARNTGAKVSPAMYIEKVTEAAFDD 60
Query: 391 ITPEDPDFSSIQ-GLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEK 449
++PEDPDF IQ GLAEAGLI SKLS + GPI FLP+ PLSRQDL+SWK A+E
Sbjct: 61 VSPEDPDFPFIQAGLAEAGLIFSKLSRGP---DSDGPIHFLPDRPLSRQDLISWKFAVEN 117
Query: 450 RQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVT 509
LP AN+ L + FIDID I+ D WPA+ AD+ AG++ II+ AFG TRLFQP KPVT
Sbjct: 118 HSLPVANRNKLQE--RFIDIDNIHTDVWPAIAADVAAGDRSIISSAFGYTRLFQPHKPVT 175
Query: 510 NAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMER 569
QAAVAL+ GEAS+ + EEL+R+EAE AE AV+ AL A +KE N F +EL +R
Sbjct: 176 TGQAAVALSSGEASEHIGEELERLEAERHAEKAVAAEIALEARAQKEANAVFREELDRQR 235
Query: 570 EKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLM 629
+ E +AE R+ELE+L++ERE +K +MKERA++++ E LS+ R EV+E L+ L
Sbjct: 236 QLTVEAEAVAERLREELEKLKSEREEEKYGVMKERASLDAAKEALSRARLEVDELLQGLS 295
Query: 630 SNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAK 689
S KV++ +E++R+ L E E E + ++ E +VE+KAL +AR WAE+EAK+A AK
Sbjct: 296 SEKVKVVFERDRMEKLLAEIEEERDTLENVKSETQVEKKALVLARTWAEEEAKKAMAHAK 355
Query: 690 ALEGARDRWERQGIKVVVDKDL 711
LE AR RWE QGI+V VDKDL
Sbjct: 356 VLEEARKRWESQGIEVHVDKDL 377
>gi|302755236|ref|XP_002961042.1| hypothetical protein SELMODRAFT_74805 [Selaginella moellendorffii]
gi|300171981|gb|EFJ38581.1| hypothetical protein SELMODRAFT_74805 [Selaginella moellendorffii]
Length = 399
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 202/377 (53%), Positives = 262/377 (69%), Gaps = 11/377 (2%)
Query: 300 IPAPSVVSAALQVLP-GKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARW 358
IPAPS SA LP G+VL+ VVD Q QALSALQ LKV+E V +C RREYARW
Sbjct: 32 IPAPSAPSA----LPSGRVLIAPVVDHGQEQALSALQSLKVVEPGVGASTICTRREYARW 87
Query: 359 LVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQ-GLAEAGLISSKLSHR 417
LV+A+ TL R+T +KV PAMYIE VT+ AFDD++PEDPDF IQ GLAEAGLI SKLS
Sbjct: 88 LVAANRTLARNTGAKVSPAMYIEKVTEAAFDDVSPEDPDFPFIQAGLAEAGLIFSKLSRG 147
Query: 418 DLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAW 477
+ GPI FLP+ PLSRQDL+SWK A+E LP AN+ L + FIDID I+ D W
Sbjct: 148 P---DSDGPIHFLPDRPLSRQDLISWKFAVENHSLPVANRNKLQE--RFIDIDNIHTDVW 202
Query: 478 PALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAES 537
PA+ AD+ AG++ II+ AFG TRLFQP KPVT QAAVAL+ GEAS+ + EEL+R+EAE
Sbjct: 203 PAIAADVAAGDRSIISSAFGYTRLFQPHKPVTTGQAAVALSSGEASEHIGEELERLEAER 262
Query: 538 AAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDK 597
AE AV+ AL A +KE N F +EL +R+ E +AE R+ELE+L++ERE +K
Sbjct: 263 HAEKAVAAEIALEARAQKEANAVFREELDRQRQLTVEAEAVAERLREELEKLKSEREEEK 322
Query: 598 IALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIA 657
+MKERA++++ E LS+ R EV++ L+ L S KV++ +E++R+ L E E E +
Sbjct: 323 YGVMKERASLDAAKEALSRARLEVDDLLQGLSSEKVKVVFERDRMEKLLAEIEEERDTLE 382
Query: 658 RLQYELEVERKALSMAR 674
++ E +VE+KAL +AR
Sbjct: 383 NVKSETQVEKKALVLAR 399
>gi|168029069|ref|XP_001767049.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681791|gb|EDQ68215.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 360
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 176/362 (48%), Positives = 249/362 (68%), Gaps = 10/362 (2%)
Query: 338 KVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPD 397
+V+E P +C RR+YARW++S SSTL+RS +KV PAMYIE VT AF DI +DPD
Sbjct: 1 QVVEEGADPRAICNRRDYARWIISFSSTLSRSPANKVLPAMYIEGVTKQAFADIASDDPD 60
Query: 398 FSSIQGLAEAGLISSKLSHRDLLNEEPGP-------IFFLPESPLSRQDLVSWKMALEKR 450
F IQGLAEAGLI S LS L+NE+ G ++F P+SP++RQDLVSWK+AL +R
Sbjct: 61 FPYIQGLAEAGLIPSNLS---LINEDRGTYDSDSDVMYFFPDSPVTRQDLVSWKVALGRR 117
Query: 451 QLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTN 510
LP +K+ L SGF+D+D+I+ WP L DL +GE II AFG TR+FQP+KP T
Sbjct: 118 SLPTIDKETLKAKSGFLDVDRIDNTLWPLLSDDLDSGENSIILSAFGFTRIFQPEKPATV 177
Query: 511 AQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMERE 570
QAA+ALA G S+ EEL R +AE A A+ A+ +KE++E F+ +LS ER
Sbjct: 178 GQAAIALACGNTSEKFGEELARYQAEWTAHEVAIADDAMKAQKQKELDELFDGQLSAERR 237
Query: 571 KIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMS 630
+ ++ ++ EE R E ER+++ER+ +K L+K++AA+ESE E+L L+ +V+EQL++L +
Sbjct: 238 QKELAQQRFEELRAEFERMKSERDAEKGVLLKDKAAVESEKELLGHLKEQVDEQLQALTT 297
Query: 631 NKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKA 690
++++S E++R+ LR E +E++R+ +ELEVE+KAL AR WAEDEA+ AR A+A
Sbjct: 298 REMQVSIEQDRLENLRSTCEGHEEELSRVTFELEVEKKALMQARFWAEDEARNARAHAEA 357
Query: 691 LE 692
LE
Sbjct: 358 LE 359
>gi|413944417|gb|AFW77066.1| hypothetical protein ZEAMMB73_947659 [Zea mays]
Length = 561
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 148/397 (37%), Positives = 222/397 (55%), Gaps = 18/397 (4%)
Query: 313 LPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMS 372
+ L D V +A S L+ L++IE DV D C RRE+ARW V SS R M
Sbjct: 167 IASHFLFRVHTDPVHEEAFSILKKLQIIEKDVSSSDFCTRREFARWFVKLSSKFERKRMC 226
Query: 373 KVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSH--RDLLNEEP---GPI 427
++ P + AFDD+ +DPDF IQ L E+G++SSKLS+ L + P G
Sbjct: 227 RIVPNKLTSDTVQCAFDDVNIDDPDFLYIQSLGESGIVSSKLSNSLEMLTSGSPCSKGNT 286
Query: 428 FFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAG 487
FLP+S LSR DLV+WK+ +E + ++KI Q +D+ + PD P++L +L AG
Sbjct: 287 LFLPDSYLSRFDLVNWKVLVEHPRALRIDEKIPSQNVCILDL-RACPDVSPSMLIELMAG 345
Query: 488 EQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHS 547
E II+ FG TR QP KPVT AQAA AL G +AV+EEL ++EAE+ A H
Sbjct: 346 ENNIISRVFGNTRRLQPGKPVTKAQAAAALTSGRMKEAVHEELNKLEAENQA------HL 399
Query: 548 ALVAEVEKE------INESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALM 601
+L+AE+ +E I + +E+++ E+E+ V+ + E R +RE + L+
Sbjct: 400 SLIAEIMEELISRGDIQQQWEQKMKKEQERALEVDNNLQHVLHEHANERTDREEELADLL 459
Query: 602 KERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQY 661
KERAA+E + + L LR E++ + L + E+ +++ + L + +++Q + +
Sbjct: 460 KERAALECQNQELINLRSEIDGMYDRLATENEEVMADQQTLENLMSDMTSKHQAVNEAKS 519
Query: 662 ELEVERKALSMARAWAEDEAKRAREQAKALEGARDRW 698
LE E++AL+M R W EDEA R E+A+ LE A RW
Sbjct: 520 YLEAEKEALTMLRTWVEDEAGRVHERAETLEKALRRW 556
>gi|326526257|dbj|BAJ97145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 531
Score = 239 bits (609), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 122/180 (67%), Positives = 140/180 (77%), Gaps = 4/180 (2%)
Query: 281 NSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVI 340
N + +S G S SP GIPAPS++SAALQV G+++VPA VD QG AL+ALQVLKVI
Sbjct: 342 NQNDLFKSAAHGKSFSP-GIPAPSLLSAALQVPAGQIVVPAAVDPTQGNALAALQVLKVI 400
Query: 341 EADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSS 400
E PGDLC RREYARWLV AS++L+R+T SKVYPAMYIENV++LAFDD+T EDPDF
Sbjct: 401 EPGALPGDLCTRREYARWLVVASNSLSRNTYSKVYPAMYIENVSELAFDDVTTEDPDFPF 460
Query: 401 IQGLAEAGLISSKLSHRDL---LNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANK 457
IQGLAEAGLISSKLS D+ N + FF PESPLSRQDLVSWKMAL+KRQLPE K
Sbjct: 461 IQGLAEAGLISSKLSRSDMDIDENVQNNHYFFSPESPLSRQDLVSWKMALDKRQLPEVEK 520
>gi|356498348|ref|XP_003518015.1| PREDICTED: uncharacterized protein LOC100803220 [Glycine max]
Length = 535
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 148/401 (36%), Positives = 239/401 (59%), Gaps = 21/401 (5%)
Query: 314 PGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSK 373
P +V++P VD Q +ALS L+ LK+IE DV+ +LC RRE+ARWLV ++S+L RS
Sbjct: 144 PERVVIPVSVDSTQEEALSVLKSLKIIEDDVEANELCTRREFARWLVKSNSSLERSPKHM 203
Query: 374 VYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLN----EEPGPIFF 429
+ P + + AFDD+ +DPDF SIQ LAEAG+I SKLS + N + I F
Sbjct: 204 IAPIVSLSGSVVTAFDDVGIDDPDFRSIQVLAEAGVIPSKLSWNNSFNYGGSDSQENINF 263
Query: 430 LPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQ 489
P+ +SRQDL+ W+ LE +I + +G++D+ +I PA+ D+ AG+
Sbjct: 264 YPDRFISRQDLIDWRAQLEYDFFSGVVDEISIKKAGYMDVKEITS---PAVYVDMLAGDT 320
Query: 490 GIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSAL 549
I+ FG ++ FQP+KP T AQAAVAL G +A++ EL RIEAE++A ++E +
Sbjct: 321 SILRKVFGQSKRFQPNKPSTKAQAAVALTSGRMKEAISAELSRIEAENSAR--LAEAGEI 378
Query: 550 VAEV--EKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREV--DKIA--LMKE 603
+E+ EI ++++L E+ + VE++ + LE E E+ DKI+ +KE
Sbjct: 379 WSELLSRGEIQRFWDEKLIEEKNRGFDVERLYHVEVKNLE----EEEINQDKISAEYLKE 434
Query: 604 RAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYEL 663
+A ++ + ++L L++EV+E E + S +V E++ + L ++ E +++E+ + L
Sbjct: 435 KATMDCQKQLLLNLKKEVDEISEKVASERVTYVDERDVVQKLHEDLEFKHEELLNTKSTL 494
Query: 664 EVERKALSMARAWAEDEAKRAREQAKALE--GARDRWERQG 702
E E++AL + R+W EDEA+R++ +A LE G R +W+ Q
Sbjct: 495 EAEKEALQILRSWVEDEARRSQARAAVLEEVGRRWKWDDQA 535
>gi|449435256|ref|XP_004135411.1| PREDICTED: uncharacterized protein LOC101214855 [Cucumis sativus]
Length = 595
Score = 236 bits (602), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 145/392 (36%), Positives = 230/392 (58%), Gaps = 6/392 (1%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVY 375
+V++ VD Q +ALS L+ LKVIE D+ G+LC RREYARWLV S+L R+ +
Sbjct: 205 RVIIAIPVDSTQDEALSILKKLKVIEEDINAGELCSRREYARWLVHMYSSLERNPKHHII 264
Query: 376 PAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLS---HRDLLNEEPGPIFFLPE 432
P++ + T AFDDI+ EDPDF SIQ LAEAG++ SKLS D L ++ FF PE
Sbjct: 265 PSVSLSGSTVAAFDDISFEDPDFESIQALAEAGVVPSKLSPNYGYDGLGDQERTYFF-PE 323
Query: 433 SPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGII 492
+SRQ L+ WK+ L+ +P ++I F+D+ +I+ +A P L D+ AGE+ I+
Sbjct: 324 RFVSRQTLIDWKVQLDYEFVPGMLERISSAKVDFMDLKEISSEASPQLFMDILAGERSIL 383
Query: 493 ALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAE 552
FG + FQP+KP T AQ AV LA G ++A+ EL R+E+ES+A A E L
Sbjct: 384 RKVFGQIKRFQPNKPATKAQVAVTLASGRMAEAIAAELSRLESESSARKAEIEDIKLELV 443
Query: 553 VEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEME 612
+I ++K+L+ E++++ VE++ A L + +E +KE+A+I+ + +
Sbjct: 444 ERGDIQRYWDKKLTEEKKRLLDVEELYLAAISNLGEEKMVQEKIFSEYLKEKASIDCQRQ 503
Query: 613 ILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSM 672
+L L EV+ E ++S + E+ ++ + + +N+ + + + LE E++AL +
Sbjct: 504 LLLSLNEEVDGIAEKILSERSVCETEQNELHNMHTDLQNQLEGMLDTKSVLEAEKEALRI 563
Query: 673 ARAWAEDEAKRAREQAKALE--GARDRWERQG 702
R W EDEA++++ +AK LE G R +W+ Q
Sbjct: 564 LRTWVEDEARKSQARAKVLEEVGRRWKWDDQA 595
>gi|449493510|ref|XP_004159324.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101228994
[Cucumis sativus]
Length = 1097
Score = 236 bits (602), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 144/391 (36%), Positives = 229/391 (58%), Gaps = 7/391 (1%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVY 375
+V++ VD Q +ALS L+ LKVIE D+ G+LC RREYARWLV S+L R+ +
Sbjct: 708 RVIIAIPVDSTQDEALSILKKLKVIEEDINAGELCSRREYARWLVHMYSSLERNPKHHII 767
Query: 376 PAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLS---HRDLLNEEPGPIFFLPE 432
P++ + T AFDDI+ EDPDF SIQ LAEAG++ SKLS D L ++ F PE
Sbjct: 768 PSVSLSGSTVAAFDDISFEDPDFESIQALAEAGVVPSKLSPNYGYDGLGDQERT--FFPE 825
Query: 433 SPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGII 492
+SRQ L+ WK+ L+ +P ++I F+D+ +I+ +A P L D+ AGE+ I+
Sbjct: 826 RFVSRQTLIDWKVQLDYEFVPGMLERISSAKVDFMDLKEISSEASPQLFMDILAGERSIL 885
Query: 493 ALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAE 552
FG + FQP+KP T AQ AV LA G ++A+ EL R+E+ES+A A E L
Sbjct: 886 RKVFGQIKRFQPNKPATKAQVAVTLASGRMAEAIAAELSRLESESSARKAEIEDIKLELV 945
Query: 553 VEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEME 612
+I ++K+L+ E++++ VE++ A L + +E +KE+A+I+ + +
Sbjct: 946 ERGDIQRYWDKKLTEEKKRLLDVEELYLAAISNLGEEKMVQEKIFSEYLKEKASIDCQRQ 1005
Query: 613 ILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSM 672
+L L EV+ E ++S + E+ ++ + + +N+ + + + LE E++AL +
Sbjct: 1006 LLLSLNEEVDGIAEKILSERSVCETEQNELHNMHTDLQNQLEGMLDTKSVLEAEKEALRI 1065
Query: 673 ARAWAEDEAKRAREQAKALE--GARDRWERQ 701
R W EDEA++++ +AK LE G R +W+ Q
Sbjct: 1066 LRTWVEDEARKSQARAKVLEEVGRRWKWDDQ 1096
>gi|223974903|gb|ACN31639.1| unknown [Zea mays]
Length = 560
Score = 236 bits (601), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 146/385 (37%), Positives = 216/385 (56%), Gaps = 17/385 (4%)
Query: 324 DQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENV 383
D V +ALS L+ L++IE DV D C R+E+ARW V S R M ++ P
Sbjct: 178 DPVHEEALSVLKKLQIIEKDVSSSDFCTRKEFARWFVKLCSKFERKKMQRIVPNKLTSGT 237
Query: 384 TDLAFDDITPEDPDFSSIQGLAEAGLISSKLSH--RDLLNEEP--GPIFFLPESPLSRQD 439
AFDD+ + PDF IQ L E+G+ISSKLS+ L P G FLP+S LSR D
Sbjct: 238 VQCAFDDVNIDHPDFLYIQSLGESGIISSKLSNSLETLTTGSPSQGNSLFLPDSYLSRFD 297
Query: 440 LVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCT 499
LV+WK+ +E E ++K+L Q +D+ + PD P++L +L AGE II+ FG T
Sbjct: 298 LVNWKVLVEHPCALEIDQKMLSQNVCILDL-RACPDVSPSMLIELMAGENSIISRVFGNT 356
Query: 500 RLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKE--- 556
R QP KPVT AQAA AL G +A+ EEL R+EA++ A +V +AE+ +E
Sbjct: 357 RRLQPHKPVTKAQAAAALTSGRMKEAIQEELNRLEADNQARLSV------IAEITEELIN 410
Query: 557 ---INESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEI 613
I + +E+++ E+E+ V+ + EL RA+RE + L+KERAA+E + +
Sbjct: 411 RGDIQQQWEEKMKTEQERALEVDNNLQHVLDELANERADREEELAVLLKERAALERKNQE 470
Query: 614 LSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMA 673
L LR EV+ + L + E+ +++ + + +++Q + + LE E++AL+M
Sbjct: 471 LINLRLEVDGMYDRLATENEEVMADQQTLENQLSDMTSKHQAVNEAKSYLEAEKEALTML 530
Query: 674 RAWAEDEAKRAREQAKALEGARDRW 698
R W EDEA E+A+ LE A RW
Sbjct: 531 RTWVEDEAAHVHERAETLEKALRRW 555
>gi|297814772|ref|XP_002875269.1| hypothetical protein ARALYDRAFT_904734 [Arabidopsis lyrata subsp.
lyrata]
gi|297321107|gb|EFH51528.1| hypothetical protein ARALYDRAFT_904734 [Arabidopsis lyrata subsp.
lyrata]
Length = 558
Score = 234 bits (598), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 147/385 (38%), Positives = 226/385 (58%), Gaps = 4/385 (1%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVY 375
+V +P VD Q +A++ L+ LK+IE DV +LC RREYARWLV ++ L R+ M ++
Sbjct: 172 RVTIPVAVDAAQQEAIAVLKKLKIIEDDVVADELCTRREYARWLVRSNLLLERNPMHRIV 231
Query: 376 PAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPL 435
PA+ + + AFDDI DPDF IQ LAEAG+ SSKLS +D N+ G F PES +
Sbjct: 232 PAVALAGSSIPAFDDINTADPDFEYIQALAEAGITSSKLSGKDSQNDS-GNNNFYPESFV 290
Query: 436 SRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALA 495
SR DLV+WK LE PE ++I +ID INPD D G++ I
Sbjct: 291 SRLDLVNWKAQLECGFHPEIMEEISRTKVDYIDTKNINPDMALGFFLDFLTGDKSTIRNV 350
Query: 496 FGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEK 555
FG + FQP++PVT AQAAVAL G+ A++EEL R+EAES ++ A +E + +E+ K
Sbjct: 351 FGRIKRFQPNRPVTKAQAAVALTSGKMVKAISEELSRLEAESLSQKAETEE--IRSELLK 408
Query: 556 -EINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEIL 614
EI + +++++ ER + +E++ ELE + +E +KE+AA + + ++L
Sbjct: 409 GEIRQFWDEKIQAERSRGVEMEELYLSRVSELEEEKNTQEKWFAERLKEKAATDCQKQLL 468
Query: 615 SKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMAR 674
L E++E + L+S+K E ++ + + +++ + + + LE E +AL + R
Sbjct: 469 HSLSEEIDEMSQRLISDKSVYLTEHSKLQEMLSDIQSKLESLVDKRSILEAEIEALRILR 528
Query: 675 AWAEDEAKRAREQAKALEGARDRWE 699
+W EDEAK ++ +AK LE A RW+
Sbjct: 529 SWIEDEAKASQARAKVLEEAGRRWK 553
>gi|356502489|ref|XP_003520051.1| PREDICTED: uncharacterized protein LOC100813930 [Glycine max]
Length = 536
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 146/401 (36%), Positives = 238/401 (59%), Gaps = 21/401 (5%)
Query: 314 PGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSK 373
P +V++P VD Q +ALS L+ LK+IE DV+ +LC RRE+ARWLV +S+L R+ +
Sbjct: 145 PKRVVIPVCVDSTQEEALSVLKSLKIIEDDVEANELCTRREFARWLVKLNSSLERNPKHR 204
Query: 374 VYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLN----EEPGPIFF 429
+ P + + AFDDI+ +DPDF SIQ LAEAG+I SKLS + + + I F
Sbjct: 205 IAPIVSLSGSVFTAFDDISIDDPDFRSIQVLAEAGVIPSKLSWNNSFDYGGFDTQQNINF 264
Query: 430 LPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQ 489
P+ +SRQDL+ W+ LE +I + +G++D+ +I A + D+ AG++
Sbjct: 265 FPDRFISRQDLIDWRAQLEYDFFSGVVDQISIKKAGYMDVKEIISSA---VYVDMLAGDK 321
Query: 490 GIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSE--HS 547
I+ FG ++ FQP+KP T AQA VAL G +A++ EL RIEAE++A A +E S
Sbjct: 322 SILRKVFGQSKRFQPNKPSTKAQAVVALTGGRMKEAISAELLRIEAENSARLAEAEEIRS 381
Query: 548 ALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREV--DKIA--LMKE 603
L++ +I ++++L+ E+ + VE++ + LE E E+ DKI+ +KE
Sbjct: 382 ELLSR--GDIQRFWDEKLNEEKNRGFDVERLYHMEVKNLE----EEEINQDKISAEYLKE 435
Query: 604 RAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYEL 663
+AA++ + ++L L++EV+E E + +V E+ + L + E +++E+ + L
Sbjct: 436 KAAMDCQKQLLLNLKKEVDEISEKVALERVTYVDERHVVQKLLGDLELKHEELLNTKSTL 495
Query: 664 EVERKALSMARAWAEDEAKRAREQAKALE--GARDRWERQG 702
E E++AL + R+W EDEA+R++ +A LE G R +W+ Q
Sbjct: 496 EAEKEALQILRSWVEDEARRSQARAAVLEEVGRRWKWDDQA 536
>gi|11994759|dbj|BAB03088.1| unnamed protein product [Arabidopsis thaliana]
Length = 567
Score = 226 bits (575), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 141/384 (36%), Positives = 218/384 (56%), Gaps = 1/384 (0%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVY 375
+V P VD Q +A++ L+ LK+ E D+ +LC +REYARWLV ++S L R+ M +
Sbjct: 180 RVATPVAVDAAQQEAIAVLKKLKIYEDDIVADELCTKREYARWLVRSNSLLERNPMHMIV 239
Query: 376 PAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPL 435
PA+ + + AFDDI DPDF IQ LAEAG+ SSKLS D N+ G F PES +
Sbjct: 240 PAVALAGSSIPAFDDINTSDPDFEYIQALAEAGITSSKLSGEDSRND-LGNSNFNPESFV 298
Query: 436 SRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALA 495
SR DLV+WK LE PE ++I +ID INPD D G++ I
Sbjct: 299 SRLDLVNWKAQLECGFHPEIMEEISRTKVDYIDTKNINPDMALGFFLDFLMGDKSTIRNV 358
Query: 496 FGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEK 555
FG + FQP++PVT AQAAVAL G+ A+ EL R+EAES ++ A +E +
Sbjct: 359 FGRIKRFQPNRPVTKAQAAVALTSGKMVKAITAELSRLEAESLSQKAETEEIRSELLEKG 418
Query: 556 EINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILS 615
EI + +++++ ER + +E++ E+E + +E +KE+AAI+ + ++L+
Sbjct: 419 EIRQFWDEKIQAERSRGFEMEELYLSRVNEVEEEKTTQEKWSAERLKEKAAIDCQKQLLN 478
Query: 616 KLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARA 675
L E++E + L+S+K E ++ + + +++ + + + LE E +AL + R+
Sbjct: 479 SLTEEIDEMSQRLISDKSVYLTEHSKLQEMLSDLQSKLESLIDKRSILEAEVEALRILRS 538
Query: 676 WAEDEAKRAREQAKALEGARDRWE 699
W EDE K ++ +AK LE A RW+
Sbjct: 539 WIEDEGKASQARAKVLEEAGRRWK 562
>gi|42565187|ref|NP_566775.2| uncharacterized protein [Arabidopsis thaliana]
gi|332643529|gb|AEE77050.1| uncharacterized protein [Arabidopsis thaliana]
Length = 558
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 141/384 (36%), Positives = 218/384 (56%), Gaps = 1/384 (0%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVY 375
+V P VD Q +A++ L+ LK+ E D+ +LC +REYARWLV ++S L R+ M +
Sbjct: 171 RVATPVAVDAAQQEAIAVLKKLKIYEDDIVADELCTKREYARWLVRSNSLLERNPMHMIV 230
Query: 376 PAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPL 435
PA+ + + AFDDI DPDF IQ LAEAG+ SSKLS D N+ G F PES +
Sbjct: 231 PAVALAGSSIPAFDDINTSDPDFEYIQALAEAGITSSKLSGEDSRND-LGNSNFNPESFV 289
Query: 436 SRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALA 495
SR DLV+WK LE PE ++I +ID INPD D G++ I
Sbjct: 290 SRLDLVNWKAQLECGFHPEIMEEISRTKVDYIDTKNINPDMALGFFLDFLMGDKSTIRNV 349
Query: 496 FGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEK 555
FG + FQP++PVT AQAAVAL G+ A+ EL R+EAES ++ A +E +
Sbjct: 350 FGRIKRFQPNRPVTKAQAAVALTSGKMVKAITAELSRLEAESLSQKAETEEIRSELLEKG 409
Query: 556 EINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILS 615
EI + +++++ ER + +E++ E+E + +E +KE+AAI+ + ++L+
Sbjct: 410 EIRQFWDEKIQAERSRGFEMEELYLSRVNEVEEEKTTQEKWSAERLKEKAAIDCQKQLLN 469
Query: 616 KLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARA 675
L E++E + L+S+K E ++ + + +++ + + + LE E +AL + R+
Sbjct: 470 SLTEEIDEMSQRLISDKSVYLTEHSKLQEMLSDLQSKLESLIDKRSILEAEVEALRILRS 529
Query: 676 WAEDEAKRAREQAKALEGARDRWE 699
W EDE K ++ +AK LE A RW+
Sbjct: 530 WIEDEGKASQARAKVLEEAGRRWK 553
>gi|357488241|ref|XP_003614408.1| hypothetical protein MTR_5g053260 [Medicago truncatula]
gi|355515743|gb|AES97366.1| hypothetical protein MTR_5g053260 [Medicago truncatula]
gi|388504036|gb|AFK40084.1| unknown [Medicago truncatula]
Length = 545
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 139/397 (35%), Positives = 229/397 (57%), Gaps = 13/397 (3%)
Query: 314 PGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSK 373
P +V +P D Q +ALS L+ LK++E DV+ +LC RR++ARWL+ +S+L R+ +
Sbjct: 154 PARVTIPVSADSTQEEALSVLKKLKIVEDDVEANELCTRRQFARWLIKLNSSLERNPKHR 213
Query: 374 VYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLN----EEPGPIFF 429
+ P + + D AFDDI+ +DPDF SIQ LAEAG++ SKLS ++ N E I F
Sbjct: 214 IAPIVSLSGSVDNAFDDISVDDPDFQSIQVLAEAGVVPSKLSWKNSSNGCRAEYKEDIIF 273
Query: 430 LPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQ 489
P+ +SRQDL+ W+ LE ++ + +G++D+ +I + D+ AG+
Sbjct: 274 FPDRFISRQDLMEWRTQLEYGFFFGIIDQVSIKKAGYMDVKEITSQG---VYLDMLAGDG 330
Query: 490 GIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSE--HS 547
I+ FG ++ FQP+KP T AQAAVAL G +A++ E+ R+EAE++A +E S
Sbjct: 331 SILRKVFGQSKRFQPNKPSTIAQAAVALTSGRMKEAISAEMSRLEAENSARQDETEEIRS 390
Query: 548 ALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAI 607
L++ +I + ++ ++S E+ VE++ EA L + +E +KE+AA+
Sbjct: 391 ELLSR--GDIQKFWDAKISEEKSHGSDVERLYLEAVNNLVEEKINQEKINADFLKEQAAM 448
Query: 608 ESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVER 667
+ ++L L+ EV+E E L +V EK+ + L ++ E ++++I + LE E+
Sbjct: 449 ACQKQMLLSLKEEVDEISEKLALERVIYVDEKQTVQKLLRDLEFKHEKILDTKSTLEAEK 508
Query: 668 KALSMARAWAEDEAKRAREQAKALE--GARDRWERQG 702
+AL M R W EDEA+R++ +A L G R +W+ Q
Sbjct: 509 EALQMLRTWVEDEARRSQARAAVLAEVGRRWKWDDQA 545
>gi|224111752|ref|XP_002315964.1| predicted protein [Populus trichocarpa]
gi|222865004|gb|EEF02135.1| predicted protein [Populus trichocarpa]
Length = 566
Score = 225 bits (573), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 146/397 (36%), Positives = 228/397 (57%), Gaps = 16/397 (4%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVY 375
+V V VD Q + L AL+ LK+IE DV +LC RREYARWL+ +S L R+ ++
Sbjct: 176 RVKVSVYVDSNQLETLLALKKLKIIEDDVAADELCTRREYARWLLRLNSMLERNQKHRIV 235
Query: 376 PAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRD-LLNEEPGPIF-FLPES 433
P++ + AFDD+ EDPDF SIQ LAE+G+I SKLS + + G F F PE
Sbjct: 236 PSISLSGSVIAAFDDLGVEDPDFESIQALAESGIIPSKLSGTNSCADSSDGRSFCFYPER 295
Query: 434 PLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIA 493
+SRQDL++WK LE LP +++ ++D+ +I+ DA P LL D+ AG++ II
Sbjct: 296 FISRQDLINWKAQLEYGFLPGITEQMSKTKVYYMDVKEISSDATPELLTDMLAGDKSIIR 355
Query: 494 LAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEV 553
FG +R FQP+KP+T AQAAVAL G S+AV E+ R+EAE + A V E+
Sbjct: 356 KVFGQSRRFQPNKPLTKAQAAVALISGRMSEAVYNEILRLEAEKSLRQAT------VKEI 409
Query: 554 EKE------INESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAI 607
E I ++++++ E+ + VEK+ A +LE + + +KE+AA+
Sbjct: 410 RNEFLERGDIKRFWDEKMNEEKIRGFEVEKLYIAALHDLEEEKIVQVKTYEEYLKEKAAM 469
Query: 608 ESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVER 667
+ + ++L L+ EV+E E L S + + E+ + L + + + + + + LE E
Sbjct: 470 DCQRQLLLHLKEEVDEMSERLASERSVYAAEQCNLQELLSKLQFKQEVMLDTKSILEAEI 529
Query: 668 KALSMARAWAEDEAKRAREQAKALE--GARDRWERQG 702
+AL + R+W EDEA++++ +A+ LE G R +W+ Q
Sbjct: 530 EALRILRSWVEDEARKSQARARVLEEVGRRWKWDNQA 566
>gi|359472711|ref|XP_003631189.1| PREDICTED: uncharacterized protein LOC100259365 [Vitis vinifera]
Length = 547
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 172/540 (31%), Positives = 271/540 (50%), Gaps = 45/540 (8%)
Query: 178 IFSDSSSISSSHAPIEPLAAVISVSSDTTVE-PQILPK-------GDTETVASPSTIKNV 229
IFS SS +SH P + S S+T+ E + P+ G V SP K
Sbjct: 27 IFSSSSPFINSHRFRNPRLCISSSVSETSFEVTWVSPERNASDDYGGWAVVESPCRKKKK 86
Query: 230 E---QSEKPLLSGEDSSSSMEVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFT 286
Q KP+ S +H++ + S +++ SN+S
Sbjct: 87 GFRLQFNKPMHS---------IHEIFVRTKTEAGQSNTVY-------------SNASDVD 124
Query: 287 ESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKP 346
+ + S + V SA+ +V VL+P D Q +AL L+ LK+IE DV
Sbjct: 125 TNIVEAGTESASNEIDEDVASASKKV--KHVLIPVAADSTQQEALLVLKKLKIIEDDVSA 182
Query: 347 GDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAE 406
+LC +REYARWLV A+ L R +++ + AFDD+ ED D+ SIQ LAE
Sbjct: 183 DELCTKREYARWLVRANLLLERDPRHRIFSSSLPSGSIISAFDDVNAEDRDYGSIQALAE 242
Query: 407 AGLISSKLS--HRDLLNEEP--GPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQ 462
AG+I SKLS L+ G ++F P+ +SRQDL++WK LE + +P +KI
Sbjct: 243 AGIIPSKLSGNSNSALDSSKVQGEVYFSPDRFISRQDLINWKAQLEYKFMPGIKEKISRT 302
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEA 522
F+D+ +I+ DA P D+ AG++ I+ FG ++ FQP+KP T AQ+AVAL G
Sbjct: 303 KVDFMDMKEISSDASPEFFIDMLAGDRSIVRKVFGQSKRFQPNKPSTKAQSAVALTSGRM 362
Query: 523 SDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEA 582
++ ++ EL R+EAE + A +E +I + +++ E+ + VEK A
Sbjct: 363 TEVIHTELLRLEAEKLSREAEAEEIRSQLLNRGDIQSFWSEKIKDEKIRGFEVEKDYLAA 422
Query: 583 RQELERLRAEREVDKIAL---MKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEK 639
+LE ER V L +KE+AA+E + ++L +L+ EV+E E L + E+
Sbjct: 423 VSDLEE---ERIVHVNCLTENLKEKAAMECQSQLLFRLKDEVDEMSERLACERTGYMAEQ 479
Query: 640 ERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWE 699
+ + E +N+ + + ++ LE E++AL + R+W EDEA++ + +AK LE RW+
Sbjct: 480 RNLQDMLNELQNKQEGVLDVKSILEAEKEALRILRSWVEDEARKNQARAKVLEEVGRRWK 539
>gi|297737877|emb|CBI27078.3| unnamed protein product [Vitis vinifera]
Length = 576
Score = 222 bits (566), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 140/390 (35%), Positives = 219/390 (56%), Gaps = 10/390 (2%)
Query: 317 VLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYP 376
VL+P D Q +AL L+ LK+IE DV +LC +REYARWLV A+ L R +++
Sbjct: 182 VLIPVAADSTQQEALLVLKKLKIIEDDVSADELCTKREYARWLVRANLLLERDPRHRIFS 241
Query: 377 AMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLS--HRDLLNEEP--GPIFFLPE 432
+ AFDD+ ED D+ SIQ LAEAG+I SKLS L+ G ++F P+
Sbjct: 242 SSLPSGSIISAFDDVNAEDRDYGSIQALAEAGIIPSKLSGNSNSALDSSKVQGEVYFSPD 301
Query: 433 SPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGII 492
+SRQDL++WK LE + +P +KI F+D+ +I+ DA P D+ AG++ I+
Sbjct: 302 RFISRQDLINWKAQLEYKFMPGIKEKISRTKVDFMDMKEISSDASPEFFIDMLAGDRSIV 361
Query: 493 ALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAE 552
FG ++ FQP+KP T AQ+AVAL G ++ ++ EL R+EAE + A +E
Sbjct: 362 RKVFGQSKRFQPNKPSTKAQSAVALTSGRMTEVIHTELLRLEAEKLSREAEAEEIRSQLL 421
Query: 553 VEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIAL---MKERAAIES 609
+I + +++ E+ + VEK A +LE ER V L +KE+AA+E
Sbjct: 422 NRGDIQSFWSEKIKDEKIRGFEVEKDYLAAVSDLEE---ERIVHVNCLTENLKEKAAMEC 478
Query: 610 EMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKA 669
+ ++L +L+ EV+E E L + E+ + + E +N+ + + ++ LE E++A
Sbjct: 479 QSQLLFRLKDEVDEMSERLACERTGYMAEQRNLQDMLNELQNKQEGVLDVKSILEAEKEA 538
Query: 670 LSMARAWAEDEAKRAREQAKALEGARDRWE 699
L + R+W EDEA++ + +AK LE RW+
Sbjct: 539 LRILRSWVEDEARKNQARAKVLEEVGRRWK 568
>gi|326510461|dbj|BAJ87447.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 548
Score = 218 bits (554), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 147/437 (33%), Positives = 231/437 (52%), Gaps = 39/437 (8%)
Query: 275 CDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSAL 334
D++E N +S P SS + IPA V ++ VD + +ALS L
Sbjct: 133 VDVDERNDTS-----PNDSSQN--HIPAGGV----------RISFTVPVDPMHEEALSIL 175
Query: 335 QVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPE 394
+ L++IE D GD C RRE+ARW V S L R M ++ P + + AFDD+ +
Sbjct: 176 KKLQIIENDASSGDFCTRREFARWFVKLCSKLERKRMHRIIPNLITSGSVESAFDDVNFD 235
Query: 395 DPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPI-------FFLPESPLSRQDLVSWKMAL 447
DPDF IQ L E+G++ SKLS FLPES LSR DLV+WK+ +
Sbjct: 236 DPDFLYIQSLGESGIVPSKLS--SFFGTSTNGYQSANRNSNFLPESYLSRFDLVNWKLLV 293
Query: 448 EKRQLPEANKKILYQLSGFIDIDKINPDAWP----ALLADLTAGEQGIIALAFGCTRLFQ 503
E E ++K+L + ++ ++ AWP ++L DL G+ I++ FG TR Q
Sbjct: 294 EYPFASELDQKMLSK-----NVHTLDLSAWPDVTASVLTDLFDGDHNIVSKVFGNTRRLQ 348
Query: 504 PDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEV--EKEINESF 561
KPVT AQAA AL G + V +EL R+EAE+ ++ +S ++ E+ +I +
Sbjct: 349 HHKPVTKAQAAAALTSGRMEEVVRDELNRLEAEN--QSRLSVMGEMMEELINRGDIKHYW 406
Query: 562 EKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREV 621
E ++ E++ VEK ++ EL R ++E + L+KE++A+E + + L LR EV
Sbjct: 407 EDKMKKEQDHGFEVEKHLQDVLHELANERTDQEKEIADLLKEKSALERQNQELVCLRSEV 466
Query: 622 EEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEA 681
+ + L + +E+ ++E + L + +++Q + + LE E++AL+M R+W E EA
Sbjct: 467 DGMYDRLATQSLEVMADEENLEKLSSDMSSKHQAVTEAKSYLEAEKEALTMLRSWVEQEA 526
Query: 682 KRAREQAKALEGARDRW 698
R E+A+ LE A RW
Sbjct: 527 ARVHERAEVLERAVRRW 543
>gi|255566819|ref|XP_002524393.1| conserved hypothetical protein [Ricinus communis]
gi|223536354|gb|EEF38004.1| conserved hypothetical protein [Ricinus communis]
Length = 564
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 152/419 (36%), Positives = 235/419 (56%), Gaps = 26/419 (6%)
Query: 305 VVSAALQVLPGKVLVPA------------VVDQVQGQALSALQVLKVIEADVKPGDLCIR 352
VVS + PG+ L PA +VD Q +ALS L+ LK+IE DV+ +LC R
Sbjct: 151 VVSEFIPEAPGEALAPASSQRLERDKVAVLVDSNQLEALSVLKKLKIIEDDVRADELCTR 210
Query: 353 REYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISS 412
REYARWLV +S L R+ ++ + + AFDD++ EDPDF SIQ LAEAG I S
Sbjct: 211 REYARWLVRLNSLLERNPKHRI-ACLSLCGSILAAFDDVSVEDPDFDSIQALAEAGFIPS 269
Query: 413 KLSHRDLLNEEPG---PIFFLPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDI 469
K+S ++ F PE +SRQD+++WK LE + LP +++ ++D+
Sbjct: 270 KISGSHCCSDTSKGDESFCFHPERFISRQDMINWKAQLEYQFLPRITEQMSRIRVDYMDM 329
Query: 470 DKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEE 529
I+ +A L DL A ++ II FG +R FQP+KP+T AQAAVAL G S+AV E
Sbjct: 330 KDISSEASSEFLIDLLAADKSIIRKVFGQSRRFQPNKPLTKAQAAVALISGRMSEAVYNE 389
Query: 530 LQRIEAE-SAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELER 588
+ R+EA+ S+ + A+ E + + E + +I + ++ S ER + V+K+ +LE+
Sbjct: 390 ILRVEADNSSRQAALKEIRSELLE-KGDIERFWREKNSEERTRGLEVQKLYVTVLHDLEQ 448
Query: 589 LRAEREVDKIAL---MKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINML 645
E+ V AL +KERAA++ + ++L L+ EV+E E L S + E+ + L
Sbjct: 449 ---EKTVQLKALAEYLKERAAMDCQRQLLLHLKEEVDEMSERLTSERAMYVAEQGNLQEL 505
Query: 646 RKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALE--GARDRWERQG 702
E + + + + LE E +A+ + R+W EDEA++++ +AK LE G R +W+ Q
Sbjct: 506 LGELQARQEGMLDKKCVLEAEIEAIRILRSWVEDEARKSQARAKVLEEVGRRWKWDNQA 564
>gi|147860148|emb|CAN78724.1| hypothetical protein VITISV_020007 [Vitis vinifera]
Length = 836
Score = 216 bits (549), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 143/400 (35%), Positives = 223/400 (55%), Gaps = 20/400 (5%)
Query: 317 VLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYP 376
VL+P D Q +AL L+ LK+IE DV +LC +REYARWLV A+ L R +++
Sbjct: 408 VLIPVAADSTQQEALLVLKKLKIIEDDVSADELCTKREYARWLVRANLLLERDPRHRIFS 467
Query: 377 AMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLS--HRDLLNEEP--GPIFFLPE 432
+ AFDD+ ED D+ SIQ LAEAG+I SKLS L+ G ++F P+
Sbjct: 468 SSLPSGSIISAFDDVNAEDRDYGSIQALAEAGIIPSKLSGNSNSALDSSKVQGEVYFSPD 527
Query: 433 SPLSRQDLVSWKMALEKRQLPEANKKIL------YQLS----GFIDIDKINPDAWPALLA 482
+SRQDL++WK LE + +P +KIL Q+S F+D+ +I+ DA P
Sbjct: 528 RFISRQDLINWKAQLEYKFMPGIKEKILKPDLFTVQISRTKVDFMDMKEISSDASPEFFI 587
Query: 483 DLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENA 542
D+ AG++ I+ FG ++ FQP+KP T AQ+AVAL G ++ ++ EL R+EAE + A
Sbjct: 588 DMLAGDRSIVRKVFGQSKRFQPNKPSTKAQSAVALTSGRMTEVIHTELLRLEAEKLSREA 647
Query: 543 VSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIAL-- 600
+E +I + +++ E+ + VEK A +LE ER V L
Sbjct: 648 EAEEIRSQLLNRGDIQSFWSEKIKDEKIRGFEVEKDYLAAVSDLEE---ERIVHVNCLTE 704
Query: 601 -MKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARL 659
+KE+AA+E + ++L +L+ EV+E E L + E+ + + E +N+ + + +
Sbjct: 705 NLKEKAAMECQSQLLFRLKDEVDEMSERLACERTGYMAEQRNLQDMLNELQNKQEGVLDV 764
Query: 660 QYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWE 699
+ LE E++AL + R+W EDEA++ + +AK LE RW+
Sbjct: 765 KSILEAEKEALRILRSWVEDEARKNQARAKVLEEVGRRWK 804
>gi|357124400|ref|XP_003563888.1| PREDICTED: uncharacterized protein LOC100834778 [Brachypodium
distachyon]
Length = 561
Score = 206 bits (525), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 134/392 (34%), Positives = 215/392 (54%), Gaps = 22/392 (5%)
Query: 319 VPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAM 378
+PA D + ++LS L+ L++IE D + C RRE+ARW V S L R ++ P +
Sbjct: 171 IPA--DPMHEESLSILKKLQIIENDAGSSEFCTRREFARWFVKLCSRLERKRRHRIIPNL 228
Query: 379 YIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSH------RDLLNEEPGPIFFLPE 432
I + AFDD+ +D DF IQ L E+G++ SKLS D L+ F P+
Sbjct: 229 LICGSVESAFDDVNLDDSDFLYIQSLGESGIVPSKLSSFCGTFTSDSLSANRNA-NFQPD 287
Query: 433 SPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWP----ALLADLTAGE 488
S LSR DLV+WK+ +E E ++K+ + ++ ++ AWP ++L DL G+
Sbjct: 288 SYLSRLDLVNWKVLVEHPFASELDQKMPSK-----NVHTLDLSAWPDVSASILTDLIGGD 342
Query: 489 QGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSA 548
II+ FG TR Q KPVT AQAA AL G + + +EL+R+E E+ E+ +S
Sbjct: 343 HSIISKVFGNTRRLQHHKPVTKAQAAAALTSGRMEEVIRDELKRLEVEN--ESRLSVMGE 400
Query: 549 LVAEV--EKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAA 606
++ E+ +I + ++ ++ E++ VEK ++ EL R +RE + L+KER A
Sbjct: 401 MMEELIERGDIRQYWDCKMKREQDCGLEVEKHLQDVFHELANERTDREKELAVLLKERTA 460
Query: 607 IESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVE 666
+E + + L LR EV+ + L + +EI +++ + L + +++Q + + LE E
Sbjct: 461 LEHQNQELVSLRSEVDSMYDRLANESIEIMADEQNLEKLSSDMSSKHQAVTEAKSYLEAE 520
Query: 667 RKALSMARAWAEDEAKRAREQAKALEGARDRW 698
++AL+M R+W E EA R E+AK LE A RW
Sbjct: 521 KEALTMLRSWVETEAARVHERAKVLEKAVRRW 552
>gi|218198247|gb|EEC80674.1| hypothetical protein OsI_23089 [Oryza sativa Indica Group]
Length = 579
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 142/393 (36%), Positives = 216/393 (54%), Gaps = 18/393 (4%)
Query: 317 VLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYP 376
VL A VD + +A S L+ L++IE D D C RRE+ARW + S L R M ++ P
Sbjct: 189 VLFRAPVDPMHEEAFSILKKLQIIEKDASSSDFCSRREFARWFIKLHSKLERKKMHRIIP 248
Query: 377 AMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSH-----RDLLNEEPGPIFFLP 431
AFDDI +DPDF IQ L E+G++SSKLS+ + + G FLP
Sbjct: 249 NRLTFGSVRSAFDDIDADDPDFLYIQSLGESGIVSSKLSNFLGTSTSGSSSDSGNSNFLP 308
Query: 432 ESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGI 491
S LSR DLV+WK +E E ++K+L + +D+ + PD ++L DL GEQ I
Sbjct: 309 NSYLSRFDLVNWKALVEHPFATELDQKMLSKNVRILDL-RAWPDVPSSILIDLMGGEQSI 367
Query: 492 IALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVA 551
I+ FG TR QP KPVT AQAA AL G + + +EL R+EAE+ ++ +V +
Sbjct: 368 ISKVFGNTRCLQPHKPVTKAQAAAALTSGRMEEVIRDELNRLEAENQSQLSV------MG 421
Query: 552 EVEKE------INESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERA 605
E+ +E I +E ++ +E + V+K + QEL + +RE + L+KER
Sbjct: 422 EIMEELINRGDIKRYWEDKMKVEEIREVAVDKQLQHVLQELANEKTDREKELAVLLKERT 481
Query: 606 AIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEV 665
A+E + + L LR E++ + L +E+ E++ + L + ++Q ++ + LE
Sbjct: 482 ALEHQNQELMNLRSEIDGMYDRLAMESLEVMTEEQNLEKLSLDVNRKHQAVSESKSYLEA 541
Query: 666 ERKALSMARAWAEDEAKRAREQAKALEGARDRW 698
E++AL+M R+W E+EA R E+A+ LE A RW
Sbjct: 542 EKEALTMLRSWVEEEAARVHERAEVLERAVRRW 574
>gi|52076485|dbj|BAD45364.1| unknown protein [Oryza sativa Japonica Group]
Length = 564
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 130/395 (32%), Positives = 203/395 (51%), Gaps = 37/395 (9%)
Query: 317 VLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYP 376
VL A VD + +A S L+ L++IE D D C RRE+ARW M ++ P
Sbjct: 189 VLFRAPVDPMHEEAFSILKKLQIIEKDASSSDFCSRREFARW----------KKMHRIIP 238
Query: 377 AMYIENVTDLAFDDI--TPEDPDFSSIQGLAEAGLISSKLSH-----RDLLNEEPGPIFF 429
L F + +D D L ++SSKLS+ + + G F
Sbjct: 239 -------NRLTFGSVRSAFDDIDADDPDFLYIQCIVSSKLSNFLGTSTSGSSSDSGNSNF 291
Query: 430 LPESPLSRQDLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQ 489
LP S LSR DLV+WK +E E ++K+L + +D+ + PD ++L DL GEQ
Sbjct: 292 LPNSYLSRFDLVNWKALVEHPFATELDQKMLSKNVRILDL-RAWPDVPSSILVDLMGGEQ 350
Query: 490 GIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSAL 549
II+ FG TR QP KPVT AQAA AL G + + +EL R+EAE+ ++ +V
Sbjct: 351 SIISKVFGNTRCLQPHKPVTKAQAAAALTSGRMEEVIRDELNRLEAENQSQLSV------ 404
Query: 550 VAEVEKE------INESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKE 603
+ E+ +E I +E ++ +E + V+K + QEL + +RE + L+KE
Sbjct: 405 MGEIMEELINRGDIKRYWEDKMKVEEIREVAVDKQLQHVLQELANEKTDREKELAVLLKE 464
Query: 604 RAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYEL 663
R A+E + + L LR E++ + L +E+ E++ + L + ++Q ++ + L
Sbjct: 465 RTALEHQNQELMNLRSEIDGMYDRLAMESLEVMTEEQNLEKLSLDVNRKHQAVSESKSYL 524
Query: 664 EVERKALSMARAWAEDEAKRAREQAKALEGARDRW 698
E E++AL+M R+W E+EA R E+A+ LE A RW
Sbjct: 525 EAEKEALTMLRSWVEEEAARVHERAEVLERAVRRW 559
>gi|414874061|tpg|DAA52618.1| TPA: hypothetical protein ZEAMMB73_607077 [Zea mays]
Length = 486
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 79/125 (63%), Positives = 97/125 (77%), Gaps = 1/125 (0%)
Query: 279 ESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLK 338
E+ + +PP SSP GIPAPSVVS ALQV G ++VPA VD Q A++ALQ+LK
Sbjct: 358 ENQNKQLESTPPDQYFSSP-GIPAPSVVSTALQVPAGPIVVPASVDPTQENAIAALQILK 416
Query: 339 VIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDF 398
VIE+ + G+LC RREYARWLV+AS+ L+R+T SKVYPAMYI+NVT+LAFDD+TPEDPDF
Sbjct: 417 VIESSAQAGELCTRREYARWLVAASNCLSRNTFSKVYPAMYIDNVTELAFDDVTPEDPDF 476
Query: 399 SSIQG 403
IQG
Sbjct: 477 PFIQG 481
>gi|222635638|gb|EEE65770.1| hypothetical protein OsJ_21451 [Oryza sativa Japonica Group]
Length = 532
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 172/309 (55%), Gaps = 18/309 (5%)
Query: 401 IQGLAEAGLISSKLSH-----RDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEA 455
IQ L E+G++SSKLS+ + + G FLP S LSR DLV+WK +E E
Sbjct: 226 IQSLGESGIVSSKLSNFLGTSTSGSSSDSGNSNFLPNSYLSRFDLVNWKALVEHPFATEL 285
Query: 456 NKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAV 515
++K+L + +D+ + PD ++L DL GEQ II+ FG TR QP KPVT AQAA
Sbjct: 286 DQKMLSKNVRILDL-RAWPDVPSSILVDLMGGEQSIISKVFGNTRCLQPHKPVTKAQAAA 344
Query: 516 ALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKE------INESFEKELSMER 569
AL G + + +EL R+EAE+ ++ +V + E+ +E I +E ++ +E
Sbjct: 345 ALTSGRMEEVIRDELNRLEAENQSQLSV------MGEIMEELINRGDIKRYWEDKMKVEE 398
Query: 570 EKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLM 629
+ V+K + QEL + +RE + L+KER A+E + + L LR E++ + L
Sbjct: 399 IREVAVDKQLQHVLQELANEKTDREKELAVLLKERTALEHQNQELMNLRSEIDGMYDRLA 458
Query: 630 SNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAK 689
+E+ E++ + L + ++Q ++ + LE E++AL+M R+W E+EA R E+A+
Sbjct: 459 MESLEVMTEEQNLEKLSLDVNRKHQAVSESKSYLEAEKEALTMLRSWVEEEAARVHERAE 518
Query: 690 ALEGARDRW 698
LE A RW
Sbjct: 519 VLERAVRRW 527
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 14/106 (13%)
Query: 317 VLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYP 376
VL A VD + +A S L+ L++IE D D C RRE+ARW + S L R M ++ P
Sbjct: 72 VLFRAPVDPMHEEAFSILKKLQIIEKDASSSDFCSRREFARWFIKLHSKLERKKMHRIIP 131
Query: 377 AMYIENVTDLAFDDITPEDPDFSS-------IQGLAEAGLISSKLS 415
L F + D + IQ L E+G++SSKLS
Sbjct: 132 -------NRLTFGSVRSAFDDIDADDPDFLYIQSLGESGIVSSKLS 170
>gi|427709384|ref|YP_007051761.1| S-layer protein [Nostoc sp. PCC 7107]
gi|427361889|gb|AFY44611.1| S-layer domain-containing protein [Nostoc sp. PCC 7107]
Length = 458
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 82/185 (44%), Positives = 106/185 (57%), Gaps = 10/185 (5%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
+PG + RR YARWLV+A++ + + +K T AF D+ DPDF +IQG
Sbjct: 269 FEPGKIITRRMYARWLVAANNAMYPNNSAKQI--RLAAETTQPAFSDVAKTDPDFPAIQG 326
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAEAGLI S LS + F P++PL+R+ L+ WK+ L+ RQ LP AN + Q
Sbjct: 327 LAEAGLIPSSLSGDSTT------VLFRPDAPLTREQLILWKVPLDTRQALPTANLDAVKQ 380
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGE 521
GF D+ KI+P A A+LAD + EQ I FG T LFQP KPVT A+AA AL G
Sbjct: 381 TWGFQDVGKIDPKALRAILADFPSAEQSNIRRVFGYTTLFQPKKPVTRAEAAAALWYFGT 440
Query: 522 ASDAV 526
D V
Sbjct: 441 VGDGV 445
>gi|186685810|ref|YP_001869006.1| S-layer protein [Nostoc punctiforme PCC 73102]
gi|186468262|gb|ACC84063.1| S-layer domain protein [Nostoc punctiforme PCC 73102]
Length = 464
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 89/211 (42%), Positives = 118/211 (55%), Gaps = 21/211 (9%)
Query: 329 QALSALQVL----KVIEAD-------VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPA 377
Q L+AL VL KV +++ +PG + REYARWL++A++ + S +K
Sbjct: 251 QDLAALGVLSLEPKVTKSNSTTTNNQFEPGKIVTHREYARWLIAANNAMYASNPAKQI-- 308
Query: 378 MYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSR 437
T F D+T +DPDF +IQGLAEAGLI S LS + F P++PL+R
Sbjct: 309 RLASESTQPIFSDVTAKDPDFPAIQGLAEAGLIPSPLSGDSTA------VLFRPDAPLTR 362
Query: 438 QDLVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAF 496
+ L+ WK L+ RQ LP AN + Q GF D +I+P A A+LAD GEQ I F
Sbjct: 363 EQLLLWKSPLDTRQALPSANLDTVKQTWGFQDAARIDPKALRAVLADYQNGEQSNIRRVF 422
Query: 497 GCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
G T LFQP KPVT A+AA+ L G S+ V
Sbjct: 423 GYTTLFQPKKPVTRAEAAMTLWYFGSQSEGV 453
>gi|257060199|ref|YP_003138087.1| S-layer protein [Cyanothece sp. PCC 8802]
gi|256590365|gb|ACV01252.1| S-layer domain protein [Cyanothece sp. PCC 8802]
Length = 406
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 87/216 (40%), Positives = 120/216 (55%), Gaps = 19/216 (8%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEAD---VKPGDLCIRREYARWLVSASSTLTRSTMS 372
+VL P + D L+ L +L +D KP RREYARWLV+A + +
Sbjct: 188 EVLRPYIED------LARLGILTANNSDNNQFKPNQTITRREYARWLVNAKNKFYEKSPE 241
Query: 373 KVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPE 432
K + + N + AF D++ DPDF IQGLAEAGLI S+L+ + F P+
Sbjct: 242 KQI-RLGVNN-SQPAFSDVSSSDPDFGVIQGLAEAGLIPSRLTGNSSAS------LFRPD 293
Query: 433 SPLSRQDLVSWKMALEK-RQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGI 491
+PL+R DL++WK+ L+ + LP+A+ + + GF D +I+P A AL AD +GEQG
Sbjct: 294 APLTRSDLIAWKVPLDTGKGLPQASIDAIKETWGFQDTTQIDPQALRALYADFQSGEQGN 353
Query: 492 IALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
+ FG T LFQP KPVT AQAA AL G D +
Sbjct: 354 VRRVFGYTTLFQPKKPVTRAQAAAALWYFGYQGDGL 389
>gi|218247128|ref|YP_002372499.1| S-layer protein [Cyanothece sp. PCC 8801]
gi|218167606|gb|ACK66343.1| S-layer domain protein [Cyanothece sp. PCC 8801]
Length = 406
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 87/216 (40%), Positives = 119/216 (55%), Gaps = 19/216 (8%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEAD---VKPGDLCIRREYARWLVSASSTLTRSTMS 372
+VL P + D L+ L +L +D KP RREYARWLV+A + +
Sbjct: 188 EVLRPYIED------LARLGILTANNSDNNQFKPNQTITRREYARWLVNAKNKFYEKSPE 241
Query: 373 KVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPE 432
K + + N + AF D++ DPDF IQGLAEAGLI S+L+ + F P
Sbjct: 242 KQI-RLGVNN-SQPAFSDVSSSDPDFGVIQGLAEAGLIPSRLTGNSSAS------LFRPN 293
Query: 433 SPLSRQDLVSWKMALEK-RQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGI 491
+PL+R DL++WK+ L+ + LP+A+ + + GF D +I+P A AL AD +GEQG
Sbjct: 294 APLTRSDLIAWKVPLDTGKGLPQASIDAIKETWGFQDTTQIDPQALRALYADFQSGEQGN 353
Query: 492 IALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
+ FG T LFQP KPVT AQAA AL G D +
Sbjct: 354 VRRVFGYTTLFQPKKPVTRAQAAAALWYFGYQGDGL 389
>gi|298490038|ref|YP_003720215.1| S-layer protein ['Nostoc azollae' 0708]
gi|298231956|gb|ADI63092.1| S-layer domain-containing protein ['Nostoc azollae' 0708]
Length = 461
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 77/177 (43%), Positives = 108/177 (61%), Gaps = 11/177 (6%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDL-AFDDITPEDPDFSSIQ 402
+P + RREYARWLV+A++T+ + K + + + D AF D+ P+DPDF ++Q
Sbjct: 275 FQPNKIITRREYARWLVAANNTMYANNPGK---QIRLASGNDQPAFRDVLPKDPDFLTVQ 331
Query: 403 GLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILY 461
GLAEAGLI S LS + + F P++PL+R+ L+ WK+ L+ RQ LP AN + +
Sbjct: 332 GLAEAGLIPSSLSG------DTTAVLFRPDAPLTREQLLLWKVPLDTRQALPAANLEAVK 385
Query: 462 QLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALA 518
Q GF D +KI+P A A+LAD +Q I FG T LFQP K VT A+A AL+
Sbjct: 386 QTWGFQDTEKIDPKALRAILADFQGAQQSNIRRVFGYTTLFQPKKAVTRAEAGAALS 442
>gi|254416435|ref|ZP_05030188.1| S-layer domain protein [Coleofasciculus chthonoplastes PCC 7420]
gi|196176873|gb|EDX71884.1| S-layer domain protein [Coleofasciculus chthonoplastes PCC 7420]
Length = 445
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 77/170 (45%), Positives = 101/170 (59%), Gaps = 9/170 (5%)
Query: 349 LCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAG 408
+ RR+YARWL+ ++ + ++ K T AF D+ DPDF +IQGLAEAG
Sbjct: 264 IITRRQYARWLIETNNRIYENSPGKQI--RLASETTQPAFQDVPASDPDFGAIQGLAEAG 321
Query: 409 LISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFI 467
LI S LS + F P++PL+R++LV WK+ L+ RQ LP+A+ + Q GF
Sbjct: 322 LIPSPLSGNST------EVLFRPDAPLTRENLVLWKVPLDIRQGLPQASLDAVQQTWGFQ 375
Query: 468 DIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
D KI+P A A+LAD GEQ II FG T LFQP KPVT A+AA AL
Sbjct: 376 DAGKIDPKALRAVLADFQNGEQSIIRRVFGYTTLFQPQKPVTQAEAAAAL 425
>gi|434406901|ref|YP_007149786.1| putative S-layer protein [Cylindrospermum stagnale PCC 7417]
gi|428261156|gb|AFZ27106.1| putative S-layer protein [Cylindrospermum stagnale PCC 7417]
Length = 452
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 82/194 (42%), Positives = 112/194 (57%), Gaps = 13/194 (6%)
Query: 329 QALSALQVL----KVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVT 384
Q L+AL + K + + +PG + REYA WLV+A++ + + +K
Sbjct: 247 QDLAALGIFSQDSKATKNNFEPGKIITHREYAHWLVAANNAMNANNPAKQI--RLASETA 304
Query: 385 DLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWK 444
AF D++ +DPDF++IQGLAEAGLI S LS + F P++PL+R+ L+ WK
Sbjct: 305 QPAFSDVSAKDPDFAAIQGLAEAGLIPSALSGDSTA------VLFRPDAPLTREQLLLWK 358
Query: 445 MALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQ 503
+ L+ RQ LP AN + Q GF D+ KI+P A A+LAD GEQ I FG T LFQ
Sbjct: 359 IPLDTRQALPAANLDAVKQTWGFQDVGKIDPKALRAVLADFQNGEQSNIRRVFGYTTLFQ 418
Query: 504 PDKPVTNAQAAVAL 517
P KPVT +AA AL
Sbjct: 419 PKKPVTRGEAAAAL 432
>gi|428223591|ref|YP_007107688.1| S-layer protein [Geitlerinema sp. PCC 7407]
gi|427983492|gb|AFY64636.1| S-layer domain-containing protein [Geitlerinema sp. PCC 7407]
Length = 479
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 109/189 (57%), Gaps = 12/189 (6%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSK-VYPAMYIENVTDLAFDDITPEDPDFSSIQ 402
KP RRE+ARWL A++ L + ++ + PA+ + AF DI P+DPDF +IQ
Sbjct: 263 FKPNATLTRREFARWLFRANNALYANQPARQIRPAV---GTSTPAFQDIRPQDPDFGAIQ 319
Query: 403 GLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILY 461
GLAEAG+I S L+ + G F PE PLSR+ L+ WK+ L+ RQ LP N + L
Sbjct: 320 GLAEAGIIPSPLAG------DAGATTFRPEVPLSRETLLLWKVPLDSRQGLPAPNLETLK 373
Query: 462 QLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALA-IG 520
Q GF D KI+P AL+AD +G+Q + AFG T LFQP K VT A+AA L G
Sbjct: 374 QTWGFQDAAKIDPRVQRALIADYQSGDQSNVRRAFGYTTLFQPKKLVTRAEAAAVLGYFG 433
Query: 521 EASDAVNEE 529
D ++ +
Sbjct: 434 TQGDGLSAQ 442
>gi|119511050|ref|ZP_01630170.1| S-layer region-like protein [Nodularia spumigena CCY9414]
gi|119464301|gb|EAW45218.1| S-layer region-like protein [Nodularia spumigena CCY9414]
Length = 458
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/213 (41%), Positives = 112/213 (52%), Gaps = 41/213 (19%)
Query: 329 QALSALQVLKVIEADVK------------PGDLCIRREYARWLVSASSTLTRSTMSKVYP 376
Q L+AL VL + +VK P RREYARWLV+A++
Sbjct: 242 QDLAALGVLSLKSEEVKSSSNDAISKSFEPSKNITRREYARWLVAANN------------ 289
Query: 377 AMYIENV----------TDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGP 426
AMYI N T AF D++ D DF IQGLAEAGLI S LS +
Sbjct: 290 AMYINNPAKQIRLASESTQSAFSDVSKTDVDFPVIQGLAEAGLIPSPLSG------DSTA 343
Query: 427 IFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLT 485
+ F P++PL+R+ L+ WK+ L+ RQ LP AN + + Q GF D KI+P A A+LAD
Sbjct: 344 VLFRPDAPLTREQLILWKIPLDTRQALPAANLEAVNQTWGFQDTGKIDPKALRAVLADFQ 403
Query: 486 AGEQGIIALAFGCTRLFQPDKPVTNAQAAVALA 518
EQ I FG T LFQP KPV+ A+AA AL
Sbjct: 404 NSEQSNIRRVFGYTTLFQPKKPVSRAEAAAALG 436
>gi|427718126|ref|YP_007066120.1| S-layer protein [Calothrix sp. PCC 7507]
gi|427350562|gb|AFY33286.1| S-layer domain-containing protein [Calothrix sp. PCC 7507]
Length = 456
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 107/187 (57%), Gaps = 13/187 (6%)
Query: 346 PGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLA 405
P + RREYA WLV+A++ + + +K T AF D+ +DP+F +IQGLA
Sbjct: 269 PAKIITRREYASWLVNANNAMYANNPAKQI--RLASTSTQPAFSDVPAKDPNFPAIQGLA 326
Query: 406 EAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLS 464
EAGLI S LS + F P++PL+R+ LV WK+ L+ RQ LP AN + Q
Sbjct: 327 EAGLIPSSLSGDSTA------VLFRPDAPLTREQLVLWKLPLDTRQALPTANLDAVKQTW 380
Query: 465 GFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAI----G 520
GF D+ KI+P A A+LAD GEQ I FG T LFQP KPVT A+AA AL G
Sbjct: 381 GFQDVGKIDPKALRAVLADFQNGEQANIRRVFGYTTLFQPKKPVTRAEAAAALWYFGTQG 440
Query: 521 EASDAVN 527
E AV+
Sbjct: 441 EGISAVD 447
>gi|428781166|ref|YP_007172952.1| S-layer protein [Dactylococcopsis salina PCC 8305]
gi|428695445|gb|AFZ51595.1| putative S-layer protein [Dactylococcopsis salina PCC 8305]
Length = 403
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 81/207 (39%), Positives = 115/207 (55%), Gaps = 25/207 (12%)
Query: 331 LSALQVLKVIEADVK---PGDLCIRREYARWLVSASSTLTRSTMSKVYP------AMYIE 381
++ L VL IE + K P RR +ARWL A + + YP +E
Sbjct: 203 VAKLGVLSAIEPESKQFSPNAEITRRTFARWLFQAHN--------RFYPDRPSQQIRSVE 254
Query: 382 NVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLV 441
AF DI+P DPDF IQGLAEAG+I S+L+ + F P++PL R+ L+
Sbjct: 255 QAETPAFQDISPNDPDFKIIQGLAEAGIIPSRLTDDATITR------FRPDAPLKRETLL 308
Query: 442 SWKMALE-KRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTR 500
WK+ L+ +R LPEA+ K + + GF D +I+ A+ A++AD + GE+ II +G T+
Sbjct: 309 LWKIPLDTRRNLPEADVKTVQETWGFQDASEIDSKAFGAVVADFSQGERSIIRRLYGYTQ 368
Query: 501 LFQPDKPVTNAQAAVAL-AIGEASDAV 526
L QP+KPVT AQAA AL + G D +
Sbjct: 369 LLQPNKPVTRAQAAAALWSFGTQGDII 395
>gi|428320136|ref|YP_007118018.1| S-layer domain-containing protein [Oscillatoria nigro-viridis PCC
7112]
gi|428243816|gb|AFZ09602.1| S-layer domain-containing protein [Oscillatoria nigro-viridis PCC
7112]
Length = 466
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 98/251 (39%), Positives = 135/251 (53%), Gaps = 21/251 (8%)
Query: 278 NESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVL 337
N S SS + PT S+S P PS S + V + +P + Q L+ L+VL
Sbjct: 202 NNSESSPSPTASPTPSNSE-KTTPTPSASSKSANVSEPESQIPQQLRQYVAD-LTQLEVL 259
Query: 338 KVIEAD----------VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLA 387
KV A KP + RREYARWLV+A++ + S +K + ++ A
Sbjct: 260 KVRSAQSANLETGSTLPKPNKIITRREYARWLVAANNQIYASRQAKQI--RLAVDSSEPA 317
Query: 388 FDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMAL 447
F D+ DPDFS+IQGLAEAG+I S LS E + F P++PL+R+ ++ WK+ L
Sbjct: 318 FSDVPKTDPDFSAIQGLAEAGVIPSSLSG------ETKDVKFRPDAPLTRETMILWKVPL 371
Query: 448 EKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDK 506
+ RQ LP AN + + + GF D KI+ A A+LAD G+ I FG T LFQP K
Sbjct: 372 DSRQVLPTANIEGVKEKWGFQDASKIDSQASRAVLADFNNGDLANIRRVFGFTTLFQPKK 431
Query: 507 PVTNAQAAVAL 517
PVT A+AA +L
Sbjct: 432 PVTRAEAAASL 442
>gi|75909391|ref|YP_323687.1| S-layer region-like protein [Anabaena variabilis ATCC 29413]
gi|75703116|gb|ABA22792.1| S-layer region-like protein [Anabaena variabilis ATCC 29413]
Length = 445
Score = 132 bits (333), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 111/189 (58%), Gaps = 13/189 (6%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
+PG + RREYARWLV+A++ + + +K T AF D++ +D DF +IQG
Sbjct: 255 FEPGRIITRREYARWLVNANNAMYANNSAKQI--RLAGESTQAAFTDVSSQDADFPAIQG 312
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAEAGLI S LS + + F P++PL+R+ L+ WK+ L+ RQ LP AN + + +
Sbjct: 313 LAEAGLIPSPLSG------DATAVLFRPDAPLTREQLILWKVPLDTRQALPNANLEAVKE 366
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAI--- 519
GF D KI+P A A+LAD GEQ I FG T LFQP KPVT A+AA AL
Sbjct: 367 TWGFQDAGKIDPKALRAVLADFQNGEQANIRRIFGYTTLFQPKKPVTRAEAAGALWYFGS 426
Query: 520 -GEASDAVN 527
GE AV+
Sbjct: 427 QGEGISAVD 435
>gi|307154449|ref|YP_003889833.1| S-layer protein [Cyanothece sp. PCC 7822]
gi|306984677|gb|ADN16558.1| S-layer domain protein [Cyanothece sp. PCC 7822]
Length = 404
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 82/205 (40%), Positives = 117/205 (57%), Gaps = 13/205 (6%)
Query: 329 QALSALQVLKVIEAD-VKPGDLCIRREYARWLVSASSTLTRSTMSK-VYPAMYIENVTDL 386
Q L++L VL + + P RR++ARWL +A++ + + K + P +
Sbjct: 197 QDLASLGVLSADKGNQFNPNAPITRRDFARWLYNANNKIFANAAGKQIRPG---STSSQS 253
Query: 387 AFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMA 446
AF D+ P DPDF IQGLAEAGLI S L+ + + F P +PL+R++LV WK+
Sbjct: 254 AFTDVKPNDPDFPIIQGLAEAGLIPSPLTG------DSNALLFRPNAPLTREELVQWKVP 307
Query: 447 LEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPD 505
L+ R+ LP A+ + + + GF D++KINP A +L AD GEQ I FG T LFQP
Sbjct: 308 LDSRKGLPTASMESVKETWGFQDLNKINPLALRSLYADYQNGEQANIKRVFGYTTLFQPK 367
Query: 506 KPVTNAQAAVAL-AIGEASDAVNEE 529
KPVT A+AA AL G D ++ +
Sbjct: 368 KPVTRAEAAAALWYFGYQGDGMSAQ 392
>gi|443326660|ref|ZP_21055306.1| putative S-layer protein [Xenococcus sp. PCC 7305]
gi|442793716|gb|ELS03157.1| putative S-layer protein [Xenococcus sp. PCC 7305]
Length = 479
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/296 (34%), Positives = 150/296 (50%), Gaps = 25/296 (8%)
Query: 247 EVHDLNKNGSSGTSVSPSIFPFSNEKETCDLNESNSSSFTESPPTGSSSSP---AGIPAP 303
E+ D N++ SS T + + PF + DL E N S E P + P A IP
Sbjct: 188 ELLDTNEDNSSQTKIDIAYKPFIPNLSSTDL-EPNIES--EVLPNSAEDYPEATAVIPEL 244
Query: 304 SVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVK-------PGDLCIRREYA 356
+ S A+Q A V + QA+ + L ++ K P +L RR+Y
Sbjct: 245 ATGSDAIQTTTTNFADLAEVREQLQQAVQDVAALGILTPQTKDSPPQLAPNELITRRDYV 304
Query: 357 RWLVSASSTLT-RSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLS 415
RWLVSA++ S +K++ + + AF DI DPDF IQGLAEAGLI S
Sbjct: 305 RWLVSANNKFHENSPGNKIH---LTKKTSQTAFKDIDINDPDFGEIQGLAEAGLIPS--- 358
Query: 416 HRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINP 474
+L + + F P++ L+R+DL+SWK+ L+ R LP+A+ + + + GF D+ KI+
Sbjct: 359 ---ILTSDSNNVLFRPDAALTREDLISWKVPLDLRAALPKASIETIEETWGFQDVAKIDS 415
Query: 475 DAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAVNEE 529
A AL +D G++ + FG T LFQP K VT A+AA +L G D ++ E
Sbjct: 416 QAIRALFSDYQNGDRSNVRRVFGFTTLFQPKKGVTRAEAAASLWYFGYQDDGISAE 471
>gi|218441526|ref|YP_002379855.1| S-layer protein [Cyanothece sp. PCC 7424]
gi|218174254|gb|ACK72987.1| S-layer domain protein [Cyanothece sp. PCC 7424]
Length = 408
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/279 (34%), Positives = 141/279 (50%), Gaps = 34/279 (12%)
Query: 261 VSPSIFPFSNEKETCDLNESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVP 320
V+PS SN D ++N++S P S++ P P PS S L
Sbjct: 146 VTPSSGNSSNTDFVIDYEKNNNTS-----PQSSTNIPNSTPTPSQTSTNFSDL------- 193
Query: 321 AVVDQV---QGQALSALQVLKVIEAD----VKPGDLCIRREYARWLVSASSTLTRSTMSK 373
DQV + L L V+ AD P RR +ARWL +A++ + ++ K
Sbjct: 194 ---DQVPEPWRNNIKDLGTLGVLSADKGDQFNPNAAVTRRVFARWLYNANNKIFANSAGK 250
Query: 374 -VYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPE 432
+ P + AF DI P+DPDF +IQGLAEAG+I S+L+ + + F P+
Sbjct: 251 QIRPG---STNSQSAFQDINPKDPDFEAIQGLAEAGIIPSRLTG------DSSALLFRPD 301
Query: 433 SPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGI 491
+PL+R+DL+ WK+ L+ R+ LP A+ + + GF D KI P++ +L AD +Q
Sbjct: 302 APLTREDLLQWKVPLDTRKGLPTASIDSVKETWGFQDTSKIKPNSLRSLYADFQNADQAN 361
Query: 492 IALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAVNEE 529
+ FG T LFQP KPVT A+AA AL G D ++ +
Sbjct: 362 VRRVFGYTTLFQPTKPVTRAEAATALWYFGYQGDGMSAQ 400
>gi|428203600|ref|YP_007082189.1| putative S-layer protein [Pleurocapsa sp. PCC 7327]
gi|427981032|gb|AFY78632.1| putative S-layer protein [Pleurocapsa sp. PCC 7327]
Length = 412
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 114/203 (56%), Gaps = 13/203 (6%)
Query: 329 QALSALQVL--KVIEAD-VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTD 385
Q L+AL +L + D P + RREYARWLV+A++ L + K N ++
Sbjct: 198 QDLAALGILVSNQNQGDRFNPDTIITRREYARWLVAANNKLFANQPGKQL--RLATNTSE 255
Query: 386 LAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKM 445
F D+ DPDF++IQGLAEAG ISS L+ + + F P+SPL+R+DL++WK+
Sbjct: 256 PVFKDVPKNDPDFAAIQGLAEAGFISSTLTGDN------SAMLFRPDSPLTREDLIAWKV 309
Query: 446 ALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQP 504
AL+ R+ LP A+ + Q GF D KI+P A +L AD Q I FG T LFQP
Sbjct: 310 ALDHRKALPSASIDQIKQTWGFQDATKIDPKALRSLYADYQNSGQANIGRVFGSTTLFQP 369
Query: 505 DKPVTNAQAAVAL-AIGEASDAV 526
K VT A+AA AL G D +
Sbjct: 370 KKNVTRAEAAAALWYFGFQGDGI 392
>gi|425472481|ref|ZP_18851322.1| S-layer region-like [Microcystis aeruginosa PCC 9701]
gi|389881430|emb|CCI38014.1| S-layer region-like [Microcystis aeruginosa PCC 9701]
Length = 387
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 107/200 (53%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P D RRE+ARWL+ A++ + + K + N + AF
Sbjct: 179 QDLARLGVLTGNNNQFNPNDTITRREFARWLLQANNAIYANVAGKQI-RLATPNSSP-AF 236
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + + F P +PL+R+DL++WK+ L+
Sbjct: 237 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSTLLFRPNAPLTREDLIAWKVPLD 290
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 291 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 350
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA L G D V
Sbjct: 351 VTRAEAAATLWYFGFQGDGV 370
>gi|334119580|ref|ZP_08493665.1| S-layer domain-containing protein [Microcoleus vaginatus FGP-2]
gi|333457742|gb|EGK86363.1| S-layer domain-containing protein [Microcoleus vaginatus FGP-2]
Length = 466
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 132/251 (52%), Gaps = 21/251 (8%)
Query: 278 NESNSSSFTESPPTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVL 337
N S SS+ + PT S+S P PS + QV +P + Q LS L+ L
Sbjct: 202 NNSESSASPTASPTPSNSE-GTTPTPSASPKSAQVSQLDTQIPQQLRQYVAD-LSQLEAL 259
Query: 338 KVIEADV----------KPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLA 387
KV + KP + RREYARWLV+A++ + S +K + ++ A
Sbjct: 260 KVRSTESANLETASTLPKPNKIVTRREYARWLVAANNQIYASRQAKQI--RLAVDSSEPA 317
Query: 388 FDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMAL 447
F DI DPDFS+IQGLAEAG+I S LS E + F P++PL+R+ ++ WK+ L
Sbjct: 318 FSDIPKSDPDFSAIQGLAEAGVIPSSLSG------ETKDVKFRPDAPLTRETMILWKVPL 371
Query: 448 EKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDK 506
+ RQ LP AN + + GF D KI+ A A+LAD G+ I FG T LFQP K
Sbjct: 372 DTRQVLPTANIDGVKEKWGFQDASKIDSQASRAVLADFNNGDLANIRRVFGFTTLFQPKK 431
Query: 507 PVTNAQAAVAL 517
VT A+AA +L
Sbjct: 432 SVTRAEAAASL 442
>gi|425441873|ref|ZP_18822140.1| S-layer region-like [Microcystis aeruginosa PCC 9717]
gi|389717284|emb|CCH98606.1| S-layer region-like [Microcystis aeruginosa PCC 9717]
Length = 400
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 107/200 (53%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P D RRE+ARWL+ A++ + + K + N + AF
Sbjct: 192 QDLARLGVLTGNNNQFNPNDTITRREFARWLLQANNAIYANVAGKQI-RLATPNSSP-AF 249
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + + F P +PL+R+DL++WK+ L+
Sbjct: 250 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSALLFRPNAPLTREDLIAWKVPLD 303
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 304 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 363
Query: 508 VTNAQAAVAL-AIGEASDAV 526
V A+AA AL G D V
Sbjct: 364 VNRAEAAAALWYFGFQGDGV 383
>gi|17231127|ref|NP_487675.1| hypothetical protein all3635 [Nostoc sp. PCC 7120]
gi|17132768|dbj|BAB75334.1| all3635 [Nostoc sp. PCC 7120]
Length = 400
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 110/189 (58%), Gaps = 13/189 (6%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
+PG + RREYARWLV+A++ + + +K T AF D++ +D DF +IQG
Sbjct: 210 FEPGKIITRREYARWLVNANNAMYANNSAKQI--RLAGESTQAAFTDVSSQDADFPAIQG 267
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAEAGLI S LS + + F P++PL+R+ L+ WK+ L+ RQ LP AN + + +
Sbjct: 268 LAEAGLIPSPLSG------DATAVLFRPDAPLTREQLILWKVPLDTRQALPNANLEAVKE 321
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAI--- 519
GF D KI+P A A+LAD EQ I FG T LFQP KPVT A+AA AL
Sbjct: 322 TWGFQDAGKIDPKALRAVLADFQNSEQANIRRIFGYTTLFQPKKPVTRAEAAGALWYFGS 381
Query: 520 -GEASDAVN 527
GE AV+
Sbjct: 382 QGEGISAVD 390
>gi|422304813|ref|ZP_16392152.1| S-layer region-like [Microcystis aeruginosa PCC 9806]
gi|389789968|emb|CCI14091.1| S-layer region-like [Microcystis aeruginosa PCC 9806]
Length = 396
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 107/200 (53%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P D RRE+ARWL+ A++ + + K + N + AF
Sbjct: 188 QDLARLGVLTGNNNQFNPNDTITRREFARWLLQANNAIYANVAGKQI-RLATPNSSP-AF 245
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + + F P +PL+R+DL++WK+ L+
Sbjct: 246 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSALLFRPNAPLTREDLIAWKVPLD 299
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 300 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 359
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA L G D V
Sbjct: 360 VTRAEAAATLWYFGFQGDGV 379
>gi|16330480|ref|NP_441208.1| hypothetical protein slr2000 [Synechocystis sp. PCC 6803]
gi|383322221|ref|YP_005383074.1| hypothetical protein SYNGTI_1312 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325390|ref|YP_005386243.1| hypothetical protein SYNPCCP_1311 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491274|ref|YP_005408950.1| hypothetical protein SYNPCCN_1311 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436541|ref|YP_005651265.1| hypothetical protein SYNGTS_1312 [Synechocystis sp. PCC 6803]
gi|451814638|ref|YP_007451090.1| hypothetical protein MYO_113240 [Synechocystis sp. PCC 6803]
gi|1652971|dbj|BAA17888.1| slr2000 [Synechocystis sp. PCC 6803]
gi|339273573|dbj|BAK50060.1| hypothetical protein SYNGTS_1312 [Synechocystis sp. PCC 6803]
gi|359271540|dbj|BAL29059.1| hypothetical protein SYNGTI_1312 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274710|dbj|BAL32228.1| hypothetical protein SYNPCCN_1311 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277880|dbj|BAL35397.1| hypothetical protein SYNPCCP_1311 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958401|dbj|BAM51641.1| hypothetical protein BEST7613_2710 [Synechocystis sp. PCC 6803]
gi|451780607|gb|AGF51576.1| hypothetical protein MYO_113240 [Synechocystis sp. PCC 6803]
Length = 321
Score = 129 bits (323), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 108/178 (60%), Gaps = 12/178 (6%)
Query: 352 RREYARWLVSASSTLTRSTMSK-VYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLI 410
R E+ARWL A++ + SK + PA N T L F D+ P PDF+ IQGLA+AGLI
Sbjct: 118 RGEFARWLFQANNVFFANQPSKQIRPAP--SNATPL-FTDVPPTHPDFAQIQGLADAGLI 174
Query: 411 SSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDI 469
S L++ +P F P++PL+R+DLV WK+ L++R+ LP A+ + + + GF D
Sbjct: 175 PSSLTN------DPTASQFRPDAPLTREDLVRWKVPLDQRRALPNASLENIKETWGFQDA 228
Query: 470 DKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
KINP+ WPAL AD G+Q + FG +FQP +PVT +AA AL +G +D +
Sbjct: 229 AKINPNIWPALAADFQNGDQANLKRVFGYITIFQPQRPVTRGEAASALWFLGVQTDGL 286
>gi|440682095|ref|YP_007156890.1| S-layer domain-containing protein [Anabaena cylindrica PCC 7122]
gi|428679214|gb|AFZ57980.1| S-layer domain-containing protein [Anabaena cylindrica PCC 7122]
Length = 458
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 76/175 (43%), Positives = 100/175 (57%), Gaps = 9/175 (5%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
+P + RREYARWLV+A++ + + +K AF D+ +DP F SIQG
Sbjct: 272 FEPNKIITRREYARWLVAANNAMYSNNPAKK--VRLASESNQPAFRDVLAKDPYFPSIQG 329
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAEAGLI S LS + + F P++PL+R+ L+ WK+ L+ RQ LP AN + + Q
Sbjct: 330 LAEAGLIPSSLSG------DATAVLFRPDAPLTREQLLLWKVPLDTRQALPSANLEAVKQ 383
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
GF D +KI P A A+LAD EQ I FG T LFQP K VT A+A AL
Sbjct: 384 TWGFQDTEKIEPKALKAVLADFQNAEQSNIRRVFGYTTLFQPKKAVTRAEAGAAL 438
>gi|166365868|ref|YP_001658141.1| S-layer protein [Microcystis aeruginosa NIES-843]
gi|425466482|ref|ZP_18845780.1| S-layer region-like [Microcystis aeruginosa PCC 9809]
gi|166088241|dbj|BAG02949.1| S-layer region-like precursor [Microcystis aeruginosa NIES-843]
gi|389830979|emb|CCI26645.1| S-layer region-like [Microcystis aeruginosa PCC 9809]
Length = 400
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 106/200 (53%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P D RRE+ARWL+ A++ + + K + N + AF
Sbjct: 192 QDLARLGVLTGNNNQFNPNDTITRREFARWLLQANNAIYANVAGKQI-RLATPNSSP-AF 249
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + + F P +PL+R+DL++WK+ L+
Sbjct: 250 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSALLFRPNAPLTREDLIAWKVPLD 303
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 304 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 363
Query: 508 VTNAQAAVAL-AIGEASDAV 526
V A+AA L G D V
Sbjct: 364 VNRAEAAATLWYFGFQGDGV 383
>gi|434398236|ref|YP_007132240.1| S-layer domain-containing protein [Stanieria cyanosphaera PCC 7437]
gi|428269333|gb|AFZ35274.1| S-layer domain-containing protein [Stanieria cyanosphaera PCC 7437]
Length = 447
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 85/233 (36%), Positives = 128/233 (54%), Gaps = 14/233 (6%)
Query: 318 LVPAVVDQVQGQALSALQVLKVIEAD-VKPGDLCIRREYARWLVSASSTLTRSTMSKVYP 376
L P + D + L+A +EA+ P + RR++ARWLV A++ + +
Sbjct: 224 LRPYIEDLAKLGILTAYSKEGKVEANKFAPNEPITRRDFARWLVEANNQFHGNAAGE--- 280
Query: 377 AMYIENVTDL-AFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPL 435
+++ +D AF DI+ DPDF IQGLAEAGLI S +L + + F P++PL
Sbjct: 281 KIHLATKSDRPAFQDISVNDPDFEIIQGLAEAGLIPS------MLTDNSSKLLFQPDAPL 334
Query: 436 SRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIAL 494
+R+DL++WK+ L+ R+ LP A+ + Q GF D I+P A AL AD G+ +
Sbjct: 335 TREDLLTWKVPLDLRKNLPTASIDAIKQSWGFQDTANISPQALQALFADFQNGDNSNMKR 394
Query: 495 AFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAVNEELQRIEAESAAENAVSEH 546
FG T LFQP KPVT A+AA +L G D + L+ ++ ES ++AV+
Sbjct: 395 VFGYTTLFQPKKPVTRAEAAASLWYFGFQGDGIT-ALEVVKGESINQSAVNSQ 446
>gi|434395295|ref|YP_007130242.1| S-layer domain-containing protein [Gloeocapsa sp. PCC 7428]
gi|428267136|gb|AFZ33082.1| S-layer domain-containing protein [Gloeocapsa sp. PCC 7428]
Length = 460
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/215 (40%), Positives = 114/215 (53%), Gaps = 26/215 (12%)
Query: 329 QALSALQVLKVIEADVK---------PGDLCIRREYARWLVSASSTLTRSTMSKVYPAMY 379
Q LSAL VL ++A K P RREYARWLV+A++ PA
Sbjct: 236 QDLSALGVLP-LQATSKSNSATNQFEPNKTITRREYARWLVAANNRFYTDN-----PAKQ 289
Query: 380 IENV---TDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLS 436
I T AF D+ DPDF+ IQGLAEAGLI S LS + F P++PL+
Sbjct: 290 IREASASTQPAFQDVPASDPDFAVIQGLAEAGLIPSPLSGSSTT------VLFRPDAPLT 343
Query: 437 RQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALA 495
R+ ++ WK+ ++ R LP A+ + Q GF D KI P A A+LAD ++ I
Sbjct: 344 REQMILWKVPIDTRSSLPNASVDAVQQTWGFQDAAKIEPRALQAVLADFQNSDRANIRRV 403
Query: 496 FGCTRLFQPDKPVTNAQAAVAL-AIGEASDAVNEE 529
FG T LFQP K VT A+AA AL IG AS+ V+ +
Sbjct: 404 FGYTTLFQPKKTVTRAEAAAALWYIGTASEGVSAQ 438
>gi|390439590|ref|ZP_10227976.1| S-layer region-like [Microcystis sp. T1-4]
gi|389836986|emb|CCI32100.1| S-layer region-like [Microcystis sp. T1-4]
Length = 387
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 106/200 (53%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P D RRE+ARWL+ A++ + + K + N + AF
Sbjct: 179 QDLARLGVLTGNNNQFNPNDTITRREFARWLLQANNAIYANVAGKQI-RLATPNSSP-AF 236
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ D D+ IQGLAEAGLI S L+ + + F P +PL+R+DL++WK+ L+
Sbjct: 237 SDVKNNDTDYIYIQGLAEAGLIPSPLTG------DSSSLLFRPNAPLTREDLIAWKVPLD 290
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 291 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 350
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA L G D V
Sbjct: 351 VTRAEAAATLWYFGFQGDGV 370
>gi|428310556|ref|YP_007121533.1| S-layer protein [Microcoleus sp. PCC 7113]
gi|428252168|gb|AFZ18127.1| putative S-layer protein [Microcoleus sp. PCC 7113]
Length = 479
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 77/170 (45%), Positives = 98/170 (57%), Gaps = 15/170 (8%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIENVTDL---AFDDITPEDPDFSSIQGLAEAG 408
RRE+ARWLV+A++ + + P I ++ AF D+ D DFSSIQ LAEAG
Sbjct: 279 RREFARWLVAANNQIFAN-----RPGQQIRLASETSQPAFGDVPRSDRDFSSIQALAEAG 333
Query: 409 LISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFI 467
LI S LS F P++PL+R+ LV WK+ L+ RQ LP A+ L Q GF
Sbjct: 334 LIPSSLSGDSTA------ALFRPDAPLTRETLVVWKVPLDTRQTLPTASLDTLKQTWGFQ 387
Query: 468 DIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
D+ KI+P W AL+AD GEQ I FG T LFQP K VT A+AA A+
Sbjct: 388 DVAKIDPKTWRALVADFQNGEQSNIRRVFGYTTLFQPKKTVTRAEAATAV 437
>gi|425446334|ref|ZP_18826342.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 9443]
gi|389733490|emb|CCI02772.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 9443]
Length = 404
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 82/200 (41%), Positives = 108/200 (54%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P + RRE+ARWL+ A++ + + K + N + AF
Sbjct: 196 QDLARLGVLTGNNNQFNPNNTITRREFARWLLQANNAIYANVAGKQI-RLATANSSP-AF 253
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + I F P +PL+R+DL++WK+ L+
Sbjct: 254 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSAILFRPNAPLTREDLIAWKVPLD 307
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 308 VRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 367
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA AL G D V
Sbjct: 368 VTRAEAAAALWYFGFQGDGV 387
>gi|425461054|ref|ZP_18840534.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 9808]
gi|389826143|emb|CCI23562.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 9808]
Length = 404
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 106/200 (53%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P + RRE+ARWL+ A++ + + K + AF
Sbjct: 196 QDLAKLGVLTGNNNQFNPNNTITRREFARWLLQANNAIYANVAGKQIRVATAN--SSPAF 253
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + I F P +PL+R+DL++WK+ L+
Sbjct: 254 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSAILFRPNAPLTREDLIAWKVPLD 307
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 308 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 367
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA AL G D V
Sbjct: 368 VTRAEAAAALWYFGFQGDGV 387
>gi|354565006|ref|ZP_08984182.1| S-layer domain-containing protein [Fischerella sp. JSC-11]
gi|353550132|gb|EHC19571.1| S-layer domain-containing protein [Fischerella sp. JSC-11]
Length = 474
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/175 (44%), Positives = 103/175 (58%), Gaps = 9/175 (5%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
+P RREYARWLV+A++ + + SK + N AF D+ DPDFS+IQG
Sbjct: 283 FEPNKNITRREYARWLVAANNAMYANIPSKRI-RLASANAQP-AFSDVPKTDPDFSAIQG 340
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAE+GL S LL+ + + F P++PL+R+ L+ K+ L+ RQ LP A+ + Q
Sbjct: 341 LAESGLFPS------LLSGDSTQVLFRPDAPLTREQLLMSKVPLDTRQALPTASVSAVSQ 394
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
GF D KI+P A A+LAD GEQ I +G T LFQP KPVT A+AA AL
Sbjct: 395 TWGFQDAAKIDPKALRAILADFQNGEQSNIRRVYGYTTLFQPKKPVTRAEAAAAL 449
>gi|414075582|ref|YP_006994900.1| S-layer protein [Anabaena sp. 90]
gi|413968998|gb|AFW93087.1| S-layer domain-containing protein [Anabaena sp. 90]
Length = 442
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 81/192 (42%), Positives = 109/192 (56%), Gaps = 11/192 (5%)
Query: 329 QALSALQVLKV--IEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDL 386
Q L+ L +L + + P + RREYARWLV+A++ + + +K + T
Sbjct: 239 QDLATLGILSIEPKTTEFLPDKIITRREYARWLVAANNAMYANNPAKQI--RLASSSTQP 296
Query: 387 AFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMA 446
AF DI +DPDF IQGLAEAGLI S LS + + F P++PL+R+ L+ WK+
Sbjct: 297 AFRDILTKDPDFPVIQGLAEAGLIPSALSG------DATAVLFRPDAPLTREQLILWKVP 350
Query: 447 LEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPD 505
L+ RQ LP A+ + + Q GF D I P A A+LAD EQ I FG T LFQP
Sbjct: 351 LDTRQALPAASLEAVKQTWGFQDAGGIEPKALKAVLADFQNAEQSNIRRVFGYTTLFQPK 410
Query: 506 KPVTNAQAAVAL 517
KPVT A+A+ AL
Sbjct: 411 KPVTRAEASAAL 422
>gi|428774912|ref|YP_007166699.1| S-layer protein [Halothece sp. PCC 7418]
gi|428689191|gb|AFZ42485.1| S-layer domain-containing protein [Halothece sp. PCC 7418]
Length = 419
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 110/201 (54%), Gaps = 9/201 (4%)
Query: 318 LVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPA 377
L P V D Q LS ++ P ++ RR +ARWL +A++ R S+
Sbjct: 209 LQPYVQDMAQLGLLSPVESDNAQGKQFAPNEVITRRTFARWLFNANNRFYRDRASQQI-- 266
Query: 378 MYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSR 437
++ AF DI+P DPDF IQGLAEAGLI S+L+ + F P++PL+R
Sbjct: 267 RRVQQAPTPAFTDISPSDPDFGIIQGLAEAGLIPSRLTGDSTVTR------FRPDAPLTR 320
Query: 438 QDLVSWKMALEKR-QLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAF 496
+ L+ WK+ L+ R LP A + + GF D KI+P A A++AD + GEQ + F
Sbjct: 321 ETLLLWKVPLDTRSNLPSATVNTVKETWGFQDAGKIDPKALGAIVADFSNGEQSTLRRVF 380
Query: 497 GCTRLFQPDKPVTNAQAAVAL 517
G T+L QP+K VT A+AA AL
Sbjct: 381 GYTQLLQPEKAVTRAEAAAAL 401
>gi|428299810|ref|YP_007138116.1| S-layer protein [Calothrix sp. PCC 6303]
gi|428236354|gb|AFZ02144.1| S-layer domain-containing protein [Calothrix sp. PCC 6303]
Length = 435
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/192 (41%), Positives = 108/192 (56%), Gaps = 17/192 (8%)
Query: 334 LQVLKVIEA------DVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIEN-VTDL 386
L L VI A ++P + R EYARWLV+ ++T S SK + + N +
Sbjct: 231 LATLGVIPAYSIQTNKLEPNKIITRGEYARWLVTVNNTFYASNPSK---QIRLGNESSQP 287
Query: 387 AFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMA 446
A+ D+ +P+F IQGLAEAGLI S+LS + + F SPL+R+ ++ WK+
Sbjct: 288 AYSDVAKNNPNFPYIQGLAEAGLIPSQLSG------DSTEVLFRAGSPLTREQMLLWKLP 341
Query: 447 LEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPD 505
L+ RQ LP AN + + Q GF D KI P A A+LAD +GEQ I FG T LFQP
Sbjct: 342 LDSRQGLPTANLEAVKQTWGFQDAAKIEPKALKAVLADHQSGEQSNIRRVFGYTTLFQPK 401
Query: 506 KPVTNAQAAVAL 517
KPVT ++AA L
Sbjct: 402 KPVTRSEAAAVL 413
>gi|427728254|ref|YP_007074491.1| putative S-layer protein [Nostoc sp. PCC 7524]
gi|427364173|gb|AFY46894.1| putative S-layer protein [Nostoc sp. PCC 7524]
Length = 457
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 111/195 (56%), Gaps = 14/195 (7%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
+PG + RREYARWLV+A++ + + +K T F D++ +D DF +IQG
Sbjct: 268 FEPGKIITRREYARWLVAANNAMYANNPAKQI--RLASESTQPTFSDVSRQDLDFPAIQG 325
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAEAGLI S LS + F P++PL+R+ ++ WK+ L+ RQ LP AN + + +
Sbjct: 326 LAEAGLIPSALSGDSTA------VLFRPDAPLTREQMLLWKVPLDTRQGLPAANLETVKE 379
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAA-VALAIGE 521
GF D KI+P A A+LAD GEQ I FG T L QP KPVT A+AA V G
Sbjct: 380 TWGFQDTGKIDPKALRAVLADFQNGEQSNIRRVFGYTTLLQPKKPVTRAEAAGVLWYFGT 439
Query: 522 ASDAVN----EELQR 532
+ ++ ++LQR
Sbjct: 440 QGEGISATEAQKLQR 454
>gi|434384486|ref|YP_007095097.1| putative S-layer protein [Chamaesiphon minutus PCC 6605]
gi|428015476|gb|AFY91570.1| putative S-layer protein [Chamaesiphon minutus PCC 6605]
Length = 413
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/184 (41%), Positives = 104/184 (56%), Gaps = 9/184 (4%)
Query: 335 QVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPE 394
Q+ + A+ P RREYARWLV+A + +T S ++ + T AF D+
Sbjct: 218 QIGVLTSAESAPNRTVTRREYARWLVTAHNRITGSKPTQQVKLATTD--TKPAFQDVPST 275
Query: 395 DPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LP 453
+PDF SIQGLAEAGLI S LS + + F P++PL+R+ ++ WK+ L+ RQ LP
Sbjct: 276 NPDFPSIQGLAEAGLIPSPLSG------DATSVLFRPDTPLTREQMILWKVPLDTRQPLP 329
Query: 454 EANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQA 513
A+ + Q GF D KI+P A A+LAD GEQ I AFG T LFQP K V +
Sbjct: 330 TASLDAVKQTWGFQDAGKIDPKALRAVLADFQNGEQSNIRRAFGYTTLFQPKKTVNLGEV 389
Query: 514 AVAL 517
A +L
Sbjct: 390 ATSL 393
>gi|425435981|ref|ZP_18816423.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 9432]
gi|389679397|emb|CCH91817.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 9432]
Length = 404
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 108/200 (54%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P + RRE+ARWL+ A++ + + K + N + AF
Sbjct: 196 QDLARLGVLTGNNNQFNPNNTITRREFARWLLQANNAIYANVAGKQI-RLATTNSSP-AF 253
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + + F P +PL+R+DL++WK+ L+
Sbjct: 254 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSALLFRPNAPLTREDLIAWKVPLD 307
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 308 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 367
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA AL G D V
Sbjct: 368 VTRAEAAAALWYFGFQGDGV 387
>gi|282898950|ref|ZP_06306932.1| S-layer region protein-like protein [Cylindrospermopsis raciborskii
CS-505]
gi|281196090|gb|EFA71005.1| S-layer region protein-like protein [Cylindrospermopsis raciborskii
CS-505]
Length = 453
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/185 (40%), Positives = 107/185 (57%), Gaps = 10/185 (5%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
++P + RRE+ARWLV+ ++ + + +K V+ F D+ P D DF IQG
Sbjct: 266 LEPNKIITRREFARWLVTGNNVMYANKQAKK--IRLPSPVSQPIFKDVPPTDVDFPFIQG 323
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAEAGLI+S LS + + F P++PL+R++L+ WK+ L+ RQ LP A+ + + Q
Sbjct: 324 LAEAGLIASPLSG------DATELLFRPDAPLTRENLLMWKVPLDTRQSLPNASTEAVKQ 377
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALA-IGE 521
GF D KI+P A A+LAD GEQ I FG T LFQP K VT +AA+ ++ G
Sbjct: 378 TWGFQDTAKIDPKALRAVLADFQNGEQSNIRRVFGYTILFQPKKAVTRGEAALTISYFGS 437
Query: 522 ASDAV 526
+ V
Sbjct: 438 QGEGV 442
>gi|425456521|ref|ZP_18836229.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 9807]
gi|389802360|emb|CCI18581.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 9807]
Length = 404
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 105/200 (52%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P + RRE+ARWL+ A++ + + K + AF
Sbjct: 196 QDLARLGVLTGNNNQFNPNNTITRREFARWLLQANNAIYANVAGKQIRVATAN--SSPAF 253
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + I F P +PL+R+DL++WK+ L+
Sbjct: 254 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSAILFRPNAPLTREDLIAWKVPLD 307
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 308 VRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 367
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA L G D V
Sbjct: 368 VTRAEAAATLWYFGFQGDGV 387
>gi|425451147|ref|ZP_18830969.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 7941]
gi|389767720|emb|CCI06975.1| Similar to Q3M894_ANAVT S-layer region-like [Microcystis aeruginosa
PCC 7941]
Length = 404
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 107/200 (53%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P + RRE+ARWL+ A++ + + K + N + AF
Sbjct: 196 QDLARLGVLTGNNNQFNPNNTITRREFARWLLQANNAIYANVAGKQI-RLATPNSSP-AF 253
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + + F P +PL+R+DL++WK+ L+
Sbjct: 254 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSTLLFRPNAPLTREDLIAWKVPLD 307
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 308 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 367
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA L G D V
Sbjct: 368 VTRAEAAATLWYFGFQGDGV 387
>gi|443656045|ref|ZP_21131639.1| S-layer domain protein [Microcystis aeruginosa DIANCHI905]
gi|443333468|gb|ELS48025.1| S-layer domain protein [Microcystis aeruginosa DIANCHI905]
Length = 400
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 79/200 (39%), Positives = 104/200 (52%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P + RRE+ARWL+ A++ + + K + F
Sbjct: 192 QDLAKLGVLTGNNNQFNPNNTITRREFARWLLQANNAIYANVAGKQIRVATAN--SSPTF 249
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + I F P +PL+R+DL++WK+ L+
Sbjct: 250 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSAILFRPNAPLTREDLIAWKVPLD 303
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 304 VRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 363
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA L G D V
Sbjct: 364 VTRAEAAATLWYFGFQGDGV 383
>gi|159030618|emb|CAO88286.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 404
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 79/200 (39%), Positives = 104/200 (52%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P + RRE+ARWL+ A++ + + K + F
Sbjct: 196 QDLAKLGVLTGNNNQFNPNNTITRREFARWLLQANNAIYANVAGKQIRVATAN--SSPTF 253
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + I F P +PL+R+DL++WK+ L+
Sbjct: 254 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSAILFRPNAPLTREDLIAWKVPLD 307
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL AD EQ I FG T LFQP +P
Sbjct: 308 VRKALPSANLDTIKNTWGFQDTNKINPQLVRALYADFQNAEQANIRRVFGFTTLFQPKRP 367
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA L G D V
Sbjct: 368 VTRAEAAATLWYFGFQGDGV 387
>gi|428208318|ref|YP_007092671.1| S-layer protein [Chroococcidiopsis thermalis PCC 7203]
gi|428010239|gb|AFY88802.1| S-layer domain-containing protein [Chroococcidiopsis thermalis PCC
7203]
Length = 481
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/202 (39%), Positives = 109/202 (53%), Gaps = 26/202 (12%)
Query: 331 LSALQVLKVIEADVK-----------PGDLCIRREYARWLVSASSTLTRSTMSKVYPAMY 379
L+AL VL + + K P + RRE+ARWL A++ + PA+
Sbjct: 248 LAALGVLPLEPTNAKSSQADPVRQFNPSKIVSRREFARWLFEANNRI-----QATRPALQ 302
Query: 380 IENVTDLA---FDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLS 436
I + A F D+ DPDFS IQGLAEAGLI S LS + F P++PL+
Sbjct: 303 IRAASAAAQPAFRDVPRNDPDFSVIQGLAEAGLIPSSLSGDGTA------VLFRPDAPLT 356
Query: 437 RQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALA 495
R+ L+ WK+ L+ RQ LP A+ + + + GF D +I+P A A++AD GEQ I
Sbjct: 357 REQLILWKVPLDTRQALPNASLEAVKESWGFQDAARIDPKALRAVIADFQNGEQSNIRRV 416
Query: 496 FGCTRLFQPDKPVTNAQAAVAL 517
FG T L QP KPVT A+AA A+
Sbjct: 417 FGYTTLLQPKKPVTRAEAASAI 438
>gi|332711549|ref|ZP_08431480.1| hypothetical protein LYNGBM3L_64220 [Moorea producens 3L]
gi|332349527|gb|EGJ29136.1| hypothetical protein LYNGBM3L_64220 [Moorea producens 3L]
Length = 511
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 102/179 (56%), Gaps = 21/179 (11%)
Query: 346 PGDLCIRREYARWLVSASSTLTRSTMSKVY---PAMYIE---NVTDLAFDDITPEDPDFS 399
P RR+YARWL+ A++ ++Y P + I N T AF D+ DPDF
Sbjct: 312 PNQTINRRQYARWLMDANN--------RIYANRPGLQIRLASNTTQPAFQDVPRTDPDFP 363
Query: 400 SIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKK 458
IQGLA+AGLI S LS + F P++PL+R++L+ WK+ L+ R+ LP+A+ +
Sbjct: 364 YIQGLADAGLIPSPLSGDSTA------VLFRPDAPLTRENLMLWKVPLDTRKALPKASIE 417
Query: 459 ILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
+ + GF D +I+P A A+LAD G+ I FG T LFQP KPVT A+A AL
Sbjct: 418 AVKETWGFKDTAEIDPKALRAVLADFRNGDLANIRRVFGYTILFQPKKPVTRAEAGAAL 476
>gi|440756978|ref|ZP_20936178.1| S-layer domain protein [Microcystis aeruginosa TAIHU98]
gi|440173007|gb|ELP52491.1| S-layer domain protein [Microcystis aeruginosa TAIHU98]
Length = 400
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/200 (39%), Positives = 106/200 (53%), Gaps = 10/200 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L VL P + RRE+ARWL+ A++ + + K + N + AF
Sbjct: 192 QDLAKLGVLTGNNNQFNPNNTITRREFARWLLQANNAIYANVAGKQI-RLATPNSSP-AF 249
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPD+ IQGLAEAGLI S L+ + + F P +PL+R+DL++WK+ L+
Sbjct: 250 SDVKNNDPDYIYIQGLAEAGLIPSPLTG------DSSALLFRPNAPLTREDLIAWKVPLD 303
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP AN + GF D +KINP AL D EQ I FG T LFQP +P
Sbjct: 304 IRKALPSANLDTIKNTWGFQDTNKINPQLVRALYVDFQNAEQANIRRVFGFTTLFQPKRP 363
Query: 508 VTNAQAAVAL-AIGEASDAV 526
VT A+AA L G D V
Sbjct: 364 VTRAEAAATLWYFGFQGDGV 383
>gi|15450984|gb|AAK96763.1| Unknown protein [Arabidopsis thaliana]
gi|20148715|gb|AAM10248.1| unknown protein [Arabidopsis thaliana]
Length = 322
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 68/147 (46%), Positives = 90/147 (61%), Gaps = 1/147 (0%)
Query: 316 KVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVY 375
+V P VD Q +A++ L+ LK+ E D+ +LC +REYARWLV ++S L R+ M +
Sbjct: 171 RVATPVAVDAAQQEAIAVLKKLKIYEDDIVADELCTKREYARWLVRSNSLLERNPMHMIV 230
Query: 376 PAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPL 435
PA+ + + AFDDI DPDF IQ LAEAG+ SSKLS D N+ G F PES +
Sbjct: 231 PAVALAGSSIPAFDDINTSDPDFEYIQALAEAGITSSKLSGEDSRNDL-GNSNFNPESFV 289
Query: 436 SRQDLVSWKMALEKRQLPEANKKILYQ 462
SR DLV+WK LE PE ++ YQ
Sbjct: 290 SRLDLVNWKAQLECGFHPEIMEESRYQ 316
>gi|119489552|ref|ZP_01622313.1| S-layer region-like protein [Lyngbya sp. PCC 8106]
gi|119454631|gb|EAW35778.1| S-layer region-like protein [Lyngbya sp. PCC 8106]
Length = 496
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 83/238 (34%), Positives = 124/238 (52%), Gaps = 29/238 (12%)
Query: 308 AALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLT 367
LQ PG+ P+ DQ+ P R E+ARWLV+A++
Sbjct: 280 GVLQSSPGRTTNPSNSDQII----------------TNPNSTITRGEFARWLVNANNNFY 323
Query: 368 RSTMSKVYPAMYIENVTDL-AFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGP 426
+T K + + +D F D++ P F IQGLAEAGLISS LS +
Sbjct: 324 SNTPPK---QIRLGVPSDQPVFTDVSTTHPYFPEIQGLAEAGLISSSLSG------DSAA 374
Query: 427 IFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLT 485
+ F P++PL+R+DL+ WK+ L+ RQ LP+A + + GF D KI+ ++ A+LAD
Sbjct: 375 VQFRPDAPLTREDLILWKVPLDTRQALPKATVDAVQERWGFQDTAKIDSNSLRAILADFD 434
Query: 486 AGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAVNEELQRIEAESAAENA 542
G+ I FG T LFQP KPVT A+AA AL G D ++ + + ++A++ +N+
Sbjct: 435 NGDNANIRRVFGFTTLFQPKKPVTRAEAASALWYFGFQGDGISAQ-EVLQAQNQTQNS 491
>gi|254421186|ref|ZP_05034904.1| hypothetical protein S7335_1336 [Synechococcus sp. PCC 7335]
gi|196188675|gb|EDX83639.1| hypothetical protein S7335_1336 [Synechococcus sp. PCC 7335]
Length = 374
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 98/177 (55%), Gaps = 11/177 (6%)
Query: 343 DVKPGDLCIRREYARWLVSASSTLTRSTMSK-VYPAMYIENVTDLAFDDITPEDPDFSSI 401
+ +P RREYARWL++A++ + T ++ + P + + F D+ D DF++I
Sbjct: 176 EFRPNQATTRREYARWLLAANNRFYQGTPNRRIRPGV---TSSQPVFQDVPVSDADFAAI 232
Query: 402 QGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKIL 460
QGLAEAG+I S L+ I F P++PL+R+DL+ WK+ L+ RQ LPEA +
Sbjct: 233 QGLAEAGIIPSSLTGSSTT------ITFRPDAPLTRKDLLLWKVPLDTRQPLPEATATAV 286
Query: 461 YQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
Q F D D I P AL+AD G+ I FG T LFQPDK T A+ A L
Sbjct: 287 RQAWSFQDTDTIEPRVAQALIADHQLGDFSNIRRTFGYTTLFQPDKAATRAETAAVL 343
>gi|428304474|ref|YP_007141299.1| S-layer protein [Crinalium epipsammum PCC 9333]
gi|428246009|gb|AFZ11789.1| S-layer domain-containing protein [Crinalium epipsammum PCC 9333]
Length = 466
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 69/166 (41%), Positives = 94/166 (56%), Gaps = 9/166 (5%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
+P + RREY RWLV+A++ + ++ K T AF D+ DPDF +IQG
Sbjct: 262 FEPNKIISRREYVRWLVNANNQIYTNSPGKQI--RLASRDTQPAFQDVAKTDPDFPAIQG 319
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAEAGLI S+LS + + F P +PL+R++L+ WK+ L+ R+ LP A+ + Q
Sbjct: 320 LAEAGLIPSRLSG------DSTAVLFRPNAPLTRENLILWKVPLDTREALPSASIDAVKQ 373
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPV 508
GF D KI P A A+LAD +Q I FG T LFQP KPV
Sbjct: 374 TWGFQDATKIEPKALKAVLADFQNSDQSNIRRVFGYTALFQPKKPV 419
>gi|300863982|ref|ZP_07108892.1| S-layer domain protein [Oscillatoria sp. PCC 6506]
gi|300338021|emb|CBN54038.1| S-layer domain protein [Oscillatoria sp. PCC 6506]
Length = 453
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 102/189 (53%), Gaps = 22/189 (11%)
Query: 346 PGDLCIRREYARWLVSASSTLTRSTMSKVY---PAMYIE---NVTDLAFDDITPEDPDFS 399
P RREYARWLV+A++ K+Y PA I T AF D+ DPDF
Sbjct: 268 PNKTITRREYARWLVAANN--------KIYANRPAKQIRLAITSTTAAFTDVPKTDPDFP 319
Query: 400 SIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKK 458
+IQGLAEAGLI S LS D N + F P PL+R ++ WK+ L+ RQ LP AN
Sbjct: 320 AIQGLAEAGLIPSSLSG-DTKN-----VKFRPNEPLTRAAMMLWKVPLDTRQALPSANID 373
Query: 459 ILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL- 517
+ + GF D KI+ A A+LAD G+ I FG T LFQP+K V+ A+AA +L
Sbjct: 374 AVKERWGFQDTSKIDSGAARAVLADFNNGDLANIRRVFGFTTLFQPNKSVSRAEAAASLW 433
Query: 518 AIGEASDAV 526
G D V
Sbjct: 434 YFGVEGDGV 442
>gi|443323742|ref|ZP_21052745.1| putative S-layer protein [Gloeocapsa sp. PCC 73106]
gi|442786528|gb|ELR96258.1| putative S-layer protein [Gloeocapsa sp. PCC 73106]
Length = 380
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 71/173 (41%), Positives = 105/173 (60%), Gaps = 9/173 (5%)
Query: 346 PGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLA 405
P + RREYARWL+SA + L T S+ + + T AF+DI D DFS IQGLA
Sbjct: 197 PNRVITRREYARWLLSAHNLLYSDTPSQQISS--VSQATQPAFEDIPLTDSDFSIIQGLA 254
Query: 406 EAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLS 464
EAG+I S+L+ + + FLP++PL+R+DL++WK+ L+ R+ LP + + + +
Sbjct: 255 EAGIIPSRLTG------DSNALKFLPDTPLTREDLITWKVPLDYRKALPPVSIEDIRETW 308
Query: 465 GFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
GF D++ I+P AL D ++ + AFG T LFQP+KPVT ++AA L
Sbjct: 309 GFQDVNIIDPRVRQALYIDEQNSDRSNVRRAFGYTTLFQPNKPVTRSEAAAVL 361
>gi|354552013|ref|ZP_08971321.1| S-layer domain-containing protein [Cyanothece sp. ATCC 51472]
gi|353555335|gb|EHC24723.1| S-layer domain-containing protein [Cyanothece sp. ATCC 51472]
Length = 385
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 75/183 (40%), Positives = 100/183 (54%), Gaps = 16/183 (8%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIE---NVTDLAFDDITPEDPDFSSIQGLAEAG 408
RR YARWLV + +T PA I + AF D++ DPDF+ IQGLAEAG
Sbjct: 207 RRTYARWLVETYNKFYENT-----PAKQIRLGVETSQPAFSDVSSNDPDFAVIQGLAEAG 261
Query: 409 LISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE-KRQLPEANKKILYQLSGFI 467
+I S L+ F P +PL+R+DLV+WK+ L+ + LP+A+ + + GF
Sbjct: 262 IIPSPLTGNS------SASLFRPNNPLTREDLVTWKVPLDMGKGLPQASIDNIKETWGFQ 315
Query: 468 DIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
D KI+ A AL AD GEQ + FG T LFQPDK VT A+AA +L G D +
Sbjct: 316 DTTKIDTKAIQALYADFQNGEQSNVRRVFGYTTLFQPDKGVTLAEAAASLWYFGYQGDGL 375
Query: 527 NEE 529
+ E
Sbjct: 376 SAE 378
>gi|427733822|ref|YP_007053366.1| putative S-layer protein [Rivularia sp. PCC 7116]
gi|427368863|gb|AFY52819.1| putative S-layer protein [Rivularia sp. PCC 7116]
Length = 442
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/167 (43%), Positives = 96/167 (57%), Gaps = 9/167 (5%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
+P RREYA WLV+A++ + ++ SK N + AF D+ DPDF++IQG
Sbjct: 256 FQPNKNITRREYAGWLVAANNAMYANSPSKQIRLAGEANKS--AFSDVKQTDPDFAAIQG 313
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQ 462
LAEAGLI S LS + F P++PL+R+ ++ WK+ L+ RQ LP A+ + Q
Sbjct: 314 LAEAGLIPSILSGDSTQ------VLFRPDAPLTREQMLLWKIPLDTRQGLPTASLDAVKQ 367
Query: 463 LSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVT 509
GF D KI+P A ALLAD GEQ I FG T LFQP K VT
Sbjct: 368 TWGFQDAGKIDPKALRALLADHQNGEQSNIRRVFGYTTLFQPKKAVT 414
>gi|172034973|ref|YP_001801474.1| hypothetical protein cce_0056 [Cyanothece sp. ATCC 51142]
gi|171696427|gb|ACB49408.1| hypothetical protein cce_0056 [Cyanothece sp. ATCC 51142]
Length = 391
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/183 (40%), Positives = 100/183 (54%), Gaps = 16/183 (8%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIE---NVTDLAFDDITPEDPDFSSIQGLAEAG 408
RR YARWLV + +T PA I + AF D++ DPDF+ IQGLAEAG
Sbjct: 213 RRTYARWLVETYNKFYENT-----PAKQIRLGVETSQPAFSDVSSNDPDFAVIQGLAEAG 267
Query: 409 LISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEK-RQLPEANKKILYQLSGFI 467
+I S L+ F P +PL+R+DLV+WK+ L+ + LP+A+ + + GF
Sbjct: 268 IIPSPLTGNS------SASLFRPNNPLTREDLVTWKVPLDMGKGLPQASIDNIKETWGFQ 321
Query: 468 DIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
D KI+ A AL AD GEQ + FG T LFQPDK VT A+AA +L G D +
Sbjct: 322 DTTKIDTKAIQALYADFQNGEQSNVRRVFGYTTLFQPDKGVTLAEAAASLWYFGYQGDGL 381
Query: 527 NEE 529
+ E
Sbjct: 382 SAE 384
>gi|282896819|ref|ZP_06304825.1| S-layer region protein-like protein [Raphidiopsis brookii D9]
gi|281198228|gb|EFA73118.1| S-layer region protein-like protein [Raphidiopsis brookii D9]
Length = 457
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/178 (39%), Positives = 107/178 (60%), Gaps = 13/178 (7%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKV--YPAMYIENVTDLAFDDITPEDPDFSSI 401
++P + RR++ARWL++ ++ + + +K P+ + + F D+ P D DF I
Sbjct: 270 LQPNKIITRRDFARWLLTGNNVMYANKQAKKIRLPSPGSQPI----FKDVPPTDVDFPFI 325
Query: 402 QGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKIL 460
QGLAEAGLI+S LS D+ + F P++PL+R++L+ WK+ L+ RQ LP A+ + +
Sbjct: 326 QGLAEAGLIASPLSG-DITE-----VLFRPDAPLTRENLLMWKVPLDTRQSLPNASMEAV 379
Query: 461 YQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALA 518
Q GF D +KI+ A A+LAD GEQ I FG T LFQP K VT +AA+ ++
Sbjct: 380 KQTWGFQDTEKIDLKALRAVLADFQNGEQSNIRRVFGYTTLFQPKKAVTRGEAALTIS 437
>gi|443309783|ref|ZP_21039470.1| putative S-layer protein [Synechocystis sp. PCC 7509]
gi|442780176|gb|ELR90382.1| putative S-layer protein [Synechocystis sp. PCC 7509]
Length = 426
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/212 (37%), Positives = 113/212 (53%), Gaps = 24/212 (11%)
Query: 331 LSALQVLKVIEADVK--------PGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIEN 382
L+AL VL + K P RREYAR L +A++ + S PA+ I
Sbjct: 209 LAALGVLSLPTTGNKTSSSSLFYPAKTITRREYARLLFAANNQINSSR-----PALQIRE 263
Query: 383 V---TDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQD 439
+ F D+ P D DF++IQGLA+AGLI S LS + + F P +PL+R+
Sbjct: 264 AAKDSQKTFQDVPPSDRDFAAIQGLADAGLIPSALSG------DSSAVLFRPNAPLTREQ 317
Query: 440 LVSWKMALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGC 498
L+ WK+ L+ RQ LP A+ + + GF D+ KI P A A+ AD ++ I AFG
Sbjct: 318 LIVWKVPLDSRQALPNASVESIKDSWGFQDVAKIEPKALRAVFADFQNSDRSNIRRAFGY 377
Query: 499 TRLFQPDKPVTNAQAAVAL-AIGEASDAVNEE 529
T LFQP KPVT A+AA L G A++ ++ +
Sbjct: 378 TTLFQPKKPVTRAEAAAVLWYFGNATEGLSAQ 409
>gi|126658439|ref|ZP_01729588.1| S-layer region-like protein [Cyanothece sp. CCY0110]
gi|126620371|gb|EAZ91091.1| S-layer region-like protein [Cyanothece sp. CCY0110]
Length = 386
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/183 (40%), Positives = 101/183 (55%), Gaps = 16/183 (8%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIE---NVTDLAFDDITPEDPDFSSIQGLAEAG 408
RR YARWLV + +T PA I + AF D++ DPDF+ IQGLAEAG
Sbjct: 207 RRTYARWLVETYNKFYENT-----PAKQIRLGVETSKPAFSDVSSNDPDFAVIQGLAEAG 261
Query: 409 LISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE-KRQLPEANKKILYQLSGFI 467
+I S L+ F P++PL+R+DLV+WK+ L+ + LP+A+ + + GF
Sbjct: 262 IIPSPLTGNS------SASLFRPDNPLTREDLVTWKVPLDMGKGLPQASIDNIKETWGFQ 315
Query: 468 DIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
D KI+ A AL AD GEQ + FG T LFQP+K VT A+AA +L G D +
Sbjct: 316 DTTKIDTKAIQALYADFQNGEQSNVRRIFGYTTLFQPNKGVTLAEAAASLWYFGYQGDGL 375
Query: 527 NEE 529
+ E
Sbjct: 376 SAE 378
>gi|411118890|ref|ZP_11391270.1| S-layer domain containing protein [Oscillatoriales cyanobacterium
JSC-12]
gi|410710753|gb|EKQ68260.1| S-layer domain containing protein [Oscillatoriales cyanobacterium
JSC-12]
Length = 457
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/191 (39%), Positives = 106/191 (55%), Gaps = 12/191 (6%)
Query: 342 ADVKPGDLCIRREYARWLVSASSTLTRS-TMSKVYPAMYIENVTDLAFDDITPEDPDFSS 400
A+ +P RREYARWL A++ L R ++ PA + + F D+ DPDF++
Sbjct: 270 ANFQPNKPVSRREYARWLFEANNRLFRDRPAHQIRPA---NSESQPVFKDVPRTDPDFAA 326
Query: 401 IQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKI 459
IQGLAEAG+I S LS DL + F P++ LSR+ ++ WK+ ++ RQ LP
Sbjct: 327 IQGLAEAGIIPSPLSG-DLAT-----VTFRPDAQLSREMMLLWKVPIDTRQPLPTVTPDS 380
Query: 460 LYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-A 518
+ Q GF D +I+P A A+LAD + I G T LFQP KPVT A+AAV L
Sbjct: 381 IKQTWGFQDSSQIDPKAQRAVLADYQNADLSNIRRVLGYTTLFQPKKPVTRAEAAVTLWY 440
Query: 519 IGEASDAVNEE 529
IG D ++ +
Sbjct: 441 IGYQGDGISAQ 451
>gi|428769872|ref|YP_007161662.1| S-layer protein [Cyanobacterium aponinum PCC 10605]
gi|428684151|gb|AFZ53618.1| S-layer region-like precursor [Cyanobacterium aponinum PCC 10605]
Length = 382
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 104/198 (52%), Gaps = 11/198 (5%)
Query: 329 QALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAF 388
Q L+ L ++ E + P + RREYARWLV ++ L + SK+ + F
Sbjct: 185 QELARLGIVSNSEV-INPYNTISRREYARWLVQTNNLLFQDVNSKLIREANPN--SKPIF 241
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ DPDF+ IQGLAEAGLI S L + F P+ PL+R++L++WK+ L+
Sbjct: 242 TDVPVSDPDFAVIQGLAEAGLIPSPLLKQGEFTS------FNPDKPLTRENLITWKVPLD 295
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP L + GF D+ KI+P AW AL D GE AFG LFQP K
Sbjct: 296 FREKLPNVTLDALKETWGFQDLSKIDPGAWSALYLDWQNGESANTRKAFGYIILFQPQKE 355
Query: 508 VTNAQAAVAL-AIGEASD 524
VT +AA L + G +D
Sbjct: 356 VTYDEAARVLSSFGTNTD 373
>gi|428220554|ref|YP_007104724.1| putative S-layer protein [Synechococcus sp. PCC 7502]
gi|427993894|gb|AFY72589.1| putative S-layer protein [Synechococcus sp. PCC 7502]
Length = 450
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/205 (39%), Positives = 111/205 (54%), Gaps = 14/205 (6%)
Query: 331 LSALQVLKVIEADV----KPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDL 386
++ LQ L I A V KP DL +REYARWLV+ ++ L S ++ + N T +
Sbjct: 226 ITDLQKLGTITAKVDNQFKPNDLIQKREYARWLVNTNNRLYASRPTRQI-RLADANSTPV 284
Query: 387 AFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMA 446
F DI P DPDF IQ LA GLI + S + F P P+SR+ L+ WK+
Sbjct: 285 -FVDIPPSDPDFPVIQALANVGLIPTDPSVANTSRR------FRPNDPISREMLLQWKIP 337
Query: 447 LEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPD 505
L+ R +P AN + + Q GF D D+I+P + +LAD G+ I +FG T LFQP
Sbjct: 338 LDIRAAIPAANLESVKQAWGFQDSDRISPISLKFVLADFKLGDLSNIRRSFGYTTLFQPQ 397
Query: 506 KPVTNAQAAVAL-AIGEASDAVNEE 529
K VT A+AAVAL G D ++ +
Sbjct: 398 KNVTRAEAAVALWYFGTPEDGLSAQ 422
>gi|67921420|ref|ZP_00514938.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
gi|67856532|gb|EAM51773.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
Length = 385
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 72/180 (40%), Positives = 99/180 (55%), Gaps = 16/180 (8%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIE---NVTDLAFDDITPEDPDFSSIQGLAEAG 408
RR YARWLV T + + PA + N + AF D+ DPDF+ IQGLAEAG
Sbjct: 208 RRTYARWLVE-----TYNKFHENNPAKQLRLGVNTSQPAFSDVGSNDPDFAIIQGLAEAG 262
Query: 409 LISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEK-RQLPEANKKILYQLSGFI 467
+I S+L+ F P++PL+R+DL++WK+ L+ + LP+A+ + + GF
Sbjct: 263 IIPSRLTGNS------SASLFRPDTPLNREDLLTWKVPLDTGKGLPKASLDNIKETWGFQ 316
Query: 468 DIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
D KI+ A AL AD GEQ + FG T LFQP K VT A+AA +L G D +
Sbjct: 317 DTTKIDTKAIQALYADFQNGEQSNVRRVFGYTTLFQPKKGVTLAEAAASLWYFGYQGDGL 376
>gi|416384000|ref|ZP_11684553.1| hypothetical protein CWATWH0003_1383 [Crocosphaera watsonii WH
0003]
gi|357265132|gb|EHJ13935.1| hypothetical protein CWATWH0003_1383 [Crocosphaera watsonii WH
0003]
Length = 385
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 72/180 (40%), Positives = 99/180 (55%), Gaps = 16/180 (8%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIE---NVTDLAFDDITPEDPDFSSIQGLAEAG 408
RR YARWLV T + + PA + N + AF D+ DPDF+ IQGLAEAG
Sbjct: 208 RRTYARWLVE-----TYNKFHENNPAKQLRLGVNTSQPAFSDVGSNDPDFAIIQGLAEAG 262
Query: 409 LISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEK-RQLPEANKKILYQLSGFI 467
+I S+L+ F P++PL+R+DL++WK+ L+ + LP+A+ + + GF
Sbjct: 263 IIPSRLTGNS------SASLFRPDTPLNREDLLTWKVPLDTGKGLPKASLDNIKETWGFQ 316
Query: 468 DIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
D KI+ A AL AD GEQ + FG T LFQP K VT A+AA +L G D +
Sbjct: 317 DTTKIDTKAIQALYADFQNGEQSNVRRVFGYTTLFQPKKGVTLAEAAASLWYFGYQRDGL 376
>gi|374922023|gb|AFA26189.1| hypothetical protein, partial [Lolium perenne]
Length = 84
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/84 (69%), Positives = 66/84 (78%), Gaps = 3/84 (3%)
Query: 380 IENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLL---NEEPGPIFFLPESPLS 436
IENV++LAFDD+T EDPDF IQGLAEAGLISSKLS D+ N + FF P+SPLS
Sbjct: 1 IENVSELAFDDVTTEDPDFPFIQGLAEAGLISSKLSRSDMNISENVQNSHYFFSPDSPLS 60
Query: 437 RQDLVSWKMALEKRQLPEANKKIL 460
RQDLVSWKMAL+KRQLPE +K L
Sbjct: 61 RQDLVSWKMALDKRQLPEVDKNSL 84
>gi|170079104|ref|YP_001735742.1| S layer domain-containing protein [Synechococcus sp. PCC 7002]
gi|169886773|gb|ACB00487.1| S layer domain protein [Synechococcus sp. PCC 7002]
Length = 401
Score = 112 bits (281), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 104/194 (53%), Gaps = 14/194 (7%)
Query: 326 VQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTD 385
Q Q L ALQV AD + + RR++ARWL A + + ++ + N +
Sbjct: 179 TQVQDLVALQVFTA--ADFQADTVITRRQFARWLFKAHNAIYGDRQNQ---QIRRANASS 233
Query: 386 LA-FDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWK 444
F D+ DPDF IQGLAEAG+I S L++ + F P++PL+R+ L++WK
Sbjct: 234 KPIFTDVPASDPDFPFIQGLAEAGIIPSSLTNDTITT-------FRPDAPLTRESLIAWK 286
Query: 445 MALEKRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQ 503
+ L++RQ LP+ + + + GF D KI+ A AL D G+Q + FG T LFQ
Sbjct: 287 VPLDRRQALPQTSLDNIAETWGFQDAAKIDTRALQALYVDFQNGDQANVRRVFGYTTLFQ 346
Query: 504 PDKPVTNAQAAVAL 517
P K VT + A+AL
Sbjct: 347 PQKTVTQQEVAIAL 360
>gi|428773305|ref|YP_007165093.1| S-layer protein [Cyanobacterium stanieri PCC 7202]
gi|428687584|gb|AFZ47444.1| S-layer region-like precursor [Cyanobacterium stanieri PCC 7202]
Length = 369
Score = 112 bits (281), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 63/168 (37%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIENV-TDLAFDDITPEDPDFSSIQGLAEAGLI 410
RR+YARWLV ++ + S K ++ + + ++ F D+ +DPDF IQGLA AGLI
Sbjct: 196 RRQYARWLVKTNNVIFGSNDGK---SIRLASANSEAVFRDVANDDPDFPYIQGLANAGLI 252
Query: 411 SSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDI 469
S+L++ P I F P+ PL+R+DL+ WK+ + RQ P + + GF D
Sbjct: 253 PSRLTNN------PDAIAFNPDQPLTREDLILWKVPFDFRQSFPTTTLDNIRETWGFQDA 306
Query: 470 DKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
++++P+ W L D G+ I FG T LFQP K VT +AA+ L
Sbjct: 307 NQMSPELWQKLYIDWQNGDNANIRRTFGFTTLFQPQKTVTMEEAAITL 354
>gi|427722679|ref|YP_007069956.1| S layer domain-containing protein [Leptolyngbya sp. PCC 7376]
gi|427354399|gb|AFY37122.1| S layer domain-containing protein [Leptolyngbya sp. PCC 7376]
Length = 381
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 104/200 (52%), Gaps = 17/200 (8%)
Query: 334 LQVLKVIEADVKPGDLCI-RREYARWLVSASSTLTRSTMSKVYPAMYIENVTDL---AFD 389
L L V E G L I RR++ARWL A + + P I TD F
Sbjct: 187 LIALDVFEDSEFQGSLTISRRQFARWLFKAHNAIYGDR-----PNQQIRLATDTNQAVFQ 241
Query: 390 DITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEK 449
D+ DPDF IQGLAEAGL+ S L+ + P F P++PL+R+ L+SWK+ L++
Sbjct: 242 DLPSSDPDFGMIQGLAEAGLLPSTLTS------DANPTTFRPDAPLTRETLISWKVPLDR 295
Query: 450 RQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPV 508
R+ LPE + + + GF D +I+ A AL D G+Q FG T LFQP K V
Sbjct: 296 RKALPETTLEQIAETWGFQDATEIDSRALQALYIDFQNGDQANTRRVFGYTTLFQPKKTV 355
Query: 509 TNAQAAVAL-AIGEASDAVN 527
T +AA+AL G SD ++
Sbjct: 356 TQTEAAIALWYFGFQSDGLS 375
>gi|443475092|ref|ZP_21065052.1| S-layer domain-containing protein [Pseudanabaena biceps PCC 7429]
gi|443020094|gb|ELS34093.1| S-layer domain-containing protein [Pseudanabaena biceps PCC 7429]
Length = 351
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 72/190 (37%), Positives = 102/190 (53%), Gaps = 18/190 (9%)
Query: 340 IEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLA-FDDITPEDPDF 398
++ + +P L RREYARWL+ ++ L ++ S+ + +D+A F DI PDF
Sbjct: 167 VDREFRPNTLISRREYARWLMQTNNRLYKNQPSR---QIRFAQSSDMASFPDIPSSHPDF 223
Query: 399 SSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANK 457
+ IQGLA AGLI F P PL R++LV WK+ L+ RQ LP A
Sbjct: 224 TIIQGLANAGLIGGTGDR------------FRPNDPLLREELVQWKIPLDLRQPLPNATL 271
Query: 458 KILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
+ + Q GF D D+I+ +A A+ AD + I +FG T L QP KPVT A+AA +L
Sbjct: 272 ENVSQAWGFKDSDRISENALSAIFADAQLRDISNIRRSFGFTTLLQPQKPVTRAEAAASL 331
Query: 518 -AIGEASDAV 526
G A+D +
Sbjct: 332 WYFGTATDGI 341
>gi|443313986|ref|ZP_21043588.1| putative S-layer protein [Leptolyngbya sp. PCC 6406]
gi|442786420|gb|ELR96158.1| putative S-layer protein [Leptolyngbya sp. PCC 6406]
Length = 341
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 70/181 (38%), Positives = 100/181 (55%), Gaps = 12/181 (6%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIENVTD-LAFDDITPEDPDFSSIQGLAEAGLI 410
RREYARWL + ++ + +K + N +D AF D+ PEDPDF++IQGLA AG+I
Sbjct: 164 RREYARWLFALNNQFHQDAAAK---RIRGGNRSDKPAFQDVPPEDPDFAAIQGLAAAGII 220
Query: 411 SSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKR-QLPEANKKILYQLSGFIDI 469
S L+ + F P++PL+R+ LV WK+ L+ R LP + + Q F D+
Sbjct: 221 PSALTGNSTA------VTFRPDAPLTRETLVLWKVPLDTRATLPTTTPEAVTQTWAFQDV 274
Query: 470 DKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALA-IGEASDAVNE 528
+ I P A A+ AD G+ I AFG T L QP K VT ++AA AL G +D +
Sbjct: 275 NTIEPLALRAIAADFQLGDFANIRRAFGYTTLLQPQKAVTRSEAAAALWRFGTQTDGITA 334
Query: 529 E 529
+
Sbjct: 335 Q 335
>gi|113474253|ref|YP_720314.1| hypothetical protein Tery_0372 [Trichodesmium erythraeum IMS101]
gi|110165301|gb|ABG49841.1| S-layer region-like [Trichodesmium erythraeum IMS101]
Length = 379
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 77/190 (40%), Positives = 110/190 (57%), Gaps = 13/190 (6%)
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
VKP +L RR YARWLVSA++ + + +K + + N + F D+ DPDF+ IQG
Sbjct: 192 VKPQNLVSRRVYARWLVSANNKMFTNNSAK-HIRLARANEKQI-FQDVPKTDPDFAVIQG 249
Query: 404 LAEAGLISSKL-SHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILY 461
LAEAGLI S L +LL F P+ L+R+DL+ WK+ L+ R+ LPEA + +
Sbjct: 250 LAEAGLIPSPLFGDVNLLK-------FRPDDFLTREDLILWKVPLDFRKPLPEATIEKIK 302
Query: 462 QLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIG 520
+ F D KINP A A+LAD + I FG T+LF+PDK V+ A+AA L G
Sbjct: 303 AVWDFQDASKINPIALKAVLAD-GGNKFSNIRRVFGYTKLFRPDKTVSRAEAAAVLWYFG 361
Query: 521 EASDAVNEEL 530
+ D ++ ++
Sbjct: 362 DQKDGISAQM 371
>gi|428216489|ref|YP_007100954.1| S-layer protein [Pseudanabaena sp. PCC 7367]
gi|427988271|gb|AFY68526.1| S-layer domain-containing protein [Pseudanabaena sp. PCC 7367]
Length = 533
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/235 (34%), Positives = 116/235 (49%), Gaps = 28/235 (11%)
Query: 296 SPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREY 355
SP + + V AALQ + +D QG L +P RR+Y
Sbjct: 308 SPTKLSDLNTVPAALQTFVKDMAKLGAIDPAQGDRL-------------QPNGEIKRRDY 354
Query: 356 ARWLVSASSTLTRSTMSKVYPAMYIENVTDL-AFDDITPEDPDFSSIQGLAEAGLISSKL 414
ARWLV A++ + +S S+ + + +D AF D+ DPDF IQ LA AGLI +
Sbjct: 355 ARWLVLANNRIHQSNPSR---QIRLALTSDKPAFADVKSGDPDFLYIQALANAGLIGNAS 411
Query: 415 SHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPE-ANKKILYQLSGFIDIDKI 472
L F P++PLSR+++++WK+ L+ R LP A + Q F D D+I
Sbjct: 412 GGNQNL--------FRPDAPLSREEMIAWKVPLDLRDGLPNSATVATVQQAWNFQDSDRI 463
Query: 473 NPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAV 526
P A A+L D G+ I +FG T + QP KPVT A+AA AL G A+D +
Sbjct: 464 APGALVAILGDDQLGDLSNIRRSFGYTTILQPKKPVTRAEAAAALWHFGTATDGI 518
>gi|220908035|ref|YP_002483346.1| S-layer protein [Cyanothece sp. PCC 7425]
gi|219864646|gb|ACL44985.1| S-layer region-like protein, putative [Cyanothece sp. PCC 7425]
Length = 460
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/282 (34%), Positives = 131/282 (46%), Gaps = 37/282 (13%)
Query: 293 SSSSPAGIPAPSVVSAAL-QVLPGKV-----LVPAVVD--------QVQGQALSALQVLK 338
SS++P+ APS S + VLPG P D QV Q ++ L +L
Sbjct: 176 SSANPSVSAAPSPTSDLIGPVLPGNASPSPQTSPTFTDLNQAPLALQVNLQDMAKLGILT 235
Query: 339 VIEAD-----VKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVT---DLAFDD 390
+A P RR +ARWLV LT + + PA I + F D
Sbjct: 236 PADASRNPNLFYPNQAINRRTFARWLV-----LTNNRIYSDRPARQIRLASPSDSPLFRD 290
Query: 391 ITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKR 450
+ DPDF IQGLA AG + S L+ L F P PLSR+ L+ WK+ ++ R
Sbjct: 291 VPSSDPDFPYIQGLAAAGYLPSSLTDSTSLQ-------FRPNDPLSREALLQWKVPVDIR 343
Query: 451 Q-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVT 509
Q LP A + Q GF D ++I P+A A+LAD G+ I FG T L QP KPVT
Sbjct: 344 QNLPTATMDGVKQAWGFKDANRIAPEALQAVLADYQNGDLSNIRRLFGSTLLLQPKKPVT 403
Query: 510 NAQAAVAL-AIGEASDAVN-EELQRIEAESAAENAVSEHSAL 549
A+A VAL +G D + ++ + E A+ N SAL
Sbjct: 404 RAEAGVALWYVGVQGDGFSAQDALKSEQVQASSNPTPNDSAL 445
>gi|242095712|ref|XP_002438346.1| hypothetical protein SORBIDRAFT_10g013060 [Sorghum bicolor]
gi|241916569|gb|EER89713.1| hypothetical protein SORBIDRAFT_10g013060 [Sorghum bicolor]
Length = 319
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/129 (43%), Positives = 76/129 (58%), Gaps = 4/129 (3%)
Query: 324 DQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENV 383
D V +A S L+ L++IE DV D C RRE+ARW V S R M ++ P+
Sbjct: 177 DPVHEEAFSILKKLQIIEKDVSSSDFCTRREFARWFVKLCSKFERKRMQRIVPSKLTSGA 236
Query: 384 TDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHR-DLL---NEEPGPIFFLPESPLSRQD 439
AFDD+ +DPDF IQ L E+G++SSKLS+ + L + G FLP+S LSR D
Sbjct: 237 VQCAFDDVNIDDPDFLYIQSLGESGIVSSKLSNSLETLTSGSHSKGNSLFLPDSYLSRFD 296
Query: 440 LVSWKMALE 448
LV+WK+ +E
Sbjct: 297 LVNWKVLVE 305
>gi|428210890|ref|YP_007084034.1| putative S-layer protein [Oscillatoria acuminata PCC 6304]
gi|427999271|gb|AFY80114.1| putative S-layer protein [Oscillatoria acuminata PCC 6304]
Length = 451
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 94/170 (55%), Gaps = 15/170 (8%)
Query: 352 RREYARWLVSASSTLTRSTMSK---VYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAG 408
R+++ARWL+ ++ + + K + PA + AF D+ PDFS IQGLAEAG
Sbjct: 269 RKQFARWLLEVNNKIYANQAGKQIRLAPA-----TANPAFSDVNANHPDFSVIQGLAEAG 323
Query: 409 LISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKR-QLPEANKKILYQLSGFI 467
+I S LS F P++ ++R+ L+ WK+ L+ R LP A + Q GF
Sbjct: 324 IIPSALSGDSSATT------FRPDAIITREQLLVWKVPLDLRSNLPSATVDAIQQSWGFQ 377
Query: 468 DIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
D KI+P+A A+ AD GE+ I AFG T LFQP K VT A+AA AL
Sbjct: 378 DAGKIDPNALRAVYADYQNGERSNIRRAFGYTTLFQPKKEVTRAEAAAAL 427
>gi|326507018|dbj|BAJ95586.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 307
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 79/144 (54%), Gaps = 11/144 (7%)
Query: 323 VDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIEN 382
VD + +ALS L+ L++IE D GD C RRE+ARW V S L R M ++ P +
Sbjct: 164 VDPMHEEALSILKKLQIIENDASSGDFCTRREFARWFVKLCSKLERKRMHRIIPNLITSG 223
Query: 383 VTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSH--------RDLLNEEPGPIFFLPESP 434
+ AFDD+ +DPDF IQ L E+G++ SKLS N FLPES
Sbjct: 224 SVESAFDDVNFDDPDFLYIQSLGESGIVPSKLSSFFGTSTNGYQSANRNSN---FLPESY 280
Query: 435 LSRQDLVSWKMALEKRQLPEANKK 458
LSR DLV+WK+ +E E ++K
Sbjct: 281 LSRFDLVNWKLLVEYPFASELDQK 304
>gi|427419413|ref|ZP_18909596.1| S-layer domain containing protein [Leptolyngbya sp. PCC 7375]
gi|425762126|gb|EKV02979.1| S-layer domain containing protein [Leptolyngbya sp. PCC 7375]
Length = 341
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/202 (38%), Positives = 103/202 (50%), Gaps = 26/202 (12%)
Query: 331 LSALQVLKVIEADVK-------------PGDLCIRREYARWLVSASSTLTRSTMSK-VYP 376
L AL VL +I DVK P + RREYARWL++ ++ +K + P
Sbjct: 117 LVALDVLMLI--DVKADDNIERDPNEFLPNQVITRREYARWLLAVNNKFYSDQRAKKIRP 174
Query: 377 AMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLS 436
A+ AF D+ + DF++IQGLAEAG+I S L N + F P++PL+
Sbjct: 175 AV---ESAQPAFQDVGKTNIDFAAIQGLAEAGIIPSTL------NGSTTVVKFRPDAPLT 225
Query: 437 RQDLVSWKMALEKRQLPEANKKILYQLS-GFIDIDKINPDAWPALLADLTAGEQGIIALA 495
R+DL+ WK+ L+ R A + GF D KI P A+L D + GE I A
Sbjct: 226 RKDLILWKVPLDTRAALPAATATAVTEAWGFQDAGKIEPTVLKAVLTDHSNGEFANIRRA 285
Query: 496 FGCTRLFQPDKPVTNAQAAVAL 517
G T LFQPDK VT A+AA L
Sbjct: 286 LGYTTLFQPDKAVTRAEAAAVL 307
>gi|409991339|ref|ZP_11274609.1| S-layer protein [Arthrospira platensis str. Paraca]
gi|409937793|gb|EKN79187.1| S-layer protein [Arthrospira platensis str. Paraca]
Length = 430
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 94/179 (52%), Gaps = 21/179 (11%)
Query: 346 PGDLCIRREYARWLVSASSTLTRSTMSKVY---PAMYIE---NVTDLAFDDITPEDPDFS 399
P R E+AR LV A++ K+Y P I + + F D+ P FS
Sbjct: 242 PNQAITRGEFARALVQANN--------KIYADIPGRQIRLAISSSQPVFTDVPANHPYFS 293
Query: 400 SIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKK 458
IQGLAEAGLI S LS + + F P++PL+R+ L+ WK+ L+ RQ LP+
Sbjct: 294 EIQGLAEAGLIPSSLSG------DSTAVQFRPDAPLTREYLILWKVPLDTRQGLPQGTVD 347
Query: 459 ILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
+ Q GF D KIN +A A+LAD G+ I FG T LFQP K VT A+AA AL
Sbjct: 348 AVEQRWGFQDASKINSNAIRAILADFDNGDNANIRRVFGFTTLFQPQKTVTRAEAATAL 406
>gi|291565728|dbj|BAI88000.1| S-layer domain protein [Arthrospira platensis NIES-39]
Length = 430
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 94/179 (52%), Gaps = 21/179 (11%)
Query: 346 PGDLCIRREYARWLVSASSTLTRSTMSKVY---PAMYIE---NVTDLAFDDITPEDPDFS 399
P R E+AR LV A++ K+Y P I + + F D+ P FS
Sbjct: 242 PNQAITRGEFARALVQANN--------KIYADIPGRQIRLAISSSQPVFTDVPANHPYFS 293
Query: 400 SIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKK 458
IQGLAEAGLI S LS + + F P++PL+R+ L+ WK+ L+ RQ LP+
Sbjct: 294 EIQGLAEAGLIPSSLSG------DSTAVQFRPDAPLTREYLILWKVPLDTRQGLPQGTVD 347
Query: 459 ILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
+ Q GF D KIN +A A+LAD G+ I FG T LFQP K VT A+AA AL
Sbjct: 348 AVEQRWGFQDASKINSNAIRAILADFDNGDNANIRRVFGFTTLFQPQKTVTRAEAATAL 406
>gi|376007071|ref|ZP_09784276.1| S-layer region-like protein [Arthrospira sp. PCC 8005]
gi|375324551|emb|CCE20029.1| S-layer region-like protein [Arthrospira sp. PCC 8005]
Length = 430
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 98/189 (51%), Gaps = 22/189 (11%)
Query: 337 LKVIEADV-KPGDLCIRREYARWLVSASSTLTRSTMSKVY---PAMYIE---NVTDLAFD 389
L V E + P R E+AR LV A++ K+Y P I + + F
Sbjct: 232 LGVFEQNFPNPNQAITRGEFARALVQANN--------KIYADIPGRQIRLSISSSQPVFT 283
Query: 390 DITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEK 449
D+ P F+ IQGLAEAGLI S LS + + F P++PL+R+ L+ WK+ L+
Sbjct: 284 DVPANHPYFAEIQGLAEAGLIPSSLSG------DTTAVQFRPDAPLTREYLILWKVPLDI 337
Query: 450 RQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPV 508
RQ LP+ + Q GF D KIN +A A+LAD G+ I FG T LFQP K V
Sbjct: 338 RQGLPQGTVDAVEQRWGFQDASKINSNAIRAILADFDNGDNANIRRVFGFTTLFQPQKTV 397
Query: 509 TNAQAAVAL 517
T A+AA AL
Sbjct: 398 TRAEAATAL 406
>gi|312509|emb|CAA47922.1| unnamed protein product [Synechococcus elongatus PCC 7942]
Length = 294
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 92/167 (55%), Gaps = 9/167 (5%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLIS 411
R +ARWL++ ++ + + I + + + D+ +PDF +IQ LAEAG +
Sbjct: 83 RGTFARWLLAVNNRFFEDDPGRQI-RLAIADSPPI-YTDVPTSNPDFIAIQSLAEAGTLP 140
Query: 412 SKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDID 470
S+LS F PE+PL+R DL+ WK+ L+ RQ LP A + L GF D +
Sbjct: 141 SRLSGDTAATR------FQPEAPLTRADLLLWKVPLDHRQTLPTATPEKLAASWGFQDTN 194
Query: 471 KINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
++P ALLAD G Q II FG T+LFQP KPVT A+AA AL
Sbjct: 195 GLDPRLQRALLADDDNGTQSIIRRVFGFTQLFQPRKPVTRAEAAAAL 241
>gi|56750147|ref|YP_170848.1| hypothetical protein syc0138_c [Synechococcus elongatus PCC 6301]
gi|81300226|ref|YP_400434.1| hypothetical protein Synpcc7942_1417 [Synechococcus elongatus PCC
7942]
gi|56685106|dbj|BAD78328.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169107|gb|ABB57447.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 327
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 92/167 (55%), Gaps = 9/167 (5%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLIS 411
R +ARWL++ ++ + + I + + + D+ +PDF +IQ LAEAG +
Sbjct: 116 RGTFARWLLAVNNRFFEDDPGRQI-RLAIADSPPI-YTDVPTSNPDFIAIQSLAEAGTLP 173
Query: 412 SKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ-LPEANKKILYQLSGFIDID 470
S+LS F PE+PL+R DL+ WK+ L+ RQ LP A + L GF D +
Sbjct: 174 SRLSGDTAATR------FQPEAPLTRADLLLWKVPLDHRQTLPTATPEKLAASWGFQDTN 227
Query: 471 KINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
++P ALLAD G Q II FG T+LFQP KPVT A+AA AL
Sbjct: 228 GLDPRLQRALLADDDNGTQSIIRRVFGFTQLFQPRKPVTRAEAAAAL 274
>gi|209526645|ref|ZP_03275169.1| S-layer domain protein [Arthrospira maxima CS-328]
gi|423064060|ref|ZP_17052850.1| S-layer domain protein [Arthrospira platensis C1]
gi|209492881|gb|EDZ93212.1| S-layer domain protein [Arthrospira maxima CS-328]
gi|406714477|gb|EKD09642.1| S-layer domain protein [Arthrospira platensis C1]
Length = 430
Score = 100 bits (249), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 98/189 (51%), Gaps = 22/189 (11%)
Query: 337 LKVIEADV-KPGDLCIRREYARWLVSASSTLTRSTMSKVY---PAMYIE---NVTDLAFD 389
L V E + P R E+AR LV A++ K+Y P I + + F
Sbjct: 232 LGVFEQNFPNPNQAITRGEFARALVQANN--------KIYADIPGRQIRLSISSSQPVFT 283
Query: 390 DITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEK 449
D+ P F+ IQGLAEAGLI S LS + + F P++PL+R+ L+ WK+ L+
Sbjct: 284 DVPANHPYFAEIQGLAEAGLIPSSLSG------DTTAVQFRPDAPLTREYLILWKVPLDI 337
Query: 450 RQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPV 508
RQ LP+ + Q GF D KIN +A A+LAD G+ I FG T LFQP + V
Sbjct: 338 RQGLPQGTVDAVEQRWGFQDASKINSNAIRAILADFDNGDNANIRRVFGFTTLFQPQRTV 397
Query: 509 TNAQAAVAL 517
T A+AA AL
Sbjct: 398 TRAEAATAL 406
>gi|297605871|ref|NP_001057703.2| Os06g0499000 [Oryza sativa Japonica Group]
gi|255677073|dbj|BAF19617.2| Os06g0499000, partial [Oryza sativa Japonica Group]
Length = 206
Score = 92.8 bits (229), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 114/206 (55%), Gaps = 12/206 (5%)
Query: 499 TRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKE-- 556
TR QP KPVT AQAA AL G + + +EL R+EAE+ ++ +V + E+ +E
Sbjct: 2 TRCLQPHKPVTKAQAAAALTSGRMEEVIRDELNRLEAENQSQLSV------MGEIMEELI 55
Query: 557 ----INESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEME 612
I +E ++ +E + V+K + QEL + +RE + L+KER A+E + +
Sbjct: 56 NRGDIKRYWEDKMKVEEIREVAVDKQLQHVLQELANEKTDREKELAVLLKERTALEHQNQ 115
Query: 613 ILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSM 672
L LR E++ + L +E+ E++ + L + ++Q ++ + LE E++AL+M
Sbjct: 116 ELMNLRSEIDGMYDRLAMESLEVMTEEQNLEKLSLDVNRKHQAVSESKSYLEAEKEALTM 175
Query: 673 ARAWAEDEAKRAREQAKALEGARDRW 698
R+W E+EA R E+A+ LE A RW
Sbjct: 176 LRSWVEEEAARVHERAEVLERAVRRW 201
>gi|22299348|ref|NP_682595.1| hypothetical protein tll1805 [Thermosynechococcus elongatus BP-1]
gi|22295531|dbj|BAC09357.1| tll1805 [Thermosynechococcus elongatus BP-1]
Length = 398
Score = 92.4 bits (228), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 77/223 (34%), Positives = 115/223 (51%), Gaps = 33/223 (14%)
Query: 318 LVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRR-EYARWLVSASSTLTRSTMSKVY- 375
L PA+ D L+ L VL ++P ++ IRR ++ RWLV+ T ++ Y
Sbjct: 190 LQPAIRD------LAELGVLSTTGDRLQP-NIPIRRGQFVRWLVT--------TYNRFYA 234
Query: 376 --PAMYIE----NVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFF 429
PA I N T + F D+ + PDF IQGLA AG + S L+ F
Sbjct: 235 DRPARQIRLGSRNDTPI-FQDVPRDHPDFPYIQGLAMAGFLPSPLTGDTS-------ALF 286
Query: 430 LPESPLSRQDLVSWKMALEKR-QLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGE 488
PE+PL+R+ L+ WK+ L+++ +L + + Q GF D +I P A A+ AD AG+
Sbjct: 287 RPEAPLTRETLLQWKVPLDQQGRLSPSTIDRIQQTWGFKDSQRIAPPAINAVAADYLAGD 346
Query: 489 QGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGEASDAVNEEL 530
I +G T L QP KPVT+A+AA AL IG ++ ++ +
Sbjct: 347 LSNIRRVWGETLLLQPQKPVTHAEAAAALWYIGNGTEGLSAAM 389
>gi|427712706|ref|YP_007061330.1| putative S-layer protein [Synechococcus sp. PCC 6312]
gi|427376835|gb|AFY60787.1| putative S-layer protein [Synechococcus sp. PCC 6312]
Length = 399
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 83/242 (34%), Positives = 108/242 (44%), Gaps = 34/242 (14%)
Query: 290 PTGSSSSPAGIPAPSVVSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEA------D 343
PT S + PA P PS Q PA + AL L L VI
Sbjct: 152 PTNSPTIPAPTPQPSPADLPPQSFTDLAQAPATLQP----ALQNLAELGVITGSPQNPQQ 207
Query: 344 VKPGDLCIRREYARWLVSASSTLTRSTMSKVY---PAMYIE----NVTDLAFDDITPEDP 396
P R YARWLV+A + + Y PA I N L F+D+ +P
Sbjct: 208 FAPNQPISRGTYARWLVTAHN--------RFYADRPARQIRLGSPNDKPL-FNDVPKTNP 258
Query: 397 DFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE-KRQLPEA 455
DF IQ LA AG + S L+ + P F P +PL+R+ L+ WK+ L+ +R L
Sbjct: 259 DFPYIQALAAAGYLPSPLTG----SVTP---LFRPSAPLTRETLLQWKVPLDVQRNLTTT 311
Query: 456 NKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAV 515
+ Q GF D ++I P+A A AD G+ I FG T L QP KPVT A+AA
Sbjct: 312 AIDRIEQTWGFKDSNRITPEALSATAADFQNGDLSNIRRIFGATLLLQPQKPVTRAEAAA 371
Query: 516 AL 517
+L
Sbjct: 372 SL 373
>gi|284928820|ref|YP_003421342.1| hypothetical protein UCYN_02370 [cyanobacterium UCYN-A]
gi|284809279|gb|ADB94984.1| hypothetical protein UCYN_02370 [cyanobacterium UCYN-A]
Length = 383
Score = 89.4 bits (220), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 87/178 (48%), Gaps = 31/178 (17%)
Query: 352 RREYARWLVSASSTLT--------RSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQG 403
RR YA+WL + R S + P F D++ DPD+ IQG
Sbjct: 207 RRTYAKWLFKTYNKFYQDVPEKQIRLASSNLKPV----------FSDVSSNDPDYLYIQG 256
Query: 404 LAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQ----LPEANKKI 459
LAEAG+I S LS ++ + F P++ L+R++L+ WK+ L+ + +P + +
Sbjct: 257 LAEAGIIPSSLS------KDNNSLLFYPDAYLTRENLIIWKVPLDFGKGLSIIPTIDIE- 309
Query: 460 LYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL 517
+ GF DI IN A AL D E+ I FG T LFQP KPVT A+A +L
Sbjct: 310 --KNWGFQDIKTINLKALQALYIDFHNKEKSNIRRIFGYTILFQPQKPVTLAEAITSL 365
>gi|226498314|ref|NP_001144830.1| uncharacterized protein LOC100277914 [Zea mays]
gi|195647636|gb|ACG43286.1| hypothetical protein [Zea mays]
Length = 270
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 54/93 (58%)
Query: 324 DQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENV 383
D V +ALS L+ L++IE DV D C R+E+ARW V S R M ++ P
Sbjct: 178 DPVHEEALSVLKKLQIIEKDVSSSDFCTRKEFARWFVKLCSKFERKKMQRIVPNKLTSGT 237
Query: 384 TDLAFDDITPEDPDFSSIQGLAEAGLISSKLSH 416
AFDD+ + PDF IQ L E+G+ISSKLS+
Sbjct: 238 VQCAFDDVNIDHPDFLYIQSLGESGIISSKLSN 270
>gi|158338818|ref|YP_001519995.1| S-layer protein [Acaryochloris marina MBIC11017]
gi|158309059|gb|ABW30676.1| S-layer region-like protein, putative [Acaryochloris marina
MBIC11017]
Length = 414
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 105/218 (48%), Gaps = 16/218 (7%)
Query: 331 LSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLT--RSTMSKVYPAMYIENVTDLAF 388
+S +Q L+VI+ P R +ARWLV ++ L R T A V F
Sbjct: 208 VSDIQRLEVIDLG-NPNQSIQRGTFARWLVKTNNRLYQDRPTQQIRLAATSQPPV----F 262
Query: 389 DDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALE 448
D+ +P+F IQGLAEAG I S LS + F P PL+R+ L+SWK+ ++
Sbjct: 263 KDVPSSNPNFPYIQGLAEAGFIPSPLSG------DADQATFQPNQPLTREVLLSWKVPID 316
Query: 449 KRQ-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKP 507
R+ LP A + ++ GF D +I+ A+ AD GE I G L QP K
Sbjct: 317 FRKILPSATTAKVQEVWGFKDTKQISTPTLSAIFADHNNGELANIRRLLGSALLLQPKKS 376
Query: 508 VTNAQAAVAL-AIGEASDAVNEELQRIEAESAAENAVS 544
VT A+AA L IG A + + + + AE AE A S
Sbjct: 377 VTRAEAAATLWFIGVAGEGYSAK-DVLRAEQQAEAASS 413
>gi|359460719|ref|ZP_09249282.1| S-layer region-like protein [Acaryochloris sp. CCMEE 5410]
Length = 425
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 72/214 (33%), Positives = 106/214 (49%), Gaps = 12/214 (5%)
Query: 331 LSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLAFDD 390
+S +Q L+VI+ P R +ARWLV ++ L + ++ + N T + F D
Sbjct: 219 VSDVQRLEVIDLG-NPNQPIQRGTFARWLVKTNNRLYQDRPTQQI-RLAATNQTPI-FKD 275
Query: 391 ITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKR 450
+ +P+F IQGLAEAG I S LS + F P PL+R+ L+SWK+ ++ R
Sbjct: 276 VPSSNPNFPYIQGLAEAGFIPSPLSG------DADQATFQPNQPLTREVLLSWKVPIDFR 329
Query: 451 Q-LPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVT 509
+ LP A + ++ GF D +I+ A+ D GE I G L QP K VT
Sbjct: 330 KILPSATTAKVQEVWGFKDTKQISTPTLSAIFTDHNNGELANIRRLLGSALLLQPKKSVT 389
Query: 510 NAQAAVAL-AIGEASDAVNEELQRIEAESAAENA 542
A+AA L IG A + + + + AE AE A
Sbjct: 390 RAEAAATLWFIGVAGEGYSAK-DVLRAEQQAEAA 422
>gi|428309133|ref|YP_007120110.1| S-layer protein [Microcoleus sp. PCC 7113]
gi|428250745|gb|AFZ16704.1| putative S-layer protein [Microcoleus sp. PCC 7113]
Length = 312
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 90/204 (44%), Gaps = 17/204 (8%)
Query: 319 VPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAM 378
+ V+ + Q L L V + P R E+ARWLV ++ + + K +
Sbjct: 104 IKGVLGEKQIIQLGQLGVFDSTSGNFDPKAPITRAEFARWLVRTNNAIFPGSSDKT---I 160
Query: 379 YIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQ 438
+ F D+ P PDF IQGLA AG S F PE L+R+
Sbjct: 161 RLSEAGKATFSDVPPTHPDFPYIQGLANAGYSISDDEKT-----------FKPEQILTRE 209
Query: 439 DLVSWKMALEKRQLPEANKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGC 498
+++ K+AL+ R E+ K G+ D +KI+ WPA+ + I+ FG
Sbjct: 210 QMLAMKVALDHRVPLESYKG--GAPGGWTDSNKISKKYWPAIYVESVFQNNANISRTFGA 267
Query: 499 TRLFQPDKPVTNAQAAVAL-AIGE 521
+ P PVT ++A +++ AIG+
Sbjct: 268 LKTLNPQAPVTRSEAVLSISAIGD 291
>gi|86608930|ref|YP_477692.1| S-layer protein [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86557472|gb|ABD02429.1| S-layer domain protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 364
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 94/185 (50%), Gaps = 25/185 (13%)
Query: 352 RREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLA-FDDITPEDPDFSSIQGLAEAGLI 410
R E+ARWLV A++ + S+ + + + ++ F D+ EDP+F IQ L AG+I
Sbjct: 129 RGEFARWLVLANNAIHADEPSR---QIRLGSPSERPLFLDVPEEDPNFRYIQALGAAGII 185
Query: 411 SSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLPEANKKI----LYQLSGF 466
+ N E F P S LSR +L+ K+ L+ LP K L + GF
Sbjct: 186 AGDA------NRE-----FRPNSLLSRAELIRMKVPLD---LPPGQIKGSRAELEERWGF 231
Query: 467 IDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIGE--AS 523
D +I P+A PAL+AD + + FG R F P +PV+ +AA+AL A GE A
Sbjct: 232 TDAAQIPPEAIPALVADRSLENASTVLRTFGPIRTFNPFEPVSRGEAAIALSAFGERTAQ 291
Query: 524 DAVNE 528
DA+ +
Sbjct: 292 DALPQ 296
>gi|86606153|ref|YP_474916.1| S-layer protein [Synechococcus sp. JA-3-3Ab]
gi|86554695|gb|ABC99653.1| S-layer domain protein [Synechococcus sp. JA-3-3Ab]
Length = 373
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/241 (31%), Positives = 114/241 (47%), Gaps = 32/241 (13%)
Query: 290 PTGSSSSPAGI-PAPSVVSAALQVL-PGKVLVPAVVDQVQGQALSALQVLK-VIEADVKP 346
P+ ++ PA P PS VS+ L PG ++ AV D L L V ++ ++ +P
Sbjct: 85 PSAPATPPASFTPPPSQVSSRFTDLEPGSLVARAVSD------LDRLGVFADIVGSEFQP 138
Query: 347 GDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDLA-FDDITPEDPDFSSIQGLA 405
R E+ARWLV A++ + S+ + + + + F D+ EDP+F IQ L
Sbjct: 139 QRSVRRGEFARWLVLANNVIHADQPSR---QIRLGSAGERPLFLDVPEEDPNFRYIQALG 195
Query: 406 EAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMALEKRQLP----EANKKILY 461
AGL+ N E F P S LSR +L+ K L+ LP + ++ L
Sbjct: 196 AAGLVVGDA------NRE-----FRPNSLLSRAELIRMKAPLD---LPPGQIKGSRAELE 241
Query: 462 QLSGFIDIDKINPDAWPALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVAL-AIG 520
+ GF D +I +A L+AD + + FG R F P +PV+ +AA+AL A G
Sbjct: 242 ERWGFTDAAQIPDEAVAPLVADRSLENASTVLRTFGPIRTFNPFEPVSRGEAAIALSAFG 301
Query: 521 E 521
E
Sbjct: 302 E 302
>gi|427728031|ref|YP_007074268.1| putative S-layer protein [Nostoc sp. PCC 7524]
gi|427363950|gb|AFY46671.1| putative S-layer protein [Nostoc sp. PCC 7524]
Length = 284
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 71/274 (25%), Positives = 119/274 (43%), Gaps = 40/274 (14%)
Query: 281 NSSSFTESPPTGS---SSSPAGIPAPSVVSAALQVLPGKVLVP-----------AVVDQV 326
N SS +SP G+ +++ + + VV++ Q+L ++L+ V +
Sbjct: 26 NYSSLAQSPQLGNCIQTTNSLSLSSQKVVTSCQQILSYQLLIAESTITNFTDISGVYGEK 85
Query: 327 QGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAMYIENVTDL 386
+ + L+ L VLK ++ +P R ++ WLV + L R P +N +
Sbjct: 86 EIKQLAELGVLKNTSSEFQPQAPVTRGQFVAWLVKTYNELHRE------PIRLPQNNSS- 138
Query: 387 AFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQDLVSWKMA 446
AF D++ P F+ IQ AG L+ D N F P+ L+R+ ++ K
Sbjct: 139 AFPDVSSSHPHFTFIQAAHNAGF----LAGFDDGN-------FRPDDILTREQMIVLKTN 187
Query: 447 LEK----RQLPEA---NKKILYQLSGFIDIDKINPDAWPALLADLTAGEQGI-IALAFGC 498
+ R P A + + + GF D D+I+ P + DL G A +G
Sbjct: 188 FDSNPRLRNYPNALRDYRNFIGKTRGFTDTDQISDRYVPFIAFDLGNAASGRNFARVYGR 247
Query: 499 TRLFQPDKPVTNAQAAVALAIGEASDAVNEELQR 532
TR++ P K VT A+AAV L+ V + L+R
Sbjct: 248 TRIYAPKKAVTRAEAAVILSRFRKGGTVEQALKR 281
>gi|354552897|ref|ZP_08972204.1| S-layer domain-containing protein [Cyanothece sp. ATCC 51472]
gi|353554727|gb|EHC24116.1| S-layer domain-containing protein [Cyanothece sp. ATCC 51472]
Length = 273
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 29/206 (14%)
Query: 323 VDQVQGQA----LSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAM 378
+D V+GQ L L V++ ++ P D R ++ WLV A + L +
Sbjct: 72 IDGVKGQTEIQQLVQLGVIETNSSNFNPLDPITRGQFVTWLVKAYNQLHDVPIP------ 125
Query: 379 YIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQ 438
+ + +LAF D++ PDF+ IQ +AG I + E G F P +PL+R+
Sbjct: 126 -VGSNRELAFSDLSVSHPDFNYIQAAYDAGYI---------VGFEDGT--FQPNNPLTRE 173
Query: 439 DLVSWKMALEKRQLPEANKKILYQLS----GFIDIDKINPDAWPALLADL--TAGEQGII 492
+++ K L+ N L + G+ D+++++ L+ D AG + +
Sbjct: 174 QMIALKSQLDSSGSDSRNADRLRHFASRTMGYTDVEQMSDQYLKYLVFDAWNAAGSKNFV 233
Query: 493 ALAFGCTRLFQPDKPVTNAQAAVALA 518
+ +G TR++ P +PVT A+AA+ L
Sbjct: 234 RV-YGQTRIYSPKRPVTRAEAAILLT 258
>gi|172036094|ref|YP_001802595.1| hypothetical protein cce_1179 [Cyanothece sp. ATCC 51142]
gi|171697548|gb|ACB50529.1| hypothetical protein cce_1179 [Cyanothece sp. ATCC 51142]
Length = 278
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 29/206 (14%)
Query: 323 VDQVQGQA----LSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRSTMSKVYPAM 378
+D V+GQ L L V++ ++ P D R ++ WLV A + L +
Sbjct: 77 IDGVKGQTEIQQLVQLGVIETNSSNFNPLDPITRGQFVTWLVKAYNQLHDVPIP------ 130
Query: 379 YIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFFLPESPLSRQ 438
+ + +LAF D++ PDF+ IQ +AG I + E G F P +PL+R+
Sbjct: 131 -VGSNRELAFSDLSVSHPDFNYIQAAYDAGYI---------VGFEDGT--FQPNNPLTRE 178
Query: 439 DLVSWKMALEKRQLPEANKKILYQLS----GFIDIDKINPDAWPALLADL--TAGEQGII 492
+++ K L+ N L + G+ D+++++ L+ D AG + +
Sbjct: 179 QMIALKSQLDSSGSDSRNADRLRHFASRTMGYTDVEQMSDQYLKYLVFDAWNAAGSKNFV 238
Query: 493 ALAFGCTRLFQPDKPVTNAQAAVALA 518
+ +G TR++ P +PVT A+AA+ L
Sbjct: 239 RV-YGQTRIYSPKRPVTRAEAAILLT 263
>gi|218248080|ref|YP_002373451.1| S-layer protein [Cyanothece sp. PCC 8801]
gi|218168558|gb|ACK67295.1| S-layer domain protein [Cyanothece sp. PCC 8801]
Length = 262
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/214 (24%), Positives = 96/214 (44%), Gaps = 23/214 (10%)
Query: 310 LQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRS 369
L G + + Q Q L+ L V++ +P R ++ WL+ A + L R+
Sbjct: 52 LLAFSGFTDISGIRGSTQIQQLARLGVIETNSNTFRPSQSITRGQFVAWLIKAYNQLHRT 111
Query: 370 TMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFF 429
P + + ++ AF D++P P F IQ EAG L+ E G F
Sbjct: 112 ------PIL-LTSMAVSAFPDVSPSHPYFRYIQSAHEAGF---------LVGFEDGT--F 153
Query: 430 LPESPLSRQDLVSWKMALEK----RQLPEANKKILYQLSGFIDIDKINPDAWPALLADLT 485
P+ PL+R+ +++ K L+ R+ ++ ++ + + GF D +++ + DL
Sbjct: 154 RPDIPLTREQMIALKSPLDSKGSSRRDADSLRQFVTKTMGFTDAEEMGDQYLQYIAFDLG 213
Query: 486 AGEQGI-IALAFGCTRLFQPDKPVTNAQAAVALA 518
G +G TR++ P +PVT +AA+ ++
Sbjct: 214 NAAGGKNFQRVYGNTRIYAPKRPVTREEAAILVS 247
>gi|257060593|ref|YP_003138481.1| S-layer protein [Cyanothece sp. PCC 8802]
gi|256590759|gb|ACV01646.1| S-layer domain protein [Cyanothece sp. PCC 8802]
Length = 262
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/214 (24%), Positives = 93/214 (43%), Gaps = 23/214 (10%)
Query: 310 LQVLPGKVLVPAVVDQVQGQALSALQVLKVIEADVKPGDLCIRREYARWLVSASSTLTRS 369
L G + + Q Q L+ L V++ +P R ++ WL+ A + L R+
Sbjct: 52 LLAFSGFTDISGIRGSTQIQQLARLGVIETNSNTFRPSQSITRGQFVAWLIKAYNQLHRT 111
Query: 370 TMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLNEEPGPIFF 429
P + AF D++P P F IQ EAG L+ E G F
Sbjct: 112 ------PILLTSTAVS-AFPDVSPSHPYFRYIQSAHEAGF---------LVGFEDGT--F 153
Query: 430 LPESPLSRQDLVSWKMALEK----RQLPEANKKILYQLSGFIDIDKINPDAWPALLADLT 485
P+ PL+R+ +++ K L+ R+ ++ ++ + + GF D +++ + DL
Sbjct: 154 RPDIPLTREQMIALKSPLDSKGSSRRDADSLRQFVTKTMGFADAEEMGDQYLQYIAFDLG 213
Query: 486 AGEQGI-IALAFGCTRLFQPDKPVTNAQAAVALA 518
G +G TR++ P +PVT +AA+ ++
Sbjct: 214 NAAGGKNFQRVYGNTRIYAPKRPVTREEAAILVS 247
>gi|359461083|ref|ZP_09249646.1| S-layer region-like protein [Acaryochloris sp. CCMEE 5410]
Length = 318
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 94/219 (42%), Gaps = 23/219 (10%)
Query: 306 VSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEA----DVKPGDLCIRREYARWLVS 361
VS A P V + D ++ L L+++EA + +P + R EY WL
Sbjct: 96 VSEAETTKPAAVAFKDIADLPTQPLIADLIKLEILEAADDQNFQPYEPISRGEYMLWLFK 155
Query: 362 ASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLN 421
A + ++R + + D F DI + P F +Q LA AG + + D
Sbjct: 156 AHNAISRPAQK-----IRLAPTFDPEFTDIDAKHPAFKVVQALANAGY---SVGYDDKT- 206
Query: 422 EEPGPIFFLPESPLSRQDLVSWKMALEK-RQLPEANKKILYQLSGFIDIDKINPDAWPAL 480
F P+ P++R++++S K+ ++K + + + L F DI +I+ +
Sbjct: 207 -------FKPDQPITREEMISIKVGIDKGKSIKPVSTSSLRAAWKFSDIAEIDKRHSGYI 259
Query: 481 LADL-TAGEQGI-IALAFGCTRLFQPDKPVTNAQAAVAL 517
DL T G QG I AFG F+P + +AA L
Sbjct: 260 YNDLFTKGPQGSNIERAFGKIGTFKPKQAAKRHEAAATL 298
>gi|158336909|ref|YP_001518084.1| S-layer protein [Acaryochloris marina MBIC11017]
gi|158307150|gb|ABW28767.1| S-layer region-like protein, putative [Acaryochloris marina
MBIC11017]
Length = 318
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 94/219 (42%), Gaps = 23/219 (10%)
Query: 306 VSAALQVLPGKVLVPAVVDQVQGQALSALQVLKVIEA----DVKPGDLCIRREYARWLVS 361
VS A P V + D ++ L L+++EA + +P + R EY WL
Sbjct: 96 VSEAETTKPAAVAFKDIADLPTQPLIADLIKLEILEATDDQNFQPYESISRGEYMLWLFK 155
Query: 362 ASSTLTRSTMSKVYPAMYIENVTDLAFDDITPEDPDFSSIQGLAEAGLISSKLSHRDLLN 421
A + ++R + + D F DI + P F +Q LA AG + + D
Sbjct: 156 AHNAISRPAQK-----IRLAPTFDPEFTDIDAKHPAFKVVQALANAGY---SVGYDDKT- 206
Query: 422 EEPGPIFFLPESPLSRQDLVSWKMALEK-RQLPEANKKILYQLSGFIDIDKINPDAWPAL 480
F P+ P++R++++S K+ ++K + + + L F DI +I+ +
Sbjct: 207 -------FKPDQPITREEMISIKVGIDKGKSIKPVSASSLRAAWKFSDIAEIDKRHSGYI 259
Query: 481 LADL-TAGEQGI-IALAFGCTRLFQPDKPVTNAQAAVAL 517
DL T G QG I AFG F+P + +AA L
Sbjct: 260 YNDLFTKGPQGSNIERAFGKIGTFKPKQAAKRHEAAATL 298
>gi|38566922|emb|CAE76225.1| related to putative cytoplasmic structural protein [Neurospora
crassa]
Length = 2556
Score = 46.6 bits (109), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 89/179 (49%), Gaps = 15/179 (8%)
Query: 529 ELQRIEAESAAENAVSEHSALVAEVEKEINESFEKE---LSMEREKIDVVEKMAEEA-RQ 584
E +R+E E A + V+ A AE EK E E+E L REK + + E+A R+
Sbjct: 1557 ETERLEHEKAEQERVAREKAERAEREKAEREKAEREQVALEKAREKAEQEKAEREKAERE 1616
Query: 585 ELERLRAEREVDKIALMKERAAIE-SEMEILSKLRREVEEQLESLMSNKVEISY-EKERI 642
+ ER R ERE + L +ER A E +E+E + R EE + K E+ E+ERI
Sbjct: 1617 KAERERVEREKAREKLEQERIAREKAELEKAERERIAAEEARKKAELEKAELEKAERERI 1676
Query: 643 --NMLRKEAENENQEIARLQYE-LEVERKALSMARAWAEDE------AKRAREQAKALE 692
RK+AE E E+ + + E E ER A AR AE E +R + Q KAL+
Sbjct: 1677 AAEKARKKAELEKAELEKAELEKAERERVAAEKARKKAEQEKAEQERVEREKAQEKALQ 1735
>gi|164427657|ref|XP_963992.2| hypothetical protein NCU02858 [Neurospora crassa OR74A]
gi|157071832|gb|EAA34756.2| predicted protein [Neurospora crassa OR74A]
Length = 2524
Score = 46.6 bits (109), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 89/179 (49%), Gaps = 15/179 (8%)
Query: 529 ELQRIEAESAAENAVSEHSALVAEVEKEINESFEKE---LSMEREKIDVVEKMAEEA-RQ 584
E +R+E E A + V+ A AE EK E E+E L REK + + E+A R+
Sbjct: 1525 ETERLEHEKAEQERVAREKAERAEREKAEREKAEREQVALEKAREKAEQEKAEREKAERE 1584
Query: 585 ELERLRAEREVDKIALMKERAAIE-SEMEILSKLRREVEEQLESLMSNKVEI-SYEKERI 642
+ ER R ERE + L +ER A E +E+E + R EE + K E+ E+ERI
Sbjct: 1585 KAERERVEREKAREKLEQERIAREKAELEKAERERIAAEEARKKAELEKAELEKAERERI 1644
Query: 643 --NMLRKEAENENQEIARLQYE-LEVERKALSMARAWAEDE------AKRAREQAKALE 692
RK+AE E E+ + + E E ER A AR AE E +R + Q KAL+
Sbjct: 1645 AAEKARKKAELEKAELEKAELEKAERERVAAEKARKKAEQEKAEQERVEREKAQEKALQ 1703
>gi|221052008|ref|XP_002257580.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193807410|emb|CAQ37916.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 2047
Score = 46.2 bits (108), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 71/131 (54%)
Query: 554 EKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEI 613
E+E E EL+ ER+ + ++ EE R + L+ E E ++ + + +E E +
Sbjct: 1104 ERERCSLLEAELTEERDNVTALKTELEEERDNVTTLKTELEGERDNVTALKIELEEERDN 1163
Query: 614 LSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMA 673
++ L+ E+EE+ +++++ K E+ E++ + L+ E E E + L+ ELE ER ++
Sbjct: 1164 VTTLKTELEEERDNVIALKTELEGERDNVTTLKTELEEERDNVIALKTELEGERDNVTTL 1223
Query: 674 RAWAEDEAKRA 684
+ E+E R+
Sbjct: 1224 KTELEEEKGRS 1234
>gi|354481025|ref|XP_003502703.1| PREDICTED: WD repeat-containing protein 65 [Cricetulus griseus]
gi|344252024|gb|EGW08128.1| WD repeat-containing protein 65 [Cricetulus griseus]
Length = 1250
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 80/152 (52%), Gaps = 18/152 (11%)
Query: 554 EKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEI 613
EKE N + E + R+K ++K EE ++E L+AE+ MK + I+S +
Sbjct: 885 EKESNLRLKGETGIMRKKFSSLQKEIEERTNDIENLKAEQ-------MKLQGVIKSLEKD 937
Query: 614 LSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMA 673
+ L+RE++E+ E++ +++RI L+K+ NQE+ + ++ L+ + K L
Sbjct: 938 IQGLKREIQERDETIQD-------KEKRIYDLKKK----NQELEKFKFVLDYKIKELKKQ 986
Query: 674 RAWAEDEAKRAREQAKALEGARDRWERQGIKV 705
E+E K +EQ + +E +R+ +Q ++
Sbjct: 987 IEPRENEIKVMKEQIQEMEAELERFHKQNTQL 1018
>gi|350589140|ref|XP_003130443.3| PREDICTED: centromere protein F [Sus scrofa]
Length = 3070
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 64/220 (29%), Positives = 111/220 (50%), Gaps = 32/220 (14%)
Query: 481 LADLTAGEQGIIALAFGCTRL---FQPDKP----VTNAQAAVALAIGEASDAVNEELQRI 533
L + Q ++ + G T L DKP + + A V GE +N ELQRI
Sbjct: 1844 LESFSCDNQRVVERSGGLTSLDLEMGTDKPSCEVIEDDVAKVTDNWGERYFDMNNELQRI 1903
Query: 534 EAESAAENAVSEHSALVAEVEKEI-----------NESFEKELSMEREKIDVVEKMAEEA 582
++E + +EH AL AE + E+ NE+ +K ++ E++ VV + +
Sbjct: 1904 KSEKGS----TEHHALSAEADLEVVQTEKLYLEKDNENKQKVITCLEEELSVVTRERDRL 1959
Query: 583 RQELERLRAE-REVDKIA-LMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEK- 639
R++L+ L E +E+D+++ MKE+ E+E L R ++EQL+ L + +S +
Sbjct: 1960 REDLDTLSKENKELDQLSEKMKEKIG---ELESLQGERLHLQEQLQRLEEDSQALSLVRS 2016
Query: 640 ---ERINMLRKEAENENQEIARLQYEL-EVERKALSMARA 675
+I L KE ++ +E LQ +L E+ER+ L++A+A
Sbjct: 2017 ELENQIGQLNKEKDSLIRESESLQGKLSELEREKLTIAKA 2056
>gi|328863809|gb|EGG12908.1| hypothetical protein MELLADRAFT_58819 [Melampsora larici-populina
98AG31]
Length = 1803
Score = 42.7 bits (99), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 75/145 (51%), Gaps = 8/145 (5%)
Query: 541 NAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIAL 600
N +SE + +E+ +E+ ES E E+ R + +E +AR + R+R+ER V+K +
Sbjct: 1306 NEISE--LIKSEIRREVLESKENEIQELRNVKNELELGLLKARADHGRVRSERAVEKEKI 1363
Query: 601 MKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEA-ENENQEIARL 659
M+E++ +E E+E L K +R E ++E K E+ + R+ + KE+ E E + L
Sbjct: 1364 MEEKSKVERELEELRKKKRMDELEIERFRKEKEEMEGSR-RVQGVEKESVERERDRLKEL 1422
Query: 660 QYELEVERKALSM-ARAWAEDEAKR 683
LE + + R W E KR
Sbjct: 1423 VARLESRCEGFEVKGREW---ETKR 1444
>gi|338721748|ref|XP_001498751.3| PREDICTED: WD repeat-containing protein 65 [Equus caballus]
Length = 1250
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 121/262 (46%), Gaps = 40/262 (15%)
Query: 554 EKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIES-EME 612
EKE N + E + R+K ++K EE ++E L+ E+ +K + I+S E +
Sbjct: 885 EKESNLRLKGETGIMRKKFSSLQKEIEERTNDIESLKGEQ-------VKLQGVIKSLEKD 937
Query: 613 ILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSM 672
ILS L+RE++E+ E++ +++RI L+K+ NQE+ + ++ L+ + K L
Sbjct: 938 ILS-LKREIQERDETIQD-------KEKRIYDLKKK----NQELEKFKFVLDYKIKELKK 985
Query: 673 ARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQ 732
E+E K +EQ + +E +R+ +Q ++ E A +W K + DQ
Sbjct: 986 QIEPRENEIKVMKEQIQEMEAELERFHKQNTQL--------ELHIAELW---QKLKATDQ 1034
Query: 733 TVSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKA-SMRAAELKDAT 791
+ R + L+A+ T +H + +I + K + ++ A
Sbjct: 1035 EMRRERQKERDLEALVRR--------FKTDLHNCVAYIQEPRLLKEKVRGLFEKYVQRAD 1086
Query: 792 ILKAKGSVQELQQSTAEFRSNL 813
+++ G +LQQ A R +L
Sbjct: 1087 MVEIAGLNMDLQQEYARQREHL 1108
>gi|110737251|dbj|BAF00573.1| putative nuclear matrix constituent protein [Arabidopsis thaliana]
Length = 743
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 101/198 (51%), Gaps = 24/198 (12%)
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEK 577
A+ + D VNE+ +EA+ + E EK I ++ EK LS+E++++ ++
Sbjct: 46 AMNKKFDRVNEKEMDLEAKLKT----------IKEREK-IIQAEEKRLSLEKQQLLSDKE 94
Query: 578 MAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISY 637
E+ +QE+E++RAE + K+ IE E + L + E EE L + ++++
Sbjct: 95 SLEDLQQEIEKIRAE-------MTKKEEMIEEECKSLEIKKEEREEYLR--LQSELKSQI 145
Query: 638 EKERIN--MLRKEAENENQEIARLQYELEV--ERKALSMARAWAEDEAKRAREQAKALEG 693
EK R++ L KE EN QE R + E E+ E++A+ E K E+ + LEG
Sbjct: 146 EKSRVHEEFLSKEVENLKQEKERFEKEWEILDEKQAVYNKERIRISEEKEKFERFQLLEG 205
Query: 694 ARDRWERQGIKVVVDKDL 711
R + E ++V + ++L
Sbjct: 206 ERLKKEESALRVQIMQEL 223
>gi|426218665|ref|XP_004003562.1| PREDICTED: WD repeat-containing protein 65 [Ovis aries]
Length = 1248
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 120/261 (45%), Gaps = 38/261 (14%)
Query: 554 EKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEI 613
EKE N + E + R+K ++K EE ++E L+ E+ +K + I+S +
Sbjct: 885 EKESNLRLKGETGIMRKKFSSLQKEIEERTNDIELLKGEQ-------VKLQGVIKSLEKD 937
Query: 614 LSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMA 673
+ L+RE++E+ E++ +++RI L+K+ NQE+ + ++ L+ + K L
Sbjct: 938 IMGLKREIQERDETIQD-------KEKRIYDLKKK----NQELEKFKFVLDYKIKELKKQ 986
Query: 674 RAWAEDEAKRAREQAKALEGARDRWERQGIKVVVDKDLREESDAAVMWVNAGKQFSVDQT 733
E+E K +EQ + +E +R+ +Q ++ E + +W K + DQ
Sbjct: 987 IEPRENEIKVMKEQIQEMEAELERFHKQNTQL--------ELNITELW---QKLRATDQE 1035
Query: 734 VSRAQSLVDKLKAMANDVSGKSKEIINTIIHKILLFISNLKKWASKA-SMRAAELKDATI 792
+ R + L+A+ T +H + +I ++ K ++ ++ A +
Sbjct: 1036 MRRERQKERDLEALVKR--------FKTDLHNCVAYIQEPRQLKEKVRALFEKYVQRADM 1087
Query: 793 LKAKGSVQELQQSTAEFRSNL 813
++ G +LQQ A R +L
Sbjct: 1088 VEIAGLNSDLQQEYARQREHL 1108
>gi|124511764|ref|XP_001349015.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
3D7]
gi|23498783|emb|CAD50853.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
3D7]
Length = 2910
Score = 41.6 bits (96), Expect = 1.8, Method: Composition-based stats.
Identities = 57/216 (26%), Positives = 106/216 (49%), Gaps = 25/216 (11%)
Query: 521 EASDAVNEELQRIEAESAAENAVSEHSALVAEVEKE--INESFEKELSMEREKIDVVEKM 578
E +NEE Q I+ E E +++V E+EKE IN +++L E+++ D +
Sbjct: 1941 EEQKKINEE-QYIQLEKDKEII----NSMVVEMEKEKIINNEIKQKLEKEKKQNDQLVIH 1995
Query: 579 AEEARQELERLRAEREVDKIALMKERAAIESEM----EILSKLRREVEE--QLESLMSNK 632
E +Q ++L + +K + +E E EI+ +L++E EE ++ SL+ +
Sbjct: 1996 LENEKQANKKLNILLDQNKKINEELNIQVEQEKLINNEIIVQLKKENEENNKINSLLEEQ 2055
Query: 633 --------VEISYEKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRA 684
+++ EKE L+ + ENE QE L++ELE E+K ++ ++E
Sbjct: 2056 NGLNKKVTLQLEKEKEENGKLKLQLENEKQENGNLRFELENEKKDIANLILQLQEE---- 2111
Query: 685 REQAKALEGARDRWERQGIKVVVDKDLREESDAAVM 720
+E K + D+ + + V+V+ D +E+ VM
Sbjct: 2112 KENTKNVMVQMDKEKEKTKNVMVEMDKEKENTKNVM 2147
>gi|18391490|ref|NP_563924.1| nuclear matrix constituent protein-like protein [Arabidopsis
thaliana]
gi|4850405|gb|AAD31075.1|AC007357_24 Similar to gb|D64087 nuclear matrix constituent protein 1 (NMCP1)
from Daucus carota [Arabidopsis thaliana]
gi|332190866|gb|AEE28987.1| nuclear matrix constituent protein-like protein [Arabidopsis
thaliana]
Length = 1128
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 101/198 (51%), Gaps = 24/198 (12%)
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEK 577
A+ + D VNE+ +EA+ + E EK I ++ EK LS+E++++ ++
Sbjct: 431 AMNKKFDRVNEKEMDLEAKLKT----------IKEREK-IIQAEEKRLSLEKQQLLSDKE 479
Query: 578 MAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISY 637
E+ +QE+E++RAE + K+ IE E + L + E EE L + ++++
Sbjct: 480 SLEDLQQEIEKIRAE-------MTKKEEMIEEECKSLEIKKEEREEYLR--LQSELKSQI 530
Query: 638 EKERIN--MLRKEAENENQEIARLQYELEV--ERKALSMARAWAEDEAKRAREQAKALEG 693
EK R++ L KE EN QE R + E E+ E++A+ E K E+ + LEG
Sbjct: 531 EKSRVHEEFLSKEVENLKQEKERFEKEWEILDEKQAVYNKERIRISEEKEKFERFQLLEG 590
Query: 694 ARDRWERQGIKVVVDKDL 711
R + E ++V + ++L
Sbjct: 591 ERLKKEESALRVQIMQEL 608
>gi|386764405|ref|NP_727769.3| mushroom body defect, isoform H [Drosophila melanogaster]
gi|383293386|gb|AAN09583.3| mushroom body defect, isoform H [Drosophila melanogaster]
Length = 2567
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 131/269 (48%), Gaps = 52/269 (19%)
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKE------INESFEKELSMEREK 571
++ EA ++++LQR E ESA + L E++KE +N +FE + +
Sbjct: 1244 SVIEAQTKLSDDLQR-EKESAQQLV----DNLKVELDKERKELAQVNSAFEAQTKLS--- 1295
Query: 572 IDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSN 631
D +++ E A+Q ++ L+ E + ++ L + +A E++ ++ L+RE +E + L+ N
Sbjct: 1296 -DDLQRQKESAQQLVDNLKVELDKERKELAQVNSAFEAQTKLSDDLQRE-KESAQQLVDN 1353
Query: 632 -KVEISYEKERI--------------NMLRKEAENENQEIARLQYELEVERKALSMARAW 676
KVE+ E++ + + L+++ E+ Q + L+ EL+ ERK L+ ++
Sbjct: 1354 LKVELDKERKELAQVKSVIEAQTKLSDDLQRQKESAQQLVDNLKVELDKERKELAKVKSV 1413
Query: 677 AE------DEAKRAREQAKALEGAR---DRWERQ---------GIKVVVDKDLREESDAA 718
E D+ +R +E A+ LE D +RQ +KV +DK+ R+E
Sbjct: 1414 IEAQTKLSDDLQRQKESAQQLEAQTKLSDDLQRQKESAQQLVDNLKVELDKE-RKELAQV 1472
Query: 719 VMWVNAGKQFSVDQTVSR--AQSLVDKLK 745
+ A + S D + AQ LVD LK
Sbjct: 1473 KSVIEAQTKLSDDLQRQKESAQQLVDNLK 1501
>gi|331231080|ref|XP_003328204.1| hypothetical protein PGTG_09498 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309307194|gb|EFP83785.1| hypothetical protein PGTG_09498 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1275
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 103/215 (47%), Gaps = 35/215 (16%)
Query: 479 ALLADLTAGEQGIIALAFGCTRLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESA 538
AL ADL + + A RL D + N Q A+ + + + ++AV EL+ +
Sbjct: 808 ALQADLKKQNETVAAELNN--RLRDQDVEI-NQQRALRVDLNKENEAVVAELKNRLGDQH 864
Query: 539 AENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKI 598
E ++ H AL A+++KE NE+ ++L +RLR +++V+ I
Sbjct: 865 ME--ITRHRALQADLKKE-NEALAEKLK--------------------DRLR-DQDVEII 900
Query: 599 ALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIAR 658
A A +E E E ++KL+ + E+ E + + ++ EKE + L++ N++ EI
Sbjct: 901 ARWALAADLEKEQERVAKLKNRLRERDEEITGRQADLEKEKEAVAELKQRLRNQDVEITG 960
Query: 659 -----LQYELEVER---KALSMARAWAEDEAKRAR 685
+ Y++EV+R K S+ A +E + AR
Sbjct: 961 RRVELMNYQVEVDRLQTKVKSLESLLATEEKQPAR 995
>gi|84998638|ref|XP_954040.1| hypothetical protein [Theileria annulata]
gi|65305038|emb|CAI73363.1| hypothetical protein, conserved [Theileria annulata]
Length = 602
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 80/182 (43%), Gaps = 11/182 (6%)
Query: 524 DAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKIDVVEKMAEEAR 583
D V +E + +E + NA + AE KE E EL E++ +D + E +
Sbjct: 214 DQVKQEQKNLEEKVNEANAAEQALKATAEDLKEGQE----ELKQEQDNLDQAQDKLESTQ 269
Query: 584 QELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERIN 643
+E+E E AL E +E E E L + + E+E Q L K E+ EK+ ++
Sbjct: 270 KEVEAKEHNLEQTADALKSEANKLEEEKESLDEQKEELENQQNDLNKQKNELESEKKNLD 329
Query: 644 MLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEGARDRWERQGI 703
E +++ Q L+ E+++L + E + K +Q LE +D+ Q
Sbjct: 330 K-------EKEDLTTGQKSLDTEKESLDNEKKDLEQQQKSLDDQQSKLEDQQDKLNDQQE 382
Query: 704 KV 705
K+
Sbjct: 383 KL 384
>gi|386764401|ref|NP_727770.2| mushroom body defect, isoform F [Drosophila melanogaster]
gi|383293384|gb|AAF48362.3| mushroom body defect, isoform F [Drosophila melanogaster]
Length = 2394
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 131/269 (48%), Gaps = 52/269 (19%)
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKE------INESFEKELSMEREK 571
++ EA ++++LQR E ESA + L E++KE +N +FE + +
Sbjct: 1244 SVIEAQTKLSDDLQR-EKESAQQLV----DNLKVELDKERKELAQVNSAFEAQTKLS--- 1295
Query: 572 IDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSN 631
D +++ E A+Q ++ L+ E + ++ L + +A E++ ++ L+RE +E + L+ N
Sbjct: 1296 -DDLQRQKESAQQLVDNLKVELDKERKELAQVNSAFEAQTKLSDDLQRE-KESAQQLVDN 1353
Query: 632 -KVEISYEKERI--------------NMLRKEAENENQEIARLQYELEVERKALSMARAW 676
KVE+ E++ + + L+++ E+ Q + L+ EL+ ERK L+ ++
Sbjct: 1354 LKVELDKERKELAQVKSVIEAQTKLSDDLQRQKESAQQLVDNLKVELDKERKELAKVKSV 1413
Query: 677 AE------DEAKRAREQAKALEGAR---DRWERQ---------GIKVVVDKDLREESDAA 718
E D+ +R +E A+ LE D +RQ +KV +DK+ R+E
Sbjct: 1414 IEAQTKLSDDLQRQKESAQQLEAQTKLSDDLQRQKESAQQLVDNLKVELDKE-RKELAQV 1472
Query: 719 VMWVNAGKQFSVDQTVSR--AQSLVDKLK 745
+ A + S D + AQ LVD LK
Sbjct: 1473 KSVIEAQTKLSDDLQRQKESAQQLVDNLK 1501
>gi|47228073|emb|CAF97702.1| unnamed protein product [Tetraodon nigroviridis]
Length = 750
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 53/163 (32%), Positives = 77/163 (47%), Gaps = 24/163 (14%)
Query: 519 IGEASDAVNEE------LQRIEAESAAENAVSEHSALVAEVEKEINESFEKELSMEREKI 572
I E A+ EE LQR E E A E E + + E+E E+EL++ERE
Sbjct: 147 ITEHQQAMEEEREKSLALQRAEMERALERERIEKQQAIEKEEQEKARQRERELALERE-- 204
Query: 573 DVVEKMAEEARQELE-RLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSN 631
RQE E L ERE ++AL +ERA E+E +L R+ E+ L
Sbjct: 205 ----------RQERELALVKEREEQELALARERAL---ELERKRELERQELERQRELERQ 251
Query: 632 KVEISYEKERINMLRKEAENENQEIAR--LQYELEVERKALSM 672
++E E ER + R+ +E+ R L+ + E+ER+ L M
Sbjct: 252 ELERQRELERQELERQRELERQRELERQELERQRELERQKLEM 294
>gi|449710630|gb|EMD49671.1| GRIP domain containing protein RUD3 [Entamoeba histolytica KU27]
Length = 695
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 76/156 (48%), Gaps = 18/156 (11%)
Query: 533 IEAESAAENAVSEHSALVAEVEKEINESFE--KELSMEREKIDVVEKMAE--------EA 582
IE A++ ++ L E++K +NE E K+ S++ E++ ++ +E E
Sbjct: 364 IECRKQCATAINTNAGLNDEIKK-LNEQLEEEKKKSVDYEQLKQKQEDSEKQYSQSLTEK 422
Query: 583 RQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERI 642
+E+ER +AE E K + ++A IE + R E+E Q + S K EI +K I
Sbjct: 423 EKEIERQKAEIESQKAEIESQKAEIERQ-------RNEIESQKAEIESQKAEIESQKAEI 475
Query: 643 NMLRKEAENENQEIARLQYELEVERKALSMARAWAE 678
+ E E + EI R + E+E +R + +A E
Sbjct: 476 ERQKAEIERQKAEIERQRNEIESQRNEIERQKAEIE 511
>gi|386764409|ref|NP_001245666.1| mushroom body defect, isoform J [Drosophila melanogaster]
gi|383293388|gb|AFH07380.1| mushroom body defect, isoform J [Drosophila melanogaster]
Length = 2165
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 131/269 (48%), Gaps = 52/269 (19%)
Query: 518 AIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKE------INESFEKELSMEREK 571
++ EA ++++LQR E ESA + L E++KE +N +FE + +
Sbjct: 1244 SVIEAQTKLSDDLQR-EKESAQQLV----DNLKVELDKERKELAQVNSAFEAQTKLS--- 1295
Query: 572 IDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSN 631
D +++ E A+Q ++ L+ E + ++ L + +A E++ ++ L+RE +E + L+ N
Sbjct: 1296 -DDLQRQKESAQQLVDNLKVELDKERKELAQVNSAFEAQTKLSDDLQRE-KESAQQLVDN 1353
Query: 632 -KVEISYEKERI--------------NMLRKEAENENQEIARLQYELEVERKALSMARAW 676
KVE+ E++ + + L+++ E+ Q + L+ EL+ ERK L+ ++
Sbjct: 1354 LKVELDKERKELAQVKSVIEAQTKLSDDLQRQKESAQQLVDNLKVELDKERKELAKVKSV 1413
Query: 677 AE------DEAKRAREQAKALEGAR---DRWERQ---------GIKVVVDKDLREESDAA 718
E D+ +R +E A+ LE D +RQ +KV +DK+ R+E
Sbjct: 1414 IEAQTKLSDDLQRQKESAQQLEAQTKLSDDLQRQKESAQQLVDNLKVELDKE-RKELAQV 1472
Query: 719 VMWVNAGKQFSVDQTVSR--AQSLVDKLK 745
+ A + S D + AQ LVD LK
Sbjct: 1473 KSVIEAQTKLSDDLQRQKESAQQLVDNLK 1501
>gi|426384023|ref|XP_004058576.1| PREDICTED: centrobin isoform 2 [Gorilla gorilla gorilla]
Length = 925
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 100/207 (48%), Gaps = 16/207 (7%)
Query: 512 QAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEK-ELSMERE 570
Q AVA+A D + E+L + A + H A EV + + E + EL+ ++
Sbjct: 217 QLAVAVAADRKKDTMIEQLDKTLARVV--EGWNRHEAERTEVLRGLQEEHQAAELTRSKQ 274
Query: 571 KIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMS 630
+ + V ++ + + +E L E+E ++ +ER +E E + L+ LR E E+Q ++
Sbjct: 275 Q-ETVTRLEQSLSEAMEALNREQESARLQ-QQERETLEEERQTLT-LRLEAEQQRCCVLQ 331
Query: 631 NKVEISY-----EKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAR 685
+ ++++ E + LR E E Q A+ +++L+ +AL E +A+ R
Sbjct: 332 EERDVAWAGQLSEHRELETLRAALEEERQTWAQQEHQLKEHYQALQ-----EESQAQLER 386
Query: 686 EQAKALEGARDRWERQGIKVVVDKDLR 712
E+ K+ A+ WE Q +V ++R
Sbjct: 387 EKEKSQREAQAAWEAQHQLALVQSEVR 413
>gi|67477833|ref|XP_654352.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56471392|gb|EAL48964.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 695
Score = 39.7 bits (91), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 76/156 (48%), Gaps = 18/156 (11%)
Query: 533 IEAESAAENAVSEHSALVAEVEKEINESFE--KELSMEREKIDVVEKMAE--------EA 582
IE A++ ++ L E++K +NE E K+ S++ E++ ++ +E E
Sbjct: 364 IECRKQCATAINTNAGLNDEIKK-LNEQLEEEKKKSVDYEQLKQKQEDSEKQYSQSLTEK 422
Query: 583 RQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMSNKVEISYEKERI 642
+E+ER +AE E K + ++A IE + R E+E Q + S K EI +K I
Sbjct: 423 EKEIERQKAEIESQKAEIESQKAEIERQ-------RNEIESQKAEIESQKAEIESQKAEI 475
Query: 643 NMLRKEAENENQEIARLQYELEVERKALSMARAWAE 678
+ E E + EI R + E+E +R + +A E
Sbjct: 476 ESQKAEIERQKAEIERQRNEIESQRNEIERQKAEIE 511
>gi|328872521|gb|EGG20888.1| SNF2-related domain-containing protein [Dictyostelium fasciculatum]
Length = 2077
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 81/152 (53%), Gaps = 9/152 (5%)
Query: 554 EKEINESFEKELSMEREKIDVVEK--MAEEARQELERLRAERE-VDKIALMKERAAIESE 610
EKE E+ E E +E+EK D +EK + +E ++LE+ + E+E ++K L KER E
Sbjct: 596 EKEEQEARENEAKLEKEKHDQLEKERLEKERLEQLEKEKLEQERLEKERLEKERLEKE-R 654
Query: 611 MEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAEN-ENQEIARLQYEL--EVER 667
+E L K R E +E+LE L ++E EKERI R E E E E RL+ E ++E+
Sbjct: 655 LEQLEKERLE-KERLEQLEKERLE-QLEKERIENERLEKEKLERLEKERLEKERLEQLEK 712
Query: 668 KALSMARAWAEDEAKRAREQAKALEGARDRWE 699
+ L R E +A+ R E R R E
Sbjct: 713 ERLENERIANEKKAEEERIVKGREEKERKRLE 744
>gi|426384021|ref|XP_004058575.1| PREDICTED: centrobin isoform 1 [Gorilla gorilla gorilla]
Length = 903
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 100/207 (48%), Gaps = 16/207 (7%)
Query: 512 QAAVALAIGEASDAVNEELQRIEAESAAENAVSEHSALVAEVEKEINESFEK-ELSMERE 570
Q AVA+A D + E+L + A + H A EV + + E + EL+ ++
Sbjct: 217 QLAVAVAADRKKDTMIEQLDKTLARVV--EGWNRHEAERTEVLRGLQEEHQAAELTRSKQ 274
Query: 571 KIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIESEMEILSKLRREVEEQLESLMS 630
+ + V ++ + + +E L E+E ++ +ER +E E + L+ LR E E+Q ++
Sbjct: 275 Q-ETVTRLEQSLSEAMEALNREQESARLQ-QQERETLEEERQTLT-LRLEAEQQRCCVLQ 331
Query: 631 NKVEISY-----EKERINMLRKEAENENQEIARLQYELEVERKALSMARAWAEDEAKRAR 685
+ ++++ E + LR E E Q A+ +++L+ +AL E +A+ R
Sbjct: 332 EERDVAWAGQLSEHRELETLRAALEEERQTWAQQEHQLKEHYQALQ-----EESQAQLER 386
Query: 686 EQAKALEGARDRWERQGIKVVVDKDLR 712
E+ K+ A+ WE Q +V ++R
Sbjct: 387 EKEKSQREAQAAWEAQHQLALVQSEVR 413
>gi|406862699|gb|EKD15748.1| Autophagy-related protein 11 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1379
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 82/171 (47%), Gaps = 16/171 (9%)
Query: 549 LVAEVEKEINESFEKELSMEREKIDVVEKMAEEARQELERLRAEREVDKIALMKERAAIE 608
L+AE E+ FEKE+S + D ++ + EEA L + + ++ER ++E
Sbjct: 646 LIAERERAAG--FEKEVSARKTAADAMKSLVEEANSTKTDLMENFDAQQREFIEERKSLE 703
Query: 609 SEMEILSKLRREVEEQLESLMSNKVEISYEKERINMLRKEAENENQEIARLQYELEVERK 668
SE++ L E+E++++ + S E ER ++ + + +LQ ELE RK
Sbjct: 704 SEIKRLKAKLEELEDEMDRYLG-----SRENERTSI--------DDRVKQLQEELEQVRK 750
Query: 669 ALSMARAWAEDEAKRAREQAKALEGARDRWERQGIKVVVD-KDLREESDAA 718
+ A+ + + R+QAK + E Q ++ D KDL ++AA
Sbjct: 751 EATAESQKAQGQVEYLRDQAKMQRETNEALEAQMHRLRQDNKDLSTRAEAA 801
>gi|401826058|ref|XP_003887123.1| myosin heavy chain [Encephalitozoon hellem ATCC 50504]
gi|392998281|gb|AFM98142.1| myosin heavy chain [Encephalitozoon hellem ATCC 50504]
Length = 1678
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 85/156 (54%), Gaps = 17/156 (10%)
Query: 500 RLFQPDKPVTNAQAAVALAIGEASDAVNEELQRIEAESA----AENAVSEHSALVAEVEK 555
RL+Q KP+ + + + E + E ++ ++AE AE + E + +EK
Sbjct: 807 RLYQKIKPLLDVRKRDN-EMKEKEAMIQEYIRMLDAEKGRREEAEEMLKEVNLKKEALEK 865
Query: 556 EINESFEKELSMEREKIDVVEKM-AEEARQELERLRAER----EVDKIA--LMKERAAI- 607
+ + EK SME++++ + + A+E QELER+R E+ E K+A +KE A +
Sbjct: 866 CVKD--EKRFSMEKDELLMALRYKADEMGQELERIRKEKGSIYEEKKVAEARLKESACVL 923
Query: 608 -ESEMEILSKLRREVEEQLESLMSNKVEISYEKERI 642
E E EI SKL++EVEEQ ++ ++ EIS +E I
Sbjct: 924 EERESEI-SKLKKEVEEQGNVILLHEGEISSLREEI 958
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.307 0.124 0.330
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,059,898,552
Number of Sequences: 23463169
Number of extensions: 508313861
Number of successful extensions: 3164357
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3591
Number of HSP's successfully gapped in prelim test: 63597
Number of HSP's that attempted gapping in prelim test: 2597900
Number of HSP's gapped (non-prelim): 338821
length of query: 837
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 686
effective length of database: 8,816,256,848
effective search space: 6047952197728
effective search space used: 6047952197728
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 82 (36.2 bits)