BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015921
(398 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225454022|ref|XP_002281030.1| PREDICTED: uncharacterized protein LOC100259142 [Vitis vinifera]
gi|296089202|emb|CBI38905.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 625 bits (1612), Expect = e-176, Method: Compositional matrix adjust.
Identities = 298/388 (76%), Positives = 332/388 (85%), Gaps = 4/388 (1%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLP 74
MR C G RR L+ LP V F+PY LSVLELH+ S +E +K+ +KFDHL+LGPAAGQ L
Sbjct: 1 MRVCSGWRRRLYCLPFVLFIPYFLSVLELHQSSTIEGSQKKHSKKFDHLVLGPAAGQGLH 60
Query: 75 NRLQCQGLKALNKTSFQASS----IGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNAS 130
+RLQCQG KALNKT SS G SI+ +TVFTIYN+SL +H D R+S++VTVGNAS
Sbjct: 61 DRLQCQGTKALNKTHIATSSHESNFGESIALITVFTIYNSSLALHADGRSSDLVTVGNAS 120
Query: 131 YSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLML 190
YSK ERSMAILNVFINFIQ TMP+S+V ILTDPAS+ S+ R VTIYPI GEYSRDKLML
Sbjct: 121 YSKMERSMAILNVFINFIQATMPQSNVIILTDPASEFSLHRDRVTIYPIQGEYSRDKLML 180
Query: 191 QRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNK 250
QRIRSYI FLE ++ EHSQG GHINHY+FTDSDIAVVDDLG IF + NFH+ALTFRNNK
Sbjct: 181 QRIRSYIVFLETKLEEHSQGHGHINHYIFTDSDIAVVDDLGQIFQSHPNFHVALTFRNNK 240
Query: 251 DQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFD 310
+QPLNSGFIAVRGTPDGI RAK+FL+EVL+VYSS++MNASRMLGDQLALAWVVKSHP FD
Sbjct: 241 EQPLNSGFIAVRGTPDGILRAKLFLQEVLKVYSSRFMNASRMLGDQLALAWVVKSHPYFD 300
Query: 311 ARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE 370
+RF+K Q F+EDI G SVLFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE
Sbjct: 301 TKRFSKPQAFLEDIGGTSVLFLPCAIYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE 360
Query: 371 SWNFFSSSSDISDMLCLILMSGRTKYDF 398
SWNFF SSSDISDMLCLILMSGRTKYDF
Sbjct: 361 SWNFFISSSDISDMLCLILMSGRTKYDF 388
>gi|224129974|ref|XP_002320717.1| predicted protein [Populus trichocarpa]
gi|222861490|gb|EEE99032.1| predicted protein [Populus trichocarpa]
Length = 369
Score = 590 bits (1521), Expect = e-166, Method: Compositional matrix adjust.
Identities = 285/367 (77%), Positives = 320/367 (87%), Gaps = 5/367 (1%)
Query: 34 LPYL-LSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQGLKALNKTSFQA 92
+P+L SVLELH+ + P+K KFDHL+LGPAAGQ LPNRLQCQG KALNKT ++
Sbjct: 6 IPFLSFSVLELHQNPAAQPPPKKMNTKFDHLVLGPAAGQGLPNRLQCQGTKALNKTHTRS 65
Query: 93 SS-IGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQVT 151
SS G S+SFVTVFT+YNTSL DSR SN VTVGNASY+K ERSMA+LNVF+NFI+VT
Sbjct: 66 SSNAGESVSFVTVFTVYNTSL---ADSRLSNFVTVGNASYTKMERSMAVLNVFVNFIKVT 122
Query: 152 MPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQ 211
MP+S+V ILTDPASDLS+ VT+YPI G+YSRDKLMLQRIRSYITFLE R+ E +Q
Sbjct: 123 MPRSNVVILTDPASDLSLFGNSVTVYPIQGDYSRDKLMLQRIRSYITFLETRLEELAQNP 182
Query: 212 GHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRA 271
GHINHY+FTDSDIAVVDDLGH+F+D+ NFHLALTFRNNK+QPLNSGFIAVRGT D I RA
Sbjct: 183 GHINHYIFTDSDIAVVDDLGHLFNDHPNFHLALTFRNNKEQPLNSGFIAVRGTTDAILRA 242
Query: 272 KIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLF 331
KIFL+EVL+VYSSK+M+ASRMLGDQLALAW +KSHP FD RRFTKAQ F+E+I GASVLF
Sbjct: 243 KIFLQEVLKVYSSKFMSASRMLGDQLALAWAIKSHPGFDLRRFTKAQAFLENIGGASVLF 302
Query: 332 LPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMS 391
LPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF SSSSDI DMLCL+L+S
Sbjct: 303 LPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFLSSSSDIFDMLCLVLLS 362
Query: 392 GRTKYDF 398
GRTKYDF
Sbjct: 363 GRTKYDF 369
>gi|255562808|ref|XP_002522409.1| conserved hypothetical protein [Ricinus communis]
gi|223538294|gb|EEF39901.1| conserved hypothetical protein [Ricinus communis]
Length = 388
Score = 587 bits (1514), Expect = e-165, Method: Compositional matrix adjust.
Identities = 290/391 (74%), Positives = 333/391 (85%), Gaps = 10/391 (2%)
Query: 15 MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNR-QKFDHLILGPAAGQ 71
MR G RRF+ FFL LV F ++ SVLELH SV E P+KNR +K DHL+LGPAAGQ
Sbjct: 1 MRTWSGWRRFILCFFLLLVIF--HIFSVLELHSNSVTE-APQKNRNKKSDHLVLGPAAGQ 57
Query: 72 RLPNRLQCQGLKALNKT----SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVG 127
LP+RLQC+G KALNKT S S++G++++FVTVFTIYNTSLD D R+SN+VTVG
Sbjct: 58 GLPDRLQCEGSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSIPDDRSSNLVTVG 117
Query: 128 NASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDK 187
N SYSK ERSMAILNVFINFIQVTMP+S+V ILTDPASDLS+ R VT+YPI GEYSR+K
Sbjct: 118 NVSYSKMERSMAILNVFINFIQVTMPRSNVIILTDPASDLSLQRYKVTLYPIQGEYSREK 177
Query: 188 LMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFR 247
LMLQRI+SYI FL+ +++E ++ H +HY+FTDSDIAVVDDLG IFH+Y NFH+ALTFR
Sbjct: 178 LMLQRIKSYINFLDMKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYPNFHIALTFR 237
Query: 248 NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHP 307
NNK+QPLNSGFIAVRGT + I RAKIFL+ VL VY+SKYMNASRMLGDQLALAWV++SHP
Sbjct: 238 NNKEQPLNSGFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASRMLGDQLALAWVIRSHP 297
Query: 308 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 367
FD RRF KAQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL
Sbjct: 298 GFDLRRFRKAQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 357
Query: 368 MLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
MLESWNFF S+SDISDMLCLILMSGRTKYDF
Sbjct: 358 MLESWNFFRSASDISDMLCLILMSGRTKYDF 388
>gi|255541282|ref|XP_002511705.1| conserved hypothetical protein [Ricinus communis]
gi|223548885|gb|EEF50374.1| conserved hypothetical protein [Ricinus communis]
Length = 388
Score = 584 bits (1506), Expect = e-164, Method: Compositional matrix adjust.
Identities = 288/391 (73%), Positives = 333/391 (85%), Gaps = 10/391 (2%)
Query: 15 MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNR-QKFDHLILGPAAGQ 71
MR G RRF+ FFL LV F ++ SVLELH SV E P+KNR +K DHL+LGPAAGQ
Sbjct: 1 MRTWSGWRRFILSFFLLLVIF--HIFSVLELHSNSVTE-APQKNRNKKSDHLVLGPAAGQ 57
Query: 72 RLPNRLQCQGLKALNKT----SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVG 127
LP+RLQC+G KALNKT S S++G++++FVTVFTIYNTSLD + R+SN+VTVG
Sbjct: 58 GLPDRLQCEGSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSLPNDRSSNLVTVG 117
Query: 128 NASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDK 187
N SYSKTERSMAILNVFINFIQVTMP+S+V ILTDPASDL + R VT+YPI GEYSR+K
Sbjct: 118 NVSYSKTERSMAILNVFINFIQVTMPQSNVIILTDPASDLLLQRDKVTLYPIQGEYSREK 177
Query: 188 LMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFR 247
LMLQRIRSYI FL+ +++E ++ H +HY+FTDSDIAVVDDLG IFH+Y+NFH+ALTFR
Sbjct: 178 LMLQRIRSYINFLDTKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYRNFHIALTFR 237
Query: 248 NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHP 307
NNK+QPLNSGFIAVRGT + I RAKIFL+ VL VY+SKYMNAS+MLGDQLALAWV++SHP
Sbjct: 238 NNKEQPLNSGFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASQMLGDQLALAWVIRSHP 297
Query: 308 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 367
FD RF KAQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL
Sbjct: 298 GFDLWRFRKAQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 357
Query: 368 MLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
MLESWNFF S+SDISDMLCLILMSGRTKYDF
Sbjct: 358 MLESWNFFRSASDISDMLCLILMSGRTKYDF 388
>gi|356545145|ref|XP_003541005.1| PREDICTED: uncharacterized protein LOC100785469 [Glycine max]
Length = 432
Score = 553 bits (1424), Expect = e-155, Method: Compositional matrix adjust.
Identities = 273/386 (70%), Positives = 316/386 (81%), Gaps = 4/386 (1%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLP 74
M+ G RF+F LPL+F L +L SV ELH S +E+ ++ +K DHL+LGPAAGQ L
Sbjct: 49 MKIFSGWHRFVFGLPLIFLLTHLFSVRELHTNSKMEEPRKQLNKKLDHLVLGPAAGQGLS 108
Query: 75 NRLQCQGLKALNK--TSFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYS 132
NRLQCQG K+LN+ +S S + SI+FVTVFTIYN+SL+ VD ++ N + VGNASY+
Sbjct: 109 NRLQCQGTKSLNRIHSSNSRSGVDGSITFVTVFTIYNSSLN-DVDDKSLNTI-VGNASYN 166
Query: 133 KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR 192
K RSMA+LNVFINFIQV M +S V ILTDP SDLS+ R GV++YPI GEYSRDKLMLQR
Sbjct: 167 KFGRSMALLNVFINFIQVAMRQSKVIILTDPVSDLSVQRNGVSLYPIEGEYSRDKLMLQR 226
Query: 193 IRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQ 252
IRSYITFLE R++ SQ +I HY+FTDSD+AVVDDLG IFHD+ NFH+ALTFRNNK Q
Sbjct: 227 IRSYITFLETRLQNLSQKPKNITHYIFTDSDMAVVDDLGQIFHDHPNFHVALTFRNNKAQ 286
Query: 253 PLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDAR 312
PLNSGFIAVRGTP+ I RAK+FL+EVL+VY++KY NASRMLGDQLALAWVVKS P FDA
Sbjct: 287 PLNSGFIAVRGTPEAILRAKLFLQEVLKVYTTKYKNASRMLGDQLALAWVVKSKPHFDAS 346
Query: 313 RFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESW 372
RF KA F EDI G SVLFLPC+ YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESW
Sbjct: 347 RFAKAPAFSEDIGGTSVLFLPCSLYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESW 406
Query: 373 NFFSSSSDISDMLCLILMSGRTKYDF 398
NF+SSS ++SDMLCLIL SGRTKYDF
Sbjct: 407 NFYSSSLEVSDMLCLILGSGRTKYDF 432
>gi|449432261|ref|XP_004133918.1| PREDICTED: uncharacterized protein LOC101215082 [Cucumis sativus]
gi|449480062|ref|XP_004155788.1| PREDICTED: uncharacterized protein LOC101230110 [Cucumis sativus]
Length = 387
Score = 533 bits (1372), Expect = e-149, Method: Compositional matrix adjust.
Identities = 256/350 (73%), Positives = 292/350 (83%), Gaps = 4/350 (1%)
Query: 53 PRKNRQKFDHLILGPAAGQRLPNRLQCQGLKALNKTSF----QASSIGNSISFVTVFTIY 108
P K +KFDHLILGPA GQ L +RLQC G KALN T ++ G+SI FVTVFTIY
Sbjct: 38 PDKRSKKFDHLILGPATGQGLSDRLQCSGTKALNNTHLPDTSNSADSGDSIHFVTVFTIY 97
Query: 109 NTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLS 168
N S D V R++++V VG+ASY+K ERSMA+LNVFINFIQV+MP+S+V ILTDPASDL
Sbjct: 98 NASQDSKVIGRSTDVVKVGDASYNKVERSMAVLNVFINFIQVSMPQSNVVILTDPASDLP 157
Query: 169 MPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVD 228
+ R V ++PI GEYSRD LMLQRIRSYI+FL+ ++ E QG HINHY+FTDSD+AVV
Sbjct: 158 VRRNRVAVFPIQGEYSRDTLMLQRIRSYISFLDAKLDEQRQGTTHINHYIFTDSDMAVVG 217
Query: 229 DLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMN 288
DLG IFH + FHLALTFRNNK QPLNSGFIAVRGT DGI RAK FLEEVL++YSS++M
Sbjct: 218 DLGEIFHKHPKFHLALTFRNNKAQPLNSGFIAVRGTEDGIRRAKTFLEEVLKIYSSRFMK 277
Query: 289 ASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQF 348
ASRMLGDQLALAWVV+S+PSFDAR+F+K + FVE+I GASVLFLPCA YNWTPPEGAGQF
Sbjct: 278 ASRMLGDQLALAWVVRSNPSFDARKFSKPETFVEEINGASVLFLPCALYNWTPPEGAGQF 337
Query: 349 HGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
HGMPL+VKVVHFKGSRKRLMLESWNFF SSS ISDMLCLIL SGRTKYDF
Sbjct: 338 HGMPLNVKVVHFKGSRKRLMLESWNFFQSSSSISDMLCLILSSGRTKYDF 387
>gi|357471691|ref|XP_003606130.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
gi|355507185|gb|AES88327.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
Length = 350
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 258/343 (75%), Positives = 290/343 (84%), Gaps = 4/343 (1%)
Query: 58 QKFDHLILGPAAGQRLPNRLQCQGLKALNKTSFQASSIG--NSISFVTVFTIYNTSLDVH 115
+KFDHL+LGPAAGQ L NRLQCQG KALN+T G SI+FVTVFTIYN+SL+
Sbjct: 10 KKFDHLVLGPAAGQGLSNRLQCQGSKALNRTHSSNGRFGVDGSITFVTVFTIYNSSLN-R 68
Query: 116 VDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVT 175
VD ++SN VGNASY+K ERSMA+LNVFI+FIQV MP+S+V ILTDP SDLS+ R V+
Sbjct: 69 VDDKSSNTF-VGNASYNKVERSMAVLNVFIDFIQVVMPQSEVIILTDPVSDLSVHRNRVS 127
Query: 176 IYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFH 235
+YPI GEYSRDKLMLQRIRSYITFLE R+++ SQ I HY+FTDSDIAVVDDLG IF
Sbjct: 128 LYPIQGEYSRDKLMLQRIRSYITFLETRLQKLSQNPKDITHYIFTDSDIAVVDDLGQIFR 187
Query: 236 DYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGD 295
D+ NFH+ALTFRNNK QPLNSGFIAV+GTPDGI RAK+FL+EVL+VY SKYM+ASRMLGD
Sbjct: 188 DHPNFHMALTFRNNKAQPLNSGFIAVKGTPDGILRAKLFLQEVLKVYVSKYMSASRMLGD 247
Query: 296 QLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDV 355
QLALAWVVKS P FDA RF K F +DI G S+LFLPCA YNWTPPEGAGQFHGMPLDV
Sbjct: 248 QLALAWVVKSKPQFDASRFAKTVAFSDDIGGTSILFLPCALYNWTPPEGAGQFHGMPLDV 307
Query: 356 KVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
KVVHFKGSRKRLMLESWNF+SS+ DI+DMLCLIL SGRTKYDF
Sbjct: 308 KVVHFKGSRKRLMLESWNFYSSTPDIADMLCLILGSGRTKYDF 350
>gi|356517294|ref|XP_003527323.1| PREDICTED: uncharacterized protein LOC100794487 [Glycine max]
Length = 352
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 260/354 (73%), Positives = 296/354 (83%), Gaps = 6/354 (1%)
Query: 49 VEDLPRKN-RQKFDHLILGPAAGQRLPNRLQCQGLKALNK--TSFQASSIGNSISFVTVF 105
+E+ PRK +K +HL+LGPAAGQ L NRLQCQG KALN+ +S S + SI+FVTVF
Sbjct: 1 MEEPPRKQLNKKLNHLVLGPAAGQGLSNRLQCQGTKALNRIHSSNSRSGVDGSITFVTVF 60
Query: 106 TIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPAS 165
TIYN+SL+ VD ++SN V VGNASY+K RS A+LNVFINFIQV MP+S V ILTDP S
Sbjct: 61 TIYNSSLN-DVDDKSSNTV-VGNASYNKFGRSTALLNVFINFIQVAMPQSKVIILTDPVS 118
Query: 166 DLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQ-GHINHYVFTDSDI 224
DLS+ R GV++YPI GEYSRDKLMLQRIRSYITFLE R++ SQ + +I HY+FTDSDI
Sbjct: 119 DLSVLRNGVSLYPIEGEYSRDKLMLQRIRSYITFLETRLQNLSQKKPKNITHYIFTDSDI 178
Query: 225 AVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSS 284
AVVDDLG IF D+ NFH+ALTFRNNK QPLNSGFIAVRGTP+ I RAK+FL+EVL+VYS+
Sbjct: 179 AVVDDLGQIFRDHPNFHVALTFRNNKAQPLNSGFIAVRGTPEAILRAKLFLQEVLKVYST 238
Query: 285 KYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEG 344
KY NASRMLGDQLALAWVVKS P FDA RF KA F EDI G SV+FLPC+ YNWTPPEG
Sbjct: 239 KYRNASRMLGDQLALAWVVKSKPHFDASRFGKALAFSEDIGGTSVVFLPCSLYNWTPPEG 298
Query: 345 AGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
AGQFHGMPLD KVVHFKGSRKRLMLESWNF+SSS ++SDMLCLIL SGRTKYDF
Sbjct: 299 AGQFHGMPLDAKVVHFKGSRKRLMLESWNFYSSSLEVSDMLCLILGSGRTKYDF 352
>gi|297828243|ref|XP_002882004.1| hypothetical protein ARALYDRAFT_903971 [Arabidopsis lyrata subsp.
lyrata]
gi|297327843|gb|EFH58263.1| hypothetical protein ARALYDRAFT_903971 [Arabidopsis lyrata subsp.
lyrata]
Length = 391
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 260/388 (67%), Positives = 309/388 (79%), Gaps = 10/388 (2%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
MR+C G RR L +P++F LP+L S+L + S + R +K DHL+LGP AGQ L
Sbjct: 10 MRSCSGWRRILLLIPVLFLLPHLSSLLGISSDSAKRNDARTIPNKKLDHLVLGPVAGQGL 69
Query: 74 PNRLQCQGLKALNKT--SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTV-GNAS 130
+RL C+G KALNKT S S GN +SFVTVFT+YNTSL ++++SNMV+V GN +
Sbjct: 70 SDRLHCRGTKALNKTHGSSHVSGAGNGVSFVTVFTVYNTSLG---NAKSSNMVSVVGNVT 126
Query: 131 YSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLML 190
YSK ERSMA+LN F FIQVTMPKS+V ILTDPASDLS+ + V + P+ G+YSR LML
Sbjct: 127 YSKPERSMAVLNAFAYFIQVTMPKSNVVILTDPASDLSIQQSNVMVQPVQGDYSRGNLML 186
Query: 191 QRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNK 250
QRIRSYITFLE ++ ++ +G INHY+FTDSDIAVVDD+ IF + +FHLALTFRNNK
Sbjct: 187 QRIRSYITFLEMKLEKN---EGGINHYIFTDSDIAVVDDIRAIFDKHPSFHLALTFRNNK 243
Query: 251 DQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFD 310
DQPLNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL WVVKSHPSFD
Sbjct: 244 DQPLNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVWVVKSHPSFD 303
Query: 311 ARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE 370
A+RFTK Q F ++I GASVLFLPC YNWTPPEGAGQFHGMPLDVK+VHFKGSRKRLMLE
Sbjct: 304 AKRFTKPQAFTQEIAGASVLFLPCVLYNWTPPEGAGQFHGMPLDVKIVHFKGSRKRLMLE 363
Query: 371 SWNFFSSSSDISDMLCLILMSGRTKYDF 398
+WNF+ S+S+I DMLCL+L SGRTKYDF
Sbjct: 364 AWNFYKSTSNIPDMLCLVLGSGRTKYDF 391
>gi|30689992|ref|NP_850432.1| uncharacterized protein [Arabidopsis thaliana]
gi|330255444|gb|AEC10538.1| uncharacterized protein [Arabidopsis thaliana]
Length = 392
Score = 509 bits (1311), Expect = e-142, Method: Compositional matrix adjust.
Identities = 258/389 (66%), Positives = 307/389 (78%), Gaps = 11/389 (2%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
MR+C G RR L +P++F LP+L S+++ S + R +K DHL+LGP AGQ L
Sbjct: 10 MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69
Query: 74 PNRLQCQGLKALNKT---SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTV-GNA 129
+R C+G KALNKT + S GN +SFVTVFT+YNTSL + ++SN V+V GN
Sbjct: 70 SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLG---NVKSSNPVSVVGNV 126
Query: 130 SYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLM 189
+YSK ERSMA+LN F NFIQVTMPKS+V ILTDPASDLS+ + V + P+ G+YSR LM
Sbjct: 127 TYSKPERSMAVLNAFANFIQVTMPKSNVVILTDPASDLSIQQSNVILQPVQGDYSRGNLM 186
Query: 190 LQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNN 249
LQRIRSYITFLE ++ ++ +G INHY+FTDSDIAVVDD+G IF + +FHLALTFRNN
Sbjct: 187 LQRIRSYITFLEMKLEKN---EGGINHYIFTDSDIAVVDDVGTIFDKHSSFHLALTFRNN 243
Query: 250 KDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSF 309
KDQPLNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL VVKSH SF
Sbjct: 244 KDQPLNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVSVVKSHASF 303
Query: 310 DARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLML 369
DA+RFTK Q F E+I GASVLFLPCA YNWTPPEGAGQFHGMPLDVK+VHFKGSRKRLML
Sbjct: 304 DAKRFTKPQAFTEEIAGASVLFLPCALYNWTPPEGAGQFHGMPLDVKIVHFKGSRKRLML 363
Query: 370 ESWNFFSSSSDISDMLCLILMSGRTKYDF 398
E+WNF+ S+S+I DMLCL+L SGRTKYDF
Sbjct: 364 EAWNFYKSTSNIPDMLCLVLGSGRTKYDF 392
>gi|147854152|emb|CAN83830.1| hypothetical protein VITISV_003973 [Vitis vinifera]
Length = 321
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 238/323 (73%), Positives = 264/323 (81%), Gaps = 22/323 (6%)
Query: 81 GLKALNKTSFQASS----IGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTER 136
G KALNKT SS G SI+ +TVFTIYN+SL +H D R+S++VTVGNASYSK ER
Sbjct: 16 GTKALNKTHIATSSHESNFGESIALITVFTIYNSSLALHXDGRSSDLVTVGNASYSKMER 75
Query: 137 SMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSY 196
SMAILNVFINFIQ TMP+S+V ILTDPAS+ S+ R VTIYPI GEYSRDKLMLQRIRSY
Sbjct: 76 SMAILNVFINFIQATMPQSNVIILTDPASEFSLHRDRVTIYPIQGEYSRDKLMLQRIRSY 135
Query: 197 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNS 256
I FLE ++ EHSQG GHINHY+FTDSDIAVVDDLG IF + NFH+ALTFRNNK+QPL
Sbjct: 136 IVFLETKLEEHSQGHGHINHYIFTDSDIAVVDDLGQIFQSHPNFHVALTFRNNKEQPL-- 193
Query: 257 GFIAVRGTPDGISRAKIFL-EEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFT 315
K ++ +VL+VYSS++MNASRMLGDQLALAWVVKSHP FD +RF+
Sbjct: 194 ---------------KFWIYSKVLKVYSSRFMNASRMLGDQLALAWVVKSHPYFDTKRFS 238
Query: 316 KAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFF 375
K Q F+EDI G SVLFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFF
Sbjct: 239 KPQAFLEDIGGTSVLFLPCAIYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFF 298
Query: 376 SSSSDISDMLCLILMSGRTKYDF 398
SSSDISDMLCLILMSGRTKYDF
Sbjct: 299 ISSSDISDMLCLILMSGRTKYDF 321
>gi|294462546|gb|ADE76819.1| unknown [Picea sitchensis]
Length = 391
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 234/390 (60%), Positives = 290/390 (74%), Gaps = 8/390 (2%)
Query: 17 ACGGCRRFLFFLPLVFFLPYLLSVLEL----HEKSVVEDLPRKNRQKFDHLILGPAAGQR 72
A G RF+ FLP + LP++ S +L + K + R+KFD+++LGPAAGQ
Sbjct: 2 ASSGKWRFIRFLPFILILPFIFSGFQLSRLQNSKPKGDGSVGVGRKKFDYIVLGPAAGQG 61
Query: 73 LPNRLQCQGLKALNK---TSFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNA 129
LPNR+QCQGLKA+ + SF S + ISFVTVFTIYN SL + D + S V+VGN+
Sbjct: 62 LPNRIQCQGLKAVKRRPLPSFHLSLVKEKISFVTVFTIYNQSLQISFDQKVSTNVSVGNS 121
Query: 130 SYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLM 189
+Y KT+RSMAILNVF NFI+V MP+S++FILTDPAS+ + + I G+YSR+ LM
Sbjct: 122 TYDKTQRSMAILNVFANFIKVAMPRSNIFILTDPASNFPVVPSNAVVMHIPGDYSRNNLM 181
Query: 190 LQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNN 249
LQRI+SYI FLE R+ H Q ++H++FTDSDIAVVDDLG + +Y +FH+ LTFRNN
Sbjct: 182 LQRIKSYIDFLEARLSGHIGKQNQVDHFIFTDSDIAVVDDLGDVVENYPDFHIGLTFRNN 241
Query: 250 KDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSF 309
KDQPLNSGFI VRGT + +S+AK FLEEVL +Y S +M A+RMLGDQLALAW+VK+ P F
Sbjct: 242 KDQPLNSGFILVRGTDEAVSKAKAFLEEVLEIYKSMFMKAARMLGDQLALAWIVKNQPLF 301
Query: 310 DARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLML 369
DA+RF + FV ++ A VLFLPCA YNWTPPEGAGQFHGMP DVKV+HFKGSRKRLM+
Sbjct: 302 DAQRFRNPKAFVAEVHRAQVLFLPCAIYNWTPPEGAGQFHGMPEDVKVIHFKGSRKRLMM 361
Query: 370 ESWNFFSSSS-DISDMLCLILMSGRTKYDF 398
ESWNFF+S D SDM+CLIL SGR KYDF
Sbjct: 362 ESWNFFNSHPVDFSDMMCLILKSGRVKYDF 391
>gi|255541260|ref|XP_002511694.1| conserved hypothetical protein [Ricinus communis]
gi|223548874|gb|EEF50363.1| conserved hypothetical protein [Ricinus communis]
Length = 554
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 243/382 (63%), Positives = 281/382 (73%), Gaps = 61/382 (15%)
Query: 29 PLVFFLPYLL-------SVLELHEKSVVEDLPRKN-RQKFDHLILGPAAGQRLPNRLQCQ 80
PL+F + YL SVLELH SV E P+KN +K DHL++GPAAGQ LP+RLQC+
Sbjct: 222 PLLFCVMYLACYTMHVASVLELHWNSVTE-APQKNWNKKSDHLVIGPAAGQGLPDRLQCE 280
Query: 81 GLKALNKT----SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTER 136
G KALNKT S S++G++++FVTVFTIYNTSLD D R+SN+VTVGN SYSKTER
Sbjct: 281 GSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSLPDDRSSNLVTVGNVSYSKTER 340
Query: 137 SMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSY 196
SMAILNVFINFIQ
Sbjct: 341 SMAILNVFINFIQ----------------------------------------------- 353
Query: 197 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNS 256
FL+ +++E ++ H +HY+FTDSDIAVVDDLG IFH+Y NFH+ALTFRNNK+QPLNS
Sbjct: 354 -NFLDTKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYPNFHIALTFRNNKEQPLNS 412
Query: 257 GFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTK 316
GFIAVRGT + I RAKIFL+ VL VY+SKYMNASRMLGDQLALAWV++SHP FD +RF K
Sbjct: 413 GFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASRMLGDQLALAWVIRSHPGFDLQRFRK 472
Query: 317 AQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFS 376
AQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRK LMLESWNFF
Sbjct: 473 AQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKHLMLESWNFFR 532
Query: 377 SSSDISDMLCLILMSGRTKYDF 398
S+SDISDMLCLILMSGRTKYDF
Sbjct: 533 SASDISDMLCLILMSGRTKYDF 554
>gi|293336758|ref|NP_001169994.1| uncharacterized protein LOC100383899 precursor [Zea mays]
gi|224032791|gb|ACN35471.1| unknown [Zea mays]
gi|413924636|gb|AFW64568.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
Length = 388
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 232/375 (61%), Positives = 284/375 (75%), Gaps = 9/375 (2%)
Query: 30 LVFFLPYLLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQGLKAL 85
L+F +P + SV L EK V P ++ DHL+LGPAAGQ P+RLQC+GL+AL
Sbjct: 17 LLFLVPLIYSVSRLQPWAPEKGVCLPPPTAPKRP-DHLVLGPAAGQDRPDRLQCRGLRAL 75
Query: 86 NKT--SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNV 143
NK S + + G +SF TVFT YN S+ D+ S+ VTVGN+SYSK ERSMAILN
Sbjct: 76 NKIGISSEENYSGEHVSFATVFTTYN-SVSAGDDNVPSDSVTVGNSSYSKIERSMAILNT 134
Query: 144 FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERR 203
FI+FI+V+MP+SD+ ILTDP S +S+ + T+ P+ G YSR LMLQRI++YI FLE++
Sbjct: 135 FISFIKVSMPRSDLIILTDPGSKISVNQGTATLLPVEGNYSRGNLMLQRIKTYIAFLEQK 194
Query: 204 IREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRG 263
+ E + + +NH+V TDSDIAVV DLGHIF Y +FHLA+TFRNNK QPLNSGF+AVRG
Sbjct: 195 LVEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRNNKGQPLNSGFVAVRG 253
Query: 264 TPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVED 323
T DGI+ A FL++VL YS +YM ASRMLGDQLALAWVVKSH +F+K + F +
Sbjct: 254 TRDGITNAVEFLKQVLGTYSLRYMKASRMLGDQLALAWVVKSHLPSAFGKFSKNEAFTGE 313
Query: 324 IIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISD 383
+ G SVLFLPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF+SS+S +SD
Sbjct: 314 VNGTSVLFLPCAVYNWTPPEGAGQFHGVPLDVKVVHFKGSRKRLMLEAWNFYSSTSKLSD 373
Query: 384 MLCLILMSGRTKYDF 398
MLCLIL SGRTKYDF
Sbjct: 374 MLCLILRSGRTKYDF 388
>gi|413954355|gb|AFW87004.1| hypothetical protein ZEAMMB73_846695 [Zea mays]
Length = 414
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 224/356 (62%), Positives = 275/356 (77%), Gaps = 5/356 (1%)
Query: 45 EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQGLKALNKT--SFQASSIGNSISFV 102
EK V P ++ DHL+LGPAAGQ P+RLQC+GL+ALNK S + + G +SFV
Sbjct: 62 EKGVCLPPPTAPKRP-DHLVLGPAAGQGRPDRLQCRGLRALNKIGLSSEENYSGEHVSFV 120
Query: 103 TVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTD 162
TVFT YN S+ + + VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTD
Sbjct: 121 TVFTTYN-SVSAGDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTD 179
Query: 163 PASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDS 222
P S S+ + T+ PI G YSR LMLQRI++YI FLE+++ E + + +NH+V TDS
Sbjct: 180 PGSKFSVNQGSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDS 238
Query: 223 DIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVY 282
DIAVVDDLGHIF Y +FHLA+TFRNNK QPLNSGF+AVRGT DGI++A FL++VL+ Y
Sbjct: 239 DIAVVDDLGHIFEKYPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITKAAEFLKQVLKAY 298
Query: 283 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 342
S +Y+ A+RMLGDQLALAWVVKSH +F+K + F ++ G SVLFLPCA YNWTPP
Sbjct: 299 SLRYIKAARMLGDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTPP 358
Query: 343 EGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
EGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 359 EGAGQFHGIPLDVKVVHFKGSRKRLMLEAWNFYNSTSKMSDMLCLILRSGRTKYDF 414
>gi|413954354|gb|AFW87003.1| hypothetical protein ZEAMMB73_846695 [Zea mays]
Length = 389
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 224/356 (62%), Positives = 275/356 (77%), Gaps = 5/356 (1%)
Query: 45 EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQGLKALNKT--SFQASSIGNSISFV 102
EK V P ++ DHL+LGPAAGQ P+RLQC+GL+ALNK S + + G +SFV
Sbjct: 37 EKGVCLPPPTAPKRP-DHLVLGPAAGQGRPDRLQCRGLRALNKIGLSSEENYSGEHVSFV 95
Query: 103 TVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTD 162
TVFT YN S+ + + VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTD
Sbjct: 96 TVFTTYN-SVSAGDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTD 154
Query: 163 PASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDS 222
P S S+ + T+ PI G YSR LMLQRI++YI FLE+++ E + + +NH+V TDS
Sbjct: 155 PGSKFSVNQGSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDS 213
Query: 223 DIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVY 282
DIAVVDDLGHIF Y +FHLA+TFRNNK QPLNSGF+AVRGT DGI++A FL++VL+ Y
Sbjct: 214 DIAVVDDLGHIFEKYPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITKAAEFLKQVLKAY 273
Query: 283 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 342
S +Y+ A+RMLGDQLALAWVVKSH +F+K + F ++ G SVLFLPCA YNWTPP
Sbjct: 274 SLRYIKAARMLGDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTPP 333
Query: 343 EGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
EGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 334 EGAGQFHGIPLDVKVVHFKGSRKRLMLEAWNFYNSTSKMSDMLCLILRSGRTKYDF 389
>gi|357124057|ref|XP_003563723.1| PREDICTED: uncharacterized protein LOC100833864 [Brachypodium
distachyon]
Length = 391
Score = 450 bits (1157), Expect = e-124, Method: Compositional matrix adjust.
Identities = 222/340 (65%), Positives = 264/340 (77%), Gaps = 4/340 (1%)
Query: 61 DHLILGPAAGQRLPNRLQCQGLKALNKT--SFQASSIGNSISFVTVFTIYNTSLDVHVDS 118
D L+LGPAAGQ P+RLQCQGLKA+NK S + + G +SFVTVFT YN+ D
Sbjct: 54 DRLVLGPAAGQGRPDRLQCQGLKAVNKIILSSETTHYGERVSFVTVFTTYNSDPD-KASK 112
Query: 119 RASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYP 178
+S +VTVGN SYSK ERS+A+LN FI+FIQV+MP+S+V ILTDP S+LS+ + I P
Sbjct: 113 MSSGLVTVGNHSYSKVERSIAVLNTFISFIQVSMPRSNVIILTDPKSNLSIDQGNAVILP 172
Query: 179 IHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQ 238
I G YSR LMLQRI+SYI FLE + E Q H+VFTDSDIAVV+ LGHIF Y
Sbjct: 173 IEGNYSRGNLMLQRIKSYIAFLELKFVEL-QRVDRFTHFVFTDSDIAVVEGLGHIFKRYP 231
Query: 239 NFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLA 298
+ HLALTFRNN QPLNSGF+AVRGT DGIS+A F +EVL+ Y+SKYM ASRMLGDQLA
Sbjct: 232 HCHLALTFRNNNGQPLNSGFVAVRGTSDGISKATEFFKEVLKAYNSKYMKASRMLGDQLA 291
Query: 299 LAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVV 358
LAWVVKS+ +F++ + F ++ GAS+LFLPCA YNWTPPEGAGQFHGMPLDVKV+
Sbjct: 292 LAWVVKSYLPSAFGKFSRHEEFTGEVNGASILFLPCAVYNWTPPEGAGQFHGMPLDVKVI 351
Query: 359 HFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
HFKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 352 HFKGSRKRLMLEAWNFYNSTSHLSDMLCLILKSGRTKYDF 391
>gi|242093388|ref|XP_002437184.1| hypothetical protein SORBIDRAFT_10g022560 [Sorghum bicolor]
gi|241915407|gb|EER88551.1| hypothetical protein SORBIDRAFT_10g022560 [Sorghum bicolor]
Length = 377
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 227/368 (61%), Positives = 269/368 (73%), Gaps = 21/368 (5%)
Query: 37 LLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQGLKALNKTSFQA 92
+ SV LH EK V P ++ D L+LGPAAGQ P+RLQCQGL+ALNK +
Sbjct: 25 IYSVSRLHPWVPEKGVCLPPPTAPKRP-DRLVLGPAAGQGRPDRLQCQGLRALNKIGLSS 83
Query: 93 SSI--GNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQV 150
I G ISFVTVFT YN S+ + S+ VTVGN SYSK ERSMAILN FI+FI+V
Sbjct: 84 EEIYSGEHISFVTVFTTYN-SVSAGDGNVPSDSVTVGNHSYSKIERSMAILNTFISFIKV 142
Query: 151 TMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQG 210
+MP+S+V ILTDP S +S+ + T+ PI G YSR LMLQRI++YI
Sbjct: 143 SMPRSNVIILTDPGSKISVNQGSATLLPIEGNYSRGNLMLQRIQTYI------------- 189
Query: 211 QGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISR 270
G + + TDSDIAVVDDLGHIF Y + HLALTFRNNK QPLNSGF+AVRGT DGI++
Sbjct: 190 DGGVESFFLTDSDIAVVDDLGHIFKKYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGITK 249
Query: 271 AKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVL 330
A FL++VL YSS+Y+ ASRMLGDQLALAWVVKSH +F+K + F ++ GASVL
Sbjct: 250 AVEFLKQVLGAYSSRYIKASRMLGDQLALAWVVKSHLPSAFGKFSKHEAFTGEVNGASVL 309
Query: 331 FLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILM 390
FLPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL
Sbjct: 310 FLPCAVYNWTPPEGAGQFHGIPLDVKVVHFKGSRKRLMLEAWNFYNSTSKLSDMLCLILR 369
Query: 391 SGRTKYDF 398
SGRTKYDF
Sbjct: 370 SGRTKYDF 377
>gi|242060516|ref|XP_002451547.1| hypothetical protein SORBIDRAFT_04g003580 [Sorghum bicolor]
gi|241931378|gb|EES04523.1| hypothetical protein SORBIDRAFT_04g003580 [Sorghum bicolor]
Length = 346
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 210/335 (62%), Positives = 255/335 (76%), Gaps = 4/335 (1%)
Query: 66 GPAAGQRLPNRLQCQGLKALNKT--SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNM 123
GP L + GL+ALNK S + + G ISFVTVFT YN S+ S+
Sbjct: 14 GPVLLFLLAPLIYSAGLRALNKIGLSSEENYPGEHISFVTVFTTYN-SVSAGDGKVPSDS 72
Query: 124 VTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEY 183
VTVGN SYSKTERSMAIL+ FI+FI+V+MP+S+V ILTDP S +S+ + T+ PI G Y
Sbjct: 73 VTVGNHSYSKTERSMAILSTFISFIRVSMPRSNVIILTDPGSKISVNQGSATLLPIEGNY 132
Query: 184 SRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLA 243
SR LMLQRI++YI FLE+++ E +G +NH+V TDSDIA+VDDLGHIF Y + HLA
Sbjct: 133 SRGNLMLQRIKTYIAFLEQKLVEFDSMEG-LNHFVLTDSDIALVDDLGHIFKKYPHCHLA 191
Query: 244 LTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVV 303
LTFRNNK QPLNSGF+AVRGT DGI++A FL++VL Y +Y+ ASRMLGDQLALAWVV
Sbjct: 192 LTFRNNKGQPLNSGFVAVRGTRDGITKAVEFLKQVLEAYCLRYIKASRMLGDQLALAWVV 251
Query: 304 KSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGS 363
KSH +F+K + F ++ GASVLFLPCA YNWTPPEGAGQFHG+PLDVKVVHFKGS
Sbjct: 252 KSHLPSAFGKFSKHEAFTGEVNGASVLFLPCAVYNWTPPEGAGQFHGIPLDVKVVHFKGS 311
Query: 364 RKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 398
RKRLMLE+WNF++S+S +SDMLC+IL SGRTKYDF
Sbjct: 312 RKRLMLEAWNFYNSTSKLSDMLCIILRSGRTKYDF 346
>gi|357471693|ref|XP_003606131.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
gi|355507186|gb|AES88328.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
Length = 251
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 196/251 (78%), Positives = 218/251 (86%)
Query: 148 IQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREH 207
+QV MP+S+V ILTDP SDLS+ R V++YPI GEYSRDKLMLQRIRSYITFLE R+++
Sbjct: 1 MQVVMPQSEVIILTDPVSDLSVHRNRVSLYPIQGEYSRDKLMLQRIRSYITFLETRLQKL 60
Query: 208 SQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDG 267
SQ I HY+FTDSDIAVVDDLG IF D+ NFH+ALTFRNNK QPLNSGFIAV+GTPDG
Sbjct: 61 SQNPKDITHYIFTDSDIAVVDDLGQIFRDHPNFHMALTFRNNKAQPLNSGFIAVKGTPDG 120
Query: 268 ISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGA 327
I RAK+FL+EVL+VY SKYM+ASRMLGDQLALAWVVKS P FDA RF K F +DI G
Sbjct: 121 ILRAKLFLQEVLKVYVSKYMSASRMLGDQLALAWVVKSKPQFDASRFAKTVAFSDDIGGT 180
Query: 328 SVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCL 387
S+LFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF+SS+ DI+DMLCL
Sbjct: 181 SILFLPCALYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFYSSTPDIADMLCL 240
Query: 388 ILMSGRTKYDF 398
IL SGRTKYDF
Sbjct: 241 ILGSGRTKYDF 251
>gi|218198407|gb|EEC80834.1| hypothetical protein OsI_23435 [Oryza sativa Indica Group]
Length = 344
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 208/321 (64%), Positives = 240/321 (74%), Gaps = 8/321 (2%)
Query: 81 GLKALNKTSFQAS---SIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERS 137
GLKA+NK + S G+ ++FVTVFT YN+ SN+VTVG SYSK RS
Sbjct: 29 GLKAVNKIGLSSERNYSRGH-VTFVTVFTTYNSD-PAEASKLPSNVVTVGKHSYSKVGRS 86
Query: 138 MAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYI 197
MAILN FI FIQV+MP+S+V ILTDP S L+ I PI G YSR LMLQRIRSYI
Sbjct: 87 MAILNTFIGFIQVSMPRSNVIILTDPNSKLT--HGSAVILPIEGNYSRGNLMLQRIRSYI 144
Query: 198 TFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSG 257
FLE+R+ E + INH +FTDSDIAVV DLGHIF Y + HLALTFRNNK QPLNSG
Sbjct: 145 AFLEQRLEELETVED-INHLIFTDSDIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSG 203
Query: 258 FIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKA 317
F+AVRGT DGI +A F +EVL Y KYM ASRMLGDQLALAWVVKS+ +F+K
Sbjct: 204 FVAVRGTRDGIFKAIEFFKEVLEAYHLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKH 263
Query: 318 QPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSS 377
+ F ++ G S+LFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE+WNF++S
Sbjct: 264 EAFTGEVNGTSILFLPCAVYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLEAWNFYNS 323
Query: 378 SSDISDMLCLILMSGRTKYDF 398
+S++SDMLCLIL SGRTKYDF
Sbjct: 324 TSELSDMLCLILRSGRTKYDF 344
>gi|222635777|gb|EEE65909.1| hypothetical protein OsJ_21755 [Oryza sativa Japonica Group]
Length = 344
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 207/321 (64%), Positives = 239/321 (74%), Gaps = 8/321 (2%)
Query: 81 GLKALNKTSFQAS---SIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERS 137
GLKA+NK + S G+ ++FVTVFT YN+ SN+VTVG SYSK RS
Sbjct: 29 GLKAVNKIGLSSERNYSRGH-VTFVTVFTTYNSD-PAEASKLPSNVVTVGKHSYSKVGRS 86
Query: 138 MAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYI 197
MAILN FI FIQV+MP+S+V ILTDP S L+ I PI G YSR LM QRIRSYI
Sbjct: 87 MAILNTFIGFIQVSMPRSNVIILTDPNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYI 144
Query: 198 TFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSG 257
FLE+R+ E + INH +FTDSDIAVV DLGHIF Y + HLALTFRNNK QPLNSG
Sbjct: 145 AFLEQRLEELETVE-DINHLIFTDSDIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSG 203
Query: 258 FIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKA 317
F+AVRGT DGI +A F +EVL Y KYM ASRMLGDQLALAWVVKS+ +F+K
Sbjct: 204 FVAVRGTRDGIFKAIEFFKEVLEAYYLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKH 263
Query: 318 QPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSS 377
+ F ++ G S+LFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE+WNF++S
Sbjct: 264 EAFTGEVNGTSILFLPCAVYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLEAWNFYNS 323
Query: 378 SSDISDMLCLILMSGRTKYDF 398
+S++SDMLCLIL SGRTKYDF
Sbjct: 324 TSELSDMLCLILRSGRTKYDF 344
>gi|224101407|ref|XP_002334278.1| predicted protein [Populus trichocarpa]
gi|222870580|gb|EEF07711.1| predicted protein [Populus trichocarpa]
Length = 274
Score = 353 bits (906), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 194/366 (53%), Positives = 220/366 (60%), Gaps = 98/366 (26%)
Query: 34 LPYL-LSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQGLKALNKTSFQA 92
+P+L SVLELH+ + P+K KFDHL+LGPAAGQ LPNRLQC Q
Sbjct: 6 IPFLSFSVLELHQNPAAQPPPKKMNTKFDHLVLGPAAGQGLPNRLQC-----------QG 54
Query: 93 SSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTM 152
S+ S+V QVTM
Sbjct: 55 DSVQIHFSYVC--------------------------------------------FQVTM 70
Query: 153 PKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQG 212
P+S+V ILTDPASDLS+ R VT+YPI G+YSRDKLMLQRIRSYITFLE R+ + +Q G
Sbjct: 71 PQSNVVILTDPASDLSLHRNSVTVYPIQGDYSRDKLMLQRIRSYITFLETRLEKLAQNPG 130
Query: 213 HINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAK 272
I+HY+ TDSDIAVVDDLGH+F+D+ F TFR+NK+QPLNSGFIAV GT D I R
Sbjct: 131 PISHYILTDSDIAVVDDLGHLFNDHPTFTRLFTFRDNKEQPLNSGFIAVWGTADAILR-- 188
Query: 273 IFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFL 332
RFTKAQ F+E+I G SVLFL
Sbjct: 189 ----------------------------------------RFTKAQAFLENIGGTSVLFL 208
Query: 333 PCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSG 392
PCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF SSSSDI MLCL+L SG
Sbjct: 209 PCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFLSSSSDIFGMLCLVLSSG 268
Query: 393 RTKYDF 398
RTKYDF
Sbjct: 269 RTKYDF 274
>gi|2583122|gb|AAB82631.1| hypothetical protein [Arabidopsis thaliana]
Length = 304
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 193/349 (55%), Positives = 231/349 (66%), Gaps = 59/349 (16%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
MR+C G RR L +P++F LP+L S+++ S + R +K DHL+LGP AGQ L
Sbjct: 10 MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69
Query: 74 PNRLQCQGLKALNKT---SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTV-GNA 129
+R C+G KALNKT + S GN +SFVTVFT+YNTSL + ++SN V+V GN
Sbjct: 70 SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLG---NVKSSNPVSVVGNV 126
Query: 130 SYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLM 189
+YSK ERSMA+LN F NFIQ
Sbjct: 127 TYSKPERSMAVLNAFANFIQ---------------------------------------- 146
Query: 190 LQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNN 249
TFLE ++ ++ +G INHY+FTDSDIAVVDD+G IF + +FHLALTFRNN
Sbjct: 147 --------TFLEMKLEKN---EGGINHYIFTDSDIAVVDDVGTIFDKHSSFHLALTFRNN 195
Query: 250 KDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSF 309
KDQPLNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL VVKSH SF
Sbjct: 196 KDQPLNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVSVVKSHASF 255
Query: 310 DARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVV 358
DA+RFTK Q F E+I GASVLFLPCA YNWTPPEGAGQFHGMPLDVKV+
Sbjct: 256 DAKRFTKPQAFTEEIAGASVLFLPCALYNWTPPEGAGQFHGMPLDVKVL 304
>gi|223949095|gb|ACN28631.1| unknown [Zea mays]
gi|413924639|gb|AFW64571.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
Length = 209
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 148/210 (70%), Positives = 174/210 (82%), Gaps = 1/210 (0%)
Query: 189 MLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRN 248
MLQRI++YI FLE+++ E + + +NH+V TDSDIAVV DLGHIF Y +FHLA+TFRN
Sbjct: 1 MLQRIKTYIAFLEQKLVEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRN 59
Query: 249 NKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPS 308
NK QPLNSGF+AVRGT DGI+ A FL++VL YS +YM ASRMLGDQLALAWVVKSH
Sbjct: 60 NKGQPLNSGFVAVRGTRDGITNAVEFLKQVLGTYSLRYMKASRMLGDQLALAWVVKSHLP 119
Query: 309 FDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM 368
+F+K + F ++ G SVLFLPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLM
Sbjct: 120 SAFGKFSKNEAFTGEVNGTSVLFLPCAVYNWTPPEGAGQFHGVPLDVKVVHFKGSRKRLM 179
Query: 369 LESWNFFSSSSDISDMLCLILMSGRTKYDF 398
LE+WNF+SS+S +SDMLCLIL SGRTKYDF
Sbjct: 180 LEAWNFYSSTSKLSDMLCLILRSGRTKYDF 209
>gi|54291155|dbj|BAD61827.1| hypothetical protein [Oryza sativa Japonica Group]
gi|54291236|dbj|BAD61931.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 288
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 164/279 (58%), Positives = 187/279 (67%), Gaps = 22/279 (7%)
Query: 81 GLKALNKTSFQAS---SIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERS 137
GLKA+NK + S G+ ++FVTVFT YN+ SN+VTVG SYSK RS
Sbjct: 29 GLKAVNKIGLSSERNYSRGH-VTFVTVFTTYNSD-PAEASKLPSNVVTVGKHSYSKVGRS 86
Query: 138 MAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYI 197
MAILN FI FIQV+MP+S+V ILTDP S L+ I PI G YSR LM QRIRSYI
Sbjct: 87 MAILNTFIGFIQVSMPRSNVIILTDPNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYI 144
Query: 198 TFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSG 257
FLE+R+ E + INH +FTDSDIAVV DLGHIF Y + HLALTFRNNK QPLNSG
Sbjct: 145 AFLEQRLEELETVED-INHLIFTDSDIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSG 203
Query: 258 FIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKA 317
F+AVRGT DGI +A F +EVL Y KYM ASRMLGDQLALAWVVKS+ +F+K
Sbjct: 204 FVAVRGTRDGIFKAIEFFKEVLEAYYLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKH 263
Query: 318 QPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVK 356
+ F YNWTPPEGAGQFHGMPLDVK
Sbjct: 264 EAF--------------TVYNWTPPEGAGQFHGMPLDVK 288
>gi|302774282|ref|XP_002970558.1| hypothetical protein SELMODRAFT_65658 [Selaginella moellendorffii]
gi|300162074|gb|EFJ28688.1| hypothetical protein SELMODRAFT_65658 [Selaginella moellendorffii]
Length = 331
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/339 (43%), Positives = 213/339 (62%), Gaps = 18/339 (5%)
Query: 69 AGQRLPNRLQCQGLKALNKTSFQASSIG-NSISFVTVFTIYNTSLDVHVDSRASNMVTVG 127
AG + R+QC+ ++ +SS G S+ V++F D H S + VG
Sbjct: 2 AGLGISGRIQCRSSSGKPGATWISSSCGTESVGLVSLFV----PPDRH--SGECDGSVVG 55
Query: 128 NASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR--KGVTIYPIHGEYSR 185
E+ ++L VF+ ++ MP S +LTDPA+ +S R G++ + G YSR
Sbjct: 56 GRVLRGLEKGYSVLRVFVESARLAMPNSQQLVLTDPAAVISTERLPAGISFQRVPGNYSR 115
Query: 186 DKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALT 245
LMLQR+ SYI FL+ +I++ + + H++F DSD+ VV DLG +F ++ +F +ALT
Sbjct: 116 GNLMLQRLDSYIAFLDDQIKQVGKAD-SLQHFIFADSDMIVVGDLGCVFLEFPSFDVALT 174
Query: 246 FRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVK- 304
FRNNK+QP+NSG I VRG+ DG+++ K+ L+ V+ Y + ASRM+GDQLA AWVV+
Sbjct: 175 FRNNKEQPINSGMIFVRGSKDGLAKGKLLLQSVVDSYRRDFFRASRMMGDQLAFAWVVRH 234
Query: 305 -SHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGS 363
S P D+ F + + F + G VLFLPC++YNWTP EGAGQFHGMPLDVK +HFKGS
Sbjct: 235 FSDPLEDS--FKQGKVFKSQVKGVEVLFLPCSSYNWTPAEGAGQFHGMPLDVKAIHFKGS 292
Query: 364 RKRLMLESWNF----FSSSSDISDMLCLILMSGRTKYDF 398
RKRLMLE+W+ +++ D+ + C +L SGR+KYDF
Sbjct: 293 RKRLMLEAWDSHKHQVAATKDLLPLQCFVLKSGRSKYDF 331
>gi|302769952|ref|XP_002968395.1| hypothetical protein SELMODRAFT_65657 [Selaginella moellendorffii]
gi|300164039|gb|EFJ30649.1| hypothetical protein SELMODRAFT_65657 [Selaginella moellendorffii]
Length = 280
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 131/280 (46%), Positives = 185/280 (66%), Gaps = 7/280 (2%)
Query: 125 TVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR--KGVTIYPIHGE 182
VG E+ ++L VF+ ++ MP S +LTDPA+ +S R G++ + G
Sbjct: 2 VVGGRVLRGLEKGYSVLRVFVESARLAMPNSQQLVLTDPAAAISTERLPAGISFQRVPGN 61
Query: 183 YSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHL 242
YSR LMLQR+ SYI FL+ +I++ + + H++F DSD+ VV DLG +F ++ +F +
Sbjct: 62 YSRGNLMLQRLDSYIAFLDDQIKQVGKADS-LQHFIFADSDMIVVGDLGCVFLEFPSFDV 120
Query: 243 ALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWV 302
ALTFRNNK+QP+NSG I VRG+ DG+++ K+ L+ V+ Y + ASRM+GDQLA AWV
Sbjct: 121 ALTFRNNKEQPINSGMIFVRGSKDGLAKGKLLLQSVVDSYRRDFFRASRMMGDQLAFAWV 180
Query: 303 VKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKG 362
V+ F + + F + G VLFLPC++YNWTP EGAGQFHGMPLDVK +HFKG
Sbjct: 181 VRHFADPLEDSFKQGKVFKSQVKGVEVLFLPCSSYNWTPAEGAGQFHGMPLDVKAIHFKG 240
Query: 363 SRKRLMLESWNF----FSSSSDISDMLCLILMSGRTKYDF 398
SRKRLMLE+W+ +++ D+ + C +L SGR+KYDF
Sbjct: 241 SRKRLMLEAWDSHKHQVAATKDLLPLQCFVLKSGRSKYDF 280
>gi|413924637|gb|AFW64569.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
gi|413924638|gb|AFW64570.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
Length = 260
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 138/247 (55%), Positives = 178/247 (72%), Gaps = 9/247 (3%)
Query: 30 LVFFLPYLLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQGLKAL 85
L+F +P + SV L EK V P ++ DHL+LGPAAGQ P+RLQC+GL+AL
Sbjct: 17 LLFLVPLIYSVSRLQPWAPEKGVCLPPPTAPKRP-DHLVLGPAAGQDRPDRLQCRGLRAL 75
Query: 86 NKT--SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNV 143
NK S + + G +SF TVFT YN S+ D+ S+ VTVGN+SYSK ERSMAILN
Sbjct: 76 NKIGISSEENYSGEHVSFATVFTTYN-SVSAGDDNVPSDSVTVGNSSYSKIERSMAILNT 134
Query: 144 FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERR 203
FI+FI+V+MP+SD+ ILTDP S +S+ + T+ P+ G YSR LMLQRI++YI FLE++
Sbjct: 135 FISFIKVSMPRSDLIILTDPGSKISVNQGTATLLPVEGNYSRGNLMLQRIKTYIAFLEQK 194
Query: 204 IREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRG 263
+ E + + +NH+V TDSDIAVV DLGHIF Y +FHLA+TFRNNK QPLNSGF+AVRG
Sbjct: 195 LVEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRNNKGQPLNSGFVAVRG 253
Query: 264 TPDGISR 270
T DGI++
Sbjct: 254 TRDGITK 260
>gi|413938025|gb|AFW72576.1| hypothetical protein ZEAMMB73_448315 [Zea mays]
Length = 450
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 122/184 (66%), Positives = 148/184 (80%), Gaps = 1/184 (0%)
Query: 196 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 255
+ FLE+++ E + + +NH+V TDSDIAVVDDLGHIF Y +FHLA+TFRNNK QPLN
Sbjct: 262 FTAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHIFEKYPHFHLAVTFRNNKGQPLN 320
Query: 256 SGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFT 315
SGF+AVRGT DGI++A FL++VL YS +Y+ ASRMLGDQLALAWVVK H +F+
Sbjct: 321 SGFVAVRGTSDGITKAVEFLKQVLGTYSLRYIKASRMLGDQLALAWVVKFHLPSALGKFS 380
Query: 316 KAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFF 375
K + F ++ G SVLFLPCA YNWT PEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF+
Sbjct: 381 KHEAFTGEVNGTSVLFLPCAVYNWTQPEGAGQFHGIPLDVKVVHFKGSRKRLMLEAWNFY 440
Query: 376 SSSS 379
+S S
Sbjct: 441 NSFS 444
>gi|414866083|tpg|DAA44640.1| TPA: putative serine/threonine protein phosphatase superfamily
protein [Zea mays]
Length = 470
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 118/183 (64%), Positives = 143/183 (78%), Gaps = 1/183 (0%)
Query: 174 VTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHI 233
T+ PI G YSR LMLQRI++YI FLE+++ E + + +NH+V TDSDIAVVDDLGHI
Sbjct: 210 ATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHI 268
Query: 234 FHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRML 293
F Y +FHLA+TF NNK QPLNSGF+AVRGT DGI++A FL++VL YS +Y+ ASRML
Sbjct: 269 FEKYPHFHLAVTFCNNKGQPLNSGFVAVRGTRDGITKAVEFLKQVLGTYSLRYIKASRML 328
Query: 294 GDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPL 353
GDQLALAWVVKSH +F+K + F ++ G SVLFLPC YNWTPPEGAGQFHG+PL
Sbjct: 329 GDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCVVYNWTPPEGAGQFHGIPL 388
Query: 354 DVK 356
DVK
Sbjct: 389 DVK 391
>gi|168008832|ref|XP_001757110.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691608|gb|EDQ77969.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 127/307 (41%), Positives = 184/307 (59%), Gaps = 9/307 (2%)
Query: 97 NSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQ-VTMP-K 154
++SFVT+F + ++ D + + K R A+L F+ IQ V+MP
Sbjct: 6 ETVSFVTLFVM--PEVETASDRFSEAVEKKFEVRKVKKSRQDAVLRAFLESIQQVSMPGT 63
Query: 155 SDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGH- 213
+ V I+T+ + + P +SR LM+QR++SYI L+ I +
Sbjct: 64 TRVTIITNHNKLRGELPQDIDWKPTSRHFSRRNLMIQRLQSYIELLDSMIEDRKNNSSSP 123
Query: 214 INHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKI 273
++H +F+D D+ VVDDLG +F ++ +F +A TFRNN+ QP+NSG I VRGT +SRA
Sbjct: 124 VSHAIFSDFDMIVVDDLGCVFKEFPHFDIAFTFRNNQRQPINSGVIMVRGTFGSLSRATQ 183
Query: 274 FLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLP 333
L+EV+++Y +K+ +A +LGDQLALA +VK + AR F + P ++ LFLP
Sbjct: 184 LLKEVVKIYLAKFRHAFGVLGDQLALADIVKG--TLQARAFQEGVPVEATVMTTKTLFLP 241
Query: 334 CATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSS--DISDMLCLILMS 391
C YNWTPPEGAGQF GMP +VKV+HFKG RKRLM+++W F+ D M CL+L S
Sbjct: 242 CVIYNWTPPEGAGQFQGMPTEVKVLHFKGRRKRLMIQAWYFYKKQGVLDFYKMKCLVLKS 301
Query: 392 GRTKYDF 398
GR+KYD+
Sbjct: 302 GRSKYDY 308
>gi|414587497|tpg|DAA38068.1| TPA: hypothetical protein ZEAMMB73_303828 [Zea mays]
Length = 258
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 124/220 (56%), Positives = 158/220 (71%), Gaps = 7/220 (3%)
Query: 53 PRKNRQKFDHLILGPAAGQRLPNRLQCQGLKALNKT--SFQASSIGNSISFVTVFTIYNT 110
P ++ DHL+LGPAAGQ P+R + L+ALNK S + + G + FVTVFT YN
Sbjct: 44 PPTAPKRPDHLVLGPAAGQGRPDRRR---LRALNKIGLSSEENYSGEHVPFVTVFTTYN- 99
Query: 111 SLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMP 170
S+ + + VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S S+
Sbjct: 100 SVSAGDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSVN 159
Query: 171 RKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDL 230
+ T+ PI G YSR LMLQRI++YI FLE+++ E + + +NH+V TDSDIAVVDDL
Sbjct: 160 QGSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDL 218
Query: 231 GHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISR 270
GHIF +FHLA+TFRNNK QPLNSGF+AVRGT DGI++
Sbjct: 219 GHIFEKNPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITK 258
>gi|255562810|ref|XP_002522410.1| hypothetical protein RCOM_0835860 [Ricinus communis]
gi|223538295|gb|EEF39902.1| hypothetical protein RCOM_0835860 [Ricinus communis]
Length = 202
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 103/130 (79%), Positives = 112/130 (86%), Gaps = 6/130 (4%)
Query: 269 SRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGAS 328
RAKIFL+ VL VY+SKYMNASR ALAWV++SHP FD RRF KAQ F++++ GAS
Sbjct: 79 CRAKIFLQHVLEVYTSKYMNASR------ALAWVIRSHPGFDLRRFHKAQAFMDEMGGAS 132
Query: 329 VLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLI 388
LFLPCA YNWTPPEGAGQFH MPLDVKVVHFKGSRKRLMLESWNFF S+SDISDMLCLI
Sbjct: 133 ALFLPCAIYNWTPPEGAGQFHRMPLDVKVVHFKGSRKRLMLESWNFFRSASDISDMLCLI 192
Query: 389 LMSGRTKYDF 398
LMSGRTKYDF
Sbjct: 193 LMSGRTKYDF 202
>gi|51968566|dbj|BAD42975.1| hypothetical protein [Arabidopsis thaliana]
Length = 200
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/188 (55%), Positives = 132/188 (70%), Gaps = 8/188 (4%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
MR+C G RR L +P++F LP+L S+++ S + R +K DHL+LGP AGQ L
Sbjct: 10 MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69
Query: 74 PNRLQCQGLKALNKT---SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTV-GNA 129
+R C+G KALNKT + S GN +SFVTVFT+YNTSL + ++SN V+V GN
Sbjct: 70 SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLG---NVKSSNPVSVVGNV 126
Query: 130 SYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLM 189
+YSK ERSMA+LN F NFIQVTMPKS+V ILTDPASDLS+ + V + P+ G+YSR LM
Sbjct: 127 TYSKPERSMAVLNAFANFIQVTMPKSNVVILTDPASDLSIQQSNVILQPVQGDYSRGNLM 186
Query: 190 LQRIRSYI 197
LQRIRSYI
Sbjct: 187 LQRIRSYI 194
>gi|357520759|ref|XP_003630668.1| hypothetical protein MTR_8g102040 [Medicago truncatula]
gi|355524690|gb|AET05144.1| hypothetical protein MTR_8g102040 [Medicago truncatula]
Length = 105
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 82/129 (63%), Positives = 94/129 (72%), Gaps = 25/129 (19%)
Query: 270 RAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASV 329
+ K+FL+EVL+VY SKYM+ ++ L PF +DI G S+
Sbjct: 2 QGKLFLQEVLKVYVSKYMSVAKTL-------------------------PFSDDIGGTSI 36
Query: 330 LFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLIL 389
LFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF+SS+ DI+DMLCLIL
Sbjct: 37 LFLPCALYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFYSSTPDIADMLCLIL 96
Query: 390 MSGRTKYDF 398
SGRTKYDF
Sbjct: 97 GSGRTKYDF 105
>gi|413942906|gb|AFW75555.1| hypothetical protein ZEAMMB73_119492 [Zea mays]
Length = 177
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 61/84 (72%), Positives = 67/84 (79%), Gaps = 1/84 (1%)
Query: 292 MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGM 351
MLGDQLALAWVVK H +F+K + F ++ G SVLFLPCA YNWT PEGAGQFHG+
Sbjct: 1 MLGDQLALAWVVKFHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTSPEGAGQFHGI 60
Query: 352 PLDVKVVHFKGSRKRLMLES-WNF 374
PLDVKVVHFKGSRKRLMLE WNF
Sbjct: 61 PLDVKVVHFKGSRKRLMLERLWNF 84
>gi|414886764|tpg|DAA62778.1| TPA: putative homeodomain-like transcription factor superfamily
protein [Zea mays]
Length = 2379
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/119 (55%), Positives = 84/119 (70%), Gaps = 3/119 (2%)
Query: 81 GLKALNKT--SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGNASYSKTERSM 138
GL+ALNK S + + G +SFVTVFT YN S+ + + + VTVGN+SYSK ERSM
Sbjct: 72 GLRALNKIGLSSEENYSGEHVSFVTVFTTYN-SVSAGDGNVSPDSVTVGNSSYSKIERSM 130
Query: 139 AILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYI 197
ILN FI+FI+V+MP+SDV ILTDP S S+ + T+ PI G YSR LMLQRI++YI
Sbjct: 131 TILNTFISFIKVSMPRSDVIILTDPGSKFSVNQGSATLLPIEGNYSRGNLMLQRIKTYI 189
>gi|255562818|ref|XP_002522414.1| conserved hypothetical protein [Ricinus communis]
gi|223538299|gb|EEF39906.1| conserved hypothetical protein [Ricinus communis]
Length = 205
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/124 (54%), Positives = 85/124 (68%), Gaps = 8/124 (6%)
Query: 15 MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQR 72
MR G RRF+ FFL LV F ++ SVLEL+ SV E L + +K HL+LGPAA Q
Sbjct: 1 MRTWSGRRRFILCFFLLLVIF--HIFSVLELYSNSVTEALQKNRNKKSYHLVLGPAASQG 58
Query: 73 LPNRLQCQGLKALNKT----SFQASSIGNSISFVTVFTIYNTSLDVHVDSRASNMVTVGN 128
LPNRLQC+G KALNKT S S++ ++++FVTVFTIYNTSLD D R+SN+V VGN
Sbjct: 59 LPNRLQCEGSKALNKTHLLDSSSDSNVRDNVAFVTVFTIYNTSLDSFPDDRSSNLVIVGN 118
Query: 129 ASYS 132
Y+
Sbjct: 119 VFYT 122
>gi|356575160|ref|XP_003555710.1| PREDICTED: uncharacterized protein LOC100806135 [Glycine max]
Length = 146
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/75 (72%), Positives = 60/75 (80%)
Query: 283 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 342
S+KY NASRMLGDQLALA V S P FD +F KA F EDI G+S+LFLPC+ YNWT P
Sbjct: 23 STKYRNASRMLGDQLALASVEMSKPHFDTSKFAKALAFSEDIGGSSILFLPCSMYNWTLP 82
Query: 343 EGAGQFHGMPLDVKV 357
EGAGQFHGMPLDVK+
Sbjct: 83 EGAGQFHGMPLDVKI 97
>gi|414867606|tpg|DAA46163.1| TPA: hypothetical protein ZEAMMB73_544883 [Zea mays]
Length = 379
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/75 (64%), Positives = 62/75 (82%), Gaps = 1/75 (1%)
Query: 196 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 255
+ FLE+++ E + + +NH+V TDSDIAVVDDLGHIF Y +FHLA+TFRNNK+QPLN
Sbjct: 306 FTAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHIFEKYPHFHLAVTFRNNKEQPLN 364
Query: 256 SGFIAVRGTPDGISR 270
SGF+AVRGT DGI++
Sbjct: 365 SGFVAVRGTRDGITK 379
>gi|67920764|ref|ZP_00514283.1| hypothetical protein CwatDRAFT_5283 [Crocosphaera watsonii WH 8501]
gi|67856881|gb|EAM52121.1| hypothetical protein CwatDRAFT_5283 [Crocosphaera watsonii WH 8501]
Length = 316
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 114/265 (43%), Gaps = 27/265 (10%)
Query: 115 HVDSR---ASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR 171
HVDS A + N + + ++N+ + + P +LTD + L+
Sbjct: 8 HVDSSKETAKKIYNQDNKDFRNDYNYILLINLLFRSVSIFHPNCRKVVLTDMNTRLAGLE 67
Query: 172 KGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLG 231
+ +Y + +M R+ + ++ + Q + + DSD+ V +L
Sbjct: 68 DDIEVY--RTSLDPESIMFSRLVAQFNYV--------KTQQIDSDIILIDSDMLVNANLE 117
Query: 232 HIFHDYQNFHLALTFR---NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYM- 287
H+F ++F +ALT+R KD P+N G I + + D A FLE+V ++Y KY+
Sbjct: 118 HLFE--EDFSVALTYRYLEAVKDMPINGGIIFL--SRDRKQEAIKFLEKVYQIYQEKYLK 173
Query: 288 NASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQ 347
+ GDQ AL + FD F Q V + + L C YN++P
Sbjct: 174 DYQSWSGDQYALIDAI----GFD--NFNSRQSDVMLVDEQKIKLLDCEIYNFSPDRNPNS 227
Query: 348 FHGMPLDVKVVHFKGSRKRLMLESW 372
D ++HFKGSRK++M W
Sbjct: 228 IVREHKDKVILHFKGSRKKIMPLYW 252
>gi|416379625|ref|ZP_11683920.1| hypothetical protein CWATWH0003_0757 [Crocosphaera watsonii WH
0003]
gi|357265857|gb|EHJ14567.1| hypothetical protein CWATWH0003_0757 [Crocosphaera watsonii WH
0003]
Length = 316
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 117/270 (43%), Gaps = 27/270 (10%)
Query: 110 TSLDVHVDSR---ASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASD 166
T + HVDS A + N + + ++N+ + + P +LTD +
Sbjct: 3 TFVTFHVDSSKETAKKIYNQDNKDFRNDYNYILLINLLFRSVSIFHPNCRKVVLTDMNTR 62
Query: 167 LSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAV 226
L+ + +Y + +M R+ + +++ + Q I + DSD+ V
Sbjct: 63 LAGLEDDIEVY--RTSLDPESIMFSRLVAQFNYVKTQ-----QIDSDI---ILIDSDMLV 112
Query: 227 VDDLGHIFHDYQNFHLALTFR---NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYS 283
+L H+F ++F +ALT+R KD P+N G I + + D A FLE+V ++Y
Sbjct: 113 NANLEHLFE--EDFSVALTYRYLEAVKDMPINGGIIFL--SRDRKQEAIKFLEKVYQIYQ 168
Query: 284 SKYM-NASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 342
KY+ + GDQ AL + FD F Q V + + L C YN++P
Sbjct: 169 EKYLKDYQSWWGDQYALIDAI----GFD--NFHSRQSDVMLVDEQKIKLLDCEIYNFSPG 222
Query: 343 EGAGQFHGMPLDVKVVHFKGSRKRLMLESW 372
D ++HFKGSRK++M W
Sbjct: 223 RNPNSIVREHKDKVILHFKGSRKKIMPLYW 252
>gi|297620616|ref|YP_003708753.1| hypothetical protein wcw_0375 [Waddlia chondrophila WSU 86-1044]
gi|297375917|gb|ADI37747.1| hypothetical protein wcw_0375 [Waddlia chondrophila WSU 86-1044]
gi|337292759|emb|CCB90764.1| putative uncharacterized protein [Waddlia chondrophila 2032/99]
Length = 259
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 76/164 (46%), Gaps = 22/164 (13%)
Query: 218 VFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD-----QPLNSGFIAVRGTPDGISRAK 272
VF D D+ + + +F +NF LAL +R + + P+N GFI + P+G ++A
Sbjct: 106 VFCDYDLLFQESIELLFK--ENFDLALIYRKSFEGGLHPAPINGGFIGIH--PEGFTKAI 161
Query: 273 IFLEEVLRVYSSKYMNASRMLGDQLALAWVV---KSHPSFDARRFTKAQPFVEDIIGASV 329
FLE V Y Y G Q +L ++ K H +F + GA +
Sbjct: 162 NFLETVHSCYLENYSEYKEWGGFQSSLNKLLVPKKVHNAFPNHLIYE---------GAEI 212
Query: 330 LFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWN 373
LP + YN+ E G++ D K++HFKG RK +M WN
Sbjct: 213 ALLPSSEYNYAI-EAQGEWVDFKPDKKILHFKGPRKEVMANYWN 255
>gi|384252921|gb|EIE26396.1| hypothetical protein COCSUDRAFT_39504 [Coccomyxa subellipsoidea
C-169]
Length = 333
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 111/263 (42%), Gaps = 60/263 (22%)
Query: 144 FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR--------IRS 195
FI+ ++ + P + +LTD + + +P V +Y Y+ D+ L R +
Sbjct: 50 FISALRRSNPGCTIVVLTDQGTQIELP-PDVRLY----RYAIDRSKLGRNPYANYYQYLA 104
Query: 196 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 255
I+FL+ + ++G VF D DI V+D L +F + F A+T + D P+N
Sbjct: 105 QISFLQHLM---AKGLAQSMDVVFLDMDILVIDSLAEVFKEGPGFDYAVTLSDAVDMPVN 161
Query: 256 SG--FIAVRGTPDGISRAKIFLEEVLRVY--SSKYMNASRMLGDQLALAWVVKSHPSFDA 311
G F+ P ++ FLE+VL VY + +++ LG+ + L + D
Sbjct: 162 IGMQFVHHGRYPGAVA----FLEDVLAVYPFNETFVSGQVALGNLIGLRYN-------DE 210
Query: 312 RRFTKAQPFVEDIIGA----------SVLFLPCATYNWTPPEGAGQFHGMPLD------- 354
+ T + V D A SV FLPC YN+ AGQ D
Sbjct: 211 QLLTHYKSAVRDRSSAKQVRGRHSVHSVRFLPCMRYNYC---HAGQSCCTDPDRLPVSVT 267
Query: 355 ---------VKVVHFKGSRKRLM 368
VKV+HF G RK+ +
Sbjct: 268 TEEELTDTRVKVLHFVGHRKKAL 290
>gi|297606040|ref|NP_001057915.2| Os06g0571300 [Oryza sativa Japonica Group]
gi|255677159|dbj|BAF19829.2| Os06g0571300, partial [Oryza sativa Japonica Group]
Length = 73
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 38/58 (65%), Gaps = 2/58 (3%)
Query: 143 VFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFL 200
+FI +QV+MP+S+V ILTDP S L+ I PI G YSR LM QRIRSYI L
Sbjct: 4 LFIEPLQVSMPRSNVIILTDPNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYIVSL 59
>gi|412992455|emb|CCO18435.1| unknown protein [Bathycoccus prasinos]
Length = 401
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 109/238 (45%), Gaps = 36/238 (15%)
Query: 153 PKSDVFILTDPASDLSMPRKGVTIYPIH---GEYSRDK-----LMLQRIRSYITFLERRI 204
P + V ++TD +++ M + G+ +H G R K LML+R++ Y F+ +R
Sbjct: 156 PGTCVALITDEETEIDMSKPGMDKVQLHRFEGILDRTKIGTGALMLERMKLYNAFI-KRA 214
Query: 205 REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGT 264
R++ + D+DI V D+ +F +NF +T R+NK P+ G V+
Sbjct: 215 RDNDWNA----DLLMVDTDIVFVGDVSDLFQ-TRNFDYGVTIRDNKAYPVQGG---VQFV 266
Query: 265 PDG--ISRAKIFLEEVLRVYSSKYMNASR---MLGDQLALAWVVKSHPSFDARRFTKAQ- 318
P G + AK F + L ++ S + + GDQ A + P+ + K +
Sbjct: 267 PKGKYVGAAK-FSDHTLDLWKSDLEKSGKEAGFTGDQAAYQRGLNV-PASKVQSLAKGKK 324
Query: 319 ----PFVEDIIGASVL---FLPCATYNWTPPEGAGQFHGM-PLDVKVVHFKGSRKRLM 368
P V GA V+ +P YN+ P +G G+ D++++H+KG +K M
Sbjct: 325 VIDLPVVCGSSGAEVVTVRMIPGDQYNFVP---SGNGQGLKKKDIRILHYKGGKKEGM 379
>gi|384252787|gb|EIE26262.1| hypothetical protein COCSUDRAFT_64411 [Coccomyxa subellipsoidea
C-169]
Length = 477
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 97/247 (39%), Gaps = 32/247 (12%)
Query: 144 FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERR 203
FI+ ++ + P V +LTD A+ + +P + R KL +Y +L +
Sbjct: 190 FISTLRRSHPGCTVAVLTDQATQIDLP---ADVRLFRFTIDRSKLGRNPYANYYQYLAQI 246
Query: 204 I---REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIA 260
+ ++G H VF D D VVD + +F F LT + D P+N I
Sbjct: 247 AFMKQLAAEGLEHSTDVVFLDMDALVVDSIAEVFGQGAQFDYGLTLSDATDMPVN---IG 303
Query: 261 VRGTPDG-ISRAKIFLEEVLRVY--SSKYMNASRMLGDQLALAWVVKSHPSFDARRF--- 314
++ P G A FL++V+ +Y +S + L D L K P R
Sbjct: 304 IQFVPRGRYGSAIAFLQDVIAIYPFNSTFTAGQEALTDLLGF----KDDPEEVLSRVNIS 359
Query: 315 TKAQPFVEDIIGASVLFLPCATYNWT---------P---PEGAGQFHGMPLD-VKVVHFK 361
+ + + G +V C YN+ P P F + VKV+HF
Sbjct: 360 VQEGRTCQQVGGRTVCLFTCMRYNYCHVDQSCCTDPARLPVSLTSFDDLAAARVKVLHFV 419
Query: 362 GSRKRLM 368
G RK+ +
Sbjct: 420 GHRKKAL 426
>gi|381204432|ref|ZP_09911503.1| hypothetical protein SclubJA_02255, partial [SAR324 cluster
bacterium JCVI-SC AAA005]
Length = 176
Score = 45.8 bits (107), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 69/158 (43%), Gaps = 22/158 (13%)
Query: 105 FTIYNTSLDVHVDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPA 164
F ++ L+ V A N+ N + LN+ + ++ P++D+++LTD
Sbjct: 9 FVAFHVDLESKVLKEAQNVAKAVNDDNHEEN-----LNLMFSSVKRIYPEADLYVLTDTK 63
Query: 165 SDLSMPRKGVTI-YPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSD 223
S S I Y + Y +L R +++ FLE+ + +F DSD
Sbjct: 64 SKFSENTISKLIRYDLDSRYP----ILARNKAWYKFLEKTDKST----------IFLDSD 109
Query: 224 IAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAV 261
I + D+ + +F +A TFR+ K P+N G I V
Sbjct: 110 ILINDNFDELMS--VDFDIAFTFRDWKKWPINLGIIYV 145
>gi|182419850|ref|ZP_02951090.1| conserved hypothetical protein [Clostridium butyricum 5521]
gi|237666660|ref|ZP_04526645.1| YfnD [Clostridium butyricum E4 str. BoNT E BL5262]
gi|182376398|gb|EDT73980.1| conserved hypothetical protein [Clostridium butyricum 5521]
gi|237657859|gb|EEP55414.1| YfnD [Clostridium butyricum E4 str. BoNT E BL5262]
Length = 313
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/166 (21%), Positives = 70/166 (42%), Gaps = 25/166 (15%)
Query: 198 TFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQ----- 252
FLE I ++S +Y D+D+ ++ IF++ N + LT N ++
Sbjct: 89 VFLEYIINKYSDAV----YYAHVDADLFFFSNIDSIFNENSNASIFLTDHRNSEEFMHYY 144
Query: 253 ----PLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA-WVVKSHP 307
N+GF+ + T +G + K++ + L+ +++Y ++ GDQ + W+
Sbjct: 145 ELSGQFNTGFVGFKNTDEGKAAIKLWGDRCLKRCTAEYDTINKTFGDQRYVEDWI----D 200
Query: 308 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPL 353
F K+ IGA+V F Y ++ + + PL
Sbjct: 201 IFKDVHVVKS-------IGANVAFWNVKNYEFSKVDDLIYVNNKPL 239
>gi|440753032|ref|ZP_20932235.1| hypothetical protein O53_1407 [Microcystis aeruginosa TAIHU98]
gi|440177525|gb|ELP56798.1| hypothetical protein O53_1407 [Microcystis aeruginosa TAIHU98]
Length = 311
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 45/109 (41%), Gaps = 13/109 (11%)
Query: 192 RIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD 251
++R R R S G + H++F DSDI V+D L +F Y N L +
Sbjct: 75 KVRGIDDIHNHRFRIFSIFWGPLEHFIFLDSDIIVLDSLQELFRTYINSELEFMY----- 129
Query: 252 QPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA 300
RG D + + F ++++R Y + NA + + A +
Sbjct: 130 --------YYRGIFDQVYKEGEFRDKMIREYRANGFNAGSFISSRGAFS 170
>gi|425436184|ref|ZP_18816622.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389679139|emb|CCH92045.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 311
Score = 37.7 bits (86), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 45/109 (41%), Gaps = 13/109 (11%)
Query: 192 RIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD 251
++R R R S G + H++F DSDI V+D L +F Y N L +
Sbjct: 75 KVRGIDDIHNHRFRIFSIFWGPLEHFIFLDSDIIVLDSLQELFRTYINSELEFMY----- 129
Query: 252 QPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA 300
RG D + + F ++++R + + NA L + A +
Sbjct: 130 --------YYRGIFDQVYKEGEFRDKMIREHRANGFNAGSFLSSRGAFS 170
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.139 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,043,957,719
Number of Sequences: 23463169
Number of extensions: 246076438
Number of successful extensions: 635444
Number of sequences better than 100.0: 55
Number of HSP's better than 100.0 without gapping: 41
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 635302
Number of HSP's gapped (non-prelim): 60
length of query: 398
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 253
effective length of database: 8,957,035,862
effective search space: 2266130073086
effective search space used: 2266130073086
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 78 (34.7 bits)