BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017999
(362 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225454022|ref|XP_002281030.1| PREDICTED: uncharacterized protein LOC100259142 [Vitis vinifera]
gi|296089202|emb|CBI38905.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 567 bits (1461), Expect = e-159, Method: Compositional matrix adjust.
Identities = 276/388 (71%), Positives = 306/388 (78%), Gaps = 40/388 (10%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLP 74
MR C G RR L+ LP V F+PY LSVLELH+ S +E +K+ +KFDHL+LGPAAGQ L
Sbjct: 1 MRVCSGWRRRLYCLPFVLFIPYFLSVLELHQSSTIEGSQKKHSKKFDHLVLGPAAGQGLH 60
Query: 75 NRLQCQ----------------------------------------DSRASNMVTVGNAS 94
+RLQCQ D R+S++VTVGNAS
Sbjct: 61 DRLQCQGTKALNKTHIATSSHESNFGESIALITVFTIYNSSLALHADGRSSDLVTVGNAS 120
Query: 95 YSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLML 154
YSK ERSMAILNVFINFIQ TMP+S+V ILTDPAS+ S+ R VTIYPI GEYSRDKLML
Sbjct: 121 YSKMERSMAILNVFINFIQATMPQSNVIILTDPASEFSLHRDRVTIYPIQGEYSRDKLML 180
Query: 155 QRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNK 214
QRIRSYI FLE ++ EHSQG GHINHY+FTDSDIAVVDDLG IF + NFH+ALTFRNNK
Sbjct: 181 QRIRSYIVFLETKLEEHSQGHGHINHYIFTDSDIAVVDDLGQIFQSHPNFHVALTFRNNK 240
Query: 215 DQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFD 274
+QPLNSGFIAVRGTPDGI RAK+FL+EVL+VYSS++MNASRMLGDQLALAWVVKSHP FD
Sbjct: 241 EQPLNSGFIAVRGTPDGILRAKLFLQEVLKVYSSRFMNASRMLGDQLALAWVVKSHPYFD 300
Query: 275 ARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE 334
+RF+K Q F+EDI G SVLFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE
Sbjct: 301 TKRFSKPQAFLEDIGGTSVLFLPCAIYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLE 360
Query: 335 SWNFFSSSSDISDMLCLILMSGRTKYDF 362
SWNFF SSSDISDMLCLILMSGRTKYDF
Sbjct: 361 SWNFFISSSDISDMLCLILMSGRTKYDF 388
>gi|224129974|ref|XP_002320717.1| predicted protein [Populus trichocarpa]
gi|222861490|gb|EEE99032.1| predicted protein [Populus trichocarpa]
Length = 369
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 262/364 (71%), Positives = 293/364 (80%), Gaps = 35/364 (9%)
Query: 34 LPYL-LSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQ------------ 80
+P+L SVLELH+ + P+K KFDHL+LGPAAGQ LPNRLQCQ
Sbjct: 6 IPFLSFSVLELHQNPAAQPPPKKMNTKFDHLVLGPAAGQGLPNRLQCQGTKALNKTHTRS 65
Query: 81 ----------------------DSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPK 118
DSR SN VTVGNASY+K ERSMA+LNVF+NFI+VTMP+
Sbjct: 66 SSNAGESVSFVTVFTVYNTSLADSRLSNFVTVGNASYTKMERSMAVLNVFVNFIKVTMPR 125
Query: 119 SDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHI 178
S+V ILTDPASDLS+ VT+YPI G+YSRDKLMLQRIRSYITFLE R+ E +Q GHI
Sbjct: 126 SNVVILTDPASDLSLFGNSVTVYPIQGDYSRDKLMLQRIRSYITFLETRLEELAQNPGHI 185
Query: 179 NHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIF 238
NHY+FTDSDIAVVDDLGH+F+D+ NFHLALTFRNNK+QPLNSGFIAVRGT D I RAKIF
Sbjct: 186 NHYIFTDSDIAVVDDLGHLFNDHPNFHLALTFRNNKEQPLNSGFIAVRGTTDAILRAKIF 245
Query: 239 LEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPC 298
L+EVL+VYSSK+M+ASRMLGDQLALAW +KSHP FD RRFTKAQ F+E+I GASVLFLPC
Sbjct: 246 LQEVLKVYSSKFMSASRMLGDQLALAWAIKSHPGFDLRRFTKAQAFLENIGGASVLFLPC 305
Query: 299 ATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRT 358
ATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF SSSSDI DMLCL+L+SGRT
Sbjct: 306 ATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFLSSSSDIFDMLCLVLLSGRT 365
Query: 359 KYDF 362
KYDF
Sbjct: 366 KYDF 369
>gi|255562808|ref|XP_002522409.1| conserved hypothetical protein [Ricinus communis]
gi|223538294|gb|EEF39901.1| conserved hypothetical protein [Ricinus communis]
Length = 388
Score = 529 bits (1362), Expect = e-148, Method: Compositional matrix adjust.
Identities = 267/391 (68%), Positives = 304/391 (77%), Gaps = 46/391 (11%)
Query: 15 MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNR-QKFDHLILGPAAGQ 71
MR G RRF+ FFL LV F ++ SVLELH SV E P+KNR +K DHL+LGPAAGQ
Sbjct: 1 MRTWSGWRRFILCFFLLLVIF--HIFSVLELHSNSVTE-APQKNRNKKSDHLVLGPAAGQ 57
Query: 72 RLPNRLQCQ----------------------------------------DSRASNMVTVG 91
LP+RLQC+ D R+SN+VTVG
Sbjct: 58 GLPDRLQCEGSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSIPDDRSSNLVTVG 117
Query: 92 NASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDK 151
N SYSK ERSMAILNVFINFIQVTMP+S+V ILTDPASDLS+ R VT+YPI GEYSR+K
Sbjct: 118 NVSYSKMERSMAILNVFINFIQVTMPRSNVIILTDPASDLSLQRYKVTLYPIQGEYSREK 177
Query: 152 LMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFR 211
LMLQRI+SYI FL+ +++E ++ H +HY+FTDSDIAVVDDLG IFH+Y NFH+ALTFR
Sbjct: 178 LMLQRIKSYINFLDMKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYPNFHIALTFR 237
Query: 212 NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHP 271
NNK+QPLNSGFIAVRGT + I RAKIFL+ VL VY+SKYMNASRMLGDQLALAWV++SHP
Sbjct: 238 NNKEQPLNSGFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASRMLGDQLALAWVIRSHP 297
Query: 272 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 331
FD RRF KAQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL
Sbjct: 298 GFDLRRFRKAQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 357
Query: 332 MLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
MLESWNFF S+SDISDMLCLILMSGRTKYDF
Sbjct: 358 MLESWNFFRSASDISDMLCLILMSGRTKYDF 388
>gi|255541282|ref|XP_002511705.1| conserved hypothetical protein [Ricinus communis]
gi|223548885|gb|EEF50374.1| conserved hypothetical protein [Ricinus communis]
Length = 388
Score = 526 bits (1356), Expect = e-147, Method: Compositional matrix adjust.
Identities = 266/391 (68%), Positives = 304/391 (77%), Gaps = 46/391 (11%)
Query: 15 MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNR-QKFDHLILGPAAGQ 71
MR G RRF+ FFL LV F ++ SVLELH SV E P+KNR +K DHL+LGPAAGQ
Sbjct: 1 MRTWSGWRRFILSFFLLLVIF--HIFSVLELHSNSVTE-APQKNRNKKSDHLVLGPAAGQ 57
Query: 72 RLPNRLQCQDSRA----------------------------------------SNMVTVG 91
LP+RLQC+ S+A SN+VTVG
Sbjct: 58 GLPDRLQCEGSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSLPNDRSSNLVTVG 117
Query: 92 NASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDK 151
N SYSKTERSMAILNVFINFIQVTMP+S+V ILTDPASDL + R VT+YPI GEYSR+K
Sbjct: 118 NVSYSKTERSMAILNVFINFIQVTMPQSNVIILTDPASDLLLQRDKVTLYPIQGEYSREK 177
Query: 152 LMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFR 211
LMLQRIRSYI FL+ +++E ++ H +HY+FTDSDIAVVDDLG IFH+Y+NFH+ALTFR
Sbjct: 178 LMLQRIRSYINFLDTKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYRNFHIALTFR 237
Query: 212 NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHP 271
NNK+QPLNSGFIAVRGT + I RAKIFL+ VL VY+SKYMNAS+MLGDQLALAWV++SHP
Sbjct: 238 NNKEQPLNSGFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASQMLGDQLALAWVIRSHP 297
Query: 272 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 331
FD RF KAQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL
Sbjct: 298 GFDLWRFRKAQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRL 357
Query: 332 MLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
MLESWNFF S+SDISDMLCLILMSGRTKYDF
Sbjct: 358 MLESWNFFRSASDISDMLCLILMSGRTKYDF 388
>gi|356545145|ref|XP_003541005.1| PREDICTED: uncharacterized protein LOC100785469 [Glycine max]
Length = 432
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 252/384 (65%), Positives = 289/384 (75%), Gaps = 36/384 (9%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLP 74
M+ G RF+F LPL+F L +L SV ELH S +E+ ++ +K DHL+LGPAAGQ L
Sbjct: 49 MKIFSGWHRFVFGLPLIFLLTHLFSVRELHTNSKMEEPRKQLNKKLDHLVLGPAAGQGLS 108
Query: 75 NRLQCQDSRASNMV------------------------------------TVGNASYSKT 98
NRLQCQ +++ N + VGNASY+K
Sbjct: 109 NRLQCQGTKSLNRIHSSNSRSGVDGSITFVTVFTIYNSSLNDVDDKSLNTIVGNASYNKF 168
Query: 99 ERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIR 158
RSMA+LNVFINFIQV M +S V ILTDP SDLS+ R GV++YPI GEYSRDKLMLQRIR
Sbjct: 169 GRSMALLNVFINFIQVAMRQSKVIILTDPVSDLSVQRNGVSLYPIEGEYSRDKLMLQRIR 228
Query: 159 SYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPL 218
SYITFLE R++ SQ +I HY+FTDSD+AVVDDLG IFHD+ NFH+ALTFRNNK QPL
Sbjct: 229 SYITFLETRLQNLSQKPKNITHYIFTDSDMAVVDDLGQIFHDHPNFHVALTFRNNKAQPL 288
Query: 219 NSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRF 278
NSGFIAVRGTP+ I RAK+FL+EVL+VY++KY NASRMLGDQLALAWVVKS P FDA RF
Sbjct: 289 NSGFIAVRGTPEAILRAKLFLQEVLKVYTTKYKNASRMLGDQLALAWVVKSKPHFDASRF 348
Query: 279 TKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF 338
KA F EDI G SVLFLPC+ YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF
Sbjct: 349 AKAPAFSEDIGGTSVLFLPCSLYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF 408
Query: 339 FSSSSDISDMLCLILMSGRTKYDF 362
+SSS ++SDMLCLIL SGRTKYDF
Sbjct: 409 YSSSLEVSDMLCLILGSGRTKYDF 432
>gi|449432261|ref|XP_004133918.1| PREDICTED: uncharacterized protein LOC101215082 [Cucumis sativus]
gi|449480062|ref|XP_004155788.1| PREDICTED: uncharacterized protein LOC101230110 [Cucumis sativus]
Length = 387
Score = 483 bits (1242), Expect = e-134, Method: Compositional matrix adjust.
Identities = 238/350 (68%), Positives = 271/350 (77%), Gaps = 40/350 (11%)
Query: 53 PRKNRQKFDHLILGPAAGQRLPNRLQC--------------------------------- 79
P K +KFDHLILGPA GQ L +RLQC
Sbjct: 38 PDKRSKKFDHLILGPATGQGLSDRLQCSGTKALNNTHLPDTSNSADSGDSIHFVTVFTIY 97
Query: 80 ---QDS----RASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLS 132
QDS R++++V VG+ASY+K ERSMA+LNVFINFIQV+MP+S+V ILTDPASDL
Sbjct: 98 NASQDSKVIGRSTDVVKVGDASYNKVERSMAVLNVFINFIQVSMPQSNVVILTDPASDLP 157
Query: 133 MPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVD 192
+ R V ++PI GEYSRD LMLQRIRSYI+FL+ ++ E QG HINHY+FTDSD+AVV
Sbjct: 158 VRRNRVAVFPIQGEYSRDTLMLQRIRSYISFLDAKLDEQRQGTTHINHYIFTDSDMAVVG 217
Query: 193 DLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMN 252
DLG IFH + FHLALTFRNNK QPLNSGFIAVRGT DGI RAK FLEEVL++YSS++M
Sbjct: 218 DLGEIFHKHPKFHLALTFRNNKAQPLNSGFIAVRGTEDGIRRAKTFLEEVLKIYSSRFMK 277
Query: 253 ASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQF 312
ASRMLGDQLALAWVV+S+PSFDAR+F+K + FVE+I GASVLFLPCA YNWTPPEGAGQF
Sbjct: 278 ASRMLGDQLALAWVVRSNPSFDARKFSKPETFVEEINGASVLFLPCALYNWTPPEGAGQF 337
Query: 313 HGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
HGMPL+VKVVHFKGSRKRLMLESWNFF SSS ISDMLCLIL SGRTKYDF
Sbjct: 338 HGMPLNVKVVHFKGSRKRLMLESWNFFQSSSSISDMLCLILSSGRTKYDF 387
>gi|357471691|ref|XP_003606130.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
gi|355507185|gb|AES88327.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
Length = 350
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 239/341 (70%), Positives = 266/341 (78%), Gaps = 36/341 (10%)
Query: 58 QKFDHLILGPAAGQRLPNRLQCQDSRASN----------------MVTV----------- 90
+KFDHL+LGPAAGQ L NRLQCQ S+A N VTV
Sbjct: 10 KKFDHLVLGPAAGQGLSNRLQCQGSKALNRTHSSNGRFGVDGSITFVTVFTIYNSSLNRV 69
Query: 91 ---------GNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIY 141
GNASY+K ERSMA+LNVFI+FIQV MP+S+V ILTDP SDLS+ R V++Y
Sbjct: 70 DDKSSNTFVGNASYNKVERSMAVLNVFIDFIQVVMPQSEVIILTDPVSDLSVHRNRVSLY 129
Query: 142 PIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDY 201
PI GEYSRDKLMLQRIRSYITFLE R+++ SQ I HY+FTDSDIAVVDDLG IF D+
Sbjct: 130 PIQGEYSRDKLMLQRIRSYITFLETRLQKLSQNPKDITHYIFTDSDIAVVDDLGQIFRDH 189
Query: 202 QNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQL 261
NFH+ALTFRNNK QPLNSGFIAV+GTPDGI RAK+FL+EVL+VY SKYM+ASRMLGDQL
Sbjct: 190 PNFHMALTFRNNKAQPLNSGFIAVKGTPDGILRAKLFLQEVLKVYVSKYMSASRMLGDQL 249
Query: 262 ALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKV 321
ALAWVVKS P FDA RF K F +DI G S+LFLPCA YNWTPPEGAGQFHGMPLDVKV
Sbjct: 250 ALAWVVKSKPQFDASRFAKTVAFSDDIGGTSILFLPCALYNWTPPEGAGQFHGMPLDVKV 309
Query: 322 VHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
VHFKGSRKRLMLESWNF+SS+ DI+DMLCLIL SGRTKYDF
Sbjct: 310 VHFKGSRKRLMLESWNFYSSTPDIADMLCLILGSGRTKYDF 350
>gi|356517294|ref|XP_003527323.1| PREDICTED: uncharacterized protein LOC100794487 [Glycine max]
Length = 352
Score = 472 bits (1215), Expect = e-131, Method: Compositional matrix adjust.
Identities = 237/352 (67%), Positives = 268/352 (76%), Gaps = 38/352 (10%)
Query: 49 VEDLPRKN-RQKFDHLILGPAAGQRLPNRLQCQDSRASNMV------------------- 88
+E+ PRK +K +HL+LGPAAGQ L NRLQCQ ++A N +
Sbjct: 1 MEEPPRKQLNKKLNHLVLGPAAGQGLSNRLQCQGTKALNRIHSSNSRSGVDGSITFVTVF 60
Query: 89 -----------------TVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDL 131
VGNASY+K RS A+LNVFINFIQV MP+S V ILTDP SDL
Sbjct: 61 TIYNSSLNDVDDKSSNTVVGNASYNKFGRSTALLNVFINFIQVAMPQSKVIILTDPVSDL 120
Query: 132 SMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQ-GHINHYVFTDSDIAV 190
S+ R GV++YPI GEYSRDKLMLQRIRSYITFLE R++ SQ + +I HY+FTDSDIAV
Sbjct: 121 SVLRNGVSLYPIEGEYSRDKLMLQRIRSYITFLETRLQNLSQKKPKNITHYIFTDSDIAV 180
Query: 191 VDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKY 250
VDDLG IF D+ NFH+ALTFRNNK QPLNSGFIAVRGTP+ I RAK+FL+EVL+VYS+KY
Sbjct: 181 VDDLGQIFRDHPNFHVALTFRNNKAQPLNSGFIAVRGTPEAILRAKLFLQEVLKVYSTKY 240
Query: 251 MNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAG 310
NASRMLGDQLALAWVVKS P FDA RF KA F EDI G SV+FLPC+ YNWTPPEGAG
Sbjct: 241 RNASRMLGDQLALAWVVKSKPHFDASRFGKALAFSEDIGGTSVVFLPCSLYNWTPPEGAG 300
Query: 311 QFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
QFHGMPLD KVVHFKGSRKRLMLESWNF+SSS ++SDMLCLIL SGRTKYDF
Sbjct: 301 QFHGMPLDAKVVHFKGSRKRLMLESWNFYSSSLEVSDMLCLILGSGRTKYDF 352
>gi|297828243|ref|XP_002882004.1| hypothetical protein ARALYDRAFT_903971 [Arabidopsis lyrata subsp.
lyrata]
gi|297327843|gb|EFH58263.1| hypothetical protein ARALYDRAFT_903971 [Arabidopsis lyrata subsp.
lyrata]
Length = 391
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 238/385 (61%), Positives = 283/385 (73%), Gaps = 40/385 (10%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
MR+C G RR L +P++F LP+L S+L + S + R +K DHL+LGP AGQ L
Sbjct: 10 MRSCSGWRRILLLIPVLFLLPHLSSLLGISSDSAKRNDARTIPNKKLDHLVLGPVAGQGL 69
Query: 74 PNRLQCQDSRA-----------------------------------SNMVT-VGNASYSK 97
+RL C+ ++A SNMV+ VGN +YSK
Sbjct: 70 SDRLHCRGTKALNKTHGSSHVSGAGNGVSFVTVFTVYNTSLGNAKSSNMVSVVGNVTYSK 129
Query: 98 TERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRI 157
ERSMA+LN F FIQVTMPKS+V ILTDPASDLS+ + V + P+ G+YSR LMLQRI
Sbjct: 130 PERSMAVLNAFAYFIQVTMPKSNVVILTDPASDLSIQQSNVMVQPVQGDYSRGNLMLQRI 189
Query: 158 RSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQP 217
RSYITFLE ++ ++ +G INHY+FTDSDIAVVDD+ IF + +FHLALTFRNNKDQP
Sbjct: 190 RSYITFLEMKLEKN---EGGINHYIFTDSDIAVVDDIRAIFDKHPSFHLALTFRNNKDQP 246
Query: 218 LNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARR 277
LNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL WVVKSHPSFDA+R
Sbjct: 247 LNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVWVVKSHPSFDAKR 306
Query: 278 FTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWN 337
FTK Q F ++I GASVLFLPC YNWTPPEGAGQFHGMPLDVK+VHFKGSRKRLMLE+WN
Sbjct: 307 FTKPQAFTQEIAGASVLFLPCVLYNWTPPEGAGQFHGMPLDVKIVHFKGSRKRLMLEAWN 366
Query: 338 FFSSSSDISDMLCLILMSGRTKYDF 362
F+ S+S+I DMLCL+L SGRTKYDF
Sbjct: 367 FYKSTSNIPDMLCLVLGSGRTKYDF 391
>gi|30689992|ref|NP_850432.1| uncharacterized protein [Arabidopsis thaliana]
gi|330255444|gb|AEC10538.1| uncharacterized protein [Arabidopsis thaliana]
Length = 392
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 235/386 (60%), Positives = 280/386 (72%), Gaps = 41/386 (10%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
MR+C G RR L +P++F LP+L S+++ S + R +K DHL+LGP AGQ L
Sbjct: 10 MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69
Query: 74 PNRLQCQDSRASN-------------------------------------MVTVGNASYS 96
+R C+ ++A N + VGN +YS
Sbjct: 70 SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLGNVKSSNPVSVVGNVTYS 129
Query: 97 KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR 156
K ERSMA+LN F NFIQVTMPKS+V ILTDPASDLS+ + V + P+ G+YSR LMLQR
Sbjct: 130 KPERSMAVLNAFANFIQVTMPKSNVVILTDPASDLSIQQSNVILQPVQGDYSRGNLMLQR 189
Query: 157 IRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQ 216
IRSYITFLE ++ ++ +G INHY+FTDSDIAVVDD+G IF + +FHLALTFRNNKDQ
Sbjct: 190 IRSYITFLEMKLEKN---EGGINHYIFTDSDIAVVDDVGTIFDKHSSFHLALTFRNNKDQ 246
Query: 217 PLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDAR 276
PLNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL VVKSH SFDA+
Sbjct: 247 PLNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVSVVKSHASFDAK 306
Query: 277 RFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESW 336
RFTK Q F E+I GASVLFLPCA YNWTPPEGAGQFHGMPLDVK+VHFKGSRKRLMLE+W
Sbjct: 307 RFTKPQAFTEEIAGASVLFLPCALYNWTPPEGAGQFHGMPLDVKIVHFKGSRKRLMLEAW 366
Query: 337 NFFSSSSDISDMLCLILMSGRTKYDF 362
NF+ S+S+I DMLCL+L SGRTKYDF
Sbjct: 367 NFYKSTSNIPDMLCLVLGSGRTKYDF 392
>gi|147854152|emb|CAN83830.1| hypothetical protein VITISV_003973 [Vitis vinifera]
Length = 321
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 216/283 (76%), Positives = 238/283 (84%), Gaps = 18/283 (6%)
Query: 81 DSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTI 140
D R+S++VTVGNASYSK ERSMAILNVFINFIQ TMP+S+V ILTDPAS+ S+ R VTI
Sbjct: 56 DGRSSDLVTVGNASYSKMERSMAILNVFINFIQATMPQSNVIILTDPASEFSLHRDRVTI 115
Query: 141 YPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHD 200
YPI GEYSRDKLMLQRIRSYI FLE ++ EHSQG GHINHY+FTDSDIAVVDDLG IF
Sbjct: 116 YPIQGEYSRDKLMLQRIRSYIVFLETKLEEHSQGHGHINHYIFTDSDIAVVDDLGQIFQS 175
Query: 201 YQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFL-EEVLRVYSSKYMNASRMLGD 259
+ NFH+ALTFRNNK+QPL K ++ +VL+VYSS++MNASRMLGD
Sbjct: 176 HPNFHVALTFRNNKEQPL-----------------KFWIYSKVLKVYSSRFMNASRMLGD 218
Query: 260 QLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDV 319
QLALAWVVKSHP FD +RF+K Q F+EDI G SVLFLPCA YNWTPPEGAGQFHGMPLDV
Sbjct: 219 QLALAWVVKSHPYFDTKRFSKPQAFLEDIGGTSVLFLPCAIYNWTPPEGAGQFHGMPLDV 278
Query: 320 KVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
KVVHFKGSRKRLMLESWNFF SSSDISDMLCLILMSGRTKYDF
Sbjct: 279 KVVHFKGSRKRLMLESWNFFISSSDISDMLCLILMSGRTKYDF 321
>gi|294462546|gb|ADE76819.1| unknown [Picea sitchensis]
Length = 391
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 214/390 (54%), Positives = 266/390 (68%), Gaps = 44/390 (11%)
Query: 17 ACGGCRRFLFFLPLVFFLPYLLSVLEL----HEKSVVEDLPRKNRQKFDHLILGPAAGQR 72
A G RF+ FLP + LP++ S +L + K + R+KFD+++LGPAAGQ
Sbjct: 2 ASSGKWRFIRFLPFILILPFIFSGFQLSRLQNSKPKGDGSVGVGRKKFDYIVLGPAAGQG 61
Query: 73 LPNRLQCQ---------------------------------------DSRASNMVTVGNA 93
LPNR+QCQ D + S V+VGN+
Sbjct: 62 LPNRIQCQGLKAVKRRPLPSFHLSLVKEKISFVTVFTIYNQSLQISFDQKVSTNVSVGNS 121
Query: 94 SYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLM 153
+Y KT+RSMAILNVF NFI+V MP+S++FILTDPAS+ + + I G+YSR+ LM
Sbjct: 122 TYDKTQRSMAILNVFANFIKVAMPRSNIFILTDPASNFPVVPSNAVVMHIPGDYSRNNLM 181
Query: 154 LQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNN 213
LQRI+SYI FLE R+ H Q ++H++FTDSDIAVVDDLG + +Y +FH+ LTFRNN
Sbjct: 182 LQRIKSYIDFLEARLSGHIGKQNQVDHFIFTDSDIAVVDDLGDVVENYPDFHIGLTFRNN 241
Query: 214 KDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSF 273
KDQPLNSGFI VRGT + +S+AK FLEEVL +Y S +M A+RMLGDQLALAW+VK+ P F
Sbjct: 242 KDQPLNSGFILVRGTDEAVSKAKAFLEEVLEIYKSMFMKAARMLGDQLALAWIVKNQPLF 301
Query: 274 DARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLML 333
DA+RF + FV ++ A VLFLPCA YNWTPPEGAGQFHGMP DVKV+HFKGSRKRLM+
Sbjct: 302 DAQRFRNPKAFVAEVHRAQVLFLPCAIYNWTPPEGAGQFHGMPEDVKVIHFKGSRKRLMM 361
Query: 334 ESWNFFSSSS-DISDMLCLILMSGRTKYDF 362
ESWNFF+S D SDM+CLIL SGR KYDF
Sbjct: 362 ESWNFFNSHPVDFSDMMCLILKSGRVKYDF 391
>gi|293336758|ref|NP_001169994.1| uncharacterized protein LOC100383899 precursor [Zea mays]
gi|224032791|gb|ACN35471.1| unknown [Zea mays]
gi|413924636|gb|AFW64568.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
Length = 388
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 216/374 (57%), Positives = 262/374 (70%), Gaps = 43/374 (11%)
Query: 30 LVFFLPYLLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQDSRA- 84
L+F +P + SV L EK V P ++ DHL+LGPAAGQ P+RLQC+ RA
Sbjct: 17 LLFLVPLIYSVSRLQPWAPEKGVCLPPPTAPKRP-DHLVLGPAAGQDRPDRLQCRGLRAL 75
Query: 85 ------------------------------------SNMVTVGNASYSKTERSMAILNVF 108
S+ VTVGN+SYSK ERSMAILN F
Sbjct: 76 NKIGISSEENYSGEHVSFATVFTTYNSVSAGDDNVPSDSVTVGNSSYSKIERSMAILNTF 135
Query: 109 INFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRI 168
I+FI+V+MP+SD+ ILTDP S +S+ + T+ P+ G YSR LMLQRI++YI FLE+++
Sbjct: 136 ISFIKVSMPRSDLIILTDPGSKISVNQGTATLLPVEGNYSRGNLMLQRIKTYIAFLEQKL 195
Query: 169 REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGT 228
E + + +NH+V TDSDIAVV DLGHIF Y +FHLA+TFRNNK QPLNSGF+AVRGT
Sbjct: 196 VEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRNNKGQPLNSGFVAVRGT 254
Query: 229 PDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDI 288
DGI+ A FL++VL YS +YM ASRMLGDQLALAWVVKSH +F+K + F ++
Sbjct: 255 RDGITNAVEFLKQVLGTYSLRYMKASRMLGDQLALAWVVKSHLPSAFGKFSKNEAFTGEV 314
Query: 289 IGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDM 348
G SVLFLPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF+SS+S +SDM
Sbjct: 315 NGTSVLFLPCAVYNWTPPEGAGQFHGVPLDVKVVHFKGSRKRLMLEAWNFYSSTSKLSDM 374
Query: 349 LCLILMSGRTKYDF 362
LCLIL SGRTKYDF
Sbjct: 375 LCLILRSGRTKYDF 388
>gi|413954355|gb|AFW87004.1| hypothetical protein ZEAMMB73_846695 [Zea mays]
Length = 414
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 206/347 (59%), Positives = 250/347 (72%), Gaps = 38/347 (10%)
Query: 53 PRKNRQKFDHLILGPAAGQRLPNRLQCQDSRASNM------------------------- 87
P ++ DHL+LGPAAGQ P+RLQC+ RA N
Sbjct: 69 PPTAPKRPDHLVLGPAAGQGRPDRLQCRGLRALNKIGLSSEENYSGEHVSFVTVFTTYNS 128
Query: 88 ------------VTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR 135
VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S S+ +
Sbjct: 129 VSAGDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSVNQ 188
Query: 136 KGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLG 195
T+ PI G YSR LMLQRI++YI FLE+++ E + + +NH+V TDSDIAVVDDLG
Sbjct: 189 GSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLG 247
Query: 196 HIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASR 255
HIF Y +FHLA+TFRNNK QPLNSGF+AVRGT DGI++A FL++VL+ YS +Y+ A+R
Sbjct: 248 HIFEKYPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITKAAEFLKQVLKAYSLRYIKAAR 307
Query: 256 MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGM 315
MLGDQLALAWVVKSH +F+K + F ++ G SVLFLPCA YNWTPPEGAGQFHG+
Sbjct: 308 MLGDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTPPEGAGQFHGI 367
Query: 316 PLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 368 PLDVKVVHFKGSRKRLMLEAWNFYNSTSKMSDMLCLILRSGRTKYDF 414
>gi|413954354|gb|AFW87003.1| hypothetical protein ZEAMMB73_846695 [Zea mays]
Length = 389
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 206/347 (59%), Positives = 250/347 (72%), Gaps = 38/347 (10%)
Query: 53 PRKNRQKFDHLILGPAAGQRLPNRLQCQDSRASNM------------------------- 87
P ++ DHL+LGPAAGQ P+RLQC+ RA N
Sbjct: 44 PPTAPKRPDHLVLGPAAGQGRPDRLQCRGLRALNKIGLSSEENYSGEHVSFVTVFTTYNS 103
Query: 88 ------------VTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR 135
VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S S+ +
Sbjct: 104 VSAGDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSVNQ 163
Query: 136 KGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLG 195
T+ PI G YSR LMLQRI++YI FLE+++ E + + +NH+V TDSDIAVVDDLG
Sbjct: 164 GSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLG 222
Query: 196 HIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASR 255
HIF Y +FHLA+TFRNNK QPLNSGF+AVRGT DGI++A FL++VL+ YS +Y+ A+R
Sbjct: 223 HIFEKYPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITKAAEFLKQVLKAYSLRYIKAAR 282
Query: 256 MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGM 315
MLGDQLALAWVVKSH +F+K + F ++ G SVLFLPCA YNWTPPEGAGQFHG+
Sbjct: 283 MLGDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTPPEGAGQFHGI 342
Query: 316 PLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 343 PLDVKVVHFKGSRKRLMLEAWNFYNSTSKMSDMLCLILRSGRTKYDF 389
>gi|357471693|ref|XP_003606131.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
gi|355507186|gb|AES88328.1| hypothetical protein MTR_4g053430 [Medicago truncatula]
Length = 251
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 196/251 (78%), Positives = 218/251 (86%)
Query: 112 IQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREH 171
+QV MP+S+V ILTDP SDLS+ R V++YPI GEYSRDKLMLQRIRSYITFLE R+++
Sbjct: 1 MQVVMPQSEVIILTDPVSDLSVHRNRVSLYPIQGEYSRDKLMLQRIRSYITFLETRLQKL 60
Query: 172 SQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDG 231
SQ I HY+FTDSDIAVVDDLG IF D+ NFH+ALTFRNNK QPLNSGFIAV+GTPDG
Sbjct: 61 SQNPKDITHYIFTDSDIAVVDDLGQIFRDHPNFHMALTFRNNKAQPLNSGFIAVKGTPDG 120
Query: 232 ISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGA 291
I RAK+FL+EVL+VY SKYM+ASRMLGDQLALAWVVKS P FDA RF K F +DI G
Sbjct: 121 ILRAKLFLQEVLKVYVSKYMSASRMLGDQLALAWVVKSKPQFDASRFAKTVAFSDDIGGT 180
Query: 292 SVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCL 351
S+LFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF+SS+ DI+DMLCL
Sbjct: 181 SILFLPCALYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFYSSTPDIADMLCL 240
Query: 352 ILMSGRTKYDF 362
IL SGRTKYDF
Sbjct: 241 ILGSGRTKYDF 251
>gi|255541260|ref|XP_002511694.1| conserved hypothetical protein [Ricinus communis]
gi|223548874|gb|EEF50363.1| conserved hypothetical protein [Ricinus communis]
Length = 554
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 220/382 (57%), Positives = 252/382 (65%), Gaps = 97/382 (25%)
Query: 29 PLVFFLPYLL-------SVLELHEKSVVEDLPRKN-RQKFDHLILGPAAGQRLPNRLQCQ 80
PL+F + YL SVLELH SV E P+KN +K DHL++GPAAGQ LP+RLQC+
Sbjct: 222 PLLFCVMYLACYTMHVASVLELHWNSVTE-APQKNWNKKSDHLVIGPAAGQGLPDRLQCE 280
Query: 81 ----------------------------------------DSRASNMVTVGNASYSKTER 100
D R+SN+VTVGN SYSKTER
Sbjct: 281 GSKALNKTHLLDSSSGSNVGDNVAFVTVFTIYNTSLDSLPDDRSSNLVTVGNVSYSKTER 340
Query: 101 SMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSY 160
SMAILNVFINFIQ
Sbjct: 341 SMAILNVFINFIQ----------------------------------------------- 353
Query: 161 ITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNS 220
FL+ +++E ++ H +HY+FTDSDIAVVDDLG IFH+Y NFH+ALTFRNNK+QPLNS
Sbjct: 354 -NFLDTKLKELAKNPVHKSHYIFTDSDIAVVDDLGRIFHEYPNFHIALTFRNNKEQPLNS 412
Query: 221 GFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTK 280
GFIAVRGT + I RAKIFL+ VL VY+SKYMNASRMLGDQLALAWV++SHP FD +RF K
Sbjct: 413 GFIAVRGTAESILRAKIFLQHVLEVYTSKYMNASRMLGDQLALAWVIRSHPGFDLQRFRK 472
Query: 281 AQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFS 340
AQ F++++ GASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRK LMLESWNFF
Sbjct: 473 AQAFMDEMGGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKHLMLESWNFFR 532
Query: 341 SSSDISDMLCLILMSGRTKYDF 362
S+SDISDMLCLILMSGRTKYDF
Sbjct: 533 SASDISDMLCLILMSGRTKYDF 554
>gi|357124057|ref|XP_003563723.1| PREDICTED: uncharacterized protein LOC100833864 [Brachypodium
distachyon]
Length = 391
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 205/339 (60%), Positives = 242/339 (71%), Gaps = 38/339 (11%)
Query: 61 DHLILGPAAGQRLPNRLQCQDSRASN---------------------------------- 86
D L+LGPAAGQ P+RLQCQ +A N
Sbjct: 54 DRLVLGPAAGQGRPDRLQCQGLKAVNKIILSSETTHYGERVSFVTVFTTYNSDPDKASKM 113
Query: 87 ---MVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPI 143
+VTVGN SYSK ERS+A+LN FI+FIQV+MP+S+V ILTDP S+LS+ + I PI
Sbjct: 114 SSGLVTVGNHSYSKVERSIAVLNTFISFIQVSMPRSNVIILTDPKSNLSIDQGNAVILPI 173
Query: 144 HGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQN 203
G YSR LMLQRI+SYI FLE + E Q H+VFTDSDIAVV+ LGHIF Y +
Sbjct: 174 EGNYSRGNLMLQRIKSYIAFLELKFVEL-QRVDRFTHFVFTDSDIAVVEGLGHIFKRYPH 232
Query: 204 FHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLAL 263
HLALTFRNN QPLNSGF+AVRGT DGIS+A F +EVL+ Y+SKYM ASRMLGDQLAL
Sbjct: 233 CHLALTFRNNNGQPLNSGFVAVRGTSDGISKATEFFKEVLKAYNSKYMKASRMLGDQLAL 292
Query: 264 AWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVH 323
AWVVKS+ +F++ + F ++ GAS+LFLPCA YNWTPPEGAGQFHGMPLDVKV+H
Sbjct: 293 AWVVKSYLPSAFGKFSRHEEFTGEVNGASILFLPCAVYNWTPPEGAGQFHGMPLDVKVIH 352
Query: 324 FKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
FKGSRKRLMLE+WNF++S+S +SDMLCLIL SGRTKYDF
Sbjct: 353 FKGSRKRLMLEAWNFYNSTSHLSDMLCLILKSGRTKYDF 391
>gi|242093388|ref|XP_002437184.1| hypothetical protein SORBIDRAFT_10g022560 [Sorghum bicolor]
gi|241915407|gb|EER88551.1| hypothetical protein SORBIDRAFT_10g022560 [Sorghum bicolor]
Length = 377
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 210/367 (57%), Positives = 248/367 (67%), Gaps = 55/367 (14%)
Query: 37 LLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQDSRA-------- 84
+ SV LH EK V P ++ D L+LGPAAGQ P+RLQCQ RA
Sbjct: 25 IYSVSRLHPWVPEKGVCLPPPTAPKRP-DRLVLGPAAGQGRPDRLQCQGLRALNKIGLSS 83
Query: 85 -----------------------------SNMVTVGNASYSKTERSMAILNVFINFIQVT 115
S+ VTVGN SYSK ERSMAILN FI+FI+V+
Sbjct: 84 EEIYSGEHISFVTVFTTYNSVSAGDGNVPSDSVTVGNHSYSKIERSMAILNTFISFIKVS 143
Query: 116 MPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQ 175
MP+S+V ILTDP S +S+ + T+ PI G YSR LMLQRI++YI
Sbjct: 144 MPRSNVIILTDPGSKISVNQGSATLLPIEGNYSRGNLMLQRIQTYI-------------D 190
Query: 176 GHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRA 235
G + + TDSDIAVVDDLGHIF Y + HLALTFRNNK QPLNSGF+AVRGT DGI++A
Sbjct: 191 GGVESFFLTDSDIAVVDDLGHIFKKYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGITKA 250
Query: 236 KIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLF 295
FL++VL YSS+Y+ ASRMLGDQLALAWVVKSH +F+K + F ++ GASVLF
Sbjct: 251 VEFLKQVLGAYSSRYIKASRMLGDQLALAWVVKSHLPSAFGKFSKHEAFTGEVNGASVLF 310
Query: 296 LPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMS 355
LPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLCLIL S
Sbjct: 311 LPCAVYNWTPPEGAGQFHGIPLDVKVVHFKGSRKRLMLEAWNFYNSTSKLSDMLCLILRS 370
Query: 356 GRTKYDF 362
GRTKYDF
Sbjct: 371 GRTKYDF 377
>gi|242060516|ref|XP_002451547.1| hypothetical protein SORBIDRAFT_04g003580 [Sorghum bicolor]
gi|241931378|gb|EES04523.1| hypothetical protein SORBIDRAFT_04g003580 [Sorghum bicolor]
Length = 346
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 190/289 (65%), Positives = 232/289 (80%), Gaps = 2/289 (0%)
Query: 75 NRLQCQDSRA-SNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSM 133
N + D + S+ VTVGN SYSKTERSMAIL+ FI+FI+V+MP+S+V ILTDP S +S+
Sbjct: 59 NSVSAGDGKVPSDSVTVGNHSYSKTERSMAILSTFISFIRVSMPRSNVIILTDPGSKISV 118
Query: 134 PRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDD 193
+ T+ PI G YSR LMLQRI++YI FLE+++ E +G +NH+V TDSDIA+VDD
Sbjct: 119 NQGSATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDSMEG-LNHFVLTDSDIALVDD 177
Query: 194 LGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNA 253
LGHIF Y + HLALTFRNNK QPLNSGF+AVRGT DGI++A FL++VL Y +Y+ A
Sbjct: 178 LGHIFKKYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGITKAVEFLKQVLEAYCLRYIKA 237
Query: 254 SRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFH 313
SRMLGDQLALAWVVKSH +F+K + F ++ GASVLFLPCA YNWTPPEGAGQFH
Sbjct: 238 SRMLGDQLALAWVVKSHLPSAFGKFSKHEAFTGEVNGASVLFLPCAVYNWTPPEGAGQFH 297
Query: 314 GMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
G+PLDVKVVHFKGSRKRLMLE+WNF++S+S +SDMLC+IL SGRTKYDF
Sbjct: 298 GIPLDVKVVHFKGSRKRLMLEAWNFYNSTSKLSDMLCIILRSGRTKYDF 346
>gi|218198407|gb|EEC80834.1| hypothetical protein OsI_23435 [Oryza sativa Indica Group]
Length = 344
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 196/296 (66%), Positives = 223/296 (75%), Gaps = 13/296 (4%)
Query: 67 PAAGQRLPNRLQCQDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTD 126
PA +LP SN+VTVG SYSK RSMAILN FI FIQV+MP+S+V ILTD
Sbjct: 62 PAEASKLP----------SNVVTVGKHSYSKVGRSMAILNTFIGFIQVSMPRSNVIILTD 111
Query: 127 PASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDS 186
P S L+ I PI G YSR LMLQRIRSYI FLE+R+ E + INH +FTDS
Sbjct: 112 PNSKLT--HGSAVILPIEGNYSRGNLMLQRIRSYIAFLEQRLEELETVED-INHLIFTDS 168
Query: 187 DIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVY 246
DIAVV DLGHIF Y + HLALTFRNNK QPLNSGF+AVRGT DGI +A F +EVL Y
Sbjct: 169 DIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGIFKAIEFFKEVLEAY 228
Query: 247 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 306
KYM ASRMLGDQLALAWVVKS+ +F+K + F ++ G S+LFLPCA YNWTPP
Sbjct: 229 HLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKHEAFTGEVNGTSILFLPCAVYNWTPP 288
Query: 307 EGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
EGAGQFHGMPLDVKVVHFKGSRKRLMLE+WNF++S+S++SDMLCLIL SGRTKYDF
Sbjct: 289 EGAGQFHGMPLDVKVVHFKGSRKRLMLEAWNFYNSTSELSDMLCLILRSGRTKYDF 344
>gi|222635777|gb|EEE65909.1| hypothetical protein OsJ_21755 [Oryza sativa Japonica Group]
Length = 344
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 195/296 (65%), Positives = 222/296 (75%), Gaps = 13/296 (4%)
Query: 67 PAAGQRLPNRLQCQDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTD 126
PA +LP SN+VTVG SYSK RSMAILN FI FIQV+MP+S+V ILTD
Sbjct: 62 PAEASKLP----------SNVVTVGKHSYSKVGRSMAILNTFIGFIQVSMPRSNVIILTD 111
Query: 127 PASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDS 186
P S L+ I PI G YSR LM QRIRSYI FLE+R+ E + INH +FTDS
Sbjct: 112 PNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYIAFLEQRLEELETVE-DINHLIFTDS 168
Query: 187 DIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVY 246
DIAVV DLGHIF Y + HLALTFRNNK QPLNSGF+AVRGT DGI +A F +EVL Y
Sbjct: 169 DIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGIFKAIEFFKEVLEAY 228
Query: 247 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 306
KYM ASRMLGDQLALAWVVKS+ +F+K + F ++ G S+LFLPCA YNWTPP
Sbjct: 229 YLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKHEAFTGEVNGTSILFLPCAVYNWTPP 288
Query: 307 EGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLILMSGRTKYDF 362
EGAGQFHGMPLDVKVVHFKGSRKRLMLE+WNF++S+S++SDMLCLIL SGRTKYDF
Sbjct: 289 EGAGQFHGMPLDVKVVHFKGSRKRLMLEAWNFYNSTSELSDMLCLILRSGRTKYDF 344
>gi|224101407|ref|XP_002334278.1| predicted protein [Populus trichocarpa]
gi|222870580|gb|EEF07711.1| predicted protein [Populus trichocarpa]
Length = 274
Score = 365 bits (936), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 194/330 (58%), Positives = 221/330 (66%), Gaps = 62/330 (18%)
Query: 34 LPYL-LSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQDSRASNMVTVGN 92
+P+L SVLELH+ + P+K KFDHL+LGPAAGQ LPNRLQCQ
Sbjct: 6 IPFLSFSVLELHQNPAAQPPPKKMNTKFDHLVLGPAAGQGLPNRLQCQGD---------- 55
Query: 93 ASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKL 152
S+ I ++ F QVTMP+S+V ILTDPASDLS+ R VT+YPI G+YSRDKL
Sbjct: 56 --------SVQIHFSYVCF-QVTMPQSNVVILTDPASDLSLHRNSVTVYPIQGDYSRDKL 106
Query: 153 MLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRN 212
MLQRIRSYITFLE R+ + +Q G I+HY+ TDSDIAVVDDLGH+F+D+ F TFR+
Sbjct: 107 MLQRIRSYITFLETRLEKLAQNPGPISHYILTDSDIAVVDDLGHLFNDHPTFTRLFTFRD 166
Query: 213 NKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPS 272
NK+QPLNSGFIAV GT D I R
Sbjct: 167 NKEQPLNSGFIAVWGTADAILR-------------------------------------- 188
Query: 273 FDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM 332
RFTKAQ F+E+I G SVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM
Sbjct: 189 ----RFTKAQAFLENIGGTSVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM 244
Query: 333 LESWNFFSSSSDISDMLCLILMSGRTKYDF 362
LESWNF SSSSDI MLCL+L SGRTKYDF
Sbjct: 245 LESWNFLSSSSDIFGMLCLVLSSGRTKYDF 274
>gi|223949095|gb|ACN28631.1| unknown [Zea mays]
gi|413924639|gb|AFW64571.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
Length = 209
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 148/210 (70%), Positives = 174/210 (82%), Gaps = 1/210 (0%)
Query: 153 MLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRN 212
MLQRI++YI FLE+++ E + + +NH+V TDSDIAVV DLGHIF Y +FHLA+TFRN
Sbjct: 1 MLQRIKTYIAFLEQKLVEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRN 59
Query: 213 NKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPS 272
NK QPLNSGF+AVRGT DGI+ A FL++VL YS +YM ASRMLGDQLALAWVVKSH
Sbjct: 60 NKGQPLNSGFVAVRGTRDGITNAVEFLKQVLGTYSLRYMKASRMLGDQLALAWVVKSHLP 119
Query: 273 FDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLM 332
+F+K + F ++ G SVLFLPCA YNWTPPEGAGQFHG+PLDVKVVHFKGSRKRLM
Sbjct: 120 SAFGKFSKNEAFTGEVNGTSVLFLPCAVYNWTPPEGAGQFHGVPLDVKVVHFKGSRKRLM 179
Query: 333 LESWNFFSSSSDISDMLCLILMSGRTKYDF 362
LE+WNF+SS+S +SDMLCLIL SGRTKYDF
Sbjct: 180 LEAWNFYSSTSKLSDMLCLILRSGRTKYDF 209
>gi|2583122|gb|AAB82631.1| hypothetical protein [Arabidopsis thaliana]
Length = 304
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 170/346 (49%), Positives = 204/346 (58%), Gaps = 89/346 (25%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
MR+C G RR L +P++F LP+L S+++ S + R +K DHL+LGP AGQ L
Sbjct: 10 MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69
Query: 74 PNRLQCQDSRASN-------------------------------------MVTVGNASYS 96
+R C+ ++A N + VGN +YS
Sbjct: 70 SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLGNVKSSNPVSVVGNVTYS 129
Query: 97 KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR 156
K ERSMA+LN F NFIQ
Sbjct: 130 KPERSMAVLNAFANFIQ------------------------------------------- 146
Query: 157 IRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQ 216
TFLE ++ ++ +G INHY+FTDSDIAVVDD+G IF + +FHLALTFRNNKDQ
Sbjct: 147 -----TFLEMKLEKN---EGGINHYIFTDSDIAVVDDVGTIFDKHSSFHLALTFRNNKDQ 198
Query: 217 PLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDAR 276
PLNSGFIAVRGT +GI RAK+FLEEVL+ Y +KYM ASRMLGDQLAL VVKSH SFDA+
Sbjct: 199 PLNSGFIAVRGTREGILRAKVFLEEVLKAYKTKYMKASRMLGDQLALVSVVKSHASFDAK 258
Query: 277 RFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVV 322
RFTK Q F E+I GASVLFLPCA YNWTPPEGAGQFHGMPLDVKV+
Sbjct: 259 RFTKPQAFTEEIAGASVLFLPCALYNWTPPEGAGQFHGMPLDVKVL 304
>gi|54291155|dbj|BAD61827.1| hypothetical protein [Oryza sativa Japonica Group]
gi|54291236|dbj|BAD61931.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 288
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 152/254 (59%), Positives = 170/254 (66%), Gaps = 27/254 (10%)
Query: 67 PAAGQRLPNRLQCQDSRASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTD 126
PA +LP SN+VTVG SYSK RSMAILN FI FIQV+MP+S+V ILTD
Sbjct: 62 PAEASKLP----------SNVVTVGKHSYSKVGRSMAILNTFIGFIQVSMPRSNVIILTD 111
Query: 127 PASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDS 186
P S L+ I PI G YSR LM QRIRSYI FLE+R+ E + INH +FTDS
Sbjct: 112 PNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYIAFLEQRLEELETVED-INHLIFTDS 168
Query: 187 DIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVY 246
DIAVV DLGHIF Y + HLALTFRNNK QPLNSGF+AVRGT DGI +A F +EVL Y
Sbjct: 169 DIAVVTDLGHIFEMYPHCHLALTFRNNKGQPLNSGFVAVRGTRDGIFKAIEFFKEVLEAY 228
Query: 247 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 306
KYM ASRMLGDQLALAWVVKS+ +F+K + F YNWTPP
Sbjct: 229 YLKYMEASRMLGDQLALAWVVKSYLPSAFSKFSKHEAF--------------TVYNWTPP 274
Query: 307 EGAGQFHGMPLDVK 320
EGAGQFHGMPLDVK
Sbjct: 275 EGAGQFHGMPLDVK 288
>gi|302769952|ref|XP_002968395.1| hypothetical protein SELMODRAFT_65657 [Selaginella moellendorffii]
gi|300164039|gb|EFJ30649.1| hypothetical protein SELMODRAFT_65657 [Selaginella moellendorffii]
Length = 280
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 131/280 (46%), Positives = 185/280 (66%), Gaps = 7/280 (2%)
Query: 89 TVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR--KGVTIYPIHGE 146
VG E+ ++L VF+ ++ MP S +LTDPA+ +S R G++ + G
Sbjct: 2 VVGGRVLRGLEKGYSVLRVFVESARLAMPNSQQLVLTDPAAAISTERLPAGISFQRVPGN 61
Query: 147 YSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHL 206
YSR LMLQR+ SYI FL+ +I++ + + H++F DSD+ VV DLG +F ++ +F +
Sbjct: 62 YSRGNLMLQRLDSYIAFLDDQIKQVGKADS-LQHFIFADSDMIVVGDLGCVFLEFPSFDV 120
Query: 207 ALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWV 266
ALTFRNNK+QP+NSG I VRG+ DG+++ K+ L+ V+ Y + ASRM+GDQLA AWV
Sbjct: 121 ALTFRNNKEQPINSGMIFVRGSKDGLAKGKLLLQSVVDSYRRDFFRASRMMGDQLAFAWV 180
Query: 267 VKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKG 326
V+ F + + F + G VLFLPC++YNWTP EGAGQFHGMPLDVK +HFKG
Sbjct: 181 VRHFADPLEDSFKQGKVFKSQVKGVEVLFLPCSSYNWTPAEGAGQFHGMPLDVKAIHFKG 240
Query: 327 SRKRLMLESWNF----FSSSSDISDMLCLILMSGRTKYDF 362
SRKRLMLE+W+ +++ D+ + C +L SGR+KYDF
Sbjct: 241 SRKRLMLEAWDSHKHQVAATKDLLPLQCFVLKSGRSKYDF 280
>gi|302774282|ref|XP_002970558.1| hypothetical protein SELMODRAFT_65658 [Selaginella moellendorffii]
gi|300162074|gb|EFJ28688.1| hypothetical protein SELMODRAFT_65658 [Selaginella moellendorffii]
Length = 331
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 134/282 (47%), Positives = 189/282 (67%), Gaps = 11/282 (3%)
Query: 89 TVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPR--KGVTIYPIHGE 146
VG E+ ++L VF+ ++ MP S +LTDPA+ +S R G++ + G
Sbjct: 53 VVGGRVLRGLEKGYSVLRVFVESARLAMPNSQQLVLTDPAAVISTERLPAGISFQRVPGN 112
Query: 147 YSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHL 206
YSR LMLQR+ SYI FL+ +I++ + + H++F DSD+ VV DLG +F ++ +F +
Sbjct: 113 YSRGNLMLQRLDSYIAFLDDQIKQVGKAD-SLQHFIFADSDMIVVGDLGCVFLEFPSFDV 171
Query: 207 ALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWV 266
ALTFRNNK+QP+NSG I VRG+ DG+++ K+ L+ V+ Y + ASRM+GDQLA AWV
Sbjct: 172 ALTFRNNKEQPINSGMIFVRGSKDGLAKGKLLLQSVVDSYRRDFFRASRMMGDQLAFAWV 231
Query: 267 VK--SHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHF 324
V+ S P D+ F + + F + G VLFLPC++YNWTP EGAGQFHGMPLDVK +HF
Sbjct: 232 VRHFSDPLEDS--FKQGKVFKSQVKGVEVLFLPCSSYNWTPAEGAGQFHGMPLDVKAIHF 289
Query: 325 KGSRKRLMLESWNF----FSSSSDISDMLCLILMSGRTKYDF 362
KGSRKRLMLE+W+ +++ D+ + C +L SGR+KYDF
Sbjct: 290 KGSRKRLMLEAWDSHKHQVAATKDLLPLQCFVLKSGRSKYDF 331
>gi|413938025|gb|AFW72576.1| hypothetical protein ZEAMMB73_448315 [Zea mays]
Length = 450
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 122/184 (66%), Positives = 148/184 (80%), Gaps = 1/184 (0%)
Query: 160 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 219
+ FLE+++ E + + +NH+V TDSDIAVVDDLGHIF Y +FHLA+TFRNNK QPLN
Sbjct: 262 FTAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHIFEKYPHFHLAVTFRNNKGQPLN 320
Query: 220 SGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFT 279
SGF+AVRGT DGI++A FL++VL YS +Y+ ASRMLGDQLALAWVVK H +F+
Sbjct: 321 SGFVAVRGTSDGITKAVEFLKQVLGTYSLRYIKASRMLGDQLALAWVVKFHLPSALGKFS 380
Query: 280 KAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFF 339
K + F ++ G SVLFLPCA YNWT PEGAGQFHG+PLDVKVVHFKGSRKRLMLE+WNF+
Sbjct: 381 KHEAFTGEVNGTSVLFLPCAVYNWTQPEGAGQFHGIPLDVKVVHFKGSRKRLMLEAWNFY 440
Query: 340 SSSS 343
+S S
Sbjct: 441 NSFS 444
>gi|414866083|tpg|DAA44640.1| TPA: putative serine/threonine protein phosphatase superfamily
protein [Zea mays]
Length = 470
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 118/183 (64%), Positives = 143/183 (78%), Gaps = 1/183 (0%)
Query: 138 VTIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHI 197
T+ PI G YSR LMLQRI++YI FLE+++ E + + +NH+V TDSDIAVVDDLGHI
Sbjct: 210 ATLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHI 268
Query: 198 FHDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRML 257
F Y +FHLA+TF NNK QPLNSGF+AVRGT DGI++A FL++VL YS +Y+ ASRML
Sbjct: 269 FEKYPHFHLAVTFCNNKGQPLNSGFVAVRGTRDGITKAVEFLKQVLGTYSLRYIKASRML 328
Query: 258 GDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPL 317
GDQLALAWVVKSH +F+K + F ++ G SVLFLPC YNWTPPEGAGQFHG+PL
Sbjct: 329 GDQLALAWVVKSHLPSALGKFSKHEAFTGEVNGTSVLFLPCVVYNWTPPEGAGQFHGIPL 388
Query: 318 DVK 320
DVK
Sbjct: 389 DVK 391
>gi|168008832|ref|XP_001757110.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691608|gb|EDQ77969.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 121/271 (44%), Positives = 170/271 (62%), Gaps = 7/271 (2%)
Query: 97 KTERSMAILNVFINFIQ-VTMP-KSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLML 154
K R A+L F+ IQ V+MP + V I+T+ + + P +SR LM+
Sbjct: 40 KKSRQDAVLRAFLESIQQVSMPGTTRVTIITNHNKLRGELPQDIDWKPTSRHFSRRNLMI 99
Query: 155 QRIRSYITFLERRIREHSQGQGH-INHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNN 213
QR++SYI L+ I + ++H +F+D D+ VVDDLG +F ++ +F +A TFRNN
Sbjct: 100 QRLQSYIELLDSMIEDRKNNSSSPVSHAIFSDFDMIVVDDLGCVFKEFPHFDIAFTFRNN 159
Query: 214 KDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSF 273
+ QP+NSG I VRGT +SRA L+EV+++Y +K+ +A +LGDQLALA +VK +
Sbjct: 160 QRQPINSGVIMVRGTFGSLSRATQLLKEVVKIYLAKFRHAFGVLGDQLALADIVKG--TL 217
Query: 274 DARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLML 333
AR F + P ++ LFLPC YNWTPPEGAGQF GMP +VKV+HFKG RKRLM+
Sbjct: 218 QARAFQEGVPVEATVMTTKTLFLPCVIYNWTPPEGAGQFQGMPTEVKVLHFKGRRKRLMI 277
Query: 334 ESWNFFSSSS--DISDMLCLILMSGRTKYDF 362
++W F+ D M CL+L SGR+KYD+
Sbjct: 278 QAWYFYKKQGVLDFYKMKCLVLKSGRSKYDY 308
>gi|413924637|gb|AFW64569.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
gi|413924638|gb|AFW64570.1| hypothetical protein ZEAMMB73_896032 [Zea mays]
Length = 260
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/246 (49%), Positives = 156/246 (63%), Gaps = 43/246 (17%)
Query: 30 LVFFLPYLLSVLELH----EKSVVEDLPRKNRQKFDHLILGPAAGQRLPNRLQCQDSRA- 84
L+F +P + SV L EK V P ++ DHL+LGPAAGQ P+RLQC+ RA
Sbjct: 17 LLFLVPLIYSVSRLQPWAPEKGVCLPPPTAPKRP-DHLVLGPAAGQDRPDRLQCRGLRAL 75
Query: 85 ------------------------------------SNMVTVGNASYSKTERSMAILNVF 108
S+ VTVGN+SYSK ERSMAILN F
Sbjct: 76 NKIGISSEENYSGEHVSFATVFTTYNSVSAGDDNVPSDSVTVGNSSYSKIERSMAILNTF 135
Query: 109 INFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERRI 168
I+FI+V+MP+SD+ ILTDP S +S+ + T+ P+ G YSR LMLQRI++YI FLE+++
Sbjct: 136 ISFIKVSMPRSDLIILTDPGSKISVNQGTATLLPVEGNYSRGNLMLQRIKTYIAFLEQKL 195
Query: 169 REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGT 228
E + + +NH+V TDSDIAVV DLGHIF Y +FHLA+TFRNNK QPLNSGF+AVRGT
Sbjct: 196 VEFDRME-RLNHFVLTDSDIAVVGDLGHIFKKYPHFHLAVTFRNNKGQPLNSGFVAVRGT 254
Query: 229 PDGISR 234
DGI++
Sbjct: 255 RDGITK 260
>gi|255562810|ref|XP_002522410.1| hypothetical protein RCOM_0835860 [Ricinus communis]
gi|223538295|gb|EEF39902.1| hypothetical protein RCOM_0835860 [Ricinus communis]
Length = 202
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 103/130 (79%), Positives = 112/130 (86%), Gaps = 6/130 (4%)
Query: 233 SRAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGAS 292
RAKIFL+ VL VY+SKYMNASR ALAWV++SHP FD RRF KAQ F++++ GAS
Sbjct: 79 CRAKIFLQHVLEVYTSKYMNASR------ALAWVIRSHPGFDLRRFHKAQAFMDEMGGAS 132
Query: 293 VLFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLI 352
LFLPCA YNWTPPEGAGQFH MPLDVKVVHFKGSRKRLMLESWNFF S+SDISDMLCLI
Sbjct: 133 ALFLPCAIYNWTPPEGAGQFHRMPLDVKVVHFKGSRKRLMLESWNFFRSASDISDMLCLI 192
Query: 353 LMSGRTKYDF 362
LMSGRTKYDF
Sbjct: 193 LMSGRTKYDF 202
>gi|414587497|tpg|DAA38068.1| TPA: hypothetical protein ZEAMMB73_303828 [Zea mays]
Length = 258
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 140/216 (64%), Gaps = 35/216 (16%)
Query: 53 PRKNRQKFDHLILGPAAGQRLPNR----------LQCQDSRAS----------------- 85
P ++ DHL+LGPAAGQ P+R L +++ +
Sbjct: 44 PPTAPKRPDHLVLGPAAGQGRPDRRRLRALNKIGLSSEENYSGEHVPFVTVFTTYNSVSA 103
Query: 86 -------NMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGV 138
+ VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S S+ +
Sbjct: 104 GDGNVPPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSVNQGSA 163
Query: 139 TIYPIHGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIF 198
T+ PI G YSR LMLQRI++YI FLE+++ E + + +NH+V TDSDIAVVDDLGHIF
Sbjct: 164 TLLPIEGNYSRGNLMLQRIKTYIAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHIF 222
Query: 199 HDYQNFHLALTFRNNKDQPLNSGFIAVRGTPDGISR 234
+FHLA+TFRNNK QPLNSGF+AVRGT DGI++
Sbjct: 223 EKNPHFHLAVTFRNNKGQPLNSGFVAVRGTRDGITK 258
>gi|357520759|ref|XP_003630668.1| hypothetical protein MTR_8g102040 [Medicago truncatula]
gi|355524690|gb|AET05144.1| hypothetical protein MTR_8g102040 [Medicago truncatula]
Length = 105
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 82/129 (63%), Positives = 94/129 (72%), Gaps = 25/129 (19%)
Query: 234 RAKIFLEEVLRVYSSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASV 293
+ K+FL+EVL+VY SKYM+ ++ L PF +DI G S+
Sbjct: 2 QGKLFLQEVLKVYVSKYMSVAKTL-------------------------PFSDDIGGTSI 36
Query: 294 LFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFFSSSSDISDMLCLIL 353
LFLPCA YNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNF+SS+ DI+DMLCLIL
Sbjct: 37 LFLPCALYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWNFYSSTPDIADMLCLIL 96
Query: 354 MSGRTKYDF 362
SGRTKYDF
Sbjct: 97 GSGRTKYDF 105
>gi|51968566|dbj|BAD42975.1| hypothetical protein [Arabidopsis thaliana]
Length = 200
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 81/185 (43%), Positives = 105/185 (56%), Gaps = 38/185 (20%)
Query: 15 MRACGGCRRFLFFLPLVFFLPYLLSVLELHEKSVVEDLPRK-NRQKFDHLILGPAAGQRL 73
MR+C G RR L +P++F LP+L S+++ S + R +K DHL+LGP AGQ L
Sbjct: 10 MRSCSGWRRILLLIPVLFLLPHLSSLVDFSSDSATRNDARTIPNKKLDHLVLGPVAGQGL 69
Query: 74 PNRLQCQDSRASN-------------------------------------MVTVGNASYS 96
+R C+ ++A N + VGN +YS
Sbjct: 70 SDRFHCRGTKALNKTHGSTSHVSGAGNGVSFVTVFTVYNTSLGNVKSSNPVSVVGNVTYS 129
Query: 97 KTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR 156
K ERSMA+LN F NFIQVTMPKS+V ILTDPASDLS+ + V + P+ G+YSR LMLQR
Sbjct: 130 KPERSMAVLNAFANFIQVTMPKSNVVILTDPASDLSIQQSNVILQPVQGDYSRGNLMLQR 189
Query: 157 IRSYI 161
IRSYI
Sbjct: 190 IRSYI 194
>gi|413942906|gb|AFW75555.1| hypothetical protein ZEAMMB73_119492 [Zea mays]
Length = 177
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 61/84 (72%), Positives = 67/84 (79%), Gaps = 1/84 (1%)
Query: 256 MLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGM 315
MLGDQLALAWVVK H +F+K + F ++ G SVLFLPCA YNWT PEGAGQFHG+
Sbjct: 1 MLGDQLALAWVVKFHLPSALGKFSKHEAFTGEVNGTSVLFLPCAVYNWTSPEGAGQFHGI 60
Query: 316 PLDVKVVHFKGSRKRLMLES-WNF 338
PLDVKVVHFKGSRKRLMLE WNF
Sbjct: 61 PLDVKVVHFKGSRKRLMLERLWNF 84
>gi|356575160|ref|XP_003555710.1| PREDICTED: uncharacterized protein LOC100806135 [Glycine max]
Length = 146
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/75 (72%), Positives = 60/75 (80%)
Query: 247 SSKYMNASRMLGDQLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPP 306
S+KY NASRMLGDQLALA V S P FD +F KA F EDI G+S+LFLPC+ YNWT P
Sbjct: 23 STKYRNASRMLGDQLALASVEMSKPHFDTSKFAKALAFSEDIGGSSILFLPCSMYNWTLP 82
Query: 307 EGAGQFHGMPLDVKV 321
EGAGQFHGMPLDVK+
Sbjct: 83 EGAGQFHGMPLDVKI 97
>gi|414867606|tpg|DAA46163.1| TPA: hypothetical protein ZEAMMB73_544883 [Zea mays]
Length = 379
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/75 (64%), Positives = 62/75 (82%), Gaps = 1/75 (1%)
Query: 160 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 219
+ FLE+++ E + + +NH+V TDSDIAVVDDLGHIF Y +FHLA+TFRNNK+QPLN
Sbjct: 306 FTAFLEQKLVEFDRTE-RLNHFVLTDSDIAVVDDLGHIFEKYPHFHLAVTFRNNKEQPLN 364
Query: 220 SGFIAVRGTPDGISR 234
SGF+AVRGT DGI++
Sbjct: 365 SGFVAVRGTRDGITK 379
>gi|414886764|tpg|DAA62778.1| TPA: putative homeodomain-like transcription factor superfamily
protein [Zea mays]
Length = 2379
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/88 (57%), Positives = 63/88 (71%), Gaps = 1/88 (1%)
Query: 75 NRLQCQDSRAS-NMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSM 133
N + D S + VTVGN+SYSK ERSM ILN FI+FI+V+MP+SDV ILTDP S S+
Sbjct: 102 NSVSAGDGNVSPDSVTVGNSSYSKIERSMTILNTFISFIKVSMPRSDVIILTDPGSKFSV 161
Query: 134 PRKGVTIYPIHGEYSRDKLMLQRIRSYI 161
+ T+ PI G YSR LMLQRI++YI
Sbjct: 162 NQGSATLLPIEGNYSRGNLMLQRIKTYI 189
>gi|255562818|ref|XP_002522414.1| conserved hypothetical protein [Ricinus communis]
gi|223538299|gb|EEF39906.1| conserved hypothetical protein [Ricinus communis]
Length = 205
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 60/97 (61%), Gaps = 5/97 (5%)
Query: 15 MRACGGCRRFL--FFLPLVFFLPYLLSVLELHEKSVVEDLPRKNRQKFDHLILGPAAGQR 72
MR G RRF+ FFL LV F ++ SVLEL+ SV E L + +K HL+LGPAA Q
Sbjct: 1 MRTWSGRRRFILCFFLLLVIF--HIFSVLELYSNSVTEALQKNRNKKSYHLVLGPAASQG 58
Query: 73 LPNRLQCQDSRASNMV-TVGNASYSKTERSMAILNVF 108
LPNRLQC+ S+A N + ++S S ++A + VF
Sbjct: 59 LPNRLQCEGSKALNKTHLLDSSSDSNVRDNVAFVTVF 95
>gi|67920764|ref|ZP_00514283.1| hypothetical protein CwatDRAFT_5283 [Crocosphaera watsonii WH 8501]
gi|67856881|gb|EAM52121.1| hypothetical protein CwatDRAFT_5283 [Crocosphaera watsonii WH 8501]
Length = 316
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 65/257 (25%), Positives = 111/257 (43%), Gaps = 24/257 (9%)
Query: 84 ASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPI 143
A + N + + ++N+ + + P +LTD + L+ + +Y
Sbjct: 16 AKKIYNQDNKDFRNDYNYILLINLLFRSVSIFHPNCRKVVLTDMNTRLAGLEDDIEVY-- 73
Query: 144 HGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQN 203
+ +M R+ + +++ + Q I + DSD+ V +L H+F ++
Sbjct: 74 RTSLDPESIMFSRLVAQFNYVKTQ-----QIDSDI---ILIDSDMLVNANLEHLFE--ED 123
Query: 204 FHLALTFR---NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYM-NASRMLGD 259
F +ALT+R KD P+N G I + + D A FLE+V ++Y KY+ + GD
Sbjct: 124 FSVALTYRYLEAVKDMPINGGIIFL--SRDRKQEAIKFLEKVYQIYQEKYLKDYQSWSGD 181
Query: 260 QLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDV 319
Q AL + FD F Q V + + L C YN++P D
Sbjct: 182 QYALIDAI----GFD--NFNSRQSDVMLVDEQKIKLLDCEIYNFSPDRNPNSIVREHKDK 235
Query: 320 KVVHFKGSRKRLMLESW 336
++HFKGSRK++M W
Sbjct: 236 VILHFKGSRKKIMPLYW 252
>gi|416379625|ref|ZP_11683920.1| hypothetical protein CWATWH0003_0757 [Crocosphaera watsonii WH
0003]
gi|357265857|gb|EHJ14567.1| hypothetical protein CWATWH0003_0757 [Crocosphaera watsonii WH
0003]
Length = 316
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 65/257 (25%), Positives = 111/257 (43%), Gaps = 24/257 (9%)
Query: 84 ASNMVTVGNASYSKTERSMAILNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPI 143
A + N + + ++N+ + + P +LTD + L+ + +Y
Sbjct: 16 AKKIYNQDNKDFRNDYNYILLINLLFRSVSIFHPNCRKVVLTDMNTRLAGLEDDIEVY-- 73
Query: 144 HGEYSRDKLMLQRIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQN 203
+ +M R+ + +++ + Q I + DSD+ V +L H+F ++
Sbjct: 74 RTSLDPESIMFSRLVAQFNYVKTQ-----QIDSDI---ILIDSDMLVNANLEHLFE--ED 123
Query: 204 FHLALTFR---NNKDQPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYM-NASRMLGD 259
F +ALT+R KD P+N G I + + D A FLE+V ++Y KY+ + GD
Sbjct: 124 FSVALTYRYLEAVKDMPINGGIIFL--SRDRKQEAIKFLEKVYQIYQEKYLKDYQSWWGD 181
Query: 260 QLALAWVVKSHPSFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPLDV 319
Q AL + FD F Q V + + L C YN++P D
Sbjct: 182 QYALIDAI----GFD--NFHSRQSDVMLVDEQKIKLLDCEIYNFSPGRNPNSIVREHKDK 235
Query: 320 KVVHFKGSRKRLMLESW 336
++HFKGSRK++M W
Sbjct: 236 VILHFKGSRKKIMPLYW 252
>gi|297620616|ref|YP_003708753.1| hypothetical protein wcw_0375 [Waddlia chondrophila WSU 86-1044]
gi|297375917|gb|ADI37747.1| hypothetical protein wcw_0375 [Waddlia chondrophila WSU 86-1044]
gi|337292759|emb|CCB90764.1| putative uncharacterized protein [Waddlia chondrophila 2032/99]
Length = 259
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 76/164 (46%), Gaps = 22/164 (13%)
Query: 182 VFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD-----QPLNSGFIAVRGTPDGISRAK 236
VF D D+ + + +F +NF LAL +R + + P+N GFI + P+G ++A
Sbjct: 106 VFCDYDLLFQESIELLFK--ENFDLALIYRKSFEGGLHPAPINGGFIGIH--PEGFTKAI 161
Query: 237 IFLEEVLRVYSSKYMNASRMLGDQLALAWVV---KSHPSFDARRFTKAQPFVEDIIGASV 293
FLE V Y Y G Q +L ++ K H +F + GA +
Sbjct: 162 NFLETVHSCYLENYSEYKEWGGFQSSLNKLLVPKKVHNAFPNHLIYE---------GAEI 212
Query: 294 LFLPCATYNWTPPEGAGQFHGMPLDVKVVHFKGSRKRLMLESWN 337
LP + YN+ E G++ D K++HFKG RK +M WN
Sbjct: 213 ALLPSSEYNYAI-EAQGEWVDFKPDKKILHFKGPRKEVMANYWN 255
>gi|384252921|gb|EIE26396.1| hypothetical protein COCSUDRAFT_39504 [Coccomyxa subellipsoidea
C-169]
Length = 333
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 111/263 (42%), Gaps = 60/263 (22%)
Query: 108 FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQR--------IRS 159
FI+ ++ + P + +LTD + + +P V +Y Y+ D+ L R +
Sbjct: 50 FISALRRSNPGCTIVVLTDQGTQIELP-PDVRLY----RYAIDRSKLGRNPYANYYQYLA 104
Query: 160 YITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLN 219
I+FL+ + ++G VF D DI V+D L +F + F A+T + D P+N
Sbjct: 105 QISFLQHLM---AKGLAQSMDVVFLDMDILVIDSLAEVFKEGPGFDYAVTLSDAVDMPVN 161
Query: 220 SG--FIAVRGTPDGISRAKIFLEEVLRVY--SSKYMNASRMLGDQLALAWVVKSHPSFDA 275
G F+ P ++ FLE+VL VY + +++ LG+ + L + D
Sbjct: 162 IGMQFVHHGRYPGAVA----FLEDVLAVYPFNETFVSGQVALGNLIGLRY-------NDE 210
Query: 276 RRFTKAQPFVEDIIGA----------SVLFLPCATYNWTPPEGAGQFHGMPLD------- 318
+ T + V D A SV FLPC YN+ AGQ D
Sbjct: 211 QLLTHYKSAVRDRSSAKQVRGRHSVHSVRFLPCMRYNYC---HAGQSCCTDPDRLPVSVT 267
Query: 319 ---------VKVVHFKGSRKRLM 332
VKV+HF G RK+ +
Sbjct: 268 TEEELTDTRVKVLHFVGHRKKAL 290
>gi|412992455|emb|CCO18435.1| unknown protein [Bathycoccus prasinos]
Length = 401
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 109/238 (45%), Gaps = 36/238 (15%)
Query: 117 PKSDVFILTDPASDLSMPRKGVTIYPIH---GEYSRDK-----LMLQRIRSYITFLERRI 168
P + V ++TD +++ M + G+ +H G R K LML+R++ Y F+ +R
Sbjct: 156 PGTCVALITDEETEIDMSKPGMDKVQLHRFEGILDRTKIGTGALMLERMKLYNAFI-KRA 214
Query: 169 REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIAVRGT 228
R++ + D+DI V D+ +F +NF +T R+NK P+ G V+
Sbjct: 215 RDNDWNA----DLLMVDTDIVFVGDVSDLFQ-TRNFDYGVTIRDNKAYPVQGG---VQFV 266
Query: 229 PDG--ISRAKIFLEEVLRVYSSKYMNASR---MLGDQLALAWVVKSHPSFDARRFTKAQ- 282
P G + AK F + L ++ S + + GDQ A + P+ + K +
Sbjct: 267 PKGKYVGAAK-FSDHTLDLWKSDLEKSGKEAGFTGDQAAYQRGLNV-PASKVQSLAKGKK 324
Query: 283 ----PFVEDIIGASVL---FLPCATYNWTPPEGAGQFHGM-PLDVKVVHFKGSRKRLM 332
P V GA V+ +P YN+ P +G G+ D++++H+KG +K M
Sbjct: 325 VIDLPVVCGSSGAEVVTVRMIPGDQYNFVP---SGNGQGLKKKDIRILHYKGGKKEGM 379
>gi|297606040|ref|NP_001057915.2| Os06g0571300 [Oryza sativa Japonica Group]
gi|255677159|dbj|BAF19829.2| Os06g0571300, partial [Oryza sativa Japonica Group]
Length = 73
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 38/58 (65%), Gaps = 2/58 (3%)
Query: 107 VFINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFL 164
+FI +QV+MP+S+V ILTDP S L+ I PI G YSR LM QRIRSYI L
Sbjct: 4 LFIEPLQVSMPRSNVIILTDPNSKLT--HGSAVILPIEGNYSRGNLMFQRIRSYIVSL 59
>gi|384252787|gb|EIE26262.1| hypothetical protein COCSUDRAFT_64411 [Coccomyxa subellipsoidea
C-169]
Length = 477
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 96/247 (38%), Gaps = 32/247 (12%)
Query: 108 FINFIQVTMPKSDVFILTDPASDLSMPRKGVTIYPIHGEYSRDKLMLQRIRSYITFLERR 167
FI+ ++ + P V +LTD A+ + +P + R KL +Y +L +
Sbjct: 190 FISTLRRSHPGCTVAVLTDQATQIDLP---ADVRLFRFTIDRSKLGRNPYANYYQYLAQI 246
Query: 168 I---REHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFIA 224
+ ++G H VF D D VVD + +F F LT + D P+N I
Sbjct: 247 AFMKQLAAEGLEHSTDVVFLDMDALVVDSIAEVFGQGAQFDYGLTLSDATDMPVN---IG 303
Query: 225 VRGTPDG-ISRAKIFLEEVLRVY--SSKYMNASRMLGDQLALAWVVKSHPSFDARRFT-- 279
++ P G A FL++V+ +Y +S + L D L K P R
Sbjct: 304 IQFVPRGRYGSAIAFLQDVIAIYPFNSTFTAGQEALTDLLGF----KDDPEEVLSRVNIS 359
Query: 280 -KAQPFVEDIIGASVLFLPCATYNWTP------------PEGAGQFHGMPLD-VKVVHFK 325
+ + + G +V C YN+ P F + VKV+HF
Sbjct: 360 VQEGRTCQQVGGRTVCLFTCMRYNYCHVDQSCCTDPARLPVSLTSFDDLAAARVKVLHFV 419
Query: 326 GSRKRLM 332
G RK+ +
Sbjct: 420 GHRKKAL 426
>gi|381204432|ref|ZP_09911503.1| hypothetical protein SclubJA_02255, partial [SAR324 cluster
bacterium JCVI-SC AAA005]
Length = 176
Score = 45.4 bits (106), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 58/122 (47%), Gaps = 17/122 (13%)
Query: 105 LNVFINFIQVTMPKSDVFILTDPASDLSMPRKGVTI-YPIHGEYSRDKLMLQRIRSYITF 163
LN+ + ++ P++D+++LTD S S I Y + Y +L R +++ F
Sbjct: 40 LNLMFSSVKRIYPEADLYVLTDTKSKFSENTISKLIRYDLDSRYP----ILARNKAWYKF 95
Query: 164 LERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQPLNSGFI 223
LE+ + +F DSDI + D+ + +F +A TFR+ K P+N G I
Sbjct: 96 LEKTDKST----------IFLDSDILINDNFDELMS--VDFDIAFTFRDWKKWPINLGII 143
Query: 224 AV 225
V
Sbjct: 144 YV 145
>gi|182419850|ref|ZP_02951090.1| conserved hypothetical protein [Clostridium butyricum 5521]
gi|237666660|ref|ZP_04526645.1| YfnD [Clostridium butyricum E4 str. BoNT E BL5262]
gi|182376398|gb|EDT73980.1| conserved hypothetical protein [Clostridium butyricum 5521]
gi|237657859|gb|EEP55414.1| YfnD [Clostridium butyricum E4 str. BoNT E BL5262]
Length = 313
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 36/166 (21%), Positives = 70/166 (42%), Gaps = 25/166 (15%)
Query: 162 TFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKDQ----- 216
FLE I ++S +Y D+D+ ++ IF++ N + LT N ++
Sbjct: 89 VFLEYIINKYSDAV----YYAHVDADLFFFSNIDSIFNENSNASIFLTDHRNSEEFMHYY 144
Query: 217 ----PLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA-WVVKSHP 271
N+GF+ + T +G + K++ + L+ +++Y ++ GDQ + W+
Sbjct: 145 ELSGQFNTGFVGFKNTDEGKAAIKLWGDRCLKRCTAEYDTINKTFGDQRYVEDWI----D 200
Query: 272 SFDARRFTKAQPFVEDIIGASVLFLPCATYNWTPPEGAGQFHGMPL 317
F K+ IGA+V F Y ++ + + PL
Sbjct: 201 IFKDVHVVKS-------IGANVAFWNVKNYEFSKVDDLIYVNNKPL 239
>gi|440753032|ref|ZP_20932235.1| hypothetical protein O53_1407 [Microcystis aeruginosa TAIHU98]
gi|440177525|gb|ELP56798.1| hypothetical protein O53_1407 [Microcystis aeruginosa TAIHU98]
Length = 311
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 45/109 (41%), Gaps = 13/109 (11%)
Query: 156 RIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD 215
++R R R S G + H++F DSDI V+D L +F Y N L +
Sbjct: 75 KVRGIDDIHNHRFRIFSIFWGPLEHFIFLDSDIIVLDSLQELFRTYINSELEFMY----- 129
Query: 216 QPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA 264
RG D + + F ++++R Y + NA + + A +
Sbjct: 130 --------YYRGIFDQVYKEGEFRDKMIREYRANGFNAGSFISSRGAFS 170
>gi|425436184|ref|ZP_18816622.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389679139|emb|CCH92045.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 311
Score = 37.7 bits (86), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 45/109 (41%), Gaps = 13/109 (11%)
Query: 156 RIRSYITFLERRIREHSQGQGHINHYVFTDSDIAVVDDLGHIFHDYQNFHLALTFRNNKD 215
++R R R S G + H++F DSDI V+D L +F Y N L +
Sbjct: 75 KVRGIDDIHNHRFRIFSIFWGPLEHFIFLDSDIIVLDSLQELFRTYINSELEFMY----- 129
Query: 216 QPLNSGFIAVRGTPDGISRAKIFLEEVLRVYSSKYMNASRMLGDQLALA 264
RG D + + F ++++R + + NA L + A +
Sbjct: 130 --------YYRGIFDQVYKEGEFRDKMIREHRANGFNAGSFLSSRGAFS 170
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.140 0.424
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,632,447,918
Number of Sequences: 23463169
Number of extensions: 230695629
Number of successful extensions: 577097
Number of sequences better than 100.0: 55
Number of HSP's better than 100.0 without gapping: 41
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 576977
Number of HSP's gapped (non-prelim): 75
length of query: 362
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 218
effective length of database: 8,980,499,031
effective search space: 1957748788758
effective search space used: 1957748788758
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)