BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020825
(321 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225429560|ref|XP_002279873.1| PREDICTED: uncharacterized protein LOC100255893 [Vitis vinifera]
gi|296081666|emb|CBI20671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 436 bits (1122), Expect = e-120, Method: Compositional matrix adjust.
Identities = 207/321 (64%), Positives = 256/321 (79%), Gaps = 1/321 (0%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS+ LVLYGDGLARF+ PSH HLH LAS+ CGFLSL +PPSE+E+ERIVREFA
Sbjct: 1 MADKPSRGLVLYGDGLARFINPSHTHLHDLASRGICGFLSLSNSPPSETEDERIVREFAQ 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L+D+ + L T L+ ++KS TIS+RFMG++AA++ NS L SFG LGF +
Sbjct: 61 LLDSCEASLGVNVGGATNLECQQKSLMPTISERFMGMRAAIVANNSSLTSFGGTLGFTVM 120
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
Q +ELI+ + LS P+DVVASE LKLLG Q GK E SQFDL+ +HIGAGEK N + +
Sbjct: 121 QTDELIKNNHPLSEPPVDVVASESLKLLGFQEGKTLETSQFDLIFMHIGAGEKANG-QTE 179
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
+A+D+EYIN LV I+Q A+P +E+GSRLHLS+V+SYG V + D+ +LSVL++ +E +S
Sbjct: 180 IIANDVEYINGLVGRIMQTAEPGSEIGSRLHLSLVMSYGAVSKDDDPSLSVLVTKNENNS 239
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
DLS+L PRQSYTMKG PR+++RHHCPMLI+QWQYAVTRKDMA+TFSF+DFKEHGG LTI
Sbjct: 240 DLSSLSPRQSYTMKGGNPRDNIRHHCPMLIAQWQYAVTRKDMAKTFSFEDFKEHGGNLTI 299
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
PADRFLHEVAFKLWKAPKYGA
Sbjct: 300 PADRFLHEVAFKLWKAPKYGA 320
>gi|224092198|ref|XP_002309505.1| predicted protein [Populus trichocarpa]
gi|222855481|gb|EEE93028.1| predicted protein [Populus trichocarpa]
Length = 309
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 212/321 (66%), Positives = 241/321 (75%), Gaps = 12/321 (3%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADK S+ LV+YGDGLA F+ PSHAHLHSLASKA CGFL+LP APPSESE+ERIVREFA
Sbjct: 1 MADKASRGLVIYGDGLASFINPSHAHLHSLASKAFCGFLTLPNAPPSESEDERIVREFAY 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L+DA + Y + TIS+RFMG+KAA+IT N LKSFG KLG
Sbjct: 61 LLDACEAY------------QNVSTSIPTISERFMGMKAAIITNNPGLKSFGGKLGLTVF 108
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
N+L SLSGS D V ELLKLLG Q GK E SQFDLV VH+GAGE+ N + K
Sbjct: 109 PFNDLKGNEFSLSGSSTDFVTFELLKLLGFQEGKTLETSQFDLVFVHVGAGERVNAEGHK 168
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
+A D+EYI+ALV I+++AQP +E+GSRLHLS+V+SYG V E D +LS+L S DE
Sbjct: 169 TIAIDVEYIDALVDGIMRIAQPGSEIGSRLHLSLVMSYGYVTEGDGRDLSILTSKDEMDP 228
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
LS LFP QSYTMKGE PRND+R+HCPMLISQWQYAVTR DMAETFSFKDFKEHGG L I
Sbjct: 229 ALSKLFPLQSYTMKGEKPRNDIRYHCPMLISQWQYAVTRVDMAETFSFKDFKEHGGNLVI 288
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
PADRFLHEVAFKLWKAPKYGA
Sbjct: 289 PADRFLHEVAFKLWKAPKYGA 309
>gi|255550804|ref|XP_002516450.1| conserved hypothetical protein [Ricinus communis]
gi|223544270|gb|EEF45791.1| conserved hypothetical protein [Ricinus communis]
Length = 313
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 196/321 (61%), Positives = 237/321 (73%), Gaps = 8/321 (2%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS+ALVLYGDGLA+ ++ SH H+HSLASKA+CGFLSLP AP SESE ERIVREFA
Sbjct: 1 MADKPSRALVLYGDGLAQLIDESHTHIHSLASKASCGFLSLPNAPSSESENERIVREFAH 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L+DA Y E+ E+ + TIS RFM +KAA+IT N L+SFG KLGF L
Sbjct: 61 LVDAYDAY--------QEMSNEKCTLIPTISQRFMEMKAAIITDNPSLQSFGSKLGFTVL 112
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
L T S S + + SELLKLLG Q GK E SQFDLV++H GA E N D +
Sbjct: 113 SFKCLNGTHKSHYESSFEDLTSELLKLLGFQEGKTIEASQFDLVILHTGASELLNSDSSE 172
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
A D Y+N++V+ + A+P +E+GSRLHLS+V+SYG + D++NLSVL+S DE +S
Sbjct: 173 ATECDAHYLNSMVKETMYNAKPGSEIGSRLHLSLVMSYGNASKIDSTNLSVLVSKDEMNS 232
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
DLS L PRQSYTMKGE R++VRHHCPMLI+QWQ+AVTRKD AET+S+K+FKEHG L I
Sbjct: 233 DLSVLVPRQSYTMKGEKLRDNVRHHCPMLIAQWQHAVTRKDTAETYSYKEFKEHGVNLVI 292
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
PADRF+HEVAFKLWKAPKYGA
Sbjct: 293 PADRFIHEVAFKLWKAPKYGA 313
>gi|356564061|ref|XP_003550275.1| PREDICTED: uncharacterized protein LOC100780200 [Glycine max]
Length = 308
Score = 392 bits (1008), Expect = e-107, Method: Compositional matrix adjust.
Identities = 196/321 (61%), Positives = 242/321 (75%), Gaps = 13/321 (4%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS++LVL+GDGLAR ++ SH+H HSLAS ++CGFLSLP +PPSESE+ER VREFAV
Sbjct: 1 MADKPSRSLVLFGDGLARSIDSSHSHFHSLASLSSCGFLSLPNSPPSESEDERTVREFAV 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L+DA Y S + ++ +E P QT+ DRF+G+KAA++T NS LKSF KLGF+ L
Sbjct: 61 LLDACHTY-SNMKGQISDRDNQEDPPKQTLPDRFIGMKAAILTNNSSLKSFSAKLGFSVL 119
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
+L+EL+ G +VVA ELLK LG + GK+ + FDLV HIGAGE+ K
Sbjct: 120 KLDELV------GGGSAEVVALELLKFLGFKEGKVLDSDHFDLVFFHIGAGEQ------K 167
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
VA D+ Y++ALV ++ AQP +++ SRLHLSVV+SYG VLEAD+S SVL +DEK+S
Sbjct: 168 VVAADVGYMDALVGGVMSQAQPGSDISSRLHLSVVMSYGNVLEADDSKFSVLKRVDEKNS 227
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
LS L+P QSYTMKG PR DVRH+ PMLI+QWQ AVTRKD AE FSF+DF EHGG LTI
Sbjct: 228 HLSMLYPLQSYTMKGGIPRKDVRHYSPMLIAQWQSAVTRKDNAERFSFEDFMEHGGNLTI 287
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
PADRFLHE+AFKLWKAPKYGA
Sbjct: 288 PADRFLHEIAFKLWKAPKYGA 308
>gi|255647370|gb|ACU24151.1| unknown [Glycine max]
Length = 308
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 195/321 (60%), Positives = 242/321 (75%), Gaps = 13/321 (4%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS++LVL+GDGLAR ++ SH+H HSLAS ++CGFLSLP +PPSESE+ER VREFAV
Sbjct: 1 MADKPSRSLVLFGDGLARSIDSSHSHFHSLASLSSCGFLSLPNSPPSESEDERTVREFAV 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L+DA Y S + ++ +E P QT+ DRF+G+KAA++T NS LKSF KLGF+ L
Sbjct: 61 LLDACHTY-SNMKGQISDRDNQEDPPKQTLPDRFIGMKAAILTNNSSLKSFSAKLGFSVL 119
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
+L+EL+ G +VVA ELLK LG + GK+ + FDLV HIGAGE+ K
Sbjct: 120 KLDELV------GGGSAEVVALELLKFLGFKEGKVLDSDHFDLVFFHIGAGEQ------K 167
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
VA D+ Y++ALV ++ AQP +++ SRLHLSVV+SYG VLEAD+ SVL +DEK+S
Sbjct: 168 VVAADVGYMDALVGGVMSQAQPGSDISSRLHLSVVMSYGNVLEADDFKFSVLKRVDEKNS 227
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
LS L+P QSYTMKG PR DVRH+ PMLI+QWQ+AVTRKD AE FSF+DF EHGG LTI
Sbjct: 228 HLSMLYPLQSYTMKGGIPRKDVRHYSPMLIAQWQFAVTRKDNAERFSFEDFMEHGGNLTI 287
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
PADRFLHE+AFKLWKAPKYGA
Sbjct: 288 PADRFLHEIAFKLWKAPKYGA 308
>gi|449450351|ref|XP_004142926.1| PREDICTED: uncharacterized protein LOC101217985 [Cucumis sativus]
Length = 315
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 190/321 (59%), Positives = 247/321 (76%), Gaps = 6/321 (1%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS+AL ++GDGLARF++ SH++LH+LAS A+CGFLSLP APPSESEE+R++RE +
Sbjct: 1 MADKPSRALAVFGDGLARFLDHSHSNLHALASLASCGFLSLPNAPPSESEEKRMIRELEL 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L DA+ +S G E S ++I +RFMG++AAL+T NS ++S G +LG + L
Sbjct: 61 LFDASDSDVSKNGGGC-----EGSSQKKSIPERFMGMRAALLTNNSVVQSLGSELGISLL 115
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
+L+EL++ + L P+D +ASELLK LG Q GK+++ S+FDL+ VHIG G+K N +K +
Sbjct: 116 RLDELMQM-NHLGLPPVDFLASELLKRLGFQDGKIQDASEFDLLFVHIGVGDKVNGEKDR 174
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
+ +++YI+ALV ILQ QP +E+GSRLHLS+++SYG V E D ++LSVL S K+S
Sbjct: 175 TASDEMKYIDALVGDILQKTQPGSEIGSRLHLSLLMSYGDVFEDDENSLSVLTSNCGKNS 234
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
DLS LFPRQSYTM GE RND+RHH PML +QWQY VTRKD AETFSFK+FKEHG L I
Sbjct: 235 DLSVLFPRQSYTMMGEVQRNDIRHHSPMLAAQWQYGVTRKDKAETFSFKEFKEHGSNLVI 294
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
PADRF+HEVAFKLWKAPKYGA
Sbjct: 295 PADRFIHEVAFKLWKAPKYGA 315
>gi|449494427|ref|XP_004159543.1| PREDICTED: uncharacterized LOC101217985 [Cucumis sativus]
Length = 315
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 189/321 (58%), Positives = 247/321 (76%), Gaps = 6/321 (1%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS+AL ++GDGLARF++ SH++LH+LAS A+CGFLSLP APPSESEE+R++RE +
Sbjct: 1 MADKPSRALAVFGDGLARFLDQSHSNLHALASLASCGFLSLPNAPPSESEEKRMIRELEL 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L DA+ ++ G E S ++I +RFMG++AAL+T NS ++S G +LG + L
Sbjct: 61 LFDASDSDVNKNGGGC-----EGSSQKKSIPERFMGMRAALLTNNSVVQSLGSELGISLL 115
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
+L+EL++ + L P+D +ASELLK LG Q GK+++ S+FDL+ VHIG G+K N +K +
Sbjct: 116 RLDELMQM-NHLGLPPVDFLASELLKRLGFQDGKIQDASEFDLLFVHIGVGDKVNGEKDR 174
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
+ +++YI+ALV ILQ QP +E+GSRLHLS+++SYG V E D ++LSVL S K+S
Sbjct: 175 TASDEMKYIDALVGDILQKTQPGSEIGSRLHLSLLMSYGDVFEDDENSLSVLTSNCGKNS 234
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
DLS LFPRQSYTM GE RND+RHH PML +QWQY VTRKD AETFSFK+FKEHG L I
Sbjct: 235 DLSVLFPRQSYTMMGEVQRNDIRHHSPMLAAQWQYGVTRKDKAETFSFKEFKEHGSNLVI 294
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
PADRF+HEVAFKLWKAPKYGA
Sbjct: 295 PADRFIHEVAFKLWKAPKYGA 315
>gi|356552344|ref|XP_003544528.1| PREDICTED: uncharacterized protein LOC100817041 [Glycine max]
Length = 307
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 195/321 (60%), Positives = 242/321 (75%), Gaps = 14/321 (4%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS++LVL+GDGLARF++ SH+H HSLAS ++CGFLSLP +PPSESE+ER VRE AV
Sbjct: 1 MADKPSRSLVLFGDGLARFIDSSHSHFHSLASLSSCGFLSLPNSPPSESEDERTVREVAV 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L+DA YL+ +G+ ++ +E SP QT+ DRFMG+KAA++T NS LKSF KLGF+ L
Sbjct: 61 LLDACPIYLN-MGK-ISDSDNQEDSPKQTLPDRFMGMKAAILTNNSSLKSFSAKLGFSVL 118
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
+L+EL+ G +VVA ELLKLLG Q GK+ + FDLV H GAGE+ K
Sbjct: 119 KLDELV------VGGSAEVVALELLKLLGFQEGKVLDSDHFDLVFFHNGAGEQ------K 166
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
VA D+ Y++ALV ++ QP +++ SRLHLSVV+SYG +LE D+S SV +DEK+S
Sbjct: 167 VVAADMGYMDALVGGVMSQMQPGSDISSRLHLSVVVSYGNILEGDDSKFSVSKRVDEKNS 226
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
LS L+P QSY MKG PR DVRH+ PMLI+QWQ AVTRKD AE FSF+DF EHGG LTI
Sbjct: 227 HLSVLYPLQSYAMKGGIPRKDVRHYSPMLIAQWQSAVTRKDNAERFSFEDFMEHGGNLTI 286
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
PADRFLHE+AFKLWKAPKYGA
Sbjct: 287 PADRFLHEIAFKLWKAPKYGA 307
>gi|388507454|gb|AFK41793.1| unknown [Lotus japonicus]
Length = 304
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 190/322 (59%), Positives = 234/322 (72%), Gaps = 19/322 (5%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS++LVL+GDGLAR ++PSH+HLHSLAS ++CGFLSLP +PPSESE++R VREFAV
Sbjct: 1 MADKPSRSLVLFGDGLARSIDPSHSHLHSLASLSSCGFLSLPNSPPSESEDQRTVREFAV 60
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSP-FQTISDRFMGLKAALITTNSRLKSFGDKLGFAT 119
L+DA + ++ ++ SP QT+ DRFMG+KAA++T +S LKSF KLGF
Sbjct: 61 LLDA-----PNVNVQVSDDDSQKDSPEKQTLPDRFMGMKAAILTNHSGLKSFSSKLGFNV 115
Query: 120 LQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKG 179
LQL+ L + D+ DVVA ELLKLLG Q GK+++ + FDLV HIGAGEK
Sbjct: 116 LQLDALAKDYDA-GLHENDVVALELLKLLGFQEGKVQDSNNFDLVFFHIGAGEK------ 168
Query: 180 KAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKS 239
++EYINALV ++ AQP +++ SRLHLS+V+SYG+V E D S SV DEKS
Sbjct: 169 -----NVEYINALVGGLMNQAQPGSDISSRLHLSLVMSYGKVSEDDESKFSVSKRADEKS 223
Query: 240 SDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLT 299
LS L+P QSY MKG PR DVR H P+LI+QWQ AVT KD AE F F+DF EHGG LT
Sbjct: 224 Y-LSLLYPLQSYAMKGGFPRKDVRLHSPLLIAQWQDAVTHKDNAERFYFQDFMEHGGNLT 282
Query: 300 IPADRFLHEVAFKLWKAPKYGA 321
IPADRFLHE+AFKLWKAPKYGA
Sbjct: 283 IPADRFLHEIAFKLWKAPKYGA 304
>gi|357437483|ref|XP_003589017.1| hypothetical protein MTR_1g016410 [Medicago truncatula]
gi|355478065|gb|AES59268.1| hypothetical protein MTR_1g016410 [Medicago truncatula]
gi|388492878|gb|AFK34505.1| unknown [Medicago truncatula]
Length = 306
Score = 347 bits (889), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 180/327 (55%), Positives = 237/327 (72%), Gaps = 27/327 (8%)
Query: 1 MADKPSKALVLYGDGLARFVEPS--HAHLHSLASKAACGFLSLPIAPPS----ESEEERI 54
MADKPS++L+LYGDGLARF++PS H +LHSLAS ++C FL+L + + E+E++RI
Sbjct: 1 MADKPSRSLILYGDGLARFIDPSSSHTNLHSLASLSSCAFLTLSNSNSNFNPSETEDDRI 60
Query: 55 VREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDK 114
VREF +L+DA+Q + +E QT+ DRFMG+KAA++T +S LKSF K
Sbjct: 61 VREFTLLLDASQIH-------------DE----QTLPDRFMGMKAAIMTNHSGLKSFSTK 103
Query: 115 LGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKT 174
LGF+ +QL+ L++ S S D+VA E+LKLLG Q GK+ + SQFDLV +H+GAGEK
Sbjct: 104 LGFSVVQLDGLVKQHASESQDS-DIVAVEILKLLGFQEGKVLDNSQFDLVFIHVGAGEKD 162
Query: 175 NDDKGKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLIS 234
N + A D+E+++ALV I++ AQP ++VGSRLHLSVV+SYG+VLE D S SV
Sbjct: 163 NGSEQNA---DVEFVDALVGAIMRQAQPGSDVGSRLHLSVVMSYGKVLEDDESKFSVSKK 219
Query: 235 IDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEH 294
+DEK S L+P QSY MKG PR DVR H PMLI+Q Q+AVTRKD + F+F+DF ++
Sbjct: 220 VDEKDSCFLTLYPLQSYAMKGGVPRKDVRLHSPMLIAQLQHAVTRKDNVQRFTFQDFMKN 279
Query: 295 GGYLTIPADRFLHEVAFKLWKAPKYGA 321
GG LTIPADRFLHE+AFK+WKAPKYGA
Sbjct: 280 GGNLTIPADRFLHEIAFKMWKAPKYGA 306
>gi|297807235|ref|XP_002871501.1| hypothetical protein ARALYDRAFT_488018 [Arabidopsis lyrata subsp.
lyrata]
gi|297317338|gb|EFH47760.1| hypothetical protein ARALYDRAFT_488018 [Arabidopsis lyrata subsp.
lyrata]
Length = 307
Score = 330 bits (847), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 172/322 (53%), Positives = 226/322 (70%), Gaps = 16/322 (4%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS++LVLYGDGLARFV+PS+ H+HSLAS +CGFLSLP APP E+E ER VREFA
Sbjct: 1 MADKPSRSLVLYGDGLARFVDPSNTHIHSLASVGSCGFLSLPNAPP-ETENERTVREFAH 59
Query: 61 LIDANQDYLSGIGEHTTELKWE-EKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFAT 119
L+DA++ Y G LK + + T+++RFMGLKAAL+T +S L SFG +G
Sbjct: 60 LMDASEAYSIASG-----LKPKGNGNDISTLAERFMGLKAALVTDSSTLTSFGKLIGLDV 114
Query: 120 LQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKG 179
LQL+E+ + SDS P D +S+LLKLLG + GK +VS +D V VHIG D+
Sbjct: 115 LQLSEICQESDSF---PSDATSSKLLKLLGFEGGKCLDVSPYDSVFVHIGV------DQY 165
Query: 180 KAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKS 239
+++ I++L+ I++MAQP +EV SRLHLS+VLSYG V + D S + ++ +
Sbjct: 166 NNGNNNMGIIDSLIGSIMRMAQPGSEVVSRLHLSLVLSYGSVTDKDVSVFPIKTPQEDIN 225
Query: 240 SDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLT 299
+ L PRQSYTM+GE R+DVRH+CPML++QWQ+AVTRKD+ +T SF+ K+ G L
Sbjct: 226 PAFAGLVPRQSYTMRGEKTRDDVRHYCPMLVAQWQHAVTRKDLVDTLSFETLKKLCGNLV 285
Query: 300 IPADRFLHEVAFKLWKAPKYGA 321
IP DRF+HEV FKLWKAPKYGA
Sbjct: 286 IPTDRFIHEVTFKLWKAPKYGA 307
>gi|15239788|ref|NP_196742.1| uncharacterized protein [Arabidopsis thaliana]
gi|7573395|emb|CAB87699.1| putative protein [Arabidopsis thaliana]
gi|26452379|dbj|BAC43275.1| unknown protein [Arabidopsis thaliana]
gi|108385324|gb|ABF85772.1| At5g11810 [Arabidopsis thaliana]
gi|332004340|gb|AED91723.1| uncharacterized protein [Arabidopsis thaliana]
Length = 306
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 170/322 (52%), Positives = 225/322 (69%), Gaps = 17/322 (5%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADK S++L+LYGDGLARFV+PS+ ++HSLAS A CGFLSLP APP E+E ERIVREF+
Sbjct: 1 MADKSSRSLILYGDGLARFVDPSNTNIHSLASVATCGFLSLPNAPP-ETENERIVREFSH 59
Query: 61 LIDANQDYLSGIGEHTTELKWE-EKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFAT 119
L+DA++ Y G LK + + T+++RFMGLKAAL+T +S L SFG +G
Sbjct: 60 LLDASEAYSIASG-----LKPKGNGNDISTLAERFMGLKAALVTDSSTLTSFGKLIGLDV 114
Query: 120 LQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKG 179
LQL+E+ + SDS P D +S+LLKLLG + GK +V+ +DLV VH G E N
Sbjct: 115 LQLSEICQESDSF---PSDATSSKLLKLLGFEGGKCLDVNLYDLVFVHFGVDEYNN---- 167
Query: 180 KAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKS 239
+++ +++L+ I+ MAQP +E+ SRLHLSVVLSYG V + D S + ++ +
Sbjct: 168 ---GNNMGILDSLIGSIMGMAQPGSEILSRLHLSVVLSYGSVTDKDVSVFPIKTPQEDIN 224
Query: 240 SDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLT 299
L PRQSYTM+GE R+DVRH+CPML++QWQ+ VTRKD+ +T SF+ K+ G L
Sbjct: 225 PAFIGLVPRQSYTMRGEKTRDDVRHYCPMLVAQWQHGVTRKDLVDTLSFEALKKLCGNLV 284
Query: 300 IPADRFLHEVAFKLWKAPKYGA 321
IPADRF+HEVAFKLWKAPKYGA
Sbjct: 285 IPADRFIHEVAFKLWKAPKYGA 306
>gi|397787620|gb|AFO66525.1| putative Myo28B1 [Brassica napus]
Length = 303
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 170/324 (52%), Positives = 223/324 (68%), Gaps = 24/324 (7%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS+ALVLYGDG AR V+PS+ H+HSLAS CGFLSLP APP+E+E+ERIVREFA
Sbjct: 1 MADKPSRALVLYGDGSARSVDPSNTHIHSLASVGTCGFLSLPHAPPAETEDERIVREFAH 60
Query: 61 LIDANQDY--LSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFA 118
L+DA++ Y SG+ +E + T+S+RFMG+KAAL+T +S L SFG +G
Sbjct: 61 LLDASEAYSIASGLKAKGSE------NEISTLSERFMGIKAALVTDSSSLTSFGKLIGLD 114
Query: 119 TLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDK 178
LQLNE+ + SD P D A+ELL+LLG + GK ++S +D V VHIG
Sbjct: 115 VLQLNEIHQKSDD--SFPSDATATELLRLLGFEGGKCLDMSLYDSVFVHIGG-------- 164
Query: 179 GKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEK 238
++ IN+L+ I++M+QP +E+ RLHLS+VLSYG V + D S+ +V + E
Sbjct: 165 -----DNVRTINSLIGSIMRMSQPGSEIAPRLHLSLVLSYGSVTDKDVSDFNVKKTPQEG 219
Query: 239 -SSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGY 297
+ L PRQSYTM+GE R++VR +CPML++QWQ AVTRKD+ +T SF+ K+ G
Sbjct: 220 INPAFQGLVPRQSYTMRGEKTRDNVRDYCPMLVAQWQDAVTRKDLVDTLSFEALKKLSGN 279
Query: 298 LTIPADRFLHEVAFKLWKAPKYGA 321
L IPADRF+HEVAFKLWKAPKYGA
Sbjct: 280 LVIPADRFIHEVAFKLWKAPKYGA 303
>gi|397787592|gb|AFO66498.1| putative rhomboid family protein [Brassica napus]
Length = 271
Score = 260 bits (664), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 150/324 (46%), Positives = 197/324 (60%), Gaps = 56/324 (17%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKPS+ALVLYGD +E+E+ERIVREFA
Sbjct: 1 MADKPSRALVLYGD--------------------------------AETEDERIVREFAH 28
Query: 61 LIDANQDYL--SGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFA 118
L+DA++ Y SG+ + +E + T+S+RFMG+ AAL+T NS L SFG +G
Sbjct: 29 LLDASEAYTLASGLKANGSE------NEISTLSERFMGIMAALVTDNSSLTSFGKMIGLD 82
Query: 119 TLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDK 178
LQLNE+ + SD S D A+ELL+LLG + GK ++S +D V VHIG D+
Sbjct: 83 VLQLNEIHQKSDDSFSS--DATATELLRLLGFEGGKCLDMSLYDSVFVHIGG-----DNV 135
Query: 179 GKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEK 238
G +++L+ I++M+QP +E SRLHLSVVLSYG V + D S V + E
Sbjct: 136 GT--------VDSLIGSIMRMSQPGSETASRLHLSVVLSYGSVTDKDVSVFPVKNTPQEG 187
Query: 239 -SSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGY 297
+ L PRQSYTM+GE R++VR +CPML++QWQ AVTRKD+ +T SF+ K+ G
Sbjct: 188 INPAFQGLVPRQSYTMRGEKTRDNVRDYCPMLVAQWQDAVTRKDLVDTLSFEALKKLSGS 247
Query: 298 LTIPADRFLHEVAFKLWKAPKYGA 321
L IPADRF+HEVAFKLWKAPKYGA
Sbjct: 248 LVIPADRFIHEVAFKLWKAPKYGA 271
>gi|115444077|ref|NP_001045818.1| Os02g0135600 [Oryza sativa Japonica Group]
gi|42408840|dbj|BAD10100.1| unknown protein [Oryza sativa Japonica Group]
gi|42409110|dbj|BAD10360.1| unknown protein [Oryza sativa Japonica Group]
gi|113535349|dbj|BAF07732.1| Os02g0135600 [Oryza sativa Japonica Group]
gi|125537989|gb|EAY84384.1| hypothetical protein OsI_05760 [Oryza sativa Indica Group]
gi|125580726|gb|EAZ21657.1| hypothetical protein OsJ_05290 [Oryza sativa Japonica Group]
gi|215697639|dbj|BAG91633.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 337
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 197/340 (57%), Gaps = 22/340 (6%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHA------------HLHSLASKAACGFLSLPIAPPSE 48
MADKPS+ LVLY G A + P+ HL + AS A+CGFLSL P +
Sbjct: 1 MADKPSRGLVLYAAGHAALLPPASGGGGGAGGGGGGSHLDAFASLASCGFLSLRSPPLAS 60
Query: 49 SEEER--IVREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQ----TISDRFMGLKAALI 102
EE+R + E A L+D D E T + + + P Q +S+RFMG++AA++
Sbjct: 61 GEEKRDSTILELAQLLDVYDDLFPAKTEKTGQ-ETAQVDPLQLAVPKLSERFMGIRAAMV 119
Query: 103 TTNSRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFD 162
T + SF LGF +L+ S S + S + S+ L LLG G ++E S+FD
Sbjct: 120 TNCPLVSSFAANLGFHVSGTEDLVAQSGSSAASKEAGIISQALSLLGFSEGNVQETSEFD 179
Query: 163 LVLVHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQV 221
LV +H+ A E TN GK + DL ++ LV I++ A ++ + SR+ +SV+LSYG
Sbjct: 180 LVFLHV-AMENTNSKLGKLGMKTDLNRLDKLVGAIMEAAPISSAIASRILVSVMLSYGSA 238
Query: 222 LEADNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKD 281
E + S+ S E SDL+ L PRQSYTMK +DVR+H P+L++QWQ VTR D
Sbjct: 239 AE-NKDEFSISKSSTEIDSDLNLLRPRQSYTMKAGNTLDDVRNHHPILLAQWQEGVTRSD 297
Query: 282 MAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
+ E FSF++F + GG L + A+RFLHEVAFKLWKAPKYGA
Sbjct: 298 LTEGFSFEEFIKRGGNLAMLAERFLHEVAFKLWKAPKYGA 337
>gi|242063964|ref|XP_002453271.1| hypothetical protein SORBIDRAFT_04g002870 [Sorghum bicolor]
gi|241933102|gb|EES06247.1| hypothetical protein SORBIDRAFT_04g002870 [Sorghum bicolor]
Length = 334
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 200/338 (59%), Gaps = 21/338 (6%)
Query: 1 MADKPSKALVLYGDGLARFVEP------SHAHLHSLASKAACGFLSL---PIAPPSESEE 51
MADKPS+ALVLY G A + P + +HL + AS+A+CG L+L P + P+ E
Sbjct: 1 MADKPSRALVLYAAGHATVLTPPAGSAAAGSHLDAFASRASCGLLTLRSPPASHPTTGAE 60
Query: 52 ER--IVREFAVLIDANQDYLSG----IGEHTTELKWEEKSPFQTISDRFMGLKAALITTN 105
+ + E A L+D G G+ ++ +E +S+RFMGL+AAL+T+
Sbjct: 61 DNSSTILELAQLLDVYDHLYPGKNADSGQEVAQVDPQELV-VPKLSERFMGLRAALVTSC 119
Query: 106 SRLKSFGDKLGFATLQLNELIETSDSLSGSPIDV-VASELLKLLGLQRGKMEEVSQFDLV 164
R+ SF LGF Q N+ S S SG +V + + LLG G ++E S+FDLV
Sbjct: 120 PRVSSFAANLGFQVFQTNDFAPQSAS-SGVTKEVGLINRAFDLLGFSDGNVQETSEFDLV 178
Query: 165 LVHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLE 223
+H+ A E T+ GK + DL + LV +++ A + + SR+H+SV+LSYG
Sbjct: 179 FMHV-AMENTSSKLGKLGMKTDLNRLEKLVGAVMESAPVGSAIASRIHVSVILSYGSA-S 236
Query: 224 ADNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMA 283
+ S+L S E SDL+ L PRQSYTMK +DVR H P+L++QWQ VTR D+A
Sbjct: 237 GNKDEFSLLTSSTEADSDLNLLRPRQSYTMKAGHTLDDVRLHHPILLAQWQEGVTRVDLA 296
Query: 284 ETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
+ FSF++F +HGG L + A+RFLHEVAFKLWKAPKYGA
Sbjct: 297 KGFSFEEFIKHGGNLAMLAERFLHEVAFKLWKAPKYGA 334
>gi|212275488|ref|NP_001130063.1| uncharacterized protein LOC100191155 [Zea mays]
gi|194688200|gb|ACF78184.1| unknown [Zea mays]
gi|413935502|gb|AFW70053.1| hypothetical protein ZEAMMB73_997782 [Zea mays]
Length = 334
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 137/337 (40%), Positives = 196/337 (58%), Gaps = 19/337 (5%)
Query: 1 MADKPSKALVLYGDGLARFVEP------SHAHLHSLASKAACGFLSL---PIAPPSESEE 51
MADKPS+ALV+Y G A + P + +HL + AS+A CG L+L P + P+ E
Sbjct: 1 MADKPSRALVVYAAGHAAALIPPPGSAAAGSHLDAFASRACCGLLTLRSPPTSHPTTGTE 60
Query: 52 ER--IVREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTIS----DRFMGLKAALITTN 105
+ + E A L+D G E + + + + P + + +RFMGL+AAL+T+
Sbjct: 61 DNSSTILELAQLLDVYDHLYPGKNEESGQ-EVAQVDPQELVVPKLFERFMGLRAALVTSC 119
Query: 106 SRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVL 165
+ SF LGF + N+ S S S + D + LLG G ++E S+FDLV
Sbjct: 120 PHVSSFAANLGFHVFKTNDFAPQSGSSSITKEDGLIKRAFDLLGFSDGDVQETSEFDLVF 179
Query: 166 VHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEA 224
+H+ A E T+ GK + D+ ++ LV +L+ A + + SR+H+SV+LSYG
Sbjct: 180 MHV-AMENTSSKLGKLGMKTDINRLDKLVGAVLESAPVGSAIASRIHVSVILSYGSA-SG 237
Query: 225 DNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAE 284
+ S+L S E SDL+ L PRQSYTMK +DVR H P+L++QWQ VTR D+A+
Sbjct: 238 NKDEFSLLTSSTETDSDLNLLRPRQSYTMKTGRTLDDVRLHHPILLAQWQDGVTRVDLAK 297
Query: 285 TFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
FSF++F + GG L + A+RFLHEVAFKLWKAPKYGA
Sbjct: 298 GFSFEEFMKRGGNLAMLAERFLHEVAFKLWKAPKYGA 334
>gi|195636240|gb|ACG37588.1| hypothetical protein [Zea mays]
Length = 334
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 137/337 (40%), Positives = 195/337 (57%), Gaps = 19/337 (5%)
Query: 1 MADKPSKALVLYGDGLARFVEP------SHAHLHSLASKAACGFLSL---PIAPPSESEE 51
MADKPS+ALV+Y G A + P + +HL + AS+A CG L+L P + P+ E
Sbjct: 1 MADKPSRALVVYAAGHAAALIPPPGSAAAGSHLDAFASRACCGLLTLRSPPTSHPTTGTE 60
Query: 52 ER--IVREFAVLIDANQDYLSG----IGEHTTELKWEEKSPFQTISDRFMGLKAALITTN 105
+ + E A L+D G G+ ++ +E + +RFMGL+AAL+T+
Sbjct: 61 DNSSTILELAQLLDVYDHLYPGKNAESGQEVAQVDPQELV-VPKLFERFMGLRAALVTSC 119
Query: 106 SRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVL 165
+ SF LGF + N+ S S S + D + LLG G ++E S+FDLV
Sbjct: 120 PHVSSFAANLGFHVFKTNDFAPQSGSSSITKEDGLIKRAFDLLGFSDGDVQETSEFDLVF 179
Query: 166 VHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEA 224
+H+ A E T+ GK + D+ ++ LV +L+ A + + SR+H+SV+LSYG
Sbjct: 180 MHV-AMENTSSKLGKLGMKTDINRLDKLVGAVLESAPVGSAIASRIHVSVILSYGSA-SG 237
Query: 225 DNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAE 284
+ S+L S E SDL+ L PRQSYTMK +DVR H P+L++QWQ VTR D+A+
Sbjct: 238 NKDEFSLLTSSTETDSDLNLLRPRQSYTMKTGRTLDDVRLHHPILLAQWQDRVTRVDLAK 297
Query: 285 TFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
FSF++F + GG L + A+RFLHEVAFKLWKAPKYGA
Sbjct: 298 GFSFEEFMKRGGNLAMLAERFLHEVAFKLWKAPKYGA 334
>gi|357147081|ref|XP_003574214.1| PREDICTED: uncharacterized protein LOC100836573 [Brachypodium
distachyon]
Length = 335
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 192/338 (56%), Gaps = 20/338 (5%)
Query: 1 MADKPSKALVLYGDGLARFVEPS---------HAHLHSLASKAACGFLSL--PIAPPSES 49
MADKPS+ALVLY G A + + +HL + AS A+CGFLS+ P
Sbjct: 1 MADKPSRALVLYAAGHAALLPAAAAAAGGGGGKSHLDAFASLASCGFLSVRTPAVNDEGD 60
Query: 50 EEERIVREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTI----SDRFMGLKAALITTN 105
+ + E A L+D D L + T L+ P + + S+RFMG++AA++T
Sbjct: 61 KNSATIVELAQLLDV-YDALYPAKDAQTGLETARVDPQELVVPKLSERFMGMRAAMVTNC 119
Query: 106 SRLKSFGDKLGFATLQLNELIETSDSLSGSPIDV-VASELLKLLGLQRGKMEEVSQFDLV 164
+ SF LGF + + + S L GS D+ + + LLG G +++ S+FDLV
Sbjct: 120 PDVSSFTANLGFHVFRTEDFVAHS-GLGGSSKDIGIINRAFGLLGFSEGSVQDASEFDLV 178
Query: 165 LVHIGAGEKTNDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLE 223
VH+ A E T GK + DL ++ LV +++ A + + +R+H+SV+LSYG E
Sbjct: 179 FVHV-AMENTASKLGKLGMKTDLNRLDKLVAAVMEAAPVKSAIAARVHVSVILSYGSATE 237
Query: 224 ADNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMA 283
+ +L S E SDL L PRQSYTMK +DVR+H PML++QWQ VTR D+A
Sbjct: 238 NKEESCLILNSSTETDSDLKLLHPRQSYTMKAGRTLDDVRNHHPMLLAQWQEGVTRSDLA 297
Query: 284 ETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
+ FSF++F + GG + A+RFLHEVAFKLWKAPKYGA
Sbjct: 298 KGFSFEEFIKRGGNFAMLAERFLHEVAFKLWKAPKYGA 335
>gi|326488863|dbj|BAJ98043.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 326
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 132/328 (40%), Positives = 180/328 (54%), Gaps = 9/328 (2%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHL---HSLASKAACGFLSLPIAPPSES---EEERI 54
MADKPS+ALVLY G A + + AS A+CGFLS+ +E ++
Sbjct: 1 MADKPSRALVLYAAGHAALLAGGGGGKGHLDAFASIASCGFLSVRTPAVNEDGGDKDSDT 60
Query: 55 VREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDK 114
+ E A L+D D L + T ++ +S+RFMGL+AA++T + SF
Sbjct: 61 ILELAQLLDV-YDGLCPAKDGETSRVDPQQLVVPKLSERFMGLRAAMVTNCPGVSSFAAN 119
Query: 115 LGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKT 174
LGF + S S S S + LLG G +++ S+FDLV VH A E T
Sbjct: 120 LGFHVFGTEDFAAQSGSGSSSKDSRIIDRAFTLLGFVEGNVQDASEFDLVFVH-AAMENT 178
Query: 175 NDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLI 233
GK + DL ++ LV +++ A + V +R+H+SVVLSYG E ++
Sbjct: 179 TSKLGKLGMKTDLNRLDKLVASVMEAAPVGSAVAARIHVSVVLSYGSATENKEEACLIVN 238
Query: 234 SIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKE 293
S E SDL L PRQSYTMK +DVR+H PML++QWQ VTR D+A+ FSF++F +
Sbjct: 239 SSTETDSDLKLLRPRQSYTMKAGKTLDDVRNHHPMLLAQWQQGVTRSDLAKEFSFEEFIK 298
Query: 294 HGGYLTIPADRFLHEVAFKLWKAPKYGA 321
G + A+RFLHEVAFKLWKAPKYGA
Sbjct: 299 RAGNFAMLAERFLHEVAFKLWKAPKYGA 326
>gi|168006913|ref|XP_001756153.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692663|gb|EDQ79019.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 314
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 132/331 (39%), Positives = 181/331 (54%), Gaps = 27/331 (8%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MADKP++ALV YGDGL V+P HAHLH LA +CGFL+L P ES R V E A
Sbjct: 1 MADKPARALVCYGDGLMPAVKPHHAHLHQLAVSGSCGFLALRSLPDDES---RPVLELAQ 57
Query: 61 LID---------ANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSF 111
L+D +D SG G EL+ ++ P ++++RFMG KA I + +
Sbjct: 58 LLDVYDVYSETACKEDLASGNGSAGAELRQDDLLP--SMAERFMGTKATFIANSKSAVAL 115
Query: 112 GDKLGFATLQLNELIETSDSLSGSPIDVVASELLKLLGL-QRGKMEEVSQFDLVLVHIGA 170
G + GFAT L E + S SL + AS+ L LLG + +E+++ LV +H+ A
Sbjct: 116 GRRCGFATDCLAE--DGSSSLPDA--SATASKALGLLGFDESSDAKEMNE--LVFLHLDA 169
Query: 171 GEKTNDDKGKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLS 230
D G + +E++++LV + ++ T RL VVL YG +
Sbjct: 170 ------DNGTSAECGIEFLDSLVGSVQDASKEGTSAYGRLFFVVVLGYGDAKANLGDAVP 223
Query: 231 VLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKD 290
+ S ++ S +L++L PRQSYTMK P VR P+L Q AVTR+D E F F+D
Sbjct: 224 SIKSKEKLSVELASLRPRQSYTMKTGKPVEGVREEYPLLAVYNQVAVTRRDEVERFMFED 283
Query: 291 FKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
F+ G L + DRFL+EVAFKLWKAPKYGA
Sbjct: 284 FQNRAGNLAMLVDRFLYEVAFKLWKAPKYGA 314
>gi|168012392|ref|XP_001758886.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690023|gb|EDQ76392.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 314
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 126/336 (37%), Positives = 175/336 (52%), Gaps = 37/336 (11%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSL-----PIAPPSESEEERIV 55
MADKPS+ALV+YGDGLA VE H+HLH LA+ +CGFL+L P S ER V
Sbjct: 1 MADKPSRALVIYGDGLADAVESHHSHLHQLAASGSCGFLALRTLSTPHPGSSADGSERTV 60
Query: 56 REFAVLID--------ANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSR 107
E A L+D A ++ SGI + + EE S ++++RFMG+K+ IT +
Sbjct: 61 LELAQLLDVYDIYTEKARKEDASGITDAGVD---EESSKLPSMAERFMGMKSTFITNSKP 117
Query: 108 LKSFGDKLGFATLQLNELIETSDSLSGSP-IDVVASELLKLLGL-QRGKMEEVSQFDLVL 165
+ + GF+ L E +G P + AS +L LLG + +E ++ LV
Sbjct: 118 ALALSKRAGFSVKPLTED-------NGPPHANATASTVLSLLGFCESSDAKETNE--LVF 168
Query: 166 VHIGAGEKTNDDKGKAVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEAD 225
+H+ G + +EY+N+L+ + ++ + L L VVL YG
Sbjct: 169 LHL---------DGSSSGTGIEYLNSLIGCVQDASKEGSLAYGHLFLVVVLGYGDAALGA 219
Query: 226 NSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAET 285
N S I S+L+AL P QSY MK +R P+L Q AVTR+D +
Sbjct: 220 NGGRSFKEEI-ALPSELAALRPVQSYKMKTGKAVEGIREDYPLLAVYNQVAVTRRDEVQK 278
Query: 286 FSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
FSF DFK+ G L + DRFL+EV+FKLWKAPKYGA
Sbjct: 279 FSFDDFKKRAGNLVMLVDRFLYEVSFKLWKAPKYGA 314
>gi|302771948|ref|XP_002969392.1| hypothetical protein SELMODRAFT_410426 [Selaginella moellendorffii]
gi|300162868|gb|EFJ29480.1| hypothetical protein SELMODRAFT_410426 [Selaginella moellendorffii]
Length = 281
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 119/321 (37%), Positives = 170/321 (52%), Gaps = 40/321 (12%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MAD PS+++V++GDG V H++LHSLAS CGFL+L P+ S+ R
Sbjct: 1 MADGPSRSVVIFGDGFLPHVAAQHSNLHSLASDGCCGFLAL--RSPAASDGNRSAEALIQ 58
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L+D Y G G FQT+S+RFMG+ AAL+T + + + G K GF
Sbjct: 59 LLDLYDAYKEGKG-------------FQTVSERFMGMNAALVTNSEQAVAVGSKAGFVVS 105
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
+ +L E G + + S+ L ++G+ G FDL+ +H+ A ++D K
Sbjct: 106 RFQDLHE------GIGAEDMPSKFLGMVGV--GDSAGGKPFDLLFLHLVA----DNDLEK 153
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
+ A E++++LV + PA L L ++L YG L S + + ID++
Sbjct: 154 SPAISTEWMDSLVGKLKN--APAKN----LLLVLILGYGNAL----SEVEIPELIDQQ-- 201
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
L L PRQSY++KG P D+ C +L Q AVTR+D +F++ G LTI
Sbjct: 202 -LRQLRPRQSYSIKGGKPVEDISKDCSLLAVFHQKAVTRRDHCTCLQLGEFRQKCGNLTI 260
Query: 301 PADRFLHEVAFKLWKAPKYGA 321
ADRFLHEVAFKLWKAPKYGA
Sbjct: 261 LADRFLHEVAFKLWKAPKYGA 281
>gi|302774625|ref|XP_002970729.1| hypothetical protein SELMODRAFT_411538 [Selaginella moellendorffii]
gi|300161440|gb|EFJ28055.1| hypothetical protein SELMODRAFT_411538 [Selaginella moellendorffii]
Length = 313
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 151/300 (50%), Gaps = 40/300 (13%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAV 60
MAD PS+++V++GDG V H++LHSLAS CGFL+L P+ S+ R
Sbjct: 1 MADGPSRSVVIFGDGFLPHVAAQHSNLHSLASDGCCGFLAL--RSPAASDGNRSAEALIQ 58
Query: 61 LIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATL 120
L+D + E KS FQT+S+RFMG+ AAL+T + + + G K GF
Sbjct: 59 LLD------------LYDADKEGKS-FQTVSERFMGMNAALVTNSEQAVAVGSKAGFVVS 105
Query: 121 QLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGK 180
+ +L E G + + S+ L ++G+ G FDL+ +H+ A ++D K
Sbjct: 106 RFQDLHE------GIGAEDMPSKFLGMVGV--GDSAGGKPFDLLFLHLVA----DNDLEK 153
Query: 181 AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSS 240
+ A E++++LV + PA L L ++L YG L S + + ID++
Sbjct: 154 SPAISTEWMDSLVGKLKNA--PAK----NLLLVLILGYGNAL----SEVEIPEFIDQQ-- 201
Query: 241 DLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTI 300
L L PRQSY++KG P D+ C +L Q AVTR+D ++F++ G + +
Sbjct: 202 -LRQLRPRQSYSIKGGKPVEDISKDCSLLAVFHQKAVTRRDHCTCLQLEEFRQTGSSMRL 260
>gi|224141629|ref|XP_002324169.1| predicted protein [Populus trichocarpa]
gi|222865603|gb|EEF02734.1| predicted protein [Populus trichocarpa]
Length = 79
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 41/51 (80%), Positives = 45/51 (88%)
Query: 262 VRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFK 312
+ +HCP+LI+QWQYAVTR DMAETFSFKD KEHGG L IPADRFL EVAFK
Sbjct: 27 ISYHCPILIAQWQYAVTRLDMAETFSFKDLKEHGGNLVIPADRFLREVAFK 77
>gi|300681520|emb|CBH32615.1| conserved hypothetical protein, expressed [Triticum aestivum]
Length = 364
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/249 (30%), Positives = 112/249 (44%), Gaps = 23/249 (9%)
Query: 1 MADKPSKALVLYGDGLARFVEPSHA---HLHSLASKAACGFLSLPIAPPSESEEER---I 54
MADKPS+ALVLY G A + L + AS A+C FLS+ +E +R
Sbjct: 1 MADKPSRALVLYAAGHAALLAGGGGGKGQLDAFASIASCSFLSVRTPVVNEDGGDRNSDS 60
Query: 55 VREFAVLIDANQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDK 114
+ E A L+D D L + + ++ P +S+RF LI S L FG +
Sbjct: 61 ILELAQLLDVC-DALYSVKDGEIARVDPQELPVPKLSERF-----DLIRGESWLPCFGTE 114
Query: 115 LGFATLQLNELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKT 174
FA Q + D+ + LLG G +++ S+FDLV VH E T
Sbjct: 115 -DFAA-QSGSGSSSKDTR-------IIDRAFTLLGFAEGNVQDASEFDLVFVH-ATMENT 164
Query: 175 NDDKGK-AVAHDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLI 233
GK + L ++ LV +++ A + V +R+H+SV+LSYG +L
Sbjct: 165 TSKLGKLGMKTYLNRLDKLVASVMEAAPVGSAVAARIHVSVILSYGSATANKEEACLILN 224
Query: 234 SIDEKSSDL 242
S E SDL
Sbjct: 225 SSIETDSDL 233
>gi|290996720|ref|XP_002680930.1| hypothetical protein NAEGRDRAFT_46748 [Naegleria gruberi]
gi|284094552|gb|EFC48186.1| hypothetical protein NAEGRDRAFT_46748 [Naegleria gruberi]
Length = 300
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 105/249 (42%), Gaps = 43/249 (17%)
Query: 78 ELKWEEKSPFQTISDRFMGLKAALITTNSRL----KSFGDKLGFATLQLNELIETSDSLS 133
EL+ E++S F +F LK + TTN+ + KS D F +N E D++
Sbjct: 90 ELELEKQSLFC----KFKKLKVEMWTTNNNVLNIFKSKKDNAKFLNFIVNNK-EEEDAIC 144
Query: 134 GSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGKAVAHDLEYINALV 193
+ + + S D++++H N K + + +I +
Sbjct: 145 ENAVQAMLSN------------------DILILH------DNITKLQTIEEKRNHIQQID 180
Query: 194 RVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLSALFPRQSYTM 253
++ Q+ T+ ++ + Y +L + N SVL S ++KSS P QSY
Sbjct: 181 HLLKQVNNLFTQKQNQFKI-----YNVILVSTPFN-SVLPSFNKKSSPFR---PLQSYEQ 231
Query: 254 KGETPRNDVRHHCPMLIS-QWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFK 312
K T D+ HH P LI Q TR D + K E G I D + E+A+K
Sbjct: 232 KNGTILTDIVHHEPSLICVQHCEGETRSDTNTQLTEKSSIELNGNCAILIDFVISEMAYK 291
Query: 313 LWKAPKYGA 321
L K+PKYGA
Sbjct: 292 LRKSPKYGA 300
>gi|66807053|ref|XP_637249.1| hypothetical protein DDB_G0287343 [Dictyostelium discoideum AX4]
gi|60465660|gb|EAL63739.1| hypothetical protein DDB_G0287343 [Dictyostelium discoideum AX4]
Length = 278
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 103/245 (42%), Gaps = 36/245 (14%)
Query: 79 LKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATLQLNELIETSDSLSGSPID 138
K +EK Q+ +++M LK LI+ + EL SDS+
Sbjct: 68 FKHDEKLTEQSFKEKYMDLKIKLISNIEK----------------ELCIFSDSVKLDTQK 111
Query: 139 VVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGKAVAHDLEYINALVRVILQ 198
ELL + S L+++H + +++ K + + I+ ++ +L+
Sbjct: 112 QTNQELLDTIS--------ESNEQLIMIHYQTINNSLEERRKTLFN----IDLILSNLLK 159
Query: 199 MAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLSALF--PRQSYTMKGE 256
Q ++++V+ Y Q N + + ++S+++ F P+QSYT+
Sbjct: 160 KNQNDGNNNKNYYINIVIGYSQT----NIEIPLTAQQQQQSTNIPNWFNLPKQSYTLSNS 215
Query: 257 TPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKA 316
P D R P+ + TRKD ++F +F +G I + E+AFKL K
Sbjct: 216 LPY-DPRLTSPLFTIHYNSIFTRKDNTKSFCESEFI-NGSNGKILLFQHFRELAFKLGKV 273
Query: 317 PKYGA 321
PKYGA
Sbjct: 274 PKYGA 278
>gi|384253165|gb|EIE26640.1| hypothetical protein COCSUDRAFT_59159 [Coccomyxa subellipsoidea
C-169]
Length = 78
Score = 48.1 bits (113), Expect = 0.006, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 4/76 (5%)
Query: 247 PRQSYTMKG-ETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRF 305
PRQS+ G ET R +RH ++ +Q V R+D + FS + K+ GG I +R
Sbjct: 6 PRQSFEFAGLETVR--IRHDASVMFAQRLKGVVRQDSSTAFSAEQCKQGGGG-GILLERL 62
Query: 306 LHEVAFKLWKAPKYGA 321
+ E+A+KL +A KYGA
Sbjct: 63 MPEIAYKLGRAHKYGA 78
>gi|452823400|gb|EME30411.1| hypothetical protein Gasu_23170 [Galdieria sulphuraria]
Length = 284
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 131/320 (40%), Gaps = 41/320 (12%)
Query: 6 SKALVLYGDGLARFVEPS-HAHLHSLASKAACGFLSLPIAPPSESEEERIVREFAVLIDA 64
S +L+G L+ F+ S + +L +A G +L E E + +F++ D+
Sbjct: 2 SIVWILHGLDLSNFLLKSQYENLDRIAELGCSGICALY----EYDELENVTPQFSI-KDS 56
Query: 65 NQDYLSGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLG-FATLQLN 123
+SG+ +++ K F SD GL+ A++TT+ L +L F L L
Sbjct: 57 CFLQVSGLSRWN---RYQHKKNFP--SDTIFGLRTAVLTTDENLLEHKSQLSPFNLLLLA 111
Query: 124 ELIETSDSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGKAVA 183
E + DSL+ + S L + E +++FDLV +++
Sbjct: 112 E--KDEDSLTQR---LWHSCCQLLCHTYEKEYERIAKFDLVFIYLK-------------- 152
Query: 184 HDLEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLS 243
N +VIL++A T G L + + + VL D + +
Sbjct: 153 ------NIPNQVILRVADVVT--GMLLSIPNEIPFLSVLGFDPRWNKSFEDLFPSRKEGI 204
Query: 244 ALFPRQSYTMKGET--PRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIP 301
PRQS+ E R+ H+ + S ++ TR+D F +D G I
Sbjct: 205 PFMPRQSFQSIDELILERHCFHHNTHLWASFYKEGFTRRDKCCRFLPEDCFSLGCNRVIL 264
Query: 302 ADRFLHEVAFKLWKAPKYGA 321
A+ ++E+AF L K PKYG+
Sbjct: 265 AEHLIYEIAFLLEKVPKYGS 284
>gi|325185878|emb|CCA20384.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 249
Score = 44.3 bits (103), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 37/75 (49%)
Query: 247 PRQSYTMKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFL 306
PRQS+T +D ++S +Q+ RKD + + +F GY ++ +
Sbjct: 175 PRQSFTKVDGAYVDDNTAPQYFVLSYFQHNRIRKDDVKRYELSEFAHRNGYGSMNLRALI 234
Query: 307 HEVAFKLWKAPKYGA 321
E+AF+L PKYGA
Sbjct: 235 KEIAFRLGCTPKYGA 249
>gi|440793375|gb|ELR14561.1| hypothetical protein ACA1_077920 [Acanthamoeba castellanii str.
Neff]
Length = 298
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 31/53 (58%)
Query: 269 LISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
L + + TR+D + F+ + K G I A++F+ E+AFKL K PKYGA
Sbjct: 246 LATYYHEGSTRRDDVQRFTEAEVKSKGANGNIAAEQFVAELAFKLGKTPKYGA 298
>gi|384491511|gb|EIE82707.1| hypothetical protein RO3G_07412 [Rhizopus delemar RA 99-880]
Length = 214
Score = 42.4 bits (98), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 1/107 (0%)
Query: 215 VLSYGQVLEADNSNLSVLISIDEKSSDLSALFPRQSYTMKGETPRNDVRHHCPMLISQWQ 274
+++ G L+ + + ++ I++ + P+QSY MKG ++ ++ S +
Sbjct: 109 IINNGCALDRPETLVCIVSEINQNEERKAEFMPQQSYWMKGGVKVEEIEKGRSLIYSYFH 168
Query: 275 YAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
TR+D E F +D G I A FL E+ KL KYG+
Sbjct: 169 CGSTRRDSVEHFG-QDIVRLSGNKKILAWHFLAEIGNKLGFVAKYGS 214
>gi|281207162|gb|EFA81345.1| hypothetical protein PPL_05328 [Polysphondylium pallidum PN500]
Length = 278
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 59/251 (23%), Positives = 107/251 (42%), Gaps = 51/251 (20%)
Query: 82 EEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVA 141
+ K + T +++ L LIT++ L + GF ++ +N +V
Sbjct: 68 DHKVTYNTFKEKYSNLSLRLITSSQSLLQDCNTKGFDSILVN--------------NVNC 113
Query: 142 SELLKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDKGKAVAHDLEYINALVRVILQMAQ 201
+LL + M+E + ++V++H+ + V+ LE AL+ + Q Q
Sbjct: 114 KDLLNEIN---KPMKE--KVEMVIIHL-----------EPVSKGLEDRRALLTTLDQTVQ 157
Query: 202 PATEVGS-----RLHLSVVLSYGQVLEADNSNLSVLIS--IDEKSSDLSALF--PRQSYT 252
+ ++++V+S+ D+ N++V ++ + + F P+QS
Sbjct: 158 SLKQQQEQQQQFETYINIVISF------DSPNVTVPLTEQQQSSQQSIPSTFIPPKQSTQ 211
Query: 253 MKGETPRNDVRHHCPMLISQWQYAVTRKDMAETFSFKDFKE--HGGYLTIPADRFLHEVA 310
P + R PM TRKD +++F F +G LTI ++L E+A
Sbjct: 212 FYNGKPV-EPRQTSPMFAIYSHAPWTRKDESKSFRLDQFIRGCNGKILTI---QYLKELA 267
Query: 311 FKLWKAPKYGA 321
FKL K PKYGA
Sbjct: 268 FKLGKIPKYGA 278
>gi|374371858|ref|ZP_09629777.1| hypothetical protein OR16_40849 [Cupriavidus basilensis OR16]
gi|373096602|gb|EHP37804.1| hypothetical protein OR16_40849 [Cupriavidus basilensis OR16]
Length = 696
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 45/101 (44%)
Query: 70 SGIGEHTTELKWEEKSPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATLQLNELIETS 129
S +G +T LK E +PF + G A L++ ++ + G KL L I
Sbjct: 82 SALGTYTCTLKTGEVAPFFVVVTDPTGNTAPLVSVSTVTPAAGAKLTLNATPLTTAIVAQ 141
Query: 130 DSLSGSPIDVVASELLKLLGLQRGKMEEVSQFDLVLVHIGA 170
+ G+P+ V+ + + LQ+ V+Q VL IGA
Sbjct: 142 LAADGNPLTFVSGKTVDAAALQKVTANVVAQLASVLAAIGA 182
>gi|301104856|ref|XP_002901512.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100516|gb|EEY58568.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 204
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 51/113 (45%), Gaps = 23/113 (20%)
Query: 225 DNSNLSVLISIDEKSS-------DLSALFPRQSYTM---------KGETPRNDVRHHCPM 268
D+ + V +SI + +S + L P+QSY GE PR M
Sbjct: 99 DDGSSKVFVSIVKTASQRVVEPSEPHPLRPQQSYHKYDHKYPEHDSGEAPRR------LM 152
Query: 269 LISQWQYAVTRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
S +Q TR+D +TF + GGY + F+ E+AF+L APKYGA
Sbjct: 153 FASLYQ-DQTRRDAVQTFDEAQMDKLGGYGAMDVRVFVKEMAFRLGYAPKYGA 204
>gi|418872319|ref|ZP_13426661.1| urease accessory protein UreE, N-terminal domain protein
[Staphylococcus aureus subsp. aureus IS-125]
gi|418947816|ref|ZP_13500158.1| urease accessory protein UreE, N-terminal domain protein
[Staphylococcus aureus subsp. aureus IS-157]
gi|375367276|gb|EHS71241.1| urease accessory protein UreE, N-terminal domain protein
[Staphylococcus aureus subsp. aureus IS-125]
gi|375374329|gb|EHS77963.1| urease accessory protein UreE, N-terminal domain protein
[Staphylococcus aureus subsp. aureus IS-157]
Length = 150
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 45/98 (45%), Gaps = 10/98 (10%)
Query: 186 LEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLSAL 245
LE + + R+ + TE+G RL + L YG +L AD+ N +I +D S DL +
Sbjct: 27 LENSDLVKRIQRVVTDHGTEIGIRLKQPIDLQYGDILYADDHN---MIIVDVNSEDLLVI 83
Query: 246 FPRQSYTMKGETPRNDVRHHCPM------LISQWQYAV 277
PR M G+ HH P ++ Q+ Y V
Sbjct: 84 QPRTLQEM-GDIAHQLGNHHLPAQFTETEMLVQYDYLV 120
>gi|348689196|gb|EGZ29010.1| hypothetical protein PHYSODRAFT_476234 [Phytophthora sojae]
Length = 184
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 29/44 (65%)
Query: 278 TRKDMAETFSFKDFKEHGGYLTIPADRFLHEVAFKLWKAPKYGA 321
TR+D ++F+ ++ + G Y + A F+ E+AF+L APKYGA
Sbjct: 141 TRRDGVQSFTEEEIDKLGAYGAMDARVFMKEMAFRLGCAPKYGA 184
>gi|206889471|ref|YP_002249056.1| dihydroorotase [Thermodesulfovibrio yellowstonii DSM 11347]
gi|206741409|gb|ACI20466.1| dihydroorotase [Thermodesulfovibrio yellowstonii DSM 11347]
Length = 435
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 71/154 (46%), Gaps = 22/154 (14%)
Query: 26 HLHSLASKAACGF-LSLPIAPPSESEEERIVREFAVLIDANQDYLSGIGEHTTELKWEEK 84
H +L +A G+ + + PP SEE+R + A L D D ++ +H E+K
Sbjct: 269 HYFTLTEEAVSGYNTNAKVNPPLRSEEDREAIKNA-LSDGIIDVIAT--DHAPHHIDEKK 325
Query: 85 SPFQTISDRFMGLKAALITTNSRLKSFGDKLGFATLQLNELIETSDSLSGSPIDVVASEL 144
PFQ + GL+ A + LK DK+ L L +LIE L+ +P
Sbjct: 326 VPFQEALNGISGLETAFAIS---LKLVHDKI----LNLRKLIE---KLTLNP-------- 367
Query: 145 LKLLGLQRGKMEEVSQFDLVLVHIGAGEKTNDDK 178
K+L L++ ++E +Q DL+LV I DK
Sbjct: 368 SKILNLKKEGIKEGAQGDLILVDINKNWTVKKDK 401
>gi|418564108|ref|ZP_13128533.1| urease accessory protein UreE, N-terminal domain protein
[Staphylococcus aureus subsp. aureus 21264]
gi|371977208|gb|EHO94486.1| urease accessory protein UreE, N-terminal domain protein
[Staphylococcus aureus subsp. aureus 21264]
Length = 150
Score = 37.7 bits (86), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 10/98 (10%)
Query: 186 LEYINALVRVILQMAQPATEVGSRLHLSVVLSYGQVLEADNSNLSVLISIDEKSSDLSAL 245
LE + + R+ + TE+G RL + L YG +L AD+ N +I +D S DL +
Sbjct: 27 LENSDLVKRIQRVVTDHGTEIGIRLKQPIDLQYGDILYADDHN---MIVVDVNSEDLLVI 83
Query: 246 FPRQSYTMKGETPRNDVRHHCPM------LISQWQYAV 277
PR M G+ H P ++ Q+ Y V
Sbjct: 84 HPRTLQEM-GDIAHQLGNRHLPAQFTETEMLVQYDYLV 120
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.379
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,895,686,194
Number of Sequences: 23463169
Number of extensions: 194162792
Number of successful extensions: 460540
Number of sequences better than 100.0: 59
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 460434
Number of HSP's gapped (non-prelim): 61
length of query: 321
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 179
effective length of database: 9,027,425,369
effective search space: 1615909141051
effective search space used: 1615909141051
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)