BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044000
(420 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449448711|ref|XP_004142109.1| PREDICTED: mediator of RNA polymerase II transcription subunit
4-like [Cucumis sativus]
gi|449520010|ref|XP_004167027.1| PREDICTED: mediator of RNA polymerase II transcription subunit
4-like [Cucumis sativus]
Length = 404
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 296/420 (70%), Positives = 340/420 (80%), Gaps = 18/420 (4%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
MLQHQIVQSPARLGLTNPNSPS+QNP +P K P S HH + ++ ++
Sbjct: 1 MLQHQIVQSPARLGLTNPNSPSIQNP----TPPKLPPSQQSHHQNQSNLSSSTPSSTLLP 56
Query: 61 CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
RAQALLLQMASLASKLFEVSPNRSLW++AFRGSLPTFL+SQ QS P
Sbjct: 57 LLPPLP--------RAQALLLQMASLASKLFEVSPNRSLWLTAFRGSLPTFLTSQTQSSP 108
Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
+DS+PS+ KEI++ F LQTQLFEAVAELQEILDLQD+KQKIA EI+AKD+A+ AFA
Sbjct: 109 STLLDSTPSSSKEILSLFNVLQTQLFEAVAELQEILDLQDSKQKIAYEIRAKDAALRAFA 168
Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHR 240
NK+KEAE+VLD+L DDY+DY+RP+R K+EG N+ CSTTV S+L LSDILSYAHR
Sbjct: 169 NKLKEAEQVLDVLFDDYADYQRPRRDKTEGDAENE---SLCSTTVASQLRLSDILSYAHR 225
Query: 241 ISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAI 300
ISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLY FA+LDVGLPK VE+ EKTIE +
Sbjct: 226 ISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYNFANLDVGLPKAVESTEKTIEPM 285
Query: 301 IEPP-PLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPL 359
IEPP P++P DANPL + GLLPPNITVPSGWKPGMPVELP D P+ PPPGWKPGD V L
Sbjct: 286 IEPPAPVRP-DANPLAAIQGLLPPNITVPSGWKPGMPVELPGDLPM-PPPGWKPGDPVRL 343
Query: 360 PPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDILDQDDDSSDYSSEEGSSEDD 419
PP+D+LP+ R++E QLRP GLHKPPEPIQVRHV+LDILDQDDDSSDY+S+EGSSEDD
Sbjct: 344 PPMDSLPVTRVDEPQLRPNALPGLHKPPEPIQVRHVQLDILDQDDDSSDYTSDEGSSEDD 403
>gi|357518055|ref|XP_003629316.1| hypothetical protein MTR_8g075790 [Medicago truncatula]
gi|355523338|gb|AET03792.1| hypothetical protein MTR_8g075790 [Medicago truncatula]
Length = 402
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 284/406 (69%), Positives = 326/406 (80%), Gaps = 31/406 (7%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHH--LRSATPPTAPITTGT 58
MLQHQIVQSPARLGLTNPNSPS+ NPNP K P S T+HH SATP
Sbjct: 1 MLQHQIVQSPARLGLTNPNSPSILNPNPQ----KLPPSQTNHHQDRHSATP--------- 47
Query: 59 TACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQS 118
S+ L+SLLPPLPRAQALL QMASLASKLFEVSPNRS+W++AFRGSLPTFLSSQ Q
Sbjct: 48 ----SAALISLLPPLPRAQALLSQMASLASKLFEVSPNRSVWVTAFRGSLPTFLSSQTQP 103
Query: 119 QPPPPIDSS-PSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAIL 177
++SS PST KEII+ FT+LQTQ+FEAV+ELQEI+DLQDAK+KI EI++KDSA+L
Sbjct: 104 HSFSSLESSLPSTTKEIISLFTTLQTQIFEAVSELQEIIDLQDAKKKIDGEIRSKDSALL 163
Query: 178 AFANKIKEAERVLDILVDDYSDYRRP-KRVKSEGGDGNDDDSGSCSTTVESRLNLSDILS 236
AFANK+K+AER LDILVDDYSDYRR KR+KS GDG++DDS + ST V S+L LSDILS
Sbjct: 164 AFANKLKDAERELDILVDDYSDYRRSIKRLKS--GDGSEDDSLTTST-VSSQLKLSDILS 220
Query: 237 YAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKT 296
YAHRISYTTFAPPEFGAG PLRGA+PPAPQ+EQMRASQLY FADLD+GLPK VETKEKT
Sbjct: 221 YAHRISYTTFAPPEFGAGTAPLRGAMPPAPQDEQMRASQLYNFADLDIGLPKAVETKEKT 280
Query: 297 IEAIIEPPPLQPVDANPLKDLA---GLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKP 353
+EAIIEPPP VD NPL +L+ G+LPPN +P GWKPGMPV+LP D P+ PPPGWKP
Sbjct: 281 VEAIIEPPP--SVDTNPLGNLSAIQGMLPPNFAIPPGWKPGMPVQLPIDMPIKPPPGWKP 338
Query: 354 GDAVPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
GD V LPP+D+LP+PR EEQ+L P PQ + PE IQV+HV LD+
Sbjct: 339 GDPVALPPIDSLPVPRFEEQKLPPHIPQP--RQPEIIQVQHVNLDL 382
>gi|225439765|ref|XP_002273310.1| PREDICTED: uncharacterized protein LOC100249946 [Vitis vinifera]
Length = 414
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 306/423 (72%), Positives = 341/423 (80%), Gaps = 19/423 (4%)
Query: 2 LQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTAC 61
+QHQIVQSPARLGL NPNSPSLQN T+ + + +T
Sbjct: 5 VQHQIVQSPARLGLPNPNSPSLQNS----------TAPKLSSQIPQSQQPHQQSNLSTTT 54
Query: 62 TSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPP 121
TSSTLLSLLPPL RAQ+LLLQMASLAS+LFEVSPNRSLW+++FRG+LPTFLSSQ+ P
Sbjct: 55 TSSTLLSLLPPLSRAQSLLLQMASLASRLFEVSPNRSLWLTSFRGTLPTFLSSQSIPPTP 114
Query: 122 PPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFAN 181
P++ SPS+ KEI++ FTSLQTQLFEAVAELQEILDLQDAKQKI REI++KDSAILAFAN
Sbjct: 115 TPLEHSPSSTKEILSLFTSLQTQLFEAVAELQEILDLQDAKQKITREIRSKDSAILAFAN 174
Query: 182 KIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRI 241
K+KEAE VLDILVDDYSDYRRPKR S D + +DS SC+TTV S+LNLSDILSYAHRI
Sbjct: 175 KLKEAEAVLDILVDDYSDYRRPKR--SNTAD-DSEDSSSCTTTVASQLNLSDILSYAHRI 231
Query: 242 SYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAII 301
SYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY F+DLDVGLPK V KEKTIE I
Sbjct: 232 SYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFSDLDVGLPKAVGIKEKTIETFI 291
Query: 302 EPPPLQPVDANPLKDL---AGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVP 358
EPPP QP D+NPL +L GLLPPNITVPSGWKPGMPVELP P+VPPPGWKPGD VP
Sbjct: 292 EPPPAQPADSNPLANLPSIQGLLPPNITVPSGWKPGMPVELPS-LPIVPPPGWKPGDPVP 350
Query: 359 LPPLDALPMP-RIEEQQLRPVPPQGLHKPPEPIQVRHVELDI-LDQDDDSSDYSSEEGSS 416
LPPL+++P+P R EEQ RP+ GL K PEPI V HV+LDI LDQDDDSSDYSSE GSS
Sbjct: 351 LPPLESIPIPSRAEEQLPRPLAAHGLQKAPEPIHVPHVQLDINLDQDDDSSDYSSEMGSS 410
Query: 417 EDD 419
ED+
Sbjct: 411 EDE 413
>gi|147807720|emb|CAN66553.1| hypothetical protein VITISV_018166 [Vitis vinifera]
Length = 1448
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 300/417 (71%), Positives = 335/417 (80%), Gaps = 19/417 (4%)
Query: 8 QSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTACTSSTLL 67
+SPARLGL NPNSPSLQN T+ + + +T TSSTLL
Sbjct: 1045 KSPARLGLPNPNSPSLQNS----------TAPKLSSQIPQSQQPHQQSNLSTTTTSSTLL 1094
Query: 68 SLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSS 127
SLLPPL RAQ+LLLQMASLAS+LFEVSPNRSLW+++FRG+LPTFLSSQ+ P P++ S
Sbjct: 1095 SLLPPLSRAQSLLLQMASLASRLFEVSPNRSLWLTSFRGTLPTFLSSQSIPPTPTPLEHS 1154
Query: 128 PSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAE 187
PS+ KEI++ FTSLQTQLFEAVAELQEILDLQDAKQKI REI++KDSAILAFANK+KEAE
Sbjct: 1155 PSSTKEILSLFTSLQTQLFEAVAELQEILDLQDAKQKITREIRSKDSAILAFANKLKEAE 1214
Query: 188 RVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFA 247
VLDILVDDYSDYRRPKR S D + +DS SC+TTV S+LNLSDILSYAHRISYTTFA
Sbjct: 1215 AVLDILVDDYSDYRRPKR--SNTAD-DSEDSSSCTTTVASQLNLSDILSYAHRISYTTFA 1271
Query: 248 PPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQ 307
PPEFGAGQ PLRGALPPAPQEEQMRASQLY F+DLDVGLPK V KEKTIE IEPPP Q
Sbjct: 1272 PPEFGAGQAPLRGALPPAPQEEQMRASQLYNFSDLDVGLPKAVGIKEKTIETFIEPPPAQ 1331
Query: 308 PVDANPLKDL---AGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPLPPLDA 364
P D+NPL +L GLLPPNITVPSGWKPGMPVELP P+VPPPGWKPGD VPLPPL++
Sbjct: 1332 PADSNPLANLPSIQGLLPPNITVPSGWKPGMPVELP-SLPIVPPPGWKPGDPVPLPPLES 1390
Query: 365 LPMP-RIEEQQLRPVPPQGLHKPPEPIQVRHVELDI-LDQDDDSSDYSSEEGSSEDD 419
+P+P R EEQ RP+ GL K PEPI V HV+LDI LDQDDDSSDYSSE GSSED+
Sbjct: 1391 IPIPSRAEEQLPRPLAAHGLQKAPEPIHVPHVQLDINLDQDDDSSDYSSEMGSSEDE 1447
>gi|224138570|ref|XP_002322847.1| predicted protein [Populus trichocarpa]
gi|222867477|gb|EEF04608.1| predicted protein [Populus trichocarpa]
Length = 397
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 266/401 (66%), Positives = 308/401 (76%), Gaps = 27/401 (6%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSL--QNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGT 58
M QHQIVQSPARLGLTNP SPSL Q PNP P P HHH + T T +
Sbjct: 1 MHQHQIVQSPARLGLTNPTSPSLPVQTPNPPPPKFPNPQPPQHHHPSTTTTTTTTSSALL 60
Query: 59 TACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQS 118
+ RAQ+LLLQMASLA+KLFE+SPNR W+SA+RG+LPTFL SQ+QS
Sbjct: 61 SLLPPLP---------RAQSLLLQMASLATKLFELSPNRPHWLSAYRGTLPTFLPSQSQS 111
Query: 119 QPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILA 178
P +P++ KEII+ FTSLQTQLFE+VAELQEILDLQD+KQ+I++EIK+KDSA+L
Sbjct: 112 LDEP----TPTSTKEIISLFTSLQTQLFESVAELQEILDLQDSKQRISQEIKSKDSALLN 167
Query: 179 FANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYA 238
FANK+KEAERVLDILVDDY+DYRR + + ++++ + +TTV S+L LSDILSYA
Sbjct: 168 FANKLKEAERVLDILVDDYADYRRRRPIPP----NEEEENATNTTTVASQLKLSDILSYA 223
Query: 239 HRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIE 298
HRISYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY FA LDVGLPK VETKEKTIE
Sbjct: 224 HRISYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFAHLDVGLPKDVETKEKTIE 283
Query: 299 AIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVP 358
AIIEP +ANPL++LAGL PPNI+VP+GWKPGMPV LP D P VPPPGWKPGD V
Sbjct: 284 AIIEP----QHEANPLENLAGLFPPNISVPAGWKPGMPVVLPTDLP-VPPPGWKPGDPV- 337
Query: 359 LPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
LD++P+PR EQ+L P+ PQGLHKPPE IQVRHVELDI
Sbjct: 338 --SLDSIPLPRAVEQKLPPIAPQGLHKPPETIQVRHVELDI 376
>gi|357500889|ref|XP_003620733.1| hypothetical protein MTR_6g089780 [Medicago truncatula]
gi|355495748|gb|AES76951.1| hypothetical protein MTR_6g089780 [Medicago truncatula]
Length = 579
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 280/403 (69%), Positives = 316/403 (78%), Gaps = 26/403 (6%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
MLQHQ+VQSPARLGLTNPNSP + NP P P T+ HH R A
Sbjct: 1 MLQHQVVQSPARLGLTNPNSPLIPNPTPPKLPPLQTTN--HHQDRH------------LA 46
Query: 61 CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
S LLSLLPPLPRAQALL QMASL++KLFEVSPN+SLW+SAFRGSLPTFLSSQ Q +
Sbjct: 47 TPSPALLSLLPPLPRAQALLQQMASLSTKLFEVSPNKSLWVSAFRGSLPTFLSSQGQPRS 106
Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
+DSSPST KEI++ FT+LQTQ+FEAV+ELQE+LD +DAKQKI +EI +KDSA+LAFA
Sbjct: 107 SASLDSSPSTTKEILSLFTNLQTQMFEAVSELQEVLDQKDAKQKIDQEICSKDSALLAFA 166
Query: 181 NKIKEAERVLDILVDDYSDYR-RPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAH 239
NK+K+AERVLDILVDDYSDYR + KR+K GDG++D S + S TV S+L LSDILSYAH
Sbjct: 167 NKLKDAERVLDILVDDYSDYRSKTKRLKL--GDGSEDVSLTTS-TVSSQLKLSDILSYAH 223
Query: 240 RISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEA 299
RISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLY FADLD+GLPK ET EKTIEA
Sbjct: 224 RISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYNFADLDIGLPKSAETTEKTIEA 283
Query: 300 IIEPPPLQPVDANPLKDL---AGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDA 356
IIEPPPLQPVDAN L +L GLLPPN TVP+GWKPGMPV+LP D P+ PPPGWKPGD
Sbjct: 284 IIEPPPLQPVDANSLANLPGIQGLLPPNFTVPAGWKPGMPVQLPIDIPIKPPPGWKPGDP 343
Query: 357 VPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
V LD+L +PRIEEQQL P PQ K PE IQV V LD+
Sbjct: 344 V---ALDSLSIPRIEEQQLHPHVPQP--KLPEIIQVAPVNLDL 381
>gi|357501019|ref|XP_003620798.1| hypothetical protein MTR_6g090580 [Medicago truncatula]
gi|355495813|gb|AES77016.1| hypothetical protein MTR_6g090580 [Medicago truncatula]
Length = 538
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 280/403 (69%), Positives = 316/403 (78%), Gaps = 26/403 (6%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
MLQHQ+VQSPARLGLTNPNSP + NP P P T+ HH R A
Sbjct: 1 MLQHQVVQSPARLGLTNPNSPLIPNPTPPKLPPLQTTN--HHQDRH------------LA 46
Query: 61 CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
S LLSLLPPLPRAQALL QMASL++KLFEVSPN+SLW+SAFRGSLPTFLSSQ Q +
Sbjct: 47 TPSPALLSLLPPLPRAQALLQQMASLSTKLFEVSPNKSLWVSAFRGSLPTFLSSQGQPRS 106
Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
+DSSPST KEI++ FT+LQTQ+FEAV+ELQE+LD +DAKQKI +EI +KDSA+LAFA
Sbjct: 107 SASLDSSPSTTKEILSLFTNLQTQMFEAVSELQEVLDQKDAKQKIDQEICSKDSALLAFA 166
Query: 181 NKIKEAERVLDILVDDYSDYR-RPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAH 239
NK+K+AERVLDILVDDYSDYR + KR+K GDG++D S + S TV S+L LSDILSYAH
Sbjct: 167 NKLKDAERVLDILVDDYSDYRSKTKRLKL--GDGSEDVSLTTS-TVSSQLKLSDILSYAH 223
Query: 240 RISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEA 299
RISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLY FADLD+GLPK ET EKTIEA
Sbjct: 224 RISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYNFADLDIGLPKSAETTEKTIEA 283
Query: 300 IIEPPPLQPVDANPLKDL---AGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDA 356
IIEPPPLQPVDAN L +L GLLPPN TVP+GWKPGMPV+LP D P+ PPPGWKPGD
Sbjct: 284 IIEPPPLQPVDANSLANLPGIQGLLPPNFTVPAGWKPGMPVQLPIDIPIKPPPGWKPGDP 343
Query: 357 VPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
V LD+L +PRIEEQQL P PQ K PE IQV V LD+
Sbjct: 344 V---ALDSLSIPRIEEQQLHPHVPQP--KLPEIIQVAPVNLDL 381
>gi|356519021|ref|XP_003528173.1| PREDICTED: uncharacterized protein LOC100796958 [Glycine max]
Length = 388
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 271/404 (67%), Positives = 308/404 (76%), Gaps = 40/404 (9%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHH--HLRSATPPTAPITTGT 58
MLQHQIVQSPA LGL NPNSPS+ NP +P K P + THH S+TP
Sbjct: 1 MLQHQIVQSPAMLGLANPNSPSIPNP----TPPKLPPTQTHHPQDRNSSTP--------- 47
Query: 59 TACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQS 118
SS LLSLLPP RAQALL QMASLASKLFEVSPNRSLW++AFRGSLPTFLSS + +
Sbjct: 48 ----SSALLSLLPPHLRAQALLAQMASLASKLFEVSPNRSLWVTAFRGSLPTFLSSHSST 103
Query: 119 QPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILA 178
P+D+SPS KEII+ FT LQTQ+FEA+AELQEILDLQDAKQKI EI ++DS +LA
Sbjct: 104 ----PLDTSPSDGKEIISLFTVLQTQIFEAIAELQEILDLQDAKQKIDCEIHSQDSTLLA 159
Query: 179 FANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYA 238
FANK+KEAE LDILVDDYSDY R KR KS DDDS + S+T+ S+L LSD+LSYA
Sbjct: 160 FANKLKEAESCLDILVDDYSDYHRSKRSKS-----GDDDSMT-SSTISSQLKLSDMLSYA 213
Query: 239 HRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIE 298
HRISYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY FADLDVGLPK VETKEK +E
Sbjct: 214 HRISYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFADLDVGLPKEVETKEKIVE 273
Query: 299 AIIEPPPLQPVDANPLKDLA---GLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGD 355
AI+EPPP VD N + +L+ G+LPP +P GWKPGMPV+LP D PL PP GWKPGD
Sbjct: 274 AIVEPPP--QVDTNTVPNLSAFQGMLPP---MPPGWKPGMPVQLPIDLPLPPP-GWKPGD 327
Query: 356 AVPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
VPLPP+D+L +PR EQQ++P PQ K PE IQV+ V LD+
Sbjct: 328 PVPLPPMDSLQIPRFAEQQMQPHIPQP--KQPEVIQVQPVNLDL 369
>gi|356524055|ref|XP_003530648.1| PREDICTED: uncharacterized protein LOC100812071 [Glycine max]
Length = 388
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 258/400 (64%), Positives = 292/400 (73%), Gaps = 32/400 (8%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
MLQHQIVQSPARLGL NPNSPS+ N HH + T +
Sbjct: 1 MLQHQIVQSPARLGLANPNSPSIPN-----PTPPKLPPTQTHHPQDRNSSTPSSALLSLL 55
Query: 61 CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
RAQALL QMASLASKLF+VSPNRSLW++AFRGSLPTFLSS + +
Sbjct: 56 PPLP----------RAQALLAQMASLASKLFDVSPNRSLWVTAFRGSLPTFLSSHSST-- 103
Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
P+D+SPS KEII+ FT LQTQ+FEAVAELQEILDLQDAKQKI REI+++DS +LAFA
Sbjct: 104 --PLDTSPSNAKEIISLFTVLQTQIFEAVAELQEILDLQDAKQKIDREIRSQDSTLLAFA 161
Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHR 240
NK+KEAE LDILVDDYSDYRR KR KS DDDS + S+TV S+L LSDILSYAHR
Sbjct: 162 NKLKEAESCLDILVDDYSDYRRSKRSKS-----GDDDSMT-SSTVSSQLKLSDILSYAHR 215
Query: 241 ISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAI 300
ISYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY FADLDVGLPK VETKEK +EAI
Sbjct: 216 ISYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFADLDVGLPKEVETKEKIVEAI 275
Query: 301 IEPPPLQPVDANP-LKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPL 359
+EPPP +A P L G+LPP +P GWKPGMPV+LP D +PPPGWKPGD VPL
Sbjct: 276 VEPPPQVDTNAVPNLSAFQGMLPP---MPPGWKPGMPVQLPID-LPLPPPGWKPGDPVPL 331
Query: 360 PPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
PP+D+L +PR EQQ++P PQ K PE IQV+ V LD+
Sbjct: 332 PPMDSLQIPRFAEQQMQPHIPQP--KQPEVIQVQPVNLDL 369
>gi|255568436|ref|XP_002525192.1| conserved hypothetical protein [Ricinus communis]
gi|223535489|gb|EEF37158.1| conserved hypothetical protein [Ricinus communis]
Length = 401
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 261/401 (65%), Positives = 303/401 (75%), Gaps = 27/401 (6%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
MLQHQIVQSPARLGLTNPNSPSLQNP +P KFP +++ ++ + +
Sbjct: 1 MLQHQIVQSPARLGLTNPNSPSLQNP----TPPKFPAASSQQQQHPSSALLSLLPPLR-- 54
Query: 61 CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
RAQ+LLLQMASLASK FEVSPNR+LW+SAFRGSLPTFLSSQ QS
Sbjct: 55 --------------RAQSLLLQMASLASKFFEVSPNRALWVSAFRGSLPTFLSSQTQS-- 98
Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
P++S+P + K+I++ FTSLQTQLFEAVAELQEILDLQDAKQ++AR+IK+ DSA+L FA
Sbjct: 99 ISPLESTPFSSKDILSLFTSLQTQLFEAVAELQEILDLQDAKQRLARDIKSHDSALLNFA 158
Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKS---EGGDGNDDDSGSCSTTVESRLNLSDILSY 237
NK+KEAE +LDILVDDYSDYR+PK KS E D ++ + + +TTV S+LNLSDILSY
Sbjct: 159 NKLKEAETLLDILVDDYSDYRKPKIAKSSLTEDDDVSNTTTSATTTTVASQLNLSDILSY 218
Query: 238 AHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTI 297
AHRISYTTFAPPEFGAG PLRGA PPAPQEEQMR SQLY FADLDVGLPK +
Sbjct: 219 AHRISYTTFAPPEFGAGTAPLRGAFPPAPQEEQMRVSQLYAFADLDVGLPKTETRERIIE 278
Query: 298 EAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAV 357
I PPP Q D NPL +LAGL PP++TVPSGWKPGMPV LP D P PP KPGD +
Sbjct: 279 PIIEPPPPPQSADTNPLGNLAGLFPPSLTVPSGWKPGMPVVLPTDLPPPPPGW-KPGDPL 337
Query: 358 PLPPLDALPMPRIEEQQLRP-VPPQGLHKPPEPIQVRHVEL 397
PLPPLD+LP+PR EQQL+P V PQGLHKPPE IQVRHV++
Sbjct: 338 PLPPLDSLPLPRTNEQQLQPAVAPQGLHKPPETIQVRHVQI 378
>gi|356550939|ref|XP_003543839.1| PREDICTED: uncharacterized protein LOC100812941 [Glycine max]
Length = 384
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 254/400 (63%), Positives = 297/400 (74%), Gaps = 36/400 (9%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
MLQHQIV SPARLGL NPNSPS+ +PT TH R+++ P++ + +
Sbjct: 1 MLQHQIVLSPARLGLANPNSPSIP------NPTPPKLPPTHTQDRNSSTPSSALLSLLPP 54
Query: 61 CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
RAQALL+Q+ASLASKLFEVSPNRSLW++AFRGSLPTFLSS + +
Sbjct: 55 LP------------RAQALLVQLASLASKLFEVSPNRSLWVTAFRGSLPTFLSSHSST-- 100
Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
P+ +S S KEII+ FT LQTQ+FEAVAELQEILDLQDAKQKI REI ++DS +L+FA
Sbjct: 101 --PLCAS-SNAKEIISLFTVLQTQIFEAVAELQEILDLQDAKQKIDREIHSQDSTLLSFA 157
Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHR 240
NK+K+AE LDILVDDYSDYRR KR+K DDDS + S+TV S+L LSDILSYAHR
Sbjct: 158 NKLKDAESCLDILVDDYSDYRRSKRLKF-----GDDDSMT-SSTVSSQLKLSDILSYAHR 211
Query: 241 ISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAI 300
ISYTTFAPPEFGAGQ PLRGALPPAPQEEQMRASQLY FADLDVGLPK VETKEK +EAI
Sbjct: 212 ISYTTFAPPEFGAGQAPLRGALPPAPQEEQMRASQLYNFADLDVGLPKEVETKEKIVEAI 271
Query: 301 IEPPPLQPVDANP-LKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPL 359
+EPPP +A P L G+LPP +P GWKPGMPV+LP D +PPPGWKPGD VPL
Sbjct: 272 VEPPPQVDTNAVPNLSAFQGMLPP---MPPGWKPGMPVQLPID-LPLPPPGWKPGDPVPL 327
Query: 360 PPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
PP+D+L +PR EQQ++P PQ K PE IQV+ V LD+
Sbjct: 328 PPMDSLQIPRFAEQQIQPHIPQP--KQPEVIQVQPVNLDL 365
>gi|15242555|ref|NP_195905.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
thaliana]
gi|75181168|sp|Q9LZ00.1|MED4_ARATH RecName: Full=Mediator of RNA polymerase II transcription subunit 4
gi|7413560|emb|CAB86039.1| putative protein [Arabidopsis thaliana]
gi|332003144|gb|AED90527.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
thaliana]
Length = 426
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 272/413 (65%), Positives = 320/413 (77%), Gaps = 17/413 (4%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNP--NPSPTKFPTSATHHHLRSATP--------P 50
MLQHQIVQSPARLGLT P SPS+QNP P + PT +S + H P
Sbjct: 1 MLQHQIVQSPARLGLTGPGSPSVQNPTPTRHGHPTSSSSSQSQHQQIQQQPNLLPSSTVA 60
Query: 51 TAPITTGTTACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPT 110
A + + A +SS LLSLLPPLPRAQALL QMA L SKLF+VSPNR++W+SAFRGSLP+
Sbjct: 61 AASSASASAAVSSSALLSLLPPLPRAQALLQQMAVLTSKLFDVSPNRAIWLSAFRGSLPS 120
Query: 111 FLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIK 170
FLSS + PPP + SPS+ KEI++ F SLQTQLFEAV ELQEILDLQDAKQK+AREIK
Sbjct: 121 FLSSHSLPPPPPLENPSPSSTKEILSQFNSLQTQLFEAVTELQEILDLQDAKQKVAREIK 180
Query: 171 AKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVK-SEGGDGNDDDSGSCSTTVESRL 229
+KDS++LAFANK+K+AERVLD+LVDDYSDYR+PKR K E + ND++S S STTV S+L
Sbjct: 181 SKDSSLLAFANKLKDAERVLDMLVDDYSDYRKPKRSKIEEDDEDNDNESSSSSTTVSSQL 240
Query: 230 NLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV 289
L DIL+YAH+ISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLYTFADLD+GLPK
Sbjct: 241 KLKDILAYAHKISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYTFADLDIGLPKT 300
Query: 290 VETKEKTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPP 349
VE EK +EA+IEPPP P +A + + LLPPNI VPSGWKPGMPVEL +PPP
Sbjct: 301 VENMEKKVEALIEPPP--PPEAMDISAIHNLLPPNIAVPSGWKPGMPVEL-PRDLPLPPP 357
Query: 350 GWKPGDAVPLPPLDALPMPRIEE-QQLRPVPPQGLHKPPEPIQVRHVELDILD 401
GWKPGD V LPPL+++ PR E+ Q +R P QGLH+PP+ IQVR V+LDIL+
Sbjct: 358 GWKPGDPVVLPPLESIAAPRAEDHQHMR--PSQGLHRPPDVIQVRAVQLDILE 408
>gi|30794029|gb|AAP40461.1| unknown protein [Arabidopsis thaliana]
Length = 426
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 270/413 (65%), Positives = 317/413 (76%), Gaps = 17/413 (4%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPN----------PSPTKFPTSATHHHLRSATPP 50
MLQHQIVQSPARLGLT P SPS+QNP P + L S+T
Sbjct: 1 MLQHQIVQSPARLGLTGPGSPSVQNPTPTRHGHTTSSSSSQSQHQQIQQQPNLLPSSTVA 60
Query: 51 TAPITTGTTACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPT 110
A + + A +SS LLSLLPPLPRAQALL QMA L SKLF+VSPNR++W+SAFRGSLP+
Sbjct: 61 AASSASASAAVSSSALLSLLPPLPRAQALLQQMAVLTSKLFDVSPNRAIWLSAFRGSLPS 120
Query: 111 FLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIK 170
FLSS + PPP + SPS+ KEI++ F SLQTQLFEAV ELQEILDLQDAKQK+AREIK
Sbjct: 121 FLSSHSLPPPPPLENPSPSSTKEILSQFNSLQTQLFEAVTELQEILDLQDAKQKVAREIK 180
Query: 171 AKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVK-SEGGDGNDDDSGSCSTTVESRL 229
+KDS++LAFANK+K+AERVLD+LVDDYSDYR+PKR K E + ND++S S STTV S+L
Sbjct: 181 SKDSSLLAFANKLKDAERVLDMLVDDYSDYRKPKRSKIEEDDEDNDNESSSSSTTVSSQL 240
Query: 230 NLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV 289
L DIL+YAH+ISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLYTFADLD+GLPK
Sbjct: 241 KLKDILAYAHKISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYTFADLDIGLPKT 300
Query: 290 VETKEKTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPP 349
VE EK +EA+IEPPP P +A + + LLPPNI VPSGWKPGMPVEL +PPP
Sbjct: 301 VENMEKKVEALIEPPP--PPEAMDISAIHNLLPPNIAVPSGWKPGMPVEL-PRDLPLPPP 357
Query: 350 GWKPGDAVPLPPLDALPMPRIEE-QQLRPVPPQGLHKPPEPIQVRHVELDILD 401
GWKPGD V LPPL+++ PR E+ Q +R P QGLH+PP+ IQVR V+LDIL+
Sbjct: 358 GWKPGDPVVLPPLESIAAPRAEDHQHMR--PSQGLHRPPDVIQVRAVQLDILE 408
>gi|297806219|ref|XP_002870993.1| hypothetical protein ARALYDRAFT_487063 [Arabidopsis lyrata subsp.
lyrata]
gi|297316830|gb|EFH47252.1| hypothetical protein ARALYDRAFT_487063 [Arabidopsis lyrata subsp.
lyrata]
Length = 428
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 273/414 (65%), Positives = 324/414 (78%), Gaps = 17/414 (4%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNP--NPSPTKFPTSATHHHLRSATP--------P 50
MLQHQIVQSPARLGLT P SPS+QNP P + PT +S + H P
Sbjct: 1 MLQHQIVQSPARLGLTGPGSPSVQNPTPTRHGHPTSSSSSQSQHQQIQQQPNLLPSSTVA 60
Query: 51 TAPITTGTTACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPT 110
A + +A +SS LLSLLPPLPRAQALL QMA L SKLF+VSPNR+LW+SAFRGSLP+
Sbjct: 61 AASSASAASAISSSALLSLLPPLPRAQALLQQMAVLTSKLFDVSPNRALWLSAFRGSLPS 120
Query: 111 FLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIK 170
FLSS + PPP + +PS+ KEI++ F SLQTQLFEAV ELQEILDLQDAKQK+AREIK
Sbjct: 121 FLSSHSLPPPPPLENPNPSSTKEILSQFNSLQTQLFEAVTELQEILDLQDAKQKVAREIK 180
Query: 171 AKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVK--SEGGDGNDDDSGSCSTTVESR 228
++DS++LAFANK+K+AERVLD+LV+DYSDYR+PKR K SE + ND++S S STTV S+
Sbjct: 181 SRDSSLLAFANKLKDAERVLDMLVEDYSDYRKPKRSKIVSEEEEDNDNESSSSSTTVSSQ 240
Query: 229 LNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPK 288
L L DIL+YAH+ISYTTFAPPEFGAGQ PLRGALPPAPQ+EQMRASQLY FADLD+GLPK
Sbjct: 241 LKLKDILAYAHKISYTTFAPPEFGAGQAPLRGALPPAPQDEQMRASQLYAFADLDIGLPK 300
Query: 289 VVETKEKTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPP 348
VE EK +EA+IE PP P +A + + LLPPNI VPSGWKPGMPVELP+D +PP
Sbjct: 301 TVENMEKKVEALIE-PPPPPPEAMDISAIHNLLPPNIAVPSGWKPGMPVELPRD-LPLPP 358
Query: 349 PGWKPGDAVPLPPLDALPMPRIEEQQ-LRPVPPQGLHKPPEPIQVRHVELDILD 401
PGWKPGD V LPPL+++ PR E+QQ +R PPQGLH+PP+ IQVR V+LDIL+
Sbjct: 359 PGWKPGDPVVLPPLESIAAPRAEDQQHMR--PPQGLHRPPDVIQVRAVQLDILE 410
>gi|116789048|gb|ABK25099.1| unknown [Picea sitchensis]
Length = 393
Score = 308 bits (790), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 189/389 (48%), Positives = 249/389 (64%), Gaps = 33/389 (8%)
Query: 23 LQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTACTSSTLLSLLPPLPRAQALLLQ 82
+QNP PSP + S H +AT P+ P ++ A S+ + +L PP RAQ L+ +
Sbjct: 2 MQNPPIFPSPARLGLSVAHSP--AATVPSVPSSSQKIAERSTPIFNL-PPYRRAQLLIHE 58
Query: 83 MASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQ 142
MA L S+LFEVS NR+ W++A+RGS+PTFL Q PP ++ +T KEI++ S Q
Sbjct: 59 MAGLTSRLFEVSNNRAAWLTAYRGSIPTFL----QGSCIPPEAAAATTTKEILSILESHQ 114
Query: 143 TQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRR 202
L EAV ELQEI ++ KQ +A+EI +KDSA+ FA K+ +A++VL+ ++DY YR
Sbjct: 115 NLLEEAVTELQEIQKIEQDKQAVAQEISSKDSAMRVFARKLHDAQQVLEQTLEDYEGYRI 174
Query: 203 PKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGAL 262
KR KS D L L+DIL+YAHRISYTTFA P+FGAG+ PLRGA+
Sbjct: 175 GKRSKSLNSSPGD-------AVGLGHLELNDILTYAHRISYTTFATPDFGAGKAPLRGAI 227
Query: 263 PPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQPVDANPLKDLA-GLL 321
PPAPQEEQMRASQLY +ADLDVG+PK T+ ++ AI+EP P + V NP +++ G+L
Sbjct: 228 PPAPQEEQMRASQLYQYADLDVGIPKQAVTEPLSV-AIVEPAPDR-VVVNPKQEVVPGML 285
Query: 322 PPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPLPPL----------DALPMPRIE 371
P VP GWKPGMP+ELP V PPGWKPGD VPLPP DA+ +P+ E
Sbjct: 286 PGMPPVPPGWKPGMPIELP-----VMPPGWKPGDPVPLPPAMEVPAGWKPGDAVVLPKPE 340
Query: 372 EQ-QLRPVPPQGLHKPPEPIQVRHVELDI 399
E Q P+ P L +P EPIQV++V+LDI
Sbjct: 341 EAVQAPPLRPPALARPAEPIQVQYVQLDI 369
>gi|357159717|ref|XP_003578536.1| PREDICTED: uncharacterized protein LOC100835981 [Brachypodium
distachyon]
Length = 416
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 216/431 (50%), Positives = 273/431 (63%), Gaps = 30/431 (6%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
M+Q + SPARLG+T +SPSL PNP+P +S S+ T
Sbjct: 1 MMQSHM-PSPARLGMTA-SSPSLP---PNPAPVNPTSSPPQASSSSSAAAPGAGVPTATL 55
Query: 61 CTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQP 120
TS +LL LLPPLPRAQ+LL M+SLA+ LFE+SPNR+ WIS++RGSLPTFL S + S
Sbjct: 56 TTSPSLLPLLPPLPRAQSLLHIMSSLAANLFELSPNRTAWISSYRGSLPTFLPSSSTSGM 115
Query: 121 PPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
PPP+ + S KE ++ TS+QTQLFE V ELQE LDLQDA+ ++ RE +AKD+A+LAFA
Sbjct: 116 PPPLPTPVSNTKEAMSLLTSVQTQLFETVTELQETLDLQDARARLVREARAKDAALLAFA 175
Query: 181 NKIKEAERVLDILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNLSDIL 235
K++EA VLD LVDDY+DYRR PKR + G D+ G ++ S+L L DIL
Sbjct: 176 KKLREAHHVLDRLVDDYADYRRDPKRPR--GAPVVDEPEPVSHGDFGASLHSKLKLDDIL 233
Query: 236 SYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV-VETKE 294
+YAHRISYTTFAPPE GAG PLRGALPPAPQ+ ++R SQLY FADLDVG+PK +ETK+
Sbjct: 234 TYAHRISYTTFAPPEHGAGM-PLRGALPPAPQDNELRMSQLYQFADLDVGVPKKPLETKD 292
Query: 295 KTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPG 354
A+ +LP ITVP GW G+P +LP D PL PPPGWKPG
Sbjct: 293 GA--AVETEATPLFEPPQEEPPRPSMLP--ITVPPGWPKGLPKDLPVDIPL-PPPGWKPG 347
Query: 355 DAVPLPPLDALPMPRIEEQQLRPVPPQGLHKPP------EPIQVRHVELDILDQDDDSSD 408
D + L + M ++ ++ RP P + P EPIQV VELDI + S +
Sbjct: 348 DPIELGGI----MASVKAEEPRPSMPMPVGVQPIVPRAQEPIQVAAVELDI-NISSSSDE 402
Query: 409 YSSEEGSSEDD 419
YSS+ GSSE+D
Sbjct: 403 YSSDVGSSEED 413
>gi|326509687|dbj|BAJ87059.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326515180|dbj|BAK03503.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 213/415 (51%), Positives = 270/415 (65%), Gaps = 36/415 (8%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPS-PTKFPTSATHHHLRSATPPTAPITTGTT 59
M+QH + SPARLG+T +SPSL PN +P+ PT P A +A T
Sbjct: 1 MMQH--MPSPARLGVTA-SSPSLP-PNQSPANPTSSPPQANPPSASAAAAAAGAGAAVPT 56
Query: 60 ACTSSTLLSLLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQ 119
TS LL LLPPLPRAQ+LL ++SLAS LFE+SPNR+ WIS++RGSLP FL+S + S
Sbjct: 57 LTTSPALLPLLPPLPRAQSLLHLISSLASNLFELSPNRAAWISSYRGSLPNFLASSSASA 116
Query: 120 PPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAF 179
P P+ + ST KE ++ TSLQTQLFE VAELQE LDLQDA+ K+ARE +AKD+++LAF
Sbjct: 117 APTPLPAPVSTTKEAMSMLTSLQTQLFETVAELQETLDLQDARAKLAREARAKDASLLAF 176
Query: 180 ANKIKEAERVLDILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNLSDI 234
A K++EA VLD LVDDY+DYRR PKR + G D+ G ++ S+L L DI
Sbjct: 177 AKKLREAHHVLDRLVDDYADYRRDPKRPR--GAAAVDEPEPVSQGDFGASLHSKLKLDDI 234
Query: 235 LSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV-VETK 293
L+YAHRISYTTFAPPE GAG PLRGALPPAPQ+ ++R SQLY FADLDVG+PK +ETK
Sbjct: 235 LTYAHRISYTTFAPPEHGAGL-PLRGALPPAPQDNELRMSQLYQFADLDVGVPKKPLETK 293
Query: 294 E---KTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPG 350
E +E++ + P +LP ITVP GW+ G+P LP FP+ PPPG
Sbjct: 294 EGLTAEMESMPLFELPEEEPPRP-----SMLP--ITVPPGWQKGLPALLPA-FPM-PPPG 344
Query: 351 WKPGDAVPLPPLDALPMPRIEEQQLRP-VP-PQGLH----KPPEPIQVRHVELDI 399
WKPGD + L + +P ++ + RP VP P G+ + EPIQV V L+I
Sbjct: 345 WKPGDPIELGGI----IPAVKPDEQRPSVPIPVGVQPMVPRAQEPIQVAAVNLEI 395
>gi|115480431|ref|NP_001063809.1| Os09g0540500 [Oryza sativa Japonica Group]
gi|52076063|dbj|BAD46576.1| unknown protein [Oryza sativa Japonica Group]
gi|113632042|dbj|BAF25723.1| Os09g0540500 [Oryza sativa Japonica Group]
gi|125606481|gb|EAZ45517.1| hypothetical protein OsJ_30176 [Oryza sativa Japonica Group]
gi|215767917|dbj|BAH00146.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 414
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 187/355 (52%), Positives = 228/355 (64%), Gaps = 22/355 (6%)
Query: 75 RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEI 134
RAQALL +++LAS LFE+SP+R+ WISA+RGSLPTFL S + + PPP ST KE
Sbjct: 70 RAQALLQLISALASNLFELSPSRAAWISAYRGSLPTFLPSPSSAPPPPLPAPISST-KEA 128
Query: 135 IACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILV 194
++ +LQTQLFEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA VLD LV
Sbjct: 129 LSLLNTLQTQLFEAVAELQETLDLQDARARLAREARAKDASILAFAKKLREAHHVLDRLV 188
Query: 195 DDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNLSDILSYAHRISYTTFAPP 249
DDY+DYRR PKR + G DD G ++ S+LNL D+L+YAHRISYTTFAPP
Sbjct: 189 DDYADYRRDPKRPR--GAAAADDPEPVSDGDFGASLHSKLNLDDVLTYAHRISYTTFAPP 246
Query: 250 EFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQPV 309
E GAG PLRGALPPAPQE +MR SQLY FADLDVG+PK E KE+T P QP
Sbjct: 247 EHGAGL-PLRGALPPAPQENEMRMSQLYQFADLDVGVPKSQEAKERTAAEGDATPLFQP- 304
Query: 310 DANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPLPPLDALP--- 366
+P ++ A +LP + P GW+ G P GWKPGD + LPP L
Sbjct: 305 --SPTQE-AAVLPITVPHPHGWRNGALPLEIPLPPP----GWKPGDPITLPPDGILAGVK 357
Query: 367 --MPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDILDQDDDSSDYSSEEGSSEDD 419
PR QQ+ V P + K EPIQVRHV+LDI + S S EDD
Sbjct: 358 GEEPRASVQQMPVVVPAMVPKAQEPIQVRHVDLDINNSSSSDEYSSDVGSSEEDD 412
>gi|125564538|gb|EAZ09918.1| hypothetical protein OsI_32214 [Oryza sativa Indica Group]
Length = 414
Score = 277 bits (708), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 215/421 (51%), Positives = 262/421 (62%), Gaps = 26/421 (6%)
Query: 9 SPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTACTSSTLLS 68
SPARLGLT +SPSL PNP+ +S + +A A T TS +LL
Sbjct: 8 SPARLGLTA-SSPSLP---PNPAAVNPTSSPPQGNPSAAAAVGAAAAAAPTLTTSPSLLP 63
Query: 69 LLPPLPRAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSP 128
LLPPLPRAQALL +++LAS LFE+SP+R+ WISA+RGSLPTFL S + + PPP
Sbjct: 64 LLPPLPRAQALLQLISALASNLFELSPSRAAWISAYRGSLPTFLPSPSSAPPPPLPAPIS 123
Query: 129 STIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAER 188
ST KE ++ +LQTQLFEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA
Sbjct: 124 ST-KEALSLLNTLQTQLFEAVAELQETLDLQDARARLAREARAKDASILAFAKKLREAHH 182
Query: 189 VLDILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNLSDILSYAHRISY 243
VLD LVDDY+DYRR PKR + G DD G ++ S+LNL D+L+YAHRISY
Sbjct: 183 VLDRLVDDYADYRRDPKRPR--GAAAADDPEPVSDGDFGASLHSKLNLDDVLTYAHRISY 240
Query: 244 TTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEP 303
TTFAPPE GAG PLRGALPPAPQE +MR SQLY FADLDVG+PK E KE+T
Sbjct: 241 TTFAPPEHGAGL-PLRGALPPAPQENEMRMSQLYQFADLDVGVPKSQEAKERTAAEGDAT 299
Query: 304 PPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPPGWKPGDAVPLPPLD 363
P QP +P ++ A +LP + P GW+ G P GWKPGD + LPP
Sbjct: 300 PLFQP---SPTQE-AAVLPITVPHPHGWRNGALPLEIPLPPP----GWKPGDPITLPPDG 351
Query: 364 ALP-----MPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDILDQDDDSSDYSSEEGSSED 418
L PR Q+ V P + K EPIQVRHV+LDI + S S ED
Sbjct: 352 ILAGVKGEEPRASVPQMPVVVPAMVPKAQEPIQVRHVDLDINNSSSSDEYSSDVGSSEED 411
Query: 419 D 419
D
Sbjct: 412 D 412
>gi|226498388|ref|NP_001144436.1| uncharacterized protein LOC100277397 [Zea mays]
gi|195642186|gb|ACG40561.1| hypothetical protein [Zea mays]
gi|238013174|gb|ACR37622.1| unknown [Zea mays]
gi|414590084|tpg|DAA40655.1| TPA: hypothetical protein ZEAMMB73_937469 [Zea mays]
gi|414590085|tpg|DAA40656.1| TPA: hypothetical protein ZEAMMB73_937469 [Zea mays]
Length = 402
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 178/360 (49%), Positives = 227/360 (63%), Gaps = 43/360 (11%)
Query: 75 RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEI 134
RAQ+LL ++SLA+ LFE+SPNR+ WI ++RGSLPTFL S + S PP P +S S+ K+
Sbjct: 68 RAQSLLQLISSLATNLFELSPNRAAWILSYRGSLPTFLPSASSSAPPLP--TSISSTKDA 125
Query: 135 IACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILV 194
++ T+LQTQLFEAVAELQE LDLQDA+ ++ RE +AKD++++ FA K+ EA +VLD LV
Sbjct: 126 LSLLTTLQTQLFEAVAELQETLDLQDARARLVREARAKDASLVTFAKKLHEAHQVLDRLV 185
Query: 195 DDYSDYRR-PKRVKS--EGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEF 251
DDY+DYRR PKR + D G T++ S L L DIL+YAHRISYTTFAPPE
Sbjct: 186 DDYADYRRDPKRPRGAAAADDPEPASEGDFGTSLHSSLKLDDILTYAHRISYTTFAPPEH 245
Query: 252 GAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKV-VETKEKTIEAIIEPPPLQPVD 310
GAG PLR +LPP PQE QMRASQLY FADLDVG+PK ++ KE + P +P
Sbjct: 246 GAGL-PLRVSLPPVPQENQMRASQLYLFADLDVGVPKKPLDAKEGIAPEVEAAPLYEPPQ 304
Query: 311 ANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFP--LVPPPGWKPGDAVPLPPLDALPMP 368
P + +P LP FP L PPPGWKPGD + +P + LP
Sbjct: 305 EGPSR-------------------LPNTLPPMFPKELKPPPGWKPGDPITIPLDEILPGI 345
Query: 369 RIEE---------QQLRPVPPQGLHKPPEPIQVRHVELDILDQDDDSSDYSSEEGSSEDD 419
+ E +RPV P G PEPIQV+ V+LD + S +YSS+ GSSE+D
Sbjct: 346 KGAEPQPPVPQAPVSVRPVAPMG----PEPIQVKPVQLDF--ESSSSDEYSSDVGSSEED 399
>gi|224029783|gb|ACN33967.1| unknown [Zea mays]
gi|414888302|tpg|DAA64316.1| TPA: hypothetical protein ZEAMMB73_260641 [Zea mays]
Length = 403
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 170/341 (49%), Positives = 213/341 (62%), Gaps = 42/341 (12%)
Query: 75 RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEI 134
RAQ+LL ++SLAS L E+S NR+ WISA+RGSLPTFL S + + PP+ + S+ K+
Sbjct: 68 RAQSLLQLISSLASNLLELSANRAAWISAYRGSLPTFLPSASSA---PPLATPISSTKDA 124
Query: 135 IACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILV 194
++ T+LQTQLFEAVAELQE LDLQDA+ ++ RE + KD+A+LAFA K+ EA +VLD LV
Sbjct: 125 LSLVTTLQTQLFEAVAELQETLDLQDARARLVREARGKDAALLAFAKKLHEAHQVLDRLV 184
Query: 195 DDYSDYRR-PKRVKSEGGDGNDD--DSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEF 251
DDY+DYRR PKR + + + G T++ S L L DIL+YAHRISYTTFAPPE
Sbjct: 185 DDYADYRRDPKRPRGAAAADDPEPASEGDFGTSLHSSLKLDDILTYAHRISYTTFAPPEH 244
Query: 252 GAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQPVDA 311
GAG PLR +LPP PQE QMRASQLY FADLDVG+PK + I A +E PL
Sbjct: 245 GAGL-PLRVSLPPVPQENQMRASQLYLFADLDVGVPKKPLDAKDGIAAEVEVAPL----- 298
Query: 312 NPLKDLAGLLPPNITVPSGWKPGMPVELPKDFP--LVPPPGWKPGDAVPLPPLDALPMPR 369
P +P LP FP L PPPGWKPGD + LP + LP +
Sbjct: 299 -------------YEAPQEGPSRLPNTLPPMFPKELKPPPGWKPGDPITLPLDEILPGIK 345
Query: 370 IEEQQ-----------LRPVPPQGLHKPPEPIQVRHVELDI 399
EE++ +RPV P G PE IQV+ V+LD
Sbjct: 346 GEEREAQAPVPQAPVSVRPVAPMG----PETIQVKPVQLDF 382
>gi|242050012|ref|XP_002462750.1| hypothetical protein SORBIDRAFT_02g031320 [Sorghum bicolor]
gi|241926127|gb|EER99271.1| hypothetical protein SORBIDRAFT_02g031320 [Sorghum bicolor]
Length = 402
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 175/338 (51%), Positives = 218/338 (64%), Gaps = 41/338 (12%)
Query: 75 RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEI 134
RAQ+LL ++SLAS LFE+SPNR+ WI A+RGS PTFL S + S PP P + S+ K+
Sbjct: 68 RAQSLLQLISSLASNLFELSPNRAAWIQAYRGSFPTFLPSASSSAPPLP--TPISSTKDA 125
Query: 135 IACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILV 194
++ T+LQTQLFEAVAELQE LDLQDA+ ++ARE +AKD+A++AFA K+ EA +VLD LV
Sbjct: 126 LSLLTTLQTQLFEAVAELQETLDLQDARARLAREARAKDAALVAFAKKLHEAHQVLDRLV 185
Query: 195 DDYSDYRR-PKRVKSEGGDGNDD--DSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEF 251
DDY+DYRR PKR + + + G T++ S L L DIL+YAHRISYTTFAPPE
Sbjct: 186 DDYADYRRDPKRPRGAAAADDPEPASEGDFGTSLHSSLKLDDILAYAHRISYTTFAPPEH 245
Query: 252 GAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPL-QPVD 310
GAG PLR +LPP PQE QMRASQLY FADLDVG+PK ++ I A +E PL +P
Sbjct: 246 GAGL-PLRVSLPPVPQENQMRASQLYLFADLDVGVPKKPLDAKEGIAAEVEAAPLYEPPQ 304
Query: 311 ANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFP--LVPPPGWKPGDAVPLPPLDALPMP 368
P + +P LP FP L PPPGWKPGD + LPP + LP
Sbjct: 305 EGPSR-------------------LPNTLPPMFPKELKPPPGWKPGDPITLPPDEILPGI 345
Query: 369 RIEEQQ---------LRPVPPQGLHKPPEPIQVRHVEL 397
+ EE Q +RPV P G EPIQV+ V+L
Sbjct: 346 KGEEPQAPVPQPHVPVRPVAPMGA----EPIQVQPVQL 379
>gi|226506328|ref|NP_001145443.1| uncharacterized protein LOC100278821 [Zea mays]
gi|195656375|gb|ACG47655.1| hypothetical protein [Zea mays]
Length = 431
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 193/410 (47%), Positives = 245/410 (59%), Gaps = 39/410 (9%)
Query: 1 MLQHQIVQSPARLGLTNPNSPSLQNPNPNPSPTKFPTSATHHHLRSATPPTAPITTGTTA 60
M+Q + SPARLGLT +SPSL +PT P+ H L P +A GT+A
Sbjct: 29 MMQSHL-PSPARLGLTA-SSPSLPPNPVPLNPTSSPS---HGSL----PWSA--AAGTSA 77
Query: 61 CTSSTLLSLLPPLP----RAQALLLQMASLASKLFEVSPNRSLWISAFRGSLPTFLSSQA 116
+ T L PL RAQ+LL ++SLAS E+S NR+ WISA+RGSLPTFL S +
Sbjct: 78 APTLTTSPSLLPLLPPLPRAQSLLQLISSLASNFLELSANRAAWISAYRGSLPTFLPSAS 137
Query: 117 QSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAI 176
+ PP+ + S+ ++ ++ T+LQTQLFEAVAELQE LDLQDA+ ++ RE + KD+A+
Sbjct: 138 SA---PPLATPISSTEDALSLVTTLQTQLFEAVAELQETLDLQDARARLVREARGKDAAL 194
Query: 177 LAFANKIKEAERVLDILVDDYSDYRR-PKRVKS--EGGDGNDDDSGSCSTTVESRLNLSD 233
LAFA K+ EA +VLD LVDDY+DYRR PKR + D G T++ S L L D
Sbjct: 195 LAFAKKLHEAHQVLDRLVDDYADYRRDPKRPRGAAAADDPEPASEGDFGTSLHSSLKLDD 254
Query: 234 ILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPKVVETK 293
IL+YAHRISYT FAPPE GAG PLR +LPP PQE QMRASQLY FADLD+G+PK
Sbjct: 255 ILTYAHRISYTAFAPPEHGAGL-PLRVSLPPVPQENQMRASQLYLFADLDLGVPKKPLDA 313
Query: 294 EKTIEAIIEPPPLQPV-DANPLK---DLAGLLPPNITVPSGWKPGMPVELPKDFPLVPPP 349
+ I A +E PL P + LA + P + P GWKPG P+ LP D L P
Sbjct: 314 KDGIAAEVEVAPLYEAPQEGPSRLPNTLAPMFPKELKPPPGWKPGDPITLPLDEIL---P 370
Query: 350 GWKPGDAVPLPPLDALPMPRIEEQQLRPVPPQGLHKPPEPIQVRHVELDI 399
G K + P+ P+ +RPV P G PE IQV+ V+LD
Sbjct: 371 GIKGEEREAQAPVPQAPV------SVRPVAPMG----PETIQVKPVQLDF 410
>gi|168013070|ref|XP_001759224.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689537|gb|EDQ75908.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 401
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 122/331 (36%), Positives = 174/331 (52%), Gaps = 53/331 (16%)
Query: 38 SATHHHLRSATPPT----------APITTGTTACTSSTLLSLL--PPLPRAQALLLQMAS 85
SA H L+S P P+ T TA + +L+ PP RA LL +MA
Sbjct: 54 SAGHGELQSMWGPARSQQQQHDQAVPMDTAATARPAVVRSALMNQPPQRRAHMLLQEMAG 113
Query: 86 LASKLFEVSPNRSLWISAFRGSLPTFLSSQAQSQPPPPIDSSPSTIKEIIACFTSLQTQL 145
LA++L E++ +S + S+P FL+ PP + T KE+ A + Q L
Sbjct: 114 LANRLVELAATKSWQPGHMKVSIPPFLAGAIL----PPEGQAAVTSKEVQALLENHQNLL 169
Query: 146 FEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRRPKR 205
+EAV+ELQE+ D++ KQ+IA EI KD + AFA ++++A +VL+ +++Y+D ++ K
Sbjct: 170 YEAVSELQELQDIEQEKQQIAEEINTKDVVVRAFAKRVRDAHQVLEDALEEYADLKKGKH 229
Query: 206 VKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPA 265
KS + +++++SYAHRISYTTFAPP++GAG+ LRGALPPA
Sbjct: 230 RKS--------------------VPVAELVSYAHRISYTTFAPPDYGAGETILRGALPPA 269
Query: 266 PQEEQMRASQLYTFADLDVGLPKVVETKEKTIEAIIEPPPLQPVDANPLKDLAGLLPPNI 325
PQEE MRASQLY D+G +P A P+
Sbjct: 270 PQEENMRASQLYHVQVSDLGFTSAFAPPPAPEALEEA----EPPKAEPI----------- 314
Query: 326 TVPSGWKPGMPVELPKDFPLVP-PPGWKPGD 355
+P+GW+PGMPV+LP L P P GWK GD
Sbjct: 315 -MPAGWRPGMPVQLPPMAELPPMPAGWKIGD 344
>gi|62701671|gb|AAX92744.1| hypothetical protein LOC_Os11g05150 [Oryza sativa Japonica Group]
gi|77548691|gb|ABA91488.1| hydroxyproline-rich glycoprotein family protein, putative [Oryza
sativa Japonica Group]
Length = 194
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/105 (52%), Positives = 73/105 (69%), Gaps = 7/105 (6%)
Query: 132 KEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLD 191
+E ++ +LQ Q FEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA VLD
Sbjct: 86 QEALSLLNTLQMQFFEAVAELQETLDLQDARARLARETRAKDASILAFAKKLREAHHVLD 145
Query: 192 ILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNL 231
LVDDY+DYRR PKR + G DD G ++ S+L+L
Sbjct: 146 RLVDDYADYRRDPKRPR--GAAAADDPEPVSDGDFGASLHSKLDL 188
>gi|218185250|gb|EEC67677.1| hypothetical protein OsI_35109 [Oryza sativa Indica Group]
Length = 157
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/108 (50%), Positives = 74/108 (68%), Gaps = 7/108 (6%)
Query: 129 STIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAER 188
S K+ ++ +LQ Q FEAVAELQE LDLQDA+ ++AR+ +AKD++ILAFA K++EA
Sbjct: 46 SYTKKALSLLNTLQMQFFEAVAELQETLDLQDARARLARKTRAKDASILAFAKKLREAHH 105
Query: 189 VLDILVDDYSDYRR-PKRVKSEGGDGNDD----DSGSCSTTVESRLNL 231
VLD LVDDY+DYRR PKR + G DD G ++ S+L+L
Sbjct: 106 VLDRLVDDYADYRRDPKRPR--GAAAADDPEPVSDGDFGASLHSKLDL 151
>gi|125576223|gb|EAZ17445.1| hypothetical protein OsJ_32973 [Oryza sativa Japonica Group]
Length = 174
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/93 (55%), Positives = 66/93 (70%), Gaps = 7/93 (7%)
Query: 144 QLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRR- 202
Q FEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA VLD LVDDY+DYRR
Sbjct: 78 QFFEAVAELQETLDLQDARARLARETRAKDASILAFAKKLREAHHVLDRLVDDYADYRRD 137
Query: 203 PKRVKSEGGDGNDD----DSGSCSTTVESRLNL 231
PKR + G DD G ++ S+L+L
Sbjct: 138 PKRPR--GAAAADDPEPVSDGDFGASLHSKLDL 168
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/63 (71%), Positives = 54/63 (85%), Gaps = 1/63 (1%)
Query: 144 QLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRR- 202
Q FEAVAELQE LDLQDA+ ++ARE +AKD++ILAFA K++EA VLD LVDDY+DYRR
Sbjct: 2 QFFEAVAELQETLDLQDARARLARETRAKDASILAFAKKLREAHHVLDRLVDDYADYRRD 61
Query: 203 PKR 205
PKR
Sbjct: 62 PKR 64
>gi|301098212|ref|XP_002898199.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105262|gb|EEY63314.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 352
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 117/277 (42%), Gaps = 56/277 (20%)
Query: 118 SQPPPPIDSSP-STIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAI 176
S PP I ++ T+ E L AV +L Q +I EI+ K +
Sbjct: 31 SLPPSTITATRLRTVNETFNKVLEKDRDLLTAVKKLARHQVAQKELLRIEAEIENKKQKV 90
Query: 177 LAFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILS 236
+ +A +++ ++ +D +D R RV + ++ S V L+ DI++
Sbjct: 91 VQYAAQLRCSQ-------EDIADVLRKHRVTLQ-------NAKQKSKVV---LDPRDIVA 133
Query: 237 YAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDV---GLPKVVETK 293
YAHRI+ TT AP E+ G P+ G +PPAPQE MRA L ++ G K T
Sbjct: 134 YAHRIAGTTSAPKEWQPG-FPMFGFMPPAPQEHMMRAGVLSRGIVAEIVTQGPEKYTSTA 192
Query: 294 EKT--------IEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKD--- 342
+ +E + P L DA K + P GWKPG PV+LP D
Sbjct: 193 ARVSKEGDGDAMEKLNIPIGLGASDAQIRKQM----------PPGWKPGDPVDLPLDALL 242
Query: 343 ----------FPLVPPPGWKPGDAVPLPPLDALPMPR 369
+ P WK GD PLPP DA+ + R
Sbjct: 243 HYMGRNFFTEHGITLPESWKTGD--PLPP-DAMEILR 276
>gi|348675077|gb|EGZ14895.1| hypothetical protein PHYSODRAFT_508758 [Phytophthora sojae]
Length = 353
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 124/276 (44%), Gaps = 59/276 (21%)
Query: 122 PPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAR---EIKAKDSAILA 178
PP + + ++ + F + + + +A ++++ Q A++++ R EI++K ++
Sbjct: 33 PPSTITATRLRTVNETFNKVLEKDRDLLAAVKKLARHQVAQKELLRVEAEIESKKQKVVQ 92
Query: 179 FANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYA 238
+A +++ ++ +D ++ R RV + ++ S V L+ DI++YA
Sbjct: 93 YAAQLRRSQ-------EDIANVLRKHRVVLQ-------NAKQKSKVV---LDPRDIVAYA 135
Query: 239 HRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDV---GLPKVVETKEK 295
HRI+ TT AP E+ G P+ G +PPAPQE MRA L ++ G K T +
Sbjct: 136 HRIAGTTSAPKEWQPG-FPMFGFMPPAPQEHMMRAGVLSRGIVAEIVTQGPEKYTSTAAR 194
Query: 296 T---------IEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKD---- 342
IE + P L DA K + P GWKPG PV+LP D
Sbjct: 195 VKQDGDGGEAIEKLNIPIGLGASDAQIRKQM----------PPGWKPGDPVDLPLDALLH 244
Query: 343 ---------FPLVPPPGWKPGDAVPLPPLDALPMPR 369
+ P WK GD PLPP D + + R
Sbjct: 245 YMGRSFFTEHGITLPESWKTGD--PLPP-DVMEILR 277
>gi|325187457|emb|CCA21995.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 335
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 101/225 (44%), Gaps = 50/225 (22%)
Query: 164 KIAREIKAKDSAILAFANKIKEAERVLDILVDDYS----DYRRPKRVKSEGGDGNDDDSG 219
++ EI+AK+ ++ +A +++ A+ + + + + R+PK+ +
Sbjct: 77 QLQNEIQAKEERVIEYATQLRNAQVAIQNVFEKHQYTLQQARKPKQEQ------------ 124
Query: 220 SCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTF 279
++ R+ +++YAHRI+ TT AP + G P+ G PPAPQE+ MRA
Sbjct: 125 ---VLLDPRV----VVAYAHRIAGTTSAPEHWQPGL-PMSGFAPPAPQEQIMRA------ 170
Query: 280 ADLDVGLPKVVETKE--KTIEAIIEPPPLQPVDANPLKDLAGLLPPNITV----PSGWKP 333
L G+ T+E KT + P V N LK GL + + P GWKP
Sbjct: 171 GVLSRGVIAETVTREGFKTTRIDVSSEPETSV-TNDLKTPIGLGATDAEIRNQMPPGWKP 229
Query: 334 GMPVELPKDFPLVP-------------PPGWKPGDAVPLPPLDAL 365
G PV+LP D L P WK GDAVP +D L
Sbjct: 230 GDPVDLPLDALLHYMGRSFFQENGINLPEFWKSGDAVPPEAMDLL 274
>gi|307106146|gb|EFN54393.1| hypothetical protein CHLNCDRAFT_53218 [Chlorella variabilis]
Length = 326
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/133 (39%), Positives = 69/133 (51%), Gaps = 11/133 (8%)
Query: 234 ILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFA-DLDVGLPKVVET 292
+LSYAHR+SY++ AP + GQ PL PPAPQE +MRASQL+ F+ D +
Sbjct: 116 LLSYAHRLSYSSSAPLGYQTGQ-PLYFFKPPAPQEAEMRASQLHAFSRDWEARQQAAAAQ 174
Query: 293 KEKTIEAI---IEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPVELPKDFPLV--- 346
++ T A+ PP QP +A P A P + P G M + D +
Sbjct: 175 RQITATAVALGAAPPSAQP-EAAPGPGSATAAP--VPAPVGDVAAMAAQHGLDVAQLLAS 231
Query: 347 PPPGWKPGDAVPL 359
PPGWKPGD +PL
Sbjct: 232 IPPGWKPGDPIPL 244
>gi|412994053|emb|CCO14564.1| predicted protein [Bathycoccus prasinos]
Length = 264
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 68/143 (47%), Gaps = 17/143 (11%)
Query: 229 LNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDVGLPK 288
+N+ +L YA R++YTT AP + G L G LPPAPQEE SQ+ + + +
Sbjct: 73 VNVETLLEYARRVAYTTSAPLNYQPGVSQLCGQLPPAPQEEHFLVSQMKKRHEEKLVHEQ 132
Query: 289 VVETK-EKTIEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPG--MPVELPKDFPL 345
++ K ++ E ++ + P ++L L WKPG P +PK
Sbjct: 133 ALQRKRQREEETTKRQKEIEEMAKLPKEELIRRL-------VQWKPGAAWPAGVPK---- 181
Query: 346 VPPPGWKPGDAVPLPPLDALPMP 368
PP GWKPGD PL L A P
Sbjct: 182 -PPVGWKPGD--PLAFLTAKAAP 201
>gi|145341318|ref|XP_001415760.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575983|gb|ABO94052.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 235
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/54 (57%), Positives = 38/54 (70%), Gaps = 1/54 (1%)
Query: 226 ESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTF 279
ES L +S++L YA RISYTTFAP + G PL G +PPAPQEE +AS+L F
Sbjct: 45 ESELKVSEVLDYARRISYTTFAPAGYVPG-APLCGIMPPAPQEEHFQASRLAAF 97
>gi|301099090|ref|XP_002898637.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105062|gb|EEY63114.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 232
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 75/165 (45%), Gaps = 38/165 (23%)
Query: 229 LNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADLDV---G 285
L+ DI++YAHRI+ TT AP E+ G P+ G +PPAPQE MRA L ++ G
Sbjct: 6 LDPRDIVAYAHRIAGTTSAPKEWQPGF-PMFGFMPPAPQEHMMRAGVLSRGIVAEIVTQG 64
Query: 286 LPKVVETKEKT--------IEAIIEPPPLQPVDANPLKDLAGLLPPNITVPSGWKPGMPV 337
K T + +E + P L DA K + P GWKPG PV
Sbjct: 65 PEKYTSTAARVSKEGDGDAMEKLNIPIGLGASDAQIRKQM----------PPGWKPGDPV 114
Query: 338 ELPKD-------------FPLVPPPGWKPGDAVPLPPLDALPMPR 369
+LP D + P WK GD PLPP DA+ + R
Sbjct: 115 DLPLDALLHYMGRSFFTEHGITLPESWKTGD--PLPP-DAMEILR 156
>gi|330844437|ref|XP_003294132.1| hypothetical protein DICPUDRAFT_159085 [Dictyostelium purpureum]
gi|325075457|gb|EGC29342.1| hypothetical protein DICPUDRAFT_159085 [Dictyostelium purpureum]
Length = 347
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 77/156 (49%), Gaps = 5/156 (3%)
Query: 126 SSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKE 185
++P ++ +I+ + + + +L++ Q+ I +EI+ KD I AN +K+
Sbjct: 79 ATPQSVIQIMGRLIEVDKTYQKTLKKLEKHQKYQNEIANIQKEIEEKDKLISGLANNLKD 138
Query: 186 AERVLDI-LVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYT 244
E L+ L D + SE D N+ + + + +++SYAH+IS T
Sbjct: 139 IESFLENELSQDNAKMNPNISNVSEILDLNEPSTPVTPDIINREVTPEELISYAHKISGT 198
Query: 245 TFAPPEFGAG-QGPLRGAL-PPAPQEEQMRASQLYT 278
TFAP FG PL PPAPQ+E MRAS L+T
Sbjct: 199 TFAP--FGYQPHLPLISLFKPPAPQDEMMRASVLFT 232
>gi|66801189|ref|XP_629520.1| hypothetical protein DDB_G0292608 [Dictyostelium discoideum AX4]
gi|74850970|sp|Q54D00.1|MED4_DICDI RecName: Full=Putative mediator of RNA polymerase II transcription
subunit 4; AltName: Full=Putative mediator complex
subunit 4
gi|60462910|gb|EAL61107.1| hypothetical protein DDB_G0292608 [Dictyostelium discoideum AX4]
Length = 343
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 78/169 (46%), Gaps = 24/169 (14%)
Query: 125 DSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIK 184
+++P+ + +++ + + +L++ +Q +I +EI+ KD I A +K
Sbjct: 62 NATPNMVLQLMNHLIDVDKIYQNTLKKLEKHQKIQKEITEIQKEIEEKDKLISTLALNLK 121
Query: 185 EAERVLDILVDDYSDYRRPKRVKSEGGDGND-------------DDSGSCSTTVESRLNL 231
+ E L+ + V G+ +D DD+ S V ++
Sbjct: 122 DIESFLE-------NELSQDNVNINNGNASDGIVNEVLIDLKQPDDAASMDNIVNREVSP 174
Query: 232 SDILSYAHRISYTTFAPPEFGAG-QGPLRGAL-PPAPQEEQMRASQLYT 278
D++SYAH+IS TT AP FG PL PPAPQ+E MR+S L+T
Sbjct: 175 EDLISYAHKISGTTSAP--FGYQPNAPLASLFKPPAPQDEMMRSSVLFT 221
>gi|328868218|gb|EGG16598.1| putative mediator complex subunit 4 [Dictyostelium fasciculatum]
Length = 358
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 63/134 (47%), Gaps = 27/134 (20%)
Query: 144 QLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDILVDDYSDYRRP 203
QL + + +EI+D+Q +EI KD I+ A +K E+ L+ +++Y
Sbjct: 113 QLEQHLIYQKEIVDIQ-------KEIIEKDKVIMEMATTLKSLEKTLEEDIEEYG----- 160
Query: 204 KRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALP 263
K GD E + D++SYAH+IS TT AP + + P
Sbjct: 161 --AKKTKGD-------------EREVYPEDLISYAHKISGTTSAPFGYQPHLALIPLYKP 205
Query: 264 PAPQEEQMRASQLY 277
PAPQEE MRAS L+
Sbjct: 206 PAPQEEMMRASVLF 219
>gi|428183182|gb|EKX52040.1| hypothetical protein GUITHDRAFT_133791 [Guillardia theta CCMP2712]
Length = 324
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 66/145 (45%), Gaps = 24/145 (16%)
Query: 134 IIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLDIL 193
++ S Q +L E VAEL +I++++ + +I D+A+ F +K+A VL
Sbjct: 61 LLHLLCSFQVELRENVAELNKIVEVKKKIDDVKSKIDDSDAALKTFGRSLKDAAGVLQEA 120
Query: 194 VDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGA 253
+ S +T + + D++ YAH ISYT+ A E
Sbjct: 121 L-----------------------KTSSNTERVVEVEVEDLIKYAHVISYTSSA-QEGWE 156
Query: 254 GQGPLRGALPPAPQEEQMRASQLYT 278
PL GALPPAP + M S+L++
Sbjct: 157 PNTPLVGALPPAPHSDVMARSRLFS 181
>gi|281201187|gb|EFA75401.1| putative mediator complex subunit 4 [Polysphondylium pallidum
PN500]
Length = 355
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 28/143 (19%)
Query: 143 TQLFEAVAELQEILDLQDAKQK-------IAREIKAKDSAILAFANKIKEAERVLDILVD 195
+ L EA LQ+ L + Q I +EI+ KD I A+ +K+ E +L+ +
Sbjct: 96 SMLIEADKLLQKTLKQLEQHQTYQKEIVAIQKEIEEKDKQITTLASNLKDVEILLE---N 152
Query: 196 DYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPEFGAGQ 255
+ SD + + +K E ++ +++SYAH+IS TT AP + +
Sbjct: 153 EISDASKNEDIK------------------ERTISPEELISYAHKISGTTSAPYGYQSNL 194
Query: 256 GPLRGALPPAPQEEQMRASQLYT 278
+ PPAPQE+ MR+S L+T
Sbjct: 195 PLMPLYKPPAPQEDMMRSSTLFT 217
>gi|440799017|gb|ELR20078.1| hypothetical protein ACA1_114570 [Acanthamoeba castellanii str.
Neff]
Length = 310
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 69/147 (46%), Gaps = 14/147 (9%)
Query: 132 KEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFANKIKEAERVLD 191
+E++ L A+ +L+E Q ++ E+ KD + A ++++A+ L
Sbjct: 75 EELMERLLRTDQALLRALNQLREHQRQQQRLARLKDEVAKKDGVVEQLALQLQQAQGTLQ 134
Query: 192 ILVDDYS-DYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTFAPPE 250
++DD + RP + GG D +++ ++++YAH++SYTT APP
Sbjct: 135 RVIDDTTTKLDRP--LVPGGGSARRDG-----------VDVDEVVAYAHKVSYTTSAPPN 181
Query: 251 FGAGQGPLRGALPPAPQEEQMRASQLY 277
+ +R LPPAPQE + Y
Sbjct: 182 WNPNLPHMRMFLPPAPQEINILQGNFY 208
>gi|255073715|ref|XP_002500532.1| hypothetical protein MICPUN_56974 [Micromonas sp. RCC299]
gi|226515795|gb|ACO61790.1| hypothetical protein MICPUN_56974 [Micromonas sp. RCC299]
Length = 227
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 30/41 (73%), Gaps = 1/41 (2%)
Query: 229 LNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEE 269
++ +D++ YA RISYTTFAP + G PL G +PPAPQ+E
Sbjct: 41 VSAADVVEYARRISYTTFAPAGYQPG-APLHGIMPPAPQDE 80
>gi|299115493|emb|CBN75657.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 333
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 77/171 (45%), Gaps = 19/171 (11%)
Query: 111 FLSSQAQSQPPPPIDSSP-STIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREI 169
L + A+ Q +D++ + I++ + + +L +A L+E L + + +E+
Sbjct: 21 VLGALARCQSGTKLDANARAAIRKALVLLNGKEAKLTKASEALREHQQLHAEVESLRQEV 80
Query: 170 KAKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRL 229
+ + + A K+ EA+ L R KR+ ++G + E +
Sbjct: 81 RYGEQQVQDCARKLTEADAAL------RGPLRDAKRLL---------EAGRKAEAAE--I 123
Query: 230 NLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFA 280
++ DI+ YA RIS T AP + G + G PPAP+ E MRA L FA
Sbjct: 124 DVRDIVDYAQRISGITSAPAYWKPGMAMV-GFAPPAPRPEMMRAGALSAFA 173
>gi|239617708|ref|YP_002941030.1| Chromosome segregation ATPase-like protein [Kosmotoga olearia TBF
19.5.1]
gi|239506539|gb|ACR80026.1| Chromosome segregation ATPase-like protein [Kosmotoga olearia TBF
19.5.1]
Length = 1455
Score = 43.9 bits (102), Expect = 0.16, Method: Composition-based stats.
Identities = 34/124 (27%), Positives = 64/124 (51%), Gaps = 7/124 (5%)
Query: 124 IDSSPSTIKEIIACF---TSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAILAFA 180
++S + ++E+I F L Q +EA EL+ + ++++A I +E+K D AI
Sbjct: 831 LNSRKADVEELIIKFERLIELNKQAYEADKELRRLKEIENA---IGKELKDNDDAISRCD 887
Query: 181 NKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNL-SDILSYAH 239
++K+ + IL D SD R+ + E G G + S S + ++S L S++ S++
Sbjct: 888 EELKKIHALKAILDKDLSDRRKEREFYREKGKGRNPVSLSLTEAIDSYRRLESELGSHSA 947
Query: 240 RISY 243
I +
Sbjct: 948 TIKH 951
>gi|320166264|gb|EFW43163.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 371
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 78/170 (45%), Gaps = 24/170 (14%)
Query: 110 TFLSSQAQS--QPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAR 167
T + QAQ + P+ S + + ++A LQ +AV +LQ ++ R
Sbjct: 104 TEATEQAQRLVEMADPVARSTAIAEALMATEARLQ----QAVTKLQVHQSGMARLNELKR 159
Query: 168 EIKAKDSAILAFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVES 227
+ +D I +FA ++K A++ L ++VD S + ++E GD V+
Sbjct: 160 VLGEQDELIASFAAELKRADQRLGLIVD-RSKTKLDVMKQAERGD------------VDP 206
Query: 228 RLNLSDILSYAHRISYTTFAPPEFGAGQGPLRGALPPAPQEEQMRASQLY 277
R D++ YAH IS APP++ G PL P P E QMR S LY
Sbjct: 207 R----DVIRYAHYISTVATAPPDWKEGM-PLGIYSGPYPLETQMRDSLLY 251
>gi|395332718|gb|EJF65096.1| hypothetical protein DICSQDRAFT_50584 [Dichomitus squalens LYAD-421
SS1]
Length = 1081
Score = 41.6 bits (96), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 59/117 (50%), Gaps = 11/117 (9%)
Query: 188 RVLDILVDDYSD-YRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILSYAHRISYTTF 246
R+LD LVD+ ++ Y R R + +G DG +D + +T E+ L L +S SYT F
Sbjct: 727 RLLD-LVDELTNWYIRFNRRRLKGEDGEEDTVAALNTLFETLLTLCQTMS-----SYTPF 780
Query: 247 APPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADL-DVGLPKVVETKEKTIEAIIE 302
G LR +P AP+ +R+ F ++ + +V+E + K ++ +IE
Sbjct: 781 LTENIYQG---LREFIPEAPKAGDVRSIHFLPFPEVKEEYFDEVIERRVKRMQTVIE 834
>gi|406862031|gb|EKD15083.1| hypothetical protein MBM_06844 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 155
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 7/98 (7%)
Query: 118 SQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKDSAIL 177
SQPP +D P T++++ A L+ L +A L+++ DL+ R IK ++ +
Sbjct: 61 SQPPADLDHGPLTVRDLPAATDGLKHNLQKAKVLLKQLPDLE-------RSIKEQEDEMR 113
Query: 178 AFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGND 215
K+ E + VL LV+ + +R + ++ G +D
Sbjct: 114 DLEAKVGEQKAVLAQLVEVGASVKREREQRNRNGRADD 151
>gi|410926535|ref|XP_003976734.1| PREDICTED: mediator of RNA polymerase II transcription subunit
4-like [Takifugu rubripes]
Length = 253
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 74/171 (43%), Gaps = 30/171 (17%)
Query: 114 SQAQSQPPPPIDSSPSTIKEIIACFTSLQTQLFEAVAELQEILDLQDAKQKIAREIKAKD 173
S++Q PPP D +++ + E + Q+ + + Q + +E++ +D
Sbjct: 34 SRSQKLPPPGEDV------QVLELLVQRDAEFQELMEVAQQQGKVHEEMQVLEKEVEKRD 87
Query: 174 SAILAFANKIKEAERVLDILVDDYSDYRRPKRVKSEGGDGNDDDSGSCSTTVESRLNLSD 233
S I ++KEAE +L V Y+ +++K S + ++ +
Sbjct: 88 SDIQQLQKQLKEAEHILATAV-----YQAKEKLK------------SIDKARKGNISSEE 130
Query: 234 ILSYAHRISYTT--FAPPEFGAGQGPLRGALPPAPQEEQMRASQLYTFADL 282
I+ YAHRIS + AP + G P R P P + +MR+ L A+L
Sbjct: 131 IIKYAHRISASNAVCAPLNWVPGD-PRR----PYPTDLEMRSGLLGHMANL 176
>gi|331696553|ref|YP_004332792.1| FAD dependent oxidoreductase [Pseudonocardia dioxanivorans CB1190]
gi|326951242|gb|AEA24939.1| FAD dependent oxidoreductase [Pseudonocardia dioxanivorans CB1190]
Length = 393
Score = 38.1 bits (87), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 4/90 (4%)
Query: 281 DLDVGLPKVVETKEKTIEAIIEPPPLQPVDANPLKDLAGLLP-PNITVPSGWKPGMPVEL 339
D V P VV+ + + + + P+ PL+ A ++P + VP+G + M +E+
Sbjct: 207 DGHVEAPDVVDAAGAWADVVAQAAGVDPIGLVPLRRTAAIVPVADEAVPAGGRRSMVIEV 266
Query: 340 PKDFPLVPPPGW---KPGDAVPLPPLDALP 366
+ F PG PGD P PP DA P
Sbjct: 267 GESFYFKSEPGGLLVSPGDETPSPPCDARP 296
>gi|99082818|ref|YP_614972.1| flagellar basal body rod modification protein [Ruegeria sp. TM1040]
gi|99039098|gb|ABF65710.1| flagellar basal-body rod modification protein FlgD [Ruegeria sp.
TM1040]
Length = 231
Score = 37.7 bits (86), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 74/160 (46%), Gaps = 27/160 (16%)
Query: 102 SAFRGSLPTFL---SSQAQSQPP-PPIDSSPSTIKEIIACFTSLQTQLF--EAVAELQEI 155
SA TFL ++QA+ Q P P+DSS T + +A F+S++ Q+ E + +
Sbjct: 27 SALTSDFETFLRLMTTQAKYQDPLEPMDSSDYTAQ--LAQFSSVEQQVLTNETLTSMLTA 84
Query: 156 LDLQDAKQ---KIAREIKAKDSAILAFANKIK-------EAERVLDILVDDY-SDYRR-- 202
L L + Q + RE++A+ + ++ + A++V I+ DD ++ R
Sbjct: 85 LSLSNMSQLTGWVGREVRAQAAGYFDGSSSVSIAPNPAAVADKVEMIIYDDQEAEVGRVD 144
Query: 203 ------PKRVKSEGGDGNDDDSGSCSTTVESRLNLSDILS 236
P + E GDGN GS S +ES N ILS
Sbjct: 145 LPVSAEPYQWNGEDGDGNTLPEGSYSFVIESSSNGEVILS 184
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.312 0.132 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,422,129,033
Number of Sequences: 23463169
Number of extensions: 360503379
Number of successful extensions: 1598637
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 87
Number of HSP's successfully gapped in prelim test: 2841
Number of HSP's that attempted gapping in prelim test: 1568437
Number of HSP's gapped (non-prelim): 26288
length of query: 420
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 275
effective length of database: 8,957,035,862
effective search space: 2463184862050
effective search space used: 2463184862050
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 78 (34.7 bits)