BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020866
(320 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255561747|ref|XP_002521883.1| conserved hypothetical protein [Ricinus communis]
gi|223538921|gb|EEF40519.1| conserved hypothetical protein [Ricinus communis]
Length = 336
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 199/317 (62%), Positives = 237/317 (74%), Gaps = 17/317 (5%)
Query: 10 LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL---- 65
+KPWEQH+ +ISIPRFDYN+PS+LL SHSGFL+TC+IKREKSATKE +SIL K +
Sbjct: 29 MKPWEQHAAIISIPRFDYNAPSALLHNSHSGFLITCSIKREKSATKEVMSILEKYIGSYT 88
Query: 66 -ESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDF 124
+SSN + KRRK MGG CA G E + ED S +T T ET F
Sbjct: 89 KDSSNGSQGIKRRKTL---MGGTCAQGMESKDVSEDPD--------QVSEETHTVEETGF 137
Query: 125 DMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLK 184
+SLVKLTR+GLLLL F E+SP+ IVSNIFQ + SGSLKSP+WCHRIFPIQATC L
Sbjct: 138 TLSLVKLTRSGLLLLNFVGENSPDATEIVSNIFQRIESGSLKSPLWCHRIFPIQATCCLD 197
Query: 185 EKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDT-KDSDVSALLD 243
EKEL+ VSKLVL+F+ND+ NK RP+K+AVGYNRRG EE Q K KDT KDS + +LLD
Sbjct: 198 EKELRTVVSKLVLRFINDKANKFERPIKYAVGYNRRGIEETQAKNVKDTSKDSALCSLLD 257
Query: 244 RNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTK 303
RNKCF +VA+AVK+V+SDSAVDLKSPELS+LVELLP+SG+P+ LV VS+LPQ LV+ K
Sbjct: 258 RNKCFDVVASAVKDVISDSAVDLKSPELSILVELLPLSGVPNGSLVAAVSVLPQNLVSVK 317
Query: 304 PRLSIRALVSGTNAKNG 320
PRL I+ LVS NAK G
Sbjct: 318 PRLCIKPLVSDANAKKG 334
>gi|225424542|ref|XP_002285300.1| PREDICTED: uncharacterized protein LOC100267955 [Vitis vinifera]
gi|296081413|emb|CBI16846.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 201/335 (60%), Positives = 246/335 (73%), Gaps = 17/335 (5%)
Query: 1 MTEEKEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISI 60
M+EE+ EE +KPWEQHS VISIPRFDYN+PSSLL SHSGFL+TCTIKREKSATKEA+ I
Sbjct: 1 MSEEEREEGMKPWEQHSAVISIPRFDYNAPSSLLDHSHSGFLVTCTIKREKSATKEAMPI 60
Query: 61 LHKRL-----------ESSNTAGDSKRRKVCTDDMGGKCADGAE----INSIEEDSAGGL 105
L K + ESS+ +KRRK+CT+++ +C + E N+ ED G L
Sbjct: 61 LEKYVGSFSSCSSESLESSDANATTKRRKICTEEIDEECVNSVENKTASNNCGED-GGEL 119
Query: 106 QKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSL 165
K+ SS D E +SLVKLTR+GLLL FPR +S +T+++VS I +SL SGS+
Sbjct: 120 SKDAGVSSANRDAIVENGHVLSLVKLTRSGLLLFVFPRNNSVDTVDVVSQIIRSLQSGSV 179
Query: 166 KSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEK 225
K P+WCHRIFPIQATC L EKEL V+KLV+QFVN+EQNK +RP+KFAVGYNRRG EE
Sbjct: 180 KPPLWCHRIFPIQATCRLDEKELHEVVTKLVVQFVNNEQNKFARPIKFAVGYNRRGIEET 239
Query: 226 QNKIPKDT-KDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLP 284
+ KIPK T +D + ALLDR KCF++VA AVK VSDS VDLKSPELSVLVELLP+S +P
Sbjct: 240 EMKIPKSTPRDCNSHALLDRKKCFSVVATAVKGAVSDSVVDLKSPELSVLVELLPLSRVP 299
Query: 285 SELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKN 319
+ +VV VS+LPQ L+TTKPRL I+AL+S T N
Sbjct: 300 NGSMVVAVSVLPQNLITTKPRLCIKALLSDTKVGN 334
>gi|356575470|ref|XP_003555863.1| PREDICTED: uncharacterized protein LOC100797260 [Glycine max]
Length = 320
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 186/319 (58%), Positives = 227/319 (71%), Gaps = 17/319 (5%)
Query: 6 EEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL 65
EE+ + PWEQHS VI +PRFDYN+PSSLL+ SHSGFL+TCTIKREKSATKEAISILHK L
Sbjct: 3 EEKGMSPWEQHSAVIKLPRFDYNAPSSLLRNSHSGFLITCTIKREKSATKEAISILHKFL 62
Query: 66 ESSNTAG---------DSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKT 116
S + SKRRK+CT D +C D E +A + S VK
Sbjct: 63 GSGHYHTLNNPNEEDTTSKRRKICTQDAAEECFD-------ESPTANSGADGKLSSPVKA 115
Query: 117 DTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFP 176
+ + ++SLVKLTRNGLLLLTFP P+T+ IVSNI Q+L SG++ PVWCHRIFP
Sbjct: 116 EAEKDGIANLSLVKLTRNGLLLLTFPINTHPDTVTIVSNIIQALESGTVSLPVWCHRIFP 175
Query: 177 IQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDS 236
IQATC L EKELQ VS LV +F+ D+Q+KL RP+KFAVG+NRRG EE +++ DS
Sbjct: 176 IQATCNLNEKELQEVVSMLVKKFLADKQDKLERPLKFAVGFNRRGIEE-TTLAKENSNDS 234
Query: 237 DVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILP 296
+LLDRNKCF +VA+AV VV DS VDL+SPELSVLVELLP+SG+P ++V VS+LP
Sbjct: 235 KAFSLLDRNKCFGVVASAVNHVVEDSVVDLRSPELSVLVELLPLSGVPDASIIVAVSVLP 294
Query: 297 QKLVTTKPRLSIRALVSGT 315
+ LV+TKPRL I+AL S T
Sbjct: 295 RNLVSTKPRLCIKALNSNT 313
>gi|42561854|ref|NP_172400.2| uncharacterized protein [Arabidopsis thaliana]
gi|14334964|gb|AAK59659.1| unknown protein [Arabidopsis thaliana]
gi|332190304|gb|AEE28425.1| uncharacterized protein [Arabidopsis thaliana]
Length = 348
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 183/324 (56%), Positives = 224/324 (69%), Gaps = 12/324 (3%)
Query: 5 KEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKR 64
+E E L PWEQHS +ISIPRFDY +PSSLL SHSGFL+TC IKREKSATKE +SIL K
Sbjct: 27 EEAETLTPWEQHSSIISIPRFDYKAPSSLLHHSHSGFLVTCNIKREKSATKEVMSILGKY 86
Query: 65 LESSN-------TAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTD 117
+ S + + SK++KVC + G + +E D+ +N +K
Sbjct: 87 IGSMHEEKPEVLNSTASKKQKVCAQETE---EGGEKTVPLENDALQETGENPNVEDLKL- 142
Query: 118 TNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPI 177
N E + MSLVKLT++GLLL TFP E+SPNT NIVS +FQS+ SG+LK+P+WCHRIFP+
Sbjct: 143 ANEEHNSLMSLVKLTKSGLLLFTFPVENSPNTTNIVSRVFQSMESGALKAPIWCHRIFPV 202
Query: 178 QATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSD 237
QATC L EKEL+ TVSKLV +FVND+ N LS+PVKFA GY RRG EE + KI KD D
Sbjct: 203 QATCGLTEKELRETVSKLVQRFVNDKDNTLSKPVKFAAGYQRRGAEETKGKIRKDASDVL 262
Query: 238 VSA-LLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILP 296
V LLDR KCF VAA VK++V DS VDLKSPEL VLVELLP+S + S V VS+LP
Sbjct: 263 VQCPLLDRIKCFETVAAGVKDIVPDSVVDLKSPELCVLVELLPLSRISSGSFVAAVSVLP 322
Query: 297 QKLVTTKPRLSIRALVSGTNAKNG 320
+LV+TKP+L I+ LV + K G
Sbjct: 323 HRLVSTKPKLCIKPLVPESKHKKG 346
>gi|24030215|gb|AAN41286.1| unknown protein [Arabidopsis thaliana]
Length = 373
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 183/324 (56%), Positives = 224/324 (69%), Gaps = 12/324 (3%)
Query: 5 KEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKR 64
+E E L PWEQHS +ISIPRFDY +PSSLL SHSGFL+TC IKREKSATKE +SIL K
Sbjct: 52 EEAETLTPWEQHSSIISIPRFDYKAPSSLLHHSHSGFLVTCNIKREKSATKEVMSILGKY 111
Query: 65 LESSN-------TAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTD 117
+ S + + SK++KVC + G + +E D+ +N +K
Sbjct: 112 IGSMHEEKPEVLNSTASKKQKVCAQETE---EGGEKTVPLENDALQETGENPNVEDLKL- 167
Query: 118 TNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPI 177
N E + MSLVKLT++GLLL TFP E+SPNT NIVS +FQS+ SG+LK+P+WCHRIFP+
Sbjct: 168 ANEEHNSLMSLVKLTKSGLLLFTFPVENSPNTTNIVSRVFQSMESGALKAPIWCHRIFPV 227
Query: 178 QATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSD 237
QATC L EKEL+ TVSKLV +FVND+ N LS+PVKFA GY RRG EE + KI KD D
Sbjct: 228 QATCGLTEKELRETVSKLVQRFVNDKDNTLSKPVKFAAGYQRRGAEETKGKIRKDASDVL 287
Query: 238 VSA-LLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILP 296
V LLDR KCF VAA VK++V DS VDLKSPEL VLVELLP+S + S V VS+LP
Sbjct: 288 VQCPLLDRIKCFETVAAGVKDIVPDSVVDLKSPELCVLVELLPLSRISSGSFVAAVSVLP 347
Query: 297 QKLVTTKPRLSIRALVSGTNAKNG 320
+LV+TKP+L I+ LV + K G
Sbjct: 348 HRLVSTKPKLCIKPLVPESKHKKG 371
>gi|388515009|gb|AFK45566.1| unknown [Lotus japonicus]
Length = 369
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 179/323 (55%), Positives = 224/323 (69%), Gaps = 19/323 (5%)
Query: 6 EEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL 65
E+ + PWEQHS VI + R+DYN+P+SLL SHSGFL+TCTIKREKSATKEA+SIL K +
Sbjct: 46 EKAMMTPWEQHSGVIKLSRYDYNTPASLLLHSHSGFLITCTIKREKSATKEALSILEKFV 105
Query: 66 ESSNTAG-----------DSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSV 114
+ N G KR +V T D+GG+C D E S+ ++S G + S
Sbjct: 106 GAFNAGGFECSKNPGEYSSFKRMRVDTQDIGGECLDVKETESVTDNSGDG----KVLSPA 161
Query: 115 KTDTNTETDF-DMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHR 173
K + + D+SLVKLTRNGLLL +P P+T+NIVSNI QSL SGS PVWCHR
Sbjct: 162 KAEAEDKDGVTDLSLVKLTRNGLLLFIYPNNMFPDTVNIVSNIIQSLESGSTSLPVWCHR 221
Query: 174 IFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDT 233
IFPIQATC L EKE Q VS LV +F+ +Q KL RP+KFAVGYNRRG E+ K K+
Sbjct: 222 IFPIQATCRLNEKEFQEVVSMLVKKFLASKQGKLERPLKFAVGYNRRGIED--TKFAKEN 279
Query: 234 KD-SDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGV 292
+ S+ +LLDRNKCF++VA+AV +VV DS VDL+SPELS+LVELLP+S +P+ LVV V
Sbjct: 280 SNGSNAFSLLDRNKCFSVVASAVNDVVEDSVVDLRSPELSILVELLPLSRVPNGSLVVAV 339
Query: 293 SILPQKLVTTKPRLSIRALVSGT 315
S LP+ LV+TKPRL ++AL S T
Sbjct: 340 SALPRNLVSTKPRLCVKALTSNT 362
>gi|297849234|ref|XP_002892498.1| hypothetical protein ARALYDRAFT_471019 [Arabidopsis lyrata subsp.
lyrata]
gi|297338340|gb|EFH68757.1| hypothetical protein ARALYDRAFT_471019 [Arabidopsis lyrata subsp.
lyrata]
Length = 348
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 181/325 (55%), Positives = 226/325 (69%), Gaps = 16/325 (4%)
Query: 6 EEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL 65
E E L PWEQHS +ISIPRFDY +PSSLL SHSGFL+TCTIKREKSATKE +SIL K +
Sbjct: 28 EAETLPPWEQHSSIISIPRFDYKAPSSLLHHSHSGFLVTCTIKREKSATKEVMSILAKFI 87
Query: 66 ESSNT-------AGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVK--T 116
S + + SK++K+ ++ G E ED A LQ+ +V+
Sbjct: 88 GSMHEEKPQVLRSTASKKQKLSEQEI----EVGEEKTVPPEDDA--LQETGGDPNVEDLK 141
Query: 117 DTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFP 176
N E + MSLVKLT++GLLL TFP E+SP+T NIVS +FQS+ SG+LK+P+WCHRIFP
Sbjct: 142 LANEEHNSLMSLVKLTKSGLLLFTFPIENSPDTTNIVSRVFQSMESGALKAPIWCHRIFP 201
Query: 177 IQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDS 236
+QATC L EKEL+ TVSKLV +FV+++ N LS+PVKFA GY RRG EE + KI K+ +
Sbjct: 202 VQATCGLTEKELRETVSKLVQRFVDNKDNTLSKPVKFAAGYQRRGVEETKGKIQKEASEV 261
Query: 237 -DVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSIL 295
D LLDR KCF VAA VK++V DS VDLKSPEL VLVELLP+S +PS V VS+L
Sbjct: 262 LDQCPLLDRIKCFETVAAGVKDIVPDSVVDLKSPELCVLVELLPLSRIPSGSFVAAVSVL 321
Query: 296 PQKLVTTKPRLSIRALVSGTNAKNG 320
P +LV+TKP+L I+ LV + K G
Sbjct: 322 PHRLVSTKPKLCIKPLVPESKHKKG 346
>gi|449487893|ref|XP_004157853.1| PREDICTED: uncharacterized protein LOC101227649 [Cucumis sativus]
Length = 345
Score = 327 bits (839), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 188/334 (56%), Positives = 228/334 (68%), Gaps = 34/334 (10%)
Query: 4 EKEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHK 63
E E + + PWEQHS VISIPRFDYN+PS+LL R +GFL+TCTIKREKSATKEAISIL K
Sbjct: 28 ETERKMMTPWEQHSAVISIPRFDYNAPSALLHRCQTGFLITCTIKREKSATKEAISILQK 87
Query: 64 RLESSNTAGD-----------SKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHS 112
++ N++ SKRRKV D ++ S GG + H+
Sbjct: 88 YVQYFNSSMSETLVVSDENETSKRRKVSED--------------VDHRSVGGESSTDEHA 133
Query: 113 ------SVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLK 166
S K++ E +SLVKLTR+GLLL TF ++ SP+T+ IV +I QSL + +LK
Sbjct: 134 KETSLISTKSEAKVEKCSPISLVKLTRSGLLLFTFTKDISPDTVYIVKDIMQSLEARTLK 193
Query: 167 SPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQ 226
S WCHRIFPIQATC L E +LQ VSKLVL F+ND+ N LS PVKFA+GYNRRG EE +
Sbjct: 194 SLAWCHRIFPIQATCSLNENDLQGVVSKLVLHFMNDKGNILSHPVKFAIGYNRRGIEETE 253
Query: 227 NKIPKDTKDSD-VSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPS 285
K K +DS V+ +L R+KCF+IVAAAVK VVSD+ VDLKSPEL VLVELLP+SGLPS
Sbjct: 254 MK--KTFEDSSGVNVILGRDKCFSIVAAAVKGVVSDAIVDLKSPELCVLVELLPVSGLPS 311
Query: 286 ELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKN 319
VVGVS+L LVTTKPRL I+AL S T AK+
Sbjct: 312 GSSVVGVSVLSNNLVTTKPRLCIKALTSDTKAKS 345
>gi|283132369|dbj|BAI63590.1| hypothetical protein [Lotus japonicus]
Length = 336
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 170/311 (54%), Positives = 209/311 (67%), Gaps = 28/311 (9%)
Query: 6 EEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL 65
E+ + PWEQHS VI + R+DYN+P+SLL SHSGFL+TCTIKREKSATKEA+SIL K
Sbjct: 46 EKAMMTPWEQHSGVIKLSRYDYNAPASLLLHSHSGFLITCTIKREKSATKEALSILEKNC 105
Query: 66 ESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFD 125
G K+ M GK A+ + ++D D
Sbjct: 106 R----PGFIKQAWSWALMMAGKVLSPAKAEAEDKDGVT---------------------D 140
Query: 126 MSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKE 185
+SLVKLTRNGLLL +P P+T+NIVSNI QSL SGS PVWCHRIFPIQATC L E
Sbjct: 141 LSLVKLTRNGLLLFIYPNNMFPDTVNIVSNIIQSLESGSTSLPVWCHRIFPIQATCRLNE 200
Query: 186 KELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKD-SDVSALLDR 244
KELQ VS LV +F+ +Q KL RP+KFAVGYNRRG E+ K K+ + S+ +LLDR
Sbjct: 201 KELQEVVSMLVKKFLASKQGKLERPLKFAVGYNRRGIED--TKFAKENSNGSNAFSLLDR 258
Query: 245 NKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKP 304
NKCF++VA+AV +VV DS VDL+SPELS+LVELLP+S +P+ LVV VS LP+ LV+TKP
Sbjct: 259 NKCFSVVASAVNDVVEDSVVDLRSPELSILVELLPLSRVPNGSLVVAVSALPRNLVSTKP 318
Query: 305 RLSIRALVSGT 315
RL ++AL S T
Sbjct: 319 RLCVKALTSNT 329
>gi|357136974|ref|XP_003570077.1| PREDICTED: uncharacterized protein LOC100839515 isoform 1
[Brachypodium distachyon]
Length = 359
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 163/323 (50%), Positives = 217/323 (67%), Gaps = 16/323 (4%)
Query: 10 LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSN 69
L+PWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAISIL + + ++
Sbjct: 37 LQPWEQHAAVINLPRYDYRASRSLLLRSHSGFLITCPIKREKSATKEAISILGEYINPAS 96
Query: 70 TAGD-----------SKRRKVCTDDMGGKCADGAEINSIEE--DSAGGLQKNECHSSVKT 116
+ SK+RK C++ G + ++ A N ++ +S G ++ KT
Sbjct: 97 SHSSENLEPCVIEVASKKRKTCSEASGIENSEDAVTNGKDDASESTGCIEVETSSLHSKT 156
Query: 117 DTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFP 176
+ ++SLVKL R+GLL +F + + +++ +F SL SG LKSP WCHRIFP
Sbjct: 157 SEIIDRTSNLSLVKLARSGLLFFSF-TSGGLHVVQMLTEVFHSLRSGKLKSPQWCHRIFP 215
Query: 177 IQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDS 236
IQ TCVL E EL ATVSKL L F+ +++++ P+KFAVGYNRRG +E K K+ DS
Sbjct: 216 IQETCVLSETELHATVSKLFLDFLRNKEDQY-EPIKFAVGYNRRGIDETVTKTQKNDNDS 274
Query: 237 DVS-ALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSIL 295
+ A++DR KCF +VAAA+K V D+ VDLKSPE++VLVELLPISG+P V GVS+L
Sbjct: 275 SIQQAVMDREKCFKVVAAAIKSVAEDAIVDLKSPEVAVLVELLPISGVPIGSSVAGVSVL 334
Query: 296 PQKLVTTKPRLSIRALVSGTNAK 318
P +LV TKPRL IRALVS T AK
Sbjct: 335 PAELVATKPRLCIRALVSDTKAK 357
>gi|115447851|ref|NP_001047705.1| Os02g0672400 [Oryza sativa Japonica Group]
gi|50251230|dbj|BAD27816.1| unknown protein [Oryza sativa Japonica Group]
gi|50251921|dbj|BAD27858.1| unknown protein [Oryza sativa Japonica Group]
gi|113537236|dbj|BAF09619.1| Os02g0672400 [Oryza sativa Japonica Group]
gi|215678818|dbj|BAG95255.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 366
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 209/322 (64%), Gaps = 16/322 (4%)
Query: 10 LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL---- 65
L+PWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAISIL +
Sbjct: 43 LQPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIKREKSATKEAISILENSIGHAN 102
Query: 66 -------ESSNTAGDSKRRKVC--TDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKT 116
E S+T +K+RK+C T D+ ++ G ++K+ KT
Sbjct: 103 SYSSEKSEPSDTEVSAKKRKICSETPDIENSGDAVTYEKGDASETTGSVEKDSVSPHSKT 162
Query: 117 DTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFP 176
N + ++SLVKL+R+GLL +FP + +++ +F SL SG LKSP WCHRIFP
Sbjct: 163 SKNVDDTSNLSLVKLSRSGLLFFSFP-SGGLRVVQMLTQMFHSLHSGKLKSPQWCHRIFP 221
Query: 177 IQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKD- 235
IQ TCVL E EL TVSKL L FV ++++ P+KFAVGYNRRG +E + K K+ +
Sbjct: 222 IQETCVLSEAELHTTVSKLFLDFVKSKEDQ-DEPIKFAVGYNRRGIDETEMKGQKNGNEG 280
Query: 236 SDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSIL 295
S L+DR++CF +VA AVK V +S VDL+SPE++VLVE+LPISG+ V GVS+L
Sbjct: 281 SKQQTLMDRDQCFKVVAGAVKSVAENSIVDLRSPEVAVLVEMLPISGVSLGSSVAGVSVL 340
Query: 296 PQKLVTTKPRLSIRALVSGTNA 317
P +L++TKPRL ++ALV A
Sbjct: 341 PSELISTKPRLCVKALVPDAKA 362
>gi|218191337|gb|EEC73764.1| hypothetical protein OsI_08432 [Oryza sativa Indica Group]
Length = 366
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 209/322 (64%), Gaps = 16/322 (4%)
Query: 10 LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL---- 65
L+PWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAISIL +
Sbjct: 43 LQPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIKREKSATKEAISILENSIGHAN 102
Query: 66 -------ESSNTAGDSKRRKVC--TDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKT 116
E S+T +K+RK+C T D+ ++ G ++K+ KT
Sbjct: 103 NYSSEKSEPSDTEVSAKKRKICSETPDIENSGDAVTYEKGDASETTGSVEKDSVSPHSKT 162
Query: 117 DTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFP 176
N + ++SLVKL+R+GLL +FP + +++ +F SL SG LKSP WCHRIFP
Sbjct: 163 SKNVDDTSNLSLVKLSRSGLLFFSFP-SGGLRVVQMLTQMFHSLHSGKLKSPQWCHRIFP 221
Query: 177 IQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKD- 235
IQ TCVL E EL TVSKL L FV ++++ P+KFAVGYNRRG +E + K K+ +
Sbjct: 222 IQETCVLSEAELHTTVSKLFLDFVKSKEDQ-DEPIKFAVGYNRRGIDETEMKGQKNGNEG 280
Query: 236 SDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSIL 295
S L+DR++CF +VA AVK V +S VDL+SPE++VLVE+LPISG+ V GVS+L
Sbjct: 281 SKQQTLMDRDQCFKVVAGAVKSVAENSIVDLRSPEVAVLVEMLPISGVSLGSSVAGVSVL 340
Query: 296 PQKLVTTKPRLSIRALVSGTNA 317
P +L++TKPRL ++ALV A
Sbjct: 341 PSELISTKPRLCVKALVPDAKA 362
>gi|125583219|gb|EAZ24150.1| hypothetical protein OsJ_07895 [Oryza sativa Japonica Group]
Length = 376
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 209/332 (62%), Gaps = 26/332 (7%)
Query: 10 LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL---- 65
L+PWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAISIL +
Sbjct: 43 LQPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIKREKSATKEAISILENSIGHAN 102
Query: 66 -------ESSNTAGDSKRRKVC--TDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKT 116
E S+T +K+RK+C T D+ ++ G ++K+ KT
Sbjct: 103 SYSSEKSEPSDTEVSAKKRKICSETPDIENSGDAVTYEKGDASETTGSVEKDSVSPHSKT 162
Query: 117 DTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFP 176
N + ++SLVKL+R+GLL +FP + +++ +F SL SG LKSP WCHRIFP
Sbjct: 163 SKNVDDTSNLSLVKLSRSGLLFFSFP-SGGLRVVQMLTQMFHSLHSGKLKSPQWCHRIFP 221
Query: 177 IQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKD- 235
IQ TCVL E EL TVSKL L FV ++++ P+KFAVGYNRRG +E + K K+ +
Sbjct: 222 IQETCVLSEAELHTTVSKLFLDFVKSKEDQ-DEPIKFAVGYNRRGIDETEMKGQKNGNEG 280
Query: 236 SDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPE----------LSVLVELLPISGLPS 285
S L+DR++CF +VA AVK V +S VDL+SPE ++VLVE+LPISG+
Sbjct: 281 SKQQTLMDRDQCFKVVAGAVKSVAENSIVDLRSPEHVNFWMQQMSVAVLVEMLPISGVSL 340
Query: 286 ELLVVGVSILPQKLVTTKPRLSIRALVSGTNA 317
V GVS+LP +L++TKPRL ++ALV A
Sbjct: 341 GSSVAGVSVLPSELISTKPRLCVKALVPDAKA 372
>gi|147775350|emb|CAN65717.1| hypothetical protein VITISV_023512 [Vitis vinifera]
Length = 371
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 147/256 (57%), Positives = 184/256 (71%), Gaps = 14/256 (5%)
Query: 74 SKRRKVCTDDMGGKCADGAE----INSIEEDS---------AGGLQKNECHSSVKTDTNT 120
+KRRK+CT+++ +C + AE N+ ED G L K+ SS D
Sbjct: 84 TKRRKICTEEIDEECVNSAENKTASNNCGEDGDIADLCLVQVGELSKDAGVSSANXDAIV 143
Query: 121 ETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQAT 180
E +SLVKLTR+GLLL FPR +S +T+++VS I +SL SGS+K P+WCH IFPIQAT
Sbjct: 144 ENGHVLSLVKLTRSGLLLFVFPRNNSVDTVDVVSQIIRSLQSGSVKPPLWCHXIFPIQAT 203
Query: 181 CVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDT-KDSDVS 239
C L EKEL V+KLV+QFVN+EQNK +RP+KFAVGYNRRG EE + KIPK T +D +
Sbjct: 204 CRLDEKELHEVVTKLVVQFVNNEQNKFARPIKFAVGYNRRGIEETEMKIPKSTPRDCNSH 263
Query: 240 ALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKL 299
ALLDR KCF++VA AVK VSDS VDLKSPELSVLVELLP+S +P+ +VV VS+LPQ L
Sbjct: 264 ALLDRKKCFSVVATAVKGAVSDSVVDLKSPELSVLVELLPLSRVPNGSMVVAVSVLPQNL 323
Query: 300 VTTKPRLSIRALVSGT 315
+TTKPRL I+AL+S T
Sbjct: 324 ITTKPRLCIKALLSDT 339
>gi|326510883|dbj|BAJ91789.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 157/327 (48%), Positives = 217/327 (66%), Gaps = 22/327 (6%)
Query: 9 ELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESS 68
+L+PWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAISIL + + S
Sbjct: 32 DLQPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIKREKSATKEAISILGEYI-SD 90
Query: 69 NTAGDSKRRKVCTDDMGGK----CADGAEINSIEEDSAGG------------LQKNECHS 112
++ S+ + C + K ++ +EI E+ + G ++ N HS
Sbjct: 91 ASSHSSENFEPCVMETASKKRKIFSEASEIEHSEDAATNGKGDAPEFTGSIEVETNSVHS 150
Query: 113 SVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCH 172
T N + ++SL+KL+ +GLL +FP + + +++ +F+SL SG LKSP WCH
Sbjct: 151 D--TSGNPDRTSNLSLIKLSMSGLLFFSFPT-GGVHVVQMLTEVFRSLRSGKLKSPQWCH 207
Query: 173 RIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKD 232
RIFPIQ TCVL E EL ATVSKL L F +++++ P+KFAVGY+RRG +E NK K+
Sbjct: 208 RIFPIQETCVLSETELHATVSKLFLDFFRNKEDQ-DEPIKFAVGYSRRGIDETVNKTQKN 266
Query: 233 TKDSDVS-ALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVG 291
+S V AL+DR +CF +VAAAVK V D+ VDLKSPE++VLVE+LPISG+P V G
Sbjct: 267 DNNSSVQQALMDRQQCFKVVAAAVKSVAKDAIVDLKSPEVAVLVEVLPISGVPVGSSVAG 326
Query: 292 VSILPQKLVTTKPRLSIRALVSGTNAK 318
VS+LP +LV+ KPRL ++A+V T K
Sbjct: 327 VSVLPAELVSAKPRLCVKAMVFDTKTK 353
>gi|357136976|ref|XP_003570078.1| PREDICTED: uncharacterized protein LOC100839515 isoform 2
[Brachypodium distachyon]
Length = 330
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 199/312 (63%), Gaps = 23/312 (7%)
Query: 10 LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSN 69
L+PWEQH+ VI++PR+DY + SLL RSHSGFL+TC LE
Sbjct: 37 LQPWEQHAAVINLPRYDYRASRSLLLRSHSGFLITCP------------------LEPCV 78
Query: 70 TAGDSKRRKVCTDDMGGKCADGAEINSIEE--DSAGGLQKNECHSSVKTDTNTETDFDMS 127
SK+RK C++ G + ++ A N ++ +S G ++ KT + ++S
Sbjct: 79 IEVASKKRKTCSEASGIENSEDAVTNGKDDASESTGCIEVETSSLHSKTSEIIDRTSNLS 138
Query: 128 LVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKE 187
LVKL R+GLL +F + + +++ +F SL SG LKSP WCHRIFPIQ TCVL E E
Sbjct: 139 LVKLARSGLLFFSF-TSGGLHVVQMLTEVFHSLRSGKLKSPQWCHRIFPIQETCVLSETE 197
Query: 188 LQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVS-ALLDRNK 246
L ATVSKL L F+ +++++ P+KFAVGYNRRG +E K K+ DS + A++DR K
Sbjct: 198 LHATVSKLFLDFLRNKEDQY-EPIKFAVGYNRRGIDETVTKTQKNDNDSSIQQAVMDREK 256
Query: 247 CFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRL 306
CF +VAAA+K V D+ VDLKSPE++VLVELLPISG+P V GVS+LP +LV TKPRL
Sbjct: 257 CFKVVAAAIKSVAEDAIVDLKSPEVAVLVELLPISGVPIGSSVAGVSVLPAELVATKPRL 316
Query: 307 SIRALVSGTNAK 318
IRALVS T AK
Sbjct: 317 CIRALVSDTKAK 328
>gi|413938203|gb|AFW72754.1| hypothetical protein ZEAMMB73_044682 [Zea mays]
Length = 353
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 151/319 (47%), Positives = 203/319 (63%), Gaps = 14/319 (4%)
Query: 8 EELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHK---- 63
+ELKPWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAI IL +
Sbjct: 36 QELKPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIKREKSATKEAIPILEEYHAN 95
Query: 64 -----RLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDT 118
E + K+RK+C++ G NS + G N K +
Sbjct: 96 CCVSEHAEPCDVKNAIKKRKLCSEASGSLEEAVTNGNSSSVSESIGSTGNTSSPQSKVND 155
Query: 119 NTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQ 178
N + ++SLVKL+R+GLL FP + + +++ I SL SG LKSP WCHRIFPIQ
Sbjct: 156 NVDRASNLSLVKLSRSGLLFFKFP-SGGVHVVEMLTEILHSLRSGKLKSPQWCHRIFPIQ 214
Query: 179 ATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDV 238
TC+L E++L ATVSKL L F + N+ +P+KFAV YNRRG +E + + S+
Sbjct: 215 ETCILSEEDLHATVSKLFLDFSRSKTNE-DKPIKFAVAYNRRGIDETEKN---SNEGSNQ 270
Query: 239 SALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQK 298
L+DR +CF +VAAAVK V +S VDL+SPE++VLVE+LP+SG+P V GVS+LP +
Sbjct: 271 QTLMDREQCFKVVAAAVKSVAENSVVDLRSPEVAVLVEMLPVSGVPLGSSVAGVSVLPAE 330
Query: 299 LVTTKPRLSIRALVSGTNA 317
L++TKPRL +R+LV A
Sbjct: 331 LISTKPRLCVRSLVPDAKA 349
>gi|219363445|ref|NP_001136900.1| uncharacterized protein LOC100217057 [Zea mays]
gi|194697532|gb|ACF82850.1| unknown [Zea mays]
Length = 353
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 150/319 (47%), Positives = 202/319 (63%), Gaps = 14/319 (4%)
Query: 8 EELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHK---- 63
+ELKPWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAI IL +
Sbjct: 36 QELKPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIKREKSATKEAIPILEEYHAN 95
Query: 64 -----RLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDT 118
E + K+RK+C++ G NS + G N K +
Sbjct: 96 CCVSEHAEPCDVKNAIKKRKLCSEASGSLEEAVTNGNSSSVSESIGSTGNTSSPQSKVND 155
Query: 119 NTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQ 178
N + ++SLVKL+R+GLL FP + + +++ I SL SG LKSP WCHRIFPIQ
Sbjct: 156 NVDRASNLSLVKLSRSGLLFFKFP-SGGVHVVEMLTEILHSLRSGKLKSPQWCHRIFPIQ 214
Query: 179 ATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDV 238
TC+L E++L ATVSKL L F + N+ +P+KFAV YNRRG +E + + S+
Sbjct: 215 ETCILSEEDLHATVSKLFLDFSRSKTNE-DKPIKFAVAYNRRGIDETEKN---SNEGSNQ 270
Query: 239 SALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQK 298
L+DR +CF +VAAAVK V +S VDL+SPE++VLVE+LP+SG+P V GVS+ P +
Sbjct: 271 QTLMDREQCFKVVAAAVKSVAENSVVDLRSPEVAVLVEMLPVSGVPLGSSVAGVSVPPAE 330
Query: 299 LVTTKPRLSIRALVSGTNA 317
L++TKPRL +R+LV A
Sbjct: 331 LISTKPRLCVRSLVPDAKA 349
>gi|195627716|gb|ACG35688.1| hypothetical protein [Zea mays]
gi|413938201|gb|AFW72752.1| hypothetical protein ZEAMMB73_044682 [Zea mays]
Length = 326
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 191/310 (61%), Gaps = 23/310 (7%)
Query: 8 EELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLES 67
+ELKPWEQH+ VI++PR+DY + SLL RSHSGFL+TC IK K AI
Sbjct: 36 QELKPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIK--PCDVKNAI--------- 84
Query: 68 SNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMS 127
K+RK+C++ G NS + G N K + N + ++S
Sbjct: 85 -------KKRKLCSEASGSLEEAVTNGNSSSVSESIGSTGNTSSPQSKVNDNVDRASNLS 137
Query: 128 LVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKE 187
LVKL+R+GLL FP + + +++ I SL SG LKSP WCHRIFPIQ TC+L E++
Sbjct: 138 LVKLSRSGLLFFKFP-SGGVHVVEMLTEILHSLRSGKLKSPQWCHRIFPIQETCILSEED 196
Query: 188 LQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKC 247
L ATVSKL L F + N+ +P+KFAV YNRRG +E + + S+ L+DR +C
Sbjct: 197 LHATVSKLFLDFSRSKTNE-DKPIKFAVAYNRRGIDETEKN---SNEGSNQQTLMDREQC 252
Query: 248 FTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLS 307
F +VAAAVK V +S VDL+SPE++VLVE+LP+SG+P V GVS+LP +L++TKPRL
Sbjct: 253 FKVVAAAVKSVAENSVVDLRSPEVAVLVEMLPVSGVPLGSSVAGVSVLPAELISTKPRLC 312
Query: 308 IRALVSGTNA 317
+R+LV A
Sbjct: 313 VRSLVPDAKA 322
>gi|224108177|ref|XP_002314748.1| predicted protein [Populus trichocarpa]
gi|222863788|gb|EEF00919.1| predicted protein [Populus trichocarpa]
Length = 280
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 122/207 (58%), Positives = 155/207 (74%), Gaps = 1/207 (0%)
Query: 65 LESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDF 124
LE S+ +K+R++ TD+ KC + + SI ++ +GG K++C S K D E F
Sbjct: 15 LERSDENKSAKKRRILTDETSAKCDEDVKSESIIDEISGGPAKDDCQSLSKADAPVERGF 74
Query: 125 DMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLK 184
+SLVKLT +GL+LLTFPRE+ T ++VSNIFQ L SG LKSP+WCHRIFPIQATC+L
Sbjct: 75 VLSLVKLTGSGLVLLTFPRENPSVTADVVSNIFQCLESGILKSPLWCHRIFPIQATCLLI 134
Query: 185 EKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDT-KDSDVSALLD 243
EKEL+A VSKLVLQF+ND+QNKL+RP+KFAVGYNRRG EE K KD KDSD +LD
Sbjct: 135 EKELRAVVSKLVLQFINDKQNKLARPIKFAVGYNRRGIEETCMKNLKDNPKDSDPFPMLD 194
Query: 244 RNKCFTIVAAAVKEVVSDSAVDLKSPE 270
R+KCF +VA+A+K+ V +SAVDLKSPE
Sbjct: 195 RSKCFDVVASAIKDAVPESAVDLKSPE 221
>gi|449445272|ref|XP_004140397.1| PREDICTED: uncharacterized protein LOC101219243 [Cucumis sativus]
Length = 260
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 142/253 (56%), Positives = 172/253 (67%), Gaps = 23/253 (9%)
Query: 74 SKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHS------SVKTDTNTETDFDMS 127
SKRRKV D ++ S GG + H+ S K++ E +S
Sbjct: 24 SKRRKVSED--------------VDHRSVGGESSTDEHAKETSLISTKSEAKVEKCSPIS 69
Query: 128 LVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKE 187
LVKLTR+GLLL TF ++ SP+T+ IV +I QSL + +LKS WCHRIFPIQATC L E +
Sbjct: 70 LVKLTRSGLLLFTFTKDISPDTVYIVKDIMQSLEARTLKSLAWCHRIFPIQATCSLNEND 129
Query: 188 LQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSD-VSALLDRNK 246
LQ VSKLVL F+ND+ N LS PVKFA+GYNRRG EE + K K +DS V+ +L R+K
Sbjct: 130 LQGVVSKLVLHFMNDKGNILSHPVKFAIGYNRRGIEETEMK--KTFEDSSGVNVILGRDK 187
Query: 247 CFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRL 306
CF+IVAAAVK VVSD+ VDLKSPEL VLVELLP+SGLPS VVGVS+L LVTTKPRL
Sbjct: 188 CFSIVAAAVKGVVSDAIVDLKSPELCVLVELLPVSGLPSGSSVVGVSVLSNNLVTTKPRL 247
Query: 307 SIRALVSGTNAKN 319
I+AL S T AK+
Sbjct: 248 CIKALTSDTKAKS 260
>gi|4337179|gb|AAD18100.1| This gene is continued on the 5' end of BAC T12M14, partial
[Arabidopsis thaliana]
Length = 297
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 122/237 (51%), Positives = 152/237 (64%), Gaps = 25/237 (10%)
Query: 5 KEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHK- 63
+E E L PWEQHS +ISIPRFDY +PSSLL SHSGFL+TC IKREKSATKE +SIL K
Sbjct: 27 EEAETLTPWEQHSSIISIPRFDYKAPSSLLHHSHSGFLVTCNIKREKSATKEVMSILGKV 86
Query: 64 -------------------RLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGG 104
+ E N+ SK++KVC + G + +E D+
Sbjct: 87 SSQFSIISEAKYIGSMHEEKPEVLNSTA-SKKQKVCAQETE---EGGEKTVPLENDALQE 142
Query: 105 LQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGS 164
+N +K N E + MSLVKLT++GLLL TFP E+SPNT NIVS +FQS+ SG+
Sbjct: 143 TGENPNVEDLKL-ANEEHNSLMSLVKLTKSGLLLFTFPVENSPNTTNIVSRVFQSMESGA 201
Query: 165 LKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRG 221
LK+P+WCHRIFP+QATC L EKEL+ TVSKLV +FVND+ N LS+PVK + G
Sbjct: 202 LKAPIWCHRIFPVQATCGLTEKELRETVSKLVQRFVNDKDNTLSKPVKVRKSFRYIG 258
>gi|413938202|gb|AFW72753.1| hypothetical protein ZEAMMB73_044682 [Zea mays]
Length = 313
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 126/275 (45%), Positives = 170/275 (61%), Gaps = 14/275 (5%)
Query: 8 EELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILH----- 62
+ELKPWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAI IL
Sbjct: 36 QELKPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIKREKSATKEAIPILEEYHAN 95
Query: 63 ----KRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDT 118
+ E + K+RK+C++ G NS + G N K +
Sbjct: 96 CCVSEHAEPCDVKNAIKKRKLCSEASGSLEEAVTNGNSSSVSESIGSTGNTSSPQSKVND 155
Query: 119 NTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQ 178
N + ++SLVKL+R+GLL FP + + +++ I SL SG LKSP WCHRIFPIQ
Sbjct: 156 NVDRASNLSLVKLSRSGLLFFKFP-SGGVHVVEMLTEILHSLRSGKLKSPQWCHRIFPIQ 214
Query: 179 ATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDV 238
TC+L E++L ATVSKL L F + N+ +P+KFAV YNRRG +E + + S+
Sbjct: 215 ETCILSEEDLHATVSKLFLDFSRSKTNE-DKPIKFAVAYNRRGIDETEK---NSNEGSNQ 270
Query: 239 SALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSV 273
L+DR +CF +VAAAVK V +S VDL+SPE+++
Sbjct: 271 QTLMDREQCFKVVAAAVKSVAENSVVDLRSPEVTL 305
>gi|195623584|gb|ACG33622.1| hypothetical protein [Zea mays]
Length = 313
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 126/275 (45%), Positives = 170/275 (61%), Gaps = 14/275 (5%)
Query: 8 EELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILH----- 62
+ELKPWEQH+ VI++PR+DY + SLL RSHSGFL+TC IKREKSATKEAI IL
Sbjct: 36 QELKPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIKREKSATKEAIPILEEYHAN 95
Query: 63 ----KRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDT 118
+ E + K+RK+C++ G NS + G N K +
Sbjct: 96 CCVSEHAEPCDVKNAIKKRKLCSEASGSLEEAVTNGNSSSVSESIGSTGNTSSPQSKVND 155
Query: 119 NTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQ 178
N + ++SLVKL+R+GLL FP + + +++ I SL SG LKSP WCHRIFPIQ
Sbjct: 156 NVDRASNLSLVKLSRSGLLFFKFP-SGGVHVVEMLTEILHSLRSGKLKSPQWCHRIFPIQ 214
Query: 179 ATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDV 238
TC+L E++L ATVSKL L F + N+ +P+KFAV YNRRG +E + + S+
Sbjct: 215 ETCILSEEDLHATVSKLFLDFSRSKTNE-DKPIKFAVAYNRRGIDETEK---NSNEGSNQ 270
Query: 239 SALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSV 273
L+DR +CF +VAAAVK V +S VDL+SPE+++
Sbjct: 271 QTLMDREQCFKVVAAAVKSVAENSVVDLRSPEVTL 305
>gi|413938200|gb|AFW72751.1| hypothetical protein ZEAMMB73_044682 [Zea mays]
Length = 286
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 116/266 (43%), Positives = 158/266 (59%), Gaps = 23/266 (8%)
Query: 8 EELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLES 67
+ELKPWEQH+ VI++PR+DY + SLL RSHSGFL+TC IK K AI
Sbjct: 36 QELKPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPIK--PCDVKNAI--------- 84
Query: 68 SNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMS 127
K+RK+C++ G NS + G N K + N + ++S
Sbjct: 85 -------KKRKLCSEASGSLEEAVTNGNSSSVSESIGSTGNTSSPQSKVNDNVDRASNLS 137
Query: 128 LVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKE 187
LVKL+R+GLL FP + + +++ I SL SG LKSP WCHRIFPIQ TC+L E++
Sbjct: 138 LVKLSRSGLLFFKFP-SGGVHVVEMLTEILHSLRSGKLKSPQWCHRIFPIQETCILSEED 196
Query: 188 LQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKC 247
L ATVSKL L F + N+ +P+KFAV YNRRG +E + + S+ L+DR +C
Sbjct: 197 LHATVSKLFLDFSRSKTNE-DKPIKFAVAYNRRGIDETEK---NSNEGSNQQTLMDREQC 252
Query: 248 FTIVAAAVKEVVSDSAVDLKSPELSV 273
F +VAAAVK V +S VDL+SPE+++
Sbjct: 253 FKVVAAAVKSVAENSVVDLRSPEVTL 278
>gi|413938198|gb|AFW72749.1| hypothetical protein ZEAMMB73_044682 [Zea mays]
Length = 166
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/166 (53%), Positives = 118/166 (71%), Gaps = 4/166 (2%)
Query: 152 IVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPV 211
+++ I SL SG LKSP WCHRIFPIQ TC+L E++L ATVSKL L F + N+ +P+
Sbjct: 1 MLTEILHSLRSGKLKSPQWCHRIFPIQETCILSEEDLHATVSKLFLDFSRSKTNE-DKPI 59
Query: 212 KFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPEL 271
KFAV YNRRG +E + + S+ L+DR +CF +VAAAVK V +S VDL+SPE+
Sbjct: 60 KFAVAYNRRGIDETEKN---SNEGSNQQTLMDREQCFKVVAAAVKSVAENSVVDLRSPEV 116
Query: 272 SVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNA 317
+VLVE+LP+SG+P V GVS+LP +L++TKPRL +R+LV A
Sbjct: 117 AVLVEMLPVSGVPLGSSVAGVSVLPAELISTKPRLCVRSLVPDAKA 162
>gi|388516073|gb|AFK46098.1| unknown [Lotus japonicus]
Length = 212
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 64/105 (60%), Positives = 82/105 (78%), Gaps = 3/105 (2%)
Query: 212 KFAVGYNRRGFEEKQNKIPKDTKD-SDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPE 270
KFAVGYNRRG E+ K K+ + S+ +LLDRNKCF++VA+AV +VV DS VDL+SPE
Sbjct: 103 KFAVGYNRRGIED--TKFAKENSNGSNAFSLLDRNKCFSVVASAVNDVVEDSVVDLRSPE 160
Query: 271 LSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGT 315
LS+LVELLP+S +P+ LVV VS LP+ LV+TKPRL ++AL S T
Sbjct: 161 LSILVELLPLSRVPNGSLVVAVSALPRNLVSTKPRLCVKALTSNT 205
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 79/143 (55%), Gaps = 13/143 (9%)
Query: 6 EEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL 65
E+ + PWEQHS VI + R+DYN+P+SLL S+SGFL+TCTIKREKSATKEA+SIL K
Sbjct: 46 EKAMMTPWEQHSGVIKLSRYDYNAPASLLLHSYSGFLITCTIKREKSATKEALSILEKFA 105
Query: 66 ESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFD 125
N RR + + ++G+ S+ L +N+C S V + N +
Sbjct: 106 VGYN------RRGIEDTKFAKENSNGSNAFSL-------LDRNKCFSVVASAVNDVVEDS 152
Query: 126 MSLVKLTRNGLLLLTFPREHSPN 148
+ ++ +L+ P PN
Sbjct: 153 VVDLRSPELSILVELLPLSRVPN 175
>gi|168030223|ref|XP_001767623.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681152|gb|EDQ67582.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 166/372 (44%), Gaps = 89/372 (23%)
Query: 10 LKPWEQHSRVISIPRFDYNSPS-SLLQRSHSGFLLTCTIK-----------------REK 51
LKPW+QH VI++PR+ Y+S + LL RS GFL+TC + REK
Sbjct: 27 LKPWDQHGGVINMPRYLYSSDAVPLLHRS--GFLITCAYRANIAKTWDDFVELRRVGREK 84
Query: 52 SATKEAISILHKRLESSNTAGDSKR----------------------------------- 76
SATKEA+ IL + L+ + G +KR
Sbjct: 85 SATKEAVEILREFLDFTKPNGGAKRARTESYSGSTQQNSLGLTKIEAQLRSPEERDTKVA 144
Query: 77 ---RKVCTDDMGGKCADGAEINSIEEDSAGGLQKNEC---HSSVKTDTNTETDFDMS--- 127
R+ + + GK DG+ + +E S ++KN H+ T ++ +T S
Sbjct: 145 EVERQSSSTGITGKL-DGS---NTDEGSPTAMEKNGFGMDHAETFTLSDRDTGLGCSSAP 200
Query: 128 -------LVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQAT 180
LVKL RNG++ ++ P + + I S + +P + +
Sbjct: 201 DMKGEFRLVKLARNGMVYISIPCRSAEDLFTSFIKIINDFHSRTRAAPR-SEKPINFEHR 259
Query: 181 CVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSA 240
CV L + + N + S ++AV N+RG E+K+ P T ++
Sbjct: 260 CVSFPVRLF-----IFWEASNGMLHAHSGSSQYAVAVNKRGLEDKEK--PGVT----TAS 308
Query: 241 LLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLV 300
+ + +C VA A++ + +VDL +P++ V++ELLP+ + + V VS+LP++LV
Sbjct: 309 VFGKPECIHAVAEAIQSLTEYISVDLTNPQMVVMLELLPLVRVGTS--VCAVSVLPRELV 366
Query: 301 TTKPRLSIRALV 312
TKP+L I+AL
Sbjct: 367 LTKPKLVIQALA 378
>gi|302773570|ref|XP_002970202.1| hypothetical protein SELMODRAFT_441087 [Selaginella moellendorffii]
gi|300161718|gb|EFJ28332.1| hypothetical protein SELMODRAFT_441087 [Selaginella moellendorffii]
Length = 812
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 97/177 (54%), Gaps = 18/177 (10%)
Query: 126 MSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKE 185
SL+KL+ +G++ L+ P + + ++++S + + + K WC+R+ P+QATC +E
Sbjct: 154 FSLIKLSGSGIVFLSTPDDT--DVVSVLSRVLDDIRAPHTKPFSWCNRLTPVQATCASEE 211
Query: 186 KELQATVSKLVLQFVNDEQNKLSRP-VKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDR 244
+ + TV L+ +K R +K+AVG+N R E+ + +R
Sbjct: 212 EAIFVTVLGLLRDDAGGIVSKTGRSNLKYAVGFNSRAAEDGNTGV-------------ER 258
Query: 245 NKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVT 301
+ C VA A++E ++V+L +PE+ V VEL+P+ G + L G+S+LP+++V
Sbjct: 259 DGCIAAVARAMQEFEQQASVNLSNPEIVVAVELIPLVGKTTPL--AGISLLPKEVVV 313
>gi|302793202|ref|XP_002978366.1| hypothetical protein SELMODRAFT_443858 [Selaginella moellendorffii]
gi|300153715|gb|EFJ20352.1| hypothetical protein SELMODRAFT_443858 [Selaginella moellendorffii]
Length = 742
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 96/175 (54%), Gaps = 18/175 (10%)
Query: 126 MSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKE 185
SL+KL+ +G++ L+ P + + ++++S + + + K WC+R+ P+QATC +E
Sbjct: 70 FSLIKLSGSGIVFLSTPDD--TDVVSVLSRVLDDIRAPHTKPFSWCNRLTPVQATCASEE 127
Query: 186 KELQATVSKLVLQFVNDEQNKLSRP-VKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDR 244
+ + TV L+ +K R +K+AVG+N R E+ + +R
Sbjct: 128 EAIFVTVLGLLRDDAGGIVSKTGRSNLKYAVGFNSRAAEDGNTGV-------------ER 174
Query: 245 NKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKL 299
+ C VA A++E ++V+L +PE+ V VEL+P+ G + L G+S+LP+++
Sbjct: 175 DGCIAAVARAMQEFEQQASVNLSNPEIVVAVELIPLVGKTTPL--AGISLLPKEV 227
>gi|224108173|ref|XP_002314747.1| predicted protein [Populus trichocarpa]
gi|222863787|gb|EEF00918.1| predicted protein [Populus trichocarpa]
Length = 79
Score = 80.5 bits (197), Expect = 9e-13, Method: Composition-based stats.
Identities = 32/50 (64%), Positives = 43/50 (86%)
Query: 6 EEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATK 55
E+E +KPWEQH+ VI++PRFDYN+PS+LL SHSGFL+TC+IK E+ T+
Sbjct: 27 EKERMKPWEQHAGVINMPRFDYNAPSALLHHSHSGFLITCSIKIERPTTQ 76
>gi|413938197|gb|AFW72748.1| hypothetical protein ZEAMMB73_044682 [Zea mays]
Length = 80
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/79 (56%), Positives = 62/79 (78%), Gaps = 2/79 (2%)
Query: 242 LDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVT 301
+DR +CF +VAAAVK V +S VDL+SPE++VLVE+LP+SG+P V GVS+LP +L++
Sbjct: 1 MDREQCFKVVAAAVKSVAENSVVDLRSPEVAVLVEMLPVSGVPLGSSVAGVSVLPAELIS 60
Query: 302 TKPRLSIRALVSGTNAKNG 320
TKPRL +R+LV +AK G
Sbjct: 61 TKPRLCVRSLV--PDAKAG 77
>gi|195604606|gb|ACG24133.1| hypothetical protein [Zea mays]
Length = 112
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 40/69 (57%), Positives = 56/69 (81%)
Query: 242 LDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVT 301
+DR +CF +VAAAVK V +S VDL+SPE++VLVE+LP+SG+P V GVS+LP +L++
Sbjct: 1 MDREQCFKVVAAAVKSVAENSVVDLRSPEVAVLVEMLPVSGVPLGSSVAGVSVLPAELIS 60
Query: 302 TKPRLSIRA 310
TKPRL +R+
Sbjct: 61 TKPRLCVRS 69
>gi|413938199|gb|AFW72750.1| hypothetical protein ZEAMMB73_044682 [Zea mays]
Length = 102
Score = 67.0 bits (162), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/40 (67%), Positives = 34/40 (85%)
Query: 8 EELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTI 47
+ELKPWEQH+ VI++PR+DY + SLL RSHSGFL+TC I
Sbjct: 36 QELKPWEQHAAVINLPRYDYRASGSLLLRSHSGFLITCPI 75
>gi|302834746|ref|XP_002948935.1| hypothetical protein VOLCADRAFT_89304 [Volvox carteri f.
nagariensis]
gi|300265680|gb|EFJ49870.1| hypothetical protein VOLCADRAFT_89304 [Volvox carteri f.
nagariensis]
Length = 576
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 93/225 (41%), Gaps = 59/225 (26%)
Query: 148 NTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKL 207
+ + +V ++ L SG+L+ P C RI P+ ATC+L + A V++ F
Sbjct: 303 DPVRVVESMLADLESGTLQPPKHCQRIVPLDATCLLTPSGIAAAVAEAAAAFKRRRYKST 362
Query: 208 SR-------------------PVKFAVGYN----------------------------RR 220
+R P +A+ Y+ R
Sbjct: 363 ARCTDSNDGNGGGGGDAAPVEPFSYAISYHSRFTESLPQSAAAGADGGNSTYAGNGAAHR 422
Query: 221 GFEEKQNKIPKDTKDSD------VSALLDRNKCFTIVA-AAVKEVVSDSA-VDLKSPELS 272
G +E+++ +P V LLDRN+ ++ A V D A V+LK P+++
Sbjct: 423 GGQEQEHALPPLPPPPQQQGQEVVDVLLDRNQIISLAARGMVDAFTGDVARVNLKKPQVA 482
Query: 273 VLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNA 317
V VE LP+ G G+++LP+ + +K +L ++ALV N+
Sbjct: 483 VRVEALPVGG----RQFAGLTLLPEHMFVSKGKLVVKALVKNANS 523
>gi|108864520|gb|ABG22531.1| THUMP domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|215767589|dbj|BAG99817.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 365
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 67/162 (41%), Gaps = 24/162 (14%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG + + R+ P + IV N+ S S + R P + TC E+E+ +
Sbjct: 223 NGCIFIQMHKRDGDPGPVEIVQNMMSSAASTRKHMSRFILRFLPTEVTCYASEEEITKAI 282
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
S LV ++ E + + KFAV Y R + +DR K I+
Sbjct: 283 SPLVEKYFPKESSSV---YKFAVLYEARS-----------------NTGIDRMK---IIN 319
Query: 253 AAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSI 294
A K V VDL SP+ +++V++ I G + + G I
Sbjct: 320 AVAKSVPQPHKVDLSSPDRTIIVQIAKIFGENQQGVCFGRCI 361
>gi|302814406|ref|XP_002988887.1| hypothetical protein SELMODRAFT_427525 [Selaginella moellendorffii]
gi|300143458|gb|EFJ10149.1| hypothetical protein SELMODRAFT_427525 [Selaginella moellendorffii]
Length = 296
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 58/264 (21%), Positives = 108/264 (40%), Gaps = 74/264 (28%)
Query: 37 SHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINS 96
GFL+TC RE+ ++E+I++L + E D+GG I+
Sbjct: 28 GRQGFLITCDGGRERQCSRESINLLERYFE----------------DLGGV------ISV 65
Query: 97 IEEDSAGGLQKNECHSSVKTDTNTETDFDMSL----VKLTR------------------- 133
E+++A G Q ++ K+ +E +FD L V+L
Sbjct: 66 AEDENAEGEQ-----AAAKSSDASEKNFDELLKEEIVELRDKKNDDWKIQARFVSSETGC 120
Query: 134 NGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVS 193
NG++ + + S + + +I + + S +C ++ P++ TC +E++
Sbjct: 121 NGVVFIEMVK-ASIGPVELAESIVRMVASTKKSRTRFCMKLLPMEVTCYASAEEVKIAAQ 179
Query: 194 KLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAA 253
L+L+ E + KFAV Y R + LDR T++ A
Sbjct: 180 PLILKHFPAEA---AEGTKFAVVYEARA-----------------NTGLDR---MTLIDA 216
Query: 254 AVKEVVSDSAVDLKSPELSVLVEL 277
+ V AVDLK+P+ +++V++
Sbjct: 217 VAQLVPKPHAVDLKAPQKTIIVQV 240
>gi|440800230|gb|ELR21269.1| THUMP domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 333
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/295 (22%), Positives = 111/295 (37%), Gaps = 70/295 (23%)
Query: 23 PRFDYNS---PSSLLQRSHS--GFLLTCTIKREKSATKEAISILH---KRLESSNTAGDS 74
PR D +S ++L R + GF++TC + RE A + + +L+ ++
Sbjct: 34 PRLDRSSLKKHATLFNRGQARHGFVVTCAVHRENHALAQVLQLLNHYARKFYPEEVERHL 93
Query: 75 KRRKVCTDDMGGKCADGAEINSIEE-----DSAGGLQKNECHSSVKTDTNTETDFDMSLV 129
D GG AD + E DSA L+ + +++ D+ ET
Sbjct: 94 NPPAPTEGDAGGTAADARPDGTAAEEPSSADSAETLKLADSNTAAAADSKEETGTSHGPT 153
Query: 130 KLTRNGLLLLTFPREHSPNTINIVSN------IFQSLGSGSLKSPV-------------- 169
+ PRE +P NI + F S+ S ++ PV
Sbjct: 154 PPSDK-------PRERAPKLFNIFDSGGISGLFFVSVNSPDIR-PVEFAHRIFTDPDFPQ 205
Query: 170 ------WCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFE 223
CHR+ PI+ C E ++ + ++ E + P+KFAV +++R +
Sbjct: 206 QAAHSDACHRLVPIERVCAASEADIVSAAQAMLPDHFPSETDA---PLKFAVVFDKRNND 262
Query: 224 EKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELL 278
++R + +AA+V V S VDL +P VLV +
Sbjct: 263 R-----------------VERMPVVSQIAASVPPVHS---VDLSTPHKVVLVYIF 297
>gi|218185953|gb|EEC68380.1| hypothetical protein OsI_36525 [Oryza sativa Indica Group]
Length = 374
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 61/145 (42%), Gaps = 24/145 (16%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG + + R+ P + IV N+ S S + R P + TC E+E+ +
Sbjct: 223 NGCIFIQMHKRDGDPGPVEIVQNMMSSAASTRKHMSRFILRFLPTEVTCYASEEEITKAI 282
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
S LV ++ E + + KFAV Y R + +DR K I+
Sbjct: 283 SPLVEKYFPKESSSV---YKFAVLYEARS-----------------NTGIDRMK---IIN 319
Query: 253 AAVKEVVSDSAVDLKSPELSVLVEL 277
A K V VDL SP+ +++V++
Sbjct: 320 AVAKSVPQPHKVDLSSPDRTIIVQI 344
>gi|115485943|ref|NP_001068115.1| Os11g0568600 [Oryza sativa Japonica Group]
gi|108864519|gb|ABA94316.2| THUMP domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113645337|dbj|BAF28478.1| Os11g0568600 [Oryza sativa Japonica Group]
gi|215768400|dbj|BAH00629.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222616171|gb|EEE52303.1| hypothetical protein OsJ_34303 [Oryza sativa Japonica Group]
Length = 374
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 61/145 (42%), Gaps = 24/145 (16%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG + + R+ P + IV N+ S S + R P + TC E+E+ +
Sbjct: 223 NGCIFIQMHKRDGDPGPVEIVQNMMSSAASTRKHMSRFILRFLPTEVTCYASEEEITKAI 282
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
S LV ++ E + + KFAV Y R + +DR K I+
Sbjct: 283 SPLVEKYFPKESSSV---YKFAVLYEARS-----------------NTGIDRMK---IIN 319
Query: 253 AAVKEVVSDSAVDLKSPELSVLVEL 277
A K V VDL SP+ +++V++
Sbjct: 320 AVAKSVPQPHKVDLSSPDRTIIVQI 344
>gi|302761510|ref|XP_002964177.1| hypothetical protein SELMODRAFT_405908 [Selaginella moellendorffii]
gi|300167906|gb|EFJ34510.1| hypothetical protein SELMODRAFT_405908 [Selaginella moellendorffii]
Length = 295
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/264 (21%), Positives = 107/264 (40%), Gaps = 74/264 (28%)
Query: 37 SHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINS 96
GFL+TC RE+ ++E I++L + E D+GG I+
Sbjct: 28 GRQGFLITCDGGRERQCSRECINLLERYFE----------------DLGGV------ISV 65
Query: 97 IEEDSAGGLQKNECHSSVKTDTNTETDFDMSL----VKLTR------------------- 133
E+++A G Q ++ K+ +E +FD L V+L
Sbjct: 66 AEDENAEGEQ-----AAAKSSDASEKNFDELLKEEIVELRDKKNDDWKIQARFVSSETGC 120
Query: 134 NGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVS 193
NG++ + + S + + +I + + S +C ++ P++ TC +E++
Sbjct: 121 NGVVFIEMVK-ASIGPVELAESIVRMVASTKKSRTRFCMKLLPMEVTCYASAEEVKIAAQ 179
Query: 194 KLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAA 253
L+L+ E + KFAV Y R + LDR T++ A
Sbjct: 180 PLILKHFPAEA---AEGTKFAVVYEARA-----------------NTGLDR---MTLIDA 216
Query: 254 AVKEVVSDSAVDLKSPELSVLVEL 277
+ V +VDLK+P+ +++V++
Sbjct: 217 VAQLVPKPHSVDLKAPQKTIIVQV 240
>gi|18416857|ref|NP_568274.1| THUMP domain-containing protein [Arabidopsis thaliana]
gi|14586371|emb|CAC42902.1| putative protein [Arabidopsis thaliana]
gi|22022558|gb|AAM83236.1| At5g12410 [Arabidopsis thaliana]
gi|24797044|gb|AAN64534.1| At5g12410/At5g12410 [Arabidopsis thaliana]
gi|332004423|gb|AED91806.1| THUMP domain-containing protein [Arabidopsis thaliana]
Length = 376
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 63/146 (43%), Gaps = 24/146 (16%)
Query: 134 NGLLLLTFP-REHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NGL+ + R+ P+ IV + S + + R+ PI+ +C E+E+ +
Sbjct: 227 NGLVFIQMKKRDGDPSPKEIVQHAMTSAAATKKHMSRFILRLLPIEVSCYPSEEEISRAI 286
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
LV Q+ E P KFAV Y R + LDR K +A
Sbjct: 287 KPLVEQYFPIE---TENPRKFAVLYGAR-----------------ANTGLDRMKIINTIA 326
Query: 253 AAVKEVVSDSAVDLKSPELSVLVELL 278
K + + VDL +PE++++VE++
Sbjct: 327 ---KSIPAPHKVDLSNPEMTIVVEII 349
>gi|297811371|ref|XP_002873569.1| THUMP domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297319406|gb|EFH49828.1| THUMP domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 362
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 63/146 (43%), Gaps = 24/146 (16%)
Query: 134 NGLLLLTFPR-EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NGL+ + R + P+ +I + S + + R+ PI+ +C E+E+ +
Sbjct: 213 NGLVFIQMKRRDGDPSPKDIAQHAMTSAAATKKHMSRFILRLLPIEVSCYPSEEEISRAI 272
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
LV Q+ E P KFAV Y R + LDR K +A
Sbjct: 273 KPLVEQYFPIE---TENPRKFAVLYGAR-----------------ANTGLDRMKIINTIA 312
Query: 253 AAVKEVVSDSAVDLKSPELSVLVELL 278
K + + VDL +PE+S++VE++
Sbjct: 313 ---KSIPAPHKVDLSNPEMSIVVEVV 335
>gi|225436571|ref|XP_002278600.1| PREDICTED: uncharacterized protein LOC100244787 [Vitis vinifera]
gi|297734958|emb|CBI17192.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 65/146 (44%), Gaps = 24/146 (16%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG++ + ++ P+ IV ++ S S + R+ P++ATC E+E+ +
Sbjct: 243 NGVVFVQMRKKDGDPSPKEIVQHMMTSAASTRKHMSRFILRVLPVEATCYASEEEISIAI 302
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
LV Q+ E P KFAV Y R + +DR K I+
Sbjct: 303 KPLVEQYFPVE---TQNPKKFAVLYEARS-----------------NTGIDRMK---IIN 339
Query: 253 AAVKEVVSDSAVDLKSPELSVLVELL 278
+ K V VDL +P+++++V+++
Sbjct: 340 SVAKSVPGPHKVDLSNPDMTIVVQIV 365
>gi|328777951|ref|XP_003249423.1| PREDICTED: THUMP domain-containing protein 1-like, partial [Apis
mellifera]
Length = 254
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 52/250 (20%), Positives = 102/250 (40%), Gaps = 38/250 (15%)
Query: 34 LQRSHSGFLLTCTIKREKSATKEAISILHK---RLESSNTAGDSKRRKVCTDDMGGKCAD 90
L+ GFL TC EK ++A IL++ + ++ DS + D+ D
Sbjct: 7 LEPGMKGFLCTCNFS-EKECVRDAYKILNEFADEIYGLDSIKDSNNENIKVSDIK---KD 62
Query: 91 GAEINSIEEDSAGGLQK--NECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPN 148
+ N E+D + L K NE + N ++ G+ + F R N
Sbjct: 63 ETKYNDNEDDISVVLNKEINELKAEYSKPINARR------FQVIDTGVKNIVFIRSSLTN 116
Query: 149 TINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLS 208
+ +V+ I L + + + R+ PI+ C ++++ ++ ++ E
Sbjct: 117 PLELVTKIITELYNTKQQRTRYLLRLLPIEVICKANMNDIKSKADAMLEKYFAQE----- 171
Query: 209 RPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKS 268
P F++ +NR N I +D D++ ++++ + DLK+
Sbjct: 172 -PKTFSIVFNRHS----NNNIYRDEIIEDLAEIINKKN-------------PGNKADLKN 213
Query: 269 PELSVLVELL 278
PEL+V+VE++
Sbjct: 214 PELAVIVEMV 223
>gi|356497337|ref|XP_003517517.1| PREDICTED: uncharacterized protein LOC100776466 [Glycine max]
Length = 384
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 66/148 (44%), Gaps = 30/148 (20%)
Query: 134 NGLLLLTFPR---EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQA 190
NG++ + + + SP +IV I S S + RI PI+ +C ++E+
Sbjct: 234 NGVVFVQMRKKDGDRSPK--DIVHRIVTSAASTGKHMSRFILRILPIEVSCYASKEEISR 291
Query: 191 TVSKLVLQFVNDE-QNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFT 249
T+ LV Q+ E QN P+KFAV Y R + +DR
Sbjct: 292 TIKPLVEQYFPVETQN----PLKFAVLYEAR-----------------ANTGVDR---ME 327
Query: 250 IVAAAVKEVVSDSAVDLKSPELSVLVEL 277
I+ A K V VDLK+P+ +++VE+
Sbjct: 328 IIDAVAKSVPGPHKVDLKNPDKTIVVEI 355
>gi|226498966|ref|NP_001148536.1| LOC100282152 [Zea mays]
gi|195620092|gb|ACG31876.1| THUMP domain containing protein [Zea mays]
Length = 396
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 61/145 (42%), Gaps = 24/145 (16%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG + + R P + IV N+ S + R+ P++ C E+E+ +
Sbjct: 244 NGCIFIQMHKRAGDPGPVEIVQNMMSSAALTRKHMSRFILRVLPVEVACYASEEEITKAI 303
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
S L+ ++ E S KFAV Y R + +DR K I+
Sbjct: 304 SPLIEKYFPKE---CSSGHKFAVLYEARS-----------------NTGIDRMK---IIN 340
Query: 253 AAVKEVVSDSAVDLKSPELSVLVEL 277
AA K V VDLK+P+ +++V++
Sbjct: 341 AAAKSVPQPHKVDLKNPDKTIVVQI 365
>gi|413925319|gb|AFW65251.1| THUMP domain containing protein [Zea mays]
Length = 399
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 61/145 (42%), Gaps = 24/145 (16%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG + + R P + IV N+ S + R+ P++ C E+E+ +
Sbjct: 247 NGCIFIQMHKRAGDPGPVEIVQNMMSSAALTRKHMSRFILRVLPVEVACYASEEEITKAI 306
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
S L+ ++ E S KFAV Y R + +DR K I+
Sbjct: 307 SPLIEKYFPKE---CSSGHKFAVLYEARS-----------------NTGIDRMK---IIN 343
Query: 253 AAVKEVVSDSAVDLKSPELSVLVEL 277
AA K V VDLK+P+ +++V++
Sbjct: 344 AAAKSVPQPHKVDLKNPDKTIVVQI 368
>gi|255087744|ref|XP_002505795.1| hypothetical protein MICPUN_112716 [Micromonas sp. RCC299]
gi|226521065|gb|ACO67053.1| hypothetical protein MICPUN_112716 [Micromonas sp. RCC299]
Length = 353
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 54/279 (19%), Positives = 111/279 (39%), Gaps = 45/279 (16%)
Query: 21 SIPRFDYNSPSSLLQR-SHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKV 79
S+ R++Y +P ++++ GF++TC+ +REKSAT+EA +L RL ++ A
Sbjct: 54 SLERYEYAAPDAIVRGLGCVGFVVTCSFQREKSATREATEMLRPRLPTTPLA-------- 105
Query: 80 CTDDMGGKCADGAEINSIEEDSAGGL-----QKNECHSSVKTDTNTETDFDMSLVKLTRN 134
G A G +N ++ G + ++ + ++ + TR
Sbjct: 106 -----FGAAAKGLRLNPVKMPGRGFVLIRMSERPDPDAAAADAKGRDDPHPRPDDDRTRA 160
Query: 135 GLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSK 194
+ + + +V+ + +G +P + ++ P+ TC + A
Sbjct: 161 TV---------AAEVLGVVTRTVADVKAGRCPAPKFVEKMTPVTVTCAFDGAAIDAGAVA 211
Query: 195 LVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALL--------DRNK 246
+ D + R V FAV Y+ R F+ + + KD+ + R +
Sbjct: 212 ALAASGFDAKG---RDVSFAVSYHNR-FKTQGDPTVGAGKDAVDAGDTAATAAERHGRRE 267
Query: 247 CFTIVAAAVK-----EVVSDSAVDLKSPELSVLVELLPI 280
VA V+ + +VDL+ P++ E++ +
Sbjct: 268 VIEAVAGGVRAALTAAGATGLSVDLRDPDVVAFAEVISV 306
>gi|388518055|gb|AFK47089.1| unknown [Medicago truncatula]
Length = 378
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 30/148 (20%)
Query: 134 NGLLLLTFPR---EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQA 190
NG++ + + + SPN IV+ I S S + RI PI+ +C ++E+
Sbjct: 228 NGVVFIQMRKKDGDKSPN--KIVNRIVTSAASTRKHMSRFILRILPIEVSCYASKEEISK 285
Query: 191 TVSKLVLQFVNDE-QNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFT 249
+ LV Q+ E QN P KFAV Y R + +DR
Sbjct: 286 AIQPLVEQYFPVETQN----PQKFAVMYEARA-----------------NTGVDR---ME 321
Query: 250 IVAAAVKEVVSDSAVDLKSPELSVLVEL 277
I+ A K + + VDL +P+ +++VE+
Sbjct: 322 IIDAVAKSIPAPHKVDLSNPDRTIIVEI 349
>gi|357481361|ref|XP_003610966.1| THUMP domain containing protein [Medicago truncatula]
gi|355512301|gb|AES93924.1| THUMP domain containing protein [Medicago truncatula]
Length = 378
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 30/148 (20%)
Query: 134 NGLLLLTFPR---EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQA 190
NG++ + + + SPN IV+ I S S + RI PI+ +C ++E+
Sbjct: 228 NGVVFIQMRKKDGDKSPN--KIVNRIVTSAASTRKHMSRFILRILPIEVSCYASKEEISK 285
Query: 191 TVSKLVLQFVNDE-QNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFT 249
+ LV Q+ E QN P KFAV Y R + +DR
Sbjct: 286 AIQPLVEQYFPVETQN----PQKFAVMYEARA-----------------NTGVDR---ME 321
Query: 250 IVAAAVKEVVSDSAVDLKSPELSVLVEL 277
I+ A K + + VDL +P+ +++VE+
Sbjct: 322 IIDAVAKSIPAPHKVDLSNPDKTIIVEI 349
>gi|388507080|gb|AFK41606.1| unknown [Medicago truncatula]
Length = 378
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 30/148 (20%)
Query: 134 NGLLLLTFPR---EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQA 190
NG++ + + + SPN IV+ I S S + RI PI+ +C ++E+
Sbjct: 228 NGVVFIQMRKKDGDKSPN--KIVNRIVTSAASTRKHMSRFILRILPIEVSCYASKEEISK 285
Query: 191 TVSKLVLQFVNDE-QNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFT 249
+ LV Q+ E QN P KFAV Y R + +DR
Sbjct: 286 AIQPLVEQYFPVETQN----PQKFAVMYEARA-----------------NTGVDR---ME 321
Query: 250 IVAAAVKEVVSDSAVDLKSPELSVLVEL 277
I+ A K + + VDL +P+ +++VE+
Sbjct: 322 IIDAVAKSIPAPHKVDLSNPDKTIIVEI 349
>gi|322785995|gb|EFZ12611.1| hypothetical protein SINV_11518 [Solenopsis invicta]
Length = 316
Score = 44.7 bits (104), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 113/270 (41%), Gaps = 38/270 (14%)
Query: 15 QHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDS 74
+++R I+ R DY +LQ GFL TC +E+ EA +L+ TA D
Sbjct: 5 RNNRYIA-NRNDYKKRKFMLQPGMMGFLCTCNY-QERGCVTEAYRLLN-LFADEKTASDM 61
Query: 75 KRRKV--CTDDMGGKCADG--AEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVK 130
+ V T+ + K D ++ +ED + L+K + TE+ + K
Sbjct: 62 NKESVPSTTNVLAKKEVDKLLENCDAEDEDISTALEKE------IDELRTESKMPLFSRK 115
Query: 131 LTR--NGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKEL 188
G+ + F PN + +V I L + + + R+ PI+ C +
Sbjct: 116 FQEVDTGVKNMIFIASTLPNPLELVMKIVTKLDTTKEQCTRYLLRLLPIEVVCKAYMDNI 175
Query: 189 QATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCF 248
+ S+L ++ E P F++ +N R N I +D D++ ++
Sbjct: 176 KTKASELFEKYFAQE------PKTFSIVFNHRS----NNSIKRDEIIEDLAEII------ 219
Query: 249 TIVAAAVKEVVSDSAVDLKSPELSVLVELL 278
+K+ + A DLK+P+++V+VE++
Sbjct: 220 ------LKKNPGNKA-DLKNPDIAVIVEVI 242
>gi|307208989|gb|EFN86189.1| THUMP domain-containing protein 1-like protein [Harpegnathos
saltator]
Length = 317
Score = 44.3 bits (103), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 58/252 (23%), Positives = 107/252 (42%), Gaps = 41/252 (16%)
Query: 39 SGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKV--CTDDMGGKCADGAEIN- 95
+GFL TC EK +A +L++ + + T+ K ++ + K AD + +
Sbjct: 36 TGFLCTCNFN-EKGCITDAYKLLNQFADENTTSEMIKESEISDTSSTFNKKEADKSSSDL 94
Query: 96 SIEEDSAGGLQKNECHSSVKTDTNTETDFDMSL----VKLTRNGLLLLTFPREHSPNTIN 151
EED + L+K + D +T +DM L ++ G+ + F R +
Sbjct: 95 DEEEDISTALEK-------EID-ELKTKYDMPLSSRRFQVVDTGVKNMIFVRSTLSKPLE 146
Query: 152 IVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPV 211
+V+NI L + + + R+ PI+ C +++ S L ++ + E P
Sbjct: 147 LVTNIVTELDTSKKQCTRYLLRLLPIEVVCKAYMDDIRTKASILFEKYFSQE------PK 200
Query: 212 KFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPEL 271
F++ +NR N I RN+ +A + + + DLK+PE+
Sbjct: 201 TFSIVFNRHS----NNSIK-------------RNEIIEDLAEIISKKNPGNKADLKNPEI 243
Query: 272 SVLVELLPISGL 283
+V+VE+ I GL
Sbjct: 244 AVVVEV--IRGL 253
>gi|330793640|ref|XP_003284891.1| hypothetical protein DICPUDRAFT_148721 [Dictyostelium purpureum]
gi|325085200|gb|EGC38612.1| hypothetical protein DICPUDRAFT_148721 [Dictyostelium purpureum]
Length = 323
Score = 44.3 bits (103), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 106/247 (42%), Gaps = 57/247 (23%)
Query: 37 SHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINS 96
S +G + T REK A+K+ +L + ++ ++K +DD A AE+
Sbjct: 72 SGNGLIFTFGNGREKQASKDINKVLDEYIDKFKDIAAETKKKANSDDFDFDSAFQAELQQ 131
Query: 97 IEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTR--NGLLLLTFPREHSPNTINIVS 154
I+E S+G K VK T NG+ ++ +P I + S
Sbjct: 132 IKE-SSGSKSK--------------------YVKYTLKCNGVAYMSLNSGINP--IELTS 168
Query: 155 NIFQ-SLGSGSLKSPVWCHRIFPIQATCVLKE--KELQATVSKLVLQFVNDEQNKLSRPV 211
IF+ + + +LKS +RI PIQ TC L +E Q+ LV Q+ N + ++
Sbjct: 169 KIFKDAQDTKALKSK-EINRIIPIQKTCHLSNIFEETQS----LVNQYFNVD----NKVY 219
Query: 212 KFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPEL 271
K+ + + R NKI K +++ L+D VDL +PEL
Sbjct: 220 KYKIEFKSRM----NNKINKVEYIQELAKLVD----------------PKHIVDLDNPEL 259
Query: 272 SVLVELL 278
+++VE++
Sbjct: 260 TIIVEIV 266
>gi|363749795|ref|XP_003645115.1| hypothetical protein Ecym_2582 [Eremothecium cymbalariae
DBVPG#7215]
gi|356888748|gb|AET38298.1| Hypothetical protein Ecym_2582 [Eremothecium cymbalariae
DBVPG#7215]
Length = 296
Score = 44.3 bits (103), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 90/218 (41%), Gaps = 32/218 (14%)
Query: 17 SRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKR 76
S V + R Y + L + SG TCT + EK A E +S+L ++LE A + K
Sbjct: 15 SEVKTNARKKYKVAQTTLDPNTSGIYATCTRRHEKQAGNELMSLLEEKLEY--YAQELK- 71
Query: 77 RKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGL 136
D G + + +E + EE S KNE E + +LV T+
Sbjct: 72 -----DINGAELDNPSENSKSEELSIEDQVKNELE---------ELRNNATLVDRTKKKP 117
Query: 137 LLLTFPRE---------HSP-NTINIVSNIFQSLGSGSL--KSPVWCHRIFPIQATCVLK 184
+L P E P N N V I Q L S ++ K + ++ PI ++C
Sbjct: 118 ILQQIPLESECMVFFKTRKPINPENFVLRIIQELASPTMNSKRTRYIQKLTPITSSCNAS 177
Query: 185 EKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGF 222
+EL +++ + E+ S KFA+ N+R F
Sbjct: 178 LEELTKLCQRVLPPHFHQEE---STAYKFAIEVNKRNF 212
>gi|303289096|ref|XP_003063836.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454904|gb|EEH52209.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 365
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 20 ISIPRFDYNSPSSLLQRSHS--GFLLTCTIKREKSATKEAISIL 61
+S+ R++Y +P+ +L + GF +TC REKSATKEA+++L
Sbjct: 61 VSLERYEYAAPAVILSGNGGACGFAITCQFMREKSATKEAMALL 104
>gi|332023253|gb|EGI63508.1| THUMP domain-containing protein 1-like protein [Acromyrmex
echinatior]
Length = 282
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 58/262 (22%), Positives = 103/262 (39%), Gaps = 63/262 (24%)
Query: 33 LLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGA 92
+L+ +GFL TC E+ EA +L N D K +
Sbjct: 30 VLEPGMTGFLCTCNF-HERGCITEAYRLL-------NLFADEK----------------S 65
Query: 93 EINSIEEDSAGGLQKNECHSSV------------KTDTNTETDFDMSLV----KLTRNGL 136
EI+ E S L K E SV K T+ +MSL ++ G+
Sbjct: 66 EIHKESETSTNALHKKETDKSVDDNDDDISVALEKEINELRTEREMSLYSRKFQVVDTGV 125
Query: 137 LLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLV 196
+ F PN + +V+ I L + + + R+ PI+ C ++ ++L
Sbjct: 126 KNMIFITSTLPNPLELVTKIVSKLDTTKEQCTRYLLRLLPIEVVCKAYMDNIKTKANELF 185
Query: 197 LQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVK 256
++ E P F++ +NRR N I +D D++ ++ +K
Sbjct: 186 EKYFAQE------PKTFSIVFNRRS----NNSIKRDEIIEDLAEII------------LK 223
Query: 257 EVVSDSAVDLKSPELSVLVELL 278
+ + A DLK+P+++V+VE++
Sbjct: 224 KNPGNKA-DLKNPDIAVIVEVI 244
>gi|312382047|gb|EFR27631.1| hypothetical protein AND_05556 [Anopheles darlingi]
Length = 308
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 62/264 (23%), Positives = 105/264 (39%), Gaps = 53/264 (20%)
Query: 27 YN-SPSSLLQRSHSGFLLTCT------IKREKSATKEAISILHKRLESSNTAGDSKRRKV 79
YN S + +Q GFL+TC I+ E +L+ S N A + K
Sbjct: 31 YNKSTARFMQPGQRGFLVTCNGRLHDCIRDSYRILNEYADLLYGAAPSPNEAEPAANEKS 90
Query: 80 CTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLL 139
D+ S EED + LQK + +K + + D N L +
Sbjct: 91 VDDE------------SEEEDISVTLQKEAAAAGIKKPMRFQ-NVDSGAT----NCLFIR 133
Query: 140 TFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQF 199
T +E + + + Q L + + R+ PI+A C K++ TV L
Sbjct: 134 TTLKEPTA----LAHKVLQDLSTTRKAKSRFILRMVPIEAVCRANLKDIIDTVGSL---- 185
Query: 200 VNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVV 259
+++ L P +A+ YN R N++P+D +++ L+
Sbjct: 186 --SDRHFLKEPKTYAIIYNHR----LNNELPRDDVIRELADLISSKN------------- 226
Query: 260 SDSAVDLKSPELSVLVELLPISGL 283
+ + +LK+PEL+V+VE+ I GL
Sbjct: 227 AGNKANLKNPELAVIVEV--IKGL 248
>gi|255544952|ref|XP_002513537.1| thump domain protein, putative [Ricinus communis]
gi|223547445|gb|EEF48940.1| thump domain protein, putative [Ricinus communis]
Length = 382
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 65/147 (44%), Gaps = 26/147 (17%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG++ + R+ P+ +I +I S S + R+ P++ C E+E+ +
Sbjct: 233 NGVVFVQMRKRDGDPSPKDIAQHIMTSAASTRKHMSRFILRVLPVEIACYASEEEISKAM 292
Query: 193 SKLVLQFVN-DEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIV 251
+ +V ++ D QN P KFAV Y R + +DR K I+
Sbjct: 293 APIVEKYFPVDTQN----PQKFAVLYEARA-----------------NTGIDRMK---II 328
Query: 252 AAAVKEVVSDSAVDLKSPELSVLVELL 278
+ K V VDL +P+ +++VE++
Sbjct: 329 NSVAKSVPGPHKVDLNNPDKTIVVEIV 355
>gi|363808318|ref|NP_001242247.1| uncharacterized protein LOC100812892 [Glycine max]
gi|255647118|gb|ACU24027.1| unknown [Glycine max]
Length = 377
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 63/146 (43%), Gaps = 26/146 (17%)
Query: 134 NGLLLLTFPREHSPNTI-NIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG++ + ++ ++ +IV I S S + RI PI+ C ++E+ +
Sbjct: 227 NGVVFVQMRKKDGDRSLKDIVHRIVTSAASTGKHMSRFILRILPIEIACYASKEEISRAI 286
Query: 193 SKLVLQFVNDE-QNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIV 251
LV Q+ E QN P KFAV Y R + +DR I+
Sbjct: 287 KPLVEQYFPVETQN----PHKFAVLYEARA-----------------NTGVDR---MEII 322
Query: 252 AAAVKEVVSDSAVDLKSPELSVLVEL 277
A K V VDLK+P+ +++VE+
Sbjct: 323 DAVAKSVPGPHKVDLKNPDKTIVVEI 348
>gi|312066695|ref|XP_003136392.1| hypothetical protein LOAG_00804 [Loa loa]
gi|307768439|gb|EFO27673.1| hypothetical protein LOAG_00804 [Loa loa]
Length = 265
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 61/265 (23%), Positives = 108/265 (40%), Gaps = 35/265 (13%)
Query: 24 RFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDD 83
R++ N ++ G TC EK A +EA +++ + LE G S+ D+
Sbjct: 16 RYNQNVKQRWVESGCCGLFFTCN-GNEKQAVREAYNVIEQALEIWKKDGHSEEDS--DDE 72
Query: 84 MGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPR 143
+ K G S +ED A + K C ++D+ + N L T
Sbjct: 73 LAEK---GTNFGSDQEDVADSV-KRFC-DQARSDSRNVKKRKIWQRPTGTNNCLFFTVKG 127
Query: 144 EHSPNTINIVSNIFQSLGSGSLKSPVWC---HRIFPIQATCVLKEKELQATVSKLVLQFV 200
+ N + V + + + LK C R++P++ TC + EL VS+LV +
Sbjct: 128 VNDKNVYDFVDYLVEMV----LKERC-CRLLQRVWPVERTCRVDMVELDMEVSRLVSRHF 182
Query: 201 NDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVS 260
+ ++ F F+ + N DS L + +V A+K++
Sbjct: 183 HPTEDGKWPTYLF-------DFKARNN-------DS-----LGKGDVMDMVLLALKQLAP 223
Query: 261 DSAVDLKSPELSVLVELLPISGLPS 285
DS V L +P ++VLV+++ S + S
Sbjct: 224 DSHVSLDNPHIAVLVQIVHKSVMLS 248
>gi|449476100|ref|XP_004154640.1| PREDICTED: THUMP domain-containing protein 1-like [Cucumis sativus]
Length = 384
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 64/147 (43%), Gaps = 26/147 (17%)
Query: 134 NGLLLLTFPR-EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG++ + + + P +V ++ S+ S + RI PI+ C +E+ +
Sbjct: 234 NGVIFIQMRKNDGEPGPEAVVQHMMTSIASTRKHISRFILRILPIEVACYASVEEITRAI 293
Query: 193 SKLVLQ-FVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIV 251
L+ + F + QN PVKFAV Y R ++ +DR V
Sbjct: 294 KPLIEKNFPVESQN----PVKFAVLYEARA-----------------NSGIDRATIINTV 332
Query: 252 AAAVKEVVSDSAVDLKSPELSVLVELL 278
A AV E VDL +PE +++VE++
Sbjct: 333 AKAVPE---PHKVDLNNPEKTIVVEIV 356
>gi|242071399|ref|XP_002450976.1| hypothetical protein SORBIDRAFT_05g021960 [Sorghum bicolor]
gi|241936819|gb|EES09964.1| hypothetical protein SORBIDRAFT_05g021960 [Sorghum bicolor]
Length = 420
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 59/145 (40%), Gaps = 24/145 (16%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG + + R P + IV N+ S S + R+ P + TC E+E+ +
Sbjct: 267 NGCIFIQMHKRAGDPGPVEIVQNMMSSAVSTRKHMSRFILRVLPAEVTCYASEEEITKAI 326
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
S LV ++ E KFAV Y R + +DR K I+
Sbjct: 327 SPLVEKYFPKE---CPSGHKFAVLYEARS-----------------NTGIDRMK---IIN 363
Query: 253 AAVKEVVSDSAVDLKSPELSVLVEL 277
A K V VDL +P+ +++V++
Sbjct: 364 AVAKSVPQPHKVDLSNPDKTIVVQI 388
>gi|449442547|ref|XP_004139043.1| PREDICTED: THUMP domain-containing protein 1-like [Cucumis sativus]
Length = 386
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 64/147 (43%), Gaps = 26/147 (17%)
Query: 134 NGLLLLTFPR-EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG++ + + + P +V ++ S+ S + RI PI+ C +E+ +
Sbjct: 236 NGVVFIQMRKNDGEPGPEAVVQHMMTSIASTRKHISRFILRILPIEVACYASVEEITRAI 295
Query: 193 SKLVLQ-FVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIV 251
L+ + F + QN PVKFAV Y R ++ +DR V
Sbjct: 296 KPLIEKNFPVESQN----PVKFAVLYEARA-----------------NSGIDRATIINTV 334
Query: 252 AAAVKEVVSDSAVDLKSPELSVLVELL 278
A AV E VDL +PE +++VE++
Sbjct: 335 AKAVPE---PHKVDLNNPEKTIVVEIV 358
>gi|224104217|ref|XP_002313361.1| predicted protein [Populus trichocarpa]
gi|222849769|gb|EEE87316.1| predicted protein [Populus trichocarpa]
Length = 378
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 66/147 (44%), Gaps = 26/147 (17%)
Query: 134 NGLLLLTFPR-EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG+ + + + P +IV +I S S + R+ PI+ C E+E+ +
Sbjct: 228 NGVAFIQMRKIDGDPCPKDIVQHIMTSAASTRKHMSRFIIRMLPIEVACYASEEEISRAI 287
Query: 193 SKLVLQFVN-DEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIV 251
+ +V ++ D Q+ P+KFAV Y R ++ +DR K I+
Sbjct: 288 APVVEKYFPVDTQD----PLKFAVMYEARA-----------------NSGIDRMK---II 323
Query: 252 AAAVKEVVSDSAVDLKSPELSVLVELL 278
+ K V VDL +P+ +++VE++
Sbjct: 324 NSVAKSVPGPHKVDLGNPDKTIVVEIV 350
>gi|91078880|ref|XP_972914.1| PREDICTED: similar to THUMP domain-containing protein 1 [Tribolium
castaneum]
gi|270004143|gb|EFA00591.1| hypothetical protein TcasGA2_TC003462 [Tribolium castaneum]
Length = 251
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 51/240 (21%), Positives = 95/240 (39%), Gaps = 54/240 (22%)
Query: 40 GFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEE 99
GFL +C REK KE+ ++L++ + + + DD K EI+ +++
Sbjct: 32 GFLCSCN-NREKDCIKESYNLLNEYADKLYQESNEPEAEQDIDDSLAK-----EISELKQ 85
Query: 100 DSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQS 159
D +E F ++ +G F R N + + I +
Sbjct: 86 DK------------------SEKRF-----QVIESGAKNFLFIRTSLENPVELAEAIIKD 122
Query: 160 LGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKL-SRPVKFAVGYN 218
+ + + R+ P++ TC +A VS +V FV Q P F V +N
Sbjct: 123 VDGSKTQRTKFLLRLIPVEITC-------KANVSDIVNAFVPLAQKHFVESPQTFCVIFN 175
Query: 219 RRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELL 278
R + ++ R++ ++AA V E+ D VDLK +++++VE++
Sbjct: 176 HRN-----------------NNVVSRDEVIKLIAAKVSELRPDHKVDLKEAKVAIIVEVI 218
>gi|388512383|gb|AFK44253.1| unknown [Medicago truncatula]
Length = 144
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 59/135 (43%), Gaps = 27/135 (20%)
Query: 144 EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDE 203
+ SPN IV+ I S S + RI PI+ +C ++E+ + LV Q+ E
Sbjct: 7 DKSPN--KIVNRIVTSAASTRKHMSRFILRILPIEVSCYASKEEISKAIQPLVEQYFPVE 64
Query: 204 -QNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDS 262
QN P KFAV Y R + +DR I+ A K + +
Sbjct: 65 TQN----PQKFAVMYEARA-----------------NTGVDR---MEIIDAVAKSIPAPH 100
Query: 263 AVDLKSPELSVLVEL 277
VDL +P+ +++VE+
Sbjct: 101 KVDLSNPDKTIIVEI 115
>gi|158285169|ref|XP_308171.3| AGAP007705-PA [Anopheles gambiae str. PEST]
gi|157019863|gb|EAA04205.3| AGAP007705-PA [Anopheles gambiae str. PEST]
Length = 289
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 59/264 (22%), Positives = 106/264 (40%), Gaps = 47/264 (17%)
Query: 34 LQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAE 93
++ H G L+TC + +++ IL++ + ++ R + GG
Sbjct: 1 MKPGHRGILVTCN-GHVRDCVRDSYRILNEYADELYGPVETTRCEEENQPDGG------- 52
Query: 94 INSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIV 153
S EED + LQK E ++ K F + +G + F + P+ IV
Sbjct: 53 --SDEEDISVKLQK-EAEAAGKKRNAASFRF-----QSVESGAMNCLFIQTVLPDPNEIV 104
Query: 154 SNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKF 213
+ + L + + R+ PIQA C K++ V +L Q+ L P +
Sbjct: 105 VKLMRDLSATKKHKSRFILRMLPIQAVCRANLKDIMDVVGRLGDQYF------LKEPKTY 158
Query: 214 AVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSV 273
A+ +NRR ++ L R+ +A + + + +LK+PEL+V
Sbjct: 159 AIVFNRR-----------------LNNDLSRDDVIRELADLITSKNAGNKANLKNPELAV 201
Query: 274 LVELLPISGLPSELLVVGVSILPQ 297
+VE+ I GL + ILP+
Sbjct: 202 IVEV--IKGL------CCIGILPE 217
>gi|412986887|emb|CCO15313.1| unknown protein [Bathycoccus prasinos]
Length = 291
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 28/47 (59%)
Query: 24 RFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNT 70
+FDY + S +GF LTC + RE+SA KEA+ +L + L+ T
Sbjct: 47 QFDYALEKQFRENSVAGFALTCVMHRERSAMKEALELLRECLDDDFT 93
>gi|413925320|gb|AFW65252.1| hypothetical protein ZEAMMB73_035669 [Zea mays]
Length = 218
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 61/145 (42%), Gaps = 24/145 (16%)
Query: 134 NGLLLLTF-PREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG + + R P + IV N+ S + R+ P++ C E+E+ +
Sbjct: 66 NGCIFIQMHKRAGDPGPVEIVQNMMSSAALTRKHMSRFILRVLPVEVACYASEEEITKAI 125
Query: 193 SKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVA 252
S L+ ++ E S KFAV Y R + +DR K I+
Sbjct: 126 SPLIEKYFPKE---CSSGHKFAVLYEARS-----------------NTGIDRMK---IIN 162
Query: 253 AAVKEVVSDSAVDLKSPELSVLVEL 277
AA K V VDLK+P+ +++V++
Sbjct: 163 AAAKSVPQPHKVDLKNPDKTIVVQI 187
>gi|224054516|ref|XP_002298299.1| predicted protein [Populus trichocarpa]
gi|222845557|gb|EEE83104.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 35/147 (23%), Positives = 65/147 (44%), Gaps = 26/147 (17%)
Query: 134 NGLLLLTFPR-EHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATV 192
NG++ + + + P +IV ++ S S + R+ PI+ C E E+ +
Sbjct: 230 NGVVFVQMRKIDGDPCPKDIVQHMMTSAASTRKHMSRFILRVLPIEVACYASEDEISRAI 289
Query: 193 SKLVLQFVN-DEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIV 251
+ +V ++ D Q+ P KFAV Y R ++ +DR K I+
Sbjct: 290 APVVEKYFPVDTQD----PQKFAVMYEARA-----------------NSGIDRMK---II 325
Query: 252 AAAVKEVVSDSAVDLKSPELSVLVELL 278
+ K V VDL +P+ +++VE++
Sbjct: 326 NSVAKCVPGPHKVDLSNPDKTIVVEIV 352
>gi|383866083|ref|XP_003708501.1| PREDICTED: THUMP domain-containing protein 1-like [Megachile
rotundata]
Length = 310
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 47/245 (19%), Positives = 99/245 (40%), Gaps = 28/245 (11%)
Query: 34 LQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAE 93
L+ GFL TC REK ++A +L++ + G + ++ D+ E
Sbjct: 24 LEPGMRGFLCTCNY-REKDCVRDAYKLLNEF--ADEIYGSNTKKDFNDDNTEQSMIKKHE 80
Query: 94 INSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIV 153
+ +E+ N+ + +KT+ D ++ G+ + F R + + +V
Sbjct: 81 TDQSDEEDDISTALNKEINELKTEYAKPVD--SRRFQVVDTGVKNVIFIRSSIIHPLELV 138
Query: 154 SNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKF 213
+ I L + + R+ PI+ C +++ ++ ++ E P F
Sbjct: 139 TKIITELDRTKQQRTRFLLRLLPIEVVCKAYMNDIKLKADTMLEKYFAQE------PKTF 192
Query: 214 AVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSV 273
++ +NR N I +D D++ ++ + + DLK+PEL+V
Sbjct: 193 SIVFNRHS----NNNINRDEVIEDLAEIISKKN-------------PGNKADLKAPELAV 235
Query: 274 LVELL 278
+VE++
Sbjct: 236 IVEVI 240
>gi|350404454|ref|XP_003487110.1| PREDICTED: THUMP domain-containing protein 1-like [Bombus
impatiens]
Length = 319
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 47/248 (18%), Positives = 97/248 (39%), Gaps = 43/248 (17%)
Query: 40 GFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEE 99
GFL TC EK ++A +L N D V T D D ++ S ++
Sbjct: 32 GFLCTCNFS-EKECVRDAYKLL-------NEFADEIYGPVTTKDFDN---DNSKEKSEKD 80
Query: 100 DSAGGLQKNECHSSVKTDTNTETD---------FDMSLVKLTRNGLLLLTFPREHSPNTI 150
DS + + + N + + + ++ G+ + F + N +
Sbjct: 81 DSVNETKNIDNEDDISVALNKQINELKAEYSKTINARRFQVVDTGVKNVIFIKSTLTNPL 140
Query: 151 NIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRP 210
+V+ I L + + + R+ PI+ C ++++ ++ ++ E P
Sbjct: 141 ELVTKIISELDNTKQQRTRFLLRLLPIEVICKANMNDIKSKADVMLEKYFAQE------P 194
Query: 211 VKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPE 270
F++ +NR N I +D D++ ++++ + DLK+PE
Sbjct: 195 KTFSIVFNRHS----NNNIHRDEVIEDLAEIINKKN-------------PGNKADLKNPE 237
Query: 271 LSVLVELL 278
L+V+VE++
Sbjct: 238 LAVIVEMI 245
>gi|413925321|gb|AFW65253.1| hypothetical protein ZEAMMB73_035669 [Zea mays]
Length = 120
Score = 37.7 bits (86), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 23/105 (21%)
Query: 173 RIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKD 232
R+ P++ C E+E+ +S L+ ++ E S KFAV Y R
Sbjct: 8 RVLPVEVACYASEEEITKAISPLIEKYFPKE---CSSGHKFAVLYEARS----------- 53
Query: 233 TKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVEL 277
+ +DR K I+ AA K V VDLK+P+ +++V++
Sbjct: 54 ------NTGIDRMK---IINAAAKSVPQPHKVDLKNPDKTIVVQI 89
>gi|340716465|ref|XP_003396718.1| PREDICTED: THUMP domain-containing protein 1-like [Bombus
terrestris]
Length = 320
Score = 37.7 bits (86), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 47/248 (18%), Positives = 96/248 (38%), Gaps = 43/248 (17%)
Query: 40 GFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEE 99
GFL TC EK ++A +L N D V T D D ++ S ++
Sbjct: 33 GFLCTCNFS-EKECIRDAYKLL-------NEFADEIYGPVTTKDFDN---DNSKEKSEKD 81
Query: 100 DSAGGLQKNECHSSVKTDTNTETD---------FDMSLVKLTRNGLLLLTFPREHSPNTI 150
DS + + + N + + + ++ G+ + F + N +
Sbjct: 82 DSVNETKNIDNEDDISVALNKQINELKAEYSKTINARRFQVVDTGVKNVIFIKSTLTNPL 141
Query: 151 NIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRP 210
+V+ I L + + + R+ PI+ C ++++ + ++ E P
Sbjct: 142 ELVTKIISELDNTKQQRTRFLLRLLPIEVICKANMNDIKSKADVMFEKYFAQE------P 195
Query: 211 VKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPE 270
F++ +NR N I +D D++ ++++ + DLK+PE
Sbjct: 196 KTFSIVFNRHS----NNNIHRDEVIKDLAEIINKKN-------------PGNKADLKNPE 238
Query: 271 LSVLVELL 278
L+V+VE++
Sbjct: 239 LAVIVEMI 246
>gi|45387737|ref|NP_991220.1| uncharacterized protein LOC402955 [Danio rerio]
gi|41351167|gb|AAH65594.1| Zgc:77221 [Danio rerio]
Length = 353
Score = 37.7 bits (86), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 52/240 (21%), Positives = 93/240 (38%), Gaps = 45/240 (18%)
Query: 40 GFLLTCTIKREKSATKEAISILHKRL-ESSNTAGDSKRRKVCTDDMGGKCADGAEINSIE 98
G L+TC + E+ T EA S+L + E +S +D CA E++ ++
Sbjct: 32 GVLITCNMN-ERKCTSEAFSLLSEYADELYGPEQESLSEDEEQEDEDAGCALQREVSQLQ 90
Query: 99 EDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQ 158
S G Q+ S+V + N + F R H + +V +I
Sbjct: 91 SSSKGRQQR---FSAVDSGANN------------------VVFIRTHGVDPAQLVHHILS 129
Query: 159 SLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYN 218
L + R+ P+ ATC +++Q +S + ++ R F + +
Sbjct: 130 DLHLTRKRKSRVILRMLPVSATCRAFPEDMQKLLSVFLQRWF-----LAPRHATFQICFK 184
Query: 219 RRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELL 278
R S+ R + T VA V ++ + VDL +PELS+++E++
Sbjct: 185 ARN-----------------SSHSKREEVITAVAGLVGQLNPLNKVDLTNPELSIIIEII 227
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.313 0.130 0.361
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,664,924,546
Number of Sequences: 23463169
Number of extensions: 181431296
Number of successful extensions: 344888
Number of sequences better than 100.0: 114
Number of HSP's better than 100.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 79
Number of HSP's that attempted gapping in prelim test: 344706
Number of HSP's gapped (non-prelim): 140
length of query: 320
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 178
effective length of database: 9,027,425,369
effective search space: 1606881715682
effective search space used: 1606881715682
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 77 (34.3 bits)