BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027302
(225 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255552764|ref|XP_002517425.1| conserved hypothetical protein [Ricinus communis]
gi|223543436|gb|EEF44967.1| conserved hypothetical protein [Ricinus communis]
Length = 216
Score = 218 bits (556), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 122/224 (54%), Positives = 158/224 (70%), Gaps = 8/224 (3%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
M G D KK LLTLIRD ASEKSQGERRV+GLKKR+ ++R E+E EN E E +K+LKETIE
Sbjct: 1 MAGIDQKKQLLTLIRDCASEKSQGERRVIGLKKRLVEIRTEVETENAELEASKRLKETIE 60
Query: 61 QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELN 120
Q+LKG+EVELALN Q+LE+R+ I +EIS++G+EV+ LK E+ + RD FI QM +L
Sbjct: 61 QQLKGFEVELALNVAFIQSLEARVFQIQDEISSIGSEVDGLKNEERTSRDEFIEQMVKLG 120
Query: 121 DKIRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREE 180
+IR F + IA Q+++S GT A +E E + L D + ++VSQT +EE
Sbjct: 121 TRIRRFQEKIASEFQKENSIGTTAETENK--------VESDSRILADMVDQIVSQTTKEE 172
Query: 181 ELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
+ Y E+ IQKQVQ E +DL+ KVSLM MI ET +LQDLTRYP
Sbjct: 173 QEYLVEQIIQKQVQQEYVDLQEKVSLMGMIMKETKALQDLTRYP 216
>gi|42565264|ref|NP_189478.2| uncharacterized protein [Arabidopsis thaliana]
gi|332643917|gb|AEE77438.1| uncharacterized protein [Arabidopsis thaliana]
Length = 292
Score = 212 bits (540), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 118/225 (52%), Positives = 161/225 (71%), Gaps = 5/225 (2%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
M G D +K LL+LIRDF SE+S+GE+RVVGLKKRIE L+ E+EA N E E+AK++KE E
Sbjct: 1 MAGIDTQKQLLSLIRDFTSERSRGEQRVVGLKKRIEILQSEVEAANSEVEKAKRIKEVAE 60
Query: 61 QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELN 120
+EL GYEVEL+LN+ Q+LE+RISL+ +E++T+G+EV+ALK ++ LRD FI QM ELN
Sbjct: 61 EELNGYEVELSLNDATIQSLEARISLLQDEVTTIGSEVDALKNKEGLLRDQFISQMEELN 120
Query: 121 DKIRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREE 180
+IR F K++ +L DD G A +A + S L+ ++ ++EV SQ A+EE
Sbjct: 121 KEIREFQKTVDSSLSSDDGIGITANVKASEDGSG-----ADLEAIKGMLSEVNSQLAKEE 175
Query: 181 ELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYPC 225
E Y E+KIQ+Q+Q EL D E+K+SLME I +T S+Q LTRYP
Sbjct: 176 EGYLAEQKIQEQLQKELDDYEKKMSLMEAITDKTNSVQVLTRYPL 220
>gi|297815134|ref|XP_002875450.1| hypothetical protein ARALYDRAFT_484614 [Arabidopsis lyrata subsp.
lyrata]
gi|297321288|gb|EFH51709.1| hypothetical protein ARALYDRAFT_484614 [Arabidopsis lyrata subsp.
lyrata]
Length = 269
Score = 207 bits (527), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/225 (52%), Positives = 158/225 (70%), Gaps = 5/225 (2%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
M G D +K LL+LIRDF SE+S+GE+RVVGLKKRIE L+ E+EA N E E AK++KE E
Sbjct: 1 MAGIDTQKQLLSLIRDFTSERSRGEQRVVGLKKRIEILQSEIEAANSEVENAKRIKEVAE 60
Query: 61 QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELN 120
+EL GYEVEL+LN+ Q+LE+RISL+ +E+STVG+EV+ALK ++ LRD FI QM ELN
Sbjct: 61 EELNGYEVELSLNDATIQSLEARISLLQDEVSTVGSEVDALKNKEGFLRDQFISQMEELN 120
Query: 121 DKIRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREE 180
+IR F K++ DDS G +A + S G A+K + +++V SQ A+EE
Sbjct: 121 KEIREFQKTVDSTFGNDDSTGITPNVKAFEDSS--GADSEAIKGM---LSDVSSQLAKEE 175
Query: 181 ELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYPC 225
E Y E+KIQ+Q+Q EL D E+K+SLME I +T S+Q L RYP
Sbjct: 176 EGYLAEKKIQEQLQKELDDYEKKMSLMEAITDKTNSVQVLARYPL 220
>gi|363806888|ref|NP_001242299.1| uncharacterized protein LOC100815720 [Glycine max]
gi|255633752|gb|ACU17236.1| unknown [Glycine max]
Length = 220
Score = 189 bits (481), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 110/216 (50%), Positives = 145/216 (67%), Gaps = 5/216 (2%)
Query: 3 GSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
GSD +K L +LIR+FA+EKSQGERRVV L+K+IEKL+ EL N E E AK+ KE +EQ+
Sbjct: 9 GSDSQKQLWSLIRNFAAEKSQGERRVVNLRKQIEKLKSELSEANAELENAKRCKELVEQD 68
Query: 63 LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
LKG+EV+L L ++ Q LE+R+SLI N IS VG+++E LK E+ LR+ F M +N K
Sbjct: 69 LKGFEVQLMLTESSAQTLEARVSLIQNHISAVGSDLETLKSEEADLREQFFHNMLHMNGK 128
Query: 123 IRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREEEL 182
IR F +SI D TA++ +A K+ EVA+ LE + EVVSQ A+E+E
Sbjct: 129 IRKFQESIITC----DIDSTASI-DASQVTMKENDAEVAIMALESTLLEVVSQIAKEDEE 183
Query: 183 YQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQ 218
YQ E+K + VQ +LID E+KVSLM MI ET LQ
Sbjct: 184 YQAEQKNYENVQQQLIDCEKKVSLMSMIVMETKELQ 219
>gi|356519023|ref|XP_003528174.1| PREDICTED: uncharacterized protein LOC100797493 [Glycine max]
Length = 222
Score = 187 bits (476), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 141/212 (66%), Gaps = 5/212 (2%)
Query: 3 GSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
GSD +K LL+LIRDFA+EKSQGERRVV L+K+IEKL EL N E E AK+ KE +EQ+
Sbjct: 7 GSDSQKQLLSLIRDFAAEKSQGERRVVNLRKQIEKLTSELSEANAELENAKRYKELVEQD 66
Query: 63 LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
L+G+EV+L L ++ Q L +R+SLI IS V ++ E L+ E+ +LR+ F + +N K
Sbjct: 67 LRGFEVQLMLTESSVQTLAARVSLIQGHISAVESDFETLRSEEAALREQFFHNILHMNGK 126
Query: 123 IRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREEEL 182
IR F +SI G+ A +A K+ EVAL+ LE + EV+SQTA+E++
Sbjct: 127 IRKFQESIITC-----DIGSTASIDASQVIMKENDAEVALRALESTLLEVISQTAKEDQE 181
Query: 183 YQEEEKIQKQVQLELIDLERKVSLMEMIAYET 214
YQ E+KI + VQ +LID ERKVSLM MI ET
Sbjct: 182 YQAEQKIYENVQQQLIDCERKVSLMSMIVMET 213
>gi|9294572|dbj|BAB02853.1| unnamed protein product [Arabidopsis thaliana]
Length = 244
Score = 183 bits (464), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 113/241 (46%), Positives = 158/241 (65%), Gaps = 22/241 (9%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
M G D +K LL+LIRDF SE+S+GE+RVVGLKKRIE L+ E+EA N E E+AK++KE E
Sbjct: 1 MAGIDTQKQLLSLIRDFTSERSRGEQRVVGLKKRIEILQSEVEAANSEVEKAKRIKEVAE 60
Query: 61 QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKK---------EQE----- 106
+EL GYEVEL+LN+ Q+LE+RISL+ +E++T+G+EV+ALK+ EQ
Sbjct: 61 EELNGYEVELSLNDATIQSLEARISLLQDEVTTIGSEVDALKRSVYKPNGRTEQGDKVIF 120
Query: 107 -SLRDGFIVQMFELNDKI--RTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALK 163
SLR + + L+ + R F K++ +L DD G A +A + S L+
Sbjct: 121 YSLRYHLVPGVTHLSCLVGNREFQKTVDSSLSSDDGIGITANVKASEDGSG-----ADLE 175
Query: 164 TLEDKIAEVVSQTAREEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRY 223
++ ++EV SQ A+EEE Y E+KIQ+Q+Q EL D E+K+SLME I +T S+Q LTRY
Sbjct: 176 AIKGMLSEVNSQLAKEEEGYLAEQKIQEQLQKELDDYEKKMSLMEAITDKTNSVQVLTRY 235
Query: 224 P 224
P
Sbjct: 236 P 236
>gi|224060018|ref|XP_002300029.1| predicted protein [Populus trichocarpa]
gi|222847287|gb|EEE84834.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/102 (65%), Positives = 81/102 (79%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
M +DP++ LLTLIRDFASEKSQGERRVVGLK RI +L +L+A N E EE K+ KET E
Sbjct: 1 MAATDPQRQLLTLIRDFASEKSQGERRVVGLKNRIVELGCQLDAANAEMEEVKRFKETTE 60
Query: 61 QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALK 102
ELKGYE +LA N+ + Q LE+RISLI +EIS+VG+EVE LK
Sbjct: 61 LELKGYEFQLAFNDVSIQTLEARISLIQDEISSVGSEVEGLK 102
>gi|9294570|dbj|BAB02851.1| unnamed protein product [Arabidopsis thaliana]
Length = 217
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 132/210 (62%), Gaps = 29/210 (13%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
M G D +K LL+LIRDF SE+S+GE+RVVGLKKRIE L+ E+EA N E E AK++KE E
Sbjct: 1 MAGIDTQKQLLSLIRDFTSERSRGEQRVVGLKKRIEILQSEVEAANSEVEHAKRIKEVAE 60
Query: 61 QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKK---------EQE----- 106
+EL GYEVEL+LN++ Q+LE +E++T+G EV+ALK+ EQ
Sbjct: 61 EELNGYEVELSLNDSTIQSLE-------DEVTTIGNEVDALKRSVYKPNGRTEQGDKVIF 113
Query: 107 -SLRDGFIVQMFELNDKI--RTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALK 163
SLR ++ + L+ + R F K++ +L ++DS G A +A + S + A+K
Sbjct: 114 YSLRFHLMMLLHILSCLVGKREFQKTVDSSLADEDSTGITANIKAFEDCSGDDLE--AIK 171
Query: 164 TLEDKIAEVVSQTAREEELYQEEEKIQKQV 193
L +++V SQ A+EEE Y E+K+Q+Q+
Sbjct: 172 NL---LSDVHSQLAKEEEGYLAEQKMQEQI 198
>gi|224140891|ref|XP_002323811.1| predicted protein [Populus trichocarpa]
gi|222866813|gb|EEF03944.1| predicted protein [Populus trichocarpa]
Length = 101
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/102 (58%), Positives = 81/102 (79%), Gaps = 3/102 (2%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
M +DP++ LLTLIR+FA+EKSQGERRV+GL+KRI+++ E++ N E EE K +KET E
Sbjct: 1 MAFTDPQRQLLTLIREFATEKSQGERRVIGLEKRIQEVGCEVDVANGEMEEVKCIKETTE 60
Query: 61 QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALK 102
QELKGYE++L LN + Q LE+RIS +EIS+VG++VE LK
Sbjct: 61 QELKGYELQLFLNEASIQTLEARIS---HEISSVGSQVEGLK 99
>gi|296087754|emb|CBI35010.3| unnamed protein product [Vitis vinifera]
Length = 115
Score = 116 bits (290), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 64/117 (54%), Positives = 84/117 (71%), Gaps = 5/117 (4%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
M G+D +K LLTLIRDFA+EKSQGERRV LKKRI++L+ E++ N E EE+K+ KE E
Sbjct: 1 MAGTDSRKQLLTLIRDFAAEKSQGERRVESLKKRIQELQSEMDIANAELEESKRSKEAAE 60
Query: 61 QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMF 117
QELKGYEVELA+N + Q LE+R + I E+S VG+++ LK + GFI +F
Sbjct: 61 QELKGYEVELAMNEASIQTLEARNAAIQYEVSMVGSDLGILK-----VFIGFISGVF 112
>gi|297724653|ref|NP_001174690.1| Os06g0244200 [Oryza sativa Japonica Group]
gi|52076860|dbj|BAD45801.1| hypothetical protein [Oryza sativa Japonica Group]
gi|255676885|dbj|BAH93418.1| Os06g0244200 [Oryza sativa Japonica Group]
Length = 337
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/238 (37%), Positives = 135/238 (56%), Gaps = 21/238 (8%)
Query: 5 DPKKHLLTLIRDFASEKSQG--------------ERRVVGLKKRIEKLRLELEAENFERE 50
+P+K LL++IRDFA+EKS G ERRV LK+R++ LR +A E E
Sbjct: 13 EPQKQLLSIIRDFAAEKSHGGFSRDPLAASELLAERRVTDLKRRLDDLRAASDAAAAELE 72
Query: 51 EAKQLKETIEQELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRD 110
AK+ +E EQEL+G +V++A+ + QALE+ IS + EIS G++++ALK + + RD
Sbjct: 73 AAKRAREGAEQELRGGQVQVAIAAASIQALEATISHLQEEISKAGSDLDALKGKGDIERD 132
Query: 111 GFIVQMFELNDKIRTFHKSIA--FNLQEDDSFGTAAVSEADH--NFSKKGVPEVALKTLE 166
FI QM +LN KIR F ++++ F Q+ + E H + SK E LK L
Sbjct: 133 EFISQMDQLNTKIRQFQQTVSVEFKRQKCSELPSG---EGQHVRDMSKIEESEGILKDLI 189
Query: 167 DKIAEVVSQTAREEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
DK+ ++ EE Y+++ +V+ EL D + K +LME + ET LQ+L YP
Sbjct: 190 DKVNNADAELHVLEEEYKKDLLHHDEVRRELADTQAKRALMEAVMGETKQLQELGEYP 247
>gi|110743389|dbj|BAE99581.1| hypothetical protein [Arabidopsis thaliana]
Length = 182
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/141 (47%), Positives = 95/141 (67%), Gaps = 5/141 (3%)
Query: 82 SRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQEDDSFG 141
+RISL+ +E++T+G+EV+ALK ++ LRD FI QM ELN +IR F K++ +L DD G
Sbjct: 1 ARISLLQDEVTTIGSEVDALKNKEGLLRDQFISQMEELNKEIREFQKTVDSSLSSDDGIG 60
Query: 142 TAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREEELYQEEEKIQKQVQLELIDLE 201
A +A + S G A+K + ++EV SQ A+EEE Y E+KIQ+Q+Q EL D E
Sbjct: 61 ITANVKASEDGS--GADLEAIKGM---LSEVNSQLAKEEEGYLAEQKIQEQLQKELDDYE 115
Query: 202 RKVSLMEMIAYETGSLQDLTR 222
+K+SLME I +T S+Q LTR
Sbjct: 116 KKMSLMEAITDKTNSVQVLTR 136
>gi|125596680|gb|EAZ36460.1| hypothetical protein OsJ_20791 [Oryza sativa Japonica Group]
Length = 337
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/238 (37%), Positives = 132/238 (55%), Gaps = 21/238 (8%)
Query: 5 DPKKHLLTLIRDFASEKSQG--------------ERRVVGLKKRIEKLRLELEAENFERE 50
+P+K LL++IRDFA+EKS G ERRV LK+R++ LR +A E E
Sbjct: 13 EPQKQLLSIIRDFAAEKSHGGFSRDPLAASELLAERRVTDLKRRLDDLRAASDAAAAELE 72
Query: 51 EAKQLKETIEQELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRD 110
AK+ +E EQEL G V++A+ + Q LE+ IS + EIS G++++ALK + + RD
Sbjct: 73 AAKRAREGPEQELPGGPVQVAIAAGSIQGLEATISHLQEEISKAGSDLDALKGKGDIERD 132
Query: 111 GFIVQMFELNDKIRTFHKSIA--FNLQEDDSFGTAAVSEADH--NFSKKGVPEVALKTLE 166
FI QM +LN KIR F ++++ F Q+ + E H + SK E LK L
Sbjct: 133 EFISQMDQLNTKIRQFQQTVSVEFKRQKCSELPSG---EGQHVRDMSKIEESEGILKDLI 189
Query: 167 DKIAEVVSQTAREEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
DK+ ++ EE Y+++ +V+ EL D + K +LME + ET LQ+L YP
Sbjct: 190 DKVNNADAELHVLEEEYKKDLLHHDEVRRELADTQAKRALMEAVMGETKQLQELGEYP 247
>gi|413944208|gb|AFW76857.1| hypothetical protein ZEAMMB73_834489 [Zea mays]
Length = 240
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 80/227 (35%), Positives = 129/227 (56%), Gaps = 9/227 (3%)
Query: 3 GSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
G D K LL++IRDFA EKS GERRV L++R+ R + E + AK+ +E EQE
Sbjct: 7 GGDTPKQLLSIIRDFAYEKSHGERRVSDLRRRLADARAAADVAAAELDAAKRAREAAEQE 66
Query: 63 LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
+G +V+ A+ + ALE+ IS +H EIS +++ALK +++S R+ FI +M+E+N K
Sbjct: 67 FRGSQVQDAIAADSILALEATISCLHEEISKASTDLDALKSKEDSEREDFISKMYEMNAK 126
Query: 123 IRTFHKSIAFNLQEDD-----SFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTA 177
IR F + ++ L E + S V + +G+ E L DK++ + ++
Sbjct: 127 IRQFQQMVSLELAEYNHCELPSTDGEHVKDKSKTVDSEGISE----ELSDKVSNIEAEVQ 182
Query: 178 REEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
EE Y+++ +V+ EL +++ K +LME + ET LQ+L YP
Sbjct: 183 LLEEEYKKDLLYHDKVRQELAEVQAKRALMEAVMEETKQLQELGEYP 229
>gi|242095308|ref|XP_002438144.1| hypothetical protein SORBIDRAFT_10g008750 [Sorghum bicolor]
gi|241916367|gb|EER89511.1| hypothetical protein SORBIDRAFT_10g008750 [Sorghum bicolor]
Length = 298
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 130/224 (58%), Gaps = 3/224 (1%)
Query: 3 GSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
G D K LL++IRDFA EKS GERRV L++R+ R +A E + AK+ +E EQ+
Sbjct: 9 GGDTPKQLLSIIRDFAYEKSHGERRVSDLRRRLADARAASDAAAAELDAAKRAREAAEQD 68
Query: 63 LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
L+G +V+ A+ + ALE+ IS + EIS G ++ALK + ++ R+ FI M+E+N K
Sbjct: 69 LRGSQVQAAIAADSILALEATISHLQEEISKAGTHLDALKSKGDNEREEFISNMYEMNAK 128
Query: 123 IRTFHKSIAFNLQEDDSFGTAAVSEADH--NFSKKGVPEVALKTLEDKIAEVVSQTAREE 180
IR F + + L E + + +E +H + SK E K L DK++ + S+ E
Sbjct: 129 IRQFQQMASLELAEPNHCELPS-TEGEHVRDKSKTVDSEGISKELADKVSNIESEVQLLE 187
Query: 181 ELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
E Y+++ +V+ EL D++ K +LME + ET LQ+L YP
Sbjct: 188 EEYKKDLLDHDKVRQELADVQAKRALMEAVMGETKQLQELGEYP 231
>gi|147815650|emb|CAN63835.1| hypothetical protein VITISV_036109 [Vitis vinifera]
Length = 213
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 84/154 (54%), Gaps = 40/154 (25%)
Query: 1 MEGSDPKKHLLTLIRDFASEKSQ------------------------------------G 24
M G+D +K LLTLIRD A+EKS+
Sbjct: 60 MAGTDSRKRLLTLIRDLAAEKSKVGENNFSSVVFGAFFLXKRRVVRFIFDIKPFLAARYA 119
Query: 25 ERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTA-FQALESR 83
ERRVVGLKKRI++L+ E++ N E EE+K+ K+ EQELKGYEVELA+N A Q LE+R
Sbjct: 120 ERRVVGLKKRIQELQSEMDIANAELEESKRSKDAAEQELKGYEVELAMNEEASIQTLEAR 179
Query: 84 ISLIHNEISTVGAEVEALKKEQESLRDGFIVQMF 117
+ I E+S VG+++E LK GFI +F
Sbjct: 180 NTAIQYEVSMVGSDLEVLKIFXG---PGFISGVF 210
>gi|51535851|dbj|BAD37935.1| hypothetical protein [Oryza sativa Japonica Group]
gi|218197864|gb|EEC80291.1| hypothetical protein OsI_22304 [Oryza sativa Indica Group]
Length = 442
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/133 (39%), Positives = 83/133 (62%), Gaps = 3/133 (2%)
Query: 6 PKKHLLT---LIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
P+K L + ++RDFA+++S GER + LK R++ +R +A E E AK+ +E EQE
Sbjct: 13 PQKKLPSWPSMVRDFATDESHGERGMSVLKLRLDGVRSAYDAAAAELETAKRAREDAEQE 72
Query: 63 LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
L G +V++A+ + LE+ IS + EIS V ++++ALK + + RD FI QM +LN K
Sbjct: 73 LGGNQVQVAIAVVSIHGLEATISHLQEEISQVRSDLDALKGKGDIERDEFISQMDQLNTK 132
Query: 123 IRTFHKSIAFNLQ 135
IR F + ++ L
Sbjct: 133 IRKFQQMVSVELN 145
>gi|125596625|gb|EAZ36405.1| hypothetical protein OsJ_20734 [Oryza sativa Japonica Group]
Length = 420
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/123 (39%), Positives = 78/123 (63%)
Query: 13 LIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELAL 72
++RDFA+++S GER + LK R++ +R +A E E AK+ +E EQEL G +V++A+
Sbjct: 1 MVRDFATDESHGERGMSVLKLRLDGVRSAYDAAAAELETAKRAREDAEQELGGNQVQVAI 60
Query: 73 NNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAF 132
+ LE+ IS + EIS V ++++ALK + + RD FI QM +LN KIR F + ++
Sbjct: 61 AVVSIHGLEATISHLQEEISQVRSDLDALKGKGDIERDEFISQMDQLNTKIRKFQQMVSV 120
Query: 133 NLQ 135
L
Sbjct: 121 ELN 123
>gi|218197884|gb|EEC80311.1| hypothetical protein OsI_22349 [Oryza sativa Indica Group]
Length = 121
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 75/105 (71%)
Query: 5 DPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELK 64
+P+K LL++IRDFA+EKS GERRV LK+R++ +R +A E E AK+ +E EQEL+
Sbjct: 13 EPQKQLLSIIRDFAAEKSHGERRVKDLKRRLDDVRAASDAAAAELEAAKRAREGAEQELR 72
Query: 65 GYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLR 109
G +V++A+ + QALE+ IS + EIS G++++ALK+ + + R
Sbjct: 73 GGQVQVAIAAASIQALEATISHLQEEISKAGSDLDALKEWENTGR 117
>gi|30102588|gb|AAP21212.1| At3g28370 [Arabidopsis thaliana]
Length = 148
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/107 (45%), Positives = 67/107 (62%), Gaps = 5/107 (4%)
Query: 116 MFELNDKIRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQ 175
M ELN +IR F K++ +L DD G A +A + S G A+K + ++EV SQ
Sbjct: 1 MEELNKEIREFQKTVDSSLSSDDGIGITANVKASEDGS--GADLEAIKGM---LSEVNSQ 55
Query: 176 TAREEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTR 222
A+EEE Y E+KIQ+Q+Q EL D E+K+SLME I +T S+Q LTR
Sbjct: 56 LAKEEEGYLAEQKIQEQLQKELDDYEKKMSLMEAITDKTNSVQVLTR 102
>gi|30694968|ref|NP_175554.2| uncharacterized protein [Arabidopsis thaliana]
gi|332194546|gb|AEE32667.1| uncharacterized protein [Arabidopsis thaliana]
Length = 160
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 70/147 (47%), Gaps = 42/147 (28%)
Query: 5 DPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELK 64
D +K +L+LIRDF L+ +E FER +
Sbjct: 23 DTQKQVLSLIRDFT-------------------LKDLVEISGFER---------FYYSIP 54
Query: 65 GYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKI- 123
G EL+LN+ Q+LE+RISL+ +E+ST G E +ALK+ +++LN +I
Sbjct: 55 GDTGELSLNDFTIQSLEARISLLQDEVSTNGIEDDALKR-----------SVYKLNGRID 103
Query: 124 --RTFHKSIAFNLQEDDSFGTAAVSEA 148
FHK+I ++ D++ G A ++A
Sbjct: 104 QGEKFHKTIDLSVGSDETTGITANNKA 130
>gi|12325377|gb|AAG52636.1|AC024261_23 hypothetical protein; 72506-71203 [Arabidopsis thaliana]
Length = 191
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 70/147 (47%), Gaps = 42/147 (28%)
Query: 5 DPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELK 64
D +K +L+LIRDF L+ +E FER +
Sbjct: 23 DTQKQVLSLIRDFT-------------------LKDLVEISGFER---------FYYSIP 54
Query: 65 GYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKI- 123
G EL+LN+ Q+LE+RISL+ +E+ST G E +ALK+ +++LN +I
Sbjct: 55 GDTGELSLNDFTIQSLEARISLLQDEVSTNGIEDDALKR-----------SVYKLNGRID 103
Query: 124 --RTFHKSIAFNLQEDDSFGTAAVSEA 148
FHK+I ++ D++ G A ++A
Sbjct: 104 QGEKFHKTIDLSVGSDETTGITANNKA 130
>gi|156083234|ref|XP_001609101.1| variant erythrocyte surface antigen-1, alpha subunit [Babesia bovis
T2Bo]
gi|154796351|gb|EDO05533.1| variant erythrocyte surface antigen-1, alpha subunit [Babesia
bovis]
Length = 1359
Score = 45.8 bits (107), Expect = 0.012, Method: Composition-based stats.
Identities = 44/156 (28%), Positives = 77/156 (49%), Gaps = 14/156 (8%)
Query: 25 ERRVVGLKKRI---EKLRLELEAENFEREEAKQLKETIEQ---ELKGYEVELALNNTAFQ 78
E+ V G+K+++ E+++ LEA + EEAK+ +E +E EL+G + L AF
Sbjct: 727 EKLVEGVKEKVKALEEIKKTLEAAKEKLEEAKKTQEALEAAKGELEGAKGALTTAKKAFP 786
Query: 79 ALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQEDD 138
S I N +S A ++ L K + L+D + Q+ E K T + F+
Sbjct: 787 GSSSDIDPCKNIVS---ATIDGLHKALQLLKDA-VQQIMEATRKDTTEFQDAIFSA---- 838
Query: 139 SFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVS 174
++ V+E + + +K+ V AL TL I +++S
Sbjct: 839 TYTADVVTEVEKHQAKQKVTLDALSTLVSAIEDLLS 874
>gi|156083358|ref|XP_001609163.1| variant erythrocyte surface antigen-1, alpha subunit [Babesia bovis
T2Bo]
gi|154796413|gb|EDO05595.1| variant erythrocyte surface antigen-1, alpha subunit [Babesia
bovis]
Length = 1360
Score = 43.9 bits (102), Expect = 0.052, Method: Composition-based stats.
Identities = 41/145 (28%), Positives = 70/145 (48%), Gaps = 11/145 (7%)
Query: 33 KRIEKLRLELEAENFEREEAKQLKETIEQ---ELKGYEVELALNNTAFQALESRISLIHN 89
K +E+++ LEA + EEAK+ +E +E EL+G + L AF S I N
Sbjct: 738 KALEEIKKTLEAAKEKLEEAKKTQEALEAAKGELEGAKGALTTAKKAFPGSSSDIDPCKN 797
Query: 90 EISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQEDDSFGTAAVSEAD 149
+S A ++ L K + L+D + Q+ E K T + F+ ++ V+E +
Sbjct: 798 IVS---ATIDGLHKALQLLKDA-VQQIMEATRKDTTEFQDAIFSA----TYTADVVTEVE 849
Query: 150 HNFSKKGVPEVALKTLEDKIAEVVS 174
+ +K+ V AL TL I +++S
Sbjct: 850 KHQAKQKVTLDALSTLVSAIEDLLS 874
>gi|407833366|gb|EKF98741.1| hypothetical protein TCSYLVIO_010352 [Trypanosoma cruzi]
Length = 902
Score = 38.9 bits (89), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 45/80 (56%), Gaps = 6/80 (7%)
Query: 27 RVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAFQALESRISL 86
RVV R+EK R E+EA ER+E ++ K+T+++ L+ + A F A+ S ++
Sbjct: 259 RVVAQDLRVEKCRAEIEARTAERQEKEEAKKTLKEHLQALNEDPAKREVQFSAVRSELAK 318
Query: 87 IHNEISTVGAEVEALKKEQE 106
+ E++ EAL++ +E
Sbjct: 319 VQQELA------EALRRRKE 332
>gi|260795120|ref|XP_002592554.1| hypothetical protein BRAFLDRAFT_118903 [Branchiostoma floridae]
gi|229277775|gb|EEN48565.1| hypothetical protein BRAFLDRAFT_118903 [Branchiostoma floridae]
Length = 3563
Score = 38.5 bits (88), Expect = 2.1, Method: Composition-based stats.
Identities = 41/196 (20%), Positives = 90/196 (45%), Gaps = 8/196 (4%)
Query: 20 EKSQGERRVVGLKKRIEKLRLELEAEN--FEREEAKQLKETIEQELKGYEVELALNNTAF 77
+ S+ + ++ + I +LR+++E+ N E K L E+E + E + T+
Sbjct: 2154 QSSEKDGKIQEQMEEIRRLRMKIESLNDDMTERERKMLDLAREKEELQGQYEFYFSETS- 2212
Query: 78 QALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQED 137
L+++++ I + +G EAL+ E ESLR N + H+ I L+ +
Sbjct: 2213 -KLQTQVTEIQTVQTQMGGSTEALEDELESLRRRLKSDNEHANQEAFELHRRIT-ELELE 2270
Query: 138 DSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREEELYQEEEKIQKQVQLEL 197
+ T + + N S +G EV + L ++ ++ E Y E K+ ++++ ++
Sbjct: 2271 RNTATEKLKHMEVNMS-QGSGEV--EKLRQQLRRAEQDSSNTELRYNEMHKVNERLRSDM 2327
Query: 198 IDLERKVSLMEMIAYE 213
D+ + + + ++ E
Sbjct: 2328 DDILHQRDIAQRLSKE 2343
>gi|71651425|ref|XP_814391.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70879358|gb|EAN92540.1| hypothetical protein Tc00.1047053511239.90 [Trypanosoma cruzi]
Length = 869
Score = 37.4 bits (85), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 45/80 (56%), Gaps = 6/80 (7%)
Query: 27 RVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAFQALESRISL 86
RV+ R+EK R E+EA ER+E ++ K+T+++ L+ + A F A+ S ++
Sbjct: 226 RVLAQDLRVEKCRAEIEARTAERQEKEEAKKTLKEHLQALNEDPAKREVQFNAVRSELAK 285
Query: 87 IHNEISTVGAEVEALKKEQE 106
+ E++ EAL++ +E
Sbjct: 286 VQQELA------EALRRRKE 299
>gi|345482259|ref|XP_001607886.2| PREDICTED: intersectin-1-like [Nasonia vitripennis]
Length = 1728
Score = 37.0 bits (84), Expect = 5.6, Method: Composition-based stats.
Identities = 39/205 (19%), Positives = 99/205 (48%), Gaps = 17/205 (8%)
Query: 18 ASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAF 77
A E+ + R+ + ++++E + + + +R++ + + ++ + + +EL+ N
Sbjct: 569 AQEQREAARKEMERQRQLEWEKQKSQELQAQRQKEQDILLKLKAKNQSLTIELSSLNEKV 628
Query: 78 QALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQED 137
+ L +I +ST+ ++ ++ + RD + +M L +K+R ++ + QE
Sbjct: 629 KELSQKICDTRVGVSTIKTTIDGMR----TTRDQQLQEMAALKNKLREQNQRLLTISQEK 684
Query: 138 DSFGTAAVSEADHNFSK-KGVPEVALKTLEDKIAEVVSQTAREEELY----QEEEKIQKQ 192
+ AD + + + ++ALK +EDKIAE+ + +E+ ++ E++Q Q
Sbjct: 685 ARLDAKTKANADPAGQEAQKLKQIALKQMEDKIAEMEKEIKDKEKEVENSNKQLEELQNQ 744
Query: 193 VQLELIDL--------ERKVSLMEM 209
+++ + E+K ++EM
Sbjct: 745 MEVMFTNCEELNAKFEEKKAKVLEM 769
>gi|197117352|ref|YP_002137779.1| chromosome segregation ATPase SMC [Geobacter bemidjiensis Bem]
gi|197086712|gb|ACH37983.1| chromosome segregation ATPase SMC [Geobacter bemidjiensis Bem]
Length = 1176
Score = 37.0 bits (84), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 53/101 (52%), Gaps = 11/101 (10%)
Query: 21 KSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAFQAL 80
+ + E R + L+K +E L+ LE FE EEA+++ +++ + + + A+Q +
Sbjct: 772 RGEAEERKLALEKNVEALQGRLEGSRFEMEEAREMVTSLKVRVAALKEKGESTQRAYQRV 831
Query: 81 E-------SRISLIHNEISTVGAE----VEALKKEQESLRD 110
E SRIS NE+ GAE ++A+ +E+LRD
Sbjct: 832 EALCADLASRISSRGNELEGSGAERTRLIDAIAAGEEALRD 872
>gi|71662315|ref|XP_818166.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70883401|gb|EAN96315.1| hypothetical protein Tc00.1047053506605.100 [Trypanosoma cruzi]
Length = 869
Score = 37.0 bits (84), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 45/80 (56%), Gaps = 6/80 (7%)
Query: 27 RVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAFQALESRISL 86
RV+ R+EK R E+EA ER+E ++ K+T+++ L+ + A F A+ S ++
Sbjct: 226 RVLAQDLRVEKCRAEIEARTAERQEKEEAKKTLKEHLQALNEDPAKREVQFGAVRSELAK 285
Query: 87 IHNEISTVGAEVEALKKEQE 106
+ E++ EAL++ +E
Sbjct: 286 VQQELA------EALRRRKE 299
>gi|92112829|ref|YP_572757.1| chromosome segregation protein SMC [Chromohalobacter salexigens DSM
3043]
gi|91795919|gb|ABE58058.1| Chromosome segregation protein SMC [Chromohalobacter salexigens DSM
3043]
Length = 1164
Score = 37.0 bits (84), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 66/129 (51%), Gaps = 4/129 (3%)
Query: 5 DPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELK 64
D L +RD +++ Q ER + GL+ R+E+ R++++A + + E + ++
Sbjct: 873 DRAHQLAETLRDTEAKRQQHERNLDGLRGRLEEARMQVQALKLKADAEDAQLEELGHDVA 932
Query: 65 GYEVELALNNTAFQALESRISLIHNEISTVGA-EVEALKK--EQESLRDGFIVQMFELND 121
G + L N T A ++R+ I ++ +GA + A+++ +Q RD Q EL++
Sbjct: 933 GLQEGLDPNATE-SAWQTRLDEIAEKVRRLGAINLAAIEEYDQQAERRDYLEAQHAELSE 991
Query: 122 KIRTFHKSI 130
I T K+I
Sbjct: 992 AIETLEKAI 1000
>gi|334705393|ref|ZP_08521259.1| HlyD family secretion protein [Aeromonas caviae Ae398]
Length = 471
Score = 36.6 bits (83), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 81/167 (48%), Gaps = 34/167 (20%)
Query: 17 FASEKSQGERRVVGLKKRIEKLRLEL-------EAENFEREEAKQLKETIEQELKGYE-- 67
F S+ + E+ +V LK I +LR E+ E RE+ K+ IE +GYE
Sbjct: 114 FKSDFREREQELVTLKGDIARLRAEIRSVVVKNEPNLGWREQVAIDKQAIEFP-EGYENV 172
Query: 68 -VELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTF 126
E A + Q E+R++ + N++ +G ++E +KEQE++ ELN KIRT
Sbjct: 173 FAEYAARERSVQ--EARLNNLKNQLYILGQQIE--QKEQETI---------ELNAKIRTV 219
Query: 127 HKSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
+S+ QE + +K+G VP+V L LE ++ E+
Sbjct: 220 GRSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 257
>gi|14591553|ref|NP_143635.1| chromosome assembly protein [Pyrococcus horikoshii OT3]
gi|3258234|dbj|BAA30917.1| 1179aa long hypothetical chromosome assembly protein [Pyrococcus
horikoshii OT3]
Length = 1179
Score = 36.6 bits (83), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 46/80 (57%), Gaps = 16/80 (20%)
Query: 31 LKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVEL-ALNNTAFQALESRISLIHN 89
LK+++E LR+ +KE++E E+ V+L AL N +F+ L R+S +
Sbjct: 666 LKEKVENLRI--------------MKESLEGEVNSLRVKLKALENQSFE-LRIRMSDVEK 710
Query: 90 EISTVGAEVEALKKEQESLR 109
EIS + ++E L KE+ESLR
Sbjct: 711 EISLISKDLEKLIKEEESLR 730
>gi|406676615|ref|ZP_11083801.1| HlyD family type I secretion membrane fusion protein [Aeromonas
veronii AMC35]
gi|423209192|ref|ZP_17195746.1| HlyD family type I secretion membrane fusion protein [Aeromonas
veronii AER397]
gi|404619037|gb|EKB15957.1| HlyD family type I secretion membrane fusion protein [Aeromonas
veronii AER397]
gi|404626838|gb|EKB23648.1| HlyD family type I secretion membrane fusion protein [Aeromonas
veronii AMC35]
Length = 477
Score = 36.6 bits (83), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 32/166 (19%)
Query: 17 FASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE------QELKGYEV-- 68
F S+ + E+ VV LK + +LR E+ + + + A +E + Q GYE
Sbjct: 120 FRSDFREREQEVVTLKGDVARLRAEMASVEVKNDPALGWREQVVVGEAPIQFPDGYETVF 179
Query: 69 -ELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFH 127
E A + Q R++ + N++ +G ++E +KEQE + ELN KIRT +
Sbjct: 180 AEYAHRERSVQ--RDRLNNLKNQLYILGQQIE--QKEQE---------LIELNAKIRTVN 226
Query: 128 KSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
+S+ QE + +K+G VP+V L LE ++ E+
Sbjct: 227 RSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 263
>gi|423201993|ref|ZP_17188572.1| HlyD family type I secretion membrane fusion protein [Aeromonas
veronii AER39]
gi|404615604|gb|EKB12566.1| HlyD family type I secretion membrane fusion protein [Aeromonas
veronii AER39]
Length = 477
Score = 36.2 bits (82), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 32/166 (19%)
Query: 17 FASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE------QELKGYEV-- 68
F S+ + E+ VV LK + +LR E+ + + + A +E + Q GYE
Sbjct: 120 FRSDFREREQEVVTLKGDVARLRAEMASVEVKNDPALGWREQVVVGEAPIQFPDGYETVF 179
Query: 69 -ELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFH 127
E A + Q R++ + N++ +G ++E +KEQE + ELN KIRT +
Sbjct: 180 AEYAHRERSVQ--RDRLNNLKNQLYILGQQIE--QKEQE---------LIELNAKIRTVN 226
Query: 128 KSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
+S+ QE + +K+G VP+V L LE ++ E+
Sbjct: 227 RSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 263
>gi|330830103|ref|YP_004393055.1| secretion protein, HlyD family [Aeromonas veronii B565]
gi|328805239|gb|AEB50438.1| Secretion protein, HlyD family [Aeromonas veronii B565]
Length = 472
Score = 36.2 bits (82), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 32/166 (19%)
Query: 17 FASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE------QELKGYEV-- 68
F S+ + E+ VV LK + +LR E+ + + + A +E + Q GYE
Sbjct: 115 FRSDFREREQEVVTLKGDVARLRAEMASVEVKNDPALGWREQVVVGEAPIQFPDGYETVF 174
Query: 69 -ELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFH 127
E A + Q R++ + N++ +G ++E +KEQE + ELN KIRT +
Sbjct: 175 AEYAHRERSVQ--RDRLNNLKNQLYILGQQIE--QKEQE---------LIELNAKIRTVN 221
Query: 128 KSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
+S+ QE + +K+G VP+V L LE ++ E+
Sbjct: 222 RSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 258
>gi|423207159|ref|ZP_17193715.1| HlyD family type I secretion membrane fusion protein [Aeromonas
veronii AMC34]
gi|404621108|gb|EKB18000.1| HlyD family type I secretion membrane fusion protein [Aeromonas
veronii AMC34]
Length = 477
Score = 36.2 bits (82), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 32/166 (19%)
Query: 17 FASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE------QELKGYEV-- 68
F S+ + E+ VV LK + +LR E+ + + + A +E + Q GYE
Sbjct: 120 FRSDFREREQEVVTLKGDVARLRAEMASVEVKNDPALGWREQVVVGEAPIQFPDGYETVF 179
Query: 69 -ELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFH 127
E A + Q R++ + N++ +G ++E +KEQE + ELN KIRT +
Sbjct: 180 AEYAHRERSVQ--RDRLNNLKNQLYILGQQIE--QKEQE---------LIELNAKIRTVN 226
Query: 128 KSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
+S+ QE + +K+G VP+V L LE ++ E+
Sbjct: 227 RSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 263
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.129 0.334
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,960,280,207
Number of Sequences: 23463169
Number of extensions: 109526849
Number of successful extensions: 706904
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 574
Number of HSP's successfully gapped in prelim test: 10109
Number of HSP's that attempted gapping in prelim test: 667638
Number of HSP's gapped (non-prelim): 45078
length of query: 225
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 88
effective length of database: 9,144,741,214
effective search space: 804737226832
effective search space used: 804737226832
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 74 (33.1 bits)