BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 027302
         (225 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255552764|ref|XP_002517425.1| conserved hypothetical protein [Ricinus communis]
 gi|223543436|gb|EEF44967.1| conserved hypothetical protein [Ricinus communis]
          Length = 216

 Score =  218 bits (556), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 122/224 (54%), Positives = 158/224 (70%), Gaps = 8/224 (3%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
           M G D KK LLTLIRD ASEKSQGERRV+GLKKR+ ++R E+E EN E E +K+LKETIE
Sbjct: 1   MAGIDQKKQLLTLIRDCASEKSQGERRVIGLKKRLVEIRTEVETENAELEASKRLKETIE 60

Query: 61  QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELN 120
           Q+LKG+EVELALN    Q+LE+R+  I +EIS++G+EV+ LK E+ + RD FI QM +L 
Sbjct: 61  QQLKGFEVELALNVAFIQSLEARVFQIQDEISSIGSEVDGLKNEERTSRDEFIEQMVKLG 120

Query: 121 DKIRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREE 180
            +IR F + IA   Q+++S GT A +E           E   + L D + ++VSQT +EE
Sbjct: 121 TRIRRFQEKIASEFQKENSIGTTAETENK--------VESDSRILADMVDQIVSQTTKEE 172

Query: 181 ELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
           + Y  E+ IQKQVQ E +DL+ KVSLM MI  ET +LQDLTRYP
Sbjct: 173 QEYLVEQIIQKQVQQEYVDLQEKVSLMGMIMKETKALQDLTRYP 216


>gi|42565264|ref|NP_189478.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332643917|gb|AEE77438.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 292

 Score =  212 bits (540), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 118/225 (52%), Positives = 161/225 (71%), Gaps = 5/225 (2%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
           M G D +K LL+LIRDF SE+S+GE+RVVGLKKRIE L+ E+EA N E E+AK++KE  E
Sbjct: 1   MAGIDTQKQLLSLIRDFTSERSRGEQRVVGLKKRIEILQSEVEAANSEVEKAKRIKEVAE 60

Query: 61  QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELN 120
           +EL GYEVEL+LN+   Q+LE+RISL+ +E++T+G+EV+ALK ++  LRD FI QM ELN
Sbjct: 61  EELNGYEVELSLNDATIQSLEARISLLQDEVTTIGSEVDALKNKEGLLRDQFISQMEELN 120

Query: 121 DKIRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREE 180
            +IR F K++  +L  DD  G  A  +A  + S        L+ ++  ++EV SQ A+EE
Sbjct: 121 KEIREFQKTVDSSLSSDDGIGITANVKASEDGSG-----ADLEAIKGMLSEVNSQLAKEE 175

Query: 181 ELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYPC 225
           E Y  E+KIQ+Q+Q EL D E+K+SLME I  +T S+Q LTRYP 
Sbjct: 176 EGYLAEQKIQEQLQKELDDYEKKMSLMEAITDKTNSVQVLTRYPL 220


>gi|297815134|ref|XP_002875450.1| hypothetical protein ARALYDRAFT_484614 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321288|gb|EFH51709.1| hypothetical protein ARALYDRAFT_484614 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 269

 Score =  207 bits (527), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 119/225 (52%), Positives = 158/225 (70%), Gaps = 5/225 (2%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
           M G D +K LL+LIRDF SE+S+GE+RVVGLKKRIE L+ E+EA N E E AK++KE  E
Sbjct: 1   MAGIDTQKQLLSLIRDFTSERSRGEQRVVGLKKRIEILQSEIEAANSEVENAKRIKEVAE 60

Query: 61  QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELN 120
           +EL GYEVEL+LN+   Q+LE+RISL+ +E+STVG+EV+ALK ++  LRD FI QM ELN
Sbjct: 61  EELNGYEVELSLNDATIQSLEARISLLQDEVSTVGSEVDALKNKEGFLRDQFISQMEELN 120

Query: 121 DKIRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREE 180
            +IR F K++      DDS G     +A  + S  G    A+K +   +++V SQ A+EE
Sbjct: 121 KEIREFQKTVDSTFGNDDSTGITPNVKAFEDSS--GADSEAIKGM---LSDVSSQLAKEE 175

Query: 181 ELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYPC 225
           E Y  E+KIQ+Q+Q EL D E+K+SLME I  +T S+Q L RYP 
Sbjct: 176 EGYLAEKKIQEQLQKELDDYEKKMSLMEAITDKTNSVQVLARYPL 220


>gi|363806888|ref|NP_001242299.1| uncharacterized protein LOC100815720 [Glycine max]
 gi|255633752|gb|ACU17236.1| unknown [Glycine max]
          Length = 220

 Score =  189 bits (481), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 110/216 (50%), Positives = 145/216 (67%), Gaps = 5/216 (2%)

Query: 3   GSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
           GSD +K L +LIR+FA+EKSQGERRVV L+K+IEKL+ EL   N E E AK+ KE +EQ+
Sbjct: 9   GSDSQKQLWSLIRNFAAEKSQGERRVVNLRKQIEKLKSELSEANAELENAKRCKELVEQD 68

Query: 63  LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
           LKG+EV+L L  ++ Q LE+R+SLI N IS VG+++E LK E+  LR+ F   M  +N K
Sbjct: 69  LKGFEVQLMLTESSAQTLEARVSLIQNHISAVGSDLETLKSEEADLREQFFHNMLHMNGK 128

Query: 123 IRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREEEL 182
           IR F +SI       D   TA++ +A     K+   EVA+  LE  + EVVSQ A+E+E 
Sbjct: 129 IRKFQESIITC----DIDSTASI-DASQVTMKENDAEVAIMALESTLLEVVSQIAKEDEE 183

Query: 183 YQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQ 218
           YQ E+K  + VQ +LID E+KVSLM MI  ET  LQ
Sbjct: 184 YQAEQKNYENVQQQLIDCEKKVSLMSMIVMETKELQ 219


>gi|356519023|ref|XP_003528174.1| PREDICTED: uncharacterized protein LOC100797493 [Glycine max]
          Length = 222

 Score =  187 bits (476), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 105/212 (49%), Positives = 141/212 (66%), Gaps = 5/212 (2%)

Query: 3   GSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
           GSD +K LL+LIRDFA+EKSQGERRVV L+K+IEKL  EL   N E E AK+ KE +EQ+
Sbjct: 7   GSDSQKQLLSLIRDFAAEKSQGERRVVNLRKQIEKLTSELSEANAELENAKRYKELVEQD 66

Query: 63  LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
           L+G+EV+L L  ++ Q L +R+SLI   IS V ++ E L+ E+ +LR+ F   +  +N K
Sbjct: 67  LRGFEVQLMLTESSVQTLAARVSLIQGHISAVESDFETLRSEEAALREQFFHNILHMNGK 126

Query: 123 IRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREEEL 182
           IR F +SI          G+ A  +A     K+   EVAL+ LE  + EV+SQTA+E++ 
Sbjct: 127 IRKFQESIITC-----DIGSTASIDASQVIMKENDAEVALRALESTLLEVISQTAKEDQE 181

Query: 183 YQEEEKIQKQVQLELIDLERKVSLMEMIAYET 214
           YQ E+KI + VQ +LID ERKVSLM MI  ET
Sbjct: 182 YQAEQKIYENVQQQLIDCERKVSLMSMIVMET 213


>gi|9294572|dbj|BAB02853.1| unnamed protein product [Arabidopsis thaliana]
          Length = 244

 Score =  183 bits (464), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 113/241 (46%), Positives = 158/241 (65%), Gaps = 22/241 (9%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
           M G D +K LL+LIRDF SE+S+GE+RVVGLKKRIE L+ E+EA N E E+AK++KE  E
Sbjct: 1   MAGIDTQKQLLSLIRDFTSERSRGEQRVVGLKKRIEILQSEVEAANSEVEKAKRIKEVAE 60

Query: 61  QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKK---------EQE----- 106
           +EL GYEVEL+LN+   Q+LE+RISL+ +E++T+G+EV+ALK+         EQ      
Sbjct: 61  EELNGYEVELSLNDATIQSLEARISLLQDEVTTIGSEVDALKRSVYKPNGRTEQGDKVIF 120

Query: 107 -SLRDGFIVQMFELNDKI--RTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALK 163
            SLR   +  +  L+  +  R F K++  +L  DD  G  A  +A  + S        L+
Sbjct: 121 YSLRYHLVPGVTHLSCLVGNREFQKTVDSSLSSDDGIGITANVKASEDGSG-----ADLE 175

Query: 164 TLEDKIAEVVSQTAREEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRY 223
            ++  ++EV SQ A+EEE Y  E+KIQ+Q+Q EL D E+K+SLME I  +T S+Q LTRY
Sbjct: 176 AIKGMLSEVNSQLAKEEEGYLAEQKIQEQLQKELDDYEKKMSLMEAITDKTNSVQVLTRY 235

Query: 224 P 224
           P
Sbjct: 236 P 236


>gi|224060018|ref|XP_002300029.1| predicted protein [Populus trichocarpa]
 gi|222847287|gb|EEE84834.1| predicted protein [Populus trichocarpa]
          Length = 117

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 67/102 (65%), Positives = 81/102 (79%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
           M  +DP++ LLTLIRDFASEKSQGERRVVGLK RI +L  +L+A N E EE K+ KET E
Sbjct: 1   MAATDPQRQLLTLIRDFASEKSQGERRVVGLKNRIVELGCQLDAANAEMEEVKRFKETTE 60

Query: 61  QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALK 102
            ELKGYE +LA N+ + Q LE+RISLI +EIS+VG+EVE LK
Sbjct: 61  LELKGYEFQLAFNDVSIQTLEARISLIQDEISSVGSEVEGLK 102


>gi|9294570|dbj|BAB02851.1| unnamed protein product [Arabidopsis thaliana]
          Length = 217

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 132/210 (62%), Gaps = 29/210 (13%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
           M G D +K LL+LIRDF SE+S+GE+RVVGLKKRIE L+ E+EA N E E AK++KE  E
Sbjct: 1   MAGIDTQKQLLSLIRDFTSERSRGEQRVVGLKKRIEILQSEVEAANSEVEHAKRIKEVAE 60

Query: 61  QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKK---------EQE----- 106
           +EL GYEVEL+LN++  Q+LE       +E++T+G EV+ALK+         EQ      
Sbjct: 61  EELNGYEVELSLNDSTIQSLE-------DEVTTIGNEVDALKRSVYKPNGRTEQGDKVIF 113

Query: 107 -SLRDGFIVQMFELNDKI--RTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALK 163
            SLR   ++ +  L+  +  R F K++  +L ++DS G  A  +A  + S   +   A+K
Sbjct: 114 YSLRFHLMMLLHILSCLVGKREFQKTVDSSLADEDSTGITANIKAFEDCSGDDLE--AIK 171

Query: 164 TLEDKIAEVVSQTAREEELYQEEEKIQKQV 193
            L   +++V SQ A+EEE Y  E+K+Q+Q+
Sbjct: 172 NL---LSDVHSQLAKEEEGYLAEQKMQEQI 198


>gi|224140891|ref|XP_002323811.1| predicted protein [Populus trichocarpa]
 gi|222866813|gb|EEF03944.1| predicted protein [Populus trichocarpa]
          Length = 101

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 60/102 (58%), Positives = 81/102 (79%), Gaps = 3/102 (2%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
           M  +DP++ LLTLIR+FA+EKSQGERRV+GL+KRI+++  E++  N E EE K +KET E
Sbjct: 1   MAFTDPQRQLLTLIREFATEKSQGERRVIGLEKRIQEVGCEVDVANGEMEEVKCIKETTE 60

Query: 61  QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALK 102
           QELKGYE++L LN  + Q LE+RIS   +EIS+VG++VE LK
Sbjct: 61  QELKGYELQLFLNEASIQTLEARIS---HEISSVGSQVEGLK 99


>gi|296087754|emb|CBI35010.3| unnamed protein product [Vitis vinifera]
          Length = 115

 Score =  116 bits (290), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 64/117 (54%), Positives = 84/117 (71%), Gaps = 5/117 (4%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE 60
           M G+D +K LLTLIRDFA+EKSQGERRV  LKKRI++L+ E++  N E EE+K+ KE  E
Sbjct: 1   MAGTDSRKQLLTLIRDFAAEKSQGERRVESLKKRIQELQSEMDIANAELEESKRSKEAAE 60

Query: 61  QELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMF 117
           QELKGYEVELA+N  + Q LE+R + I  E+S VG+++  LK     +  GFI  +F
Sbjct: 61  QELKGYEVELAMNEASIQTLEARNAAIQYEVSMVGSDLGILK-----VFIGFISGVF 112


>gi|297724653|ref|NP_001174690.1| Os06g0244200 [Oryza sativa Japonica Group]
 gi|52076860|dbj|BAD45801.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|255676885|dbj|BAH93418.1| Os06g0244200 [Oryza sativa Japonica Group]
          Length = 337

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/238 (37%), Positives = 135/238 (56%), Gaps = 21/238 (8%)

Query: 5   DPKKHLLTLIRDFASEKSQG--------------ERRVVGLKKRIEKLRLELEAENFERE 50
           +P+K LL++IRDFA+EKS G              ERRV  LK+R++ LR   +A   E E
Sbjct: 13  EPQKQLLSIIRDFAAEKSHGGFSRDPLAASELLAERRVTDLKRRLDDLRAASDAAAAELE 72

Query: 51  EAKQLKETIEQELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRD 110
            AK+ +E  EQEL+G +V++A+   + QALE+ IS +  EIS  G++++ALK + +  RD
Sbjct: 73  AAKRAREGAEQELRGGQVQVAIAAASIQALEATISHLQEEISKAGSDLDALKGKGDIERD 132

Query: 111 GFIVQMFELNDKIRTFHKSIA--FNLQEDDSFGTAAVSEADH--NFSKKGVPEVALKTLE 166
            FI QM +LN KIR F ++++  F  Q+     +    E  H  + SK    E  LK L 
Sbjct: 133 EFISQMDQLNTKIRQFQQTVSVEFKRQKCSELPSG---EGQHVRDMSKIEESEGILKDLI 189

Query: 167 DKIAEVVSQTAREEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
           DK+    ++    EE Y+++     +V+ EL D + K +LME +  ET  LQ+L  YP
Sbjct: 190 DKVNNADAELHVLEEEYKKDLLHHDEVRRELADTQAKRALMEAVMGETKQLQELGEYP 247


>gi|110743389|dbj|BAE99581.1| hypothetical protein [Arabidopsis thaliana]
          Length = 182

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/141 (47%), Positives = 95/141 (67%), Gaps = 5/141 (3%)

Query: 82  SRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQEDDSFG 141
           +RISL+ +E++T+G+EV+ALK ++  LRD FI QM ELN +IR F K++  +L  DD  G
Sbjct: 1   ARISLLQDEVTTIGSEVDALKNKEGLLRDQFISQMEELNKEIREFQKTVDSSLSSDDGIG 60

Query: 142 TAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREEELYQEEEKIQKQVQLELIDLE 201
             A  +A  + S  G    A+K +   ++EV SQ A+EEE Y  E+KIQ+Q+Q EL D E
Sbjct: 61  ITANVKASEDGS--GADLEAIKGM---LSEVNSQLAKEEEGYLAEQKIQEQLQKELDDYE 115

Query: 202 RKVSLMEMIAYETGSLQDLTR 222
           +K+SLME I  +T S+Q LTR
Sbjct: 116 KKMSLMEAITDKTNSVQVLTR 136


>gi|125596680|gb|EAZ36460.1| hypothetical protein OsJ_20791 [Oryza sativa Japonica Group]
          Length = 337

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 89/238 (37%), Positives = 132/238 (55%), Gaps = 21/238 (8%)

Query: 5   DPKKHLLTLIRDFASEKSQG--------------ERRVVGLKKRIEKLRLELEAENFERE 50
           +P+K LL++IRDFA+EKS G              ERRV  LK+R++ LR   +A   E E
Sbjct: 13  EPQKQLLSIIRDFAAEKSHGGFSRDPLAASELLAERRVTDLKRRLDDLRAASDAAAAELE 72

Query: 51  EAKQLKETIEQELKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRD 110
            AK+ +E  EQEL G  V++A+   + Q LE+ IS +  EIS  G++++ALK + +  RD
Sbjct: 73  AAKRAREGPEQELPGGPVQVAIAAGSIQGLEATISHLQEEISKAGSDLDALKGKGDIERD 132

Query: 111 GFIVQMFELNDKIRTFHKSIA--FNLQEDDSFGTAAVSEADH--NFSKKGVPEVALKTLE 166
            FI QM +LN KIR F ++++  F  Q+     +    E  H  + SK    E  LK L 
Sbjct: 133 EFISQMDQLNTKIRQFQQTVSVEFKRQKCSELPSG---EGQHVRDMSKIEESEGILKDLI 189

Query: 167 DKIAEVVSQTAREEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
           DK+    ++    EE Y+++     +V+ EL D + K +LME +  ET  LQ+L  YP
Sbjct: 190 DKVNNADAELHVLEEEYKKDLLHHDEVRRELADTQAKRALMEAVMGETKQLQELGEYP 247


>gi|413944208|gb|AFW76857.1| hypothetical protein ZEAMMB73_834489 [Zea mays]
          Length = 240

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 80/227 (35%), Positives = 129/227 (56%), Gaps = 9/227 (3%)

Query: 3   GSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
           G D  K LL++IRDFA EKS GERRV  L++R+   R   +    E + AK+ +E  EQE
Sbjct: 7   GGDTPKQLLSIIRDFAYEKSHGERRVSDLRRRLADARAAADVAAAELDAAKRAREAAEQE 66

Query: 63  LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
            +G +V+ A+   +  ALE+ IS +H EIS    +++ALK +++S R+ FI +M+E+N K
Sbjct: 67  FRGSQVQDAIAADSILALEATISCLHEEISKASTDLDALKSKEDSEREDFISKMYEMNAK 126

Query: 123 IRTFHKSIAFNLQEDD-----SFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTA 177
           IR F + ++  L E +     S     V +       +G+ E     L DK++ + ++  
Sbjct: 127 IRQFQQMVSLELAEYNHCELPSTDGEHVKDKSKTVDSEGISE----ELSDKVSNIEAEVQ 182

Query: 178 REEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
             EE Y+++     +V+ EL +++ K +LME +  ET  LQ+L  YP
Sbjct: 183 LLEEEYKKDLLYHDKVRQELAEVQAKRALMEAVMEETKQLQELGEYP 229


>gi|242095308|ref|XP_002438144.1| hypothetical protein SORBIDRAFT_10g008750 [Sorghum bicolor]
 gi|241916367|gb|EER89511.1| hypothetical protein SORBIDRAFT_10g008750 [Sorghum bicolor]
          Length = 298

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 130/224 (58%), Gaps = 3/224 (1%)

Query: 3   GSDPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
           G D  K LL++IRDFA EKS GERRV  L++R+   R   +A   E + AK+ +E  EQ+
Sbjct: 9   GGDTPKQLLSIIRDFAYEKSHGERRVSDLRRRLADARAASDAAAAELDAAKRAREAAEQD 68

Query: 63  LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
           L+G +V+ A+   +  ALE+ IS +  EIS  G  ++ALK + ++ R+ FI  M+E+N K
Sbjct: 69  LRGSQVQAAIAADSILALEATISHLQEEISKAGTHLDALKSKGDNEREEFISNMYEMNAK 128

Query: 123 IRTFHKSIAFNLQEDDSFGTAAVSEADH--NFSKKGVPEVALKTLEDKIAEVVSQTAREE 180
           IR F +  +  L E +     + +E +H  + SK    E   K L DK++ + S+    E
Sbjct: 129 IRQFQQMASLELAEPNHCELPS-TEGEHVRDKSKTVDSEGISKELADKVSNIESEVQLLE 187

Query: 181 ELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTRYP 224
           E Y+++     +V+ EL D++ K +LME +  ET  LQ+L  YP
Sbjct: 188 EEYKKDLLDHDKVRQELADVQAKRALMEAVMGETKQLQELGEYP 231


>gi|147815650|emb|CAN63835.1| hypothetical protein VITISV_036109 [Vitis vinifera]
          Length = 213

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 64/154 (41%), Positives = 84/154 (54%), Gaps = 40/154 (25%)

Query: 1   MEGSDPKKHLLTLIRDFASEKSQ------------------------------------G 24
           M G+D +K LLTLIRD A+EKS+                                     
Sbjct: 60  MAGTDSRKRLLTLIRDLAAEKSKVGENNFSSVVFGAFFLXKRRVVRFIFDIKPFLAARYA 119

Query: 25  ERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTA-FQALESR 83
           ERRVVGLKKRI++L+ E++  N E EE+K+ K+  EQELKGYEVELA+N  A  Q LE+R
Sbjct: 120 ERRVVGLKKRIQELQSEMDIANAELEESKRSKDAAEQELKGYEVELAMNEEASIQTLEAR 179

Query: 84  ISLIHNEISTVGAEVEALKKEQESLRDGFIVQMF 117
            + I  E+S VG+++E LK        GFI  +F
Sbjct: 180 NTAIQYEVSMVGSDLEVLKIFXG---PGFISGVF 210


>gi|51535851|dbj|BAD37935.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|218197864|gb|EEC80291.1| hypothetical protein OsI_22304 [Oryza sativa Indica Group]
          Length = 442

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/133 (39%), Positives = 83/133 (62%), Gaps = 3/133 (2%)

Query: 6   PKKHLLT---LIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQE 62
           P+K L +   ++RDFA+++S GER +  LK R++ +R   +A   E E AK+ +E  EQE
Sbjct: 13  PQKKLPSWPSMVRDFATDESHGERGMSVLKLRLDGVRSAYDAAAAELETAKRAREDAEQE 72

Query: 63  LKGYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDK 122
           L G +V++A+   +   LE+ IS +  EIS V ++++ALK + +  RD FI QM +LN K
Sbjct: 73  LGGNQVQVAIAVVSIHGLEATISHLQEEISQVRSDLDALKGKGDIERDEFISQMDQLNTK 132

Query: 123 IRTFHKSIAFNLQ 135
           IR F + ++  L 
Sbjct: 133 IRKFQQMVSVELN 145


>gi|125596625|gb|EAZ36405.1| hypothetical protein OsJ_20734 [Oryza sativa Japonica Group]
          Length = 420

 Score = 87.0 bits (214), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 78/123 (63%)

Query: 13  LIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELAL 72
           ++RDFA+++S GER +  LK R++ +R   +A   E E AK+ +E  EQEL G +V++A+
Sbjct: 1   MVRDFATDESHGERGMSVLKLRLDGVRSAYDAAAAELETAKRAREDAEQELGGNQVQVAI 60

Query: 73  NNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAF 132
              +   LE+ IS +  EIS V ++++ALK + +  RD FI QM +LN KIR F + ++ 
Sbjct: 61  AVVSIHGLEATISHLQEEISQVRSDLDALKGKGDIERDEFISQMDQLNTKIRKFQQMVSV 120

Query: 133 NLQ 135
            L 
Sbjct: 121 ELN 123


>gi|218197884|gb|EEC80311.1| hypothetical protein OsI_22349 [Oryza sativa Indica Group]
          Length = 121

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 75/105 (71%)

Query: 5   DPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELK 64
           +P+K LL++IRDFA+EKS GERRV  LK+R++ +R   +A   E E AK+ +E  EQEL+
Sbjct: 13  EPQKQLLSIIRDFAAEKSHGERRVKDLKRRLDDVRAASDAAAAELEAAKRAREGAEQELR 72

Query: 65  GYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLR 109
           G +V++A+   + QALE+ IS +  EIS  G++++ALK+ + + R
Sbjct: 73  GGQVQVAIAAASIQALEATISHLQEEISKAGSDLDALKEWENTGR 117


>gi|30102588|gb|AAP21212.1| At3g28370 [Arabidopsis thaliana]
          Length = 148

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/107 (45%), Positives = 67/107 (62%), Gaps = 5/107 (4%)

Query: 116 MFELNDKIRTFHKSIAFNLQEDDSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQ 175
           M ELN +IR F K++  +L  DD  G  A  +A  + S  G    A+K +   ++EV SQ
Sbjct: 1   MEELNKEIREFQKTVDSSLSSDDGIGITANVKASEDGS--GADLEAIKGM---LSEVNSQ 55

Query: 176 TAREEELYQEEEKIQKQVQLELIDLERKVSLMEMIAYETGSLQDLTR 222
            A+EEE Y  E+KIQ+Q+Q EL D E+K+SLME I  +T S+Q LTR
Sbjct: 56  LAKEEEGYLAEQKIQEQLQKELDDYEKKMSLMEAITDKTNSVQVLTR 102


>gi|30694968|ref|NP_175554.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332194546|gb|AEE32667.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 160

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 70/147 (47%), Gaps = 42/147 (28%)

Query: 5   DPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELK 64
           D +K +L+LIRDF                    L+  +E   FER             + 
Sbjct: 23  DTQKQVLSLIRDFT-------------------LKDLVEISGFER---------FYYSIP 54

Query: 65  GYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKI- 123
           G   EL+LN+   Q+LE+RISL+ +E+ST G E +ALK+            +++LN +I 
Sbjct: 55  GDTGELSLNDFTIQSLEARISLLQDEVSTNGIEDDALKR-----------SVYKLNGRID 103

Query: 124 --RTFHKSIAFNLQEDDSFGTAAVSEA 148
               FHK+I  ++  D++ G  A ++A
Sbjct: 104 QGEKFHKTIDLSVGSDETTGITANNKA 130


>gi|12325377|gb|AAG52636.1|AC024261_23 hypothetical protein; 72506-71203 [Arabidopsis thaliana]
          Length = 191

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 70/147 (47%), Gaps = 42/147 (28%)

Query: 5   DPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELK 64
           D +K +L+LIRDF                    L+  +E   FER             + 
Sbjct: 23  DTQKQVLSLIRDFT-------------------LKDLVEISGFER---------FYYSIP 54

Query: 65  GYEVELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKI- 123
           G   EL+LN+   Q+LE+RISL+ +E+ST G E +ALK+            +++LN +I 
Sbjct: 55  GDTGELSLNDFTIQSLEARISLLQDEVSTNGIEDDALKR-----------SVYKLNGRID 103

Query: 124 --RTFHKSIAFNLQEDDSFGTAAVSEA 148
               FHK+I  ++  D++ G  A ++A
Sbjct: 104 QGEKFHKTIDLSVGSDETTGITANNKA 130


>gi|156083234|ref|XP_001609101.1| variant erythrocyte surface antigen-1, alpha subunit [Babesia bovis
           T2Bo]
 gi|154796351|gb|EDO05533.1| variant erythrocyte surface antigen-1, alpha subunit [Babesia
           bovis]
          Length = 1359

 Score = 45.8 bits (107), Expect = 0.012,   Method: Composition-based stats.
 Identities = 44/156 (28%), Positives = 77/156 (49%), Gaps = 14/156 (8%)

Query: 25  ERRVVGLKKRI---EKLRLELEAENFEREEAKQLKETIEQ---ELKGYEVELALNNTAFQ 78
           E+ V G+K+++   E+++  LEA   + EEAK+ +E +E    EL+G +  L     AF 
Sbjct: 727 EKLVEGVKEKVKALEEIKKTLEAAKEKLEEAKKTQEALEAAKGELEGAKGALTTAKKAFP 786

Query: 79  ALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQEDD 138
              S I    N +S   A ++ L K  + L+D  + Q+ E   K  T  +   F+     
Sbjct: 787 GSSSDIDPCKNIVS---ATIDGLHKALQLLKDA-VQQIMEATRKDTTEFQDAIFSA---- 838

Query: 139 SFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVS 174
           ++    V+E + + +K+ V   AL TL   I +++S
Sbjct: 839 TYTADVVTEVEKHQAKQKVTLDALSTLVSAIEDLLS 874


>gi|156083358|ref|XP_001609163.1| variant erythrocyte surface antigen-1, alpha subunit [Babesia bovis
           T2Bo]
 gi|154796413|gb|EDO05595.1| variant erythrocyte surface antigen-1, alpha subunit [Babesia
           bovis]
          Length = 1360

 Score = 43.9 bits (102), Expect = 0.052,   Method: Composition-based stats.
 Identities = 41/145 (28%), Positives = 70/145 (48%), Gaps = 11/145 (7%)

Query: 33  KRIEKLRLELEAENFEREEAKQLKETIEQ---ELKGYEVELALNNTAFQALESRISLIHN 89
           K +E+++  LEA   + EEAK+ +E +E    EL+G +  L     AF    S I    N
Sbjct: 738 KALEEIKKTLEAAKEKLEEAKKTQEALEAAKGELEGAKGALTTAKKAFPGSSSDIDPCKN 797

Query: 90  EISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQEDDSFGTAAVSEAD 149
            +S   A ++ L K  + L+D  + Q+ E   K  T  +   F+     ++    V+E +
Sbjct: 798 IVS---ATIDGLHKALQLLKDA-VQQIMEATRKDTTEFQDAIFSA----TYTADVVTEVE 849

Query: 150 HNFSKKGVPEVALKTLEDKIAEVVS 174
            + +K+ V   AL TL   I +++S
Sbjct: 850 KHQAKQKVTLDALSTLVSAIEDLLS 874


>gi|407833366|gb|EKF98741.1| hypothetical protein TCSYLVIO_010352 [Trypanosoma cruzi]
          Length = 902

 Score = 38.9 bits (89), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 45/80 (56%), Gaps = 6/80 (7%)

Query: 27  RVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAFQALESRISL 86
           RVV    R+EK R E+EA   ER+E ++ K+T+++ L+    + A     F A+ S ++ 
Sbjct: 259 RVVAQDLRVEKCRAEIEARTAERQEKEEAKKTLKEHLQALNEDPAKREVQFSAVRSELAK 318

Query: 87  IHNEISTVGAEVEALKKEQE 106
           +  E++      EAL++ +E
Sbjct: 319 VQQELA------EALRRRKE 332


>gi|260795120|ref|XP_002592554.1| hypothetical protein BRAFLDRAFT_118903 [Branchiostoma floridae]
 gi|229277775|gb|EEN48565.1| hypothetical protein BRAFLDRAFT_118903 [Branchiostoma floridae]
          Length = 3563

 Score = 38.5 bits (88), Expect = 2.1,   Method: Composition-based stats.
 Identities = 41/196 (20%), Positives = 90/196 (45%), Gaps = 8/196 (4%)

Query: 20   EKSQGERRVVGLKKRIEKLRLELEAEN--FEREEAKQLKETIEQELKGYEVELALNNTAF 77
            + S+ + ++    + I +LR+++E+ N      E K L    E+E    + E   + T+ 
Sbjct: 2154 QSSEKDGKIQEQMEEIRRLRMKIESLNDDMTERERKMLDLAREKEELQGQYEFYFSETS- 2212

Query: 78   QALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQED 137
              L+++++ I    + +G   EAL+ E ESLR          N +    H+ I   L+ +
Sbjct: 2213 -KLQTQVTEIQTVQTQMGGSTEALEDELESLRRRLKSDNEHANQEAFELHRRIT-ELELE 2270

Query: 138  DSFGTAAVSEADHNFSKKGVPEVALKTLEDKIAEVVSQTAREEELYQEEEKIQKQVQLEL 197
             +  T  +   + N S +G  EV  + L  ++      ++  E  Y E  K+ ++++ ++
Sbjct: 2271 RNTATEKLKHMEVNMS-QGSGEV--EKLRQQLRRAEQDSSNTELRYNEMHKVNERLRSDM 2327

Query: 198  IDLERKVSLMEMIAYE 213
             D+  +  + + ++ E
Sbjct: 2328 DDILHQRDIAQRLSKE 2343


>gi|71651425|ref|XP_814391.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70879358|gb|EAN92540.1| hypothetical protein Tc00.1047053511239.90 [Trypanosoma cruzi]
          Length = 869

 Score = 37.4 bits (85), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 45/80 (56%), Gaps = 6/80 (7%)

Query: 27  RVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAFQALESRISL 86
           RV+    R+EK R E+EA   ER+E ++ K+T+++ L+    + A     F A+ S ++ 
Sbjct: 226 RVLAQDLRVEKCRAEIEARTAERQEKEEAKKTLKEHLQALNEDPAKREVQFNAVRSELAK 285

Query: 87  IHNEISTVGAEVEALKKEQE 106
           +  E++      EAL++ +E
Sbjct: 286 VQQELA------EALRRRKE 299


>gi|345482259|ref|XP_001607886.2| PREDICTED: intersectin-1-like [Nasonia vitripennis]
          Length = 1728

 Score = 37.0 bits (84), Expect = 5.6,   Method: Composition-based stats.
 Identities = 39/205 (19%), Positives = 99/205 (48%), Gaps = 17/205 (8%)

Query: 18  ASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAF 77
           A E+ +  R+ +  ++++E  + + +    +R++ + +   ++ + +   +EL+  N   
Sbjct: 569 AQEQREAARKEMERQRQLEWEKQKSQELQAQRQKEQDILLKLKAKNQSLTIELSSLNEKV 628

Query: 78  QALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFHKSIAFNLQED 137
           + L  +I      +ST+   ++ ++    + RD  + +M  L +K+R  ++ +    QE 
Sbjct: 629 KELSQKICDTRVGVSTIKTTIDGMR----TTRDQQLQEMAALKNKLREQNQRLLTISQEK 684

Query: 138 DSFGTAAVSEADHNFSK-KGVPEVALKTLEDKIAEVVSQTAREEELY----QEEEKIQKQ 192
                   + AD    + + + ++ALK +EDKIAE+  +   +E+      ++ E++Q Q
Sbjct: 685 ARLDAKTKANADPAGQEAQKLKQIALKQMEDKIAEMEKEIKDKEKEVENSNKQLEELQNQ 744

Query: 193 VQLELIDL--------ERKVSLMEM 209
           +++   +         E+K  ++EM
Sbjct: 745 MEVMFTNCEELNAKFEEKKAKVLEM 769


>gi|197117352|ref|YP_002137779.1| chromosome segregation ATPase SMC [Geobacter bemidjiensis Bem]
 gi|197086712|gb|ACH37983.1| chromosome segregation ATPase SMC [Geobacter bemidjiensis Bem]
          Length = 1176

 Score = 37.0 bits (84), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 53/101 (52%), Gaps = 11/101 (10%)

Query: 21  KSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAFQAL 80
           + + E R + L+K +E L+  LE   FE EEA+++  +++  +   + +      A+Q +
Sbjct: 772 RGEAEERKLALEKNVEALQGRLEGSRFEMEEAREMVTSLKVRVAALKEKGESTQRAYQRV 831

Query: 81  E-------SRISLIHNEISTVGAE----VEALKKEQESLRD 110
           E       SRIS   NE+   GAE    ++A+   +E+LRD
Sbjct: 832 EALCADLASRISSRGNELEGSGAERTRLIDAIAAGEEALRD 872


>gi|71662315|ref|XP_818166.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70883401|gb|EAN96315.1| hypothetical protein Tc00.1047053506605.100 [Trypanosoma cruzi]
          Length = 869

 Score = 37.0 bits (84), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 45/80 (56%), Gaps = 6/80 (7%)

Query: 27  RVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVELALNNTAFQALESRISL 86
           RV+    R+EK R E+EA   ER+E ++ K+T+++ L+    + A     F A+ S ++ 
Sbjct: 226 RVLAQDLRVEKCRAEIEARTAERQEKEEAKKTLKEHLQALNEDPAKREVQFGAVRSELAK 285

Query: 87  IHNEISTVGAEVEALKKEQE 106
           +  E++      EAL++ +E
Sbjct: 286 VQQELA------EALRRRKE 299


>gi|92112829|ref|YP_572757.1| chromosome segregation protein SMC [Chromohalobacter salexigens DSM
            3043]
 gi|91795919|gb|ABE58058.1| Chromosome segregation protein SMC [Chromohalobacter salexigens DSM
            3043]
          Length = 1164

 Score = 37.0 bits (84), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 66/129 (51%), Gaps = 4/129 (3%)

Query: 5    DPKKHLLTLIRDFASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIEQELK 64
            D    L   +RD  +++ Q ER + GL+ R+E+ R++++A   + +      E +  ++ 
Sbjct: 873  DRAHQLAETLRDTEAKRQQHERNLDGLRGRLEEARMQVQALKLKADAEDAQLEELGHDVA 932

Query: 65   GYEVELALNNTAFQALESRISLIHNEISTVGA-EVEALKK--EQESLRDGFIVQMFELND 121
            G +  L  N T   A ++R+  I  ++  +GA  + A+++  +Q   RD    Q  EL++
Sbjct: 933  GLQEGLDPNATE-SAWQTRLDEIAEKVRRLGAINLAAIEEYDQQAERRDYLEAQHAELSE 991

Query: 122  KIRTFHKSI 130
             I T  K+I
Sbjct: 992  AIETLEKAI 1000


>gi|334705393|ref|ZP_08521259.1| HlyD family secretion protein [Aeromonas caviae Ae398]
          Length = 471

 Score = 36.6 bits (83), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 50/167 (29%), Positives = 81/167 (48%), Gaps = 34/167 (20%)

Query: 17  FASEKSQGERRVVGLKKRIEKLRLEL-------EAENFEREEAKQLKETIEQELKGYE-- 67
           F S+  + E+ +V LK  I +LR E+       E     RE+    K+ IE   +GYE  
Sbjct: 114 FKSDFREREQELVTLKGDIARLRAEIRSVVVKNEPNLGWREQVAIDKQAIEFP-EGYENV 172

Query: 68  -VELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTF 126
             E A    + Q  E+R++ + N++  +G ++E  +KEQE++         ELN KIRT 
Sbjct: 173 FAEYAARERSVQ--EARLNNLKNQLYILGQQIE--QKEQETI---------ELNAKIRTV 219

Query: 127 HKSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
            +S+    QE +              +K+G VP+V L  LE ++ E+
Sbjct: 220 GRSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 257


>gi|14591553|ref|NP_143635.1| chromosome assembly protein [Pyrococcus horikoshii OT3]
 gi|3258234|dbj|BAA30917.1| 1179aa long hypothetical chromosome assembly protein [Pyrococcus
           horikoshii OT3]
          Length = 1179

 Score = 36.6 bits (83), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 46/80 (57%), Gaps = 16/80 (20%)

Query: 31  LKKRIEKLRLELEAENFEREEAKQLKETIEQELKGYEVEL-ALNNTAFQALESRISLIHN 89
           LK+++E LR+              +KE++E E+    V+L AL N +F+ L  R+S +  
Sbjct: 666 LKEKVENLRI--------------MKESLEGEVNSLRVKLKALENQSFE-LRIRMSDVEK 710

Query: 90  EISTVGAEVEALKKEQESLR 109
           EIS +  ++E L KE+ESLR
Sbjct: 711 EISLISKDLEKLIKEEESLR 730


>gi|406676615|ref|ZP_11083801.1| HlyD family type I secretion membrane fusion protein [Aeromonas
           veronii AMC35]
 gi|423209192|ref|ZP_17195746.1| HlyD family type I secretion membrane fusion protein [Aeromonas
           veronii AER397]
 gi|404619037|gb|EKB15957.1| HlyD family type I secretion membrane fusion protein [Aeromonas
           veronii AER397]
 gi|404626838|gb|EKB23648.1| HlyD family type I secretion membrane fusion protein [Aeromonas
           veronii AMC35]
          Length = 477

 Score = 36.6 bits (83), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 32/166 (19%)

Query: 17  FASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE------QELKGYEV-- 68
           F S+  + E+ VV LK  + +LR E+ +   + + A   +E +       Q   GYE   
Sbjct: 120 FRSDFREREQEVVTLKGDVARLRAEMASVEVKNDPALGWREQVVVGEAPIQFPDGYETVF 179

Query: 69  -ELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFH 127
            E A    + Q    R++ + N++  +G ++E  +KEQE         + ELN KIRT +
Sbjct: 180 AEYAHRERSVQ--RDRLNNLKNQLYILGQQIE--QKEQE---------LIELNAKIRTVN 226

Query: 128 KSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
           +S+    QE +              +K+G VP+V L  LE ++ E+
Sbjct: 227 RSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 263


>gi|423201993|ref|ZP_17188572.1| HlyD family type I secretion membrane fusion protein [Aeromonas
           veronii AER39]
 gi|404615604|gb|EKB12566.1| HlyD family type I secretion membrane fusion protein [Aeromonas
           veronii AER39]
          Length = 477

 Score = 36.2 bits (82), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 32/166 (19%)

Query: 17  FASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE------QELKGYEV-- 68
           F S+  + E+ VV LK  + +LR E+ +   + + A   +E +       Q   GYE   
Sbjct: 120 FRSDFREREQEVVTLKGDVARLRAEMASVEVKNDPALGWREQVVVGEAPIQFPDGYETVF 179

Query: 69  -ELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFH 127
            E A    + Q    R++ + N++  +G ++E  +KEQE         + ELN KIRT +
Sbjct: 180 AEYAHRERSVQ--RDRLNNLKNQLYILGQQIE--QKEQE---------LIELNAKIRTVN 226

Query: 128 KSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
           +S+    QE +              +K+G VP+V L  LE ++ E+
Sbjct: 227 RSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 263


>gi|330830103|ref|YP_004393055.1| secretion protein, HlyD family [Aeromonas veronii B565]
 gi|328805239|gb|AEB50438.1| Secretion protein, HlyD family [Aeromonas veronii B565]
          Length = 472

 Score = 36.2 bits (82), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 32/166 (19%)

Query: 17  FASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE------QELKGYEV-- 68
           F S+  + E+ VV LK  + +LR E+ +   + + A   +E +       Q   GYE   
Sbjct: 115 FRSDFREREQEVVTLKGDVARLRAEMASVEVKNDPALGWREQVVVGEAPIQFPDGYETVF 174

Query: 69  -ELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFH 127
            E A    + Q    R++ + N++  +G ++E  +KEQE         + ELN KIRT +
Sbjct: 175 AEYAHRERSVQ--RDRLNNLKNQLYILGQQIE--QKEQE---------LIELNAKIRTVN 221

Query: 128 KSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
           +S+    QE +              +K+G VP+V L  LE ++ E+
Sbjct: 222 RSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 258


>gi|423207159|ref|ZP_17193715.1| HlyD family type I secretion membrane fusion protein [Aeromonas
           veronii AMC34]
 gi|404621108|gb|EKB18000.1| HlyD family type I secretion membrane fusion protein [Aeromonas
           veronii AMC34]
          Length = 477

 Score = 36.2 bits (82), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 78/166 (46%), Gaps = 32/166 (19%)

Query: 17  FASEKSQGERRVVGLKKRIEKLRLELEAENFEREEAKQLKETIE------QELKGYEV-- 68
           F S+  + E+ VV LK  + +LR E+ +   + + A   +E +       Q   GYE   
Sbjct: 120 FRSDFREREQEVVTLKGDVARLRAEMASVEVKNDPALGWREQVVVGEAPIQFPDGYETVF 179

Query: 69  -ELALNNTAFQALESRISLIHNEISTVGAEVEALKKEQESLRDGFIVQMFELNDKIRTFH 127
            E A    + Q    R++ + N++  +G ++E  +KEQE         + ELN KIRT +
Sbjct: 180 AEYAHRERSVQ--RDRLNNLKNQLYILGQQIE--QKEQE---------LIELNAKIRTVN 226

Query: 128 KSIAFNLQEDDSFGTAAVSEADHNFSKKG-VPEVALKTLEDKIAEV 172
           +S+    QE +              +K+G VP+V L  LE ++ E+
Sbjct: 227 RSVDLARQELN---------ISRPLAKEGIVPQVELIKLERQVNEM 263


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.311    0.129    0.334 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,960,280,207
Number of Sequences: 23463169
Number of extensions: 109526849
Number of successful extensions: 706904
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 574
Number of HSP's successfully gapped in prelim test: 10109
Number of HSP's that attempted gapping in prelim test: 667638
Number of HSP's gapped (non-prelim): 45078
length of query: 225
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 88
effective length of database: 9,144,741,214
effective search space: 804737226832
effective search space used: 804737226832
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 74 (33.1 bits)