BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 025378
         (253 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224093738|ref|XP_002309969.1| predicted protein [Populus trichocarpa]
 gi|222852872|gb|EEE90419.1| predicted protein [Populus trichocarpa]
          Length = 267

 Score =  191 bits (484), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 130/258 (50%), Positives = 154/258 (59%), Gaps = 51/258 (19%)

Query: 8   VFTNSPPSCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHI-SHNDYTPFKF---IP 63
           +F NSP     R T   ++T+L  +  NL L     SRH+   + SHN    F F    P
Sbjct: 14  LFNNSP----HRITLLFTSTSL--SLRNLTL-----SRHVNTSLHSHN----FHFKPQTP 58

Query: 64  NSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISE-------------------VLWS 104
            S F  +LK      T  + TQ +  FNLD  LS++E                   VL S
Sbjct: 59  KSSFNLTLKAYQSDPT--IPTQDSKQFNLDHFLSVAELLCIFSSSIITISYALNYTVLNS 116

Query: 105 RVLACGVVSLVCG-VW----------IGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEK 153
           +    GV+    G  W          IGA IRRR W RV  E  R   RES+NLVGRIEK
Sbjct: 117 KRGVLGVIGSNTGFAWGMVVMVSGVVIGAWIRRRMWWRVSRETGREGSRESLNLVGRIEK 176

Query: 154 LEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDV 213
           LEED++SSATI+RVLSRQLEKLG+RFRVTRKALK+PI + AALAQKNS+ATRALA+Q D+
Sbjct: 177 LEEDLRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSDATRALAVQEDI 236

Query: 214 LEKELGEIQKVLLAMQVS 231
           LEKELGEIQKVLLAMQVS
Sbjct: 237 LEKELGEIQKVLLAMQVS 254


>gi|224083616|ref|XP_002307074.1| predicted protein [Populus trichocarpa]
 gi|222856523|gb|EEE94070.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/250 (47%), Positives = 150/250 (60%), Gaps = 44/250 (17%)

Query: 7   HVFTNSPPSCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPNSK 66
           H+FTNSPP  +  F+    +   +             SRH+   +  +++      P + 
Sbjct: 9   HLFTNSPPRITLLFSSSSLSLRNLS-----------LSRHVTTSLHSSNFHFKPQTPRNS 57

Query: 67  FYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVL-----------------WSRVLAC 109
           F  +LK      T  + TQ ++ FNLD  LSI+E+L                 +S+  A 
Sbjct: 58  FSFTLKAYQSDPT--IRTQVSNQFNLDQFLSIAELLCIISSSIITISYALNCTFSKTGAL 115

Query: 110 GVVSLVCG-VW----------IGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDM 158
           GV+    G  W          IGA IRRRQW R+C E  R EG  S+NLVGRIEKLE+DM
Sbjct: 116 GVIGSNTGFAWGMVVMVSGVVIGAWIRRRQWWRICRETGR-EG--SLNLVGRIEKLEQDM 172

Query: 159 KSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKEL 218
           +SSATI+RVLSRQLEKLG+RFRVTRKALK+PI + AALAQKNSEATRALA+Q ++LEKEL
Sbjct: 173 RSSATIIRVLSRQLEKLGIRFRVTRKALKEPIVETAALAQKNSEATRALALQENILEKEL 232

Query: 219 GEIQKVLLAM 228
           GE QK+LLAM
Sbjct: 233 GETQKILLAM 242


>gi|356512685|ref|XP_003525047.1| PREDICTED: uncharacterized protein LOC100790782 [Glycine max]
          Length = 293

 Score =  174 bits (441), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 91/128 (71%), Positives = 103/128 (80%), Gaps = 2/128 (1%)

Query: 102 LWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSS 161
           + +R    G   LV GV +GA IRRRQWRR C E  +  G   VNL+ RIEKLEEDM+SS
Sbjct: 118 IGTRAAPFGGALLVVGVLVGAWIRRRQWRRACVETGK--GGLEVNLLERIEKLEEDMRSS 175

Query: 162 ATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEI 221
           AT++RVLSRQLEKLGVRFRVTRKALKDPI + AALAQKNSEA RALA+Q D+LEKELGEI
Sbjct: 176 ATVVRVLSRQLEKLGVRFRVTRKALKDPIAETAALAQKNSEAARALAVQSDILEKELGEI 235

Query: 222 QKVLLAMQ 229
           Q+VLLAMQ
Sbjct: 236 QQVLLAMQ 243


>gi|18424929|ref|NP_569009.1| uncharacterized protein [Arabidopsis thaliana]
 gi|10178187|dbj|BAB11661.1| unnamed protein product [Arabidopsis thaliana]
 gi|15028313|gb|AAK76633.1| unknown protein [Arabidopsis thaliana]
 gi|19310695|gb|AAL85078.1| unknown protein [Arabidopsis thaliana]
 gi|332010646|gb|AED98029.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 300

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 106/247 (42%), Positives = 145/247 (58%), Gaps = 32/247 (12%)

Query: 8   VFTNSPP---SCSSRFTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYTPFKFIPN 64
           +F+ S P   S  S   F  SA    +  ++++ P PI  RH+ I    N      +  +
Sbjct: 11  LFSGSTPVRISRRSNLCFAVSAARASRR-DSISFPQPIFLRHV-ISTQSNFEIDRNYRAH 68

Query: 65  SKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVL---------------------- 102
           +    S + ++ SN  ++   ++D F+L S +S +E L                      
Sbjct: 69  ASSIGSFEDSSSSN--LLEDASSDGFDLGSFVSFAEALCILSSAVISVVLAVNYVVVGEI 126

Query: 103 WSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSA 162
             +VL+ G V LV  V  G+ +RRRQW R+C     +EG    NL+ R+EKLE+D+KSS 
Sbjct: 127 GKKVLSLGFVGLVGSVATGSWLRRRQWMRICKGARESEG---TNLIRRLEKLEKDLKSST 183

Query: 163 TILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQ 222
           +I+RVLSR LEKLG+RFRVTRKALK+PI++ AALAQKNSEATR L  Q ++LEKELGEIQ
Sbjct: 184 SIVRVLSRHLEKLGIRFRVTRKALKEPISETAALAQKNSEATRVLVAQQEILEKELGEIQ 243

Query: 223 KVLLAMQ 229
           KVLLAMQ
Sbjct: 244 KVLLAMQ 250


>gi|356525459|ref|XP_003531342.1| PREDICTED: uncharacterized protein LOC100809936 [Glycine max]
          Length = 287

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 89/128 (69%), Positives = 102/128 (79%), Gaps = 2/128 (1%)

Query: 102 LWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSS 161
           + +R    G   LV GV +GA IRRRQWRRV  E  +  G   VNL+ RIEKLEED++SS
Sbjct: 112 IGARAAPFGGALLVVGVLVGAWIRRRQWRRVSVEAGK--GGLEVNLLERIEKLEEDLRSS 169

Query: 162 ATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEI 221
           AT++RVLSRQLEKLGVRFRVTRK LKDPI + AALAQKNSEA RALA+Q D+LEKELGEI
Sbjct: 170 ATVVRVLSRQLEKLGVRFRVTRKGLKDPIAETAALAQKNSEAARALAVQSDILEKELGEI 229

Query: 222 QKVLLAMQ 229
           Q+VLLAMQ
Sbjct: 230 QQVLLAMQ 237


>gi|297797551|ref|XP_002866660.1| hypothetical protein ARALYDRAFT_496756 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312495|gb|EFH42919.1| hypothetical protein ARALYDRAFT_496756 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 299

 Score =  169 bits (428), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 109/254 (42%), Positives = 149/254 (58%), Gaps = 32/254 (12%)

Query: 1   MSQSSQH-VFTNSPPSCSSR--FTFYHSATNLIKNPNNLALPPPITSRHLLIHISHNDYT 57
           +S  SQ+ +F+ S P   SR    F  SA    +  ++++ P PI  RH+ I    N   
Sbjct: 3   LSLPSQYALFSGSTPVRISRSNLCFAVSAARTSRR-DSISFPQPILLRHV-ISTRSNFEI 60

Query: 58  PFKFIPNSKFYHSLKPNNCSNTSVVTTQTNDNFNLDSLLSISEVL--------------- 102
              +  ++    S + ++ SN   +    +D F+L S +S +E L               
Sbjct: 61  DRDYRAHASSIGSYEDSSSSNE--LEDANSDGFDLGSFVSFAEALCILSSAVISVVLAVN 118

Query: 103 -------WSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLE 155
                    +VL+ G V LV  V  G+ +RRRQW R+C     +EG    NL+ R+EKLE
Sbjct: 119 YVVVGEIGKKVLSLGFVGLVGSVATGSWLRRRQWMRICKGARESEG---TNLIRRLEKLE 175

Query: 156 EDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLE 215
           +D+KSS +I+RVLSR LEKLG+RFRVTRKALK+PI++ AALAQKNSEATR LA Q ++LE
Sbjct: 176 KDLKSSTSIVRVLSRHLEKLGIRFRVTRKALKEPISETAALAQKNSEATRVLAAQQEILE 235

Query: 216 KELGEIQKVLLAMQ 229
           KELGEIQKVLLA+Q
Sbjct: 236 KELGEIQKVLLALQ 249


>gi|449451421|ref|XP_004143460.1| PREDICTED: uncharacterized protein LOC101207421 [Cucumis sativus]
          Length = 323

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 110/237 (46%), Positives = 137/237 (57%), Gaps = 38/237 (16%)

Query: 26  ATNLIKNPNNLALPPPITSRHLLIHISHNDY---TPFKFIPNSKFYHSLKPN-NCSNTSV 81
           +T L +NP  ++LP P +     +H    DY   +PF F   S   H   P  + S    
Sbjct: 41  STFLSRNPC-VSLPFPPSRFPNTLHFQILDYKFRSPFNF--GSINAHHFCPRVSTSGGVG 97

Query: 82  VTTQTNDNFNLDSLLSISE-----------------------------VLWSRVLACGVV 112
                  +F++DSLLS +E                             V    VL   ++
Sbjct: 98  RRPGGVADFDIDSLLSATEFFCLVASLIGSVGFALNCAKTRSKSLFLAVFGDGVLVGTIL 157

Query: 113 SLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQL 172
            LV GV IGA IRRRQW RV  E A+  G   VNL+ +  KLEED++SSAT++RVLSRQL
Sbjct: 158 FLVAGVAIGAWIRRRQWNRVFRETAK--GVLEVNLMEKTNKLEEDLRSSATLIRVLSRQL 215

Query: 173 EKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
           EKLG+RFRVTRKALK P+ + AALAQK SEATRALA++GD+LEKEL EIQKVLLAMQ
Sbjct: 216 EKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQ 272


>gi|449520022|ref|XP_004167033.1| PREDICTED: uncharacterized LOC101207421, partial [Cucumis sativus]
          Length = 246

 Score =  160 bits (404), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 93/170 (54%), Positives = 112/170 (65%), Gaps = 31/170 (18%)

Query: 89  NFNLDSLLSISE-----------------------------VLWSRVLACGVVSLVCGVW 119
           +F++DSLLS +E                             V    VL   ++ LV GV 
Sbjct: 28  DFDIDSLLSATEFFCLVASLIGSVGFALNCAKTRSKSLFLAVFGDGVLVGTILFLVAGVA 87

Query: 120 IGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRF 179
           IGA IRRRQW RV  E A+  G   VNL+ +  KLEED++SSAT++RVLSRQLEKLG+RF
Sbjct: 88  IGAWIRRRQWNRVFRETAK--GVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRF 145

Query: 180 RVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
           RVTRKALK P+ + AALAQK SEATRALA++GD+LEKEL EIQKVLLAMQ
Sbjct: 146 RVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQ 195


>gi|359496689|ref|XP_002269557.2| PREDICTED: uncharacterized protein LOC100244969 [Vitis vinifera]
          Length = 193

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 88/129 (68%), Positives = 107/129 (82%), Gaps = 1/129 (0%)

Query: 101 VLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKS 160
           VL +R+L    V+LV GV +G+ IRRRQW R+  + A+  G ESVNLV R+EK+EED++S
Sbjct: 3   VLGNRILLWQAVALVGGVVVGSWIRRRQWWRIFNDTAKP-GIESVNLVERMEKMEEDIRS 61

Query: 161 SATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGE 220
            AT++RV+SRQLEKLG+RFRVTRKALK PI + A LAQKNSEATRALA+Q D+LEKELGE
Sbjct: 62  MATLIRVMSRQLEKLGIRFRVTRKALKQPIAETAVLAQKNSEATRALAIQEDILEKELGE 121

Query: 221 IQKVLLAMQ 229
           IQKVLLAMQ
Sbjct: 122 IQKVLLAMQ 130


>gi|116794014|gb|ABK26972.1| unknown [Picea sitchensis]
          Length = 379

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/122 (60%), Positives = 90/122 (73%), Gaps = 5/122 (4%)

Query: 111 VVSLVCGVWIGAIIRRRQWRRVCG----EKARAEGRESVNLVGRIEKLEEDMKSSATILR 166
           ++ L+  + I AIIR RQW R+      + A +EG E  +L+ RI+KLEED+ SS TI+R
Sbjct: 137 ILPLLGALIISAIIRTRQWERLTAVGPKDHANSEGSE-FSLIHRIKKLEEDVGSSVTIIR 195

Query: 167 VLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLL 226
           VLSRQLEKLGVRFRVTR+ L+DPI + AALAQK SEA R L +Q DVLEKEL  IQ VLL
Sbjct: 196 VLSRQLEKLGVRFRVTRQTLRDPINETAALAQKTSEAIRVLELQEDVLEKELRGIQHVLL 255

Query: 227 AM 228
           AM
Sbjct: 256 AM 257


>gi|326504278|dbj|BAJ90971.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 264

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 69/120 (57%), Positives = 87/120 (72%), Gaps = 2/120 (1%)

Query: 112 VSLVCGVWIGAIIRRRQWRRVC--GEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLS 169
           V LV  V IG ++RR+QW R+C  G  A A     V+LVGR+EK+EE ++     + VLS
Sbjct: 117 VLLVGAVAIGVLVRRKQWERLCRVGAGAGASDTGGVDLVGRVEKVEESVRGVLAAVVVLS 176

Query: 170 RQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
           R +EKLGVRFRV R+ L+DPI++ AALA+KNSEATR LA Q D+LEKE+  IQKVL AMQ
Sbjct: 177 RTVEKLGVRFRVLRRTLRDPISETAALAEKNSEATRILAAQEDLLEKEISSIQKVLYAMQ 236


>gi|449458383|ref|XP_004146927.1| PREDICTED: uncharacterized protein LOC101211190 [Cucumis sativus]
 gi|449513157|ref|XP_004164247.1| PREDICTED: uncharacterized LOC101211190 [Cucumis sativus]
          Length = 303

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 71/128 (55%), Positives = 90/128 (70%), Gaps = 2/128 (1%)

Query: 101 VLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKS 160
           ++  R L  G   LV  V + + IRR QWRR+   K  ++G + VNL+ R EKLEED+KS
Sbjct: 126 LMGDRGLVWGFPLLVATVVLNSWIRRLQWRRISWGKT-SDGLK-VNLLDRFEKLEEDLKS 183

Query: 161 SATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGE 220
              ++R LSR+LEKLG+R+ VTRK LKDPI + A LAQ NSE TR LA+Q D+LEKE  E
Sbjct: 184 LMIVIRGLSRKLEKLGIRYMVTRKTLKDPIAETAVLAQINSEDTRTLAVQEDILEKEFLE 243

Query: 221 IQKVLLAM 228
           +QKVLLAM
Sbjct: 244 MQKVLLAM 251


>gi|296085648|emb|CBI29447.3| unnamed protein product [Vitis vinifera]
          Length = 139

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 60/76 (78%), Positives = 70/76 (92%)

Query: 154 LEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDV 213
           +EED++S AT++RV+SRQLEKLG+RFRVTRKALK PI + A LAQKNSEATRALA+Q D+
Sbjct: 1   MEEDIRSMATLIRVMSRQLEKLGIRFRVTRKALKQPIAETAVLAQKNSEATRALAIQEDI 60

Query: 214 LEKELGEIQKVLLAMQ 229
           LEKELGEIQKVLLAMQ
Sbjct: 61  LEKELGEIQKVLLAMQ 76


>gi|218201335|gb|EEC83762.1| hypothetical protein OsI_29654 [Oryza sativa Indica Group]
          Length = 269

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 72/135 (53%), Positives = 89/135 (65%), Gaps = 3/135 (2%)

Query: 99  SEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVC---GEKARAEGRESVNLVGRIEKLE 155
           S ++  R+     V LV  V IG++IRRRQW R+C   G          V+  GRI ++E
Sbjct: 106 SGLVGGRMFVVQYVLLVGAVAIGSLIRRRQWGRLCQVGGGGGGGAAARGVDFAGRIGEVE 165

Query: 156 EDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLE 215
           E ++     + VLSR +EKLGVRFRV R+ L+DPI + A LAQKNSEATR LA Q D+LE
Sbjct: 166 ESVRGVVAAVGVLSRTVEKLGVRFRVLRRTLRDPINETATLAQKNSEATRVLAAQEDLLE 225

Query: 216 KELGEIQKVLLAMQV 230
           KELGEIQKVL AMQV
Sbjct: 226 KELGEIQKVLYAMQV 240


>gi|42409401|dbj|BAD10714.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 296

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 70/133 (52%), Positives = 87/133 (65%), Gaps = 3/133 (2%)

Query: 99  SEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVC---GEKARAEGRESVNLVGRIEKLE 155
           S ++  R+     V LV  V IG++IRRRQW R+C   G          V+  GRI ++E
Sbjct: 106 SGLVGGRMFVVQYVLLVGAVAIGSLIRRRQWGRLCQVGGGGGGGAAARGVDFAGRIGEVE 165

Query: 156 EDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLE 215
           E ++     + VLSR +EKLGVRFRV R+ L+DPI + A LAQKNSEATR LA Q D+LE
Sbjct: 166 ESVRGVVAAVGVLSRTVEKLGVRFRVLRRTLRDPINETATLAQKNSEATRVLAAQEDLLE 225

Query: 216 KELGEIQKVLLAM 228
           KELGEIQKVL AM
Sbjct: 226 KELGEIQKVLYAM 238


>gi|242082429|ref|XP_002445983.1| hypothetical protein SORBIDRAFT_07g029060 [Sorghum bicolor]
 gi|241942333|gb|EES15478.1| hypothetical protein SORBIDRAFT_07g029060 [Sorghum bicolor]
          Length = 285

 Score =  119 bits (299), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 94/168 (55%), Gaps = 41/168 (24%)

Query: 90  FNLDSLLSISEVLWSRVLACGVVSLVCG----------------------------VWIG 121
           F  DSLLS +E+L   +    + S+VC                             V IG
Sbjct: 69  FGPDSLLSAAELLC--LAPPAICSVVCAARLVFLPASASAGSPPLLVLQYVLLVGAVAIG 126

Query: 122 AIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRV 181
           ++IRRRQ            GR    L GR+EK+EE ++     + VLSR +EKLG+RFRV
Sbjct: 127 SLIRRRQ-----------SGRLRPGLTGRVEKVEEAVRGMVAAVAVLSRTVEKLGLRFRV 175

Query: 182 TRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
            R+ L+DPIT+ AALAQKNSEATR L+ Q D+LEKE+G IQKVL AMQ
Sbjct: 176 LRRTLRDPITETAALAQKNSEATRMLSAQEDLLEKEIGAIQKVLYAMQ 223


>gi|357141684|ref|XP_003572312.1| PREDICTED: uncharacterized protein LOC100841857 [Brachypodium
           distachyon]
          Length = 290

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 64/115 (55%), Positives = 84/115 (73%), Gaps = 3/115 (2%)

Query: 118 VWIGAIIRRRQWRRVCGEKARAEGRES---VNLVGRIEKLEEDMKSSATILRVLSRQLEK 174
           V IGA++RRRQW R+     R  G  +   V+LVGR+EK+EE ++     + VLSR +EK
Sbjct: 116 VAIGALVRRRQWARLTRVGVRGVGASAPCGVDLVGRVEKVEESVRGVVAAVGVLSRTVEK 175

Query: 175 LGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
           LGVRFRV R+ L+DPI++ AALAQKNSEATR LA++  + EK++G IQKVL AMQ
Sbjct: 176 LGVRFRVLRRTLRDPISETAALAQKNSEATRILAVRESLFEKDIGSIQKVLYAMQ 230


>gi|115476978|ref|NP_001062085.1| Os08g0484800 [Oryza sativa Japonica Group]
 gi|113624054|dbj|BAF23999.1| Os08g0484800 [Oryza sativa Japonica Group]
          Length = 352

 Score =  113 bits (283), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 71/144 (49%), Positives = 87/144 (60%), Gaps = 14/144 (9%)

Query: 99  SEVLWSRVLACGVVSLVCGVWIGAIIRRRQWRRVC---GEKARAEGRESVNLVGRIEKLE 155
           S ++  R+     V LV  V IG++IRRRQW R+C   G          V+  GRI ++E
Sbjct: 151 SGLVGGRMFVVQYVLLVGAVAIGSLIRRRQWGRLCQVGGGGGGGAAARGVDFAGRIGEVE 210

Query: 156 EDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPIT-----------QAAALAQKNSEAT 204
           E ++     + VLSR +EKLGVRFRV R+ L+DPI            Q A LAQKNSEAT
Sbjct: 211 ESVRGVVAAVGVLSRTVEKLGVRFRVLRRTLRDPINENYLVSLSTEKQTATLAQKNSEAT 270

Query: 205 RALAMQGDVLEKELGEIQKVLLAM 228
           R LA Q D+LEKELGEIQKVL AM
Sbjct: 271 RVLAAQEDLLEKELGEIQKVLYAM 294


>gi|168011248|ref|XP_001758315.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690350|gb|EDQ76717.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/125 (40%), Positives = 82/125 (65%), Gaps = 2/125 (1%)

Query: 107 LACGVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVN--LVGRIEKLEEDMKSSATI 164
           L+  +++++  +++ AI++ R   R+       E  E  +  L  R+ +LEE+++   T 
Sbjct: 104 LSLPLLAVLGAIFLNAIVQYRVVERLTHLPQLKESTEKGDNTLKHRVVELEENVREFVTT 163

Query: 165 LRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKV 224
            R+LSRQ+EKLGVRFR T++ L+DP+ + AA+++K S+ T ALA + D LE+EL E+Q  
Sbjct: 164 ARLLSRQVEKLGVRFRATKRTLRDPMQETAAISEKTSQVTIALAQREDRLEQELRELQHA 223

Query: 225 LLAMQ 229
           LLAMQ
Sbjct: 224 LLAMQ 228


>gi|357463301|ref|XP_003601932.1| hypothetical protein MTR_3g086990 [Medicago truncatula]
 gi|355490980|gb|AES72183.1| hypothetical protein MTR_3g086990 [Medicago truncatula]
          Length = 216

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 67/139 (48%), Gaps = 36/139 (25%)

Query: 87  NDNFNLDSLLSISEVLWSRVLACGVVSLVCG------------------VWIG------- 121
           + N N DSLLS  E+  S +++  +VSLV                    VW         
Sbjct: 69  SSNLNFDSLLSAMEL--SCLISSAIVSLVLVGNGSKNWLLNVSGNRISVVWGVLTLVTGV 126

Query: 122 ---AIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVR 178
              A IR RQWRR+  E  +        LV RIEKLE++++S   ++R+LSR  EKLG R
Sbjct: 127 VVGAWIRTRQWRRISKENMKG------GLVERIEKLEKELRSHVRVIRILSRHAEKLGKR 180

Query: 179 FRVTRKALKDPITQAAALA 197
            R +  ALKD ITQ   L 
Sbjct: 181 IRGSCYALKDHITQVTVLV 199


>gi|255584267|ref|XP_002532870.1| hypothetical protein RCOM_0038200 [Ricinus communis]
 gi|223527355|gb|EEF29499.1| hypothetical protein RCOM_0038200 [Ricinus communis]
          Length = 116

 Score = 66.6 bits (161), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 33/35 (94%), Positives = 34/35 (97%)

Query: 194 AALAQKNSEATRALAMQGDVLEKELGEIQKVLLAM 228
           AALAQKNSEATRALAMQ D+LEKELGEIQKVLLAM
Sbjct: 17  AALAQKNSEATRALAMQEDILEKELGEIQKVLLAM 51


>gi|302791044|ref|XP_002977289.1| hypothetical protein SELMODRAFT_417213 [Selaginella moellendorffii]
 gi|300155265|gb|EFJ21898.1| hypothetical protein SELMODRAFT_417213 [Selaginella moellendorffii]
          Length = 313

 Score = 63.9 bits (154), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 60/90 (66%), Gaps = 2/90 (2%)

Query: 142 RESVNLV--GRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQK 199
           +ES NL    R+  LE D   S T++R L+RQLEKLGVR +VT++ L+DPI Q A LA  
Sbjct: 168 KESSNLALAQRVASLEHDFLGSVTLIRALTRQLEKLGVRVQVTKRTLQDPIHQTAKLAAD 227

Query: 200 NSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
            + A +AL+ +  +LE+EL E +  L+A+Q
Sbjct: 228 MASAVKALSHRETLLEEELKETRSALMAIQ 257


>gi|302821022|ref|XP_002992176.1| hypothetical protein SELMODRAFT_430397 [Selaginella moellendorffii]
 gi|300140102|gb|EFJ06831.1| hypothetical protein SELMODRAFT_430397 [Selaginella moellendorffii]
          Length = 313

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 60/90 (66%), Gaps = 2/90 (2%)

Query: 142 RESVNLV--GRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQK 199
           +ES NL    R+  LE D   S T++R L+RQLEKLGVR +VT++ L+DPI Q A LA  
Sbjct: 168 KESSNLALAQRVASLEHDFLGSITLIRALTRQLEKLGVRVQVTKRTLQDPIHQTAKLAAD 227

Query: 200 NSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
            + A +AL+ +  +LE+EL E +  L+A+Q
Sbjct: 228 MASAVKALSHRETLLEEELKETRSALMAIQ 257


>gi|388522165|gb|AFK49144.1| unknown [Medicago truncatula]
          Length = 186

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 34/58 (58%), Positives = 44/58 (75%), Gaps = 6/58 (10%)

Query: 121 GAIIRRRQWRRVC--GEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLG 176
           GA+IRRR+WR     G  + +E    V+ + RIEKLEED+KS+AT++RVLSRQLEKLG
Sbjct: 132 GALIRRRKWRETVLNGVVSVSE----VDFLQRIEKLEEDLKSNATVVRVLSRQLEKLG 185


>gi|357463303|ref|XP_003601933.1| hypothetical protein MTR_3g087000 [Medicago truncatula]
 gi|355490981|gb|AES72184.1| hypothetical protein MTR_3g087000 [Medicago truncatula]
          Length = 101

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/37 (83%), Positives = 34/37 (91%)

Query: 193 AAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
           +AA AQKNSEATRALAMQ ++LEKEL EIQKVLLAMQ
Sbjct: 32  SAAPAQKNSEATRALAMQSEILEKELREIQKVLLAMQ 68


>gi|145344155|ref|XP_001416603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576829|gb|ABO94896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 260

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 65/131 (49%), Gaps = 1/131 (0%)

Query: 110 GVVSLVCGVWIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLS 169
           G + L   V  G I+RR    R    + R +G  S ++  R+  LE+   S   + R  +
Sbjct: 54  GDLPLYVAVVAGTILRRLSRAREESTEGRLQGVPS-SVKARLAFLEQANDSVMEVSRRTA 112

Query: 170 RQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
           R + +LG R R+TR+ L  P+ +  A +++N++   A+A + ++LE EL E Q  +  + 
Sbjct: 113 RDVSRLGTRVRLTRRELSPPLRKVQAESKENAQILAAVAQRIELLEGELAEGQSTMSGLH 172

Query: 230 VSPITWFYHLS 240
                 F  LS
Sbjct: 173 TVSSKQFDVLS 183


>gi|115384434|ref|XP_001208764.1| chromosome segregation protein sudA [Aspergillus terreus NIH2624]
 gi|114196456|gb|EAU38156.1| chromosome segregation protein sudA [Aspergillus terreus NIH2624]
          Length = 1199

 Score = 45.4 bits (106), Expect = 0.020,   Method: Composition-based stats.
 Identities = 31/119 (26%), Positives = 60/119 (50%), Gaps = 19/119 (15%)

Query: 126 RRQWRRVCGEKARAEGRESV-------NLVGRIEKL------------EEDMKSSATILR 166
           RRQ++ +   ++  E R+SV       NL  R+++L            + ++K +   ++
Sbjct: 798 RRQFQELSNNRSELEARKSVLEVELRENLHPRLDQLVSQDIDMADDDNQGNLKETQREMK 857

Query: 167 VLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVL 225
            LS+ L+KL  R +   ++++   TQAA LAQ+N+E  R +      +EK    ++K +
Sbjct: 858 RLSKALDKLSQRLQQVDESIEQANTQAAELAQRNAETRREMEELAKSIEKHQRRMEKSM 916


>gi|169774329|ref|XP_001821632.1| chromosome segregation protein sudA [Aspergillus oryzae RIB40]
 gi|83769495|dbj|BAE59630.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1199

 Score = 44.3 bits (103), Expect = 0.049,   Method: Composition-based stats.
 Identities = 30/119 (25%), Positives = 59/119 (49%), Gaps = 19/119 (15%)

Query: 126 RRQWRRVCGEKARAEGRESV-------NLVGRIEKL------------EEDMKSSATILR 166
           RRQ++ +  +++  E R+SV       NL  R+++L            + ++K +   ++
Sbjct: 798 RRQYQELSSQRSELEARKSVLEVELRENLNPRLDQLVGQDIDMADDGSQGNLKETQREMK 857

Query: 167 VLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVL 225
            LS+ LEKLG R +   ++++    + A L Q+N+E  R L      +EK    ++K +
Sbjct: 858 RLSKALEKLGQRLQQVDESIEQANARMAELGQRNAETRRELDELAKSIEKHQRRMEKSM 916


>gi|391867772|gb|EIT77012.1| structural maintenance of chromosome protein [Aspergillus oryzae
           3.042]
          Length = 1199

 Score = 44.3 bits (103), Expect = 0.050,   Method: Composition-based stats.
 Identities = 30/119 (25%), Positives = 59/119 (49%), Gaps = 19/119 (15%)

Query: 126 RRQWRRVCGEKARAEGRESV-------NLVGRIEKL------------EEDMKSSATILR 166
           RRQ++ +  +++  E R+SV       NL  R+++L            + ++K +   ++
Sbjct: 798 RRQYQELSSQRSELEARKSVLEVELRENLNPRLDQLVGQDIDMADDGSQGNLKETQREMK 857

Query: 167 VLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVL 225
            LS+ LEKLG R +   ++++    + A L Q+N+E  R L      +EK    ++K +
Sbjct: 858 RLSKALEKLGQRLQQVDESIEQANARMAELGQRNAETRRELDELAKSIEKHQRRMEKSM 916


>gi|238497153|ref|XP_002379812.1| chromosome segregation protein SudA, putative [Aspergillus flavus
           NRRL3357]
 gi|220694692|gb|EED51036.1| chromosome segregation protein SudA, putative [Aspergillus flavus
           NRRL3357]
          Length = 1199

 Score = 44.3 bits (103), Expect = 0.050,   Method: Composition-based stats.
 Identities = 30/119 (25%), Positives = 59/119 (49%), Gaps = 19/119 (15%)

Query: 126 RRQWRRVCGEKARAEGRESV-------NLVGRIEKL------------EEDMKSSATILR 166
           RRQ++ +  +++  E R+SV       NL  R+++L            + ++K +   ++
Sbjct: 798 RRQYQELSSQRSELEARKSVLEVELRENLNPRLDQLVGQDIDMADDGSQGNLKETQREMK 857

Query: 167 VLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVL 225
            LS+ LEKLG R +   ++++    + A L Q+N+E  R L      +EK    ++K +
Sbjct: 858 RLSKALEKLGQRLQQVDESIEQANARMAELGQRNAETRRELDELAKSIEKHQRRMEKSM 916


>gi|134056537|emb|CAK47661.1| unnamed protein product [Aspergillus niger]
          Length = 1219

 Score = 42.4 bits (98), Expect = 0.21,   Method: Composition-based stats.
 Identities = 28/118 (23%), Positives = 57/118 (48%), Gaps = 18/118 (15%)

Query: 126 RRQWRRVCGEKARAEGRESV-------NLVGRIEKL-----------EEDMKSSATILRV 167
           RRQ++ +  +++  EGR+S+       NL  R+++L           + ++K +   ++ 
Sbjct: 819 RRQYQELSSQRSELEGRKSILEVELRENLNPRLDQLVGQDMDLADDSQGNLKETQREMKR 878

Query: 168 LSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVL 225
           L + LEKLG R +   +++     +   L Q+N+E  R L      +EK    ++K +
Sbjct: 879 LHKALEKLGQRLQQVDESIDQANARVGDLQQRNAETRRELEELAKSIEKHQRRMEKSM 936


>gi|350634524|gb|EHA22886.1| hypothetical protein ASPNIDRAFT_46980 [Aspergillus niger ATCC 1015]
          Length = 1198

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 28/118 (23%), Positives = 57/118 (48%), Gaps = 18/118 (15%)

Query: 126 RRQWRRVCGEKARAEGRESV-------NLVGRIEKL-----------EEDMKSSATILRV 167
           RRQ++ +  +++  EGR+S+       NL  R+++L           + ++K +   ++ 
Sbjct: 798 RRQYQELSSQRSELEGRKSILEVELRENLNPRLDQLVGQDMDLADDSQGNLKETQREMKR 857

Query: 168 LSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVL 225
           L + LEKLG R +   +++     +   L Q+N+E  R L      +EK    ++K +
Sbjct: 858 LHKALEKLGQRLQQVDESIDQANARVGDLQQRNAETRRELEELAKSIEKHQRRMEKSM 915


>gi|317026832|ref|XP_001399622.2| chromosome segregation protein sudA [Aspergillus niger CBS 513.88]
          Length = 1198

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 28/118 (23%), Positives = 57/118 (48%), Gaps = 18/118 (15%)

Query: 126 RRQWRRVCGEKARAEGRESV-------NLVGRIEKL-----------EEDMKSSATILRV 167
           RRQ++ +  +++  EGR+S+       NL  R+++L           + ++K +   ++ 
Sbjct: 798 RRQYQELSSQRSELEGRKSILEVELRENLNPRLDQLVGQDMDLADDSQGNLKETQREMKR 857

Query: 168 LSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVL 225
           L + LEKLG R +   +++     +   L Q+N+E  R L      +EK    ++K +
Sbjct: 858 LHKALEKLGQRLQQVDESIDQANARVGDLQQRNAETRRELEELAKSIEKHQRRMEKSM 915


>gi|358365605|dbj|GAA82227.1| chromosome segregation protein SudA [Aspergillus kawachii IFO 4308]
          Length = 1199

 Score = 42.0 bits (97), Expect = 0.26,   Method: Composition-based stats.
 Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 19/119 (15%)

Query: 126 RRQWRRVCGEKARAEGRESV-------NLVGRIEKL------------EEDMKSSATILR 166
           RRQ++ +  +++  EGR+S+       NL  R+++L            + ++K +   ++
Sbjct: 798 RRQYQELSSQRSELEGRKSILEVELRENLNPRLDQLVGQDMDLADDDSQGNLKETQREMK 857

Query: 167 VLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVL 225
            L + LEKLG R +   +++     +   L Q+N+E  R L      +EK    ++K +
Sbjct: 858 RLHKALEKLGQRLQQVDESIDQANARVGDLQQRNAETRRELEELAKSIEKHQRRMEKSM 916


>gi|335433681|ref|ZP_08558499.1| DNA topoisomerase VI subunit B [Halorhabdus tiamatea SARL4B]
 gi|334898484|gb|EGM36590.1| DNA topoisomerase VI subunit B [Halorhabdus tiamatea SARL4B]
          Length = 1377

 Score = 40.0 bits (92), Expect = 0.91,   Method: Composition-based stats.
 Identities = 31/118 (26%), Positives = 56/118 (47%), Gaps = 10/118 (8%)

Query: 117  GVWIGAIIRRRQ----WRRVCGEKARAEGRESV--NLVGRIEKLEEDMKSSATILRVLSR 170
            G   G+ +R +     W ++ G+   A+ RE++  +L G +++   D K  ATI  +  R
Sbjct: 929  GTTFGSTVREKATAAAWAQIIGDTDEADQRETLASDLYGLVDRATSDRKDDATIEGLADR 988

Query: 171  QLEKL----GVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKV 224
               K       R R TR+ L + + +AA   ++  +AT     + +VLE+  G +  V
Sbjct: 989  IAAKFEDSDDERHRATREELAEYVERAAENTEEYDDATIGETARENVLEEIWGVMATV 1046


>gi|431906276|gb|ELK10473.1| Rootletin [Pteropus alecto]
          Length = 1976

 Score = 40.0 bits (92), Expect = 0.92,   Method: Composition-based stats.
 Identities = 25/95 (26%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 128  QWRRVCGEKARAEGR-ESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKAL 186
            Q R   GE+A  EGR E++ L  ++ K E  +++    L+   R+L++L   FR   + L
Sbjct: 1277 QGRLALGERAEKEGRREALGLRQKLLKGEASLEAMRQELQGAQRKLQELDGEFRTRERGL 1336

Query: 187  KDPITQAAALAQKNSEATRALAMQGDVLEKELGEI 221
            +  + +A +  ++  + TR+L ++ + +  E  E+
Sbjct: 1337 QGSLEEARSAEKQQLDHTRSLELKLEAVRAEAAEL 1371


>gi|307173524|gb|EFN64434.1| Myosin-IXa [Camponotus floridanus]
          Length = 2287

 Score = 39.3 bits (90), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 51/115 (44%), Gaps = 26/115 (22%)

Query: 3    QSSQHVFTNSPPSCSSRFTFYHSAT-NLIKNPNNLALPPPITSRHLLIHISHNDY---TP 58
            QS+  VF  SPP  SSR T YH+   NL  NPN     P I  R      ++NDY    P
Sbjct: 1306 QSAPPVFNTSPPFTSSRDTKYHTVNVNLTHNPN-----PDIWHRRADFSPAYNDYFMSVP 1360

Query: 59   FKFI----PNSKFYHSL-----------KPNNCSNTSVVTTQTNDNF-NLDSLLS 97
             K      PN   Y ++           KPN  +N + V   T+D F  ++SL S
Sbjct: 1361 QKATVTRSPNVSRYKNMTDSVFEQIRQDKPNEAANYN-VKVDTSDRFVQMESLTS 1414


>gi|307106458|gb|EFN54704.1| hypothetical protein CHLNCDRAFT_135363, partial [Chlorella
           variabilis]
          Length = 762

 Score = 38.5 bits (88), Expect = 3.0,   Method: Composition-based stats.
 Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 8/120 (6%)

Query: 117 GVWIG-------AIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLS 169
           G+W+        A+ R R W    G      GR +     R+ +LE  +   A  L   +
Sbjct: 69  GLWLSVPLVAALALGRLRAWLAAPGSSG-PSGRLAEPAAARLVRLEGVVSRQAAALEAAA 127

Query: 170 RQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQ 229
           RQL+KLGVR R+  + L+ PI Q  A A + SE    LA + +  E+E+   + ++ AMQ
Sbjct: 128 RQLDKLGVRSRLAGRDLRLPIQQLQAAAGQQSEVVVGLAQRMERFEQEIRGTEALVEAMQ 187


>gi|407035900|gb|EKE37918.1| SH3 domain containing protein [Entamoeba nuttalli P19]
          Length = 393

 Score = 37.0 bits (84), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 43/95 (45%), Gaps = 7/95 (7%)

Query: 123 IIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVT 182
           +++  + +R+CG+     G     L   +  LEED+     I  VLS          +V 
Sbjct: 49  MVQEFRQKRICGQTTLYYGTVKDALETYLNALEEDINKDTAITNVLS-------TSAKVF 101

Query: 183 RKALKDPITQAAALAQKNSEATRALAMQGDVLEKE 217
              +K+   Q+  L    ++ T+A+  Q DVL+KE
Sbjct: 102 ENDIKELEKQSKGLISDGNKMTKAIESQKDVLKKE 136


>gi|242035353|ref|XP_002465071.1| hypothetical protein SORBIDRAFT_01g031580 [Sorghum bicolor]
 gi|241918925|gb|EER92069.1| hypothetical protein SORBIDRAFT_01g031580 [Sorghum bicolor]
          Length = 661

 Score = 37.0 bits (84), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 1/86 (1%)

Query: 140 EGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQK 199
           E  +S  L+ R+E+L+ +M     ++ VL+RQ+E L       RK ++    Q A     
Sbjct: 3   EAADSSELLRRVEELQREMDEKVDLVEVLTRQVEDLQRERDALRKDIEQLCMQQAGPGY- 61

Query: 200 NSEATRALAMQGDVLEKELGEIQKVL 225
            S ATR L  +   LE+++  +QK L
Sbjct: 62  VSVATRMLTQRTTALEQDIENLQKKL 87


>gi|67484634|ref|XP_657537.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474795|gb|EAL52152.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449709792|gb|EMD48991.1| variant SH3 domain containing protein, putative [Entamoeba
           histolytica KU27]
          Length = 393

 Score = 37.0 bits (84), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 43/95 (45%), Gaps = 7/95 (7%)

Query: 123 IIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVRFRVT 182
           +++  + +R+CG+     G     L   +  LEED+     I  VLS          +V 
Sbjct: 49  MVQEFRQKRICGQTTLYYGTVKDALETYLNALEEDINKDTAITNVLS-------TSAKVF 101

Query: 183 RKALKDPITQAAALAQKNSEATRALAMQGDVLEKE 217
              +K+   Q+  L    ++ T+A+  Q DVL+KE
Sbjct: 102 ENDIKELEKQSKGLISDGNKMTKAIESQKDVLKKE 136


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.131    0.392 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,616,586,070
Number of Sequences: 23463169
Number of extensions: 130073571
Number of successful extensions: 437259
Number of sequences better than 100.0: 75
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 45
Number of HSP's that attempted gapping in prelim test: 437183
Number of HSP's gapped (non-prelim): 85
length of query: 253
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 114
effective length of database: 9,097,814,876
effective search space: 1037150895864
effective search space used: 1037150895864
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)