BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 039538
         (311 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|297804606|ref|XP_002870187.1| hypothetical protein ARALYDRAFT_493280 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316023|gb|EFH46446.1| hypothetical protein ARALYDRAFT_493280 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 297

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 186/243 (76%), Positives = 211/243 (86%), Gaps = 10/243 (4%)

Query: 69  IQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAP 128
           ++L DESDF ++ S D  ISI+GFGSLLSE SARSTFP+L NFR+AKL GFRRVFAH AP
Sbjct: 65  LELEDESDFEKLLSSDNRISIAGFGSLLSERSARSTFPDLENFRIAKLQGFRRVFAHSAP 124

Query: 129 IFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEF 188
           IFFE GIA P+TKEISSLSVEPCEG++L+VTVFEIK           + IPAFI+RE EF
Sbjct: 125 IFFELGIANPETKEISSLSVEPCEGESLVVTVFEIKS----------SEIPAFIQRELEF 174

Query: 189 RFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDIL 248
           RFLAV PET++GKP+TN AVLC R SDEEFFQIRCKG+KEIYF HYGR+NI+KIWRDDIL
Sbjct: 175 RFLAVVPETMEGKPYTNSAVLCGRYSDEEFFQIRCKGNKEIYFQHYGRFNIEKIWRDDIL 234

Query: 249 PCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTR 308
           PCR+YLRHCVLAAKNLGDEAYNNFLDHT LGD KTTIREYL++TG+GIMEEEPPE+LK+R
Sbjct: 235 PCRLYLRHCVLAAKNLGDEAYNNFLDHTFLGDRKTTIREYLSSTGSGIMEEEPPEALKSR 294

Query: 309 YGG 311
           YGG
Sbjct: 295 YGG 297


>gi|145340274|ref|NP_193341.3| uncharacterized protein [Arabidopsis thaliana]
 gi|332658287|gb|AEE83687.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 289

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 187/243 (76%), Positives = 209/243 (86%), Gaps = 10/243 (4%)

Query: 69  IQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAP 128
           ++L DESDF ++ S D  ISI+GFGSLLSE SARSTFP+L NFR+AKL GFRRVFAH AP
Sbjct: 57  MELEDESDFEKLLSSDNRISITGFGSLLSERSARSTFPDLENFRIAKLQGFRRVFAHSAP 116

Query: 129 IFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEF 188
           IFFERGIA P+TKEISSLSVEPCEG++L+VTVFEIK           + IPAFI RE EF
Sbjct: 117 IFFERGIANPETKEISSLSVEPCEGESLVVTVFEIKS----------SEIPAFIGRELEF 166

Query: 189 RFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDIL 248
           RFLAV PETL+GKP+TN AVLC R SDEEFFQIRCKG+K IYF HYGR+ IDKIWRDDIL
Sbjct: 167 RFLAVVPETLEGKPYTNSAVLCGRYSDEEFFQIRCKGNKGIYFQHYGRFKIDKIWRDDIL 226

Query: 249 PCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTR 308
           PCR+YLRHCVLAAKNLGDEAYNNFLDHT LGD KTTIREYL++TG+GIMEEEPPE+LK+R
Sbjct: 227 PCRLYLRHCVLAAKNLGDEAYNNFLDHTFLGDRKTTIREYLSSTGSGIMEEEPPEALKSR 286

Query: 309 YGG 311
           YGG
Sbjct: 287 YGG 289


>gi|225431812|ref|XP_002273099.1| PREDICTED: uncharacterized protein LOC100251759 [Vitis vinifera]
 gi|296083318|emb|CBI22954.3| unnamed protein product [Vitis vinifera]
          Length = 248

 Score =  386 bits (991), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 191/246 (77%), Positives = 210/246 (85%), Gaps = 13/246 (5%)

Query: 69  IQLNDESDFYQITSPDG---LISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAH 125
           I+L DESDF ++ SPDG   L+S+ GFGSLLSE SARSTFP+L+NFRVA+L+ FRRVFAH
Sbjct: 13  IELKDESDFEKVLSPDGGLGLLSVCGFGSLLSERSARSTFPDLMNFRVARLNSFRRVFAH 72

Query: 126 VAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKRE 185
           VAPIFFERGIAKP+T EISSLSVEPCEG+TLIVTVFEI++          + IP+FIKRE
Sbjct: 73  VAPIFFERGIAKPETMEISSLSVEPCEGETLIVTVFEIQR----------SEIPSFIKRE 122

Query: 186 HEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRD 245
           HEFRFLAV PETLDGK FT  AVLCAR SDEEFFQ RCKGSKEIYF  YGR+NI KIWRD
Sbjct: 123 HEFRFLAVFPETLDGKLFTTPAVLCARYSDEEFFQNRCKGSKEIYFQQYGRHNIHKIWRD 182

Query: 246 DILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESL 305
           D+LPCRVYLRHCVLAAKNL D AYNNFLDHT L D KTTIREYLATTG+GIMEEEPPESL
Sbjct: 183 DVLPCRVYLRHCVLAAKNLSDAAYNNFLDHTFLADRKTTIREYLATTGSGIMEEEPPESL 242

Query: 306 KTRYGG 311
           + RYGG
Sbjct: 243 RARYGG 248


>gi|363806706|ref|NP_001242012.1| uncharacterized protein LOC100781602 [Glycine max]
 gi|255641907|gb|ACU21222.1| unknown [Glycine max]
          Length = 283

 Score =  376 bits (966), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 198/294 (67%), Positives = 221/294 (75%), Gaps = 29/294 (9%)

Query: 25  FSFQLRVSTHPPLHSSHCKLHRRR-------PPPMSSDHAGTNFPANKLEPIQLNDESDF 77
           F FQL      PL S  C  H+R        P P+S     +  P       +L+DESDF
Sbjct: 12  FPFQLS-----PLASRAC--HQRATLSLSFPPLPLSKLAMSSAAPVT-----ELSDESDF 59

Query: 78  YQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAK 137
             + SPDG ISI GFGSLLSE SARSTFP+L NFR A+L+ FRRVFAHVAP+FFERGIAK
Sbjct: 60  ESLLSPDGHISICGFGSLLSERSARSTFPDLANFRTARLNRFRRVFAHVAPVFFERGIAK 119

Query: 138 PKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPET 197
           P+T EISSLSVEPCEG+TL+VTVFEI+K          + IP FIKRE EFRFLAV PET
Sbjct: 120 PETMEISSLSVEPCEGETLVVTVFEIRK----------SEIPDFIKREVEFRFLAVLPET 169

Query: 198 LDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHC 257
           LDGK F   AVLCAR SDEEFF  RCKG+KEIYF  YGR+NIDKIWRDD+LPCRVYLRHC
Sbjct: 170 LDGKAFDFPAVLCARYSDEEFFDNRCKGNKEIYFQQYGRWNIDKIWRDDVLPCRVYLRHC 229

Query: 258 VLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           VLAAKNLG+ AYNNF+DHT L D KTTIR+YLATTG+GIMEE+PPESLK RYGG
Sbjct: 230 VLAAKNLGETAYNNFVDHTYLADRKTTIRDYLATTGSGIMEEQPPESLKHRYGG 283


>gi|449464742|ref|XP_004150088.1| PREDICTED: uncharacterized protein LOC101211371 [Cucumis sativus]
          Length = 306

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 191/287 (66%), Positives = 220/287 (76%), Gaps = 18/287 (6%)

Query: 25  FSFQLRVSTHPPLHSSHCKLHRRRPPPMSSDHAGTNFPANKLEPIQLNDESDFYQITSPD 84
           FSF +    H PL S    L R     +S  +  +N   N+ EP +L DESDF  I S  
Sbjct: 38  FSFSI---AHSPLCSGSGLLQR-----ISPINVLSNSLVNEAEPKELRDESDFEAIFSDS 89

Query: 85  GLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEIS 144
             IS+ GFGSLLSE SARSTFP+LINFRVA+L+GFRR+F +VAP+FFERGIAKP+TKEIS
Sbjct: 90  DYISVCGFGSLLSERSARSTFPDLINFRVARLNGFRRIFGNVAPVFFERGIAKPETKEIS 149

Query: 145 SLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFT 204
           SL  EPCEG+ +IVTVFEIKK          + +PAFI+RE EFRFLAV PETL GK + 
Sbjct: 150 SLCAEPCEGENIIVTVFEIKK----------SEVPAFIQREIEFRFLAVFPETLHGKAYK 199

Query: 205 NRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNL 264
             AVLC+RS+DEEFFQ+RCKG+K+I+FHHYGR+NIDKIWRDDI PCRVYLRHC+LAAKNL
Sbjct: 200 KPAVLCSRSTDEEFFQVRCKGNKDIFFHHYGRHNIDKIWRDDIFPCRVYLRHCILAAKNL 259

Query: 265 GDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           GD+AYNNFLDHT LGD  TTIREYL   G GIMEEEPPESLK RYGG
Sbjct: 260 GDKAYNNFLDHTFLGDRSTTIREYLTGYGLGIMEEEPPESLKFRYGG 306


>gi|224110318|ref|XP_002315483.1| predicted protein [Populus trichocarpa]
 gi|222864523|gb|EEF01654.1| predicted protein [Populus trichocarpa]
          Length = 245

 Score =  373 bits (957), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 189/247 (76%), Positives = 208/247 (84%), Gaps = 12/247 (4%)

Query: 67  EPIQLNDESDFYQITSPDGLIS--ISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFA 124
           EPI+L D+SDF    S        + GFGSLLSE SARSTFP+L+NFRVA+L+GFRRVFA
Sbjct: 9   EPIELRDDSDFRSFLSSSDSDLISVCGFGSLLSERSARSTFPDLLNFRVARLNGFRRVFA 68

Query: 125 HVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKR 184
           HVAPIFFERGIAKP+TKEISSLSVEPCEG+ L+VTVFEI+K          + IPAF++R
Sbjct: 69  HVAPIFFERGIAKPETKEISSLSVEPCEGEGLVVTVFEIEK----------SEIPAFMER 118

Query: 185 EHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWR 244
           E EFRFLAV PETLDGKPF   AVLCAR SDEEFFQ+RCKGSKEIYF HYGRYNI+KIWR
Sbjct: 119 ELEFRFLAVVPETLDGKPFDFPAVLCARYSDEEFFQVRCKGSKEIYFQHYGRYNINKIWR 178

Query: 245 DDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPES 304
           DDILPCR YLRHCVLAAKNL D A++NFL+HT LGD KTTIREYLATTGAGIMEEEPPES
Sbjct: 179 DDILPCRAYLRHCVLAAKNLSDAAHSNFLEHTFLGDRKTTIREYLATTGAGIMEEEPPES 238

Query: 305 LKTRYGG 311
           LKTRYGG
Sbjct: 239 LKTRYGG 245


>gi|225431810|ref|XP_002271042.1| PREDICTED: uncharacterized protein LOC100265416 [Vitis vinifera]
 gi|296083319|emb|CBI22955.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  367 bits (943), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 181/249 (72%), Positives = 208/249 (83%), Gaps = 11/249 (4%)

Query: 64  NKLEPIQLNDESDFYQI-TSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRV 122
           N  E  +L+DESDF ++ +S D L+SI GFGSLLSE SARSTFP+LINFRVA+L+GFRRV
Sbjct: 70  NHSELTELHDESDFERLASSNDDLLSICGFGSLLSERSARSTFPDLINFRVARLNGFRRV 129

Query: 123 FAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFI 182
           FAHV P+FFERGIAKP+T EIS LSVEPCEG++LIVTVFEI +          + IP++I
Sbjct: 130 FAHVTPVFFERGIAKPETMEISGLSVEPCEGESLIVTVFEIHR----------SEIPSYI 179

Query: 183 KREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKI 242
           KRE E+RFLAV PETLDG PFT+ AVLCAR SDEEF QIRCKGSKE++F  YGRYNI KI
Sbjct: 180 KREIEYRFLAVFPETLDGTPFTSPAVLCARYSDEEFLQIRCKGSKEMFFEQYGRYNIHKI 239

Query: 243 WRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPP 302
           WRDD+ PCRVYLRHC+LAAK+L D AYN+FLDHT L D KTTI+EYLATTG+GIMEEEPP
Sbjct: 240 WRDDVFPCRVYLRHCLLAAKSLSDAAYNDFLDHTFLADRKTTIQEYLATTGSGIMEEEPP 299

Query: 303 ESLKTRYGG 311
           ESLK RYGG
Sbjct: 300 ESLKARYGG 308


>gi|115463405|ref|NP_001055302.1| Os05g0358400 [Oryza sativa Japonica Group]
 gi|47777404|gb|AAT38038.1| unknown protein [Oryza sativa Japonica Group]
 gi|55167986|gb|AAV43854.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578853|dbj|BAF17216.1| Os05g0358400 [Oryza sativa Japonica Group]
 gi|215701221|dbj|BAG92645.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215766615|dbj|BAG98677.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 308

 Score =  366 bits (939), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 184/274 (67%), Positives = 209/274 (76%), Gaps = 24/274 (8%)

Query: 45  HRRRP-----PPMSSDHAGTNFPANKLEPIQLNDESDFYQITSP--DGLISISGFGSLLS 97
           HRR P     P  ++  +G +FP       ++N   DF  +T+P   G +S+ GFGSLLS
Sbjct: 52  HRRLPLMASSPSAAATGSGGSFP-------EMNSVDDFAAVTAPRGGGRVSVVGFGSLLS 104

Query: 98  ENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLI 157
           E SARSTFP L  FRVA L GFRRVFAH APIFFERGIA   TKE SSLSVEPCEG+ ++
Sbjct: 105 ERSARSTFPELEGFRVAALRGFRRVFAHSAPIFFERGIAIEATKEFSSLSVEPCEGELIV 164

Query: 158 VTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEE 217
           VTVFEIK+            +PAFI+REHEFRFLAV PE LDG P+ N AV+CAR SDEE
Sbjct: 165 VTVFEIKE----------DEVPAFIEREHEFRFLAVVPEGLDGVPYANPAVVCARYSDEE 214

Query: 218 FFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTS 277
           +FQ+RCKGSKEIY+  YGRYNID+IWRDDILPCR YLRHCVLAAKNLG+ AY+NFLDHT 
Sbjct: 215 YFQVRCKGSKEIYYQRYGRYNIDRIWRDDILPCRAYLRHCVLAAKNLGEPAYSNFLDHTY 274

Query: 278 LGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           LGD  TTIREYLATTGAGIMEEEPPESLK+RYGG
Sbjct: 275 LGDRTTTIREYLATTGAGIMEEEPPESLKSRYGG 308


>gi|218196630|gb|EEC79057.1| hypothetical protein OsI_19621 [Oryza sativa Indica Group]
 gi|222631273|gb|EEE63405.1| hypothetical protein OsJ_18217 [Oryza sativa Japonica Group]
          Length = 251

 Score =  364 bits (935), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 180/264 (68%), Positives = 205/264 (77%), Gaps = 19/264 (7%)

Query: 50  PPMSSDHAGTNFPANKLEPIQLNDESDFYQITSP--DGLISISGFGSLLSENSARSTFPN 107
           P  ++  +G +FP       ++N   DF  +T+P   G +S+ GFGSLLSE SARSTFP 
Sbjct: 5   PSAAATGSGGSFP-------EMNSVDDFAAVTAPRGGGRVSVVGFGSLLSERSARSTFPE 57

Query: 108 LINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIR 167
           L  FRVA L GFRRVFAH APIFFERGIA   TKE SSLSVEPCEG+ ++VTVFEIK+  
Sbjct: 58  LEGFRVAALRGFRRVFAHSAPIFFERGIAIEATKEFSSLSVEPCEGELIVVTVFEIKE-- 115

Query: 168 GIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSK 227
                     +PAFI+REHEFRFLAV PE LDG P+ N AV+CAR SDEE+FQ+RCKGSK
Sbjct: 116 --------DEVPAFIEREHEFRFLAVVPEGLDGVPYANPAVVCARYSDEEYFQVRCKGSK 167

Query: 228 EIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIRE 287
           EIY+  YGRYNID+IWRDDILPCR YLRHCVLAAKNLG+ AY+NFLDHT LGD  TTIRE
Sbjct: 168 EIYYQRYGRYNIDRIWRDDILPCRAYLRHCVLAAKNLGEPAYSNFLDHTYLGDRTTTIRE 227

Query: 288 YLATTGAGIMEEEPPESLKTRYGG 311
           YLATTGAGIMEEEPPESLK+RYGG
Sbjct: 228 YLATTGAGIMEEEPPESLKSRYGG 251


>gi|357129373|ref|XP_003566337.1| PREDICTED: uncharacterized protein LOC100832854 [Brachypodium
           distachyon]
          Length = 309

 Score =  363 bits (933), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 189/281 (67%), Positives = 211/281 (75%), Gaps = 20/281 (7%)

Query: 31  VSTHPPLHSSHCKLHRRRPPPMSSDHAGTNFPANKLEPIQLNDESDFYQITSPDGLISIS 90
           V + PP+  SH     R  PP +   A T   A      +L   +DF  I S DG IS+ 
Sbjct: 49  VPSQPPISRSH-----RLMPPTA---AATTSEAGSF--TELTSAADFATIASADGRISVI 98

Query: 91  GFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEP 150
           GFGSLLSE SARSTFP L  FRVA L GFRRVFAH APIFF+RGIA   TKE SSLSVE 
Sbjct: 99  GFGSLLSERSARSTFPELEGFRVAALRGFRRVFAHAAPIFFDRGIAIEATKEFSSLSVEQ 158

Query: 151 CEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLC 210
           CEG+ ++VTVFEIK+      EE    +PAFI+RE+EFRFL V PE LD  PF N AV+C
Sbjct: 159 CEGEMIVVTVFEIKE------EE----VPAFIERENEFRFLVVVPEGLDQVPFPNPAVVC 208

Query: 211 ARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYN 270
           AR SDEE+FQ+RCKGSKEIY  HYGRYNIDKIWRDDILPCR+YLRHCVLAAKNLG+ AY+
Sbjct: 209 ARYSDEEYFQVRCKGSKEIYNQHYGRYNIDKIWRDDILPCRLYLRHCVLAAKNLGEAAYS 268

Query: 271 NFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           +FLDHT LGD KTTIREYLATTGAGIMEEEPPESLK+RYGG
Sbjct: 269 SFLDHTYLGDRKTTIREYLATTGAGIMEEEPPESLKSRYGG 309


>gi|449501489|ref|XP_004161381.1| PREDICTED: uncharacterized LOC101211371 [Cucumis sativus]
          Length = 304

 Score =  361 bits (926), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 190/288 (65%), Positives = 216/288 (75%), Gaps = 22/288 (7%)

Query: 25  FSFQLRVSTHPPLHSSHCKLHRRRPPPMSSDHAGTNFPANKLEPIQLNDESDFYQITSPD 84
           FSF +    H PL S    L R     +S  +  +N   N+ EP +L DESDF  I S  
Sbjct: 38  FSFSI---AHSPLCSGSGLLQR-----ISPINVLSNSLVNEAEPKELRDESDFEAIFSDS 89

Query: 85  GLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEIS 144
             IS+ GFGSLLSE SARSTFP+LINFRVA+L+GFRR+F +VAP+FFERGIAKP+TKEIS
Sbjct: 90  DYISVCGFGSLLSERSARSTFPDLINFRVARLNGFRRIFGNVAPVFFERGIAKPETKEIS 149

Query: 145 SLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFT 204
           SL  EPCEG+ +IVTVFEIKK          + +PAFI+RE EFRFLAV PETL GK + 
Sbjct: 150 SLCAEPCEGENIIVTVFEIKK----------SEVPAFIQREIEFRFLAVFPETLHGKAYK 199

Query: 205 N-RAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKN 263
           N R VLC+RS+DEEFFQ+RCKG     FHHYGR+NIDKIWRDDI PCRVYLRHC+LAAKN
Sbjct: 200 NQRYVLCSRSTDEEFFQVRCKGH---LFHHYGRHNIDKIWRDDIFPCRVYLRHCILAAKN 256

Query: 264 LGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           LGD+AYNNFLDHT LGD  TTIREYL   G GIMEEEPPESLK RYGG
Sbjct: 257 LGDKAYNNFLDHTFLGDRSTTIREYLTGYGLGIMEEEPPESLKFRYGG 304


>gi|449464740|ref|XP_004150087.1| PREDICTED: uncharacterized protein LOC101211120 [Cucumis sativus]
          Length = 244

 Score =  357 bits (915), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 178/253 (70%), Positives = 202/253 (79%), Gaps = 10/253 (3%)

Query: 59  TNFPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHG 118
           +N   N+ EP +L DESDF  I S    IS+SGFGSLLS  SAR TFP LINFRVA+L+G
Sbjct: 2   SNSMLNEAEPKELRDESDFEVIFSDSDYISVSGFGSLLSVRSARGTFPELINFRVARLNG 61

Query: 119 FRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMI 178
           FRR F HVAPIFFERGIAKP+TKEISSL  EPCE + +I+TVFEIKK          + I
Sbjct: 62  FRRFFGHVAPIFFERGIAKPETKEISSLCSEPCEEENIIITVFEIKK----------SEI 111

Query: 179 PAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYN 238
           PAFI+RE EFRFLAV PETLDGK +   AVLC+R +DEEFFQ+RCKG+K+I FHHYGR+N
Sbjct: 112 PAFIQREIEFRFLAVFPETLDGKIYDKPAVLCSRFTDEEFFQVRCKGNKDILFHHYGRHN 171

Query: 239 IDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIME 298
           IDKIWRDDILPCRVYLRHCVLAAKNLG+ AYNNFLDHT LGD +TT+REYL    +GIME
Sbjct: 172 IDKIWRDDILPCRVYLRHCVLAAKNLGEIAYNNFLDHTFLGDRRTTVREYLTRYDSGIME 231

Query: 299 EEPPESLKTRYGG 311
           EEPPES+K RY G
Sbjct: 232 EEPPESVKFRYEG 244


>gi|449501496|ref|XP_004161383.1| PREDICTED: uncharacterized LOC101211120 [Cucumis sativus]
          Length = 244

 Score =  354 bits (908), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 177/253 (69%), Positives = 201/253 (79%), Gaps = 10/253 (3%)

Query: 59  TNFPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHG 118
           +N   N+ EP +L DESDF  I S    IS+ GFGSLLS  SAR TFP LINFRVA+L+G
Sbjct: 2   SNSMLNEAEPKELRDESDFEVIFSDSDYISVCGFGSLLSVRSARGTFPELINFRVARLNG 61

Query: 119 FRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMI 178
           FRR F HVAPIFFERGIAKP+TKEISSL  EPCE + +I+TVFEIKK          + I
Sbjct: 62  FRRFFGHVAPIFFERGIAKPETKEISSLCSEPCEEENIIITVFEIKK----------SEI 111

Query: 179 PAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYN 238
           PAFI+RE EFRFLAV PETLDGK +   AVLC+R +DEEFFQ+RCKG+K+I FHHYGR+N
Sbjct: 112 PAFIQREIEFRFLAVFPETLDGKIYDKPAVLCSRFTDEEFFQVRCKGNKDILFHHYGRHN 171

Query: 239 IDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIME 298
           IDKIWRDDILPCRVYLRHCVLAAKNLG+ AYNNFLDHT LGD +TT+REYL    +GIME
Sbjct: 172 IDKIWRDDILPCRVYLRHCVLAAKNLGEIAYNNFLDHTFLGDRRTTVREYLTRYDSGIME 231

Query: 299 EEPPESLKTRYGG 311
           EEPPES+K RY G
Sbjct: 232 EEPPESVKFRYEG 244


>gi|2244964|emb|CAB10385.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268355|emb|CAB78648.1| hypothetical protein [Arabidopsis thaliana]
          Length = 274

 Score =  354 bits (908), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 176/243 (72%), Positives = 196/243 (80%), Gaps = 25/243 (10%)

Query: 69  IQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAP 128
           ++L DESDF ++ S D  ISI+GFGSLLSE SARSTFP+L NFR+AKL GFRRVFAH AP
Sbjct: 57  MELEDESDFEKLLSSDNRISITGFGSLLSERSARSTFPDLENFRIAKLQGFRRVFAHSAP 116

Query: 129 IFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEF 188
           IFFERGIA P+TKEISSLSVEPCEG++L+VTVFEIK           + IPAFI RE EF
Sbjct: 117 IFFERGIANPETKEISSLSVEPCEGESLVVTVFEIKS----------SEIPAFIGRELEF 166

Query: 189 RFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDIL 248
           RFLAV               LC R SDEEFFQIRCKG+K IYF HYGR+ IDKIWRDDIL
Sbjct: 167 RFLAV---------------LCGRYSDEEFFQIRCKGNKGIYFQHYGRFKIDKIWRDDIL 211

Query: 249 PCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTR 308
           PCR+YLRHCVLAAKNLGDEAYNNFLDHT LGD KTTIREYL++TG+GIMEEEPPE+LK+R
Sbjct: 212 PCRLYLRHCVLAAKNLGDEAYNNFLDHTFLGDRKTTIREYLSSTGSGIMEEEPPEALKSR 271

Query: 309 YGG 311
           YGG
Sbjct: 272 YGG 274


>gi|26452147|dbj|BAC43162.1| unknown protein [Arabidopsis thaliana]
 gi|28950779|gb|AAO63313.1| At4g16060 [Arabidopsis thaliana]
          Length = 228

 Score =  348 bits (894), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 169/217 (77%), Positives = 186/217 (85%), Gaps = 10/217 (4%)

Query: 95  LLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQ 154
           L S   ARSTFP+L NFR+AKLHGFRRVFAH APIFFERGIA P+TKEISSLSVEPCEG+
Sbjct: 22  LFSPREARSTFPDLENFRIAKLHGFRRVFAHSAPIFFERGIANPETKEISSLSVEPCEGE 81

Query: 155 TLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSS 214
           +L+VTVFEIK           + IPAFI RE EFRFLAV PETL+GKP+TN AVLC R S
Sbjct: 82  SLVVTVFEIKS----------SEIPAFIGRELEFRFLAVVPETLEGKPYTNSAVLCGRYS 131

Query: 215 DEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLD 274
           DEEFFQIRCKG+K IYF HYGR+ IDKIWRDDILPCR+YLRHCVLAAKNLGDEAYNNFLD
Sbjct: 132 DEEFFQIRCKGNKGIYFQHYGRFKIDKIWRDDILPCRLYLRHCVLAAKNLGDEAYNNFLD 191

Query: 275 HTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           HT LGD KTTIREYL++TG+GIMEEEPPE+LK+RY G
Sbjct: 192 HTFLGDRKTTIREYLSSTGSGIMEEEPPEALKSRYRG 228


>gi|255551863|ref|XP_002516977.1| conserved hypothetical protein [Ricinus communis]
 gi|223544065|gb|EEF45591.1| conserved hypothetical protein [Ricinus communis]
          Length = 283

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 178/237 (75%), Positives = 191/237 (80%), Gaps = 24/237 (10%)

Query: 76  DFYQITSPD-GLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERG 134
           DF ++ SP  GLISI GFGSLLS               VA+L+GFRRVFAHVAPIFFERG
Sbjct: 70  DFQKLLSPSSGLISICGFGSLLSGT-------------VARLNGFRRVFAHVAPIFFERG 116

Query: 135 IAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVS 194
           IAKP+TKEISSLSVEPCEG+TLIVTVFEI K          + IPAF++RE EFRFLAV 
Sbjct: 117 IAKPETKEISSLSVEPCEGETLIVTVFEIDK----------SEIPAFMERELEFRFLAVV 166

Query: 195 PETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYL 254
           PETLDGKPF + AVLCAR SDEEFF+IRCKGSKE YF HYGRY I KIWRDDILPCR YL
Sbjct: 167 PETLDGKPFDSPAVLCARYSDEEFFRIRCKGSKETYFQHYGRYGISKIWRDDILPCRTYL 226

Query: 255 RHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           RHCVLAAKNLG  A+NNFLDHT LGD KTTIR+YLATTGAGIMEEEPPESLKTRYGG
Sbjct: 227 RHCVLAAKNLGSAAHNNFLDHTFLGDRKTTIRKYLATTGAGIMEEEPPESLKTRYGG 283


>gi|147770754|emb|CAN62470.1| hypothetical protein VITISV_016050 [Vitis vinifera]
          Length = 234

 Score =  343 bits (879), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 176/247 (71%), Positives = 193/247 (78%), Gaps = 29/247 (11%)

Query: 69  IQLNDESDFYQITSPDG---LISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAH 125
           I+L DESDF ++ SPDG   L+S+ GFGSLLSE SARSTFP+L+NFRVA+L+ FRRVFAH
Sbjct: 13  IELKDESDFEKVLSPDGGLGLLSVCGFGSLLSERSARSTFPDLMNFRVARLNSFRRVFAH 72

Query: 126 VAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKRE 185
           VAPIFFERGIAKP+T EISSLSVEPCEG+TLIVTVFEI++                    
Sbjct: 73  VAPIFFERGIAKPETMEISSLSVEPCEGETLIVTVFEIQRSE------------------ 114

Query: 186 HEFRFLAVSPETLDGKPFTNRAV-LCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWR 244
                  V PETLDGK FT  AV LCAR SDEEFFQ RCKGSKEIYF  YGR+NI KIWR
Sbjct: 115 -------VFPETLDGKLFTTPAVVLCARYSDEEFFQNRCKGSKEIYFQQYGRHNIHKIWR 167

Query: 245 DDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPES 304
           DD+LPCRVYLRHCVLAAKNL D AYNNFLDHT L D +TTIREYLATTG GIMEEEPPES
Sbjct: 168 DDVLPCRVYLRHCVLAAKNLSDAAYNNFLDHTFLADRQTTIREYLATTGTGIMEEEPPES 227

Query: 305 LKTRYGG 311
           L+ RYGG
Sbjct: 228 LRPRYGG 234


>gi|449464744|ref|XP_004150089.1| PREDICTED: uncharacterized protein LOC101211619 [Cucumis sativus]
 gi|449501484|ref|XP_004161380.1| PREDICTED: uncharacterized LOC101211619 [Cucumis sativus]
          Length = 243

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 165/245 (67%), Positives = 193/245 (78%), Gaps = 10/245 (4%)

Query: 67  EPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHV 126
           EP +L DESDF  I S    IS+ G+GSLLSE SARSTFP LINFR+A+L+ FRRVF  +
Sbjct: 9   EPNELRDESDFEAIFSDSDYISVCGYGSLLSERSARSTFPELINFRIARLNNFRRVFGVI 68

Query: 127 APIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREH 186
           APIFFE  IAKP+TKEISS+  EPCEG+T+I+TVFEIKK            IPAFI+RE 
Sbjct: 69  APIFFEHDIAKPETKEISSVFAEPCEGETIIITVFEIKKFE----------IPAFIEREF 118

Query: 187 EFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDD 246
            FRFL V PETLDGK +   AVLC+RS+DEEFFQ++CKG+K+I+  +YGR+N+DKIWRDD
Sbjct: 119 AFRFLTVLPETLDGKLYHKPAVLCSRSTDEEFFQVKCKGNKDIFLQYYGRHNVDKIWRDD 178

Query: 247 ILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLK 306
           ILPCRVYLRHC+LAAK + D AYNN LDHT LGD  TTIREYL T G+ IMEEEPP+SLK
Sbjct: 179 ILPCRVYLRHCILAAKGVDDVAYNNILDHTFLGDRCTTIREYLVTNGSSIMEEEPPKSLK 238

Query: 307 TRYGG 311
            RYGG
Sbjct: 239 FRYGG 243


>gi|168064197|ref|XP_001784051.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664437|gb|EDQ51157.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 262

 Score =  314 bits (804), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 151/243 (62%), Positives = 187/243 (76%), Gaps = 11/243 (4%)

Query: 70  QLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPI 129
           +L +E+DF  I + DG ISI GFGSLLSE SA  TFPNL NFRV  LHGFRRVFAHVAP+
Sbjct: 30  ELRNEADFASIANTDGEISICGFGSLLSEKSALYTFPNLRNFRVGVLHGFRRVFAHVAPV 89

Query: 130 FFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFR 189
           F +RGIA  +TKE+SSLSVEPC G++++VTVFEI            + +PAFI+REHEFR
Sbjct: 90  FLDRGIANIETKELSSLSVEPCVGESIVVTVFEIS----------VSEVPAFIEREHEFR 139

Query: 190 FLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILP 249
           FLAV+ E  +G+PF+ +AV+CAR SDEE+ +IRC+G  E Y+  YGR+ I++IWRDDI P
Sbjct: 140 FLAVTVEDKEGQPFSQQAVICARYSDEEYRRIRCQGMNEEYYRRYGRHKIEQIWRDDIFP 199

Query: 250 CRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLA-TTGAGIMEEEPPESLKTR 308
           CRVYLRHCVLAAKNLG++AY +FLDHT + D +TTIR Y     G GIM E+PPE+L+ R
Sbjct: 200 CRVYLRHCVLAAKNLGEDAYESFLDHTYVADRRTTIRTYFKHGLGVGIMNEKPPEALRER 259

Query: 309 YGG 311
           YGG
Sbjct: 260 YGG 262


>gi|351727499|ref|NP_001236139.1| uncharacterized protein LOC100527697 [Glycine max]
 gi|255632976|gb|ACU16842.1| unknown [Glycine max]
          Length = 240

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 151/244 (61%), Positives = 177/244 (72%), Gaps = 10/244 (4%)

Query: 63  ANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRV 122
           A  L  + L   SDF  + SPDG ISI G+GSLLSE SAR+TFP+L+NFR+A+L GFRR+
Sbjct: 2   AIPLPSLDLTHHSDFEPLVSPDGYISICGYGSLLSETSARTTFPDLVNFRIARLTGFRRL 61

Query: 123 FAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFI 182
           F  V  IFF   +A  KT+EI+ LSVEPC+G+ ++V VFEIK           T IPAFI
Sbjct: 62  FNSVGRIFFTHRVANIKTQEIAGLSVEPCDGEFVVVAVFEIKN----------TEIPAFI 111

Query: 183 KREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKI 242
           +RE E+RFLAV PETLDGKPF N AVLCA  +DEEFF+ RC   +E YF  YG Y I KI
Sbjct: 112 EREREYRFLAVVPETLDGKPFINPAVLCASYTDEEFFKFRCSEGREKYFEQYGEYKIHKI 171

Query: 243 WRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPP 302
           WRDD+ PCRVYLRHC LAAK+LGDE +NNFLDHT L D KTTIR+Y    G  I+EEEPP
Sbjct: 172 WRDDVFPCRVYLRHCALAAKSLGDEVHNNFLDHTFLADRKTTIRQYFEKVGTSILEEEPP 231

Query: 303 ESLK 306
           ESLK
Sbjct: 232 ESLK 235


>gi|147770755|emb|CAN62471.1| hypothetical protein VITISV_016051 [Vitis vinifera]
          Length = 301

 Score =  304 bits (778), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 162/257 (63%), Positives = 185/257 (71%), Gaps = 34/257 (13%)

Query: 64  NKLEPIQLNDESDFYQI-TSPDGLISISGFGSLLSENSARSTFPNLIN-FRVAKLHGFR- 120
           N  E  +L+DESDF ++ +S D L+SI GFGSLLSE SARSTFP+ +N  +  K+     
Sbjct: 70  NHSELTELHDESDFEKLASSNDDLLSICGFGSLLSERSARSTFPDPVNQLQSRKIERISY 129

Query: 121 RVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPA 180
           RVFAHV P+FFERGIAKP+T EIS LSVEPCEG++LIVTVFEI +               
Sbjct: 130 RVFAHVTPVFFERGIAKPETMEISGLSVEPCEGESLIVTVFEIHRSE------------- 176

Query: 181 FIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNID 240
                       V PETLDG PFT+ AVLCAR SDEEF QIRCKGSKE++F  YGRYNI 
Sbjct: 177 ------------VFPETLDGTPFTSPAVLCARYSDEEFLQIRCKGSKEMFFEQYGRYNIH 224

Query: 241 KIWRDDILPCRVYLRHC------VLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGA 294
           KIWRDD+ PCRVYLRHC      +LAAK+L D AYN+FLDHT L D KTTIREYLATTG+
Sbjct: 225 KIWRDDVFPCRVYLRHCCKTLCSLLAAKSLSDAAYNDFLDHTFLADRKTTIREYLATTGS 284

Query: 295 GIMEEEPPESLKTRYGG 311
           GIMEEEPPESLK RYGG
Sbjct: 285 GIMEEEPPESLKARYGG 301


>gi|326512630|dbj|BAJ99670.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 279

 Score =  297 bits (760), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 159/253 (62%), Positives = 175/253 (69%), Gaps = 37/253 (14%)

Query: 50  PPMSSDHAGTNFPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLI 109
           PP+++    T+  A      +L    DF  I SPDG IS+ GFGSLLSE SARSTFP L 
Sbjct: 51  PPLTAAALATSSGAESFS--ELASADDFAAIASPDGHISVIGFGSLLSERSARSTFPELE 108

Query: 110 NFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGI 169
            FRVA L GFRRVFAH APIFFERGIA   TKE SSLSVE CEG+ ++VTVFEIK+    
Sbjct: 109 GFRVAALRGFRRVFAHAAPIFFERGIAIEATKEFSSLSVEHCEGEMIVVTVFEIKE---- 164

Query: 170 KLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEI 229
             EE    +PAFI+RE EFRFLAV PE LDG P                         EI
Sbjct: 165 --EE----VPAFIERELEFRFLAVVPEGLDGVP-------------------------EI 193

Query: 230 YFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYL 289
           Y  HYGRYNIDKIWRDDILPCR+YLRHCVLAAKNLG+ AY+NFLDHT LGD +TTIREYL
Sbjct: 194 YNQHYGRYNIDKIWRDDILPCRLYLRHCVLAAKNLGEAAYSNFLDHTYLGDRRTTIREYL 253

Query: 290 ATTGAGIMEEEPP 302
           ATTGAGIMEEEPP
Sbjct: 254 ATTGAGIMEEEPP 266


>gi|302754804|ref|XP_002960826.1| hypothetical protein SELMODRAFT_139369 [Selaginella moellendorffii]
 gi|300171765|gb|EFJ38365.1| hypothetical protein SELMODRAFT_139369 [Selaginella moellendorffii]
          Length = 242

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 141/244 (57%), Positives = 182/244 (74%), Gaps = 14/244 (5%)

Query: 70  QLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPI 129
           +L + SDF  I+S +GL+SI GFGSLLSE SARSTFP L+NFR   +  FRR+FAH+API
Sbjct: 11  ELENASDFAAISS-NGLVSICGFGSLLSEKSARSTFPRLLNFRPGVVKNFRRIFAHIAPI 69

Query: 130 FFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFR 189
           F ERGIA  +TKE+SSLSVEPC G++++VTVFEI  +           +PAFI+REHEFR
Sbjct: 70  FLERGIANVETKEMSSLSVEPCIGESIVVTVFEISLLE----------VPAFIQREHEFR 119

Query: 190 FL--AVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDI 247
           FL  AV   + +      +AV+CA+ SDEE+  IRC+G+K++++  YG++ I +IWRDDI
Sbjct: 120 FLEVAVVNSSEENAIHLCKAVICAKYSDEEYLSIRCQGNKDVFYERYGKHGIQQIWRDDI 179

Query: 248 LPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKT 307
           LPCR+YL HCVLAAKNLG +A+ +FLDHT LGD +T IREYL     GIM+E+PPE L+ 
Sbjct: 180 LPCRLYLHHCVLAAKNLGQDAFESFLDHTYLGDRRTRIREYL-DAHPGIMKEQPPEHLRD 238

Query: 308 RYGG 311
           RYGG
Sbjct: 239 RYGG 242


>gi|302804182|ref|XP_002983843.1| hypothetical protein SELMODRAFT_119517 [Selaginella moellendorffii]
 gi|300148195|gb|EFJ14855.1| hypothetical protein SELMODRAFT_119517 [Selaginella moellendorffii]
          Length = 225

 Score =  277 bits (708), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 138/242 (57%), Positives = 176/242 (72%), Gaps = 27/242 (11%)

Query: 70  QLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPI 129
           +L + SDF  I+S +GL+SI GFGSLLSE SARSTFP L+NFR   +  FRR+FAH+API
Sbjct: 11  ELENASDFTAISS-NGLVSICGFGSLLSEKSARSTFPRLLNFRPGVVKNFRRIFAHIAPI 69

Query: 130 FFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFR 189
           F ERGIA  +TKE+SSLSVEPC G++++VTVFEI  +           +PAFI+REHEFR
Sbjct: 70  FLERGIANVETKEMSSLSVEPCIGESIVVTVFEISLLE----------VPAFIQREHEFR 119

Query: 190 FLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILP 249
           FL V               +CA+ SDEE+  IRC+G+K++++  YG++ I +IWRDDILP
Sbjct: 120 FLEV---------------ICAKYSDEEYLSIRCQGNKDVFYERYGKHGIQQIWRDDILP 164

Query: 250 CRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRY 309
           CR+YLRHCVLAAKNLG +A+ +FLDHT LGD +T IREYL      IM+E+PPE L+ RY
Sbjct: 165 CRLYLRHCVLAAKNLGQDAFESFLDHTYLGDRRTRIREYL-DAHPDIMKEQPPEHLRDRY 223

Query: 310 GG 311
           GG
Sbjct: 224 GG 225


>gi|413948845|gb|AFW81494.1| hypothetical protein ZEAMMB73_001847 [Zea mays]
          Length = 255

 Score =  268 bits (685), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 145/243 (59%), Positives = 170/243 (69%), Gaps = 20/243 (8%)

Query: 17  ACIPRSKAFSFQLRVSTHP-PLHSSHCKLHRRRPPPMS--SDHAGTNFPANKLEPIQLND 73
           A +P     + +LR+   P P  S   +L    PPP +  S  AG +FP       +L  
Sbjct: 29  ATLPPVPLTAARLRLRRQPEPGPSPPTRLSHPMPPPTATMSPAAGGSFP-------ELTC 81

Query: 74  ESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFER 133
            +DF  + +P G  S+ GFGSLLSE SARSTFP L  FRVA L GFRRVFAH APIFFER
Sbjct: 82  PADFATVAAPGGRFSVVGFGSLLSERSARSTFPELEGFRVAVLRGFRRVFAHAAPIFFER 141

Query: 134 GIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAV 193
           GIA   TKE SSLSVEPC+ + ++VTVFEIK+      EE    +PAFI+REHEFRFLAV
Sbjct: 142 GIAIEATKEFSSLSVEPCDSELIVVTVFEIKE------EE----VPAFIEREHEFRFLAV 191

Query: 194 SPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVY 253
            PE LDG PFTNRAV+CAR S+EE+FQ RC+GSKEIY  +YGRYNIDKIWRDDILPCR+Y
Sbjct: 192 VPEGLDGVPFTNRAVVCARYSNEEYFQERCRGSKEIYNKYYGRYNIDKIWRDDILPCRLY 251

Query: 254 LRH 256
           LRH
Sbjct: 252 LRH 254


>gi|294462572|gb|ADE76832.1| unknown [Picea sitchensis]
          Length = 187

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 127/194 (65%), Positives = 150/194 (77%), Gaps = 10/194 (5%)

Query: 63  ANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRV 122
           A   +  +LND +DF  I SPDG +SISGFGSLLSE SARSTFPNL  FRVA L GFRRV
Sbjct: 3   ATTTQYTELNDAADFDSIASPDGYVSISGFGSLLSERSARSTFPNLQTFRVAVLRGFRRV 62

Query: 123 FAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFI 182
           FAH APIFFERGIA+ +TKEISSLSVEPC G+ +IVTVFEI K          + +PAFI
Sbjct: 63  FAHTAPIFFERGIARMETKEISSLSVEPCPGEFIIVTVFEIHK----------SEVPAFI 112

Query: 183 KREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKI 242
           +REHEFRFL   PE L GKPF++ +V+C+R SDEE+  +RCKG++E +F  YG +NIDKI
Sbjct: 113 EREHEFRFLVAVPEDLFGKPFSHPSVVCSRYSDEEYRLVRCKGNEETFFQRYGHHNIDKI 172

Query: 243 WRDDILPCRVYLRH 256
           W+DDILPCRVYLRH
Sbjct: 173 WQDDILPCRVYLRH 186


>gi|307111831|gb|EFN60065.1| hypothetical protein CHLNCDRAFT_18676 [Chlorella variabilis]
          Length = 235

 Score =  210 bits (534), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 117/251 (46%), Positives = 154/251 (61%), Gaps = 25/251 (9%)

Query: 61  FPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFR 120
            PA      +L+D  DF  +    GL++I GFGSLLS+ SA +TFP+L NFR   L G+R
Sbjct: 10  IPAVTSGAFELDD--DFGGLACSQGLMTICGFGSLLSKTSALTTFPSLQNFRTGLLRGWR 67

Query: 121 RVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPA 180
           RVF H   IF+ RGIA+P+T E+SSLSVE   G  ++V++FE+              I A
Sbjct: 68  RVFTHQCDIFYARGIARPETGEVSSLSVEEHPGSEIVVSLFEVGSDP--------ASIAA 119

Query: 181 FIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNID 240
           FI+REHEFR++ V             AV+CAR  D ++ + RC   +  +   YGR+ + 
Sbjct: 120 FIEREHEFRWVGV------------LAVVCARWDDAQYRRRRCPPDE--WQRRYGRHGVQ 165

Query: 241 KIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEE 300
           +IW D+++PCRVYLRHCVLAA+ LG EA+ +FLD T L D  TTIRE+LA    GIM E 
Sbjct: 166 QIWTDNVVPCRVYLRHCVLAAQRLGPEAHASFLDGTVLSDRATTIREHLAAN-PGIMHEL 224

Query: 301 PPESLKTRYGG 311
           PP SL  RY G
Sbjct: 225 PPSSLIGRYSG 235


>gi|302850776|ref|XP_002956914.1| hypothetical protein VOLCADRAFT_119537 [Volvox carteri f.
           nagariensis]
 gi|300257795|gb|EFJ42039.1| hypothetical protein VOLCADRAFT_119537 [Volvox carteri f.
           nagariensis]
          Length = 261

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 109/259 (42%), Positives = 153/259 (59%), Gaps = 39/259 (15%)

Query: 70  QLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPI 129
           +L+DE  F  +   DGL++I+GFGSLLSE SAR TFP+L+NFR+  + G+RRVF+H A +
Sbjct: 25  ELDDE--FTGLAGDDGLMTIAGFGSLLSERSARYTFPHLVNFRIGLVPGWRRVFSHTADV 82

Query: 130 FFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFR 189
           FF RGIA+P+T             + ++V++FE+              + AFI REHEFR
Sbjct: 83  FFLRGIARPETSS---------HRRGVVVSLFEVPYTP--------ANVAAFISREHEFR 125

Query: 190 FLAVSPETLD-----------------GKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFH 232
           F+AV P  L                  G+P   +AV+CA ++D ++  +RC  ++  +  
Sbjct: 126 FVAVQPLELGTGQPLELGTGQPLELGTGQPIGRKAVVCAANTDSDYRALRCPPAE--WSR 183

Query: 233 HYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATT 292
            +  +    +WR D+LPCRVYLRHCVLAA++LG+EA  NFL  T L D +TT+ EYLA  
Sbjct: 184 RWSVHGCTSVWRTDVLPCRVYLRHCVLAARSLGEEAEENFLRATFLADRRTTVGEYLALH 243

Query: 293 GAGIMEEEPPESLKTRYGG 311
              IM+E PP  L  RY G
Sbjct: 244 -LDIMDELPPPELAERYSG 261


>gi|159485048|ref|XP_001700561.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158272201|gb|EDO98005.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 205

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 103/231 (44%), Positives = 141/231 (61%), Gaps = 32/231 (13%)

Query: 87  ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSL 146
           ++++GFGSLLSE SAR TFPNL+NFR  ++HG+RRVFAH A +FF RGIA+P T   ++ 
Sbjct: 1   MTVAGFGSLLSERSARFTFPNLVNFRAGQIHGWRRVFAHTADVFFVRGIARPDTVRGAAP 60

Query: 147 SVEPCEGQTLIVTVFEIKKIRGIKLEELCT-----MIPAFIKREHEFRFLAVSPETLDGK 201
           S                       L  +C       + AFI REHEFRF+AV P  L+G+
Sbjct: 61  S-----------------------LPYMCVPATRDSVAAFIGREHEFRFVAVQPADLEGR 97

Query: 202 PFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIW-RDDILPCRVYLRHCVLA 260
               +AV+CA ++D ++  +RC   +  +   +G + I  +W R D+LPCR YLRHCVLA
Sbjct: 98  ALGRKAVVCAANTDADYKAMRCPPQE--WERRWGVHGIQAVWGRGDVLPCRAYLRHCVLA 155

Query: 261 AKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           A +LG EA ++FL  T L D +TTI ++LA     IMEE+PP +L  RY G
Sbjct: 156 AASLGPEAEHSFLHATYLADRRTTIAQHLARDPT-IMEEQPPPALAERYNG 205


>gi|428183575|gb|EKX52432.1| hypothetical protein GUITHDRAFT_65153 [Guillardia theta CCMP2712]
          Length = 228

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 111/226 (49%), Positives = 143/226 (63%), Gaps = 19/226 (8%)

Query: 89  ISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSV 148
           I GFGSLLSE SARS+FPNL NFR+A+L G++RV  H A IFF+R IA+  T+E +SLSV
Sbjct: 19  IIGFGSLLSEASARSSFPNLQNFRLARLRGYKRVMRHPASIFFQRNIARKDTREFASLSV 78

Query: 149 EPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSP-ETLDGKPFTNRA 207
           EPC     +V  F+I        EEL     A+ +RE EF F+A++P E LDG       
Sbjct: 79  EPCPDGNFLVAAFDIPS------EEL----QAYHEREEEF-FIALAPIEELDGSK-VQEG 126

Query: 208 VLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKI--WRDDILPCRVYLRHCVLAAKNLG 265
           +LC   +D+E+     K  ++ +   YG Y +  I  W+ +ILPCRVYLRHCVLA    G
Sbjct: 127 LLCCAGTDDEYI---AKHGRKNFEERYGSYGLSTIWNWQGEILPCRVYLRHCVLACSKAG 183

Query: 266 DEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           +E   +FL +T L D KTTI+EYL    + IMEEEPPESL+ RY G
Sbjct: 184 EEVKEDFLVNTFLWDRKTTIKEYLQANPS-IMEEEPPESLRHRYSG 228


>gi|348664663|gb|EGZ04507.1| hypothetical protein PHYSODRAFT_343221 [Phytophthora sojae]
          Length = 216

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 106/231 (45%), Positives = 143/231 (61%), Gaps = 20/231 (8%)

Query: 86  LISISGFGSLLSENSARSTFPNLI-NFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEIS 144
           +++I GFGSLLSE SARSTF + + NFR+A++  +RRVFAH A IFF+RGIA  +TKE++
Sbjct: 1   MVAIVGFGSLLSEASARSTFGDGVRNFRLARVLDYRRVFAHPASIFFQRGIADLQTKEMA 60

Query: 145 SLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFT 204
           SLS EP  G   +V+VF+I +           ++P F +RE EF  ++   + LDG    
Sbjct: 61  SLSTEPAPGCEFLVSVFDIPE----------QLLPDFYEREEEFEIISAKFQELDGTS-G 109

Query: 205 NRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDD----ILPCRVYLRHCVLA 260
             A++C R SDEE+     K  +E +   Y  Y +D IW  D    ILPCRVYLRHC+LA
Sbjct: 110 EEALMCTRWSDEEYI---AKRGQETFDTKYKAYGLDTIWGWDAQSGILPCRVYLRHCLLA 166

Query: 261 AKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
            K LG + Y++F+  T LGD  TTI+EY+    + IM E PP  L  RY G
Sbjct: 167 VKKLGQDVYDDFVTTTYLGDRSTTIKEYIEANPS-IMLELPPPHLVDRYSG 216


>gi|301111564|ref|XP_002904861.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262095191|gb|EEY53243.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 205

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 101/227 (44%), Positives = 139/227 (61%), Gaps = 23/227 (10%)

Query: 86  LISISGFGSLLSENSARSTFPNLI-NFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEIS 144
           +++I GFGSLLSE  ARSTF + + NFR+A +  +RRVFAH A IFF+RGIA  +TKEI+
Sbjct: 1   MVAIVGFGSLLSEAWARSTFGDGVRNFRLATVLDYRRVFAHPASIFFQRGIADLETKEIA 60

Query: 145 SLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFT 204
           SLS EP  G   +V+VF+I        EEL   +P F +RE EF+ ++ + + LDG    
Sbjct: 61  SLSTEPAPGCKFLVSVFDIP-------EEL---LPDFYEREEEFKIISATFQELDGSA-R 109

Query: 205 NRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNL 264
             A++C R SD+E+   R +G+ +  +  YG   ++ IW        VYLRHC+LA K L
Sbjct: 110 GEALMCTRWSDDEYIAKRGQGTFDTKYKAYG---LNTIW-------GVYLRHCLLAVKKL 159

Query: 265 GDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           G + Y++F   T LGD  TTI+EY+    + IM E PP  L  RY G
Sbjct: 160 GQDVYDDFFTTTYLGDRSTTIKEYVEAKPS-IMLERPPPHLVDRYSG 205


>gi|224006305|ref|XP_002292113.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972632|gb|EED90964.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 231

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 105/236 (44%), Positives = 138/236 (58%), Gaps = 17/236 (7%)

Query: 87  ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSL 146
           I+I GFGSLLS  SAR TFP+L NFR+ ++   RRVF+H A IFF+RGIA+  T E+SSL
Sbjct: 2   ITILGFGSLLSIKSARLTFPSLQNFRLGRIPNHRRVFSHPASIFFQRGIARMDTLEMSSL 61

Query: 147 SVEPCEGQTLIVTVFEI--KKIRGIKLEELCT--MIP--AFIKREHEFRFLAVSPETLDG 200
             E  EG + I +VFE+  + +  I  +   T   IP  AF++RE EF  +A+ P +   
Sbjct: 62  CAEYKEGHSFICSVFEVPNEGLTAIASDGEGTGEWIPSRAFLEREEEFE-IAMVPNSTAS 120

Query: 201 KPFTNRAVLCARSSDEEFFQIRCKGSKEIYFH-HYGRYNIDKIWRDD----ILPCRVYLR 255
                  VLC RS+DE +         E +F   Y  Y +  IW  D    + PC+VY R
Sbjct: 121 DDTVKYGVLCRRSTDESYL----AHWGEAHFQKQYLDYGVKTIWNWDEDSGLRPCQVYTR 176

Query: 256 HCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           HCVLA++ +G    ++FLD T L D +TT+REYL      IM  EPPE LK RYGG
Sbjct: 177 HCVLASEGIGGACQDSFLDDTYLVDRRTTLREYLNQY-PEIMTTEPPEELKERYGG 231


>gi|255076537|ref|XP_002501943.1| predicted protein [Micromonas sp. RCC299]
 gi|226517207|gb|ACO63201.1| predicted protein [Micromonas sp. RCC299]
          Length = 264

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 105/249 (42%), Positives = 134/249 (53%), Gaps = 34/249 (13%)

Query: 84  DGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEI 143
           DG + + G GSLLSE SAR T PNL  F+  +L G+RRVFAH AP+FFERGIA  KT+E+
Sbjct: 29  DGWVGVVGIGSLLSERSARHTSPNLRGFKTVRLRGYRRVFAHTAPVFFERGIADGKTREV 88

Query: 144 SSLSVEPCE----------GQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAV 193
           SSLS+E  E           + L+ T FEI        EE     PA   RE EF   AV
Sbjct: 89  SSLSMEALEPGDDPREAHGPRGLVATYFEIPG------EEF----PALAAREAEFELRAV 138

Query: 194 S-----PETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDD-- 246
           +      +   G     +A+LC RSSD  + +  C+  K         +  ++IW     
Sbjct: 139 TYEDDEDDGEGGGSDGKKAILCVRSSDAAYAERHCERMKGTGGGGG--FGEERIWYPGRR 196

Query: 247 ----ILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPP 302
               + P R Y RHCVLAAK+LG EA  +FLDHT L D +TT+R +L    + +ME  PP
Sbjct: 197 SPPAVYPTRAYARHCVLAAKSLGPEALASFLDHTFLADRRTTLRTHLERNPS-LMETPPP 255

Query: 303 ESLKTRYGG 311
             L  RYGG
Sbjct: 256 PELAERYGG 264


>gi|323449443|gb|EGB05331.1| hypothetical protein AURANDRAFT_30996 [Aureococcus anophagefferens]
          Length = 225

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 102/235 (43%), Positives = 124/235 (52%), Gaps = 18/235 (7%)

Query: 81  TSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKT 140
           T+  G  +I GFGSLLSE SAR TFP+L  FR+ ++ G+RRVF H A IFFERGIA   T
Sbjct: 5   TARVGRTTILGFGSLLSEQSARLTFPDLDEFRLVRVRGYRRVFGHAASIFFERGIADAAT 64

Query: 141 KEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDG 200
           KE SSLS EPC   + +   FE+        +        F  RE EF         LDG
Sbjct: 65  KEFSSLSAEPCADASFVACAFEV--------DVDADAWARFEAREEEFALADAPFAALDG 116

Query: 201 KPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIW---RDDIL-PCRVYLRH 256
                  VLC R SD  F +   +  +E Y   Y   ++  IW   RD  L PC  YLRH
Sbjct: 117 GA-GGEGVLCTRGSDAAFVE---RWGRETYERKY-LAHVPSIWDWSRDSGLRPCACYLRH 171

Query: 257 CVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           CVLA    G  A ++FLD T L D +TT+R YL      +M+  PPESL  RY G
Sbjct: 172 CVLAVTKAGPAALDSFLDDTYLVDRETTLRTYL-DANPHVMDTRPPESLIGRYSG 225


>gi|397643202|gb|EJK75711.1| hypothetical protein THAOC_02554 [Thalassiosira oceanica]
          Length = 357

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/260 (38%), Positives = 128/260 (49%), Gaps = 40/260 (15%)

Query: 87  ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSL 146
           I+I GFGSLLS  SAR TFP L NFR+ ++   RRVFAH A IFF+RGIA  +T E+SSL
Sbjct: 89  ITILGFGSLLSHKSARLTFPTLKNFRLGRVPDHRRVFAHPASIFFQRGIADMETLEMSSL 148

Query: 147 SVEPCEGQTLIVTVFEIKKIRGIKLEELCT----MIP--AFIKREHEFRFLAVSPETL-- 198
           S E  EG + + +VFE+    G+      +     +P  AF++RE EF    V  E L  
Sbjct: 149 SCEYVEGASFVCSVFEVPN-EGLSATGATSGDDNWVPSRAFLEREEEFEIAMVPYEELTS 207

Query: 199 ---DGKPFT-------NRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDD-- 246
              DG            R       S+          +K+ +  HY  Y ++ IW  D  
Sbjct: 208 ADVDGGDAEHGIRGPRTREGTSTTISEGATRAATTAATKDRFKRHYADYGVETIWGWDQG 267

Query: 247 --ILPCRVYLRHCVLAAKN----------------LGDEAYNNFLDHTSLGDCKTTIREY 288
             + PC  YLRHCVLA+ N                 G   Y +FLD T L D +TT+REY
Sbjct: 268 SGLRPCAAYLRHCVLASWNCGDGTGGGGDAWSADGAGGLCYGSFLDETFLVDRETTVREY 327

Query: 289 LATTGAGIMEEEPPESLKTR 308
           L      +M  EPPE L+ R
Sbjct: 328 LRRF-PDVMATEPPEGLRER 346


>gi|89093034|ref|ZP_01165985.1| hypothetical protein MED92_03118 [Neptuniibacter caesariensis]
 gi|89082684|gb|EAR61905.1| hypothetical protein MED92_03118 [Oceanospirillum sp. MED92]
          Length = 217

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 79/206 (38%), Positives = 116/206 (56%), Gaps = 13/206 (6%)

Query: 86  LISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISS 145
           +ISI G+GSLLSE+SAR T P+L NFR+ ++ G+RR+F  V  +F  R  A P++ +++S
Sbjct: 1   MISIIGYGSLLSESSARETVPDLCNFRLVRVPGYRRIFNKVGIVFLSRHGADPQSLQLAS 60

Query: 146 LSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTN 205
            + E  +   LI + FE  +    +L E          REH F ++ V  E++D      
Sbjct: 61  CATEADKTSVLICSQFECSEESFSELYE----------REHRFEWVDV--ESVDSSGVKT 108

Query: 206 RAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLG 265
            A +C +S+DE +   +C   KE Y    GRY    +WRDDILP   YL  C+ AA+  G
Sbjct: 109 VAKMCTQSTDEHYRFQKCGSEKEFY-ERVGRYYQGPLWRDDILPFPRYLAFCLQAAEGQG 167

Query: 266 DEAYNNFLDHTSLGDCKTTIREYLAT 291
            +  +NFLD + L D  T+IR YL +
Sbjct: 168 ADVLDNFLDTSYLADGCTSIRRYLDS 193


>gi|219122551|ref|XP_002181606.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406882|gb|EEC46820.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 290

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 86/237 (36%), Positives = 130/237 (54%), Gaps = 21/237 (8%)

Query: 84  DGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEI 143
           D  I+I G+GSLLSE SA+STFP L NFR+ ++  +RRVF H A IFF  GIA   + +I
Sbjct: 66  DETITILGYGSLLSEKSAKSTFPELSNFRLGRIRNYRRVFGHPASIFFATGIADLPSLQI 125

Query: 144 SSLSVE-PCEGQTLIVTVFEIKKIRGIKLEELCTMIP--AFIKRE--HEFRFLAVSPETL 198
           +S S E   +    +V+ F++         E  + +P  A ++RE  ++F  +  +  T 
Sbjct: 126 ASASAEFDQDSAGFVVSAFDVPN-------EGLSGVPSRALLEREASYDFEKVVFAGLTK 178

Query: 199 DGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWR----DDILPCRVYL 254
           +  P +    L  RS+D  + +   K  +E +   + +  I+ IW       + PC VYL
Sbjct: 179 NCNP-SMEGWLSVRSTDSAYIK---KWGQERFDRQFRKNGIETIWHWPEDSGLRPCAVYL 234

Query: 255 RHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
           RHC LAA+++G+E +N+ LD + L D  TTIREYL      +++  PP  L  RY G
Sbjct: 235 RHCYLAAQSMGEECFNSLLDESYLCDRSTTIREYL-NMYPEVLDSVPPPELALRYNG 290


>gi|428177877|gb|EKX46755.1| hypothetical protein GUITHDRAFT_70452, partial [Guillardia theta
           CCMP2712]
          Length = 207

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 72/215 (33%), Positives = 110/215 (51%), Gaps = 12/215 (5%)

Query: 100 SARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTK-EISSLSVEPCEGQTLIV 158
           SAR TFP+L +FR+ ++   RRVFAH      ++GI  P T    +SLSVEP  G  L  
Sbjct: 2   SARLTFPHLRDFRLVRVQDHRRVFAHPHNFLIQQGIIDPSTTLRFASLSVEPALGHQLTA 61

Query: 159 TVFEI--KKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDE 216
             FE+  K    ++++       +F++RE E++F   +   + G+      VLC  + D 
Sbjct: 62  AAFELEGKAAELVQVDMDGKQRQSFVEREQEYQFGIATFTDIRGEG-RREGVLCLANDDS 120

Query: 217 EFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHT 276
              +   +  K      +G        R  + P  +YLRHC+LA +  G+EA  +FLD T
Sbjct: 121 SIPEETRRKLKVPLVWGWGE-------RSGLEPAWIYLRHCILACRKGGEEAERSFLDET 173

Query: 277 SLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
            L D KTTIR+YL      +++  PP  L++R+ G
Sbjct: 174 FLVDRKTTIRDYLQLH-PEVLDSSPPPYLESRFNG 207


>gi|428178630|gb|EKX47504.1| hypothetical protein GUITHDRAFT_137296 [Guillardia theta CCMP2712]
          Length = 218

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 75/216 (34%), Positives = 108/216 (50%), Gaps = 22/216 (10%)

Query: 80  ITSPDGL-----ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERG 134
           + +PDG      ++I GFGSL+  +SA  T     N R   + GF RVF  V+ I   RG
Sbjct: 5   LRAPDGAAKGGRVTIIGFGSLMDPSSASQTMRG-SNMRRGTVDGFCRVFNLVSIINIARG 63

Query: 135 IAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVS 194
            AK   +E+++ +    E   L V++ +I K      EE      AF +RE   R   V 
Sbjct: 64  YAK--NRELATCTAMRRENSFLRVSLMDIDK------EEY----EAFARREARLRHEVVE 111

Query: 195 PETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYL 254
             + DG       V+C   SDEE+ + R +   + YF   G+    K++R+D+LP   YL
Sbjct: 112 YVSDDGDRGV--GVICTAYSDEEYRKERVRDDDD-YFEQVGKLYDGKLYRNDLLPVPNYL 168

Query: 255 RHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLA 290
           R  V A    GD+  +NFLDH+ LGD  T +R+Y A
Sbjct: 169 RMVVRAHMRHGDDFLDNFLDHSFLGD-GTRMRDYAA 203


>gi|449016611|dbj|BAM80013.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 263

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 69/251 (27%), Positives = 119/251 (47%), Gaps = 20/251 (7%)

Query: 63  ANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRV 122
           A + E +  +D   +    S +  + ++G+GSLLS  SAR T P L + R+A + G+RR+
Sbjct: 9   AAEAEVLAQSDAVSWVDAGSGERRVRLAGYGSLLSVASARRTCPQLRHHRLAWVQGYRRI 68

Query: 123 FAHVAPIFFERG--IAKPKTKEISSLSVEPCEGQT-LIVTVFEIKKIRGIKLEELCTMIP 179
           F  V+     RG   A  +T+E+++ +  P + +  L V +F+I            + + 
Sbjct: 69  FCLVSLTCVRRGPPYADWRTREVAACAAFPGDSEDRLAVALFDIPA----------SELQ 118

Query: 180 AFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNI 239
           A+++RE  + F  V+    D      +A++C    D   + + C   ++   H  G ++ 
Sbjct: 119 AYLERESRYEFTVVNCLEADSSGI-RKAIMCVAPFDR--WAVACPIREQGPLHDAGCWHW 175

Query: 240 --DKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIM 297
               +WR D+LP   YL  C+ AA     +  + FLD + L D  TT+R YL +      
Sbjct: 176 YSGPLWRKDVLPSAPYLHLCLRAAAAF--DVLDEFLDASYLADGCTTLRSYLRSLDQWGP 233

Query: 298 EEEPPESLKTR 308
             +P    KTR
Sbjct: 234 AADPISLEKTR 244


>gi|290997430|ref|XP_002681284.1| hypothetical protein NAEGRDRAFT_78353 [Naegleria gruberi]
 gi|284094908|gb|EFC48540.1| hypothetical protein NAEGRDRAFT_78353 [Naegleria gruberi]
          Length = 441

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 113/250 (45%), Gaps = 48/250 (19%)

Query: 87  ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERG-IAKPKTKEISS 145
           I+I G GSLLSE SAR TFPNLINFR A +  + R F  VA      G + K K   +++
Sbjct: 175 ITICGIGSLLSEKSARRTFPNLINFRKAIVKNYVRCFNLVAISSLNFGDLEKRKIASVAA 234

Query: 146 LSVEPCEGQTL--IVTVFEIKKIRGIKLEELCTM--IP-------AFIKREH--EFRFLA 192
           +     E +T+  I+ +    + R  ++E  C++  IP       A++ REH  EF+F+ 
Sbjct: 235 VHQSDLEEETIRKIIRLQSQDETRQ-EIEMHCSVFEIPLTEEIWCAYMTREHRYEFKFIT 293

Query: 193 VSPETLDG---------------KPFTN---RAVLCARSSDEEFFQIRCKGSKEIYFHHY 234
           +  +  +G                 F++   +A++C + +D+ +   RC   +  Y    
Sbjct: 294 IDQDFQNGVMQASCSCCENNCQCSEFSSCGCKALMCCQFNDDSYRIERCSSYENEYHERV 353

Query: 235 GRYNIDKIWRDD---------------ILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLG 279
            ++   ++W                  I P   YL  C  AA  +G  A+ NFL  + + 
Sbjct: 354 AQHFQGRLWYHTKHVQNEKHIHNNEIPIKPIPKYLELCFNAALEMGRSAFENFLQTSVMA 413

Query: 280 DCKTTIREYL 289
           D  T + +YL
Sbjct: 414 DGVTLLEDYL 423


>gi|387220189|gb|AFJ69803.1| hypothetical protein NGATSA_2070200, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 63

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/50 (62%), Positives = 43/50 (86%)

Query: 87  ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIA 136
           +++ GFGSLLSE S+R+TFP L+NFR+ +++G+RR+F H A IFFERGIA
Sbjct: 14  VTLIGFGSLLSETSSRTTFPQLVNFRLGRVNGYRRLFRHPAAIFFERGIA 63


>gi|283782143|ref|YP_003372898.1| hypothetical protein Psta_4391 [Pirellula staleyi DSM 6068]
 gi|283440596|gb|ADB19038.1| hypothetical protein Psta_4391 [Pirellula staleyi DSM 6068]
          Length = 295

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 103/257 (40%), Gaps = 57/257 (22%)

Query: 57  AGTNFPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLIN---FRV 113
           +G +FP   LE          +    P+  + I G GSLL+  SA      +     F V
Sbjct: 81  SGFDFPWTNLEE---------HIRQQPNQRVLIVGIGSLLNLESAAEDIQGVNEEEFFPV 131

Query: 114 AKLHGFRRVFAHVAP--IFFERGIAKPKTKEISSLSV-------EPCEGQTLIVTVFEIK 164
             + G +R+F ++ P  +  E G   P+  E ++L++       +   G+ L V V ++ 
Sbjct: 132 VAV-GAKRLFNYLIPGSVLEELG-GNPQPDERAALNMVATGRSNDWFNGRILSVAVQDLD 189

Query: 165 KIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCK 224
            +R               KREH +            +P T  ++     +DE F      
Sbjct: 190 GLR---------------KREHGYHL----------RPVT--SIRWDSPTDEPFLAYVLC 222

Query: 225 GSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTT 284
              E+     GR  ID    + I P   Y+  C+  A ++  E    FLD + L D KT+
Sbjct: 223 AEDEVV---EGRQVID----NTIEPNPAYVEICLTGASSVSQEFGAMFLDTSYLADGKTS 275

Query: 285 IREYLATTGAGIMEEEP 301
           +R +LAT    I +E P
Sbjct: 276 LRTWLATHPLPIADESP 292


>gi|291416492|ref|XP_002724481.1| PREDICTED: MUC12 protein-like, partial [Oryctolagus cuniculus]
          Length = 480

 Score = 37.4 bits (85), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 16/41 (39%), Positives = 23/41 (56%)

Query: 14  ASFACIPRSKAFSFQLRVSTHPPLHSSHCKLHRRRPPPMSS 54
            SF  +P+   FS Q R +  PP+H SHC++H     P+ S
Sbjct: 121 GSFDNLPQQPKFSLQHRATCKPPVHHSHCRIHHFPQQPIVS 161


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.138    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,982,761,301
Number of Sequences: 23463169
Number of extensions: 205992551
Number of successful extensions: 390690
Number of sequences better than 100.0: 47
Number of HSP's better than 100.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 390557
Number of HSP's gapped (non-prelim): 48
length of query: 311
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 169
effective length of database: 9,027,425,369
effective search space: 1525634887361
effective search space used: 1525634887361
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)