BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 039538
(311 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297804606|ref|XP_002870187.1| hypothetical protein ARALYDRAFT_493280 [Arabidopsis lyrata subsp.
lyrata]
gi|297316023|gb|EFH46446.1| hypothetical protein ARALYDRAFT_493280 [Arabidopsis lyrata subsp.
lyrata]
Length = 297
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 186/243 (76%), Positives = 211/243 (86%), Gaps = 10/243 (4%)
Query: 69 IQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAP 128
++L DESDF ++ S D ISI+GFGSLLSE SARSTFP+L NFR+AKL GFRRVFAH AP
Sbjct: 65 LELEDESDFEKLLSSDNRISIAGFGSLLSERSARSTFPDLENFRIAKLQGFRRVFAHSAP 124
Query: 129 IFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEF 188
IFFE GIA P+TKEISSLSVEPCEG++L+VTVFEIK + IPAFI+RE EF
Sbjct: 125 IFFELGIANPETKEISSLSVEPCEGESLVVTVFEIKS----------SEIPAFIQRELEF 174
Query: 189 RFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDIL 248
RFLAV PET++GKP+TN AVLC R SDEEFFQIRCKG+KEIYF HYGR+NI+KIWRDDIL
Sbjct: 175 RFLAVVPETMEGKPYTNSAVLCGRYSDEEFFQIRCKGNKEIYFQHYGRFNIEKIWRDDIL 234
Query: 249 PCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTR 308
PCR+YLRHCVLAAKNLGDEAYNNFLDHT LGD KTTIREYL++TG+GIMEEEPPE+LK+R
Sbjct: 235 PCRLYLRHCVLAAKNLGDEAYNNFLDHTFLGDRKTTIREYLSSTGSGIMEEEPPEALKSR 294
Query: 309 YGG 311
YGG
Sbjct: 295 YGG 297
>gi|145340274|ref|NP_193341.3| uncharacterized protein [Arabidopsis thaliana]
gi|332658287|gb|AEE83687.1| uncharacterized protein [Arabidopsis thaliana]
Length = 289
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 187/243 (76%), Positives = 209/243 (86%), Gaps = 10/243 (4%)
Query: 69 IQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAP 128
++L DESDF ++ S D ISI+GFGSLLSE SARSTFP+L NFR+AKL GFRRVFAH AP
Sbjct: 57 MELEDESDFEKLLSSDNRISITGFGSLLSERSARSTFPDLENFRIAKLQGFRRVFAHSAP 116
Query: 129 IFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEF 188
IFFERGIA P+TKEISSLSVEPCEG++L+VTVFEIK + IPAFI RE EF
Sbjct: 117 IFFERGIANPETKEISSLSVEPCEGESLVVTVFEIKS----------SEIPAFIGRELEF 166
Query: 189 RFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDIL 248
RFLAV PETL+GKP+TN AVLC R SDEEFFQIRCKG+K IYF HYGR+ IDKIWRDDIL
Sbjct: 167 RFLAVVPETLEGKPYTNSAVLCGRYSDEEFFQIRCKGNKGIYFQHYGRFKIDKIWRDDIL 226
Query: 249 PCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTR 308
PCR+YLRHCVLAAKNLGDEAYNNFLDHT LGD KTTIREYL++TG+GIMEEEPPE+LK+R
Sbjct: 227 PCRLYLRHCVLAAKNLGDEAYNNFLDHTFLGDRKTTIREYLSSTGSGIMEEEPPEALKSR 286
Query: 309 YGG 311
YGG
Sbjct: 287 YGG 289
>gi|225431812|ref|XP_002273099.1| PREDICTED: uncharacterized protein LOC100251759 [Vitis vinifera]
gi|296083318|emb|CBI22954.3| unnamed protein product [Vitis vinifera]
Length = 248
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 191/246 (77%), Positives = 210/246 (85%), Gaps = 13/246 (5%)
Query: 69 IQLNDESDFYQITSPDG---LISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAH 125
I+L DESDF ++ SPDG L+S+ GFGSLLSE SARSTFP+L+NFRVA+L+ FRRVFAH
Sbjct: 13 IELKDESDFEKVLSPDGGLGLLSVCGFGSLLSERSARSTFPDLMNFRVARLNSFRRVFAH 72
Query: 126 VAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKRE 185
VAPIFFERGIAKP+T EISSLSVEPCEG+TLIVTVFEI++ + IP+FIKRE
Sbjct: 73 VAPIFFERGIAKPETMEISSLSVEPCEGETLIVTVFEIQR----------SEIPSFIKRE 122
Query: 186 HEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRD 245
HEFRFLAV PETLDGK FT AVLCAR SDEEFFQ RCKGSKEIYF YGR+NI KIWRD
Sbjct: 123 HEFRFLAVFPETLDGKLFTTPAVLCARYSDEEFFQNRCKGSKEIYFQQYGRHNIHKIWRD 182
Query: 246 DILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESL 305
D+LPCRVYLRHCVLAAKNL D AYNNFLDHT L D KTTIREYLATTG+GIMEEEPPESL
Sbjct: 183 DVLPCRVYLRHCVLAAKNLSDAAYNNFLDHTFLADRKTTIREYLATTGSGIMEEEPPESL 242
Query: 306 KTRYGG 311
+ RYGG
Sbjct: 243 RARYGG 248
>gi|363806706|ref|NP_001242012.1| uncharacterized protein LOC100781602 [Glycine max]
gi|255641907|gb|ACU21222.1| unknown [Glycine max]
Length = 283
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 198/294 (67%), Positives = 221/294 (75%), Gaps = 29/294 (9%)
Query: 25 FSFQLRVSTHPPLHSSHCKLHRRR-------PPPMSSDHAGTNFPANKLEPIQLNDESDF 77
F FQL PL S C H+R P P+S + P +L+DESDF
Sbjct: 12 FPFQLS-----PLASRAC--HQRATLSLSFPPLPLSKLAMSSAAPVT-----ELSDESDF 59
Query: 78 YQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAK 137
+ SPDG ISI GFGSLLSE SARSTFP+L NFR A+L+ FRRVFAHVAP+FFERGIAK
Sbjct: 60 ESLLSPDGHISICGFGSLLSERSARSTFPDLANFRTARLNRFRRVFAHVAPVFFERGIAK 119
Query: 138 PKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPET 197
P+T EISSLSVEPCEG+TL+VTVFEI+K + IP FIKRE EFRFLAV PET
Sbjct: 120 PETMEISSLSVEPCEGETLVVTVFEIRK----------SEIPDFIKREVEFRFLAVLPET 169
Query: 198 LDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHC 257
LDGK F AVLCAR SDEEFF RCKG+KEIYF YGR+NIDKIWRDD+LPCRVYLRHC
Sbjct: 170 LDGKAFDFPAVLCARYSDEEFFDNRCKGNKEIYFQQYGRWNIDKIWRDDVLPCRVYLRHC 229
Query: 258 VLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
VLAAKNLG+ AYNNF+DHT L D KTTIR+YLATTG+GIMEE+PPESLK RYGG
Sbjct: 230 VLAAKNLGETAYNNFVDHTYLADRKTTIRDYLATTGSGIMEEQPPESLKHRYGG 283
>gi|449464742|ref|XP_004150088.1| PREDICTED: uncharacterized protein LOC101211371 [Cucumis sativus]
Length = 306
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 191/287 (66%), Positives = 220/287 (76%), Gaps = 18/287 (6%)
Query: 25 FSFQLRVSTHPPLHSSHCKLHRRRPPPMSSDHAGTNFPANKLEPIQLNDESDFYQITSPD 84
FSF + H PL S L R +S + +N N+ EP +L DESDF I S
Sbjct: 38 FSFSI---AHSPLCSGSGLLQR-----ISPINVLSNSLVNEAEPKELRDESDFEAIFSDS 89
Query: 85 GLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEIS 144
IS+ GFGSLLSE SARSTFP+LINFRVA+L+GFRR+F +VAP+FFERGIAKP+TKEIS
Sbjct: 90 DYISVCGFGSLLSERSARSTFPDLINFRVARLNGFRRIFGNVAPVFFERGIAKPETKEIS 149
Query: 145 SLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFT 204
SL EPCEG+ +IVTVFEIKK + +PAFI+RE EFRFLAV PETL GK +
Sbjct: 150 SLCAEPCEGENIIVTVFEIKK----------SEVPAFIQREIEFRFLAVFPETLHGKAYK 199
Query: 205 NRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNL 264
AVLC+RS+DEEFFQ+RCKG+K+I+FHHYGR+NIDKIWRDDI PCRVYLRHC+LAAKNL
Sbjct: 200 KPAVLCSRSTDEEFFQVRCKGNKDIFFHHYGRHNIDKIWRDDIFPCRVYLRHCILAAKNL 259
Query: 265 GDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
GD+AYNNFLDHT LGD TTIREYL G GIMEEEPPESLK RYGG
Sbjct: 260 GDKAYNNFLDHTFLGDRSTTIREYLTGYGLGIMEEEPPESLKFRYGG 306
>gi|224110318|ref|XP_002315483.1| predicted protein [Populus trichocarpa]
gi|222864523|gb|EEF01654.1| predicted protein [Populus trichocarpa]
Length = 245
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 189/247 (76%), Positives = 208/247 (84%), Gaps = 12/247 (4%)
Query: 67 EPIQLNDESDFYQITSPDGLIS--ISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFA 124
EPI+L D+SDF S + GFGSLLSE SARSTFP+L+NFRVA+L+GFRRVFA
Sbjct: 9 EPIELRDDSDFRSFLSSSDSDLISVCGFGSLLSERSARSTFPDLLNFRVARLNGFRRVFA 68
Query: 125 HVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKR 184
HVAPIFFERGIAKP+TKEISSLSVEPCEG+ L+VTVFEI+K + IPAF++R
Sbjct: 69 HVAPIFFERGIAKPETKEISSLSVEPCEGEGLVVTVFEIEK----------SEIPAFMER 118
Query: 185 EHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWR 244
E EFRFLAV PETLDGKPF AVLCAR SDEEFFQ+RCKGSKEIYF HYGRYNI+KIWR
Sbjct: 119 ELEFRFLAVVPETLDGKPFDFPAVLCARYSDEEFFQVRCKGSKEIYFQHYGRYNINKIWR 178
Query: 245 DDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPES 304
DDILPCR YLRHCVLAAKNL D A++NFL+HT LGD KTTIREYLATTGAGIMEEEPPES
Sbjct: 179 DDILPCRAYLRHCVLAAKNLSDAAHSNFLEHTFLGDRKTTIREYLATTGAGIMEEEPPES 238
Query: 305 LKTRYGG 311
LKTRYGG
Sbjct: 239 LKTRYGG 245
>gi|225431810|ref|XP_002271042.1| PREDICTED: uncharacterized protein LOC100265416 [Vitis vinifera]
gi|296083319|emb|CBI22955.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 181/249 (72%), Positives = 208/249 (83%), Gaps = 11/249 (4%)
Query: 64 NKLEPIQLNDESDFYQI-TSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRV 122
N E +L+DESDF ++ +S D L+SI GFGSLLSE SARSTFP+LINFRVA+L+GFRRV
Sbjct: 70 NHSELTELHDESDFERLASSNDDLLSICGFGSLLSERSARSTFPDLINFRVARLNGFRRV 129
Query: 123 FAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFI 182
FAHV P+FFERGIAKP+T EIS LSVEPCEG++LIVTVFEI + + IP++I
Sbjct: 130 FAHVTPVFFERGIAKPETMEISGLSVEPCEGESLIVTVFEIHR----------SEIPSYI 179
Query: 183 KREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKI 242
KRE E+RFLAV PETLDG PFT+ AVLCAR SDEEF QIRCKGSKE++F YGRYNI KI
Sbjct: 180 KREIEYRFLAVFPETLDGTPFTSPAVLCARYSDEEFLQIRCKGSKEMFFEQYGRYNIHKI 239
Query: 243 WRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPP 302
WRDD+ PCRVYLRHC+LAAK+L D AYN+FLDHT L D KTTI+EYLATTG+GIMEEEPP
Sbjct: 240 WRDDVFPCRVYLRHCLLAAKSLSDAAYNDFLDHTFLADRKTTIQEYLATTGSGIMEEEPP 299
Query: 303 ESLKTRYGG 311
ESLK RYGG
Sbjct: 300 ESLKARYGG 308
>gi|115463405|ref|NP_001055302.1| Os05g0358400 [Oryza sativa Japonica Group]
gi|47777404|gb|AAT38038.1| unknown protein [Oryza sativa Japonica Group]
gi|55167986|gb|AAV43854.1| unknown protein [Oryza sativa Japonica Group]
gi|113578853|dbj|BAF17216.1| Os05g0358400 [Oryza sativa Japonica Group]
gi|215701221|dbj|BAG92645.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215766615|dbj|BAG98677.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 308
Score = 366 bits (939), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 184/274 (67%), Positives = 209/274 (76%), Gaps = 24/274 (8%)
Query: 45 HRRRP-----PPMSSDHAGTNFPANKLEPIQLNDESDFYQITSP--DGLISISGFGSLLS 97
HRR P P ++ +G +FP ++N DF +T+P G +S+ GFGSLLS
Sbjct: 52 HRRLPLMASSPSAAATGSGGSFP-------EMNSVDDFAAVTAPRGGGRVSVVGFGSLLS 104
Query: 98 ENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLI 157
E SARSTFP L FRVA L GFRRVFAH APIFFERGIA TKE SSLSVEPCEG+ ++
Sbjct: 105 ERSARSTFPELEGFRVAALRGFRRVFAHSAPIFFERGIAIEATKEFSSLSVEPCEGELIV 164
Query: 158 VTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEE 217
VTVFEIK+ +PAFI+REHEFRFLAV PE LDG P+ N AV+CAR SDEE
Sbjct: 165 VTVFEIKE----------DEVPAFIEREHEFRFLAVVPEGLDGVPYANPAVVCARYSDEE 214
Query: 218 FFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTS 277
+FQ+RCKGSKEIY+ YGRYNID+IWRDDILPCR YLRHCVLAAKNLG+ AY+NFLDHT
Sbjct: 215 YFQVRCKGSKEIYYQRYGRYNIDRIWRDDILPCRAYLRHCVLAAKNLGEPAYSNFLDHTY 274
Query: 278 LGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
LGD TTIREYLATTGAGIMEEEPPESLK+RYGG
Sbjct: 275 LGDRTTTIREYLATTGAGIMEEEPPESLKSRYGG 308
>gi|218196630|gb|EEC79057.1| hypothetical protein OsI_19621 [Oryza sativa Indica Group]
gi|222631273|gb|EEE63405.1| hypothetical protein OsJ_18217 [Oryza sativa Japonica Group]
Length = 251
Score = 364 bits (935), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 180/264 (68%), Positives = 205/264 (77%), Gaps = 19/264 (7%)
Query: 50 PPMSSDHAGTNFPANKLEPIQLNDESDFYQITSP--DGLISISGFGSLLSENSARSTFPN 107
P ++ +G +FP ++N DF +T+P G +S+ GFGSLLSE SARSTFP
Sbjct: 5 PSAAATGSGGSFP-------EMNSVDDFAAVTAPRGGGRVSVVGFGSLLSERSARSTFPE 57
Query: 108 LINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIR 167
L FRVA L GFRRVFAH APIFFERGIA TKE SSLSVEPCEG+ ++VTVFEIK+
Sbjct: 58 LEGFRVAALRGFRRVFAHSAPIFFERGIAIEATKEFSSLSVEPCEGELIVVTVFEIKE-- 115
Query: 168 GIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSK 227
+PAFI+REHEFRFLAV PE LDG P+ N AV+CAR SDEE+FQ+RCKGSK
Sbjct: 116 --------DEVPAFIEREHEFRFLAVVPEGLDGVPYANPAVVCARYSDEEYFQVRCKGSK 167
Query: 228 EIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIRE 287
EIY+ YGRYNID+IWRDDILPCR YLRHCVLAAKNLG+ AY+NFLDHT LGD TTIRE
Sbjct: 168 EIYYQRYGRYNIDRIWRDDILPCRAYLRHCVLAAKNLGEPAYSNFLDHTYLGDRTTTIRE 227
Query: 288 YLATTGAGIMEEEPPESLKTRYGG 311
YLATTGAGIMEEEPPESLK+RYGG
Sbjct: 228 YLATTGAGIMEEEPPESLKSRYGG 251
>gi|357129373|ref|XP_003566337.1| PREDICTED: uncharacterized protein LOC100832854 [Brachypodium
distachyon]
Length = 309
Score = 363 bits (933), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 189/281 (67%), Positives = 211/281 (75%), Gaps = 20/281 (7%)
Query: 31 VSTHPPLHSSHCKLHRRRPPPMSSDHAGTNFPANKLEPIQLNDESDFYQITSPDGLISIS 90
V + PP+ SH R PP + A T A +L +DF I S DG IS+
Sbjct: 49 VPSQPPISRSH-----RLMPPTA---AATTSEAGSF--TELTSAADFATIASADGRISVI 98
Query: 91 GFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEP 150
GFGSLLSE SARSTFP L FRVA L GFRRVFAH APIFF+RGIA TKE SSLSVE
Sbjct: 99 GFGSLLSERSARSTFPELEGFRVAALRGFRRVFAHAAPIFFDRGIAIEATKEFSSLSVEQ 158
Query: 151 CEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLC 210
CEG+ ++VTVFEIK+ EE +PAFI+RE+EFRFL V PE LD PF N AV+C
Sbjct: 159 CEGEMIVVTVFEIKE------EE----VPAFIERENEFRFLVVVPEGLDQVPFPNPAVVC 208
Query: 211 ARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYN 270
AR SDEE+FQ+RCKGSKEIY HYGRYNIDKIWRDDILPCR+YLRHCVLAAKNLG+ AY+
Sbjct: 209 ARYSDEEYFQVRCKGSKEIYNQHYGRYNIDKIWRDDILPCRLYLRHCVLAAKNLGEAAYS 268
Query: 271 NFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
+FLDHT LGD KTTIREYLATTGAGIMEEEPPESLK+RYGG
Sbjct: 269 SFLDHTYLGDRKTTIREYLATTGAGIMEEEPPESLKSRYGG 309
>gi|449501489|ref|XP_004161381.1| PREDICTED: uncharacterized LOC101211371 [Cucumis sativus]
Length = 304
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 190/288 (65%), Positives = 216/288 (75%), Gaps = 22/288 (7%)
Query: 25 FSFQLRVSTHPPLHSSHCKLHRRRPPPMSSDHAGTNFPANKLEPIQLNDESDFYQITSPD 84
FSF + H PL S L R +S + +N N+ EP +L DESDF I S
Sbjct: 38 FSFSI---AHSPLCSGSGLLQR-----ISPINVLSNSLVNEAEPKELRDESDFEAIFSDS 89
Query: 85 GLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEIS 144
IS+ GFGSLLSE SARSTFP+LINFRVA+L+GFRR+F +VAP+FFERGIAKP+TKEIS
Sbjct: 90 DYISVCGFGSLLSERSARSTFPDLINFRVARLNGFRRIFGNVAPVFFERGIAKPETKEIS 149
Query: 145 SLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFT 204
SL EPCEG+ +IVTVFEIKK + +PAFI+RE EFRFLAV PETL GK +
Sbjct: 150 SLCAEPCEGENIIVTVFEIKK----------SEVPAFIQREIEFRFLAVFPETLHGKAYK 199
Query: 205 N-RAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKN 263
N R VLC+RS+DEEFFQ+RCKG FHHYGR+NIDKIWRDDI PCRVYLRHC+LAAKN
Sbjct: 200 NQRYVLCSRSTDEEFFQVRCKGH---LFHHYGRHNIDKIWRDDIFPCRVYLRHCILAAKN 256
Query: 264 LGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
LGD+AYNNFLDHT LGD TTIREYL G GIMEEEPPESLK RYGG
Sbjct: 257 LGDKAYNNFLDHTFLGDRSTTIREYLTGYGLGIMEEEPPESLKFRYGG 304
>gi|449464740|ref|XP_004150087.1| PREDICTED: uncharacterized protein LOC101211120 [Cucumis sativus]
Length = 244
Score = 357 bits (915), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 178/253 (70%), Positives = 202/253 (79%), Gaps = 10/253 (3%)
Query: 59 TNFPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHG 118
+N N+ EP +L DESDF I S IS+SGFGSLLS SAR TFP LINFRVA+L+G
Sbjct: 2 SNSMLNEAEPKELRDESDFEVIFSDSDYISVSGFGSLLSVRSARGTFPELINFRVARLNG 61
Query: 119 FRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMI 178
FRR F HVAPIFFERGIAKP+TKEISSL EPCE + +I+TVFEIKK + I
Sbjct: 62 FRRFFGHVAPIFFERGIAKPETKEISSLCSEPCEEENIIITVFEIKK----------SEI 111
Query: 179 PAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYN 238
PAFI+RE EFRFLAV PETLDGK + AVLC+R +DEEFFQ+RCKG+K+I FHHYGR+N
Sbjct: 112 PAFIQREIEFRFLAVFPETLDGKIYDKPAVLCSRFTDEEFFQVRCKGNKDILFHHYGRHN 171
Query: 239 IDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIME 298
IDKIWRDDILPCRVYLRHCVLAAKNLG+ AYNNFLDHT LGD +TT+REYL +GIME
Sbjct: 172 IDKIWRDDILPCRVYLRHCVLAAKNLGEIAYNNFLDHTFLGDRRTTVREYLTRYDSGIME 231
Query: 299 EEPPESLKTRYGG 311
EEPPES+K RY G
Sbjct: 232 EEPPESVKFRYEG 244
>gi|449501496|ref|XP_004161383.1| PREDICTED: uncharacterized LOC101211120 [Cucumis sativus]
Length = 244
Score = 354 bits (908), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 177/253 (69%), Positives = 201/253 (79%), Gaps = 10/253 (3%)
Query: 59 TNFPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHG 118
+N N+ EP +L DESDF I S IS+ GFGSLLS SAR TFP LINFRVA+L+G
Sbjct: 2 SNSMLNEAEPKELRDESDFEVIFSDSDYISVCGFGSLLSVRSARGTFPELINFRVARLNG 61
Query: 119 FRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMI 178
FRR F HVAPIFFERGIAKP+TKEISSL EPCE + +I+TVFEIKK + I
Sbjct: 62 FRRFFGHVAPIFFERGIAKPETKEISSLCSEPCEEENIIITVFEIKK----------SEI 111
Query: 179 PAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYN 238
PAFI+RE EFRFLAV PETLDGK + AVLC+R +DEEFFQ+RCKG+K+I FHHYGR+N
Sbjct: 112 PAFIQREIEFRFLAVFPETLDGKIYDKPAVLCSRFTDEEFFQVRCKGNKDILFHHYGRHN 171
Query: 239 IDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIME 298
IDKIWRDDILPCRVYLRHCVLAAKNLG+ AYNNFLDHT LGD +TT+REYL +GIME
Sbjct: 172 IDKIWRDDILPCRVYLRHCVLAAKNLGEIAYNNFLDHTFLGDRRTTVREYLTRYDSGIME 231
Query: 299 EEPPESLKTRYGG 311
EEPPES+K RY G
Sbjct: 232 EEPPESVKFRYEG 244
>gi|2244964|emb|CAB10385.1| hypothetical protein [Arabidopsis thaliana]
gi|7268355|emb|CAB78648.1| hypothetical protein [Arabidopsis thaliana]
Length = 274
Score = 354 bits (908), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 176/243 (72%), Positives = 196/243 (80%), Gaps = 25/243 (10%)
Query: 69 IQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAP 128
++L DESDF ++ S D ISI+GFGSLLSE SARSTFP+L NFR+AKL GFRRVFAH AP
Sbjct: 57 MELEDESDFEKLLSSDNRISITGFGSLLSERSARSTFPDLENFRIAKLQGFRRVFAHSAP 116
Query: 129 IFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEF 188
IFFERGIA P+TKEISSLSVEPCEG++L+VTVFEIK + IPAFI RE EF
Sbjct: 117 IFFERGIANPETKEISSLSVEPCEGESLVVTVFEIKS----------SEIPAFIGRELEF 166
Query: 189 RFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDIL 248
RFLAV LC R SDEEFFQIRCKG+K IYF HYGR+ IDKIWRDDIL
Sbjct: 167 RFLAV---------------LCGRYSDEEFFQIRCKGNKGIYFQHYGRFKIDKIWRDDIL 211
Query: 249 PCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTR 308
PCR+YLRHCVLAAKNLGDEAYNNFLDHT LGD KTTIREYL++TG+GIMEEEPPE+LK+R
Sbjct: 212 PCRLYLRHCVLAAKNLGDEAYNNFLDHTFLGDRKTTIREYLSSTGSGIMEEEPPEALKSR 271
Query: 309 YGG 311
YGG
Sbjct: 272 YGG 274
>gi|26452147|dbj|BAC43162.1| unknown protein [Arabidopsis thaliana]
gi|28950779|gb|AAO63313.1| At4g16060 [Arabidopsis thaliana]
Length = 228
Score = 348 bits (894), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 169/217 (77%), Positives = 186/217 (85%), Gaps = 10/217 (4%)
Query: 95 LLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQ 154
L S ARSTFP+L NFR+AKLHGFRRVFAH APIFFERGIA P+TKEISSLSVEPCEG+
Sbjct: 22 LFSPREARSTFPDLENFRIAKLHGFRRVFAHSAPIFFERGIANPETKEISSLSVEPCEGE 81
Query: 155 TLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSS 214
+L+VTVFEIK + IPAFI RE EFRFLAV PETL+GKP+TN AVLC R S
Sbjct: 82 SLVVTVFEIKS----------SEIPAFIGRELEFRFLAVVPETLEGKPYTNSAVLCGRYS 131
Query: 215 DEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLD 274
DEEFFQIRCKG+K IYF HYGR+ IDKIWRDDILPCR+YLRHCVLAAKNLGDEAYNNFLD
Sbjct: 132 DEEFFQIRCKGNKGIYFQHYGRFKIDKIWRDDILPCRLYLRHCVLAAKNLGDEAYNNFLD 191
Query: 275 HTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
HT LGD KTTIREYL++TG+GIMEEEPPE+LK+RY G
Sbjct: 192 HTFLGDRKTTIREYLSSTGSGIMEEEPPEALKSRYRG 228
>gi|255551863|ref|XP_002516977.1| conserved hypothetical protein [Ricinus communis]
gi|223544065|gb|EEF45591.1| conserved hypothetical protein [Ricinus communis]
Length = 283
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 178/237 (75%), Positives = 191/237 (80%), Gaps = 24/237 (10%)
Query: 76 DFYQITSPD-GLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERG 134
DF ++ SP GLISI GFGSLLS VA+L+GFRRVFAHVAPIFFERG
Sbjct: 70 DFQKLLSPSSGLISICGFGSLLSGT-------------VARLNGFRRVFAHVAPIFFERG 116
Query: 135 IAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVS 194
IAKP+TKEISSLSVEPCEG+TLIVTVFEI K + IPAF++RE EFRFLAV
Sbjct: 117 IAKPETKEISSLSVEPCEGETLIVTVFEIDK----------SEIPAFMERELEFRFLAVV 166
Query: 195 PETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYL 254
PETLDGKPF + AVLCAR SDEEFF+IRCKGSKE YF HYGRY I KIWRDDILPCR YL
Sbjct: 167 PETLDGKPFDSPAVLCARYSDEEFFRIRCKGSKETYFQHYGRYGISKIWRDDILPCRTYL 226
Query: 255 RHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
RHCVLAAKNLG A+NNFLDHT LGD KTTIR+YLATTGAGIMEEEPPESLKTRYGG
Sbjct: 227 RHCVLAAKNLGSAAHNNFLDHTFLGDRKTTIRKYLATTGAGIMEEEPPESLKTRYGG 283
>gi|147770754|emb|CAN62470.1| hypothetical protein VITISV_016050 [Vitis vinifera]
Length = 234
Score = 343 bits (879), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 176/247 (71%), Positives = 193/247 (78%), Gaps = 29/247 (11%)
Query: 69 IQLNDESDFYQITSPDG---LISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAH 125
I+L DESDF ++ SPDG L+S+ GFGSLLSE SARSTFP+L+NFRVA+L+ FRRVFAH
Sbjct: 13 IELKDESDFEKVLSPDGGLGLLSVCGFGSLLSERSARSTFPDLMNFRVARLNSFRRVFAH 72
Query: 126 VAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKRE 185
VAPIFFERGIAKP+T EISSLSVEPCEG+TLIVTVFEI++
Sbjct: 73 VAPIFFERGIAKPETMEISSLSVEPCEGETLIVTVFEIQRSE------------------ 114
Query: 186 HEFRFLAVSPETLDGKPFTNRAV-LCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWR 244
V PETLDGK FT AV LCAR SDEEFFQ RCKGSKEIYF YGR+NI KIWR
Sbjct: 115 -------VFPETLDGKLFTTPAVVLCARYSDEEFFQNRCKGSKEIYFQQYGRHNIHKIWR 167
Query: 245 DDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPES 304
DD+LPCRVYLRHCVLAAKNL D AYNNFLDHT L D +TTIREYLATTG GIMEEEPPES
Sbjct: 168 DDVLPCRVYLRHCVLAAKNLSDAAYNNFLDHTFLADRQTTIREYLATTGTGIMEEEPPES 227
Query: 305 LKTRYGG 311
L+ RYGG
Sbjct: 228 LRPRYGG 234
>gi|449464744|ref|XP_004150089.1| PREDICTED: uncharacterized protein LOC101211619 [Cucumis sativus]
gi|449501484|ref|XP_004161380.1| PREDICTED: uncharacterized LOC101211619 [Cucumis sativus]
Length = 243
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 165/245 (67%), Positives = 193/245 (78%), Gaps = 10/245 (4%)
Query: 67 EPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHV 126
EP +L DESDF I S IS+ G+GSLLSE SARSTFP LINFR+A+L+ FRRVF +
Sbjct: 9 EPNELRDESDFEAIFSDSDYISVCGYGSLLSERSARSTFPELINFRIARLNNFRRVFGVI 68
Query: 127 APIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREH 186
APIFFE IAKP+TKEISS+ EPCEG+T+I+TVFEIKK IPAFI+RE
Sbjct: 69 APIFFEHDIAKPETKEISSVFAEPCEGETIIITVFEIKKFE----------IPAFIEREF 118
Query: 187 EFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDD 246
FRFL V PETLDGK + AVLC+RS+DEEFFQ++CKG+K+I+ +YGR+N+DKIWRDD
Sbjct: 119 AFRFLTVLPETLDGKLYHKPAVLCSRSTDEEFFQVKCKGNKDIFLQYYGRHNVDKIWRDD 178
Query: 247 ILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLK 306
ILPCRVYLRHC+LAAK + D AYNN LDHT LGD TTIREYL T G+ IMEEEPP+SLK
Sbjct: 179 ILPCRVYLRHCILAAKGVDDVAYNNILDHTFLGDRCTTIREYLVTNGSSIMEEEPPKSLK 238
Query: 307 TRYGG 311
RYGG
Sbjct: 239 FRYGG 243
>gi|168064197|ref|XP_001784051.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664437|gb|EDQ51157.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 262
Score = 314 bits (804), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 151/243 (62%), Positives = 187/243 (76%), Gaps = 11/243 (4%)
Query: 70 QLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPI 129
+L +E+DF I + DG ISI GFGSLLSE SA TFPNL NFRV LHGFRRVFAHVAP+
Sbjct: 30 ELRNEADFASIANTDGEISICGFGSLLSEKSALYTFPNLRNFRVGVLHGFRRVFAHVAPV 89
Query: 130 FFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFR 189
F +RGIA +TKE+SSLSVEPC G++++VTVFEI + +PAFI+REHEFR
Sbjct: 90 FLDRGIANIETKELSSLSVEPCVGESIVVTVFEIS----------VSEVPAFIEREHEFR 139
Query: 190 FLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILP 249
FLAV+ E +G+PF+ +AV+CAR SDEE+ +IRC+G E Y+ YGR+ I++IWRDDI P
Sbjct: 140 FLAVTVEDKEGQPFSQQAVICARYSDEEYRRIRCQGMNEEYYRRYGRHKIEQIWRDDIFP 199
Query: 250 CRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLA-TTGAGIMEEEPPESLKTR 308
CRVYLRHCVLAAKNLG++AY +FLDHT + D +TTIR Y G GIM E+PPE+L+ R
Sbjct: 200 CRVYLRHCVLAAKNLGEDAYESFLDHTYVADRRTTIRTYFKHGLGVGIMNEKPPEALRER 259
Query: 309 YGG 311
YGG
Sbjct: 260 YGG 262
>gi|351727499|ref|NP_001236139.1| uncharacterized protein LOC100527697 [Glycine max]
gi|255632976|gb|ACU16842.1| unknown [Glycine max]
Length = 240
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 151/244 (61%), Positives = 177/244 (72%), Gaps = 10/244 (4%)
Query: 63 ANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRV 122
A L + L SDF + SPDG ISI G+GSLLSE SAR+TFP+L+NFR+A+L GFRR+
Sbjct: 2 AIPLPSLDLTHHSDFEPLVSPDGYISICGYGSLLSETSARTTFPDLVNFRIARLTGFRRL 61
Query: 123 FAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFI 182
F V IFF +A KT+EI+ LSVEPC+G+ ++V VFEIK T IPAFI
Sbjct: 62 FNSVGRIFFTHRVANIKTQEIAGLSVEPCDGEFVVVAVFEIKN----------TEIPAFI 111
Query: 183 KREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKI 242
+RE E+RFLAV PETLDGKPF N AVLCA +DEEFF+ RC +E YF YG Y I KI
Sbjct: 112 EREREYRFLAVVPETLDGKPFINPAVLCASYTDEEFFKFRCSEGREKYFEQYGEYKIHKI 171
Query: 243 WRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPP 302
WRDD+ PCRVYLRHC LAAK+LGDE +NNFLDHT L D KTTIR+Y G I+EEEPP
Sbjct: 172 WRDDVFPCRVYLRHCALAAKSLGDEVHNNFLDHTFLADRKTTIRQYFEKVGTSILEEEPP 231
Query: 303 ESLK 306
ESLK
Sbjct: 232 ESLK 235
>gi|147770755|emb|CAN62471.1| hypothetical protein VITISV_016051 [Vitis vinifera]
Length = 301
Score = 304 bits (778), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 162/257 (63%), Positives = 185/257 (71%), Gaps = 34/257 (13%)
Query: 64 NKLEPIQLNDESDFYQI-TSPDGLISISGFGSLLSENSARSTFPNLIN-FRVAKLHGFR- 120
N E +L+DESDF ++ +S D L+SI GFGSLLSE SARSTFP+ +N + K+
Sbjct: 70 NHSELTELHDESDFEKLASSNDDLLSICGFGSLLSERSARSTFPDPVNQLQSRKIERISY 129
Query: 121 RVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPA 180
RVFAHV P+FFERGIAKP+T EIS LSVEPCEG++LIVTVFEI +
Sbjct: 130 RVFAHVTPVFFERGIAKPETMEISGLSVEPCEGESLIVTVFEIHRSE------------- 176
Query: 181 FIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNID 240
V PETLDG PFT+ AVLCAR SDEEF QIRCKGSKE++F YGRYNI
Sbjct: 177 ------------VFPETLDGTPFTSPAVLCARYSDEEFLQIRCKGSKEMFFEQYGRYNIH 224
Query: 241 KIWRDDILPCRVYLRHC------VLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGA 294
KIWRDD+ PCRVYLRHC +LAAK+L D AYN+FLDHT L D KTTIREYLATTG+
Sbjct: 225 KIWRDDVFPCRVYLRHCCKTLCSLLAAKSLSDAAYNDFLDHTFLADRKTTIREYLATTGS 284
Query: 295 GIMEEEPPESLKTRYGG 311
GIMEEEPPESLK RYGG
Sbjct: 285 GIMEEEPPESLKARYGG 301
>gi|326512630|dbj|BAJ99670.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 279
Score = 297 bits (760), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 159/253 (62%), Positives = 175/253 (69%), Gaps = 37/253 (14%)
Query: 50 PPMSSDHAGTNFPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLI 109
PP+++ T+ A +L DF I SPDG IS+ GFGSLLSE SARSTFP L
Sbjct: 51 PPLTAAALATSSGAESFS--ELASADDFAAIASPDGHISVIGFGSLLSERSARSTFPELE 108
Query: 110 NFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGI 169
FRVA L GFRRVFAH APIFFERGIA TKE SSLSVE CEG+ ++VTVFEIK+
Sbjct: 109 GFRVAALRGFRRVFAHAAPIFFERGIAIEATKEFSSLSVEHCEGEMIVVTVFEIKE---- 164
Query: 170 KLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEI 229
EE +PAFI+RE EFRFLAV PE LDG P EI
Sbjct: 165 --EE----VPAFIERELEFRFLAVVPEGLDGVP-------------------------EI 193
Query: 230 YFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYL 289
Y HYGRYNIDKIWRDDILPCR+YLRHCVLAAKNLG+ AY+NFLDHT LGD +TTIREYL
Sbjct: 194 YNQHYGRYNIDKIWRDDILPCRLYLRHCVLAAKNLGEAAYSNFLDHTYLGDRRTTIREYL 253
Query: 290 ATTGAGIMEEEPP 302
ATTGAGIMEEEPP
Sbjct: 254 ATTGAGIMEEEPP 266
>gi|302754804|ref|XP_002960826.1| hypothetical protein SELMODRAFT_139369 [Selaginella moellendorffii]
gi|300171765|gb|EFJ38365.1| hypothetical protein SELMODRAFT_139369 [Selaginella moellendorffii]
Length = 242
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 141/244 (57%), Positives = 182/244 (74%), Gaps = 14/244 (5%)
Query: 70 QLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPI 129
+L + SDF I+S +GL+SI GFGSLLSE SARSTFP L+NFR + FRR+FAH+API
Sbjct: 11 ELENASDFAAISS-NGLVSICGFGSLLSEKSARSTFPRLLNFRPGVVKNFRRIFAHIAPI 69
Query: 130 FFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFR 189
F ERGIA +TKE+SSLSVEPC G++++VTVFEI + +PAFI+REHEFR
Sbjct: 70 FLERGIANVETKEMSSLSVEPCIGESIVVTVFEISLLE----------VPAFIQREHEFR 119
Query: 190 FL--AVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDI 247
FL AV + + +AV+CA+ SDEE+ IRC+G+K++++ YG++ I +IWRDDI
Sbjct: 120 FLEVAVVNSSEENAIHLCKAVICAKYSDEEYLSIRCQGNKDVFYERYGKHGIQQIWRDDI 179
Query: 248 LPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKT 307
LPCR+YL HCVLAAKNLG +A+ +FLDHT LGD +T IREYL GIM+E+PPE L+
Sbjct: 180 LPCRLYLHHCVLAAKNLGQDAFESFLDHTYLGDRRTRIREYL-DAHPGIMKEQPPEHLRD 238
Query: 308 RYGG 311
RYGG
Sbjct: 239 RYGG 242
>gi|302804182|ref|XP_002983843.1| hypothetical protein SELMODRAFT_119517 [Selaginella moellendorffii]
gi|300148195|gb|EFJ14855.1| hypothetical protein SELMODRAFT_119517 [Selaginella moellendorffii]
Length = 225
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 138/242 (57%), Positives = 176/242 (72%), Gaps = 27/242 (11%)
Query: 70 QLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPI 129
+L + SDF I+S +GL+SI GFGSLLSE SARSTFP L+NFR + FRR+FAH+API
Sbjct: 11 ELENASDFTAISS-NGLVSICGFGSLLSEKSARSTFPRLLNFRPGVVKNFRRIFAHIAPI 69
Query: 130 FFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFR 189
F ERGIA +TKE+SSLSVEPC G++++VTVFEI + +PAFI+REHEFR
Sbjct: 70 FLERGIANVETKEMSSLSVEPCIGESIVVTVFEISLLE----------VPAFIQREHEFR 119
Query: 190 FLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILP 249
FL V +CA+ SDEE+ IRC+G+K++++ YG++ I +IWRDDILP
Sbjct: 120 FLEV---------------ICAKYSDEEYLSIRCQGNKDVFYERYGKHGIQQIWRDDILP 164
Query: 250 CRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRY 309
CR+YLRHCVLAAKNLG +A+ +FLDHT LGD +T IREYL IM+E+PPE L+ RY
Sbjct: 165 CRLYLRHCVLAAKNLGQDAFESFLDHTYLGDRRTRIREYL-DAHPDIMKEQPPEHLRDRY 223
Query: 310 GG 311
GG
Sbjct: 224 GG 225
>gi|413948845|gb|AFW81494.1| hypothetical protein ZEAMMB73_001847 [Zea mays]
Length = 255
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/243 (59%), Positives = 170/243 (69%), Gaps = 20/243 (8%)
Query: 17 ACIPRSKAFSFQLRVSTHP-PLHSSHCKLHRRRPPPMS--SDHAGTNFPANKLEPIQLND 73
A +P + +LR+ P P S +L PPP + S AG +FP +L
Sbjct: 29 ATLPPVPLTAARLRLRRQPEPGPSPPTRLSHPMPPPTATMSPAAGGSFP-------ELTC 81
Query: 74 ESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFER 133
+DF + +P G S+ GFGSLLSE SARSTFP L FRVA L GFRRVFAH APIFFER
Sbjct: 82 PADFATVAAPGGRFSVVGFGSLLSERSARSTFPELEGFRVAVLRGFRRVFAHAAPIFFER 141
Query: 134 GIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAV 193
GIA TKE SSLSVEPC+ + ++VTVFEIK+ EE +PAFI+REHEFRFLAV
Sbjct: 142 GIAIEATKEFSSLSVEPCDSELIVVTVFEIKE------EE----VPAFIEREHEFRFLAV 191
Query: 194 SPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVY 253
PE LDG PFTNRAV+CAR S+EE+FQ RC+GSKEIY +YGRYNIDKIWRDDILPCR+Y
Sbjct: 192 VPEGLDGVPFTNRAVVCARYSNEEYFQERCRGSKEIYNKYYGRYNIDKIWRDDILPCRLY 251
Query: 254 LRH 256
LRH
Sbjct: 252 LRH 254
>gi|294462572|gb|ADE76832.1| unknown [Picea sitchensis]
Length = 187
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 127/194 (65%), Positives = 150/194 (77%), Gaps = 10/194 (5%)
Query: 63 ANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRV 122
A + +LND +DF I SPDG +SISGFGSLLSE SARSTFPNL FRVA L GFRRV
Sbjct: 3 ATTTQYTELNDAADFDSIASPDGYVSISGFGSLLSERSARSTFPNLQTFRVAVLRGFRRV 62
Query: 123 FAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFI 182
FAH APIFFERGIA+ +TKEISSLSVEPC G+ +IVTVFEI K + +PAFI
Sbjct: 63 FAHTAPIFFERGIARMETKEISSLSVEPCPGEFIIVTVFEIHK----------SEVPAFI 112
Query: 183 KREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKI 242
+REHEFRFL PE L GKPF++ +V+C+R SDEE+ +RCKG++E +F YG +NIDKI
Sbjct: 113 EREHEFRFLVAVPEDLFGKPFSHPSVVCSRYSDEEYRLVRCKGNEETFFQRYGHHNIDKI 172
Query: 243 WRDDILPCRVYLRH 256
W+DDILPCRVYLRH
Sbjct: 173 WQDDILPCRVYLRH 186
>gi|307111831|gb|EFN60065.1| hypothetical protein CHLNCDRAFT_18676 [Chlorella variabilis]
Length = 235
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 117/251 (46%), Positives = 154/251 (61%), Gaps = 25/251 (9%)
Query: 61 FPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFR 120
PA +L+D DF + GL++I GFGSLLS+ SA +TFP+L NFR L G+R
Sbjct: 10 IPAVTSGAFELDD--DFGGLACSQGLMTICGFGSLLSKTSALTTFPSLQNFRTGLLRGWR 67
Query: 121 RVFAHVAPIFFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPA 180
RVF H IF+ RGIA+P+T E+SSLSVE G ++V++FE+ I A
Sbjct: 68 RVFTHQCDIFYARGIARPETGEVSSLSVEEHPGSEIVVSLFEVGSDP--------ASIAA 119
Query: 181 FIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNID 240
FI+REHEFR++ V AV+CAR D ++ + RC + + YGR+ +
Sbjct: 120 FIEREHEFRWVGV------------LAVVCARWDDAQYRRRRCPPDE--WQRRYGRHGVQ 165
Query: 241 KIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEE 300
+IW D+++PCRVYLRHCVLAA+ LG EA+ +FLD T L D TTIRE+LA GIM E
Sbjct: 166 QIWTDNVVPCRVYLRHCVLAAQRLGPEAHASFLDGTVLSDRATTIREHLAAN-PGIMHEL 224
Query: 301 PPESLKTRYGG 311
PP SL RY G
Sbjct: 225 PPSSLIGRYSG 235
>gi|302850776|ref|XP_002956914.1| hypothetical protein VOLCADRAFT_119537 [Volvox carteri f.
nagariensis]
gi|300257795|gb|EFJ42039.1| hypothetical protein VOLCADRAFT_119537 [Volvox carteri f.
nagariensis]
Length = 261
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/259 (42%), Positives = 153/259 (59%), Gaps = 39/259 (15%)
Query: 70 QLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPI 129
+L+DE F + DGL++I+GFGSLLSE SAR TFP+L+NFR+ + G+RRVF+H A +
Sbjct: 25 ELDDE--FTGLAGDDGLMTIAGFGSLLSERSARYTFPHLVNFRIGLVPGWRRVFSHTADV 82
Query: 130 FFERGIAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFR 189
FF RGIA+P+T + ++V++FE+ + AFI REHEFR
Sbjct: 83 FFLRGIARPETSS---------HRRGVVVSLFEVPYTP--------ANVAAFISREHEFR 125
Query: 190 FLAVSPETLD-----------------GKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFH 232
F+AV P L G+P +AV+CA ++D ++ +RC ++ +
Sbjct: 126 FVAVQPLELGTGQPLELGTGQPLELGTGQPIGRKAVVCAANTDSDYRALRCPPAE--WSR 183
Query: 233 HYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATT 292
+ + +WR D+LPCRVYLRHCVLAA++LG+EA NFL T L D +TT+ EYLA
Sbjct: 184 RWSVHGCTSVWRTDVLPCRVYLRHCVLAARSLGEEAEENFLRATFLADRRTTVGEYLALH 243
Query: 293 GAGIMEEEPPESLKTRYGG 311
IM+E PP L RY G
Sbjct: 244 -LDIMDELPPPELAERYSG 261
>gi|159485048|ref|XP_001700561.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272201|gb|EDO98005.1| predicted protein [Chlamydomonas reinhardtii]
Length = 205
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 141/231 (61%), Gaps = 32/231 (13%)
Query: 87 ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSL 146
++++GFGSLLSE SAR TFPNL+NFR ++HG+RRVFAH A +FF RGIA+P T ++
Sbjct: 1 MTVAGFGSLLSERSARFTFPNLVNFRAGQIHGWRRVFAHTADVFFVRGIARPDTVRGAAP 60
Query: 147 SVEPCEGQTLIVTVFEIKKIRGIKLEELCT-----MIPAFIKREHEFRFLAVSPETLDGK 201
S L +C + AFI REHEFRF+AV P L+G+
Sbjct: 61 S-----------------------LPYMCVPATRDSVAAFIGREHEFRFVAVQPADLEGR 97
Query: 202 PFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIW-RDDILPCRVYLRHCVLA 260
+AV+CA ++D ++ +RC + + +G + I +W R D+LPCR YLRHCVLA
Sbjct: 98 ALGRKAVVCAANTDADYKAMRCPPQE--WERRWGVHGIQAVWGRGDVLPCRAYLRHCVLA 155
Query: 261 AKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
A +LG EA ++FL T L D +TTI ++LA IMEE+PP +L RY G
Sbjct: 156 AASLGPEAEHSFLHATYLADRRTTIAQHLARDPT-IMEEQPPPALAERYNG 205
>gi|428183575|gb|EKX52432.1| hypothetical protein GUITHDRAFT_65153 [Guillardia theta CCMP2712]
Length = 228
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/226 (49%), Positives = 143/226 (63%), Gaps = 19/226 (8%)
Query: 89 ISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSLSV 148
I GFGSLLSE SARS+FPNL NFR+A+L G++RV H A IFF+R IA+ T+E +SLSV
Sbjct: 19 IIGFGSLLSEASARSSFPNLQNFRLARLRGYKRVMRHPASIFFQRNIARKDTREFASLSV 78
Query: 149 EPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSP-ETLDGKPFTNRA 207
EPC +V F+I EEL A+ +RE EF F+A++P E LDG
Sbjct: 79 EPCPDGNFLVAAFDIPS------EEL----QAYHEREEEF-FIALAPIEELDGSK-VQEG 126
Query: 208 VLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKI--WRDDILPCRVYLRHCVLAAKNLG 265
+LC +D+E+ K ++ + YG Y + I W+ +ILPCRVYLRHCVLA G
Sbjct: 127 LLCCAGTDDEYI---AKHGRKNFEERYGSYGLSTIWNWQGEILPCRVYLRHCVLACSKAG 183
Query: 266 DEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
+E +FL +T L D KTTI+EYL + IMEEEPPESL+ RY G
Sbjct: 184 EEVKEDFLVNTFLWDRKTTIKEYLQANPS-IMEEEPPESLRHRYSG 228
>gi|348664663|gb|EGZ04507.1| hypothetical protein PHYSODRAFT_343221 [Phytophthora sojae]
Length = 216
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 106/231 (45%), Positives = 143/231 (61%), Gaps = 20/231 (8%)
Query: 86 LISISGFGSLLSENSARSTFPNLI-NFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEIS 144
+++I GFGSLLSE SARSTF + + NFR+A++ +RRVFAH A IFF+RGIA +TKE++
Sbjct: 1 MVAIVGFGSLLSEASARSTFGDGVRNFRLARVLDYRRVFAHPASIFFQRGIADLQTKEMA 60
Query: 145 SLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFT 204
SLS EP G +V+VF+I + ++P F +RE EF ++ + LDG
Sbjct: 61 SLSTEPAPGCEFLVSVFDIPE----------QLLPDFYEREEEFEIISAKFQELDGTS-G 109
Query: 205 NRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDD----ILPCRVYLRHCVLA 260
A++C R SDEE+ K +E + Y Y +D IW D ILPCRVYLRHC+LA
Sbjct: 110 EEALMCTRWSDEEYI---AKRGQETFDTKYKAYGLDTIWGWDAQSGILPCRVYLRHCLLA 166
Query: 261 AKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
K LG + Y++F+ T LGD TTI+EY+ + IM E PP L RY G
Sbjct: 167 VKKLGQDVYDDFVTTTYLGDRSTTIKEYIEANPS-IMLELPPPHLVDRYSG 216
>gi|301111564|ref|XP_002904861.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095191|gb|EEY53243.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 205
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 101/227 (44%), Positives = 139/227 (61%), Gaps = 23/227 (10%)
Query: 86 LISISGFGSLLSENSARSTFPNLI-NFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEIS 144
+++I GFGSLLSE ARSTF + + NFR+A + +RRVFAH A IFF+RGIA +TKEI+
Sbjct: 1 MVAIVGFGSLLSEAWARSTFGDGVRNFRLATVLDYRRVFAHPASIFFQRGIADLETKEIA 60
Query: 145 SLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFT 204
SLS EP G +V+VF+I EEL +P F +RE EF+ ++ + + LDG
Sbjct: 61 SLSTEPAPGCKFLVSVFDIP-------EEL---LPDFYEREEEFKIISATFQELDGSA-R 109
Query: 205 NRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNL 264
A++C R SD+E+ R +G+ + + YG ++ IW VYLRHC+LA K L
Sbjct: 110 GEALMCTRWSDDEYIAKRGQGTFDTKYKAYG---LNTIW-------GVYLRHCLLAVKKL 159
Query: 265 GDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
G + Y++F T LGD TTI+EY+ + IM E PP L RY G
Sbjct: 160 GQDVYDDFFTTTYLGDRSTTIKEYVEAKPS-IMLERPPPHLVDRYSG 205
>gi|224006305|ref|XP_002292113.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972632|gb|EED90964.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 231
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 105/236 (44%), Positives = 138/236 (58%), Gaps = 17/236 (7%)
Query: 87 ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSL 146
I+I GFGSLLS SAR TFP+L NFR+ ++ RRVF+H A IFF+RGIA+ T E+SSL
Sbjct: 2 ITILGFGSLLSIKSARLTFPSLQNFRLGRIPNHRRVFSHPASIFFQRGIARMDTLEMSSL 61
Query: 147 SVEPCEGQTLIVTVFEI--KKIRGIKLEELCT--MIP--AFIKREHEFRFLAVSPETLDG 200
E EG + I +VFE+ + + I + T IP AF++RE EF +A+ P +
Sbjct: 62 CAEYKEGHSFICSVFEVPNEGLTAIASDGEGTGEWIPSRAFLEREEEFE-IAMVPNSTAS 120
Query: 201 KPFTNRAVLCARSSDEEFFQIRCKGSKEIYFH-HYGRYNIDKIWRDD----ILPCRVYLR 255
VLC RS+DE + E +F Y Y + IW D + PC+VY R
Sbjct: 121 DDTVKYGVLCRRSTDESYL----AHWGEAHFQKQYLDYGVKTIWNWDEDSGLRPCQVYTR 176
Query: 256 HCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
HCVLA++ +G ++FLD T L D +TT+REYL IM EPPE LK RYGG
Sbjct: 177 HCVLASEGIGGACQDSFLDDTYLVDRRTTLREYLNQY-PEIMTTEPPEELKERYGG 231
>gi|255076537|ref|XP_002501943.1| predicted protein [Micromonas sp. RCC299]
gi|226517207|gb|ACO63201.1| predicted protein [Micromonas sp. RCC299]
Length = 264
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 105/249 (42%), Positives = 134/249 (53%), Gaps = 34/249 (13%)
Query: 84 DGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEI 143
DG + + G GSLLSE SAR T PNL F+ +L G+RRVFAH AP+FFERGIA KT+E+
Sbjct: 29 DGWVGVVGIGSLLSERSARHTSPNLRGFKTVRLRGYRRVFAHTAPVFFERGIADGKTREV 88
Query: 144 SSLSVEPCE----------GQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAV 193
SSLS+E E + L+ T FEI EE PA RE EF AV
Sbjct: 89 SSLSMEALEPGDDPREAHGPRGLVATYFEIPG------EEF----PALAAREAEFELRAV 138
Query: 194 S-----PETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDD-- 246
+ + G +A+LC RSSD + + C+ K + ++IW
Sbjct: 139 TYEDDEDDGEGGGSDGKKAILCVRSSDAAYAERHCERMKGTGGGGG--FGEERIWYPGRR 196
Query: 247 ----ILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPP 302
+ P R Y RHCVLAAK+LG EA +FLDHT L D +TT+R +L + +ME PP
Sbjct: 197 SPPAVYPTRAYARHCVLAAKSLGPEALASFLDHTFLADRRTTLRTHLERNPS-LMETPPP 255
Query: 303 ESLKTRYGG 311
L RYGG
Sbjct: 256 PELAERYGG 264
>gi|323449443|gb|EGB05331.1| hypothetical protein AURANDRAFT_30996 [Aureococcus anophagefferens]
Length = 225
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 102/235 (43%), Positives = 124/235 (52%), Gaps = 18/235 (7%)
Query: 81 TSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKT 140
T+ G +I GFGSLLSE SAR TFP+L FR+ ++ G+RRVF H A IFFERGIA T
Sbjct: 5 TARVGRTTILGFGSLLSEQSARLTFPDLDEFRLVRVRGYRRVFGHAASIFFERGIADAAT 64
Query: 141 KEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDG 200
KE SSLS EPC + + FE+ + F RE EF LDG
Sbjct: 65 KEFSSLSAEPCADASFVACAFEV--------DVDADAWARFEAREEEFALADAPFAALDG 116
Query: 201 KPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIW---RDDIL-PCRVYLRH 256
VLC R SD F + + +E Y Y ++ IW RD L PC YLRH
Sbjct: 117 GA-GGEGVLCTRGSDAAFVE---RWGRETYERKY-LAHVPSIWDWSRDSGLRPCACYLRH 171
Query: 257 CVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
CVLA G A ++FLD T L D +TT+R YL +M+ PPESL RY G
Sbjct: 172 CVLAVTKAGPAALDSFLDDTYLVDRETTLRTYL-DANPHVMDTRPPESLIGRYSG 225
>gi|397643202|gb|EJK75711.1| hypothetical protein THAOC_02554 [Thalassiosira oceanica]
Length = 357
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/260 (38%), Positives = 128/260 (49%), Gaps = 40/260 (15%)
Query: 87 ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISSL 146
I+I GFGSLLS SAR TFP L NFR+ ++ RRVFAH A IFF+RGIA +T E+SSL
Sbjct: 89 ITILGFGSLLSHKSARLTFPTLKNFRLGRVPDHRRVFAHPASIFFQRGIADMETLEMSSL 148
Query: 147 SVEPCEGQTLIVTVFEIKKIRGIKLEELCT----MIP--AFIKREHEFRFLAVSPETL-- 198
S E EG + + +VFE+ G+ + +P AF++RE EF V E L
Sbjct: 149 SCEYVEGASFVCSVFEVPN-EGLSATGATSGDDNWVPSRAFLEREEEFEIAMVPYEELTS 207
Query: 199 ---DGKPFT-------NRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDD-- 246
DG R S+ +K+ + HY Y ++ IW D
Sbjct: 208 ADVDGGDAEHGIRGPRTREGTSTTISEGATRAATTAATKDRFKRHYADYGVETIWGWDQG 267
Query: 247 --ILPCRVYLRHCVLAAKN----------------LGDEAYNNFLDHTSLGDCKTTIREY 288
+ PC YLRHCVLA+ N G Y +FLD T L D +TT+REY
Sbjct: 268 SGLRPCAAYLRHCVLASWNCGDGTGGGGDAWSADGAGGLCYGSFLDETFLVDRETTVREY 327
Query: 289 LATTGAGIMEEEPPESLKTR 308
L +M EPPE L+ R
Sbjct: 328 LRRF-PDVMATEPPEGLRER 346
>gi|89093034|ref|ZP_01165985.1| hypothetical protein MED92_03118 [Neptuniibacter caesariensis]
gi|89082684|gb|EAR61905.1| hypothetical protein MED92_03118 [Oceanospirillum sp. MED92]
Length = 217
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 116/206 (56%), Gaps = 13/206 (6%)
Query: 86 LISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEISS 145
+ISI G+GSLLSE+SAR T P+L NFR+ ++ G+RR+F V +F R A P++ +++S
Sbjct: 1 MISIIGYGSLLSESSARETVPDLCNFRLVRVPGYRRIFNKVGIVFLSRHGADPQSLQLAS 60
Query: 146 LSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTN 205
+ E + LI + FE + +L E REH F ++ V E++D
Sbjct: 61 CATEADKTSVLICSQFECSEESFSELYE----------REHRFEWVDV--ESVDSSGVKT 108
Query: 206 RAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLG 265
A +C +S+DE + +C KE Y GRY +WRDDILP YL C+ AA+ G
Sbjct: 109 VAKMCTQSTDEHYRFQKCGSEKEFY-ERVGRYYQGPLWRDDILPFPRYLAFCLQAAEGQG 167
Query: 266 DEAYNNFLDHTSLGDCKTTIREYLAT 291
+ +NFLD + L D T+IR YL +
Sbjct: 168 ADVLDNFLDTSYLADGCTSIRRYLDS 193
>gi|219122551|ref|XP_002181606.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406882|gb|EEC46820.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 290
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 130/237 (54%), Gaps = 21/237 (8%)
Query: 84 DGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTKEI 143
D I+I G+GSLLSE SA+STFP L NFR+ ++ +RRVF H A IFF GIA + +I
Sbjct: 66 DETITILGYGSLLSEKSAKSTFPELSNFRLGRIRNYRRVFGHPASIFFATGIADLPSLQI 125
Query: 144 SSLSVE-PCEGQTLIVTVFEIKKIRGIKLEELCTMIP--AFIKRE--HEFRFLAVSPETL 198
+S S E + +V+ F++ E + +P A ++RE ++F + + T
Sbjct: 126 ASASAEFDQDSAGFVVSAFDVPN-------EGLSGVPSRALLEREASYDFEKVVFAGLTK 178
Query: 199 DGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWR----DDILPCRVYL 254
+ P + L RS+D + + K +E + + + I+ IW + PC VYL
Sbjct: 179 NCNP-SMEGWLSVRSTDSAYIK---KWGQERFDRQFRKNGIETIWHWPEDSGLRPCAVYL 234
Query: 255 RHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
RHC LAA+++G+E +N+ LD + L D TTIREYL +++ PP L RY G
Sbjct: 235 RHCYLAAQSMGEECFNSLLDESYLCDRSTTIREYL-NMYPEVLDSVPPPELALRYNG 290
>gi|428177877|gb|EKX46755.1| hypothetical protein GUITHDRAFT_70452, partial [Guillardia theta
CCMP2712]
Length = 207
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 110/215 (51%), Gaps = 12/215 (5%)
Query: 100 SARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIAKPKTK-EISSLSVEPCEGQTLIV 158
SAR TFP+L +FR+ ++ RRVFAH ++GI P T +SLSVEP G L
Sbjct: 2 SARLTFPHLRDFRLVRVQDHRRVFAHPHNFLIQQGIIDPSTTLRFASLSVEPALGHQLTA 61
Query: 159 TVFEI--KKIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDE 216
FE+ K ++++ +F++RE E++F + + G+ VLC + D
Sbjct: 62 AAFELEGKAAELVQVDMDGKQRQSFVEREQEYQFGIATFTDIRGEG-RREGVLCLANDDS 120
Query: 217 EFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHT 276
+ + K +G R + P +YLRHC+LA + G+EA +FLD T
Sbjct: 121 SIPEETRRKLKVPLVWGWGE-------RSGLEPAWIYLRHCILACRKGGEEAERSFLDET 173
Query: 277 SLGDCKTTIREYLATTGAGIMEEEPPESLKTRYGG 311
L D KTTIR+YL +++ PP L++R+ G
Sbjct: 174 FLVDRKTTIRDYLQLH-PEVLDSSPPPYLESRFNG 207
>gi|428178630|gb|EKX47504.1| hypothetical protein GUITHDRAFT_137296 [Guillardia theta CCMP2712]
Length = 218
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/216 (34%), Positives = 108/216 (50%), Gaps = 22/216 (10%)
Query: 80 ITSPDGL-----ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERG 134
+ +PDG ++I GFGSL+ +SA T N R + GF RVF V+ I RG
Sbjct: 5 LRAPDGAAKGGRVTIIGFGSLMDPSSASQTMRG-SNMRRGTVDGFCRVFNLVSIINIARG 63
Query: 135 IAKPKTKEISSLSVEPCEGQTLIVTVFEIKKIRGIKLEELCTMIPAFIKREHEFRFLAVS 194
AK +E+++ + E L V++ +I K EE AF +RE R V
Sbjct: 64 YAK--NRELATCTAMRRENSFLRVSLMDIDK------EEY----EAFARREARLRHEVVE 111
Query: 195 PETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNIDKIWRDDILPCRVYL 254
+ DG V+C SDEE+ + R + + YF G+ K++R+D+LP YL
Sbjct: 112 YVSDDGDRGV--GVICTAYSDEEYRKERVRDDDD-YFEQVGKLYDGKLYRNDLLPVPNYL 168
Query: 255 RHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLA 290
R V A GD+ +NFLDH+ LGD T +R+Y A
Sbjct: 169 RMVVRAHMRHGDDFLDNFLDHSFLGD-GTRMRDYAA 203
>gi|449016611|dbj|BAM80013.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 263
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 69/251 (27%), Positives = 119/251 (47%), Gaps = 20/251 (7%)
Query: 63 ANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRV 122
A + E + +D + S + + ++G+GSLLS SAR T P L + R+A + G+RR+
Sbjct: 9 AAEAEVLAQSDAVSWVDAGSGERRVRLAGYGSLLSVASARRTCPQLRHHRLAWVQGYRRI 68
Query: 123 FAHVAPIFFERG--IAKPKTKEISSLSVEPCEGQT-LIVTVFEIKKIRGIKLEELCTMIP 179
F V+ RG A +T+E+++ + P + + L V +F+I + +
Sbjct: 69 FCLVSLTCVRRGPPYADWRTREVAACAAFPGDSEDRLAVALFDIPA----------SELQ 118
Query: 180 AFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCKGSKEIYFHHYGRYNI 239
A+++RE + F V+ D +A++C D + + C ++ H G ++
Sbjct: 119 AYLERESRYEFTVVNCLEADSSGI-RKAIMCVAPFDR--WAVACPIREQGPLHDAGCWHW 175
Query: 240 --DKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTTIREYLATTGAGIM 297
+WR D+LP YL C+ AA + + FLD + L D TT+R YL +
Sbjct: 176 YSGPLWRKDVLPSAPYLHLCLRAAAAF--DVLDEFLDASYLADGCTTLRSYLRSLDQWGP 233
Query: 298 EEEPPESLKTR 308
+P KTR
Sbjct: 234 AADPISLEKTR 244
>gi|290997430|ref|XP_002681284.1| hypothetical protein NAEGRDRAFT_78353 [Naegleria gruberi]
gi|284094908|gb|EFC48540.1| hypothetical protein NAEGRDRAFT_78353 [Naegleria gruberi]
Length = 441
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/250 (27%), Positives = 113/250 (45%), Gaps = 48/250 (19%)
Query: 87 ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERG-IAKPKTKEISS 145
I+I G GSLLSE SAR TFPNLINFR A + + R F VA G + K K +++
Sbjct: 175 ITICGIGSLLSEKSARRTFPNLINFRKAIVKNYVRCFNLVAISSLNFGDLEKRKIASVAA 234
Query: 146 LSVEPCEGQTL--IVTVFEIKKIRGIKLEELCTM--IP-------AFIKREH--EFRFLA 192
+ E +T+ I+ + + R ++E C++ IP A++ REH EF+F+
Sbjct: 235 VHQSDLEEETIRKIIRLQSQDETRQ-EIEMHCSVFEIPLTEEIWCAYMTREHRYEFKFIT 293
Query: 193 VSPETLDG---------------KPFTN---RAVLCARSSDEEFFQIRCKGSKEIYFHHY 234
+ + +G F++ +A++C + +D+ + RC + Y
Sbjct: 294 IDQDFQNGVMQASCSCCENNCQCSEFSSCGCKALMCCQFNDDSYRIERCSSYENEYHERV 353
Query: 235 GRYNIDKIWRDD---------------ILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLG 279
++ ++W I P YL C AA +G A+ NFL + +
Sbjct: 354 AQHFQGRLWYHTKHVQNEKHIHNNEIPIKPIPKYLELCFNAALEMGRSAFENFLQTSVMA 413
Query: 280 DCKTTIREYL 289
D T + +YL
Sbjct: 414 DGVTLLEDYL 423
>gi|387220189|gb|AFJ69803.1| hypothetical protein NGATSA_2070200, partial [Nannochloropsis
gaditana CCMP526]
Length = 63
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/50 (62%), Positives = 43/50 (86%)
Query: 87 ISISGFGSLLSENSARSTFPNLINFRVAKLHGFRRVFAHVAPIFFERGIA 136
+++ GFGSLLSE S+R+TFP L+NFR+ +++G+RR+F H A IFFERGIA
Sbjct: 14 VTLIGFGSLLSETSSRTTFPQLVNFRLGRVNGYRRLFRHPAAIFFERGIA 63
>gi|283782143|ref|YP_003372898.1| hypothetical protein Psta_4391 [Pirellula staleyi DSM 6068]
gi|283440596|gb|ADB19038.1| hypothetical protein Psta_4391 [Pirellula staleyi DSM 6068]
Length = 295
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 103/257 (40%), Gaps = 57/257 (22%)
Query: 57 AGTNFPANKLEPIQLNDESDFYQITSPDGLISISGFGSLLSENSARSTFPNLIN---FRV 113
+G +FP LE + P+ + I G GSLL+ SA + F V
Sbjct: 81 SGFDFPWTNLEE---------HIRQQPNQRVLIVGIGSLLNLESAAEDIQGVNEEEFFPV 131
Query: 114 AKLHGFRRVFAHVAP--IFFERGIAKPKTKEISSLSV-------EPCEGQTLIVTVFEIK 164
+ G +R+F ++ P + E G P+ E ++L++ + G+ L V V ++
Sbjct: 132 VAV-GAKRLFNYLIPGSVLEELG-GNPQPDERAALNMVATGRSNDWFNGRILSVAVQDLD 189
Query: 165 KIRGIKLEELCTMIPAFIKREHEFRFLAVSPETLDGKPFTNRAVLCARSSDEEFFQIRCK 224
+R KREH + +P T ++ +DE F
Sbjct: 190 GLR---------------KREHGYHL----------RPVT--SIRWDSPTDEPFLAYVLC 222
Query: 225 GSKEIYFHHYGRYNIDKIWRDDILPCRVYLRHCVLAAKNLGDEAYNNFLDHTSLGDCKTT 284
E+ GR ID + I P Y+ C+ A ++ E FLD + L D KT+
Sbjct: 223 AEDEVV---EGRQVID----NTIEPNPAYVEICLTGASSVSQEFGAMFLDTSYLADGKTS 275
Query: 285 IREYLATTGAGIMEEEP 301
+R +LAT I +E P
Sbjct: 276 LRTWLATHPLPIADESP 292
>gi|291416492|ref|XP_002724481.1| PREDICTED: MUC12 protein-like, partial [Oryctolagus cuniculus]
Length = 480
Score = 37.4 bits (85), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 23/41 (56%)
Query: 14 ASFACIPRSKAFSFQLRVSTHPPLHSSHCKLHRRRPPPMSS 54
SF +P+ FS Q R + PP+H SHC++H P+ S
Sbjct: 121 GSFDNLPQQPKFSLQHRATCKPPVHHSHCRIHHFPQQPIVS 161
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.138 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,982,761,301
Number of Sequences: 23463169
Number of extensions: 205992551
Number of successful extensions: 390690
Number of sequences better than 100.0: 47
Number of HSP's better than 100.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 390557
Number of HSP's gapped (non-prelim): 48
length of query: 311
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 169
effective length of database: 9,027,425,369
effective search space: 1525634887361
effective search space used: 1525634887361
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)