BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 030495
         (176 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225431386|ref|XP_002279284.1| PREDICTED: uncharacterized protein LOC100266100 [Vitis vinifera]
          Length = 190

 Score =  214 bits (546), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 109/165 (66%), Positives = 126/165 (76%), Gaps = 1/165 (0%)

Query: 1   MMSATILSSSSCNLIRSSGKQIPSHPKPFLGDGQRILN-QLVAKRNPSYHAFQNKKLPHR 59
           M +A++L     NLI SS K+ P HPK F G G +IL+  +  KRN ++   Q   L H+
Sbjct: 1   MAAASLLCLCDTNLIHSSRKESPWHPKLFPGVGHKILDLSVTRKRNRTHPGRQKGNLSHK 60

Query: 60  FSVFAVTEGSAKSSESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTST 119
           FSV A TEGSAKS++SEETIPSWA+PDS EPPPWA DEG G  S+ SFEIPFYVYLLTS 
Sbjct: 61  FSVSATTEGSAKSNKSEETIPSWARPDSSEPPPWAQDEGMGNESEKSFEIPFYVYLLTSA 120

Query: 120 ITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           +TAIAAIGSIFEY NK PVFG+L+SDSIFYAPLLGFF FTG PTS
Sbjct: 121 VTAIAAIGSIFEYANKRPVFGVLSSDSIFYAPLLGFFVFTGIPTS 165


>gi|449473313|ref|XP_004153846.1| PREDICTED: uncharacterized protein LOC101218426 [Cucumis sativus]
 gi|449490678|ref|XP_004158675.1| PREDICTED: uncharacterized protein LOC101229448 [Cucumis sativus]
          Length = 172

 Score =  179 bits (453), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 97/168 (57%), Positives = 115/168 (68%), Gaps = 26/168 (15%)

Query: 2   MSATILS----SSSCNLIRSSGKQIP-SHPKPFLGDGQRILNQLVAKRNPSYHAFQNKKL 56
           MSAT+LS    S  CN  R   K++   H + F             +RN  Y        
Sbjct: 1   MSATLLSYCNPSLVCNYGRMDLKRMALGHLRSF------------GRRNNGYR------- 41

Query: 57  PHRFSVFAVTEGSAKSSESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLL 116
             + +VFAVTEGSAKSSESEETIPSWAK DS+EPPPWA +EGK + +Q  F++PFYVYLL
Sbjct: 42  --KLTVFAVTEGSAKSSESEETIPSWAKLDSEEPPPWAKEEGKEIGTQQGFQVPFYVYLL 99

Query: 117 TSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
            S+ITAIAAIGS+FEYVN+ PVFGI+NSDSI YAPLLGFFA TG PT+
Sbjct: 100 ASSITAIAAIGSVFEYVNQKPVFGIINSDSILYAPLLGFFALTGIPTA 147


>gi|255575261|ref|XP_002528534.1| conserved hypothetical protein [Ricinus communis]
 gi|223532036|gb|EEF33846.1| conserved hypothetical protein [Ricinus communis]
          Length = 176

 Score =  169 bits (427), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 96/163 (58%), Positives = 117/163 (71%), Gaps = 12/163 (7%)

Query: 2   MSATILSSSSCNLIRSSGKQIPSHPKPFLGDGQRILNQLVAKRNPSYHAFQNKKLPHRFS 61
           M+A ++   + N+I++S   +        G  +RILN+   K N S      +K   R +
Sbjct: 1   MAAALIIPCNSNIIQASYSTL-------FGVRRRILNKPTGKVNVS-----TRKFCKRLT 48

Query: 62  VFAVTEGSAKSSESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTIT 121
           VFA TEGSA SS+SEE IP+WAKPDSDEPPPWA DEG G +++ + EIPF+VYLL S IT
Sbjct: 49  VFAATEGSADSSKSEENIPTWAKPDSDEPPPWARDEGSGNSAEQTVEIPFFVYLLASAIT 108

Query: 122 AIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           AIAAIGS+FEY N+NPVFGIL SDSIFYAPLLGFFAFTG PTS
Sbjct: 109 AIAAIGSVFEYANQNPVFGILKSDSIFYAPLLGFFAFTGIPTS 151


>gi|224056729|ref|XP_002298994.1| predicted protein [Populus trichocarpa]
 gi|118484433|gb|ABK94093.1| unknown [Populus trichocarpa]
 gi|222846252|gb|EEE83799.1| predicted protein [Populus trichocarpa]
          Length = 181

 Score =  159 bits (401), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 98/164 (59%), Positives = 118/164 (71%), Gaps = 8/164 (4%)

Query: 1   MMSATILSSSSCNLIRSSGKQIPSHPKPFLGDGQRILNQLVAKRNPSYHAFQNKKLPHRF 60
           M +AT+L   + +LI        S  + F+ +     NQ   K   SY+A+Q +K   +F
Sbjct: 1   MAAATLLCWCNSDLIIHG-----SCSRLFVRNKVMDSNQSTVKSRGSYYAYQKQK---KF 52

Query: 61  SVFAVTEGSAKSSESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTI 120
            V A TEGSAKSS+SEETIPSWAKPDSDEPPPWA  EGK  +S+ +FE+PF+VYLL S I
Sbjct: 53  FVCAATEGSAKSSKSEETIPSWAKPDSDEPPPWAKGEGKENSSKQNFEVPFFVYLLASAI 112

Query: 121 TAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           TAIAAIGSIFEYVN+ PVFG++N DSIFYAPLLGFFAFTG P S
Sbjct: 113 TAIAAIGSIFEYVNQRPVFGVVNPDSIFYAPLLGFFAFTGIPFS 156


>gi|116792022|gb|ABK26201.1| unknown [Picea sitchensis]
          Length = 187

 Score =  156 bits (395), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 76/108 (70%), Positives = 87/108 (80%), Gaps = 3/108 (2%)

Query: 60  FSVFAVTEGSAKSSESEETIPSWAKPDSDEPPPWATDEGKGLTS---QGSFEIPFYVYLL 116
             V A TEGSAKS  S+  IPSWAKPDSDE PPWA  EGKG T+   + +F+IPFYVYLL
Sbjct: 55  LHVVAATEGSAKSEPSDPKIPSWAKPDSDELPPWARKEGKGQTADPAEPTFQIPFYVYLL 114

Query: 117 TSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           +S I AIAA+GS+FEYVN+ PVFG+LNSDS+FYAPLLGFF FTG PTS
Sbjct: 115 SSAIIAIAAVGSVFEYVNQKPVFGLLNSDSVFYAPLLGFFTFTGIPTS 162


>gi|388496594|gb|AFK36363.1| unknown [Lotus japonicus]
          Length = 165

 Score =  148 bits (374), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 82/145 (56%), Positives = 97/145 (66%), Gaps = 17/145 (11%)

Query: 21  QIPSHPKPFLGDGQRILNQLVAKRNPSYHAFQNKKLPHRFSVFAVTEGSAKSSESEETIP 80
           Q  S P PFLG  Q   ++    R         KK P +  V AVT+      +S+E +P
Sbjct: 18  QHASLPTPFLGVPQ-CQSKFFGIR---------KKPPQKLVVVAVTK------DSDEKVP 61

Query: 81  SWAKPDSDEPPPWATDEGKGLTSQGS-FEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVF 139
           SWAKPDSDEPPPWA DE    T+  S FE+PFYVYLL S ITAIAA+GSIFEYVN+ PVF
Sbjct: 62  SWAKPDSDEPPPWARDEANNKTASSSGFEVPFYVYLLASAITAIAAVGSIFEYVNQKPVF 121

Query: 140 GILNSDSIFYAPLLGFFAFTGFPTS 164
           G+L  DS+FYAP+LGFFAFTG P+S
Sbjct: 122 GLLPYDSVFYAPVLGFFAFTGIPSS 146


>gi|351723271|ref|NP_001237273.1| uncharacterized protein LOC100499876 [Glycine max]
 gi|255627331|gb|ACU14010.1| unknown [Glycine max]
          Length = 177

 Score =  148 bits (374), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 91/145 (62%), Positives = 103/145 (71%), Gaps = 15/145 (10%)

Query: 21  QIPSHPKPFLGDGQRILNQLVAKRNPSYHAFQNKKLPHRFSVFAVTEGSAKSSESEETIP 80
           Q  SHP  FLG         V K N      + +K PHR  V AVT+GSA+SS+S+E IP
Sbjct: 22  QHVSHPTTFLG---------VPKNNQ-----RKRKTPHRLVVVAVTQGSAESSKSDEKIP 67

Query: 81  SWAKPDSDEPPPWATDEGKGLTSQGS-FEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVF 139
           SWAKPDSDEPPPWA DE    +SQ   FEIPFY YL  S ITAIAAIGSIFEYVN+ PVF
Sbjct: 68  SWAKPDSDEPPPWARDEPNNNSSQQEGFEIPFYAYLFASAITAIAAIGSIFEYVNQKPVF 127

Query: 140 GILNSDSIFYAPLLGFFAFTGFPTS 164
           G+L+SDS+FYAPLLGFF FTG P+S
Sbjct: 128 GVLSSDSVFYAPLLGFFVFTGIPSS 152


>gi|388510544|gb|AFK43338.1| unknown [Lotus japonicus]
          Length = 171

 Score =  148 bits (373), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 82/145 (56%), Positives = 97/145 (66%), Gaps = 17/145 (11%)

Query: 21  QIPSHPKPFLGDGQRILNQLVAKRNPSYHAFQNKKLPHRFSVFAVTEGSAKSSESEETIP 80
           Q  S P PFLG  Q   ++    R         KK P +  V AVT+      +S+E +P
Sbjct: 18  QHASLPTPFLGVPQ-CQSKFFGIR---------KKPPQKLVVVAVTK------DSDEKVP 61

Query: 81  SWAKPDSDEPPPWATDEGKGLTSQGS-FEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVF 139
           SWAKPDSDEPPPWA DE    T+  S FE+PFYVYLL S ITAIAA+GSIFEYVN+ PVF
Sbjct: 62  SWAKPDSDEPPPWARDEANNKTASSSGFEVPFYVYLLASAITAIAAVGSIFEYVNQKPVF 121

Query: 140 GILNSDSIFYAPLLGFFAFTGFPTS 164
           G+L  DS+FYAP+LGFFAFTG P+S
Sbjct: 122 GLLPYDSVFYAPVLGFFAFTGIPSS 146


>gi|351722993|ref|NP_001237007.1| uncharacterized protein LOC100306066 [Glycine max]
 gi|255627429|gb|ACU14059.1| unknown [Glycine max]
          Length = 182

 Score =  145 bits (366), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 94/171 (54%), Positives = 112/171 (65%), Gaps = 22/171 (12%)

Query: 2   MSATILSSSSCNLIRSSG---KQIPSHPKPFLGDGQRILNQLVAKRNPSYHAFQNKKLPH 58
           M+A+   SSS   +  +G    Q  SHP+ F G         V  +N      + +K PH
Sbjct: 1   MAASGGCSSSFLTLSCAGAAQNQHASHPRTFFG---------VPHKNQ-----RKRKTPH 46

Query: 59  RFSVFAVTEGSAKSSESEETIPSWAKPDSDEPPPWATDE-----GKGLTSQGSFEIPFYV 113
           R  V AVT+GSA+SS+S+E IPSWAKPDSDEPPPWA DE           +  FEIPFY 
Sbjct: 47  RLVVVAVTQGSAESSKSDEKIPSWAKPDSDEPPPWARDEPNNKNSNNSQQEEEFEIPFYA 106

Query: 114 YLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           YLL S ITAIAAIGSIFEYVN+ PVFG+L+SDS+FYAPLLGFF FTG P+S
Sbjct: 107 YLLASAITAIAAIGSIFEYVNQRPVFGVLSSDSVFYAPLLGFFVFTGIPSS 157


>gi|297816416|ref|XP_002876091.1| hypothetical protein ARALYDRAFT_906500 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321929|gb|EFH52350.1| hypothetical protein ARALYDRAFT_906500 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 182

 Score =  143 bits (360), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 81/129 (62%), Positives = 92/129 (71%), Gaps = 13/129 (10%)

Query: 46  PSYHAFQNKKLPH----------RFSVFAVTEGSAKSSESEETIPSWAKPDSDEPPPWAT 95
           P +H    ++  H          R  V A TEGS KS ESE   PSWA PDSDEPPPWA 
Sbjct: 32  PGHHPLAGRRKGHLLHYELTTVRRLVVTAATEGSRKSKESE---PSWANPDSDEPPPWAR 88

Query: 96  DEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGF 155
           +EG+  TSQ SFE+PFYVYLL S ITAIAAIGS+FEY +KNPVFG+L+SDSIFY P+LGF
Sbjct: 89  NEGRSSTSQESFEVPFYVYLLASAITAIAAIGSVFEYSSKNPVFGVLDSDSIFYTPVLGF 148

Query: 156 FAFTGFPTS 164
           FA TG PTS
Sbjct: 149 FALTGIPTS 157


>gi|15230505|ref|NP_190719.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6572072|emb|CAB63015.1| putative protein [Arabidopsis thaliana]
 gi|15027917|gb|AAK76489.1| unknown protein [Arabidopsis thaliana]
 gi|15450643|gb|AAK96593.1| AT3g51510/F26O13_150 [Arabidopsis thaliana]
 gi|20259289|gb|AAM14380.1| unknown protein [Arabidopsis thaliana]
 gi|21536487|gb|AAM60819.1| unknown [Arabidopsis thaliana]
 gi|332645280|gb|AEE78801.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 181

 Score =  142 bits (357), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 85/146 (58%), Positives = 96/146 (65%), Gaps = 14/146 (9%)

Query: 30  LGDGQRILNQLVAKRN-PSYHAFQNKKLPH----------RFSVFAVTEGSAKSSESEET 78
           L  G    N+   K   P +H    ++  H          R  V A TEGS KS ESE  
Sbjct: 14  LSHGDECCNRSPTKSPFPGHHPLAGRRKGHLLHYERSTVRRLVVTAATEGSKKSKESE-- 71

Query: 79  IPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPV 138
            PSWA PDSDEPPPWA +EG+  TSQ SFE+PF+VYLL S ITAIAAIGS+FEY +KNPV
Sbjct: 72  -PSWANPDSDEPPPWARNEGRSSTSQESFEVPFFVYLLASAITAIAAIGSVFEYTSKNPV 130

Query: 139 FGILNSDSIFYAPLLGFFAFTGFPTS 164
           FGIL SDSIFY P+LGFFA TG PTS
Sbjct: 131 FGILESDSIFYTPVLGFFALTGIPTS 156


>gi|413945835|gb|AFW78484.1| hypothetical protein ZEAMMB73_149525 [Zea mays]
          Length = 169

 Score =  129 bits (325), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 60/91 (65%), Positives = 69/91 (75%), Gaps = 1/91 (1%)

Query: 74  ESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYV 133
           E+EE +P+WAKP +DEPPPW   EG     Q + ++PFY YLL STITAIAAIGSIFEY 
Sbjct: 55  EAEEQVPAWAKPGADEPPPW-EREGGAARGQEARQVPFYAYLLASTITAIAAIGSIFEYT 113

Query: 134 NKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           N+ PVFGI+  DS  YAPLLGFF FTG PTS
Sbjct: 114 NQRPVFGIVGPDSALYAPLLGFFVFTGIPTS 144


>gi|117670153|gb|ABK56720.1| unknown [Hordeum vulgare]
          Length = 170

 Score =  127 bits (319), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 60/99 (60%), Positives = 76/99 (76%), Gaps = 3/99 (3%)

Query: 69  SAKSS--ESEETIPSWAKPDSDEPPPWATDEGKGLTSQG-SFEIPFYVYLLTSTITAIAA 125
           +AKSS  +++E +PSWA+P SDEPPPWA +EG     +G + ++PFY YLL S +TAIAA
Sbjct: 47  AAKSSGEKADEKVPSWARPGSDEPPPWARNEGGASGQEGDAAQVPFYAYLLASAVTAIAA 106

Query: 126 IGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           IGSIFEY N  PVFG++ +DS  YAP+LGFFA TG PTS
Sbjct: 107 IGSIFEYTNGRPVFGVVGTDSPLYAPILGFFAVTGIPTS 145


>gi|224117668|ref|XP_002317638.1| predicted protein [Populus trichocarpa]
 gi|222860703|gb|EEE98250.1| predicted protein [Populus trichocarpa]
          Length = 137

 Score =  121 bits (303), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/88 (75%), Positives = 77/88 (87%)

Query: 77  ETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKN 136
           + +P+WAKPDSDEPPPWA  EGK  +S+ +FE+PF+VYLL S ITAIAAIGSIFEYVN+ 
Sbjct: 25  KKLPTWAKPDSDEPPPWAKGEGKENSSKQNFEVPFFVYLLASAITAIAAIGSIFEYVNQR 84

Query: 137 PVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           PVFGI+N+DSI YAPLLGFFAFTG PTS
Sbjct: 85  PVFGIVNTDSILYAPLLGFFAFTGIPTS 112


>gi|115464611|ref|NP_001055905.1| Os05g0490900 [Oryza sativa Japonica Group]
 gi|50080307|gb|AAT69641.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579456|dbj|BAF17819.1| Os05g0490900 [Oryza sativa Japonica Group]
 gi|215767008|dbj|BAG99236.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218197018|gb|EEC79445.1| hypothetical protein OsI_20429 [Oryza sativa Indica Group]
 gi|222632059|gb|EEE64191.1| hypothetical protein OsJ_19023 [Oryza sativa Japonica Group]
          Length = 172

 Score =  114 bits (285), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 69/104 (66%), Positives = 81/104 (77%), Gaps = 3/104 (2%)

Query: 64  AVTEGSAKSS-ESEETIPSWAKPDSDEPPPWATDEGKGLTSQ--GSFEIPFYVYLLTSTI 120
           A TEGSAKSS E++E +PSWA+P SDEPPPWA + G G   Q  G+ E+PF+ YLL S I
Sbjct: 44  AATEGSAKSSGEADEQVPSWARPGSDEPPPWAREGGGGGGQQEPGAVELPFFAYLLASAI 103

Query: 121 TAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           TAIAAIGSIFEY N+ PVFG+++ DS  YAPLLGFF FTG PTS
Sbjct: 104 TAIAAIGSIFEYANQRPVFGVVSPDSALYAPLLGFFVFTGIPTS 147


>gi|302761140|ref|XP_002963992.1| hypothetical protein SELMODRAFT_69021 [Selaginella moellendorffii]
 gi|300167721|gb|EFJ34325.1| hypothetical protein SELMODRAFT_69021 [Selaginella moellendorffii]
          Length = 109

 Score =  112 bits (280), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 51/85 (60%), Positives = 65/85 (76%)

Query: 80  PSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVF 139
           P WAKP SD PPPWA+ E K ++S+G  ++P+ VYL+ S + AIAA+GSIFEY NKNPVF
Sbjct: 1   PEWAKPGSDVPPPWASGEKKQVSSEGFQDLPYIVYLVASCLVAIAAVGSIFEYFNKNPVF 60

Query: 140 GILNSDSIFYAPLLGFFAFTGFPTS 164
           G++  DS FY P+LGFF+ TG P S
Sbjct: 61  GVIQPDSPFYTPVLGFFSITGIPVS 85


>gi|357128893|ref|XP_003566104.1| PREDICTED: uncharacterized protein LOC100828480 [Brachypodium
           distachyon]
          Length = 181

 Score =  110 bits (276), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 70/111 (63%), Positives = 79/111 (71%), Gaps = 4/111 (3%)

Query: 58  HRFSVFAVTEGSAKSS-ESEETIPSWAKPDSDEPPPWATDEGKGLTSQ---GSFEIPFYV 113
            R    A TEGSAKSS E++E +PSWA+P SDEPPPWA DEG G        + ++PFY 
Sbjct: 46  RRPVAVAATEGSAKSSGEADEQVPSWARPGSDEPPPWARDEGGGGAGGRDPAAVQVPFYA 105

Query: 114 YLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           YLL S ITAIAAIGSIFEY N  PVFGI+  DS  YAP+LGFFA TG PTS
Sbjct: 106 YLLASAITAIAAIGSIFEYTNGRPVFGIVGVDSPLYAPILGFFAVTGVPTS 156


>gi|296088633|emb|CBI37624.3| unnamed protein product [Vitis vinifera]
          Length = 91

 Score =  108 bits (270), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 53/65 (81%), Positives = 57/65 (87%)

Query: 100 GLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFT 159
           G  S+ SFEIPFYVYLLTS +TAIAAIGSIFEY NK PVFG+L+SDSIFYAPLLGFF FT
Sbjct: 2   GNESEKSFEIPFYVYLLTSAVTAIAAIGSIFEYANKRPVFGVLSSDSIFYAPLLGFFVFT 61

Query: 160 GFPTS 164
           G PTS
Sbjct: 62  GIPTS 66


>gi|242090927|ref|XP_002441296.1| hypothetical protein SORBIDRAFT_09g024020 [Sorghum bicolor]
 gi|241946581|gb|EES19726.1| hypothetical protein SORBIDRAFT_09g024020 [Sorghum bicolor]
          Length = 181

 Score =  105 bits (262), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 57/91 (62%), Positives = 70/91 (76%)

Query: 74  ESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYV 133
           E+EE +P+WAKP ++EPPPWA + G     + + ++PFY YLL S ITAIAAIGSIFEY 
Sbjct: 66  EAEEKVPAWAKPGAEEPPPWAREGGAARGPEEAGQVPFYAYLLASAITAIAAIGSIFEYT 125

Query: 134 NKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           N+ PVFGI+ SDS  YAP+LGFF FTG PTS
Sbjct: 126 NQRPVFGIVGSDSALYAPILGFFVFTGIPTS 156


>gi|168040591|ref|XP_001772777.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675854|gb|EDQ62344.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 119

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 50/94 (53%), Positives = 68/94 (72%), Gaps = 2/94 (2%)

Query: 71  KSSESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIF 130
           K +E+E T PSWAKP ++E PPWA +E        S ++PF VYL+ S + AIAA+GSIF
Sbjct: 4   KPTENEPT-PSWAKPGTEELPPWARNEAAAPVDS-SGDLPFPVYLIGSCLVAIAAVGSIF 61

Query: 131 EYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           EY N+NP+FG++  DS  +AP+LG FA TGFP++
Sbjct: 62  EYFNQNPIFGVVQPDSPLWAPILGVFAITGFPSA 95


>gi|307111092|gb|EFN59327.1| hypothetical protein CHLNCDRAFT_137733 [Chlorella variabilis]
          Length = 152

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 38/60 (63%), Positives = 46/60 (76%)

Query: 105 GSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           GSF++PF  YLL S+ TAIAA+GSIFEY N+ PVF ++  DS  Y P+LGFFA TG PTS
Sbjct: 69  GSFDLPFGAYLLFSSFTAIAAVGSIFEYANQRPVFDVIQPDSPLYTPILGFFAITGLPTS 128


>gi|302835946|ref|XP_002949534.1| hypothetical protein VOLCADRAFT_104324 [Volvox carteri f.
           nagariensis]
 gi|300265361|gb|EFJ49553.1| hypothetical protein VOLCADRAFT_104324 [Volvox carteri f.
           nagariensis]
          Length = 163

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/81 (51%), Positives = 57/81 (70%), Gaps = 2/81 (2%)

Query: 86  DSDEPPPWATDEGKG--LTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILN 143
           + +E PPW   E +      QGS  +P+ +YLL S  TAIAA+GSIFE+V++NPVFG+L 
Sbjct: 58  EDEELPPWIRREKERELQAQQGSSGLPWPLYLLFSIFTAIAAVGSIFEFVDRNPVFGVLP 117

Query: 144 SDSIFYAPLLGFFAFTGFPTS 164
            D+  +AP+L FFA TGFPT+
Sbjct: 118 PDNPLWAPILLFFAVTGFPTA 138


>gi|145345461|ref|XP_001417228.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577455|gb|ABO95521.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 163

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 51/80 (63%), Gaps = 1/80 (1%)

Query: 85  PDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILNS 144
           PD D+ P W   E +   +     +P+  YLL S I  IAA GS+FEY  KNP+FG++ +
Sbjct: 61  PD-DDVPIWERRELERKAADAKGGLPWPAYLLLSVIVLIAATGSMFEYAYKNPIFGVVGA 119

Query: 145 DSIFYAPLLGFFAFTGFPTS 164
           DS  YAP+LG+F FTGFP +
Sbjct: 120 DSGLYAPILGWFVFTGFPLA 139


>gi|159475535|ref|XP_001695874.1| hypothetical protein CHLREDRAFT_130491 [Chlamydomonas reinhardtii]
 gi|158275434|gb|EDP01211.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 162

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 56/94 (59%), Gaps = 2/94 (2%)

Query: 73  SESEETIPSWAKPDSDEPPPWATDEGKG--LTSQGSFEIPFYVYLLTSTITAIAAIGSIF 130
           ++ E+  P       DE PPW   E +       G   +P+ + LL S  TAIAA+GSIF
Sbjct: 44  AQPEQKTPPTVPGSEDELPPWVRREKERELQAKDGVSGLPWGLCLLFSVFTAIAAVGSIF 103

Query: 131 EYVNKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           E+V++N +FG++  DS  +AP+L FF  TGFPT+
Sbjct: 104 EFVDRNAIFGVIQPDSPLWAPILLFFGVTGFPTA 137


>gi|384247811|gb|EIE21297.1| hypothetical protein COCSUDRAFT_83521, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 105

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 56/91 (61%), Gaps = 10/91 (10%)

Query: 74  ESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYV 133
           E EE +P W + + +     A DEG  L         F VYL+ S + AIAA+GSIFE+ 
Sbjct: 1   EGEEPLPPWIRGERERK--LAADEGSDLL--------FPVYLIGSALVAIAAVGSIFEFA 50

Query: 134 NKNPVFGILNSDSIFYAPLLGFFAFTGFPTS 164
           N+NP+FG+L   +  +AP+L FF+ TGFP++
Sbjct: 51  NRNPIFGVLPPSNFLWAPILLFFSITGFPSA 81


>gi|412985780|emb|CCO16980.1| predicted protein [Bathycoccus prasinos]
          Length = 158

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/74 (45%), Positives = 48/74 (64%)

Query: 91  PPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYA 150
           PPW   E +   S+ S E P+ V+LL S IT +AA GS+FE+    P+FG++++ S  Y 
Sbjct: 60  PPWERREMEKKLSKESGEFPWPVFLLGSLITLLAATGSVFEWTFSKPIFGVVDASSGMYK 119

Query: 151 PLLGFFAFTGFPTS 164
           P+LG+F  TGFP S
Sbjct: 120 PILGWFIVTGFPLS 133


>gi|303282241|ref|XP_003060412.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457883|gb|EEH55181.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 123

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 47/79 (59%)

Query: 86  DSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILNSD 145
           D ++ PPW   E +   +     +P+  YL  + I AIAAIGS FE   +NP+FG++ +D
Sbjct: 21  DVEDIPPWERREIEKKAAMEKGGLPWPAYLGLAVIVAIAAIGSCFELTYENPIFGVVGAD 80

Query: 146 SIFYAPLLGFFAFTGFPTS 164
           S  Y P+L +F  TGFP +
Sbjct: 81  SFAYKPILFWFIGTGFPLA 99


>gi|255085550|ref|XP_002505206.1| predicted protein [Micromonas sp. RCC299]
 gi|226520475|gb|ACO66464.1| predicted protein [Micromonas sp. RCC299]
          Length = 99

 Score = 58.9 bits (141), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 44/77 (57%), Gaps = 6/77 (7%)

Query: 91  PPWATDEGKGLTSQGSFE---IPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSI 147
           PPW   E   L  + + E   +P+  YL  S I +IAAIGS FE    NP+FG++  DS 
Sbjct: 2   PPWERRE---LMKKAAMEKGGLPWPAYLGLSVIVSIAAIGSCFELNYGNPIFGVVGPDSF 58

Query: 148 FYAPLLGFFAFTGFPTS 164
            Y P+L +F  TGFP +
Sbjct: 59  LYKPILYWFIGTGFPLA 75


>gi|449019211|dbj|BAM82613.1| hypothetical protein CYME_CMR438C [Cyanidioschyzon merolae strain
           10D]
          Length = 138

 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 32/51 (62%)

Query: 107 FEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYAPLLGFFA 157
           F IP Y YL  S++ AIAAIGS+FE  +  P +G++ + S+    L GF A
Sbjct: 73  FPIPAYGYLGISSVLAIAAIGSVFELSSGKPQYGVVFTASVLALSLPGFLA 123


>gi|302664314|ref|XP_003023788.1| 37S ribosomal protein S9 [Trichophyton verrucosum HKI 0517]
 gi|291187804|gb|EFE43170.1| 37S ribosomal protein S9 [Trichophyton verrucosum HKI 0517]
          Length = 403

 Score = 40.0 bits (92), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 10/117 (8%)

Query: 16  RSSGKQIPSHPKPFL--GDGQRILN-QLVAKRNPSYH-------AFQNKKLPHRFSVFAV 65
           R  G++  +H   +L  GDGQ I+N + +++  P  H       A +     H+++VFAV
Sbjct: 258 RGIGRRKTAHATAYLVEGDGQIIINGKTISEVFPRIHDRESAMWALKATNRLHKYNVFAV 317

Query: 66  TEGSAKSSESEETIPSWAKPDSDEPPPWATDEGKGLTSQGSFEIPFYVYLLTSTITA 122
             G   + ++E    + AK      P   +   +G +S     +  YV++LTST TA
Sbjct: 318 ARGGGITGQAEAVTLAVAKSLLVHEPGLKSALRRGKSSPLMLMVLNYVFILTSTSTA 374


>gi|428309243|ref|YP_007120220.1| hypothetical protein Mic7113_0915 [Microcoleus sp. PCC 7113]
 gi|428250855|gb|AFZ16814.1| hypothetical protein Mic7113_0915 [Microcoleus sp. PCC 7113]
          Length = 65

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 113 VYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYA--PLLGFFAFTGFPTSRS 166
           ++L +S + AIAA+GSIFE  + NP  G L +  I  A  PL G F +     +R+
Sbjct: 8   LFLGSSCVAAIAAVGSIFELSSGNPQLGTLVTGIILAASVPLTGLFFYAAVRDARA 63


>gi|443312300|ref|ZP_21041918.1| hypothetical protein Syn7509DRAFT_00015170 [Synechocystis sp. PCC
           7509]
 gi|442777538|gb|ELR87813.1| hypothetical protein Syn7509DRAFT_00015170 [Synechocystis sp. PCC
           7509]
          Length = 66

 Score = 35.8 bits (81), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 113 VYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYA--PLLGFFAF 158
           ++L    ITAIA++GSIFE  + NP  G LN+  I     PL  FF F
Sbjct: 10  LFLGGCCITAIASVGSIFELASGNPDLGSLNTSIILAVAIPLTVFFFF 57


>gi|255722343|ref|XP_002546106.1| hypothetical protein CTRG_00888 [Candida tropicalis MYA-3404]
 gi|240136595|gb|EER36148.1| hypothetical protein CTRG_00888 [Candida tropicalis MYA-3404]
          Length = 921

 Score = 35.8 bits (81), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 3/78 (3%)

Query: 96  DEGKGLTSQGSFEIPFYVYLLTSTITAIAAIGSIFEYVNKNPVFGI---LNSDSIFYAPL 152
           D    +  Q   E+P + YL+   I  + AI  +  Y  + PV+GI   L  + IF  PL
Sbjct: 584 DPHSKMMRQNYSEVPEWAYLIVLVIALVLAIVCVTVYPAQTPVWGIFFALGINFIFLIPL 643

Query: 153 LGFFAFTGFPTSRSIGPE 170
              +A TGF    ++  E
Sbjct: 644 TTIYARTGFAFGLNVLVE 661


>gi|119479739|ref|XP_001259898.1| phosphatase family protein [Neosartorya fischeri NRRL 181]
 gi|119408052|gb|EAW18001.1| phosphatase family protein [Neosartorya fischeri NRRL 181]
          Length = 1062

 Score = 35.8 bits (81), Expect = 7.2,   Method: Composition-based stats.
 Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 5/42 (11%)

Query: 62  VFAVTEGSAKSSESEETIPSWA--KPDSDEP-PPWATDEGKG 100
           +F +TE  A S +SE  I  W   K DSDE  PPWA D+G G
Sbjct: 767 LFRLTE--AISEQSERAIAEWGMTKSDSDEELPPWAKDQGAG 806


>gi|254409658|ref|ZP_05023439.1| hypothetical protein MC7420_7291 [Coleofasciculus chthonoplastes
           PCC 7420]
 gi|196183655|gb|EDX78638.1| hypothetical protein MC7420_7291 [Coleofasciculus chthonoplastes
           PCC 7420]
          Length = 64

 Score = 35.8 bits (81), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 113 VYLLTSTITAIAAIGSIFEYVNKNPVFGILNSDSIFYA--PLLGFFAFTGFPTSRS 166
           ++L+ S ITAIAA+GSIFE  +  P  G   ++ I     PL GF  +     +R+
Sbjct: 7   LFLMGSCITAIAAVGSIFELSSGTPQLGASVTEIILAGSIPLSGFLFYAAVRDARA 62


>gi|189241668|ref|XP_967398.2| PREDICTED: similar to brahma CG5942-PA, partial [Tribolium castaneum]
          Length = 1402

 Score = 35.4 bits (80), Expect = 8.5,   Method: Composition-based stats.
 Identities = 18/70 (25%), Positives = 27/70 (38%)

Query: 37   LNQLVAKRNPSYHAFQNKKLPHRFSVFAVTEGSAKSSESEETIPSWAKPDSDEPPPWATD 96
            +NQ+VA+    +  FQ   L  R     +             +P W   D DE  PW  D
Sbjct: 1103 VNQMVARSEAEFELFQKMDLERRREEAKLGPNRKPRMMEISELPDWLVKDDDEVDPWNYD 1162

Query: 97   EGKGLTSQGS 106
            E +    +G+
Sbjct: 1163 ETESALGRGT 1172


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.133    0.400 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,859,112,221
Number of Sequences: 23463169
Number of extensions: 117670690
Number of successful extensions: 304961
Number of sequences better than 100.0: 52
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 304911
Number of HSP's gapped (non-prelim): 57
length of query: 176
length of database: 8,064,228,071
effective HSP length: 132
effective length of query: 44
effective length of database: 9,262,057,059
effective search space: 407530510596
effective search space used: 407530510596
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 71 (32.0 bits)