BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 028862
         (202 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255559869|ref|XP_002520953.1| conserved hypothetical protein [Ricinus communis]
 gi|223539790|gb|EEF41370.1| conserved hypothetical protein [Ricinus communis]
          Length = 202

 Score =  212 bits (539), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 110/197 (55%), Positives = 144/197 (73%), Gaps = 9/197 (4%)

Query: 9   NMEEYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
           +++EYSA++T I+FD  IPLLRGP+P    DDPSSG Y LAFR+ QSWA+AY+  ESK++
Sbjct: 8   DLDEYSASATTIEFDFQIPLLRGPIPAAESDDPSSGPYFLAFRNPQSWATAYKLSESKII 67

Query: 66  SQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGF 125
           SQCE GARIGC+++AS+KCKPPWWRNLIG + AD KE+E+CE REME CL+ AK+KC GF
Sbjct: 68  SQCEGGARIGCAISASQKCKPPWWRNLIGWKVADLKEKENCEEREMEGCLVAAKEKCVGF 127

Query: 126 AKDRLLRPFQDARIAGRVGPKEVEK-LVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
           AK++  RPF +ARIA  VG +EV K L+C  S+PE     W  L+G +   +  F + TN
Sbjct: 128 AKEKCFRPFSEARIA--VGKEEVVKRLMCLVSVPEKTGIKWFRLMGID--DKCRFGI-TN 182

Query: 185 YRAADLLGSDNNYKRFL 201
            +A++LL  D  Y  F 
Sbjct: 183 QKASELLAFDPKYTWFF 199


>gi|224062315|ref|XP_002300815.1| predicted protein [Populus trichocarpa]
 gi|222842541|gb|EEE80088.1| predicted protein [Populus trichocarpa]
          Length = 201

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 111/197 (56%), Positives = 140/197 (71%), Gaps = 10/197 (5%)

Query: 9   NMEEYSAASTVIKFDKPIPLLRGPVPH---DDPSSGSYLLAFRSHQSWASAYESCESKVL 65
           N++EYSA++T IKFD+PIPLLRGP+P    DDPS   ++LAFRS QSWA AY++CESK++
Sbjct: 8   NLDEYSASATTIKFDRPIPLLRGPIPAGPPDDPSISPHVLAFRSPQSWAVAYKTCESKII 67

Query: 66  SQCEHGARIGCSVNASKKCKPPWWRNLI-GQRPADFKEREDCEVREMEACLIVAKDKCAG 124
           SQCE GAR GC++ AS KCKPPWWR  I G R  D KERE CE RE+E CL+ AKDKC G
Sbjct: 68  SQCEEGARFGCAITASNKCKPPWWRGFIGGARLEDLKERERCEDRELEGCLVAAKDKCVG 127

Query: 125 FAKDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
           FAK++  +PF+DARIA R   + V  LV   S+P+   S W  LIG    G+  F V TN
Sbjct: 128 FAKEKCWKPFRDARIAVR-EEEVVRNLVSLVSVPQ-RSSKWISLIG---FGKFNFGV-TN 181

Query: 185 YRAADLLGSDNNYKRFL 201
           +RA++ LGS+ N+  F 
Sbjct: 182 FRASEFLGSNKNHAWFF 198


>gi|356570604|ref|XP_003553475.1| PREDICTED: uncharacterized protein LOC100790558 [Glycine max]
          Length = 156

 Score =  179 bits (453), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 84/144 (58%), Positives = 113/144 (78%), Gaps = 4/144 (2%)

Query: 1   MDSTGTNSNMEEYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAY 57
           MD++  + N+++YS +STVIKFD+P+PLLRGP+P    DDPS+G Y+LAFR  ++WAS++
Sbjct: 1   MDASEPD-NVDDYSPSSTVIKFDRPVPLLRGPLPAGPSDDPSAGPYVLAFRDSRAWASSF 59

Query: 58  ESCESKVLSQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIV 117
            +CE K++ QCE GARIGC+V+AS+KCK PWW++L G   +D KERE CEVREM  C   
Sbjct: 60  AACERKIVEQCEEGARIGCAVSASRKCKLPWWKSLAGPTLSDLKEREQCEVREMADCEAA 119

Query: 118 AKDKCAGFAKDRLLRPFQDARIAG 141
           AK+KC GFA+D+ L PF+DARI G
Sbjct: 120 AKEKCVGFARDKCLVPFRDARIRG 143


>gi|449455252|ref|XP_004145367.1| PREDICTED: uncharacterized protein LOC101218153 [Cucumis sativus]
 gi|449472029|ref|XP_004153474.1| PREDICTED: uncharacterized protein LOC101205543 [Cucumis sativus]
          Length = 162

 Score =  173 bits (438), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 85/146 (58%), Positives = 107/146 (73%), Gaps = 6/146 (4%)

Query: 9   NMEEYSAASTVIKFDKPIPLLRGPV---PHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
           N  EYS++ST+I F +PIPLLRGPV     ++PS+G YLLAFR  Q+W SA+ +CESK++
Sbjct: 19  NPHEYSSSSTIITFQRPIPLLRGPVRASQSENPSAGPYLLAFRDRQAWESAFRACESKII 78

Query: 66  SQCEHGARIGCSVNASKKCKPPWWRNLI-GQRPADFKEREDCEVREMEACLIVAKDKCAG 124
            QCE GARIGCS+ AS KCKPPWW  L+  ++  D KERE CE  EMEACL VAK+KC G
Sbjct: 79  EQCEVGARIGCSITASNKCKPPWWGFLLRSKKGLDLKEREQCEELEMEACLAVAKEKCVG 138

Query: 125 FAKDRLLRPFQDARIAGRVGPKEVEK 150
           FAK++  +PF +AR+ G  G K  EK
Sbjct: 139 FAKEKCYKPFMEARVVG--GRKLTEK 162


>gi|225437166|ref|XP_002274823.1| PREDICTED: uncharacterized protein LOC100254442 [Vitis vinifera]
 gi|296084494|emb|CBI25053.3| unnamed protein product [Vitis vinifera]
          Length = 208

 Score =  172 bits (436), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 96/199 (48%), Positives = 134/199 (67%), Gaps = 10/199 (5%)

Query: 6   TNSNMEEYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCES 62
           T +++ +YS ++T+  FD P+PLLRGPVP    DDPS+G ++LAFR  ++W SA+++CES
Sbjct: 3   TPTDLSDYSPSATITPFDCPVPLLRGPVPASPSDDPSAGPFVLAFRDSRAWRSAFKACES 62

Query: 63  KVLSQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKC 122
           K++ QCE GARIGC+V+AS KC PPWW  LIG+ P    ERE+CE REM AC+  +K+KC
Sbjct: 63  KIIEQCEAGARIGCAVSASSKCGPPWWWALIGRSPPSLAEREECEEREMAACVAASKEKC 122

Query: 123 AGFAKDRLLRPFQDARIA------GRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGE 176
             FAKD+ L PF+DARIA       R   +E  +L+ WAS+     S+  + +GS W+  
Sbjct: 123 VKFAKDKCLGPFRDARIAVNGRDLTRKKKREAVELIFWASMGGRRNSIGLENLGS-WVLS 181

Query: 177 REFRVRTNYRAADLLGSDN 195
           +     TNYR +DLL + N
Sbjct: 182 KCKLGETNYRGSDLLDTQN 200


>gi|297821264|ref|XP_002878515.1| hypothetical protein ARALYDRAFT_324763 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324353|gb|EFH54774.1| hypothetical protein ARALYDRAFT_324763 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 173

 Score =  172 bits (436), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 91/181 (50%), Positives = 118/181 (65%), Gaps = 19/181 (10%)

Query: 11  EEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
           ++YSAA+T + F++PIPLLRGPVP    S GSY+LAFRS  SW++A++ CE+ +  QC+ 
Sbjct: 5   DDYSAAATTVVFERPIPLLRGPVP----SGGSYVLAFRSLDSWSAAFKRCETLIKDQCQE 60

Query: 71  GARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRL 130
           GARIGC+V+AS  CKPPWWR        D +ER+ CE RE +AC+  +K KCA FAKD+ 
Sbjct: 61  GARIGCAVSASNNCKPPWWRG-----SGDMRERDKCEEREFQACVAASKGKCAAFAKDKC 115

Query: 131 LRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAADL 190
              F DARI+     KEVE +VC AS+PE  R  WRDL+G   +G           A DL
Sbjct: 116 SGAFLDARIS-----KEVEGMVCLASMPEHSR--WRDLMG---IGSLHLHTNNCCTARDL 165

Query: 191 L 191
           L
Sbjct: 166 L 166


>gi|449530293|ref|XP_004172130.1| PREDICTED: uncharacterized LOC101205543 [Cucumis sativus]
          Length = 151

 Score =  172 bits (436), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 81/137 (59%), Positives = 103/137 (75%), Gaps = 4/137 (2%)

Query: 9   NMEEYSAASTVIKFDKPIPLLRGPV---PHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
           N  EYS++ST+I F +PIPLLRGPV     ++PS+G YLLAFR  Q+W SA+ +CESK++
Sbjct: 8   NPHEYSSSSTIITFQRPIPLLRGPVRASQSENPSAGPYLLAFRDRQAWESAFRACESKII 67

Query: 66  SQCEHGARIGCSVNASKKCKPPWWRNLI-GQRPADFKEREDCEVREMEACLIVAKDKCAG 124
            QCE GARIGCS+ AS KCKPPWW  L+  ++  D KERE CE  EMEACL VAK+KC G
Sbjct: 68  EQCEVGARIGCSITASNKCKPPWWGFLLRSKKGLDLKEREQCEELEMEACLAVAKEKCVG 127

Query: 125 FAKDRLLRPFQDARIAG 141
           FAK++  +PF +AR+ G
Sbjct: 128 FAKEKCYKPFMEARVVG 144


>gi|18412754|ref|NP_567146.1| uncharacterized protein [Arabidopsis thaliana]
 gi|98960925|gb|ABF58946.1| At3g63390 [Arabidopsis thaliana]
 gi|332646953|gb|AEE80474.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 175

 Score =  150 bits (379), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 95/174 (54%), Positives = 117/174 (67%), Gaps = 17/174 (9%)

Query: 11  EEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
           +++SAA+T + FD+PIPLLRGPVP    S GSY+LAFRS  SW++A++ CE+ +  QCE 
Sbjct: 6   DDFSAAATTVVFDRPIPLLRGPVP----SGGSYVLAFRSLDSWSAAFKRCETLIKDQCEE 61

Query: 71  GARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRL 130
           GA+IGC+V+AS  CKPPWWR   G    D +ERE CE RE E C+  AK KCA FAKD+ 
Sbjct: 62  GAKIGCAVSASNNCKPPWWRGSGG----DMREREKCEEREFEGCVAAAKGKCAAFAKDKC 117

Query: 131 LRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
              F DARIA     KEVE LV  AS+PE  R  WRDL+G   LG    R+ TN
Sbjct: 118 SGAFLDARIAKD---KEVEGLVWLASMPEESR--WRDLMG---LGG-SLRLHTN 162


>gi|7573322|emb|CAB87792.1| putative protein [Arabidopsis thaliana]
 gi|21554112|gb|AAM63192.1| unknown [Arabidopsis thaliana]
          Length = 174

 Score =  150 bits (378), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 95/174 (54%), Positives = 117/174 (67%), Gaps = 17/174 (9%)

Query: 11  EEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
           +++SAA+T + FD+PIPLLRGPVP    S GSY+LAFRS  SW++A++ CE+ +  QCE 
Sbjct: 5   DDFSAAATTVVFDRPIPLLRGPVP----SGGSYVLAFRSLDSWSAAFKRCETLIKDQCEE 60

Query: 71  GARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRL 130
           GA+IGC+V+AS  CKPPWWR   G    D +ERE CE RE E C+  AK KCA FAKD+ 
Sbjct: 61  GAKIGCAVSASNNCKPPWWRGSGG----DMREREKCEEREFEGCVAAAKGKCAAFAKDKC 116

Query: 131 LRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
              F DARIA     KEVE LV  AS+PE  R  WRDL+G   LG    R+ TN
Sbjct: 117 SGAFLDARIAKD---KEVEGLVWLASMPEESR--WRDLMG---LGG-SLRLHTN 161


>gi|110736892|dbj|BAF00403.1| hypothetical protein [Arabidopsis thaliana]
          Length = 167

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 95/171 (55%), Positives = 114/171 (66%), Gaps = 17/171 (9%)

Query: 14  SAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEHGAR 73
           SAA+T + FD+PIPLLRGPVP    S GSY+LAFRS  SW++A++ CE+ +  QCE GA+
Sbjct: 1   SAAATTVVFDRPIPLLRGPVP----SGGSYVLAFRSLDSWSAAFKRCETLIKDQCEEGAK 56

Query: 74  IGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRLLRP 133
           IGC+V+AS  CKPPWWR   G    D +ERE CE RE E C+  AK KCA FAKD+    
Sbjct: 57  IGCAVSASNNCKPPWWRGSGG----DMREREKCEEREFEGCVAAAKGKCAAFAKDKCSGA 112

Query: 134 FQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTN 184
           F DARIA     KEVE LV  AS+PE  R  WRDL+G   LG    R+ TN
Sbjct: 113 FLDARIAKD---KEVEGLVWLASMPEESR--WRDLMG---LGG-SLRLHTN 154


>gi|125535103|gb|EAY81651.1| hypothetical protein OsI_36821 [Oryza sativa Indica Group]
          Length = 206

 Score =  143 bits (360), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 123/201 (61%), Gaps = 12/201 (5%)

Query: 5   GTNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKV 64
           G+    E+YS A+TV++FD P+PLLR PVP    S    +LAFR   SW +A+E+ E+ +
Sbjct: 8   GSVDQSEDYSPAATVVRFDPPLPLLRAPVPSSAASGEPPVLAFRDAASWRAAWEAAEASL 67

Query: 65  LSQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAG 124
           LSQCE GAR GCS+ AS+KCKPPWW+ L+G  P D++ERE CE REM ACL  A++ C  
Sbjct: 68  LSQCEAGARSGCSITASRKCKPPWWKGLLGGAPTDYQERERCEEREMAACLEAAREACVK 127

Query: 125 FAKDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVR-- 182
           FAK++ + PF+DARIA   G  E  K   W +      S    +  S++     F  R  
Sbjct: 128 FAKEKCIEPFRDARIASE-GLLENTKFAIWGAGSNRTSSASLCITNSQY----PFNPRPG 182

Query: 183 -TNYRAADLLGS----DNNYK 198
            TNY+ +DLL S    DNN K
Sbjct: 183 STNYKGSDLLDSFPSEDNNIK 203


>gi|115486277|ref|NP_001068282.1| Os11g0616200 [Oryza sativa Japonica Group]
 gi|77551945|gb|ABA94742.1| expressed protein [Oryza sativa Japonica Group]
 gi|113645504|dbj|BAF28645.1| Os11g0616200 [Oryza sativa Japonica Group]
 gi|125577881|gb|EAZ19103.1| hypothetical protein OsJ_34635 [Oryza sativa Japonica Group]
 gi|215697312|dbj|BAG91306.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 206

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 123/201 (61%), Gaps = 12/201 (5%)

Query: 5   GTNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKV 64
           G+    E+YS A+TV++FD P+PLLR PVP    S    +LAFR   SW +A+E+ E+ +
Sbjct: 8   GSVDQSEDYSPAATVVRFDPPLPLLRAPVPSSAASGEPPVLAFRDAASWRAAWEAAEASL 67

Query: 65  LSQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAG 124
           LSQCE GAR GCS+ AS+KCKPPWW+ L+G  P D++ERE CE REM ACL  A++ C  
Sbjct: 68  LSQCEAGARSGCSITASRKCKPPWWKGLLGGAPTDYQERERCEEREMAACLEAAREACVK 127

Query: 125 FAKDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVR-- 182
           FAK++ + PF+DARIA   G  E  K   W +      S    +  S++     F  R  
Sbjct: 128 FAKEKCIEPFRDARIASE-GLLENTKFAIWGAGSNRTSSASLCIANSQY----PFNPRPG 182

Query: 183 -TNYRAADLLGS----DNNYK 198
            TNY+ +DLL S    DNN K
Sbjct: 183 STNYKGSDLLDSFPSEDNNIK 203


>gi|357156167|ref|XP_003577364.1| PREDICTED: uncharacterized protein LOC100836805 [Brachypodium
           distachyon]
          Length = 202

 Score =  127 bits (319), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 79/183 (43%), Positives = 112/183 (61%)

Query: 11  EEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
           E+YS A+TV++FD P+PLLR PVP         +LAFR   SW +A+E+ E+ + SQCE 
Sbjct: 14  EDYSPAATVVRFDSPLPLLRAPVPSSAAGDEPPVLAFRDAASWRAAWEAAEASLFSQCEA 73

Query: 71  GARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRL 130
           GAR GCS+ A++KCKPPWW+ L G  P D++ERE CE REM +CL  AK+ C  F+K++ 
Sbjct: 74  GARSGCSIMATRKCKPPWWKGLFGGAPTDYQEREQCEEREMASCLESAKEACIKFSKEKC 133

Query: 131 LRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAADL 190
             PF+DARIA     ++ + +V  A+    + S     + +           TNY+ +DL
Sbjct: 134 SAPFRDARIASEGLLEDTDFVVWGAAAGNNKTSSASFCVANRQHTPNPGAAATNYKGSDL 193

Query: 191 LGS 193
           L S
Sbjct: 194 LDS 196


>gi|242069173|ref|XP_002449863.1| hypothetical protein SORBIDRAFT_05g024570 [Sorghum bicolor]
 gi|241935706|gb|EES08851.1| hypothetical protein SORBIDRAFT_05g024570 [Sorghum bicolor]
          Length = 202

 Score =  126 bits (317), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 82/188 (43%), Positives = 115/188 (61%), Gaps = 2/188 (1%)

Query: 6   TNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
            + + E+YSA +T+++FD P+PLLR PV    PS    +LAFR   SW +A+E+ E+ ++
Sbjct: 7   ADGSAEDYSAEATLVRFDPPLPLLRAPVRSPAPSGEGPVLAFRDAASWRAAWEAAEANLV 66

Query: 66  SQCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGF 125
           SQCE GAR GCS++AS+KCKPPWW+ L    P D++ERE CE REM ACL  AK+ C  F
Sbjct: 67  SQCEAGARSGCSISASRKCKPPWWKGLFVAAPTDYEERERCEEREMAACLEAAKEACIKF 126

Query: 126 AKDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNY 185
           AK + + PF+DARIA   G  E      W +  +   S     + ++     +    TNY
Sbjct: 127 AKAKCIGPFRDARIASE-GLVENTDFHVWGAAADKSSSTSACALNNQQSFSPDPGA-TNY 184

Query: 186 RAADLLGS 193
           R +D+L S
Sbjct: 185 RGSDVLDS 192


>gi|226530317|ref|NP_001143017.1| uncharacterized protein LOC100275481 [Zea mays]
 gi|195612990|gb|ACG28325.1| hypothetical protein [Zea mays]
          Length = 180

 Score =  122 bits (306), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 73/139 (52%), Positives = 96/139 (69%), Gaps = 4/139 (2%)

Query: 6   TNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
            + + E+YSA +TV++FD P+PLLR PVP    S    +LAFR   SW +A+E+ E+ ++
Sbjct: 7   ADDSAEDYSAEATVVRFDPPLPLLRAPVPSPASSGEGPVLAFRDVASWRAAWEAAEASLV 66

Query: 66  SQCEHGARIGCSVNASKKCKPPWWRNLI----GQRPADFKEREDCEVREMEACLIVAKDK 121
           SQCE GAR GCSV+AS+KCKPPWW+ L+       P D++ER  CE REM ACL  A++ 
Sbjct: 67  SQCEAGARSGCSVSASRKCKPPWWKRLLVVVAAAPPTDYEERSRCEEREMAACLEAAREA 126

Query: 122 CAGFAKDRLLRPFQDARIA 140
           CA FAK +   PFQDARIA
Sbjct: 127 CAKFAKAKCAGPFQDARIA 145


>gi|242034065|ref|XP_002464427.1| hypothetical protein SORBIDRAFT_01g018080 [Sorghum bicolor]
 gi|241918281|gb|EER91425.1| hypothetical protein SORBIDRAFT_01g018080 [Sorghum bicolor]
          Length = 202

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 82/187 (43%), Positives = 110/187 (58%), Gaps = 2/187 (1%)

Query: 7   NSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLS 66
           + + ++YSA +T + FD P+PLLR PV    PS    +LAFR   SW +A+E+ E+ ++S
Sbjct: 8   DDSAQDYSAKATAVPFDPPLPLLRAPVRFPAPSGEDPVLAFRDAASWRAAWEAAEASLVS 67

Query: 67  QCEHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFA 126
           QCE GAR GCS+ AS+KCKPPWW+ L    P D++ERE CE REM ACL  AK+ C  FA
Sbjct: 68  QCEVGARSGCSIGASRKCKPPWWKGLFVAAPTDYEERERCEEREMAACLEAAKEACIKFA 127

Query: 127 KDRLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYR 186
           K +   PF DARIA   G  E      W +  +   S     + +      +  V TNYR
Sbjct: 128 KGKCTGPFGDARIASE-GLLENTDFDVWGAAVDKSSSTSGCALNNRQSFSPDPGV-TNYR 185

Query: 187 AADLLGS 193
            +D+L S
Sbjct: 186 GSDVLDS 192


>gi|23928446|gb|AAN40032.1| putative protein [Zea mays]
          Length = 165

 Score =  116 bits (290), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 72/142 (50%), Positives = 95/142 (66%), Gaps = 7/142 (4%)

Query: 6   TNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
            + + E+YSA +TV++FD P+PLLR PVP    S    +LAFR   SW +A+E+ E+ ++
Sbjct: 7   ADDSAEDYSAEATVVRFDPPLPLLRAPVPSPASSGEGPVLAFRDAASWRAAWEAAEASLV 66

Query: 66  SQC---EHGARIGCSVNASKKCKPPWWRNLI----GQRPADFKEREDCEVREMEACLIVA 118
           SQC   E GAR GCSV+AS+KCKPPWW+ L+       P D++ER  CE REM ACL  A
Sbjct: 67  SQCELAEAGARSGCSVSASRKCKPPWWKRLLVVVAAAPPTDYEERSRCEEREMAACLEAA 126

Query: 119 KDKCAGFAKDRLLRPFQDARIA 140
           ++ C  FAK +   PFQDARIA
Sbjct: 127 REACVKFAKAKCAGPFQDARIA 148


>gi|168035348|ref|XP_001770172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678549|gb|EDQ65006.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 141

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 54/133 (40%), Positives = 85/133 (63%), Gaps = 6/133 (4%)

Query: 14  SAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEH 70
           S +  VI FD  +PL+R PVP    +D S G Y+LAFR+ ++W  A+++C+ K+ +QCE 
Sbjct: 6   SPSVLVIAFDPFLPLVRIPVPVGRDEDSSKGPYVLAFRNEETWRQAWKTCKEKIATQCEA 65

Query: 71  GARIGCSVNASKKCKPPWWRNLI---GQRPADFKEREDCEVREMEACLIVAKDKCAGFAK 127
           GA++GCSV+A+  C+ PWW++ +   G    D   R+DCE REM  CL+ + ++C  FA+
Sbjct: 66  GAKVGCSVSAAMACRQPWWKSYLPFSGSGSDDQFRRQDCEGREMRGCLLSSHERCFTFAR 125

Query: 128 DRLLRPFQDARIA 140
           +     + + RIA
Sbjct: 126 EICDPVYANMRIA 138


>gi|302784989|ref|XP_002974266.1| hypothetical protein SELMODRAFT_101436 [Selaginella moellendorffii]
 gi|300157864|gb|EFJ24488.1| hypothetical protein SELMODRAFT_101436 [Selaginella moellendorffii]
          Length = 238

 Score =  109 bits (273), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 101/194 (52%), Gaps = 9/194 (4%)

Query: 12  EYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQC 68
           + S ++T I F+  +P+LR PVP    DDPS G ++LAF+   SW+ A+++CE ++ SQC
Sbjct: 47  DLSPSATTIAFEPVLPVLRVPVPAGSDDDPSKGPFVLAFKDEASWSRAWQNCEKQITSQC 106

Query: 69  EHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKD 128
           E     GCS+ ASK C+ PWW+  +       + RE CE + M ACL  ++     FA+ 
Sbjct: 107 EAATMKGCSITASKACRSPWWKAFVPNFNTSTEGREACEEKAMMACLAASRRDGIKFARG 166

Query: 129 RLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAA 188
                F DARIA ++     +     ++   +  +  + LI  E  G   FR R+    A
Sbjct: 167 VCEEVFGDARIANKLQFANPQGHTNRSTENASTSNHNQVLI--ENPGSTNFRGRSLTDGA 224

Query: 189 DLLGSDNNYKRFLG 202
           D    D +Y+  LG
Sbjct: 225 D----DKHYQSSLG 234


>gi|302807885|ref|XP_002985636.1| hypothetical protein SELMODRAFT_122681 [Selaginella moellendorffii]
 gi|300146545|gb|EFJ13214.1| hypothetical protein SELMODRAFT_122681 [Selaginella moellendorffii]
          Length = 284

 Score =  109 bits (272), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 102/194 (52%), Gaps = 9/194 (4%)

Query: 12  EYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQC 68
           + S ++  I F+  +P+LR PVP    DDPS G ++LAF+   SW+ A+++CE ++ SQC
Sbjct: 47  DLSPSAITIAFEPVLPVLRVPVPAGSDDDPSKGPFVLAFKDEASWSRAWQNCEKQITSQC 106

Query: 69  EHGARIGCSVNASKKCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKD 128
           E     GCS+ ASK C+ PWW+  +       ++RE CE + M ACL  ++     FA+ 
Sbjct: 107 EAATMKGCSITASKACRSPWWKAFVPNFNTSTEDREACEEKAMMACLAASRRDGIKFARG 166

Query: 129 RLLRPFQDARIAGRVGPKEVEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAA 188
                F DARIA ++   + +     ++   +  +  + LI  E  G   FR R+    A
Sbjct: 167 VCEEVFGDARIANKLQFADPQGHTNRSTENASTSNHNQVLI--ENPGSTNFRGRSLTDGA 224

Query: 189 DLLGSDNNYKRFLG 202
           D    D +Y+  LG
Sbjct: 225 D----DKHYQSSLG 234


>gi|168044182|ref|XP_001774561.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674116|gb|EDQ60629.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 293

 Score = 99.4 bits (246), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 55/139 (39%), Positives = 89/139 (64%), Gaps = 6/139 (4%)

Query: 8   SNMEEYSAASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKV 64
           S+    S   TV+ FD  IPL R PVP    +DPS G Y+LAFR+ ++W  ++ +C+  V
Sbjct: 15  SSQVNLSPYVTVVAFDPLIPLARIPVPAGQEEDPSKGPYVLAFRNDETWRQSWRTCKENV 74

Query: 65  LSQCEHGARIGCSVNASKKCKPPWWRN---LIGQRPADFKEREDCEVREMEACLIVAKDK 121
           ++QCE GA++GCS++A+K C+ PWW+     +G    D  +RE+CE REM +CL+ + ++
Sbjct: 75  VTQCEAGAKVGCSISAAKACRQPWWKAYLPFLGSGSDDPSKREECERREMRSCLLSSHER 134

Query: 122 CAGFAKDRLLRPFQDARIA 140
           C  +A++  +  + + RIA
Sbjct: 135 CFTYARETCVPVYSNMRIA 153


>gi|302795037|ref|XP_002979282.1| hypothetical protein SELMODRAFT_110619 [Selaginella
          moellendorffii]
 gi|300153050|gb|EFJ19690.1| hypothetical protein SELMODRAFT_110619 [Selaginella
          moellendorffii]
          Length = 60

 Score = 62.8 bits (151), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 28/60 (46%), Positives = 39/60 (65%), Gaps = 3/60 (5%)

Query: 32 PVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEHGARIGCSVNASKKCKPPW 88
          PVP   +DDPS G ++LAF+   SW+ A++ CE ++ S CE     GCS+ ASK C+ PW
Sbjct: 1  PVPAGSNDDPSKGPFILAFKDKASWSHAWQYCEKQITSYCEAATMKGCSITASKACRSPW 60


>gi|302813810|ref|XP_002988590.1| hypothetical protein SELMODRAFT_128383 [Selaginella
          moellendorffii]
 gi|300143697|gb|EFJ10386.1| hypothetical protein SELMODRAFT_128383 [Selaginella
          moellendorffii]
          Length = 60

 Score = 61.2 bits (147), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/60 (46%), Positives = 38/60 (63%), Gaps = 3/60 (5%)

Query: 32 PVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEHGARIGCSVNASKKCKPPW 88
          PVP   +DDPS G ++LAF+   SW+ A++ CE ++ S CE     GCS+ ASK C  PW
Sbjct: 1  PVPTGSNDDPSKGPFILAFKDKASWSRAWQYCEKQITSYCEAATMKGCSITASKACCSPW 60


>gi|302795041|ref|XP_002979284.1| hypothetical protein SELMODRAFT_59163 [Selaginella
          moellendorffii]
 gi|300153052|gb|EFJ19692.1| hypothetical protein SELMODRAFT_59163 [Selaginella
          moellendorffii]
          Length = 61

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 24/57 (42%), Positives = 40/57 (70%), Gaps = 3/57 (5%)

Query: 16 ASTVIKFDKPIPLLRGPVP---HDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCE 69
          ++T+I F+  +P+LR P+P    DDPS G ++LAF+   SW+ A++ CE ++ SQC+
Sbjct: 5  SATMIAFEPVLPVLRVPIPTGSDDDPSKGPFILAFKDEASWSHAWQYCEKQITSQCK 61


>gi|307102844|gb|EFN51111.1| hypothetical protein CHLNCDRAFT_141467 [Chlorella variabilis]
          Length = 112

 Score = 56.6 bits (135), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 53/107 (49%), Gaps = 2/107 (1%)

Query: 45  LAFRSHQSWASAYESCESKVLSQCEHGARIGCSVNASKKCKPPWWRNLIGQRP-ADFKER 103
           LAF S  S    +  CE KV+  C  G R GCS+ A+++C  P    L+G  P   ++ER
Sbjct: 4   LAFASEASREQQHRGCEGKVIEVCTEGLRAGCSLQAARQCASPPLLRLLGLVPEQSWEER 63

Query: 104 EDCEVREMEACLIVAKDKCAGFAKDRLLRPFQDARIAGRVGPKEVEK 150
           E C+ R M+ACL      C   A+      F    +AG+   +E EK
Sbjct: 64  EACQQRCMDACLAQQTPACQQHAQHFCADVF-AGPVAGQQQGREQEK 109


>gi|413920417|gb|AFW60349.1| hypothetical protein ZEAMMB73_302102 [Zea mays]
          Length = 89

 Score = 53.1 bits (126), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/64 (46%), Positives = 44/64 (68%)

Query: 6  TNSNMEEYSAASTVIKFDKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVL 65
           + + E+YSA +TV++FD P+PLLR PVP    S    +LAFR   SW +A+E+ E+ ++
Sbjct: 7  ADDSAEDYSAEATVVRFDPPLPLLRAPVPSPASSGEGPVLAFRDAASWRAAWEAAEASLV 66

Query: 66 SQCE 69
          SQCE
Sbjct: 67 SQCE 70


>gi|406920230|gb|EKD58332.1| hypothetical protein ACD_56C00148G0013 [uncultured bacterium]
          Length = 327

 Score = 39.3 bits (90), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 98  ADFKEREDCEV---REMEACLIVAKDKCAGFAKDRLLRPFQDARIAGRVGPKEVEKLVCW 154
           ADF+ +E  E    RE E    ++KDK   F K++L  P+ D RI    G K+V+K++  
Sbjct: 17  ADFRSKEKIEKMLKREKEKYDGLSKDKKELFDKEKLTNPYSDTRIHFDGGAKQVKKIMAG 76

Query: 155 ASLPETERSLWRDL 168
             +  +E  + R L
Sbjct: 77  IDIDSSELMVARYL 90


>gi|413920416|gb|AFW60348.1| hypothetical protein ZEAMMB73_751749 [Zea mays]
          Length = 68

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 21/30 (70%)

Query: 111 MEACLIVAKDKCAGFAKDRLLRPFQDARIA 140
           M ACL  A++ CA FAK +   PFQDARIA
Sbjct: 1   MAACLEAAREACAKFAKAKCAGPFQDARIA 30


>gi|254385548|ref|ZP_05000874.1| conserved hypothetical protein [Streptomyces sp. Mg1]
 gi|194344419|gb|EDX25385.1| conserved hypothetical protein [Streptomyces sp. Mg1]
          Length = 415

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 47/104 (45%), Gaps = 12/104 (11%)

Query: 88  WWRNLIGQRPADFKEREDCEVREMEACLIVAKDKCAGFAKDRLLRPFQDARIAGRVGPKE 147
           WWR   G+    F+  +     ++ A    A  + AG    R    + DA++   +    
Sbjct: 196 WWRLATGEEEFSFRPYKP----KLHAVYRTAAGEVAGLVSYRSDDHWTDAKVP--LNTLR 249

Query: 148 VEKLVCWASLPETERSLWRDLIGSEWLGEREFRVRTNYRAADLL 191
           VE+L+  A+ PE ER+LW  L   +W      R+RT  RA D L
Sbjct: 250 VEELL--AATPEAERALWHFLCSVDW----ALRIRTGLRATDDL 287


>gi|260655401|ref|ZP_05860889.1| lipopolysaccharide O-Ag biosynthesis protein FlmB [Jonquetella
           anthropi E3_33 E1]
 gi|424845213|ref|ZP_18269824.1| putative PLP-dependent enzyme possibly involved in cell wall
           biogenesis [Jonquetella anthropi DSM 22815]
 gi|260629849|gb|EEX48043.1| lipopolysaccharide O-Ag biosynthesis protein FlmB [Jonquetella
           anthropi E3_33 E1]
 gi|363986651|gb|EHM13481.1| putative PLP-dependent enzyme possibly involved in cell wall
           biogenesis [Jonquetella anthropi DSM 22815]
          Length = 386

 Score = 36.2 bits (82), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 46/111 (41%), Gaps = 8/111 (7%)

Query: 23  DKPIPLLRGPVPHDDPSSGSYLLAFRSHQSWASAYESCESKVLSQCEHGARIGCSVNASK 82
           D P+ L  GP P  + S   ++L              C    + +  + A+IGCSV+   
Sbjct: 279 DLPVQLPMGPTPEREHSWHLFMLRLSDDAPI------CRDNFIQEMSN-AKIGCSVHYIP 331

Query: 83  KCKPPWWRNLIGQRPADFKEREDCEVREMEACLIVA-KDKCAGFAKDRLLR 132
               P+W+     +P DF   ED   RE+   +  A KD    +  +++ R
Sbjct: 332 LHAQPYWKQRYNLQPQDFPVAEDAWKREVSLPIFTAMKDSDVDYVIEQVRR 382


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.425 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,378,818,237
Number of Sequences: 23463169
Number of extensions: 133925712
Number of successful extensions: 238602
Number of sequences better than 100.0: 32
Number of HSP's better than 100.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 238501
Number of HSP's gapped (non-prelim): 33
length of query: 202
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 67
effective length of database: 9,191,667,552
effective search space: 615841725984
effective search space used: 615841725984
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)