BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 030344
         (179 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255562170|ref|XP_002522093.1| conserved hypothetical protein [Ricinus communis]
 gi|223538692|gb|EEF40293.1| conserved hypothetical protein [Ricinus communis]
          Length = 192

 Score =  226 bits (576), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 134/200 (67%), Positives = 148/200 (74%), Gaps = 29/200 (14%)

Query: 1   MALNFIAPSP---------------FVIPKHKHLFKLQSQTTSC-KTFAQSQRIGAEGGG 44
           MALN    SP               F+IPKHKHLFK     +SC K FA+S+  GAE G 
Sbjct: 1   MALNISPASPCQISSCSSSSANNSHFLIPKHKHLFKFY--YSSCRKIFAKSE--GAESG- 55

Query: 45  GVTEEDPPAASLSGSSSSAQTQLDLLEQLTSTSSS---LEGYESDGSSRKLTIRDQLAQL 101
            + E+D P    SGS SS +TQLDLL+QLTS SSS     GYESDGSS KLTIR QLA+L
Sbjct: 56  -IKEQDLPP--FSGSLSSTRTQLDLLDQLTSNSSSSPDTVGYESDGSSYKLTIRQQLAKL 112

Query: 102 VGDRDD--EFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVI 159
           VG+RD+  EFSIPLGKNLKK S KFLTISQKRNIRRQAYL+EVSQRNDSVFFATIGAFVI
Sbjct: 113 VGERDEDEEFSIPLGKNLKKFSPKFLTISQKRNIRRQAYLDEVSQRNDSVFFATIGAFVI 172

Query: 160 LPPFIILGIAVLTGYVQLFP 179
           LPP +ILGIA+LTGYVQLFP
Sbjct: 173 LPPIVILGIAILTGYVQLFP 192


>gi|356501461|ref|XP_003519543.1| PREDICTED: uncharacterized protein LOC100776849 [Glycine max]
          Length = 191

 Score =  224 bits (572), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 120/169 (71%), Positives = 135/169 (79%), Gaps = 6/169 (3%)

Query: 11  FVIPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLL 70
           F+  KH +L K     +S  T   +Q  G + G  VT++D  +A   GS SS++ QLDLL
Sbjct: 29  FLTHKHNNLIKFYHPFSSLLTTC-AQTHGTDTG--VTQKDDASA---GSLSSSRAQLDLL 82

Query: 71  EQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQK 130
           EQLTSTSS   GYESDGSSRKLTIR+QL QL G+RDD+F+IPLGKNLKKVS KFLTISQK
Sbjct: 83  EQLTSTSSPNSGYESDGSSRKLTIREQLVQLFGERDDDFTIPLGKNLKKVSAKFLTISQK 142

Query: 131 RNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           RNIRRQ YLNEVSQRNDSVFFATIGAFVILPPFIILGIA+LTGYVQLFP
Sbjct: 143 RNIRRQTYLNEVSQRNDSVFFATIGAFVILPPFIILGIAILTGYVQLFP 191


>gi|356553792|ref|XP_003545236.1| PREDICTED: uncharacterized protein LOC100778027 [Glycine max]
          Length = 178

 Score =  222 bits (565), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 121/170 (71%), Positives = 136/170 (80%), Gaps = 9/170 (5%)

Query: 11  FVIPKHKHLFKLQSQTTSC-KTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDL 69
           F+  KH +L K+   ++S   T AQ+Q        GVT+ED  A +  GS SS++ QLDL
Sbjct: 17  FLTHKHNNLIKIYHPSSSLLTTCAQTQGTDT----GVTQEDASAGN--GSLSSSRAQLDL 70

Query: 70  LEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQ 129
           LEQLTSTSS   GYESDGS  KLTIR+QL QL G+RDD+F+IPLGKNLKKVS KFLTISQ
Sbjct: 71  LEQLTSTSSPNSGYESDGS--KLTIREQLEQLFGERDDDFTIPLGKNLKKVSAKFLTISQ 128

Query: 130 KRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           KRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIA+LTGYVQLFP
Sbjct: 129 KRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAILTGYVQLFP 178


>gi|225446815|ref|XP_002283386.1| PREDICTED: uncharacterized protein LOC100243650 isoform 1 [Vitis
           vinifera]
 gi|359485353|ref|XP_003633263.1| PREDICTED: uncharacterized protein LOC100243650 isoform 2 [Vitis
           vinifera]
 gi|302143535|emb|CBI22096.3| unnamed protein product [Vitis vinifera]
          Length = 190

 Score =  218 bits (556), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 132/171 (77%), Positives = 147/171 (85%), Gaps = 6/171 (3%)

Query: 11  FVIPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPP-AASLSGSSSSAQTQLDL 69
           F+IPKH+ LFKL S ++  +T AQS   G   G GVT+EDPP  A+LS S SSA++QLDL
Sbjct: 24  FLIPKHR-LFKLCSSSSCFRTSAQS---GGTEGRGVTQEDPPLPAALSESPSSARSQLDL 79

Query: 70  LEQLTSTSSSLEG-YESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTIS 128
           LEQLTST+ SLEG YESDG SR LTIRDQLAQLVG+RDD+F IPLGKNLKKVS KFLTIS
Sbjct: 80  LEQLTSTTPSLEGGYESDGMSRNLTIRDQLAQLVGERDDDFIIPLGKNLKKVSPKFLTIS 139

Query: 129 QKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           QKRNI+RQAYLNEVSQRNDSVFFATIGAFVILPP IILGIA++TGYVQLFP
Sbjct: 140 QKRNIKRQAYLNEVSQRNDSVFFATIGAFVILPPIIILGIAIITGYVQLFP 190


>gi|388509082|gb|AFK42607.1| unknown [Medicago truncatula]
 gi|388515509|gb|AFK45816.1| unknown [Medicago truncatula]
          Length = 178

 Score =  210 bits (534), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 116/174 (66%), Positives = 138/174 (79%), Gaps = 8/174 (4%)

Query: 6   IAPSPFVIPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQT 65
           + PSP  + +  +  K    ++  KT AQSQ  G E G  V+E+    AS  GS SS++ 
Sbjct: 13  LHPSPSFLTRKYNNIKFHHASSLTKTCAQSQ--GTEPG--VSED----ASSIGSLSSSRA 64

Query: 66  QLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFL 125
           QLDLLEQ TSTSSS+ GYESDGSS++LTIR+QLAQLVG+RDD+FS+PLGKNLKKVS KFL
Sbjct: 65  QLDLLEQFTSTSSSVIGYESDGSSQRLTIREQLAQLVGERDDDFSLPLGKNLKKVSAKFL 124

Query: 126 TISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           T SQ+RNI+RQ+YLNEVSQRNDS FFATIGAFV+LPP IILGIA+LTGYVQL P
Sbjct: 125 TTSQRRNIKRQSYLNEVSQRNDSTFFATIGAFVLLPPLIILGIAILTGYVQLLP 178


>gi|357494233|ref|XP_003617405.1| hypothetical protein MTR_5g091190 [Medicago truncatula]
 gi|355518740|gb|AET00364.1| hypothetical protein MTR_5g091190 [Medicago truncatula]
          Length = 173

 Score =  210 bits (534), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 116/174 (66%), Positives = 138/174 (79%), Gaps = 8/174 (4%)

Query: 6   IAPSPFVIPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQT 65
           + PSP  + +  +  K    ++  KT AQSQ  G E G  V+E+    AS  GS SS++ 
Sbjct: 8   LHPSPSFLTRKYNNIKFHHASSLTKTCAQSQ--GTEPG--VSED----ASSIGSLSSSRA 59

Query: 66  QLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFL 125
           QLDLLEQ TSTSSS+ GYESDGSS++LTIR+QLAQLVG+RDD+FS+PLGKNLKKVS KFL
Sbjct: 60  QLDLLEQFTSTSSSVIGYESDGSSQRLTIREQLAQLVGERDDDFSLPLGKNLKKVSAKFL 119

Query: 126 TISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           T SQ+RNI+RQ+YLNEVSQRNDS FFATIGAFV+LPP IILGIA+LTGYVQL P
Sbjct: 120 TTSQRRNIKRQSYLNEVSQRNDSTFFATIGAFVLLPPLIILGIAILTGYVQLLP 173


>gi|449525854|ref|XP_004169931.1| PREDICTED: uncharacterized LOC101216249 [Cucumis sativus]
          Length = 190

 Score =  205 bits (521), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 116/161 (72%), Positives = 134/161 (83%), Gaps = 5/161 (3%)

Query: 20  FKLQSQ-TTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQLTSTSS 78
           FK  S+ ++S + FA S+   AEG   V EE+   +S SGSSSSA+TQLDLLEQL+S S 
Sbjct: 34  FKFHSRCSSSSRKFAVSE--SAEGR--VNEEEFLISSSSGSSSSARTQLDLLEQLSSGSQ 89

Query: 79  SLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAY 138
            ++GYESDGS  K TIRDQLAQL  DRDD+F++PLGKNLKKVS KFLTISQKRNI+RQAY
Sbjct: 90  LVDGYESDGSYGKTTIRDQLAQLFRDRDDDFTVPLGKNLKKVSAKFLTISQKRNIKRQAY 149

Query: 139 LNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           LNEVSQRNDSVFFAT+GAFVILPP +ILGIA+LTGYVQLFP
Sbjct: 150 LNEVSQRNDSVFFATVGAFVILPPIVILGIAILTGYVQLFP 190


>gi|449453780|ref|XP_004144634.1| PREDICTED: uncharacterized protein LOC101216249 [Cucumis sativus]
          Length = 189

 Score =  205 bits (521), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 116/161 (72%), Positives = 134/161 (83%), Gaps = 5/161 (3%)

Query: 20  FKLQSQ-TTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQLTSTSS 78
           FK  S+ ++S + FA S+   AEG   V EE+   +S SGSSSSA+TQLDLLEQL+S S 
Sbjct: 33  FKFHSRCSSSSRKFAVSE--SAEGR--VNEEEFLISSSSGSSSSARTQLDLLEQLSSGSQ 88

Query: 79  SLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAY 138
            ++GYESDGS  K TIRDQLAQL  DRDD+F++PLGKNLKKVS KFLTISQKRNI+RQAY
Sbjct: 89  LVDGYESDGSYGKTTIRDQLAQLFRDRDDDFTVPLGKNLKKVSAKFLTISQKRNIKRQAY 148

Query: 139 LNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           LNEVSQRNDSVFFAT+GAFVILPP +ILGIA+LTGYVQLFP
Sbjct: 149 LNEVSQRNDSVFFATVGAFVILPPIVILGIAILTGYVQLFP 189


>gi|224066559|ref|XP_002302136.1| predicted protein [Populus trichocarpa]
 gi|222843862|gb|EEE81409.1| predicted protein [Populus trichocarpa]
          Length = 186

 Score =  199 bits (507), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 132/192 (68%), Positives = 149/192 (77%), Gaps = 19/192 (9%)

Query: 1   MALNFIAPSP-----------FVIPKHKHLFKLQSQTTSC-KTFAQSQRIGAEGGGGVTE 48
           MALNF A S            F+IP HKHL KL   + SC KT AQS+  GA    GVTE
Sbjct: 1   MALNFSASSLSHVSSSTTGPQFLIPAHKHLLKLYPSSFSCTKTSAQSE--GA--ASGVTE 56

Query: 49  EDPPAASLSGSSSSAQTQLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDE 108
           +DP + S S SS+  +TQLDLLEQLTS + S +GYESDG S KLTIR+QLAQLVG+ D++
Sbjct: 57  QDPSSFSGSLSST--RTQLDLLEQLTSATPSADGYESDGGSGKLTIREQLAQLVGEGDND 114

Query: 109 FSIPLGK-NLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILG 167
           FSIPLGK NLKKVS KFLT+SQKRNIRRQAYLN+VSQRNDSVFFATIGAF+ILPP IILG
Sbjct: 115 FSIPLGKKNLKKVSAKFLTVSQKRNIRRQAYLNQVSQRNDSVFFATIGAFIILPPIIILG 174

Query: 168 IAVLTGYVQLFP 179
           IA+LTGYVQLFP
Sbjct: 175 IAILTGYVQLFP 186


>gi|297824263|ref|XP_002880014.1| hypothetical protein ARALYDRAFT_321975 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325853|gb|EFH56273.1| hypothetical protein ARALYDRAFT_321975 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 187

 Score =  195 bits (496), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 114/190 (60%), Positives = 132/190 (69%), Gaps = 14/190 (7%)

Query: 1   MALNFIAPSPFVIP----------KHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEED 50
           MALNF + S   I           +H HL    S   + +  ++   IG       +   
Sbjct: 1   MALNFFSTSLHPISNPSSSSRSKQRHHHLLLQLSSPPATRVRSERSEIGVTDNNDSSSSS 60

Query: 51  PPAASLSGSSSSAQTQLDLLEQLTSTSSSLEGYESD-GSSRKLTIRDQLAQLVGDRDDEF 109
              +  S SSSS +TQLDLLEQLTSTS   +GY SD G SR +TIR+QLA LVGDRDD+F
Sbjct: 61  SSFSGSSSSSSSVRTQLDLLEQLTSTS---DGYLSDAGGSRGITIRNQLAGLVGDRDDDF 117

Query: 110 SIPLGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIA 169
           +IPLGKNLKKVS KFLTISQKRNI+RQ+YLNEVSQRNDSVFFATIGAFVILPP +IL IA
Sbjct: 118 TIPLGKNLKKVSPKFLTISQKRNIKRQSYLNEVSQRNDSVFFATIGAFVILPPLVILAIA 177

Query: 170 VLTGYVQLFP 179
           +LTGYVQLFP
Sbjct: 178 ILTGYVQLFP 187


>gi|30689207|ref|NP_850380.1| uncharacterized protein [Arabidopsis thaliana]
 gi|19347805|gb|AAL86353.1| unknown protein [Arabidopsis thaliana]
 gi|22136700|gb|AAM91669.1| unknown protein [Arabidopsis thaliana]
 gi|330255099|gb|AEC10193.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 187

 Score =  193 bits (490), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 99/118 (83%), Positives = 106/118 (89%), Gaps = 4/118 (3%)

Query: 63  AQTQLDLLEQLTSTSSSLEGYESDGS-SRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVS 121
            +TQLDLLEQLTSTS   +GY SDG  SR LTIRDQLA LVGDRDD+FSIPLGKNLKKVS
Sbjct: 73  VRTQLDLLEQLTSTS---DGYLSDGGGSRGLTIRDQLAGLVGDRDDDFSIPLGKNLKKVS 129

Query: 122 EKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
            KFLTISQKRNI+RQ+YLNEVSQRNDSVFFATIGAFVILPP +IL IA+LTGYVQLFP
Sbjct: 130 PKFLTISQKRNIKRQSYLNEVSQRNDSVFFATIGAFVILPPLVILAIAILTGYVQLFP 187


>gi|224082572|ref|XP_002306747.1| predicted protein [Populus trichocarpa]
 gi|222856196|gb|EEE93743.1| predicted protein [Populus trichocarpa]
          Length = 122

 Score =  186 bits (473), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 103/122 (84%), Positives = 112/122 (91%), Gaps = 1/122 (0%)

Query: 59  SSSSAQTQLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGK-NL 117
           S SS +TQLDLLEQLTSTSS  +GYESDGSS KLTIRDQLA+LVGDRDD+FSIPLGK NL
Sbjct: 1   SLSSTRTQLDLLEQLTSTSSPADGYESDGSSGKLTIRDQLARLVGDRDDDFSIPLGKKNL 60

Query: 118 KKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQL 177
           KKVS  FLT+SQKRNIRRQAYLNEVSQRNDSVFFATIGAF+ILPP +ILGIA++ GYVQL
Sbjct: 61  KKVSANFLTVSQKRNIRRQAYLNEVSQRNDSVFFATIGAFIILPPTLILGIAIIIGYVQL 120

Query: 178 FP 179
           FP
Sbjct: 121 FP 122


>gi|414881083|tpg|DAA58214.1| TPA: hypothetical protein ZEAMMB73_738830 [Zea mays]
          Length = 188

 Score =  140 bits (353), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 81/175 (46%), Positives = 109/175 (62%), Gaps = 22/175 (12%)

Query: 13  IPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQ 72
           +P    +   Q     C+  A+ +        G T+E       S ++SSA+ QLDLLEQ
Sbjct: 28  VPLKASVAPAQRALGCCRATARQE--------GATQEP------SVAASSARAQLDLLEQ 73

Query: 73  LTS-----TSSSLE-GYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKF-- 124
           LTS     T++ LE G       R+ TIR+QL+ L   + DEF++PLGK LK+  ++   
Sbjct: 74  LTSATPDDTAAGLENGMPPPEPRRRATIREQLSALANGKVDEFTLPLGKRLKEGLKRLNS 133

Query: 125 LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PP  IL I VLTGY+QLFP
Sbjct: 134 LTVSQRRNIKRQALLTQVSGRNDSVFFATVGAFVLVPPLAILAIGVLTGYIQLFP 188


>gi|226497328|ref|NP_001145019.1| uncharacterized protein LOC100278190 [Zea mays]
 gi|195649997|gb|ACG44466.1| hypothetical protein [Zea mays]
          Length = 188

 Score =  138 bits (348), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 80/175 (45%), Positives = 108/175 (61%), Gaps = 22/175 (12%)

Query: 13  IPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQ 72
           +P    +   Q     C+  A+ +        G T+E       S ++SSA+ QLDLLEQ
Sbjct: 28  VPLKASVAPAQRSLGCCRATARQE--------GATQEP------SVAASSARAQLDLLEQ 73

Query: 73  LTSTS-----SSLE-GYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKF-- 124
           LTS +     + LE G       R+ TIR+QL+ L   + DEF++PLGK LK+  ++   
Sbjct: 74  LTSATPDDAAAGLENGMPPPEPRRRATIREQLSTLANGKVDEFTLPLGKRLKEGLKRLNS 133

Query: 125 LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PP  IL I VLTGY+QLFP
Sbjct: 134 LTVSQRRNIKRQALLTQVSGRNDSVFFATVGAFVLVPPLAILAIGVLTGYIQLFP 188


>gi|224284915|gb|ACN40187.1| unknown [Picea sitchensis]
          Length = 195

 Score =  133 bits (335), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/158 (49%), Positives = 99/158 (62%), Gaps = 10/158 (6%)

Query: 26  TTSC--KTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQLTSTSSSLEGY 83
           T SC  K         ++   GV+E  P  +    + SS Q+QLDLLE+LTS    +   
Sbjct: 44  TNSCRLKRIVSPPAAASKTESGVSEGSPDESV--STESSVQSQLDLLERLTS----INNK 97

Query: 84  ESDGS--SRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNE 141
           E+D +  S   TIR+QL+ LV D   E +IPLGK  K  +   LTISQKRNI+RQ YLN+
Sbjct: 98  ENDQAKVSSTTTIREQLSALVSDEVGEVTIPLGKRFKPNNYNSLTISQKRNIKRQDYLNK 157

Query: 142 VSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
           V+QRND  FF+TI  FVILPP +ILG+AV TGYV +FP
Sbjct: 158 VAQRNDIPFFSTIALFVILPPVVILGVAVATGYVDIFP 195


>gi|242053923|ref|XP_002456107.1| hypothetical protein SORBIDRAFT_03g030570 [Sorghum bicolor]
 gi|241928082|gb|EES01227.1| hypothetical protein SORBIDRAFT_03g030570 [Sorghum bicolor]
          Length = 195

 Score =  133 bits (334), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 75/126 (59%), Positives = 93/126 (73%), Gaps = 7/126 (5%)

Query: 61  SSAQTQLDLLEQLTS-TSSSLEGYES---DGSSRKLTIRDQLAQLV-GDRDDEFSIPLGK 115
           SSA+ QLDLLEQLTS T   + G E+     S ++ TIR QL+ L  G   DEF++PLG+
Sbjct: 70  SSARAQLDLLEQLTSPTPDGIAGLENGVPTESRQRATIRQQLSALANGKVVDEFTLPLGR 129

Query: 116 NLKK--VSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTG 173
            LK+   S   LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PPF IL IAV+TG
Sbjct: 130 KLKEGLKSLNKLTVSQRRNIKRQALLTQVSGRNDSVFFATVGAFVLVPPFAILAIAVITG 189

Query: 174 YVQLFP 179
           YVQLFP
Sbjct: 190 YVQLFP 195


>gi|125527181|gb|EAY75295.1| hypothetical protein OsI_03186 [Oryza sativa Indica Group]
          Length = 187

 Score =  132 bits (331), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 75/126 (59%), Positives = 97/126 (76%), Gaps = 5/126 (3%)

Query: 59  SSSSAQTQLDLLEQLTSTSSSLEGYESDGSSR---KLTIRDQLAQLVGDRDDEFSIPLGK 115
           S SSA+TQLDLLEQLT+ +S   G E+   +    + TIR+QL+ ++GDRD E+++PLGK
Sbjct: 62  SVSSARTQLDLLEQLTTPTSDGIGVENGAPTEPRVQTTIREQLSAVIGDRDGEYTLPLGK 121

Query: 116 NLKKVSEKF--LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTG 173
            LK+  +K   LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PP  IL IAVLTG
Sbjct: 122 KLKEGLKKLNSLTVSQRRNIKRQALLTKVSGRNDSVFFATVGAFVLVPPLAILAIAVLTG 181

Query: 174 YVQLFP 179
           YVQL P
Sbjct: 182 YVQLLP 187


>gi|115439045|ref|NP_001043802.1| Os01g0666600 [Oryza sativa Japonica Group]
 gi|56202163|dbj|BAD73641.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|113533333|dbj|BAF05716.1| Os01g0666600 [Oryza sativa Japonica Group]
 gi|125571500|gb|EAZ13015.1| hypothetical protein OsJ_02936 [Oryza sativa Japonica Group]
          Length = 187

 Score =  132 bits (331), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 75/126 (59%), Positives = 97/126 (76%), Gaps = 5/126 (3%)

Query: 59  SSSSAQTQLDLLEQLTSTSSSLEGYESDGSSR---KLTIRDQLAQLVGDRDDEFSIPLGK 115
           S SSA+TQLDLLEQLT+ +S   G E+   +    + TIR+QL+ ++GDRD E+++PLGK
Sbjct: 62  SVSSARTQLDLLEQLTTPTSDGIGVENGAPTEPRVQTTIREQLSAVIGDRDGEYTLPLGK 121

Query: 116 NLKKVSEKF--LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTG 173
            LK+  +K   LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PP  IL IAVLTG
Sbjct: 122 KLKEGLKKLNSLTVSQRRNIKRQALLTKVSGRNDSVFFATVGAFVLVPPLAILAIAVLTG 181

Query: 174 YVQLFP 179
           YVQL P
Sbjct: 182 YVQLLP 187


>gi|414881082|tpg|DAA58213.1| TPA: hypothetical protein ZEAMMB73_738830 [Zea mays]
          Length = 216

 Score =  129 bits (323), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 109/203 (53%), Gaps = 50/203 (24%)

Query: 13  IPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQ 72
           +P    +   Q     C+  A+ +        G T+E       S ++SSA+ QLDLLEQ
Sbjct: 28  VPLKASVAPAQRALGCCRATARQE--------GATQEP------SVAASSARAQLDLLEQ 73

Query: 73  LTS---------------------------------TSSSLE-GYESDGSSRKLTIRDQL 98
           LTS                                 +SS LE G       R+ TIR+QL
Sbjct: 74  LTSATPDDTAAGKDPYCTCTQGYKYSVIEPNSHDKRSSSGLENGMPPPEPRRRATIREQL 133

Query: 99  AQLVGDRDDEFSIPLGKNLKKVSEKF--LTISQKRNIRRQAYLNEVSQRNDSVFFATIGA 156
           + L   + DEF++PLGK LK+  ++   LT+SQ+RNI+RQA L +VS RNDSVFFAT+GA
Sbjct: 134 SALANGKVDEFTLPLGKRLKEGLKRLNSLTVSQRRNIKRQALLTQVSGRNDSVFFATVGA 193

Query: 157 FVILPPFIILGIAVLTGYVQLFP 179
           FV++PP  IL I VLTGY+QLFP
Sbjct: 194 FVLVPPLAILAIGVLTGYIQLFP 216


>gi|357453657|ref|XP_003597109.1| hypothetical protein MTR_2g089780 [Medicago truncatula]
 gi|357482675|ref|XP_003611624.1| hypothetical protein MTR_5g016040 [Medicago truncatula]
 gi|355486157|gb|AES67360.1| hypothetical protein MTR_2g089780 [Medicago truncatula]
 gi|355512959|gb|AES94582.1| hypothetical protein MTR_5g016040 [Medicago truncatula]
          Length = 104

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 49/80 (61%), Positives = 56/80 (70%), Gaps = 17/80 (21%)

Query: 82  GYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNE 141
           GYESDGSS++L IR+QL QL                 KVS  FLT SQ+RNI+RQ+YLNE
Sbjct: 15  GYESDGSSQRLRIREQLEQL-----------------KVSATFLTTSQRRNIKRQSYLNE 57

Query: 142 VSQRNDSVFFATIGAFVILP 161
           VSQRNDS FFATIGAF +LP
Sbjct: 58  VSQRNDSTFFATIGAFGLLP 77


>gi|168044668|ref|XP_001774802.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673826|gb|EDQ60343.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 280

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 57/81 (70%)

Query: 93  TIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFA 152
           TIRDQLA  +    +E  + L    K+ +   LTISQKRNIRRQ YLN+VS+RND+ FFA
Sbjct: 177 TIRDQLASQIEPEVEEVVVSLVSEEKEPTPPQLTISQKRNIRRQNYLNKVSERNDAPFFA 236

Query: 153 TIGAFVILPPFIILGIAVLTG 173
           T+  FV+LPP +ILG AV TG
Sbjct: 237 TVAGFVLLPPLLILGFAVATG 257


>gi|357130670|ref|XP_003566970.1| PREDICTED: uncharacterized protein LOC100823674 [Brachypodium
           distachyon]
          Length = 179

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/106 (47%), Positives = 70/106 (66%), Gaps = 6/106 (5%)

Query: 59  SSSSAQTQLDLLEQLTSTSSSLEGYESDG--SSRKLT-IRDQLAQLVGDRDDEFSIPLGK 115
           S SSA+TQLDLLEQLTS+ +S +G ES     SR LT IR+QL+ L G+R  EF++P+GK
Sbjct: 60  SVSSARTQLDLLEQLTSSPTS-DGIESGAPVESRPLTTIREQLSALYGERGGEFTLPMGK 118

Query: 116 NLKKVSEKF--LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVI 159
            LK+  +    +T+SQ+RNI+RQA L +V   +   +    G F +
Sbjct: 119 KLKEGLKALNSMTVSQRRNIKRQAMLTKVFSGHLQTYHTHEGTFAL 164


>gi|302763955|ref|XP_002965399.1| hypothetical protein SELMODRAFT_439213 [Selaginella moellendorffii]
 gi|300167632|gb|EFJ34237.1| hypothetical protein SELMODRAFT_439213 [Selaginella moellendorffii]
          Length = 235

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 40/89 (44%), Positives = 58/89 (65%), Gaps = 6/89 (6%)

Query: 93  TIRDQLAQLVGDRDD--EFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVF 150
           TI  +L    G+ ++  +FS+PL  +  K  +    IS +RNI+RQ YL++VSQR+DS  
Sbjct: 151 TIASKLVDRYGNSEEGSDFSVPLVYSRYKKPK----ISVQRNIKRQEYLDKVSQRDDSGT 206

Query: 151 FATIGAFVILPPFIILGIAVLTGYVQLFP 179
           FATIG FV++PP  IL +A+  GYV + P
Sbjct: 207 FATIGLFVLVPPLAILAVAIAVGYVNILP 235


>gi|307105097|gb|EFN53348.1| hypothetical protein CHLNCDRAFT_137078 [Chlorella variabilis]
          Length = 285

 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 60/123 (48%), Gaps = 12/123 (9%)

Query: 61  SSAQTQLDLLEQLTSTSSSLEGYESDGSSRK-------LTIRDQL-AQLVGDRDDEFSIP 112
           SSA+   DLL  +   +SS E  +S             LT+ ++L AQ    ++ E   P
Sbjct: 157 SSAEAADDLLSAIVQAASSGESMDSGDEGGASAGGRGGLTVAEKLEAQYRQQQEAEGGAP 216

Query: 113 LGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLT 172
                   + + LT  +KR  RR+ Y  + S+RND  FFA + A  +LPP IIL  A  +
Sbjct: 217 EAAQ----AWQALTALEKRQKRRKEYFEQASERNDGPFFAGVAALFVLPPAIILLAASAS 272

Query: 173 GYV 175
           GY+
Sbjct: 273 GYL 275


>gi|403068063|ref|ZP_10909395.1| hypothetical protein ONdio_00502 [Oceanobacillus sp. Ndiop]
          Length = 431

 Score = 36.2 bits (82), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 6/87 (6%)

Query: 59  SSSSAQTQLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLK 118
           S ++   QL L  Q      ++   E  G++  +T+ D L ++VGD  DEF      +++
Sbjct: 296 SEATPIKQLLLKMQKERIHMAIVNDEYGGTAGLVTVEDILEEIVGDIRDEFDEDEVPDIE 355

Query: 119 KVSEKFLTISQKRNIRRQAYLNEVSQR 145
           +V EK L +S K N      L EV+QR
Sbjct: 356 EVDEKTLLLSGKLN------LEEVNQR 376


>gi|68299776|ref|NP_001018874.1| protein C-ets-2 [Danio rerio]
 gi|63100226|gb|AAH95899.1| Zgc:92106 [Danio rerio]
          Length = 439

 Score = 35.8 bits (81), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 12/123 (9%)

Query: 22  LQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLD--LLEQLT-STSS 78
           LQ    S K + Q +    E G  V     PAA L+G + S   QL   LLE LT ST  
Sbjct: 294 LQKHGRSFKDYVQERNQTQETGRPVI----PAAVLAGFTGSGPIQLWQFLLELLTDSTCQ 349

Query: 79  SLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSI---PLGKNLKKVSEKFLTISQKRNIRR 135
           S+  +  DG   KLT  D++A+  G R ++  +    L + L+   +K   I  K + +R
Sbjct: 350 SIISWTGDGWEFKLTDPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDK--NIIHKTSGKR 407

Query: 136 QAY 138
             Y
Sbjct: 408 YVY 410


>gi|367017824|ref|XP_003683410.1| hypothetical protein TDEL_0H03400 [Torulaspora delbrueckii]
 gi|359751074|emb|CCE94199.1| hypothetical protein TDEL_0H03400 [Torulaspora delbrueckii]
          Length = 753

 Score = 35.8 bits (81), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 58/134 (43%), Gaps = 5/134 (3%)

Query: 47  TEEDPPAASLSGSSSSAQTQLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRD 106
           +  +P   S      S   + DLL+ L+ +S  LEG      SRK   R +   +     
Sbjct: 34  SHNEPLNQSDEREGHSPNVKQDLLDALSYSSKILEGMGHRDESRKRFKRQKQDSIKMKIP 93

Query: 107 DEFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLN-EVSQRNDSVFFATIGAFVILPPFII 165
               I +GK+ KKV+    TIS KR   +   +N + S +ND+   A +   VI PP + 
Sbjct: 94  PAIDIAVGKSRKKVTNTTFTISSKRT--KHVTINSDGSIQNDNR--AKLTPHVIPPPLVS 149

Query: 166 LGIAVLTGYVQLFP 179
               VL   V L P
Sbjct: 150 GLDRVLYQPVTLHP 163


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.132    0.364 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,485,427,283
Number of Sequences: 23463169
Number of extensions: 89152430
Number of successful extensions: 422508
Number of sequences better than 100.0: 67
Number of HSP's better than 100.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 422424
Number of HSP's gapped (non-prelim): 80
length of query: 179
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 46
effective length of database: 9,238,593,890
effective search space: 424975318940
effective search space used: 424975318940
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 72 (32.3 bits)