BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 030344
(179 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255562170|ref|XP_002522093.1| conserved hypothetical protein [Ricinus communis]
gi|223538692|gb|EEF40293.1| conserved hypothetical protein [Ricinus communis]
Length = 192
Score = 226 bits (576), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 134/200 (67%), Positives = 148/200 (74%), Gaps = 29/200 (14%)
Query: 1 MALNFIAPSP---------------FVIPKHKHLFKLQSQTTSC-KTFAQSQRIGAEGGG 44
MALN SP F+IPKHKHLFK +SC K FA+S+ GAE G
Sbjct: 1 MALNISPASPCQISSCSSSSANNSHFLIPKHKHLFKFY--YSSCRKIFAKSE--GAESG- 55
Query: 45 GVTEEDPPAASLSGSSSSAQTQLDLLEQLTSTSSS---LEGYESDGSSRKLTIRDQLAQL 101
+ E+D P SGS SS +TQLDLL+QLTS SSS GYESDGSS KLTIR QLA+L
Sbjct: 56 -IKEQDLPP--FSGSLSSTRTQLDLLDQLTSNSSSSPDTVGYESDGSSYKLTIRQQLAKL 112
Query: 102 VGDRDD--EFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVI 159
VG+RD+ EFSIPLGKNLKK S KFLTISQKRNIRRQAYL+EVSQRNDSVFFATIGAFVI
Sbjct: 113 VGERDEDEEFSIPLGKNLKKFSPKFLTISQKRNIRRQAYLDEVSQRNDSVFFATIGAFVI 172
Query: 160 LPPFIILGIAVLTGYVQLFP 179
LPP +ILGIA+LTGYVQLFP
Sbjct: 173 LPPIVILGIAILTGYVQLFP 192
>gi|356501461|ref|XP_003519543.1| PREDICTED: uncharacterized protein LOC100776849 [Glycine max]
Length = 191
Score = 224 bits (572), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 120/169 (71%), Positives = 135/169 (79%), Gaps = 6/169 (3%)
Query: 11 FVIPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLL 70
F+ KH +L K +S T +Q G + G VT++D +A GS SS++ QLDLL
Sbjct: 29 FLTHKHNNLIKFYHPFSSLLTTC-AQTHGTDTG--VTQKDDASA---GSLSSSRAQLDLL 82
Query: 71 EQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQK 130
EQLTSTSS GYESDGSSRKLTIR+QL QL G+RDD+F+IPLGKNLKKVS KFLTISQK
Sbjct: 83 EQLTSTSSPNSGYESDGSSRKLTIREQLVQLFGERDDDFTIPLGKNLKKVSAKFLTISQK 142
Query: 131 RNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
RNIRRQ YLNEVSQRNDSVFFATIGAFVILPPFIILGIA+LTGYVQLFP
Sbjct: 143 RNIRRQTYLNEVSQRNDSVFFATIGAFVILPPFIILGIAILTGYVQLFP 191
>gi|356553792|ref|XP_003545236.1| PREDICTED: uncharacterized protein LOC100778027 [Glycine max]
Length = 178
Score = 222 bits (565), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 121/170 (71%), Positives = 136/170 (80%), Gaps = 9/170 (5%)
Query: 11 FVIPKHKHLFKLQSQTTSC-KTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDL 69
F+ KH +L K+ ++S T AQ+Q GVT+ED A + GS SS++ QLDL
Sbjct: 17 FLTHKHNNLIKIYHPSSSLLTTCAQTQGTDT----GVTQEDASAGN--GSLSSSRAQLDL 70
Query: 70 LEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQ 129
LEQLTSTSS GYESDGS KLTIR+QL QL G+RDD+F+IPLGKNLKKVS KFLTISQ
Sbjct: 71 LEQLTSTSSPNSGYESDGS--KLTIREQLEQLFGERDDDFTIPLGKNLKKVSAKFLTISQ 128
Query: 130 KRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
KRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIA+LTGYVQLFP
Sbjct: 129 KRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAILTGYVQLFP 178
>gi|225446815|ref|XP_002283386.1| PREDICTED: uncharacterized protein LOC100243650 isoform 1 [Vitis
vinifera]
gi|359485353|ref|XP_003633263.1| PREDICTED: uncharacterized protein LOC100243650 isoform 2 [Vitis
vinifera]
gi|302143535|emb|CBI22096.3| unnamed protein product [Vitis vinifera]
Length = 190
Score = 218 bits (556), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 132/171 (77%), Positives = 147/171 (85%), Gaps = 6/171 (3%)
Query: 11 FVIPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPP-AASLSGSSSSAQTQLDL 69
F+IPKH+ LFKL S ++ +T AQS G G GVT+EDPP A+LS S SSA++QLDL
Sbjct: 24 FLIPKHR-LFKLCSSSSCFRTSAQS---GGTEGRGVTQEDPPLPAALSESPSSARSQLDL 79
Query: 70 LEQLTSTSSSLEG-YESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTIS 128
LEQLTST+ SLEG YESDG SR LTIRDQLAQLVG+RDD+F IPLGKNLKKVS KFLTIS
Sbjct: 80 LEQLTSTTPSLEGGYESDGMSRNLTIRDQLAQLVGERDDDFIIPLGKNLKKVSPKFLTIS 139
Query: 129 QKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
QKRNI+RQAYLNEVSQRNDSVFFATIGAFVILPP IILGIA++TGYVQLFP
Sbjct: 140 QKRNIKRQAYLNEVSQRNDSVFFATIGAFVILPPIIILGIAIITGYVQLFP 190
>gi|388509082|gb|AFK42607.1| unknown [Medicago truncatula]
gi|388515509|gb|AFK45816.1| unknown [Medicago truncatula]
Length = 178
Score = 210 bits (534), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 116/174 (66%), Positives = 138/174 (79%), Gaps = 8/174 (4%)
Query: 6 IAPSPFVIPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQT 65
+ PSP + + + K ++ KT AQSQ G E G V+E+ AS GS SS++
Sbjct: 13 LHPSPSFLTRKYNNIKFHHASSLTKTCAQSQ--GTEPG--VSED----ASSIGSLSSSRA 64
Query: 66 QLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFL 125
QLDLLEQ TSTSSS+ GYESDGSS++LTIR+QLAQLVG+RDD+FS+PLGKNLKKVS KFL
Sbjct: 65 QLDLLEQFTSTSSSVIGYESDGSSQRLTIREQLAQLVGERDDDFSLPLGKNLKKVSAKFL 124
Query: 126 TISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
T SQ+RNI+RQ+YLNEVSQRNDS FFATIGAFV+LPP IILGIA+LTGYVQL P
Sbjct: 125 TTSQRRNIKRQSYLNEVSQRNDSTFFATIGAFVLLPPLIILGIAILTGYVQLLP 178
>gi|357494233|ref|XP_003617405.1| hypothetical protein MTR_5g091190 [Medicago truncatula]
gi|355518740|gb|AET00364.1| hypothetical protein MTR_5g091190 [Medicago truncatula]
Length = 173
Score = 210 bits (534), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 116/174 (66%), Positives = 138/174 (79%), Gaps = 8/174 (4%)
Query: 6 IAPSPFVIPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQT 65
+ PSP + + + K ++ KT AQSQ G E G V+E+ AS GS SS++
Sbjct: 8 LHPSPSFLTRKYNNIKFHHASSLTKTCAQSQ--GTEPG--VSED----ASSIGSLSSSRA 59
Query: 66 QLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFL 125
QLDLLEQ TSTSSS+ GYESDGSS++LTIR+QLAQLVG+RDD+FS+PLGKNLKKVS KFL
Sbjct: 60 QLDLLEQFTSTSSSVIGYESDGSSQRLTIREQLAQLVGERDDDFSLPLGKNLKKVSAKFL 119
Query: 126 TISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
T SQ+RNI+RQ+YLNEVSQRNDS FFATIGAFV+LPP IILGIA+LTGYVQL P
Sbjct: 120 TTSQRRNIKRQSYLNEVSQRNDSTFFATIGAFVLLPPLIILGIAILTGYVQLLP 173
>gi|449525854|ref|XP_004169931.1| PREDICTED: uncharacterized LOC101216249 [Cucumis sativus]
Length = 190
Score = 205 bits (521), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 116/161 (72%), Positives = 134/161 (83%), Gaps = 5/161 (3%)
Query: 20 FKLQSQ-TTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQLTSTSS 78
FK S+ ++S + FA S+ AEG V EE+ +S SGSSSSA+TQLDLLEQL+S S
Sbjct: 34 FKFHSRCSSSSRKFAVSE--SAEGR--VNEEEFLISSSSGSSSSARTQLDLLEQLSSGSQ 89
Query: 79 SLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAY 138
++GYESDGS K TIRDQLAQL DRDD+F++PLGKNLKKVS KFLTISQKRNI+RQAY
Sbjct: 90 LVDGYESDGSYGKTTIRDQLAQLFRDRDDDFTVPLGKNLKKVSAKFLTISQKRNIKRQAY 149
Query: 139 LNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
LNEVSQRNDSVFFAT+GAFVILPP +ILGIA+LTGYVQLFP
Sbjct: 150 LNEVSQRNDSVFFATVGAFVILPPIVILGIAILTGYVQLFP 190
>gi|449453780|ref|XP_004144634.1| PREDICTED: uncharacterized protein LOC101216249 [Cucumis sativus]
Length = 189
Score = 205 bits (521), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 116/161 (72%), Positives = 134/161 (83%), Gaps = 5/161 (3%)
Query: 20 FKLQSQ-TTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQLTSTSS 78
FK S+ ++S + FA S+ AEG V EE+ +S SGSSSSA+TQLDLLEQL+S S
Sbjct: 33 FKFHSRCSSSSRKFAVSE--SAEGR--VNEEEFLISSSSGSSSSARTQLDLLEQLSSGSQ 88
Query: 79 SLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAY 138
++GYESDGS K TIRDQLAQL DRDD+F++PLGKNLKKVS KFLTISQKRNI+RQAY
Sbjct: 89 LVDGYESDGSYGKTTIRDQLAQLFRDRDDDFTVPLGKNLKKVSAKFLTISQKRNIKRQAY 148
Query: 139 LNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
LNEVSQRNDSVFFAT+GAFVILPP +ILGIA+LTGYVQLFP
Sbjct: 149 LNEVSQRNDSVFFATVGAFVILPPIVILGIAILTGYVQLFP 189
>gi|224066559|ref|XP_002302136.1| predicted protein [Populus trichocarpa]
gi|222843862|gb|EEE81409.1| predicted protein [Populus trichocarpa]
Length = 186
Score = 199 bits (507), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 132/192 (68%), Positives = 149/192 (77%), Gaps = 19/192 (9%)
Query: 1 MALNFIAPSP-----------FVIPKHKHLFKLQSQTTSC-KTFAQSQRIGAEGGGGVTE 48
MALNF A S F+IP HKHL KL + SC KT AQS+ GA GVTE
Sbjct: 1 MALNFSASSLSHVSSSTTGPQFLIPAHKHLLKLYPSSFSCTKTSAQSE--GA--ASGVTE 56
Query: 49 EDPPAASLSGSSSSAQTQLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDE 108
+DP + S S SS+ +TQLDLLEQLTS + S +GYESDG S KLTIR+QLAQLVG+ D++
Sbjct: 57 QDPSSFSGSLSST--RTQLDLLEQLTSATPSADGYESDGGSGKLTIREQLAQLVGEGDND 114
Query: 109 FSIPLGK-NLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILG 167
FSIPLGK NLKKVS KFLT+SQKRNIRRQAYLN+VSQRNDSVFFATIGAF+ILPP IILG
Sbjct: 115 FSIPLGKKNLKKVSAKFLTVSQKRNIRRQAYLNQVSQRNDSVFFATIGAFIILPPIIILG 174
Query: 168 IAVLTGYVQLFP 179
IA+LTGYVQLFP
Sbjct: 175 IAILTGYVQLFP 186
>gi|297824263|ref|XP_002880014.1| hypothetical protein ARALYDRAFT_321975 [Arabidopsis lyrata subsp.
lyrata]
gi|297325853|gb|EFH56273.1| hypothetical protein ARALYDRAFT_321975 [Arabidopsis lyrata subsp.
lyrata]
Length = 187
Score = 195 bits (496), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 114/190 (60%), Positives = 132/190 (69%), Gaps = 14/190 (7%)
Query: 1 MALNFIAPSPFVIP----------KHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEED 50
MALNF + S I +H HL S + + ++ IG +
Sbjct: 1 MALNFFSTSLHPISNPSSSSRSKQRHHHLLLQLSSPPATRVRSERSEIGVTDNNDSSSSS 60
Query: 51 PPAASLSGSSSSAQTQLDLLEQLTSTSSSLEGYESD-GSSRKLTIRDQLAQLVGDRDDEF 109
+ S SSSS +TQLDLLEQLTSTS +GY SD G SR +TIR+QLA LVGDRDD+F
Sbjct: 61 SSFSGSSSSSSSVRTQLDLLEQLTSTS---DGYLSDAGGSRGITIRNQLAGLVGDRDDDF 117
Query: 110 SIPLGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIA 169
+IPLGKNLKKVS KFLTISQKRNI+RQ+YLNEVSQRNDSVFFATIGAFVILPP +IL IA
Sbjct: 118 TIPLGKNLKKVSPKFLTISQKRNIKRQSYLNEVSQRNDSVFFATIGAFVILPPLVILAIA 177
Query: 170 VLTGYVQLFP 179
+LTGYVQLFP
Sbjct: 178 ILTGYVQLFP 187
>gi|30689207|ref|NP_850380.1| uncharacterized protein [Arabidopsis thaliana]
gi|19347805|gb|AAL86353.1| unknown protein [Arabidopsis thaliana]
gi|22136700|gb|AAM91669.1| unknown protein [Arabidopsis thaliana]
gi|330255099|gb|AEC10193.1| uncharacterized protein [Arabidopsis thaliana]
Length = 187
Score = 193 bits (490), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 99/118 (83%), Positives = 106/118 (89%), Gaps = 4/118 (3%)
Query: 63 AQTQLDLLEQLTSTSSSLEGYESDGS-SRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVS 121
+TQLDLLEQLTSTS +GY SDG SR LTIRDQLA LVGDRDD+FSIPLGKNLKKVS
Sbjct: 73 VRTQLDLLEQLTSTS---DGYLSDGGGSRGLTIRDQLAGLVGDRDDDFSIPLGKNLKKVS 129
Query: 122 EKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
KFLTISQKRNI+RQ+YLNEVSQRNDSVFFATIGAFVILPP +IL IA+LTGYVQLFP
Sbjct: 130 PKFLTISQKRNIKRQSYLNEVSQRNDSVFFATIGAFVILPPLVILAIAILTGYVQLFP 187
>gi|224082572|ref|XP_002306747.1| predicted protein [Populus trichocarpa]
gi|222856196|gb|EEE93743.1| predicted protein [Populus trichocarpa]
Length = 122
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/122 (84%), Positives = 112/122 (91%), Gaps = 1/122 (0%)
Query: 59 SSSSAQTQLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGK-NL 117
S SS +TQLDLLEQLTSTSS +GYESDGSS KLTIRDQLA+LVGDRDD+FSIPLGK NL
Sbjct: 1 SLSSTRTQLDLLEQLTSTSSPADGYESDGSSGKLTIRDQLARLVGDRDDDFSIPLGKKNL 60
Query: 118 KKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQL 177
KKVS FLT+SQKRNIRRQAYLNEVSQRNDSVFFATIGAF+ILPP +ILGIA++ GYVQL
Sbjct: 61 KKVSANFLTVSQKRNIRRQAYLNEVSQRNDSVFFATIGAFIILPPTLILGIAIIIGYVQL 120
Query: 178 FP 179
FP
Sbjct: 121 FP 122
>gi|414881083|tpg|DAA58214.1| TPA: hypothetical protein ZEAMMB73_738830 [Zea mays]
Length = 188
Score = 140 bits (353), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/175 (46%), Positives = 109/175 (62%), Gaps = 22/175 (12%)
Query: 13 IPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQ 72
+P + Q C+ A+ + G T+E S ++SSA+ QLDLLEQ
Sbjct: 28 VPLKASVAPAQRALGCCRATARQE--------GATQEP------SVAASSARAQLDLLEQ 73
Query: 73 LTS-----TSSSLE-GYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKF-- 124
LTS T++ LE G R+ TIR+QL+ L + DEF++PLGK LK+ ++
Sbjct: 74 LTSATPDDTAAGLENGMPPPEPRRRATIREQLSALANGKVDEFTLPLGKRLKEGLKRLNS 133
Query: 125 LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PP IL I VLTGY+QLFP
Sbjct: 134 LTVSQRRNIKRQALLTQVSGRNDSVFFATVGAFVLVPPLAILAIGVLTGYIQLFP 188
>gi|226497328|ref|NP_001145019.1| uncharacterized protein LOC100278190 [Zea mays]
gi|195649997|gb|ACG44466.1| hypothetical protein [Zea mays]
Length = 188
Score = 138 bits (348), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 80/175 (45%), Positives = 108/175 (61%), Gaps = 22/175 (12%)
Query: 13 IPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQ 72
+P + Q C+ A+ + G T+E S ++SSA+ QLDLLEQ
Sbjct: 28 VPLKASVAPAQRSLGCCRATARQE--------GATQEP------SVAASSARAQLDLLEQ 73
Query: 73 LTSTS-----SSLE-GYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKF-- 124
LTS + + LE G R+ TIR+QL+ L + DEF++PLGK LK+ ++
Sbjct: 74 LTSATPDDAAAGLENGMPPPEPRRRATIREQLSTLANGKVDEFTLPLGKRLKEGLKRLNS 133
Query: 125 LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PP IL I VLTGY+QLFP
Sbjct: 134 LTVSQRRNIKRQALLTQVSGRNDSVFFATVGAFVLVPPLAILAIGVLTGYIQLFP 188
>gi|224284915|gb|ACN40187.1| unknown [Picea sitchensis]
Length = 195
Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/158 (49%), Positives = 99/158 (62%), Gaps = 10/158 (6%)
Query: 26 TTSC--KTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQLTSTSSSLEGY 83
T SC K ++ GV+E P + + SS Q+QLDLLE+LTS +
Sbjct: 44 TNSCRLKRIVSPPAAASKTESGVSEGSPDESV--STESSVQSQLDLLERLTS----INNK 97
Query: 84 ESDGS--SRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNE 141
E+D + S TIR+QL+ LV D E +IPLGK K + LTISQKRNI+RQ YLN+
Sbjct: 98 ENDQAKVSSTTTIREQLSALVSDEVGEVTIPLGKRFKPNNYNSLTISQKRNIKRQDYLNK 157
Query: 142 VSQRNDSVFFATIGAFVILPPFIILGIAVLTGYVQLFP 179
V+QRND FF+TI FVILPP +ILG+AV TGYV +FP
Sbjct: 158 VAQRNDIPFFSTIALFVILPPVVILGVAVATGYVDIFP 195
>gi|242053923|ref|XP_002456107.1| hypothetical protein SORBIDRAFT_03g030570 [Sorghum bicolor]
gi|241928082|gb|EES01227.1| hypothetical protein SORBIDRAFT_03g030570 [Sorghum bicolor]
Length = 195
Score = 133 bits (334), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/126 (59%), Positives = 93/126 (73%), Gaps = 7/126 (5%)
Query: 61 SSAQTQLDLLEQLTS-TSSSLEGYES---DGSSRKLTIRDQLAQLV-GDRDDEFSIPLGK 115
SSA+ QLDLLEQLTS T + G E+ S ++ TIR QL+ L G DEF++PLG+
Sbjct: 70 SSARAQLDLLEQLTSPTPDGIAGLENGVPTESRQRATIRQQLSALANGKVVDEFTLPLGR 129
Query: 116 NLKK--VSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTG 173
LK+ S LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PPF IL IAV+TG
Sbjct: 130 KLKEGLKSLNKLTVSQRRNIKRQALLTQVSGRNDSVFFATVGAFVLVPPFAILAIAVITG 189
Query: 174 YVQLFP 179
YVQLFP
Sbjct: 190 YVQLFP 195
>gi|125527181|gb|EAY75295.1| hypothetical protein OsI_03186 [Oryza sativa Indica Group]
Length = 187
Score = 132 bits (331), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 75/126 (59%), Positives = 97/126 (76%), Gaps = 5/126 (3%)
Query: 59 SSSSAQTQLDLLEQLTSTSSSLEGYESDGSSR---KLTIRDQLAQLVGDRDDEFSIPLGK 115
S SSA+TQLDLLEQLT+ +S G E+ + + TIR+QL+ ++GDRD E+++PLGK
Sbjct: 62 SVSSARTQLDLLEQLTTPTSDGIGVENGAPTEPRVQTTIREQLSAVIGDRDGEYTLPLGK 121
Query: 116 NLKKVSEKF--LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTG 173
LK+ +K LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PP IL IAVLTG
Sbjct: 122 KLKEGLKKLNSLTVSQRRNIKRQALLTKVSGRNDSVFFATVGAFVLVPPLAILAIAVLTG 181
Query: 174 YVQLFP 179
YVQL P
Sbjct: 182 YVQLLP 187
>gi|115439045|ref|NP_001043802.1| Os01g0666600 [Oryza sativa Japonica Group]
gi|56202163|dbj|BAD73641.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113533333|dbj|BAF05716.1| Os01g0666600 [Oryza sativa Japonica Group]
gi|125571500|gb|EAZ13015.1| hypothetical protein OsJ_02936 [Oryza sativa Japonica Group]
Length = 187
Score = 132 bits (331), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 75/126 (59%), Positives = 97/126 (76%), Gaps = 5/126 (3%)
Query: 59 SSSSAQTQLDLLEQLTSTSSSLEGYESDGSSR---KLTIRDQLAQLVGDRDDEFSIPLGK 115
S SSA+TQLDLLEQLT+ +S G E+ + + TIR+QL+ ++GDRD E+++PLGK
Sbjct: 62 SVSSARTQLDLLEQLTTPTSDGIGVENGAPTEPRVQTTIREQLSAVIGDRDGEYTLPLGK 121
Query: 116 NLKKVSEKF--LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLTG 173
LK+ +K LT+SQ+RNI+RQA L +VS RNDSVFFAT+GAFV++PP IL IAVLTG
Sbjct: 122 KLKEGLKKLNSLTVSQRRNIKRQALLTKVSGRNDSVFFATVGAFVLVPPLAILAIAVLTG 181
Query: 174 YVQLFP 179
YVQL P
Sbjct: 182 YVQLLP 187
>gi|414881082|tpg|DAA58213.1| TPA: hypothetical protein ZEAMMB73_738830 [Zea mays]
Length = 216
Score = 129 bits (323), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 82/203 (40%), Positives = 109/203 (53%), Gaps = 50/203 (24%)
Query: 13 IPKHKHLFKLQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLDLLEQ 72
+P + Q C+ A+ + G T+E S ++SSA+ QLDLLEQ
Sbjct: 28 VPLKASVAPAQRALGCCRATARQE--------GATQEP------SVAASSARAQLDLLEQ 73
Query: 73 LTS---------------------------------TSSSLE-GYESDGSSRKLTIRDQL 98
LTS +SS LE G R+ TIR+QL
Sbjct: 74 LTSATPDDTAAGKDPYCTCTQGYKYSVIEPNSHDKRSSSGLENGMPPPEPRRRATIREQL 133
Query: 99 AQLVGDRDDEFSIPLGKNLKKVSEKF--LTISQKRNIRRQAYLNEVSQRNDSVFFATIGA 156
+ L + DEF++PLGK LK+ ++ LT+SQ+RNI+RQA L +VS RNDSVFFAT+GA
Sbjct: 134 SALANGKVDEFTLPLGKRLKEGLKRLNSLTVSQRRNIKRQALLTQVSGRNDSVFFATVGA 193
Query: 157 FVILPPFIILGIAVLTGYVQLFP 179
FV++PP IL I VLTGY+QLFP
Sbjct: 194 FVLVPPLAILAIGVLTGYIQLFP 216
>gi|357453657|ref|XP_003597109.1| hypothetical protein MTR_2g089780 [Medicago truncatula]
gi|357482675|ref|XP_003611624.1| hypothetical protein MTR_5g016040 [Medicago truncatula]
gi|355486157|gb|AES67360.1| hypothetical protein MTR_2g089780 [Medicago truncatula]
gi|355512959|gb|AES94582.1| hypothetical protein MTR_5g016040 [Medicago truncatula]
Length = 104
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/80 (61%), Positives = 56/80 (70%), Gaps = 17/80 (21%)
Query: 82 GYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNE 141
GYESDGSS++L IR+QL QL KVS FLT SQ+RNI+RQ+YLNE
Sbjct: 15 GYESDGSSQRLRIREQLEQL-----------------KVSATFLTTSQRRNIKRQSYLNE 57
Query: 142 VSQRNDSVFFATIGAFVILP 161
VSQRNDS FFATIGAF +LP
Sbjct: 58 VSQRNDSTFFATIGAFGLLP 77
>gi|168044668|ref|XP_001774802.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673826|gb|EDQ60343.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 280
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 46/81 (56%), Positives = 57/81 (70%)
Query: 93 TIRDQLAQLVGDRDDEFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFA 152
TIRDQLA + +E + L K+ + LTISQKRNIRRQ YLN+VS+RND+ FFA
Sbjct: 177 TIRDQLASQIEPEVEEVVVSLVSEEKEPTPPQLTISQKRNIRRQNYLNKVSERNDAPFFA 236
Query: 153 TIGAFVILPPFIILGIAVLTG 173
T+ FV+LPP +ILG AV TG
Sbjct: 237 TVAGFVLLPPLLILGFAVATG 257
>gi|357130670|ref|XP_003566970.1| PREDICTED: uncharacterized protein LOC100823674 [Brachypodium
distachyon]
Length = 179
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 70/106 (66%), Gaps = 6/106 (5%)
Query: 59 SSSSAQTQLDLLEQLTSTSSSLEGYESDG--SSRKLT-IRDQLAQLVGDRDDEFSIPLGK 115
S SSA+TQLDLLEQLTS+ +S +G ES SR LT IR+QL+ L G+R EF++P+GK
Sbjct: 60 SVSSARTQLDLLEQLTSSPTS-DGIESGAPVESRPLTTIREQLSALYGERGGEFTLPMGK 118
Query: 116 NLKKVSEKF--LTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVI 159
LK+ + +T+SQ+RNI+RQA L +V + + G F +
Sbjct: 119 KLKEGLKALNSMTVSQRRNIKRQAMLTKVFSGHLQTYHTHEGTFAL 164
>gi|302763955|ref|XP_002965399.1| hypothetical protein SELMODRAFT_439213 [Selaginella moellendorffii]
gi|300167632|gb|EFJ34237.1| hypothetical protein SELMODRAFT_439213 [Selaginella moellendorffii]
Length = 235
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 40/89 (44%), Positives = 58/89 (65%), Gaps = 6/89 (6%)
Query: 93 TIRDQLAQLVGDRDD--EFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVF 150
TI +L G+ ++ +FS+PL + K + IS +RNI+RQ YL++VSQR+DS
Sbjct: 151 TIASKLVDRYGNSEEGSDFSVPLVYSRYKKPK----ISVQRNIKRQEYLDKVSQRDDSGT 206
Query: 151 FATIGAFVILPPFIILGIAVLTGYVQLFP 179
FATIG FV++PP IL +A+ GYV + P
Sbjct: 207 FATIGLFVLVPPLAILAVAIAVGYVNILP 235
>gi|307105097|gb|EFN53348.1| hypothetical protein CHLNCDRAFT_137078 [Chlorella variabilis]
Length = 285
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 60/123 (48%), Gaps = 12/123 (9%)
Query: 61 SSAQTQLDLLEQLTSTSSSLEGYESDGSSRK-------LTIRDQL-AQLVGDRDDEFSIP 112
SSA+ DLL + +SS E +S LT+ ++L AQ ++ E P
Sbjct: 157 SSAEAADDLLSAIVQAASSGESMDSGDEGGASAGGRGGLTVAEKLEAQYRQQQEAEGGAP 216
Query: 113 LGKNLKKVSEKFLTISQKRNIRRQAYLNEVSQRNDSVFFATIGAFVILPPFIILGIAVLT 172
+ + LT +KR RR+ Y + S+RND FFA + A +LPP IIL A +
Sbjct: 217 EAAQ----AWQALTALEKRQKRRKEYFEQASERNDGPFFAGVAALFVLPPAIILLAASAS 272
Query: 173 GYV 175
GY+
Sbjct: 273 GYL 275
>gi|403068063|ref|ZP_10909395.1| hypothetical protein ONdio_00502 [Oceanobacillus sp. Ndiop]
Length = 431
Score = 36.2 bits (82), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 6/87 (6%)
Query: 59 SSSSAQTQLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSIPLGKNLK 118
S ++ QL L Q ++ E G++ +T+ D L ++VGD DEF +++
Sbjct: 296 SEATPIKQLLLKMQKERIHMAIVNDEYGGTAGLVTVEDILEEIVGDIRDEFDEDEVPDIE 355
Query: 119 KVSEKFLTISQKRNIRRQAYLNEVSQR 145
+V EK L +S K N L EV+QR
Sbjct: 356 EVDEKTLLLSGKLN------LEEVNQR 376
>gi|68299776|ref|NP_001018874.1| protein C-ets-2 [Danio rerio]
gi|63100226|gb|AAH95899.1| Zgc:92106 [Danio rerio]
Length = 439
Score = 35.8 bits (81), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 12/123 (9%)
Query: 22 LQSQTTSCKTFAQSQRIGAEGGGGVTEEDPPAASLSGSSSSAQTQLD--LLEQLT-STSS 78
LQ S K + Q + E G V PAA L+G + S QL LLE LT ST
Sbjct: 294 LQKHGRSFKDYVQERNQTQETGRPVI----PAAVLAGFTGSGPIQLWQFLLELLTDSTCQ 349
Query: 79 SLEGYESDGSSRKLTIRDQLAQLVGDRDDEFSI---PLGKNLKKVSEKFLTISQKRNIRR 135
S+ + DG KLT D++A+ G R ++ + L + L+ +K I K + +R
Sbjct: 350 SIISWTGDGWEFKLTDPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDK--NIIHKTSGKR 407
Query: 136 QAY 138
Y
Sbjct: 408 YVY 410
>gi|367017824|ref|XP_003683410.1| hypothetical protein TDEL_0H03400 [Torulaspora delbrueckii]
gi|359751074|emb|CCE94199.1| hypothetical protein TDEL_0H03400 [Torulaspora delbrueckii]
Length = 753
Score = 35.8 bits (81), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 58/134 (43%), Gaps = 5/134 (3%)
Query: 47 TEEDPPAASLSGSSSSAQTQLDLLEQLTSTSSSLEGYESDGSSRKLTIRDQLAQLVGDRD 106
+ +P S S + DLL+ L+ +S LEG SRK R + +
Sbjct: 34 SHNEPLNQSDEREGHSPNVKQDLLDALSYSSKILEGMGHRDESRKRFKRQKQDSIKMKIP 93
Query: 107 DEFSIPLGKNLKKVSEKFLTISQKRNIRRQAYLN-EVSQRNDSVFFATIGAFVILPPFII 165
I +GK+ KKV+ TIS KR + +N + S +ND+ A + VI PP +
Sbjct: 94 PAIDIAVGKSRKKVTNTTFTISSKRT--KHVTINSDGSIQNDNR--AKLTPHVIPPPLVS 149
Query: 166 LGIAVLTGYVQLFP 179
VL V L P
Sbjct: 150 GLDRVLYQPVTLHP 163
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.364
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,485,427,283
Number of Sequences: 23463169
Number of extensions: 89152430
Number of successful extensions: 422508
Number of sequences better than 100.0: 67
Number of HSP's better than 100.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 422424
Number of HSP's gapped (non-prelim): 80
length of query: 179
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 46
effective length of database: 9,238,593,890
effective search space: 424975318940
effective search space used: 424975318940
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 72 (32.3 bits)