BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 032332
(143 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255540207|ref|XP_002511168.1| conserved hypothetical protein [Ricinus communis]
gi|223550283|gb|EEF51770.1| conserved hypothetical protein [Ricinus communis]
Length = 196
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/162 (59%), Positives = 107/162 (66%), Gaps = 24/162 (14%)
Query: 1 MNDELCNPTLWS------------TTTTTTTTSTSNGVVHADSTSDAN-NHHLDSSGGSG 47
MNDELCNPTLWS + + + + NGVV+ DS SDA HH D +G
Sbjct: 40 MNDELCNPTLWSTTSSSNNTTSPISAPANSNSLSQNGVVNGDSVSDAVVAHHFDGTGTGV 99
Query: 48 NS------ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDK 101
ALDEAR RYK SVAALR VL AIPNSHKA SFE SS S +DE +++K
Sbjct: 100 VGGGTGNGALDEARFRYKNSVAALREVLAAIPNSHKAISFETSSS-----SPADEADVEK 154
Query: 102 LEERASLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
LEE+AS LRKEL NKN YLK LIDQLRDLITDISTWQSPCSV
Sbjct: 155 LEEQASNLRKELFNKNAYLKLLIDQLRDLITDISTWQSPCSV 196
>gi|224133984|ref|XP_002321708.1| predicted protein [Populus trichocarpa]
gi|222868704|gb|EEF05835.1| predicted protein [Populus trichocarpa]
Length = 177
Score = 154 bits (390), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 98/158 (62%), Positives = 107/158 (67%), Gaps = 22/158 (13%)
Query: 1 MNDELCNPTLWSTTTTT--------TTTSTSNGVVHADSTSDANNHHLDSS-------GG 45
MNDELCNPTLWSTT+T T+ + NG V DAN DS+ GG
Sbjct: 27 MNDELCNPTLWSTTSTATTAVTSPITSKNAQNGAVVITVNGDANAAVSDSTHELSGGGGG 86
Query: 46 SGNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEER 105
GN ALDEAR RYK SVAALR VL AIPNSHKAK FE SSP +DE EI KLEER
Sbjct: 87 VGNGALDEARFRYKNSVAALRDVLAAIPNSHKAKPFE-TSSP------ADEAEIKKLEER 139
Query: 106 ASLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
AS LRKELA KN Y+K LIDQLR++ITDISTWQ+PCSV
Sbjct: 140 ASSLRKELAKKNTYVKLLIDQLREIITDISTWQTPCSV 177
>gi|225456359|ref|XP_002283991.1| PREDICTED: uncharacterized protein LOC100264705 [Vitis vinifera]
gi|147789866|emb|CAN73869.1| hypothetical protein VITISV_001275 [Vitis vinifera]
gi|297734439|emb|CBI15686.3| unnamed protein product [Vitis vinifera]
Length = 174
Score = 153 bits (386), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 93/145 (64%), Positives = 114/145 (78%), Gaps = 6/145 (4%)
Query: 1 MNDELCNPTLWSTTT--TTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSALDEARHRY 58
MNDELCNPTLWST T +S S+ ++ D++SD + HH++ +GG ALD+AR RY
Sbjct: 34 MNDELCNPTLWSTPPPPATAASSHSSASLNGDASSDFS-HHMEIAGGG---ALDDARLRY 89
Query: 59 KTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNK 118
K+SVA+LRAVL AI +S KAK+FE S+ S S++D+ EI+KLEERAS LRKELA+KNK
Sbjct: 90 KSSVASLRAVLNAIQSSQKAKAFEASSAASGSASQADQAEIEKLEERASNLRKELADKNK 149
Query: 119 YLKRLIDQLRDLITDISTWQSPCSV 143
YLK LIDQLRDLITDISTWQSPCSV
Sbjct: 150 YLKLLIDQLRDLITDISTWQSPCSV 174
>gi|18424713|ref|NP_568973.1| uncharacterized protein [Arabidopsis thaliana]
gi|75171739|sp|Q9FMV4.1|MED30_ARATH RecName: Full=Mediator of RNA polymerase II transcription subunit
30
gi|9758289|dbj|BAB08813.1| unnamed protein product [Arabidopsis thaliana]
gi|190886513|gb|ACE95179.1| At1g17730 [Arabidopsis thaliana]
gi|332010374|gb|AED97757.1| uncharacterized protein [Arabidopsis thaliana]
Length = 189
Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 88/157 (56%), Positives = 107/157 (68%), Gaps = 20/157 (12%)
Query: 1 MNDELCNPTLWSTTTT-----TTTTSTSNGVVHADSTS-DANNHHLDSSGGSG-----NS 49
MNDELCNP+LWST+ T TTT S + +V AD+ + D +HH +S+GG G NS
Sbjct: 39 MNDELCNPSLWSTSATPSSAATTTGSNGSALVSADAAAIDGTSHHSESAGGGGGGGSGNS 98
Query: 50 ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEM---VSSPVDSVSRSDEVEIDKLEERA 106
LDEA RYK SV +LRAVL AIPNS KAK+ EM + SP E EI+KLEE+A
Sbjct: 99 VLDEASLRYKNSVTSLRAVLAAIPNSQKAKASEMQNGLGSP------ESEDEIEKLEEQA 152
Query: 107 SLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
LR E+A KN ++K LID+LR+LI DISTWQSPCSV
Sbjct: 153 LSLRMEIAKKNVHVKELIDKLRELIADISTWQSPCSV 189
>gi|449440544|ref|XP_004138044.1| PREDICTED: mediator of RNA polymerase II transcription subunit
30-like [Cucumis sativus]
gi|449532651|ref|XP_004173294.1| PREDICTED: mediator of RNA polymerase II transcription subunit
30-like [Cucumis sativus]
Length = 198
Score = 147 bits (370), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 86/161 (53%), Positives = 111/161 (68%), Gaps = 20/161 (12%)
Query: 1 MNDELCNPTLWS---------------TTTTTTTTSTSNGVVHADSTSDANNHHLDSSGG 45
MNDELCNPTLWS + + ++ ++NGVV+ D++S++ +HH DSSGG
Sbjct: 40 MNDELCNPTLWSTTSSTATTATSGLTIASPSGPSSHSANGVVNGDASSESTSHHTDSSGG 99
Query: 46 SGNS---ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKL 102
G+ AL+EAR RYK SV ALRA+L AIPNS KAK++E +S S +D+ EIDKL
Sbjct: 100 GGSGSGGALEEARVRYKNSVLALRAILAAIPNSQKAKAYESGTS--GHASPADQSEIDKL 157
Query: 103 EERASLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
E+ S LRKEL KN ++K LIDQLRDL+ D+STWQSPCSV
Sbjct: 158 EQHLSNLRKELVVKNLHIKLLIDQLRDLVADVSTWQSPCSV 198
>gi|224119684|ref|XP_002318134.1| predicted protein [Populus trichocarpa]
gi|118486707|gb|ABK95190.1| unknown [Populus trichocarpa]
gi|222858807|gb|EEE96354.1| predicted protein [Populus trichocarpa]
Length = 185
Score = 146 bits (368), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 96/149 (64%), Positives = 105/149 (70%), Gaps = 13/149 (8%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSS------GGSGNSALDEA 54
MNDELCNPTLWSTT+T+T+T TS DAN DS+ GG GN ALDEA
Sbjct: 44 MNDELCNPTLWSTTSTSTSTVTSPITSSNAQNGDANAAVSDSTLELNGGGGVGNGALDEA 103
Query: 55 RHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELA 114
R RYK SVAALR VL AIPNSHKAK FE SSP +D+ EI+KLEERAS LRKELA
Sbjct: 104 RFRYKNSVAALRDVLAAIPNSHKAKPFE-TSSP------ADQAEIEKLEERASNLRKELA 156
Query: 115 NKNKYLKRLIDQLRDLITDISTWQSPCSV 143
KN Y+K LIDQLR+LI DISTWQ P SV
Sbjct: 157 KKNTYVKLLIDQLRELIADISTWQGPYSV 185
>gi|356507364|ref|XP_003522437.1| PREDICTED: uncharacterized protein LOC100788627 [Glycine max]
Length = 164
Score = 144 bits (364), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 84/144 (58%), Positives = 100/144 (69%), Gaps = 14/144 (9%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSALDEA--RHRY 58
MNDELCNP LWST+ +T+ +AD+TSD +NHH D+S S + R RY
Sbjct: 32 MNDELCNPVLWSTSPSTSP--------NADATSDNSNHHADASAASAAAGGALEEARARY 83
Query: 59 KTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNK 118
K +V ALR +LTAIP S K K+F+ S+ S +DE EIDKLEERAS LR+ELANKN
Sbjct: 84 KKAVTALRTILTAIPTSQKEKTFDAGSA----ASPADEAEIDKLEERASSLRRELANKNL 139
Query: 119 YLKRLIDQLRDLITDISTWQSPCS 142
+LK LIDQLRDLITDISTWQSP S
Sbjct: 140 HLKTLIDQLRDLITDISTWQSPFS 163
>gi|297793927|ref|XP_002864848.1| hypothetical protein ARALYDRAFT_496535 [Arabidopsis lyrata subsp.
lyrata]
gi|297310683|gb|EFH41107.1| hypothetical protein ARALYDRAFT_496535 [Arabidopsis lyrata subsp.
lyrata]
Length = 178
Score = 144 bits (362), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/157 (54%), Positives = 106/157 (67%), Gaps = 20/157 (12%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNG-----VVHADSTS-DANNHHLDSSGGSG-----NS 49
MNDELCNP+LWST+ T ++ +T+ G +V D+ + D +HH +S GG G NS
Sbjct: 28 MNDELCNPSLWSTSATPSSAATATGSNGSALVSVDAAAIDGTSHHSESGGGGGGGGSGNS 87
Query: 50 ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEM---VSSPVDSVSRSDEVEIDKLEERA 106
LDEA RYK SV +LRAVL AIPNSHKAK+ EM SP E EI+KLEE+A
Sbjct: 88 VLDEASLRYKNSVTSLRAVLAAIPNSHKAKASEMENGFGSP------ESEDEIEKLEEQA 141
Query: 107 SLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
LR E+A KN ++K LID+LR+LI DISTWQSPCSV
Sbjct: 142 LSLRMEIAKKNVHVKELIDKLRELIADISTWQSPCSV 178
>gi|51969384|dbj|BAD43384.1| unknown protein [Arabidopsis thaliana]
Length = 184
Score = 142 bits (359), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 88/157 (56%), Positives = 106/157 (67%), Gaps = 20/157 (12%)
Query: 1 MNDELCNPTLWSTTTT-----TTTTSTSNGVVHADSTS-DANNHHLDSSGGSG-----NS 49
MNDELCNP+LWST+ T TTT S + +V AD+ + D +HH +S+GG G NS
Sbjct: 34 MNDELCNPSLWSTSATPSSAATTTGSNGSALVSADAAAIDGTSHHSESAGGGGGEGSGNS 93
Query: 50 ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEM---VSSPVDSVSRSDEVEIDKLEERA 106
LDEA RYK SV +LRAVL AIPNS KAK+ EM + SP E EI+KLEE+A
Sbjct: 94 VLDEASLRYKNSVTSLRAVLAAIPNSQKAKASEMQNGLGSP------ESEDEIEKLEEQA 147
Query: 107 SLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
LR E+A KN ++K LID LR+LI DISTWQSPCSV
Sbjct: 148 LSLRMEIAKKNVHVKELIDILRELIADISTWQSPCSV 184
>gi|356516539|ref|XP_003526951.1| PREDICTED: uncharacterized protein LOC100816835 [Glycine max]
Length = 168
Score = 137 bits (346), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/144 (56%), Positives = 103/144 (71%), Gaps = 12/144 (8%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSA--LDEARHRY 58
MNDELCNP LWST+ + +++ +AD+TS+ +NHH D++ + L++AR RY
Sbjct: 34 MNDELCNPVLWSTSPSASSSP------NADATSENSNHHADATTAAAGGGGALEQARSRY 87
Query: 59 KTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNK 118
K +VAALR +LTAIPNS KA + + + S +DE EIDKLEERAS LR+ELANKN
Sbjct: 88 KKAVAALRTILTAIPNSQKANA----FNASSAASPADEAEIDKLEERASSLRRELANKNL 143
Query: 119 YLKRLIDQLRDLITDISTWQSPCS 142
+LK LIDQLRDLITDISTW SP S
Sbjct: 144 HLKTLIDQLRDLITDISTWHSPFS 167
>gi|21553739|gb|AAM62832.1| unknown [Arabidopsis thaliana]
Length = 178
Score = 129 bits (323), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 85/154 (55%), Positives = 105/154 (68%), Gaps = 14/154 (9%)
Query: 1 MNDELCNPTLWSTTTT-----TTTTSTSNGVVHADSTS-DANNHHLDSSGGSG-----NS 49
MNDELCNP+LWST+ T TTT S + +V AD+ + D +HH +S+GG G NS
Sbjct: 28 MNDELCNPSLWSTSATPSSAATTTGSNGSALVSADAAAIDGTSHHSESAGGGGGGGSGNS 87
Query: 50 ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLL 109
LDEA RYK SV +LRAVL AIPNS KAK+ EM + + S R DE+E + E A L
Sbjct: 88 VLDEASLRYKNSVTSLRAVLAAIPNSQKAKASEM-QNGLGSPEREDEIEKLE--EEALSL 144
Query: 110 RKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
R E+A KN ++K LID+LR+LI DISTWQSPCSV
Sbjct: 145 RMEIAKKNVHVKELIDKLRELIADISTWQSPCSV 178
>gi|326520639|dbj|BAJ88360.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 177
Score = 124 bits (311), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/156 (49%), Positives = 95/156 (60%), Gaps = 25/156 (16%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVVH-------------ADSTSDANNHHLDSSGGSG 47
MNDELCNP LWS++ T T T+ S H ADS +DA + + G
Sbjct: 34 MNDELCNPALWSSSATATATAASAASQHPHHQNAAPPPPHSADSDADA----MGGAAGGS 89
Query: 48 NSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERAS 107
+LDEARHRYK +VAALRA + A+ S P + S+ D+ EI++LEE AS
Sbjct: 90 GGSLDEARHRYKIAVAALRASIAAVSPS--------TQEPGPTESKGDQAEIERLEEHAS 141
Query: 108 LLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
LRKE+ +KNK LK LIDQLRDLI+DIS WQSPCSV
Sbjct: 142 SLRKEIGSKNKQLKLLIDQLRDLISDISMWQSPCSV 177
>gi|388521333|gb|AFK48728.1| unknown [Medicago truncatula]
Length = 179
Score = 121 bits (304), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 81/147 (55%), Positives = 99/147 (67%), Gaps = 13/147 (8%)
Query: 1 MNDELCNPTLWSTTTTTTTTS---TSNGVVHADSTSDANNHHLDSSGGSGNSALDEARHR 57
MNDELCNP WST+ ++ T+ +SNG A+S + + + G AL EAR R
Sbjct: 40 MNDELCNPVWWSTSPSSATSPNAPSSNG--DANSENSGQHADGAAPSGGAGGALGEARLR 97
Query: 58 YKTSVAALRAVLTAIPNSHKAKSFEMVS--SPVDSVSRSDEVEIDKLEERASLLRKELAN 115
YK +VA LR+VL AIPNS KA +F+ S SP+D E EI+KLEE+AS LRKEL N
Sbjct: 98 YKDAVAGLRSVLAAIPNSQKANTFDGGSADSPMD------EAEIEKLEEQASSLRKELGN 151
Query: 116 KNKYLKRLIDQLRDLITDISTWQSPCS 142
KN +LK LIDQLR+LITDISTWQSP S
Sbjct: 152 KNLHLKILIDQLRELITDISTWQSPFS 178
>gi|226533480|ref|NP_001145600.1| uncharacterized protein LOC100279078 [Zea mays]
gi|195658667|gb|ACG48801.1| hypothetical protein [Zea mays]
gi|413918133|gb|AFW58065.1| hypothetical protein ZEAMMB73_297063 [Zea mays]
Length = 172
Score = 119 bits (299), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/150 (48%), Positives = 96/150 (64%), Gaps = 18/150 (12%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDS----SGG--SGNSALDEA 54
MNDELCN LWS+++ + T + G H S + H +DS +GG S +LDEA
Sbjct: 34 MNDELCNAGLWSSSSVSATAAA--GYQHHHSATLPTPHSVDSDAADAGGAPSPGGSLDEA 91
Query: 55 RHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSV-SRSDEVEIDKLEERASLLRKEL 113
RHRYK++VAALRA ++A+ + + + S S +D EI++LEE AS LRKE+
Sbjct: 92 RHRYKSAVAALRASISAVSS---------CAQDIGSTESEADHAEIERLEEHASALRKEI 142
Query: 114 ANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
+KNK +K LIDQLRDLI DI+ WQSPCSV
Sbjct: 143 ESKNKQVKLLIDQLRDLIADIAMWQSPCSV 172
>gi|357163076|ref|XP_003579617.1| PREDICTED: uncharacterized protein LOC100837441 [Brachypodium
distachyon]
Length = 177
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 69/153 (45%), Positives = 90/153 (58%), Gaps = 19/153 (12%)
Query: 1 MNDELCNPTLWSTTTTTTTTST---SNGVVHADSTSDANNHHLDSS------GGSGNSAL 51
MNDELCNP LWS++ T + + + H + N D S +L
Sbjct: 34 MNDELCNPVLWSSSATASAAAAAGSQHPHQHQAAPPPPNCADSDVSDAAGGGAAGSGGSL 93
Query: 52 DEARHRYKTSVAALRAVLTAI-PNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLR 110
DEARHRYK +VAALRA ++A+ P + + S E + D+ E ++LEERAS LR
Sbjct: 94 DEARHRYKIAVAALRASISAVAPCAQELGSTEF---------KGDQAETEQLEERASALR 144
Query: 111 KELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
KE+ +KNK+LK LIDQLR LI+DIS WQSPCS
Sbjct: 145 KEIESKNKHLKLLIDQLRGLISDISMWQSPCSA 177
>gi|242075540|ref|XP_002447706.1| hypothetical protein SORBIDRAFT_06g013860 [Sorghum bicolor]
gi|241938889|gb|EES12034.1| hypothetical protein SORBIDRAFT_06g013860 [Sorghum bicolor]
Length = 172
Score = 111 bits (278), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 66/149 (44%), Positives = 91/149 (61%), Gaps = 16/149 (10%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNS------ALDEA 54
MNDELCN LWS+++ + + S H +T+ H DS +LDEA
Sbjct: 34 MNDELCNAGLWSSSSVSAAAAGSQH--HHSATTPPPPHSADSDAADAGGGPGPGGSLDEA 91
Query: 55 RHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELA 114
RHRYK++VAALRA ++A+ + + + S +D EI++LEE AS LRKE+
Sbjct: 92 RHRYKSAVAALRASISALSSCAQDTG--------STESEADHAEIERLEEHASALRKEIE 143
Query: 115 NKNKYLKRLIDQLRDLITDISTWQSPCSV 143
+KNK +K L+DQLRDLITD++ WQSPCSV
Sbjct: 144 SKNKQVKLLMDQLRDLITDVAMWQSPCSV 172
>gi|357149130|ref|XP_003575010.1| PREDICTED: uncharacterized protein LOC100826158 [Brachypodium
distachyon]
Length = 169
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 90/141 (63%), Gaps = 8/141 (5%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSALDEARHRYKT 60
+N ELCNP LWS+++++ V A + + + G +LDEARHRYKT
Sbjct: 35 INHELCNPALWSSSSSSQQQEHQPTVPAAPDSHTSADAEGGGGGAGPGGSLDEARHRYKT 94
Query: 61 SVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNKYL 120
+V+ALRA +TA+ +S S E+ P+D + DE I KLEERAS+LRKE+ KNK +
Sbjct: 95 AVSALRASITAVSSS----SQEI--RPLDDKTNKDE--IGKLEERASVLRKEIETKNKNV 146
Query: 121 KRLIDQLRDLITDISTWQSPC 141
K L++QLRD+I D+S WQSPC
Sbjct: 147 KLLMNQLRDMILDVSMWQSPC 167
>gi|293331961|ref|NP_001170098.1| hypothetical protein [Zea mays]
gi|224033425|gb|ACN35788.1| unknown [Zea mays]
gi|414587582|tpg|DAA38153.1| TPA: hypothetical protein ZEAMMB73_493217 [Zea mays]
Length = 176
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 89/154 (57%), Gaps = 22/154 (14%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVV--HADSTSDANNHHLDSSGGSGNS------ALD 52
MNDELCN LWS+++ + + + H S + H DS +LD
Sbjct: 34 MNDELCNAGLWSSSSVSAAAAAAASAGSQHHQSATPPPPHSADSDAADAGGAPGPGGSLD 93
Query: 53 EARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVD---SVSRSDEVEIDKLEERASLL 109
EARHRYK++V ALR ++A+ S P D + S D+ EI++LEE AS L
Sbjct: 94 EARHRYKSAVVALRDSISAVS-----------SCPQDIGSTESEVDQAEIERLEEHASAL 142
Query: 110 RKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
RKE+ +KNK +K L+DQLRDLITD++ WQSPCSV
Sbjct: 143 RKEIESKNKQVKLLMDQLRDLITDVAMWQSPCSV 176
>gi|218194747|gb|EEC77174.1| hypothetical protein OsI_15660 [Oryza sativa Indica Group]
gi|222628762|gb|EEE60894.1| hypothetical protein OsJ_14577 [Oryza sativa Japonica Group]
Length = 197
Score = 98.6 bits (244), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/93 (54%), Positives = 68/93 (73%), Gaps = 8/93 (8%)
Query: 51 LDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLR 110
LDEARHRYK ++AALR + A+ + + EM S+ ++D+ EI++LEE AS LR
Sbjct: 113 LDEARHRYKVAMAALRTSIAAVSSCAQ----EMGSTE----HKADQAEIERLEEHASALR 164
Query: 111 KELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
KE+ +KNK++K LIDQL DLI+DIS WQSPCSV
Sbjct: 165 KEIESKNKHVKLLIDQLHDLISDISMWQSPCSV 197
>gi|357465037|ref|XP_003602800.1| hypothetical protein MTR_3g099130 [Medicago truncatula]
gi|355491848|gb|AES73051.1| hypothetical protein MTR_3g099130 [Medicago truncatula]
Length = 140
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 78/145 (53%), Gaps = 39/145 (26%)
Query: 1 MNDELCNPTLWSTTTTTTTTS---TSNGVVHADSTSDANNHHLDSSGGSGNSALDEARHR 57
MNDELCNP WST+ ++ T+ +SNG A+S + + + G ALDEAR R
Sbjct: 31 MNDELCNPVWWSTSPSSATSPNAPSSNG--DANSENSGQHADGAAPSGGAGGALDEARLR 88
Query: 58 YKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKN 117
YK +VA LR+VL AIPNS +KEL NKN
Sbjct: 89 YKDAVAGLRSVLAAIPNS----------------------------------QKELGNKN 114
Query: 118 KYLKRLIDQLRDLITDISTWQSPCS 142
+LK LIDQLR+LITDISTWQSP S
Sbjct: 115 LHLKILIDQLRELITDISTWQSPFS 139
>gi|302798244|ref|XP_002980882.1| hypothetical protein SELMODRAFT_420385 [Selaginella moellendorffii]
gi|300151421|gb|EFJ18067.1| hypothetical protein SELMODRAFT_420385 [Selaginella moellendorffii]
Length = 143
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 69/108 (63%), Gaps = 6/108 (5%)
Query: 36 NNHHLDSSGG---SGNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVS 92
+N L GG +G +A+D+AR +YK + ALR ++ AI N + S ++ S+ +V
Sbjct: 36 SNPALWEGGGKEEAGWAAVDDARQKYKAATTALRGIVAAIFNHPQMTSLDVDSTVEKAV- 94
Query: 93 RSDEVEIDKLEERASLLRKELANKNKYLKRLIDQLRDLITDISTWQSP 140
D+ EID+L+++ LR+E+ KN LK+LI Q+R+L+ DIS WQ+P
Sbjct: 95 --DQAEIDRLQKQTVELREEIVRKNDVLKQLITQMRELVRDISMWQTP 140
>gi|302755929|ref|XP_002961388.1| hypothetical protein SELMODRAFT_437764 [Selaginella moellendorffii]
gi|300170047|gb|EFJ36648.1| hypothetical protein SELMODRAFT_437764 [Selaginella moellendorffii]
Length = 135
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 68/107 (63%), Gaps = 6/107 (5%)
Query: 37 NHHLDSSGG---SGNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSR 93
N L GG +G +A+D+AR +YK + ALR ++ AI N + S ++ S+ +V
Sbjct: 29 NPALWEGGGKEEAGWAAVDDARQKYKAATTALRGIVAAIFNHPQMMSLDVDSTVEKAV-- 86
Query: 94 SDEVEIDKLEERASLLRKELANKNKYLKRLIDQLRDLITDISTWQSP 140
D+ EID+L+++ LR+E+ KN LK+LI Q+R+L+ DIS WQ+P
Sbjct: 87 -DQAEIDRLQKQTVELREEIVRKNDVLKQLITQMRELVRDISMWQTP 132
>gi|168037604|ref|XP_001771293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677382|gb|EDQ63853.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2035
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 64/127 (50%), Gaps = 24/127 (18%)
Query: 1 MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNH----------------HLDSSG 44
+N LC+P+ W + + SNG ADS++D N S G
Sbjct: 107 LNRALCSPSFW---VAPSLQAGSNGP--ADSSADGANEGGSVATTAGGESGVTKEKASDG 161
Query: 45 GSGNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEE 104
G G +ALD R RY+ + AALRA +TAI +A E+ + V+++ D E++ LE+
Sbjct: 162 GEGLTALDATRFRYRNATAALRATVTAILKQTQAARAEIDGATVENI---DNDEVEYLEQ 218
Query: 105 RASLLRK 111
RA+ LR+
Sbjct: 219 RANDLRE 225
>gi|159486711|ref|XP_001701381.1| hypothetical protein CHLREDRAFT_179302 [Chlamydomonas reinhardtii]
gi|158271683|gb|EDO97497.1| predicted protein [Chlamydomonas reinhardtii]
Length = 60
Score = 38.1 bits (87), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 106 ASLLRKELANKNKYLKRLIDQLRDLITDISTWQS 139
+LL+ EL KN+ LK+LIDQLR ++ + W+S
Sbjct: 12 GALLQAELCLKNEALKQLIDQLRQMLESMCMWES 45
>gi|405970719|gb|EKC35598.1| MAM domain-containing glycosylphosphatidylinositol anchor protein 1
[Crassostrea gigas]
Length = 909
Score = 37.0 bits (84), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 62 VAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNKYLK 121
V ++ + I S KA++ E V VD+V+ ++DK+E+ SLL K L+ +NK +
Sbjct: 146 VTEIKKTMEGIRTSMKAEA-ETVKKLVDTVTSDQLKQVDKIEQ--SLL-KILSGQNKKID 201
Query: 122 RLIDQLRDLITDISTWQSP 140
I+ L DLI+ + SP
Sbjct: 202 DYINYLNDLISTFYGYLSP 220
>gi|401826947|ref|XP_003887566.1| Rad50-like protein [Encephalitozoon hellem ATCC 50504]
gi|392998572|gb|AFM98585.1| Rad50-like protein [Encephalitozoon hellem ATCC 50504]
Length = 1249
Score = 37.0 bits (84), Expect = 2.8, Method: Composition-based stats.
Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 9/93 (9%)
Query: 48 NSALDEARHRYKTSVAALRAVLTAIPNSHK--AKSFEMVSSPVDSVSRSDEVEIDKLEER 105
+S LDEAR + + +L + L + + A++F+ V D R + +E+ LE
Sbjct: 353 SSLLDEARRQREEVFKSLESELLVTKDFRETAAEAFKKVEE--DIRRRKESIEV--LESE 408
Query: 106 ASLLRKELANKNKYL---KRLIDQLRDLITDIS 135
LRKE NK K L KR++D+ RDL+ D S
Sbjct: 409 LLKLRKEDENKMKILSEKKRIVDEYRDLVYDES 441
>gi|260823424|ref|XP_002604183.1| hypothetical protein BRAFLDRAFT_73473 [Branchiostoma floridae]
gi|229289508|gb|EEN60194.1| hypothetical protein BRAFLDRAFT_73473 [Branchiostoma floridae]
Length = 439
Score = 36.2 bits (82), Expect = 4.1, Method: Composition-based stats.
Identities = 30/104 (28%), Positives = 45/104 (43%), Gaps = 1/104 (0%)
Query: 28 HADSTSDANNHHLDSSGGS-GNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSS 86
A +TS N GG GN+A DEA +++ SV R V+ + + ++
Sbjct: 242 QAGATSGNNPGPSGEGGGQPGNAAQDEANRKFERSVHKARTVMILVAVAFVFWFLPLLVV 301
Query: 87 PVDSVSRSDEVEIDKLEERASLLRKELANKNKYLKRLIDQLRDL 130
PV VSR D ++ E A + L N L R D R++
Sbjct: 302 PVMCVSRKDVCPVNGWGEEAFVCLNSLINPIATLIRTPDLRREI 345
>gi|403183522|gb|EAT41141.2| AAEL007186-PA [Aedes aegypti]
Length = 737
Score = 36.2 bits (82), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 61/132 (46%), Gaps = 14/132 (10%)
Query: 7 NPTLWSTTTTTTTTSTSNGVVHADSTSDA--NNHHLD--SSGGSGNSALDEARHRYKTSV 62
NP++ + +T ++ + TS G V +DS+S+ + H D SS G+ D+ +
Sbjct: 431 NPSMDTVSTCSSESVTSFGSVRSDSSSNTITSKSHPDRLSSVGTEPPETDQLQR------ 484
Query: 63 AALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNKYLKR 122
LRA L + S + + F+ D + D E +LRK+ N + L
Sbjct: 485 --LRAELGTV--SEQLRKFQQTKQAQDDIIAEQNARFDATESELKVLRKDYENNQRKLFH 540
Query: 123 LIDQLRDLITDI 134
+ +++ LIT+I
Sbjct: 541 MQTEIKRLITEI 552
>gi|417859223|ref|ZP_12504280.1| methyl-accepting chemotaxis protein [Agrobacterium tumefaciens F2]
gi|338825227|gb|EGP59194.1| methyl-accepting chemotaxis protein [Agrobacterium tumefaciens F2]
Length = 774
Score = 36.2 bits (82), Expect = 5.1, Method: Composition-based stats.
Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 3/84 (3%)
Query: 55 RHRYKTSVAALRAVLTAIPNSHKAKS--FEMVSSPVDSVSRSDEVEIDKLEERASLLRKE 112
RH + +SV LR L ++ NS + + VSS D +SR E + LEE A+ L +
Sbjct: 457 RHDFNSSVVQLREALASVGNSVQTVNGGAHEVSSASDDLSRRTEQQAASLEETAAALEQI 516
Query: 113 LANKNKYLKRLIDQLRDLITDIST 136
AN + KR D RD + + T
Sbjct: 517 TANVSATSKRTGDA-RDTVREART 539
>gi|384135188|ref|YP_005517902.1| GerA spore germination protein [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
gi|339289273|gb|AEJ43383.1| GerA spore germination protein [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
Length = 482
Score = 35.8 bits (81), Expect = 5.6, Method: Composition-based stats.
Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 17/120 (14%)
Query: 11 WSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSALDE---ARHRYKTSVAALRA 67
WST T G+V AD +A D+ G G + A + +T AA+RA
Sbjct: 20 WSTQRDTGVVVWLEGLVDADKLQEALLALRDAGGARGRREIPHPGVATQQVETLSAAVRA 79
Query: 68 VL-----TAIPNSHKAKSFEMVSSPVDSVSRSD---------EVEIDKLEERASLLRKEL 113
VL A + A+++ + P SV RS+ E I++L +L+RK L
Sbjct: 80 VLDGCAVIARDGASSAQAWRLDRVPARSVDRSENEPTLHGPQEAFIERLSVNLALVRKRL 139
>gi|344272772|ref|XP_003408205.1| PREDICTED: mediator of RNA polymerase II transcription subunit
30-like [Loxodonta africana]
Length = 182
Score = 35.4 bits (80), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 13/63 (20%)
Query: 87 PVDSVSRSDEVEIDKLEERASLLR-------------KELANKNKYLKRLIDQLRDLITD 133
PV+ + E + K ++RASL R K+L KN+ LK+++DQLR+LI D
Sbjct: 114 PVEQLIPYVEEDGSKNDDRASLPRFASEERREIAEVNKKLKQKNQQLKQIMDQLRNLIWD 173
Query: 134 IST 136
I+T
Sbjct: 174 INT 176
>gi|427795653|gb|JAA63278.1| Putative mediator of rna polymerase ii transcription subunit
30-like protein, partial [Rhipicephalus pulchellus]
Length = 246
Score = 35.4 bits (80), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 26/36 (72%)
Query: 102 LEERASLLRKELANKNKYLKRLIDQLRDLITDISTW 137
LEERA L +++ KN+ LK +ID LR++I +I+T
Sbjct: 192 LEERARRLTEQVVLKNRQLKEVIDALRNIIWEINTM 227
>gi|375108245|ref|ZP_09754506.1| adenylate/guanylate cyclase family protein [Burkholderiales
bacterium JOSHI_001]
gi|374668976|gb|EHR73761.1| adenylate/guanylate cyclase family protein [Burkholderiales
bacterium JOSHI_001]
Length = 1022
Score = 35.4 bits (80), Expect = 7.7, Method: Composition-based stats.
Identities = 31/102 (30%), Positives = 45/102 (44%), Gaps = 19/102 (18%)
Query: 30 DSTSDANNHHLDSSGGS---------------GNSALDEARHRYKTSVAALRAVLTAIPN 74
D DA HH D+ G S SALD AR Y+ ++AAL + +PN
Sbjct: 561 DEHLDALAHHFDAGGESTLAARYAERAGDRALAASALDRARGHYRVALAALDRL---VPN 617
Query: 75 SHKAKSFEMVSSPVDSVSRSDEVEID-KLEERASLLRKELAN 115
+A + ++ + VS D D L RA +L +E A+
Sbjct: 618 GDRALRWIAIAQRLGMVSLYDAKRADLALARRALVLAEEQAD 659
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.308 0.121 0.335
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,139,297,311
Number of Sequences: 23463169
Number of extensions: 75171035
Number of successful extensions: 636434
Number of sequences better than 100.0: 433
Number of HSP's better than 100.0 without gapping: 125
Number of HSP's successfully gapped in prelim test: 308
Number of HSP's that attempted gapping in prelim test: 632483
Number of HSP's gapped (non-prelim): 3086
length of query: 143
length of database: 8,064,228,071
effective HSP length: 107
effective length of query: 36
effective length of database: 9,848,636,284
effective search space: 354550906224
effective search space used: 354550906224
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 71 (32.0 bits)