BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020513
(325 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255547265|ref|XP_002514690.1| conserved hypothetical protein [Ricinus communis]
gi|223546294|gb|EEF47796.1| conserved hypothetical protein [Ricinus communis]
Length = 407
Score = 369 bits (946), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 204/334 (61%), Positives = 253/334 (75%), Gaps = 12/334 (3%)
Query: 1 MNCSQEI-----------DYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLE 49
+NCSQEI DYLQDQLNARN EVYSL EHVH LELKLVDM+ L K+ QL+
Sbjct: 72 LNCSQEIVWISKIITFLTDYLQDQLNARNAEVYSLGEHVHELELKLVDMDDLLVKISQLQ 131
Query: 50 EELRRSDSECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALE 109
EELR+SDSEC LL++EL+ KE L+ S I+KLEES++S L+SQCEIES+K+D++ALE
Sbjct: 132 EELRKSDSECFLLIQELERKEVELQKSVSFIEKLEESVASFTLDSQCEIESMKLDVMALE 191
Query: 110 QTCVEAKKVHKENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGR 169
Q C E+KK +E EK M+ L++EL+ + D++EII+CL+KENKEL+ KL + E NGR
Sbjct: 192 QACCESKKKQEETTMEKDTMDGLVQELKNQVYDAEEIIQCLEKENKELRVKLATSEMNGR 251
Query: 170 VFCQKIEEWMEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDAN 229
+F QKIEEWME +D L Q SELE+ +SKE CG+V G L SKLA+VL P+++
Sbjct: 252 IFIQKIEEWMENQDNLLLSTQPYSSELEKE-NMSKEMSACGEVLGLLFSKLAIVLAPESD 310
Query: 230 LKEKIKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECK 289
LK+++K +S +I EYE+L+ QLKE+LR EK KAKEEAEDLAQEMAELR+QMT LLEEECK
Sbjct: 311 LKKQMKRLSHKIREYEVLMNQLKEDLREEKLKAKEEAEDLAQEMAELRHQMTGLLEEECK 370
Query: 290 RRACIEQASLQRIAELETQIEKGQNKFVATGRHL 323
RRACIEQASLQRIAELE QI+K Q K R L
Sbjct: 371 RRACIEQASLQRIAELEAQIQKEQRKPSFAIRTL 404
>gi|297733914|emb|CBI15161.3| unnamed protein product [Vitis vinifera]
Length = 420
Score = 359 bits (922), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 201/326 (61%), Positives = 243/326 (74%), Gaps = 4/326 (1%)
Query: 2 NCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLL 61
NCSQEIDYLQDQLNAR+ EV L EHVHSLELKL D + L+D VG+L +EL+RS+SEC+L
Sbjct: 92 NCSQEIDYLQDQLNARDAEVKCLGEHVHSLELKLADKDNLEDMVGRLMQELKRSNSECML 151
Query: 62 LMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKE 121
LM+EL++KE L+ S+L I KLEESISS LE QCE+ES+K++MI LEQ+C EAKK+ E
Sbjct: 152 LMQELENKEVELQMSSLCIDKLEESISSVTLEFQCEMESMKLEMITLEQSCFEAKKLQDE 211
Query: 122 NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKI----EE 177
+EK +MN LI+E +V+ QD+Q++IECLDKENKEL+ KL + E + + QKI EE
Sbjct: 212 ASEEKTKMNGLIQEFQVQLQDAQKMIECLDKENKELRGKLKTSEMDAILLRQKIKEHSEE 271
Query: 178 WMEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGM 237
W+E +D +L QS ELE F +S E +V L KLA+ D LKEK++ M
Sbjct: 272 WLENKDESELKTQSSSGELESKFNLSTEMSTSAEVLVPLFPKLAVSATSDVGLKEKMEKM 331
Query: 238 SLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQA 297
S QI YELLVKQLKEELR EK KAKEEAEDLAQEMAELRYQ+T +LEEECKRRACIEQA
Sbjct: 332 SHQIHGYELLVKQLKEELREEKLKAKEEAEDLAQEMAELRYQITGMLEEECKRRACIEQA 391
Query: 298 SLQRIAELETQIEKGQNKFVATGRHL 323
SLQRIAELE QI+K Q K A R
Sbjct: 392 SLQRIAELEAQIQKEQTKSYAAIRRF 417
>gi|147853034|emb|CAN78532.1| hypothetical protein VITISV_035305 [Vitis vinifera]
Length = 1164
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 201/330 (60%), Positives = 241/330 (73%), Gaps = 8/330 (2%)
Query: 2 NCSQEI----DYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDS 57
NCSQEI DYLQDQLNAR+ EV L EH HSLELKL D + L+D VG+L EEL+RS+S
Sbjct: 832 NCSQEIVFLVDYLQDQLNARDAEVKCLGEHAHSLELKLADKDNLEDMVGRLMEELKRSNS 891
Query: 58 ECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKK 117
EC+ LM+EL++KE L+ S+L I KLEESISS LE QCEIES+K++MI LEQ+C EAKK
Sbjct: 892 ECMFLMQELENKEVELQTSSLCIDKLEESISSVTLEFQCEIESMKLEMITLEQSCFEAKK 951
Query: 118 VHKENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKI-- 175
+ E +EK +MN LI+E +V+ QD+Q++IECLDKENKEL+ KL + E + + QKI
Sbjct: 952 LQDEASEEKTKMNGLIQEFQVQLQDAQKMIECLDKENKELRGKLKTSEMDAILLRQKIKE 1011
Query: 176 --EEWMEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEK 233
EEW+E +D +L QS ELE F +S E +V L KLA+ D LKEK
Sbjct: 1012 HSEEWLENKDESELKTQSSSGELESKFNLSTEMSTSAEVLVPLFPKLAVSATSDVXLKEK 1071
Query: 234 IKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRAC 293
++ MS QI YELLVKQLKEELR EK KAKEEAEDLAQEMAELRYQ+T +LEEECKRRAC
Sbjct: 1072 MEKMSHQIHGYELLVKQLKEELREEKLKAKEEAEDLAQEMAELRYQITGMLEEECKRRAC 1131
Query: 294 IEQASLQRIAELETQIEKGQNKFVATGRHL 323
IEQASLQRIAELE QI+K Q K A R
Sbjct: 1132 IEQASLQRIAELEAQIQKEQTKSYAAIRRF 1161
>gi|449439299|ref|XP_004137423.1| PREDICTED: uncharacterized protein LOC101221046 [Cucumis sativus]
gi|449486970|ref|XP_004157457.1| PREDICTED: uncharacterized protein LOC101230337 [Cucumis sativus]
Length = 390
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 178/324 (54%), Positives = 227/324 (70%), Gaps = 16/324 (4%)
Query: 2 NCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLL 61
NC+QEIDYLQDQL RN E+ L +HV SLE KLV ME Q+K +LEEE++RS+SECL
Sbjct: 74 NCTQEIDYLQDQLCTRNTELTYLVDHVESLEFKLVHMEHSQEKASKLEEEVKRSNSECLF 133
Query: 62 LMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKE 121
LM++L KE+ LR S +++KLEESIS+ LESQCEIES+K+DM+A+EQ +E KK +E
Sbjct: 134 LMQKLDDKEQELRESNSNVEKLEESISAITLESQCEIESMKLDMLAMEQRYIETKKFQEE 193
Query: 122 NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEK 181
+ + +M+ LI+EL Q++Q ++ L+ EN+EL+ +LD N FC+ +EE +E
Sbjct: 194 ALSQNDKMDRLIEEL----QNAQRNVKFLETENEELQRELDVSTRNASTFCRSVEELIEN 249
Query: 182 EDRKQLDIQSLVSELERNFTVSKETCF----CGKVFGALLSKLALVLGPDANLKEKIKGM 237
++R Q + RN K T CG V G LL KLA+ L DAN + K+ M
Sbjct: 250 KERSQNTM--------RNDRDGKLTSILKNSCGDVLGHLLPKLAVALFADANSEAKMDVM 301
Query: 238 SLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQA 297
QI +YELLV+QLKEELR EK KAKEEAEDLAQEMAELRYQ+T LLEEECKRRACIEQA
Sbjct: 302 KKQILDYELLVEQLKEELREEKLKAKEEAEDLAQEMAELRYQITGLLEEECKRRACIEQA 361
Query: 298 SLQRIAELETQIEKGQNKFVATGR 321
SLQRIA+LE Q+ KGQN+ R
Sbjct: 362 SLQRIAQLEAQVLKGQNRSFPVAR 385
>gi|356513505|ref|XP_003525454.1| PREDICTED: uncharacterized protein LOC100781747 [Glycine max]
Length = 997
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 178/311 (57%), Positives = 231/311 (74%), Gaps = 2/311 (0%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NCSQEIDYLQDQL+ARN EV L EH+H+LELKL ME LQ++V +L EEL+RS+S+
Sbjct: 684 LNCSQEIDYLQDQLSARNAEVNYLEEHIHNLELKLEGMEDLQEEVFRLREELKRSESKQF 743
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
L++EL +KE+ L SAL I+KLEES SS LESQ E+ES+K+DM+ LEQ+ EAKK+
Sbjct: 744 SLIQELDTKEKELEKSALSIEKLEESFSSITLESQFEVESMKLDMMVLEQSLFEAKKIQD 803
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E + E RM+ I+EL+V QD+Q+II L++E +EL+EKLD+ N R+ QK E W+E
Sbjct: 804 ETLDENNRMSRSIEELQVALQDAQKIIITLNEEIRELEEKLDTANQNSRISSQKDEYWLE 863
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
+DR QL+ QS ++ N T+ +E +V G + +LA++L P A+LK K++ MS Q
Sbjct: 864 NKDRSQLETQSSLNVRGNNSTM-QEDISTYEVCGPHVGRLAMILDPAADLKGKME-MSQQ 921
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I EYE L+K+LKEELR EK KAKEEAEDL QEMAELRYQ T LEEECKRRAC E ASLQ
Sbjct: 922 IQEYECLIKKLKEELREEKLKAKEEAEDLVQEMAELRYQFTGSLEEECKRRACFEHASLQ 981
Query: 301 RIAELETQIEK 311
RIAELE Q+++
Sbjct: 982 RIAELEAQLKR 992
>gi|356564927|ref|XP_003550698.1| PREDICTED: uncharacterized protein LOC100805706 [Glycine max]
Length = 435
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 181/313 (57%), Positives = 228/313 (72%), Gaps = 2/313 (0%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NCSQEIDYLQDQL+A N EV L EH+ SLELKL ME LQ++V +L EEL+RS+S+
Sbjct: 73 LNCSQEIDYLQDQLSASNTEVNYLEEHIRSLELKLEGMEDLQEEVFRLREELKRSNSKHF 132
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
L++EL +KE L SAL I+KLEES SS LESQ E+ES+K+DM+ALEQ+ EAKK+
Sbjct: 133 FLIQELDTKEIELEKSALSIEKLEESFSSITLESQFEVESMKLDMMALEQSLFEAKKIQD 192
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E + E RM+ I+EL+V QD+Q+II L++EN++LKEKLD N R+ QK E W+E
Sbjct: 193 ETLDENNRMSRSIEELQVALQDAQKIIISLNEENRKLKEKLDIANKNSRISSQKDEYWLE 252
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
DR QL+ QS ++ N T+ ++ G V G + +LA++L A+LK K++ MS Q
Sbjct: 253 NNDRLQLETQSSLNGRGNNSTILEDIRTYG-VHGPHVGRLAMILYLAADLKGKME-MSQQ 310
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I EYE L+K+LKEELR EK +AKEEAEDL QEMAELRYQ TS LEEECKRRACIE ASLQ
Sbjct: 311 IQEYECLIKKLKEELREEKLRAKEEAEDLVQEMAELRYQFTSSLEEECKRRACIEHASLQ 370
Query: 301 RIAELETQIEKGQ 313
RIAELE Q GQ
Sbjct: 371 RIAELEAQAATGQ 383
>gi|224135329|ref|XP_002322042.1| predicted protein [Populus trichocarpa]
gi|222869038|gb|EEF06169.1| predicted protein [Populus trichocarpa]
Length = 295
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 177/294 (60%), Positives = 219/294 (74%), Gaps = 11/294 (3%)
Query: 38 MEILQDKVGQLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCE 97
ME LQ GQL EEL+R DSE LLL++EL+SKE L+ SAL I KLEESISS L+SQCE
Sbjct: 1 MEHLQANNGQLREELKRCDSEHLLLLQELESKEIELQESALCIGKLEESISSLTLDSQCE 60
Query: 98 IESLKIDMIALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKEL 157
IES+K+DMIALEQ C +AKK +E +QE ARMN LIKELE + +++E IEC++KEN EL
Sbjct: 61 IESMKLDMIALEQACFKAKKTQEETIQENARMNGLIKELEFQILEAKETIECVEKENIEL 120
Query: 158 KEKLDSYETNGRVFCQKIEEWMEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALL 217
++KL + + N ++F Q+IEEW+E +D QL+ QS SE+E +SKE + G
Sbjct: 121 RDKLVTSDVNSKLFLQQIEEWLENKDTSQLNTQSCSSEIEHQSNMSKEM---REALGPCF 177
Query: 218 SKLALVLGPDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELR 277
SKLA +LG ++NLKE ++ MS QI +YE+LVKQLK+ELR EK KAKEEA+DLAQEMAELR
Sbjct: 178 SKLATLLGSESNLKEWMESMSHQIRKYEVLVKQLKDELREEKSKAKEEADDLAQEMAELR 237
Query: 278 YQMTSLLEEECKRRACIEQASLQRIAELETQ--------IEKGQNKFVATGRHL 323
YQMT LLEEECKRRACIEQASLQRI+ELE Q IE+ + KF A HL
Sbjct: 238 YQMTGLLEEECKRRACIEQASLQRISELEAQVFLVFPSKIERERRKFFAAVGHL 291
>gi|297810891|ref|XP_002873329.1| hypothetical protein ARALYDRAFT_487621 [Arabidopsis lyrata subsp.
lyrata]
gi|297319166|gb|EFH49588.1| hypothetical protein ARALYDRAFT_487621 [Arabidopsis lyrata subsp.
lyrata]
Length = 409
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 160/325 (49%), Positives = 229/325 (70%), Gaps = 6/325 (1%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NC +EIDYL+DQL R++EV L+EH+H LE KL + L+++V L +EL S SE L
Sbjct: 88 LNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELCMSKSEHL 147
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LL++EL+SKE L+ S+L ++KLEE+ISS LES CEIES+KID+ ALEQ +A K+ +
Sbjct: 148 LLLQELESKEIELQCSSLSLEKLEETISSLTLESLCEIESMKIDITALEQALFDAMKIQE 207
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E++QEK ++ +I+E + ++Q +QE ++ ++K+N+EL+EK ++ E + + F Q +E +E
Sbjct: 208 ESIQEKHQLKGIIEESQFQSQRAQENVKYIEKQNEELREKFNASEKSIKEFFQSTKERLE 267
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
ED + L + +EL +S E C F A++ KL L + NL +K++GM+ Q
Sbjct: 268 SEDEEPLTVGCFFAELSHVLPMSNEVRNC---FDAIMKKLE--LSQNVNLTDKVEGMAKQ 322
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I ++E +VKQLKEEL+ EK KAKEEAEDL QEMAELRY+MT LL+EE RR CIEQASLQ
Sbjct: 323 IHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVCIEQASLQ 382
Query: 301 RIAELETQIEKGQNKFVATGRHLPI 325
RIAELE QI K + K ++ LP+
Sbjct: 383 RIAELEAQI-KREIKKPSSTEMLPL 406
>gi|30682143|ref|NP_196406.2| myosin heavy chain-like protein [Arabidopsis thaliana]
gi|79327239|ref|NP_001031851.1| myosin heavy chain-like protein [Arabidopsis thaliana]
gi|222423567|dbj|BAH19753.1| AT5G07890 [Arabidopsis thaliana]
gi|332003833|gb|AED91216.1| myosin heavy chain-like protein [Arabidopsis thaliana]
gi|332003835|gb|AED91218.1| myosin heavy chain-like protein [Arabidopsis thaliana]
Length = 409
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 157/325 (48%), Positives = 227/325 (69%), Gaps = 6/325 (1%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NC +EIDYL+DQL R++EV L+EH+H LE KL + L+++V L +EL S SE L
Sbjct: 88 LNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELCMSKSEHL 147
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LL++EL+SKE L+ S+L ++KLEE+ISS LES CEIES+K+D+ ALEQ +A K+ +
Sbjct: 148 LLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALFDAMKIQE 207
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E++QEK ++ +I+E + ++Q ++E ++ ++K+N++L+EK + E + + F Q +E +E
Sbjct: 208 ESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQSTKERLE 267
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
ED + L+ +EL VS E C F A++ KL L + NL +K++GM Q
Sbjct: 268 SEDEQPLNAMCFFAELSHVLPVSNEVRNC---FDAIMKKLE--LSQNVNLIDKVEGMGKQ 322
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I ++E +VKQLKEEL+ EK KAKEEAEDL QEMAELRY+MT LL+EE RR CIEQASLQ
Sbjct: 323 IHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVCIEQASLQ 382
Query: 301 RIAELETQIEKGQNKFVATGRHLPI 325
RI+ELE QI++ K A+ LP+
Sbjct: 383 RISELEAQIKRDVKK-PASNEMLPL 406
>gi|79327231|ref|NP_001031850.1| myosin heavy chain-like protein [Arabidopsis thaliana]
gi|332003834|gb|AED91217.1| myosin heavy chain-like protein [Arabidopsis thaliana]
Length = 328
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 157/325 (48%), Positives = 227/325 (69%), Gaps = 6/325 (1%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NC +EIDYL+DQL R++EV L+EH+H LE KL + L+++V L +EL S SE L
Sbjct: 7 LNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELCMSKSEHL 66
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LL++EL+SKE L+ S+L ++KLEE+ISS LES CEIES+K+D+ ALEQ +A K+ +
Sbjct: 67 LLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALFDAMKIQE 126
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E++QEK ++ +I+E + ++Q ++E ++ ++K+N++L+EK + E + + F Q +E +E
Sbjct: 127 ESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQSTKERLE 186
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
ED + L+ +EL VS E C F A++ KL L + NL +K++GM Q
Sbjct: 187 SEDEQPLNAMCFFAELSHVLPVSNEVRNC---FDAIMKKLE--LSQNVNLIDKVEGMGKQ 241
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I ++E +VKQLKEEL+ EK KAKEEAEDL QEMAELRY+MT LL+EE RR CIEQASLQ
Sbjct: 242 IHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVCIEQASLQ 301
Query: 301 RIAELETQIEKGQNKFVATGRHLPI 325
RI+ELE QI++ K A+ LP+
Sbjct: 302 RISELEAQIKRDVKK-PASNEMLPL 325
>gi|6562303|emb|CAB62601.1| putative protein [Arabidopsis thaliana]
gi|10176723|dbj|BAB09953.1| unnamed protein product [Arabidopsis thaliana]
Length = 389
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 157/325 (48%), Positives = 227/325 (69%), Gaps = 6/325 (1%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NC +EIDYL+DQL R++EV L+EH+H LE KL + L+++V L +EL S SE L
Sbjct: 68 LNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELCMSKSEHL 127
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LL++EL+SKE L+ S+L ++KLEE+ISS LES CEIES+K+D+ ALEQ +A K+ +
Sbjct: 128 LLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALFDAMKIQE 187
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E++QEK ++ +I+E + ++Q ++E ++ ++K+N++L+EK + E + + F Q +E +E
Sbjct: 188 ESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQSTKERLE 247
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
ED + L+ +EL VS E C F A++ KL L + NL +K++GM Q
Sbjct: 248 SEDEQPLNAMCFFAELSHVLPVSNEVRNC---FDAIMKKLE--LSQNVNLIDKVEGMGKQ 302
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I ++E +VKQLKEEL+ EK KAKEEAEDL QEMAELRY+MT LL+EE RR CIEQASLQ
Sbjct: 303 IHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELRYKMTCLLDEERNRRVCIEQASLQ 362
Query: 301 RIAELETQIEKGQNKFVATGRHLPI 325
RI+ELE QI++ K A+ LP+
Sbjct: 363 RISELEAQIKRDVKK-PASNEMLPL 386
>gi|222424375|dbj|BAH20143.1| AT5G07890 [Arabidopsis thaliana]
Length = 409
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 156/325 (48%), Positives = 226/325 (69%), Gaps = 6/325 (1%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NC +EIDYL+DQL R++EV L+EH+H LE KL + L+++V L +EL S SE L
Sbjct: 88 LNCYKEIDYLRDQLIFRSKEVNYLNEHLHDLEFKLAESRNLEEEVNSLRDELCMSKSEHL 147
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LL++EL+SKE L+ S+L ++KLEE+ISS LES CEIES+K+D+ ALEQ +A K+ +
Sbjct: 148 LLLQELESKEIELQCSSLTLEKLEETISSLTLESLCEIESMKLDITALEQALFDAMKIQE 207
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E++QEK ++ +I+E + ++Q ++E ++ ++K+N++L+EK + E + + F Q +E +E
Sbjct: 208 ESIQEKDQLKGIIEESQFQSQRAKENVKYIEKQNEDLREKFTASEKSIKDFFQSTKERLE 267
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
ED + L+ +EL VS E C F A++ KL L + NL +K++GM Q
Sbjct: 268 SEDEQPLNAMCFFAELSHVLPVSNEVRNC---FDAIMKKLE--LSQNVNLIDKVEGMGKQ 322
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I ++E +VKQLKEEL+ EK KAKEEAEDL QEMAEL Y+MT LL+EE RR CIEQASLQ
Sbjct: 323 IHQHEDVVKQLKEELKQEKLKAKEEAEDLTQEMAELGYKMTCLLDEERNRRVCIEQASLQ 382
Query: 301 RIAELETQIEKGQNKFVATGRHLPI 325
RI+ELE QI++ K A+ LP+
Sbjct: 383 RISELEAQIKRDVKK-PASNEMLPL 406
>gi|297793677|ref|XP_002864723.1| hypothetical protein ARALYDRAFT_332363 [Arabidopsis lyrata subsp.
lyrata]
gi|297310558|gb|EFH40982.1| hypothetical protein ARALYDRAFT_332363 [Arabidopsis lyrata subsp.
lyrata]
Length = 374
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 153/309 (49%), Positives = 210/309 (67%), Gaps = 22/309 (7%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NC QEIDYL+DQ+N R +E+ LSEHV LE+K+ + L+++V L EEL S SE L
Sbjct: 69 LNCYQEIDYLRDQVNFRGQEMNDLSEHVLDLEVKVNESGRLEEEVNYLREELCTSKSEQL 128
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LL++EL+S E L+ S ++KLEESISS LESQCEIES+K+D+ ALEQ +A K
Sbjct: 129 LLLQELESAETELQLSLFSVEKLEESISSLTLESQCEIESMKLDIAALEQALFDAHKFQG 188
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E++QE ++ ++KEL++++Q+++E ECL+K+NK+L E+ + E N + CQ +E +E
Sbjct: 189 ESIQENDKLREVVKELQLKSQEAEENAECLEKQNKKLMERCVASERNIKELCQSFKERLE 248
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
E V+ E C F ++ KL + D L++K++ M+ Q
Sbjct: 249 SEGEA---------------AVNAEEC-----FHEIIKKLE--VSRDVKLRDKMEDMARQ 286
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I +Y+ LVKQLK+EL+ EK KAKEEAEDL QEMAELRY+MT LLEEECKRRACIEQASLQ
Sbjct: 287 ILQYKDLVKQLKDELKEEKLKAKEEAEDLTQEMAELRYKMTCLLEEECKRRACIEQASLQ 346
Query: 301 RIAELETQI 309
RIA LE Q+
Sbjct: 347 RIANLEAQV 355
>gi|242050784|ref|XP_002463136.1| hypothetical protein SORBIDRAFT_02g038360 [Sorghum bicolor]
gi|241926513|gb|EER99657.1| hypothetical protein SORBIDRAFT_02g038360 [Sorghum bicolor]
Length = 559
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 198/325 (60%), Gaps = 46/325 (14%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
MNC QEIDYLQDQLN R+ E + EH+HSLELKL ++E ++V ++ EL RSDS+C
Sbjct: 188 MNCYQEIDYLQDQLNIRSVEANIMGEHIHSLELKLTELEKFPERVRLMDNELIRSDSQCW 247
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LLMEE++ KEE L+ +A I+KLE S+AL+SQCEIESLK+D+ LEQ ++A+ +
Sbjct: 248 LLMEEVRCKEEELQKAAAQIEKLE----STALDSQCEIESLKLDLTNLEQRLLDAESFTQ 303
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLD--SYETNGRVFCQKIEEW 178
+ KA++ L+ E E++ ++Q+ I+ L ENK+LKE L + + + Q++++
Sbjct: 304 HAAEHKAQIEKLLGEHELQLHEAQKTIDQLVLENKQLKELLPVRAPKQSPSRSGQQVDKT 363
Query: 179 MEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMS 238
+E R + C G V ++ M+
Sbjct: 364 LENGVRAE--------------------CESGDVI--------------------LEKMA 383
Query: 239 LQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQAS 298
+ E ELL++QLKEELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CIEQA+
Sbjct: 384 KRNEESELLIEQLKEELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCIEQAA 443
Query: 299 LQRIAELETQIEKGQNKFVATGRHL 323
+Q I ELETQ+ K + K R L
Sbjct: 444 IQHIQELETQVSKEKTKLSGALRRL 468
>gi|334188541|ref|NP_001190585.1| uncharacterized protein [Arabidopsis thaliana]
gi|332010052|gb|AED97435.1| uncharacterized protein [Arabidopsis thaliana]
Length = 389
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 211/324 (65%), Gaps = 23/324 (7%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NC QEIDYL+DQ+N R++E+ LSEHV LE+++ L+++V L EEL S SE L
Sbjct: 87 LNCYQEIDYLRDQVNFRSQEMNDLSEHVLDLEVRVTKSGKLEEEVNYLREELCSSKSEQL 146
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LL++EL+S E L+ S ++KLEES+SS LESQCEIES+K+D++ALEQ +A+K
Sbjct: 147 LLLQELESTETELQFSLFSVEKLEESVSSLTLESQCEIESIKLDIVALEQALFDAQKFQG 206
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E++QE ++ ++KEL + +++++E ECL+K+NKEL E+ + E N + Q +E
Sbjct: 207 ESIQENDKLREIVKELRLNSREAEENAECLEKQNKELMERCVASERNIKDLRQSFRGRLE 266
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
E E F ++ KL + D L++K++ M+ Q
Sbjct: 267 SE---------------------SEAPVNPDCFHDIIKKLEVF--QDGKLRDKMEDMARQ 303
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I +Y+ LVKQLK+EL+ EK KAKEEAEDL QEMAELRY+MT LLEEECKRRACIEQASLQ
Sbjct: 304 ILQYKDLVKQLKDELKEEKLKAKEEAEDLTQEMAELRYEMTCLLEEECKRRACIEQASLQ 363
Query: 301 RIAELETQIEKGQNKFVATGRHLP 324
RIA LE QI++ +NK LP
Sbjct: 364 RIANLEAQIKREKNKSSTCLVPLP 387
>gi|125559058|gb|EAZ04594.1| hypothetical protein OsI_26744 [Oryza sativa Indica Group]
Length = 454
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 196/322 (60%), Gaps = 44/322 (13%)
Query: 2 NCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLL 61
NC QEIDYLQDQLN RN E + EH+HSLELKL ++E ++V +++EL RSDS+C L
Sbjct: 92 NCYQEIDYLQDQLNIRNVEANIMGEHIHSLELKLTELEKFPERVRVIDDELMRSDSQCWL 151
Query: 62 LMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKE 121
LMEE++ +EE+L+ +AL I+KLE + L+SQCEIESLK+D+ LEQ +A +
Sbjct: 152 LMEEVRCQEEKLKKAALQIEKLE----NVNLDSQCEIESLKLDLTTLEQRLFDADSFGQH 207
Query: 122 NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEK 181
+KA ++ ++E E++ Q++ + I+ L ENKELK R+F +
Sbjct: 208 VSADKAIADNKLREYELQLQEAHKTIDHLLLENKELK----------RLFPGGV------ 251
Query: 182 EDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQI 241
+L S+ + + T+ K + G + +L + M+ +
Sbjct: 252 -------ATALTSDEQVDKTIEK-------IDGQYYERGGAIL----------ENMAKRS 287
Query: 242 CEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQR 301
E ELL++QLKEELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CIEQA++Q+
Sbjct: 288 EESELLIEQLKEELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCIEQAAIQQ 347
Query: 302 IAELETQIEKGQNKFVATGRHL 323
I ELE Q+ K Q K R L
Sbjct: 348 IQELEAQVSKEQRKLSGALRKL 369
>gi|125583497|gb|EAZ24428.1| hypothetical protein OsJ_08181 [Oryza sativa Japonica Group]
Length = 462
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 198/325 (60%), Gaps = 50/325 (15%)
Query: 2 NCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLL 61
NC QEIDYLQDQLN RN E + EH+HSLELKL ++E ++V +++EL RSDS+C L
Sbjct: 100 NCYQEIDYLQDQLNIRNVEANIMGEHIHSLELKLTELEKFPERVRVIDDELMRSDSQCWL 159
Query: 62 LMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKE 121
LMEE++ +EE+L+ +AL I+KLE + L+SQCEIESLK+D+ LEQ +A +
Sbjct: 160 LMEEVRCQEEKLKKAALQIEKLE----NVNLDSQCEIESLKLDLTTLEQRLFDADSFGQH 215
Query: 122 NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEK 181
+KA ++ ++E E++ Q++ + I+ L ENKELK R+F +
Sbjct: 216 VSADKAIADNKLREYELQLQEAHKTIDHLLLENKELK----------RLFPGGV------ 259
Query: 182 EDRKQLDIQSLVSELERNFTVSKETCFCGKVF---GALLSKLALVLGPDANLKEKIKGMS 238
+L S+ + + T+ K G+ + GA+L + M+
Sbjct: 260 -------ATALTSDEQVDKTIEK---IDGQYYERGGAIL-----------------ENMA 292
Query: 239 LQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQAS 298
+ E ELL++QLKEELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CIEQA+
Sbjct: 293 KRSEESELLIEQLKEELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCIEQAA 352
Query: 299 LQRIAELETQIEKGQNKFVATGRHL 323
+Q+I ELE Q+ K Q K R L
Sbjct: 353 IQQIQELEAQVSKEQRKLSGALRKL 377
>gi|115473171|ref|NP_001060184.1| Os07g0598700 [Oryza sativa Japonica Group]
gi|113611720|dbj|BAF22098.1| Os07g0598700 [Oryza sativa Japonica Group]
Length = 454
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 196/322 (60%), Gaps = 44/322 (13%)
Query: 2 NCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLL 61
NC QEIDYLQDQLN RN E + EH+HSLELKL ++E ++V +++EL RSDS+C L
Sbjct: 92 NCYQEIDYLQDQLNIRNVEANIMGEHIHSLELKLTELEKFPERVRVIDDELMRSDSQCWL 151
Query: 62 LMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKE 121
LMEE++ +EE+L+ +AL I+KLE + L+SQCEIESLK+D+ LEQ +A +
Sbjct: 152 LMEEVRCQEEKLKKAALQIEKLE----NVNLDSQCEIESLKLDLTTLEQRLFDADSFGQH 207
Query: 122 NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEK 181
+KA ++ ++E E++ Q++ + I+ L ENKELK R+F +
Sbjct: 208 VSADKAIADNKLREYELQLQEAHKTIDHLLLENKELK----------RLFPGGV------ 251
Query: 182 EDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQI 241
+L S+ + + T+ K + G + +L + M+ +
Sbjct: 252 -------ATALTSDEQVDKTIEK-------IDGQYYERGGAIL----------ENMAKRS 287
Query: 242 CEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQR 301
E ELL++QLKEELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CIEQA++Q+
Sbjct: 288 EESELLIEQLKEELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCIEQAAIQQ 347
Query: 302 IAELETQIEKGQNKFVATGRHL 323
I ELE Q+ K Q K R L
Sbjct: 348 IQELEAQVSKEQRKLSGALRKL 369
>gi|34393592|dbj|BAC83219.1| hypothetical protein [Oryza sativa Japonica Group]
gi|50508110|dbj|BAD30356.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 418
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 196/322 (60%), Gaps = 44/322 (13%)
Query: 2 NCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLL 61
NC QEIDYLQDQLN RN E + EH+HSLELKL ++E ++V +++EL RSDS+C L
Sbjct: 56 NCYQEIDYLQDQLNIRNVEANIMGEHIHSLELKLTELEKFPERVRVIDDELMRSDSQCWL 115
Query: 62 LMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKE 121
LMEE++ +EE+L+ +AL I+KLE + L+SQCEIESLK+D+ LEQ +A +
Sbjct: 116 LMEEVRCQEEKLKKAALQIEKLE----NVNLDSQCEIESLKLDLTTLEQRLFDADSFGQH 171
Query: 122 NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEK 181
+KA ++ ++E E++ Q++ + I+ L ENKELK R+F +
Sbjct: 172 VSADKAIADNKLREYELQLQEAHKTIDHLLLENKELK----------RLFPGGV------ 215
Query: 182 EDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQI 241
+L S+ + + T+ K + G + +L + M+ +
Sbjct: 216 -------ATALTSDEQVDKTIEK-------IDGQYYERGGAIL----------ENMAKRS 251
Query: 242 CEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQR 301
E ELL++QLKEELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CIEQA++Q+
Sbjct: 252 EESELLIEQLKEELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCIEQAAIQQ 311
Query: 302 IAELETQIEKGQNKFVATGRHL 323
I ELE Q+ K Q K R L
Sbjct: 312 IQELEAQVSKEQRKLSGALRKL 333
>gi|414590751|tpg|DAA41322.1| TPA: hypothetical protein ZEAMMB73_176381 [Zea mays]
Length = 421
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 193/329 (58%), Gaps = 54/329 (16%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
MNC QEIDYLQDQLN R+ E + EH+HSLELKL ++E ++V ++ EL RSDS+C
Sbjct: 50 MNCYQEIDYLQDQLNIRSVEANFMGEHIHSLELKLTELEKFPERVRAMDNELIRSDSQCW 109
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LLMEE++ KEE L+ +A I+KLE S+AL+SQCEIESLK+D+ LEQ +A++ +
Sbjct: 110 LLMEEVRCKEEELQKAASQIEKLE----STALDSQCEIESLKLDLTNLEQRLFDAERFSQ 165
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLD------SYETNGRVFCQK 174
+ KA+ + L+ E E++ ++Q+ I L ENK+LKE L S +G +
Sbjct: 166 HAGEHKAQFDKLLGEHELQLHEAQKTIHQLVWENKQLKELLPVRAPKQSPPGSGWKVNKT 225
Query: 175 IEEWMEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKI 234
+E + E C G V +
Sbjct: 226 LENGVHAE------------------------CESGDVI--------------------L 241
Query: 235 KGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACI 294
+ M+ + E ELL++QLKEELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CI
Sbjct: 242 ENMAKRNEESELLIEQLKEELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCI 301
Query: 295 EQASLQRIAELETQIEKGQNKFVATGRHL 323
EQA+++ I ELETQ+ K + K + L
Sbjct: 302 EQAAIKHIQELETQVSKEKTKLSGALKRL 330
>gi|195624198|gb|ACG33929.1| hypothetical protein [Zea mays]
Length = 424
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 193/329 (58%), Gaps = 54/329 (16%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
MNC QEIDYLQDQLN R+ E + EH+HSLELKL ++E ++V ++ EL RSDS+C
Sbjct: 53 MNCYQEIDYLQDQLNIRSVEANFMGEHIHSLELKLTELEKFPERVRAMDNELIRSDSQCW 112
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LLMEE++ KEE L+ +A I+KLE S+AL+SQCEIESLK+D+ LEQ +A++ +
Sbjct: 113 LLMEEVRCKEEELQKAASQIEKLE----STALDSQCEIESLKLDLTNLEQRLFDAERFSQ 168
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLD------SYETNGRVFCQK 174
+ KA+ + L+ E E++ ++Q+ I L ENK+LKE L S +G +
Sbjct: 169 HAGEHKAQFDKLLGEHELQLHEAQKTIHQLVWENKQLKELLPVRAPKQSPPGSGWKVNKT 228
Query: 175 IEEWMEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKI 234
+E + E C G V +
Sbjct: 229 LENGVHAE------------------------CESGDVI--------------------L 244
Query: 235 KGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACI 294
+ M+ + E ELL++QLKEELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CI
Sbjct: 245 ENMAKRNEESELLIEQLKEELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCI 304
Query: 295 EQASLQRIAELETQIEKGQNKFVATGRHL 323
EQA+++ I ELETQ+ K + K + L
Sbjct: 305 EQAAIKHIQELETQVSKEKTKLSGALKRL 333
>gi|212721560|ref|NP_001132291.1| uncharacterized protein LOC100193731 [Zea mays]
gi|194693990|gb|ACF81079.1| unknown [Zea mays]
gi|194705186|gb|ACF86677.1| unknown [Zea mays]
gi|414590752|tpg|DAA41323.1| TPA: hypothetical protein ZEAMMB73_176381 [Zea mays]
gi|414590753|tpg|DAA41324.1| TPA: hypothetical protein ZEAMMB73_176381 [Zea mays]
gi|414590754|tpg|DAA41325.1| TPA: hypothetical protein ZEAMMB73_176381 [Zea mays]
Length = 424
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 193/329 (58%), Gaps = 54/329 (16%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
MNC QEIDYLQDQLN R+ E + EH+HSLELKL ++E ++V ++ EL RSDS+C
Sbjct: 53 MNCYQEIDYLQDQLNIRSVEANFMGEHIHSLELKLTELEKFPERVRAMDNELIRSDSQCW 112
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LLMEE++ KEE L+ +A I+KLE S+AL+SQCEIESLK+D+ LEQ +A++ +
Sbjct: 113 LLMEEVRCKEEELQKAASQIEKLE----STALDSQCEIESLKLDLTNLEQRLFDAERFSQ 168
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLD------SYETNGRVFCQK 174
+ KA+ + L+ E E++ ++Q+ I L ENK+LKE L S +G +
Sbjct: 169 HAGEHKAQFDKLLGEHELQLHEAQKTIHQLVWENKQLKELLPVRAPKQSPPGSGWKVNKT 228
Query: 175 IEEWMEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKI 234
+E + E C G V +
Sbjct: 229 LENGVHAE------------------------CESGDVI--------------------L 244
Query: 235 KGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACI 294
+ M+ + E ELL++QLKEELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CI
Sbjct: 245 ENMAKRNEESELLIEQLKEELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCI 304
Query: 295 EQASLQRIAELETQIEKGQNKFVATGRHL 323
EQA+++ I ELETQ+ K + K + L
Sbjct: 305 EQAAIKHIQELETQVSKEKTKLSGALKRL 333
>gi|9759466|dbj|BAB10382.1| unnamed protein product [Arabidopsis thaliana]
Length = 381
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 150/325 (46%), Positives = 209/325 (64%), Gaps = 31/325 (9%)
Query: 1 MNCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECL 60
+NC QEIDYL+DQ+N R++E+ LSEHV LE+++ L+++V L EEL S SE L
Sbjct: 67 LNCYQEIDYLRDQVNFRSQEMNDLSEHVLDLEVRVTKSGKLEEEVNYLREELCSSKSEQL 126
Query: 61 LLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHK 120
LL++EL+S E L+ S ++KLEES+SS LESQCEIES+K+D++ALEQ +A+K
Sbjct: 127 LLLQELESTETELQFSLFSVEKLEESVSSLTLESQCEIESIKLDIVALEQALFDAQKFQG 186
Query: 121 ENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWME 180
E++QE ++ ++KEL + +++++E ECL+K+NKEL E+ + E N + Q +E
Sbjct: 187 ESIQENDKLREIVKELRLNSREAEENAECLEKQNKELMERCVASERNIKDLRQSFRGRLE 246
Query: 181 KEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQ 240
E ++ F ++ KL + D L++K++ M+ Q
Sbjct: 247 SESEAPVN---------------------PDCFHDIIKKLEVF--QDGKLRDKMEDMARQ 283
Query: 241 ICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQ 300
I +Y+ LVKQLK+EL+ EK KAKEEAEDL QEMAELRY+MT LLEEECKRRACIEQASLQ
Sbjct: 284 ILQYKDLVKQLKDELKEEKLKAKEEAEDLTQEMAELRYEMTCLLEEECKRRACIEQASLQ 343
Query: 301 RIAELETQIEKGQNKFVATGRHLPI 325
RIA LE Q V +LPI
Sbjct: 344 RIANLEAQ--------VLASLYLPI 360
>gi|357122107|ref|XP_003562757.1| PREDICTED: uncharacterized protein LOC100846178 [Brachypodium
distachyon]
Length = 434
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 193/327 (59%), Gaps = 51/327 (15%)
Query: 2 NCSQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLL 61
NC QEIDYLQDQLN RN E + EH+H LELKL ++E ++V ++ +L RSDS+C L
Sbjct: 49 NCYQEIDYLQDQLNIRNIEANIMGEHIHGLELKLTELEKFPERVRVMDNDLMRSDSQCWL 108
Query: 62 LMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKE 121
LMEE+Q KEE L+ +AL I+KLE S+ L+SQCEIESLK+D+ LEQ +A+ +
Sbjct: 109 LMEEVQCKEEELQKAALQIEKLE----SATLDSQCEIESLKLDLTTLEQKLFDAESFGQH 164
Query: 122 NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEK 181
V+ KARM + + E++ Q +Q I+ L+ E K+L E+L S
Sbjct: 165 TVEFKARMEKQLWDYELQLQAAQNTIDNLELEKKQLTEELLS------------------ 206
Query: 182 EDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLSKLALVLGPDANLK-----EKIKG 236
R+ L + S +E E+ + S G D N E ++
Sbjct: 207 --RRALKLSSSTAE-EQLYKTS---------------------GHDGNANCEEDHEILEK 242
Query: 237 MSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQ 296
M+ Q E ELL++QLK ELR +K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CIEQ
Sbjct: 243 MAKQNEEPELLIEQLKVELREQKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCIEQ 302
Query: 297 ASLQRIAELETQIEKGQNKFVATGRHL 323
A++Q+I +LE QI Q K R L
Sbjct: 303 AAIQQIQQLEAQISGEQRKLSGALRRL 329
>gi|145359517|ref|NP_200928.2| uncharacterized protein [Arabidopsis thaliana]
gi|71905623|gb|AAZ52789.1| hypothetical protein At5g61200 [Arabidopsis thaliana]
gi|332010050|gb|AED97433.1| uncharacterized protein [Arabidopsis thaliana]
Length = 283
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 138/301 (45%), Positives = 193/301 (64%), Gaps = 23/301 (7%)
Query: 24 LSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKL 83
LSEHV LE+++ L+++V L EEL S SE LLL++EL+S E L+ S ++KL
Sbjct: 4 LSEHVLDLEVRVTKSGKLEEEVNYLREELCSSKSEQLLLLQELESTETELQFSLFSVEKL 63
Query: 84 EESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDS 143
EES+SS LESQCEIES+K+D++ALEQ +A+K E++QE ++ ++KEL + ++++
Sbjct: 64 EESVSSLTLESQCEIESIKLDIVALEQALFDAQKFQGESIQENDKLREIVKELRLNSREA 123
Query: 144 QEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEKEDRKQLDIQSLVSELERNFTVS 203
+E ECL+K+NKEL E+ + E N + Q +E E ++
Sbjct: 124 EENAECLEKQNKELMERCVASERNIKDLRQSFRGRLESESEAPVN--------------- 168
Query: 204 KETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAK 263
F ++ KL + D L++K++ M+ QI +Y+ LVKQLK+EL+ EK KAK
Sbjct: 169 ------PDCFHDIIKKLEVF--QDGKLRDKMEDMARQILQYKDLVKQLKDELKEEKLKAK 220
Query: 264 EEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQRIAELETQIEKGQNKFVATGRHL 323
EEAEDL QEMAELRY+MT LLEEECKRRACIEQASLQRIA LE QI++ +NK L
Sbjct: 221 EEAEDLTQEMAELRYEMTCLLEEECKRRACIEQASLQRIANLEAQIKREKNKSSTCLVPL 280
Query: 324 P 324
P
Sbjct: 281 P 281
>gi|414590750|tpg|DAA41321.1| TPA: hypothetical protein ZEAMMB73_176381 [Zea mays]
Length = 470
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 123/306 (40%), Positives = 176/306 (57%), Gaps = 54/306 (17%)
Query: 24 LSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKL 83
+ EH+HSLELKL ++E ++V ++ EL RSDS+C LLMEE++ KEE L+ +A I+KL
Sbjct: 1 MGEHIHSLELKLTELEKFPERVRAMDNELIRSDSQCWLLMEEVRCKEEELQKAASQIEKL 60
Query: 84 EESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDS 143
E S+AL+SQCEIESLK+D+ LEQ +A++ + + KA+ + L+ E E++ ++
Sbjct: 61 E----STALDSQCEIESLKLDLTNLEQRLFDAERFSQHAGEHKAQFDKLLGEHELQLHEA 116
Query: 144 QEIIECLDKENKELKEKLD------SYETNGRVFCQKIEEWMEKEDRKQLDIQSLVSELE 197
Q+ I L ENK+LKE L S +G + +E + E
Sbjct: 117 QKTIHQLVWENKQLKELLPVRAPKQSPPGSGWKVNKTLENGVHAE--------------- 161
Query: 198 RNFTVSKETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQICEYELLVKQLKEELRA 257
C G V ++ M+ + E ELL++QLKEELR
Sbjct: 162 ---------CESGDVI--------------------LENMAKRNEESELLIEQLKEELRE 192
Query: 258 EKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQRIAELETQIEKGQNKFV 317
+K KAKE+AEDL QEMAELRYQ+T +LEEE KRR+CIEQA+++ I ELETQ+ K + K
Sbjct: 193 QKLKAKEDAEDLTQEMAELRYQITGMLEEEYKRRSCIEQAAIKHIQELETQVSKEKTKLS 252
Query: 318 ATGRHL 323
+ L
Sbjct: 253 GALKRL 258
>gi|186532628|ref|NP_001119469.1| uncharacterized protein [Arabidopsis thaliana]
gi|60547975|gb|AAX23951.1| hypothetical protein At5g61200 [Arabidopsis thaliana]
gi|71905625|gb|AAZ52790.1| hypothetical protein At5g61200 [Arabidopsis thaliana]
gi|332010051|gb|AED97434.1| uncharacterized protein [Arabidopsis thaliana]
Length = 295
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 137/302 (45%), Positives = 190/302 (62%), Gaps = 31/302 (10%)
Query: 24 LSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKL 83
LSEHV LE+++ L+++V L EEL S SE LLL++EL+S E L+ S ++KL
Sbjct: 4 LSEHVLDLEVRVTKSGKLEEEVNYLREELCSSKSEQLLLLQELESTETELQFSLFSVEKL 63
Query: 84 EESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDS 143
EES+SS LESQCEIES+K+D++ALEQ +A+K E++QE ++ ++KEL + ++++
Sbjct: 64 EESVSSLTLESQCEIESIKLDIVALEQALFDAQKFQGESIQENDKLREIVKELRLNSREA 123
Query: 144 QEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEKEDRKQLDIQSLVSELERNFTVS 203
+E ECL+K+NKEL E+ + E N + Q +E E ++
Sbjct: 124 EENAECLEKQNKELMERCVASERNIKDLRQSFRGRLESESEAPVN--------------- 168
Query: 204 KETCFCGKVFGALLSKLALVLGPDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAK 263
F ++ KL + D L++K++ M+ QI +Y+ LVKQLK+EL+ EK KAK
Sbjct: 169 ------PDCFHDIIKKLEVF--QDGKLRDKMEDMARQILQYKDLVKQLKDELKEEKLKAK 220
Query: 264 EEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQRIAELETQIEKGQNKFVATGRHL 323
EEAEDL QEMAELRY+MT LLEEECKRRACIEQASLQRIA LE Q V +L
Sbjct: 221 EEAEDLTQEMAELRYEMTCLLEEECKRRACIEQASLQRIANLEAQ--------VLASLYL 272
Query: 324 PI 325
PI
Sbjct: 273 PI 274
>gi|168051076|ref|XP_001777982.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670630|gb|EDQ57195.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 598
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 39/69 (56%), Positives = 52/69 (75%)
Query: 249 KQLKEELRAEKFKAKEEAEDLAQEMAELRYQMTSLLEEECKRRACIEQASLQRIAELETQ 308
K LKEE+ EK KAKEEAEDL QEMAELRYQ+ ++E+E + RA EQAS+ R+ ELE+Q
Sbjct: 233 KNLKEEVTTEKGKAKEEAEDLTQEMAELRYQLMEMIEQERELRAQAEQASVLRVVELESQ 292
Query: 309 IEKGQNKFV 317
++ + + V
Sbjct: 293 VKNARQEAV 301
>gi|353242238|emb|CCA73898.1| hypothetical protein PIIN_07851 [Piriformospora indica DSM 11827]
Length = 2056
Score = 48.5 bits (114), Expect = 0.004, Method: Composition-based stats.
Identities = 67/329 (20%), Positives = 151/329 (45%), Gaps = 40/329 (12%)
Query: 5 QEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLLLME 64
+E+ L + + + + +L + S L D+ LQ++ L+ +L ++ S
Sbjct: 1619 KEVSILPHEFGSTQDAIRALHSELASKLRTLPDIIELQNQRHDLQVQLSKARSR-----P 1673
Query: 65 ELQSKEERLR------------------NSALHIKKLEESISSSAL--ESQCEIESLKID 104
+++ ERLR + + ++ E+++ S L E+ + + +
Sbjct: 1674 RTKAERERLRMEVQATQGLVAAKDAELAAAGVKTEQAEKALQQSLLRIETSEAVSKMVKE 1733
Query: 105 MIALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKE----LKEK 160
IA +T +++ K N + +A+++SL ++ +D + E L + +E L+++
Sbjct: 1734 QIARLETA--NRELQKTNNERQAKIDSLELQMTFANRDKETAKEALARVEQERDATLEQQ 1791
Query: 161 LDSYETNGRVFCQKIEEWMEKEDRKQLD-IQSLVSELERNFTVSKETCFCGKVFGALLSK 219
D++ N ++ QK++ M+ K+ D ++ L + +++ + E K L SK
Sbjct: 1792 QDAWREN-KLMSQKLDSLMQLLMSKESDELRELRRQRDKSKVMEAELSAAKKRTAELESK 1850
Query: 220 LALVLGPDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRY- 278
L L++ +A + ++ Q+ EYE V++L++EL + D AQ+ + +
Sbjct: 1851 LELLVRSEAKTNQSLEDSRRQVDEYEDKVEKLEKELEPLR------RLDTAQKTRDREFE 1904
Query: 279 QMTSLLEEECKRRACIEQASLQRIAELET 307
Q+ S L+ + K+ + Q ++ EL T
Sbjct: 1905 QIRSQLQHQEKQENHLRQTNIMLEEELTT 1933
>gi|375092330|ref|ZP_09738611.1| hypothetical protein HMPREF9709_01473 [Helcococcus kunzii ATCC 51366]
gi|374561195|gb|EHR32542.1| hypothetical protein HMPREF9709_01473 [Helcococcus kunzii ATCC 51366]
Length = 1864
Score = 43.9 bits (102), Expect = 0.096, Method: Composition-based stats.
Identities = 53/248 (21%), Positives = 109/248 (43%), Gaps = 53/248 (21%)
Query: 39 EILQDKVGQLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEI 98
E L+++V +L+++L + E +EL SKE L S I +LE+S+ ++
Sbjct: 1469 EKLKEEVEKLKQDLAEKEKELAEKQKELDSKETELTESKDKISELEKSLEAAN------- 1521
Query: 99 ESLKIDMIALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELK 158
QE A++ I L+ + + ++ L+KE + K
Sbjct: 1522 -------------------------QEIAKLKEEINSLKEKVKALEDEKAALEKEIADTK 1556
Query: 159 EKLDSYETNGRVFCQKIEEWMEKEDRKQLDIQSLVSELERNFTVSKETCFCGKVFGALLS 218
+LD + +++E +E + + +++V+EL + F L +
Sbjct: 1557 AELDKAK-------KELENILEDPESEVAKARAVVAELTKQFE-------------ELTA 1596
Query: 219 KLALVLGPDANLKEKIKGMSLQICEYELLVKQLKEELRAEKFKAKEEAEDLAQEMAELRY 278
+ A V EK+K + ++ E E VK KE++ +K +A+++ + +E+++L+
Sbjct: 1597 QKAQVEQELKEKTEKVKSLEAKVSELEQEVKD-KEQIEKDKKEAEDKVVEKEKEISDLQK 1655
Query: 279 QMTSLLEE 286
+ L EE
Sbjct: 1656 EEARLKEE 1663
Score = 43.9 bits (102), Expect = 0.10, Method: Composition-based stats.
Identities = 41/201 (20%), Positives = 98/201 (48%), Gaps = 11/201 (5%)
Query: 5 QEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLLLME 64
+E++ L+ L + +E L+E L+ K ++ +DK+ +LE+ L ++ E L E
Sbjct: 1473 EEVEKLKQDLAEKEKE---LAEKQKELDSKETELTESKDKISELEKSLEAANQEIAKLKE 1529
Query: 65 ELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKENVQ 124
E+ S +E+++ LE+ I+ + E + L+ + E +A+ V E +
Sbjct: 1530 EINSLKEKVKALEDEKAALEKEIADTKAELDKAKKELENILEDPESEVAKARAVVAELTK 1589
Query: 125 EKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRVFCQKIEEWMEKEDR 184
+ + + ++E ++ E ++ L+ + EL++++ E +IE+ ++ +
Sbjct: 1590 QFEELTAQKAQVEQELKEKTEKVKSLEAKVSELEQEVKDKE--------QIEKDKKEAED 1641
Query: 185 KQLDIQSLVSELERNFTVSKE 205
K ++ + +S+L++ KE
Sbjct: 1642 KVVEKEKEISDLQKEEARLKE 1662
Score = 43.5 bits (101), Expect = 0.14, Method: Composition-based stats.
Identities = 37/160 (23%), Positives = 80/160 (50%), Gaps = 14/160 (8%)
Query: 4 SQEIDYLQDQLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLLLM 63
+QEI L++++N+ E+V +L + +LE ++ D + EL ++ E ++
Sbjct: 1521 NQEIAKLKEEINSLKEKVKALEDEKAALEKEIADT----------KAELDKAKKELENIL 1570
Query: 64 EELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAKKVHKENV 123
E+ +S+ + R + K E +++ + + E++ + +LE E ++ V
Sbjct: 1571 EDPESEVAKARAVVAELTKQFEELTAQKAQVEQELKEKTEKVKSLEAKVSEL----EQEV 1626
Query: 124 QEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDS 163
++K ++ KE E + + ++ I L KE LKE+L+S
Sbjct: 1627 KDKEQIEKDKKEAEDKVVEKEKEISDLQKEEARLKEELES 1666
>gi|301754611|ref|XP_002913131.1| PREDICTED: CAP-Gly domain-containing linker protein 1-like isoform
1 [Ailuropoda melanoleuca]
Length = 1427
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 99/211 (46%), Gaps = 39/211 (18%)
Query: 20 EVYSLSEHVHSLELKLV------DMEILQ--DKVGQLEEELRRSDSECLLLMEELQSKEE 71
EV + HV +E +L D +L+ K+ QL + +D E + L+ +L+ EE
Sbjct: 377 EVAKATSHVGEVEQELALARDGHDQHVLELEAKMDQLRTMVEAADREKVELLNQLE--EE 434
Query: 72 RLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTC----VEAKKVHKE------ 121
+ + L + EESI+ LE+Q ++E +I LEQ+ +A K+ +E
Sbjct: 435 KRKVEDLQFRVEEESITKGDLETQTKLEHARIK--ELEQSLLFEKTKADKLQRELEDTRV 492
Query: 122 -NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRV-----FCQKI 175
V EK+R+ L K+L +R Q E EL+ +L+S++ G V F Q+I
Sbjct: 493 ATVSEKSRIMELEKDLALRAQ-----------EVAELRRRLESHKPVGDVDMSLSFLQEI 541
Query: 176 EEWMEKEDRKQLDIQSLVSELERNFTVSKET 206
EK + D Q ++ L+ F +ET
Sbjct: 542 SSLQEKLEAAHADHQREITSLKEQFGAREET 572
>gi|67478732|ref|XP_654748.1| SMC4 protein [Entamoeba histolytica HM-1:IMSS]
gi|56471819|gb|EAL49361.1| SMC4 protein, putative [Entamoeba histolytica HM-1:IMSS]
gi|449702908|gb|EMD43452.1| Hypothetical protein EHI5A_167200 [Entamoeba histolytica KU27]
Length = 1226
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 88/168 (52%), Gaps = 27/168 (16%)
Query: 13 QLNARNEEVYSLSEHVHSLELKLVDMEILQDKVGQLEEELRRSDSECLLLMEELQSKEER 72
+LN +NEE+ + E +L + ++E +DK+G+ EE+ ++SE L E+ Q E+
Sbjct: 889 ELNEKNEELKKIEEEYGTLLKSIEELETEEDKIGEQIEEINGNNSE---LTEKRQRCEKE 945
Query: 73 LRNSALHIKKL-------------EESISSSALESQCEIESLKIDMIALEQTCVEAKKVH 119
+R+ HI++L E ++ + ++Q + E ++I I+L+ E K+++
Sbjct: 946 IRSIFKHIRELLHIAEIHEEDEYIENVLNHNETDNQMDEEKIRIIGISLKDVVDELKEIN 1005
Query: 120 KENV-----QEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLD 162
++ V EK ++ ++I+ + ++ II K NKE +EK D
Sbjct: 1006 RKEVLAMIEDEKKKIENMIENVNLK------IIATFIKVNKEYQEKWD 1047
>gi|73994485|ref|XP_859319.1| PREDICTED: CAP-Gly domain-containing linker protein 1 isoform 3
[Canis lupus familiaris]
Length = 1427
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 100/211 (47%), Gaps = 39/211 (18%)
Query: 20 EVYSLSEHVHSLELKLV------DMEILQ--DKVGQLEEELRRSDSECLLLMEELQSKEE 71
EV + HV +E +L D +L+ K+ QL + +D E + L+ +L+ EE
Sbjct: 377 EVAKATSHVGEIEQELALARDGHDQHVLELEAKMDQLRTMVEAADREKVELLNQLE--EE 434
Query: 72 RLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCV----EAKKVHKE------ 121
+ + L + EESI+ LE+Q ++E +I LEQ+ + +A K+ +E
Sbjct: 435 KRKVEDLQFRVEEESITKGDLETQTKLEHARIK--ELEQSLLFEKTKADKLQRELEDTRV 492
Query: 122 -NVQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETNGRV-----FCQKI 175
V EK+R+ L K+L +R Q E EL+ +L+S++ G V F Q+I
Sbjct: 493 ATVSEKSRIMELEKDLALRAQ-----------EVAELRRRLESHKPAGDVDMSLSFLQEI 541
Query: 176 EEWMEKEDRKQLDIQSLVSELERNFTVSKET 206
EK + D Q + L+ +F +ET
Sbjct: 542 SSLQEKLEAAHADHQREIISLKEHFGACEET 572
>gi|154248750|ref|YP_001409575.1| S-layer domain-containing protein [Fervidobacterium nodosum
Rt17-B1]
gi|154152686|gb|ABS59918.1| S-layer domain protein [Fervidobacterium nodosum Rt17-B1]
Length = 468
Score = 38.1 bits (87), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 87/173 (50%), Gaps = 17/173 (9%)
Query: 2 NCSQEIDYLQDQLNARNEEVYSLSEHVHS--LELKLVDMEILQDK-VGQLEEELR--RSD 56
N +++I L ++LNA V SLS+ + + LK D EI DK + L++EL + D
Sbjct: 277 NVNKDITALSERLNAVESNVASLSKAITTETTNLKNKDAEI--DKNINALKDELTNLKGD 334
Query: 57 SECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIESLKIDMIALEQTCVEAK 116
E +K + +A+ + KLEE + S A E + K+ I + + +E
Sbjct: 335 LEI--------NKGDLAELTAVKLPKLEEELKSKADEKAVSELNEKVSQIGNKLSNIEVS 386
Query: 117 KVHKEN--VQEKARMNSLIKELEVRTQDSQEIIECLDKENKELKEKLDSYETN 167
++ EN ++NS + E++ +T +QE I L EN++LK+KL +N
Sbjct: 387 VINVENKLSSSITQVNSKVSEIDGKTLKNQEEINKLKAENEDLKKKLQQTSSN 439
>gi|395538840|ref|XP_003771382.1| PREDICTED: ELKS/Rab6-interacting/CAST family member 1 isoform 3
[Sarcophilus harrisii]
Length = 1088
Score = 37.4 bits (85), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 72/145 (49%), Gaps = 24/145 (16%)
Query: 40 ILQDKVGQLEEELRRSDSECLLLMEELQSKEERLRNSALHIKKLEESISSSALESQCEIE 99
+++K+GQ+++EL R D+E L L +L++ + +S HI+ L+ES+ +A E + I
Sbjct: 435 FMKNKIGQVKQELSRKDTELLALQTKLETLTNQFSDSKQHIEVLKESL--TAKEQRAAIL 492
Query: 100 SLKIDMIALEQTCVEAKKVHKENVQEKARMNSLIKELEVRTQDSQEIIE---CLDKENKE 156
++D + L ++EK M L +T+ QEI+E E +
Sbjct: 493 QTEVDALRL-------------RLEEKETM------LNKKTKQIQEIVEEKGTQAGEIHD 533
Query: 157 LKEKLDSYETNGRVFCQKIEEWMEK 181
LK+ LD E V +KIE E+
Sbjct: 534 LKDMLDVKERKVNVLQKKIENLQEQ 558
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.312 0.129 0.340
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,311,390,851
Number of Sequences: 23463169
Number of extensions: 168025921
Number of successful extensions: 1242822
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1339
Number of HSP's successfully gapped in prelim test: 42611
Number of HSP's that attempted gapping in prelim test: 1008453
Number of HSP's gapped (non-prelim): 179699
length of query: 325
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 183
effective length of database: 9,027,425,369
effective search space: 1652018842527
effective search space used: 1652018842527
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 77 (34.3 bits)