BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011943
(474 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224087896|ref|XP_002308260.1| predicted protein [Populus trichocarpa]
gi|222854236|gb|EEE91783.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 238/477 (49%), Positives = 323/477 (67%), Gaps = 50/477 (10%)
Query: 20 KTTTDDVAVGADPNHLINGQK-----DDVRSQASEPADIQNVQVDADRH--QSNGAESAN 72
K + DDV+V A N+ NGQ D+ + S D+ + + D + H + NGA +
Sbjct: 25 KASEDDVSVAAATNNNSNGQNHGNVNDNQVIEVSSNGDVVDAE-DFNGHYDKPNGAAPHS 83
Query: 73 LAEAERQHLLQREAMAI------LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYK 126
A E LQ E ++ EET+K LR+E +SHIQKEATLE TV+QLQNE +
Sbjct: 84 AALQETIKHLQNETDSLTRTKDTFEETIKRLRDENDSHIQKEATLEETVKQLQNESASHT 143
Query: 127 EKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEK 186
+K +A+LE+TI QL+ N+L +QKEAT E+TIKQL+ +ND H+Q+E LE I LQSEK
Sbjct: 144 QK-EASLEDTINQLRSVNNLCIQKEATFEDTIKQLKTENDSHLQKEADLEKRIVQLQSEK 202
Query: 187 EFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLS 246
+FWLQKEA +K++ L+DE AAL AS+ E+++LLE+DKDSWT E+ +KETIA ++
Sbjct: 203 DFWLQKEAGFGEKLNHLQDEKAAL----ASIGEKIRLLESDKDSWTISENTTKETIARMN 258
Query: 247 VDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSS-DESKKHATS--------EQ 297
+D+T+LRMQVVELE+SRN+L++EN+QLKE++S+L+ QL + D S A + E+
Sbjct: 259 IDVTRLRMQVVELEDSRNSLVKENQQLKESISNLKLQLQNIDTSVSFANTSELGKLGAEK 318
Query: 298 KDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSAIGSDAV---TALT 354
++ ++QIEAA AL+DKLITEN +LVEKVN+L +KLD Q AA SSA G + + L
Sbjct: 319 EELNSQIEAACALVDKLITENADLVEKVNELYIKLDHQGTAASFSSATGRGVIVRNSELA 378
Query: 355 SDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQ 414
+ T PM++S+ N+++L ++LE+ + +P +V SSEA SGEIVQ
Sbjct: 379 NGTHPMADSNANLTALGHKLES---------------LEVEPAVVVQYSSEA-GSGEIVQ 422
Query: 415 IPLDDKEVQDLELQVVESYTD-KVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
IPLDD EV DLE+Q E TD K AVPLTDAPLIGAPFRL+SFVAKYVSG DLVN+
Sbjct: 423 IPLDDNEVPDLEMQAAE--TDYKSGAVPLTDAPLIGAPFRLISFVAKYVSGGDLVNR 477
>gi|255575616|ref|XP_002528708.1| conserved hypothetical protein [Ricinus communis]
gi|223531880|gb|EEF33697.1| conserved hypothetical protein [Ricinus communis]
Length = 492
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 243/493 (49%), Positives = 321/493 (65%), Gaps = 67/493 (13%)
Query: 18 QTKTTTDDVAVGADPNHLINGQKDDVRSQASEPADIQN------VQVDA-DRHQSNGAES 70
Q+KTT DD +H NG + +Q ++ A+IQN VD DRHQ+ G E
Sbjct: 22 QSKTTEDDAVFVGGVDHQ-NGASSNGNNQVTDLAEIQNGDVQNAANVDGFDRHQNTGTEI 80
Query: 71 ANLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQ 130
+ EAE+Q QREA I EET+K L+NE + HIQKEA L ++++LQ E D+ + K Q
Sbjct: 81 SISVEAEKQQWFQREA--IFEETIKRLQNENDLHIQKEAILRKSIKELQEENDIQRRK-Q 137
Query: 131 ATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQRE------------------ 172
LEET+++LQ ND +QKEATLEETIK+LRN ND +Q+E
Sbjct: 138 VMLEETVKELQNANDSALQKEATLEETIKELRNTNDSALQKEVKLEKTIKELRSAKDSAL 197
Query: 173 ---GGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKD 229
GL+ IA LQSE E +LQ +A LE+K+ +E+A+L +K ASLEE++K LE++KD
Sbjct: 198 QKEAGLKKKIAQLQSENELFLQTKAGLEEKLQLSLNENASLGLKEASLEEKIKQLESEKD 257
Query: 230 SWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQL----- 284
SWT E+ +KETIA ++VD+T+LRMQVVELEESRN+LL+EN+QL EN+S L+ Q+
Sbjct: 258 SWTLAENTTKETIARMNVDVTRLRMQVVELEESRNSLLKENQQLMENISDLQLQVQNLET 317
Query: 285 ------SSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVA 338
+S E KKHA E +D +QIEAA AL+DKLITEN ELVEKVN+L +KLD+QS
Sbjct: 318 NISSASTSCEQKKHAL-EGEDLESQIEAACALVDKLITENAELVEKVNELYIKLDQQSTT 376
Query: 339 AGLSSAIGSDAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPP 398
A SS G+ +T + +D+ P +S +NMS + ++LE+ + V+ AD
Sbjct: 377 ATASS--GNAEITYI-ADSIP--QSDENMSVVVHKLES---LEVEHAAALPYAAEAD--- 425
Query: 399 LVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFV 458
SGEIVQIPLDD E++DLELQ VES DK AVP++DAPLIGAPFRL+SFV
Sbjct: 426 ----------SGEIVQIPLDDNELRDLELQAVES--DKTEAVPISDAPLIGAPFRLISFV 473
Query: 459 AKYVSGADLVNKN 471
AKYVSGADLV+K+
Sbjct: 474 AKYVSGADLVDKD 486
>gi|224139390|ref|XP_002323089.1| predicted protein [Populus trichocarpa]
gi|222867719|gb|EEF04850.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 218/437 (49%), Positives = 286/437 (65%), Gaps = 60/437 (13%)
Query: 52 DIQNVQVDADRHQSNGAESANLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATL 111
D+++ D+ +S+ LAEAE+Q LQREA EET+K L +E +SHIQKE T
Sbjct: 66 DVEDFNTHNDKPNGVAPQSSILAEAEKQWWLQREAT--FEETIKQLEDENDSHIQKEVTF 123
Query: 112 EGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQR 171
E ETI+QLQ +ND +QKEAT E+TIKQL+ +NDLH+Q+
Sbjct: 124 E----------------------ETIKQLQDENDSHIQKEATFEDTIKQLKTENDLHMQK 161
Query: 172 EGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSW 231
E GLEM I LQ EK+FW KEA LE+K++ L DE AAL +K ASLEE++ L+++KDSW
Sbjct: 162 EAGLEMKIMQLQREKDFWFLKEAGLEEKLNLLLDEKAALGLKLASLEEKIGQLDSEKDSW 221
Query: 232 TQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKK 291
E+ +KE + +++DIT LRMQVVELE SRN+L++EN+QLKE+VS L+ QL + E+++
Sbjct: 222 AVSENTTKEIVGRMNIDITSLRMQVVELEYSRNSLVKENQQLKESVSDLKLQLQNVETQQ 281
Query: 292 HATS-----------EQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAG 340
+S E++D ++QIEAA AL+DKLITEN ELVEKVN+L +KLD+Q AA
Sbjct: 282 SISSANTSELGKNDAEKEDLNSQIEAACALVDKLITENAELVEKVNELYIKLDQQKAAAS 341
Query: 341 LSSAIGSDAV---TALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPP 397
LSS+ GSD + + L + T PMSESS++ L + + ADPP
Sbjct: 342 LSSSAGSDVILRNSELANGTPPMSESSESALGLKSE-----------------SLDADPP 384
Query: 398 PLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSF 457
VL S + EIVQIPLDD EV D+E+Q DK + VPLTDAPLIGAPFRL+SF
Sbjct: 385 AAVLPQSSEPDAEEIVQIPLDDNEVPDVEMQA----EDK-SGVPLTDAPLIGAPFRLISF 439
Query: 458 VAKYVSGADLVNKNASN 474
VAKYVSGADLV+K+ SN
Sbjct: 440 VAKYVSGADLVSKDDSN 456
>gi|297741755|emb|CBI32887.3| unnamed protein product [Vitis vinifera]
Length = 466
Score = 321 bits (823), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 228/501 (45%), Positives = 317/501 (63%), Gaps = 67/501 (13%)
Query: 1 MEDEKKKKRNKKKKNNKQTKTTTDDVAVGA-----DPNHL---INGQKDDVRSQASEPAD 52
MEDEKKKK+NKKKKN + + V VG D NHL +G ++ + AD
Sbjct: 1 MEDEKKKKKNKKKKNKQHKASEGPAVGVGEEIASLDENHLSTQTHGNQESI-------AD 53
Query: 53 IQN--VQ---VDADRHQSNGAESANLAEAERQHLLQREAMAILEETVKHLRNERESHIQK 107
+QN VQ VD + H +NG +S+ L EAE+Q LQREA L+E +K L N++E H QK
Sbjct: 54 VQNRDVQKENVDLNGHCANGTDSSILVEAEKQKWLQREAS--LQERIKQLENDKELHTQK 111
Query: 108 EATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDL 167
E+ L E D ++Q +++EATLEE IKQL+++NDL
Sbjct: 112 ESILA--------EAD-----------------KKQ---WLKREATLEERIKQLQDENDL 143
Query: 168 HIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEAD 227
H+Q+E GLEM I LQ+EK W+QKEA LE KI QL DE++ L++K ASLEE+++ E +
Sbjct: 144 HMQKEAGLEMRIVQLQTEKHSWIQKEATLEGKIQQLLDENSTLSVKWASLEEKVEHQERE 203
Query: 228 KDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQL--- 284
++SW E+ S+E I+ L+ + +L+ QV+ELEE R N+LQEN+ LKE VSSL+ Q+
Sbjct: 204 RNSWVLKENSSREIISSLNDENRKLQGQVMELEEFRINILQENQLLKEKVSSLQLQIKEL 263
Query: 285 --------SSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQS 336
+S E KHA S+ D + Q EAA LI+KL +EN +LVEKVN+L ++LD+
Sbjct: 264 EESVSSAHASTEITKHA-SKHVDLNPQTEAATVLIEKLTSENADLVEKVNELYIELDQLR 322
Query: 337 VAAGLSSAIGSD-AVTALTSD--TEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVH 393
VAAGLSSAIG D + AL + + MSE +DN S+ + R++ V + ++R +
Sbjct: 323 VAAGLSSAIGLDKKIGALQNSNVADHMSEPTDNSSASSERMDPLEAVPMNDERISADIED 382
Query: 394 ADPPPLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFR 453
+ ++ +SSE + SGEIVQIPLD+ E ++LELQ E+ D+ VP+TDAPLIGAPFR
Sbjct: 383 VEQTAVIPNSSETDDSGEIVQIPLDENEARELELQAAEN--DRNTTVPITDAPLIGAPFR 440
Query: 454 LVSFVAKYVSGADLVNKNASN 474
+SFVAKYVSGADLV K+++N
Sbjct: 441 FISFVAKYVSGADLVGKSSTN 461
>gi|356566804|ref|XP_003551617.1| PREDICTED: uncharacterized protein LOC100796148 [Glycine max]
Length = 651
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 185/417 (44%), Positives = 261/417 (62%), Gaps = 18/417 (4%)
Query: 72 NLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQA 131
NL E H+ Q+E ++ +ET+K L+ E E HIQ+EA + T++ L+ E D +KV
Sbjct: 236 NLKEKNDVHV-QKETLS--QETIKKLKEENEVHIQEEAISKETIKNLKEENDKLLQKV-V 291
Query: 132 TLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQ 191
+LEE I LQ N+L+ QK +LE I L N+ ++E GLE IA LQSE LQ
Sbjct: 292 SLEEVINNLQTDNELQTQKHESLERKINLLENELSSFSEKEVGLETRIAQLQSENNSLLQ 351
Query: 192 KEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQ 251
KEA L ++ +QL +E A L++K SLE+++ LLE+D +S + E+ +KETI+ L+ +I
Sbjct: 352 KEATLVERTNQLLNEKAVLSLKGESLEQKIYLLESDLNSLVKKENSTKETISNLNGNIAV 411
Query: 252 LRMQVVELEESRNNLLQENRQLKENVSSLRSQL-----------SSDESKKHATSEQKDF 300
L+ QV ELEESRNNL EN+QL+E VSSL+S + S D S K SE +D
Sbjct: 412 LQAQVEELEESRNNLFLENQQLREKVSSLQSTVQNHENSNTSSCSWDASVKDLASENEDL 471
Query: 301 STQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSAIGSDAVTALTSDTE-- 358
++IEAA L++KL+ EN ELVEKV +L V+LD +S G S S+ +T T
Sbjct: 472 KSEIEAAFTLVEKLMAENAELVEKVTELCVELDHRSAEVGHSGVTESNGMTEFVKPTGVA 531
Query: 359 -PMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIPL 417
P+ ES++ S + +L + +VK++ N + H S A+ +GEIVQIPL
Sbjct: 532 IPLPESAEYASVSSPKLNSLEETSVKDNGNSNDAKHVVGVTSNSSLVSADDAGEIVQIPL 591
Query: 418 DDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNKNASN 474
DD E+QD+ELQ ++ + AVP+TDAPLIGAPFRL+SFVA +VSGADLVN+++SN
Sbjct: 592 DDNEIQDIELQDAKNVENDADAVPITDAPLIGAPFRLISFVANFVSGADLVNQSSSN 648
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 154/300 (51%), Gaps = 42/300 (14%)
Query: 30 ADPNHLINGQKDDVRSQASEPADIQNVQVDADRHQSNGAESANLAEAERQHLLQREAMAI 89
+D N + +G+ +++ S E AD Q+ +D++ H NG E AI
Sbjct: 105 SDQNLVKSGKDENIPSL--EFADGQSTNMDSNGHPPNGKE-----------------CAI 145
Query: 90 LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
EET++ L+ E + HIQKE L KE TI++L+ +ND+ +Q
Sbjct: 146 SEETIRKLKEENDIHIQKET--------------LSKE--------TIRKLKAENDMHIQ 183
Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
KEA +EETI++L QND+H+Q+E E I L+ + + +QKEA E I L++++
Sbjct: 184 KEAIMEETIRKLTEQNDMHMQKEIASEETIRKLKEKHDMDVQKEAISEDTIRNLKEKNDV 243
Query: 210 LNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQE 269
K +E +K L+ + + Q E++SKETI L + +L +VV LEE NNL +
Sbjct: 244 HVQKETLSQETIKKLKEENEVHIQEEAISKETIKNLKEENDKLLQKVVSLEEVINNLQTD 303
Query: 270 NRQLKENVSSLRSQLSSDESKKHATSEQK-DFSTQIEAAGALIDKLITENIELVEKVNDL 328
N + SL +++ E++ + SE++ T+I + + L+ + LVE+ N L
Sbjct: 304 NELQTQKHESLERKINLLENELSSFSEKEVGLETRIAQLQSENNSLLQKEATLVERTNQL 363
>gi|359481328|ref|XP_002279159.2| PREDICTED: uncharacterized protein LOC100249907 [Vitis vinifera]
Length = 511
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 221/565 (39%), Positives = 300/565 (53%), Gaps = 150/565 (26%)
Query: 1 MEDEKKKKRNKKKKNNKQTKTTTDDVAVG-----ADPNHL---INGQKDDVRSQASEPAD 52
MEDEKKKK+NKKKKN + + V VG D NHL +G ++ + AD
Sbjct: 1 MEDEKKKKKNKKKKNKQHKASEGPAVGVGEEIASLDENHLSTQTHGNQESI-------AD 53
Query: 53 IQN--VQ---VDADRHQSNGAESANLAEAERQHLLQREAMAILEETVKHLRNERESHIQK 107
+QN VQ VD + H +NG +S+ L EAE+Q LQREA L+E +K L N++E H QK
Sbjct: 54 VQNRDVQKENVDLNGHCANGTDSSILVEAEKQKWLQREAS--LQERIKQLENDKELHTQK 111
Query: 108 EATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDL 167
E+ L E D ++Q +++EATLEE IKQL+++NDL
Sbjct: 112 ESILA--------EAD-----------------KKQ---WLKREATLEERIKQLQDENDL 143
Query: 168 HIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEAD 227
H+Q+E GLEM I LQ+EK W+QKEA LE KI QL DE++ L++K
Sbjct: 144 HMQKEAGLEMRIVQLQTEKHSWIQKEATLEGKIQQLLDENSTLSVK-------------- 189
Query: 228 KDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQL--- 284
W S+E I+ L+ + +L+ QV+ELEE R N+LQEN+ LKE VSSL+ Q+
Sbjct: 190 ---WNS----SREIISSLNDENRKLQGQVMELEEFRINILQENQLLKEKVSSLQLQIKEL 242
Query: 285 --------SSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQS 336
+S E KHA S+ D + Q EAA LI+KL +EN +LVEKVN+L ++LD+
Sbjct: 243 EESVSSAHASTEITKHA-SKHVDLNPQTEAATVLIEKLTSENADLVEKVNELYIELDQLR 301
Query: 337 VAAGLSSAIGSD-AVTALTSD--TEPMSESSDNMSSLNNRLETQGVVAVK---------- 383
VAAGLSSAIG D + AL + + MSE +DN S+ + R+E+ V +
Sbjct: 302 VAAGLSSAIGLDKKIGALQNSNVADHMSEPTDNSSASSERMESVEAVPIHGQSSQEAVPM 361
Query: 384 EDRNGINGVHADP-------PPLVLSSSEA-----EYSGEIVQI---------PLDDK-- 420
D N V P L SS EA + S E V I P++D+
Sbjct: 362 HDENSQEAVPIHDQRSLEAVPMLDESSQEAVPIHDQRSIEAVPINDKSPLEAVPMNDERI 421
Query: 421 --EVQDLELQVV---ESYTD---KVAAVPL-----------------------TDAPLIG 449
+++D+E V S TD ++ +PL TDAPLIG
Sbjct: 422 SADIEDVEQTAVIPNSSETDDSGEIVQIPLDENEARELELQAAENDRNTTVPITDAPLIG 481
Query: 450 APFRLVSFVAKYVSGADLVNKNASN 474
APFR +SFVAKYVSGADLV K+++N
Sbjct: 482 APFRFISFVAKYVSGADLVGKSSTN 506
>gi|79570413|ref|NP_181392.2| ATP synthase D chain-related protein [Arabidopsis thaliana]
gi|51968994|dbj|BAD43189.1| hypothetical protein [Arabidopsis thaliana]
gi|330254458|gb|AEC09552.1| ATP synthase D chain-related protein [Arabidopsis thaliana]
Length = 482
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 151/397 (38%), Positives = 240/397 (60%), Gaps = 29/397 (7%)
Query: 75 EAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLE 134
E E + +Q+EA+ LEE + HL+ E E+H+ EA LEG + L+ E + + + +A LE
Sbjct: 113 ETENEAHIQKEAL--LEERLVHLKTENEAHVHNEALLEGKLLHLRTENEDHIQN-EALLE 169
Query: 135 ETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEA 194
E + L+ +N+ +Q EA LEE + R +N+ H Q E LE + +++ + L++ +
Sbjct: 170 EKLLHLRTENEAHIQNEALLEERLLHFRTENEAHKQNEEKLEERLVQYKNKNDMLLREMS 229
Query: 195 ALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRM 254
+ E ++ QL DE + K ASLE++++ L+ D++S E S+E I+ L+ +I +LR
Sbjct: 230 STEAQMRQLLDERSTFTQKEASLEKKVQQLQHDEESLVAEEKSSREMISSLNNEIARLRA 289
Query: 255 QVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKL 314
QV ELE+S++NLL++N+ LKE +S+L+ Q + +S SE+ + ++QIEAA L++KL
Sbjct: 290 QVTELEKSKSNLLEQNQSLKETISNLQVQHENHDSNAKGASEE-ELNSQIEAACTLVEKL 348
Query: 315 ITENIELVEKVNDLSVKLDRQSVAAGLSSAIGSDAVTALTSDTEPMSESSDNMSSLNNRL 374
ITEN +LVEKVN+L +KL++ A+ S AI + +L + P+ D + ++N
Sbjct: 349 ITENADLVEKVNELCIKLNQSQHASPESLAIEVEKSESL--EEIPI---HDELIRIDNSR 403
Query: 375 ETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYT 434
+ ++K RN G + PL L++ +GE+ D+E QV +
Sbjct: 404 DMD-TASIK--RNFSEGEIEETVPLSLNA-----NGEV-----------DVESQVAVAGE 444
Query: 435 DKV-AAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
D++ A VPL DAPLIGAPFRLVSFVA+YVSGADL K
Sbjct: 445 DEINAGVPLADAPLIGAPFRLVSFVARYVSGADLAAK 481
>gi|297823743|ref|XP_002879754.1| hypothetical protein ARALYDRAFT_345641 [Arabidopsis lyrata subsp.
lyrata]
gi|297325593|gb|EFH56013.1| hypothetical protein ARALYDRAFT_345641 [Arabidopsis lyrata subsp.
lyrata]
Length = 509
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 160/405 (39%), Positives = 239/405 (59%), Gaps = 44/405 (10%)
Query: 82 LQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQ 141
LQ+EA LEE + HL+ E E+HIQ EA LE + L+ E + + + +A LEE + L+
Sbjct: 132 LQKEAF--LEERLVHLKTENEAHIQNEALLEEKLLHLRTENEAHTQN-EALLEEKLLHLR 188
Query: 142 RQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEM----NIANLQSEKE---------- 187
+N+ +Q EA LEE + LR++N+ H Q E LE+ NI +Q + E
Sbjct: 189 TENEDHIQNEARLEERLLHLRSENEAHKQNEDHLEVGGQNNIVFMQEKLEERLVQYKNKN 248
Query: 188 -FWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLS 246
L++ ++ E ++ QL DE + K SLE++++ L+ D++S E S+E I+ L+
Sbjct: 249 DMLLREMSSTEAQMRQLLDERSTFTQKEVSLEKKVQQLQHDEESLVAEEKSSREMISSLN 308
Query: 247 VDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEA 306
+I +LR QV ELEES++NLL++N+ LKE VSSL+ Q + +S SE+ + ++QIEA
Sbjct: 309 NEIARLRAQVTELEESKSNLLEQNQSLKETVSSLQVQHENHDSNAKGASEE-ELNSQIEA 367
Query: 307 AGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSAIGSDAVTALTSDTEPMSESSDN 366
A L++KLITEN ELVEKVN+L +KL++ A+ S AI + +L + P+ D
Sbjct: 368 ACTLVEKLITENAELVEKVNELCIKLNQSQHASPESLAIEVEKSESL--EEIPI---HDE 422
Query: 367 MSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIPLDDKEVQDLE 426
+ ++N + + RN G + PL L++ +GE+ D+E
Sbjct: 423 LIRIDNSKDMDTALI---KRNLSEGEIEETVPLSLNA-----NGEV-----------DVE 463
Query: 427 LQVVESYTDKVAA-VPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
QVV + D+V+A VPL DAPLIGAPFRLVSFVA+YVSGADL K
Sbjct: 464 SQVVVAGEDEVSAGVPLADAPLIGAPFRLVSFVARYVSGADLAAK 508
>gi|356530165|ref|XP_003533654.1| PREDICTED: uncharacterized protein LOC100811378 [Glycine max]
Length = 639
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 151/439 (34%), Positives = 228/439 (51%), Gaps = 79/439 (17%)
Query: 72 NLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQA 131
NL E H+ Q+E ++ +ET++ L+ E E HIQKEA + T+++ + E D +K +
Sbjct: 241 NLKEKNDMHM-QKETLS--QETIRKLKEENEGHIQKEAISKETIKKFEEENDKLVQK-ET 296
Query: 132 TLEETIQQLQRQNDLRMQKEATLE----------------------------ETIKQLRN 163
+LE I QLQ +N+ +QKE ++ E+++ L
Sbjct: 297 SLEMRIAQLQSENNSLLQKEYIIDGILDLGPNPDHWAEQNKDQLLDPSPKEMESLEGLYQ 356
Query: 164 QNDL------HIQR--EGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRA 215
+ L +I R GLE A L SE LQKEA L ++ +QL +E L++K
Sbjct: 357 KAALWRVMFSYIGRRLHVGLETRFAQLHSENNSLLQKEATLVERTNQLLNEKEVLSLKGE 416
Query: 216 SLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKE 275
SLE+++ LLE+D S + E+ +K+TI+ L+ +I L++Q +
Sbjct: 417 SLEQKIYLLESDLSSLVEKENSTKDTISKLNGNIAVLQVQDL------------------ 458
Query: 276 NVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQ 335
SE KD ++IEAA L++KL+ EN ELVEKV +L V+LD Q
Sbjct: 459 ------------------ASENKDLKSEIEAAFTLVEKLMAENAELVEKVTELCVELDHQ 500
Query: 336 SVAAGLSSAIGSDAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHAD 395
S + T + P ES++ S +L + +VK++ N N
Sbjct: 501 SAVTEPNEMTEFAKPTGVAI---PPPESAEYASVSAPKLNSLEETSVKDNGNSFNDAKHV 557
Query: 396 PPPLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLV 455
+ SS ++ +GEIVQIPLDD E+QD+ELQ ++ + AVP+TDAPLIGAPFRL+
Sbjct: 558 VGVMSNSSLLSDDAGEIVQIPLDDNEIQDIELQDAKNVENDADAVPITDAPLIGAPFRLI 617
Query: 456 SFVAKYVSGADLVNKNASN 474
SFVA +VSGADLV+ ++SN
Sbjct: 618 SFVANFVSGADLVDPSSSN 636
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 73/120 (60%), Gaps = 5/120 (4%)
Query: 131 ATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWL 190
A EETI++L+ +ND+ +QKE +ETI++L+ +ND+HIQ+E +E I L + + +
Sbjct: 144 AISEETIRKLKEENDIHIQKETLSKETIRKLKAENDMHIQKEAIMEETIRKLTEQNDLHM 203
Query: 191 QKEA-----ALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGL 245
QKEA A E+ I +L+++ K A E+ ++ L+ D Q E++S+ETI L
Sbjct: 204 QKEAIFPQIASEETIRKLKEKHDVHVQKEAISEDTIRNLKEKNDMHMQKETLSQETIRKL 263
>gi|3786014|gb|AAC67360.1| hypothetical protein [Arabidopsis thaliana]
Length = 377
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 198/321 (61%), Gaps = 26/321 (8%)
Query: 151 EATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAAL 210
E LEETIKQLR +N ++Q+E LE + +++ + L++ ++ E ++ QL DE +
Sbjct: 81 EVLLEETIKQLREENGSYLQKEEKLEERLVQYKNKNDMLLREMSSTEAQMRQLLDERSTF 140
Query: 211 NMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQEN 270
K ASLE++++ L+ D++S E S+E I+ L+ +I +LR QV ELE+S++NLL++N
Sbjct: 141 TQKEASLEKKVQQLQHDEESLVAEEKSSREMISSLNNEIARLRAQVTELEKSKSNLLEQN 200
Query: 271 RQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSV 330
+ LKE +S+L+ Q + +S SE+ + ++QIEAA L++KLITEN +LVEKVN+L +
Sbjct: 201 QSLKETISNLQVQHENHDSNAKGASEE-ELNSQIEAACTLVEKLITENADLVEKVNELCI 259
Query: 331 KLDRQSVAAGLSSAIGSDAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGIN 390
KL++ A+ S AI + +L + P+ D + ++N + ++K RN
Sbjct: 260 KLNQSQHASPESLAIEVEKSESL--EEIPI---HDELIRIDNSRDMD-TASIK--RNFSE 311
Query: 391 GVHADPPPLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKV-AAVPLTDAPLIG 449
G + PL L++ +GE+ D+E QV + D++ A VPL DAPLIG
Sbjct: 312 GEIEETVPLSLNA-----NGEV-----------DVESQVAVAGEDEINAGVPLADAPLIG 355
Query: 450 APFRLVSFVAKYVSGADLVNK 470
APFRLVSFVA+YVSGADL K
Sbjct: 356 APFRLVSFVARYVSGADLAAK 376
>gi|449439795|ref|XP_004137671.1| PREDICTED: uncharacterized protein LOC101221440 [Cucumis sativus]
gi|449497171|ref|XP_004160333.1| PREDICTED: uncharacterized LOC101221440 [Cucumis sativus]
Length = 392
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 130/333 (39%), Positives = 196/333 (58%), Gaps = 13/333 (3%)
Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
K+A L++TIK L +N++HI+R L++ + + EK WLQKE AL KI L+++ A
Sbjct: 62 KDAKLDDTIKHLHEENNIHIKRMADLDLKLVECEGEKYSWLQKEEALMDKIRNLQEDKTA 121
Query: 210 LNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQE 269
L+++ A L +KLLE DK S E S+ETI + DI++L+ QVVELEE + +LL E
Sbjct: 122 LDLEGARLLNIIKLLERDKASLILDEKSSRETIVDKNKDISRLQAQVVELEEQKRDLLHE 181
Query: 270 NRQLKENVSSLRSQLSSDESKKHATS-------EQKDFSTQIEAAGALIDKLITENIELV 322
N+QL V+ +S+L + E K +T ++ ++Q++AA L+DKLITEN EL+
Sbjct: 182 NKQLTGKVADYQSKLLNLERKISSTYIHSSDRVTKEILNSQVDAARILVDKLITENAELI 241
Query: 323 EKVNDLSVKLDRQSVAAGLSSAIGSDAVTALTSDTEPMSESSDN--MSSLNNRLETQGVV 380
KVN+L V+L R + L S + D + +DT +ES ++S+ + +
Sbjct: 242 GKVNELFVELQRVT-KTELPSGVVPDQMATEATDTTTFNESEPPVILNSVTSGKSLDALK 300
Query: 381 AVKEDRNGINGVHADPPPLVLSS--SEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVA 438
+V + I G D ++S S +GEI QI L E Q+ ++ + D+
Sbjct: 301 SVSIHSHSIGGDFVDLGSDFMASEASMPMAAGEIEQIQLHQFEDQNGTRELPATEIDE-K 359
Query: 439 AVPLTDAPLIGAPFRLVSFVAKYVSGADLVNKN 471
V L+DAPLIGAP+RL+SF+AKYVSGADLV K+
Sbjct: 360 DVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 392
>gi|326488811|dbj|BAJ98017.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 108/360 (30%), Positives = 174/360 (48%), Gaps = 53/360 (14%)
Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
LY+ K+ L +TI+QL+ + L +QK +E I++L N+ H Q E LE + NLQ
Sbjct: 132 LYEAKLDK-LHDTIKQLEDEKGLWLQKMNKMESEIEKLHNKVGYHAQNEVILEEKLNNLQ 190
Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIA 243
+ + ++KE L+ K+ + D + L + L+ERL LE +
Sbjct: 191 NGHDMLVKKEEVLDNKVKCVEDVNGVLTHQETFLKERLSGLEETNKA------------- 237
Query: 244 GLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATS-------E 296
L+ QV L+E+ + ++EN+QL +V L S+L + E+K T
Sbjct: 238 --------LQEQVKVLDEASKSTVEENQQLLVSVDELESRLQTLEAKIALTEASITKEVP 289
Query: 297 QKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSA--IGSDAVTALT 354
+ + Q + AG+ + K + ++ K N+L+ DR GL+S+ + SD + +
Sbjct: 290 KNEVMNQTDLAGSFLHKQTAGFMTVISKGNELTA--DR-----GLNSSLVVTSDNIYSHV 342
Query: 355 SDTEPMSESSDNMSSLNNRL----ETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSG 410
S+ + +SD+ + + G + N G E SG
Sbjct: 343 SNIPVGAYASDHADETSAHFPEATSSNGAGQSLMNANARQGF-----------DEPRMSG 391
Query: 411 EIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
EIV +PLDD + + + Q S + VP TDAP++GAPFRL+SFVA+YVSGADLVN+
Sbjct: 392 EIVPVPLDDILIHEDDPQPAGSDVETAEEVPFTDAPIVGAPFRLISFVARYVSGADLVNQ 451
>gi|148909438|gb|ABR17817.1| unknown [Picea sitchensis]
Length = 558
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 122/422 (28%), Positives = 199/422 (47%), Gaps = 97/422 (22%)
Query: 130 QATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFW 189
+A+ EE I++L+ + L MQKEA++E+ IK+ + + +Q+EG +E I L+ EK W
Sbjct: 143 EASKEEEIKRLKDKFKLHMQKEASIEDEIKRFKEEKISWLQKEGTMEKEIKRLKDEKSSW 202
Query: 190 LQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDI 249
LQ+EA +++I L DE K EE ++ L+A+ +S + E ++ GL +
Sbjct: 203 LQREAKTQEEIKSLTDEKQLWLQKEVFREEEIERLKAENNSLLKKEVTKEDEAKGLRFEN 262
Query: 250 TQLRMQVVELEESRNNLLQENRQLK-------------ENVSSLRSQLSSD--------- 287
L+ +VVE+++S +L E++QL E +++ SQL D
Sbjct: 263 FSLQKEVVEMKKSLESLSLEHQQLTAQAVAVQTGIQQLERNATVHSQLEQDIAIATTPEV 322
Query: 288 -------ESKKHATSE----QKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQS 336
S K +E +D Q EAA A+++KL+ EN EL+EKVN+LS +D+
Sbjct: 323 SKQWLGWGSAKDPVAEAARIAEDSRLQKEAAAAIVEKLVIENTELMEKVNELSTIVDQ-- 380
Query: 337 VAAGLSSAIGSDAVTALTS-DTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHAD 395
+ SS S T+ D +S + +SL+ +L NG+ GV
Sbjct: 381 LNKKCSSLAESKVRLQETAQDATALSPLTFKENSLSTKL-----------SNGL-GVTNY 428
Query: 396 PPPLVLSSSEAEYS-GEIVQI-----PLDDKEVQDLELQVVE-----------------S 432
P ++++ Y+ EIV+I P+++ +E ++VE S
Sbjct: 429 PSVSIVNNDVTSYTESEIVEIMASQDPVNNDVTSYMESEIVEIMPSQDPVTMEDGENGLS 488
Query: 433 YTDKVA--------------------------AVPLTDAPLIGAPFRLVSFVAKYVSGAD 466
+D + VP +DAP++GAP RLVSF+ KYV+GAD
Sbjct: 489 ASDSIGTVEEAKQITSNSVREPVDMTFQKTSETVPFSDAPVMGAPIRLVSFLTKYVTGAD 548
Query: 467 LV 468
LV
Sbjct: 549 LV 550
>gi|242035841|ref|XP_002465315.1| hypothetical protein SORBIDRAFT_01g036280 [Sorghum bicolor]
gi|241919169|gb|EER92313.1| hypothetical protein SORBIDRAFT_01g036280 [Sorghum bicolor]
Length = 458
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/335 (31%), Positives = 170/335 (50%), Gaps = 35/335 (10%)
Query: 154 LEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMK 213
L ETIKQL N+ L +Q+ +E + L SE Q E LE K++ L+ L K
Sbjct: 140 LHETIKQLENEKSLWLQKVSIMEGELEKLHSEVGCHAQNEVLLEGKVNSLQSGYDVLIKK 199
Query: 214 RASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQL 273
L+++++ ++ D+ T E++ KE ++GL L +QV LE++ NN +EN+ L
Sbjct: 200 EEVLDDKVRRIDNINDTLTHQEALLKERLSGLEETNKALLVQVKMLEDASNNTSEENQML 259
Query: 274 KENVSSLRSQLSSDESKKHATSEQKDFSTQIEA-AGALIDKLITENIELVEKVNDLSVKL 332
+ + L S+L + E+K A SE ++ IE ++ + + +++++V+ + L
Sbjct: 260 VQKIDELDSRLQALEAKC-APSE----ASMIEKNVIKIVMQFPVPDNKVMDQVDFANSPL 314
Query: 333 DRQSVAA----------GLSSA--IGSDAVTALTSDTEPMSESSDNMSSLNNRLETQGVV 380
+Q++ GL S+ I D D + +S+ + +L G
Sbjct: 315 QQQTIGFSEGNKLIAERGLGSSVQINPDNSYGQIYDIPSNAYASNYPEEASIQLPEIGT- 373
Query: 381 AVKEDRNGINGVHADPPPLVLSSSEAEYSG-----EIVQIPLDDKEVQDLELQVVESYTD 435
N I H D +S +E + G EIV +PLDD E+ + + Q D
Sbjct: 374 -----SNSIVQGHVD-----VSVNEHRFDGPRPSEEIVPVPLDDIEIHEDDPQ-QPGAAD 422
Query: 436 KVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
++ VP +DAP+IGAPFRL+SFVA+YVSGADLVN+
Sbjct: 423 EIDEVPFSDAPIIGAPFRLISFVARYVSGADLVNQ 457
>gi|212721506|ref|NP_001131641.1| uncharacterized protein LOC100192998 [Zea mays]
gi|194692122|gb|ACF80145.1| unknown [Zea mays]
gi|414866651|tpg|DAA45208.1| TPA: hypothetical protein ZEAMMB73_881768 [Zea mays]
Length = 446
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 171/350 (48%), Gaps = 52/350 (14%)
Query: 140 LQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQK 199
+ R ++ ++ K L ETIKQL N+ L +Q+ +E + L +E Q E LE+K
Sbjct: 129 MHRLHEAKLDK---LHETIKQLENEKSLWLQKVITMEGELEKLHNEVGCHAQNEVLLEEK 185
Query: 200 ISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVEL 259
++ L+ L K ++ +++ ++ D+ T ES+ KE ++GL L +QV L
Sbjct: 186 LNSLQSGYDVLIKKEEVVDNKVRCIDNINDTLTHQESLLKERLSGLEETNKSLLVQVKVL 245
Query: 260 EESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAA--GALIDKLITE 317
EE+ NN +EN+ L + + L S+L + E+K A SE + IE ++D++
Sbjct: 246 EEASNNTSEENQMLVKKIYELDSRLQALEAKA-APSE----ALMIEKFPDNKVMDQVHFA 300
Query: 318 NIELVEKVNDLS--------------VKLDRQSVAAGLSSAIGSDAVTALTSDTE---PM 360
N L ++ D S VKL+ + + +D + T +T P
Sbjct: 301 NSPLQQQTIDFSEGNKLFAERGLSSPVKLNPDNSYRQIYDIPSNDYASNYTEETSIQLPE 360
Query: 361 SESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIPLDDK 420
+S+ + QG V V E R D P + EIV +PLD+
Sbjct: 361 IGTSNTI--------VQGHVDVNEHR-------FDGPRTI---------EEIVPVPLDEI 396
Query: 421 EVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
++ + ++ D++ VP +DAP+IGAPFRL+SFVA+YVSGADLVN+
Sbjct: 397 QIHE-DVPGQPGAADEIDEVPFSDAPIIGAPFRLISFVARYVSGADLVNQ 445
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 115/280 (41%), Gaps = 59/280 (21%)
Query: 1 MEDEKKKKRNKKKKNNKQTKTTTDDVAVGADPNH-----LINGQK----DDVRSQASE-- 49
M+ +KK+ + KK K T A NH I+ K DDV S E
Sbjct: 1 MDSKKKRNKKKKGNQGKNAADVTSIAGEAASQNHNHELAPIDHHKGSDADDVMSSVGEEL 60
Query: 50 -------PADIQNVQVDAD----------------------------RHQSNGA-----E 69
P N+Q D + H+++ A
Sbjct: 61 PQYKNHQP----NIQADYNGTSAYDTTSSIGECIACYQNNEPMMTQENHKASNAVPADQR 116
Query: 70 SANLAEAERQ---HLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYK 126
S L+E+ + H L + L ET+K L NE+ +QK T+EG +++L NE +
Sbjct: 117 SVGLSESSVELDMHRLHEAKLDKLHETIKQLENEKSLWLQKVITMEGELEKLHNEVGCHA 176
Query: 127 EKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEK 186
+ + LEE + LQ D+ ++KE ++ ++ + N ND +E L+ ++ L+
Sbjct: 177 QN-EVLLEEKLNSLQSGYDVLIKKEEVVDNKVRCIDNINDTLTHQESLLKERLSGLEETN 235
Query: 187 EFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEA 226
+ L + LE+ + +E+ L K L+ RL+ LEA
Sbjct: 236 KSLLVQVKVLEEASNNTSEENQMLVKKIYELDSRLQALEA 275
>gi|414866652|tpg|DAA45209.1| TPA: hypothetical protein ZEAMMB73_881768 [Zea mays]
Length = 454
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/348 (29%), Positives = 174/348 (50%), Gaps = 40/348 (11%)
Query: 140 LQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQK 199
+ R ++ ++ K L ETIKQL N+ L +Q+ +E + L +E Q E LE+K
Sbjct: 129 MHRLHEAKLDK---LHETIKQLENEKSLWLQKVITMEGELEKLHNEVGCHAQNEVLLEEK 185
Query: 200 ISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVEL 259
++ L+ L K ++ +++ ++ D+ T ES+ KE ++GL L +QV L
Sbjct: 186 LNSLQSGYDVLIKKEEVVDNKVRCIDNINDTLTHQESLLKERLSGLEETNKSLLVQVKVL 245
Query: 260 EESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENI 319
EE+ NN +EN+ L + + L S+L + E+K A SE +I+K +T+ +
Sbjct: 246 EEASNNTSEENQMLVKKIYELDSRLQALEAKA-APSE-----------ALMIEKSVTKTV 293
Query: 320 ------ELVEKVNDLSVKLDRQSV----AAGLSSAIGSDAVTALTSDTEP--MSESSDNM 367
+++++V+ + L +Q++ L + G + L D + + N
Sbjct: 294 MQFPDNKVMDQVHFANSPLQQQTIDFSEGNKLFAERGLSSPVKLNPDNSYRQIYDIPSND 353
Query: 368 SSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSG-----EIVQIPLDDKEV 422
+ N ET + N I H D +E + G EIV +PLD+ ++
Sbjct: 354 YASNYTEETSIQLPEIGTSNTIVQGHVD-------VNEHRFDGPRTIEEIVPVPLDEIQI 406
Query: 423 QDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
+ ++ D++ VP +DAP+IGAPFRL+SFVA+YVSGADLVN+
Sbjct: 407 HE-DVPGQPGAADEIDEVPFSDAPIIGAPFRLISFVARYVSGADLVNQ 453
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 115/280 (41%), Gaps = 59/280 (21%)
Query: 1 MEDEKKKKRNKKKKNNKQTKTTTDDVAVGADPNH-----LINGQK----DDVRSQASE-- 49
M+ +KK+ + KK K T A NH I+ K DDV S E
Sbjct: 1 MDSKKKRNKKKKGNQGKNAADVTSIAGEAASQNHNHELAPIDHHKGSDADDVMSSVGEEL 60
Query: 50 -------PADIQNVQVDAD----------------------------RHQSNGA-----E 69
P N+Q D + H+++ A
Sbjct: 61 PQYKNHQP----NIQADYNGTSAYDTTSSIGECIACYQNNEPMMTQENHKASNAVPADQR 116
Query: 70 SANLAEAERQ---HLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYK 126
S L+E+ + H L + L ET+K L NE+ +QK T+EG +++L NE +
Sbjct: 117 SVGLSESSVELDMHRLHEAKLDKLHETIKQLENEKSLWLQKVITMEGELEKLHNEVGCHA 176
Query: 127 EKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEK 186
+ + LEE + LQ D+ ++KE ++ ++ + N ND +E L+ ++ L+
Sbjct: 177 QN-EVLLEEKLNSLQSGYDVLIKKEEVVDNKVRCIDNINDTLTHQESLLKERLSGLEETN 235
Query: 187 EFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEA 226
+ L + LE+ + +E+ L K L+ RL+ LEA
Sbjct: 236 KSLLVQVKVLEEASNNTSEENQMLVKKIYELDSRLQALEA 275
>gi|125543744|gb|EAY89883.1| hypothetical protein OsI_11427 [Oryza sativa Indica Group]
Length = 424
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 167/354 (47%), Gaps = 68/354 (19%)
Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
LY+ K+ L +TI++L+ + L QK +++E +++L N+ D H Q E LE + NLQ
Sbjct: 131 LYEAKLDK-LHDTIKKLEDEKSLWHQKMSSMEIEVEKLHNKVDYHAQNEVRLEEKLNNLQ 189
Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIA 243
+ + ++KE AL+ K+ + +E D+ T E+ KE ++
Sbjct: 190 NGYDVLIKKEVALDNKV---------------------RSIEVINDALTHQETSLKERLS 228
Query: 244 GLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQ 303
GL L +QV LEE+ NN ++E+++L + L S+L
Sbjct: 229 GLEETNKVLLVQVKVLEEASNNTVEESQRLVKGFDELASRLGV----------------- 271
Query: 304 IEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLS----SAIGSDAVTALTSDT-- 357
EA AL + +T +K N+L V S AA S S I S A S+
Sbjct: 272 FEAKSALTEASVT------KKGNELIVDRSVNSSAAITSVDNYSPINSSPSNAYVSNHLE 325
Query: 358 -EPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIP 416
PM + N + ++G++ D N D E S EI+ +
Sbjct: 326 EAPMQLPETTI----NDVSSEGLI----DMNAHQRSKQD-------CDEPRTSEEILPVA 370
Query: 417 LDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
LDD ++ + + Q + D+ VP +DAP++GAPFRL+SFVA+YVSGADLVN+
Sbjct: 371 LDDIQIHEEDPQPPVA-DDEAEEVPFSDAPIVGAPFRLISFVARYVSGADLVNQ 423
>gi|115452839|ref|NP_001050020.1| Os03g0333000 [Oryza sativa Japonica Group]
gi|108707985|gb|ABF95780.1| expressed protein [Oryza sativa Japonica Group]
gi|113548491|dbj|BAF11934.1| Os03g0333000 [Oryza sativa Japonica Group]
Length = 424
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 167/354 (47%), Gaps = 68/354 (19%)
Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
LY+ K+ L +TI++L+ + L QK +++E +++L N+ D H Q E LE + NLQ
Sbjct: 131 LYEAKLDK-LHDTIKKLEDEKSLWHQKMSSMEIEVEKLHNKVDYHAQNEVRLEEKLNNLQ 189
Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIA 243
+ + ++KE AL+ K+ + +E D+ T E+ KE ++
Sbjct: 190 NGYDVLIKKEVALDNKV---------------------RSIEVINDALTHQETSLKERLS 228
Query: 244 GLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQ 303
GL L +QV LEE+ NN ++E+++L + L S+L
Sbjct: 229 GLEETNKVLLVQVKVLEEASNNTVEESQRLVKGFDELASRLGV----------------- 271
Query: 304 IEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLS----SAIGSDAVTALTSDT-- 357
EA AL + +T +K N+L V S AA S S I S A S+
Sbjct: 272 FEAKSALTEASVT------KKGNELIVDRSVNSSAAITSVDNYSPINSSPSNAYVSNHLE 325
Query: 358 -EPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIP 416
PM + N + ++G++ D N D E S EI+ +
Sbjct: 326 EAPMQLPETTI----NDVSSEGLI----DMNAHQRSKQD-------FDEPRTSEEILPVA 370
Query: 417 LDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
LDD ++ + + Q + D+ VP +DAP++GAPFRL+SFVA+YVSGADLVN+
Sbjct: 371 LDDIQIHEEDPQPPVA-DDEAEEVPFSDAPIVGAPFRLISFVARYVSGADLVNQ 423
>gi|125586148|gb|EAZ26812.1| hypothetical protein OsJ_10723 [Oryza sativa Japonica Group]
Length = 429
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 168/358 (46%), Gaps = 71/358 (19%)
Query: 124 LYKEKV----QATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNI 179
LY+ K+ Q L +TI++L+ + L QK +++E +++L N+ D H Q E LE +
Sbjct: 131 LYEAKLILPMQDKLHDTIKKLEDEKSLWHQKMSSMEIEVEKLHNKVDYHAQNEVRLEEKL 190
Query: 180 ANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSK 239
NLQ+ + ++KE AL+ K+ + +E D+ T E+ K
Sbjct: 191 NNLQNGYDVLIKKEVALDNKV---------------------RSIEVINDALTHQETSLK 229
Query: 240 ETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKD 299
E ++GL L +QV LEE+ NN ++E+++L + L S+L
Sbjct: 230 ERLSGLEETNKVLLVQVKVLEEASNNTVEESQRLVKGFDELASRLGV------------- 276
Query: 300 FSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLS----SAIGSDAVTALTS 355
EA AL + +T +K N+L V S AA S S I S A S
Sbjct: 277 ----FEAKSALTEASVT------KKGNELIVDRSVNSSAAITSVDNYSPINSSPSNAYVS 326
Query: 356 DT---EPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEI 412
+ PM + N + ++G++ D N D E S EI
Sbjct: 327 NHLEEAPMQLPETTI----NDVSSEGLI----DMNAHQRSKQD-------FDEPRTSEEI 371
Query: 413 VQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
+ + LDD ++ + + Q + D+ VP +DAP++GAPFRL+SFVA+YVSGADLVN+
Sbjct: 372 LPVALDDIQIHEEDPQPPVA-DDEAEEVPFSDAPIVGAPFRLISFVARYVSGADLVNQ 428
>gi|147845679|emb|CAN80589.1| hypothetical protein VITISV_040788 [Vitis vinifera]
Length = 472
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 50/76 (65%), Positives = 63/76 (82%), Gaps = 2/76 (2%)
Query: 399 LVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFV 458
++ +SSE + SGEIVQIPLD+ E ++LELQ E+ D+ VPLTDAPLIGAPFRL+SFV
Sbjct: 394 VIPNSSETDDSGEIVQIPLDENEARELELQAAEN--DRNTTVPLTDAPLIGAPFRLISFV 451
Query: 459 AKYVSGADLVNKNASN 474
AKYVSGADLV K+++N
Sbjct: 452 AKYVSGADLVGKSSTN 467
>gi|357112364|ref|XP_003557979.1| PREDICTED: uncharacterized protein LOC100829228 isoform 1
[Brachypodium distachyon]
Length = 448
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 173/382 (45%), Gaps = 98/382 (25%)
Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
LY+ K+ L T++QL+ + L ++K +E +++L N+ H Q E LE + LQ
Sbjct: 129 LYEAKLDK-LHGTVKQLEDEKSLWLKKVNMMEIELEKLHNKVGYHAQNEVLLEEKLDILQ 187
Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIA 243
+ ++KE L+ K+S + D + L + SL+ERL +++E +K
Sbjct: 188 HGHDMLVKKEEVLDNKVSCIEDANVVLTHEETSLKERL----------SELEETNK---- 233
Query: 244 GLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHAT--------S 295
L QV L+E+ ++EN+ L ++ L S+L + E+K +
Sbjct: 234 -------ALLEQVKVLDEASKITVEENQSLLTSIYELESRLQAVEAKIFLSEVSISKEVP 286
Query: 296 EQKDFSTQIEAAGALIDKLIT--------ENIELVEKVNDLSVKLDRQSVAAGLSSAIGS 347
E K Q + G+L+ T E EL+ GL+S++
Sbjct: 287 ENKVMDHQTDLTGSLLHNQTTDFTNLISNEGNELIGN--------------RGLNSSV-- 330
Query: 348 DAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVL---SSS 404
VT S +N S +NN + + + H D L+ +SS
Sbjct: 331 -TVT-----------SENNHSHINN---SSSIAYISN--------HPDETSLLFPEATSS 367
Query: 405 EAE--------YSG--------EIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLI 448
A+ + G EI+ +PLDD ++ + +LQ D+ A VP TDAP++
Sbjct: 368 SADQGFIHENAHQGFDKPRINEEIMPVPLDDIQIHEDDLQ--PRGLDETAEVPFTDAPIV 425
Query: 449 GAPFRLVSFVAKYVSGADLVNK 470
GAPFRL+SFVA+YVSGADLV++
Sbjct: 426 GAPFRLISFVARYVSGADLVDQ 447
>gi|357112366|ref|XP_003557980.1| PREDICTED: uncharacterized protein LOC100829228 isoform 2
[Brachypodium distachyon]
Length = 416
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 86/322 (26%), Positives = 153/322 (47%), Gaps = 69/322 (21%)
Query: 175 LEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQM 234
L + L+ EK WL+K +E ++ +L ++ LEE+L +L+ D +
Sbjct: 137 LHGTVKQLEDEKSLWLKKVNMMEIELEKLHNKVGYHAQNEVLLEEKLDILQHGHDMLVKK 196
Query: 235 ESVSKETIAG-------LSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSD 287
E V ++ L+ + T L+ ++ ELEE+ LL++ + L E S+++ +
Sbjct: 197 EEVLDNKVSCIEDANVVLTHEETSLKERLSELEETNKALLEQVKVLDE-----ASKITVE 251
Query: 288 ESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSAIGS 347
E++ TS + ++++A A K+ + + ++ N+L IG+
Sbjct: 252 ENQSLLTS-IYELESRLQAVEA---KIFLSEVSISKEGNEL----------------IGN 291
Query: 348 DAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVL---SSS 404
+ + + T S +N S +NN + + + H D L+ +SS
Sbjct: 292 RGLNSSVTVT-----SENNHSHINN---SSSIAYISN--------HPDETSLLFPEATSS 335
Query: 405 EAE--------YSG--------EIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLI 448
A+ + G EI+ +PLDD ++ + +LQ D+ A VP TDAP++
Sbjct: 336 SADQGFIHENAHQGFDKPRINEEIMPVPLDDIQIHEDDLQ--PRGLDETAEVPFTDAPIV 393
Query: 449 GAPFRLVSFVAKYVSGADLVNK 470
GAPFRL+SFVA+YVSGADLV++
Sbjct: 394 GAPFRLISFVARYVSGADLVDQ 415
>gi|357477631|ref|XP_003609101.1| hypothetical protein MTR_4g111980, partial [Medicago truncatula]
gi|355510156|gb|AES91298.1| hypothetical protein MTR_4g111980, partial [Medicago truncatula]
Length = 475
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 108/229 (47%), Gaps = 39/229 (17%)
Query: 45 SQASEPADIQNVQVDADRHQSNGAESANLAEAERQHLLQREAMAILEETVKHLRNERESH 104
+Q E AD Q+ +D++ H NG E I +ET+K L+ E
Sbjct: 281 AQPMESADGQSTNMDSNGHLPNGKECD-----------------ISDETIKKLKEENAML 323
Query: 105 IQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQ 164
IQ+E E ETI++L ++ + KE EETI++L Q
Sbjct: 324 IQREVMSE----------------------ETIRKLNEDKNMHILKETISEETIRKLEEQ 361
Query: 165 NDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLL 224
D+H+++E LE +I L+ E + ++KEA LE I +L E+ K A+ +E ++ L
Sbjct: 362 KDMHVEKEVALEESIRKLKKEIDMHVKKEAILEDTIRKLNKENDTHMQKEATSKETIRNL 421
Query: 225 EADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQL 273
E + + + Q E+ ++ETIA L + + + V +EE N L R L
Sbjct: 422 EEENEVYIQKEAKAEETIAKLKEAVDRHIHKEVTMEEIINKLQTWYRAL 470
>gi|123240166|ref|XP_001287759.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121855734|gb|EAX74829.1| hypothetical protein TVAG_009050 [Trichomonas vaginalis G3]
Length = 394
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 87/141 (61%), Gaps = 2/141 (1%)
Query: 90 LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
L + VK L+++ + +K+ LEG +Q+L+++ D +++ LE+ IQ+L+ ++ + +
Sbjct: 3 LADKVKGLQDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEDAIQELKDKDAAQDE 61
Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
K+ LE+ I++L++++ +++ LE I L+ + +K+ LE I +L+D+ AA
Sbjct: 62 KDKNLEDAIQELKDKDAAQDEKDKNLEDAIQELKDKDAAQDEKDKNLEGAIQELKDKDAA 121
Query: 210 LNMKRASLEERLKLLEADKDS 230
+ K +LE+ ++ L+ DKD+
Sbjct: 122 QDEKDKNLEDAIQELK-DKDA 141
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 78/129 (60%), Gaps = 1/129 (0%)
Query: 90 LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
LE+ ++ L+++ + +K+ LEG +Q+L+++ D +++ LE+ IQ+L+ ++ + +
Sbjct: 87 LEDAIQELKDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEDAIQELKDKDAAQDE 145
Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
K+ LE I++L++++ +++ LE I L+ + +K+ LE I +L+D+ AA
Sbjct: 146 KDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAA 205
Query: 210 LNMKRASLE 218
+ K +LE
Sbjct: 206 QDEKDKNLE 214
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 58/248 (23%), Positives = 125/248 (50%), Gaps = 41/248 (16%)
Query: 90 LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
LE+ ++ L+++ + +K+ LEG +Q+L+++ D +++ LE IQ+L+ ++ + +
Sbjct: 129 LEDAIQELKDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEGAIQELKDKDAAQDE 187
Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
K+ LE I++L++++ +++ LE I L+ + +K+ LE I +L+D+ AA
Sbjct: 188 KDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAA 247
Query: 210 LNMKRASLEERLKLLEADKDS-WTQMESVSKETIAGLSVDITQLRMQVVELEESR----- 263
+ K +LE ++ L+ DKD+ + + ++E I L+ + L+ ++ + EE++
Sbjct: 248 QDEKDKNLEGAIQELK-DKDAAQDEKDKANEEAIKSLADRLQDLKDKIRDAEEAKATQGA 306
Query: 264 -------------NNLLQE---NRQLK-----------------ENVSSLRSQLSSDESK 290
NNL++E + LK E++ +L Q+ E +
Sbjct: 307 EDLQGVNDALNALNNLIKEKADDDALKALEDRLNNNDNKDNKQDEDLKALADQIQEMEDR 366
Query: 291 KHATSEQK 298
K EQ+
Sbjct: 367 KKEEDEQR 374
>gi|123469108|ref|XP_001317768.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121900510|gb|EAY05545.1| hypothetical protein TVAG_319000 [Trichomonas vaginalis G3]
Length = 491
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/184 (22%), Positives = 92/184 (50%), Gaps = 7/184 (3%)
Query: 132 TLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQ 191
+LE+++ Q++ QK LE ++ Q+++ Q+ G LE ++ ++ ++ Q
Sbjct: 282 SLEQSVNQIKDDQTTMNQKLGALETSVNQIKDDQTTMNQKLGALETSVNQIKDDQTTMNQ 341
Query: 192 KEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQ 251
K ALE ++Q++D+ +N K +LE + ++ D+ + Q + ++ + D T
Sbjct: 342 KLGALETSVNQIKDDQTTMNQKLGALETSVNQIKDDQTTMNQKLGALETSVNQIKDDQTT 401
Query: 252 LRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALI 311
+ ++ LE S N Q+K++ +++ +LS +S + + + T++ A
Sbjct: 402 MNQKLGALETSVN-------QIKDDQTTMNQKLSKLDSLEQSVNNITQMLTKLNAQYETN 454
Query: 312 DKLI 315
DK+I
Sbjct: 455 DKII 458
>gi|357477629|ref|XP_003609100.1| hypothetical protein MTR_4g111960 [Medicago truncatula]
gi|355510155|gb|AES91297.1| hypothetical protein MTR_4g111960 [Medicago truncatula]
Length = 643
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 74/168 (44%), Gaps = 36/168 (21%)
Query: 31 DPNHLINGQKDDVRSQASEPADIQNVQVDADRHQ---SNGAESANLAEAERQHLLQREAM 87
D NHL+N KD E A+I V+++R + G+ESA L + +
Sbjct: 90 DHNHLVNDAKD-------EHANISEAVVNSNRGWRKFNQGSESA----IATPLLYKHSPL 138
Query: 88 AILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLR 147
I +ET+++L+ E + IQ+E E T+Q L E D
Sbjct: 139 DISDETIRNLKEENDMLIQRETISEKTIQILTEEIDT----------------------N 176
Query: 148 MQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAA 195
++KE + IK+L +N +HIQ E LE NI L+ E + + E
Sbjct: 177 IRKEVLSLKIIKELDEENHMHIQGEVALEENIRKLKKELDMHAKNEVT 224
>gi|346973208|gb|EGY16660.1| hypothetical protein VDAG_07824 [Verticillium dahliae VdLs.17]
Length = 949
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 134/294 (45%), Gaps = 63/294 (21%)
Query: 77 ERQHLLQREAMAILEETVKHLRN------ERESHIQKEATLEGTVQQLQNECDLYKEKVQ 130
E Q+ + +A + L KH R+ E + Q +A LE V +L+ + LYK+ +
Sbjct: 351 EHQYQMVDDAQSSLTTHSKHSRSPSKGLQEVQGLQQDKADLEAKVSELEKDLTLYKD-FK 409
Query: 131 ATLEETIQQLQ---RQNDLRMQKEA----TLEETIKQLR-NQNDLHIQREGGLEMNIANL 182
LE +LQ +Q + R + ++ + E ++ LR Q+ HI R LE++ +L
Sbjct: 410 VCLEAKCDELQDEIKQQEERFKTDSEKLRSDVEDVEDLRKTQDAAHIARIVSLELSKESL 469
Query: 183 QSEKEFWLQKEAALEQKIS--------------QLRDESAALNMKRASLEERLKLLEADK 228
Q+ K ALE +I+ QL + A ++ +LE R+++LE++K
Sbjct: 470 QA-------KAQALEAEIARKDFEYGEFTTERDQLSKDKDAWVEEKKNLESRVEVLESEK 522
Query: 229 DSWTQMESVSKETIAGLSVDITQLRMQVVELEES--------------RNNLLQENRQLK 274
+ +ETI GL + ++ VVELE++ ++ ++ ++ +LK
Sbjct: 523 SKLEEQTKALEETIKGLETERDSIQAHVVELEKTNGELQSKVGGLEDDKSTIVSKSEKLK 582
Query: 275 ENV-------SSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIEL 321
V +LR+Q+SS +Q D ++Q+ A D L ENI L
Sbjct: 583 TKVDRLKGENGNLRTQVSS------LLKDQDDHTSQVLTLSAERDGLKEENITL 630
>gi|449688203|ref|XP_002167031.2| PREDICTED: flagellar attachment zone protein 1-like [Hydra
magnipapillata]
Length = 751
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 148/329 (44%), Gaps = 21/329 (6%)
Query: 69 ESANLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEK 128
ES N E E + E LEE+V L +E + + LE +V +L+ E ++E
Sbjct: 250 ESVNKLEGENKKF--EENNKRLEESVNKLESENKKFEENNKKLEESVHKLEGENKKFEEN 307
Query: 129 VQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEF 188
LEE++ +L+ +N + LEE++ +L +N + L+ ++ L+
Sbjct: 308 -NKKLEESVHKLEGENKKFEENNKKLEESVHKLEGENKKFEENNNRLQESVNKLE----- 361
Query: 189 WLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVD 248
+ LE ++QL + + L LE K E + + + +E A L +
Sbjct: 362 --ENNNRLETSVNQLEENNKRLEDSIDKLEGENKKFEENNKKLEEENNRFQENNAKLESE 419
Query: 249 ITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLS--SDESKKHATSEQK------DF 300
+ +L ++ LEE N + N LK NVS L +++ S +K++ + +K
Sbjct: 420 VNRLGGEINRLEEQNNKFAENNENLKNNVSKLEGEVTKMSHLNKEYEINNKKLEDNVNKL 479
Query: 301 STQIEAAGALIDKLITENIELVEKVNDLS---VKLDRQSVAAGLSSAIGSDAVTALTSDT 357
++ G K N +L E+VN L+ K+ ++ ++ S VT ++ +
Sbjct: 480 EGELHKMGEENKKFEVNNKKLEEQVNQLTSEVAKMTEENNKFSENNNKLSQEVTKISKEN 539
Query: 358 EPMSESSDNMSSLNNRLETQGVVAVKEDR 386
E M+E++ N++ RL+ + V+E+R
Sbjct: 540 EKMAENNKNLADEIARLKAEIDRMVEENR 568
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 84/171 (49%), Gaps = 1/171 (0%)
Query: 111 LEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQ 170
LE +V +L+ E ++E LEE++ +L+ +N + LEE++ +L +N +
Sbjct: 185 LEASVNKLEGENKKFEEN-NKRLEESVNKLEGENKKFEENNKRLEESVNKLEGENKKFEE 243
Query: 171 REGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDS 230
LE ++ L+ E + + + LE+ +++L E+ LEE + LE +
Sbjct: 244 NNKRLEESVNKLEGENKKFEENNKRLEESVNKLESENKKFEENNKKLEESVHKLEGENKK 303
Query: 231 WTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLR 281
+ + +E++ L + + +LEES + L EN++ +EN + L+
Sbjct: 304 FEENNKKLEESVHKLEGENKKFEENNKKLEESVHKLEGENKKFEENNNRLQ 354
>gi|302423820|ref|XP_003009740.1| predicted protein [Verticillium albo-atrum VaMs.102]
gi|261352886|gb|EEY15314.1| predicted protein [Verticillium albo-atrum VaMs.102]
Length = 923
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 75/294 (25%), Positives = 132/294 (44%), Gaps = 63/294 (21%)
Query: 77 ERQHLLQREAMAILEETVKHLRN------ERESHIQKEATLEGTVQQLQNECDLYKEKVQ 130
E Q+ + +A + L KH R+ E + Q +A L+ V +L+ + LYK+ +
Sbjct: 352 EHQYQMVDDAQSSLTTHSKHSRSASKGLQEVQGLQQDKADLQAKVSELEKDLTLYKD-FK 410
Query: 131 ATLEETIQQLQ---RQNDLRMQKEA----TLEETIKQLR-NQNDLHIQREGGLEMNIANL 182
LE +LQ +Q + R + ++ + E ++ LR Q+ HI R LE++ +L
Sbjct: 411 VCLEAKCDELQDEIKQQEERFKTDSEKFRSDVEDVEDLRKTQDAAHIARIVSLELSKESL 470
Query: 183 QSEKEFWLQKEAALEQKIS--------------QLRDESAALNMKRASLEERLKLLEADK 228
Q+ K ALE +I+ QL + A ++ LE R+++LE++K
Sbjct: 471 QA-------KAQALEAEIARKDFEYGEFTTERDQLSKDKDAWVEEKKDLESRVEVLESEK 523
Query: 229 DSWTQMESVSKETIAGLSVDITQLRMQVVELEES--------------RNNLLQENRQLK 274
+ +ETI GL + ++ VVELE + ++ ++ ++ +LK
Sbjct: 524 SKLEEQTKALEETIKGLETERDSIQAHVVELENTNGELQSKVGGLEDDKSTIVSKSEKLK 583
Query: 275 ENV-------SSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIEL 321
V +LR+Q+SS +Q D ++Q+ A D L ENI L
Sbjct: 584 TKVDRLKGENGNLRTQVSS------LLKDQDDHTSQVLTLSAERDGLKEENITL 631
>gi|358395257|gb|EHK44644.1| hypothetical protein TRIATDRAFT_220313 [Trichoderma atroviride IMI
206040]
Length = 2049
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 67/127 (52%), Gaps = 5/127 (3%)
Query: 165 NDLHIQREGGLEMNIANLQSEKEFWLQKEAALE---QKISQLRDESAALNMKRASLEERL 221
N+L+ RE G+ + N Q +K+ L+K++ +E Q I L E + +++ LEE +
Sbjct: 1296 NELNTYRESGMALRTENSQLKKQI-LEKDSKIEEMVQSIQPLEAEIENIKTQKSFLEEEI 1354
Query: 222 KLLEADKDSW-TQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSL 280
K ++AD+D W + E + + ++ QL+ + +LE RN L + L+ ++ L
Sbjct: 1355 KQIQADRDRWQKRTEGILTKYGRVDPAEMEQLKQTITDLETERNALKESAEPLQAKITEL 1414
Query: 281 RSQLSSD 287
S L ++
Sbjct: 1415 ESTLETE 1421
>gi|322710763|gb|EFZ02337.1| filament-forming protein [Metarhizium anisopliae ARSEF 23]
Length = 2030
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 69/142 (48%), Gaps = 6/142 (4%)
Query: 196 LEQKISQLRDESAALNMKRASLEERLKLLEADKDSW-TQMESVSKETIAGLSVDITQLRM 254
+E K+ L E LNM+++ LEE +K ++ D+D W + E + + ++ QL+
Sbjct: 1331 MEGKVQPLEAEIDNLNMQKSYLEEEIKQIQEDRDRWQKRTEGILTKYGRVDPAEMEQLKK 1390
Query: 255 QVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKL 314
+ +LE R+ L Q + L+E ++ L S L S+ T + + Q + KL
Sbjct: 1391 TISDLEAERDCLKQGEQPLREKITELDSTLESERQAWQTT--RAKLTEQFKERSR---KL 1445
Query: 315 ITENIELVEKVNDLSVKLDRQS 336
E E ++ N L +LD+ S
Sbjct: 1446 TGEKNEAAQRANHLQEQLDKAS 1467
>gi|390337002|ref|XP_797348.3| PREDICTED: uncharacterized protein LOC592748 [Strongylocentrotus
purpuratus]
Length = 1201
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 76/149 (51%), Gaps = 19/149 (12%)
Query: 122 CDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEE-TIKQLRNQNDLHIQREGGLEMNIA 180
C+ +K+ +A + ++Q Q +N R ++ + + T KQ+ + + ++ E L+ A
Sbjct: 501 CEDFKQSERA-FRDQLKQHQAENGKRWEEAQRMTQLTNKQMESMGEDCLKAERQLKKVQA 559
Query: 181 NLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKE 240
L SE+E W Q+E LEQK+S L E A AS +E +LL+ + K
Sbjct: 560 TLASEREEWKQREVELEQKVSSLEMERA------ASFDEMQRLLQEE-----------KR 602
Query: 241 TIAGLSVDITQLRMQVVELEESRNNLLQE 269
T+ L+ D+ ++R + E + ++N +E
Sbjct: 603 TVDELNKDLIKMRREADESRKDKDNAHRE 631
>gi|313672426|ref|YP_004050537.1| chromosome segregation protein smc [Calditerrivibrio nitroreducens
DSM 19672]
gi|312939182|gb|ADR18374.1| chromosome segregation protein SMC [Calditerrivibrio nitroreducens
DSM 19672]
Length = 1118
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 92/179 (51%), Gaps = 25/179 (13%)
Query: 82 LQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQAT------LEE 135
L RE++ L++ +K + + S I A LE T+++L N+ D YKE+V + E
Sbjct: 340 LMRESIVKLQDEIKSVVETKRSEI---AELENTIEELVNQRDEYKEEVDVNRRRFLDITE 396
Query: 136 TIQQLQRQNDLRMQKEAT-LEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEA 194
I ++ R N ++ + E L + I +L N+ + +E IA+L+ +K+ + ++
Sbjct: 397 KISKI-RSNIIKHESEMNRLNKDISRLENEKSNILGEVSKIESLIADLKIKKDIFFNEQM 455
Query: 195 ALE--------------QKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSK 239
++ +++ L D+ +AL + +++ E++LKL+EA+ + E +S+
Sbjct: 456 VIQDSYTREKEIVNKIDEELKSLSDKLSALKVTKSTREQQLKLIEAEVKKQSYGEELSR 514
>gi|123407834|ref|XP_001303085.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884435|gb|EAX90155.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 1165
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 106/210 (50%), Gaps = 13/210 (6%)
Query: 85 EAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKE--KVQATLEETIQQLQR 142
E+++ILE T K+ + E I K L T ++L+ +L +E K L++ +L++
Sbjct: 64 ESISILESTKKYTKIEFVDGIIKILYL--TNKELKRYANLVREIMKENNKLKKENSELKK 121
Query: 143 QNDLRMQKEATLEETIKQLRNQNDLHIQREGGLE-------MNIANLQSEKEFWLQKEAA 195
ND ++ L+E +L+ +ND G L+ + I NL+ ++
Sbjct: 122 TNDGLKKENDELKEENDRLKKENDALKNGCGVLKEENEKSNIEINNLKKTNHDLKEENDR 181
Query: 196 LEQKISQLRDESAALNMKRASLE-ERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRM 254
+++I+ L+ E L ++ +L+ ER ++L+ D + + KE G +I +L+
Sbjct: 182 FKKEINDLKKECHTLKIESNNLKNERTRMLKEINDLKKETNDLIKEN-NGFKTEIIELKK 240
Query: 255 QVVELEESRNNLLQENRQLKENVSSLRSQL 284
Q E+EE+ + ++QEN LK+ SL+ Q+
Sbjct: 241 QQREVEENYDRIIQENINLKKENESLKKQI 270
>gi|123162154|ref|XP_001278994.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121827325|gb|EAX66064.1| hypothetical protein TVAG_538710 [Trichomonas vaginalis G3]
Length = 338
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 67/116 (57%), Gaps = 1/116 (0%)
Query: 90 LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
L + VK L+++ + +K+ LEG +Q+L+++ D +++ LE IQ+L+ ++ + +
Sbjct: 3 LADKVKGLQDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEGAIQELKDKDAAQDE 61
Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRD 205
K+ LE I++L++++ +++ LE I L+ + +K+ A E+ I L D
Sbjct: 62 KDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAAQDEKDKANEEAIKSLAD 117
>gi|365986334|ref|XP_003669999.1| hypothetical protein NDAI_0D04420 [Naumovozyma dairenensis CBS 421]
gi|343768768|emb|CCD24756.1| hypothetical protein NDAI_0D04420 [Naumovozyma dairenensis CBS 421]
Length = 1914
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 99/194 (51%), Gaps = 30/194 (15%)
Query: 45 SQASEPADIQNVQVDADRHQSNGAESANLAEAERQHLLQ-----REAMAILEETVKHLRN 99
+++SE N +++A + N E AE R++LL+ +E ++ LE T+K L +
Sbjct: 1211 TESSEKIKTLNAELEALTEKLNQNEME--AEKLRENLLKDQNASKEEISALENTLKELHS 1268
Query: 100 ERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIK 159
E+E+ +++++ LE ++ L+ E D K+K + +E +Q+++ N+ +A +E+
Sbjct: 1269 EKETQMEEKSKLESDIKSLETELDETKKKHASEIESLHEQIEKINEETKDFKAIIEKAAS 1328
Query: 160 QL-RNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQL-RDESAALNMKRAS- 216
L R ++DL + +E + + E L KI ++ E A L K+ S
Sbjct: 1329 NLKRTESDLKVLKEK---------------YTKNEEKL--KIMEVASKEHAELKKKQESE 1371
Query: 217 ---LEERLKLLEAD 227
LE RLK+LE D
Sbjct: 1372 YELLENRLKMLEED 1385
>gi|361127712|gb|EHK99672.1| putative Myosin type-2 heavy chain 1 [Glarea lozoyensis 74030]
Length = 1971
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 67/122 (54%), Gaps = 12/122 (9%)
Query: 218 EERLKLLEADKDSWTQMESVSK--ETIAGLSVD----ITQLRMQVVELEESRNNLLQENR 271
EER +L E ++++T+M+ V + E+ L++D +L+++ EL + + L +
Sbjct: 520 EERQRLDEERRNTYTEMQRVQQTLESERALALDKEEIFKRLQLRETELSDKLSGALDDQE 579
Query: 272 QLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVK 331
+L++ + L E+KK A + + TQ+E AG +I KL E EL E+++ L +
Sbjct: 580 KLEDQLDDLL------EAKKKAEQNSEQWRTQLEQAGQIIAKLELEKRELSERLDSLEAQ 633
Query: 332 LD 333
L+
Sbjct: 634 LE 635
>gi|383852228|ref|XP_003701630.1| PREDICTED: major antigen-like [Megachile rotundata]
Length = 2226
Score = 38.9 bits (89), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 135/276 (48%), Gaps = 29/276 (10%)
Query: 110 TLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHI 169
+L V +L+N+ D+ + K+Q LE + +LQ++ND LE+ + LR +
Sbjct: 1007 SLNDEVSKLRNQLDIAERKIQ-ELEPLVDRLQKENDKLQNDLKALEDDARNLR------L 1059
Query: 170 QREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKD 229
+ +GG+ N ++++ + L +K+ + E+ AL + +L R KLL+ D +
Sbjct: 1060 RLDGGMSDN-ERMRNDMAMLESQVGDLNEKLKAAKAENDALQQENQTL--RAKLLDMDNE 1116
Query: 230 -SWTQMESVS-KETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSD 287
S + E K IA L+ I++LR ++ +LEE + EN +L+ + L++ L
Sbjct: 1117 LSQAKQECADLKAEIADLNNLISELRAKIAKLEEDVEHWKLENCKLQIEIDKLKADL--- 1173
Query: 288 ESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGL-SSAIG 346
+ A + + +A A +++L E EL +++ L+ +++ Q AA L SA G
Sbjct: 1174 ---EKALKDLSECQALKKAQEAELNRLQNEKAELNKQIAGLTAQIEEQKKAAELEKSAKG 1230
Query: 347 S---------DAVTALTSDTEPMS-ESSDNMSSLNN 372
D + AL + E + E++D + ++N
Sbjct: 1231 ESEAKLKALQDELNALKKELEKLRMENNDYKNEMDN 1266
>gi|363750548|ref|XP_003645491.1| hypothetical protein Ecym_3173 [Eremothecium cymbalariae
DBVPG#7215]
gi|356889125|gb|AET38674.1| Hypothetical protein Ecym_3173 [Eremothecium cymbalariae
DBVPG#7215]
Length = 839
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 70/153 (45%), Gaps = 24/153 (15%)
Query: 191 QKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDIT 250
QK AA + + L E+ +L +K A +E L DK + +E + L+VDI
Sbjct: 564 QKFAAQHEVVKLLTQETESLKLKLAGCQESLSATNKDKQETLEKSQKLQEHLHKLNVDIV 623
Query: 251 QLRMQVVELEESRNNLL--------------QENRQLKENVSSLRSQLSSDESKKHATSE 296
QL+ +ELEE L E RQLKE V ++ D ++ +E
Sbjct: 624 QLKAHELELEEENRKLTNNIELIESTNKEEDYELRQLKEQVKTM------DLERRGFVTE 677
Query: 297 QKDFSTQIEAAGALIDKLITENIELVEKVNDLS 329
+ D+ +IE +++K I + + + K+ DLS
Sbjct: 678 RLDYQDRIE----MLEKQIMQLTKQLTKITDLS 706
>gi|322693568|gb|EFY85424.1| filament-forming protein [Metarhizium acridum CQMa 102]
Length = 2026
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 68/142 (47%), Gaps = 6/142 (4%)
Query: 196 LEQKISQLRDESAALNMKRASLEERLKLLEADKDSW-TQMESVSKETIAGLSVDITQLRM 254
+E K+ L E L M+++ LEE +K ++ D+D W + E + + ++ QL+
Sbjct: 1331 MEGKVQPLEVEIDNLKMQKSYLEEEIKQIQEDRDRWQKRTEGILTKYGRVDPAEMEQLKK 1390
Query: 255 QVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKL 314
+ +LE R+ L Q + L+E ++ L S L S+ T + + Q + KL
Sbjct: 1391 TISDLEAERDCLKQGEQPLREKITELESTLESERQAWQTT--RAKLTEQFKERSR---KL 1445
Query: 315 ITENIELVEKVNDLSVKLDRQS 336
E E ++ N L +LD+ S
Sbjct: 1446 TGEKNEAAQRANHLQEQLDKAS 1467
>gi|348680404|gb|EGZ20220.1| hypothetical protein PHYSODRAFT_489768 [Phytophthora sojae]
Length = 615
Score = 38.5 bits (88), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 43/77 (55%)
Query: 197 EQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQV 256
E+K+S L DE AL++K+A LE+RL+ L +KD + + + ++ L ++
Sbjct: 295 EEKLSALLDEGQALSVKQAQLEQRLRALRKEKDELEERALKLQSQFEASAEEVKDLTTKL 354
Query: 257 VELEESRNNLLQENRQL 273
EE + L QENRQL
Sbjct: 355 KASEEEKTRLAQENRQL 371
>gi|342319886|gb|EGU11831.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 1301
Score = 38.5 bits (88), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 129/275 (46%), Gaps = 47/275 (17%)
Query: 52 DIQNVQVDADRHQSNGAESANLA------EAERQHLLQREAMAI-LEETVKHLRNERESH 104
D++ ++D D S A A LA E +Q + Q E ++ L T K L +ES
Sbjct: 760 DVEIAELDRDLQASESAYEALLASDREGQEQRQQAVRQVEKLSGELAATNKELLRAQESL 819
Query: 105 IQKEAT----LEGTVQQLQNE-----CDLYKEKVQAT-LEETIQQLQRQNDLRMQKEATL 154
++ T LE ++QL+ E DL + ++T LE ++ LQ+Q +L Q++A
Sbjct: 820 AAEQRTEKGKLEKEIEQLKAELDKALLDLSTVRTRSTELEADLEGLQKQAELVGQEKAVS 879
Query: 155 EETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAAL----EQKISQLRDESAAL 210
E +++ ++N G L + LQ+ + +E AL +Q I+ + + AAL
Sbjct: 880 EAALREASSRNQQLANENGKLREEVVALQARLQATEAEEKALTQADQQAITAMNGDLAAL 939
Query: 211 NMKRASLEERLKLL---------------EADKD--SW-TQMESVSKETIAGLSVDI--- 249
+ L++ L+ L EA K+ +W Q+ES + IAGL D+
Sbjct: 940 AGENEDLQDVLEELRPLAEKYVDHEDLECEAVKEMQTWRAQVESRDAK-IAGLEGDLAAQ 998
Query: 250 ----TQLRMQVVELEESRNNLLQENRQLKENVSSL 280
T+L+ + E E RN LLQEN L+++ +L
Sbjct: 999 LCVDTELQTRAEEAESERNELLQENAALEQDKVAL 1033
>gi|111185907|ref|NP_079552.3| envoplakin [Mus musculus]
gi|341941103|sp|Q9D952.3|EVPL_MOUSE RecName: Full=Envoplakin; AltName: Full=210 kDa cornified envelope
precursor protein; AltName: Full=p210
gi|162318566|gb|AAI56423.1| Envoplakin [synthetic construct]
gi|225000372|gb|AAI72666.1| Envoplakin [synthetic construct]
Length = 2035
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 74/254 (29%), Positives = 120/254 (47%), Gaps = 48/254 (18%)
Query: 89 ILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRM 148
I+EE VK ++K+ LE + + L+ E D K +V L Q LQ Q DL +
Sbjct: 1462 IMEEVVK---------LEKDPDLERSTEALRRELDQEKNRV-TELHRECQGLQVQVDL-L 1510
Query: 149 QKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFW--LQKEAALEQKISQLRDE 206
QK + E+TI + + ++++ LE A + W L +E A + R+E
Sbjct: 1511 QKTKSQEKTI----YKEVIRVEKDPVLEGERARV------WEILNRERAARKG----REE 1556
Query: 207 SAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNL 266
SL+ER+ EA + SW++ E+ + S D +L+ Q+ ELE+ +
Sbjct: 1557 DVR------SLQERIDRAEALRRSWSREEAELQRARDQASQDCGRLQRQLRELEQQKQ-- 1608
Query: 267 LQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEK-- 324
Q+ RQL+E L SQ + E +K A Q TQ+EAA ++ E ++ EK
Sbjct: 1609 -QKARQLQEE-GRLLSQKTESERQKAAQRSQA--VTQLEAA------ILQEKDKIYEKER 1658
Query: 325 -VNDLSVKLDRQSV 337
+ DL K+ R+ +
Sbjct: 1659 TLRDLHTKVSREEL 1672
>gi|213410371|ref|XP_002175955.1| spindle assembly checkpoint component mad1 [Schizosaccharomyces
japonicus yFS275]
gi|212004002|gb|EEB09662.1| spindle assembly checkpoint component mad1 [Schizosaccharomyces
japonicus yFS275]
Length = 699
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 146/330 (44%), Gaps = 68/330 (20%)
Query: 72 NLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEA--------TLEGTVQQLQNECD 123
N+A E++ + E +LE+ +K L NER+ +Q + L TV L+ C+
Sbjct: 161 NVAAMEKKVISSEEQYTLLEKQLK-LTNERKEELQTKYQVVVEECDKLRDTVTSLEEACN 219
Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
L K Q T E+I+ LQ QN+ K +LE+ + ++ + + N L+
Sbjct: 220 LQSVKAQDT--ESIKALQIQNEQLQTKLNSLEKLV----DRQSATLSSNALEKHNFKLLE 273
Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLK-----LLEADKD--------S 230
EK+ L K + L+ RD+ A L +K LE +L+ L E ++ S
Sbjct: 274 EEKKSLLTKLSVLD----GFRDKVATLELKNNELEGKLRPYLELLGETKREPHDILHELS 329
Query: 231 WTQMESVS--------KETIAGLSVDIT--------------------QLRMQVVELEES 262
+ME+ S ET+A L ++ +L MQ+ L
Sbjct: 330 ALEMENKSLREESNRLTETVAKLKTELAGANSVPELEEEITSLNETQRELAMQLRRLTLQ 389
Query: 263 RNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALID--KLITENIE 320
++ L+E L+EN+ S S +ES + K + +I++ LID K EN+
Sbjct: 390 KDLALREVHLLRENLKS----YSEEESVLSPETYDKKKTERIDSLTKLIDDYKSTLENVS 445
Query: 321 LVEKVNDLSVKLDRQSVAAGLSSAIGSDAV 350
+ +V D+ VK R+S+ GLS + SD++
Sbjct: 446 IKPEVMDVPVKRKRESL--GLSRSNFSDSL 473
>gi|123134363|ref|XP_001277101.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121820009|gb|EAX64171.1| hypothetical protein TVAG_574510 [Trichomonas vaginalis G3]
Length = 247
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 67/116 (57%), Gaps = 1/116 (0%)
Query: 90 LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
L + VK L+++ + +K+ LEG +Q+L+++ D +++ LE IQ+L+ ++ + +
Sbjct: 3 LADKVKGLQDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEGAIQELKDKDAAQDE 61
Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRD 205
K+ LE I++L++++ +++ LE I L+ + +K+ A E+ I L D
Sbjct: 62 KDKNLEGAIQELKDKDAAQDKKDKNLEGAIQELKDKDAAQDEKDKANEEAIKSLAD 117
>gi|14270509|emb|CAC38864.2| envoplakin [Mus musculus]
Length = 2035
Score = 38.1 bits (87), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 73/254 (28%), Positives = 119/254 (46%), Gaps = 48/254 (18%)
Query: 89 ILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRM 148
I+EE VK ++K+ LE + + L+ E D K +V L Q LQ Q DL +
Sbjct: 1462 IMEEVVK---------LEKDPDLERSTEALRRELDQEKNRV-TELHRECQGLQVQVDL-L 1510
Query: 149 QKEATLEETIKQ--LRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDE 206
QK + E+TI + +R + D ++ E I N ++ AA + + +R
Sbjct: 1511 QKTKSQEKTIYKEVIRVEKDPVLEGERARVWEILN---------RERAARKGREEDVR-- 1559
Query: 207 SAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNL 266
SL+ER+ EA + SW++ E+ + S D +L+ Q+ ELE+ +
Sbjct: 1560 ---------SLQERIDRAEALRRSWSREEAELQRARDQASQDCGRLQRQLRELEQQKQ-- 1608
Query: 267 LQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEK-- 324
Q+ RQL+E L SQ + E +K A Q TQ+EAA ++ E ++ EK
Sbjct: 1609 -QKARQLQEE-GRLLSQKTESERQKAAQWSQA--VTQLEAA------ILQEKDKIYEKER 1658
Query: 325 -VNDLSVKLDRQSV 337
+ DL K+ R+ +
Sbjct: 1659 TLRDLHTKVSREEL 1672
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.306 0.122 0.314
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,366,948,440
Number of Sequences: 23463169
Number of extensions: 249752456
Number of successful extensions: 1399276
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1799
Number of HSP's successfully gapped in prelim test: 57384
Number of HSP's that attempted gapping in prelim test: 1182718
Number of HSP's gapped (non-prelim): 174388
length of query: 474
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 328
effective length of database: 8,933,572,693
effective search space: 2930211843304
effective search space used: 2930211843304
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 79 (35.0 bits)