BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011943
         (474 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224087896|ref|XP_002308260.1| predicted protein [Populus trichocarpa]
 gi|222854236|gb|EEE91783.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 238/477 (49%), Positives = 323/477 (67%), Gaps = 50/477 (10%)

Query: 20  KTTTDDVAVGADPNHLINGQK-----DDVRSQASEPADIQNVQVDADRH--QSNGAESAN 72
           K + DDV+V A  N+  NGQ      D+   + S   D+ + + D + H  + NGA   +
Sbjct: 25  KASEDDVSVAAATNNNSNGQNHGNVNDNQVIEVSSNGDVVDAE-DFNGHYDKPNGAAPHS 83

Query: 73  LAEAERQHLLQREAMAI------LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYK 126
            A  E    LQ E  ++       EET+K LR+E +SHIQKEATLE TV+QLQNE   + 
Sbjct: 84  AALQETIKHLQNETDSLTRTKDTFEETIKRLRDENDSHIQKEATLEETVKQLQNESASHT 143

Query: 127 EKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEK 186
           +K +A+LE+TI QL+  N+L +QKEAT E+TIKQL+ +ND H+Q+E  LE  I  LQSEK
Sbjct: 144 QK-EASLEDTINQLRSVNNLCIQKEATFEDTIKQLKTENDSHLQKEADLEKRIVQLQSEK 202

Query: 187 EFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLS 246
           +FWLQKEA   +K++ L+DE AAL    AS+ E+++LLE+DKDSWT  E+ +KETIA ++
Sbjct: 203 DFWLQKEAGFGEKLNHLQDEKAAL----ASIGEKIRLLESDKDSWTISENTTKETIARMN 258

Query: 247 VDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSS-DESKKHATS--------EQ 297
           +D+T+LRMQVVELE+SRN+L++EN+QLKE++S+L+ QL + D S   A +        E+
Sbjct: 259 IDVTRLRMQVVELEDSRNSLVKENQQLKESISNLKLQLQNIDTSVSFANTSELGKLGAEK 318

Query: 298 KDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSAIGSDAV---TALT 354
           ++ ++QIEAA AL+DKLITEN +LVEKVN+L +KLD Q  AA  SSA G   +   + L 
Sbjct: 319 EELNSQIEAACALVDKLITENADLVEKVNELYIKLDHQGTAASFSSATGRGVIVRNSELA 378

Query: 355 SDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQ 414
           + T PM++S+ N+++L ++LE+               +  +P  +V  SSEA  SGEIVQ
Sbjct: 379 NGTHPMADSNANLTALGHKLES---------------LEVEPAVVVQYSSEA-GSGEIVQ 422

Query: 415 IPLDDKEVQDLELQVVESYTD-KVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
           IPLDD EV DLE+Q  E  TD K  AVPLTDAPLIGAPFRL+SFVAKYVSG DLVN+
Sbjct: 423 IPLDDNEVPDLEMQAAE--TDYKSGAVPLTDAPLIGAPFRLISFVAKYVSGGDLVNR 477


>gi|255575616|ref|XP_002528708.1| conserved hypothetical protein [Ricinus communis]
 gi|223531880|gb|EEF33697.1| conserved hypothetical protein [Ricinus communis]
          Length = 492

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 243/493 (49%), Positives = 321/493 (65%), Gaps = 67/493 (13%)

Query: 18  QTKTTTDDVAVGADPNHLINGQKDDVRSQASEPADIQN------VQVDA-DRHQSNGAES 70
           Q+KTT DD       +H  NG   +  +Q ++ A+IQN        VD  DRHQ+ G E 
Sbjct: 22  QSKTTEDDAVFVGGVDHQ-NGASSNGNNQVTDLAEIQNGDVQNAANVDGFDRHQNTGTEI 80

Query: 71  ANLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQ 130
           +   EAE+Q   QREA  I EET+K L+NE + HIQKEA L  ++++LQ E D+ + K Q
Sbjct: 81  SISVEAEKQQWFQREA--IFEETIKRLQNENDLHIQKEAILRKSIKELQEENDIQRRK-Q 137

Query: 131 ATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQRE------------------ 172
             LEET+++LQ  ND  +QKEATLEETIK+LRN ND  +Q+E                  
Sbjct: 138 VMLEETVKELQNANDSALQKEATLEETIKELRNTNDSALQKEVKLEKTIKELRSAKDSAL 197

Query: 173 ---GGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKD 229
               GL+  IA LQSE E +LQ +A LE+K+    +E+A+L +K ASLEE++K LE++KD
Sbjct: 198 QKEAGLKKKIAQLQSENELFLQTKAGLEEKLQLSLNENASLGLKEASLEEKIKQLESEKD 257

Query: 230 SWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQL----- 284
           SWT  E+ +KETIA ++VD+T+LRMQVVELEESRN+LL+EN+QL EN+S L+ Q+     
Sbjct: 258 SWTLAENTTKETIARMNVDVTRLRMQVVELEESRNSLLKENQQLMENISDLQLQVQNLET 317

Query: 285 ------SSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVA 338
                 +S E KKHA  E +D  +QIEAA AL+DKLITEN ELVEKVN+L +KLD+QS  
Sbjct: 318 NISSASTSCEQKKHAL-EGEDLESQIEAACALVDKLITENAELVEKVNELYIKLDQQSTT 376

Query: 339 AGLSSAIGSDAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPP 398
           A  SS  G+  +T + +D+ P  +S +NMS + ++LE+   + V+          AD   
Sbjct: 377 ATASS--GNAEITYI-ADSIP--QSDENMSVVVHKLES---LEVEHAAALPYAAEAD--- 425

Query: 399 LVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFV 458
                     SGEIVQIPLDD E++DLELQ VES  DK  AVP++DAPLIGAPFRL+SFV
Sbjct: 426 ----------SGEIVQIPLDDNELRDLELQAVES--DKTEAVPISDAPLIGAPFRLISFV 473

Query: 459 AKYVSGADLVNKN 471
           AKYVSGADLV+K+
Sbjct: 474 AKYVSGADLVDKD 486


>gi|224139390|ref|XP_002323089.1| predicted protein [Populus trichocarpa]
 gi|222867719|gb|EEF04850.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 218/437 (49%), Positives = 286/437 (65%), Gaps = 60/437 (13%)

Query: 52  DIQNVQVDADRHQSNGAESANLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATL 111
           D+++     D+      +S+ LAEAE+Q  LQREA    EET+K L +E +SHIQKE T 
Sbjct: 66  DVEDFNTHNDKPNGVAPQSSILAEAEKQWWLQREAT--FEETIKQLEDENDSHIQKEVTF 123

Query: 112 EGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQR 171
           E                      ETI+QLQ +ND  +QKEAT E+TIKQL+ +NDLH+Q+
Sbjct: 124 E----------------------ETIKQLQDENDSHIQKEATFEDTIKQLKTENDLHMQK 161

Query: 172 EGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSW 231
           E GLEM I  LQ EK+FW  KEA LE+K++ L DE AAL +K ASLEE++  L+++KDSW
Sbjct: 162 EAGLEMKIMQLQREKDFWFLKEAGLEEKLNLLLDEKAALGLKLASLEEKIGQLDSEKDSW 221

Query: 232 TQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKK 291
              E+ +KE +  +++DIT LRMQVVELE SRN+L++EN+QLKE+VS L+ QL + E+++
Sbjct: 222 AVSENTTKEIVGRMNIDITSLRMQVVELEYSRNSLVKENQQLKESVSDLKLQLQNVETQQ 281

Query: 292 HATS-----------EQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAG 340
             +S           E++D ++QIEAA AL+DKLITEN ELVEKVN+L +KLD+Q  AA 
Sbjct: 282 SISSANTSELGKNDAEKEDLNSQIEAACALVDKLITENAELVEKVNELYIKLDQQKAAAS 341

Query: 341 LSSAIGSDAV---TALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPP 397
           LSS+ GSD +   + L + T PMSESS++   L +                   + ADPP
Sbjct: 342 LSSSAGSDVILRNSELANGTPPMSESSESALGLKSE-----------------SLDADPP 384

Query: 398 PLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSF 457
             VL  S    + EIVQIPLDD EV D+E+Q      DK + VPLTDAPLIGAPFRL+SF
Sbjct: 385 AAVLPQSSEPDAEEIVQIPLDDNEVPDVEMQA----EDK-SGVPLTDAPLIGAPFRLISF 439

Query: 458 VAKYVSGADLVNKNASN 474
           VAKYVSGADLV+K+ SN
Sbjct: 440 VAKYVSGADLVSKDDSN 456


>gi|297741755|emb|CBI32887.3| unnamed protein product [Vitis vinifera]
          Length = 466

 Score =  321 bits (823), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 228/501 (45%), Positives = 317/501 (63%), Gaps = 67/501 (13%)

Query: 1   MEDEKKKKRNKKKKNNKQTKTTTDDVAVGA-----DPNHL---INGQKDDVRSQASEPAD 52
           MEDEKKKK+NKKKKN +   +    V VG      D NHL    +G ++ +       AD
Sbjct: 1   MEDEKKKKKNKKKKNKQHKASEGPAVGVGEEIASLDENHLSTQTHGNQESI-------AD 53

Query: 53  IQN--VQ---VDADRHQSNGAESANLAEAERQHLLQREAMAILEETVKHLRNERESHIQK 107
           +QN  VQ   VD + H +NG +S+ L EAE+Q  LQREA   L+E +K L N++E H QK
Sbjct: 54  VQNRDVQKENVDLNGHCANGTDSSILVEAEKQKWLQREAS--LQERIKQLENDKELHTQK 111

Query: 108 EATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDL 167
           E+ L         E D                 ++Q    +++EATLEE IKQL+++NDL
Sbjct: 112 ESILA--------EAD-----------------KKQ---WLKREATLEERIKQLQDENDL 143

Query: 168 HIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEAD 227
           H+Q+E GLEM I  LQ+EK  W+QKEA LE KI QL DE++ L++K ASLEE+++  E +
Sbjct: 144 HMQKEAGLEMRIVQLQTEKHSWIQKEATLEGKIQQLLDENSTLSVKWASLEEKVEHQERE 203

Query: 228 KDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQL--- 284
           ++SW   E+ S+E I+ L+ +  +L+ QV+ELEE R N+LQEN+ LKE VSSL+ Q+   
Sbjct: 204 RNSWVLKENSSREIISSLNDENRKLQGQVMELEEFRINILQENQLLKEKVSSLQLQIKEL 263

Query: 285 --------SSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQS 336
                   +S E  KHA S+  D + Q EAA  LI+KL +EN +LVEKVN+L ++LD+  
Sbjct: 264 EESVSSAHASTEITKHA-SKHVDLNPQTEAATVLIEKLTSENADLVEKVNELYIELDQLR 322

Query: 337 VAAGLSSAIGSD-AVTALTSD--TEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVH 393
           VAAGLSSAIG D  + AL +    + MSE +DN S+ + R++    V + ++R   +   
Sbjct: 323 VAAGLSSAIGLDKKIGALQNSNVADHMSEPTDNSSASSERMDPLEAVPMNDERISADIED 382

Query: 394 ADPPPLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFR 453
            +   ++ +SSE + SGEIVQIPLD+ E ++LELQ  E+  D+   VP+TDAPLIGAPFR
Sbjct: 383 VEQTAVIPNSSETDDSGEIVQIPLDENEARELELQAAEN--DRNTTVPITDAPLIGAPFR 440

Query: 454 LVSFVAKYVSGADLVNKNASN 474
            +SFVAKYVSGADLV K+++N
Sbjct: 441 FISFVAKYVSGADLVGKSSTN 461


>gi|356566804|ref|XP_003551617.1| PREDICTED: uncharacterized protein LOC100796148 [Glycine max]
          Length = 651

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 185/417 (44%), Positives = 261/417 (62%), Gaps = 18/417 (4%)

Query: 72  NLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQA 131
           NL E    H+ Q+E ++  +ET+K L+ E E HIQ+EA  + T++ L+ E D   +KV  
Sbjct: 236 NLKEKNDVHV-QKETLS--QETIKKLKEENEVHIQEEAISKETIKNLKEENDKLLQKV-V 291

Query: 132 TLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQ 191
           +LEE I  LQ  N+L+ QK  +LE  I  L N+     ++E GLE  IA LQSE    LQ
Sbjct: 292 SLEEVINNLQTDNELQTQKHESLERKINLLENELSSFSEKEVGLETRIAQLQSENNSLLQ 351

Query: 192 KEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQ 251
           KEA L ++ +QL +E A L++K  SLE+++ LLE+D +S  + E+ +KETI+ L+ +I  
Sbjct: 352 KEATLVERTNQLLNEKAVLSLKGESLEQKIYLLESDLNSLVKKENSTKETISNLNGNIAV 411

Query: 252 LRMQVVELEESRNNLLQENRQLKENVSSLRSQL-----------SSDESKKHATSEQKDF 300
           L+ QV ELEESRNNL  EN+QL+E VSSL+S +           S D S K   SE +D 
Sbjct: 412 LQAQVEELEESRNNLFLENQQLREKVSSLQSTVQNHENSNTSSCSWDASVKDLASENEDL 471

Query: 301 STQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSAIGSDAVTALTSDTE-- 358
            ++IEAA  L++KL+ EN ELVEKV +L V+LD +S   G S    S+ +T     T   
Sbjct: 472 KSEIEAAFTLVEKLMAENAELVEKVTELCVELDHRSAEVGHSGVTESNGMTEFVKPTGVA 531

Query: 359 -PMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIPL 417
            P+ ES++  S  + +L +    +VK++ N  +  H        S   A+ +GEIVQIPL
Sbjct: 532 IPLPESAEYASVSSPKLNSLEETSVKDNGNSNDAKHVVGVTSNSSLVSADDAGEIVQIPL 591

Query: 418 DDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNKNASN 474
           DD E+QD+ELQ  ++  +   AVP+TDAPLIGAPFRL+SFVA +VSGADLVN+++SN
Sbjct: 592 DDNEIQDIELQDAKNVENDADAVPITDAPLIGAPFRLISFVANFVSGADLVNQSSSN 648



 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 154/300 (51%), Gaps = 42/300 (14%)

Query: 30  ADPNHLINGQKDDVRSQASEPADIQNVQVDADRHQSNGAESANLAEAERQHLLQREAMAI 89
           +D N + +G+ +++ S   E AD Q+  +D++ H  NG E                  AI
Sbjct: 105 SDQNLVKSGKDENIPSL--EFADGQSTNMDSNGHPPNGKE-----------------CAI 145

Query: 90  LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
            EET++ L+ E + HIQKE               L KE        TI++L+ +ND+ +Q
Sbjct: 146 SEETIRKLKEENDIHIQKET--------------LSKE--------TIRKLKAENDMHIQ 183

Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
           KEA +EETI++L  QND+H+Q+E   E  I  L+ + +  +QKEA  E  I  L++++  
Sbjct: 184 KEAIMEETIRKLTEQNDMHMQKEIASEETIRKLKEKHDMDVQKEAISEDTIRNLKEKNDV 243

Query: 210 LNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQE 269
              K    +E +K L+ + +   Q E++SKETI  L  +  +L  +VV LEE  NNL  +
Sbjct: 244 HVQKETLSQETIKKLKEENEVHIQEEAISKETIKNLKEENDKLLQKVVSLEEVINNLQTD 303

Query: 270 NRQLKENVSSLRSQLSSDESKKHATSEQK-DFSTQIEAAGALIDKLITENIELVEKVNDL 328
           N    +   SL  +++  E++  + SE++    T+I    +  + L+ +   LVE+ N L
Sbjct: 304 NELQTQKHESLERKINLLENELSSFSEKEVGLETRIAQLQSENNSLLQKEATLVERTNQL 363


>gi|359481328|ref|XP_002279159.2| PREDICTED: uncharacterized protein LOC100249907 [Vitis vinifera]
          Length = 511

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 221/565 (39%), Positives = 300/565 (53%), Gaps = 150/565 (26%)

Query: 1   MEDEKKKKRNKKKKNNKQTKTTTDDVAVG-----ADPNHL---INGQKDDVRSQASEPAD 52
           MEDEKKKK+NKKKKN +   +    V VG      D NHL    +G ++ +       AD
Sbjct: 1   MEDEKKKKKNKKKKNKQHKASEGPAVGVGEEIASLDENHLSTQTHGNQESI-------AD 53

Query: 53  IQN--VQ---VDADRHQSNGAESANLAEAERQHLLQREAMAILEETVKHLRNERESHIQK 107
           +QN  VQ   VD + H +NG +S+ L EAE+Q  LQREA   L+E +K L N++E H QK
Sbjct: 54  VQNRDVQKENVDLNGHCANGTDSSILVEAEKQKWLQREAS--LQERIKQLENDKELHTQK 111

Query: 108 EATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDL 167
           E+ L         E D                 ++Q    +++EATLEE IKQL+++NDL
Sbjct: 112 ESILA--------EAD-----------------KKQ---WLKREATLEERIKQLQDENDL 143

Query: 168 HIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEAD 227
           H+Q+E GLEM I  LQ+EK  W+QKEA LE KI QL DE++ L++K              
Sbjct: 144 HMQKEAGLEMRIVQLQTEKHSWIQKEATLEGKIQQLLDENSTLSVK-------------- 189

Query: 228 KDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQL--- 284
              W      S+E I+ L+ +  +L+ QV+ELEE R N+LQEN+ LKE VSSL+ Q+   
Sbjct: 190 ---WNS----SREIISSLNDENRKLQGQVMELEEFRINILQENQLLKEKVSSLQLQIKEL 242

Query: 285 --------SSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQS 336
                   +S E  KHA S+  D + Q EAA  LI+KL +EN +LVEKVN+L ++LD+  
Sbjct: 243 EESVSSAHASTEITKHA-SKHVDLNPQTEAATVLIEKLTSENADLVEKVNELYIELDQLR 301

Query: 337 VAAGLSSAIGSD-AVTALTSD--TEPMSESSDNMSSLNNRLETQGVVAVK---------- 383
           VAAGLSSAIG D  + AL +    + MSE +DN S+ + R+E+   V +           
Sbjct: 302 VAAGLSSAIGLDKKIGALQNSNVADHMSEPTDNSSASSERMESVEAVPIHGQSSQEAVPM 361

Query: 384 EDRNGINGVHADP-------PPLVLSSSEA-----EYSGEIVQI---------PLDDK-- 420
            D N    V           P L  SS EA     + S E V I         P++D+  
Sbjct: 362 HDENSQEAVPIHDQRSLEAVPMLDESSQEAVPIHDQRSIEAVPINDKSPLEAVPMNDERI 421

Query: 421 --EVQDLELQVV---ESYTD---KVAAVPL-----------------------TDAPLIG 449
             +++D+E   V    S TD   ++  +PL                       TDAPLIG
Sbjct: 422 SADIEDVEQTAVIPNSSETDDSGEIVQIPLDENEARELELQAAENDRNTTVPITDAPLIG 481

Query: 450 APFRLVSFVAKYVSGADLVNKNASN 474
           APFR +SFVAKYVSGADLV K+++N
Sbjct: 482 APFRFISFVAKYVSGADLVGKSSTN 506


>gi|79570413|ref|NP_181392.2| ATP synthase D chain-related protein [Arabidopsis thaliana]
 gi|51968994|dbj|BAD43189.1| hypothetical protein [Arabidopsis thaliana]
 gi|330254458|gb|AEC09552.1| ATP synthase D chain-related protein [Arabidopsis thaliana]
          Length = 482

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 151/397 (38%), Positives = 240/397 (60%), Gaps = 29/397 (7%)

Query: 75  EAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLE 134
           E E +  +Q+EA+  LEE + HL+ E E+H+  EA LEG +  L+ E + + +  +A LE
Sbjct: 113 ETENEAHIQKEAL--LEERLVHLKTENEAHVHNEALLEGKLLHLRTENEDHIQN-EALLE 169

Query: 135 ETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEA 194
           E +  L+ +N+  +Q EA LEE +   R +N+ H Q E  LE  +   +++ +  L++ +
Sbjct: 170 EKLLHLRTENEAHIQNEALLEERLLHFRTENEAHKQNEEKLEERLVQYKNKNDMLLREMS 229

Query: 195 ALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRM 254
           + E ++ QL DE +    K ASLE++++ L+ D++S    E  S+E I+ L+ +I +LR 
Sbjct: 230 STEAQMRQLLDERSTFTQKEASLEKKVQQLQHDEESLVAEEKSSREMISSLNNEIARLRA 289

Query: 255 QVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKL 314
           QV ELE+S++NLL++N+ LKE +S+L+ Q  + +S     SE+ + ++QIEAA  L++KL
Sbjct: 290 QVTELEKSKSNLLEQNQSLKETISNLQVQHENHDSNAKGASEE-ELNSQIEAACTLVEKL 348

Query: 315 ITENIELVEKVNDLSVKLDRQSVAAGLSSAIGSDAVTALTSDTEPMSESSDNMSSLNNRL 374
           ITEN +LVEKVN+L +KL++   A+  S AI  +   +L  +  P+    D +  ++N  
Sbjct: 349 ITENADLVEKVNELCIKLNQSQHASPESLAIEVEKSESL--EEIPI---HDELIRIDNSR 403

Query: 375 ETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYT 434
           +     ++K  RN   G   +  PL L++     +GE+           D+E QV  +  
Sbjct: 404 DMD-TASIK--RNFSEGEIEETVPLSLNA-----NGEV-----------DVESQVAVAGE 444

Query: 435 DKV-AAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
           D++ A VPL DAPLIGAPFRLVSFVA+YVSGADL  K
Sbjct: 445 DEINAGVPLADAPLIGAPFRLVSFVARYVSGADLAAK 481


>gi|297823743|ref|XP_002879754.1| hypothetical protein ARALYDRAFT_345641 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325593|gb|EFH56013.1| hypothetical protein ARALYDRAFT_345641 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 509

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 160/405 (39%), Positives = 239/405 (59%), Gaps = 44/405 (10%)

Query: 82  LQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQ 141
           LQ+EA   LEE + HL+ E E+HIQ EA LE  +  L+ E + + +  +A LEE +  L+
Sbjct: 132 LQKEAF--LEERLVHLKTENEAHIQNEALLEEKLLHLRTENEAHTQN-EALLEEKLLHLR 188

Query: 142 RQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEM----NIANLQSEKE---------- 187
            +N+  +Q EA LEE +  LR++N+ H Q E  LE+    NI  +Q + E          
Sbjct: 189 TENEDHIQNEARLEERLLHLRSENEAHKQNEDHLEVGGQNNIVFMQEKLEERLVQYKNKN 248

Query: 188 -FWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLS 246
              L++ ++ E ++ QL DE +    K  SLE++++ L+ D++S    E  S+E I+ L+
Sbjct: 249 DMLLREMSSTEAQMRQLLDERSTFTQKEVSLEKKVQQLQHDEESLVAEEKSSREMISSLN 308

Query: 247 VDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEA 306
            +I +LR QV ELEES++NLL++N+ LKE VSSL+ Q  + +S     SE+ + ++QIEA
Sbjct: 309 NEIARLRAQVTELEESKSNLLEQNQSLKETVSSLQVQHENHDSNAKGASEE-ELNSQIEA 367

Query: 307 AGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSAIGSDAVTALTSDTEPMSESSDN 366
           A  L++KLITEN ELVEKVN+L +KL++   A+  S AI  +   +L  +  P+    D 
Sbjct: 368 ACTLVEKLITENAELVEKVNELCIKLNQSQHASPESLAIEVEKSESL--EEIPI---HDE 422

Query: 367 MSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIPLDDKEVQDLE 426
           +  ++N  +    +     RN   G   +  PL L++     +GE+           D+E
Sbjct: 423 LIRIDNSKDMDTALI---KRNLSEGEIEETVPLSLNA-----NGEV-----------DVE 463

Query: 427 LQVVESYTDKVAA-VPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
            QVV +  D+V+A VPL DAPLIGAPFRLVSFVA+YVSGADL  K
Sbjct: 464 SQVVVAGEDEVSAGVPLADAPLIGAPFRLVSFVARYVSGADLAAK 508


>gi|356530165|ref|XP_003533654.1| PREDICTED: uncharacterized protein LOC100811378 [Glycine max]
          Length = 639

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 151/439 (34%), Positives = 228/439 (51%), Gaps = 79/439 (17%)

Query: 72  NLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQA 131
           NL E    H+ Q+E ++  +ET++ L+ E E HIQKEA  + T+++ + E D   +K + 
Sbjct: 241 NLKEKNDMHM-QKETLS--QETIRKLKEENEGHIQKEAISKETIKKFEEENDKLVQK-ET 296

Query: 132 TLEETIQQLQRQNDLRMQKEATLE----------------------------ETIKQLRN 163
           +LE  I QLQ +N+  +QKE  ++                            E+++ L  
Sbjct: 297 SLEMRIAQLQSENNSLLQKEYIIDGILDLGPNPDHWAEQNKDQLLDPSPKEMESLEGLYQ 356

Query: 164 QNDL------HIQR--EGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRA 215
           +  L      +I R    GLE   A L SE    LQKEA L ++ +QL +E   L++K  
Sbjct: 357 KAALWRVMFSYIGRRLHVGLETRFAQLHSENNSLLQKEATLVERTNQLLNEKEVLSLKGE 416

Query: 216 SLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKE 275
           SLE+++ LLE+D  S  + E+ +K+TI+ L+ +I  L++Q +                  
Sbjct: 417 SLEQKIYLLESDLSSLVEKENSTKDTISKLNGNIAVLQVQDL------------------ 458

Query: 276 NVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQ 335
                              SE KD  ++IEAA  L++KL+ EN ELVEKV +L V+LD Q
Sbjct: 459 ------------------ASENKDLKSEIEAAFTLVEKLMAENAELVEKVTELCVELDHQ 500

Query: 336 SVAAGLSSAIGSDAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHAD 395
           S     +        T +     P  ES++  S    +L +    +VK++ N  N     
Sbjct: 501 SAVTEPNEMTEFAKPTGVAI---PPPESAEYASVSAPKLNSLEETSVKDNGNSFNDAKHV 557

Query: 396 PPPLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLV 455
              +  SS  ++ +GEIVQIPLDD E+QD+ELQ  ++  +   AVP+TDAPLIGAPFRL+
Sbjct: 558 VGVMSNSSLLSDDAGEIVQIPLDDNEIQDIELQDAKNVENDADAVPITDAPLIGAPFRLI 617

Query: 456 SFVAKYVSGADLVNKNASN 474
           SFVA +VSGADLV+ ++SN
Sbjct: 618 SFVANFVSGADLVDPSSSN 636



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 73/120 (60%), Gaps = 5/120 (4%)

Query: 131 ATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWL 190
           A  EETI++L+ +ND+ +QKE   +ETI++L+ +ND+HIQ+E  +E  I  L  + +  +
Sbjct: 144 AISEETIRKLKEENDIHIQKETLSKETIRKLKAENDMHIQKEAIMEETIRKLTEQNDLHM 203

Query: 191 QKEA-----ALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGL 245
           QKEA     A E+ I +L+++      K A  E+ ++ L+   D   Q E++S+ETI  L
Sbjct: 204 QKEAIFPQIASEETIRKLKEKHDVHVQKEAISEDTIRNLKEKNDMHMQKETLSQETIRKL 263


>gi|3786014|gb|AAC67360.1| hypothetical protein [Arabidopsis thaliana]
          Length = 377

 Score =  184 bits (467), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 127/321 (39%), Positives = 198/321 (61%), Gaps = 26/321 (8%)

Query: 151 EATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAAL 210
           E  LEETIKQLR +N  ++Q+E  LE  +   +++ +  L++ ++ E ++ QL DE +  
Sbjct: 81  EVLLEETIKQLREENGSYLQKEEKLEERLVQYKNKNDMLLREMSSTEAQMRQLLDERSTF 140

Query: 211 NMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQEN 270
             K ASLE++++ L+ D++S    E  S+E I+ L+ +I +LR QV ELE+S++NLL++N
Sbjct: 141 TQKEASLEKKVQQLQHDEESLVAEEKSSREMISSLNNEIARLRAQVTELEKSKSNLLEQN 200

Query: 271 RQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSV 330
           + LKE +S+L+ Q  + +S     SE+ + ++QIEAA  L++KLITEN +LVEKVN+L +
Sbjct: 201 QSLKETISNLQVQHENHDSNAKGASEE-ELNSQIEAACTLVEKLITENADLVEKVNELCI 259

Query: 331 KLDRQSVAAGLSSAIGSDAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGIN 390
           KL++   A+  S AI  +   +L  +  P+    D +  ++N  +     ++K  RN   
Sbjct: 260 KLNQSQHASPESLAIEVEKSESL--EEIPI---HDELIRIDNSRDMD-TASIK--RNFSE 311

Query: 391 GVHADPPPLVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKV-AAVPLTDAPLIG 449
           G   +  PL L++     +GE+           D+E QV  +  D++ A VPL DAPLIG
Sbjct: 312 GEIEETVPLSLNA-----NGEV-----------DVESQVAVAGEDEINAGVPLADAPLIG 355

Query: 450 APFRLVSFVAKYVSGADLVNK 470
           APFRLVSFVA+YVSGADL  K
Sbjct: 356 APFRLVSFVARYVSGADLAAK 376


>gi|449439795|ref|XP_004137671.1| PREDICTED: uncharacterized protein LOC101221440 [Cucumis sativus]
 gi|449497171|ref|XP_004160333.1| PREDICTED: uncharacterized LOC101221440 [Cucumis sativus]
          Length = 392

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 130/333 (39%), Positives = 196/333 (58%), Gaps = 13/333 (3%)

Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
           K+A L++TIK L  +N++HI+R   L++ +   + EK  WLQKE AL  KI  L+++  A
Sbjct: 62  KDAKLDDTIKHLHEENNIHIKRMADLDLKLVECEGEKYSWLQKEEALMDKIRNLQEDKTA 121

Query: 210 LNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQE 269
           L+++ A L   +KLLE DK S    E  S+ETI   + DI++L+ QVVELEE + +LL E
Sbjct: 122 LDLEGARLLNIIKLLERDKASLILDEKSSRETIVDKNKDISRLQAQVVELEEQKRDLLHE 181

Query: 270 NRQLKENVSSLRSQLSSDESKKHATS-------EQKDFSTQIEAAGALIDKLITENIELV 322
           N+QL   V+  +S+L + E K  +T         ++  ++Q++AA  L+DKLITEN EL+
Sbjct: 182 NKQLTGKVADYQSKLLNLERKISSTYIHSSDRVTKEILNSQVDAARILVDKLITENAELI 241

Query: 323 EKVNDLSVKLDRQSVAAGLSSAIGSDAVTALTSDTEPMSESSDN--MSSLNNRLETQGVV 380
            KVN+L V+L R +    L S +  D +    +DT   +ES     ++S+ +      + 
Sbjct: 242 GKVNELFVELQRVT-KTELPSGVVPDQMATEATDTTTFNESEPPVILNSVTSGKSLDALK 300

Query: 381 AVKEDRNGINGVHADPPPLVLSS--SEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVA 438
           +V    + I G   D     ++S  S    +GEI QI L   E Q+   ++  +  D+  
Sbjct: 301 SVSIHSHSIGGDFVDLGSDFMASEASMPMAAGEIEQIQLHQFEDQNGTRELPATEIDE-K 359

Query: 439 AVPLTDAPLIGAPFRLVSFVAKYVSGADLVNKN 471
            V L+DAPLIGAP+RL+SF+AKYVSGADLV K+
Sbjct: 360 DVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 392


>gi|326488811|dbj|BAJ98017.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 452

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 108/360 (30%), Positives = 174/360 (48%), Gaps = 53/360 (14%)

Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
           LY+ K+   L +TI+QL+ +  L +QK   +E  I++L N+   H Q E  LE  + NLQ
Sbjct: 132 LYEAKLDK-LHDTIKQLEDEKGLWLQKMNKMESEIEKLHNKVGYHAQNEVILEEKLNNLQ 190

Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIA 243
           +  +  ++KE  L+ K+  + D +  L  +   L+ERL  LE    +             
Sbjct: 191 NGHDMLVKKEEVLDNKVKCVEDVNGVLTHQETFLKERLSGLEETNKA------------- 237

Query: 244 GLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATS-------E 296
                   L+ QV  L+E+  + ++EN+QL  +V  L S+L + E+K   T         
Sbjct: 238 --------LQEQVKVLDEASKSTVEENQQLLVSVDELESRLQTLEAKIALTEASITKEVP 289

Query: 297 QKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSA--IGSDAVTALT 354
           + +   Q + AG+ + K     + ++ K N+L+   DR     GL+S+  + SD + +  
Sbjct: 290 KNEVMNQTDLAGSFLHKQTAGFMTVISKGNELTA--DR-----GLNSSLVVTSDNIYSHV 342

Query: 355 SDTEPMSESSDNMSSLNNRL----ETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSG 410
           S+    + +SD+    +        + G      + N   G             E   SG
Sbjct: 343 SNIPVGAYASDHADETSAHFPEATSSNGAGQSLMNANARQGF-----------DEPRMSG 391

Query: 411 EIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
           EIV +PLDD  + + + Q   S  +    VP TDAP++GAPFRL+SFVA+YVSGADLVN+
Sbjct: 392 EIVPVPLDDILIHEDDPQPAGSDVETAEEVPFTDAPIVGAPFRLISFVARYVSGADLVNQ 451


>gi|148909438|gb|ABR17817.1| unknown [Picea sitchensis]
          Length = 558

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 122/422 (28%), Positives = 199/422 (47%), Gaps = 97/422 (22%)

Query: 130 QATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFW 189
           +A+ EE I++L+ +  L MQKEA++E+ IK+ + +    +Q+EG +E  I  L+ EK  W
Sbjct: 143 EASKEEEIKRLKDKFKLHMQKEASIEDEIKRFKEEKISWLQKEGTMEKEIKRLKDEKSSW 202

Query: 190 LQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDI 249
           LQ+EA  +++I  L DE      K    EE ++ L+A+ +S  + E   ++   GL  + 
Sbjct: 203 LQREAKTQEEIKSLTDEKQLWLQKEVFREEEIERLKAENNSLLKKEVTKEDEAKGLRFEN 262

Query: 250 TQLRMQVVELEESRNNLLQENRQLK-------------ENVSSLRSQLSSD--------- 287
             L+ +VVE+++S  +L  E++QL              E  +++ SQL  D         
Sbjct: 263 FSLQKEVVEMKKSLESLSLEHQQLTAQAVAVQTGIQQLERNATVHSQLEQDIAIATTPEV 322

Query: 288 -------ESKKHATSE----QKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQS 336
                   S K   +E     +D   Q EAA A+++KL+ EN EL+EKVN+LS  +D+  
Sbjct: 323 SKQWLGWGSAKDPVAEAARIAEDSRLQKEAAAAIVEKLVIENTELMEKVNELSTIVDQ-- 380

Query: 337 VAAGLSSAIGSDAVTALTS-DTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHAD 395
           +    SS   S      T+ D   +S  +   +SL+ +L            NG+ GV   
Sbjct: 381 LNKKCSSLAESKVRLQETAQDATALSPLTFKENSLSTKL-----------SNGL-GVTNY 428

Query: 396 PPPLVLSSSEAEYS-GEIVQI-----PLDDKEVQDLELQVVE-----------------S 432
           P   ++++    Y+  EIV+I     P+++     +E ++VE                 S
Sbjct: 429 PSVSIVNNDVTSYTESEIVEIMASQDPVNNDVTSYMESEIVEIMPSQDPVTMEDGENGLS 488

Query: 433 YTDKVA--------------------------AVPLTDAPLIGAPFRLVSFVAKYVSGAD 466
            +D +                            VP +DAP++GAP RLVSF+ KYV+GAD
Sbjct: 489 ASDSIGTVEEAKQITSNSVREPVDMTFQKTSETVPFSDAPVMGAPIRLVSFLTKYVTGAD 548

Query: 467 LV 468
           LV
Sbjct: 549 LV 550


>gi|242035841|ref|XP_002465315.1| hypothetical protein SORBIDRAFT_01g036280 [Sorghum bicolor]
 gi|241919169|gb|EER92313.1| hypothetical protein SORBIDRAFT_01g036280 [Sorghum bicolor]
          Length = 458

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 104/335 (31%), Positives = 170/335 (50%), Gaps = 35/335 (10%)

Query: 154 LEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMK 213
           L ETIKQL N+  L +Q+   +E  +  L SE     Q E  LE K++ L+     L  K
Sbjct: 140 LHETIKQLENEKSLWLQKVSIMEGELEKLHSEVGCHAQNEVLLEGKVNSLQSGYDVLIKK 199

Query: 214 RASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQL 273
              L+++++ ++   D+ T  E++ KE ++GL      L +QV  LE++ NN  +EN+ L
Sbjct: 200 EEVLDDKVRRIDNINDTLTHQEALLKERLSGLEETNKALLVQVKMLEDASNNTSEENQML 259

Query: 274 KENVSSLRSQLSSDESKKHATSEQKDFSTQIEA-AGALIDKLITENIELVEKVNDLSVKL 332
            + +  L S+L + E+K  A SE    ++ IE     ++ +    + +++++V+  +  L
Sbjct: 260 VQKIDELDSRLQALEAKC-APSE----ASMIEKNVIKIVMQFPVPDNKVMDQVDFANSPL 314

Query: 333 DRQSVAA----------GLSSA--IGSDAVTALTSDTEPMSESSDNMSSLNNRLETQGVV 380
            +Q++            GL S+  I  D       D    + +S+     + +L   G  
Sbjct: 315 QQQTIGFSEGNKLIAERGLGSSVQINPDNSYGQIYDIPSNAYASNYPEEASIQLPEIGT- 373

Query: 381 AVKEDRNGINGVHADPPPLVLSSSEAEYSG-----EIVQIPLDDKEVQDLELQVVESYTD 435
                 N I   H D     +S +E  + G     EIV +PLDD E+ + + Q      D
Sbjct: 374 -----SNSIVQGHVD-----VSVNEHRFDGPRPSEEIVPVPLDDIEIHEDDPQ-QPGAAD 422

Query: 436 KVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
           ++  VP +DAP+IGAPFRL+SFVA+YVSGADLVN+
Sbjct: 423 EIDEVPFSDAPIIGAPFRLISFVARYVSGADLVNQ 457


>gi|212721506|ref|NP_001131641.1| uncharacterized protein LOC100192998 [Zea mays]
 gi|194692122|gb|ACF80145.1| unknown [Zea mays]
 gi|414866651|tpg|DAA45208.1| TPA: hypothetical protein ZEAMMB73_881768 [Zea mays]
          Length = 446

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 105/350 (30%), Positives = 171/350 (48%), Gaps = 52/350 (14%)

Query: 140 LQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQK 199
           + R ++ ++ K   L ETIKQL N+  L +Q+   +E  +  L +E     Q E  LE+K
Sbjct: 129 MHRLHEAKLDK---LHETIKQLENEKSLWLQKVITMEGELEKLHNEVGCHAQNEVLLEEK 185

Query: 200 ISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVEL 259
           ++ L+     L  K   ++ +++ ++   D+ T  ES+ KE ++GL      L +QV  L
Sbjct: 186 LNSLQSGYDVLIKKEEVVDNKVRCIDNINDTLTHQESLLKERLSGLEETNKSLLVQVKVL 245

Query: 260 EESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAA--GALIDKLITE 317
           EE+ NN  +EN+ L + +  L S+L + E+K  A SE    +  IE      ++D++   
Sbjct: 246 EEASNNTSEENQMLVKKIYELDSRLQALEAKA-APSE----ALMIEKFPDNKVMDQVHFA 300

Query: 318 NIELVEKVNDLS--------------VKLDRQSVAAGLSSAIGSDAVTALTSDTE---PM 360
           N  L ++  D S              VKL+  +    +     +D  +  T +T    P 
Sbjct: 301 NSPLQQQTIDFSEGNKLFAERGLSSPVKLNPDNSYRQIYDIPSNDYASNYTEETSIQLPE 360

Query: 361 SESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIPLDDK 420
             +S+ +         QG V V E R        D P  +          EIV +PLD+ 
Sbjct: 361 IGTSNTI--------VQGHVDVNEHR-------FDGPRTI---------EEIVPVPLDEI 396

Query: 421 EVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
           ++ + ++       D++  VP +DAP+IGAPFRL+SFVA+YVSGADLVN+
Sbjct: 397 QIHE-DVPGQPGAADEIDEVPFSDAPIIGAPFRLISFVARYVSGADLVNQ 445



 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 69/280 (24%), Positives = 115/280 (41%), Gaps = 59/280 (21%)

Query: 1   MEDEKKKKRNKKKKNNKQTKTTTDDVAVGADPNH-----LINGQK----DDVRSQASE-- 49
           M+ +KK+ + KK    K     T      A  NH      I+  K    DDV S   E  
Sbjct: 1   MDSKKKRNKKKKGNQGKNAADVTSIAGEAASQNHNHELAPIDHHKGSDADDVMSSVGEEL 60

Query: 50  -------PADIQNVQVDAD----------------------------RHQSNGA-----E 69
                  P    N+Q D +                             H+++ A      
Sbjct: 61  PQYKNHQP----NIQADYNGTSAYDTTSSIGECIACYQNNEPMMTQENHKASNAVPADQR 116

Query: 70  SANLAEAERQ---HLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYK 126
           S  L+E+  +   H L    +  L ET+K L NE+   +QK  T+EG +++L NE   + 
Sbjct: 117 SVGLSESSVELDMHRLHEAKLDKLHETIKQLENEKSLWLQKVITMEGELEKLHNEVGCHA 176

Query: 127 EKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEK 186
           +  +  LEE +  LQ   D+ ++KE  ++  ++ + N ND    +E  L+  ++ L+   
Sbjct: 177 QN-EVLLEEKLNSLQSGYDVLIKKEEVVDNKVRCIDNINDTLTHQESLLKERLSGLEETN 235

Query: 187 EFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEA 226
           +  L +   LE+  +   +E+  L  K   L+ RL+ LEA
Sbjct: 236 KSLLVQVKVLEEASNNTSEENQMLVKKIYELDSRLQALEA 275


>gi|414866652|tpg|DAA45209.1| TPA: hypothetical protein ZEAMMB73_881768 [Zea mays]
          Length = 454

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 102/348 (29%), Positives = 174/348 (50%), Gaps = 40/348 (11%)

Query: 140 LQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQK 199
           + R ++ ++ K   L ETIKQL N+  L +Q+   +E  +  L +E     Q E  LE+K
Sbjct: 129 MHRLHEAKLDK---LHETIKQLENEKSLWLQKVITMEGELEKLHNEVGCHAQNEVLLEEK 185

Query: 200 ISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVEL 259
           ++ L+     L  K   ++ +++ ++   D+ T  ES+ KE ++GL      L +QV  L
Sbjct: 186 LNSLQSGYDVLIKKEEVVDNKVRCIDNINDTLTHQESLLKERLSGLEETNKSLLVQVKVL 245

Query: 260 EESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENI 319
           EE+ NN  +EN+ L + +  L S+L + E+K  A SE             +I+K +T+ +
Sbjct: 246 EEASNNTSEENQMLVKKIYELDSRLQALEAKA-APSE-----------ALMIEKSVTKTV 293

Query: 320 ------ELVEKVNDLSVKLDRQSV----AAGLSSAIGSDAVTALTSDTEP--MSESSDNM 367
                 +++++V+  +  L +Q++       L +  G  +   L  D     + +   N 
Sbjct: 294 MQFPDNKVMDQVHFANSPLQQQTIDFSEGNKLFAERGLSSPVKLNPDNSYRQIYDIPSND 353

Query: 368 SSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSG-----EIVQIPLDDKEV 422
            + N   ET   +      N I   H D        +E  + G     EIV +PLD+ ++
Sbjct: 354 YASNYTEETSIQLPEIGTSNTIVQGHVD-------VNEHRFDGPRTIEEIVPVPLDEIQI 406

Query: 423 QDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
            + ++       D++  VP +DAP+IGAPFRL+SFVA+YVSGADLVN+
Sbjct: 407 HE-DVPGQPGAADEIDEVPFSDAPIIGAPFRLISFVARYVSGADLVNQ 453



 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 69/280 (24%), Positives = 115/280 (41%), Gaps = 59/280 (21%)

Query: 1   MEDEKKKKRNKKKKNNKQTKTTTDDVAVGADPNH-----LINGQK----DDVRSQASE-- 49
           M+ +KK+ + KK    K     T      A  NH      I+  K    DDV S   E  
Sbjct: 1   MDSKKKRNKKKKGNQGKNAADVTSIAGEAASQNHNHELAPIDHHKGSDADDVMSSVGEEL 60

Query: 50  -------PADIQNVQVDAD----------------------------RHQSNGA-----E 69
                  P    N+Q D +                             H+++ A      
Sbjct: 61  PQYKNHQP----NIQADYNGTSAYDTTSSIGECIACYQNNEPMMTQENHKASNAVPADQR 116

Query: 70  SANLAEAERQ---HLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYK 126
           S  L+E+  +   H L    +  L ET+K L NE+   +QK  T+EG +++L NE   + 
Sbjct: 117 SVGLSESSVELDMHRLHEAKLDKLHETIKQLENEKSLWLQKVITMEGELEKLHNEVGCHA 176

Query: 127 EKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEK 186
           +  +  LEE +  LQ   D+ ++KE  ++  ++ + N ND    +E  L+  ++ L+   
Sbjct: 177 QN-EVLLEEKLNSLQSGYDVLIKKEEVVDNKVRCIDNINDTLTHQESLLKERLSGLEETN 235

Query: 187 EFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEA 226
           +  L +   LE+  +   +E+  L  K   L+ RL+ LEA
Sbjct: 236 KSLLVQVKVLEEASNNTSEENQMLVKKIYELDSRLQALEA 275


>gi|125543744|gb|EAY89883.1| hypothetical protein OsI_11427 [Oryza sativa Indica Group]
          Length = 424

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 106/354 (29%), Positives = 167/354 (47%), Gaps = 68/354 (19%)

Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
           LY+ K+   L +TI++L+ +  L  QK +++E  +++L N+ D H Q E  LE  + NLQ
Sbjct: 131 LYEAKLDK-LHDTIKKLEDEKSLWHQKMSSMEIEVEKLHNKVDYHAQNEVRLEEKLNNLQ 189

Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIA 243
           +  +  ++KE AL+ K+                     + +E   D+ T  E+  KE ++
Sbjct: 190 NGYDVLIKKEVALDNKV---------------------RSIEVINDALTHQETSLKERLS 228

Query: 244 GLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQ 303
           GL      L +QV  LEE+ NN ++E+++L +    L S+L                   
Sbjct: 229 GLEETNKVLLVQVKVLEEASNNTVEESQRLVKGFDELASRLGV----------------- 271

Query: 304 IEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLS----SAIGSDAVTALTSDT-- 357
            EA  AL +  +T      +K N+L V     S AA  S    S I S    A  S+   
Sbjct: 272 FEAKSALTEASVT------KKGNELIVDRSVNSSAAITSVDNYSPINSSPSNAYVSNHLE 325

Query: 358 -EPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIP 416
             PM      +    N + ++G++    D N       D         E   S EI+ + 
Sbjct: 326 EAPMQLPETTI----NDVSSEGLI----DMNAHQRSKQD-------CDEPRTSEEILPVA 370

Query: 417 LDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
           LDD ++ + + Q   +  D+   VP +DAP++GAPFRL+SFVA+YVSGADLVN+
Sbjct: 371 LDDIQIHEEDPQPPVA-DDEAEEVPFSDAPIVGAPFRLISFVARYVSGADLVNQ 423


>gi|115452839|ref|NP_001050020.1| Os03g0333000 [Oryza sativa Japonica Group]
 gi|108707985|gb|ABF95780.1| expressed protein [Oryza sativa Japonica Group]
 gi|113548491|dbj|BAF11934.1| Os03g0333000 [Oryza sativa Japonica Group]
          Length = 424

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 106/354 (29%), Positives = 167/354 (47%), Gaps = 68/354 (19%)

Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
           LY+ K+   L +TI++L+ +  L  QK +++E  +++L N+ D H Q E  LE  + NLQ
Sbjct: 131 LYEAKLDK-LHDTIKKLEDEKSLWHQKMSSMEIEVEKLHNKVDYHAQNEVRLEEKLNNLQ 189

Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIA 243
           +  +  ++KE AL+ K+                     + +E   D+ T  E+  KE ++
Sbjct: 190 NGYDVLIKKEVALDNKV---------------------RSIEVINDALTHQETSLKERLS 228

Query: 244 GLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQ 303
           GL      L +QV  LEE+ NN ++E+++L +    L S+L                   
Sbjct: 229 GLEETNKVLLVQVKVLEEASNNTVEESQRLVKGFDELASRLGV----------------- 271

Query: 304 IEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLS----SAIGSDAVTALTSDT-- 357
            EA  AL +  +T      +K N+L V     S AA  S    S I S    A  S+   
Sbjct: 272 FEAKSALTEASVT------KKGNELIVDRSVNSSAAITSVDNYSPINSSPSNAYVSNHLE 325

Query: 358 -EPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEIVQIP 416
             PM      +    N + ++G++    D N       D         E   S EI+ + 
Sbjct: 326 EAPMQLPETTI----NDVSSEGLI----DMNAHQRSKQD-------FDEPRTSEEILPVA 370

Query: 417 LDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
           LDD ++ + + Q   +  D+   VP +DAP++GAPFRL+SFVA+YVSGADLVN+
Sbjct: 371 LDDIQIHEEDPQPPVA-DDEAEEVPFSDAPIVGAPFRLISFVARYVSGADLVNQ 423


>gi|125586148|gb|EAZ26812.1| hypothetical protein OsJ_10723 [Oryza sativa Japonica Group]
          Length = 429

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 107/358 (29%), Positives = 168/358 (46%), Gaps = 71/358 (19%)

Query: 124 LYKEKV----QATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNI 179
           LY+ K+    Q  L +TI++L+ +  L  QK +++E  +++L N+ D H Q E  LE  +
Sbjct: 131 LYEAKLILPMQDKLHDTIKKLEDEKSLWHQKMSSMEIEVEKLHNKVDYHAQNEVRLEEKL 190

Query: 180 ANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSK 239
            NLQ+  +  ++KE AL+ K+                     + +E   D+ T  E+  K
Sbjct: 191 NNLQNGYDVLIKKEVALDNKV---------------------RSIEVINDALTHQETSLK 229

Query: 240 ETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKD 299
           E ++GL      L +QV  LEE+ NN ++E+++L +    L S+L               
Sbjct: 230 ERLSGLEETNKVLLVQVKVLEEASNNTVEESQRLVKGFDELASRLGV------------- 276

Query: 300 FSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLS----SAIGSDAVTALTS 355
                EA  AL +  +T      +K N+L V     S AA  S    S I S    A  S
Sbjct: 277 ----FEAKSALTEASVT------KKGNELIVDRSVNSSAAITSVDNYSPINSSPSNAYVS 326

Query: 356 DT---EPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVLSSSEAEYSGEI 412
           +     PM      +    N + ++G++    D N       D         E   S EI
Sbjct: 327 NHLEEAPMQLPETTI----NDVSSEGLI----DMNAHQRSKQD-------FDEPRTSEEI 371

Query: 413 VQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFVAKYVSGADLVNK 470
           + + LDD ++ + + Q   +  D+   VP +DAP++GAPFRL+SFVA+YVSGADLVN+
Sbjct: 372 LPVALDDIQIHEEDPQPPVA-DDEAEEVPFSDAPIVGAPFRLISFVARYVSGADLVNQ 428


>gi|147845679|emb|CAN80589.1| hypothetical protein VITISV_040788 [Vitis vinifera]
          Length = 472

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 50/76 (65%), Positives = 63/76 (82%), Gaps = 2/76 (2%)

Query: 399 LVLSSSEAEYSGEIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLIGAPFRLVSFV 458
           ++ +SSE + SGEIVQIPLD+ E ++LELQ  E+  D+   VPLTDAPLIGAPFRL+SFV
Sbjct: 394 VIPNSSETDDSGEIVQIPLDENEARELELQAAEN--DRNTTVPLTDAPLIGAPFRLISFV 451

Query: 459 AKYVSGADLVNKNASN 474
           AKYVSGADLV K+++N
Sbjct: 452 AKYVSGADLVGKSSTN 467


>gi|357112364|ref|XP_003557979.1| PREDICTED: uncharacterized protein LOC100829228 isoform 1
           [Brachypodium distachyon]
          Length = 448

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 173/382 (45%), Gaps = 98/382 (25%)

Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
           LY+ K+   L  T++QL+ +  L ++K   +E  +++L N+   H Q E  LE  +  LQ
Sbjct: 129 LYEAKLDK-LHGTVKQLEDEKSLWLKKVNMMEIELEKLHNKVGYHAQNEVLLEEKLDILQ 187

Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIA 243
              +  ++KE  L+ K+S + D +  L  +  SL+ERL          +++E  +K    
Sbjct: 188 HGHDMLVKKEEVLDNKVSCIEDANVVLTHEETSLKERL----------SELEETNK---- 233

Query: 244 GLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHAT--------S 295
                   L  QV  L+E+    ++EN+ L  ++  L S+L + E+K   +         
Sbjct: 234 -------ALLEQVKVLDEASKITVEENQSLLTSIYELESRLQAVEAKIFLSEVSISKEVP 286

Query: 296 EQKDFSTQIEAAGALIDKLIT--------ENIELVEKVNDLSVKLDRQSVAAGLSSAIGS 347
           E K    Q +  G+L+    T        E  EL+                 GL+S++  
Sbjct: 287 ENKVMDHQTDLTGSLLHNQTTDFTNLISNEGNELIGN--------------RGLNSSV-- 330

Query: 348 DAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVL---SSS 404
             VT           S +N S +NN   +  +  +          H D   L+    +SS
Sbjct: 331 -TVT-----------SENNHSHINN---SSSIAYISN--------HPDETSLLFPEATSS 367

Query: 405 EAE--------YSG--------EIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLI 448
            A+        + G        EI+ +PLDD ++ + +LQ      D+ A VP TDAP++
Sbjct: 368 SADQGFIHENAHQGFDKPRINEEIMPVPLDDIQIHEDDLQ--PRGLDETAEVPFTDAPIV 425

Query: 449 GAPFRLVSFVAKYVSGADLVNK 470
           GAPFRL+SFVA+YVSGADLV++
Sbjct: 426 GAPFRLISFVARYVSGADLVDQ 447


>gi|357112366|ref|XP_003557980.1| PREDICTED: uncharacterized protein LOC100829228 isoform 2
           [Brachypodium distachyon]
          Length = 416

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 86/322 (26%), Positives = 153/322 (47%), Gaps = 69/322 (21%)

Query: 175 LEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQM 234
           L   +  L+ EK  WL+K   +E ++ +L ++          LEE+L +L+   D   + 
Sbjct: 137 LHGTVKQLEDEKSLWLKKVNMMEIELEKLHNKVGYHAQNEVLLEEKLDILQHGHDMLVKK 196

Query: 235 ESVSKETIAG-------LSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSD 287
           E V    ++        L+ + T L+ ++ ELEE+   LL++ + L E      S+++ +
Sbjct: 197 EEVLDNKVSCIEDANVVLTHEETSLKERLSELEETNKALLEQVKVLDE-----ASKITVE 251

Query: 288 ESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGLSSAIGS 347
           E++   TS   +  ++++A  A   K+    + + ++ N+L                IG+
Sbjct: 252 ENQSLLTS-IYELESRLQAVEA---KIFLSEVSISKEGNEL----------------IGN 291

Query: 348 DAVTALTSDTEPMSESSDNMSSLNNRLETQGVVAVKEDRNGINGVHADPPPLVL---SSS 404
             + +  + T     S +N S +NN   +  +  +          H D   L+    +SS
Sbjct: 292 RGLNSSVTVT-----SENNHSHINN---SSSIAYISN--------HPDETSLLFPEATSS 335

Query: 405 EAE--------YSG--------EIVQIPLDDKEVQDLELQVVESYTDKVAAVPLTDAPLI 448
            A+        + G        EI+ +PLDD ++ + +LQ      D+ A VP TDAP++
Sbjct: 336 SADQGFIHENAHQGFDKPRINEEIMPVPLDDIQIHEDDLQ--PRGLDETAEVPFTDAPIV 393

Query: 449 GAPFRLVSFVAKYVSGADLVNK 470
           GAPFRL+SFVA+YVSGADLV++
Sbjct: 394 GAPFRLISFVARYVSGADLVDQ 415


>gi|357477631|ref|XP_003609101.1| hypothetical protein MTR_4g111980, partial [Medicago truncatula]
 gi|355510156|gb|AES91298.1| hypothetical protein MTR_4g111980, partial [Medicago truncatula]
          Length = 475

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 67/229 (29%), Positives = 108/229 (47%), Gaps = 39/229 (17%)

Query: 45  SQASEPADIQNVQVDADRHQSNGAESANLAEAERQHLLQREAMAILEETVKHLRNERESH 104
           +Q  E AD Q+  +D++ H  NG E                   I +ET+K L+ E    
Sbjct: 281 AQPMESADGQSTNMDSNGHLPNGKECD-----------------ISDETIKKLKEENAML 323

Query: 105 IQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQ 164
           IQ+E   E                      ETI++L    ++ + KE   EETI++L  Q
Sbjct: 324 IQREVMSE----------------------ETIRKLNEDKNMHILKETISEETIRKLEEQ 361

Query: 165 NDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLL 224
            D+H+++E  LE +I  L+ E +  ++KEA LE  I +L  E+     K A+ +E ++ L
Sbjct: 362 KDMHVEKEVALEESIRKLKKEIDMHVKKEAILEDTIRKLNKENDTHMQKEATSKETIRNL 421

Query: 225 EADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQL 273
           E + + + Q E+ ++ETIA L   + +   + V +EE  N L    R L
Sbjct: 422 EEENEVYIQKEAKAEETIAKLKEAVDRHIHKEVTMEEIINKLQTWYRAL 470


>gi|123240166|ref|XP_001287759.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121855734|gb|EAX74829.1| hypothetical protein TVAG_009050 [Trichomonas vaginalis G3]
          Length = 394

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 87/141 (61%), Gaps = 2/141 (1%)

Query: 90  LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
           L + VK L+++  +  +K+  LEG +Q+L+++ D  +++    LE+ IQ+L+ ++  + +
Sbjct: 3   LADKVKGLQDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEDAIQELKDKDAAQDE 61

Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
           K+  LE+ I++L++++    +++  LE  I  L+ +     +K+  LE  I +L+D+ AA
Sbjct: 62  KDKNLEDAIQELKDKDAAQDEKDKNLEDAIQELKDKDAAQDEKDKNLEGAIQELKDKDAA 121

Query: 210 LNMKRASLEERLKLLEADKDS 230
            + K  +LE+ ++ L+ DKD+
Sbjct: 122 QDEKDKNLEDAIQELK-DKDA 141



 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 78/129 (60%), Gaps = 1/129 (0%)

Query: 90  LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
           LE+ ++ L+++  +  +K+  LEG +Q+L+++ D  +++    LE+ IQ+L+ ++  + +
Sbjct: 87  LEDAIQELKDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEDAIQELKDKDAAQDE 145

Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
           K+  LE  I++L++++    +++  LE  I  L+ +     +K+  LE  I +L+D+ AA
Sbjct: 146 KDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAA 205

Query: 210 LNMKRASLE 218
            + K  +LE
Sbjct: 206 QDEKDKNLE 214



 Score = 46.2 bits (108), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 58/248 (23%), Positives = 125/248 (50%), Gaps = 41/248 (16%)

Query: 90  LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
           LE+ ++ L+++  +  +K+  LEG +Q+L+++ D  +++    LE  IQ+L+ ++  + +
Sbjct: 129 LEDAIQELKDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEGAIQELKDKDAAQDE 187

Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAA 209
           K+  LE  I++L++++    +++  LE  I  L+ +     +K+  LE  I +L+D+ AA
Sbjct: 188 KDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAA 247

Query: 210 LNMKRASLEERLKLLEADKDS-WTQMESVSKETIAGLSVDITQLRMQVVELEESR----- 263
            + K  +LE  ++ L+ DKD+   + +  ++E I  L+  +  L+ ++ + EE++     
Sbjct: 248 QDEKDKNLEGAIQELK-DKDAAQDEKDKANEEAIKSLADRLQDLKDKIRDAEEAKATQGA 306

Query: 264 -------------NNLLQE---NRQLK-----------------ENVSSLRSQLSSDESK 290
                        NNL++E   +  LK                 E++ +L  Q+   E +
Sbjct: 307 EDLQGVNDALNALNNLIKEKADDDALKALEDRLNNNDNKDNKQDEDLKALADQIQEMEDR 366

Query: 291 KHATSEQK 298
           K    EQ+
Sbjct: 367 KKEEDEQR 374


>gi|123469108|ref|XP_001317768.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121900510|gb|EAY05545.1| hypothetical protein TVAG_319000 [Trichomonas vaginalis G3]
          Length = 491

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 41/184 (22%), Positives = 92/184 (50%), Gaps = 7/184 (3%)

Query: 132 TLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQ 191
           +LE+++ Q++       QK   LE ++ Q+++      Q+ G LE ++  ++ ++    Q
Sbjct: 282 SLEQSVNQIKDDQTTMNQKLGALETSVNQIKDDQTTMNQKLGALETSVNQIKDDQTTMNQ 341

Query: 192 KEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQ 251
           K  ALE  ++Q++D+   +N K  +LE  +  ++ D+ +  Q     + ++  +  D T 
Sbjct: 342 KLGALETSVNQIKDDQTTMNQKLGALETSVNQIKDDQTTMNQKLGALETSVNQIKDDQTT 401

Query: 252 LRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALI 311
           +  ++  LE S N       Q+K++ +++  +LS  +S + + +      T++ A     
Sbjct: 402 MNQKLGALETSVN-------QIKDDQTTMNQKLSKLDSLEQSVNNITQMLTKLNAQYETN 454

Query: 312 DKLI 315
           DK+I
Sbjct: 455 DKII 458


>gi|357477629|ref|XP_003609100.1| hypothetical protein MTR_4g111960 [Medicago truncatula]
 gi|355510155|gb|AES91297.1| hypothetical protein MTR_4g111960 [Medicago truncatula]
          Length = 643

 Score = 46.6 bits (109), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 74/168 (44%), Gaps = 36/168 (21%)

Query: 31  DPNHLINGQKDDVRSQASEPADIQNVQVDADRHQ---SNGAESANLAEAERQHLLQREAM 87
           D NHL+N  KD       E A+I    V+++R     + G+ESA         L +   +
Sbjct: 90  DHNHLVNDAKD-------EHANISEAVVNSNRGWRKFNQGSESA----IATPLLYKHSPL 138

Query: 88  AILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLR 147
            I +ET+++L+ E +  IQ+E   E T+Q L  E D                        
Sbjct: 139 DISDETIRNLKEENDMLIQRETISEKTIQILTEEIDT----------------------N 176

Query: 148 MQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAA 195
           ++KE    + IK+L  +N +HIQ E  LE NI  L+ E +   + E  
Sbjct: 177 IRKEVLSLKIIKELDEENHMHIQGEVALEENIRKLKKELDMHAKNEVT 224


>gi|346973208|gb|EGY16660.1| hypothetical protein VDAG_07824 [Verticillium dahliae VdLs.17]
          Length = 949

 Score = 46.2 bits (108), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 76/294 (25%), Positives = 134/294 (45%), Gaps = 63/294 (21%)

Query: 77  ERQHLLQREAMAILEETVKHLRN------ERESHIQKEATLEGTVQQLQNECDLYKEKVQ 130
           E Q+ +  +A + L    KH R+      E +   Q +A LE  V +L+ +  LYK+  +
Sbjct: 351 EHQYQMVDDAQSSLTTHSKHSRSPSKGLQEVQGLQQDKADLEAKVSELEKDLTLYKD-FK 409

Query: 131 ATLEETIQQLQ---RQNDLRMQKEA----TLEETIKQLR-NQNDLHIQREGGLEMNIANL 182
             LE    +LQ   +Q + R + ++    +  E ++ LR  Q+  HI R   LE++  +L
Sbjct: 410 VCLEAKCDELQDEIKQQEERFKTDSEKLRSDVEDVEDLRKTQDAAHIARIVSLELSKESL 469

Query: 183 QSEKEFWLQKEAALEQKIS--------------QLRDESAALNMKRASLEERLKLLEADK 228
           Q+       K  ALE +I+              QL  +  A   ++ +LE R+++LE++K
Sbjct: 470 QA-------KAQALEAEIARKDFEYGEFTTERDQLSKDKDAWVEEKKNLESRVEVLESEK 522

Query: 229 DSWTQMESVSKETIAGLSVDITQLRMQVVELEES--------------RNNLLQENRQLK 274
               +     +ETI GL  +   ++  VVELE++              ++ ++ ++ +LK
Sbjct: 523 SKLEEQTKALEETIKGLETERDSIQAHVVELEKTNGELQSKVGGLEDDKSTIVSKSEKLK 582

Query: 275 ENV-------SSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIEL 321
             V        +LR+Q+SS         +Q D ++Q+    A  D L  ENI L
Sbjct: 583 TKVDRLKGENGNLRTQVSS------LLKDQDDHTSQVLTLSAERDGLKEENITL 630


>gi|449688203|ref|XP_002167031.2| PREDICTED: flagellar attachment zone protein 1-like [Hydra
           magnipapillata]
          Length = 751

 Score = 45.8 bits (107), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 148/329 (44%), Gaps = 21/329 (6%)

Query: 69  ESANLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEK 128
           ES N  E E +     E    LEE+V  L +E +   +    LE +V +L+ E   ++E 
Sbjct: 250 ESVNKLEGENKKF--EENNKRLEESVNKLESENKKFEENNKKLEESVHKLEGENKKFEEN 307

Query: 129 VQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEF 188
               LEE++ +L+ +N    +    LEE++ +L  +N    +    L+ ++  L+     
Sbjct: 308 -NKKLEESVHKLEGENKKFEENNKKLEESVHKLEGENKKFEENNNRLQESVNKLE----- 361

Query: 189 WLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVD 248
             +    LE  ++QL + +  L      LE   K  E +     +  +  +E  A L  +
Sbjct: 362 --ENNNRLETSVNQLEENNKRLEDSIDKLEGENKKFEENNKKLEEENNRFQENNAKLESE 419

Query: 249 ITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLS--SDESKKHATSEQK------DF 300
           + +L  ++  LEE  N   + N  LK NVS L  +++  S  +K++  + +K        
Sbjct: 420 VNRLGGEINRLEEQNNKFAENNENLKNNVSKLEGEVTKMSHLNKEYEINNKKLEDNVNKL 479

Query: 301 STQIEAAGALIDKLITENIELVEKVNDLS---VKLDRQSVAAGLSSAIGSDAVTALTSDT 357
             ++   G    K    N +L E+VN L+    K+  ++     ++   S  VT ++ + 
Sbjct: 480 EGELHKMGEENKKFEVNNKKLEEQVNQLTSEVAKMTEENNKFSENNNKLSQEVTKISKEN 539

Query: 358 EPMSESSDNMSSLNNRLETQGVVAVKEDR 386
           E M+E++ N++    RL+ +    V+E+R
Sbjct: 540 EKMAENNKNLADEIARLKAEIDRMVEENR 568



 Score = 41.6 bits (96), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 41/171 (23%), Positives = 84/171 (49%), Gaps = 1/171 (0%)

Query: 111 LEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQ 170
           LE +V +L+ E   ++E     LEE++ +L+ +N    +    LEE++ +L  +N    +
Sbjct: 185 LEASVNKLEGENKKFEEN-NKRLEESVNKLEGENKKFEENNKRLEESVNKLEGENKKFEE 243

Query: 171 REGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDS 230
               LE ++  L+ E + + +    LE+ +++L  E+         LEE +  LE +   
Sbjct: 244 NNKRLEESVNKLEGENKKFEENNKRLEESVNKLESENKKFEENNKKLEESVHKLEGENKK 303

Query: 231 WTQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLR 281
           + +     +E++  L  +  +      +LEES + L  EN++ +EN + L+
Sbjct: 304 FEENNKKLEESVHKLEGENKKFEENNKKLEESVHKLEGENKKFEENNNRLQ 354


>gi|302423820|ref|XP_003009740.1| predicted protein [Verticillium albo-atrum VaMs.102]
 gi|261352886|gb|EEY15314.1| predicted protein [Verticillium albo-atrum VaMs.102]
          Length = 923

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 75/294 (25%), Positives = 132/294 (44%), Gaps = 63/294 (21%)

Query: 77  ERQHLLQREAMAILEETVKHLRN------ERESHIQKEATLEGTVQQLQNECDLYKEKVQ 130
           E Q+ +  +A + L    KH R+      E +   Q +A L+  V +L+ +  LYK+  +
Sbjct: 352 EHQYQMVDDAQSSLTTHSKHSRSASKGLQEVQGLQQDKADLQAKVSELEKDLTLYKD-FK 410

Query: 131 ATLEETIQQLQ---RQNDLRMQKEA----TLEETIKQLR-NQNDLHIQREGGLEMNIANL 182
             LE    +LQ   +Q + R + ++    +  E ++ LR  Q+  HI R   LE++  +L
Sbjct: 411 VCLEAKCDELQDEIKQQEERFKTDSEKFRSDVEDVEDLRKTQDAAHIARIVSLELSKESL 470

Query: 183 QSEKEFWLQKEAALEQKIS--------------QLRDESAALNMKRASLEERLKLLEADK 228
           Q+       K  ALE +I+              QL  +  A   ++  LE R+++LE++K
Sbjct: 471 QA-------KAQALEAEIARKDFEYGEFTTERDQLSKDKDAWVEEKKDLESRVEVLESEK 523

Query: 229 DSWTQMESVSKETIAGLSVDITQLRMQVVELEES--------------RNNLLQENRQLK 274
               +     +ETI GL  +   ++  VVELE +              ++ ++ ++ +LK
Sbjct: 524 SKLEEQTKALEETIKGLETERDSIQAHVVELENTNGELQSKVGGLEDDKSTIVSKSEKLK 583

Query: 275 ENV-------SSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIEL 321
             V        +LR+Q+SS         +Q D ++Q+    A  D L  ENI L
Sbjct: 584 TKVDRLKGENGNLRTQVSS------LLKDQDDHTSQVLTLSAERDGLKEENITL 631


>gi|358395257|gb|EHK44644.1| hypothetical protein TRIATDRAFT_220313 [Trichoderma atroviride IMI
            206040]
          Length = 2049

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 67/127 (52%), Gaps = 5/127 (3%)

Query: 165  NDLHIQREGGLEMNIANLQSEKEFWLQKEAALE---QKISQLRDESAALNMKRASLEERL 221
            N+L+  RE G+ +   N Q +K+  L+K++ +E   Q I  L  E   +  +++ LEE +
Sbjct: 1296 NELNTYRESGMALRTENSQLKKQI-LEKDSKIEEMVQSIQPLEAEIENIKTQKSFLEEEI 1354

Query: 222  KLLEADKDSW-TQMESVSKETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSL 280
            K ++AD+D W  + E +  +       ++ QL+  + +LE  RN L +    L+  ++ L
Sbjct: 1355 KQIQADRDRWQKRTEGILTKYGRVDPAEMEQLKQTITDLETERNALKESAEPLQAKITEL 1414

Query: 281  RSQLSSD 287
             S L ++
Sbjct: 1415 ESTLETE 1421


>gi|322710763|gb|EFZ02337.1| filament-forming protein [Metarhizium anisopliae ARSEF 23]
          Length = 2030

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 69/142 (48%), Gaps = 6/142 (4%)

Query: 196  LEQKISQLRDESAALNMKRASLEERLKLLEADKDSW-TQMESVSKETIAGLSVDITQLRM 254
            +E K+  L  E   LNM+++ LEE +K ++ D+D W  + E +  +       ++ QL+ 
Sbjct: 1331 MEGKVQPLEAEIDNLNMQKSYLEEEIKQIQEDRDRWQKRTEGILTKYGRVDPAEMEQLKK 1390

Query: 255  QVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKL 314
             + +LE  R+ L Q  + L+E ++ L S L S+      T  +   + Q +       KL
Sbjct: 1391 TISDLEAERDCLKQGEQPLREKITELDSTLESERQAWQTT--RAKLTEQFKERSR---KL 1445

Query: 315  ITENIELVEKVNDLSVKLDRQS 336
              E  E  ++ N L  +LD+ S
Sbjct: 1446 TGEKNEAAQRANHLQEQLDKAS 1467


>gi|390337002|ref|XP_797348.3| PREDICTED: uncharacterized protein LOC592748 [Strongylocentrotus
           purpuratus]
          Length = 1201

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 76/149 (51%), Gaps = 19/149 (12%)

Query: 122 CDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEE-TIKQLRNQNDLHIQREGGLEMNIA 180
           C+ +K+  +A   + ++Q Q +N  R ++   + + T KQ+ +  +  ++ E  L+   A
Sbjct: 501 CEDFKQSERA-FRDQLKQHQAENGKRWEEAQRMTQLTNKQMESMGEDCLKAERQLKKVQA 559

Query: 181 NLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKE 240
            L SE+E W Q+E  LEQK+S L  E A      AS +E  +LL+ +           K 
Sbjct: 560 TLASEREEWKQREVELEQKVSSLEMERA------ASFDEMQRLLQEE-----------KR 602

Query: 241 TIAGLSVDITQLRMQVVELEESRNNLLQE 269
           T+  L+ D+ ++R +  E  + ++N  +E
Sbjct: 603 TVDELNKDLIKMRREADESRKDKDNAHRE 631


>gi|313672426|ref|YP_004050537.1| chromosome segregation protein smc [Calditerrivibrio nitroreducens
           DSM 19672]
 gi|312939182|gb|ADR18374.1| chromosome segregation protein SMC [Calditerrivibrio nitroreducens
           DSM 19672]
          Length = 1118

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 44/179 (24%), Positives = 92/179 (51%), Gaps = 25/179 (13%)

Query: 82  LQREAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQAT------LEE 135
           L RE++  L++ +K +   + S I   A LE T+++L N+ D YKE+V         + E
Sbjct: 340 LMRESIVKLQDEIKSVVETKRSEI---AELENTIEELVNQRDEYKEEVDVNRRRFLDITE 396

Query: 136 TIQQLQRQNDLRMQKEAT-LEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEA 194
            I ++ R N ++ + E   L + I +L N+    +     +E  IA+L+ +K+ +  ++ 
Sbjct: 397 KISKI-RSNIIKHESEMNRLNKDISRLENEKSNILGEVSKIESLIADLKIKKDIFFNEQM 455

Query: 195 ALE--------------QKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSK 239
            ++              +++  L D+ +AL + +++ E++LKL+EA+    +  E +S+
Sbjct: 456 VIQDSYTREKEIVNKIDEELKSLSDKLSALKVTKSTREQQLKLIEAEVKKQSYGEELSR 514


>gi|123407834|ref|XP_001303085.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121884435|gb|EAX90155.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 1165

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 55/210 (26%), Positives = 106/210 (50%), Gaps = 13/210 (6%)

Query: 85  EAMAILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKE--KVQATLEETIQQLQR 142
           E+++ILE T K+ + E    I K   L  T ++L+   +L +E  K    L++   +L++
Sbjct: 64  ESISILESTKKYTKIEFVDGIIKILYL--TNKELKRYANLVREIMKENNKLKKENSELKK 121

Query: 143 QNDLRMQKEATLEETIKQLRNQNDLHIQREGGLE-------MNIANLQSEKEFWLQKEAA 195
            ND   ++   L+E   +L+ +ND      G L+       + I NL+       ++   
Sbjct: 122 TNDGLKKENDELKEENDRLKKENDALKNGCGVLKEENEKSNIEINNLKKTNHDLKEENDR 181

Query: 196 LEQKISQLRDESAALNMKRASLE-ERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRM 254
            +++I+ L+ E   L ++  +L+ ER ++L+   D   +   + KE   G   +I +L+ 
Sbjct: 182 FKKEINDLKKECHTLKIESNNLKNERTRMLKEINDLKKETNDLIKEN-NGFKTEIIELKK 240

Query: 255 QVVELEESRNNLLQENRQLKENVSSLRSQL 284
           Q  E+EE+ + ++QEN  LK+   SL+ Q+
Sbjct: 241 QQREVEENYDRIIQENINLKKENESLKKQI 270


>gi|123162154|ref|XP_001278994.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121827325|gb|EAX66064.1| hypothetical protein TVAG_538710 [Trichomonas vaginalis G3]
          Length = 338

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 67/116 (57%), Gaps = 1/116 (0%)

Query: 90  LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
           L + VK L+++  +  +K+  LEG +Q+L+++ D  +++    LE  IQ+L+ ++  + +
Sbjct: 3   LADKVKGLQDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEGAIQELKDKDAAQDE 61

Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRD 205
           K+  LE  I++L++++    +++  LE  I  L+ +     +K+ A E+ I  L D
Sbjct: 62  KDKNLEGAIQELKDKDAAQDEKDKNLEGAIQELKDKDAAQDEKDKANEEAIKSLAD 117


>gi|365986334|ref|XP_003669999.1| hypothetical protein NDAI_0D04420 [Naumovozyma dairenensis CBS 421]
 gi|343768768|emb|CCD24756.1| hypothetical protein NDAI_0D04420 [Naumovozyma dairenensis CBS 421]
          Length = 1914

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 99/194 (51%), Gaps = 30/194 (15%)

Query: 45   SQASEPADIQNVQVDADRHQSNGAESANLAEAERQHLLQ-----REAMAILEETVKHLRN 99
            +++SE     N +++A   + N  E    AE  R++LL+     +E ++ LE T+K L +
Sbjct: 1211 TESSEKIKTLNAELEALTEKLNQNEME--AEKLRENLLKDQNASKEEISALENTLKELHS 1268

Query: 100  ERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIK 159
            E+E+ +++++ LE  ++ L+ E D  K+K  + +E   +Q+++ N+     +A +E+   
Sbjct: 1269 EKETQMEEKSKLESDIKSLETELDETKKKHASEIESLHEQIEKINEETKDFKAIIEKAAS 1328

Query: 160  QL-RNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQL-RDESAALNMKRAS- 216
             L R ++DL + +E                + + E  L  KI ++   E A L  K+ S 
Sbjct: 1329 NLKRTESDLKVLKEK---------------YTKNEEKL--KIMEVASKEHAELKKKQESE 1371

Query: 217  ---LEERLKLLEAD 227
               LE RLK+LE D
Sbjct: 1372 YELLENRLKMLEED 1385


>gi|361127712|gb|EHK99672.1| putative Myosin type-2 heavy chain 1 [Glarea lozoyensis 74030]
          Length = 1971

 Score = 38.9 bits (89), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 67/122 (54%), Gaps = 12/122 (9%)

Query: 218 EERLKLLEADKDSWTQMESVSK--ETIAGLSVD----ITQLRMQVVELEESRNNLLQENR 271
           EER +L E  ++++T+M+ V +  E+   L++D      +L+++  EL +  +  L +  
Sbjct: 520 EERQRLDEERRNTYTEMQRVQQTLESERALALDKEEIFKRLQLRETELSDKLSGALDDQE 579

Query: 272 QLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVK 331
           +L++ +  L       E+KK A    + + TQ+E AG +I KL  E  EL E+++ L  +
Sbjct: 580 KLEDQLDDLL------EAKKKAEQNSEQWRTQLEQAGQIIAKLELEKRELSERLDSLEAQ 633

Query: 332 LD 333
           L+
Sbjct: 634 LE 635


>gi|383852228|ref|XP_003701630.1| PREDICTED: major antigen-like [Megachile rotundata]
          Length = 2226

 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 70/276 (25%), Positives = 135/276 (48%), Gaps = 29/276 (10%)

Query: 110  TLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHI 169
            +L   V +L+N+ D+ + K+Q  LE  + +LQ++ND        LE+  + LR      +
Sbjct: 1007 SLNDEVSKLRNQLDIAERKIQ-ELEPLVDRLQKENDKLQNDLKALEDDARNLR------L 1059

Query: 170  QREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKD 229
            + +GG+  N   ++++      +   L +K+   + E+ AL  +  +L  R KLL+ D +
Sbjct: 1060 RLDGGMSDN-ERMRNDMAMLESQVGDLNEKLKAAKAENDALQQENQTL--RAKLLDMDNE 1116

Query: 230  -SWTQMESVS-KETIAGLSVDITQLRMQVVELEESRNNLLQENRQLKENVSSLRSQLSSD 287
             S  + E    K  IA L+  I++LR ++ +LEE   +   EN +L+  +  L++ L   
Sbjct: 1117 LSQAKQECADLKAEIADLNNLISELRAKIAKLEEDVEHWKLENCKLQIEIDKLKADL--- 1173

Query: 288  ESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEKVNDLSVKLDRQSVAAGL-SSAIG 346
               + A  +  +     +A  A +++L  E  EL +++  L+ +++ Q  AA L  SA G
Sbjct: 1174 ---EKALKDLSECQALKKAQEAELNRLQNEKAELNKQIAGLTAQIEEQKKAAELEKSAKG 1230

Query: 347  S---------DAVTALTSDTEPMS-ESSDNMSSLNN 372
                      D + AL  + E +  E++D  + ++N
Sbjct: 1231 ESEAKLKALQDELNALKKELEKLRMENNDYKNEMDN 1266


>gi|363750548|ref|XP_003645491.1| hypothetical protein Ecym_3173 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889125|gb|AET38674.1| Hypothetical protein Ecym_3173 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 839

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 70/153 (45%), Gaps = 24/153 (15%)

Query: 191 QKEAALEQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDIT 250
           QK AA  + +  L  E+ +L +K A  +E L     DK    +     +E +  L+VDI 
Sbjct: 564 QKFAAQHEVVKLLTQETESLKLKLAGCQESLSATNKDKQETLEKSQKLQEHLHKLNVDIV 623

Query: 251 QLRMQVVELEESRNNLL--------------QENRQLKENVSSLRSQLSSDESKKHATSE 296
           QL+   +ELEE    L                E RQLKE V ++      D  ++   +E
Sbjct: 624 QLKAHELELEEENRKLTNNIELIESTNKEEDYELRQLKEQVKTM------DLERRGFVTE 677

Query: 297 QKDFSTQIEAAGALIDKLITENIELVEKVNDLS 329
           + D+  +IE    +++K I +  + + K+ DLS
Sbjct: 678 RLDYQDRIE----MLEKQIMQLTKQLTKITDLS 706


>gi|322693568|gb|EFY85424.1| filament-forming protein [Metarhizium acridum CQMa 102]
          Length = 2026

 Score = 38.9 bits (89), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 68/142 (47%), Gaps = 6/142 (4%)

Query: 196  LEQKISQLRDESAALNMKRASLEERLKLLEADKDSW-TQMESVSKETIAGLSVDITQLRM 254
            +E K+  L  E   L M+++ LEE +K ++ D+D W  + E +  +       ++ QL+ 
Sbjct: 1331 MEGKVQPLEVEIDNLKMQKSYLEEEIKQIQEDRDRWQKRTEGILTKYGRVDPAEMEQLKK 1390

Query: 255  QVVELEESRNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKL 314
             + +LE  R+ L Q  + L+E ++ L S L S+      T  +   + Q +       KL
Sbjct: 1391 TISDLEAERDCLKQGEQPLREKITELESTLESERQAWQTT--RAKLTEQFKERSR---KL 1445

Query: 315  ITENIELVEKVNDLSVKLDRQS 336
              E  E  ++ N L  +LD+ S
Sbjct: 1446 TGEKNEAAQRANHLQEQLDKAS 1467


>gi|348680404|gb|EGZ20220.1| hypothetical protein PHYSODRAFT_489768 [Phytophthora sojae]
          Length = 615

 Score = 38.5 bits (88), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 43/77 (55%)

Query: 197 EQKISQLRDESAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQV 256
           E+K+S L DE  AL++K+A LE+RL+ L  +KD   +     +      + ++  L  ++
Sbjct: 295 EEKLSALLDEGQALSVKQAQLEQRLRALRKEKDELEERALKLQSQFEASAEEVKDLTTKL 354

Query: 257 VELEESRNNLLQENRQL 273
              EE +  L QENRQL
Sbjct: 355 KASEEEKTRLAQENRQL 371


>gi|342319886|gb|EGU11831.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
          Length = 1301

 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 77/275 (28%), Positives = 129/275 (46%), Gaps = 47/275 (17%)

Query: 52   DIQNVQVDADRHQSNGAESANLA------EAERQHLLQREAMAI-LEETVKHLRNERESH 104
            D++  ++D D   S  A  A LA      E  +Q + Q E ++  L  T K L   +ES 
Sbjct: 760  DVEIAELDRDLQASESAYEALLASDREGQEQRQQAVRQVEKLSGELAATNKELLRAQESL 819

Query: 105  IQKEAT----LEGTVQQLQNE-----CDLYKEKVQAT-LEETIQQLQRQNDLRMQKEATL 154
              ++ T    LE  ++QL+ E      DL   + ++T LE  ++ LQ+Q +L  Q++A  
Sbjct: 820  AAEQRTEKGKLEKEIEQLKAELDKALLDLSTVRTRSTELEADLEGLQKQAELVGQEKAVS 879

Query: 155  EETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAAL----EQKISQLRDESAAL 210
            E  +++  ++N       G L   +  LQ+  +    +E AL    +Q I+ +  + AAL
Sbjct: 880  EAALREASSRNQQLANENGKLREEVVALQARLQATEAEEKALTQADQQAITAMNGDLAAL 939

Query: 211  NMKRASLEERLKLL---------------EADKD--SW-TQMESVSKETIAGLSVDI--- 249
              +   L++ L+ L               EA K+  +W  Q+ES   + IAGL  D+   
Sbjct: 940  AGENEDLQDVLEELRPLAEKYVDHEDLECEAVKEMQTWRAQVESRDAK-IAGLEGDLAAQ 998

Query: 250  ----TQLRMQVVELEESRNNLLQENRQLKENVSSL 280
                T+L+ +  E E  RN LLQEN  L+++  +L
Sbjct: 999  LCVDTELQTRAEEAESERNELLQENAALEQDKVAL 1033


>gi|111185907|ref|NP_079552.3| envoplakin [Mus musculus]
 gi|341941103|sp|Q9D952.3|EVPL_MOUSE RecName: Full=Envoplakin; AltName: Full=210 kDa cornified envelope
            precursor protein; AltName: Full=p210
 gi|162318566|gb|AAI56423.1| Envoplakin [synthetic construct]
 gi|225000372|gb|AAI72666.1| Envoplakin [synthetic construct]
          Length = 2035

 Score = 38.5 bits (88), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 74/254 (29%), Positives = 120/254 (47%), Gaps = 48/254 (18%)

Query: 89   ILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRM 148
            I+EE VK         ++K+  LE + + L+ E D  K +V   L    Q LQ Q DL +
Sbjct: 1462 IMEEVVK---------LEKDPDLERSTEALRRELDQEKNRV-TELHRECQGLQVQVDL-L 1510

Query: 149  QKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFW--LQKEAALEQKISQLRDE 206
            QK  + E+TI     +  + ++++  LE   A +      W  L +E A  +     R+E
Sbjct: 1511 QKTKSQEKTI----YKEVIRVEKDPVLEGERARV------WEILNRERAARKG----REE 1556

Query: 207  SAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNL 266
                     SL+ER+   EA + SW++ E+  +      S D  +L+ Q+ ELE+ +   
Sbjct: 1557 DVR------SLQERIDRAEALRRSWSREEAELQRARDQASQDCGRLQRQLRELEQQKQ-- 1608

Query: 267  LQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEK-- 324
             Q+ RQL+E    L SQ +  E +K A   Q    TQ+EAA      ++ E  ++ EK  
Sbjct: 1609 -QKARQLQEE-GRLLSQKTESERQKAAQRSQA--VTQLEAA------ILQEKDKIYEKER 1658

Query: 325  -VNDLSVKLDRQSV 337
             + DL  K+ R+ +
Sbjct: 1659 TLRDLHTKVSREEL 1672


>gi|213410371|ref|XP_002175955.1| spindle assembly checkpoint component mad1 [Schizosaccharomyces
           japonicus yFS275]
 gi|212004002|gb|EEB09662.1| spindle assembly checkpoint component mad1 [Schizosaccharomyces
           japonicus yFS275]
          Length = 699

 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 146/330 (44%), Gaps = 68/330 (20%)

Query: 72  NLAEAERQHLLQREAMAILEETVKHLRNERESHIQKEA--------TLEGTVQQLQNECD 123
           N+A  E++ +   E   +LE+ +K L NER+  +Q +          L  TV  L+  C+
Sbjct: 161 NVAAMEKKVISSEEQYTLLEKQLK-LTNERKEELQTKYQVVVEECDKLRDTVTSLEEACN 219

Query: 124 LYKEKVQATLEETIQQLQRQNDLRMQKEATLEETIKQLRNQNDLHIQREGGLEMNIANLQ 183
           L   K Q T  E+I+ LQ QN+    K  +LE+ +    ++    +      + N   L+
Sbjct: 220 LQSVKAQDT--ESIKALQIQNEQLQTKLNSLEKLV----DRQSATLSSNALEKHNFKLLE 273

Query: 184 SEKEFWLQKEAALEQKISQLRDESAALNMKRASLEERLK-----LLEADKD--------S 230
            EK+  L K + L+      RD+ A L +K   LE +L+     L E  ++        S
Sbjct: 274 EEKKSLLTKLSVLD----GFRDKVATLELKNNELEGKLRPYLELLGETKREPHDILHELS 329

Query: 231 WTQMESVS--------KETIAGLSVDIT--------------------QLRMQVVELEES 262
             +ME+ S         ET+A L  ++                     +L MQ+  L   
Sbjct: 330 ALEMENKSLREESNRLTETVAKLKTELAGANSVPELEEEITSLNETQRELAMQLRRLTLQ 389

Query: 263 RNNLLQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALID--KLITENIE 320
           ++  L+E   L+EN+ S     S +ES     +  K  + +I++   LID  K   EN+ 
Sbjct: 390 KDLALREVHLLRENLKS----YSEEESVLSPETYDKKKTERIDSLTKLIDDYKSTLENVS 445

Query: 321 LVEKVNDLSVKLDRQSVAAGLSSAIGSDAV 350
           +  +V D+ VK  R+S+  GLS +  SD++
Sbjct: 446 IKPEVMDVPVKRKRESL--GLSRSNFSDSL 473


>gi|123134363|ref|XP_001277101.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121820009|gb|EAX64171.1| hypothetical protein TVAG_574510 [Trichomonas vaginalis G3]
          Length = 247

 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 67/116 (57%), Gaps = 1/116 (0%)

Query: 90  LEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRMQ 149
           L + VK L+++  +  +K+  LEG +Q+L+++ D  +++    LE  IQ+L+ ++  + +
Sbjct: 3   LADKVKGLQDKDAAQDEKDKNLEGAIQELKDK-DAAQDEKDKNLEGAIQELKDKDAAQDE 61

Query: 150 KEATLEETIKQLRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRD 205
           K+  LE  I++L++++    +++  LE  I  L+ +     +K+ A E+ I  L D
Sbjct: 62  KDKNLEGAIQELKDKDAAQDKKDKNLEGAIQELKDKDAAQDEKDKANEEAIKSLAD 117


>gi|14270509|emb|CAC38864.2| envoplakin [Mus musculus]
          Length = 2035

 Score = 38.1 bits (87), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 73/254 (28%), Positives = 119/254 (46%), Gaps = 48/254 (18%)

Query: 89   ILEETVKHLRNERESHIQKEATLEGTVQQLQNECDLYKEKVQATLEETIQQLQRQNDLRM 148
            I+EE VK         ++K+  LE + + L+ E D  K +V   L    Q LQ Q DL +
Sbjct: 1462 IMEEVVK---------LEKDPDLERSTEALRRELDQEKNRV-TELHRECQGLQVQVDL-L 1510

Query: 149  QKEATLEETIKQ--LRNQNDLHIQREGGLEMNIANLQSEKEFWLQKEAALEQKISQLRDE 206
            QK  + E+TI +  +R + D  ++ E      I N         ++ AA + +   +R  
Sbjct: 1511 QKTKSQEKTIYKEVIRVEKDPVLEGERARVWEILN---------RERAARKGREEDVR-- 1559

Query: 207  SAALNMKRASLEERLKLLEADKDSWTQMESVSKETIAGLSVDITQLRMQVVELEESRNNL 266
                     SL+ER+   EA + SW++ E+  +      S D  +L+ Q+ ELE+ +   
Sbjct: 1560 ---------SLQERIDRAEALRRSWSREEAELQRARDQASQDCGRLQRQLRELEQQKQ-- 1608

Query: 267  LQENRQLKENVSSLRSQLSSDESKKHATSEQKDFSTQIEAAGALIDKLITENIELVEK-- 324
             Q+ RQL+E    L SQ +  E +K A   Q    TQ+EAA      ++ E  ++ EK  
Sbjct: 1609 -QKARQLQEE-GRLLSQKTESERQKAAQWSQA--VTQLEAA------ILQEKDKIYEKER 1658

Query: 325  -VNDLSVKLDRQSV 337
             + DL  K+ R+ +
Sbjct: 1659 TLRDLHTKVSREEL 1672


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.306    0.122    0.314 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,366,948,440
Number of Sequences: 23463169
Number of extensions: 249752456
Number of successful extensions: 1399276
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1799
Number of HSP's successfully gapped in prelim test: 57384
Number of HSP's that attempted gapping in prelim test: 1182718
Number of HSP's gapped (non-prelim): 174388
length of query: 474
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 328
effective length of database: 8,933,572,693
effective search space: 2930211843304
effective search space used: 2930211843304
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 79 (35.0 bits)