BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 004163
         (771 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255562854|ref|XP_002522432.1| conserved hypothetical protein [Ricinus communis]
 gi|223538317|gb|EEF39924.1| conserved hypothetical protein [Ricinus communis]
          Length = 751

 Score =  806 bits (2083), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/774 (56%), Positives = 537/774 (69%), Gaps = 35/774 (4%)

Query: 4   GEGDIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALY 63
           GEG+ +R  C++C +VG E + DGFYYC  C A  D+I+ TGVADEDF+      G ALY
Sbjct: 5   GEGETKRWACRRCGHVGLEES-DGFYYCQECGAQADDIILTGVADEDFIEKDGEGGGALY 63

Query: 64  QASHARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREE 123
            A   R    +R      P SQ    ++   ++ N TTTTT    Y+++     +IK+EE
Sbjct: 64  SARFTRYSQPTRTI-QTNPSSQAWFRYTQEEEDINFTTTTTLNGTYSNI-----KIKKEE 117

Query: 124 V----SYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFN 179
                 Y D +GP EP+DFG        YEDY+ EVR++YVMGMQ MIQLQCE+LV+KFN
Sbjct: 118 RFDDDEYLDGLGPVEPEDFGGKS---LSYEDYYNEVRIRYVMGMQWMIQLQCESLVEKFN 174

Query: 180 VCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLH 239
           V PLICGVA ++W RFL +TG+    WAD+ I++SESQ   E +D++PR+ +R+EPH  +
Sbjct: 175 VSPLICGVAGNVWLRFLVATGVFKDNWADDVILESESQVQGEPEDWKPRSSHRNEPHNAY 234

Query: 240 GQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIE 299
           GQRAVM+W+K LR+ IPLSSSLAISFLACHV REAILPTDIV+WSIEGK+PYFAA VEIE
Sbjct: 235 GQRAVMVWFKYLRKTIPLSSSLAISFLACHVAREAILPTDIVRWSIEGKLPYFAAHVEIE 294

Query: 300 KRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQL 359
           KRF  +S AC +S S MFRPS++VP+QKLES AA+ AESIGLHLPPVNFY +ASRYLK L
Sbjct: 295 KRFEHSSPACPISSSLMFRPSQAVPAQKLESMAAAFAESIGLHLPPVNFYEIASRYLKNL 354

Query: 360 CLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEK 419
            LP+ KILP A +I EWSMPPDLWLSTNE R PTRV VMSILIV+IRILYN+NGFGAWE+
Sbjct: 355 ALPVEKILPHACRIYEWSMPPDLWLSTNELRLPTRVTVMSILIVAIRILYNLNGFGAWER 414

Query: 420 SLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESEL 479
           SLSS     S S+   + D  C S M+ + E  + SP   +  S  K  +N SH +  EL
Sbjct: 415 SLSSLNCSPSNSHPASRLDSMCRSVMQGDAE--TGSPFYSLDGSAEKFLRNPSHMQMPEL 472

Query: 480 DAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPH-DDPEEAIMIEKF 538
           D+  LL+ LE +YN I D+  YE++KDLP+YL+YCKDVVFAG  P H DD EE  ++EK 
Sbjct: 473 DSAELLHHLEVKYNFIADA--YEFTKDLPSYLQYCKDVVFAGAGPSHMDDLEEEELMEKL 530

Query: 539 WKFYQNEKESE-AAEDSGKRCGIASKRKRSR-DDLSLGSHYKESEKIREKEFTTRLSASA 596
           W FYQNEK+SE A E   +     S +KRSR DD S+  +  E EKI+E+      S SA
Sbjct: 531 WDFYQNEKDSELAKEPRTQSSSRLSNQKRSRNDDGSVFVNLSEKEKIKEEWHD---SPSA 587

Query: 597 DYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCY 656
           D  S + D+   QS +  + S +S E  N E  ++ S +T+   A++RLKLDME NRFCY
Sbjct: 588 DISSHNADNSSHQSFDNGHFSNNSLEDQNVEHKEKDSEKTLEGRAIRRLKLDMEENRFCY 647

Query: 657 ISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERR 716
           I PRV +KR DYL YVRK DEGA TY AHADYYILLRACA +AQVDIR MH+ VLSFERR
Sbjct: 648 IPPRVNLKRFDYLHYVRKKDEGAFTYVAHADYYILLRACARVAQVDIRIMHIGVLSFERR 707

Query: 717 LAWLEKRIDHCLHLTPPSVTCRFCSDPVELAAGDPICNATEDSADDHIGLSNLN 770
           LAWLEKRID+CLHL+PP++TC FC D  +            +S DD IGLS LN
Sbjct: 708 LAWLEKRIDYCLHLSPPTITCEFCRDMPD-----------HNSNDDVIGLSKLN 750


>gi|225425686|ref|XP_002269865.1| PREDICTED: uncharacterized protein LOC100241425 [Vitis vinifera]
          Length = 722

 Score =  752 bits (1942), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 412/769 (53%), Positives = 521/769 (67%), Gaps = 65/769 (8%)

Query: 11  LKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ 70
           L C  C +VGF    DGF+YC  C +  ++I++TGVA+EDF+A G   G A+Y ASH RQ
Sbjct: 7   LTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAKGDARG-AIYSASHRRQ 65

Query: 71  VANSRPANHFVPLSQ-QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEV---SY 126
             +  P     PLSQ QS F + L          T   +Y        R++ EE    + 
Sbjct: 66  RHSIAPKPE--PLSQSQSQFLNNL----------TLDDDY--------RVENEETREETV 105

Query: 127 TDMVGPTEPQDFGSS--GPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLI 184
            D VGP+ P DFG    G     +EDY+ ++R++YVMG+Q+MI+LQC+ALV+KF   PLI
Sbjct: 106 ADEVGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLI 165

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAV 244
           CGVA +IW RF+A+T +    WAD+ I  SE Q+  ES+D +PRAKY  EPH ++GQRAV
Sbjct: 166 CGVAGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRAKYSAEPHNIYGQRAV 225

Query: 245 MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQ 304
           +IW++ L++KIPLS SL ISFLACH+ REAILPTDI+KWS+EGK+PYFAAF+EIEK+ G 
Sbjct: 226 IIWHRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGP 285

Query: 305 TSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG 364
            S  C LS SFMFRPS+++P QKLE+ AASIA+ IGLHLPPVNFYA+A RYL+QL LP+ 
Sbjct: 286 PSSPCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVE 345

Query: 365 KILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSR 424
           KILP A ++ EWSMPPDLWLS NE R PTRVCVMSILIV+IRILYN++GFG WE SLSS 
Sbjct: 346 KILPYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSS 405

Query: 425 KFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEAL 484
              SS+S+   K +   N KM D  ++   SP   +  SN +   NSSH ++SE DA  L
Sbjct: 406 SGSSSSSSQIVKLNASDNIKMMDGAKQ--GSPLHDLNGSNEEPVTNSSHAQKSEFDATEL 463

Query: 485 LYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQN 544
           L +L+ARY+++ D  TYEYSKDLPTYL+YCKDVVFAGLE P +D EE  +IE+ W+FYQN
Sbjct: 464 LCNLDARYDELID--TYEYSKDLPTYLQYCKDVVFAGLELPFEDHEEEKIIEQLWEFYQN 521

Query: 545 EKESEAAEDSGKRCGIASKRKRSRDDLS-LGSHYKESEKIREKEFTTRLSASADYESFSG 603
           +K+SE +ED G  CG A   KRSR+D   + S  KE +KIR+                  
Sbjct: 522 QKDSEPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRD------------------ 563

Query: 604 DDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQI 663
           D      L+GD++S +S+    S    +AS ET+ + A+ R+K DME NRFCYI PRV +
Sbjct: 564 DCSVPLGLDGDDTSLNSQGGQKSVPTHQASVETLKEEAILRMKADMEENRFCYIPPRVNV 623

Query: 664 KRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKR 723
           KR DYL YVRK DEG+  YAAHADYYILLRACA +AQVD+RSMH+ V+S ERRL W+EKR
Sbjct: 624 KRFDYLHYVRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKR 683

Query: 724 IDHCLHLTPPSVTCRFCSDPVELAAGDPICN--ATEDSADDHIGLSNLN 770
           IDHCLH  PP    +F SDP         CN  A E S DD++  S+LN
Sbjct: 684 IDHCLHFKPP----KFSSDP---------CNDDAPECSTDDYVEFSSLN 719


>gi|147780094|emb|CAN73288.1| hypothetical protein VITISV_027088 [Vitis vinifera]
          Length = 722

 Score =  744 bits (1922), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/767 (52%), Positives = 516/767 (67%), Gaps = 61/767 (7%)

Query: 11  LKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ 70
           L C  C +VGF    DGF+YC  C +  ++I++TGVA+EDF+A G   G A+Y ASH RQ
Sbjct: 7   LTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAKGDARG-AIYSASHRRQ 65

Query: 71  VANSRPANHFVPLSQ-QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEV---SY 126
             +  P     PLSQ QS F + L          T   +Y        R++ EE    + 
Sbjct: 66  RHSIAPKPE--PLSQSQSQFLNNL----------TLDDDY--------RVENEETREETV 105

Query: 127 TDMVGPTEPQDFGSS--GPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLI 184
            D VGP+ P DFG    G     +EDY+ ++R++YVMG+Q+MI+LQC+ALV+KF   PLI
Sbjct: 106 ADEVGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLI 165

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAV 244
           CGVA +IW RF+A+T +    WAD+ I  SE Q+  ES+D +PR KY  EPH ++GQRAV
Sbjct: 166 CGVAGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRTKYSAEPHNIYGQRAV 225

Query: 245 MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQ 304
           +IW++ L++KIPLS SL ISFLACH+ REAILPTDI+KWS+EGK+PYFAAF+EIEK+ G 
Sbjct: 226 IIWHRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGP 285

Query: 305 TSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG 364
            S  C LS SFMFRPS+++P QKLE+ AASIA+ IGLHLPPVNFYA+A RYL+QL LP+ 
Sbjct: 286 PSSPCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVE 345

Query: 365 KILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSR 424
           KILP A ++ EWSMPPDLWLS NE R PTRVCVMSILIV+IRILYN++GFG WE SLSS 
Sbjct: 346 KILPYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSS 405

Query: 425 KFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEAL 484
              SS+S+   K +   N KM D  ++   SP   +  SN +   NSSH ++SE DA  L
Sbjct: 406 SGSSSSSSQIVKLNASDNIKMMDGAKQ--GSPLHDLNGSNEEPVTNSSHAQKSEFDATEL 463

Query: 485 LYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQN 544
           L +L+ARY+++ D  TYEYSKDLPTYL+YCKDVVFAGLE P +D EE  +IE+ W+FYQN
Sbjct: 464 LCNLDARYDELID--TYEYSKDLPTYLQYCKDVVFAGLELPFEDHEEEKIIEQLWEFYQN 521

Query: 545 EKESEAAEDSGKRCGIASKRKRSRDDLS-LGSHYKESEKIREKEFTTRLSASADYESFSG 603
           +K+SE +ED G  CG A   KRSR+D   + S  KE +KIR+                  
Sbjct: 522 QKDSEPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRD------------------ 563

Query: 604 DDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQI 663
           D      L+GD++S +S+    S    +AS ET+ + A+ R+K DME NRFCYI  RV +
Sbjct: 564 DCSVPLGLDGDDTSLNSQGGQXSVPTHQASVETVKEEAILRMKADMEENRFCYIPXRVNV 623

Query: 664 KRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKR 723
           KR DYL YVRK DEG+  YAAHADYYILLRACA +AQVD+RSMH+ V+S ERRL W+EKR
Sbjct: 624 KRFDYLHYVRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKR 683

Query: 724 IDHCLHLTPPSVTCRFCSDPVELAAGDPICNATEDSADDHIGLSNLN 770
           IDHCLH  PP  +   C+            +A E S DD++  S+LN
Sbjct: 684 IDHCLHFKPPKFSSDXCNX-----------DAPECSTDDYVEFSSLN 719


>gi|356541066|ref|XP_003539004.1| PREDICTED: uncharacterized protein LOC100789168 [Glycine max]
          Length = 719

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/770 (48%), Positives = 502/770 (65%), Gaps = 60/770 (7%)

Query: 7   DIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQAS 66
           D+ +L C+ C+NVG     DGF+YC  C +  +++++T V D+D         +A+Y  S
Sbjct: 3   DVHKLICQTCNNVGVGDGSDGFFYCLRCGSQCEDVMDTAVDDDDLF-----NKSAVYLPS 57

Query: 67  HARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREE--- 123
           H RQ + +  A    P+SQ  SF    +D  +         + N   +   ++KREE   
Sbjct: 58  HQRQRSVAIKAE---PISQYDSF----YDSHSNFIRNLGLEDENPQRNDHVQVKREEEFD 110

Query: 124 VSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPL 183
            +  D   P+ P DFG  G   A +EDYH E+RM+YVMG+Q+MI+LQCEALV +F V PL
Sbjct: 111 AAQFDEASPSVPADFG--GSQVACFEDYHNEIRMRYVMGLQMMIELQCEALVKEFKVTPL 168

Query: 184 ICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRA 243
           ICG+   IW RF++ TG+    WAD+ I  SE+Q   E +D++ RAKYR EPH + G+RA
Sbjct: 169 ICGLVGPIWLRFVSKTGIFGDDWADKVIHDSETQNEDEPEDYKLRAKYRTEPHNMFGKRA 228

Query: 244 VMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFG 303
           VMIW++ L+++IPL  ++A+SFLACHV REA+LP+D++KW++EGK+PYF+AFVEIEKR G
Sbjct: 229 VMIWFRSLKKRIPLPCTVAVSFLACHVAREAVLPSDMMKWTLEGKLPYFSAFVEIEKRMG 288

Query: 304 QTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 363
           Q + AC +S S MFRP +++P QKLESFAASIA+ IGL LPPVNFYA+A  +L++L LP+
Sbjct: 289 QPTSACPISSSVMFRPQQAIPVQKLESFAASIAQFIGLELPPVNFYAIAYLFLQKLSLPV 348

Query: 364 GKILPRALKIQEWSMPPDLW--LSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSL 421
            KILP A +I EWSMPPDLW  LS+N  R PT VCVMSIL+V+IRILYNINGFG WEKSL
Sbjct: 349 EKILPYACRIYEWSMPPDLWLSLSSNYFRLPTHVCVMSILVVAIRILYNINGFGEWEKSL 408

Query: 422 SSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDA 481
           S        S +G  F  +                    G  +  S+++    ++ ELD+
Sbjct: 409 SRDDDAKDNSEMGNAFASDNGHDF---------------GKESKDSAEDQVGSQKHELDS 453

Query: 482 EALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKF 541
             LL  L+ARYN+I D  TY YSKDL TYLKYC+DVVFAG EP + + EE  MIE  WKF
Sbjct: 454 AWLLQHLQARYNEIAD--TYGYSKDLATYLKYCRDVVFAGSEPSYGNHEEEKMIEYLWKF 511

Query: 542 YQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESF 601
           YQNE++++ +E+  ++   +  +  SRD   +G   KE EKIR+K F          E F
Sbjct: 512 YQNEEDTKPSENV-EQSNTSFNQTSSRDKGCIGKISKE-EKIRKKGFN---------ELF 560

Query: 602 SGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRV 661
             DD   + ++ +NS +S  +  +S++ + +S ++ +  A++++KL+ME NRF YI P V
Sbjct: 561 PDDDTSLEDVSNNNSHESLSDSEDSDSQEHSSGKSHVKEAIRQMKLEMEENRFYYIPPSV 620

Query: 662 QIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLE 721
           + KRLDY+ YVRK DEGALTY AHADYYILLRACA IAQVDIR +H+ VL  ERRLAWLE
Sbjct: 621 KRKRLDYIHYVRKRDEGALTYVAHADYYILLRACARIAQVDIRILHIGVLRLERRLAWLE 680

Query: 722 KRIDHCLHLTPPSVTCRFCSDPVELAAGDPICN-ATEDSADDHIGLSNLN 770
           KRID CLH  P S++C+FC            CN ATE+ +D   GLSNLN
Sbjct: 681 KRIDQCLHSKPTSISCQFC------------CNKATENGSDGLPGLSNLN 718


>gi|296086368|emb|CBI31957.3| unnamed protein product [Vitis vinifera]
          Length = 713

 Score =  674 bits (1739), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/768 (49%), Positives = 468/768 (60%), Gaps = 158/768 (20%)

Query: 11  LKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ 70
           L C  C +VGF    DGF+YC  C +  ++I++TG                         
Sbjct: 7   LTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTG------------------------- 41

Query: 71  VANSRPANHFVPLSQ-QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTDM 129
                      PLSQ QS F + L  + +       T              REE +  D 
Sbjct: 42  -----------PLSQSQSQFLNNLTLDDDYRVENEET--------------REE-TVADE 75

Query: 130 VGPTEPQDFGSS--GPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGV 187
           VGP+ P DFG    G     +EDY+ ++R++YVMG+Q+MI+LQC+ALV+KF   PLICGV
Sbjct: 76  VGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLICGV 135

Query: 188 AASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVMIW 247
           A +IW RF+A+T +    WAD+ I  SE Q+  ES+D +PRAKY  EPH ++GQRAV+IW
Sbjct: 136 AGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRAKYSAEPHNIYGQRAVIIW 195

Query: 248 YKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSV 307
           ++ L++KIPLS SL ISFLACH+ REAILPTDI+KWS+EGK+PYFAAF+EIEK+ G  S 
Sbjct: 196 HRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGPPSS 255

Query: 308 ACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKIL 367
            C LS SFMFRPS+++P QKLE+ AASIA+ IGLHLPPVNFYA+A RYL+QL LP+ KIL
Sbjct: 256 PCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVEKIL 315

Query: 368 PRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFF 427
           P A ++ EWSMPPDLWLS NE R PTRVCVMSILIV+IRILYN++GFG WE SLS     
Sbjct: 316 PYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLS----- 370

Query: 428 SSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYD 487
                                                 K   NSSH ++SE DA  LL +
Sbjct: 371 --------------------------------------KPVTNSSHAQKSEFDATELLCN 392

Query: 488 LEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKE 547
           L+ARY+++ D  TYEYSKDLPTYL+YCKDVVFAGLE P +D EE  +IE+ W+FYQN+K+
Sbjct: 393 LDARYDELID--TYEYSKDLPTYLQYCKDVVFAGLELPFEDHEEEKIIEQLWEFYQNQKD 450

Query: 548 SEAAEDSGKRCGIASKRKRSRDDLS-LGSHYKESEKIREKEFTTRLSASADYESFSGDDY 606
           SE +ED G  CG A   KRSR+D   + S  KE +KIR                      
Sbjct: 451 SEPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIR---------------------- 488

Query: 607 PQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRL 666
                  D+ S                 ET+ + A+ R+K DME NRFCYI PRV +KR 
Sbjct: 489 -------DDCSV-------------PLVETLKEEAILRMKADMEENRFCYIPPRVNVKRF 528

Query: 667 DYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRIDH 726
           DYL YVRK DEG+  YAAHADYYILLRACA +AQVD+RSMH+ V+S ERRL W+EKRIDH
Sbjct: 529 DYLHYVRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKRIDH 588

Query: 727 CLHLTPPSVTCRFCSDPVELAAGDPICN--ATEDSADDHIGLS-NLNT 771
           CLH  PP    +F SDP         CN  A E S DD++  S NLNT
Sbjct: 589 CLHFKPP----KFSSDP---------CNDDAPECSTDDYVEFSRNLNT 623


>gi|449518998|ref|XP_004166522.1| PREDICTED: TATA box-binding protein-associated factor RNA
           polymerase I subunit B-like [Cucumis sativus]
          Length = 735

 Score =  663 bits (1710), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/757 (48%), Positives = 477/757 (63%), Gaps = 58/757 (7%)

Query: 6   GDIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAG-TGTALYQ 64
            D   L C  C ++G    +DGF+YC  C +  D+I++TGVA+ED +    G +G  +Y 
Sbjct: 2   ADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYS 61

Query: 65  ASHARQVANSRPANHFVPLSQQSSFW----SPLFDEPNTTTTTTTTANYNDLDSIIHRIK 120
            SH R+   +       PLSQ  S +    S  +D  N     +      D D ++    
Sbjct: 62  QSHTRR--RNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVM---- 115

Query: 121 REEVSYTDMVGPTEPQDFGSSGPV--KADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKF 178
                  D VGPT P+DFGS   +  K  +E+Y  EVRM+YVMG+QL+++LQCE LV +F
Sbjct: 116 -----LNDGVGPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEF 170

Query: 179 NVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTL 238
              P+ICG+AASIW RF+ +T +  + WA + + +SESQ L   +  +  + ++DEPH  
Sbjct: 171 KATPIICGLAASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNF 230

Query: 239 HGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI 298
           +GQR V++W K LR+KIPL S+LA+SFLACHV REAILPTDI+KWS+EGK+PY+AAFV+I
Sbjct: 231 YGQRVVVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDI 290

Query: 299 EKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQ 358
           E R G+TS AC +S   M RPS+    QKLES AASIA +IGL+LPPVNF+++A RYL +
Sbjct: 291 ESRIGKTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNK 350

Query: 359 LCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWE 418
           L LP+ KILP A +I EWSMPPDLWLSTNE R P+RVCVMSILI+++RILYN++GFG WE
Sbjct: 351 LALPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWE 410

Query: 419 KSLS---------SRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSK 469
           KSLS          +K  SS +N    F     S M+ + E      S  +   N   S 
Sbjct: 411 KSLSVDCASCFPPHQKTHSSPAN---NF-----SNMQADSENRPGFTSHDV--DNPSVSP 460

Query: 470 NSSHFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPH-DD 528
            + H   +E      L  +EARY++I +  TYEYSKDLPTYL+YCKDV FAG E    DD
Sbjct: 461 ENPHLTTTE-----FLRKIEARYHEIAE--TYEYSKDLPTYLQYCKDVAFAGSESLFIDD 513

Query: 529 PEEAIMIEKFWKFYQNEKESEAAEDSGKRCGIASKRKR--SRDDLSLGSHYKESEKIREK 586
            +E  MIEK W +YQNEK+ +  ED  +      KR R  S D LS      ES+K++ +
Sbjct: 514 HDEQKMIEKLWNYYQNEKDYDQTEDVDQNAASNQKRLREGSNDRLS-----NESKKVKGE 568

Query: 587 EFTTRLS-ASADYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRL 645
           E   R+S  S +  + S D     S  G ++S   E+    +A   AS+ T I+ A+++L
Sbjct: 569 E--DRISRESLNNRTGSIDSRQSHSSKGLDNSDDDEQSSVDKA---ASSLTSINEAIRQL 623

Query: 646 KLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRS 705
           KLDME  RFCYI PR+  KR DYL Y RK DEGALTYAAHADYYILLRACA  AQVDIR 
Sbjct: 624 KLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLRACARAAQVDIRI 683

Query: 706 MHMAVLSFERRLAWLEKRIDHCLHLTPPSVTCRFCSD 742
           MH+ VLS E+RL+WLE RI   L LTP S+TC FCSD
Sbjct: 684 MHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720


>gi|449470354|ref|XP_004152882.1| PREDICTED: TATA box-binding protein-associated factor RNA
           polymerase I subunit B-like [Cucumis sativus]
          Length = 735

 Score =  661 bits (1705), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/769 (47%), Positives = 475/769 (61%), Gaps = 82/769 (10%)

Query: 6   GDIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAG-TGTALYQ 64
            D   L C  C ++G    +DGF+YC  C +  D+I++TGVA+ED +    G +G  +Y 
Sbjct: 2   ADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYS 61

Query: 65  ASHARQVANSRPANHFVPLSQQSSFW----SPLFDEPNTTTTTTTTANYNDLDSIIHRIK 120
            SH R+   +       PLSQ  S +    S  +D  N     +      D D ++    
Sbjct: 62  QSHTRR--RNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVM---- 115

Query: 121 REEVSYTDMVGPTEPQDFGSSGPV--KADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKF 178
                  D VGPT P+DFGS   +  K  +E+Y  EVRM+YVMG+QL+++LQCE LV +F
Sbjct: 116 -----LNDGVGPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEF 170

Query: 179 NVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTL 238
              P+ICG+AASIW RF+ +T +  + WA + + +SESQ L   +  +  + ++DEPH  
Sbjct: 171 KATPIICGLAASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNF 230

Query: 239 HGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI 298
           +GQR V++W K LR+KIPL S+LA+SFLACHV REAILPTDI+KWS+EGK+PY+AAFV+I
Sbjct: 231 YGQRVVVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDI 290

Query: 299 EKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQ 358
           E R G+TS AC +S   M RPS+    QKLES AASIA +IGL+LPPVNF+++A RYL +
Sbjct: 291 ESRIGKTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNK 350

Query: 359 LCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWE 418
           L LP+ KILP A +I EWSMPPDLWLSTNE R P+RVCVMSILI+++RILYN++GFG WE
Sbjct: 351 LALPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWE 410

Query: 419 KSLS---------SRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGD-SNAKSS 468
           KSLS          +K  SS +N    F     S M+ + E     P     D  N   S
Sbjct: 411 KSLSVDCASCFPPHQKTHSSPAN---NF-----SNMQADSEN---RPGFTSHDVDNPSVS 459

Query: 469 KNSSHFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPH-D 527
             + H   +E      L  +EARY++I +  TYEYSKDLPTYL+YCKDV FAG E    D
Sbjct: 460 PENPHLTTTE-----FLRKIEARYHEIAE--TYEYSKDLPTYLQYCKDVAFAGSESLFID 512

Query: 528 DPEEAIMIEKFWKFYQNEKESEAAEDSGKRCGIASKRKR--SRDDLSLGSHYKESEKIRE 585
           D +E  MIEK W +YQNEK+ +  ED  +      KR R  S D LS      ES+K++ 
Sbjct: 513 DHDEQKMIEKLWNYYQNEKDYDQTEDVDQNAASNQKRLREGSNDRLS-----NESKKVKG 567

Query: 586 KEFTTRLSASADYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDE------------AS 633
           +E                D   ++SLN    S  S +  +S+++D             AS
Sbjct: 568 EE----------------DRISRESLNNRTGSIDSRQSHSSKSLDNSDDDEQSSVDKAAS 611

Query: 634 AETIIDSAVKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLR 693
           + T I+ A+++LKLDME  RFCYI PR+  KR DYL Y RK DEGALTYAAHADYYILLR
Sbjct: 612 SLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLR 671

Query: 694 ACATIAQVDIRSMHMAVLSFERRLAWLEKRIDHCLHLTPPSVTCRFCSD 742
           ACA  AQVDIR MH+ VLS E+RL+WLE RI   L LTP S+TC FCSD
Sbjct: 672 ACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720


>gi|114053441|gb|AAK91899.2|AC091627_12 hypothetical protein SDM1_2t00007 [Solanum demissum]
          Length = 664

 Score =  560 bits (1442), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 323/738 (43%), Positives = 435/738 (58%), Gaps = 90/738 (12%)

Query: 13  CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQVA 72
           C+ C N+ F+   DGF+YCT C +   +I++TGV D+D +    GT    ++        
Sbjct: 9   CEVCGNLSFDDGGDGFFYCTRCGSQAVDIMDTGVDDDDLVNVDGGTTLPAFR-------- 60

Query: 73  NSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTDMVGP 132
                               +F+            N +D D         E +  D VGP
Sbjct: 61  --------------------VFE------------NLDDYD---------ERNGDDGVGP 79

Query: 133 TEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIW 192
             P DFGSS  +   Y DY+ E+R++YVMG+Q MIQ+QC+ LV+KFNV PLI G+A  IW
Sbjct: 80  AVPSDFGSS-QISLTYTDYYSEIRLRYVMGLQAMIQMQCKTLVEKFNVSPLIVGLAGPIW 138

Query: 193 FRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLR 252
            R LA   +LS  WAD+ + +SE+Q   E +  Q     + EPH L G+RAV IW+K LR
Sbjct: 139 LRLLAHENVLSDEWADDVVHESETQTQGEIELSQSTGSQKTEPHNLLGKRAVTIWHKSLR 198

Query: 253 QKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLS 312
             IPL  SLAISFL CHV REAILPTDI+KW++EGK+PYFAAF+EIEK+ G  S +C +S
Sbjct: 199 NLIPLPCSLAISFLVCHVAREAILPTDILKWTLEGKLPYFAAFLEIEKQLGPPSRSCPIS 258

Query: 313 PSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALK 372
            S MFRP ++V  QKLES AASIA  IGL LP VNF+A+A+RYLK L LP+ KILP+A +
Sbjct: 259 TSRMFRPIRTVTLQKLESLAASIARKIGLELPSVNFHAIAARYLKHLSLPVEKILPQACQ 318

Query: 373 IQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEK-SLSSRKFFSSTS 431
           + EWSMPP+L+LS N+ R P+RVCVMSILIV++RILY++NG G WE  S  S    S+  
Sbjct: 319 VYEWSMPPELYLSDNDSRLPSRVCVMSILIVTMRILYDLNG-GKWELISSCSNNLVSAVE 377

Query: 432 NIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEAR 491
           N  G    EC           SC+    + + ++ S  +  H   SE     LL  LE +
Sbjct: 378 NGAG----ECG---------FSCNARGAVAEEDSASRDSDPHDSTSE-----LLKILEEK 419

Query: 492 YNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEAA 551
           YN++ D  TY++SKDL +YL YCK+VVFAGLEP +DD EE  +IE FW FYQ+ K     
Sbjct: 420 YNELSD--TYDFSKDLQSYLLYCKNVVFAGLEPAYDDHEEERIIEDFWDFYQSHK----I 473

Query: 552 EDSGKRCGIASKR------KRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDD 605
               + C I ++            +L + +   +++K  E   T   S + ++   SG  
Sbjct: 474 FIHNQNCTITTRAVLNYLLTLQSINLLIYNQIIQADKASEDGKTD--SHTGNHFHHSGSR 531

Query: 606 YPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKR 665
           +   S   +N +   + Y    + D+  +    ++A+++LK DM  NRF YI PR  +K+
Sbjct: 532 HGSNSTTKENENFRDDGYKCKMSRDDGDS----NAALRQLKADMNENRFVYIPPRKNVKK 587

Query: 666 LD-YLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRI 724
            D Y++Y RK D GA  YA HADYYILLR+CA +AQVD+R+MH+ VL+FE+RLA LEKRI
Sbjct: 588 KDGYIRYARKKD-GAYLYAVHADYYILLRSCAKVAQVDVRTMHVGVLAFEKRLAMLEKRI 646

Query: 725 DHCLHLTPPSVTCRFCSD 742
           D CL    P   C FC D
Sbjct: 647 DFCLCKRLPDDFCEFCRD 664


>gi|357473537|ref|XP_003607053.1| hypothetical protein MTR_4g071660 [Medicago truncatula]
 gi|355508108|gb|AES89250.1| hypothetical protein MTR_4g071660 [Medicago truncatula]
          Length = 653

 Score =  554 bits (1427), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 332/755 (43%), Positives = 437/755 (57%), Gaps = 131/755 (17%)

Query: 26  DGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQVANSRPANHFVPLSQ 85
           DGFYYC+ C   + ++V+TG  +ED  A GAG    +Y ASH R+ A    A +  P+SQ
Sbjct: 7   DGFYYCSACGEKNLDVVDTGAEEED--AIGAG----IYLASHQRRTAAPTDAVYVQPISQ 60

Query: 86  ----QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTDMVGPTEPQDFGSS 141
               QS+F   L  E ++                  ++K E V  +    P+ P DFG S
Sbjct: 61  CNPSQSNFLRKLGLEDDSQV----------------KVKAENVDQSQ-CDPSNPADFGGS 103

Query: 142 GPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGL 201
             V    E Y+ E+R++Y+MG+Q+MI+LQCEALV +F V PLICG+   IW RF++ TG+
Sbjct: 104 TVVSI--EQYYKEIRLRYIMGLQMMIELQCEALVKEFKVTPLICGLVGPIWLRFVSKTGV 161

Query: 202 LSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSL 261
               WAD+AI  SE Q   E +D+  R KY+ EPH + GQRA  IW++ LR +IP+  ++
Sbjct: 162 FDDDWADKAIHDSEMQNEGEPEDYNIRGKYKSEPHNMFGQRAAFIWFRSLRNRIPVVCTI 221

Query: 262 AISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSK 321
            +S+LACH+ REAI+P+D++KW+ EGK+PYF+AF+E+E R G   VAC +S S MFRP +
Sbjct: 222 VVSYLACHIAREAIMPSDMIKWTCEGKLPYFSAFLELESRMG-PPVACPISSSLMFRPQR 280

Query: 322 SVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMPPD 381
           ++   KLES A+SI++ IGL LPPVNFYAL              ILP A +I EWSM PD
Sbjct: 281 ALSVHKLESCASSISQFIGLELPPVNFYAL--------------ILPYACRIYEWSMSPD 326

Query: 382 LWLSTNECRF--PTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSSTSNIGGKFDP 439
           LWLS ++  F  PT VCV+S+L+V+IRILYNING+G WEKSLS                 
Sbjct: 327 LWLSLSKDYFKLPTHVCVVSVLVVAIRILYNINGYGEWEKSLSH---------------- 370

Query: 440 ECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEARYNDIHDSV 499
                                 DS   S+K+    ++ ELD   LL  L A YN+I D  
Sbjct: 371 ---------------------NDSAKDSAKDPVEQQKHELDCTGLLQHLHAIYNEIAD-- 407

Query: 500 TYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEAAEDSGKRCG 559
           T+EYSKDLPTYLKYCKDVVFAGLEP     EE  M+E  WK YQNE   E  ED      
Sbjct: 408 THEYSKDLPTYLKYCKDVVFAGLEPSLGSYEETNMMEILWKHYQNE---EVIED------ 458

Query: 560 IASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDDYPQQSLNGDNSSKS 619
                    ++L    + K SE   EK++         + SFSG     +   G  S + 
Sbjct: 459 --------LNELLHRHNTKPSES--EKQY---------HSSFSGTGSRDEGCVGKTSKR- 498

Query: 620 SEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGA 679
             E    + + + +    +  A+++LKLDME NRFCYI P V+ ++L Y+ YVRK D+GA
Sbjct: 499 --EKKKRKCLSDDNPSGSLPEAIRQLKLDMEENRFCYIPPSVKQEKLGYVHYVRKKDKGA 556

Query: 680 LTYAAHADYYILLRACATIAQV-DIRSMHMAVLSFERRLAWLEKRIDHCLHLTPPSVTCR 738
           L+Y AHADYYILLRA A  A V DIR +H+ VL+ ERRLAWLEKRI  CLHL PP+++C 
Sbjct: 557 LSYVAHADYYILLRAFARAAHVDDIRILHIGVLNLERRLAWLEKRIGQCLHLKPPNISCE 616

Query: 739 FCSDPVELAAGDPICNATEDSAD---DHIGLSNLN 770
           FCS             ATE+ +D   D   LSNLN
Sbjct: 617 FCS-----------VRATENGSDHMMDEDELSNLN 640


>gi|22325438|ref|NP_671769.1| maternal effect embryo arrest 12 protein [Arabidopsis thaliana]
 gi|75111159|sp|Q5XVF0.1|MEE12_ARATH RecName: Full=TATA box-binding protein-associated factor RNA
           polymerase I subunit B; AltName: Full=Central cell
           guidance protein; AltName: Full=Protein MATERNAL EFFECT
           EMBRYO ARREST 12; AltName: Full=TATA box-binding
           protein-associated factor 1B; Short=TBP-associated
           factor 1B
 gi|52354243|gb|AAU44442.1| hypothetical protein AT2G02955 [Arabidopsis thaliana]
 gi|330250550|gb|AEC05644.1| maternal effect embryo arrest 12 protein [Arabidopsis thaliana]
          Length = 666

 Score =  524 bits (1350), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 307/738 (41%), Positives = 423/738 (57%), Gaps = 92/738 (12%)

Query: 13  CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ-- 70
           C +C+N  F+   DG+YYC  C    + +++TGV D D +  G GT  ALY   H R   
Sbjct: 3   CTECENDAFDEEDDGYYYCQRCGVQVENLIQTGVDDGDLIGEGGGTQGALYNPKHRRTEP 62

Query: 71  --VANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTD 128
             +  S+P   F   + + S +   F+  N                +   +KR   SY D
Sbjct: 63  QPITPSQP--RFTDDTSRYSQFKSQFESENGNK------------ELPREVKRAPDSYVD 108

Query: 129 MVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVA 188
              PTEP DF +       YE+Y+ E R +YV    +MI  QC+ALVDKFNV PLI G+ 
Sbjct: 109 K-EPTEPVDFAAE---TLSYENYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLIIGLV 164

Query: 189 ASIWFRFLASTGLLSQGWADEAIVQSESQ-ELAESKDFQPRAKYRDEPHTLHGQRAVMIW 247
             I  R++A +G+  + WA+ AI  SE Q E  E KD +   +++ EP  + G+RAV IW
Sbjct: 165 GPISLRYVALSGVYHKDWANNAIRDSEHQSEDGEVKDAKRLKRHKAEPRNIDGKRAVTIW 224

Query: 248 YKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSV 307
           + +L++ +PLSSSL ISFLACH     +LPTDIV+W+ EGK+PY + F++I ++ G+ S 
Sbjct: 225 FGILKKTMPLSSSLVISFLACHQAGAPVLPTDIVRWAREGKLPYLSCFLDIREQMGERSA 284

Query: 308 ACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKIL 367
           AC +  S M RP + + +Q LE+ A+ IA++IGL LPPVNFY +AS Y+KQL +P  KIL
Sbjct: 285 ACPVKVSIMARPFQVISAQMLEARASVIADTIGLPLPPVNFYGIASNYIKQLSIPEDKIL 344

Query: 368 PRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFF 427
             A  IQ WS+PP+L+LSTNE + P+RVCVMSILIV+IR+LYNING G WE+SL     F
Sbjct: 345 DLARLIQNWSLPPELYLSTNEQKLPSRVCVMSILIVAIRMLYNINGLGVWERSLG----F 400

Query: 428 SSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYD 487
            + S+                            GDS   S       K +E D + LL +
Sbjct: 401 VNASD----------------------------GDSETNSGTAE---KATEFDTQELLKN 429

Query: 488 LEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAI-MIEKFWKFYQNEK 546
           LEA+Y+++  + T E  KDL +YL   K+  FAGLE   D P++   +++  W  Y  ++
Sbjct: 430 LEAKYHEVA-AETLESEKDLVSYLSLGKNEFFAGLE--EDSPDDTYRIVDNLWNGYPKDE 486

Query: 547 ESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYES-FSGDD 605
           + E             KR R  DD                     L+  + Y+S FS  +
Sbjct: 487 DIEC----------LPKRGRDWDD------------------DVSLNQLSLYDSRFSDGN 518

Query: 606 YPQQSLNGDNSSKS-SEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIK 664
            P  S +  N S S   +  +SE  + +S E + + A+KRL  DM ++ FCYI PRV++K
Sbjct: 519 NPCSSSSRRNESVSIGLDLSSSEHRESSSPEKLKEIAIKRLITDMGDDLFCYIPPRVKVK 578

Query: 665 RLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRI 724
           RLDYLQYVRK ++GAL YAAHADYYILLR CA +A++D+R+MH  VLSFERRLAW+EKRI
Sbjct: 579 RLDYLQYVRKKEDGALIYAAHADYYILLRVCAKVAEIDVRNMHRGVLSFERRLAWIEKRI 638

Query: 725 DHCLHLTPPSVTCRFCSD 742
           D  LHLT P +TC+ C D
Sbjct: 639 DQVLHLTRPLMTCKHCCD 656


>gi|297814596|ref|XP_002875181.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297321019|gb|EFH51440.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 698

 Score =  514 bits (1325), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 315/741 (42%), Positives = 432/741 (58%), Gaps = 67/741 (9%)

Query: 13  CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQVA 72
           C++CDN  F+   DG++YC  C    +++++TGV D D +  G GT  A+Y   + R + 
Sbjct: 3   CRECDNDAFDEEDDGYFYCKRCGVQAEDLIQTGVDDGDLIGDGGGTHGAIYDPRYRRTIT 62

Query: 73  NSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANY-NDLDSIIHR-------IKREEV 124
                    P++           +P  T  T    ++ + L+S I +       +KRE  
Sbjct: 63  Q--------PITPS---------QPRYTDDTIRYTHFKSQLESEIEKKKDLPREVKREPE 105

Query: 125 SYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLI 184
            Y +   PTEP DFG+       YEDY+ E R +YV    +MI  QC+ALVDKFNV PLI
Sbjct: 106 IYLEK-EPTEPVDFGAE---TLSYEDYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLI 161

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSESQ-ELAESKDFQPRAKYRDEPHTLHGQRA 243
            G+   I  R++A +G+    WAD+AI  SE Q E  E KD +     + EP  L G+RA
Sbjct: 162 IGLVGPISLRYVALSGVYDDDWADKAIRDSELQSEDGEVKDAKRPKTQKAEPRNLDGKRA 221

Query: 244 VMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFG 303
           V IW  LL++ +PLSSSLAISFLACH     +LPTDIV+W+ EGK+PY + F++I ++ G
Sbjct: 222 VTIWVSLLKKTLPLSSSLAISFLACHHAGAPVLPTDIVRWAREGKLPYLSCFIDIREQMG 281

Query: 304 QTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 363
           + S AC +  S M RP + + +Q LE+ AA IA+ IGL L PVNFY +AS Y+K+L +P 
Sbjct: 282 ERSAACPVKASIMSRPFQIISAQMLEARAACIADIIGLPLSPVNFYGIASNYIKRLSIPE 341

Query: 364 GKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSS 423
            KIL     IQ WS+PP+L+LS NE + PTRVCVMSILIV+IR+LYNING G WE+SL  
Sbjct: 342 DKILELVRLIQNWSLPPELYLSKNELKLPTRVCVMSILIVAIRMLYNINGLGLWERSLE- 400

Query: 424 RKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEA 483
              F + S     F+ E +S + DE           + ++N+ +++ S   K ++ D E 
Sbjct: 401 ---FVNASEADIDFE-EFDSPVHDE----------EVSNTNSGTAELSDVTKATKFDTEE 446

Query: 484 LLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAI-MIEKFWKFY 542
           LL +LEA+Y ++  + T E  KDL +YL   K+  FAGLE   D P++    ++  W  Y
Sbjct: 447 LLKNLEAKYYEVA-AETLESEKDLLSYLSLGKNEFFAGLE--EDLPDDTYRTVDNLWNGY 503

Query: 543 QNEKESEAAEDSGKRCGIASKRKRS-RDDLSLGSHYKESEKIREKEFTTRLSASADYESF 601
             +++ E       RC    KR R   DD+SL     E  K  +       S S   ES 
Sbjct: 504 PKDEDFE-------RCRTPPKRGRDWDDDVSLNQLSLEDSKFSDGNNPCS-SPSRRNESV 555

Query: 602 SGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRV 661
           S D     S +G++SS  +  +           E + + A+KRL  DM +N FCYI PRV
Sbjct: 556 SMDRDLSSSEHGESSSPENHHH---------QEEKLKERAIKRLITDMGDNLFCYIPPRV 606

Query: 662 QIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLE 721
           ++KRLDYLQYVRK ++GAL Y AHADYYILLR CA +A++D+R+MH  VLSFERRLAW+E
Sbjct: 607 KVKRLDYLQYVRKKEDGALIYTAHADYYILLRVCAKVAEIDVRNMHRGVLSFERRLAWIE 666

Query: 722 KRIDHCLHLTPPSVTCRFCSD 742
           KRID  LHL PP +TC  C D
Sbjct: 667 KRIDQVLHLPPPFMTCDHCCD 687


>gi|357490217|ref|XP_003615396.1| hypothetical protein MTR_5g067430 [Medicago truncatula]
 gi|355516731|gb|AES98354.1| hypothetical protein MTR_5g067430 [Medicago truncatula]
          Length = 734

 Score =  487 bits (1253), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 290/741 (39%), Positives = 394/741 (53%), Gaps = 142/741 (19%)

Query: 7   DIRRLKCKKCDNVGF-ECAYDGFYYCTHCNA--LDDEIVETGVADEDFMATGAGTGTALY 63
           DIR LKC+    VG+ + + DGF+YC  CN         +     + ++       +A  
Sbjct: 84  DIRILKCQNYLFVGYVDDSDDGFFYCPKCNVRFDGVFDNDVDDVIDVYILRFRRKKSAAI 143

Query: 64  QASHARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREE 123
           +A       NS+P + +         WS      N      T  N +D+  I   I    
Sbjct: 144 KAE-----PNSQPCSEY-------DAWSKFV---NNLLEDETQQN-DDIHIIEQNIDGTS 187

Query: 124 VSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPL 183
           VS+        P DFGSS    + Y   H ++RM+++MG+Q+MI+LQC+ALV +F V PL
Sbjct: 188 VSH--------PSDFGSSKHKTSKYYHNHNQIRMRHLMGLQMMIELQCKALVKEFKVTPL 239

Query: 184 ICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRA 243
           ICG+   IW RF++ TG+     A   +  SE Q+  E +D+  R KYR+EPH + GQR 
Sbjct: 240 ICGLVGPIWLRFISKTGVFDDDRAYTVMHNSEMQKEGEPEDYTIRGKYREEPHNMFGQRF 299

Query: 244 VMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFG 303
           VMIW + L+ +IPL               EAILP+D++KW++EGK+PYF+AFVE+E+   
Sbjct: 300 VMIWLRYLKNRIPL---------------EAILPSDMMKWTLEGKLPYFSAFVELERLMK 344

Query: 304 QTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 363
             S+AC +S S MF+P +++P QKLESFAASIA  I            A RYLK+L LP+
Sbjct: 345 HESLACPISSSAMFKPQRAIPVQKLESFAASIAHCI------------AYRYLKKLSLPV 392

Query: 364 GKILPRALKIQEWSMPPDLWLSTNE--CRFPTRVCVMSILIVSIRILYNINGFGAWEKSL 421
            KILP A +I EW MPPDLWLS  +   R PT VC +SIL+++IRILYNING+G WEKSL
Sbjct: 393 EKILPYACRIYEWLMPPDLWLSFTKDYFRLPTHVCAVSILVIAIRILYNINGYGEWEKSL 452

Query: 422 SSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDA 481
           S                  CN   +D             G+ +   + +  HF     D 
Sbjct: 453 S------------------CNVGDKDN------------GEMDTTFASHDEHFLVKIAD- 481

Query: 482 EALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAG-LEPPHDDPEEAIMIEKFWK 540
                              +EYSKDLPTYLKY +DVVF G  E  +D+ EE  MIE    
Sbjct: 482 ------------------NHEYSKDLPTYLKYRRDVVFDGSKEQSYDNYEEQRMIEILRN 523

Query: 541 FYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYES 600
           FYQNEK +    +   +   +  R R  D+     H + + K++                
Sbjct: 524 FYQNEKNTTKPSEHAGKSNTSFNRLRLTDE----GHIRRTSKMK---------------- 563

Query: 601 FSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPR 660
            +   Y + S+   N                 S + ++D A+ ++KLDME N FCY+ PR
Sbjct: 564 -NTSPYGEDSVFNKNG---------------GSGKAVVDEAITQMKLDMEENLFCYVPPR 607

Query: 661 VQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWL 720
           V+ K LDY+ YVRK D+GA+TY AHADYYILLR CA IAQVDIR + + VL  E+RLAWL
Sbjct: 608 VKPKNLDYVHYVRKKDKGAMTYVAHADYYILLRTCARIAQVDIRILQIGVLRLEKRLAWL 667

Query: 721 EKRIDHCLHLTPPSVTCRFCS 741
           EKR+D CLHL P S++C+FCS
Sbjct: 668 EKRVDQCLHLKPSSISCQFCS 688


>gi|52354245|gb|AAU44443.1| hypothetical protein AT2G02955 [Arabidopsis thaliana]
          Length = 526

 Score =  459 bits (1180), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 260/582 (44%), Positives = 355/582 (60%), Gaps = 70/582 (12%)

Query: 165 LMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQ-ELAESK 223
           +MI  QC+ALVDKFNV PLI G+   I  R++A +G+  + WA+ AI  SE Q E  E K
Sbjct: 1   MMITYQCDALVDKFNVTPLIIGLVGPISLRYVALSGVYHKDWANNAIRDSEHQSEDGEVK 60

Query: 224 DFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKW 283
           D +   +++ EP  + G+RAV IW+ +L++ +PLSSSL ISFLACH     +LPTDIV+W
Sbjct: 61  DAKRLKRHKAEPRNIDGKRAVTIWFGILKKTMPLSSSLVISFLACHQAGAPVLPTDIVRW 120

Query: 284 SIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHL 343
           + EGK+PY + F++I ++ G+ S AC +  S M RP + + +Q LE+ A+ IA++IGL L
Sbjct: 121 AREGKLPYLSCFLDIREQMGERSAACPVKVSIMARPFQVISAQMLEARASVIADTIGLPL 180

Query: 344 PPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIV 403
           PPVNFY +AS Y+KQL +P  KIL  A  IQ WS+PP+L+LSTNE + P+RVCVMSILIV
Sbjct: 181 PPVNFYGIASNYIKQLSIPEDKILDLARLIQNWSLPPELYLSTNEQKLPSRVCVMSILIV 240

Query: 404 SIRILYNINGFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDS 463
           +IR+LYNING G WE+SL     F + S+                            GDS
Sbjct: 241 AIRMLYNINGLGVWERSLG----FVNASD----------------------------GDS 268

Query: 464 NAKSSKNSSHFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLE 523
              S       K +E D + LL +LEA+Y+++  + T E  KDL +YL   K+  FAGLE
Sbjct: 269 ETNSGTAE---KATEFDTQELLKNLEAKYHEVA-AETLESEKDLVSYLSLGKNEFFAGLE 324

Query: 524 PPHDDPEEAI-MIEKFWKFYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEK 582
              D P++   +++  W  Y  +++ E             KR R  DD            
Sbjct: 325 --EDSPDDTYRIVDNLWNGYPKDEDIEC----------LPKRGRDWDD------------ 360

Query: 583 IREKEFTTRLSASADYES-FSGDDYPQQSLNGDNSSKS-SEEYPNSEAIDEASAETIIDS 640
                    L+  + Y+S FS  + P  S +  N S S   +  +SE  + +S E + + 
Sbjct: 361 ------DVSLNQLSLYDSRFSDGNNPCSSSSRRNESVSIGLDLSSSEHRESSSPEKLKEI 414

Query: 641 AVKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQ 700
           A+KRL  DM ++ FCYI PRV++KRLDYLQYVRK ++GAL YAAHADYYILLR CA +A+
Sbjct: 415 AIKRLITDMGDDLFCYIPPRVKVKRLDYLQYVRKKEDGALIYAAHADYYILLRVCAKVAE 474

Query: 701 VDIRSMHMAVLSFERRLAWLEKRIDHCLHLTPPSVTCRFCSD 742
           +D+R+MH  VLSFERRLAW+EKRID  LHLT P +TC+ C D
Sbjct: 475 IDVRNMHRGVLSFERRLAWIEKRIDQVLHLTRPLMTCKHCCD 516


>gi|224112275|ref|XP_002316139.1| predicted protein [Populus trichocarpa]
 gi|222865179|gb|EEF02310.1| predicted protein [Populus trichocarpa]
          Length = 582

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 229/427 (53%), Positives = 288/427 (67%), Gaps = 39/427 (9%)

Query: 314 SFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKI 373
           S MFRP ++VP QKLES AASIAESIGLHLPPV FY +ASRYLKQL LP+ KILP A +I
Sbjct: 175 SLMFRPVQAVPIQKLESMAASIAESIGLHLPPVYFYGIASRYLKQLSLPVEKILPHACRI 234

Query: 374 QEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSSTSNI 433
            EWSMPPD W STNE R P+RVCV+SILIV+IRI+YNINGFGAWE SLS    ++S    
Sbjct: 235 YEWSMPPDQWFSTNELRLPSRVCVLSILIVAIRIIYNINGFGAWEGSLSG---YTS---- 287

Query: 434 GGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEARYN 493
             + D  C+S+MR  V+E        + DS+ KS  N SH + SELD+  LL +L+A+YN
Sbjct: 288 --RLDFPCSSQMR--VDEKDSGSPHKLDDSDEKSIGNRSHVQRSELDSAELLRNLDAKYN 343

Query: 494 DIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEAAED 553
           +I D  TYE+++DLP+YL+YC +VVF G    H +  E  +I+K W FYQ+EK S+ AED
Sbjct: 344 EISD--TYEFTRDLPSYLQYCNEVVFCGAGSSHVNHREDELIKKLWDFYQSEKYSDPAED 401

Query: 554 SGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDDYPQQSLNG 613
            G +       KRSR+D  L       +KIRE+   T    S D  + +GD+      N 
Sbjct: 402 DGAQNRTVFNGKRSRNDNGLARDQMGKKKIREE---THECLSTDTCNSTGDE------NN 452

Query: 614 DNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRLDYLQYVR 673
           D+S+                 ET+ D A+++LKLDM  NRF YI PR++IKR DYL YVR
Sbjct: 453 DDST-----------------ETVKDKAIRKLKLDMAENRFYYIPPRLKIKRFDYLHYVR 495

Query: 674 KSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRIDHCLHLTPP 733
           +  +GALTY AHADYYILLR+ A  AQVD R MH+ V++FERR+AW+EKRID+CLHLTPP
Sbjct: 496 RVGDGALTYVAHADYYILLRSFARAAQVDTRIMHIGVMNFERRVAWMEKRIDYCLHLTPP 555

Query: 734 SVTCRFC 740
           S TC +C
Sbjct: 556 SFTCEYC 562



 Score =  122 bits (307), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 52/90 (57%), Positives = 69/90 (76%)

Query: 129 MVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVA 188
           + GPTEP+DFG     K  YEDY+ EVR+KYV+GM LMI+LQCEALV+KFNV PLI GVA
Sbjct: 68  VFGPTEPEDFGGLSKAKLSYEDYYNEVRIKYVLGMGLMIRLQCEALVEKFNVTPLIVGVA 127

Query: 189 ASIWFRFLASTGLLSQGWADEAIVQSESQE 218
           +++W RFL +TG+    WA++ ++ SE Q+
Sbjct: 128 STVWLRFLLTTGVFKDSWANDVLIDSEMQK 157



 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 27/51 (52%), Positives = 34/51 (66%), Gaps = 1/51 (1%)

Query: 1  MADGEGDIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDF 51
          M   EG++R L+C+ C  +G E + DGFYYC  C A    I+ETGVADEDF
Sbjct: 1  MEREEGEVRNLQCESCGYIGLEES-DGFYYCQQCGAQAQGIMETGVADEDF 50


>gi|297814209|ref|XP_002874988.1| hypothetical protein ARALYDRAFT_912098 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320825|gb|EFH51247.1| hypothetical protein ARALYDRAFT_912098 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 546

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 252/604 (41%), Positives = 341/604 (56%), Gaps = 115/604 (19%)

Query: 133 TEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIW 192
           T P+   +  PVK  YEDY+ E R +YV G+ +MI  QC+ALV+KFNV PLI G      
Sbjct: 55  TRPEPAPNPEPVKLSYEDYYKETRDRYVKGLLMMITYQCDALVEKFNVTPLIIG------ 108

Query: 193 FRFLASTGLLSQGWADEAIVQSESQELAESKDFQ-PRAKYRDEPHTLHGQRAVMIWYKLL 251
                   L+   WAD+AI++SE Q   E ++ +  R K  +EP  L+G+RAV IW   L
Sbjct: 109 --------LVDDDWADKAILESELQSEGEVREPEVKRGKASEEPQNLYGKRAVTIWLSQL 160

Query: 252 RQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSL 311
           ++ +PLSSSLA+SFLACH   +A  PTDIV+W+ +GK PY ++F++I ++ G  S AC +
Sbjct: 161 KKSLPLSSSLALSFLACH---KAGTPTDIVRWARQGKFPYLSSFLKIREQMGDRSAACPV 217

Query: 312 SPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRAL 371
           S S MF+PS+ V +Q+LE+  ASIA+ IGL LPPVNFYA+AS YL++L +P  K+L    
Sbjct: 218 SVSVMFKPSQIVSAQRLEAQVASIADIIGLPLPPVNFYAIASNYLERLSIPKEKVLDLVR 277

Query: 372 KIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSSTS 431
            +  WSMP DL+LS NE R PTRV VMSI+IV+IR+LYNINGFG WE+SL          
Sbjct: 278 LLHNWSMPSDLYLSKNELRLPTRVYVMSIIIVAIRMLYNINGFGVWEQSLG--------- 328

Query: 432 NIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEAR 491
                        + D  EE     +                        E LL +LE +
Sbjct: 329 -------------LLDASEEDVTKAT------------------------EELLKNLETK 351

Query: 492 YNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEAA 551
           Y+++  + T E+ KDL +YL + K+ +FAGLE    D +    ++  W  Y  ++     
Sbjct: 352 YHEVA-AETVEHEKDLLSYLLHGKNEIFAGLEEASAD-DTYRTVDNLWNSYPKDQ----L 405

Query: 552 EDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDDYPQQSL 611
            D    C   S++  +                                          S 
Sbjct: 406 RDRNNPCSSRSRKSET-----------------------------------------GST 424

Query: 612 NGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRLDYLQY 671
           + D  S  S   P+    +E S E +I    +RL  DM +N F YI PRV++KRLDYLQY
Sbjct: 425 DCDEPSLESVSSPDKHNHEEKSKEKVI----RRLITDMRDNLFQYIPPRVKVKRLDYLQY 480

Query: 672 VRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRIDHCLHLT 731
           VRK+D+GAL YAAHADYYILLR CA +A++D R+MH +VLSFERRLAW+EK+IDH LHLT
Sbjct: 481 VRKNDDGALIYAAHADYYILLRVCAMVAEIDARNMHRSVLSFERRLAWIEKKIDHVLHLT 540

Query: 732 PPSV 735
           P S+
Sbjct: 541 PLSI 544



 Score = 42.7 bits (99), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 23/39 (58%)

Query: 13 CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDF 51
          C  C++  FE   DGF+YC  C+   D I++TGV  ED 
Sbjct: 3  CHICESDSFEDGGDGFFYCLQCSTQADGIIQTGVDGEDL 41


>gi|384955472|sp|Q5W770.2|TAF1B_ORYSJ RecName: Full=TATA box-binding protein-associated factor RNA
           polymerase I subunit B; AltName: Full=TATA box-binding
           protein-associated factor 1B; Short=TBP-associated
           factor 1B
          Length = 634

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 222/604 (36%), Positives = 330/604 (54%), Gaps = 87/604 (14%)

Query: 126 YTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLIC 185
           + D+  P+EP+DF +      + ED    VR +YV G+Q+++Q Q EALV++  V  L  
Sbjct: 100 FDDLGEPSEPRDFATGANAWGNPEDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAA 159

Query: 186 GVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVM 245
            +A +IW R++A++ +  + W  + +  + S E   S     +++   +           
Sbjct: 160 SLAGTIWLRWVAASKVFDEMWVHKMLAIAASVEEGHSASKDKQSELEGDAQKSQSSYE-F 218

Query: 246 IWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQT 305
           ++ + LR  +P+ S+LA+ FLACHV RE ILPTDI +W++EGK+PY AAF +++K  G +
Sbjct: 219 LFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDKLLGSS 278

Query: 306 SVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGK 365
              C LS   +FRP++ + + +LE+ A SIA+ IGL LP VNFY +A R+LK+L LP+ K
Sbjct: 279 LNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELSLPIEK 338

Query: 366 ILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRK 425
           ILP A +I EW+MP +LWLS+N  R P+RVCVM+ILIV++R+LY ING G WE       
Sbjct: 339 ILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWES------ 392

Query: 426 FFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALL 485
             + T N  G  DPE +            +P S+  DSN     NS  F     DA  LL
Sbjct: 393 -IAQTENAVGS-DPEAS------------APHSIEPDSN-----NSEEF-----DARELL 428

Query: 486 YDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNE 545
             L A Y+ I+  V ++YSK++ +YLKYCKDVVF G+       EE  +I+ FW  Y+  
Sbjct: 429 CTLAASYDKIN--VGHDYSKEVHSYLKYCKDVVFTGMT---FSLEEEHLIDIFWDMYKG- 482

Query: 546 KESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDD 605
           KE    +++ K C                      EK+R                     
Sbjct: 483 KEVMLLDENAKLC---------------------QEKLR--------------------- 500

Query: 606 YPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKR 665
               + NG N       + ++    +  +  + + A++ +K  ME N FCY+SPR ++  
Sbjct: 501 ----TTNGVNKRCRDGRFADT----KCCSTPLGNCALQSIKSKMEENGFCYVSPRKRLVS 552

Query: 666 LDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRID 725
             YL Y R+   G+L Y AHADYYILLR  A +A+VD+R +H +VL  ERRL W+E+R+ 
Sbjct: 553 DGYLLYTRRESSGSLIYVAHADYYILLRPFAKLAEVDVRVLHSSVLKLERRLGWIEERVG 612

Query: 726 HCLH 729
             L+
Sbjct: 613 RSLN 616


>gi|384952682|sp|B8AX23.1|TAF1B_ORYSI RecName: Full=TATA box-binding protein-associated factor RNA
           polymerase I subunit B; AltName: Full=TATA box-binding
           protein-associated factor 1B; Short=TBP-associated
           factor 1B
 gi|218196609|gb|EEC79036.1| hypothetical protein OsI_19584 [Oryza sativa Indica Group]
          Length = 634

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 222/604 (36%), Positives = 329/604 (54%), Gaps = 87/604 (14%)

Query: 126 YTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLIC 185
           + D+  P+EP+DF +      + ED    VR +YV G+Q+++Q Q EALV++  V  L  
Sbjct: 100 FDDLGEPSEPRDFATGANAWGNPEDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAA 159

Query: 186 GVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVM 245
            +A +IW R++A++ +  + W  + +  + S E   S     +++   +           
Sbjct: 160 SLAGTIWLRWVAASKVFDEMWVHKMLAIAASVEEGHSASKDKQSELEGDAQKSQSSYE-F 218

Query: 246 IWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQT 305
           ++ + LR  +P+ S+LA+ FLACHV RE ILPTDI +W++EGK+PY AAF +++K  G +
Sbjct: 219 LFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDKLLGSS 278

Query: 306 SVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGK 365
              C LS   +FRP++ + + +LE+ A SIA+ IGL LP VNFY +A R+LK+L LP+ K
Sbjct: 279 LNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELSLPIEK 338

Query: 366 ILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRK 425
           ILP A +I EW+MP +LWLS+N  R P+RVCVM+ILIV++R+LY ING G WE       
Sbjct: 339 ILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWES------ 392

Query: 426 FFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALL 485
             + T N  G  DPE +            +P S+  DSN     NS  F     DA  LL
Sbjct: 393 -IAQTENAVGS-DPEAS------------APHSIEPDSN-----NSEEF-----DARELL 428

Query: 486 YDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNE 545
             L A Y+ I   V ++YSK++ +YLKYCKDVVF G+       EE  +I+ FW  Y+  
Sbjct: 429 CTLAASYDKI--DVGHDYSKEVHSYLKYCKDVVFTGMT---FSLEEEHLIDIFWDMYKG- 482

Query: 546 KESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDD 605
           KE    +++ K C                      EK+R                     
Sbjct: 483 KEVMLLDENAKLC---------------------QEKLR--------------------- 500

Query: 606 YPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKR 665
               + NG N       + ++    +  +  + + A++ +K  ME N FCY+SPR ++  
Sbjct: 501 ----TTNGVNKRCRDGRFADT----KCCSTPLGNCALQSIKSKMEENGFCYVSPRKRLVS 552

Query: 666 LDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRID 725
             YL Y R+   G+L Y AHADYYILLR  A +A+VD+R +H +VL  ERRL W+E+R+ 
Sbjct: 553 DGYLLYTRRESSGSLIYVAHADYYILLRPFAKLAEVDVRVLHSSVLKLERRLGWIEERVG 612

Query: 726 HCLH 729
             L+
Sbjct: 613 RSLN 616


>gi|357119318|ref|XP_003561389.1| PREDICTED: uncharacterized protein LOC100826720 [Brachypodium
           distachyon]
          Length = 623

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 250/730 (34%), Positives = 357/730 (48%), Gaps = 147/730 (20%)

Query: 11  LKCKKCDNVGF---ECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASH 67
           L+C +C        + A DGF+ C  C A+      T     DF                
Sbjct: 17  LRCNECSATEVYSPDDAVDGFFSCRRCYAVHTSTQVTATDPNDF---------------- 60

Query: 68  ARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYT 127
                +S P    VP +Q     +P +  P+      T   ++D D              
Sbjct: 61  ----NDSMPFLRRVP-TQTPQTPAP-YRTPHPAHGGPTARPFDDFDE------------- 101

Query: 128 DMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGV 187
               P+ PQDF        + E+   +VR +YV+G+Q+++Q Q E LV++  V  LICGV
Sbjct: 102 ----PSAPQDFAPGADAWGNPEELVAQVRRRYVLGLQVILQRQLEVLVERHRVGALICGV 157

Query: 188 AASIWFRFLASTGLLSQGWADEAIVQSESQE-----LAESKDFQPRAKYRDEPHTLHG-- 240
           A +IW R +A++ +  + W  + + Q E+ E      ++ K  Q + +  D      G  
Sbjct: 158 AGTIWVRLVAASKVFDEMWPQKVLAQDEAVEKLKRSASDQKPHQVKYELADAALARKGTR 217

Query: 241 QRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK 300
           +R   I+ + LR  +PL S+LA+SFLACH+ REAILPTDI +W+ EGK+PY AAF E++K
Sbjct: 218 KRCEFIFLRSLRTMLPLYSTLAVSFLACHIAREAILPTDISRWATEGKLPYVAAFTEVDK 277

Query: 301 RFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLC 360
             G +++ C L    +FRP + + + +LE+ A SIA+ IGL LP VNFYA+A RYL +L 
Sbjct: 278 LLG-SALKCPLDARQLFRPVRVIGTWQLEAAAGSIAQRIGLRLPSVNFYAIAQRYLNELS 336

Query: 361 LPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKS 420
           LP+ +ILP A  I EW+MP +LWLS+N  R PTRV VM+ILIVS+R LYNING G WE+ 
Sbjct: 337 LPVERILPHACHIYEWAMPAELWLSSNSDRIPTRVYVMAILIVSLRFLYNINGQGIWEE- 395

Query: 421 LSSRKFFSSTSNIGGKFDPECNS--KMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESE 478
                         G  DP+ NS   M+ E              +NA  +K         
Sbjct: 396 ------ICEAGITAGGSDPDANSLPSMKPECR------------TNAFGTKE-------- 429

Query: 479 LDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKF 538
                LL  L    + I   V ++YSKDL +YLKYCKDVVF G+       EEA +IE F
Sbjct: 430 -----LLCTLAEACDKI--DVGHDYSKDLRSYLKYCKDVVFPGIA---YSVEEAHLIEIF 479

Query: 539 WKFYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADY 598
              Y+      A ED                         ++ K+  +E           
Sbjct: 480 HDMYK------AGED-------------------------DNPKVHMEEM---------- 498

Query: 599 ESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYIS 658
                     Q+ NG N       +  +     +       S ++R+K +ME++ FCY+ 
Sbjct: 499 ----------QTTNGVNKRCQDGTFVGARRFSTS-------SGIQRIKSEMEDHGFCYMP 541

Query: 659 PRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLA 718
           P   ++  +YL Y RK+  G+L   +HADYYIL+RA A +A+VDIR +H +VL  ERRLA
Sbjct: 542 PGRGLRSDNYLHYRRKTVSGSLVSVSHADYYILIRAFAKLAEVDIRIIHASVLKLERRLA 601

Query: 719 WLEKRIDHCL 728
            +EKRI   L
Sbjct: 602 SIEKRIGRSL 611


>gi|413944896|gb|AFW77545.1| hypothetical protein ZEAMMB73_711148 [Zea mays]
          Length = 690

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 237/724 (32%), Positives = 352/724 (48%), Gaps = 142/724 (19%)

Query: 4   GEGDIRRLKCKKC---DNVGFECAYDGFYYCTHCNALDDEIVETGVAD-EDFMATGAGTG 59
           G G I  L C  C   DN   + A DG + C  C+A+     +   AD  DF ATG    
Sbjct: 14  GGGSIH-LVCDHCGTSDNYTTDDADDGQFTCRTCSAV--HTTQATAADPHDFPATG---- 66

Query: 60  TALYQASHARQVANSRPANHFVP-LSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHR 118
                      ++  R A    P L  ++   +P    P       T A           
Sbjct: 67  ----------NISVRRVATQPTPKLGARTP--APYLRTPQAAPVPATAA----------- 103

Query: 119 IKREEVSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKF 178
                  + D    +EP+DF S      + ED   +VR +YV G+Q+++Q Q E LV++ 
Sbjct: 104 -------FDDFTELSEPRDFVSGSGTCGEPEDLAVQVRWRYVRGLQVILQRQLEVLVERH 156

Query: 179 NVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESK---DFQP-RAKYR-D 233
            V   +C VA  +W R++A++ +    WA +  V  + + +   K   D +P   KY  D
Sbjct: 157 QVGAFVCAVAGIVWVRWVAASRVFDGIWARQ--VLEDHKAVGRQKCYDDKKPDEMKYEWD 214

Query: 234 EPHTLHGQ---RAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIP 290
           +  +L  +   +    + + LR  +P+ S+LA+ FLACH+ REAILP+DI +W++EG IP
Sbjct: 215 DDMSLQRKDRHKLEFAFLRSLRMMLPVYSTLAVCFLACHIAREAILPSDIYRWAMEGNIP 274

Query: 291 YFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYA 350
           Y AAF E+++  G +   C L    +FRP + + + +LE+ A SIA+ + L LP VNFYA
Sbjct: 275 YVAAFTEVDRFLGSSFQDCPLDARQLFRPVRVIGAWQLEAAAGSIAQRVALRLPSVNFYA 334

Query: 351 LASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYN 410
           +A R LK L LP+ +ILP A +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYN
Sbjct: 335 IAQRCLKDLLLPIDRILPHACQIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYN 394

Query: 411 INGFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKN 470
           ING G WEK                             + E   +PS    D+N+ +S+ 
Sbjct: 395 INGQGIWEK-----------------------------ICEEGRNPSESHYDANSSTSRK 425

Query: 471 SSHFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPE 530
                  E     LL  +   Y+ I+ S  ++YS DL +YLKYCK+VVFAG+       E
Sbjct: 426 LEASNSEEFGTRELLCAISVAYDKINTS--HDYSSDLRSYLKYCKEVVFAGIT---SSDE 480

Query: 531 EAIMIEKFWKFYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGS----HYKESEKIREK 586
           +  +IE FW  Y+  +E +  ED  K       +  S  D+ + +    HY++   ++  
Sbjct: 481 QTHLIEIFWDMYK-AREDDNIEDHVK------SQSHSMADIPITNGVRKHYRDGAFVKAS 533

Query: 587 EFTTRLSASADYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLK 646
            F    SAS      SG D                                   A++ LK
Sbjct: 534 SF----SAS------SGHD-----------------------------------AMQMLK 548

Query: 647 LDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSM 706
            +M+++ F Y+ PR   K   YL+Y R+   G   + AHADYY+LLRA A +A++DIR M
Sbjct: 549 SEMQDHGFHYMPPRKPRKSDGYLRYRRRRLSGGFIFVAHADYYLLLRAFAKLAEIDIRIM 608

Query: 707 HMAV 710
           H+ +
Sbjct: 609 HIRI 612


>gi|222631239|gb|EEE63371.1| hypothetical protein OsJ_18183 [Oryza sativa Japonica Group]
          Length = 808

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 172/433 (39%), Positives = 254/433 (58%), Gaps = 37/433 (8%)

Query: 126 YTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLIC 185
           + D+  P+EP+DF +      + ED    VR +YV G+Q+++Q Q EALV++  V  L  
Sbjct: 100 FDDLGEPSEPRDFATGANAWGNPEDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAA 159

Query: 186 GVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVM 245
            +A +IW R++A++ +  + W  + +  + S E   S     +++   +           
Sbjct: 160 SLAGTIWLRWVAASKVFDEMWVHKMLAIAASVEEGHSASKDKQSELEGDAQKSQSSYE-F 218

Query: 246 IWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQT 305
           ++ + LR  +P+ S+LA+ FLACHV RE ILPTDI +W++EGK+PY AAF +++K  G +
Sbjct: 219 LFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDKLLGSS 278

Query: 306 SVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGK 365
              C LS   +FRP++ + + +LE+ A SIA+ IGL LP VNFY +A R+LK+L LP+ K
Sbjct: 279 LNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELSLPIEK 338

Query: 366 ILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRK 425
           ILP A +I EW+MP +LWLS+N  R P+RVCVM+ILIV++R+LY ING G WE       
Sbjct: 339 ILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWES------ 392

Query: 426 FFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALL 485
             + T N  G  DPE +            +P S+  DSN     NS  F     DA  LL
Sbjct: 393 -IAQTENAVGS-DPEAS------------APHSIEPDSN-----NSEEF-----DARELL 428

Query: 486 YDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNE 545
             L A Y+ I+  V ++YSK++ +YLKYCKDVVF G+       EE  +I+ FW  Y+  
Sbjct: 429 CTLAASYDKIN--VGHDYSKEVHSYLKYCKDVVFTGMT---FSLEEEHLIDIFWDMYKG- 482

Query: 546 KESEAAEDSGKRC 558
           KE    +++ K C
Sbjct: 483 KEVMLLDENAKLC 495


>gi|55168006|gb|AAV43874.1| unknown protein [Oryza sativa Japonica Group]
          Length = 1221

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 174/444 (39%), Positives = 256/444 (57%), Gaps = 36/444 (8%)

Query: 126 YTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLIC 185
           + D+  P+EP+DF +      + ED    VR +YV G+Q+++Q Q EALV++  V  L  
Sbjct: 163 FDDLGEPSEPRDFATGANAWGNPEDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAA 222

Query: 186 GVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVM 245
            +A +IW R++A++ +  + W  + +  + S E   S     +++   +           
Sbjct: 223 SLAGTIWLRWVAASKVFDEMWVHKMLAIAASVEEGHSASKDKQSELEGDAQKSQSSYE-F 281

Query: 246 IWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQT 305
           ++ + LR  +P+ S+LA+ FLACHV RE ILPTDI +W++EGK+PY AAF +++K  G +
Sbjct: 282 LFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDKLLGSS 341

Query: 306 SVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGK 365
              C LS   +FRP++ + + +LE+ A SIA+ IGL LP VNFY +A R+LK+L LP+ K
Sbjct: 342 LNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELSLPIEK 401

Query: 366 ILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRK 425
           ILP A +I EW+MP +LWLS+N  R P+RVCVM+ILIV++R+LY ING G WE       
Sbjct: 402 ILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWES------ 455

Query: 426 FFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALL 485
             + T N  G  DPE +            +P S+  DSN     NS  F     DA  LL
Sbjct: 456 -IAQTENAVGS-DPEAS------------APHSIEPDSN-----NSEEF-----DARELL 491

Query: 486 YDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNE 545
             L A Y+ I+  V ++YSK++ +YLKYCKDVVF G+       EE  +I+ FW  Y+ +
Sbjct: 492 CTLAASYDKIN--VGHDYSKEVHSYLKYCKDVVFTGMT---FSLEEEHLIDIFWDMYKGK 546

Query: 546 KESEAAEDSGKRCGIASKRKRSRD 569
           ++  A     K        KR RD
Sbjct: 547 EDENAKLCQEKLRTTNGVNKRCRD 570


>gi|308080800|ref|NP_001183047.1| uncharacterized protein LOC100501385 [Zea mays]
 gi|238009008|gb|ACR35539.1| unknown [Zea mays]
          Length = 529

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 195/567 (34%), Positives = 285/567 (50%), Gaps = 92/567 (16%)

Query: 4   GEGDIRRLKCKKC---DNVGFECAYDGFYYCTHCNALDDEIVETGVAD-EDFMATGAGTG 59
           G G I  L C  C   DN   + A DG + C  C+A+     +   AD  DF ATG    
Sbjct: 14  GGGSIH-LVCDHCGTSDNYTTDDADDGQFTCRTCSAV--HTTQATAADPHDFPATG---- 66

Query: 60  TALYQASHARQVANSRPANHFVP-LSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHR 118
                      ++  R A    P L  ++   +P    P       T A           
Sbjct: 67  ----------NISVRRVATQPTPKLGARTP--APYLRTPQAAPVPATAA----------- 103

Query: 119 IKREEVSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKF 178
                  + D    +EP+DF S      + ED   +VR +YV G+Q+++Q Q E LV++ 
Sbjct: 104 -------FDDFTELSEPRDFVSGSGTCGEPEDLAVQVRWRYVRGLQVILQRQLEVLVERH 156

Query: 179 NVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESK---DFQP-RAKYR-D 233
            V   +C VA  +W R++A++ +    WA +  V  + + +   K   D +P   KY  D
Sbjct: 157 QVGAFVCAVAGIVWVRWVAASRVFDGIWARQ--VLEDHKAVGRQKCYDDKKPDEMKYEWD 214

Query: 234 EPHTLHGQ---RAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIP 290
           +  +L  +   +    + + LR  +P+ S+LA+ FLACH+ REAILP+DI +W++EG IP
Sbjct: 215 DDMSLQRKDRHKLEFAFLRSLRMMLPVYSTLAVCFLACHIAREAILPSDIYRWAMEGNIP 274

Query: 291 YFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYA 350
           Y AAF E+++  G +   C L    +FRP + + + +LE+ A SIA+ + L LP VNFYA
Sbjct: 275 YVAAFTEVDRFLGSSFQDCPLDARQLFRPVRVIGAWQLEAAAGSIAQRVALRLPSVNFYA 334

Query: 351 LASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYN 410
           +A R LK L LP+ +ILP A +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYN
Sbjct: 335 IAQRCLKDLLLPIDRILPHACQIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYN 394

Query: 411 INGFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKN 470
           ING G WEK                             + E   +PS    D+N+ +S+ 
Sbjct: 395 INGQGIWEK-----------------------------ICEEGRNPSESHYDANSSTSRK 425

Query: 471 SSHFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPE 530
                  E     LL  +   Y+ I+ S  ++YS DL +YLKYCK+VVFAG+       E
Sbjct: 426 LEASNSEEFGTRELLCAISVAYDKINTS--HDYSSDLRSYLKYCKEVVFAGIT---SSDE 480

Query: 531 EAIMIEKFWKFYQNEKESEAAEDSGKR 557
           +  +IE FW  Y+      A ED  +R
Sbjct: 481 QTHLIEIFWDMYK------AREDDNRR 501


>gi|413944895|gb|AFW77544.1| hypothetical protein ZEAMMB73_711148 [Zea mays]
          Length = 459

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 182/483 (37%), Positives = 262/483 (54%), Gaps = 90/483 (18%)

Query: 251 LRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACS 310
           LR  +P+ S+LA+ FLACH+ REAILP+DI +W++EG IPY AAF E+++  G +   C 
Sbjct: 45  LRMMLPVYSTLAVCFLACHIAREAILPSDIYRWAMEGNIPYVAAFTEVDRFLGSSFQDCP 104

Query: 311 LSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRA 370
           L    +FRP + + + +LE+ A SIA+ + L LP VNFYA+A R LK L LP+ +ILP A
Sbjct: 105 LDARQLFRPVRVIGAWQLEAAAGSIAQRVALRLPSVNFYAIAQRCLKDLLLPIDRILPHA 164

Query: 371 LKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSST 430
            +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYNING G WEK           
Sbjct: 165 CQIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYNINGQGIWEK----------- 213

Query: 431 SNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEA 490
                             + E   +PS    D+N+ +S+        E     LL  +  
Sbjct: 214 ------------------ICEEGRNPSESHYDANSSTSRKLEASNSEEFGTRELLCAISV 255

Query: 491 RYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEA 550
            Y+ I+ S  ++YS DL +YLKYCK+VVFAG+       E+  +IE FW  Y+  +E + 
Sbjct: 256 AYDKINTS--HDYSSDLRSYLKYCKEVVFAGIT---SSDEQTHLIEIFWDMYK-AREDDN 309

Query: 551 AEDSGKRCGIASKRKRSRDDLSLGS----HYKESEKIREKEFTTRLSASADYESFSGDDY 606
            ED  K       +  S  D+ + +    HY++   ++   F    SAS      SG D 
Sbjct: 310 IEDHVK------SQSHSMADIPITNGVRKHYRDGAFVKASSF----SAS------SGHD- 352

Query: 607 PQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRL 666
                                             A++ LK +M+++ F Y+ PR   K  
Sbjct: 353 ----------------------------------AMQMLKSEMQDHGFHYMPPRKPRKSD 378

Query: 667 DYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRIDH 726
            YL+Y R+   G   + AHADYY+LLRA A +A++DIR MH++VL  E+RLA +E+RI+ 
Sbjct: 379 GYLRYRRRRLSGGFIFVAHADYYLLLRAFAKLAEIDIRIMHISVLKLEKRLACIEERIER 438

Query: 727 CLH 729
            L+
Sbjct: 439 SLN 441


>gi|168008162|ref|XP_001756776.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692014|gb|EDQ78373.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 491

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 181/563 (32%), Positives = 282/563 (50%), Gaps = 93/563 (16%)

Query: 165 LMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKD 224
           +++Q+QCEALV  F V PL+CG+   +W +F+ ST +  +GWA+EA++ +E+Q   E+K+
Sbjct: 1   MLVQMQCEALVQSFGVTPLVCGILGPVWMKFVRSTCVYEKGWAEEALLVAEAQH--ENKE 58

Query: 225 FQPRAKYR--DEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVK 282
            +PR K R   EP T +G+R   +W + L+ +IPL SSLAI++L CH+ RE ILPTDI+ 
Sbjct: 59  VKPRRKKRVVGEPWTTYGERTRFVWLRSLKARIPLRSSLAITYLVCHLAREPILPTDIIN 118

Query: 283 WSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLH 342
           W+ EG +PY +A+  IEKR            S +F+P +   ++ +E  A+ I + IGL 
Sbjct: 119 WAFEGTLPYLSAYTAIEKRSNWIPELQLFKASKIFKPLRMTNARVVEYLASLIGDRIGLE 178

Query: 343 LPPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILI 402
           LPP+NF+A++ R+LK+L LP+ K+ P   ++ EW     LWL++   + PTRV VM+ L+
Sbjct: 179 LPPINFHAISRRFLKELDLPVEKLAPYVCRLYEWFPSSGLWLTSQVSKVPTRVYVMAALV 238

Query: 403 VSIRILYNINGFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGD 462
           V  +ILYN++G       LSS+ F                               +V GD
Sbjct: 239 VVFKILYNLDG----RYRLSSKDF------------------------------DAVSGD 264

Query: 463 SNAKSSKNSSHFKESELDAEALLYDLE-ARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAG 521
             +  +      KE + +AE  +  ++   +NDI+     ++SK L  YL+YC++ +FA 
Sbjct: 265 DESMDTLEVR--KEEQWNAEEFVTKIKLLLHNDINQ--VGDFSKQLTDYLRYCRETIFAN 320

Query: 522 LEPPHDDPEEAIMIEKFWKFYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESE 581
               +D+     + + FW  Y         E S K+               LG+  KES+
Sbjct: 321 DGMSNDED----LRDYFWGIY---------EQSAKQVT------------CLGTISKESQ 355

Query: 582 KIREKEFTTRLSASADYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSA 641
              E +  T      D E              +NS     +         +  +T  D  
Sbjct: 356 NT-ESDIGTNGLLLLDSE--------------ENSFYKQSKPGKKSKKTSSKLQT--DPL 398

Query: 642 VKRLKLDMENNRFCYISPRVQIKRLD-YLQYVRKSDEGALTYAAHADYYILLRACATIAQ 700
           ++     ME+  FC I P   + R + Y  YV   D+       H DY  +LRACA I  
Sbjct: 399 LR----GMEDLGFCIIDPPKHVPRENEYWTYVITRDD---MRPVHEDYLFVLRACAKIID 451

Query: 701 VDIRSMHMAVLSFERRLAWLEKR 723
           VD R +HMA+L+ E+ +  +EKR
Sbjct: 452 VDPRVLHMAILNVEKGIIKVEKR 474


>gi|242034077|ref|XP_002464433.1| hypothetical protein SORBIDRAFT_01g018310 [Sorghum bicolor]
 gi|241918287|gb|EER91431.1| hypothetical protein SORBIDRAFT_01g018310 [Sorghum bicolor]
          Length = 468

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 148/427 (34%), Positives = 223/427 (52%), Gaps = 46/427 (10%)

Query: 4   GEGDIRRLKCKKC---DNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGT 60
           G G    L C  C   DN   + A DG + C  C+A+                       
Sbjct: 15  GGGGSIHLVCDHCGTSDNYNTDDADDGQFTCRTCSAV----------------------- 51

Query: 61  ALYQASHARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIK 120
                 H  Q   + P  H+ P++   S    +  +P       T A Y           
Sbjct: 52  ------HTTQATAADP--HYFPVTGSISV-RRVATQPTPKLGARTPAPYPRTPQAAS--V 100

Query: 121 REEVSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNV 180
               ++ D +  +EP+DF        + ED    VR +YV G+Q+++Q Q E LV++  V
Sbjct: 101 PAAAAFDDFMELSEPRDFAPGSGTWGEPEDLAVRVRWRYVRGLQVILQRQLEVLVERHQV 160

Query: 181 CPLICGVAASIWFRFLASTGLLSQGWADEAIVQSES---QELAESKDFQP--RAKYRDEP 235
              +C VA  +W R++A++G+    W  + +   ++   ++ + S+D +     KY  + 
Sbjct: 161 GATVCAVAGIVWVRWVAASGVFDGIWVHQVLEDHKAMGREKCSSSRDNKKPDEVKYESDD 220

Query: 236 HTL---HGQRAV-MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPY 291
             L     +R V   + + LR  +P+ S+LA+ FLACH+ REAILP DI +W++EG +PY
Sbjct: 221 DMLLQRKDRRKVEFAFLRSLRMMLPVYSTLAVCFLACHIAREAILPNDIYRWAMEGNVPY 280

Query: 292 FAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYAL 351
            AAF E+++  G +   C L    +FRP + + + +LE+   SIA+ +GL LP VNFYA+
Sbjct: 281 VAAFTEVDRFLGSSFQDCPLDARQLFRPVRVIGAWQLEAAVGSIAQKVGLRLPSVNFYAI 340

Query: 352 ASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNI 411
           A R LK L LP+ +ILP A +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYNI
Sbjct: 341 AHRCLKDLLLPIDRILPHACRIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYNI 400

Query: 412 NGFGAWE 418
           NG G WE
Sbjct: 401 NGQGIWE 407


>gi|52354247|gb|AAU44444.1| hypothetical protein AT2G02955 [Arabidopsis thaliana]
          Length = 293

 Score =  219 bits (557), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 118/304 (38%), Positives = 170/304 (55%), Gaps = 23/304 (7%)

Query: 13  CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ-- 70
           C +C+N  F+   DG+YYC  C    + +++TGV D D +  G GT  ALY   H R   
Sbjct: 3   CTECENDAFDEEDDGYYYCQRCGVQVENLIQTGVDDGDLIGEGGGTQGALYNPKHRRTEP 62

Query: 71  --VANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTD 128
             +  S+P   F   + + S +   F+  N                +   +KR   SY D
Sbjct: 63  QPITPSQP--RFTDDTSRYSQFKSQFESENGNK------------ELPREVKRAPDSYVD 108

Query: 129 MVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVA 188
              PTEP DF +       YE+Y+ E R +YV    +MI  QC+ALVDKFNV PLI G+ 
Sbjct: 109 K-EPTEPVDFAAE---TLSYENYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLIIGLV 164

Query: 189 ASIWFRFLASTGLLSQGWADEAIVQSESQ-ELAESKDFQPRAKYRDEPHTLHGQRAVMIW 247
             I  R++A +G+  + WA+ AI  SE Q E  E KD +   +++ EP  + G+RAV IW
Sbjct: 165 GPISLRYVALSGVYHKDWANNAIRDSEHQSEDGEVKDAKRLKRHKAEPRNIDGKRAVTIW 224

Query: 248 YKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSV 307
           + +L++ +PLSSSL ISFLACH     +LPTDIV+W+ EGK+PY + F++I ++ G+ S 
Sbjct: 225 FGILKKTMPLSSSLVISFLACHQAGAPVLPTDIVRWAREGKLPYLSCFLDIREQMGERSA 284

Query: 308 ACSL 311
           AC +
Sbjct: 285 ACPV 288


>gi|242090177|ref|XP_002440921.1| hypothetical protein SORBIDRAFT_09g016515 [Sorghum bicolor]
 gi|241946206|gb|EES19351.1| hypothetical protein SORBIDRAFT_09g016515 [Sorghum bicolor]
          Length = 241

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 94/169 (55%), Positives = 126/169 (74%)

Query: 251 LRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACS 310
           LR  +P+ S+LA+ FLACH+ REAILP+DI +W++EG IPY AAF E+++  G +   C 
Sbjct: 52  LRMMLPVYSTLAVCFLACHIAREAILPSDIYRWAMEGNIPYVAAFTEVDRFLGNSFQDCP 111

Query: 311 LSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRA 370
           L    +FRP + + + +LE+ A SIA+ +GL LP VNFYA+A R LK L LP+ +ILP A
Sbjct: 112 LDARQLFRPVRVIGAWQLEAAAGSIAQKVGLRLPSVNFYAIAHRCLKDLLLPIDRILPHA 171

Query: 371 LKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEK 419
            +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYNING G WEK
Sbjct: 172 CRIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYNINGQGIWEK 220


>gi|2191148|gb|AAB61035.1| A_IG002N01.28 gene product [Arabidopsis thaliana]
          Length = 246

 Score =  176 bits (445), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 118/277 (42%), Positives = 155/277 (55%), Gaps = 61/277 (22%)

Query: 233 DEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYF 292
           +EP  L+G+RAV IW   L + +PLSSSLA+SFLACH     ILPTDIV+W+ +GKIPY 
Sbjct: 25  EEPQNLYGKRAVTIWLSQLTKSLPLSSSLALSFLACHKAGTPILPTDIVRWARQGKIPYL 84

Query: 293 AAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALA 352
           +  ++I ++ G+ S AC +S S             LE+ AASIA+ IGL LPPVNFYA+A
Sbjct: 85  S--LKIREQMGERSAACPVSASI------------LEAQAASIADIIGLPLPPVNFYAIA 130

Query: 353 SRYLKQLCLPL-GKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNI 411
           S YL +L +P   K+L     ++ WSM  DL+LS  E R PTRVCVMSI++V+IR+LYNI
Sbjct: 131 SNYLMRLSVPKEDKVLDLVRLLEYWSMRSDLYLSKKELRLPTRVCVMSIIVVAIRMLYNI 190

Query: 412 NGFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNS 471
           NGFG WE+SL   +    T                                         
Sbjct: 191 NGFGVWERSLEDAELSDVT----------------------------------------- 209

Query: 472 SHFKESELDAEALLYDLEARYNDIHDSV--TYEYSKD 506
              K +ELD E LL +LEA+Y ++      T  YS+D
Sbjct: 210 ---KATELDTEELLKNLEAKYYEVAAETVGTQLYSRD 243


>gi|357473559|ref|XP_003607064.1| hypothetical protein MTR_4g071850 [Medicago truncatula]
 gi|355508119|gb|AES89261.1| hypothetical protein MTR_4g071850 [Medicago truncatula]
          Length = 211

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 84/223 (37%), Positives = 118/223 (52%), Gaps = 29/223 (13%)

Query: 7   DIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQAS 66
           D+    C+ C   G     DGFYYC+ C   + ++V+TG  +ED  A GAG    +Y AS
Sbjct: 3   DVVTFTCQSCTYEGEALESDGFYYCSACGEKNLDVVDTGAEEED--AIGAG----IYLAS 56

Query: 67  HARQVANSRPANHFVPLSQ----QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKRE 122
           H R+ A    A +  P+SQ    QS+F   L  E ++                  ++K E
Sbjct: 57  HQRRTAAPTDAVYVQPISQCNPSQSNFLRKLGLEDDSQV----------------KVKAE 100

Query: 123 EVSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCP 182
            V  +    P+ P DFG S  V    E Y+ E+R++Y+MG+Q+MI+LQCEALV +F V P
Sbjct: 101 NVDQS-QCDPSNPADFGGSTVVS--IEQYYKEIRLRYIMGLQMMIELQCEALVKEFKVTP 157

Query: 183 LICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDF 225
           LICG+   IW RF++ TG+    WAD+AI  SE Q    +  F
Sbjct: 158 LICGLVGPIWLRFVSKTGVFDDDWADKAIHDSEMQNEGNNVKF 200


>gi|302800339|ref|XP_002981927.1| hypothetical protein SELMODRAFT_115616 [Selaginella moellendorffii]
 gi|302802299|ref|XP_002982905.1| hypothetical protein SELMODRAFT_117213 [Selaginella moellendorffii]
 gi|300149495|gb|EFJ16150.1| hypothetical protein SELMODRAFT_117213 [Selaginella moellendorffii]
 gi|300150369|gb|EFJ17020.1| hypothetical protein SELMODRAFT_115616 [Selaginella moellendorffii]
          Length = 159

 Score =  117 bits (294), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 59/148 (39%), Positives = 90/148 (60%), Gaps = 6/148 (4%)

Query: 155 VRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQS 214
           +R  YV G+Q +IQLQCE+LV+ F V PLIC +   IW RF+AST +  + WA EAI+ +
Sbjct: 15  IREYYVQGLQTIIQLQCESLVENFGVSPLICRIVGPIWLRFVASTRVFEREWASEAILLA 74

Query: 215 ESQELAESKDFQPRAKYR---DEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVV 271
           +++ L E        K       P  +   R   +W   L++++PL  +LA+ FLAC+V+
Sbjct: 75  DARSLKEKAKKLKLKKESSTLPAPEKIEMSR---VWTDSLKRRLPLRITLAMVFLACYVI 131

Query: 272 REAILPTDIVKWSIEGKIPYFAAFVEIE 299
           RE +L TDI  W+ +G +PY  AF+ ++
Sbjct: 132 REPVLATDISTWAGQGSLPYLTAFLSMQ 159


>gi|12322421|gb|AAG51232.1|AC035249_7 unknown protein; 59131-63280 [Arabidopsis thaliana]
          Length = 579

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/138 (42%), Positives = 78/138 (56%), Gaps = 26/138 (18%)

Query: 132 PTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASI 191
           PT P DFG+       YEDY+ E R +Y     +MI  QC+ALVDKFNV PLI  +    
Sbjct: 47  PTNPVDFGAE---TLSYEDYYDETRDRYDKAFLMMITYQCDALVDKFNVTPLIIAIR--- 100

Query: 192 WFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLL 251
                            E+ +QSE  E+ ++K  +P+  ++ EP  L G+RAV IW+ +L
Sbjct: 101 -----------------ESELQSEDGEVKDTK--RPKT-HKAEPCNLDGKRAVTIWFSML 140

Query: 252 RQKIPLSSSLAISFLACH 269
           +  +PLSSSL ISFLACH
Sbjct: 141 KMTMPLSSSLVISFLACH 158


>gi|242090175|ref|XP_002440920.1| hypothetical protein SORBIDRAFT_09g016510 [Sorghum bicolor]
 gi|241946205|gb|EES19350.1| hypothetical protein SORBIDRAFT_09g016510 [Sorghum bicolor]
          Length = 159

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 42/87 (48%), Positives = 60/87 (68%)

Query: 642 VKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQV 701
           ++ LK +M+++ F Y+ PR   K   YL+Y RK   G   Y AHADYY+LLRA A +A++
Sbjct: 54  MQMLKSEMQDHGFHYMPPRKPRKSDGYLRYRRKRLSGGFVYVAHADYYMLLRAFAKLAEI 113

Query: 702 DIRSMHMAVLSFERRLAWLEKRIDHCL 728
           DIR MH++VL  ERRLA +E++I+  L
Sbjct: 114 DIRIMHISVLKLERRLACIEEQIERSL 140


>gi|242034079|ref|XP_002464434.1| hypothetical protein SORBIDRAFT_01g018320 [Sorghum bicolor]
 gi|241918288|gb|EER91432.1| hypothetical protein SORBIDRAFT_01g018320 [Sorghum bicolor]
          Length = 159

 Score = 85.9 bits (211), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 42/88 (47%), Positives = 60/88 (68%)

Query: 641 AVKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQ 700
           A++ L+ +M+++ F Y+ PR   K   YL+Y RK       Y AHADYY+LLRA A +A+
Sbjct: 53  AMQMLRSEMQDHGFHYMPPRKPRKSDGYLRYRRKRLSSGFIYVAHADYYMLLRAFAKLAE 112

Query: 701 VDIRSMHMAVLSFERRLAWLEKRIDHCL 728
           +DIR MH++VL  ERRLA +E+RI+  L
Sbjct: 113 IDIRIMHISVLKVERRLACIEERIERSL 140


>gi|384246021|gb|EIE19513.1| hypothetical protein COCSUDRAFT_83564 [Coccomyxa subellipsoidea
           C-169]
          Length = 300

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 97/171 (56%), Gaps = 12/171 (7%)

Query: 245 MIWY-KLLR---QKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK 300
           M+WY + L+   Q +   + LA+SFL C  +REA+ P DI++W+ +G++P   +   +  
Sbjct: 1   MLWYCRCLQHALQALNPDAVLAVSFLGCWYLREAVTPMDILRWASDGRLPVL-SLPPLSA 59

Query: 301 RFGQTS--VACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQ 358
           +  Q++    C+L P+ +  P+  +  Q+L + A  I   + L LPPVN  AL  RY++ 
Sbjct: 60  KLLQSAQEADCTLPPA-LLHPTGVMGVQELVAMAGQIGSRLKLRLPPVNAGALIHRYVQD 118

Query: 359 LCLPLGKILPRALKIQE--WSMPPDLWLSTNECRFPTRVCVMSILIVSIRI 407
           L LP  ++ P AL++ +   S  P +WL  +    P    +M+IL+V++++
Sbjct: 119 LALP-EEMAPVALRLFDIYQSGSPQVWLQDDVFLHPY-AHLMAILLVTLKL 167


>gi|302853742|ref|XP_002958384.1| hypothetical protein VOLCADRAFT_99662 [Volvox carteri f.
           nagariensis]
 gi|300256264|gb|EFJ40534.1| hypothetical protein VOLCADRAFT_99662 [Volvox carteri f.
           nagariensis]
          Length = 1355

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 66/257 (25%), Positives = 111/257 (43%), Gaps = 44/257 (17%)

Query: 143 PVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLL 202
           PV A ++ Y  +V   Y+  +QL++Q Q +ALV++F   P    V  +IWF F+  TGLL
Sbjct: 104 PVMARFK-YTAQV---YLRALQLLVQAQADALVERFGGPPECRVVMRNIWFGFVPLTGLL 159

Query: 203 S-------------------QGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRA 243
                                G A E ++ +ES     S  F   A          G R 
Sbjct: 160 DLDPRVGGTTVHAYLDGSGYPGLAREGVL-AESHMGGGSDIFGRSAFGTLTSSNRPGMRR 218

Query: 244 VMIWYKL------------------LRQKIPLSSSLAISFLACHVVREAILPTDIVKWSI 285
           V +  K                   +R+ +P S++L ++ L C ++R  + P D+V+W++
Sbjct: 219 VKMTSKFHTTGASDIDWGAARMDGFIRRLMPPSATLVMTLLTCLLLRMPVTPVDVVRWAL 278

Query: 286 EGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPP 345
              +PY    +++  +    S      P+ +     +VP ++L   A  +A  +GL LPP
Sbjct: 279 NESLPYRD--LQMRGKPLLRSSPFKTFPARLLAAKGAVPPRRLLVAAVEMAGVLGLQLPP 336

Query: 346 VNFYALASRYLKQLCLP 362
           +N   L ++    L LP
Sbjct: 337 LNVEGLLAKMTDALGLP 353


>gi|297805422|ref|XP_002870595.1| hypothetical protein ARALYDRAFT_915988 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316431|gb|EFH46854.1| hypothetical protein ARALYDRAFT_915988 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 201

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 37/72 (51%), Positives = 50/72 (69%)

Query: 300 KRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQL 359
           ++ G  + AC +S S MFRPS+ V +Q LE+ AASIA+ IGL LPPVNF  +AS YLK+L
Sbjct: 15  EKMGDRAAACPVSVSVMFRPSQIVSAQSLEAQAASIADIIGLLLPPVNFCGIASNYLKRL 74

Query: 360 CLPLGKILPRAL 371
            +P  + L R +
Sbjct: 75  SIPRIRFLSRCV 86



 Score = 40.0 bits (92), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 2/71 (2%)

Query: 476 ESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMI 535
           +++   E LL +LEA+Y ++    T+E  KDL  YL + KD +FAGL+    D    I +
Sbjct: 117 DTKATTEELLKNLEAKYYEVAVE-THECEKDLLPYLLHGKDEIFAGLDEASAD-ATYITV 174

Query: 536 EKFWKFYQNEK 546
           +  W  Y  E+
Sbjct: 175 DNLWNSYPKEE 185


>gi|145354102|ref|XP_001421333.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581570|gb|ABO99626.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 748

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 86/169 (50%), Gaps = 6/169 (3%)

Query: 249 KLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVA 308
           + ++ ++P+  SL + +LA    R AILP ++ K + EG +PY   +  + +  G+ +V 
Sbjct: 185 RFVKDRLPMVMSLGLLYLALARRRAAILPFELAKMAAEGSLPYLNVYDVVHEHCGENTV- 243

Query: 309 CSLSPS---FMFRPSKSVPS-QKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG 364
             L PS   F+F   K +P+ Q++ + AA  A  I   LPP+N  A+ +RY+  L     
Sbjct: 244 -ELLPSDEIFLFEHPKRIPTPQRIVAAAAFAAAKIDAQLPPINAAAILTRYVGTLFNLDS 302

Query: 365 KILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNING 413
           ++L  A +     + P L              +++ ++V+++ LY ++G
Sbjct: 303 QVLHAARRTLSVYLSPALRYGAKNVVGAPESALLAYVVVALKFLYGLDG 351


>gi|297801128|ref|XP_002868448.1| hypothetical protein ARALYDRAFT_915725 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314284|gb|EFH44707.1| hypothetical protein ARALYDRAFT_915725 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 206

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 69/151 (45%), Gaps = 23/151 (15%)

Query: 316 MFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKIQE 375
           MFRP + V +Q+LE+ A SIA+ IGL LPPVNFY  +S YLK L  P  K+L     IQ 
Sbjct: 1   MFRPIQIVSAQRLEAQATSIADIIGLPLPPVNFY--SSNYLKLLYNPNDKVLELLCLIQN 58

Query: 376 WSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGA-------WEKSLSSRKFFS 428
           WS+P  L+                   +  R   N+NG G        WE  LS  K   
Sbjct: 59  WSIPSGLYYRRTS--------------LVTRKRRNLNGLGLHPRGGRDWEDDLSLIKLSL 104

Query: 429 STSNIGGKFDPECNSKMRDEVEEISCSPSSV 459
           +   +    +P  +   + E E + C   S+
Sbjct: 105 NDDKLRNGNNPCVSHSRKSETESMDCDEPSL 135



 Score = 47.4 bits (111), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 59/127 (46%), Gaps = 25/127 (19%)

Query: 556 KRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDDYPQQSLNGDN 615
           +R  + ++++R+ +   LG H +      +     +LS + D           +  NG+N
Sbjct: 68  RRTSLVTRKRRNLN--GLGLHPRGGRDWEDDLSLIKLSLNDD-----------KLRNGNN 114

Query: 616 SSKSSEEYPNSEAID--EASAETI----------IDSAVKRLKLDMENNRFCYISPRVQI 663
              S      +E++D  E S E +           + A+KRL  DM  N FCYI PRV++
Sbjct: 115 PCVSHSRKSETESMDCDEPSLECVSSPDDHKEKSKECAIKRLITDMGYNLFCYIPPRVKV 174

Query: 664 KRLDYLQ 670
           KR DYLQ
Sbjct: 175 KRQDYLQ 181


>gi|255085772|ref|XP_002505317.1| predicted protein [Micromonas sp. RCC299]
 gi|226520586|gb|ACO66575.1| predicted protein [Micromonas sp. RCC299]
          Length = 879

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 61/117 (52%)

Query: 245 MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQ 304
           +   +L+ + +P    LA+++LAC + RE + P D  +W++EG +P+ A   ++ K   +
Sbjct: 265 LTLRRLVSRHLPRRLPLAVAYLACVLRREPLHPADFTRWALEGDLPFMAHAADVAKDLRR 324

Query: 305 TSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCL 361
                S  P+ +  P  +    K+ S A  +A ++ + LPP N   L +R+  +L L
Sbjct: 325 AGADPSPWPNALTTPRATPLPDKIASCAHDVACTLNIELPPCNALGLCARFASELGL 381


>gi|371780083|emb|CBZ39499.1| td2TF2 protein, partial [Triticum durum]
          Length = 101

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 30/63 (47%), Positives = 42/63 (66%)

Query: 648 DMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMH 707
           +ME + FCY+ PR   +   YL Y +K+  G L  A HADYY+L+R+ A +A+VDIR MH
Sbjct: 39  EMEYHGFCYMQPRKWPRSDGYLHYRKKTMTGHLVCAVHADYYLLIRSFAKLAEVDIRVMH 98

Query: 708 MAV 710
            +V
Sbjct: 99  ASV 101


>gi|260817882|ref|XP_002603814.1| hypothetical protein BRAFLDRAFT_86650 [Branchiostoma floridae]
 gi|229289137|gb|EEN59825.1| hypothetical protein BRAFLDRAFT_86650 [Branchiostoma floridae]
          Length = 628

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 44/178 (24%), Positives = 85/178 (47%), Gaps = 8/178 (4%)

Query: 253 QKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLS 312
           Q++ +  +L  S+L    ++E +LP+D+++W  EG IPY A    + K      + C+  
Sbjct: 240 QRMTMLRTLCFSYLGILWLQEPVLPSDMIRWVREGHIPYIAVKELLPKHMKLGIMDCN-- 297

Query: 313 PSFMFRPSKSVPS-QKLESFAASIAESIGL-HLPPVNFYALASRYLKQLCLPLGKILPRA 370
            +F     ++ PS Q +++    +A  I L  LP V+   +ASR++  L LP        
Sbjct: 298 -NFSV---QAFPSYQVIQNNTVRLARFIQLPQLPEVDISRVASRFILDLNLPAEMHGLVE 353

Query: 371 LKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFS 428
             + +  + P LW +  +         M ++IV +++L+ ++       S ++RK  S
Sbjct: 354 NLLNQRPIDPCLWCNRRDAMLNHEAIAMGLVIVVLKLLFGLDDNSERTLSETARKLQS 411


>gi|449300811|gb|EMC96823.1| hypothetical protein BAUCODRAFT_34214 [Baudoinia compniacensis UAMH
           10762]
          Length = 473

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 79/167 (47%), Gaps = 22/167 (13%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 316
           LS  LA+ ++   ++RE +   D+  W   G++ Y+ A  E+     +      L P++ 
Sbjct: 107 LSDLLALCYVGTLLLREPVTVADLHGWVTHGELLYYRAAKEVPLDMRE-----RLPPTYQ 161

Query: 317 --FRPSKSVPSQKLESFAAS----IAESIGLHLPPVNFYALASRYLKQLCLPLGKI---- 366
             F P    P+QKL          + +S G+ +PP+N   +  R+++ L LP+       
Sbjct: 162 DQFEPHHVGPAQKLHQNVVDTLKVLDDSFGMAMPPINHPLILYRWVRDLSLPIEIFAATT 221

Query: 367 -LPRALKIQ---EWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILY 409
            L RAL++       +PP       E RFP  + +M++L+ + ++LY
Sbjct: 222 NLARALELDFAYRLDVPPA--TKGVELRFP-ELRLMTLLVAATKLLY 265


>gi|296412649|ref|XP_002836035.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295629835|emb|CAZ80192.1| unnamed protein product [Tuber melanosporum]
          Length = 635

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 75/182 (41%), Gaps = 22/182 (12%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQT----------- 305
           L  S+ I +L   V+  A+   DI +W  E  I Y  A  EI     Q            
Sbjct: 206 LVGSVGICYLGLIVLGVAVSLGDIHRWVEEQGIVYLRAINEIPSEMKQRLDAEYYVALDP 265

Query: 306 SVACSLSPSFM-FRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG 364
            V C L  S    RP+  +    ++        + G+  PP+N   L   ++K L LPL 
Sbjct: 266 RVCCLLPLSLTGVRPTLGLLHSTVQGLVIMYQVTFGMAFPPLNITLLVFGFMKALGLPL- 324

Query: 365 KILPRALKIQEWSMPPDLWLSTNECRFPTRVC------VMSILIVSIRILYNINGFGAWE 418
           +I     ++      P +W   N  R+  RV       +M++LI+S ++LY ++G     
Sbjct: 325 EIYYGFKRLANLISLPFMW---NGLRYRGRVTNWPEARLMALLIISTKLLYGLDGVKRTP 381

Query: 419 KS 420
           KS
Sbjct: 382 KS 383


>gi|430812269|emb|CCJ30297.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 310

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 16/146 (10%)

Query: 230 KYRDEPHTLHGQRAVMIWYKLLRQKIP-LSSSLAISFLACHVVREAILPTDIVKWSIEGK 288
           KY+++  ++ G    +I YK   +K+P L  ++ I +L    +R +I  +D+      GK
Sbjct: 140 KYQNK--SIIGNPNNIIPYK--TKKLPKLIDTIGICYLGLLNIRSSITISDLCNLMKSGK 195

Query: 289 IPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSV-PS-QKLESFAASIA----ESIGLH 342
           IPY  A   I      T V  S+ P +    S ++ PS +K+ +   SI     +S  + 
Sbjct: 196 IPYMKALFIIP-----TQVRKSMEPQYQIALSPTIFPSYEKVLNVTYSIVSAFYDSYKII 250

Query: 343 LPPVNFYALASRYLKQLCLPLGKILP 368
            PP+N Y +   Y++ LCLPL  ILP
Sbjct: 251 FPPLNVYPILVSYIQNLCLPLELILP 276


>gi|325184330|emb|CCA18821.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 548

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 54/247 (21%), Positives = 94/247 (38%), Gaps = 53/247 (21%)

Query: 160 VMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGL-----LSQGWADEAIVQS 214
           +  +Q ++  Q   L++  +       V   IWFRFL + G      L + + +   ++ 
Sbjct: 91  LQAVQYVLNYQANTLINDLDFPKEYAVVVKQIWFRFLEAWGCKSDKPLLKCFTEMDRLRL 150

Query: 215 ESQELAESKDFQPRAKYRDEPHTLHGQRAVMI---------------------------- 246
             + LA  ++  P A    E   L     + +                            
Sbjct: 151 RGKRLAGQQESNPIAPSNVEDDLLAEWDELFVGSQNSAKGIDEEIDFTIEDNTTQQEGFS 210

Query: 247 ----WYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRF 302
               W KL  + + LS+ L I +L+  ++   +LP+D   W   GK+PY     +  K+ 
Sbjct: 211 LPKAWSKL--EHLSLSTLLGILYLSSRILHLGVLPSDFSHWITNGKLPYHNLLAKCPKK- 267

Query: 303 GQTSVACSLSPSFMFRPSKSVPSQ-----KLESFAASIAESIGLHLPPVNFYALASRYLK 357
               +AC L  + +F  S    SQ     K+  F   +   + LH+PP+N    AS    
Sbjct: 268 ----LACRLDDAALFFNSNVWYSQTFNAAKISFFTNYLQYHLELHIPPLN----ASLAAH 319

Query: 358 QLCLPLG 364
            +C+ LG
Sbjct: 320 TICVNLG 326


>gi|429849272|gb|ELA24675.1| ubiquitin ribosomal fusion protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 674

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 64/296 (21%), Positives = 120/296 (40%), Gaps = 31/296 (10%)

Query: 136 QDFGSSGPVKADYEDYHFEVRMK---------YVMGMQLMIQLQCEALVDKFNVCPLICG 186
           +DFG  G V    +     VR           Y+  +QL+++ Q   LVD       +  
Sbjct: 51  EDFGQKGKVSRKEKQIKERVRRVLTGPPARDLYLECLQLILREQVMWLVDVKGHKAELET 110

Query: 187 VAASIW-FRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVM 245
           V   +W  R   +   +    A EA +   S ++  S+D QP  +           +   
Sbjct: 111 VVRDLWDLRIRGAASAVENDSASEAELSLFSSQIDLSQDEQPSNR----------TQGAQ 160

Query: 246 IWYKLLRQKIP---LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRF 302
            W      + P   L  +LA+ FL C ++R      D+ +W+  G IPY  ++ ++ +  
Sbjct: 161 NWTPESGSEWPAPRLMDTLALCFLGCMLLRMPTRTGDLARWARAGNIPYKHSYRKLPQEM 220

Query: 303 GQTSVA---CSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQL 359
                A     L  + + +         + S A S   + G+  PP+N   +  +Y+++L
Sbjct: 221 RDRLPAYYSTVLKAAHLAKFDHGELHAAVLSLALSFNSNYGIVFPPLNDVLMTLQYVREL 280

Query: 360 CLPLGKI-LPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGF 414
            LP+  I + R + +   ++P +  L  +  R    +    IL+V++ +L  I+ F
Sbjct: 281 GLPIETISVARRIPLI-VNLPYEFPLGKSRLRL---IHQPEILLVAVIVLTTIHCF 332


>gi|213406467|ref|XP_002174005.1| RNA polymerase I-specific transcription initiation factor rrn7
           [Schizosaccharomyces japonicus yFS275]
 gi|212002052|gb|EEB07712.1| RNA polymerase I-specific transcription initiation factor rrn7
           [Schizosaccharomyces japonicus yFS275]
          Length = 532

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 95/222 (42%), Gaps = 31/222 (13%)

Query: 151 YHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIW-------FRFLASTGLLS 203
           Y  E ++ ++  MQ ++QLQ  ALV++  V      V  S+W       F+  A    L+
Sbjct: 78  YGTEGKVLFLKVMQHILQLQTHALVNELGVDQRCEAVIRSLWAVIVSNVFKSSAFHQNLA 137

Query: 204 QGWADEAIVQSESQELAESKDFQ--PRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSL 261
           +   +    Q  S+E ++ +DF+  P ++ +DE          M W KL+        SL
Sbjct: 138 EAINETVQQQPNSEEDSDLEDFEHLPSSEQQDEVKV----DKEMEWPKLIH-------SL 186

Query: 262 AISFLACHVVREAILPTDIVKWSIEGKIPYFAAF------VEIEKRFGQTSVACSLSPSF 315
           A+  + C ++R  +  +DI  W+   KIPY  A+      + I    G       ++P  
Sbjct: 187 ALIQIGCLLLRLPVTTSDIQYWTSTNKIPYMQAYNLLPNTLRIRLHVGYVR---QITPK- 242

Query: 316 MFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLK 357
             +PS     + +       +E I L +P +N   +   YLK
Sbjct: 243 -VQPSTERIHESVGLMIRFFSEKISLLIPSLNTPLILQSYLK 283


>gi|121702971|ref|XP_001269750.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
 gi|119397893|gb|EAW08324.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
          Length = 518

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 75/404 (18%), Positives = 151/404 (37%), Gaps = 70/404 (17%)

Query: 134 EPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDK--FNV-CPLICGVA-- 188
           +P+DFG+ G             R+K  +  QL++  QC ALV    F V   + C  +  
Sbjct: 37  DPEDFGTQGKTS----------RVKKAI-YQLILWKQCHALVQSRGFPVQLEVPCNYSIP 85

Query: 189 ---ASIWFRFLASTGLLSQGWA-------DEAIVQSESQELAESKDFQPRAKYRDEPHTL 238
                 W+    +  ++   WA       D   V +ES    E    QP A +  +    
Sbjct: 86  DCPCRYWYLTPDAQSIVRDLWALRLESLSDRLNVSAESDREPELFSSQPAASFDKQEDAY 145

Query: 239 HGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI 298
                   W +L+        ++ + +L C ++R  +   ++ +  +   +P+      I
Sbjct: 146 RPSGRATQWPRLI-------DTIGLCYLGCLLMRLPVTVGELHRMIMREDVPFIRVLRSI 198

Query: 299 EKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAE----SIGLHLPPVNFYALASR 354
            +          LS   +   ++ + ++ L      ++       G+ LPP+N  AL  R
Sbjct: 199 PREMRDKLPQEYLS---LLETTRLIKAEHLHKATLELSLLYRIKFGVQLPPLNTPALLYR 255

Query: 355 YLKQLCLPLGKILPRALKIQ-----EWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILY 409
           Y+++L LP+ +I P   +IQ     ++  P  +         P  V +++++++S ++ +
Sbjct: 256 YIRRLALPV-EIYPAVKRIQSLLGFKFEFPASILGKRRPLHLP-EVQIITLVVISTKLFF 313

Query: 410 NINGFGA--------------WEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCS 455
             +                  WE     ++ F S   +GG+           +  EI  +
Sbjct: 314 PFDDIKRYPVSVREPSAQVLDWELWSQVQRHFDSRETVGGRI---------GKGNEILIT 364

Query: 456 PSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEARYNDIHDSV 499
              V   S A+  +    ++ S LD   +  D EA +  +   +
Sbjct: 365 EKDVFNMSPAQLDEYMDWYENSWLDHSKMEDDHEAMHTMLQTVI 408


>gi|307103437|gb|EFN51697.1| hypothetical protein CHLNCDRAFT_54966 [Chlorella variabilis]
          Length = 545

 Score = 48.5 bits (114), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 73/157 (46%), Gaps = 25/157 (15%)

Query: 268 CHVVREAILPTDIVKWSIEGKIPY--FAA-----FVEIEKRFGQTSVACSLSPSFMFRPS 320
           C    EA  P D+++W+++G++PY  FAA       +     G   +A            
Sbjct: 202 CWCQLEAAGPLDVLRWALDGRLPYVAFAAEEGAVLQQYRNILGGELIAV----------- 250

Query: 321 KSVPSQK-LESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMP 379
           + VPS K L   A+++ + +GL  PP++     +RYL  L L   ++LP AL++     P
Sbjct: 251 RGVPSPKALLLNASTMGKRLGLACPPLSPALWLARYLADLELS-QELLPVALQLHALYQP 309

Query: 380 PDLWLSTNECRFPTR---VCVMSILIVSIRILYNING 413
               L   E   P R     +M+ L+V+ ++ Y + G
Sbjct: 310 AP--LVPTEGERPGRHPWALLMASLVVAAKLCYGVGG 344


>gi|303282013|ref|XP_003060298.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457769|gb|EEH55067.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 810

 Score = 48.5 bits (114), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 11/104 (10%)

Query: 260 SLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRP 319
           +LAI++LAC  +++ +   D+V+W+++G +PY AA   +    G             F+ 
Sbjct: 238 TLAIAYLACAWLKQPVHVGDLVRWALDGDMPYLAASARVHASLGGDD-------ELPFKG 290

Query: 320 ---SKSVPSQKL-ESFAASIAESIGLHLPPVNFYALASRYLKQL 359
              ++ VP   L  + A  ++   G+ LPP N     +R+  ++
Sbjct: 291 ALVARCVPQPHLIATCAMHVSSVAGIALPPANVAGFVARFASEV 334


>gi|453086938|gb|EMF14979.1| hypothetical protein SEPMUDRAFT_79174 [Mycosphaerella populorum
           SO2202]
          Length = 568

 Score = 48.1 bits (113), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 47/230 (20%), Positives = 104/230 (45%), Gaps = 27/230 (11%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI-----EKRFGQTSVACSL 311
           L+  L++ ++   ++R  I   DI KW  +G + Y+ A  E+     ++  GQ  +  SL
Sbjct: 177 LTDLLSLIYVGILLLRIPITTADIYKWVNDGDLLYYRAAREVPLSMRDRLPGQ--LQESL 234

Query: 312 SPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRAL 371
            P  + R    +  + +     S+   +G+  PP+N   +  R++K + LPL ++   A 
Sbjct: 235 EPQTLLR--MEILQRNVLDLLTSLDSDVGMQAPPLNHPLVLYRWVKTMHLPL-EVYVTAQ 291

Query: 372 KIQEWSMPPDLWLSTN---ECRFPTR---VCVMSILIVSIRILYNINGFGAWEKSLSSRK 425
           ++    +  D   + +    CR   R   V +M +++++I++L+  +          +++
Sbjct: 292 RLGGL-LGLDFQYNVDAKERCRISLRCPEVRLMVMIVIAIKMLFPFD---------HAKR 341

Query: 426 FFSSTSNIGG-KFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHF 474
           + +S S++   K D +   +++D    +       +G +   S   SSH 
Sbjct: 342 YPTSASDMTALKMDWDAWQRIQDGRNVVHDGNRENVGSNRLASQDESSHL 391


>gi|407920931|gb|EKG14108.1| Ubiquitin [Macrophomina phaseolina MS6]
          Length = 583

 Score = 47.8 bits (112), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 78/171 (45%), Gaps = 17/171 (9%)

Query: 254 KIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSP 313
           ++ L  ++A+ +    +++      ++  W   G +PY++A +++      + +   L  
Sbjct: 144 RVKLVDTVALCYFGMLILQLPAGLRELYDWVNNGDMPYYSALLDMP-----SDLLHQLPA 198

Query: 314 SFMFR--PSKSVPSQKLESFAASIAE----SIGLHLPPVNFYALASRYLKQLCLPLG--K 365
            +  R  P + +  ++L+S    +A      +G+  PP+N+  LA +Y+K L LPL    
Sbjct: 199 PYRNRLEPHRRLRPERLQSAVLDLASWYHLDVGMVFPPLNYVLLARKYIKDLSLPLEVYH 258

Query: 366 ILPRALKIQEWSMP-PDLWLSTNE--CRFPTRVCVMSILIVSIRILYNING 413
           +L   L I   +   PD      +  C  P      SI I++ ++LY  NG
Sbjct: 259 MLECILHIAGCTFAFPDFTAVGQDPICELPEAQLAASI-IIATKMLYPFNG 308


>gi|297814594|ref|XP_002875180.1| hypothetical protein ARALYDRAFT_904560 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297321018|gb|EFH51439.1| hypothetical protein ARALYDRAFT_904560 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 150

 Score = 45.8 bits (107), Expect = 0.090,   Method: Composition-based stats.
 Identities = 20/44 (45%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 7  DIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADED 50
          D RR+ C +C+N  FE   DGF+YC  C+   + I++T V DED
Sbjct: 2  DTRRIICNECENDAFELE-DGFFYCQRCSVRVEGIIQTVVIDED 44


>gi|254578868|ref|XP_002495420.1| ZYRO0B10934p [Zygosaccharomyces rouxii]
 gi|238938310|emb|CAR26487.1| ZYRO0B10934p [Zygosaccharomyces rouxii]
          Length = 534

 Score = 45.1 bits (105), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 34/141 (24%), Positives = 65/141 (46%), Gaps = 14/141 (9%)

Query: 154 EVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQ 213
           E R+ ++  +Q  ++ Q   L+++  +      +   IW + L S  L +    DE   Q
Sbjct: 93  EARLLFIRSLQFTLKRQSRWLIEEQKLPQEFDKLVKIIWMKLLKS--LENDDNNDEFGEQ 150

Query: 214 SESQELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVRE 273
            + Q    ++D       +D  H L  +R     Y+L      L S+LAI +++   +  
Sbjct: 151 EDGQTFDSNED-------QDRFHELARRRKDKNRYRL-----SLVSTLAILYMSSVQLGI 198

Query: 274 AILPTDIVKWSIEGKIPYFAA 294
            I   D++KW++ G++PYF +
Sbjct: 199 PIYTCDLIKWTMSGRLPYFKS 219


>gi|443701088|gb|ELT99720.1| hypothetical protein CAPTEDRAFT_184938 [Capitella teleta]
          Length = 880

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 37/162 (22%), Positives = 73/162 (45%), Gaps = 13/162 (8%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 316
           LS +L    L        ++P D+V+W+  GK+P  +A   +             +   M
Sbjct: 409 LSCTLGFCMLGLKYTNPLVMPGDLVRWAKLGKVPLQSAIKSLPDDMKLIG-----NDRQM 463

Query: 317 FRPSKSVPSQKLESFAASIAESIG-LHLPPVNFYALASRYLKQLCLPLGKILPRALKIQE 375
           F  ++   + +L      ++ S+G L   PV+   LA RY+++L LP G++      + E
Sbjct: 464 FSWARVPSTDELREQCWRLSTSLGILDFAPVDIKLLADRYIEELRLP-GELHALVRHLIE 522

Query: 376 WSMPPDLWLSTNECR-----FPTRVCVMSILIVSIRILYNIN 412
            S P D  ++  + R     F      M+ +++ +++L+ +N
Sbjct: 523 -SNPIDTHMTQADSRQFNSGFEVSRIAMTYVVIMLKLLFCLN 563


>gi|225678065|gb|EEH16349.1| predicted protein [Paracoccidioides brasiliensis Pb03]
          Length = 664

 Score = 44.7 bits (104), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 41/175 (23%), Positives = 78/175 (44%), Gaps = 27/175 (15%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 316
           L  SLA+ +L   ++R  +    + +W +  +IP   A      RF    +   L  ++ 
Sbjct: 201 LIDSLALCYLGVLLLRVPLGLGRVERWILNDEIPLIRAI-----RFIPQDMKDRLPAAYH 255

Query: 317 FR-PSKSVPS-----QKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRA 370
           F   +K+VP+     + +    A   +  G+ +PP+N   +   Y+KQL LPL    P  
Sbjct: 256 FALDTKNVPTGYQLHRAIADLGAFYHKEFGVEIPPLNAPLMLLTYIKQLSLPLEPFPPVL 315

Query: 371 LKIQE-------WSMPPDLWLSTNECRFPTR---------VCVMSILIVSIRILY 409
            ++Q+       +S P     +T+  R   R         V ++S+LI+++++ Y
Sbjct: 316 NRLQQLAGYSFSYSSPISYSTTTSARRKCKRKFHRNHLPEVQLVSLLIIAVKLFY 370


>gi|198433254|ref|XP_002121631.1| PREDICTED: similar to TATA box binding protein (Tbp)-associated
           factor, RNA polymerase I, B [Ciona intestinalis]
          Length = 637

 Score = 43.9 bits (102), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 34/165 (20%), Positives = 77/165 (46%), Gaps = 18/165 (10%)

Query: 261 LAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKR-----FGQTSVACSLSPSF 315
           L+I ++A    +E I P D++ W  +G +PY  A + +E       +   ++  ++ PS+
Sbjct: 264 LSILYVALQWCKELITPGDLLNWVKQGYLPYIDALLCLEPHMKISDYDGANLQRNIIPSY 323

Query: 316 MFRPSKSVPSQKLESFAASIAESIGL-HLPPVNFYALASRYLKQLCLPLGKI--LPRALK 372
                     ++    +  + E + +  LP V+  +L +R++ Q+ LPL  I  +   ++
Sbjct: 324 ----------KQFVKVSRLLEEYVEMPPLPDVDISSLMARFVIQVNLPLEIISFINNLIR 373

Query: 373 IQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAW 417
           I  +        +T    +   +   ++++V++++LY IN    W
Sbjct: 374 INNFKYTRANVRTTALTGYAPDLISCALIVVALKLLYGINDVTEW 418


>gi|396487145|ref|XP_003842569.1| hypothetical protein LEMA_P083290.1 [Leptosphaeria maculans JN3]
 gi|312219146|emb|CBX99090.1| hypothetical protein LEMA_P083290.1 [Leptosphaeria maculans JN3]
          Length = 614

 Score = 43.9 bits (102), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 61/279 (21%), Positives = 122/279 (43%), Gaps = 30/279 (10%)

Query: 152 HFEVRMK---YVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWAD 208
           HF  R     Y+  +QL+++ Q   LV + ++   +  V   +W   +A           
Sbjct: 137 HFSGRQAFDLYLKSLQLILRHQVWFLVHEKSLPAELEIVIYDLWALRIAQLADKMTNTNL 196

Query: 209 EAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQK----IP-LSSSLAI 263
           ++ +QS++Q  +  +         D+  T   +R ++  ++  R++    IP L+ +LA+
Sbjct: 197 DSDLQSQAQIFSTLET--------DDSDTADEKRGLLKNFEHRRERKLASIPNLTDALAL 248

Query: 264 SFLACHVVREAILPTDIVKWSIEGKIPYFAAF----VEIEKRFGQTSVACSLSPSFMFRP 319
            +L    +R  I P D   W  +GK+ Y  A     + +  R   T  A  L+   +F+ 
Sbjct: 249 CYLGIRTLRLPITPGDFYAWVTDGKMAYRRAIRLVPLAMRDRLPATFHAV-LNRQALFKH 307

Query: 320 SKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMP 379
            +   +  +      +++  G+  P +N   L  RYL+++ LPL ++     ++ E  + 
Sbjct: 308 KQFYDT--VVDLQIGLSKDHGIVWPALNVPMLLYRYLREMALPL-ELYDATTRLAEL-LG 363

Query: 380 PDLWLSTN-----ECRFPTRVCVMSILIVSIRILYNING 413
            D  L  N       R      ++  LIVS+++LY ++G
Sbjct: 364 YDFALHPNGKKRLGIRHLPEAQLVGCLIVSVKLLYPLDG 402


>gi|169605689|ref|XP_001796265.1| hypothetical protein SNOG_05869 [Phaeosphaeria nodorum SN15]
 gi|111065813|gb|EAT86933.1| hypothetical protein SNOG_05869 [Phaeosphaeria nodorum SN15]
          Length = 523

 Score = 43.5 bits (101), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 53/210 (25%), Positives = 83/210 (39%), Gaps = 14/210 (6%)

Query: 159 YVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTG-LLSQGWADEAIVQSESQ 217
           Y+  +QL+++ Q   LV +  +   +  V   +W   +A  G  +    A+    QS+  
Sbjct: 66  YLKCLQLILRHQVWFLVQEKGLPSELEMVVFDLWALRVAQLGNRIGSNNAEAESQQSQVF 125

Query: 218 ELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILP 277
              ES+  Q     R    T  G+       + L     L   LA+ +L    +R  I P
Sbjct: 126 NTLESEGEQTTDNERGTISTPKGRD------RQLHGTPNLYDCLALCYLGMSTLRLLITP 179

Query: 278 TDIVKWSIEGKIPYFAAF----VEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAA 333
            D+  W  +GK+ Y  A     + +  R   T  A  L P  M   +  +    L +   
Sbjct: 180 GDVHAWVTDGKLAYRRAIKLIPLGMRDRLPSTYHAV-LDPQTML--NYRIFYTTLTNLEI 236

Query: 334 SIAESIGLHLPPVNFYALASRYLKQLCLPL 363
           S  +   +  P +N   L  RYLKQL LPL
Sbjct: 237 SFEKDHEMQWPALNVQPLLFRYLKQLALPL 266


>gi|330917748|ref|XP_003297944.1| hypothetical protein PTT_08505 [Pyrenophora teres f. teres 0-1]
 gi|311329140|gb|EFQ93979.1| hypothetical protein PTT_08505 [Pyrenophora teres f. teres 0-1]
          Length = 417

 Score = 43.5 bits (101), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 70/160 (43%), Gaps = 12/160 (7%)

Query: 261 LAISFLACHVVREAILPTDIVKWSIEGKIPYFAAF----VEIEKRFGQTSVACSLSPSFM 316
           +A+ +L    +R  + P DI +W  +  +PY  A     + + KR     +A  L P   
Sbjct: 165 VALCYLGFITLRLPVTPGDIHRWVTDEMLPYRKAGGLIPLNMRKRLPPNYIAI-LHP--- 220

Query: 317 FRPSK-SVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG--KILPRALKI 373
            RP + +   + L     S +   G+  P +N   L  RYLK+L LPL       R  K+
Sbjct: 221 -RPLRFNRFYRTLTDLQRSYSNDYGIAWPALNVPLLLHRYLKELALPLNIYDATRRLGKM 279

Query: 374 QEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNING 413
             +   PD      E        ++S L+V +++L+ ++G
Sbjct: 280 IGYDFVPDPKNKRPEVHQLPEAQLVSCLLVCVKLLHPLDG 319


>gi|212540280|ref|XP_002150295.1| RNA polymerase I specific transcription initiation factor Rrn7,
           putative [Talaromyces marneffei ATCC 18224]
 gi|210067594|gb|EEA21686.1| RNA polymerase I specific transcription initiation factor Rrn7,
           putative [Talaromyces marneffei ATCC 18224]
          Length = 544

 Score = 43.1 bits (100), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 27/111 (24%), Positives = 51/111 (45%), Gaps = 9/111 (8%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 316
           L  +L + +LAC ++R  +   DI K++I  +IP+  A   + +      +   +SP   
Sbjct: 166 LKETLGLCYLACLLLRLPVSIGDIYKYAIRNEIPFVKALNSVPQ-----EMRDRMSPQHT 220

Query: 317 FRPSKSVPSQKLE----SFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 363
              S    S+++        A  +    +  PPVN   +  ++++QL LPL
Sbjct: 221 RDLSIQAKSEEIHETVNQLVAMYSREYNVVFPPVNMPLMLFQFIRQLALPL 271


>gi|307186494|gb|EFN72064.1| hypothetical protein EAG_13797 [Camponotus floridanus]
          Length = 950

 Score = 43.1 bits (100), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 48/228 (21%), Positives = 96/228 (42%), Gaps = 17/228 (7%)

Query: 210 AIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACH 269
           +  + E++++ E     PR K      T + Q  +   YK   + I      AI +LA  
Sbjct: 228 SFAKEETRKIKEMAKKIPRYK-----RTKYKQNHITTQYKTGPELITPMKLWAILYLALR 282

Query: 270 VVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLE 329
           +  + I  +D++++  EG + Y+        R     VA + +       +  +  + + 
Sbjct: 283 IHNQDIHLSDMLRYGREGHLSYYKL-----DRLIPPEVALTRNDINFLSRAMDITHKGMR 337

Query: 330 SFAASIAESIGL-HLPPVNFYALASRYLKQLCLPLGKIL--PRALKIQEWSMPPDLWLST 386
                +A+ +G+  +   +F  L +RY  +L LP G +L   R + +     PP +    
Sbjct: 338 RIIGRMAKFLGVTKIICPDFLPLINRYCSELNLPRGILLYTERFVALS----PPKMKFDK 393

Query: 387 NECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSSTSNIG 434
             C        M+ +IV ++ L +++G   +E S  + K  S+ +N G
Sbjct: 394 KSCIPNYEGRAMAFIIVVLKTLLSLDGITEYEISNVADKINSAINNEG 441


>gi|398391290|ref|XP_003849105.1| hypothetical protein MYCGRDRAFT_111069 [Zymoseptoria tritici
           IPO323]
 gi|339468981|gb|EGP84081.1| hypothetical protein MYCGRDRAFT_111069 [Zymoseptoria tritici
           IPO323]
          Length = 551

 Score = 43.1 bits (100), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 52/108 (48%), Gaps = 9/108 (8%)

Query: 261 LAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI-----EKRFGQTSVACSLSPSF 315
           L + F A  ++R  +   D+  W  +G + Y+ A  EI     E+  G   +   L P  
Sbjct: 179 LCLDFTAFLLLRIPLTVADLTSWVNDGTLCYYRASKEIPLSMRERLPGH--LQEQLEPQD 236

Query: 316 MFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 363
           +  P      +K+  +  ++A++ G+ +PPVN   L  R++ +L LPL
Sbjct: 237 LVAPE--TLQRKILEWLTTLADNFGMAVPPVNHPLLLYRWVTELSLPL 282


>gi|406604863|emb|CCH43738.1| RNA polymerase I-specific transcription initiation factor RRN7
           [Wickerhamomyces ciferrii]
          Length = 521

 Score = 43.1 bits (100), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 42/207 (20%), Positives = 86/207 (41%), Gaps = 26/207 (12%)

Query: 156 RMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSE 215
           R  Y    Q +++LQ ++L+++  V  +   VA +++ ++L +     + +       ++
Sbjct: 90  RELYFKCFQTILKLQIKSLINEHGVPEVFEKVAKNVYLQYLNAFFNTDEAFESLGYETAD 149

Query: 216 SQELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLS-SSLAISFLACHVVREA 274
           S   ++ +   P                     KL  +K+P    +LAI +LA   ++  
Sbjct: 150 SHVTSDEESQNP---------------------KLFTKKLPSHIDTLAICYLAARYLQLP 188

Query: 275 ILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAAS 334
           +   D+V+W+   +IPY  A  ++ K          L+   +  P K      + +   +
Sbjct: 189 LYLNDLVQWATSRRIPYMRASHDLPKNLSSRLPNIMLT---LLDPGKPPVKGDICNSVTT 245

Query: 335 IAESIGLHLPPVNFYALASRYLKQLCL 361
           I  SI + L   N+  L  R +K+L L
Sbjct: 246 IMSSIDMDL-KFNYKPLLFRSIKELIL 271


>gi|380490508|emb|CCF35965.1| forms a complex with the other RRN protein RRN6 and RRN11
           [Colletotrichum higginsianum]
          Length = 545

 Score = 42.4 bits (98), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 74/317 (23%), Positives = 127/317 (40%), Gaps = 53/317 (16%)

Query: 136 QDFGSSGPVKADYEDYHFEV---------RMKYVMGMQLMIQLQCEALVDKFNVCPLICG 186
           +DFG  G V    +     V         R  Y+  +QL+++ Q   LV +    P +  
Sbjct: 51  EDFGQKGRVSRKEKQIKEHVKQVLTGQQARDLYLECLQLILRQQVLWLVTEKGHKPELET 110

Query: 187 VAASIW-FRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVM 245
           V   +W  R   +  +     A E      S +L  ++D QP          +  +RA  
Sbjct: 111 VVRDLWDLRICGAATVTHDDSASEGEPSLFSSQLDLAQDEQPH---------MPTKRA-Q 160

Query: 246 IWYKLLRQKIP---LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRF 302
            W        P   +  +L + FL C ++R      D+ +W+  G IPY  A+  + +  
Sbjct: 161 SWAAESDNGWPAPRVMDTLGLCFLGCVLLRIPTRIGDLARWARVGSIPYTQAYRRLPREM 220

Query: 303 GQTSVACSLSPSFMFRPSKSVPSQKL---ESFAASIAESIGLHL------PPVNFYALAS 353
                     P +   P KS         E  A+ +  ++  HL      PP+N   +  
Sbjct: 221 RDR------LPPWYTAPLKSAHLATFDQGELHASVLNLALSYHLNYSMVFPPLNDVLMTM 274

Query: 354 RYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFP---TRVCVM---SILIVSIRI 407
           +Y ++L LP+  I   A+  +   +PP   ++    RFP   TRV ++    IL+V+I +
Sbjct: 275 QYARELGLPVETI---AIARR---IPP---IADLAFRFPLEKTRVRLIHQPEILLVAILV 325

Query: 408 LYNINGFGAWEKSLSSR 424
           L  I+ F   + + S+R
Sbjct: 326 LATIHCFPFEDSTCSAR 342


>gi|115390963|ref|XP_001212986.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114193910|gb|EAU35610.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 405

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 66/308 (21%), Positives = 123/308 (39%), Gaps = 65/308 (21%)

Query: 134 EPQDFGSSGP----VKADYEDYH--FEVRMKYVMGMQ---LMIQLQCEALVDKFNVCPLI 184
           +P DFG+ G      KA  E  H  +  R  Y++ +Q   L++  QC ALV         
Sbjct: 42  DPDDFGTQGKRNRVKKAVIEKGHKTYRGRHAYILFLQTYQLILWKQCSALVQNKGFPSQF 101

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSESQELAESKDFQPRAKYRDEPHTLHGQRAV 244
             V   +W        L  Q +  +    SE  +  E    QP  +  D+ H L      
Sbjct: 102 EHVVRDLW-------ALRLQKYEAKLREVSEDADTLEFFSSQPAGEPDDQLHLLTSGGKR 154

Query: 245 MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK---- 300
           + W +L+        ++A+ +L   ++R  I   D  +  +  ++PY      I +    
Sbjct: 155 IQWPRLV-------DTVALCYLGALLMRLPISVGDFHRMVMREEVPYIRVIKSIPRDIRD 207

Query: 301 RFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLH-------LPPVNFYALAS 353
           +  Q  +A   +   +          K E    ++ E + L+        PP+N  AL  
Sbjct: 208 KLPQEYLALIETTRLL----------KAEHLHKAVFELLLLYNQCFDIEFPPLNVPALLY 257

Query: 354 RYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVC------------VMSIL 401
            Y+++L LP+ ++ P   +++      DL   T E  FPT+V             +++++
Sbjct: 258 NYIRRLALPV-ELYPAVKRLR------DLLGFTFE--FPTKVIGKRKPLHLPEVQLVTLI 308

Query: 402 IVSIRILY 409
           ++S ++L+
Sbjct: 309 VISTKLLF 316


>gi|451993473|gb|EMD85946.1| hypothetical protein COCHEDRAFT_1186964 [Cochliobolus
           heterostrophus C5]
          Length = 468

 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 51/118 (43%), Gaps = 5/118 (4%)

Query: 249 KLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAA--FVEIEKRFGQTS 306
           K+L     L+  L + +L    +R  I P DI  W  + K+PY+ A   V +  R     
Sbjct: 167 KVLSTAPNLNDCLLLCYLGIKTLRLPITPGDIHAWVTDNKLPYWRAIKLVPLPMRDRLLP 226

Query: 307 V-ACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 363
           V    L PS  F  ++    + L     S     G+  P +N   L  RYLK+L LPL
Sbjct: 227 VYRAVLDPSAPFTYTRFY--RLLTDLQISYDADHGIIWPSLNVPLLLFRYLKELALPL 282


>gi|440798393|gb|ELR19461.1| hypothetical protein ACA1_267200 [Acanthamoeba castellanii str.
           Neff]
          Length = 613

 Score = 41.2 bits (95), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 54/107 (50%), Gaps = 15/107 (14%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 316
           ++++L  +++AC++++E ++  D+++W   G++PY   +   + +  Q         SF 
Sbjct: 215 VTATLTFNYVACNILQEPVVAHDLIRWVRSGRLPYEEVYSLKDFKDAQ---------SFE 265

Query: 317 FRPSKSVPSQKLESFAASIAESIGLHL-PPVNFYALASRYLKQLCLP 362
            R     P  KLE  A ++    G+    P  FY    R++++L LP
Sbjct: 266 VR-----PLDKLEYQAYNLINYFGIEQEAPPPFYPTLCRFVEELHLP 307


>gi|194902452|ref|XP_001980701.1| GG17298 [Drosophila erecta]
 gi|190652404|gb|EDV49659.1| GG17298 [Drosophila erecta]
          Length = 872

 Score = 40.8 bits (94), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 7  DIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADED-FMATGAGTGTA 61
          +++ ++C  C+   F+    GFYYC  C    D+I    +  ED F  TGAG  TA
Sbjct: 9  ELQNMQCDVCEGTAFQERV-GFYYCVECGTQKDQIRAVDITAEDNFDETGAGRYTA 63


>gi|346325092|gb|EGX94689.1| ubiquitin fusion protein [Cordyceps militaris CM01]
          Length = 428

 Score = 40.4 bits (93), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 28/121 (23%), Positives = 57/121 (47%), Gaps = 17/121 (14%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAF----VEIEKRFGQTSVACSLS 312
           +S +LA+ +L C+++R      +++KW+ + + PY   F     E+++R     V     
Sbjct: 178 ISDTLALCYLGCYLLRIPTRLGELIKWANDFQFPYRHTFQQLPTEMQERMPSIYVKVLKI 237

Query: 313 PSFMFRPSKSVPSQKLESFAASIAESIGLHL------PPVNFYALASRYLKQLCLPLGKI 366
           P         +P +  ++++A +  +   HL      P +N+  L  +Y K L LP+  I
Sbjct: 238 PI-------RLPLEGAQAYSAVMDLACSYHLNYNMLFPDLNYMPLLVQYAKLLALPVESI 290

Query: 367 L 367
           +
Sbjct: 291 I 291


>gi|366992866|ref|XP_003676198.1| hypothetical protein NCAS_0D02560 [Naumovozyma castellii CBS 4309]
 gi|342302064|emb|CCC69837.1| hypothetical protein NCAS_0D02560 [Naumovozyma castellii CBS 4309]
          Length = 533

 Score = 39.7 bits (91), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 31/144 (21%), Positives = 60/144 (41%), Gaps = 16/144 (11%)

Query: 151 YHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEA 210
           Y  +  + ++   QL+++ QC  L++ F    L   V   IW ++L S         +  
Sbjct: 91  YGQDANLLFLKAFQLILRKQCSTLINDFGFPTLFTDVVKLIWMKYLESIDR-----DNRK 145

Query: 211 IVQSESQELAESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHV 270
           +V +E  +     D +      D  HT H  R +          + +SS++A+ ++A   
Sbjct: 146 LVSNEFSDGEIGGDTEVDVPIND--HTEHKVRKL---------GLHMSSTIAMLYMASVH 194

Query: 271 VREAILPTDIVKWSIEGKIPYFAA 294
           +   +   D ++W    K+ YF A
Sbjct: 195 LGLPVYMNDFIQWISTTKLLYFKA 218


>gi|325096356|gb|EGC49666.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 667

 Score = 38.9 bits (89), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 28/113 (24%), Positives = 51/113 (45%), Gaps = 11/113 (9%)

Query: 257 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 316
           L  SLA+ +L   ++R  I    +  W ++ +IP   A      R     +   L P++ 
Sbjct: 195 LIDSLALCYLGLMLLRVPIGVGALESWILKDEIPLIRAI-----RLIPQDMKDRLPPAYH 249

Query: 317 FR-PSKSVPS-----QKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 363
           +   +K+ P+     + +   A    +  G+ +PP+N   +   Y+KQL LPL
Sbjct: 250 YALDTKNAPTGNQLHRSISDLATFYRKEFGVQIPPLNAPLMLFTYIKQLSLPL 302


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.132    0.397 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,134,456,076
Number of Sequences: 23463169
Number of extensions: 502988928
Number of successful extensions: 1356525
Number of sequences better than 100.0: 136
Number of HSP's better than 100.0 without gapping: 43
Number of HSP's successfully gapped in prelim test: 93
Number of HSP's that attempted gapping in prelim test: 1355977
Number of HSP's gapped (non-prelim): 445
length of query: 771
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 620
effective length of database: 8,816,256,848
effective search space: 5466079245760
effective search space used: 5466079245760
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)