BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 004180
         (770 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255562854|ref|XP_002522432.1| conserved hypothetical protein [Ricinus communis]
 gi|223538317|gb|EEF39924.1| conserved hypothetical protein [Ricinus communis]
          Length = 751

 Score =  803 bits (2073), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/774 (56%), Positives = 538/774 (69%), Gaps = 36/774 (4%)

Query: 4   GEGDIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALY 63
           GEG+ +R  C++C +VG E + DGFYYC  C A  D+I+ TGVADEDF+      G ALY
Sbjct: 5   GEGETKRWACRRCGHVGLEES-DGFYYCQECGAQADDIILTGVADEDFIEKDGEGGGALY 63

Query: 64  QASHARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREE 123
            A   R    +R      P SQ    ++   ++ N TTTTT    Y+++     +IK+EE
Sbjct: 64  SARFTRYSQPTRTI-QTNPSSQAWFRYTQEEEDINFTTTTTLNGTYSNI-----KIKKEE 117

Query: 124 V----SYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFN 179
                 Y D +GP EP+DFG        YEDY+ EVR++YVMGMQ MIQLQCE+LV+KFN
Sbjct: 118 RFDDDEYLDGLGPVEPEDFGGKS---LSYEDYYNEVRIRYVMGMQWMIQLQCESLVEKFN 174

Query: 180 VCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQ-ELESKDFQPRAKYRDEPHTLH 238
           V PLICGVA ++W RFL +TG+    WAD+ I++SESQ + E +D++PR+ +R+EPH  +
Sbjct: 175 VSPLICGVAGNVWLRFLVATGVFKDNWADDVILESESQVQGEPEDWKPRSSHRNEPHNAY 234

Query: 239 GQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIE 298
           GQRAVM+W+K LR+ IPLSSSLAISFLACHV REAILPTDIV+WSIEGK+PYFAA VEIE
Sbjct: 235 GQRAVMVWFKYLRKTIPLSSSLAISFLACHVAREAILPTDIVRWSIEGKLPYFAAHVEIE 294

Query: 299 KRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQL 358
           KRF  +S AC +S S MFRPS++VP+QKLES AA+ AESIGLHLPPVNFY +ASRYLK L
Sbjct: 295 KRFEHSSPACPISSSLMFRPSQAVPAQKLESMAAAFAESIGLHLPPVNFYEIASRYLKNL 354

Query: 359 CLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEK 418
            LP+ KILP A +I EWSMPPDLWLSTNE R PTRV VMSILIV+IRILYN+NGFGAWE+
Sbjct: 355 ALPVEKILPHACRIYEWSMPPDLWLSTNELRLPTRVTVMSILIVAIRILYNLNGFGAWER 414

Query: 419 SLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESEL 478
           SLSS     S S+   + D  C S M+ + E  + SP   +  S  K  +N SH +  EL
Sbjct: 415 SLSSLNCSPSNSHPASRLDSMCRSVMQGDAE--TGSPFYSLDGSAEKFLRNPSHMQMPEL 472

Query: 479 DAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPH-DDPEEAIMIEKF 537
           D+  LL+ LE +YN I D+  YE++KDLP+YL+YCKDVVFAG  P H DD EE  ++EK 
Sbjct: 473 DSAELLHHLEVKYNFIADA--YEFTKDLPSYLQYCKDVVFAGAGPSHMDDLEEEELMEKL 530

Query: 538 WKFYQNEKESE-AAEDSGKRCGIASKRKRSR-DDLSLGSHYKESEKIREKEFTTRLSASA 595
           W FYQNEK+SE A E   +     S +KRSR DD S+  +  E EKI+E+      S SA
Sbjct: 531 WDFYQNEKDSELAKEPRTQSSSRLSNQKRSRNDDGSVFVNLSEKEKIKEEWHD---SPSA 587

Query: 596 DYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCY 655
           D  S + D+   QS +  + S +S E  N E  ++ S +T+   A++RLKLDME NRFCY
Sbjct: 588 DISSHNADNSSHQSFDNGHFSNNSLEDQNVEHKEKDSEKTLEGRAIRRLKLDMEENRFCY 647

Query: 656 ISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERR 715
           I PRV +KR DYL YVRK DEGA TY AHADYYILLRACA +AQVDIR MH+ VLSFERR
Sbjct: 648 IPPRVNLKRFDYLHYVRKKDEGAFTYVAHADYYILLRACARVAQVDIRIMHIGVLSFERR 707

Query: 716 LAWLEKRIDHCLHLTPPSVTCRFCSDPVELAAGDPICNATEDSADDHIGLSNLN 769
           LAWLEKRID+CLHL+PP++TC FC D  +            +S DD IGLS LN
Sbjct: 708 LAWLEKRIDYCLHLSPPTITCEFCRDMPD-----------HNSNDDVIGLSKLN 750


>gi|225425686|ref|XP_002269865.1| PREDICTED: uncharacterized protein LOC100241425 [Vitis vinifera]
          Length = 722

 Score =  748 bits (1930), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 412/769 (53%), Positives = 521/769 (67%), Gaps = 66/769 (8%)

Query: 11  LKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ 70
           L C  C +VGF    DGF+YC  C +  ++I++TGVA+EDF+A G   G A+Y ASH RQ
Sbjct: 7   LTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAKGDARG-AIYSASHRRQ 65

Query: 71  VANSRPANHFVPLSQ-QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEV---SY 126
             +  P     PLSQ QS F + L          T   +Y        R++ EE    + 
Sbjct: 66  RHSIAPKPE--PLSQSQSQFLNNL----------TLDDDY--------RVENEETREETV 105

Query: 127 TDMVGPTEPQDFGSS--GPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLI 184
            D VGP+ P DFG    G     +EDY+ ++R++YVMG+Q+MI+LQC+ALV+KF   PLI
Sbjct: 106 ADEVGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLI 165

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSESQEL-ESKDFQPRAKYRDEPHTLHGQRAV 243
           CGVA +IW RF+A+T +    WAD+ I  SE Q+  ES+D +PRAKY  EPH ++GQRAV
Sbjct: 166 CGVAGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRAKYSAEPHNIYGQRAV 225

Query: 244 MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQ 303
           +IW++ L++KIPLS SL ISFLACH+ REAILPTDI+KWS+EGK+PYFAAF+EIEK+ G 
Sbjct: 226 IIWHRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGP 285

Query: 304 TSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG 363
            S  C LS SFMFRPS+++P QKLE+ AASIA+ IGLHLPPVNFYA+A RYL+QL LP+ 
Sbjct: 286 PSSPCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVE 345

Query: 364 KILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSR 423
           KILP A ++ EWSMPPDLWLS NE R PTRVCVMSILIV+IRILYN++GFG WE SLSS 
Sbjct: 346 KILPYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSS 405

Query: 424 KFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEAL 483
              SS+S+   K +   N KM D  ++   SP   +  SN +   NSSH ++SE DA  L
Sbjct: 406 SGSSSSSSQIVKLNASDNIKMMDGAKQ--GSPLHDLNGSNEEPVTNSSHAQKSEFDATEL 463

Query: 484 LYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQN 543
           L +L+ARY+++ D  TYEYSKDLPTYL+YCKDVVFAGLE P +D EE  +IE+ W+FYQN
Sbjct: 464 LCNLDARYDELID--TYEYSKDLPTYLQYCKDVVFAGLELPFEDHEEEKIIEQLWEFYQN 521

Query: 544 EKESEAAEDSGKRCGIASKRKRSRDDLS-LGSHYKESEKIREKEFTTRLSASADYESFSG 602
           +K+SE +ED G  CG A   KRSR+D   + S  KE +KIR+                  
Sbjct: 522 QKDSEPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRD------------------ 563

Query: 603 DDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQI 662
           D      L+GD++S +S+    S    +AS ET+ + A+ R+K DME NRFCYI PRV +
Sbjct: 564 DCSVPLGLDGDDTSLNSQGGQKSVPTHQASVETLKEEAILRMKADMEENRFCYIPPRVNV 623

Query: 663 KRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKR 722
           KR DYL YVRK DEG+  YAAHADYYILLRACA +AQVD+RSMH+ V+S ERRL W+EKR
Sbjct: 624 KRFDYLHYVRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKR 683

Query: 723 IDHCLHLTPPSVTCRFCSDPVELAAGDPICN--ATEDSADDHIGLSNLN 769
           IDHCLH  PP    +F SDP         CN  A E S DD++  S+LN
Sbjct: 684 IDHCLHFKPP----KFSSDP---------CNDDAPECSTDDYVEFSSLN 719


>gi|147780094|emb|CAN73288.1| hypothetical protein VITISV_027088 [Vitis vinifera]
          Length = 722

 Score =  739 bits (1909), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/767 (52%), Positives = 516/767 (67%), Gaps = 62/767 (8%)

Query: 11  LKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ 70
           L C  C +VGF    DGF+YC  C +  ++I++TGVA+EDF+A G   G A+Y ASH RQ
Sbjct: 7   LTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTGVAEEDFVAKGDARG-AIYSASHRRQ 65

Query: 71  VANSRPANHFVPLSQ-QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEV---SY 126
             +  P     PLSQ QS F + L          T   +Y        R++ EE    + 
Sbjct: 66  RHSIAPKPE--PLSQSQSQFLNNL----------TLDDDY--------RVENEETREETV 105

Query: 127 TDMVGPTEPQDFGSS--GPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLI 184
            D VGP+ P DFG    G     +EDY+ ++R++YVMG+Q+MI+LQC+ALV+KF   PLI
Sbjct: 106 ADEVGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLI 165

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSESQEL-ESKDFQPRAKYRDEPHTLHGQRAV 243
           CGVA +IW RF+A+T +    WAD+ I  SE Q+  ES+D +PR KY  EPH ++GQRAV
Sbjct: 166 CGVAGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRTKYSAEPHNIYGQRAV 225

Query: 244 MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQ 303
           +IW++ L++KIPLS SL ISFLACH+ REAILPTDI+KWS+EGK+PYFAAF+EIEK+ G 
Sbjct: 226 IIWHRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGP 285

Query: 304 TSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG 363
            S  C LS SFMFRPS+++P QKLE+ AASIA+ IGLHLPPVNFYA+A RYL+QL LP+ 
Sbjct: 286 PSSPCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVE 345

Query: 364 KILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSR 423
           KILP A ++ EWSMPPDLWLS NE R PTRVCVMSILIV+IRILYN++GFG WE SLSS 
Sbjct: 346 KILPYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLSSS 405

Query: 424 KFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEAL 483
              SS+S+   K +   N KM D  ++   SP   +  SN +   NSSH ++SE DA  L
Sbjct: 406 SGSSSSSSQIVKLNASDNIKMMDGAKQ--GSPLHDLNGSNEEPVTNSSHAQKSEFDATEL 463

Query: 484 LYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQN 543
           L +L+ARY+++ D  TYEYSKDLPTYL+YCKDVVFAGLE P +D EE  +IE+ W+FYQN
Sbjct: 464 LCNLDARYDELID--TYEYSKDLPTYLQYCKDVVFAGLELPFEDHEEEKIIEQLWEFYQN 521

Query: 544 EKESEAAEDSGKRCGIASKRKRSRDDLS-LGSHYKESEKIREKEFTTRLSASADYESFSG 602
           +K+SE +ED G  CG A   KRSR+D   + S  KE +KIR+                  
Sbjct: 522 QKDSEPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIRD------------------ 563

Query: 603 DDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQI 662
           D      L+GD++S +S+    S    +AS ET+ + A+ R+K DME NRFCYI  RV +
Sbjct: 564 DCSVPLGLDGDDTSLNSQGGQXSVPTHQASVETVKEEAILRMKADMEENRFCYIPXRVNV 623

Query: 663 KRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKR 722
           KR DYL YVRK DEG+  YAAHADYYILLRACA +AQVD+RSMH+ V+S ERRL W+EKR
Sbjct: 624 KRFDYLHYVRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKR 683

Query: 723 IDHCLHLTPPSVTCRFCSDPVELAAGDPICNATEDSADDHIGLSNLN 769
           IDHCLH  PP  +   C+            +A E S DD++  S+LN
Sbjct: 684 IDHCLHFKPPKFSSDXCNX-----------DAPECSTDDYVEFSSLN 719


>gi|356541066|ref|XP_003539004.1| PREDICTED: uncharacterized protein LOC100789168 [Glycine max]
          Length = 719

 Score =  699 bits (1804), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/770 (48%), Positives = 503/770 (65%), Gaps = 61/770 (7%)

Query: 7   DIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQAS 66
           D+ +L C+ C+NVG     DGF+YC  C +  +++++T V D+D         +A+Y  S
Sbjct: 3   DVHKLICQTCNNVGVGDGSDGFFYCLRCGSQCEDVMDTAVDDDDLF-----NKSAVYLPS 57

Query: 67  HARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREE--- 123
           H RQ + +  A    P+SQ  SF    +D  +         + N   +   ++KREE   
Sbjct: 58  HQRQRSVAIKAE---PISQYDSF----YDSHSNFIRNLGLEDENPQRNDHVQVKREEEFD 110

Query: 124 VSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPL 183
            +  D   P+ P DFG  G   A +EDYH E+RM+YVMG+Q+MI+LQCEALV +F V PL
Sbjct: 111 AAQFDEASPSVPADFG--GSQVACFEDYHNEIRMRYVMGLQMMIELQCEALVKEFKVTPL 168

Query: 184 ICGVAASIWFRFLASTGLLSQGWADEAIVQSESQ-ELESKDFQPRAKYRDEPHTLHGQRA 242
           ICG+   IW RF++ TG+    WAD+ I  SE+Q E E +D++ RAKYR EPH + G+RA
Sbjct: 169 ICGLVGPIWLRFVSKTGIFGDDWADKVIHDSETQNEDEPEDYKLRAKYRTEPHNMFGKRA 228

Query: 243 VMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFG 302
           VMIW++ L+++IPL  ++A+SFLACHV REA+LP+D++KW++EGK+PYF+AFVEIEKR G
Sbjct: 229 VMIWFRSLKKRIPLPCTVAVSFLACHVAREAVLPSDMMKWTLEGKLPYFSAFVEIEKRMG 288

Query: 303 QTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 362
           Q + AC +S S MFRP +++P QKLESFAASIA+ IGL LPPVNFYA+A  +L++L LP+
Sbjct: 289 QPTSACPISSSVMFRPQQAIPVQKLESFAASIAQFIGLELPPVNFYAIAYLFLQKLSLPV 348

Query: 363 GKILPRALKIQEWSMPPDLW--LSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSL 420
            KILP A +I EWSMPPDLW  LS+N  R PT VCVMSIL+V+IRILYNINGFG WEKSL
Sbjct: 349 EKILPYACRIYEWSMPPDLWLSLSSNYFRLPTHVCVMSILVVAIRILYNINGFGEWEKSL 408

Query: 421 SSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDA 480
           S        S +G  F  +                    G  +  S+++    ++ ELD+
Sbjct: 409 SRDDDAKDNSEMGNAFASDNGHDF---------------GKESKDSAEDQVGSQKHELDS 453

Query: 481 EALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKF 540
             LL  L+ARYN+I D  TY YSKDL TYLKYC+DVVFAG EP + + EE  MIE  WKF
Sbjct: 454 AWLLQHLQARYNEIAD--TYGYSKDLATYLKYCRDVVFAGSEPSYGNHEEEKMIEYLWKF 511

Query: 541 YQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESF 600
           YQNE++++ +E+  ++   +  +  SRD   +G   KE EKIR+K F          E F
Sbjct: 512 YQNEEDTKPSENV-EQSNTSFNQTSSRDKGCIGKISKE-EKIRKKGFN---------ELF 560

Query: 601 SGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRV 660
             DD   + ++ +NS +S  +  +S++ + +S ++ +  A++++KL+ME NRF YI P V
Sbjct: 561 PDDDTSLEDVSNNNSHESLSDSEDSDSQEHSSGKSHVKEAIRQMKLEMEENRFYYIPPSV 620

Query: 661 QIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLE 720
           + KRLDY+ YVRK DEGALTY AHADYYILLRACA IAQVDIR +H+ VL  ERRLAWLE
Sbjct: 621 KRKRLDYIHYVRKRDEGALTYVAHADYYILLRACARIAQVDIRILHIGVLRLERRLAWLE 680

Query: 721 KRIDHCLHLTPPSVTCRFCSDPVELAAGDPICN-ATEDSADDHIGLSNLN 769
           KRID CLH  P S++C+FC            CN ATE+ +D   GLSNLN
Sbjct: 681 KRIDQCLHSKPTSISCQFC------------CNKATENGSDGLPGLSNLN 718


>gi|296086368|emb|CBI31957.3| unnamed protein product [Vitis vinifera]
          Length = 713

 Score =  669 bits (1726), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/768 (49%), Positives = 468/768 (60%), Gaps = 159/768 (20%)

Query: 11  LKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ 70
           L C  C +VGF    DGF+YC  C +  ++I++TG                         
Sbjct: 7   LTCHVCGSVGFSDGADGFFYCGRCGSQAEDIIDTG------------------------- 41

Query: 71  VANSRPANHFVPLSQ-QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTDM 129
                      PLSQ QS F + L  + +       T              REE +  D 
Sbjct: 42  -----------PLSQSQSQFLNNLTLDDDYRVENEET--------------REE-TVADE 75

Query: 130 VGPTEPQDFGSS--GPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGV 187
           VGP+ P DFG    G     +EDY+ ++R++YVMG+Q+MI+LQC+ALV+KF   PLICGV
Sbjct: 76  VGPSGPSDFGLGLDGSDGLSFEDYYTQLRIRYVMGVQIMIELQCQALVEKFKASPLICGV 135

Query: 188 AASIWFRFLASTGLLSQGWADEAIVQSESQEL-ESKDFQPRAKYRDEPHTLHGQRAVMIW 246
           A +IW RF+A+T +    WAD+ I  SE Q+  ES+D +PRAKY  EPH ++GQRAV+IW
Sbjct: 136 AGTIWLRFVATTRVFDDEWADKVIQDSEMQKPGESEDLKPRAKYSAEPHNIYGQRAVIIW 195

Query: 247 YKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSV 306
           ++ L++KIPLS SL ISFLACH+ REAILPTDI+KWS+EGK+PYFAAF+EIEK+ G  S 
Sbjct: 196 HRSLKKKIPLSCSLVISFLACHIAREAILPTDILKWSLEGKLPYFAAFIEIEKQIGPPSS 255

Query: 307 ACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKIL 366
            C LS SFMFRPS+++P QKLE+ AASIA+ IGLHLPPVNFYA+A RYL+QL LP+ KIL
Sbjct: 256 PCPLSSSFMFRPSEAIPLQKLEAQAASIADFIGLHLPPVNFYAIAFRYLEQLFLPVEKIL 315

Query: 367 PRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFF 426
           P A ++ EWSMPPDLWLS NE R PTRVCVMSILIV+IRILYN++GFG WE SLS     
Sbjct: 316 PYACRVYEWSMPPDLWLSANELRLPTRVCVMSILIVTIRILYNVHGFGKWEMSLS----- 370

Query: 427 SSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYD 486
                                                 K   NSSH ++SE DA  LL +
Sbjct: 371 --------------------------------------KPVTNSSHAQKSEFDATELLCN 392

Query: 487 LEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKE 546
           L+ARY+++ D  TYEYSKDLPTYL+YCKDVVFAGLE P +D EE  +IE+ W+FYQN+K+
Sbjct: 393 LDARYDELID--TYEYSKDLPTYLQYCKDVVFAGLELPFEDHEEEKIIEQLWEFYQNQKD 450

Query: 547 SEAAEDSGKRCGIASKRKRSRDDLS-LGSHYKESEKIREKEFTTRLSASADYESFSGDDY 605
           SE +ED G  CG A   KRSR+D   + S  KE +KIR                      
Sbjct: 451 SEPSEDLGVECGSALNEKRSRNDEGCINSIPKEKKKIR---------------------- 488

Query: 606 PQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRL 665
                  D+ S                 ET+ + A+ R+K DME NRFCYI PRV +KR 
Sbjct: 489 -------DDCSV-------------PLVETLKEEAILRMKADMEENRFCYIPPRVNVKRF 528

Query: 666 DYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRIDH 725
           DYL YVRK DEG+  YAAHADYYILLRACA +AQVD+RSMH+ V+S ERRL W+EKRIDH
Sbjct: 529 DYLHYVRKKDEGSYIYAAHADYYILLRACARVAQVDVRSMHVGVMSLERRLGWIEKRIDH 588

Query: 726 CLHLTPPSVTCRFCSDPVELAAGDPICN--ATEDSADDHIGLS-NLNT 770
           CLH  PP    +F SDP         CN  A E S DD++  S NLNT
Sbjct: 589 CLHFKPP----KFSSDP---------CNDDAPECSTDDYVEFSRNLNT 623


>gi|449518998|ref|XP_004166522.1| PREDICTED: TATA box-binding protein-associated factor RNA
           polymerase I subunit B-like [Cucumis sativus]
          Length = 735

 Score =  661 bits (1706), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/757 (48%), Positives = 478/757 (63%), Gaps = 59/757 (7%)

Query: 6   GDIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAG-TGTALYQ 64
            D   L C  C ++G    +DGF+YC  C +  D+I++TGVA+ED +    G +G  +Y 
Sbjct: 2   ADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYS 61

Query: 65  ASHARQVANSRPANHFVPLSQQSSFW----SPLFDEPNTTTTTTTTANYNDLDSIIHRIK 120
            SH R+   +       PLSQ  S +    S  +D  N     +      D D ++    
Sbjct: 62  QSHTRR--RNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVM---- 115

Query: 121 REEVSYTDMVGPTEPQDFGSSGPV--KADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKF 178
                  D VGPT P+DFGS   +  K  +E+Y  EVRM+YVMG+QL+++LQCE LV +F
Sbjct: 116 -----LNDGVGPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEF 170

Query: 179 NVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDFQPR-AKYRDEPHTL 237
              P+ICG+AASIW RF+ +T +  + WA + + +SESQ L+ +  +   + ++DEPH  
Sbjct: 171 KATPIICGLAASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNF 230

Query: 238 HGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI 297
           +GQR V++W K LR+KIPL S+LA+SFLACHV REAILPTDI+KWS+EGK+PY+AAFV+I
Sbjct: 231 YGQRVVVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDI 290

Query: 298 EKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQ 357
           E R G+TS AC +S   M RPS+    QKLES AASIA +IGL+LPPVNF+++A RYL +
Sbjct: 291 ESRIGKTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNK 350

Query: 358 LCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWE 417
           L LP+ KILP A +I EWSMPPDLWLSTNE R P+RVCVMSILI+++RILYN++GFG WE
Sbjct: 351 LALPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWE 410

Query: 418 KSLS---------SRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSK 468
           KSLS          +K  SS +N    F     S M+ + E      S  +   N   S 
Sbjct: 411 KSLSVDCASCFPPHQKTHSSPAN---NF-----SNMQADSENRPGFTSHDV--DNPSVSP 460

Query: 469 NSSHFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPH-DD 527
            + H   +E      L  +EARY++I +  TYEYSKDLPTYL+YCKDV FAG E    DD
Sbjct: 461 ENPHLTTTE-----FLRKIEARYHEIAE--TYEYSKDLPTYLQYCKDVAFAGSESLFIDD 513

Query: 528 PEEAIMIEKFWKFYQNEKESEAAEDSGKRCGIASKRKR--SRDDLSLGSHYKESEKIREK 585
            +E  MIEK W +YQNEK+ +  ED  +      KR R  S D LS      ES+K++ +
Sbjct: 514 HDEQKMIEKLWNYYQNEKDYDQTEDVDQNAASNQKRLREGSNDRLS-----NESKKVKGE 568

Query: 586 EFTTRLS-ASADYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRL 644
           E   R+S  S +  + S D     S  G ++S   E+    +A   AS+ T I+ A+++L
Sbjct: 569 E--DRISRESLNNRTGSIDSRQSHSSKGLDNSDDDEQSSVDKA---ASSLTSINEAIRQL 623

Query: 645 KLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRS 704
           KLDME  RFCYI PR+  KR DYL Y RK DEGALTYAAHADYYILLRACA  AQVDIR 
Sbjct: 624 KLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLRACARAAQVDIRI 683

Query: 705 MHMAVLSFERRLAWLEKRIDHCLHLTPPSVTCRFCSD 741
           MH+ VLS E+RL+WLE RI   L LTP S+TC FCSD
Sbjct: 684 MHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720


>gi|449470354|ref|XP_004152882.1| PREDICTED: TATA box-binding protein-associated factor RNA
           polymerase I subunit B-like [Cucumis sativus]
          Length = 735

 Score =  660 bits (1702), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/769 (47%), Positives = 476/769 (61%), Gaps = 83/769 (10%)

Query: 6   GDIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAG-TGTALYQ 64
            D   L C  C ++G    +DGF+YC  C +  D+I++TGVA+ED +    G +G  +Y 
Sbjct: 2   ADPSNLSCYNCGSIGLTDGFDGFFYCLQCGSQADDIIDTGVAEEDLVLRDVGKSGAPIYS 61

Query: 65  ASHARQVANSRPANHFVPLSQQSSFW----SPLFDEPNTTTTTTTTANYNDLDSIIHRIK 120
            SH R+   +       PLSQ  S +    S  +D  N     +      D D ++    
Sbjct: 62  QSHTRR--RNPTVLKVEPLSQSQSLFGTSQSEFWDSLNLMEDPSGNVGGKDGDIVM---- 115

Query: 121 REEVSYTDMVGPTEPQDFGSSGPV--KADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKF 178
                  D VGPT P+DFGS   +  K  +E+Y  EVRM+YVMG+QL+++LQCE LV +F
Sbjct: 116 -----LNDGVGPTGPEDFGSGDVLSGKPSFEEYADEVRMRYVMGLQLIMELQCEVLVKEF 170

Query: 179 NVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDFQPR-AKYRDEPHTL 237
              P+ICG+AASIW RF+ +T +  + WA + + +SESQ L+ +  +   + ++DEPH  
Sbjct: 171 KATPIICGLAASIWLRFVTATRVFDEDWAFQTVQESESQCLDPERIRRVCSSHKDEPHNF 230

Query: 238 HGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI 297
           +GQR V++W K LR+KIPL S+LA+SFLACHV REAILPTDI+KWS+EGK+PY+AAFV+I
Sbjct: 231 YGQRVVVLWVKSLRKKIPLFSTLAVSFLACHVAREAILPTDIIKWSLEGKLPYYAAFVDI 290

Query: 298 EKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQ 357
           E R G+TS AC +S   M RPS+    QKLES AASIA +IGL+LPPVNF+++A RYL +
Sbjct: 291 ESRIGKTSRACPISSKLMHRPSRISSLQKLESLAASIAHTIGLNLPPVNFHSIACRYLNK 350

Query: 358 LCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWE 417
           L LP+ KILP A +I EWSMPPDLWLSTNE R P+RVCVMSILI+++RILYN++GFG WE
Sbjct: 351 LALPVDKILPHACRIYEWSMPPDLWLSTNELRLPSRVCVMSILIIAMRILYNLHGFGEWE 410

Query: 418 KSLS---------SRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGD-SNAKSS 467
           KSLS          +K  SS +N    F     S M+ + E     P     D  N   S
Sbjct: 411 KSLSVDCASCFPPHQKTHSSPAN---NF-----SNMQADSEN---RPGFTSHDVDNPSVS 459

Query: 468 KNSSHFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPH-D 526
             + H   +E      L  +EARY++I +  TYEYSKDLPTYL+YCKDV FAG E    D
Sbjct: 460 PENPHLTTTE-----FLRKIEARYHEIAE--TYEYSKDLPTYLQYCKDVAFAGSESLFID 512

Query: 527 DPEEAIMIEKFWKFYQNEKESEAAEDSGKRCGIASKRKR--SRDDLSLGSHYKESEKIRE 584
           D +E  MIEK W +YQNEK+ +  ED  +      KR R  S D LS      ES+K++ 
Sbjct: 513 DHDEQKMIEKLWNYYQNEKDYDQTEDVDQNAASNQKRLREGSNDRLS-----NESKKVKG 567

Query: 585 KEFTTRLSASADYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDE------------AS 632
           +E                D   ++SLN    S  S +  +S+++D             AS
Sbjct: 568 EE----------------DRISRESLNNRTGSIDSRQSHSSKSLDNSDDDEQSSVDKAAS 611

Query: 633 AETIIDSAVKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLR 692
           + T I+ A+++LKLDME  RFCYI PR+  KR DYL Y RK DEGALTYAAHADYYILLR
Sbjct: 612 SLTSINEAIRQLKLDMEEKRFCYIPPRINPKRFDYLHYSRKIDEGALTYAAHADYYILLR 671

Query: 693 ACATIAQVDIRSMHMAVLSFERRLAWLEKRIDHCLHLTPPSVTCRFCSD 741
           ACA  AQVDIR MH+ VLS E+RL+WLE RI   L LTP S+TC FCSD
Sbjct: 672 ACARAAQVDIRIMHIGVLSLEKRLSWLEDRIHKSLRLTPTSITCEFCSD 720


>gi|114053441|gb|AAK91899.2|AC091627_12 hypothetical protein SDM1_2t00007 [Solanum demissum]
          Length = 664

 Score =  555 bits (1430), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 323/738 (43%), Positives = 436/738 (59%), Gaps = 91/738 (12%)

Query: 13  CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQVA 72
           C+ C N+ F+   DGF+YCT C +   +I++TGV D+D +    GT    ++        
Sbjct: 9   CEVCGNLSFDDGGDGFFYCTRCGSQAVDIMDTGVDDDDLVNVDGGTTLPAFR-------- 60

Query: 73  NSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTDMVGP 132
                               +F+            N +D D         E +  D VGP
Sbjct: 61  --------------------VFE------------NLDDYD---------ERNGDDGVGP 79

Query: 133 TEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIW 192
             P DFGSS  +   Y DY+ E+R++YVMG+Q MIQ+QC+ LV+KFNV PLI G+A  IW
Sbjct: 80  AVPSDFGSS-QISLTYTDYYSEIRLRYVMGLQAMIQMQCKTLVEKFNVSPLIVGLAGPIW 138

Query: 193 FRFLASTGLLSQGWADEAIVQSESQ-ELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLR 251
            R LA   +LS  WAD+ + +SE+Q + E +  Q     + EPH L G+RAV IW+K LR
Sbjct: 139 LRLLAHENVLSDEWADDVVHESETQTQGEIELSQSTGSQKTEPHNLLGKRAVTIWHKSLR 198

Query: 252 QKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLS 311
             IPL  SLAISFL CHV REAILPTDI+KW++EGK+PYFAAF+EIEK+ G  S +C +S
Sbjct: 199 NLIPLPCSLAISFLVCHVAREAILPTDILKWTLEGKLPYFAAFLEIEKQLGPPSRSCPIS 258

Query: 312 PSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALK 371
            S MFRP ++V  QKLES AASIA  IGL LP VNF+A+A+RYLK L LP+ KILP+A +
Sbjct: 259 TSRMFRPIRTVTLQKLESLAASIARKIGLELPSVNFHAIAARYLKHLSLPVEKILPQACQ 318

Query: 372 IQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEK-SLSSRKFFSSTS 430
           + EWSMPP+L+LS N+ R P+RVCVMSILIV++RILY++NG G WE  S  S    S+  
Sbjct: 319 VYEWSMPPELYLSDNDSRLPSRVCVMSILIVTMRILYDLNG-GKWELISSCSNNLVSAVE 377

Query: 431 NIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEAR 490
           N  G    EC           SC+    + + ++ S  +  H   SE     LL  LE +
Sbjct: 378 NGAG----ECG---------FSCNARGAVAEEDSASRDSDPHDSTSE-----LLKILEEK 419

Query: 491 YNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEAA 550
           YN++ D  TY++SKDL +YL YCK+VVFAGLEP +DD EE  +IE FW FYQ+ K     
Sbjct: 420 YNELSD--TYDFSKDLQSYLLYCKNVVFAGLEPAYDDHEEERIIEDFWDFYQSHK----I 473

Query: 551 EDSGKRCGIASKR------KRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDD 604
               + C I ++            +L + +   +++K  E   T   S + ++   SG  
Sbjct: 474 FIHNQNCTITTRAVLNYLLTLQSINLLIYNQIIQADKASEDGKTD--SHTGNHFHHSGSR 531

Query: 605 YPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKR 664
           +   S   +N +   + Y    + D+  +    ++A+++LK DM  NRF YI PR  +K+
Sbjct: 532 HGSNSTTKENENFRDDGYKCKMSRDDGDS----NAALRQLKADMNENRFVYIPPRKNVKK 587

Query: 665 LD-YLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRI 723
            D Y++Y RK D GA  YA HADYYILLR+CA +AQVD+R+MH+ VL+FE+RLA LEKRI
Sbjct: 588 KDGYIRYARKKD-GAYLYAVHADYYILLRSCAKVAQVDVRTMHVGVLAFEKRLAMLEKRI 646

Query: 724 DHCLHLTPPSVTCRFCSD 741
           D CL    P   C FC D
Sbjct: 647 DFCLCKRLPDDFCEFCRD 664


>gi|357473537|ref|XP_003607053.1| hypothetical protein MTR_4g071660 [Medicago truncatula]
 gi|355508108|gb|AES89250.1| hypothetical protein MTR_4g071660 [Medicago truncatula]
          Length = 653

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 333/755 (44%), Positives = 438/755 (58%), Gaps = 132/755 (17%)

Query: 26  DGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQVANSRPANHFVPLSQ 85
           DGFYYC+ C   + ++V+TG  +ED  A GAG    +Y ASH R+ A    A +  P+SQ
Sbjct: 7   DGFYYCSACGEKNLDVVDTGAEEED--AIGAG----IYLASHQRRTAAPTDAVYVQPISQ 60

Query: 86  ----QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTDMVGPTEPQDFGSS 141
               QS+F   L  E ++                  ++K E V  +    P+ P DFG S
Sbjct: 61  CNPSQSNFLRKLGLEDDSQV----------------KVKAENVDQSQ-CDPSNPADFGGS 103

Query: 142 GPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGL 201
             V    E Y+ E+R++Y+MG+Q+MI+LQCEALV +F V PLICG+   IW RF++ TG+
Sbjct: 104 TVVSI--EQYYKEIRLRYIMGLQMMIELQCEALVKEFKVTPLICGLVGPIWLRFVSKTGV 161

Query: 202 LSQGWADEAIVQSESQ-ELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSL 260
               WAD+AI  SE Q E E +D+  R KY+ EPH + GQRA  IW++ LR +IP+  ++
Sbjct: 162 FDDDWADKAIHDSEMQNEGEPEDYNIRGKYKSEPHNMFGQRAAFIWFRSLRNRIPVVCTI 221

Query: 261 AISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSK 320
            +S+LACH+ REAI+P+D++KW+ EGK+PYF+AF+E+E R G   VAC +S S MFRP +
Sbjct: 222 VVSYLACHIAREAIMPSDMIKWTCEGKLPYFSAFLELESRMG-PPVACPISSSLMFRPQR 280

Query: 321 SVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMPPD 380
           ++   KLES A+SI++ IGL LPPVNFYAL              ILP A +I EWSM PD
Sbjct: 281 ALSVHKLESCASSISQFIGLELPPVNFYAL--------------ILPYACRIYEWSMSPD 326

Query: 381 LWLSTNECRF--PTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSSTSNIGGKFDP 438
           LWLS ++  F  PT VCV+S+L+V+IRILYNING+G WEKSLS                 
Sbjct: 327 LWLSLSKDYFKLPTHVCVVSVLVVAIRILYNINGYGEWEKSLSH---------------- 370

Query: 439 ECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEARYNDIHDSV 498
                                 DS   S+K+    ++ ELD   LL  L A YN+I D  
Sbjct: 371 ---------------------NDSAKDSAKDPVEQQKHELDCTGLLQHLHAIYNEIAD-- 407

Query: 499 TYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEAAEDSGKRCG 558
           T+EYSKDLPTYLKYCKDVVFAGLEP     EE  M+E  WK YQNE   E  ED      
Sbjct: 408 THEYSKDLPTYLKYCKDVVFAGLEPSLGSYEETNMMEILWKHYQNE---EVIED------ 458

Query: 559 IASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDDYPQQSLNGDNSSKS 618
                    ++L    + K SE   EK++         + SFSG     +   G  S + 
Sbjct: 459 --------LNELLHRHNTKPSES--EKQY---------HSSFSGTGSRDEGCVGKTSKR- 498

Query: 619 SEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGA 678
             E    + + + +    +  A+++LKLDME NRFCYI P V+ ++L Y+ YVRK D+GA
Sbjct: 499 --EKKKRKCLSDDNPSGSLPEAIRQLKLDMEENRFCYIPPSVKQEKLGYVHYVRKKDKGA 556

Query: 679 LTYAAHADYYILLRACATIAQV-DIRSMHMAVLSFERRLAWLEKRIDHCLHLTPPSVTCR 737
           L+Y AHADYYILLRA A  A V DIR +H+ VL+ ERRLAWLEKRI  CLHL PP+++C 
Sbjct: 557 LSYVAHADYYILLRAFARAAHVDDIRILHIGVLNLERRLAWLEKRIGQCLHLKPPNISCE 616

Query: 738 FCSDPVELAAGDPICNATEDSAD---DHIGLSNLN 769
           FCS             ATE+ +D   D   LSNLN
Sbjct: 617 FCS-----------VRATENGSDHMMDEDELSNLN 640


>gi|22325438|ref|NP_671769.1| maternal effect embryo arrest 12 protein [Arabidopsis thaliana]
 gi|75111159|sp|Q5XVF0.1|MEE12_ARATH RecName: Full=TATA box-binding protein-associated factor RNA
           polymerase I subunit B; AltName: Full=Central cell
           guidance protein; AltName: Full=Protein MATERNAL EFFECT
           EMBRYO ARREST 12; AltName: Full=TATA box-binding
           protein-associated factor 1B; Short=TBP-associated
           factor 1B
 gi|52354243|gb|AAU44442.1| hypothetical protein AT2G02955 [Arabidopsis thaliana]
 gi|330250550|gb|AEC05644.1| maternal effect embryo arrest 12 protein [Arabidopsis thaliana]
          Length = 666

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 306/738 (41%), Positives = 424/738 (57%), Gaps = 93/738 (12%)

Query: 13  CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ-- 70
           C +C+N  F+   DG+YYC  C    + +++TGV D D +  G GT  ALY   H R   
Sbjct: 3   CTECENDAFDEEDDGYYYCQRCGVQVENLIQTGVDDGDLIGEGGGTQGALYNPKHRRTEP 62

Query: 71  --VANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTD 128
             +  S+P   F   + + S +   F+  N                +   +KR   SY D
Sbjct: 63  QPITPSQP--RFTDDTSRYSQFKSQFESENGNK------------ELPREVKRAPDSYVD 108

Query: 129 MVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVA 188
              PTEP DF +       YE+Y+ E R +YV    +MI  QC+ALVDKFNV PLI G+ 
Sbjct: 109 K-EPTEPVDFAAE---TLSYENYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLIIGLV 164

Query: 189 ASIWFRFLASTGLLSQGWADEAIVQSE--SQELESKDFQPRAKYRDEPHTLHGQRAVMIW 246
             I  R++A +G+  + WA+ AI  SE  S++ E KD +   +++ EP  + G+RAV IW
Sbjct: 165 GPISLRYVALSGVYHKDWANNAIRDSEHQSEDGEVKDAKRLKRHKAEPRNIDGKRAVTIW 224

Query: 247 YKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSV 306
           + +L++ +PLSSSL ISFLACH     +LPTDIV+W+ EGK+PY + F++I ++ G+ S 
Sbjct: 225 FGILKKTMPLSSSLVISFLACHQAGAPVLPTDIVRWAREGKLPYLSCFLDIREQMGERSA 284

Query: 307 ACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKIL 366
           AC +  S M RP + + +Q LE+ A+ IA++IGL LPPVNFY +AS Y+KQL +P  KIL
Sbjct: 285 ACPVKVSIMARPFQVISAQMLEARASVIADTIGLPLPPVNFYGIASNYIKQLSIPEDKIL 344

Query: 367 PRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFF 426
             A  IQ WS+PP+L+LSTNE + P+RVCVMSILIV+IR+LYNING G WE+SL     F
Sbjct: 345 DLARLIQNWSLPPELYLSTNEQKLPSRVCVMSILIVAIRMLYNINGLGVWERSLG----F 400

Query: 427 SSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYD 486
            + S+                            GDS   S       K +E D + LL +
Sbjct: 401 VNASD----------------------------GDSETNSGTAE---KATEFDTQELLKN 429

Query: 487 LEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAI-MIEKFWKFYQNEK 545
           LEA+Y+++  + T E  KDL +YL   K+  FAGLE   D P++   +++  W  Y  ++
Sbjct: 430 LEAKYHEVA-AETLESEKDLVSYLSLGKNEFFAGLE--EDSPDDTYRIVDNLWNGYPKDE 486

Query: 546 ESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYES-FSGDD 604
           + E             KR R  DD                     L+  + Y+S FS  +
Sbjct: 487 DIEC----------LPKRGRDWDD------------------DVSLNQLSLYDSRFSDGN 518

Query: 605 YPQQSLNGDNSSKS-SEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIK 663
            P  S +  N S S   +  +SE  + +S E + + A+KRL  DM ++ FCYI PRV++K
Sbjct: 519 NPCSSSSRRNESVSIGLDLSSSEHRESSSPEKLKEIAIKRLITDMGDDLFCYIPPRVKVK 578

Query: 664 RLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRI 723
           RLDYLQYVRK ++GAL YAAHADYYILLR CA +A++D+R+MH  VLSFERRLAW+EKRI
Sbjct: 579 RLDYLQYVRKKEDGALIYAAHADYYILLRVCAKVAEIDVRNMHRGVLSFERRLAWIEKRI 638

Query: 724 DHCLHLTPPSVTCRFCSD 741
           D  LHLT P +TC+ C D
Sbjct: 639 DQVLHLTRPLMTCKHCCD 656


>gi|297814596|ref|XP_002875181.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297321019|gb|EFH51440.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 698

 Score =  513 bits (1322), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 314/741 (42%), Positives = 433/741 (58%), Gaps = 68/741 (9%)

Query: 13  CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQVA 72
           C++CDN  F+   DG++YC  C    +++++TGV D D +  G GT  A+Y   + R + 
Sbjct: 3   CRECDNDAFDEEDDGYFYCKRCGVQAEDLIQTGVDDGDLIGDGGGTHGAIYDPRYRRTIT 62

Query: 73  NSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANY-NDLDSIIHR-------IKREEV 124
                    P++           +P  T  T    ++ + L+S I +       +KRE  
Sbjct: 63  Q--------PITPS---------QPRYTDDTIRYTHFKSQLESEIEKKKDLPREVKREPE 105

Query: 125 SYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLI 184
            Y +   PTEP DFG+       YEDY+ E R +YV    +MI  QC+ALVDKFNV PLI
Sbjct: 106 IYLEK-EPTEPVDFGAE---TLSYEDYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLI 161

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSE--SQELESKDFQPRAKYRDEPHTLHGQRA 242
            G+   I  R++A +G+    WAD+AI  SE  S++ E KD +     + EP  L G+RA
Sbjct: 162 IGLVGPISLRYVALSGVYDDDWADKAIRDSELQSEDGEVKDAKRPKTQKAEPRNLDGKRA 221

Query: 243 VMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFG 302
           V IW  LL++ +PLSSSLAISFLACH     +LPTDIV+W+ EGK+PY + F++I ++ G
Sbjct: 222 VTIWVSLLKKTLPLSSSLAISFLACHHAGAPVLPTDIVRWAREGKLPYLSCFIDIREQMG 281

Query: 303 QTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 362
           + S AC +  S M RP + + +Q LE+ AA IA+ IGL L PVNFY +AS Y+K+L +P 
Sbjct: 282 ERSAACPVKASIMSRPFQIISAQMLEARAACIADIIGLPLSPVNFYGIASNYIKRLSIPE 341

Query: 363 GKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSS 422
            KIL     IQ WS+PP+L+LS NE + PTRVCVMSILIV+IR+LYNING G WE+SL  
Sbjct: 342 DKILELVRLIQNWSLPPELYLSKNELKLPTRVCVMSILIVAIRMLYNINGLGLWERSLE- 400

Query: 423 RKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEA 482
              F + S     F+ E +S + DE           + ++N+ +++ S   K ++ D E 
Sbjct: 401 ---FVNASEADIDFE-EFDSPVHDE----------EVSNTNSGTAELSDVTKATKFDTEE 446

Query: 483 LLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAI-MIEKFWKFY 541
           LL +LEA+Y ++  + T E  KDL +YL   K+  FAGLE   D P++    ++  W  Y
Sbjct: 447 LLKNLEAKYYEVA-AETLESEKDLLSYLSLGKNEFFAGLE--EDLPDDTYRTVDNLWNGY 503

Query: 542 QNEKESEAAEDSGKRCGIASKRKRS-RDDLSLGSHYKESEKIREKEFTTRLSASADYESF 600
             +++ E       RC    KR R   DD+SL     E  K  +       S S   ES 
Sbjct: 504 PKDEDFE-------RCRTPPKRGRDWDDDVSLNQLSLEDSKFSDGNNPCS-SPSRRNESV 555

Query: 601 SGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRV 660
           S D     S +G++SS  +  +           E + + A+KRL  DM +N FCYI PRV
Sbjct: 556 SMDRDLSSSEHGESSSPENHHH---------QEEKLKERAIKRLITDMGDNLFCYIPPRV 606

Query: 661 QIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLE 720
           ++KRLDYLQYVRK ++GAL Y AHADYYILLR CA +A++D+R+MH  VLSFERRLAW+E
Sbjct: 607 KVKRLDYLQYVRKKEDGALIYTAHADYYILLRVCAKVAEIDVRNMHRGVLSFERRLAWIE 666

Query: 721 KRIDHCLHLTPPSVTCRFCSD 741
           KRID  LHL PP +TC  C D
Sbjct: 667 KRIDQVLHLPPPFMTCDHCCD 687


>gi|357490217|ref|XP_003615396.1| hypothetical protein MTR_5g067430 [Medicago truncatula]
 gi|355516731|gb|AES98354.1| hypothetical protein MTR_5g067430 [Medicago truncatula]
          Length = 734

 Score =  484 bits (1245), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 291/741 (39%), Positives = 394/741 (53%), Gaps = 143/741 (19%)

Query: 7   DIRRLKCKKCDNVGF-ECAYDGFYYCTHCNA--LDDEIVETGVADEDFMATGAGTGTALY 63
           DIR LKC+    VG+ + + DGF+YC  CN         +     + ++       +A  
Sbjct: 84  DIRILKCQNYLFVGYVDDSDDGFFYCPKCNVRFDGVFDNDVDDVIDVYILRFRRKKSAAI 143

Query: 64  QASHARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREE 123
           +A       NS+P + +         WS      N      T  N +D+  I   I    
Sbjct: 144 KAE-----PNSQPCSEY-------DAWSKFV---NNLLEDETQQN-DDIHIIEQNIDGTS 187

Query: 124 VSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPL 183
           VS+        P DFGSS    + Y   H ++RM+++MG+Q+MI+LQC+ALV +F V PL
Sbjct: 188 VSH--------PSDFGSSKHKTSKYYHNHNQIRMRHLMGLQMMIELQCKALVKEFKVTPL 239

Query: 184 ICGVAASIWFRFLASTGLLSQGWADEAIVQSESQ-ELESKDFQPRAKYRDEPHTLHGQRA 242
           ICG+   IW RF++ TG+     A   +  SE Q E E +D+  R KYR+EPH + GQR 
Sbjct: 240 ICGLVGPIWLRFISKTGVFDDDRAYTVMHNSEMQKEGEPEDYTIRGKYREEPHNMFGQRF 299

Query: 243 VMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFG 302
           VMIW + L+ +IPL               EAILP+D++KW++EGK+PYF+AFVE+E+   
Sbjct: 300 VMIWLRYLKNRIPL---------------EAILPSDMMKWTLEGKLPYFSAFVELERLMK 344

Query: 303 QTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 362
             S+AC +S S MF+P +++P QKLESFAASIA  I            A RYLK+L LP+
Sbjct: 345 HESLACPISSSAMFKPQRAIPVQKLESFAASIAHCI------------AYRYLKKLSLPV 392

Query: 363 GKILPRALKIQEWSMPPDLWLSTNE--CRFPTRVCVMSILIVSIRILYNINGFGAWEKSL 420
            KILP A +I EW MPPDLWLS  +   R PT VC +SIL+++IRILYNING+G WEKSL
Sbjct: 393 EKILPYACRIYEWLMPPDLWLSFTKDYFRLPTHVCAVSILVIAIRILYNINGYGEWEKSL 452

Query: 421 SSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDA 480
           S                  CN   +D             G+ +   + +  HF     D 
Sbjct: 453 S------------------CNVGDKDN------------GEMDTTFASHDEHFLVKIAD- 481

Query: 481 EALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAG-LEPPHDDPEEAIMIEKFWK 539
                              +EYSKDLPTYLKY +DVVF G  E  +D+ EE  MIE    
Sbjct: 482 ------------------NHEYSKDLPTYLKYRRDVVFDGSKEQSYDNYEEQRMIEILRN 523

Query: 540 FYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYES 599
           FYQNEK +    +   +   +  R R  D+     H + + K++                
Sbjct: 524 FYQNEKNTTKPSEHAGKSNTSFNRLRLTDE----GHIRRTSKMK---------------- 563

Query: 600 FSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPR 659
            +   Y + S+   N                 S + ++D A+ ++KLDME N FCY+ PR
Sbjct: 564 -NTSPYGEDSVFNKNG---------------GSGKAVVDEAITQMKLDMEENLFCYVPPR 607

Query: 660 VQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWL 719
           V+ K LDY+ YVRK D+GA+TY AHADYYILLR CA IAQVDIR + + VL  E+RLAWL
Sbjct: 608 VKPKNLDYVHYVRKKDKGAMTYVAHADYYILLRTCARIAQVDIRILQIGVLRLEKRLAWL 667

Query: 720 EKRIDHCLHLTPPSVTCRFCS 740
           EKR+D CLHL P S++C+FCS
Sbjct: 668 EKRVDQCLHLKPSSISCQFCS 688


>gi|52354245|gb|AAU44443.1| hypothetical protein AT2G02955 [Arabidopsis thaliana]
          Length = 526

 Score =  457 bits (1176), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 259/582 (44%), Positives = 356/582 (61%), Gaps = 71/582 (12%)

Query: 165 LMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSE--SQELESK 222
           +MI  QC+ALVDKFNV PLI G+   I  R++A +G+  + WA+ AI  SE  S++ E K
Sbjct: 1   MMITYQCDALVDKFNVTPLIIGLVGPISLRYVALSGVYHKDWANNAIRDSEHQSEDGEVK 60

Query: 223 DFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKW 282
           D +   +++ EP  + G+RAV IW+ +L++ +PLSSSL ISFLACH     +LPTDIV+W
Sbjct: 61  DAKRLKRHKAEPRNIDGKRAVTIWFGILKKTMPLSSSLVISFLACHQAGAPVLPTDIVRW 120

Query: 283 SIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHL 342
           + EGK+PY + F++I ++ G+ S AC +  S M RP + + +Q LE+ A+ IA++IGL L
Sbjct: 121 AREGKLPYLSCFLDIREQMGERSAACPVKVSIMARPFQVISAQMLEARASVIADTIGLPL 180

Query: 343 PPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIV 402
           PPVNFY +AS Y+KQL +P  KIL  A  IQ WS+PP+L+LSTNE + P+RVCVMSILIV
Sbjct: 181 PPVNFYGIASNYIKQLSIPEDKILDLARLIQNWSLPPELYLSTNEQKLPSRVCVMSILIV 240

Query: 403 SIRILYNINGFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDS 462
           +IR+LYNING G WE+SL     F + S+                            GDS
Sbjct: 241 AIRMLYNINGLGVWERSLG----FVNASD----------------------------GDS 268

Query: 463 NAKSSKNSSHFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLE 522
              S       K +E D + LL +LEA+Y+++  + T E  KDL +YL   K+  FAGLE
Sbjct: 269 ETNSGTAE---KATEFDTQELLKNLEAKYHEVA-AETLESEKDLVSYLSLGKNEFFAGLE 324

Query: 523 PPHDDPEEAI-MIEKFWKFYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEK 581
              D P++   +++  W  Y  +++ E             KR R  DD            
Sbjct: 325 --EDSPDDTYRIVDNLWNGYPKDEDIEC----------LPKRGRDWDD------------ 360

Query: 582 IREKEFTTRLSASADYES-FSGDDYPQQSLNGDNSSKS-SEEYPNSEAIDEASAETIIDS 639
                    L+  + Y+S FS  + P  S +  N S S   +  +SE  + +S E + + 
Sbjct: 361 ------DVSLNQLSLYDSRFSDGNNPCSSSSRRNESVSIGLDLSSSEHRESSSPEKLKEI 414

Query: 640 AVKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQ 699
           A+KRL  DM ++ FCYI PRV++KRLDYLQYVRK ++GAL YAAHADYYILLR CA +A+
Sbjct: 415 AIKRLITDMGDDLFCYIPPRVKVKRLDYLQYVRKKEDGALIYAAHADYYILLRVCAKVAE 474

Query: 700 VDIRSMHMAVLSFERRLAWLEKRIDHCLHLTPPSVTCRFCSD 741
           +D+R+MH  VLSFERRLAW+EKRID  LHLT P +TC+ C D
Sbjct: 475 IDVRNMHRGVLSFERRLAWIEKRIDQVLHLTRPLMTCKHCCD 516


>gi|224112275|ref|XP_002316139.1| predicted protein [Populus trichocarpa]
 gi|222865179|gb|EEF02310.1| predicted protein [Populus trichocarpa]
          Length = 582

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 229/427 (53%), Positives = 288/427 (67%), Gaps = 39/427 (9%)

Query: 313 SFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKI 372
           S MFRP ++VP QKLES AASIAESIGLHLPPV FY +ASRYLKQL LP+ KILP A +I
Sbjct: 175 SLMFRPVQAVPIQKLESMAASIAESIGLHLPPVYFYGIASRYLKQLSLPVEKILPHACRI 234

Query: 373 QEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSSTSNI 432
            EWSMPPD W STNE R P+RVCV+SILIV+IRI+YNINGFGAWE SLS    ++S    
Sbjct: 235 YEWSMPPDQWFSTNELRLPSRVCVLSILIVAIRIIYNINGFGAWEGSLSG---YTS---- 287

Query: 433 GGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEARYN 492
             + D  C+S+MR  V+E        + DS+ KS  N SH + SELD+  LL +L+A+YN
Sbjct: 288 --RLDFPCSSQMR--VDEKDSGSPHKLDDSDEKSIGNRSHVQRSELDSAELLRNLDAKYN 343

Query: 493 DIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEAAED 552
           +I D  TYE+++DLP+YL+YC +VVF G    H +  E  +I+K W FYQ+EK S+ AED
Sbjct: 344 EISD--TYEFTRDLPSYLQYCNEVVFCGAGSSHVNHREDELIKKLWDFYQSEKYSDPAED 401

Query: 553 SGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDDYPQQSLNG 612
            G +       KRSR+D  L       +KIRE+   T    S D  + +GD+      N 
Sbjct: 402 DGAQNRTVFNGKRSRNDNGLARDQMGKKKIREE---THECLSTDTCNSTGDE------NN 452

Query: 613 DNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRLDYLQYVR 672
           D+S+                 ET+ D A+++LKLDM  NRF YI PR++IKR DYL YVR
Sbjct: 453 DDST-----------------ETVKDKAIRKLKLDMAENRFYYIPPRLKIKRFDYLHYVR 495

Query: 673 KSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRIDHCLHLTPP 732
           +  +GALTY AHADYYILLR+ A  AQVD R MH+ V++FERR+AW+EKRID+CLHLTPP
Sbjct: 496 RVGDGALTYVAHADYYILLRSFARAAQVDTRIMHIGVMNFERRVAWMEKRIDYCLHLTPP 555

Query: 733 SVTCRFC 739
           S TC +C
Sbjct: 556 SFTCEYC 562



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 53/96 (55%), Positives = 72/96 (75%)

Query: 129 MVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVA 188
           + GPTEP+DFG     K  YEDY+ EVR+KYV+GM LMI+LQCEALV+KFNV PLI GVA
Sbjct: 68  VFGPTEPEDFGGLSKAKLSYEDYYNEVRIKYVLGMGLMIRLQCEALVEKFNVTPLIVGVA 127

Query: 189 ASIWFRFLASTGLLSQGWADEAIVQSESQELESKDF 224
           +++W RFL +TG+    WA++ ++ SE Q+ E+ + 
Sbjct: 128 STVWLRFLLTTGVFKDSWANDVLIDSEMQKPETTEL 163



 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 27/51 (52%), Positives = 34/51 (66%), Gaps = 1/51 (1%)

Query: 1  MADGEGDIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDF 51
          M   EG++R L+C+ C  +G E + DGFYYC  C A    I+ETGVADEDF
Sbjct: 1  MEREEGEVRNLQCESCGYIGLEES-DGFYYCQQCGAQAQGIMETGVADEDF 50


>gi|297814209|ref|XP_002874988.1| hypothetical protein ARALYDRAFT_912098 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320825|gb|EFH51247.1| hypothetical protein ARALYDRAFT_912098 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 546

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 253/605 (41%), Positives = 341/605 (56%), Gaps = 118/605 (19%)

Query: 133 TEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIW 192
           T P+   +  PVK  YEDY+ E R +YV G+ +MI  QC+ALV+KFNV PLI G      
Sbjct: 55  TRPEPAPNPEPVKLSYEDYYKETRDRYVKGLLMMITYQCDALVEKFNVTPLIIG------ 108

Query: 193 FRFLASTGLLSQGWADEAIVQSESQELESKDFQP---RAKYRDEPHTLHGQRAVMIWYKL 249
                   L+   WAD+AI++SE Q  E +  +P   R K  +EP  L+G+RAV IW   
Sbjct: 109 --------LVDDDWADKAILESELQS-EGEVREPEVKRGKASEEPQNLYGKRAVTIWLSQ 159

Query: 250 LRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACS 309
           L++ +PLSSSLA+SFLACH   +A  PTDIV+W+ +GK PY ++F++I ++ G  S AC 
Sbjct: 160 LKKSLPLSSSLALSFLACH---KAGTPTDIVRWARQGKFPYLSSFLKIREQMGDRSAACP 216

Query: 310 LSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRA 369
           +S S MF+PS+ V +Q+LE+  ASIA+ IGL LPPVNFYA+AS YL++L +P  K+L   
Sbjct: 217 VSVSVMFKPSQIVSAQRLEAQVASIADIIGLPLPPVNFYAIASNYLERLSIPKEKVLDLV 276

Query: 370 LKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSST 429
             +  WSMP DL+LS NE R PTRV VMSI+IV+IR+LYNINGFG WE+SL         
Sbjct: 277 RLLHNWSMPSDLYLSKNELRLPTRVYVMSIIIVAIRMLYNINGFGVWEQSLG-------- 328

Query: 430 SNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEA 489
                         + D  EE     +                        E LL +LE 
Sbjct: 329 --------------LLDASEEDVTKAT------------------------EELLKNLET 350

Query: 490 RYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEA 549
           +Y+++  + T E+ KDL +YL + K+ +FAGLE    D +    ++  W  Y  ++    
Sbjct: 351 KYHEVA-AETVEHEKDLLSYLLHGKNEIFAGLEEASAD-DTYRTVDNLWNSYPKDQ---- 404

Query: 550 AEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDDYPQQS 609
             D    C   S++  +                                          S
Sbjct: 405 LRDRNNPCSSRSRKSET-----------------------------------------GS 423

Query: 610 LNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRLDYLQ 669
            + D  S  S   P+    +E S E +I    +RL  DM +N F YI PRV++KRLDYLQ
Sbjct: 424 TDCDEPSLESVSSPDKHNHEEKSKEKVI----RRLITDMRDNLFQYIPPRVKVKRLDYLQ 479

Query: 670 YVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRIDHCLHL 729
           YVRK+D+GAL YAAHADYYILLR CA +A++D R+MH +VLSFERRLAW+EK+IDH LHL
Sbjct: 480 YVRKNDDGALIYAAHADYYILLRVCAMVAEIDARNMHRSVLSFERRLAWIEKKIDHVLHL 539

Query: 730 TPPSV 734
           TP S+
Sbjct: 540 TPLSI 544



 Score = 42.7 bits (99), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 23/39 (58%)

Query: 13 CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDF 51
          C  C++  FE   DGF+YC  C+   D I++TGV  ED 
Sbjct: 3  CHICESDSFEDGGDGFFYCLQCSTQADGIIQTGVDGEDL 41


>gi|384955472|sp|Q5W770.2|TAF1B_ORYSJ RecName: Full=TATA box-binding protein-associated factor RNA
           polymerase I subunit B; AltName: Full=TATA box-binding
           protein-associated factor 1B; Short=TBP-associated
           factor 1B
          Length = 634

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 225/609 (36%), Positives = 335/609 (55%), Gaps = 98/609 (16%)

Query: 126 YTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLIC 185
           + D+  P+EP+DF +      + ED    VR +YV G+Q+++Q Q EALV++  V  L  
Sbjct: 100 FDDLGEPSEPRDFATGANAWGNPEDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAA 159

Query: 186 GVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDFQPRAKYRDEPHTLHG--QRAV 243
            +A +IW R++A++ +  + W  + +  + S E      +  +  +D+   L G  Q++ 
Sbjct: 160 SLAGTIWLRWVAASKVFDEMWVHKMLAIAASVE------EGHSASKDKQSELEGDAQKSQ 213

Query: 244 ----MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK 299
                ++ + LR  +P+ S+LA+ FLACHV RE ILPTDI +W++EGK+PY AAF +++K
Sbjct: 214 SSYEFLFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDK 273

Query: 300 RFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLC 359
             G +   C LS   +FRP++ + + +LE+ A SIA+ IGL LP VNFY +A R+LK+L 
Sbjct: 274 LLGSSLNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELS 333

Query: 360 LPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKS 419
           LP+ KILP A +I EW+MP +LWLS+N  R P+RVCVM+ILIV++R+LY ING G WE  
Sbjct: 334 LPIEKILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWES- 392

Query: 420 LSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELD 479
                  + T N  G  DPE +            +P S+  DSN     NS  F     D
Sbjct: 393 ------IAQTENAVGS-DPEAS------------APHSIEPDSN-----NSEEF-----D 423

Query: 480 AEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWK 539
           A  LL  L A Y+ I+  V ++YSK++ +YLKYCKDVVF G+       EE  +I+ FW 
Sbjct: 424 ARELLCTLAASYDKIN--VGHDYSKEVHSYLKYCKDVVFTGMT---FSLEEEHLIDIFWD 478

Query: 540 FYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYES 599
            Y+  KE    +++ K C                      EK+R                
Sbjct: 479 MYKG-KEVMLLDENAKLC---------------------QEKLR---------------- 500

Query: 600 FSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPR 659
                    + NG N       + ++    +  +  + + A++ +K  ME N FCY+SPR
Sbjct: 501 ---------TTNGVNKRCRDGRFADT----KCCSTPLGNCALQSIKSKMEENGFCYVSPR 547

Query: 660 VQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWL 719
            ++    YL Y R+   G+L Y AHADYYILLR  A +A+VD+R +H +VL  ERRL W+
Sbjct: 548 KRLVSDGYLLYTRRESSGSLIYVAHADYYILLRPFAKLAEVDVRVLHSSVLKLERRLGWI 607

Query: 720 EKRIDHCLH 728
           E+R+   L+
Sbjct: 608 EERVGRSLN 616


>gi|384952682|sp|B8AX23.1|TAF1B_ORYSI RecName: Full=TATA box-binding protein-associated factor RNA
           polymerase I subunit B; AltName: Full=TATA box-binding
           protein-associated factor 1B; Short=TBP-associated
           factor 1B
 gi|218196609|gb|EEC79036.1| hypothetical protein OsI_19584 [Oryza sativa Indica Group]
          Length = 634

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 225/609 (36%), Positives = 334/609 (54%), Gaps = 98/609 (16%)

Query: 126 YTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLIC 185
           + D+  P+EP+DF +      + ED    VR +YV G+Q+++Q Q EALV++  V  L  
Sbjct: 100 FDDLGEPSEPRDFATGANAWGNPEDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAA 159

Query: 186 GVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDFQPRAKYRDEPHTLHG--QRAV 243
            +A +IW R++A++ +  + W  + +  + S E      +  +  +D+   L G  Q++ 
Sbjct: 160 SLAGTIWLRWVAASKVFDEMWVHKMLAIAASVE------EGHSASKDKQSELEGDAQKSQ 213

Query: 244 ----MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK 299
                ++ + LR  +P+ S+LA+ FLACHV RE ILPTDI +W++EGK+PY AAF +++K
Sbjct: 214 SSYEFLFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDK 273

Query: 300 RFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLC 359
             G +   C LS   +FRP++ + + +LE+ A SIA+ IGL LP VNFY +A R+LK+L 
Sbjct: 274 LLGSSLNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELS 333

Query: 360 LPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKS 419
           LP+ KILP A +I EW+MP +LWLS+N  R P+RVCVM+ILIV++R+LY ING G WE  
Sbjct: 334 LPIEKILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWES- 392

Query: 420 LSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELD 479
                  + T N  G  DPE +            +P S+  DSN     NS  F     D
Sbjct: 393 ------IAQTENAVGS-DPEAS------------APHSIEPDSN-----NSEEF-----D 423

Query: 480 AEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWK 539
           A  LL  L A Y+ I   V ++YSK++ +YLKYCKDVVF G+       EE  +I+ FW 
Sbjct: 424 ARELLCTLAASYDKI--DVGHDYSKEVHSYLKYCKDVVFTGMT---FSLEEEHLIDIFWD 478

Query: 540 FYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYES 599
            Y+  KE    +++ K C                      EK+R                
Sbjct: 479 MYKG-KEVMLLDENAKLC---------------------QEKLR---------------- 500

Query: 600 FSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPR 659
                    + NG N       + ++    +  +  + + A++ +K  ME N FCY+SPR
Sbjct: 501 ---------TTNGVNKRCRDGRFADT----KCCSTPLGNCALQSIKSKMEENGFCYVSPR 547

Query: 660 VQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWL 719
            ++    YL Y R+   G+L Y AHADYYILLR  A +A+VD+R +H +VL  ERRL W+
Sbjct: 548 KRLVSDGYLLYTRRESSGSLIYVAHADYYILLRPFAKLAEVDVRVLHSSVLKLERRLGWI 607

Query: 720 EKRIDHCLH 728
           E+R+   L+
Sbjct: 608 EERVGRSLN 616


>gi|357119318|ref|XP_003561389.1| PREDICTED: uncharacterized protein LOC100826720 [Brachypodium
           distachyon]
          Length = 623

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 250/730 (34%), Positives = 356/730 (48%), Gaps = 148/730 (20%)

Query: 11  LKCKKCDNVGF---ECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASH 67
           L+C +C        + A DGF+ C  C A+      T     DF                
Sbjct: 17  LRCNECSATEVYSPDDAVDGFFSCRRCYAVHTSTQVTATDPNDF---------------- 60

Query: 68  ARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYT 127
                +S P    VP +Q     +P +  P+      T   ++D D              
Sbjct: 61  ----NDSMPFLRRVP-TQTPQTPAP-YRTPHPAHGGPTARPFDDFDE------------- 101

Query: 128 DMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGV 187
               P+ PQDF        + E+   +VR +YV+G+Q+++Q Q E LV++  V  LICGV
Sbjct: 102 ----PSAPQDFAPGADAWGNPEELVAQVRRRYVLGLQVILQRQLEVLVERHRVGALICGV 157

Query: 188 AASIWFRFLASTGLLSQGWADEAIVQSESQEL------ESKDFQPRAKYRDEPHTLHG-- 239
           A +IW R +A++ +  + W  + + Q E+ E       + K  Q + +  D      G  
Sbjct: 158 AGTIWVRLVAASKVFDEMWPQKVLAQDEAVEKLKRSASDQKPHQVKYELADAALARKGTR 217

Query: 240 QRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK 299
           +R   I+ + LR  +PL S+LA+SFLACH+ REAILPTDI +W+ EGK+PY AAF E++K
Sbjct: 218 KRCEFIFLRSLRTMLPLYSTLAVSFLACHIAREAILPTDISRWATEGKLPYVAAFTEVDK 277

Query: 300 RFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLC 359
             G +++ C L    +FRP + + + +LE+ A SIA+ IGL LP VNFYA+A RYL +L 
Sbjct: 278 LLG-SALKCPLDARQLFRPVRVIGTWQLEAAAGSIAQRIGLRLPSVNFYAIAQRYLNELS 336

Query: 360 LPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKS 419
           LP+ +ILP A  I EW+MP +LWLS+N  R PTRV VM+ILIVS+R LYNING G WE+ 
Sbjct: 337 LPVERILPHACHIYEWAMPAELWLSSNSDRIPTRVYVMAILIVSLRFLYNINGQGIWEE- 395

Query: 420 LSSRKFFSSTSNIGGKFDPECNS--KMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESE 477
                         G  DP+ NS   M+ E              +NA  +K         
Sbjct: 396 ------ICEAGITAGGSDPDANSLPSMKPECR------------TNAFGTKE-------- 429

Query: 478 LDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKF 537
                LL  L    + I   V ++YSKDL +YLKYCKDVVF G+       EEA +IE F
Sbjct: 430 -----LLCTLAEACDKI--DVGHDYSKDLRSYLKYCKDVVFPGIA---YSVEEAHLIEIF 479

Query: 538 WKFYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADY 597
              Y+      A ED                         ++ K+  +E           
Sbjct: 480 HDMYK------AGED-------------------------DNPKVHMEEM---------- 498

Query: 598 ESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYIS 657
                     Q+ NG N       +  +     +       S ++R+K +ME++ FCY+ 
Sbjct: 499 ----------QTTNGVNKRCQDGTFVGARRFSTS-------SGIQRIKSEMEDHGFCYMP 541

Query: 658 PRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLA 717
           P   ++  +YL Y RK+  G+L   +HADYYIL+RA A +A+VDIR +H +VL  ERRLA
Sbjct: 542 PGRGLRSDNYLHYRRKTVSGSLVSVSHADYYILIRAFAKLAEVDIRIIHASVLKLERRLA 601

Query: 718 WLEKRIDHCL 727
            +EKRI   L
Sbjct: 602 SIEKRIGRSL 611


>gi|413944896|gb|AFW77545.1| hypothetical protein ZEAMMB73_711148 [Zea mays]
          Length = 690

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 235/722 (32%), Positives = 351/722 (48%), Gaps = 139/722 (19%)

Query: 4   GEGDIRRLKCKKC---DNVGFECAYDGFYYCTHCNALDDEIVETGVAD-EDFMATGAGTG 59
           G G I  L C  C   DN   + A DG + C  C+A+     +   AD  DF ATG    
Sbjct: 14  GGGSIH-LVCDHCGTSDNYTTDDADDGQFTCRTCSAV--HTTQATAADPHDFPATG---- 66

Query: 60  TALYQASHARQVANSRPANHFVP-LSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHR 118
                      ++  R A    P L  ++   +P    P       T A           
Sbjct: 67  ----------NISVRRVATQPTPKLGARTP--APYLRTPQAAPVPATAA----------- 103

Query: 119 IKREEVSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKF 178
                  + D    +EP+DF S      + ED   +VR +YV G+Q+++Q Q E LV++ 
Sbjct: 104 -------FDDFTELSEPRDFVSGSGTCGEPEDLAVQVRWRYVRGLQVILQRQLEVLVERH 156

Query: 179 NVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELES--KDFQP-RAKYR-DEP 234
            V   +C VA  +W R++A++ +    WA + +   ++   +    D +P   KY  D+ 
Sbjct: 157 QVGAFVCAVAGIVWVRWVAASRVFDGIWARQVLEDHKAVGRQKCYDDKKPDEMKYEWDDD 216

Query: 235 HTLHGQ---RAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYF 291
            +L  +   +    + + LR  +P+ S+LA+ FLACH+ REAILP+DI +W++EG IPY 
Sbjct: 217 MSLQRKDRHKLEFAFLRSLRMMLPVYSTLAVCFLACHIAREAILPSDIYRWAMEGNIPYV 276

Query: 292 AAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALA 351
           AAF E+++  G +   C L    +FRP + + + +LE+ A SIA+ + L LP VNFYA+A
Sbjct: 277 AAFTEVDRFLGSSFQDCPLDARQLFRPVRVIGAWQLEAAAGSIAQRVALRLPSVNFYAIA 336

Query: 352 SRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNIN 411
            R LK L LP+ +ILP A +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYNIN
Sbjct: 337 QRCLKDLLLPIDRILPHACQIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYNIN 396

Query: 412 GFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSS 471
           G G WEK                             + E   +PS    D+N+ +S+   
Sbjct: 397 GQGIWEK-----------------------------ICEEGRNPSESHYDANSSTSRKLE 427

Query: 472 HFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEA 531
                E     LL  +   Y+ I+ S  ++YS DL +YLKYCK+VVFAG+       E+ 
Sbjct: 428 ASNSEEFGTRELLCAISVAYDKINTS--HDYSSDLRSYLKYCKEVVFAGIT---SSDEQT 482

Query: 532 IMIEKFWKFYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGS----HYKESEKIREKEF 587
            +IE FW  Y+  +E +  ED  K       +  S  D+ + +    HY++   ++   F
Sbjct: 483 HLIEIFWDMYK-AREDDNIEDHVK------SQSHSMADIPITNGVRKHYRDGAFVKASSF 535

Query: 588 TTRLSASADYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLD 647
               SAS      SG D                                   A++ LK +
Sbjct: 536 ----SAS------SGHD-----------------------------------AMQMLKSE 550

Query: 648 MENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHM 707
           M+++ F Y+ PR   K   YL+Y R+   G   + AHADYY+LLRA A +A++DIR MH+
Sbjct: 551 MQDHGFHYMPPRKPRKSDGYLRYRRRRLSGGFIFVAHADYYLLLRAFAKLAEIDIRIMHI 610

Query: 708 AV 709
            +
Sbjct: 611 RI 612


>gi|222631239|gb|EEE63371.1| hypothetical protein OsJ_18183 [Oryza sativa Japonica Group]
          Length = 808

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 175/438 (39%), Positives = 259/438 (59%), Gaps = 48/438 (10%)

Query: 126 YTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLIC 185
           + D+  P+EP+DF +      + ED    VR +YV G+Q+++Q Q EALV++  V  L  
Sbjct: 100 FDDLGEPSEPRDFATGANAWGNPEDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAA 159

Query: 186 GVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDFQPRAKYRDEPHTLHG--QRAV 243
            +A +IW R++A++ +  + W  + +  + S E      +  +  +D+   L G  Q++ 
Sbjct: 160 SLAGTIWLRWVAASKVFDEMWVHKMLAIAASVE------EGHSASKDKQSELEGDAQKSQ 213

Query: 244 ----MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK 299
                ++ + LR  +P+ S+LA+ FLACHV RE ILPTDI +W++EGK+PY AAF +++K
Sbjct: 214 SSYEFLFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDK 273

Query: 300 RFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLC 359
             G +   C LS   +FRP++ + + +LE+ A SIA+ IGL LP VNFY +A R+LK+L 
Sbjct: 274 LLGSSLNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELS 333

Query: 360 LPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKS 419
           LP+ KILP A +I EW+MP +LWLS+N  R P+RVCVM+ILIV++R+LY ING G WE  
Sbjct: 334 LPIEKILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWES- 392

Query: 420 LSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELD 479
                  + T N  G  DPE +            +P S+  DSN     NS  F     D
Sbjct: 393 ------IAQTENAVGS-DPEAS------------APHSIEPDSN-----NSEEF-----D 423

Query: 480 AEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWK 539
           A  LL  L A Y+ I+  V ++YSK++ +YLKYCKDVVF G+       EE  +I+ FW 
Sbjct: 424 ARELLCTLAASYDKIN--VGHDYSKEVHSYLKYCKDVVFTGMT---FSLEEEHLIDIFWD 478

Query: 540 FYQNEKESEAAEDSGKRC 557
            Y+  KE    +++ K C
Sbjct: 479 MYKG-KEVMLLDENAKLC 495


>gi|55168006|gb|AAV43874.1| unknown protein [Oryza sativa Japonica Group]
          Length = 1221

 Score =  308 bits (789), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 177/449 (39%), Positives = 261/449 (58%), Gaps = 47/449 (10%)

Query: 126 YTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLIC 185
           + D+  P+EP+DF +      + ED    VR +YV G+Q+++Q Q EALV++  V  L  
Sbjct: 163 FDDLGEPSEPRDFATGANAWGNPEDVAARVRWRYVRGLQVILQRQLEALVERHRVGSLAA 222

Query: 186 GVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDFQPRAKYRDEPHTLHG--QRAV 243
            +A +IW R++A++ +  + W  + +  + S E      +  +  +D+   L G  Q++ 
Sbjct: 223 SLAGTIWLRWVAASKVFDEMWVHKMLAIAASVE------EGHSASKDKQSELEGDAQKSQ 276

Query: 244 ----MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK 299
                ++ + LR  +P+ S+LA+ FLACHV RE ILPTDI +W++EGK+PY AAF +++K
Sbjct: 277 SSYEFLFLRSLRMMLPVYSTLAVCFLACHVARETILPTDICRWAMEGKLPYVAAFTQVDK 336

Query: 300 RFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLC 359
             G +   C LS   +FRP++ + + +LE+ A SIA+ IGL LP VNFY +A R+LK+L 
Sbjct: 337 LLGSSLNDCPLSSRQLFRPTRVIGAWQLEAAAGSIAQKIGLLLPSVNFYLIAQRFLKELS 396

Query: 360 LPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKS 419
           LP+ KILP A +I EW+MP +LWLS+N  R P+RVCVM+ILIV++R+LY ING G WE  
Sbjct: 397 LPIEKILPHACRIYEWAMPAELWLSSNPGRVPSRVCVMAILIVALRVLYGINGQGIWES- 455

Query: 420 LSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELD 479
                  + T N  G  DPE +            +P S+  DSN     NS  F     D
Sbjct: 456 ------IAQTENAVGS-DPEAS------------APHSIEPDSN-----NSEEF-----D 486

Query: 480 AEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWK 539
           A  LL  L A Y+ I+  V ++YSK++ +YLKYCKDVVF G+       EE  +I+ FW 
Sbjct: 487 ARELLCTLAASYDKIN--VGHDYSKEVHSYLKYCKDVVFTGMT---FSLEEEHLIDIFWD 541

Query: 540 FYQNEKESEAAEDSGKRCGIASKRKRSRD 568
            Y+ +++  A     K        KR RD
Sbjct: 542 MYKGKEDENAKLCQEKLRTTNGVNKRCRD 570


>gi|308080800|ref|NP_001183047.1| uncharacterized protein LOC100501385 [Zea mays]
 gi|238009008|gb|ACR35539.1| unknown [Zea mays]
          Length = 529

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 192/565 (33%), Positives = 281/565 (49%), Gaps = 89/565 (15%)

Query: 4   GEGDIRRLKCKKC---DNVGFECAYDGFYYCTHCNALDDEIVETGVAD-EDFMATGAGTG 59
           G G I  L C  C   DN   + A DG + C  C+A+     +   AD  DF ATG    
Sbjct: 14  GGGSIH-LVCDHCGTSDNYTTDDADDGQFTCRTCSAV--HTTQATAADPHDFPATG---- 66

Query: 60  TALYQASHARQVANSRPANHFVP-LSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHR 118
                      ++  R A    P L  ++   +P    P       T A           
Sbjct: 67  ----------NISVRRVATQPTPKLGARTP--APYLRTPQAAPVPATAA----------- 103

Query: 119 IKREEVSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKF 178
                  + D    +EP+DF S      + ED   +VR +YV G+Q+++Q Q E LV++ 
Sbjct: 104 -------FDDFTELSEPRDFVSGSGTCGEPEDLAVQVRWRYVRGLQVILQRQLEVLVERH 156

Query: 179 NVCPLICGVAASIWFRFLASTGLLSQGWA-----DEAIVQSESQELESKDFQPRAKYRDE 233
            V   +C VA  +W R++A++ +    WA     D   V  +    + K  + + ++ D+
Sbjct: 157 QVGAFVCAVAGIVWVRWVAASRVFDGIWARQVLEDHKAVGRQKCYDDKKPDEMKYEWDDD 216

Query: 234 PHTLHGQRAVM--IWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYF 291
                  R  +   + + LR  +P+ S+LA+ FLACH+ REAILP+DI +W++EG IPY 
Sbjct: 217 MSLQRKDRHKLEFAFLRSLRMMLPVYSTLAVCFLACHIAREAILPSDIYRWAMEGNIPYV 276

Query: 292 AAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALA 351
           AAF E+++  G +   C L    +FRP + + + +LE+ A SIA+ + L LP VNFYA+A
Sbjct: 277 AAFTEVDRFLGSSFQDCPLDARQLFRPVRVIGAWQLEAAAGSIAQRVALRLPSVNFYAIA 336

Query: 352 SRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNIN 411
            R LK L LP+ +ILP A +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYNIN
Sbjct: 337 QRCLKDLLLPIDRILPHACQIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYNIN 396

Query: 412 GFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSS 471
           G G WEK                             + E   +PS    D+N+ +S+   
Sbjct: 397 GQGIWEK-----------------------------ICEEGRNPSESHYDANSSTSRKLE 427

Query: 472 HFKESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEA 531
                E     LL  +   Y+ I+ S  ++YS DL +YLKYCK+VVFAG+       E+ 
Sbjct: 428 ASNSEEFGTRELLCAISVAYDKINTS--HDYSSDLRSYLKYCKEVVFAGIT---SSDEQT 482

Query: 532 IMIEKFWKFYQNEKESEAAEDSGKR 556
            +IE FW  Y+      A ED  +R
Sbjct: 483 HLIEIFWDMYK------AREDDNRR 501


>gi|413944895|gb|AFW77544.1| hypothetical protein ZEAMMB73_711148 [Zea mays]
          Length = 459

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 182/483 (37%), Positives = 262/483 (54%), Gaps = 90/483 (18%)

Query: 250 LRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACS 309
           LR  +P+ S+LA+ FLACH+ REAILP+DI +W++EG IPY AAF E+++  G +   C 
Sbjct: 45  LRMMLPVYSTLAVCFLACHIAREAILPSDIYRWAMEGNIPYVAAFTEVDRFLGSSFQDCP 104

Query: 310 LSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRA 369
           L    +FRP + + + +LE+ A SIA+ + L LP VNFYA+A R LK L LP+ +ILP A
Sbjct: 105 LDARQLFRPVRVIGAWQLEAAAGSIAQRVALRLPSVNFYAIAQRCLKDLLLPIDRILPHA 164

Query: 370 LKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFSST 429
            +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYNING G WEK           
Sbjct: 165 CQIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYNINGQGIWEK----------- 213

Query: 430 SNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEA 489
                             + E   +PS    D+N+ +S+        E     LL  +  
Sbjct: 214 ------------------ICEEGRNPSESHYDANSSTSRKLEASNSEEFGTRELLCAISV 255

Query: 490 RYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMIEKFWKFYQNEKESEA 549
            Y+ I+ S  ++YS DL +YLKYCK+VVFAG+       E+  +IE FW  Y+  +E + 
Sbjct: 256 AYDKINTS--HDYSSDLRSYLKYCKEVVFAGIT---SSDEQTHLIEIFWDMYK-AREDDN 309

Query: 550 AEDSGKRCGIASKRKRSRDDLSLGS----HYKESEKIREKEFTTRLSASADYESFSGDDY 605
            ED  K       +  S  D+ + +    HY++   ++   F    SAS      SG D 
Sbjct: 310 IEDHVK------SQSHSMADIPITNGVRKHYRDGAFVKASSF----SAS------SGHD- 352

Query: 606 PQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAVKRLKLDMENNRFCYISPRVQIKRL 665
                                             A++ LK +M+++ F Y+ PR   K  
Sbjct: 353 ----------------------------------AMQMLKSEMQDHGFHYMPPRKPRKSD 378

Query: 666 DYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMHMAVLSFERRLAWLEKRIDH 725
            YL+Y R+   G   + AHADYY+LLRA A +A++DIR MH++VL  E+RLA +E+RI+ 
Sbjct: 379 GYLRYRRRRLSGGFIFVAHADYYLLLRAFAKLAEIDIRIMHISVLKLEKRLACIEERIER 438

Query: 726 CLH 728
            L+
Sbjct: 439 SLN 441


>gi|168008162|ref|XP_001756776.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692014|gb|EDQ78373.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 491

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 181/562 (32%), Positives = 282/562 (50%), Gaps = 92/562 (16%)

Query: 165 LMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDF 224
           +++Q+QCEALV  F V PL+CG+   +W +F+ ST +  +GWA+EA++ +E+Q  E+K+ 
Sbjct: 1   MLVQMQCEALVQSFGVTPLVCGILGPVWMKFVRSTCVYEKGWAEEALLVAEAQH-ENKEV 59

Query: 225 QPRAKYR--DEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKW 282
           +PR K R   EP T +G+R   +W + L+ +IPL SSLAI++L CH+ RE ILPTDI+ W
Sbjct: 60  KPRRKKRVVGEPWTTYGERTRFVWLRSLKARIPLRSSLAITYLVCHLAREPILPTDIINW 119

Query: 283 SIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHL 342
           + EG +PY +A+  IEKR            S +F+P +   ++ +E  A+ I + IGL L
Sbjct: 120 AFEGTLPYLSAYTAIEKRSNWIPELQLFKASKIFKPLRMTNARVVEYLASLIGDRIGLEL 179

Query: 343 PPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIV 402
           PP+NF+A++ R+LK+L LP+ K+ P   ++ EW     LWL++   + PTRV VM+ L+V
Sbjct: 180 PPINFHAISRRFLKELDLPVEKLAPYVCRLYEWFPSSGLWLTSQVSKVPTRVYVMAALVV 239

Query: 403 SIRILYNINGFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDS 462
             +ILYN++G       LSS+ F                               +V GD 
Sbjct: 240 VFKILYNLDG----RYRLSSKDF------------------------------DAVSGDD 265

Query: 463 NAKSSKNSSHFKESELDAEALLYDLE-ARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGL 521
            +  +      KE + +AE  +  ++   +NDI+     ++SK L  YL+YC++ +FA  
Sbjct: 266 ESMDTLEVR--KEEQWNAEEFVTKIKLLLHNDINQ--VGDFSKQLTDYLRYCRETIFAND 321

Query: 522 EPPHDDPEEAIMIEKFWKFYQNEKESEAAEDSGKRCGIASKRKRSRDDLSLGSHYKESEK 581
              +D+     + + FW  Y         E S K+               LG+  KES+ 
Sbjct: 322 GMSNDED----LRDYFWGIY---------EQSAKQVT------------CLGTISKESQN 356

Query: 582 IREKEFTTRLSASADYESFSGDDYPQQSLNGDNSSKSSEEYPNSEAIDEASAETIIDSAV 641
             E +  T      D E              +NS     +         +  +T  D  +
Sbjct: 357 T-ESDIGTNGLLLLDSE--------------ENSFYKQSKPGKKSKKTSSKLQT--DPLL 399

Query: 642 KRLKLDMENNRFCYISPRVQIKRLD-YLQYVRKSDEGALTYAAHADYYILLRACATIAQV 700
           +     ME+  FC I P   + R + Y  YV   D+       H DY  +LRACA I  V
Sbjct: 400 R----GMEDLGFCIIDPPKHVPRENEYWTYVITRDD---MRPVHEDYLFVLRACAKIIDV 452

Query: 701 DIRSMHMAVLSFERRLAWLEKR 722
           D R +HMA+L+ E+ +  +EKR
Sbjct: 453 DPRVLHMAILNVEKGIIKVEKR 474


>gi|242034077|ref|XP_002464433.1| hypothetical protein SORBIDRAFT_01g018310 [Sorghum bicolor]
 gi|241918287|gb|EER91431.1| hypothetical protein SORBIDRAFT_01g018310 [Sorghum bicolor]
          Length = 468

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 149/427 (34%), Positives = 221/427 (51%), Gaps = 47/427 (11%)

Query: 4   GEGDIRRLKCKKC---DNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGT 60
           G G    L C  C   DN   + A DG + C  C+A+                       
Sbjct: 15  GGGGSIHLVCDHCGTSDNYNTDDADDGQFTCRTCSAV----------------------- 51

Query: 61  ALYQASHARQVANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIK 120
                 H  Q   + P  H+ P++   S    +  +P       T A Y           
Sbjct: 52  ------HTTQATAADP--HYFPVTGSISV-RRVATQPTPKLGARTPAPYPRTPQAAS--V 100

Query: 121 REEVSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNV 180
               ++ D +  +EP+DF        + ED    VR +YV G+Q+++Q Q E LV++  V
Sbjct: 101 PAAAAFDDFMELSEPRDFAPGSGTWGEPEDLAVRVRWRYVRGLQVILQRQLEVLVERHQV 160

Query: 181 CPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQELE----SKDFQP--RAKYRDEP 234
              +C VA  +W R++A++G+    W  + +   ++   E    S+D +     KY  + 
Sbjct: 161 GATVCAVAGIVWVRWVAASGVFDGIWVHQVLEDHKAMGREKCSSSRDNKKPDEVKYESDD 220

Query: 235 HTL---HGQRAV-MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPY 290
             L     +R V   + + LR  +P+ S+LA+ FLACH+ REAILP DI +W++EG +PY
Sbjct: 221 DMLLQRKDRRKVEFAFLRSLRMMLPVYSTLAVCFLACHIAREAILPNDIYRWAMEGNVPY 280

Query: 291 FAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYAL 350
            AAF E+++  G +   C L    +FRP + + + +LE+   SIA+ +GL LP VNFYA+
Sbjct: 281 VAAFTEVDRFLGSSFQDCPLDARQLFRPVRVIGAWQLEAAVGSIAQKVGLRLPSVNFYAI 340

Query: 351 ASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNI 410
           A R LK L LP+ +ILP A +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYNI
Sbjct: 341 AHRCLKDLLLPIDRILPHACRIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYNI 400

Query: 411 NGFGAWE 417
           NG G WE
Sbjct: 401 NGQGIWE 407


>gi|52354247|gb|AAU44444.1| hypothetical protein AT2G02955 [Arabidopsis thaliana]
          Length = 293

 Score =  217 bits (553), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 117/304 (38%), Positives = 171/304 (56%), Gaps = 24/304 (7%)

Query: 13  CKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQASHARQ-- 70
           C +C+N  F+   DG+YYC  C    + +++TGV D D +  G GT  ALY   H R   
Sbjct: 3   CTECENDAFDEEDDGYYYCQRCGVQVENLIQTGVDDGDLIGEGGGTQGALYNPKHRRTEP 62

Query: 71  --VANSRPANHFVPLSQQSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKREEVSYTD 128
             +  S+P   F   + + S +   F+  N                +   +KR   SY D
Sbjct: 63  QPITPSQP--RFTDDTSRYSQFKSQFESENGNK------------ELPREVKRAPDSYVD 108

Query: 129 MVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVA 188
              PTEP DF +       YE+Y+ E R +YV    +MI  QC+ALVDKFNV PLI G+ 
Sbjct: 109 K-EPTEPVDFAAE---TLSYENYYDEARDRYVKAFLMMITYQCDALVDKFNVTPLIIGLV 164

Query: 189 ASIWFRFLASTGLLSQGWADEAIVQSE--SQELESKDFQPRAKYRDEPHTLHGQRAVMIW 246
             I  R++A +G+  + WA+ AI  SE  S++ E KD +   +++ EP  + G+RAV IW
Sbjct: 165 GPISLRYVALSGVYHKDWANNAIRDSEHQSEDGEVKDAKRLKRHKAEPRNIDGKRAVTIW 224

Query: 247 YKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSV 306
           + +L++ +PLSSSL ISFLACH     +LPTDIV+W+ EGK+PY + F++I ++ G+ S 
Sbjct: 225 FGILKKTMPLSSSLVISFLACHQAGAPVLPTDIVRWAREGKLPYLSCFLDIREQMGERSA 284

Query: 307 ACSL 310
           AC +
Sbjct: 285 ACPV 288


>gi|242090177|ref|XP_002440921.1| hypothetical protein SORBIDRAFT_09g016515 [Sorghum bicolor]
 gi|241946206|gb|EES19351.1| hypothetical protein SORBIDRAFT_09g016515 [Sorghum bicolor]
          Length = 241

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 94/169 (55%), Positives = 126/169 (74%)

Query: 250 LRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACS 309
           LR  +P+ S+LA+ FLACH+ REAILP+DI +W++EG IPY AAF E+++  G +   C 
Sbjct: 52  LRMMLPVYSTLAVCFLACHIAREAILPSDIYRWAMEGNIPYVAAFTEVDRFLGNSFQDCP 111

Query: 310 LSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRA 369
           L    +FRP + + + +LE+ A SIA+ +GL LP VNFYA+A R LK L LP+ +ILP A
Sbjct: 112 LDARQLFRPVRVIGAWQLEAAAGSIAQKVGLRLPSVNFYAIAHRCLKDLLLPIDRILPHA 171

Query: 370 LKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEK 418
            +I EW+MP +LWLS+N  R PTRVCVM+ILIV++R+LYNING G WEK
Sbjct: 172 CRIYEWAMPAELWLSSNPARVPTRVCVMAILIVALRLLYNINGQGIWEK 220


>gi|2191148|gb|AAB61035.1| A_IG002N01.28 gene product [Arabidopsis thaliana]
          Length = 246

 Score =  176 bits (445), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 118/277 (42%), Positives = 155/277 (55%), Gaps = 61/277 (22%)

Query: 232 DEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYF 291
           +EP  L+G+RAV IW   L + +PLSSSLA+SFLACH     ILPTDIV+W+ +GKIPY 
Sbjct: 25  EEPQNLYGKRAVTIWLSQLTKSLPLSSSLALSFLACHKAGTPILPTDIVRWARQGKIPYL 84

Query: 292 AAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALA 351
           +  ++I ++ G+ S AC +S S             LE+ AASIA+ IGL LPPVNFYA+A
Sbjct: 85  S--LKIREQMGERSAACPVSASI------------LEAQAASIADIIGLPLPPVNFYAIA 130

Query: 352 SRYLKQLCLPL-GKILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNI 410
           S YL +L +P   K+L     ++ WSM  DL+LS  E R PTRVCVMSI++V+IR+LYNI
Sbjct: 131 SNYLMRLSVPKEDKVLDLVRLLEYWSMRSDLYLSKKELRLPTRVCVMSIIVVAIRMLYNI 190

Query: 411 NGFGAWEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNS 470
           NGFG WE+SL   +    T                                         
Sbjct: 191 NGFGVWERSLEDAELSDVT----------------------------------------- 209

Query: 471 SHFKESELDAEALLYDLEARYNDIHDSV--TYEYSKD 505
              K +ELD E LL +LEA+Y ++      T  YS+D
Sbjct: 210 ---KATELDTEELLKNLEAKYYEVAAETVGTQLYSRD 243


>gi|357473559|ref|XP_003607064.1| hypothetical protein MTR_4g071850 [Medicago truncatula]
 gi|355508119|gb|AES89261.1| hypothetical protein MTR_4g071850 [Medicago truncatula]
          Length = 211

 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 83/215 (38%), Positives = 116/215 (53%), Gaps = 29/215 (13%)

Query: 7   DIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADEDFMATGAGTGTALYQAS 66
           D+    C+ C   G     DGFYYC+ C   + ++V+TG  +ED  A GAG    +Y AS
Sbjct: 3   DVVTFTCQSCTYEGEALESDGFYYCSACGEKNLDVVDTGAEEED--AIGAG----IYLAS 56

Query: 67  HARQVANSRPANHFVPLSQ----QSSFWSPLFDEPNTTTTTTTTANYNDLDSIIHRIKRE 122
           H R+ A    A +  P+SQ    QS+F   L  E ++                  ++K E
Sbjct: 57  HQRRTAAPTDAVYVQPISQCNPSQSNFLRKLGLEDDSQV----------------KVKAE 100

Query: 123 EVSYTDMVGPTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCP 182
            V  +    P+ P DFG S  V    E Y+ E+R++Y+MG+Q+MI+LQCEALV +F V P
Sbjct: 101 NVDQS-QCDPSNPADFGGSTVVS--IEQYYKEIRLRYIMGLQMMIELQCEALVKEFKVTP 157

Query: 183 LICGVAASIWFRFLASTGLLSQGWADEAIVQSESQ 217
           LICG+   IW RF++ TG+    WAD+AI  SE Q
Sbjct: 158 LICGLVGPIWLRFVSKTGVFDDDWADKAIHDSEMQ 192


>gi|302800339|ref|XP_002981927.1| hypothetical protein SELMODRAFT_115616 [Selaginella moellendorffii]
 gi|302802299|ref|XP_002982905.1| hypothetical protein SELMODRAFT_117213 [Selaginella moellendorffii]
 gi|300149495|gb|EFJ16150.1| hypothetical protein SELMODRAFT_117213 [Selaginella moellendorffii]
 gi|300150369|gb|EFJ17020.1| hypothetical protein SELMODRAFT_115616 [Selaginella moellendorffii]
          Length = 159

 Score =  117 bits (294), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 59/148 (39%), Positives = 93/148 (62%), Gaps = 7/148 (4%)

Query: 155 VRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQS 214
           +R  YV G+Q +IQLQCE+LV+ F V PLIC +   IW RF+AST +  + WA EAI+ +
Sbjct: 15  IREYYVQGLQTIIQLQCESLVENFGVSPLICRIVGPIWLRFVASTRVFEREWASEAILLA 74

Query: 215 ESQELESKDFQPRAKYR----DEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVV 270
           +++ L+ K  + + K        P  +   R   +W   L++++PL  +LA+ FLAC+V+
Sbjct: 75  DARSLKEKAKKLKLKKESSTLPAPEKIEMSR---VWTDSLKRRLPLRITLAMVFLACYVI 131

Query: 271 REAILPTDIVKWSIEGKIPYFAAFVEIE 298
           RE +L TDI  W+ +G +PY  AF+ ++
Sbjct: 132 REPVLATDISTWAGQGSLPYLTAFLSMQ 159


>gi|12322421|gb|AAG51232.1|AC035249_7 unknown protein; 59131-63280 [Arabidopsis thaliana]
          Length = 579

 Score =  100 bits (249), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 75/137 (54%), Gaps = 25/137 (18%)

Query: 132 PTEPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASI 191
           PT P DFG+       YEDY+ E R +Y     +MI  QC+ALVDKFNV PLI  +    
Sbjct: 47  PTNPVDFGAE---TLSYEDYYDETRDRYDKAFLMMITYQCDALVDKFNVTPLIIAIR--- 100

Query: 192 WFRFLASTGLLSQGWADEAIVQSESQELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLR 251
                            E+ +QSE  E+  KD +    ++ EP  L G+RAV IW+ +L+
Sbjct: 101 -----------------ESELQSEDGEV--KDTKRPKTHKAEPCNLDGKRAVTIWFSMLK 141

Query: 252 QKIPLSSSLAISFLACH 268
             +PLSSSL ISFLACH
Sbjct: 142 MTMPLSSSLVISFLACH 158


>gi|242090175|ref|XP_002440920.1| hypothetical protein SORBIDRAFT_09g016510 [Sorghum bicolor]
 gi|241946205|gb|EES19350.1| hypothetical protein SORBIDRAFT_09g016510 [Sorghum bicolor]
          Length = 159

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 42/87 (48%), Positives = 60/87 (68%)

Query: 641 VKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQV 700
           ++ LK +M+++ F Y+ PR   K   YL+Y RK   G   Y AHADYY+LLRA A +A++
Sbjct: 54  MQMLKSEMQDHGFHYMPPRKPRKSDGYLRYRRKRLSGGFVYVAHADYYMLLRAFAKLAEI 113

Query: 701 DIRSMHMAVLSFERRLAWLEKRIDHCL 727
           DIR MH++VL  ERRLA +E++I+  L
Sbjct: 114 DIRIMHISVLKLERRLACIEEQIERSL 140


>gi|242034079|ref|XP_002464434.1| hypothetical protein SORBIDRAFT_01g018320 [Sorghum bicolor]
 gi|241918288|gb|EER91432.1| hypothetical protein SORBIDRAFT_01g018320 [Sorghum bicolor]
          Length = 159

 Score = 86.3 bits (212), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 42/88 (47%), Positives = 60/88 (68%)

Query: 640 AVKRLKLDMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQ 699
           A++ L+ +M+++ F Y+ PR   K   YL+Y RK       Y AHADYY+LLRA A +A+
Sbjct: 53  AMQMLRSEMQDHGFHYMPPRKPRKSDGYLRYRRKRLSSGFIYVAHADYYMLLRAFAKLAE 112

Query: 700 VDIRSMHMAVLSFERRLAWLEKRIDHCL 727
           +DIR MH++VL  ERRLA +E+RI+  L
Sbjct: 113 IDIRIMHISVLKVERRLACIEERIERSL 140


>gi|384246021|gb|EIE19513.1| hypothetical protein COCSUDRAFT_83564 [Coccomyxa subellipsoidea
           C-169]
          Length = 300

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 97/171 (56%), Gaps = 12/171 (7%)

Query: 244 MIWY-KLLR---QKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK 299
           M+WY + L+   Q +   + LA+SFL C  +REA+ P DI++W+ +G++P   +   +  
Sbjct: 1   MLWYCRCLQHALQALNPDAVLAVSFLGCWYLREAVTPMDILRWASDGRLPVL-SLPPLSA 59

Query: 300 RFGQTS--VACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQ 357
           +  Q++    C+L P+ +  P+  +  Q+L + A  I   + L LPPVN  AL  RY++ 
Sbjct: 60  KLLQSAQEADCTLPPA-LLHPTGVMGVQELVAMAGQIGSRLKLRLPPVNAGALIHRYVQD 118

Query: 358 LCLPLGKILPRALKIQE--WSMPPDLWLSTNECRFPTRVCVMSILIVSIRI 406
           L LP  ++ P AL++ +   S  P +WL  +    P    +M+IL+V++++
Sbjct: 119 LALP-EEMAPVALRLFDIYQSGSPQVWLQDDVFLHPY-AHLMAILLVTLKL 167


>gi|302853742|ref|XP_002958384.1| hypothetical protein VOLCADRAFT_99662 [Volvox carteri f.
           nagariensis]
 gi|300256264|gb|EFJ40534.1| hypothetical protein VOLCADRAFT_99662 [Volvox carteri f.
           nagariensis]
          Length = 1355

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 65/257 (25%), Positives = 112/257 (43%), Gaps = 45/257 (17%)

Query: 143 PVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLL 202
           PV A ++ Y  +V   Y+  +QL++Q Q +ALV++F   P    V  +IWF F+  TGLL
Sbjct: 104 PVMARFK-YTAQV---YLRALQLLVQAQADALVERFGGPPECRVVMRNIWFGFVPLTGLL 159

Query: 203 S-------------------QGWADEAIVQSESQELESKDFQPRAKYRD-EPHTLHGQRA 242
                                G A E ++ +ES      D   R+ +         G R 
Sbjct: 160 DLDPRVGGTTVHAYLDGSGYPGLAREGVL-AESHMGGGSDIFGRSAFGTLTSSNRPGMRR 218

Query: 243 VMIWYKL------------------LRQKIPLSSSLAISFLACHVVREAILPTDIVKWSI 284
           V +  K                   +R+ +P S++L ++ L C ++R  + P D+V+W++
Sbjct: 219 VKMTSKFHTTGASDIDWGAARMDGFIRRLMPPSATLVMTLLTCLLLRMPVTPVDVVRWAL 278

Query: 285 EGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPP 344
              +PY    +++  +    S      P+ +     +VP ++L   A  +A  +GL LPP
Sbjct: 279 NESLPYRD--LQMRGKPLLRSSPFKTFPARLLAAKGAVPPRRLLVAAVEMAGVLGLQLPP 336

Query: 345 VNFYALASRYLKQLCLP 361
           +N   L ++    L LP
Sbjct: 337 LNVEGLLAKMTDALGLP 353


>gi|297805422|ref|XP_002870595.1| hypothetical protein ARALYDRAFT_915988 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316431|gb|EFH46854.1| hypothetical protein ARALYDRAFT_915988 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 201

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 37/72 (51%), Positives = 50/72 (69%)

Query: 299 KRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQL 358
           ++ G  + AC +S S MFRPS+ V +Q LE+ AASIA+ IGL LPPVNF  +AS YLK+L
Sbjct: 15  EKMGDRAAACPVSVSVMFRPSQIVSAQSLEAQAASIADIIGLLLPPVNFCGIASNYLKRL 74

Query: 359 CLPLGKILPRAL 370
            +P  + L R +
Sbjct: 75  SIPRIRFLSRCV 86



 Score = 40.0 bits (92), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 2/71 (2%)

Query: 475 ESELDAEALLYDLEARYNDIHDSVTYEYSKDLPTYLKYCKDVVFAGLEPPHDDPEEAIMI 534
           +++   E LL +LEA+Y ++    T+E  KDL  YL + KD +FAGL+    D    I +
Sbjct: 117 DTKATTEELLKNLEAKYYEVAVE-THECEKDLLPYLLHGKDEIFAGLDEASAD-ATYITV 174

Query: 535 EKFWKFYQNEK 545
           +  W  Y  E+
Sbjct: 175 DNLWNSYPKEE 185


>gi|145354102|ref|XP_001421333.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581570|gb|ABO99626.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 748

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 86/169 (50%), Gaps = 6/169 (3%)

Query: 248 KLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVA 307
           + ++ ++P+  SL + +LA    R AILP ++ K + EG +PY   +  + +  G+ +V 
Sbjct: 185 RFVKDRLPMVMSLGLLYLALARRRAAILPFELAKMAAEGSLPYLNVYDVVHEHCGENTV- 243

Query: 308 CSLSPS---FMFRPSKSVPS-QKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG 363
             L PS   F+F   K +P+ Q++ + AA  A  I   LPP+N  A+ +RY+  L     
Sbjct: 244 -ELLPSDEIFLFEHPKRIPTPQRIVAAAAFAAAKIDAQLPPINAAAILTRYVGTLFNLDS 302

Query: 364 KILPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNING 412
           ++L  A +     + P L              +++ ++V+++ LY ++G
Sbjct: 303 QVLHAARRTLSVYLSPALRYGAKNVVGAPESALLAYVVVALKFLYGLDG 351


>gi|297801128|ref|XP_002868448.1| hypothetical protein ARALYDRAFT_915725 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314284|gb|EFH44707.1| hypothetical protein ARALYDRAFT_915725 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 206

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 69/151 (45%), Gaps = 23/151 (15%)

Query: 315 MFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKIQE 374
           MFRP + V +Q+LE+ A SIA+ IGL LPPVNFY  +S YLK L  P  K+L     IQ 
Sbjct: 1   MFRPIQIVSAQRLEAQATSIADIIGLPLPPVNFY--SSNYLKLLYNPNDKVLELLCLIQN 58

Query: 375 WSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGA-------WEKSLSSRKFFS 427
           WS+P  L+                   +  R   N+NG G        WE  LS  K   
Sbjct: 59  WSIPSGLYYRRTS--------------LVTRKRRNLNGLGLHPRGGRDWEDDLSLIKLSL 104

Query: 428 STSNIGGKFDPECNSKMRDEVEEISCSPSSV 458
           +   +    +P  +   + E E + C   S+
Sbjct: 105 NDDKLRNGNNPCVSHSRKSETESMDCDEPSL 135



 Score = 47.4 bits (111), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 59/127 (46%), Gaps = 25/127 (19%)

Query: 555 KRCGIASKRKRSRDDLSLGSHYKESEKIREKEFTTRLSASADYESFSGDDYPQQSLNGDN 614
           +R  + ++++R+ +   LG H +      +     +LS + D           +  NG+N
Sbjct: 68  RRTSLVTRKRRNLN--GLGLHPRGGRDWEDDLSLIKLSLNDD-----------KLRNGNN 114

Query: 615 SSKSSEEYPNSEAID--EASAETI----------IDSAVKRLKLDMENNRFCYISPRVQI 662
              S      +E++D  E S E +           + A+KRL  DM  N FCYI PRV++
Sbjct: 115 PCVSHSRKSETESMDCDEPSLECVSSPDDHKEKSKECAIKRLITDMGYNLFCYIPPRVKV 174

Query: 663 KRLDYLQ 669
           KR DYLQ
Sbjct: 175 KRQDYLQ 181


>gi|255085772|ref|XP_002505317.1| predicted protein [Micromonas sp. RCC299]
 gi|226520586|gb|ACO66575.1| predicted protein [Micromonas sp. RCC299]
          Length = 879

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 61/117 (52%)

Query: 244 MIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQ 303
           +   +L+ + +P    LA+++LAC + RE + P D  +W++EG +P+ A   ++ K   +
Sbjct: 265 LTLRRLVSRHLPRRLPLAVAYLACVLRREPLHPADFTRWALEGDLPFMAHAADVAKDLRR 324

Query: 304 TSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCL 360
                S  P+ +  P  +    K+ S A  +A ++ + LPP N   L +R+  +L L
Sbjct: 325 AGADPSPWPNALTTPRATPLPDKIASCAHDVACTLNIELPPCNALGLCARFASELGL 381


>gi|371780083|emb|CBZ39499.1| td2TF2 protein, partial [Triticum durum]
          Length = 101

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 30/63 (47%), Positives = 42/63 (66%)

Query: 647 DMENNRFCYISPRVQIKRLDYLQYVRKSDEGALTYAAHADYYILLRACATIAQVDIRSMH 706
           +ME + FCY+ PR   +   YL Y +K+  G L  A HADYY+L+R+ A +A+VDIR MH
Sbjct: 39  EMEYHGFCYMQPRKWPRSDGYLHYRKKTMTGHLVCAVHADYYLLIRSFAKLAEVDIRVMH 98

Query: 707 MAV 709
            +V
Sbjct: 99  ASV 101


>gi|260817882|ref|XP_002603814.1| hypothetical protein BRAFLDRAFT_86650 [Branchiostoma floridae]
 gi|229289137|gb|EEN59825.1| hypothetical protein BRAFLDRAFT_86650 [Branchiostoma floridae]
          Length = 628

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 44/178 (24%), Positives = 85/178 (47%), Gaps = 8/178 (4%)

Query: 252 QKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLS 311
           Q++ +  +L  S+L    ++E +LP+D+++W  EG IPY A    + K      + C+  
Sbjct: 240 QRMTMLRTLCFSYLGILWLQEPVLPSDMIRWVREGHIPYIAVKELLPKHMKLGIMDCN-- 297

Query: 312 PSFMFRPSKSVPS-QKLESFAASIAESIGL-HLPPVNFYALASRYLKQLCLPLGKILPRA 369
            +F     ++ PS Q +++    +A  I L  LP V+   +ASR++  L LP        
Sbjct: 298 -NFSV---QAFPSYQVIQNNTVRLARFIQLPQLPEVDISRVASRFILDLNLPAEMHGLVE 353

Query: 370 LKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAWEKSLSSRKFFS 427
             + +  + P LW +  +         M ++IV +++L+ ++       S ++RK  S
Sbjct: 354 NLLNQRPIDPCLWCNRRDAMLNHEAIAMGLVIVVLKLLFGLDDNSERTLSETARKLQS 411


>gi|449300811|gb|EMC96823.1| hypothetical protein BAUCODRAFT_34214 [Baudoinia compniacensis UAMH
           10762]
          Length = 473

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 79/167 (47%), Gaps = 22/167 (13%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 315
           LS  LA+ ++   ++RE +   D+  W   G++ Y+ A  E+     +      L P++ 
Sbjct: 107 LSDLLALCYVGTLLLREPVTVADLHGWVTHGELLYYRAAKEVPLDMRE-----RLPPTYQ 161

Query: 316 --FRPSKSVPSQKLESFAAS----IAESIGLHLPPVNFYALASRYLKQLCLPLGKI---- 365
             F P    P+QKL          + +S G+ +PP+N   +  R+++ L LP+       
Sbjct: 162 DQFEPHHVGPAQKLHQNVVDTLKVLDDSFGMAMPPINHPLILYRWVRDLSLPIEIFAATT 221

Query: 366 -LPRALKIQ---EWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILY 408
            L RAL++       +PP       E RFP  + +M++L+ + ++LY
Sbjct: 222 NLARALELDFAYRLDVPPA--TKGVELRFP-ELRLMTLLVAATKLLY 265


>gi|430812269|emb|CCJ30297.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 310

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 79/159 (49%), Gaps = 16/159 (10%)

Query: 216 SQELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIP-LSSSLAISFLACHVVREAI 274
           S +L  K+ +   KY+++  ++ G    +I YK   +K+P L  ++ I +L    +R +I
Sbjct: 127 SNKLFYKNAKNINKYQNK--SIIGNPNNIIPYK--TKKLPKLIDTIGICYLGLLNIRSSI 182

Query: 275 LPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRPSKSV-PS-QKLESFAA 332
             +D+      GKIPY  A   I      T V  S+ P +    S ++ PS +K+ +   
Sbjct: 183 TISDLCNLMKSGKIPYMKALFIIP-----TQVRKSMEPQYQIALSPTIFPSYEKVLNVTY 237

Query: 333 SIA----ESIGLHLPPVNFYALASRYLKQLCLPLGKILP 367
           SI     +S  +  PP+N Y +   Y++ LCLPL  ILP
Sbjct: 238 SIVSAFYDSYKIIFPPLNVYPILVSYIQNLCLPLELILP 276


>gi|296412649|ref|XP_002836035.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295629835|emb|CAZ80192.1| unnamed protein product [Tuber melanosporum]
          Length = 635

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 75/182 (41%), Gaps = 22/182 (12%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQT----------- 304
           L  S+ I +L   V+  A+   DI +W  E  I Y  A  EI     Q            
Sbjct: 206 LVGSVGICYLGLIVLGVAVSLGDIHRWVEEQGIVYLRAINEIPSEMKQRLDAEYYVALDP 265

Query: 305 SVACSLSPSFM-FRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG 363
            V C L  S    RP+  +    ++        + G+  PP+N   L   ++K L LPL 
Sbjct: 266 RVCCLLPLSLTGVRPTLGLLHSTVQGLVIMYQVTFGMAFPPLNITLLVFGFMKALGLPL- 324

Query: 364 KILPRALKIQEWSMPPDLWLSTNECRFPTRVC------VMSILIVSIRILYNINGFGAWE 417
           +I     ++      P +W   N  R+  RV       +M++LI+S ++LY ++G     
Sbjct: 325 EIYYGFKRLANLISLPFMW---NGLRYRGRVTNWPEARLMALLIISTKLLYGLDGVKRTP 381

Query: 418 KS 419
           KS
Sbjct: 382 KS 383


>gi|325184330|emb|CCA18821.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 548

 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 16/123 (13%)

Query: 246 WYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTS 305
           W KL  + + LS+ L I +L+  ++   +LP+D   W   GK+PY     +  K+     
Sbjct: 215 WSKL--EHLSLSTLLGILYLSSRILHLGVLPSDFSHWITNGKLPYHNLLAKCPKK----- 267

Query: 306 VACSLSPSFMFRPSKSVPSQ-----KLESFAASIAESIGLHLPPVNFYALASRYLKQLCL 360
           +AC L  + +F  S    SQ     K+  F   +   + LH+PP+N    AS     +C+
Sbjct: 268 LACRLDDAALFFNSNVWYSQTFNAAKISFFTNYLQYHLELHIPPLN----ASLAAHTICV 323

Query: 361 PLG 363
            LG
Sbjct: 324 NLG 326


>gi|429849272|gb|ELA24675.1| ubiquitin ribosomal fusion protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 674

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 64/296 (21%), Positives = 121/296 (40%), Gaps = 32/296 (10%)

Query: 136 QDFGSSGPVKADYEDYHFEVRMK---------YVMGMQLMIQLQCEALVDKFNVCPLICG 186
           +DFG  G V    +     VR           Y+  +QL+++ Q   LVD       +  
Sbjct: 51  EDFGQKGKVSRKEKQIKERVRRVLTGPPARDLYLECLQLILREQVMWLVDVKGHKAELET 110

Query: 187 VAASIW-FRFLASTGLLSQGWADEAIVQSESQELE-SKDFQPRAKYRDEPHTLHGQRAVM 244
           V   +W  R   +   +    A EA +   S +++ S+D QP  +           +   
Sbjct: 111 VVRDLWDLRIRGAASAVENDSASEAELSLFSSQIDLSQDEQPSNR----------TQGAQ 160

Query: 245 IWYKLLRQKIP---LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRF 301
            W      + P   L  +LA+ FL C ++R      D+ +W+  G IPY  ++ ++ +  
Sbjct: 161 NWTPESGSEWPAPRLMDTLALCFLGCMLLRMPTRTGDLARWARAGNIPYKHSYRKLPQEM 220

Query: 302 GQTSVA---CSLSPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQL 358
                A     L  + + +         + S A S   + G+  PP+N   +  +Y+++L
Sbjct: 221 RDRLPAYYSTVLKAAHLAKFDHGELHAAVLSLALSFNSNYGIVFPPLNDVLMTLQYVREL 280

Query: 359 CLPLGKI-LPRALKIQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGF 413
            LP+  I + R + +   ++P +  L  +  R    +    IL+V++ +L  I+ F
Sbjct: 281 GLPIETISVARRIPLI-VNLPYEFPLGKSRLRL---IHQPEILLVAVIVLTTIHCF 332


>gi|307103437|gb|EFN51697.1| hypothetical protein CHLNCDRAFT_54966 [Chlorella variabilis]
          Length = 545

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 73/157 (46%), Gaps = 25/157 (15%)

Query: 267 CHVVREAILPTDIVKWSIEGKIPY--FAA-----FVEIEKRFGQTSVACSLSPSFMFRPS 319
           C    EA  P D+++W+++G++PY  FAA       +     G   +A            
Sbjct: 202 CWCQLEAAGPLDVLRWALDGRLPYVAFAAEEGAVLQQYRNILGGELIAV----------- 250

Query: 320 KSVPSQK-LESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMP 378
           + VPS K L   A+++ + +GL  PP++     +RYL  L L   ++LP AL++     P
Sbjct: 251 RGVPSPKALLLNASTMGKRLGLACPPLSPALWLARYLADLELS-QELLPVALQLHALYQP 309

Query: 379 PDLWLSTNECRFPTR---VCVMSILIVSIRILYNING 412
               L   E   P R     +M+ L+V+ ++ Y + G
Sbjct: 310 AP--LVPTEGERPGRHPWALLMASLVVAAKLCYGVGG 344


>gi|303282013|ref|XP_003060298.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457769|gb|EEH55067.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 810

 Score = 48.5 bits (114), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 11/104 (10%)

Query: 259 SLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFRP 318
           +LAI++LAC  +++ +   D+V+W+++G +PY AA   +    G             F+ 
Sbjct: 238 TLAIAYLACAWLKQPVHVGDLVRWALDGDMPYLAASARVHASLGGDD-------ELPFKG 290

Query: 319 ---SKSVPSQKL-ESFAASIAESIGLHLPPVNFYALASRYLKQL 358
              ++ VP   L  + A  ++   G+ LPP N     +R+  ++
Sbjct: 291 ALVARCVPQPHLIATCAMHVSSVAGIALPPANVAGFVARFASEV 334


>gi|213406467|ref|XP_002174005.1| RNA polymerase I-specific transcription initiation factor rrn7
           [Schizosaccharomyces japonicus yFS275]
 gi|212002052|gb|EEB07712.1| RNA polymerase I-specific transcription initiation factor rrn7
           [Schizosaccharomyces japonicus yFS275]
          Length = 532

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 96/222 (43%), Gaps = 32/222 (14%)

Query: 151 YHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLAS---TGLLSQGWA 207
           Y  E ++ ++  MQ ++QLQ  ALV++  V      V  S+W   +++   +    Q  A
Sbjct: 78  YGTEGKVLFLKVMQHILQLQTHALVNELGVDQRCEAVIRSLWAVIVSNVFKSSAFHQNLA 137

Query: 208 ---DEAIVQSESQELES--KDFQ--PRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSL 260
              +E + Q  + E +S  +DF+  P ++ +DE          M W KL+        SL
Sbjct: 138 EAINETVQQQPNSEEDSDLEDFEHLPSSEQQDEVKV----DKEMEWPKLIH-------SL 186

Query: 261 AISFLACHVVREAILPTDIVKWSIEGKIPYFAAF------VEIEKRFGQTSVACSLSPSF 314
           A+  + C ++R  +  +DI  W+   KIPY  A+      + I    G       ++P  
Sbjct: 187 ALIQIGCLLLRLPVTTSDIQYWTSTNKIPYMQAYNLLPNTLRIRLHVGYVR---QITPK- 242

Query: 315 MFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLK 356
             +PS     + +       +E I L +P +N   +   YLK
Sbjct: 243 -VQPSTERIHESVGLMIRFFSEKISLLIPSLNTPLILQSYLK 283


>gi|453086938|gb|EMF14979.1| hypothetical protein SEPMUDRAFT_79174 [Mycosphaerella populorum
           SO2202]
          Length = 568

 Score = 48.1 bits (113), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 47/230 (20%), Positives = 104/230 (45%), Gaps = 27/230 (11%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI-----EKRFGQTSVACSL 310
           L+  L++ ++   ++R  I   DI KW  +G + Y+ A  E+     ++  GQ  +  SL
Sbjct: 177 LTDLLSLIYVGILLLRIPITTADIYKWVNDGDLLYYRAAREVPLSMRDRLPGQ--LQESL 234

Query: 311 SPSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRAL 370
            P  + R    +  + +     S+   +G+  PP+N   +  R++K + LPL ++   A 
Sbjct: 235 EPQTLLR--MEILQRNVLDLLTSLDSDVGMQAPPLNHPLVLYRWVKTMHLPL-EVYVTAQ 291

Query: 371 KIQEWSMPPDLWLSTN---ECRFPTR---VCVMSILIVSIRILYNINGFGAWEKSLSSRK 424
           ++    +  D   + +    CR   R   V +M +++++I++L+  +          +++
Sbjct: 292 RLGGL-LGLDFQYNVDAKERCRISLRCPEVRLMVMIVIAIKMLFPFD---------HAKR 341

Query: 425 FFSSTSNIGG-KFDPECNSKMRDEVEEISCSPSSVIGDSNAKSSKNSSHF 473
           + +S S++   K D +   +++D    +       +G +   S   SSH 
Sbjct: 342 YPTSASDMTALKMDWDAWQRIQDGRNVVHDGNRENVGSNRLASQDESSHL 391


>gi|121702971|ref|XP_001269750.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
 gi|119397893|gb|EAW08324.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
          Length = 518

 Score = 47.8 bits (112), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 76/404 (18%), Positives = 153/404 (37%), Gaps = 71/404 (17%)

Query: 134 EPQDFGSSGPVKADYEDYHFEVRMKYVMGMQLMIQLQCEALVDK--FNV-CPLICGVA-- 188
           +P+DFG+ G             R+K  +  QL++  QC ALV    F V   + C  +  
Sbjct: 37  DPEDFGTQGKTS----------RVKKAI-YQLILWKQCHALVQSRGFPVQLEVPCNYSIP 85

Query: 189 ---ASIWFRFLASTGLLSQGWA-------DEAIVQSES-QELESKDFQPRAKYRDEPHTL 237
                 W+    +  ++   WA       D   V +ES +E E    QP A +  +    
Sbjct: 86  DCPCRYWYLTPDAQSIVRDLWALRLESLSDRLNVSAESDREPELFSSQPAASFDKQEDAY 145

Query: 238 HGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI 297
                   W +L+        ++ + +L C ++R  +   ++ +  +   +P+      I
Sbjct: 146 RPSGRATQWPRLI-------DTIGLCYLGCLLMRLPVTVGELHRMIMREDVPFIRVLRSI 198

Query: 298 EKRFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAE----SIGLHLPPVNFYALASR 353
            +          LS   +   ++ + ++ L      ++       G+ LPP+N  AL  R
Sbjct: 199 PREMRDKLPQEYLS---LLETTRLIKAEHLHKATLELSLLYRIKFGVQLPPLNTPALLYR 255

Query: 354 YLKQLCLPLGKILPRALKIQ-----EWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILY 408
           Y+++L LP+ +I P   +IQ     ++  P  +         P  V +++++++S ++ +
Sbjct: 256 YIRRLALPV-EIYPAVKRIQSLLGFKFEFPASILGKRRPLHLP-EVQIITLVVISTKLFF 313

Query: 409 NINGFGA--------------WEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCS 454
             +                  WE     ++ F S   +GG+           +  EI  +
Sbjct: 314 PFDDIKRYPVSVREPSAQVLDWELWSQVQRHFDSRETVGGRI---------GKGNEILIT 364

Query: 455 PSSVIGDSNAKSSKNSSHFKESELDAEALLYDLEARYNDIHDSV 498
              V   S A+  +    ++ S LD   +  D EA +  +   +
Sbjct: 365 EKDVFNMSPAQLDEYMDWYENSWLDHSKMEDDHEAMHTMLQTVI 408


>gi|407920931|gb|EKG14108.1| Ubiquitin [Macrophomina phaseolina MS6]
          Length = 583

 Score = 47.8 bits (112), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 78/171 (45%), Gaps = 17/171 (9%)

Query: 253 KIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSP 312
           ++ L  ++A+ +    +++      ++  W   G +PY++A +++      + +   L  
Sbjct: 144 RVKLVDTVALCYFGMLILQLPAGLRELYDWVNNGDMPYYSALLDMP-----SDLLHQLPA 198

Query: 313 SFMFR--PSKSVPSQKLESFAASIAE----SIGLHLPPVNFYALASRYLKQLCLPLG--K 364
            +  R  P + +  ++L+S    +A      +G+  PP+N+  LA +Y+K L LPL    
Sbjct: 199 PYRNRLEPHRRLRPERLQSAVLDLASWYHLDVGMVFPPLNYVLLARKYIKDLSLPLEVYH 258

Query: 365 ILPRALKIQEWSMP-PDLWLSTNE--CRFPTRVCVMSILIVSIRILYNING 412
           +L   L I   +   PD      +  C  P      SI I++ ++LY  NG
Sbjct: 259 MLECILHIAGCTFAFPDFTAVGQDPICELPEAQLAASI-IIATKMLYPFNG 308


>gi|254578868|ref|XP_002495420.1| ZYRO0B10934p [Zygosaccharomyces rouxii]
 gi|238938310|emb|CAR26487.1| ZYRO0B10934p [Zygosaccharomyces rouxii]
          Length = 534

 Score = 46.2 bits (108), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 65/140 (46%), Gaps = 13/140 (9%)

Query: 154 EVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQ 213
           E R+ ++  +Q  ++ Q   L+++  +      +   IW + L S  L +    DE   Q
Sbjct: 93  EARLLFIRSLQFTLKRQSRWLIEEQKLPQEFDKLVKIIWMKLLKS--LENDDNNDEFGEQ 150

Query: 214 SESQELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREA 273
            + Q  +S + Q R       H L  +R     Y+L      L S+LAI +++   +   
Sbjct: 151 EDGQTFDSNEDQDRF------HELARRRKDKNRYRL-----SLVSTLAILYMSSVQLGIP 199

Query: 274 ILPTDIVKWSIEGKIPYFAA 293
           I   D++KW++ G++PYF +
Sbjct: 200 IYTCDLIKWTMSGRLPYFKS 219


>gi|297814594|ref|XP_002875180.1| hypothetical protein ARALYDRAFT_904560 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297321018|gb|EFH51439.1| hypothetical protein ARALYDRAFT_904560 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 150

 Score = 45.8 bits (107), Expect = 0.093,   Method: Composition-based stats.
 Identities = 20/44 (45%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 7  DIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADED 50
          D RR+ C +C+N  FE   DGF+YC  C+   + I++T V DED
Sbjct: 2  DTRRIICNECENDAFELE-DGFFYCQRCSVRVEGIIQTVVIDED 44


>gi|317144339|ref|XP_001820052.2| RNA polymerase I specific transcription initiation factor Rrn7
           [Aspergillus oryzae RIB40]
          Length = 543

 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 67/357 (18%), Positives = 141/357 (39%), Gaps = 53/357 (14%)

Query: 134 EPQDFGSSGP---VKADYEDYH---FEVRMKYVMGMQ---LMIQLQCEALVDKFNVCPLI 184
           EP DFG+ G    VK +  +     +  R  Y + +Q   L++  QC ALV      P +
Sbjct: 42  EPDDFGTQGKTNRVKKEVTEKGAKTYRGRQAYRLFLQVYQLILWKQCHALVQSRGFPPQL 101

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDFQPRAKYRDEPHTLHGQRAVM 244
             V   +W        L  + + ++    S+  E+E    QP A  ++EP  +      +
Sbjct: 102 EHVVRDLW-------ALRLETYLNKIQDSSDGGEIEFFSSQPTAD-QEEPDIVKFGGKHL 153

Query: 245 IWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQT 304
            W +L+        S+ + +L   ++R  +   D  +  + G IPY      I +     
Sbjct: 154 QWPRLV-------DSVGLCYLGALLMRLPVGIGDFHRMIMYGDIPYIRITRSIPREMRDK 206

Query: 305 SVACSLSPSFMFRPSKSVPSQKLESFAASIA----ESIGLHLPPVNFYALASRYLKQLCL 360
                LS   +   ++ + ++ L      ++       G+H P +N   +   Y+++L L
Sbjct: 207 LPQEYLS---IMETTRLLKAEHLHKAVLQLSLLYRHKFGVHFPALNLPPILYHYIRRLAL 263

Query: 361 PLGKILPRALKIQE-----WSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGA 415
           P+  I     ++Q+     +  P  L   +     P  + +++++++S ++L+  +    
Sbjct: 264 PV-DIYSAVKRLQDLLGFSFEFPKGLASRSRPHDLP-EIQLVTLIVISTKLLFPFDDLKR 321

Query: 416 --------------WEKSLSSRKFFSSTSNIGGKFDPECNSKMRDEVEEISCSPSSV 458
                         W+    ++K F S     G+   + N  + +E +  + +PS +
Sbjct: 322 YPASAKEPATQVIDWKHWAQAQKQFDSRETARGRIG-KGNEILVNEHDVFNMTPSQL 377


>gi|443701088|gb|ELT99720.1| hypothetical protein CAPTEDRAFT_184938 [Capitella teleta]
          Length = 880

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 37/162 (22%), Positives = 73/162 (45%), Gaps = 13/162 (8%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 315
           LS +L    L        ++P D+V+W+  GK+P  +A   +             +   M
Sbjct: 409 LSCTLGFCMLGLKYTNPLVMPGDLVRWAKLGKVPLQSAIKSLPDDMKLIG-----NDRQM 463

Query: 316 FRPSKSVPSQKLESFAASIAESIG-LHLPPVNFYALASRYLKQLCLPLGKILPRALKIQE 374
           F  ++   + +L      ++ S+G L   PV+   LA RY+++L LP G++      + E
Sbjct: 464 FSWARVPSTDELREQCWRLSTSLGILDFAPVDIKLLADRYIEELRLP-GELHALVRHLIE 522

Query: 375 WSMPPDLWLSTNECR-----FPTRVCVMSILIVSIRILYNIN 411
            S P D  ++  + R     F      M+ +++ +++L+ +N
Sbjct: 523 -SNPIDTHMTQADSRQFNSGFEVSRIAMTYVVIMLKLLFCLN 563


>gi|396487145|ref|XP_003842569.1| hypothetical protein LEMA_P083290.1 [Leptosphaeria maculans JN3]
 gi|312219146|emb|CBX99090.1| hypothetical protein LEMA_P083290.1 [Leptosphaeria maculans JN3]
          Length = 614

 Score = 44.7 bits (104), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 61/278 (21%), Positives = 122/278 (43%), Gaps = 29/278 (10%)

Query: 152 HFEVRMK---YVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWAD 208
           HF  R     Y+  +QL+++ Q   LV + ++   +  V   +W   +A           
Sbjct: 137 HFSGRQAFDLYLKSLQLILRHQVWFLVHEKSLPAELEIVIYDLWALRIAQLADKMTNTNL 196

Query: 209 EAIVQSESQELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQK----IP-LSSSLAIS 263
           ++ +QS++Q   + +        D+  T   +R ++  ++  R++    IP L+ +LA+ 
Sbjct: 197 DSDLQSQAQIFSTLE-------TDDSDTADEKRGLLKNFEHRRERKLASIPNLTDALALC 249

Query: 264 FLACHVVREAILPTDIVKWSIEGKIPYFAAF----VEIEKRFGQTSVACSLSPSFMFRPS 319
           +L    +R  I P D   W  +GK+ Y  A     + +  R   T  A  L+   +F+  
Sbjct: 250 YLGIRTLRLPITPGDFYAWVTDGKMAYRRAIRLVPLAMRDRLPATFHAV-LNRQALFKHK 308

Query: 320 KSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRALKIQEWSMPP 379
           +   +  +      +++  G+  P +N   L  RYL+++ LPL ++     ++ E  +  
Sbjct: 309 QFYDT--VVDLQIGLSKDHGIVWPALNVPMLLYRYLREMALPL-ELYDATTRLAEL-LGY 364

Query: 380 DLWLSTN-----ECRFPTRVCVMSILIVSIRILYNING 412
           D  L  N       R      ++  LIVS+++LY ++G
Sbjct: 365 DFALHPNGKKRLGIRHLPEAQLVGCLIVSVKLLYPLDG 402


>gi|225678065|gb|EEH16349.1| predicted protein [Paracoccidioides brasiliensis Pb03]
          Length = 664

 Score = 44.7 bits (104), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 41/175 (23%), Positives = 78/175 (44%), Gaps = 27/175 (15%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 315
           L  SLA+ +L   ++R  +    + +W +  +IP   A      RF    +   L  ++ 
Sbjct: 201 LIDSLALCYLGVLLLRVPLGLGRVERWILNDEIPLIRAI-----RFIPQDMKDRLPAAYH 255

Query: 316 FR-PSKSVPS-----QKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLGKILPRA 369
           F   +K+VP+     + +    A   +  G+ +PP+N   +   Y+KQL LPL    P  
Sbjct: 256 FALDTKNVPTGYQLHRAIADLGAFYHKEFGVEIPPLNAPLMLLTYIKQLSLPLEPFPPVL 315

Query: 370 LKIQE-------WSMPPDLWLSTNECRFPTR---------VCVMSILIVSIRILY 408
            ++Q+       +S P     +T+  R   R         V ++S+LI+++++ Y
Sbjct: 316 NRLQQLAGYSFSYSSPISYSTTTSARRKCKRKFHRNHLPEVQLVSLLIIAVKLFY 370


>gi|451848996|gb|EMD62300.1| hypothetical protein COCSADRAFT_38255 [Cochliobolus sativus ND90Pr]
          Length = 535

 Score = 44.7 bits (104), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 86/208 (41%), Gaps = 9/208 (4%)

Query: 159 YVMGMQLMIQLQCEALVDKFNVCPLICGVAASIW-FRFLASTGLLSQGWADEAIVQSESQ 217
           Y+  +QL+++ Q   LV    +   +  V   +W  R       ++      A  Q +SQ
Sbjct: 80  YLKCLQLILRHQIWFLVQDKGLPAELETVVYDLWALRIAQLESRIASNENPNADSQRQSQ 139

Query: 218 ELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPT 277
              + D +      ++ H +HG++      K+L     L+  L + +L    +R  + P 
Sbjct: 140 TFSTIDSEDDHTDTEKGH-VHGKKRKR--GKVLSTAPNLNDCLLLCYLGIKTLRLPVTPG 196

Query: 278 DIVKWSIEGKIPYFAA--FVEIEKRFGQTSV-ACSLSPSFMFRPSKSVPSQKLESFAASI 334
           DI     + K+PY+ A   V +  R     V    L PS  F  ++    + L     S 
Sbjct: 197 DIHALVTDNKLPYWRAIKLVPLSMRDRLLPVYRAVLDPSAPFTYARFY--RLLTDLQISY 254

Query: 335 AESIGLHLPPVNFYALASRYLKQLCLPL 362
               G+  PP+N   L  RYLK+L LPL
Sbjct: 255 GADHGIIWPPLNVPLLLFRYLKELALPL 282


>gi|378733857|gb|EHY60316.1| hypothetical protein HMPREF1120_08282 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 723

 Score = 44.3 bits (103), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 54/276 (19%), Positives = 114/276 (41%), Gaps = 46/276 (16%)

Query: 159 YVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLL-------SQGWADEA- 210
           ++   QL+++ QC  LV +  +   +  +   +W  +L+  G         S    D A 
Sbjct: 75  FLQAWQLILRKQCYILVHQKGLPAELWTIVRDLWILWLSQLGHRLHDPAGGSHPPTDSAS 134

Query: 211 ---IVQSESQELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLAC 267
               + + + E +S     R + R      HG   V               ++ +++L  
Sbjct: 135 EADAITTSAHETDSGGEAARERKRRGEAASHGPILV--------------DTVVLNYLGI 180

Query: 268 HVVREAILPTDIVKWSIEGKIPYFAAF----VEIEKRF-GQTSVACSLSPSFMFRPS--K 320
            ++R  I    I+ W  +  IP+  A      E++ R  G+  +A  L   +M  P   +
Sbjct: 181 LLLRRPIGLATILNWIQKEHIPFIRAIRHVPAEMKDRLPGEYHIA--LDTIWMLEPDGLQ 238

Query: 321 SVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLP-----LGKILPRALKIQEW 375
           +   ++++ F+ S     G+ +PP+N   +   Y++ L LP     + + L  A+   ++
Sbjct: 239 AAVYERIKMFSLSF----GMTMPPLNHNLVLLHYVRSLALPIEVYSISQRLNAAITKYQF 294

Query: 376 SMPPDLWLSTNECRFPT---RVCVMSILIVSIRILY 408
             P    L +   R PT      +MS+++++ ++L+
Sbjct: 295 RCPDKTALKSMTRRQPTTYPEAQLMSLVVIATKLLF 330


>gi|198433254|ref|XP_002121631.1| PREDICTED: similar to TATA box binding protein (Tbp)-associated
           factor, RNA polymerase I, B [Ciona intestinalis]
          Length = 637

 Score = 43.9 bits (102), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 34/165 (20%), Positives = 77/165 (46%), Gaps = 18/165 (10%)

Query: 260 LAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKR-----FGQTSVACSLSPSF 314
           L+I ++A    +E I P D++ W  +G +PY  A + +E       +   ++  ++ PS+
Sbjct: 264 LSILYVALQWCKELITPGDLLNWVKQGYLPYIDALLCLEPHMKISDYDGANLQRNIIPSY 323

Query: 315 MFRPSKSVPSQKLESFAASIAESIGL-HLPPVNFYALASRYLKQLCLPLGKI--LPRALK 371
                     ++    +  + E + +  LP V+  +L +R++ Q+ LPL  I  +   ++
Sbjct: 324 ----------KQFVKVSRLLEEYVEMPPLPDVDISSLMARFVIQVNLPLEIISFINNLIR 373

Query: 372 IQEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNINGFGAW 416
           I  +        +T    +   +   ++++V++++LY IN    W
Sbjct: 374 INNFKYTRANVRTTALTGYAPDLISCALIVVALKLLYGINDVTEW 418


>gi|451993473|gb|EMD85946.1| hypothetical protein COCHEDRAFT_1186964 [Cochliobolus
           heterostrophus C5]
          Length = 468

 Score = 43.5 bits (101), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 85/208 (40%), Gaps = 9/208 (4%)

Query: 159 YVMGMQLMIQLQCEALVDKFNVCPLICGVAASIW-FRFLASTGLLSQGWADEAIVQSESQ 217
           Y+  +QL+++ Q   LV    +   +  V   +W  R       ++      A  Q +SQ
Sbjct: 80  YLKCLQLILRHQIWFLVQHKGLPAELETVVYDLWALRIAQLENRIASNENSNADSQPQSQ 139

Query: 218 ELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPT 277
              + + +      ++ H +HG +      K+L     L+  L + +L    +R  I P 
Sbjct: 140 TFSTLESEDDVTDTEKSH-IHGVKRKR--GKVLSTAPNLNDCLLLCYLGIKTLRLPITPG 196

Query: 278 DIVKWSIEGKIPYFAA--FVEIEKRFGQTSV-ACSLSPSFMFRPSKSVPSQKLESFAASI 334
           DI  W  + K+PY+ A   V +  R     V    L PS  F  ++    + L     S 
Sbjct: 197 DIHAWVTDNKLPYWRAIKLVPLPMRDRLLPVYRAVLDPSAPFTYTRFY--RLLTDLQISY 254

Query: 335 AESIGLHLPPVNFYALASRYLKQLCLPL 362
               G+  P +N   L  RYLK+L LPL
Sbjct: 255 DADHGIIWPSLNVPLLLFRYLKELALPL 282


>gi|169605689|ref|XP_001796265.1| hypothetical protein SNOG_05869 [Phaeosphaeria nodorum SN15]
 gi|111065813|gb|EAT86933.1| hypothetical protein SNOG_05869 [Phaeosphaeria nodorum SN15]
          Length = 523

 Score = 43.5 bits (101), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 47/111 (42%), Gaps = 7/111 (6%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAF----VEIEKRFGQTSVACSLS 311
           L   LA+ +L    +R  I P D+  W  +GK+ Y  A     + +  R   T  A  L 
Sbjct: 159 LYDCLALCYLGMSTLRLLITPGDVHAWVTDGKLAYRRAIKLIPLGMRDRLPSTYHAV-LD 217

Query: 312 PSFMFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 362
           P  M   +  +    L +   S  +   +  P +N   L  RYLKQL LPL
Sbjct: 218 PQTML--NYRIFYTTLTNLEISFEKDHEMQWPALNVQPLLFRYLKQLALPL 266


>gi|330917748|ref|XP_003297944.1| hypothetical protein PTT_08505 [Pyrenophora teres f. teres 0-1]
 gi|311329140|gb|EFQ93979.1| hypothetical protein PTT_08505 [Pyrenophora teres f. teres 0-1]
          Length = 417

 Score = 43.5 bits (101), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 70/160 (43%), Gaps = 12/160 (7%)

Query: 260 LAISFLACHVVREAILPTDIVKWSIEGKIPYFAAF----VEIEKRFGQTSVACSLSPSFM 315
           +A+ +L    +R  + P DI +W  +  +PY  A     + + KR     +A  L P   
Sbjct: 165 VALCYLGFITLRLPVTPGDIHRWVTDEMLPYRKAGGLIPLNMRKRLPPNYIAI-LHP--- 220

Query: 316 FRPSK-SVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPLG--KILPRALKI 372
            RP + +   + L     S +   G+  P +N   L  RYLK+L LPL       R  K+
Sbjct: 221 -RPLRFNRFYRTLTDLQRSYSNDYGIAWPALNVPLLLHRYLKELALPLNIYDATRRLGKM 279

Query: 373 QEWSMPPDLWLSTNECRFPTRVCVMSILIVSIRILYNING 412
             +   PD      E        ++S L+V +++L+ ++G
Sbjct: 280 IGYDFVPDPKNKRPEVHQLPEAQLVSCLLVCVKLLHPLDG 319


>gi|212540280|ref|XP_002150295.1| RNA polymerase I specific transcription initiation factor Rrn7,
           putative [Talaromyces marneffei ATCC 18224]
 gi|210067594|gb|EEA21686.1| RNA polymerase I specific transcription initiation factor Rrn7,
           putative [Talaromyces marneffei ATCC 18224]
          Length = 544

 Score = 43.1 bits (100), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 27/111 (24%), Positives = 51/111 (45%), Gaps = 9/111 (8%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 315
           L  +L + +LAC ++R  +   DI K++I  +IP+  A   + +      +   +SP   
Sbjct: 166 LKETLGLCYLACLLLRLPVSIGDIYKYAIRNEIPFVKALNSVPQ-----EMRDRMSPQHT 220

Query: 316 FRPSKSVPSQKLE----SFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 362
              S    S+++        A  +    +  PPVN   +  ++++QL LPL
Sbjct: 221 RDLSIQAKSEEIHETVNQLVAMYSREYNVVFPPVNMPLMLFQFIRQLALPL 271


>gi|296815252|ref|XP_002847963.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238840988|gb|EEQ30650.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 512

 Score = 43.1 bits (100), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 60/263 (22%), Positives = 111/263 (42%), Gaps = 37/263 (14%)

Query: 159 YVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLASTGLLSQGWADEAIVQSESQE 218
           ++   QL++  QC ALV    + P + G+   +W   L     + +G  D  +    S +
Sbjct: 76  FLEAYQLLLWKQCHALVHVKGLPPELEGIVKDLWVLRLDK---IYEGRDDIYVEDENSTQ 132

Query: 219 LESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTD 278
           L S       +   E H  H ++        L     +  SLA+ +L   ++R   LP  
Sbjct: 133 LFSSQTGVSNEIDKEEHQYHKRK--------LSTSPRVLDSLALCYLGILLLR---LPVS 181

Query: 279 IVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFMFR-PSKSVPSQKLESFAASIAES 337
           I  W  E +IP+         RF  + +   L   +     +K+VP    + F  ++A+ 
Sbjct: 182 I-GWVTEDEIPFMRPL-----RFIPSEMKDRLPAIYHVAFETKNVPIG--DQFHRAVADL 233

Query: 338 IGLH-------LPPVNFYALASRYLKQLCLPLG-----KILPRALKIQEWSMPPDLWLST 385
           + L+        P +N   L   Y+KQL LP+      K L ++++   ++ P ++    
Sbjct: 234 VMLYYRGFSIEFPAINSSLLLFSYVKQLALPIEVFSAVKQLMKSVEFT-YTFPKEVKGKF 292

Query: 386 NECRFPTRVCVMSILIVSIRILY 408
                P  V +MS++IV+ ++L+
Sbjct: 293 RRIYLP-EVQLMSLVIVATKLLF 314


>gi|398391290|ref|XP_003849105.1| hypothetical protein MYCGRDRAFT_111069 [Zymoseptoria tritici
           IPO323]
 gi|339468981|gb|EGP84081.1| hypothetical protein MYCGRDRAFT_111069 [Zymoseptoria tritici
           IPO323]
          Length = 551

 Score = 43.1 bits (100), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 52/108 (48%), Gaps = 9/108 (8%)

Query: 260 LAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEI-----EKRFGQTSVACSLSPSF 314
           L + F A  ++R  +   D+  W  +G + Y+ A  EI     E+  G   +   L P  
Sbjct: 179 LCLDFTAFLLLRIPLTVADLTSWVNDGTLCYYRASKEIPLSMRERLPGH--LQEQLEPQD 236

Query: 315 MFRPSKSVPSQKLESFAASIAESIGLHLPPVNFYALASRYLKQLCLPL 362
           +  P      +K+  +  ++A++ G+ +PPVN   L  R++ +L LPL
Sbjct: 237 LVAPE--TLQRKILEWLTTLADNFGMAVPPVNHPLLLYRWVTELSLPL 282


>gi|115390963|ref|XP_001212986.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114193910|gb|EAU35610.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 405

 Score = 42.7 bits (99), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 67/310 (21%), Positives = 129/310 (41%), Gaps = 70/310 (22%)

Query: 134 EPQDFGSSGP----VKADYEDYH--FEVRMKYVMGMQ---LMIQLQCEALVDKFNVCPLI 184
           +P DFG+ G      KA  E  H  +  R  Y++ +Q   L++  QC ALV         
Sbjct: 42  DPDDFGTQGKRNRVKKAVIEKGHKTYRGRHAYILFLQTYQLILWKQCSALVQNKGFPSQF 101

Query: 185 CGVAASIWFRFLASTGLLSQGWADEAIVQSESQELESKDF---QPRAKYRDEPHTLHGQR 241
             V   +W        L  Q +  EA ++  S++ ++ +F   QP  +  D+ H L    
Sbjct: 102 EHVVRDLW-------ALRLQKY--EAKLREVSEDADTLEFFSSQPAGEPDDQLHLLTSGG 152

Query: 242 AVMIWYKLLRQKIPLSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEK-- 299
             + W +L+        ++A+ +L   ++R  I   D  +  +  ++PY      I +  
Sbjct: 153 KRIQWPRLV-------DTVALCYLGALLMRLPISVGDFHRMVMREEVPYIRVIKSIPRDI 205

Query: 300 --RFGQTSVACSLSPSFMFRPSKSVPSQKLESFAASIAESIGLH-------LPPVNFYAL 350
             +  Q  +A   +   +          K E    ++ E + L+        PP+N  AL
Sbjct: 206 RDKLPQEYLALIETTRLL----------KAEHLHKAVFELLLLYNQCFDIEFPPLNVPAL 255

Query: 351 ASRYLKQLCLPLGKILPRALKIQEWSMPPDLWLSTNECRFPTRVC------------VMS 398
              Y+++L LP+ ++ P   +++      DL   T E  FPT+V             +++
Sbjct: 256 LYNYIRRLALPV-ELYPAVKRLR------DLLGFTFE--FPTKVIGKRKPLHLPEVQLVT 306

Query: 399 ILIVSIRILY 408
           ++++S ++L+
Sbjct: 307 LIVISTKLLF 316


>gi|19112921|ref|NP_596129.1| RNA polymerase I upstream activation factor complex subunit Rrn7
           [Schizosaccharomyces pombe 972h-]
 gi|26399334|sp|Q9UST5.1|RRN7_SCHPO RecName: Full=RNA polymerase I-specific transcription initiation
           factor rrn7
 gi|6048298|emb|CAB58161.1| RNA polymerase I upstream activation factor complex subunit Rrn7
           [Schizosaccharomyces pombe]
          Length = 537

 Score = 41.2 bits (95), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 98/232 (42%), Gaps = 39/232 (16%)

Query: 156 RMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLA------STGLLSQGWADE 209
           R  Y+   Q+++QLQC+AL  K      I G+   +W  +L+      ++  LS    ++
Sbjct: 80  RSLYLQAFQIILQLQCQALTTKLGFDQRIEGMIRDLWALYLSLSYESFTSSFLS---LNQ 136

Query: 210 AIVQSESQELESKDFQPRAKYRDEP---HTLHGQRAVMIWYKLLRQKIPLSSSLAISFLA 266
              QSES + +     P ++   +P    T    ++  I Y  L     L SS A  ++A
Sbjct: 137 NSTQSESSDSDFDLIDPESQPGADPSSRKTKETSQSHSISYPRL-----LYSS-AFIYVA 190

Query: 267 CHVVREAILPTDIVKWSI---EGKIPYFAAF----VEIEKRFGQTSVACSLSPSFMFRPS 319
           C ++R   LP  I K  +   +  IPY+ A+    ++I KR  +  V        M  P 
Sbjct: 191 CLLLR---LPLTIHKLEVLIRKNIIPYYRAYKQIPLKIFKRLQKNYVR-------MLIPF 240

Query: 320 KSVPSQKLESFAASIAESI----GLHLPPVNFYALASRYLKQLCLPLGKILP 367
                Q+++S   ++ + +     L +PP N   +    +     PL   +P
Sbjct: 241 HYPTYQRIQSAVLTLVDVLVSKYELKVPPPNEPLILFELINSFFFPLEIFIP 292


>gi|440798393|gb|ELR19461.1| hypothetical protein ACA1_267200 [Acanthamoeba castellanii str.
           Neff]
          Length = 613

 Score = 41.2 bits (95), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 54/107 (50%), Gaps = 15/107 (14%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAFVEIEKRFGQTSVACSLSPSFM 315
           ++++L  +++AC++++E ++  D+++W   G++PY   +   + +  Q         SF 
Sbjct: 215 VTATLTFNYVACNILQEPVVAHDLIRWVRSGRLPYEEVYSLKDFKDAQ---------SFE 265

Query: 316 FRPSKSVPSQKLESFAASIAESIGLHL-PPVNFYALASRYLKQLCLP 361
            R     P  KLE  A ++    G+    P  FY    R++++L LP
Sbjct: 266 VR-----PLDKLEYQAYNLINYFGIEQEAPPPFYPTLCRFVEELHLP 307


>gi|194902452|ref|XP_001980701.1| GG17298 [Drosophila erecta]
 gi|190652404|gb|EDV49659.1| GG17298 [Drosophila erecta]
          Length = 872

 Score = 40.8 bits (94), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 7  DIRRLKCKKCDNVGFECAYDGFYYCTHCNALDDEIVETGVADED-FMATGAGTGTA 61
          +++ ++C  C+   F+    GFYYC  C    D+I    +  ED F  TGAG  TA
Sbjct: 9  ELQNMQCDVCEGTAFQERV-GFYYCVECGTQKDQIRAVDITAEDNFDETGAGRYTA 63


>gi|366992866|ref|XP_003676198.1| hypothetical protein NCAS_0D02560 [Naumovozyma castellii CBS 4309]
 gi|342302064|emb|CCC69837.1| hypothetical protein NCAS_0D02560 [Naumovozyma castellii CBS 4309]
          Length = 533

 Score = 40.4 bits (93), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 32/148 (21%), Positives = 61/148 (41%), Gaps = 25/148 (16%)

Query: 151 YHFEVRMKYVMGMQLMIQLQCEALVDKFNVCPLICGVAASIWFRFLAST-----GLLSQG 205
           Y  +  + ++   QL+++ QC  L++ F    L   V   IW ++L S       L+S  
Sbjct: 91  YGQDANLLFLKAFQLILRKQCSTLINDFGFPTLFTDVVKLIWMKYLESIDRDNRKLVSNE 150

Query: 206 WADEAIVQSESQELESKDFQPRAKYRDEPHTLHGQRAVMIWYKLLRQKIPLSSSLAISFL 265
           ++D  I      ++   D           HT H  R +          + +SS++A+ ++
Sbjct: 151 FSDGEIGGDTEVDVPIND-----------HTEHKVRKL---------GLHMSSTIAMLYM 190

Query: 266 ACHVVREAILPTDIVKWSIEGKIPYFAA 293
           A   +   +   D ++W    K+ YF A
Sbjct: 191 ASVHLGLPVYMNDFIQWISTTKLLYFKA 218


>gi|346325092|gb|EGX94689.1| ubiquitin fusion protein [Cordyceps militaris CM01]
          Length = 428

 Score = 40.4 bits (93), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 28/121 (23%), Positives = 57/121 (47%), Gaps = 17/121 (14%)

Query: 256 LSSSLAISFLACHVVREAILPTDIVKWSIEGKIPYFAAF----VEIEKRFGQTSVACSLS 311
           +S +LA+ +L C+++R      +++KW+ + + PY   F     E+++R     V     
Sbjct: 178 ISDTLALCYLGCYLLRIPTRLGELIKWANDFQFPYRHTFQQLPTEMQERMPSIYVKVLKI 237

Query: 312 PSFMFRPSKSVPSQKLESFAASIAESIGLHL------PPVNFYALASRYLKQLCLPLGKI 365
           P         +P +  ++++A +  +   HL      P +N+  L  +Y K L LP+  I
Sbjct: 238 PI-------RLPLEGAQAYSAVMDLACSYHLNYNMLFPDLNYMPLLVQYAKLLALPVESI 290

Query: 366 L 366
           +
Sbjct: 291 I 291


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.132    0.397 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,139,046,059
Number of Sequences: 23463169
Number of extensions: 503834097
Number of successful extensions: 1350930
Number of sequences better than 100.0: 142
Number of HSP's better than 100.0 without gapping: 43
Number of HSP's successfully gapped in prelim test: 99
Number of HSP's that attempted gapping in prelim test: 1350367
Number of HSP's gapped (non-prelim): 454
length of query: 770
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 619
effective length of database: 8,816,256,848
effective search space: 5457262988912
effective search space used: 5457262988912
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)