BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 042741
         (166 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449432628|ref|XP_004134101.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
           [Cucumis sativus]
 gi|449432630|ref|XP_004134102.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
           [Cucumis sativus]
 gi|449504102|ref|XP_004162253.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
           [Cucumis sativus]
 gi|449504105|ref|XP_004162254.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
           [Cucumis sativus]
          Length = 190

 Score =  149 bits (377), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 79/177 (44%), Positives = 109/177 (61%), Gaps = 14/177 (7%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF+EVEL+RD+ +  +   R+     QRY +  LLE  + +KA KDHGYFLSVT  +SI 
Sbjct: 1   MFYEVELVRDVEITVEKEKRDAHNF-QRYIITCLLENLLKEKANKDHGYFLSVTSLRSIG 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
           K G   N    VSFP+ F+CRTFLP +GEILHGV            CGP+KY  +S R+M
Sbjct: 60  K-GIVKNESQCVSFPITFICRTFLPFEGEILHGVVRHIFQRGLLLKCGPIKYVFLSARKM 118

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLVFGRAKGES 165
           PTY++V G+   FS+ +   IGN VVV+F V  VRW  +   +K+E+++    +G +
Sbjct: 119 PTYQYVGGENPVFSSKEFATIGNDVVVRFSVLGVRWIEKRGCIKKEFVMLASLEGNN 175


>gi|224107919|ref|XP_002314653.1| predicted protein [Populus trichocarpa]
 gi|222863693|gb|EEF00824.1| predicted protein [Populus trichocarpa]
          Length = 189

 Score =  143 bits (361), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 78/178 (43%), Positives = 110/178 (61%), Gaps = 14/178 (7%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF EVE+   + + A+++DRN L VPQR  V  LL+  ++ KA KDHGYFL+VT  KSI 
Sbjct: 1   MFGEVEVCSTVRIIAENLDRNGL-VPQRSIVTHLLKDLLSMKASKDHGYFLAVTNLKSIG 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
           K G  VN  G V F V F CRTF+P+KGEIL GV            CGP+KY  +S R+M
Sbjct: 60  K-GEVVNKSGDVLFHVEFKCRTFMPMKGEILQGVVHRTFRHGVLLRCGPVKYIFLSARKM 118

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLVFGRAKGESI 166
           P Y++ S +   F ND+  +I N V+V+F V  VRW  +  +++R++++     G+S+
Sbjct: 119 PNYQYTSEENPVFLNDELARIENNVLVRFSVLDVRWIEKMWDMRRDFMMLASLVGDSL 176


>gi|30682881|ref|NP_193188.2| DNA-directed RNA polymerase II-like protein [Arabidopsis thaliana]
 gi|30682886|ref|NP_849385.1| DNA-directed RNA polymerase II-like protein [Arabidopsis thaliana]
 gi|334186527|ref|NP_001190729.1| DNA-directed RNA polymerase II-like protein [Arabidopsis thaliana]
 gi|38603938|gb|AAR24714.1| At4g14520 [Arabidopsis thaliana]
 gi|44681416|gb|AAS47648.1| At4g14520 [Arabidopsis thaliana]
 gi|110741153|dbj|BAE98669.1| hypothetical protein [Arabidopsis thaliana]
 gi|332658054|gb|AEE83454.1| DNA-directed RNA polymerase II-like protein [Arabidopsis thaliana]
 gi|332658055|gb|AEE83455.1| DNA-directed RNA polymerase II-like protein [Arabidopsis thaliana]
 gi|332658056|gb|AEE83456.1| DNA-directed RNA polymerase II-like protein [Arabidopsis thaliana]
          Length = 200

 Score =  109 bits (273), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 65/186 (34%), Positives = 99/186 (53%), Gaps = 26/186 (13%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSI- 59
           MF EVE+ RD+A+ AK ++      P +  + RLL+  I++KAC++HG++L +T  KSI 
Sbjct: 1   MFSEVEMARDVAICAKHLNGQS---PHQPILCRLLQDLIHEKACREHGFYLGITALKSIG 57

Query: 60  -----DKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYAL 102
                + +    +   +++FPV F CRTFLP +G+IL G              GP++YA 
Sbjct: 58  NNKNNNIDNENNHQAKILTFPVSFTCRTFLPARGDILQGTVKKVLWNGAFIRSGPLRYAY 117

Query: 103 MSPRRMPTYRHVSG-----KKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLV 157
           +S  +MP Y +V       +K  F  D   KI  GVVV+F V AVR+       + +Y V
Sbjct: 118 LSLLKMPHYHYVHSPLSEDEKPHFQKDDLSKIAVGVVVRFQVLAVRFKERPHKRRNDYYV 177

Query: 158 FGRAKG 163
               +G
Sbjct: 178 LATLEG 183


>gi|225432916|ref|XP_002284221.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7 [Vitis
           vinifera]
 gi|147822056|emb|CAN61549.1| hypothetical protein VITISV_043525 [Vitis vinifera]
          Length = 177

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 93/164 (56%), Gaps = 14/164 (8%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ +PA+ +D   L + QR  + RLL+ + N+KA ++ GYFL+VT  ++I 
Sbjct: 1   MFLKVQLPWNVIIPAECLDAKGLML-QRSIIIRLLDAFSNKKATQELGYFLAVTTLENIG 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            EG      G V FPV+F C TF   +GEIL GV            CGP++   +S ++M
Sbjct: 60  -EGKVRQHSGDVLFPVVFSCVTFKLFRGEILDGVVHKVLKHGVILRCGPVENIYLSCQKM 118

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           P YR+V G+   F N++  KI   VVV+F+V   +W    R  +
Sbjct: 119 PDYRYVPGENPVFLNEKLSKIEKDVVVRFIVMGTKWLEAEREFQ 162


>gi|297737162|emb|CBI26363.3| unnamed protein product [Vitis vinifera]
          Length = 242

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 93/164 (56%), Gaps = 14/164 (8%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ +PA+ +D   L + QR  + RLL+ + N+KA ++ GYFL+VT  ++I 
Sbjct: 66  MFLKVQLPWNVIIPAECLDAKGLML-QRSIIIRLLDAFSNKKATQELGYFLAVTTLENIG 124

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            EG      G V FPV+F C TF   +GEIL GV            CGP++   +S ++M
Sbjct: 125 -EGKVRQHSGDVLFPVVFSCVTFKLFRGEILDGVVHKVLKHGVILRCGPVENIYLSCQKM 183

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           P YR+V G+   F N++  KI   VVV+F+V   +W    R  +
Sbjct: 184 PDYRYVPGENPVFLNEKLSKIEKDVVVRFIVMGTKWLEAEREFQ 227


>gi|449465372|ref|XP_004150402.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
           [Cucumis sativus]
 gi|449522228|ref|XP_004168129.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
           [Cucumis sativus]
          Length = 175

 Score =  100 bits (248), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 64/164 (39%), Positives = 92/164 (56%), Gaps = 15/164 (9%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ +PA+++D   L + QR  + RLL+ +  +KA KD GYFL+VT  ++I 
Sbjct: 1   MFLKVQLPWNVIIPAENLDAKGLML-QRSIIIRLLDEFATKKATKDLGYFLAVTTLENIG 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            EG  V   G V FPVIF   TF   +GEIL GV            CGP++   +S  +M
Sbjct: 60  -EG-KVRQTGDVLFPVIFSGITFKLYRGEILEGVVHKVLKHGVFLRCGPVENIYLSYLKM 117

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           P YR+V G+   F ND+  KI   VVV+F+V   +W    R  +
Sbjct: 118 PDYRYVPGENPVFLNDKLSKIEKDVVVRFIVIGTKWLEAEREFQ 161


>gi|2244808|emb|CAB10231.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268158|emb|CAB78494.1| hypothetical protein [Arabidopsis thaliana]
          Length = 194

 Score =  100 bits (248), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 60/181 (33%), Positives = 94/181 (51%), Gaps = 26/181 (14%)

Query: 7   LLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSI------D 60
           + RD+A+ AK ++      P +  + RLL+  I++KAC++HG++L +T  KSI      +
Sbjct: 1   MARDVAICAKHLNGQS---PHQPILCRLLQDLIHEKACREHGFYLGITALKSIGNNKNNN 57

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            +    +   +++FPV F CRTFLP +G+IL G              GP++YA +S  +M
Sbjct: 58  IDNENNHQAKILTFPVSFTCRTFLPARGDILQGTVKKVLWNGAFIRSGPLRYAYLSLLKM 117

Query: 109 PTYRHVSG-----KKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLVFGRAKG 163
           P Y +V       +K  F  D   KI  GVVV+F V AVR+       + +Y V    +G
Sbjct: 118 PHYHYVHSPLSEDEKPHFQKDDLSKIAVGVVVRFQVLAVRFKERPHKRRNDYYVLATLEG 177

Query: 164 E 164
            
Sbjct: 178 N 178


>gi|224102153|ref|XP_002312568.1| predicted protein [Populus trichocarpa]
 gi|222852388|gb|EEE89935.1| predicted protein [Populus trichocarpa]
          Length = 176

 Score = 98.2 bits (243), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 90/164 (54%), Gaps = 14/164 (8%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ +PA+++D   L + QR  V RLL+ +  + A KD GY+L+V+  +SI 
Sbjct: 1   MFLKVQLPWNVIIPAENLDAKGLML-QRSIVVRLLDDFAKKGATKDLGYYLAVSTLESIG 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            EG      G V FPV+F   TF   +GEIL G+            CGP++   +S  +M
Sbjct: 60  -EGKVRQHTGDVLFPVVFSGITFKIFRGEILDGIVHKVLKHGVLLRCGPIENIYLSCMKM 118

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           P YR+V G+   F ND+  KI   VVV+F+V   +W    R  +
Sbjct: 119 PDYRYVPGENPVFLNDKTSKIEKDVVVRFVVLGTKWLEAEREFQ 162


>gi|297804848|ref|XP_002870308.1| hypothetical protein ARALYDRAFT_493454 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316144|gb|EFH46567.1| hypothetical protein ARALYDRAFT_493454 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 194

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 93/176 (52%), Gaps = 26/176 (14%)

Query: 7   LLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSI------D 60
           + RD+A+ A  ++    + P    + RLL+  I++KAC++HG++L +T  KSI      +
Sbjct: 1   MARDVAICANHLNG---QAPHHQILGRLLKDLIHEKACREHGFYLGITALKSIGNNKNNN 57

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            E    +   +++FPV F CRTFLP +G+IL G              GP++YA +S  +M
Sbjct: 58  DENKDNHQAHLLTFPVSFTCRTFLPARGDILQGTVKKVLWNGAFIRSGPLRYAYLSFLKM 117

Query: 109 PTYRHVSG-----KKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLVFG 159
           P Y +V       +K +F  D   KI  GVVV+F V AVR+  +    + +Y V  
Sbjct: 118 PDYHYVHSPLLEDEKPYFQKDDLSKIAVGVVVRFGVLAVRFKEKPHKRRNDYYVLA 173


>gi|255563208|ref|XP_002522607.1| DNA-directed RNA polymerase II 19 kD polypeptide rpb7, putative
           [Ricinus communis]
 gi|223538083|gb|EEF39694.1| DNA-directed RNA polymerase II 19 kD polypeptide rpb7, putative
           [Ricinus communis]
          Length = 176

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 89/164 (54%), Gaps = 14/164 (8%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           M+ +V+L  ++ + A+ +D   L + QR  + RLLE + ++KA KD GY+L+VT  +SI 
Sbjct: 1   MYLKVQLPWNVIISAEHLDAKGLML-QRSIIIRLLEDFASKKATKDLGYYLAVTTLESIG 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            EG      G V FPV+F   TF   +GEIL GV            CGP++   +S  +M
Sbjct: 60  -EGKVREHTGDVLFPVVFNGITFKIFRGEILEGVVHKVLKHGVFIRCGPIENIYLSCMKM 118

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           P Y +V G+   F ND+  KI   VVV+F+V   +W    R  +
Sbjct: 119 PDYHYVPGENPVFLNDKTSKIEKDVVVRFIVIGTKWLEAEREFQ 162


>gi|351724841|ref|NP_001238607.1| uncharacterized protein LOC100527862 [Glycine max]
 gi|255633396|gb|ACU17055.1| unknown [Glycine max]
          Length = 180

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 86/164 (52%), Gaps = 14/164 (8%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ + A+++    L +  R  + RLL  +  +KA KD GYFL+VT  + I 
Sbjct: 1   MFLKVQLHWNVIIAAENLQPEGLML-HRAIIVRLLSDFAVKKATKDLGYFLAVTTLEKIG 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            EG      G V FPV+F   TF   KGEIL GV            CGP++   +S  +M
Sbjct: 60  -EGKVRQHTGDVLFPVVFNVITFKFFKGEILEGVVHKVLKHGVFMRCGPIENVYLSNLKM 118

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           P YR+V G+ + F ND+  KIG  V V+F V   +W    R  +
Sbjct: 119 PDYRYVPGENACFMNDKMSKIGKDVTVRFSVIGTKWMEAEREFQ 162


>gi|118487294|gb|ABK95475.1| unknown [Populus trichocarpa]
          Length = 176

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 88/164 (53%), Gaps = 14/164 (8%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ +PA+++D   L + QR  V  LL  +  ++A KD GY+L+V+  +SI 
Sbjct: 1   MFLKVQLPWNVIIPAENLDAKGLML-QRSIVVCLLADFAKKRATKDLGYYLAVSTLESIG 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            EG      G V FPV+F   TF   KGEIL GV            CGP++   +S  +M
Sbjct: 60  -EGKVRQHTGDVLFPVVFSGITFKIFKGEILEGVVHKVLKHGVLLRCGPIENIYLSSMKM 118

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
             YR+V G+   F ND+  KI   VVV+F+V   +W    R  +
Sbjct: 119 LDYRYVPGENPVFLNDKTSKIEKDVVVRFVVLGTKWLEAEREFQ 162


>gi|224107923|ref|XP_002314654.1| predicted protein [Populus trichocarpa]
 gi|222863694|gb|EEF00825.1| predicted protein [Populus trichocarpa]
          Length = 193

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 88/164 (53%), Gaps = 14/164 (8%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ +PA+++D   L + QR  V  LL  +  ++A KD GY+L+V+  +SI 
Sbjct: 18  MFLKVQLPWNVIIPAENLDAKGLML-QRSIVVCLLADFAKKRATKDLGYYLAVSTLESIG 76

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
            EG      G V FPV+F   TF   KGEIL GV            CGP++   +S  +M
Sbjct: 77  -EGKVRQHTGDVLFPVVFSGITFKIFKGEILEGVVHKVLKHGVLLRCGPIENIYLSSMKM 135

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
             YR+V G+   F ND+  KI   VVV+F+V   +W    R  +
Sbjct: 136 LDYRYVPGENPVFLNDKTSKIEKDVVVRFVVLGTKWLEAEREFQ 179


>gi|388504534|gb|AFK40333.1| unknown [Lotus japonicus]
          Length = 180

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 55/166 (33%), Positives = 89/166 (53%), Gaps = 18/166 (10%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ + A+++ +  L + QR  + RLL  +  ++A KD GYF +VT   ++D
Sbjct: 1   MFLKVQLSWNVIIAAENLQQGSLML-QRAILIRLLGDFAAKRATKDLGYFTAVT---TLD 56

Query: 61  K--EGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPR 106
           K  EG      G V FPV+F   TF   KGEIL GV            CGP+++  +S  
Sbjct: 57  KVGEGKVRQHTGDVLFPVVFNGVTFKLFKGEILEGVVHKVLKHGVFLRCGPIEHVYLSNM 116

Query: 107 RMPTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           +M  YR+  G+ ++F N++  KI   V ++F+V   +W    R  +
Sbjct: 117 KMADYRYFPGENAYFMNEKASKIAKDVTIRFVVIGTKWMEAEREFQ 162


>gi|42566796|ref|NP_193202.2| DNA-directed RNA polymerase II subunit G [Arabidopsis thaliana]
 gi|149944323|gb|ABR46204.1| At4g14660 [Arabidopsis thaliana]
 gi|332658072|gb|AEE83472.1| DNA-directed RNA polymerase II subunit G [Arabidopsis thaliana]
          Length = 178

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 98/180 (54%), Gaps = 22/180 (12%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ +PA+++D   L + +R  +  LLE + ++KA K+ GY+++VT   ++D
Sbjct: 1   MFLKVQLPWNVMIPAENMDAKGLML-KRAILVELLEAFASKKATKELGYYVAVT---TLD 56

Query: 61  K--EGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPR 106
           K  EG      G V FPV+F   TF   KGEI+HGV            CGP++   +S  
Sbjct: 57  KIGEGKIREHTGEVLFPVMFSGMTFKIFKGEIIHGVVHKVLKHGVFMRCGPIENVYLSYT 116

Query: 107 RMPTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLVFGRAKGESI 166
           +MP Y+++ G+   F N++  +I     V+ +V  ++W      ++RE+      +G+ +
Sbjct: 117 KMPDYKYIPGENPIFMNEKTSRIQVETTVRVVVIGIKW----MEVEREFQALASLEGDYL 172


>gi|388498028|gb|AFK37080.1| unknown [Medicago truncatula]
          Length = 180

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 87/166 (52%), Gaps = 18/166 (10%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ + A+++    L + QR  + RLL  +  +KA KD GYFL+VT   ++D
Sbjct: 1   MFLKVQLPWNVIIAAENLKPGSLML-QRAILIRLLSDFAAKKATKDMGYFLAVT---TLD 56

Query: 61  K--EGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVC------------GPMKYALMSPR 106
           K  EG      G V FPV+F   TF   KGE+L GV             GP++ A +S  
Sbjct: 57  KIGEGKVRQHTGDVLFPVVFNAVTFKIFKGEVLEGVVHKVLKHGVFMRIGPIENAYLSSS 116

Query: 107 RMPTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           +MP Y +V G+  +F N + PKI   V V+ +V   +W    R  +
Sbjct: 117 KMPGYVYVLGENPYFMNQKMPKIAKDVKVRVVVIGTKWMEAEREFQ 162


>gi|297800772|ref|XP_002868270.1| RNA polymerase Rpb7 N-terminal domain-containing protein
           [Arabidopsis lyrata subsp. lyrata]
 gi|297314106|gb|EFH44529.1| RNA polymerase Rpb7 N-terminal domain-containing protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 178

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 97/180 (53%), Gaps = 22/180 (12%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ +PA+++D   L + +R  +  LL+ + ++KA K+ GY+++VT   ++D
Sbjct: 1   MFLKVQLPWNVMIPAENMDAKGL-ILKRAILVELLDAFASKKATKELGYYVAVT---TLD 56

Query: 61  K--EGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPR 106
           K  EG      G V FPV+F   TF   KGEI+HGV            CGP++   +S  
Sbjct: 57  KIGEGKIREHTGEVLFPVMFSGMTFKIFKGEIIHGVVHKVLKHGVFMRCGPIENVYLSYT 116

Query: 107 RMPTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLVFGRAKGESI 166
           +MP Y++V G+   F N++  +I     V+ +V  ++W       +RE+      +G+ +
Sbjct: 117 KMPDYKYVPGENPIFMNEKTSRIQVETTVRVVVIGIKW----MEAEREFQALASLEGDYL 172


>gi|217074784|gb|ACJ85752.1| unknown [Medicago truncatula]
          Length = 180

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 57/166 (34%), Positives = 86/166 (51%), Gaps = 18/166 (10%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ + A+++    L + QR  + RLL  +  +KA KD GYFL+VT   ++D
Sbjct: 1   MFLKVQLPWNVIIAAENLKPGSLML-QRAILIRLLSDFAAKKATKDMGYFLAVT---TLD 56

Query: 61  K--EGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVC------------GPMKYALMSPR 106
           K  EG      G V FPV+F   TF   KGE+L GV             GP++ A +S  
Sbjct: 57  KIGEGKVRQHTGDVLFPVVFNAVTFKIFKGEVLEGVVHKVLKHGVFMRIGPIENAYLSSS 116

Query: 107 RMPTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           +MP Y +V G+  +F N + PKI   V  + +V   +W    R  +
Sbjct: 117 KMPGYVYVLGENPYFMNQKMPKIAKDVKARVVVIGTKWMEAEREFQ 162


>gi|18403570|ref|NP_566719.1| DNA-directed RNA polymerase II subunit G [Arabidopsis thaliana]
 gi|21553765|gb|AAM62858.1| RNA polymerase II fifth largest subunit-like protein [Arabidopsis
           thaliana]
 gi|332643168|gb|AEE76689.1| DNA-directed RNA polymerase II subunit G [Arabidopsis thaliana]
          Length = 174

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 88/178 (49%), Gaps = 19/178 (10%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  D+ +PA+ +D   +   QR  V RLLE +  +KA KD GY ++ T  ++I 
Sbjct: 1   MFIKVKLPWDVTIPAEDMDTGLML--QRAIVIRLLEAFSKEKATKDLGYLITPTILENIG 58

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVC------------GPMKYALMSPRRM 108
            EG      G + FPV+F    F   KGEI+HGV             GP +   +S  +M
Sbjct: 59  -EGKIKEQTGEIQFPVVFNGICFKMFKGEIVHGVVHKVHKTGVFLKSGPYEIIYLSHMKM 117

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLVFGRAKGESI 166
           P Y  + G+  FF N    +I  G  V+F+V    W    R  +++++      G+++
Sbjct: 118 PGYEFIPGENPFFMNQYMSRIQIGARVRFVVLDTEW----REAEKDFMALASIDGDNL 171


>gi|242086985|ref|XP_002439325.1| hypothetical protein SORBIDRAFT_09g004460 [Sorghum bicolor]
 gi|241944610|gb|EES17755.1| hypothetical protein SORBIDRAFT_09g004460 [Sorghum bicolor]
          Length = 175

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 86/164 (52%), Gaps = 15/164 (9%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           +F E E+  ++ +    +DR  L + ++  + RLLE   N++A K+HGY+++V++ K+I 
Sbjct: 2   VFLEAEMSWNVLISPSQLDRKGL-LLRKAIIVRLLEDVTNRRASKEHGYYVAVSQLKAI- 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEIL---------HGV---CGPMKYALMSPRRM 108
            EG      G V FPV F C T  P+KGE++         HGV    GP++   ++ + M
Sbjct: 60  SEGKVRELTGDVLFPVSFTCITLKPMKGEVMVGHVDRILKHGVFLKSGPVESIFLAEKSM 119

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
             Y+++ G+ + F ND   K+     V+F V   RW    R  +
Sbjct: 120 SDYKYIGGENAMFMNDHS-KLEKDTAVRFKVLGFRWMEADRQFQ 162


>gi|297831004|ref|XP_002883384.1| RNA polymerase Rpb7 N-terminal domain-containing protein
           [Arabidopsis lyrata subsp. lyrata]
 gi|297329224|gb|EFH59643.1| RNA polymerase Rpb7 N-terminal domain-containing protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 174

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 55/178 (30%), Positives = 88/178 (49%), Gaps = 19/178 (10%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  D+ +PA+ +D   +   QR  V RLLE +  +KA KD GY ++ T  ++I 
Sbjct: 1   MFIKVKLPWDVTIPAEDMDTGLML--QRAIVIRLLEAFGTKKATKDLGYLITPTILENIG 58

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVC------------GPMKYALMSPRRM 108
            EG      G + FPV+F    F   KGE++HGV             GP +   +S  +M
Sbjct: 59  -EGKIKEQTGEIQFPVVFNGICFKMFKGEVVHGVVQKVHKSGVFLRSGPYEIIYLSHVKM 117

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLKREYLVFGRAKGESI 166
           P Y  + G+K  F N    +I  G  V+F+V    W    R  +++++      G+++
Sbjct: 118 PGYEFIPGEKPIFMNQNMSRIQIGARVRFIVLDTEW----REAEKDFMALASIDGDNL 171


>gi|226493930|ref|NP_001150375.1| LOC100284005 [Zea mays]
 gi|194705372|gb|ACF86770.1| unknown [Zea mays]
 gi|195638760|gb|ACG38848.1| DNA-directed RNA polymerase II 19 kDa polypeptide [Zea mays]
 gi|195658229|gb|ACG48582.1| DNA-directed RNA polymerase II 19 kDa polypeptide [Zea mays]
 gi|413944600|gb|AFW77249.1| DNA-directed RNA polymerase II polypeptide isoform 1 [Zea mays]
 gi|413944601|gb|AFW77250.1| DNA-directed RNA polymerase II polypeptide isoform 2 [Zea mays]
 gi|413944602|gb|AFW77251.1| DNA-directed RNA polymerase II polypeptide isoform 3 [Zea mays]
          Length = 175

 Score = 85.1 bits (209), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 85/164 (51%), Gaps = 15/164 (9%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           +F E E+  ++ +    +DR  L + ++  + RLLE   N++A K+HGY+++V + K+I 
Sbjct: 2   VFLEAEMSWNVLISPSQLDRKGL-LLRKAIIVRLLEDVTNRRASKEHGYYIAVNQLKAIS 60

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEIL---------HGV---CGPMKYALMSPRRM 108
            EG      G V FPV F C T  P+KGE++         HGV    GP++   ++ + M
Sbjct: 61  -EGKVRELTGDVLFPVSFTCITQKPMKGEVMVGHVDRILKHGVFLKSGPVESIFLAEKSM 119

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
             Y+++ G+ + F ND   K+     V+F V   RW    R  +
Sbjct: 120 SNYKYIGGENAMFMND-HSKLEKDTAVRFKVLGFRWMEADRQFQ 162


>gi|115462207|ref|NP_001054703.1| Os05g0157100 [Oryza sativa Japonica Group]
 gi|45267865|gb|AAS55764.1| putative RNA polymerase II [Oryza sativa Japonica Group]
 gi|113578254|dbj|BAF16617.1| Os05g0157100 [Oryza sativa Japonica Group]
 gi|125550915|gb|EAY96624.1| hypothetical protein OsI_18537 [Oryza sativa Indica Group]
 gi|215692867|dbj|BAG88287.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222630264|gb|EEE62396.1| hypothetical protein OsJ_17187 [Oryza sativa Japonica Group]
          Length = 175

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 86/164 (52%), Gaps = 15/164 (9%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           +F +VE+  ++ +    +    L + ++  + RLLE   N+KA KDHGY+++V++ K+I 
Sbjct: 2   VFLKVEMSLNVLISPSQLSPQGLLL-RKAVIVRLLEDIANRKASKDHGYYIAVSELKAIS 60

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEIL---------HGV---CGPMKYALMSPRRM 108
            EG      G V FPV F C T  P+KGE+L         HG+    GP++   +S + M
Sbjct: 61  -EGKVRELTGDVLFPVTFTCITQKPMKGEVLVGSVDKILKHGIFLKSGPIESIFLSEKTM 119

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
             ++++ G+ + F N +  K+    VV+F V   RW    R  +
Sbjct: 120 SDFKYIGGENAVFMN-EHSKLEKDTVVRFKVMGFRWMEADRQFQ 162


>gi|357134424|ref|XP_003568817.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like isoform
           1 [Brachypodium distachyon]
 gi|357134426|ref|XP_003568818.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like isoform
           2 [Brachypodium distachyon]
          Length = 175

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 71/138 (51%), Gaps = 14/138 (10%)

Query: 27  QRYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPV 86
           +++ + RLLE   N+KA K+HGY+++V + K I  EG      G V FPV F C T  P+
Sbjct: 27  RKFIIVRLLEDVTNRKASKEHGYYIAVNELKEI-SEGKVRELTGDVLFPVTFTCITLKPM 85

Query: 87  KGEIL---------HGV---CGPMKYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNGVV 134
           KGEIL         HG+    GP++   +S + M  Y+++ G+   F  D   K+    +
Sbjct: 86  KGEILVGSVDKILKHGMFLKSGPIENIFLSEKTMNDYKYIGGENPMFMKDHS-KLEKDTI 144

Query: 135 VQFLVTAVRWSGEGRNLK 152
           V+F V   RW    R  +
Sbjct: 145 VRFRVMGFRWMEGDRQFQ 162


>gi|2244822|emb|CAB10245.1| RNA polymerase II fifth largest subunit like protein [Arabidopsis
           thaliana]
 gi|7268172|emb|CAB78508.1| RNA polymerase II fifth largest subunit like protein [Arabidopsis
           thaliana]
          Length = 161

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 47/154 (30%), Positives = 81/154 (52%), Gaps = 21/154 (13%)

Query: 27  QRYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDK--EGPTVNGPGVVSFPVIFMCRTFL 84
           +R  +  LLE + ++KA K+ GY+++VT   ++DK  EG      G V FPV+F   TF 
Sbjct: 9   KRAILVELLEAFASKKATKELGYYVAVT---TLDKIGEGKIREHTGEVLFPVMFSGMTFK 65

Query: 85  PVKGEILHGV------------CGPMKYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNG 132
             KGEI+HGV            CGP++   +S  +MP Y+++ G+   F N++  +I   
Sbjct: 66  IFKGEIIHGVVHKVLKHGVFMRCGPIENVYLSYTKMPDYKYIPGENPIFMNEKTSRIQVE 125

Query: 133 VVVQFLVTAVRWSGEGRNLKREYLVFGRAKGESI 166
             V+ +V  ++W      ++RE+      +G+ +
Sbjct: 126 TTVRVVVIGIKW----MEVEREFQALASLEGDYL 155


>gi|218196310|gb|EEC78737.1| hypothetical protein OsI_18944 [Oryza sativa Indica Group]
          Length = 175

 Score = 79.0 bits (193), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 48/138 (34%), Positives = 70/138 (50%), Gaps = 14/138 (10%)

Query: 27  QRYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPV 86
           ++  +  LLE   N+KA KDHGY+++V++ K+I  EG      G V FPV F C T  P 
Sbjct: 27  RKAVIVSLLEEIANRKASKDHGYYIAVSELKAIS-EGKVRELTGDVLFPVTFTCITQKPT 85

Query: 87  KGEIL---------HGV---CGPMKYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNGVV 134
           KGEIL         HGV    GP++   +S + +  Y+++ G+   F ND   K+     
Sbjct: 86  KGEILVGSVDKILKHGVFLKSGPIESIFLSEKTLSDYKYIGGENPMFMNDHS-KLEKDTA 144

Query: 135 VQFLVTAVRWSGEGRNLK 152
           V+F V   RW    R  +
Sbjct: 145 VRFKVMGFRWMEADRQFQ 162


>gi|57863806|gb|AAS98454.2| putative receptor kinase [Oryza sativa Japonica Group]
          Length = 715

 Score = 78.2 bits (191), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 48/138 (34%), Positives = 70/138 (50%), Gaps = 14/138 (10%)

Query: 27  QRYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPV 86
           ++  +  LLE   N+KA KDHGY+++V++ K+I  EG      G V FPV F C T  P 
Sbjct: 62  RKAVIVSLLEEIANRKASKDHGYYIAVSELKAI-SEGKVRELTGDVLFPVTFTCITQKPT 120

Query: 87  KGEIL---------HGV---CGPMKYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNGVV 134
           KGEIL         HGV    GP++   +S + +  Y+++ G+   F ND   K+     
Sbjct: 121 KGEILVGSVDKILKHGVFLKSGPIESIFLSEKTLSDYKYIGGENPMFMNDHS-KLEKDTA 179

Query: 135 VQFLVTAVRWSGEGRNLK 152
           V+F V   RW    R  +
Sbjct: 180 VRFKVMGFRWMEADRQFQ 197


>gi|222630667|gb|EEE62799.1| hypothetical protein OsJ_17602 [Oryza sativa Japonica Group]
          Length = 796

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 82/164 (50%), Gaps = 15/164 (9%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           +F +V++  ++ +    +    L + ++  +  LLE   N+KA KDHGY+++V++ K+I 
Sbjct: 2   VFLKVDMSWNVLISPSELSPKGLLL-RKAVIVSLLEEIANRKASKDHGYYIAVSELKAI- 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEIL---------HGV---CGPMKYALMSPRRM 108
            EG      G V FPV F C T  P KGEIL         HGV    GP++   +S + +
Sbjct: 60  SEGKVRELTGDVLFPVTFTCITQKPTKGEILVGSVDKILKHGVFLKSGPIESIFLSEKTL 119

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
             Y+++ G+   F ND   K+     V+F V   RW    R  +
Sbjct: 120 SDYKYIGGENPMFMNDHS-KLEKDTAVRFKVMGFRWMEADRQFQ 162


>gi|357134557|ref|XP_003568883.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
           [Brachypodium distachyon]
          Length = 175

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 69/138 (50%), Gaps = 14/138 (10%)

Query: 27  QRYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPV 86
           ++  + RLLE   N+KA K+HGY+++V + K I  EG      G V FPV F C T  P+
Sbjct: 27  RKSILVRLLEDIANRKASKEHGYYIAVNELKEIS-EGKVRELTGDVLFPVTFTCITLKPM 85

Query: 87  KGEIL---------HGV---CGPMKYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNGVV 134
           KGEIL         HGV    GP++   +S + M  Y+++ G+   F  D   K+    V
Sbjct: 86  KGEILVGSVEKILKHGVFLKSGPIENVFLSEKTMNDYKYIGGENPMFMKDHS-KLEKDTV 144

Query: 135 VQFLVTAVRWSGEGRNLK 152
           ++F     RW    R  +
Sbjct: 145 LRFKAMGFRWMEADRQFQ 162


>gi|357441001|ref|XP_003590778.1| DNA-directed RNA polymerase II subunit RPB7 [Medicago truncatula]
 gi|355479826|gb|AES61029.1| DNA-directed RNA polymerase II subunit RPB7 [Medicago truncatula]
          Length = 173

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 77/164 (46%), Gaps = 28/164 (17%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF +V+L  ++ + A+++    L +               Q+A KD GYFL+VT    I 
Sbjct: 8   MFLKVQLPWNVIIAAENLKPGSLML---------------QRATKDMGYFLAVTTLDKIG 52

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVC------------GPMKYALMSPRRM 108
            EG      G V FPV+F   TF   KGE+L GV             GP++ A +S  +M
Sbjct: 53  -EGKVRQHTGDVLFPVVFNAVTFKIFKGEVLEGVVHKVLKHGVFMRIGPIENAYLSSSKM 111

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEGRNLK 152
           P Y +V G+  +F N + PKI   V V+ +V   +W    R  +
Sbjct: 112 PGYVYVLGENPYFMNQKMPKIAKDVKVRVVVIGTKWMEAEREFQ 155


>gi|11994719|dbj|BAB03035.1| RNA polymerase II fifth largest subunit-like protein [Arabidopsis
           thaliana]
          Length = 157

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 73/152 (48%), Gaps = 17/152 (11%)

Query: 27  QRYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPV 86
           QR  V RLLE +  +KA KD GY ++ T  ++I  EG      G + FPV+F    F   
Sbjct: 8   QRAIVIRLLEAFSKEKATKDLGYLITPTILENIG-EGKIKEQTGEIQFPVVFNGICFKMF 66

Query: 87  KGEILHGVC------------GPMKYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNGVV 134
           KGEI+HGV             GP +   +S  +MP Y  + G+  FF N    +I  G  
Sbjct: 67  KGEIVHGVVHKVHKTGVFLKSGPYEIIYLSHMKMPGYEFIPGENPFFMNQYMSRIQIGAR 126

Query: 135 VQFLVTAVRWSGEGRNLKREYLVFGRAKGESI 166
           V+F+V    W    R  +++++      G+++
Sbjct: 127 VRFVVLDTEW----REAEKDFMALASIDGDNL 154


>gi|326527317|dbj|BAK04600.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 175

 Score = 75.5 bits (184), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 69/138 (50%), Gaps = 14/138 (10%)

Query: 27  QRYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPV 86
           ++  + RLLE   N+KA  +HGY+++V + K+I  EG      G V FPV F C T  P+
Sbjct: 27  RKSIIVRLLEDITNRKASNEHGYYIAVNELKTIS-EGKVRELTGDVLFPVTFTCITQRPM 85

Query: 87  KGEIL---------HGV---CGPMKYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNGVV 134
           KGEIL         HGV    GP++   +S + M  Y+++ G+   F  D   K+     
Sbjct: 86  KGEILVGSVEKILKHGVFLKSGPIESIFLSEKSMSDYKYMGGENPMFMKDHS-KLERDTA 144

Query: 135 VQFLVTAVRWSGEGRNLK 152
           V+F V   RW    R  +
Sbjct: 145 VRFKVMGFRWMEAERQFQ 162


>gi|115462713|ref|NP_001054956.1| Os05g0224700 [Oryza sativa Japonica Group]
 gi|113578507|dbj|BAF16870.1| Os05g0224700 [Oryza sativa Japonica Group]
          Length = 782

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 77/149 (51%), Gaps = 15/149 (10%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           +F +V++  ++ +    +    L + ++  +  LLE   N+KA KDHGY+++V++ K+I 
Sbjct: 2   VFLKVDMSWNVLISPSELSPKGLLL-RKAVIVSLLEEIANRKASKDHGYYIAVSELKAI- 59

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEIL---------HGV---CGPMKYALMSPRRM 108
            EG      G V FPV F C T  P KGEIL         HGV    GP++   +S + +
Sbjct: 60  SEGKVRELTGDVLFPVTFTCITQKPTKGEILVGSVDKILKHGVFLKSGPIESIFLSEKTL 119

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQF 137
             Y+++ G+   F ND   K+     V+F
Sbjct: 120 SDYKYIGGENPMFMNDHS-KLEKDTAVRF 147


>gi|168006688|ref|XP_001756041.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692971|gb|EDQ79326.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 190

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 52/93 (55%), Gaps = 3/93 (3%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFFEVEL R + V  + +  +EL   +R  + RLL+ +  ++  ++HG+ ++VT  + + 
Sbjct: 1  MFFEVELRRLVVVEPQELG-DELHT-RRAMIRRLLKDFDAERCSEEHGFHVTVTTLEDVS 58

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHG 93
            G   +G G V F V F C  F P++ EIL  
Sbjct: 59 P-GKIRSGTGSVIFWVTFKCIVFRPIRNEILEA 90


>gi|302783046|ref|XP_002973296.1| hypothetical protein SELMODRAFT_98932 [Selaginella moellendorffii]
 gi|302789680|ref|XP_002976608.1| hypothetical protein SELMODRAFT_105350 [Selaginella moellendorffii]
 gi|300155646|gb|EFJ22277.1| hypothetical protein SELMODRAFT_105350 [Selaginella moellendorffii]
 gi|300159049|gb|EFJ25670.1| hypothetical protein SELMODRAFT_98932 [Selaginella moellendorffii]
          Length = 175

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/160 (23%), Positives = 68/160 (42%), Gaps = 16/160 (10%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           M++E+ L RD+ V  + +D +     +R+ +  LLE   + +  ++HGY+++ T    + 
Sbjct: 1   MWWEISLRRDVIVHPRHMDPSGGH--RRWIIQTLLEDMEDLQCSREHGYYVAPTTLSKV- 57

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEIL------------HGVCGPMKYALMSPRRM 108
             G  +   G VS+ V F C  F  +K EI+               CGP     +    M
Sbjct: 58  -SGGMIRESGGVSYKVDFKCMVFKLIKNEIVELEIENVKQSGAEASCGPYTAIFLHHSLM 116

Query: 109 PTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEG 148
             + + +     F N     I  G  V+  +   ++ G+G
Sbjct: 117 TGFVYSNENGPCFKNSDGVVISKGCAVRAKILGGQFMGDG 156


>gi|297796859|ref|XP_002866314.1| DNA-directed RNA polymerase II [Arabidopsis lyrata subsp. lyrata]
 gi|297312149|gb|EFH42573.1| DNA-directed RNA polymerase II [Arabidopsis lyrata subsp. lyrata]
          Length = 176

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   RN LR      + + +EG  + +    HG+ +++T  +SI 
Sbjct: 1  MFFHIVLERNMQLHPRFFGRN-LRENLVSKLMKDVEGTCSGR----HGFVVAITGIESIG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|255558950|ref|XP_002520498.1| DNA-directed RNA polymerase II 19 kD polypeptide rpb7, putative
          [Ricinus communis]
 gi|223540340|gb|EEF41911.1| DNA-directed RNA polymerase II 19 kD polypeptide rpb7, putative
          [Ricinus communis]
          Length = 176

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   RN LR      + + +EG  + +    HG+ +++T  +S+ 
Sbjct: 1  MFFHIVLERNMQLHPRYFGRN-LRENLVSKLMKDVEGTCSGR----HGFVVAITGIESVG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|449466050|ref|XP_004150740.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
          [Cucumis sativus]
 gi|449518487|ref|XP_004166273.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
          [Cucumis sativus]
          Length = 176

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   RN LR      + + +EG  + +    HG+ +++T  ++I 
Sbjct: 1  MFFHIVLERNMQLHPRHFGRN-LRENLVSKLMKDVEGTCSGR----HGFVVAITGIENIG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|351722267|ref|NP_001236982.1| DNA-directed RNA polymerase II subunit RPB7 [Glycine max]
 gi|356512183|ref|XP_003524800.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
          [Glycine max]
 gi|1173137|sp|P46279.1|RPB7_SOYBN RecName: Full=DNA-directed RNA polymerase II subunit RPB7;
          Short=RNA polymerase II subunit B7
 gi|170052|gb|AAA34005.1| RNA polymerase II [Glycine max]
 gi|255626843|gb|ACU13766.1| unknown [Glycine max]
          Length = 176

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   RN LR      + + +EG  + +    HG+ ++VT  ++I 
Sbjct: 1  MFFHIVLERNMQLHPRYFGRN-LRDNLVSKLMKDVEGTCSGR----HGFVVAVTGIENIG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|357476433|ref|XP_003608502.1| DNA-directed RNA polymerase II subunit RPB7 [Medicago truncatula]
 gi|217071674|gb|ACJ84197.1| unknown [Medicago truncatula]
 gi|355509557|gb|AES90699.1| DNA-directed RNA polymerase II subunit RPB7 [Medicago truncatula]
 gi|388514911|gb|AFK45517.1| unknown [Medicago truncatula]
          Length = 176

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   RN LR      + + +EG  + +    HG+ ++VT  ++I 
Sbjct: 1  MFFHIVLERNMQLHPRYFGRN-LRDNLVSKLMKDVEGTCSGR----HGFVVAVTGIENIG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GIIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|352681821|ref|YP_004892345.1| DNA-directed RNA polymerase subunit E' [Thermoproteus tenax Kra 1]
 gi|350274620|emb|CCC81265.1| DNA-directed RNA polymerase, subunit E' [Thermoproteus tenax Kra 1]
          Length = 189

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 56/123 (45%), Gaps = 21/123 (17%)

Query: 42  KACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVC------ 95
           K+ +D GY ++V  +K +++EG  + G G      IF   TF+P+ GE++HG+       
Sbjct: 38  KSFRDLGYVVAVLDAK-VNREGVIIFGDGATYHKAIFHILTFMPLDGEVVHGIVESAREV 96

Query: 96  ------GPM-----KYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRW 144
                 GP+     K  LM     P     +  KSF     + K+  G VV+  +T + +
Sbjct: 97  GVMVRIGPVLGFINKIHLMDE---PNVLFDASTKSFIGERTKKKLSIGDVVRARITGISY 153

Query: 145 SGE 147
             +
Sbjct: 154 VAQ 156


>gi|388502464|gb|AFK39298.1| unknown [Lotus japonicus]
          Length = 176

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   RN LR      + + +EG  + +    HG+ ++VT  +++ 
Sbjct: 1  MFFHIVLERNMQLHPRYFGRN-LRDNLVAKLMKDVEGTCSGR----HGFVVAVTGIENVG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|224056563|ref|XP_002298912.1| predicted protein [Populus trichocarpa]
 gi|222846170|gb|EEE83717.1| predicted protein [Populus trichocarpa]
          Length = 176

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   RN LR      + + +EG  + +    HG+ +++T  ++I 
Sbjct: 1  MFFHIVLERNMQLHPRFFGRN-LRENIVSKLMKDVEGTCSGR----HGFVVAITGIENIG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|15237831|ref|NP_200726.1| DNA-directed RNA polymerase II subunit RPB7 [Arabidopsis
          thaliana]
 gi|585917|sp|P38421.1|RPB7_ARATH RecName: Full=DNA-directed RNA polymerase II subunit RPB7;
          Short=RNA polymerase II subunit B7
 gi|166854|gb|AAA32861.1| RNA polymerase II [Arabidopsis thaliana]
 gi|9759239|dbj|BAB09763.1| DNA-directed RNA polymerase II 19 kD polypeptide (RNA polymerase
          II subunit 5) [Arabidopsis thaliana]
 gi|26452908|dbj|BAC43532.1| putative RNA polymerase II [Arabidopsis thaliana]
 gi|28973525|gb|AAO64087.1| putative RNA polymerase II [Arabidopsis thaliana]
 gi|332009771|gb|AED97154.1| DNA-directed RNA polymerase II subunit RPB7 [Arabidopsis
          thaliana]
          Length = 176

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 50/94 (53%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   RN L+      + + +EG  + +    HG+ +++T   +I 
Sbjct: 1  MFFHIVLERNMQLHPRFFGRN-LKENLVSKLMKDVEGTCSGR----HGFVVAITGIDTIG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|168056096|ref|XP_001780058.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668556|gb|EDQ55161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 177

 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 61/140 (43%), Gaps = 26/140 (18%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKD--------HGYFLS 52
           MFF + L           DRN L++  R+  P L +  I  K  +D        HG+ ++
Sbjct: 1   MFFHIYL-----------DRN-LQLHPRHFGPHLRDKLI-AKLIQDVEGTCSGRHGFVVA 47

Query: 53  VTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVCGPMK----YALMSPRRM 108
           VT  ++I   G   +G G V+FPV + C  F P KGEIL  V   +     +A   P ++
Sbjct: 48  VTAVETI-GSGLIRDGTGFVTFPVKYQCVVFRPFKGEILESVVTMVNKMGFFAEAGPVQI 106

Query: 109 PTYRHVSGKKSFFSNDQQPK 128
               H+      F +D  P 
Sbjct: 107 FVSNHLIPDDMAFQSDDVPN 126


>gi|302754896|ref|XP_002960872.1| hypothetical protein SELMODRAFT_75793 [Selaginella
          moellendorffii]
 gi|302767440|ref|XP_002967140.1| hypothetical protein SELMODRAFT_87027 [Selaginella
          moellendorffii]
 gi|300165131|gb|EFJ31739.1| hypothetical protein SELMODRAFT_87027 [Selaginella
          moellendorffii]
 gi|300171811|gb|EFJ38411.1| hypothetical protein SELMODRAFT_75793 [Selaginella
          moellendorffii]
          Length = 177

 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 20/101 (19%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWIN------QKACKD-HGYFLSV 53
          MFF + L R+            L++  R+  P+L +  +       +  C   HG+ ++V
Sbjct: 1  MFFHIRLERN------------LQLHPRFFGPQLRDKLVEKLIHDVEGTCSGRHGFVVAV 48

Query: 54 TKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          T   +I K G   +G G V+FP+ + C  F P KGEIL GV
Sbjct: 49 TGVDTIGK-GLIRDGTGYVTFPIHYQCVVFRPFKGEILEGV 88


>gi|225434800|ref|XP_002282298.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7 [Vitis
          vinifera]
 gi|147841138|emb|CAN75204.1| hypothetical protein VITISV_042914 [Vitis vinifera]
 gi|297745997|emb|CBI16053.3| unnamed protein product [Vitis vinifera]
          Length = 176

 Score = 43.9 bits (102), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 51/94 (54%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +   R+ LR      + + +EG  + +    HG+ +++T  +++ 
Sbjct: 1  MFFHIVLERNMQLHPRHFGRH-LRDNLVAKLVKDVEGTCSGR----HGFVVAITGIENVG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G   +G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIRDGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|126644185|ref|XP_001388228.1| DNA-directed RNA polymerase II [Cryptosporidium parvum Iowa II]
 gi|126117301|gb|EAZ51401.1| DNA-directed RNA polymerase II, putative [Cryptosporidium parvum
           Iowa II]
          Length = 178

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 65/147 (44%), Gaps = 26/147 (17%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYT--VPRLLEGWINQKACKDHGYFLSVTKSKS 58
           MFF VEL R+I+V          ++  RY   +  +L   +  +    +GY + V K   
Sbjct: 1   MFFFVELWRNISVKPS-------QLGPRYNEHIDDILRSQVEGQRAPPYGYVVCVIKI-I 52

Query: 59  IDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPR 106
           + + G   +  G++  PV +    + P+KGE++ GV            CGP+K   +S  
Sbjct: 53  LKQPGRVQDSTGLIIVPVKYQAIVYRPIKGEVVDGVVESVNELGVIVDCGPLKRVFVSQS 112

Query: 107 RMP---TYRH-VSGKKSFFSNDQQPKI 129
            +P    Y+  + G+ S    D + +I
Sbjct: 113 ALPENMVYKSGIDGQSSRIYIDTKTQI 139


>gi|326496519|dbj|BAJ94721.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 236

 Score = 41.2 bits (95), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 6/94 (6%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MFF + L R++ +  +    + LR      + + +EG  + +    HG+ +++T  + I 
Sbjct: 60  MFFHIVLERNMQLHPRHFGPH-LRDKLVAKLMKDVEGTCSGR----HGFVVAITGVEDIG 114

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
           K G    G G V+FPV + C  F P KGEIL  V
Sbjct: 115 K-GLIREGTGFVTFPVKYQCVVFRPFKGEILEAV 147


>gi|111140005|gb|ABH06364.1| RNA polymerase II [Sorbus aucuparia]
          Length = 110

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 46 DHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
           HG+ +++T  ++I K G   +G G VSFP+ + C  F P KGEIL  V
Sbjct: 37 QHGFVVAITGIENIGK-GMIRDGTGFVSFPMKYQCVVFRPFKGEILEAV 84


>gi|413944603|gb|AFW77252.1| hypothetical protein ZEAMMB73_465202 [Zea mays]
          Length = 82

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          +F E E+  ++ +    +DR  L + ++  + RLLE   N++A K+HGY+++V + K+I 
Sbjct: 2  VFLEAEMSWNVLISPSQLDRKGLLL-RKAIIVRLLEDVTNRRASKEHGYYIAVNQLKAIS 60

Query: 61 K 61
          +
Sbjct: 61 E 61


>gi|357133761|ref|XP_003568492.1| PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like
          [Brachypodium distachyon]
          Length = 177

 Score = 40.8 bits (94), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +    + LR      + + +EG  + +    HG+ +++T  + I 
Sbjct: 1  MFFHIVLERNMQLHPRHFGPH-LRDKLVAKLMKDVEGTCSGR----HGFVVAITGVEDIG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G    G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIREGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|326497319|dbj|BAK02244.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326505504|dbj|BAJ95423.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 177

 Score = 40.8 bits (94), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +    + LR      + + +EG  + +    HG+ +++T  + I 
Sbjct: 1  MFFHIVLERNMQLHPRHFGPH-LRDKLVAKLMKDVEGTCSGR----HGFVVAITGVEDIG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G    G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIREGTGFVTFPVKYQCVVFRPFKGEILEAV 88


>gi|226500588|ref|NP_001149597.1| DNA-directed RNA polymerase II 19 kDa polypeptide [Zea mays]
 gi|242056199|ref|XP_002457245.1| hypothetical protein SORBIDRAFT_03g003990 [Sorghum bicolor]
 gi|242090461|ref|XP_002441063.1| hypothetical protein SORBIDRAFT_09g019690 [Sorghum bicolor]
 gi|194708012|gb|ACF88090.1| unknown [Zea mays]
 gi|195628342|gb|ACG36001.1| DNA-directed RNA polymerase II 19 kDa polypeptide [Zea mays]
 gi|241929220|gb|EES02365.1| hypothetical protein SORBIDRAFT_03g003990 [Sorghum bicolor]
 gi|241946348|gb|EES19493.1| hypothetical protein SORBIDRAFT_09g019690 [Sorghum bicolor]
 gi|413945277|gb|AFW77926.1| DNA-directed RNA polymerase II polypeptide [Zea mays]
          Length = 177

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 22/102 (21%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKD--------HGYFLS 52
          MFF + L R+            +++  R+  P L +  ++ K  KD        HG+ ++
Sbjct: 1  MFFHIVLERN------------MQLHPRHFGPHLRDKLVS-KLIKDVEGTCSGRHGFVVA 47

Query: 53 VTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          +T  + I K G    G G V+FPV + C  F P KGEIL  V
Sbjct: 48 ITGVEDIGK-GLIREGTGYVTFPVKYQCVVFRPFKGEILEAV 88


>gi|432330728|ref|YP_007248871.1| hypothetical protein Metfor_1324 [Methanoregula formicicum SMSP]
 gi|432137437|gb|AGB02364.1| hypothetical protein Metfor_1324 [Methanoregula formicicum SMSP]
          Length = 140

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 15/117 (12%)

Query: 6   ELLRDIAVPAKSVDRNELRV------PQRYTVPRLLEGWINQKACKD---HGYF--LSVT 54
           EL+  + V AKS DR+ +RV       + Y  P LL+     +  K     G +  + VT
Sbjct: 24  ELIHRLPVTAKSADRDGIRVEDGRVIDRAYNGPVLLDAIARNQTIKTTPASGAYKGVPVT 83

Query: 55  KSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVCG----PMKYALMSPRR 107
            +   D+EG  +   G+V    IF   T +  +  IL  VCG    P+    +S RR
Sbjct: 84  VTPIRDREGNAIGAIGIVDITGIFDLATLMEHQSAILKQVCGKDPCPLDSEKISSRR 140


>gi|115463767|ref|NP_001055483.1| Os05g0400700 [Oryza sativa Japonica Group]
 gi|50878367|gb|AAT85142.1| putative RNA polymerase II subunit 5 [Oryza sativa Japonica
          Group]
 gi|113579034|dbj|BAF17397.1| Os05g0400700 [Oryza sativa Japonica Group]
 gi|125552268|gb|EAY97977.1| hypothetical protein OsI_19896 [Oryza sativa Indica Group]
 gi|215704281|dbj|BAG93121.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218196761|gb|EEC79188.1| hypothetical protein OsI_19893 [Oryza sativa Indica Group]
 gi|222631528|gb|EEE63660.1| hypothetical protein OsJ_18478 [Oryza sativa Japonica Group]
          Length = 177

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MFF + L R++ +  +    + LR      + + +EG  + +    HG+ +++T  + + 
Sbjct: 1  MFFHIVLERNMQLHPRHFGPH-LRDKLVSKLIKDVEGTCSGR----HGFVVAITGVEDVG 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          K G    G G V+FPV + C  F P KGEIL  V
Sbjct: 56 K-GLIREGTGYVTFPVKYQCVVFRPFKGEILEAV 88


>gi|327310158|ref|YP_004337055.1| DNA-directed RNA polymerase subunit E' [Thermoproteus uzoniensis
           768-20]
 gi|326946637|gb|AEA11743.1| DNA-directed RNA polymerase subunit E' [Thermoproteus uzoniensis
           768-20]
          Length = 189

 Score = 39.3 bits (90), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 31/127 (24%), Positives = 57/127 (44%), Gaps = 21/127 (16%)

Query: 42  KACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVC------ 95
           K+ +D G+ ++V  +K + +EG  + G G      +F   T++P+ GE+++G+       
Sbjct: 38  KSFRDLGFVVAVLGAK-VAREGVILFGDGATYHRAVFDLLTYVPLDGEVVYGIVESAREV 96

Query: 96  ------GPM-----KYALMSPRRMPTYRHVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRW 144
                 GP+     K  LM     P     +  KSF     + K+G G +V+  +T V +
Sbjct: 97  GVMVRIGPVLGFINKIHLMEE---PNILFDASTKSFIGERSKRKVGVGDMVRARITGVSY 153

Query: 145 SGEGRNL 151
             +   L
Sbjct: 154 VAQKEGL 160


>gi|255073033|ref|XP_002500191.1| predicted protein [Micromonas sp. RCC299]
 gi|226515453|gb|ACO61449.1| predicted protein [Micromonas sp. RCC299]
          Length = 185

 Score = 38.9 bits (89), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 43/94 (45%), Gaps = 5/94 (5%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MF+ ++L R+I +  +   RN      R T+ + L+  +       HGY + VT   +I 
Sbjct: 1  MFWHIKLERNIVLEPRFFGRN-----MRDTLMQRLKHEVEGSCTGKHGYIVMVTDFTNIS 55

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
          +   T +G     F V F    F P KG++L  V
Sbjct: 56 EGMVTDDGTARAKFRVEFDAIAFRPFKGQVLDAV 89


>gi|209882951|ref|XP_002142910.1| DNA-directed RNA polymerase II subunit RPB7 [Cryptosporidium muris
           RN66]
 gi|209558516|gb|EEA08561.1| DNA-directed RNA polymerase II subunit RPB7, putative
           [Cryptosporidium muris RN66]
          Length = 176

 Score = 38.9 bits (89), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 29/121 (23%), Positives = 53/121 (43%), Gaps = 18/121 (14%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MFF VEL ++I     S+  N+L       +  +L   +  +     GY + V +   + 
Sbjct: 1   MFFFVELWKNI-----SLSPNQLGPRYDEHIDDILRSQVEGQWITSFGYVVCVVRIL-LR 54

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
           + G   +G G++  P+ +    + P+KGE++ G             CGP+K   +S   +
Sbjct: 55  QPGRIQDGTGLIIVPIKYQAIVYRPMKGEVIDGTVESVNELGIIVNCGPLKRVFVSQSAL 114

Query: 109 P 109
           P
Sbjct: 115 P 115


>gi|159467891|ref|XP_001692125.1| DNA-directed RNA polymerase II, 19 kDa polypeptide [Chlamydomonas
           reinhardtii]
 gi|158278852|gb|EDP04615.1| DNA-directed RNA polymerase II, 19 kDa polypeptide [Chlamydomonas
           reinhardtii]
          Length = 176

 Score = 38.5 bits (88), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 37/158 (23%), Positives = 66/158 (41%), Gaps = 18/158 (11%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MF+ + L + + +  K    ++LR   R  +    EG    K    +GY ++VTK   I 
Sbjct: 1   MFYFLTLSKSLDIHPKHFG-SKLREVIREKLIAETEGTCTGK----YGYVVAVTKVDDIG 55

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVCGPMK----YALMSPRRMPTYRHV-- 114
           +     +  G  +F V + C    P KGE+L  V   +     +A   P ++   +H+  
Sbjct: 56  RGRIRQDQSGYATFEVSYGCIVCRPYKGEVLDAVVTSVNKMGFFAQAGPLQLFVTQHLIP 115

Query: 115 -------SGKKSFFSNDQQPKIGNGVVVQFLVTAVRWS 145
                  S   S+ S DQ  +I  G  V+  +  +++ 
Sbjct: 116 DEFEFDTSDDNSWISMDQTTRIQGGTHVRIRIVGIKYD 153


>gi|145351559|ref|XP_001420140.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580373|gb|ABO98433.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 179

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/173 (20%), Positives = 66/173 (38%), Gaps = 22/173 (12%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
           MFF V L R+I +  +           +  +   L   +       +G+ + VT+   + 
Sbjct: 1   MFFHVNLERNITLEPR-----HFGARMKAVLEEKLRHEVEGTCSGRYGFIVMVTRLAEVS 55

Query: 61  KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV------------CGPMKYALMSPRRM 108
           +   T +G     F + + C  F P KGE++  V             GP++  + +    
Sbjct: 56  EGVVTDDGTARAKFHIKYDCVVFRPFKGEVMDAVVTQVNKFGFFAEAGPLQLFVSNALIT 115

Query: 109 PTYR-HVSGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSGEG----RNLKREYL 156
              +   SG+  + S+DQQ +I     V+  +  +R           +K +YL
Sbjct: 116 EDMQFDSSGENCYVSDDQQIRIQRDASVRVRIVGMRIDANDIFCIATIKDDYL 168


>gi|395646496|ref|ZP_10434356.1| RNA 3'-phosphate cyclase [Methanofollis liminatans DSM 4140]
 gi|395443236|gb|EJG07993.1| RNA 3'-phosphate cyclase [Methanofollis liminatans DSM 4140]
          Length = 321

 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/149 (22%), Positives = 62/149 (41%), Gaps = 18/149 (12%)

Query: 30  TVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVI----------FM 79
           ++P +L+ W+       HG  +++T    ++K  PT++    V  P++            
Sbjct: 94  SIPLVLQAWLPVALV--HGGSITLTGGTEVEKS-PTIDYFMQVFLPLLRAHGAEVRVEVR 150

Query: 80  CRTFLPVKGEILHGVCGPMKYALMSPRRMPTYRHVSGKKSFFSN---DQQPKIGNGVVVQ 136
            R + P  G ++H   GP   A + P R+P  + +        +   D+Q    + V+  
Sbjct: 151 ARGYYPAGGGVVHVTAGPSALAPIDPARLPNEQGIVSCTQNLPDHVADRQASAASAVLPD 210

Query: 137 FLVTAVRWSGEGRNLKREYLVFGRAKGES 165
           F VT  R +  GR+       +  AKG S
Sbjct: 211 FPVTIDRRT--GRSTGTSVTAWAGAKGAS 237


>gi|171185361|ref|YP_001794280.1| DNA-directed RNA polymerase subunit E' [Pyrobaculum neutrophilum
           V24Sta]
 gi|170934573|gb|ACB39834.1| RNA polymerase Rpb7 domain protein [Pyrobaculum neutrophilum
           V24Sta]
          Length = 191

 Score = 37.7 bits (86), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/126 (23%), Positives = 56/126 (44%), Gaps = 17/126 (13%)

Query: 42  KACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVCGPMKYA 101
           +  ++ GY ++V  +K + +EG  V G G      +F    ++P+ GEI+ GV   ++  
Sbjct: 38  RVLREVGYVVAVLTAK-VSREGKIVFGDGGTYHKALFTMLAYMPLDGEIVEGVVENVREM 96

Query: 102 LMSPRRMPTYRHV--------------SGKKSFFSNDQQPKIGNGVVVQFLVTAVRWSG- 146
            M  R  P    +              +  KS+     + ++G G VV+  +T V ++  
Sbjct: 97  GMLVRIGPVLGFINKIHVMDEPNILFDASTKSYIGERTKRRVGVGDVVRARITGVSFTTP 156

Query: 147 -EGRNL 151
            EG +L
Sbjct: 157 REGTDL 162


>gi|326426973|gb|EGD72543.1| hypothetical protein PTSG_00566 [Salpingoeca sp. ATCC 50818]
          Length = 171

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 12/113 (10%)

Query: 1   MFFEVELLRDIAVPAKSVDRNELRVPQ-RYTVPRLLEGWINQKACKDHGYFLSVTKSKSI 59
           MFF++ L  D+ +P + +       PQ   ++ R L   +  K    HGY +SV +   I
Sbjct: 1   MFFKLTLEHDVLLPPRFLG------PQLAESIRRKLYEDVESKCIGKHGYIVSVIEIVKI 54

Query: 60  DKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVCGPMK----YALMSPRRM 108
             EG  +   G   FPV +    F PVK E++  +   +     +A + P R+
Sbjct: 55  -GEGEILVARGETLFPVTYRALVFRPVKNEVVDAIVSTVTKMGIFAEVGPLRV 106


>gi|374635070|ref|ZP_09706675.1| DNA-directed RNA polymerase [Methanotorris formicicus Mc-S-70]
 gi|373563472|gb|EHP89666.1| DNA-directed RNA polymerase [Methanotorris formicicus Mc-S-70]
          Length = 187

 Score = 36.6 bits (83), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 33/133 (24%), Positives = 56/133 (42%), Gaps = 16/133 (12%)

Query: 26  PQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLP 85
           P + T+ ++L         KD G+ LS+   K I  EG  ++G G    PV+F    ++P
Sbjct: 21  PLKETITKILREKYEGILDKDIGFILSIVDLKEI-GEGKVIHGDGAAYHPVVFDTLIYVP 79

Query: 86  -----VKGEILHGV-------CGPMKYALMSPRRMPTYRHVSGKKSFFSNDQQPK---IG 130
                V+GEI+  V        GP+   +   + M  Y     K+      +  K   IG
Sbjct: 80  ELHEVVEGEIVDIVEFGAFVRLGPLDGLIHISQIMDDYVSYDPKREAIIGRETGKVLEIG 139

Query: 131 NGVVVQFLVTAVR 143
           + V  + +  ++R
Sbjct: 140 DKVRARIVAISLR 152


>gi|15668573|ref|NP_247371.1| DNA-directed RNA polymerase subunit E' [Methanocaldococcus
          jannaschii DSM 2661]
 gi|2500642|sp|Q57840.1|RPOE1_METJA RecName: Full=DNA-directed RNA polymerase subunit E'
 gi|17943304|pdb|1GO3|E Chain E, Structure Of An Archeal Homolog Of The Eukaryotic Rna
          Polymerase Ii Rpb4RPB7 COMPLEX
 gi|17943306|pdb|1GO3|M Chain M, Structure Of An Archeal Homolog Of The Eukaryotic Rna
          Polymerase Ii Rpb4RPB7 COMPLEX
 gi|1591102|gb|AAB98387.1| DNA-directed RNA polymerase, subunit E' (rpoE1)
          [Methanocaldococcus jannaschii DSM 2661]
          Length = 187

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 1/66 (1%)

Query: 28 RYTVPRLLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVK 87
          + TV ++L      +  KD G+ LS+   K I  EG  V+G G    PV+F    ++P  
Sbjct: 23 KETVKKILMEKYEGRLDKDVGFVLSIVDVKDIG-EGKVVHGDGSAYHPVVFETLVYIPEM 81

Query: 88 GEILHG 93
           E++ G
Sbjct: 82 YELIEG 87


>gi|145592427|ref|YP_001154429.1| DNA-directed RNA polymerase subunit E' [Pyrobaculum arsenaticum DSM
           13514]
 gi|379005594|ref|YP_005261266.1| DNA-directed RNA polymerase (rpoE), archaeal and eukaryotic form
           [Pyrobaculum oguniense TE7]
 gi|145284195|gb|ABP51777.1| DNA-directed RNA polymerase, subunit E' [Pyrobaculum arsenaticum
           DSM 13514]
 gi|375161047|gb|AFA40659.1| DNA-directed RNA polymerase (rpoE), archaeal and eukaryotic form
           [Pyrobaculum oguniense TE7]
          Length = 191

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 27/118 (22%), Positives = 51/118 (43%), Gaps = 15/118 (12%)

Query: 42  KACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGVCGPMKYA 101
           +  ++ GY ++V  +K + +EG  V G G      +F    F+P+ GEI+ GV   ++  
Sbjct: 38  RVLREVGYVVAVLNTK-VSREGKIVFGDGGTYHKAVFTMLAFMPLDGEIVEGVVENVREM 96

Query: 102 LMSPRRMPTYRHV--------------SGKKSFFSNDQQPKIGNGVVVQFLVTAVRWS 145
            M  R  P    +              +  KS+     + K+  G +V+  +T V ++
Sbjct: 97  GMLVRIGPVLGFINKIHVMDEPNIFFDASTKSYIGERTKRKVTVGDIVRARITGVSFT 154


>gi|313230246|emb|CBY07950.1| unnamed protein product [Oikopleura dioica]
          Length = 172

 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 24/94 (25%), Positives = 41/94 (43%), Gaps = 6/94 (6%)

Query: 1  MFFEVELLRDIAVPAKSVDRNELRVPQRYTVPRLLEGWINQKACKDHGYFLSVTKSKSID 60
          MF+ + L  ++ +       N        T+ + L   +      +HG+ + VT   SI 
Sbjct: 1  MFYHINLEHELLIHPMYFGENLTD-----TIRQKLYSEVEGSCTGEHGFVIGVTNIHSI- 54

Query: 61 KEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHGV 94
           +G  + G G   FP+ +    F P KGE++ GV
Sbjct: 55 GDGDILAGRGFCLFPIKYAAIVFRPFKGEVVDGV 88


>gi|384247619|gb|EIE21105.1| RNA polymerase II subunit B7 [Coccomyxa subellipsoidea C-169]
          Length = 176

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 39/162 (24%), Positives = 64/162 (39%), Gaps = 28/162 (17%)

Query: 28  RYTVPRLLEGWINQKACKD--------HGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFM 79
           R+  PRL E  + QK   +        +G+ + VT    + K G    G G   F V + 
Sbjct: 16  RFFGPRLRE-VLEQKLISEVEGTCSGKYGFIICVTGMGHVGK-GSIREGSGTALFKVQYS 73

Query: 80  CRTFLPVKGEILHGVCGPMK----YALMSPRRMPTYRHV---------SGKKSFFSNDQQ 126
           C    P KGE+L  V   +     +A   P ++    H+         +G+ +F S D+ 
Sbjct: 74  CVVLRPFKGEVLDCVVSSVNKVGFFADAGPLQLFVSNHLIPEDFEFNATGEPAFVSTDEA 133

Query: 127 PKIGNGVVVQFLVTAVRWSGEG----RNLKREYL-VFGRAKG 163
            ++  G  V+  +   R           +K +YL V  +A G
Sbjct: 134 VRVQAGAEVRLRIVGTRMDASEIFCVGTIKEDYLGVISQAAG 175


>gi|374633213|ref|ZP_09705580.1| DNA-directed RNA polymerase subunit E' [Metallosphaera
           yellowstonensis MK1]
 gi|373524697|gb|EHP69574.1| DNA-directed RNA polymerase subunit E' [Metallosphaera
           yellowstonensis MK1]
          Length = 176

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 7/86 (8%)

Query: 34  LLEGWINQKACKDHGYFLSVTKSKSIDKEGPTVNGPGVVSFPVIFMCRTFLPVKGEILHG 93
           +L+    ++  KD G  L+VTK+K + +EG  V G G     V F   TF P+  E++ G
Sbjct: 29  ILKNEYQERLFKDLGLVLTVTKAK-VSEEGMIVFGDGATYHEVEFELLTFSPIIQEVIEG 87

Query: 94  VCGPMK----YALMSPRRMPTYRHVS 115
               +     Y  M P  M    HVS
Sbjct: 88  DITQVDNYGVYVNMGP--MDGLVHVS 111


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.140    0.430 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,699,173,083
Number of Sequences: 23463169
Number of extensions: 105445639
Number of successful extensions: 159276
Number of sequences better than 100.0: 72
Number of HSP's better than 100.0 without gapping: 26
Number of HSP's successfully gapped in prelim test: 46
Number of HSP's that attempted gapping in prelim test: 159165
Number of HSP's gapped (non-prelim): 72
length of query: 166
length of database: 8,064,228,071
effective HSP length: 127
effective length of query: 39
effective length of database: 9,379,372,904
effective search space: 365795543256
effective search space used: 365795543256
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 71 (32.0 bits)