BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 032332
         (143 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255540207|ref|XP_002511168.1| conserved hypothetical protein [Ricinus communis]
 gi|223550283|gb|EEF51770.1| conserved hypothetical protein [Ricinus communis]
          Length = 196

 Score =  165 bits (418), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 96/162 (59%), Positives = 107/162 (66%), Gaps = 24/162 (14%)

Query: 1   MNDELCNPTLWS------------TTTTTTTTSTSNGVVHADSTSDAN-NHHLDSSGGSG 47
           MNDELCNPTLWS            +    + + + NGVV+ DS SDA   HH D +G   
Sbjct: 40  MNDELCNPTLWSTTSSSNNTTSPISAPANSNSLSQNGVVNGDSVSDAVVAHHFDGTGTGV 99

Query: 48  NS------ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDK 101
                   ALDEAR RYK SVAALR VL AIPNSHKA SFE  SS     S +DE +++K
Sbjct: 100 VGGGTGNGALDEARFRYKNSVAALREVLAAIPNSHKAISFETSSS-----SPADEADVEK 154

Query: 102 LEERASLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
           LEE+AS LRKEL NKN YLK LIDQLRDLITDISTWQSPCSV
Sbjct: 155 LEEQASNLRKELFNKNAYLKLLIDQLRDLITDISTWQSPCSV 196


>gi|224133984|ref|XP_002321708.1| predicted protein [Populus trichocarpa]
 gi|222868704|gb|EEF05835.1| predicted protein [Populus trichocarpa]
          Length = 177

 Score =  154 bits (390), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 98/158 (62%), Positives = 107/158 (67%), Gaps = 22/158 (13%)

Query: 1   MNDELCNPTLWSTTTTT--------TTTSTSNGVVHADSTSDANNHHLDSS-------GG 45
           MNDELCNPTLWSTT+T         T+ +  NG V      DAN    DS+       GG
Sbjct: 27  MNDELCNPTLWSTTSTATTAVTSPITSKNAQNGAVVITVNGDANAAVSDSTHELSGGGGG 86

Query: 46  SGNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEER 105
            GN ALDEAR RYK SVAALR VL AIPNSHKAK FE  SSP      +DE EI KLEER
Sbjct: 87  VGNGALDEARFRYKNSVAALRDVLAAIPNSHKAKPFE-TSSP------ADEAEIKKLEER 139

Query: 106 ASLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
           AS LRKELA KN Y+K LIDQLR++ITDISTWQ+PCSV
Sbjct: 140 ASSLRKELAKKNTYVKLLIDQLREIITDISTWQTPCSV 177


>gi|225456359|ref|XP_002283991.1| PREDICTED: uncharacterized protein LOC100264705 [Vitis vinifera]
 gi|147789866|emb|CAN73869.1| hypothetical protein VITISV_001275 [Vitis vinifera]
 gi|297734439|emb|CBI15686.3| unnamed protein product [Vitis vinifera]
          Length = 174

 Score =  153 bits (386), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 93/145 (64%), Positives = 114/145 (78%), Gaps = 6/145 (4%)

Query: 1   MNDELCNPTLWSTTT--TTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSALDEARHRY 58
           MNDELCNPTLWST     T  +S S+  ++ D++SD + HH++ +GG    ALD+AR RY
Sbjct: 34  MNDELCNPTLWSTPPPPATAASSHSSASLNGDASSDFS-HHMEIAGGG---ALDDARLRY 89

Query: 59  KTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNK 118
           K+SVA+LRAVL AI +S KAK+FE  S+   S S++D+ EI+KLEERAS LRKELA+KNK
Sbjct: 90  KSSVASLRAVLNAIQSSQKAKAFEASSAASGSASQADQAEIEKLEERASNLRKELADKNK 149

Query: 119 YLKRLIDQLRDLITDISTWQSPCSV 143
           YLK LIDQLRDLITDISTWQSPCSV
Sbjct: 150 YLKLLIDQLRDLITDISTWQSPCSV 174


>gi|18424713|ref|NP_568973.1| uncharacterized protein [Arabidopsis thaliana]
 gi|75171739|sp|Q9FMV4.1|MED30_ARATH RecName: Full=Mediator of RNA polymerase II transcription subunit
           30
 gi|9758289|dbj|BAB08813.1| unnamed protein product [Arabidopsis thaliana]
 gi|190886513|gb|ACE95179.1| At1g17730 [Arabidopsis thaliana]
 gi|332010374|gb|AED97757.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 189

 Score =  147 bits (371), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 88/157 (56%), Positives = 107/157 (68%), Gaps = 20/157 (12%)

Query: 1   MNDELCNPTLWSTTTT-----TTTTSTSNGVVHADSTS-DANNHHLDSSGGSG-----NS 49
           MNDELCNP+LWST+ T     TTT S  + +V AD+ + D  +HH +S+GG G     NS
Sbjct: 39  MNDELCNPSLWSTSATPSSAATTTGSNGSALVSADAAAIDGTSHHSESAGGGGGGGSGNS 98

Query: 50  ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEM---VSSPVDSVSRSDEVEIDKLEERA 106
            LDEA  RYK SV +LRAVL AIPNS KAK+ EM   + SP        E EI+KLEE+A
Sbjct: 99  VLDEASLRYKNSVTSLRAVLAAIPNSQKAKASEMQNGLGSP------ESEDEIEKLEEQA 152

Query: 107 SLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
             LR E+A KN ++K LID+LR+LI DISTWQSPCSV
Sbjct: 153 LSLRMEIAKKNVHVKELIDKLRELIADISTWQSPCSV 189


>gi|449440544|ref|XP_004138044.1| PREDICTED: mediator of RNA polymerase II transcription subunit
           30-like [Cucumis sativus]
 gi|449532651|ref|XP_004173294.1| PREDICTED: mediator of RNA polymerase II transcription subunit
           30-like [Cucumis sativus]
          Length = 198

 Score =  147 bits (370), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 86/161 (53%), Positives = 111/161 (68%), Gaps = 20/161 (12%)

Query: 1   MNDELCNPTLWS---------------TTTTTTTTSTSNGVVHADSTSDANNHHLDSSGG 45
           MNDELCNPTLWS                + +  ++ ++NGVV+ D++S++ +HH DSSGG
Sbjct: 40  MNDELCNPTLWSTTSSTATTATSGLTIASPSGPSSHSANGVVNGDASSESTSHHTDSSGG 99

Query: 46  SGNS---ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKL 102
            G+    AL+EAR RYK SV ALRA+L AIPNS KAK++E  +S     S +D+ EIDKL
Sbjct: 100 GGSGSGGALEEARVRYKNSVLALRAILAAIPNSQKAKAYESGTS--GHASPADQSEIDKL 157

Query: 103 EERASLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
           E+  S LRKEL  KN ++K LIDQLRDL+ D+STWQSPCSV
Sbjct: 158 EQHLSNLRKELVVKNLHIKLLIDQLRDLVADVSTWQSPCSV 198


>gi|224119684|ref|XP_002318134.1| predicted protein [Populus trichocarpa]
 gi|118486707|gb|ABK95190.1| unknown [Populus trichocarpa]
 gi|222858807|gb|EEE96354.1| predicted protein [Populus trichocarpa]
          Length = 185

 Score =  146 bits (368), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 96/149 (64%), Positives = 105/149 (70%), Gaps = 13/149 (8%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSS------GGSGNSALDEA 54
           MNDELCNPTLWSTT+T+T+T TS          DAN    DS+      GG GN ALDEA
Sbjct: 44  MNDELCNPTLWSTTSTSTSTVTSPITSSNAQNGDANAAVSDSTLELNGGGGVGNGALDEA 103

Query: 55  RHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELA 114
           R RYK SVAALR VL AIPNSHKAK FE  SSP      +D+ EI+KLEERAS LRKELA
Sbjct: 104 RFRYKNSVAALRDVLAAIPNSHKAKPFE-TSSP------ADQAEIEKLEERASNLRKELA 156

Query: 115 NKNKYLKRLIDQLRDLITDISTWQSPCSV 143
            KN Y+K LIDQLR+LI DISTWQ P SV
Sbjct: 157 KKNTYVKLLIDQLRELIADISTWQGPYSV 185


>gi|356507364|ref|XP_003522437.1| PREDICTED: uncharacterized protein LOC100788627 [Glycine max]
          Length = 164

 Score =  144 bits (364), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 84/144 (58%), Positives = 100/144 (69%), Gaps = 14/144 (9%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSALDEA--RHRY 58
           MNDELCNP LWST+ +T+         +AD+TSD +NHH D+S  S  +       R RY
Sbjct: 32  MNDELCNPVLWSTSPSTSP--------NADATSDNSNHHADASAASAAAGGALEEARARY 83

Query: 59  KTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNK 118
           K +V ALR +LTAIP S K K+F+  S+     S +DE EIDKLEERAS LR+ELANKN 
Sbjct: 84  KKAVTALRTILTAIPTSQKEKTFDAGSA----ASPADEAEIDKLEERASSLRRELANKNL 139

Query: 119 YLKRLIDQLRDLITDISTWQSPCS 142
           +LK LIDQLRDLITDISTWQSP S
Sbjct: 140 HLKTLIDQLRDLITDISTWQSPFS 163


>gi|297793927|ref|XP_002864848.1| hypothetical protein ARALYDRAFT_496535 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310683|gb|EFH41107.1| hypothetical protein ARALYDRAFT_496535 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 178

 Score =  144 bits (362), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/157 (54%), Positives = 106/157 (67%), Gaps = 20/157 (12%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNG-----VVHADSTS-DANNHHLDSSGGSG-----NS 49
           MNDELCNP+LWST+ T ++ +T+ G     +V  D+ + D  +HH +S GG G     NS
Sbjct: 28  MNDELCNPSLWSTSATPSSAATATGSNGSALVSVDAAAIDGTSHHSESGGGGGGGGSGNS 87

Query: 50  ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEM---VSSPVDSVSRSDEVEIDKLEERA 106
            LDEA  RYK SV +LRAVL AIPNSHKAK+ EM     SP        E EI+KLEE+A
Sbjct: 88  VLDEASLRYKNSVTSLRAVLAAIPNSHKAKASEMENGFGSP------ESEDEIEKLEEQA 141

Query: 107 SLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
             LR E+A KN ++K LID+LR+LI DISTWQSPCSV
Sbjct: 142 LSLRMEIAKKNVHVKELIDKLRELIADISTWQSPCSV 178


>gi|51969384|dbj|BAD43384.1| unknown protein [Arabidopsis thaliana]
          Length = 184

 Score =  142 bits (359), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 88/157 (56%), Positives = 106/157 (67%), Gaps = 20/157 (12%)

Query: 1   MNDELCNPTLWSTTTT-----TTTTSTSNGVVHADSTS-DANNHHLDSSGGSG-----NS 49
           MNDELCNP+LWST+ T     TTT S  + +V AD+ + D  +HH +S+GG G     NS
Sbjct: 34  MNDELCNPSLWSTSATPSSAATTTGSNGSALVSADAAAIDGTSHHSESAGGGGGEGSGNS 93

Query: 50  ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEM---VSSPVDSVSRSDEVEIDKLEERA 106
            LDEA  RYK SV +LRAVL AIPNS KAK+ EM   + SP        E EI+KLEE+A
Sbjct: 94  VLDEASLRYKNSVTSLRAVLAAIPNSQKAKASEMQNGLGSP------ESEDEIEKLEEQA 147

Query: 107 SLLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
             LR E+A KN ++K LID LR+LI DISTWQSPCSV
Sbjct: 148 LSLRMEIAKKNVHVKELIDILRELIADISTWQSPCSV 184


>gi|356516539|ref|XP_003526951.1| PREDICTED: uncharacterized protein LOC100816835 [Glycine max]
          Length = 168

 Score =  137 bits (346), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/144 (56%), Positives = 103/144 (71%), Gaps = 12/144 (8%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSA--LDEARHRY 58
           MNDELCNP LWST+ + +++       +AD+TS+ +NHH D++  +      L++AR RY
Sbjct: 34  MNDELCNPVLWSTSPSASSSP------NADATSENSNHHADATTAAAGGGGALEQARSRY 87

Query: 59  KTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNK 118
           K +VAALR +LTAIPNS KA +     +   + S +DE EIDKLEERAS LR+ELANKN 
Sbjct: 88  KKAVAALRTILTAIPNSQKANA----FNASSAASPADEAEIDKLEERASSLRRELANKNL 143

Query: 119 YLKRLIDQLRDLITDISTWQSPCS 142
           +LK LIDQLRDLITDISTW SP S
Sbjct: 144 HLKTLIDQLRDLITDISTWHSPFS 167


>gi|21553739|gb|AAM62832.1| unknown [Arabidopsis thaliana]
          Length = 178

 Score =  129 bits (323), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 85/154 (55%), Positives = 105/154 (68%), Gaps = 14/154 (9%)

Query: 1   MNDELCNPTLWSTTTT-----TTTTSTSNGVVHADSTS-DANNHHLDSSGGSG-----NS 49
           MNDELCNP+LWST+ T     TTT S  + +V AD+ + D  +HH +S+GG G     NS
Sbjct: 28  MNDELCNPSLWSTSATPSSAATTTGSNGSALVSADAAAIDGTSHHSESAGGGGGGGSGNS 87

Query: 50  ALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLL 109
            LDEA  RYK SV +LRAVL AIPNS KAK+ EM  + + S  R DE+E  +  E A  L
Sbjct: 88  VLDEASLRYKNSVTSLRAVLAAIPNSQKAKASEM-QNGLGSPEREDEIEKLE--EEALSL 144

Query: 110 RKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
           R E+A KN ++K LID+LR+LI DISTWQSPCSV
Sbjct: 145 RMEIAKKNVHVKELIDKLRELIADISTWQSPCSV 178


>gi|326520639|dbj|BAJ88360.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 177

 Score =  124 bits (311), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/156 (49%), Positives = 95/156 (60%), Gaps = 25/156 (16%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVVH-------------ADSTSDANNHHLDSSGGSG 47
           MNDELCNP LWS++ T T T+ S    H             ADS +DA    +  + G  
Sbjct: 34  MNDELCNPALWSSSATATATAASAASQHPHHQNAAPPPPHSADSDADA----MGGAAGGS 89

Query: 48  NSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERAS 107
             +LDEARHRYK +VAALRA + A+  S           P  + S+ D+ EI++LEE AS
Sbjct: 90  GGSLDEARHRYKIAVAALRASIAAVSPS--------TQEPGPTESKGDQAEIERLEEHAS 141

Query: 108 LLRKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
            LRKE+ +KNK LK LIDQLRDLI+DIS WQSPCSV
Sbjct: 142 SLRKEIGSKNKQLKLLIDQLRDLISDISMWQSPCSV 177


>gi|388521333|gb|AFK48728.1| unknown [Medicago truncatula]
          Length = 179

 Score =  121 bits (304), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 81/147 (55%), Positives = 99/147 (67%), Gaps = 13/147 (8%)

Query: 1   MNDELCNPTLWSTTTTTTTTS---TSNGVVHADSTSDANNHHLDSSGGSGNSALDEARHR 57
           MNDELCNP  WST+ ++ T+    +SNG   A+S +   +    +  G    AL EAR R
Sbjct: 40  MNDELCNPVWWSTSPSSATSPNAPSSNG--DANSENSGQHADGAAPSGGAGGALGEARLR 97

Query: 58  YKTSVAALRAVLTAIPNSHKAKSFEMVS--SPVDSVSRSDEVEIDKLEERASLLRKELAN 115
           YK +VA LR+VL AIPNS KA +F+  S  SP+D      E EI+KLEE+AS LRKEL N
Sbjct: 98  YKDAVAGLRSVLAAIPNSQKANTFDGGSADSPMD------EAEIEKLEEQASSLRKELGN 151

Query: 116 KNKYLKRLIDQLRDLITDISTWQSPCS 142
           KN +LK LIDQLR+LITDISTWQSP S
Sbjct: 152 KNLHLKILIDQLRELITDISTWQSPFS 178


>gi|226533480|ref|NP_001145600.1| uncharacterized protein LOC100279078 [Zea mays]
 gi|195658667|gb|ACG48801.1| hypothetical protein [Zea mays]
 gi|413918133|gb|AFW58065.1| hypothetical protein ZEAMMB73_297063 [Zea mays]
          Length = 172

 Score =  119 bits (299), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/150 (48%), Positives = 96/150 (64%), Gaps = 18/150 (12%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDS----SGG--SGNSALDEA 54
           MNDELCN  LWS+++ + T +   G  H  S +    H +DS    +GG  S   +LDEA
Sbjct: 34  MNDELCNAGLWSSSSVSATAAA--GYQHHHSATLPTPHSVDSDAADAGGAPSPGGSLDEA 91

Query: 55  RHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSV-SRSDEVEIDKLEERASLLRKEL 113
           RHRYK++VAALRA ++A+ +          +  + S  S +D  EI++LEE AS LRKE+
Sbjct: 92  RHRYKSAVAALRASISAVSS---------CAQDIGSTESEADHAEIERLEEHASALRKEI 142

Query: 114 ANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
            +KNK +K LIDQLRDLI DI+ WQSPCSV
Sbjct: 143 ESKNKQVKLLIDQLRDLIADIAMWQSPCSV 172


>gi|357163076|ref|XP_003579617.1| PREDICTED: uncharacterized protein LOC100837441 [Brachypodium
           distachyon]
          Length = 177

 Score =  112 bits (281), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 69/153 (45%), Positives = 90/153 (58%), Gaps = 19/153 (12%)

Query: 1   MNDELCNPTLWSTTTTTTTTST---SNGVVHADSTSDANNHHLDSS------GGSGNSAL 51
           MNDELCNP LWS++ T +  +     +   H  +    N    D S            +L
Sbjct: 34  MNDELCNPVLWSSSATASAAAAAGSQHPHQHQAAPPPPNCADSDVSDAAGGGAAGSGGSL 93

Query: 52  DEARHRYKTSVAALRAVLTAI-PNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLR 110
           DEARHRYK +VAALRA ++A+ P + +  S E          + D+ E ++LEERAS LR
Sbjct: 94  DEARHRYKIAVAALRASISAVAPCAQELGSTEF---------KGDQAETEQLEERASALR 144

Query: 111 KELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
           KE+ +KNK+LK LIDQLR LI+DIS WQSPCS 
Sbjct: 145 KEIESKNKHLKLLIDQLRGLISDISMWQSPCSA 177


>gi|242075540|ref|XP_002447706.1| hypothetical protein SORBIDRAFT_06g013860 [Sorghum bicolor]
 gi|241938889|gb|EES12034.1| hypothetical protein SORBIDRAFT_06g013860 [Sorghum bicolor]
          Length = 172

 Score =  111 bits (278), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 66/149 (44%), Positives = 91/149 (61%), Gaps = 16/149 (10%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNS------ALDEA 54
           MNDELCN  LWS+++ +   + S    H  +T+    H  DS             +LDEA
Sbjct: 34  MNDELCNAGLWSSSSVSAAAAGSQH--HHSATTPPPPHSADSDAADAGGGPGPGGSLDEA 91

Query: 55  RHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELA 114
           RHRYK++VAALRA ++A+ +  +            + S +D  EI++LEE AS LRKE+ 
Sbjct: 92  RHRYKSAVAALRASISALSSCAQDTG--------STESEADHAEIERLEEHASALRKEIE 143

Query: 115 NKNKYLKRLIDQLRDLITDISTWQSPCSV 143
           +KNK +K L+DQLRDLITD++ WQSPCSV
Sbjct: 144 SKNKQVKLLMDQLRDLITDVAMWQSPCSV 172


>gi|357149130|ref|XP_003575010.1| PREDICTED: uncharacterized protein LOC100826158 [Brachypodium
           distachyon]
          Length = 169

 Score =  110 bits (275), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/141 (46%), Positives = 90/141 (63%), Gaps = 8/141 (5%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSALDEARHRYKT 60
           +N ELCNP LWS+++++        V  A  +  + +      G     +LDEARHRYKT
Sbjct: 35  INHELCNPALWSSSSSSQQQEHQPTVPAAPDSHTSADAEGGGGGAGPGGSLDEARHRYKT 94

Query: 61  SVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNKYL 120
           +V+ALRA +TA+ +S    S E+   P+D  +  DE  I KLEERAS+LRKE+  KNK +
Sbjct: 95  AVSALRASITAVSSS----SQEI--RPLDDKTNKDE--IGKLEERASVLRKEIETKNKNV 146

Query: 121 KRLIDQLRDLITDISTWQSPC 141
           K L++QLRD+I D+S WQSPC
Sbjct: 147 KLLMNQLRDMILDVSMWQSPC 167


>gi|293331961|ref|NP_001170098.1| hypothetical protein [Zea mays]
 gi|224033425|gb|ACN35788.1| unknown [Zea mays]
 gi|414587582|tpg|DAA38153.1| TPA: hypothetical protein ZEAMMB73_493217 [Zea mays]
          Length = 176

 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/154 (42%), Positives = 89/154 (57%), Gaps = 22/154 (14%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVV--HADSTSDANNHHLDSSGGSGNS------ALD 52
           MNDELCN  LWS+++ +   + +      H  S +    H  DS             +LD
Sbjct: 34  MNDELCNAGLWSSSSVSAAAAAAASAGSQHHQSATPPPPHSADSDAADAGGAPGPGGSLD 93

Query: 53  EARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVD---SVSRSDEVEIDKLEERASLL 109
           EARHRYK++V ALR  ++A+            S P D   + S  D+ EI++LEE AS L
Sbjct: 94  EARHRYKSAVVALRDSISAVS-----------SCPQDIGSTESEVDQAEIERLEEHASAL 142

Query: 110 RKELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
           RKE+ +KNK +K L+DQLRDLITD++ WQSPCSV
Sbjct: 143 RKEIESKNKQVKLLMDQLRDLITDVAMWQSPCSV 176


>gi|218194747|gb|EEC77174.1| hypothetical protein OsI_15660 [Oryza sativa Indica Group]
 gi|222628762|gb|EEE60894.1| hypothetical protein OsJ_14577 [Oryza sativa Japonica Group]
          Length = 197

 Score = 98.6 bits (244), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 51/93 (54%), Positives = 68/93 (73%), Gaps = 8/93 (8%)

Query: 51  LDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLR 110
           LDEARHRYK ++AALR  + A+ +  +    EM S+      ++D+ EI++LEE AS LR
Sbjct: 113 LDEARHRYKVAMAALRTSIAAVSSCAQ----EMGSTE----HKADQAEIERLEEHASALR 164

Query: 111 KELANKNKYLKRLIDQLRDLITDISTWQSPCSV 143
           KE+ +KNK++K LIDQL DLI+DIS WQSPCSV
Sbjct: 165 KEIESKNKHVKLLIDQLHDLISDISMWQSPCSV 197


>gi|357465037|ref|XP_003602800.1| hypothetical protein MTR_3g099130 [Medicago truncatula]
 gi|355491848|gb|AES73051.1| hypothetical protein MTR_3g099130 [Medicago truncatula]
          Length = 140

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 64/145 (44%), Positives = 78/145 (53%), Gaps = 39/145 (26%)

Query: 1   MNDELCNPTLWSTTTTTTTTS---TSNGVVHADSTSDANNHHLDSSGGSGNSALDEARHR 57
           MNDELCNP  WST+ ++ T+    +SNG   A+S +   +    +  G    ALDEAR R
Sbjct: 31  MNDELCNPVWWSTSPSSATSPNAPSSNG--DANSENSGQHADGAAPSGGAGGALDEARLR 88

Query: 58  YKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKN 117
           YK +VA LR+VL AIPNS                                  +KEL NKN
Sbjct: 89  YKDAVAGLRSVLAAIPNS----------------------------------QKELGNKN 114

Query: 118 KYLKRLIDQLRDLITDISTWQSPCS 142
            +LK LIDQLR+LITDISTWQSP S
Sbjct: 115 LHLKILIDQLRELITDISTWQSPFS 139


>gi|302798244|ref|XP_002980882.1| hypothetical protein SELMODRAFT_420385 [Selaginella moellendorffii]
 gi|300151421|gb|EFJ18067.1| hypothetical protein SELMODRAFT_420385 [Selaginella moellendorffii]
          Length = 143

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 69/108 (63%), Gaps = 6/108 (5%)

Query: 36  NNHHLDSSGG---SGNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVS 92
           +N  L   GG   +G +A+D+AR +YK +  ALR ++ AI N  +  S ++ S+   +V 
Sbjct: 36  SNPALWEGGGKEEAGWAAVDDARQKYKAATTALRGIVAAIFNHPQMTSLDVDSTVEKAV- 94

Query: 93  RSDEVEIDKLEERASLLRKELANKNKYLKRLIDQLRDLITDISTWQSP 140
             D+ EID+L+++   LR+E+  KN  LK+LI Q+R+L+ DIS WQ+P
Sbjct: 95  --DQAEIDRLQKQTVELREEIVRKNDVLKQLITQMRELVRDISMWQTP 140


>gi|302755929|ref|XP_002961388.1| hypothetical protein SELMODRAFT_437764 [Selaginella moellendorffii]
 gi|300170047|gb|EFJ36648.1| hypothetical protein SELMODRAFT_437764 [Selaginella moellendorffii]
          Length = 135

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 68/107 (63%), Gaps = 6/107 (5%)

Query: 37  NHHLDSSGG---SGNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSR 93
           N  L   GG   +G +A+D+AR +YK +  ALR ++ AI N  +  S ++ S+   +V  
Sbjct: 29  NPALWEGGGKEEAGWAAVDDARQKYKAATTALRGIVAAIFNHPQMMSLDVDSTVEKAV-- 86

Query: 94  SDEVEIDKLEERASLLRKELANKNKYLKRLIDQLRDLITDISTWQSP 140
            D+ EID+L+++   LR+E+  KN  LK+LI Q+R+L+ DIS WQ+P
Sbjct: 87  -DQAEIDRLQKQTVELREEIVRKNDVLKQLITQMRELVRDISMWQTP 132


>gi|168037604|ref|XP_001771293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677382|gb|EDQ63853.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 2035

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 64/127 (50%), Gaps = 24/127 (18%)

Query: 1   MNDELCNPTLWSTTTTTTTTSTSNGVVHADSTSDANNH----------------HLDSSG 44
           +N  LC+P+ W      +  + SNG   ADS++D  N                    S G
Sbjct: 107 LNRALCSPSFW---VAPSLQAGSNGP--ADSSADGANEGGSVATTAGGESGVTKEKASDG 161

Query: 45  GSGNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEE 104
           G G +ALD  R RY+ + AALRA +TAI    +A   E+  + V+++   D  E++ LE+
Sbjct: 162 GEGLTALDATRFRYRNATAALRATVTAILKQTQAARAEIDGATVENI---DNDEVEYLEQ 218

Query: 105 RASLLRK 111
           RA+ LR+
Sbjct: 219 RANDLRE 225


>gi|159486711|ref|XP_001701381.1| hypothetical protein CHLREDRAFT_179302 [Chlamydomonas reinhardtii]
 gi|158271683|gb|EDO97497.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 60

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 24/34 (70%)

Query: 106 ASLLRKELANKNKYLKRLIDQLRDLITDISTWQS 139
            +LL+ EL  KN+ LK+LIDQLR ++  +  W+S
Sbjct: 12  GALLQAELCLKNEALKQLIDQLRQMLESMCMWES 45


>gi|405970719|gb|EKC35598.1| MAM domain-containing glycosylphosphatidylinositol anchor protein 1
           [Crassostrea gigas]
          Length = 909

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 62  VAALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNKYLK 121
           V  ++  +  I  S KA++ E V   VD+V+     ++DK+E+  SLL K L+ +NK + 
Sbjct: 146 VTEIKKTMEGIRTSMKAEA-ETVKKLVDTVTSDQLKQVDKIEQ--SLL-KILSGQNKKID 201

Query: 122 RLIDQLRDLITDISTWQSP 140
             I+ L DLI+    + SP
Sbjct: 202 DYINYLNDLISTFYGYLSP 220


>gi|401826947|ref|XP_003887566.1| Rad50-like protein [Encephalitozoon hellem ATCC 50504]
 gi|392998572|gb|AFM98585.1| Rad50-like protein [Encephalitozoon hellem ATCC 50504]
          Length = 1249

 Score = 37.0 bits (84), Expect = 2.8,   Method: Composition-based stats.
 Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 9/93 (9%)

Query: 48  NSALDEARHRYKTSVAALRAVLTAIPNSHK--AKSFEMVSSPVDSVSRSDEVEIDKLEER 105
           +S LDEAR + +    +L + L    +  +  A++F+ V    D   R + +E+  LE  
Sbjct: 353 SSLLDEARRQREEVFKSLESELLVTKDFRETAAEAFKKVEE--DIRRRKESIEV--LESE 408

Query: 106 ASLLRKELANKNKYL---KRLIDQLRDLITDIS 135
              LRKE  NK K L   KR++D+ RDL+ D S
Sbjct: 409 LLKLRKEDENKMKILSEKKRIVDEYRDLVYDES 441


>gi|260823424|ref|XP_002604183.1| hypothetical protein BRAFLDRAFT_73473 [Branchiostoma floridae]
 gi|229289508|gb|EEN60194.1| hypothetical protein BRAFLDRAFT_73473 [Branchiostoma floridae]
          Length = 439

 Score = 36.2 bits (82), Expect = 4.1,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 45/104 (43%), Gaps = 1/104 (0%)

Query: 28  HADSTSDANNHHLDSSGGS-GNSALDEARHRYKTSVAALRAVLTAIPNSHKAKSFEMVSS 86
            A +TS  N       GG  GN+A DEA  +++ SV   R V+  +  +       ++  
Sbjct: 242 QAGATSGNNPGPSGEGGGQPGNAAQDEANRKFERSVHKARTVMILVAVAFVFWFLPLLVV 301

Query: 87  PVDSVSRSDEVEIDKLEERASLLRKELANKNKYLKRLIDQLRDL 130
           PV  VSR D   ++   E A +    L N    L R  D  R++
Sbjct: 302 PVMCVSRKDVCPVNGWGEEAFVCLNSLINPIATLIRTPDLRREI 345


>gi|403183522|gb|EAT41141.2| AAEL007186-PA [Aedes aegypti]
          Length = 737

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 61/132 (46%), Gaps = 14/132 (10%)

Query: 7   NPTLWSTTTTTTTTSTSNGVVHADSTSDA--NNHHLD--SSGGSGNSALDEARHRYKTSV 62
           NP++ + +T ++ + TS G V +DS+S+   +  H D  SS G+     D+ +       
Sbjct: 431 NPSMDTVSTCSSESVTSFGSVRSDSSSNTITSKSHPDRLSSVGTEPPETDQLQR------ 484

Query: 63  AALRAVLTAIPNSHKAKSFEMVSSPVDSVSRSDEVEIDKLEERASLLRKELANKNKYLKR 122
             LRA L  +  S + + F+      D +        D  E    +LRK+  N  + L  
Sbjct: 485 --LRAELGTV--SEQLRKFQQTKQAQDDIIAEQNARFDATESELKVLRKDYENNQRKLFH 540

Query: 123 LIDQLRDLITDI 134
           +  +++ LIT+I
Sbjct: 541 MQTEIKRLITEI 552


>gi|417859223|ref|ZP_12504280.1| methyl-accepting chemotaxis protein [Agrobacterium tumefaciens F2]
 gi|338825227|gb|EGP59194.1| methyl-accepting chemotaxis protein [Agrobacterium tumefaciens F2]
          Length = 774

 Score = 36.2 bits (82), Expect = 5.1,   Method: Composition-based stats.
 Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 3/84 (3%)

Query: 55  RHRYKTSVAALRAVLTAIPNSHKAKS--FEMVSSPVDSVSRSDEVEIDKLEERASLLRKE 112
           RH + +SV  LR  L ++ NS +  +     VSS  D +SR  E +   LEE A+ L + 
Sbjct: 457 RHDFNSSVVQLREALASVGNSVQTVNGGAHEVSSASDDLSRRTEQQAASLEETAAALEQI 516

Query: 113 LANKNKYLKRLIDQLRDLITDIST 136
            AN +   KR  D  RD + +  T
Sbjct: 517 TANVSATSKRTGDA-RDTVREART 539


>gi|384135188|ref|YP_005517902.1| GerA spore germination protein [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
 gi|339289273|gb|AEJ43383.1| GerA spore germination protein [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
          Length = 482

 Score = 35.8 bits (81), Expect = 5.6,   Method: Composition-based stats.
 Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 17/120 (14%)

Query: 11  WSTTTTTTTTSTSNGVVHADSTSDANNHHLDSSGGSGNSALDE---ARHRYKTSVAALRA 67
           WST   T       G+V AD   +A     D+ G  G   +     A  + +T  AA+RA
Sbjct: 20  WSTQRDTGVVVWLEGLVDADKLQEALLALRDAGGARGRREIPHPGVATQQVETLSAAVRA 79

Query: 68  VL-----TAIPNSHKAKSFEMVSSPVDSVSRSD---------EVEIDKLEERASLLRKEL 113
           VL      A   +  A+++ +   P  SV RS+         E  I++L    +L+RK L
Sbjct: 80  VLDGCAVIARDGASSAQAWRLDRVPARSVDRSENEPTLHGPQEAFIERLSVNLALVRKRL 139


>gi|344272772|ref|XP_003408205.1| PREDICTED: mediator of RNA polymerase II transcription subunit
           30-like [Loxodonta africana]
          Length = 182

 Score = 35.4 bits (80), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 13/63 (20%)

Query: 87  PVDSVSRSDEVEIDKLEERASLLR-------------KELANKNKYLKRLIDQLRDLITD 133
           PV+ +    E +  K ++RASL R             K+L  KN+ LK+++DQLR+LI D
Sbjct: 114 PVEQLIPYVEEDGSKNDDRASLPRFASEERREIAEVNKKLKQKNQQLKQIMDQLRNLIWD 173

Query: 134 IST 136
           I+T
Sbjct: 174 INT 176


>gi|427795653|gb|JAA63278.1| Putative mediator of rna polymerase ii transcription subunit
           30-like protein, partial [Rhipicephalus pulchellus]
          Length = 246

 Score = 35.4 bits (80), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 26/36 (72%)

Query: 102 LEERASLLRKELANKNKYLKRLIDQLRDLITDISTW 137
           LEERA  L +++  KN+ LK +ID LR++I +I+T 
Sbjct: 192 LEERARRLTEQVVLKNRQLKEVIDALRNIIWEINTM 227


>gi|375108245|ref|ZP_09754506.1| adenylate/guanylate cyclase family protein [Burkholderiales
           bacterium JOSHI_001]
 gi|374668976|gb|EHR73761.1| adenylate/guanylate cyclase family protein [Burkholderiales
           bacterium JOSHI_001]
          Length = 1022

 Score = 35.4 bits (80), Expect = 7.7,   Method: Composition-based stats.
 Identities = 31/102 (30%), Positives = 45/102 (44%), Gaps = 19/102 (18%)

Query: 30  DSTSDANNHHLDSSGGS---------------GNSALDEARHRYKTSVAALRAVLTAIPN 74
           D   DA  HH D+ G S                 SALD AR  Y+ ++AAL  +   +PN
Sbjct: 561 DEHLDALAHHFDAGGESTLAARYAERAGDRALAASALDRARGHYRVALAALDRL---VPN 617

Query: 75  SHKAKSFEMVSSPVDSVSRSDEVEID-KLEERASLLRKELAN 115
             +A  +  ++  +  VS  D    D  L  RA +L +E A+
Sbjct: 618 GDRALRWIAIAQRLGMVSLYDAKRADLALARRALVLAEEQAD 659


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.308    0.121    0.335 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,139,297,311
Number of Sequences: 23463169
Number of extensions: 75171035
Number of successful extensions: 636434
Number of sequences better than 100.0: 433
Number of HSP's better than 100.0 without gapping: 125
Number of HSP's successfully gapped in prelim test: 308
Number of HSP's that attempted gapping in prelim test: 632483
Number of HSP's gapped (non-prelim): 3086
length of query: 143
length of database: 8,064,228,071
effective HSP length: 107
effective length of query: 36
effective length of database: 9,848,636,284
effective search space: 354550906224
effective search space used: 354550906224
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 71 (32.0 bits)