BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029467
         (193 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224069860|ref|XP_002303058.1| predicted protein [Populus trichocarpa]
 gi|222844784|gb|EEE82331.1| predicted protein [Populus trichocarpa]
          Length = 201

 Score =  269 bits (688), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 139/203 (68%), Positives = 159/203 (78%), Gaps = 12/203 (5%)

Query: 1   MSQGDSKKLEGDAPSA-----SSSSDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLAN 55
           M     K LE +A +      SS S+  KERI++P +LAG+ GG  GL+SKHRKVHGLAN
Sbjct: 1   MGSSGMKPLEENAKAPPPPPPSSFSENWKERILIPTLLAGITGGGVGLVSKHRKVHGLAN 60

Query: 56  ISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALR 115
           IS TYATN SIVT CYCGAREFVRV RK+ PDDLVNSA+AGFGSGALLGRLQGGQ GA R
Sbjct: 61  ISTTYATNFSIVTGCYCGAREFVRVVRKSEPDDLVNSAVAGFGSGALLGRLQGGQFGAFR 120

Query: 116 YSVIFAVVGTTVDYATLRLAPIIRNFRES-----HKDGDWLKLPEWSPIQVLDEEALAEK 170
           YSVIFAVVGTTVD+AT++L P + NF+ES      K   WLKLPEWSPIQVLDEEALA K
Sbjct: 121 YSVIFAVVGTTVDFATIKLRPALSNFKESIFKEKEKKPGWLKLPEWSPIQVLDEEALAAK 180

Query: 171 KAREQKLYAQTQQALGKLRKEES 193
           +ARE++LYA++  ALGKL KEES
Sbjct: 181 EAREKELYARS--ALGKLSKEES 201


>gi|449464128|ref|XP_004149781.1| PREDICTED: uncharacterized protein LOC101211202 [Cucumis sativus]
 gi|449529275|ref|XP_004171626.1| PREDICTED: uncharacterized protein LOC101232073 [Cucumis sativus]
          Length = 197

 Score =  262 bits (669), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 133/198 (67%), Positives = 160/198 (80%), Gaps = 8/198 (4%)

Query: 1   MSQGDSKKLEGDAPS----ASSSSDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANI 56
           M   D K    D P      SS   G KERI  P +LAG+VGG +GLLSK+RKVHGLANI
Sbjct: 1   MDSPDPKTPNSDTPPFSSSTSSDYGGWKERIFYPTLLAGIVGGGAGLLSKYRKVHGLANI 60

Query: 57  SATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRY 116
           SATYATNLSIV ACYCGAREFVRVSR++ PDDL+NSAIAGFG+GALLGRLQGG+ G++RY
Sbjct: 61  SATYATNLSIVAACYCGAREFVRVSRRSEPDDLMNSAIAGFGTGALLGRLQGGRLGSVRY 120

Query: 117 SVIFAVVGTTVDYATLRLAPIIRNFRESHKDG--DWLKLPEWSPIQVLDEEALAEKKARE 174
           S++F +VGTTVDYATL+L P +R+++ES  +G   W+KLPEWSPIQVLDEEALA K+ARE
Sbjct: 121 SLMFTIVGTTVDYATLKLKPTLRSYKESIIEGSSSWMKLPEWSPIQVLDEEALAAKQARE 180

Query: 175 QKLYAQTQQALGKLRKEE 192
           Q+LYA  Q+ALG+L K++
Sbjct: 181 QQLYA--QRALGQLNKKD 196


>gi|359495519|ref|XP_002265905.2| PREDICTED: uncharacterized protein LOC100245423 [Vitis vinifera]
 gi|296085621|emb|CBI29410.3| unnamed protein product [Vitis vinifera]
          Length = 193

 Score =  253 bits (646), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 122/172 (70%), Positives = 144/172 (83%), Gaps = 4/172 (2%)

Query: 24  KERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRK 83
           KER  +P +LAG+VGG +GL+SK+RKVHGLANISATYA N +IVT CYCGAREFVR +R 
Sbjct: 24  KERFFIPTLLAGIVGGGTGLVSKYRKVHGLANISATYAANFAIVTGCYCGAREFVRRNRG 83

Query: 84  TGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRNFRE 143
           TGPDDL++SA+ GFGSGA+LG LQGG+ GA+RYSV+FAV GT+VD+A L+L P+ +NF E
Sbjct: 84  TGPDDLISSAVGGFGSGAILGCLQGGRAGAVRYSVMFAVAGTSVDFAILKLKPVWKNFSE 143

Query: 144 --SHKDGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKEES 193
             S K  DWLKLPEWSPIQVLDEEALA KKAREQKLY   Q++LG L KEES
Sbjct: 144 SISQKSEDWLKLPEWSPIQVLDEEALAAKKAREQKLYG--QRSLGSLNKEES 193


>gi|255583982|ref|XP_002532737.1| protein translocase, putative [Ricinus communis]
 gi|223527514|gb|EEF29639.1| protein translocase, putative [Ricinus communis]
          Length = 203

 Score =  237 bits (605), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 130/201 (64%), Positives = 154/201 (76%), Gaps = 13/201 (6%)

Query: 6   SKKLEGDAPSASSS-------SDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANISA 58
           SK L+G+A             S+  KERI++P ILAG+ GG  GL+SKHR+ HGLA+IS 
Sbjct: 3   SKPLQGEATHPPPPPSSSSSFSENWKERILIPTILAGIGGGGVGLVSKHRQTHGLASIST 62

Query: 59  TYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSV 118
            YATN +IVT CYCGAREFVRVSRK+ PDDLVNSA AGFG+GALLGRLQGGQ GA RYS+
Sbjct: 63  AYATNFAIVTGCYCGAREFVRVSRKSEPDDLVNSAFAGFGTGALLGRLQGGQFGAFRYSI 122

Query: 119 IFAVVGTTVDYATLRLAPIIRNFRES-----HKDGDWLKLPEWSPIQVLDEEALAEKKAR 173
           IFAVVGTTVD+AT++L P +R+F++S      K G WLKLP W PIQVLDEEALA K+ R
Sbjct: 123 IFAVVGTTVDFATIKLKPKLRSFKQSLYGDKEKKGGWLKLPVWFPIQVLDEEALAAKEVR 182

Query: 174 EQKLYAQ-TQQALGKLRKEES 193
           E++LY Q  Q+ALGKL KEES
Sbjct: 183 EKQLYEQRMQRALGKLSKEES 203


>gi|351723463|ref|NP_001236000.1| uncharacterized protein LOC100526920 [Glycine max]
 gi|255631151|gb|ACU15941.1| unknown [Glycine max]
          Length = 203

 Score =  202 bits (514), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 105/178 (58%), Positives = 134/178 (75%), Gaps = 9/178 (5%)

Query: 24  KERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRK 83
           K RI++P +LAG+ G  +GL+SKHR+  GLA + + YA N +IVT CYCGAREFV  +RK
Sbjct: 27  KSRIVIPTLLAGVAGAGAGLISKHRRTLGLATVCSNYAVNFTIVTGCYCGAREFVAATRK 86

Query: 84  TGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRN--- 140
           TGPDD++NSA+AGFG+GALLGRLQGGQ GA+RYSV+FAVVGTT D+A L+L   +R+   
Sbjct: 87  TGPDDILNSAVAGFGTGALLGRLQGGQLGAIRYSVMFAVVGTTADFAILKLKDALRDHTK 146

Query: 141 -----FRESHKDGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKEES 193
                   S K G WLKLPEW PIQVLDEEALA K+A+E++ + Q +  +  LR+EES
Sbjct: 147 TIYQHIENSKKSGTWLKLPEWFPIQVLDEEALAAKRAQEEQ-FLQQRARIRSLREEES 203


>gi|297835600|ref|XP_002885682.1| mitochondrial import inner membrane translocase subunit
           Tim17/Tim22/Tim23 family protein [Arabidopsis lyrata
           subsp. lyrata]
 gi|297331522|gb|EFH61941.1| mitochondrial import inner membrane translocase subunit
           Tim17/Tim22/Tim23 family protein [Arabidopsis lyrata
           subsp. lyrata]
          Length = 189

 Score =  194 bits (494), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 101/197 (51%), Positives = 139/197 (70%), Gaps = 14/197 (7%)

Query: 1   MSQGDSKKLEGDAPSASSSSDGRKE------RIIVPAILAGLVGGVSGLLSKHRKVHGLA 54
           M+ GD K  E      S  +   +E      R+++ ++LAG++GG +GL+SKHR  H   
Sbjct: 1   MASGDRKSPEQTNQPLSPPTPIVQETGTPTKRVLITSLLAGVIGGGAGLVSKHRIAH--P 58

Query: 55  NISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGAL 114
           NI   YATN +IV  CYCGARE VR++R++  DDL+NSAI G  SGALLGRLQGG +GA+
Sbjct: 59  NIPTVYATNCAIVAGCYCGARESVRITRRSEHDDLMNSAIGGLFSGALLGRLQGGPQGAI 118

Query: 115 RYSVIFAVVGTTVDYATLRLAPIIRNFRESHKDGDWLKLPEWSPIQVLDEEALAEKKARE 174
           RYS++FA VGT  DYA+L+  P++    ES+++ +  KLPEWSPI+VLDEEALA+KKA+E
Sbjct: 119 RYSLVFAAVGTAFDYASLKAKPML----ESYRNMESFKLPEWSPIKVLDEEALAKKKAQE 174

Query: 175 QKLYAQTQQALGKLRKE 191
           +K++   ++ LGKL KE
Sbjct: 175 EKIFP--ERVLGKLNKE 189


>gi|18404423|ref|NP_566759.1| Mitochondrial import inner membrane translocase subunit
           Tim17/Tim22/Tim23 family protein [Arabidopsis thaliana]
 gi|9294168|dbj|BAB02070.1| unnamed protein product [Arabidopsis thaliana]
 gi|89213243|gb|ABD64061.1| At3g25120 [Arabidopsis thaliana]
 gi|208879524|gb|ACI31307.1| At2g45250 [Arabidopsis thaliana]
 gi|332643460|gb|AEE76981.1| Mitochondrial import inner membrane translocase subunit
           Tim17/Tim22/Tim23 family protein [Arabidopsis thaliana]
          Length = 189

 Score =  194 bits (492), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 101/197 (51%), Positives = 138/197 (70%), Gaps = 14/197 (7%)

Query: 1   MSQGDSKKLEG-----DAPSASSSSDGRK-ERIIVPAILAGLVGGVSGLLSKHRKVHGLA 54
           M+ GD K  E        P+     +G   +R+++ ++LAG++GG +GL+SKHR  H   
Sbjct: 1   MALGDRKSPEQTNQALSPPTPIVQENGTPTKRVLITSLLAGVIGGGAGLVSKHRIAH--P 58

Query: 55  NISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGAL 114
           NI   YA N +IV  CYCGARE VR++R++  DDL+NSAI G  SGALLGRLQGG +GA+
Sbjct: 59  NIPTVYAANFAIVAGCYCGARESVRITRRSEHDDLMNSAIGGLFSGALLGRLQGGPKGAI 118

Query: 115 RYSVIFAVVGTTVDYATLRLAPIIRNFRESHKDGDWLKLPEWSPIQVLDEEALAEKKARE 174
           RYS++FA VGT  DYATL+  P++    ES+++ +  KLPEWSPI+VLDEEALA+KKA E
Sbjct: 119 RYSLVFAAVGTAFDYATLKGKPML----ESYRNMESFKLPEWSPIKVLDEEALAKKKAHE 174

Query: 175 QKLYAQTQQALGKLRKE 191
           +K++   ++ LGKL KE
Sbjct: 175 EKIFP--ERVLGKLNKE 189


>gi|21554035|gb|AAM63116.1| unknown [Arabidopsis thaliana]
          Length = 189

 Score =  192 bits (488), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 94/167 (56%), Positives = 127/167 (76%), Gaps = 8/167 (4%)

Query: 25  ERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRKT 84
           +R+++ ++LAG++GG +GL+SKHR  H   NI   YA N +IV  CYCGARE VR++R++
Sbjct: 31  KRVLITSLLAGVIGGGAGLVSKHRIAH--PNIPTVYAANFAIVAGCYCGARESVRITRRS 88

Query: 85  GPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRNFRES 144
             DDL+NSAI G  SGALL RLQGG +GA+RYS++FA VGT  DYATL+  PI+    ES
Sbjct: 89  EHDDLMNSAIGGLFSGALLARLQGGPKGAIRYSLVFAAVGTAFDYATLKGKPIL----ES 144

Query: 145 HKDGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKE 191
           +++ +  KLPEWSPI+VLDEEALA+KKA E+K++   ++ LGKL KE
Sbjct: 145 YRNMESFKLPEWSPIKVLDEEALAKKKAHEEKIFP--ERVLGKLNKE 189


>gi|326495594|dbj|BAJ85893.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 196

 Score =  176 bits (447), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 100/192 (52%), Positives = 129/192 (67%), Gaps = 5/192 (2%)

Query: 4   GDSKKLEGDAPSASSS-SDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYAT 62
           GD +      P A++  S   KERI+VP + AG+VG   GLLS+HR   G    +ATYA 
Sbjct: 6   GDGEATGAGTPEAAAPFSADWKERILVPVVAAGVVGAGFGLLSRHRARLGPIRAAATYAA 65

Query: 63  NLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAV 122
           NLSIV  CY GARE  R +R T PDDL+NS + G  SGA+LGR+QGG  GA++Y+V  AV
Sbjct: 66  NLSIVAGCYGGARELARDARATTPDDLMNSVVGGLASGAVLGRIQGGHFGAVKYAVTLAV 125

Query: 123 VGTTVDYATLRLAPIIRNFRE--SHKDGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQ 180
            GT +DY  ++L P   +++E  S    DW  +PEWSPIQVLDEEALA+K+ RE+ L+A 
Sbjct: 126 AGTALDYFAMKLRPQWLDWKEHLSVDTKDWFTVPEWSPIQVLDEEALAKKREREKLLFA- 184

Query: 181 TQQALGKLRKEE 192
            Q+ALGK+ KEE
Sbjct: 185 -QRALGKVSKEE 195


>gi|115457958|ref|NP_001052579.1| Os04g0376100 [Oryza sativa Japonica Group]
 gi|38346114|emb|CAE04592.2| OSJNBb0006N15.9 [Oryza sativa Japonica Group]
 gi|113564150|dbj|BAF14493.1| Os04g0376100 [Oryza sativa Japonica Group]
 gi|125590093|gb|EAZ30443.1| hypothetical protein OsJ_14491 [Oryza sativa Japonica Group]
 gi|215734903|dbj|BAG95625.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 197

 Score =  173 bits (439), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 104/184 (56%), Positives = 130/184 (70%), Gaps = 8/184 (4%)

Query: 14  PSASSSSDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCG 73
           P+A  S+D  KERI++PA +AG+VG   GLLS+HR   G A  +ATYA NL+IV  CY G
Sbjct: 16  PAAPFSAD-WKERILLPAAVAGVVGAGFGLLSRHRARLGAARATATYAANLAIVAGCYGG 74

Query: 74  AREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLR 133
           ARE  R +R + PDD +NS + G  SGA+LGRLQGG  GA++Y+V FA  GT +DYA L+
Sbjct: 75  ARELARDARASTPDDPMNSVVGGLASGAVLGRLQGGHFGAVKYAVTFAAAGTALDYAALK 134

Query: 134 LAPIIRNFRESHK-----DGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKL 188
           L+P I     + K     D DW  LPEWSPIQVLDEEALA+K+ARE+KL+A  Q+AL KL
Sbjct: 135 LSPQINGKWHALKEHFSGDKDWFTLPEWSPIQVLDEEALAKKRAREEKLFA--QRALSKL 192

Query: 189 RKEE 192
            KEE
Sbjct: 193 SKEE 196


>gi|125547970|gb|EAY93792.1| hypothetical protein OsI_15570 [Oryza sativa Indica Group]
          Length = 197

 Score =  171 bits (432), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 103/184 (55%), Positives = 130/184 (70%), Gaps = 8/184 (4%)

Query: 14  PSASSSSDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCG 73
           P+A  S+D  KERI++PA +AG+VG   GLLS+HR   G A  +ATYA NL+IV  CY G
Sbjct: 16  PAAPFSAD-WKERILLPAAVAGVVGAGFGLLSRHRVRLGAARATATYAANLAIVAGCYGG 74

Query: 74  AREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLR 133
           ARE  R +R + PDD +NS + G  SGA+LGR+QGG  GA++Y+V FA  GT +DYA L+
Sbjct: 75  ARELARDARASTPDDPMNSVVGGLASGAVLGRIQGGHFGAVKYAVTFAAAGTALDYAALK 134

Query: 134 LAPIIRN----FRESHKDG-DWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKL 188
           L P +       +E    G DW  LPEWSPIQVLDEEALA+K+ARE+KL+A  Q+ALGKL
Sbjct: 135 LRPQLNGKWHALKEHFSGGKDWFTLPEWSPIQVLDEEALAKKRAREEKLFA--QRALGKL 192

Query: 189 RKEE 192
            KEE
Sbjct: 193 SKEE 196


>gi|357162965|ref|XP_003579580.1| PREDICTED: uncharacterized protein LOC100825365 [Brachypodium
           distachyon]
          Length = 194

 Score =  168 bits (426), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 101/181 (55%), Positives = 127/181 (70%), Gaps = 4/181 (2%)

Query: 15  SASSSSDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGA 74
           +A+  S   KERI+VPA  AG+ G   GLLS+HR   G    +ATYA NL IV  CY GA
Sbjct: 16  AAAPFSADWKERIMVPAAAAGVFGAGFGLLSRHRARLGALRAAATYAANLFIVAGCYGGA 75

Query: 75  REFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRL 134
           RE  R +R T PDDL+NS + G  SGA+LGR+QGG  GA++Y+V  AVVGT +DY+ L+L
Sbjct: 76  RELARDARATTPDDLMNSVVGGLASGAVLGRIQGGHFGAVKYAVTLAVVGTALDYSALKL 135

Query: 135 APIIRNFRE--SHKDGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKEE 192
           +P    ++E  S    DWL LPEWSPIQVLDEEALA+K+ RE+ L+A  Q+ALGKL K+E
Sbjct: 136 SPQWHGWKEHLSIDTKDWLTLPEWSPIQVLDEEALAKKREREEILFA--QRALGKLNKDE 193

Query: 193 S 193
           S
Sbjct: 194 S 194


>gi|239985625|ref|NP_001150698.1| mitochondrial import inner membrane translocase subunit Tim17
           family protein [Zea mays]
 gi|195641154|gb|ACG40045.1| mitochondrial import inner membrane translocase subunit Tim17
           family protein [Zea mays]
          Length = 193

 Score =  167 bits (422), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 98/170 (57%), Positives = 124/170 (72%), Gaps = 3/170 (1%)

Query: 24  KERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRK 83
           +ERII+PA+ AG++G   GLLS+HR   G A  + TYA NL+IV  CY GARE  R +R 
Sbjct: 25  RERIILPAVAAGVLGAAFGLLSRHRARLGAARAAVTYAANLAIVAGCYGGARELARDARA 84

Query: 84  TGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRNFRE 143
           T PDD +NS + G  SGA+LGR+QGG  GA++Y+V FA  GT +DYA L+LAP   + +E
Sbjct: 85  TTPDDPMNSVVGGLASGAVLGRIQGGHFGAVKYAVTFAAAGTALDYAALKLAPEWHSLKE 144

Query: 144 S-HKDGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKEE 192
              +  DW  LPEWSPIQVLDEEALA+KKARE+KL+A  Q+ALGKL KE+
Sbjct: 145 QLSEKKDWFTLPEWSPIQVLDEEALAKKKAREEKLFA--QRALGKLDKED 192


>gi|242072624|ref|XP_002446248.1| hypothetical protein SORBIDRAFT_06g009810 [Sorghum bicolor]
 gi|241937431|gb|EES10576.1| hypothetical protein SORBIDRAFT_06g009810 [Sorghum bicolor]
          Length = 193

 Score =  160 bits (406), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 99/171 (57%), Positives = 123/171 (71%), Gaps = 5/171 (2%)

Query: 24  KERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRK 83
           +ERII+PA+ AG+VG   GLLS+HR   G A  + TYA NL+IV  CY GARE  R +R 
Sbjct: 25  RERIIIPAVAAGVVGAAFGLLSRHRARLGAARAAVTYAANLAIVAGCYGGARELARDARA 84

Query: 84  TGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRNFRE 143
           T P+D +NS + G  SGA+LGR+QGG  GA++Y+V FA  GT +DYA L+ AP     +E
Sbjct: 85  TTPNDPMNSVVGGLASGAVLGRIQGGHFGAVKYAVTFAAAGTALDYAALKYAPEWHALKE 144

Query: 144 SHKDG--DWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKEE 192
            H  G  DW  LPEWSPIQVLDEEALA+KKARE+KL+A  Q+ALGKL KE+
Sbjct: 145 -HLSGKKDWFTLPEWSPIQVLDEEALAKKKAREEKLFA--QRALGKLDKED 192


>gi|218194710|gb|EEC77137.1| hypothetical protein OsI_15575 [Oryza sativa Indica Group]
          Length = 152

 Score =  139 bits (350), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 83/162 (51%), Positives = 103/162 (63%), Gaps = 20/162 (12%)

Query: 36  LVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIA 95
           +VG   GLLS+HR   G A  +ATYA NL+IV  CY GARE  R +R + PDD +NS + 
Sbjct: 5   VVGAGFGLLSRHRARLGAARATATYAANLAIVAGCYGGARELARDARASTPDDPMNSVV- 63

Query: 96  GFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRNFRESHK-----DGDW 150
                       GG  GA++Y+V FA  GT +DYA L+L+P I     + K     D DW
Sbjct: 64  ------------GGHFGAVKYAVTFAAAGTALDYAALKLSPQINGKWHALKEHFSGDKDW 111

Query: 151 LKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKEE 192
             LPEWSPIQVLDEEALA+K+ARE+KL+A  Q+AL KL KEE
Sbjct: 112 FTLPEWSPIQVLDEEALAKKRAREEKLFA--QRALSKLSKEE 151


>gi|116781489|gb|ABK22121.1| unknown [Picea sitchensis]
          Length = 208

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 78/190 (41%), Positives = 114/190 (60%), Gaps = 23/190 (12%)

Query: 20  SDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVR 79
           S G +ERI +P + AG++GG +GLL  + + HG+A +S+ YA NL+IVT+C+CGA+E VR
Sbjct: 25  SKGWQERIYIPTLTAGVLGGGAGLLGGYSRGHGMARLSSAYAINLAIVTSCFCGAQELVR 84

Query: 80  VSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIR 139
             R   P D +NS + G  SG  LGR+QGG+  A +Y+ +FA+VGT + Y   +     +
Sbjct: 85  ELRAAEPGDPINSVLGGIASGGFLGRIQGGRPRAFQYAFMFALVGTGLQYGAHKF----Q 140

Query: 140 NFR----------------ESHKDGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQ 183
            +R                E   +  W   PEW+PIQ LDEEA A K+A+E+ L+   Q+
Sbjct: 141 EYRSQQYLESSSTDTLGEAEKPSEKKWWSWPEWAPIQKLDEEAAA-KRAQERDLH--IQR 197

Query: 184 ALGKLRKEES 193
            + KL+K ES
Sbjct: 198 TIDKLKKGES 207


>gi|414587685|tpg|DAA38256.1| TPA: import inner membrane translocase subunit Tim17 family protein
           [Zea mays]
          Length = 103

 Score =  125 bits (313), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 62/104 (59%), Positives = 79/104 (75%), Gaps = 3/104 (2%)

Query: 90  VNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRNFRES-HKDG 148
           +NS + G  SGA+LGR+QGG  GA++Y+V FA  GT +DYA L+LAP   + +E   +  
Sbjct: 1   MNSVVGGLASGAVLGRIQGGHFGAVKYAVTFAAAGTALDYAALKLAPEWHSLKEQLSEKK 60

Query: 149 DWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKEE 192
           DW  LPEWSPIQVLDEEALA+KKARE+KL+A  Q+ALGKL KE+
Sbjct: 61  DWFTLPEWSPIQVLDEEALAKKKAREEKLFA--QRALGKLDKED 102


>gi|27754316|gb|AAO22611.1| unknown protein [Arabidopsis thaliana]
          Length = 76

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/81 (55%), Positives = 63/81 (77%), Gaps = 6/81 (7%)

Query: 111 RGALRYSVIFAVVGTTVDYATLRLAPIIRNFRESHKDGDWLKLPEWSPIQVLDEEALAEK 170
           +GA+RYS++FA VGT  DYATL+  P++    ES+++ +  KLPEWSPI+VLDEEALA+K
Sbjct: 2   KGAIRYSLVFAAVGTAFDYATLKGKPML----ESYRNMESFKLPEWSPIKVLDEEALAKK 57

Query: 171 KAREQKLYAQTQQALGKLRKE 191
           KA E+K++   ++ LGKL KE
Sbjct: 58  KAHEEKIF--PERVLGKLNKE 76


>gi|168024805|ref|XP_001764926.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683962|gb|EDQ70368.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 323

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 71/153 (46%), Gaps = 24/153 (15%)

Query: 56  ISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALR 115
           +  +  TN  I  +C+  A+E  R  R   P++L+++A+ GF SGALLG   GG+   L 
Sbjct: 68  LGLSVGTNFLIAASCFGAAQEISRELRAAEPENLIDAAVGGFASGALLGHFHGGRARTLP 127

Query: 116 YSVIFAVVGT-----TVDYATLRLAPIIRNFRESHKDGD------------------WLK 152
             ++FAVVGT       +Y   R+   +          D                  W K
Sbjct: 128 MGILFAVVGTGLQLGAAEYKEYRIRHFLNTLPSEPLIADANVEVPVVKEEETTEESSW-K 186

Query: 153 LPEWSPIQVLDEEALAEKKAREQKLYAQTQQAL 185
           LP+W PIQ+L  E  A++ A ++K   +T + L
Sbjct: 187 LPDWFPIQMLSAEQAAKRAAEQEKKRQKTVENL 219


>gi|383157768|gb|AFG61221.1| Pinus taeda anonymous locus 2_8173_01 genomic sequence
 gi|383157770|gb|AFG61222.1| Pinus taeda anonymous locus 2_8173_01 genomic sequence
 gi|383157772|gb|AFG61223.1| Pinus taeda anonymous locus 2_8173_01 genomic sequence
 gi|383157774|gb|AFG61224.1| Pinus taeda anonymous locus 2_8173_01 genomic sequence
          Length = 113

 Score = 70.9 bits (172), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 44/115 (38%), Positives = 65/115 (56%), Gaps = 15/115 (13%)

Query: 91  NSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYAT-----LRLAPIIRN----- 140
           NS + G  SG  LGR+QGG+    +Y+ +FA+VGT + +        R  P + +     
Sbjct: 1   NSVLGGIASGGFLGRIQGGRPRVFQYAFMFALVGTGLQFGANKFQEYRSQPYLESTTTDA 60

Query: 141 FRESHKDGD--WLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQALGKLRKEES 193
             E+ K  +  W   PEW+PIQ LDEEA A K+A+E+ L+   Q+ + KL+K ES
Sbjct: 61  LSEAEKPSEKKWWSWPEWAPIQKLDEEAAA-KRAQERDLH--IQRTIDKLKKGES 112


>gi|302811763|ref|XP_002987570.1| hypothetical protein SELMODRAFT_426311 [Selaginella moellendorffii]
 gi|300144724|gb|EFJ11406.1| hypothetical protein SELMODRAFT_426311 [Selaginella moellendorffii]
          Length = 209

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 74/192 (38%), Gaps = 25/192 (13%)

Query: 1   MSQGDSKKLEGDAPSASSSSDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATY 60
           ++ GD ++    A    +   G  +R++V +  A  +G   GL+    K H +   +   
Sbjct: 5   INSGDGEEKRRAAFKHKTGGQGWAKRVLVGSSAATGIGATIGLIGGAAKGHPVRAFALPV 64

Query: 61  ATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIF 120
             N +I  +   G  E  R   +T P    N+   G  SG +L RLQG         ++F
Sbjct: 65  TINFAIAGSFLSGMTEVTRAMLRTEPGSYTNTIGGGMISGIVLARLQGIAASPFSMGLLF 124

Query: 121 AVVGTTVDYAT-----LRLAPII---------RNFRES--------HKDGDWLKLPEWSP 158
           A VGT   +A       RL   +           FRE+        +K+  W   P W P
Sbjct: 125 AAVGTGCQFAANQWDEYRLKKFVDTLPDDAFPEEFRETRKEMVEEANKERSW---PSWLP 181

Query: 159 IQVLDEEALAEK 170
            + L EE    K
Sbjct: 182 FRKLSEEEFQHK 193


>gi|302822353|ref|XP_002992835.1| hypothetical protein SELMODRAFT_448917 [Selaginella moellendorffii]
 gi|300139383|gb|EFJ06125.1| hypothetical protein SELMODRAFT_448917 [Selaginella moellendorffii]
          Length = 289

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 74/192 (38%), Gaps = 25/192 (13%)

Query: 1   MSQGDSKKLEGDAPSASSSSDGRKERIIVPAILAGLVGGVSGLLSKHRKVHGLANISATY 60
           ++ GD ++    A    +   G  +R++V +  A  +G   GL+    K H +   +   
Sbjct: 85  INSGDGEEKRRAAFKHKTGGQGWAKRVLVGSSAATGIGATIGLIGGAAKGHPVRAFALPV 144

Query: 61  ATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIF 120
             N +I  +   G  E  R   +T P    N+   G  SG +L RLQG         ++F
Sbjct: 145 TINFAIAGSFLSGMTEVTRAMLRTEPGSYTNTIGGGMISGIVLARLQGIAASPFSMGLLF 204

Query: 121 AVVGTTVDYAT-----LRLAPII---------RNFRES--------HKDGDWLKLPEWSP 158
           A VGT   +A       RL   +           FRE+        +K+  W   P W P
Sbjct: 205 AAVGTGCQFAANQWDEYRLKKFVDTLPDDAFPEEFRETRKEMVEEANKERSW---PSWLP 261

Query: 159 IQVLDEEALAEK 170
            + L EE    K
Sbjct: 262 FRKLSEEEFQHK 273


>gi|255075705|ref|XP_002501527.1| predicted protein [Micromonas sp. RCC299]
 gi|226516791|gb|ACO62785.1| predicted protein [Micromonas sp. RCC299]
          Length = 292

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 50/172 (29%), Positives = 77/172 (44%), Gaps = 22/172 (12%)

Query: 35  GLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAI 94
           GL GG +  +  H  +    NI+A  A N++I    +  A+E  R+ R+T  DD VNSA 
Sbjct: 66  GLAGGAAYCV-YHESLDFSHNIAA-IAGNVTIAGVAFLSAKELTRLVRQT--DDWVNSAT 121

Query: 95  AGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYA------------TLRLAPIIRNFR 142
           AG   G+ L  +Q G R +   + +   V   + YA             L    +I + R
Sbjct: 122 AGALVGSQLAVMQRGPRYSAAGAALCGGVAGAIHYAWESDKGSDSLWRALGFRAVI-DPR 180

Query: 143 ESHKDGDWLKLPEWSPIQVLDEEALAEKKAREQKLYAQTQQAL-GKLRKEES 193
                 DW+  P W P++ L +   AE +  E     + Q  L G+L +EE+
Sbjct: 181 TGQATDDWVT-PRWFPVRRLTD---AEAETNEINFQMRVQSVLDGRLSEEEA 228


>gi|242062956|ref|XP_002452767.1| hypothetical protein SORBIDRAFT_04g032110 [Sorghum bicolor]
 gi|241932598|gb|EES05743.1| hypothetical protein SORBIDRAFT_04g032110 [Sorghum bicolor]
          Length = 218

 Score = 43.1 bits (100), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 50/99 (50%), Gaps = 7/99 (7%)

Query: 44  LSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALL 103
           L  +R ++   ++   Y   L +V   +   +  V   R +G DD +N+A+AG G+GAL 
Sbjct: 120 LRANRVLNSCGSVGRAYGNRLGVVAMLFSATKSGVSACR-SGADDWINTAVAGVGTGALY 178

Query: 104 GRLQGGQRGALRYSVIFAVVGTTVDYATLRLA-PIIRNF 141
            RL GG R A    ++  +VG  +  AT+    P++  F
Sbjct: 179 -RLPGGPRSA----IVGGIVGGLLSGATVVAGRPVLGKF 212


>gi|84998644|ref|XP_954043.1| mitochondrial inner membrane protein [Theileria annulata]
 gi|65305041|emb|CAI73366.1| mitochondrial inner membrane protein, putative [Theileria annulata]
          Length = 175

 Score = 39.7 bits (91), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 13/107 (12%)

Query: 36  LVGGVSGLLSKHRK--------VHGLANISATYA----TNLSIVTACYCGAREFVRVSRK 83
           L GG  GLL    K        V+ + N  +TY       L  +T  YCG    V+  R 
Sbjct: 64  LCGGSFGLLKGISKGGATTKLFVNSILNNCSTYGPKLGNRLGCLTIIYCGINSLVKTIRG 123

Query: 84  TGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYA 130
           +   D  ++ IAG  SGA+  ++ G  +   +YS+   +V T +DYA
Sbjct: 124 SDESDKYSAPIAGLLSGAVY-KISGPLKATAKYSLGTGLVFTAIDYA 169


>gi|71033225|ref|XP_766254.1| mitochondrial import inner membrane translocase [Theileria parva
           strain Muguga]
 gi|68353211|gb|EAN33971.1| mitochondrial import inner membrane translocase, putative
           [Theileria parva]
          Length = 175

 Score = 39.3 bits (90), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 13/107 (12%)

Query: 36  LVGGVSGLLSKHRK--------VHGLANISATYA----TNLSIVTACYCGAREFVRVSRK 83
           L GG  GLL    K        V+ + N  +TY       L  +T  YCG    V+  R 
Sbjct: 64  LCGGSFGLLKGISKGGATTKLFVNSILNNCSTYGPKLGNRLGCLTIIYCGINSLVKTVRG 123

Query: 84  TGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYA 130
           +   D  ++ IAG  SGA+  +L G      +YS+   +V T +DYA
Sbjct: 124 SDESDKYSAPIAGLLSGAVY-KLSGPLAATAKYSLGTGLVFTAIDYA 169


>gi|115447853|ref|NP_001047706.1| Os02g0672500 [Oryza sativa Japonica Group]
 gi|50251231|dbj|BAD27817.1| putative mitochondrial inner membrane translocase [Oryza sativa
           Japonica Group]
 gi|50251922|dbj|BAD27859.1| putative mitochondrial inner membrane translocase [Oryza sativa
           Japonica Group]
 gi|113537237|dbj|BAF09620.1| Os02g0672500 [Oryza sativa Japonica Group]
 gi|125540638|gb|EAY87033.1| hypothetical protein OsI_08433 [Oryza sativa Indica Group]
 gi|125583220|gb|EAZ24151.1| hypothetical protein OsJ_07896 [Oryza sativa Japonica Group]
 gi|215766422|dbj|BAG98650.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215766828|dbj|BAG99056.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 199

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 6/103 (5%)

Query: 39  GVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFG 98
           G S  L  +R ++   ++   +     IV   + G   FVR  R  G DD VN+  AG  
Sbjct: 91  GESAKLRINRALNQSGSVGRAFGNRFGIVAMLFAGTESFVRDQRD-GADDWVNTVAAGAS 149

Query: 99  SGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRNF 141
           +GAL  R+  G R      ++  ++G  +  A +   P+++ F
Sbjct: 150 AGALY-RIASGPRSM----IVAGILGGVLSGAAVAGKPMLQRF 187


>gi|449302073|gb|EMC98082.1| hypothetical protein BAUCODRAFT_22919 [Baudoinia compniacensis UAMH
           10762]
          Length = 207

 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 88/193 (45%), Gaps = 23/193 (11%)

Query: 18  SSSDGRKERIIVPAILAG--------LVGGVSGLLSKHRKVHGLANISATYATNLSIVTA 69
           ++S+G    ++ P++  G        L+GG SG+L K       A  SA      ++ T 
Sbjct: 8   NASNGTAVDVLAPSLRVGAICGCAGFLIGGTSGIL-KGTTPFLFATASAIQ--TFALGTT 64

Query: 70  CYCGAREFV--RVSRKTGPDDLVN-SAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTT 126
            +     F+  R +    P D    S +AG  SG ++G +  G+R  L   ++++++G T
Sbjct: 65  YWTCRSAFLQSRFAIGQSPRDCTKASTVAGGVSGGIIGLVTRGRRNLLPGMLVWSLLGCT 124

Query: 127 VDYATLRLAPIIRNFRESHKDGDWLKLPE--WSPIQVLDEEALAEK-KAREQKLYAQT-- 181
              A  R +      ++   +G W +L +  W+P++ +  E  AE  + R  KL A+   
Sbjct: 125 GQLAYNRFSTAD---QKQPTEGFWTRLAKQSWTPVKFMTNEEYAEVLRERVLKLDAEIAV 181

Query: 182 -QQALGKLRKEES 193
               + +L+ EE+
Sbjct: 182 LDDKISELKSEEA 194


>gi|428672858|gb|EKX73771.1| mitochondrial import inner membrane translocase subunit, putative
           [Babesia equi]
          Length = 177

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 45/101 (44%), Gaps = 6/101 (5%)

Query: 35  GLVGGVS-GLLSKHRKVHGLANISATYATNLSIVTAC----YCGAREFVRVSRKTGPDDL 89
           GL+ GVS G  ++   ++ + N    Y  N+    AC    YC      +  R  G  D+
Sbjct: 72  GLIKGVSKGGATQKLFINSILNNCGKYGPNVGNKAACLTLLYCAFNNAFKFIRNAGDHDV 131

Query: 90  VNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYA 130
            N+ IAG  +GA+  +  G      +YS     V T +DYA
Sbjct: 132 YNAPIAGLVTGAVY-KCSGPLMTCAKYSAGTGAVFTAIDYA 171


>gi|395647656|ref|ZP_10435506.1| putative transcriptional regulator [Pseudomonas extremaustralis
           14-3 substr. 14-3b]
          Length = 484

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 24/40 (60%)

Query: 73  GAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRG 112
           G R F+ V+  + PD L  S + G+  GA  G LQGG+RG
Sbjct: 213 GDRPFIHVNCGSIPDSLFESEMFGYERGAFTGALQGGKRG 252


>gi|339442121|ref|YP_004708126.1| transcriptional regulator containing PAS [Clostridium sp. SY8519]
 gi|338901522|dbj|BAK47024.1| transcriptional regulator containing PAS [Clostridium sp. SY8519]
          Length = 459

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 24/38 (63%)

Query: 75  REFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRG 112
           + FVRV+    P+DL+ S   G+  GA  G L+GG++G
Sbjct: 195 QNFVRVNCSAIPEDLMESEFFGYSDGAFTGALKGGKKG 232


>gi|403221099|dbj|BAM39232.1| mitochondrial inner membrane protein [Theileria orientalis strain
           Shintoku]
          Length = 175

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 42/93 (45%), Gaps = 1/93 (1%)

Query: 38  GGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGF 97
           GG +G L  +  ++  +         L  +T  YCG    V++ R    D+  N+ +AG 
Sbjct: 79  GGATGKLFLNSVLNNCSTYGPKLGNRLGCLTIIYCGLNGLVKLVRGEKADERFNAPVAGL 138

Query: 98  GSGALLGRLQGGQRGALRYSVIFAVVGTTVDYA 130
            SGA+  ++ G      +YS    VV T +DY 
Sbjct: 139 LSGAVY-KIAGPIAKTAKYSAGTGVVFTAIDYV 170


>gi|340966639|gb|EGS22146.1| NADH-ubiquinone oxidoreductase-like protein [Chaetomium
           thermophilum var. thermophilum DSM 1495]
          Length = 199

 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 63/142 (44%), Gaps = 10/142 (7%)

Query: 34  AGLVGGVSGLLSKHRKVHGLANIS--ATYATN--LSIVTACYCGAREFVRVSRKT--GPD 87
           A +VGG+  L +  R      N+   A +  N  L+   A   GA +F R +       +
Sbjct: 24  AAVVGGIGLLFAAVRTSLARKNVGPWAIFTRNGKLAATFAAVGGAYDFTRAAAANLREKE 83

Query: 88  DLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAPIIRNFRESHKD 147
           D VN+ I G  +GA +G   G     L ++ +  VV  T +YA   L  + +   + ++ 
Sbjct: 84  DWVNNGIGGLFAGATMGLTTGRIPRVLGFAALTGVVLATAEYAGSGLRGVFKRDVDEYER 143

Query: 148 GDWLKLPEWSPIQVLDEEALAE 169
            ++L+     PI    EE LAE
Sbjct: 144 KEFLRKNRRRPI----EETLAE 161


>gi|412986866|emb|CCO15292.1| predicted protein [Bathycoccus prasinos]
          Length = 239

 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 38/144 (26%), Positives = 62/144 (43%), Gaps = 14/144 (9%)

Query: 20  SDGRKERIIVPAILAGLVGGVSGL---LSKHRKVHGLANISATYATNLSIVTACYCGARE 76
           + G  ER++      G  G  SGL   L + R V    + +   ATN  +V   + G++E
Sbjct: 50  NPGWLERVVEGTFYCGCAGVASGLAFALVRGRPVSVAPHYALNMATNFVVVGGTFLGSKE 109

Query: 77  FVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYATLRLAP 136
             ++ R+   D ++N  IAG  +G+ +  L  G  GA      F + G      +  L+ 
Sbjct: 110 TAKLLRQR--DGVMNDMIAGGVTGSAIWGLYRGVSGA---PSGFLLGGVVAGMGSAMLS- 163

Query: 137 IIRNFRESHKDGDWLKLPEWSPIQ 160
                 E  K  D  +LP W P++
Sbjct: 164 -----EELGKAFDAFELPGWFPVR 182


>gi|320589849|gb|EFX02305.1| hypothetical protein CMQ_2354 [Grosmannia clavigera kw1407]
          Length = 216

 Score = 36.6 bits (83), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 45/162 (27%), Positives = 71/162 (43%), Gaps = 31/162 (19%)

Query: 26  RIIVPAILAGLVGGVSGLLSKHRKVHGLANISATYATNL-SIVTACYCGAREFVRVSRKT 84
            I+VP++ AGL  G  GL      V   + I+ + A  L S+VT    GA+ FV  S  +
Sbjct: 27  EILVPSLQAGLGAGTVGLF-----VGAASGIARSAAPVLFSVVT----GAQWFVLGSSYS 77

Query: 85  G-----------------PDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTV 127
                             PD +  SA+AG  +G  +G L  G R  L    +F++ G T 
Sbjct: 78  ASRTALIKAWGGNDNVSQPDKVKVSALAGASAG-FVGGLIRGPRNILPAIAVFSIFGATG 136

Query: 128 DYATLRLAPIIRNFRESHKDGDWLKLPEWSPIQVLDEEALAE 169
             A   ++   R    + +  DWL    WSP++ + ++  A+
Sbjct: 137 QVAANAMSS--RAAESTGEKTDWLA-SRWSPLRAMSDDEYAQ 175


>gi|242398456|ref|YP_002993880.1| Hydroxyethylthiazole kinase [Thermococcus sibiricus MM 739]
 gi|242264849|gb|ACS89531.1| Hydroxyethylthiazole kinase [Thermococcus sibiricus MM 739]
          Length = 268

 Score = 36.6 bits (83), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 48/94 (51%), Gaps = 8/94 (8%)

Query: 31  AILAGLVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAREFVRVSRKTGPDDLV 90
           +++ G  G ++ LL +H K  G+   SATY  N++  T     AREF  V   TGP D V
Sbjct: 118 SVVRGNFGEIAALLGEHGKTRGVD--SATYDENIA-KTIAKEAAREFNTVVAVTGPVDYV 174

Query: 91  NSA--IAGFGSGA-LLGRLQGGQRGALRYSVIFA 121
                + G  +G  +LGR+ G   G +  S+I A
Sbjct: 175 GDGRRVYGVANGTPMLGRVTG--TGCMVTSIIGA 206


>gi|357157802|ref|XP_003577919.1| PREDICTED: uncharacterized protein LOC100826278 isoform 2
           [Brachypodium distachyon]
          Length = 143

 Score = 36.2 bits (82), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 56  ISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLG 104
           +  TYA +L+++ A Y G  + +   R     D VN A+AGF +GA +G
Sbjct: 60  VCGTYAASLAVLGATYVGVEQALENYRM--KKDFVNGAVAGFTAGATMG 106


>gi|357157799|ref|XP_003577918.1| PREDICTED: uncharacterized protein LOC100826278 isoform 1
           [Brachypodium distachyon]
          Length = 144

 Score = 36.2 bits (82), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 56  ISATYATNLSIVTACYCGAREFVRVSRKTGPDDLVNSAIAGFGSGALLG 104
           +  TYA +L+++ A Y G  + +   R     D VN A+AGF +GA +G
Sbjct: 60  VCGTYAASLAVLGATYVGVEQALENYRM--KKDFVNGAVAGFTAGATMG 106


>gi|387888828|ref|YP_006319126.1| putative transcriptional regulator [Escherichia blattae DSM 4481]
 gi|414592894|ref|ZP_11442543.1| putative transcriptional regulator protein [Escherichia blattae
           NBRC 105725]
 gi|386923661|gb|AFJ46615.1| putative transcriptional regulator [Escherichia blattae DSM 4481]
 gi|403196375|dbj|GAB80195.1| putative transcriptional regulator protein [Escherichia blattae
           NBRC 105725]
          Length = 486

 Score = 36.2 bits (82), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 73  GAREFVRVSRKTGPDDLVNSAIAGFGSGALLGRLQGGQRG 112
           G R F+ V+  + P+ L  S + G+  GA  G LQGG+RG
Sbjct: 213 GDRPFIHVNCGSIPESLFESEMFGYERGAFTGALQGGKRG 252


>gi|398407941|ref|XP_003855436.1| hypothetical protein MYCGRDRAFT_36861, partial [Zymoseptoria
           tritici IPO323]
 gi|339475320|gb|EGP90412.1| hypothetical protein MYCGRDRAFT_36861 [Zymoseptoria tritici IPO323]
          Length = 186

 Score = 36.2 bits (82), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 39/144 (27%), Positives = 62/144 (43%), Gaps = 19/144 (13%)

Query: 36  LVGGVSGLLSKHRKVHGLANISATYATNLSIVTACYCGAR-----EFVRVSRKTGPDDLV 90
           L GG +G++   R   G    +A+     +I  + + G R      +   SR T P D  
Sbjct: 3   LFGGAAGIV---RGAPGFLFATASAIQTFAI-GSTFTGVRLGIIHAWTTESRSTTPLDRT 58

Query: 91  N-SAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVD--YATLRLAPIIRNFRESHKD 147
             SAIAG  +G L+G +  G+  A+  +++F + G      Y     AP         K 
Sbjct: 59  KASAIAGGFTGGLIGFITRGRSNAIPGAIMFTLFGAGGQHVYNKRTAAP-----SAEPKK 113

Query: 148 GDWLKLPE--WSPIQVLDEEALAE 169
             W ++ +  W+P +VL  E  AE
Sbjct: 114 NFWRRMSDKSWTPFKVLTNEEYAE 137


>gi|294896754|ref|XP_002775715.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239881938|gb|EER07531.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 178

 Score = 35.8 bits (81), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 46/107 (42%), Gaps = 13/107 (12%)

Query: 36  LVGGVSGLLSKHRK--------VHGLANISAT----YATNLSIVTACYCGAREFVRVSRK 83
           LVGG  GLL   +K        V+ + N  +T     A   +I+T  Y      V  +R 
Sbjct: 67  LVGGTFGLLKGLQKGGATSKLRVNAVMNAMSTQGPALANQAAIITMYYVAFNGLVSWARG 126

Query: 84  TGPDDLVNSAIAGFGSGALLGRLQGGQRGALRYSVIFAVVGTTVDYA 130
            G DD  N+  AG  SGAL     G      +YS   A V T VD+A
Sbjct: 127 -GKDDQFNAVGAGALSGALFKATSGSWIAVAKYSAASAAVFTVVDFA 172


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.132    0.375 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,981,971,357
Number of Sequences: 23463169
Number of extensions: 119884992
Number of successful extensions: 362350
Number of sequences better than 100.0: 82
Number of HSP's better than 100.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 362275
Number of HSP's gapped (non-prelim): 88
length of query: 193
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 59
effective length of database: 9,215,130,721
effective search space: 543692712539
effective search space used: 543692712539
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 72 (32.3 bits)