BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 030190
         (181 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225424758|ref|XP_002268027.1| PREDICTED: uncharacterized protein LOC100260319 [Vitis vinifera]
 gi|296086507|emb|CBI32096.3| unnamed protein product [Vitis vinifera]
          Length = 192

 Score =  226 bits (577), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 109/138 (78%), Positives = 120/138 (86%), Gaps = 2/138 (1%)

Query: 43  LHDSSTISKKAQLSPLSFSAS-FRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAK 101
           LH S T   + QL  L FS+S F NHICRAAEYKFPDPIPEFA  ET+KFRTHLLNK +K
Sbjct: 53  LHSSLTTKLQPQLHSLVFSSSSFANHICRAAEYKFPDPIPEFAQVETEKFRTHLLNKFSK 112

Query: 102 KDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAA 161
           KD++G+S+E+VVGICTEIFSTFLH+EYGGPGTLLV+PFIDMADTLNERGLPG  QAARAA
Sbjct: 113 KDIYGESVEEVVGICTEIFSTFLHTEYGGPGTLLVIPFIDMADTLNERGLPGS-QAARAA 171

Query: 162 VKWAQRHVDKDWKEWTGD 179
           VKWAQ HVDKDW EWTGD
Sbjct: 172 VKWAQDHVDKDWNEWTGD 189


>gi|255588186|ref|XP_002534527.1| conserved hypothetical protein [Ricinus communis]
 gi|223525108|gb|EEF27857.1| conserved hypothetical protein [Ricinus communis]
          Length = 188

 Score =  217 bits (552), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 101/125 (80%), Positives = 108/125 (86%)

Query: 54  QLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVV 113
            L P        NHICRAAEYKFPDPIPEFAD ET+KFRTHL  KL+KK + GDS+E+VV
Sbjct: 60  HLFPYVTKNKGNNHICRAAEYKFPDPIPEFADVETEKFRTHLQKKLSKKGVCGDSIEEVV 119

Query: 114 GICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDW 173
           GICTEIFSTFLHSEYGGPGTLLV+PF DMADT+NERGLPGGPQAARAAVKWAQ HVDKDW
Sbjct: 120 GICTEIFSTFLHSEYGGPGTLLVIPFCDMADTINERGLPGGPQAARAAVKWAQDHVDKDW 179

Query: 174 KEWTG 178
           KEWTG
Sbjct: 180 KEWTG 184


>gi|449434881|ref|XP_004135224.1| PREDICTED: uncharacterized protein LOC101210245 [Cucumis sativus]
 gi|449527396|ref|XP_004170697.1| PREDICTED: uncharacterized LOC101210245 [Cucumis sativus]
          Length = 169

 Score =  216 bits (550), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 95/119 (79%), Positives = 109/119 (91%)

Query: 61  SASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIF 120
           S+SF  HIC AAEYKFPDPIPEFA +ET+KFR HLL KL+K+D++GDSL+ VVGICTEIF
Sbjct: 48  SSSFPTHICTAAEYKFPDPIPEFAVAETEKFRAHLLQKLSKRDIYGDSLQQVVGICTEIF 107

Query: 121 STFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
           +TF+H+EYGGPGTLLVLPFIDM+D +NERGLPGGPQAARAAVKWAQ HVDKDW +WTGD
Sbjct: 108 NTFMHTEYGGPGTLLVLPFIDMSDAINERGLPGGPQAARAAVKWAQDHVDKDWNQWTGD 166


>gi|351734434|ref|NP_001237024.1| uncharacterized protein LOC100306067 precursor [Glycine max]
 gi|255627431|gb|ACU14060.1| unknown [Glycine max]
          Length = 173

 Score =  215 bits (547), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 100/135 (74%), Positives = 120/135 (88%), Gaps = 2/135 (1%)

Query: 48  TISKKAQLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGD 107
           ++  + + S L FS++ R  + RAAEYKFPDPIPEFAD+ETQKF++HLL KL+KKDM+G+
Sbjct: 38  SVKGRRKFSTLPFSSTKR-LVTRAAEYKFPDPIPEFADAETQKFKSHLLKKLSKKDMYGE 96

Query: 108 SLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQR 167
           S+E+VVGICTEIFSTFLHSEYGGPGTLLVLPF+DMAD LNE+GLPGGPQAARAA+KWAQ 
Sbjct: 97  SIEEVVGICTEIFSTFLHSEYGGPGTLLVLPFVDMADALNEQGLPGGPQAARAAIKWAQD 156

Query: 168 HVDKDWKEWT-GDSD 181
           HVDKDWK WT GD++
Sbjct: 157 HVDKDWKGWTNGDNN 171


>gi|18391169|ref|NP_563872.1| uncharacterized protein [Arabidopsis thaliana]
 gi|145323826|ref|NP_001077502.1| uncharacterized protein [Arabidopsis thaliana]
 gi|5091545|gb|AAD39574.1|AC007067_14 T10O24.14 [Arabidopsis thaliana]
 gi|21554792|gb|AAM63692.1| unknown [Arabidopsis thaliana]
 gi|149944329|gb|ABR46207.1| At1g10522 [Arabidopsis thaliana]
 gi|332190468|gb|AEE28589.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332190469|gb|AEE28590.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 179

 Score =  213 bits (541), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 93/118 (78%), Positives = 108/118 (91%)

Query: 61  SASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIF 120
           S S R  +CRAAEYKFPDPIPEFA++ET+KFR H+LNKL+K+D+F DS++++VG+CTEIF
Sbjct: 60  SLSRRGFVCRAAEYKFPDPIPEFAEAETEKFRDHMLNKLSKRDLFEDSVDEIVGVCTEIF 119

Query: 121 STFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTG 178
            TFL SEYGGPGTLLV+PFIDMADTLNER LPGGPQAARAA+KWAQ HVDKDWKEWTG
Sbjct: 120 ETFLRSEYGGPGTLLVIPFIDMADTLNERELPGGPQAARAAIKWAQDHVDKDWKEWTG 177


>gi|388507390|gb|AFK41761.1| unknown [Lotus japonicus]
          Length = 182

 Score =  208 bits (529), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 93/134 (69%), Positives = 116/134 (86%), Gaps = 1/134 (0%)

Query: 48  TISKKAQLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGD 107
           +++ + +L+ L FS++ R  + RAAEYKFPDPIPEFAD+ET+KF++HLLNKL+K+D++GD
Sbjct: 48  SVNGRRKLATLPFSSTKR-FVIRAAEYKFPDPIPEFADAETEKFKSHLLNKLSKRDIYGD 106

Query: 108 SLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQR 167
           S+ +VVG+CTEIF +FLHSEYGGPGTLLV PF DMAD ++ERGLPGGPQAAR AVKWAQ 
Sbjct: 107 SVGEVVGVCTEIFGSFLHSEYGGPGTLLVNPFADMADAVSERGLPGGPQAARVAVKWAQN 166

Query: 168 HVDKDWKEWTGDSD 181
           HVDKDW EWTG  D
Sbjct: 167 HVDKDWNEWTGGGD 180


>gi|297843868|ref|XP_002889815.1| hypothetical protein ARALYDRAFT_471166 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335657|gb|EFH66074.1| hypothetical protein ARALYDRAFT_471166 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 180

 Score =  206 bits (523), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 91/118 (77%), Positives = 105/118 (88%)

Query: 61  SASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIF 120
           S S R  +CRAAEYKFPDPIPEFA++ET+KFR H+  KL+K+D+F DS++++VGICTEIF
Sbjct: 62  SLSRRGFVCRAAEYKFPDPIPEFAEAETEKFRDHMSKKLSKRDLFEDSVDEIVGICTEIF 121

Query: 121 STFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTG 178
            TFL SEYGGPGTLLV+PFIDMADTL ER LPGGPQAARAA+KWAQ HVDKDWKEWTG
Sbjct: 122 GTFLRSEYGGPGTLLVIPFIDMADTLKERELPGGPQAARAAIKWAQDHVDKDWKEWTG 179


>gi|357516739|ref|XP_003628658.1| hypothetical protein MTR_8g063150 [Medicago truncatula]
 gi|355522680|gb|AET03134.1| hypothetical protein MTR_8g063150 [Medicago truncatula]
 gi|388491344|gb|AFK33738.1| unknown [Medicago truncatula]
          Length = 172

 Score =  191 bits (486), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 98/131 (74%), Positives = 112/131 (85%), Gaps = 3/131 (2%)

Query: 48  TISKKAQLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGD 107
           +I++K    P S   S +  I RAAEYKFPDPIPEFADSET+KF+ HLLNKL+KKD+F +
Sbjct: 40  SINRKVSTLPFS---SIKRSITRAAEYKFPDPIPEFADSETEKFQNHLLNKLSKKDVFEE 96

Query: 108 SLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQR 167
           S+E+VVG+CTEIFSTFLHSEYGGPGTLLV PFIDMAD +NERGLPGGPQAARAA+ WAQ 
Sbjct: 97  SVEEVVGVCTEIFSTFLHSEYGGPGTLLVDPFIDMADIVNERGLPGGPQAARAAINWAQA 156

Query: 168 HVDKDWKEWTG 178
           HVDKDW EWTG
Sbjct: 157 HVDKDWNEWTG 167


>gi|224108113|ref|XP_002314726.1| predicted protein [Populus trichocarpa]
 gi|222863766|gb|EEF00897.1| predicted protein [Populus trichocarpa]
          Length = 96

 Score =  171 bits (433), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 76/92 (82%), Positives = 85/92 (92%)

Query: 87  ETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTL 146
           ET+KF+THL  KL KKD++GDSLE+VVGICTEIFSTFLH+EYGGPGTLLV PFIDMADT+
Sbjct: 1   ETEKFKTHLSKKLTKKDIYGDSLEEVVGICTEIFSTFLHAEYGGPGTLLVTPFIDMADTV 60

Query: 147 NERGLPGGPQAARAAVKWAQRHVDKDWKEWTG 178
           NE+GLPGGP+AAR AVKWAQ HVD DWKEWTG
Sbjct: 61  NEQGLPGGPEAARVAVKWAQAHVDNDWKEWTG 92


>gi|147777968|emb|CAN65133.1| hypothetical protein VITISV_018708 [Vitis vinifera]
          Length = 170

 Score =  171 bits (432), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 79/93 (84%), Positives = 87/93 (93%), Gaps = 1/93 (1%)

Query: 87  ETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTL 146
           ET+KFRTHLLNK +KKD++G S+E+VVGICTEIFSTFLH+EYGGPGTLLV+PFIDMADTL
Sbjct: 76  ETEKFRTHLLNKFSKKDIYGXSVEEVVGICTEIFSTFLHTEYGGPGTLLVIPFIDMADTL 135

Query: 147 NERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
           NERGLP G QAARAAVKWAQ HVDKDW EWTGD
Sbjct: 136 NERGLP-GSQAARAAVKWAQDHVDKDWNEWTGD 167


>gi|225424768|ref|XP_002266853.1| PREDICTED: uncharacterized protein LOC100251854 [Vitis vinifera]
 gi|296086501|emb|CBI32090.3| unnamed protein product [Vitis vinifera]
          Length = 188

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 85/185 (45%), Positives = 108/185 (58%), Gaps = 25/185 (13%)

Query: 3   CYCNRSLISSKLDPCSPFSSSSLRLAFHRNSS-----VLLGGGIKLHDSST----ISKKA 53
           C    +L +S LD       S LR+A  +N       +L    + +H SST    +S+K 
Sbjct: 12  CQVPMALQTSLLD-------SVLRIAPAKNWPSPNILILRTPCVNVHPSSTSFPILSRKI 64

Query: 54  QLSPLSFSASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAK-KDMFGDSLEDV 112
              PL  S           +Y +PDPIPEFA++ET+KFR  LL KL+K KD F D L  V
Sbjct: 65  FPKPLRASTP--------QKYVYPDPIPEFAEAETEKFRVELLKKLSKEKDTFRDDLHAV 116

Query: 113 VGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKD 172
           V +C +IFS FLH EYGGPGTLLV PF DM   L E+ LPG P AARA++ WAQ +VD+D
Sbjct: 117 VAVCAQIFSEFLHKEYGGPGTLLVEPFTDMLVALKEKKLPGAPLAARASLLWAQNYVDQD 176

Query: 173 WKEWT 177
           W+ W 
Sbjct: 177 WEVWN 181


>gi|255581190|ref|XP_002531408.1| conserved hypothetical protein [Ricinus communis]
 gi|223529001|gb|EEF30992.1| conserved hypothetical protein [Ricinus communis]
          Length = 162

 Score =  139 bits (350), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 65/120 (54%), Positives = 83/120 (69%), Gaps = 8/120 (6%)

Query: 61  SASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAK-KDMFGDSLEDVVGICTEI 119
           + SF+N++       + DPIPEFA SET+KFR  +L KL+K K  FG  L+ VV +C+EI
Sbjct: 47  TPSFQNYV-------YSDPIPEFAQSETEKFRAEILKKLSKDKQTFGGDLDKVVDVCSEI 99

Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
           FS FLH EYGGPGTLLV PF DM   L E+ LPG P AAR ++ WAQ +VD+DW+ W  +
Sbjct: 100 FSDFLHKEYGGPGTLLVEPFTDMLIALKEKNLPGAPLAARESLLWAQNYVDQDWEAWNSN 159


>gi|357516745|ref|XP_003628661.1| hypothetical protein MTR_8g063190 [Medicago truncatula]
 gi|355522683|gb|AET03137.1| hypothetical protein MTR_8g063190 [Medicago truncatula]
          Length = 182

 Score =  135 bits (339), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 62/109 (56%), Positives = 79/109 (72%), Gaps = 3/109 (2%)

Query: 71  AAEYKFPDPIPEFADSETQKFRTHLLNKLAKK---DMFGDSLEDVVGICTEIFSTFLHSE 127
           + ++ +PDPIPEFA+ ETQKF+  LL KL+     D FGD L+ VV +C +IFS FLH E
Sbjct: 69  SQKHVYPDPIPEFAEFETQKFKVQLLQKLSDSEDVDEFGDELDAVVHVCGQIFSEFLHKE 128

Query: 128 YGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEW 176
           YGGPGTLLV PF DM   L ++ LPG P AARA++ WAQ +VD+DW+ W
Sbjct: 129 YGGPGTLLVTPFTDMFVALKKKKLPGAPLAARASLLWAQNYVDEDWEIW 177


>gi|326509753|dbj|BAJ87092.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 171

 Score =  133 bits (335), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 69/109 (63%), Positives = 86/109 (78%), Gaps = 2/109 (1%)

Query: 73  EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEIFSTFLHSEYGG 130
           +YKFPDPIPEFA+ ET KFR H+  +L +K  D FG+ +ED+V +CTE+  TFL  EY G
Sbjct: 62  DYKFPDPIPEFAEQETSKFREHMTWRLEQKKEDYFGEHVEDIVDVCTEVLGTFLQHEYCG 121

Query: 131 PGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
           PGTLLV PF+DM   + ERGLPG PQAARAA+ WA++++DKDWKEWTGD
Sbjct: 122 PGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKEWTGD 170


>gi|449478547|ref|XP_004155348.1| PREDICTED: uncharacterized LOC101211725 [Cucumis sativus]
          Length = 170

 Score =  133 bits (335), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 59/106 (55%), Positives = 76/106 (71%), Gaps = 1/106 (0%)

Query: 72  AEYKFPDPIPEFADSETQKFRTHLLNKLAK-KDMFGDSLEDVVGICTEIFSTFLHSEYGG 130
           A Y +P+PIPEFA+ ETQKF+  L  KLAK ++ FG+  + VV +C++IF  +LH EYGG
Sbjct: 58  ASYVYPEPIPEFAEVETQKFKEQLSKKLAKDRETFGNDFDSVVDVCSKIFGEYLHVEYGG 117

Query: 131 PGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEW 176
           PGTL+V PF +M   LNER LPG P AAR ++ WAQ H+D DW  W
Sbjct: 118 PGTLIVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDNDWNIW 163


>gi|326523125|dbj|BAJ88603.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 168

 Score =  133 bits (334), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 69/109 (63%), Positives = 86/109 (78%), Gaps = 2/109 (1%)

Query: 73  EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEIFSTFLHSEYGG 130
           +YKFPDPIPEFA+ ET KFR H+  +L +K  D FG+ +ED+V +CTE+  TFL  EY G
Sbjct: 59  DYKFPDPIPEFAEQETSKFREHMTWRLEQKKEDYFGEHVEDIVDVCTEVLGTFLQHEYCG 118

Query: 131 PGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
           PGTLLV PF+DM   + ERGLPG PQAARAA+ WA++++DKDWKEWTGD
Sbjct: 119 PGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKEWTGD 167


>gi|226501364|ref|NP_001145504.1| hypothetical protein [Zea mays]
 gi|195657185|gb|ACG48060.1| hypothetical protein [Zea mays]
 gi|414589435|tpg|DAA40006.1| TPA: hypothetical protein ZEAMMB73_406956 [Zea mays]
          Length = 173

 Score =  131 bits (330), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 71/120 (59%), Positives = 90/120 (75%), Gaps = 8/120 (6%)

Query: 68  ICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEI 119
           +CR A      +YKFPDPIPEFA+ ET KFR H+  +L +K  D FGD +E++V +CTEI
Sbjct: 53  VCRRAKNAGYEDYKFPDPIPEFAEQETSKFREHMAWRLEQKKEDYFGDHVEEIVDVCTEI 112

Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
             TFL ++Y GPGTLLV PF+DM   + ERGLPG PQAARAA+ WA+++VDKDWK WTG+
Sbjct: 113 MGTFLENDYRGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNVDKDWKAWTGE 172


>gi|195637804|gb|ACG38370.1| hypothetical protein [Zea mays]
          Length = 173

 Score =  131 bits (330), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 71/120 (59%), Positives = 90/120 (75%), Gaps = 8/120 (6%)

Query: 68  ICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEI 119
           +CR A      +YKFPDPIPEFA+ ET KFR H+  +L +K  D FGD +E++V +CTEI
Sbjct: 53  VCRRAKNAGYEDYKFPDPIPEFAEQETSKFREHMAWRLEQKKEDYFGDHVEEIVDVCTEI 112

Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
             TFL ++Y GPGTLLV PF+DM   + ERGLPG PQAARAA+ WA+++VDKDWK WTG+
Sbjct: 113 LGTFLENDYRGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNVDKDWKAWTGE 172


>gi|388513625|gb|AFK44874.1| unknown [Lotus japonicus]
          Length = 172

 Score =  131 bits (329), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 68/148 (45%), Positives = 91/148 (61%), Gaps = 7/148 (4%)

Query: 28  AFHRNSSVLLGGGIKLHDSSTISKKAQLSPLSFSASFRNHICRAAEYKFPDPIPEFADSE 87
            F  N+ +L     +L  SS   K      L  S + R+ +    ++ +PDP PEFA+SE
Sbjct: 24  TFPSNTLILPHHCFRLSSSSLFVK------LRPSFALRDSLTTHKKHVYPDPSPEFAESE 77

Query: 88  TQKFRTHLLNKLAK-KDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTL 146
           TQKFR  L  KL++ +D FGD L  V+ +C +IF+ FL  EYGGPGTLLV PF DM   L
Sbjct: 78  TQKFRIELFQKLSQDEDEFGDHLHAVIDVCAQIFNEFLQKEYGGPGTLLVEPFTDMLVAL 137

Query: 147 NERGLPGGPQAARAAVKWAQRHVDKDWK 174
            ++ LPG P AARA++ WAQ +VD DW+
Sbjct: 138 KKKKLPGAPLAARASLLWAQNYVDHDWE 165


>gi|242049158|ref|XP_002462323.1| hypothetical protein SORBIDRAFT_02g023810 [Sorghum bicolor]
 gi|241925700|gb|EER98844.1| hypothetical protein SORBIDRAFT_02g023810 [Sorghum bicolor]
          Length = 173

 Score =  131 bits (329), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 71/120 (59%), Positives = 90/120 (75%), Gaps = 8/120 (6%)

Query: 68  ICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEI 119
           +CR A      +YKFPDPIPEFA+ ET KFR H+  +L +K  D FGD +E++V ICTEI
Sbjct: 53  VCRRAKNAGYEDYKFPDPIPEFAEQETSKFREHMAWRLEQKKEDYFGDHVEEIVDICTEI 112

Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
             TFL ++Y GPGTLLV PF+DM   + ERGLPG PQAARAA+ WA++++DKDWK WTG+
Sbjct: 113 LGTFLENDYCGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKAWTGE 172


>gi|449434893|ref|XP_004135230.1| PREDICTED: uncharacterized protein LOC101211725 [Cucumis sativus]
          Length = 170

 Score =  130 bits (326), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 58/106 (54%), Positives = 75/106 (70%), Gaps = 1/106 (0%)

Query: 72  AEYKFPDPIPEFADSETQKFRTHLLNKLAK-KDMFGDSLEDVVGICTEIFSTFLHSEYGG 130
           A Y +P+PIPEFA+ ETQKF+  L  KLAK ++ FG+  + VV +C++IF  +LH EYGG
Sbjct: 58  ASYVYPEPIPEFAEVETQKFKEQLSKKLAKDRETFGNDFDSVVDVCSKIFGEYLHVEYGG 117

Query: 131 PGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEW 176
           PGTL+V PF +M   LNER L G P AAR ++ WAQ H+D DW  W
Sbjct: 118 PGTLIVEPFTNMFIALNERKLSGAPLAARTSLLWAQNHLDNDWNIW 163


>gi|357153554|ref|XP_003576489.1| PREDICTED: uncharacterized protein LOC100837297 [Brachypodium
           distachyon]
          Length = 169

 Score =  127 bits (318), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 67/120 (55%), Positives = 89/120 (74%), Gaps = 8/120 (6%)

Query: 68  ICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICTEI 119
           +CR A      +YKFPDPIPEFA+ ET KFR H+  +L +K  + FG+ +E++V +CTE+
Sbjct: 49  VCRRAKNAAFDDYKFPDPIPEFAEQETSKFREHMTLRLEQKKEEYFGEHVEEIVDVCTEV 108

Query: 120 FSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTGD 179
             TFL  +Y GPGTLLV PF+DM   + ERGLPG PQAARAA+ WA++++DKDWK WTG+
Sbjct: 109 LGTFLQHDYCGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKAWTGE 168


>gi|115478941|ref|NP_001063064.1| Os09g0382400 [Oryza sativa Japonica Group]
 gi|49387643|dbj|BAD25837.1| unknown protein [Oryza sativa Japonica Group]
 gi|113631297|dbj|BAF24978.1| Os09g0382400 [Oryza sativa Japonica Group]
 gi|215693061|dbj|BAG88481.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641488|gb|EEE69620.1| hypothetical protein OsJ_29199 [Oryza sativa Japonica Group]
          Length = 172

 Score =  125 bits (314), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 68/122 (55%), Positives = 90/122 (73%), Gaps = 8/122 (6%)

Query: 66  NHICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICT 117
           + +CR A      +YKFPDPIPEFA  ET KF+ H++ +L +K  D FG+ +E++V +CT
Sbjct: 50  DFVCRRAKNAAYDDYKFPDPIPEFAAQETSKFKEHMMWRLEQKKDDYFGEHVEEIVDVCT 109

Query: 118 EIFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWT 177
           EI  TFL  +Y GPGTLLV PF+DM   + ERGLPG PQAARAA+ WA++++DKDWK WT
Sbjct: 110 EILGTFLEHDYCGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAIAWAEKNIDKDWKAWT 169

Query: 178 GD 179
           G+
Sbjct: 170 GE 171


>gi|218202081|gb|EEC84508.1| hypothetical protein OsI_31201 [Oryza sativa Indica Group]
          Length = 188

 Score =  116 bits (290), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 68/137 (49%), Positives = 90/137 (65%), Gaps = 23/137 (16%)

Query: 66  NHICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICT 117
           + +CR A      +YKFPDPIPEFA  ET KF+ H++ +L +K  D FG+ +E++V +CT
Sbjct: 51  DFVCRRAKNAAYDDYKFPDPIPEFAAQETSKFKEHMMWRLEQKKDDYFGEHVEEIVDVCT 110

Query: 118 E---------------IFSTFLHSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAV 162
           E               I  TFL  +Y GPGTLLV PF+DM   + ERGLPG PQAARAA+
Sbjct: 111 EASSCNFCTIACIGFLILGTFLEHDYCGPGTLLVHPFLDMKGEIKERGLPGAPQAARAAI 170

Query: 163 KWAQRHVDKDWKEWTGD 179
            WA++++DKDWK WTG+
Sbjct: 171 AWAEKNIDKDWKAWTGE 187


>gi|356526856|ref|XP_003532032.1| PREDICTED: uncharacterized protein LOC100778043 [Glycine max]
          Length = 163

 Score =  114 bits (285), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/105 (56%), Positives = 75/105 (71%), Gaps = 2/105 (1%)

Query: 74  YKFPDPIPEFADSETQKFRTHLLNKLAKKDM--FGDSLEDVVGICTEIFSTFLHSEYGGP 131
           + +PDP+PEFA+ ET KF+  L  KL++ D+  FGD L+ VV +C +IFS FLH EYGGP
Sbjct: 55  HVYPDPVPEFAEHETHKFKVELFRKLSEDDVDEFGDDLDAVVAVCAQIFSEFLHKEYGGP 114

Query: 132 GTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEW 176
           GTLLV PF DM   L ++ LPG   AARA++ WAQ  VDKDW+ W
Sbjct: 115 GTLLVEPFTDMMVALKKKKLPGAALAARASLLWAQNFVDKDWEVW 159


>gi|357454523|ref|XP_003597542.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
 gi|355486590|gb|AES67793.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
          Length = 280

 Score =  106 bits (264), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 45/54 (83%), Positives = 48/54 (88%)

Query: 125 HSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKDWKEWTG 178
           HSEYGGPGTLLV PFIDMAD +N+RGLPGGPQAAR A+ WAQ HVDKDW EWTG
Sbjct: 222 HSEYGGPGTLLVDPFIDMADIVNDRGLPGGPQAARTAINWAQAHVDKDWNEWTG 275


>gi|224108121|ref|XP_002314729.1| predicted protein [Populus trichocarpa]
 gi|222863769|gb|EEF00900.1| predicted protein [Populus trichocarpa]
          Length = 97

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 51/98 (52%), Gaps = 2/98 (2%)

Query: 76  FPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGIC-TEIFSTFLHSEYGGPGTL 134
            PD + E  + + + +    +N        G S E+    C  +IFS FLH +YGG  TL
Sbjct: 1   MPDSVIE-NEQDVRLWEMMQMNNCMLLQWHGCSGEEKTTFCWNQIFSDFLHKDYGGSETL 59

Query: 135 LVLPFIDMADTLNERGLPGGPQAARAAVKWAQRHVDKD 172
           +V PF D   +L E   PG P AA A++ WAQ + D+D
Sbjct: 60  MVEPFTDALVSLEENFFPGAPSAASASLLWAQNYDDQD 97


>gi|125563560|gb|EAZ08940.1| hypothetical protein OsI_31203 [Oryza sativa Indica Group]
          Length = 118

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/66 (46%), Positives = 44/66 (66%), Gaps = 8/66 (12%)

Query: 66  NHICRAA------EYKFPDPIPEFADSETQKFRTHLLNKLAKK--DMFGDSLEDVVGICT 117
           + +CR A      +YKFPDPIPEFA  ET KF+ H++ +L +K  D FG+ +E++V +CT
Sbjct: 50  DFVCRRAKNAAYDDYKFPDPIPEFAAQETSKFKEHMMWRLEQKKDDYFGEHVEEIVDVCT 109

Query: 118 EIFSTF 123
           E  S F
Sbjct: 110 EASSWF 115


>gi|116792287|gb|ABK26304.1| unknown [Picea sitchensis]
          Length = 177

 Score = 62.4 bits (150), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 56/112 (50%), Gaps = 6/112 (5%)

Query: 69  CRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDVVGICTEIFSTFLHSEY 128
           CRA          +FAD E  +FR  LL  LA     GD    +V  CT+IF  F  S  
Sbjct: 66  CRAVATHKIVAQSDFADHEATRFRDGLLKMLADDQNIGDLRVRMVDTCTKIFREFAMSY- 124

Query: 129 GGPGTLLVLPFIDMADTLNE-RGLPGG--PQAARAAVKWAQRHVDKDWKEWT 177
              G L + PF++M   L E +G+P     +  R A+ W++ ++ +DWK+W+
Sbjct: 125 --SGELPMEPFVEMRRVLEEGQGIPLSIVSKVTRGAIGWSRYNLYRDWKKWS 174


>gi|386721713|ref|YP_006188038.1| hypothetical protein B2K_05970 [Paenibacillus mucilaginosus K02]
 gi|384088837|gb|AFH60273.1| hypothetical protein B2K_05970 [Paenibacillus mucilaginosus K02]
          Length = 764

 Score = 39.3 bits (90), Expect = 0.68,   Method: Composition-based stats.
 Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 14/87 (16%)

Query: 98  KLAKKDMFGDSLEDVVGICTEI--FSTFLHSEYGGPGTLLVLPFIDMAD----------- 144
           +L + D + D   D +G CT +  +S +   +YGGPGT   L  ID  D           
Sbjct: 427 ELGRTD-YIDYAADHIGQCTRLGEYSEWDAQQYGGPGTNHQLALIDSLDDCGSFGATMLR 485

Query: 145 TLNERGLPGGPQAARAAVKWAQRHVDK 171
            L ER LPGG  AA    ++  R  D+
Sbjct: 486 VLRERELPGGRWAADRIAEYITREQDR 512


>gi|337748029|ref|YP_004642191.1| hypothetical protein KNP414_03782 [Paenibacillus mucilaginosus
           KNP414]
 gi|379719123|ref|YP_005311254.1| hypothetical protein PM3016_1175 [Paenibacillus mucilaginosus 3016]
 gi|336299218|gb|AEI42321.1| YesR2 [Paenibacillus mucilaginosus KNP414]
 gi|378567795|gb|AFC28105.1| YesR2 [Paenibacillus mucilaginosus 3016]
          Length = 764

 Score = 38.9 bits (89), Expect = 0.78,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 35/78 (44%), Gaps = 13/78 (16%)

Query: 107 DSLEDVVGICTEI--FSTFLHSEYGGPGTLLVLPFIDMAD-----------TLNERGLPG 153
           D   D +G CT +  +S +   +YGGPGT   L  ID  D            L ER LPG
Sbjct: 435 DYAADHIGQCTRLGEYSEWDAQQYGGPGTNHQLALIDSLDDCGSFGATMLRVLRERELPG 494

Query: 154 GPQAARAAVKWAQRHVDK 171
           G  AA    ++  R  D+
Sbjct: 495 GRWAADRIAEYITREQDR 512


>gi|189238626|ref|XP_001810169.1| PREDICTED: similar to CG12581 CG12581-PA [Tribolium castaneum]
 gi|270008388|gb|EFA04836.1| hypothetical protein TcasGA2_TC014886 [Tribolium castaneum]
          Length = 567

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 48/113 (42%), Gaps = 17/113 (15%)

Query: 45  DSSTISKKAQLSPLSFSASFRN-HICRA----AEYKFPDPIP--EFADSETQKFRTHLLN 97
           ++    KK  L+  +FS S R  H  R+    A  KF  PIP  +  D E  K    ++ 
Sbjct: 462 NTGIYRKKGHLNERAFSYSIRQEHRSRSYGSLANLKFATPIPNGDVEDREDMKKEREIMQ 521

Query: 98  KLAKKDMFGDSLEDVVGICTEIFSTFLHSEYGGPGTLLVLPFIDMADTLNERG 150
            +   D+ GD LE      +E+  T   +  G PG     P I  A  +N RG
Sbjct: 522 MVQDLDLSGDELER-----SEVPRTMYEARGGPPG-----PVIVAAPQMNGRG 564


>gi|424864330|ref|ZP_18288234.1| phytanoyl-CoA dioxygenase [SAR86 cluster bacterium SAR86B]
 gi|400759759|gb|EJP73940.1| phytanoyl-CoA dioxygenase [SAR86 cluster bacterium SAR86B]
          Length = 224

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 27/111 (24%), Positives = 46/111 (41%), Gaps = 3/111 (2%)

Query: 2   ACYCNRSLISSKLDPCSPFSSSSLRLAFHRNSSVLLGGGIKLHDSSTISKKAQLSPLSFS 61
           A  CNR L+      C      ++   FHR SSVL   G  +H  S +  +  +   + S
Sbjct: 115 AIICNRQLLHGSFPNCGYEQRVTVNFGFHRRSSVLGTQGGGIHSESQVFTEEVIQRRARS 174

Query: 62  ASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDV 112
             +      A + K+    P   +       + + N+ A+KD+   +LED+
Sbjct: 175 IGY---AIDARKQKYTSETPYSYEPFESTNTSFVWNEEARKDLKDYNLEDL 222


>gi|147801414|emb|CAN65778.1| hypothetical protein VITISV_014488 [Vitis vinifera]
          Length = 155

 Score = 37.0 bits (84), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 23/34 (67%), Gaps = 1/34 (2%)

Query: 86  SETQKFRTHLLNKLAK-KDMFGDSLEDVVGICTE 118
           +ET+KFR  LL KL+K KD F D L  VV +C +
Sbjct: 34  NETEKFRVELLKKLSKEKDTFRDDLHAVVAVCAQ 67


>gi|424864172|ref|ZP_18288076.1| phytanoyl-CoA dioxygenase [SAR86 cluster bacterium SAR86B]
 gi|400759601|gb|EJP73782.1| phytanoyl-CoA dioxygenase [SAR86 cluster bacterium SAR86B]
          Length = 389

 Score = 36.6 bits (83), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 27/111 (24%), Positives = 46/111 (41%), Gaps = 3/111 (2%)

Query: 2   ACYCNRSLISSKLDPCSPFSSSSLRLAFHRNSSVLLGGGIKLHDSSTISKKAQLSPLSFS 61
           A  CNR L+      C      ++   FHR SSVL   G  +H  S +  +  +   + S
Sbjct: 280 AIICNRQLLHGSFPNCGYEQRVTVNFGFHRRSSVLGTQGGGIHSESQVFTEEIIQRRARS 339

Query: 62  ASFRNHICRAAEYKFPDPIPEFADSETQKFRTHLLNKLAKKDMFGDSLEDV 112
             +      A + K+    P   +       + + N+ A+KD+   +LED+
Sbjct: 340 IGY---AIDARKQKYTSETPYSYEPFESTNTSFVWNEEARKDLKDYNLEDL 387


>gi|226947191|ref|YP_002802264.1| thiol oxidoreductase [Azotobacter vinelandii DJ]
 gi|226722118|gb|ACO81289.1| thiol oxidoreductase [Azotobacter vinelandii DJ]
          Length = 452

 Score = 36.2 bits (82), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 5/42 (11%)

Query: 125 HSEYGGPGTLLVLPFIDMADTLNERGLPGGPQAARAAVKWAQ 166
            +++GGP     LP  D  D LN+ G+PG P   RAA+ W +
Sbjct: 120 ENDHGGP-----LPHPDYGDQLNDTGIPGVPPEGRAAIAWEE 156


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.414 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,975,847,763
Number of Sequences: 23463169
Number of extensions: 118300207
Number of successful extensions: 241547
Number of sequences better than 100.0: 39
Number of HSP's better than 100.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 241495
Number of HSP's gapped (non-prelim): 39
length of query: 181
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 48
effective length of database: 9,238,593,890
effective search space: 443452506720
effective search space used: 443452506720
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)