BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 025316
         (254 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|15217806|ref|NP_176678.1| Nucleotide-diphospho-sugar transferases superfamily protein
           [Arabidopsis thaliana]
 gi|5042435|gb|AAD38274.1|AC006193_30 Unknown protein [Arabidopsis thaliana]
 gi|34146814|gb|AAQ62415.1| At1g64980 [Arabidopsis thaliana]
 gi|51970684|dbj|BAD44034.1| hypothetical protein [Arabidopsis thaliana]
 gi|332196190|gb|AEE34311.1| Nucleotide-diphospho-sugar transferases superfamily protein
           [Arabidopsis thaliana]
          Length = 259

 Score =  446 bits (1147), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 206/249 (82%), Positives = 222/249 (89%)

Query: 5   VREVTANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGL 64
           V+  T N  + + KPF+IFVGYDPREDLAY+VCH SI KRSSIPV+ITPI+QSDLRK GL
Sbjct: 8   VKSETCNNGSSEKKPFRIFVGYDPREDLAYQVCHHSITKRSSIPVEITPIIQSDLRKKGL 67

Query: 65  YWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVH 124
           YWRERGQ ESTEFSFSRFLTP+L+ + GWA+FVDCDFLYLADIKEL DLIDDKYAIMCV 
Sbjct: 68  YWRERGQLESTEFSFSRFLTPHLSDYQGWAMFVDCDFLYLADIKELTDLIDDKYAIMCVQ 127

Query: 125 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE 184
           HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK L+PE+VNTQTGAFLHRFQWLE
Sbjct: 128 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKTLSPEIVNTQTGAFLHRFQWLE 187

Query: 185 DEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY 244
           DEEIGSIPFVWNFLEGHN VVE DPTT PKA+HYTRGGPWF+AWK CEFADLWL EMEEY
Sbjct: 188 DEEIGSIPFVWNFLEGHNRVVEKDPTTQPKAVHYTRGGPWFDAWKDCEFADLWLNEMEEY 247

Query: 245 KNEANKTVE 253
             E  K  +
Sbjct: 248 NKENKKEAD 256


>gi|297840831|ref|XP_002888297.1| hypothetical protein ARALYDRAFT_475501 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334138|gb|EFH64556.1| hypothetical protein ARALYDRAFT_475501 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 260

 Score =  442 bits (1136), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 204/249 (81%), Positives = 221/249 (88%)

Query: 5   VREVTANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGL 64
           V+  T N  + + KPF+IFVGYDPREDLAY+VC  S+ KRSSIPV+I PIVQSDLRK GL
Sbjct: 8   VKSDTCNNGSSEKKPFRIFVGYDPREDLAYQVCRHSVTKRSSIPVEIIPIVQSDLRKKGL 67

Query: 65  YWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVH 124
           YWRERGQ ESTEFSFSRFLTP+L+ + GWA+FVDCDFLYLADIKEL DLIDDKYAIMCV 
Sbjct: 68  YWRERGQLESTEFSFSRFLTPHLSDYQGWAMFVDCDFLYLADIKELTDLIDDKYAIMCVQ 127

Query: 125 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE 184
           HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK L+PE+VNTQTGAFLHRFQWLE
Sbjct: 128 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKTLSPEIVNTQTGAFLHRFQWLE 187

Query: 185 DEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY 244
           DEEIGSIPFVWNFLEGHN VVE DPTT PKA+HYTRGGPWF+AWK CEFADLWL EMEEY
Sbjct: 188 DEEIGSIPFVWNFLEGHNRVVEKDPTTQPKAVHYTRGGPWFDAWKDCEFADLWLNEMEEY 247

Query: 245 KNEANKTVE 253
             E+ K  +
Sbjct: 248 NKESKKEAD 256


>gi|224117146|ref|XP_002317490.1| predicted protein [Populus trichocarpa]
 gi|222860555|gb|EEE98102.1| predicted protein [Populus trichocarpa]
          Length = 253

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 210/246 (85%), Positives = 222/246 (90%)

Query: 5   VREVTANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGL 64
           V  VTAN     +KPFKIFVGYDPRED+AY VCH SILKRSSIPV+ITPIVQSDLR +GL
Sbjct: 8   VHPVTANGGFTIDKPFKIFVGYDPREDIAYEVCHHSILKRSSIPVEITPIVQSDLRTNGL 67

Query: 65  YWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVH 124
           YWRERGQ ESTEFSF+RFLTP+LA +DGWA+FVDCDFLYLADIKEL DLIDDKYAIMCV 
Sbjct: 68  YWRERGQLESTEFSFTRFLTPHLANYDGWAMFVDCDFLYLADIKELRDLIDDKYAIMCVQ 127

Query: 125 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE 184
           HDYTPKE TKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPEVVNTQTGAFLHRFQWLE
Sbjct: 128 HDYTPKEATKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEVVNTQTGAFLHRFQWLE 187

Query: 185 DEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY 244
           DEEIGSIPFVWNFLEGHN VVE D TTFPKAIHYTRGGPWFEAWK+CEFA+LWLKE +E 
Sbjct: 188 DEEIGSIPFVWNFLEGHNKVVEGDTTTFPKAIHYTRGGPWFEAWKNCEFAELWLKEKDEC 247

Query: 245 KNEANK 250
             E  K
Sbjct: 248 VKEEKK 253


>gi|449461229|ref|XP_004148344.1| PREDICTED: uncharacterized protein LOC101206166 [Cucumis sativus]
 gi|449531569|ref|XP_004172758.1| PREDICTED: uncharacterized protein LOC101224062 [Cucumis sativus]
          Length = 252

 Score =  438 bits (1127), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 203/239 (84%), Positives = 219/239 (91%)

Query: 16  DNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTEST 75
           D +PF+IFVGYD REDLAY+VC  SILKRSSIPV+I PI Q+DLRK+G+YWRERGQTEST
Sbjct: 14  DEQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRKNGVYWRERGQTEST 73

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
           EFSFSRFLTPYLA F GWA+FVDCDFLYLADIKEL DLID+K+A+MCVHHDYTPKETTKM
Sbjct: 74  EFSFSRFLTPYLANFKGWAMFVDCDFLYLADIKELRDLIDNKFAVMCVHHDYTPKETTKM 133

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVW 195
           DGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPE+VNTQTGAFLHRFQWLED EIGS+PFVW
Sbjct: 134 DGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQWLEDNEIGSVPFVW 193

Query: 196 NFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKTVEK 254
           NFLEGHN  VE D TT PKAIHYTRGGPWFEAWK+CEFADLWLKEMEEY  EA K  E+
Sbjct: 194 NFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYNKEAEKKSEE 252


>gi|224144884|ref|XP_002325449.1| predicted protein [Populus trichocarpa]
 gi|222862324|gb|EEE99830.1| predicted protein [Populus trichocarpa]
          Length = 234

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 205/233 (87%), Positives = 215/233 (92%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           KPFKIFVGYDP ED+AY VC  SILKRSSIPV ITPIVQS+LR  GLYWRERGQ ESTEF
Sbjct: 2   KPFKIFVGYDPCEDVAYEVCRHSILKRSSIPVDITPIVQSELRTKGLYWRERGQLESTEF 61

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDG 137
           SFSRFLTP+LA +DGWA+FVDCDFLYLADIKEL DLIDDKYAIMCVHHDY PKETTKMDG
Sbjct: 62  SFSRFLTPHLANYDGWAMFVDCDFLYLADIKELRDLIDDKYAIMCVHHDYAPKETTKMDG 121

Query: 138 AVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNF 197
           AVQTVYPRKNWSSMVLYNCGHPKNK+LTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNF
Sbjct: 122 AVQTVYPRKNWSSMVLYNCGHPKNKVLTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNF 181

Query: 198 LEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANK 250
           LEGHN VVE D TTFPKAIHYTRGGPWFEAWKHCEFA++WLKE +E   EA+K
Sbjct: 182 LEGHNKVVEGDTTTFPKAIHYTRGGPWFEAWKHCEFAEMWLKEKDECVKEADK 234


>gi|357498109|ref|XP_003619343.1| hypothetical protein MTR_6g046960 [Medicago truncatula]
 gi|355494358|gb|AES75561.1| hypothetical protein MTR_6g046960 [Medicago truncatula]
          Length = 263

 Score =  433 bits (1113), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 198/232 (85%), Positives = 214/232 (92%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           PFKIFVGYDPRED+A++VC  SI+KRSSIPV+I PI QSDLRKSGLYWRERGQ ESTEFS
Sbjct: 27  PFKIFVGYDPREDIAFQVCRHSIMKRSSIPVEIIPIKQSDLRKSGLYWRERGQFESTEFS 86

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFLTP LA + GWA+FVDCDFLYLADIKEL DLI+DKYAIMCV HDYTPKETTKMDGA
Sbjct: 87  FTRFLTPSLANYQGWAMFVDCDFLYLADIKELVDLIEDKYAIMCVQHDYTPKETTKMDGA 146

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFL 198
           VQTVYPRKNWSSMVLYNCGHPKNK+LTP+ VN+QTGAFLHRFQWLED+EIGS+PFVWNFL
Sbjct: 147 VQTVYPRKNWSSMVLYNCGHPKNKVLTPDAVNSQTGAFLHRFQWLEDDEIGSVPFVWNFL 206

Query: 199 EGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANK 250
           EGHN  VENDPTT PKAIHYTRGGPWFEAWK+CEFADLWL EMEEY  +A K
Sbjct: 207 EGHNRAVENDPTTSPKAIHYTRGGPWFEAWKNCEFADLWLNEMEEYLTQAKK 258


>gi|225454680|ref|XP_002269835.1| PREDICTED: uncharacterized protein LOC100264930 isoform 1 [Vitis
           vinifera]
 gi|359489862|ref|XP_003633988.1| PREDICTED: uncharacterized protein LOC100264930 [Vitis vinifera]
          Length = 254

 Score =  432 bits (1112), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 200/236 (84%), Positives = 213/236 (90%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           KPFKIFVGYDPREDLAY VC  S+LKRSSIP++ITPI QSDLRKSGLYWRERGQTESTEF
Sbjct: 18  KPFKIFVGYDPREDLAYEVCCHSLLKRSSIPIEITPIKQSDLRKSGLYWRERGQTESTEF 77

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDG 137
           SFSRFLTPYLA ++GWA+FVDCDFLYL DIKEL DL+DD YA+MCV HDYTPKE TKMDG
Sbjct: 78  SFSRFLTPYLANYEGWAMFVDCDFLYLGDIKELRDLVDDNYAMMCVQHDYTPKEKTKMDG 137

Query: 138 AVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNF 197
            VQTVYPRKNWSSMVLYNCGHPKN++LTP+VVN+QTGAFLHRFQWLED EIGSIPFVWNF
Sbjct: 138 VVQTVYPRKNWSSMVLYNCGHPKNRVLTPDVVNSQTGAFLHRFQWLEDHEIGSIPFVWNF 197

Query: 198 LEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKTVE 253
           L GHN   ENDP+TFPKAIHYT GGPWFEAWK CEF DLWL EMEEYK EANK  E
Sbjct: 198 LVGHNKAEENDPSTFPKAIHYTLGGPWFEAWKDCEFGDLWLNEMEEYKKEANKKTE 253


>gi|255558119|ref|XP_002520088.1| conserved hypothetical protein [Ricinus communis]
 gi|223540852|gb|EEF42412.1| conserved hypothetical protein [Ricinus communis]
          Length = 257

 Score =  432 bits (1112), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 211/249 (84%), Positives = 223/249 (89%)

Query: 5   VREVTANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGL 64
           V  V+AN     NK FK FVGYDPRED+AY+VC  SILKRSSIPV+I PI+QS+LRK  L
Sbjct: 8   VHTVSANGGLTSNKSFKFFVGYDPREDIAYQVCRHSILKRSSIPVEIIPIIQSELRKKNL 67

Query: 65  YWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVH 124
           YWRERGQ ESTEFSFSRFLTPYLA ++GWAVFVDCDFLYLAD+KEL DLIDDKYAIMCV 
Sbjct: 68  YWRERGQLESTEFSFSRFLTPYLANYEGWAVFVDCDFLYLADVKELRDLIDDKYAIMCVQ 127

Query: 125 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE 184
           HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPEVVN QTGAFLHRFQWLE
Sbjct: 128 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEVVNAQTGAFLHRFQWLE 187

Query: 185 DEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY 244
           DE+IGSIPFVWNFLEGHN VVE D TTFPKAIHYTRGGPWFEAWK+CEFADLWLKEMEEY
Sbjct: 188 DEDIGSIPFVWNFLEGHNKVVEGDATTFPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEY 247

Query: 245 KNEANKTVE 253
            NE NK  E
Sbjct: 248 MNEENKVAE 256


>gi|357481009|ref|XP_003610790.1| hypothetical protein MTR_5g007050 [Medicago truncatula]
 gi|355512125|gb|AES93748.1| hypothetical protein MTR_5g007050 [Medicago truncatula]
          Length = 279

 Score =  432 bits (1111), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 198/232 (85%), Positives = 214/232 (92%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           PFKIFVGYDPRED+A++VC  SI+KRSSIPV+I PI QSDLRKSGLYWRERGQ ESTEFS
Sbjct: 43  PFKIFVGYDPREDIAFQVCRHSIMKRSSIPVEIIPIKQSDLRKSGLYWRERGQFESTEFS 102

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFLTP LA + GWA+FVDCDFLYLADIKEL DLI+DKYAIMCV HDYTPKETTKMDGA
Sbjct: 103 FTRFLTPSLANYQGWAMFVDCDFLYLADIKELLDLIEDKYAIMCVQHDYTPKETTKMDGA 162

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFL 198
           VQTVYPRKNWSSMVLYNCGHPKNK+LTP+ VN+QTGAFLHRFQWLED+EIGS+PFVWNFL
Sbjct: 163 VQTVYPRKNWSSMVLYNCGHPKNKVLTPDAVNSQTGAFLHRFQWLEDDEIGSVPFVWNFL 222

Query: 199 EGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANK 250
           EGHN  VENDPTT PKAIHYTRGGPWFEAWK+CEFADLWL EMEEY  +A K
Sbjct: 223 EGHNRAVENDPTTSPKAIHYTRGGPWFEAWKNCEFADLWLNEMEEYLTQAKK 274


>gi|225454682|ref|XP_002269871.1| PREDICTED: uncharacterized protein LOC100264930 isoform 2 [Vitis
           vinifera]
          Length = 268

 Score =  432 bits (1110), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 200/236 (84%), Positives = 213/236 (90%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           KPFKIFVGYDPREDLAY VC  S+LKRSSIP++ITPI QSDLRKSGLYWRERGQTESTEF
Sbjct: 32  KPFKIFVGYDPREDLAYEVCCHSLLKRSSIPIEITPIKQSDLRKSGLYWRERGQTESTEF 91

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDG 137
           SFSRFLTPYLA ++GWA+FVDCDFLYL DIKEL DL+DD YA+MCV HDYTPKE TKMDG
Sbjct: 92  SFSRFLTPYLANYEGWAMFVDCDFLYLGDIKELRDLVDDNYAMMCVQHDYTPKEKTKMDG 151

Query: 138 AVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNF 197
            VQTVYPRKNWSSMVLYNCGHPKN++LTP+VVN+QTGAFLHRFQWLED EIGSIPFVWNF
Sbjct: 152 VVQTVYPRKNWSSMVLYNCGHPKNRVLTPDVVNSQTGAFLHRFQWLEDHEIGSIPFVWNF 211

Query: 198 LEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKTVE 253
           L GHN   ENDP+TFPKAIHYT GGPWFEAWK CEF DLWL EMEEYK EANK  E
Sbjct: 212 LVGHNKAEENDPSTFPKAIHYTLGGPWFEAWKDCEFGDLWLNEMEEYKKEANKKTE 267


>gi|356497421|ref|XP_003517559.1| PREDICTED: uncharacterized protein LOC100798782 [Glycine max]
          Length = 264

 Score =  430 bits (1106), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 201/241 (83%), Positives = 215/241 (89%)

Query: 10  ANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRER 69
           A  +  + KPFKIFVGYDPREDLAY VC  SILKRSSIPV+I PI QSDLRKSGLYWRER
Sbjct: 20  AEIDNNEKKPFKIFVGYDPREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKSGLYWRER 79

Query: 70  GQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTP 129
           GQ ESTEFSFSRFLTP LA + GWA+FVDCDFLYLADIKEL DLI+DKYAIMCV HDY P
Sbjct: 80  GQFESTEFSFSRFLTPCLANYQGWAMFVDCDFLYLADIKELKDLIEDKYAIMCVQHDYAP 139

Query: 130 KETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIG 189
           KETTKMDGAVQTVYPRKNWSSMVLYNC HPKN++LTP+ VNTQTGAFLHRFQWLED+ IG
Sbjct: 140 KETTKMDGAVQTVYPRKNWSSMVLYNCAHPKNRVLTPDTVNTQTGAFLHRFQWLEDDLIG 199

Query: 190 SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEAN 249
           SIPFVWNFLEGHN VV+NDPTT PKAIHYTRGGPWFEAWK+CEFADLWL EME+Y  +A 
Sbjct: 200 SIPFVWNFLEGHNRVVDNDPTTSPKAIHYTRGGPWFEAWKNCEFADLWLNEMEDYMKQAK 259

Query: 250 K 250
           K
Sbjct: 260 K 260


>gi|255639816|gb|ACU20201.1| unknown [Glycine max]
          Length = 264

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 200/241 (82%), Positives = 214/241 (88%)

Query: 10  ANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRER 69
           A  +  + KPFKIFVGYDPREDLAY VC  SILKRSSIPV+I PI QSDLRKSGLYWRER
Sbjct: 20  AEIDNNEKKPFKIFVGYDPREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKSGLYWRER 79

Query: 70  GQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTP 129
           GQ ESTEFSFSRFLTP LA + GWA+FVDCDFLYLA IKEL DLI+DKYAIMCV HDY P
Sbjct: 80  GQFESTEFSFSRFLTPCLANYQGWAMFVDCDFLYLAGIKELKDLIEDKYAIMCVQHDYAP 139

Query: 130 KETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIG 189
           KETTKMDGAVQTVYPRKNWSSMVLYNC HPKN++LTP+ VNTQTGAFLHRFQWLED+ IG
Sbjct: 140 KETTKMDGAVQTVYPRKNWSSMVLYNCAHPKNRVLTPDTVNTQTGAFLHRFQWLEDDLIG 199

Query: 190 SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEAN 249
           SIPFVWNFLEGHN VV+NDPTT PKAIHYTRGGPWFEAWK+CEFADLWL EME+Y  +A 
Sbjct: 200 SIPFVWNFLEGHNRVVDNDPTTSPKAIHYTRGGPWFEAWKNCEFADLWLNEMEDYMKQAK 259

Query: 250 K 250
           K
Sbjct: 260 K 260


>gi|356538331|ref|XP_003537657.1| PREDICTED: uncharacterized protein LOC100813045 isoform 1 [Glycine
           max]
 gi|356538333|ref|XP_003537658.1| PREDICTED: uncharacterized protein LOC100813045 isoform 2 [Glycine
           max]
          Length = 257

 Score =  426 bits (1096), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 197/235 (83%), Positives = 212/235 (90%)

Query: 16  DNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTEST 75
           + KPFKIFVGYDPREDLAY VC  SILKRSSIPV+I PI QSDLRKSGLYWRERGQ EST
Sbjct: 19  EKKPFKIFVGYDPREDLAYEVCRHSILKRSSIPVEIIPIKQSDLRKSGLYWRERGQFEST 78

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
           EFSFSRFLTP LA + GWA+FVDCDFLYLADIKEL +LI+DKYAIMCV HDY PKETTKM
Sbjct: 79  EFSFSRFLTPCLANYQGWAMFVDCDFLYLADIKELKNLIEDKYAIMCVQHDYAPKETTKM 138

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVW 195
           DGAVQTVYPRKNWSSMVLYNC HPKN++LTP+ VNTQ GAFLHRFQWLED++IGSIPFVW
Sbjct: 139 DGAVQTVYPRKNWSSMVLYNCAHPKNRVLTPDTVNTQNGAFLHRFQWLEDDQIGSIPFVW 198

Query: 196 NFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANK 250
           NFLEGHN +V NDPTT PKAIHYTRGGPWFEAWK+CEFADLWL EME+Y  +A K
Sbjct: 199 NFLEGHNRIVGNDPTTSPKAIHYTRGGPWFEAWKNCEFADLWLNEMEDYMKQAKK 253


>gi|388522263|gb|AFK49193.1| unknown [Medicago truncatula]
          Length = 261

 Score =  426 bits (1095), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 195/232 (84%), Positives = 212/232 (91%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           PFKIFVGYDPRED+A++VC  SI+KRSSIPV+I PI QSDLRKSGLYWRERGQ ESTEFS
Sbjct: 25  PFKIFVGYDPREDIAFQVCRHSIMKRSSIPVEIIPIKQSDLRKSGLYWRERGQFESTEFS 84

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFLTP LA + GWA+FVDCDFLYLADIKEL DLI+DKYAI+CV HDYTPKETTKMDG 
Sbjct: 85  FTRFLTPSLANYQGWAMFVDCDFLYLADIKELLDLIEDKYAIVCVQHDYTPKETTKMDGV 144

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFL 198
           VQTVYPRKNWSSMVLYNCGHPKNK+LTP+ VN+QTGAFLHRFQWLED+EIGS+PFVWNFL
Sbjct: 145 VQTVYPRKNWSSMVLYNCGHPKNKVLTPDAVNSQTGAFLHRFQWLEDDEIGSVPFVWNFL 204

Query: 199 EGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANK 250
           EGHN  VENDPTT PKAIHYTRGGPWF AWK+CEFADLWL EMEEY  +A K
Sbjct: 205 EGHNRAVENDPTTSPKAIHYTRGGPWFGAWKNCEFADLWLNEMEEYLTQAKK 256


>gi|356543158|ref|XP_003540030.1| PREDICTED: uncharacterized protein LOC100780640 [Glycine max]
          Length = 248

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 194/237 (81%), Positives = 217/237 (91%), Gaps = 2/237 (0%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           KPFKIFVGYD RED+A+ VC  SILK+SSIPV+I PI QSDLRKSGLYWRE+GQ ESTEF
Sbjct: 10  KPFKIFVGYDSREDVAFEVCRYSILKKSSIPVEIIPIKQSDLRKSGLYWREKGQYESTEF 69

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLI-DDKYAIMCVHHDYTPKETTKMD 136
           SFSRFLTPYLA ++GWA+FVDCDFLYLADIKEL DLI D+KYAIMCV HDY PKETTKMD
Sbjct: 70  SFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIVDEKYAIMCVQHDYAPKETTKMD 129

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWN 196
           GAVQTVYPRKNWSSMVLYNCGHPKN++LTP++VN+QTGAFLHRFQWL+D+EIGSIPFVWN
Sbjct: 130 GAVQTVYPRKNWSSMVLYNCGHPKNRVLTPDLVNSQTGAFLHRFQWLQDDEIGSIPFVWN 189

Query: 197 FLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY-KNEANKTV 252
           FLEGHN VVE+DP+T P+AIHYTRGGPWFEAWK+C+FADLWL E +EY  NEA K +
Sbjct: 190 FLEGHNRVVESDPSTLPRAIHYTRGGPWFEAWKNCDFADLWLNERDEYLNNEAKKAI 246


>gi|225442729|ref|XP_002284941.1| PREDICTED: uncharacterized protein LOC100258273 isoform 2 [Vitis
           vinifera]
 gi|225442731|ref|XP_002284936.1| PREDICTED: uncharacterized protein LOC100258273 isoform 1 [Vitis
           vinifera]
          Length = 258

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 187/239 (78%), Positives = 211/239 (88%)

Query: 15  KDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTES 74
           K +KPFKIFVGYDPRED+AY VC  SILKR+SIPV++ PI QS+LRK GLYWRERG+ ES
Sbjct: 20  KTSKPFKIFVGYDPREDIAYEVCRYSILKRASIPVEVIPIKQSELRKKGLYWRERGKLES 79

Query: 75  TEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTK 134
           TEFSFSRFLTPYLA +DGWA+FVDCDFLYLADIKEL DL+DD  AIMCV HDY PKETTK
Sbjct: 80  TEFSFSRFLTPYLANYDGWAMFVDCDFLYLADIKELRDLVDDNCAIMCVQHDYAPKETTK 139

Query: 135 MDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFV 194
           MDGAVQTVYPRKNWSSMVLYNCGHPKN++LTP+VVN+QTGA+LHRF WL+D EIGS+PFV
Sbjct: 140 MDGAVQTVYPRKNWSSMVLYNCGHPKNRVLTPDVVNSQTGAYLHRFMWLDDHEIGSVPFV 199

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKTVE 253
           WNFL GHN V+E+DP+T PKAIHYT GGPWFE+WK+CEFADLW+ EMEE + E     E
Sbjct: 200 WNFLVGHNKVIEDDPSTLPKAIHYTLGGPWFESWKNCEFADLWVNEMEECQKEMKTKTE 258


>gi|357481011|ref|XP_003610791.1| hypothetical protein MTR_5g007050 [Medicago truncatula]
 gi|355512126|gb|AES93749.1| hypothetical protein MTR_5g007050 [Medicago truncatula]
          Length = 214

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 180/209 (86%), Positives = 193/209 (92%)

Query: 42  LKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDF 101
           +KRSSIPV+I PI QSDLRKSGLYWRERGQ ESTEFSF+RFLTP LA + GWA+FVDCDF
Sbjct: 1   MKRSSIPVEIIPIKQSDLRKSGLYWRERGQFESTEFSFTRFLTPSLANYQGWAMFVDCDF 60

Query: 102 LYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKN 161
           LYLADIKEL DLI+DKYAIMCV HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKN
Sbjct: 61  LYLADIKELLDLIEDKYAIMCVQHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKN 120

Query: 162 KILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRG 221
           K+LTP+ VN+QTGAFLHRFQWLED+EIGS+PFVWNFLEGHN  VENDPTT PKAIHYTRG
Sbjct: 121 KVLTPDAVNSQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNRAVENDPTTSPKAIHYTRG 180

Query: 222 GPWFEAWKHCEFADLWLKEMEEYKNEANK 250
           GPWFEAWK+CEFADLWL EMEEY  +A K
Sbjct: 181 GPWFEAWKNCEFADLWLNEMEEYLTQAKK 209


>gi|148907640|gb|ABR16949.1| unknown [Picea sitchensis]
          Length = 412

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 169/225 (75%), Positives = 194/225 (86%), Gaps = 2/225 (0%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            KIF+GYDPREDLAY VC  SILK +++PV+I PI Q DLRK+GLYWR R +TESTEFSF
Sbjct: 117 IKIFIGYDPREDLAYEVCRYSILKHATMPVEIIPIKQPDLRKTGLYWRNRDKTESTEFSF 176

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           +RFLTP+LAGF+GWA+F+DCDFLY ADI+EL + IDDKYAIMCV HDYTPK  TKMDG  
Sbjct: 177 TRFLTPFLAGFEGWALFIDCDFLYTADIRELAEFIDDKYAIMCVKHDYTPKNATKMDGVP 236

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QTVYPRKNWSSMVLYNC HPKN++LTP VVN+Q+GAFLHRF WL++++IGSIPFVWNFL 
Sbjct: 237 QTVYPRKNWSSMVLYNCSHPKNRVLTPSVVNSQSGAFLHRFTWLDNDDIGSIPFVWNFLV 296

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY 244
           GHN V  +D   FP+AIHYT GGPWFEAWK CEFADLWLKE +EY
Sbjct: 297 GHNEV--DDENRFPRAIHYTSGGPWFEAWKDCEFADLWLKERDEY 339


>gi|116789082|gb|ABK25110.1| unknown [Picea sitchensis]
          Length = 412

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 169/225 (75%), Positives = 194/225 (86%), Gaps = 2/225 (0%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            KIF+GYDPREDLAY VC  SILK +++PV+I PI Q DLRK+GLYWR R +TESTEFSF
Sbjct: 117 IKIFIGYDPREDLAYEVCRYSILKHATMPVEIIPIKQPDLRKTGLYWRNRDKTESTEFSF 176

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           +RFLTP+LAGF+GWA+F+DCDFLY ADI+EL + IDDKYAIMCV HDYTPK  TKMDG  
Sbjct: 177 TRFLTPFLAGFEGWALFIDCDFLYTADIRELTEFIDDKYAIMCVKHDYTPKNATKMDGVP 236

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QTVYPRKNWSSMVLYNC HPKN++LTP VVN+Q+GAFLHRF WL++++IGSIPFVWNFL 
Sbjct: 237 QTVYPRKNWSSMVLYNCSHPKNRVLTPSVVNSQSGAFLHRFTWLDNDDIGSIPFVWNFLV 296

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY 244
           GHN V  +D   FP+AIHYT GGPWFEAWK CEFADLWLKE +EY
Sbjct: 297 GHNEV--DDENRFPRAIHYTSGGPWFEAWKDCEFADLWLKERDEY 339


>gi|148905858|gb|ABR16091.1| unknown [Picea sitchensis]
          Length = 409

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 164/226 (72%), Positives = 189/226 (83%), Gaps = 3/226 (1%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            KIFVGYDPREDLAY VC  SILK ++ PV++ PI Q DLR++GLYWR R  TEST+FSF
Sbjct: 115 LKIFVGYDPREDLAYEVCRHSILKHATFPVEVIPIKQEDLRRAGLYWRTRDPTESTQFSF 174

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           +RFLTP+LAGF+GWA+FVDCDFLY  DI+EL + IDDKYAIMCV HDY PK  TKMDG V
Sbjct: 175 TRFLTPFLAGFEGWAMFVDCDFLYTVDIRELAEFIDDKYAIMCVKHDYKPKSATKMDGVV 234

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QT YPRKNWSSMVLYNC HPKN++LTP VVN+Q+GAFLHRF WL+D +IGSIP+ WNFL 
Sbjct: 235 QTTYPRKNWSSMVLYNCKHPKNRVLTPSVVNSQSGAFLHRFTWLDDNDIGSIPYTWNFLV 294

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYK 245
           GHN + E D    PKAIHYT GGPWFEAWK C+FADLWL+E++EYK
Sbjct: 295 GHNEIDEKD---LPKAIHYTMGGPWFEAWKDCQFADLWLQEVDEYK 337


>gi|357118488|ref|XP_003560986.1| PREDICTED: uncharacterized protein LOC100821670 [Brachypodium
           distachyon]
          Length = 280

 Score =  354 bits (909), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 158/242 (65%), Positives = 202/242 (83%), Gaps = 5/242 (2%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           ++PF++FVGYD RED+AYRVC RS+L+RSSIP++I PIVQ +LR +GLYWRERG TESTE
Sbjct: 20  DEPFRVFVGYDSREDIAYRVCRRSLLRRSSIPLEIIPIVQDELRSAGLYWRERGPTESTE 79

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL---CDLIDDKYAIMCVHHDYTPKETT 133
           FSF+RFLTP+LAG+ GWA+F+DCDFL++AD+ EL   C   + ++A++CVHHDY PKE T
Sbjct: 80  FSFTRFLTPHLAGYRGWALFIDCDFLFVADVAELARMCAAANPRHAVLCVHHDYAPKEAT 139

Query: 134 KMDGAVQTVYPRKNWSSMVLYNCGHPKNK-ILTPEVVNTQTGAFLHRFQWLEDEEIGSIP 192
           KMDGAVQT+YPRKNWSSMVL+NCGHPKN+  LTP+ V+T++GA LHRF WL+D+E+G +P
Sbjct: 140 KMDGAVQTLYPRKNWSSMVLFNCGHPKNRAALTPDEVSTRSGAHLHRFAWLDDDEVGEVP 199

Query: 193 FVWNFLEGHNMVVEND-PTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKT 251
           FVWNFL GHN V  +D   T P+AIHYT GGPWFE +K CEFADLW++E + Y++E  + 
Sbjct: 200 FVWNFLVGHNRVDPDDVAGTTPRAIHYTSGGPWFERYKDCEFADLWIQERDAYESEEKQA 259

Query: 252 VE 253
            E
Sbjct: 260 EE 261


>gi|226496930|ref|NP_001144035.1| uncharacterized protein LOC100276859 [Zea mays]
 gi|195635807|gb|ACG37372.1| hypothetical protein [Zea mays]
          Length = 271

 Score =  354 bits (909), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 163/240 (67%), Positives = 200/240 (83%), Gaps = 6/240 (2%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           +PF++FVGYD RED+AYRVC RS+L+RSSIP++ITPIVQ +LR++GLYWRERG TESTEF
Sbjct: 15  EPFRVFVGYDSREDVAYRVCRRSLLRRSSIPLEITPIVQQELREAGLYWRERGPTESTEF 74

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDL--IDDKYAIMCVHHDYTPKETTKM 135
           SF+RFLTPYLAG+ GWA+FVDCDFL++AD+  L  L   D + A++CVHHDYTP E TKM
Sbjct: 75  SFTRFLTPYLAGYRGWALFVDCDFLFVADVAGLVALARADPRRAVLCVHHDYTPTEATKM 134

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNK-ILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFV 194
           DGAVQT YPRKNWSSMVL+NCGHPKN+  LTPE V+T++GA+LHRF WL+D+EIG +PFV
Sbjct: 135 DGAVQTAYPRKNWSSMVLFNCGHPKNRAALTPEAVSTRSGAYLHRFMWLDDDEIGEVPFV 194

Query: 195 WNFLEGHNMVVENDPT-TFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKTVE 253
           WNFL GHN V   D   T P+AIHYT GGPWF+ +K CEFADLW++E + Y  E +K V+
Sbjct: 195 WNFLVGHNRVDPADEAGTAPRAIHYTSGGPWFDRYKDCEFADLWVQERDAY--EVDKDVD 252


>gi|413924227|gb|AFW64159.1| hypothetical protein ZEAMMB73_266582 [Zea mays]
          Length = 271

 Score =  354 bits (908), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 163/240 (67%), Positives = 200/240 (83%), Gaps = 6/240 (2%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           +PF++FVGYD RED+AYRVC RS+L+RSSIP++ITPIVQ +LR++GLYWRERG TESTEF
Sbjct: 15  EPFRVFVGYDSREDVAYRVCRRSLLRRSSIPLEITPIVQQELREAGLYWRERGPTESTEF 74

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDL--IDDKYAIMCVHHDYTPKETTKM 135
           SF+RFLTPYLAG+ GWA+FVDCDFL++AD+  L  L   D + A++CVHHDYTP E TKM
Sbjct: 75  SFTRFLTPYLAGYRGWALFVDCDFLFVADVAGLVALARADPRRAVLCVHHDYTPTEATKM 134

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNK-ILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFV 194
           DGAVQT YPRKNWSSMVL+NCGHPKN+  LTPE V+T++GA+LHRF WL+D+EIG +PFV
Sbjct: 135 DGAVQTAYPRKNWSSMVLFNCGHPKNRAALTPEAVSTRSGAYLHRFMWLDDDEIGEVPFV 194

Query: 195 WNFLEGHNMVVENDPT-TFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKTVE 253
           WNFL GHN V   D   T P+AIHYT GGPWF+ +K CEFADLW++E + Y  E +K V+
Sbjct: 195 WNFLVGHNRVDPADEAGTAPRAIHYTSGGPWFDRYKDCEFADLWVQERDAY--EVDKDVD 252


>gi|115471711|ref|NP_001059454.1| Os07g0413800 [Oryza sativa Japonica Group]
 gi|23617177|dbj|BAC20854.1| unknown protein [Oryza sativa Japonica Group]
 gi|113610990|dbj|BAF21368.1| Os07g0413800 [Oryza sativa Japonica Group]
 gi|215679005|dbj|BAG96435.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215692534|dbj|BAG87954.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 285

 Score =  353 bits (907), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 156/234 (66%), Positives = 193/234 (82%), Gaps = 2/234 (0%)

Query: 8   VTANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWR 67
           V    + +  + F++FVGYD RED+AYRVC RS+L+RSS+PV + PIVQ +LR +GLYWR
Sbjct: 15  VAGAGDVQAAETFRVFVGYDSREDIAYRVCRRSLLQRSSVPVAVIPIVQQELRSAGLYWR 74

Query: 68  ERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDY 127
           ERG TESTEFSF+RFLTP+LAG+ GWA+FVDCDFL++AD+ EL  + D +YA++CVHHDY
Sbjct: 75  ERGPTESTEFSFTRFLTPHLAGYRGWALFVDCDFLFVADVAELARMADPRYAVLCVHHDY 134

Query: 128 TPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK-ILTPEVVNTQTGAFLHRFQWLEDE 186
            PKE TKMDGAVQTVYPRKNWSSMVL+NC HPKN+  LTPE V+TQ+GA+LHRF WL+D 
Sbjct: 135 APKEATKMDGAVQTVYPRKNWSSMVLFNCAHPKNRAALTPEAVSTQSGAYLHRFMWLDDA 194

Query: 187 EIGSIPFVWNFLEGHNMVVENDPT-TFPKAIHYTRGGPWFEAWKHCEFADLWLK 239
           +IG +PFVWNFL GHN V   D   T P+AIHYT GGPWFE +K+CEFA+LW++
Sbjct: 195 DIGEVPFVWNFLVGHNRVDPADTAGTAPRAIHYTSGGPWFEQYKNCEFAELWVQ 248


>gi|115482940|ref|NP_001065063.1| Os10g0516400 [Oryza sativa Japonica Group]
 gi|13786447|gb|AAK39572.1|AC025296_7 unknown protein [Oryza sativa Japonica Group]
 gi|31433063|gb|AAP54623.1| expressed protein [Oryza sativa Japonica Group]
 gi|113639672|dbj|BAF26977.1| Os10g0516400 [Oryza sativa Japonica Group]
 gi|125575401|gb|EAZ16685.1| hypothetical protein OsJ_32160 [Oryza sativa Japonica Group]
 gi|215737262|dbj|BAG96191.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 268

 Score =  353 bits (907), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 162/243 (66%), Positives = 193/243 (79%), Gaps = 6/243 (2%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           +PF++FVGYDPRED AY VC RS+L+ +SIPV + PI Q DLR +GLYWRERG TESTEF
Sbjct: 10  EPFRVFVGYDPREDEAYEVCRRSLLRHASIPVDVRPIRQPDLRAAGLYWRERGPTESTEF 69

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLI-----DDKYAIMCVHHDYTPKET 132
           SF+RFLTPYLAG+ GWA+FVDCDFLYLADI  L   +     D + A+ CV H+Y P E 
Sbjct: 70  SFTRFLTPYLAGYRGWALFVDCDFLYLADIAGLLACLPSSDPDHRLAVACVKHEYAPAEA 129

Query: 133 TKMDGAVQTVYPRKNWSSMVLYNCGHPKN-KILTPEVVNTQTGAFLHRFQWLEDEEIGSI 191
           TKMDGA+QTVYPRKNWSSMVLYNCGHPKN   LTP+ V+TQTGAFLHRF WL+D+EIG I
Sbjct: 130 TKMDGAIQTVYPRKNWSSMVLYNCGHPKNVAALTPDAVSTQTGAFLHRFAWLDDDEIGEI 189

Query: 192 PFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKT 251
           PF WNFL GHN V   DP+T PKAIHYT GGPWFE +++C+FA+LW+KE +E K +  K 
Sbjct: 190 PFAWNFLVGHNKVDPADPSTQPKAIHYTSGGPWFERYRNCDFAELWIKEADELKADKEKQ 249

Query: 252 VEK 254
            ++
Sbjct: 250 KQQ 252


>gi|125532642|gb|EAY79207.1| hypothetical protein OsI_34319 [Oryza sativa Indica Group]
          Length = 270

 Score =  353 bits (907), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 162/243 (66%), Positives = 193/243 (79%), Gaps = 6/243 (2%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           +PF++FVGYDPRED AY VC RS+L+ +SIPV + PI Q DLR +GLYWRERG TESTEF
Sbjct: 12  EPFRVFVGYDPREDEAYEVCRRSLLRHASIPVDVRPIRQPDLRAAGLYWRERGPTESTEF 71

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLI-----DDKYAIMCVHHDYTPKET 132
           SF+RFLTPYLAG+ GWA+FVDCDFLYLADI  L   +     D + A+ CV H+Y P E 
Sbjct: 72  SFTRFLTPYLAGYRGWALFVDCDFLYLADIAGLLACLPSSDPDHRLAVACVKHEYAPAEA 131

Query: 133 TKMDGAVQTVYPRKNWSSMVLYNCGHPKN-KILTPEVVNTQTGAFLHRFQWLEDEEIGSI 191
           TKMDGA+QTVYPRKNWSSMVLYNCGHPKN   LTP+ V+TQTGAFLHRF WL+D+EIG I
Sbjct: 132 TKMDGAIQTVYPRKNWSSMVLYNCGHPKNVAALTPDAVSTQTGAFLHRFAWLDDDEIGEI 191

Query: 192 PFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKT 251
           PF WNFL GHN V   DP+T PKAIHYT GGPWFE +++C+FA+LW+KE +E K +  K 
Sbjct: 192 PFAWNFLVGHNKVDPADPSTQPKAIHYTSGGPWFERYRNCDFAELWIKEADELKADKEKQ 251

Query: 252 VEK 254
            ++
Sbjct: 252 KQQ 254


>gi|222636888|gb|EEE67020.1| hypothetical protein OsJ_23952 [Oryza sativa Japonica Group]
          Length = 275

 Score =  353 bits (906), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 156/234 (66%), Positives = 193/234 (82%), Gaps = 2/234 (0%)

Query: 8   VTANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWR 67
           V    + +  + F++FVGYD RED+AYRVC RS+L+RSS+PV + PIVQ +LR +GLYWR
Sbjct: 5   VAGAGDVQAAETFRVFVGYDSREDIAYRVCRRSLLQRSSVPVAVIPIVQQELRSAGLYWR 64

Query: 68  ERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDY 127
           ERG TESTEFSF+RFLTP+LAG+ GWA+FVDCDFL++AD+ EL  + D +YA++CVHHDY
Sbjct: 65  ERGPTESTEFSFTRFLTPHLAGYRGWALFVDCDFLFVADVAELARMADPRYAVLCVHHDY 124

Query: 128 TPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK-ILTPEVVNTQTGAFLHRFQWLEDE 186
            PKE TKMDGAVQTVYPRKNWSSMVL+NC HPKN+  LTPE V+TQ+GA+LHRF WL+D 
Sbjct: 125 APKEATKMDGAVQTVYPRKNWSSMVLFNCAHPKNRAALTPEAVSTQSGAYLHRFMWLDDA 184

Query: 187 EIGSIPFVWNFLEGHNMVVENDPT-TFPKAIHYTRGGPWFEAWKHCEFADLWLK 239
           +IG +PFVWNFL GHN V   D   T P+AIHYT GGPWFE +K+CEFA+LW++
Sbjct: 185 DIGEVPFVWNFLVGHNRVDPADTAGTAPRAIHYTSGGPWFEQYKNCEFAELWVQ 238


>gi|357125012|ref|XP_003564190.1| PREDICTED: uncharacterized protein LOC100824782 [Brachypodium
           distachyon]
          Length = 279

 Score =  348 bits (892), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 156/238 (65%), Positives = 197/238 (82%), Gaps = 5/238 (2%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           +PF++FVGYD RED+AYRVC RS+L+RSSIP++I PIVQ +LR +GLYWRER  TESTEF
Sbjct: 20  EPFRVFVGYDSREDIAYRVCRRSLLRRSSIPLEIIPIVQDELRSAGLYWREREPTESTEF 79

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL---CDLIDDKYAIMCVHHDYTPKETTK 134
           SF+RFLTP+LAG+ GWA+F+DCDFL++AD+ EL   C     ++A++CVHHDY PKE TK
Sbjct: 80  SFTRFLTPHLAGYRGWALFIDCDFLFVADVAELARMCAAAKPRHAVLCVHHDYAPKEATK 139

Query: 135 MDGAVQTVYPRKNWSSMVLYNCGHPKNK-ILTPEVVNTQTGAFLHRFQWLEDEEIGSIPF 193
           MDGAVQT+YPRKNWSSMVL+NCGHPKN+  LTP+ V+T++GA LHRF WL+D+E+G +PF
Sbjct: 140 MDGAVQTLYPRKNWSSMVLFNCGHPKNRAALTPDEVSTRSGAHLHRFAWLDDDEVGEVPF 199

Query: 194 VWNFLEGHNMVVEND-PTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANK 250
           VWNFL GHN V  +D   T P+AIHYT GGPWFE +K CEFADLW++E + Y+ E  +
Sbjct: 200 VWNFLVGHNRVDPDDVAGTTPRAIHYTSGGPWFERYKDCEFADLWIQERDAYEAEEKQ 257


>gi|242032543|ref|XP_002463666.1| hypothetical protein SORBIDRAFT_01g003910 [Sorghum bicolor]
 gi|241917520|gb|EER90664.1| hypothetical protein SORBIDRAFT_01g003910 [Sorghum bicolor]
          Length = 283

 Score =  347 bits (890), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 161/238 (67%), Positives = 187/238 (78%), Gaps = 7/238 (2%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
            +PF++FVGYDPRE  AY VC RS+L+RSSIP+ + PI Q+DLR +GLY R RG TESTE
Sbjct: 19  GEPFRVFVGYDPREHEAYEVCRRSLLRRSSIPLDVRPIRQADLRAAGLYTRARGPTESTE 78

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELC------DLIDDKYAIMCVHHDYTPK 130
           FSF+RFLTPYLAG+ GWA+FVDCDFLYLADI EL           D+ A+ CV H+Y P 
Sbjct: 79  FSFTRFLTPYLAGYRGWALFVDCDFLYLADIAELLAAAVPPADAADRIAVACVKHEYQPA 138

Query: 131 ETTKMDGAVQTVYPRKNWSSMVLYNCGHPKN-KILTPEVVNTQTGAFLHRFQWLEDEEIG 189
           E TKMDGA+QTVYPRKNWSSMVLYNC HPKN  +LTP+ V+TQTGAFLHRF WL+D+EIG
Sbjct: 139 EATKMDGAIQTVYPRKNWSSMVLYNCAHPKNVAVLTPDAVSTQTGAFLHRFAWLDDDEIG 198

Query: 190 SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNE 247
            +PF WNFL GHN V   DP T PKAIHYT GGPWFE ++ CEFADLWLKE +E + E
Sbjct: 199 EVPFAWNFLVGHNRVDPADPATRPKAIHYTSGGPWFERYRDCEFADLWLKEADELRAE 256


>gi|302817843|ref|XP_002990596.1| hypothetical protein SELMODRAFT_47888 [Selaginella moellendorffii]
 gi|300141518|gb|EFJ08228.1| hypothetical protein SELMODRAFT_47888 [Selaginella moellendorffii]
          Length = 222

 Score =  347 bits (889), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 158/224 (70%), Positives = 184/224 (82%), Gaps = 2/224 (0%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            +IF+GYDPRE  A+ VC  SIL+ +++PV+ITPI QS LR SGLY R +  TESTEFSF
Sbjct: 1   LQIFIGYDPREHDAFEVCRHSILRHATMPVEITPIKQSQLRASGLYTRTKDPTESTEFSF 60

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           +RFLTP+LAGF GWA+FVDCDFLY AD++EL  LIDD+YA+MCV HDYTPK  TKMDG V
Sbjct: 61  TRFLTPFLAGFHGWAMFVDCDFLYTADVRELAALIDDRYAVMCVQHDYTPKSATKMDGVV 120

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QT YPRKNWSSMVLYNC HPKN ILTP +V++QTGAFLHRF WL+D +IGSIPFVWNFL 
Sbjct: 121 QTSYPRKNWSSMVLYNCAHPKNMILTPALVSSQTGAFLHRFLWLDDADIGSIPFVWNFLV 180

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
           GHN +   D    PKAIHYT GGPWFEAWK CE+A+LWL+E E+
Sbjct: 181 GHNKM--GDGAGLPKAIHYTSGGPWFEAWKDCEYAELWLRERED 222


>gi|302770583|ref|XP_002968710.1| hypothetical protein SELMODRAFT_67913 [Selaginella moellendorffii]
 gi|300163215|gb|EFJ29826.1| hypothetical protein SELMODRAFT_67913 [Selaginella moellendorffii]
          Length = 222

 Score =  346 bits (888), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 158/224 (70%), Positives = 184/224 (82%), Gaps = 2/224 (0%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            +IF+GYDPRE  A+ VC  SIL+ +++PV+ITPI QS LR SGLY R +  TESTEFSF
Sbjct: 1   LQIFIGYDPREHDAFEVCRHSILRHATMPVEITPIKQSQLRASGLYTRTKDPTESTEFSF 60

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           +RFLTP+LAGF GWA+FVDCDFLY AD++EL  LIDD YA+MCV HDYTPK  TKMDG V
Sbjct: 61  TRFLTPFLAGFHGWAMFVDCDFLYTADVRELAALIDDHYAVMCVQHDYTPKSATKMDGVV 120

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QT YPRKNWSSMVLYNC HPKN+ILTP +V++QTGAFLHRF WL+D +IGSIPFVWNFL 
Sbjct: 121 QTSYPRKNWSSMVLYNCAHPKNRILTPALVSSQTGAFLHRFLWLDDADIGSIPFVWNFLV 180

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
           GHN +   D    PKAIHYT GGPWFEAWK CE+A+LWL+E E+
Sbjct: 181 GHNKM--GDGAGLPKAIHYTSGGPWFEAWKDCEYAELWLRERED 222


>gi|242063368|ref|XP_002452973.1| hypothetical protein SORBIDRAFT_04g035800 [Sorghum bicolor]
 gi|241932804|gb|EES05949.1| hypothetical protein SORBIDRAFT_04g035800 [Sorghum bicolor]
          Length = 270

 Score =  343 bits (881), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 161/252 (63%), Positives = 204/252 (80%), Gaps = 7/252 (2%)

Query: 9   TANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRE 68
           + N  +   +PF++FVGYD RED+AYRVC RS+L+RSSIP++ITPIVQ +LR +GLYWRE
Sbjct: 3   STNLASAVAEPFRVFVGYDSREDIAYRVCRRSLLRRSSIPLEITPIVQQELRDAGLYWRE 62

Query: 69  RGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDD---KYAIMCVHH 125
           RG TESTEFSF+RFLTPYLAG+ GWA+FVDCDFL++ D+  L  L  D   ++A++CVHH
Sbjct: 63  RGPTESTEFSFTRFLTPYLAGYRGWALFVDCDFLFVDDVAALARLAADADPRHAVLCVHH 122

Query: 126 DYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK-ILTPEVVNTQTGAFLHRFQWLE 184
           DYTP E TKMDGAVQT YPRKNWSSMVL++CGHPKN+  LTPE V+T++GA+LHRF WL+
Sbjct: 123 DYTPTEATKMDGAVQTAYPRKNWSSMVLFDCGHPKNRAALTPEAVSTRSGAYLHRFMWLD 182

Query: 185 DEEIGSIPFVWNFLEGHNMVVENDPT-TFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
           D+E+G +PFVWNFL GHN V   D   T P+AIHYT GGPWFE ++ CEFA+LW++E + 
Sbjct: 183 DDEVGEVPFVWNFLVGHNRVDPADEAGTAPRAIHYTSGGPWFERYRDCEFANLWVQERDA 242

Query: 244 YKNE--ANKTVE 253
           Y+ E  A+K V+
Sbjct: 243 YEAEEAADKDVD 254


>gi|357146963|ref|XP_003574173.1| PREDICTED: uncharacterized protein LOC100823495 [Brachypodium
           distachyon]
          Length = 276

 Score =  336 bits (862), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 154/246 (62%), Positives = 185/246 (75%), Gaps = 12/246 (4%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
             PF++FVGYDPRE  AY VC RS+L+ +++P+ + P+ Q DLR +GLYWR RG TESTE
Sbjct: 11  GSPFRVFVGYDPREHEAYEVCRRSLLRHATVPLDVRPVRQPDLRAAGLYWRARGPTESTE 70

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELC-----------DLIDDKYAIMCVHH 125
           FSF+RFLTPYLAG+ GWA+FVDCDFLYLAD+ EL                D+ A++CV H
Sbjct: 71  FSFTRFLTPYLAGYRGWALFVDCDFLYLADVAELLASAVPASADDPAAAADRLAVVCVKH 130

Query: 126 DYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKN-KILTPEVVNTQTGAFLHRFQWLE 184
           +Y P E TKMDGA+QTVYPRKNWSSMVLYNC HPKN   LTPE V+TQTGAFLHRF WL+
Sbjct: 131 EYAPAEATKMDGAIQTVYPRKNWSSMVLYNCAHPKNVAALTPEAVSTQTGAFLHRFSWLD 190

Query: 185 DEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY 244
           D+EIG +PF WNFL GHN V   DP T PKA+HYT GGPWFE +K C+FADLW+KE E  
Sbjct: 191 DDEIGEVPFAWNFLVGHNKVDPADPATQPKALHYTCGGPWFERYKDCDFADLWIKEAEGL 250

Query: 245 KNEANK 250
           + +  +
Sbjct: 251 RADKER 256


>gi|326531020|dbj|BAK04861.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 305

 Score =  333 bits (855), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 150/241 (62%), Positives = 185/241 (76%), Gaps = 8/241 (3%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           +PF++FVGYDPRE  AY VC RS+++ +++P+ + P+ Q DLR SGLYWR RG TESTEF
Sbjct: 46  EPFRVFVGYDPREHEAYEVCRRSLIRHATVPLDVRPVRQPDLRASGLYWRARGPTESTEF 105

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELC-------DLIDDKYAIMCVHHDYTPK 130
           SF+RFLTP+LA + GWA+F+DCDFLYLAD+ EL             + A++CV H Y P 
Sbjct: 106 SFTRFLTPFLARYRGWALFIDCDFLYLADVAELIASAVPSDAAAASRLAVVCVKHVYQPL 165

Query: 131 ETTKMDGAVQTVYPRKNWSSMVLYNCGHPKN-KILTPEVVNTQTGAFLHRFQWLEDEEIG 189
           E TKMDGA+QTVYPRKNWSSMVLYNC HPKN   LTP+ V+T+ GAFLHRF WL+D+EIG
Sbjct: 166 EATKMDGAIQTVYPRKNWSSMVLYNCAHPKNVAALTPDAVSTKPGAFLHRFSWLDDDEIG 225

Query: 190 SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEAN 249
            +PFVWNFL GHN V  +DP T PKA+HYT GGPWFE ++ CEFADLW+KE EE + +  
Sbjct: 226 EVPFVWNFLVGHNRVDPDDPATQPKALHYTCGGPWFERYRDCEFADLWIKEAEELRADKE 285

Query: 250 K 250
           K
Sbjct: 286 K 286


>gi|414873566|tpg|DAA52123.1| TPA: hypothetical protein ZEAMMB73_071261 [Zea mays]
          Length = 291

 Score =  333 bits (854), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 160/247 (64%), Positives = 189/247 (76%), Gaps = 9/247 (3%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
            +PF++FVGYDPRE  AY VC RS+L+RSSIP+ + PI Q++LR +GLY R RG TESTE
Sbjct: 15  GEPFRVFVGYDPREHEAYEVCRRSLLRRSSIPLDVRPIRQAELRAAGLYTRARGPTESTE 74

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKY--------AIMCVHHDYT 128
           FSF+RFLTPYLAG+ GWA+FVDCDFLYLAD+ +L                A+ CV H+Y 
Sbjct: 75  FSFTRFLTPYLAGYRGWALFVDCDFLYLADLADLLAAAAAADRAAGAGRIAVACVKHEYQ 134

Query: 129 PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKN-KILTPEVVNTQTGAFLHRFQWLEDEE 187
           P E TKMDGA+QTVYPRKNWSSMVLYNCGHPKN   LTPE V+TQTGAFLHRF WL+D+E
Sbjct: 135 PAEATKMDGAIQTVYPRKNWSSMVLYNCGHPKNVAALTPEAVSTQTGAFLHRFAWLDDDE 194

Query: 188 IGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNE 247
           IG +PF WNFL GHN V   DP T P+AIHYT GGPWFE ++ CEFADLWLKE EE + +
Sbjct: 195 IGEVPFAWNFLVGHNRVDPADPATRPRAIHYTSGGPWFERYRDCEFADLWLKEAEELRAD 254

Query: 248 ANKTVEK 254
            +K  +K
Sbjct: 255 KDKENDK 261


>gi|168016548|ref|XP_001760811.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688171|gb|EDQ74550.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 226

 Score =  329 bits (843), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 148/225 (65%), Positives = 182/225 (80%), Gaps = 1/225 (0%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            +IFVG+D RE++AY VC  SIL+ ++I V+I P+   +L + GLY R R  TESTEFSF
Sbjct: 1   LRIFVGFDSREEIAYDVCRHSILRHATIDVEIIPLKLQNLVEQGLYARSRESTESTEFSF 60

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           +RFLTP+LAGF+GWAVFVDCDFLY  D++EL +L+DD++AIMCV H+YTPK T KMDG V
Sbjct: 61  TRFLTPFLAGFEGWAVFVDCDFLYTTDVRELAELVDDQFAIMCVKHNYTPKATVKMDGVV 120

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QT Y RKNWSSMVLYNC HPKN++LTPE++N+++GAFLHRF WLED EIG IPF WNFL 
Sbjct: 121 QTSYLRKNWSSMVLYNCSHPKNRVLTPELINSESGAFLHRFMWLEDVEIGEIPFTWNFLV 180

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY 244
           GHN V +    + PKAIH+T GGPWFEAW+ CEFAD WLKE ++Y
Sbjct: 181 GHNEVPDGG-ASLPKAIHFTCGGPWFEAWRDCEFADCWLKERDDY 224


>gi|226504082|ref|NP_001144588.1| uncharacterized protein LOC100277603 [Zea mays]
 gi|195644204|gb|ACG41570.1| hypothetical protein [Zea mays]
          Length = 290

 Score =  325 bits (833), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 157/238 (65%), Positives = 183/238 (76%), Gaps = 9/238 (3%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
            +PF++FVGYDPRE  AY VC RS+L+RSSIP+ + PI Q++LR +GLY R RG TESTE
Sbjct: 15  GEPFRVFVGYDPREHEAYEVCRRSLLRRSSIPLDVRPIRQAELRAAGLYTRARGPTESTE 74

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKY--------AIMCVHHDYT 128
           FSF+RFLTPYLAG+ GWA+FVDCDFLYLAD+ +L                A+ CV H+Y 
Sbjct: 75  FSFTRFLTPYLAGYRGWALFVDCDFLYLADLADLLAAAAAADRAAGAGRIAVACVKHEYQ 134

Query: 129 PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKN-KILTPEVVNTQTGAFLHRFQWLEDEE 187
           P E TKMDGA+QTVYPR+NWSSMVLYNCGHPKN   LTPE V+TQTGAFLHRF WL+D+E
Sbjct: 135 PAEATKMDGAIQTVYPRENWSSMVLYNCGHPKNVAALTPEAVSTQTGAFLHRFAWLDDDE 194

Query: 188 IGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYK 245
           IG +PF WNFL GHN V   DP T P AIHYT GGPWFE ++ CEFADLWLKE EE +
Sbjct: 195 IGEVPFAWNFLVGHNRVDPADPDTRPXAIHYTSGGPWFERYRDCEFADLWLKEAEELR 252


>gi|297737263|emb|CBI26464.3| unnamed protein product [Vitis vinifera]
          Length = 229

 Score =  306 bits (783), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 141/164 (85%), Positives = 153/164 (93%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
           KPFKIFVGYDPREDLAY VC  S+LKRSSIP++ITPI QSDLRKSGLYWRERGQTESTEF
Sbjct: 32  KPFKIFVGYDPREDLAYEVCCHSLLKRSSIPIEITPIKQSDLRKSGLYWRERGQTESTEF 91

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDG 137
           SFSRFLTPYLA ++GWA+FVDCDFLYL DIKEL DL+DD YA+MCV HDYTPKE TKMDG
Sbjct: 92  SFSRFLTPYLANYEGWAMFVDCDFLYLGDIKELRDLVDDNYAMMCVQHDYTPKEKTKMDG 151

Query: 138 AVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQ 181
            VQTVYPRKNWSSMVLYNCGHPKN++LTP+VVN+QTGAFLHRFQ
Sbjct: 152 VVQTVYPRKNWSSMVLYNCGHPKNRVLTPDVVNSQTGAFLHRFQ 195


>gi|163795698|ref|ZP_02189663.1| hypothetical protein BAL199_07348 [alpha proteobacterium BAL199]
 gi|159178994|gb|EDP63529.1| hypothetical protein BAL199_07348 [alpha proteobacterium BAL199]
          Length = 242

 Score =  301 bits (772), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 134/237 (56%), Positives = 170/237 (71%), Gaps = 3/237 (1%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
             PF++FVGYD RED+A+RVC  S+  RSSI + I PIVQS++R  GLYWR +    STE
Sbjct: 8   GSPFRVFVGYDSREDVAFRVCESSLRHRSSIALDIRPIVQSEMRSRGLYWRSKDPLASTE 67

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           F++SRFLTPYLAG+ GWA+F DCDFL+L+DI EL  L D   A+MCVHHD+ P E TKMD
Sbjct: 68  FTYSRFLTPYLAGYQGWALFCDCDFLWLSDIAELVALADPATAVMCVHHDHQPTEATKMD 127

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWN 196
           GAVQT YPRKNWSS++L+NCGHP  + LTPE VNT TGA LHR QW +D +IG++P  WN
Sbjct: 128 GAVQTTYPRKNWSSLMLFNCGHPMTRSLTPETVNTATGAHLHRMQWADDAQIGALPPSWN 187

Query: 197 FLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNEANKTVE 253
           +LEG     E   +  P AIH+TRGGPWF+ W+  ++ D+W  E        N +++
Sbjct: 188 WLEGSMAKPE---SGTPDAIHFTRGGPWFDGWQDVDYGDVWRHERARLPETQNVSIQ 241


>gi|110634464|ref|YP_674672.1| hypothetical protein Meso_2115 [Chelativorans sp. BNC1]
 gi|110285448|gb|ABG63507.1| hypothetical protein Meso_2115 [Chelativorans sp. BNC1]
          Length = 455

 Score =  267 bits (682), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 120/222 (54%), Positives = 160/222 (72%), Gaps = 3/222 (1%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           I++G+D RE +AY V  ++IL R+S+PV++ PI  SDL + G Y R+     STEF++SR
Sbjct: 229 IYIGWDSREPIAYDVAKKTILDRASVPVEVHPIKLSDLVEKGAYTRDIDPLASTEFTYSR 288

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQT 141
           F TP+LAG+ GWA+F DCDFL+L D+ +L +  D   A++CV HDYTPK T KMDG VQT
Sbjct: 289 FFTPWLAGYKGWALFCDCDFLFLDDVAKLLEYRDSSKAVLCVKHDYTPKATVKMDGKVQT 348

Query: 142 VYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLEGH 201
            YPRKNWSS +L+NC HP  K LTPEV+N +TGA+LHR QW +DEEIG IP  WN+LEG 
Sbjct: 349 TYPRKNWSSFMLFNCEHPSTKTLTPEVINRETGAYLHRMQWAKDEEIGGIPEAWNWLEGW 408

Query: 202 NMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
           +   E   +  P AIH+T GGPWF+ W++ ++ DLW  E ++
Sbjct: 409 S---EKPESGTPSAIHFTNGGPWFKDWQNVDYGDLWRAEADK 447


>gi|153007761|ref|YP_001368976.1| hypothetical protein Oant_0416 [Ochrobactrum anthropi ATCC 49188]
 gi|404317290|ref|ZP_10965223.1| hypothetical protein OantC_03842 [Ochrobactrum anthropi CTS-325]
 gi|151559649|gb|ABS13147.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188]
          Length = 233

 Score =  265 bits (678), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 122/226 (53%), Positives = 160/226 (70%), Gaps = 3/226 (1%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           ++P +IF+G+D RE +AY V   S LK SSIP++I PI   DL   G+Y RE     STE
Sbjct: 2   SEPLRIFIGWDSREPIAYEVAKASALKHSSIPLEIVPIKLQDLVDRGVYTREVDPLASTE 61

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           F++SRF TP+LAG+ GWA+F DCDFL+L DI  L    D   A+ CV HDY PK++ KMD
Sbjct: 62  FTYSRFFTPWLAGYKGWALFCDCDFLFLGDIAGLQAYNDPSKAVYCVQHDYQPKDSVKMD 121

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWN 196
           G VQT YPRKNWSS +L+NC HP  + LTPE++N++TGA+LHR QW  D+EIG++P  +N
Sbjct: 122 GKVQTSYPRKNWSSCMLFNCEHPSTRQLTPELINSETGAYLHRMQWAADDEIGALPTDYN 181

Query: 197 FLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           +LEG N   E   T  PK +HYTRGGPWF+ W++ ++ADLW  E +
Sbjct: 182 WLEGWN---EKPETGLPKVVHYTRGGPWFKDWQNVDYADLWRAEAD 224


>gi|239831176|ref|ZP_04679505.1| Hypothetical protein OINT_1000354 [Ochrobactrum intermedium LMG
           3301]
 gi|444309472|ref|ZP_21145109.1| hypothetical protein D584_06742 [Ochrobactrum intermedium M86]
 gi|239823443|gb|EEQ95011.1| Hypothetical protein OINT_1000354 [Ochrobactrum intermedium LMG
           3301]
 gi|443487139|gb|ELT49904.1| hypothetical protein D584_06742 [Ochrobactrum intermedium M86]
          Length = 233

 Score =  263 bits (673), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 122/226 (53%), Positives = 159/226 (70%), Gaps = 3/226 (1%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           ++P +IF+G+D RE +AY V   S LK SSIP++I PI   DL + G Y RE     STE
Sbjct: 2   SEPLRIFIGWDSREPIAYDVAKASALKHSSIPLEIVPIKLQDLVEQGAYTREVDPLASTE 61

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           F++SRF TP+LAG+ GWA+F DCDFL+L DI  L    D   A+ CV HDY PK++ KMD
Sbjct: 62  FTYSRFFTPWLAGYKGWALFCDCDFLFLGDIAGLQSYNDPSKAVYCVQHDYQPKDSVKMD 121

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWN 196
           G VQT YPRKNWSS +L+NC HP  + LTPE+VN ++GA+LHR QW  D+EIG++P  +N
Sbjct: 122 GKVQTSYPRKNWSSCMLFNCEHPSTRKLTPELVNRESGAYLHRMQWAADDEIGALPTDYN 181

Query: 197 FLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           +LEG N   E   T  PK +HYTRGGPWF+ W++ ++ADLW  E +
Sbjct: 182 WLEGWN---EKPETGLPKVVHYTRGGPWFKDWQNVDYADLWRAEAD 224


>gi|398831564|ref|ZP_10589742.1| hypothetical protein PMI41_04642 [Phyllobacterium sp. YR531]
 gi|398212271|gb|EJM98880.1| hypothetical protein PMI41_04642 [Phyllobacterium sp. YR531]
          Length = 233

 Score =  261 bits (667), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 116/231 (50%), Positives = 158/231 (68%), Gaps = 3/231 (1%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           ++   I++G+D RE +AY V   +ILK +SIPV + PI+  DL   G Y RE     STE
Sbjct: 2   DRSLDIYIGWDSREPIAYEVAKSTILKNASIPVNVHPIILQDLVDKGAYTREVDPLASTE 61

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           F++SRF TPYLAG+ GWA+F DCDFL+ +D+ EL    D+  A+ CV HDY PK   KMD
Sbjct: 62  FTYSRFFTPYLAGYKGWALFCDCDFLFFSDVAELLQYRDESKAVACVKHDYQPKAGVKMD 121

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWN 196
           G VQT YPRKNWSS +L+NC HP  + LTPE++N ++GA+LHR QW +D+EIG IP  WN
Sbjct: 122 GKVQTTYPRKNWSSFMLFNCEHPSTRQLTPELINRESGAYLHRMQWAQDDEIGEIPTEWN 181

Query: 197 FLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKNE 247
           +LEG N   +   + +P A+H+T GGPWF+ W++ ++ D+W K+  E   E
Sbjct: 182 WLEGWN---DKPASGYPHAVHFTNGGPWFKDWQNVDYGDIWTKKALEIDPE 229


>gi|359789739|ref|ZP_09292674.1| hypothetical protein MAXJ12_10183 [Mesorhizobium alhagi CCNWXJ12-2]
 gi|359254417|gb|EHK57429.1| hypothetical protein MAXJ12_10183 [Mesorhizobium alhagi CCNWXJ12-2]
          Length = 233

 Score =  260 bits (665), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 117/224 (52%), Positives = 158/224 (70%), Gaps = 3/224 (1%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
             I++G+D RE +AY V  +++L R+S+PV + PI  ++L + G Y RE     STEF++
Sbjct: 5   LDIYIGWDSREPIAYDVAKKTLLDRASVPVGVHPIKLAELVEKGAYTREVDPLASTEFTY 64

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           SRF TP+LAG+ GWA+F DCDFL+L D+ EL   +D   A++CV HDYTPK T KMDG V
Sbjct: 65  SRFFTPWLAGYKGWALFCDCDFLFLDDVAELTGYLDPSKAVLCVKHDYTPKATVKMDGKV 124

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QT YPRKNWSS +L+NC HP  K LTPEV+N+QTGA+LHR QW  D+EIG++P  WN+LE
Sbjct: 125 QTNYPRKNWSSFMLFNCEHPSTKTLTPEVINSQTGAYLHRMQWAADDEIGALPERWNWLE 184

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
           G +   E   T  P A+H+T GGPWF+ W++ ++ D W  E  +
Sbjct: 185 GWS---EKPETGTPSAVHFTNGGPWFKDWQNVDYGDEWRAEASK 225


>gi|359409646|ref|ZP_09202114.1| LPS:glycosyltransferase [SAR116 cluster alpha proteobacterium
           HIMB100]
 gi|356676399|gb|EHI48752.1| LPS:glycosyltransferase [SAR116 cluster alpha proteobacterium
           HIMB100]
          Length = 229

 Score =  253 bits (647), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 119/220 (54%), Positives = 153/220 (69%), Gaps = 6/220 (2%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            ++F+GYD RED+AY+VC  SI  R+   V I P++Q +LR SG Y R   +  STEF+F
Sbjct: 1   MRVFIGYDKREDIAYQVCKYSIKSRNQ-NVDIRPLIQQELRDSGWYSRPIDKLASTEFTF 59

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           +RFL P L  F GWAVF+DCD L  +DI EL  L DD YA+MCV HDY PK T KMDG  
Sbjct: 60  TRFLVPELCDFKGWAVFMDCDMLLFSDISELFSLADDNYALMCVKHDYNPKATLKMDGQK 119

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQ--TGAFLHRFQWLEDEEIGSIPFVWNF 197
           QT+YPRKNWSS++L+NC HP N++LT E+VN +  +GA+LHRF WL+DEEIG +P  WN+
Sbjct: 120 QTIYPRKNWSSVMLFNCAHPSNRVLTRELVNKEDISGAYLHRFSWLKDEEIGELPCTWNW 179

Query: 198 LEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLW 237
           L GH    E+     P  +HYT GGPWFE ++ C + + W
Sbjct: 180 LVGHYQEPEDGK---PDLVHYTEGGPWFENYRFCGYHEEW 216


>gi|399043217|ref|ZP_10737642.1| hypothetical protein PMI09_05213 [Rhizobium sp. CF122]
 gi|398058464|gb|EJL50362.1| hypothetical protein PMI09_05213 [Rhizobium sp. CF122]
          Length = 232

 Score =  253 bits (647), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 113/219 (51%), Positives = 148/219 (67%), Gaps = 3/219 (1%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
             +++G+D RED+AY V   S+L+ +SI V + PI   DL   G+Y R      STEF++
Sbjct: 4   LSVYIGWDSREDIAYEVAKESLLEHASIEVDVHPIKLQDLIDRGVYTRAVDPLASTEFTY 63

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           SRF TP+LA + GWA+F DCDFL+  D+ +L    ++  A+ CVHHDYTPKE  KMDG +
Sbjct: 64  SRFFTPWLADYKGWALFCDCDFLFFGDVADLLQYRNEDQAVACVHHDYTPKEGVKMDGKI 123

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QT YPRKNWSS +L+NCGHP  + LTPE +N +TGAFLHR QW +D EIG IP  WN+LE
Sbjct: 124 QTTYPRKNWSSFMLFNCGHPSTRNLTPERINAETGAFLHRLQWADDSEIGEIPEEWNWLE 183

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWL 238
           G +         +PK +HYT GGPWF+ W+  E+A  WL
Sbjct: 184 GWS---SKPSAGYPKGVHYTNGGPWFKNWQDVEYAQKWL 219


>gi|409441201|ref|ZP_11268196.1| conserved hypothetical protein [Rhizobium mesoamericanum STM3625]
 gi|408747496|emb|CCM79393.1| conserved hypothetical protein [Rhizobium mesoamericanum STM3625]
          Length = 232

 Score =  252 bits (644), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 111/219 (50%), Positives = 150/219 (68%), Gaps = 3/219 (1%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
             +++G+D RED+AY+V   S+L+ +SI V + PI   +L   G+Y R      STEF++
Sbjct: 4   LSVYIGWDSREDIAYQVAKESLLEHASIDVDVHPIKLQELVDRGVYTRPVDPLASTEFTY 63

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           SRF TP+LAG+ GWA+F DCDFL+  D+ +L    ++  A+ CVHHDYTPKE  KMDG +
Sbjct: 64  SRFFTPWLAGYKGWALFCDCDFLFFGDVADLLQYRNEDQAVACVHHDYTPKEGVKMDGKI 123

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
           QT YPRKNWSS +L+NC HP  + LTP ++N +TGAFLHR QW +D EIG IP  WN+LE
Sbjct: 124 QTTYPRKNWSSFMLFNCDHPSTRKLTPALINAETGAFLHRLQWADDREIGEIPEEWNWLE 183

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWL 238
           G +         +PK +HYT GGPWF+ W+  E+A+ WL
Sbjct: 184 GWS---SKPSAGYPKGVHYTNGGPWFKNWQEVEYAEKWL 219


>gi|412986604|emb|CCO15030.1| unknown [Bathycoccus prasinos]
          Length = 278

 Score =  249 bits (635), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 115/228 (50%), Positives = 153/228 (67%), Gaps = 3/228 (1%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           PF+ +VGYD  ED+ + V   S+ K SS+P+++ P+ + ++RK G+Y R +   +STEF+
Sbjct: 14  PFRCYVGYDSHEDITFEVAKYSMEKHSSVPIQVIPLKREEMRKQGIYTRTQDPKQSTEFT 73

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           + RF  PYL  + GWA+FVD DFL+L DIKEL D IDDKYAIMCV HDY P    K+ G 
Sbjct: 74  YCRFFVPYLQNYKGWAMFVDDDFLWLGDIKELIDQIDDKYAIMCVQHDYKPTVDVKLAGK 133

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEE-IGSIPFVWNF 197
            Q  YPRKNWSSMVLYNCGHP N+ +   ++N+Q G+FLHRF W+ D+  IG I + WNF
Sbjct: 134 AQEAYPRKNWSSMVLYNCGHPANQGVNLSMINSQPGSFLHRFSWITDDSLIGKIDYEWNF 193

Query: 198 LEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYK 245
           L       E +    PKAIHYT GGPWF  ++  ++A  W   M+EY+
Sbjct: 194 LVEWYKPYEGERK--PKAIHYTEGGPWFPDYRETDYAKEWFDHMKEYE 239


>gi|222085660|ref|YP_002544190.1| hypothetical protein Arad_1952 [Agrobacterium radiobacter K84]
 gi|398381947|ref|ZP_10540048.1| hypothetical protein PMI03_05703 [Rhizobium sp. AP16]
 gi|221723108|gb|ACM26264.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
 gi|397718339|gb|EJK78929.1| hypothetical protein PMI03_05703 [Rhizobium sp. AP16]
          Length = 228

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 114/219 (52%), Positives = 152/219 (69%), Gaps = 10/219 (4%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFS 80
           +IFVG+D +E +AY V  +SI++RSSIPV  +PIV S+L   G++ RER   +STEFSFS
Sbjct: 7   RIFVGFDSKEVVAYHVLAQSIIERSSIPVVFSPIVLSNL--EGVFTRERNPLQSTEFSFS 64

Query: 81  RFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQ 140
           RFL PYL+ F+GW++F+DCD L  ADI EL  L DD+YA MCV HDY PK  TK  G  Q
Sbjct: 65  RFLVPYLSDFEGWSIFMDCDMLARADIAELWKLRDDRYAAMCVKHDYQPKVETKFLGQTQ 124

Query: 141 TVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNFLE 199
           T Y +KNWSSM+L+N  + K K LT + VNT TG  LH+F+WLE D++IG +P  WN+L 
Sbjct: 125 TKYEKKNWSSMILFN--NAKCKALTKDFVNTATGLQLHQFKWLESDDQIGEVPVTWNYL- 181

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWL 238
               V E D     K +H+T GGP+F+ +++ ++AD W 
Sbjct: 182 ----VNEYDRREDAKLVHFTDGGPYFDEYRNDDYADEWF 216


>gi|86358596|ref|YP_470488.1| hypothetical protein RHE_CH02994 [Rhizobium etli CFN 42]
 gi|86282698|gb|ABC91761.1| hypothetical conserved protein [Rhizobium etli CFN 42]
          Length = 229

 Score =  224 bits (571), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 113/228 (49%), Positives = 158/228 (69%), Gaps = 10/228 (4%)

Query: 16  DNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTEST 75
           + K  +IFVG+DP+E +AY V  +SI+++SSIPV+ + I  S+L   G++ RER   +ST
Sbjct: 2   EQKTVRIFVGFDPKEVVAYHVLVQSIIEKSSIPVEFSLIALSNL--GGIFTRERNALQST 59

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
           EFSFSRFLTPYL+G++GW++F+DCD L  ADI EL  L D++YA+MCV HDY PK  TK 
Sbjct: 60  EFSFSRFLTPYLSGYEGWSIFMDCDMLMRADIAELWALRDERYAVMCVKHDYQPKIDTKF 119

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEE-IGSIPFV 194
            G VQT Y +KNWSS +L+N  + + + LTP+ VNT  G  LH+F+WLE+E+ IG +P  
Sbjct: 120 LGQVQTKYEKKNWSSFILFN--NDECRALTPDYVNTAAGLQLHQFKWLENEDLIGELPVT 177

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           WN+L     V E D     K +H+T GGP+FE +K+ ++A+ W    E
Sbjct: 178 WNYL-----VNEYDYREDAKNVHFTDGGPYFEEYKNDDYAEEWFAARE 220


>gi|421591179|ref|ZP_16036078.1| hypothetical protein RCCGEPOP_19183 [Rhizobium sp. Pop5]
 gi|403703417|gb|EJZ19651.1| hypothetical protein RCCGEPOP_19183 [Rhizobium sp. Pop5]
          Length = 229

 Score =  223 bits (569), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 113/228 (49%), Positives = 155/228 (67%), Gaps = 10/228 (4%)

Query: 16  DNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTEST 75
           + +  +IFVG+D +E +AY V  +SI+++SSIPV+ +PIV S+L   G++ RER   +ST
Sbjct: 2   EQQTVRIFVGFDSKEVVAYHVLAQSIIEKSSIPVEFSPIVLSNL--GGVFTRERNALQST 59

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
           EFSFSRFLTPYL+G+ GW++F+DCD L   DI EL  L DD+YA MCV HDY PK  TK 
Sbjct: 60  EFSFSRFLTPYLSGYQGWSIFMDCDMLMRVDIAELWALRDDRYAAMCVKHDYQPKIETKF 119

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFV 194
            G  QT Y +KNWSS +L+N  + K + LTP+ VNT TG  LH+F+WL+ D+ IG +P  
Sbjct: 120 LGQTQTKYEKKNWSSFILFN--NDKCRALTPDFVNTATGLQLHQFKWLDNDDLIGELPVT 177

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           WN+L     V E D     K +H+T GGP+FE +K+ ++A+ W    E
Sbjct: 178 WNYL-----VNEYDYREDAKNVHFTDGGPYFEEYKNDDYAEEWFAARE 220


>gi|424885395|ref|ZP_18309006.1| hypothetical protein Rleg10DRAFT_5896 [Rhizobium leguminosarum bv.
           trifolii WSM2012]
 gi|393177157|gb|EJC77198.1| hypothetical protein Rleg10DRAFT_5896 [Rhizobium leguminosarum bv.
           trifolii WSM2012]
          Length = 229

 Score =  222 bits (566), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 114/223 (51%), Positives = 154/223 (69%), Gaps = 10/223 (4%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFS 80
           +IFVG+DP+E +AY V  +SI+++SSIPV+ +PI  S+L   G++ RER   +STEFSFS
Sbjct: 7   RIFVGFDPKEVVAYHVLVQSIIEKSSIPVEFSPIALSNL--GGIFTRERNALQSTEFSFS 64

Query: 81  RFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQ 140
           RFL PYL+ + GW++F+DCD L  ADI EL  L DD+YAIMCV HDY PK  TK  G  Q
Sbjct: 65  RFLAPYLSDYRGWSLFMDCDMLMRADIAELWALRDDRYAIMCVKHDYQPKIDTKFLGQTQ 124

Query: 141 TVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEE-IGSIPFVWNFLE 199
           T Y +KNWSS +L+N  + + + LTP+ VNT TG  LH+F+WLE+E+ IG +P  WN+L 
Sbjct: 125 TKYEKKNWSSFILFN--NDECRALTPDYVNTATGLQLHQFKWLENEDLIGELPVTWNYL- 181

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
               V E D     K IH+T GGP+FE +K+ ++A+ W    E
Sbjct: 182 ----VNEYDYREDAKNIHFTDGGPYFEEYKNDDYAEEWFAARE 220


>gi|424896200|ref|ZP_18319774.1| hypothetical protein Rleg4DRAFT_2096 [Rhizobium leguminosarum bv.
           trifolii WSM2297]
 gi|393180427|gb|EJC80466.1| hypothetical protein Rleg4DRAFT_2096 [Rhizobium leguminosarum bv.
           trifolii WSM2297]
          Length = 229

 Score =  222 bits (566), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 112/228 (49%), Positives = 156/228 (68%), Gaps = 10/228 (4%)

Query: 16  DNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTEST 75
           + +  +IFVG+DP+E +AY V  +SI+++SSIPV+ +PI  S+L   G++ RER   +ST
Sbjct: 2   EQQNVRIFVGFDPKEVVAYHVLVQSIIEKSSIPVEFSPIALSNL--GGIFTRERNTLQST 59

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
           EFSFSRFLTPYL+ + GW++F+DCD L  ADI EL  L D +YA+MCV HDY PK  TK 
Sbjct: 60  EFSFSRFLTPYLSDYQGWSIFMDCDMLMRADIAELWALRDHRYAVMCVKHDYQPKVETKF 119

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEE-IGSIPFV 194
            G  QT Y +KNWSS +L+N  + + + LTP+ VNT TG  LH+F+WLE+E+ IG +P  
Sbjct: 120 LGQTQTKYEKKNWSSFILFN--NDECRALTPDYVNTATGLQLHQFKWLENEDLIGELPVT 177

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           WN+L     V E D     K +H+T GGP+FE +K+ ++A+ W    E
Sbjct: 178 WNYL-----VNEYDYREDAKIVHFTDGGPYFEEYKNDDYAEEWFATRE 220


>gi|420238957|ref|ZP_14743321.1| hypothetical protein PMI07_01064 [Rhizobium sp. CF080]
 gi|398083459|gb|EJL74165.1| hypothetical protein PMI07_01064 [Rhizobium sp. CF080]
          Length = 230

 Score =  219 bits (557), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 108/229 (47%), Positives = 154/229 (67%), Gaps = 10/229 (4%)

Query: 16  DNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTEST 75
           D K   IF+GYD +E +AY V  +S++K S++P+KITPI    ++   ++ R+R   +ST
Sbjct: 3   DQKTVNIFIGYDSKEVVAYHVLSQSLIKHSTLPLKITPIALQHMQ--NIFKRDRNNLQST 60

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
           EFSFSRFLTPY++ ++GW++F+DCD L  ADI EL  L DD Y++MC  HDY PKE TK 
Sbjct: 61  EFSFSRFLTPYISDYEGWSIFMDCDMLARADIAELWSLRDDNYSVMCCKHDYQPKEDTKF 120

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFV 194
            G VQT Y +KNWSS++++N  + + + LTPE V+T+TG  LH+F+WLE D  IG +P  
Sbjct: 121 LGQVQTKYEKKNWSSVMMFN--NKRCQSLTPEYVSTRTGLELHQFKWLEGDHLIGDLPLK 178

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
           WN L G   V ++D       +HYT GGP+F  + +C++AD W   + E
Sbjct: 179 WNHLVG---VYDHDQNA--ALVHYTEGGPYFNDYANCDYADEWRSLLGE 222


>gi|147788154|emb|CAN78229.1| hypothetical protein VITISV_020275 [Vitis vinifera]
          Length = 121

 Score =  218 bits (555), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 101/120 (84%), Positives = 106/120 (88%), Gaps = 1/120 (0%)

Query: 135 MDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFV 194
           MDG VQTVYPRKNWSSMVLYNCGHPKN++LTP+VVN+QTGAFLHRFQWLED EIGSIPFV
Sbjct: 1   MDGVVQTVYPRKNWSSMVLYNCGHPKNRVLTPDVVNSQTGAFLHRFQWLEDHEIGSIPFV 60

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEY-KNEANKTVE 253
           WNFL GHN   ENDP+TFPKAIHYT GGPWFEAWK CEF DLWL EMEEY K EANK  E
Sbjct: 61  WNFLVGHNKAEENDPSTFPKAIHYTLGGPWFEAWKDCEFGDLWLNEMEEYKKKEANKKTE 120


>gi|53804386|ref|YP_113969.1| hypothetical protein MCA1513 [Methylococcus capsulatus str. Bath]
 gi|53758147|gb|AAU92438.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath]
          Length = 232

 Score =  217 bits (552), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 111/224 (49%), Positives = 150/224 (66%), Gaps = 10/224 (4%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P +IF+GYDPRE LA+ V   SI  R+S PV ITP++ S L +  +Y RER   +ST+FS
Sbjct: 6   PIRIFIGYDPREALAFSVLAHSIHVRASRPVSITPLMLSQLGQ--VYRRERNPLQSTDFS 63

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           FSRFL PYL+GF GW+VF+DCD L L DI +L DL D++YA+  V HD+ P E  K   A
Sbjct: 64  FSRFLVPYLSGFSGWSVFLDCDMLVLDDIAKLWDLRDERYAVQVVKHDHVPAEEIKFLDA 123

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEE-IGSIPFVWNF 197
            QT Y +KNWSS++L++  + K K LTP+ VNT +G  LH+F+WL+DE  IG IP  WN 
Sbjct: 124 AQTRYEKKNWSSVMLFD--NAKCKALTPDYVNTASGLELHQFKWLDDEALIGEIPHRWNH 181

Query: 198 LEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEM 241
           L G++      P+      HYT GGP+FE ++ CE+++ W  E+
Sbjct: 182 LVGYDT-----PSPEVSLAHYTIGGPYFEEYRDCEYSEEWRAEL 220


>gi|222148348|ref|YP_002549305.1| hypothetical protein Avi_1814 [Agrobacterium vitis S4]
 gi|221735336|gb|ACM36299.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 229

 Score =  217 bits (552), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 114/228 (50%), Positives = 153/228 (67%), Gaps = 11/228 (4%)

Query: 16  DNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTEST 75
           DNK  +IF+G+D +E +AY V  +SIL+ SSIPV  TPIV  +L  +G++ RER   +ST
Sbjct: 3   DNK-LQIFIGFDTKEVVAYHVLAQSILEHSSIPVAFTPIVLDNL--AGIFTRERNPLQST 59

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
           EFSFSRFL PYL+ + GW++F+DCD L  AD+ EL  L DD+YA+MCV HDY PK  TK 
Sbjct: 60  EFSFSRFLVPYLSDYQGWSMFMDCDMLARADLAELWALRDDRYAVMCVKHDYQPKVETKF 119

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFV 194
            G  QT Y +KNWSS++L+N  + K   LT + VNT TG  LH+F+WLE D+ IG +P  
Sbjct: 120 LGQTQTKYEKKNWSSVMLFN--NAKCTALTKDYVNTATGLQLHQFKWLENDDLIGGLPLH 177

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           WN+L     V E D     K +H+T GGP+FE +K+ ++A+ W    E
Sbjct: 178 WNYL-----VNEYDHDPAAKIVHFTDGGPYFEQYKNDDYAEEWFAARE 220


>gi|440226321|ref|YP_007333412.1| hypothetical protein RTCIAT899_CH07330 [Rhizobium tropici CIAT 899]
 gi|440037832|gb|AGB70866.1| hypothetical protein RTCIAT899_CH07330 [Rhizobium tropici CIAT 899]
          Length = 228

 Score =  216 bits (550), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 112/225 (49%), Positives = 152/225 (67%), Gaps = 11/225 (4%)

Query: 15  KDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTES 74
           KDN   +IFVG+D +E +AY V  +SI++RSSIPV  +PIV ++L   G++ RER   +S
Sbjct: 2   KDNVA-RIFVGFDSKEVVAYHVLTQSIIERSSIPVVFSPIVLNNL--EGVFTRERNPLQS 58

Query: 75  TEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTK 134
           TEFSFSRFL PYL+ + GW++F+DCD L   DI EL  L DD+YA MCV HDY PK  TK
Sbjct: 59  TEFSFSRFLVPYLSDYQGWSIFMDCDMLARTDIAELWALRDDRYAAMCVKHDYQPKVETK 118

Query: 135 MDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPF 193
             G  QT Y +KNWSSM+L+N  + K   LT + VNT TG  LH+F+WLE D++IG +P 
Sbjct: 119 FLGQTQTKYEKKNWSSMILFN--NAKCTALTKDFVNTATGLQLHQFKWLENDDQIGEVPV 176

Query: 194 VWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWL 238
            WN+L     V E D     K +H+T GGP+F+ +++ ++A+ W 
Sbjct: 177 TWNYL-----VNEYDYRDDAKLVHFTDGGPYFDEYRNDDYAEEWF 216


>gi|254468847|ref|ZP_05082253.1| conserved hypothetical protein [beta proteobacterium KB13]
 gi|207087657|gb|EDZ64940.1| conserved hypothetical protein [beta proteobacterium KB13]
          Length = 223

 Score =  214 bits (544), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 107/224 (47%), Positives = 143/224 (63%), Gaps = 9/224 (4%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           ++VGYD RE +AY V   S++K +SIPVKITP+V S L++    + E  Q  S +F +SR
Sbjct: 4   LYVGYDEREAIAYHVFCHSVIKNTSIPVKITPLVLSQLKE----FNETHQDRSNDFVYSR 59

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQT 141
           FLTPYL  F+GWA+F D D +  AD+KEL  + D   A+M V HDY  K + K  G +  
Sbjct: 60  FLTPYLNEFNGWAIFADGDMICQADLKELIGMADPNKALMVVKHDYQTKASIKYLGNINE 119

Query: 142 VYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLEGH 201
            YPRKNWSS++L+NC HPK+KILTPE V+ QTG FLHRF WL+D +IG +P  WN+L   
Sbjct: 120 NYPRKNWSSVILWNCSHPKHKILTPEFVSNQTGKFLHRFSWLDDNDIGELPVEWNWL--- 176

Query: 202 NMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYK 245
               E +     K IHYT G P F+ ++  + A++W    E  K
Sbjct: 177 --ACEYEKNADAKLIHYTLGTPCFKDFRDTDMAEIWYDYYESAK 218


>gi|144900377|emb|CAM77241.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense
           MSR-1]
          Length = 229

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 106/224 (47%), Positives = 147/224 (65%), Gaps = 8/224 (3%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            +IF+G+DPRE +A+ V   SIL+R+S P  + P+  + L   G  WRER   +ST+FSF
Sbjct: 2   LRIFIGFDPREAVAFNVLQYSILRRASQPTAVIPLTLNQL--GGKMWRERNPLQSTDFSF 59

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           SRFLTP+L+ ++GW++F+DCD L L D+  L  L DDKYA+  + HD+ P E  K  GAV
Sbjct: 60  SRFLTPHLSNYEGWSLFIDCDMLVLDDVANLFALADDKYAVQVIKHDHRPVEDEKFLGAV 119

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEE-IGSIPFVWNFL 198
           QT Y +KNWSS+VL+N  + K K LTPE VNT +G  LH+F+WL D+  IG +P  W  L
Sbjct: 120 QTKYEKKNWSSVVLFN--NAKCKALTPEYVNTASGLDLHQFKWLGDDALIGELPHRWGHL 177

Query: 199 EGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
             ++  V  D  +    IHYT GGP+F+ ++ C +ADLW  E +
Sbjct: 178 VDYDAEVPVDQLSL---IHYTIGGPYFDDYRDCGYADLWNAERD 218


>gi|428219227|ref|YP_007103692.1| hypothetical protein Pse7367_3014 [Pseudanabaena sp. PCC 7367]
 gi|427991009|gb|AFY71264.1| hypothetical protein Pse7367_3014 [Pseudanabaena sp. PCC 7367]
          Length = 225

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 106/225 (47%), Positives = 150/225 (66%), Gaps = 10/225 (4%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
             +++GYDPRE +A+ V   SI  R+S PV+I P++ S L   G+  R+R   +ST+FSF
Sbjct: 2   INVYIGYDPREAIAFHVLAHSIHCRASQPVQIVPLMLSQL--GGIMTRDRHPLQSTDFSF 59

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           SRFLTPYL+G+ GW++F+DCD L L D+  L  L DD+YA+M V HDYTP+E+TK  G  
Sbjct: 60  SRFLTPYLSGYQGWSIFMDCDMLMLDDVANLWALCDDRYAVMVVKHDYTPQESTKFLGQA 119

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEE-IGSIPFVWNFL 198
           QT Y +KNWSS++L+N  + K + L+P  VN  TG  LH+F+WLE E  IG +P  WN L
Sbjct: 120 QTKYVKKNWSSVMLFN--NAKCRSLSPNYVNGATGLNLHQFKWLESEALIGDLPKRWNHL 177

Query: 199 EGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
            G   V + DP      +HYT GGP+F+ +  C++A+ W  E+++
Sbjct: 178 VG---VYDYDPDV--SLVHYTLGGPYFKEYSDCDYAERWNAELQK 217


>gi|90417777|ref|ZP_01225689.1| conserved hypothetical protein [Aurantimonas manganoxydans
           SI85-9A1]
 gi|90337449|gb|EAS51100.1| conserved hypothetical protein [Aurantimonas manganoxydans
           SI85-9A1]
          Length = 230

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 106/223 (47%), Positives = 152/223 (68%), Gaps = 10/223 (4%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFS 80
           ++F+G+D +E +AY V  +SIL+RSS+PV ++P+   +L   G + RER   +STEFSFS
Sbjct: 7   RVFIGFDSKEVIAYHVLCQSILERSSVPVVLSPLYLPNLE--GAFQRERNALQSTEFSFS 64

Query: 81  RFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQ 140
           RFL PYL+GF GW++F+DCD L  ADI EL  L DD++++MCV HDY PK  TK  G  Q
Sbjct: 65  RFLVPYLSGFTGWSLFMDCDMLMRADIAELWALRDDRHSVMCVKHDYQPKVETKFLGQTQ 124

Query: 141 TVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNFLE 199
           T Y +KNWSS++L+N  + K + LTP+ VNT TG  LH+F+WL+ DE IG +P  WN+L 
Sbjct: 125 TKYEKKNWSSVMLFN--NAKCQALTPDYVNTATGLQLHQFKWLDGDENIGELPAEWNWL- 181

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
               V E D     + +H+T GGP+F+ +++ ++A+ W    E
Sbjct: 182 ----VNEYDHNPDARNVHFTDGGPYFDEYRNDDYAEEWFAARE 220


>gi|405379789|ref|ZP_11033636.1| LPS:glycosyltransferase [Rhizobium sp. CF142]
 gi|397323819|gb|EJJ28210.1| LPS:glycosyltransferase [Rhizobium sp. CF142]
          Length = 228

 Score =  211 bits (536), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 104/222 (46%), Positives = 153/222 (68%), Gaps = 10/222 (4%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           I+VG+D +E +A+ V  +SIL++SS+PV+ITPI   ++  +  + RER   +STEFSFSR
Sbjct: 7   IYVGFDSKEVVAHHVLSQSILEKSSVPVRITPIYLRNIENT--FTRERNALQSTEFSFSR 64

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQT 141
           FLTP+L+ ++GW++F+DCD L   DI EL  L D++Y++MCV HDY PK  TK  G VQT
Sbjct: 65  FLTPHLSQYEGWSLFMDCDMLARTDIAELWALRDERYSVMCVKHDYQPKTETKFLGQVQT 124

Query: 142 VYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNFLEG 200
            Y +KNWSS++L+N  + K + LTP+ VNT TG  LH+F+WL+ D+ IG +P  WN+L  
Sbjct: 125 KYEKKNWSSLILFN--NAKCRALTPDYVNTATGLQLHQFKWLDGDDLIGELPITWNYL-- 180

Query: 201 HNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
              V E D     K +H+T GGP+F+ +++ ++A+ W    E
Sbjct: 181 ---VNEYDKRDDAKIVHFTDGGPYFDEYRNDDYAEEWFATRE 219


>gi|148241387|ref|YP_001226544.1| hypothetical protein SynRCC307_0288 [Synechococcus sp. RCC307]
 gi|147849697|emb|CAK27191.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 233

 Score =  210 bits (534), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 100/212 (47%), Positives = 135/212 (63%), Gaps = 7/212 (3%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
            +P  IF+G DPRE  A  V   S++  SS P+ ITP+V   L +  LYWR+R   +ST 
Sbjct: 3   QQPISIFIGMDPRERAATNVLIDSLVDTSSAPLAITPLVTPQLEQQKLYWRQRDPKQSTA 62

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           FSF+RFL P L GF GWA+F+DCD L  ADI +L  L D+++A+MCV HD+ P E +K  
Sbjct: 63  FSFTRFLVPQLMGFQGWAIFMDCDMLCRADISDLWALRDERFAVMCVQHDHQPTEASKFL 122

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVW 195
           G  Q+ YP+KNWSS++L NC   + + LTPE VN+ +G  LHRF WL+ D +IG++P  W
Sbjct: 123 GETQSAYPKKNWSSLMLLNCS--RCQALTPEYVNSASGLDLHRFHWLDGDHQIGALPNRW 180

Query: 196 NFLEGHNMVVENDPTTFPKAIHYTRGGPWFEA 227
           N L G    V+   +     +H+T GGPWF  
Sbjct: 181 NHLVG----VQEPASEEAALLHWTLGGPWFRG 208


>gi|78211692|ref|YP_380471.1| hypothetical protein Syncc9605_0140 [Synechococcus sp. CC9605]
 gi|78211731|ref|YP_380510.1| hypothetical protein Syncc9605_0179 [Synechococcus sp. CC9605]
 gi|78196151|gb|ABB33916.1| conserved hypothetical protein [Synechococcus sp. CC9605]
 gi|78196190|gb|ABB33955.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 236

 Score =  209 bits (533), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 108/212 (50%), Positives = 137/212 (64%), Gaps = 9/212 (4%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           ++P  IF+GYDPRE  A  V   S+ + SS+P+ ITP+V   L   GL+ RER   +ST 
Sbjct: 5   SQPIPIFIGYDPRERAATNVLIDSLYQNSSVPLAITPLVTPQLEAQGLFRRERDPKQSTA 64

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           FSF+RFL PYL G++GWA+F+DCD L  ADIK L D  DD Y  MCV H++ P ET K  
Sbjct: 65  FSFTRFLVPYLMGYEGWALFMDCDMLCRADIKALWDQRDDAYGAMCVQHEHVPGETVKFL 124

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVW 195
           G VQ+ YP+KNWSS++L NC   +   LTP+ VNT TG  LHRF WLE D EIG+I   W
Sbjct: 125 GEVQSPYPKKNWSSLMLLNCS--RCTKLTPDYVNTATGLELHRFHWLEGDHEIGAIQGGW 182

Query: 196 NFLEGHNMVVENDPTTFPKA--IHYTRGGPWF 225
           N    H + V+  P +   A  +H+T GGPWF
Sbjct: 183 N----HLVDVQAPPESEQAAPMLHWTLGGPWF 210


>gi|78183792|ref|YP_376226.1| hypothetical protein Syncc9902_0208 [Synechococcus sp. CC9902]
 gi|78168086|gb|ABB25183.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 236

 Score =  206 bits (523), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 107/210 (50%), Positives = 133/210 (63%), Gaps = 9/210 (4%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P  IF+GYDPRE  A  V   S+ + SS+P+ ITP+V   L   GL+ RER   +ST FS
Sbjct: 7   PIPIFIGYDPRERAATNVLIDSLYQHSSVPLAITPLVTPQLENQGLFQRERDPKQSTAFS 66

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFL PYL  + GWA+F+DCD L  ADIK L D  DD+Y  MCV H++ P ET K  G 
Sbjct: 67  FTRFLVPYLMEYQGWALFMDCDMLCQADIKALWDQRDDRYGAMCVQHEHVPGETVKFLGE 126

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNF 197
           VQ+ YP+KNWSS++L NC   KN  LT + VNT +G  LHRF WLE D EIG+I   WN 
Sbjct: 127 VQSAYPKKNWSSLMLLNCSRCKN--LTVDYVNTASGLELHRFHWLEGDHEIGAIEGGWN- 183

Query: 198 LEGHNMVVENDPTTFPKA--IHYTRGGPWF 225
              H + V+  P     A  +H+T GGPWF
Sbjct: 184 ---HLVDVQRPPAQEEPAPMLHWTLGGPWF 210


>gi|148238568|ref|YP_001223955.1| hypothetical protein SynWH7803_0232 [Synechococcus sp. WH 7803]
 gi|147847107|emb|CAK22658.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 238

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 105/213 (49%), Positives = 135/213 (63%), Gaps = 12/213 (5%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P  IF+GYDPRE  A  V   S+ + SS P+ ITP+V   L   GLY R+R   +ST FS
Sbjct: 6   PIPIFIGYDPRERAATNVLIDSLYQHSSSPLAITPLVTPQLEAQGLYRRQRDPKQSTAFS 65

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFL PYL G++GWAVF+DCD L   DIK+L D  DD+Y  MCV H++ P ET K  G 
Sbjct: 66  FTRFLVPYLMGYEGWAVFMDCDMLCRGDIKQLWDQRDDRYGAMCVQHEHVPGETVKFLGE 125

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNF 197
           VQ+ YP+KNWSS++L NC   +   L+ + VNT +G  LHRF WLE D EIG++   WN 
Sbjct: 126 VQSAYPKKNWSSLMLLNCS--RCTKLSVDYVNTASGLELHRFHWLEGDHEIGALDGGWN- 182

Query: 198 LEGHNMVVENDPTTF-----PKAIHYTRGGPWF 225
              H + V+  PT       P+ +H+T GGPWF
Sbjct: 183 ---HLVDVQAPPTAAAQEGGPRLLHWTLGGPWF 212


>gi|87124522|ref|ZP_01080371.1| hypothetical protein RS9917_12950 [Synechococcus sp. RS9917]
 gi|86168094|gb|EAQ69352.1| hypothetical protein RS9917_12950 [Synechococcus sp. RS9917]
          Length = 233

 Score =  204 bits (518), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 103/222 (46%), Positives = 143/222 (64%), Gaps = 9/222 (4%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFS 80
           ++F+G+DPRED+A  V   S+ + +S+PV+I  I  S LR   +Y R     +STEFSFS
Sbjct: 9   RVFIGFDPREDVAVNVLTDSMQRHASVPVQIAQIRLSQLRS--VYTRPHNPLQSTEFSFS 66

Query: 81  RFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQ 140
           RFL P+L G++GWA+F+D D L L D+ EL  L DD+YA+  V H +  +  TK  G  Q
Sbjct: 67  RFLVPWLCGYEGWALFIDADMLCLGDLAELWALRDDRYAVQLVKHQHDCESGTKFQGMPQ 126

Query: 141 TVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLEG 200
           T Y RKNWSS++L+NCG  + + LTP+VVNT +G  LH+FQWLED+ IG +P  WN L G
Sbjct: 127 TPYRRKNWSSVMLFNCG--RCRALTPQVVNTASGLELHQFQWLEDDAIGELPPQWNVLVG 184

Query: 201 HNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
              V E+      + +HYT GGPWF+  +    ++LW +  E
Sbjct: 185 VQDVPED-----ARILHYTLGGPWFDDCQTMPRSELWTQARE 221


>gi|88808106|ref|ZP_01123617.1| hypothetical protein WH7805_08086 [Synechococcus sp. WH 7805]
 gi|88788145|gb|EAR19301.1| hypothetical protein WH7805_08086 [Synechococcus sp. WH 7805]
          Length = 238

 Score =  204 bits (518), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 103/213 (48%), Positives = 134/213 (62%), Gaps = 12/213 (5%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P  IF+GYDPRE  A  V   S+ + S+ P+ ITP+V   L   GLY R+R   +ST FS
Sbjct: 6   PIPIFIGYDPRERAATNVLIDSLYQHSTSPLAITPLVTPQLEAQGLYRRQRDPKQSTAFS 65

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFL P+L G+ GWA+F+DCD L   DIK+L D  DD+Y  MCV H++ P ET K  G 
Sbjct: 66  FTRFLVPHLMGYQGWAIFMDCDMLCRGDIKQLWDQRDDRYGAMCVQHEHVPGETVKFLGE 125

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNF 197
           VQ+ YP+KNWSS++L NC   +   LT + VN  TG  LHRF WLE D EIG++   WN 
Sbjct: 126 VQSAYPKKNWSSLMLLNCS--RCTALTVDYVNRATGLELHRFHWLEGDHEIGALDAGWN- 182

Query: 198 LEGHNMVVENDPTTF-----PKAIHYTRGGPWF 225
              H + V++ PT       P+ +H+T GGPWF
Sbjct: 183 ---HLVDVQDAPTAAAEEGGPRLLHWTLGGPWF 212


>gi|116074465|ref|ZP_01471727.1| hypothetical protein RS9916_38482 [Synechococcus sp. RS9916]
 gi|116069770|gb|EAU75522.1| hypothetical protein RS9916_38482 [Synechococcus sp. RS9916]
          Length = 234

 Score =  203 bits (517), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 106/210 (50%), Positives = 133/210 (63%), Gaps = 9/210 (4%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P  IF+GYDPRE  A  V   S+ + SS P+ ITP+V   L   GLY RER   +ST FS
Sbjct: 5   PIPIFIGYDPRERAATNVLIDSLYQHSSAPLAITPLVTPQLEAQGLYRRERDPKQSTAFS 64

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFL P+L G+ GWA+F+DCD L  ADI+ L D  DD+Y  MCV H++ P ET K  G 
Sbjct: 65  FTRFLVPHLMGYQGWALFMDCDMLCRADIQALWDQRDDRYGAMCVQHEHVPGETVKFLGE 124

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNF 197
           VQ+ YP+KNWSS++L NC   +   LT + VNT TG  LHRF WLE D EIG+I   WN 
Sbjct: 125 VQSAYPKKNWSSLMLLNCS--RCTKLTVDYVNTATGLELHRFHWLEGDHEIGAIQGGWN- 181

Query: 198 LEGHNMVVENDPTTFPKA--IHYTRGGPWF 225
              H + V+  P T   +  +H+T GGPWF
Sbjct: 182 ---HLVDVQAPPETQQASPMLHWTLGGPWF 208


>gi|33864719|ref|NP_896278.1| hypothetical protein SYNW0183 [Synechococcus sp. WH 8102]
 gi|33632242|emb|CAE06698.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 256

 Score =  203 bits (517), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 101/209 (48%), Positives = 132/209 (63%), Gaps = 5/209 (2%)

Query: 18  KPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEF 77
            P  IF+GYDPRE  A  V   S+ + SS+P+ ITP+V   L   GLY RER   +ST F
Sbjct: 26  SPIPIFIGYDPRERAATNVLIDSLYQLSSVPLAITPLVTPQLEAQGLYQRERDPKQSTAF 85

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDG 137
           SF+RFL P+L G++GWA+F+DCD L   DIK+L D  DD+Y  MCV H++ P ET K  G
Sbjct: 86  SFTRFLVPHLMGYEGWALFMDCDMLCRGDIKQLWDQRDDRYGAMCVQHEHVPGETVKFLG 145

Query: 138 AVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWN 196
            VQ+ YP+KNWSS++L NC   +   LT + VN+ +G  LHRF WL  D EIG+I   WN
Sbjct: 146 EVQSAYPKKNWSSLMLLNCS--RCSTLTVDYVNSASGLELHRFHWLAGDHEIGAIQGGWN 203

Query: 197 FLEGHNMVVENDPTTFPKAIHYTRGGPWF 225
            L   ++    +P      +H+T GGPWF
Sbjct: 204 HLV--DVQAPPEPLDASPMLHWTLGGPWF 230


>gi|116071599|ref|ZP_01468867.1| hypothetical protein BL107_05604 [Synechococcus sp. BL107]
 gi|116065222|gb|EAU70980.1| hypothetical protein BL107_05604 [Synechococcus sp. BL107]
          Length = 236

 Score =  203 bits (516), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 106/212 (50%), Positives = 134/212 (63%), Gaps = 9/212 (4%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           + P  IF+GYDPRE  A  V   S+ + SS+P+ ITP+V   L   GL+ RER   +ST 
Sbjct: 5   HPPIPIFIGYDPRERAATNVLIDSLYQHSSVPLAITPLVTPQLENQGLFHRERDPKQSTA 64

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           FSF+RFL PYL G+ GWA+F+DCD L  ADIK L D  DD+Y  MCV H++ P ET K  
Sbjct: 65  FSFTRFLVPYLMGYQGWALFMDCDMLCQADIKALWDQRDDRYGAMCVQHEHVPGETVKFL 124

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVW 195
           G VQ+ YP+KNWSS++L NC    N  LT + VNT +G  LHRF WL+ D EIG+I   W
Sbjct: 125 GEVQSPYPKKNWSSLMLLNCSRCTN--LTVDYVNTASGLELHRFHWLKGDHEIGAIQGGW 182

Query: 196 NFLEGHNMVVENDPTTFPKA--IHYTRGGPWF 225
           N    H + V+  P     A  +H+T GGPWF
Sbjct: 183 N----HLVDVQRPPEQEEPAPMLHWTLGGPWF 210


>gi|87302001|ref|ZP_01084835.1| hypothetical protein WH5701_01710 [Synechococcus sp. WH 5701]
 gi|87283569|gb|EAQ75524.1| hypothetical protein WH5701_01710 [Synechococcus sp. WH 5701]
          Length = 250

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 103/217 (47%), Positives = 133/217 (61%), Gaps = 12/217 (5%)

Query: 15  KDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTES 74
           ++  P  IF+GYDPRE  A  V   S+ + SS P+ ITP+V   L + GLY RER   +S
Sbjct: 14  ENPSPIPIFIGYDPRERAATTVLIDSLYQHSSAPLAITPLVTPQLERQGLYRRERDPKQS 73

Query: 75  TEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTK 134
           T FSF+RFL P L G+ GWA+F+DCD L   DI EL     D YA+MCV H++ P ET K
Sbjct: 74  TAFSFTRFLVPALMGYQGWAIFMDCDMLCRGDIAELWAERSDDYALMCVQHEHVPAETVK 133

Query: 135 MDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPF 193
             G VQ+ YP+KNWSS++L NC   + + LTP+ VN+ +G  LHRF WL  D EIG +  
Sbjct: 134 FLGEVQSAYPKKNWSSLMLLNCD--RCQALTPDYVNSASGLELHRFHWLAGDHEIGGLDG 191

Query: 194 VWNFLEGHNMVVENDPTTF-----PKAIHYTRGGPWF 225
            WN    H + V+  PT       P  +H+T GGPWF
Sbjct: 192 RWN----HLVDVQAAPTAAVAEGGPTLLHWTLGGPWF 224


>gi|418399509|ref|ZP_12973058.1| hypothetical protein SM0020_05340 [Sinorhizobium meliloti
           CCNWSX0020]
 gi|359506602|gb|EHK79115.1| hypothetical protein SM0020_05340 [Sinorhizobium meliloti
           CCNWSX0020]
          Length = 229

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 102/218 (46%), Positives = 148/218 (67%), Gaps = 10/218 (4%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           I++G+D +E +AY V  +SIL++SSIPV  TPI  +++    ++ RER   +STEFSFSR
Sbjct: 7   IYIGFDSKEVVAYHVLCQSILEKSSIPVSFTPISLNNVE--NIFTRERNPLQSTEFSFSR 64

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQT 141
           FL PYL+ ++GW++F DCD L   DI +L +L DD+YA MCV HDY PK  TK  G VQT
Sbjct: 65  FLVPYLSRYEGWSLFADCDMLMRTDIAQLWELRDDRYAAMCVKHDYVPKVETKFLGQVQT 124

Query: 142 VYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNFLEG 200
            Y +KNWSS++L+N  + K + L  + VNT +G  LH+F+WLE D+ IG +P  WN+L  
Sbjct: 125 KYEKKNWSSVILFN--NAKCRKLDLDYVNTASGLELHQFKWLESDDLIGELPASWNWL-- 180

Query: 201 HNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWL 238
              V E + +   K +H+T GGP+F+ +K+ ++A+ W 
Sbjct: 181 ---VNEYEYSPDAKLVHFTDGGPYFDEYKNDDYAEEWF 215


>gi|16264294|ref|NP_437086.1| hypothetical protein SM_b20806 [Sinorhizobium meliloti 1021]
 gi|334321078|ref|YP_004557707.1| hypothetical protein [Sinorhizobium meliloti AK83]
 gi|384533565|ref|YP_005716229.1| hypothetical protein [Sinorhizobium meliloti BL225C]
 gi|384539305|ref|YP_005723389.1| hypothetical protein SM11_pD1056 [Sinorhizobium meliloti SM11]
 gi|433611286|ref|YP_007194747.1| hypothetical protein C770_GR4pD1041 [Sinorhizobium meliloti GR4]
 gi|15140431|emb|CAC48946.1| hypothetical protein SM_b20806 [Sinorhizobium meliloti 1021]
 gi|333815741|gb|AEG08408.1| hypothetical protein SinmeB_4118 [Sinorhizobium meliloti BL225C]
 gi|334098817|gb|AEG56827.1| hypothetical protein Sinme_5205 [Sinorhizobium meliloti AK83]
 gi|336037958|gb|AEH83888.1| hypothetical protein SM11_pD1056 [Sinorhizobium meliloti SM11]
 gi|429556228|gb|AGA11148.1| hypothetical protein C770_GR4pD1041 [Sinorhizobium meliloti GR4]
          Length = 229

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 102/218 (46%), Positives = 148/218 (67%), Gaps = 10/218 (4%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           I++G+D +E +AY V  +SIL++SSIPV  TPI  +++    ++ RER   +STEFSFSR
Sbjct: 7   IYIGFDSKEVVAYHVLCQSILEKSSIPVTFTPISLNNVE--NIFTRERNPLQSTEFSFSR 64

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQT 141
           FL PYL+ ++GW++F DCD L   DI +L +L DD+YA MCV HDY PK  TK  G VQT
Sbjct: 65  FLVPYLSRYEGWSLFADCDMLMRTDIAQLWELRDDRYAAMCVKHDYVPKVETKFLGQVQT 124

Query: 142 VYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNFLEG 200
            Y +KNWSS++L+N  + K + L  + VNT +G  LH+F+WLE D+ IG +P  WN+L  
Sbjct: 125 KYEKKNWSSVILFN--NAKCRKLDLDYVNTASGLELHQFKWLESDDLIGELPASWNWL-- 180

Query: 201 HNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWL 238
              V E + +   K +H+T GGP+F+ +K+ ++A+ W 
Sbjct: 181 ---VNEYEYSPDAKLVHFTDGGPYFDEYKNDDYAEEWF 215


>gi|87123160|ref|ZP_01079011.1| hypothetical protein RS9917_04855 [Synechococcus sp. RS9917]
 gi|86168880|gb|EAQ70136.1| hypothetical protein RS9917_04855 [Synechococcus sp. RS9917]
          Length = 237

 Score =  198 bits (503), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 101/208 (48%), Positives = 130/208 (62%), Gaps = 6/208 (2%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P  IF+GYDPRE  A  V   S+++ SS+P+ ITP+V   L   GLY RER   +ST FS
Sbjct: 9   PIPIFIGYDPRERAATNVLIDSLVQHSSMPLAITPLVTPQLEAQGLYRRERDPKQSTAFS 68

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFL P+L G++GWA+F+DCD L   DI  L    DD+YA MCV H++ P ET K  G 
Sbjct: 69  FTRFLVPHLIGYEGWAMFMDCDMLCRGDIAALWQQRDDQYAAMCVQHEHVPGETVKFLGE 128

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNF 197
           VQ+ YP+KNWSS++L N    +   LT + VNT +G  LHRF WLE D EIG++   WN 
Sbjct: 129 VQSAYPKKNWSSLMLLNGS--RCTKLTVDYVNTASGLELHRFHWLEGDHEIGALDSGWNH 186

Query: 198 LEGHNMVVENDPTTFPKAIHYTRGGPWF 225
           L     V + D       +H+T GGPWF
Sbjct: 187 LVD---VQDADGAAAASLLHWTLGGPWF 211


>gi|407723747|ref|YP_006843408.1| hypothetical protein BN406_06126 [Sinorhizobium meliloti Rm41]
 gi|407323807|emb|CCM72408.1| hypothetical protein BN406_06126 [Sinorhizobium meliloti Rm41]
          Length = 229

 Score =  197 bits (502), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 102/218 (46%), Positives = 147/218 (67%), Gaps = 10/218 (4%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           I++G+D +E +AY V  +SIL++SSIPV  TPI  +++    ++ RER   +STEFSFSR
Sbjct: 7   IYIGFDSKEVVAYHVLCQSILEKSSIPVTFTPISLNNVE--NIFTRERNPLQSTEFSFSR 64

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQT 141
           FL PYL+ ++GW++F DCD L   DI +L +L DD+YA MCV HDY PK  TK  G VQT
Sbjct: 65  FLVPYLSRYEGWSLFADCDMLMRTDIAQLWELRDDRYAAMCVKHDYVPKVETKFLGQVQT 124

Query: 142 VYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNFLEG 200
            Y +KNWSS++L+N  + K + L  + VNT +G  LH+F+WLE D  IG +P  WN+L  
Sbjct: 125 KYEKKNWSSVILFN--NAKCRKLDLDYVNTASGLELHQFKWLESDNLIGELPASWNWL-- 180

Query: 201 HNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWL 238
              V E + +   K +H+T GGP+F+ +K+ ++A+ W 
Sbjct: 181 ---VNEYEYSPDAKLVHFTDGGPYFDEYKNDDYAEEWF 215


>gi|150377136|ref|YP_001313732.1| hypothetical protein Smed_5018 [Sinorhizobium medicae WSM419]
 gi|150031683|gb|ABR63799.1| conserved hypothetical protein [Sinorhizobium medicae WSM419]
          Length = 229

 Score =  196 bits (499), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 101/218 (46%), Positives = 148/218 (67%), Gaps = 10/218 (4%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           I++G+D +E +AY V  +SIL++SSIPV  TPI  +++    ++ RER   +STEFSFSR
Sbjct: 7   IYIGFDSKEVVAYHVLCQSILEKSSIPVTFTPISLNNVE--NIFTRERNPLQSTEFSFSR 64

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQT 141
           FL PYL+ ++GW++F DCD L   DI +L +L DD+YA MCV HDY PK  TK  G VQT
Sbjct: 65  FLVPYLSRYEGWSLFADCDMLMRTDIAQLWELRDDRYAAMCVKHDYIPKVETKFLGQVQT 124

Query: 142 VYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVWNFLEG 200
            Y +KNWSS++L+N  + K + L  + VNT +G  LH+F+WL+ D+ IG +P  WN+L  
Sbjct: 125 KYEKKNWSSVILFN--NAKCRKLDLDYVNTASGLELHQFKWLDSDDLIGELPASWNWL-- 180

Query: 201 HNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWL 238
              V E + +   K +H+T GGP+F+ +K+ ++A+ W 
Sbjct: 181 ---VNEYEYSPDAKLVHFTDGGPYFDEYKNDDYAEEWF 215


>gi|171057660|ref|YP_001790009.1| hypothetical protein Lcho_0972 [Leptothrix cholodnii SP-6]
 gi|170775105|gb|ACB33244.1| conserved hypothetical protein [Leptothrix cholodnii SP-6]
          Length = 223

 Score =  194 bits (493), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 98/218 (44%), Positives = 135/218 (61%), Gaps = 9/218 (4%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            ++FVG+DPRE +AY    +S+L+ +S+PV  TP+V   L +    ++E     S  F +
Sbjct: 2   IRLFVGFDPREAVAYHTFCQSVLESASVPVAFTPLVLQSLPR----YKETHNDGSNTFIY 57

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           SRFLTP+L  F+GWA+F D D +  +DI EL +L D+  A+  V H Y  K TTK  G  
Sbjct: 58  SRFLTPHLCEFEGWAIFADGDMVCRSDIAELWNLRDESKAVQVVQHSYQTKATTKYLGNK 117

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
              YPRKNWSS+VL+NCGH K+++LTPE V  QTGAFLHRF WL D +IG++P  WN+L 
Sbjct: 118 NQNYPRKNWSSVVLWNCGHAKHRMLTPEFVMGQTGAFLHRFSWLGDADIGALPVEWNWL- 176

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLW 237
                 E +     K +HYT G P F+ +   + A+ W
Sbjct: 177 ----TTEYEDNADAKLLHYTLGTPCFKDYVDADMAEEW 210


>gi|433772054|ref|YP_007302521.1| hypothetical protein Mesau_00657 [Mesorhizobium australicum
           WSM2073]
 gi|433664069|gb|AGB43145.1| hypothetical protein Mesau_00657 [Mesorhizobium australicum
           WSM2073]
          Length = 248

 Score =  194 bits (493), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 104/232 (44%), Positives = 134/232 (57%), Gaps = 16/232 (6%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTE-------- 73
           I++GYDPRE     V   S  +R+S P++I P+V  DL   G+  R     +        
Sbjct: 4   IYIGYDPRESRVLDVARWSAYRRTSTPLQILPLVLKDLEAQGIMKRPMVVRDGKLWCPIS 63

Query: 74  ----STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTP 129
               +TEF+ SRF  P++    GWA+F DCD L+L D  +L DL+D  YAIM V H    
Sbjct: 64  EAPMATEFAISRFAVPFIRQ-GGWAMFCDCDVLFLKDPAKLLDLVDPSYAIMAVKHQQRE 122

Query: 130 KETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIG 189
            E  KMDG +QT Y RKNWSS+VL+N  HP NK LT E++NT  G  LH F+WLED EIG
Sbjct: 123 SEAVKMDGQMQTFYRRKNWSSVVLWNLDHPSNKRLTNEMLNTWPGRDLHAFKWLEDHEIG 182

Query: 190 SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKH-CEFADLWLKE 240
            +P  WN+L G +   E DP     A HYT GGPWF  W    ++ +LW +E
Sbjct: 183 ELPLAWNYLVGAS-ASELDPADVSLA-HYTLGGPWFAGWSGTTQWDELWRRE 232


>gi|319778000|ref|YP_004134430.1| hypothetical protein Mesci_6262 [Mesorhizobium ciceri biovar
           biserrulae WSM1271]
 gi|317171719|gb|ADV15256.1| hypothetical protein Mesci_6262 [Mesorhizobium ciceri biovar
           biserrulae WSM1271]
          Length = 248

 Score =  194 bits (492), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 104/232 (44%), Positives = 133/232 (57%), Gaps = 16/232 (6%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTE-------- 73
           I++GYDPRE     V   S  +R+S P++I P+V  DL   G+  R     +        
Sbjct: 4   IYIGYDPREARVLEVARWSAFRRTSTPLQIHPLVLKDLEAQGIMKRPMEVRDGRLWCPIS 63

Query: 74  ----STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTP 129
               +TEF+ SRF  P++    GWA+F DCD L+L D  +L DL+D  YAIM V H    
Sbjct: 64  EAPMATEFAISRFAVPFMRQ-GGWAMFCDCDVLFLQDPVKLLDLVDPSYAIMAVKHRQRE 122

Query: 130 KETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIG 189
            E  KMDG  QT Y RKNWSS+VL+N  HP NK LT E++NT  G  LH F+WLED EIG
Sbjct: 123 SEAVKMDGQTQTFYRRKNWSSVVLWNLDHPANKRLTNEMLNTWPGRDLHAFKWLEDHEIG 182

Query: 190 SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHC-EFADLWLKE 240
            +P  WN+L G +   E DP     A HYT GGPWF  W    ++ +LW +E
Sbjct: 183 ELPLAWNYLVGAS-ASELDPAEVSLA-HYTLGGPWFAGWSGATQWDELWSRE 232


>gi|124024243|ref|YP_001018550.1| hypothetical protein P9303_25541 [Prochlorococcus marinus str. MIT
           9303]
 gi|123964529|gb|ABM79285.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9303]
          Length = 237

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 100/214 (46%), Positives = 134/214 (62%), Gaps = 13/214 (6%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P  I++GYDPRE  A  V   S+ + SS+P+ ITP+V   L   GL+ RER   +ST FS
Sbjct: 4   PIPIYIGYDPRERAATNVLIDSLYQHSSMPLAITPLVTPQLEGQGLFRRERDPKQSTAFS 63

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F+RFL P L G+ GWA+F+DCD L   DI EL    D++ A+MCV H++ P ET K  G 
Sbjct: 64  FTRFLVPKLMGYQGWAIFMDCDMLCRGDIAELWAQRDEQMAVMCVKHEHEPGETKKFLGE 123

Query: 139 VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSI-PFVWN 196
           VQ+ YP+KNWSS+++ NC   +   LT + V+T TG  LHRF WL  D EIG+I   +WN
Sbjct: 124 VQSPYPKKNWSSLMMLNCS--RCTALTVDYVSTATGLDLHRFHWLAGDHEIGAIEGGLWN 181

Query: 197 FLEGHNMVVENDP-----TTFPKAIHYTRGGPWF 225
               H + V+ DP      + P+ +H+T GGPWF
Sbjct: 182 ----HLVAVQEDPDLSVGQSRPQLLHWTLGGPWF 211


>gi|332529494|ref|ZP_08405452.1| hypothetical protein HGR_06256 [Hylemonella gracilis ATCC 19624]
 gi|332041014|gb|EGI77382.1| hypothetical protein HGR_06256 [Hylemonella gracilis ATCC 19624]
          Length = 243

 Score =  192 bits (487), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 100/231 (43%), Positives = 141/231 (61%), Gaps = 11/231 (4%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           N+P +IF+GYD R  +A++    SI  R++ PV   PI  ++L    LY R+    +STE
Sbjct: 2   NEPIRIFIGYDRRIPVAFQTLVHSITARATAPVTFCPISLTNL--GSLYTRDVNPLQSTE 59

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           FSFSRF TPYL+ ++GW++F+D D + + DI +L  L DDKYA M   H++ P E  K  
Sbjct: 60  FSFSRFFTPYLSNYEGWSIFMDNDIIAVDDIAKLWALRDDKYAAMVCKHEHDPGEAEKFM 119

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIPFVW 195
            A QT Y +KNWSS++L+N  + K K LTP+ VNT TG  LH+F+WLE D  IGS+P  W
Sbjct: 120 HATQTRYEKKNWSSVILFN--NAKCKALTPDYVNTATGLQLHQFKWLENDNLIGSLPLEW 177

Query: 196 NFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEEYKN 246
           N L  +N          P  +HYT GGP++  +K C +++ W +E     N
Sbjct: 178 NTLVDYNHFKN------PSMLHYTEGGPFYPDYKDCGYSEEWFREYAAANN 222


>gi|33864181|ref|NP_895741.1| hypothetical protein PMT1915 [Prochlorococcus marinus str. MIT
           9313]
 gi|33635765|emb|CAE22090.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 237

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 101/216 (46%), Positives = 134/216 (62%), Gaps = 13/216 (6%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
            KP  I++GYDPRE  A  V   S+ + SS+P+ ITP+V   L   GL+ RER   +ST 
Sbjct: 2   TKPIPIYIGYDPRERAATNVLIDSLYQHSSMPLAITPLVTPQLEGQGLFRRERDPKQSTA 61

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           FSF+RFL P L G+  WA+F+DCD L   DI EL    D++ A+MCV H++ P ET K  
Sbjct: 62  FSFTRFLVPKLMGYQDWAIFMDCDMLCRGDIAELWAQRDEQMAVMCVKHEHEPGETKKFL 121

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE-DEEIGSIP-FV 194
           G VQ+ YP+KNWSS+++ NC   +   LT + V+T TG  LHRF WL  D EIG+I   +
Sbjct: 122 GEVQSPYPKKNWSSLMMLNCS--RCTALTVDYVSTATGLDLHRFHWLAGDHEIGAIEGGL 179

Query: 195 WNFLEGHNMVVENDP-----TTFPKAIHYTRGGPWF 225
           WN L G    V+ DP      + P+ +H+T GGPWF
Sbjct: 180 WNHLVG----VQEDPELSVGQSRPQLLHWTLGGPWF 211


>gi|118593721|ref|ZP_01551096.1| hypothetical protein SIAM614_27707 [Stappia aggregata IAM 12614]
 gi|118433731|gb|EAV40393.1| hypothetical protein SIAM614_27707 [Stappia aggregata IAM 12614]
          Length = 230

 Score =  190 bits (483), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 95/221 (42%), Positives = 133/221 (60%), Gaps = 9/221 (4%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
             IF+G+DPRE +AY VC  S+ + SS  + ITP+   +L+     + E     S  F +
Sbjct: 2   LNIFIGFDPREAIAYHVCANSLTRLSSDVLAITPLSLHNLKT----YNETHTDGSNHFIY 57

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           +RFL P L  ++GWA+F+D D +   DI +L  + D+  A+MCVHHDY  ++TTK  G+ 
Sbjct: 58  TRFLIPSLMNYNGWALFMDGDMIVRDDIAKLFAMRDESKAVMCVHHDYQTRQTTKYLGSK 117

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
              YPRKNWSS+VL+NCGHP NK++TP+ V   TGA LHRF WL+D+ IG +P  WN+L 
Sbjct: 118 NENYPRKNWSSVVLWNCGHPANKVVTPDFVMNATGAQLHRFTWLDDDLIGELPKEWNWLP 177

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKE 240
                 E  P    K +H+T G P F  + +   A  W +E
Sbjct: 178 D-----EFGPNPDAKLLHFTLGTPCFHEYANAPMAAEWHRE 213


>gi|398806488|ref|ZP_10565394.1| LPS:glycosyltransferase [Polaromonas sp. CF318]
 gi|398088006|gb|EJL78579.1| LPS:glycosyltransferase [Polaromonas sp. CF318]
          Length = 228

 Score =  187 bits (474), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 101/219 (46%), Positives = 128/219 (58%), Gaps = 9/219 (4%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           IF+GYDPRE +AY VC  SI+++ S PV + P+    LR  G Y  E+    S  F +SR
Sbjct: 4   IFIGYDPREAVAYHVCSNSIVRQVSQPVSLNPLA---LRNLGFY-VEKHTDGSNHFIYSR 59

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQT 141
           FL P+L  + GWA+F+D D L   DI  L +L D   A+M V HDY  + T K  G+   
Sbjct: 60  FLVPHLMEYRGWAIFMDGDMLLRDDITRLWELRDPTKAVMVVKHDYKTRMTEKYLGSKNE 119

Query: 142 VYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLEGH 201
            YPRKNWSS+VL+NC HP N+ LTPE V   TGA LHRF WLED+ IG +P VWN+L   
Sbjct: 120 NYPRKNWSSVVLWNCEHPANRALTPEFVQGATGAQLHRFTWLEDDLIGELPKVWNWLPD- 178

Query: 202 NMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKE 240
               E       K +H+T G P F  +     AD W +E
Sbjct: 179 ----EFGANPEAKLLHFTLGTPCFHEFATTPMADEWHRE 213


>gi|218677603|ref|ZP_03525500.1| hypothetical protein RetlC8_01575 [Rhizobium etli CIAT 894]
          Length = 190

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 91/180 (50%), Positives = 122/180 (67%), Gaps = 8/180 (4%)

Query: 64  LYWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCV 123
           ++ R+R   +STEFSFSRFLTPYL+GF+GW++F+DCD L  ADI EL  L DD+YA MCV
Sbjct: 9   IFTRQRNALQSTEFSFSRFLTPYLSGFEGWSIFMDCDMLMRADIAELWALRDDRYAAMCV 68

Query: 124 HHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWL 183
            HDY PK  TK  G  QT Y +KNWSS++L+N  + K + LT + VNT TG  LH+F+WL
Sbjct: 69  KHDYQPKIDTKFLGHTQTKYEKKNWSSLILFN--NDKCRALTRDYVNTATGLQLHQFKWL 126

Query: 184 E-DEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           E D+ IG +P  WN+L     V E D     K +H+T GGP+F+ +K+ ++A+ W    E
Sbjct: 127 ENDDLIGELPVTWNYL-----VNEYDYREDAKNVHFTDGGPYFDQYKNDDYAEEWFAARE 181


>gi|318041934|ref|ZP_07973890.1| hypothetical protein SCB01_09494 [Synechococcus sp. CB0101]
          Length = 228

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 93/223 (41%), Positives = 134/223 (60%), Gaps = 9/223 (4%)

Query: 15  KDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTES 74
           + + P K+F+G+D RED+A  V   SI   +S PV+I  +  + +   G+Y R +   +S
Sbjct: 2   QPSDPIKVFIGFDSREDVAGNVLTDSIQAHASKPVQIGQVRLTQIE--GVYHRSKHPLQS 59

Query: 75  TEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTK 134
           TEFSFSRFL P++ G+  WA+F+D D L   DI EL DL D++YA+  V H++      K
Sbjct: 60  TEFSFSRFLVPWICGYQDWAIFMDADMLCQGDITELWDLRDERYAVQVVQHNHHCSSGRK 119

Query: 135 MDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFV 194
             G  Q+ Y RKNWSS++L+NC   + K L+P  VNT +G  LH+F+W   EEIG +P  
Sbjct: 120 FQGMPQSPYRRKNWSSVMLFNCA--RCKALSPHYVNTASGLELHQFEWTTTEEIGPLPPQ 177

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLW 237
           WN L G    V+  P    + +HYT GGPWF+  +    ++LW
Sbjct: 178 WNVLVG----VQEVPAD-ARLLHYTLGGPWFDDCRSMPRSELW 215


>gi|456062462|ref|YP_007501432.1| hypothetical protein D521_0127 [beta proteobacterium CB]
 gi|455439759|gb|AGG32697.1| hypothetical protein D521_0127 [beta proteobacterium CB]
          Length = 225

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 127/217 (58%), Gaps = 9/217 (4%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFS 80
            I VG+D RE +AY V  +S++  SS PV+  P+ ++ L      ++E     S +F +S
Sbjct: 6   SIVVGFDQREAIAYHVFCQSVIDSSSNPVQFLPLAENTLSS----YKEVHTDGSNKFIYS 61

Query: 81  RFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQ 140
           RFLTP+L  F GWA+F D D +  ADI EL  L D+  A+  V HDY  K + K  G   
Sbjct: 62  RFLTPFLMNFSGWAIFADGDMVCQADISELWALRDESKAVQVVKHDYQTKASKKYLGNRN 121

Query: 141 TVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLEG 200
             YPRKNWSS++L+NC HP N +LTP+ +++Q G++LHRF WL D+ IG +P  WN+L  
Sbjct: 122 ENYPRKNWSSVILWNCSHPANLLLTPDFIHSQPGSYLHRFSWLGDDLIGELPAEWNWL-- 179

Query: 201 HNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLW 237
               +E       K IHYT G P F  + +   ++ W
Sbjct: 180 ---AIEYPENPNAKLIHYTLGTPCFFDYANESMSENW 213


>gi|171909567|ref|ZP_02925037.1| hypothetical protein VspiD_00305 [Verrucomicrobium spinosum DSM
           4136]
          Length = 192

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 86/180 (47%), Positives = 114/180 (63%), Gaps = 6/180 (3%)

Query: 63  GLYWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMC 122
           G++ R +   +STEFSFSRFL P+L  + GWAVFVDCD L L DI  L  L D+ +A+  
Sbjct: 6   GVFTRPQDARQSTEFSFSRFLVPWLCDYQGWAVFVDCDTLVLDDIARLWALRDESFAVQV 65

Query: 123 VHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQW 182
           V HD+ P+ETTK  G  Q+ YP+KNWSS++L NC   +   LTPE VN  +G  LHRF W
Sbjct: 66  VKHDHRPRETTKFLGQPQSTYPKKNWSSVMLMNCA--RCTALTPEYVNQASGLDLHRFHW 123

Query: 183 LEDEE-IGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEM 241
           LE E  IG IP  WN L  ++  +   P +    +H+T GGPWF+ +  C +AD+W +E 
Sbjct: 124 LEGEHLIGEIPHRWNHLVDYDPPL---PESELSCLHFTSGGPWFQDYAECGYADVWHREF 180


>gi|297743326|emb|CBI36193.3| unnamed protein product [Vitis vinifera]
          Length = 103

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 76/96 (79%), Positives = 85/96 (88%)

Query: 95  VFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLY 154
           +FVDCDFLYLADIKEL DL+DD  AIMCV HDY PKETTKMDGAVQTVYPRKNWSSMVLY
Sbjct: 1   MFVDCDFLYLADIKELRDLVDDNCAIMCVQHDYAPKETTKMDGAVQTVYPRKNWSSMVLY 60

Query: 155 NCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGS 190
           NCGHPKN++LTP+VVN+QTGA+LHRF  L   ++ S
Sbjct: 61  NCGHPKNRVLTPDVVNSQTGAYLHRFMLLRMIQVRS 96


>gi|414088185|ref|YP_006988363.1| putative nucleotide-diphospho-sugar transferase [Caulobacter phage
           CcrColossus]
 gi|408731555|gb|AFU87999.1| putative nucleotide-diphospho-sugar transferase [Caulobacter phage
           CcrColossus]
          Length = 261

 Score =  163 bits (412), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 135/252 (53%), Gaps = 28/252 (11%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILK------RSSIPVKITPIVQSDLRKSGLY---WR--E 68
             +FVGYD RE  A+ VC  S+L        + + V++ P+   +LR++G++   WR  E
Sbjct: 3   LHVFVGYDVREHAAWMVCRDSLLNPIAYEGDAPLDVQVHPVSHRELRRAGVFDRPWRIDE 62

Query: 69  RGQTE--------STEFSFSRFLTPYLA---GFDGWAVFVDCDFLYLADIKELCDLIDDK 117
           +GQT         STEFS SRFL P LA   G  G  VFVDCDF+++  + ++ D +D  
Sbjct: 63  KGQTWDERDGRPFSTEFSHSRFLVPSLAQAMGLTGQVVFVDCDFMFMHSVVDMVDSLDPT 122

Query: 118 YAIMCVHHDYTP-KETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAF 176
             +  V HD+      TKMDG  Q+ Y RK WSS++++N  H    +  P   NT  G+ 
Sbjct: 123 KLVSVVKHDFERIAAGTKMDGMSQSRYYRKLWSSLMVFNLDHEDAHLFHPHYANTWDGSS 182

Query: 177 LHRFQWLEDEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADL 236
           LH FQ L D EIG IP  WN + GH+   E      P AIH++ GGPW  A++   +A L
Sbjct: 183 LHGFQPLTDSEIGDIPSEWNHIPGHSSANE-----VPSAIHWSFGGPWMPAYEKAAWAPL 237

Query: 237 WLKEMEEYKNEA 248
           W     +   +A
Sbjct: 238 WRARYVQVMQDA 249


>gi|354335005|gb|AER23947.1| hypothetical protein var070 [Variovorax sp. HH01]
          Length = 221

 Score =  133 bits (334), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 118/227 (51%), Gaps = 22/227 (9%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           +F G+D RE   +    +S+++R+S PV+ITP+    L +            S  F+ SR
Sbjct: 4   LFCGFDEREAAGFHTFCQSVIRRASKPVRITPLSAMGLHQG-----------SNTFTLSR 52

Query: 82  FLTPYLAGFDGWAVFVD-CDFLYLADIKELCDLIDDKYAIMCVHH-DYTPKETTKMDGA- 138
           FL PYL GF G A+F+D CD L L D+ EL  L D ++A+  V H DY  +   K  G  
Sbjct: 53  FLVPYLMGFKGRAIFMDACDMLMLDDVSELDALFDKRFAVQVVKHGDYQSQHARKYVGTD 112

Query: 139 ---VQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVW 195
               Q+ Y RKNW+S++  NC H     +TP++++      L + ++ ED+EIG++P  W
Sbjct: 113 MECKQSNYSRKNWASVMSINCEHSAWFAMTPKMISLAEPIDLLQLRYFEDKEIGALPAEW 172

Query: 196 NFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           N L     + E       K +H+T G P F  +++   +  W  E E
Sbjct: 173 NVL-----IDEGQEREGAKVLHWTAGLPTFHHYRNARGSADWFAEHE 214


>gi|119502716|ref|ZP_01624801.1| hypothetical protein MGP2080_05407 [marine gamma proteobacterium
           HTCC2080]
 gi|119461062|gb|EAW42152.1| hypothetical protein MGP2080_05407 [marine gamma proteobacterium
           HTCC2080]
          Length = 251

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 82/223 (36%), Positives = 110/223 (49%), Gaps = 31/223 (13%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
            K+FVG+D R+ +++     SI + +S PV ITP+V   L  S      RG T    F+F
Sbjct: 4   LKVFVGFDERQRVSFTTLTTSIFETASRPVSITPLVLHTLPIS-----RRGLTP---FTF 55

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
           SRFL P+L  F+G AVF+D D L  +DI EL  LI D  A+  V    T ++T       
Sbjct: 56  SRFLVPWLCNFEGAAVFMDADMLLASDITELEQLISDDLAVSVVRSLETYEQT------- 108

Query: 140 QTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
                     S +L NC HP ++ LTPE +  +T   LH  +W+  +EIGS+   WN L 
Sbjct: 109 ----------SFMLLNCAHPSHRKLTPEFIQ-ETDIGLHALEWVGRDEIGSLDSKWNQLV 157

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEME 242
           G+      D       IHYT G P F        A LWLK  +
Sbjct: 158 GY-----QDVDFTQGNIHYTMGVPAFPETSTSPGAQLWLKNAQ 195


>gi|291336000|gb|ADD95590.1| hypothetical protein [uncultured phage MedDCM-OCT-S09-C7]
          Length = 543

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/224 (35%), Positives = 120/224 (53%), Gaps = 16/224 (7%)

Query: 12  RETKDNKPFKIFVGYDPR----EDLAYRVCHRSILK----RSSIPVKITPIVQSDLRKSG 63
            E    KP  I++GYD      E++      +SI K      S    I  I   D  K  
Sbjct: 298 HELSKGKPLNIYIGYDSTHNNIEEITKFSIEQSIKKTFSGNESANRFIPEIKYLDYSKIP 357

Query: 64  LYWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCV 123
           +Y RE  + +ST F++SRFL PYL  ++G+++F+D D L+      +   ++   A+ C+
Sbjct: 358 VYNREY-KNQSTAFTYSRFLIPYLENYEGFSMFIDDDILFEETPLPMFYYLNPDDAVACI 416

Query: 124 HHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWL 183
            +     ++ K +G +   YP K WSS++++N GH   K LTPEVVNT TGA LH+F+W 
Sbjct: 417 QYPQYEHDSVKFNGEINIDYPCKLWSSLMIFNNGHEDCKKLTPEVVNTWTGAQLHQFEWT 476

Query: 184 EDEEIGSIPFVWNFLEGHNMVVENDPTTFPKA-IHYTRGGPWFE 226
             ++I  IP  + F EG+    +N  T +  A  HYTRGGPW +
Sbjct: 477 --DKISKIPEHYVFTEGY----DNPETKWQCAGYHYTRGGPWID 514


>gi|291334635|gb|ADD94283.1| hypothetical protein MCA1513 [uncultured phage MedDCM-OCT-S04-C64]
          Length = 227

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 71/226 (31%), Positives = 114/226 (50%), Gaps = 39/226 (17%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIV--QSDLRKSGLYWRERGQTESTEF 77
            +I++G DPR+ +AY V   SI +R+S PV I P+V  Q  +++ GL          T+F
Sbjct: 1   MRIYIGVDPRQPIAYNVLQWSITRRASKPVSIIPLVLFQLPIKRVGL----------TDF 50

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDG 137
           +FSR+L P L+G+ G +VF+D D L L DI +L + + D  A+                 
Sbjct: 51  TFSRYLAPALSGYQGVSVFMDADMLVLDDIHKLKEFVKDDCAV----------------- 93

Query: 138 AVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNF 197
           +V+    +  W+S++++N    K   LT + +N ++      F W   E IG IP  +N+
Sbjct: 94  SVRKSIHKFEWASLMVFNNSLCKK--LTADYINDESNT-PQNFDWA--ESIGDIPQKFNY 148

Query: 198 LEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
             G+     ++PT     +HYT G P F   + C++A  W  E E 
Sbjct: 149 CVGY-----DEPTEPASIVHYTAGIPHFPETRDCDYAKEWWNEYES 189


>gi|291334841|gb|ADD94481.1| hypothetical protein [uncultured phage MedDCM-OCT-S06-C1041]
          Length = 106

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 62/103 (60%), Gaps = 12/103 (11%)

Query: 20  FKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIV--QSDLRKSGLYWRERGQTESTEF 77
            +I++G DPR+ +AY V   SI +R+S PV I P+V  Q  +++ GL          T+F
Sbjct: 1   MRIYIGVDPRQPIAYNVLQWSITRRASKPVSIIPLVLFQLPIKRVGL----------TDF 50

Query: 78  SFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAI 120
           +FSR+L P L G+ G +VF+D D L L DI +L + + D  A+
Sbjct: 51  TFSRYLAPALCGYQGVSVFMDADMLVLDDIHKLKEFVKDDCAV 93


>gi|220909569|ref|YP_002484880.1| glycosyl transferase protein [Cyanothece sp. PCC 7425]
 gi|219866180|gb|ACL46519.1| glycosyl transferase family 8 [Cyanothece sp. PCC 7425]
          Length = 306

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 66/229 (28%), Positives = 111/229 (48%), Gaps = 25/229 (10%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFS 80
           ++FVG D  + LA +V   SI + +S+PV++ P++  D+++      +      T FSFS
Sbjct: 8   RVFVGADRSQQLAVKVLEHSIHRHTSVPVQVLPMIDLDVKQPA----DPKNWGRTGFSFS 63

Query: 81  RFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQ 140
           RF  P LAG+ G A+++D D L + DI  L + I    A + +  D  P++         
Sbjct: 64  RFRIPKLAGYQGKAIYLDADMLVMKDIAALWN-IPFNGAKIVIQAD-LPEKHQATHHKYG 121

Query: 141 TVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAF-----LHRFQWLEDEEIG-SIPFV 194
               R    +++L +CG+   ++    + +   G +     +++F  L+++EI  S+PF 
Sbjct: 122 APTQRIKQCAVMLLDCGNLDWEV-DRIIADLDQGKYTYEDLMYQFCLLKEDEIRYSVPFE 180

Query: 195 WNFLEGHNMVVENDPTTFPKAIHYTRGG--PWFEAWKHCEFADLWLKEM 241
           WN LE        D  T    IHYT     PW  A      A LW++E+
Sbjct: 181 WNSLEYF------DAETC--LIHYTDMATQPWVSA--RNPLAYLWMEEV 219


>gi|56750290|ref|YP_170991.1| hypothetical protein syc0281_d [Synechococcus elongatus PCC 6301]
 gi|81300080|ref|YP_400288.1| hypothetical protein Synpcc7942_1271 [Synechococcus elongatus PCC
           7942]
 gi|56685249|dbj|BAD78471.1| unknown protein [Synechococcus elongatus PCC 6301]
 gi|81168961|gb|ABB57301.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 245

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 68/234 (29%), Positives = 101/234 (43%), Gaps = 38/234 (16%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIP-VKITPIVQSDLRKSGLYWRERGQTESTEFSF 79
           +I++G +  + LA +    SI K  S P ++I P+ Q+  R  G           T F F
Sbjct: 9   RIYIGTEANQYLAEQTLIYSIRKHLSDPSIEIIPLTQTQQRVGG-----------TRFGF 57

Query: 80  SRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAV 139
            RF  P   G+ G A+++D D L L D+  + DL+ D  AI CV   Y  +        V
Sbjct: 58  VRFTLPQAMGYQGKALYLDADQLALTDLAPIFDLLPDDKAIACV---YEAEGLYNGKPHV 114

Query: 140 QTVYPRKNWSSMVLYNCGH---PKNKILTPEVVNTQ---TGAFLHR----FQWLEDEEIG 189
           QT +     +S++L +C       +++    V N Q    G   +R     QW E E I 
Sbjct: 115 QTWH-----TSVMLLDCEQLAWQPDELFATVVPNKQEPKPGQIRYRNFMELQWFEQERIA 169

Query: 190 SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCEFADLWLKEMEE 243
            +P  WN     N+V    P T      + R  PW    +H E  D WL  ++E
Sbjct: 170 KLPPSWNHF---NIVT---PETNIVHFSHVRSQPWKNP-RH-ELTDFWLGYLKE 215


>gi|427720463|ref|YP_007068457.1| glycosyl transferase family protein [Calothrix sp. PCC 7507]
 gi|427352899|gb|AFY35623.1| glycosyl transferase family 8 [Calothrix sp. PCC 7507]
          Length = 303

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 65/236 (27%), Positives = 106/236 (44%), Gaps = 30/236 (12%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           N+  +++VG D  + LA +V   SI + + + V + P++   ++K     +     + T 
Sbjct: 2   NEQVRVYVGTDTSQLLAVKVLEHSIKRCTKLDVTVYPMLDLPIKKP----QNPNNWQRTG 57

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD 136
           FSFSRF  P LAG+ G A+++D D L   DI  L ++  D  A + +  D  P E  K +
Sbjct: 58  FSFSRFCIPELAGYTGKALYLDADMLVFQDIASLWNIPFDG-AKVVIQED-IPNEYQKTN 115

Query: 137 GAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWL--------EDEEI 188
             +     R    S++L +C     KI   E +  +     + ++ L        E  +I
Sbjct: 116 KKLGAPSQRIKQCSVMLLDCTRLNWKI---ETIIDELDQDKYNYEQLMYELCILNESSDI 172

Query: 189 G-SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCE--FADLWLKEM 241
             SIPF WN LE ++             IHYT      + W  C+  +A LW  E+
Sbjct: 173 KYSIPFKWNSLEHYDQ--------HTCLIHYTDMAT--QPWASCQNKYAYLWFNEL 218


>gi|427720479|ref|YP_007068473.1| glycosyl transferase family protein [Calothrix sp. PCC 7507]
 gi|427352915|gb|AFY35639.1| glycosyl transferase family 8 [Calothrix sp. PCC 7507]
          Length = 314

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 71/232 (30%), Positives = 108/232 (46%), Gaps = 34/232 (14%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTE---ST 75
           P +IFVG    + LA +V   SI K +S+PV++TP+  + +RK+G+   E    +    T
Sbjct: 14  PIQIFVGTQEEQMLAVKVLEYSIKKYASMPVEVTPLFIA-VRKAGIRIPEPQDPKIKPQT 72

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
            FSF RF  P L G+ G A+++D D     DIKEL  L  +   ++ V   Y P ++ ++
Sbjct: 73  PFSFQRFAIPALNGYRGRAIYLDSDMQVFRDIKELWLLPFNGADLLSV---YEPGDSQRV 129

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLE---DEEIGS-I 191
                       +S MV+ NC    N  +   V + + G + ++   LE     +I S I
Sbjct: 130 S----------QFSVMVI-NCEQ-LNWNVEQLVRDLEMGKWTYKQFILEMSPASKIASVI 177

Query: 192 PFVWNFLEGHNMVVENDPTTFPKAIHYTR--GGPWFEAWKHCEFADLWLKEM 241
           P  WN LE ++     D T      HYT     PW     H     LW +E+
Sbjct: 178 PTEWNDLERYS----EDKTAL---THYTDMPNQPWLNT--HNPLGWLWCQEL 220


>gi|33864189|ref|NP_895749.1| hypothetical protein PMT1923 [Prochlorococcus marinus str. MIT
           9313]
 gi|33635773|emb|CAE22098.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 119

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 122 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQ 181
           CV       ET K  G VQ+ Y +KNWSS++L NCG  +   LT + VNT TG   HRF 
Sbjct: 7   CVCSMNISGETVKFLGEVQSAYLKKNWSSLMLLNCG--RCNKLTVDYVNTSTGLEPHRFH 64

Query: 182 WLE-DEEIGSIPFVWNFL 198
           WLE D +IG I   WN L
Sbjct: 65  WLEGDHKIGVIEGGWNHL 82


>gi|398903210|ref|ZP_10651520.1| LPS:glycosyltransferase [Pseudomonas sp. GM50]
 gi|398177405|gb|EJM65086.1| LPS:glycosyltransferase [Pseudomonas sp. GM50]
          Length = 314

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/247 (25%), Positives = 114/247 (46%), Gaps = 26/247 (10%)

Query: 16  DNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTEST 75
           D++  ++FVG D  + LA  V   SI + ++  V++ P++  DL          GQ   T
Sbjct: 3   DDEVIRVFVGVDRSQLLAVPVLEYSIRRHTTAKVEVIPML--DLPVPVPLDPRNGQ--RT 58

Query: 76  EFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
            FSFSRF  P L  + G A+++D D L + DI+EL ++  +   ++        +ET + 
Sbjct: 59  GFSFSRFCIPKLCEYKGKAIYMDADMLVMHDIRELWNIPFEGAKVIIQEEVKFVEETLQK 118

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAF-----LHRFQWLEDEEIG- 189
            GA +    RK   +++L +C +    +    V     G +     +     L+++ +  
Sbjct: 119 VGAPKL---RKKQCAVMLLDCANLDWDV-NAIVAGMDEGKYNYDELMAGLCILDEQHVKY 174

Query: 190 SIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHC--EFADLWLKEMEEYKNE 247
            +PF WN LE        D  T  + +HYT    + + W  C  +FA +W +E+     +
Sbjct: 175 GVPFEWNSLE------HWDANT--RLLHYT--DVYTQPWSCCGNKFAWVWFQEVRRMLAD 224

Query: 248 ANKTVEK 254
            + T E+
Sbjct: 225 GSLTEER 231


>gi|402490220|ref|ZP_10837009.1| glycosyl transferase protein [Rhizobium sp. CCGE 510]
 gi|401810246|gb|EJT02619.1| glycosyl transferase protein [Rhizobium sp. CCGE 510]
          Length = 383

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 90/201 (44%), Gaps = 16/201 (7%)

Query: 9   TANRETKDNKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRE 68
           T      DN    IFVG D  + LA  V   SI + +S+ +K+  +    L       ++
Sbjct: 46  TGMTSENDNDTAYIFVGADRSQLLAVPVLEHSIRRHTSMNIKLRSMDDVVLPDP----KD 101

Query: 69  RGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYT 128
             Q + T FSF+RF  P L G+ G A+++D D L   D +EL +L  +   I+    +  
Sbjct: 102 IRQGKRTGFSFARFAIPQLMGYKGRAIYLDADMLVFRDFRELWNLPFNSAKIII--QEEL 159

Query: 129 PKETTKMD--GAVQTVYPRKNWSSMVLYNCGHPKN----KILTPEVVNTQTGAFLHRFQW 182
           P E  K    GA++    R    S++L +C    +    KI+            L+    
Sbjct: 160 PDEAQKQSKPGALKK---RVKQCSVMLLDCEALSDWDPVKIIAGLDGQYTYQDLLYDMCI 216

Query: 183 LEDEEIG-SIPFVWNFLEGHN 202
           L++ EI   +PF WN LE ++
Sbjct: 217 LDESEINYGVPFEWNSLEHYD 237


>gi|418937385|ref|ZP_13491027.1| glycosyl transferase family 8 [Rhizobium sp. PDO1-076]
 gi|375055904|gb|EHS52122.1| glycosyl transferase family 8 [Rhizobium sp. PDO1-076]
          Length = 336

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 86/185 (46%), Gaps = 16/185 (8%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           IFVG D  + LA  V   SI + +S+ +K+  +    L       ++  Q + T FSF+R
Sbjct: 12  IFVGADRSQLLAVPVLEHSIRRHTSMNIKLRSMDDVVLPDP----KDVRQGKRTGFSFTR 67

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMD--GAV 139
           F  P L G+ G A+++D D L   D +EL +L  +   I+    +  P +  K D  GA+
Sbjct: 68  FAIPQLMGYKGRAIYLDADMLVFRDFRELWNLPFNSAKIII--QEELPDQAQKQDKPGAL 125

Query: 140 QTVYPRKNWSSMVLYNCGHPKN----KILTPEVVNTQTGAFLHRFQWLEDEEIG-SIPFV 194
           +    R    S++L +C    +    KI+            L+    L++ EI   +PF 
Sbjct: 126 KK---RVKQCSVMLLDCEALSDWDPVKIIEGLDGKYTYQDLLYDMCILDESEISYGVPFE 182

Query: 195 WNFLE 199
           WN LE
Sbjct: 183 WNSLE 187


>gi|282892058|ref|ZP_06300535.1| hypothetical protein pah_c205o095 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|338174718|ref|YP_004651528.1| hypothetical protein PUV_07240 [Parachlamydia acanthamoebae UV-7]
 gi|281498100|gb|EFB40442.1| hypothetical protein pah_c205o095 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|336479076|emb|CCB85674.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
          Length = 234

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 58/235 (24%), Positives = 100/235 (42%), Gaps = 47/235 (20%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P ++F+G +  + L   V  RSIL  ++ PV+   ++   L+ +        +   T FS
Sbjct: 3   PIRVFIGTEDAQFLPTEVLKRSILHHTTAPVEFHLLIDIPLKLN--------RKMYTGFS 54

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGA 138
           F RF  P   G+ G A+++D D + + DI EL +L                    +  GA
Sbjct: 55  FYRFAIPEACGYQGRAIYLDADMVMVGDISELYNL------------------PMQGKGA 96

Query: 139 VQTVYPRKNW-SSMVLYNCGHPKNKILTP--EVVNTQTGAFLHRFQW-----LEDEEIGS 190
           +    P   W +S +L +C   K+  L+    ++N    ++     W     L  ++ G 
Sbjct: 97  LARPCPPNAWYTSAMLLDCEQLKHWNLSKWETLINAGIASYKEAL-WGEPGSLTHKDFGL 155

Query: 191 IPFVWNFLEGHNMVVENDPTTFPKAIHYTR--GGPWFEAWKHCEFADLWLKEMEE 243
           +P  WN L+  +   +N        IHYT     PW        FA ++LK+++E
Sbjct: 156 LPEYWNHLDTFDNTTKN--------IHYTNVPRQPWKTPGH--PFAKIFLKQLKE 200


>gi|220909560|ref|YP_002484871.1| glycosyl transferase protein [Cyanothece sp. PCC 7425]
 gi|219866171|gb|ACL46510.1| glycosyl transferase family 8 [Cyanothece sp. PCC 7425]
          Length = 302

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 66/234 (28%), Positives = 99/234 (42%), Gaps = 34/234 (14%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTE--- 73
             P KI+VG    + LAY++   SIL+ ++ PV+I P+ ++ +  + L      Q     
Sbjct: 4   QTPIKIYVGTQKEQLLAYKILEYSILQHTAHPVEIVPLHEA-IAAAQLTIPIPAQPALRP 62

Query: 74  STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETT 133
            T FSF RF  P L  + G A+++D D     DI EL         ++ VH    P ++ 
Sbjct: 63  RTPFSFQRFTIPALENYQGRAIYLDSDMQVFRDIAELWHWDFAGADLLAVHE---PADSG 119

Query: 134 KMDGAVQTVYPRKNWSSMVLYNCGHPKNKI--LTPEVVNTQTGAFLHRFQWLEDEEIGSI 191
           +          R  +S MVL +C      I  L  E+   +       FQ     ++GS+
Sbjct: 120 R----------RPQFSVMVL-DCEQLHWDINELVRELEQGRWSYEQFVFQMAPARQVGSV 168

Query: 192 -PFVWNFLEGHNMVVENDPTTFPKAI-HYT--RGGPWFEAWKHCEFADLWLKEM 241
            P  WN LE +        T    A+ HYT  R  PW         A LW +E+
Sbjct: 169 LPTEWNDLERY--------TPGKTALTHYTDMRTQPWLSTAN--PLAGLWCREL 212


>gi|423013883|ref|ZP_17004604.1| hypothetical protein AXXA_05508 [Achromobacter xylosoxidans AXX-A]
 gi|338782814|gb|EGP47183.1| hypothetical protein AXXA_05508 [Achromobacter xylosoxidans AXX-A]
          Length = 289

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 88/197 (44%), Gaps = 29/197 (14%)

Query: 19  PFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYW---RERG-QT 72
           P +IFVG DP + DL    V   S  K +S PV+IT +  S    S  Y    R RG +T
Sbjct: 4   PIRIFVGCDPNDCDLEQMMVLDYSARKHASRPVEITWMRLSRDPASPWYCDPERGRGWRT 63

Query: 73  E--STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPK 130
           E  ST FS  R+  P  AGF G A+++D D L L D+ E+ DL  D  AI+         
Sbjct: 64  EKWSTPFSAFRWAVPAAAGFQGRALYMDADMLVLCDLAEIWDLPLDGEAIV--------- 114

Query: 131 ETTKMDGAVQTVYPRKNWSSMV-LYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEE-- 187
              + DG          W S V L++C   +  + T E +  +  A      +  D    
Sbjct: 115 -AGRRDG--------DGWRSCVALWDCARARAHLPTLEALRAKRHANREMKHYFADRPRL 165

Query: 188 IGSIPFVWNFLEGHNMV 204
           +  +   +N ++G N+ 
Sbjct: 166 VQHLDARYNSIDGENLA 182


>gi|398903201|ref|ZP_10651511.1| LPS:glycosyltransferase [Pseudomonas sp. GM50]
 gi|398177396|gb|EJM65077.1| LPS:glycosyltransferase [Pseudomonas sp. GM50]
          Length = 297

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 96/230 (41%), Gaps = 46/230 (20%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE---- 76
           +IFV   P E L  RV   SI + S++P++++ I          + RE  Q ++ E    
Sbjct: 13  RIFVAATPAEWLPMRVLEYSIREHSTLPIELSTIYS--------HGREIPQPKALENRAR 64

Query: 77  --FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTK 134
             FSF RFL P L  F G A+++D D     DI EL D          +  ++    +T 
Sbjct: 65  TPFSFQRFLIPELCHFTGRAIYLDADMQVFKDIAELWD--------HPLEGNHLQTVSTA 116

Query: 135 MDGAVQTVYPRKNWSSMVLYNCGHPKNKI------LTPEVVNTQTGAFLHRFQWLEDEEI 188
            DG       R++  S++L +C     K+      L    ++ QT  F  +       EI
Sbjct: 117 NDG-------RRSQFSVMLLDCARLDWKVEDIVARLDDGSLDYQTLMFDMKVA----REI 165

Query: 189 G-SIPFVWNFLEGH-----NMVVENDPTTFP-KAIHYTRGGPWFEAWKHC 231
           G  IP  WN LE +      ++   D  T P  ++H T G  W    +  
Sbjct: 166 GYGIPLKWNSLEAYMPDQTALLHYTDMNTQPWVSLHNTNGVLWISCLRRA 215


>gi|313575241|emb|CBI71183.1| hypothetical protein [uncultured bacterium]
          Length = 62

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 37/61 (60%)

Query: 17 NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
          ++   I++G+D RE +AY V   +ILK +SIPV + PIV  DL   G Y R+     STE
Sbjct: 2  DRSLDIYIGWDSREPIAYEVAKSTILKNASIPVNVHPIVLQDLVDKGAYTRDVDPLASTE 61

Query: 77 F 77
          F
Sbjct: 62 F 62


>gi|413932695|gb|AFW67246.1| hypothetical protein ZEAMMB73_571848, partial [Zea mays]
          Length = 69

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 25/41 (60%), Positives = 30/41 (73%), Gaps = 1/41 (2%)

Query: 211 TFPKAIHYTRGGPWFEAWKHCEFADL-WLKEMEEYKNEANK 250
           T   AIHYT GGPWF+ ++ CEFADL WLKE EE + E +K
Sbjct: 1   TSSSAIHYTSGGPWFQRYRDCEFADLIWLKEAEELRVEKDK 41


>gi|220909573|ref|YP_002484884.1| glycosyl transferase protein [Cyanothece sp. PCC 7425]
 gi|219866184|gb|ACL46523.1| glycosyl transferase family 8 [Cyanothece sp. PCC 7425]
          Length = 241

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/232 (21%), Positives = 102/232 (43%), Gaps = 36/232 (15%)

Query: 22  IFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFSR 81
           + +G +  + +  +V   +I +     V I    QS+ R  G           T+F F R
Sbjct: 4   VVIGTEDNQLIPQKVLEYTIRQHCKGEVDIRACRQSEPRMGG-----------TKFGFVR 52

Query: 82  FLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCV--------HHDYTPKETT 133
           F  P +  ++G A+++D D + L D++EL D +++++ +  V        +    P+  T
Sbjct: 53  FHVPSIFNYEGKAIYLDADQIVLTDLQELVDHLEEQFCLALVKDPVGFFGNKPAPPRYET 112

Query: 134 KMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSIPF 193
            +   V      K+W    L+N   P + +L P  ++ +   +L   +W++  EI  +  
Sbjct: 113 SV--MVLNCDKLKHWQPQTLFNNVVPNDSVLQPGQIHYRDFMWL---EWMDKSEIQLLEP 167

Query: 194 VWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCE--FADLWLKEMEE 243
            WN    H  +++ D     K IH++      + WK+ +    + W K +++
Sbjct: 168 GWN----HFNILKED----TKLIHFSHVAS--QPWKNPQHPLTEFWAKWLQQ 209


>gi|319788045|ref|YP_004147520.1| hypothetical protein Psesu_2457 [Pseudoxanthomonas suwonensis 11-1]
 gi|317466557|gb|ADV28289.1| hypothetical protein Psesu_2457 [Pseudoxanthomonas suwonensis 11-1]
          Length = 479

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 4/94 (4%)

Query: 17  NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTE 76
           N P ++FVG D  + L  ++   SI K +S+ VK+ PI    +        +      T 
Sbjct: 210 NAPVRVFVGTDAAQMLGVKMLEYSIKKFASLSVKVEPIDDRGIPVPA----DPANRSRTG 265

Query: 77  FSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
           FSFSRF  P L  + G  ++VD D    +DI +L
Sbjct: 266 FSFSRFKIPELCRYHGRGIYVDADMQVFSDILDL 299


>gi|81301103|ref|YP_401311.1| hypothetical protein Synpcc7942_2294 [Synechococcus elongatus PCC
           7942]
 gi|81169984|gb|ABB58324.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 278

 Score = 50.4 bits (119), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 96/249 (38%), Gaps = 53/249 (21%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKIT--------------PIVQSDLRKSGL 64
           P +IF+G      +  +V   S+ + +  P+ IT              P   + +     
Sbjct: 4   PIRIFIGSGEASRIERQVAIDSLQRMTDRPLAITVFNGTHNALEVLGEPPRPAPMSLRVK 63

Query: 65  YWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVH 124
           YW       +TEFS  RFL P L G+ G A+++D D + L DI EL D  D K A     
Sbjct: 64  YW------NATEFSLYRFLIPELCGYQGRAIYIDSDVVCLHDIGELFD-TDLKGADFLAK 116

Query: 125 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLH------ 178
            +  P     + G            S+VL++C   +   L   +   + G + +      
Sbjct: 117 AEAYPDRAGALWGL-----------SVVLFDCDRCRFP-LEQFIDELEAGVYDYRDLLCM 164

Query: 179 --RFQWLEDEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYT--RGGPWFEAWKHCEFA 234
             RF      +IG++   WN  +  +           + IHYT  +  PW        + 
Sbjct: 165 GPRFLNKHPYQIGALDPRWNSFDFADRET--------RLIHYTDLKRQPWKRPGH--PYG 214

Query: 235 DLWLKEMEE 243
           DLW K M++
Sbjct: 215 DLWFKAMQQ 223


>gi|422319914|ref|ZP_16400987.1| family 8 glycosyl transferase [Achromobacter xylosoxidans C54]
 gi|317405367|gb|EFV85685.1| family 8 glycosyl transferase [Achromobacter xylosoxidans C54]
          Length = 291

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 42/120 (35%), Positives = 58/120 (48%), Gaps = 12/120 (10%)

Query: 17  NKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKS--------GLYW 66
           N   +IFVG DP + DL    V   S  K +S PV+IT +  S    S        GL W
Sbjct: 2   NDAIRIFVGCDPNDCDLEQMMVLDYSARKHASRPVEITWMRLSRDPASPWYCDPAHGLGW 61

Query: 67  RERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHD 126
           R   +  +T FS  R+  P  AGF G A ++D D L L D+ E+ D+   + AI+    +
Sbjct: 62  RT--EKWATPFSALRWAVPAAAGFQGRAFYMDADMLVLCDLAEIWDMPLAEGAIVAARRE 119


>gi|56751815|ref|YP_172516.1| hypothetical protein syc1806_c [Synechococcus elongatus PCC 6301]
 gi|56686774|dbj|BAD79996.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 278

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 96/249 (38%), Gaps = 53/249 (21%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKIT--------------PIVQSDLRKSGL 64
           P +IF+G      +  +V   S+ + +  P+ IT              P   + +     
Sbjct: 4   PIRIFIGSGEASRIERQVAIDSLQRMTDRPLAITAFNGTHNALEVLGEPPRPAPMSLRVK 63

Query: 65  YWRERGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVH 124
           YW       +TEFS  RFL P L G+ G A+++D D + L DI EL D  D + A     
Sbjct: 64  YW------NATEFSLYRFLIPELCGYQGRAIYIDSDVVCLHDIGELFD-TDLEGADFLAK 116

Query: 125 HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLH------ 178
            +  P     + G            S+VL++C   +   L   +   + G + +      
Sbjct: 117 AEAYPDRAGALWGL-----------SVVLFDCDRCRFP-LEQFIDELEAGVYDYRDLPCM 164

Query: 179 --RFQWLEDEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYT--RGGPWFEAWKHCEFA 234
             RF      +IG++   WN  +  +           + IHYT  +  PW        + 
Sbjct: 165 GPRFLNKRPYQIGALDPRWNSFDFADRET--------RLIHYTDLKRQPWKRPGH--PYG 214

Query: 235 DLWLKEMEE 243
           DLW K M++
Sbjct: 215 DLWFKAMQQ 223


>gi|258541421|ref|YP_003186854.1| hypothetical protein APA01_03220 [Acetobacter pasteurianus IFO
           3283-01]
 gi|384041342|ref|YP_005480086.1| hypothetical protein APA12_03220 [Acetobacter pasteurianus IFO
           3283-12]
 gi|384049857|ref|YP_005476920.1| hypothetical protein APA03_03220 [Acetobacter pasteurianus IFO
           3283-03]
 gi|384052967|ref|YP_005486061.1| hypothetical protein APA07_03220 [Acetobacter pasteurianus IFO
           3283-07]
 gi|384056199|ref|YP_005488866.1| hypothetical protein APA22_03220 [Acetobacter pasteurianus IFO
           3283-22]
 gi|384058840|ref|YP_005497968.1| hypothetical protein APA26_03220 [Acetobacter pasteurianus IFO
           3283-26]
 gi|384062134|ref|YP_005482776.1| hypothetical protein APA32_03220 [Acetobacter pasteurianus IFO
           3283-32]
 gi|384118210|ref|YP_005500834.1| hypothetical protein APA42C_03220 [Acetobacter pasteurianus IFO
           3283-01-42C]
 gi|256632499|dbj|BAH98474.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01]
 gi|256635556|dbj|BAI01525.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-03]
 gi|256638611|dbj|BAI04573.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-07]
 gi|256641665|dbj|BAI07620.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-22]
 gi|256644720|dbj|BAI10668.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-26]
 gi|256647775|dbj|BAI13716.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-32]
 gi|256650828|dbj|BAI16762.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01-42C]
 gi|256653819|dbj|BAI19746.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-12]
          Length = 288

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 54/107 (50%), Gaps = 8/107 (7%)

Query: 12  RETKDNKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIV----QSDLRKSGLY 65
           ++ + N   K+F+G DP + DL    V H SI K SS+PV IT +      S    SGL 
Sbjct: 2   KDLRMNTVVKLFIGCDPNDCDLEQLMVLHYSITKYSSLPVDITFMQLTRDPSSFWYSGLK 61

Query: 66  WRERGQTES--TEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
             E   T    T FS  R+  P  +G  G A+++D D   L+DI EL
Sbjct: 62  PEEGWNTTEWPTPFSGFRWGIPAYSGGKGRALYMDADIFALSDIAEL 108


>gi|358010759|ref|ZP_09142569.1| hypothetical protein AP8-3_04515 [Acinetobacter sp. P8-3-8]
          Length = 301

 Score = 46.6 bits (109), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 54/215 (25%), Positives = 88/215 (40%), Gaps = 30/215 (13%)

Query: 20  FKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTE---- 73
            KIFVG DP   DL    V   SI K +  PV+IT +  S  + S  Y  +   T     
Sbjct: 6   IKIFVGCDPNNCDLEQMMVLDYSIRKNTQHPVEITWMQLSHDQNSPWYSDQIKHTGWCTE 65

Query: 74  --STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKE 131
             +T FS  R+  P L  F G A+++D D + L DI EL +   D+  ++        K 
Sbjct: 66  KWATPFSGFRWAIPELCNFQGRAIYMDADVIVLTDIAELWNTAFDQNTMVIAK---GGKS 122

Query: 132 TTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGSI 191
           T ++                 ++NC   +N +   + +     A     + ++++     
Sbjct: 123 TARL--------------CTCIWNCAVAENYLPKLQQLKQDPNAHKKLMKLIKEQS---- 164

Query: 192 PFVWNFLEGHNMV-VENDPTTFPKAIHYTRGGPWF 225
             V N+ + +N +  E  P    K +HY+  G  F
Sbjct: 165 NLVQNYTDNYNCIDGEELPINEIKILHYSDMGTQF 199


>gi|421853057|ref|ZP_16285738.1| hypothetical protein APS_1543 [Acetobacter pasteurianus subsp.
           pasteurianus LMG 1262 = NBRC 106471]
 gi|371478767|dbj|GAB30941.1| hypothetical protein APS_1543 [Acetobacter pasteurianus subsp.
           pasteurianus LMG 1262 = NBRC 106471]
          Length = 283

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 50/102 (49%), Gaps = 8/102 (7%)

Query: 17  NKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIV----QSDLRKSGLYWRERG 70
           N   K+FVG DP + DL    V H SI K SS+PV IT +      S    SGL   E  
Sbjct: 2   NAVVKLFVGCDPNDCDLEQLMVLHYSITKYSSLPVDITFMQLTRDPSSFWYSGLNPEEGW 61

Query: 71  QTES--TEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            T    T FS  R+  P   G  G A+++D D   L+DI EL
Sbjct: 62  NTTEWPTPFSGFRWGIPAYCGGKGRALYMDADIFALSDIAEL 103


>gi|421849364|ref|ZP_16282345.1| hypothetical protein APT_1038 [Acetobacter pasteurianus NBRC
           101655]
 gi|371459838|dbj|GAB27548.1| hypothetical protein APT_1038 [Acetobacter pasteurianus NBRC
           101655]
          Length = 283

 Score = 45.8 bits (107), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 50/102 (49%), Gaps = 8/102 (7%)

Query: 17  NKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIV----QSDLRKSGLYWRERG 70
           N   K+F+G DP + DL    V H SI K SS+PV IT +      S    SGL   E  
Sbjct: 2   NTVVKLFIGCDPNDCDLEQLMVLHYSITKYSSLPVDITFMQLTRDPSSFWYSGLKPEEGW 61

Query: 71  QTES--TEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            T    T FS  R+  P   G  G A+++D D   L+DI EL
Sbjct: 62  NTTEWPTPFSGFRWGIPAYCGGKGRALYMDADIFALSDIAEL 103


>gi|381197763|ref|ZP_09905102.1| hypothetical protein AlwoW_10934 [Acinetobacter lwoffii WJ10621]
          Length = 301

 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 8/103 (7%)

Query: 16  DNKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRE----- 68
           +N+  ++FVG DP   DL    V   S+ K +S PV IT +  S    S  Y  +     
Sbjct: 2   NNEVIQVFVGCDPNNCDLEQMMVLEYSLRKHTSHPVNITWMQLSHDTNSPWYTNQETGEG 61

Query: 69  -RGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            R +  +T FS  R+  P    F+G A+++D D + L DI EL
Sbjct: 62  WRTEKWATPFSGFRWAIPESCNFEGRAIYMDADVIVLCDIAEL 104


>gi|329115006|ref|ZP_08243761.1| Hypothetical protein APO_1816 [Acetobacter pomorum DM001]
 gi|326695449|gb|EGE47135.1| Hypothetical protein APO_1816 [Acetobacter pomorum DM001]
          Length = 288

 Score = 45.4 bits (106), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 12/112 (10%)

Query: 12  RETKDNKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRER 69
           ++ + N   K+F+G DP + DL    V H SI K SS+PV IT +  +  R    +W   
Sbjct: 2   KDLRMNAVVKLFIGCDPNDCDLEQLMVLHYSITKYSSLPVDITFMQLT--RDPSSFWYSG 59

Query: 70  GQTES--------TEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDL 113
            + E         T FS  R+  P   G  G A+++D D   L+DI +L +L
Sbjct: 60  FKPEEGWNTTEWPTPFSGFRWGIPAYCGGKGRALYMDADIFALSDIAKLWNL 111


>gi|257455409|ref|ZP_05620644.1| conserved hypothetical protein [Enhydrobacter aerosaccus SK60]
 gi|257447371|gb|EEV22379.1| conserved hypothetical protein [Enhydrobacter aerosaccus SK60]
          Length = 289

 Score = 45.4 bits (106), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 50/104 (48%), Gaps = 12/104 (11%)

Query: 17  NKPFKIFVGYDPR-EDL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTE- 73
           ++P KIF+G DP   DL    V   S+ K +S PV+I  +  S  R     W    QT  
Sbjct: 5   DQPIKIFIGCDPNNSDLEQMMVLDYSLHKHTSEPVEIIWMQLS--RDPSSVWYSNPQTGE 62

Query: 74  -------STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
                  +T FS  R+  P   GF G A+++D D + L DI +L
Sbjct: 63  GWHTEKWATPFSGFRWAIPEYCGFSGRAIYMDADMVVLDDIAKL 106


>gi|226954445|ref|ZP_03824909.1| family 8 glycosyl transferase [Acinetobacter sp. ATCC 27244]
 gi|406038079|ref|ZP_11045443.1| hypothetical protein AparD1_14278 [Acinetobacter parvus DSM 16617 =
           CIP 108168]
 gi|226834794|gb|EEH67177.1| family 8 glycosyl transferase [Acinetobacter sp. ATCC 27244]
          Length = 320

 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 51/103 (49%), Gaps = 8/103 (7%)

Query: 16  DNKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLY----WRER 69
           +N+  K+FVG DP   DL    V   SI K +  PV+I  +  S   KS  Y     RE 
Sbjct: 20  NNEVIKVFVGCDPNNCDLEQMMVLDYSIRKHTQHPVEIVWMQLSHDPKSPWYSNQETREG 79

Query: 70  GQTE--STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
             TE  +T FS  R+  P    F G A+++D D + L DI EL
Sbjct: 80  WHTEKWATPFSGFRWAIPESCNFQGRAIYMDADVVVLCDIAEL 122


>gi|294651462|ref|ZP_06728775.1| glycosyl transferase [Acinetobacter haemolyticus ATCC 19194]
 gi|292822612|gb|EFF81502.1| glycosyl transferase [Acinetobacter haemolyticus ATCC 19194]
          Length = 320

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 51/103 (49%), Gaps = 8/103 (7%)

Query: 16  DNKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLY----WRER 69
           +N+  K+FVG DP   DL    V   SI K +  PV+I  +  S   KS  Y     RE 
Sbjct: 20  NNEVIKVFVGCDPNNCDLEQMMVLDYSIRKHTQHPVEIVWMQLSHDPKSPWYSNQETREG 79

Query: 70  GQTE--STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
             TE  +T FS  R+  P    F G A+++D D + L DI EL
Sbjct: 80  WHTEKWATPFSGFRWAIPESCNFQGRAIYMDADVVVLCDIAEL 122


>gi|333367690|ref|ZP_08459936.1| hypothetical protein HMPREF9373_0341 [Psychrobacter sp. 1501(2011)]
 gi|332978466|gb|EGK15179.1| hypothetical protein HMPREF9373_0341 [Psychrobacter sp. 1501(2011)]
          Length = 286

 Score = 45.1 bits (105), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 12/104 (11%)

Query: 17  NKPFKIFVGYDPR-EDL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWR---ERG- 70
           N+  K+FVG DP   DL    V   SI K ++ PV+I  +  S  R    +W    E+G 
Sbjct: 3   NETIKVFVGCDPNNSDLEQMMVLDYSIKKHTTAPVEIVWMQLS--RDPESFWYSNPEKGE 60

Query: 71  ----QTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
               Q  +T FS  R+  P    F+G A+++D D + L+D+K+L
Sbjct: 61  GWNTQKWATPFSGFRWAIPEYCNFEGRAIYMDADMVILSDLKDL 104


>gi|386824884|ref|ZP_10112013.1| hypothetical protein Q5A_11766 [Serratia plymuthica PRI-2C]
 gi|386378329|gb|EIJ19137.1| hypothetical protein Q5A_11766 [Serratia plymuthica PRI-2C]
          Length = 292

 Score = 45.1 bits (105), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 48/98 (48%), Gaps = 11/98 (11%)

Query: 22  IFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERG-----QTE- 73
           +F+G DP + DL    V   S+ K +SIPV I  +  S  R    +W   G      TE 
Sbjct: 12  VFIGCDPNDCDLEQMMVLEYSLRKHASIPVHIEWMRLS--RDPASFWHSSGAGVGWNTEC 69

Query: 74  -STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            +T FS  R+  P    F G A + D D LYL+DI E+
Sbjct: 70  WATPFSAFRWGIPAFCNFQGRAFYSDADVLYLSDIAEI 107


>gi|445420896|ref|ZP_21435718.1| glycosyltransferase family 8 domain protein [Acinetobacter sp.
           WC-743]
 gi|444758463|gb|ELW82963.1| glycosyltransferase family 8 domain protein [Acinetobacter sp.
           WC-743]
          Length = 301

 Score = 44.3 bits (103), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 50/102 (49%), Gaps = 8/102 (7%)

Query: 17  NKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQ--- 71
           N   KIFVG DP   DL    V   SI K +  PV+IT +  S    S  +  +      
Sbjct: 3   NDVIKIFVGCDPNNCDLEQMMVLDYSIRKNTQYPVEITWMQLSHDPDSPWFSDQTHHLGW 62

Query: 72  -TE--STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            TE  +T FS  R+  P L  F+G A+++D D + L DI EL
Sbjct: 63  HTEKWATPFSGFRWAIPQLCNFEGRAIYMDADVIVLTDIAEL 104


>gi|403050008|ref|ZP_10904492.1| hypothetical protein AberL1_00345 [Acinetobacter bereziniae LMG
           1003]
          Length = 301

 Score = 44.3 bits (103), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 50/102 (49%), Gaps = 8/102 (7%)

Query: 17  NKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQ--- 71
           N   KIFVG DP   DL    V   SI K +  PV+IT +  S    S  +  +      
Sbjct: 3   NDVIKIFVGCDPNNCDLEQMMVLDYSIRKNTQYPVEITWMQLSHDPDSPWFSDQTHHLGW 62

Query: 72  -TE--STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            TE  +T FS  R+  P L  F+G A+++D D + L DI EL
Sbjct: 63  HTEKWATPFSGFRWAIPQLCNFEGRAIYMDADVIVLTDIAEL 104


>gi|434392115|ref|YP_007127062.1| glycosyl transferase family 8 [Gloeocapsa sp. PCC 7428]
 gi|428263956|gb|AFZ29902.1| glycosyl transferase family 8 [Gloeocapsa sp. PCC 7428]
          Length = 290

 Score = 43.9 bits (102), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 81/185 (43%), Gaps = 36/185 (19%)

Query: 74  STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETT 133
           +TEFS  RF+ P +  + G A++VD D + L DI +L D   +     C   D+  K+  
Sbjct: 68  ATEFSLYRFIIPEVCNYQGKAIYVDSDIVSLTDIGQLFDTPMNS----C---DFLAKK-- 118

Query: 134 KMDGAVQTVYPRKNWS-SMVLYNCGHPKNKILT-----PEVVNTQTGAFLHRFQWLEDE- 186
              GA +       W  S++L +C   +  + T      + + TQT   +    +L    
Sbjct: 119 ---GAQKGYRGSNFWGLSVMLIDCEKSQFNLETIFNEIDQKLYTQTDFMVMNQTFLTHHP 175

Query: 187 -EIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRGGPWFEAWKHCE--FADLWLKEMEE 243
             IG +   WN L+        D  T  K IHYT  G + + WK+    + D+W K    
Sbjct: 176 YTIGELDPGWNMLDSW------DKNT--KLIHYT--GLFSQPWKYPNHPYGDIWFK---- 221

Query: 244 YKNEA 248
           Y NEA
Sbjct: 222 YFNEA 226


>gi|56751822|ref|YP_172523.1| hypothetical protein syc1813_d [Synechococcus elongatus PCC 6301]
 gi|81301096|ref|YP_401304.1| hypothetical protein Synpcc7942_2287 [Synechococcus elongatus PCC
           7942]
 gi|56686781|dbj|BAD80003.1| unknown protein [Synechococcus elongatus PCC 6301]
 gi|81169977|gb|ABB58317.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 324

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 16/106 (15%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQ------- 71
           P ++F+G   R  +  RV   SI + +  P++    V S L   G   R  G        
Sbjct: 5   PIRLFIGSSARNQVEERVLVYSIQRLTQRPLE----VWSLLGDRGTVRRPDGSELPLPEA 60

Query: 72  -----TESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCD 112
                  +T FS +R+  P+  G++G A++ D D +   D+ EL D
Sbjct: 61  IRDRLGGATRFSTARYAIPHFCGYEGRAIYCDSDQILFTDLAELFD 106


>gi|332705755|ref|ZP_08425831.1| lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase
           [Moorea producens 3L]
 gi|332355547|gb|EGJ35011.1| lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase
           [Moorea producens 3L]
          Length = 277

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 25/39 (64%)

Query: 74  STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCD 112
           +TEFS  R+L P L  + G A+++D D + L DI EL D
Sbjct: 69  TTEFSLYRYLIPQLCNYQGKAIYIDSDTICLTDIGELLD 107


>gi|262371672|ref|ZP_06064951.1| predicted protein [Acinetobacter junii SH205]
 gi|262311697|gb|EEY92782.1| predicted protein [Acinetobacter junii SH205]
          Length = 302

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 49/99 (49%), Gaps = 8/99 (8%)

Query: 20  FKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQ----TE 73
            K+FVG DP   DL    V   SI K +  PV+IT +  S    S  Y  +       TE
Sbjct: 8   IKVFVGCDPNNCDLEQMMVLDYSIRKNTQHPVEITWMQLSHDPASPWYSNQETHEGWHTE 67

Query: 74  --STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
             +T FS  R+  P  + F+G ++++D D + L DI EL
Sbjct: 68  KWATPFSGFRWAIPESSNFEGRSIYMDADVIVLCDIAEL 106


>gi|389704855|ref|ZP_10185959.1| hypothetical protein HADU_03034 [Acinetobacter sp. HA]
 gi|388611099|gb|EIM40208.1| hypothetical protein HADU_03034 [Acinetobacter sp. HA]
          Length = 300

 Score = 41.2 bits (95), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 49/99 (49%), Gaps = 8/99 (8%)

Query: 20  FKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQ----TE 73
            K+FVG DP   DL    V   SI K +  PV+IT +  S    S  Y  +       TE
Sbjct: 6   IKVFVGCDPNNCDLEQMMVLDYSIRKNTQHPVEITWMQLSHDPASPWYSNQETHEGWHTE 65

Query: 74  --STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
             +T FS  R+  P  + F+G ++++D D + L DI EL
Sbjct: 66  KWATPFSGFRWAIPESSNFEGRSIYMDADVIVLCDIAEL 104


>gi|33864190|ref|NP_895750.1| hypothetical protein PMT1924 [Prochlorococcus marinus str. MIT
          9313]
 gi|33635774|emb|CAE22099.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
          9313]
          Length = 101

 Score = 41.2 bits (95), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 30/58 (51%), Gaps = 1/58 (1%)

Query: 17 NKPFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSG-LYWRERGQTE 73
            P  IF+GYD RE  A  V   S+ + SS+P+ ITP+    L   G L  R R + E
Sbjct: 3  TAPIPIFIGYDSRERAATNVLIDSLYQHSSVPLAITPLETPQLEAQGPLPTRARPEAE 60


>gi|425745654|ref|ZP_18863697.1| hypothetical protein ACINWC323_3192 [Acinetobacter baumannii
           WC-323]
 gi|425488092|gb|EKU54432.1| hypothetical protein ACINWC323_3192 [Acinetobacter baumannii
           WC-323]
          Length = 300

 Score = 41.2 bits (95), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 48/102 (47%), Gaps = 8/102 (7%)

Query: 17  NKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERG---- 70
           N   KIFVG DP   DL    V   S+ K +  PV+I  +  S    S  Y  ++     
Sbjct: 3   NSTIKIFVGCDPNNCDLEQMMVLDYSVRKHTKHPVEIIWMQLSHDPNSPWYTNQQTGEGW 62

Query: 71  QTE--STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            TE  +T FS  R+  P    F G A+++D D + L DI EL
Sbjct: 63  HTEHWATPFSGFRWAIPESCNFQGRAIYMDADVVVLCDIAEL 104


>gi|226946462|ref|YP_002801535.1| hypothetical protein Avin_44420 [Azotobacter vinelandii DJ]
 gi|226721389|gb|ACO80560.1| Conserved hypothetical protein [Azotobacter vinelandii DJ]
          Length = 779

 Score = 40.8 bits (94), Expect = 0.61,   Method: Composition-based stats.
 Identities = 31/92 (33%), Positives = 48/92 (52%), Gaps = 6/92 (6%)

Query: 21  KIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFSFS 80
           +IF+G +P +  A R+   S L+R   P++   +     R +GL   +R +   T F+  
Sbjct: 64  RIFLGTEPAQHRAERIFFYS-LERVRDPLRRYEV----YRMTGLPGFDR-KGWRTNFTNF 117

Query: 81  RFLTPYLAGFDGWAVFVDCDFLYLADIKELCD 112
           RF  P LAG  G A++ D D ++ AD  EL D
Sbjct: 118 RFAIPDLAGRQGRAIYTDVDQIFTADPAELFD 149


>gi|359793228|ref|ZP_09295993.1| hypothetical protein MAXJ12_27053, partial [Mesorhizobium alhagi
           CCNWXJ12-2]
 gi|359250580|gb|EHK54056.1| hypothetical protein MAXJ12_27053, partial [Mesorhizobium alhagi
           CCNWXJ12-2]
          Length = 41

 Score = 40.4 bits (93), Expect = 0.68,   Method: Composition-based stats.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 102 LYLADIKELCDLIDDKYAIMCVHHDYTPKE 131
           L+L DI EL + +D   A +CV HDYTPKE
Sbjct: 8   LFLDDIAELTEYLDLSKAALCVKHDYTPKE 37


>gi|291334842|gb|ADD94482.1| hypothetical protein [uncultured phage MedDCM-OCT-S06-C1041]
          Length = 120

 Score = 40.0 bits (92), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 162 KILTPEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLEGHNMVVENDPTTFPKAIHYTRG 221
           K LT + +N ++      F W +   I  IP  +NF  G++ V E      P  +HYT G
Sbjct: 9   KKLTADYINDESNT-PQNFDWAD--SINDIPQKYNFCVGYDEVTEP-----PSIVHYTAG 60

Query: 222 GPWFEAWKHCEFADLWLKEME 242
            P F   + C++A  W  E E
Sbjct: 61  IPHFPETRDCDYAKEWWNEYE 81


>gi|297621531|ref|YP_003709668.1| hypothetical protein wcw_1310 [Waddlia chondrophila WSU 86-1044]
 gi|297376832|gb|ADI38662.1| conserved hypothetical protein [Waddlia chondrophila WSU 86-1044]
 gi|337292895|emb|CCB90895.1| putative uncharacterized protein [Waddlia chondrophila 2032/99]
          Length = 269

 Score = 39.7 bits (91), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 70  GQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHH 125
           G+  +T F+  RF  P L  F+G A+ +D DF+   D++++ + ID  + IM  HH
Sbjct: 80  GEGWATPFTCFRFAIPELCSFEGRAIHMDADFIVQNDLRKIFE-IDMTHPIMSPHH 134


>gi|302926322|ref|XP_003054272.1| hypothetical protein NECHADRAFT_74821 [Nectria haematococca mpVI
           77-13-4]
 gi|256735213|gb|EEU48559.1| hypothetical protein NECHADRAFT_74821 [Nectria haematococca mpVI
           77-13-4]
          Length = 370

 Score = 39.3 bits (90), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 11/82 (13%)

Query: 124 HHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAF--LHRFQ 181
           H D   +ETTKMDG+  TV+               P   +L PEVVN  TG F    R +
Sbjct: 171 HADEVFQETTKMDGSSMTVF---------YVRADSPHYPLLAPEVVNDPTGCFGVCSRNR 221

Query: 182 WLEDEEIGSIPFVWNFLEGHNM 203
            L +  I S P  W+ +   N+
Sbjct: 222 TLIEGHIRSPPLFWSTVRKLNL 243


>gi|359429852|ref|ZP_09220870.1| hypothetical protein ACT4_031_00020 [Acinetobacter sp. NBRC 100985]
 gi|358234645|dbj|GAB02409.1| hypothetical protein ACT4_031_00020 [Acinetobacter sp. NBRC 100985]
          Length = 301

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 50/103 (48%), Gaps = 8/103 (7%)

Query: 16  DNKPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWR----ER 69
           +++  KIFVG DP   DL    V   S+ K +  PV+I  +  S   +S  Y      E 
Sbjct: 2   NSEAIKIFVGCDPNNCDLEQMMVLDYSVRKHTQHPVEIIWMQLSHDPESPWYTNQATGEG 61

Query: 70  GQTE--STEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
             TE  +T FS  R+  P    F G A+++D D + L DI EL
Sbjct: 62  WHTEKWATPFSGFRWAIPESCNFQGRAIYMDADVVVLCDIVEL 104


>gi|392951371|ref|ZP_10316926.1| family 8 glycosyl transferase [Hydrocarboniphaga effusa AP103]
 gi|391860333|gb|EIT70861.1| family 8 glycosyl transferase [Hydrocarboniphaga effusa AP103]
          Length = 293

 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 12/101 (11%)

Query: 20  FKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKS--------GLYWRER 69
            +IFVG DP + DL    V   S+   +S+P+++  +  S   +S        G  WR  
Sbjct: 11  IRIFVGCDPNDCDLEQMMVLEHSLRAHASLPIELHWMRLSRDPESLWYSNPETGAGWRT- 69

Query: 70  GQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            +  +T FS  R+  P   G+ G A+++D D L L+DI  L
Sbjct: 70  -ERWATPFSGFRWAVPAACGYRGRALYMDADTLVLSDIARL 109


>gi|407006446|gb|EKE22356.1| hypothetical protein ACD_6C00859G0009 [uncultured bacterium]
          Length = 299

 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 46/103 (44%), Gaps = 12/103 (11%)

Query: 18  KPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRE------- 68
           K  KIF+G DP   DL    V   SI K +  PV+I  +  S  R    YW         
Sbjct: 4   KTIKIFIGCDPNNCDLEQMMVLDYSIKKHTQQPVEIIWMQLS--RDPSSYWYSDPKSNQG 61

Query: 69  -RGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            R +  +T FS  R+  P    F G A+++D D + L D+  L
Sbjct: 62  WRTENWATPFSGFRWAIPEYCQFQGKAIYMDTDVIVLDDLALL 104


>gi|88811562|ref|ZP_01126816.1| Predicted nucleoside-diphosphate-sugar epimerase [Nitrococcus
           mobilis Nb-231]
 gi|88790953|gb|EAR22066.1| Predicted nucleoside-diphosphate-sugar epimerase [Nitrococcus
           mobilis Nb-231]
          Length = 851

 Score = 38.5 bits (88), Expect = 2.4,   Method: Composition-based stats.
 Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 10/117 (8%)

Query: 19  PFKIFVGYDPREDLAYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRERGQTESTEFS 78
           P +IF+G +  +  A RV   SI K      +    +  DL      W   G      F+
Sbjct: 45  PVRIFLGTEGAQFRAERVFVWSIAKVRDPARRYEIYLLKDLPGFDRRWWLTG------FT 98

Query: 79  FSRFLTPYLAGFDGWAVFVDCDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKM 135
             RF  P LAG  G A++ D D +YLAD  EL D     +  + +     P++T+ M
Sbjct: 99  AYRFAIPELAGGVGRAIYNDTDQIYLADPAELFDTPMQGHGFLSIQ----PRDTSVM 151


>gi|262374907|ref|ZP_06068141.1| predicted protein [Acinetobacter lwoffii SH145]
 gi|262309920|gb|EEY91049.1| predicted protein [Acinetobacter lwoffii SH145]
          Length = 299

 Score = 38.5 bits (88), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 46/103 (44%), Gaps = 12/103 (11%)

Query: 18  KPFKIFVGYDPRE-DL-AYRVCHRSILKRSSIPVKITPIVQSDLRKSGLYWRE------- 68
           K  KIF+G DP   DL    V   SI K +  PV+I  +  S  R    YW         
Sbjct: 4   KTIKIFIGCDPNNCDLEQMMVLDYSIKKHTQQPVEIIWMQLS--RDPSSYWYSDPKSNQG 61

Query: 69  -RGQTESTEFSFSRFLTPYLAGFDGWAVFVDCDFLYLADIKEL 110
            R +  +T FS  R+  P    F G A+++D D + L D+  L
Sbjct: 62  WRTENWATPFSGFRWAIPEYCQFQGKAIYMDTDVIVLDDLALL 104


>gi|54291777|gb|AAV32146.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1029

 Score = 37.4 bits (85), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 43/177 (24%), Positives = 71/177 (40%), Gaps = 28/177 (15%)

Query: 89  GFDGWAVFVD--CDFLYLADIKELCDLIDDKYAIMCVHHDYTPKETTKMDGAVQTVYPRK 146
           G+D +  F D    + Y+  IKE  + +D K+ I     +       K+D       P +
Sbjct: 431 GYDSFITFTDDYSRYGYIYPIKERSEALD-KFKIFKAEVENQHDIKIKVDSTEMVATPEE 489

Query: 147 NWSSMVLYNCGHPKNKILT-------PEVVNTQTGAFLHRFQWLEDEEIGSIPFVWNFLE 199
                      HP+   +T       P+V +        R + +    I     V+N  E
Sbjct: 490 ELQ--------HPQTDNVTVQETQQEPQVQDVPNVEAPRRSERVRRSAIRDDYKVYNIEE 541

Query: 200 GHNMVVENDPTTFPKAIHYTRGGPWFEA----WKHCEFADLWLKEMEEYKNEANKTV 252
            H   +E+DPT++ +A+   R   W EA     K  +  D+W  ++EE  N A KT+
Sbjct: 542 SH---MEDDPTSYEEAMRSARSSEWLEAMKDEMKSMKLNDVW--DLEEIPNGA-KTI 592


>gi|164688674|ref|ZP_02212702.1| hypothetical protein CLOBAR_02320 [Clostridium bartlettii DSM
           16795]
 gi|164602150|gb|EDQ95615.1| glycosyltransferase, group 2 family protein [Clostridium bartlettii
           DSM 16795]
          Length = 365

 Score = 37.0 bits (84), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 10/75 (13%)

Query: 136 DGAVQTVYPRKNWSSMVLYNCGHPKNKILTPEVVNTQTGAFLHRFQWLEDEEIGS---IP 192
           DGA+  +YP+ N S  ++Y       K+L P +V     +F   + W   E + S    P
Sbjct: 127 DGAINKIYPKLNKS--IIYE--EDIKKLLIPSIVGI---SFEKLYDWFSGESLTSNNEFP 179

Query: 193 FVWNFLEGHNMVVEN 207
            VW FL   N++ +N
Sbjct: 180 SVWRFLYSRNIIQKN 194


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.137    0.446 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,417,026,303
Number of Sequences: 23463169
Number of extensions: 186558631
Number of successful extensions: 333620
Number of sequences better than 100.0: 148
Number of HSP's better than 100.0 without gapping: 118
Number of HSP's successfully gapped in prelim test: 30
Number of HSP's that attempted gapping in prelim test: 333294
Number of HSP's gapped (non-prelim): 148
length of query: 254
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 115
effective length of database: 9,097,814,876
effective search space: 1046248710740
effective search space used: 1046248710740
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)