BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 048617
         (281 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255566896|ref|XP_002524431.1| conserved hypothetical protein [Ricinus communis]
 gi|223536315|gb|EEF37966.1| conserved hypothetical protein [Ricinus communis]
          Length = 278

 Score =  320 bits (821), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 166/227 (73%), Positives = 192/227 (84%), Gaps = 7/227 (3%)

Query: 48  KIPFLTFPSNPRRFLKIPIMSRTNHQLHHHSFATPQSLSDWLKPRLPSDSFASWGVKPGT 107
           +IPF+  PSN RRFLK+P MS + +   +H FATPQSLSDWLKPRLPSDSFASWGVKPGT
Sbjct: 50  RIPFICLPSNRRRFLKVPAMSISKN---NHPFATPQSLSDWLKPRLPSDSFASWGVKPGT 106

Query: 108 KNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRP 167
           KNV NLWLELS+GET LAD+ PP+RTV+VVTVR+IG+ N +LVESHQELSDGSVRNRCRP
Sbjct: 107 KNVQNLWLELSEGETSLADSTPPIRTVSVVTVRVIGEKNTVLVESHQELSDGSVRNRCRP 166

Query: 168 LAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYV 227
           L+EKMK NETPE AV RA+KEELGSILND   VRIV GSY +K++ERNSLSYPGLPARYV
Sbjct: 167 LSEKMKPNETPEAAVFRAIKEELGSILNDDANVRIVDGSYKEKIEERNSLSYPGLPARYV 226

Query: 228 LHSVDAFVDGLPEEGEFCTEEAEEYADSS---AADKALFVKRHYWKW 271
           L+S+D  VDGLP + EF TEE EEY+DS     ADKA+ V++H+WKW
Sbjct: 227 LYSMDVIVDGLPND-EFSTEEVEEYSDSEDKRTADKAVSVRKHFWKW 272


>gi|359490753|ref|XP_003634156.1| PREDICTED: uncharacterized protein LOC100854009 [Vitis vinifera]
          Length = 292

 Score =  316 bits (810), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 168/253 (66%), Positives = 195/253 (77%), Gaps = 6/253 (2%)

Query: 22  PASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMSRTNHQLHHHSFAT 81
           P     A S+L LF  S P   T   KI  L+FP NPRRFLK P M   ++      F++
Sbjct: 37  PDLFFTALSLLFLFP-SSPKPQTLFSKISSLSFPQNPRRFLKTPTMISPSNPKPITQFSS 95

Query: 82  PQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRI 141
           PQSL+DWLK RLPSDSFASWGVKPGTKNVHNLWLELS+GET LAD+IPP+RTV VVTVRI
Sbjct: 96  PQSLTDWLKHRLPSDSFASWGVKPGTKNVHNLWLELSEGETSLADSIPPIRTVRVVTVRI 155

Query: 142 IGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVR 201
           I +DN+IL+ESHQELSDG++R RCRPL+EKMK  ET E AV RAVKEELGSI+  +  VR
Sbjct: 156 ISEDNRILLESHQELSDGTIRERCRPLSEKMKPGETLECAVHRAVKEELGSIIQGNGNVR 215

Query: 202 IVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADSSA---A 258
           IVPGSY +K++ER S SYPGLPA YVLHSV+A+VDGLP +GEFCTEE EEY D +    A
Sbjct: 216 IVPGSYEQKVEERVSASYPGLPACYVLHSVNAWVDGLP-DGEFCTEE-EEYRDWNGMGIA 273

Query: 259 DKALFVKRHYWKW 271
           + A+ VKRHYWKW
Sbjct: 274 EMAVSVKRHYWKW 286


>gi|224134953|ref|XP_002321946.1| predicted protein [Populus trichocarpa]
 gi|224134957|ref|XP_002321947.1| predicted protein [Populus trichocarpa]
 gi|222868942|gb|EEF06073.1| predicted protein [Populus trichocarpa]
 gi|222868943|gb|EEF06074.1| predicted protein [Populus trichocarpa]
          Length = 216

 Score =  300 bits (767), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 145/210 (69%), Positives = 175/210 (83%), Gaps = 7/210 (3%)

Query: 67  MSRTNHQLHHHSFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLAD 126
           MS +    + H F+TPQSLSDWLKPRLPSDSFASWG+KPGTKN+HNLWLE+S GET LAD
Sbjct: 1   MSVSRKTKNSHHFSTPQSLSDWLKPRLPSDSFASWGIKPGTKNIHNLWLEISQGETFLAD 60

Query: 127 AIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAV 186
           + PP+RTVNVVTV+II K NQ L+ESHQELSDGSVRNRCRPL+EKMK NE+ ++A+ RA+
Sbjct: 61  STPPIRTVNVVTVKIINK-NQTLIESHQELSDGSVRNRCRPLSEKMKPNESFKDAIFRAI 119

Query: 187 KEELGSILNDS--ITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEF 244
            EELGSIL D   +++ IV GSY +K++ERNS+SYPGLPARYVL+S D  V+GLP +GEF
Sbjct: 120 NEELGSILKDGNEVSINIVNGSYKEKVEERNSMSYPGLPARYVLYSADVEVNGLP-DGEF 178

Query: 245 CTEEAEEYADSS---AADKALFVKRHYWKW 271
           CTEEAEEY DS     A+KA+ VK+H+WKW
Sbjct: 179 CTEEAEEYPDSEEKRVAEKAVSVKKHFWKW 208


>gi|297808463|ref|XP_002872115.1| hypothetical protein ARALYDRAFT_489310 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317952|gb|EFH48374.1| hypothetical protein ARALYDRAFT_489310 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 295

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 165/264 (62%), Positives = 200/264 (75%), Gaps = 22/264 (8%)

Query: 22  PASILAAFSVLCLFTCSKPYNTTSTLKIP--FLTFPSNPRRFLKIPIMSRTNHQLHHHSF 79
           P   LAA S+L L++  KP      L +P    +FP NPRR  +I  MSR +       F
Sbjct: 32  PDLFLAAVSLLFLWSSPKPL-----LSLPPNRFSFPLNPRR--RITAMSRRSPPPPSQRF 84

Query: 80  ATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTV 139
           A PQSLSDWL+ RLPSDSFA+WGVKPGTKNVHNLWLELSDGET LAD+ PPVRTVNVVTV
Sbjct: 85  ANPQSLSDWLESRLPSDSFAAWGVKPGTKNVHNLWLELSDGETSLADSTPPVRTVNVVTV 144

Query: 140 RIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN-DSI 198
           R+IGK+ +ILVE+HQELSDGS+R+R RPL+EKMK  E+P+EAV RA+KEELGSI N D +
Sbjct: 145 RVIGKNGRILVEAHQELSDGSIRDRFRPLSEKMKPEESPDEAVFRAIKEELGSIFNGDDV 204

Query: 199 T--VRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEE-------- 248
              +RI+PG+YS++++ERNSLSYPGLPARY LHSVDA V+GLPEE +FCT+E        
Sbjct: 205 GQRIRILPGTYSRRVEERNSLSYPGLPARYALHSVDATVEGLPEE-DFCTDEKEYDGGDS 263

Query: 249 -AEEYADSSAADKALFVKRHYWKW 271
             E+  ++ AA KA+ VKRHYWKW
Sbjct: 264 TTEDSVETRAAGKAVTVKRHYWKW 287


>gi|357464217|ref|XP_003602390.1| hypothetical protein MTR_3g092810 [Medicago truncatula]
 gi|355491438|gb|AES72641.1| hypothetical protein MTR_3g092810 [Medicago truncatula]
          Length = 293

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 163/286 (56%), Positives = 196/286 (68%), Gaps = 21/286 (7%)

Query: 9   PIQSSGSIKIINPPASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMS 68
           P+Q   ++ + + P  +L A SV  LFT S+ +NT +T   PFL+FP  PRRFLKIP MS
Sbjct: 4   PLQPLTNLNLPSLPDLLLTALSVCFLFTSSRSHNTPTTTNCPFLSFPLKPRRFLKIPAMS 63

Query: 69  RTNHQLHHHS-------------FATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWL 115
                    S             F +PQSLS+WLKPRLPSDSFA+WGVKPGTKNVHNLWL
Sbjct: 64  NNISTSSLSSSSSSNPNRNNRRNFVSPQSLSEWLKPRLPSDSFATWGVKPGTKNVHNLWL 123

Query: 116 ELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSN 175
           ELS GET L D+ PPVRTV VVTVR++GKD +ILVESHQELSDG VR R RPL+EKMK N
Sbjct: 124 ELSQGETSLDDSEPPVRTVQVVTVRVVGKDGKILVESHQELSDGKVRERGRPLSEKMKPN 183

Query: 176 ETPEEAVIRAVKEELGSILNDSI----TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSV 231
           E PE A +R +KEELGS++N        V I P SY  +++ERNS SYPGLP  YVLH +
Sbjct: 184 EDPESAAVRGIKEELGSVINGETEICDIVMIDPNSYEMRVEERNSGSYPGLPGCYVLHIL 243

Query: 232 DAFVDGLPEEGEFCTEEAEEYADSS---AADKALFVKRHYWKWRRA 274
            A V+GLP EG+FCT E +EY DS     A +A+ VK+H+WKW  A
Sbjct: 244 SATVEGLP-EGDFCTYEVDEYDDSDDKKVAHQAVSVKKHFWKWVSA 288


>gi|356509151|ref|XP_003523315.1| PREDICTED: uncharacterized protein LOC100783319 [Glycine max]
          Length = 292

 Score =  280 bits (715), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 155/282 (54%), Positives = 184/282 (65%), Gaps = 37/282 (13%)

Query: 22  PASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMSRTNHQL------- 74
           P  +L A SV  L + SKP+   ST + PF      PRRFLKIP MS TN          
Sbjct: 14  PDLLLTALSVCFLVSSSKPHIGGST-RCPF------PRRFLKIPAMSLTNTTTSKSSKNN 66

Query: 75  ---------HHHSFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLA 125
                      HSFA+PQSLS+WLKPRLPS+SFASWGVKPGTKNVHNLWLELS GET LA
Sbjct: 67  NLNNTLLPPRRHSFASPQSLSEWLKPRLPSESFASWGVKPGTKNVHNLWLELSQGETSLA 126

Query: 126 DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRA 185
           D+ PPVRTV+VV VR+ GK  +ILVESHQELSDG+VR R RPL+EKMK NE PE A +R 
Sbjct: 127 DSTPPVRTVHVVLVRVTGKHGKILVESHQELSDGNVRKRGRPLSEKMKPNEDPESAAVRG 186

Query: 186 VKEELGSILNDSI---------TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVD 236
           + EELGS +              V I P SY  +++ER+S SYPGLP  YVLH++ A V+
Sbjct: 187 IMEELGSAIGGGFRAANFEIDDIVTIDPNSYEMRVEERDSGSYPGLPGCYVLHTLSATVE 246

Query: 237 GLPEEGEFCTEEAEEYA----DSSAADKALFVKRHYWKWRRA 274
           GLP EG+F T E +EY     D   AD+A+ VK+H+W W  A
Sbjct: 247 GLP-EGDFSTYEVDEYGGVFQDKIVADEAVSVKKHHWTWVSA 287


>gi|15238541|ref|NP_197834.1| uncharacterized protein [Arabidopsis thaliana]
 gi|9758534|dbj|BAB08929.1| unnamed protein product [Arabidopsis thaliana]
 gi|17065180|gb|AAL32744.1| Unknown protein [Arabidopsis thaliana]
 gi|20259932|gb|AAM13313.1| unknown protein [Arabidopsis thaliana]
 gi|332005930|gb|AED93313.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 300

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 164/265 (61%), Positives = 199/265 (75%), Gaps = 20/265 (7%)

Query: 22  PASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMSRTNHQLHHHS--F 79
           P   LAA S+L L++  KP     +L     +FP NPRR   I  MSR +      +  F
Sbjct: 33  PDIFLAAVSLLFLWSSPKPL---VSLPPNRFSFPLNPRR-RSITAMSRRSPPPPPPTQRF 88

Query: 80  ATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTV 139
           A PQSLSDWL+ RLPSDSFA+WGVKPGTKNVHNLWLELSDGET LAD+ PPVRTVNVVTV
Sbjct: 89  ANPQSLSDWLESRLPSDSFAAWGVKPGTKNVHNLWLELSDGETSLADSTPPVRTVNVVTV 148

Query: 140 RIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN---D 196
           R+IGK+ +ILVE+HQELSDGS+R R RPL+EKMK  E+P+EAV RA+KEELGSI N   D
Sbjct: 149 RVIGKNGRILVEAHQELSDGSIRERFRPLSEKMKPEESPDEAVFRAIKEELGSIFNGDGD 208

Query: 197 SIT--VRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAE-EYA 253
           +I   ++I+PG+Y+++++ERNSLSYPGLPARY LHSV+A V+GLPE+ +FCTEE E E  
Sbjct: 209 NIGQRIKILPGTYNRRVEERNSLSYPGLPARYALHSVNATVEGLPEQ-DFCTEEKEYEGG 267

Query: 254 DSS-------AADKALFVKRHYWKW 271
           DS+       AA  A+ VKRHYWKW
Sbjct: 268 DSTKDSVETLAAGNAVTVKRHYWKW 292


>gi|449456435|ref|XP_004145955.1| PREDICTED: uncharacterized protein LOC101208896 [Cucumis sativus]
          Length = 294

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 140/230 (60%), Positives = 178/230 (77%), Gaps = 9/230 (3%)

Query: 50  PFLTFPSNPRRFLKIPIMSRT--NHQLHHHSFATPQSLSDWLKPRLPSDSFASWGVKPGT 107
           P  +   NPRRF KIP +S    N      +F +PQSLS+WL+PRLPS SFASWGV PGT
Sbjct: 58  PAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQPRLPSHSFASWGVIPGT 117

Query: 108 KNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRP 167
           KN+HNLWLE+S GET LAD+ PP+RT++V+++RII   +++L+ESHQ+LSDG++RNR RP
Sbjct: 118 KNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRP 177

Query: 168 LAEKMKSNETPEEAVIRAVKEELGSILND---SITVRIVPGSYSKKLDERNSLSYPGLPA 224
           L+EKMK NETPE AV RAV+EELGSIL D   S  VRIVP SY  K++ER+S+SYPGL A
Sbjct: 178 LSEKMKPNETPESAVYRAVQEELGSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSA 237

Query: 225 RYVLHSVDAFVDGLPEEGEFCTEEAEEYA---DSSAADKALFVKRHYWKW 271
            YVLHS+D +V+GLP +G+FCT E EEY    D++ AD A+ VK+H+WKW
Sbjct: 238 SYVLHSMDVWVEGLP-DGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW 286


>gi|449497388|ref|XP_004160388.1| PREDICTED: uncharacterized protein LOC101231604 [Cucumis sativus]
          Length = 294

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 140/230 (60%), Positives = 177/230 (76%), Gaps = 9/230 (3%)

Query: 50  PFLTFPSNPRRFLKIPIMSRT--NHQLHHHSFATPQSLSDWLKPRLPSDSFASWGVKPGT 107
           P  +   NPRRF KIP +S    N      +F +PQSLS+WL+PRLPS SFASWGV PGT
Sbjct: 58  PAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQPRLPSHSFASWGVIPGT 117

Query: 108 KNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRP 167
           KN+HNLWLE+S GET LAD+ PP+RT++V+++RII   +++L+ESHQ+LSDG++RNR RP
Sbjct: 118 KNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRP 177

Query: 168 LAEKMKSNETPEEAVIRAVKEELGSILND---SITVRIVPGSYSKKLDERNSLSYPGLPA 224
           L+EKMK NETPE AV RAV+EELGSIL D   S  VRIVP SY  K++ER+S+SYPGL A
Sbjct: 178 LSEKMKPNETPESAVYRAVQEELGSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSA 237

Query: 225 RYVLHSVDAFVDGLPEEGEFCTEEAEEYA---DSSAADKALFVKRHYWKW 271
            YVLHS+D  V+GLP +G+FCT E EEY    D++ AD A+ VK+H+WKW
Sbjct: 238 SYVLHSMDVRVEGLP-DGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW 286


>gi|356516158|ref|XP_003526763.1| PREDICTED: uncharacterized protein LOC100804392 [Glycine max]
          Length = 291

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 155/281 (55%), Positives = 183/281 (65%), Gaps = 36/281 (12%)

Query: 22  PASILAAFSVLCLFTCSKPYNTTSTLKIPFLTFPSNPRRFLKIPIMSRTNHQL------- 74
           P  +L A SV  L + SKP+   ST + PF      PRRFLKIP MS T           
Sbjct: 14  PDLLLTALSVCFLVSSSKPHIGGST-RCPF------PRRFLKIPAMSLTTITTTSTSSKN 66

Query: 75  ----------HHHSFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCL 124
                       HSFA+PQSLS+WLKPRLPS+SFASWGVKPGTKNVHNLWLELS GET L
Sbjct: 67  NNNLNSHLPPRRHSFASPQSLSEWLKPRLPSESFASWGVKPGTKNVHNLWLELSQGETSL 126

Query: 125 ADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIR 184
           AD+ PPVRTV+VV VR+ GK  +ILVESHQ+LSDGSVR R RPL+EKMK NE PE A +R
Sbjct: 127 ADSTPPVRTVHVVLVRVTGKHGRILVESHQDLSDGSVRKRGRPLSEKMKPNEDPESAAVR 186

Query: 185 AVKEELGSILNDSI-------TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDG 237
            + EELGS +            V I P SY  +++ER+S SYPGLP  YVLH++ A V+ 
Sbjct: 187 GIMEELGSAIGGGAKNFEIDDIVTIDPNSYEMRVEERDSGSYPGLPGCYVLHTLSATVEC 246

Query: 238 LPEEGEFCTEEAEEYA----DSSAADKALFVKRHYWKWRRA 274
           LP EG+FCT E +EY     D   AD+A+ VK+HYW W  A
Sbjct: 247 LP-EGDFCTYEVDEYGGVFEDKIVADEAVSVKKHYWTWVSA 286


>gi|294463008|gb|ADE77042.1| unknown [Picea sitchensis]
          Length = 237

 Score =  200 bits (508), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 101/204 (49%), Positives = 135/204 (66%), Gaps = 11/204 (5%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVV 137
           +  + Q L+ WL+ R+ S     WGV+PGTKNV NLW+EL++GE  L D+ PP RTVNV 
Sbjct: 25  AIGSAQGLARWLRTRMISAHLDMWGVEPGTKNVINLWVELAEGEITLEDSCPPKRTVNVA 84

Query: 138 TVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDS 197
           +V I  K  Q+L+ESHQE+ DGS+R R RPL+EKMK+ ET EE   R + EELG +L   
Sbjct: 85  SVNIRNKSGQVLLESHQEMGDGSIRFRNRPLSEKMKAGETVEEGCKRGICEELGLVLGSP 144

Query: 198 ITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEE------ 251
             + ++  SY KK++ER+SLSYPGL   YV+HSVDA V  LPE  +F TEE E+      
Sbjct: 145 DKIELLMDSYEKKVEERDSLSYPGLLTCYVVHSVDAVVADLPET-DFFTEEDEDGRINGN 203

Query: 252 ----YADSSAADKALFVKRHYWKW 271
               ++ S   +KA+ VK+H+WKW
Sbjct: 204 HGFSFSSSGTREKAIGVKKHFWKW 227


>gi|115469912|ref|NP_001058555.1| Os06g0712200 [Oryza sativa Japonica Group]
 gi|53792644|dbj|BAD53658.1| unknown protein [Oryza sativa Japonica Group]
 gi|53792882|dbj|BAD54059.1| unknown protein [Oryza sativa Japonica Group]
 gi|113596595|dbj|BAF20469.1| Os06g0712200 [Oryza sativa Japonica Group]
 gi|125598474|gb|EAZ38254.1| hypothetical protein OsJ_22631 [Oryza sativa Japonica Group]
 gi|215692431|dbj|BAG87851.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 241

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 104/213 (48%), Positives = 136/213 (63%), Gaps = 24/213 (11%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAI--------- 128
           +F T  SL+ +L  RLP+ + ASWG  PGTK + NL+LELS GE  L  A          
Sbjct: 11  AFPTLDSLAAFLGSRLPASALASWGTAPGTKTLLNLFLELSQGECVLISAAAAAAPPSQQ 70

Query: 129 ----PPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNR--CRPLAEKMKSNETPEEAV 182
               P VR V+V +VRI      +L+E+ Q LSDG++R+R   RPL+EKM+  ETPE A 
Sbjct: 71  QQQHPVVRAVHVASVRIRNGRGALLMETGQLLSDGTLRSRGGLRPLSEKMRPGETPEAAA 130

Query: 183 IRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEE 241
           +RAV+EELG    + + VRI+ G    +++ER+S SYPGL ARYVLH+VDA  V+G+PE+
Sbjct: 131 VRAVREELG----ERVRVRIL-GGEEARVEERDSASYPGLHARYVLHAVDAEVVEGVPED 185

Query: 242 GEFCTEEAEEYADSSAAD---KALFVKRHYWKW 271
           GEF TEE  E+ D    D    A+ VKRHYWKW
Sbjct: 186 GEFDTEEGGEHEDEVVVDGAAAAITVKRHYWKW 218


>gi|125556726|gb|EAZ02332.1| hypothetical protein OsI_24435 [Oryza sativa Indica Group]
          Length = 244

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/216 (48%), Positives = 136/216 (62%), Gaps = 27/216 (12%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAI--------- 128
           +F T  SL+ +L  RLP+ + ASWG  PGTK + NL+LELS GE  L  A          
Sbjct: 11  AFPTLDSLAAFLGSRLPASALASWGTAPGTKTLLNLFLELSQGECVLISAAAAAAPPSQQ 70

Query: 129 -------PPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNR--CRPLAEKMKSNETPE 179
                  P VR V+V +VRI      +L+E+ Q LSDG++R+R   RPL+EKM+  ETPE
Sbjct: 71  QQQQQQHPVVRAVHVASVRIRNGRGALLMETGQLLSDGTLRSRGGLRPLSEKMRPGETPE 130

Query: 180 EAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGL 238
            A +RAV+EELG    + + VRI+ G    +++ER+S SYPGL ARYVLH+VDA  V+G+
Sbjct: 131 AAAVRAVREELG----ERVRVRIL-GGEEARVEERDSASYPGLHARYVLHAVDAEVVEGV 185

Query: 239 PEEGEFCTEEAEEYADSSAAD---KALFVKRHYWKW 271
           PE+GEF TEE  E+ D    D    A+ VKRHYWKW
Sbjct: 186 PEDGEFDTEEGGEHEDEVVVDGAAAAITVKRHYWKW 221


>gi|226507796|ref|NP_001142985.1| uncharacterized protein LOC100275445 [Zea mays]
 gi|195612554|gb|ACG28107.1| hypothetical protein [Zea mays]
          Length = 225

 Score =  167 bits (424), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 107/198 (54%), Positives = 136/198 (68%), Gaps = 14/198 (7%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPP--VRTVN 135
           +F T  SL+ +L+PRLP  + +SWG  PGTK + NL+LEL+ G+ C  DA  P  VR V+
Sbjct: 16  AFPTLDSLAAFLRPRLPPGALSSWGAVPGTKTLANLFLELAVGD-CALDAASPRLVRAVH 74

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVKEELGSIL 194
           V TVRI  +    LVE+ Q LSDG+VR R  RPL+EKM+  E+PE A  RAV+EELG+  
Sbjct: 75  VATVRIRNRRGARLVETRQLLSDGTVRRRGPRPLSEKMRPGESPEAAAARAVREELGA-- 132

Query: 195 NDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCTEEAEEYA 253
                VRI+ G+   +++ER S SYPGLPARYVLH+VDA  VDG+PE+GEF TEEA E  
Sbjct: 133 --RARVRILGGA-GARVEERESASYPGLPARYVLHAVDAEVVDGVPEDGEFETEEAGE-- 187

Query: 254 DSSAADKALFVKRHYWKW 271
           D +A   A+ VKRHYW W
Sbjct: 188 DDAAG--AITVKRHYWDW 203


>gi|326501076|dbj|BAJ98769.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326510433|dbj|BAJ87433.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326526951|dbj|BAK00864.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 226

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 106/201 (52%), Positives = 135/201 (67%), Gaps = 11/201 (5%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCL--ADAIPP---VR 132
           +F T  SL+ +L+PRLP  + ASWG  PGTKN+ NL+LELS G+  L  A + PP   VR
Sbjct: 11  AFPTLDSLAAYLRPRLPGPALASWGSAPGTKNLLNLFLELSCGDCTLLPAASSPPALVVR 70

Query: 133 TVNVVTVRIIGKDNQILVESHQELSDGSVRNR-CRPLAEKMKSNETPEEAVIRAVKEELG 191
            V+V TVRI  +   +LVE+ Q LSDG++R R  RPL+EKM+  ETPE A  RAV EELG
Sbjct: 71  AVHVATVRIRNRRGALLVETRQLLSDGTLRRRAARPLSEKMRPGETPEAAAARAVCEELG 130

Query: 192 SILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCTEEAE 250
               +   VRI+  +   +++E+ S+SYPGLPARYVLH+VDA  V+G+PEEGEF TEEA 
Sbjct: 131 ----ERARVRILGAAPEPRVEEQESVSYPGLPARYVLHAVDAELVEGVPEEGEFETEEAG 186

Query: 251 EYADSSAADKALFVKRHYWKW 271
           E         A+ VKRHYW W
Sbjct: 187 EGEGHDGGGAAITVKRHYWSW 207


>gi|168020019|ref|XP_001762541.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686274|gb|EDQ72664.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 184

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 86/177 (48%), Positives = 113/177 (63%), Gaps = 3/177 (1%)

Query: 97  SFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKDNQILVESHQEL 156
           S +SWGV  GTK + NLW EL DGE  L D+ PP RTV+V  V+I  +    L+ES QE+
Sbjct: 1   SVSSWGVDSGTKRIGNLWTELVDGEISLEDSKPPKRTVHVAVVKIRNEAGMFLIESSQEM 60

Query: 157 SDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNS 216
            DG +++R RPL+EKM+  E  E+A  R V EELG  +     V ++  +Y ++ +ER+S
Sbjct: 61  DDGRMKSRNRPLSEKMRPGENVEDACRRGVFEELGPEMGARDRVEMILETYQREEEERDS 120

Query: 217 LSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADSSAADK--ALFVKRHYWKW 271
            SYPGL  RYVLHS+ A V GLP   EFCTEE E   +  A+    A+ VK+HYWKW
Sbjct: 121 FSYPGLLTRYVLHSMVASVRGLP-SSEFCTEENEGKRNGHASGIRCAIGVKKHYWKW 176


>gi|413943158|gb|AFW75807.1| hypothetical protein ZEAMMB73_771450 [Zea mays]
          Length = 252

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 105/195 (53%), Positives = 133/195 (68%), Gaps = 14/195 (7%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPP--VRTVN 135
           +F T  SL+ +L+PRLP  + +SWG  PGTK + NL+LEL+ G+ C  DA  P  VR V+
Sbjct: 43  AFPTLDSLAAFLRPRLPPGALSSWGAVPGTKTLANLFLELAVGD-CALDAASPRVVRAVH 101

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVKEELGSIL 194
           V TVRI  +    LVE+ Q LSDG+VR R  RPL+EKM+  E+PE A  RAV+EELG+  
Sbjct: 102 VATVRIRNRRGARLVETRQLLSDGTVRRRGPRPLSEKMRPGESPEAAAARAVREELGA-- 159

Query: 195 NDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCTEEAEEYA 253
                VRI+ G    +++ER S SYPGLPARYVLH+VDA  VDG+PE+GEF TEEA E  
Sbjct: 160 --RARVRIL-GGAGARVEERESASYPGLPARYVLHAVDAEVVDGVPEDGEFETEEAGE-- 214

Query: 254 DSSAADKALFVKRHY 268
           D +A   A+ VKRHY
Sbjct: 215 DDAAG--AITVKRHY 227


>gi|357117587|ref|XP_003560546.1| PREDICTED: uncharacterized protein LOC100835592 [Brachypodium
           distachyon]
          Length = 233

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 136/204 (66%), Gaps = 14/204 (6%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPP------- 130
           +F T  SL+ +LKPRLP  + ASWG  PGTKN+ NL+LELS G+  L  A          
Sbjct: 11  AFPTLDSLAAYLKPRLPGPALASWGTAPGTKNLLNLFLELSCGDCTLLPAAASSPPAALV 70

Query: 131 VRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVKEE 189
           VR V+V TVRI  +   +LVE+ Q LSDG++R R  RPL+EKM+  ETPE A  RAV+EE
Sbjct: 71  VRAVHVATVRIRNRRGALLVETRQLLSDGTLRRRGPRPLSEKMRPGETPEAAAARAVREE 130

Query: 190 LGSILNDSITVRIV-PGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCTE 247
           LG +    + VRI+ P +  ++++ER+S SYPGLPARYVLH+VDA  V+G+PE+GEF TE
Sbjct: 131 LGEL----VRVRILGPEAEPRRVEERDSASYPGLPARYVLHAVDAEVVEGVPEDGEFDTE 186

Query: 248 EAEEYADSSAADKALFVKRHYWKW 271
           EA E         A+ VKRHYW W
Sbjct: 187 EAGEGEGHDGGGVAITVKRHYWAW 210


>gi|302822847|ref|XP_002993079.1| hypothetical protein SELMODRAFT_136514 [Selaginella moellendorffii]
 gi|300139079|gb|EFJ05827.1| hypothetical protein SELMODRAFT_136514 [Selaginella moellendorffii]
          Length = 237

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 85/207 (41%), Positives = 121/207 (58%), Gaps = 27/207 (13%)

Query: 88  WLKPRLPSDS--FASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKD 145
           WLK RLP        WG  PGTK V NLW EL +GE  L D+ PP RTV+V +V+I+ + 
Sbjct: 30  WLKERLPQGGALLDLWGTAPGTKRVANLWRELLEGEISLEDSRPPKRTVHVASVQIVNES 89

Query: 146 NQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPG 205
            ++LVE++QE++DG +R R RPL+EKM+  E+ EEA +R + EELG  ++    V ++  
Sbjct: 90  GEMLVEAYQEMADGRIRPRNRPLSEKMRPGESVEEACLRGISEELGCAIDQ---VALLRE 146

Query: 206 SYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYAD----------- 254
           SY +  +ER S SYPGL  RYV+H++ A V  LP+  +F TEE E+              
Sbjct: 147 SYQRVEEERESFSYPGLSTRYVIHTITAHVKQLPQT-DFDTEEDEDGNGGGGGGGGAAVL 205

Query: 255 ----------SSAADKALFVKRHYWKW 271
                     +S    A+ V++H+WKW
Sbjct: 206 VAASGRTATATSCLGGAVGVRKHFWKW 232


>gi|302787050|ref|XP_002975295.1| hypothetical protein SELMODRAFT_103171 [Selaginella moellendorffii]
 gi|300156869|gb|EFJ23496.1| hypothetical protein SELMODRAFT_103171 [Selaginella moellendorffii]
          Length = 240

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 85/210 (40%), Positives = 121/210 (57%), Gaps = 30/210 (14%)

Query: 88  WLKPRLPSDS--FASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRIIGKD 145
           WLK RLP        WG  PGTK V NLW EL +GE  L D+ PP RTV+V +V+I+ + 
Sbjct: 30  WLKERLPQGGAQLDLWGTAPGTKRVANLWRELLEGEISLEDSRPPKRTVHVASVQIVNES 89

Query: 146 NQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPG 205
            ++LVE++QE++DG +R R RPL+EKM+  E+ EEA +R + EELG  ++    V ++  
Sbjct: 90  GEMLVEAYQEMADGRIRPRNRPLSEKMRPGESVEEACLRGISEELGCAIDQ---VALLRE 146

Query: 206 SYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYAD----------- 254
           SY +  +ER S SYPGL  RYV+H++ A V  LP+  +F TEE E+              
Sbjct: 147 SYQRVEEERESFSYPGLSTRYVIHTITAHVKQLPQT-DFDTEEDEDGNGGGGGSGGGGGA 205

Query: 255 -------------SSAADKALFVKRHYWKW 271
                        +S    A+ V++H+WKW
Sbjct: 206 AVLVVASGRTATATSCLGGAVGVRKHFWKW 235


>gi|242094184|ref|XP_002437582.1| hypothetical protein SORBIDRAFT_10g029760 [Sorghum bicolor]
 gi|241915805|gb|EER88949.1| hypothetical protein SORBIDRAFT_10g029760 [Sorghum bicolor]
          Length = 260

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 97/202 (48%), Positives = 121/202 (59%), Gaps = 19/202 (9%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPP------- 130
           +F T  SL+ +L+PRLP  + ASWG  PGTK + NL+LEL+ G+  L             
Sbjct: 42  AFPTLDSLAAFLRPRLPPGALASWGAVPGTKTLSNLFLELAVGDCALHAPADADPPPAAP 101

Query: 131 --VRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVK 187
             VR V+V TVRI       LVE+ Q LSDG+VR R  RPL+EKM+  E+PE A  RAV+
Sbjct: 102 RVVRAVHVATVRIRNSRGARLVETRQLLSDGTVRRRGPRPLSEKMRPGESPEAAAARAVR 161

Query: 188 EELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA-FVDGLPEEGEFCT 246
           EELG+       VRI+ G    +++ER S SYPGLPARYVLH+VDA  VDGLPE+     
Sbjct: 162 EELGA----RARVRIL-GGDGARVEERESASYPGLPARYVLHAVDAEVVDGLPED-GEFE 215

Query: 247 EEAEEYADSSAADKALFVKRHY 268
            E     D  A   A+ VKRHY
Sbjct: 216 TEETGEDDGDAG--AITVKRHY 235


>gi|361066237|gb|AEW07430.1| Pinus taeda anonymous locus 0_1427_02 genomic sequence
 gi|383166917|gb|AFG66456.1| Pinus taeda anonymous locus 0_1427_02 genomic sequence
 gi|383166918|gb|AFG66457.1| Pinus taeda anonymous locus 0_1427_02 genomic sequence
 gi|383166919|gb|AFG66458.1| Pinus taeda anonymous locus 0_1427_02 genomic sequence
          Length = 130

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 50/94 (53%), Positives = 66/94 (70%)

Query: 78  SFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVV 137
           +  + Q L+ WL  R+ S     WG++PGTKNV NLW+EL++GE  L D+ PP RTVNVV
Sbjct: 37  AIGSAQGLARWLNTRMISTRLDMWGIEPGTKNVTNLWVELAEGEITLEDSCPPKRTVNVV 96

Query: 138 TVRIIGKDNQILVESHQELSDGSVRNRCRPLAEK 171
           +V I  K  Q+L+ESHQE+ DGS R R RPL++K
Sbjct: 97  SVNIRNKSGQVLLESHQEMGDGSNRVRNRPLSQK 130


>gi|145341152|ref|XP_001415678.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575901|gb|ABO93970.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 224

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 73/195 (37%), Positives = 97/195 (49%), Gaps = 25/195 (12%)

Query: 77  HSFATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLA----------- 125
           H  AT  +L  WL  R    +   +    GTKN+ +L +EL  GE+ L            
Sbjct: 6   HEGATTANLGAWLARR--GVAVEKFDRARGTKNLEDLLIELKSGESVLVSERERASGENG 63

Query: 126 -----DAIPP-VRTVNVVTVRII-----GKDNQILVESHQELSDGSV-RNRCRPLAEKMK 173
                DA    VR V+V+T+R+       +D   L+E  Q      + R R RPL+EKM 
Sbjct: 64  GEMNEDARETCVRYVDVLTLRVRRPGSNAEDGMCLIEKEQIFGKNELKRRRNRPLSEKMN 123

Query: 174 SNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDA 233
             E   + V RAV+EELGS L D   V  +  +Y   + E  S SYPGL +R+ LH VDA
Sbjct: 124 FGEHWRDCVERAVREELGSALGDDYVVETLEDTYKLCVSEEMSASYPGLRSRFALHRVDA 183

Query: 234 FVDGLPEEGEFCTEE 248
            V GLP+E EF +EE
Sbjct: 184 IVHGLPDEDEFQSEE 198


>gi|255089965|ref|XP_002506904.1| predicted protein [Micromonas sp. RCC299]
 gi|226522177|gb|ACO68162.1| predicted protein [Micromonas sp. RCC299]
          Length = 234

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 104/199 (52%), Gaps = 13/199 (6%)

Query: 84  SLSDWLKPR-LPSDSFASWGVKPGTKNVHNLWLELSDGETCLA---DAIPPVRTVNVVTV 139
           SLS WL    + +D+F     + G K++ +L +E+  GET L        P R V V+ +
Sbjct: 32  SLSAWLTANGVDADAFG----EDGAKSLVDLAVEIERGETTLVLDPQTSAPRRDVRVLRL 87

Query: 140 RIIGKD---NQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN- 195
            +  +D     +L+E+ QE   G VR R  PL+EK+  NE    A  RAV EELGS ++ 
Sbjct: 88  LVTDEDARPGMVLIEARQEWESGRVRTRRTPLSEKLLGNEDWRVACARAVNEELGSAIDA 147

Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGE-FCTEEAEEYAD 254
            + ++R+   S  + + +R+SLSYPGL +RYV+++V   V GLPE  E        E   
Sbjct: 148 RTFSMRVDESSVKETMMDRDSLSYPGLRSRYVMYTVACDVKGLPEPTEQVFGSSTGEMGF 207

Query: 255 SSAADKALFVKRHYWKWRR 273
           +S  D      +  W WRR
Sbjct: 208 TSVEDTPAGWLKATWTWRR 226


>gi|307107732|gb|EFN55974.1| hypothetical protein CHLNCDRAFT_57724 [Chlorella variabilis]
          Length = 201

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 101/195 (51%), Gaps = 16/195 (8%)

Query: 80  ATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCL-ADAIPPVRTVNVVT 138
           A  Q L+ WL       S   +G  P  K++  LW E+  GET L  ++  P+R V+V+ 
Sbjct: 21  ADAQELAQWLAR--GGISLDRYGKGP-AKSLDQLWEEVEAGETQLFLESGRPLREVSVLN 77

Query: 139 VRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSI 198
           V +     Q L E+ Q L +GS R R  PL+EK+   E  +EA +R V EELG +L    
Sbjct: 78  VLLKNSRGQTLYEAEQVLPNGSKRTRGLPLSEKLLPGERWQEAAVRGVLEELGPVLTADP 137

Query: 199 TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFV-DGLPEEGEFCTEEAEEYADSSA 257
            V +  GS  + ++ + S SYPGL ++YV   V A V  GLPE+ +F T+  EE +D   
Sbjct: 138 QVEVDEGSMVESVEMKESQSYPGLLSKYVCKRVSARVLSGLPED-DFSTQ--EERSDG-- 192

Query: 258 ADKALFVKRHYWKWR 272
                 +  H W WR
Sbjct: 193 ------LLEHCWVWR 201


>gi|159463302|ref|XP_001689881.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283869|gb|EDP09619.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 203

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/142 (41%), Positives = 82/142 (57%), Gaps = 2/142 (1%)

Query: 108 KNVHNLWLELSDGETCLA-DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCR 166
           K VH L+ E+S  E+ L  +    +R VNV+++ I+    QIL E  Q L DG  R R  
Sbjct: 42  KTVHELFDEVSKQESVLELEGGKALRIVNVLSLHILNSRGQILFEDEQVLPDGRSRRRNV 101

Query: 167 PLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARY 226
           P++EKM +NE   EA+ RAVKEEL S L    TV ++   + +  +  +S+SYPGL  +Y
Sbjct: 102 PVSEKMVANEPWHEALDRAVKEELSSALPLDYTVTLLEEPFMRT-EYSSSMSYPGLLTKY 160

Query: 227 VLHSVDAFVDGLPEEGEFCTEE 248
           + H V A V G+PE     TEE
Sbjct: 161 IFHRVKARVSGIPENPFSTTEE 182


>gi|302854431|ref|XP_002958723.1| hypothetical protein VOLCADRAFT_100035 [Volvox carteri f.
           nagariensis]
 gi|300255898|gb|EFJ40179.1| hypothetical protein VOLCADRAFT_100035 [Volvox carteri f.
           nagariensis]
          Length = 210

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/151 (38%), Positives = 87/151 (57%), Gaps = 2/151 (1%)

Query: 108 KNVHNLWLELSDGETCLA-DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCR 166
           K VH L+ E++  E+ L  +    +R VNV+++ I+    QIL E  Q L DG  R R  
Sbjct: 60  KTVHELFDEVTKQESVLELEGGKALRIVNVLSLHILNSRGQILFEDEQVLPDGRSRRRNV 119

Query: 167 PLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPARY 226
           P++EKM  +E+   A+ RAV+EEL S L +   V ++  SY  + +  +S+SYPGL  +Y
Sbjct: 120 PVSEKMVGDESWLAALHRAVREELSSALPEDYKVTLLDDSYFLRTEYSSSMSYPGLLTKY 179

Query: 227 VLHSVDAFVDGLPEEGEFCTEEAEEYADSSA 257
           + H V A V G+PE   F T E  E  +S++
Sbjct: 180 IFHRVRARVSGIPES-PFSTVEPNEGGESAS 209


>gi|224112030|ref|XP_002316059.1| predicted protein [Populus trichocarpa]
 gi|222865099|gb|EEF02230.1| predicted protein [Populus trichocarpa]
          Length = 117

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 56/138 (40%), Positives = 79/138 (57%), Gaps = 33/138 (23%)

Query: 137 VTVRIIGK--DNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSIL 194
           + VRI+      +IL+E+ QELSD S R + R   EKMK+ E+      R+++E+     
Sbjct: 1   MVVRILSSCSTKKILLEAQQELSDDSFRPQGREPGEKMKAGES------RSLREK----- 49

Query: 195 NDSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYAD 254
                VR   GS           SYPGLP+RY LH V+A+VDGLP+E +FCT+E +EY D
Sbjct: 50  -----VRGKSGS----------TSYPGLPSRYTLHWVNAYVDGLPDE-DFCTDE-DEYKD 92

Query: 255 SSAAD---KALFVKRHYW 269
             A     +A+ VK+H+W
Sbjct: 93  CGAGGYSSRAVTVKKHFW 110


>gi|428163378|gb|EKX32452.1| hypothetical protein GUITHDRAFT_121382 [Guillardia theta CCMP2712]
          Length = 311

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 69/193 (35%), Positives = 92/193 (47%), Gaps = 14/193 (7%)

Query: 83  QSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRII 142
           +SL+ WL+ R    S   WG +   K V   + EL  GE C        R ++VV VRI+
Sbjct: 114 ESLTAWLEIRDIDTSL--WG-QGEAKTVQEFFHELWAGE-CEPLTESAERVISVVKVRIL 169

Query: 143 GKDNQ---ILVESHQEL-SDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSI 198
             D      L E+ Q+L  DG  R R RPLAEK +  E+  +A +R VKEELGS +  + 
Sbjct: 170 DHDEAGTWQLYETKQKLCRDGRTRPRNRPLAEKRRPEESSFDAAVRGVKEELGSSIKSTS 229

Query: 199 TVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADSSAA 258
            + +   S  +  + R S S+PGL   Y L  VD  V+GL       T E EE  + S  
Sbjct: 230 RIVVKEDSLQEWTERRMSKSFPGLETVYKLSQVDVEVEGLSRAATGDTFETEETPELSNG 289

Query: 259 DKALFVKRHYWKW 271
                 K H W W
Sbjct: 290 ------KVHTWTW 296


>gi|412992265|emb|CCO19978.1| PREDICTED: hypothetical protein [Bathycoccus prasinos]
          Length = 248

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 88/188 (46%), Gaps = 39/188 (20%)

Query: 106 GTKNVHNLWLELSDGETCLAD----------AIPPVRTVNVVTVRIIGKD--NQILVESH 153
           G K++ +L  E+ +GET L +           +  +R V+VV V I  K   N+ LVE  
Sbjct: 77  GLKSIDDLLKEIENGETVLTEHETTHDDGTVKLSCIRRVSVVVVEITSKSKPNKKLVEYE 136

Query: 154 QELSDGSVRNRCRPLAEKMKS--NETPEEAVIRAVKEELGSILN-------DSITVRIVP 204
           Q L  G  R R R L+EK+ S   ETP EA  R ++EELG  L+       D  ++R +P
Sbjct: 137 QTLPSGLARKRERFLSEKIMSGKGETPLEAAKRGIREELGDALSPKAVINIDETSLRSLP 196

Query: 205 GSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEE-GEFCTEEAEEYADSSAADKALF 263
            S         S SY  LP RY  + V+  +DGL E+  EF + E           KA+ 
Sbjct: 197 LSSKPSF----SQSYKALPTRYRFYEVNCEIDGLAEDKDEFTSTE-------KCGTKAV- 244

Query: 264 VKRHYWKW 271
                WKW
Sbjct: 245 -----WKW 247


>gi|428175671|gb|EKX44560.1| hypothetical protein GUITHDRAFT_139784 [Guillardia theta CCMP2712]
          Length = 290

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 90/198 (45%), Gaps = 16/198 (8%)

Query: 79  FATPQSLSDWLKPRLPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVT 138
            A+ ++L  WL      D    WG     K+   L  EL  GE+ L       R V V  
Sbjct: 83  LASEEALERWLVEEAGCD-LNKWG-HGKAKSARELLEELRAGESFLRP--EGTRGVKVAK 138

Query: 139 VRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSI 198
           VR+   D + L E+ Q L +G V+ R + LAEK +  ETP  A  R V EELGSI+  + 
Sbjct: 139 VRVFDGDYE-LFETSQILDNGVVKERNKQLAEKFRPGETPLHACGRGVSEELGSIVGSNP 197

Query: 199 TVRIVPGSYSKKLDE-RNSLSYPGLPARYVLHSVDAFVDGLPEEG---EFCTEEAEEYAD 254
            +  +        +E   S SYPGL + Y L+ ++A + GL        F T E+ E +D
Sbjct: 198 AIHFLRNGEVVAWEETETSSSYPGLRSVYELYWMEAEIQGLHRAATGMRFYTRESSE-SD 256

Query: 255 SSAADKALFVKRHYWKWR 272
                     K H W+W+
Sbjct: 257 GHCE------KIHVWEWK 268


>gi|428304217|ref|YP_007141042.1| hypothetical protein Cri9333_0586 [Crinalium epipsammum PCC 9333]
 gi|428245752|gb|AFZ11532.1| hypothetical protein Cri9333_0586 [Crinalium epipsammum PCC 9333]
          Length = 176

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 83/172 (48%), Gaps = 17/172 (9%)

Query: 104 KPGTKNVHNLWLELSDGET-CLADAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVR 162
           K   K+  +L  E+   E+  + D   P+R + V  VR+I  D Q L E  Q  SDG VR
Sbjct: 15  KGQAKDYSSLLQEIEKKESQIIWDNNRPIRLIKVCRVRVICGD-QELYEDRQVFSDGRVR 73

Query: 163 NR-CRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPG 221
            R  R LAEK+  +E  E A  RA+KEELG  L+D  +  +         +E+ S SYPG
Sbjct: 74  KRGIRGLAEKLFKDEKEEIAAKRALKEELG--LSDEASTLVSIHFNGIDNEEKESPSYPG 131

Query: 222 LPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADSSAADKALFVKRHYWKWRR 273
           L  RY+      F+  LP          EE+ +    D    VK+ Y+ WR+
Sbjct: 132 LKTRYLFWD---FIAYLP-----VQYWQEEFVEGCEDD----VKQTYFVWRK 171


>gi|332662886|ref|YP_004445674.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332331700|gb|AEE48801.1| hypothetical protein Halhy_0900 [Haliscomenobacter hydrossis DSM
           1100]
          Length = 190

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 76/153 (49%), Gaps = 17/153 (11%)

Query: 87  DWLKPRLPSDSFAS--WGVKPGTKNVHNLWLELSDGETCLADAIPP--VRTVNVVTVRII 142
           D L+ RL +   ++  WG K   K + +L  E+ DGET L  +     +R V V    I 
Sbjct: 13  DDLRQRLENAGISTTDWG-KGTAKTLLHLVKEMEDGETFLEKSADGQLLRYVVVAAADIY 71

Query: 143 ----GKDNQILVESHQELSDGSVRNR--CRPLAEKMKSNETPEEAVIRAVKEELGSILND 196
                 +   L E  Q   DG  R R   R ++EK+K  E P EA+IR ++EELG  +  
Sbjct: 72  YDAPSGERYRLKEDRQVFKDGRERKRNLGRAVSEKIKLAEIPAEAMIRGIREELG--IAG 129

Query: 197 SITVRIVPGSYSKKLDERNSLSYPGLPARYVLH 229
           +I   +  G+  + +D   S SYPGL +RYV H
Sbjct: 130 AINW-VEEGATEETVD---SPSYPGLTSRYVSH 158


>gi|294892790|ref|XP_002774235.1| hypothetical protein Pmar_PMAR028239 [Perkinsus marinus ATCC 50983]
 gi|239879452|gb|EER06051.1| hypothetical protein Pmar_PMAR028239 [Perkinsus marinus ATCC 50983]
          Length = 2924

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 90/196 (45%), Gaps = 38/196 (19%)

Query: 108 KNVHNLWLELSDGETCLA--DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC 165
           KNV +L+ EL   E C    D     R ++VV VR+  KD  +LVESH+++SDG VR+R 
Sbjct: 127 KNVSDLYFELQH-EICWLERDDRGLRRVLDVVIVRLSYKD-LVLVESHEQMSDGRVRSRN 184

Query: 166 RPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPAR 225
                K + +ET   A+ R + +ELG  LN     +    SYS++ +  +SLSYP +   
Sbjct: 185 LLPGVKKRLSETIGAALERWIHKELG--LNFDCLKQAAAYSYSEQ-EADSSLSYP-IRVL 240

Query: 226 YVLHSV------DAFVD------GLPEEGEFCTEE--AEEYADSS--AADKALFVKR--- 266
           + LH +      DA         GLP+   F T E   E   D S   AD    + R   
Sbjct: 241 FRLHEMSYQLIPDALSKEQLARIGLPKGHSFQTSEDLCEAVNDDSDNNADHGAVISRGGS 300

Query: 267 -----------HYWKW 271
                      H+W W
Sbjct: 301 SSCCGYLKSAVHFWGW 316


>gi|294887459|ref|XP_002772120.1| hypothetical protein Pmar_PMAR017350 [Perkinsus marinus ATCC 50983]
 gi|239876058|gb|EER03936.1| hypothetical protein Pmar_PMAR017350 [Perkinsus marinus ATCC 50983]
          Length = 2503

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 90/196 (45%), Gaps = 38/196 (19%)

Query: 108 KNVHNLWLELSDGETCLA--DAIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC 165
           KNV +L+ EL   E C    D     R ++VV VR+  KD  +LVESH+++SDG VR+R 
Sbjct: 127 KNVSDLYFELQH-EICWLERDDRGLRRVLDVVIVRLSYKD-LVLVESHEQMSDGRVRSRN 184

Query: 166 RPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSKKLDERNSLSYPGLPAR 225
                K + +ET   A+ R + +ELG  LN     +    SYS++ +  +SLSYP +   
Sbjct: 185 LLPGVKKRLSETIGAALERWIHKELG--LNFDCLKQAAAYSYSEQ-EADSSLSYP-IRVL 240

Query: 226 YVLHSV------DAFVD------GLPEEGEFCTEE--AEEYADSS--AADKALFVKR--- 266
           + LH +      DA         GLP+   F T E   E   D S   AD    + R   
Sbjct: 241 FRLHEMSYQLIPDALSKEQLARIGLPKGHSFQTSEDLCEAVNDDSDNNADHGAVISRGGS 300

Query: 267 -----------HYWKW 271
                      H+W W
Sbjct: 301 SSCCGYLKSAVHFWGW 316


>gi|406899724|gb|EKD42919.1| hypothetical protein ACD_72C00533G0001 [uncultured bacterium]
          Length = 183

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 47/88 (53%), Gaps = 11/88 (12%)

Query: 149 LVESHQELSDGSVRNR--CRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGS 206
           L E  Q   DG VR R   R ++EKMK  E   +A++R ++EELG      I+  +    
Sbjct: 75  LKEEKQVFGDGRVRVRDFGRAVSEKMKFGEYSHDAMVRGIQEELG------ISSNVTLHE 128

Query: 207 YSKKLDERNSLSYPGLPARYVLHSVDAF 234
               +D+++S SYPGL + YV   VD F
Sbjct: 129 IETIVDKKDSPSYPGLMSNYV---VDVF 153


>gi|303287598|ref|XP_003063088.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455724|gb|EEH53027.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 250

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 87/212 (41%), Gaps = 59/212 (27%)

Query: 84  SLSDWLKPR-LPSDSFASWGVKPGTKNVHNLWLELSDGETCLADAIPPVRTVNVVTVRII 142
           SL+ WLK   + +D +     K   K + +L LE+  GE+ L      V   NV  +R++
Sbjct: 26  SLATWLKTHGVDADRYG----KNNAKGLADLKLEVDLGESTLEVGPGGVPRRNVRVLRLL 81

Query: 143 GKDN--QILVESHQELSD---GSV---------------------RNRCR---------- 166
            +D+  + LVE+ Q  +D   G V                     R R R          
Sbjct: 82  IRDDAGRTLVEATQRWNDPNTGEVIAERMRGSPLRRVHYLSRRFPRYRVRASLRIGVHNA 141

Query: 167 -----------------PLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYSK 209
                            P +EK+   E    A  RAV EELGS L    T+ +   S  +
Sbjct: 142 NAVVWGPVCSDAFETHHPDSEKLIGAEDWRAAAPRAVAEELGSALAPGYTLAVDEASLVR 201

Query: 210 KLDERNSLSYPGLPARYVLHSVDAFVD-GLPE 240
              +R SLSYP L +RY + SV A ++ GLPE
Sbjct: 202 ADVDRVSLSYPDLSSRYAMFSVAARIENGLPE 233


>gi|260495745|ref|ZP_05815867.1| mutator MutT protein [Fusobacterium sp. 3_1_33]
 gi|260196703|gb|EEW94228.1| mutator MutT protein [Fusobacterium sp. 3_1_33]
          Length = 252

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 41/73 (56%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G IL 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLILI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHRGIVIFNFN 74


>gi|294867978|ref|XP_002765323.1| hypothetical protein Pmar_PMAR016118 [Perkinsus marinus ATCC 50983]
 gi|239865336|gb|EEQ98040.1| hypothetical protein Pmar_PMAR016118 [Perkinsus marinus ATCC 50983]
          Length = 1543

 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 56/102 (54%), Gaps = 11/102 (10%)

Query: 180 EAVIRAVKEELGSILNDSITVRIVPGSYSKKLDE--RNSLSYPGLPARYVLHSVDAFVDG 237
           E    AVK+E+   ++D ++VR V     K L E  R+S +Y GL  +YVL++ +  +DG
Sbjct: 670 EVCSEAVKKEIAHSVSD-VSVRKVVMRNRKPLVELERSSPTYEGLATKYVLYTTEVLLDG 728

Query: 238 LPEE-GEFCTEEA-EEYADSSAADKALFV------KRHYWKW 271
           LP    EF T EA     +++++D AL        + H+W+W
Sbjct: 729 LPSGVEEFETTEAPVSGGNANSSDWALENGHKSSPRVHHWEW 770


>gi|237743627|ref|ZP_04574108.1| mutator mutT protein [Fusobacterium sp. 7_1]
 gi|229432658|gb|EEO42870.1| mutator mutT protein [Fusobacterium sp. 7_1]
          Length = 252

 Score = 41.6 bits (96), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 63/143 (44%), Gaps = 20/143 (13%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
           D I   IV  +++   DE         P    L++   F+  + E    C+E      D 
Sbjct: 62  DYIHRGIVIFNFNN--DE---------PLYMYLYTSKNFLGEVQE----CSE-----GDL 101

Query: 256 SAADKALFVKRHYWKWRRAMREL 278
              DK+     + WK  +   +L
Sbjct: 102 KWIDKSKIFNLNLWKGDKIFLDL 124


>gi|294784232|ref|ZP_06749527.1| mutator MutT protein [Fusobacterium sp. 3_1_27]
 gi|294488098|gb|EFG35449.1| mutator MutT protein [Fusobacterium sp. 3_1_27]
          Length = 252

 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHKGIVIFNFN 74


>gi|421527177|ref|ZP_15973781.1| Mutator MutT protein [Fusobacterium nucleatum ChDC F128]
 gi|402256611|gb|EJU07089.1| Mutator MutT protein [Fusobacterium nucleatum ChDC F128]
          Length = 252

 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLV 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIYRGIVIFNFN 74


>gi|284434558|gb|ADB85305.1| hypothetical protein [Phyllostachys edulis]
          Length = 175

 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 42/71 (59%), Gaps = 9/71 (12%)

Query: 158 DGSVRN-RCRPLAEKMKSNETPEEAVIRAVKEELGSILNDSITVRIVPGSYS----KKLD 212
           DG++R    RPL+EKM+  E+PE     A++EE    L++ + VRI+ G+      ++++
Sbjct: 82  DGTLRRWGQRPLSEKMRPGESPEATAACAIREE----LSERVRVRILGGAGGAEAKQRVE 137

Query: 213 ERNSLSYPGLP 223
           E  S SYP  P
Sbjct: 138 EWESASYPASP 148


>gi|209965935|ref|YP_002298850.1| hydrolase [Rhodospirillum centenum SW]
 gi|209959401|gb|ACJ00038.1| hydrolase, NUDIX family, putative [Rhodospirillum centenum SW]
          Length = 151

 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 37/65 (56%), Gaps = 2/65 (3%)

Query: 127 AIPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAV 186
           ++PPVRTV VV   ++  D ++L+   Q     S+         K+   ETPE+A++R +
Sbjct: 16  SLPPVRTVLVVAAALVDVDGRVLLA--QRPPGKSLAGLWEFPGGKVHEGETPEQALVREL 73

Query: 187 KEELG 191
           +EELG
Sbjct: 74  REELG 78


>gi|421144275|ref|ZP_15604191.1| Mutator MutT protein [Fusobacterium nucleatum subsp. fusiforme ATCC
           51190]
 gi|395489376|gb|EJG10215.1| Mutator MutT protein [Fusobacterium nucleatum subsp. fusiforme ATCC
           51190]
          Length = 252

 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHRGIVIFNFN 74


>gi|237742919|ref|ZP_04573400.1| mutator mutT protein [Fusobacterium sp. 4_1_13]
 gi|256846891|ref|ZP_05552345.1| mutator mutT protein [Fusobacterium sp. 3_1_36A2]
 gi|229430567|gb|EEO40779.1| mutator mutT protein [Fusobacterium sp. 4_1_13]
 gi|256717689|gb|EEU31248.1| mutator mutT protein [Fusobacterium sp. 3_1_36A2]
          Length = 252

 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHRGIVIFNFN 74


>gi|423137651|ref|ZP_17125294.1| hypothetical protein HMPREF9942_01432 [Fusobacterium nucleatum
           subsp. animalis F0419]
 gi|371959445|gb|EHO77131.1| hypothetical protein HMPREF9942_01432 [Fusobacterium nucleatum
           subsp. animalis F0419]
          Length = 252

 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHRGIVIFNFN 74


>gi|289765984|ref|ZP_06525362.1| mutator MutT protein [Fusobacterium sp. D11]
 gi|289717539|gb|EFD81551.1| mutator MutT protein [Fusobacterium sp. D11]
          Length = 252

 Score = 41.2 bits (95), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHRGIVIFNFN 74


>gi|422315135|ref|ZP_16396574.1| hypothetical protein FPOG_00064 [Fusobacterium periodonticum D10]
 gi|404592845|gb|EKA94571.1| hypothetical protein FPOG_00064 [Fusobacterium periodonticum D10]
          Length = 153

 Score = 41.2 bits (95), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 20/143 (13%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + K+N+ L+    +  +   +N+   +  K++  ETPE+ +IR VKEE G  L 
Sbjct: 2   ITTLCYLEKENKYLMLHRTKKENDINKNKWLGVGGKLEKGETPEQCLIREVKEETGLDLI 61

Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
           D +   IV  +Y+   DE         P    L++   F   + E    C+E      D 
Sbjct: 62  DYVHRGIVIFNYND--DE---------PLDMYLYTSKNFFGEIQE----CSE-----GDL 101

Query: 256 SAADKALFVKRHYWKWRRAMREL 278
              DK+   K + W+  R   EL
Sbjct: 102 KWIDKSEIYKLNLWEGDRIFLEL 124


>gi|336420283|ref|ZP_08600519.1| mutator MutT protein [Fusobacterium sp. 11_3_2]
 gi|336161324|gb|EGN64330.1| mutator MutT protein [Fusobacterium sp. 11_3_2]
          Length = 252

 Score = 41.2 bits (95), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 64/143 (44%), Gaps = 20/143 (13%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G IL 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLILI 61

Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
           + +   IV  +++   DE         P    L++   F+  + E    C+E      D 
Sbjct: 62  NYVHRGIVIFNFND--DE---------PLYMYLYTSKNFLGEVQE----CSE-----GDL 101

Query: 256 SAADKALFVKRHYWKWRRAMREL 278
              DK+     + WK  +   +L
Sbjct: 102 KWIDKSKIFNLNLWKGDKIFLDL 124


>gi|294783695|ref|ZP_06749019.1| mutator MutT protein [Fusobacterium sp. 1_1_41FAA]
 gi|294480573|gb|EFG28350.1| mutator MutT protein [Fusobacterium sp. 1_1_41FAA]
          Length = 153

 Score = 40.8 bits (94), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 64/143 (44%), Gaps = 20/143 (13%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + K+N+ L+    +  +   +N+   +  K++  ETPE+ +IR VKEE G  L 
Sbjct: 2   ITTLCYLEKENKYLMLHRTKKENDINKNKWLGVGGKLEKGETPEQCLIREVKEETGLDLI 61

Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
           D +   IV  +Y++  DE         P    L++   F   + E    C+E      D 
Sbjct: 62  DYVHRGIVIFNYNE--DE---------PLEMYLYTSKNFSGEIQE----CSE-----GDL 101

Query: 256 SAADKALFVKRHYWKWRRAMREL 278
              DK+   K + W+  R   EL
Sbjct: 102 KWIDKSEIYKLNLWEGDRIFLEL 124


>gi|428768996|ref|YP_007160786.1| hypothetical protein Cyan10605_0603 [Cyanobacterium aponinum PCC
           10605]
 gi|428683275|gb|AFZ52742.1| hypothetical protein Cyan10605_0603 [Cyanobacterium aponinum PCC
           10605]
          Length = 193

 Score = 40.4 bits (93), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 5/97 (5%)

Query: 100 SWGVKPGTKNVHNLWLELSDGETCLADAIPP-VRTVNVVTVRII---GKDNQILVESHQE 155
            W  + G K +  L+ E+ +GE+ L       VR + V  + +I   G +   L+E  Q 
Sbjct: 26  QWNHQQGNKTIEELYQEIKEGESQLKLIQGKLVRLMKVSAIEVIFNLGNNYFQLIEDKQI 85

Query: 156 LSDGSVRNRCRP-LAEKMKSNETPEEAVIRAVKEELG 191
              G VR R +  + EK+K  E   +   R ++EE+G
Sbjct: 86  FLTGIVRRREQNFITEKIKGGENSLQGAYRGLEEEIG 122


>gi|389695973|ref|ZP_10183615.1| mutator mutT protein [Microvirga sp. WSM3557]
 gi|388584779|gb|EIM25074.1| mutator mutT protein [Microvirga sp. WSM3557]
          Length = 182

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 55/119 (46%), Gaps = 17/119 (14%)

Query: 128 IPPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVK 187
           +PP++   VV V +I  DN+IL+    E     +         K++  E PEE +IR ++
Sbjct: 47  VPPLKLTVVVAVALIDTDNRILLAQRPE--GKQLAGLWEFPGGKLEPGERPEETLIRELR 104

Query: 188 EELGSILNDSITVR---IVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGE 243
           EELG      ITV+   + P +++    E   L  P     Y+    + FV   P EG+
Sbjct: 105 EELG------ITVKEPCLAPLTFASHAYETFHLLMP----LYICRRWEGFVQ--PLEGQ 151


>gi|422338999|ref|ZP_16419959.1| mutator MutT protein [Fusobacterium nucleatum subsp. polymorphum
           F0401]
 gi|355372126|gb|EHG19469.1| mutator MutT protein [Fusobacterium nucleatum subsp. polymorphum
           F0401]
          Length = 80

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 37/68 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIV 203
           D I   IV
Sbjct: 62  DYIHRGIV 69


>gi|340753337|ref|ZP_08690123.1| mutator mutT protein [Fusobacterium sp. 2_1_31]
 gi|340566963|gb|EEO37977.2| mutator mutT protein [Fusobacterium sp. 2_1_31]
          Length = 153

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 20/143 (13%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + K+N+ L+    +  +   +N+   +  K++  ETPE+ +IR VKEE G  L 
Sbjct: 2   ITTLCYLEKENKYLMLHRTKKENDINKNKWLGVGGKLEKEETPEQCLIREVKEETGLDLI 61

Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
           D +   IV  +Y+   DE         P    L++   F   + E    C+E      D 
Sbjct: 62  DYVHRGIVIFNYND--DE---------PLDMYLYTSKNFSGEIQE----CSE-----GDL 101

Query: 256 SAADKALFVKRHYWKWRRAMREL 278
              DK+   K + W+  R   EL
Sbjct: 102 KWIDKSEIYKLNLWEGDRIFLEL 124


>gi|262066430|ref|ZP_06026042.1| mutator MutT protein [Fusobacterium periodonticum ATCC 33693]
 gi|291379857|gb|EFE87375.1| mutator MutT protein [Fusobacterium periodonticum ATCC 33693]
          Length = 153

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 64/143 (44%), Gaps = 20/143 (13%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + K+++ L+    +  +   +N+   +  K++  ETPE+ +IR VKEE G  L 
Sbjct: 2   ITTLCYLEKESKYLMLHRTKKENDINKNKWLGVGGKLEKGETPEQCLIREVKEETGLDLI 61

Query: 196 DSITVRIVPGSYSKKLDERNSLSYPGLPARYVLHSVDAFVDGLPEEGEFCTEEAEEYADS 255
           D +   IV  +Y++  DE         P    L++   F   + E    C+E      D 
Sbjct: 62  DYVHRGIVIFNYNE--DE---------PLEMYLYTSKNFSGEIQE----CSE-----GDL 101

Query: 256 SAADKALFVKRHYWKWRRAMREL 278
              DK+   K + W+  R   EL
Sbjct: 102 KWIDKSQVYKLNLWEGDRIFLEL 124


>gi|34762713|ref|ZP_00143703.1| Mutator mutT protein (7,8-dihydro-8-oxoguanine-triphosphatase)
           [Fusobacterium nucleatum subsp. vincentii ATCC 49256]
 gi|27887612|gb|EAA24691.1| Mutator mutT protein (7,8-dihydro-8-oxoguanine-triphosphatase)
           [Fusobacterium nucleatum subsp. vincentii ATCC 49256]
          Length = 252

 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ +ETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKSETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHRGIVIFNFN 74


>gi|19705096|ref|NP_602591.1| Mutator MutT protein [Fusobacterium nucleatum subsp. nucleatum ATCC
           25586]
 gi|19713020|gb|AAL93890.1| Mutator MutT protein [Fusobacterium nucleatum subsp. nucleatum ATCC
           25586]
          Length = 252

 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ +ETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKSETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHRGIVIFNFN 74


>gi|296329356|ref|ZP_06871857.1| mutator MutT protein [Fusobacterium nucleatum subsp. nucleatum ATCC
           23726]
 gi|296153712|gb|EFG94529.1| mutator MutT protein [Fusobacterium nucleatum subsp. nucleatum ATCC
           23726]
          Length = 252

 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ +ETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKRENDINKNKWLGVGGKLEKSETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           D I   IV  +++
Sbjct: 62  DYIHRGIVIFNFN 74


>gi|345297877|ref|YP_004827235.1| mutator MutT protein [Enterobacter asburiae LF7a]
 gi|345091814|gb|AEN63450.1| mutator MutT protein [Enterobacter asburiae LF7a]
          Length = 130

 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 35/61 (57%), Gaps = 2/61 (3%)

Query: 131 VRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEEL 190
           ++T+ +    I    NQI +   Q  +D  + N+      K++S ETPE+A+IR ++EE+
Sbjct: 1   MKTLQIAVGIIRNPQNQIFIT--QRAADAHMANKWEFPGGKIESGETPEQALIRELQEEV 58

Query: 191 G 191
           G
Sbjct: 59  G 59


>gi|336399973|ref|ZP_08580765.1| hypothetical protein HMPREF0404_00056 [Fusobacterium sp. 21_1A]
 gi|336159973|gb|EGN63043.1| hypothetical protein HMPREF0404_00056 [Fusobacterium sp. 21_1A]
          Length = 252

 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + KDN+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G  L 
Sbjct: 2   ITTLCYLEKDNKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLTLI 61

Query: 196 DSITVRIVPGSYS 208
           + +   IV  +++
Sbjct: 62  NYVHRGIVIFNFN 74


>gi|357011931|ref|ZP_09076930.1| MutT/NUDIX family protein [Paenibacillus elgii B69]
          Length = 156

 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 39/63 (61%), Gaps = 7/63 (11%)

Query: 131 VRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRC-RPLAEKMKSNETPEEAVIRAVKEE 189
           +RTV++  +R   +D+QILVE   E+ D  ++ R  RP+   ++  E  ++ VIR V EE
Sbjct: 3   IRTVSLCLIR---RDDQILVE---EIVDEGLKKRLYRPIGGTVEHGENSKQTVIREVMEE 56

Query: 190 LGS 192
           +G+
Sbjct: 57  IGA 59


>gi|126728002|ref|ZP_01743818.1| mutator mutT protein [Sagittula stellata E-37]
 gi|126710967|gb|EBA10017.1| mutator mutT protein [Sagittula stellata E-37]
          Length = 142

 Score = 37.7 bits (86), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 2/80 (2%)

Query: 129 PPVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKE 188
           PP++ + V  V +I  D ++L+    E    S+         K++S ETPE A+IR + E
Sbjct: 9   PPMKIILVSAVALIDPDGRVLLAQRPE--GKSMAGLWEFPGGKVESGETPEAALIRELHE 66

Query: 189 ELGSILNDSITVRIVPGSYS 208
           ELG    DS    +   S++
Sbjct: 67  ELGIETWDSCLAPLTFASHT 86


>gi|422939573|ref|ZP_16966951.1| mutator MutT protein [Fusobacterium nucleatum subsp. animalis ATCC
           51191]
 gi|339890380|gb|EGQ79513.1| mutator MutT protein [Fusobacterium nucleatum subsp. animalis ATCC
           51191]
          Length = 252

 Score = 37.4 bits (85), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 40/73 (54%)

Query: 136 VVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEELGSILN 195
           + T+  + K N+ L+    +  +   +N+   +  K++ NETPE+ + R VKEE G IL 
Sbjct: 2   ITTLCYLEKANKYLMLHRTKKENDINKNKWLGVGGKLEKNETPEQCLFREVKEETGLILI 61

Query: 196 DSITVRIVPGSYS 208
           + +   IV  +++
Sbjct: 62  NYVHRGIVIFNFN 74


>gi|407771274|ref|ZP_11118634.1| hydrolase [Thalassospira xiamenensis M-5 = DSM 17429]
 gi|407285720|gb|EKF11216.1| hydrolase [Thalassospira xiamenensis M-5 = DSM 17429]
          Length = 143

 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 41/79 (51%), Gaps = 2/79 (2%)

Query: 130 PVRTVNVVTVRIIGKDNQILVESHQELSDGSVRNRCRPLAEKMKSNETPEEAVIRAVKEE 189
           P +TV V  V ++  D+++L+    E    S+         K+++ ETPE A++R +KEE
Sbjct: 11  PEKTVFVSAVALVDADHRVLIAQRPE--GKSMAGLWEFPGGKVEAGETPEMALVRELKEE 68

Query: 190 LGSILNDSITVRIVPGSYS 208
           LG  + +S        S++
Sbjct: 69  LGIDITESCLAPFTFASFT 87


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.133    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,724,982,507
Number of Sequences: 23463169
Number of extensions: 195756176
Number of successful extensions: 473240
Number of sequences better than 100.0: 72
Number of HSP's better than 100.0 without gapping: 51
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 473097
Number of HSP's gapped (non-prelim): 75
length of query: 281
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 140
effective length of database: 9,050,888,538
effective search space: 1267124395320
effective search space used: 1267124395320
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)