BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 019135
         (345 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255567088|ref|XP_002524526.1| conserved hypothetical protein [Ricinus communis]
 gi|223536200|gb|EEF37853.1| conserved hypothetical protein [Ricinus communis]
          Length = 484

 Score =  513 bits (1322), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 254/346 (73%), Positives = 281/346 (81%), Gaps = 8/346 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YVADTLNLAIRKIGDAGVTTIAGGKSN AG+RDGPSEDAKFS DFDVVYV  T
Sbjct: 146 MDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNTAGYRDGPSEDAKFSTDFDVVYVHST 205

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLV+DRGN ALRQISLNQ+DC+YQ +SI+ TD+LMVVGAV  GYVTCMLQQGFGP FF
Sbjct: 206 CSLLVVDRGNVALRQISLNQEDCDYQSSSITVTDLLMVVGAVFTGYVTCMLQQGFGPSFF 265

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSM 180
           S+TQ  SES      EI E  + EKP PI  SMKEEP WPSFGQL+IDLSKL LEA+  M
Sbjct: 266 SKTQHFSES------EILEHQSMEKPTPITGSMKEEPRWPSFGQLMIDLSKLTLEALADM 319

Query: 181 FLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGDK 240
           FL  +P   RS+G++  LTPL+DTLRMPEDE  PP V RQ  +VPLSET QV  P   DK
Sbjct: 320 FLYLIPSWLRSNGSRKGLTPLKDTLRMPEDEVEPPSVHRQSISVPLSETRQVHNPNASDK 379

Query: 241 YPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRD 300
           Y +MKPPKIKSASFKD SLS+KHRSSKRQEYAEFYGSGE+P    +SKS KE+TRHRQRD
Sbjct: 380 YSEMKPPKIKSASFKDPSLSSKHRSSKRQEYAEFYGSGEMPSS-GRSKSHKEKTRHRQRD 438

Query: 301 KSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGD-DSYRF 345
           KSGE+    +G EPKPV MK VDYDN KFDHYNMRSKYG  +SY+F
Sbjct: 439 KSGEVAPAATGAEPKPVNMKHVDYDNPKFDHYNMRSKYGSGNSYQF 484


>gi|225461726|ref|XP_002285515.1| PREDICTED: uncharacterized protein LOC100245328 [Vitis vinifera]
 gi|302142876|emb|CBI20171.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  506 bits (1302), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 253/350 (72%), Positives = 292/350 (83%), Gaps = 13/350 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YVADT NLAIRKIGD+GVTTIAGGKSNVAG+RDGPSEDAKFS+DFDVVYVRPT
Sbjct: 143 MDDKGNVYVADTSNLAIRKIGDSGVTTIAGGKSNVAGYRDGPSEDAKFSSDFDVVYVRPT 202

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLV+DRGNAALRQISLNQ+DC+YQ +SIS TDI MV+GAV+VGY +C+LQ+GFGP  F
Sbjct: 203 CSLLVVDRGNAALRQISLNQEDCDYQNSSISATDIFMVIGAVMVGYASCLLQKGFGPSAF 262

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIIDLSKLALEAMG 178
           S+TQ  SESEF ++        KEKP PIVES+KEEP  GWPSFGQLIIDLSK  LEA+ 
Sbjct: 263 SKTQH-SESEFEDQL------IKEKPTPIVESIKEEPDAGWPSFGQLIIDLSKFTLEALT 315

Query: 179 SMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTG 238
            +FL F+P RF  +  +  LTPL+D L MPEDEA PP+ Q+Q++  PLSET Q  TP T 
Sbjct: 316 GIFLYFIPSRFMPTRARKGLTPLKDHLIMPEDEADPPLAQKQRAPPPLSETRQAHTPNTS 375

Query: 239 DKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYS--KSKSQKERTRH 296
           +KY +MKPPKIKS SFKD SLS+KHRSSKRQEYAEFY SGEVPPPY+  +SKSQKER+RH
Sbjct: 376 EKYSEMKPPKIKSYSFKDPSLSSKHRSSKRQEYAEFYHSGEVPPPYTQVRSKSQKERSRH 435

Query: 297 RQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYG-DDSYRF 345
           RQRDKSGE+ +G  G E KPVE+K VDYD+ KFDHYN+RSKYG DDS+RF
Sbjct: 436 RQRDKSGEM-FGAVGAESKPVEIKAVDYDDPKFDHYNIRSKYGSDDSFRF 484


>gi|356549639|ref|XP_003543199.1| PREDICTED: uncharacterized protein LOC100797841 [Glycine max]
          Length = 487

 Score =  501 bits (1290), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 244/348 (70%), Positives = 288/348 (82%), Gaps = 10/348 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVADT NLAIRKIGDAGVTTIAGGKSNVAG+RDGPSEDAKFSNDFDVVYVRPT
Sbjct: 144 VDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKSNVAGYRDGPSEDAKFSNDFDVVYVRPT 203

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAALRQISL+Q+DC+YQ NSIS TDIL VVGAV+VGY TCMLQQGFG  FF
Sbjct: 204 CSLLVIDRGNAALRQISLDQEDCDYQSNSISSTDILTVVGAVIVGYATCMLQQGFGSSFF 263

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSM 180
           S+TQQPS+ +F      K L++ EK +PI+ES KEEPGWPSFGQLI+DLSKL+LEA+ S 
Sbjct: 264 SKTQQPSQKQF------KGLASNEKHMPILESSKEEPGWPSFGQLIVDLSKLSLEALAST 317

Query: 181 FLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSA--VPLSETHQVRTPGTG 238
           F+ F+P  FR S +K  LTPL+D L MPED+  PP+V RQ +    PL+E   V TP   
Sbjct: 318 FIQFIPSHFRPSNSKRGLTPLKDRLVMPEDDVPPPLVNRQNAQGHTPLTENRMVHTPTIA 377

Query: 239 DKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQ 298
           +KY +MKPPKIKS+SFKD S+S+KHRSS+R EYAEFYGS E+ PPY+KSKSQKER RHRQ
Sbjct: 378 EKYSEMKPPKIKSSSFKDPSMSSKHRSSRRPEYAEFYGSSEI-PPYTKSKSQKERPRHRQ 436

Query: 299 RDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYG-DDSYRF 345
           R+KSGE+V G  G E KPVE + VD++N KFDHY+MR+KY  ++++RF
Sbjct: 437 REKSGEVVMGAVGAEAKPVETRAVDHNNPKFDHYSMRTKYASEETFRF 484


>gi|224117004|ref|XP_002317450.1| predicted protein [Populus trichocarpa]
 gi|222860515|gb|EEE98062.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  479 bits (1232), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 247/347 (71%), Positives = 284/347 (81%), Gaps = 9/347 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YVAD+LN AIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV  T
Sbjct: 121 MDDKGNIYVADSLNHAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVHST 180

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPT-DILMVVGAVLVGYVTCMLQQGFGPFF 119
           CSLLV+DRGNAALRQISLNQ+DC+YQ +S + T D+LMVVGAVL+GY TCMLQ GFG   
Sbjct: 181 CSLLVVDRGNAALRQISLNQEDCDYQSSSFTMTEDVLMVVGAVLIGYATCMLQLGFGSSS 240

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGS 179
            SR QQ SESE+      KE S+KEKPIPIV++MKEEP WPSFGQL IDLSKLALEA+  
Sbjct: 241 SSRMQQSSESEY------KEKSSKEKPIPIVDNMKEEPKWPSFGQLFIDLSKLALEALVG 294

Query: 180 MFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGD 239
           + L+FVP  ++  G +  LTPL+D+L MP+DE  PP VQRQ +  P+SE  QV+TP T D
Sbjct: 295 ILLSFVPSWYKPGGARKGLTPLKDSLIMPDDEVEPPSVQRQSTPAPVSENRQVQTPTTSD 354

Query: 240 KYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQR 299
           KY +MK PKIKSASFKD S  +KHRSSK+QEYAEFYGSGEV P + +SKS KE++RHRQR
Sbjct: 355 KYSEMKLPKIKSASFKDPSPLSKHRSSKQQEYAEFYGSGEV-PSHGRSKSHKEKSRHRQR 413

Query: 300 DKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 345
           DKSGE+V+G  G EPKP EM PVDY++ KF+HYN RSKYG DS YRF
Sbjct: 414 DKSGEVVFGAVGAEPKPAEMNPVDYNSPKFNHYNNRSKYGSDSLYRF 460


>gi|449438911|ref|XP_004137231.1| PREDICTED: uncharacterized protein LOC101210502 [Cucumis sativus]
          Length = 480

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 222/341 (65%), Positives = 272/341 (79%), Gaps = 14/341 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVADTLNLAIRKIGDAGVTTIAGGKSNV G+RDGP EDAKFSNDFDV+YVR T
Sbjct: 145 VDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVVGYRDGPGEDAKFSNDFDVMYVRST 204

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAA+RQISLNQ+DCEYQ +SIS +D+LM++GAVL GY T M+Q+GFG    
Sbjct: 205 CSLLVIDRGNAAIRQISLNQEDCEYQDSSISNSDVLMIIGAVLAGYATYMIQRGFGTSNV 264

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSM 180
           S+T  P E+E+ E+    E S+      +++S+KE+PGWPSFG+LIIDLSKLALEA+ S+
Sbjct: 265 SQTNPPLETEYREKPYKPESSS------VMDSVKEDPGWPSFGRLIIDLSKLALEAVASI 318

Query: 181 FLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR-QKSAVPLSETHQVRTPGTGD 239
           FL+ VP RFR+  T+  LTPL+D+LRMPEDE   P VQ  Q++ VPL+ET Q       D
Sbjct: 319 FLSVVPARFRARNTRKGLTPLKDSLRMPEDEPEQPTVQMLQRTPVPLTETRQAHV-NARD 377

Query: 240 KYPD-MKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQ 298
            +P+ MKP K+ S+SFKD SL +KHRSSKRQE+A+FY SGE+PPPYS+SKSQKER RHRQ
Sbjct: 378 PFPELMKPSKLNSSSFKDPSLQSKHRSSKRQEHADFYRSGEIPPPYSRSKSQKERPRHRQ 437

Query: 299 RDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYG 339
           R+KS E+ YG  G      E+KP DYDN K++HYN+R+KYG
Sbjct: 438 REKSAEISYGAVGS-----ELKPADYDNPKYEHYNIRNKYG 473


>gi|449530424|ref|XP_004172195.1| PREDICTED: uncharacterized LOC101210502, partial [Cucumis sativus]
          Length = 437

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 222/341 (65%), Positives = 272/341 (79%), Gaps = 14/341 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVADTLNLAIRKIGDAGVTTIAGGKSNV G+RDGP EDAKFSNDFDV+YVR T
Sbjct: 102 VDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVVGYRDGPGEDAKFSNDFDVMYVRST 161

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAA+RQISLNQ+DCEYQ +SIS +D+LM++GAVL GY T M+Q+GFG    
Sbjct: 162 CSLLVIDRGNAAIRQISLNQEDCEYQDSSISNSDVLMIIGAVLAGYATYMIQRGFGTSNV 221

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSM 180
           S+T  P E+E+ E+    E S+      +++S+KE+PGWPSFG+LIIDLSKLALEA+ S+
Sbjct: 222 SQTNPPLETEYREKPYKPESSS------VMDSVKEDPGWPSFGRLIIDLSKLALEAVASI 275

Query: 181 FLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR-QKSAVPLSETHQVRTPGTGD 239
           FL+ VP RFR+  T+  LTPL+D+LRMPEDE   P VQ  Q++ VPL+ET Q       D
Sbjct: 276 FLSVVPARFRARNTRKGLTPLKDSLRMPEDEPEQPTVQMLQRTPVPLTETRQAHV-NARD 334

Query: 240 KYPD-MKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQ 298
            +P+ MKP K+ S+SFKD SL +KHRSSKRQE+A+FY SGE+PPPYS+SKSQKER RHRQ
Sbjct: 335 PFPELMKPSKLNSSSFKDPSLQSKHRSSKRQEHADFYRSGEIPPPYSRSKSQKERPRHRQ 394

Query: 299 RDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYG 339
           R+KS E+ YG  G      E+KP DYDN K++HYN+R+KYG
Sbjct: 395 REKSAEISYGAVGS-----ELKPADYDNPKYEHYNIRNKYG 430


>gi|357452081|ref|XP_003596317.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
 gi|355485365|gb|AES66568.1| hypothetical protein MTR_2g075860 [Medicago truncatula]
          Length = 493

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 228/351 (64%), Positives = 270/351 (76%), Gaps = 16/351 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YVADT NLAIRKIGDAGVTTIAGGKSNVAG+RDGPSEDAKFSNDFDVVYVRPT
Sbjct: 150 MDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKSNVAGYRDGPSEDAKFSNDFDVVYVRPT 209

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAALR+I L+Q+DC+YQ +SIS TDIL+VVGAVLVGY TCMLQQGFG  FF
Sbjct: 210 CSLLVIDRGNAALRKIILDQEDCDYQSSSISSTDILIVVGAVLVGYATCMLQQGFGSSFF 269

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSM 180
           S+T+   +       E K   + +K +PI ES KE+PGWPSFGQLI DLSKL+LEA+ S 
Sbjct: 270 SKTRSSGQ-------EFKGRESNDKRMPIPESSKEDPGWPSFGQLIADLSKLSLEALASA 322

Query: 181 FLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETH---QVRTPGT 237
           F  F+P  F+ +  K  LTPL+D L MPEDE  PP+V+R+ + V ++E     QV T   
Sbjct: 323 FTQFMPSHFKFNSRKTGLTPLKDRLVMPEDEVQPPLVKRKTTPVTVTENRQMPQVHTATI 382

Query: 238 GDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHR 297
            +KY + KPPK+KS+SFKD S   KHRSSKR EYAEFYGSGEVP  Y+KSKSQKER RHR
Sbjct: 383 TEKYSEAKPPKVKSSSFKDPS---KHRSSKRSEYAEFYGSGEVPSSYAKSKSQKERPRHR 439

Query: 298 QRDKSGELVYGVSGMEPKPVEMK-PVDYDNLKFDHYNMRSK--YGDDSYRF 345
            R+KSGE+V+  +G E KPVE +  VD+ N KFD Y+MR+      +S+RF
Sbjct: 440 HREKSGEVVFPTNGAEAKPVEPRAAVDHSNSKFDRYSMRTGGYVPGESFRF 490


>gi|224117390|ref|XP_002331700.1| predicted protein [Populus trichocarpa]
 gi|222874306|gb|EEF11437.1| predicted protein [Populus trichocarpa]
          Length = 495

 Score =  413 bits (1062), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 227/374 (60%), Positives = 263/374 (70%), Gaps = 54/374 (14%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY+  T
Sbjct: 147 MDDKGNIYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYLHST 206

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI-------------------LMVVGA 101
           CSLLV+DRGNAALRQISLNQ+DC+YQ      T I                   +MVVGA
Sbjct: 207 CSLLVVDRGNAALRQISLNQEDCDYQSKCTRKTTIESTRLVSKTRCWGVLFDYVIMVVGA 266

Query: 102 VLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPS 161
           VL+GY TCMLQQGFG  FFSR +Q      + ++E K+ S+KEKPIPI+E+MKEEP WPS
Sbjct: 267 VLIGYATCMLQQGFGSSFFSRMKQ------SSDSEFKKKSSKEKPIPIMENMKEEPKWPS 320

Query: 162 FGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQK 221
           FGQL+IDLSKLALEA+  + L FVP   R    +  LTPL+D+L +PED+  PP VQRQ 
Sbjct: 321 FGQLLIDLSKLALEALVGILLCFVPSWNRPGEARTGLTPLKDSLTLPEDKVEPPSVQRQS 380

Query: 222 SAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVP 281
           +  P+SE+ QV TP T DKY + KPPKIKSASFKD SL +KH SSKRQEYA FYGSGEVP
Sbjct: 381 TPAPVSESRQVHTPTTSDKYLEGKPPKIKSASFKDPSLLSKHWSSKRQEYAGFYGSGEVP 440

Query: 282 PPYSKSKSQKERTRHRQRDKSGELVYGVSGM---------EPKPVEMKPVDYDNLKFDHY 332
                                GE   G++ +         +PKP EMK VDY++ KF+HY
Sbjct: 441 -------------------SHGEARTGLTPLKDSLTLPEDKPKPAEMKHVDYESPKFEHY 481

Query: 333 NMRSKYG-DDSYRF 345
           N+RSKYG D SYRF
Sbjct: 482 NIRSKYGPDSSYRF 495


>gi|42572439|ref|NP_974315.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332642057|gb|AEE75578.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 493

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 219/356 (61%), Positives = 259/356 (72%), Gaps = 30/356 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YVADTLNLAIRKIGD+GVTTIAGGKSN+AG+RDGPSEDAKFSNDFDVVYVRPT
Sbjct: 157 MDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKSNIAGYRDGPSEDAKFSNDFDVVYVRPT 216

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYN-SISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           CSLLVIDRGNAALRQISL+++DC+YQ + SIS TDIL+V+GAVL+GY TCMLQQGFG  F
Sbjct: 217 CSLLVIDRGNAALRQISLSEEDCDYQDDSSISLTDILLVIGAVLIGYATCMLQQGFGNSF 276

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGS 179
           FS+TQ  SE+ + EE   KE     +P+      KEEPGWPSFGQL+ DL KLALE + S
Sbjct: 277 FSKTQVGSETSYEEEHPGKE--KLSRPVHEKTITKEEPGWPSFGQLLTDLCKLALEFITS 334

Query: 180 MFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGD 239
              + VP RF+   T  NL PL+D L MPEDE  PP VQR  +  P+SE+     P   D
Sbjct: 335 ---HLVPARFQ---TNPNLRPLKDRLIMPEDEQEPPRVQRHTAPAPISESRHAHLPKADD 388

Query: 240 KYPDMKPPKIKSAS-FKDSSLST--KHR-SSKRQEYAEFYGSGEVPPPYSKSKSQKERTR 295
            YP+ K PK++S+S  KD +LS+   HR SSKRQ+YA+FY SGEV  P    K  KER+R
Sbjct: 389 SYPEHKTPKLRSSSVMKDPTLSSSKHHRTSSKRQDYAQFYASGEVAQP----KIHKERSR 444

Query: 296 HRQRDKSGELVYGVSGMEPKPV---EMKPVDYDN-LKFDHYNMR-SKYGDDS-YRF 345
            R RDK+ E        EPKP     +KPV+Y N  KFDHYNMR SKYG ++ +RF
Sbjct: 445 RRHRDKTTE-------TEPKPTPSDTVKPVEYSNSSKFDHYNMRSSKYGPETPFRF 493


>gi|297834352|ref|XP_002885058.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330898|gb|EFH61317.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  392 bits (1008), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 218/356 (61%), Positives = 258/356 (72%), Gaps = 30/356 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YV DTLNLAIRKIGD+GVTTIAGGKSN+AG+RDGPSEDAKFSNDFDVVYVR T
Sbjct: 153 MDDKGNVYVGDTLNLAIRKIGDSGVTTIAGGKSNIAGYRDGPSEDAKFSNDFDVVYVRST 212

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYN-SISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           CSLLVIDRGNAALRQISL+ +DC+YQ + SIS TDIL+V+GAVL+GY TC+LQQGFG  F
Sbjct: 213 CSLLVIDRGNAALRQISLSDEDCDYQDDSSISLTDILLVIGAVLIGYATCLLQQGFGNSF 272

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGS 179
           FS+TQ  SE+ F EE   KE      P+   +  KEEPGWPSFGQLIIDL KLAL+ + S
Sbjct: 273 FSKTQLESETSFEEEHPGKE--KLSLPVHETKVTKEEPGWPSFGQLIIDLCKLALDFITS 330

Query: 180 MFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGD 239
              + VP RF +S    NL PL+D L+MPEDE  PP VQR  +  P+SE+     P   D
Sbjct: 331 ---HLVPTRFTTS---HNLRPLKDRLKMPEDEQEPPRVQRHTAPAPISESRHAHLPKADD 384

Query: 240 KYPDMKPPKIKSAS-FKDSSLS-TKHR--SSKRQEYAEFYGSGEVPPPYSKSKSQKERTR 295
            YP+ K PK++S+S  KD +LS +KH   SSKRQ+YA+FY SGEV PP    K  KER+R
Sbjct: 385 SYPEHKTPKLRSSSVMKDPALSASKHHRSSSKRQDYAQFYASGEVAPP----KIHKERSR 440

Query: 296 HRQRDKSGELVYGVSGMEPKPV---EMKPVDYDN-LKFDHYNMR-SKYGDDS-YRF 345
            R RDK+ E        EPK      +KPV+Y N  KFDHYNMR SKYG ++ +RF
Sbjct: 441 RRHRDKTTE-------TEPKLTPSDTVKPVEYSNSSKFDHYNMRSSKYGPETPFRF 489


>gi|22331093|ref|NP_188104.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|20260314|gb|AAM13055.1| unknown protein [Arabidopsis thaliana]
 gi|25083516|gb|AAN72090.1| unknown protein [Arabidopsis thaliana]
 gi|332642056|gb|AEE75577.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 492

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 218/356 (61%), Positives = 258/356 (72%), Gaps = 31/356 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YVADTLNLAIRKIGD+GVTTIAGGKSN+AG+RDGPSEDAKFSNDFDVVYVRPT
Sbjct: 157 MDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKSNIAGYRDGPSEDAKFSNDFDVVYVRPT 216

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYN-SISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           CSLLVIDRGNAALRQISL+++DC+YQ + SIS TDIL+V+GAVL+GY TCMLQQGFG  F
Sbjct: 217 CSLLVIDRGNAALRQISLSEEDCDYQDDSSISLTDILLVIGAVLIGYATCMLQQGFGNSF 276

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGS 179
           FS+T   SE+ + EE   KE     +P+      KEEPGWPSFGQL+ DL KLALE + S
Sbjct: 277 FSKTVG-SETSYEEEHPGKE--KLSRPVHEKTITKEEPGWPSFGQLLTDLCKLALEFITS 333

Query: 180 MFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGD 239
              + VP RF+   T  NL PL+D L MPEDE  PP VQR  +  P+SE+     P   D
Sbjct: 334 ---HLVPARFQ---TNPNLRPLKDRLIMPEDEQEPPRVQRHTAPAPISESRHAHLPKADD 387

Query: 240 KYPDMKPPKIKSAS-FKDSSLST--KHR-SSKRQEYAEFYGSGEVPPPYSKSKSQKERTR 295
            YP+ K PK++S+S  KD +LS+   HR SSKRQ+YA+FY SGEV  P    K  KER+R
Sbjct: 388 SYPEHKTPKLRSSSVMKDPTLSSSKHHRTSSKRQDYAQFYASGEVAQP----KIHKERSR 443

Query: 296 HRQRDKSGELVYGVSGMEPKPV---EMKPVDYDN-LKFDHYNMR-SKYGDDS-YRF 345
            R RDK+ E        EPKP     +KPV+Y N  KFDHYNMR SKYG ++ +RF
Sbjct: 444 RRHRDKTTE-------TEPKPTPSDTVKPVEYSNSSKFDHYNMRSSKYGPETPFRF 492


>gi|11994597|dbj|BAB02652.1| unnamed protein product [Arabidopsis thaliana]
          Length = 511

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 219/374 (58%), Positives = 259/374 (69%), Gaps = 48/374 (12%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDDKGN+YVADTLNLAIRKIGD+GVTTIAGGKSN+AG+RDGPSEDAKFSNDFDVVYVRPT
Sbjct: 157 MDDKGNVYVADTLNLAIRKIGDSGVTTIAGGKSNIAGYRDGPSEDAKFSNDFDVVYVRPT 216

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYN-SISPT------------------DILMVVGA 101
           CSLLVIDRGNAALRQISL+++DC+YQ + SIS T                  DIL+V+GA
Sbjct: 217 CSLLVIDRGNAALRQISLSEEDCDYQDDSSISLTGVDLRVLTEDHCVLTFGADILLVIGA 276

Query: 102 VLVGYVTCMLQQGFGPFFFSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPS 161
           VL+GY TCMLQQGFG  FFS+TQ  SE+ + EE   KE     +P+      KEEPGWPS
Sbjct: 277 VLIGYATCMLQQGFGNSFFSKTQVGSETSYEEEHPGKE--KLSRPVHEKTITKEEPGWPS 334

Query: 162 FGQLIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQK 221
           FGQL+ DL KLALE + S   + VP RF+   T  NL PL+D L MPEDE  PP VQR  
Sbjct: 335 FGQLLTDLCKLALEFITS---HLVPARFQ---TNPNLRPLKDRLIMPEDEQEPPRVQRHT 388

Query: 222 SAVPLSETHQVRTPGTGDKYPDMKPPKIKSAS-FKDSSLST--KHR-SSKRQEYAEFYGS 277
           +  P+SE+     P   D YP+ K PK++S+S  KD +LS+   HR SSKRQ+YA+FY S
Sbjct: 389 APAPISESRHAHLPKADDSYPEHKTPKLRSSSVMKDPTLSSSKHHRTSSKRQDYAQFYAS 448

Query: 278 GEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEPKPV---EMKPVDYDN-LKFDHYN 333
           GEV  P    K  KER+R R RDK+ E        EPKP     +KPV+Y N  KFDHYN
Sbjct: 449 GEVAQP----KIHKERSRRRHRDKTTE-------TEPKPTPSDTVKPVEYSNSSKFDHYN 497

Query: 334 MR-SKYGDDS-YRF 345
           MR SKYG ++ +RF
Sbjct: 498 MRSSKYGPETPFRF 511


>gi|357140648|ref|XP_003571876.1| PREDICTED: uncharacterized protein LOC100824092 [Brachypodium
           distachyon]
          Length = 491

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 192/349 (55%), Positives = 247/349 (70%), Gaps = 15/349 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD GN+Y+ADT NLAIRKIGD+GVTTIAGGKSN+ G+RDGPSEDAKFS DFDVVYV+  
Sbjct: 154 VDDTGNVYIADTANLAIRKIGDSGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKM 213

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAALR+I+L Q+DC YQ  ++  +DI++V+GAV+ GY+  ++Q GFGP   
Sbjct: 214 CSLLVIDRGNAALRKIALPQEDCTYQDAALLSSDIILVIGAVVAGYLFSVVQHGFGPSSS 273

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIIDLSKLALEAMG 178
            + + P       E E +E S   KP  +VES+KEEP  GWPSFG L+ DL KLA+E +G
Sbjct: 274 EKIEAP-------EDEKQESSTVGKPPLVVESLKEEPSAGWPSFGTLVADLLKLAIEGVG 326

Query: 179 SMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTG 238
           S+  N VP R +    K  LTPL+D L MPED    P+ Q+  S+ P+  T  +  P T 
Sbjct: 327 SLLFNIVPSRLQRVKRKTGLTPLKDRLVMPEDREETPIAQKL-SSTPM-RTETLHAPNTV 384

Query: 239 DKYPDMKPPKIKSAS-FKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHR 297
           ++        IKS+S F+DS+LS+KHRS+KR EYAEFYGS E   P + +K  K+R RHR
Sbjct: 385 NETAPKAQKSIKSSSKFRDSTLSSKHRSTKRPEYAEFYGSSET--PQASAKVPKDRLRHR 442

Query: 298 QRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 345
            R+KSGE+ YG    EPKP EMKP DY++ K+DHYNMR+KYG D+ +R+
Sbjct: 443 HREKSGEVAYGSGHPEPKPAEMKPADYNDPKYDHYNMRNKYGTDTGFRY 491


>gi|115445197|ref|NP_001046378.1| Os02g0234200 [Oryza sativa Japonica Group]
 gi|50251216|dbj|BAD27660.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|113535909|dbj|BAF08292.1| Os02g0234200 [Oryza sativa Japonica Group]
 gi|125538730|gb|EAY85125.1| hypothetical protein OsI_06475 [Oryza sativa Indica Group]
 gi|222622487|gb|EEE56619.1| hypothetical protein OsJ_06002 [Oryza sativa Japonica Group]
          Length = 493

 Score =  363 bits (932), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 192/348 (55%), Positives = 239/348 (68%), Gaps = 14/348 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDGPSEDAKFS DFDVVYV+  
Sbjct: 157 VDDTGNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKM 216

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAALR+I+L Q+DC YQ  ++  +DI++V+GAV+ GY+  ++Q GFG    
Sbjct: 217 CSLLVIDRGNAALRKIALPQEDCTYQDATLLSSDIILVIGAVVAGYIFSVVQHGFGSSTA 276

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIIDLSKLALEAMG 178
            + + P       E E +E S   KP  +VES+KEEP  GWPS G LI DL KL +E +G
Sbjct: 277 EKIEAP-------EDEHQESSTVGKPPLVVESLKEEPSAGWPSLGTLIGDLLKLVIEGVG 329

Query: 179 SMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTG 238
           +  L  VP R +    K +LTPL+D L MPED    PV Q+  S     ET  +  P   
Sbjct: 330 NQLLRLVPSRLQHGKRKTDLTPLKDRLVMPEDTEETPVAQKLSSTPMRPET--LHGPNPV 387

Query: 239 DKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQ 298
           ++        +KS+ F+DS+LS+KHRSSKRQEYAEFYGS E   P   SK  K+R RHR 
Sbjct: 388 NETAPKAQKSVKSSKFRDSTLSSKHRSSKRQEYAEFYGSSET--PQVSSKVPKDRLRHRH 445

Query: 299 RDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 345
           R+KSGE VYG S  EPKP E+KP DY + K+D YN+RSKYG DS YR+
Sbjct: 446 REKSGEAVYGTSHPEPKPAEVKPADYSDPKYDPYNIRSKYGADSGYRY 493


>gi|449518065|ref|XP_004166064.1| PREDICTED: uncharacterized protein LOC101228723 [Cucumis sativus]
          Length = 449

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 191/346 (55%), Positives = 241/346 (69%), Gaps = 14/346 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVADTLNLAIRKI DAGVTTIAGGK+NV G+ DGP E+AKFSNDFDV+YVR T
Sbjct: 115 IDDKGNVYVADTLNLAIRKIVDAGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDVIYVRRT 174

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLV+DRGNAALRQISLN++DC+YQY S+S +D+ M +GA+L+GY T MLQ GF   FF
Sbjct: 175 CSLLVVDRGNAALRQISLNKEDCDYQYGSVSTSDVAMFIGALLIGYFTYMLQHGFRLSFF 234

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSM 180
           +   Q      + ETE KELS K K   +V ++KEE  W SFGQ++ +L K A+E +   
Sbjct: 235 TFMVQSE----HLETETKELS-KGKQTKLVSTIKEETWWESFGQVVAELYKQAIELLPGN 289

Query: 181 FLNFVPFRFRSSGTK-GNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGD 239
             +F+   FRS   K   LTPL+D L+MPEDE    V  +QK+  PLSET          
Sbjct: 290 LKSFLRPYFRSEDNKEKGLTPLKDALKMPEDEIKTNVSLKQKTVTPLSETKHASI----- 344

Query: 240 KYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQR 299
           K+ ++KPPK+KS S K+ SL  KH  S  QEYAEFYG+G V    S+SK QK+R+RHRQ+
Sbjct: 345 KHDELKPPKMKS-SIKNPSLLNKHSHSG-QEYAEFYGTGMVSSSLSRSKGQKDRSRHRQK 402

Query: 300 DKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDSYRF 345
           +K  +++ G  G EPK  EM+  DY+  KFD YN+R+KY  DS  F
Sbjct: 403 EKGLDILTGTLGAEPKLAEMR-TDYNEPKFDQYNIRNKYRYDSSSF 447


>gi|449456567|ref|XP_004146020.1| PREDICTED: uncharacterized protein LOC101206392 [Cucumis sativus]
          Length = 477

 Score =  353 bits (907), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 191/346 (55%), Positives = 241/346 (69%), Gaps = 14/346 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVADTLNLAIRKI DAGVTTIAGGK+NV G+ DGP E+AKFSNDFDV+YVR T
Sbjct: 143 IDDKGNVYVADTLNLAIRKIVDAGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDVIYVRRT 202

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLV+DRGNAALRQISLN++DC+YQY S+S +D+ M +GA+L+GY T MLQ GF   FF
Sbjct: 203 CSLLVVDRGNAALRQISLNKEDCDYQYGSVSTSDVAMFIGALLIGYFTYMLQHGFRLSFF 262

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSM 180
           +   Q      + ETE KELS K K   +V ++KEE  W SFGQ++ +L K A+E +   
Sbjct: 263 TFMVQSE----HLETETKELS-KGKQTKLVSTIKEETWWESFGQVVAELYKQAIELLPGN 317

Query: 181 FLNFVPFRFRSSGTK-GNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTGD 239
             +F+   FRS   K   LTPL+D L+MPEDE    V  +QK+  PLSET          
Sbjct: 318 LKSFLRPYFRSEDNKEKGLTPLKDALKMPEDEIKTNVSLKQKTVTPLSETKHASI----- 372

Query: 240 KYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQR 299
           K+ ++KPPK+KS S K+ SL  KH  S  QEYAEFYG+G V    S+SK QK+R+RHRQ+
Sbjct: 373 KHDELKPPKMKS-SIKNPSLLNKHSHSG-QEYAEFYGTGMVSSSLSRSKGQKDRSRHRQK 430

Query: 300 DKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDSYRF 345
           +K  +++ G  G EPK  EM+  DY+  KFD YN+R+KY  DS  F
Sbjct: 431 EKGLDILTGTLGAEPKLAEMR-TDYNEPKFDQYNIRNKYRYDSSSF 475


>gi|242061092|ref|XP_002451835.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
 gi|241931666|gb|EES04811.1| hypothetical protein SORBIDRAFT_04g008420 [Sorghum bicolor]
          Length = 495

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 189/350 (54%), Positives = 238/350 (68%), Gaps = 17/350 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDGPSEDAKFS DFDVVYV+  
Sbjct: 158 VDDMGNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKM 217

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAALR+ISL Q+DC YQ +++  +D+++V+GAV+ GY+    Q GFG F  
Sbjct: 218 CSLLVIDRGNAALRKISLPQEDCTYQDSALLSSDLILVIGAVVAGYIFSGFQHGFG-FSG 276

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIIDLSKLALEAMG 178
           S   +  E+E +E + I       KP  +VES+KEEP  GWPS G LI DL KLA+E +G
Sbjct: 277 SEKVEAPENEQHESSTIG------KPPLVVESLKEEPGAGWPSLGTLIADLLKLAIEGVG 330

Query: 179 SMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQVRTPGTG 238
            + L+ VP R +    K +LTPLRD L MPED    P  Q+  S     ET  V  P   
Sbjct: 331 KLLLSVVPQRMQHGKKKTDLTPLRDRLVMPEDREETPAAQKLSSTPMRPET--VHAPNAV 388

Query: 239 DKYPDMKPPKIKSAS-FKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHR 297
            +        IKS+S F+DS+LS+KHRSSKRQEYA+FYG+ E P P   +K  K+R RHR
Sbjct: 389 PETAVKAQKSIKSSSKFRDSTLSSKHRSSKRQEYADFYGTSE-PAPVG-AKVPKDRLRHR 446

Query: 298 -QRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 345
             R+KSGE+ YG +  + KP E KP DY + K+D Y MRSKY  +S YR+
Sbjct: 447 HHREKSGEVAYGAAHPDLKPAEAKPADYSDPKYDPY-MRSKYAAESGYRY 495


>gi|226504928|ref|NP_001143593.1| uncharacterized protein LOC100276297 precursor [Zea mays]
 gi|195623016|gb|ACG33338.1| hypothetical protein [Zea mays]
          Length = 501

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 183/352 (51%), Positives = 235/352 (66%), Gaps = 16/352 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDGPSEDAKFS DFDVVYV+  
Sbjct: 159 VDDMGNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKM 218

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAALR+ISL Q+DC YQ +++  +D+++V+GAV+ GY+    Q GFG F  
Sbjct: 219 CSLLVIDRGNAALRKISLPQEDCTYQDSALLSSDLILVIGAVVAGYIFSGFQHGFG-FSG 277

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIIDLSKLALEAMG 178
           S   +  E+E +E + I       KP  +VES+KEEP  GWPS G LI DL KLA+E +G
Sbjct: 278 SEKVEAPENEQHESSTIG------KPPLVVESLKEEPGAGWPSLGTLIADLLKLAIEGVG 331

Query: 179 SMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS-ETHQVRTPGT 237
            + L+ VP R +    K +LTPLRD L MPED        ++ S+ P+  ET     P T
Sbjct: 332 KLLLSVVPQRMQHGKRKTDLTPLRDRLVMPEDREETAAAAQKLSSTPMRPETAHAPNPVT 391

Query: 238 GDKYPDMKPPKIKSASFKDSS-LSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKE--RT 294
               P   P   K +  +DSS LS+KHRSSKRQEYA+FYG+ E  P  + +K  K+  R 
Sbjct: 392 ETAAPKA-PKSTKPSKLRDSSTLSSKHRSSKRQEYADFYGTSEPAPVGAAAKVPKDRLRH 450

Query: 295 RHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 345
           RH  R+KSGE+ YG +  + KP E KP DY +  +D Y +RSKY  +S YR+
Sbjct: 451 RHHHREKSGEVAYGAAHHDLKPAEAKPADYSDPSYDPY-VRSKYAAESGYRY 501


>gi|413936635|gb|AFW71186.1| hypothetical protein ZEAMMB73_395034 [Zea mays]
          Length = 501

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 183/352 (51%), Positives = 235/352 (66%), Gaps = 16/352 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDGPSEDAKFS DFDVVYV+  
Sbjct: 159 VDDMGNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKM 218

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGNAALR+ISL Q+DC YQ +++  +D+++V+GAV+ GY+    Q GFG F  
Sbjct: 219 CSLLVIDRGNAALRKISLPQEDCTYQDSALLSSDLILVIGAVVAGYIFSGFQHGFG-FSG 277

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIIDLSKLALEAMG 178
           S   +  E+E +E + I       KP  +VES+KEEP  GWPS G LI DL KLA+E +G
Sbjct: 278 SEKVEAPENEQHESSTIG------KPPLVVESLKEEPGAGWPSLGTLIADLLKLAIEGVG 331

Query: 179 SMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS-ETHQVRTPGT 237
            + L+ VP R +    K +LTPLRD L MPED        ++ S+ P+  ET     P T
Sbjct: 332 KLLLSVVPQRMQHGKRKTDLTPLRDRLVMPEDREETAAAAQKLSSTPMRPETAHAPNPVT 391

Query: 238 GDKYPDMKPPKIKSASFKDSS-LSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKE--RT 294
               P   P   K +  +DSS LS+KHRSSKRQEYA+FYG+ E  P  + +K  K+  R 
Sbjct: 392 ETAAPKA-PKSTKPSKLRDSSTLSSKHRSSKRQEYADFYGTSEPAPVGAAAKVPKDRLRH 450

Query: 295 RHRQRDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 345
           RH  R+KSGE+ YG +  + KP E KP DY +  +D Y +RSKY  +S YR+
Sbjct: 451 RHHHREKSGEVAYGAAHHDLKPAEAKPADYSDPSYDPY-VRSKYAAESGYRY 501


>gi|238009310|gb|ACR35690.1| unknown [Zea mays]
          Length = 340

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 181/348 (52%), Positives = 232/348 (66%), Gaps = 16/348 (4%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           GN+YVADT NLAIRKIG++GVTTIAGGKSN+ G+RDGPSEDAKFS DFDVVYV+  CSLL
Sbjct: 2   GNVYVADTANLAIRKIGESGVTTIAGGKSNIPGYRDGPSEDAKFSTDFDVVYVKKMCSLL 61

Query: 65  VIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQ 124
           VIDRGNAALR+ISL Q+DC YQ +++  +D+++V+GAV+ GY+    Q GFG F  S   
Sbjct: 62  VIDRGNAALRKISLPQEDCTYQDSALLSSDLILVIGAVVAGYIFSGFQHGFG-FSGSEKV 120

Query: 125 QPSESEFNEETEIKELSNKEKPIPIVESMKEEP--GWPSFGQLIIDLSKLALEAMGSMFL 182
           +  E+E +E + I       KP  +VES+KEEP  GWPS G LI DL KLA+E +G + L
Sbjct: 121 EAPENEQHESSTIG------KPPLVVESLKEEPGAGWPSLGTLIADLLKLAIEGVGKLLL 174

Query: 183 NFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS-ETHQVRTPGTGDKY 241
           + VP R +    K +LTPLRD L MPED        ++ S+ P+  ET     P T    
Sbjct: 175 SVVPQRMQHGKRKTDLTPLRDRLVMPEDREETAAAAQKLSSTPMRPETAHAPNPVTETAA 234

Query: 242 PDMKPPKIKSASFKDSS-LSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKE--RTRHRQ 298
           P   P   K +  +DSS LS+KHRSSKRQEYA+FYG+ E  P  + +K  K+  R RH  
Sbjct: 235 PKA-PKSTKPSKLRDSSTLSSKHRSSKRQEYADFYGTSEPAPVGAAAKVPKDRLRHRHHH 293

Query: 299 RDKSGELVYGVSGMEPKPVEMKPVDYDNLKFDHYNMRSKYGDDS-YRF 345
           R+KSGE+ YG +  + KP E KP DY +  +D Y +RSKY  +S YR+
Sbjct: 294 REKSGEVAYGAAHHDLKPAEAKPADYSDPSYDPY-VRSKYAAESGYRY 340


>gi|357516379|ref|XP_003628478.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
 gi|355522500|gb|AET02954.1| hypothetical protein MTR_8g058630 [Medicago truncatula]
          Length = 150

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 79/95 (83%), Positives = 87/95 (91%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVADT N+AIRKIGDAGVTTIAGGKSNVAG+RDGP EDAK SNDFDVVY+RPT
Sbjct: 46  LDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKLSNDFDVVYIRPT 105

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI 95
           CSLLVIDRGNAALRQI LNQ+DC YQ +SIS T +
Sbjct: 106 CSLLVIDRGNAALRQIFLNQEDCNYQSSSISLTGL 140


>gi|357516309|ref|XP_003628443.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
 gi|355522465|gb|AET02919.1| hypothetical protein MTR_8g058300 [Medicago truncatula]
          Length = 160

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 79/93 (84%), Positives = 86/93 (92%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVADT N+AIRKIGDAGVTTIAGGKSNVAG+RDGP EDAK SNDFDVVY+RPT
Sbjct: 46  LDDKGNVYVADTQNMAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKLSNDFDVVYIRPT 105

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 93
           CSLLVIDRGNAALRQI LNQ+DC YQ +SIS T
Sbjct: 106 CSLLVIDRGNAALRQIFLNQEDCNYQSSSISLT 138


>gi|147798773|emb|CAN69794.1| hypothetical protein VITISV_022544 [Vitis vinifera]
          Length = 491

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 129/364 (35%), Positives = 179/364 (49%), Gaps = 47/364 (12%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADTLN+AIRKI DAGVTTIAGGK    G   DGPSEDAKFSNDFDVVY+  
Sbjct: 150 VDDRGNIYVADTLNMAIRKISDAGVTTIAGGKLGRGGGHVDGPSEDAKFSNDFDVVYIGS 209

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY S  P  I +++ A   GY+  +LQ+  G   
Sbjct: 210 SCSLLVIDRGNQAIREIQLHFDDCAYQYGSGFPLGIAVLIAAGFFGYMLALLQRRVGTIV 269

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKP-IPIVESM--KEEPGWPSFGQLIIDLSKLALEA 176
            S   Q + S     T  K L +   P IP  + M  +EE  + S G+L +       E 
Sbjct: 270 SSENDQANPS-IAHSTYQKPLKSVRPPLIPTEDEMEKQEEGFFGSLGKLFVYAGACIAEI 328

Query: 177 MGSMF--LNFVPFRFRSSGT--------KGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPL 226
            G M   L   P  ++              N  PL+++  +P DE  PP +         
Sbjct: 329 FGGMIPGLKKKPHSYQYQNQQNYQQPQKHSNAWPLQESFVIP-DEDEPPSI--------- 378

Query: 227 SETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSK 286
                 RTP     YP M          KD+     H+  + + +   +           
Sbjct: 379 ----DTRTPTPRKTYPFMS---------KDA--EKMHQIRQSRAFVSGWDGDFQQQQKQH 423

Query: 287 SKSQKERTRH----RQRDKSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSKYG 339
                  T H    +  +K+ E+V+G      +  +PV++KPV+Y +  +DH+N+RS+  
Sbjct: 424 HHRHYSSTPHTYYEQNCEKTNEIVFGAVQEQQVRREPVDIKPVNYGDPIYDHHNIRSRIH 483

Query: 340 DDSY 343
              Y
Sbjct: 484 SKGY 487


>gi|225424717|ref|XP_002265467.1| PREDICTED: uncharacterized protein LOC100263777 [Vitis vinifera]
 gi|296086531|emb|CBI32120.3| unnamed protein product [Vitis vinifera]
          Length = 491

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 132/365 (36%), Positives = 183/365 (50%), Gaps = 49/365 (13%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADTLN+AIRKI DAGVTTIAGGK    G   DGPSEDAKFSNDFDVVY+  
Sbjct: 150 VDDRGNIYVADTLNMAIRKISDAGVTTIAGGKLGRGGGHVDGPSEDAKFSNDFDVVYIGS 209

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY S  P  I +++ A   GY+  +LQ+  G   
Sbjct: 210 SCSLLVIDRGNQAIREIQLHFDDCAYQYGSGFPLGIAVLIAAGFFGYMLALLQRRVGTIV 269

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKP-IPIVESM--KEEPGWPSFGQLIIDLSKLALEA 176
            S   Q + S     T  K L +   P IP  + M  +EE  + S G+L +       E 
Sbjct: 270 SSENDQANPS-IAHSTYQKPLKSVRPPLIPTEDEMERQEEGFFGSLGKLFVYAGACIAEI 328

Query: 177 MGSMF--LNFVPFRFRSSGT--------KGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPL 226
            G M   L   P  ++              N  PL+++  +P DE  PP +         
Sbjct: 329 FGGMIPGLKKKPHSYQYQNQQNYQQPQKHSNAWPLQESFVIP-DEDEPPSI--------- 378

Query: 227 SETHQVRTPGTGDKYPDM-----KPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVP 281
                 RTP     YP M     K  +I+ +    S      +  ++Q +   Y S   P
Sbjct: 379 ----DTRTPTPRKTYPFMSKDAEKMHQIRQSRAFVSGWDGDFQQQQKQHHHRHYSS--TP 432

Query: 282 PPYSKSKSQKERTRHRQRDKSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSKY 338
             Y +   +          K+ E+V+G      +  +PV++KPV+Y +  +DH+N+RS+ 
Sbjct: 433 HTYYEQNCE----------KTNEIVFGAVQEQQVRREPVDIKPVNYGDPIYDHHNIRSRI 482

Query: 339 GDDSY 343
               Y
Sbjct: 483 HSKGY 487


>gi|357516307|ref|XP_003628442.1| hypothetical protein MTR_8g058280 [Medicago truncatula]
 gi|355522464|gb|AET02918.1| hypothetical protein MTR_8g058280 [Medicago truncatula]
          Length = 283

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 37/284 (13%)

Query: 71  AALRQISLNQDDCEYQYNS-ISPTDILMVVGAVLVGYVTCMLQQGF---GPFFFSRTQQP 126
           A L  I  N    +Y YN     + I  +  +  +G +  +  + +   G +     QQP
Sbjct: 25  APLIIICGNIVAVKYPYNCWCCDSRICCIHASAGIGILFLLKNELYFILGVYIAYMLQQP 84

Query: 127 SESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGSMFLNFVP 186
           SE +F  E      ++ +K    +E  KEE GWPSF Q ++DL                 
Sbjct: 85  SERDFKGE------ASSDKSKSTLERTKEETGWPSFRQ-VVDL----------------- 120

Query: 187 FRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKS-AVPLSETHQVRTPGTGDKYPDMK 245
              +   T+  LTPL+D L MPEDE  P +V+RQ + A PL+ET ++      +KY + K
Sbjct: 121 --LKPDSTRRGLTPLKDRLLMPEDELEPLLVKRQSALAPPLTETRKIHLKSAAEKYSETK 178

Query: 246 PPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRDKSGEL 305
             K+KS++ KD SL +KH SSKRQEYAEFYGS E+P P +KSK QK+R+RHR+R+KS E 
Sbjct: 179 IAKVKSSTPKDPSLPSKHHSSKRQEYAEFYGSSEIPAP-TKSKIQKQRSRHRRREKSEE- 236

Query: 306 VYGVSGMEPKPVEMKPVDYDNLKFDHYN---MRSKY-GDDSYRF 345
           V G  G E KP+EM+   Y N K+D YN   MR KY  +D+ RF
Sbjct: 237 VSGAVGTEQKPLEMRAAGYFNTKYDQYNYNMMRPKYVPEDTSRF 280


>gi|356565931|ref|XP_003551189.1| PREDICTED: uncharacterized protein LOC100781836 [Glycine max]
          Length = 493

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 129/381 (33%), Positives = 190/381 (49%), Gaps = 79/381 (20%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADT N+AIRKI D+GVTTIAGGK N  G   DGPSE+AKFS+D DVVYV  
Sbjct: 154 VDDRGNIYVADTTNMAIRKISDSGVTTIAGGKWNRGGGHVDGPSEEAKFSDDLDVVYVGS 213

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY S  P  I M+VGA   GY+  +LQ+  G   
Sbjct: 214 SCSLLVIDRGNRAIREIQLHFDDCAYQYGSGFPLGIAMLVGAGFFGYMLALLQRRLGTIV 273

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPI-----PIVES-----MKEEPGWPSFGQLIIDL 169
            S+             E+   S  +KP+     P++ S      +EE  + S G+L+ + 
Sbjct: 274 ASQ-------------EVIPPSPYQKPLKSVRPPLIPSEYEPDKQEEGFFGSLGKLLANT 320

Query: 170 SKLALEAMGSMFLNF----VPFRFRSS------GTKGNLTPLRDTLRMPEDEAAPPVVQR 219
               +E +G +F +F    + ++F+          + N  P++++  +P DE  PP +  
Sbjct: 321 GASMVEIIGGLFPSFRRKSLRYQFQRQPLIQQPQKQVNAWPVQESFVIP-DEDQPPSI-- 377

Query: 220 QKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGE 279
                        R P     YP M          KD+    + R S+      FY SG 
Sbjct: 378 -----------DTRAPTPRKTYPFMS---------KDAEKMQQLRQSR-----AFY-SGW 411

Query: 280 VPPPYSKSKSQKERTRHRQR------------DKSGELVYGV---SGMEPKPVEMKPVDY 324
                 + + Q++  RH+ R            + + E+V+G       + + V +KPVDY
Sbjct: 412 DGDLQQQQQQQQKHHRHQYRSSIPHTYYEQSHETTNEIVFGAVEEQDRKQESVVIKPVDY 471

Query: 325 DNLKFDHYNMRSKYGDDSYRF 345
               ++H N+RS+     Y +
Sbjct: 472 GESMYEH-NIRSRMSSMGYGY 491


>gi|356497860|ref|XP_003517774.1| PREDICTED: uncharacterized protein LOC100799483 [Glycine max]
          Length = 508

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 97/234 (41%), Positives = 138/234 (58%), Gaps = 18/234 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D++GN+YVAD +N+AIRKI D+GVTTIAGGK S   G  DGPSE+AKFSNDFDVVYV  
Sbjct: 150 VDNRGNIYVADIMNMAIRKISDSGVTTIAGGKWSRGGGHVDGPSEEAKFSNDFDVVYVGS 209

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY +  P  I +++GA   GY+  +LQ+  G   
Sbjct: 210 SCSLLVIDRGNQAIREIQLHFDDCAYQYENGFPLGIAVLIGAGFFGYMLALLQRRLGTIV 269

Query: 120 FSR--TQQPSESEFNEETEIKELSNKEKPIPIVE---SMKEEPGWPSFGQLIIDLSKLAL 174
            S+  +   S S  +     K L++   P+   E     +EE  + S G+L+ +     +
Sbjct: 270 ASQDVSAHSSMSGISPSPYQKPLNSARPPLIPSEDESDKQEESFFGSIGKLLTNAGASVV 329

Query: 175 EAMGSMFLNF-----------VPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVV 217
           E MG++F  F            P  F+    + N  P++++  +P DE  PP +
Sbjct: 330 EIMGALFPGFRKNPPQSYEFQSPPLFQQPQKQVNAWPVQESFVIP-DEDEPPSI 382


>gi|224102017|ref|XP_002312513.1| predicted protein [Populus trichocarpa]
 gi|222852333|gb|EEE89880.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 121/356 (33%), Positives = 176/356 (49%), Gaps = 43/356 (12%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVADT+N+AIRKI DAGVTTIAGGK       DG SEDA FSNDFDVVY+  +
Sbjct: 153 VDDRGNIYVADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDANFSNDFDVVYIGSS 212

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I L+ DDC YQY S  P  I ++V A   GY+  +LQ+  G    
Sbjct: 213 CSLLVIDRGNRAIREIQLHFDDCAYQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGMIVS 272

Query: 121 SR--TQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMG 178
            +  + + S +    +  IK +     P    +   EE  + S G+L I+     +E  G
Sbjct: 273 PQNVSMKMSTTGIPYQKPIKSIRPPLIPTEDEQEKHEEGLFGSLGKLFINTGASVMEIFG 332

Query: 179 SMFLNF-----------VPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS 227
            +  +F              +      + +  P++D+  +P DE  PP ++         
Sbjct: 333 GIVPSFRKKPVSYQYQNYQQQQYQHQKQLSSWPVQDSFVIP-DEDEPPSIES-------- 383

Query: 228 ETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYG--SGEVPPPYS 285
                RTP     YP M          KD+    + R   R  Y+ + G    +    + 
Sbjct: 384 -----RTPTPRKTYPFMS---------KDTEKMHQWRQG-RSIYSGWDGDLQQQQHQHHH 428

Query: 286 KSKSQKERTRHRQR-DKSGELVYGVSGMEPKPVE---MKPVDYDNLKFDHYNMRSK 337
           + +S    T + Q  +K+ E+V+G    +    E    KPVDY + K  H+N+RS+
Sbjct: 429 RYQSSTPHTYYEQSYEKTNEIVFGAVQEQDGKYETMVTKPVDYGDPKHYHHNIRSR 484


>gi|449449204|ref|XP_004142355.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101209861 [Cucumis sativus]
          Length = 454

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 113/365 (30%), Positives = 186/365 (50%), Gaps = 47/365 (12%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV-AGFRDGPSEDAKFSNDFDVVYVRP 59
           +D++GN+Y+ADT+N+AIRKI D GVTTIAGG+ N  +G  DGPSEDAKFSNDFDVVYV  
Sbjct: 101 LDERGNIYIADTMNMAIRKISDTGVTTIAGGRWNQGSGHIDGPSEDAKFSNDFDVVYVGS 160

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I LN DDC  QY       ++++V A L GY+  +LQ+     F
Sbjct: 161 SCSLLVIDRGNKAIREIELNYDDCNTQYADSLNLGVVLLVAAGLFGYLLALLQRRVQAMF 220

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVESMK-------------EEPGWPSFGQLI 166
                  S+ E   +  +K         P ++S++             EE  + S G+L 
Sbjct: 221 ------SSQKEIRSQQMMKATPVAPYQRPPLKSVRPSLIPSEDEPEKLEEGFFGSLGRLF 274

Query: 167 IDLSKLALEAMGSMFLNF--------VPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQ 218
           ++      +  G +   F        +  +F+      N  PL+++  +P DE  PP + 
Sbjct: 275 VNSGSSMADIFGGLLSGFRRKPLNHQIHQQFQPVNRHPNAWPLQESFVIP-DEDEPPSI- 332

Query: 219 RQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKH--RSSKRQEYAEFYG 276
                       + +TP     YP M     +S  FK ++ +     +   +Q++ + + 
Sbjct: 333 ------------ETKTPTIKKTYPFMTQDLDRSHQFKPNNSNNNSXIQHHHQQQHIQHHH 380

Query: 277 SGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEP---KPVEMKPVDYDNLKFDHYN 333
             +    + +  S    T + +  ++ E+V+G    +    + + +K VDY + +++H+N
Sbjct: 381 HQQQQQYHHRQYSAGPTTYYEKSCETNEIVFGAVQEQDGRREAMVIKAVDYGDPRYNHHN 440

Query: 334 MRSKY 338
           +R++Y
Sbjct: 441 IRARY 445


>gi|356501873|ref|XP_003519748.1| PREDICTED: uncharacterized protein LOC100797570 [Glycine max]
          Length = 507

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 122/369 (33%), Positives = 179/369 (48%), Gaps = 53/369 (14%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D++GN+Y+AD +N+AIRKI D+GVTTIAGGK S   G  DGPSE+AKFSNDFDVVY+  
Sbjct: 150 VDNRGNIYIADIMNMAIRKISDSGVTTIAGGKWSRGGGHIDGPSEEAKFSNDFDVVYIGS 209

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY +     I M+VGA   GY+  +LQ   G   
Sbjct: 210 SCSLLVIDRGNQAIREIQLHFDDCAYQYENGLTLGIAMLVGAGFFGYMLALLQHRLGTIV 269

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVES---MKEEPGWPSFGQLIIDLSKLALEA 176
            S+    + S  +     K L +   P+   E     +EE    S G+L+ +     +E 
Sbjct: 270 ASQ-DGAAMSGISPSPYQKPLKSVRPPLIPSEDESDKQEESFIRSIGKLLTNAGASVVEI 328

Query: 177 MGSMFLNF------VPFR----FRSSGTKGNLTPLRDTLRMPEDEAAPPVV--------- 217
           MG++F  F        F+    F+    + N  P++++  +P DE  PP +         
Sbjct: 329 MGALFPGFRKKPQSYEFQSQPLFQQPQKQVNAWPVQESFVIP-DEDEPPSIDPRSPTPRK 387

Query: 218 ------QRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEY 271
                 Q  +    L ++H   +   GD     +    +    +       HR       
Sbjct: 388 TYAFMSQDAEKMPQLWQSHAFYSGWGGDLQQQQQLKLQQQQQQQQQQQLKHHR------- 440

Query: 272 AEFYGSGEVPPPYSKSKSQKERTRHRQRDKSGELVYG-VSGMEPK--PVEMKPVDYDNLK 328
              Y S  VP  Y +          +  ++  E+V+G V   E K   V +KPVDY    
Sbjct: 441 -HLYHS-SVPHTYYE----------QSHEEINEIVFGAVQEQEGKKETVVIKPVDYGQSL 488

Query: 329 FDHYNMRSK 337
           +DH+ +R +
Sbjct: 489 YDHHYIRPR 497


>gi|357485963|ref|XP_003613269.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
 gi|355514604|gb|AES96227.1| hypothetical protein MTR_5g034750 [Medicago truncatula]
          Length = 589

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 110/263 (41%), Positives = 142/263 (53%), Gaps = 39/263 (14%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVAD +N+AIRKI D+GVTTIAGGK S   G  DGPSE+AKFSNDFDVVYV  
Sbjct: 182 VDDRGNIYVADIMNMAIRKISDSGVTTIAGGKLSRGGGHVDGPSEEAKFSNDFDVVYVGS 241

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L  DDC YQY S  P  I M++GA   GY+  +LQ+      
Sbjct: 242 SCSLLVIDRGNQAIREIQLRFDDCAYQYESGFPLGIAMLLGAGFFGYMLALLQRRLSTIV 301

Query: 120 FSRTQQPSE----SEFNEETEIKELSNKEKP-IPIV-ESMKEEPG-WPSFGQLIIDLSKL 172
            S+    +E    S+F+     K L +   P IP   ES K+E G + S G+L+ +    
Sbjct: 302 ASQDMTLAESSAMSDFSPSPYQKPLKSVRPPLIPSEDESYKQEEGLFASIGKLLTNAGAS 361

Query: 173 ALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQV 232
            +E MG        FR +    +    PL      PE         RQ +A P+ E+  +
Sbjct: 362 VVEIMG--------FRKKPQSYEFQSQPL---FHQPE---------RQINAWPVQESFVI 401

Query: 233 -----------RTPGTGDKYPDM 244
                      RTP     YP M
Sbjct: 402 TNEDEPPSIDPRTPTPKKTYPFM 424


>gi|115448467|ref|NP_001048013.1| Os02g0730400 [Oryza sativa Japonica Group]
 gi|46390465|dbj|BAD15926.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|46390861|dbj|BAD16365.1| putative NHL repeat-containing protein [Oryza sativa Japonica
           Group]
 gi|113537544|dbj|BAF09927.1| Os02g0730400 [Oryza sativa Japonica Group]
 gi|125541002|gb|EAY87397.1| hypothetical protein OsI_08804 [Oryza sativa Indica Group]
 gi|125583566|gb|EAZ24497.1| hypothetical protein OsJ_08258 [Oryza sativa Japonica Group]
 gi|215715228|dbj|BAG94979.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 508

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 90/232 (38%), Positives = 131/232 (56%), Gaps = 15/232 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+YVAD +N+AIRKI D GVTTIAGGKS+  G  DGPS+DAKFS DF+V Y+  +
Sbjct: 156 VDGRGNIYVADAMNMAIRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIGSS 215

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I LN DDC YQY +  P  + +++ A   GY+  +LQ+       
Sbjct: 216 CSLLVIDRGNQAIREIQLNFDDCVYQYEAGFPLGVAVLLAAAFFGYMLALLQRRV--LGM 273

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPI------PIV----ESMK---EEPGWPSFGQLII 167
             T+   E++   +  I  +   +KP+      P++    ES K   EE  + S G+LI+
Sbjct: 274 VSTEDEPETQTPLKASIASIPPYQKPLKQSVRPPLIPNEDESEKQEVEEGFFTSIGKLIV 333

Query: 168 DLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
                  E +G+ F             +    P++++  +P DE  PP+  R
Sbjct: 334 GAKSSVAEIVGAAFSRKKRLNIHQQQARVRSWPVQESYAIPRDETPPPLDTR 385


>gi|255560011|ref|XP_002521024.1| conserved hypothetical protein [Ricinus communis]
 gi|223539861|gb|EEF41441.1| conserved hypothetical protein [Ricinus communis]
          Length = 500

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 99/254 (38%), Positives = 142/254 (55%), Gaps = 24/254 (9%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+Y+AD +N+AIRKI DAGVTTIAGGK    G   DG SEDAKFSNDFDVVY+  
Sbjct: 158 VDDRGNIYIADAMNMAIRKISDAGVTTIAGGKWGRGGSHVDGASEDAKFSNDFDVVYIGS 217

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY S  P  + ++V A   GY+  +LQ+  G   
Sbjct: 218 SCSLLVIDRGNRAIREIQLHFDDCAYQYESGFPLGVAVLVAAGFFGYMLALLQRRVGKIV 277

Query: 120 FSRTQQPS-ESEFNEETEIKELSNKEKPIPIVESMK---EEPGWPSFGQLIIDLSKLALE 175
            S+  + + ++  +     K L +   P+   E  +   EE  + S G+L  +     +E
Sbjct: 278 SSQNDRDAMKTSISGSPYQKPLRSVRPPLIPTEDEQEKHEEGFFGSLGKLFANAGACVVE 337

Query: 176 AMGSMFLNF----VPFRFRSSGTKGNLT-PLRDTLRMPEDEAAPPVVQRQKSAVPLSETH 230
            +G +   F    + +++ S   K + T P++D+  +P DE  PP +             
Sbjct: 338 ILGGIVPGFRKKPLNYQYLSQQQKHSSTWPVQDSFVIP-DEDEPPSI------------- 383

Query: 231 QVRTPGTGDKYPDM 244
           + RTP     YP M
Sbjct: 384 ETRTPTPKKTYPFM 397


>gi|255552640|ref|XP_002517363.1| conserved hypothetical protein [Ricinus communis]
 gi|223543374|gb|EEF44905.1| conserved hypothetical protein [Ricinus communis]
          Length = 494

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 120/358 (33%), Positives = 176/358 (49%), Gaps = 41/358 (11%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D++GN+Y+ADT+N+AIRKI D GVTTIAGGK +   G  DGPSEDAKFSNDFD+ Y+  
Sbjct: 149 VDNRGNIYIADTMNMAIRKISDGGVTTIAGGKWTRAGGHVDGPSEDAKFSNDFDIAYIGS 208

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I LN DDC  QY+      I ++V A   GY+  +LQ      F
Sbjct: 209 SCSLLVIDRGNQAIREIQLNDDDCNNQYDGTFHLGIAVLVAAAFFGYMLALLQWRVQTLF 268

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPI--PIVESMKEEPGWPSFGQLIIDLSKLALEAM 177
             R      +     T +       KP+  P+V S  +EP  P  G     L KL L  +
Sbjct: 269 SPRND--PRTYMKNGTPMAPYQRPPKPVRPPLVPS-DDEPDKPDEG-FFGSLGKLVLGGL 324

Query: 178 GSMF----LNFVPFRFRSSGTK-GNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLSETHQV 232
            S F    L++   +      K  N  P++++  +P DE  PP +             + 
Sbjct: 325 FSGFRRKPLHYQFQQQYQQQLKHSNPWPMQESFVIP-DEDEPPSL-------------ET 370

Query: 233 RTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKSQKE 292
           RTP +   YP M     K   FK SS    + +    +Y +              K Q+ 
Sbjct: 371 RTPTSKKAYPSMTNGVEKHYQFKQSS---GYYNGWDGDYHQQQQMQMQVQMQQHQKQQQH 427

Query: 293 RTRHRQRD---------KSGELVYGVSGMEP---KPVEMKPVDYDNLKFDHYNMRSKY 338
             RH   +         ++ E+V+G    +    + V +K VDY + +++H+N+R ++
Sbjct: 428 HHRHYTSNPKTYYEKSCETNEIVFGAVQEQDGRREAVVIKAVDYADPRYNHHNIRPRF 485


>gi|224108107|ref|XP_002314724.1| predicted protein [Populus trichocarpa]
 gi|222863764|gb|EEF00895.1| predicted protein [Populus trichocarpa]
          Length = 487

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 120/356 (33%), Positives = 177/356 (49%), Gaps = 48/356 (13%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+Y+ADT+N+AIRKI DAGVTTIAGGK       DG SEDAKFSNDFDV+Y+  +
Sbjct: 156 VDDRGNIYIADTMNMAIRKISDAGVTTIAGGKWGRGSHVDGASEDAKFSNDFDVLYIGSS 215

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I L+ DDC YQY S  P  I +++ A   GY+  +LQ+  G    
Sbjct: 216 CSLLVIDRGNQAIREIQLHFDDCAYQYGSGFPLGIAVLLAAGFFGYMLALLQRRVGMIV- 274

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPI-----PIV----ESMKEEPG-WPSFGQLIIDLS 170
                   S  N   E     N  +PI     P++    E  K E G + S G+L I+  
Sbjct: 275 --------SPQNVAMETSTTGNPYQPIKSFRPPLIPTEDEQEKHEEGLFGSLGKLFINTW 326

Query: 171 KLALEAMGSMFLNFV---PFRFRSSGTKGNLT-PLRDTLRMPEDEAAPPVVQRQKSAVPL 226
               E +G +  +F    P     +  + + + P++D+  +P DE  PP           
Sbjct: 327 ASIAEILGGIVPSFRKKPPNYQYQNYQQQSTSWPVQDSFVIP-DEDEPP----------- 374

Query: 227 SETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPY-S 285
             + + RTP     YP M     K   ++ S      RS       +F    +    Y  
Sbjct: 375 --STETRTPTPRKTYPFMSKDTEKMHQWRQS------RSIYSGWDGDFQQQQQQKQQYHH 426

Query: 286 KSKSQKERTRHRQR-DKSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSK 337
           + +S    T + Q  +K+ E+V+G      ++     + PV+Y +   + +N+RS+
Sbjct: 427 RYQSSTPHTYYEQSYEKTNEIVFGAVQEQDVKHGTTVILPVNYGDPIHNRHNIRSR 482


>gi|449492692|ref|XP_004159073.1| PREDICTED: uncharacterized protein LOC101226879 [Cucumis sativus]
          Length = 516

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 185/377 (49%), Gaps = 57/377 (15%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV-AGFRDGPSEDAKFSNDFDVVYVRP 59
           +D++GN+Y+ADT+N+AIRKI D GVTTIAGG+ N  +G  DGPSEDAKFSNDFDVVYV  
Sbjct: 149 LDERGNIYIADTMNMAIRKISDTGVTTIAGGRWNQGSGHIDGPSEDAKFSNDFDVVYVGS 208

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I LN DDC  QY       ++++V A L GY+  +LQ+     F
Sbjct: 209 SCSLLVIDRGNKAIREIELNYDDCNTQYADSLNLGVVLLVAAGLFGYLLALLQRRVQAMF 268

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVESMK-------------EEPGWPSFGQLI 166
            S+  Q    E   +  +K         P ++S++             EE  + S G+L 
Sbjct: 269 SSQKDQ----EIRSQQMMKATPVAPYQRPPLKSVRPSLIPSEDEPEKLEEGFFGSLGRLF 324

Query: 167 IDLSKLALEAMGSMFLNF--------VPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQ 218
           ++      +  G +   F        +  +F+      N  PL+++  +P DE  PP + 
Sbjct: 325 VNSGSSMADIFGGLLSGFRRKPLNHQIHQQFQPVNRHPNAWPLQESFVIP-DEDEPPSI- 382

Query: 219 RQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFK--------------DSSLSTKHR 264
                       + +TP     YP M     +S  FK                    + +
Sbjct: 383 ------------ETKTPTIKKTYPFMTQDLDRSHQFKPNRSYFSGWDGEFHQQQQQQQIQ 430

Query: 265 SSKRQEYAEFYGSGEVPPPYSKSKSQKERTRHRQRDKSGELVYGVSGMEP---KPVEMKP 321
              +Q++ + +   +    + +  S    T + +  ++ E+V+G    +    + + +K 
Sbjct: 431 HHHQQQHIQHHHHQQQQQYHHRQYSAGPTTYYEKSCETNEIVFGAVQEQDGRREAMVIKA 490

Query: 322 VDYDNLKFDHYNMRSKY 338
           VDY + +++H+N+R++Y
Sbjct: 491 VDYGDPRYNHHNIRARY 507


>gi|218197942|gb|EEC80369.1| hypothetical protein OsI_22479 [Oryza sativa Indica Group]
          Length = 517

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 107/354 (30%), Positives = 166/354 (46%), Gaps = 33/354 (9%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS+DAKFS DF++ Y+  +
Sbjct: 161 VDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSS 220

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I+L+ DDCEYQY +  P  I ++  A   GY+  +LQ+       
Sbjct: 221 CSLLVIDRGNQAIREIALHNDDCEYQYEAGFPLGIALLFAAGFFGYMLALLQRQ----VL 276

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPI------PIV-------ESMKEEPGWPSFGQLII 167
                  E +      I  +   +KP+      P++       +   EE  + S G+LI 
Sbjct: 277 GMVSTADEPQTPPRPSIASIPPYQKPLKPSLRPPLIPTEDQAGKHEAEEGFFTSIGKLIG 336

Query: 168 DLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS 227
                A+E           +       + N  P++++  +P DE  PP+  R        
Sbjct: 337 GAKSSAVEIFSRKKRPTHQYHHHLQQQRANPWPVQESYAIPHDETPPPLDMR-------- 388

Query: 228 ETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKS 287
                 TP     +   +P K+          +        Q+  E     +      + 
Sbjct: 389 ----AATPRKNYAFMTKEPEKVHHVCHGRPYFNGWDMQHAPQQQPEQQMYHQQHLQQHRQ 444

Query: 288 KSQKERTRHRQR-DKSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSK 337
            S   +T + Q  + + E+V+G       + + VE+K V+Y +  ++ Y MR +
Sbjct: 445 YSAGPQTFYEQSCETTNEIVFGAVQEVDTKRRMVEIKAVNYGDTFYEQYGMRYR 498


>gi|115467514|ref|NP_001057356.1| Os06g0269300 [Oryza sativa Japonica Group]
 gi|53793193|dbj|BAD54399.1| NHL repeat-containing protein-like [Oryza sativa Japonica Group]
 gi|113595396|dbj|BAF19270.1| Os06g0269300 [Oryza sativa Japonica Group]
 gi|215768552|dbj|BAH00781.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222635365|gb|EEE65497.1| hypothetical protein OsJ_20925 [Oryza sativa Japonica Group]
          Length = 517

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/354 (30%), Positives = 166/354 (46%), Gaps = 33/354 (9%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS+DAKFS DF++ Y+  +
Sbjct: 161 VDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSMRGGHLDGPSDDAKFSTDFEIRYISSS 220

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I+L+ DDCEYQY +  P  I ++  A   GY+  +LQ+       
Sbjct: 221 CSLLVIDRGNQAIREIALHNDDCEYQYEAGFPLGIALLFAAGFFGYMLALLQRQ----VL 276

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPI------PIV-------ESMKEEPGWPSFGQLII 167
                  E +      I  +   +KP+      P++       +   EE  + S G+LI 
Sbjct: 277 GMVSTADEPQTPPRPSIASIPPYQKPLKPSLRPPLIPTEDQAGKHEAEEGFFTSIGKLIG 336

Query: 168 DLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS 227
                A+E           +       + N  P++++  +P DE  PP+  R        
Sbjct: 337 GAKSSAVEIFSRKKRPTHQYHHHLQQQRANPWPVQESYAIPHDETPPPLDMR-------- 388

Query: 228 ETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKS 287
                 TP     +   +P K+          +        Q+  E     +      + 
Sbjct: 389 ----AATPRKNYAFMTKEPEKVHHVRHGRPYFNGWDMQHAPQQQPEQQMYHQQHLQQHRQ 444

Query: 288 KSQKERTRHRQR-DKSGELVYGV---SGMEPKPVEMKPVDYDNLKFDHYNMRSK 337
            S   +T + Q  + + E+V+G       + + VE+K V+Y +  ++ Y MR +
Sbjct: 445 YSAGPQTFYEQSCETTNEIVFGAVQEVDTKRRMVEIKAVNYGDTFYEQYGMRYR 498


>gi|297845436|ref|XP_002890599.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336441|gb|EFH66858.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 112/323 (34%), Positives = 156/323 (48%), Gaps = 34/323 (10%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADT+N AIRKI +AGVTTIAGGK  +  G  DGPSEDAKFSNDFDVVY+  
Sbjct: 152 VDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVHGGGHVDGPSEDAKFSNDFDVVYLGS 211

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC  QY S  P  I ++V A+  GY+  +LQ+      
Sbjct: 212 SCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGIAVLVAAIFFGYMLALLQRRLSSII 271

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSFGQLIIDLSKLALEAMGS 179
              T Q       ++  IK +     P    +  +EE    +    I ++   ++E    
Sbjct: 272 SYHTDQEVFEAVPDQDPIKPVRPPLIPTGDEQEKQEESFLGTLQIFISNVWVFSVELFSG 331

Query: 180 MFLNF-----VPFRFRSSGTKGNL-----TPLRDTLRMPEDEAAPPVVQRQKSAVPLSET 229
           MF        V F F    TK +       P++++  +   +  PP+             
Sbjct: 332 MFPGLRKKQTVGFNFNHQETKHSAFSTTSWPIQESFVIHNKDEPPPI------------- 378

Query: 230 HQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYGSGEVPPPYSKSKS 289
            + R P  G  YP M     K A+ K   L  + R+  R   AEF    +        + 
Sbjct: 379 -ESRNPTPGKIYPFMS----KDAAEKMQQLR-QSRALYRSLDAEFLQEQQQQKHQQHHRR 432

Query: 290 QKERTRH----RQRDKSGELVYG 308
                 H    R  +KS E+V+G
Sbjct: 433 HHATIPHTVYERSSEKSNEIVFG 455


>gi|30698020|ref|NP_177185.3| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332196920|gb|AEE35041.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 509

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 75/126 (59%), Positives = 90/126 (71%), Gaps = 1/126 (0%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADT+N AIRKI + GVTTIAGGK+    G  DGPSEDAKFSNDFDVVYV  
Sbjct: 150 VDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGS 209

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY S  P  I ++V A   GY+  +LQ+  G   
Sbjct: 210 SCSLLVIDRGNKAIREIQLHFDDCAYQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIV 269

Query: 120 FSRTQQ 125
            S   Q
Sbjct: 270 SSHNDQ 275


>gi|30698018|ref|NP_850974.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|17065224|gb|AAL32766.1| Unknown protein [Arabidopsis thaliana]
 gi|21387163|gb|AAM47985.1| unknown protein [Arabidopsis thaliana]
 gi|332196919|gb|AEE35040.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 447

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 75/126 (59%), Positives = 90/126 (71%), Gaps = 1/126 (0%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADT+N AIRKI + GVTTIAGGK+    G  DGPSEDAKFSNDFDVVYV  
Sbjct: 88  VDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGS 147

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY S  P  I ++V A   GY+  +LQ+  G   
Sbjct: 148 SCSLLVIDRGNKAIREIQLHFDDCAYQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIV 207

Query: 120 FSRTQQ 125
            S   Q
Sbjct: 208 SSHNDQ 213


>gi|297841763|ref|XP_002888763.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334604|gb|EFH65022.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 503

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 75/126 (59%), Positives = 90/126 (71%), Gaps = 1/126 (0%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADT+N AIRKI + GVTTIAGGK+    G  DGPSEDAKFSNDFDVVYV  
Sbjct: 150 VDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGS 209

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC YQY S  P  I ++V A   GY+  +LQ+  G   
Sbjct: 210 SCSLLVIDRGNKAIREIQLHFDDCAYQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIV 269

Query: 120 FSRTQQ 125
            S   Q
Sbjct: 270 SSHNDQ 275


>gi|22329761|ref|NP_173800.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|18700091|gb|AAL77657.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
 gi|20856013|gb|AAM26643.1| At1g23880/T23E23_8 [Arabidopsis thaliana]
 gi|332192324|gb|AEE30445.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 545

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 97/237 (40%), Positives = 130/237 (54%), Gaps = 19/237 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADT+N AIRKI +AGVTTIAGGK     G  DGPSEDAKFSNDFDVVY+  
Sbjct: 212 VDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHVDGPSEDAKFSNDFDVVYLGS 271

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ DDC  QY S  P  I ++V AV  GY+  +LQ+      
Sbjct: 272 SCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGIAVLVAAVFFGYMLALLQRRLSSIV 331

Query: 120 FSRTQQPSESEFNEETEIKELSNKEKPIPIVESMKEEPGWPSF-GQLIIDLSK---LALE 175
              T Q       ++  IK +    +P  I+   ++E    SF G L I +S     ++E
Sbjct: 332 SYHTDQEVFEAVPDQDPIKPV----RPPLILTGDEQEKQEESFLGTLQIFISNAWVFSVE 387

Query: 176 AMGSMF----------LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKS 222
               MF          LNF     + S       P++++  +   +  PPV  R  +
Sbjct: 388 LFSGMFPGLRKKQTVGLNFNHQETKHSAFSTTSWPIQESFVIHNKDEPPPVESRNAT 444


>gi|296087695|emb|CBI34951.3| unnamed protein product [Vitis vinifera]
          Length = 811

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/271 (37%), Positives = 142/271 (52%), Gaps = 28/271 (10%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           MD++GN+Y+ADT+N+AIRKI DAGVTTIAGGK    G   DGPSEDAKFSNDFDVVY+  
Sbjct: 149 MDERGNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGS 208

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ +DC YQYN      I ++V A   GY+  +LQ+     F
Sbjct: 209 SCSLLVIDRGNQAIREIQLHYEDCAYQYNGSFHLGIAVLVAAGFFGYMLALLQRRVAAMF 268

Query: 120 FSR--TQQPSESEFNEETEIKELSNKEKPIPIVESMKE---EPGWPSFGQLIIDLSKLAL 174
            S+  +  P +     E+  + L +   P+   E   E   E  + S G+L ++      
Sbjct: 269 SSQYDSSTPMKKGMPPESYQRPLKSVRAPLIPTEDEYEKADEGFFGSLGRLFLNTGSTLA 328

Query: 175 EAMGSMFLN--------FVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPL 226
           E  G +F           +  ++       N  P++++  +P DE  PP ++        
Sbjct: 329 EIFGGLFSGSRKKPPHQQIQQQYGQPNVHSNGWPMQESYVIP-DEDEPPSIES------- 380

Query: 227 SETHQVRTPGTGDKYPDMKPPKIKSASFKDS 257
                 R P     YP M P   K+  F+ S
Sbjct: 381 ------RAPTPKKTYPFMTPEMEKTHHFRQS 405


>gi|242095458|ref|XP_002438219.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
 gi|241916442|gb|EER89586.1| hypothetical protein SORBIDRAFT_10g009760 [Sorghum bicolor]
          Length = 507

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/231 (38%), Positives = 123/231 (53%), Gaps = 17/231 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS+DAKFS DF++ Y+  +
Sbjct: 159 VDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSIRGGHIDGPSDDAKFSTDFEIQYISSS 218

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I LN DDC YQY +  P    ++  A   GY+  +LQ+      F
Sbjct: 219 CSLLVIDRGNQAIREIPLNDDDCAYQYETGFPLGFALLCAAGFFGYMLALLQRR----LF 274

Query: 121 SRTQQPSESEFNEETEIKELSNKEKP---------IPIVESMK---EEPGWPSFGQLIID 168
                  E +      I  +   +KP         IP  E+ K   EE  + S G+LI  
Sbjct: 275 GMASTTDEPQAPPRPSIASIPPYQKPLNPYVRQPFIPREETAKQETEEGFFTSAGKLIGG 334

Query: 169 LSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
                 E  G +    +   +     + N  P++++  +P DE  P +  R
Sbjct: 335 AKSSVAEIFG-LKKKRLSNPYHQQQRRANPWPVQESYAIPHDEHPPALDTR 384


>gi|225452518|ref|XP_002274991.1| PREDICTED: uncharacterized protein LOC100266244 [Vitis vinifera]
          Length = 677

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 101/270 (37%), Positives = 142/270 (52%), Gaps = 29/270 (10%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           MD++GN+Y+ADT+N+AIRKI DAGVTTIAGGK    G   DGPSEDAKFSNDFDVVY+  
Sbjct: 149 MDERGNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGS 208

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFF 119
           +CSLLVIDRGN A+R+I L+ +DC YQYN      I ++V A   GY+  +LQ+     F
Sbjct: 209 SCSLLVIDRGNQAIREIQLHYEDCAYQYNGSFHLGIAVLVAAGFFGYMLALLQRRVAAMF 268

Query: 120 FSR-TQQPSESEFNEETEIKELSNKEKPIPIVESMKE---EPGWPSFGQLIIDLSKLALE 175
            S+    P  + F  ++  + L +   P+   E   E   E  + S G+L ++      E
Sbjct: 269 SSQYVSDP--AFFTLQSYQRPLKSVRAPLIPTEDEYEKADEGFFGSLGRLFLNTGSTLAE 326

Query: 176 AMGSMFLN--------FVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAVPLS 227
             G +F           +  ++       N  P++++  +P DE  PP ++         
Sbjct: 327 IFGGLFSGSRKKPPHQQIQQQYGQPNVHSNGWPMQESYVIP-DEDEPPSIES-------- 377

Query: 228 ETHQVRTPGTGDKYPDMKPPKIKSASFKDS 257
                R P     YP M P   K+  F+ S
Sbjct: 378 -----RAPTPKKTYPFMTPEMEKTHHFRQS 402


>gi|195621162|gb|ACG32411.1| hypothetical protein [Zea mays]
 gi|219884599|gb|ACL52674.1| unknown [Zea mays]
 gi|413944257|gb|AFW76906.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
          Length = 508

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 66/113 (58%), Positives = 83/113 (73%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS+DAKFS DF++ Y+  +
Sbjct: 156 VDDRGNIYVADGMNMAIRKISDTGVTTIAGGKSIRGGHIDGPSDDAKFSTDFEIKYISSS 215

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQ 113
           CSLLVIDRGN A+R+I LN DDC YQY +  P    ++  A   GY+  MLQ+
Sbjct: 216 CSLLVIDRGNQAIREIPLNDDDCAYQYEAGFPLGFALLCAAGFFGYMLAMLQR 268


>gi|357118152|ref|XP_003560822.1| PREDICTED: uncharacterized protein LOC100834977 [Brachypodium
           distachyon]
          Length = 544

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 68/128 (53%), Positives = 89/128 (69%), Gaps = 1/128 (0%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS+DAKFS DF++ Y+  +
Sbjct: 162 VDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSVRGGHTDGPSDDAKFSTDFEIRYISSS 221

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQG-FGPFF 119
           CSLLVIDRGN A+R+I L  DDCEYQ+ +  P  + ++  A   GY+  +LQ+  FG   
Sbjct: 222 CSLLVIDRGNQAIREIPLQLDDCEYQHEAGFPLGVALLFAAGFFGYMLALLQRRVFGMVS 281

Query: 120 FSRTQQPS 127
            +    PS
Sbjct: 282 TTEEHHPS 289


>gi|242066098|ref|XP_002454338.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
 gi|241934169|gb|EES07314.1| hypothetical protein SORBIDRAFT_04g029020 [Sorghum bicolor]
          Length = 518

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 91/233 (39%), Positives = 130/233 (55%), Gaps = 15/233 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS++AKFS DF+V YV  +
Sbjct: 158 VDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYVGSS 217

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I L+ DDC YQY +  P  + +++ A   GY+  +LQ+     + 
Sbjct: 218 CSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGMYS 277

Query: 121 SRTQQPSESEFNEETEI------KELSNKEKP--IPI----VESMKEEPGWPSFGQLIID 168
           +  +Q  ES    +         K L    +P  IP     V+  +EE  + S G+L+  
Sbjct: 278 NGDEQEHESPVKAKLSSIPPPYQKPLKPSLRPPLIPTDDEPVKQEEEEGLFTSIGKLVGG 337

Query: 169 LSKLALEAMGSMF--LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
                 E +G+ F     V       G   +  P++++  +P DE  PP+  R
Sbjct: 338 AKSSIAEIIGAAFSRKKRVNIHHHQLGRPTSW-PVQESYAIPRDETPPPLDTR 389


>gi|326512916|dbj|BAK03365.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529495|dbj|BAK04694.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529987|dbj|BAK08273.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 503

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 89/237 (37%), Positives = 125/237 (52%), Gaps = 29/237 (12%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+YVAD +N+A+RKI D GVTTIAGGKS+  G  DGPS+DAKFS DF+V Y+  +
Sbjct: 153 VDGRGNIYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASS 212

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I LN DDC YQY +  P  + +++ A   GY+  +LQ+       
Sbjct: 213 CSLLVIDRGNQAIREIQLNFDDCVYQYEAGFPLGVALLLAAGFFGYMLALLQR------- 265

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMK-------------------EEPGWPS 161
            R      +E   +T +K       P  I + +K                   EE  + S
Sbjct: 266 -RVLGMVSTEDELQTPMKASIASIPPYQIQKPLKPSLRPPLIPNEDESEKPEVEEGFFTS 324

Query: 162 FGQLIIDLSKLALEAMGSMF-LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVV 217
            G+LI        E +G+ F     P        +    P++++  +P DE  PPVV
Sbjct: 325 LGKLIGGAKSSVAEIVGAAFSRKKRPSVHHYQQGRSGSWPVQESYAIPRDE-TPPVV 380


>gi|293331401|ref|NP_001168100.1| uncharacterized protein LOC100381837 precursor [Zea mays]
 gi|223946007|gb|ACN27087.1| unknown [Zea mays]
 gi|413938717|gb|AFW73268.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
          Length = 510

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 91/233 (39%), Positives = 130/233 (55%), Gaps = 15/233 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS++AKFS DF+V Y+  +
Sbjct: 157 VDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSS 216

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I L+ DDC YQY +  P  + +++ A   GY+  +LQ+     + 
Sbjct: 217 CSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGMYS 276

Query: 121 SRTQQPSESEFNEE-TEI-----KELSNKEKPIPI------VESMKEEPGWPSFGQLIID 168
           +  +Q  ES    + T I     K L     P+ I      V+  +EE  + S G+L+  
Sbjct: 277 NGDEQDHESPVKAKLTSIPPPYQKPLKPYVHPLLIPNDDEPVKQEEEEGLFTSIGKLVGG 336

Query: 169 LSKLALEAMGSMF--LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
                 E +G+ F     V       G   +  P +++  +P DE  PP+  R
Sbjct: 337 AKSSIAEIIGAAFSRKKRVNVHHHQLGRPTSW-PAQESYAIPRDETPPPLDTR 388


>gi|413938718|gb|AFW73269.1| hypothetical protein ZEAMMB73_298159 [Zea mays]
          Length = 511

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 91/233 (39%), Positives = 130/233 (55%), Gaps = 15/233 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS++AKFS DF+V Y+  +
Sbjct: 158 VDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSS 217

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I L+ DDC YQY +  P  + +++ A   GY+  +LQ+     + 
Sbjct: 218 CSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGMYS 277

Query: 121 SRTQQPSESEFNEE-TEI-----KELSNKEKPIPI------VESMKEEPGWPSFGQLIID 168
           +  +Q  ES    + T I     K L     P+ I      V+  +EE  + S G+L+  
Sbjct: 278 NGDEQDHESPVKAKLTSIPPPYQKPLKPYVHPLLIPNDDEPVKQEEEEGLFTSIGKLVGG 337

Query: 169 LSKLALEAMGSMF--LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
                 E +G+ F     V       G   +  P +++  +P DE  PP+  R
Sbjct: 338 AKSSIAEIIGAAFSRKKRVNVHHHQLGRPTSW-PAQESYAIPRDETPPPLDTR 389


>gi|293332279|ref|NP_001170567.1| uncharacterized protein LOC100384592 precursor [Zea mays]
 gi|238006082|gb|ACR34076.1| unknown [Zea mays]
          Length = 505

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 90/235 (38%), Positives = 125/235 (53%), Gaps = 23/235 (9%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPS--EDAKFSNDFDVVYVR 58
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G+ D PS  +DAKFS DF+V Y+ 
Sbjct: 159 VDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSIRGGYIDEPSVSDDAKFSTDFEVQYIS 218

Query: 59  PTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGF--G 116
            TCSLLVIDRGN A+R+I LN DDC YQY +  P    ++  A   GY+  +LQ      
Sbjct: 219 STCSLLVIDRGNQAIREIPLNDDDCAYQYEAGFPLGFALLCAAGFFGYMLALLQHRLLGM 278

Query: 117 PFFFSRTQQPSESEFNEETEIKELSNKEKP---------IPIVESMKEEPG---WPSFGQ 164
           P   ++ Q P          I  +S  +KP         IP  E+ ++EP    + S G+
Sbjct: 279 PSTINKPQAP------PRPSIASISPYQKPLNPYVRQPFIPREETARQEPEEGFFTSAGK 332

Query: 165 LIIDLSKLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
           L+        E  G      +   +     + N  P++++  +P DE  P +  R
Sbjct: 333 LVGGAKSSVAEVFG-FKKKSLSNPYHQQQRRANPWPVQESYAIPHDEPPPALDTR 386


>gi|413923861|gb|AFW63793.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
          Length = 516

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 89/233 (38%), Positives = 131/233 (56%), Gaps = 16/233 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS++AKFS DF+V Y+  +
Sbjct: 158 VDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSS 217

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I L+ DDC YQY +  P  + +++ A   GY+  +LQ+     + 
Sbjct: 218 CSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGMYS 277

Query: 121 SRTQQPSESEFNEETEI------KELSNKEKPIPIVES------MKEEPGWPSFGQLIID 168
           +  +Q  ES    +         K L +  +P P++ +       +EE  + S G+L+  
Sbjct: 278 NGDEQEDESPVKAKLSSIPPPYQKPLKSSLRP-PLIPNDDEPVKQEEEGLFTSIGKLVGV 336

Query: 169 LSKLALEAMGSMFL--NFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
                 E +G+ F     V       G   +  P++++  +P DE  PP+  R
Sbjct: 337 ARSSVAEIVGATFSRKKRVNVHHHQLGRPTSW-PVQESYAIPRDETPPPLDTR 388


>gi|3176691|gb|AAC18814.1| Contains homology to serine/threonine protein kinase gb|X99618 from
           Mycobacterium tuberculosis. ESTs gb|F14403, gb|F14404,
           and gb|N96730 come from this gene [Arabidopsis thaliana]
          Length = 493

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/137 (54%), Positives = 90/137 (65%), Gaps = 12/137 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADT+N AIRKI + GVTTIAGGK+    G  DGPSEDAKFSNDFDVVYV  
Sbjct: 123 VDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTVRNGGHVDGPSEDAKFSNDFDVVYVGS 182

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD-----------ILMVVGAVLVGYVT 108
           +CSLLVIDRGN A+R+I L+ DDC YQY S  P             I ++V A   GY+ 
Sbjct: 183 SCSLLVIDRGNKAIREIQLHFDDCAYQYGSGFPLGKLTKVPNFFLWIAVLVAAGFFGYML 242

Query: 109 CMLQQGFGPFFFSRTQQ 125
            +LQ+  G    S   Q
Sbjct: 243 ALLQRRVGSIVSSHNDQ 259


>gi|413923863|gb|AFW63795.1| hypothetical protein ZEAMMB73_706161 [Zea mays]
          Length = 515

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 89/233 (38%), Positives = 131/233 (56%), Gaps = 16/233 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS++AKFS DF+V Y+  +
Sbjct: 157 VDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSS 216

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I L+ DDC YQY +  P  + +++ A   GY+  +LQ+     + 
Sbjct: 217 CSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQRRVLGMYS 276

Query: 121 SRTQQPSESEFNEETEI------KELSNKEKPIPIVES------MKEEPGWPSFGQLIID 168
           +  +Q  ES    +         K L +  +P P++ +       +EE  + S G+L+  
Sbjct: 277 NGDEQEDESPVKAKLSSIPPPYQKPLKSSLRP-PLIPNDDEPVKQEEEGLFTSIGKLVGV 335

Query: 169 LSKLALEAMGSMFL--NFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
                 E +G+ F     V       G   +  P++++  +P DE  PP+  R
Sbjct: 336 ARSSVAEIVGATFSRKKRVNVHHHQLGRPTSW-PVQESYAIPRDETPPPLDTR 387


>gi|356573394|ref|XP_003554846.1| PREDICTED: uncharacterized protein LOC100793916 [Glycine max]
          Length = 528

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 98/273 (35%), Positives = 146/273 (53%), Gaps = 30/273 (10%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+Y+ADTLN+AIRKI D GVTTIAGGK   AG   DGPSEDAKFSNDFDVVYV  
Sbjct: 154 VDDRGNIYIADTLNMAIRKISDEGVTTIAGGKRGYAGGHVDGPSEDAKFSNDFDVVYVGS 213

Query: 60  TCSLLVIDRGNAALRQISLNQDDC---EYQYNSISPTDILMVVGAVLVGYVTCMLQQGFG 116
           +CSLLV+DRGN A+R+I L+QDDC   +   NS +   I+++V A   GY+  +LQ    
Sbjct: 214 SCSLLVVDRGNHAIREIQLHQDDCTSYDEDDNSFN-LGIVVLVAAAFFGYMLALLQWRVR 272

Query: 117 PFFFS-RTQQPSESEFNEETEIKELSNKEKPIPIVESM--KEEPGWPSFGQLIIDLSKLA 173
             F S   + P  ++  +       S +   IP  +    ++E  + S G+L ++     
Sbjct: 273 AMFSSPDVRAPFVAQQMQRPPPTTKSVRPPLIPNEDEFEKQDEGFFVSLGRLFLNSGTCM 332

Query: 174 LEAMGSMF---------LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQRQKSAV 224
            E +G +F          +    +++ +    N  P++++  +P+++  PP +       
Sbjct: 333 SEILGGLFSGSKRKSLQYHQYQQQYQYANRYPNAWPMQESFVIPDEDEPPPSL------- 385

Query: 225 PLSETHQVRTPGTGDKYPDMKPPKIKSASFKDS 257
                 + +TP   + YP M     K   FK S
Sbjct: 386 ------ETKTPTPRETYPIMTKELEKPQHFKPS 412


>gi|224140207|ref|XP_002323476.1| predicted protein [Populus trichocarpa]
 gi|222868106|gb|EEF05237.1| predicted protein [Populus trichocarpa]
          Length = 453

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 111/380 (29%), Positives = 181/380 (47%), Gaps = 72/380 (18%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D+ GN+YVADT+N+AIRKI  D GVTTIAGGK +   G  DGPSEDAKFSNDFDVVY  
Sbjct: 95  VDEGGNIYVADTMNMAIRKISTDGGVTTIAGGKWARGGGHVDGPSEDAKFSNDFDVVYSA 154

Query: 59  PTCSLLVIDRGNAALRQISLNQDDCEYQYNS----------ISPTDILMVVGAVLVGYVT 108
            +CSLL+IDRG+ A+R+I L+ DDC Y ++           I+   + ++V A   GY+ 
Sbjct: 155 SSCSLLIIDRGSQAIREIQLHDDDCNYPHDDCFHLDLDNILINIAGLAVLVAAGFFGYML 214

Query: 109 CMLQQGFGPFFFS-RTQQPSESEFNE---ETEIKELSNKEKPIPIVESMKEEPGWPSFGQ 164
            +LQ+     F S R + P ++ +           + ++++P+   E +     + S G+
Sbjct: 215 ALLQRRVQILFSSTRGKGPPKAPYQSPPMSVRPPFIPDEDEPVKSDEGL-----FGSLGR 269

Query: 165 LIIDLSKLALEAMGSMFLNF--VPFRFRSSGT------KGNLTPLRDTLRMPEDEAAPPV 216
           LI++ S    E  G +F  F   P  ++            N  P++D+  +P DE  PP 
Sbjct: 270 LILNTSSTVGEIFGGIFSGFRRKPIHYQFQQHYQQPLKHSNTWPVQDSFVIP-DEDEPPS 328

Query: 217 VQRQKSAVPLSETHQVRTPGTGDKYPDMKPPKIKSASFKDSSLSTKHRSSKRQEYAEFYG 276
           +             + R+P +   YP           F    +   H   + Q Y   +G
Sbjct: 329 I-------------ETRSPTSQKTYP-----------FMTKDVEQNHHLEQNQGYYSNWG 364

Query: 277 SG---------------EVPPPYSKSKSQKERTRHRQRDKSGELVYGV---SGMEPKPVE 318
            G               +    + +      +T + +  ++ E+V+G         + V 
Sbjct: 365 GGYHQQQQQQMHLQRYKQQQQQHHRHYMPNPKTYYEKSCETNEIVFGAVQEQNGRREAVV 424

Query: 319 MKPVDYDNLKFDHYNMRSKY 338
           +K VDY + +++H+N+R ++
Sbjct: 425 IKAVDYGDPRYNHHNIRPRF 444


>gi|413953662|gb|AFW86311.1| hypothetical protein ZEAMMB73_666053 [Zea mays]
          Length = 506

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 87/229 (37%), Positives = 124/229 (54%), Gaps = 11/229 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPS--EDAKFSNDFDVVYVR 58
           +DD+GN+YVAD +N+AIRKI D GVTTIAGGKS   G+ D PS  +DAKFS DF+V Y+ 
Sbjct: 159 VDDRGNIYVADAMNMAIRKISDTGVTTIAGGKSIRGGYIDEPSVSDDAKFSTDFEVQYIS 218

Query: 59  PTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGF--G 116
            TCSLLVIDRGN A+R+I LN DDC YQY +  P    ++  A   GY+  +LQ      
Sbjct: 219 STCSLLVIDRGNQAIREIPLNDDDCAYQYEAGFPLGFALLCAAGFFGYMLALLQHRLLGM 278

Query: 117 PFFFSRTQQPSESEFNEETEIKELSN---KEKPIPIVESMKEEPG---WPSFGQLIIDLS 170
           P   ++ Q P           ++  N   ++  IP  E+ ++EP    + S G+L+    
Sbjct: 279 PSTINKPQAPPRPSIASIPPYQKPLNPYVRQPFIPREETARQEPEEGFFTSAGKLVGGAK 338

Query: 171 KLALEAMGSMFLNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
               E  G      +   +     + N  P++++  +P DE  P +  R
Sbjct: 339 SSVAEVFG-FKKKSLSNPYHQQQRRANPWPVQESYAIPHDEPPPALDTR 386


>gi|357137830|ref|XP_003570502.1| PREDICTED: uncharacterized protein LOC100839308 [Brachypodium
           distachyon]
          Length = 495

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 127/239 (53%), Gaps = 28/239 (11%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+YVAD +N+A+RKI D GVTTIAGGKS+  G  DGPS+DAKFS DF+V Y+  +
Sbjct: 152 VDGRGNIYVADAMNMAVRKISDTGVTTIAGGKSSRGGHVDGPSDDAKFSTDFEVRYIASS 211

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLVIDRGN A+R+I L+ DDC YQY +  P  + +++ A   GY+  +LQ+       
Sbjct: 212 CSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQR----RVL 267

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPIPIVESMK---------------EEPGWPSFGQL 165
                  E +   +  I  +   +   P+  S++               EE  + S G+L
Sbjct: 268 GMVSTEDELQTPPKASIASIPPYQIQKPLKSSLRPPLIPNEDDSEKQEVEEGFFSSVGKL 327

Query: 166 IIDLSKLALEAMGSMF-----LNFVPFRFRSSGTKGNLTPLRDTLRMPEDEAAPPVVQR 219
           I        E +G+ F     LN   ++      + N  P++++  +P  E  PP+  R
Sbjct: 328 IGGAKSSVAEIIGAAFSRKKRLNTHHYQ----QARANSWPVQESYAIPHGETPPPLDTR 382


>gi|226530993|ref|NP_001143690.1| uncharacterized protein LOC100276420 precursor [Zea mays]
 gi|195624742|gb|ACG34201.1| hypothetical protein [Zea mays]
          Length = 515

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 65/113 (57%), Positives = 85/113 (75%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DDKGN+YVAD +N+AIRKI D GVTTIAGGKS   G  DGPS++AKFS DF+V Y+  +
Sbjct: 158 VDDKGNIYVADAMNMAIRKISDTGVTTIAGGKSGRGGHVDGPSDEAKFSTDFEVRYIGSS 217

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQ 113
           CSLLVIDRGN A+R+I L+ DDC YQY +  P  + +++ A   GY+  +LQ+
Sbjct: 218 CSLLVIDRGNQAIREIQLHFDDCVYQYEAGFPLGVALLLAAAFFGYMLALLQR 270


>gi|449446849|ref|XP_004141183.1| PREDICTED: uncharacterized protein LOC101216407 [Cucumis sativus]
 gi|449489545|ref|XP_004158343.1| PREDICTED: uncharacterized protein LOC101227198 [Cucumis sativus]
          Length = 417

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 79/190 (41%), Positives = 109/190 (57%), Gaps = 11/190 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+Y+ADT N AIRKI DAGVTTIAGGK   +G  DGPSE++KFSNDFD+VYV  +
Sbjct: 88  VDQRGNIYIADTKNKAIRKISDAGVTTIAGGKWRKSGHIDGPSEESKFSNDFDLVYVGSS 147

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           CSLLV+DRGN A+R+I L  +DC  +Y+      I ++  A+L+GY+    Q      F 
Sbjct: 148 CSLLVVDRGNQAIREIQLRAEDCT-EYDGSFLLGIALLTAAMLLGYMLARFQFRVLATFS 206

Query: 121 SRTQQPSESEFNEETEIKELSNKEKPI--PIVESMKEEPGWP------SFGQLIIDLSKL 172
           S+    S  +      I      EK I  P++ S + E   P      S G+L ++    
Sbjct: 207 SKND--SRVDLRNIPSIPPYGRVEKSIRRPLIPSEEAEDNQPEENIICSLGKLFLNTGSS 264

Query: 173 ALEAMGSMFL 182
           A E   ++ L
Sbjct: 265 AAEIFVALLL 274


>gi|9369390|gb|AAF87138.1|AC002423_3 T23E23.5 [Arabidopsis thaliana]
          Length = 493

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 71/123 (57%), Positives = 87/123 (70%), Gaps = 4/123 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD+GN+YVADT+N AIRKI +AGVTTIAGGK     G  DGPSEDAKFSNDFDVVY+  
Sbjct: 123 VDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHVDGPSEDAKFSNDFDVVYLGS 182

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT-DILMVVGAVLVGYVTCMLQQGFGPF 118
           +CSLLVIDRGN A+R+I L+ DDC  QY S  P  +I +   +V V   +C+LQ      
Sbjct: 183 SCSLLVIDRGNQAIREIQLHFDDCADQYGSGFPLGNIYIPKFSVWVP--SCILQSSLSQL 240

Query: 119 FFS 121
            F+
Sbjct: 241 LFN 243


>gi|147860395|emb|CAN80443.1| hypothetical protein VITISV_043282 [Vitis vinifera]
          Length = 527

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 63/89 (70%), Positives = 75/89 (84%), Gaps = 1/89 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           MD++GN+Y+ADT+N+AIRKI DAGVTTIAGGK    G   DGPSEDAKFSNDFDVVY+  
Sbjct: 132 MDERGNIYIADTMNMAIRKISDAGVTTIAGGKWGRGGGHVDGPSEDAKFSNDFDVVYIGS 191

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYN 88
           +CSLLVIDRGN A+R+I L+ +DC YQYN
Sbjct: 192 SCSLLVIDRGNQAIREIQLHYEDCAYQYN 220


>gi|317015034|gb|ADU85604.1| NHL repeat-containing protein, partial [Wolffia arrhiza]
          Length = 239

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/96 (67%), Positives = 81/96 (84%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+YVAD  NLAIRKIGD+GV+TIAGGK+  AGFRDGPSE+A+FS DFDVVYV+  
Sbjct: 144 VDRRGNIYVADVSNLAIRKIGDSGVSTIAGGKAGAAGFRDGPSEEARFSADFDVVYVKSL 203

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDIL 96
           CSLLV+DRGNAALR+I L+ DDC   ++S+  +DIL
Sbjct: 204 CSLLVVDRGNAALRKIFLHDDDCTQDFSSVGFSDIL 239


>gi|359493626|ref|XP_003634638.1| PREDICTED: LOW QUALITY PROTEIN: protein argonaute 1B-like [Vitis
           vinifera]
          Length = 867

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 68/126 (53%), Positives = 79/126 (62%), Gaps = 35/126 (27%)

Query: 14  NLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAAL 73
           NLAIRKIGD+GVTTI GGKSNVAG+ +GP EDAKFS DFDVVYVR               
Sbjct: 11  NLAIRKIGDSGVTTIVGGKSNVAGYGNGPCEDAKFSRDFDVVYVR--------------- 55

Query: 74  RQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSRTQQPSESEFNE 133
                             PT I MV+GAV+VGY +C+LQ+GFGP  FS+TQ  SESE +E
Sbjct: 56  ------------------PTHIFMVIGAVMVGYASCLLQKGFGPSVFSKTQH-SESE-SE 95

Query: 134 ETEIKE 139
           +  IKE
Sbjct: 96  DQLIKE 101


>gi|30685100|ref|NP_196993.2| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332004703|gb|AED92086.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 754

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 79/232 (34%), Positives = 118/232 (50%), Gaps = 18/232 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDD+GN+YVADT+N+AIRKI D GV+TIA G     G ++   E  +FS+DFD++YV  +
Sbjct: 159 MDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSGGSKE---ESMRFSDDFDLIYVSSS 215

Query: 61  CSLLVIDRGNAALRQISLNQDDC---EYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGP 117
           CSLLVIDRGN  +++I L+  DC   E   +S+     L+ V AV  GY+  +L +    
Sbjct: 216 CSLLVIDRGNQLIKEIQLHDHDCSQPEPDTDSLHLGTALL-VAAVFFGYMLALLVRRVRS 274

Query: 118 FFFSRTQQPSESEFNEETEIKELSNKEKPIPI--------VESMKEEPGWPSFGQLIIDL 169
            F S +             +     +  P P+         ES KEE    S G+L++  
Sbjct: 275 LFSSSSHDTKSKRHVATPSMTMAPYQRYPRPVRQPLIPPQHESEKEEGFLGSLGKLVVKT 334

Query: 170 SKLALEAMGSMFLNFVPFRFRSSGTKG--NLTPLRDTLRMPEDEAAPPVVQR 219
                E M S   N +P  F     +   N  P++++  +PE++  P +  R
Sbjct: 335 GSSVSEMM-SGSRNVIPPNFHQYHHQQEPNQWPVQESFAIPEEDGPPALEPR 385


>gi|9755656|emb|CAC01808.1| putative protein [Arabidopsis thaliana]
          Length = 733

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/232 (34%), Positives = 118/232 (50%), Gaps = 18/232 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MDD+GN+YVADT+N+AIRKI D GV+TIA G     G ++   E  +FS+DFD++YV  +
Sbjct: 138 MDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSGGSKE---ESMRFSDDFDLIYVSSS 194

Query: 61  CSLLVIDRGNAALRQISLNQDDC---EYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGP 117
           CSLLVIDRGN  +++I L+  DC   E   +S+     L+ V AV  GY+  +L +    
Sbjct: 195 CSLLVIDRGNQLIKEIQLHDHDCSQPEPDTDSLHLGTALL-VAAVFFGYMLALLVRRVRS 253

Query: 118 FFFSRTQQPSESEFNEETEIKELSNKEKPIPI--------VESMKEEPGWPSFGQLIIDL 169
            F S +             +     +  P P+         ES KEE    S G+L++  
Sbjct: 254 LFSSSSHDTKSKRHVATPSMTMAPYQRYPRPVRQPLIPPQHESEKEEGFLGSLGKLVVKT 313

Query: 170 SKLALEAMGSMFLNFVPFRFRSSGTKG--NLTPLRDTLRMPEDEAAPPVVQR 219
                E M S   N +P  F     +   N  P++++  +PE++  P +  R
Sbjct: 314 GSSVSEMM-SGSRNVIPPNFHQYHHQQEPNQWPVQESFAIPEEDGPPALEPR 364


>gi|297807503|ref|XP_002871635.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317472|gb|EFH47894.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 704

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 73/229 (31%), Positives = 111/229 (48%), Gaps = 14/229 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD GN+YVADT N+AIRKI D GV+TI  G     G ++   E  +FS+DFD++YV  +
Sbjct: 159 IDDSGNIYVADTNNMAIRKISDDGVSTITAGGRWSGGSKE---ESMRFSDDFDLIYVSSS 215

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI--LMVVGAVLVGYVTCMLQQGFGPF 118
           CSLLVIDRGN  +++I L+  DC +Q        +   ++V A   GY+  +L +     
Sbjct: 216 CSLLVIDRGNQMIKEIQLHDHDCSHQEPDTDNLHLGTALLVAAAFFGYMFALLVRRVRSL 275

Query: 119 FFSRTQQPSESEFNEETEIKELSNKEKPI------PIVESMKEEPGWPSFGQLIIDLSKL 172
           F S               +       +P+      P  E  KEE    S G+L++     
Sbjct: 276 FSSSGHDNKRHVAKPSMTMAPYQRYPRPVRQPLIPPQHEPEKEEGFLGSLGKLVVKTGSS 335

Query: 173 ALEAMGSMFLNFVPFRFRSSGTKG--NLTPLRDTLRMPEDEAAPPVVQR 219
             E M S   N +P  F     +   N  P++++  +PE++  P +  R
Sbjct: 336 VSEMM-SGSRNVIPPNFHQYHHQQEPNQWPVQESYAIPEEDGPPALEPR 383


>gi|42571623|ref|NP_973902.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|332192325|gb|AEE30446.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 400

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 56/117 (47%), Positives = 74/117 (63%), Gaps = 5/117 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KGN+YVAD  N AIRKI  +G VTTIAGG S   G RDGP+++A FS+DF++ +V  
Sbjct: 108 VDAKGNVYVADKSNKAIRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQ 167

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFG 116
            C LLV D GN  +RQI+L ++DC    +S   T  L  +G VL    +C+L    G
Sbjct: 168 RCCLLVSDHGNEMIRQINLKEEDCLENSHSNLGTYSLWSIGIVL----SCILGVAIG 220


>gi|18395326|ref|NP_564209.1| NHL domain-containing protein [Arabidopsis thaliana]
 gi|15724248|gb|AAL06517.1|AF412064_1 At1g23890/T23E23_13 [Arabidopsis thaliana]
 gi|16974539|gb|AAL31179.1| At1g23890/T23E23_13 [Arabidopsis thaliana]
 gi|110742589|dbj|BAE99208.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192326|gb|AEE30447.1| NHL domain-containing protein [Arabidopsis thaliana]
          Length = 261

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 56/117 (47%), Positives = 74/117 (63%), Gaps = 5/117 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KGN+YVAD  N AIRKI  +G VTTIAGG S   G RDGP+++A FS+DF++ +V  
Sbjct: 108 VDAKGNVYVADKSNKAIRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQ 167

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFG 116
            C LLV D GN  +RQI+L ++DC    +S   T  L  +G VL    +C+L    G
Sbjct: 168 RCCLLVSDHGNEMIRQINLKEEDCLENSHSNLGTYSLWSIGIVL----SCILGVAIG 220


>gi|115476416|ref|NP_001061804.1| Os08g0414600 [Oryza sativa Japonica Group]
 gi|37806208|dbj|BAC99711.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623773|dbj|BAF23718.1| Os08g0414600 [Oryza sativa Japonica Group]
 gi|215715379|dbj|BAG95130.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215715380|dbj|BAG95131.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 275

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 79/129 (61%), Gaps = 3/129 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D   N+YVAD +N A+RKI  +G  TTIAGG+S   G +DGP+++A FS+DF++VYV  
Sbjct: 118 VDHADNVYVADRMNGAVRKIAPSGFTTTIAGGRSKGPGRKDGPAQNATFSSDFELVYVPK 177

Query: 60  TCSLLVIDRGNAALRQISLNQDDC--EYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGP 117
            C+LLV DRGN  +RQI+L Q+DC  E Q    + +  ++ V   L+G V     + F P
Sbjct: 178 MCALLVTDRGNRLIRQINLKQEDCARETQPGLGTTSVSIIAVLCALLGSVIGFSIRHFYP 237

Query: 118 FFFSRTQQP 126
                  QP
Sbjct: 238 AHAGSLHQP 246


>gi|148909539|gb|ABR17864.1| unknown [Picea sitchensis]
          Length = 264

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 46/83 (55%), Positives = 59/83 (71%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D KGN+YVAD  N AIRKI  +GV+TIAGG +   G  DGPS++A FSND+D+ ++   
Sbjct: 108 LDSKGNIYVADRANHAIRKISKSGVSTIAGGTAGKTGHADGPSQEAMFSNDYDLTFIPSI 167

Query: 61  CSLLVIDRGNAALRQISLNQDDC 83
           C+LLV DRGN  +RQI L   DC
Sbjct: 168 CALLVSDRGNRMIRQIKLPPGDC 190



 Score = 38.5 bits (88), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 42/79 (53%), Gaps = 6/79 (7%)

Query: 4   KGNLYVADTLNLAIRKIG-----DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
            G++ + D++N  + +IG     D+ +   AG ++ V GF DG   DA+F N      + 
Sbjct: 51  NGDILLLDSVNSTLFRIGLPLSQDSTIKVFAGSRNTVPGFADGGFLDAQF-NHPKSFTLD 109

Query: 59  PTCSLLVIDRGNAALRQIS 77
              ++ V DR N A+R+IS
Sbjct: 110 SKGNIYVADRANHAIRKIS 128


>gi|255560013|ref|XP_002521025.1| conserved hypothetical protein [Ricinus communis]
 gi|223539862|gb|EEF41442.1| conserved hypothetical protein [Ricinus communis]
          Length = 408

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 74/116 (63%), Gaps = 2/116 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D+KGN+YVAD LN  IRKI ++GV+TIAGG S   G  DGP+++A FS+DF+V +V   
Sbjct: 115 VDNKGNIYVADRLNGTIRKITNSGVSTIAGGYSKGFGREDGPAQNATFSSDFEVAFVAEE 174

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFG 116
           C+LL+ D GN  +R++ L  DDC    +S        V+G  LV  ++C++    G
Sbjct: 175 CALLISDHGNQLVRRLPLKPDDCATASHSALGAVSFWVLGLGLV--MSCLIGIAIG 228


>gi|9369410|gb|AAF87158.1|AC002423_23 T23E23.6 [Arabidopsis thaliana]
          Length = 370

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 50/92 (54%), Positives = 66/92 (71%), Gaps = 2/92 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KGN+YVAD  N AIRKI  +G VTTIAGG S   G RDGP+++A FS+DF++ +V  
Sbjct: 108 VDAKGNVYVADKSNKAIRKISSSGSVTTIAGGISKAFGHRDGPAQNATFSSDFEITFVPQ 167

Query: 60  TCSLLVIDRGNAALRQISLNQDDC-EYQYNSI 90
            C LLV D GN  +RQI+L ++DC E  ++SI
Sbjct: 168 RCCLLVSDHGNEMIRQINLKEEDCLENSHSSI 199


>gi|222640551|gb|EEE68683.1| hypothetical protein OsJ_27311 [Oryza sativa Japonica Group]
          Length = 420

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 58/135 (42%), Positives = 81/135 (60%), Gaps = 10/135 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D   N+YVAD +N A+RKI  +G  TTIAGG+S   G +DGP+++A FS+DF++VYV  
Sbjct: 118 VDHADNVYVADRMNGAVRKIAPSGFTTTIAGGRSKGPGRKDGPAQNATFSSDFELVYVPK 177

Query: 60  TCSLLVIDRGNAALRQISLNQDDC--EYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGP 117
            C+LLV DRGN  +RQI+L Q+DC  E Q    + +  ++ V   L+G V     + F P
Sbjct: 178 MCALLVTDRGNRLIRQINLKQEDCARETQPGLGTTSVSIIAVLCALLGSVIGFSIRHFYP 237

Query: 118 F-------FFSRTQQ 125
                   FF R Q 
Sbjct: 238 AHEVSINRFFRRMQM 252


>gi|224065156|ref|XP_002301692.1| predicted protein [Populus trichocarpa]
 gi|222843418|gb|EEE80965.1| predicted protein [Populus trichocarpa]
          Length = 264

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 55/140 (39%), Positives = 77/140 (55%), Gaps = 8/140 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+YVAD LN A+RKI  +G+TT   G  +  G +DGP E A FS DF+V++V   
Sbjct: 107 VDLRGNVYVADQLNHAVRKISSSGMTTTIAGNYSQIGRQDGPGETATFSTDFEVLFVPQI 166

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCM--LQQGFG-- 116
           C+LL+ D GN  LRQ+ L Q+DC     S        V+G VL    +C+  L  GF   
Sbjct: 167 CALLISDHGNQLLRQVDLKQEDCIIGSQSALGAVKFWVLGLVL----SCLFGLATGFAIR 222

Query: 117 PFFFSRTQQPSESEFNEETE 136
           P+    T +     F ++ E
Sbjct: 223 PYVIPHTGRRQTPSFQQDME 242


>gi|226495023|ref|NP_001144050.1| uncharacterized protein LOC100276874 precursor [Zea mays]
 gi|195636116|gb|ACG37526.1| hypothetical protein [Zea mays]
 gi|414870559|tpg|DAA49116.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 272

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/139 (41%), Positives = 83/139 (59%), Gaps = 6/139 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D   N+YVAD L+ A+RK+  +G  TTIAGG S+  G RDGP+++A FS DF +VYV  
Sbjct: 115 VDAADNVYVADRLHGAVRKVAPSGYTTTIAGGFSSGPGHRDGPAQNATFSPDFALVYVPK 174

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDIL--MVVGAVLVGYVTCMLQQGFGP 117
            C+LLV DRGN  +RQI+L  +DC ++  S     ++  + V   L+G +   L + F P
Sbjct: 175 ICALLVADRGNRMVRQINLKPEDCAHEKQSGLGNTLVSFIAVLCTLLGSLIGFLARHFYP 234

Query: 118 FFFSRTQQPSESEFNEETE 136
                 +QP    F ++TE
Sbjct: 235 VNAGSLRQP---LFQQDTE 250


>gi|296086532|emb|CBI32121.3| unnamed protein product [Vitis vinifera]
          Length = 369

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/137 (41%), Positives = 81/137 (59%), Gaps = 12/137 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTT-IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KGN+YVAD  N  IRKI + GVTT IAGG +   G  DGP+++A FS DF++V+V  
Sbjct: 112 VDLKGNVYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPE 171

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVV----GAVLVGYVTCML---- 111
            C++LV DRG+  +RQI L  +DC     S+     L V+     + LVG++  ++    
Sbjct: 172 KCAVLVSDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWVLLGLGVSCLVGFIVGIISRPY 231

Query: 112 ---QQGFGPFFFSRTQQ 125
              Q+ F P FFS T +
Sbjct: 232 VIPQEVFCPLFFSETWK 248


>gi|297850862|ref|XP_002893312.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297339154|gb|EFH69571.1| NHL repeat-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 398

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 73/117 (62%), Gaps = 5/117 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KGN+YVAD  N AIRKI  +G VTTIAGG S   G RDGP+++A FS+DF++ +V  
Sbjct: 108 IDAKGNVYVADRSNKAIRKISSSGYVTTIAGGISKEFGHRDGPAQNATFSSDFEITFVPQ 167

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFG 116
            C LLV D GN  +RQI+L ++DC    +S      L  +G     +++C+L    G
Sbjct: 168 RCCLLVSDHGNEMVRQINLKEEDCLESSHSNLGAYSLWSIGI----FLSCILGVAIG 220


>gi|224028415|gb|ACN33283.1| unknown [Zea mays]
 gi|414870560|tpg|DAA49117.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 418

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/135 (41%), Positives = 81/135 (60%), Gaps = 10/135 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D   N+YVAD L+ A+RK+  +G  TTIAGG S+  G RDGP+++A FS DF +VYV  
Sbjct: 115 VDAADNVYVADRLHGAVRKVAPSGYTTTIAGGFSSGPGHRDGPAQNATFSPDFALVYVPK 174

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDIL--MVVGAVLVGYVTCMLQQGFGP 117
            C+LLV DRGN  +RQI+L  +DC ++  S     ++  + V   L+G +   L + F P
Sbjct: 175 ICALLVADRGNRMVRQINLKPEDCAHEKQSGLGNTLVSFIAVLCTLLGSLIGFLARHFYP 234

Query: 118 F-------FFSRTQQ 125
                   FFSR ++
Sbjct: 235 VNEVSVNHFFSRIRK 249


>gi|225424715|ref|XP_002265272.1| PREDICTED: uncharacterized protein LOC100243227 [Vitis vinifera]
          Length = 438

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 57/137 (41%), Positives = 81/137 (59%), Gaps = 12/137 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTT-IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KGN+YVAD  N  IRKI + GVTT IAGG +   G  DGP+++A FS DF++V+V  
Sbjct: 112 VDLKGNVYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPE 171

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVV----GAVLVGYVTCML---- 111
            C++LV DRG+  +RQI L  +DC     S+     L V+     + LVG++  ++    
Sbjct: 172 KCAVLVSDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWVLLGLGVSCLVGFIVGIISRPY 231

Query: 112 ---QQGFGPFFFSRTQQ 125
              Q+ F P FFS T +
Sbjct: 232 VIPQEVFCPLFFSETWK 248


>gi|414870561|tpg|DAA49118.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 280

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 50/115 (43%), Positives = 73/115 (63%), Gaps = 7/115 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D   N+YVAD L+ A+RK+  +G  TTIAGG S+  G RDGP+++A FS DF +VYV  
Sbjct: 115 VDAADNVYVADRLHGAVRKVAPSGYTTTIAGGFSSGPGHRDGPAQNATFSPDFALVYVPK 174

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNS------ISPTDILMVVGAVLVGYVT 108
            C+LLV DRGN  +RQI+L  +DC ++  S      +S   +L  +   L+G++ 
Sbjct: 175 ICALLVADRGNRMVRQINLKPEDCAHEKQSGLGNTLVSFIAVLCTLLGSLIGFLA 229


>gi|326503362|dbj|BAJ99306.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 452

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/136 (40%), Positives = 78/136 (57%), Gaps = 12/136 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D   N+YVAD  N +IRK+  +G  TTIAG  S   G RD P+++A FS DF+++YV  
Sbjct: 132 VDGADNVYVADRANGSIRKVAPSGYTTTIAGAYSAGTGHRDEPAQNATFSADFELIYVPQ 191

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYN---SISPTDILMVVGAVLVGYVTCMLQQGFG 116
            C+LLV DRGN  +RQI L  +DC ++       + T I+ ++ A L G +   L + F 
Sbjct: 192 ICALLVADRGNRLIRQIKLKPEDCAHENQKGLGTTSTSIIAILAA-LFGSIIGFLVRHFY 250

Query: 117 PF-------FFSRTQQ 125
           PF       FF R Q+
Sbjct: 251 PFHEVSINRFFIRIQR 266


>gi|242079119|ref|XP_002444328.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
 gi|241940678|gb|EES13823.1| hypothetical protein SORBIDRAFT_07g020240 [Sorghum bicolor]
          Length = 416

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/135 (40%), Positives = 79/135 (58%), Gaps = 10/135 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D   N+YVAD  + A+RK+  +G  TT+AGG S+  G RDG +++A FS DF++VYV  
Sbjct: 113 VDAADNVYVADQRHGAVRKVAPSGYTTTVAGGLSSGPGHRDGLAQNATFSADFELVYVPK 172

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVG--AVLVGYVTCMLQQGFGP 117
            C+LLV DRGN  +RQI+L  +DC ++  S   T  + V+     L+G +   L + F P
Sbjct: 173 ICALLVADRGNRMVRQINLKPEDCAHEKQSGLGTTSVSVIAILCALLGLIIGFLVRHFYP 232

Query: 118 F-------FFSRTQQ 125
                   FF R Q+
Sbjct: 233 VNEVSINHFFIRIQK 247


>gi|414870562|tpg|DAA49119.1| TPA: hypothetical protein ZEAMMB73_641473 [Zea mays]
          Length = 207

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 46/90 (51%), Positives = 63/90 (70%), Gaps = 1/90 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D   N+YVAD L+ A+RK+  +G  TTIAGG S+  G RDGP+++A FS DF +VYV  
Sbjct: 115 VDAADNVYVADRLHGAVRKVAPSGYTTTIAGGFSSGPGHRDGPAQNATFSPDFALVYVPK 174

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNS 89
            C+LLV DRGN  +RQI+L  +DC ++  S
Sbjct: 175 ICALLVADRGNRMVRQINLKPEDCAHEKQS 204


>gi|388505270|gb|AFK40701.1| unknown [Medicago truncatula]
          Length = 384

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 97/194 (50%), Gaps = 15/194 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+YVAD +N  IRKI   GVTTIAGG S  +  +DGP ++A FSNDF++ ++   
Sbjct: 113 VDFRGNVYVADRVNKVIRKISTNGVTTIAGGSSEKSSIKDGPVQNASFSNDFELTFIPAL 172

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFG---- 116
           C+LLV D  +  + QI+L  +DC     S     +   +G  L    +C+L    G    
Sbjct: 173 CALLVSDHMHQLVHQINLKGEDCTLGSKSALGAVMTWTLGLGL----SCILGLVIGIVIR 228

Query: 117 PFFFSRTQQPSESEFNEETEIKELSNKEKPIPI----VESMKEEPGWPSFGQLIIDLSKL 172
           P+     +  S   F   T     +N  K +P     ++S     G  S   + + L +L
Sbjct: 229 PYIIPH-EHTSRCHFT-ATRKHCQTNLVKLVPTLYSGIKSAVASCGCSSVFTVAVRLWEL 286

Query: 173 ALEAMGSMF-LNFV 185
           +L  +  MF ++FV
Sbjct: 287 SLSLLVLMFNIDFV 300


>gi|147798774|emb|CAN69795.1| hypothetical protein VITISV_022545 [Vitis vinifera]
          Length = 459

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 54/141 (38%), Positives = 80/141 (56%), Gaps = 8/141 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTT-IAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KGN+YVAD  N  IRKI + GVTT IAGG +   G  DGP+++A FS DF++V+V  
Sbjct: 112 VDLKGNVYVADQSNGVIRKITNRGVTTTIAGGYAQKTGKVDGPAQNASFSKDFELVFVPE 171

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVV----GAVLVGYVTCMLQQGF 115
            C++LV DRG+  +RQI L  +DC     S+     L V+     + LVG++  ++ +  
Sbjct: 172 KCAVLVSDRGSQLVRQIDLKVEDCRRSPQSVLGGAFLWVLLGLGVSCLVGFIVGIISR-- 229

Query: 116 GPFFFSRTQQPSESEFNEETE 136
            P+    T     + F  + E
Sbjct: 230 -PYVIPHTGSLLPTLFQRDVE 249


>gi|224079468|ref|XP_002305875.1| predicted protein [Populus trichocarpa]
 gi|222848839|gb|EEE86386.1| predicted protein [Populus trichocarpa]
          Length = 241

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 73/136 (53%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+YVAD  N A+RKI ++GVT+   G  +  G +DGP + A FS+DF+V++V   
Sbjct: 84  VDLRGNVYVADQQNHAVRKISNSGVTSTIVGNYSQTGRQDGPGKTATFSSDFEVLFVPQI 143

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
           C+LL+ D GN  LRQI L  +DC     S        V+G  L   +  ++     P+  
Sbjct: 144 CALLISDHGNQLLRQIDLKPEDCVIGSQSALGAVKFWVLGLALSCLLGIVIGIATRPYVI 203

Query: 121 SRTQQPSESEFNEETE 136
             T +     F ++ E
Sbjct: 204 PHTGRLQTPSFQQDME 219


>gi|356526828|ref|XP_003532018.1| PREDICTED: uncharacterized protein LOC100816542 [Glycine max]
          Length = 400

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 59/84 (70%), Gaps = 1/84 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGK-SNVAGFRDGPSEDAKFSNDFDVVYVRP 59
            D +GN+YVAD  N AIRKI   GVTTIAGG+ S  +  +DGP+ +A FSNDFD+ ++  
Sbjct: 109 FDMRGNVYVADKSNRAIRKISAKGVTTIAGGEFSEKSRTKDGPALNASFSNDFDLTFIPG 168

Query: 60  TCSLLVIDRGNAALRQISLNQDDC 83
            C+LLV D  +  +RQI+L ++DC
Sbjct: 169 LCALLVSDHMHRLVRQINLMEEDC 192


>gi|357485931|ref|XP_003613253.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
 gi|355514588|gb|AES96211.1| hypothetical protein MTR_5g034550 [Medicago truncatula]
          Length = 154

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/118 (41%), Positives = 61/118 (51%), Gaps = 18/118 (15%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +DD+GN+YVAD +N+AIRKI         G       F    S    +   FDV+YV  +
Sbjct: 36  VDDRGNIYVADIINMAIRKIS-------LGNNMTYLSFLYEESLILFYLLLFDVIYVGSS 88

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD-----------ILMVVGAVLVGYV 107
            SLLVIDRG  A+R+I L  DDC YQY S  P             I M+VGA   GY+
Sbjct: 89  YSLLVIDRGKQAIREIQLRFDDCAYQYESRFPLGKLNKFKVCLYRIAMLVGAGFFGYM 146


>gi|168018878|ref|XP_001761972.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686689|gb|EDQ73076.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 386

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 41/82 (50%), Positives = 49/82 (59%), Gaps = 5/82 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 56
           +D  GN+YVAD  N AIR I   G    VTTIAGG  N  G  DG      FSNDF V Y
Sbjct: 128 LDSDGNIYVADVRNTAIRMITTQGKHSFVTTIAGGM-NRTGHNDGEGRVVTFSNDFGVTY 186

Query: 57  VRPTCSLLVIDRGNAALRQISL 78
           +R  C+LL++DRGN  +R + L
Sbjct: 187 LRKNCTLLIVDRGNRMVRAMKL 208


>gi|125561542|gb|EAZ06990.1| hypothetical protein OsI_29235 [Oryza sativa Indica Group]
          Length = 498

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 56/87 (64%), Gaps = 6/87 (6%)

Query: 26  TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEY 85
           TTIAGG+S   G +DGP+++A FS DF++VYV   C+LL+ DRGN  +RQI+L ++DC  
Sbjct: 223 TTIAGGRSKGPGRKDGPAQNATFSPDFELVYVPKMCALLITDRGNRLIRQINLKREDCAR 282

Query: 86  Q------YNSISPTDILMVVGAVLVGY 106
           +        S+S   +L  +   +VG+
Sbjct: 283 ETQPGLGTTSVSIIAVLCALLGSVVGF 309


>gi|414870491|tpg|DAA49048.1| TPA: hypothetical protein ZEAMMB73_516870 [Zea mays]
          Length = 271

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 57/90 (63%), Gaps = 4/90 (4%)

Query: 24  GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY-VRPTCSLLVIDRGNAALRQISLNQDD 82
           GVT    G+    G  DGPS++AK S DF+V Y +  +CSLLVIDRGN  +R+I L+ DD
Sbjct: 182 GVTVDDKGR---GGQVDGPSDEAKSSTDFEVCYYIGSSCSLLVIDRGNQTIREIQLHFDD 238

Query: 83  CEYQYNSISPTDILMVVGAVLVGYVTCMLQ 112
           C YQ+ +  P  + ++  A  +GY+  +LQ
Sbjct: 239 CVYQHEADFPLGVALLAVAAFLGYMLALLQ 268


>gi|302768697|ref|XP_002967768.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
 gi|300164506|gb|EFJ31115.1| hypothetical protein SELMODRAFT_67906 [Selaginella moellendorffii]
          Length = 186

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/79 (51%), Positives = 54/79 (68%), Gaps = 5/79 (6%)

Query: 3   DKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D G ++VADT NLAIRKI  D  VTTIAGG S   GF DGP + A+FS++F +     +C
Sbjct: 101 DNGAVFVADTRNLAIRKISKDGEVTTIAGGSSRKPGFADGPGDTARFSSEFSLAC---SC 157

Query: 62  -SLLVIDRGNAALRQISLN 79
            SLL+ DRGN  +R+I ++
Sbjct: 158 GSLLIADRGNRLIREIQID 176


>gi|302768701|ref|XP_002967770.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
 gi|300164508|gb|EFJ31117.1| hypothetical protein SELMODRAFT_17931 [Selaginella moellendorffii]
          Length = 128

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 60/93 (64%), Gaps = 7/93 (7%)

Query: 3   DKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D G ++VADT NLAIRKI     G+TTIAGG S   GF DGP + A+FS++F +     +
Sbjct: 39  DNGAVFVADTRNLAIRKISKDGEGMTTIAGGSSRKPGFADGPGDTARFSSEFRLAC---S 95

Query: 61  C-SLLVIDRGNAALRQISLNQ-DDCEYQYNSIS 91
           C SLL+ DRGN  +R+I ++    C+   +++S
Sbjct: 96  CGSLLIADRGNRLIREIQIDDPKSCDSSDSAVS 128


>gi|302821973|ref|XP_002992647.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
 gi|300139611|gb|EFJ06349.1| hypothetical protein SELMODRAFT_7079 [Selaginella moellendorffii]
          Length = 124

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 40/92 (43%), Positives = 57/92 (61%), Gaps = 9/92 (9%)

Query: 3   DKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D G ++VADT NLAIRKI  D  VTTIAGG S   GF     + A+FS +F +     +C
Sbjct: 39  DNGAVFVADTRNLAIRKISKDGEVTTIAGGSSRKPGF---AGDTARFSGEFSLAC---SC 92

Query: 62  -SLLVIDRGNAALRQISLNQ-DDCEYQYNSIS 91
            SLL+ DRGN  +R+I ++  + C+   +++S
Sbjct: 93  GSLLIADRGNRLIREIQIDDPNSCDSSDSAVS 124


>gi|302821389|ref|XP_002992357.1| hypothetical protein SELMODRAFT_430570 [Selaginella moellendorffii]
 gi|300139773|gb|EFJ06507.1| hypothetical protein SELMODRAFT_430570 [Selaginella moellendorffii]
          Length = 220

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 61/120 (50%), Gaps = 15/120 (12%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC-SL 63
           G L +A   N       DA VTTI GG S   GF DGP + A+FS++  +     +C SL
Sbjct: 10  GALLLASLAN-------DASVTTIVGGSSRKPGFADGPGDTARFSSESSLAC---SCGSL 59

Query: 64  LVIDRGNAALRQISLNQ-DDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFFSR 122
           L+ DRGN  +R+I ++    C+   +++S +    ++  VL+G     L    GPF   +
Sbjct: 60  LIADRGNRLIREIQIDDPKSCDSSDSAVSGSQKWALIPVVLLGVC---LGMPLGPFIIWK 116


>gi|302821461|ref|XP_002992393.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
 gi|300139809|gb|EFJ06543.1| hypothetical protein SELMODRAFT_47902 [Selaginella moellendorffii]
          Length = 183

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 50/79 (63%), Gaps = 8/79 (10%)

Query: 3   DKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D G ++VADT NLAIRKI  D  VTTIAGG S   GF     + A+FS++F +     +C
Sbjct: 101 DNGAVFVADTRNLAIRKISKDGEVTTIAGGSSRKPGF---AGDTARFSSEFSLAC---SC 154

Query: 62  -SLLVIDRGNAALRQISLN 79
            SLL+ D GN  +R+I ++
Sbjct: 155 GSLLIADCGNRLIREIQID 173


>gi|421861496|ref|ZP_16293498.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
 gi|410828922|dbj|GAC43935.1| hypothetical protein PPOP_3335 [Paenibacillus popilliae ATCC 14706]
          Length = 575

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 4/79 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D +GNLY+AD  N  IRK+  +G VTT+AG  S +AG+RDG   +A+F+   DV  V   
Sbjct: 158 DRQGNLYIADAGNHVIRKLDKSGRVTTVAG--SGLAGWRDGTGSEARFNEPRDVA-VAED 214

Query: 61  CSLLVIDRGNAALRQISLN 79
            SL V D  N  LR+I  N
Sbjct: 215 GSLYVADALNHVLRRIDAN 233


>gi|374606289|ref|ZP_09679172.1| copper amine oxidase domain-containing protein [Paenibacillus
           dendritiformis C454]
 gi|374388103|gb|EHQ59542.1| copper amine oxidase domain-containing protein [Paenibacillus
           dendritiformis C454]
          Length = 564

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 48/79 (60%), Gaps = 4/79 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D +GNLY+AD  N  IRK+  +G VTT+AG  S +AG++DG   +A+F+   DV  V   
Sbjct: 147 DRQGNLYIADAANHVIRKLDKSGRVTTVAG--SGLAGWKDGKGAEARFNEPRDVA-VAED 203

Query: 61  CSLLVIDRGNAALRQISLN 79
            SL V D  N  +R+I  N
Sbjct: 204 GSLYVADALNHVIRRIDAN 222



 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 45/90 (50%), Gaps = 13/90 (14%)

Query: 3   DKGNLYVADTLNLAIRKI-GDAGVTTI----------AGGKSNVAG-FRDGPSEDAKFSN 50
           + G+LYVAD LN  IR+I  +  VTT+          A G    AG + DG    +KF+ 
Sbjct: 202 EDGSLYVADALNHVIRRIDANGNVTTLNARSKRIVEYAPGAVTAAGDYADGKLMVSKFNE 261

Query: 51  DFDVVYVRPTCSLLVIDRGNAALRQISLNQ 80
              + +  P+  L+V D GN  LR + L Q
Sbjct: 262 PSSLAFT-PSGDLVVSDTGNQRLRLVDLKQ 290


>gi|340788705|ref|YP_004754170.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
 gi|340553972|gb|AEK63347.1| hypothetical protein CFU_3523 [Collimonas fungivorans Ter331]
          Length = 394

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNLY+ADT N  IR I  AGV T   G +  AG  DG + DA+F+  + V      
Sbjct: 118 VDAGGNLYLADTQNHTIRMITSAGVVTTIAGSAGQAGQNDGTAGDARFNQPWGVAR-DGA 176

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D GNA +R+I+
Sbjct: 177 GNLYVTDTGNATVRKIT 193


>gi|431798813|ref|YP_007225717.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
 gi|430789578|gb|AGA79707.1| NHL repeat protein [Echinicola vietnamensis DSM 17526]
          Length = 655

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 53/80 (66%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +GN++VAD  N  IRKI  +G V+T AG  + +AGF DG +  AKF++ + V  V  
Sbjct: 364 IDQQGNIFVADRDNNVIRKISSSGDVSTFAG--TGIAGFTDGVAGVAKFNSPWKVA-VDN 420

Query: 60  TCSLLVIDRGNAALRQISLN 79
             +++V DRGN ++R+I+ N
Sbjct: 421 QGNVIVADRGNHSIRKITPN 440



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 8/93 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D++GN+ VAD  N +IRKI   G  +   G +N  G++DG    AKF    DV  V P 
Sbjct: 418 VDNQGNVIVADRGNHSIRKITPNGTVSTLAGTTN--GYQDGSGNQAKFDQPTDVA-VLPN 474

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 93
            ++++ D  N  +R I     D   Q ++I+ T
Sbjct: 475 GNIVIADNRNHCIRMI-----DSSVQVSTIAGT 502



 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGD-AGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  GNL+VAD  N AIRKI     V+T+AGG  N  G ++G    AKF + + V  V   
Sbjct: 526 DPNGNLFVADRKNHAIRKIDSYHNVSTVAGG--NGEGIQNGGIAVAKFDDPYGVA-VGQN 582

Query: 61  CSLLVIDRGNAALRQIS 77
             +LV D  N  +R+I+
Sbjct: 583 GKVLVADLDNNVIREIN 599


>gi|182412380|ref|YP_001817446.1| immunoglobulin I-set domain-containing protein [Opitutus terrae
           PB90-1]
 gi|177839594|gb|ACB73846.1| Immunoglobulin I-set domain protein [Opitutus terrae PB90-1]
          Length = 1130

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D   NLYVADT N  IRK+  AGV T   G ++  G  DGP   A F N + +  + P 
Sbjct: 550 IDGADNLYVADTNNWKIRKVTPAGVVTTFAGHTSTQGANDGPIGIASFFNPYGLA-IGPN 608

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V+D     LR IS
Sbjct: 609 GALYVVDLAGDTLRMIS 625



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVAD  N  IRKI  AG  T   G +  +GF DGP+ +A+F     +  + P 
Sbjct: 242 IDAAGNVYVADINNHTIRKINPAGEVTTLAGAAGESGFADGPAANARFFCPTSLA-IDPA 300

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D  N A+R+IS
Sbjct: 301 GAIWVNDAINRAIRKIS 317



 Score = 47.0 bits (110), Expect = 0.013,   Method: Composition-based stats.
 Identities = 28/72 (38%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           G+LYVAD+   AIRK+  AGV T   G     G+ DGP  +A+FS    +  V  T  + 
Sbjct: 136 GDLYVADSNGHAIRKVTPAGVVTTLAGGPLRYGYMDGPGTEAQFSYPRGIA-VNATGVIF 194

Query: 65  VIDRGNAALRQI 76
           V DR    +R++
Sbjct: 195 VSDRSAHTIRRV 206



 Score = 46.6 bits (109), Expect = 0.017,   Method: Composition-based stats.
 Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 1/73 (1%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           G +YVAD  N  IR I  AGV T   G +NV G  DG   +A F N   +V V P+  L 
Sbjct: 81  GLVYVADLANSTIRAITPAGVVTTLAGVANVHGCIDGVGSNALFHNPSALV-VGPSGDLY 139

Query: 65  VIDRGNAALRQIS 77
           V D    A+R+++
Sbjct: 140 VADSNGHAIRKVT 152



 Score = 45.1 bits (105), Expect = 0.053,   Method: Composition-based stats.
 Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 3/76 (3%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D +GNL V D     IRK+  AG  T   G     G  DGP+  A+F    D + V  + 
Sbjct: 444 DKEGNLLVGD--GRTIRKVTLAGAVTTIAGADGEDGDTDGPAASARF-RAVDGLAVDSSG 500

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V+DRG + +R+IS
Sbjct: 501 NIFVVDRGASTIRKIS 516


>gi|380693545|ref|ZP_09858404.1| hypothetical protein BfaeM_06134 [Bacteroides faecis MAJ27]
          Length = 454

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D+ GN++V +  N  IRKI  AG  T+  G+   +GF DG  E+AKF N  + V V P 
Sbjct: 377 FDEDGNMFVPERNNHIIRKITPAGSATLYAGRPEQSGFGDGLPEEAKF-NQPECVTVYPD 435

Query: 61  CSLLVIDRGNAALRQISL 78
            S+ V DR N  +R++++
Sbjct: 436 NSIYVADRDNHVIRRVTV 453


>gi|298386893|ref|ZP_06996448.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
 gi|298260567|gb|EFI03436.1| IPT/TIG domain-containing protein [Bacteroides sp. 1_1_14]
          Length = 454

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D+ GN++V +  N  IRKI  AG  T+  G+   +GF DG  E+AKF N  + V V P 
Sbjct: 377 FDEDGNMFVPERNNHIIRKITPAGSATLYAGRPEQSGFGDGLPEEAKF-NQPECVTVYPD 435

Query: 61  CSLLVIDRGNAALRQISL 78
            S+ V DR N  +R++++
Sbjct: 436 NSIYVADRDNHVIRRVTV 453


>gi|302821467|ref|XP_002992396.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
 gi|300139812|gb|EFJ06546.1| hypothetical protein SELMODRAFT_135182 [Selaginella moellendorffii]
          Length = 161

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 49/80 (61%), Gaps = 9/80 (11%)

Query: 3   DKGNLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D G ++VADT NLAIRKI      VTTIAGG S   GF     + A+FS++F +     +
Sbjct: 53  DNGAVFVADTRNLAIRKISKFLPTVTTIAGGSSRKPGF---AGDTARFSSEFSLAC---S 106

Query: 61  C-SLLVIDRGNAALRQISLN 79
           C SLL+ D GN  +R+I ++
Sbjct: 107 CGSLLIADCGNRLIREIQID 126


>gi|373953102|ref|ZP_09613062.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
 gi|373889702|gb|EHQ25599.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
          Length = 3731

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 41/76 (53%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D +GN+YVAD  N  IRKI   GV     G S  +GF +G +  AKF N   +       
Sbjct: 693 DAQGNVYVADNFNQTIRKITREGVVNTFAGTSGSSGFVNGTAAAAKFKNPIGIA-TDTQG 751

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D GN A+R+I+
Sbjct: 752 NVYVADNGNLAIRKIT 767



 Score = 46.6 bits (109), Expect = 0.017,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 41/76 (53%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D +GN+YVAD  N  IRKI  AGV T   G    +GF +G    AKF N    V      
Sbjct: 199 DMQGNVYVADYGNQLIRKITPAGVVTTLAGTVGSSGFVNGTGTAAKF-NYPRSVATDAAG 257

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D+ N A+R+I+
Sbjct: 258 NVYVADQVNQAIRKIT 273



 Score = 45.1 bits (105), Expect = 0.045,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 2    DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
            D +GN+YVAD  N AIRKI   GV T   G  ++ G  DG    A F N  + V      
Sbjct: 1285 DAQGNVYVADLYNNAIRKITPGGVVTTLAGTGSI-GSADGVGTSASFYNP-NAVATDAVG 1342

Query: 62   SLLVIDRGNAALRQIS 77
            ++ V+D  N  +R+I+
Sbjct: 1343 NIYVVDTYNQLIRKIT 1358



 Score = 45.1 bits (105), Expect = 0.057,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN+YVAD +N AIRKI  AGV T   G S V G  +G    A F N   V  +    
Sbjct: 254 DAAGNVYVADQVNQAIRKITPAGVVTTFAG-SGVPGALNGTGTAATFYNPTGVT-MDAQG 311

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D  N ++R+I+
Sbjct: 312 NVYVADSQNHSIRKIT 327



 Score = 43.5 bits (101), Expect = 0.16,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 44/78 (56%), Gaps = 4/78 (5%)

Query: 1    MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
            +D  GN+YVAD LN  +RKI  D  VTT+AG  S  AG  DG    A F    + + V  
Sbjct: 2165 VDASGNVYVADLLNSMVRKITPDGTVTTLAGSTS--AGSADGTGAAAGFHYPTN-LQVDD 2221

Query: 60   TCSLLVIDRGNAALRQIS 77
              +++V D+ N  +R+IS
Sbjct: 2222 QGNIIVADQLNNKIRKIS 2239



 Score = 43.1 bits (100), Expect = 0.19,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 4/77 (5%)

Query: 2    DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D  GNLY+AD  N  IRKI  +G VTT AG  S  AG  +G    A F+N F +      
Sbjct: 1068 DASGNLYIADFNNRLIRKITPSGLVTTFAG--SGAAGSENGNGAAASFNNPFGLT-TDAQ 1124

Query: 61   CSLLVIDRGNAALRQIS 77
             ++ V D  N  +R+I+
Sbjct: 1125 GNIYVSDANNNTIRKIT 1141



 Score = 42.7 bits (99), Expect = 0.23,   Method: Composition-based stats.
 Identities = 28/76 (36%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           MD +GN+YVAD+ N +IRKI  AGV T   G  ++ G  +G   +A F    + V     
Sbjct: 307 MDAQGNVYVADSQNHSIRKITPAGVVTTLAGTGSM-GSANGAGTNASFYYP-NAVVADAL 364

Query: 61  CSLLVIDRGNAALRQI 76
            +L + D  N  +R+I
Sbjct: 365 GNLYIADTNNHLIRKI 380



 Score = 42.0 bits (97), Expect = 0.45,   Method: Composition-based stats.
 Identities = 32/76 (42%), Positives = 45/76 (59%), Gaps = 1/76 (1%)

Query: 2    DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
            D +GN+YVAD  N  IRKI   GV T   G + VAG  +G +  AKF++ +DV  V  T 
Sbjct: 1176 DAQGNIYVADFGNQVIRKITPDGVVTTFAGTTGVAGNVNGAAAAAKFNSPYDVA-VDVTG 1234

Query: 62   SLLVIDRGNAALRQIS 77
            ++ V D  N  +R+I+
Sbjct: 1235 NVYVADELNQVIRKIT 1250



 Score = 42.0 bits (97), Expect = 0.48,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 40/77 (51%), Gaps = 4/77 (5%)

Query: 2    DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D  GNLYVADT N  IRK+  AG VTT AG  S  A   DG    A F N    + +  +
Sbjct: 1726 DAAGNLYVADTYNNLIRKVTSAGSVTTFAG--SGAASSVDGTGTAASF-NYPSAISIDAS 1782

Query: 61   CSLLVIDRGNAALRQIS 77
             +L V +     +R+IS
Sbjct: 1783 GNLYVAELNGNVIRKIS 1799



 Score = 41.6 bits (96), Expect = 0.51,   Method: Composition-based stats.
 Identities = 22/39 (56%), Positives = 26/39 (66%), Gaps = 4/39 (10%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD 40
           D +GN+YVAD  NLAIRKI  AGV T   G    +GF+D
Sbjct: 748 DTQGNVYVADNGNLAIRKITPAGVVTTLAG----SGFKD 782



 Score = 41.6 bits (96), Expect = 0.59,   Method: Composition-based stats.
 Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 2    DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
            D +GN+YV+D  N  IRKI  +GV T   G S  +G  DG    A F++ + +       
Sbjct: 1122 DAQGNIYVSDANNNTIRKITPSGVVTTFAG-SGSSGAADGIGMAASFNSPYGLA-TDAQG 1179

Query: 62   SLLVIDRGNAALRQIS 77
            ++ V D GN  +R+I+
Sbjct: 1180 NIYVADFGNQVIRKIT 1195



 Score = 41.2 bits (95), Expect = 0.64,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVAD  N  IRKI   GV +   G S VAG  +G    A F+N F +      
Sbjct: 90  VDASGNVYVADAGNNLIRKISPVGVVSTFAG-SGVAGSANGTGTAASFNNPFGIA-TDVQ 147

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D  +  +R+I+
Sbjct: 148 GNLYVSDVNSNLIRKIT 164



 Score = 40.8 bits (94), Expect = 0.98,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            +D  GN+YVAD LN  IRKI  AG+ T   G   + G  +G    A F N   +      
Sbjct: 1230 VDVTGNVYVADELNQVIRKITPAGLVTTFAGSGGI-GALNGTGTAASFHNPTGIT-TDAQ 1287

Query: 61   CSLLVIDRGNAALRQIS 77
             ++ V D  N A+R+I+
Sbjct: 1288 GNVYVADLYNNAIRKIT 1304



 Score = 40.8 bits (94), Expect = 0.98,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            +D  GN+YVAD  N  IRKI  AGV T   GK   AG  DG    A F+    V  V  +
Sbjct: 2111 VDAAGNIYVADQGNNMIRKITAAGVVTTLAGKL-TAGSADGVGAAASFNLPAGVA-VDAS 2168

Query: 61   CSLLVIDRGNAALRQIS 77
             ++ V D  N+ +R+I+
Sbjct: 2169 GNVYVADLLNSMVRKIT 2185



 Score = 40.8 bits (94), Expect = 1.1,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 39/77 (50%), Gaps = 14/77 (18%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            +DD+GN+ VAD LN  IRKI  AGV T   G +              F+N +DV  +  T
Sbjct: 2219 VDDQGNIIVADQLNNKIRKISPAGVVTTIAGPTG-------------FNNPYDVA-ISKT 2264

Query: 61   CSLLVIDRGNAALRQIS 77
              + V D  + +++ IS
Sbjct: 2265 GIIYVADYNSNSIKAIS 2281



 Score = 38.9 bits (89), Expect = 4.0,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 32/55 (58%), Gaps = 3/55 (5%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 54
            +D  GNLYVA+     IRKI  AG VTTIAG  S  +G  +G  + A F N + +
Sbjct: 1779 IDASGNLYVAELNGNVIRKISPAGVVTTIAG--SGASGIANGIGKAATFGNLYSI 1831



 Score = 38.1 bits (87), Expect = 5.6,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 4/77 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D +GNLYV+D  +  IRKI   G VTT+AG  S  AG  +G    A F+  + +      
Sbjct: 145 DVQGNLYVSDVNSNLIRKITPGGVVTTLAGSGS--AGSVNGTGTAASFNTPYSLT-TDMQ 201

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D GN  +R+I+
Sbjct: 202 GNVYVADYGNQLIRKIT 218


>gi|302791603|ref|XP_002977568.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
 gi|300154938|gb|EFJ21572.1| hypothetical protein SELMODRAFT_417419 [Selaginella moellendorffii]
          Length = 288

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 62/107 (57%), Gaps = 11/107 (10%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           M   G ++VADT  +AIRKI  + VTTIAGG S   G  DG    A+FS++F +     +
Sbjct: 117 MCGNGAIFVADT-RMAIRKI--SKVTTIAGGSSRKPGIADGT---ARFSSEFSLAC---S 167

Query: 61  C-SLLVIDRGNAALRQISLNQ-DDCEYQYNSISPTDILMVVGAVLVG 105
           C SL + DRGN  +R+I ++    C+   +++S +   +V+  V +G
Sbjct: 168 CGSLPIADRGNWLIREIQIDDPKSCDSSDSTVSGSQKCVVLLGVCLG 214


>gi|444912221|ref|ZP_21232386.1| putative hemolysin [Cystobacter fuscus DSM 2262]
 gi|444717129|gb|ELW57964.1| putative hemolysin [Cystobacter fuscus DSM 2262]
          Length = 844

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 43/77 (55%), Gaps = 1/77 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D +GN+YV+D+ N  IR+I   G TT+  G     G  DG +  A+F N    ++V P  
Sbjct: 617 DAQGNVYVSDSDNYCIRRITPDGTTTVFAGAEMQPGGMDGSALQARF-NQPAGLFVTPAQ 675

Query: 62  SLLVIDRGNAALRQISL 78
            LLV D GN  +R+I L
Sbjct: 676 ELLVADLGNGVIRRIDL 692


>gi|302848538|ref|XP_002955801.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
           nagariensis]
 gi|300258994|gb|EFJ43226.1| hypothetical protein VOLCADRAFT_119185 [Volvox carteri f.
           nagariensis]
          Length = 1149

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 30/78 (38%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D  GN++VAD+ N  +R++   D+ V+T+ G   N AGFRDG   DA+F N        
Sbjct: 288 VDGDGNIFVADSSNHCVRQLTAADSIVSTLVGSPGN-AGFRDGTGTDARFRNPCGAAVNL 346

Query: 59  PTCSLLVIDRGNAALRQI 76
              +L+V D GN  LR++
Sbjct: 347 QDGTLVVADAGNNRLRKV 364


>gi|383124185|ref|ZP_09944852.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
 gi|251839311|gb|EES67395.1| hypothetical protein BSIG_3780 [Bacteroides sp. 1_1_6]
          Length = 454

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D+ GN++V +  N  IRKI  AG  ++  G+   +GF DG  E+AKF N  + V V P 
Sbjct: 377 FDEDGNMFVPERNNHIIRKITPAGSASLYAGQPEQSGFGDGLPEEAKF-NQPECVTVYPD 435

Query: 61  CSLLVIDRGNAALRQISL 78
            S+ V DR N  +R++++
Sbjct: 436 NSVYVADRDNHVIRRVTV 453


>gi|261404592|ref|YP_003240833.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
           Y412MC10]
 gi|261281055|gb|ACX63026.1| copper amine oxidase domain protein [Paenibacillus sp. Y412MC10]
          Length = 1280

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KG +Y+AD+ N AIRKI  AG VTTIAG  +   G +DG ++DA F    DV  V  
Sbjct: 148 VDGKGQVYIADSGNHAIRKIDQAGRVTTIAG--NGRIGLKDGKAQDALFHEPQDVA-VTE 204

Query: 60  TCSLLVIDRGNAALRQIS 77
              L V D  N  +R+IS
Sbjct: 205 DGILYVADTLNHVIRRIS 222



 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 3/72 (4%)

Query: 6   NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 65
            L +AD+ N A+R++ D  V+T+AGG     G+ DG   + +F+   D+     +  L +
Sbjct: 342 GLLIADSHNHAVRQLKDERVSTMAGGTR---GYADGIESEVRFNAPADIAVASDSGELFI 398

Query: 66  IDRGNAALRQIS 77
            D+ N A+R  S
Sbjct: 399 ADQRNGAVRAWS 410



 Score = 40.8 bits (94), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 48/89 (53%), Gaps = 14/89 (15%)

Query: 1   MDDKGNLYVADTLNLAIRKI--GDAGVTTIAG------GKSNVA-----GFRDGPSEDAK 47
           +D KGNLYV+D+ N AIR I  G   V+T AG      G +  A     G+ DG ++ A+
Sbjct: 269 LDGKGNLYVSDSGNHAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYADPGYLDGAADAAR 328

Query: 48  FSNDFDVVYVRPTCSLLVIDRGNAALRQI 76
           F++   + +      LL+ D  N A+RQ+
Sbjct: 329 FNSPRGLAW-SVEDGLLIADSHNHAVRQL 356



 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 50/104 (48%), Gaps = 13/104 (12%)

Query: 3   DKGNLYVADTLNLAIRKIG-DAGVTTIA----------GGKSNVAG-FRDGPSEDAKFSN 50
           + G LYVADTLN  IR+I  D  VTTI            G   +AG +++G   +A+F N
Sbjct: 204 EDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAVQVRAGVIALAGDYKNGSLAEAQF-N 262

Query: 51  DFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 94
           +   + +    +L V D GN A+R I   +        S+  +D
Sbjct: 263 EPAGLALDGKGNLYVSDSGNHAIRYIDFGKGTVSTAAGSVPSSD 306


>gi|428185691|gb|EKX54543.1| hypothetical protein GUITHDRAFT_63359, partial [Guillardia theta
           CCMP2712]
          Length = 305

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 6   NLYVADTLNLAIRKIGDAGVTTIAGGKS--NVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           ++ VAD+ N A+R+I +  VTT+AGG       GF DG SE AKF     V++ +   +L
Sbjct: 124 SIIVADSGNNAVRRIANGMVTTLAGGSGPDRAGGFVDGESEGAKFRRPTFVMFDKEE-TL 182

Query: 64  LVIDRGNAALRQIS 77
           LVID GN  LR +S
Sbjct: 183 LVIDSGNHCLRVMS 196


>gi|302768699|ref|XP_002967769.1| hypothetical protein SELMODRAFT_409048 [Selaginella moellendorffii]
 gi|300164507|gb|EFJ31116.1| hypothetical protein SELMODRAFT_409048 [Selaginella moellendorffii]
          Length = 199

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/56 (48%), Positives = 38/56 (67%), Gaps = 4/56 (7%)

Query: 25  VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC-SLLVIDRGNAALRQISLN 79
           VTTIAGG S   GF DGP + A+FS++F +     +C SLL+ DRGN  +R+I ++
Sbjct: 68  VTTIAGGSSRKPGFADGPGDTARFSSEFSLAC---SCGSLLIADRGNRLIREIQID 120


>gi|408823744|ref|ZP_11208634.1| NHL repeat-containing protein [Pseudomonas geniculata N1]
          Length = 681

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/48 (58%), Positives = 35/48 (72%), Gaps = 4/48 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           D +GNLYVADT N AIR+IG D  VTT+AGG+    G+ DGP+  A+F
Sbjct: 120 DAQGNLYVADTGNHAIRRIGTDGQVTTLAGGEQ---GYADGPAAQARF 164



 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G +YVADT N  IR IG D  V T+AGG+    G  DG    A+F     + +   
Sbjct: 172 VDAQGQVYVADTYNDRIRVIGTDGNVRTLAGGER--PGMADGVGAAARFDTPVALAF-DA 228

Query: 60  TCSLLVIDRGNAALRQIS 77
             +LLV D  N A+R+I 
Sbjct: 229 QGALLVADLFNNAVRRIG 246


>gi|330824371|ref|YP_004387674.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
 gi|329309743|gb|AEB84158.1| NHL repeat containing protein [Alicycliphilus denitrificans K601]
          Length = 362

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 48/78 (61%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G L+V D LN A+R++  AG VTT+AG  +  AG+ DGP+   +F N   V  + P
Sbjct: 228 IDGQGRLWVPDQLNHAVRRVTPAGEVTTVAG--AGKAGYADGPAATVRFDNPTGVAPL-P 284

Query: 60  TCSLLVIDRGNAALRQIS 77
             +++V DR N  LR ++
Sbjct: 285 NGAVVVADRNNNRLRLVT 302



 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 49/93 (52%), Gaps = 3/93 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           ++  G +YVAD  N  IRKI  D  V T+AGG S  AGF DG  + A+F+  + V     
Sbjct: 118 VERSGAVYVADRNNHRIRKIRPDGTVITLAGGDS--AGFVDGDLKSARFNQPYGVALDAA 175

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISP 92
             +L V D  N A+R+I L  D  +    + +P
Sbjct: 176 QTTLYVADYLNHAIRRIDLVLDKVDTLAGNGAP 208



 Score = 41.6 bits (96), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 43/75 (57%), Gaps = 4/75 (5%)

Query: 3   DKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           + G+LY AD  N  +R +   G V T+AG  +  AGF DGP+  A+F N+   V V  + 
Sbjct: 66  NDGSLYFADRGNHQVRVLRSGGTVATVAG--TGKAGFADGPAHAARF-NEPIAVAVERSG 122

Query: 62  SLLVIDRGNAALRQI 76
           ++ V DR N  +R+I
Sbjct: 123 AVYVADRNNHRIRKI 137


>gi|255531263|ref|YP_003091635.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255344247|gb|ACU03573.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 646

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+Y+AD  N  IRKI   G V+T AG  + VAGF DG    AKF+  + V  V  
Sbjct: 357 IDASGNMYIADKNNNVIRKITSTGTVSTFAG--TGVAGFADGAGSIAKFNGPWKVA-VDA 413

Query: 60  TCSLLVIDRGNAALRQIS 77
           T ++ V DR N  +R+I+
Sbjct: 414 TGNVYVADRDNFKIRKIT 431



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+YVAD  N  IRKI  AG V+T+AG   + AG+ DG    AKF    DV  + P
Sbjct: 411 VDATGNVYVADRDNFKIRKITPAGIVSTLAG---STAGYADGTGSAAKFMQPLDVA-IDP 466

Query: 60  TCSLLVIDRGNAALRQIS 77
           + +++V D  +  +R+I+
Sbjct: 467 SGNVIVADNTSHRIRKIT 484



 Score = 43.9 bits (102), Expect = 0.100,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 5/78 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVAD ++  IRKI    V+T+AG    + G+++G S  AKF+   D+V     
Sbjct: 572 VDVSGNVYVADLISSRIRKISSGQVSTLAG---TIPGYQNGTSTIAKFNQPTDLVI--QG 626

Query: 61  CSLLVIDRGNAALRQISL 78
            ++ + D  N ++R + L
Sbjct: 627 SNIYIADHSNNSIRLVKL 644



 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 45/78 (57%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+ VAD  +  IRKI  AG VTTIAG  +  AG+ +G    A+F N    V V  
Sbjct: 464 IDPSGNVIVADNTSHRIRKITAAGVVTTIAGNGT--AGYTNGTGTAAQFKNPSG-VDVDA 520

Query: 60  TCSLLVIDRGNAALRQIS 77
           + ++ V DR N  +R+I+
Sbjct: 521 SGNIYVADRLNHRIRKIT 538


>gi|194364186|ref|YP_002026796.1| SMP-30/gluconolaconase/LRE domain-containing protein
           [Stenotrophomonas maltophilia R551-3]
 gi|194346990|gb|ACF50113.1| SMP-30/Gluconolaconase/LRE domain protein [Stenotrophomonas
           maltophilia R551-3]
          Length = 693

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/48 (58%), Positives = 35/48 (72%), Gaps = 4/48 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           D +GNLYVADT N AIR+IG D  VTT+AGG+    G+ DGP+  A+F
Sbjct: 132 DAQGNLYVADTGNHAIRRIGTDGQVTTLAGGEQ---GYADGPAAQARF 176



 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G +YVADT N  IR IG D  V T+AGG+    G  DG    A+F     + +   
Sbjct: 184 VDAQGQVYVADTFNDRIRVIGTDGNVRTLAGGER--PGLADGLGVAARFDTPVALAF-DA 240

Query: 60  TCSLLVIDRGNAALRQIS 77
             +LLV D  N A+R++ 
Sbjct: 241 HGALLVADLFNNAVRRVG 258


>gi|423301816|ref|ZP_17279839.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
           CL09T03C10]
 gi|408470907|gb|EKJ89439.1| hypothetical protein HMPREF1057_02980 [Bacteroides finegoldii
           CL09T03C10]
          Length = 469

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           MD + N+YVAD  N  IRKI  DAG  T   GK+   G++DG   +A F +   +  V  
Sbjct: 391 MDSQNNIYVADRYNCCIRKISLDAGTITTFAGKAQTEGYQDGLISEALFDDPMGIA-VDK 449

Query: 60  TCSLLVIDRGNAALRQISL 78
              + V D  N A+R+I+L
Sbjct: 450 NGVIYVADSDNYAIRRIAL 468


>gi|255531264|ref|YP_003091636.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255344248|gb|ACU03574.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 439

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 48/78 (61%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +GN+ VAD  N  IRKI  AG VTTIAG     AG+ DG    AKFS+ + +  + P
Sbjct: 148 VDAQGNVIVADRQNHTIRKISPAGQVTTIAG--DGTAGYADGTGTAAKFSSPWKLA-IDP 204

Query: 60  TCSLLVIDRGNAALRQIS 77
             +++V DR N  +R+I+
Sbjct: 205 LGNIIVADRDNFKIRKIA 222



 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +GN+ VAD LN  IRKI  AG V+TIAG  +   G  DG +  AKF++ + V  V  
Sbjct: 309 VDQQGNIIVADRLNHRIRKITPAGTVSTIAG--AGTTGLLDGDALTAKFADPYGVA-VDA 365

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++LV +  NA +R+I+
Sbjct: 366 GGNILVAELTNARIRKIT 383



 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/56 (50%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 55
           +D  GN+ VAD  N  IRKI  D  VTT+AG   + AGF DG    AKF    DVV
Sbjct: 202 IDPLGNIIVADRDNFKIRKIAPDGAVTTLAG---STAGFADGTGSAAKFMQPLDVV 254



 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D +GN+YVAD     IRK+  +G VTT+AG     AG+ D     AKF N   +  V   
Sbjct: 256 DAQGNIYVADNTAHRIRKVSPSGTVTTLAG--DGTAGYLDATGVQAKFRNPSGLT-VDQQ 312

Query: 61  CSLLVIDRGNAALRQIS 77
            +++V DR N  +R+I+
Sbjct: 313 GNIIVADRLNHRIRKIT 329



 Score = 37.7 bits (86), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 34/57 (59%), Gaps = 6/57 (10%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS--NDFDV 54
           +D  GN+ VA+  N  IRKI   G V+T+AG     AGF DG S +AKF+   D D+
Sbjct: 363 VDAGGNILVAELTNARIRKITPVGQVSTLAGSS---AGFADGLSVNAKFNQPTDLDI 416


>gi|329908045|ref|ZP_08274805.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327546778|gb|EGF31712.1| hypothetical protein IMCC9480_3485 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 685

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 45/77 (58%), Gaps = 1/77 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GNL+VADT N  IRKI  AGV T   G  + +G  DGP   A+F  + + + +  + 
Sbjct: 446 DALGNLFVADTFNSRIRKITPAGVVTTIAGAGSSSGSTDGPGNIARFF-EPEAIAIDASR 504

Query: 62  SLLVIDRGNAALRQISL 78
           +L V D GN  +R+I++
Sbjct: 505 NLYVADTGNHTVRKITV 521



 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  G LYVADT N  IRKI   GV T   G+  V G  DG    A+FS    +  V   
Sbjct: 125 IDRAGTLYVADTNNQTIRKITPQGVVTTIAGRVGVDGSTDGRGNAARFSYPQGIA-VDVA 183

Query: 61  CSLLVIDRGNAALRQIS 77
            ++LV D  N  +R IS
Sbjct: 184 GTVLVSDTYNHTVRTIS 200



 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G + V+DT N  +R I   GV  T+AG   N  G  DG    A+FSN   +V    
Sbjct: 180 VDVAGTVLVSDTYNHTVRTISPGGVVGTLAGSAGNF-GVLDGVRSAARFSNPQGLV-TDA 237

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++ V D GN  LR+++
Sbjct: 238 ARNIYVADAGNGVLRKVT 255


>gi|190572588|ref|YP_001970433.1| NHL repeat-containing protein [Stenotrophomonas maltophilia K279a]
 gi|190010510|emb|CAQ44119.1| putative NHL repeat protein [Stenotrophomonas maltophilia K279a]
          Length = 1267

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 28/48 (58%), Positives = 34/48 (70%), Gaps = 4/48 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           D +GNLYVADT N AIR+IG D  VTT+AGG+    G  DGP+  A+F
Sbjct: 706 DAQGNLYVADTGNHAIRRIGTDGQVTTLAGGEQ---GHADGPAVQARF 750



 Score = 42.4 bits (98), Expect = 0.29,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 42/77 (54%), Gaps = 4/77 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G +YVADT N  IR IG D  V T+AGG  +  G  DG    A+F     + +   
Sbjct: 758 VDAQGQVYVADTFNDRIRVIGTDGMVRTLAGG--DRPGLADGVGAAARFDTPVALAF-DA 814

Query: 60  TCSLLVIDRGNAALRQI 76
             +LLV D  N A+R++
Sbjct: 815 QGALLVADLFNNAVRRV 831


>gi|431798812|ref|YP_007225716.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
 gi|430789577|gb|AGA79706.1| gluconolactonase [Echinicola vietnamensis DSM 17526]
          Length = 428

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 48/80 (60%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +GN+YVAD  N  IRKI   G V+T+AG  +  AGF DGP+E+A+F    D+  V  
Sbjct: 193 VDMEGNIYVADRDNHMIRKIDPQGMVSTVAG--TGEAGFADGPAEEAQFDQPLDIA-VTA 249

Query: 60  TCSLLVIDRGNAALRQISLN 79
              L V D  N  +R+I ++
Sbjct: 250 EGVLYVTDNRNHRIRKIEVD 269



 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 5/78 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +DD GN+YVAD +N  IRKI  +AG V+T+AG  S   G RDG    A+F+N +  + V 
Sbjct: 300 VDDMGNIYVADRINHLIRKIDLNAGQVSTVAGDGSQ--GTRDGQVMTAQFNNPYG-ISVA 356

Query: 59  PTCSLLVIDRGNAALRQI 76
               L+V D  N  +R I
Sbjct: 357 DNGQLVVADLSNHKIRLI 374



 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 32/74 (43%), Positives = 38/74 (51%), Gaps = 5/74 (6%)

Query: 3   DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           D G L VAD  N  IR I    V TIAG   +VAGF DG    ++F N  DV Y      
Sbjct: 357 DNGQLVVADLSNHKIRLIQGENVITIAG---SVAGFLDGVGVTSQFYNPTDVTY--HDGV 411

Query: 63  LLVIDRGNAALRQI 76
           + V D GN  +R+I
Sbjct: 412 IYVADLGNHRVRKI 425


>gi|357008076|ref|ZP_09073075.1| copper amine oxidase domain-containing protein [Paenibacillus elgii
           B69]
          Length = 533

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 46/77 (59%), Gaps = 4/77 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  GN+YVAD+ N AIRKI  AG V+T+AG  + + G +DG  +DA F    DV  V   
Sbjct: 136 DANGNVYVADSGNHAIRKIDTAGQVSTVAG--NGLLGRKDGEGKDALFYRPTDVA-VAAD 192

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D  N A+R IS
Sbjct: 193 GTLYVADSLNHAIRSIS 209



 Score = 45.4 bits (106), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 40/78 (51%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           + ++G L +AD+ N AIR + D  V+TIAG    + G  DG    A      DV  V   
Sbjct: 321 LTEEGGLVIADSQNHAIRYLLDGQVSTIAGAADRITGNADGIEGSAALHRPMDVA-VLAD 379

Query: 61  CSLLVIDRGNAALRQISL 78
            S+L  D  N  LR++SL
Sbjct: 380 GSILAADTYNNKLREVSL 397



 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 40/84 (47%), Gaps = 11/84 (13%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNV--------AGFRDGPSEDAKFSN 50
           +D KGNL V+D+ N  IR I      VTT+AGG             GF DG + DA+FS 
Sbjct: 256 LDAKGNLIVSDSGNQRIRYIDLQQGKVTTLAGGGQAATNKELHVQGGFADGSASDARFSF 315

Query: 51  DFDVVYVRPTCSLLVIDRGNAALR 74
              +        L++ D  N A+R
Sbjct: 316 PMGLALTEE-GGLVIADSQNHAIR 338


>gi|375146761|ref|YP_005009202.1| NHL repeat containing protein [Niastella koreensis GR20-10]
 gi|361060807|gb|AEV99798.1| NHL repeat containing protein [Niastella koreensis GR20-10]
          Length = 439

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 3/96 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD  NLYVAD  N  IRKI  +A VTT  G  S  AG+ DG    AKFS  +DV +   
Sbjct: 159 VDDNLNLYVADPGNHCIRKIDSNANVTTFCGSPSG-AGYADGKGTAAKFSLPYDVAF-DA 216

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI 95
             ++  +D  N  +R+I+ +     + + S SP  +
Sbjct: 217 QGNIWCVDPANWDIRKIAPDGTATTWAWGSQSPWSV 252


>gi|424666862|ref|ZP_18103887.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
           Ab55555]
 gi|401069531|gb|EJP78052.1| hypothetical protein A1OC_00420 [Stenotrophomonas maltophilia
           Ab55555]
          Length = 693

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/48 (58%), Positives = 34/48 (70%), Gaps = 4/48 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           D +GNLYVADT N AIR+IG D  VTT+AGG+    G  DGP+  A+F
Sbjct: 132 DAQGNLYVADTGNHAIRRIGIDGQVTTLAGGEQ---GHADGPAAQARF 176



 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 42/80 (52%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G +YVADT N  IR IG D  V T+AGG     G  DG    A+F     + +   
Sbjct: 184 VDAQGQVYVADTFNDRIRVIGTDGNVRTLAGGDR--PGLADGVGAAARFDTPVALAF-DA 240

Query: 60  TCSLLVIDRGNAALRQISLN 79
             +LLV D  N A+R++  +
Sbjct: 241 QGALLVADLFNNAVRRVGAD 260


>gi|329908044|ref|ZP_08274804.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327546777|gb|EGF31711.1| hypothetical protein IMCC9480_3484 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 642

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 42/76 (55%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN++VADT N  IRKI  AGV T   G  +  G  DGP   A+FS   + + +    
Sbjct: 405 DPLGNVFVADTYNSTIRKITPAGVVTTIAGAPSSTGSTDGPGNLARFSGP-EAIAIDAQR 463

Query: 62  SLLVIDRGNAALRQIS 77
           +L V D GN  +R+IS
Sbjct: 464 NLYVGDTGNHTIRKIS 479



 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD--VVYVR 58
           +D  G LYVAD++N  IRK+   GV T   G++   G  DG    A+F   FD   V V 
Sbjct: 84  IDRAGMLYVADSVNHTIRKVSPQGVVTTLAGRAGEPGSADGRGSAARF---FDPKGVAVD 140

Query: 59  PTCSLLVIDRGNAALRQIS 77
              +++V D  N  +R+IS
Sbjct: 141 VAGNVVVSDNANHTIRKIS 159



 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+ V+D  N  IRKI  D  VTT+AG   N +   DG    A+F+  F    V  
Sbjct: 139 VDVAGNVVVSDNANHTIRKISPDGNVTTLAGSARNASAL-DGIGSAARFA--FPQALVTD 195

Query: 60  TC-SLLVIDRGNAALRQIS 77
              ++ V+DRGN  LR+I+
Sbjct: 196 AARNVYVVDRGNGLLRKIT 214



 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN YVAD  N  IRKI  AGV +   G S   G  DG    A FS    +V   P  
Sbjct: 351 DATGNWYVADAGNYMIRKITPAGVVSQLAG-SGKWGSVDGTGAGASFSAPKGIV-ADPLG 408

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D  N+ +R+I+
Sbjct: 409 NVFVADTYNSTIRKIT 424


>gi|456737758|gb|EMF62435.1| Hypothetical protein EPM1_0109 [Stenotrophomonas maltophilia EPM1]
          Length = 693

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/48 (58%), Positives = 34/48 (70%), Gaps = 4/48 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           D +GNLYVADT N AIR+IG D  VTT+AGG+    G  DGP+  A+F
Sbjct: 132 DAQGNLYVADTGNHAIRRIGTDGQVTTLAGGEQ---GHADGPAAQARF 176



 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 42/80 (52%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G +YVADT N  IR IG D  V T+AGG     G  DG    A+F     + +   
Sbjct: 184 VDAQGQVYVADTFNDRIRVIGTDGNVRTLAGGDR--PGLADGVGATARFDTPVALAF-DA 240

Query: 60  TCSLLVIDRGNAALRQISLN 79
             +LLV D  N A+R++  +
Sbjct: 241 QGALLVADLFNNAVRRVGAD 260


>gi|451980854|ref|ZP_21929239.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
           gracilis 3/211]
 gi|451761976|emb|CCQ90480.1| conserved hypothetical protein, contains NHL repeats [Nitrospina
           gracilis 3/211]
          Length = 729

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDFDVVYVRP 59
           MD +GNLY+ DT N  IRKI   G+ T   G  N     D GP+  A F +  D+V+  P
Sbjct: 565 MDKEGNLYIGDTANNVIRKIDKNGIVTTYAGNGNHEHSGDGGPALKAGFMSIGDIVF-SP 623

Query: 60  TCSLLVIDRGNAALRQIS 77
           +  L V++ GN  +R+I+
Sbjct: 624 SGELHVVEPGNHTVRKIT 641



 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 45/89 (50%), Gaps = 2/89 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D KGNLYV+D +N  IRK+   G + TIAG  S+      GP++DA F  D   + +   
Sbjct: 510 DKKGNLYVSDRINHQIRKVTPEGKIITIAGNGSSDYTGDGGPAKDASFR-DPQSLTMDKE 568

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNS 89
            +L + D  N  +R+I  N     Y  N 
Sbjct: 569 GNLYIGDTANNVIRKIDKNGIVTTYAGNG 597



 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 45/77 (58%), Gaps = 4/77 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           D +GNLYVAD  N  +RK+   G +TT AG GK+  +G + GP+  AK +   D+  V  
Sbjct: 106 DKEGNLYVADRENHVVRKVDTNGIITTFAGTGKAGYSGDK-GPATQAKLNLPSDMT-VDH 163

Query: 60  TCSLLVIDRGNAALRQI 76
             +L + DR N  +R++
Sbjct: 164 KGNLFISDRSNNVIRKV 180


>gi|329927682|ref|ZP_08281814.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
           HGF5]
 gi|328938318|gb|EGG34710.1| copper amine oxidase N-terminal domain protein [Paenibacillus sp.
           HGF5]
          Length = 1251

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 45/78 (57%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KG +Y+AD+ N AIRKI  AG V TIAG  +   G +DG ++DA F    DV  V  
Sbjct: 119 VDGKGQVYIADSGNHAIRKIDQAGRVNTIAG--NGRIGLKDGKAQDALFHEPQDVA-VTE 175

Query: 60  TCSLLVIDRGNAALRQIS 77
              L V D  N  +R+IS
Sbjct: 176 DGILYVADTLNHVIRRIS 193



 Score = 42.0 bits (97), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 7   LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 66
           L +AD+ N A+R++ D  VTT+ GG     G+ DG   + +F+   D+     +  L + 
Sbjct: 314 LLIADSHNHAVRQLKDERVTTMVGGTR---GYADGIESEVRFNAPADIGVASDSGELFIA 370

Query: 67  DRGNAALRQIS 77
           D+ N A+R  S
Sbjct: 371 DQRNGAVRAWS 381



 Score = 40.8 bits (94), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 48/89 (53%), Gaps = 14/89 (15%)

Query: 1   MDDKGNLYVADTLNLAIRKI--GDAGVTTIAG------GKSNVA-----GFRDGPSEDAK 47
           +D KGNLYV+D+ N AIR I  G   V+T AG      G +  A     G+ DG ++ A+
Sbjct: 240 LDGKGNLYVSDSGNQAIRYIDFGKGTVSTAAGSVPSSDGYAKDALYADPGYLDGAADAAR 299

Query: 48  FSNDFDVVYVRPTCSLLVIDRGNAALRQI 76
           F++   + +      LL+ D  N A+RQ+
Sbjct: 300 FNSPRGLAW-SVEDGLLIADSHNHAVRQL 327



 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 50/104 (48%), Gaps = 13/104 (12%)

Query: 3   DKGNLYVADTLNLAIRKIG-DAGVTTIA----------GGKSNVAG-FRDGPSEDAKFSN 50
           + G LYVADTLN  IR+I  D  VTTI            G   +AG +++G   +A+F N
Sbjct: 175 EDGILYVADTLNHVIRRISPDGEVTTIGSPSTRAVQVRAGVIALAGDYKNGSLAEAQF-N 233

Query: 51  DFDVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 94
           +   + +    +L V D GN A+R I   +        S+  +D
Sbjct: 234 EPAGLALDGKGNLYVSDSGNQAIRYIDFGKGTVSTAAGSVPSSD 277


>gi|223935789|ref|ZP_03627704.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223895390|gb|EEF61836.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 755

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN YVADT N  IRK+   G  T   G + V G  DG   +A+FS     V +   
Sbjct: 433 VDTGGNAYVADTANNVIRKVTSGGTVTTLAGTAGVEGQGDGTGSNAQFSGP-QAVALDGA 491

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D GN  +R+IS
Sbjct: 492 ANVYVSDTGNHTIRKIS 508



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNLYVADT N  IRKI    V+T A G + V+G  +G    A+F N    V V   
Sbjct: 56  VDKTGNLYVADTANHTIRKISGGVVSTFA-GLAGVSGSANGKGSAARF-NQPQGVAVDTN 113

Query: 61  CSLLVIDRGNAALRQISL 78
             + V D GN  +R+I+L
Sbjct: 114 GIVYVADTGNHIIRKIAL 131



 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 37/74 (50%), Gaps = 2/74 (2%)

Query: 4   KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
            G +YVAD LN  IR +   GV T   G +   G +DG   +A+F     V     T  +
Sbjct: 273 NGTVYVADNLNHTIRAVTSGGVVTTLAGLAGNYGSKDGTGSNARFYAPQGVAVSGST--V 330

Query: 64  LVIDRGNAALRQIS 77
            V+D GN  +RQIS
Sbjct: 331 FVVDTGNGTIRQIS 344



 Score = 45.4 bits (106), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 45/79 (56%), Gaps = 7/79 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDF--DVVYV 57
           +D  GN+YVADT N  IRKI  D  V+T+A G   V G  DG + DA+F   F  + + +
Sbjct: 548 VDSSGNVYVADTGNHTIRKITADGSVSTLA-GLPGVWGNADGTNRDARF---FQPEGISI 603

Query: 58  RPTCSLLVIDRGNAALRQI 76
               +L V+D GN  +R +
Sbjct: 604 DSQGNLFVMDSGNHTMRML 622


>gi|373488690|ref|ZP_09579354.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
 gi|372005635|gb|EHP06271.1| NHL repeat containing protein [Holophaga foetida DSM 6591]
          Length = 438

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 40/78 (51%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNLYVAD  N  IRKI   G  T   G++  +G  DG    A+F++   + +    
Sbjct: 185 LDASGNLYVADYGNHTIRKIDTLGAVTTFAGQAEASGTVDGDRLSARFNHPIGLAFNASY 244

Query: 61  CSLLVIDRGNAALRQISL 78
             L V D GN  +R I++
Sbjct: 245 SVLYVADSGNHTIRAINI 262


>gi|427400231|ref|ZP_18891469.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
 gi|425720971|gb|EKU83886.1| hypothetical protein HMPREF9710_01065 [Massilia timonae CCUG 45783]
          Length = 367

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/58 (46%), Positives = 33/58 (56%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D   NLYVADT N  IRKI  AGV T   G +  +GF DG + +A+F N   V   R
Sbjct: 82  IDADDNLYVADTGNFVIRKITPAGVVTTVAGAAGTSGFVDGDAGNARFGNPVAVAVNR 139


>gi|86610239|ref|YP_479001.1| NHL repeat-containing protein [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86558781|gb|ABD03738.1| NHL repeat protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 649

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 46/77 (59%), Gaps = 4/77 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  GNL+VAD  N  IR++   G V+T+AG   N  G++DGP+  A+F     +  V P 
Sbjct: 561 DAAGNLWVADRDNHRIRRLQPGGQVSTLAG--QNQPGWQDGPTSVARFDQPAGLA-VLPD 617

Query: 61  CSLLVIDRGNAALRQIS 77
            S++V+D G   LR+IS
Sbjct: 618 GSVVVVDAGLPGLRRIS 634



 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
            D KG L+VAD+ N  +R I     V+T AG  + +AG+RDG  + A+F     +  +  
Sbjct: 401 WDPKGGLWVADSGNHCLRYINLQRQVSTFAG--TCIAGYRDGGLDRAQFREPSGLA-LGS 457

Query: 60  TCSLLVIDRGNAALRQIS 77
             SL V DR N  +R+I+
Sbjct: 458 DGSLYVADRANRRIRRIT 475



 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 48
           +   G+LYVAD  N  IR+I  AG VTT+AG  +   G  DGP+E A+ 
Sbjct: 455 LGSDGSLYVADRANRRIRRITPAGEVTTVAG--TGQPGSVDGPAEQAQL 501


>gi|430809992|ref|ZP_19437107.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
 gi|429497580|gb|EKZ96110.1| NHL repeat containing protein [Cupriavidus sp. HMR-1]
          Length = 536

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 47/78 (60%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN++VADT N  +R +   G VTT+AG  +  AGF DGP++ A F+    +  V  
Sbjct: 352 VDVAGNVFVADTYNHLVRMVTPTGVVTTLAG--TGTAGFADGPAQAAAFNRPAGIA-VDA 408

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++ V D GNA +R+IS
Sbjct: 409 AGNIYVSDLGNARIRKIS 426



 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+YVAD  N  IRKI   G VTT+AG  S VAG  DG    A F N    V V  
Sbjct: 244 VDGTGNVYVADYGNHRIRKISPYGAVTTLAG--SGVAGSTDGTGTAASF-NLPSGVAVDG 300

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++ V D  N  +R+I+
Sbjct: 301 AGNVYVADTYNHLIRKIT 318


>gi|312198348|ref|YP_004018409.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311229684|gb|ADP82539.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 782

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +GNLY+AD  N  +R++   G +TT+AG      G  DGP+ DA F    + + + P
Sbjct: 529 VDGRGNLYIADGENERVRRVDTHGIITTVAGSGVEAGGDGDGPATDAFFKYT-EGLTIGP 587

Query: 60  TCSLLVIDRGNAALRQIS 77
             SL V D+GN+ +R+I+
Sbjct: 588 DGSLYVGDQGNSRIRRIT 605


>gi|373955434|ref|ZP_09615394.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
 gi|373892034|gb|EHQ27931.1| NHL repeat containing protein [Mucilaginibacter paludis DSM 18603]
          Length = 459

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D +GN+YVAD+ N  IRKI  AGVTT   G   + G+ DGP+  A+F     V       
Sbjct: 171 DAQGNMYVADSYNNVIRKITAAGVTTTYAGTGTL-GYLDGPAATAQFYAPKGVA-ADAQG 228

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D GN  +R+IS
Sbjct: 229 NIYVADMGNNMIRKIS 244



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D +GN+YVAD  N  IRKI  AGV T   GK + AG+ DG   DA F +   +  V  + 
Sbjct: 225 DAQGNIYVADMGNNMIRKISAAGVVTTLAGKGS-AGYADGTGADAVFKSPAGLA-VDASG 282

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D+G   +R+++
Sbjct: 283 NIYVADQGTNTIRKVT 298



 Score = 38.9 bits (89), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSE-DAKFSNDFDVVYVR 58
           +D  GN+YVAD     IRK+  AG VTT+AG  +  +G  D  +  DA+FS+   V  V 
Sbjct: 278 VDASGNIYVADQGTNTIRKVTSAGVVTTLAGAAA--SGQVDATTNTDARFSSPSGVT-VD 334

Query: 59  PTCSLLVIDRGNAALRQIS 77
            + ++ V D  N A+R+++
Sbjct: 335 ASGNVYVADLANHAIRKVT 353


>gi|29347940|ref|NP_811443.1| hypothetical protein BT_2530 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383121619|ref|ZP_09942326.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
 gi|29339842|gb|AAO77637.1| conserved hypothetical protein, with a conserved domain
           [Bacteroides thetaiotaomicron VPI-5482]
 gi|251837946|gb|EES66035.1| hypothetical protein BSIG_4948 [Bacteroides sp. 1_1_6]
          Length = 441

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D+ GN++VAD  N  IR I    + T   G+  VAG++DG   ++ F N + V  V   
Sbjct: 364 FDNDGNMFVADYGNHCIRMISADNIVTTVAGQPGVAGYKDGGPVESLFKNPWGVA-VNEQ 422

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D GNA +R++ +
Sbjct: 423 GDIYIADWGNARIRKLVI 440


>gi|296136309|ref|YP_003643551.1| NHL repeat containing protein [Thiomonas intermedia K12]
 gi|295796431|gb|ADG31221.1| NHL repeat containing protein [Thiomonas intermedia K12]
          Length = 366

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +G +YV++  N  IRKI   G  T   GK    GF DG  E+A+F +   + +  P 
Sbjct: 232 LDAQGQIYVSEYFNNVIRKITPDGTVTTFAGKPGKGGFADGKVEEAQFLHPQTLSFA-PD 290

Query: 61  CSLLVIDRGNAALRQIS 77
            SL+V D GN  +R+IS
Sbjct: 291 GSLIVADTGNNRVRRIS 307



 Score = 45.8 bits (107), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN++VAD +N  +RK+G  G  +I  G+      R+GP+  A+F +  + V V   
Sbjct: 67  LDAQGNIFVADYVNSVVRKLGTDGQVSIVAGQVEQRDARNGPALQARFYSP-ECVAVATD 125

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D G+  +R IS
Sbjct: 126 GTLFVSDSGSNTVRCIS 142


>gi|115376708|ref|ZP_01463935.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
 gi|310822897|ref|YP_003955255.1| hypothetical protein STAUR_5665 [Stigmatella aurantiaca DW4/3-1]
 gi|115366257|gb|EAU65265.1| hypothetical protein STIAU_5350 [Stigmatella aurantiaca DW4/3-1]
 gi|309395969|gb|ADO73428.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 856

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN+YVAD+ N  IR+I   G  T+  G     G  DG ++ A+F+    + +  P  
Sbjct: 625 DASGNIYVADSDNHCIRRIAPDGTATVFAGAVMEPGQGDGTAKAARFNQPAGLAF-GPGG 683

Query: 62  SLLVIDRGNAALRQISLN 79
            LLV D GN  +R++ L+
Sbjct: 684 ELLVADLGNGVIRRVDLS 701


>gi|254522386|ref|ZP_05134441.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
 gi|219719977|gb|EED38502.1| NHL repeat containing protein [Stenotrophomonas sp. SKA14]
          Length = 643

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/48 (60%), Positives = 33/48 (68%), Gaps = 4/48 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           D +GNLYVADT N AIR+IG D  VTT+AGG   V G  DGP   A+F
Sbjct: 82  DAQGNLYVADTGNHAIRRIGIDGQVTTLAGG---VQGHGDGPVAQARF 126



 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G +YVADT N  IR IG D  V T+AGG+    G  DG  + A+F     + +   
Sbjct: 134 VDAQGQVYVADTFNDRIRVIGTDGSVRTLAGGER--PGLADGAGDAARFDTPVALAF-DA 190

Query: 60  TCSLLVIDRGNAALRQIS 77
             +LLV D  N A+R++ 
Sbjct: 191 QGALLVADLFNNAVRRVG 208


>gi|409204160|ref|ZP_11232357.1| putative outer membrane adhesin-like protein [Pseudoalteromonas
           flavipulchra JG1]
          Length = 2380

 Score = 48.9 bits (115), Expect = 0.003,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D   NLYV++     IRKI  AGV T   G + V+G  DG +  A+FS  F +  V   
Sbjct: 786 LDSADNLYVSEQTGHTIRKITSAGVVTTFAGSAGVSGSTDGLASVARFSQPFGIA-VDSN 844

Query: 61  CSLLVIDRGNAALRQIS 77
             + V D GN  +R+IS
Sbjct: 845 GIVYVADTGNHRIRKIS 861



 Score = 43.9 bits (102), Expect = 0.099,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNL+V +T    +RKI  AGV T   G    +GF D     AKF+  ++       
Sbjct: 677 IDSNGNLFVVETNPHIVRKITPAGVVTTFAGSKGSSGFTDATGTSAKFNFPYN-GGSSSN 735

Query: 61  CSLLVIDRGNAALRQIS 77
             L + DR N A+R+++
Sbjct: 736 NDLFIADRNNHAIRKVT 752



 Score = 38.9 bits (89), Expect = 3.8,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 37/76 (48%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D   N+YVAD  N  IRKI   GV T   G S  AG  +G    A F N    V V  + 
Sbjct: 570 DSNNNVYVADYSNHVIRKITPEGVVTTLAG-SGTAGSDEGTGSAASF-NFPKAVTVDASG 627

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D  N  +R+I+
Sbjct: 628 NVYVADSSNNKIRKIT 643



 Score = 37.7 bits (86), Expect = 7.3,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVAD+ N  IRKI  AGV T   G S   G  DG    A F+    +  +   
Sbjct: 623 VDASGNVYVADSSNNKIRKITPAGVVTTFAG-SGSPGSTDGTGTAATFAAPTGIT-IDSN 680

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V++     +R+I+
Sbjct: 681 GNLFVVETNPHIVRKIT 697


>gi|423215200|ref|ZP_17201728.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692463|gb|EIY85701.1| hypothetical protein HMPREF1074_03260 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 451

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D+ GN++V +     IRKI  AGV ++  G    +GF DG  E AKF N  + V V P 
Sbjct: 374 FDEDGNMFVPEKSAHIIRKITPAGVASLYAGIPGQSGFGDGLPELAKF-NSPECVTVYPD 432

Query: 61  CSLLVIDRGNAALRQISL 78
            S+ V DR N  +R++++
Sbjct: 433 NSVYVADRENHVIRRVTV 450


>gi|380692527|ref|ZP_09857386.1| hypothetical protein BfaeM_00899 [Bacteroides faecis MAJ27]
          Length = 441

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D+ GN++VAD  N  IR +    + T   G+  VAG++DG   ++ F N + V  V   
Sbjct: 364 FDNDGNMFVADYGNHCIRMVSADNIVTTVAGQPGVAGYKDGGPLESLFKNPWGVA-VNEQ 422

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D GNA +R++ +
Sbjct: 423 GDIYIADWGNARIRKLVI 440


>gi|298387816|ref|ZP_06997366.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
 gi|298259421|gb|EFI02295.1| hypothetical protein HMPREF9007_04616 [Bacteroides sp. 1_1_14]
          Length = 440

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D+ GN++VAD  N  IR +    + T   G+  VAG++DG   ++ F N + V  V   
Sbjct: 363 FDNDGNMFVADYGNHCIRMVSADNIVTTVAGQPGVAGYKDGGPVESLFKNPWGVA-VNEQ 421

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D GNA +R++ +
Sbjct: 422 GDIYIADWGNARIRKLVI 439


>gi|302768705|ref|XP_002967772.1| hypothetical protein SELMODRAFT_409051 [Selaginella
          moellendorffii]
 gi|302826179|ref|XP_002994615.1| hypothetical protein SELMODRAFT_432523 [Selaginella
          moellendorffii]
 gi|300137312|gb|EFJ04320.1| hypothetical protein SELMODRAFT_432523 [Selaginella
          moellendorffii]
 gi|300164510|gb|EFJ31119.1| hypothetical protein SELMODRAFT_409051 [Selaginella
          moellendorffii]
          Length = 172

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 36/58 (62%), Gaps = 8/58 (13%)

Query: 25 VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC---SLLVIDRGNAALRQISLN 79
          VTTIAGG S   GF DGP + A+FS++F +      C   SLL+ DRGN  + +I ++
Sbjct: 41 VTTIAGGSSRKPGFADGPGDTARFSSEFSL-----ACSFGSLLIADRGNRLIHEIQID 93


>gi|402814801|ref|ZP_10864394.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
 gi|402507172|gb|EJW17694.1| copper amine oxidase domain protein [Paenibacillus alvei DSM 29]
          Length = 571

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 4/76 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D KGNLYVAD  N  IRKI  AG VTT+AG    + G++DG +  A+F      + V   
Sbjct: 150 DKKGNLYVADAGNHVIRKIDGAGKVTTVAG--DGIPGWKDGAAGTARFHTP-RAIAVADD 206

Query: 61  CSLLVIDRGNAALRQI 76
            ++ V D  N  +R+I
Sbjct: 207 GAIYVTDSLNHVIRRI 222


>gi|428179226|gb|EKX48098.1| hypothetical protein GUITHDRAFT_69001 [Guillardia theta CCMP2712]
          Length = 334

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 44/77 (57%), Gaps = 5/77 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           MD +GN+  AD+ N  IRK+  D  VTTIAG K    GF+DGP+  A F N    V V  
Sbjct: 109 MDAQGNIIFADSGNQRIRKVSPDGTVTTIAGSKK---GFKDGPAGKALF-NYPAYVAVDS 164

Query: 60  TCSLLVIDRGNAALRQI 76
             S+ V D GN  +R+I
Sbjct: 165 KGSIFVSDFGNHCIRKI 181



 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D KG+++V+D  N  IRKI G+  VTT+AG  +   G+ DG    A+F N    + +  
Sbjct: 162 VDSKGSIFVSDFGNHCIRKIDGEGMVTTVAG--NGKMGWADGKGAKARF-NSPQGMCIDK 218

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++ + D GN  +R++S
Sbjct: 219 EDTVYIADYGNQRIRKMS 236


>gi|405345633|ref|ZP_11022426.1| hypothetical protein A176_0011 [Chondromyces apiculatus DSM 436]
 gi|397093682|gb|EJJ24379.1| hypothetical protein A176_0011 [Myxococcus sp. (contaminant ex DSM
           436)]
          Length = 870

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 42/76 (55%), Gaps = 5/76 (6%)

Query: 5   GNLYVADTLNLAIRKI---GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           G LYV DT++  +R++   G   V T+AG G     GF+DGP  DA+F     +V + P 
Sbjct: 735 GTLYVLDTVSQRLRRVSLQGSRAVVTLAGTGAGTPFGFQDGPGSDARFRAQLGMV-MGPQ 793

Query: 61  CSLLVIDRGNAALRQI 76
             LL+ D  N  LR+I
Sbjct: 794 GELLLADTANLRLRKI 809


>gi|290978674|ref|XP_002672060.1| predicted protein [Naegleria gruberi]
 gi|284085634|gb|EFC39316.1| predicted protein [Naegleria gruberi]
          Length = 1219

 Score = 48.1 bits (113), Expect = 0.006,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 44/77 (57%), Gaps = 4/77 (5%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G ++ +DT N  IR++ ++G +TTIAG GKS   G  DG + D   +   ++ YV  T  
Sbjct: 487 GEVFFSDTENFRIRRVSNSGIITTIAGTGKSKFNG--DGMATDTNLATPTEIQYVPSTTE 544

Query: 63  LLVIDRGNAALRQISLN 79
           +   D GN  +R+ ++N
Sbjct: 545 IYFADSGNNRIRKFTIN 561


>gi|223934698|ref|ZP_03626618.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223896653|gb|EEF63094.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 847

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           ++  GNLYVAD  N  IR+I  AGV T   G S  AG  DG +  A+F     V  V  T
Sbjct: 284 INSAGNLYVADYFNNTIRQISPAGVVTTLAGLSGTAGSADGTNSSARFLGPQGVA-VDST 342

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ + D  N+ +R ++
Sbjct: 343 GTVFIADTANSTIRVMT 359



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D + N+YVADT N  IRKI   GV ++  G + V G  DG   +A FS    +  V   
Sbjct: 393 VDGQNNIYVADTQNSVIRKITPFGVVSVLAGTTGVFGSADGSGANALFSGPQGIA-VDGG 451

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D GN+ +R+I+
Sbjct: 452 GNIYVADTGNSTIRKIT 468



 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 40/78 (51%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNLYV D  N  IRK+  AG  T   G + + G  DG    A F      + V   
Sbjct: 558 VDGAGNLYVTDYNNDTIRKVTSAGAVTTLAGWTGMWGSIDGAGNSALFFGPSG-ISVDAL 616

Query: 61  CSLLVIDRGNAALRQISL 78
            +L VID GN+ LR+++L
Sbjct: 617 GNLYVIDSGNSTLRKLTL 634



 Score = 41.2 bits (95), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 42/82 (51%), Gaps = 11/82 (13%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDF----DVV 55
           +D  GN+YVADT N AIR I   G VTT+AG     AGF    S DA  +N        +
Sbjct: 229 VDQAGNVYVADTGNAAIRMIMPGGSVTTLAG----AAGFVG--SADASGTNALFHQPAGI 282

Query: 56  YVRPTCSLLVIDRGNAALRQIS 77
            +    +L V D  N  +RQIS
Sbjct: 283 GINSAGNLYVADYFNNTIRQIS 304



 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVADT N  IRKI  +G T+   G +   G  DG    A+F     V  V   
Sbjct: 448 VDGGGNIYVADTGNSTIRKITPSGSTSTLAGSAGNPGNADGAGITAQFYQPQGVA-VDSA 506

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D GN  +R ++
Sbjct: 507 NNVYVADTGNHTVRMVT 523



 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D+  NLYVAD  N  IRKI  + V +   G + V G  DG     +F N+ + V V   
Sbjct: 176 VDNATNLYVADYGNHLIRKITPSNVVSTLAGVTGVPGSADG--LGGQF-NEPEAVAVDQA 232

Query: 61  CSLLVIDRGNAALRQI 76
            ++ V D GNAA+R I
Sbjct: 233 GNVYVADTGNAAIRMI 248


>gi|153868931|ref|ZP_01998653.1| receptor protein kinase [Beggiatoa sp. PS]
 gi|152074497|gb|EDN71345.1| receptor protein kinase [Beggiatoa sp. PS]
          Length = 3115

 Score = 48.1 bits (113), Expect = 0.007,   Method: Composition-based stats.
 Identities = 46/129 (35%), Positives = 64/129 (49%), Gaps = 14/129 (10%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
            +D +GNLY+ADTLN  IRK+   G +TT+AG GK+   G  +G +  AK  N   +V+  
Sbjct: 2191 IDHEGNLYIADTLNHRIRKVDSNGIITTVAGIGKAGNTG-DNGLATAAKLRNPTAIVFDN 2249

Query: 59   PTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPF 118
                L + D GN  +R++S  Q   +   NSI    I  V G    GY     Q   GP 
Sbjct: 2250 -NGHLYIADSGNHRIRKVS-GQRTRKPSANSI----ITTVAGNGRSGY-----QGDNGPA 2298

Query: 119  FFSRTQQPS 127
              +R   P+
Sbjct: 2299 TGARLSNPT 2307



 Score = 40.8 bits (94), Expect = 1.1,   Method: Composition-based stats.
 Identities = 25/76 (32%), Positives = 36/76 (47%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            +D + NLY+ADT N  IRK+   G  T   G  N     DG    A   N    + V  T
Sbjct: 2311 VDSQNNLYIADTDNHRIRKVDLTGTITTVAGNGNKGYSGDGDPATAAQINTPTGLEVDST 2370

Query: 61   CSLLVIDRGNAALRQI 76
             +L + D+ N  +R++
Sbjct: 2371 GNLYIADKNNHRIRKV 2386



 Score = 37.7 bits (86), Expect = 7.3,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVRP 59
            +D  GNLY+AD  N  IRK+   G+ T   G        DG  +  A+ S   DV  +  
Sbjct: 2367 VDSTGNLYIADKNNHRIRKVDTEGIITTFTGTGKPGTATDGIIASVAQISQPTDVA-LDQ 2425

Query: 60   TCSLLVIDRGNAALRQI 76
              +L + D+GN  +R+I
Sbjct: 2426 YGNLYIADKGNDTIRKI 2442


>gi|255589882|ref|XP_002535117.1| nhl repeat-containing protein, putative [Ricinus communis]
 gi|223523999|gb|EEF27264.1| nhl repeat-containing protein, putative [Ricinus communis]
          Length = 435

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 45/78 (57%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GNLYVADT N AIRKIG D  V+T+AG    +AG +DG    A+F+    V  V  
Sbjct: 127 IDAAGNLYVADTGNNAIRKIGPDGTVSTLAG--DGLAGDKDGKGAGAQFNGPIGVA-VDA 183

Query: 60  TCSLLVIDRGNAALRQIS 77
              + V D  N  +R+I+
Sbjct: 184 AGVVYVTDTYNDRIRRIA 201



 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G +YV DT N  IR+I  D  VTT+AGGK   AG  DG    A F     +  +  
Sbjct: 181 VDAAGVVYVTDTYNDRIRRIAPDGTVTTVAGGKR--AGMADGAGAQALFDTPTGIA-LGA 237

Query: 60  TCSLLVIDRGNAALRQIS 77
           + +L + D GN+A+R+I 
Sbjct: 238 SGALYIADTGNSAIRKIG 255



 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 6/79 (7%)

Query: 1   MDDKGNLYVADTL-NLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D +GNLYVAD   N +IRKI   GV TT+AGG     G+ +G    A F N    + + 
Sbjct: 73  LDRQGNLYVADGGDNNSIRKIDLDGVTTTLAGGTE---GYAEGAPTAAAF-NTPSGLAID 128

Query: 59  PTCSLLVIDRGNAALRQIS 77
              +L V D GN A+R+I 
Sbjct: 129 AAGNLYVADTGNNAIRKIG 147


>gi|196230675|ref|ZP_03129536.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
 gi|196225016|gb|EDY19525.1| Peptidase S53 propeptide [Chthoniobacter flavus Ellin428]
          Length = 1120

 Score = 47.8 bits (112), Expect = 0.009,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVAD  N  IRKI   G  T   G+  V G  +G   +A F N  + V +   
Sbjct: 695 VDSSGNIYVADFNNDTIRKITPGGTVTTPYGQPGVIGATNGTGTNATF-NTPNGVAIDSA 753

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D GN+A+R+I+
Sbjct: 754 NNIYVADSGNSAIRKIT 770



 Score = 39.3 bits (90), Expect = 2.4,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D   N+YVAD+ N AIRKI    V +   G+  ++G  DG ++ A F+N   V  V   
Sbjct: 750 IDSANNIYVADSGNSAIRKITPGRVVSTLAGQPGLSGSADGTTK-ALFNNPQGVA-VDSA 807

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D  N  +R+I+
Sbjct: 808 GNVYVADTTNETIRKIT 824



 Score = 39.3 bits (90), Expect = 2.9,   Method: Composition-based stats.
 Identities = 20/49 (40%), Positives = 27/49 (55%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 49
           +D  GN+YVAD  N  +RKI   GV T   G++ + G  DG    A F+
Sbjct: 859 VDSAGNVYVADLYNFVVRKITPGGVVTTPYGQAGMPGRLDGIGTAALFN 907


>gi|384245978|gb|EIE19470.1| hypothetical protein COCSUDRAFT_67905 [Coccomyxa subellipsoidea
          C-169]
          Length = 838

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 32/67 (47%)

Query: 1  MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
          M   G L VAD+ N  IRK+   G  T   G+    G  DG    A FSN  + V   P 
Sbjct: 1  MSPNGFLAVADSTNACIRKVTLEGEVTTLAGECGFPGDEDGEMHSALFSNGIEDVACLPN 60

Query: 61 CSLLVID 67
          C++LV D
Sbjct: 61 CTILVTD 67


>gi|197120448|ref|YP_002132399.1| fibronectin type III domain-containing protein [Anaeromyxobacter
           sp. K]
 gi|196170297|gb|ACG71270.1| Fibronectin type III domain protein [Anaeromyxobacter sp. K]
          Length = 460

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 44/76 (57%), Gaps = 4/76 (5%)

Query: 3   DKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D   LYVADT+N AIR +  A   V+T+AG  ++ AG+ DG    A+FS  + +      
Sbjct: 277 DGTTLYVADTMNNAIRALDLATGVVSTLAGDPASYAGYADGRGTAARFSAPYGL--AMHG 334

Query: 61  CSLLVIDRGNAALRQI 76
             LLV D GN+A+R +
Sbjct: 335 GELLVADSGNSAIRAV 350


>gi|158315873|ref|YP_001508381.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158111278|gb|ABW13475.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 850

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  GN+Y+AD+ N  IR++G  G+  T+AGG    A    GP+ DA  +    V   RP 
Sbjct: 661 DPTGNIYIADSGNNRIRRVGTDGIIQTVAGGDGEGAFGDGGPAADALLAFPISVAMDRP- 719

Query: 61  CSLLVIDRGNAALRQISLN 79
             L + D GN  +R+I L+
Sbjct: 720 GRLYIADSGNNRIRRIGLD 738



 Score = 39.3 bits (90), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVRP 59
           +D  GN+ +ADT N  IR++  +G  T   GK +     DG P+ +A       VV   P
Sbjct: 604 LDSAGNILIADTFNQRIRRVDPSGTITTVAGKDDRGFSEDGVPATEATLWYPGGVV-ADP 662

Query: 60  TCSLLVIDRGNAALRQISLN 79
           T ++ + D GN  +R++  +
Sbjct: 663 TGNIYIADSGNNRIRRVGTD 682


>gi|182414669|ref|YP_001819735.1| SMP-30/gluconolaconase/LRE domain-containing protein [Opitutus
           terrae PB90-1]
 gi|177841883|gb|ACB76135.1| SMP-30/Gluconolaconase/LRE domain protein [Opitutus terrae PB90-1]
          Length = 1292

 Score = 47.4 bits (111), Expect = 0.010,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  G +YVADT N  IRK+  +GV T   G   ++G  DG  + A+F+   D+  V   
Sbjct: 225 VDANGTVYVADTSNHIIRKVTPSGVVTTLAGSPGISGNSDGAGDAARFNAPTDIA-VDDA 283

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V+D+ + +LR+I+
Sbjct: 284 GTIYVVDQ-SGSLRKIT 299



 Score = 47.0 bits (110), Expect = 0.013,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNLYVADT N +IRKI  AG  T   G S V G  DG    A+F +   + + R  
Sbjct: 486 VDQHGNLYVADTFNHSIRKIDAAGQVTTPYGVSGVEGTVDGIGNAARFGSPTALAFDRDG 545

Query: 61  CSLLVID 67
            SL V D
Sbjct: 546 -SLFVAD 551



 Score = 40.8 bits (94), Expect = 0.86,   Method: Composition-based stats.
 Identities = 28/76 (36%), Positives = 36/76 (47%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GNLYV D     IRKI  A   +   G     G  DGP+  A+F     V  V    
Sbjct: 63  DADGNLYVVDPGTNLIRKITPAASVSTFAGTPTGWGLVDGPAASARFGLPQGVA-VGADG 121

Query: 62  SLLVIDRGNAALRQIS 77
           ++ + D GNAA+R I+
Sbjct: 122 TVYIADTGNAAIRIIT 137



 Score = 38.5 bits (88), Expect = 5.2,   Method: Composition-based stats.
 Identities = 36/144 (25%), Positives = 59/144 (40%), Gaps = 5/144 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN++VA+     IR+I   G     GG +   G  DG    A+F+  + +   R  
Sbjct: 594 VDRAGNVFVAENTTHVIRRITPDGTVVTIGGLAGSIGTADGVGSAARFNEPWGLALDR-F 652

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGPFFF 120
             L ++D GN  +R+  +          +I+   +    GA      T        P F 
Sbjct: 653 GHLYIVDSGNNTIRKGVIVGATAP----AITKDPVAKAAGAGRQASFTLTATGLPAPQFQ 708

Query: 121 SRTQQPSESEFNEETEIKELSNKE 144
            + ++  ES+F   +EI E S  E
Sbjct: 709 WQRKRAGESDFVALSEIAEFSGVE 732


>gi|386716891|ref|YP_006183217.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
 gi|384076453|emb|CCH11034.1| NHL repeat protein [Stenotrophomonas maltophilia D457]
          Length = 652

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 27/48 (56%), Positives = 33/48 (68%), Gaps = 4/48 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           D +GNLYVADT N AIR+I  D  VTT+AGG+    G  DGP+  A+F
Sbjct: 91  DAQGNLYVADTGNHAIRRISTDGQVTTLAGGEQ---GHADGPAAQARF 135



 Score = 42.0 bits (97), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 42/80 (52%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G +YVADT N  IR IG D  V T+AGG     G  DG    A+F     + +   
Sbjct: 143 VDAQGQVYVADTFNDRIRVIGTDGSVRTLAGGDR--PGLADGVGAAARFDTPVALAF-DA 199

Query: 60  TCSLLVIDRGNAALRQISLN 79
             +LLV D  N A+R++  +
Sbjct: 200 QGALLVADLFNNAVRRVGAD 219


>gi|29348037|ref|NP_811540.1| cell surface protein [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339939|gb|AAO77734.1| putative cell surface protein, have conserved domain [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 434

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN+Y+AD+ N  IR I   G+ T   G+   AG+ DG S D    ND   V V   
Sbjct: 357 LDMEGNIYIADSGNHCIRMIDKNGIVTTPIGQPGEAGYADG-SPDMALLNDPRGVAVNSE 415

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D GN  +R+++L
Sbjct: 416 GDVYIADLGNRCIRKLTL 433


>gi|338536093|ref|YP_004669427.1| putative lipoprotein [Myxococcus fulvus HW-1]
 gi|337262189|gb|AEI68349.1| putative lipoprotein [Myxococcus fulvus HW-1]
          Length = 855

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+YVADT +  IR+I   G V   AGG     G +DGP++ A F+    +     
Sbjct: 624 VDAAGNVYVADTDHYVIRRIDVEGKVEVFAGGTP---GLQDGPAKQAAFNQPTGMTVTPD 680

Query: 60  TCSLLVIDRGNAALRQISL 78
             +LLV D  N  +R+I L
Sbjct: 681 GTALLVADMNNGVIRRIDL 699


>gi|223934991|ref|ZP_03626910.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223896444|gb|EEF62886.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 1064

 Score = 47.4 bits (111), Expect = 0.010,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVADTLN  IR+I   G  T   G + V+G  +G + DA+F     +  V  T
Sbjct: 462 VDGVGNIYVADTLNHIIRRITPDGAATTFAGSAGVSGTANGTNTDAQFYAPQGLA-VDGT 520

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D  N  +R+I+
Sbjct: 521 GNVFVADTFNNLIRKIT 537



 Score = 47.0 bits (110), Expect = 0.012,   Method: Composition-based stats.
 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 1/81 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D   N++VADT N  IRKI   G+ T   G     G RDG   +A F     +  V  +
Sbjct: 78  IDISNNVFVADTENHVIRKISCTGIITTLAGSLGTHGSRDGSGTNALFFRPAGIA-VDAS 136

Query: 61  CSLLVIDRGNAALRQISLNQD 81
            ++LV D GN  +R+I+   D
Sbjct: 137 GNVLVADTGNNTVRKITATGD 157



 Score = 45.4 bits (106), Expect = 0.042,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 40/77 (51%), Gaps = 4/77 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  GNLY+ADT N  IR I   G VTT AG  S   G  DG S DA+F      V V   
Sbjct: 354 DATGNLYIADTGNNTIRTITPGGSVTTFAGLPS--IGSADGLSSDARFRFP-QAVAVDAA 410

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D  N  +R+IS
Sbjct: 411 TNVYVADTANQTIRKIS 427



 Score = 44.3 bits (103), Expect = 0.094,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D+  N++VADT N  IRKI  +G      G + V G  D    +A FS    +  V  +
Sbjct: 188 IDNFNNIFVADTGNNTIRKITPSGNVNTMAGSAGVYGNLDNSGANALFSGPQGLT-VDSS 246

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V+D GN  +R+I+
Sbjct: 247 GNLYVVDTGNGTIRKIT 263



 Score = 42.7 bits (99), Expect = 0.24,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNLYV DT N  IRKI  +GV T   G +   G  +G   +A F     +      
Sbjct: 243 VDSSGNLYVVDTGNGTIRKITSSGVVTTFAGSAGNYGATNGIGANALFYAPQGITIDLFG 302

Query: 61  CSLLVIDRGNAALRQIS 77
           C + V D GN  +R+I+
Sbjct: 303 C-VYVADTGNHTIRKIT 318



 Score = 42.4 bits (98), Expect = 0.29,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR---DGPSEDAKFSNDFDVVYV 57
           +D+ GN++VAD +N  IR++  +G   I    + +AGF    DG +  A+F     +  V
Sbjct: 572 VDNAGNVFVADYMNHTIRELIPSGTNWIVNTVAGLAGFWGSIDGTNTSARFFQPRSLS-V 630

Query: 58  RPTCSLLVIDRGNAALRQIS 77
             + +L V D GN A+R+I+
Sbjct: 631 DASGALYVADSGNHAIRKIT 650



 Score = 41.2 bits (95), Expect = 0.75,   Method: Composition-based stats.
 Identities = 24/49 (48%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 48
           +D  GN++VADT N  IRKI   G VTT+AG   N  G  DG + +A+F
Sbjct: 517 VDGTGNVFVADTFNNLIRKITPGGAVTTLAGNFENF-GSSDGTNSNARF 564



 Score = 39.7 bits (91), Expect = 2.3,   Method: Composition-based stats.
 Identities = 24/53 (45%), Positives = 31/53 (58%), Gaps = 3/53 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFSN 50
           +D  G LYVAD+ N AIRKI  +G   V T   G +  AG  DG   +A+FS+
Sbjct: 630 VDASGALYVADSGNHAIRKITPSGTNWVVTTVAGLAGAAGSVDGTGINAEFSH 682



 Score = 38.9 bits (89), Expect = 3.8,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G +YVADT N  IRKI  D  VTT+AG   N  G  D  +  A F N   +     
Sbjct: 298 IDLFGCVYVADTGNHTIRKITSDGTVTTLAGLAGNY-GSADSVNSSASFWNPQGIT-SDA 355

Query: 60  TCSLLVIDRGNAALRQIS 77
           T +L + D GN  +R I+
Sbjct: 356 TGNLYIADTGNNTIRTIT 373


>gi|298386946|ref|ZP_06996500.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
 gi|298260096|gb|EFI02966.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
          Length = 457

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D++ NLY+AD+ N  IRK+   GV +   G  N +G++DG  E A F+  + +  +   
Sbjct: 380 LDEEENLYIADSGNHCIRKVTPEGVVSTVIGNPNTSGYKDGTPEIALFTEPWGLA-IDSE 438

Query: 61  CSLLVIDRGNAALRQISL 78
            ++ + D+ N  +R++S+
Sbjct: 439 GTIYIGDKDNRCVRKLSI 456


>gi|398331208|ref|ZP_10515913.1| hypothetical protein LalesM3_04029 [Leptospira alexanderi serovar
           Manhao 3 str. L 60]
          Length = 357

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG   V+G+ DG    A+F +   + Y +
Sbjct: 184 LDRERNLYVGELGNHAIRKINLNSGTVSTLSGG---VSGYLDGDLASAQFKSPLGITYDQ 240

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  N  +R+I L
Sbjct: 241 KTDSLLVADLQNHRIRKIDL 260


>gi|291612976|ref|YP_003523133.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
 gi|291583088|gb|ADE10746.1| NHL repeat-containing protein [Sideroxydans lithotrophicus ES-1]
          Length = 385

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 43/77 (55%), Gaps = 3/77 (3%)

Query: 3   DKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  NLYVADT N  IRKI  A G  T   G + V G  DG    A+F++ F ++Y+    
Sbjct: 311 DGSNLYVADTNNHTIRKIVIATGNVTTPAGAALVPGTADGAGPTARFNHPFGIIYIN--G 368

Query: 62  SLLVIDRGNAALRQISL 78
           +L V D  N ++R++ L
Sbjct: 369 ALYVADYTNGSIRKVQL 385



 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 37/81 (45%), Gaps = 5/81 (6%)

Query: 3   DKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRD---GPSEDAKFSNDFDVVYV 57
           D  NLYV D     IRKI      VTT AG  +  +G  D   GP   A F+N + +V  
Sbjct: 75  DGTNLYVTDMYANNIRKIVIATGQVTTFAGSITGASGVTDAPSGPGTSALFNNPYGIVID 134

Query: 58  RPTCSLLVIDRGNAALRQISL 78
                L V D  N  +RQI +
Sbjct: 135 ATNTKLYVADGYNHTIRQIVI 155


>gi|86605024|ref|YP_473787.1| NHL repeat-containing protein [Synechococcus sp. JA-3-3Ab]
 gi|86553566|gb|ABC98524.1| NHL repeat domain protein [Synechococcus sp. JA-3-3Ab]
          Length = 637

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 45/78 (57%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
            D KG L+VAD+ N  +R I     VTT AG  + VAG+RDG  ++A+F   F +  +  
Sbjct: 389 WDPKGGLWVADSGNHCLRHISRQRQVTTFAG--TCVAGYRDGERDEAQFREPFGLA-LGL 445

Query: 60  TCSLLVIDRGNAALRQIS 77
             SL V DR N  +R+I+
Sbjct: 446 DGSLYVADRANRRIRRIT 463



 Score = 38.1 bits (87), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +G  Y+AD     I ++   G   +  G S  AG RDG ++ A+F +   +++  P 
Sbjct: 335 VDPEGFFYLADPAQHRIFRLSPEGELEVWAG-SGKAGHRDGAADQAQFDSPQGLLW-DPK 392

Query: 61  CSLLVIDRGNAALRQIS 77
             L V D GN  LR IS
Sbjct: 393 GGLWVADSGNHCLRHIS 409


>gi|255532869|ref|YP_003093241.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255345853|gb|ACU05179.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 2296

 Score = 47.0 bits (110), Expect = 0.012,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN++VAD  N  IRK+  AGV +   GK   +GF +G S DA+F   F +     +
Sbjct: 726 VDANGNVFVADRNNFRIRKVTPAGVVSTYAGKG-TSGFAEGLSGDAQFKLVFGLAMSSIS 784

Query: 61  CSLLVIDRGN 70
            +L + D+GN
Sbjct: 785 GNLYISDQGN 794



 Score = 45.8 bits (107), Expect = 0.026,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN+YVAD  N  IRKI  AGV T   G +  A F DG   +A+F+    V  V    
Sbjct: 673 DAAGNVYVADRDNHRIRKITAAGVVTTLAG-NGTADFADGTQANARFNMPVGVA-VDANG 730

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V DR N  +R+++
Sbjct: 731 NVFVADRNNFRIRKVT 746



 Score = 42.0 bits (97), Expect = 0.41,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D+ GNLYV D    +IRKI   GV +   G S V GF+DG    A+F+    +      
Sbjct: 618 IDNSGNLYVTDQDVNSIRKITPGGVVSTFAG-SGVKGFKDGTGTAAQFAFPTGLA-ADAA 675

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V DR N  +R+I+
Sbjct: 676 GNVYVADRDNHRIRKIT 692


>gi|110639161|ref|YP_679370.1| hypothetical protein CHU_2779 [Cytophaga hutchinsonii ATCC 33406]
 gi|110281842|gb|ABG60028.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
          Length = 1051

 Score = 47.0 bits (110), Expect = 0.012,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D   N+Y+AD  N  IRKI  AGV     G S  AG  DG    A+F+N + +  +  +
Sbjct: 116 LDAAQNIYIADNGNHCIRKITPAGVVITFAG-SGTAGSNDGTGTAAQFNNPYGMA-IDAS 173

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D GN  +R+I+
Sbjct: 174 GNLYVADSGNNLIRKIT 190



 Score = 44.7 bits (104), Expect = 0.061,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN ++ D  N  IRK+  AGV +   G S  AG+ +G    A+F+  + +V V   
Sbjct: 224 LDVSGNFFITDNRNHRIRKMTSAGVVSTVAG-SGSAGYMNGTGVTAQFNRPYGIV-VDAF 281

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D  N  +R+I+
Sbjct: 282 SNLYVTDTNNGVIRKIT 298



 Score = 42.4 bits (98), Expect = 0.35,   Method: Composition-based stats.
 Identities = 22/48 (45%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           +D   NLYV DT N  IRKI  +GV +   G +   GF DGP+ +A+F
Sbjct: 278 VDAFSNLYVTDTNNGVIRKITSSGVVSTYAG-TGTPGFADGPAANAQF 324



 Score = 41.6 bits (96), Expect = 0.55,   Method: Composition-based stats.
 Identities = 28/78 (35%), Positives = 44/78 (56%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GNLYVAD+ N  IRKI  +G VTTIAG  +   G+ DG    A+F    ++  +  
Sbjct: 170 IDASGNLYVADSGNNLIRKITSSGVVTTIAG--NTTPGYVDGTGTAARFYLPVNIT-LDV 226

Query: 60  TCSLLVIDRGNAALRQIS 77
           + +  + D  N  +R+++
Sbjct: 227 SGNFFITDNRNHRIRKMT 244


>gi|290979906|ref|XP_002672674.1| predicted protein [Naegleria gruberi]
 gi|284086252|gb|EFC39930.1| predicted protein [Naegleria gruberi]
          Length = 747

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 48/74 (64%), Gaps = 5/74 (6%)

Query: 4   KGNLYVADTLNLAIRKI-GDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
            G+LY+ D++N AIRKI  + G ++TIAG  + +AGF+DG +  A+F+++  +  V P  
Sbjct: 510 NGDLYICDSINHAIRKIDHETGIISTIAG--NGIAGFKDGNASIAQFNSNLGLS-VLPNG 566

Query: 62  SLLVIDRGNAALRQ 75
            LL+ D  N  +R+
Sbjct: 567 DLLIADYNNNRIRK 580


>gi|29347802|ref|NP_811305.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383120108|ref|ZP_09940841.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
 gi|29339703|gb|AAO77499.1| NHL repeat-containing protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251838367|gb|EES66454.1| hypothetical protein BSIG_4606 [Bacteroides sp. 1_1_6]
          Length = 440

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 48/80 (60%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDA--GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D +GNLY+AD  N  IRK+      VTTIAG  S  AG++DG   +A+F+  +  VY+ 
Sbjct: 363 IDKEGNLYIADVGNNRIRKVDTKLNVVTTIAG--SGAAGYKDGDPLEAQFNQPWG-VYLD 419

Query: 59  PTCSLLVIDRGNAALRQISL 78
               L + D+ N  +R++++
Sbjct: 420 KNEFLYIADQNNHCIRKLAI 439


>gi|344205849|ref|YP_004790990.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
 gi|343777211|gb|AEM49764.1| NHL repeat containing protein [Stenotrophomonas maltophilia JV3]
          Length = 693

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 27/48 (56%), Positives = 33/48 (68%), Gaps = 4/48 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           D +GNLYVADT N AIR+I  D  VTT+AGG+    G  DGP+  A+F
Sbjct: 132 DVQGNLYVADTGNHAIRRISTDGQVTTLAGGEQ---GHADGPAAQARF 176



 Score = 44.3 bits (103), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G +YVADT N  IR IG D  V T+AGG     GF DG    A+F     + +   
Sbjct: 184 VDAQGQVYVADTFNDRIRVIGTDGSVRTLAGGDR--PGFADGVGTAARFDTPVALAF-DA 240

Query: 60  TCSLLVIDRGNAALRQISLN 79
             +LLV D  N A+R++  +
Sbjct: 241 HGALLVADLFNNAVRRVGAD 260


>gi|42522312|ref|NP_967692.1| hypothetical protein Bd0727 [Bdellovibrio bacteriovorus HD100]
 gi|39574843|emb|CAE78685.1| hypothetical protein with NHL repeat [Bdellovibrio bacteriovorus
           HD100]
          Length = 709

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 1/77 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  G+LYV+D+ N  +RK+  AGV T   G+  + G  DG    A FS    +  V P+ 
Sbjct: 522 DGAGSLYVSDSGNYTVRKVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGIT-VTPSG 580

Query: 62  SLLVIDRGNAALRQISL 78
           ++ V D  N  +R+I++
Sbjct: 581 NIFVADTDNNVIRKITV 597



 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 4   KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
            GN++VADT N  IRKI  AGV T   G +   G  DG   +A+FS     V    + +L
Sbjct: 579 SGNIFVADTDNNVIRKITVAGVVTTFAGAAGQGGNDDGMGSNARFSQPH-FVATDSSGNL 637

Query: 64  LVIDRGNAALRQIS 77
            V + G A +R+I+
Sbjct: 638 YVAEWGEATIRKIT 651



 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 49
           D  GNLYVADT+N  IRKI  +G VTTIAG    + G  DG    A+FS
Sbjct: 414 DPAGNLYVADTMNRTIRKITPSGNVTTIAGSPGQI-GSADGTGAAARFS 461



 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNL+V +  N  IRKI  A V T   G    AG  DG    A+F+N  D+  +   
Sbjct: 137 IDGSGNLFVTEGNNHTIRKITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDIT-LAAD 195

Query: 61  CSLLVIDRGNAALRQIS 77
            +  + D+ N  +R+++
Sbjct: 196 GNFYITDKNNNMIRKMT 212



 Score = 38.1 bits (87), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GNL+V  +    IRKI  AGV T   G++   G  DG    A+FS    +  +  + 
Sbjct: 247 DSAGNLFVVCSSCSTIRKITPAGVVTTFAGQAYTTGAVDGTGTAARFSWPVGIT-IDSSD 305

Query: 62  SLLVIDRGNAALRQIS 77
           +L V D  N+A+R+++
Sbjct: 306 NLYVADYSNSAIRKVT 321



 Score = 37.7 bits (86), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D   NLYVAD  N AIRK+  + V +   G     G  DG    A+F+     V +  +
Sbjct: 301 IDSSDNLYVADYSNSAIRKVTSSAVVSNFAGSYGDYGAVDGTGTAARFAGPAG-VGIDAS 359

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D  NA++R+++
Sbjct: 360 GNLFVTDSDNASIRKVT 376


>gi|116749588|ref|YP_846275.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
 gi|116698652|gb|ABK17840.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
          Length = 1750

 Score = 47.0 bits (110), Expect = 0.015,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+Y AD+ N  IRKI  +G +TT+AG  S  +    GP+  A+ +  F +  V P
Sbjct: 102 VDAAGNVYFADSNNHCIRKIDTSGIITTVAGTGSAGSNGDGGPAASARLAYPFGIA-VDP 160

Query: 60  TCSLLVIDRGNAALRQI 76
           + ++ V D GN  +R+I
Sbjct: 161 SGNIYVADLGNHKVRRI 177


>gi|383454552|ref|YP_005368541.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
 gi|380728713|gb|AFE04715.1| putative lipoprotein [Corallococcus coralloides DSM 2259]
          Length = 861

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN+YV+DT N  IR+I   G   +  G +   G++DGP+  A F+    + +     
Sbjct: 632 DALGNVYVSDTENAVIRRISPEGKAEVFAGTT--WGYQDGPALSAAFNQPAGLSFTPDGT 689

Query: 62  SLLVIDRGNAALRQISL 78
           +LLV D  N+ +R+I +
Sbjct: 690 ALLVADLNNSVIRRIDM 706


>gi|410694120|ref|YP_003624742.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
           3As]
 gi|294340545|emb|CAZ88930.1| putative Soluble quinoprotein glucose dehydrogenase [Thiomonas sp.
           3As]
          Length = 366

 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 45/77 (58%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GN++VAD +N  +RK+G  G  +I  G+      R+GP+  A+F +  + V V   
Sbjct: 67  LDAQGNIFVADYVNSVVRKLGTDGQVSIVAGQVEQRDARNGPALQARFYSP-ECVAVATD 125

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D G+  +R+IS
Sbjct: 126 GTLFVSDSGSNTVRRIS 142



 Score = 45.1 bits (105), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +G +YV++  N  IRKI   G  T   GK    GF DG   +A F +   + +  P 
Sbjct: 232 LDAQGQIYVSEYFNNVIRKITPDGTVTTFAGKPGKGGFADGNVGEALFLHPQTLSFA-PD 290

Query: 61  CSLLVIDRGNAALRQIS 77
            SL+V D GN  +R+IS
Sbjct: 291 GSLIVADTGNNRVRRIS 307


>gi|421097867|ref|ZP_15558546.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
           str. 200901122]
 gi|410799150|gb|EKS01231.1| hypothetical protein LEP1GSC125_1130 [Leptospira borgpetersenii
           str. 200901122]
          Length = 357

 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG   V G+ DG    A+F +   + Y +
Sbjct: 184 LDRERNLYVGELSNHAIRKINLNSGTVSTLSGG---VLGYLDGDLASAQFKSPLGITYDQ 240

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  N  +R+I L
Sbjct: 241 KTDSLLVADIQNHNIRKIDL 260


>gi|325186506|emb|CCA21046.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1881

 Score = 46.6 bits (109), Expect = 0.016,   Method: Composition-based stats.
 Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGD-AGVTTIAGGKSNVAGFRDGPSEDAK--FSNDFDVVYV 57
           +D  G  YV DT N  IR+I   + VTT AG ++   GFRDGP + A+  F +   VV  
Sbjct: 258 IDSTGVTYVVDTGNHCIRRIDQRSRVTTFAGNRTR--GFRDGPLDIAQYNFPSGIAVVNE 315

Query: 58  RPTCSLLVIDRGNAALRQISLNQDDC 83
           +   ++ V D GN  +R+I   Q  C
Sbjct: 316 QNRVTVYVADTGNHRIRRIRDGQVAC 341



 Score = 40.4 bits (93), Expect = 1.2,   Method: Composition-based stats.
 Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 3   DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNV--AGFRDGPSEDAKFSNDFDVVYVRPT 60
           ++  +YVADT N  IR+I D  V  IAG   ++   GF DG + +++F     +  +   
Sbjct: 317 NRVTVYVADTGNHRIRRIRDGQVACIAGRCDSIPHPGFSDGNASESRFDTPLGLA-IDVD 375

Query: 61  CSLLVIDRGNAALRQI 76
            +L+V D GN  +R I
Sbjct: 376 GNLIVADSGNNLIRLI 391


>gi|195641946|gb|ACG40441.1| hypothetical protein [Zea mays]
          Length = 79

 Score = 46.6 bits (109), Expect = 0.016,   Method: Composition-based stats.
 Identities = 22/34 (64%), Positives = 25/34 (73%)

Query: 24 GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 57
          GVTTI  GKS   G  DGPS +AKFSNDF+V Y+
Sbjct: 43 GVTTIDWGKSGRGGHVDGPSGEAKFSNDFEVHYI 76


>gi|426402696|ref|YP_007021667.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
           Tiberius]
 gi|425859364|gb|AFY00400.1| hypothetical protein Bdt_0693 [Bdellovibrio bacteriovorus str.
           Tiberius]
          Length = 692

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 1/77 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  G+LYV+D+ N  +RK+  AGV T   G+  + G  DG    A FS    +  V P+ 
Sbjct: 505 DGAGSLYVSDSGNYTVRKVTLAGVVTTLAGQVGIQGSDDGTGTGATFSRVAGIT-VTPSG 563

Query: 62  SLLVIDRGNAALRQISL 78
           ++ V D  N  +R+I++
Sbjct: 564 NIFVADTDNNVIRKITV 580



 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 41/76 (53%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GNL+V  +    IRKI  AGV T   G +N  G  DG    A+FS    +  +  + 
Sbjct: 230 DSAGNLFVVCSSCSTIRKITPAGVVTTFAGLANATGALDGTGTAARFSWPIGIT-IDSSD 288

Query: 62  SLLVIDRGNAALRQIS 77
           +L V D GN+A+R+++
Sbjct: 289 NLYVADYGNSAIRKVT 304



 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 4   KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
            GN++VADT N  IRKI  AGV T   G +   G  DG   +A+FS     V    + +L
Sbjct: 562 SGNIFVADTDNNVIRKITVAGVVTTFAGAAGQGGNDDGMGSNARFSQPH-FVATDSSGNL 620

Query: 64  LVIDRGNAALRQIS 77
            V + G A +R+I+
Sbjct: 621 YVAEWGEATIRKIT 634



 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 49
           D  GNLYVADT+N  IRKI  +G VTTIAG    + G  DG    A+FS
Sbjct: 397 DPAGNLYVADTMNRTIRKITPSGNVTTIAGSPGQI-GSADGTGAAARFS 444



 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNL+V +  N  IRKI  A V T   G    AG  DG    A+F+N  D+  +   
Sbjct: 120 IDGSGNLFVTEGNNHTIRKITPAAVVTTVAGSPGNAGTADGTGSAARFNNPEDIT-LAAD 178

Query: 61  CSLLVIDRGNAALRQIS 77
            +  + D+ N  +R+++
Sbjct: 179 GNFYITDKNNNMIRKMT 195


>gi|380692993|ref|ZP_09857852.1| cell surface protein [Bacteroides faecis MAJ27]
          Length = 457

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D++ NLY+AD+ N  IRK+   GV +   G  N +G++DG  E A F+  + +  +   
Sbjct: 380 LDEEENLYIADSGNHCIRKVTPEGVVSTVIGNPNSSGYKDGTPEIALFTEPWGLA-IDSE 438

Query: 61  CSLLVIDRGNAALRQISL 78
            ++ + D+ N  +R++S+
Sbjct: 439 GTIYIGDKDNRCVRKLSI 456


>gi|182416423|ref|YP_001821489.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
 gi|177843637|gb|ACB77889.1| NHL repeat containing protein [Opitutus terrae PB90-1]
          Length = 1026

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 2/79 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+ VAD  N AIRK+  +GV +   GK+ ++G  DG  + A+FS    V   R T
Sbjct: 277 IDAYGNILVADEGNCAIRKVSPSGVVSTVAGKTGLSGSDDGV-DAARFSLPRGVAVSR-T 334

Query: 61  CSLLVIDRGNAALRQISLN 79
             + V D GN+ +R+I++ 
Sbjct: 335 GDIYVADSGNSTVRRIAVG 353



 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 39/72 (54%), Gaps = 3/72 (4%)

Query: 5   GNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           G+LY+ADT N  IR+I  D  + TI GG     G +DG   +A+F N + V  V     L
Sbjct: 604 GSLYIADTGNSTIRQIRPDGSMVTIGGGMRQ-EGKQDGRGGEARFLNPYGVA-VDAAGHL 661

Query: 64  LVIDRGNAALRQ 75
            V D GN  +R+
Sbjct: 662 YVADSGNNLVRK 673


>gi|427400920|ref|ZP_18892158.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
 gi|425720099|gb|EKU83025.1| hypothetical protein HMPREF9710_01754 [Massilia timonae CCUG 45783]
          Length = 717

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 40/68 (58%), Gaps = 10/68 (14%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFS-------NDF 52
           +D +GNLYVADT N AIRK+  D  VTT+AG  S   G+ DG    A+F+       +D 
Sbjct: 136 LDHEGNLYVADTGNHAIRKVAPDGTVTTVAG--SGSPGYLDGIGRAAQFNGPVGIAVDDA 193

Query: 53  DVVYVRPT 60
            +VYV  T
Sbjct: 194 GIVYVADT 201



 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 42/77 (54%), Gaps = 4/77 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +DD G +YVADT N  IR+I  D  VTT+AG  +   G  DG   DA F     +   R 
Sbjct: 190 VDDAGIVYVADTYNDRIRRIAPDGMVTTLAG--NGKPGLLDGALLDAGFDTPSALAAGR- 246

Query: 60  TCSLLVIDRGNAALRQI 76
             +L V D GN A+R+I
Sbjct: 247 DGTLYVADTGNHAVRRI 263


>gi|223937279|ref|ZP_03629185.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223894064|gb|EEF60519.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 429

 Score = 46.2 bits (108), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D+ GN+YVADT N  IRKI  AG+ +   G +   G  DG +  A+F N    + V   
Sbjct: 169 VDNGGNVYVADTWNHTIRKITPAGLVSTLAGLAGNPGSADGTNSKARF-NRPSGIAVDNA 227

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D  N  +R+I+
Sbjct: 228 TNLFVTDFHNHTIRKIT 244



 Score = 40.8 bits (94), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 43/77 (55%), Gaps = 6/77 (7%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 57
           D+ GNL+VAD+ N  IRKI  +G    V+T+A G S +AG  +G +  A+F    DV   
Sbjct: 280 DNAGNLFVADSGNQTIRKISPSGTNWIVSTVA-GLSGIAGNANGTNNTARFYFPADVAQ- 337

Query: 58  RPTCSLLVIDRGNAALR 74
                + V D GN A+R
Sbjct: 338 DIAGYIYVADLGNNAIR 354



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 37/77 (48%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D+  NL+V D  N  IRKI   G  T   G   V G  DG +  A+F     +V     
Sbjct: 224 VDNATNLFVTDFHNHTIRKITPGGTVTTIAGLPGVWGNADGTNNVARFFQPQGIV-ADNA 282

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D GN  +R+IS
Sbjct: 283 GNLFVADSGNQTIRKIS 299



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 3/78 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G +YVADT N  IRKI  AG V+T+AG   N+  F DG   +A F     +  V  
Sbjct: 114 VDSAGFIYVADTANATIRKITPAGVVSTLAGSAGNINSF-DGTGINANFYQPRSLA-VDN 171

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++ V D  N  +R+I+
Sbjct: 172 GGNVYVADTWNHTIRKIT 189



 Score = 37.7 bits (86), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 36/76 (47%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN+YVADT N  IRKI   G  +   G +   G  DG   +A F     +  V    
Sbjct: 60  DSAGNIYVADTENSTIRKITPNGSVSTFAGFAGTFGSADGVGTNALFYAPQGIA-VDSAG 118

Query: 62  SLLVIDRGNAALRQIS 77
            + V D  NA +R+I+
Sbjct: 119 FIYVADTANATIRKIT 134


>gi|380692925|ref|ZP_09857784.1| hypothetical protein BfaeM_02978 [Bacteroides faecis MAJ27]
          Length = 454

 Score = 46.2 bits (108), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GNL + D +N  IRKI   G  T+  G+   +G  DG  + AKF     V +    
Sbjct: 378 LDPEGNLLIPDKMNHCIRKITPEGEVTLYAGQPQTSGHTDGLPDKAKFYEPEAVTF--SG 435

Query: 61  CSLLVIDRGNAALRQISL 78
            +L+V DRGN  +R + +
Sbjct: 436 NALIVADRGNHCVRNVVI 453


>gi|147865073|emb|CAN83232.1| hypothetical protein VITISV_021876 [Vitis vinifera]
          Length = 1694

 Score = 46.2 bits (108), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 1/46 (2%)

Query: 24   GVTTIAGGKSNVA-GFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDR 68
            G   IA GK     G  DGP +D K SNDFD VY+  +CSL+VID+
Sbjct: 1646 GGYNIARGKWGRGEGHVDGPIQDTKVSNDFDKVYIGSSCSLMVIDK 1691


>gi|392945712|ref|ZP_10311354.1| protein kinase family protein,NHL repeat protein [Frankia sp. QA3]
 gi|392289006|gb|EIV95030.1| protein kinase family protein,NHL repeat protein [Frankia sp. QA3]
          Length = 770

 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           + D G+LYVADT N  +R+I  +G  T+  G+    G R DG +  A   +D   + + P
Sbjct: 622 LGDDGSLYVADTGNETVRRIDPSGRITLVAGRPGTYGHRGDGKAATAALLDDPRGLALGP 681

Query: 60  TCSLLVIDRGNAALRQI 76
             SL + D GN  +R++
Sbjct: 682 DGSLYIADSGNDVIRRV 698


>gi|392540748|ref|ZP_10287885.1| Ig family protein [Pseudoalteromonas piscicida JCM 20779]
          Length = 2384

 Score = 45.8 bits (107), Expect = 0.026,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNLYV +T    +RKI  AGV T   G  N +GF D     A F+  ++       
Sbjct: 681 IDSNGNLYVVETNPHIVRKITPAGVVTTFAGSKNSSGFTDATGTSATFNFPYN-GGSNSN 739

Query: 61  CSLLVIDRGNAALRQIS 77
             L + DR N A+R+++
Sbjct: 740 NDLFIADRNNHAIRKVT 756



 Score = 41.6 bits (96), Expect = 0.59,   Method: Composition-based stats.
 Identities = 27/76 (35%), Positives = 36/76 (47%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D   NLYV +     IRKI   GV T   G +  +G  DG    A+FS  + +  V    
Sbjct: 791 DSADNLYVTEQAAHTIRKITSTGVVTTYAGSAGASGNTDGLVSVARFSQPYGIA-VDSND 849

Query: 62  SLLVIDRGNAALRQIS 77
            + V D GN  +R+IS
Sbjct: 850 VVYVADTGNHRIRKIS 865



 Score = 39.7 bits (91), Expect = 1.9,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN+YVAD  N  IRKI   GV T   G S  AG  DG    A F N    V +  + 
Sbjct: 574 DSNGNVYVADYGNHVIRKITPEGVVTTLAG-SGSAGSDDGTGSAASF-NFPKAVTLDSSG 631

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D  N  +R+++
Sbjct: 632 NVYVADSSNNNIRKVT 647



 Score = 37.7 bits (86), Expect = 8.4,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 49
           +D  GN+YVAD+ N  IRK+  AGV T   G S   G  DG    A F+
Sbjct: 627 LDSSGNVYVADSSNNNIRKVTPAGVVTTFAG-SGTYGSDDGTGTAATFA 674


>gi|108761192|ref|YP_633033.1| lipoprotein [Myxococcus xanthus DK 1622]
 gi|108465072|gb|ABF90257.1| putative lipoprotein [Myxococcus xanthus DK 1622]
          Length = 868

 Score = 45.8 bits (107), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 40/78 (51%), Gaps = 4/78 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  GN+YVADT +  IR+I   G V   AGG     G  DGP++ A F+    +      
Sbjct: 638 DAAGNVYVADTDHYVIRRIDPEGKVEVFAGGTP---GLMDGPAKQAAFNQPTGLAVTPDG 694

Query: 61  CSLLVIDRGNAALRQISL 78
            +LLV D  N  +R+I L
Sbjct: 695 TALLVADMNNGVIRRIDL 712


>gi|284038675|ref|YP_003388605.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
 gi|283817968|gb|ADB39806.1| NHL repeat containing protein [Spirosoma linguale DSM 74]
          Length = 359

 Score = 45.8 bits (107), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
            D   NLY+ +T    +RKI  D  V+ + G  S + G++DGP + A+F +  D+ +  P
Sbjct: 283 FDSNDNLYIGETGAGIVRKIATDGQVSDVTG--SRLGGYKDGPLQAAEFGSVEDLAF-SP 339

Query: 60  TCSLLVIDRGNAALRQISL 78
           + SL V D  N A+R+I+ 
Sbjct: 340 SGSLYVADNRNGAIRKITF 358



 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 31/55 (56%), Gaps = 5/55 (9%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR----DGPSEDAKFSN 50
            D  GNLYVAD+ N AIRKI  D  VTT AG   +V        DG  ++A+F N
Sbjct: 110 FDRAGNLYVADSGNQAIRKISPDGQVTTFAGQPYDVTSITNVSVDGIGKEARFYN 164


>gi|182414082|ref|YP_001819148.1| NHL repeat-containing protein [Opitutus terrae PB90-1]
 gi|177841296|gb|ACB75548.1| NHL repeat containing protein [Opitutus terrae PB90-1]
          Length = 963

 Score = 45.8 bits (107), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 7/83 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKI----GDA--GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 54
           +D  GN++VAD+LN  IR++    G A  GV T   G++ V G  DG    A+F+  + +
Sbjct: 289 VDRDGNIFVADSLNNTIRRVTPLNGPAPLGVVTTVAGQAGVTGSADGVGSQARFNLPYGI 348

Query: 55  VYVRPTCSLLVIDRGNAALRQIS 77
             V    ++ V D GN  +R+I+
Sbjct: 349 A-VDAAGNIFVADLGNTTIRKIA 370



 Score = 42.4 bits (98), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN++VAD  N  IRKI  +G  T   G+++V G  DGP   A+F N  + V V   
Sbjct: 350 VDAAGNIFVADLGNTTIRKIAPSGAVTTLAGEASV-GTADGPGPMARF-NYPNGVAVDLA 407

Query: 61  CSLLVIDRGNAALRQIS 77
            +  V D  NA +R+I+
Sbjct: 408 GNTYVADTFNATIRKIT 424



 Score = 42.0 bits (97), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNL+VADT N  IRKI  +G  +   G     G  DG    A+F +   V  +   
Sbjct: 70  VDQAGNLFVADTTNHTIRKITPSGTVSTFAGMGGQPGSVDGTGNAARFLSPHGVA-LDEA 128

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D GN  +R+I+
Sbjct: 129 GNLYVADSGNNTIRKIT 145



 Score = 40.8 bits (94), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 26/48 (54%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           +D  GN YVADT N  IRKI  AGV +   G +   G  DG    A+F
Sbjct: 404 VDLAGNTYVADTFNATIRKITPAGVVSTLAGAAGQIGSADGTGSAARF 451



 Score = 40.8 bits (94), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GNLYV+D  +  +RKI  AG  T   G+  +AG  DG    A+F+    +   R  
Sbjct: 568 IDAAGNLYVSDRGDFTVRKITPAGEVTTVAGQHGIAGGADGTGSAAQFAYAGGIAIDR-R 626

Query: 61  CSLLVIDRGNAALRQIS 77
            +L V D  N  +RQI+
Sbjct: 627 GTLYVADSNN-RIRQIT 642


>gi|403382146|ref|ZP_10924203.1| copper amine oxidase domain-containing protein [Paenibacillus sp.
           JC66]
          Length = 531

 Score = 45.8 bits (107), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 44/77 (57%), Gaps = 4/77 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GNL+VAD  N A+RK+  +  VTT AG    V G +DG +E++ F    DVV V  
Sbjct: 132 LDHNGNLFVADKGNHAVRKVDAEGNVTTYAG--QGVLGHKDGTAEESLFYAPEDVV-VAS 188

Query: 60  TCSLLVIDRGNAALRQI 76
             ++ V D  N  +R+I
Sbjct: 189 DGTVYVADTLNHVIRKI 205



 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 13/102 (12%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAG----------GKSNVAG-FRDGPSEDAKFSNDF 52
           G +YVADTLN  IRKI   G V+T+            G++ +AG ++DGP ++AKF N+ 
Sbjct: 190 GTVYVADTLNHVIRKIDPEGKVSTLNALPQRYIEVFPGEAVLAGDYKDGPLQEAKF-NEP 248

Query: 53  DVVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTD 94
             + +    +L + D GN  +R + L  D       S+   D
Sbjct: 249 TGLAIDHLGNLYISDTGNRVIRYMDLANDRVSTVAGSVQLYD 290



 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 1/80 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           + ++G L +AD+LN AIR + +  V T+AGG     G +DG +   + ++  D V V   
Sbjct: 322 ITEEGGLVIADSLNHAIRYLFEGRVITLAGGHEAEHGQQDGINGYNRLNHPQD-VQVAAD 380

Query: 61  CSLLVIDRGNAALRQISLNQ 80
            S+++ D  N  LR   L Q
Sbjct: 381 GSIIIADAYNNQLRAFQLYQ 400


>gi|320105416|ref|YP_004181006.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319923937|gb|ADV81012.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 822

 Score = 45.4 bits (106), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 43/78 (55%), Gaps = 3/78 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GNLYVADT N  IRKI    +TT+AG G+   AG  DG    A + +  D V V  
Sbjct: 180 VDTGGNLYVADTDNHRIRKISGTTITTVAGSGEQGFAG--DGGPATAAWLDSPDGVAVDA 237

Query: 60  TCSLLVIDRGNAALRQIS 77
           T +L + D  N  +R +S
Sbjct: 238 TGNLYIADTHNQRIRVVS 255


>gi|307110397|gb|EFN58633.1| hypothetical protein CHLNCDRAFT_140870 [Chlorella variabilis]
          Length = 415

 Score = 45.4 bits (106), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 4/87 (4%)

Query: 4   KGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
            G+L VADT N  IR+I  A   V+T+A G+    G  DGP+ +A+F +    +     C
Sbjct: 118 HGHLAVADTGNACIRQIDAATKQVSTLA-GRCGEPGAADGPAAEAQFGSSIKSIACA-NC 175

Query: 62  SLLVIDRGNAALRQISLNQDDCEYQYN 88
           S+ V D     LR + ++  +C    N
Sbjct: 176 SVFVGDVSTGRLRLVRVDDAECLGASN 202


>gi|116672456|ref|YP_833389.1| NHL repeat-containing protein [Arthrobacter sp. FB24]
 gi|116612565|gb|ABK05289.1| NHL repeat containing protein [Arthrobacter sp. FB24]
          Length = 672

 Score = 45.4 bits (106), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 45/79 (56%), Gaps = 6/79 (7%)

Query: 2   DDKGNLYVADTLNLAIRK--IGDAGVTTI--AGGKSNVA-GFRDGPSEDAKFSNDFDVVY 56
           D  GN++VAD+   A+RK  I DAG  T+  A GK     GFRDGP+ +A+  +   V  
Sbjct: 406 DADGNIWVADSETSALRKLVIDDAGTVTVESAVGKGLFDFGFRDGPAAEARLQHPLGVT- 464

Query: 57  VRPTCSLLVIDRGNAALRQ 75
           V P  S+ + D  N A+R+
Sbjct: 465 VLPDGSVAIADTYNGAVRR 483


>gi|421094122|ref|ZP_15554843.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
 gi|410363263|gb|EKP14295.1| putative lipoprotein [Leptospira borgpetersenii str. 200801926]
 gi|456886734|gb|EMF97860.1| putative lipoprotein [Leptospira borgpetersenii str. 200701203]
          Length = 358

 Score = 45.4 bits (106), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   V+G+ DG    A+F +   ++Y +
Sbjct: 185 LDRERNLYVGELGNHTIRKINLNSETVSTLSGG---VSGYLDGDLTSAQFKSPSGIIYDQ 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  N  +R+I L
Sbjct: 242 KTDSLLVADLQNHRIRKIDL 261


>gi|116619658|ref|YP_821814.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116222820|gb|ABJ81529.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 585

 Score = 45.1 bits (105), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           +D  GN Y+AD+LN  IRK+    +TT+AG  +   G + DG   +    N+   V V P
Sbjct: 114 LDSSGNFYIADSLNSVIRKVTGGTITTVAGDYTQFPGDQGDGGQANVAVLNNPTSVMVDP 173

Query: 60  TCSLLVIDRGNAALRQI 76
             +  + D GN  +R++
Sbjct: 174 AGNYYIADSGNNRIRKV 190


>gi|302821969|ref|XP_002992645.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
 gi|300139609|gb|EFJ06347.1| hypothetical protein SELMODRAFT_6806 [Selaginella moellendorffii]
          Length = 79

 Score = 44.7 bits (104), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 25/41 (60%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 3  DKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGP 42
          D G ++VADT NLAIRKI  D  VTTIAGG S   G  D P
Sbjct: 39 DNGAVFVADTTNLAIRKISKDGEVTTIAGGSSRRPGIADSP 79


>gi|290992324|ref|XP_002678784.1| predicted protein [Naegleria gruberi]
 gi|284092398|gb|EFC46040.1| predicted protein [Naegleria gruberi]
          Length = 2331

 Score = 44.7 bits (104), Expect = 0.062,   Method: Composition-based stats.
 Identities = 22/80 (27%), Positives = 44/80 (55%), Gaps = 1/80 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVRP 59
           +++ G++Y++D  N  IRK+  +G  T   G++ ++GF D   ++ A  +  F + Y   
Sbjct: 539 INNVGDMYISDYANNVIRKMSSSGAITTIAGQAKISGFADSLNAKTALLNGPFGLYYQES 598

Query: 60  TCSLLVIDRGNAALRQISLN 79
           T +L   D  N  +R+++ N
Sbjct: 599 TQTLFFADSLNGRIRKLTKN 618


>gi|344924046|ref|ZP_08777507.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 368

 Score = 44.7 bits (104), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  G +YV DT    +RKI   GVT++  G SN  G+ +G    A+FS+ + +  V  +
Sbjct: 224 VDSSGTVYVTDTYTQRVRKITSGGVTSLLAG-SNTIGYAEGTGASARFSSPYGIA-VDSS 281

Query: 61  CSLLVIDRGNAALRQIS 77
            +  V D  N  +R+I+
Sbjct: 282 GTAYVADSDNHRIRKIT 298



 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVT-TIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G +YVAD  N  IR+I  AGVT T+AG  S+  G+ +G    A+F    DV  V  
Sbjct: 170 VDSSGTVYVADFFNSRIRRITSAGVTSTLAG--SSTGGYLEGTGGAAQFGTPIDVA-VDS 226

Query: 60  TCSLLVIDRGNAALRQIS 77
           + ++ V D     +R+I+
Sbjct: 227 SGTVYVTDTYTQRVRKIT 244



 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 3/77 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  G LYV++  N  IRKI  AGVT++  G +   G+ +G    A+F   + V  V  +
Sbjct: 117 VDSSGTLYVSEYTNHRIRKITSAGVTSLLAGSAQ--GYAEGTGSGARFDRPYSVA-VDSS 173

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ V D  N+ +R+I+
Sbjct: 174 GTVYVADFFNSRIRRIT 190


>gi|348676369|gb|EGZ16187.1| hypothetical protein PHYSODRAFT_506757 [Phytophthora sojae]
          Length = 1981

 Score = 44.7 bits (104), Expect = 0.065,   Method: Composition-based stats.
 Identities = 33/83 (39%), Positives = 44/83 (53%), Gaps = 9/83 (10%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPS-EDAKFSNDFDVVYV 57
           +D  G +YVADT N  IRKI      V+T+AG    + GF DG +   A+FS   DV  +
Sbjct: 233 VDSNGVVYVADTANHRIRKINPTTKMVSTLAG--DGIEGFADGAALSAARFSYPSDVAVL 290

Query: 58  R----PTCSLLVIDRGNAALRQI 76
                 T ++ V D GN  +RQI
Sbjct: 291 ETNGGATVTVFVADTGNHRIRQI 313


>gi|428172619|gb|EKX41527.1| hypothetical protein GUITHDRAFT_153825 [Guillardia theta CCMP2712]
          Length = 285

 Score = 44.7 bits (104), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 6/76 (7%)

Query: 6   NLYVADTLNLAIRKIG-----DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           N+++ADT N AIR I      +  VT IAG +S  +GF DG  E A+F+    +  V   
Sbjct: 134 NIFIADTYNHAIRHITRDEDRNWTVTLIAGSRSGQSGFEDGEGESARFNCPTGLAVVNEQ 193

Query: 61  CSLLVIDRGNAALRQI 76
             +L+ D  N+A+R +
Sbjct: 194 -EILISDFSNSAIRLL 208


>gi|421111981|ref|ZP_15572448.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
           JET]
 gi|410802632|gb|EKS08783.1| hypothetical protein LEP1GSC071_4063 [Leptospira santarosai str.
           JET]
          Length = 356

 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG   + G+ DG    A+F +   + Y +
Sbjct: 183 LDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---ILGYLDGDLTSAQFKSPLGITYDQ 239

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 240 KTDSLLVADLQDHRIRKIDL 259


>gi|365121900|ref|ZP_09338810.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643613|gb|EHL82920.1| hypothetical protein HMPREF1033_02156 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 443

 Score = 44.3 bits (103), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D+  NLY+ADT N  IRKI   G  +   G++ VAG++DG  E A F     V  +   
Sbjct: 366 LDEDDNLYIADTENHVIRKITPQGQVSTVIGQAGVAGYQDGDPEVALFDRPHGVC-INKE 424

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D  N  +R++++
Sbjct: 425 GIIYIGDYENQCIRRLAI 442


>gi|351729452|ref|ZP_08947143.1| NHL repeat-containing protein [Acidovorax radicis N35]
          Length = 659

 Score = 44.3 bits (103), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 28/48 (58%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF 48
           +D  GN+YVADT N  +R+I   G  T   GK  V G +DG S+ A F
Sbjct: 65  VDTAGNMYVADTENHLVRRISPTGAVTTLAGKPGVCGNQDGTSDTATF 112



 Score = 38.9 bits (89), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 8/77 (10%)

Query: 2   DDKGNLYVADTL-------NLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD 53
           D  GN+YV D L          IRKI  +G V+T+AG  +   G+ DG    A F+ D  
Sbjct: 513 DAAGNVYVVDGLFHEVNKIGPTIRKITPSGMVSTLAGNPAAAPGYADGTGAAAVFTVDLG 572

Query: 54  VVYVRPTCSLLVIDRGN 70
             Y + T +L V  +GN
Sbjct: 573 FRYPQQTAALAVDAQGN 589


>gi|301064161|ref|ZP_07204608.1| RHS repeat-associated core domain protein [delta proteobacterium
           NaphS2]
 gi|300441781|gb|EFK06099.1| RHS repeat-associated core domain protein [delta proteobacterium
           NaphS2]
          Length = 2050

 Score = 44.3 bits (103), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 45/82 (54%), Gaps = 6/82 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYV 57
           +D  GNLY++DT N  IRK+   G +TT+AG  + VAGF    GP+ +A       +  +
Sbjct: 893 IDSAGNLYISDTGNYCIRKVDTNGIITTVAG--NGVAGFSGDGGPAVEASLGWAMGIA-I 949

Query: 58  RPTCSLLVIDRGNAALRQISLN 79
               +L ++D  N  +R++  N
Sbjct: 950 DSAGNLYILDGSNHRVRKVDTN 971



 Score = 44.3 bits (103), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 1/80 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDFDVVYVRP 59
           +D  GNLY+ D+ N  +RK+   G+ T   G  N +   D GP+ +A  S+    + +  
Sbjct: 836 IDSAGNLYILDSDNNRVRKVDTNGIITTVAGNGNWSYNGDGGPAVEASLSSAASGIAIDS 895

Query: 60  TCSLLVIDRGNAALRQISLN 79
             +L + D GN  +R++  N
Sbjct: 896 AGNLYISDTGNYCIRKVDTN 915


>gi|333383538|ref|ZP_08475196.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332827477|gb|EGK00223.1| hypothetical protein HMPREF9455_03362 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 434

 Score = 44.3 bits (103), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D +GNLY+AD  N  IR I   G+ T   G+    G+ DG  E A F     V  V   
Sbjct: 357 LDLEGNLYIADEGNHCIRMIDKDGIVTTPIGQPGQEGYMDGSPEIALFKKPRGVA-VDKN 415

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D GN  LR+++L
Sbjct: 416 GDVYIADYGNRCLRKLTL 433


>gi|456865078|gb|EMF83443.1| putative lipoprotein [Leptospira weilii serovar Topaz str. LT2116]
          Length = 357

 Score = 44.3 bits (103), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG   V+G+ DG    A+F +   + Y +
Sbjct: 184 LDRERNLYVGELGNHAIRKINLNSETVSTLSGG---VSGYLDGDLVSAQFKSPLGITYDQ 240

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 241 KTDSLLVADLQDHRIRKIDL 260


>gi|359728310|ref|ZP_09267006.1| hypothetical protein Lwei2_15904 [Leptospira weilii str.
           2006001855]
 gi|417778551|ref|ZP_12426356.1| putative lipoprotein [Leptospira weilii str. 2006001853]
 gi|410781344|gb|EKR65918.1| putative lipoprotein [Leptospira weilii str. 2006001853]
          Length = 357

 Score = 44.3 bits (103), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG   V+G+ DG    A+F +   + Y +
Sbjct: 184 LDRERNLYVGELGNHAIRKINLNSETVSTLSGG---VSGYLDGDLVSAQFKSPLGITYDQ 240

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 241 KTDSLLVADLQDHRIRKIDL 260


>gi|289664052|ref|ZP_06485633.1| putative NHL repeat protein [Xanthomonas campestris pv.
          vasculorum NCPPB 702]
          Length = 263

 Score = 44.3 bits (103), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%), Gaps = 3/56 (5%)

Query: 1  MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 55
          +D  GNLY+ADT N AIRK+   G VTT+AG    VAG R+G +   +F+    V 
Sbjct: 29 LDTAGNLYIADTGNHAIRKLTPQGKVTTLAG--DGVAGDRNGAAAQVRFNGPVGVA 82


>gi|372267905|ref|ZP_09503953.1| hypothetical protein AlS89_08394 [Alteromonas sp. S89]
          Length = 2227

 Score = 44.3 bits (103), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 3    DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPT 60
            D G++YVADT N  +R+I   GV T   G   +  F    GP+ DA  +  +DV+   P 
Sbjct: 977  DDGSIYVADTGNDRVRRIDRRGVITTIAGSEVIDTFAGDGGPATDASLNAPYDVI-CGPH 1035

Query: 61   CSLLVIDRGNAALRQISLN 79
             S+ + D  N  +R++ +N
Sbjct: 1036 GSIYIADSRNHRIRRVDVN 1054



 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 5    GNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGF--RDGPSEDAKFSNDFDVVYVRPTC 61
            G +Y++D+    +R+IG D  + T+AG KS   GF    GP+  A  +   DV  + P  
Sbjct: 1148 GGIYISDSGEGGVRRIGTDGTIVTVAGNKSPYWGFGGDGGPAIQADMTGVTDVA-IGPEG 1206

Query: 62   SLLVIDRGNAALRQIS 77
            SL ++D  N  +R++S
Sbjct: 1207 SLYLVDAYNFRIRKVS 1222



 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 1    MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
            +D +GNLY+ D+ N  IR++G D  +TTIAG          G + +A F +   + Y   
Sbjct: 1088 LDPEGNLYIVDSGNRRIRRVGVDGRITTIAGNGGYKNTGDGGSALEAGFDDPLGIAYA-A 1146

Query: 60   TCSLLVIDRGNAALRQI 76
               + + D G   +R+I
Sbjct: 1147 DGGIYISDSGEGGVRRI 1163



 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 45/78 (57%), Gaps = 6/78 (7%)

Query: 5    GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTC 61
            G++Y+AD+ N  IR++   G ++T+AG  S   GF    GP+ DA  S    +  + P  
Sbjct: 1036 GSIYIADSRNHRIRRVDVNGIISTVAG--SGARGFSGDGGPATDASLSAPSGIT-LDPEG 1092

Query: 62   SLLVIDRGNAALRQISLN 79
            +L ++D GN  +R++ ++
Sbjct: 1093 NLYIVDSGNRRIRRVGVD 1110


>gi|171913873|ref|ZP_02929343.1| NHL repeat containing protein [Verrucomicrobium spinosum DSM 4136]
          Length = 370

 Score = 44.3 bits (103), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 44/81 (54%), Gaps = 7/81 (8%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYV 57
           D  GN Y+AD  N AIRK+ DA    +TT+AG G+    G  DGPSE A+      + + 
Sbjct: 122 DKAGNYYIADMSNHAIRKV-DAKTGIITTLAGTGRGGYTG-DDGPSEKAELKQPHSIQF- 178

Query: 58  RPTCSLLVIDRGNAALRQISL 78
            P   L + D GN  +R+I +
Sbjct: 179 GPDGDLYICDIGNHVIRKIDM 199


>gi|390960114|ref|YP_006423871.1| gluconolactonase [Terriglobus roseus DSM 18391]
 gi|390415032|gb|AFL90536.1| gluconolactonase [Terriglobus roseus DSM 18391]
          Length = 1836

 Score = 44.3 bits (103), Expect = 0.089,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVR 58
           +D  GNLYVAD+ N  IR+I   GV TT+AG  +N + +  DG    A   N    V + 
Sbjct: 245 IDGAGNLYVADSGNNVIRRIDTNGVMTTVAGQVANSSPYAGDGLLATAATLNAPWSVALD 304

Query: 59  PTCSLLVIDRGNAALRQI 76
           P  +L + D GN A+R++
Sbjct: 305 PDGNLFIADTGNNAVRRV 322


>gi|445496654|ref|ZP_21463509.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
 gi|444786649|gb|ELX08197.1| NHL repeat containing protein [Janthinobacterium sp. HH01]
          Length = 741

 Score = 44.3 bits (103), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 45/79 (56%), Gaps = 6/79 (7%)

Query: 1   MDDKGNLYVADTL-NLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D  GNLYVAD   N +IRKI  D   TT+AGG   V G+ +G  + A F N    + + 
Sbjct: 111 IDGAGNLYVADGGDNNSIRKIALDGATTTLAGG---VEGYAEGAGKAAAF-NTPSGLAID 166

Query: 59  PTCSLLVIDRGNAALRQIS 77
              +L V D GN A+R+++
Sbjct: 167 AAGNLYVADTGNNAIRKVT 185



 Score = 42.0 bits (97), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 41/78 (52%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G +YVADT N  IR+I   G VTTIAGG    AG  DG +  A F     +  +  
Sbjct: 219 VDAAGVVYVADTYNDRIRRIAPNGDVTTIAGGSR--AGKADGAAAQALFDTPTGLA-LSA 275

Query: 60  TCSLLVIDRGNAALRQIS 77
              L + D GN A+R++ 
Sbjct: 276 AGDLYIADTGNHAIRKLG 293



 Score = 41.6 bits (96), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 45/82 (54%), Gaps = 4/82 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GNLYVADT N AIRK+   G V+T+AG    + G +DG    A+F+    +  V  
Sbjct: 165 IDAAGNLYVADTGNNAIRKVTPEGVVSTLAG--DGLPGDKDGRGAAAQFNGPVGIA-VDA 221

Query: 60  TCSLLVIDRGNAALRQISLNQD 81
              + V D  N  +R+I+ N D
Sbjct: 222 AGVVYVADTYNDRIRRIAPNGD 243


>gi|325916326|ref|ZP_08178604.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
 gi|325537497|gb|EGD09215.1| gluconolactonase [Xanthomonas vesicatoria ATCC 35937]
          Length = 699

 Score = 44.3 bits (103), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 54
           +D  GNLY+ADT N AIRK    G VTT+AG     AGF DG +  A+F+    V
Sbjct: 133 LDTAGNLYIADTGNHAIRKRTPQGVVTTLAG--DGTAGFGDGAASQARFNGPMGV 185


>gi|418677567|ref|ZP_13238841.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
 gi|418687786|ref|ZP_13248945.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
 gi|418742122|ref|ZP_13298495.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
 gi|400320757|gb|EJO68617.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. RM52]
 gi|410738110|gb|EKQ82849.1| putative lipoprotein [Leptospira kirschneri serovar Grippotyphosa
           str. Moskva]
 gi|410750480|gb|EKR07460.1| putative lipoprotein [Leptospira kirschneri serovar Valbuzzi str.
           200702274]
          Length = 358

 Score = 44.3 bits (103), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 47/80 (58%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI  D+G V+T +GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDLERNLYVGELSNHTIRKINLDSGMVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLL  D  +  +R+I+L
Sbjct: 242 KTDSLLAADIQDHRIRKINL 261


>gi|418737091|ref|ZP_13293489.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
           str. 200801910]
 gi|410747250|gb|EKR00156.1| NHL repeat protein [Leptospira borgpetersenii serovar Castellonis
           str. 200801910]
          Length = 358

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   V+G+ DG    A+F     ++Y +
Sbjct: 185 LDRERNLYVGELGNHTIRKINLNSETVSTLSGG---VSGYLDGDLTSAQFKFPSGIIYDQ 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  N  +R+I L
Sbjct: 242 KTDSLLVADLQNHRIRKIDL 261


>gi|393778566|ref|ZP_10366837.1| Ig family protein [Ralstonia sp. PBA]
 gi|392714468|gb|EIZ02071.1| Ig family protein [Ralstonia sp. PBA]
          Length = 342

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 43/76 (56%), Gaps = 1/76 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVADT N  +R+I  AGV +   G++ V G ++G    A+F   + VV V   
Sbjct: 262 LDGDGNVYVADTGNNLLRRITRAGVVSTLAGQAGVTGAQNGIGSAARFKQPYGVV-VDAD 320

Query: 61  CSLLVIDRGNAALRQI 76
            ++ V D  N  +R++
Sbjct: 321 GTVYVADTFNNLIRKV 336


>gi|444306537|ref|ZP_21142300.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
 gi|443481173|gb|ELT44105.1| NHL repeat containing protein [Arthrobacter sp. SJCon]
          Length = 673

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 45/79 (56%), Gaps = 6/79 (7%)

Query: 2   DDKGNLYVADTLNLAIRK--IGDAGVTTI--AGGKSNVA-GFRDGPSEDAKFSNDFDVVY 56
           D  GN++VAD+   A+RK  I DAG  T+  A GK     GFRDGP+ +A+  +   V  
Sbjct: 407 DGDGNIWVADSETSALRKLVIDDAGSITVESALGKGLFDFGFRDGPAVEARLQHPLGVT- 465

Query: 57  VRPTCSLLVIDRGNAALRQ 75
           V P  S+ + D  N A+R+
Sbjct: 466 VLPDGSVAIADTYNGAVRR 484


>gi|405375498|ref|ZP_11029528.1| putative hemolysin [Chondromyces apiculatus DSM 436]
 gi|397086225|gb|EJJ17355.1| putative hemolysin [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 855

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN+YV+DT +  IR+I   G   +  G +   G +DGP++ A F+    +       
Sbjct: 625 DAAGNVYVSDTDHYVIRRIDPEGKVELFAGST--PGLQDGPAKQAAFNQPAGLTVTPDGS 682

Query: 62  SLLVIDRGNAALRQISL 78
           +LLV D  N  +R+I L
Sbjct: 683 ALLVADLNNGVIRRIDL 699


>gi|124002516|ref|ZP_01687369.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
 gi|123992345|gb|EAY31713.1| hypothetical protein M23134_05219 [Microscilla marina ATCC 23134]
          Length = 2385

 Score = 43.9 bits (102), Expect = 0.10,   Method: Composition-based stats.
 Identities = 29/80 (36%), Positives = 43/80 (53%), Gaps = 3/80 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVA---GFRDGPSEDAKFSNDFDVVYV 57
           +D  GN+YVAD  N  I+KI +  VTT+AG  ++ A   G  DG ++ A+F     V   
Sbjct: 723 VDAAGNIYVADKNNHRIKKIANGMVTTLAGPMNDAASIPGRTDGAADAARFFFPTSVALD 782

Query: 58  RPTCSLLVIDRGNAALRQIS 77
                L V D+ N  +RQ++
Sbjct: 783 ITGAQLYVADKLNNIIRQVN 802



 Score = 38.9 bits (89), Expect = 3.4,   Method: Composition-based stats.
 Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 4/79 (5%)

Query: 5   GNLYVADTLNLAIRKI--GDAGVTTIAGGKSN-VAGFRDGPSEDAKFSNDFDVVYVRPTC 61
             LYVAD LN  IR++   D    T AG  +N +AG +DG +  AKF +   +  V    
Sbjct: 786 AQLYVADKLNNIIRQVNTADGHTLTYAGDVANGIAGHQDGNAASAKFRSPAGIT-VNAVG 844

Query: 62  SLLVIDRGNAALRQISLNQ 80
            + + D  N  +R+IS  Q
Sbjct: 845 DVYIADTHNQVIRKISQGQ 863


>gi|421106984|ref|ZP_15567544.1| putative lipoprotein [Leptospira kirschneri str. H2]
 gi|410007898|gb|EKO61577.1| putative lipoprotein [Leptospira kirschneri str. H2]
          Length = 358

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI  D+G V+T +GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLDSGTVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLL  D  +  +R+I L
Sbjct: 242 KTDSLLAADIQDHRIRKIDL 261


>gi|118161380|gb|ABK64062.1| conserved hypothetical protein [Janthinobacterium lividum]
          Length = 779

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GNLY+ D+LN  +RKI  AGV T   G + VAG  DG    A F+    +   +   
Sbjct: 652 DASGNLYLTDSLNHNVRKITPAGVVTTLAGTTGVAGDADGRGSAASFNGPHGIALDKA-G 710

Query: 62  SLLVIDRGNAALRQIS 77
           ++ V D  N  +R+IS
Sbjct: 711 NVYVADTENNLVRRIS 726



 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 47/83 (56%), Gaps = 9/83 (10%)

Query: 1   MDDKGNLYVADT-----LNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDV 54
           +D  GN+YV+D+     LN  +RKI  AG VTTI G  + + G  DG + +A+F+     
Sbjct: 423 VDGAGNVYVSDSNPGLQLN-GVRKITPAGEVTTITGSNAGI-GVTDGLASEARFAGP-QA 479

Query: 55  VYVRPTCSLLVIDRGNAALRQIS 77
           + +    +L V D GN  +R++S
Sbjct: 480 IALHSDGTLFVADTGNQLIRRVS 502


>gi|421129432|ref|ZP_15589632.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
 gi|410358807|gb|EKP05916.1| putative lipoprotein [Leptospira kirschneri str. 2008720114]
          Length = 358

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI  D+G V+T +GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDLERNLYVGELSNHTIRKINLDSGMVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLL  D  +  +R+I L
Sbjct: 242 KTDSLLAADIQDHRIRKIDL 261


>gi|418693423|ref|ZP_13254475.1| putative lipoprotein [Leptospira kirschneri str. H1]
 gi|409958780|gb|EKO17669.1| putative lipoprotein [Leptospira kirschneri str. H1]
          Length = 358

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI  D+G V+T +GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLDSGTVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLL  D  +  +R+I L
Sbjct: 242 KTDSLLAADIQDHRIRKIDL 261


>gi|290995061|ref|XP_002680150.1| predicted protein [Naegleria gruberi]
 gi|284093769|gb|EFC47406.1| predicted protein [Naegleria gruberi]
          Length = 880

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 4/75 (5%)

Query: 4   KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGP-SEDAKFSNDFDVVYVRPTC 61
           KG++Y+AD+ N  IRK+ +  ++TIAG  S  AGF  DG  +  AK    + V Y   + 
Sbjct: 441 KGDVYIADSYNNRIRKVSNGFISTIAGTGS--AGFTGDGELAIAAKLDTPYSVAYSNTSG 498

Query: 62  SLLVIDRGNAALRQI 76
            + ++D  NA +R I
Sbjct: 499 LVYILDTNNARIRNI 513


>gi|301122083|ref|XP_002908768.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099530|gb|EEY57582.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1981

 Score = 43.9 bits (102), Expect = 0.12,   Method: Composition-based stats.
 Identities = 36/83 (43%), Positives = 45/83 (54%), Gaps = 10/83 (12%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPS-EDAKFSNDFDVVYV 57
           +D  G +YVADT N  IR I      V+T+AG    V GF DGP+   A+FS   DV  V
Sbjct: 234 VDSNGVVYVADTANHRIRIIDPTTKRVSTLAG--DGVEGFVDGPALTAARFSYPSDVA-V 290

Query: 58  RP----TCSLLVIDRGNAALRQI 76
           R     T ++ V D GN  +RQI
Sbjct: 291 RESNSGTITVFVADTGNHRIRQI 313


>gi|399034109|ref|ZP_10732477.1| gluconolactonase [Flavobacterium sp. CF136]
 gi|398067543|gb|EJL59038.1| gluconolactonase [Flavobacterium sp. CF136]
          Length = 418

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 48/79 (60%), Gaps = 2/79 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D K NLY+++     I K+   G+ TI  G +N  GF++G  ++AKFS    +V+ +  
Sbjct: 246 LDSKNNLYISEDATSKIMKLTPTGILTIFAG-ANGTGFQNGNIKEAKFSKPSGIVFDKMD 304

Query: 61  CSLLVIDRGNAALRQISLN 79
            +L V+DR N  +R+I+++
Sbjct: 305 -NLYVVDRYNNRIRKIAVD 322


>gi|421091013|ref|ZP_15551797.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
 gi|410000210|gb|EKO50873.1| putative lipoprotein [Leptospira kirschneri str. 200802841]
          Length = 358

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI  D+G V+T +GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDLERNLYVGELSNHTIRKINLDSGMVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLL  D  +  +R+I L
Sbjct: 242 KTDSLLAADIQDHRIRKIDL 261


>gi|124008176|ref|ZP_01692873.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
 gi|123986275|gb|EAY26097.1| hypothetical protein M23134_05970 [Microscilla marina ATCC 23134]
          Length = 2548

 Score = 43.9 bits (102), Expect = 0.12,   Method: Composition-based stats.
 Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  G++YVADT N  IRKI +  VTTIA G +N  G  +G ++ AKF N    VY    
Sbjct: 864 VDMVGDVYVADTRNQVIRKISEGQVTTIA-GIANDRGLTNGTAQAAKF-NYPGSVYADLE 921

Query: 61  CSLLVIDRGNAALRQ 75
            ++ V D+ N  +R+
Sbjct: 922 QNIYVGDKVNQLVRR 936



 Score = 43.5 bits (101), Expect = 0.14,   Method: Composition-based stats.
 Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 7/85 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKI----GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 56
           +DD G LYVAD  N  IR I    G A VTT+AG     +G   G S  A+F    DV  
Sbjct: 689 IDDAGILYVADKDNHVIRAIANPDGAATVTTVAG--DGTSGDAIGASTSARFREPSDVA- 745

Query: 57  VRPTCSLLVIDRGNAALRQISLNQD 81
           V  + +L V D+ N  ++++ LN +
Sbjct: 746 VDFSGNLYVADKNNHKIKKVDLNTN 770



 Score = 40.4 bits (93), Expect = 1.1,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 8/107 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTI------AGGKSNVAGFRDGPSEDAKFSNDFDV 54
           +D  GNLYVAD  N  I+K+ D    T+      A G    AG  DG +  A+F     +
Sbjct: 746 VDFSGNLYVADKNNHKIKKV-DLNTNTVSILSGPAVGTVFPAGATDGTASIARFFFPTSI 804

Query: 55  VYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGA 101
              R + +L V DR N  +RQ++ +          IS T+ L V GA
Sbjct: 805 ALDR-SGNLFVADRHNNLIRQVATSSGATSTYAGDISQTNALYVDGA 850


>gi|197118445|ref|YP_002138872.1| lipoprotein [Geobacter bemidjiensis Bem]
 gi|197087805|gb|ACH39076.1| repeat-containing lipoprotein, putative [Geobacter bemidjiensis
           Bem]
          Length = 652

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 44/76 (57%), Gaps = 6/76 (7%)

Query: 3   DKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  NLYVAD+LN  +RKI    A VTT+AG  S +  F DG    A F++   +      
Sbjct: 193 DGANLYVADSLNHRVRKIVLASAAVTTLAG--SGLQDFADGTGIAASFNSPRGI--ATDG 248

Query: 61  CSLLVIDRGNAALRQI 76
            SL + D+GN+A+R+I
Sbjct: 249 VSLYLADQGNSAVRRI 264


>gi|398339828|ref|ZP_10524531.1| hypothetical protein LkirsB1_10395 [Leptospira kirschneri serovar
           Bim str. 1051]
          Length = 358

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI  D+G V+T +GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDLERNLYVGELSNHTIRKINLDSGMVSTFSGG---ISGYLDGDLTSARFKSPLGIAYDR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLL  D  +  +R+I L
Sbjct: 242 KTDSLLAADIQDHRIRKIDL 261


>gi|224536973|ref|ZP_03677512.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521412|gb|EEF90517.1| hypothetical protein BACCELL_01849 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 446

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGP-SEDAKFSNDFDVVYVRP 59
            D+  N YV D  N  IRK+  +G  +   G+    G+ DG   ++A+F   F + Y R 
Sbjct: 367 FDNDDNFYVCDQNNNLIRKVEPSGQVSTFAGRREEWGWADGDLRKEARFDRPFGIAYNRN 426

Query: 60  TCSLLVIDRGNAALRQIS 77
           T    + D+ N  +R I+
Sbjct: 427 TSEFYIADKNNKRIRIIT 444


>gi|255542920|ref|XP_002512523.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
 gi|223548484|gb|EEF49975.1| 2-deoxyglucose-6-phosphate phosphatase, putative [Ricinus communis]
          Length = 1016

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 48/84 (57%), Gaps = 5/84 (5%)

Query: 5   GNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G +YVAD+ N  I+K+  A   V+TIAG  +  AGF+DG +  A+ S    ++       
Sbjct: 776 GQIYVADSYNHKIKKLDPATKRVSTIAG--TGKAGFKDGKALAAQLSEPSGIIEAE-NGR 832

Query: 63  LLVIDRGNAALRQISLNQDDCEYQ 86
           L++ D  N+ +R + LN+++ E +
Sbjct: 833 LIIADTNNSIIRYLDLNKEEAELR 856


>gi|159471860|ref|XP_001694074.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277241|gb|EDP03010.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1591

 Score = 43.5 bits (101), Expect = 0.14,   Method: Composition-based stats.
 Identities = 29/78 (37%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 1    MDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
            +D +GN Y +D+ N  +R +   D  V+T AG     AGFRDG   +A+F N   +    
Sbjct: 1268 VDGEGNCYCSDSSNHCVRLLHAADGMVSTFAGSPGQ-AGFRDGAGTEARFRNPCGIAINL 1326

Query: 59   PTCSLLVIDRGNAALRQI 76
               SL V D  N  LR+I
Sbjct: 1327 QDGSLAVADAENNRLRRI 1344


>gi|375146530|ref|YP_005008971.1| NHL repeat containing protein [Niastella koreensis GR20-10]
 gi|361060576|gb|AEV99567.1| NHL repeat containing protein [Niastella koreensis GR20-10]
          Length = 434

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 50/90 (55%), Gaps = 4/90 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D+ GN+YV D  N  IRKI  D  VTTIAG   N  G+ +GP+  AKF+N      +  
Sbjct: 155 VDNIGNVYVPDGGNQRIRKIAPDGTVTTIAGTGEN--GYNEGPASGAKFNNPCATA-MDA 211

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNS 89
             ++ V +R    +R+I+ + +   + Y+S
Sbjct: 212 NGNMYVAERNGRRIRKITPDGNVSTHAYSS 241


>gi|410448598|ref|ZP_11302672.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
           LV3954]
 gi|410017668|gb|EKO79726.1| hypothetical protein LEP1GSC068_3151 [Leptospira sp. Fiocruz
           LV3954]
          Length = 356

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG     G+ DG    A+F +   + Y +
Sbjct: 183 LDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQ 239

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 240 KTDSLLVADLQDHRIRKIDL 259


>gi|410096564|ref|ZP_11291551.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409226528|gb|EKN19437.1| hypothetical protein HMPREF1076_00729 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 457

 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D +GN+Y+AD  N  I K+ +    TI  G    +G++DG  ++A F+  +DV    P  
Sbjct: 381 DQEGNVYLADYTNHVIWKVDEEFNATIFAGVPGESGYKDGKPQEALFNKPYDVA-ATPDG 439

Query: 62  SLLVIDRGNAALRQISL 78
            L V D  N  +R I++
Sbjct: 440 ILYVADTYNYLIRCIAI 456


>gi|422003150|ref|ZP_16350382.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417258114|gb|EKT87507.1| hypothetical protein LSS_06614 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 356

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG     G+ DG    A+F +   + Y +
Sbjct: 183 LDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQ 239

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 240 KTDSLLVADLQDHRIRKIDL 259


>gi|223937730|ref|ZP_03629631.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223893523|gb|EEF59983.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 1474

 Score = 43.5 bits (101), Expect = 0.16,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            +D  G L+V+DT N  IRKI   GV T   G S VAG  DG   +A F     +  +  +
Sbjct: 1073 VDGLGTLFVSDTGNNVIRKISTNGVVTTFAG-SGVAGTHDGVGTNASFLAPTGIA-LDSS 1130

Query: 61   CSLLVIDRGNAALRQIS 77
             +L V D GN+ +R+++
Sbjct: 1131 NNLYVADSGNSLIRKVT 1147


>gi|418754100|ref|ZP_13310334.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
           MOR084]
 gi|409965522|gb|EKO33385.1| hypothetical protein LEP1GSC179_2637 [Leptospira santarosai str.
           MOR084]
          Length = 356

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG     G+ DG    A+F +   + Y +
Sbjct: 183 LDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQ 239

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 240 KTDSLLVADLQDHRIRKIDL 259


>gi|293372451|ref|ZP_06618835.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
 gi|292632634|gb|EFF51228.1| IPT/TIG domain protein [Bacteroides ovatus SD CMC 3f]
          Length = 446

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 7/83 (8%)

Query: 3   DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN------VAGFRDG-PSEDAKFSNDFDVV 55
           D  + Y+ D  N  IRKI   G+ T   G+ +      V+G+ DG P E A+F+    + 
Sbjct: 363 DIYDFYLTDAANHCIRKITPDGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLC 422

Query: 56  YVRPTCSLLVIDRGNAALRQISL 78
           Y   T +  V D GN  +R I+L
Sbjct: 423 YEESTGTFYVGDNGNHRVRTIAL 445


>gi|456874551|gb|EMF89837.1| hypothetical protein LEP1GSC005_3835 [Leptospira santarosai str.
           ST188]
          Length = 356

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG     G+ DG    A+F +   + Y +
Sbjct: 183 LDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQ 239

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 240 KTDSLLVADLQDHRIRKIDL 259


>gi|336322170|ref|YP_004602138.1| alkyl hydroperoxide reductase [[Cellvibrio] gilvus ATCC 13127]
 gi|336105751|gb|AEI13570.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [[Cellvibrio] gilvus ATCC 13127]
          Length = 638

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 13/113 (11%)

Query: 1   MDDKGNLYVADTLNLAIRKI-------GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFD 53
           +DD+G +++AD+   A+R +       GD  V T+ G      G RDG +++A+  +   
Sbjct: 375 VDDEGGIWLADSETSALRVVEPVHGSAGDGSVRTVVGAGLFEFGHRDGAADEARLQHPLG 434

Query: 54  VVYVRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSIS-----PTDILMVVGA 101
           V  V P  S+ V D  N A+R+     D       +I+     P+ ++++ GA
Sbjct: 435 VA-VLPDGSVAVADTYNGAVRRFVAGTDGAGGAVTTIATGLAEPSGLVVLDGA 486


>gi|290995346|ref|XP_002680256.1| predicted protein [Naegleria gruberi]
 gi|284093876|gb|EFC47512.1| predicted protein [Naegleria gruberi]
          Length = 1226

 Score = 43.5 bits (101), Expect = 0.16,   Method: Composition-based stats.
 Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 2/71 (2%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           G+LY  D+ N AIRKI   GV T   G +  AGF+DG +  A F++   +  + P    L
Sbjct: 397 GDLYFCDSNNHAIRKIDTQGVITTLTG-NGTAGFQDGDAASALFNSPASIALL-PNGDFL 454

Query: 65  VIDRGNAALRQ 75
           V D  N+ +R+
Sbjct: 455 VADFTNSRIRK 465


>gi|418747486|ref|ZP_13303786.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
           CBC379]
 gi|410791609|gb|EKR89564.1| hypothetical protein LEP1GSC163_2583 [Leptospira santarosai str.
           CBC379]
          Length = 356

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG     G+ DG    A+F +   + Y +
Sbjct: 183 LDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQ 239

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 240 KTDSLLVADLQDHRIRKIDL 259


>gi|294647916|ref|ZP_06725468.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
 gi|294806312|ref|ZP_06765159.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345510467|ref|ZP_08790034.1| hypothetical protein BSAG_00676 [Bacteroides sp. D1]
 gi|292636824|gb|EFF55290.1| IPT/TIG domain protein [Bacteroides ovatus SD CC 2a]
 gi|294446568|gb|EFG15188.1| IPT/TIG domain protein [Bacteroides xylanisolvens SD CC 1b]
 gi|295085612|emb|CBK67135.1| IPT/TIG domain./NHL repeat. [Bacteroides xylanisolvens XB1A]
 gi|345454427|gb|EEO48966.2| hypothetical protein BSAG_00676 [Bacteroides sp. D1]
          Length = 476

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 7/83 (8%)

Query: 3   DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN------VAGFRDG-PSEDAKFSNDFDVV 55
           D  + Y+ D  N  IRKI   G+ T   G+ +      V+G+ DG P E A+F+    + 
Sbjct: 393 DVYDFYLTDAANHCIRKITPDGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLC 452

Query: 56  YVRPTCSLLVIDRGNAALRQISL 78
           Y   T +  V D GN  +R I+L
Sbjct: 453 YEESTGTFYVGDNGNHRVRTIAL 475


>gi|262405960|ref|ZP_06082510.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|262356835|gb|EEZ05925.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
          Length = 459

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 7/83 (8%)

Query: 3   DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN------VAGFRDG-PSEDAKFSNDFDVV 55
           D  + Y+ D  N  IRKI   G+ T   G+ +      V+G+ DG P E A+F+    + 
Sbjct: 376 DVYDFYLTDAANHCIRKITPDGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLC 435

Query: 56  YVRPTCSLLVIDRGNAALRQISL 78
           Y   T +  V D GN  +R I+L
Sbjct: 436 YEESTGTFYVGDNGNHRVRTIAL 458


>gi|359685014|ref|ZP_09255015.1| hypothetical protein Lsan2_10264 [Leptospira santarosai str.
           2000030832]
          Length = 356

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N AIRKI      V+T++GG     G+ DG    A+F +   + Y +
Sbjct: 183 LDRERNLYVGELGNHAIRKINLNSGSVSTLSGG---TLGYLDGDLTSAQFKSPLGITYDQ 239

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLLV D  +  +R+I L
Sbjct: 240 KTDSLLVADLQDHRIRKIDL 259


>gi|290975813|ref|XP_002670636.1| predicted protein [Naegleria gruberi]
 gi|284084197|gb|EFC37892.1| predicted protein [Naegleria gruberi]
          Length = 2442

 Score = 43.1 bits (100), Expect = 0.17,   Method: Composition-based stats.
 Identities = 30/82 (36%), Positives = 47/82 (57%), Gaps = 6/82 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGF--RDGPSEDAKFSNDFDVVYV 57
           ++ KG +YV+DT+N  +RKI   G +TT+AG  S +AGF    G + +AK S     +YV
Sbjct: 894 VNSKGEIYVSDTVNHRVRKIDLNGTITTVAG--SGLAGFYGDGGLATNAKLSYPMG-IYV 950

Query: 58  RPTCSLLVIDRGNAALRQISLN 79
                + + D GN  +R++  N
Sbjct: 951 SKNGDIYIADYGNHRIRKVFSN 972



 Score = 39.3 bits (90), Expect = 2.7,   Method: Composition-based stats.
 Identities = 26/74 (35%), Positives = 43/74 (58%), Gaps = 4/74 (5%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCSL 63
           G++YVA+++N  IRKI +  +TT+AG  ++VAG+  D  +  +   N    V+   T   
Sbjct: 571 GDIYVAESINSRIRKISNGIITTVAG--TSVAGYNGDDIAATSASLNGPPAVFATSTAYY 628

Query: 64  LVIDRGNAALRQIS 77
           L  D GN  +R++S
Sbjct: 629 LA-DAGNRRVRKVS 641


>gi|218190788|gb|EEC73215.1| hypothetical protein OsI_07299 [Oryza sativa Indica Group]
          Length = 1103

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 5/82 (6%)

Query: 7   LYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           +YVAD+ N  I+++      VTTIAG  +  AG++DGP+  A+ S    +V V     LL
Sbjct: 837 IYVADSYNHKIKRLDPVTRKVTTIAG--TGRAGYKDGPALSAQLSEPAGLVEVGDG-RLL 893

Query: 65  VIDRGNAALRQISLNQDDCEYQ 86
           V D  N+ +R I LN+   E +
Sbjct: 894 VADTNNSTIRYIVLNEKGAEVR 915


>gi|289667480|ref|ZP_06488555.1| putative NHL repeat protein [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 310

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 34/56 (60%), Gaps = 3/56 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVV 55
           +D  GNLY+ADT N AIRK+   G VTT+AG    VAG R+G +   +F+    V 
Sbjct: 76  LDTAGNLYIADTGNHAIRKLTPQGKVTTLAG--DGVAGDRNGAAAQVRFNGPVGVA 129


>gi|29349195|ref|NP_812698.1| hypothetical protein BT_3787 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341103|gb|AAO78892.1| conserved hypothetical protein with NHL repeat [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 452

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D  GN++V +     IRKI   G  ++  G    AGF DG  E A+F N  + V V   
Sbjct: 375 FDQDGNMFVPEYGRHTIRKITPTGEVSLYAGLPGQAGFTDGLPEKARF-NKPECVTVYLD 433

Query: 61  CSLLVIDRGNAALRQISL 78
            SL V DR N  +R++++
Sbjct: 434 NSLYVADRDNHLIRRVTV 451


>gi|288921012|ref|ZP_06415304.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288347587|gb|EFC81872.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 882

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 46/81 (56%), Gaps = 6/81 (7%)

Query: 2   DDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVR 58
           D KGN+Y+ D  N  IRKI  D  ++TIAG  + VAG+    GP+  A+  +  + + V 
Sbjct: 630 DGKGNIYIPDARNYRIRKIAADGTISTIAG--TGVAGYSGDGGPATAAQLKS-AEKLAVA 686

Query: 59  PTCSLLVIDRGNAALRQISLN 79
           P  S+ + D  N  +R+IS +
Sbjct: 687 PDGSIYIADYENHRIRKISTD 707



 Score = 37.7 bits (86), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 4   KGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
            G LY+AD  N  IRK+   G +TTIAG  S  +    GP+ +AKF++   VV    T +
Sbjct: 799 NGTLYIADYGNNRIRKVDANGTITTIAGTGSEGSSGDGGPATEAKFTDPSSVVE-DSTGA 857

Query: 63  LLVIDRGNAALRQIS 77
           L + D GN  +R+I+
Sbjct: 858 LYIADSGNNRIRRIA 872


>gi|312200909|ref|YP_004020970.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311232245|gb|ADP85100.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 847

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYV 57
           +D  GN+YV DT N  IRKI  D  +TT+ G  +  AGF    GP+  A+  N  + + V
Sbjct: 591 VDKNGNIYVPDTANNRIRKITPDGKITTVVG--NGTAGFSGDGGPATQAEI-NSVEGIAV 647

Query: 58  RPTCSLLVIDRGNAALRQIS 77
            P  SL + D  N  +R+++
Sbjct: 648 GPDGSLYLADYSNERIRKVT 667


>gi|299144811|ref|ZP_07037879.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
 gi|298515302|gb|EFI39183.1| putative IPT/TIG domain protein [Bacteroides sp. 3_1_23]
          Length = 459

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 7/83 (8%)

Query: 3   DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN------VAGFRDG-PSEDAKFSNDFDVV 55
           D  + Y+ D  N  IRKI   G+ T   G+ +      V+G+ DG P E A+F+    + 
Sbjct: 376 DIYDFYLTDAANHCIRKITPDGIVTTFAGRGSYSTDQIVSGYIDGDPRETARFNYPLGLC 435

Query: 56  YVRPTCSLLVIDRGNAALRQISL 78
           Y   T +  V D GN  +R I+L
Sbjct: 436 YEESTGTFYVGDNGNHRVRTIAL 458


>gi|290977429|ref|XP_002671440.1| predicted protein [Naegleria gruberi]
 gi|284085009|gb|EFC38696.1| predicted protein [Naegleria gruberi]
          Length = 1494

 Score = 43.1 bits (100), Expect = 0.20,   Method: Composition-based stats.
 Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           G +Y++D+ N  IRKI   G ++T+ G  S   GF D  +  A+ +N   + ++ P   L
Sbjct: 508 GEVYISDSSNDVIRKIDTNGAISTVVGSSS---GFLDASARRAQLANPMGIAFL-PNGDL 563

Query: 64  LVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVT 108
           ++ D  N  +R+ S++  +      +I+ T +    G  LVG  T
Sbjct: 564 IISDAKNNRIRKFSISSGNV----TTIAGTGLTTYNGEGLVGVAT 604


>gi|320106842|ref|YP_004182432.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319925363|gb|ADV82438.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 382

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           G  Y+ADT N  IR++  AG  T   G    +G  DG S  AKF     +        L 
Sbjct: 301 GAYYIADTFNNCIRRMDSAGNVTTYAGIGGQSGLVDGASTSAKFDKPTGIAIAN--GYLY 358

Query: 65  VIDRGNAALRQISLNQD 81
           V D GN A+R++ +N +
Sbjct: 359 VADSGNNAIRRVDMNAN 375


>gi|222624800|gb|EEE58932.1| hypothetical protein OsJ_10596 [Oryza sativa Japonica Group]
          Length = 1017

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 5/82 (6%)

Query: 7   LYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           +YVAD+ N  I+++      VTTIAG  +  AG++DGP+  A+ S    +V V     LL
Sbjct: 751 IYVADSYNHKIKRLDPVTRKVTTIAG--TGRAGYKDGPALSAQLSEPAGLVEVGDG-RLL 807

Query: 65  VIDRGNAALRQISLNQDDCEYQ 86
           V D  N+ +R I LN+   E +
Sbjct: 808 VADTNNSTIRYIVLNEKGAEVR 829


>gi|425734502|ref|ZP_18852820.1| NHL repeat-containing protein [Brevibacterium casei S18]
 gi|425481116|gb|EKU48277.1| NHL repeat-containing protein [Brevibacterium casei S18]
          Length = 666

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 49/97 (50%), Gaps = 4/97 (4%)

Query: 5   GNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G + VAD+   AIR++  A    TT+ G      GFRDGP+ +A+  +   V  + P  S
Sbjct: 398 GGVIVADSETSAIRRLDPASGEATTLVGTGLFDFGFRDGPAAEARLQHPLGVRTL-PDGS 456

Query: 63  LLVIDRGNAALRQISLNQDDCEYQYNSI-SPTDILMV 98
           L + D  N A+R+     ++       +  P+DIL+V
Sbjct: 457 LAIADTYNGAIRRYDFTTNEVSTLARGLREPSDILVV 493


>gi|429737840|ref|ZP_19271682.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
 gi|429161715|gb|EKY04093.1| IPT/TIG domain protein [Prevotella saccharolytica F0055]
          Length = 438

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 41/78 (52%), Gaps = 1/78 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D+ GN+YVAD  N  IR+I    +     G     G++DG  EDA F N+   + +    
Sbjct: 362 DNDGNIYVADRGNHCIRRITPENMVETVLGMPGTKGWKDGKKEDALF-NEPTGIGIAQDG 420

Query: 62  SLLVIDRGNAALRQISLN 79
           S+ V D  N  +R++++N
Sbjct: 421 SVYVADFKNGRVRKLTIN 438


>gi|332878080|ref|ZP_08445810.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
 gi|357047673|ref|ZP_09109271.1| NHL repeat protein [Paraprevotella clara YIT 11840]
 gi|332684042|gb|EGJ56909.1| NHL repeat protein [Capnocytophaga sp. oral taxon 329 str. F0087]
 gi|355529361|gb|EHG98795.1| NHL repeat protein [Paraprevotella clara YIT 11840]
          Length = 461

 Score = 42.7 bits (99), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D++ NLY+AD+ N  IRKI   GV +   G    +G++DG  + A F+  + +  +   
Sbjct: 384 LDEEENLYIADSENNCIRKITPEGVVSTVIGIPGKSGYKDGTPDVALFTQPWGLA-IDSE 442

Query: 61  CSLLVIDRGNAALRQISL 78
             + V D+ N  +RQ+S+
Sbjct: 443 GIIYVGDKDNLCVRQLSI 460


>gi|290975887|ref|XP_002670673.1| predicted protein [Naegleria gruberi]
 gi|284084234|gb|EFC37929.1| predicted protein [Naegleria gruberi]
          Length = 1074

 Score = 42.7 bits (99), Expect = 0.22,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRPTCS 62
           G +++AD+ N  IRKI   G ++T+AG  S VAGF  DG    +   N+   V V     
Sbjct: 807 GEIFIADSNNHRIRKIATNGDISTVAG--SGVAGFSGDGGLATSATLNNPSYVAVNSNGE 864

Query: 63  LLVIDRGNAALRQISLN 79
           LL+ D  N  +R++SLN
Sbjct: 865 LLISDTNNHRIRKVSLN 881


>gi|325104251|ref|YP_004273905.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973099|gb|ADY52083.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 532

 Score = 42.7 bits (99), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+ V +     IRKI  D  V+TIAG  S  AG+ DG    A+F N  D V V  
Sbjct: 106 VDAAGNIIVVEKDGARIRKIALDGTVSTIAGTGS--AGYSDGAVSVARFDNALDAV-VDS 162

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++ + DR N  +R+I+
Sbjct: 163 DGNIFIADRNNRRVRKIT 180


>gi|406944252|gb|EKD76066.1| Serine/threonine protein kinase, partial [uncultured bacterium]
          Length = 212

 Score = 42.7 bits (99), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 4   KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV-VYVRPTCS 62
            G +Y+ DT+N  + KIG  G+ T   G  +  G+RDG S+ A F+   D+ +Y      
Sbjct: 84  NGEIYLVDTINNRVEKIGTDGILTNVAGAGDY-GYRDGSSDYALFAQPQDIAIYGDTASE 142

Query: 63  LLVIDRGNAALRQI 76
           L + D  N  +R+I
Sbjct: 143 LFIADTNNNVIRKI 156


>gi|108707786|gb|ABF95581.1| HAD-superfamily hydrolase, subfamily IA, variant 3 containing
           protein, expressed [Oryza sativa Japonica Group]
          Length = 1064

 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 7   LYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           +YVAD+ N  I+++      VTTIAG  +  AG++DGP+  A+ S    +V V     LL
Sbjct: 822 IYVADSYNHKIKRLDPVTRKVTTIAG--TGRAGYKDGPALSAQLSEPAGLVEVGDG-RLL 878

Query: 65  VIDRGNAALRQISLNQDDCE 84
           V D  N+ +R I LN+   E
Sbjct: 879 VADTNNSTIRYIVLNEKGAE 898


>gi|153864226|ref|ZP_01997193.1| NHL repeat protein [Beggiatoa sp. SS]
 gi|152146280|gb|EDN72808.1| NHL repeat protein [Beggiatoa sp. SS]
          Length = 313

 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +DD G+LY+ DT N  IRK+  A   +TT+AGG S  +G  DG +  A+F N  D + + 
Sbjct: 131 IDDSGSLYIIDTNNHRIRKVEKATNIITTLAGGVSGFSG-DDGQALSAQF-NTPDGIAIS 188

Query: 59  PTCSLLVIDRGNAALRQIS 77
               LL+ D  N  +R I+
Sbjct: 189 DEGHLLIADSENHRVRMIT 207


>gi|298387997|ref|ZP_06997545.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
 gi|298259263|gb|EFI02139.1| NHL repeat-containing protein [Bacteroides sp. 1_1_14]
          Length = 439

 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 47/81 (58%), Gaps = 8/81 (9%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYV 57
           +D +GNLY+AD  N  IRK+      VTTIAG GK    G++DG   +A+F   +  VY+
Sbjct: 363 IDKEGNLYIADVNNHRIRKVDTKLNIVTTIAGSGK----GYKDGDPLEAQFDQPWG-VYL 417

Query: 58  RPTCSLLVIDRGNAALRQISL 78
                L + D+ N  +R++++
Sbjct: 418 DKNEFLYIADQNNHCIRKLAI 438


>gi|322434821|ref|YP_004217033.1| hypothetical protein AciX9_1190 [Granulicella tundricola MP5ACTX9]
 gi|321162548|gb|ADW68253.1| NHL repeat containing protein [Granulicella tundricola MP5ACTX9]
          Length = 1631

 Score = 42.7 bits (99), Expect = 0.24,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 4/78 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           D +GNLY  DT N  +R+I DAG   +TT AG      G+ DG    A   N    + V 
Sbjct: 289 DAQGNLYFCDTNNNVVRRI-DAGTGVITTFAGNGVTTGGYGDGGPAAAAMLNAPWGIAVS 347

Query: 59  PTCSLLVIDRGNAALRQI 76
               + + D+GN+ +R++
Sbjct: 348 SKGEIYIADQGNSLIRKV 365


>gi|223939820|ref|ZP_03631690.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223891508|gb|EEF57999.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 805

 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 44/81 (54%), Gaps = 6/81 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 56
           +D  GNL VADT N  IRKI   G    VTT+AG   NV G  DG +  A+F N    + 
Sbjct: 358 IDKAGNLLVADTQNSEIRKIAPVGTNYVVTTLAGFAQNVGG-ADGTNAVARF-NSPRGIA 415

Query: 57  VRPTCSLLVIDRGNAALRQIS 77
           V    ++ V D+ N  +R+++
Sbjct: 416 VDAAGNVFVADQNNNTIRKLT 436



 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 46/81 (56%), Gaps = 6/81 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 56
           +D  GN++VAD  N  IRK+   G    VTTIA G++ +A + DG   +A F+    +  
Sbjct: 416 VDAAGNVFVADQNNNTIRKLTPVGTNWAVTTIA-GQAGMAYYGDGNGTNAYFNYPAGIA- 473

Query: 57  VRPTCSLLVIDRGNAALRQIS 77
           V  + +L V D GN  +R+++
Sbjct: 474 VDASGNLFVTDAGNHVIRKLT 494



 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 57
           D  GN+YVAD+ N  IRK+   G    VTTIAG   N  G  DG +  A+F      V V
Sbjct: 69  DSAGNIYVADSGNHVIRKMAAVGTNWVVTTIAGLPGN-PGSADGTNSAAQFYYP-AAVAV 126

Query: 58  RPTCSLLVIDRGNAALRQIS 77
             + ++ V D  N  +R+I+
Sbjct: 127 DNSGNVFVADSANYTIRKIA 146



 Score = 38.1 bits (87), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 43/81 (53%), Gaps = 6/81 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 56
           +D  G+++VAD  N  IRKI  AG    VTTIA GK  VA   +G   +A F N    V 
Sbjct: 242 VDAAGSIFVADYFNSTIRKITSAGTNWLVTTIA-GKVGVADSAEGTGTNAVF-NYPHAVA 299

Query: 57  VRPTCSLLVIDRGNAALRQIS 77
           V    ++ V D  N  +R+++
Sbjct: 300 VDTNGNVFVADSENYTIRKLT 320


>gi|423298428|ref|ZP_17276486.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
           CL03T12C18]
 gi|392663340|gb|EIY56891.1| hypothetical protein HMPREF1070_05151 [Bacteroides ovatus
           CL03T12C18]
          Length = 507

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS 49
           DD GN+Y+ D  N AIR + + GV+T+ GG  N +G  +G  +DAK  
Sbjct: 433 DDAGNIYIVDQDNHAIRMVNNTGVSTVIGG--NGSGAVNGVGKDAKLC 478


>gi|393782357|ref|ZP_10370541.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673185|gb|EIY66648.1| hypothetical protein HMPREF1071_01409 [Bacteroides salyersiae
           CL02T12C01]
          Length = 446

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 7   LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVI 66
           LY+ADT N  IRKI   G+ +   G +  +GF DG  E+A F   F V  +    ++ + 
Sbjct: 375 LYLADTNNHVIRKISQDGIVSTVIGLAGQSGFMDGTPEEALFDKPFGVA-LDTDGTIYIG 433

Query: 67  DRGNAALRQISL 78
           D  N  +R++++
Sbjct: 434 DSENQCVRRLAI 445


>gi|224125014|ref|XP_002319481.1| predicted protein [Populus trichocarpa]
 gi|222857857|gb|EEE95404.1| predicted protein [Populus trichocarpa]
          Length = 1065

 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 46/82 (56%), Gaps = 5/82 (6%)

Query: 5   GNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G +Y+AD+ N  I+K+  A   VTTIAG  +  AGF+DG +  A+ S    ++       
Sbjct: 824 GLIYIADSYNHKIKKLDLATKRVTTIAG--TGKAGFKDGKALTAQLSEPAGLIEAE-NGR 880

Query: 63  LLVIDRGNAALRQISLNQDDCE 84
           L++ D  N+ +R + LN+ + E
Sbjct: 881 LIIADTNNSVIRYLDLNKGEAE 902


>gi|380692433|ref|ZP_09857292.1| NHL repeat-containing protein [Bacteroides faecis MAJ27]
          Length = 439

 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 47/81 (58%), Gaps = 8/81 (9%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDA--GVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYV 57
           +D +GNLY+AD  N  IRK+      VTTIAG GK    G++DG   +A+F   +  VY+
Sbjct: 363 IDKEGNLYIADVNNHRIRKVDTKLNIVTTIAGSGK----GYKDGDPLEAQFDQPWG-VYL 417

Query: 58  RPTCSLLVIDRGNAALRQISL 78
                L + D+ N  +R++++
Sbjct: 418 DKNEFLYIADQNNHCIRKLAI 438


>gi|156399714|ref|XP_001638646.1| predicted protein [Nematostella vectensis]
 gi|156225768|gb|EDO46583.1| predicted protein [Nematostella vectensis]
          Length = 286

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 6/83 (7%)

Query: 1   MDDKGN-LYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 57
           +DD+G+ L+V DT N AIR I   D  V T+ GG     GF+DG    +KF +   + Y 
Sbjct: 160 LDDEGDRLFVGDTDNHAIRVISLKDGSVQTLVGGS---LGFKDGVGLKSKFYHPTGIAYD 216

Query: 58  RPTCSLLVIDRGNAALRQISLNQ 80
           R    L V D  N  +R + +++
Sbjct: 217 RENDILYVSDHYNHVIRAVKVSE 239


>gi|428175994|gb|EKX44881.1| hypothetical protein GUITHDRAFT_60229, partial [Guillardia theta
           CCMP2712]
          Length = 181

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 65/148 (43%), Gaps = 21/148 (14%)

Query: 3   DKGNLYVADTLNLAIRKIGDAG---VTTIAGG--KSNVAGFRDGPSEDAKFSNDFDVVYV 57
           D   L VADT N AIR+I DA    VTTIAG    S+  G +DGP+  A F++   +   
Sbjct: 30  DGSWLAVADTNNHAIRRI-DAMTGVVTTIAGCPRSSSCLGSQDGPASAASFNSPTSIALD 88

Query: 58  RPTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTCMLQQGFGP 117
                + V D  N  +RQI L            +   +  + GA   GY   +L      
Sbjct: 89  PQDKFIFVADTSNNMIRQIDL------------TAMTVTTIAGATRSGYADGLLA---AS 133

Query: 118 FFFSRTQQPSESEFNEETEIKELSNKEK 145
            F++ T   +  +F  E ++  +++ + 
Sbjct: 134 LFYNPTGITAHPDFTNENKLVMVADTQN 161


>gi|288923319|ref|ZP_06417453.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288345335|gb|EFC79730.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 752

 Score = 42.4 bits (98), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 43/79 (54%), Gaps = 6/79 (7%)

Query: 4   KGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGF--RDGPSEDAKFSNDFDVVYVRPT 60
           +G +YVADT N  IR+I   G +TT+AG  +  AGF    GP+  A       VV V P 
Sbjct: 429 RGVIYVADTENNRIRRIDRTGRITTVAG--NGTAGFDGDGGPATAASLQEPSGVV-VAPD 485

Query: 61  CSLLVIDRGNAALRQISLN 79
            +L + D GN  +R+I  N
Sbjct: 486 GTLFIADNGNHRIRRIGTN 504


>gi|158319004|ref|YP_001511512.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114409|gb|ABW16606.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 892

 Score = 42.4 bits (98), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 41/78 (52%), Gaps = 4/78 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           D  GN+Y+ D  N  IRKI  D  +TTIAG  +  AGF  DG    A   N  + V + P
Sbjct: 640 DKNGNIYIPDAQNYRIRKITPDGIITTIAG--TGTAGFSGDGGPATAAQINSAEKVAIGP 697

Query: 60  TCSLLVIDRGNAALRQIS 77
             S+ + D  N  +R+I+
Sbjct: 698 DGSIYIADYDNHRIRKIT 715


>gi|74317309|ref|YP_315049.1| hypothetical protein Tbd_1291 [Thiobacillus denitrificans ATCC
           25259]
 gi|74056804|gb|AAZ97244.1| conserved hypothetical protein containing twin-arginine
           translocation pathway signal [Thiobacillus denitrificans
           ATCC 25259]
          Length = 510

 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 13/98 (13%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS--NDFDVVYVR 58
           ++D G++ VAD  N  IR I + G  TI  G      F+DG + D  F+  ND D +   
Sbjct: 106 IEDNGSIKVADARNSLIRNISNLGDVTIFSGSMQQYPFKDGSAVDTNFNSPNDIDKL--- 162

Query: 59  PTCSLLVIDRGNAALRQISLN--------QDDCEYQYN 88
              + +V DR N A+R++  +        Q +C+ +YN
Sbjct: 163 SDGTYVVGDRENNAIRRVFADGSVKTIAGQGNCKNKYN 200


>gi|300772471|ref|ZP_07082341.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300760774|gb|EFK57600.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 441

 Score = 42.0 bits (97), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 41/78 (52%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D  GN+++AD  N  IR +   G+ +   G+   AG++DG   +++F+  + V  V   
Sbjct: 364 FDRAGNMFIADYGNHCIRMLSADGIVSTVAGQPTKAGYKDGGPVESQFNQPWGVA-VNDQ 422

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D  NA +R++ +
Sbjct: 423 GDIYIADWNNARIRKLVI 440


>gi|227537797|ref|ZP_03967846.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227242411|gb|EEI92426.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 441

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 41/78 (52%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D  GN+++AD  N  IR +   G+ +   G+   AG++DG   +++F+  + V  V   
Sbjct: 364 FDRAGNMFIADYGNHCIRMLSADGIVSTVAGQPTKAGYKDGGPVESQFNQPWGVA-VNDQ 422

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D  NA +R++ +
Sbjct: 423 GDIYIADWNNARIRKLVI 440


>gi|255070387|ref|XP_002507275.1| predicted protein [Micromonas sp. RCC299]
 gi|226522550|gb|ACO68533.1| predicted protein [Micromonas sp. RCC299]
          Length = 131

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 46/81 (56%), Gaps = 4/81 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +   G++YVAD+ N  IR++   G VT IAG  S +AGF D     A+F+    +V + P
Sbjct: 35  VSSDGSVYVADSGNNRIRRVSSKGHVTWIAG--SGLAGFSDDKLLRAEFNRPQGIVTI-P 91

Query: 60  TCSLLVIDRGNAALRQISLNQ 80
           T  + V D  N  +R +SL++
Sbjct: 92  TGLIFVADTLNHRIRLVSLHE 112


>gi|455790313|gb|EMF42184.1| putative lipoprotein [Leptospira interrogans serovar Lora str. TE
           1992]
          Length = 294

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGTVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|302760605|ref|XP_002963725.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
 gi|300168993|gb|EFJ35596.1| hypothetical protein SELMODRAFT_79954 [Selaginella moellendorffii]
          Length = 1052

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 5/88 (5%)

Query: 5   GNLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G +YVAD+ N  I+   +    V T+AG  + VAG+ DG S  A+FS    +  + P  S
Sbjct: 817 GAIYVADSYNHKIKLMDLASKTVRTVAG--TGVAGYEDGKSVKAQFSEPAGLA-LGPNGS 873

Query: 63  LLVIDRGNAALRQISLNQDDCEYQYNSI 90
           L V D  N  +R +    DD   Q  ++
Sbjct: 874 LFVADTNNNLIRLLKPGADDNAIQVTTL 901


>gi|410938688|ref|ZP_11370531.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
 gi|410786234|gb|EKR75182.1| putative lipoprotein [Leptospira noguchii str. 2006001870]
          Length = 358

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y  
Sbjct: 185 IDPERNLYVGELGNHTIRKIDLNSGNVSTLSGG---ISGYLDGDLASARFKSPLGIAYNH 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
            T SLL  D  +  +R+I+L
Sbjct: 242 KTDSLLAADIQDHRIRKINL 261


>gi|455647406|gb|EMF26379.1| hypothetical protein H114_24527 [Streptomyces gancidicus BKS 13-15]
          Length = 627

 Score = 42.0 bits (97), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 40/71 (56%), Gaps = 2/71 (2%)

Query: 6   NLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
            L+VAD+ + A+R++G D  V T  G      G++DGP++ A F +   V  + P  SL 
Sbjct: 364 GLWVADSESSALRRLGPDGTVRTAVGAGLFAFGYQDGPADGALFQHPLGVTAL-PDGSLA 422

Query: 65  VIDRGNAALRQ 75
           V D  N ALR+
Sbjct: 423 VSDTYNHALRR 433


>gi|24215844|ref|NP_713325.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45656856|ref|YP_000942.1| hypothetical protein LIC10968 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386074980|ref|YP_005989298.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417765006|ref|ZP_12412972.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
           Mallika]
 gi|418666993|ref|ZP_13228411.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
           2006006960]
 gi|418699338|ref|ZP_13260303.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
           L1111]
 gi|418723579|ref|ZP_13282417.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
 gi|418727930|ref|ZP_13286513.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
 gi|421083673|ref|ZP_15544545.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
 gi|421102269|ref|ZP_15562874.1| putative lipoprotein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421122244|ref|ZP_15582528.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
 gi|24197042|gb|AAN50343.1| hypothetical protein LA_3145 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45600092|gb|AAS69579.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353458770|gb|AER03315.1| hypothetical protein LIF_A2535 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400352606|gb|EJP04785.1| putative lipoprotein [Leptospira interrogans serovar Bulgarica str.
           Mallika]
 gi|409962948|gb|EKO26678.1| putative lipoprotein [Leptospira interrogans str. UI 12621]
 gi|410344796|gb|EKO95953.1| putative lipoprotein [Leptospira interrogans str. Brem 329]
 gi|410367934|gb|EKP23317.1| putative lipoprotein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410433778|gb|EKP78117.1| putative lipoprotein [Leptospira santarosai str. HAI1594]
 gi|410757315|gb|EKR18927.1| putative lipoprotein [Leptospira interrogans serovar Pyrogenes str.
           2006006960]
 gi|410761708|gb|EKR27881.1| putative lipoprotein [Leptospira interrogans serovar Bataviae str.
           L1111]
 gi|410777322|gb|EKR57287.1| putative lipoprotein [Leptospira interrogans str. UI 12758]
 gi|456822102|gb|EMF70597.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           LT1962]
          Length = 358

 Score = 42.0 bits (97), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGTVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|390957686|ref|YP_006421443.1| gluconolactonase [Terriglobus roseus DSM 18391]
 gi|390412604|gb|AFL88108.1| gluconolactonase [Terriglobus roseus DSM 18391]
          Length = 738

 Score = 42.0 bits (97), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 37/76 (48%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D +GNLYVAD+ N  I ++   GV  +  G  +     DG +  A   N    V V P  
Sbjct: 78  DHEGNLYVADSRNHQIDRVSSGGVLAVVAGTGHQGYVGDGGAATAAELNAPTAVAVAPDG 137

Query: 62  SLLVIDRGNAALRQIS 77
           S+   D GN  +R+I+
Sbjct: 138 SVYFADSGNHCIRRIA 153


>gi|418714767|ref|ZP_13275259.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
 gi|410788941|gb|EKR82646.1| putative lipoprotein [Leptospira interrogans str. UI 08452]
          Length = 358

 Score = 42.0 bits (97), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGTVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|417772389|ref|ZP_12420278.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|418684105|ref|ZP_13245295.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|400324210|gb|EJO76509.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Kennewicki LC82-25]
 gi|409945760|gb|EKN95775.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Pomona]
 gi|455670390|gb|EMF35375.1| putative lipoprotein [Leptospira interrogans serovar Pomona str.
           Fox 32256]
          Length = 358

 Score = 42.0 bits (97), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGTVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|418707100|ref|ZP_13267936.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|410763306|gb|EKR34037.1| putative lipoprotein [Leptospira interrogans serovar Hebdomadis
           str. R499]
          Length = 358

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGTVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|410098158|ref|ZP_11293137.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222431|gb|EKN15373.1| hypothetical protein HMPREF1076_02315 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 446

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D  G +YVAD  N  IRKI   GV +   G + V G+ DG  EDA F     V  +   
Sbjct: 369 FDQDGVMYVADAGNHVIRKITRDGVVSTVIGIAGVKGYVDGSPEDALFQYPTGVA-IDKE 427

Query: 61  CSLLVIDRGNAALRQISL 78
            ++ V D  N  +R++++
Sbjct: 428 GTIYVGDARNNCVRKLAI 445


>gi|290989149|ref|XP_002677205.1| predicted protein [Naegleria gruberi]
 gi|284090811|gb|EFC44461.1| predicted protein [Naegleria gruberi]
          Length = 691

 Score = 41.6 bits (96), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +  +G +   D  N  I KI  D  + TIAG  +   G++DGP   AKF+N  D+     
Sbjct: 314 ISQEGEILFTDIYNNRICKINRDRTLVTIAG--TTEEGYQDGPVRMAKFNNPSDIAIDNE 371

Query: 60  TCSLLVIDRGNAALRQIS 77
           T  + V+D GN  +R+IS
Sbjct: 372 TGDVYVVDGGNNYIRKIS 389


>gi|417783289|ref|ZP_12431009.1| putative lipoprotein [Leptospira interrogans str. C10069]
 gi|409953418|gb|EKO07917.1| putative lipoprotein [Leptospira interrogans str. C10069]
          Length = 358

 Score = 41.6 bits (96), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI  ++G V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGMVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|291223251|ref|XP_002731624.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 434

 Score = 41.6 bits (96), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 4   KGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           K   +VADT N AIR++   D  VTTIAGG+    GF+DG    A F +   V  + P  
Sbjct: 194 KDRFFVADTDNHAIREVSLPDGEVTTIAGGEK---GFKDGKGTGATFYHPAGVT-IDPIR 249

Query: 62  SLLVI-DRGNAALRQISL 78
           ++L I D  N A+R I +
Sbjct: 250 NILFIADHYNHAIRMIGV 267


>gi|421117009|ref|ZP_15577380.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           Fiocruz LV133]
 gi|410011441|gb|EKO69561.1| putative lipoprotein [Leptospira interrogans serovar Canicola str.
           Fiocruz LV133]
          Length = 358

 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGTVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|418688943|ref|ZP_13250072.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
 gi|421128059|ref|ZP_15588277.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. 2006006986]
 gi|421134520|ref|ZP_15594655.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. Andaman]
 gi|400361935|gb|EJP17894.1| putative lipoprotein [Leptospira interrogans str. FPW2026]
 gi|410021314|gb|EKO88104.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. Andaman]
 gi|410434526|gb|EKP83664.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. 2006006986]
          Length = 358

 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGTVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|225872818|ref|YP_002754275.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
           51196]
 gi|225794459|gb|ACO34549.1| hypothetical protein ACP_1177 [Acidobacterium capsulatum ATCC
           51196]
          Length = 855

 Score = 41.6 bits (96), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           +D  GN+Y+AD+ N  IR++ +  + T+AG  +  AG+  DG +  +   +D   V V  
Sbjct: 142 VDSNGNIYIADSHNNRIREVSNGIINTVAG--NGTAGYSGDGAAATSAMLDDPTAVAVDA 199

Query: 60  TCSLLVIDRGNAALRQIS 77
           + ++ + D GN  +R ++
Sbjct: 200 SGNIYIADTGNQRIRNVA 217


>gi|442323754|ref|YP_007363775.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
 gi|441491396|gb|AGC48091.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
          Length = 906

 Score = 41.6 bits (96), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           G L + DT N  IR +    V T AG  S   G RDG +E A FS    +  + P  ++L
Sbjct: 734 GRLLLTDTGNERIRMLVGGRVRTYAG--SGAHGARDGTAEQATFSLPTGIAAL-PNGNML 790

Query: 65  VIDRGNAALRQISLNQDD 82
           V+D+G + LR++ +   D
Sbjct: 791 VVDQGASTLRELRMPAAD 808



 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 42/80 (52%), Gaps = 6/80 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYV 57
           +D  GN++VADT N A+R+I  DA   VTTIAG      G  +GP       +   +  V
Sbjct: 509 VDGLGNIFVADTGNHAVRRIAPDAARTVTTIAG--LGTPGVGEGPGATTALRSPQSIA-V 565

Query: 58  RPTCSLLVIDRGNAALRQIS 77
            P  +L V D GN  + +I+
Sbjct: 566 APDGTLYVADTGNHRIVRIA 585


>gi|260904571|ref|ZP_05912893.1| NHL repeat-containing protein [Brevibacterium linens BL2]
          Length = 647

 Score = 41.6 bits (96), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 16/81 (19%)

Query: 6   NLYVADTLNLAIRKIG--DAGVTTIAGGKS--------NVAGFR------DGPSEDAKFS 49
           +L VADT+N  +R I      VTT+AG  S        NV G        DGP+ D K S
Sbjct: 258 HLVVADTVNHTLRGINLDTETVTTVAGTGSQHMVGAIDNVVGTHGELGRYDGPALDVKLS 317

Query: 50  NDFDVVYVRPTCSLLVIDRGN 70
           + +DV+Y+  T  ++V   GN
Sbjct: 318 SPWDVLYIPATAEVVVAMAGN 338



 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 52/104 (50%), Gaps = 4/104 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +   G++++AD+   AIR++      V+T+ G      GFRDGP+ +A+  +   V  + 
Sbjct: 380 LSSDGDVFIADSETSAIRRLDPSTGAVSTLIGEGLFDFGFRDGPAAEARLQHPLGVRSL- 438

Query: 59  PTCSLLVIDRGNAALRQISLNQDDCEYQYNSI-SPTDILMVVGA 101
           P  S+ + D  N A+R+     ++       +  P+DI ++  A
Sbjct: 439 PDGSIAIADTYNGAIRRYDFTTNEVSTLARGLREPSDIFVLEAA 482


>gi|417763451|ref|ZP_12411429.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
 gi|417776933|ref|ZP_12424763.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
 gi|418671130|ref|ZP_13232484.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
 gi|409940657|gb|EKN86296.1| putative lipoprotein [Leptospira interrogans str. 2002000624]
 gi|410573292|gb|EKQ36344.1| putative lipoprotein [Leptospira interrogans str. 2002000621]
 gi|410581799|gb|EKQ49606.1| putative lipoprotein [Leptospira interrogans str. 2002000623]
          Length = 358

 Score = 41.6 bits (96), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D + NLYV +  N  IRKI      V+T++GG   ++G+ DG    A+F +   + Y R
Sbjct: 185 IDPERNLYVGELNNHTIRKINLNSGTVSTLSGG---ISGYLDGDLASARFKSPLGIAYNR 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              SLL  D  +  +R+I L
Sbjct: 242 KMNSLLAADIQDHRIRKIDL 261


>gi|347754243|ref|YP_004861807.1| gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
 gi|347586761|gb|AEP11291.1| Gluconolactonase [Candidatus Chloracidobacterium thermophilum B]
          Length = 1222

 Score = 41.6 bits (96), Expect = 0.61,   Method: Composition-based stats.
 Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRK-IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D +G LY+AD  N AIRK + D  + T+AGG     GF+DG   +A+F+    +   R 
Sbjct: 648 LDAQGVLYIADHFNHAIRKLLPDGRLVTLAGGGQR--GFQDGYGPEAQFNGPLGLAVGRD 705

Query: 60  TCSLLVIDRGNAALRQIS 77
              L V D  N  +R+++
Sbjct: 706 G-ELYVADHLNMRIRKVT 722


>gi|325965074|ref|YP_004242980.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
           phenanthrenivorans Sphe3]
 gi|323471161|gb|ADX74846.1| thiol-disulfide isomerase-like thioredoxin [Arthrobacter
           phenanthrenivorans Sphe3]
          Length = 674

 Score = 41.6 bits (96), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 2   DDKGNLYVADTLNLAIRKI-----GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 56
           D  GN++VAD+   A+RK+     G   V +  G      GFRDGP+ +A+  +   V  
Sbjct: 408 DADGNIWVADSETSALRKLVIDDDGGVAVESAIGKGLFDFGFRDGPAAEARLQHPLGVT- 466

Query: 57  VRPTCSLLVIDRGNAALRQ 75
           V P  S+ + D  N A+R+
Sbjct: 467 VLPDGSVAIADTYNGAVRR 485


>gi|251771416|gb|EES51995.1| conserved protein of unknown function [Leptospirillum
           ferrodiazotrophum]
          Length = 373

 Score = 41.2 bits (95), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 5   GNLYVADTLNLAIRKI----GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           G LYVADT N  IRK+    G A VTTIAG  ++ A FRDG    A+F+N   +      
Sbjct: 68  GTLYVADTGNDLIRKVTVSGGTATVTTIAG-VNHHARFRDGNGTAARFNNPEGLAISSDG 126

Query: 61  CSLLVIDRGNAALRQISLNQ 80
            +L + D  N  +R++ L+ 
Sbjct: 127 KTLYIADSRNNRIRKMDLSS 146


>gi|386722458|ref|YP_006188784.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus K02]
 gi|384089583|gb|AFH61019.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus K02]
          Length = 537

 Score = 41.2 bits (95), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 32/74 (43%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           G+LYVAD  N AIR+I   G VTT+AG  S   G +DG    A+F    DV    P  +L
Sbjct: 139 GSLYVADAGNHAIRRIDAKGNVTTVAG--SGRLGVKDGKGAAAEFYRPGDVAAA-PDGTL 195

Query: 64  LVIDRGNAALRQIS 77
            V D     +R+IS
Sbjct: 196 YVADTLGHTIRRIS 209



 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 43/88 (48%), Gaps = 13/88 (14%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAG----------GKSNVAG-FRDGPSEDAKFSNDF 52
           G LYVADTL   IR+I   G VTT+            G+   AG F DGP   AKF N+ 
Sbjct: 193 GTLYVADTLGHTIRRISPQGEVTTLTAPSKRVVEATPGQVAAAGDFADGPLAQAKF-NEP 251

Query: 53  DVVYVRPTCSLLVIDRGNAALRQISLNQ 80
             + +    +L V D GN  +R I L +
Sbjct: 252 TGIALDAKGNLYVSDSGNQRVRYIDLAK 279


>gi|409099442|ref|ZP_11219466.1| NHL repeat containing protein [Pedobacter agri PB92]
          Length = 754

 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
            D  G+LY+AD  N  I+KI   G VTT+AG  S   G  DG +  A F+N   V   R 
Sbjct: 174 FDLAGDLYLADQKNNKIKKISSTGFVTTVAG--SGATGSLDGNAASATFNNPAGVAVDRA 231

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++ V D GN  +R+I+
Sbjct: 232 -GNIYVADAGNNKIRKIT 248



 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  GN+YVAD  N  IRKI  AG V+T+AG  +  AG  DG   +A+      +  + P
Sbjct: 228 VDRAGNIYVADAGNNKIRKITPAGVVSTLAGNGT--AGSTDGTGTNARLYYPTGIT-IDP 284

Query: 60  TCSLLVIDRGNAALRQIS 77
             +L + D  N  +R+IS
Sbjct: 285 LDNLYIADYRNNRIRKIS 302


>gi|303274116|ref|XP_003056382.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462466|gb|EEH59758.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 490

 Score = 41.2 bits (95), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 1  MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSN 50
          +D  G ++VAD+ N  IR+I   GV T+  G S  AGF DG   +A+F+N
Sbjct: 9  LDGNGAIFVADSGNNRIRQIKSGGVVTVLAG-SGTAGFLDGSLLEAEFNN 57


>gi|302786076|ref|XP_002974809.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
 gi|300157704|gb|EFJ24329.1| hypothetical protein SELMODRAFT_101985 [Selaginella moellendorffii]
          Length = 1049

 Score = 41.2 bits (95), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 5   GNLYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G +YVADT N  I+   +    V T+AG  + VAG+ DG    A+FS    +  + P  S
Sbjct: 814 GAVYVADTYNHKIKLMDLASKTVRTVAG--TGVAGYEDGKGVKAQFSEPAGLA-LGPNGS 870

Query: 63  LLVIDRGNAALRQISLNQDDCEYQYNSI 90
           L V D  N  +R +    DD   Q  ++
Sbjct: 871 LFVADTNNNVIRLLKPRPDDNAIQVTTL 898


>gi|312198347|ref|YP_004018408.1| serine/threonine protein kinase [Frankia sp. EuI1c]
 gi|311229683|gb|ADP82538.1| serine/threonine protein kinase [Frankia sp. EuI1c]
          Length = 814

 Score = 40.8 bits (94), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           MD  GNLY AD  N  +R+I  D  +TT+AG G+   +G  DG    A   N    V + 
Sbjct: 511 MDGFGNLYFADFDNNRVRRISPDGTITTVAGNGQPGFSG--DGGPATAAMLNKPVAVAIG 568

Query: 59  PTCSLLVIDRGNAALRQIS 77
           P  +L ++D  N  +RQ+S
Sbjct: 569 PGGTLYIVDTFNMRVRQVS 587



 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVR 58
           +D +G +Y+AD+ N  IR+IG D  + TIAG  + VAG+  DG    A    D   V V 
Sbjct: 681 LDRQGRIYIADSYNHKIRRIGLDGVIETIAG--TGVAGYSGDGGKATAATLRDPRGVTVD 738

Query: 59  PTCSLLVIDRGNAALRQI 76
              ++ + D GN  +R+I
Sbjct: 739 AAGNVYITDSGNNRVRRI 756



 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVRP 59
           +D  G+L++AD  N  +R++G  G+ T   G+     + DG P+  A  S  F+V   R 
Sbjct: 625 IDSAGDLFIADNGNDIVRRVGVDGIITTVAGRFGYGSWGDGKPATQAMISKPFNVALDR- 683

Query: 60  TCSLLVIDRGNAALRQISLN 79
              + + D  N  +R+I L+
Sbjct: 684 QGRIYIADSYNHKIRRIGLD 703


>gi|255530479|ref|YP_003090851.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
 gi|255343463|gb|ACU02789.1| NHL repeat containing protein [Pedobacter heparinus DSM 2366]
          Length = 453

 Score = 40.8 bits (94), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D+ GN++VAD  N  IR +    + T   G+   +G++DG   ++ F+  + V  V   
Sbjct: 376 FDNSGNMFVADYGNHCIRMVSADNIVTTVAGQPGKSGYKDGGPVESLFNQPWGVA-VNEQ 434

Query: 61  CSLLVIDRGNAALRQISL 78
             + + D  NA +R++ +
Sbjct: 435 GDIYIADWSNARIRKLVI 452


>gi|428186137|gb|EKX54988.1| hypothetical protein GUITHDRAFT_59145, partial [Guillardia theta
          CCMP2712]
          Length = 231

 Score = 40.8 bits (94), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 6/71 (8%)

Query: 8  YVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 65
          +V D L+  +RK+  +   V+  AG  S + GFRDG + DA+FS    +++ R    L V
Sbjct: 1  FVTDRLDHRVRKVSKSNNFVSYFAG--SGIQGFRDGAAGDARFSEPLGIIHFRN--KLYV 56

Query: 66 IDRGNAALRQI 76
           D  N A+R I
Sbjct: 57 ADSNNHAIRSI 67


>gi|320107313|ref|YP_004182903.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
 gi|319925834|gb|ADV82909.1| NHL repeat containing protein [Terriglobus saanensis SP1PR4]
          Length = 1821

 Score = 40.8 bits (94), Expect = 0.89,   Method: Composition-based stats.
 Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYV 57
           MD  G LY+ADT N  IR+I    G+ T+  G   VA F    GP+   +F+N   +  +
Sbjct: 300 MDAAGFLYIADTGNDTIRRIDPSTGMITLFAGIPTVADFAGDGGPASAGRFNNPLGLA-I 358

Query: 58  RPTCSLLVIDRGNAALRQIS 77
               SL + D+ N  +R I+
Sbjct: 359 DSGGSLYIADQANHRIRMIT 378


>gi|405960880|gb|EKC26754.1| NHL repeat-containing protein 2 [Crassostrea gigas]
          Length = 699

 Score = 40.8 bits (94), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 7/90 (7%)

Query: 7   LYVADTLNLAIRKIG--DAGVTTIAGGKSN-----VAGFRDGPSEDAKFSNDFDVVYVRP 59
           L+VAD+ +  +R +   D  V  + GG+ +       G +DG   DAK  +   V +V  
Sbjct: 414 LFVADSESSTVRSVALKDGAVKHVVGGERDPMNLFAYGDKDGTGIDAKLQHPLGVAWVAD 473

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNS 89
           + S+LV D  N  L+ + +++  C   + S
Sbjct: 474 SGSILVADSYNHKLKLVDISKKQCTTVWGS 503


>gi|290983166|ref|XP_002674300.1| predicted protein [Naegleria gruberi]
 gi|284087889|gb|EFC41556.1| predicted protein [Naegleria gruberi]
          Length = 1435

 Score = 40.8 bits (94), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 20/78 (25%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV-VYVRP 59
           + + G +Y+ADT N  IRK+ + G+     G     G+        +   +F   + V  
Sbjct: 228 VSNSGEIYIADTFNCVIRKVSNLGIIGTVAGVGGQCGYNQDSINATESKLNFPTGITVSD 287

Query: 60  TCSLLVIDRGNAALRQIS 77
           +  L ++D+GN  +R++S
Sbjct: 288 SGDLYIVDKGNHRIRKVS 305


>gi|325104746|ref|YP_004274400.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973594|gb|ADY52578.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 1769

 Score = 40.8 bits (94), Expect = 0.95,   Method: Composition-based stats.
 Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 3/49 (6%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 48
            MD   N+YV+++ N  IRKI  AG V+T+AG    V G RDG  E+A+F
Sbjct: 1114 MDASDNIYVSESSNHRIRKITPAGEVSTVAG--KGVQGNRDGTKEEAQF 1160



 Score = 38.1 bits (87), Expect = 5.8,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 44/77 (57%), Gaps = 5/77 (6%)

Query: 2   DDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D +GN+YVA+  N  IRKI  D  V+T AG   +  G+ DG    AKF N    + +  +
Sbjct: 901 DAEGNIYVAEYGNSLIRKITADGQVSTFAG---STFGYADGIGTAAKF-NGPQGMAIDAS 956

Query: 61  CSLLVIDRGNAALRQIS 77
            ++ + +RG + +R+I+
Sbjct: 957 GNIYIAERGASRIRKIT 973



 Score = 37.7 bits (86), Expect = 7.7,   Method: Composition-based stats.
 Identities = 23/50 (46%), Positives = 32/50 (64%), Gaps = 3/50 (6%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFS 49
            +D  GN+Y+AD LN +IRKI   G V+T AG  +  AG+ DG    A+F+
Sbjct: 1007 IDPWGNIYIADDLNHSIRKITPNGTVSTFAG--NGTAGYADGVGVLARFN 1054


>gi|337745906|ref|YP_004640068.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus KNP414]
 gi|336297095|gb|AEI40198.1| copper amine oxidase domain protein [Paenibacillus mucilaginosus
           KNP414]
          Length = 537

 Score = 40.8 bits (94), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 32/74 (43%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           G+LYVAD  N AIR+I   G VTT+AG  S   G +DG    A+F    DV    P  +L
Sbjct: 139 GSLYVADAGNHAIRRIDAKGNVTTVAG--SGRLGAKDGKGAAAEFYRPGDVAAA-PDGTL 195

Query: 64  LVIDRGNAALRQIS 77
            V D     +R+IS
Sbjct: 196 YVADTLGHTIRRIS 209



 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 43/88 (48%), Gaps = 13/88 (14%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAG----------GKSNVAG-FRDGPSEDAKFSNDF 52
           G LYVADTL   IR+I   G VTT+            G+   AG F DGP   AKF N+ 
Sbjct: 193 GTLYVADTLGHTIRRISPQGEVTTLTAPSRRVVEATPGQVAAAGDFADGPLAQAKF-NEP 251

Query: 53  DVVYVRPTCSLLVIDRGNAALRQISLNQ 80
             + +    +L V D GN  +R I L +
Sbjct: 252 TGIALDAKGNLYVSDSGNQRIRYIDLAK 279



 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 11/86 (12%)

Query: 1   MDDKGNLYVADTLNLAIRKI--GDAGVTTIAGGKSNV--------AGFRDGPSEDAKFSN 50
           +D KGNLYV+D+ N  IR I      VTT+AGG +           GF +G +  A+ + 
Sbjct: 256 LDAKGNLYVSDSGNQRIRYIDLAKGTVTTVAGGGTAAELKDMYVPGGFSNGAALQARLNY 315

Query: 51  DFDVVYVRPTCSLLVIDRGNAALRQI 76
              +  V     LL+ D  N A+R +
Sbjct: 316 PMGIA-VTEEGGLLIADSQNHAVRYL 340


>gi|379719861|ref|YP_005311992.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus 3016]
 gi|378568533|gb|AFC28843.1| copper amine oxidase domain-containing protein [Paenibacillus
           mucilaginosus 3016]
          Length = 535

 Score = 40.8 bits (94), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 32/74 (43%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           G+LYVAD  N AIR+I   G VTT+AG  S   G +DG    A+F    DV    P  +L
Sbjct: 139 GSLYVADAGNHAIRRIDAKGNVTTVAG--SGRLGAKDGKGAAAEFYRPGDVAAA-PDGTL 195

Query: 64  LVIDRGNAALRQIS 77
            V D     +R+IS
Sbjct: 196 YVADTLGHTIRRIS 209



 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 43/88 (48%), Gaps = 13/88 (14%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAG----------GKSNVAG-FRDGPSEDAKFSNDF 52
           G LYVADTL   IR+I   G VTT+            G+   AG F DGP   AKF N+ 
Sbjct: 193 GTLYVADTLGHTIRRISPQGEVTTLTAPSRRVVEATPGQVAAAGDFADGPLAQAKF-NEP 251

Query: 53  DVVYVRPTCSLLVIDRGNAALRQISLNQ 80
             + +    +L V D GN  +R I L +
Sbjct: 252 TGIALDAKGNLYVSDSGNQRIRYIDLAK 279


>gi|290995833|ref|XP_002680487.1| serine/threonine kinase [Naegleria gruberi]
 gi|284094108|gb|EFC47743.1| serine/threonine kinase [Naegleria gruberi]
          Length = 1078

 Score = 40.8 bits (94), Expect = 0.99,   Method: Composition-based stats.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 34
           +D KG++Y ADT N  IRK+ +  +TTIAG K N
Sbjct: 197 VDSKGDVYFADTYNGLIRKVSNGYITTIAGSKQN 230


>gi|37522270|ref|NP_925647.1| hypothetical protein gll2701 [Gloeobacter violaceus PCC 7421]
 gi|35213270|dbj|BAC90642.1| gll2701 [Gloeobacter violaceus PCC 7421]
          Length = 351

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 43/79 (54%), Gaps = 4/79 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D  GN+ VADT N  IR IG D  + T+AG G++  +G   GP+  A+    + V  V 
Sbjct: 161 VDALGNVLVADTYNHRIRTIGPDGTIRTVAGTGQAGYSG-DGGPATAARLDFPWGVA-VA 218

Query: 59  PTCSLLVIDRGNAALRQIS 77
           P   +L+ D GN  +R I 
Sbjct: 219 PDGRILIADTGNNRIRSIG 237



 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  GNLY+AD  +  +R++G D  + TIAG      G   GP++ A+F ND   V V   
Sbjct: 106 DRAGNLYIADHNHHRVRRVGTDGRIETIAGTGEADYGGDGGPAKQARF-NDPAGVAVDAL 164

Query: 61  CSLLVIDRGNAALRQIS 77
            ++LV D  N  +R I 
Sbjct: 165 GNVLVADTYNHRIRTIG 181


>gi|373955343|ref|ZP_09615303.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
 gi|373891943|gb|EHQ27840.1| Ig family protein [Mucilaginibacter paludis DSM 18603]
          Length = 929

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/49 (53%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKF 48
           +D  GNLYV+D  N  IRKI  AG VTT+AG  S +A F DG    A F
Sbjct: 261 VDGAGNLYVSDNGNNLIRKIDLAGAVTTLAG--SGMAAFADGTGTAASF 307



 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  GN+YVAD    +I+KI   G  T+  G  ++ G  +G    A F   F V  V   
Sbjct: 207 VDLNGNVYVADAATNSIKKITPTGSVTVVAGNGSM-GSNNGIGSAASFYYPFSVT-VDGA 264

Query: 61  CSLLVIDRGNAALRQISL 78
            +L V D GN  +R+I L
Sbjct: 265 GNLYVSDNGNNLIRKIDL 282


>gi|413944258|gb|AFW76907.1| hypothetical protein ZEAMMB73_849037 [Zea mays]
          Length = 187

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/24 (66%), Positives = 21/24 (87%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG 24
           +DD+GN+YVAD +N+AIRKI D G
Sbjct: 156 VDDRGNIYVADGMNMAIRKISDTG 179


>gi|433610083|ref|YP_007042452.1| Alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
           allergen [Saccharothrix espanaensis DSM 44229]
 gi|407887936|emb|CCH35579.1| Alkyl hydroperoxide reductase/ thiol specific antioxidant/ Mal
           allergen [Saccharothrix espanaensis DSM 44229]
          Length = 630

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 3/82 (3%)

Query: 6   NLYVADTLNLAIRKIG-DAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           ++ VADT+N  +R +  D G VTT+AG G+    G  DGP+ D   ++ +DV +  P   
Sbjct: 271 HVVVADTVNHLLRGLNLDTGEVTTVAGTGEQWRGGETDGPALDIPLTSPWDVAWWEPAGG 330

Query: 63  LLVIDRGNAALRQISLNQDDCE 84
           +++   GN  L   +  +D  E
Sbjct: 331 VVIALAGNHTLGLFNPAEDRLE 352


>gi|223939666|ref|ZP_03631540.1| NHL repeat containing protein [bacterium Ellin514]
 gi|223891717|gb|EEF58204.1| NHL repeat containing protein [bacterium Ellin514]
          Length = 452

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/53 (47%), Positives = 31/53 (58%), Gaps = 4/53 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG----VTTIAGGKSNVAGFRDGPSEDAKFS 49
           +D  GN+YVADT N AIRK+   G    VTTIAG  +   G  DG +  A F+
Sbjct: 240 LDSAGNMYVADTGNNAIRKLTLEGTNWVVTTIAGSTNQQNGSLDGTNNQALFT 292



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 2   DDKGNLYVADTLNLAIRKI---GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           D  GNL+V D  +  IR++   G   V T   G + + GF+DG + DA F +   +  + 
Sbjct: 184 DTNGNLFVTDGYD-TIRQLRPDGTNWVVTTIAGAAGIHGFKDGTNNDALFYSPIGLA-LD 241

Query: 59  PTCSLLVIDRGNAALRQISL 78
              ++ V D GN A+R+++L
Sbjct: 242 SAGNMYVADTGNNAIRKLTL 261



 Score = 37.4 bits (85), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 45/100 (45%), Gaps = 20/100 (20%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG---VTTIAGGKSNVAGFR---DGPSEDAKFSNDFDV 54
           +D  GN+YVAD+ N  IRK+   G   V +  GG+   AGF    DG S  A F N   V
Sbjct: 299 VDSAGNVYVADSYNYTIRKVTVVGTDYVVSTVGGR---AGFSYPLDGVSTKAAFVNPVSV 355

Query: 55  VYVRPTCSLLVIDR-GNAALRQISLNQDDCEYQYNSISPT 93
                      +D+ GN  +   S N   C +  +  SPT
Sbjct: 356 ----------ALDKNGNLYVADHSYNTVRCGWVESMASPT 385


>gi|418707795|ref|ZP_13268612.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. UI 08368]
 gi|410771897|gb|EKR47094.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. UI 08368]
 gi|456968671|gb|EMG09839.1| putative lipoprotein [Leptospira interrogans serovar Grippotyphosa
           str. LT2186]
          Length = 358

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D + NLYV +  N  IRKI  ++G  +I  G   ++G+ DG    A+F +   + Y R 
Sbjct: 185 IDPERNLYVGELSNHTIRKINLNSGTVSILSG--GISGYLDGDLASARFKSPLGIAYNRK 242

Query: 60  TCSLLVIDRGNAALRQISL 78
             SLL  D  +  +R+I L
Sbjct: 243 MNSLLAADIQDHRIRKIDL 261


>gi|295394954|ref|ZP_06805167.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
           49030]
 gi|294972287|gb|EFG48149.1| NHL repeat containing protein [Brevibacterium mcbrellneri ATCC
           49030]
          Length = 655

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 54/106 (50%), Gaps = 18/106 (16%)

Query: 5   GNLYVADTLNLAIRKI--GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           GN++VAD+   A+RK+      VT+  G      GF+DGP+E A+  +    + + P  +
Sbjct: 399 GNVWVADSETSALRKLDPSTGAVTSYVGVGLFDFGFQDGPAESARLQHPLGALEL-PDGT 457

Query: 63  LLVIDRGNAALRQISLNQDDCEYQYNSIS--------PTDILMVVG 100
           + + D  N A+R+        + Q N++S        P+DIL++ G
Sbjct: 458 VAIADTYNGAIRRF-------DPQTNTVSTLARGLKEPSDILLMQG 496


>gi|342182683|emb|CCC92162.1| putative flagellum-adhesion glycoprotein [Trypanosoma congolense
           IL3000]
          Length = 585

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 4/82 (4%)

Query: 17  IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 76
           IR +   G+ TIAG K+ V G +DGP+  A F+N   V  +     + V DR N  +R+I
Sbjct: 97  IRSVNTTGIDTIAGSKT-VRGNQDGPAAAALFNNPTSVAGIGD--DIFVADRDNKCIRRI 153

Query: 77  SLNQDDCEY-QYNSISPTDILM 97
             + +   + Q     PTDI++
Sbjct: 154 DASGNVTTFGQVTLNQPTDIIV 175


>gi|383784367|ref|YP_005468936.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
 gi|383083279|dbj|BAM06806.1| hypothetical protein LFE_1113 [Leptospirillum ferrooxidans C2-3]
          Length = 382

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 5   GNLYVADTLNLAIRKI----GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           G++YVADT N  IRKI    G   V TIAG  ++ A FRDG    A+F+N   +      
Sbjct: 77  GSIYVADTGNDMIRKITMSSGTGTVETIAG-VNHHARFRDGAGAAARFNNPEGLAISPDG 135

Query: 61  CSLLVIDRGNAALRQISL 78
            +L V D  N  +R+I L
Sbjct: 136 KTLYVADSRNNRIRKIDL 153


>gi|359777541|ref|ZP_09280822.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
           12137]
 gi|359305319|dbj|GAB14651.1| hypothetical protein ARGLB_073_01730 [Arthrobacter globiformis NBRC
           12137]
          Length = 666

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 43/79 (54%), Gaps = 6/79 (7%)

Query: 2   DDKGNLYVADTLNLAIRK--IGDAGVTTI--AGGKSNVA-GFRDGPSEDAKFSNDFDVVY 56
           D  GN++VAD+   A+RK  I D G  T+  A GK     GFRDG + +A+  +   V  
Sbjct: 400 DADGNIWVADSETSALRKLVISDDGAVTVESAVGKGLFDFGFRDGEASEARLQHPLGVT- 458

Query: 57  VRPTCSLLVIDRGNAALRQ 75
           V P  S+ + D  N A+R+
Sbjct: 459 VLPDGSVAIADTYNGAVRR 477


>gi|323343289|ref|ZP_08083516.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
           33269]
 gi|323095108|gb|EFZ37682.1| hypothetical protein HMPREF0663_10051 [Prevotella oralis ATCC
           33269]
          Length = 436

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 40/78 (51%), Gaps = 1/78 (1%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  GN+YVAD  N  IR+I    +     G     G++DG   +A F N+   + +    
Sbjct: 360 DADGNIYVADRNNHCIRRISPDDMVETVLGMPETKGWKDGAKSEALF-NEPTGIGIGKDG 418

Query: 62  SLLVIDRGNAALRQISLN 79
           ++ V D GN  +R++++N
Sbjct: 419 AVYVADWGNGRVRKLTIN 436


>gi|116621305|ref|YP_823461.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224467|gb|ABJ83176.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 1042

 Score = 40.0 bits (92), Expect = 1.7,   Method: Composition-based stats.
 Identities = 29/79 (36%), Positives = 45/79 (56%), Gaps = 5/79 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D  GNLY+A   N  + K+  AG VTT+AG G S       GP+ +A FS  +  + + 
Sbjct: 95  LDSSGNLYLAGGNN--VFKLTSAGKVTTVAGNGTSGTYTGDGGPAINAGFSGAY-AIALD 151

Query: 59  PTCSLLVIDRGNAALRQIS 77
            T +L + D GN A+R+++
Sbjct: 152 STGNLYICDSGNNAIRKVT 170



 Score = 38.9 bits (89), Expect = 3.3,   Method: Composition-based stats.
 Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 4/75 (5%)

Query: 5   GNLYVADTLNLAIRKIG-DAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G++Y  DT N  +RKIG D  +TTIAG G    +G   GP+  A  S   D + +  T +
Sbjct: 375 GSIYFTDTANHRVRKIGTDGKITTIAGTGDLGFSG-DGGPATQATMSFP-DALAIDSTNN 432

Query: 63  LLVIDRGNAALRQIS 77
           L VID+    +R+I+
Sbjct: 433 LYVIDQKQLRVRKIT 447


>gi|375146499|ref|YP_005008940.1| cell surface receptor IPT/TIG domain-containing protein [Niastella
           koreensis GR20-10]
 gi|361060545|gb|AEV99536.1| cell surface receptor IPT/TIG domain protein [Niastella koreensis
           GR20-10]
          Length = 500

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           MD  GN+YV D     I ++   G ++ IAGG    +G+ DG  +DA+FS    +V +  
Sbjct: 424 MDGVGNIYVTDDSTNGIYRVDPNGKLSLIAGGVR--SGYIDGKPQDARFSGPRGIV-IDA 480

Query: 60  TCSLLVIDRGNAALRQISL 78
           + +L V D GN  +R+I +
Sbjct: 481 SGNLFVADIGNNCIRKIIM 499


>gi|292491957|ref|YP_003527396.1| hypothetical protein Nhal_1896 [Nitrosococcus halophilus Nc4]
 gi|291580552|gb|ADE15009.1| YD repeat protein [Nitrosococcus halophilus Nc4]
          Length = 2539

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 4/81 (4%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVR 58
            +D  GNLY+A++ N  IR++G  G+ T+  G     GF    G +  A+ ++  D+  + 
Sbjct: 1330 LDSDGNLYIAESQNNVIRRVGPDGLITLVAGNG-TRGFSGDGGLATQARLNHPNDLA-IG 1387

Query: 59   PTCSLLVIDRGNAALRQISLN 79
            P  SL + D  N  +R +S N
Sbjct: 1388 PDGSLYIADYDNHRIRLVSPN 1408


>gi|153806007|ref|ZP_01958675.1| hypothetical protein BACCAC_00252 [Bacteroides caccae ATCC 43185]
 gi|149130684|gb|EDM21890.1| IPT/TIG domain protein [Bacteroides caccae ATCC 43185]
          Length = 440

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFS 49
           +DD GN+YV D +N  +RKI  D  VTT+AG   N  G  DG    A+F+
Sbjct: 169 VDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLAGN-KGCIDGTGVQARFN 217


>gi|393789630|ref|ZP_10377750.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
           CL02T12C05]
 gi|392650346|gb|EIY44015.1| hypothetical protein HMPREF1068_04030 [Bacteroides nordii
           CL02T12C05]
          Length = 435

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 43/75 (57%), Gaps = 1/75 (1%)

Query: 4   KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           + +L++AD+ N  IRKI   GV +   G++ + G +DG  E A F+  F V  V    ++
Sbjct: 361 ENDLFLADSNNHVIRKITQDGVVSTVIGQAGMTGNQDGTPEVALFNEPFGVA-VDTDGTI 419

Query: 64  LVIDRGNAALRQISL 78
            + D  N ++R++++
Sbjct: 420 YIGDSKNQSIRRLAI 434


>gi|423219560|ref|ZP_17206056.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
           CL03T12C61]
 gi|392624765|gb|EIY18843.1| hypothetical protein HMPREF1061_02829 [Bacteroides caccae
           CL03T12C61]
          Length = 436

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFS 49
           +DD GN+YV D +N  +RKI  D  VTT+AG   N  G  DG    A+F+
Sbjct: 165 VDDDGNVYVGDCVNYCVRKITPDGTVTTLAGLAGN-KGCIDGTGVQARFN 213


>gi|452910311|ref|ZP_21958992.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
 gi|452834558|gb|EME37358.1| hypothetical protein C884_01866 [Kocuria palustris PEL]
          Length = 632

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 50/103 (48%), Gaps = 15/103 (14%)

Query: 5   GNLYVADTLNLAIRKIGDA-----GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           G ++ AD+ + A+R I  A     GV T AG      GFRDG S  A+  +   V  V P
Sbjct: 373 GTIWAADSESSALRSIAVADGALSGVATAAGLGLYDFGFRDGDSTQARLQHPLGVA-VLP 431

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSI-----SPTDILM 97
             S+ V D  N A+R+     D  E Q +++      P+D+L+
Sbjct: 432 DGSIAVADTYNGAIRR----WDPVEGQLSTLERDFDEPSDLLV 470


>gi|343476735|emb|CCD12251.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 582

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 4/82 (4%)

Query: 17  IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 76
           IR +   G+ TIAG K+ V G +DGP+  A F+N   V  +     + V DR N  +R+I
Sbjct: 97  IRSVNTTGIDTIAGSKT-VRGNQDGPAAAALFNNPTSVAGIGD--DIFVADRDNKCIRRI 153

Query: 77  SLNQDDCEY-QYNSISPTDILM 97
             + +   + Q     PTDI++
Sbjct: 154 DASGNVTIFGQVTLNQPTDIIV 175


>gi|344923600|ref|ZP_08777061.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 1226

 Score = 39.7 bits (91), Expect = 2.3,   Method: Composition-based stats.
 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 1/76 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D  G +YVAD+ N  I+KI  AG  ++  G S V+ + +G    A+FS    V      
Sbjct: 33  VDGSGTVYVADSNNNVIKKITPAGTVSVLAG-SGVSDYAEGTGTAARFSYPEGVGVDVAG 91

Query: 61  CSLLVIDRGNAALRQI 76
            ++ V D  N  +R+I
Sbjct: 92  TTVYVADSQNGVIRKI 107



 Score = 39.3 bits (90), Expect = 2.4,   Method: Composition-based stats.
 Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 10/81 (12%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD----VV 55
           +D   NLY+ DTLN  IRKI     VTT+AGG  ++       +  +  S DF+    V 
Sbjct: 145 VDSSSNLYIGDTLNHCIRKIAPGNVVTTLAGGGGSLPN-----TTSSNLSADFNGPNGVA 199

Query: 56  YVRPTCSLLVIDRGNAALRQI 76
                 ++ V D  N+ LR+I
Sbjct: 200 VDAGGTTVYVADTSNSMLRKI 220


>gi|297568314|ref|YP_003689658.1| hypothetical protein [Desulfurivibrio alkaliphilus AHT2]
 gi|296924229|gb|ADH85039.1| YD repeat protein [Desulfurivibrio alkaliphilus AHT2]
          Length = 2439

 Score = 39.7 bits (91), Expect = 2.3,   Method: Composition-based stats.
 Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 17/88 (19%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR------------DGPSEDAKF 48
            +D  G +Y+ADT N  IR +G  G+ T   G    +GFR            DGP+ +A+ 
Sbjct: 1098 IDPDGTIYIADTGNHRIRSVGTDGIITTHAG----SGFRAGGLGDSGYSGDDGPAVNARL 1153

Query: 49   SNDFDVVYVRPTCSLLVIDRGNAALRQI 76
            ++  D+  + P  S+ + D  N  +R++
Sbjct: 1154 NSPTDIA-LGPDGSIYIADSINHRIRRV 1180



 Score = 38.9 bits (89), Expect = 3.9,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 5    GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
            G++Y+ADT N  IR++G  G +TTIAG          GP+++A+       V + P  S+
Sbjct: 1226 GSIYIADTYNHRIRRVGTDGIITTIAGTGYRGFSGDGGPADEARLFGPI-AVAIGPDGSI 1284

Query: 64   LVIDRGNAALRQI 76
             + D  N  +R++
Sbjct: 1285 YIADTYNHRIRRV 1297


>gi|423072738|ref|ZP_17061487.1| fibronectin type III domain protein [Desulfitobacterium hafniense
           DP7]
 gi|361856502|gb|EHL08400.1| fibronectin type III domain protein [Desulfitobacterium hafniense
           DP7]
          Length = 1926

 Score = 39.7 bits (91), Expect = 2.4,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 26/46 (56%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK 47
           D  GN+Y++D+ N  IRK+   G+ T   GK        GP+ED+K
Sbjct: 307 DGAGNMYISDSGNRRIRKVDPQGIITTIAGKETNGDPTPGPAEDSK 352


>gi|114568006|ref|YP_755160.1| leucine-rich repeat-containing protein [Syntrophomonas wolfei
           subsp. wolfei str. Goettingen]
 gi|114338941|gb|ABI69789.1| Leucine-rich repeat (LRR) protein-like protein [Syntrophomonas
           wolfei subsp. wolfei str. Goettingen]
          Length = 1351

 Score = 39.7 bits (91), Expect = 2.4,   Method: Composition-based stats.
 Identities = 26/78 (33%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D+ GN+Y+AD  N  IRK+  +G +TT+AG  S       GP+E A   N   V     +
Sbjct: 595 DNNGNMYIADMNNKRIRKVDPSGIITTVAGNGSWKYSGDGGPAEAAGLCNAVGVA-CDSS 653

Query: 61  CSLLVIDRGNAALRQISL 78
            +L + D  +  +R++ L
Sbjct: 654 GNLYIADSHSNCIRKVVL 671



 Score = 38.1 bits (87), Expect = 5.5,   Method: Composition-based stats.
 Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  GNLY+AD  N  IRK+  +G ++T+AG  +       GP+  A+  +   V      
Sbjct: 483 DGSGNLYIADRGNHRIRKVDTSGIISTVAGNGTAGYSGDGGPATSAQLKDPVGVA-CDKN 541

Query: 61  CSLLVIDRGNAALRQI 76
            +L ++D+ N  +R++
Sbjct: 542 GNLYIVDKDNNRIRKV 557


>gi|219667746|ref|YP_002458181.1| hypothetical protein Dhaf_1699 [Desulfitobacterium hafniense DCB-2]
 gi|219538006|gb|ACL19745.1| protein of unknown function DUF1533 [Desulfitobacterium hafniense
           DCB-2]
          Length = 1916

 Score = 39.7 bits (91), Expect = 2.4,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 26/46 (56%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAK 47
           D  GN+Y++D+ N  IRK+   G+ T   GK        GP+ED+K
Sbjct: 297 DGAGNMYISDSGNRRIRKVDPQGIITTIAGKETNGDPTPGPAEDSK 342


>gi|284044371|ref|YP_003394711.1| NHL repeat containing protein [Conexibacter woesei DSM 14684]
 gi|283948592|gb|ADB51336.1| NHL repeat containing protein [Conexibacter woesei DSM 14684]
          Length = 623

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 37/87 (42%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           G   +ADTLN  IR++  AG+ T   G    A   DG    A   N    V   P  S L
Sbjct: 143 GGFLIADTLNNRIRRVDAAGLITTVAGTGAAAFTGDGGPATAAALNAPQGVAALPDGSFL 202

Query: 65  VIDRGNAALRQISLNQDDCEYQYNSIS 91
           + D GN  +R++       E    +++
Sbjct: 203 IADTGNHRIRRVDAASGRIERVVGNLT 229


>gi|111219554|ref|YP_710348.1| protein serine/threonine kinase [Frankia alni ACN14a]
 gi|111147086|emb|CAJ58733.1| putative Protein serine/threonine kinase [Frankia alni ACN14a]
          Length = 765

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR-DGPSEDAKFSNDFDVVYVRP 59
           + D G+LYVAD     +R+I  +G  T+  G+    G R DG +  A   +D   + + P
Sbjct: 619 LGDDGSLYVADRGEDTVRRIDRSGRITLVAGRPGTYGNRGDGRAATAALLDDPTGLALGP 678

Query: 60  TCSLLVIDRGNAALRQI 76
             SL + D GN  +R++
Sbjct: 679 DGSLYIADAGNDVIRRV 695


>gi|342182680|emb|CCC92159.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 565

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 4/82 (4%)

Query: 17  IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 76
           IR +   G+ TIAG K+ V G +DGP+  A F+N   V  +     + V DR N  +R I
Sbjct: 97  IRSVSTTGIDTIAGNKT-VRGNQDGPAAAALFNNPTSVAGIGD--DIFVADRDNNCIRHI 153

Query: 77  SLNQDDCEYQYNSIS-PTDILM 97
             + +    +   ++ PTDI++
Sbjct: 154 DASGNVTTLELIDLNQPTDIIV 175


>gi|111219921|ref|YP_710715.1| ascorbate-dependent monooxygenase [Frankia alni ACN14a]
 gi|111147453|emb|CAJ59103.1| Putative ascorbate-dependent monooxygenase [Frankia alni ACN14a]
          Length = 865

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  G++Y+ADT N  IRK+   GV TT+AG +        GP+  A    D   + V   
Sbjct: 719 DAAGDVYIADTDNSRIRKVDTHGVITTVAGNRLRGFAGDGGPAVKASL-QDPRGIAVDAV 777

Query: 61  CSLLVIDRGNAALRQI 76
            +L + DRGN+ +R++
Sbjct: 778 GNLYITDRGNSRIRKV 793


>gi|290992574|ref|XP_002678909.1| predicted protein [Naegleria gruberi]
 gi|284092523|gb|EFC46165.1| predicted protein [Naegleria gruberi]
          Length = 1626

 Score = 39.3 bits (90), Expect = 2.7,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 2/80 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D    +Y AD+ N  IRKI  D  V TIAG K+      + P   A+ +N +D V +  
Sbjct: 132 VDSNDEVYFADSKNHRIRKILKDGTVITIAGTKTGGYNGDNIPQTTAQLNNPYD-VKIGS 190

Query: 60  TCSLLVIDRGNAALRQISLN 79
             ++ + D GN  +R+I+ N
Sbjct: 191 DGAIYIADYGNNRIRKIANN 210


>gi|158319005|ref|YP_001511513.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
 gi|158114410|gb|ABW16607.1| serine/threonine protein kinase [Frankia sp. EAN1pec]
          Length = 807

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 41/79 (51%), Gaps = 3/79 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVR 58
           + D G LYVAD+    +R+I   G  T+  GK   +G+   DGP+ DA  ++  D+  + 
Sbjct: 659 LADDGTLYVADSSAHTVRRITKDGKITLVAGKPEQSGYEGDDGPAVDALLASPQDLA-LG 717

Query: 59  PTCSLLVIDRGNAALRQIS 77
           P   L + D  N  +R ++
Sbjct: 718 PAGELYIADTYNNVIRMVT 736


>gi|225450563|ref|XP_002277564.1| PREDICTED: NHL repeat-containing protein 2-like [Vitis vinifera]
          Length = 1078

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 5   GNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G +YVAD+ N  I+K+  A   V+T+AG  +  AGF+DG +  A+ S    +V V     
Sbjct: 838 GQIYVADSYNHKIKKLDPATGRVSTLAG--TGKAGFKDGRALAAQLSEPSGIVEVE-NGV 894

Query: 63  LLVIDRGNAALRQISLNQDDCE 84
           L + D  N+ +R + L + + +
Sbjct: 895 LFIADTNNSVIRYLDLKKKEAD 916


>gi|256380948|ref|YP_003104608.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Actinosynnema mirum DSM 43827]
 gi|255925251|gb|ACU40762.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Actinosynnema mirum DSM 43827]
          Length = 612

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 40/68 (58%), Gaps = 3/68 (4%)

Query: 9   VADTLNLAIRKIG-DAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 65
           VADT+N  +R +  D G VTT+AG G+    G  DGP+++   ++ +DVV+  P  ++ +
Sbjct: 257 VADTVNHLLRGLNLDTGEVTTVAGTGEQWRDGATDGPADEIPLTSPWDVVWWEPANAVAI 316

Query: 66  IDRGNAAL 73
              GN  L
Sbjct: 317 ALAGNHTL 324


>gi|344924044|ref|ZP_08777505.1| NHL repeat containing protein [Candidatus Odyssella
           thessalonicensis L13]
          Length = 383

 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 9/68 (13%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFS-------NDFDV 54
           D+ GN+YV+DT N  +RKI   G T++  G +  +G+++G    A+ S       N    
Sbjct: 121 DNSGNVYVSDTTNNRVRKITSGGTTSLLAGST--SGYQEGTGAGARLSSPRGLAVNSAGT 178

Query: 55  VYVRPTCS 62
           VYV    S
Sbjct: 179 VYVATATS 186



 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDV-VYVRP 59
           +D  G LYVAD  N  IRKI  +G T++  G S   G+ +G    A+F N F   +    
Sbjct: 65  IDSSGTLYVADNGNYRIRKITSSGTTSLLAG-SGTTGYAEGTGASAQF-NTFQWGIAADN 122

Query: 60  TCSLLVIDRGNAALRQIS 77
           + ++ V D  N  +R+I+
Sbjct: 123 SGNVYVSDTTNNRVRKIT 140


>gi|299473428|emb|CBN77825.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 2120

 Score = 38.9 bits (89), Expect = 3.4,   Method: Composition-based stats.
 Identities = 29/80 (36%), Positives = 38/80 (47%), Gaps = 4/80 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +D   N+YVADT N  IR+I   GV T   G     G  DG + +A FS    +     +
Sbjct: 230 VDAHANVYVADTGNHRIRRISPEGVVTTVAGDGE-EGSDDGDAMEASFSFPGGIALYYDS 288

Query: 61  CSLLVI---DRGNAALRQIS 77
              LV+   D  N  LR+IS
Sbjct: 289 SEGLVLYVADTNNHRLRKIS 308


>gi|374311754|ref|YP_005058184.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
 gi|358753764|gb|AEU37154.1| NHL repeat containing protein [Granulicella mallensis MP5ACTX8]
          Length = 1456

 Score = 38.9 bits (89), Expect = 3.4,   Method: Composition-based stats.
 Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 3/79 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D  GN+Y+ DT+N  +RK+      +TT+AG  S  +    GP+ +A       +  + 
Sbjct: 150 LDAAGNMYITDTINSRVRKVEATTGIITTVAGNGSVTSSGDGGPATNAGLVQPAGIA-LD 208

Query: 59  PTCSLLVIDRGNAALRQIS 77
              ++ V D G   +R+IS
Sbjct: 209 SANNIFVTDSGEPGVREIS 227


>gi|222056335|ref|YP_002538697.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
 gi|221565624|gb|ACM21596.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
          Length = 372

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 43/94 (45%), Gaps = 8/94 (8%)

Query: 3   DKGNLYVADTLNLAIRK--IGDAGVTTIAG----GKSNVAGFRDGPSEDAKFSNDFDVVY 56
           D  NLYVADT N  IRK  I    VTT+AG    G +  +G  DG    AKF+  F +  
Sbjct: 81  DGTNLYVADTGNNLIRKVVITTGAVTTLAGTVGTGTAQTSGSTDGTGSAAKFNAPFAI-- 138

Query: 57  VRPTCSLLVIDRGNAALRQISLNQDDCEYQYNSI 90
                +L V D  N  +R++ +          S+
Sbjct: 139 TTDGTNLYVADTNNNTIRKVVIATGTVTTLAGSV 172



 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 45/94 (47%), Gaps = 3/94 (3%)

Query: 3   DKGNLYVADTLNLAIRKIGDA-GVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTC 61
           D  +L+VADT N  IRKI  A G+ T   G + V+G  DG    AKF+  F +       
Sbjct: 251 DGTSLFVADTDNSTIRKIVIATGMVTTLAGSAGVSGIADGTGSTAKFNAPFGI--TTDGT 308

Query: 62  SLLVIDRGNAALRQISLNQDDCEYQYNSISPTDI 95
           +L V D    ++R++++         N  S T I
Sbjct: 309 NLYVTDSRQGSIRKVAIATGVVTTLVNGSSLTTI 342



 Score = 37.4 bits (85), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 38/78 (48%), Gaps = 5/78 (6%)

Query: 3   DKGNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D  NLYV+DT N  IRK  I    VTT+A G +   G  DG    A F   F +      
Sbjct: 196 DGTNLYVSDTGNRTIRKVVIATGAVTTLA-GSAGTPGSTDGVGPSALFGTVFGI--TTDG 252

Query: 61  CSLLVIDRGNAALRQISL 78
            SL V D  N+ +R+I +
Sbjct: 253 TSLFVADTDNSTIRKIVI 270


>gi|158520859|ref|YP_001528729.1| YD repeat-containing protein [Desulfococcus oleovorans Hxd3]
 gi|158509685|gb|ABW66652.1| YD repeat protein [Desulfococcus oleovorans Hxd3]
          Length = 2831

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVR 58
            +D  GN+Y+ADT N  +R+I   G+     G    AG+    G + DA+  +   +  V 
Sbjct: 1610 IDASGNIYIADTDNHCVRRISPDGIIEAFAGMGVDAGYSGDGGLAVDARLQSPTGLA-VD 1668

Query: 59   PTCSLLVIDRGNAALRQI 76
             T +L + D GN ++R++
Sbjct: 1669 KTGNLFIADSGNFSIRKV 1686


>gi|114566902|ref|YP_754056.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
 gi|114337837|gb|ABI68685.1| hypothetical protein Swol_1378 [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
          Length = 1030

 Score = 38.9 bits (89), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 48/94 (51%), Gaps = 6/94 (6%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDFDVVYVRP 59
            D  GN+Y+AD+ N  IRK+  +G+ +   G      F D GP+  A+  N   V  +  
Sbjct: 752 FDSSGNMYIADSNNNCIRKVDHSGMISTFAGNGTSGHFGDGGPATSAQLRNPVGVA-LDN 810

Query: 60  TCSLLVIDRGNAALRQISLNQDDCEYQYNSISPT 93
           + +L + D  + ++R++ L       Q ++ISPT
Sbjct: 811 SGNLFIADYFDHSIRKVVL----AAQQNSTISPT 840


>gi|153808835|ref|ZP_01961503.1| hypothetical protein BACCAC_03135 [Bacteroides caccae ATCC 43185]
 gi|149128661|gb|EDM19879.1| NHL repeat protein [Bacteroides caccae ATCC 43185]
          Length = 426

 Score = 38.9 bits (89), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D  G +Y+AD+ N  IR I   G  +   G    AG+ DG  +DA F   + V  V   
Sbjct: 349 FDLDGIMYIADSSNHCIRSIDREGAVSTVIGVPGRAGYVDGTPDDALFDEPWGVA-VDEE 407

Query: 61  CSLLVIDRGNAALRQISL 78
            ++ + D  N  +R++++
Sbjct: 408 GTIYIADTKNKCIRKLAI 425


>gi|290997788|ref|XP_002681463.1| predicted protein [Naegleria gruberi]
 gi|284095087|gb|EFC48719.1| predicted protein [Naegleria gruberi]
          Length = 2297

 Score = 38.9 bits (89), Expect = 3.8,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDFDVVYVRPTCSL 63
           G+LY+AD     IRK+ +  + TIAG  S+   F D G +  AK +    + +     +L
Sbjct: 81  GDLYIADKTYHRIRKVSNGIIKTIAGTGSSGGYFGDNGVATSAKLNKPQALAFYSGDGTL 140

Query: 64  LVIDRGNAALRQISLNQ 80
            + D  N  +R+IS NQ
Sbjct: 141 YLGDSLNYRIRKISTNQ 157


>gi|356571070|ref|XP_003553704.1| PREDICTED: NHL repeat-containing protein 2-like [Glycine max]
          Length = 1098

 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 3   DKGNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           + G +Y+AD+ N  I+K+      V+TIAG  +  AGF+DG +  A+ S    +V     
Sbjct: 855 NDGEIYIADSYNHKIKKLDPTSKRVSTIAG--TGKAGFKDGTAVKAQLSEPSGIVEGNKG 912

Query: 61  CSLLVIDRGNAALRQISLNQDDCEYQ 86
             L + D  N+ +R + LN ++ E +
Sbjct: 913 -RLFIADTNNSLIRYLDLNINETELR 937


>gi|147818279|emb|CAN64724.1| hypothetical protein VITISV_026725 [Vitis vinifera]
          Length = 423

 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 5   GNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G +YVAD+ N  I+K+  A   V+T+AG  +  AGF+DG +  A+ S    +V V     
Sbjct: 183 GQIYVADSYNHKIKKLDPATGRVSTLAG--TGKAGFKDGRALAAQLSEPSGIVEVE-NGV 239

Query: 63  LLVIDRGNAALRQISLNQDDCE 84
           L + D  N+ +R + L + + +
Sbjct: 240 LFIADTNNSVIRYLDLKKKEAD 261


>gi|423220897|ref|ZP_17207391.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
           CL03T12C61]
 gi|392622375|gb|EIY16503.1| hypothetical protein HMPREF1061_04164 [Bacteroides caccae
           CL03T12C61]
          Length = 426

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
            D  G +Y+AD+ N  IR I   G  +   G    AG+ DG  +DA F   + V  V   
Sbjct: 349 FDLDGIMYIADSSNHCIRSIDREGAVSTVIGVPGRAGYVDGTPDDALFDEPWGVA-VDEE 407

Query: 61  CSLLVIDRGNAALRQISL 78
            ++ + D  N  +R++++
Sbjct: 408 GTIYIADTKNKCIRKLAI 425


>gi|296089788|emb|CBI39607.3| unnamed protein product [Vitis vinifera]
          Length = 1002

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 5   GNLYVADTLNLAIRKIGDAG--VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G +YVAD+ N  I+K+  A   V+T+AG  +  AGF+DG +  A+ S    +V V     
Sbjct: 762 GQIYVADSYNHKIKKLDPATGRVSTLAG--TGKAGFKDGRALAAQLSEPSGIVEVENGV- 818

Query: 63  LLVIDRGNAALRQISLNQDDCE 84
           L + D  N+ +R + L + + +
Sbjct: 819 LFIADTNNSVIRYLDLKKKEAD 840


>gi|302821385|ref|XP_002992355.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
 gi|300139771|gb|EFJ06505.1| hypothetical protein SELMODRAFT_6804 [Selaginella moellendorffii]
          Length = 78

 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 26/41 (63%), Gaps = 2/41 (4%)

Query: 3  DKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGP 42
          D G ++V DT NLAIRKI   G VTTIAGG S   G  D P
Sbjct: 39 DNGAVFV-DTTNLAIRKISKNGEVTTIAGGSSRRPGIADSP 78


>gi|116620348|ref|YP_822504.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223510|gb|ABJ82219.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 915

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 44/77 (57%), Gaps = 2/77 (2%)

Query: 2   DDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           D KGNLY+ADT N +IRK+  D  ++T+ G  +  A   +G +  A+ ++   +  V   
Sbjct: 548 DGKGNLYIADTGNHSIRKVTADGVISTVVGNGTVGASGDEGAAASARLASPRGLT-VDDN 606

Query: 61  CSLLVIDRGNAALRQIS 77
            +L + D GN  +RQ++
Sbjct: 607 GNLYIGDTGNNRVRQVT 623


>gi|290971380|ref|XP_002668485.1| predicted protein [Naegleria gruberi]
 gi|284081915|gb|EFC35741.1| predicted protein [Naegleria gruberi]
          Length = 277

 Score = 38.5 bits (88), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSN 34
           +D KG++Y ADT N  IRK+ +  +TTIAG K N
Sbjct: 197 VDSKGDVYFADTYNGLIRKVSNGYITTIAGSKQN 230


>gi|196229731|ref|ZP_03128595.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
 gi|196226057|gb|EDY20563.1| NHL repeat containing protein [Chthoniobacter flavus Ellin428]
          Length = 360

 Score = 38.5 bits (88), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 7/82 (8%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG---VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVY 56
            D  GN Y+AD  N AIR++ DA    +TT AG GK   +G   GP+  A+      + +
Sbjct: 111 FDRAGNYYIADMANHAIRRV-DAKTKIITTFAGTGKPGYSG-DGGPAAQAQLKQPHSIQF 168

Query: 57  VRPTCSLLVIDRGNAALRQISL 78
             P  SL + D GN  +R++ +
Sbjct: 169 -GPDGSLYICDVGNNCIRKVDM 189


>gi|320103301|ref|YP_004178892.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
 gi|319750583|gb|ADV62343.1| NHL repeat containing protein [Isosphaera pallida ATCC 43644]
          Length = 676

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 3/81 (3%)

Query: 4   KGNLYVADTLNLAIRKI--GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           +  LYVADT N AIR I      VTT+AG G+    G   GP+     ++ +D+V +  T
Sbjct: 269 RNRLYVADTENHAIRAIDLTTRSVTTVAGTGEMVYPGLPGGPARRFGLNSPWDLVQIPET 328

Query: 61  CSLLVIDRGNAALRQISLNQD 81
              LV   G   + +I  +Q+
Sbjct: 329 NQFLVAMAGTHQIYKIDFDQN 349


>gi|398335921|ref|ZP_10520626.1| hypothetical protein LkmesMB_09964 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 947

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 28/52 (53%), Positives = 30/52 (57%), Gaps = 4/52 (7%)

Query: 7   LYVADTLNLAIR--KIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVY 56
           LYVADT N  IR  KI D  VTTIAG  S  AG  DG    AKF+    +VY
Sbjct: 749 LYVADTGNHKIRKIKISDNTVTTIAG--SGTAGTLDGLGTAAKFNQPSHLVY 798


>gi|290975356|ref|XP_002670409.1| predicted protein [Naegleria gruberi]
 gi|284083967|gb|EFC37665.1| predicted protein [Naegleria gruberi]
          Length = 2200

 Score = 38.5 bits (88), Expect = 5.0,   Method: Composition-based stats.
 Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 4/81 (4%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGV-TTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +   G +Y+ADTLN  IRKI   GV +TIAG G+++  G  DG        N  + V++ 
Sbjct: 186 ISQNGEIYIADTLNHRIRKINSYGVISTIAGTGRASFGG--DGGFAVLSPLNSPNGVHIS 243

Query: 59  PTCSLLVIDRGNAALRQISLN 79
               + + D  N  +R+I +N
Sbjct: 244 QNGEIYIADTLNHRIRKIFVN 264


>gi|357400623|ref|YP_004912548.1| hypothetical protein SCAT_3039 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|386356676|ref|YP_006054922.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
 gi|337767032|emb|CCB75743.1| conserved protein of unknown function [Streptomyces cattleya NRRL
           8057 = DSM 46488]
 gi|365807185|gb|AEW95401.1| hypothetical protein SCATT_30300 [Streptomyces cattleya NRRL 8057 =
           DSM 46488]
          Length = 613

 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 39/74 (52%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLL 64
           GNL V+D+   A+ ++   G T +    S   G  DGP++ A+FS    +  +    S++
Sbjct: 193 GNLLVSDSTRHALVELAPDGETVVRRIGSGERGLADGPADRARFSEPQGLALLPDGESVI 252

Query: 65  VIDRGNAALRQISL 78
           V D  N ALR++ L
Sbjct: 253 VADTVNHALRRVDL 266


>gi|29349488|ref|NP_812991.1| hypothetical protein BT_4080 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|298384984|ref|ZP_06994543.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|383120427|ref|ZP_09941155.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
 gi|29341397|gb|AAO79185.1| putative cell surface protein [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251840523|gb|EES68605.1| hypothetical protein BSIG_2566 [Bacteroides sp. 1_1_6]
 gi|298262128|gb|EFI04993.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 456

 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 18/87 (20%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF--------SND 51
           +D +GN+ + DT N  IRKI    G  +   GK   +G+ +G +EDA+F         ND
Sbjct: 378 VDSEGNIILTDTENHCIRKITMSTGYVSTLAGKPQNSGYVNGSAEDAQFKKPLGICIDND 437

Query: 52  FDVVYVRPTCSLLVIDRGNAALRQISL 78
            DV+Y+         D  N A+R++++
Sbjct: 438 -DVMYIG--------DSENRAIRRLAV 455


>gi|290974673|ref|XP_002670069.1| predicted protein [Naegleria gruberi]
 gi|284083624|gb|EFC37325.1| predicted protein [Naegleria gruberi]
          Length = 821

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 4   KGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
            G+LY+AD+ N  +R+I  +G +TTIAG  ++      G ++ AK  N  +V+ V  T  
Sbjct: 229 NGDLYIADSDNNVVRRINSSGIITTIAGDTTSGYSGDGGDAKLAKMKNPINVI-VSKTNE 287

Query: 63  LLVIDRGNAALRQIS 77
           + + D  N  +R++S
Sbjct: 288 IFISDADNNVIRKVS 302


>gi|222054715|ref|YP_002537077.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
 gi|221564004|gb|ACM19976.1| NHL repeat containing protein [Geobacter daltonii FRC-32]
          Length = 2393

 Score = 38.5 bits (88), Expect = 5.3,   Method: Composition-based stats.
 Identities = 26/79 (32%), Positives = 41/79 (51%), Gaps = 5/79 (6%)

Query: 1    MDDKGNLYVADTLNLAIRKIGDAG--VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYV 57
            +D  GN+Y++D LN  +RKI  A   +TT+AG G S   G  DG    A    D   + V
Sbjct: 1651 LDTSGNIYISDNLNYRVRKIDLASFIITTVAGNGTSGATG--DGGLATAASLGDIRGLAV 1708

Query: 58   RPTCSLLVIDRGNAALRQI 76
                ++ + D G+  +R++
Sbjct: 1709 DTAGNIYIADSGSNGIRKV 1727


>gi|290976760|ref|XP_002671107.1| predicted protein [Naegleria gruberi]
 gi|284084673|gb|EFC38363.1| predicted protein [Naegleria gruberi]
          Length = 1349

 Score = 38.1 bits (87), Expect = 5.4,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 5/77 (6%)

Query: 5   GNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVRPTCS 62
           G+LY+A   N  I K  ++ ++T+AG  +  AG+    GP+ +AK +   DV  V  T  
Sbjct: 536 GDLYIASQGNHKIMKYSNSQLSTVAG--NGYAGYSGDGGPARNAKLNYPADVT-VSSTGE 592

Query: 63  LLVIDRGNAALRQISLN 79
           + + D GN A+R++  N
Sbjct: 593 VYIADWGNRAVRKVFRN 609


>gi|116751477|ref|YP_848164.1| NHL repeat-containing protein [Syntrophobacter fumaroxidans MPOB]
 gi|116700541|gb|ABK19729.1| NHL repeat containing protein [Syntrophobacter fumaroxidans MPOB]
          Length = 1763

 Score = 38.1 bits (87), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 49/95 (51%), Gaps = 19/95 (20%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGG-------KSNVAGFRD----------GP 42
           +D + NLY++DT +  IRK+G D  +TT+AG        +    G+++          GP
Sbjct: 536 VDSRENLYISDTESHRIRKVGPDGVITTVAGSGLCHLKVEDGYGGYQEYDAPCFDGDQGP 595

Query: 43  SEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQIS 77
           ++ AK +N   V     + +L + D GN  LR+++
Sbjct: 596 ADQAKLNNPSGVA-TDASGNLYIADTGNRRLRKVA 629


>gi|290990564|ref|XP_002677906.1| NHL domain-containing protein kinase [Naegleria gruberi]
 gi|284091516|gb|EFC45162.1| NHL domain-containing protein kinase [Naegleria gruberi]
          Length = 2733

 Score = 38.1 bits (87), Expect = 5.6,   Method: Composition-based stats.
 Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 5   GNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           G +Y+ADT N  IRK+   G ++TIAG  +       G +  AK S+   V  V  T  +
Sbjct: 800 GEIYIADTENHRIRKVSTLGKISTIAGTGAMGYSGDGGLAITAKLSSPVGVA-VSSTGEV 858

Query: 64  LVIDRGNAALRQISLN 79
            + DR N  +R+++L+
Sbjct: 859 FIADRDNHRIRKVTLS 874


>gi|451982091|ref|ZP_21930422.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
 gi|451760645|emb|CCQ91702.1| hypothetical protein NITGR_780006 [Nitrospina gracilis 3/211]
          Length = 724

 Score = 38.1 bits (87), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 4/81 (4%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           DD+GNLY+AD  N  +RK+   G +TT+ G GK+   G  +  SE       F + +  P
Sbjct: 287 DDRGNLYIADRNNNRVRKVDTNGIITTVMGTGKNEYNGDNEIASE-TSLHLPFALAFT-P 344

Query: 60  TCSLLVIDRGNAALRQISLNQ 80
              LL++DR +  +R + L Q
Sbjct: 345 DQHLLIVDRNHHRVRSMHLVQ 365



 Score = 37.4 bits (85), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 2/80 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D+ GNLY+AD  N  +RK+  D  +TT+AG  ++      GP+     +   DVV    
Sbjct: 230 VDESGNLYIADRGNNRVRKVTPDGIITTVAGEGTHFFSGDFGPATRCSLAYPTDVV-TDD 288

Query: 60  TCSLLVIDRGNAALRQISLN 79
             +L + DR N  +R++  N
Sbjct: 289 RGNLYIADRNNNRVRKVDTN 308


>gi|290995104|ref|XP_002680171.1| predicted protein [Naegleria gruberi]
 gi|284093791|gb|EFC47427.1| predicted protein [Naegleria gruberi]
          Length = 928

 Score = 38.1 bits (87), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 40/71 (56%), Gaps = 2/71 (2%)

Query: 7   LYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 65
           +Y+AD  N  IRKI + G + TIAG  +      +GP+ +A+ +N +  V+V     + +
Sbjct: 80  VYIADFSNHRIRKILENGKIVTIAGNGTGGFSGDNGPATNAQLNNPYS-VFVSSNNEVYI 138

Query: 66  IDRGNAALRQI 76
           +D  N  +R+I
Sbjct: 139 VDYNNHRIRKI 149


>gi|290976675|ref|XP_002671065.1| predicted protein [Naegleria gruberi]
 gi|284084630|gb|EFC38321.1| predicted protein [Naegleria gruberi]
          Length = 343

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 45/89 (50%), Gaps = 19/89 (21%)

Query: 4   KGNLYVADTLNLAIRKIGDAGV-TTIAG-------GKSNVA--GFRDGPSEDAKFSNDFD 53
            GN+Y +D  N  IRKI + G+ +TI G       G  N+A   F + P      SND +
Sbjct: 80  NGNVYFSDLGNNRIRKIDNNGIISTIVGCGDYGLVGDGNLAINSFLNSP-RGICLSNDGN 138

Query: 54  VVYVRPTCSLLVIDRGNAALRQISLNQDD 82
            +Y+         DR N A+R++SLN DD
Sbjct: 139 YLYIA--------DRDNHAIRKVSLNDDD 159


>gi|116619867|ref|YP_822023.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223029|gb|ABJ81738.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 592

 Score = 38.1 bits (87), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGV-TTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           D  GNLY+A+ LN  +RK+   GV TT+AG G +  AG   GP+  A+ +    +  +  
Sbjct: 52  DANGNLYLAEELNNRVRKVDTGGVITTLAGIGTAGFAG-DGGPAAQAQLNGPLGLC-IDG 109

Query: 60  TCSLLVIDRGNAALRQIS 77
           + ++ V D+GN  +R+I+
Sbjct: 110 SGNIYVSDQGNKRVRKIA 127


>gi|392945711|ref|ZP_10311353.1| serine/threonine protein kinase [Frankia sp. QA3]
 gi|392289005|gb|EIV95029.1| serine/threonine protein kinase [Frankia sp. QA3]
          Length = 859

 Score = 38.1 bits (87), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 66/162 (40%), Gaps = 32/162 (19%)

Query: 2   DDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYVR 58
           D  GN+Y+ D  N  IRKI  AGV +TIAG  +  AG+    GP+  A+  N+ + V   
Sbjct: 604 DKAGNIYIGDAQNNRIRKISPAGVISTIAG--TGTAGYSGDGGPATAAQL-NNAEQVTTG 660

Query: 59  PTCSLLVIDRGNAALRQISLNQDDCEYQYNSISPTDILMV-VGAVLVGYVTCMLQQGFGP 117
           P  S+   D  N  +R+              I P  I+   VG  + GY         GP
Sbjct: 661 PDGSVYFSDYENHRIRK--------------IDPAGIITTYVGTGVAGYTGA-----GGP 701

Query: 118 FFFSRTQQPSESEFNEETEI------KELSNKEKPIPIVESM 153
              +R   P E    ++  +       E   K  P  I+ S+
Sbjct: 702 ATQARIDGPHEITMTDDGTLYFADLRSETIQKVTPDGIISSV 743


>gi|290982388|ref|XP_002673912.1| predicted protein [Naegleria gruberi]
 gi|284087499|gb|EFC41168.1| predicted protein [Naegleria gruberi]
          Length = 2313

 Score = 38.1 bits (87), Expect = 6.6,   Method: Composition-based stats.
 Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 5/75 (6%)

Query: 5   GNLYVADTLNLAIRK--IGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G +Y+AD  N  I K  IG   +TT+AGG  ++ GF DG   +A   N  D + +     
Sbjct: 475 GIIYIADYYNHRIAKFVIGGTSLTTLAGG--SLKGFADGVGSNANL-NYPDSISIGLNNM 531

Query: 63  LLVIDRGNAALRQIS 77
           L   DR N A+R +S
Sbjct: 532 LYFSDRDNHAIRSVS 546


>gi|116620673|ref|YP_822829.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223835|gb|ABJ82544.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 912

 Score = 38.1 bits (87), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 63/140 (45%), Gaps = 13/140 (9%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D  GN+YVA+T +  +R++   G +TTIAG G+    G  DG    A   N    + V 
Sbjct: 617 VDRAGNIYVAETGHNRVRRVSTGGTITTIAGNGQCCYTG--DGGLGTAAQLNQPWGIAVD 674

Query: 59  PTCSLLVIDRGNAALR---------QISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTC 109
              ++ V D GN A+R         Q+    +        ++P +++++ GA L G  + 
Sbjct: 675 SAGNIYVADSGNNAIRLLAPVSANIQVGAVVNAASNLPGPVAPGELVVLYGAGLAGVQSV 734

Query: 110 MLQQGFGPFFFSRTQQPSES 129
           +     GP  ++   Q   +
Sbjct: 735 LFNGVAGPLVYTSAGQTGTA 754



 Score = 38.1 bits (87), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVVYVR 58
           +D  G LY+ADT N  IR++  D  +TT AG G + V G  +GP   A  S   DV   R
Sbjct: 217 VDANGVLYIADTFNGRIRRVAADGTITTAAGVGSTGVFGGDNGPPASAALSLPTDVAVDR 276

Query: 59  PTCSLLVIDRGNAALRQIS 77
            + +  + D GN+ +R ++
Sbjct: 277 -SGNPYIADFGNSRVRMVA 294


>gi|380693812|ref|ZP_09858671.1| hypothetical protein BfaeM_07498 [Bacteroides faecis MAJ27]
          Length = 456

 Score = 38.1 bits (87), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 18/87 (20%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKF--------SND 51
           +D +GN+ + DT N  IRKI    G  +   GK   +G+ +G +EDA+F         ND
Sbjct: 378 VDSEGNIILTDTENHCIRKITMSTGYVSTLAGKPQNSGYVNGSAEDAQFKKPLGICIDND 437

Query: 52  FDVVYVRPTCSLLVIDRGNAALRQISL 78
            DV+Y+         D  N A+R++++
Sbjct: 438 -DVMYIG--------DSENRAIRRLAV 455


>gi|254390149|ref|ZP_05005369.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
           27064]
 gi|197703856|gb|EDY49668.1| NHL repeat-containing protein [Streptomyces clavuligerus ATCC
           27064]
          Length = 527

 Score = 38.1 bits (87), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G +Y+ADT N  IRKI  D  ++T+AG  +       GP+  A+ +    VV +  
Sbjct: 236 VDGAGAVYIADTDNHRIRKITADGTISTVAGTGTGGFDGDGGPATAARLNRPMGVV-IDS 294

Query: 60  TCSLLVIDRGNAALRQIS 77
           T +L V D GN  +R+I+
Sbjct: 295 TGTLYVADYGNHRIRKIT 312


>gi|288923318|ref|ZP_06417452.1| serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288345334|gb|EFC79729.1| serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 849

 Score = 37.7 bits (86), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 2/80 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRD-GPSEDAKFSNDFDVVYVRP 59
           +D  G +++ADT N  IR++G  G+ T   G+     F D GP+  A  +    V   R 
Sbjct: 659 VDGNGTVFIADTANNRIRRVGSDGIITTLAGQDGEGSFGDGGPASKALLAFPLAVALDR- 717

Query: 60  TCSLLVIDRGNAALRQISLN 79
              L + D  N  +R+I L+
Sbjct: 718 FGRLYIADTSNNRIRRIGLD 737


>gi|290975266|ref|XP_002670364.1| predicted protein [Naegleria gruberi]
 gi|284083922|gb|EFC37620.1| predicted protein [Naegleria gruberi]
          Length = 1584

 Score = 37.7 bits (86), Expect = 7.3,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +   G +Y+ADTLN  IRKI   GV +T+AG      G   G + +++ ++ +  V+V  
Sbjct: 242 ISQNGEIYIADTLNHRIRKINSYGVISTVAGTGRASFGGDGGLAINSQLNSPYG-VHVSQ 300

Query: 60  TCSLLVIDRGNAALRQISLN 79
              + + D  N  +R+I +N
Sbjct: 301 NGEIYIADTLNHRIRKIFVN 320



 Score = 37.7 bits (86), Expect = 7.3,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +   G +Y+ADTLN  IRKI   GV +T+AG      G   G + +++ ++ +  V+V  
Sbjct: 637 ISQNGEIYIADTLNHRIRKINSYGVISTVAGTGRASFGGDGGLAINSQLNSPYG-VHVSQ 695

Query: 60  TCSLLVIDRGNAALRQISLN 79
              + + D  N  +R+I +N
Sbjct: 696 NGEIYIADTLNHRIRKIFVN 715


>gi|153867703|ref|ZP_01997987.1| NHL repeat protein [Beggiatoa sp. SS]
 gi|152144978|gb|EDN72014.1| NHL repeat protein [Beggiatoa sp. SS]
          Length = 263

 Score = 37.7 bits (86), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSE-DAKFSNDFDVVYVRP 59
           +D  GNLY+ADT N  IRKI  +G  +   G  N     DG +   AK +N    V +  
Sbjct: 38  IDSAGNLYIADTENHRIRKIDSSGNISTVAGDGNRGYSGDGAAAVSAKLNNPM-WVSLDS 96

Query: 60  TCSLLVIDRGNAALRQISL 78
             +L + D GN  +R++ +
Sbjct: 97  AGNLYIADTGNNVVRKLDI 115


>gi|145592666|ref|YP_001156963.1| alkyl hydroperoxide reductase [Salinispora tropica CNB-440]
 gi|145302003|gb|ABP52585.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Salinispora tropica CNB-440]
          Length = 612

 Score = 37.7 bits (86), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 3/104 (2%)

Query: 3   DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           D   L+VAD+   A+R + +  +TT  G      G  DGP+  A   +   V  + P  S
Sbjct: 360 DGSRLWVADSETSAVRYVENGILTTAVGQGLFEFGHVDGPAAQALLQHPLGVCAL-PDGS 418

Query: 63  LLVIDRGNAALRQISLNQDDCEYQYNSIS-PTDILMVV-GAVLV 104
           +L+ D  N A+R+     D      + ++ P+D+L+   G VLV
Sbjct: 419 VLIADTYNGAVRRYDPESDSVGTVADGLAEPSDLLLTPDGGVLV 462


>gi|290972390|ref|XP_002668936.1| predicted protein [Naegleria gruberi]
 gi|284082474|gb|EFC36192.1| predicted protein [Naegleria gruberi]
          Length = 727

 Score = 37.7 bits (86), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 6/76 (7%)

Query: 4   KGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDGPSEDAKFSNDFD--VVYVRPT 60
           K  +Y+AD+ N +IRKI + G + TIAG         +GP+ +A     FD   V+V P 
Sbjct: 131 KNEVYIADSGNHSIRKILENGNIVTIAGNGQQGYNGDNGPAVNAPL---FDPSSVFVTPN 187

Query: 61  CSLLVIDRGNAALRQI 76
             + + D GN  +R+I
Sbjct: 188 DEVYIADTGNHRIRKI 203


>gi|223993873|ref|XP_002286620.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977935|gb|EED96261.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1937

 Score = 37.7 bits (86), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 26/101 (25%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAGGKSNVAGFRDG-PSEDAKFSNDFDVVYVR 58
           +D  G +YVADT N AIR I  +G V+T+AG  + V G++DG  +  A+ S+  D+   R
Sbjct: 258 VDHDGYVYVADTGNHAIRMISPSGRVSTLAG--NGVPGYKDGLANSGAQLSSPTDIAVWR 315

Query: 59  P----------------------TCSLLVIDRGNAALRQIS 77
                                  T +LLV D  N  +R+I+
Sbjct: 316 DWAWWPYENPVDPDSFIYKNGNGTLALLVADTDNHRVRKIT 356


>gi|160942444|ref|ZP_02089750.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
           BAA-613]
 gi|158434643|gb|EDP12410.1| hypothetical protein CLOBOL_07327 [Clostridium bolteae ATCC
           BAA-613]
          Length = 329

 Score = 37.7 bits (86), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 36/73 (49%), Gaps = 3/73 (4%)

Query: 4   KGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSL 63
           +G LYVADT N  I K  D  V   AG   +  GF DGP   A FS    +       +L
Sbjct: 212 EGALYVADTGNNRIVKAMDGAVVWSAGTGED--GFADGPVSQAMFSGPQRITAAEDG-AL 268

Query: 64  LVIDRGNAALRQI 76
            V D GN+ +R+I
Sbjct: 269 YVSDTGNSVVRKI 281


>gi|153868997|ref|ZP_01998705.1| protein kinase-like protein [Beggiatoa sp. PS]
 gi|152074439|gb|EDN71293.1| protein kinase-like protein [Beggiatoa sp. PS]
          Length = 1432

 Score = 37.7 bits (86), Expect = 7.8,   Method: Composition-based stats.
 Identities = 41/148 (27%), Positives = 62/148 (41%), Gaps = 31/148 (20%)

Query: 1   MDDKGNLYVADTLNLAIRKIGDAG-VTTIAG-GKSNVAGFRDGPSEDAKFSNDFDVV--Y 56
           +D +GNLY+ADT N  IRKI   G +TT+ G G++  AG            N+F +    
Sbjct: 475 IDAQGNLYIADTNNHRIRKIDSDGIITTVVGTGEAGYAG-----------DNEFAIAAQL 523

Query: 57  VRPTCSLL-------VIDRGNAALRQISLNQDDCEYQYNSISPTDILMVVGAVLVGYVTC 109
            +PT  +        + D GN ++R+I+          NS+    I  + G    GY   
Sbjct: 524 KKPTAIVFDHNGHFYIADSGNNSIRKINYQPGTSPLNANSL----ITTIAGDGRSGY--- 576

Query: 110 MLQQGFGPFFFSRTQQPSESEFNEETEI 137
                 GP   +R   PS    + E  +
Sbjct: 577 --SGDNGPAIQARLGNPSSLVVDNENNL 602


>gi|291000698|ref|XP_002682916.1| dual specificity protein phosphatase [Naegleria gruberi]
 gi|284096544|gb|EFC50172.1| dual specificity protein phosphatase [Naegleria gruberi]
          Length = 1502

 Score = 37.7 bits (86), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 6   NLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLV 65
           N++V+DT N  + +I + G  T A G    AGF DG  +++  ++  D+     T  L +
Sbjct: 454 NVFVSDTGNHRVVRINN-GFVTNAAGNPRTAGFMDGKVQNSILNSPTDLFSHENT--LYI 510

Query: 66  IDRGNAALRQISLN 79
            D GN+ +RQ+  N
Sbjct: 511 NDFGNSRIRQVFAN 524


>gi|326445969|ref|ZP_08220703.1| hypothetical protein SclaA2_33112 [Streptomyces clavuligerus ATCC
           27064]
          Length = 508

 Score = 37.7 bits (86), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G +Y+ADT N  IRKI  D  ++T+AG  +       GP+  A+ +    VV +  
Sbjct: 217 VDGAGAVYIADTDNHRIRKITADGTISTVAGTGTGGFDGDGGPATAARLNRPMGVV-IDS 275

Query: 60  TCSLLVIDRGNAALRQIS 77
           T +L V D GN  +R+I+
Sbjct: 276 TGTLYVADYGNHRIRKIT 293


>gi|345882347|ref|ZP_08833836.1| hypothetical protein HMPREF0666_00012 [Prevotella sp. C561]
 gi|345045612|gb|EGW49528.1| hypothetical protein HMPREF0666_00012 [Prevotella sp. C561]
          Length = 496

 Score = 37.7 bits (86), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 4/78 (5%)

Query: 5   GNLYVADTLNLAIRKI----GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           G LY+AD+ N  IR I    G     T   G    AGF+DG  E AKF+    V      
Sbjct: 418 GKLYIADSGNQCIRMIDTTQGKNARVTTPIGVPQSAGFQDGGVELAKFNWPTGVAVSADG 477

Query: 61  CSLLVIDRGNAALRQISL 78
            ++ V D  N  +R++S+
Sbjct: 478 STVYVADSKNQVIRELSI 495


>gi|294817139|ref|ZP_06775781.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
 gi|294321954|gb|EFG04089.1| Receptor protein kinase [Streptomyces clavuligerus ATCC 27064]
          Length = 510

 Score = 37.7 bits (86), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 1   MDDKGNLYVADTLNLAIRKI-GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           +D  G +Y+ADT N  IRKI  D  ++T+AG  +       GP+  A+ +    VV +  
Sbjct: 219 VDGAGAVYIADTDNHRIRKITADGTISTVAGTGTGGFDGDGGPATAARLNRPMGVV-IDS 277

Query: 60  TCSLLVIDRGNAALRQIS 77
           T +L V D GN  +R+I+
Sbjct: 278 TGTLYVADYGNHRIRKIT 295


>gi|336178839|ref|YP_004584214.1| serine/threonine protein kinase [Frankia symbiont of Datisca
           glomerata]
 gi|334859819|gb|AEH10293.1| serine/threonine protein kinase [Frankia symbiont of Datisca
           glomerata]
          Length = 776

 Score = 37.7 bits (86), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFR--DGPSEDAKFSNDFDVVYV 57
           +D+ G +Y  DT N  IR+IG D  +TT+AG  +   GF   +GP+  A F+   DV   
Sbjct: 472 VDNTGAVYFTDTGNNRIRRIGADGIITTVAG--TGTYGFSGDNGPAAQAHFATPTDVARD 529

Query: 58  RPTCSLLVIDRGNAALRQIS 77
           R   +L + D  N  +R+I+
Sbjct: 530 R-AGNLYIADTDNNRIRRIN 548


>gi|288801999|ref|ZP_06407440.1| conserved hypothetical protein [Prevotella melaninogenica D18]
 gi|288335434|gb|EFC73868.1| conserved hypothetical protein [Prevotella melaninogenica D18]
          Length = 486

 Score = 37.7 bits (86), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 4/78 (5%)

Query: 5   GNLYVADTLNLAIRKI----GDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPT 60
           G LY+AD+ N  IR I    G     T   G    AGF+DG  E AKF+    V      
Sbjct: 408 GKLYIADSGNHCIRMIDTTLGKNARVTTPIGVPQSAGFQDGGVELAKFNWPTGVAVSADG 467

Query: 61  CSLLVIDRGNAALRQISL 78
            ++ V D  N  +R++S+
Sbjct: 468 STVYVADSKNQVIRELSI 485


>gi|290991466|ref|XP_002678356.1| predicted protein [Naegleria gruberi]
 gi|284091968|gb|EFC45612.1| predicted protein [Naegleria gruberi]
          Length = 1017

 Score = 37.7 bits (86), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 38/73 (52%), Gaps = 11/73 (15%)

Query: 7   LYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKF--SNDFDVVYVRPTCSLL 64
           +Y AD+LN  +RKI +  +TTIAG      G  DG +  + +  SN F      P+   +
Sbjct: 784 VYFADSLNSRVRKISNGKITTIAG------GIGDGLAATSAYLNSNSFTTT---PSGEFI 834

Query: 65  VIDRGNAALRQIS 77
           + D  N  +R+IS
Sbjct: 835 IADSNNNLIRKIS 847


>gi|116625892|ref|YP_828048.1| NHL repeat-containing protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116229054|gb|ABJ87763.1| NHL repeat containing protein [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 2384

 Score = 37.7 bits (86), Expect = 8.9,   Method: Composition-based stats.
 Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 7/81 (8%)

Query: 2   DDKGNLYVADTLNLAIRKI-GDAGV-TTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           D  GNLY+AD  N  IR+I G  GV TTIAG  SN        +  A  S     ++V P
Sbjct: 174 DSAGNLYIADMGNFRIRRIDGQTGVITTIAGDGSNTTSPDGVLAAGAGLSQP---IWVAP 230

Query: 60  TCS--LLVIDRGNAALRQISL 78
             S  LLV + G   +R++ L
Sbjct: 231 DRSGGLLVSEMGAMRIRRVDL 251


>gi|325104254|ref|YP_004273908.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
 gi|324973102|gb|ADY52086.1| NHL repeat containing protein [Pedobacter saltans DSM 12145]
          Length = 439

 Score = 37.4 bits (85), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 41/78 (52%), Gaps = 4/78 (5%)

Query: 1   MDDKGNLYVADTLNLAIRKIG-DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRP 59
           MD  GN+ V +     IRKI  D  V+ IAG  S   GF DG    A+F++  D V V  
Sbjct: 203 MDAAGNIIVIEKDGGRIRKIAPDGAVSLIAGTGS--LGFTDGNVSVARFNHALDGV-VDS 259

Query: 60  TCSLLVIDRGNAALRQIS 77
             ++ V DR N  +R+I+
Sbjct: 260 EGNIFVADRNNYRIRKIT 277


>gi|72392873|ref|XP_847237.1| flagellum-adhesion glycoprotein [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62358465|gb|AAX78927.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei]
 gi|70803267|gb|AAZ13171.1| flagellum-adhesion glycoprotein, putative [Trypanosoma brucei
           brucei strain 927/4 GUTat10.1]
          Length = 590

 Score = 37.4 bits (85), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 4/81 (4%)

Query: 17  IRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRQI 76
           +R +   G+ TIAG  +   G +DGP  DA F+N   VV V     + V DR N  +R+I
Sbjct: 90  VRLVNKTGIYTIAGSLTQ-RGNKDGPKGDALFNNPTSVVSVND--DIYVADRDNNCIRRI 146

Query: 77  SLNQDDCEYQYNSIS-PTDIL 96
               +   Y    ++ P DIL
Sbjct: 147 DAEGNVTRYGPQDLNKPKDIL 167


>gi|290975027|ref|XP_002670245.1| predicted protein [Naegleria gruberi]
 gi|284083802|gb|EFC37501.1| predicted protein [Naegleria gruberi]
          Length = 1363

 Score = 37.4 bits (85), Expect = 9.6,   Method: Composition-based stats.
 Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 3/76 (3%)

Query: 5   GNLYVADTLNLAIRKIG--DAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           G++ V+D L  AI+KI      +TTIAG  +   G  +GP+ +A  +  F +  V  T  
Sbjct: 57  GDILVSDALGHAIKKINVTSGVITTIAGNGTAGFGGDNGPAVNALVNKPFGIA-VSQTDE 115

Query: 63  LLVIDRGNAALRQISL 78
           +   D GN  +R+I L
Sbjct: 116 IYFADSGNNRIRKIDL 131


>gi|159035770|ref|YP_001535023.1| alkyl hydroperoxide reductase [Salinispora arenicola CNS-205]
 gi|157914605|gb|ABV96032.1| alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal
           allergen [Salinispora arenicola CNS-205]
          Length = 612

 Score = 37.4 bits (85), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 3/104 (2%)

Query: 3   DKGNLYVADTLNLAIRKIGDAGVTTIAGGKSNVAGFRDGPSEDAKFSNDFDVVYVRPTCS 62
           D   L+VAD+   AIR + D  + T  G      G  DGP+  A   +   V  + P  S
Sbjct: 360 DGSRLWVADSETSAIRYVQDGVLNTAVGQGLFEFGHVDGPAAQALLQHPLGVCAL-PDGS 418

Query: 63  LLVIDRGNAALRQISLNQDDCEYQYNSIS-PTDILMVV-GAVLV 104
           +L+ D  N A+R+     D      + ++ P+D+++   G VLV
Sbjct: 419 VLIADTYNGAVRRYDPESDSVGTVADGLAEPSDLVLTPDGGVLV 462


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.133    0.386 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,740,551,108
Number of Sequences: 23463169
Number of extensions: 245489854
Number of successful extensions: 513898
Number of sequences better than 100.0: 510
Number of HSP's better than 100.0 without gapping: 182
Number of HSP's successfully gapped in prelim test: 328
Number of HSP's that attempted gapping in prelim test: 512318
Number of HSP's gapped (non-prelim): 1606
length of query: 345
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 202
effective length of database: 9,003,962,200
effective search space: 1818800364400
effective search space used: 1818800364400
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 77 (34.3 bits)