BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037682
         (397 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225438716|ref|XP_002282505.1| PREDICTED: uncharacterized protein LOC100262755 [Vitis vinifera]
          Length = 397

 Score =  681 bits (1757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/400 (88%), Positives = 378/400 (94%), Gaps = 6/400 (1%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTT-ATSLPVRNGLVSFSSLADKVITHLKNSG 59
           MVDVDRRMTGLNPAHIAGLRRLSARAAAP+T +T+LPVRNGL+SF++LADKVI HL+NSG
Sbjct: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPSTVSTALPVRNGLISFTTLADKVIAHLQNSG 60

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           I+VQ GLSDAEFARAEAEFGF FPPDL+A+LSAGLP+GPGFPDWRA G ARL LRASLDL
Sbjct: 61  IQVQAGLSDAEFARAEAEFGFAFPPDLKAVLSAGLPVGPGFPDWRAAG-ARLRLRASLDL 119

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           PIAAI+FQIARN+LWSKSWGPRPSDPEKALR+ARNALKRAPLLIPIFNHCYIPCNP+LAG
Sbjct: 120 PIAAISFQIARNSLWSKSWGPRPSDPEKALRIARNALKRAPLLIPIFNHCYIPCNPSLAG 179

Query: 180 NPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSL 239
           NPIF+VDENRIFCCGLDLSDFFERESLF+S  SDPQVL KQRSVSE+TAGSSSNFSRRSL
Sbjct: 180 NPIFFVDENRIFCCGLDLSDFFERESLFQS--SDPQVLKKQRSVSEKTAGSSSNFSRRSL 237

Query: 240 DAG--LGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIG 297
           D G   G RTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVD+Y+ QIG
Sbjct: 238 DTGGLAGGRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDEYIEQIG 297

Query: 298 SVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSSE 357
           SVLREGGW E DI EIVTVSASGFFEGEMV+LDNQ+VLD LLLKADR S+SLRK+GWSSE
Sbjct: 298 SVLREGGWDESDIAEIVTVSASGFFEGEMVLLDNQAVLDALLLKADRFSDSLRKAGWSSE 357

Query: 358 EVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           EVSDA GFD+RPEKERKPAKKL+PELVERIGKLAESVSRS
Sbjct: 358 EVSDALGFDYRPEKERKPAKKLAPELVERIGKLAESVSRS 397


>gi|255565188|ref|XP_002523586.1| conserved hypothetical protein [Ricinus communis]
 gi|223537148|gb|EEF38781.1| conserved hypothetical protein [Ricinus communis]
          Length = 398

 Score =  661 bits (1706), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/402 (85%), Positives = 372/402 (92%), Gaps = 9/402 (2%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGLNPAHIAGLRRLSARAAAP+TA   PVRNGLVSFSSLADKVITHL+NS I
Sbjct: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPSTA---PVRNGLVSFSSLADKVITHLRNSAI 57

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           +VQPGLSD EFARAEAEFGF FPPDLRA+LSAGLP+GPGFPDWR+TG ARLHLRASLDLP
Sbjct: 58  QVQPGLSDVEFARAEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRSTG-ARLHLRASLDLP 116

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAAI+FQIARN LWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPC+P+LAGN
Sbjct: 117 IAAISFQIARNTLWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCHPSLAGN 176

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLD 240
           PIF+VDENRIFCCGLDLSDFF+RESLFRS+ESDP +L KQRSVSE++AGSS+N+SRRSLD
Sbjct: 177 PIFFVDENRIFCCGLDLSDFFDRESLFRSSESDPVILKKQRSVSEKSAGSSTNYSRRSLD 236

Query: 241 AGL--GSRTPRWVEFWSDAAID--RRRRNSSSSSSSSPERFFDM-PRSEIPKWVDDYMGQ 295
            G+  GSRTPRWVEFWSDAA+D  RR   SSSS SSSPERFFDM PRSE+P WV++Y+ Q
Sbjct: 237 TGIAFGSRTPRWVEFWSDAAVDRRRRNSASSSSESSSPERFFDMPPRSEMPTWVNEYIEQ 296

Query: 296 IGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWS 355
           IGSVLREGGWSE DI EIV VSASGFFEGEMVMLDNQ+V D LLLKADR S+SLRK+GWS
Sbjct: 297 IGSVLREGGWSETDISEIVHVSASGFFEGEMVMLDNQAVRDALLLKADRFSDSLRKAGWS 356

Query: 356 SEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           SEEVSDA GFDFRPEKERKP KKLSPEL+E+IGKLAESVSRS
Sbjct: 357 SEEVSDALGFDFRPEKERKPVKKLSPELIEKIGKLAESVSRS 398


>gi|224096758|ref|XP_002310724.1| predicted protein [Populus trichocarpa]
 gi|222853627|gb|EEE91174.1| predicted protein [Populus trichocarpa]
          Length = 399

 Score =  646 bits (1666), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/400 (81%), Positives = 363/400 (90%), Gaps = 5/400 (1%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGLNP HIAGLRRLSARAAAP+T T+LP RN L+SFSSLADKVITHL++SGI
Sbjct: 1   MVDVDRRMTGLNPGHIAGLRRLSARAAAPSTTTTLPPRNSLLSFSSLADKVITHLRSSGI 60

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           +VQPGL+++EFAR+EAEFGF FPPDLRA+LSAGLP+GPGFPDWR+ G ARLHLRASLDLP
Sbjct: 61  QVQPGLTESEFARSEAEFGFAFPPDLRAVLSAGLPVGPGFPDWRSAG-ARLHLRASLDLP 119

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAAI FQIARN LWSKSWGPRPSDPEKALRVARN+LKRAPLLIPIFNHCYIPC P+LAGN
Sbjct: 120 IAAICFQIARNTLWSKSWGPRPSDPEKALRVARNSLKRAPLLIPIFNHCYIPCQPSLAGN 179

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLD 240
           PIF++DE+RIFCCGLDLSDFF+RESLFRS+ES P ++ KQ+SVSE++ G S+N SR+SLD
Sbjct: 180 PIFFIDESRIFCCGLDLSDFFDRESLFRSSESHPIIIKKQKSVSEKSTGLSNNLSRKSLD 239

Query: 241 AGL--GSRTPRWVEFWSDAAID--RRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQI 296
            GL  GSRTPRWVEFWSDAA+D  RR   SSSS SSSPERFF+M RSEIPKWV DY+ +I
Sbjct: 240 TGLVNGSRTPRWVEFWSDAAVDRRRRNSASSSSGSSSPERFFEMRRSEIPKWVGDYIEKI 299

Query: 297 GSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSS 356
           GSVLREGGW E DI E+V VSASGFFEGEMV+LDNQ+VLD LLLK DR S+SLRK+GWSS
Sbjct: 300 GSVLREGGWKESDIEEMVEVSASGFFEGEMVILDNQAVLDALLLKVDRFSDSLRKAGWSS 359

Query: 357 EEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSR 396
           EEVSDA GFDFRPEKERKP KKLSPELVE+IGKLAESVSR
Sbjct: 360 EEVSDALGFDFRPEKERKPVKKLSPELVEKIGKLAESVSR 399


>gi|224081647|ref|XP_002306469.1| predicted protein [Populus trichocarpa]
 gi|222855918|gb|EEE93465.1| predicted protein [Populus trichocarpa]
          Length = 399

 Score =  644 bits (1661), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/400 (81%), Positives = 360/400 (90%), Gaps = 5/400 (1%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGLNPAHIAGLRRLSARAAAP+T T+LP RN L+SFSSLADKVITHL+NSGI
Sbjct: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPSTTTTLPARNSLLSFSSLADKVITHLRNSGI 60

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           +VQPGL+D+EFAR+EAEFGF FPPDL A+LSAGLP+G GFPDWR+ G ARLHLRASLDLP
Sbjct: 61  QVQPGLTDSEFARSEAEFGFAFPPDLHAVLSAGLPVGAGFPDWRSAG-ARLHLRASLDLP 119

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAAI FQIARN LWSKSWGPRPSDPEKALR+ARN+LKRAPLLIPIFNHCYIPC P+LAGN
Sbjct: 120 IAAICFQIARNTLWSKSWGPRPSDPEKALRIARNSLKRAPLLIPIFNHCYIPCQPSLAGN 179

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLD 240
           PIF+VDENRIFCCGLDLSDFF+RE LFRS++SDP ++ KQ+SVSE++ G S+N SR+SLD
Sbjct: 180 PIFFVDENRIFCCGLDLSDFFDREYLFRSSKSDPIIIKKQKSVSEKSTGLSNNPSRKSLD 239

Query: 241 AGL--GSRTPRWVEFWSDAAID--RRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQI 296
            GL  GSRTPRWVEFWSDA +D  RR   SSSS SSSPERFF+MPRSEIP+WV DY+ QI
Sbjct: 240 TGLVNGSRTPRWVEFWSDAVVDRRRRNSASSSSGSSSPERFFEMPRSEIPRWVGDYIQQI 299

Query: 297 GSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSS 356
           GSVLREGGW E DI E+V VSASGFFEGEMV+LDNQ++LD LLLK DR S+SLRK+GWSS
Sbjct: 300 GSVLREGGWRESDIDEMVQVSASGFFEGEMVILDNQAILDALLLKVDRFSDSLRKAGWSS 359

Query: 357 EEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSR 396
           EEVSDA GFDFR EKERKP K LSPELVE+IGKLAESVSR
Sbjct: 360 EEVSDALGFDFRAEKERKPVKTLSPELVEKIGKLAESVSR 399


>gi|356496892|ref|XP_003517299.1| PREDICTED: uncharacterized protein LOC100793662 [Glycine max]
          Length = 406

 Score =  615 bits (1586), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 323/402 (80%), Positives = 366/402 (91%), Gaps = 11/402 (2%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGLNPAHIAGLRRLSARA+A   A S+PVRNG+VSF+SLADKVI HL++SGI
Sbjct: 1   MVDVDRRMTGLNPAHIAGLRRLSARASA---APSVPVRNGVVSFASLADKVINHLRDSGI 57

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
            VQ GLSD EFARAEAEFGFVFPPDLRA+L+AGLP+GPGFPDWR +G ARLHLRASLDLP
Sbjct: 58  HVQHGLSDTEFARAEAEFGFVFPPDLRAVLAAGLPVGPGFPDWR-SGGARLHLRASLDLP 116

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAAI+FQIARNA+WSKSWGPRP +PEKALRVARNALKRAPLLIPIFNHCYIP NP+LAGN
Sbjct: 117 IAAISFQIARNAVWSKSWGPRPCEPEKALRVARNALKRAPLLIPIFNHCYIPSNPSLAGN 176

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSS--SNFSRRS 238
           PIF+VDENRIFCCGLDLSDFF+RESLFRS+E+DP +L KQRSVSE++ G S  + FSRRS
Sbjct: 177 PIFFVDENRIFCCGLDLSDFFDRESLFRSSEADPILLKKQRSVSEKSTGVSVSAAFSRRS 236

Query: 239 LDAGLGSRTPRWVEFWSDAAIDRRRRN---SSSSSSSSPERFFDMPRSEIPKWVDDYMGQ 295
           LD+G  +RTPRWVEFWSDAA D+RRRN   SS S+SSSPERFF+MPRS++P WV++Y+GQ
Sbjct: 237 LDSG--TRTPRWVEFWSDAATDKRRRNSSSSSLSASSSPERFFEMPRSKVPGWVEEYIGQ 294

Query: 296 IGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWS 355
           IGSVL+ GGWSE DI E+V VSASGFFEGEMV+LDNQ++LD LLLKAD+ S+SLRK+GWS
Sbjct: 295 IGSVLKAGGWSESDINEMVEVSASGFFEGEMVVLDNQALLDALLLKADKFSDSLRKAGWS 354

Query: 356 SEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           SEEVS+A GFDFRPEKERKPAKKLSP+L ERI KLA+SVSRS
Sbjct: 355 SEEVSEALGFDFRPEKERKPAKKLSPQLAERIEKLAQSVSRS 396


>gi|449448320|ref|XP_004141914.1| PREDICTED: uncharacterized protein LOC101216316 [Cucumis sativus]
 gi|449516621|ref|XP_004165345.1| PREDICTED: uncharacterized protein LOC101228801 [Cucumis sativus]
          Length = 393

 Score =  601 bits (1549), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 316/398 (79%), Positives = 350/398 (87%), Gaps = 6/398 (1%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRM GLNPAHIAGLRRLSARAAA T   S P R GL+SFSSLAD VITHL+N+G+
Sbjct: 1   MVDVDRRMAGLNPAHIAGLRRLSARAAAVTP--SHPSRAGLLSFSSLADNVITHLRNTGV 58

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
            VQ GLS A+FARAEAEFGFVFPPDLRA+LSAGLP+GPGFPDWR++G AR HLRA+LDLP
Sbjct: 59  EVQTGLSIADFARAEAEFGFVFPPDLRAVLSAGLPIGPGFPDWRSSG-ARQHLRATLDLP 117

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAAI+FQIA+N  WSKSWGPRP DPEKALRVARNALKRAPLLIP+FNHCYIPCNP+LAGN
Sbjct: 118 IAAISFQIAKNTFWSKSWGPRPLDPEKALRVARNALKRAPLLIPLFNHCYIPCNPSLAGN 177

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLD 240
           PIF VDENRI   GLDLSDFFERE LFRS++SD   L KQRS+SE++AGSSSNFSRRSLD
Sbjct: 178 PIFSVDENRISFSGLDLSDFFEREFLFRSSQSDAHHLKKQRSISEKSAGSSSNFSRRSLD 237

Query: 241 AGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIGSVL 300
              G+RTPRWVEFWSDA +DRRRRNSSSSSSSSP+R  +MPRS IPKWV++Y+ +IGS L
Sbjct: 238 T--GARTPRWVEFWSDAVVDRRRRNSSSSSSSSPDRVIEMPRSGIPKWVNEYIEEIGSTL 295

Query: 301 REGGWSEPDIVEIVTVSASGFFEG-EMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEEV 359
           REGGWSE DI EIV VSASGFFEG  MV++DNQ+VLD LLLK DR S+ LRK+GWSSEEV
Sbjct: 296 REGGWSETDITEIVQVSASGFFEGAAMVLVDNQAVLDALLLKTDRFSDVLRKAGWSSEEV 355

Query: 360 SDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           S A GFD R E+ERKPAKKLSPELVERIGKLAESV+RS
Sbjct: 356 SYALGFDHRAERERKPAKKLSPELVERIGKLAESVTRS 393


>gi|357482915|ref|XP_003611744.1| hypothetical protein MTR_5g017410 [Medicago truncatula]
 gi|355513079|gb|AES94702.1| hypothetical protein MTR_5g017410 [Medicago truncatula]
          Length = 395

 Score =  599 bits (1544), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 316/397 (79%), Positives = 358/397 (90%), Gaps = 4/397 (1%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGLNPAHIAGLRRLSARAA+ +T++S   RN L SFSS+ DKVIT L NSG+
Sbjct: 1   MVDVDRRMTGLNPAHIAGLRRLSARAASSSTSSSSSERNCLFSFSSVVDKVITQLHNSGV 60

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           +VQPGLSDAEFARAEAEF FVFPPDLRAIL+AG+P+GPGFPDWR+T +ARL LRASLDLP
Sbjct: 61  QVQPGLSDAEFARAEAEFSFVFPPDLRAILAAGIPVGPGFPDWRST-AARLRLRASLDLP 119

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAAI+FQIARNA WSKSWG RPS+PEKAL++ARNALK+APLLIPIFNHCYIP NP+LAGN
Sbjct: 120 IAAISFQIARNAFWSKSWGLRPSEPEKALKIARNALKKAPLLIPIFNHCYIPSNPSLAGN 179

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLD 240
           PIFYVDENRIFCCG DLSDFF+RESLFR +ESDP++L+KQRSVSE++AGSS+ FSRRSLD
Sbjct: 180 PIFYVDENRIFCCGFDLSDFFQRESLFRRSESDPKILMKQRSVSEKSAGSSTAFSRRSLD 239

Query: 241 AGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDM-PRSEIPKWVDDYMGQIGSV 299
              G RTPRWVEFW++AA DRRRRNS SS S+SPERFFD+ PRS IP WVD+Y+ +IGSV
Sbjct: 240 T--GGRTPRWVEFWTEAATDRRRRNSLSSESTSPERFFDIPPRSVIPGWVDEYIDKIGSV 297

Query: 300 LREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEEV 359
           L+ GGWSEPDI E+V VSA GFFEGEMVMLDNQ++LD LLLKADR S+SLRK+GWSSEEV
Sbjct: 298 LKAGGWSEPDITEMVQVSACGFFEGEMVMLDNQALLDALLLKADRFSDSLRKAGWSSEEV 357

Query: 360 SDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSR 396
           S+A GFDFRPEKERKP KKLSPELVE+I KLA+SVS 
Sbjct: 358 SEALGFDFRPEKERKPVKKLSPELVEKIEKLAQSVSH 394


>gi|356537591|ref|XP_003537310.1| PREDICTED: uncharacterized protein LOC100776346 [Glycine max]
          Length = 412

 Score =  591 bits (1524), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 316/405 (78%), Positives = 359/405 (88%), Gaps = 14/405 (3%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGLNPAHIAGL      +A    A S+PVRNG+VSF+SLADKVI HL++SGI
Sbjct: 1   MVDVDRRMTGLNPAHIAGL---RRLSARAAAAPSVPVRNGVVSFASLADKVINHLRDSGI 57

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
            VQ GLSDAEFARAEAEFGFVFPPDLRA+L+AGLP+GPGFPDWR +G ARLHLRASLDLP
Sbjct: 58  HVQHGLSDAEFARAEAEFGFVFPPDLRAVLAAGLPVGPGFPDWR-SGGARLHLRASLDLP 116

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAAI+FQIARNA+WSKSWGPRP +PEKALRVARNALKRAPLLIPIFNHCYIP NP+LAGN
Sbjct: 117 IAAISFQIARNAVWSKSWGPRPCEPEKALRVARNALKRAPLLIPIFNHCYIPSNPSLAGN 176

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSS--SNFSRRS 238
           PIF+VDE+RIFCCGLDLSDFF+RESLFRS+E+DP +L KQRSVSE+TAG S  + FSRRS
Sbjct: 177 PIFFVDESRIFCCGLDLSDFFDRESLFRSSEADPILLKKQRSVSEKTAGVSVSAAFSRRS 236

Query: 239 LDAGLGSRTPRWVEFWSDAAIDRRRRN------SSSSSSSSPERFFDMPRSEIPKWVDDY 292
           LD+G   RTPRWVEFWSDAA D+RRRN      SS S+SSSPERFF+MPRS++P WV++Y
Sbjct: 237 LDSG--ERTPRWVEFWSDAATDKRRRNSSSSSSSSLSASSSPERFFEMPRSKVPGWVEEY 294

Query: 293 MGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKS 352
           +G+IGSVL+ GGW E DI E+V VSASGFFEGEMV+LDNQ++LD LLLKAD+ S+SLRK+
Sbjct: 295 IGKIGSVLKAGGWGESDINEMVEVSASGFFEGEMVVLDNQALLDALLLKADKFSDSLRKA 354

Query: 353 GWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           GWSSEEVS+A GFDFRPEKERKPAKKLSP+L ERI KLA+SVSRS
Sbjct: 355 GWSSEEVSEALGFDFRPEKERKPAKKLSPQLAERIEKLAQSVSRS 399


>gi|15240167|ref|NP_201503.1| uncharacterized protein [Arabidopsis thaliana]
 gi|9758547|dbj|BAB08941.1| unnamed protein product [Arabidopsis thaliana]
 gi|115646816|gb|ABJ17128.1| At5g67020 [Arabidopsis thaliana]
 gi|332010908|gb|AED98291.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 394

 Score =  588 bits (1516), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 301/399 (75%), Positives = 342/399 (85%), Gaps = 7/399 (1%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGL PAH AGLRRLSARAAAP+T T   +RN L SFS  ADKVI HLKNSGI
Sbjct: 1   MVDVDRRMTGLTPAHAAGLRRLSARAAAPSTPT---IRNSLQSFSPFADKVINHLKNSGI 57

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           ++QPGLSD EFAR EAEFGF FPPDLR ILSAGL +G GFPDWR+ G ARLHLRA +DLP
Sbjct: 58  KIQPGLSDTEFARVEAEFGFTFPPDLRVILSAGLSVGAGFPDWRSPG-ARLHLRAMIDLP 116

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           +AA++FQIA+N+LW KSWG +P DPEKALRVARNALKRAPLLIPIF+HCYIPCNP+LAGN
Sbjct: 117 VAAVSFQIAKNSLWCKSWGLKPPDPEKALRVARNALKRAPLLIPIFDHCYIPCNPSLAGN 176

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLD 240
           P+F++DE RIFCCG DLS+FFERES FRS+E  P++L KQRSVSE++AGSSSNFSRRSLD
Sbjct: 177 PVFFIDETRIFCCGSDLSEFFERESAFRSSEFFPRILTKQRSVSEKSAGSSSNFSRRSLD 236

Query: 241 AGL--GSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIGS 298
            G   G+   RWVEFWSDAA+DR RRNS+S+SSSS     D+P++E PKWV+ Y+ +IGS
Sbjct: 237 LGRANGAGKSRWVEFWSDAAVDRCRRNSASTSSSS-SSSPDLPKTETPKWVNQYVNRIGS 295

Query: 299 VLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEE 358
           VLR GGWSE DI EI+ VSASGFFEGEMV++DNQ+VLD LLLKA R+SESLRKSGWSSEE
Sbjct: 296 VLRRGGWSESDIDEIIHVSASGFFEGEMVIIDNQTVLDVLLLKAGRISESLRKSGWSSEE 355

Query: 359 VSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           VSDA GFDFRPEKERKP KKLSP LVE+  KLAE VS+S
Sbjct: 356 VSDALGFDFRPEKERKPVKKLSPMLVEQFEKLAEWVSQS 394


>gi|449463975|ref|XP_004149705.1| PREDICTED: uncharacterized protein LOC101213140 [Cucumis sativus]
 gi|449508304|ref|XP_004163277.1| PREDICTED: uncharacterized LOC101213140 [Cucumis sativus]
          Length = 393

 Score =  580 bits (1494), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 303/398 (76%), Positives = 350/398 (87%), Gaps = 6/398 (1%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRM+ LNPAHIAGLRRLSARA      +S P+RN L+SFSSLADKV+THL+NSG+
Sbjct: 1   MVDVDRRMSTLNPAHIAGLRRLSARA---AATSSAPLRNSLLSFSSLADKVLTHLRNSGV 57

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           +VQPGLSDAEFARAEAEF F FPPDLRA+LSAGLP+GPGFPDWR+ G ARLHLR+SLDLP
Sbjct: 58  QVQPGLSDAEFARAEAEFCFSFPPDLRAVLSAGLPVGPGFPDWRSAG-ARLHLRSSLDLP 116

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAAI+FQIA+N LWS SWG +P++PEKALR+ARN LKRAP+LIPIFNHCYIPCNP LAGN
Sbjct: 117 IAAISFQIAKNTLWSCSWGHKPAEPEKALRIARNLLKRAPVLIPIFNHCYIPCNPPLAGN 176

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLI-KQRSVSERTAGSSSNFSRRSL 239
           PIF+VDENR+ CCG DLSDFFERESLFR + SD   L  KQ S+++++   S+NFSRRS+
Sbjct: 177 PIFFVDENRVLCCGFDLSDFFERESLFRCSVSDSDSLFSKQSSLAQKSIVPSANFSRRSM 236

Query: 240 DAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIGSV 299
           D+G+  RTPRWVEFWSDAAIDRRRRNSSSSS+SSP+RFF+MPRSE+PKWV  Y+ ++GSV
Sbjct: 237 DSGV-VRTPRWVEFWSDAAIDRRRRNSSSSSNSSPDRFFEMPRSEVPKWVGKYLVELGSV 295

Query: 300 LREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEEV 359
           LR GGWSE ++ E+V VSA+G F+GEMVMLDNQ+VLD LLLK DR S SLR+SGWSSEEV
Sbjct: 296 LRTGGWSESEVAEMVEVSAAGIFDGEMVMLDNQAVLDALLLKVDRFSGSLRRSGWSSEEV 355

Query: 360 SDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           S+AFGFDFRPEK +K AKKLS ELVERIGKLAESVSRS
Sbjct: 356 SEAFGFDFRPEKGKKLAKKLSAELVERIGKLAESVSRS 393


>gi|297797673|ref|XP_002866721.1| hypothetical protein ARALYDRAFT_496899 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312556|gb|EFH42980.1| hypothetical protein ARALYDRAFT_496899 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 394

 Score =  575 bits (1481), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 294/399 (73%), Positives = 339/399 (84%), Gaps = 7/399 (1%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGL PAH AGLRRLSARAAAP+T     +RN L SFS  ADKVI HLKNS I
Sbjct: 1   MVDVDRRMTGLTPAHAAGLRRLSARAAAPSTPM---IRNSLQSFSPFADKVINHLKNSCI 57

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           ++QPGLSD EFAR EAEFGF FPPDLR IL+AGL +GPGFPDWR+ G ARLHLRA +DLP
Sbjct: 58  KIQPGLSDTEFARVEAEFGFTFPPDLRVILTAGLSVGPGFPDWRSPG-ARLHLRAMIDLP 116

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAA++FQIA+N+LW KSWG +P DP KALRVARNAL+RAPL+IPIF+HCYIPCNP+LAGN
Sbjct: 117 IAAVSFQIAKNSLWCKSWGFKPPDPVKALRVARNALRRAPLMIPIFDHCYIPCNPSLAGN 176

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLD 240
           P+F++DE RIFCCG DLS+FFERES FRS+E  P++L KQRSVSE++AGSSSNFSRRSLD
Sbjct: 177 PVFFIDETRIFCCGSDLSEFFERESAFRSSEFFPRILTKQRSVSEKSAGSSSNFSRRSLD 236

Query: 241 AGL--GSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIGS 298
           +G   G+   RWVEFWSDAA+DR RRNS+SSSSSS     D+P++E PKWV+ Y+ +IGS
Sbjct: 237 SGRANGTGKSRWVEFWSDAAVDRCRRNSASSSSSS-SSSPDLPKTETPKWVNQYVNRIGS 295

Query: 299 VLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEE 358
           +LR GGWSE DI EI+ VSASGFF+GEMV +DNQ+VLD LLLKA R+SESLRKSGWSSEE
Sbjct: 296 ILRGGGWSESDIDEIIHVSASGFFDGEMVFMDNQTVLDVLLLKAGRISESLRKSGWSSEE 355

Query: 359 VSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           VS+A GF FRPEKERKP +KLSP LVE+  KLAE VSRS
Sbjct: 356 VSEALGFGFRPEKERKPVQKLSPVLVEQFEKLAEWVSRS 394


>gi|297819714|ref|XP_002877740.1| hypothetical protein ARALYDRAFT_485385 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323578|gb|EFH53999.1| hypothetical protein ARALYDRAFT_485385 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 403

 Score =  558 bits (1439), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 307/407 (75%), Positives = 348/407 (85%), Gaps = 14/407 (3%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGL PAH AGLRRLSARAAAPTT T   VRN LVSFSSLA++VI+HL  S I
Sbjct: 1   MVDVDRRMTGLRPAHAAGLRRLSARAAAPTTPT---VRNSLVSFSSLAEQVISHLHTSRI 57

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           +VQPGL+D+EFA+AEAEF F FPPDLRA+L+AGLP+  GFPDWR+ G ARLHLRA +DLP
Sbjct: 58  QVQPGLTDSEFAKAEAEFAFSFPPDLRAVLTAGLPVSAGFPDWRSPG-ARLHLRAMIDLP 116

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAA++FQIARN LWSKSWG RPSDPEKALRVARNALKRAPL+IPIF+HCYIPCNP+LAGN
Sbjct: 117 IAAVSFQIARNTLWSKSWGLRPSDPEKALRVARNALKRAPLMIPIFDHCYIPCNPSLAGN 176

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSV----SERTAGSSSNFSR 236
           P+FY+DE RIFCCG DLSDFFERES+FR +++ P VL KQRSV    +  ++ SSSNFSR
Sbjct: 177 PVFYIDETRIFCCGSDLSDFFERESVFRGSDTCPVVLTKQRSVSEKSAGSSSSSSSNFSR 236

Query: 237 RSLDAG--LGSRTPRWVEFWSDAAID----RRRRNSSSSSSSSPERFFDMPRSEIPKWVD 290
            SLD+G   GS TPRWVEFWSDAA+D        + SSS SSSPER+ ++PRSE PKWVD
Sbjct: 237 MSLDSGRVHGSSTPRWVEFWSDAAVDRRRRNSASSMSSSHSSSPERYLELPRSETPKWVD 296

Query: 291 DYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLR 350
           DY+ +IGSVLR GGWSE D+ +IV VSASGFFEGEMV+LDNQ+VLD LLLKA R SESLR
Sbjct: 297 DYVSRIGSVLRGGGWSESDVDDIVHVSASGFFEGEMVILDNQAVLDALLLKAGRFSESLR 356

Query: 351 KSGWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           K+GWSSEEVSDA GFDFRPEKE+KP KKLSPELV+RIGKLAESVSRS
Sbjct: 357 KAGWSSEEVSDALGFDFRPEKEKKPVKKLSPELVQRIGKLAESVSRS 403


>gi|358349440|ref|XP_003638745.1| hypothetical protein MTR_142s1018 [Medicago truncatula]
 gi|355504680|gb|AES85883.1| hypothetical protein MTR_142s1018 [Medicago truncatula]
          Length = 401

 Score =  546 bits (1407), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 288/408 (70%), Positives = 340/408 (83%), Gaps = 18/408 (4%)

Query: 1   MVDVDRRMTGLNPA--HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNS 58
           MVDVDRRMTGLNPA  H+AGLRRLSARAA+ +TAT+   RNGL+SFS +ADK+IT+L+NS
Sbjct: 1   MVDVDRRMTGLNPARAHLAGLRRLSARAASISTATTT-ARNGLLSFSPIADKIITNLRNS 59

Query: 59  GIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLD 118
           GI+VQPGLS+ EFAR EAEFGF+FPPDL+AIL+AGLP+G GFPDWR     RLHLRASLD
Sbjct: 60  GIQVQPGLSEPEFARLEAEFGFIFPPDLKAILTAGLPVGAGFPDWRT----RLHLRASLD 115

Query: 119 LPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLA 178
           LP+AAI+FQIARN LW++ WG +P++PEKALR+ARNALK+APLLIPIFNHCYIPCNP+LA
Sbjct: 116 LPMAAISFQIARNTLWARCWGLKPTEPEKALRIARNALKKAPLLIPIFNHCYIPCNPSLA 175

Query: 179 GNPIFYVDENRIFCCGLDLSDFFERESLFRSTE--SDPQVLIKQRSVSER---TAGSSSN 233
           GNP+FYVDENRIFCCG DLSDFF+RES  RS+E    P V  KQRS +E+   T  S +N
Sbjct: 176 GNPVFYVDENRIFCCGFDLSDFFQRESPNRSSEFSPGPVVYKKQRSFTEKSVTTFCSEAN 235

Query: 234 FSRRSLDAGLGSRTPRWVEFWSDAAID---RRRRNSSSSSSSSPERFFDMPRSEIPKWVD 290
           F+RRSLDA  G RTPRWVEFWSDAA+D   R   +   S + SPERFF + + E PKWV+
Sbjct: 236 FNRRSLDA--GGRTPRWVEFWSDAAVDRRRRISSSRVESRAVSPERFFYIRKFEEPKWVE 293

Query: 291 DYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLR 350
           +Y+ +IGSVLREGGWSEPDI E+V VS SGFFEG+MVML+NQ+VLD +LLK DR S+SLR
Sbjct: 294 NYVEEIGSVLREGGWSEPDITEMVEVSGSGFFEGDMVMLNNQAVLDAILLKVDRFSDSLR 353

Query: 351 KSGWSSEEVSDAFGFDFRP-EKERKPAKKLSPELVERIGKLAESVSRS 397
           KSGWSSEEVSDA GFDF+P +KE+K  KKLSPELV  I KL +SV+RS
Sbjct: 354 KSGWSSEEVSDALGFDFQPEKKEKKLVKKLSPELVCSIEKLVQSVTRS 401


>gi|15229727|ref|NP_190603.1| uncharacterized protein [Arabidopsis thaliana]
 gi|6523045|emb|CAB62313.1| putative protein [Arabidopsis thaliana]
 gi|28416689|gb|AAO42875.1| At3g50340 [Arabidopsis thaliana]
 gi|110743322|dbj|BAE99549.1| hypothetical protein [Arabidopsis thaliana]
 gi|332645134|gb|AEE78655.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 403

 Score =  538 bits (1387), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 311/407 (76%), Positives = 349/407 (85%), Gaps = 14/407 (3%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRRMTGL PAH AGLRRLSARAAAPTT T   VRN LVSFSSLAD+VI+HL  S I
Sbjct: 1   MVDVDRRMTGLRPAHAAGLRRLSARAAAPTTPT---VRNSLVSFSSLADQVISHLHTSRI 57

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           +VQPGL+D+EFARAEAEF F FPPDLRA+L+AGLP+G GFPDWR+ G ARLHLRA +DLP
Sbjct: 58  QVQPGLTDSEFARAEAEFAFAFPPDLRAVLTAGLPVGAGFPDWRSPG-ARLHLRAMIDLP 116

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
           IAA++FQIARN LWSKSWG RPSDPEKALRVARNALKRAPL+IPIF+HCYIPCNP+LAGN
Sbjct: 117 IAAVSFQIARNTLWSKSWGLRPSDPEKALRVARNALKRAPLMIPIFDHCYIPCNPSLAGN 176

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSV----SERTAGSSSNFSR 236
           P+FY+DE RIFCCG DLSDFFERES+FR +++ P VL KQRSV    +  ++ SSSNFSR
Sbjct: 177 PVFYIDETRIFCCGSDLSDFFERESVFRGSDTCPVVLTKQRSVSEKSAGSSSSSSSNFSR 236

Query: 237 RSLDAG--LGSRTPRWVEFWSDAAID----RRRRNSSSSSSSSPERFFDMPRSEIPKWVD 290
            SLD+G   GS TPRWVEFWSDAA+D        + SSS SSSPER+ D+PRSE PKWVD
Sbjct: 237 MSLDSGRVHGSSTPRWVEFWSDAAVDRRRRNSASSMSSSHSSSPERYLDLPRSETPKWVD 296

Query: 291 DYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLR 350
           DY+ +IGSVLR GGWSE D+ +IV VSASGFFEGEMV+LDNQ+VLD LLLKA R SESLR
Sbjct: 297 DYVNRIGSVLRGGGWSESDVDDIVHVSASGFFEGEMVILDNQAVLDALLLKAGRFSESLR 356

Query: 351 KSGWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           K+GWSSEEVSDA GFDFRPEKE+KP KKLSPELV+RIGKLAESVSRS
Sbjct: 357 KAGWSSEEVSDALGFDFRPEKEKKPVKKLSPELVQRIGKLAESVSRS 403


>gi|296082421|emb|CBI21426.3| unnamed protein product [Vitis vinifera]
          Length = 372

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 284/373 (76%), Positives = 311/373 (83%), Gaps = 46/373 (12%)

Query: 24  ARAAAPTT-ATSLPVRNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVF 82
           ARAAAP+T +T+LPVRNGL+SF++LADKVI HL+NSGI+VQ GLSDAEFARAEAEFGF F
Sbjct: 35  ARAAAPSTVSTALPVRNGLISFTTLADKVIAHLQNSGIQVQAGLSDAEFARAEAEFGFAF 94

Query: 83  PPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRP 142
           PPDL+A+LSAGLP+GPGFPDWRA G ARL LRASLDLPIAAI+FQIARN+LWSKSWGPRP
Sbjct: 95  PPDLKAVLSAGLPVGPGFPDWRAAG-ARLRLRASLDLPIAAISFQIARNSLWSKSWGPRP 153

Query: 143 SDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFE 202
           SDPEKALR+ARNALKRAPLLIPIFNHCYIPCNP+LAGNPIF+VDENRIFCC         
Sbjct: 154 SDPEKALRIARNALKRAPLLIPIFNHCYIPCNPSLAGNPIFFVDENRIFCCD-------- 205

Query: 203 RESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLDAGLGSRTPRWVEFWSDAAIDRR 262
                                   T G +            G RTPRWVEFWSDAAIDRR
Sbjct: 206 ------------------------TGGLA------------GGRTPRWVEFWSDAAIDRR 229

Query: 263 RRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFF 322
           RRNSSSSSSSSPERFFDMPRSEIPKWVD+Y+ QIGSVLREGGW E DI EIVTVSASGFF
Sbjct: 230 RRNSSSSSSSSPERFFDMPRSEIPKWVDEYIEQIGSVLREGGWDESDIAEIVTVSASGFF 289

Query: 323 EGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEEVSDAFGFDFRPEKERKPAKKLSPE 382
           EGEMV+LDNQ+VLD LLLKADR S+SLRK+GWSSEEVSDA GFD+RPEKERKPAKKL+PE
Sbjct: 290 EGEMVLLDNQAVLDALLLKADRFSDSLRKAGWSSEEVSDALGFDYRPEKERKPAKKLAPE 349

Query: 383 LVERIGKLAESVS 395
           LVERIGKLAES++
Sbjct: 350 LVERIGKLAESLN 362


>gi|413921868|gb|AFW61800.1| hypothetical protein ZEAMMB73_719518 [Zea mays]
          Length = 405

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 208/413 (50%), Positives = 283/413 (68%), Gaps = 26/413 (6%)

Query: 1   MVDVDRRMTGLNPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSG 59
           MVDVDRRM  L PA H AGLRRLS RAAA  ++ S   R GL SF+ LA  V+ HL+ SG
Sbjct: 1   MVDVDRRMASLTPAAHAAGLRRLSTRAAASPSSASASPRQGLHSFAPLAAAVLGHLRASG 60

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + V PGL++ E A AEAE GF FPPDLRA+L+AGLP GPGF DWR+       LR++ D+
Sbjct: 61  VAVLPGLTELELAHAEAEMGFAFPPDLRAVLAAGLPSGPGFADWRSCAG----LRSAFDM 116

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           PI A + QIAR ALW + WGPRP+DP++ALR+AR+A++RAPLL+P+F+ C++PC P LAG
Sbjct: 117 PIVAASLQIARGALWPRCWGPRPADPDRALRLARSAIRRAPLLVPLFDRCFLPCRPCLAG 176

Query: 180 NPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSL 239
           NP+F+V ++R+ CCGLD+  F  R+S F+  +         +         ++  +RRSL
Sbjct: 177 NPVFFVTDDRVLCCGLDILHFVTRDSCFQPPDLRAPPPPVAQQQQYAGEAVATPCTRRSL 236

Query: 240 DAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSS------------PERFFDMPRSEIPK 287
           DA    + PRW+EFWSDAA D RRRNSS S +S+            P R+   PR+  P 
Sbjct: 237 DAACRGKAPRWIEFWSDAASDHRRRNSSFSDASTASSLSSGCASPPPARW---PRN--PH 291

Query: 288 WVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGE---MVMLDNQSVLDGLLLKADR 344
           WVD Y+ ++G VLR+GGW + ++ E+V V+ASG F+GE   + ++D+ +VLD LLLKAD+
Sbjct: 292 WVDSYLDRLGHVLRQGGWRDTEVTEMVEVAASGVFDGEEAAVPVVDSDAVLDALLLKADQ 351

Query: 345 LSESLRKSGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
            S+SLR++GWSSE+VSDA G D R  KER +PA  +  E+  ++ +LA+SV+R
Sbjct: 352 CSDSLRRAGWSSEDVSDALGLDLRWCKERPRPAMLVPQEIAVKVERLAQSVAR 404


>gi|42407374|dbj|BAD09363.1| unknown protein [Oryza sativa Japonica Group]
          Length = 398

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 224/409 (54%), Positives = 294/409 (71%), Gaps = 25/409 (6%)

Query: 1   MVDVDRRMTGLNPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSG 59
           MVDVDRRM GL+PA H AGLRRLS RAAA  ++ S   R+GL SF +LA  V++HL+ SG
Sbjct: 1   MVDVDRRMAGLSPAAHAAGLRRLSTRAAAGPSSASASPRHGLHSFHALAGAVLSHLRASG 60

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + V PGLSDAE ARAEAE GF FPPDLRA+L+ GLP GPGFPDWR     R  LR++ DL
Sbjct: 61  VAVLPGLSDAELARAEAEMGFAFPPDLRAVLAMGLPSGPGFPDWRT----RAGLRSAFDL 116

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           PIAA + QIAR ALW + WGPRP+DP++ALR+AR++++RAPLL+P+F+ C++PC P LAG
Sbjct: 117 PIAAASLQIARGALWPRCWGPRPADPDRALRLARSSIRRAPLLVPLFDRCFLPCRPCLAG 176

Query: 180 NPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSL 239
           NPIF+V ++R+ CCGLD+  FF R+S F+  +          S    ++G ++ + RRSL
Sbjct: 177 NPIFFVTDDRVLCCGLDILHFFTRDSSFQPLD---LRPPSSSSSVAPSSGEATPYMRRSL 233

Query: 240 DAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSS----------PERFFDMPRSEIPKWV 289
           DA  G + PRW+EFWSDAA DRRRR+SSSS +S+          P R     RS  P WV
Sbjct: 234 DAACGGKAPRWIEFWSDAASDRRRRDSSSSEASTASSSSGCASPPAR-----RSRTPHWV 288

Query: 290 DDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMV-MLDNQSVLDGLLLKADRLSES 348
           D Y+ ++GSVL+ GGW + ++ E+V V+ASG F+GE    +D  +VLD LLLKADR S+S
Sbjct: 289 DTYLDRLGSVLKSGGWRDTEVNEMVEVTASGLFDGEEAPAVDADAVLDALLLKADRCSDS 348

Query: 349 LRKSGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
           LR++GWSSE+VSDA G D R  KER +PA +L PE+  ++ +LA+SV+R
Sbjct: 349 LRRAGWSSEDVSDALGLDLRRCKERPRPAVQLPPEIAVKVERLAKSVAR 397


>gi|115476846|ref|NP_001062019.1| Os08g0472400 [Oryza sativa Japonica Group]
 gi|113623988|dbj|BAF23933.1| Os08g0472400 [Oryza sativa Japonica Group]
          Length = 391

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 217/402 (53%), Positives = 287/402 (71%), Gaps = 25/402 (6%)

Query: 8   MTGLNPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGIRVQPGL 66
           M GL+PA H AGLRRLS RAAA  ++ S   R+GL SF +LA  V++HL+ SG+ V PGL
Sbjct: 1   MAGLSPAAHAAGLRRLSTRAAAGPSSASASPRHGLHSFHALAGAVLSHLRASGVAVLPGL 60

Query: 67  SDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITF 126
           SDAE ARAEAE GF FPPDLRA+L+ GLP GPGFPDWR     R  LR++ DLPIAA + 
Sbjct: 61  SDAELARAEAEMGFAFPPDLRAVLAMGLPSGPGFPDWRT----RAGLRSAFDLPIAAASL 116

Query: 127 QIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVD 186
           QIAR ALW + WGPRP+DP++ALR+AR++++RAPLL+P+F+ C++PC P LAGNPIF+V 
Sbjct: 117 QIARGALWPRCWGPRPADPDRALRLARSSIRRAPLLVPLFDRCFLPCRPCLAGNPIFFVT 176

Query: 187 ENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLDAGLGSR 246
           ++R+ CCGLD+  FF R+S F+  +          S    ++G ++ + RRSLDA  G +
Sbjct: 177 DDRVLCCGLDILHFFTRDSSFQPLD---LRPPSSSSSVAPSSGEATPYMRRSLDAACGGK 233

Query: 247 TPRWVEFWSDAAIDRRRRNSSSSSSSS----------PERFFDMPRSEIPKWVDDYMGQI 296
            PRW+EFWSDAA DRRRR+SSSS +S+          P R     RS  P WVD Y+ ++
Sbjct: 234 APRWIEFWSDAASDRRRRDSSSSEASTASSSSGCASPPAR-----RSRTPHWVDTYLDRL 288

Query: 297 GSVLREGGWSEPDIVEIVTVSASGFFEGEMV-MLDNQSVLDGLLLKADRLSESLRKSGWS 355
           GSVL+ GGW + ++ E+V V+ASG F+GE    +D  +VLD LLLKADR S+SLR++GWS
Sbjct: 289 GSVLKSGGWRDTEVNEMVEVTASGLFDGEEAPAVDADAVLDALLLKADRCSDSLRRAGWS 348

Query: 356 SEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
           SE+VSDA G D R  KER +PA +L PE+  ++ +LA+SV+R
Sbjct: 349 SEDVSDALGLDLRRCKERPRPAVQLPPEIAVKVERLAKSVAR 390


>gi|115479539|ref|NP_001063363.1| Os09g0456700 [Oryza sativa Japonica Group]
 gi|51536303|dbj|BAD38471.1| unknown protein [Oryza sativa Japonica Group]
 gi|113631596|dbj|BAF25277.1| Os09g0456700 [Oryza sativa Japonica Group]
          Length = 396

 Score =  376 bits (966), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 208/406 (51%), Positives = 279/406 (68%), Gaps = 21/406 (5%)

Query: 1   MVDVDRRMTGL-NPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNS 58
           MVDVD RM GL +PA H AGLRRLS RAAA   + S   R+GL SF  +A  V++HL+ +
Sbjct: 1   MVDVDSRMAGLAHPAAHAAGLRRLSTRAAAGPPSASASPRHGLHSFDGVAAAVLSHLRKT 60

Query: 59  GIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLD 118
           G+ V PGLSDAEFAR EAE GF FPPDLRA+L+ GLP G GFPDWR     R  LRA+ D
Sbjct: 61  GVVVLPGLSDAEFARVEAEMGFTFPPDLRAVLAMGLPSGAGFPDWRG----RAWLRAAFD 116

Query: 119 LPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLA 178
           LPIAA + QIA+ ALW + WG RPSDP++A R+AR+A++RAPLL+P+F+ CY+PC P LA
Sbjct: 117 LPIAAASLQIAKGALWPRCWGRRPSDPDRARRLARSAIRRAPLLVPLFDRCYLPCRPCLA 176

Query: 179 GNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRS 238
           GNP+F++ ++R+ CCGLD+  FF RES F       Q L      +  ++G  +  +RRS
Sbjct: 177 GNPVFFITDDRVLCCGLDILHFFTRESSF-------QPLDVSSPSATPSSGICTPHTRRS 229

Query: 239 LDAGLGSRTPRWVEFWSDAA------IDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDY 292
           LDA  G +  RW+EFWSDAA      +      S++SSSSS       P    P WVD+Y
Sbjct: 230 LDAVCGGQALRWIEFWSDAASDRRRRVSSSSEASTASSSSSSSGCPSPPPRSTPLWVDNY 289

Query: 293 MGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMV-MLDNQSVLDGLLLKADRLSESLRK 351
           + ++GSVL++GGW + ++ E+V V+ASG F+GE     D  +VLD L LK DR S+SLR+
Sbjct: 290 LDKLGSVLKKGGWRDREVDEMVEVTASGLFDGEEAPPADADAVLDALFLKTDRCSDSLRR 349

Query: 352 SGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
           +GW+SE+VSDAFG D R  KE+ +PA ++ PE+  ++ +LA++V+R
Sbjct: 350 AGWTSEDVSDAFGLDLRRRKEQPRPAVQIPPEIAAKVQRLAQAVAR 395


>gi|218202275|gb|EEC84702.1| hypothetical protein OsI_31642 [Oryza sativa Indica Group]
          Length = 396

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 205/406 (50%), Positives = 275/406 (67%), Gaps = 21/406 (5%)

Query: 1   MVDVDRRMTGL-NPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNS 58
           MVDVD RM GL +PA H AGLRRLS RAAA   + S   R+GL SF  +A  V++HL+ +
Sbjct: 1   MVDVDSRMAGLAHPAAHAAGLRRLSTRAAAGPPSASASPRHGLHSFDGVAAAVLSHLRKT 60

Query: 59  GIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLD 118
           G+ V PGLSDAEFAR EAE GF FPPDLRA+L+ GLP G GFPDWR     R  LRA+ D
Sbjct: 61  GVVVLPGLSDAEFARVEAEMGFTFPPDLRAVLAMGLPSGAGFPDWRG----RAGLRAAFD 116

Query: 119 LPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLA 178
           LPIAA + QIA+ ALW + WG RPSDP++A R+AR+A++RAPLL+P+F+ CY+PC P LA
Sbjct: 117 LPIAAASLQIAKGALWPRCWGRRPSDPDRARRLARSAIRRAPLLVPLFDRCYLPCRPCLA 176

Query: 179 GNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRS 238
           GNP+F++ ++R+ CCGLD+  FF RES F       Q L      +  ++G  +  +RRS
Sbjct: 177 GNPVFFITDDRVLCCGLDILHFFTRESSF-------QPLDVSSPSATPSSGICTPHTRRS 229

Query: 239 LDAGLGSRTPRWVEFW------SDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDY 292
           LDA  G + PRW+EFW                 S++SSSSS       P    P WVD+Y
Sbjct: 230 LDAVCGGQAPRWIEFWSDAASDRRRRDSSSSEASTASSSSSSSGCPSPPPRSTPLWVDNY 289

Query: 293 MGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMV-MLDNQSVLDGLLLKADRLSESLRK 351
           + ++GSVL++GGW + ++ E+V V+ASG F+GE     D  +VLD L LK DR S+SLR+
Sbjct: 290 LDKLGSVLKKGGWRDREVDEMVEVTASGLFDGEEAPPADADAVLDALFLKTDRCSDSLRR 349

Query: 352 SGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
           +GW+SE+VSDAFG D R  KE+ +PA ++ PE+  ++ +LA++V+R
Sbjct: 350 AGWTSEDVSDAFGLDLRRRKEQPRPAVQIPPEIAAKVQRLAQAVAR 395


>gi|357148094|ref|XP_003574625.1| PREDICTED: uncharacterized protein LOC100834941 [Brachypodium
           distachyon]
          Length = 406

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 216/413 (52%), Positives = 293/413 (70%), Gaps = 25/413 (6%)

Query: 1   MVDVDRRMTGLNPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSG 59
           MVDVDRRM GL+PA H AGLRRLS RAAA  ++ S   R+GL SF +LA  V++HL+ SG
Sbjct: 1   MVDVDRRMAGLSPAAHAAGLRRLSTRAAAGPSSASASPRHGLHSFDALAAAVLSHLRASG 60

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + V PGL+DAE ARAEAE GF FPPDLRA+L+ GLP GPGFPDWR     R  LRA+ DL
Sbjct: 61  VAVLPGLTDAELARAEAEMGFAFPPDLRAVLAMGLPSGPGFPDWRT----RAGLRAAFDL 116

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           PIAA + QIAR ALW + WGPRP+DP++ALR+AR+A++RAPLL+P+F+ C++PC P LAG
Sbjct: 117 PIAAASLQIARGALWPRCWGPRPADPDRALRLARSAIRRAPLLVPLFDRCFLPCRPCLAG 176

Query: 180 NPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTA--GSSSNFSRR 237
           NP+F+V ++R+ CCGLD+  FF R+S F+       + ++  S    +A   +++ + RR
Sbjct: 177 NPVFFVTDDRVLCCGLDILHFFTRDSSFQPA----NMALQSPSFPPSSALGEATTPYMRR 232

Query: 238 SLDAGLGSR-TPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMP--------RSEIPKW 288
           SLDA  G +  PRW+EFWSDAA DRRRR+SSSS +S+                R+  P W
Sbjct: 233 SLDAACGGKLAPRWIEFWSDAASDRRRRDSSSSEASNVSSSSSGCSSPPPGGRRTRTPHW 292

Query: 289 VDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEG----EMVMLDNQSVLDGLLLKADR 344
           VD Y+ ++GSVL++GGW + ++ E+V V+ASG F+G        +D  +VLD L+LKADR
Sbjct: 293 VDSYLDKLGSVLKKGGWRDMEVNEMVEVTASGLFDGGEEAPAAAVDADAVLDALVLKADR 352

Query: 345 LSESLRKSGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
            S+SLR++GWS+E+VSDA G D R  KER +PA ++  E+  ++ +LA+SV+R
Sbjct: 353 CSDSLRRAGWSAEDVSDALGLDLRRCKERSRPAVRIPSEIAAKVERLAQSVAR 405


>gi|357153901|ref|XP_003576604.1| PREDICTED: uncharacterized protein LOC100829031 [Brachypodium
           distachyon]
          Length = 402

 Score =  364 bits (935), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 213/410 (51%), Positives = 286/410 (69%), Gaps = 21/410 (5%)

Query: 1   MVDVDRRMTGLNPAHIAG-LRRLSARAAAPTTATSLPV-RNGLVSFSSLADKVITHLKNS 58
           MVDVD RM GL PA  A  LRRLS RAAA   + S    R+GL SF  +A  V++HL++S
Sbjct: 1   MVDVDHRMAGLGPAAHAAALRRLSTRAAAGPASASASTPRHGLYSFHGVASSVLSHLRSS 60

Query: 59  GIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLD 118
           G+ + PGLSDAEFARAEAEFGF FPPDLRA+L+ GLP GPGFPDWR     R  LRA+ D
Sbjct: 61  GVAILPGLSDAEFARAEAEFGFTFPPDLRAVLALGLPSGPGFPDWRG----RSGLRAAFD 116

Query: 119 LPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLA 178
           LP+AA + QIAR ALW + WGPRP+D ++A R+AR+A++RAPLL+P+F+ CY+PC P LA
Sbjct: 117 LPVAAASLQIARGALWPRCWGPRPADTDRARRLARSAIRRAPLLVPLFDRCYLPCRPCLA 176

Query: 179 GNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRS 238
           GNP+F+V ++R+ CCGLD+  FF RES F+   ++  +             S++  +RRS
Sbjct: 177 GNPVFFVTDDRVLCCGLDVVHFFTRESSFQPM-AEISISSPLAKTHSSGTSSTTPCTRRS 235

Query: 239 LDAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDM--------PRSEIPKWVD 290
           LDA    + PRW+EFWSDA+ DRRRR+SSSS +S+               PR   P+WVD
Sbjct: 236 LDA---VQAPRWIEFWSDASSDRRRRDSSSSEASTASSSSSSSSAGCPSPPRRSTPRWVD 292

Query: 291 DYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVM--LDNQSVLDGLLLKADRLSES 348
            Y+ ++GSVL+EGGW + ++ E+V V+ASG F+GE      D ++VLD LLLK DR S+S
Sbjct: 293 SYLDKLGSVLKEGGWRDREVDEMVEVTASGMFDGEETTPASDAEAVLDALLLKTDRCSDS 352

Query: 349 LRKSGWSSEEVSDAFGFDF-RPEKERKPAKKLSPELVERIGKLAESVSRS 397
           LR++GWSSE+VSDA G DF R +K  + A ++ PE+  ++ +LA++V+RS
Sbjct: 353 LRRAGWSSEDVSDALGLDFRRGKKPLRSAVRVPPEIAAKVQRLAQAVARS 402


>gi|242081727|ref|XP_002445632.1| hypothetical protein SORBIDRAFT_07g022970 [Sorghum bicolor]
 gi|241941982|gb|EES15127.1| hypothetical protein SORBIDRAFT_07g022970 [Sorghum bicolor]
          Length = 422

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 206/425 (48%), Positives = 280/425 (65%), Gaps = 33/425 (7%)

Query: 1   MVDVDRRMTGLN---PAHIAGLRRLSARAAAPTTATSLPVR-NGLVSFSSLADKVITHLK 56
           MVDVDRRM+GL+    A  AGLRRLS RAAA  ++ S   R +GL SF+ +A  V+ HL+
Sbjct: 1   MVDVDRRMSGLSLTPAAQAAGLRRLSTRAAAGPSSASASPRQHGLHSFAPVAAAVLDHLR 60

Query: 57  NSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRAS 116
            + + V PGL+D E ARAEAE GF FPPDLRA+L+AG+P GPGFPDWR+    R  LRA+
Sbjct: 61  AAHVPVLPGLTDLELARAEAELGFAFPPDLRAVLAAGVPSGPGFPDWRS----RSGLRAA 116

Query: 117 LDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPT 176
            DLPIAA + QIAR ALW + WG RP+DP++ALR+AR+A++RAPLL+P+F+ C++PC P 
Sbjct: 117 FDLPIAAASLQIARGALWPRCWGARPADPDRALRLARSAIRRAPLLVPLFDRCFLPCRPC 176

Query: 177 LAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERT----AGSSS 232
           LAGNP+F+V ++R+ C GLDL  FF R++ F+        L    SV  +     A +++
Sbjct: 177 LAGNPVFFVTDDRVLCVGLDLLHFFTRDACFQPMLDPRSALPSSSSVPPQQHAGEAAAAT 236

Query: 233 NFSRRSLDAGLGSR-TPRWVEFW------------SDAAIDRRRRNSSSSSSSSPERFFD 279
             +RRSLDA  G R  PRW+EFW            S +        SS  +S  P     
Sbjct: 237 QCTRRSLDAACGGRKAPRWIEFWSDAASDRRRRDSSSSEASTASSRSSGCASPPPPPPAP 296

Query: 280 MPRSEIPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEG-------EMVMLDNQ 332
             RS  P WVD Y+ ++G VLR+GGW + ++ E+V V+ASG F+G       +   +D+ 
Sbjct: 297 ARRSRNPHWVDSYLDRLGHVLRQGGWRDSEVTEMVEVAASGVFDGGEDAAATDPAAVDSD 356

Query: 333 SVLDGLLLKADRLSESLRKSGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLA 391
           +VLD LLLKADR S+SLR++GWSSE+VSDA G D R  KER +PA ++ PE+  ++ +LA
Sbjct: 357 AVLDALLLKADRCSDSLRRAGWSSEDVSDALGLDLRRCKERPRPAVRVPPEIAVKVERLA 416

Query: 392 ESVSR 396
           +SV+R
Sbjct: 417 QSVAR 421


>gi|242049488|ref|XP_002462488.1| hypothetical protein SORBIDRAFT_02g026580 [Sorghum bicolor]
 gi|241925865|gb|EER99009.1| hypothetical protein SORBIDRAFT_02g026580 [Sorghum bicolor]
          Length = 412

 Score =  358 bits (920), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 209/416 (50%), Positives = 292/416 (70%), Gaps = 25/416 (6%)

Query: 1   MVDVDRRMTGLNP--AHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNS 58
           MVDVD RM GL P  AH A LRRLS RAA   ++ S   R+GL SF ++A  V++HL+++
Sbjct: 1   MVDVDHRMAGLAPSVAHTAALRRLSTRAAGGASSASASPRHGLHSFHAVAAGVLSHLRDA 60

Query: 59  -GIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASL 117
            G+ V PGL+DAE ARAEAEFGF FPPDLRA+L+ G+P GPGFPDWR  G  R  LRA+ 
Sbjct: 61  AGVTVLPGLTDAELARAEAEFGFTFPPDLRAVLALGVPSGPGFPDWR--GRGRAGLRAAF 118

Query: 118 DLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTL 177
           DLP AA + Q+AR ALW + WG RP+DP++ALR+AR+A++RAPLL+P+F+ CY+PC P L
Sbjct: 119 DLPAAAASLQVARGALWPRCWGRRPADPDRALRLARSAIRRAPLLVPLFDRCYLPCRPCL 178

Query: 178 AGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSER-----TAGSSS 232
           AGNP+F+V ++R+ CCGLDL  FF R+S F+ T +   V+     ++          S  
Sbjct: 179 AGNPVFFVADDRVLCCGLDLLHFFTRDSSFQPTTAMDHVVSSSSPLASPLLSAGATTSRP 238

Query: 233 NFSRRSLDAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDM-------PRSEI 285
           + +RRSLDA    + PRW+EFWSDAA DRRRR+SSSS +S+              PR   
Sbjct: 239 SCTRRSLDA---VQAPRWIEFWSDAASDRRRRDSSSSEASTASSSSSSSSSCSSPPRRST 295

Query: 286 PKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVML---DNQSVLDGLLLKA 342
           P+WVD+Y+ ++GS+L++GGW + ++ E+V V+ASGFF+GE       D++++LD L+LK 
Sbjct: 296 PRWVDNYLDELGSMLKKGGWRDREVDEMVEVTASGFFDGEEAGAPAPDSEAILDALVLKT 355

Query: 343 DRLSESLRKSGWSSEEVSDAFGFDFR--PEKERKPAKKLSPELVERIGKLAESVSR 396
           DR S+SLR++GW+SE+VSDA G DFR   E+ R  A ++ PE+  ++ +LA++++R
Sbjct: 356 DRCSDSLRRAGWTSEDVSDALGLDFRRGKERSRSAAVRIPPEIAAKVQRLAQALAR 411


>gi|222640718|gb|EEE68850.1| hypothetical protein OsJ_27643 [Oryza sativa Japonica Group]
          Length = 398

 Score =  352 bits (902), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 210/409 (51%), Positives = 278/409 (67%), Gaps = 25/409 (6%)

Query: 1   MVDVDRRMTGLNPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSG 59
           MVDVDRRM GL+PA H AGLRRLS RAAA  ++ S   R+GL SF +LA  V++HL+ SG
Sbjct: 1   MVDVDRRMAGLSPAAHAAGLRRLSTRAAAGPSSASASPRHGLHSFHALAGAVLSHLRASG 60

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + V PGLSDAE ARAEAE G  F P             PG      TG  R  LR++ DL
Sbjct: 61  VAVLPGLSDAELARAEAEMGSRFRPTCARCSPWASRRVPG----SRTGRTRAGLRSAFDL 116

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           PIAA + QIAR ALW + WGPRP+DP++ALR+AR++++RAPLL+P+F+ C++PC P LAG
Sbjct: 117 PIAAASLQIARGALWPRCWGPRPADPDRALRLARSSIRRAPLLVPLFDRCFLPCRPCLAG 176

Query: 180 NPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSL 239
           NPIF+V ++R+ CCGLD+  FF R+S F+  +          S    ++G ++ + RRSL
Sbjct: 177 NPIFFVTDDRVLCCGLDILHFFTRDSSFQPLD---LRPPSSSSSVAPSSGEATPYMRRSL 233

Query: 240 DAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSS----------PERFFDMPRSEIPKWV 289
           DA  G + PRW+EFWSDAA DRRRR+SSSS +S+          P R     RS  P WV
Sbjct: 234 DAACGGKAPRWIEFWSDAASDRRRRDSSSSEASTASSSSGCASPPAR-----RSRTPHWV 288

Query: 290 DDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMV-MLDNQSVLDGLLLKADRLSES 348
           D Y+ ++GSVL+ GGW + ++ E+V V+ASG F+GE    +D  +VLD LLLKADR S+S
Sbjct: 289 DTYLDRLGSVLKSGGWRDTEVNEMVEVTASGLFDGEEAPAVDADAVLDALLLKADRCSDS 348

Query: 349 LRKSGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
           LR++GWSSE+VSDA G D R  KER +PA +L PE+  ++ +LA+SV+R
Sbjct: 349 LRRAGWSSEDVSDALGLDLRRCKERPRPAVQLPPEIAVKVERLAKSVAR 397


>gi|414869881|tpg|DAA48438.1| TPA: hypothetical protein ZEAMMB73_731708 [Zea mays]
          Length = 399

 Score =  350 bits (897), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 204/414 (49%), Positives = 281/414 (67%), Gaps = 34/414 (8%)

Query: 1   MVDVDRRMTGLNPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSG 59
           MVDVDRRM GL PA H AGLRRLS RAAA  ++ S   R+GL SF+ LA  V+ HL+ SG
Sbjct: 1   MVDVDRRMAGLTPAAHAAGLRRLSTRAAAGPSSASASPRHGLHSFAPLAAAVLGHLRASG 60

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + V PGL++ E ARAEAE GF FPPDLRA+L+AGLP GPGFPDWR+    R  LR++ DL
Sbjct: 61  VAVLPGLTELELARAEAEMGFAFPPDLRAVLAAGLPSGPGFPDWRS----RAGLRSAFDL 116

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           PIAA + QIAR ALW + WG RP+DP++ALR+AR+A++RAPLL+P+F+ C++PC P LAG
Sbjct: 117 PIAAASLQIARGALWPRCWGARPADPDRALRLARSAIRRAPLLVPLFDRCFLPCRPCLAG 176

Query: 180 NPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSL 239
           NP+F+V ++R+ CCGLD+  FF R+S F+    DP+        S      ++  +RRSL
Sbjct: 177 NPVFFVTDDRVLCCGLDVLHFFARDSCFQPL--DPR--------SRDGEAGATPCTRRSL 226

Query: 240 DAGLGS-RTPRWVEFW----------SDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKW 288
           DA  GS + PRW+EFW            ++ +    +S SS  +SP       R+  P W
Sbjct: 227 DAACGSGKAPRWIEFWSDAASDRRRRDSSSSETSTASSLSSGCASPPPAAR--RARNPHW 284

Query: 289 VDDYMGQIGSVLREGGWSEPDIVEIVTVSA--SGFFEGEMV---MLDNQSVLDGLLLKAD 343
           VD Y+ ++G VLR+GGW + ++ E+V V+A  SG  +GE      +D+ + LD LLL AD
Sbjct: 285 VDSYLDRLGHVLRQGGWRDTEVTEMVEVAASGSGVLDGEEAAAPAVDSDAALDALLLNAD 344

Query: 344 RLSESLRKSGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
           R S++LR++GWSSE+VSDA G D R  K+R +PA ++ PE+  ++ +LA++V+R
Sbjct: 345 RCSDALRRAGWSSEDVSDALGLDLRRCKQRPRPAVRVPPEIAVKVERLAQAVAR 398


>gi|222641717|gb|EEE69849.1| hypothetical protein OsJ_29625 [Oryza sativa Japonica Group]
          Length = 375

 Score =  338 bits (867), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 196/406 (48%), Positives = 261/406 (64%), Gaps = 42/406 (10%)

Query: 1   MVDVDRRMTGL-NPA-HIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNS 58
           MVDVD RM GL +PA H AGLRRLS RAAA   + S   R+GL SF  +A  V++HL+ +
Sbjct: 1   MVDVDSRMAGLAHPAAHAAGLRRLSTRAAAGPPSASASPRHGLHSFDGVAAAVLSHLRKT 60

Query: 59  GIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLD 118
           G+ V PGLSDAEFAR EAE GF FPPDLRA+L+ GLP G GFPDWR     R  LRA+ D
Sbjct: 61  GVVVLPGLSDAEFARVEAEMGFTFPPDLRAVLAMGLPSGAGFPDWR----GRAWLRAAFD 116

Query: 119 LPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLA 178
           LPIAA + QIA+ ALW                      +RAPLL+P+F+ CY+PC P LA
Sbjct: 117 LPIAAASLQIAKGALWP---------------------RRAPLLVPLFDRCYLPCRPCLA 155

Query: 179 GNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRS 238
           GNP+F++ ++R+ CCGLD+  FF RES F       Q L      +  ++G  +  +RRS
Sbjct: 156 GNPVFFITDDRVLCCGLDILHFFTRESSF-------QPLDVSSPSATPSSGICTPHTRRS 208

Query: 239 LDAGLGSRTPRWVEFWSDAA------IDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDY 292
           LDA  G +  RW+EFWSDAA      +      S++SSSSS       P    P WVD+Y
Sbjct: 209 LDAVCGGQALRWIEFWSDAASDRRRRVSSSSEASTASSSSSSSGCPSPPPRSTPLWVDNY 268

Query: 293 MGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMV-MLDNQSVLDGLLLKADRLSESLRK 351
           + ++GSVL++GGW + ++ E+V V+ASG F+GE     D  +VLD L LK DR S+SLR+
Sbjct: 269 LDKLGSVLKKGGWRDREVDEMVEVTASGLFDGEEAPPADADAVLDALFLKTDRCSDSLRR 328

Query: 352 SGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
           +GW+SE+VSDAFG D R  KE+ +PA ++ PE+  ++ +LA++V+R
Sbjct: 329 AGWTSEDVSDAFGLDLRRRKEQPRPAVQIPPEIAAKVQRLAQAVAR 374


>gi|414589663|tpg|DAA40234.1| TPA: hypothetical protein ZEAMMB73_566138 [Zea mays]
          Length = 400

 Score =  337 bits (863), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 207/411 (50%), Positives = 290/411 (70%), Gaps = 27/411 (6%)

Query: 1   MVDVDRRMTGLNP--AHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNS 58
           MVD+D RM GL P  AH A LRRLS RAAA  ++ S   R+GL SF ++A  V++HL+ S
Sbjct: 1   MVDIDHRMAGLAPSVAHTAALRRLSTRAAAGPSSASASPRHGLHSFHAVAAGVLSHLRES 60

Query: 59  GIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLD 118
           G+ V PGLSDAE  RAEAEFGF FPPDLRA+L+ G+P GPGFPDWR     R  LRA+ D
Sbjct: 61  GVAVLPGLSDAELVRAEAEFGFTFPPDLRAVLALGVPSGPGFPDWRG----RAGLRAAFD 116

Query: 119 LPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLA 178
           LP AA + Q+AR ALW + WG RP+DP +ALR+AR+A++RAP L+P+F+ CY+PC P LA
Sbjct: 117 LPAAAASLQVARGALWPRCWGRRPADPGRALRLARSAVRRAPPLVPLFDRCYLPCRPCLA 176

Query: 179 GNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSS-NFSRR 237
           GNP+F+V ++R+ CCGLDL  FF R+S F+  +   + ++   S S  +AG+S  + +RR
Sbjct: 177 GNPVFFVADDRVLCCGLDLLYFFTRDSSFQPAD---RAVVS--SSSPLSAGASRPSCTRR 231

Query: 238 SLDAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPER---------FFDMPRSEIPKW 288
           SLDA    + PRW+EFWSDAA DRRRR+SSSS +S+                PR   P+W
Sbjct: 232 SLDA---VQAPRWIEFWSDAASDRRRRDSSSSEASTASSSSSSSSSSGCPSPPRRPTPRW 288

Query: 289 VDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEG--EMVMLDNQSVLDGLLLKADRLS 346
           VD Y+ ++GS+L++GGW + ++ E+V V+ASGF +   E    D++++LD L+LK DR S
Sbjct: 289 VDSYLDELGSMLKKGGWRDREVDEMVEVTASGFLDDGEEAPTPDSEAILDALVLKTDRCS 348

Query: 347 ESLRKSGWSSEEVSDAFGFDFRPEKER-KPAKKLSPELVERIGKLAESVSR 396
           +SLR++GW++E+VSDA G DFR  +ER + A ++ PE+  ++ +LA++++R
Sbjct: 349 DSLRRAGWTAEDVSDALGLDFRRGEERSRSAVRIPPEIAAKVQRLAQALAR 399


>gi|414885752|tpg|DAA61766.1| TPA: hypothetical protein ZEAMMB73_208208 [Zea mays]
          Length = 499

 Score =  307 bits (787), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 199/406 (49%), Positives = 283/406 (69%), Gaps = 18/406 (4%)

Query: 1   MVDVDRRMTGLNP--AHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNS 58
           MVDVD+RM GL P  AH A LRRLS RAAA  ++ S   R+GL SF  +A  V++HL+ +
Sbjct: 99  MVDVDQRMAGLAPSVAHTAALRRLSTRAAAGPSSASASPRHGLHSFDGVAAGVLSHLREA 158

Query: 59  GIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLD 118
           G+ V PGLSDAE ARAEAEFGF FPPDLRA+L+ G+P GP FPDWR     R  LRA+LD
Sbjct: 159 GVVVLPGLSDAELARAEAEFGFAFPPDLRAVLALGVPSGPRFPDWRG----RAGLRAALD 214

Query: 119 LPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLA 178
           LP AA + Q AR ALW + WG RP+DP++ALR+AR A++RAP L+P+F+ CY+PC P LA
Sbjct: 215 LPAAAASLQAARGALWPRCWGRRPADPDRALRLARAAVRRAPPLVPLFDRCYLPCRPCLA 274

Query: 179 GNPIFYVDENRIFCCGLDLSDFFERE-SLFRSTESDPQVLIKQRSVSERTAGSS-SNFSR 236
           GNP+F+V ++R+ CCGLDL  FF RE S F+ T      +    + S  +AG+S  + +R
Sbjct: 275 GNPVFFVADDRVLCCGLDLLHFFARESSSFQPTVEHRHAVSSSLASSPLSAGASRPSCTR 334

Query: 237 RSLDAGLGSRTPRWVEFW---SDAAIDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYM 293
           RSLD   G + PRW+EFW   +     R   +S +S++S+       PR   P+WVD+Y+
Sbjct: 335 RSLD---GVQAPRWIEFWSDAASDRRRRDSSSSEASTASATSSSSSPPRRSTPRWVDNYL 391

Query: 294 GQIGSVLREGGWSEPDIVEIVTVSASGFFEG-EMVMLDNQSVLDGLLLKADRLSESLRKS 352
            ++GS+L++GGW + ++ E+V V+ASG F+G E    D+++VLD L+LK DR S+SLR++
Sbjct: 392 DELGSMLKKGGWRDREVDEMVEVTASGLFDGVEAPAPDSEAVLDALVLKTDRCSDSLRRA 451

Query: 353 GWSSEEVSDAFGFDFRPEKERKPAK---KLSPELVERIGKLAESVS 395
           GW+SE+VSDA G DFR  K+++ ++   ++ PE+  ++ +LA++++
Sbjct: 452 GWTSEDVSDALGLDFRRCKDKERSRSGVRIPPEIAAKVQRLAQALA 497


>gi|302797489|ref|XP_002980505.1| hypothetical protein SELMODRAFT_112997 [Selaginella moellendorffii]
 gi|300151511|gb|EFJ18156.1| hypothetical protein SELMODRAFT_112997 [Selaginella moellendorffii]
          Length = 393

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 161/402 (40%), Positives = 225/402 (55%), Gaps = 31/402 (7%)

Query: 14  AHIAGLRRLSARA-AAPTTATSLPVRNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFA 72
           AH+AGL+RLSARA A P    S   R    SFS     VI  L+   + +  GLS+ EF 
Sbjct: 3   AHVAGLKRLSARANAGPALPASPGSRKVSFSFSGYGINVIERLRKCRVHIDKGLSEEEFE 62

Query: 73  RAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNA 132
           R EA F F FPPDLR IL  GLPLG GFP+WR+ G  +L +   ++ P A +   + R  
Sbjct: 63  RIEALFQFTFPPDLRGILREGLPLGDGFPNWRSGGPQQLKM--WINKPRAGLLADVERGV 120

Query: 133 LWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFC 192
            W K WGPRP++  KA+++AR  LK+AP+L+P++ HCY+   P  AGNP+F+V    ++C
Sbjct: 121 FWMKQWGPRPAELHKAVQIARGQLKKAPVLLPVYGHCYVAAAPVRAGNPVFFVQGRDVYC 180

Query: 193 CGLDLSDFFERESLF-RSTESDPQVLIKQRSVSERTAGSSSNFSRRSLDAGLGSRTPRWV 251
           CGLD+SDFF R+S   RS  S           S   + S  +   R+      SR  R +
Sbjct: 181 CGLDVSDFFHRDSFASRSVLSPRSSSGSSCDGSCEDSPSVRDAQGRA----RASRVRR-I 235

Query: 252 EFWSDAAIDRRRRNSSSSSS---------SSPERFFDM-------PRSEIPKWVDDYMGQ 295
           E WSD +   +  NS +S+S         +  + +F+          S + + V+D    
Sbjct: 236 ELWSDLSEKGQSMNSVASTSDLEDDEQAATDCDDYFEFDGEWENGAESLMKETVED---- 291

Query: 296 IGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWS 355
           + S LR+GGW E DI EIV  +AS   + +  M D Q+VL+GL    D LS SLRK+GWS
Sbjct: 292 MSSKLRQGGWKEEDIAEIVG-AASSCSDHKRGMPDRQTVLEGLAFHLDYLSCSLRKAGWS 350

Query: 356 SEEVSDAFGFDFRPEKERKP-AKKLSPELVERIGKLAESVSR 396
             +V++   FDF    ER+  AKK+SP++ +RIG LA  V++
Sbjct: 351 VHDVAETLCFDFGASGERRARAKKVSPQVAQRIGMLASFVAQ 392


>gi|302805807|ref|XP_002984654.1| hypothetical protein SELMODRAFT_120823 [Selaginella moellendorffii]
 gi|300147636|gb|EFJ14299.1| hypothetical protein SELMODRAFT_120823 [Selaginella moellendorffii]
          Length = 393

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 161/402 (40%), Positives = 225/402 (55%), Gaps = 31/402 (7%)

Query: 14  AHIAGLRRLSARA-AAPTTATSLPVRNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFA 72
           AH+AGL+RLSARA A P    S   R    SFS     VI  L+   + +  GLS+ EF 
Sbjct: 3   AHVAGLKRLSARANAGPALPASPGSRKVSFSFSGYGINVIERLRKCRVHIDKGLSEEEFE 62

Query: 73  RAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNA 132
           R EA F F FPPDLR IL  GLPLG GFP+WR+ G  +L +   ++ P A +   + R  
Sbjct: 63  RIEALFQFTFPPDLRGILREGLPLGDGFPNWRSGGPQQLKM--WINKPRAGLLADVERGV 120

Query: 133 LWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFC 192
            W K WGPRP++  KA+++AR  LK+AP+L+P++ HCY+   P  AGNP+F+V    ++C
Sbjct: 121 FWLKQWGPRPAELHKAVQIARGQLKKAPVLLPVYGHCYVAAAPVRAGNPVFFVQGRDVYC 180

Query: 193 CGLDLSDFFERESLF-RSTESDPQVLIKQRSVSERTAGSSSNFSRRSLDAGLGSRTPRWV 251
           CGLD+SDFF R+S   RS  S           S   + S  +   R+      SR  R +
Sbjct: 181 CGLDVSDFFHRDSFASRSVLSPRSSSGSSCDGSCEDSPSVRDAQGRA----RASRVRR-I 235

Query: 252 EFWSDAAIDRRRRNSSSSSS---------SSPERFFDM-------PRSEIPKWVDDYMGQ 295
           E WSD +   +  NS +S+S         +  + +F+          S + + V+D    
Sbjct: 236 ELWSDLSEKGQSMNSVASTSDLEDDEQAATDCDDYFEFDGEWENGAESLMKETVED---- 291

Query: 296 IGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWS 355
           + S LR+GGW E DI EIV  +AS   + +  M D Q+VL+GL    D LS SLRK+GWS
Sbjct: 292 MSSKLRQGGWKEEDIAEIVG-AASSCSDHKRGMPDRQTVLEGLAFHLDYLSCSLRKAGWS 350

Query: 356 SEEVSDAFGFDFRPEKERKP-AKKLSPELVERIGKLAESVSR 396
             +V++   FDF    ER+  AKK+SP++ +RIG LA  V++
Sbjct: 351 VHDVAETLCFDFGASGERRARAKKVSPQVAQRIGMLASFVAQ 392


>gi|168035032|ref|XP_001770015.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678736|gb|EDQ65191.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 144/360 (40%), Positives = 213/360 (59%), Gaps = 18/360 (5%)

Query: 14  AHIAGLRRLSARAAA-PTTATSLPVRNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFA 72
           A +AGL+RLSARA+A P+T  +   R    SFS+ A  VI HLK   + +  GLSD EF 
Sbjct: 21  AQLAGLKRLSARASAGPSTPGT---RRVFFSFSAYARSVIDHLKKCEVPIAEGLSDEEFE 77

Query: 73  RAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNA 132
           + EA +GF FPPDL+ IL  GLP G GFP+WR TG+ +  LR  ++LPI  +  ++A + 
Sbjct: 78  KIEATYGFTFPPDLKGILQEGLPTGSGFPNWR-TGNVQ-QLRMRINLPILRLLHEVAHSR 135

Query: 133 LWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFC 192
            W K WGPRP + + A+R+AR+AL++APLL+P+  HCYI   P  AGNP+F V +N +  
Sbjct: 136 FWWKPWGPRPLEIDHAVRIARSALRKAPLLVPMHGHCYISSAPNDAGNPVFLVYQNNVVY 195

Query: 193 CGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLDAGL---GSRTPR 249
           CG D++DFFERE+ FR+ + +P   + +   S  +  S    +  +L   +    +R+PR
Sbjct: 196 CGYDVADFFEREA-FRAHDGEPPFELDEWQGSMTSEKSLPPKALINLSMAIPPWAARSPR 254

Query: 250 WVEFWSDAA--IDRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIGSVLREGGWSE 307
            +EFWS+      ++R +S     + P      P  E  KW+  Y  ++  VLR+GGW E
Sbjct: 255 RIEFWSELVDKYQKQRNDSLKECRTLP----PTPNLESSKWLTGYFEEMSLVLRQGGWEE 310

Query: 308 PDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEEVSDAFGFDF 367
            DI +++   AS  F  +   LD ++VL  L  + + LS SL+K+GWS  +V++     F
Sbjct: 311 NDISDMMDPKASPEFWNQ--QLDAEAVLVTLAREVELLSTSLKKAGWSVPDVTETMKTGF 368


>gi|168063785|ref|XP_001783849.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664627|gb|EDQ51339.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 395

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 143/372 (38%), Positives = 216/372 (58%), Gaps = 30/372 (8%)

Query: 14  AHIAGLRRLSARAAA-PTTATSLPVRNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFA 72
           A +AGL+RLSARA+A P T      R    SFS+ A  V+  LK   + +  GLSD EF 
Sbjct: 24  AQLAGLKRLSARASAGPITPPG--TRRVFFSFSAYARSVLDKLKKCKVPIAEGLSDEEFE 81

Query: 73  RAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNA 132
           R EA +G  FPPDL+ IL  GLP+G GFP+WRA    R HL   ++LP+  +  ++A ++
Sbjct: 82  RIEATYGITFPPDLKGILHEGLPVGAGFPNWRAGN--RDHLSMRINLPVVGLLREVANSS 139

Query: 133 LWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFC 192
            W K+WGPRP D  +A+R+AR+A++++P+L+P++ HCYIP +P LAGNP+F+V +     
Sbjct: 140 FWWKAWGPRPKDINQAVRIARSAIRKSPILVPMYGHCYIPSSPNLAGNPVFFVYQKNAVY 199

Query: 193 CGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTA--GSSSNFSRRSLDAGLGSRTPRW 250
           CG D++DFF RE+          ++    S     +  G +   + R +D    +RTPR 
Sbjct: 200 CGYDVADFFNREAFLSHKFEFGDLMYDMGSFRNEKSIEGGALEHANRDVDFEPDARTPRR 259

Query: 251 VEFWSDAAIDRRRRN--SSSSSSSSPERF-FDM--PRSEIP-------------KWVDDY 292
           +EFWSD A D+  +   S     SSP R  F++  PR++               KW++ Y
Sbjct: 260 IEFWSDLA-DKVHKQIFSRVEGFSSPRRQDFNLLKPRTDSTNENLEEQDGIKPSKWLNVY 318

Query: 293 MGQIGSVLREGGWSEPDIVE-IVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRK 351
           + +I  VLR+GGW E DI + IV+ S S  +  +    D+++VL  L  + + LS SL++
Sbjct: 319 LDEISLVLRQGGWREDDINDMIVSESPSKRWNQQH---DSEAVLVSLAREVELLSTSLKR 375

Query: 352 SGWSSEEVSDAF 363
           +GWS  +V++  
Sbjct: 376 AGWSVPDVTETM 387


>gi|168005714|ref|XP_001755555.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693262|gb|EDQ79615.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 382

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 127/325 (39%), Positives = 185/325 (56%), Gaps = 34/325 (10%)

Query: 14  AHIAGLRRLSARAAA-PTTATSLPVRNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFA 72
           AH+AGL+RLSARA+A P T      R    SFS+ A  V+  LK+  + +  GLSD EF 
Sbjct: 24  AHLAGLKRLSARASAGPVTPPG--TRRVFFSFSAYARNVVDKLKSCKVPISEGLSDEEFD 81

Query: 73  RAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNA 132
           + EA FG  FPPDL+ IL   LP+G GFP+WRA   +R HL   ++LPI  +  ++A ++
Sbjct: 82  KIEATFGITFPPDLKGILHEALPVGAGFPNWRA--GSRDHLSMRINLPIVGLLREVANSS 139

Query: 133 LWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFC 192
            W K+WGPRP+D + A+RVAR AL+R+P+L+PI+ HCYIP +P LAGNP+F+  +  +  
Sbjct: 140 FWWKAWGPRPTDTKLAVRVARTALRRSPILVPIYGHCYIPSSPGLAGNPVFFAYQKNVVF 199

Query: 193 CGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLDAGLGSRTPRWVE 252
           CG D++DFF+RE    S +      I ++S+      + S   R        +RT R +E
Sbjct: 200 CGYDVADFFDREDFESSDDITKYPHIVEKSLPSEILINVSMVQR-------DARTSRHIE 252

Query: 253 FWSDAAIDRRRRNSSSSSS-SSPERFF-------DMPRSEIP--------------KWVD 290
           FWSD A    ++ +    S  SPE+           P  + P              +W++
Sbjct: 253 FWSDLAEKVHKQFAVHGESFFSPEKIVSDSSKRSSSPVKKHPERDRMEEKNEAQSLRWLN 312

Query: 291 DYMGQIGSVLREGGWSEPDIVEIVT 315
            Y+ +I  VLR+GGW E DI +++ 
Sbjct: 313 GYLDEISLVLRQGGWREDDINDMIV 337


>gi|302757759|ref|XP_002962303.1| hypothetical protein SELMODRAFT_77776 [Selaginella moellendorffii]
 gi|300170962|gb|EFJ37563.1| hypothetical protein SELMODRAFT_77776 [Selaginella moellendorffii]
          Length = 382

 Score =  190 bits (483), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 187/348 (53%), Gaps = 24/348 (6%)

Query: 36  PVRNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLP 95
           P R    SF++ A  ++ HL+  G+RV+PGLS+ E    E++ G +FPPDLRAIL+ GLP
Sbjct: 21  PRRQCCFSFAAYARSLMDHLRRCGVRVEPGLSETELGHLESQLGLLFPPDLRAILTEGLP 80

Query: 96  LGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNA 155
            G  FPDWR   +A   LR  +  P+A +  Q+ R+  W   WG RPS  E+A  +A  A
Sbjct: 81  AGSSFPDWRHEPAA--SLRTRIQQPLADLCAQVRRSRFWCDKWGDRPSGGEQAAAMAAAA 138

Query: 156 LKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDP- 214
           L+R   L+P++ +CYIP + + AGNP+  V    +   G D++DFFERE      E DP 
Sbjct: 139 LERVAPLVPVYGNCYIPSSGS-AGNPVLLVQSGEVSVVGNDVADFFEREVFHGDQEEDPN 197

Query: 215 -QVLIKQRSVSERTAGSSSNFSRRSLDAGLGSRTPRWVEFWSDAAID---RRRRNSSSS- 269
            ++L ++ S S  +    S    RS D G   +       W   ++D   RR++  S S 
Sbjct: 198 SRLLDEEDSSSSSSCLMGSMKMGRSSDGGDRDKISS-SSLWGMRSLDFAFRRKKGISRSM 256

Query: 270 ----SSSSPERFFDMP-RSEIPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVS------A 318
               S+ SP+    +P +SE  +W+  Y+  +   LR  GW E +I E++ +        
Sbjct: 257 AGTRSAESPQVDRSIPIQSEQQRWLHGYLDDMAGRLRIAGWKEDEICEMIVLPPIDDPVV 316

Query: 319 SGFFEGEMVMLDNQS---VLDGLLLKADRLSESLRKSGWSSEEVSDAF 363
           S      +V    QS   V+ GL+ K   LS+SL++SGWS ++V++AF
Sbjct: 317 SRSSSNAIVPSSKQSRQAVIQGLVFKVQELSDSLKRSGWSVQDVAEAF 364


>gi|302763601|ref|XP_002965222.1| hypothetical protein SELMODRAFT_82965 [Selaginella moellendorffii]
 gi|300167455|gb|EFJ34060.1| hypothetical protein SELMODRAFT_82965 [Selaginella moellendorffii]
          Length = 373

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 122/341 (35%), Positives = 181/341 (53%), Gaps = 19/341 (5%)

Query: 36  PVRNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLP 95
           P R    SF++ A  ++ HL+  G+RV+PGLS+ E    E++ G +FPPDLRAIL+ GLP
Sbjct: 21  PRRQCCFSFAAYARSLMDHLRRCGVRVEPGLSETELGHLESQLGLLFPPDLRAILTEGLP 80

Query: 96  LGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNA 155
            G  FPDWR   +A   LRA +  P+A +  Q+ R+  W   WG RPS  E+A  +A  A
Sbjct: 81  AGSSFPDWRHEPAA--SLRARIQQPLADLCAQVRRSRFWCDKWGDRPSGGEQAAAMAAAA 138

Query: 156 LKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDP- 214
           L+R   L+P++ +CYIP + + AGNP+  V    +   G D++DFFERE      E DP 
Sbjct: 139 LERVAPLVPVYGNCYIPSSGS-AGNPVLLVQSGEVSVVGNDVADFFEREVFHGDQEEDPN 197

Query: 215 -QVLIKQRSVSERTAGSSSNFSRRSLDAGLGSRTPRWVEFWSDAAID--RRRRNSSSSSS 271
            ++L ++ S S  +    S    RS D G   +       W   ++D   RR+   S S 
Sbjct: 198 SRLLDEEDSSSSSSCLMGSMKMGRSSDGGDRDKISS-SSLWGMRSLDFAFRRKKGISRSM 256

Query: 272 SSPERFFDMPRSEIPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVS------ASGFFEGE 325
           +        PR    +W+  Y+  +   LR  GW E +I E++ +        S      
Sbjct: 257 AGTRSAEVSPRPS--QWLHGYLDDMAGRLRIAGWKEDEICEMIVLPPIDDPVVSRSSSNA 314

Query: 326 MVMLDNQS---VLDGLLLKADRLSESLRKSGWSSEEVSDAF 363
           +V  + QS   V+ GL+ K   LS+SL++SGWS +EV++AF
Sbjct: 315 IVPSNKQSRQAVIQGLVFKVQELSDSLKRSGWSVQEVAEAF 355


>gi|224028629|gb|ACN33390.1| unknown [Zea mays]
          Length = 378

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 141/414 (34%), Positives = 207/414 (50%), Gaps = 66/414 (15%)

Query: 1   MVDVDRRMT---GLN--PAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHL 55
           MVDVDRR +   GL   P+H AGLRRLS RA+AP++           + +     ++ HL
Sbjct: 1   MVDVDRRPSHPHGLPRPPSHAAGLRRLSTRASAPSSPAP----PSPSASAPSPTALLAHL 56

Query: 56  KNSGIRVQPGLSDAEFARAEAEFGFV-FPPDLRAILSAGLPLGPGFPDWRATGSARLHLR 114
             +G+ V PGLS  EFA AEA  G V  PPDLR +L+ GLP G GFPD+R     RL   
Sbjct: 57  AAAGVAVLPGLSATEFALAEAALGGVQLPPDLRELLALGLPSGDGFPDYRTPAGLRLLRF 116

Query: 115 ASLDLPIA-AITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPC 173
           A+ ++P A A T  +A                    R  R A    P L+P+    Y+P 
Sbjct: 117 AAQEVPAAVAATLPLA-----------------PGRRAGRAA---PPPLVPLCGRHYVPA 156

Query: 174 NPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQR--SVSERTAGSS 231
            P LAGNP+FYV ++ +   G + +DF  R     + E  P   ++++  ++    +G+ 
Sbjct: 157 TPCLAGNPVFYVSDSGVTFAGANAADFLLRAF---AAEPPPGAPLRRQLSALVPPPSGAP 213

Query: 232 SNFSRRSLDAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEI------ 285
            + +RRSLD+  G R PRW+EFW+DAA             ++ +RF ++P          
Sbjct: 214 PSTARRSLDSVTG-RAPRWIEFWTDAA-------------AAGDRFLEVPTGATSAAASS 259

Query: 286 --PKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKAD 343
             P+W+   + +  S+LR GGW   ++ E+VT        GE      +  +  L L  D
Sbjct: 260 AAPEWLRPSLEEAASMLRRGGWGLCEVEEMVT--------GEGPSGAGEVNVVALALTVD 311

Query: 344 RLSESLRKSGWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           R    L+  GW +EEV +  G    P K R+    L P++  R+G+LAE+VSR+
Sbjct: 312 RCCRDLKSGGWGAEEVVEMLGALLGPRKPRRAVAALPPDVAARMGRLAEAVSRA 365


>gi|226491486|ref|NP_001143327.1| uncharacterized protein LOC100275916 [Zea mays]
 gi|195617878|gb|ACG30769.1| hypothetical protein [Zea mays]
 gi|414867141|tpg|DAA45698.1| TPA: hypothetical protein ZEAMMB73_931724 [Zea mays]
          Length = 378

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 141/414 (34%), Positives = 206/414 (49%), Gaps = 66/414 (15%)

Query: 1   MVDVDRRMT---GLN--PAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHL 55
           MVDVDRR +   GL   P+H AGLRRLS RA+AP++           + +     ++ HL
Sbjct: 1   MVDVDRRPSHPHGLPRPPSHAAGLRRLSTRASAPSSPAP----PSPSASAPSPTALLAHL 56

Query: 56  KNSGIRVQPGLSDAEFARAEAEFGFV-FPPDLRAILSAGLPLGPGFPDWRATGSARLHLR 114
             +G+ V PGLS  EFA AEA  G V  PPDLR +L+ GLP G GFPD+R     RL   
Sbjct: 57  AAAGVAVLPGLSATEFALAEAALGGVQLPPDLRELLALGLPSGDGFPDYRTPAGLRLLRF 116

Query: 115 ASLDLPIA-AITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPC 173
           A+ ++P A A T  +A                    R  R A    P L+P+    Y+P 
Sbjct: 117 AAQEVPAAVAATLPLA-----------------PGRRAGRAA---PPPLVPLCGRHYVPA 156

Query: 174 NPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQR--SVSERTAGSS 231
            P LAGNP+FYV ++ +   G + +DF  R     + E  P   ++++  +     +G+ 
Sbjct: 157 TPCLAGNPVFYVSDSGVTFAGANAADFLLRAF---AAEPPPGAPLRRQLSAPVPPPSGAP 213

Query: 232 SNFSRRSLDAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPRSEI------ 285
            + +RRSLD+  G R PRW+EFW+DAA             ++ +RF ++P          
Sbjct: 214 PSTARRSLDSVTG-RAPRWIEFWTDAA-------------AAGDRFLEVPTGATSAAASS 259

Query: 286 --PKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKAD 343
             P+W+   + +  S+LR GGW   ++ E+VT        GE      +  +  L L  D
Sbjct: 260 AAPEWLRPSLEEAASMLRRGGWGLCEVEEMVT--------GEGPSGAGEVNVVALALTVD 311

Query: 344 RLSESLRKSGWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           R    L+  GW +EEV +  G    P K R+    L P++  R+G+LAE+VSR+
Sbjct: 312 RCCRDLKSGGWGAEEVVEMLGALLGPRKPRRAVAALPPDVAARMGRLAEAVSRA 365


>gi|449451563|ref|XP_004143531.1| PREDICTED: uncharacterized protein LOC101204059 [Cucumis sativus]
          Length = 365

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 107/335 (31%), Positives = 170/335 (50%), Gaps = 40/335 (11%)

Query: 28  APTTATSLPVRNGLV--SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPD 85
           A  T T  P R  L   SF++ A  VI HLK+  I V PGLSD EF   E+ F F FPPD
Sbjct: 2   ATATVTVNPPRPKLACFSFAAYAKTVIDHLKSLQIPVHPGLSDPEFTSVESTFRFSFPPD 61

Query: 86  LRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDP 145
           LR+IL  GLP+G GFP+WR++ + +LH+   ++LP   +  +I++   W +SWG +P D 
Sbjct: 62  LRSILQEGLPIGSGFPNWRSSSTQQLHIL--INLPKFCLLKEISQRKFWCQSWGAQPDDT 119

Query: 146 EKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERES 205
             A+ +A+  L RAP+L+PI+ + YIP  P +AGNP+F++D+  I     DL+ FF+   
Sbjct: 120 NDAVALAKQFLDRAPVLVPIYKNWYIPSAPNMAGNPVFHLDDGEIRVSSFDLAGFFQ--- 176

Query: 206 LFRSTESDPQVLIKQRSVSERTAGSSSNFSRRSLDA-GLGSRTPRWVEFWSDAAIDRRRR 264
                              E +    +   R  +D+    +   R VEFW++ A  ++  
Sbjct: 177 -----------------THEYSQLGKAETDRLVIDSPAWAATEARAVEFWTEVASRKKAT 219

Query: 265 NSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEG 324
               +     E  F+M        +D  +  +   LREGGW E D+ +++ +        
Sbjct: 220 GREVTEGWWNEGEFEMG-------LDGCLEDVFWKLREGGWREEDVRDMMMMD-----RH 267

Query: 325 EMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEEV 359
           +  +  N++ ++ L +    + E L   GWS ++V
Sbjct: 268 DRSLEQNEATMEKLRVS---VCEILLSGGWSRDDV 299


>gi|115453391|ref|NP_001050296.1| Os03g0396000 [Oryza sativa Japonica Group]
 gi|14029026|gb|AAK52567.1|AC079853_20 Unknown protein [Oryza sativa Japonica Group]
 gi|108708626|gb|ABF96421.1| expressed protein [Oryza sativa Japonica Group]
 gi|113548767|dbj|BAF12210.1| Os03g0396000 [Oryza sativa Japonica Group]
          Length = 391

 Score =  167 bits (423), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 140/408 (34%), Positives = 200/408 (49%), Gaps = 48/408 (11%)

Query: 1   MVDVDRRMT---GLN--PAHIAGLRRLSARAAAPTT---ATSLPVRNGLVSFSSLADKVI 52
           MVDVDRR     GL   P+H AGLRRLS RA+APTT   + + P  +   + +     ++
Sbjct: 1   MVDVDRRPPLPHGLPRPPSHAAGLRRLSTRASAPTTPRASPATPSPSSAAAAAPSPSALV 60

Query: 53  THLKNSGIRVQPGLSDAEFARAEAEFG-FVFPPDLRAILSAGLPLGPGFPDWRATGSARL 111
            HL  +G+ V PGLSD E A AEA  G    PPDLR +L+ G+P G GFPD+R+    RL
Sbjct: 61  AHLAAAGVSVLPGLSDPELAHAEAALGGLQLPPDLRDLLAIGVPSGDGFPDYRSPAGLRL 120

Query: 112 HLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYI 171
              A+ ++P A       R A     W                     P L+P++   Y+
Sbjct: 121 LRFAAEEVPAAVAAALPGRRAGGRGRW----------------CSSSPPPLVPLYGRHYV 164

Query: 172 PCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSS 231
           P  P LAGNP+F+V +  +   G +++ F  R          P        +       +
Sbjct: 165 PAVPCLAGNPVFHVSDTGVAVAGANIAAFLLRAFAAEPPRGAPLRRQLSAPMPPPAPSPA 224

Query: 232 SNFSRRSLDAGLGSRTPRWVEFWSDAAI--DRRRRNSSSSSSSSPERFFDMPRSEIPKWV 289
            + +RRSLD+  G + PRW+EFW+DAA   DR    S+ +S+ +         +  P+WV
Sbjct: 225 PSTARRSLDSATG-KAPRWIEFWTDAAAAGDRFVEVSTCTSTRA---------NAAPQWV 274

Query: 290 DDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESL 349
             Y+   GSVLR GGW   ++ E+ T S  G  E              L L  DR    L
Sbjct: 275 RSYLEWAGSVLRRGGWGGGEVEEMTTGSGGGGEEAV-----------ALALTVDRCCGEL 323

Query: 350 RKSGWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
            ++GW +EEV +A G    P   ++PA  L P++  R+G+LAE+VSR+
Sbjct: 324 GRAGWGAEEVVEALGALLGPRTRKRPAVALPPDVAARVGRLAEAVSRA 371


>gi|125586548|gb|EAZ27212.1| hypothetical protein OsJ_11150 [Oryza sativa Japonica Group]
          Length = 391

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 139/408 (34%), Positives = 199/408 (48%), Gaps = 48/408 (11%)

Query: 1   MVDVDRRMT---GLN--PAHIAGLRRLSARAAAPTT---ATSLPVRNGLVSFSSLADKVI 52
           MVDVDRR     GL   P+H AGLRRLS RA+APTT   + + P  +   + +     ++
Sbjct: 1   MVDVDRRPPLPHGLPRPPSHAAGLRRLSTRASAPTTPRASPATPSPSSAAAAAPSPSALV 60

Query: 53  THLKNSGIRVQPGLSDAEFARAEAEFG-FVFPPDLRAILSAGLPLGPGFPDWRATGSARL 111
            HL  +G+ V PGLSD E A AEA  G    PPDLR +L+ G+P G GFPD+R+    RL
Sbjct: 61  AHLAAAGVSVLPGLSDPELAHAEAALGGLQLPPDLRDLLAIGVPSGDGFPDYRSPAGLRL 120

Query: 112 HLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYI 171
              A+ ++P A       R A     W                     P L+P++   Y+
Sbjct: 121 LRFAAEEVPAAVAAALPGRRAGGRGRW----------------CSSSPPPLVPLYGRHYV 164

Query: 172 PCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSS 231
           P  P LAGNP+F+V +  +   G +++ F  R          P        +       +
Sbjct: 165 PAVPCLAGNPVFHVSDTGVAVAGANIAAFLLRAFAAEPPRGAPLRRQLSAPMPPPAPSPA 224

Query: 232 SNFSRRSLDAGLGSRTPRWVEFWSDAAI--DRRRRNSSSSSSSSPERFFDMPRSEIPKWV 289
            + +RRSLD+  G + PRW+EFW+DAA   DR    S+ +S+ +         +  P+WV
Sbjct: 225 PSTARRSLDSATG-KAPRWIEFWTDAAAAGDRFVEVSTCTSTRA---------NAAPQWV 274

Query: 290 DDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESL 349
             Y+   GSVLR GGW   ++ E+ T S  G  E              L L  DR    L
Sbjct: 275 RSYLEWAGSVLRRGGWGGGEVEEMTTGSGGGGEEAV-----------ALALTVDRCCGEL 323

Query: 350 RKSGWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
            ++GW +EEV +A G    P   ++PA  L P++  R+G+LAE+V R+
Sbjct: 324 GRAGWGAEEVVEALGALLGPRTRKRPAVALPPDVAARVGRLAEAVLRA 371


>gi|224095001|ref|XP_002310322.1| predicted protein [Populus trichocarpa]
 gi|222853225|gb|EEE90772.1| predicted protein [Populus trichocarpa]
          Length = 348

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 99/294 (33%), Positives = 150/294 (51%), Gaps = 44/294 (14%)

Query: 43  SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPD 102
           SF++ A  ++ HLK+  I + PGL+D+EF   E+ F F FPPDLR+IL  GLP+GP FP+
Sbjct: 14  SFAAYAKTLLDHLKSLNIPILPGLTDSEFTSIESTFHFTFPPDLRSILQEGLPIGPHFPN 73

Query: 103 WRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLL 162
           WR++   +L  +  L+LP   +   I+ N  W  SWG RP D  KAL  A+  L +AP+L
Sbjct: 74  WRSSSLQQL--QILLNLPSLNLCKNISLNNFWVDSWGHRPQDTNKALDFAKQFLDKAPVL 131

Query: 163 IPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRS 222
           +PI+ +CYIP +P ++GNP+F+VD+ ++     D++ FF++           QV    R 
Sbjct: 132 VPIYRNCYIPSSPNVSGNPVFHVDDEQVCVLSFDVTRFFQQVDFL-------QVGFPIR- 183

Query: 223 VSERTAGSSSNFSRRSLDAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMPR 282
            S R    S N           +   R +EFW++ A +R RR               + R
Sbjct: 184 -SSRNENVSMNV------PAWAATEARKIEFWTEVA-ERGRRV--------------VAR 221

Query: 283 SEIPKWVDDYMG------------QIGSVLREGGWSEPDIVEIVTVSASGFFEG 324
              P+W  D  G            ++   LR+GGW E ++ E++     G  E 
Sbjct: 222 GNTPRWWKDVGGGSDHFELRECLEEVFWRLRDGGWREEEVREMMNGCDQGIREN 275


>gi|449499893|ref|XP_004160946.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101229907 [Cucumis sativus]
          Length = 259

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 78/177 (44%), Positives = 112/177 (63%), Gaps = 5/177 (2%)

Query: 28  APTTATSLPVRNGLV--SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPD 85
           A  T T  P R  L   SF++ A  VI HLK+  I V PGLSD EF   E+ F F FPPD
Sbjct: 2   ATATVTVNPPRPKLACFSFAAYAKTVIDHLKSLQIPVHPGLSDPEFTSVESTFRFSFPPD 61

Query: 86  LRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDP 145
           LR+IL  GLP+G GFP+WR++ + +LH+   ++LP   +  +I++   W +SWG +P D 
Sbjct: 62  LRSILQEGLPIGSGFPNWRSSSTQQLHIL--INLPNCLLK-EISQRKFWCQSWGAQPDDT 118

Query: 146 EKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFE 202
             A+ +A+  L RAP+L+PI+ + YIP  P +AGNP+F++D+  I     DL+ FF+
Sbjct: 119 NDAVALAKQFLDRAPVLVPIYKNWYIPSAPNMAGNPVFHLDDGEIRVSSFDLAGFFQ 175


>gi|302819108|ref|XP_002991225.1| hypothetical protein SELMODRAFT_429564 [Selaginella moellendorffii]
 gi|300140936|gb|EFJ07653.1| hypothetical protein SELMODRAFT_429564 [Selaginella moellendorffii]
          Length = 524

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 119/208 (57%), Gaps = 18/208 (8%)

Query: 17  AGLRRLSARA-----------AAPTTATSL----PVRNGLVSFSSLADKVITHLKNSGIR 61
           A L+RLS RA           A PT + SL      R   +SFS L   +I  L++ G+R
Sbjct: 32  AALKRLSERANNTSCSSTSLSAPPTFSGSLVDPSSPRKISLSFSELGQAIIARLQSYGVR 91

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRAT-GSARLHLRASLDLP 120
           + PGLSD E   A+A F   FPPDLR IL  GLP+GPGFPDWR+  GS +L L   +  P
Sbjct: 92  IDPGLSDEEIELAQAAFQITFPPDLRGILQQGLPIGPGFPDWRSQLGSHKLKLW--IAAP 149

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
              +         W + WG RP +PE A R +R AL+RAP+++P+F+ CY+P  P  AGN
Sbjct: 150 KTGLCSAAELGLFWWRGWGARPPEPEGAARASRAALRRAPVMVPVFSQCYVPSQPVRAGN 209

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFR 208
           P+ YVD +  F C  +LS+FF +E   R
Sbjct: 210 PVMYVDRSEAFYCAHELSEFFGKECFKR 237



 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 76/167 (45%), Gaps = 23/167 (13%)

Query: 249 RWVEFWSDAAIDRRRRNSSSSSSSSPERFF---------DMPRSEIPKW--VDDYMGQIG 297
           R +EFWSD        N+++  S+S +  F         D   ++   W  V + +G++ 
Sbjct: 361 RRIEFWSDLVEKGPSMNAAAGRSASHDDGFVQVGGETDLDWENAKSNPWQLVKESVGEMA 420

Query: 298 SVLREGGWSEPDIVEIVTVSASGFFEGEMVM---LDNQSVLDGLLLKADRLSESLRKSGW 354
             LR  GW+E D+ EI+          +       D +SV++GL    D LS  L  +GW
Sbjct: 421 IKLRRAGWNERDVAEIMDSCEQLDHRCDDPRPNPADRESVIEGLAAHLDFLSSCLHDAGW 480

Query: 355 SSEEV-----SDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSR 396
           S  ++     SD  GF   P +      ++SPE+  RIG LA  V++
Sbjct: 481 SIHDISETLYSDLTGFSEHPGRR----ARISPEVASRIGLLAAFVAQ 523


>gi|255582413|ref|XP_002531995.1| conserved hypothetical protein [Ricinus communis]
 gi|223528354|gb|EEF30394.1| conserved hypothetical protein [Ricinus communis]
          Length = 374

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 173/348 (49%), Gaps = 27/348 (7%)

Query: 43  SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPD 102
           SF++ A  +I +L++  I + PGL+D EF   E+ F F FPPDLR+IL  GLP+GP FP+
Sbjct: 17  SFAAYAKTLIDNLRSLNIPILPGLTDQEFTSIESNFHFSFPPDLRSILQEGLPVGPLFPN 76

Query: 103 WRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLL 162
           WR++   +L L  +L  P   ++  I  N  W  SWG RP D ++ L  ++    +AP+L
Sbjct: 77  WRSSSPQQLQLLLNL--PFLNLSKNITHNNFWVDSWGYRPDDSQQMLATSKRLFTKAPVL 134

Query: 163 IPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRS 222
           +PI+ +CYIP +P  AGNP+FYVD+  +     DL+ FF+   LF++          +R+
Sbjct: 135 VPIYGNCYIPSSPNTAGNPVFYVDDCSVRVLNFDLARFFQEAELFKTGTGRVNNFNWRRN 194

Query: 223 VSERTAGSSSNFSRRSLDA-GLGSRTPRWVEFWSDAAIDRRR---RNSSSSSSSSPERFF 278
                     ++++ S++A    +   R VEFW++ A   R+   R  +    +  E   
Sbjct: 195 YHH-------SYNKVSINAPAWAATKARKVEFWTEVAERGRKVVARADTLGWWNGGEYLE 247

Query: 279 DMPRSEIPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQS--VLD 336
           D         + D +  +   LR+GGW E D+ E++ +             D     V +
Sbjct: 248 DSS-------LGDCLEDVFWKLRDGGWREEDVREMMMIDGCDEKRENGCGGDGDGGRVQN 300

Query: 337 G---LLLKADRLSESLRKSGWSSEEVSDAFGFDFRPEKERKPAKKLSP 381
           G   ++     LS  L ++GWS E+V   +  D +  ++ K      P
Sbjct: 301 GKLDVVWHVRVLSVVLLRAGWSREDV--VYSLDLQDLEDSKCCLDFPP 346


>gi|302818993|ref|XP_002991168.1| hypothetical protein SELMODRAFT_429511 [Selaginella moellendorffii]
 gi|300140996|gb|EFJ07712.1| hypothetical protein SELMODRAFT_429511 [Selaginella moellendorffii]
          Length = 524

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 119/208 (57%), Gaps = 18/208 (8%)

Query: 17  AGLRRLSARAAAP-TTATSL----PVRNGLV----------SFSSLADKVITHLKNSGIR 61
           A L+RLS RA     ++TSL    P    LV          SFS L   +I  L++ G+R
Sbjct: 32  AALKRLSERANNTLCSSTSLSAPPPFSGSLVDPSSPRKISFSFSELGQAIIARLRSYGVR 91

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRAT-GSARLHLRASLDLP 120
           + PGLSD E   A+A F   FPPDLR IL  GLP+GPGFPDWR+  GS +L L   +  P
Sbjct: 92  IDPGLSDEEIELAQAAFQITFPPDLRGILQQGLPIGPGFPDWRSQLGSHKLKLW--IAAP 149

Query: 121 IAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGN 180
              +         W + WG RP +PE A R +R AL+RAP+++P+F+ CY+P  P  AGN
Sbjct: 150 KTGLCSAAELGLFWWRGWGARPPEPEGAARASRAALRRAPVMVPVFSQCYVPSQPVRAGN 209

Query: 181 PIFYVDENRIFCCGLDLSDFFERESLFR 208
           P+ YVD +  F C  +LS+FF +E   R
Sbjct: 210 PVMYVDRSEAFYCAHELSEFFGKECFKR 237



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 76/167 (45%), Gaps = 23/167 (13%)

Query: 249 RWVEFWSDAAIDRRRRNSSSSSSSSP---------ERFFDMPRSEIPKW--VDDYMGQIG 297
           R +EFWSD        N+++  S+S          E   D   ++   W  V + +G++ 
Sbjct: 361 RRIEFWSDLVEKGPSMNAAAGRSASHDDGLVQVGGETDLDWENAKSNPWQLVKESVGEMA 420

Query: 298 SVLREGGWSEPDIVEIVTVSASGFFEGEMVM---LDNQSVLDGLLLKADRLSESLRKSGW 354
             LR  GW+E D+ EI+          +       D +SV++GL    D LS  LR +GW
Sbjct: 421 IKLRRAGWNERDVAEIMDSCEQLDHRCDDPRPNPADRESVIEGLAAHLDFLSSCLRDAGW 480

Query: 355 SSEEV-----SDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSR 396
           S  ++     SD  GF   P +      ++SPE+  RIG LA  V++
Sbjct: 481 SIHDISETLYSDLTGFSEHPGRR----ARISPEVASRIGLLAAFVAQ 523


>gi|357482459|ref|XP_003611516.1| hypothetical protein MTR_5g014810 [Medicago truncatula]
 gi|355512851|gb|AES94474.1| hypothetical protein MTR_5g014810 [Medicago truncatula]
          Length = 314

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 91/272 (33%), Positives = 129/272 (47%), Gaps = 57/272 (20%)

Query: 43  SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPD 102
           SF++ A+ +I +LK+S I ++ GL+++EF   E++F   FPPDL AIL  GLP+ PGFP+
Sbjct: 32  SFTTYANDLIQNLKSSNIIIEQGLTESEFQHLESKFNLKFPPDLHAILQQGLPVSPGFPN 91

Query: 103 WRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLL 162
           WR+  S+   L+  L++P+++I  ++  N+ W  SWGP P D    L  A+  L  AP L
Sbjct: 92  WRS--SSHQQLQILLNIPVSSILRRVKNNSFWHPSWGPIPKD---KLTAAQRILDPAPQL 146

Query: 163 IPIFNHCYIPCNPTLAGNPIFYVDEN-RIFCCGLDLSDFFERESLFRSTESDPQVLIKQR 221
           +PIF HCYIP NP + GNP+FYVD +  +   G D+  FF         E          
Sbjct: 147 VPIFRHCYIPMNPFVTGNPVFYVDHSGDVRLVGYDIVGFFRDGGFLDGVEE--------- 197

Query: 222 SVSERTAGSSSNFSRRSLDAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDMP 281
                             D    +R  R +E W++ A  R  R                 
Sbjct: 198 ----------------VDDPVWAAREARRIEVWTEVADGRGERGWKW------------- 228

Query: 282 RSEIPKWVDDYMGQIGSV-------LREGGWS 306
                 W DD  G +G         LREGGW 
Sbjct: 229 ------WWDDRRGVVGRCMDGVLRRLREGGWK 254


>gi|356565606|ref|XP_003551030.1| PREDICTED: uncharacterized protein LOC100817113 [Glycine max]
          Length = 215

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 94/191 (49%), Positives = 113/191 (59%), Gaps = 47/191 (24%)

Query: 208 RSTESDPQVLIKQRSVSERTAGSSSNFSRRSLDAGLGSRTPRWVEFWSDAAIDRRRRNSS 267
           RS +SDP +L KQRSVS         FSR SLDA                A+D RRR+  
Sbjct: 71  RSADSDP-ILRKQRSVSA-ARFPVPGFSRHSLDA----------------AVDLRRRS-- 110

Query: 268 SSSSSSPERFFDMPRSEIPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMV 327
                              KWV+ Y+G +GSVLREGGWSE DI E+V+ S S       V
Sbjct: 111 -------------------KWVEGYVGMMGSVLREGGWSESDISEMVSGSGS-------V 144

Query: 328 MLDNQSVLDGLLLKADRLSESLRKSGWSSEEVSDAFGFDFRPEKE-RKPAKKLSPELVER 386
           +LDN+ VLD L+ KA+R S+SLRKSGWSSEEV DA G D RPEK+  +P KK++ +LV R
Sbjct: 145 LLDNEEVLDALVSKAERFSDSLRKSGWSSEEVKDALGLDLRPEKKGGRPPKKVATQLVGR 204

Query: 387 IGKLAESVSRS 397
           IGKL ESVSRS
Sbjct: 205 IGKLVESVSRS 215



 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 61/137 (44%), Positives = 82/137 (59%), Gaps = 5/137 (3%)

Query: 1   MVDVDRRMTGLNPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHLKNSGI 60
           MVDVDRR TG NPAH AGLRRLSARAA+ +T T +  RN L SFS+L++KVITHL +SGI
Sbjct: 1   MVDVDRRPTGPNPAHAAGLRRLSARAASVSTPT-VRNRNSLHSFSALSEKVITHLSSSGI 59

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLP 120
           ++ P    AEFAR+ A+   +     R++ +A  P+ PGF       +  L  R+     
Sbjct: 60  QLHPS-HGAEFARS-ADSDPILRKQ-RSVSAARFPV-PGFSRHSLDAAVDLRRRSKWVEG 115

Query: 121 IAAITFQIARNALWSKS 137
              +   + R   WS+S
Sbjct: 116 YVGMMGSVLREGGWSES 132


>gi|414877283|tpg|DAA54414.1| TPA: putative translation elongation factor family protein [Zea
           mays]
          Length = 408

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/122 (54%), Positives = 84/122 (68%), Gaps = 4/122 (3%)

Query: 38  RNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLG 97
           R+GL S + LA  V+ HL+ SG+ V PGL++ E A AEAE GF FP DLRA+L+ GLPLG
Sbjct: 96  RHGLHSSAPLAAAVLGHLRASGVAVLPGLTELELAGAEAEMGFAFPLDLRAVLATGLPLG 155

Query: 98  PGFPDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALK 157
           P FPD R    +R+ LR++ DLPIAA   QI R  LW + WG RP+D +  LR AR+A+ 
Sbjct: 156 PKFPDTR----SRVGLRSAFDLPIAAAALQITRGTLWLRCWGARPADLDCVLRPARSAIC 211

Query: 158 RA 159
           RA
Sbjct: 212 RA 213


>gi|404447071|ref|ZP_11012157.1| hypothetical protein MVAC_27349 [Mycobacterium vaccae ATCC 25954]
 gi|403649438|gb|EJZ04815.1| hypothetical protein MVAC_27349 [Mycobacterium vaccae ATCC 25954]
          Length = 187

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 60/145 (41%), Positives = 81/145 (55%), Gaps = 2/145 (1%)

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + +  GLSD E A  E  FGF F  D RA L+AGLP+GPG+PDWR+ G  R  L   L L
Sbjct: 22  VTIDTGLSDDELAGVETGFGFEFADDHRAFLAAGLPVGPGWPDWRSEG--RRSLTKRLQL 79

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           P+  + F +     W+  WG RP+    ALR AR  L R P L+P+  H Y+P     +G
Sbjct: 80  PVEGVLFAVEWGQFWAPGWGQRPARMRDALRTARYQLARVPQLVPVCAHRYLPAGRGTSG 139

Query: 180 NPIFYVDENRIFCCGLDLSDFFERE 204
           +P+  V    +  CG DL+D+ + E
Sbjct: 140 HPVLSVVRTDVHTCGADLADYVDEE 164


>gi|414877284|tpg|DAA54415.1| TPA: putative translation elongation factor family protein [Zea
           mays]
          Length = 368

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/122 (54%), Positives = 84/122 (68%), Gaps = 4/122 (3%)

Query: 38  RNGLVSFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLG 97
           R+GL S + LA  V+ HL+ SG+ V PGL++ E A AEAE GF FP DLRA+L+ GLPLG
Sbjct: 96  RHGLHSSAPLAAAVLGHLRASGVAVLPGLTELELAGAEAEMGFAFPLDLRAVLATGLPLG 155

Query: 98  PGFPDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALK 157
           P FPD R    +R+ LR++ DLPIAA   QI R  LW + WG RP+D +  LR AR+A+ 
Sbjct: 156 PKFPDTR----SRVGLRSAFDLPIAAAALQITRGTLWLRCWGARPADLDCVLRPARSAIC 211

Query: 158 RA 159
           RA
Sbjct: 212 RA 213


>gi|164514901|emb|CAP47646.1| putative integron gene cassette protein [uncultured bacterium]
          Length = 195

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 64/145 (44%), Positives = 80/145 (55%), Gaps = 2/145 (1%)

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           I    GL+DAE A  E+ F F FP DLR+ L AG+P G GFPDWR  G+    L+  LD 
Sbjct: 31  INFDAGLNDAEIASIESRFDFQFPADLRSFLQAGMPQGEGFPDWR--GADETVLQEWLDT 88

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           P   I F +  N  W   WG RP   ++A++ A   +K AP LIPIF H  +P  P  AG
Sbjct: 89  PRKGIVFDVEHNGFWLPEWGSRPPSLDQAMQHASQMVKAAPTLIPIFEHRMLPVEPQSAG 148

Query: 180 NPIFYVDENRIFCCGLDLSDFFERE 204
           NP+F V +  I   G DL  +  RE
Sbjct: 149 NPVFSVQQTDIIYYGFDLVSYLCRE 173


>gi|312200342|ref|YP_004020403.1| hypothetical protein FraEuI1c_6557 [Frankia sp. EuI1c]
 gi|311231678|gb|ADP84533.1| hypothetical protein FraEuI1c_6557 [Frankia sp. EuI1c]
          Length = 190

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 56/159 (35%), Positives = 92/159 (57%), Gaps = 3/159 (1%)

Query: 49  DKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGS 108
           ++ +  L     R++PGLS AEF   E+ +G  F P+ R +L+AGLP+G  +P+WR    
Sbjct: 13  EEAVALLARLPTRLEPGLSPAEFDAVESRYGLQFAPEHRTLLTAGLPVGQAWPNWRHGSE 72

Query: 109 ARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNH 168
           + L  R  L  P+    F + RN  W +SWGPRP +  +A+ VAR+ L++ P+++P++ H
Sbjct: 73  SSL--RDRLAWPVDGTLFDVERNGFWDESWGPRPKEMARAVDVARDCLRQVPVMVPVYGH 130

Query: 169 CYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLF 207
            Y+P    + G+P+  + +  I   G DL+D+   E  F
Sbjct: 131 RYLPSG-AVTGHPVLSMHQTDIIIYGADLADYLAAEFQF 168


>gi|297821503|ref|XP_002878634.1| hypothetical protein ARALYDRAFT_900728 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324473|gb|EFH54893.1| hypothetical protein ARALYDRAFT_900728 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 321

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 70/187 (37%), Positives = 103/187 (55%), Gaps = 9/187 (4%)

Query: 20  RRLSARAAAPTTATSLPVRNGLVSFSS--LADKVITHLK-NSGIRVQPGLSDAEFARAEA 76
            RL+   +     T+ P+R+  V   S      +I HLK  +GI V PGL++ E +  E+
Sbjct: 3   HRLAGIISPLGHITTDPIRSSSVKPPSPVYYKTIINHLKSQTGIHVSPGLTNQEISAVES 62

Query: 77  EFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAITFQIARNALWSK 136
             GF FP DLR+IL  GLP+G  FP WR TGS R HL   L      ++  + RN  W  
Sbjct: 63  SLGFSFPLDLRSILQTGLPVGTNFPIWR-TGSNRNHLLLPL----LNLSQIVVRNGFWVD 117

Query: 137 SWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIP-CNPTLAGNPIFYVDENRIFCCGL 195
           SWG RP +  +AL + +  ++ AP+L+P+F   Y+P   P LAGNP+F +D + +     
Sbjct: 118 SWGIRPGNDSEALLLVKKLIEIAPVLVPVFGDFYVPSTTPNLAGNPVFQIDGDGVRELSC 177

Query: 196 DLSDFFE 202
           D++ F +
Sbjct: 178 DVAGFLK 184


>gi|386851820|ref|YP_006269833.1| hypothetical protein ACPL_6883 [Actinoplanes sp. SE50/110]
 gi|359839324|gb|AEV87765.1| hypothetical protein ACPL_6883 [Actinoplanes sp. SE50/110]
          Length = 173

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/143 (41%), Positives = 83/143 (58%), Gaps = 7/143 (4%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPI 121
           + PGL+D EF   EAEFGF F PD RA L+AGLP G G+PDWR  G+ R  LR++L  P+
Sbjct: 15  IAPGLTDREFTDLEAEFGFTFAPDHRAFLAAGLPTGIGWPDWR--GADRTALRSALIAPV 72

Query: 122 AAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNP 181
             + F +A N  W + WG    D    +  AR  L  AP +IP++ H Y+P    + G+P
Sbjct: 73  EGVLFDVAENDFWYEGWGAPFGD---RVATARAGLMIAPRMIPLYAHRYLPA--AIPGHP 127

Query: 182 IFYVDENRIFCCGLDLSDFFERE 204
           +  + +  + C G D+ D+  RE
Sbjct: 128 VLSIYQTDVLCYGADIGDWLHRE 150


>gi|168702569|ref|ZP_02734846.1| hypothetical protein GobsU_23777 [Gemmata obscuriglobus UQM 2246]
          Length = 192

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 64/153 (41%), Positives = 83/153 (54%), Gaps = 5/153 (3%)

Query: 55  LKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLR 114
            +N G+ + PGL+ AE ARAE   G  FPPDLR+ L  GLP G G+PDWR   S  +  R
Sbjct: 10  FRNWGVELAPGLTTAELARAEERVGCRFPPDLRSFLQTGLPTGRGWPDWRNPESEYIAYR 69

Query: 115 ASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCN 174
             L  P   + F I RN  W   WGPRP    +A  V    L+  P LIP+F H ++P  
Sbjct: 70  --LAWPEQQMLFDIERNKWWPPVWGPRPPALAEAFAVGCERLRSEPKLIPVFGHRFLPAE 127

Query: 175 PTLAGNPI--FYVDENRIFCCGLDLSDFFERES 205
           P  AGNP+   Y   + ++  G DL  +F RE+
Sbjct: 128 PDEAGNPVISMYQMVDSVY-YGRDLHTYFAREN 159


>gi|418048172|ref|ZP_12686260.1| hypothetical protein MycrhDRAFT_1782 [Mycobacterium rhodesiae JS60]
 gi|353193842|gb|EHB59346.1| hypothetical protein MycrhDRAFT_1782 [Mycobacterium rhodesiae JS60]
          Length = 173

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/158 (39%), Positives = 85/158 (53%), Gaps = 4/158 (2%)

Query: 57  NSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRAS 116
           +  + + PGL+D E  R E EFGF F  D RA L+ GLPLG  +PDWR+  + R  L+  
Sbjct: 4   DGTVTIAPGLTDEELTRIEGEFGFEFADDHRAFLACGLPLGESWPDWRS--APRRSLQQR 61

Query: 117 LDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPT 176
           L LP   I F +     W+  WG RP+  + ALR A   L R P L+P++ + Y+P    
Sbjct: 62  LKLPTDGILFAVEWREFWAAGWGVRPARTKDALRSANYHLARVPQLVPVYANRYLPAGRD 121

Query: 177 LAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDP 214
             G+P+  V +  +  CG DL  F E+E  F   E DP
Sbjct: 122 TYGHPVLSVYQADVSSCGADLFAFIEQE--FGLAEPDP 157


>gi|310640896|ref|YP_003945654.1| hypothetical protein [Paenibacillus polymyxa SC2]
 gi|386039998|ref|YP_005958952.1| hypothetical protein PPM_1308 [Paenibacillus polymyxa M1]
 gi|309245846|gb|ADO55413.1| hypothetical protein PPSC2_c1434 [Paenibacillus polymyxa SC2]
 gi|343096036|emb|CCC84245.1| hypothetical protein PPM_1308 [Paenibacillus polymyxa M1]
          Length = 185

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 54/155 (34%), Positives = 80/155 (51%), Gaps = 1/155 (0%)

Query: 51  VITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSAR 110
           +I  L+   + + PGLS  E    E  +   FPPDLR +L   LP+G  F  WR T   R
Sbjct: 6   IIDLLRKDDVTLTPGLSTQEITEVEDRYDIHFPPDLRELLMNVLPIGKSFIPWRDTSPQR 65

Query: 111 LH-LRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHC 169
           +  +   L+ P+  + F + +N  W   WG RP+D ++A+   +      P LIPI+ H 
Sbjct: 66  MGVIWERLNWPLEGMIFDVEQNIFWPSEWGERPTDLQEAIDTCKREFLHVPKLIPIYGHR 125

Query: 170 YIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           YIP  P   GNP+F V +  I   G  L ++F++E
Sbjct: 126 YIPEQPCEEGNPVFSVYQTDIIVYGESLQEYFKQE 160


>gi|392415652|ref|YP_006452257.1| hypothetical protein Mycch_1787 [Mycobacterium chubuense NBB4]
 gi|390615428|gb|AFM16578.1| hypothetical protein Mycch_1787 [Mycobacterium chubuense NBB4]
          Length = 190

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 57/140 (40%), Positives = 80/140 (57%), Gaps = 2/140 (1%)

Query: 65  GLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAI 124
           GL+D EF R E EFGF F  D RA L+AGLP+G  +P+WR    AR  L   L LP+  I
Sbjct: 23  GLTDDEFVRIEHEFGFEFADDHRAFLAAGLPVGGTWPNWR--DDARRTLATRLRLPVDGI 80

Query: 125 TFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFY 184
            F +     W  +WGPRP+  + ALR A   L R P ++P+++H Y+P      G+P+  
Sbjct: 81  LFAVEWRQFWHDAWGPRPARMKDALRSASYQLARVPQMVPVYSHHYLPAGRGACGHPVLS 140

Query: 185 VDENRIFCCGLDLSDFFERE 204
           + +  I   G DL D+ +R+
Sbjct: 141 IYQADIEPAGADLLDYVDRQ 160


>gi|145224871|ref|YP_001135549.1| hypothetical protein Mflv_4292 [Mycobacterium gilvum PYR-GCK]
 gi|145217357|gb|ABP46761.1| hypothetical protein Mflv_4292 [Mycobacterium gilvum PYR-GCK]
          Length = 191

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 79/145 (54%), Gaps = 2/145 (1%)

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + ++ G+SD E  RAE + G  F  D RA L+AGLP G  +P+WR  G  R  L   L L
Sbjct: 22  VTIERGMSDDELDRAETDLGIEFADDHRAFLAAGLPTGGSWPNWRDEG--RRSLTKRLHL 79

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           P+  I F +  N  W  SWG RP+  + ALR AR  L+R P+L+PI +H Y+P      G
Sbjct: 80  PVEGILFAVEWNRFWHDSWGRRPAQMKHALRTARYQLERVPMLLPICSHHYLPAGRGSFG 139

Query: 180 NPIFYVDENRIFCCGLDLSDFFERE 204
           +P+  V    +   G DL D+   E
Sbjct: 140 HPVLSVVRTDVVVRGADLVDYVTAE 164


>gi|315445199|ref|YP_004078078.1| hypothetical protein Mspyr1_36350 [Mycobacterium gilvum Spyr1]
 gi|315263502|gb|ADU00244.1| hypothetical protein Mspyr1_36350 [Mycobacterium gilvum Spyr1]
          Length = 191

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 59/145 (40%), Positives = 79/145 (54%), Gaps = 2/145 (1%)

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + ++ G+SD E  RAEA+ G  F  D RA L+AGLP G  +P+WR  G  R  L   L L
Sbjct: 22  VTIERGMSDDELDRAEADLGIEFADDHRAFLAAGLPTGGSWPNWRDEG--RRSLTKRLHL 79

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           P   I F +  N  W  SWG RP+  + ALR AR  L+R P+L+PI +H Y+P      G
Sbjct: 80  PAEGILFAVEWNRFWHDSWGRRPAQMKHALRTARYQLERVPMLLPICSHHYLPAGRGSFG 139

Query: 180 NPIFYVDENRIFCCGLDLSDFFERE 204
           +P+  V    +   G DL D+   E
Sbjct: 140 HPVLSVVRTDVVVRGADLVDYVTAE 164


>gi|294460688|gb|ADE75918.1| unknown [Picea sitchensis]
          Length = 110

 Score =  113 bits (282), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 55/113 (48%), Positives = 85/113 (75%), Gaps = 3/113 (2%)

Query: 285 IPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADR 344
           +P+W+ DY+ ++ ++LR GGW E DI +++  S+S   + E + LD+Q++L+GLLLKAD 
Sbjct: 1   MPRWLMDYLEELATLLRNGGWKEQDINDMIQASSSPSSDTEDIYLDSQTILEGLLLKADL 60

Query: 345 LSESLRKSGWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           +S SLRK+GWSS+++++ F  ++ P K  KP KK+SPEL ERIGKLAE V ++
Sbjct: 61  MSNSLRKAGWSSQDIAEIFDINY-PTK--KPTKKISPELAERIGKLAEYVVQA 110


>gi|126434390|ref|YP_001070081.1| hypothetical protein Mjls_1802 [Mycobacterium sp. JLS]
 gi|126234190|gb|ABN97590.1| hypothetical protein Mjls_1802 [Mycobacterium sp. JLS]
          Length = 206

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 56/144 (38%), Positives = 79/144 (54%), Gaps = 2/144 (1%)

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + + PGLSD E AR E+ F F F  D R  L+A LP+G G+P+WR  G  R  L   L L
Sbjct: 43  VPMAPGLSDDEVARIESSFAFTFADDHREFLAACLPVGEGWPNWREAG--RRTLETLLRL 100

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           P+  I F +     W  SWG RP+  + ALR A   L R P L+P+ ++CY+P     +G
Sbjct: 101 PVDGILFAVEWKQFWDASWGRRPARMKDALRSAAYQLARVPRLVPVHSNCYLPAGHDSSG 160

Query: 180 NPIFYVDENRIFCCGLDLSDFFER 203
           +P+  + +  I     DL D+ +R
Sbjct: 161 HPVLSIYQADIHVVAADLFDYVDR 184


>gi|116830479|gb|ABK28197.1| unknown [Arabidopsis thaliana]
          Length = 326

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 89/154 (57%), Gaps = 7/154 (4%)

Query: 51  VITHLK-NSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSA 109
           ++ H K  +G  V PGL++ E +  E+  GF FP DLR+IL  GLP+G  FP+WR TGS 
Sbjct: 36  IVNHFKSQTGNHVSPGLTNQEISAVESSHGFSFPLDLRSILQTGLPVGTNFPNWR-TGSN 94

Query: 110 RLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHC 169
           R +L   L      ++  + RN  W  SWG RP +  +AL + +  ++ AP+L+P++   
Sbjct: 95  RNNLLLPL----LNLSQHVVRNGFWVDSWGIRPGNDAEALSLVKKLIEIAPVLVPVYGDF 150

Query: 170 YIP-CNPTLAGNPIFYVDENRIFCCGLDLSDFFE 202
           Y+P   P LAGNP+F +D + +     D+  F +
Sbjct: 151 YVPSTTPNLAGNPVFQIDGDGVRELSCDVVGFLK 184


>gi|404212792|ref|YP_006666967.1| hypothetical protein KTR9_0163 [Gordonia sp. KTR9]
 gi|403643591|gb|AFR46831.1| hypothetical protein KTR9_0163 [Gordonia sp. KTR9]
          Length = 196

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 83/163 (50%), Gaps = 4/163 (2%)

Query: 44  FSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDW 103
            S + +K    L+ +G+ + PGL+DAEF     + GF F PD  ++L+  LP+G  +PDW
Sbjct: 4   ISDVGEKAAVILRTAGVELAPGLTDAEFDAVTEQHGFRFNPDHHSLLATALPVGDRWPDW 63

Query: 104 RATGSARLHLRASLDLPIAAITFQIARN--ALWSKSWGPRPSDPEKALRVARNALKRAPL 161
           R       +LR  LD P   + F   R     W  SWG RP+       V R  L + P 
Sbjct: 64  R--NGDETYLRQMLDWPARGMVFDALRQDPPFWGASWGHRPAAESDVEAVTRRELAKWPR 121

Query: 162 LIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           LIPIF H   P  P+ +G+P+F V +  +   G DL ++ E E
Sbjct: 122 LIPIFGHRMTPAAPSPSGSPVFSVWQTDVIFYGADLLEYLENE 164


>gi|15227753|ref|NP_179864.1| uncharacterized protein [Arabidopsis thaliana]
 gi|3738093|gb|AAC63590.1| hypothetical protein [Arabidopsis thaliana]
 gi|91805461|gb|ABE65459.1| unknown [Arabidopsis thaliana]
 gi|330252260|gb|AEC07354.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 325

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 89/154 (57%), Gaps = 7/154 (4%)

Query: 51  VITHLK-NSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSA 109
           ++ H K  +G  V PGL++ E +  E+  GF FP DLR+IL  GLP+G  FP+WR TGS 
Sbjct: 36  IVNHFKSQTGNHVSPGLTNQEISAVESSHGFSFPLDLRSILQTGLPVGTNFPNWR-TGSN 94

Query: 110 RLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHC 169
           R +L   L      ++  + RN  W  SWG RP +  +AL + +  ++ AP+L+P++   
Sbjct: 95  RNNLLLPL----LNLSQHVVRNGFWVDSWGIRPGNDAEALSLVKKLIEIAPVLVPVYGDF 150

Query: 170 YIP-CNPTLAGNPIFYVDENRIFCCGLDLSDFFE 202
           Y+P   P LAGNP+F +D + +     D+  F +
Sbjct: 151 YVPSTTPNLAGNPVFQIDGDGVRELSCDVVGFLK 184


>gi|408679908|ref|YP_006879735.1| hypothetical protein SVEN_4190 [Streptomyces venezuelae ATCC 10712]
 gi|328884237|emb|CCA57476.1| hypothetical protein SVEN_4190 [Streptomyces venezuelae ATCC 10712]
          Length = 199

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 10/149 (6%)

Query: 64  PGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLG--------PGFPDWRATGSARLHLRA 115
           PGLSDAEF R E EFGF F  D R  L+AGLP+G          +P+WR  G  R  LR 
Sbjct: 25  PGLSDAEFDRIEEEFGFAFSDDHRVFLAAGLPVGRPDPHGRPSPWPNWR--GPDRDDLRW 82

Query: 116 SLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNP 175
            L+ P+  + F +  N+ W + WGPRP+ P +A+  AR  L   P ++P++ H Y+P   
Sbjct: 83  RLERPVDGVLFDVEHNSFWRREWGPRPAAPAEAVTAARAELATVPQMVPVYAHRYLPAGR 142

Query: 176 TLAGNPIFYVDENRIFCCGLDLSDFFERE 204
              G+P+  + +  +   G DL D+ +RE
Sbjct: 143 GTHGHPVLSIQQTDVIYYGADLPDYVDRE 171


>gi|377559988|ref|ZP_09789517.1| hypothetical protein GOOTI_118_00170 [Gordonia otitidis NBRC
           100426]
 gi|377522856|dbj|GAB34682.1| hypothetical protein GOOTI_118_00170 [Gordonia otitidis NBRC
           100426]
          Length = 196

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 83/163 (50%), Gaps = 4/163 (2%)

Query: 44  FSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDW 103
            S + +K    L+ +G+ + PGL+DAEF     + GF F PD R++L+  LP G  +PDW
Sbjct: 4   ISDVGEKAAVILRTAGVELAPGLTDAEFDAVTEQHGFRFNPDHRSLLATALPAGDRWPDW 63

Query: 104 RATGSARLHLRASLDLPIAAITFQIARN--ALWSKSWGPRPSDPEKALRVARNALKRAPL 161
           R    A  HLR  LD P   + F   R     W  SWG RP+       V R  L + P 
Sbjct: 64  RNGDEA--HLRQMLDWPARGMVFDALRQDPPFWGASWGHRPATEGDVEAVTRRELAKWPQ 121

Query: 162 LIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           LIPI+ H   P  P+ +G+P+F V +  +   G +L ++   E
Sbjct: 122 LIPIYGHRMTPAAPSPSGSPVFSVWQTDVIFYGANLLEYLANE 164


>gi|254382951|ref|ZP_04998306.1| conserved hypothetical protein [Streptomyces sp. Mg1]
 gi|194341851|gb|EDX22817.1| conserved hypothetical protein [Streptomyces sp. Mg1]
          Length = 330

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 56/151 (37%), Positives = 78/151 (51%), Gaps = 3/151 (1%)

Query: 61  RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGP-GFPDWRATGSARLHLRASLDL 119
            + PGL++ E    E  FGF F  D R  LSAGLP G  G+PDWR      L  R  L  
Sbjct: 16  EIGPGLTERELDAVETRFGFTFATDHRVFLSAGLPHGSSGWPDWRNGDPEDLAGR--LAW 73

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           P+  + F +  N  W  +W  RP++  +AL VAR  L+  P L+P++ H Y+P      G
Sbjct: 74  PVEGVLFDVEHNGFWHPAWPSRPAETSEALHVARTELESVPQLVPVYGHRYLPGTAGEQG 133

Query: 180 NPIFYVDENRIFCCGLDLSDFFERESLFRST 210
           +P+  V +  I   G DL+D+   E   RS+
Sbjct: 134 HPVLSVYQTDIILYGNDLADYIHHEFAGRSS 164


>gi|120403048|ref|YP_952877.1| hypothetical protein Mvan_2054 [Mycobacterium vanbaalenii PYR-1]
 gi|119955866|gb|ABM12871.1| hypothetical protein Mvan_2054 [Mycobacterium vanbaalenii PYR-1]
          Length = 187

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 59/145 (40%), Positives = 81/145 (55%), Gaps = 2/145 (1%)

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDL 119
           + ++ G+SD EFAR E +FGF F  D RA L+ GLP+G  +P+WR  G  R  L   L L
Sbjct: 23  VVIERGMSDDEFARVETDFGFEFADDHRAFLAVGLPVGASWPNWR--GEGRRSLAKRLQL 80

Query: 120 PIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAG 179
           P   + F +     W   WG RPS  + ALR AR  L R P LIP+++H Y+P      G
Sbjct: 81  PADGVLFAVEWGGFWGDGWGQRPSRMKDALRTARYQLARVPQLIPVYSHHYLPAGRGSFG 140

Query: 180 NPIFYVDENRIFCCGLDLSDFFERE 204
           +P+  V    + C G DL+D+ + E
Sbjct: 141 HPVLSVVRTDVTCRGTDLADYVDNE 165


>gi|300791004|ref|YP_003771295.1| hypothetical protein AMED_9204 [Amycolatopsis mediterranei U32]
 gi|384154547|ref|YP_005537363.1| hypothetical protein RAM_47205 [Amycolatopsis mediterranei S699]
 gi|399542883|ref|YP_006555544.1| hypothetical protein AMES_9068 [Amycolatopsis mediterranei S699]
 gi|299800518|gb|ADJ50893.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
 gi|340532701|gb|AEK47906.1| hypothetical protein RAM_47205 [Amycolatopsis mediterranei S699]
 gi|398323653|gb|AFO82600.1| hypothetical protein AMES_9068 [Amycolatopsis mediterranei S699]
          Length = 191

 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 55/143 (38%), Positives = 73/143 (51%), Gaps = 2/143 (1%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPI 121
           V+PGLS+ E    E  FGF F  D R  L+AG+P+G  +PDWR     +L  R  L  P+
Sbjct: 26  VRPGLSETELDDVERRFGFRFAADHRTFLAAGVPIGDRWPDWRCGNPEQL--RKRLAWPV 83

Query: 122 AAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNP 181
             + + +  N  W   WG RP  PE A+R AR  L   P L+P+  H Y+P      G P
Sbjct: 84  DGVLYDVEHNGFWLPDWGTRPVGPEDAVREARRRLADVPQLVPVCGHRYLPGLADTVGYP 143

Query: 182 IFYVDENRIFCCGLDLSDFFERE 204
           +  V +  I   G DL D+  RE
Sbjct: 144 VLSVYQTDIIVYGSDLRDYLHRE 166


>gi|361069837|gb|AEW09230.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|361069839|gb|AEW09231.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|376340829|gb|AFB34901.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340831|gb|AFB34902.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340833|gb|AFB34903.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340835|gb|AFB34904.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340837|gb|AFB34905.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340839|gb|AFB34906.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340841|gb|AFB34907.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340843|gb|AFB34908.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340845|gb|AFB34909.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
 gi|376340849|gb|AFB34911.1| hypothetical protein UMN_801_01, partial [Pinus mugo]
 gi|376340851|gb|AFB34912.1| hypothetical protein UMN_801_01, partial [Pinus mugo]
 gi|376340853|gb|AFB34913.1| hypothetical protein UMN_801_01, partial [Pinus mugo]
 gi|376340855|gb|AFB34914.1| hypothetical protein UMN_801_01, partial [Pinus mugo]
 gi|376340857|gb|AFB34915.1| hypothetical protein UMN_801_01, partial [Pinus mugo]
 gi|376340859|gb|AFB34916.1| hypothetical protein UMN_801_01, partial [Pinus mugo]
          Length = 124

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/106 (45%), Positives = 70/106 (66%), Gaps = 2/106 (1%)

Query: 43  SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPD 102
           SF++ A  VI HL+   + +  GL+D EF R EA FGF FPPDLR+IL  GLP+G GFP+
Sbjct: 21  SFAAYAKNVIDHLRRCQVPIARGLTDEEFGRIEATFGFAFPPDLRSILREGLPVGVGFPN 80

Query: 103 WRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKA 148
           WR+   A   LR  +DLP+A +++++++   W K WG +P   ++A
Sbjct: 81  WRS--GATQQLRMMMDLPMAGLSYEVSKGRFWWKDWGSQPYSSDEA 124


>gi|376340847|gb|AFB34910.1| hypothetical protein UMN_801_01, partial [Pinus cembra]
          Length = 124

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/106 (45%), Positives = 70/106 (66%), Gaps = 2/106 (1%)

Query: 43  SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPD 102
           SF++ A  VI HL+   + +  GL+D EF R EA FGF FPPDLR+IL  GLP+G GFP+
Sbjct: 21  SFAAYAKNVIDHLRRCQVPIARGLTDEEFVRIEATFGFAFPPDLRSILREGLPVGVGFPN 80

Query: 103 WRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKA 148
           WR+   A   LR  +DLP+A +++++++   W K WG +P   ++A
Sbjct: 81  WRS--GATQQLRMMMDLPMAGLSYEVSKGRFWWKDWGSQPYSSDEA 124


>gi|383176064|gb|AFG71514.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|383176065|gb|AFG71515.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|383176066|gb|AFG71516.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|383176067|gb|AFG71517.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|383176070|gb|AFG71520.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
          Length = 124

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/106 (45%), Positives = 70/106 (66%), Gaps = 2/106 (1%)

Query: 43  SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPD 102
           SF++ A  VI HL+   + +  GL+D EF R EA FGF FPPDLR+IL  GLP+G GFP+
Sbjct: 21  SFAAYAKNVIDHLRRCQVPIARGLTDEEFERIEATFGFAFPPDLRSILREGLPVGVGFPN 80

Query: 103 WRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKA 148
           WR+   A   LR  +DLP+A +++++++   W K WG +P   ++A
Sbjct: 81  WRS--GATQQLRMMMDLPMAGLSYEVSKGRFWWKDWGSQPYSSDEA 124


>gi|451341295|ref|ZP_21911749.1| hypothetical protein C791_1159 [Amycolatopsis azurea DSM 43854]
 gi|449415824|gb|EMD21662.1| hypothetical protein C791_1159 [Amycolatopsis azurea DSM 43854]
          Length = 189

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/143 (39%), Positives = 74/143 (51%), Gaps = 2/143 (1%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPI 121
           V+PGLS+AE    EA FGF F  D R  LSAG+P+G  +PDWR     +L  R  LD P+
Sbjct: 24  VRPGLSEAELDDVEARFGFRFAADHRTFLSAGVPIGDRWPDWRCGNPDQL--RKRLDWPV 81

Query: 122 AAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNP 181
             + + +  N  W   WG RP D   AL  AR+ L  AP L+P+  H Y+P      G P
Sbjct: 82  DGVLYDVEHNGFWLPDWGMRPIDLADALSRARSHLAHAPQLVPVCGHRYLPGIAGSTGYP 141

Query: 182 IFYVDENRIFCCGLDLSDFFERE 204
           +  V +  I   G DL  +   +
Sbjct: 142 VLSVYQTDIAYYGYDLRGYLRHD 164


>gi|383176068|gb|AFG71518.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|383176069|gb|AFG71519.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|383176071|gb|AFG71521.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|383176072|gb|AFG71522.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
 gi|383176073|gb|AFG71523.1| Pinus taeda anonymous locus UMN_801_01 genomic sequence
          Length = 124

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/106 (45%), Positives = 70/106 (66%), Gaps = 2/106 (1%)

Query: 43  SFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPD 102
           SF++ A  VI HL+   + +  GL+D EF R EA FGF FPPDLR+IL  GLP+G GFP+
Sbjct: 21  SFAAYAKNVIDHLRRCQVPIARGLTDEEFERIEATFGFAFPPDLRSILREGLPVGVGFPN 80

Query: 103 WRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKA 148
           WR+   A   LR  +DLP+A +++++++   W K WG +P   ++A
Sbjct: 81  WRS--GATQQLRMMMDLPMAGLSYEVSKGRFWWKDWGSQPYSSDEA 124


>gi|443290539|ref|ZP_21029633.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
 gi|385886094|emb|CCH17707.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
          Length = 205

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/173 (34%), Positives = 86/173 (49%), Gaps = 17/173 (9%)

Query: 47  LADKVITHLKNSGI-RVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGF----- 100
           L  +    L  +G   V PGL+D EF R E  +GF F  D RA L+AGLPL   F     
Sbjct: 9   LGTEAARRLAQTGCCEVAPGLTDEEFDRIERMYGFEFADDHRAFLAAGLPLNVPFEPEEG 68

Query: 101 ---------PDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRV 151
                    PDWR      +  R  L+LP+  + F +  N +W  +WG RP+  ++AL  
Sbjct: 69  VSYAWERPWPDWRNGDPGAI--RERLELPVEGVLFDVEHNTVWHDTWGDRPAGLDEALET 126

Query: 152 ARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           AR  L +AP L+P++ H ++P      G+P+  + +  I   G DL D+  +E
Sbjct: 127 ARLKLGQAPRLVPVYGHRFLPAGRGSVGHPVLSIVQTDIIYYGTDLVDYIHQE 179


>gi|452951304|gb|EME56754.1| hypothetical protein H074_22834 [Amycolatopsis decaplanina DSM
           44594]
          Length = 189

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 55/143 (38%), Positives = 75/143 (52%), Gaps = 2/143 (1%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPI 121
           V+PGLS+AE    EA +GF F  D R  LSAG+P+G  +PDWR     +L  R  LD P+
Sbjct: 24  VRPGLSEAELDDVEARYGFRFAADHRTFLSAGVPIGDRWPDWRCGNPDQL--RKRLDWPV 81

Query: 122 AAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNP 181
             + + +  N  W   WG RP D   AL  AR+ L  AP L+P+  H Y+P      G P
Sbjct: 82  DGVLYDVEHNGFWLPDWGMRPIDLADALSRARSHLALAPQLVPVCGHRYLPGIAGSTGYP 141

Query: 182 IFYVDENRIFCCGLDLSDFFERE 204
           +  V +  I   G DL  + + +
Sbjct: 142 VLSVYQTDIVYYGYDLRGYLQHD 164


>gi|159901563|ref|YP_001547809.1| hypothetical protein Haur_5053 [Herpetosiphon aurantiacus DSM 785]
 gi|159894602|gb|ABX07681.1| conserved hypothetical protein [Herpetosiphon aurantiacus DSM 785]
          Length = 198

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 60/166 (36%), Positives = 85/166 (51%), Gaps = 13/166 (7%)

Query: 55  LKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGP-----------GFPDW 103
           L+  G+ ++PGLS AE    EA++ F FPPDLRA L  GLP+              F DW
Sbjct: 11  LERHGVTIEPGLSAAEVRTIEADYAFRFPPDLRAFLMTGLPIPIIDETIPLSYLRSFVDW 70

Query: 104 RATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLI 163
           R     R H+   L  P+  + F I +  +W  +WG RP+D + A+     A   AP LI
Sbjct: 71  R--HGKREHILERLAWPLHDVEFAIKQQCMWLDAWGERPADRDAAVERGLRAFAAAPRLI 128

Query: 164 PIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRS 209
           PI++H +IP  P    NP+F +    I   G +L+D+   E + RS
Sbjct: 129 PIYSHRFIPATPHEVDNPVFSIYGIDIMHYGRNLADYVVHEFVPRS 174


>gi|302531209|ref|ZP_07283551.1| conserved hypothetical protein [Streptomyces sp. AA4]
 gi|302440104|gb|EFL11920.1| conserved hypothetical protein [Streptomyces sp. AA4]
          Length = 190

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 77/149 (51%), Gaps = 2/149 (1%)

Query: 56  KNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRA 115
           +N G  ++PGL+ AE +  E +FGF F  D R  L+AG+P+G  +PDWR     +L  R 
Sbjct: 20  QNLGEPLRPGLTPAELSDVEDQFGFTFSDDHRIFLTAGVPVGDRWPDWRHGDPDQL--RK 77

Query: 116 SLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNP 175
            LD P+  + + +  N LW   WG RP     AL VAR  L   P L+P+  H Y+P   
Sbjct: 78  RLDWPVDGVLYDVENNDLWLPEWGARPRSMCNALAVARRQLATVPQLVPVCGHRYLPGAA 137

Query: 176 TLAGNPIFYVDENRIFCCGLDLSDFFERE 204
             +G P+  V +  +   G DL  +   E
Sbjct: 138 GESGYPVLSVYQTDVVYYGWDLRSYLRHE 166


>gi|312140034|ref|YP_004007370.1| hypothetical protein REQ_26640 [Rhodococcus equi 103S]
 gi|311889373|emb|CBH48689.1| hypothetical protein REQ_26640 [Rhodococcus equi 103S]
          Length = 194

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/152 (36%), Positives = 76/152 (50%), Gaps = 4/152 (2%)

Query: 55  LKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLR 114
           L   G+ + PGLSDAEF      FGF F PD R++L+A LPLG G+PDWR+   +   L 
Sbjct: 12  LDADGVALGPGLSDAEFTSVHERFGFHFNPDHRSLLAAALPLGDGWPDWRSGDDSE--LT 69

Query: 115 ASLDLPIAAITFQIARNA--LWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIP 172
             LD       +         W  SWG  P+ PE+     R  L   P LIPI+ H + P
Sbjct: 70  TWLDRVAEGFIWDALHQTPPFWPASWGELPATPEEVATTVRRQLGSWPRLIPIYRHRFTP 129

Query: 173 CNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
             P+ AG+P+  V +  +   G DL ++   E
Sbjct: 130 AAPSPAGSPVLSVWQTDVIYYGADLVEYLRNE 161


>gi|399070235|ref|ZP_10749710.1| hypothetical protein PMI01_00743 [Caulobacter sp. AP07]
 gi|398044441|gb|EJL37260.1| hypothetical protein PMI01_00743 [Caulobacter sp. AP07]
          Length = 179

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/143 (40%), Positives = 75/143 (52%), Gaps = 9/143 (6%)

Query: 64  PGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLH--LRASLDLPI 121
           P  ++AE   A+A FGF FPPDLRA+L      G       A   AR H  L  S   P 
Sbjct: 11  PPYTEAELDAAQARFGFFFPPDLRALLLKHRIAG-------ALDWARDHEKLAESFAWPY 63

Query: 122 AAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNP 181
           A + F +  N LW   WGPRP DP +   V R A+  AP LIP++ H Y+P  P  AGNP
Sbjct: 64  AGLLFDVEENDLWWPEWGPRPDDPAERAVVLRRAVDAAPKLIPVYGHRYMPETPHEAGNP 123

Query: 182 IFYVDENRIFCCGLDLSDFFERE 204
           +F V +  I   G +L+++   E
Sbjct: 124 VFSVYQADIIVYGRNLANYISHE 146


>gi|414588103|tpg|DAA38674.1| TPA: hypothetical protein ZEAMMB73_780075 [Zea mays]
          Length = 249

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 67/176 (38%), Positives = 105/176 (59%), Gaps = 19/176 (10%)

Query: 174 NPTLAG--NPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQVLIKQRSVSERTAGSS 231
            P LAG  NP+F+V ++R+ CCGL +  FF R+S F+  +   + ++   S S  +AG+S
Sbjct: 81  QPDLAGYGNPVFFVADDRVLCCGLGMLHFFMRDSPFQPAD---RAVVS--SSSPFSAGAS 135

Query: 232 -SNFSRRSLDAGLGSRTPRWVEFWSDAAIDRRRRNSSSSSSSSPERFFDM------PRSE 284
             + +RRSLD    SR   W+EFWSDAA DR  R+SSS  +S+             P   
Sbjct: 136 RPSCTRRSLDVVQASR---WIEFWSDAASDRCHRDSSSLEASTASSSSSSLGCSSPPWRP 192

Query: 285 IPKWVDDYMGQIGSVLREGGWSEPDIVEIVTVSASGFFEG--EMVMLDNQSVLDGL 338
            P+WVD Y+ ++GS+L++GGW + ++ E+V  +ASGF +   E    +++++LD L
Sbjct: 193 TPRWVDSYLDELGSMLKKGGWRDREVDEMVEGTASGFLDDGEEAPTPNSKAILDAL 248


>gi|383781795|ref|YP_005466362.1| hypothetical protein AMIS_66260 [Actinoplanes missouriensis 431]
 gi|381375028|dbj|BAL91846.1| hypothetical protein AMIS_66260 [Actinoplanes missouriensis 431]
          Length = 172

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/151 (38%), Positives = 81/151 (53%), Gaps = 8/151 (5%)

Query: 55  LKNSG-IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHL 113
           L  SG   + PGLS+ E +  E EFGF F PD R  L+AG+P G G+PDWR+   A L  
Sbjct: 7   LAASGRFTIAPGLSETEISDIEREFGFTFAPDHREFLAAGVPTGVGWPDWRSPDRAGLRA 66

Query: 114 RASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPC 173
             +   P+  + F +A N  W + WGP   D    L  AR  L  AP +IP++ H Y+P 
Sbjct: 67  ALA--APVEGVLFDVAENDFWYEGWGPAFGD---RLATARAGLLIAPRMIPLYAHRYLPA 121

Query: 174 NPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
              +AG+P+  + +  +   G DL D+  RE
Sbjct: 122 --VIAGHPVLSIYQTDVVVYGADLCDWLNRE 150


>gi|422807378|ref|ZP_16855808.1| hypothetical protein ERIG_03520 [Escherichia fergusonii B253]
 gi|324111773|gb|EGC05753.1| hypothetical protein ERIG_03520 [Escherichia fergusonii B253]
          Length = 227

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/186 (29%), Positives = 95/186 (51%), Gaps = 7/186 (3%)

Query: 44  FSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPL-GPGFPD 102
           ++    K    L+ +GI +  GL+ AE   AE  + F FP D +  L  G+PL    F D
Sbjct: 2   YTQQVAKSFKLLRENGIFITQGLTTAEIEAAEKRYNFKFPNDYKEFLMQGMPLYDYDFTD 61

Query: 103 WRATGSARLHL--RASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAP 160
           WR      +        D  IA + + I +   W ++WG RP + E+ L  AR  +K+ P
Sbjct: 62  WRNLSDEYVEYIQYKMFDETIAGVLYCITKFDAWPEAWGNRPDNLEERLNFAREKIKKNP 121

Query: 161 LLIPIFNHCYIPCNPTLAGNPIFYVDE-NRIFCCGLDLSDFFERESLFRSTESDPQVLIK 219
           L IP+ ++ Y+  +P  +GNP++ + E + + C G +L D+F +E    ++E +  + I 
Sbjct: 122 LFIPMKSNRYVSLSPADSGNPVYSIHEGDDVICYGENLWDYFTKE---YASEDECDIHIN 178

Query: 220 QRSVSE 225
            +++ E
Sbjct: 179 LKTMPE 184


>gi|134103379|ref|YP_001109040.1| hypothetical protein SACE_6953 [Saccharopolyspora erythraea NRRL
           2338]
 gi|291004402|ref|ZP_06562375.1| hypothetical protein SeryN2_07772 [Saccharopolyspora erythraea NRRL
           2338]
 gi|133916002|emb|CAM06115.1| hypothetical protein SACE_6953 [Saccharopolyspora erythraea NRRL
           2338]
          Length = 196

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/152 (36%), Positives = 82/152 (53%), Gaps = 14/152 (9%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLG-------PGFPDWRA--TGSARLH 112
           ++PGL+DAEF   EA  GF F  D RA L+ GLP+G         +PDWR    G  R  
Sbjct: 25  IEPGLTDAEFDWVEARHGFEFAADHRAFLAVGLPVGRPRSERSGPWPDWRNGDPGELRAQ 84

Query: 113 LRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIP 172
           +  + DL +  +     R+  W +SWGPRP  PE+A+ VA+  + +AP ++P++ H Y+P
Sbjct: 85  IGKAADLLLCDV-----RHGHWRRSWGPRPDKPEEAVEVAKVHVAQAPAIVPVYAHRYLP 139

Query: 173 CNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
                 G+P+  +        G DL+D+  RE
Sbjct: 140 AGRGSYGHPVLSLYGADCIYYGADLADYVRRE 171


>gi|425747104|ref|ZP_18865122.1| hypothetical protein ACINWC323_0966 [Acinetobacter baumannii
           WC-323]
 gi|425484264|gb|EKU50673.1| hypothetical protein ACINWC323_0966 [Acinetobacter baumannii
           WC-323]
          Length = 194

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 60/174 (34%), Positives = 81/174 (46%), Gaps = 12/174 (6%)

Query: 53  THLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPG---FPDWR---AT 106
           T L   GI+   GL++AE    E  F F FPPDL+  L  GLP+      FP WR     
Sbjct: 8   TELLKHGIQFDTGLTEAEIEAIEQLFRFHFPPDLKLFLKYGLPVSENKWKFPRWREALHN 67

Query: 107 GSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIF 166
             A+  L   L  P   I F I +N  W   WG  P +    L +     K  P +IP++
Sbjct: 68  DQAKNILIEQLKAPHEGIAFDI-KNGFWLDQWGAAPEEISDCLVIFEEQFKAYPKMIPVY 126

Query: 167 NHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE-----SLFRSTESDPQ 215
           +H YIP  P   GNPIF + +  I   G DL ++F  E     +LF   ++ P+
Sbjct: 127 SHRYIPSTPLENGNPIFSIMQTDIIYYGTDLINYFCNEFNLDKTLFDQPQATPK 180


>gi|238059031|ref|ZP_04603743.1| hypothetical protein MCAG_05622 [Micromonospora sp. ATCC 39149]
 gi|237886467|gb|EEP75295.1| hypothetical protein MCAG_05622 [Micromonospora sp. ATCC 39149]
          Length = 177

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 79/153 (51%), Gaps = 16/153 (10%)

Query: 66  LSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPG--------------FPDWRATGSARL 111
           ++D EF R E  +GF F  D RA L+AGLPL                 +PDWR    + +
Sbjct: 1   MTDVEFDRIERTYGFEFADDHRAFLAAGLPLNVPVEPEEGVSYAWERPWPDWRDEDPSAI 60

Query: 112 HLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYI 171
             R  L+ P+  + F +  NA+W  +WG RP+  ++AL  AR  L + P L+P++ H ++
Sbjct: 61  --REQLEWPVEGVLFDVEHNAVWHDTWGERPAGRDEALATARLRLAQVPRLVPVYAHRFL 118

Query: 172 PCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           P      G+P+  + +  I   G +L+D+  +E
Sbjct: 119 PAGRGTVGHPVLSIWQTDIIYYGTNLADYIHQE 151


>gi|417691806|ref|ZP_12341014.1| hypothetical protein SB521682_4087 [Shigella boydii 5216-82]
 gi|332085353|gb|EGI90525.1| hypothetical protein SB521682_4087 [Shigella boydii 5216-82]
          Length = 227

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/186 (29%), Positives = 97/186 (52%), Gaps = 7/186 (3%)

Query: 44  FSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPL-GPGFPD 102
           ++   ++    L+ +GI +  GL+ AE   AE  + F FP D +A+L  G+PL    F D
Sbjct: 2   YTQQVEQSFKLLRENGIFITRGLTAAEIDAAEKRYNFKFPNDYKALLIQGMPLYDYDFTD 61

Query: 103 WRATGSARLHL--RASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAP 160
           WR      +        D  I+ I + I +   W ++WG RP + E+ L  +R  +K+ P
Sbjct: 62  WRNLSDEYVEYIQYKMFDETISGILYCIDKFDAWPQAWGNRPDNLEERLNFSREKIKKNP 121

Query: 161 LLIPIFNHCYIPCNPTLAGNPIFYVDE-NRIFCCGLDLSDFFERESLFRSTESDPQVLIK 219
           L IPI ++ Y+  +P  +GNP++ + E + + C G +L D+F +  +   +E +  + I 
Sbjct: 122 LFIPIKSNRYLALSPAESGNPVYSIHEGDDVICYGENLWDYFTKMYV---SEDECDIHIN 178

Query: 220 QRSVSE 225
            +++ E
Sbjct: 179 LKTMPE 184


>gi|194431247|ref|ZP_03063540.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gi|416280401|ref|ZP_11645368.1| expressed protein [Shigella boydii ATCC 9905]
 gi|417674329|ref|ZP_12323762.1| hypothetical protein SD15574_4175 [Shigella dysenteriae 155-74]
 gi|194420702|gb|EDX36778.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gi|320181944|gb|EFW56850.1| expressed protein [Shigella boydii ATCC 9905]
 gi|332085613|gb|EGI90777.1| hypothetical protein SD15574_4175 [Shigella dysenteriae 155-74]
          Length = 227

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/186 (29%), Positives = 97/186 (52%), Gaps = 7/186 (3%)

Query: 44  FSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPL-GPGFPD 102
           ++   ++    L+ +GI +  GL+ AE   AE  + F FP D +A+L  G+PL    F D
Sbjct: 2   YTQQVEQSFKLLRENGIFITRGLTAAEIDAAEKRYNFKFPNDYKALLIQGMPLYDYDFTD 61

Query: 103 WRATGSARLHL--RASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAP 160
           WR      +        D  I+ I + I +   W ++WG RP + E+ L  +R  +K+ P
Sbjct: 62  WRNLSDEYVEYIQYKMFDETISGILYCIDKFDAWPQAWGNRPDNLEERLNFSREKIKKNP 121

Query: 161 LLIPIFNHCYIPCNPTLAGNPIFYVDE-NRIFCCGLDLSDFFERESLFRSTESDPQVLIK 219
           L IPI ++ Y+  +P  +GNP++ + E + + C G +L D+F +  +   +E +  + I 
Sbjct: 122 LFIPIKSNRYLALSPADSGNPVYSIHEGDDVICYGENLWDYFTKMYV---SEDECDIHIN 178

Query: 220 QRSVSE 225
            +++ E
Sbjct: 179 LKTMPE 184


>gi|226349777|ref|YP_002776891.1| hypothetical protein ROP_pROB01-05400 [Rhodococcus opacus B4]
 gi|226245692|dbj|BAH56039.1| hypothetical protein [Rhodococcus opacus B4]
          Length = 194

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 52/163 (31%), Positives = 83/163 (50%), Gaps = 4/163 (2%)

Query: 44  FSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDW 103
             ++ D  ++ L+ +G+ +  GL+D+E     A FG  F  D   +L   LPLG  +PDW
Sbjct: 1   MGAVGDAAVSVLREAGVTLAAGLTDSEIDSVHARFGVDFACDHLDLLKTALPLGERWPDW 60

Query: 104 RATGSARLHLRASLDLPIAAITFQIARNAL--WSKSWGPRPSDPEKALRVARNALKRAPL 161
           R    A L  +  L  P+ +  + +    +  W  SWGPRP++  +  R AR  L + P+
Sbjct: 61  RDGDDAEL--QRMLGWPVESFVWDVLHQPVPFWPASWGPRPTEAVEIERTARKNLAKWPV 118

Query: 162 LIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           L+PI+ H Y+P  P   G P+F V +  +   G DL ++   E
Sbjct: 119 LVPIYGHRYLPAAPAPPGCPVFSVYQTDVIYYGPDLVEYLRNE 161


>gi|432865494|ref|ZP_20088591.1| hypothetical protein A311_04354 [Escherichia coli KTE146]
 gi|431402240|gb|ELG85553.1| hypothetical protein A311_04354 [Escherichia coli KTE146]
          Length = 235

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 96/188 (51%), Gaps = 7/188 (3%)

Query: 42  VSFSSLADKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPL-GPGF 100
           V ++    +    L+ +GI +  GL+ AE   AE  + F FP D +A+L  G+PL    F
Sbjct: 8   VMYTQQVAQSFKLLRENGIFITQGLTTAEIEAAEKRYNFKFPNDYKALLMQGMPLYDYDF 67

Query: 101 PDWRATGSARLHL--RASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKR 158
            DWR      +        D  I+ +   I +   W ++WG RP + E+ L  +R  +K+
Sbjct: 68  TDWRNLSDEYVKYIQYKMFDETISGVLCCITKFDAWPEAWGNRPDNLEERLNFSREKIKK 127

Query: 159 APLLIPIFNHCYIPCNPTLAGNPIFYVDE-NRIFCCGLDLSDFFERESLFRSTESDPQVL 217
            PL IPI ++ Y+  +P  +GNP++ + E + + C G +L D+F +  +   +E +  + 
Sbjct: 128 NPLFIPIKSNRYLALSPADSGNPVYSIHEGDDVICYGENLWDYFTKMYV---SEDECDIH 184

Query: 218 IKQRSVSE 225
           I  +++ E
Sbjct: 185 INLKTMPE 192


>gi|218548277|ref|YP_002382068.1| hypothetical protein EFER_0896 [Escherichia fergusonii ATCC 35469]
 gi|218355818|emb|CAQ88431.1| hypothetical protein EFER_0896 [Escherichia fergusonii ATCC 35469]
          Length = 191

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 74/157 (47%), Gaps = 3/157 (1%)

Query: 51  VITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRAT---G 107
           +I  LK  GI    GLS+AE    E  F F FP D +A L   LP   GF  WR T   G
Sbjct: 12  LINKLKAQGIIFYAGLSEAEITAIEQTFNFRFPLDYKAFLHNALPATEGFIHWRQTLHSG 71

Query: 108 SARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFN 167
                ++  L +P+  I + + +N  W   WG +  + +           + P+LIP++ 
Sbjct: 72  KMEREVKQRLKIPLDGILYDVMKNNFWFDIWGEKLLNLDSRKDHFDKISNQCPVLIPLYK 131

Query: 168 HCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           H Y+  +    GNP++ +  + I C G DLS + + E
Sbjct: 132 HRYMSTSSYTGGNPVYSIYNSDIICAGNDLSSWIKTE 168


>gi|422806268|ref|ZP_16854700.1| hypothetical protein ERIG_02411 [Escherichia fergusonii B253]
 gi|324112806|gb|EGC06782.1| hypothetical protein ERIG_02411 [Escherichia fergusonii B253]
          Length = 191

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 74/157 (47%), Gaps = 3/157 (1%)

Query: 51  VITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRAT---G 107
           ++  LK  GI    GLS+AE    E  F F FP D +A L   LP   GF  WR T   G
Sbjct: 12  LVNKLKTQGIIFYAGLSEAEITAIEQTFNFRFPLDCKAFLHNALPATEGFIHWRQTLHSG 71

Query: 108 SARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFN 167
                ++  L +P+  I + + +N  W   WG +  + +           + P+LIP++ 
Sbjct: 72  KMEREVKQRLKIPLDGILYDVIKNNFWLDIWGEKLLNLDSRKDHFDKISNQCPVLIPLYK 131

Query: 168 HCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           H Y+  +    GNP++ +  + I C G DLS + + E
Sbjct: 132 HRYMSTSSYTGGNPVYSIYNSDIICAGNDLSSWIKTE 168


>gi|315501458|ref|YP_004080345.1| hypothetical protein ML5_0644 [Micromonospora sp. L5]
 gi|315408077|gb|ADU06194.1| hypothetical protein ML5_0644 [Micromonospora sp. L5]
          Length = 192

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 80/166 (48%), Gaps = 19/166 (11%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPG----------FPDWRATG--SA 109
           ++ GL+DAEFAR E EFGF F  D RA L+ GLP+             +PDWR     + 
Sbjct: 17  IEDGLTDAEFARIEREFGFEFAEDHRAFLAVGLPVSSAPEDGATWSNPWPDWRGGDPEAL 76

Query: 110 RLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHC 169
           R+H+   LD  I  +      +  W   WG RPS  + A R AR  L  AP ++P++ H 
Sbjct: 77  RMHVNWELDFLIERV-----EDGEWDPRWGSRPSTRDMASREARRVLLAAPKMVPVYGHR 131

Query: 170 YIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERESLFRSTESDPQ 215
           ++P      G+P+  +    I   G DL D+   E  F  T  D Q
Sbjct: 132 FLPAGRGSYGHPVLSMWGWDIIVYGADLLDYIGNE--FDRTYDDRQ 175


>gi|218201298|gb|EEC83725.1| hypothetical protein OsI_29565 [Oryza sativa Indica Group]
          Length = 87

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/86 (55%), Positives = 65/86 (75%), Gaps = 2/86 (2%)

Query: 313 IVTVSASGFFEGEMV-MLDNQSVLDGLLLKADRLSESLRKSGWSSEEVSDAFGFDFRPEK 371
           +V V+ASG F+GE    +D  +VLD LLLKADR S+SLR++GWSSE+VSDA G D R  K
Sbjct: 1   MVEVTASGLFDGEEAPAVDADAVLDALLLKADRCSDSLRRAGWSSEDVSDALGLDLRRCK 60

Query: 372 ER-KPAKKLSPELVERIGKLAESVSR 396
           ER +PA +L PE+  ++ +LA+SV+R
Sbjct: 61  ERPRPAVQLPPEIAVKVERLAKSVAR 86


>gi|422780807|ref|ZP_16833592.1| hypothetical protein ERFG_01047 [Escherichia coli TW10509]
 gi|323977525|gb|EGB72611.1| hypothetical protein ERFG_01047 [Escherichia coli TW10509]
          Length = 191

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 73/157 (46%), Gaps = 3/157 (1%)

Query: 51  VITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRAT---G 107
           ++  LK  GI    GLS+ E    E  F F FP D +A L   LP   GF  WR T   G
Sbjct: 12  LVNKLKAQGIIFYAGLSETEITAIEQTFSFRFPLDCKAFLHNALPTTEGFIHWRQTLHSG 71

Query: 108 SARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFN 167
                ++ +L +P+  I + + +N  W   WG +  + +           + P+LI ++ 
Sbjct: 72  KMEREVKQTLKIPLDGILYDVMKNNFWLDIWGEKLLNLDSRKDHFDKISNQCPVLILLYK 131

Query: 168 HCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           H Y+  +    GNP++ +  + I C G DLS + + E
Sbjct: 132 HRYMSTSSYTGGNPVYSIYNSDIICAGNDLSSWIKTE 168


>gi|424815671|ref|ZP_18240822.1| hypothetical protein ECD227_0788 [Escherichia fergusonii ECD227]
 gi|325496691|gb|EGC94550.1| hypothetical protein ECD227_0788 [Escherichia fergusonii ECD227]
          Length = 191

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 73/157 (46%), Gaps = 3/157 (1%)

Query: 51  VITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRAT---G 107
           ++  LK  GI    GLS+AE    E  F F FP D +A L   LP   GF  WR T   G
Sbjct: 12  LVNKLKTQGIIFYAGLSEAEITAIEQTFNFRFPLDCKAFLHNALPATEGFIHWRQTLHSG 71

Query: 108 SARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFN 167
                ++  L +P+  I + + +N      WG +  + +           + P+LIP++ 
Sbjct: 72  KMEREVKQRLKIPLDGILYDVMKNNFLLDIWGEKLLNLDSRKDHFDKISNQCPVLIPLYK 131

Query: 168 HCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           H Y+  +    GNP++ +  + I C G DLS + + E
Sbjct: 132 HRYMSTSSYTGGNPVYSIYNSDIICAGNDLSSWIKTE 168


>gi|114568722|ref|YP_755402.1| hypothetical protein Mmar10_0168 [Maricaulis maris MCS10]
 gi|114339184|gb|ABI64464.1| hypothetical protein Mmar10_0168 [Maricaulis maris MCS10]
          Length = 199

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 55/165 (33%), Positives = 82/165 (49%), Gaps = 8/165 (4%)

Query: 45  SSLADKVITHLKNSGIRVQP-----GLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPG 99
             L D++  H +++G R  P     G+S+ E    E + GF  PPDLR +L+     G  
Sbjct: 6   QDLFDELNAH-QHTGNRQAPKYGVKGVSNHELDAIERDLGFDLPPDLRFLLTHVQDPGGV 64

Query: 100 FPDWRATGSARLHLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRA 159
           F  W     A    +AS++   + I F I  N +W   WG RPS   +A  +A   +K  
Sbjct: 65  FFPWAEFTLASY--KASIERVYSGIHFDIEHNKIWLSRWGQRPSSLAEAKAIADADMKTW 122

Query: 160 PLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           P L+P+  H Y+P +P L  NP+F + +  I   G DL+D+  RE
Sbjct: 123 PRLVPVSGHRYLPVDPCLDRNPVFSIWQTDIIYYGADLADYLVRE 167


>gi|456358142|dbj|BAM92587.1| hypothetical protein S58_66200 [Agromonas oligotrophica S58]
          Length = 180

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 74/146 (50%), Gaps = 8/146 (5%)

Query: 60  IRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPL-GPGFPDWRATGSARLHLRASLD 118
           +R   G + +E  RA+  FG VFPPDL A+L+   P+ G  + D RA       +R +L+
Sbjct: 9   VRWTKGYTRSELDRAQRRFGLVFPPDLIALLTRRRPVDGHDWTDERA-------IRRALE 61

Query: 119 LPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLA 178
            P  ++   +    LW   WG  PS          + + RAP LIP+  H Y+P  P  +
Sbjct: 62  WPFESLHHAVETGQLWWSEWGRLPSTARARADALPDIVSRAPKLIPLIAHRYLPEQPHES 121

Query: 179 GNPIFYVDENRIFCCGLDLSDFFERE 204
           GNP+F +        G +L+D+FERE
Sbjct: 122 GNPVFSIYGVDAIYYGANLNDYFERE 147


>gi|309798709|ref|ZP_07692974.1| conserved hypothetical protein [Streptococcus infantis SK1302]
 gi|308117652|gb|EFO55063.1| conserved hypothetical protein [Streptococcus infantis SK1302]
          Length = 187

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 76/157 (48%), Gaps = 3/157 (1%)

Query: 49  DKVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGS 108
           D++ T L  +GI ++PGLSD E  + E  +G  FP    A+    LP+  GF +WR   +
Sbjct: 9   DRIKTLLLENGISLEPGLSDEEIQKVEIIYGIEFPEQWLAVYQQLLPISEGFYNWRDFSA 68

Query: 109 ARL-HLRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFN 167
             + +++ +L  P   I  +     +W  SWG  P+         R  L+ AP LIP++ 
Sbjct: 69  ENIEYIKRNLAAPYDGI-LESLDELVWDASWGNEPTTLLDRNVQIRKMLESAPRLIPLYG 127

Query: 168 HCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           H ++P        PI  V +  I   G DL D+FE E
Sbjct: 128 HRFLPSYEN-QEMPILSVVDLDIIYYGKDLYDYFEIE 163


>gi|222625071|gb|EEE59203.1| hypothetical protein OsJ_11151 [Oryza sativa Japonica Group]
          Length = 199

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 55/154 (35%), Positives = 80/154 (51%), Gaps = 22/154 (14%)

Query: 246 RTPRWVEFWSDAAI--DRRRRNSSSSSSSSPERFFDMPRSEIPKWVDDYMGQIGSVLREG 303
           + PRW+EFW+DAA   DR    S+ +S+ +         +  P+WV  Y+   GSVLR G
Sbjct: 46  KAPRWIEFWTDAAAAGDRFVEVSTCTSTRA---------NAAPQWVRWYLEWAGSVLRRG 96

Query: 304 GWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLLLKADRLSESLRKSGWSSEEVSDAF 363
           GW   ++ E+ T S  G  E              L L  DR    L ++GW +EEV +A 
Sbjct: 97  GWGGGEVEEMTTGSGGGGEEAV-----------ALALTVDRCCGELGRAGWGAEEVVEAL 145

Query: 364 GFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           G    P   ++PA  L P++  R+G+LAE+VSR+
Sbjct: 146 GALLGPRTRKRPAVALPPDVAARVGRLAEAVSRA 179


>gi|411006118|ref|ZP_11382447.1| hypothetical protein SgloC_25216 [Streptomyces globisporus C-1027]
          Length = 222

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 60/170 (35%), Positives = 83/170 (48%), Gaps = 30/170 (17%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPG------------FPDWRATGSA 109
           ++PGLSDAEF R EAE+G VF  D R  L+AGLP+               +P+WR     
Sbjct: 29  LEPGLSDAEFDRIEAEYGIVFAADHRGFLAAGLPVDRAAPREQGESPRNPWPNWRDGDPE 88

Query: 110 RLHLRASLDLPIAAITFQIARNALW--SKSWGPRPSDPEKALRVARNALKRAPLLIPIFN 167
           R  +R  L  P+  + F +  +  W     WGPRP+DP +A+  AR AL   P LIP+++
Sbjct: 89  R--IRERLGRPVDELLFSV-EHGWWLPGGRWGPRPADPAEAVAAARAALSSVPALIPLYS 145

Query: 168 HCYIPCNPTLA-------------GNPIFYVDENRIFCCGLDLSDFFERE 204
           H Y+     LA             G P+  V        G DL+++ ERE
Sbjct: 146 HRYLAAGTGLAAGAGSGTEAGAAHGRPVLSVVGADTIHYGRDLAEWVERE 195


>gi|239989656|ref|ZP_04710320.1| hypothetical protein SrosN1_20310 [Streptomyces roseosporus NRRL
           11379]
          Length = 213

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 81/161 (50%), Gaps = 21/161 (13%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPG------------FPDWRATGSA 109
           ++PGLSDAEF R EAE+G VF P  R  L+AGLP+               +P+WR  G  
Sbjct: 29  LEPGLSDAEFDRIEAEYGIVFAPGHRGFLAAGLPVDRAAPPEEGASPRNPWPNWR-DGDP 87

Query: 110 RLHLRASLDLPIAAITFQIARNALW--SKSWGPRPSDPEKALRVARNALKRAPLLIPIFN 167
            L +R  L  P+  + F +  +  W     WG RP+DP +A+  AR AL   P LIP+++
Sbjct: 88  DL-IRERLGRPVEELLFSV-EHGWWLPGGRWGQRPADPAEAVAAARAALASVPTLIPLYS 145

Query: 168 HCYIPCNPTLA----GNPIFYVDENRIFCCGLDLSDFFERE 204
           H Y+      A    G  +  V        G DL+++ ERE
Sbjct: 146 HRYLASGAGEAGVARGRAVLSVVGADTIHYGRDLAEWVERE 186


>gi|291446673|ref|ZP_06586063.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
           15998]
 gi|291349620|gb|EFE76524.1| conserved hypothetical protein [Streptomyces roseosporus NRRL
           15998]
          Length = 193

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 81/161 (50%), Gaps = 21/161 (13%)

Query: 62  VQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPG------------FPDWRATGSA 109
           ++PGLSDAEF R EAE+G VF P  R  L+AGLP+               +P+WR  G  
Sbjct: 9   LEPGLSDAEFDRIEAEYGIVFAPGHRGFLAAGLPVDRAAPPEEGASPRNPWPNWR-DGDP 67

Query: 110 RLHLRASLDLPIAAITFQIARNALW--SKSWGPRPSDPEKALRVARNALKRAPLLIPIFN 167
            L +R  L  P+  + F +  +  W     WG RP+DP +A+  AR AL   P LIP+++
Sbjct: 68  DL-IRERLGRPVEELLFSV-EHGWWLPGGRWGQRPADPAEAVAAARAALASVPTLIPLYS 125

Query: 168 HCYIPCNPTLA----GNPIFYVDENRIFCCGLDLSDFFERE 204
           H Y+      A    G  +  V        G DL+++ ERE
Sbjct: 126 HRYLASGAGEAGVARGRAVLSVVGADTIHYGRDLAEWVERE 166


>gi|242040701|ref|XP_002467745.1| hypothetical protein SORBIDRAFT_01g033286 [Sorghum bicolor]
 gi|241921599|gb|EER94743.1| hypothetical protein SORBIDRAFT_01g033286 [Sorghum bicolor]
          Length = 140

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 57/120 (47%), Positives = 71/120 (59%), Gaps = 9/120 (7%)

Query: 1   MVDVDRRMT---GLN--PAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLADKVITHL 55
           MVDVDRR +   GL   P+H AGLRRLS RA+AP+T  +    +   + +  A      L
Sbjct: 3   MVDVDRRPSHPHGLPRPPSHAAGLRRLSTRASAPSTPRTAAPPSPSAASAGPAPSPSALL 62

Query: 56  KNS---GIRVQPGLSDAEFARAEAEFGFV-FPPDLRAILSAGLPLGPGFPDWRATGSARL 111
            +    G+ V PGLSDAE A AEA  G V  PPDLR +L+ GLP G GFPD+R+    RL
Sbjct: 63  AHLAAAGVTVLPGLSDAELALAEAALGGVQLPPDLRDLLALGLPSGDGFPDYRSPAGLRL 122


>gi|347732537|ref|ZP_08865616.1| hypothetical protein DA2_1907 [Desulfovibrio sp. A2]
 gi|347518711|gb|EGY25877.1| hypothetical protein DA2_1907 [Desulfovibrio sp. A2]
          Length = 181

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/150 (28%), Positives = 63/150 (42%), Gaps = 2/150 (1%)

Query: 55  LKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLR 114
           L+   +  + GL+  E A  E  F F FPPDLR +L+  LP+   F  WR   ++  +  
Sbjct: 8   LRKKDVVFESGLTPEELAHVENIFTFRFPPDLRFLLNMALPVQDHFIHWRQALTSEAYRT 67

Query: 115 ASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCN 174
             L      +   +       +  G      + AL    N     P LIPI+   YIP  
Sbjct: 68  DILRRIATGLEEMVEAGKQLHEEMGISFDGTDTALET--NPFPGIPRLIPIYGRRYIPET 125

Query: 175 PTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           P  A NP++ +    +   G DL+ +F  E
Sbjct: 126 PHEAANPVYSIFAADMIYYGFDLASYFRNE 155


>gi|417629539|ref|ZP_12279776.1| hypothetical protein ECSTECMHI813_2461 [Escherichia coli
           STEC_MHI813]
 gi|345372286|gb|EGX04250.1| hypothetical protein ECSTECMHI813_2461 [Escherichia coli
           STEC_MHI813]
          Length = 106

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 31/95 (32%), Positives = 45/95 (47%), Gaps = 3/95 (3%)

Query: 51  VITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWR-ATGSA 109
           ++  LK  GI    GLS+AE    E  F F FP D +A L   LP   GF  WR A  S+
Sbjct: 12  LVNKLKAQGIIFYAGLSEAEITAIEQTFNFCFPLDFKAFLHTALPSTEGFIHWRQALHSS 71

Query: 110 RL--HLRASLDLPIAAITFQIARNALWSKSWGPRP 142
           ++   ++     P+  + + + +N  W   WG  P
Sbjct: 72  KMENEVKQKFKTPLEGVLYDVIKNNFWLDVWGAPP 106


>gi|108798789|ref|YP_638986.1| hypothetical protein Mmcs_1821 [Mycobacterium sp. MCS]
 gi|108769208|gb|ABG07930.1| hypothetical protein Mmcs_1821 [Mycobacterium sp. MCS]
          Length = 132

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 40/71 (56%)

Query: 133 LWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYVDENRIFC 192
           LW  SWG RP+  + ALR A   L R P L+P+ ++CY+P     +G+P+  + +  I  
Sbjct: 40  LWDDSWGRRPARMKDALRSAAYQLARVPRLVPVHSNCYLPAGHDSSGHPVLSIYQADIHV 99

Query: 193 CGLDLSDFFER 203
              DL D+ +R
Sbjct: 100 VAADLFDYVDR 110


>gi|256424149|ref|YP_003124802.1| hypothetical protein Cpin_5170 [Chitinophaga pinensis DSM 2588]
 gi|256039057|gb|ACU62601.1| hypothetical protein Cpin_5170 [Chitinophaga pinensis DSM 2588]
          Length = 249

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 62/156 (39%), Gaps = 19/156 (12%)

Query: 65  GLSDAEFARAEAEFGFVFPPDLRAILSAGLPLG---------------PGFPDWRATGSA 109
           G++D +  + + ++  VF P+ RA L     +                P   +W     A
Sbjct: 47  GMTDEQIDQVQEKYAVVFTPEHRAFLKILHTIDRKERVYNAGGDYYEQPFLFNWLEDEEA 106

Query: 110 RLHLRASLDLPIAAITFQIARNALW-SKSWGPRPSDPEKALRVARNALKRAPLLIPIFNH 168
              +R  L+ P   I   + R   W +++WG RP   E    V       AP +IP+  H
Sbjct: 107 ---IRLKLNFPYNTILDDVLRMGFWHNRTWGERPDSDEDRTTVFTELYNTAPKVIPLTGH 163

Query: 169 CYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
            Y+  + TL   P+  V    I C GLDL  +   E
Sbjct: 164 RYLVADATLERKPVLSVWGADIVCYGLDLRKYLLHE 199


>gi|312867344|ref|ZP_07727553.1| hypothetical protein HMPREF9626_0780 [Streptococcus parasanguinis
           F0405]
 gi|311097045|gb|EFQ55280.1| hypothetical protein HMPREF9626_0780 [Streptococcus parasanguinis
           F0405]
          Length = 138

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 62/135 (45%), Gaps = 3/135 (2%)

Query: 67  SDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLH-LRASLDLPIAAIT 125
           +D  F + E  +G + P + + +      L  G+ +W          L+  ++  +  IT
Sbjct: 6   TDEVFTKIEETYGIILPNEFKQVYCDLEQLPNGWYNWADFSRENTDCLKKQMEEVMGEIT 65

Query: 126 FQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIPCNPTLAGNPIFYV 185
            +I     W+ +WG  P+   +     ++ +K +P L+PI  H YI C PT A +P+F +
Sbjct: 66  DEIDE-IEWNDNWGECPNTIHEVHHFIKSKMKSSPALMPIVGHRYIACEPT-AISPVFSI 123

Query: 186 DENRIFCCGLDLSDF 200
             + I      L+DF
Sbjct: 124 VGSEIIYYSASLTDF 138


>gi|157165653|ref|YP_001466113.1| hypothetical protein CCC13826_1677 [Campylobacter concisus 13826]
 gi|112800543|gb|EAT97887.1| hypothetical protein CCC13826_1677 [Campylobacter concisus 13826]
          Length = 188

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 71/161 (44%), Gaps = 8/161 (4%)

Query: 50  KVITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSA 109
           ++I  LK+SGI+   GL+  E  +    +   FP  L++ L+  LP+  GF +WR   S 
Sbjct: 4   EIIKLLKDSGIKFDAGLTQNELEKIYEIYKIKFPLSLKSFLATALPVSDGFYNWRDFSSG 63

Query: 110 RL----HLRASLDLPIAAITFQ--IARNALWSKSWGPRPSDPEKALRVARNALKRAPLLI 163
            +    ++       + + +F+  I  NA     W       E+ + +     K AP +I
Sbjct: 64  NVTKIKNMMQHFYEFVKSASFEEYIPNNAYSPDQWCKISR--EQRVEIVAELYKNAPKII 121

Query: 164 PIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
           P+ +H ++P        P+  +  + I   G +L ++   E
Sbjct: 122 PVCSHRFMPEMNDDNDPPVLSIYYDDIVYYGRNLQEYLLHE 162


>gi|242040699|ref|XP_002467744.1| hypothetical protein SORBIDRAFT_01g033283 [Sorghum bicolor]
 gi|241921598|gb|EER94742.1| hypothetical protein SORBIDRAFT_01g033283 [Sorghum bicolor]
          Length = 112

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 36/60 (60%)

Query: 338 LLLKADRLSESLRKSGWSSEEVSDAFGFDFRPEKERKPAKKLSPELVERIGKLAESVSRS 397
           L L  DR    L++ GW++EEV +  G    P K R+    L P++  R+G+LAE+VSR+
Sbjct: 40  LALTVDRCCGDLKRGGWAAEEVVEMLGALLGPRKPRRAVAALPPDVAARVGRLAEAVSRA 99


>gi|222623217|gb|EEE57349.1| hypothetical protein OsJ_07479 [Oryza sativa Japonica Group]
          Length = 153

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 56/119 (47%), Gaps = 13/119 (10%)

Query: 1   MVDVDRRM---TGL--NPAHIAGLRRLSARAAAPTTATSLPVRNGLVSFSSLA---DKVI 52
           MVDVD R+    GL  +P+H  GL RL+   +A TT    P      S+ ++A     V+
Sbjct: 1   MVDVDHRLPVPHGLLHSPSHATGLHRLTTHGSAITTLRVSPATASPSSYPTVAPLPSTVL 60

Query: 53  THLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARL 111
            HL  + + V PGLSD E  R  + F        R+          GFP++R++   RL
Sbjct: 61  AHLATARVSVLPGLSDPELFRVASSFLLTSVTSSRSTCHLD-----GFPNYRSSAGLRL 114


>gi|170090294|ref|XP_001876369.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164647862|gb|EDR12105.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 692

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 52/94 (55%), Gaps = 20/94 (21%)

Query: 284 EIPKWVDDYM----GQIGSVLREGGWSEPDIVEIVTVSASGFFEGEMVMLDNQSVLDGLL 339
           +I KW D+Y+    GQ  +V+  G      +  ++T+  + F+E +MV+LDN  V   L 
Sbjct: 342 DIQKWADEYLRVRFGQESTVI--GAL----VPTLLTMIVNSFYEAQMVLLDN-IVSKALE 394

Query: 340 LKADRLSESLRKSGWSSEEVSD-AFGFDFRPEKE 372
           LK DRL  SL        EVSD AFGFDF+  +E
Sbjct: 395 LKNDRLGSSL--------EVSDFAFGFDFKMTQE 420


>gi|262283110|ref|ZP_06060877.1| predicted protein [Streptococcus sp. 2_1_36FAA]
 gi|262261362|gb|EEY80061.1| predicted protein [Streptococcus sp. 2_1_36FAA]
          Length = 164

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 64/154 (41%), Gaps = 5/154 (3%)

Query: 66  LSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAIT 125
            ++  F   E  +    P + + +      L   + DW       + + ++    I    
Sbjct: 5   FNERTFRNIEQIYRVYLPDEFKKVYGNMTKLPKNWYDWSDFSPQNVKMLSNYIQIIKENI 64

Query: 126 FQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIP-CNPTLAGNPIFY 184
            +      WS +WG  PSD E   R  R+ L  +P L PI  H YI  CN  ++  P+F 
Sbjct: 65  AEEIEYVDWSDNWGEAPSDLELMKREIRSRLINSPTLFPISGHRYIASCNTPIS--PVFS 122

Query: 185 VDENRIFCCGLDLSDFFERESLFRSTE-SD-PQV 216
           +  + I      L+D+F   ++ R T  SD PQ+
Sbjct: 123 IVGSDIIYYSKSLTDYFHGIAISRETNLSDLPQI 156


>gi|125540243|gb|EAY86638.1| hypothetical protein OsI_08018 [Oryza sativa Indica Group]
          Length = 152

 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 58/121 (47%), Gaps = 18/121 (14%)

Query: 1   MVDVDRRM---TGL--NPAHIAGLRRLSARAAAPTT-----ATSLPVRNGLVSFSSLADK 50
           MVD+D R+    GL  +P+H  GL RL+   +A TT     AT+ P      + + L   
Sbjct: 1   MVDIDHRLPVPHGLLHSPSHATGLHRLTTHGSAITTLRVSPATASP---SYPTVAPLPST 57

Query: 51  VITHLKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSAR 110
           V  HL  + + V PGLSD E  R  + F        R  L+  L    GFP++R++   R
Sbjct: 58  VRAHLATARVSVLPGLSDPELFRVASSFLLTSATSSR--LTCHL---DGFPNYRSSAGLR 112

Query: 111 L 111
           L
Sbjct: 113 L 113


>gi|270292136|ref|ZP_06198351.1| hypothetical protein HMPREF0850_01301 [Streptococcus sp. M143]
 gi|270279664|gb|EFA25506.1| hypothetical protein HMPREF0850_01301 [Streptococcus sp. M143]
          Length = 164

 Score = 45.8 bits (107), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 33/146 (22%), Positives = 60/146 (41%), Gaps = 3/146 (2%)

Query: 66  LSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARLHLRASLDLPIAAIT 125
            ++  F + E  +    P + + +      L   + DW       + + ++    I    
Sbjct: 5   FNNRTFRKIEEIYSVYLPDEFKKVYGNMEELPENWYDWSDFSPQNVKVLSNYIQVIKENI 64

Query: 126 FQIARNALWSKSWGPRPSDPEKALRVARNALKRAPLLIPIFNHCYIP-CNPTLAGNPIFY 184
            +      WS +WG  PS+ E       + L  +P L+PIF H YI  CN  ++  P+F 
Sbjct: 65  AEEIEYVDWSDNWGEAPSNLELTKGEILSRLMNSPTLLPIFGHRYIASCNTPIS--PVFS 122

Query: 185 VDENRIFCCGLDLSDFFERESLFRST 210
           +  + I      L+D+F   ++ R T
Sbjct: 123 IVGSDIIYYSKSLTDYFHGITVSRET 148


>gi|257461335|ref|ZP_05626432.1| conserved hypothetical protein [Campylobacter gracilis RM3268]
 gi|257441363|gb|EEV16509.1| conserved hypothetical protein [Campylobacter gracilis RM3268]
          Length = 105

 Score = 43.9 bits (102), Expect = 0.15,   Method: Composition-based stats.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 13/100 (13%)

Query: 55  LKNSGIRVQPGLSDAEFARAEAEFGFVFPPDLRAILSAGLPLGPGFPDWRATGSARL-HL 113
           L+ SGI +  GL ++E  + E  + F FP  LR  LS  LP+   F +WR      +  +
Sbjct: 9   LEKSGIELSEGLKESEIDKIEQIYEFKFPKSLRDFLSYTLPISVEFYNWRDFSDENIKEI 68

Query: 114 RASLDL--------PIAAITFQIARNALW-SKSWGPRPSD 144
           + +++         PI  I         W ++ WGP P D
Sbjct: 69  KQAMNYVFEYLKNDPIDEI---FPNENYWNTQKWGPMPED 105


>gi|224367963|ref|YP_002602126.1| protein GlcD1 [Desulfobacterium autotrophicum HRM2]
 gi|223690679|gb|ACN13962.1| GlcD1 [Desulfobacterium autotrophicum HRM2]
          Length = 457

 Score = 42.4 bits (98), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 40/66 (60%), Gaps = 2/66 (3%)

Query: 22  LSARAAAPTTATS-LPVRNGLVSFSSLADKVI-THLKNSGIRVQPGLSDAEFARAEAEFG 79
           ++AR A  +   + +P+ +G+V   S  DK+I  + ++  + VQPG+ +A+  +A   FG
Sbjct: 66  VTARGAGTSVCGAPVPILHGIVVCFSKMDKIIEVNTRDRYVIVQPGVINADVQKALQPFG 125

Query: 80  FVFPPD 85
           F +PPD
Sbjct: 126 FFYPPD 131


>gi|392396229|ref|YP_006432830.1| hypothetical protein Fleli_0571 [Flexibacter litoralis DSM 6794]
 gi|390527307|gb|AFM03037.1| hypothetical protein Fleli_0571 [Flexibacter litoralis DSM 6794]
          Length = 304

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/164 (21%), Positives = 60/164 (36%), Gaps = 27/164 (16%)

Query: 65  GLSDAEFARAEAEFGFVFPPDLRAILSAGLPLG----------------------PGFPD 102
           GL ++E    E EF   F    RA L     +                       P F +
Sbjct: 47  GLDESEIDSIEKEFDINFTFHHRAFLKILHAIDKVEVVEYTESFDEDAEILKKEVPYFYN 106

Query: 103 WRATGSARLH-LRASLDLPIAAITFQIARNALWSKSWGP-RPSDPEKALRVARNALKRAP 160
           W+      +  L+ S +     +   +  N +W +SWG  RP    +  ++      + P
Sbjct: 107 WKTDKEEIIRRLKWSYETIFEDV---MGANKVWLESWGKIRPKSNTEKEKIFSEWFAKTP 163

Query: 161 LLIPIFNHCYIPCNPTLAGNPIFYVDENRIFCCGLDLSDFFERE 204
            LIP+ +H ++  N   A +PI  V  + I   G  + D+   E
Sbjct: 164 TLIPMCSHRFVVSNDLEADSPILSVYGSDIIVYGWSMRDYLLNE 207


>gi|417779642|ref|ZP_12427421.1| hypothetical protein LEP1GSC036_4525 [Leptospira weilii str.
           2006001853]
 gi|410780218|gb|EKR64818.1| hypothetical protein LEP1GSC036_4525 [Leptospira weilii str.
           2006001853]
          Length = 50

 Score = 38.5 bits (88), Expect = 5.8,   Method: Composition-based stats.
 Identities = 14/48 (29%), Positives = 25/48 (52%)

Query: 113 LRASLDLPIAAITFQIARNALWSKSWGPRPSDPEKALRVARNALKRAP 160
           ++  +D P   + F I  N+ W   WG +PS+  K +++A+     AP
Sbjct: 2   IQEKIDWPFEGMCFDIKNNSFWIDEWGEKPSEDSKLIQIAKYYYDLAP 49


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,283,620,527
Number of Sequences: 23463169
Number of extensions: 268191198
Number of successful extensions: 758194
Number of sequences better than 100.0: 139
Number of HSP's better than 100.0 without gapping: 118
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 757709
Number of HSP's gapped (non-prelim): 165
length of query: 397
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 252
effective length of database: 8,957,035,862
effective search space: 2257173037224
effective search space used: 2257173037224
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)