BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001659
         (1036 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|156365737|ref|XP_001626800.1| predicted protein [Nematostella vectensis]
 gi|156213689|gb|EDO34700.1| predicted protein [Nematostella vectensis]
          Length = 226

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 81/212 (38%), Positives = 132/212 (62%), Gaps = 2/212 (0%)

Query: 230 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 289
           SW  +DGI S  ++DG +SWA +D  +SW  K+G +    +DG +SW  +DG +S   +D
Sbjct: 4   SWAIKDGIHSWAIKDGINSWAIKDVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKD 63

Query: 290 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 349
           G +SW  +DG +S A + G +SW  +D  +SW  +D  + W+ +DG +SW  +DG +S A
Sbjct: 64  GINSWAIKDGINSWAIKYGINSWAIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWA 123

Query: 350 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 409
            +DG +SW  +DG +S   +DG +SW+ +DG +SW  +DG +SW  +DG +SW  +DG +
Sbjct: 124 IKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGIN 183

Query: 410 SWSKE--PDQQHRKNGGSSWGNRDGGSSWSKQ 439
            W+ +   +    K+G +SW  +DG +SW+ +
Sbjct: 184 YWAIKDGINSWAIKDGINSWAIKDGINSWAIK 215



 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 81/212 (38%), Positives = 131/212 (61%), Gaps = 2/212 (0%)

Query: 230 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 289
           SW  +DGI S  ++D  +SWA +DG +SW  K+G +    +DG +SW  +DG +S   +D
Sbjct: 13  SWAIKDGINSWAIKDVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKDGINSWAIKD 72

Query: 290 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 349
           G +SW  + G +S A +DG +SW  +D  + W  +D  +SW+ +DG +SW  +DG +S A
Sbjct: 73  GINSWAIKYGINSWAIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWAIKDGINSWA 132

Query: 350 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 409
            +DG +SW  +DG +S   +DG +SW+ +DG +SW  +DG +SW  +DG + W  +DG +
Sbjct: 133 IKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINYWAIKDGIN 192

Query: 410 SWSKE--PDQQHRKNGGSSWGNRDGGSSWSKQ 439
           SW+ +   +    K+G +SW  +DG  SW+ +
Sbjct: 193 SWAIKDGINSWAIKDGINSWAIKDGIHSWAIK 224



 Score =  133 bits (335), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 82/207 (39%), Positives = 128/207 (61%), Gaps = 7/207 (3%)

Query: 226 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 285
           DV +SW  +DGI S  ++DG +SWA +DG +SW  K+G +    +DG +SW  + G +S 
Sbjct: 27  DVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKDGINSWAIKDGINSWAIKYGINSW 86

Query: 286 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 345
             +DG +SW  +DG +  A +DG +SW  +D  +SW  +D  +SW+ +DG +SW  +DG 
Sbjct: 87  AIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGI 146

Query: 346 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 405
           +S A +DG +SW  +DG +S   +DG +SW+ +DG + W  +DG +SW  +DG +SW  +
Sbjct: 147 NSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINYWAIKDGINSWAIKDGINSWAIK 206

Query: 406 DGGSSWSKEPDQQHRKNGGSSWGNRDG 432
           DG +SW+        K+G  SW  +DG
Sbjct: 207 DGINSWA-------IKDGIHSWAIKDG 226


>gi|156349300|ref|XP_001622001.1| predicted protein [Nematostella vectensis]
 gi|156208385|gb|EDO29901.1| predicted protein [Nematostella vectensis]
          Length = 235

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 80/212 (37%), Positives = 131/212 (61%), Gaps = 2/212 (0%)

Query: 230 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 289
           SW  +DGI S  ++DG +SWA +D  +SW  K+G +    +DG +SW  +DG +S   +D
Sbjct: 4   SWAIKDGINSWAIKDGINSWAIKDVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKD 63

Query: 290 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 349
           G +SW  +DG +S A + G +SW  +D  +SW  +D  + W+ +DG +SW  +DG +S A
Sbjct: 64  GINSWAIKDGINSWAIKYGINSWAIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWA 123

Query: 350 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 409
            +DG +SW  +DG +S   +DG +SW+ +DG +SW  +DG +SW  +DG +SW  +D  +
Sbjct: 124 IKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGVNSWAIKDRIN 183

Query: 410 SWSKE--PDQQHRKNGGSSWGNRDGGSSWSKQ 439
            W+ +   +    K+G +SW  +DG +SW+ +
Sbjct: 184 YWAIKDGINSWAIKDGINSWAIKDGINSWAIK 215



 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 80/212 (37%), Positives = 131/212 (61%), Gaps = 2/212 (0%)

Query: 230 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 289
           SW  +DGI S  ++D  +SWA +DG +SW  K+G +    +DG +SW  +DG +S   +D
Sbjct: 13  SWAIKDGINSWAIKDVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKDGINSWAIKD 72

Query: 290 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 349
           G +SW  + G +S A +DG +SW  +D  + W  +D  +SW+ +DG +SW  +DG +S A
Sbjct: 73  GINSWAIKYGINSWAIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWAIKDGINSWA 132

Query: 350 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 409
            +DG +SW  +DG +S   +DG +SW+ +DG +SW  +DG +SW  +D  + W  +DG +
Sbjct: 133 IKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGVNSWAIKDRINYWAIKDGIN 192

Query: 410 SWSKE--PDQQHRKNGGSSWGNRDGGSSWSKQ 439
           SW+ +   +    K+G +SW  +DG +SW+ +
Sbjct: 193 SWAIKDGINSWAIKDGINSWAIKDGINSWAIK 224



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 81/207 (39%), Positives = 128/207 (61%), Gaps = 7/207 (3%)

Query: 226 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 285
           DV +SW  +DGI S  ++DG +SWA +DG +SW  K+G +    +DG +SW  + G +S 
Sbjct: 27  DVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKDGINSWAIKDGINSWAIKYGINSW 86

Query: 286 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 345
             +DG +SW  +DG +  A +DG +SW  +D  +SW  +D  +SW+ +DG +SW  +DG 
Sbjct: 87  AIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGI 146

Query: 346 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 405
           +S A +DG +SW  +DG +S   +DG +SW+ +D  + W  +DG +SW  +DG +SW  +
Sbjct: 147 NSWAIKDGINSWAIKDGINSWAIKDGVNSWAIKDRINYWAIKDGINSWAIKDGINSWAIK 206

Query: 406 DGGSSWSKEPDQQHRKNGGSSWGNRDG 432
           DG +SW+        K+G +SW  +DG
Sbjct: 207 DGINSWA-------IKDGINSWAIKDG 226


>gi|297806403|ref|XP_002871085.1| hypothetical protein ARALYDRAFT_487210 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297316922|gb|EFH47344.1| hypothetical protein ARALYDRAFT_487210 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1476

 Score =  110 bits (275), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 128/286 (44%), Positives = 149/286 (52%), Gaps = 85/286 (29%)

Query: 201  SGWKK---DDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSS 257
            SGW K   D + N     GW KP     D GSSW KQD       Q+G +SW K+D G S
Sbjct: 923  SGWNKSAEDSNANSKGVPGWGKP-----DGGSSWGKQDK------QEGVASWGKKDDGGS 971

Query: 258  WGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDR 317
            WG K+        DG SSWGK+D G    K DGGSSWGK+D        DGGSSWGK+D 
Sbjct: 972  WGNKD--------DGVSSWGKKDDGQ---KDDGGSSWGKKD--------DGGSSWGKKDD 1012

Query: 318  G-SSWGKQDEG-SSWSKRD-GGSSWGKQDGGSSLAKQDGGSSWGKQDGG----------- 363
            G  SWGK+D+G S W K+D GGSSWGK+D        DGGSSWGK+D G           
Sbjct: 1013 GGYSWGKKDDGGSLWGKKDDGGSSWGKKD--------DGGSSWGKKDDGGYSEQTFDMGG 1064

Query: 364  ---------------------SSLGKQDGGSSWSKQDGGSSWGKQD---GGSSWGKQD-- 397
                                 SS    +  + W+K  GGSSWG+QD   GGSSWGK++  
Sbjct: 1065 RGFGGRRGGGRRGGRDQFGRGSSFSNSEDLAPWNKPSGGSSWGQQDSDGGGSSWGKENDT 1124

Query: 398  -GGSSWGKQDGGSSWSKEPDQQHRKNGGSSWGNRD---GGSSWSKQ 439
             GGS WGKQD G   S    Q      GSSWG ++   GGSSW KQ
Sbjct: 1125 GGGSGWGKQDSGGGGSSWGKQNDASGSGSSWGKQNNAGGGSSWGKQ 1170


>gi|374583946|ref|ZP_09657038.1| pseudouridine synthase Rsu [Leptonema illini DSM 21528]
 gi|373872807|gb|EHQ04801.1| pseudouridine synthase Rsu [Leptonema illini DSM 21528]
          Length = 708

 Score =  103 bits (257), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 65/215 (30%), Positives = 129/215 (60%), Gaps = 2/215 (0%)

Query: 226 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 285
           D G S++++D   S D ++GG S+ + D G S+ +K+ G    ++D G S+ ++DGG S 
Sbjct: 44  DSGRSFDRKDSGRSFDRKEGGRSFDRGDSGRSFDRKDSGRSFDRKDSGRSFDRKDGGRSF 103

Query: 286 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 345
            ++D G S+ ++D   S  ++D G S+ +++ G S+ ++D G S+ + D   S+ + +GG
Sbjct: 104 DRKDSGRSFDRKDSGRSFDRKDSGRSFDRKEGGRSFDRKDSGRSFDRGDSRRSFDRPEGG 163

Query: 346 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 405
            S  +++GG S+ + D   S  + +GG S+ +++GG S+ + D   S+ + +GG S+ ++
Sbjct: 164 RSFDRKEGGRSFDRGDSRRSFDRPEGGRSFDRKEGGRSFDRGDSRRSFDRPEGGRSFDRK 223

Query: 406 DGGSSWSKEPDQQ--HRKNGGSSWGNRDGGSSWSK 438
           +GG S+ +   ++   R  GG S+  ++GG S+ +
Sbjct: 224 EGGRSFDRGDSRRSFDRPEGGRSYDRKEGGRSFDR 258



 Score = 86.3 bits (212), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 62/216 (28%), Positives = 124/216 (57%), Gaps = 15/216 (6%)

Query: 212 DQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQD 271
           D R  +++P+      G S+++++G  S D  D   S+ + +GG S+ +K GG    + D
Sbjct: 179 DSRRSFDRPEG-----GRSFDRKEGGRSFDRGDSRRSFDRPEGGRSFDRKEGGRSFDRGD 233

Query: 272 GGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWS 331
              S+ + +GG S  +++GG S+ + D R S  + +GG S+ +++ G S+ + D   S+ 
Sbjct: 234 SRRSFDRPEGGRSYDRKEGGRSFDRGDSRRSSDRPEGGRSFDRKEGGRSFDRGDSRRSFD 293

Query: 332 KRDGGSSWGKQDGG---------SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGS 382
           + +GG S+ +++GG          S  + +GG S+ ++DGG S  +++GG S+ + D   
Sbjct: 294 RPEGGRSFDRREGGGGFDRGDSRRSFDRPEGGRSFDRKDGGRSFDRKEGGRSFDRGDSRR 353

Query: 383 SWGKQDGGSSWGKQDGGSSWGKQDGGSSWSKEPDQQ 418
           S+ + +GG S+ +++GGSS+ + D G  + + P+++
Sbjct: 354 SFDRPEGGRSFDRKEGGSSFDRGDRGRGFDR-PERK 388


>gi|224109844|ref|XP_002315331.1| predicted protein [Populus trichocarpa]
 gi|222864371|gb|EEF01502.1| predicted protein [Populus trichocarpa]
          Length = 1697

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 87/186 (46%), Positives = 111/186 (59%), Gaps = 8/186 (4%)

Query: 490  GTGGMGDQGNGWNRRDKGTDWNKKCNWNSGSSDGDGNNGSGGWGKKSNWNSGSSGAGESK 549
            G  G+G Q + W    K T+ ++  +W+S     D N  SG  G KS+W  G+  + + K
Sbjct: 1135 GASGIGTQKSDWGA-PKTTEVDQLSSWDSKVGHVDANQSSGS-GNKSSW--GTQKSSQEK 1190

Query: 550  DTDWNKKSNLNCGSSDGDGNNSSGWDKKGNWNAGSSGDGESKDTDWNKKCNWNSGSNDGD 609
               WN++S      S  DGN  S W  K  WN+GSS  G + D+ W KK  WNS S++ D
Sbjct: 1191 -PGWNQESPELEKDSKRDGNQESSWGNKSGWNSGSSDAGRNSDSAWGKKSIWNSESSNAD 1249

Query: 610  GNNGSGWGKKSNWNSGSNVAGESNDSNWAKKGNWNSGSDDANQESSWGKKQGNWNSGSRD 669
            GN  SGW  KSNWNSGS  A +   S+WAKK NWNSGS DANQES WGKK  +W+SG  D
Sbjct: 1250 GNQDSGWATKSNWNSGSKDANQG--SSWAKKNNWNSGSSDANQESGWGKK-SSWSSGYGD 1306

Query: 670  GHQESS 675
            G+Q+SS
Sbjct: 1307 GNQDSS 1312



 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 101/231 (43%), Gaps = 61/231 (26%)

Query: 1   MWSCAQINV--KKMLY------------HAFFLL---------LQLHAESNPANAFGSGD 37
           M +C  + +  K++LY             A  ++         L    +++P N   S D
Sbjct: 674 MENCYSVEICEKEILYAWVMGMGKSVVNPAVLIMGWTSWPSLGLMDQPKTSPINPSSSAD 733

Query: 38  NGANKDEEDSAWGSKVNAIQNSSWGLAAAEGKNEDCWNKAAVKNIESNNGAYGGWGK--- 94
           N  +KD    AWGS+    Q SSWG  A      D WNKAA  NI S +GA  GWGK   
Sbjct: 734 NELSKD---GAWGSQATGNQTSSWGAVAG-----DSWNKAA-SNIGSTSGASVGWGKATL 784

Query: 95  --EDAGSSLQDSQDNWGKNKDACDNQANWKKSDSWDKGKKIIGNSTSSWGDKTAEKNEPD 152
             ED   S + + DNWG+     +N      + +WDKGK + GN TSSWGD    KN+ D
Sbjct: 785 PNEDLAGSSRGTGDNWGRGNLRAENSL-IDSAVAWDKGKTVSGNQTSSWGDAATGKNQVD 843

Query: 153 SWGKGKDGS----------SGSKSDWNSSALATENPTVSWGNASGGWTQQK 193
           SWGK  D            SG+  D              W N S GW QQK
Sbjct: 844 SWGKCNDAIGAGSWEKKKRSGTGED-------------CWSNKSTGWNQQK 881


>gi|15237667|ref|NP_196049.1| kow domain-containing transcription factor 1 [Arabidopsis thaliana]
 gi|332003341|gb|AED90724.1| kow domain-containing transcription factor 1 [Arabidopsis thaliana]
          Length = 1493

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 187/467 (40%), Positives = 225/467 (48%), Gaps = 141/467 (30%)

Query: 36   GDNGANKDEEDSAWGSKVNAIQNSSWGLAAAEGKNEDCWNKAAVKNIESNNGAYGGWGKE 95
            G +G NK +EDS WG    A ++S       + K E  W K    + ES+      WG +
Sbjct: 823  GSSGGNKQDEDSVWGKLCEASESS-------QKKEESSWGKKGGSDGESS------WGNK 869

Query: 96   DAGSSLQDSQ-DNWGKNKDACDNQ---ANWK-KSDSWDKGKKIIGNSTSSWGDKTAE--- 147
            D  SS       +WG+     D     + W  +   +  GKK  G  +S W +K+AE   
Sbjct: 870  DGNSSASKKDGVSWGQQDKGSDESKGGSAWSNQCGDFGSGKKKDG--SSGW-NKSAEDSN 926

Query: 148  ---KNEPDSWGKGKDGSS-GSKSDWNSSALATENPTVSWGNAS--GGWTQQKGGNMDE-R 200
               K  PD WG+  DGSS G K D             SWG     G W ++  GN D+  
Sbjct: 927  ANSKGVPD-WGQPNDGSSWGKKGD----------GAASWGKKDDGGSWGKKDDGNKDDGG 975

Query: 201  SGWKKDDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQ-DGGSSWG 259
            S W K D G +D             D GSSW K+         DGGSSW K+ DGGSSWG
Sbjct: 976  SSWGKKDDGQKD-------------DGGSSWEKK--------FDGGSSWGKKDDGGSSWG 1014

Query: 260  KK-NGGSLMGKQ-DGGSSWGKQDGGSSL--GKQDGGSSWGKQ-DGRSSLAKQ-DGGSSWG 313
            KK +GGSL GK+ DGGSSWGK+D G SL   K DG SSWGK+ DG SS  K+ DGGSSWG
Sbjct: 1015 KKDDGGSLWGKKDDGGSSWGKEDDGGSLWGKKDDGESSWGKKDDGESSWGKKDDGGSSWG 1074

Query: 314  KQDRG--------------------------------SSWGKQDEGSSWSKRDGGSSWGK 341
            K+D G                                SS+G  ++ + WSK  GGSSWGK
Sbjct: 1075 KKDEGGYSEQTFDRGGRGFGGRRGGGRRGGRDQFGRGSSFGNSEDPAPWSKPSGGSSWGK 1134

Query: 342  QDGGSSLAKQD------GGSSWGKQDGGSSLGKQDGGSSWSKQD---------------- 379
            QDG    +         GGSSWGKQD G        GSSW KQ+                
Sbjct: 1135 QDGDGGGSSWGKENDAGGGSSWGKQDNGV-------GSSWGKQNDGSGGGSSWGKQNDAG 1187

Query: 380  GGSSWGKQDG---GSSWGKQDG----GSSWGKQD---GGSSWSKEPD 416
            GGSSWGKQD    GSSWGKQDG    GS+WGKQ+   GGSSW K+ D
Sbjct: 1188 GGSSWGKQDSGGDGSSWGKQDGGGDSGSAWGKQNNTSGGSSWGKQSD 1234


>gi|224100723|ref|XP_002311988.1| hypothetical protein POPTRDRAFT_765239 [Populus trichocarpa]
 gi|222851808|gb|EEE89355.1| hypothetical protein POPTRDRAFT_765239 [Populus trichocarpa]
          Length = 1853

 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 78/162 (48%), Positives = 93/162 (57%), Gaps = 16/162 (9%)

Query: 538  WNSGSSGAGESKDTDW--------NKKSNLNCGSSDGDGNNSSGWDKKGNWNAGSSGDGE 589
            W +G SGAG ++ +DW        ++ S+ +  +   D N SSGW  K  WN  S    +
Sbjct: 1230 WKTGMSGAG-TQPSDWGAPKASKGDQSSSWDNKTGHVDANQSSGWGSKSCWNQKSPELEK 1288

Query: 590  SKDTD------WNKKCNWNSGSNDGDGNNGSGWGKKSNWNSGSNVAGESNDSNWAKKGNW 643
              + D      W KK N  S S+D  GN  S WGKK NWNS SN A  + DS WA K NW
Sbjct: 1289 DSEIDGNRNSSWGKKSNLKSESSDVGGNADSDWGKKGNWNSESNNADGNQDSGWANKSNW 1348

Query: 644  NSGSDDANQESSWGKKQGNWNSGSRDGHQESSWGKKSDWNSR 685
            NSGS DANQ SSW KK  NWNSGS D +QES W KKS W+SR
Sbjct: 1349 NSGSKDANQGSSWAKK-SNWNSGSSDVNQESGWDKKSSWSSR 1389



 Score = 48.1 bits (113), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 119/375 (31%), Positives = 159/375 (42%), Gaps = 126/375 (33%)

Query: 772  QGGGWNNNDSGDYKSFDSSQGVKNG-GEWSRSNDGAGSWSQGGG-------TWKSG---- 819
            Q GGW++N+S + K+FD   G  N  G W ++N G  SW++GGG       +W SG    
Sbjct: 1495 QNGGWSDNNSAEDKTFDWKNGANNSSGGW-KNNGGGSSWNRGGGDRGHQQNSWNSGSGGT 1553

Query: 820  ---------------------NSGASSQDGGWSS-----------QGSGWNNSN------ 841
                                 +SG S   GGW+            QG+ W +SN      
Sbjct: 1554 SNEGGGWSSQGSGWNQSRTAKDSGGSDLAGGWNKGTCANSDVAWGQGNSWKSSNPSGEGW 1613

Query: 842  --TTNEVKGLSDQGGGWNKGAGGSAQAGGWGRQGSGWSSGTSTGNRGSNDSSIANDVKGP 899
              ++ E+KG  DQGGGWNKG  G+     WG QG+ W S   +G   S  S    ++KG 
Sbjct: 1614 SKSSKEIKGSEDQGGGWNKGPSGANSDAAWG-QGNSWKSSNPSGEGWSQSS---KEIKGS 1669

Query: 900  NDQVVGRNKGSNGSAQ--------------------------SGGWGN-QGSGWSSGTGS 932
             DQ  G NKG   SAQ                          SG  G  Q SGWS G+  
Sbjct: 1670 EDQGGGWNKGPGSSAQGGGWGTKGAGSGEAGMTGGDAMTWNQSGASGRGQSSGWS-GSTE 1728

Query: 933  GNKGSND-SNISN------------------KGPNDQGGGWNKGSGGSAQSGAWGNQGSG 973
            G +G+N    +++                  +G +DQG GWNKG   +AQ+G WG++G+G
Sbjct: 1729 GKEGTNTGRELTDPCGKASSTSSWNQSSKDVEGSDDQGSGWNKGPSSNAQAGGWGDKGAG 1788

Query: 974  WNGGTDSGNRGSNSDQPKSWNQSSVATDGGRSKDAGEGSSRG----------WG-KTAGS 1022
                        +    K+WNQSS    G  S        +G          WG K + S
Sbjct: 1789 ----------LGDGGDAKTWNQSSAFGGGQSSGWGQSSEVKGANETGKPADPWGNKASTS 1838

Query: 1023 SW-EKGNDGSGKGGW 1036
            SW  +GNDGS KGGW
Sbjct: 1839 SWGNEGNDGSSKGGW 1853


>gi|255586801|ref|XP_002534015.1| suppressor of ty, putative [Ricinus communis]
 gi|223525980|gb|EEF28368.1| suppressor of ty, putative [Ricinus communis]
          Length = 1547

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 118/325 (36%), Positives = 139/325 (42%), Gaps = 102/325 (31%)

Query: 61   WGLAAAEGKNEDCWNKAAVKNIESNNGAYGGWGKEDAGSSLQDSQDNWGKNKDACDNQAN 120
            W       ++ED WNK +  NIES+ G  GGWGK  A  S +DS D WG+ K    N + 
Sbjct: 700  WNAGGITAESEDGWNKTST-NIESSGGTSGGWGK--AADSSKDSGDGWGQAKLDPGN-ST 755

Query: 121  WKKSDSWDKGKKIIGNSTSSWGDKTAEKNEPDSWG-----------KGKDGSSG----SK 165
               + +W+K K +  N TSSWGD    KN+ DSW            K K  ++G    SK
Sbjct: 756  LDAAAAWNKEKNVAENPTSSWGDVATAKNQQDSWTSKDTVESRSWEKSKSFTAGEDNLSK 815

Query: 166  S-------------DWNSSALATENPTV-------------------------------S 181
            S              W S+A A    TV                               S
Sbjct: 816  STGWNQQKSQNKWDTWRSTAEAQNKNTVQGDSWGKAKDSSVGGKVDWKSSTATAEKPTKS 875

Query: 182  WGNASGGWTQQKGGNMDERSGWK--KDDSGNQD-------------------------QR 214
            WGN  G W Q+   + DE S W   K D  NQ                          Q 
Sbjct: 876  WGNEGGSWAQE-SKSTDEASDWMNGKVDGANQTANWSNQKNQSEDAAGWTTGGSGSQSQT 934

Query: 215  SGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGS 274
              WNKPK+ GAD GSSW KQ          G       DGGSSW KK G S + KQ+GGS
Sbjct: 935  DNWNKPKSSGADGGSSWGKQ----------GKPETFDADGGSSWNKK-GESSLEKQEGGS 983

Query: 275  SWGKQDGGSSLGKQDGGSSWGKQDG 299
            SWGKQ G SS GKQ+GGSSW KQDG
Sbjct: 984  SWGKQGGASSWGKQEGGSSWSKQDG 1008



 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 55/123 (44%), Positives = 69/123 (56%), Gaps = 17/123 (13%)

Query: 526  NNGSGGWGKKSNWNSGSSGAGESKDTDWNKKSNLNCGSSDGDGNNSSGWDKKGNWNAGSS 585
            N+ +GGWGKK N + G +G G      WNK    +    +   ++S  W KKGNWN+ S 
Sbjct: 1085 NDQAGGWGKKPNTSWGDNGPG------WNKSHGADAKIGESKSHDSE-WGKKGNWNSASG 1137

Query: 586  GDGESKDTDWNKKCNWNSGSNDGDGNNGSGWGKKSNWNSGSNVAGESNDSN----WAKKG 641
              G +  + W KK NWNSGSN+GDGN  SGWG KS+ N       ES D+N    W KK 
Sbjct: 1138 DSGGNAGSSWGKKSNWNSGSNNGDGNQDSGWGNKSSLNL------ESGDANQSSGWGKKS 1191

Query: 642  NWN 644
            NWN
Sbjct: 1192 NWN 1194


>gi|296085969|emb|CBI31410.3| unnamed protein product [Vitis vinifera]
          Length = 1065

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 87/242 (35%), Positives = 114/242 (47%), Gaps = 59/242 (24%)

Query: 68  GKNEDCWNKAAVKNIESNNGAYGGWGKEDAGSS-----LQDSQDNWGKNKDACDNQAN-W 121
           GK+E+CWN++AV      + A   W K     S     L+D+ DNWGK K+     +N W
Sbjct: 169 GKSENCWNRSAV-TTGFGSSASDSWEKSKVSDSNQAGSLKDAGDNWGKGKNVAGTPSNGW 227

Query: 122 K-------KSDSWDKGKKI------IGNSTSSWGD-------------KTAEKNEPD--- 152
                   + D+W KGK +        + + S G+             K+ +K E     
Sbjct: 228 NDATTGNDQLDAWGKGKNVGEASCWEKSKSPSIGEDRWNNGGPGWNQQKSGDKREDTGGG 287

Query: 153 ---SWGKGKD------GSSGSKSDWNSSALATENPTVSWGNASGGWTQQKGGNMDERSGW 203
              +WGK  +      GSS SK DW SSA   EN T       GGW QQ+G   DE SGW
Sbjct: 288 DGSTWGKALESQEKGSGSSASKVDWKSSAARPENQT-------GGWAQQEGVGEDE-SGW 339

Query: 204 KKDDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNG 263
           +K    +QDQ+  WNKPKTF    GS+WN+Q    + D + GGS     D   SWGK NG
Sbjct: 340 RKGGFSSQDQKGSWNKPKTFDVTRGSAWNQQADGTNEDFK-GGS-----DQNGSWGKPNG 393

Query: 264 GS 265
            S
Sbjct: 394 FS 395



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 95/270 (35%), Positives = 121/270 (44%), Gaps = 67/270 (24%)

Query: 785  KSFDSSQGVKNGGEWSRSNDGAGSWSQGGG-----TWKSGNSGASSQDGGWSSQGSGWNN 839
            KSF+ ++G  N GE  + +DGA SW QGGG      W SG  G S+Q GGW       N 
Sbjct: 797  KSFEWNKGSNNNGEGWKGHDGAVSWGQGGGDKGPRNWNSGTGGTSNQYGGW-------NQ 849

Query: 840  SNTTNEVKGLSDQGGGWNKGAGGSAQAGGWGRQGSGWSSGTSTGNRGSNDSSIANDVKGP 899
            S    E  G   Q G WNKGA  +   GG   + S  +S +S  N+GS  +S+A      
Sbjct: 850  SKGVVESSGTGSQAGSWNKGASPAKDYGGSSVEASNLNSQSSGWNKGSAQTSVA------ 903

Query: 900  NDQVVGRNKGSNGSAQSGGWGNQGSGWSSGTGSGNKGSNDSNISNKGPNDQGGGWNKG-- 957
               VV  +              QG+ W     SG                  G W+ G  
Sbjct: 904  ---VVSND--------------QGNDWKQSNASGK-----------------GQWSAGNQ 929

Query: 958  SGGSAQSGAWGNQGSGWNGGTDSGNRGSNSDQPKSWNQSSVATDGGRS------KDAGEG 1011
            S   AQ+ AW  QGSGWN  T +G+     D  ++WNQS  A DG +S      K++ EG
Sbjct: 930  SIAGAQADAWDKQGSGWNRETSTGSGSMTRDGAETWNQSK-APDGAQSSAWNQTKNSKEG 988

Query: 1012 SSR------GWGKTAGSSWEKGNDGSGKGG 1035
            +S        WGK A +SW K  DGS KGG
Sbjct: 989  TSNFREATDSWGKAAANSWSKEKDGSSKGG 1018


>gi|50405439|ref|XP_456355.1| DEHA2A00484p [Debaryomyces hansenii CBS767]
 gi|49652019|emb|CAG84300.1| DEHA2A00484p [Debaryomyces hansenii CBS767]
          Length = 298

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/187 (25%), Positives = 65/187 (34%)

Query: 229 SSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQ 288
             WN  DG      +DG   W  +DG   W    G       +G   W   DG       
Sbjct: 102 EHWNDNDGKEHWKDRDGNDCWNDKDGKKHWKDHEGKKHWKDHEGKEHWKDHDGKEHWKDH 161

Query: 289 DGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSL 348
           DG   W  +DG+      +G   W  +D    W   +    W+ +DG   W   +G    
Sbjct: 162 DGNDCWNDKDGKKHWKDHEGKEHWNDKDGKEHWKDHEGNDCWNDKDGKKHWKDHEGKEHW 221

Query: 349 AKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGG 408
              +G   W   +G      +DG   W   DG   W  +DG   W   +G   W  QDG 
Sbjct: 222 KDHEGKKHWKDHEGKEHWNDKDGKEHWKDHDGNDCWNDKDGKKHWKDHEGKEHWKDQDGK 281

Query: 409 SSWSKEP 415
             + K P
Sbjct: 282 EHFYKPP 288



 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/195 (24%), Positives = 70/195 (35%), Gaps = 2/195 (1%)

Query: 245 GGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLA 304
           G  +W   +G   W   +G       DG   W  +DG       +G   W   DG+    
Sbjct: 55  GKKNWKDHEGKQHWNDNDGKEHWKDHDGNDCWNDKDGKKHWKDHEGKEHWNDNDGKEHWK 114

Query: 305 KQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGS 364
            +DG   W  +D    W   +    W   +G   W   DG       DG   W  +DG  
Sbjct: 115 DRDGNDCWNDKDGKKHWKDHEGKKHWKDHEGKEHWKDHDGKEHWKDHDGNDCWNDKDGKK 174

Query: 365 SLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSSWSKEPDQQHRKN-- 422
                +G   W+ +DG   W   +G   W  +DG   W   +G   W     ++H K+  
Sbjct: 175 HWKDHEGKEHWNDKDGKEHWKDHEGNDCWNDKDGKKHWKDHEGKEHWKDHEGKKHWKDHE 234

Query: 423 GGSSWGNRDGGSSWS 437
           G   W ++DG   W 
Sbjct: 235 GKEHWNDKDGKEHWK 249



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/209 (23%), Positives = 72/209 (34%), Gaps = 7/209 (3%)

Query: 230 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 289
            WN +DG       +G   W   DG   W  ++G      +DG   W   +G       +
Sbjct: 85  CWNDKDGKKHWKDHEGKEHWNDNDGKEHWKDRDGNDCWNDKDGKKHWKDHEGKKHWKDHE 144

Query: 290 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 349
           G   W   DG+      DG   W  +D    W   +    W+ +DG   W   +G     
Sbjct: 145 GKEHWKDHDGKEHWKDHDGNDCWNDKDGKKHWKDHEGKEHWNDKDGKEHWKDHEGNDCWN 204

Query: 350 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 409
            +DG   W   +G       +G   W   +G   W  +DG   W   DG   W  +DG  
Sbjct: 205 DKDGKKHWKDHEGKEHWKDHEGKKHWKDHEGKEHWNDKDGKEHWKDHDGNDCWNDKDGKK 264

Query: 410 SWSKEPDQQHRKNGGSSWGNRDGGSSWSK 438
            W           G   W ++DG   + K
Sbjct: 265 HWKDHE-------GKEHWKDQDGKEHFYK 286



 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 48/187 (25%), Positives = 62/187 (33%), Gaps = 7/187 (3%)

Query: 251 KQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGS 310
           KQ    S+G K+  S      G  +W   +G       DG   W   DG      +DG  
Sbjct: 34  KQLDSRSFGLKDFLSDHMPWSGKKNWKDHEGKQHWNDNDGKEHWKDHDGNDCWNDKDGKK 93

Query: 311 SWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSLGKQD 370
            W   +    W   D    W  RDG   W  +DG       +G   W   +G       D
Sbjct: 94  HWKDHEGKEHWNDNDGKEHWKDRDGNDCWNDKDGKKHWKDHEGKKHWKDHEGKEHWKDHD 153

Query: 371 GGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSSWSKEPDQQHRKNGGSSWGNR 430
           G   W   DG   W  +DG   W   +G   W  +DG   W           G   W ++
Sbjct: 154 GKEHWKDHDGNDCWNDKDGKKHWKDHEGKEHWNDKDGKEHWKDHE-------GNDCWNDK 206

Query: 431 DGGSSWS 437
           DG   W 
Sbjct: 207 DGKKHWK 213


>gi|156385476|ref|XP_001633656.1| predicted protein [Nematostella vectensis]
 gi|156220729|gb|EDO41593.1| predicted protein [Nematostella vectensis]
          Length = 215

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 74/211 (35%), Positives = 101/211 (47%), Gaps = 7/211 (3%)

Query: 226 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 285
           D G S   +D   S  ++D G S   ++   S G ++ G   G +D G S G +D G S 
Sbjct: 1   DAGKSQGFRDAGESQGLRDEGESRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGESQ 60

Query: 286 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 345
           G ++ G S G ++   S   +D G S G +D G S G +DEG S   R+   S G +D G
Sbjct: 61  GLRNAGESRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGESRGLRNADESQGLRDAG 120

Query: 346 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 405
            S   +D G S G +D G S G +D G S   ++   S G +D G S G +D G S G +
Sbjct: 121 KSQGFRDAGESQGLRDEGESQGLRDAGESRGLRNADESQGLRDAGKSQGFRDHGESQGLR 180

Query: 406 DGGSSWSKEPDQQHRKNGGSSWGNRDGGSSW 436
           D G S          +N G S G RD G S 
Sbjct: 181 DEGESRGL-------RNAGESRGLRDAGESQ 204



 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 66/178 (37%), Positives = 89/178 (50%)

Query: 226 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 285
           D G S   +D   S  ++D G S   ++ G S G +N     G +D G S G +D G S 
Sbjct: 37  DAGKSQGFRDAGESQGLRDEGESQGLRNAGESRGLRNADESQGLRDAGKSQGFRDAGESQ 96

Query: 286 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 345
           G +D G S G ++   S   +D G S G +D G S G +DEG S   RD G S G ++  
Sbjct: 97  GLRDEGESRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGESQGLRDAGESRGLRNAD 156

Query: 346 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWG 403
            S   +D G S G +D G S G +D G S   ++ G S G +D G S G +D G S G
Sbjct: 157 ESQGLRDAGKSQGFRDHGESQGLRDEGESRGLRNAGESRGLRDAGESQGFRDAGESRG 214



 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 84/169 (49%)

Query: 242 VQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRS 301
            +D G S   +D G S G +N G   G ++   S G +D G S G +D G S G +D   
Sbjct: 44  FRDAGESQGLRDEGESQGLRNAGESRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGE 103

Query: 302 SLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQD 361
           S   ++   S G +D G S G +D G S   RD G S G +D G S   ++   S G +D
Sbjct: 104 SRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGESQGLRDAGESRGLRNADESQGLRD 163

Query: 362 GGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSS 410
            G S G +D G S   +D G S G ++ G S G +D G S G +D G S
Sbjct: 164 AGKSQGFRDHGESQGLRDEGESRGLRNAGESRGLRDAGESQGFRDAGES 212


>gi|359486871|ref|XP_002272748.2| PREDICTED: uncharacterized protein LOC100256388 [Vitis vinifera]
          Length = 2667

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/257 (31%), Positives = 111/257 (43%), Gaps = 89/257 (34%)

Query: 68   GKNEDCWNKAAVKNIESNNGAYGGWGKEDAGSSLQDSQDNWGKNKDACDNQANWKKS--D 125
            GK+E+CWN++AV           G+G          + D+W K+K +  NQA   K   D
Sbjct: 1817 GKSENCWNRSAVTT---------GFG--------SSASDSWEKSKVSDSNQAGSLKDAGD 1859

Query: 126  SWDKGKKIIGNSTSSWGDKTAEKNEPDSWGKGKDGSSGSKSDWNSSALATENPTV----- 180
            +W KGK + G  ++ W D T   ++ D+WGKGK+   G  S W  S    ++P++     
Sbjct: 1860 NWGKGKNVAGTPSNGWNDATTGNDQLDAWGKGKN--VGEASCWEKS----KSPSIGEDRW 1913

Query: 181  -------------------------SWGNA---------------------------SGG 188
                                     +WG A                           +GG
Sbjct: 1914 NNGGPGWNQQKSGDKREDTGGGDGSTWGKALESQEKGSGSSASKVDWKSSAARPENQTGG 1973

Query: 189  WTQQKGGNMDERSGWKKDDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSS 248
            W QQ+G   DE SGW+K    +QDQ+  WNKPKTF    GS+WN+Q    + D + GGS 
Sbjct: 1974 WAQQEGVGEDE-SGWRKGGFSSQDQKGSWNKPKTFDVTRGSAWNQQADGTNEDFK-GGS- 2030

Query: 249  WAKQDGGSSWGKKNGGS 265
                D   SWGK NG S
Sbjct: 2031 ----DQNGSWGKPNGFS 2043


>gi|356574169|ref|XP_003555224.1| PREDICTED: uncharacterized protein LOC100818587 [Glycine max]
          Length = 1495

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/209 (36%), Positives = 98/209 (46%), Gaps = 59/209 (28%)

Query: 522  DGDGNNGSGGWGKKSNWNSGSSGAGESKDTDWNKKSNLNCGSSDGDGNNSSGWDKKGNWN 581
            D  GN GS GWG+KSNWNSGS    E++++ W+   N         GN  S  DKK NWN
Sbjct: 1004 DSSGNQGSNGWGQKSNWNSGSRSGNENQNSHWSSGRN-------EPGNQDSNLDKKSNWN 1056

Query: 582  AGSSGDGESKDTDWNKKCNWNSGSNDGD--------GNNGSGWG-----KKSNWNSGSNV 628
            +G+SG+  S      K  NWNSGS + +         NN S WG     K S+W+SG + 
Sbjct: 1057 SGNSGNLASDP----KSSNWNSGSGNSNENSNWGTNVNNKSSWGTGNENKNSSWSSGHSD 1112

Query: 629  AGESNDSNWAKKGNWNSGS---------------------------------DDANQESS 655
             G + D+N  KK NWNSG+                                 D  N +S+
Sbjct: 1113 PG-NQDANQGKKSNWNSGNSGNQPSDPNSNWNSNKSSWSAGNENKKSNWSSGDPGNTDSN 1171

Query: 656  WGKKQGNWNSGSRDGHQESSWGKKSDWNS 684
            WG K  N  SGS D +Q +SW   S WN+
Sbjct: 1172 WGNK-NNCISGSGDANQNTSWRSNSSWNT 1199


>gi|260940439|ref|XP_002614519.1| hypothetical protein CLUG_05296 [Clavispora lusitaniae ATCC 42720]
 gi|238851705|gb|EEQ41169.1| hypothetical protein CLUG_05296 [Clavispora lusitaniae ATCC 42720]
          Length = 708

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 96/258 (37%), Positives = 164/258 (63%), Gaps = 53/258 (20%)

Query: 187 GGWTQQKGGNMDERSGWKKDDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGG 246
           GG +++ GG  +E  G  ++D G  ++  G ++      D G+S  ++DG  S   +DGG
Sbjct: 347 GGTSEEDGGTSEEDGGTSEEDGGTSEEDGGTSE-----EDGGTS--EEDGGTSE--EDGG 397

Query: 247 SSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQ 306
           +S  ++DGG+S  +++GG+   ++DGG+S  ++DGG+S  ++DGG+S  ++DG +S  ++
Sbjct: 398 TS--EEDGGTS--EEDGGTS--EEDGGTS--EEDGGTS--EEDGGTS--EEDGGTS--EE 443

Query: 307 DGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSL 366
           DGG+S  ++D G+S  ++D G+  S+ DGG+S  ++DGG+S  ++DGG+S  ++DGG+S 
Sbjct: 444 DGGTS--EEDGGTS--EEDGGT--SEEDGGTS--EEDGGTS--EEDGGTS--EEDGGTS- 490

Query: 367 GKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWG---KQDGGSS-----WSKEPDQQ 418
            ++DGG+  S++DGG+S  ++DGG S  + DG S  G    +DGG+S      S E D  
Sbjct: 491 -EEDGGT--SEEDGGTS--EEDGGISV-EDDGASEVGGTSVEDGGTSEEVGGISVE-DDG 543

Query: 419 HRKNGGSSWGNRDGGSSW 436
             + GG+S    DGG+S 
Sbjct: 544 ASEVGGTSV--EDGGTSV 559


>gi|260820124|ref|XP_002605385.1| hypothetical protein BRAFLDRAFT_74205 [Branchiostoma floridae]
 gi|229290718|gb|EEN61395.1| hypothetical protein BRAFLDRAFT_74205 [Branchiostoma floridae]
          Length = 504

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/231 (23%), Positives = 100/231 (43%), Gaps = 8/231 (3%)

Query: 225 ADVGSSWNKQD--GICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGG 282
           +D G  +++ D  G+CS    D G   ++ D G  + + + G +  + D G  + + D G
Sbjct: 61  SDSGGVYSESDSGGVCSE--SDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSG 118

Query: 283 SSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQ 342
               + D G  + + D     ++ D G  + + D G  + + D G  +S+ D G  + + 
Sbjct: 119 GVYSESDSGGVYSESDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSES 178

Query: 343 DGGSSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSW 402
           D G   ++ D G  + + D G    + D G  +S+ D G  + + D G    + D G  +
Sbjct: 179 DSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVCSESDSGGVY 238

Query: 403 GKQDGGSSWSKEPD---QQHRKNGGSSWGNRDGGSSWSKQTDQQDNQEKPL 450
            + D G  +S E D        + G  +   D G   S+     +N+EK L
Sbjct: 239 SESDSGGVYS-ESDSGGVYSESDSGGVYSESDSGGVCSESDSGGENREKFL 288



 Score = 43.9 bits (102), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 45/205 (21%), Positives = 87/205 (42%), Gaps = 18/205 (8%)

Query: 225 ADVGSSWNKQD--GICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGG 282
           +D G  +++ D  G+CS    D G  +++ D G  + + + G +  + D G  + + D G
Sbjct: 214 SDSGGVYSESDSGGVCSE--SDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESDSG 271

Query: 283 SSLGKQDGG--------------SSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGS 328
               + D G              + + + D     ++ D G    + D G  + + D G 
Sbjct: 272 GVCSESDSGGENREKFLWFPSSAAVYSESDSGGVYSESDSGGVCSESDSGGVYSESDSGG 331

Query: 329 SWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQD 388
            +S+ D G    + D G   ++ D G  + + D G    + D G  +S+ D G  + + D
Sbjct: 332 VYSESDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESD 391

Query: 389 GGSSWGKQDGGSSWGKQDGGSSWSK 413
            G  + + D G  + + D G  +S+
Sbjct: 392 SGGVYSESDSGGVYSESDSGGVYSE 416



 Score = 40.8 bits (94), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 37/172 (21%), Positives = 74/172 (43%)

Query: 242 VQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRS 301
           V + G  +++ D G  + + N G +  + D G  + + + G    + +    + + D   
Sbjct: 6   VSNSGGVYSESDSGGVYSESNSGGVYSESDSGGVYSESNSGGVCSESNSRGVYSESDSGG 65

Query: 302 SLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQD 361
             ++ D G    + D G    + D G  +S+ D G  + + D G   ++ D G  + + D
Sbjct: 66  VYSESDSGGVCSESDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESD 125

Query: 362 GGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSSWSK 413
            G    + D G   S+ D G  + + D G  + + D G  + + D G  +S+
Sbjct: 126 SGGVYSESDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSE 177


>gi|366993156|ref|XP_003676343.1| hypothetical protein NCAS_0D04010 [Naumovozyma castellii CBS 4309]
 gi|342302209|emb|CCC69982.1| hypothetical protein NCAS_0D04010 [Naumovozyma castellii CBS 4309]
          Length = 560

 Score = 46.2 bits (108), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 87/342 (25%), Positives = 139/342 (40%), Gaps = 47/342 (13%)

Query: 204 KKDDSGNQDQRSGW---NKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGK 260
           KK   GN D    +   NK  ++G D     N  D   SS+ +   SS+   D   S+G 
Sbjct: 182 KKSSYGNDDNNDSYGSSNKKSSYGDD-----NNTDSYGSSNKK---SSYGNDDNTDSYGS 233

Query: 261 KNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSS 320
            N     G  +  SS+G  D   S G  +   S+G  + +SS    D   S+G  +   S
Sbjct: 234 SNNDDSYGSSNKKSSYGNDDNTDSYGSSNNDDSYGSSNKKSSYGNDDNTDSYGSSNNDDS 293

Query: 321 WGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDG 380
           +G  ++ SS+   D   S+G  +   S    +  SS+G  D   S G  +   S+   + 
Sbjct: 294 YGSSNKKSSYGNDDNTDSYGSSNNNDSYGSSNKKSSYGNDDNTDSYGSSNINDSYGSSNK 353

Query: 381 GSSWGKQDGGSSWGKQDGGSSWGKQDGGSSWSKEPDQQHRKNGGSSWGNRDGGSSW--SK 438
            SS+G  D   S+G  +  SS+G  D   S+       ++KN   S+GN D  +S+  S 
Sbjct: 354 KSSYGNDDNNDSYGSSNKKSSYGNDDNTDSYG----SSNKKN---SYGNDDNNNSYGSSN 406

Query: 439 QTDQQDNQEKPLESDGGRGSGGRWGQGGGQG--GGQEVSDQYGRGSFDQGSEKGTGGMGD 496
           + D            G   +   +G    +   G  + +D YG       ++K + G  D
Sbjct: 407 KNDSY----------GSSNNNDSYGSSNKKSSYGNDDNTDSYG-----SSNKKNSYGNDD 451

Query: 497 QGNGWNRRDKGTDWNKKCNWNSGSSDGDGNNGSGGWGKKSNW 538
             N +   +K   +        GSS+   N+  G   KKS++
Sbjct: 452 NNNSYGSSNKNDSY--------GSSNN--NDSYGSSNKKSSY 483



 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 87/217 (40%), Gaps = 26/217 (11%)

Query: 208 SGNQDQRSGWNKPKTFGAD----------VGSSW---NKQDGICSSDVQDGGSSWAKQDG 254
           S N D     NK  ++G D          +  S+   NK+            SS+   D 
Sbjct: 315 SNNNDSYGSSNKKSSYGNDDNTDSYGSSNINDSYGSSNKK------------SSYGNDDN 362

Query: 255 GSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGK 314
             S+G  N  S  G  D   S+G  +  +S G  D  +S+G  +   S    +   S+G 
Sbjct: 363 NDSYGSSNKKSSYGNDDNTDSYGSSNKKNSYGNDDNNNSYGSSNKNDSYGSSNNNDSYGS 422

Query: 315 QDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSLGKQDGGSS 374
            ++ SS+G  D   S+   +  +S+G  D  +S    +   S+G  +   S G  +  SS
Sbjct: 423 SNKKSSYGNDDNTDSYGSSNKKNSYGNDDNNNSYGSSNKNDSYGSSNNNDSYGSSNKKSS 482

Query: 375 WSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSSW 411
           +   D   S+G      S+G  +  SS+G  D   S+
Sbjct: 483 YGNDDNTDSYGSN-NNDSYGSANKNSSYGNDDNDDSY 518


>gi|356534384|ref|XP_003535735.1| PREDICTED: uncharacterized protein LOC100788244 [Glycine max]
          Length = 715

 Score = 44.3 bits (103), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 71/188 (37%), Positives = 97/188 (51%), Gaps = 40/188 (21%)

Query: 532 WGKKSNWNSGSSGAGESKDTDW--------NKKSNLNCGSS--DGDGNNSSGWDKKGNWN 581
           WG KSNWNSGS    E++++ W        N+ SNL+  ++    +G N +   K   WN
Sbjct: 263 WGPKSNWNSGSRSGNENQNSHWSSGHTEPRNQDSNLDKKNNWISVNGANLASDPKSSKWN 322

Query: 582 AGS--------SGDGESKDTDW-----NKKCNWNSGSNDGDGNNGSGWGKKSNWNSGS-- 626
           +GS         G+  +  + W     NK  NW+SG N G  +  S  GKKSNWNSG   
Sbjct: 323 SGSGNSNENSNWGNNGNNKSSWGAGNENKNSNWSSGRN-GPEDQESNQGKKSNWNSGDSD 381

Query: 627 NVAGESNDS------------NWAKKGNWNSGSDDANQESSWGKKQGNWNSGSRDGHQES 674
           N A + N+S            N  +  NW+SG D  N++S+WGKK  NWNSGS D +Q +
Sbjct: 382 NQASDPNNSNWNSNKSSWSAGNENRNSNWSSG-DPGNKDSNWGKKS-NWNSGSGDANQNT 439

Query: 675 SWGKKSDW 682
           +WG  S W
Sbjct: 440 TWGSNSSW 447


>gi|323456761|gb|EGB12627.1| hypothetical protein AURANDRAFT_70522 [Aureococcus anophagefferens]
          Length = 2743

 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 55/134 (41%), Positives = 73/134 (54%), Gaps = 14/134 (10%)

Query: 244 DGGSSWAK-QDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSS 302
           DGG+S++   DGG+S+   +        DGG+S+   D GSS    DGGS+    DG SS
Sbjct: 52  DGGTSYSSNDDGGTSYSSND--------DGGTSYSSNDDGSSASNDDGGSA-SNDDG-SS 101

Query: 303 LAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDG 362
            +  DGGS     D GSS    D G+S+S  D G S    DGGS+ +  DGG+S+   D 
Sbjct: 102 ASNDDGGS--ASNDDGSSASNDDGGTSYSSNDDGGSASNDDGGSA-SNDDGGTSYSSNDD 158

Query: 363 GSSLGKQDGGSSWS 376
           G S    DGG+S+S
Sbjct: 159 GGSASNDDGGTSYS 172


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.302    0.127    0.411 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,433,958,181
Number of Sequences: 23463169
Number of extensions: 1418552727
Number of successful extensions: 11201367
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 17696
Number of HSP's successfully gapped in prelim test: 50020
Number of HSP's that attempted gapping in prelim test: 4646196
Number of HSP's gapped (non-prelim): 2054431
length of query: 1036
length of database: 8,064,228,071
effective HSP length: 153
effective length of query: 883
effective length of database: 8,769,330,510
effective search space: 7743318840330
effective search space used: 7743318840330
T: 11
A: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 82 (36.2 bits)