BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001496
(1066 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|156365737|ref|XP_001626800.1| predicted protein [Nematostella vectensis]
gi|156213689|gb|EDO34700.1| predicted protein [Nematostella vectensis]
Length = 226
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/212 (38%), Positives = 132/212 (62%), Gaps = 2/212 (0%)
Query: 260 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 319
SW +DGI S ++DG +SWA +D +SW K+G + +DG +SW +DG +S +D
Sbjct: 4 SWAIKDGIHSWAIKDGINSWAIKDVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKD 63
Query: 320 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 379
G +SW +DG +S A + G +SW +D +SW +D + W+ +DG +SW +DG +S A
Sbjct: 64 GINSWAIKDGINSWAIKYGINSWAIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWA 123
Query: 380 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 439
+DG +SW +DG +S +DG +SW+ +DG +SW +DG +SW +DG +SW +DG +
Sbjct: 124 IKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGIN 183
Query: 440 SWSKE--PDQQHRKNGGSSWGNRDGGSSWSKQ 469
W+ + + K+G +SW +DG +SW+ +
Sbjct: 184 YWAIKDGINSWAIKDGINSWAIKDGINSWAIK 215
Score = 134 bits (337), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 81/212 (38%), Positives = 131/212 (61%), Gaps = 2/212 (0%)
Query: 260 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 319
SW +DGI S ++D +SWA +DG +SW K+G + +DG +SW +DG +S +D
Sbjct: 13 SWAIKDGINSWAIKDVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKDGINSWAIKD 72
Query: 320 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 379
G +SW + G +S A +DG +SW +D + W +D +SW+ +DG +SW +DG +S A
Sbjct: 73 GINSWAIKYGINSWAIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWAIKDGINSWA 132
Query: 380 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 439
+DG +SW +DG +S +DG +SW+ +DG +SW +DG +SW +DG + W +DG +
Sbjct: 133 IKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINYWAIKDGIN 192
Query: 440 SWSKE--PDQQHRKNGGSSWGNRDGGSSWSKQ 469
SW+ + + K+G +SW +DG SW+ +
Sbjct: 193 SWAIKDGINSWAIKDGINSWAIKDGIHSWAIK 224
Score = 134 bits (336), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 128/207 (61%), Gaps = 7/207 (3%)
Query: 256 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 315
DV +SW +DGI S ++DG +SWA +DG +SW K+G + +DG +SW + G +S
Sbjct: 27 DVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKDGINSWAIKDGINSWAIKYGINSW 86
Query: 316 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 375
+DG +SW +DG + A +DG +SW +D +SW +D +SW+ +DG +SW +DG
Sbjct: 87 AIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGI 146
Query: 376 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 435
+S A +DG +SW +DG +S +DG +SW+ +DG + W +DG +SW +DG +SW +
Sbjct: 147 NSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINYWAIKDGINSWAIKDGINSWAIK 206
Query: 436 DGGSSWSKEPDQQHRKNGGSSWGNRDG 462
DG +SW+ K+G SW +DG
Sbjct: 207 DGINSWA-------IKDGIHSWAIKDG 226
>gi|156349300|ref|XP_001622001.1| predicted protein [Nematostella vectensis]
gi|156208385|gb|EDO29901.1| predicted protein [Nematostella vectensis]
Length = 235
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/212 (37%), Positives = 131/212 (61%), Gaps = 2/212 (0%)
Query: 260 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 319
SW +DGI S ++DG +SWA +D +SW K+G + +DG +SW +DG +S +D
Sbjct: 4 SWAIKDGINSWAIKDGINSWAIKDVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKD 63
Query: 320 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 379
G +SW +DG +S A + G +SW +D +SW +D + W+ +DG +SW +DG +S A
Sbjct: 64 GINSWAIKDGINSWAIKYGINSWAIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWA 123
Query: 380 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 439
+DG +SW +DG +S +DG +SW+ +DG +SW +DG +SW +DG +SW +D +
Sbjct: 124 IKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGVNSWAIKDRIN 183
Query: 440 SWSKE--PDQQHRKNGGSSWGNRDGGSSWSKQ 469
W+ + + K+G +SW +DG +SW+ +
Sbjct: 184 YWAIKDGINSWAIKDGINSWAIKDGINSWAIK 215
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/212 (37%), Positives = 131/212 (61%), Gaps = 2/212 (0%)
Query: 260 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 319
SW +DGI S ++D +SWA +DG +SW K+G + +DG +SW +DG +S +D
Sbjct: 13 SWAIKDGINSWAIKDVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKDGINSWAIKD 72
Query: 320 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 379
G +SW + G +S A +DG +SW +D + W +D +SW+ +DG +SW +DG +S A
Sbjct: 73 GINSWAIKYGINSWAIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWAIKDGINSWA 132
Query: 380 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 439
+DG +SW +DG +S +DG +SW+ +DG +SW +DG +SW +D + W +DG +
Sbjct: 133 IKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGVNSWAIKDRINYWAIKDGIN 192
Query: 440 SWSKE--PDQQHRKNGGSSWGNRDGGSSWSKQ 469
SW+ + + K+G +SW +DG +SW+ +
Sbjct: 193 SWAIKDGINSWAIKDGINSWAIKDGINSWAIK 224
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/207 (39%), Positives = 128/207 (61%), Gaps = 7/207 (3%)
Query: 256 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 315
DV +SW +DGI S ++DG +SWA +DG +SW K+G + +DG +SW + G +S
Sbjct: 27 DVINSWAIKDGINSWAIKDGINSWAFKDGINSWAIKDGINSWAIKDGINSWAIKYGINSW 86
Query: 316 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 375
+DG +SW +DG + A +DG +SW +D +SW +D +SW+ +DG +SW +DG
Sbjct: 87 AIKDGINSWAIKDGINFWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGINSWAIKDGI 146
Query: 376 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 435
+S A +DG +SW +DG +S +DG +SW+ +D + W +DG +SW +DG +SW +
Sbjct: 147 NSWAIKDGINSWAIKDGINSWAIKDGVNSWAIKDRINYWAIKDGINSWAIKDGINSWAIK 206
Query: 436 DGGSSWSKEPDQQHRKNGGSSWGNRDG 462
DG +SW+ K+G +SW +DG
Sbjct: 207 DGINSWA-------IKDGINSWAIKDG 226
>gi|297806403|ref|XP_002871085.1| hypothetical protein ARALYDRAFT_487210 [Arabidopsis lyrata subsp.
lyrata]
gi|297316922|gb|EFH47344.1| hypothetical protein ARALYDRAFT_487210 [Arabidopsis lyrata subsp.
lyrata]
Length = 1476
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 149/286 (52%), Gaps = 85/286 (29%)
Query: 231 SGWKK---DDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSS 287
SGW K D + N GW KP D GSSW KQD Q+G +SW K+D G S
Sbjct: 923 SGWNKSAEDSNANSKGVPGWGKP-----DGGSSWGKQDK------QEGVASWGKKDDGGS 971
Query: 288 WGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDR 347
WG K+ DG SSWGK+D G K DGGSSWGK+D DGGSSWGK+D
Sbjct: 972 WGNKD--------DGVSSWGKKDDGQ---KDDGGSSWGKKD--------DGGSSWGKKDD 1012
Query: 348 G-SSWGKQDEG-SSWSKRD-GGSSWGKQDGGSSLAKQDGGSSWGKQDGG----------- 393
G SWGK+D+G S W K+D GGSSWGK+D DGGSSWGK+D G
Sbjct: 1013 GGYSWGKKDDGGSLWGKKDDGGSSWGKKD--------DGGSSWGKKDDGGYSEQTFDMGG 1064
Query: 394 ---------------------SSLGKQDGGSSWSKQDGGSSWGKQD---GGSSWGKQD-- 427
SS + + W+K GGSSWG+QD GGSSWGK++
Sbjct: 1065 RGFGGRRGGGRRGGRDQFGRGSSFSNSEDLAPWNKPSGGSSWGQQDSDGGGSSWGKENDT 1124
Query: 428 -GGSSWGKQDGGSSWSKEPDQQHRKNGGSSWGNRD---GGSSWSKQ 469
GGS WGKQD G S Q GSSWG ++ GGSSW KQ
Sbjct: 1125 GGGSGWGKQDSGGGGSSWGKQNDASGSGSSWGKQNNAGGGSSWGKQ 1170
>gi|255586801|ref|XP_002534015.1| suppressor of ty, putative [Ricinus communis]
gi|223525980|gb|EEF28368.1| suppressor of ty, putative [Ricinus communis]
Length = 1547
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 135/358 (37%), Positives = 170/358 (47%), Gaps = 74/358 (20%)
Query: 14 GGKDWMSAATTSAEGDRWNAGGASAGRSS-WPSFPVVGTSLHAESNPANAFGSGDNGANK 72
G K W A TSA+GDRWNAGG +A W TS + ES+ + G G +
Sbjct: 683 GSKGWTDGAGTSADGDRWNAGGITAESEDGWNK-----TSTNIESSGGTSGGWGKAADSS 737
Query: 73 DEEDSAWGSKVNAIQNSSWGLAAAEGKNEDCWNKAAVKNIESNNGAYGGWGKEDAGSSLQ 132
+ WG NS+ AAA WNK KN+ N WG D ++ +
Sbjct: 738 KDSGDGWGQAKLDPGNSTLDAAAA-------WNKE--KNVAEN--PTSSWG--DVATA-K 783
Query: 133 DSQDNWGKNKDACDNQANWKKSDSWDKGKKIIGNST-----------SSWGDKTAEKNEP 181
+ QD+W +KD ++++ W+KS S+ G+ + ST +W +N+
Sbjct: 784 NQQDSW-TSKDTVESRS-WEKSKSFTAGEDNLSKSTGWNQQKSQNKWDTWRSTAEAQNKN 841
Query: 182 ----DSWGKGKDGSSGSKSDWNSSALATENPTVSWGNASGGWTQQ----------KGGNM 227
DSWGK KD S G K DW SS E PT SWGN G W Q+ G +
Sbjct: 842 TVQGDSWGKAKDSSVGGKVDWKSSTATAEKPTKSWGNEGGSWAQESKSTDEASDWMNGKV 901
Query: 228 D----------------ERSGWKKDDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSD 271
D + +GW SG+Q Q WNKPK+ GAD GSSW KQ
Sbjct: 902 DGANQTANWSNQKNQSEDAAGWTTGGSGSQSQTDNWNKPKSSGADGGSSWGKQ------- 954
Query: 272 VQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDG 329
G DGGSSW KK G S + KQ+GGSSWGKQ G SS GKQ+GGSSW KQDG
Sbjct: 955 ---GKPETFDADGGSSWNKK-GESSLEKQEGGSSWGKQGGASSWGKQEGGSSWSKQDG 1008
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 55/123 (44%), Positives = 69/123 (56%), Gaps = 17/123 (13%)
Query: 556 NNGSGGWGKKSNWNSGSSGAGESKDTDWNKKSNLNCGSSDGDGNNSSGWDKKGNWNAGSS 615
N+ +GGWGKK N + G +G G WNK + + ++S W KKGNWN+ S
Sbjct: 1085 NDQAGGWGKKPNTSWGDNGPG------WNKSHGADAKIGESKSHDSE-WGKKGNWNSASG 1137
Query: 616 GDGESKDTDWNKKCNWNSGSNDGDGNNGSGWGKKSNWNSGSNVAGESNDSN----WAKKG 671
G + + W KK NWNSGSN+GDGN SGWG KS+ N ES D+N W KK
Sbjct: 1138 DSGGNAGSSWGKKSNWNSGSNNGDGNQDSGWGNKSSLNL------ESGDANQSSGWGKKS 1191
Query: 672 NWN 674
NWN
Sbjct: 1192 NWN 1194
>gi|374583946|ref|ZP_09657038.1| pseudouridine synthase Rsu [Leptonema illini DSM 21528]
gi|373872807|gb|EHQ04801.1| pseudouridine synthase Rsu [Leptonema illini DSM 21528]
Length = 708
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 129/215 (60%), Gaps = 2/215 (0%)
Query: 256 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 315
D G S++++D S D ++GG S+ + D G S+ +K+ G ++D G S+ ++DGG S
Sbjct: 44 DSGRSFDRKDSGRSFDRKEGGRSFDRGDSGRSFDRKDSGRSFDRKDSGRSFDRKDGGRSF 103
Query: 316 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 375
++D G S+ ++D S ++D G S+ +++ G S+ ++D G S+ + D S+ + +GG
Sbjct: 104 DRKDSGRSFDRKDSGRSFDRKDSGRSFDRKEGGRSFDRKDSGRSFDRGDSRRSFDRPEGG 163
Query: 376 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 435
S +++GG S+ + D S + +GG S+ +++GG S+ + D S+ + +GG S+ ++
Sbjct: 164 RSFDRKEGGRSFDRGDSRRSFDRPEGGRSFDRKEGGRSFDRGDSRRSFDRPEGGRSFDRK 223
Query: 436 DGGSSWSKEPDQQ--HRKNGGSSWGNRDGGSSWSK 468
+GG S+ + ++ R GG S+ ++GG S+ +
Sbjct: 224 EGGRSFDRGDSRRSFDRPEGGRSYDRKEGGRSFDR 258
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 173/363 (47%), Gaps = 64/363 (17%)
Query: 256 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 315
D G S++++D S D +DGG S+ ++D G S+ +K+ G ++D G S+ +++GG S
Sbjct: 80 DSGRSFDRKDSGRSFDRKDGGRSFDRKDSGRSFDRKDSGRSFDRKDSGRSFDRKEGGRSF 139
Query: 316 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 375
++D G S+ + D R S + +GG S+ +++ G S+ + D S+ + +GG S+ +++GG
Sbjct: 140 DRKDSGRSFDRGDSRRSFDRPEGGRSFDRKEGGRSFDRGDSRRSFDRPEGGRSFDRKEGG 199
Query: 376 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 435
S + D S+ + +GG S +++GG S+ + D S+ + +GG S+ +++GG S+ +
Sbjct: 200 RSFDRGDSRRSFDRPEGGRSFDRKEGGRSFDRGDSRRSFDRPEGGRSYDRKEGGRSFDRG 259
Query: 436 DGGSSWSKEPDQQHRKNGGSSWGNRDGGSSWSKQTDQQDNQEKPLESDGGRGSGGRWGQG 495
D S R GG S+ ++GG S+ + D + + ++P +GGR R G G
Sbjct: 260 DSRRS-------SDRPEGGRSFDRKEGGRSFDR-GDSRRSFDRP---EGGRSFDRREGGG 308
Query: 496 GG--------------------QGGGQEVSDQYGRGSFDQGS-----EKGTGGM------ 524
G + GG+ + G SFD+G ++ GG
Sbjct: 309 GFDRGDSRRSFDRPEGGRSFDRKDGGRSFDRKEGGRSFDRGDSRRSFDRPEGGRSFDRKE 368
Query: 525 -------GDQGNGWNRRDKGTDWNKKCNWNSGSSDGDGNNGSGGWGKKSNWNSGSSGAGE 577
GD+G G++R ++ K S D W SG++ E
Sbjct: 369 GGSSFDRGDRGRGFDRPERKGPAPKATRLRPMSED---------------WESGAAAIEE 413
Query: 578 SKD 580
D
Sbjct: 414 EPD 416
Score = 86.3 bits (212), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 62/216 (28%), Positives = 124/216 (57%), Gaps = 15/216 (6%)
Query: 242 DQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQD 301
D R +++P+ G S+++++G S D D S+ + +GG S+ +K GG + D
Sbjct: 179 DSRRSFDRPEG-----GRSFDRKEGGRSFDRGDSRRSFDRPEGGRSFDRKEGGRSFDRGD 233
Query: 302 GGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWS 361
S+ + +GG S +++GG S+ + D R S + +GG S+ +++ G S+ + D S+
Sbjct: 234 SRRSFDRPEGGRSYDRKEGGRSFDRGDSRRSSDRPEGGRSFDRKEGGRSFDRGDSRRSFD 293
Query: 362 KRDGGSSWGKQDGG---------SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGS 412
+ +GG S+ +++GG S + +GG S+ ++DGG S +++GG S+ + D
Sbjct: 294 RPEGGRSFDRREGGGGFDRGDSRRSFDRPEGGRSFDRKDGGRSFDRKEGGRSFDRGDSRR 353
Query: 413 SWGKQDGGSSWGKQDGGSSWGKQDGGSSWSKEPDQQ 448
S+ + +GG S+ +++GGSS+ + D G + + P+++
Sbjct: 354 SFDRPEGGRSFDRKEGGSSFDRGDRGRGFDR-PERK 388
>gi|224109844|ref|XP_002315331.1| predicted protein [Populus trichocarpa]
gi|222864371|gb|EEF01502.1| predicted protein [Populus trichocarpa]
Length = 1697
Score = 99.4 bits (246), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 87/186 (46%), Positives = 111/186 (59%), Gaps = 8/186 (4%)
Query: 520 GTGGMGDQGNGWNRRDKGTDWNKKCNWNSGSSDGDGNNGSGGWGKKSNWNSGSSGAGESK 579
G G+G Q + W K T+ ++ +W+S D N SG G KS+W G+ + + K
Sbjct: 1135 GASGIGTQKSDWGA-PKTTEVDQLSSWDSKVGHVDANQSSGS-GNKSSW--GTQKSSQEK 1190
Query: 580 DTDWNKKSNLNCGSSDGDGNNSSGWDKKGNWNAGSSGDGESKDTDWNKKCNWNSGSNDGD 639
WN++S S DGN S W K WN+GSS G + D+ W KK WNS S++ D
Sbjct: 1191 -PGWNQESPELEKDSKRDGNQESSWGNKSGWNSGSSDAGRNSDSAWGKKSIWNSESSNAD 1249
Query: 640 GNNGSGWGKKSNWNSGSNVAGESNDSNWAKKGNWNSGSDDANQESSWGKKQGNWNSGSRD 699
GN SGW KSNWNSGS A + S+WAKK NWNSGS DANQES WGKK +W+SG D
Sbjct: 1250 GNQDSGWATKSNWNSGSKDANQG--SSWAKKNNWNSGSSDANQESGWGKK-SSWSSGYGD 1306
Query: 700 GHQESS 705
G+Q+SS
Sbjct: 1307 GNQDSS 1312
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 95/207 (45%), Gaps = 41/207 (19%)
Query: 32 NAGGASAGRSSWPSFPVVGTSLHAESNPANAFGSGDNGANKDEEDSAWGSKVNAIQNSSW 91
N G +SWPS G +++P N S DN +KD AWGS+ Q SSW
Sbjct: 701 NPAVLIMGWTSWPSL---GLMDQPKTSPINPSSSADNELSKD---GAWGSQATGNQTSSW 754
Query: 92 GLAAAEGKNEDCWNKAAVKNIESNNGAYGGWGK-----EDAGSSLQDSQDNWGKNKDACD 146
G A D WNKAA NI S +GA GWGK ED S + + DNWG+ +
Sbjct: 755 GAVAG-----DSWNKAA-SNIGSTSGASVGWGKATLPNEDLAGSSRGTGDNWGRGNLRAE 808
Query: 147 NQANWKKSDSWDKGKKIIGNSTSSWGDKTAEKNEPDSWGKGKDGS----------SGSKS 196
N + +WDKGK + GN TSSWGD KN+ DSWGK D SG+
Sbjct: 809 NSL-IDSAVAWDKGKTVSGNQTSSWGDAATGKNQVDSWGKCNDAIGAGSWEKKKRSGTGE 867
Query: 197 DWNSSALATENPTVSWGNASGGWTQQK 223
D W N S GW QQK
Sbjct: 868 D-------------CWSNKSTGWNQQK 881
>gi|15237667|ref|NP_196049.1| kow domain-containing transcription factor 1 [Arabidopsis thaliana]
gi|332003341|gb|AED90724.1| kow domain-containing transcription factor 1 [Arabidopsis thaliana]
Length = 1493
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 198/520 (38%), Positives = 240/520 (46%), Gaps = 165/520 (31%)
Query: 18 WMSAATTSAEGDRWNAGGASAGRSSWPSFPVVGTSLHAESNPANAFGS-----GDNGANK 72
W + ++ E W GAS SN A+ GS G +G NK
Sbjct: 789 WGDTSASNVEASSWEKQGAST------------------SNVAD-LGSWGTHGGSSGGNK 829
Query: 73 DEEDSAWGSKVNAIQNSSWGLAAAEGKNEDCWNKAAVKNIESNNGAYGGWGKEDAGSSLQ 132
+EDS WG A ++S + K E W K + ES+ WG +D SS
Sbjct: 830 QDEDSVWGKLCEASESS-------QKKEESSWGKKGGSDGESS------WGNKDGNSSAS 876
Query: 133 DSQ-DNWGKNKDACDNQ---ANWK-KSDSWDKGKKIIGNSTSSWGDKTAE------KNEP 181
+WG+ D + W + + GKK G +S W +K+AE K P
Sbjct: 877 KKDGVSWGQQDKGSDESKGGSAWSNQCGDFGSGKKKDG--SSGW-NKSAEDSNANSKGVP 933
Query: 182 DSWGKGKDGSS-GSKSDWNSSALATENPTVSWGNAS--GGWTQQKGGNMDE-RSGWKKDD 237
D WG+ DGSS G K D SWG G W ++ GN D+ S W K D
Sbjct: 934 D-WGQPNDGSSWGKKGD----------GAASWGKKDDGGSWGKKDDGNKDDGGSSWGKKD 982
Query: 238 SGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQ-DGGSSWGKK-NGGS 295
G +D D GSSW K+ DGGSSW K+ DGGSSWGKK +GGS
Sbjct: 983 DGQKD-------------DGGSSWEKK--------FDGGSSWGKKDDGGSSWGKKDDGGS 1021
Query: 296 LMGKQ-DGGSSWGKQDGGSSL--GKQDGGSSWGKQ-DGRSSLAKQ-DGGSSWGKQDRG-- 348
L GK+ DGGSSWGK+D G SL K DG SSWGK+ DG SS K+ DGGSSWGK+D G
Sbjct: 1022 LWGKKDDGGSSWGKEDDGGSLWGKKDDGESSWGKKDDGESSWGKKDDGGSSWGKKDEGGY 1081
Query: 349 ------------------------------SSWGKQDEGSSWSKRDGGSSWGKQDGGSSL 378
SS+G ++ + WSK GGSSWGKQDG
Sbjct: 1082 SEQTFDRGGRGFGGRRGGGRRGGRDQFGRGSSFGNSEDPAPWSKPSGGSSWGKQDGDGGG 1141
Query: 379 AKQD------GGSSWGKQDGGSSLGKQDGGSSWSKQD----------------GGSSWGK 416
+ GGSSWGKQD G GSSW KQ+ GGSSWGK
Sbjct: 1142 SSWGKENDAGGGSSWGKQDNGV-------GSSWGKQNDGSGGGSSWGKQNDAGGGSSWGK 1194
Query: 417 QDG---GSSWGKQDG----GSSWGKQD---GGSSWSKEPD 446
QD GSSWGKQDG GS+WGKQ+ GGSSW K+ D
Sbjct: 1195 QDSGGDGSSWGKQDGGGDSGSAWGKQNNTSGGSSWGKQSD 1234
>gi|224100723|ref|XP_002311988.1| hypothetical protein POPTRDRAFT_765239 [Populus trichocarpa]
gi|222851808|gb|EEE89355.1| hypothetical protein POPTRDRAFT_765239 [Populus trichocarpa]
Length = 1853
Score = 93.2 bits (230), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 85/202 (42%), Positives = 105/202 (51%), Gaps = 20/202 (9%)
Query: 27 EGDRWNAGGASAGRSSWPSFPVVGTSLHAESNPANAFGSGDNGANKDEEDSAWGSKVNAI 86
EGD WN GG S R+SW S G +L E+NP N S DN NKD+ WGS+ A
Sbjct: 711 EGDGWNVGGLSTERTSWSS---PGFTLQPETNPVNPSSSVDNEPNKDD---TWGSQAKAK 764
Query: 87 QNSSWGLAAAEGKNEDCWNKAAVKNIESNNGAYGGWGK-----EDAGSSLQDSQDNWGKN 141
Q SS G AAA D WNKAA NI S++GA GWGK ED S + S DNWG+
Sbjct: 765 QTSSRGAAAA-----DSWNKAA-SNIGSSSGASVGWGKATLSNEDLPGSSRGSGDNWGQG 818
Query: 142 KDACDNQANWKKSDSWDKGKKIIGNSTSSWGDKTAEKNEPDSWGKGKDGSSGSKSDWNSS 201
+ + + +WDKGK +IGN SWG+ KN+ SWGK D + N S
Sbjct: 819 ILRDEKSSFDAAASAWDKGKTVIGNQNGSWGEAATGKNQVGSWGKCNDAVEAGSWEKNKS 878
Query: 202 ALATENPTVSWGNASGGWTQQK 223
+ E+ N + GW QQK
Sbjct: 879 SGTGED---CLSNKTTGWNQQK 897
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 119/375 (31%), Positives = 159/375 (42%), Gaps = 126/375 (33%)
Query: 802 QGGGWNNNDSGDYKSFDSSQGVKNG-GEWSRSNDGAGSWSQGGG-------TWKSG---- 849
Q GGW++N+S + K+FD G N G W ++N G SW++GGG +W SG
Sbjct: 1495 QNGGWSDNNSAEDKTFDWKNGANNSSGGW-KNNGGGSSWNRGGGDRGHQQNSWNSGSGGT 1553
Query: 850 ---------------------NSGASSQDGGWSS-----------QGSGWNNSN------ 871
+SG S GGW+ QG+ W +SN
Sbjct: 1554 SNEGGGWSSQGSGWNQSRTAKDSGGSDLAGGWNKGTCANSDVAWGQGNSWKSSNPSGEGW 1613
Query: 872 --TTNEVKGLSDQGGGWNKGAGGSAQAGGWGRQGSGWSSGTSTGNRGSNDSSIANDVKGP 929
++ E+KG DQGGGWNKG G+ WG QG+ W S +G S S ++KG
Sbjct: 1614 SKSSKEIKGSEDQGGGWNKGPSGANSDAAWG-QGNSWKSSNPSGEGWSQSS---KEIKGS 1669
Query: 930 NDQVVGRNKGSNGSAQ--------------------------SGGWGN-QGSGWSSGTGS 962
DQ G NKG SAQ SG G Q SGWS G+
Sbjct: 1670 EDQGGGWNKGPGSSAQGGGWGTKGAGSGEAGMTGGDAMTWNQSGASGRGQSSGWS-GSTE 1728
Query: 963 GNKGSND-SNISN------------------KGPNDQGGGWNKGSGGSAQSGAWGNQGSG 1003
G +G+N +++ +G +DQG GWNKG +AQ+G WG++G+G
Sbjct: 1729 GKEGTNTGRELTDPCGKASSTSSWNQSSKDVEGSDDQGSGWNKGPSSNAQAGGWGDKGAG 1788
Query: 1004 WNGGTDSGNRGSNSDQPKSWNQSSVATDGGRSKDAGEGSSRG----------WG-KTAGS 1052
+ K+WNQSS G S +G WG K + S
Sbjct: 1789 ----------LGDGGDAKTWNQSSAFGGGQSSGWGQSSEVKGANETGKPADPWGNKASTS 1838
Query: 1053 SW-EKGNDGSGKGGW 1066
SW +GNDGS KGGW
Sbjct: 1839 SWGNEGNDGSSKGGW 1853
>gi|50405439|ref|XP_456355.1| DEHA2A00484p [Debaryomyces hansenii CBS767]
gi|49652019|emb|CAG84300.1| DEHA2A00484p [Debaryomyces hansenii CBS767]
Length = 298
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 65/187 (34%)
Query: 259 SSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQ 318
WN DG +DG W +DG W G +G W DG
Sbjct: 102 EHWNDNDGKEHWKDRDGNDCWNDKDGKKHWKDHEGKKHWKDHEGKEHWKDHDGKEHWKDH 161
Query: 319 DGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSL 378
DG W +DG+ +G W +D W + W+ +DG W +G
Sbjct: 162 DGNDCWNDKDGKKHWKDHEGKEHWNDKDGKEHWKDHEGNDCWNDKDGKKHWKDHEGKEHW 221
Query: 379 AKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGG 438
+G W +G +DG W DG W +DG W +G W QDG
Sbjct: 222 KDHEGKKHWKDHEGKEHWNDKDGKEHWKDHDGNDCWNDKDGKKHWKDHEGKEHWKDQDGK 281
Query: 439 SSWSKEP 445
+ K P
Sbjct: 282 EHFYKPP 288
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 70/195 (35%), Gaps = 2/195 (1%)
Query: 275 GGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLA 334
G +W +G W +G DG W +DG +G W DG+
Sbjct: 55 GKKNWKDHEGKQHWNDNDGKEHWKDHDGNDCWNDKDGKKHWKDHEGKEHWNDNDGKEHWK 114
Query: 335 KQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGS 394
+DG W +D W + W +G W DG DG W +DG
Sbjct: 115 DRDGNDCWNDKDGKKHWKDHEGKKHWKDHEGKEHWKDHDGKEHWKDHDGNDCWNDKDGKK 174
Query: 395 SLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSSWSKEPDQQHRKN-- 452
+G W+ +DG W +G W +DG W +G W ++H K+
Sbjct: 175 HWKDHEGKEHWNDKDGKEHWKDHEGNDCWNDKDGKKHWKDHEGKEHWKDHEGKKHWKDHE 234
Query: 453 GGSSWGNRDGGSSWS 467
G W ++DG W
Sbjct: 235 GKEHWNDKDGKEHWK 249
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/209 (23%), Positives = 72/209 (34%), Gaps = 7/209 (3%)
Query: 260 SWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQD 319
WN +DG +G W DG W ++G +DG W +G +
Sbjct: 85 CWNDKDGKKHWKDHEGKEHWNDNDGKEHWKDRDGNDCWNDKDGKKHWKDHEGKKHWKDHE 144
Query: 320 GGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLA 379
G W DG+ DG W +D W + W+ +DG W +G
Sbjct: 145 GKEHWKDHDGKEHWKDHDGNDCWNDKDGKKHWKDHEGKEHWNDKDGKEHWKDHEGNDCWN 204
Query: 380 KQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGS 439
+DG W +G +G W +G W +DG W DG W +DG
Sbjct: 205 DKDGKKHWKDHEGKEHWKDHEGKKHWKDHEGKEHWNDKDGKEHWKDHDGNDCWNDKDGKK 264
Query: 440 SWSKEPDQQHRKNGGSSWGNRDGGSSWSK 468
W G W ++DG + K
Sbjct: 265 HWKDHE-------GKEHWKDQDGKEHFYK 286
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 62/187 (33%), Gaps = 7/187 (3%)
Query: 281 KQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGS 340
KQ S+G K+ S G +W +G DG W DG +DG
Sbjct: 34 KQLDSRSFGLKDFLSDHMPWSGKKNWKDHEGKQHWNDNDGKEHWKDHDGNDCWNDKDGKK 93
Query: 341 SWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSLGKQD 400
W + W D W RDG W +DG +G W +G D
Sbjct: 94 HWKDHEGKEHWNDNDGKEHWKDRDGNDCWNDKDGKKHWKDHEGKKHWKDHEGKEHWKDHD 153
Query: 401 GGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSSWSKEPDQQHRKNGGSSWGNR 460
G W DG W +DG W +G W +DG W G W ++
Sbjct: 154 GKEHWKDHDGNDCWNDKDGKKHWKDHEGKEHWNDKDGKEHWKDHE-------GNDCWNDK 206
Query: 461 DGGSSWS 467
DG W
Sbjct: 207 DGKKHWK 213
>gi|156385476|ref|XP_001633656.1| predicted protein [Nematostella vectensis]
gi|156220729|gb|EDO41593.1| predicted protein [Nematostella vectensis]
Length = 215
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 101/211 (47%), Gaps = 7/211 (3%)
Query: 256 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 315
D G S +D S ++D G S ++ S G ++ G G +D G S G +D G S
Sbjct: 1 DAGKSQGFRDAGESQGLRDEGESRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGESQ 60
Query: 316 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 375
G ++ G S G ++ S +D G S G +D G S G +DEG S R+ S G +D G
Sbjct: 61 GLRNAGESRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGESRGLRNADESQGLRDAG 120
Query: 376 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQ 435
S +D G S G +D G S G +D G S ++ S G +D G S G +D G S G +
Sbjct: 121 KSQGFRDAGESQGLRDEGESQGLRDAGESRGLRNADESQGLRDAGKSQGFRDHGESQGLR 180
Query: 436 DGGSSWSKEPDQQHRKNGGSSWGNRDGGSSW 466
D G S +N G S G RD G S
Sbjct: 181 DEGESRGL-------RNAGESRGLRDAGESQ 204
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 89/178 (50%)
Query: 256 DVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSL 315
D G S +D S ++D G S ++ G S G +N G +D G S G +D G S
Sbjct: 37 DAGKSQGFRDAGESQGLRDEGESQGLRNAGESRGLRNADESQGLRDAGKSQGFRDAGESQ 96
Query: 316 GKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGG 375
G +D G S G ++ S +D G S G +D G S G +DEG S RD G S G ++
Sbjct: 97 GLRDEGESRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGESQGLRDAGESRGLRNAD 156
Query: 376 SSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWG 433
S +D G S G +D G S G +D G S ++ G S G +D G S G +D G S G
Sbjct: 157 ESQGLRDAGKSQGFRDHGESQGLRDEGESRGLRNAGESRGLRDAGESQGFRDAGESRG 214
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 84/169 (49%)
Query: 272 VQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRS 331
+D G S +D G S G +N G G ++ S G +D G S G +D G S G +D
Sbjct: 44 FRDAGESQGLRDEGESQGLRNAGESRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGE 103
Query: 332 SLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQD 391
S ++ S G +D G S G +D G S RD G S G +D G S ++ S G +D
Sbjct: 104 SRGLRNADESQGLRDAGKSQGFRDAGESQGLRDEGESQGLRDAGESRGLRNADESQGLRD 163
Query: 392 GGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSS 440
G S G +D G S +D G S G ++ G S G +D G S G +D G S
Sbjct: 164 AGKSQGFRDHGESQGLRDEGESRGLRNAGESRGLRDAGESQGFRDAGES 212
>gi|296085969|emb|CBI31410.3| unnamed protein product [Vitis vinifera]
Length = 1065
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/242 (35%), Positives = 114/242 (47%), Gaps = 59/242 (24%)
Query: 98 GKNEDCWNKAAVKNIESNNGAYGGWGKEDAGSS-----LQDSQDNWGKNKDACDNQAN-W 151
GK+E+CWN++AV + A W K S L+D+ DNWGK K+ +N W
Sbjct: 169 GKSENCWNRSAV-TTGFGSSASDSWEKSKVSDSNQAGSLKDAGDNWGKGKNVAGTPSNGW 227
Query: 152 K-------KSDSWDKGKKI------IGNSTSSWGD-------------KTAEKNEPD--- 182
+ D+W KGK + + + S G+ K+ +K E
Sbjct: 228 NDATTGNDQLDAWGKGKNVGEASCWEKSKSPSIGEDRWNNGGPGWNQQKSGDKREDTGGG 287
Query: 183 ---SWGKGKD------GSSGSKSDWNSSALATENPTVSWGNASGGWTQQKGGNMDERSGW 233
+WGK + GSS SK DW SSA EN T GGW QQ+G DE SGW
Sbjct: 288 DGSTWGKALESQEKGSGSSASKVDWKSSAARPENQT-------GGWAQQEGVGEDE-SGW 339
Query: 234 KKDDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNG 293
+K +QDQ+ WNKPKTF GS+WN+Q + D + GGS D SWGK NG
Sbjct: 340 RKGGFSSQDQKGSWNKPKTFDVTRGSAWNQQADGTNEDFK-GGS-----DQNGSWGKPNG 393
Query: 294 GS 295
S
Sbjct: 394 FS 395
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 95/270 (35%), Positives = 121/270 (44%), Gaps = 67/270 (24%)
Query: 815 KSFDSSQGVKNGGEWSRSNDGAGSWSQGGG-----TWKSGNSGASSQDGGWSSQGSGWNN 869
KSF+ ++G N GE + +DGA SW QGGG W SG G S+Q GGW N
Sbjct: 797 KSFEWNKGSNNNGEGWKGHDGAVSWGQGGGDKGPRNWNSGTGGTSNQYGGW-------NQ 849
Query: 870 SNTTNEVKGLSDQGGGWNKGAGGSAQAGGWGRQGSGWSSGTSTGNRGSNDSSIANDVKGP 929
S E G Q G WNKGA + GG + S +S +S N+GS +S+A
Sbjct: 850 SKGVVESSGTGSQAGSWNKGASPAKDYGGSSVEASNLNSQSSGWNKGSAQTSVA------ 903
Query: 930 NDQVVGRNKGSNGSAQSGGWGNQGSGWSSGTGSGNKGSNDSNISNKGPNDQGGGWNKG-- 987
VV + QG+ W SG G W+ G
Sbjct: 904 ---VVSND--------------QGNDWKQSNASGK-----------------GQWSAGNQ 929
Query: 988 SGGSAQSGAWGNQGSGWNGGTDSGNRGSNSDQPKSWNQSSVATDGGRS------KDAGEG 1041
S AQ+ AW QGSGWN T +G+ D ++WNQS A DG +S K++ EG
Sbjct: 930 SIAGAQADAWDKQGSGWNRETSTGSGSMTRDGAETWNQSK-APDGAQSSAWNQTKNSKEG 988
Query: 1042 SSR------GWGKTAGSSWEKGNDGSGKGG 1065
+S WGK A +SW K DGS KGG
Sbjct: 989 TSNFREATDSWGKAAANSWSKEKDGSSKGG 1018
>gi|359486871|ref|XP_002272748.2| PREDICTED: uncharacterized protein LOC100256388 [Vitis vinifera]
Length = 2667
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/257 (31%), Positives = 111/257 (43%), Gaps = 89/257 (34%)
Query: 98 GKNEDCWNKAAVKNIESNNGAYGGWGKEDAGSSLQDSQDNWGKNKDACDNQANWKKS--D 155
GK+E+CWN++AV G+G + D+W K+K + NQA K D
Sbjct: 1817 GKSENCWNRSAVTT---------GFG--------SSASDSWEKSKVSDSNQAGSLKDAGD 1859
Query: 156 SWDKGKKIIGNSTSSWGDKTAEKNEPDSWGKGKDGSSGSKSDWNSSALATENPTV----- 210
+W KGK + G ++ W D T ++ D+WGKGK+ G S W S ++P++
Sbjct: 1860 NWGKGKNVAGTPSNGWNDATTGNDQLDAWGKGKN--VGEASCWEKS----KSPSIGEDRW 1913
Query: 211 -------------------------SWGNA---------------------------SGG 218
+WG A +GG
Sbjct: 1914 NNGGPGWNQQKSGDKREDTGGGDGSTWGKALESQEKGSGSSASKVDWKSSAARPENQTGG 1973
Query: 219 WTQQKGGNMDERSGWKKDDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGGSS 278
W QQ+G DE SGW+K +QDQ+ WNKPKTF GS+WN+Q + D + GGS
Sbjct: 1974 WAQQEGVGEDE-SGWRKGGFSSQDQKGSWNKPKTFDVTRGSAWNQQADGTNEDFK-GGS- 2030
Query: 279 WAKQDGGSSWGKKNGGS 295
D SWGK NG S
Sbjct: 2031 ----DQNGSWGKPNGFS 2043
>gi|356574169|ref|XP_003555224.1| PREDICTED: uncharacterized protein LOC100818587 [Glycine max]
Length = 1495
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 76/209 (36%), Positives = 98/209 (46%), Gaps = 59/209 (28%)
Query: 552 DGDGNNGSGGWGKKSNWNSGSSGAGESKDTDWNKKSNLNCGSSDGDGNNSSGWDKKGNWN 611
D GN GS GWG+KSNWNSGS E++++ W+ N GN S DKK NWN
Sbjct: 1004 DSSGNQGSNGWGQKSNWNSGSRSGNENQNSHWSSGRN-------EPGNQDSNLDKKSNWN 1056
Query: 612 AGSSGDGESKDTDWNKKCNWNSGSNDGD--------GNNGSGWG-----KKSNWNSGSNV 658
+G+SG+ S K NWNSGS + + NN S WG K S+W+SG +
Sbjct: 1057 SGNSGNLASDP----KSSNWNSGSGNSNENSNWGTNVNNKSSWGTGNENKNSSWSSGHSD 1112
Query: 659 AGESNDSNWAKKGNWNSGS---------------------------------DDANQESS 685
G + D+N KK NWNSG+ D N +S+
Sbjct: 1113 PG-NQDANQGKKSNWNSGNSGNQPSDPNSNWNSNKSSWSAGNENKKSNWSSGDPGNTDSN 1171
Query: 686 WGKKQGNWNSGSRDGHQESSWGKKSDWNS 714
WG K N SGS D +Q +SW S WN+
Sbjct: 1172 WGNK-NNCISGSGDANQNTSWRSNSSWNT 1199
>gi|449465322|ref|XP_004150377.1| PREDICTED: uncharacterized protein LOC101208989 [Cucumis sativus]
Length = 1690
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 123/384 (32%), Positives = 171/384 (44%), Gaps = 67/384 (17%)
Query: 9 LVPFC------GGKDWMSAATTSAEGDRWNAGGASAGRSSWPSFPVVGTSLHAESNPANA 62
L PF G +DW+ +SA GD WN+ S+ RS WPSFP GTS S+ N
Sbjct: 644 LKPFDILGNEGGSQDWIGGGGSSAGGDGWNSARPSSERSPWPSFPESGTSNGPGSSSTNP 703
Query: 63 FGSGDNGANKDEEDSAWGSKVNAIQNSSWGLAAAEGKNEDCWNKAAVKNIESNNGAYGGW 122
FGS K++EDS W SK+ ++SWG A K++V +N+G GW
Sbjct: 704 FGSD----AKNDEDSPWISKLTPEASTSWGAA-----------KSSVDT--ANDGQASGW 746
Query: 123 GKEDAGSSLQ-DSQDNWGKNKDACDNQANWKKSDS--WDKGKKIIGNSTSSWGDKTAE-K 178
GK D+ ++ GK + A + S+S W K N ++++GD A +
Sbjct: 747 GKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKK------NQSANFGDDNAPVE 800
Query: 179 NEPDSWGKGKDGSSGSKSDWNSSALATENPTVSWGNA-----------SGGWTQQKGGNM 227
D WG K SSGS D N+S +E GNA SGGW + K
Sbjct: 801 TSADRWG-SKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGD 859
Query: 228 DERSGWKKDDSGNQD-QRSGWNKPKTFGAD--VGSSWNKQDGICSSDVQDGGSSW--AKQ 282
S W K +G+ D Q WNKPK D VG K++ ++ D G+ W ++
Sbjct: 860 VGSSAWNKSTAGDGDGQNGSWNKPKPSNHDGNVG----KKEWGQGNEASDNGNKWQSSRS 915
Query: 283 DGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDG---G 339
DGG WG +++GGSSW +S G +SW + SSL G
Sbjct: 916 DGGKKWGTNE-----AEREGGSSW-----NTSKSSDVGPASWKDKPDSSSLTAPKGDQWA 965
Query: 340 SSWGKQDRGSSWGKQDEGSSWSKR 363
W KQ + D+ SSW+K+
Sbjct: 966 EGWDKQHSSNDTKASDDNSSWNKK 989
>gi|260940439|ref|XP_002614519.1| hypothetical protein CLUG_05296 [Clavispora lusitaniae ATCC 42720]
gi|238851705|gb|EEQ41169.1| hypothetical protein CLUG_05296 [Clavispora lusitaniae ATCC 42720]
Length = 708
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 86/227 (37%), Positives = 151/227 (66%), Gaps = 45/227 (19%)
Query: 217 GGWTQQKGGNMDERSGWKKDDSGNQDQRSGWNKPKTFGADVGSSWNKQDGICSSDVQDGG 276
GG +++ GG +E G ++D G ++ G ++ D G+S ++DG S +DGG
Sbjct: 347 GGTSEEDGGTSEEDGGTSEEDGGTSEEDGGTSE-----EDGGTS--EEDGGTSE--EDGG 397
Query: 277 SSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQ 336
+S ++DGG+S +++GG+ ++DGG+S ++DGG+S ++DGG+S ++DG +S ++
Sbjct: 398 TS--EEDGGTS--EEDGGTS--EEDGGTS--EEDGGTS--EEDGGTS--EEDGGTS--EE 443
Query: 337 DGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSL 396
DGG+S ++D G+S ++D G+ S+ DGG+S ++DGG+S ++DGG+S ++DGG+S
Sbjct: 444 DGGTS--EEDGGTS--EEDGGT--SEEDGGTS--EEDGGTS--EEDGGTS--EEDGGTS- 490
Query: 397 GKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWG---KQDGGSS 440
++DGG+ S++DGG+S ++DGG S + DG S G +DGG+S
Sbjct: 491 -EEDGGT--SEEDGGTS--EEDGGISV-EDDGASEVGGTSVEDGGTS 531
>gi|260820124|ref|XP_002605385.1| hypothetical protein BRAFLDRAFT_74205 [Branchiostoma floridae]
gi|229290718|gb|EEN61395.1| hypothetical protein BRAFLDRAFT_74205 [Branchiostoma floridae]
Length = 504
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 100/231 (43%), Gaps = 8/231 (3%)
Query: 255 ADVGSSWNKQD--GICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGG 312
+D G +++ D G+CS D G ++ D G + + + G + + D G + + D G
Sbjct: 61 SDSGGVYSESDSGGVCSE--SDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSG 118
Query: 313 SSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQ 372
+ D G + + D ++ D G + + D G + + D G +S+ D G + +
Sbjct: 119 GVYSESDSGGVYSESDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSES 178
Query: 373 DGGSSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSW 432
D G ++ D G + + D G + D G +S+ D G + + D G + D G +
Sbjct: 179 DSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVCSESDSGGVY 238
Query: 433 GKQDGGSSWSKEPD---QQHRKNGGSSWGNRDGGSSWSKQTDQQDNQEKPL 480
+ D G +S E D + G + D G S+ +N+EK L
Sbjct: 239 SESDSGGVYS-ESDSGGVYSESDSGGVYSESDSGGVCSESDSGGENREKFL 288
Score = 44.3 bits (103), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 45/205 (21%), Positives = 87/205 (42%), Gaps = 18/205 (8%)
Query: 255 ADVGSSWNKQD--GICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGG 312
+D G +++ D G+CS D G +++ D G + + + G + + D G + + D G
Sbjct: 214 SDSGGVYSESDSGGVCSE--SDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESDSG 271
Query: 313 SSLGKQDGG--------------SSWGKQDGRSSLAKQDGGSSWGKQDRGSSWGKQDEGS 358
+ D G + + + D ++ D G + D G + + D G
Sbjct: 272 GVCSESDSGGENREKFLWFPSSAAVYSESDSGGVYSESDSGGVCSESDSGGVYSESDSGG 331
Query: 359 SWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDGGSSWGKQD 418
+S+ D G + D G ++ D G + + D G + D G +S+ D G + + D
Sbjct: 332 VYSESDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESD 391
Query: 419 GGSSWGKQDGGSSWGKQDGGSSWSK 443
G + + D G + + D G +S+
Sbjct: 392 SGGVYSESDSGGVYSESDSGGVYSE 416
Score = 41.2 bits (95), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 37/172 (21%), Positives = 74/172 (43%)
Query: 272 VQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRS 331
V + G +++ D G + + N G + + D G + + + G + + + + D
Sbjct: 6 VSNSGGVYSESDSGGVYSESNSGGVYSESDSGGVYSESNSGGVCSESNSRGVYSESDSGG 65
Query: 332 SLAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQD 391
++ D G + D G + D G +S+ D G + + D G ++ D G + + D
Sbjct: 66 VYSESDSGGVCSESDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSESD 125
Query: 392 GGSSLGKQDGGSSWSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSSWSK 443
G + D G S+ D G + + D G + + D G + + D G +S+
Sbjct: 126 SGGVYSESDSGGVCSESDSGGVYSESDSGGVYSESDSGGVYSESDSGGVYSE 177
>gi|366993156|ref|XP_003676343.1| hypothetical protein NCAS_0D04010 [Naumovozyma castellii CBS 4309]
gi|342302209|emb|CCC69982.1| hypothetical protein NCAS_0D04010 [Naumovozyma castellii CBS 4309]
Length = 560
Score = 46.2 bits (108), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 87/342 (25%), Positives = 139/342 (40%), Gaps = 47/342 (13%)
Query: 234 KKDDSGNQDQRSGW---NKPKTFGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGK 290
KK GN D + NK ++G D N D SS+ + SS+ D S+G
Sbjct: 182 KKSSYGNDDNNDSYGSSNKKSSYGDD-----NNTDSYGSSNKK---SSYGNDDNTDSYGS 233
Query: 291 KNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGKQDRGSS 350
N G + SS+G D S G + S+G + +SS D S+G + S
Sbjct: 234 SNNDDSYGSSNKKSSYGNDDNTDSYGSSNNDDSYGSSNKKSSYGNDDNTDSYGSSNNDDS 293
Query: 351 WGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSLGKQDGGSSWSKQDG 410
+G ++ SS+ D S+G + S + SS+G D S G + S+ +
Sbjct: 294 YGSSNKKSSYGNDDNTDSYGSSNNNDSYGSSNKKSSYGNDDNTDSYGSSNINDSYGSSNK 353
Query: 411 GSSWGKQDGGSSWGKQDGGSSWGKQDGGSSWSKEPDQQHRKNGGSSWGNRDGGSSW--SK 468
SS+G D S+G + SS+G D S+ ++KN S+GN D +S+ S
Sbjct: 354 KSSYGNDDNNDSYGSSNKKSSYGNDDNTDSYG----SSNKKN---SYGNDDNNNSYGSSN 406
Query: 469 QTDQQDNQEKPLESDGGRGSGGRWGQGGGQG--GGQEVSDQYGRGSFDQGSEKGTGGMGD 526
+ D G + +G + G + +D YG ++K + G D
Sbjct: 407 KNDSY----------GSSNNNDSYGSSNKKSSYGNDDNTDSYG-----SSNKKNSYGNDD 451
Query: 527 QGNGWNRRDKGTDWNKKCNWNSGSSDGDGNNGSGGWGKKSNW 568
N + +K + GSS+ N+ G KKS++
Sbjct: 452 NNNSYGSSNKNDSY--------GSSNN--NDSYGSSNKKSSY 483
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 87/217 (40%), Gaps = 26/217 (11%)
Query: 238 SGNQDQRSGWNKPKTFGAD----------VGSSW---NKQDGICSSDVQDGGSSWAKQDG 284
S N D NK ++G D + S+ NK+ SS+ D
Sbjct: 315 SNNNDSYGSSNKKSSYGNDDNTDSYGSSNINDSYGSSNKK------------SSYGNDDN 362
Query: 285 GSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSSLAKQDGGSSWGK 344
S+G N S G D S+G + +S G D +S+G + S + S+G
Sbjct: 363 NDSYGSSNKKSSYGNDDNTDSYGSSNKKNSYGNDDNNNSYGSSNKNDSYGSSNNNDSYGS 422
Query: 345 QDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDGGSSLGKQDGGSS 404
++ SS+G D S+ + +S+G D +S + S+G + S G + SS
Sbjct: 423 SNKKSSYGNDDNTDSYGSSNKKNSYGNDDNNNSYGSSNKNDSYGSSNNNDSYGSSNKKSS 482
Query: 405 WSKQDGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSSW 441
+ D S+G S+G + SS+G D S+
Sbjct: 483 YGNDDNTDSYGSN-NNDSYGSANKNSSYGNDDNDDSY 518
>gi|356534384|ref|XP_003535735.1| PREDICTED: uncharacterized protein LOC100788244 [Glycine max]
Length = 715
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 97/188 (51%), Gaps = 40/188 (21%)
Query: 562 WGKKSNWNSGSSGAGESKDTDW--------NKKSNLNCGSS--DGDGNNSSGWDKKGNWN 611
WG KSNWNSGS E++++ W N+ SNL+ ++ +G N + K WN
Sbjct: 263 WGPKSNWNSGSRSGNENQNSHWSSGHTEPRNQDSNLDKKNNWISVNGANLASDPKSSKWN 322
Query: 612 AGS--------SGDGESKDTDW-----NKKCNWNSGSNDGDGNNGSGWGKKSNWNSGS-- 656
+GS G+ + + W NK NW+SG N G + S GKKSNWNSG
Sbjct: 323 SGSGNSNENSNWGNNGNNKSSWGAGNENKNSNWSSGRN-GPEDQESNQGKKSNWNSGDSD 381
Query: 657 NVAGESNDS------------NWAKKGNWNSGSDDANQESSWGKKQGNWNSGSRDGHQES 704
N A + N+S N + NW+SG D N++S+WGKK NWNSGS D +Q +
Sbjct: 382 NQASDPNNSNWNSNKSSWSAGNENRNSNWSSG-DPGNKDSNWGKKS-NWNSGSGDANQNT 439
Query: 705 SWGKKSDW 712
+WG S W
Sbjct: 440 TWGSNSSW 447
>gi|323456761|gb|EGB12627.1| hypothetical protein AURANDRAFT_70522 [Aureococcus anophagefferens]
Length = 2743
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 55/134 (41%), Positives = 73/134 (54%), Gaps = 14/134 (10%)
Query: 274 DGGSSWAK-QDGGSSWGKKNGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWGKQDGRSS 332
DGG+S++ DGG+S+ + DGG+S+ D GSS DGGS+ DG SS
Sbjct: 52 DGGTSYSSNDDGGTSYSSND--------DGGTSYSSNDDGSSASNDDGGSA-SNDDG-SS 101
Query: 333 LAKQDGGSSWGKQDRGSSWGKQDEGSSWSKRDGGSSWGKQDGGSSLAKQDGGSSWGKQDG 392
+ DGGS D GSS D G+S+S D G S DGGS+ + DGG+S+ D
Sbjct: 102 ASNDDGGS--ASNDDGSSASNDDGGTSYSSNDDGGSASNDDGGSA-SNDDGGTSYSSNDD 158
Query: 393 GSSLGKQDGGSSWS 406
G S DGG+S+S
Sbjct: 159 GGSASNDDGGTSYS 172
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.302 0.127 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,106,958,630
Number of Sequences: 23463169
Number of extensions: 1453316648
Number of successful extensions: 11275603
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 17699
Number of HSP's successfully gapped in prelim test: 50058
Number of HSP's that attempted gapping in prelim test: 4711023
Number of HSP's gapped (non-prelim): 2061306
length of query: 1066
length of database: 8,064,228,071
effective HSP length: 153
effective length of query: 913
effective length of database: 8,769,330,510
effective search space: 8006398755630
effective search space used: 8006398755630
T: 11
A: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 83 (36.6 bits)