BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023513
(281 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255580004|ref|XP_002530836.1| conserved hypothetical protein [Ricinus communis]
gi|223529600|gb|EEF31549.1| conserved hypothetical protein [Ricinus communis]
Length = 280
Score = 299 bits (766), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 147/208 (70%), Positives = 171/208 (82%), Gaps = 8/208 (3%)
Query: 52 TKAAVAAQMKQNWLDSLTFPPLHVH-------DLTANQTNADSQWALGIDPDLSGALAVL 104
++A VA Q+K+NWL SL+ P D TA +N S W +GIDPDLSGALA+L
Sbjct: 62 SEAEVARQLKENWLHSLSCPFNQTESSASKGVDSTA-PSNVGSNWVIGIDPDLSGALALL 120
Query: 105 KSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQD 164
K D +GCSA+VFD+PHL VLVGKR+RKRLDAKS++ LL S DAP+GTTAY+EQSIP+PQD
Sbjct: 121 KIDDSGCSAQVFDSPHLKVLVGKRIRKRLDAKSIVQLLHSFDAPLGTTAYIEQSIPFPQD 180
Query: 165 GKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPS 224
GKQGWWSGGFGYGLWIGILVASGFSVVP+PSL WKN + LSG +KDDSR+VASTLFPS
Sbjct: 181 GKQGWWSGGFGYGLWIGILVASGFSVVPVPSLAWKNVFELSGSKFSKDDSRKVASTLFPS 240
Query: 225 LCSQLKRKKDHGKADAVLIAAYGKGLKL 252
L S LKRKKDHG+A+A+LIAAYGKGLKL
Sbjct: 241 LSSLLKRKKDHGRAEALLIAAYGKGLKL 268
>gi|225435862|ref|XP_002263460.1| PREDICTED: uncharacterized protein LOC100242692 isoform 1 [Vitis
vinifera]
Length = 291
Score = 290 bits (742), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 146/262 (55%), Positives = 188/262 (71%), Gaps = 11/262 (4%)
Query: 1 METTVKIQPQLQSHCISMNTFVSSKIRTLCTY---PFAQHGRTLLVRAAAAGDTTKAAVA 57
M KI P+L F ++ + + T+ P T+L ++G
Sbjct: 18 MNPISKITPKLALSAPKFRVFCATSLNSTNTHDPIPAPSSTDTVL---RSSGKARARVSD 74
Query: 58 AQMKQNWLDSLTFPPLHVHD-----LTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS 112
Q ++NWL SL+ P ++ + + ++NA +Q +GIDPD+SGALA+LK+ +GCS
Sbjct: 75 TQYRENWLASLSCPFPDENERPGIRMDSAESNAGTQCVIGIDPDISGALALLKTGDSGCS 134
Query: 113 AEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSG 172
A+VFD+PHL +LVGKRVRKRLDAKS++ LLR DAPIGT AY+EQSIP+PQDGKQGWWSG
Sbjct: 135 AQVFDSPHLQILVGKRVRKRLDAKSIVQLLRGFDAPIGTIAYIEQSIPFPQDGKQGWWSG 194
Query: 173 GFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRK 232
GFGYGLWIGILVASGFSVVP+PS WKN + LSG ++KDDSRRVAST+FPS+ S LKRK
Sbjct: 195 GFGYGLWIGILVASGFSVVPVPSSLWKNEFKLSGNGTSKDDSRRVASTIFPSMSSLLKRK 254
Query: 233 KDHGKADAVLIAAYGKGLKLDS 254
KDHG+A+A+LIAA+GKGLK+ S
Sbjct: 255 KDHGRAEALLIAAFGKGLKMKS 276
>gi|296083889|emb|CBI24277.3| unnamed protein product [Vitis vinifera]
Length = 192
Score = 278 bits (710), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 128/177 (72%), Positives = 156/177 (88%)
Query: 78 LTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKS 137
+ + ++NA +Q +GIDPD+SGALA+LK+ +GCSA+VFD+PHL +LVGKRVRKRLDAKS
Sbjct: 1 MDSAESNAGTQCVIGIDPDISGALALLKTGDSGCSAQVFDSPHLQILVGKRVRKRLDAKS 60
Query: 138 MIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLT 197
++ LLR DAPIGT AY+EQSIP+PQDGKQGWWSGGFGYGLWIGILVASGFSVVP+PS
Sbjct: 61 IVQLLRGFDAPIGTIAYIEQSIPFPQDGKQGWWSGGFGYGLWIGILVASGFSVVPVPSSL 120
Query: 198 WKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLKLDS 254
WKN + LSG ++KDDSRRVAST+FPS+ S LKRKKDHG+A+A+LIAA+GKGLK+ S
Sbjct: 121 WKNEFKLSGNGTSKDDSRRVASTIFPSMSSLLKRKKDHGRAEALLIAAFGKGLKMKS 177
>gi|356528579|ref|XP_003532877.1| PREDICTED: uncharacterized protein LOC100775189 [Glycine max]
Length = 274
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 131/203 (64%), Positives = 163/203 (80%), Gaps = 7/203 (3%)
Query: 53 KAAVAAQMKQNWLDSLTFP-PLHVHDLTANQT----NADSQWALGIDPDLSGALAVLKSD 107
KA+ Q K+NWL SL++P P H L+ N S+W LGIDPD+SGA+A+LK+
Sbjct: 52 KASSDTQFKENWLASLSYPFPEKTHLLSGEHEPTHQNDGSKWVLGIDPDVSGAVALLKTH 111
Query: 108 HNGCSAEVFDTPHLPVLVGK--RVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDG 165
+ CSA+VFD+PH+ +LVGK R R+RLDAKS++ L+R DAPIGTTA++EQS+PYPQDG
Sbjct: 112 GSVCSAQVFDSPHVKILVGKNKRTRRRLDAKSVVELVRGFDAPIGTTAFIEQSLPYPQDG 171
Query: 166 KQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSL 225
KQGWWSGGFGYGLWIGILVASGFSV+P+PS TWK + LSG +TKDDSRR+ASTLFPS+
Sbjct: 172 KQGWWSGGFGYGLWIGILVASGFSVIPVPSFTWKAKFELSGNGTTKDDSRRIASTLFPSM 231
Query: 226 CSQLKRKKDHGKADAVLIAAYGK 248
S L RKKDHG+A+A+LIAAYG+
Sbjct: 232 TSMLSRKKDHGRAEALLIAAYGQ 254
>gi|30683279|ref|NP_180252.2| uncharacterized protein [Arabidopsis thaliana]
gi|26452962|dbj|BAC43557.1| unknown protein [Arabidopsis thaliana]
gi|28973431|gb|AAO64040.1| unknown protein [Arabidopsis thaliana]
gi|330252802|gb|AEC07896.1| uncharacterized protein [Arabidopsis thaliana]
Length = 273
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/283 (53%), Positives = 192/283 (67%), Gaps = 20/283 (7%)
Query: 1 METTVKIQ---PQLQSHCISMNTFVSSKIRTLCTYPFAQHGRTLLVRAAAAGDTTKAAVA 57
M TTV Q PQ+ S + F+S T PF + R+ A TTKA A
Sbjct: 1 MATTVYGQASLPQMHSLFSKVRPFLSHSPS--FTAPFTRR-RSFPFSALP---TTKAIDA 54
Query: 58 AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS--AEV 115
A MK+ WLDSL+ L D NA+S +GIDPDLSGALA+LK DH G S A+V
Sbjct: 55 ALMKEKWLDSLS---LTSQDEDTTPENAESSCIIGIDPDLSGALALLKFDHLGSSSFAQV 111
Query: 116 FDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFG 175
+DTPH+PVLVGKRVRKRLDAKS++ L++SLD P G+ Y+EQS P+P+DGKQGW+SGGFG
Sbjct: 112 YDTPHIPVLVGKRVRKRLDAKSIVQLIQSLDVPSGSRVYIEQSNPFPKDGKQGWYSGGFG 171
Query: 176 YGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDH 235
YGLWIG LVASGF V+P+ + WK + L+ G+ TKDDSRRVA+ LFPSL SQLKRKKDH
Sbjct: 172 YGLWIGTLVASGFCVIPVSASLWKRHFQLASGSCTKDDSRRVAAELFPSLSSQLKRKKDH 231
Query: 236 GKADAVLIAAYGKGLKLDSSHLSGELVPQRSEVVTTAQLLQKE 278
G+A+A+LIAAYG+ LK + +L+ Q E+++ L+ +
Sbjct: 232 GRAEALLIAAYGEALKTE------KLLIQPKELLSQVNYLENQ 268
>gi|3426042|gb|AAC32241.1| unknown protein [Arabidopsis thaliana]
Length = 286
Score = 263 bits (671), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 151/296 (51%), Positives = 193/296 (65%), Gaps = 33/296 (11%)
Query: 1 METTVKIQ---PQLQSHCISMNTFVSSKIRTLCTYPFAQHGRTLLVRAAAAGDTTKAAVA 57
M TTV Q PQ+ S + F+S T PF + R+ +A TTKA A
Sbjct: 1 MATTVYGQASLPQMHSLFSKVRPFLSHSPS--FTAPFTRR-RSF---PFSALPTTKAIDA 54
Query: 58 AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS--AEV 115
A MK+ WLDSL+ L D NA+S +GIDPDLSGALA+LK DH G S A+V
Sbjct: 55 ALMKEKWLDSLS---LTSQDEDTTPENAESSCIIGIDPDLSGALALLKFDHLGSSSFAQV 111
Query: 116 FDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQ-------- 167
+DTPH+PVLVGKRVRKRLDAKS++ L++SLD P G+ Y+EQS P+P+DGKQ
Sbjct: 112 YDTPHIPVLVGKRVRKRLDAKSIVQLIQSLDVPSGSRVYIEQSNPFPKDGKQVYATAHIR 171
Query: 168 -----GWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLF 222
GW+SGGFGYGLWIG LVASGF V+P+ + WK + L+ G+ TKDDSRRVA+ LF
Sbjct: 172 FSSAMGWYSGGFGYGLWIGTLVASGFCVIPVSASLWKRHFQLASGSCTKDDSRRVAAELF 231
Query: 223 PSLCSQLKRKKDHGKADAVLIAAYGKGLKLDSSHLSGELVPQRSEVVTTAQLLQKE 278
PSL SQLKRKKDHG+A+A+LIAAYG+ LK + +L+ Q E+++ L+ +
Sbjct: 232 PSLSSQLKRKKDHGRAEALLIAAYGEALKTE------KLLIQPKELLSQVNYLENQ 281
>gi|297825913|ref|XP_002880839.1| hypothetical protein ARALYDRAFT_320472 [Arabidopsis lyrata subsp.
lyrata]
gi|297326678|gb|EFH57098.1| hypothetical protein ARALYDRAFT_320472 [Arabidopsis lyrata subsp.
lyrata]
Length = 287
Score = 262 bits (669), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 150/297 (50%), Positives = 192/297 (64%), Gaps = 34/297 (11%)
Query: 1 METTVKIQPQL-QSHCISMNTFVSSKIR---TLCTYPFAQHGRTLLVRAAAAGDTTKAAV 56
M TT QP L Q H + SK+R +L + F R + +A TTKA
Sbjct: 1 MATTAYGQPSLPQMHSLF------SKLRPFLSLSSPSFTAPFRRRRDFSFSALPTTKAID 54
Query: 57 AAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS--AE 114
A MK+ WLDSL+ L D NA+S +GIDPDLSGALA+LK DH G S A+
Sbjct: 55 VALMKEKWLDSLS---LTSQDEDTTPENAESSCVIGIDPDLSGALALLKFDHLGSSSSAQ 111
Query: 115 VFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQ------- 167
V+DTPH+PVLVGKRVRKRLDAKS++ L++SLD P G+ Y+EQS P+P+DGKQ
Sbjct: 112 VYDTPHIPVLVGKRVRKRLDAKSIVQLIQSLDVPSGSRVYIEQSNPFPKDGKQVYATTHI 171
Query: 168 ------GWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTL 221
GW+SGGFGYGLWIG LVASGF V+P+ + WK + L+ G+ TKDDSRRVA+ L
Sbjct: 172 HFSSVMGWYSGGFGYGLWIGTLVASGFCVIPVSASLWKRHFQLASGSCTKDDSRRVAAEL 231
Query: 222 FPSLCSQLKRKKDHGKADAVLIAAYGKGLKLDSSHLSGELVPQRSEVVTTAQLLQKE 278
FPSL +QLKRKKDHG+A+A+LIAAYG+ LK + +L+ Q E+V+ L+ +
Sbjct: 232 FPSLSTQLKRKKDHGRAEALLIAAYGEALKTE------KLLIQPQELVSQVNYLENQ 282
>gi|356555307|ref|XP_003545975.1| PREDICTED: uncharacterized protein LOC100790125 [Glycine max]
Length = 290
Score = 261 bits (667), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 128/201 (63%), Positives = 156/201 (77%), Gaps = 7/201 (3%)
Query: 55 AVAAQMKQNWLDSLTFP-PLHVHDLTANQT----NADSQWALGIDPDLSGALAVLKSDHN 109
A Q K+NWL SL+ P P H L N S+W LGIDPD+SGA+A+LK+ +
Sbjct: 70 ASDTQFKENWLASLSNPFPEKTHLLNGEHEPTHQNDGSKWVLGIDPDVSGAVALLKTHGS 129
Query: 110 GCSAEVFDTPHLPVLVGKR--VRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQ 167
CSA+VFD+PH+ +LVGK R+RLDAKS++ L+ APIGTTAY+EQS+PYPQDGKQ
Sbjct: 130 VCSAQVFDSPHVKILVGKNKTTRRRLDAKSVVELVCGFRAPIGTTAYIEQSLPYPQDGKQ 189
Query: 168 GWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCS 227
GWWSGGFGYGLWIGILVASGFSV+P+PS TWK + LSG +TKDDSRR+ASTLFPS+ S
Sbjct: 190 GWWSGGFGYGLWIGILVASGFSVIPVPSFTWKAKFKLSGNGTTKDDSRRLASTLFPSMTS 249
Query: 228 QLKRKKDHGKADAVLIAAYGK 248
L RKKDHG+A+A+LIAAYG+
Sbjct: 250 MLSRKKDHGRAEALLIAAYGQ 270
>gi|224105997|ref|XP_002314007.1| predicted protein [Populus trichocarpa]
gi|222850415|gb|EEE87962.1| predicted protein [Populus trichocarpa]
Length = 235
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 112/167 (67%), Positives = 131/167 (78%), Gaps = 10/167 (5%)
Query: 53 KAAVAAQMKQNWLDSLTFP--------PLHVHDLTANQTNADSQWALGIDPDLSGALAVL 104
K A AQ+KQNWLDSLTFP L DL N N S W +G+DPD+SGALA+L
Sbjct: 67 KVADVAQLKQNWLDSLTFPLPNETENTNLGGDDLARN--NVGSNWVIGVDPDVSGALALL 124
Query: 105 KSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQD 164
K D +GCSA+VFD+PHL V+VGK +RKRLD KS++ L+RS DAPIGTTAYVEQS P+PQD
Sbjct: 125 KIDESGCSAQVFDSPHLKVMVGKGIRKRLDVKSIVQLIRSFDAPIGTTAYVEQSTPFPQD 184
Query: 165 GKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTK 211
GKQGWWSGGFGYGLWIG+LVASGFSVVP+PS+TWK+ L+GG TK
Sbjct: 185 GKQGWWSGGFGYGLWIGVLVASGFSVVPVPSMTWKSDLELAGGRCTK 231
>gi|334184493|ref|NP_001189612.1| uncharacterized protein [Arabidopsis thaliana]
gi|330252803|gb|AEC07897.1| uncharacterized protein [Arabidopsis thaliana]
Length = 326
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 187/336 (55%), Gaps = 73/336 (21%)
Query: 1 METTVKIQ---PQLQSHCISMNTFVSSKIRTLCTYPFAQHGRTLLVRAAAAGDTTKAAVA 57
M TTV Q PQ+ S + F+S T PF + R+ +A TTKA A
Sbjct: 1 MATTVYGQASLPQMHSLFSKVRPFLSHSPSF--TAPFTRR-RSF---PFSALPTTKAIDA 54
Query: 58 AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPD--------------------- 96
A MK+ WLDSL+ L D NA+S +GIDPD
Sbjct: 55 ALMKEKWLDSLS---LTSQDEDTTPENAESSCIIGIDPDLSGALALLKFDHLGSSSFAQN 111
Query: 97 ----------------------LSGALAVLKSDHNG---CSA---------EVFDTPHLP 122
L G+ + + + C + EV+DTPH+P
Sbjct: 112 QTYYQSWLCFEHLVLMLRNYLSLQGSYVICEKNRECEGFCESFMLLCLVQFEVYDTPHIP 171
Query: 123 VLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGI 182
VLVGKRVRKRLDAKS++ L++SLD P G+ Y+EQS P+P+DGKQGW+SGGFGYGLWIG
Sbjct: 172 VLVGKRVRKRLDAKSIVQLIQSLDVPSGSRVYIEQSNPFPKDGKQGWYSGGFGYGLWIGT 231
Query: 183 LVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVL 242
LVASGF V+P+ + WK + L+ G+ TKDDSRRVA+ LFPSL SQLKRKKDHG+A+A+L
Sbjct: 232 LVASGFCVIPVSASLWKRHFQLASGSCTKDDSRRVAAELFPSLSSQLKRKKDHGRAEALL 291
Query: 243 IAAYGKGLKLDSSHLSGELVPQRSEVVTTAQLLQKE 278
IAAYG+ LK + +L+ Q E+++ L+ +
Sbjct: 292 IAAYGEALKTE------KLLIQPKELLSQVNYLENQ 321
>gi|449444851|ref|XP_004140187.1| PREDICTED: uncharacterized protein LOC101204284 [Cucumis sativus]
gi|449481000|ref|XP_004156052.1| PREDICTED: uncharacterized LOC101204284 [Cucumis sativus]
Length = 273
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 146/263 (55%), Positives = 185/263 (70%), Gaps = 19/263 (7%)
Query: 11 LQSHCIS-MNTFVSSKI-------RTLCT-------YPFAQHGRTLLVRAAAAGDTTKAA 55
LQSH S MN+ SSK+ R LCT + VR ++G A
Sbjct: 6 LQSHSNSFMNSLPSSKLKLKLHHFRFLCTSSFSSICSSEISTISSSSVRKDSSGKAKLAI 65
Query: 56 VAAQMKQNWLDSLTFPPLHVHDLTANQT----NADSQWALGIDPDLSGALAVLKSDHNGC 111
AQ+K NWL SL+ P HD ++N + NA S+ +G+DPD+SGA+A+L++D +
Sbjct: 66 AHAQLKDNWLASLSCPFPLGHDYSSNSSSPDRNAASECVIGVDPDVSGAVALLRTDESIS 125
Query: 112 SAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWS 171
SA+V+D+PH+ +LVG R RKRLDAKS++ LL S +APIGTTAY+EQS P+P+DGKQGWW
Sbjct: 126 SAQVYDSPHVQILVGGRKRKRLDAKSIVQLLHSFNAPIGTTAYLEQSNPFPKDGKQGWWG 185
Query: 172 GGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKR 231
GGFGYGLWIG+LV GFSVVP+P L WKN + LSG ++KDDSRR+AS LFPSL LKR
Sbjct: 186 GGFGYGLWIGVLVGLGFSVVPVPPLAWKNKFELSGKDTSKDDSRRIASELFPSLTPLLKR 245
Query: 232 KKDHGKADAVLIAAYGKGLKLDS 254
KKDHG+ADA+LIAAYGKGLKL+S
Sbjct: 246 KKDHGRADALLIAAYGKGLKLNS 268
>gi|359479130|ref|XP_003632222.1| PREDICTED: uncharacterized protein LOC100242692 isoform 2 [Vitis
vinifera]
Length = 261
Score = 223 bits (567), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 117/242 (48%), Positives = 158/242 (65%), Gaps = 16/242 (6%)
Query: 1 METTVKIQPQLQSHCISMNTFVSSKIRTLCTY---PFAQHGRTLLVRAAAAGDTTKAAVA 57
M KI P+L F ++ + + T+ P T+L ++G
Sbjct: 18 MNPISKITPKLALSAPKFRVFCATSLNSTNTHDPIPAPSSTDTVL---RSSGKARARVSD 74
Query: 58 AQMKQNWLDSLTFPPLHVHD-----LTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS 112
Q ++NWL SL+ P ++ + + ++NA +Q +GIDPD+SGALA+LK+ +GCS
Sbjct: 75 TQYRENWLASLSCPFPDENERPGIRMDSAESNAGTQCVIGIDPDISGALALLKTGDSGCS 134
Query: 113 AEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSG 172
A+VFD+PHL +LVGKRVRKRLDAKS++ LLR DAPIGT AY+EQSIP+PQDGKQGWWSG
Sbjct: 135 AQVFDSPHLQILVGKRVRKRLDAKSIVQLLRGFDAPIGTIAYIEQSIPFPQDGKQGWWSG 194
Query: 173 GFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRK 232
GFGYGLWIGILVASGFSVVP+PS WKN + LSG ++K V ++FP + + R+
Sbjct: 195 GFGYGLWIGILVASGFSVVPVPSSLWKNEFKLSGNGTSK-----VFLSIFPLVFPHIFRE 249
Query: 233 KD 234
+
Sbjct: 250 PN 251
>gi|242052573|ref|XP_002455432.1| hypothetical protein SORBIDRAFT_03g010690 [Sorghum bicolor]
gi|241927407|gb|EES00552.1| hypothetical protein SORBIDRAFT_03g010690 [Sorghum bicolor]
Length = 282
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 118/224 (52%), Positives = 144/224 (64%), Gaps = 7/224 (3%)
Query: 35 AQHGRTLLVRAAAAGDTTKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGID 94
A RT RA+A K A+ + WL SL+ P D A A + WA+G+D
Sbjct: 62 AATARTPRSRASAKA-RAKLLAEAEPRDPWLASLSLLPT---DDVAGADAAPNGWAIGVD 117
Query: 95 PDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAY 154
PD GA+AVL D S++VFD P + ++V + +RKRLD KS+I LLR LD P GTTAY
Sbjct: 118 PDTRGAIAVLSPD---GSSQVFDNPFVHIVVSEVIRKRLDTKSIIQLLRGLDVPPGTTAY 174
Query: 155 VEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDS 214
+E+S P+P DGK GWWS GF YGLWI LVASGFSVVPI S TWK +GLS + KDDS
Sbjct: 175 IEKSSPFPTDGKLGWWSTGFSYGLWIASLVASGFSVVPIASQTWKAHFGLSRSETPKDDS 234
Query: 215 RRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLKLDSSHLS 258
R+ AS LFP LK KK HG+A+A+L+AAYGKGL L S S
Sbjct: 235 RQAASILFPDKAQSLKLKKHHGRAEALLLAAYGKGLVLPSRKFS 278
>gi|218187952|gb|EEC70379.1| hypothetical protein OsI_01327 [Oryza sativa Indica Group]
Length = 287
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/204 (53%), Positives = 138/204 (67%), Gaps = 3/204 (1%)
Query: 58 AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFD 117
A + WL SL+ P+ + A A + WA+G+DPD GA+AVL D S++VFD
Sbjct: 87 ADTRDPWLASLSLLPVDDGEAAAAAAAASTGWAIGVDPDTGGAIAVLSPD---GSSQVFD 143
Query: 118 TPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYG 177
P + ++V + +RKRLD KS+I LLR LDAP GTTAY+E+S P+P DGK GWWS GF YG
Sbjct: 144 NPFVHIVVSEFIRKRLDTKSIIELLRGLDAPPGTTAYIEKSSPFPTDGKLGWWSTGFSYG 203
Query: 178 LWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGK 237
LWI LVASGFSVVP+ S TWK ++GL+ S KD SR+ AS LFP L RKK HG+
Sbjct: 204 LWIAALVASGFSVVPVASQTWKAYFGLTRSGSPKDYSRQAASILFPDKALSLNRKKHHGR 263
Query: 238 ADAVLIAAYGKGLKLDSSHLSGEL 261
A+A+L+AAYGKGL L S S L
Sbjct: 264 AEALLLAAYGKGLVLPSGEFSKTL 287
>gi|212723270|ref|NP_001131429.1| uncharacterized protein LOC100192759 [Zea mays]
gi|194691492|gb|ACF79830.1| unknown [Zea mays]
gi|195609328|gb|ACG26494.1| hypothetical protein [Zea mays]
gi|414877006|tpg|DAA54137.1| TPA: hypothetical protein ZEAMMB73_409152 [Zea mays]
Length = 280
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 112/213 (52%), Positives = 139/213 (65%), Gaps = 6/213 (2%)
Query: 46 AAAGDTTKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLK 105
A+A K + + WL SL+ P TN W +G+DPD+ GA+AVL
Sbjct: 70 ASAKARAKLLAEVEARDPWLASLSLLPTDDIPSADATTNG---WVIGVDPDIGGAIAVLS 126
Query: 106 SDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDG 165
D S++VFD P + ++V + +RKRLD KS+I LLR LDAP GTTAY+E+S P+P DG
Sbjct: 127 PD---GSSQVFDNPFVHIVVSEVIRKRLDTKSIIQLLRGLDAPPGTTAYIEKSSPFPTDG 183
Query: 166 KQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSL 225
KQGWWS GF YGLWI LVASGFSVVPI S TWK ++GL + KDDSR+ AS LFP
Sbjct: 184 KQGWWSTGFSYGLWIASLVASGFSVVPIASQTWKAYFGLMRSETPKDDSRQAASILFPDK 243
Query: 226 CSQLKRKKDHGKADAVLIAAYGKGLKLDSSHLS 258
LK KK HG+A+A+L+AAYGKGL L S S
Sbjct: 244 DQSLKLKKHHGRAEALLLAAYGKGLVLPSGKFS 276
>gi|115435874|ref|NP_001042695.1| Os01g0268900 [Oryza sativa Japonica Group]
gi|56783734|dbj|BAD81146.1| unknown protein [Oryza sativa Japonica Group]
gi|56783755|dbj|BAD81167.1| unknown protein [Oryza sativa Japonica Group]
gi|113532226|dbj|BAF04609.1| Os01g0268900 [Oryza sativa Japonica Group]
gi|222618167|gb|EEE54299.1| hypothetical protein OsJ_01234 [Oryza sativa Japonica Group]
Length = 287
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/204 (53%), Positives = 138/204 (67%), Gaps = 3/204 (1%)
Query: 58 AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFD 117
A + WL SL+ P+ + A A + WA+G+DPD GA+AVL D S++VFD
Sbjct: 87 ADTRDPWLASLSLLPVDDGEAAAAAAAASTGWAIGVDPDTGGAIAVLSPD---GSSQVFD 143
Query: 118 TPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYG 177
P + ++V + +RKRLD KS+I LLR LDAP GTTAY+E+S P+P DGK GWWS GF YG
Sbjct: 144 NPFVHIVVSEFIRKRLDTKSIIELLRGLDAPPGTTAYIEKSSPFPTDGKLGWWSTGFSYG 203
Query: 178 LWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGK 237
LWI LVASGFSVVP+ S TWK ++GL+ S KD SR+ AS LFP L RKK HG+
Sbjct: 204 LWIAALVASGFSVVPVASQTWKAYFGLTRSGSPKDYSRQAASILFPDKALSLNRKKHHGR 263
Query: 238 ADAVLIAAYGKGLKLDSSHLSGEL 261
A+A+L+AAYGKGL L S S L
Sbjct: 264 AEALLLAAYGKGLVLPSGEFSKTL 287
>gi|215692956|dbj|BAG88376.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/204 (53%), Positives = 138/204 (67%), Gaps = 3/204 (1%)
Query: 58 AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFD 117
A + WL SL+ P+ + A A + WA+G+DPD GA+AVL D S++VFD
Sbjct: 67 ADTRDPWLASLSLLPVDDGEAAAAAAAASTGWAIGVDPDTGGAIAVLSPD---GSSQVFD 123
Query: 118 TPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYG 177
P + ++V + +RKRLD KS+I LLR LDAP GTTAY+E+S P+P DGK GWWS GF YG
Sbjct: 124 NPFVHIVVSEFIRKRLDTKSIIELLRGLDAPPGTTAYIEKSSPFPTDGKLGWWSTGFSYG 183
Query: 178 LWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGK 237
LWI LVASGFSVVP+ S TWK ++GL+ S KD SR+ AS LFP L RKK HG+
Sbjct: 184 LWIAALVASGFSVVPVASQTWKAYFGLTRSGSPKDYSRQAASILFPDKALSLNRKKHHGR 243
Query: 238 ADAVLIAAYGKGLKLDSSHLSGEL 261
A+A+L+AAYGKGL L S S L
Sbjct: 244 AEALLLAAYGKGLVLPSGEFSKTL 267
>gi|357130450|ref|XP_003566861.1| PREDICTED: uncharacterized protein LOC100835936 [Brachypodium
distachyon]
Length = 285
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 110/211 (52%), Positives = 138/211 (65%), Gaps = 7/211 (3%)
Query: 64 WLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPV 123
WL SL+ P A + WALGIDPD GA+AVL D S++VFD P + +
Sbjct: 81 WLASLSLLPAD----DGEAAPAPTGWALGIDPDTGGAIAVLSPD---GSSQVFDNPFVHI 133
Query: 124 LVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGIL 183
+V + RKRLD KS+I LLR LDAP+GTTAY+E+S P+P DGK GWWS GF YGLWI L
Sbjct: 134 VVSEFTRKRLDTKSIIQLLRGLDAPLGTTAYIEKSSPFPTDGKLGWWSTGFSYGLWIAAL 193
Query: 184 VASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLI 243
VASGFSVVP+ S TWK ++GL+ S KDDSR+ A+ LFP LK KK HG+A+A+L+
Sbjct: 194 VASGFSVVPVASQTWKAYFGLTRSKSRKDDSRQAATILFPDKALSLKLKKHHGRAEALLL 253
Query: 244 AAYGKGLKLDSSHLSGELVPQRSEVVTTAQL 274
AAYGKGL L L + Q + + +L
Sbjct: 254 AAYGKGLVLPQEQLPTIIHKQETNLTVAGRL 284
>gi|302767670|ref|XP_002967255.1| hypothetical protein SELMODRAFT_67545 [Selaginella moellendorffii]
gi|300165246|gb|EFJ31854.1| hypothetical protein SELMODRAFT_67545 [Selaginella moellendorffii]
Length = 193
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 103/199 (51%), Positives = 139/199 (69%), Gaps = 6/199 (3%)
Query: 52 TKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGC 111
+K +A ++NWLDSLT P D+ A NA + +GIDPD GALAVL+ + +
Sbjct: 1 SKKVIAEISRKNWLDSLTLPLGGEGDVDAELDNA---FVIGIDPDFKGALAVLRLEDDRF 57
Query: 112 SAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWS 171
EVFD P L V +G +R R DAK+++ LL++++AP G+ AYVE S+P+ DGKQ WWS
Sbjct: 58 IPEVFDVPTLKVAIGGTMRLRPDAKAIVNLLKTINAPHGSVAYVENSLPHSTDGKQSWWS 117
Query: 172 GGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKR 231
GFG+G+W G L+ASG SVVP+ WK GLS S+KD+SR VAS+ FPS+ SQL R
Sbjct: 118 SGFGFGMWTGTLLASGISVVPVSPRAWKTAMGLS---SSKDESRLVASSCFPSIASQLTR 174
Query: 232 KKDHGKADAVLIAAYGKGL 250
KKDHG+A+A+L+AA+G+GL
Sbjct: 175 KKDHGRAEALLLAAFGRGL 193
>gi|302754086|ref|XP_002960467.1| hypothetical protein SELMODRAFT_71406 [Selaginella moellendorffii]
gi|300171406|gb|EFJ38006.1| hypothetical protein SELMODRAFT_71406 [Selaginella moellendorffii]
Length = 193
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 102/199 (51%), Positives = 138/199 (69%), Gaps = 6/199 (3%)
Query: 52 TKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGC 111
+K +A ++NWLDSLT P D+ A NA +GIDPD GALAVL+ + +
Sbjct: 1 SKKVIAEISRKNWLDSLTLPLGDEGDVDAELDNA---LVIGIDPDFKGALAVLRLEDDRF 57
Query: 112 SAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWS 171
EVFD P L V +G +R R DAK+++ LL++++AP G+ AYVE S+P+ DGKQ WWS
Sbjct: 58 IPEVFDVPTLKVAIGGTMRLRPDAKAIVNLLKTINAPHGSVAYVENSLPHSTDGKQSWWS 117
Query: 172 GGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKR 231
GFG+G+W G L+ASG SVVP+ WK GLS S+KD+SR VAS+ FP++ SQL R
Sbjct: 118 SGFGFGMWTGTLLASGISVVPVSPRAWKTAMGLS---SSKDESRLVASSCFPAIASQLTR 174
Query: 232 KKDHGKADAVLIAAYGKGL 250
KKDHG+A+A+L+AA+G+GL
Sbjct: 175 KKDHGRAEALLLAAFGRGL 193
>gi|168017937|ref|XP_001761503.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687187|gb|EDQ73571.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 191
Score = 193 bits (490), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 91/160 (56%), Positives = 120/160 (75%), Gaps = 4/160 (2%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
LGIDPD SGA+AVL+ + EV D P + + VGK +R+R DA+S++ L+ ++AP G
Sbjct: 16 LGIDPDASGAIAVLRDG----TQEVLDVPCVKIQVGKTMRRRHDARSIVDLVNKINAPEG 71
Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
+ AYVEQ++P+P DGKQGW+ GFGYG+W+GIL+A GF VVP+ + WK+ G++G T
Sbjct: 72 SVAYVEQAMPFPMDGKQGWYGCGFGYGMWVGILMALGFKVVPVRAQVWKSAMGIAGKQYT 131
Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGL 250
KDDSR A LFP L QLKRKKDHG+ADA+LIAA+GKGL
Sbjct: 132 KDDSRATAMALFPDLGPQLKRKKDHGRADAILIAAFGKGL 171
>gi|297815494|ref|XP_002875630.1| hypothetical protein ARALYDRAFT_484823 [Arabidopsis lyrata subsp.
lyrata]
gi|297321468|gb|EFH51889.1| hypothetical protein ARALYDRAFT_484823 [Arabidopsis lyrata subsp.
lyrata]
Length = 199
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 117/189 (61%), Gaps = 5/189 (2%)
Query: 60 MKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNG-CSAEVFDT 118
MKQ L+ L++ + + A Q N W +GIDP LSGALAVLK D G CSA+V+DT
Sbjct: 1 MKQELLNPLSYSSPNYDE--AIQENFKPHWIIGIDPTLSGALAVLKFDDKGSCSAQVYDT 58
Query: 119 PHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGL 178
P V+V R R KSM+ L+R+LD P GT A+V + I Y D ++ G GYGL
Sbjct: 59 PRRQVVVQNIPRSRFKEKSMLELIRTLDVPSGTKAFVAKMILYEDDNIMAAYNAGLGYGL 118
Query: 179 WIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKA 238
W GIL+ S SV + TWK + LS ++D R++A +FPSL SQL R++D+ +A
Sbjct: 119 WTGILLTSNISVSSVAPSTWKKHFKLS--IYSRDGGRKLALKMFPSLSSQLTRRRDYARA 176
Query: 239 DAVLIAAYG 247
+A+LIAAYG
Sbjct: 177 NALLIAAYG 185
>gi|147856114|emb|CAN82440.1| hypothetical protein VITISV_006122 [Vitis vinifera]
Length = 1030
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 109/175 (62%), Gaps = 11/175 (6%)
Query: 1 METTVKIQPQLQSHCISMNTFVSSKIRTLCTY---PFAQHGRTLLVRAAAAGDTTKAAVA 57
M KI P+L F ++ + + T+ P T+L ++G
Sbjct: 18 MNPISKITPKLALSAPKFRVFCATSLNSTNTHDPIPAPSSTGTVL---RSSGKARVRVSD 74
Query: 58 AQMKQNWLDSLTFPPLHVHD-----LTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS 112
Q ++NWL SL+ P ++ + + ++NA +Q +GIDPD+SGALA+LK+ +GCS
Sbjct: 75 TQYRENWLASLSCPFPDENERPGIRMDSAESNAGTQCVIGIDPDISGALALLKTGDSGCS 134
Query: 113 AEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQ 167
A+VFD+PHL +LVGKRVRKRLDAKS++ LLR DAPIGT AY+EQSIP PQDGKQ
Sbjct: 135 AQVFDSPHLQILVGKRVRKRLDAKSIVQLLRGFDAPIGTIAYIEQSIPXPQDGKQ 189
>gi|255078906|ref|XP_002503033.1| predicted protein [Micromonas sp. RCC299]
gi|226518299|gb|ACO64291.1| predicted protein [Micromonas sp. RCC299]
Length = 386
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 69/159 (43%), Positives = 95/159 (59%), Gaps = 2/159 (1%)
Query: 91 LGIDPDLSGALAVLK--SDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAP 148
LG D D GALAV++ S N S +V D P V R+RKRL + M+ L+ SL+ P
Sbjct: 78 LGCDSDAGGALAVVRGPSLGNVASVDVHDCPTYKAEVNGRIRKRLCVEDMVELVASLNIP 137
Query: 149 IGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGT 208
GT A++E+ Q + G+ +GLW G+L A+G V + WK+ GL+
Sbjct: 138 RGTVAFLEEGGVEYGFSAQTAFVQGYNFGLWKGVLAAAGLRVEVVKPQAWKHALGLARSG 197
Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
S+KDDSR +AS +FP + L RKKDHG+A+A+LIAAYG
Sbjct: 198 SSKDDSRDMASAMFPEVGDDLGRKKDHGRAEALLIAAYG 236
>gi|303278846|ref|XP_003058716.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459876|gb|EEH57171.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 114/213 (53%), Gaps = 7/213 (3%)
Query: 73 LHVHDLTANQTNADSQWAL-GIDPDLSGALAVLKSDHNGC--SAEVFDTPHLPVLVGKRV 129
+ + + A + W L G D D GALAV++ G S +V D P + V V R
Sbjct: 84 VAARERVVEEERAPNGWFLVGCDSDAGGALAVVRGPSTGVITSVDVIDAPTMKVEVNGRA 143
Query: 130 RKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFS 189
R RLD +M +R L P GT A+VE+ Q + G+ +GLW G+L ++G +
Sbjct: 144 RVRLDVDAMCAAVRDLRLPPGTVAHVEEGGVEYGFSAQTAFVQGYNFGLWKGVLASAGLT 203
Query: 190 VVPIPSLTWKNWYGLS--GGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
VV + WK GL+ G +KD+SR +A LFP + LKRKKDHG+A+++LIAAYG
Sbjct: 204 VVVVKPQAWKWALGLARKGAGGSKDESRAMAKALFPEVEDALKRKKDHGRAESLLIAAYG 263
Query: 248 KGLKLDSSHLSGELVPQRSEVV-TTAQLLQKEE 279
+ ++ H+ G P + V A +++EE
Sbjct: 264 H-MAANARHVGGASDPLCARVRDMVAGAVEREE 295
>gi|308806632|ref|XP_003080627.1| unnamed protein product [Ostreococcus tauri]
gi|116059088|emb|CAL54795.1| unnamed protein product [Ostreococcus tauri]
Length = 312
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/159 (42%), Positives = 88/159 (55%), Gaps = 2/159 (1%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVF--DTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAP 148
+G D D +GAL VL G V D P LV R R RLD ++M ++RSL P
Sbjct: 67 VGCDSDGNGALCVLTGPSVGAIERVIVRDNPTRTELVNGRARPRLDPEAMTSVVRSLGVP 126
Query: 149 IGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGT 208
GTT Y+E+ Q + G+ +GLW G+ A+ V + WK GL
Sbjct: 127 KGTTVYLEEGGVEFGFSAQTAFVQGYNFGLWRGVFAAAELEVRVVKPQAWKAAMGLKSKE 186
Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
STKDDS +A TLFP + +LKRKKDHG+A+++LIAAYG
Sbjct: 187 STKDDSIAMAKTLFPEIAHELKRKKDHGRAESLLIAAYG 225
>gi|15229814|ref|NP_189977.1| uncharacterized protein [Arabidopsis thaliana]
gi|7594552|emb|CAB88119.1| putative protein [Arabidopsis thaliana]
gi|332644320|gb|AEE77841.1| uncharacterized protein [Arabidopsis thaliana]
Length = 195
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 102/168 (60%), Gaps = 10/168 (5%)
Query: 82 QTNADSQWALGIDPDLSGALAVLKSDHNG-CSAEVFDTPHLPVLVGKRVRKRLDAKSMIM 140
Q N W +GIDP+LSGALAVLK D G C A+V+DTP L V+V + + KSM+
Sbjct: 21 QENFKPHWIIGIDPNLSGALAVLKFDDKGSCFAQVYDTPQLEVVVQNIRTRSFNEKSMLE 80
Query: 141 LLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKN 200
L+RSLD P GT A+V + I +P++ ++ G G GLW L+ S SV+ + TW
Sbjct: 81 LIRSLDVPSGTKAFVAKYI-HPENAITA-YNDGLGCGLW--TLLTSSISVIYVTPSTWDK 136
Query: 201 WYGLSGGTSTKDDSRRVASTLFPSLCSQLKRK-KDHGKADAVLIAAYG 247
+ LS + D R++A +FPSL +L K D+ +A+A+LIAAYG
Sbjct: 137 HFNLS--IWSLDGGRKLALEMFPSL--KLTTKIDDNARANALLIAAYG 180
>gi|145349281|ref|XP_001419065.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579296|gb|ABO97358.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 195
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 90/174 (51%), Gaps = 2/174 (1%)
Query: 76 HDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGC--SAEVFDTPHLPVLVGKRVRKRL 133
+D A + D +G D D GAL V+ G S V D P V V R R RL
Sbjct: 3 NDADAEARSRDDWIVVGCDSDAHGALCVISGASVGSVRSVNVRDNPTSTVTVNGRARLRL 62
Query: 134 DAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPI 193
D +M +R+L P GT Y+E+ Q + G+ +GLW G+L A F V +
Sbjct: 63 DPDAMRGAVRALGVPKGTRVYLEEGGVEFGFSAQTAFVQGYNFGLWRGVLAAEEFEVRVV 122
Query: 194 PSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
WK GL STKDDS +A LFP + +LKRKKDHG+A+++LIAAYG
Sbjct: 123 KPQAWKAAMGLKFKGSTKDDSIAMAKMLFPEIAGELKRKKDHGRAESLLIAAYG 176
>gi|412991560|emb|CCO16405.1| predicted protein [Bathycoccus prasinos]
Length = 410
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 62/173 (35%), Positives = 91/173 (52%), Gaps = 13/173 (7%)
Query: 88 QWAL-GIDPDLSGALAVLKSDHNG--CSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRS 144
+WA+ G D D GALA+++ D G + E+ D P + V V + R RLD MI ++
Sbjct: 124 KWAVVGCDSDAGGALAIIRGDFAGDVQTVEIVDVPTMTVTVNGKKRIRLDVAKMIEKVKE 183
Query: 145 LDAPI------GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVA----SGFSVVPIP 194
LD P + +VE+ Q + G+ +GLW G+L + V +
Sbjct: 184 LDLPSDGNGGHSSVVFVEEGGVEFGFSAQTAFVQGYNFGLWKGVLESCVNPETTRVEVVK 243
Query: 195 SLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
WK GLSG ++KDDS +A +FP L+RKKDHG+A+++LIAAYG
Sbjct: 244 PQAWKLALGLSGKKTSKDDSIEMAKMVFPESEDMLQRKKDHGRAESLLIAAYG 296
>gi|374623246|ref|ZP_09695760.1| crossover junction endodeoxyribonuclease [Ectothiorhodospira sp.
PHS-1]
gi|373942361|gb|EHQ52906.1| crossover junction endodeoxyribonuclease [Ectothiorhodospira sp.
PHS-1]
Length = 159
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/159 (39%), Positives = 82/159 (51%), Gaps = 11/159 (6%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLV--GKRVRKRLDAKSMIMLLRSLDAP 148
LGIDP L+GA+A L SDH E+ + LP + RVR+ LDA + LLR +
Sbjct: 3 LGIDPGLTGAVAALASDHR---VELLED--LPTVARGSGRVRRELDAAGLAHLLRPIAGD 57
Query: 149 IGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGT 208
+ A VEQ P G +S G G G+L A G +V +P WK + G
Sbjct: 58 V-RLAVVEQVASRPGQGVASMFSLGHTAGAICGVLAALGIPLVLVPPAAWKKALNVPTG- 115
Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
S KD SR +A+ P L L RK+DH ADA+L+A YG
Sbjct: 116 SDKDMSRTIAARFHPDL--NLSRKRDHNMADALLLARYG 152
>gi|384254147|gb|EIE27621.1| hypothetical protein COCSUDRAFT_52223 [Coccomyxa subellipsoidea
C-169]
Length = 131
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 74/127 (58%), Gaps = 9/127 (7%)
Query: 159 IPYPQ--DGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRR 216
+P P +GK W++ G YG+W G L + GF V + S WK L+G K+ SR
Sbjct: 1 MPQPNRLNGKWSWYACGHSYGVWHGALASQGFKVETVTSRAWKIDLQLNG--KGKEGSRL 58
Query: 217 VASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLKLDSSHLSGELV---PQRSEVVTTAQ 273
+A LFPS LKRKKDHG+A+A+LIAA+ GL++ + SG+++ P + +
Sbjct: 59 LAIRLFPSAAELLKRKKDHGRAEAMLIAAWALGLRIQAD--SGDVIVAPPVEPQEASLDD 116
Query: 274 LLQKEEL 280
LL +E++
Sbjct: 117 LLSEEDV 123
>gi|340779655|ref|ZP_08699598.1| hypothetical protein AaceN1_17488 [Acetobacter aceti NBRC 14818]
Length = 157
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 55/154 (35%), Positives = 81/154 (52%), Gaps = 8/154 (5%)
Query: 93 IDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTT 152
IDP SGA+A+L SD G E+FD P + V V K+ R + A + +LR+ D
Sbjct: 6 IDPGNSGAIAIL-SDA-GDLIEIFDMPTVAVTVAKKQRNIVSAPLLATMLRAHDP---AE 60
Query: 153 AYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKD 212
++E+ P +G G ++ G G G+ G+ A G P+ ++T W +G + K
Sbjct: 61 IWLEKVSTRPGEGAVGAFAFGRGVGVIEGVAAALG---KPLSTVTPAVWKKATGCPADKG 117
Query: 213 DSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAY 246
+R A LFP+ Q KR KD G+A+A LI AY
Sbjct: 118 GARARAMQLFPTAADQFKRVKDDGRAEAALIGAY 151
>gi|262277495|ref|ZP_06055288.1| crossover junction endodeoxyribonuclease [alpha proteobacterium
HIMB114]
gi|262224598|gb|EEY75057.1| crossover junction endodeoxyribonuclease [alpha proteobacterium
HIMB114]
Length = 165
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 80/157 (50%), Gaps = 8/157 (5%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRS-LDAPI 149
GIDP +SGAL++ + N EV D P + GK+ +K+++A + + ++ ++
Sbjct: 4 FGIDPGVSGALSIYE---NKKLKEVIDMPTMSE--GKKNKKQINAAQLSLEIKQRINGSS 58
Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTS 209
T +E P G ++ G +G+ GI A + + + WK + L G S
Sbjct: 59 ETAVVIEHVTAMPGQGVTSMFNFGQSFGVIKGICSALTLPIYFVRPVKWKKHFNLIG--S 116
Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAY 246
+KD SR A LFPS + + +KKD KADA+LI Y
Sbjct: 117 SKDASRTKAIELFPSFSNDMAKKKDANKADAILIGRY 153
>gi|27379639|ref|NP_771168.1| hypothetical protein blr4528 [Bradyrhizobium japonicum USDA 110]
gi|27352791|dbj|BAC49793.1| blr4528 [Bradyrhizobium japonicum USDA 110]
Length = 158
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 78/157 (49%), Gaps = 15/157 (9%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
LGIDP SGA+A H E +P+ G+ +DA + L R + A
Sbjct: 6 LGIDPGASGAIAFYFPSHP----ERVAVEDMPIAAGE-----VDA---VNLARRVAAMAP 53
Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
++E+ P G + G YG+ +G++ G + +P+ +T W G +S
Sbjct: 54 DLVFLERVNAMPGQGVSSTFKFGRAYGVVLGVI---GAAALPLHLVTPAKWKGHLRLSSD 110
Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
K+++R +A LFP+ KRKKDHG+A+A LIA YG
Sbjct: 111 KEEARALALRLFPACADHFKRKKDHGRAEAALIARYG 147
>gi|307107556|gb|EFN55798.1| hypothetical protein CHLNCDRAFT_57716 [Chlorella variabilis]
Length = 361
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 2/105 (1%)
Query: 148 PIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGG 207
P A VE S+P GK W+ GF GL G+L A G +P+ TWK GL
Sbjct: 228 PPVVRAVVEYSMPGHVCGKYAWYGIGFAVGLLNGLLAAQGIPYHRVPASTWKQAMGLK-- 285
Query: 208 TSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLKL 252
KD S +A L P+ LKRKKDHG+A+A+L+AA+ G+++
Sbjct: 286 KQGKDGSIALARHLLPTASIFLKRKKDHGRAEALLMAAWTLGIRM 330
>gi|71083298|ref|YP_266017.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
ubique HTCC1062]
gi|71062411|gb|AAZ21414.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
ubique HTCC1062]
Length = 164
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 78/159 (49%), Gaps = 10/159 (6%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPI- 149
+GIDP ++G++ +G +V + P++P GK+ +K+++ + + S I
Sbjct: 4 IGIDPGITGSICFF---QDGKITDVIEMPNMPE--GKKNKKQVNGAQIYYEISSRIKDIK 58
Query: 150 --GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGG 207
+EQ P G ++ G +G+ GI A + + WK ++ L
Sbjct: 59 KEDIKVIIEQVSAMPGQGVTSMFNFGQSFGILKGICSAMQLPMYFVRPTKWKKYFNLI-- 116
Query: 208 TSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAY 246
S KD SR A +FP S+L RKKD KADA+LIA++
Sbjct: 117 NSEKDASRTKAIEVFPYFSSELSRKKDSNKADAILIASF 155
>gi|330814399|ref|YP_004358638.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
sp. IMCC9063]
gi|327487494|gb|AEA81899.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
sp. IMCC9063]
Length = 165
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 77/158 (48%), Gaps = 8/158 (5%)
Query: 90 ALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPI 149
GIDP +SGA++V + FD P + + GK+ +++++A + ++S
Sbjct: 3 VFGIDPGVSGAISVFEDKK---ITTFFDMPTM--IEGKKNKRQVNAAQLAHEIKSRVVDR 57
Query: 150 GTTAY-VEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGT 208
+A +EQ P G ++ G +G+ GI A G + I + WK + L G
Sbjct: 58 DNSAVLIEQVGAMPGQGVTSMFNFGQSFGVIKGICSALGIPIYFIRPVKWKKHFNLIG-- 115
Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAY 246
+ KD SR +FP L QL +KKD K DA+LI Y
Sbjct: 116 TEKDASRTKVIEMFPDLSPQLTKKKDSNKTDAILIGKY 153
>gi|91762272|ref|ZP_01264237.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
ubique HTCC1002]
gi|91718074|gb|EAS84724.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
ubique HTCC1002]
Length = 164
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 78/159 (49%), Gaps = 10/159 (6%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPI- 149
+GIDP ++G++ +G +V + P++P GK+ +K+++ + + S I
Sbjct: 4 IGIDPGITGSICFF---QDGKIIDVIEMPNMPE--GKKNKKQVNGAQIYYEISSKIKDIK 58
Query: 150 --GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGG 207
+EQ P G ++ G +G+ GI A + + WK ++ L
Sbjct: 59 KEDIKVIIEQVSAMPGQGVTSMFNFGQSFGILKGICSAMQLPMYFVRPTKWKKYFNLI-- 116
Query: 208 TSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAY 246
S KD SR A +FP S+L RKKD KADA+LIA++
Sbjct: 117 NSEKDASRTKAIEVFPYFSSELSRKKDSNKADAILIASF 155
>gi|371496244|ref|YP_004957392.1| unnamed protein product [Planktothrix phage PaV-LD]
gi|325587611|gb|ADZ31626.1| crossover junction endodeoxyribonuclease [Planktothrix phage
PaV-LD]
Length = 159
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 79/166 (47%), Gaps = 17/166 (10%)
Query: 84 NADSQWALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDA---KSMIM 140
N ++ +GIDP +GA+ + NG + D P + + R L A K MI
Sbjct: 3 NIMTKTFIGIDPGATGAVCRIS---NG-EVKFLDCPIIKIGSKIRPNPTLMASGLKEMIT 58
Query: 141 LLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKN 200
L T +E+ P+ G ++ G G+G+WIGI+ A G + + WK
Sbjct: 59 L--------NTHLIIEKVHAMPKQGVSSTFNFGMGFGIWIGIIAALGIPMEFVTPQAWKK 110
Query: 201 WYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAY 246
Y L G K SR +A +FP + LK +K HG+A+A+L+A Y
Sbjct: 111 HYSLIG--KDKKASRVMALQIFPGETNNLKLEKHHGRAEALLLAEY 154
>gi|406706639|ref|YP_006756992.1| Lactococcus phage M3 protein [alpha proteobacterium HIMB5]
gi|406652415|gb|AFS47815.1| Lactococcus phage M3 protein [alpha proteobacterium HIMB5]
Length = 164
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 79/160 (49%), Gaps = 12/160 (7%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDA----KSMIMLLRSLD 146
+GIDP +SG++ NG +V + P + GK+ +K+++ ++ +++ D
Sbjct: 4 IGIDPGISGSICFF---DNGRIIKVIEMPTMTD--GKKNKKQVNGAQIYHEILEGIKNYD 58
Query: 147 APIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSG 206
+EQ P G ++ G +G+ GI A S+ + WK ++ L
Sbjct: 59 KK-DIKIVIEQVSAMPGQGVTSMFNFGQSFGILKGICSAMQLSMYFVRPAKWKKYFNLI- 116
Query: 207 GTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAY 246
S KD SR A +FP SQL +KKD KADA+LIA++
Sbjct: 117 -NSEKDASRTRAIEIFPYFSSQLSKKKDSNKADAILIASF 155
>gi|254455456|ref|ZP_05068885.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
sp. HTCC7211]
gi|207082458|gb|EDZ59884.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
sp. HTCC7211]
Length = 164
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 79/169 (46%), Gaps = 13/169 (7%)
Query: 90 ALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLV-GKRVRKRLDAKSM---IMLLRSL 145
+GIDP +SG++ + ++ D +P ++ GK+ +K+++ + I
Sbjct: 3 VIGIDPGISGSICFFQD------GKIIDVVEMPTMIEGKKNKKQVNGSQIFNEISYRIKK 56
Query: 146 DAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLS 205
+EQ P G ++ G +G+ GI A + + WK ++ L
Sbjct: 57 IDKKDIKVVIEQVSAMPGQGVTSMFNFGQSFGILKGICSAMQLPMYFVRPAKWKKYFNLI 116
Query: 206 GGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAA-YGKGLKLD 253
S KD SR A +FP SQL RKKD KADA+L+A+ Y + KLD
Sbjct: 117 --NSEKDASRTRAIEVFPYFSSQLSRKKDSNKADAILLASFYYETYKLD 163
>gi|299534268|ref|ZP_07047617.1| hypothetical protein CTS44_25676 [Comamonas testosteroni S44]
gi|298717726|gb|EFI58734.1| hypothetical protein CTS44_25676 [Comamonas testosteroni S44]
Length = 166
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 81/165 (49%), Gaps = 13/165 (7%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVL-VGKR--VRKRLDAKSMI-MLLRSLD 146
+GIDP L+GA AV+ DHNG A +FD P +PV G++ V++++D K+ +LL+
Sbjct: 4 IGIDPGLTGACAVM--DHNGVRA-IFDLPTMPVPGAGEKALVKRKIDGKAFCQLLLKHCP 60
Query: 147 APIGTTAYVEQSIPYPQDGKQGWWSGG---FGYGLWIGILVASGFSVVPIPSLTWKNWYG 203
A G V + + + G G +L G+ + +WK +
Sbjct: 61 AGEGKPLAVLEKVNTMGGANNAVQTQGSLMRSLGAIESVLECLGYPTEHVAPQSWKKLFR 120
Query: 204 LSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGK 248
L S K + +A L P L R+KDH +A+AVL+A +GK
Sbjct: 121 LG---SEKTKALEMARKLHPEAAGNLTRQKDHNRAEAVLLAHWGK 162
>gi|414088500|ref|YP_006988678.1| putative RNaseH-like domain protein [Caulobacter phage CcrColossus]
gi|408731870|gb|AFU88314.1| putative RNaseH-like domain protein [Caulobacter phage CcrColossus]
Length = 168
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 73/157 (46%), Gaps = 12/157 (7%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
LGIDP ++GA+A++ +D S + D P P GK +A S R DA
Sbjct: 5 LGIDPGVNGAIALIDTDD--WSICITDMPREPGKGGK------NAVSPAGCARIFDATAP 56
Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
++E PQ G F +G +GIL + S + + ++W +
Sbjct: 57 DYTFIEDVWSSPQMGV----VSAFSFGRNLGILEGAAASRSILTKVRPQDWKAATKTPKD 112
Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
K+++RR A LFP KR KD G+A+A +++ YG
Sbjct: 113 KNEARRRAMQLFPCAYDLFKRVKDDGRAEAAILSFYG 149
>gi|301060903|ref|ZP_07201705.1| conserved hypothetical protein [delta proteobacterium NaphS2]
gi|300444946|gb|EFK08909.1| conserved hypothetical protein [delta proteobacterium NaphS2]
Length = 153
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 72/159 (45%), Gaps = 18/159 (11%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
+G+DP +GA A++ + +V D P P L+ R L ++M
Sbjct: 7 IGVDPGKTGAAALITDE---GIHDVLDYPGDPSLIVATFRGWLLNYKIVM---------- 53
Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFS-VVPIPSLTWKNWYGLSGGTS 209
A +E+ P G ++ G G W GI+ A G ++P P K G
Sbjct: 54 --AALEKVSARPGQGVVSMFTFGRNLGAWEGIISAFGIPFMMPTPRQWQKGLIDQKAGND 111
Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGK 248
TK S A LFP ++L RKKDHG+ADA+L+A + +
Sbjct: 112 TKARSLNTARRLFPD--AELARKKDHGRADALLLAFWAR 148
>gi|418846730|ref|ZP_13401496.1| crossover junction endodeoxyribonuclease [Salmonella enterica
subsp. enterica serovar Newport str. CVM 19443]
gi|392809773|gb|EJA65806.1| crossover junction endodeoxyribonuclease [Salmonella enterica
subsp. enterica serovar Newport str. CVM 19443]
Length = 154
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 75/162 (46%), Gaps = 13/162 (8%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVL-VGKRVRKRLDAKSMIMLLRSLDAPI 149
LGIDP SG+L VL ++ G D +P + VG + R++ ++ LR
Sbjct: 5 LGIDPGCSGSL-VLITEQGG----YIDHLAMPTIKVG--TKSRVNGAAVAAWLRKYGI-- 55
Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTS 209
T AY+EQ P G ++ G G+ GIL + WK GL G S
Sbjct: 56 -THAYLEQVGAMPGQGTASMFTFGHAAGVAEGILQGLNIPYTLVTPQAWKKSAGLIG--S 112
Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLK 251
KD +R A L+P L + + K ADA+LIA +G G+K
Sbjct: 113 DKDAARSRAIQLYPELRALDAKAKGQAIADALLIARHGIGIK 154
>gi|30387422|ref|NP_848251.1| crossover junction endodeoxyribonuclease [Enterobacteria phage
epsilon15]
gi|30266077|gb|AAO06106.1| crossover junction endodeoxyribonuclease [Salmonella phage
epsilon15]
Length = 154
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 73/162 (45%), Gaps = 13/162 (8%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVL-VGKRVRKRLDAKSMIMLLRSLDAPI 149
LGIDP SG+L ++ D +P + VG + R++ ++ +R +
Sbjct: 5 LGIDPGCSGSLVLITEQ-----GHYIDHLAMPTIKVG--TKSRVNGAAVAAWVRQY---V 54
Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTS 209
T AY+EQ P G ++ G G+ GIL + WK GL G S
Sbjct: 55 ITHAYLEQVGAMPGQGTASMFTFGHAAGVAEGILQGLNIPYTLVTPQAWKKSAGLIG--S 112
Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLK 251
KD +R A L+P L + + K ADA+LIA +G G+K
Sbjct: 113 DKDAARSRAIQLYPELRALDAKAKGQAIADALLIARHGIGIK 154
>gi|301059600|ref|ZP_07200512.1| conserved hypothetical protein [delta proteobacterium NaphS2]
gi|300446365|gb|EFK10218.1| conserved hypothetical protein [delta proteobacterium NaphS2]
Length = 153
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 73/160 (45%), Gaps = 20/160 (12%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLP-VLVGKRVRKRLDAKSMIMLLRSLDAPI 149
+GIDP +GA A++ + + E+ D P P ++V K +L +++ L + A
Sbjct: 7 IGIDPGKTGAAALITDE---GTHEILDYPGDPSLIVAKFTEWKLHHHIVMVALEKVSA-- 61
Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFS-VVPIPSLTWKNWYGLSGGT 208
P G +S G G W GI+ A G ++P P + G
Sbjct: 62 -----------RPGQGVVSMFSFGRNLGTWEGIISAFGIPFMMPTPRQWQQGLIDQKAGG 110
Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGK 248
K S A LFP ++L RKKDHG+ADA+L+A + +
Sbjct: 111 DPKARSLNTARRLFPD--AELSRKKDHGRADALLMAFWAR 148
>gi|301060290|ref|ZP_07201155.1| conserved hypothetical protein [delta proteobacterium NaphS2]
gi|300445603|gb|EFK09503.1| conserved hypothetical protein [delta proteobacterium NaphS2]
Length = 153
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 73/160 (45%), Gaps = 20/160 (12%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLP-VLVGKRVRKRLDAKSMIMLLRSLDAPI 149
+G+DP +GA A++ + + E+ D P P ++V K +L +++ L + A
Sbjct: 7 IGVDPGKTGAAALITDE---GTHEILDYPGDPSLIVAKFTEWKLHHHIVMVALEKVSA-- 61
Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFS-VVPIPSLTWKNWYGLSGGT 208
P G +S G G W GI+ A G ++P P + G
Sbjct: 62 -----------RPGQGVVSMFSFGRNLGTWEGIISAFGIPFMMPTPRQWQQGLIDQKAGG 110
Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGK 248
K S A LFP ++L RKKDHG+ADA+L+A + +
Sbjct: 111 DPKARSLNTARRLFPD--AELSRKKDHGRADALLMAFWAR 148
>gi|15229812|ref|NP_189976.1| uncharacterized protein [Arabidopsis thaliana]
gi|7594551|emb|CAB88118.1| putative protein [Arabidopsis thaliana]
gi|332644319|gb|AEE77840.1| uncharacterized protein [Arabidopsis thaliana]
Length = 84
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 17/94 (18%)
Query: 23 SSKIRTLCTYPFAQHGRTLLVRAAAAGDTTKAAVAAQMKQNWLDSLTFPPLHVHDLTANQ 82
SSK+++ C+ F+ H K A MKQ L+ L+ + ++ Q
Sbjct: 4 SSKLKSFCSRYFSSH--------------EKDKYKALMKQKLLNPLSSSSPNYDEMI--Q 47
Query: 83 TNADSQWALGIDPDLSGALAVLKSDHNG-CSAEV 115
N W +GIDP+LSGALAVLK D G CSA+V
Sbjct: 48 ENFKPHWIIGIDPNLSGALAVLKFDDKGSCSAQV 81
>gi|126444857|ref|YP_001063354.1| hypothetical protein BURPS668_A2360 [Burkholderia pseudomallei 668]
gi|167907357|ref|ZP_02494562.1| hypothetical protein BpseN_34325 [Burkholderia pseudomallei NCTC
13177]
gi|126224348|gb|ABN87853.1| conserved hypothetical protein [Burkholderia pseudomallei 668]
Length = 164
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 73/164 (44%), Gaps = 12/164 (7%)
Query: 90 ALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPI 149
ALGIDP + GALA L DHNG + D P P +VR +DA ++ LR L P
Sbjct: 4 ALGIDPGIRGALAAL--DHNG-HLRIADMPTRPKQGNGKVRNEIDAAALQRQLREL-IPA 59
Query: 150 GTTAYV--EQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGL--S 205
A V E + Q S + + SG++ + W+ +YG+ S
Sbjct: 60 DEQAIVVMEALNTFAGGSVQTMASLEATKAVIATVCELSGYTPAIVTPQCWQRFYGIKRS 119
Query: 206 GGTSTKDDSRRVASTLF-PSLCSQLKRKKDHGKADAVLIAAYGK 248
TK S R+A L+ P+ K G+ADA LIA + +
Sbjct: 120 ASEDTKAQSLRLARDLYGPAF---FPLAKHDGRADATLIARWAQ 160
>gi|455641397|gb|EMF20568.1| Lactococcus phage M3 protein [Citrobacter freundii GTC 09479]
Length = 154
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 71/162 (43%), Gaps = 13/162 (8%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVL-VGKRVRKRLDAKSMIMLLRSLDAPI 149
LGIDP SG+L ++ D +P + VG + R++ ++ +R
Sbjct: 5 LGIDPGCSGSLVLITEQ-----GHYIDHLAMPTIKVG--TKSRVNGAAVAAWVRKYGI-- 55
Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTS 209
T AY+E P G ++ G G+ GIL + WK GL G S
Sbjct: 56 -THAYLEHVGAMPGQGTASMFTFGHAAGVAEGILQGLNIPYTLVTPQAWKKSAGLIG--S 112
Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLK 251
KD +R A L+P L + + K ADA+LIA +G G+K
Sbjct: 113 DKDAARSRAIQLYPELRALDAKAKGQAIADALLIARHGIGIK 154
>gi|429100617|ref|ZP_19162591.1| crossover junction endodeoxyribonuclease [Cronobacter turicensis
564]
gi|426287266|emb|CCJ88704.1| crossover junction endodeoxyribonuclease [Cronobacter turicensis
564]
Length = 155
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 71/161 (44%), Gaps = 11/161 (6%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
+GIDP SGAL + D G + P + V + R++ + L+S
Sbjct: 5 IGIDPGCSGALVAV--DETGEYVAHLNMPTIKV----GSKARVNGAQLAAWLQSWSI--- 55
Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
+ AY+EQ P G ++ G G+ GIL + + WK L G S
Sbjct: 56 SHAYLEQVGALPGQGTASMFTFGHAAGIAEGILQGAYIPYTLVTPQAWKKAASLIG--SD 113
Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLK 251
KD +R A L+P+L + + K ADA+LIA +G LK
Sbjct: 114 KDAARSRAIQLYPALRALDAKAKGQAIADALLIARHGLMLK 154
>gi|414088027|ref|YP_006988210.1| putative RNaseH-like domain protein [Caulobacter phage phiCbK]
gi|414088833|ref|YP_006989006.1| putative RNaseH-like domain protein [Caulobacter phage CcrMagneto]
gi|414089557|ref|YP_006989709.1| putative RNaseH-like domain protein [Caulobacter phage CcrKarma]
gi|414089907|ref|YP_006990054.1| putative RNaseH-like domain protein [Caulobacter phage CcrSwift]
gi|398256506|gb|AFO71520.1| crossover junction endodeoxyribonuclease [Caulobacter phage phiCbK]
gi|408730699|gb|AFU87146.1| putative RNaseH-like domain protein [Caulobacter phage phiCbK]
gi|408731048|gb|AFU87494.1| putative RNaseH-like domain protein [Caulobacter phage CcrMagneto]
gi|408731401|gb|AFU87846.1| putative RNaseH-like domain protein [Caulobacter phage CcrKarma]
gi|408732196|gb|AFU88639.1| putative RNaseH-like domain protein [Caulobacter phage CcrSwift]
Length = 171
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 79/164 (48%), Gaps = 22/164 (13%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
+G+DP ++G + +L D C+ V D P P G+R L + +++ + AP
Sbjct: 4 IGVDPGIAGGVVLL--DTKACTLAVHDMPTEPSTKGRR----LASSALLTVFLHDVAP-- 55
Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
++E+ P +G G +S G G G G VA+G + P+ +T + W + T T
Sbjct: 56 DHIFLEEVGVRPGEGAVGAFSFGRGMGRLEG--VAAG-TRTPLTMVTPQEWKRV---TKT 109
Query: 211 KDDSRRV---ASTLFPSLCSQL----KRKKDHGKADAVLIAAYG 247
D +R A LFP C +L + + G+A+A LIA YG
Sbjct: 110 PADKKRAIARAYQLFPR-CVKLFQGPRGGEKDGRAEAALIAFYG 152
>gi|414089188|ref|YP_006989356.1| putative RNaseH-like domain protein [Caulobacter phage CcrRogue]
gi|408730361|gb|AFU86809.1| putative RNaseH-like domain protein [Caulobacter phage CcrRogue]
Length = 171
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 73/164 (44%), Gaps = 22/164 (13%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
+G+DP +SG + +L D CS V D P P G +RL + +++ + P
Sbjct: 4 IGVDPGISGGVVLL--DTEACSLTVADMPTEPSTKG----RRLASSALLTVFLHDVQP-- 55
Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
++E+ P +G G +S G G G G+ + ++ + WK T T
Sbjct: 56 DHIFLEEVGVRPGEGAVGAFSFGRGLGRLEGVAAGTRTALTTVTPQEWKRV------TKT 109
Query: 211 KDDSRRV---ASTLFPSLCSQL----KRKKDHGKADAVLIAAYG 247
D +R A LFP C +L + + G+A+A LIA YG
Sbjct: 110 PADKKRAIARAYQLFPR-CVKLFQGPRGGEKDGRAEAALIAFYG 152
>gi|255090110|ref|XP_002506976.1| predicted protein [Micromonas sp. RCC299]
gi|226522250|gb|ACO68234.1| predicted protein [Micromonas sp. RCC299]
Length = 206
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 37/76 (48%), Gaps = 9/76 (11%)
Query: 179 WIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTL-------FPSLCSQLKR 231
W G+L + G V + + WKN GL+ KD SR +A L P L K
Sbjct: 55 WHGVLASEGLDVRSVEPMRWKNALGLTA--RDKDGSRDMAVELCRELWPQCPDLQDASKL 112
Query: 232 KKDHGKADAVLIAAYG 247
KK H +A A LIAAYG
Sbjct: 113 KKHHRRAAAFLIAAYG 128
>gi|429120775|ref|ZP_19181439.1| hypothetical protein BN126_2357 [Cronobacter sakazakii 680]
gi|426324790|emb|CCK12176.1| hypothetical protein BN126_2357 [Cronobacter sakazakii 680]
Length = 140
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 47/99 (47%), Gaps = 2/99 (2%)
Query: 153 AYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKD 212
AY+EQ P G ++ G G+ GIL + + WK GL G S KD
Sbjct: 43 AYLEQVGAMPGQGTASMFTFGHAAGIAEGILQGAHIPYTLVTPQAWKKAAGLIG--SDKD 100
Query: 213 DSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYGKGLK 251
+R A L+P+L + + K ADA+LIA +G LK
Sbjct: 101 AARSRAIQLYPALRALDAKAKGQAIADALLIARHGLMLK 139
>gi|117530297|ref|YP_851140.1| hypothetical protein MaLMM01_gp126 [Microcystis phage Ma-LMM01]
gi|117165909|dbj|BAF36217.1| hypothetical protein [Microcystis phage Ma-LMM01]
Length = 181
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 79/185 (42%), Gaps = 30/185 (16%)
Query: 88 QWALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDA 147
++ GIDP L+G L +L+ G + V D +PV K +D M ++ L
Sbjct: 7 KFICGIDPGLAGGLCILQ---RGYTYNVVDVRVMPV-----SDKDID---MGVIKNYLSC 55
Query: 148 PIGTTAYVEQSI---PYPQDGK---QGWWSGG---FGYGLWIGILVASGFSVVPIPSLTW 198
+E+ I P +GK QG S F YG IG+L G + V + S W
Sbjct: 56 YKPHLTIIERQIVVRPRMVEGKMQRQGLGSSAKTMFNYGRLIGLLEGIGLNYVVVDSREW 115
Query: 199 KNWYG-------LSGGTSTKDDSRRVASTLFPS---LCSQLKRKKDHGKADAVLIAAYGK 248
++ Y + TK S +A L+P+ + L K G A+AVL+A YG
Sbjct: 116 QSHYADMAPPIKWTYNKPTKTRSVSIAVDLYPAANLFRTPLCHKPHDGMAEAVLLAHYGA 175
Query: 249 GLKLD 253
+ D
Sbjct: 176 EVVYD 180
>gi|406871771|gb|EKD22498.1| Crossover junction endodeoxyribonuclease [uncultured bacterium]
Length = 151
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 66/157 (42%), Gaps = 12/157 (7%)
Query: 91 LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
+GIDP SGAL VL+ + P + + RV A L AP+
Sbjct: 6 IGIDPGSSGALVVLEDTK---PVAWLNMPLIKIGTNTRVNA---AHIADFLAPHKGAPV- 58
Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
++E P+ G ++ G G+ G++ +G + + WK GL G +
Sbjct: 59 ---FIELVHIMPKQGAGSAFTFGHAAGVVQGLVQGAGHPLTMVTPQKWKKSAGLIG--TE 113
Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHGKADAVLIAAYG 247
KD +R A L+P + + K ADA LIA YG
Sbjct: 114 KDVARSTAILLWPEWRALDTKAKGGALADAALIARYG 150
>gi|341882819|gb|EGT38754.1| hypothetical protein CAEBREN_01743 [Caenorhabditis brenneri]
Length = 896
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 32/149 (21%), Positives = 60/149 (40%), Gaps = 25/149 (16%)
Query: 30 CTYPFAQHGRTLLVRAAAAGDTTKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQW 89
CT F +H +RA A D + Q ++ L + + H+H+ + N + +
Sbjct: 264 CTDKFPKHA---TLRAVAYVDQRRIGEFVQHREEQLQAHDY---HIHE----ENNPEGEQ 313
Query: 90 ALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIM--------- 140
++P S ++ D G + V HL V+ G KR D + +I+
Sbjct: 314 VFSVNPPSSRSIVCSVVDIRGAGSAVLTPVHLEVVAG----KRFDLQFLILTRGTYDVYS 369
Query: 141 --LLRSLDAPIGTTAYVEQSIPYPQDGKQ 167
D +G YV+ +P+ +D ++
Sbjct: 370 KKFFEIRDLQLGDLVYVDIIVPFHEDVRK 398
>gi|312880395|ref|ZP_07740195.1| conserved hypothetical protein [Aminomonas paucivorans DSM 12260]
gi|310783686|gb|EFQ24084.1| conserved hypothetical protein [Aminomonas paucivorans DSM 12260]
Length = 163
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 72/157 (45%), Gaps = 8/157 (5%)
Query: 92 GIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGT 151
GIDP L GA+A+L++D + E +P+ + R+ +D ++ L ++A
Sbjct: 5 GIDPGLRGAVAILEADGSPWRIE-----DMPLRSLGKTRQEVDGAALARFL--VEAGEIR 57
Query: 152 TAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTK 211
A VE P G +S G G+ +G+L A G ++ + TW+ K
Sbjct: 58 LAVVEAVHAMPGQGVSSCFSFGRNVGVVVGVLEALGVPLLEVSPRTWQGDVLRGTPGDGK 117
Query: 212 DDSRRVASTLFPSLCSQLKRKKD-HGKADAVLIAAYG 247
+ + R AS FP R K G+ADA+ +A YG
Sbjct: 118 ERALRWASLAFPGASLFTPRGKGLDGRADALALAWYG 154
>gi|158337376|ref|YP_001518551.1| hypothetical protein AM1_4255 [Acaryochloris marina MBIC11017]
gi|158307617|gb|ABW29234.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 170
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 63/162 (38%), Gaps = 18/162 (11%)
Query: 88 QWALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDA 147
Q + IDP L G +A +H A+ T K LD ++ L ++A
Sbjct: 11 QSLIAIDPGLKGGIAF--QNHQATIAKPLPTSG----------KDLD---LVTLATWVEA 55
Query: 148 PIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGG 207
A VE+ P+ G + G GYG +GI A V + WK L+G
Sbjct: 56 AQPQLAIVEKVGAMPKQGVASTFKFGSGYGGLLGICAALQIPVELVTPQRWKRVV-LAGT 114
Query: 208 TSTKDDSRRVASTLFP--SLCSQLKRKKDHGKADAVLIAAYG 247
T KD + FP L RK G ADA+ + +G
Sbjct: 115 TKDKDAAIAYCRRAFPDIELIQPGCRKPHDGVADALCLLKFG 156
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.131 0.399
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,567,579,123
Number of Sequences: 23463169
Number of extensions: 193434351
Number of successful extensions: 452768
Number of sequences better than 100.0: 64
Number of HSP's better than 100.0 without gapping: 43
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 452639
Number of HSP's gapped (non-prelim): 67
length of query: 281
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 140
effective length of database: 9,050,888,538
effective search space: 1267124395320
effective search space used: 1267124395320
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)