BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 025662
         (249 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255580004|ref|XP_002530836.1| conserved hypothetical protein [Ricinus communis]
 gi|223529600|gb|EEF31549.1| conserved hypothetical protein [Ricinus communis]
          Length = 280

 Score =  273 bits (697), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 132/192 (68%), Positives = 158/192 (82%), Gaps = 8/192 (4%)

Query: 52  TKAAVAAQMKQNWLDSLTFPPLHVHDLTANQ-------TNADSQWALGIDPDLSGALAVL 104
           ++A VA Q+K+NWL SL+ P  +  + +A++       +N  S W +GIDPDLSGALA+L
Sbjct: 62  SEAEVARQLKENWLHSLSCP-FNQTESSASKGVDSTAPSNVGSNWVIGIDPDLSGALALL 120

Query: 105 KSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQD 164
           K D +GCSA+VFD+PHL VLVGKR+RKRLDAKS++ LL S DAP+GTTAY+EQSIP+PQD
Sbjct: 121 KIDDSGCSAQVFDSPHLKVLVGKRIRKRLDAKSIVQLLHSFDAPLGTTAYIEQSIPFPQD 180

Query: 165 GKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPS 224
           GKQGWWSGGFGYGLWIGILVASGFSVVP+PSL WKN + LSG   +KDDSR+VASTLFPS
Sbjct: 181 GKQGWWSGGFGYGLWIGILVASGFSVVPVPSLAWKNVFELSGSKFSKDDSRKVASTLFPS 240

Query: 225 LCSQLKRKKDHG 236
           L S LKRKKDHG
Sbjct: 241 LSSLLKRKKDHG 252


>gi|225435862|ref|XP_002263460.1| PREDICTED: uncharacterized protein LOC100242692 isoform 1 [Vitis
           vinifera]
          Length = 291

 Score =  264 bits (675), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 134/244 (54%), Positives = 171/244 (70%), Gaps = 11/244 (4%)

Query: 1   METTVKIQPQLQSHCISMNTFVSSKIRTLCTY---PFAQHGRTLLVRAAAAGDTTKAAVA 57
           M    KI P+L         F ++ + +  T+   P      T+L    ++G        
Sbjct: 18  MNPISKITPKLALSAPKFRVFCATSLNSTNTHDPIPAPSSTDTVL---RSSGKARARVSD 74

Query: 58  AQMKQNWLDSLTFPPLHVHD-----LTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS 112
            Q ++NWL SL+ P    ++     + + ++NA +Q  +GIDPD+SGALA+LK+  +GCS
Sbjct: 75  TQYRENWLASLSCPFPDENERPGIRMDSAESNAGTQCVIGIDPDISGALALLKTGDSGCS 134

Query: 113 AEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSG 172
           A+VFD+PHL +LVGKRVRKRLDAKS++ LLR  DAPIGT AY+EQSIP+PQDGKQGWWSG
Sbjct: 135 AQVFDSPHLQILVGKRVRKRLDAKSIVQLLRGFDAPIGTIAYIEQSIPFPQDGKQGWWSG 194

Query: 173 GFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRK 232
           GFGYGLWIGILVASGFSVVP+PS  WKN + LSG  ++KDDSRRVAST+FPS+ S LKRK
Sbjct: 195 GFGYGLWIGILVASGFSVVPVPSSLWKNEFKLSGNGTSKDDSRRVASTIFPSMSSLLKRK 254

Query: 233 KDHG 236
           KDHG
Sbjct: 255 KDHG 258


>gi|356528579|ref|XP_003532877.1| PREDICTED: uncharacterized protein LOC100775189 [Glycine max]
          Length = 274

 Score =  255 bits (652), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 123/191 (64%), Positives = 151/191 (79%), Gaps = 7/191 (3%)

Query: 53  KAAVAAQMKQNWLDSLTFP-PLHVHDLTANQT----NADSQWALGIDPDLSGALAVLKSD 107
           KA+   Q K+NWL SL++P P   H L+        N  S+W LGIDPD+SGA+A+LK+ 
Sbjct: 52  KASSDTQFKENWLASLSYPFPEKTHLLSGEHEPTHQNDGSKWVLGIDPDVSGAVALLKTH 111

Query: 108 HNGCSAEVFDTPHLPVLVGK--RVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDG 165
            + CSA+VFD+PH+ +LVGK  R R+RLDAKS++ L+R  DAPIGTTA++EQS+PYPQDG
Sbjct: 112 GSVCSAQVFDSPHVKILVGKNKRTRRRLDAKSVVELVRGFDAPIGTTAFIEQSLPYPQDG 171

Query: 166 KQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSL 225
           KQGWWSGGFGYGLWIGILVASGFSV+P+PS TWK  + LSG  +TKDDSRR+ASTLFPS+
Sbjct: 172 KQGWWSGGFGYGLWIGILVASGFSVIPVPSFTWKAKFELSGNGTTKDDSRRIASTLFPSM 231

Query: 226 CSQLKRKKDHG 236
            S L RKKDHG
Sbjct: 232 TSMLSRKKDHG 242


>gi|296083889|emb|CBI24277.3| unnamed protein product [Vitis vinifera]
          Length = 192

 Score =  253 bits (645), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 116/159 (72%), Positives = 139/159 (87%)

Query: 78  LTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKS 137
           + + ++NA +Q  +GIDPD+SGALA+LK+  +GCSA+VFD+PHL +LVGKRVRKRLDAKS
Sbjct: 1   MDSAESNAGTQCVIGIDPDISGALALLKTGDSGCSAQVFDSPHLQILVGKRVRKRLDAKS 60

Query: 138 MIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLT 197
           ++ LLR  DAPIGT AY+EQSIP+PQDGKQGWWSGGFGYGLWIGILVASGFSVVP+PS  
Sbjct: 61  IVQLLRGFDAPIGTIAYIEQSIPFPQDGKQGWWSGGFGYGLWIGILVASGFSVVPVPSSL 120

Query: 198 WKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHG 236
           WKN + LSG  ++KDDSRRVAST+FPS+ S LKRKKDHG
Sbjct: 121 WKNEFKLSGNGTSKDDSRRVASTIFPSMSSLLKRKKDHG 159


>gi|30683279|ref|NP_180252.2| uncharacterized protein [Arabidopsis thaliana]
 gi|26452962|dbj|BAC43557.1| unknown protein [Arabidopsis thaliana]
 gi|28973431|gb|AAO64040.1| unknown protein [Arabidopsis thaliana]
 gi|330252802|gb|AEC07896.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 273

 Score =  248 bits (632), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 137/241 (56%), Positives = 166/241 (68%), Gaps = 14/241 (5%)

Query: 1   METTVKIQ---PQLQSHCISMNTFVSSKIRTLCTYPFAQHGRTLLVRAAAAGDTTKAAVA 57
           M TTV  Q   PQ+ S    +  F+S       T PF +  R+    A     TTKA  A
Sbjct: 1   MATTVYGQASLPQMHSLFSKVRPFLSHSPS--FTAPFTRR-RSFPFSALP---TTKAIDA 54

Query: 58  AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS--AEV 115
           A MK+ WLDSL+   L   D      NA+S   +GIDPDLSGALA+LK DH G S  A+V
Sbjct: 55  ALMKEKWLDSLS---LTSQDEDTTPENAESSCIIGIDPDLSGALALLKFDHLGSSSFAQV 111

Query: 116 FDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFG 175
           +DTPH+PVLVGKRVRKRLDAKS++ L++SLD P G+  Y+EQS P+P+DGKQGW+SGGFG
Sbjct: 112 YDTPHIPVLVGKRVRKRLDAKSIVQLIQSLDVPSGSRVYIEQSNPFPKDGKQGWYSGGFG 171

Query: 176 YGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDH 235
           YGLWIG LVASGF V+P+ +  WK  + L+ G+ TKDDSRRVA+ LFPSL SQLKRKKDH
Sbjct: 172 YGLWIGTLVASGFCVIPVSASLWKRHFQLASGSCTKDDSRRVAAELFPSLSSQLKRKKDH 231

Query: 236 G 236
           G
Sbjct: 232 G 232


>gi|356555307|ref|XP_003545975.1| PREDICTED: uncharacterized protein LOC100790125 [Glycine max]
          Length = 290

 Score =  243 bits (619), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 120/189 (63%), Positives = 144/189 (76%), Gaps = 7/189 (3%)

Query: 55  AVAAQMKQNWLDSLTFP-PLHVHDLTANQT----NADSQWALGIDPDLSGALAVLKSDHN 109
           A   Q K+NWL SL+ P P   H L         N  S+W LGIDPD+SGA+A+LK+  +
Sbjct: 70  ASDTQFKENWLASLSNPFPEKTHLLNGEHEPTHQNDGSKWVLGIDPDVSGAVALLKTHGS 129

Query: 110 GCSAEVFDTPHLPVLVGKR--VRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQ 167
            CSA+VFD+PH+ +LVGK    R+RLDAKS++ L+    APIGTTAY+EQS+PYPQDGKQ
Sbjct: 130 VCSAQVFDSPHVKILVGKNKTTRRRLDAKSVVELVCGFRAPIGTTAYIEQSLPYPQDGKQ 189

Query: 168 GWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCS 227
           GWWSGGFGYGLWIGILVASGFSV+P+PS TWK  + LSG  +TKDDSRR+ASTLFPS+ S
Sbjct: 190 GWWSGGFGYGLWIGILVASGFSVIPVPSFTWKAKFKLSGNGTTKDDSRRLASTLFPSMTS 249

Query: 228 QLKRKKDHG 236
            L RKKDHG
Sbjct: 250 MLSRKKDHG 258


>gi|3426042|gb|AAC32241.1| unknown protein [Arabidopsis thaliana]
          Length = 286

 Score =  239 bits (609), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 137/254 (53%), Positives = 166/254 (65%), Gaps = 27/254 (10%)

Query: 1   METTVKIQ---PQLQSHCISMNTFVSSKIRTLCTYPFAQHGRTLLVRAAAAGDTTKAAVA 57
           M TTV  Q   PQ+ S    +  F+S       T PF +  R+    A     TTKA  A
Sbjct: 1   MATTVYGQASLPQMHSLFSKVRPFLSHSPS--FTAPFTRR-RSFPFSALP---TTKAIDA 54

Query: 58  AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS--AEV 115
           A MK+ WLDSL+   L   D      NA+S   +GIDPDLSGALA+LK DH G S  A+V
Sbjct: 55  ALMKEKWLDSLS---LTSQDEDTTPENAESSCIIGIDPDLSGALALLKFDHLGSSSFAQV 111

Query: 116 FDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQ-------- 167
           +DTPH+PVLVGKRVRKRLDAKS++ L++SLD P G+  Y+EQS P+P+DGKQ        
Sbjct: 112 YDTPHIPVLVGKRVRKRLDAKSIVQLIQSLDVPSGSRVYIEQSNPFPKDGKQVYATAHIR 171

Query: 168 -----GWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLF 222
                GW+SGGFGYGLWIG LVASGF V+P+ +  WK  + L+ G+ TKDDSRRVA+ LF
Sbjct: 172 FSSAMGWYSGGFGYGLWIGTLVASGFCVIPVSASLWKRHFQLASGSCTKDDSRRVAAELF 231

Query: 223 PSLCSQLKRKKDHG 236
           PSL SQLKRKKDHG
Sbjct: 232 PSLSSQLKRKKDHG 245


>gi|297825913|ref|XP_002880839.1| hypothetical protein ARALYDRAFT_320472 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326678|gb|EFH57098.1| hypothetical protein ARALYDRAFT_320472 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 287

 Score =  237 bits (605), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 135/255 (52%), Positives = 166/255 (65%), Gaps = 28/255 (10%)

Query: 1   METTVKIQPQL-QSHCISMNTFVSSKIR---TLCTYPFAQHGRTLLVRAAAAGDTTKAAV 56
           M TT   QP L Q H +       SK+R   +L +  F    R     + +A  TTKA  
Sbjct: 1   MATTAYGQPSLPQMHSLF------SKLRPFLSLSSPSFTAPFRRRRDFSFSALPTTKAID 54

Query: 57  AAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS--AE 114
            A MK+ WLDSL+   L   D      NA+S   +GIDPDLSGALA+LK DH G S  A+
Sbjct: 55  VALMKEKWLDSLS---LTSQDEDTTPENAESSCVIGIDPDLSGALALLKFDHLGSSSSAQ 111

Query: 115 VFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQ------- 167
           V+DTPH+PVLVGKRVRKRLDAKS++ L++SLD P G+  Y+EQS P+P+DGKQ       
Sbjct: 112 VYDTPHIPVLVGKRVRKRLDAKSIVQLIQSLDVPSGSRVYIEQSNPFPKDGKQVYATTHI 171

Query: 168 ------GWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTL 221
                 GW+SGGFGYGLWIG LVASGF V+P+ +  WK  + L+ G+ TKDDSRRVA+ L
Sbjct: 172 HFSSVMGWYSGGFGYGLWIGTLVASGFCVIPVSASLWKRHFQLASGSCTKDDSRRVAAEL 231

Query: 222 FPSLCSQLKRKKDHG 236
           FPSL +QLKRKKDHG
Sbjct: 232 FPSLSTQLKRKKDHG 246


>gi|224105997|ref|XP_002314007.1| predicted protein [Populus trichocarpa]
 gi|222850415|gb|EEE87962.1| predicted protein [Populus trichocarpa]
          Length = 235

 Score =  231 bits (589), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 112/167 (67%), Positives = 131/167 (78%), Gaps = 10/167 (5%)

Query: 53  KAAVAAQMKQNWLDSLTFP--------PLHVHDLTANQTNADSQWALGIDPDLSGALAVL 104
           K A  AQ+KQNWLDSLTFP         L   DL  N  N  S W +G+DPD+SGALA+L
Sbjct: 67  KVADVAQLKQNWLDSLTFPLPNETENTNLGGDDLARN--NVGSNWVIGVDPDVSGALALL 124

Query: 105 KSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQD 164
           K D +GCSA+VFD+PHL V+VGK +RKRLD KS++ L+RS DAPIGTTAYVEQS P+PQD
Sbjct: 125 KIDESGCSAQVFDSPHLKVMVGKGIRKRLDVKSIVQLIRSFDAPIGTTAYVEQSTPFPQD 184

Query: 165 GKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTK 211
           GKQGWWSGGFGYGLWIG+LVASGFSVVP+PS+TWK+   L+GG  TK
Sbjct: 185 GKQGWWSGGFGYGLWIGVLVASGFSVVPVPSMTWKSDLELAGGRCTK 231


>gi|359479130|ref|XP_003632222.1| PREDICTED: uncharacterized protein LOC100242692 isoform 2 [Vitis
           vinifera]
          Length = 261

 Score =  222 bits (565), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 117/242 (48%), Positives = 158/242 (65%), Gaps = 16/242 (6%)

Query: 1   METTVKIQPQLQSHCISMNTFVSSKIRTLCTY---PFAQHGRTLLVRAAAAGDTTKAAVA 57
           M    KI P+L         F ++ + +  T+   P      T+L    ++G        
Sbjct: 18  MNPISKITPKLALSAPKFRVFCATSLNSTNTHDPIPAPSSTDTVL---RSSGKARARVSD 74

Query: 58  AQMKQNWLDSLTFPPLHVHD-----LTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS 112
            Q ++NWL SL+ P    ++     + + ++NA +Q  +GIDPD+SGALA+LK+  +GCS
Sbjct: 75  TQYRENWLASLSCPFPDENERPGIRMDSAESNAGTQCVIGIDPDISGALALLKTGDSGCS 134

Query: 113 AEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSG 172
           A+VFD+PHL +LVGKRVRKRLDAKS++ LLR  DAPIGT AY+EQSIP+PQDGKQGWWSG
Sbjct: 135 AQVFDSPHLQILVGKRVRKRLDAKSIVQLLRGFDAPIGTIAYIEQSIPFPQDGKQGWWSG 194

Query: 173 GFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRK 232
           GFGYGLWIGILVASGFSVVP+PS  WKN + LSG  ++K     V  ++FP +   + R+
Sbjct: 195 GFGYGLWIGILVASGFSVVPVPSSLWKNEFKLSGNGTSK-----VFLSIFPLVFPHIFRE 249

Query: 233 KD 234
            +
Sbjct: 250 PN 251


>gi|334184493|ref|NP_001189612.1| uncharacterized protein [Arabidopsis thaliana]
 gi|330252803|gb|AEC07897.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 326

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 113/241 (46%), Positives = 140/241 (58%), Gaps = 58/241 (24%)

Query: 51  TTKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPD-------------- 96
           TTKA  AA MK+ WLDSL+   L   D      NA+S   +GIDPD              
Sbjct: 48  TTKAIDAALMKEKWLDSLS---LTSQDEDTTPENAESSCIIGIDPDLSGALALLKFDHLG 104

Query: 97  -----------------------------LSGALAVLKSDHNG---CSA---------EV 115
                                        L G+  + + +      C +         EV
Sbjct: 105 SSSFAQNQTYYQSWLCFEHLVLMLRNYLSLQGSYVICEKNRECEGFCESFMLLCLVQFEV 164

Query: 116 FDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFG 175
           +DTPH+PVLVGKRVRKRLDAKS++ L++SLD P G+  Y+EQS P+P+DGKQGW+SGGFG
Sbjct: 165 YDTPHIPVLVGKRVRKRLDAKSIVQLIQSLDVPSGSRVYIEQSNPFPKDGKQGWYSGGFG 224

Query: 176 YGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDH 235
           YGLWIG LVASGF V+P+ +  WK  + L+ G+ TKDDSRRVA+ LFPSL SQLKRKKDH
Sbjct: 225 YGLWIGTLVASGFCVIPVSASLWKRHFQLASGSCTKDDSRRVAAELFPSLSSQLKRKKDH 284

Query: 236 G 236
           G
Sbjct: 285 G 285


>gi|449444851|ref|XP_004140187.1| PREDICTED: uncharacterized protein LOC101204284 [Cucumis sativus]
 gi|449481000|ref|XP_004156052.1| PREDICTED: uncharacterized LOC101204284 [Cucumis sativus]
          Length = 273

 Score =  200 bits (509), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 131/245 (53%), Positives = 167/245 (68%), Gaps = 19/245 (7%)

Query: 11  LQSHCIS-MNTFVSSKI-------RTLCT-------YPFAQHGRTLLVRAAAAGDTTKAA 55
           LQSH  S MN+  SSK+       R LCT               +  VR  ++G    A 
Sbjct: 6   LQSHSNSFMNSLPSSKLKLKLHHFRFLCTSSFSSICSSEISTISSSSVRKDSSGKAKLAI 65

Query: 56  VAAQMKQNWLDSLTFPPLHVHDLTANQT----NADSQWALGIDPDLSGALAVLKSDHNGC 111
             AQ+K NWL SL+ P    HD ++N +    NA S+  +G+DPD+SGA+A+L++D +  
Sbjct: 66  AHAQLKDNWLASLSCPFPLGHDYSSNSSSPDRNAASECVIGVDPDVSGAVALLRTDESIS 125

Query: 112 SAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWS 171
           SA+V+D+PH+ +LVG R RKRLDAKS++ LL S +APIGTTAY+EQS P+P+DGKQGWW 
Sbjct: 126 SAQVYDSPHVQILVGGRKRKRLDAKSIVQLLHSFNAPIGTTAYLEQSNPFPKDGKQGWWG 185

Query: 172 GGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKR 231
           GGFGYGLWIG+LV  GFSVVP+P L WKN + LSG  ++KDDSRR+AS LFPSL   LKR
Sbjct: 186 GGFGYGLWIGVLVGLGFSVVPVPPLAWKNKFELSGKDTSKDDSRRIASELFPSLTPLLKR 245

Query: 232 KKDHG 236
           KKDHG
Sbjct: 246 KKDHG 250


>gi|242052573|ref|XP_002455432.1| hypothetical protein SORBIDRAFT_03g010690 [Sorghum bicolor]
 gi|241927407|gb|EES00552.1| hypothetical protein SORBIDRAFT_03g010690 [Sorghum bicolor]
          Length = 282

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 105/202 (51%), Positives = 127/202 (62%), Gaps = 7/202 (3%)

Query: 35  AQHGRTLLVRAAAAGDTTKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGID 94
           A   RT   RA+A     K    A+ +  WL SL+  P    D  A    A + WA+G+D
Sbjct: 62  AATARTPRSRASAKA-RAKLLAEAEPRDPWLASLSLLPT---DDVAGADAAPNGWAIGVD 117

Query: 95  PDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAY 154
           PD  GA+AVL  D    S++VFD P + ++V + +RKRLD KS+I LLR LD P GTTAY
Sbjct: 118 PDTRGAIAVLSPD---GSSQVFDNPFVHIVVSEVIRKRLDTKSIIQLLRGLDVPPGTTAY 174

Query: 155 VEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDS 214
           +E+S P+P DGK GWWS GF YGLWI  LVASGFSVVPI S TWK  +GLS   + KDDS
Sbjct: 175 IEKSSPFPTDGKLGWWSTGFSYGLWIASLVASGFSVVPIASQTWKAHFGLSRSETPKDDS 234

Query: 215 RRVASTLFPSLCSQLKRKKDHG 236
           R+ AS LFP     LK KK HG
Sbjct: 235 RQAASILFPDKAQSLKLKKHHG 256


>gi|212723270|ref|NP_001131429.1| uncharacterized protein LOC100192759 [Zea mays]
 gi|194691492|gb|ACF79830.1| unknown [Zea mays]
 gi|195609328|gb|ACG26494.1| hypothetical protein [Zea mays]
 gi|414877006|tpg|DAA54137.1| TPA: hypothetical protein ZEAMMB73_409152 [Zea mays]
          Length = 280

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 99/191 (51%), Positives = 122/191 (63%), Gaps = 6/191 (3%)

Query: 46  AAAGDTTKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLK 105
           A+A    K     + +  WL SL+  P          TN    W +G+DPD+ GA+AVL 
Sbjct: 70  ASAKARAKLLAEVEARDPWLASLSLLPTDDIPSADATTNG---WVIGVDPDIGGAIAVLS 126

Query: 106 SDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDG 165
            D    S++VFD P + ++V + +RKRLD KS+I LLR LDAP GTTAY+E+S P+P DG
Sbjct: 127 PD---GSSQVFDNPFVHIVVSEVIRKRLDTKSIIQLLRGLDAPPGTTAYIEKSSPFPTDG 183

Query: 166 KQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSL 225
           KQGWWS GF YGLWI  LVASGFSVVPI S TWK ++GL    + KDDSR+ AS LFP  
Sbjct: 184 KQGWWSTGFSYGLWIASLVASGFSVVPIASQTWKAYFGLMRSETPKDDSRQAASILFPDK 243

Query: 226 CSQLKRKKDHG 236
              LK KK HG
Sbjct: 244 DQSLKLKKHHG 254


>gi|218187952|gb|EEC70379.1| hypothetical protein OsI_01327 [Oryza sativa Indica Group]
          Length = 287

 Score =  186 bits (471), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 120/179 (67%), Gaps = 3/179 (1%)

Query: 58  AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFD 117
           A  +  WL SL+  P+   +  A    A + WA+G+DPD  GA+AVL  D    S++VFD
Sbjct: 87  ADTRDPWLASLSLLPVDDGEAAAAAAAASTGWAIGVDPDTGGAIAVLSPD---GSSQVFD 143

Query: 118 TPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYG 177
            P + ++V + +RKRLD KS+I LLR LDAP GTTAY+E+S P+P DGK GWWS GF YG
Sbjct: 144 NPFVHIVVSEFIRKRLDTKSIIELLRGLDAPPGTTAYIEKSSPFPTDGKLGWWSTGFSYG 203

Query: 178 LWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHG 236
           LWI  LVASGFSVVP+ S TWK ++GL+   S KD SR+ AS LFP     L RKK HG
Sbjct: 204 LWIAALVASGFSVVPVASQTWKAYFGLTRSGSPKDYSRQAASILFPDKALSLNRKKHHG 262


>gi|215692956|dbj|BAG88376.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 120/179 (67%), Gaps = 3/179 (1%)

Query: 58  AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFD 117
           A  +  WL SL+  P+   +  A    A + WA+G+DPD  GA+AVL  D    S++VFD
Sbjct: 67  ADTRDPWLASLSLLPVDDGEAAAAAAAASTGWAIGVDPDTGGAIAVLSPD---GSSQVFD 123

Query: 118 TPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYG 177
            P + ++V + +RKRLD KS+I LLR LDAP GTTAY+E+S P+P DGK GWWS GF YG
Sbjct: 124 NPFVHIVVSEFIRKRLDTKSIIELLRGLDAPPGTTAYIEKSSPFPTDGKLGWWSTGFSYG 183

Query: 178 LWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHG 236
           LWI  LVASGFSVVP+ S TWK ++GL+   S KD SR+ AS LFP     L RKK HG
Sbjct: 184 LWIAALVASGFSVVPVASQTWKAYFGLTRSGSPKDYSRQAASILFPDKALSLNRKKHHG 242


>gi|302767670|ref|XP_002967255.1| hypothetical protein SELMODRAFT_67545 [Selaginella moellendorffii]
 gi|300165246|gb|EFJ31854.1| hypothetical protein SELMODRAFT_67545 [Selaginella moellendorffii]
          Length = 193

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 95/185 (51%), Positives = 125/185 (67%), Gaps = 6/185 (3%)

Query: 52  TKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGC 111
           +K  +A   ++NWLDSLT P     D+ A   NA   + +GIDPD  GALAVL+ + +  
Sbjct: 1   SKKVIAEISRKNWLDSLTLPLGGEGDVDAELDNA---FVIGIDPDFKGALAVLRLEDDRF 57

Query: 112 SAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWS 171
             EVFD P L V +G  +R R DAK+++ LL++++AP G+ AYVE S+P+  DGKQ WWS
Sbjct: 58  IPEVFDVPTLKVAIGGTMRLRPDAKAIVNLLKTINAPHGSVAYVENSLPHSTDGKQSWWS 117

Query: 172 GGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKR 231
            GFG+G+W G L+ASG SVVP+    WK   GLS   S+KD+SR VAS+ FPS+ SQL R
Sbjct: 118 SGFGFGMWTGTLLASGISVVPVSPRAWKTAMGLS---SSKDESRLVASSCFPSIASQLTR 174

Query: 232 KKDHG 236
           KKDHG
Sbjct: 175 KKDHG 179


>gi|115435874|ref|NP_001042695.1| Os01g0268900 [Oryza sativa Japonica Group]
 gi|56783734|dbj|BAD81146.1| unknown protein [Oryza sativa Japonica Group]
 gi|56783755|dbj|BAD81167.1| unknown protein [Oryza sativa Japonica Group]
 gi|113532226|dbj|BAF04609.1| Os01g0268900 [Oryza sativa Japonica Group]
 gi|222618167|gb|EEE54299.1| hypothetical protein OsJ_01234 [Oryza sativa Japonica Group]
          Length = 287

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 120/179 (67%), Gaps = 3/179 (1%)

Query: 58  AQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFD 117
           A  +  WL SL+  P+   +  A    A + WA+G+DPD  GA+AVL  D    S++VFD
Sbjct: 87  ADTRDPWLASLSLLPVDDGEAAAAAAAASTGWAIGVDPDTGGAIAVLSPD---GSSQVFD 143

Query: 118 TPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYG 177
            P + ++V + +RKRLD KS+I LLR LDAP GTTAY+E+S P+P DGK GWWS GF YG
Sbjct: 144 NPFVHIVVSEFIRKRLDTKSIIELLRGLDAPPGTTAYIEKSSPFPTDGKLGWWSTGFSYG 203

Query: 178 LWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHG 236
           LWI  LVASGFSVVP+ S TWK ++GL+   S KD SR+ AS LFP     L RKK HG
Sbjct: 204 LWIAALVASGFSVVPVASQTWKAYFGLTRSGSPKDYSRQAASILFPDKALSLNRKKHHG 262


>gi|357130450|ref|XP_003566861.1| PREDICTED: uncharacterized protein LOC100835936 [Brachypodium
           distachyon]
          Length = 285

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 96/173 (55%), Positives = 116/173 (67%), Gaps = 7/173 (4%)

Query: 64  WLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPV 123
           WL SL+  P            A + WALGIDPD  GA+AVL  D    S++VFD P + +
Sbjct: 81  WLASLSLLPAD----DGEAAPAPTGWALGIDPDTGGAIAVLSPD---GSSQVFDNPFVHI 133

Query: 124 LVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGIL 183
           +V +  RKRLD KS+I LLR LDAP+GTTAY+E+S P+P DGK GWWS GF YGLWI  L
Sbjct: 134 VVSEFTRKRLDTKSIIQLLRGLDAPLGTTAYIEKSSPFPTDGKLGWWSTGFSYGLWIAAL 193

Query: 184 VASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHG 236
           VASGFSVVP+ S TWK ++GL+   S KDDSR+ A+ LFP     LK KK HG
Sbjct: 194 VASGFSVVPVASQTWKAYFGLTRSKSRKDDSRQAATILFPDKALSLKLKKHHG 246


>gi|302754086|ref|XP_002960467.1| hypothetical protein SELMODRAFT_71406 [Selaginella moellendorffii]
 gi|300171406|gb|EFJ38006.1| hypothetical protein SELMODRAFT_71406 [Selaginella moellendorffii]
          Length = 193

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 94/185 (50%), Positives = 124/185 (67%), Gaps = 6/185 (3%)

Query: 52  TKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGC 111
           +K  +A   ++NWLDSLT P     D+ A   NA     +GIDPD  GALAVL+ + +  
Sbjct: 1   SKKVIAEISRKNWLDSLTLPLGDEGDVDAELDNA---LVIGIDPDFKGALAVLRLEDDRF 57

Query: 112 SAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWS 171
             EVFD P L V +G  +R R DAK+++ LL++++AP G+ AYVE S+P+  DGKQ WWS
Sbjct: 58  IPEVFDVPTLKVAIGGTMRLRPDAKAIVNLLKTINAPHGSVAYVENSLPHSTDGKQSWWS 117

Query: 172 GGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKR 231
            GFG+G+W G L+ASG SVVP+    WK   GLS   S+KD+SR VAS+ FP++ SQL R
Sbjct: 118 SGFGFGMWTGTLLASGISVVPVSPRAWKTAMGLS---SSKDESRLVASSCFPAIASQLTR 174

Query: 232 KKDHG 236
           KKDHG
Sbjct: 175 KKDHG 179


>gi|168017937|ref|XP_001761503.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687187|gb|EDQ73571.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 191

 Score =  170 bits (430), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 80/146 (54%), Positives = 106/146 (72%), Gaps = 4/146 (2%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
           LGIDPD SGA+AVL+      + EV D P + + VGK +R+R DA+S++ L+  ++AP G
Sbjct: 16  LGIDPDASGAIAVLRDG----TQEVLDVPCVKIQVGKTMRRRHDARSIVDLVNKINAPEG 71

Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
           + AYVEQ++P+P DGKQGW+  GFGYG+W+GIL+A GF VVP+ +  WK+  G++G   T
Sbjct: 72  SVAYVEQAMPFPMDGKQGWYGCGFGYGMWVGILMALGFKVVPVRAQVWKSAMGIAGKQYT 131

Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHG 236
           KDDSR  A  LFP L  QLKRKKDHG
Sbjct: 132 KDDSRATAMALFPDLGPQLKRKKDHG 157


>gi|297815494|ref|XP_002875630.1| hypothetical protein ARALYDRAFT_484823 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321468|gb|EFH51889.1| hypothetical protein ARALYDRAFT_484823 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 199

 Score =  146 bits (369), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 81/190 (42%), Positives = 110/190 (57%), Gaps = 5/190 (2%)

Query: 60  MKQNWLDSLTFPPLHVHDLTANQTNADSQWALGIDPDLSGALAVLKSDHNG-CSAEVFDT 118
           MKQ  L+ L++   +  +  A Q N    W +GIDP LSGALAVLK D  G CSA+V+DT
Sbjct: 1   MKQELLNPLSYSSPNYDE--AIQENFKPHWIIGIDPTLSGALAVLKFDDKGSCSAQVYDT 58

Query: 119 PHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGL 178
           P   V+V    R R   KSM+ L+R+LD P GT A+V + I Y  D     ++ G GYGL
Sbjct: 59  PRRQVVVQNIPRSRFKEKSMLELIRTLDVPSGTKAFVAKMILYEDDNIMAAYNAGLGYGL 118

Query: 179 WIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGTI 238
           W GIL+ S  SV  +   TWK  + LS    ++D  R++A  +FPSL SQL R++D+   
Sbjct: 119 WTGILLTSNISVSSVAPSTWKKHFKLS--IYSRDGGRKLALKMFPSLSSQLTRRRDYARA 176

Query: 239 SSPYCILYHH 248
           ++     Y H
Sbjct: 177 NALLIAAYGH 186


>gi|147856114|emb|CAN82440.1| hypothetical protein VITISV_006122 [Vitis vinifera]
          Length = 1030

 Score =  145 bits (366), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 79/175 (45%), Positives = 109/175 (62%), Gaps = 11/175 (6%)

Query: 1   METTVKIQPQLQSHCISMNTFVSSKIRTLCTY---PFAQHGRTLLVRAAAAGDTTKAAVA 57
           M    KI P+L         F ++ + +  T+   P      T+L    ++G        
Sbjct: 18  MNPISKITPKLALSAPKFRVFCATSLNSTNTHDPIPAPSSTGTVL---RSSGKARVRVSD 74

Query: 58  AQMKQNWLDSLTFPPLHVHD-----LTANQTNADSQWALGIDPDLSGALAVLKSDHNGCS 112
            Q ++NWL SL+ P    ++     + + ++NA +Q  +GIDPD+SGALA+LK+  +GCS
Sbjct: 75  TQYRENWLASLSCPFPDENERPGIRMDSAESNAGTQCVIGIDPDISGALALLKTGDSGCS 134

Query: 113 AEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQ 167
           A+VFD+PHL +LVGKRVRKRLDAKS++ LLR  DAPIGT AY+EQSIP PQDGKQ
Sbjct: 135 AQVFDSPHLQILVGKRVRKRLDAKSIVQLLRGFDAPIGTIAYIEQSIPXPQDGKQ 189


>gi|255078906|ref|XP_002503033.1| predicted protein [Micromonas sp. RCC299]
 gi|226518299|gb|ACO64291.1| predicted protein [Micromonas sp. RCC299]
          Length = 386

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 63/161 (39%), Positives = 88/161 (54%), Gaps = 2/161 (1%)

Query: 91  LGIDPDLSGALAVLK--SDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAP 148
           LG D D  GALAV++  S  N  S +V D P     V  R+RKRL  + M+ L+ SL+ P
Sbjct: 78  LGCDSDAGGALAVVRGPSLGNVASVDVHDCPTYKAEVNGRIRKRLCVEDMVELVASLNIP 137

Query: 149 IGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGT 208
            GT A++E+         Q  +  G+ +GLW G+L A+G  V  +    WK+  GL+   
Sbjct: 138 RGTVAFLEEGGVEYGFSAQTAFVQGYNFGLWKGVLAAAGLRVEVVKPQAWKHALGLARSG 197

Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYHHL 249
           S+KDDSR +AS +FP +   L RKKDHG   +     Y H+
Sbjct: 198 SSKDDSRDMASAMFPEVGDDLGRKKDHGRAEALLIAAYGHM 238


>gi|303278846|ref|XP_003058716.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459876|gb|EEH57171.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 409

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/182 (36%), Positives = 92/182 (50%), Gaps = 5/182 (2%)

Query: 73  LHVHDLTANQTNADSQWAL-GIDPDLSGALAVLKSDHNGC--SAEVFDTPHLPVLVGKRV 129
           +   +    +  A + W L G D D  GALAV++    G   S +V D P + V V  R 
Sbjct: 84  VAARERVVEEERAPNGWFLVGCDSDAGGALAVVRGPSTGVITSVDVIDAPTMKVEVNGRA 143

Query: 130 RKRLDAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFS 189
           R RLD  +M   +R L  P GT A+VE+         Q  +  G+ +GLW G+L ++G +
Sbjct: 144 RVRLDVDAMCAAVRDLRLPPGTVAHVEEGGVEYGFSAQTAFVQGYNFGLWKGVLASAGLT 203

Query: 190 VVPIPSLTWKNWYGLS--GGTSTKDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYH 247
           VV +    WK   GL+  G   +KD+SR +A  LFP +   LKRKKDHG   S     Y 
Sbjct: 204 VVVVKPQAWKWALGLARKGAGGSKDESRAMAKALFPEVEDALKRKKDHGRAESLLIAAYG 263

Query: 248 HL 249
           H+
Sbjct: 264 HM 265


>gi|308806632|ref|XP_003080627.1| unnamed protein product [Ostreococcus tauri]
 gi|116059088|emb|CAL54795.1| unnamed protein product [Ostreococcus tauri]
          Length = 312

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 63/161 (39%), Positives = 81/161 (50%), Gaps = 2/161 (1%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVF--DTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAP 148
           +G D D +GAL VL     G    V   D P    LV  R R RLD ++M  ++RSL  P
Sbjct: 67  VGCDSDGNGALCVLTGPSVGAIERVIVRDNPTRTELVNGRARPRLDPEAMTSVVRSLGVP 126

Query: 149 IGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGT 208
            GTT Y+E+         Q  +  G+ +GLW G+  A+   V  +    WK   GL    
Sbjct: 127 KGTTVYLEEGGVEFGFSAQTAFVQGYNFGLWRGVFAAAELEVRVVKPQAWKAAMGLKSKE 186

Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYHHL 249
           STKDDS  +A TLFP +  +LKRKKDHG   S     Y H+
Sbjct: 187 STKDDSIAMAKTLFPEIAHELKRKKDHGRAESLLIAAYGHV 227


>gi|145349281|ref|XP_001419065.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579296|gb|ABO97358.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 195

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 83/176 (47%), Gaps = 2/176 (1%)

Query: 76  HDLTANQTNADSQWALGIDPDLSGALAVLKSDHNGC--SAEVFDTPHLPVLVGKRVRKRL 133
           +D  A   + D    +G D D  GAL V+     G   S  V D P   V V  R R RL
Sbjct: 3   NDADAEARSRDDWIVVGCDSDAHGALCVISGASVGSVRSVNVRDNPTSTVTVNGRARLRL 62

Query: 134 DAKSMIMLLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPI 193
           D  +M   +R+L  P GT  Y+E+         Q  +  G+ +GLW G+L A  F V  +
Sbjct: 63  DPDAMRGAVRALGVPKGTRVYLEEGGVEFGFSAQTAFVQGYNFGLWRGVLAAEEFEVRVV 122

Query: 194 PSLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYHHL 249
               WK   GL    STKDDS  +A  LFP +  +LKRKKDHG   S     Y H+
Sbjct: 123 KPQAWKAAMGLKFKGSTKDDSIAMAKMLFPEIAGELKRKKDHGRAESLLIAAYGHV 178


>gi|15229814|ref|NP_189977.1| uncharacterized protein [Arabidopsis thaliana]
 gi|7594552|emb|CAB88119.1| putative protein [Arabidopsis thaliana]
 gi|332644320|gb|AEE77841.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 195

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 62/145 (42%), Positives = 86/145 (59%), Gaps = 7/145 (4%)

Query: 82  QTNADSQWALGIDPDLSGALAVLKSDHNG-CSAEVFDTPHLPVLVGKRVRKRLDAKSMIM 140
           Q N    W +GIDP+LSGALAVLK D  G C A+V+DTP L V+V     +  + KSM+ 
Sbjct: 21  QENFKPHWIIGIDPNLSGALAVLKFDDKGSCFAQVYDTPQLEVVVQNIRTRSFNEKSMLE 80

Query: 141 LLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKN 200
           L+RSLD P GT A+V + I +P++     ++ G G GLW   L+ S  SV+ +   TW  
Sbjct: 81  LIRSLDVPSGTKAFVAKYI-HPENAITA-YNDGLGCGLW--TLLTSSISVIYVTPSTWDK 136

Query: 201 WYGLSGGTSTKDDSRRVASTLFPSL 225
            + LS    + D  R++A  +FPSL
Sbjct: 137 HFNLS--IWSLDGGRKLALEMFPSL 159


>gi|412991560|emb|CCO16405.1| predicted protein [Bathycoccus prasinos]
          Length = 410

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/174 (33%), Positives = 83/174 (47%), Gaps = 13/174 (7%)

Query: 88  QWAL-GIDPDLSGALAVLKSDHNG--CSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRS 144
           +WA+ G D D  GALA+++ D  G   + E+ D P + V V  + R RLD   MI  ++ 
Sbjct: 124 KWAVVGCDSDAGGALAIIRGDFAGDVQTVEIVDVPTMTVTVNGKKRIRLDVAKMIEKVKE 183

Query: 145 LDAPI------GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVA----SGFSVVPIP 194
           LD P        +  +VE+         Q  +  G+ +GLW G+L +        V  + 
Sbjct: 184 LDLPSDGNGGHSSVVFVEEGGVEFGFSAQTAFVQGYNFGLWKGVLESCVNPETTRVEVVK 243

Query: 195 SLTWKNWYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYHH 248
              WK   GLSG  ++KDDS  +A  +FP     L+RKKDHG   S     Y H
Sbjct: 244 PQAWKLALGLSGKKTSKDDSIEMAKMVFPESEDMLQRKKDHGRAESLLIAAYGH 297


>gi|374623246|ref|ZP_09695760.1| crossover junction endodeoxyribonuclease [Ectothiorhodospira sp.
           PHS-1]
 gi|373942361|gb|EHQ52906.1| crossover junction endodeoxyribonuclease [Ectothiorhodospira sp.
           PHS-1]
          Length = 159

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/147 (38%), Positives = 73/147 (49%), Gaps = 11/147 (7%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLV--GKRVRKRLDAKSMIMLLRSLDAP 148
           LGIDP L+GA+A L SDH     E+ +   LP +     RVR+ LDA  +  LLR +   
Sbjct: 3   LGIDPGLTGAVAALASDHR---VELLED--LPTVARGSGRVRRELDAAGLAHLLRPIAGD 57

Query: 149 IGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGT 208
           +   A VEQ    P  G    +S G   G   G+L A G  +V +P   WK    +  G 
Sbjct: 58  V-RLAVVEQVASRPGQGVASMFSLGHTAGAICGVLAALGIPLVLVPPAAWKKALNVPTG- 115

Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDH 235
           S KD SR +A+   P L   L RK+DH
Sbjct: 116 SDKDMSRTIAARFHPDL--NLSRKRDH 140


>gi|340779655|ref|ZP_08699598.1| hypothetical protein AaceN1_17488 [Acetobacter aceti NBRC 14818]
          Length = 157

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 49/144 (34%), Positives = 73/144 (50%), Gaps = 8/144 (5%)

Query: 93  IDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIGTT 152
           IDP  SGA+A+L SD  G   E+FD P + V V K+ R  + A  +  +LR+ D      
Sbjct: 6   IDPGNSGAIAIL-SDA-GDLIEIFDMPTVAVTVAKKQRNIVSAPLLATMLRAHDP---AE 60

Query: 153 AYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKD 212
            ++E+    P +G  G ++ G G G+  G+  A G    P+ ++T   W   +G  + K 
Sbjct: 61  IWLEKVSTRPGEGAVGAFAFGRGVGVIEGVAAALG---KPLSTVTPAVWKKATGCPADKG 117

Query: 213 DSRRVASTLFPSLCSQLKRKKDHG 236
            +R  A  LFP+   Q KR KD G
Sbjct: 118 GARARAMQLFPTAADQFKRVKDDG 141


>gi|262277495|ref|ZP_06055288.1| crossover junction endodeoxyribonuclease [alpha proteobacterium
           HIMB114]
 gi|262224598|gb|EEY75057.1| crossover junction endodeoxyribonuclease [alpha proteobacterium
           HIMB114]
          Length = 165

 Score = 63.5 bits (153), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 8/145 (5%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRS-LDAPI 149
            GIDP +SGAL++ +   N    EV D P +    GK+ +K+++A  + + ++  ++   
Sbjct: 4   FGIDPGVSGALSIYE---NKKLKEVIDMPTMSE--GKKNKKQINAAQLSLEIKQRINGSS 58

Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTS 209
            T   +E     P  G    ++ G  +G+  GI  A    +  +  + WK  + L G  S
Sbjct: 59  ETAVVIEHVTAMPGQGVTSMFNFGQSFGVIKGICSALTLPIYFVRPVKWKKHFNLIG--S 116

Query: 210 TKDDSRRVASTLFPSLCSQLKRKKD 234
           +KD SR  A  LFPS  + + +KKD
Sbjct: 117 SKDASRTKAIELFPSFSNDMAKKKD 141


>gi|27379639|ref|NP_771168.1| hypothetical protein blr4528 [Bradyrhizobium japonicum USDA 110]
 gi|27352791|dbj|BAC49793.1| blr4528 [Bradyrhizobium japonicum USDA 110]
          Length = 158

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 71/156 (45%), Gaps = 15/156 (9%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
           LGIDP  SGA+A     H     E      +P+  G+     +DA   + L R + A   
Sbjct: 6   LGIDPGASGAIAFYFPSHP----ERVAVEDMPIAAGE-----VDA---VNLARRVAAMAP 53

Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
              ++E+    P  G    +  G  YG+ +G++   G + +P+  +T   W G    +S 
Sbjct: 54  DLVFLERVNAMPGQGVSSTFKFGRAYGVVLGVI---GAAALPLHLVTPAKWKGHLRLSSD 110

Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILY 246
           K+++R +A  LFP+     KRKKDHG   +     Y
Sbjct: 111 KEEARALALRLFPACADHFKRKKDHGRAEAALIARY 146


>gi|384254147|gb|EIE27621.1| hypothetical protein COCSUDRAFT_52223 [Coccomyxa subellipsoidea
           C-169]
          Length = 131

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 45/80 (56%), Gaps = 4/80 (5%)

Query: 159 IPYPQ--DGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKDDSRR 216
           +P P   +GK  W++ G  YG+W G L + GF V  + S  WK    L+G    K+ SR 
Sbjct: 1   MPQPNRLNGKWSWYACGHSYGVWHGALASQGFKVETVTSRAWKIDLQLNG--KGKEGSRL 58

Query: 217 VASTLFPSLCSQLKRKKDHG 236
           +A  LFPS    LKRKKDHG
Sbjct: 59  LAIRLFPSAAELLKRKKDHG 78


>gi|330814399|ref|YP_004358638.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
           sp. IMCC9063]
 gi|327487494|gb|AEA81899.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
           sp. IMCC9063]
          Length = 165

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 70/146 (47%), Gaps = 8/146 (5%)

Query: 90  ALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPI 149
             GIDP +SGA++V +          FD P +  + GK+ +++++A  +   ++S     
Sbjct: 3   VFGIDPGVSGAISVFEDKK---ITTFFDMPTM--IEGKKNKRQVNAAQLAHEIKSRVVDR 57

Query: 150 GTTAY-VEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGT 208
             +A  +EQ    P  G    ++ G  +G+  GI  A G  +  I  + WK  + L G  
Sbjct: 58  DNSAVLIEQVGAMPGQGVTSMFNFGQSFGVIKGICSALGIPIYFIRPVKWKKHFNLIG-- 115

Query: 209 STKDDSRRVASTLFPSLCSQLKRKKD 234
           + KD SR     +FP L  QL +KKD
Sbjct: 116 TEKDASRTKVIEMFPDLSPQLTKKKD 141


>gi|307107556|gb|EFN55798.1| hypothetical protein CHLNCDRAFT_57716 [Chlorella variabilis]
          Length = 361

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 43/84 (51%), Gaps = 2/84 (2%)

Query: 153 AYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTSTKD 212
           A VE S+P    GK  W+  GF  GL  G+L A G     +P+ TWK   GL      KD
Sbjct: 233 AVVEYSMPGHVCGKYAWYGIGFAVGLLNGLLAAQGIPYHRVPASTWKQAMGLK--KQGKD 290

Query: 213 DSRRVASTLFPSLCSQLKRKKDHG 236
            S  +A  L P+    LKRKKDHG
Sbjct: 291 GSIALARHLLPTASIFLKRKKDHG 314


>gi|371496244|ref|YP_004957392.1| unnamed protein product [Planktothrix phage PaV-LD]
 gi|325587611|gb|ADZ31626.1| crossover junction endodeoxyribonuclease [Planktothrix phage
           PaV-LD]
          Length = 159

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 70/156 (44%), Gaps = 17/156 (10%)

Query: 84  NADSQWALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDA---KSMIM 140
           N  ++  +GIDP  +GA+  +    NG   +  D P + +    R    L A   K MI 
Sbjct: 3   NIMTKTFIGIDPGATGAVCRIS---NG-EVKFLDCPIIKIGSKIRPNPTLMASGLKEMIT 58

Query: 141 LLRSLDAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKN 200
           L         T   +E+    P+ G    ++ G G+G+WIGI+ A G  +  +    WK 
Sbjct: 59  L--------NTHLIIEKVHAMPKQGVSSTFNFGMGFGIWIGIIAALGIPMEFVTPQAWKK 110

Query: 201 WYGLSGGTSTKDDSRRVASTLFPSLCSQLKRKKDHG 236
            Y L G    K  SR +A  +FP   + LK +K HG
Sbjct: 111 HYSLIG--KDKKASRVMALQIFPGETNNLKLEKHHG 144


>gi|71083298|ref|YP_266017.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
           ubique HTCC1062]
 gi|71062411|gb|AAZ21414.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
           ubique HTCC1062]
          Length = 164

 Score = 56.6 bits (135), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 68/147 (46%), Gaps = 10/147 (6%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
           +GIDP ++G++       +G   +V + P++P   GK+ +K+++   +   + S    I 
Sbjct: 4   IGIDPGITGSICFF---QDGKITDVIEMPNMPE--GKKNKKQVNGAQIYYEISSRIKDIK 58

Query: 151 TT---AYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGG 207
                  +EQ    P  G    ++ G  +G+  GI  A    +  +    WK ++ L   
Sbjct: 59  KEDIKVIIEQVSAMPGQGVTSMFNFGQSFGILKGICSAMQLPMYFVRPTKWKKYFNLI-- 116

Query: 208 TSTKDDSRRVASTLFPSLCSQLKRKKD 234
            S KD SR  A  +FP   S+L RKKD
Sbjct: 117 NSEKDASRTKAIEVFPYFSSELSRKKD 143


>gi|91762272|ref|ZP_01264237.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
           ubique HTCC1002]
 gi|91718074|gb|EAS84724.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
           ubique HTCC1002]
          Length = 164

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 68/147 (46%), Gaps = 10/147 (6%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPI- 149
           +GIDP ++G++       +G   +V + P++P   GK+ +K+++   +   + S    I 
Sbjct: 4   IGIDPGITGSICFF---QDGKIIDVIEMPNMPE--GKKNKKQVNGAQIYYEISSKIKDIK 58

Query: 150 --GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGG 207
                  +EQ    P  G    ++ G  +G+  GI  A    +  +    WK ++ L   
Sbjct: 59  KEDIKVIIEQVSAMPGQGVTSMFNFGQSFGILKGICSAMQLPMYFVRPTKWKKYFNLI-- 116

Query: 208 TSTKDDSRRVASTLFPSLCSQLKRKKD 234
            S KD SR  A  +FP   S+L RKKD
Sbjct: 117 NSEKDASRTKAIEVFPYFSSELSRKKD 143


>gi|406706639|ref|YP_006756992.1| Lactococcus phage M3 protein [alpha proteobacterium HIMB5]
 gi|406652415|gb|AFS47815.1| Lactococcus phage M3 protein [alpha proteobacterium HIMB5]
          Length = 164

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 12/148 (8%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAK----SMIMLLRSLD 146
           +GIDP +SG++       NG   +V + P +    GK+ +K+++       ++  +++ D
Sbjct: 4   IGIDPGISGSICFF---DNGRIIKVIEMPTMTD--GKKNKKQVNGAQIYHEILEGIKNYD 58

Query: 147 APIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSG 206
                   +EQ    P  G    ++ G  +G+  GI  A   S+  +    WK ++ L  
Sbjct: 59  KK-DIKIVIEQVSAMPGQGVTSMFNFGQSFGILKGICSAMQLSMYFVRPAKWKKYFNLI- 116

Query: 207 GTSTKDDSRRVASTLFPSLCSQLKRKKD 234
             S KD SR  A  +FP   SQL +KKD
Sbjct: 117 -NSEKDASRTRAIEIFPYFSSQLSKKKD 143


>gi|254455456|ref|ZP_05068885.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
           sp. HTCC7211]
 gi|207082458|gb|EDZ59884.1| crossover junction endodeoxyribonuclease [Candidatus Pelagibacter
           sp. HTCC7211]
          Length = 164

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 65/149 (43%), Gaps = 12/149 (8%)

Query: 90  ALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLV-GKRVRKRLDAKSM---IMLLRSL 145
            +GIDP +SG++   +        ++ D   +P ++ GK+ +K+++   +   I      
Sbjct: 3   VIGIDPGISGSICFFQD------GKIIDVVEMPTMIEGKKNKKQVNGSQIFNEISYRIKK 56

Query: 146 DAPIGTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLS 205
                    +EQ    P  G    ++ G  +G+  GI  A    +  +    WK ++ L 
Sbjct: 57  IDKKDIKVVIEQVSAMPGQGVTSMFNFGQSFGILKGICSAMQLPMYFVRPAKWKKYFNLI 116

Query: 206 GGTSTKDDSRRVASTLFPSLCSQLKRKKD 234
              S KD SR  A  +FP   SQL RKKD
Sbjct: 117 --NSEKDASRTRAIEVFPYFSSQLSRKKD 143


>gi|15229812|ref|NP_189976.1| uncharacterized protein [Arabidopsis thaliana]
 gi|7594551|emb|CAB88118.1| putative protein [Arabidopsis thaliana]
 gi|332644319|gb|AEE77840.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 84

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 48/95 (50%), Gaps = 17/95 (17%)

Query: 22  VSSKIRTLCTYPFAQHGRTLLVRAAAAGDTTKAAVAAQMKQNWLDSLTFPPLHVHDLTAN 81
            SSK+++ C+  F+ H               K    A MKQ  L+ L+    +  ++   
Sbjct: 3   FSSKLKSFCSRYFSSH--------------EKDKYKALMKQKLLNPLSSSSPNYDEMI-- 46

Query: 82  QTNADSQWALGIDPDLSGALAVLKSDHNG-CSAEV 115
           Q N    W +GIDP+LSGALAVLK D  G CSA+V
Sbjct: 47  QENFKPHWIIGIDPNLSGALAVLKFDDKGSCSAQV 81


>gi|299534268|ref|ZP_07047617.1| hypothetical protein CTS44_25676 [Comamonas testosteroni S44]
 gi|298717726|gb|EFI58734.1| hypothetical protein CTS44_25676 [Comamonas testosteroni S44]
          Length = 166

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 70/152 (46%), Gaps = 13/152 (8%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVL-VGKR--VRKRLDAKSMI-MLLRSLD 146
           +GIDP L+GA AV+  DHNG  A +FD P +PV   G++  V++++D K+   +LL+   
Sbjct: 4   IGIDPGLTGACAVM--DHNGVRA-IFDLPTMPVPGAGEKALVKRKIDGKAFCQLLLKHCP 60

Query: 147 APIGTTAYVEQSIPYPQDGKQGWWSGG---FGYGLWIGILVASGFSVVPIPSLTWKNWYG 203
           A  G    V + +           + G      G    +L   G+    +   +WK  + 
Sbjct: 61  AGEGKPLAVLEKVNTMGGANNAVQTQGSLMRSLGAIESVLECLGYPTEHVAPQSWKKLFR 120

Query: 204 LSGGTSTKDDSRRVASTLFPSLCSQLKRKKDH 235
           L    S K  +  +A  L P     L R+KDH
Sbjct: 121 LG---SEKTKALEMARKLHPEAAGNLTRQKDH 149


>gi|414088500|ref|YP_006988678.1| putative RNaseH-like domain protein [Caulobacter phage CcrColossus]
 gi|408731870|gb|AFU88314.1| putative RNaseH-like domain protein [Caulobacter phage CcrColossus]
          Length = 168

 Score = 50.1 bits (118), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 67/159 (42%), Gaps = 12/159 (7%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
           LGIDP ++GA+A++ +D    S  + D P  P   GK      +A S     R  DA   
Sbjct: 5   LGIDPGVNGAIALIDTDD--WSICITDMPREPGKGGK------NAVSPAGCARIFDATAP 56

Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
              ++E     PQ G        F +G  +GIL  +  S   +  +  ++W   +     
Sbjct: 57  DYTFIEDVWSSPQMGV----VSAFSFGRNLGILEGAAASRSILTKVRPQDWKAATKTPKD 112

Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYHHL 249
           K+++RR A  LFP      KR KD G   +     Y  L
Sbjct: 113 KNEARRRAMQLFPCAYDLFKRVKDDGRAEAAILSFYGLL 151


>gi|126444857|ref|YP_001063354.1| hypothetical protein BURPS668_A2360 [Burkholderia pseudomallei 668]
 gi|167907357|ref|ZP_02494562.1| hypothetical protein BpseN_34325 [Burkholderia pseudomallei NCTC
           13177]
 gi|126224348|gb|ABN87853.1| conserved hypothetical protein [Burkholderia pseudomallei 668]
          Length = 164

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 60/137 (43%), Gaps = 8/137 (5%)

Query: 90  ALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPI 149
           ALGIDP + GALA L  DHNG    + D P  P     +VR  +DA ++   LR L  P 
Sbjct: 4   ALGIDPGIRGALAAL--DHNG-HLRIADMPTRPKQGNGKVRNEIDAAALQRQLREL-IPA 59

Query: 150 GTTAYV--EQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGL--S 205
              A V  E    +     Q   S      +   +   SG++   +    W+ +YG+  S
Sbjct: 60  DEQAIVVMEALNTFAGGSVQTMASLEATKAVIATVCELSGYTPAIVTPQCWQRFYGIKRS 119

Query: 206 GGTSTKDDSRRVASTLF 222
               TK  S R+A  L+
Sbjct: 120 ASEDTKAQSLRLARDLY 136


>gi|301060903|ref|ZP_07201705.1| conserved hypothetical protein [delta proteobacterium NaphS2]
 gi|300444946|gb|EFK08909.1| conserved hypothetical protein [delta proteobacterium NaphS2]
          Length = 153

 Score = 45.8 bits (107), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 62/147 (42%), Gaps = 18/147 (12%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
           +G+DP  +GA A++  +      +V D P  P L+    R  L    ++M          
Sbjct: 7   IGVDPGKTGAAALITDE---GIHDVLDYPGDPSLIVATFRGWLLNYKIVM---------- 53

Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFS-VVPIPSLTWKNWYGLSGGTS 209
             A +E+    P  G    ++ G   G W GI+ A G   ++P P    K       G  
Sbjct: 54  --AALEKVSARPGQGVVSMFTFGRNLGAWEGIISAFGIPFMMPTPRQWQKGLIDQKAGND 111

Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHG 236
           TK  S   A  LFP   ++L RKKDHG
Sbjct: 112 TKARSLNTARRLFPD--AELARKKDHG 136


>gi|418846730|ref|ZP_13401496.1| crossover junction endodeoxyribonuclease [Salmonella enterica
           subsp. enterica serovar Newport str. CVM 19443]
 gi|392809773|gb|EJA65806.1| crossover junction endodeoxyribonuclease [Salmonella enterica
           subsp. enterica serovar Newport str. CVM 19443]
          Length = 154

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 14/160 (8%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVL-VGKRVRKRLDAKSMIMLLRSLDAPI 149
           LGIDP  SG+L VL ++  G      D   +P + VG   + R++  ++   LR      
Sbjct: 5   LGIDPGCSGSL-VLITEQGG----YIDHLAMPTIKVG--TKSRVNGAAVAAWLRKYGI-- 55

Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTS 209
            T AY+EQ    P  G    ++ G   G+  GIL         +    WK   GL G  S
Sbjct: 56  -THAYLEQVGAMPGQGTASMFTFGHAAGVAEGILQGLNIPYTLVTPQAWKKSAGLIG--S 112

Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYHHL 249
            KD +R  A  L+P L + L  K     I+    I  H +
Sbjct: 113 DKDAARSRAIQLYPELRA-LDAKAKGQAIADALLIARHGI 151


>gi|301059600|ref|ZP_07200512.1| conserved hypothetical protein [delta proteobacterium NaphS2]
 gi|300446365|gb|EFK10218.1| conserved hypothetical protein [delta proteobacterium NaphS2]
          Length = 153

 Score = 42.4 bits (98), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 63/148 (42%), Gaps = 20/148 (13%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLP-VLVGKRVRKRLDAKSMIMLLRSLDAPI 149
           +GIDP  +GA A++  +    + E+ D P  P ++V K    +L    +++ L  + A  
Sbjct: 7   IGIDPGKTGAAALITDE---GTHEILDYPGDPSLIVAKFTEWKLHHHIVMVALEKVSA-- 61

Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFS-VVPIPSLTWKNWYGLSGGT 208
                       P  G    +S G   G W GI+ A G   ++P P    +       G 
Sbjct: 62  -----------RPGQGVVSMFSFGRNLGTWEGIISAFGIPFMMPTPRQWQQGLIDQKAGG 110

Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHG 236
             K  S   A  LFP   ++L RKKDHG
Sbjct: 111 DPKARSLNTARRLFPD--AELSRKKDHG 136


>gi|301060290|ref|ZP_07201155.1| conserved hypothetical protein [delta proteobacterium NaphS2]
 gi|300445603|gb|EFK09503.1| conserved hypothetical protein [delta proteobacterium NaphS2]
          Length = 153

 Score = 42.0 bits (97), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 20/148 (13%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLP-VLVGKRVRKRLDAKSMIMLLRSLDAPI 149
           +G+DP  +GA A++  +    + E+ D P  P ++V K    +L    +++ L  + A  
Sbjct: 7   IGVDPGKTGAAALITDE---GTHEILDYPGDPSLIVAKFTEWKLHHHIVMVALEKVSA-- 61

Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFS-VVPIPSLTWKNWYGLSGGT 208
                       P  G    +S G   G W GI+ A G   ++P P    +       G 
Sbjct: 62  -----------RPGQGVVSMFSFGRNLGTWEGIISAFGIPFMMPTPRQWQQGLIDQKAGG 110

Query: 209 STKDDSRRVASTLFPSLCSQLKRKKDHG 236
             K  S   A  LFP   ++L RKKDHG
Sbjct: 111 DPKARSLNTARRLFPD--AELSRKKDHG 136


>gi|30387422|ref|NP_848251.1| crossover junction endodeoxyribonuclease [Enterobacteria phage
           epsilon15]
 gi|30266077|gb|AAO06106.1| crossover junction endodeoxyribonuclease [Salmonella phage
           epsilon15]
          Length = 154

 Score = 40.4 bits (93), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 44/160 (27%), Positives = 66/160 (41%), Gaps = 14/160 (8%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVL-VGKRVRKRLDAKSMIMLLRSLDAPI 149
           LGIDP  SG+L ++            D   +P + VG   + R++  ++   +R     +
Sbjct: 5   LGIDPGCSGSLVLITEQ-----GHYIDHLAMPTIKVG--TKSRVNGAAVAAWVRQY---V 54

Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTS 209
            T AY+EQ    P  G    ++ G   G+  GIL         +    WK   GL G  S
Sbjct: 55  ITHAYLEQVGAMPGQGTASMFTFGHAAGVAEGILQGLNIPYTLVTPQAWKKSAGLIG--S 112

Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYHHL 249
            KD +R  A  L+P L + L  K     I+    I  H +
Sbjct: 113 DKDAARSRAIQLYPELRA-LDAKAKGQAIADALLIARHGI 151


>gi|429100617|ref|ZP_19162591.1| crossover junction endodeoxyribonuclease [Cronobacter turicensis
           564]
 gi|426287266|emb|CCJ88704.1| crossover junction endodeoxyribonuclease [Cronobacter turicensis
           564]
          Length = 155

 Score = 40.4 bits (93), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 65/159 (40%), Gaps = 12/159 (7%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
           +GIDP  SGAL  +  D  G      + P + V      + R++   +   L+S      
Sbjct: 5   IGIDPGCSGALVAV--DETGEYVAHLNMPTIKV----GSKARVNGAQLAAWLQSWSI--- 55

Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
           + AY+EQ    P  G    ++ G   G+  GIL  +      +    WK    L G  S 
Sbjct: 56  SHAYLEQVGALPGQGTASMFTFGHAAGIAEGILQGAYIPYTLVTPQAWKKAASLIG--SD 113

Query: 211 KDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYHHL 249
           KD +R  A  L+P+L + L  K     I+    I  H L
Sbjct: 114 KDAARSRAIQLYPALRA-LDAKAKGQAIADALLIARHGL 151


>gi|414089188|ref|YP_006989356.1| putative RNaseH-like domain protein [Caulobacter phage CcrRogue]
 gi|408730361|gb|AFU86809.1| putative RNaseH-like domain protein [Caulobacter phage CcrRogue]
          Length = 171

 Score = 38.9 bits (89), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 61/142 (42%), Gaps = 18/142 (12%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKRVRKRLDAKSMIMLLRSLDAPIG 150
           +G+DP +SG + +L  D   CS  V D P  P   G    +RL + +++ +      P  
Sbjct: 4   IGVDPGISGGVVLL--DTEACSLTVADMPTEPSTKG----RRLASSALLTVFLHDVQP-- 55

Query: 151 TTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTST 210
              ++E+    P +G  G +S G G G   G+   +  ++  +    WK        T T
Sbjct: 56  DHIFLEEVGVRPGEGAVGAFSFGRGLGRLEGVAAGTRTALTTVTPQEWKRV------TKT 109

Query: 211 KDDSRRV---ASTLFPSLCSQL 229
             D +R    A  LFP  C +L
Sbjct: 110 PADKKRAIARAYQLFPR-CVKL 130


>gi|455641397|gb|EMF20568.1| Lactococcus phage M3 protein [Citrobacter freundii GTC 09479]
          Length = 154

 Score = 38.1 bits (87), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 63/158 (39%), Gaps = 14/158 (8%)

Query: 91  LGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVL-VGKRVRKRLDAKSMIMLLRSLDAPI 149
           LGIDP  SG+L ++            D   +P + VG   + R++  ++   +R      
Sbjct: 5   LGIDPGCSGSLVLITEQ-----GHYIDHLAMPTIKVG--TKSRVNGAAVAAWVRKYGI-- 55

Query: 150 GTTAYVEQSIPYPQDGKQGWWSGGFGYGLWIGILVASGFSVVPIPSLTWKNWYGLSGGTS 209
            T AY+E     P  G    ++ G   G+  GIL         +    WK   GL G  S
Sbjct: 56  -THAYLEHVGAMPGQGTASMFTFGHAAGVAEGILQGLNIPYTLVTPQAWKKSAGLIG--S 112

Query: 210 TKDDSRRVASTLFPSLCSQLKRKKDHGTISSPYCILYH 247
            KD +R  A  L+P L + L  K     I+    I  H
Sbjct: 113 DKDAARSRAIQLYPELRA-LDAKAKGQAIADALLIARH 149


>gi|341882819|gb|EGT38754.1| hypothetical protein CAEBREN_01743 [Caenorhabditis brenneri]
          Length = 896

 Score = 37.0 bits (84), Expect = 8.5,   Method: Composition-based stats.
 Identities = 33/146 (22%), Positives = 58/146 (39%), Gaps = 19/146 (13%)

Query: 30  CTYPFAQHGRTLLVRAAAAGDTTKAAVAAQMKQNWLDSLTFPPLHVHDLTANQTNADSQW 89
           CT  F +H     +RA A  D  +     Q ++  L +  +   H+H+    + N + + 
Sbjct: 264 CTDKFPKHAT---LRAVAYVDQRRIGEFVQHREEQLQAHDY---HIHE----ENNPEGEQ 313

Query: 90  ALGIDPDLSGALAVLKSDHNGCSAEVFDTPHLPVLVGKR--------VRKRLDAKSMIML 141
              ++P  S ++     D  G  + V    HL V+ GKR         R   D  S    
Sbjct: 314 VFSVNPPSSRSIVCSVVDIRGAGSAVLTPVHLEVVAGKRFDLQFLILTRGTYDVYSK-KF 372

Query: 142 LRSLDAPIGTTAYVEQSIPYPQDGKQ 167
               D  +G   YV+  +P+ +D ++
Sbjct: 373 FEIRDLQLGDLVYVDIIVPFHEDVRK 398


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.133    0.419 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,157,612,572
Number of Sequences: 23463169
Number of extensions: 175093840
Number of successful extensions: 390146
Number of sequences better than 100.0: 60
Number of HSP's better than 100.0 without gapping: 34
Number of HSP's successfully gapped in prelim test: 26
Number of HSP's that attempted gapping in prelim test: 390039
Number of HSP's gapped (non-prelim): 63
length of query: 249
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 110
effective length of database: 9,097,814,876
effective search space: 1000759636360
effective search space used: 1000759636360
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 75 (33.5 bits)