BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024416
         (268 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224133834|ref|XP_002321672.1| predicted protein [Populus trichocarpa]
 gi|222868668|gb|EEF05799.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 192/268 (71%), Positives = 227/268 (84%), Gaps = 2/268 (0%)

Query: 1   MGRNTQIFTSIFLLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIE 60
           MG+  +I   +FL+ L S Q +     +  A+RT+KQE G AH+VHCSRERSRAAW  IE
Sbjct: 1   MGKAARITQILFLISLLSKQILSQDVGEHVAARTVKQEEGHAHKVHCSRERSRAAWQAIE 60

Query: 61  EYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQH 120
           EYLMPFVE+ QY+ISSKCRLHP ND+FRDQE+HKIHVDINEW+CGYCKK FRAEK+LDQH
Sbjct: 61  EYLMPFVEREQYQISSKCRLHPSNDLFRDQEEHKIHVDINEWKCGYCKKHFRAEKYLDQH 120

Query: 121 FDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFP 180
           FDNRH NLLN+S GKC+ADLCGALHCDF+M+SKS+++KCNPAAVAKN HLCESLA+ CFP
Sbjct: 121 FDNRHYNLLNISDGKCMADLCGALHCDFMMDSKSTKTKCNPAAVAKNHHLCESLADSCFP 180

Query: 181 INQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLI 240
           +NQGPSASRLHELFL QFCDAH C    + FPKGG+K+ SVFYL+ISILT+M++P+FYLI
Sbjct: 181 LNQGPSASRLHELFLHQFCDAHTCSGNKKLFPKGGKKKTSVFYLAISILTMMMIPLFYLI 240

Query: 241 VYLYQREMRGGTQELKRIPRVGRKAKPS 268
           VYL+Q EMR GTQEL+RI +  RK KPS
Sbjct: 241 VYLHQSEMRKGTQELRRISK--RKTKPS 266


>gi|359491160|ref|XP_002262859.2| PREDICTED: uncharacterized protein LOC100244377 [Vitis vinifera]
          Length = 254

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 191/260 (73%), Positives = 218/260 (83%), Gaps = 8/260 (3%)

Query: 10  SIFLLLLFSPQEILIQGFDDSAS-RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVE 68
           +IF L +F       Q F++SA+ R LKQ+   AHEVHCSRERSRAAW +IEEY+MPFVE
Sbjct: 2   TIFNLFMF-------QDFEESAAARNLKQQKDHAHEVHCSRERSRAAWKIIEEYMMPFVE 54

Query: 69  QVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNL 128
           + QY+ISS C+LHPDN +F DQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH  L
Sbjct: 55  REQYQISSSCKLHPDNHLFSDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHYGL 114

Query: 129 LNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 188
           LNVSH KCLADLCGALHCD VMNSK+ ++KCNPAA  +NRHLCESLA+ CFP+N G SAS
Sbjct: 115 LNVSHSKCLADLCGALHCDLVMNSKTPKAKCNPAAAVRNRHLCESLADSCFPVNHGLSAS 174

Query: 189 RLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREM 248
           RLH++FLRQFCDAH C    +PF +GG KQPS+FYL+ISILTLM+LP+FY+IVYLYQREM
Sbjct: 175 RLHDIFLRQFCDAHTCSGGGKPFSRGGMKQPSIFYLAISILTLMMLPVFYVIVYLYQREM 234

Query: 249 RGGTQELKRIPRVGRKAKPS 268
           R GTQELKRI R G K KPS
Sbjct: 235 RNGTQELKRISRHGLKTKPS 254


>gi|297733623|emb|CBI14870.3| unnamed protein product [Vitis vinifera]
          Length = 276

 Score =  406 bits (1044), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 187/246 (76%), Positives = 212/246 (86%), Gaps = 1/246 (0%)

Query: 24  IQGFDDSAS-RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHP 82
            Q F++SA+ R LKQ+   AHEVHCSRERSRAAW +IEEY+MPFVE+ QY+ISS C+LHP
Sbjct: 31  TQDFEESAAARNLKQQKDHAHEVHCSRERSRAAWKIIEEYMMPFVEREQYQISSSCKLHP 90

Query: 83  DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCG 142
           DN +F DQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH  LLNVSH KCLADLCG
Sbjct: 91  DNHLFSDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHYGLLNVSHSKCLADLCG 150

Query: 143 ALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAH 202
           ALHCD VMNSK+ ++KCNPAA  +NRHLCESLA+ CFP+N G SASRLH++FLRQFCDAH
Sbjct: 151 ALHCDLVMNSKTPKAKCNPAAAVRNRHLCESLADSCFPVNHGLSASRLHDIFLRQFCDAH 210

Query: 203 KCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVG 262
            C    +PF +GG KQPS+FYL+ISILTLM+LP+FY+IVYLYQREMR GTQELKRI R G
Sbjct: 211 TCSGGGKPFSRGGMKQPSIFYLAISILTLMMLPVFYVIVYLYQREMRNGTQELKRISRHG 270

Query: 263 RKAKPS 268
            K KPS
Sbjct: 271 LKTKPS 276


>gi|296081316|emb|CBI17698.3| unnamed protein product [Vitis vinifera]
          Length = 409

 Score =  376 bits (966), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 179/279 (64%), Positives = 211/279 (75%), Gaps = 11/279 (3%)

Query: 1   MGRNTQIFTSIFLLLL----------FSPQEILIQGFDDSA-SRTLKQEPGRAHEVHCSR 49
           MGR+  +F+ +  L +           SP     QGF +S  SR +KQ+    HEVHCSR
Sbjct: 131 MGRSASLFSMVLWLWVCLQLRPTTSSISPPLHTNQGFQESTNSRNVKQDQDGVHEVHCSR 190

Query: 50  ERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKK 109
           ERSRAAW +IEEYLMPFVE+  Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKK
Sbjct: 191 ERSRAAWKIIEEYLMPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKK 250

Query: 110 SFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRH 169
           SF AE +LD+HFDNRH NLLNVS  KCLAD+CGALHCD VM+SK  ++KCNPAA A+N H
Sbjct: 251 SFYAENYLDKHFDNRHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHH 310

Query: 170 LCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISIL 229
           LCESLA+ CFP++ GPSASRLHE FLRQFCDAH C    +PF +G +K+ SVFYL ISIL
Sbjct: 311 LCESLADGCFPVSGGPSASRLHEFFLRQFCDAHTCTGGRKPFSRGRKKRTSVFYLCISIL 370

Query: 230 TLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
           TLMLLP+FY  VYLYQR +R G Q  +RI   GRK KP+
Sbjct: 371 TLMLLPLFYFFVYLYQRGLRRGAQNFRRISESGRKKKPT 409


>gi|359488643|ref|XP_002271064.2| PREDICTED: uncharacterized protein LOC100257742 [Vitis vinifera]
          Length = 279

 Score =  373 bits (958), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 179/279 (64%), Positives = 211/279 (75%), Gaps = 11/279 (3%)

Query: 1   MGRNTQIFTSIFLLLL----------FSPQEILIQGFDDSA-SRTLKQEPGRAHEVHCSR 49
           MGR+  +F+ +  L +           SP     QGF +S  SR +KQ+    HEVHCSR
Sbjct: 1   MGRSASLFSMVLWLWVCLQLRPTTSSISPPLHTNQGFQESTNSRNVKQDQDGVHEVHCSR 60

Query: 50  ERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKK 109
           ERSRAAW +IEEYLMPFVE+  Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKK
Sbjct: 61  ERSRAAWKIIEEYLMPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKK 120

Query: 110 SFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRH 169
           SF AE +LD+HFDNRH NLLNVS  KCLAD+CGALHCD VM+SK  ++KCNPAA A+N H
Sbjct: 121 SFYAENYLDKHFDNRHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHH 180

Query: 170 LCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISIL 229
           LCESLA+ CFP++ GPSASRLHE FLRQFCDAH C    +PF +G +K+ SVFYL ISIL
Sbjct: 181 LCESLADGCFPVSGGPSASRLHEFFLRQFCDAHTCTGGRKPFSRGRKKRTSVFYLCISIL 240

Query: 230 TLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
           TLMLLP+FY  VYLYQR +R G Q  +RI   GRK KP+
Sbjct: 241 TLMLLPLFYFFVYLYQRGLRRGAQNFRRISESGRKKKPT 279


>gi|226531528|ref|NP_001142988.1| uncharacterized protein LOC100275449 precursor [Zea mays]
 gi|195612594|gb|ACG28127.1| zinc finger protein [Zea mays]
          Length = 276

 Score =  369 bits (947), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 168/238 (70%), Positives = 200/238 (84%)

Query: 31  ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQ 90
            SR L Q    AHEVHCSRERSRAAW  I+EYLMPFVE+ +YE+ SKCRL PDNDMFR+Q
Sbjct: 39  GSRILLQTNPDAHEVHCSRERSRAAWEAIDEYLMPFVEKEKYELPSKCRLRPDNDMFREQ 98

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
           EQHKIH DINEW CG+CKK+FRAEKF+DQHF+NRH NL++ S G+C+ADLCGALHCD +M
Sbjct: 99  EQHKIHFDINEWHCGFCKKAFRAEKFIDQHFENRHKNLVDNSEGRCMADLCGALHCDLMM 158

Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQP 210
             K  +SKCN AA  +NRHLCESLA+ CFPINQG +ASRLHE FLRQFCDAH C + ++P
Sbjct: 159 QFKKPKSKCNAAAATRNRHLCESLADSCFPINQGLAASRLHEFFLRQFCDAHTCSKGTKP 218

Query: 211 FPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
           FPKGGRKQ +  YL++SILTL+LLP+FYLIV+L+QREMR GTQ+LKR  ++G+K KPS
Sbjct: 219 FPKGGRKQTNRLYLALSILTLILLPLFYLIVFLHQREMRKGTQDLKRFSKIGQKKKPS 276


>gi|242083250|ref|XP_002442050.1| hypothetical protein SORBIDRAFT_08g008280 [Sorghum bicolor]
 gi|241942743|gb|EES15888.1| hypothetical protein SORBIDRAFT_08g008280 [Sorghum bicolor]
          Length = 276

 Score =  369 bits (947), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 173/255 (67%), Positives = 207/255 (81%), Gaps = 1/255 (0%)

Query: 15  LLFSPQEILIQGFDDS-ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYE 73
           LLF       Q F ++  SR L Q    AHEVHCSRERSRAAW  I+EYLMPFVE+ +YE
Sbjct: 22  LLFVDGSHQDQEFREAVGSRILLQTNPDAHEVHCSRERSRAAWEAIDEYLMPFVEKEKYE 81

Query: 74  ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH 133
           + SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKFLDQHF+NRH NL++ S 
Sbjct: 82  LPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFLDQHFENRHKNLVDNSE 141

Query: 134 GKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHEL 193
           G+C+ADLCGALHCD +M  K  +SKCN AA  +NRHLCESLA+ CFP+NQG +ASRLHE 
Sbjct: 142 GRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPVNQGLAASRLHEF 201

Query: 194 FLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQ 253
           FLRQFCDAH C + ++PFPKGGRKQ + FYL++ ILTL+LLP+FYLIV+L+QREMR GTQ
Sbjct: 202 FLRQFCDAHTCNKGTKPFPKGGRKQTNRFYLALCILTLILLPLFYLIVFLHQREMRKGTQ 261

Query: 254 ELKRIPRVGRKAKPS 268
           +LKR  ++G+K KPS
Sbjct: 262 DLKRFSKIGQKKKPS 276


>gi|238013592|gb|ACR37831.1| unknown [Zea mays]
 gi|413916779|gb|AFW56711.1| zinc finger protein [Zea mays]
          Length = 275

 Score =  368 bits (945), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 168/238 (70%), Positives = 201/238 (84%), Gaps = 1/238 (0%)

Query: 31  ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQ 90
            SR L+  P  AHEVHCSRERSRAAW  I+EYLMPFVE+ +YE+ SKCRL PDNDMFR+Q
Sbjct: 39  GSRILQTNPD-AHEVHCSRERSRAAWEAIDEYLMPFVEKEKYELPSKCRLRPDNDMFREQ 97

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
           EQHKIH DINEW CG+CKK+FRAEKF+DQHF+NRH NL++ S G+C+ADLCGALHCD +M
Sbjct: 98  EQHKIHFDINEWHCGFCKKAFRAEKFIDQHFENRHKNLVDNSEGRCMADLCGALHCDLMM 157

Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQP 210
             K  +SKCN AA  +NRHLCESLA+ CFPINQG +ASRLHE FLRQFCDAH C + ++P
Sbjct: 158 QFKKPKSKCNAAAATRNRHLCESLADSCFPINQGLAASRLHEFFLRQFCDAHTCSKGTKP 217

Query: 211 FPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
           FPKGGRKQ +  YL++SILTL+LLP+FYLIV+L+QREMR GTQ+LKR  ++G+K KPS
Sbjct: 218 FPKGGRKQTNRLYLALSILTLILLPLFYLIVFLHQREMRKGTQDLKRFSKIGQKKKPS 275


>gi|449440588|ref|XP_004138066.1| PREDICTED: uncharacterized protein LOC101218367 [Cucumis sativus]
          Length = 280

 Score =  365 bits (936), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 166/244 (68%), Positives = 202/244 (82%), Gaps = 1/244 (0%)

Query: 25  QGFDDSAS-RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPD 83
           Q  + SA+ R L+Q      EVHCSRERSR AW +IEE+L+PF+E+  YE+S++CRLHP+
Sbjct: 36  QDEEQSATLRPLEQNEEHVDEVHCSRERSRTAWNIIEEHLLPFMEKENYEVSTQCRLHPN 95

Query: 84  NDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGA 143
           ND+FRDQEQHKIH+DIN W+CGYC+KSFRAEKFLD+HFDNRH+NLLNVSHGKCLADLCGA
Sbjct: 96  NDLFRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHSNLLNVSHGKCLADLCGA 155

Query: 144 LHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK 203
           LHCD  M+ KS +SKC PAA A+N+HLCESLA+ CFPIN+GPSA+RLHELFL QFC AH 
Sbjct: 156 LHCDLKMDIKSRKSKCKPAAAARNKHLCESLADSCFPINEGPSANRLHELFLHQFCGAHS 215

Query: 204 CPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGR 263
           C  K +PF +G  +QP +FY++ SIL LMLLPIFY+IVYL++RE R G + LKRI + GR
Sbjct: 216 CTGKQKPFSRGAARQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLKRISKAGR 275

Query: 264 KAKP 267
           K KP
Sbjct: 276 KNKP 279


>gi|218186624|gb|EEC69051.1| hypothetical protein OsI_37889 [Oryza sativa Indica Group]
          Length = 330

 Score =  363 bits (932), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 173/269 (64%), Positives = 211/269 (78%), Gaps = 2/269 (0%)

Query: 2   GRN-TQIFTSIFLLLLFSPQEILIQGFDDSA-SRTLKQEPGRAHEVHCSRERSRAAWGVI 59
           GR+ +  +  +F   L +      Q F ++A SRTL       ++VHCSRERSRAAW  I
Sbjct: 62  GRSRSAAYLVLFASCLLAVAAASHQEFHEAAGSRTLLMSHEHTNQVHCSRERSRAAWKAI 121

Query: 60  EEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQ 119
           +EYLMPFVE+ +YE+ SKCRLHPDNDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQ
Sbjct: 122 DEYLMPFVEKEKYELPSKCRLHPDNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQ 181

Query: 120 HFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCF 179
           HF NRHNNL++ S G+CLADLCGALHCD ++  K  +SKC+  A A+NRHLCESLA+ CF
Sbjct: 182 HFHNRHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCF 241

Query: 180 PINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYL 239
           PINQG SASRLHE FLRQFCDAH C   S+PFPKGGRKQ + FYL++  LT++LLP+FYL
Sbjct: 242 PINQGQSASRLHEFFLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYL 301

Query: 240 IVYLYQREMRGGTQELKRIPRVGRKAKPS 268
           IV+L+QREM+ G Q L+RI +V +K KPS
Sbjct: 302 IVFLHQREMKKGGQNLRRISKVVQKKKPS 330


>gi|326487255|dbj|BAJ89612.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326532646|dbj|BAJ89168.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 273

 Score =  362 bits (930), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 174/256 (67%), Positives = 201/256 (78%), Gaps = 4/256 (1%)

Query: 13  LLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQY 72
           LL+  S QE+     + S SR L Q     HEVHCSRERSRAAW  I+EYLMPFVE+ +Y
Sbjct: 22  LLVAGSHQEL----HEASGSRILHQTKPHTHEVHCSRERSRAAWKAIDEYLMPFVEKEKY 77

Query: 73  EISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVS 132
           E+ SKCRL P+NDMFR+QEQHKIH D+NEWRCG+CKKSFRAEKF+DQHF NRHNNLL+ S
Sbjct: 78  ELPSKCRLRPENDMFREQEQHKIHFDVNEWRCGFCKKSFRAEKFIDQHFANRHNNLLDNS 137

Query: 133 HGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHE 192
            G+CLADLCGALHCD  M  K  +SKCN AA A+NRH CESLA+ CFPINQG SASRLHE
Sbjct: 138 QGRCLADLCGALHCDLTMEFKKPKSKCNDAAAARNRHRCESLADSCFPINQGHSASRLHE 197

Query: 193 LFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGT 252
            FL QFCDAH C   S+PFPKGGRKQ + FYL+I ILT++LLP+FYLIV+L+QREM+ G 
Sbjct: 198 FFLHQFCDAHTCNGVSKPFPKGGRKQTNRFYLAICILTVLLLPVFYLIVFLHQREMKKGD 257

Query: 253 QELKRIPRVGRKAKPS 268
           Q  KRI +   K KPS
Sbjct: 258 QVFKRIGKTVHKKKPS 273


>gi|297793919|ref|XP_002864844.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297310679|gb|EFH41103.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 270

 Score =  359 bits (921), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 181/270 (67%), Positives = 215/270 (79%), Gaps = 2/270 (0%)

Query: 1   MGRNTQIFTSIFLLLLFSPQEILIQGFDDSASRTLKQEP--GRAHEVHCSRERSRAAWGV 58
           MGR  +    I LL      + L QGF++S S  L  E   G + E+HCSRERSRAAW +
Sbjct: 1   MGRLCESLPVILLLFALLLHQSLSQGFEESESTRLVNEEVGGNSPEIHCSRERSRAAWQI 60

Query: 59  IEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLD 118
           I++YL PFVE+ +Y+I   CRLHPDND++RDQEQHKIHVD+ EW+CGYCKKSF  EKFLD
Sbjct: 61  IQDYLTPFVERERYQIPKNCRLHPDNDLYRDQEQHKIHVDVFEWKCGYCKKSFNDEKFLD 120

Query: 119 QHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRC 178
           +HF  RH NLLN +  KCLADLCGALHCDFV+NSK  ++KCNPAAVAKNRHLCES+AN C
Sbjct: 121 KHFTTRHYNLLNTTDTKCLADLCGALHCDFVLNSKKPKTKCNPAAVAKNRHLCESVANSC 180

Query: 179 FPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFY 238
           FP++QGPSASRLHE FLRQFCDAH C  K +PFP+GG+K+  VFYL+ISILTLMLLP+FY
Sbjct: 181 FPVSQGPSASRLHEHFLRQFCDAHTCTGKDKPFPRGGKKKSGVFYLAISILTLMLLPLFY 240

Query: 239 LIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
           L+V+LYQRE R GTQEL+RI + GRK KPS
Sbjct: 241 LLVFLYQREKRSGTQELRRIRKTGRKPKPS 270


>gi|115487926|ref|NP_001066450.1| Os12g0233400 [Oryza sativa Japonica Group]
 gi|108862374|gb|ABA96879.2| zinc finger family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|108862375|gb|ABG21933.1| zinc finger family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|108862376|gb|ABG21934.1| zinc finger family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|108862377|gb|ABG21935.1| zinc finger family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113648957|dbj|BAF29469.1| Os12g0233400 [Oryza sativa Japonica Group]
 gi|215697514|dbj|BAG91508.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 275

 Score =  358 bits (920), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 171/269 (63%), Positives = 209/269 (77%), Gaps = 2/269 (0%)

Query: 2   GRN-TQIFTSIFLLLLFSPQEILIQGFDDSA-SRTLKQEPGRAHEVHCSRERSRAAWGVI 59
           GR+ +  +  +F   L +      Q F ++A SRTL       ++VHCSRERSRAAW  I
Sbjct: 7   GRSRSAAYLVLFASCLLAVAAASHQEFHEAAGSRTLLMSHEHTNQVHCSRERSRAAWKAI 66

Query: 60  EEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQ 119
           +EYLMPFVE+ +YE+ SKCRLHP NDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQ
Sbjct: 67  DEYLMPFVEKEKYELPSKCRLHPGNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQ 126

Query: 120 HFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCF 179
           HF NRHNNL++ S G+CLADLCGALHCD ++  K  +SKC+  A A+NRHLCESLA+ CF
Sbjct: 127 HFHNRHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCF 186

Query: 180 PINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYL 239
           PINQG SASRLHE  LRQFCDAH C   S+PFPKGGRKQ + FYL++  LT++LLP+FYL
Sbjct: 187 PINQGQSASRLHEFLLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYL 246

Query: 240 IVYLYQREMRGGTQELKRIPRVGRKAKPS 268
           IV+L+QREM+ G Q L+RI +V +K KPS
Sbjct: 247 IVFLHQREMKKGGQNLRRISKVVQKKKPS 275


>gi|356548383|ref|XP_003542582.1| PREDICTED: uncharacterized protein LOC100780668 [Glycine max]
          Length = 244

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 161/247 (65%), Positives = 197/247 (79%), Gaps = 6/247 (2%)

Query: 22  ILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLH 81
           +L++G +       +QE   A EVHCSRERSRAA  VIEEYL PF+E+  Y++SSKC+LH
Sbjct: 4   LLVEGIEQ------EQEQKHAQEVHCSRERSRAARKVIEEYLTPFMERENYQLSSKCKLH 57

Query: 82  PDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLC 141
           P+ND+F DQE+HKI++D +EWRCGYCKKSFR EKFLDQHFD+RH N LN+SHGKCL DLC
Sbjct: 58  PENDIFTDQEEHKIYIDRHEWRCGYCKKSFREEKFLDQHFDSRHYNFLNLSHGKCLGDLC 117

Query: 142 GALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDA 201
           GALHCD VMN KSSRSKCNPAA A+NRHLCE+LA+ CFPI++GPSA RLHELFL QFCDA
Sbjct: 118 GALHCDAVMNFKSSRSKCNPAAAARNRHLCENLADNCFPISEGPSAGRLHELFLHQFCDA 177

Query: 202 HKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRV 261
           H C  K +PF +GG+ Q S F L+   L L+LLP+FYL +YL Q +++  TQEL+RI + 
Sbjct: 178 HTCSGKHKPFSRGGKDQSSFFRLAAGALILVLLPVFYLFLYLVQSDVKSRTQELRRISKA 237

Query: 262 GRKAKPS 268
           G K+KPS
Sbjct: 238 GWKSKPS 244


>gi|18424689|ref|NP_568969.1| C2H2-like zinc finger protein [Arabidopsis thaliana]
 gi|17381006|gb|AAL36315.1| unknown protein [Arabidopsis thaliana]
 gi|21536913|gb|AAM61245.1| zinc finger-like protein [Arabidopsis thaliana]
 gi|22136886|gb|AAM91787.1| unknown protein [Arabidopsis thaliana]
 gi|332010344|gb|AED97727.1| C2H2-like zinc finger protein [Arabidopsis thaliana]
          Length = 271

 Score =  344 bits (882), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 168/247 (68%), Positives = 202/247 (81%), Gaps = 3/247 (1%)

Query: 25  QGFDDSASRTLKQEP---GRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLH 81
           QGF++S S  L  E      A E+HCSRERSRAAW +I++YL PFVE+ +YEI   CRLH
Sbjct: 25  QGFEESESTRLVNEEVEVSNAPEIHCSRERSRAAWQIIQDYLTPFVERERYEIPKNCRLH 84

Query: 82  PDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLC 141
           PDND++RDQE HK+HVD+ EW+CGYCKKSF  EKFLD+HF  RH NLLN +  KCLADLC
Sbjct: 85  PDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEKFLDKHFSTRHYNLLNTTDTKCLADLC 144

Query: 142 GALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDA 201
           GALHCDFV++SK  +SKCNP AVAKNRHLCES+AN CFP++QGPSASRLHE FLRQFCDA
Sbjct: 145 GALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVANSCFPVSQGPSASRLHEHFLRQFCDA 204

Query: 202 HKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRV 261
           H C    +PFP+GG+K+  VFYL+ISILTLMLLP+FYL+V+L+QRE R GTQ+L+RI + 
Sbjct: 205 HTCTGNDKPFPRGGKKKSGVFYLAISILTLMLLPLFYLLVFLHQREKRSGTQDLRRIIKS 264

Query: 262 GRKAKPS 268
           G+K KPS
Sbjct: 265 GKKTKPS 271


>gi|302398695|gb|ADL36642.1| C2H2L domain class transcription factor [Malus x domestica]
          Length = 274

 Score =  339 bits (870), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 158/250 (63%), Positives = 199/250 (79%), Gaps = 6/250 (2%)

Query: 25  QGFDDSAS----RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRL 80
           QG +DSA+    +T+ +E G   EVHCSRERS+AAW +I+EYLMPFVE+ +Y+IS  C+L
Sbjct: 25  QGHEDSAATSFFKTINKELGDPPEVHCSRERSKAAWKIIQEYLMPFVEKERYQISRTCKL 84

Query: 81  HPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADL 140
           HPDND++RDQE+HKI VD+NEW+CGYCKK F  +KFLD+HFDNRH NLLNVSH +CLAD+
Sbjct: 85  HPDNDLYRDQEEHKIQVDLNEWQCGYCKKRFYDDKFLDKHFDNRHYNLLNVSHTRCLADV 144

Query: 141 CGALHCDFVMNS-KSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFC 199
           CGALHCD  M+S    ++KCNPAA A+NRHLCE LA+ CFP+ QGP+ASRLHE FLRQFC
Sbjct: 145 CGALHCDLEMDSVPPKKTKCNPAAAARNRHLCEGLADSCFPVKQGPAASRLHEFFLRQFC 204

Query: 200 DAHKCPRKSQPFPKGGRKQ-PSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRI 258
           D+H C  + +PF +G RK+  S+ YL ISILTL LL +FY  +Y+Y R ++ GTQELKR+
Sbjct: 205 DSHTCTGRQKPFSRGRRKKRASISYLVISILTLALLFLFYSYIYMYSRGVKRGTQELKRV 264

Query: 259 PRVGRKAKPS 268
             +GRK KPS
Sbjct: 265 TPIGRKKKPS 274


>gi|222616852|gb|EEE52984.1| hypothetical protein OsJ_35656 [Oryza sativa Japonica Group]
          Length = 1075

 Score =  338 bits (867), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 154/222 (69%), Positives = 183/222 (82%), Gaps = 1/222 (0%)

Query: 25   QGFDDSA-SRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPD 83
            Q F ++A SRTL       ++VHCSRERSRAAW  I+EYLMPFVE+ +YE+ SKCRLHP 
Sbjct: 819  QEFHEAAGSRTLLMSHEHTNQVHCSRERSRAAWKAIDEYLMPFVEKEKYELPSKCRLHPG 878

Query: 84   NDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGA 143
            NDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQHF NRHNNL++ S G+CLADLCGA
Sbjct: 879  NDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQHFHNRHNNLVDNSQGRCLADLCGA 938

Query: 144  LHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK 203
            LHCD ++  K  +SKC+  A A+NRHLCESLA+ CFPINQG SASRLHE  LRQFCDAH 
Sbjct: 939  LHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCFPINQGQSASRLHEFLLRQFCDAHT 998

Query: 204  CPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQ 245
            C   S+PFPKGGRKQ + FYL++  LT++LLP+FYLIV+L+Q
Sbjct: 999  CKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYLIVFLHQ 1040


>gi|224060201|ref|XP_002300082.1| predicted protein [Populus trichocarpa]
 gi|222847340|gb|EEE84887.1| predicted protein [Populus trichocarpa]
          Length = 224

 Score =  334 bits (856), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 146/223 (65%), Positives = 185/223 (82%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           EVHCSRERSRAAW +I+EYLMPFVE+ +Y+ISS+CRLHP+ND++RDQEQHK+HVDINEWR
Sbjct: 1   EVHCSRERSRAAWKIIDEYLMPFVEKERYKISSRCRLHPENDLYRDQEQHKMHVDINEWR 60

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAA 163
           CGYCKK+F  EK+LD+HFDNRH +LLNVSH KCLAD+CG LHCD V++S   ++KCNPAA
Sbjct: 61  CGYCKKTFYEEKYLDKHFDNRHYDLLNVSHSKCLADVCGVLHCDLVLDSAPHKTKCNPAA 120

Query: 164 VAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFY 223
             +N+HLCESLA+ CFP+N+G SA RL+E FLRQFCDAH C    +PF KGG+K+ S  Y
Sbjct: 121 TGRNKHLCESLADSCFPVNEGTSALRLNEFFLRQFCDAHTCSGGRKPFSKGGKKETSTLY 180

Query: 224 LSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAK 266
           + IS+L LMLL +FY+ +YLY R ++ G+Q LK I + G+K +
Sbjct: 181 VIISVLVLMLLALFYIFMYLYLRGIKRGSQGLKHISKSGQKKR 223


>gi|357160391|ref|XP_003578749.1| PREDICTED: uncharacterized protein LOC100823211 [Brachypodium
           distachyon]
          Length = 274

 Score =  328 bits (841), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 173/262 (66%), Positives = 204/262 (77%), Gaps = 4/262 (1%)

Query: 7   IFTSIFLLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPF 66
           +  S  LL+  S QEI     + + SR   Q     HEVHCSRERSRAAW  I+EYLMPF
Sbjct: 17  LAASCLLLVAGSHQEI----HEATGSRIHHQTNQHTHEVHCSRERSRAAWKAIDEYLMPF 72

Query: 67  VEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHN 126
           VE+ +YE+ SKCRLH +NDMFR+QE+HKIH D+NEWRCG+CKKSFRAEK++DQHF NRHN
Sbjct: 73  VEKEKYELPSKCRLHAENDMFREQEEHKIHFDVNEWRCGFCKKSFRAEKYIDQHFSNRHN 132

Query: 127 NLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPS 186
           NLL+ S G+CLADLCGALHCD  M  K  +SKCN AA A+N HLCESLA+ CFP+NQG S
Sbjct: 133 NLLDNSQGRCLADLCGALHCDLTMEFKKPKSKCNAAAAARNCHLCESLADNCFPVNQGHS 192

Query: 187 ASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQR 246
           ASRLHE FLRQFCDAH C   S+PFPKGGRKQ + FYL++ ILTL+LLP+FYLIV+L+QR
Sbjct: 193 ASRLHEFFLRQFCDAHTCNGDSKPFPKGGRKQTNRFYLALCILTLLLLPLFYLIVFLHQR 252

Query: 247 EMRGGTQELKRIPRVGRKAKPS 268
           EM+ G Q  KRIP+   K KPS
Sbjct: 253 EMKKGVQVFKRIPKTVHKKKPS 274


>gi|449455886|ref|XP_004145681.1| PREDICTED: uncharacterized protein LOC101219582 [Cucumis sativus]
 gi|449492886|ref|XP_004159131.1| PREDICTED: uncharacterized LOC101219582 [Cucumis sativus]
          Length = 281

 Score =  327 bits (838), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 160/253 (63%), Positives = 197/253 (77%), Gaps = 3/253 (1%)

Query: 19  PQEILIQGFD--DSASR-TLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEIS 75
           P + L QG +  DSAS  T KQ+   AHEVHCSRERSRAAW VIEEYLMPFVE+ +Y+IS
Sbjct: 29  PLQSLNQGSNSKDSASESTPKQDWNNAHEVHCSRERSRAAWKVIEEYLMPFVEKKKYKIS 88

Query: 76  SKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK 135
           +KCRLHPDNDMFRDQEQHK H+D N+W+CGYC+K F  EK++DQHFDNRH NLLNVS  +
Sbjct: 89  TKCRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYIDQHFDNRHYNLLNVSRNR 148

Query: 136 CLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFL 195
           CLADLCGALHCD V+++ S +SKCNPAA A+N+H+CE LA+ CFP+++G  AS LHE FL
Sbjct: 149 CLADLCGALHCDHVIDAVSQKSKCNPAAAARNKHMCEGLADSCFPVDEGALASHLHEFFL 208

Query: 196 RQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQEL 255
            QFCDAH C  K +PF +G + + SVFY+ IS+LT++ +  FY+  YLY R MR   Q L
Sbjct: 209 HQFCDAHTCSGKPKPFSRGRQVRRSVFYIVISVLTILFVMFFYVFFYLYNRGMRTRPQVL 268

Query: 256 KRIPRVGRKAKPS 268
           KR+ + GRK KPS
Sbjct: 269 KRLSQSGRKKKPS 281


>gi|255540381|ref|XP_002511255.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223550370|gb|EEF51857.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 217

 Score =  325 bits (832), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 147/191 (76%), Positives = 172/191 (90%)

Query: 30  SASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRD 89
           SA RT K++   AHEVHCSRERSRAAW +IEEYLMPFVE+ +Y+ISSKCRLHP+ND+FRD
Sbjct: 6   SAFRTGKKDKEHAHEVHCSRERSRAAWQIIEEYLMPFVERERYQISSKCRLHPENDLFRD 65

Query: 90  QEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFV 149
           QE+HKIHVDINEW+CGYCKKSFRAE+FLDQHF+NRH NLLN+S GKCLADLCGALHCDFV
Sbjct: 66  QEEHKIHVDINEWKCGYCKKSFRAERFLDQHFENRHYNLLNISDGKCLADLCGALHCDFV 125

Query: 150 MNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQ 209
           +N+K+ ++KCNPAAVAKNRHLCESLA+ CFP+NQGPSASRLHELF+ QFCDAH C  K +
Sbjct: 126 LNTKTMKTKCNPAAVAKNRHLCESLADSCFPLNQGPSASRLHELFVHQFCDAHTCSGKRK 185

Query: 210 PFPKGGRKQPS 220
            FPKGG++  +
Sbjct: 186 LFPKGGKRNEA 196


>gi|297805600|ref|XP_002870684.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316520|gb|EFH46943.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 271

 Score =  324 bits (830), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 149/244 (61%), Positives = 192/244 (78%), Gaps = 8/244 (3%)

Query: 24  IQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPD 83
           ++GF D          G  HE+HCSRERSR AW +I+EYLMP+VE+ +Y++ S CR+H D
Sbjct: 35  LKGFKDPVD-------GSFHEIHCSRERSRVAWKIIQEYLMPYVEKERYQLPSSCRVHRD 87

Query: 84  NDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGA 143
           ND++R+QE+HK+H DINEWRCG+CKK+F  EK+LD+HFD+RH NLLN SHGKCLADLCGA
Sbjct: 88  NDIYREQEEHKVHSDINEWRCGFCKKAFYEEKYLDKHFDSRHYNLLNASHGKCLADLCGA 147

Query: 144 LHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK 203
           LHCD V+N+   +SKCNPAA A+NRHLCESLAN CFP+N+GPSA+RLH+ FLRQFCDAH 
Sbjct: 148 LHCDLVVNTAQLKSKCNPAASARNRHLCESLANSCFPVNKGPSANRLHDFFLRQFCDAHT 207

Query: 204 CPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGR 263
           C   S+PF +  +K+  + Y+ ISI TL++L ++Y  VYL+QR ++ G+QELKRI R G 
Sbjct: 208 CSGGSRPFSQKPKKRGKL-YIIISISTLIVLLLYYSFVYLFQRGLKRGSQELKRIRRNGL 266

Query: 264 KAKP 267
           K KP
Sbjct: 267 KKKP 270


>gi|356567332|ref|XP_003551875.1| PREDICTED: uncharacterized protein LOC100527694 [Glycine max]
          Length = 268

 Score =  320 bits (821), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 164/263 (62%), Positives = 202/263 (76%), Gaps = 7/263 (2%)

Query: 11  IFLL-----LLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMP 65
           +FLL     L FS Q       D +A RTL QE G  H++HCSRERSR AW +I+EYLMP
Sbjct: 8   LFLLCIVCGLYFSLQ-FTTGHADSNAGRTLNQEGG-DHQIHCSRERSRTAWKIIQEYLMP 65

Query: 66  FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
           FVE+ +Y IS +CRLHPDND++RDQEQHK H DINEW+CG+CKKSF  EK LDQHFDNRH
Sbjct: 66  FVEKEKYHISKRCRLHPDNDIYRDQEQHKFHTDINEWQCGFCKKSFYEEKHLDQHFDNRH 125

Query: 126 NNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGP 185
           +NLLN+S  +C+AD+CGALHCD  MNS S +SKCNPAA AKN+HLCESLA+ CFP+ +GP
Sbjct: 126 SNLLNLSESQCIADVCGALHCDHEMNSGSKKSKCNPAAAAKNKHLCESLADSCFPVTEGP 185

Query: 186 SASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQ 245
           +ASRLHE FL QFCDAH C    +PF +G RK+ +VFY+ +SIL ++LL ++YL +YLYQ
Sbjct: 186 AASRLHEFFLHQFCDAHSCTGNRKPFSRGRRKKTNVFYIFVSILLVILLLLYYLYIYLYQ 245

Query: 246 REMRGGTQELKRIPRVGRKAKPS 268
           R M+  TQ LKR+ +  RK KPS
Sbjct: 246 RGMKRETQVLKRVSQASRKKKPS 268


>gi|10177309|dbj|BAB10570.1| unnamed protein product [Arabidopsis thaliana]
          Length = 277

 Score =  317 bits (811), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 153/253 (60%), Positives = 186/253 (73%), Gaps = 9/253 (3%)

Query: 25  QGFDDSASRTLKQEP---GRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLH 81
           QGF++S S  L  E      A E+HCSRERSRAAW +I++YL PFVE+ +YEI   CRLH
Sbjct: 25  QGFEESESTRLVNEEVEVSNAPEIHCSRERSRAAWQIIQDYLTPFVERERYEIPKNCRLH 84

Query: 82  PDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLC 141
           PDND++RDQE HK+HVD+ EW+CGYCKKSF  EKFLD+HF  RH NLLN +  KCLADLC
Sbjct: 85  PDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEKFLDKHFSTRHYNLLNTTDTKCLADLC 144

Query: 142 GALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDA 201
           GALHCDFV++SK  +SKCNP AVAKNRHLCES+AN CFP++QGPSASRLHE FLRQFCDA
Sbjct: 145 GALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVANSCFPVSQGPSASRLHEHFLRQFCDA 204

Query: 202 HKCPRKSQPFPKGGR----KQPSVFYLSISILTLMLLPIFYLIVY--LYQREMRGGTQEL 255
           H C    +PFP+GG+    K     YL +  +  +   IF  I    ++ RE R GTQ+L
Sbjct: 205 HTCTGNDKPFPRGGKSSYIKGTVWCYLFLFSIIALSSEIFGHIAVSTVFNREKRSGTQDL 264

Query: 256 KRIPRVGRKAKPS 268
           +RI + G+K KPS
Sbjct: 265 RRIIKSGKKTKPS 277


>gi|255632970|gb|ACU16839.1| unknown [Glycine max]
          Length = 266

 Score =  315 bits (807), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 161/259 (62%), Positives = 199/259 (76%), Gaps = 7/259 (2%)

Query: 11  IFLL-----LLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMP 65
           +FLL     L FS Q       D +A RTL QE G  H++HCSRERSR AW +I+EYLMP
Sbjct: 8   LFLLCIVCGLYFSLQ-FTTGHADSNAGRTLNQEGG-DHQIHCSRERSRTAWKIIQEYLMP 65

Query: 66  FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
           FVE+ +Y IS +CRLHPDND++RDQEQHK H DINEW+CG+CKKSF  EK LDQHFDNRH
Sbjct: 66  FVEKEKYHISKRCRLHPDNDIYRDQEQHKFHTDINEWQCGFCKKSFYEEKHLDQHFDNRH 125

Query: 126 NNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGP 185
           +NLLN+S  +C+AD+CGALHCD  MNS S +SKCNPAA AKN+HLCESLA+ CFP+ +GP
Sbjct: 126 SNLLNLSESQCIADVCGALHCDHEMNSGSKKSKCNPAAAAKNKHLCESLADSCFPVTEGP 185

Query: 186 SASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQ 245
           +ASRLHE FL QFCDAH C    +PF +G RK+ +VFY+ +SIL ++LL ++YL +YLYQ
Sbjct: 186 AASRLHEFFLHQFCDAHSCTGNRKPFSRGRRKKTNVFYIFVSILLVILLLLYYLYIYLYQ 245

Query: 246 REMRGGTQELKRIPRVGRK 264
           R M+  TQ LKR+ +  RK
Sbjct: 246 RGMKRETQVLKRVSQASRK 264


>gi|356527071|ref|XP_003532137.1| PREDICTED: uncharacterized protein LOC100792257 [Glycine max]
          Length = 271

 Score =  310 bits (795), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 155/239 (64%), Positives = 192/239 (80%), Gaps = 1/239 (0%)

Query: 30  SASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRD 89
           +A RTL QE G  H++HCSRERSR AW +I+EYLMPFVE+ +Y IS +C LHPDND++RD
Sbjct: 34  NAGRTLNQEGG-DHQIHCSRERSRTAWKIIQEYLMPFVEKEKYHISKRCMLHPDNDIYRD 92

Query: 90  QEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFV 149
           QEQHK H+DINEW+C YCKKSF  EK LDQHFDNRH+NLLN+S  +CLAD+CGALHCD  
Sbjct: 93  QEQHKSHIDINEWQCRYCKKSFYEEKHLDQHFDNRHSNLLNLSESQCLADVCGALHCDHE 152

Query: 150 MNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQ 209
           MNS S +SKCNPAA AK++HLCESLA+ CFP++ GP+AS+LHE FL QFCDAH C    +
Sbjct: 153 MNSGSKKSKCNPAAAAKHKHLCESLADSCFPVSDGPAASQLHEFFLHQFCDAHSCTGSRK 212

Query: 210 PFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
           PF +G RK+ +VFY+ +SIL ++LL ++YL +YLYQR M+  TQ LKRI +  RK KPS
Sbjct: 213 PFSRGHRKKTNVFYIFVSILLVILLLLYYLYIYLYQRGMKRETQVLKRISQASRKKKPS 271


>gi|255571984|ref|XP_002526933.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223533685|gb|EEF35420.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 240

 Score =  307 bits (786), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 138/205 (67%), Positives = 166/205 (80%)

Query: 8   FTSIFLLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFV 67
            T+   LL  S    L    D +ASR L+ E G AHEVHCSRERSRAAW +IEEYLMPFV
Sbjct: 16  LTNSLHLLTLSCSTSLQGSKDYAASRALQHEQGNAHEVHCSRERSRAAWKIIEEYLMPFV 75

Query: 68  EQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNN 127
           E+  Y++S+ CRLHPDND++RDQEQHKIH D NEWRCGYC+KSF  E++L++HFDNRH N
Sbjct: 76  EKEHYQVSTGCRLHPDNDLYRDQEQHKIHEDGNEWRCGYCRKSFYEERYLNKHFDNRHYN 135

Query: 128 LLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSA 187
           LLNVSH +CLAD+CGALHCDFVM+S + ++KCNPAA A+N+HLCESLAN CFP+N+GPSA
Sbjct: 136 LLNVSHSRCLADVCGALHCDFVMDSATRKTKCNPAAAARNKHLCESLANSCFPVNEGPSA 195

Query: 188 SRLHELFLRQFCDAHKCPRKSQPFP 212
            RL+ELFL QFCDAH C  + +PFP
Sbjct: 196 HRLNELFLHQFCDAHTCTGRQKPFP 220


>gi|294462744|gb|ADE76916.1| unknown [Picea sitchensis]
          Length = 277

 Score =  306 bits (784), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 144/259 (55%), Positives = 185/259 (71%), Gaps = 3/259 (1%)

Query: 7   IFTSIFLLLLFSPQEILIQGFDD-SASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMP 65
           +   +FL ++  P   L Q   + +A R  +     ++E HCSRERSR AW +IEEYLMP
Sbjct: 18  LLCVVFLCII--PVGGLAQELQEFAAVRAQQDSVVHSYETHCSRERSRTAWKIIEEYLMP 75

Query: 66  FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
           FVEQ  Y IS  CRLH + DMFR+QE HK H+ +NEW+CG+C+K F +EK LDQHFDNRH
Sbjct: 76  FVEQENYNISRTCRLHLEIDMFREQEPHKDHLQVNEWQCGFCRKIFISEKLLDQHFDNRH 135

Query: 126 NNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGP 185
           +NLLN+S  +CLAD+CGALHCD +  SK  ++KCNPAA  KN HLCESLAN CFP  + P
Sbjct: 136 HNLLNISRSRCLADMCGALHCDAMDKSKPIKTKCNPAAADKNCHLCESLANSCFPPQESP 195

Query: 186 SASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQ 245
           SA RLH+ FLRQFCDAH C R SQPF +G  K  ++ Y ++ +L +M+LPIFY IVY++Q
Sbjct: 196 SARRLHDFFLRQFCDAHTCKRGSQPFARGSGKTTNLLYYAVCLLLMMMLPIFYCIVYVHQ 255

Query: 246 REMRGGTQELKRIPRVGRK 264
           REM    +  KRIP++ +K
Sbjct: 256 REMGKHIKNFKRIPKMNQK 274


>gi|15237477|ref|NP_198887.1| C2H2 type zinc finger protein [Arabidopsis thaliana]
 gi|9758099|dbj|BAB08543.1| unnamed protein product [Arabidopsis thaliana]
 gi|90962968|gb|ABE02408.1| At5g40710 [Arabidopsis thaliana]
 gi|332007203|gb|AED94586.1| C2H2 type zinc finger protein [Arabidopsis thaliana]
          Length = 272

 Score =  305 bits (782), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 150/280 (53%), Positives = 201/280 (71%), Gaps = 22/280 (7%)

Query: 1   MGRNTQIFTSIFLLLLFS----------PQEIL---IQGFDDSASRTLKQEPGRAHEVHC 47
           MGR+     S ++LL F           P  I+   ++GF D        E G  HE+HC
Sbjct: 1   MGRSNSQCPSHWILLCFFISSQFWGFSLPTSIIQQNLEGFKDP-------EDG-FHEIHC 52

Query: 48  SRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYC 107
           SRERSR AW +I+EYLMP+VE+ +Y++ S CR+H DND++R+QE+HK+  DINEWRCG+C
Sbjct: 53  SRERSRVAWKIIQEYLMPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFC 112

Query: 108 KKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKN 167
           KK+F  EK+LD+HFD+RH NLLN SHGKCL+DLCGALHCD V+++   +SKCNPAA AKN
Sbjct: 113 KKAFYEEKYLDKHFDSRHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKN 172

Query: 168 RHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSIS 227
           RHLCESLAN CFP+N+G SA+RLH+ FLRQFCDAH C   S+P  +  +K+ S+ Y+  S
Sbjct: 173 RHLCESLANSCFPVNKGSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFS 231

Query: 228 ILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKP 267
           I+ L++L ++Y  VYL++R ++  +Q+LKRI   G K KP
Sbjct: 232 IIVLVVLLLYYSFVYLFRRGLKRRSQDLKRIRHNGLKKKP 271


>gi|26452757|dbj|BAC43459.1| putative zinc finger protein [Arabidopsis thaliana]
          Length = 272

 Score =  304 bits (778), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 149/280 (53%), Positives = 201/280 (71%), Gaps = 22/280 (7%)

Query: 1   MGRNTQIFTSIFLLLLFS----------PQEIL---IQGFDDSASRTLKQEPGRAHEVHC 47
           MGR+     S ++LL F           P  I+   ++GF D        E G  H++HC
Sbjct: 1   MGRSNSQCPSHWILLCFFISSQFWGFSLPTSIIQQNLEGFKDP-------EDG-FHKIHC 52

Query: 48  SRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYC 107
           SRERSR AW +I+EYLMP+VE+ +Y++ S CR+H DND++R+QE+HK+  DINEWRCG+C
Sbjct: 53  SRERSRVAWKIIQEYLMPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFC 112

Query: 108 KKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKN 167
           KK+F  EK+LD+HFD+RH NLLN SHGKCL+DLCGALHCD V+++   +SKCNPAA AKN
Sbjct: 113 KKAFYEEKYLDKHFDSRHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKN 172

Query: 168 RHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSIS 227
           RHLCESLAN CFP+N+G SA+RLH+ FLRQFCDAH C   S+P  +  +K+ S+ Y+  S
Sbjct: 173 RHLCESLANSCFPVNKGSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFS 231

Query: 228 ILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKP 267
           I+ L++L ++Y  VYL++R ++  +Q+LKRI   G K KP
Sbjct: 232 IIVLVVLLLYYSFVYLFRRGLKRRSQDLKRIRHNGLKKKP 271


>gi|357477231|ref|XP_003608901.1| hypothetical protein MTR_4g104230 [Medicago truncatula]
 gi|355509956|gb|AES91098.1| hypothetical protein MTR_4g104230 [Medicago truncatula]
          Length = 241

 Score =  301 bits (770), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 145/201 (72%), Positives = 163/201 (81%), Gaps = 2/201 (0%)

Query: 28  DDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMF 87
           + S  R L QE   A EVHCSRERSR A  VIEEYL PFVE+  Y++S KCRLHP+NDMF
Sbjct: 36  ESSGERNLNQEHEHAPEVHCSRERSRVASKVIEEYLTPFVEKENYQLSRKCRLHPENDMF 95

Query: 88  RDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCD 147
           RDQE+H+I++D NEWRCGYCKKSFR EKFLDQH DNRH NLLN+SHGKCLADLCGALHCD
Sbjct: 96  RDQEEHRIYLDRNEWRCGYCKKSFREEKFLDQHLDNRHCNLLNLSHGKCLADLCGALHCD 155

Query: 148 FVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRK 207
            V NSKSSRSKCNPAA A+NRHLCESLA+ CFPI+ G SASRLHELFL QFCDAH C  K
Sbjct: 156 AVTNSKSSRSKCNPAAAARNRHLCESLADSCFPISGGLSASRLHELFLHQFCDAHSCSGK 215

Query: 208 SQPFPKGGRKQPSVFYLSISI 228
            +PF +GG+ +P   YL+  I
Sbjct: 216 HKPFSRGGKVRPE--YLTGEI 234


>gi|388514801|gb|AFK45462.1| unknown [Lotus japonicus]
          Length = 263

 Score =  272 bits (695), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 130/218 (59%), Positives = 159/218 (72%), Gaps = 12/218 (5%)

Query: 27  FDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDM 86
            +    RTL+QE    H++HCSRERSR AW +I+EYLMPFVE+ +Y IS +CRLHPDND+
Sbjct: 43  INSDVGRTLQQEQDGNHQIHCSRERSRVAWKIIQEYLMPFVEKEKYHISKRCRLHPDNDI 102

Query: 87  FRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHC 146
           +RDQE HK H+DINEW+CG+CKKSF  EK LDQHFDNRH+NLLN S  +CLAD+CGALHC
Sbjct: 103 YRDQEDHKSHIDINEWQCGFCKKSFYEEKHLDQHFDNRHSNLLNSSESRCLADVCGALHC 162

Query: 147 DFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPR 206
           D  +NS S +SKCNPAA A+N+HLCESLA+ CFP+N+GP+ASRLHE FL QFCD     +
Sbjct: 163 DLEINSGSKKSKCNPAAAARNKHLCESLADSCFPVNEGPAASRLHEFFLHQFCDHTAVLQ 222

Query: 207 KSQPFPKG-GRKQ------PSVF-----YLSISILTLM 232
                P+G GR Q      P  F     Y +IS  T +
Sbjct: 223 VGNLSPEGAGRGQMCSTSWPPFFLWCCCYFTISTFTCI 260


>gi|449529367|ref|XP_004171671.1| PREDICTED: uncharacterized protein LOC101223370, partial [Cucumis
           sativus]
          Length = 203

 Score =  265 bits (678), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 120/168 (71%), Positives = 145/168 (86%), Gaps = 1/168 (0%)

Query: 25  QGFDDSAS-RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPD 83
           Q  + SA+ R L+Q      EVHCSRERSR AW +IEE+L+PF+E+  YE+S++CRLHP+
Sbjct: 36  QDEEQSATLRPLEQNEEHVDEVHCSRERSRTAWNIIEEHLLPFMEKENYEVSTQCRLHPN 95

Query: 84  NDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGA 143
           ND+FRDQEQHKIH+DIN W+CGYC+KSFRAEKFLD+HFDNRH+NLLNVSHGKCLADLCGA
Sbjct: 96  NDLFRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHSNLLNVSHGKCLADLCGA 155

Query: 144 LHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLH 191
           LHCD  M+ KS +SKC PAA A+N+HLCESLA+ CFPIN+GPSA+RLH
Sbjct: 156 LHCDLKMDIKSRKSKCKPAAAARNKHLCESLADSCFPINEGPSANRLH 203


>gi|147776347|emb|CAN76470.1| hypothetical protein VITISV_016787 [Vitis vinifera]
          Length = 483

 Score =  254 bits (649), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 113/156 (72%), Positives = 131/156 (83%)

Query: 59  IEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLD 118
             +YLMPFVE+  Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKKSF AE +LD
Sbjct: 33  TNQYLMPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKKSFYAENYLD 92

Query: 119 QHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRC 178
           +HFDNRH NLLNVS  KCLAD+CGALHCD VM+SK  ++KCNPAA A+N HLCESLA+ C
Sbjct: 93  KHFDNRHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHHLCESLADGC 152

Query: 179 FPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKG 214
           FP++ GPSASRLHE FLRQFCDAH C    +PF +G
Sbjct: 153 FPVSGGPSASRLHEFFLRQFCDAHTCTGGRKPFSRG 188


>gi|413916780|gb|AFW56712.1| hypothetical protein ZEAMMB73_479170 [Zea mays]
          Length = 222

 Score =  253 bits (645), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 118/172 (68%), Positives = 139/172 (80%), Gaps = 1/172 (0%)

Query: 31  ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQ 90
            SR L+  P  AHEVHCSRERSRAAW  I+EYLMPFVE+ +YE+ SKCRL PDNDMFR+Q
Sbjct: 39  GSRILQTNPD-AHEVHCSRERSRAAWEAIDEYLMPFVEKEKYELPSKCRLRPDNDMFREQ 97

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
           EQHKIH DINEW CG+CKK+FRAEKF+DQHF+NRH NL++ S G+C+ADLCGALHCD +M
Sbjct: 98  EQHKIHFDINEWHCGFCKKAFRAEKFIDQHFENRHKNLVDNSEGRCMADLCGALHCDLMM 157

Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAH 202
             K  +SKCN AA  +NRHLCESLA+ CFPINQG +ASRLH  +    C+ H
Sbjct: 158 QFKKPKSKCNAAAATRNRHLCESLADSCFPINQGLAASRLHGKWNVSECNDH 209


>gi|357436185|ref|XP_003588368.1| hypothetical protein MTR_1g006480 [Medicago truncatula]
 gi|355477416|gb|AES58619.1| hypothetical protein MTR_1g006480 [Medicago truncatula]
          Length = 306

 Score =  251 bits (640), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 108/177 (61%), Positives = 139/177 (78%)

Query: 40  GRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDI 99
            R  +   +++ S  A    +EYL PFVE+ +Y IS +CRLHPDND++RDQEQHK H+DI
Sbjct: 121 ARVFQCSSTKQESSNAQARRQEYLTPFVEKEKYNISRRCRLHPDNDIYRDQEQHKSHIDI 180

Query: 100 NEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKC 159
           NEW+CGYCKK+F  EK LDQHFDNRH+NLLN+   +CLAD+CGALHCD  +NS S +SKC
Sbjct: 181 NEWQCGYCKKTFYEEKHLDQHFDNRHSNLLNLKESRCLADVCGALHCDHEINSGSKKSKC 240

Query: 160 NPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGR 216
           NPAA A+N+H+CE++A+ CFP+N+GP+ASRLHE FL QFCDAH C  + +PF +G R
Sbjct: 241 NPAAAARNKHICETIADSCFPVNEGPAASRLHEFFLHQFCDAHSCTGRGKPFSRGRR 297


>gi|168063136|ref|XP_001783530.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664965|gb|EDQ51666.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  223 bits (568), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 117/261 (44%), Positives = 160/261 (61%), Gaps = 13/261 (4%)

Query: 5   TQIFTSIFLLLLFSPQEILIQGF--DDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEY 62
             +  S+ L+L  S        F   +   R L++E GR   VHCSR RSR    ++ EY
Sbjct: 117 AHVLISLALVLCLSSFTACADKFRVSEERQRQLQEEKGR---VHCSRSRSRTVRAIVSEY 173

Query: 63  LMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFD 122
           LMPFV+  +Y +   CRLHPDND++ +QE++ I +   +W+CGYCKK FR + FLD HFD
Sbjct: 174 LMPFVKDQKYTLPKTCRLHPDNDVYHEQEENMIELRPMQWQCGYCKKIFRNQGFLDLHFD 233

Query: 123 NRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSR----SKCNPAAVAKNRHLCESLANRC 178
           NRH+  L  S   CLAD CGALHCD+  NS SS+    + C  A V KNRH CE LAN C
Sbjct: 234 NRHSEKLEPSSNSCLADACGALHCDY-YNSLSSKPQIQATCKLAVVEKNRHSCEVLANTC 292

Query: 179 FPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGG--RKQPSVFYLSISILTLMLLPI 236
           FP +  P+A RL+  F RQFCDAH C +K + FP G    +  S+ Y ++++ T+++L I
Sbjct: 293 FPADHSPAAQRLNHFFKRQFCDAHTCKKKLKIFPSGSGINRNKSLIY-ALTLFTVIVLAI 351

Query: 237 FYLIVYLYQREMRGGTQELKR 257
           FY I+YL +R+     + L+R
Sbjct: 352 FYAILYLIKRDTNLLRKGLRR 372


>gi|413916778|gb|AFW56710.1| hypothetical protein ZEAMMB73_479170 [Zea mays]
          Length = 177

 Score =  216 bits (551), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 99/140 (70%), Positives = 116/140 (82%), Gaps = 1/140 (0%)

Query: 31  ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQ 90
            SR L+  P  AHEVHCSRERSRAAW  I+EYLMPFVE+ +YE+ SKCRL PDNDMFR+Q
Sbjct: 39  GSRILQTNPD-AHEVHCSRERSRAAWEAIDEYLMPFVEKEKYELPSKCRLRPDNDMFREQ 97

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
           EQHKIH DINEW CG+CKK+FRAEKF+DQHF+NRH NL++ S G+C+ADLCGALHCD +M
Sbjct: 98  EQHKIHFDINEWHCGFCKKAFRAEKFIDQHFENRHKNLVDNSEGRCMADLCGALHCDLMM 157

Query: 151 NSKSSRSKCNPAAVAKNRHL 170
             K  +SKCN AA  +NRHL
Sbjct: 158 QFKKPKSKCNAAAATRNRHL 177


>gi|302808117|ref|XP_002985753.1| hypothetical protein SELMODRAFT_47592 [Selaginella moellendorffii]
 gi|300146662|gb|EFJ13331.1| hypothetical protein SELMODRAFT_47592 [Selaginella moellendorffii]
          Length = 171

 Score =  200 bits (508), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 97/165 (58%), Positives = 120/165 (72%), Gaps = 4/165 (2%)

Query: 43  HEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEW 102
           H V CSR RSR A  ++EEYL+PFVE   Y +S +CRLH  ND FR+QE+ K    I++W
Sbjct: 3   HGVQCSRARSRTARHILEEYLLPFVENENYALSPECRLHASNDAFREQEREKQFYHIHDW 62

Query: 103 RCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNS---KSSRSKC 159
           RCGYC K F +E++LD HFDNRH+  LNVS   CLADLCGALHCD++      K S++KC
Sbjct: 63  RCGYCHKIFESEEYLDLHFDNRHSETLNVSRHNCLADLCGALHCDYMETKDKHKFSKNKC 122

Query: 160 NPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 204
           +P +V +NRHLCE LAN CFP  QG  A+RL++ FLRQFCDAH C
Sbjct: 123 SP-SVDRNRHLCEKLANSCFPPQQGAQATRLNDFFLRQFCDAHSC 166


>gi|302785357|ref|XP_002974450.1| hypothetical protein SELMODRAFT_57652 [Selaginella moellendorffii]
 gi|300158048|gb|EFJ24672.1| hypothetical protein SELMODRAFT_57652 [Selaginella moellendorffii]
          Length = 171

 Score =  199 bits (507), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 96/165 (58%), Positives = 120/165 (72%), Gaps = 4/165 (2%)

Query: 43  HEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEW 102
           H V CSR RSR A  ++EEYL+PFVE   Y +S +CRLH  ND FR+QE+ K    I++W
Sbjct: 3   HGVQCSRARSRTARHILEEYLLPFVENENYALSPQCRLHASNDAFREQEREKQFYHIHDW 62

Query: 103 RCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNS---KSSRSKC 159
           RCGYC K F +E++LD HFDNRH+  LNVS   CLAD+CGALHCD++      K S++KC
Sbjct: 63  RCGYCHKIFESEEYLDLHFDNRHSETLNVSRDNCLADVCGALHCDYMETKDKHKFSKNKC 122

Query: 160 NPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 204
           +P +V +NRHLCE LAN CFP  QG  A+RL++ FLRQFCDAH C
Sbjct: 123 SP-SVDRNRHLCEKLANSCFPPQQGAQATRLNDFFLRQFCDAHSC 166


>gi|168040762|ref|XP_001772862.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675773|gb|EDQ62264.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 235

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 69/132 (52%), Positives = 90/132 (68%), Gaps = 4/132 (3%)

Query: 45  VHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRC 104
           VHCSR RSR    ++ EYL PFVE  +Y +   CRLHP ND++R+QE++   +   +W+C
Sbjct: 50  VHCSRSRSRTVREIVHEYLTPFVEAEKYTLPESCRLHPANDIYREQEENMDELRPMQWQC 109

Query: 105 GYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSK----CN 160
            YCKK FR + FLD HFDNRH+NLL+ S   CLAD CGALHCD+  +  +S+S+    C 
Sbjct: 110 RYCKKIFRNQGFLDTHFDNRHSNLLDKSLNMCLADACGALHCDYYNSLSTSKSQKQSTCK 169

Query: 161 PAAVAKNRHLCE 172
           PA V +NRH CE
Sbjct: 170 PAVVERNRHSCE 181


>gi|428163164|gb|EKX32251.1| hypothetical protein GUITHDRAFT_91074 [Guillardia theta CCMP2712]
          Length = 297

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 76/255 (29%), Positives = 118/255 (46%), Gaps = 40/255 (15%)

Query: 23  LIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHP 82
            + G +DS SR            +C+R RSR A  ++ E   P V ++ +++   C    
Sbjct: 23  FLHGAEDSDSR----------HANCNRGRSRLARNILREEFYPHVRKLGFQLPPGCPFSE 72

Query: 83  DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCG 142
           + DMF D E+HK  V  N W+C YC K F++E +L+ H+ NRH   +    G CLAD C 
Sbjct: 73  ERDMFLDNEKHKKEVRFNNWKCLYCNKVFKSESYLEHHYVNRHPETIK-EDGFCLADYCD 131

Query: 143 ALHCDFVMNS-------KSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFL 195
            L CD   N        K+S  +C+   + K RH C S+ ++CFP +   +A+RLH  F 
Sbjct: 132 VLECDMQENVIGSEGMFKTSSYRCDAKRMQKRRHRCHSVLDKCFPPHTSDAANRLHHEFE 191

Query: 196 RQFCDAHKCPRKSQPFPKGGRKQPSVFYLS----------------------ISILTLML 233
           + +C+   C           R+  ++   S                        +L  ++
Sbjct: 192 KLYCEHLSCEASPDGLGDVARRPHTLLSQSKARSEYHMKKTKHSPWRKLAIVFGVLFAII 251

Query: 234 LPIFYLIVYLYQREM 248
           L IFYL V LY+++M
Sbjct: 252 LFIFYLGVCLYRKDM 266


>gi|255077084|ref|XP_002502194.1| predicted protein [Micromonas sp. RCC299]
 gi|226517459|gb|ACO63452.1| predicted protein [Micromonas sp. RCC299]
          Length = 301

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 70/222 (31%), Positives = 105/222 (47%), Gaps = 19/222 (8%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSRERSR A GV++++  P  ++ +Y +   C      DM+ + E+HK  V   +++  Y
Sbjct: 41  CSRERSRHARGVLDDFFHPVADRNRYRLPPNCPFDRSKDMYLEHEKHKEVVRRTQYKSLY 100

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCD-----------FVMNSKSS 155
             K+F  E ++D+H DNRH + +      CLAD C  L CD             + S +S
Sbjct: 101 SDKTFYNEYYVDKHMDNRHMDKVPPGADVCLADYCEVLRCDEHQAWKHPELRKTVGSSAS 160

Query: 156 RSKCNPAAVAKNRHLCESLANRCFPINQGP-----SASRLHELFLRQFCDAHKCPRKSQP 210
           RS+C+   +   RH CE L + CFP   GP     +A+RL E F R  CD   C      
Sbjct: 161 RSRCSQERMRGLRHFCEVLMSGCFPTGGGPDGDAGTAARLREYFTRHHCDMLTCDGVRDM 220

Query: 211 FPKGGRKQPSV---FYLSISILTLMLLPIFYLIVYLYQREMR 249
           F   G          Y+ + I+    +  +YL V+   R+ R
Sbjct: 221 FGVLGSHHAVAGGGGYVLVLIVVFAAVGCYYLFVWHAVRDGR 262


>gi|145350216|ref|XP_001419510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579742|gb|ABO97803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 270

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 66/224 (29%), Positives = 112/224 (50%), Gaps = 15/224 (6%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE--WRC 104
           CSRERSRAA  V+ E+  P  +  +Y +   C      D+F + E HK  V  ++  WR 
Sbjct: 18  CSRERSRAARTVLNEFFYPVADGARYALDLGCPFSHARDLFYEHETHKEPVRHSKLYWRS 77

Query: 105 GYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCD----FVMNSKSSR-SKC 159
            Y  K F++E ++D+H + RH + +  +   CLAD C  L CD    +V N + S+ ++C
Sbjct: 78  LYSGKVFKSEYYVDKHMERRHGDKIPSTADVCLADYCDVLQCDKHAEYVKNGRGSKPTRC 137

Query: 160 NPAAVAKNRHLCESLANRCFPINQGPS---ASRLHELFLRQFCDAHKCPRKSQPFP--KG 214
           +   ++  R  C  +  +CFP   G +   A +L+  F + +CD   C      F     
Sbjct: 138 DEEQMSLVRDKCHEMLTQCFPNENGSTRDNAGKLNVYFTKYYCDQMTCADVDGVFELMSA 197

Query: 215 GRKQPSV-FYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKR 257
               P V +Y++++ L   ++ ++YL VY +    R  + ++KR
Sbjct: 198 HHDSPGVGYYIALAFL-FFIVAMYYLTVYSWTNGRR-VSSDIKR 239


>gi|443688435|gb|ELT91124.1| hypothetical protein CAPTEDRAFT_220314 [Capitella teleta]
          Length = 284

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 67/212 (31%), Positives = 95/212 (44%), Gaps = 15/212 (7%)

Query: 42  AHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE 101
           A    C R+RSR    +  E ++P + + Q   S  C L  D DM+ +QEQ K     ++
Sbjct: 36  AISAECPRDRSRIVRNIFIEDVLPILAEFQVTASESCPLSADRDMYAEQEQRKFMESSSK 95

Query: 102 WRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSHGKCLADLCGALHCDFVMNSKSS----R 156
           W C +C KSF  E+ LD HF+NRH  N+L      C AD C    CD +  +  +     
Sbjct: 96  WTCNFCGKSFYDERSLDNHFENRHPKNILTGPRATCPADFCDIFRCDVISGAHRAGYWEV 155

Query: 157 SKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQF----CDAHKCPRKSQPFP 212
           + C  A +    H CE L   C P N  P A+R  +L L +F    C    C    +  P
Sbjct: 156 ALCRDADLIHTHHRCEDLLRNCLPSNI-PHATR--DLILARFEKAICGYLTCDMFWE-IP 211

Query: 213 KGGRKQP--SVFYLSISILTLMLLPIFYLIVY 242
           +     P  +  Y+ + IL    L I+Y   Y
Sbjct: 212 ENRVSNPVGTAIYIVLFILLSFGLLIYYYAAY 243


>gi|168022196|ref|XP_001763626.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685119|gb|EDQ71516.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 285

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 52/128 (40%), Positives = 69/128 (53%), Gaps = 12/128 (9%)

Query: 45  VHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRC 104
           VHCSR RSR A   I EYLM +V   +Y +   C  H DND  R+QE+    + + EW+C
Sbjct: 67  VHCSRSRSRTARTTIFEYLMLYVHSQKYTLPKNCHFHFDNDFSREQEKSMDELGLTEWQC 126

Query: 105 GYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAV 164
             CK  FR+ +FLD HFDNRH+  L+     CL+            +    ++ C PA +
Sbjct: 127 RCCKIVFRSREFLDTHFDNRHSEKLDTVVFCCLS------------SQPKKQATCKPAVM 174

Query: 165 AKNRHLCE 172
            KNRH CE
Sbjct: 175 EKNRHSCE 182


>gi|195999976|ref|XP_002109856.1| hypothetical protein TRIADDRAFT_6027 [Trichoplax adhaerens]
 gi|190587980|gb|EDV28022.1| hypothetical protein TRIADDRAFT_6027, partial [Trichoplax
           adhaerens]
          Length = 202

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 54/169 (31%), Positives = 83/169 (49%), Gaps = 12/169 (7%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR  SR A  +++  L   ++ +Q ++S  C  HP+ D+FR QE+ K +   + W C Y
Sbjct: 1   CSRINSRYARNILKRSLYDLIQSMQVQLSFDCPFHPERDLFRKQEELKDNAYQSSWTCSY 60

Query: 107 CKKSFRAEKFLDQHFDNRHNNLL-NVSHGKCLADLCGALHCDFVMNSKSSRSK------- 158
           C K F  E+FLDQH DNRH+ LL  V +  CLA+ C  L CD      ++ +K       
Sbjct: 61  CGKWFYRERFLDQHLDNRHSALLGTVMNATCLANYCDILGCDLAHVQDTTLAKGNDLWWK 120

Query: 159 ---CNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 204
              C    + + R  C  +A +C P     ++S +  + +   C    C
Sbjct: 121 TALCRSTQMVELRDQCLQIAEQCTP-KSSKASSGVRNIIISNICSRLTC 168


>gi|156386657|ref|XP_001634028.1| predicted protein [Nematostella vectensis]
 gi|156221106|gb|EDO41965.1| predicted protein [Nematostella vectensis]
          Length = 257

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/237 (25%), Positives = 100/237 (42%), Gaps = 21/237 (8%)

Query: 6   QIFTSIFLLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMP 65
           +I+  +F LL+++P    + G D                 HCSR  SR     + E+  P
Sbjct: 4   EIWPVVFRLLVWAPLYAFLVGADGE---------------HCSRYHSRLVRNALREHFFP 48

Query: 66  FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
             +Q + ++   C LH  +D+F   E +K    ++ W C +C K+F +E ++D H  NRH
Sbjct: 49  AYDQHKVDVPLSCLLHETHDVFIRHEINKEEEHVSRWTCNFCGKAFYSEHYIDMHMQNRH 108

Query: 126 NNLLNVSHGKCLADLCGALHCDFVMNSKS----SRSKCNPAAVAKNRHLCESLANRCFPI 181
            + L      CLAD C    C  + + K      R+ C    +A  +  CE L + C P+
Sbjct: 109 KDFLIQDSSVCLADYCDVFRCVVLNDQKKEWFWERALCKHEKMAALKRRCEVLMHGCMPV 168

Query: 182 NQGPSAS--RLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPI 236
             G S +  +++E      C    C    +P  +       + Y   S   +M L +
Sbjct: 169 ANGTSDTPYKVYETVWSNVCSLLTCKDYWKPPYEEMPTWRIIVYAVFSPFFVMALIV 225


>gi|281201704|gb|EFA75912.1| hypothetical protein PPL_10484 [Polysphondylium pallidum PN500]
          Length = 353

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 101/208 (48%), Gaps = 14/208 (6%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CS E S     +++    P V   +Y++   C+L+P  DM    E  KI++  + W+C  
Sbjct: 126 CSSELSNKVRAILDNIFYPVVVDHKYQLPVNCQLNPSLDMLAYLETQKINL-YSIWKCRN 184

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAK 166
           C K F  E+FLD H  N H +L+  +   CL+D C  L+C            C+   + K
Sbjct: 185 CSKQFNTEEFLDLHLKNNHASLIPSNATVCLSDFCDMLNC-----KDDPVIACDNKRMDK 239

Query: 167 NRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC-------PRKSQPFPKGGRKQP 219
            +H C+S+   CFP+NQ  S   L+  F +  CD   C       P++S        +  
Sbjct: 240 LKHHCQSIMYNCFPLNQEQS-RMLNSHFTKNICDHLTCDGYKRHIPKQSDSNQSNSGRSW 298

Query: 220 SVFYLSISILTLMLLPIFYLIVYLYQRE 247
           S+     +++T++++ +FYLIV  Y++E
Sbjct: 299 SLMKYIAALITIIIIVLFYLIVCFYRKE 326


>gi|308807625|ref|XP_003081123.1| zinc finger (ISS) [Ostreococcus tauri]
 gi|116059585|emb|CAL55292.1| zinc finger (ISS) [Ostreococcus tauri]
          Length = 241

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 106/220 (48%), Gaps = 14/220 (6%)

Query: 58  VIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEK 115
           ++EE+  P  E  + EI++ C      D+F + E+HK  V  ++  WR  Y  K F++E 
Sbjct: 1   MLEEFFYPVAEGAKTEIAATCPFSRIRDVFYEHERHKERVRHSKQYWRSLYSGKVFKSEY 60

Query: 116 FLDQHFDNRHNNLLNVSHGKCLADLCGALHCD----FVMNSKSSRSKCNPAAVAKNRHLC 171
           ++D+H + RH + +      CLAD C  L CD    ++   +    +C+ A +A  +  C
Sbjct: 61  YVDKHMERRHGDKIPPGANVCLADHCEVLQCDRHASYMNGERGKLVRCDAAHMADVQAEC 120

Query: 172 ESLANRCFPINQGPSASR---LHELFLRQFCDAHKCPRKSQPFPKGGRKQPSV---FYLS 225
             +   CFP   G +A R   +H+   + +CD   C      F        +    +Y+S
Sbjct: 121 RKMLKSCFPHESGRAAERSRKVHDYLNKYYCDQLTCVGVGGVFELMSTHHETAGASYYVS 180

Query: 226 ISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKA 265
           +  + ++L+ ++Y +VY +    RG + ++KR  R  R +
Sbjct: 181 LGFV-MLLVALYYFLVYSWV-NGRGVSLDVKRAHRRARAS 218


>gi|170038289|ref|XP_001846984.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167881843|gb|EDS45226.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 616

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/183 (27%), Positives = 94/183 (51%), Gaps = 4/183 (2%)

Query: 7   IFTSIFLLLLFSPQEILIQGFDDSASRTLKQEPG-RAHEVHCSRERSRAAWGVIEEYLMP 65
           I T+++ + +  P++ L+         ++   PG    ++ CSR+ SR    +++   +P
Sbjct: 62  IITTVYAVAVSPPRKPLMMSVIVICFLSIIPWPGPSVFKISCSRDSSRVVRKIVQSKWLP 121

Query: 66  FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
            +E+ Q ++  +C  HP  D+F  Q+  K     ++W CG+C KSF  EK LD HF++RH
Sbjct: 122 ILEKYQVKLPLECPFHPLRDIFGPQQTAKKQHRPSQWTCGFCGKSFFEEKHLDMHFEHRH 181

Query: 126 NNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHL-CESLANRCFPIN 182
              +N++    CLAD C  + C+ ++   ++ S   +P  ++ +  +  ES A R     
Sbjct: 182 KGNINMAEDAICLADYCDMMRCEVLIAKDATLSFGVDPNMMSTDIEVWSESTAYRTALTT 241

Query: 183 QGP 185
            GP
Sbjct: 242 SGP 244


>gi|298711019|emb|CBJ26414.1| C2H2 zinc finger protein [Ectocarpus siliculosus]
          Length = 232

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 36/183 (19%)

Query: 42  AHEVHCSRERSRAAWGVIEEYLMPFVEQ-VQYEISSKCRLHPDNDMFRDQEQHK--IHVD 98
           A+E  CSRE SR A  +I + + PF+++     I   C LHP+ D+F+DQE HK  +   
Sbjct: 3   ANESSCSREHSRQARKIIAKSVYPFLDKDPALAIPLACPLHPELDVFKDQELHKRLVPGK 62

Query: 99  INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH------------GKCLADLCGALHC 146
           ++ ++CGYC+K F+ E ++D+H DN+H + LN++             G+CL DLC  L C
Sbjct: 63  LDTFQCGYCQKQFKTEFYMDRHMDNKHADRLNLAGSDAIGHGIEPAPGRCLGDLCPELGC 122

Query: 147 -DFVMNSKSSR--------------------SKCNPAAVAKNRHLCESLANRCFPINQGP 185
            D+  +S S                        C+PA + + R  C +L + CF  + G 
Sbjct: 123 GDYAADSCSVEIHAGGGSSLGTGAGTIRRGCGACDPADMERRRRRCRALFHGCFDDSTGS 182

Query: 186 SAS 188
           + S
Sbjct: 183 AGS 185


>gi|312378286|gb|EFR24907.1| hypothetical protein AND_10215 [Anopheles darlingi]
          Length = 630

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/145 (31%), Positives = 78/145 (53%), Gaps = 3/145 (2%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ CSRE SR    ++    +P +E+ Q ++  +C  HP  D+F  Q+  K     ++W 
Sbjct: 45  KISCSRESSRVVRKIVHSRWLPILEKYQVKLPLECPFHPMRDIFGPQQSAKKQHRPSQWT 104

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS-KCNP 161
           CG+C KSF  E+ LD HF+NRH   +N++    CLAD C  + C+ ++   ++ +   +P
Sbjct: 105 CGFCGKSFFEEQHLDMHFENRHKTNINMAEDAICLADYCDMMRCEVLIAKDATLAFGGDP 164

Query: 162 AAVAKNRHL-CESLANRCFPINQGP 185
            AV+ +  +  E+ A R      GP
Sbjct: 165 NAVSTDIEVWSEATAYRTALTTAGP 189


>gi|384247793|gb|EIE21279.1| hypothetical protein COCSUDRAFT_57182 [Coccomyxa subellipsoidea
           C-169]
          Length = 220

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 53/183 (28%), Positives = 89/183 (48%), Gaps = 19/183 (10%)

Query: 100 NEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSR--- 156
           N + C  CK+ F +E+  ++H+D +H+  ++     CLAD C  LHCD +  + +SR   
Sbjct: 20  NRFVCSLCKERFSSEEAWERHYDRKHHIHIHPGRDVCLADYCEVLHCDKINLTPTSRQLR 79

Query: 157 ------------SKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 204
                       + C  +A  + R  C  LA  CFP + GP+A++LH LF+   C AH C
Sbjct: 80  AQQAGDPTLAPHAPCRESAAEQLRESCRDLAAACFPPDAGPAAAKLHRLFVEGVCAAHTC 139

Query: 205 PRKSQP---FPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRV 261
             + Q          ++ ++ Y  +  L L  + IFY  +++Y  + R G+     + R+
Sbjct: 140 DLELQQDLLVQLAASEKGTLGYKLLIGLALGAMAIFYACLWVYH-QYRSGSLSRGDVRRL 198

Query: 262 GRK 264
            RK
Sbjct: 199 RRK 201


>gi|158296785|ref|XP_317135.4| AGAP008328-PA [Anopheles gambiae str. PEST]
 gi|157014878|gb|EAA12581.4| AGAP008328-PA [Anopheles gambiae str. PEST]
          Length = 410

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 54/179 (30%), Positives = 85/179 (47%), Gaps = 17/179 (9%)

Query: 39  PG-RAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHV 97
           PG    ++ CSRE SR    ++    +P +E+ Q ++  +C  HP  D+F  Q+  K   
Sbjct: 20  PGPSVFKISCSRESSRVVRKIVHSRWLPILEKYQVKLPLECPFHPLRDIFGPQQSAKKQH 79

Query: 98  DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSR 156
             ++W CG+C KSF  EK LD HF+NRH   +N +    CLAD C  + C+ ++   ++ 
Sbjct: 80  RPSQWTCGFCGKSFFEEKHLDMHFENRHRTNINTAEDAVCLADYCDMMRCEVLIAKDATL 139

Query: 157 S-KCNPAAVAKNRHL-CESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPK 213
           +   +P AV+ +  +  E+ A R      GP              D  K P +    PK
Sbjct: 140 AFGSDPNAVSTDIEVWSEATAYRTALSTSGPR-------------DVAKVPERKSLLPK 185


>gi|383857833|ref|XP_003704408.1| PREDICTED: uncharacterized protein LOC100876283 [Megachile
           rotundata]
          Length = 440

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ C R+R+     ++++  MP +++ Q E+  +C  H   D+FR Q++ K     ++W 
Sbjct: 25  KISCPRDRASVVRRIVQKRWMPILKKYQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 84

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
           CG C KSF AEK LD HFDNRH + +N +    CLAD C  + CD + N
Sbjct: 85  CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 133


>gi|66522619|ref|XP_393929.2| PREDICTED: hypothetical protein LOC410449 [Apis mellifera]
          Length = 454

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ C R+R+     ++++  MP +++ Q E+  +C  H   D+FR Q++ K     ++W 
Sbjct: 25  KISCPRDRASVVRRIVQKRWMPILKKHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 84

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
           CG C KSF AEK LD HFDNRH + +N +    CLAD C  + CD + N
Sbjct: 85  CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 133


>gi|380017517|ref|XP_003692701.1| PREDICTED: uncharacterized protein LOC100864708 [Apis florea]
          Length = 454

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ C R+R+     ++++  MP +++ Q E+  +C  H   D+FR Q++ K     ++W 
Sbjct: 25  KISCPRDRASVVRRIVQKRWMPILKKHQVELPLECPFHESRDIFRPQQKAKHQHRSSQWT 84

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
           CG C KSF AEK LD HFDNRH + +N +    CLAD C  + CD + N
Sbjct: 85  CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 133


>gi|340723433|ref|XP_003400094.1| PREDICTED: hypothetical protein LOC100643092 [Bombus terrestris]
          Length = 453

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ C R+R+     ++++  MP +++ Q E+  +C  H   D+FR Q++ K     ++W 
Sbjct: 28  KISCPRDRASVVRRIVQKRWMPILKRHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 87

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
           CG C KSF AEK LD HFDNRH + +N +    CLAD C  + CD + N
Sbjct: 88  CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 136


>gi|350427747|ref|XP_003494865.1| PREDICTED: hypothetical protein LOC100742329 [Bombus impatiens]
          Length = 450

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ C R+R+     ++++  MP +++ Q E+  +C  H   D+FR Q++ K     ++W 
Sbjct: 28  KISCPRDRASVVRRIVQKRWMPILKRHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 87

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
           CG C KSF AEK LD HFDNRH + +N +    CLAD C  + CD + N
Sbjct: 88  CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 136


>gi|307188422|gb|EFN73179.1| hypothetical protein EAG_05305 [Camponotus floridanus]
          Length = 441

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 63/109 (57%), Gaps = 1/109 (0%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ C R+R+     ++ +  MP +++ Q E+  +C  H   D+FR Q++ K     ++W 
Sbjct: 19  KISCPRDRASVVRRIVHKRWMPILKKYQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 78

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
           CG C KSF AEK LD HFDNRH + +N +    CLAD C  + CD + N
Sbjct: 79  CGLCGKSFYAEKHLDVHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 127


>gi|270007090|gb|EFA03538.1| hypothetical protein TcasGA2_TC013541 [Tribolium castaneum]
          Length = 423

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 63/115 (54%), Gaps = 2/115 (1%)

Query: 39  PG-RAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHV 97
           PG    ++ C R+ +R    +I+   +P +E+ +  +  +C  HP  D+F  Q+  K   
Sbjct: 22  PGPTVFKISCPRDNARIVRKIIQNKWIPILEKYKVSLPIECPFHPIRDIFGPQQAAKQQH 81

Query: 98  DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
             ++W CG C KSF  EK+LD HFDNRH   +N++    CLAD C  + CD ++ 
Sbjct: 82  RPSQWTCGLCGKSFFEEKYLDMHFDNRHKGFINMAEDAVCLADYCDIMRCDVLIT 136


>gi|91082837|ref|XP_969705.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 350

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 63/115 (54%), Gaps = 2/115 (1%)

Query: 39  PG-RAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHV 97
           PG    ++ C R+ +R    +I+   +P +E+ +  +  +C  HP  D+F  Q+  K   
Sbjct: 22  PGPTVFKISCPRDNARIVRKIIQNKWIPILEKYKVSLPIECPFHPIRDIFGPQQAAKQQH 81

Query: 98  DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMN 151
             ++W CG C KSF  EK+LD HFDNRH   +N++    CLAD C  + CD ++ 
Sbjct: 82  RPSQWTCGLCGKSFFEEKYLDMHFDNRHKGFINMAEDAVCLADYCDIMRCDVLIT 136


>gi|226481637|emb|CAX73716.1| Zinc finger, C2H2-type domain-containing protein [Schistosoma
           japonicum]
          Length = 254

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 89/205 (43%), Gaps = 10/205 (4%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CS  +SR A  ++ E + P  +     ++ +C  +P +D++   EQ K  +   +W C  
Sbjct: 26  CSISQSRLARSILNELVFPLFQYANVVVNEQCPFNPKHDIYSIHEQMKNKISDYDWECQL 85

Query: 107 CKKSFRAEKFLDQHFDNRHN-NLLNVSHGKCLADLCGALHC-------DFVMNSKSSRSK 158
           C K F  E+  D H  NRH  N  +++   CL+  C  L C       D+        + 
Sbjct: 86  CGKRFYTERSFDLHIGNRHKANAYSINRTICLSSYCSLLRCSVLKPEIDYGNQIFWDEAL 145

Query: 159 CNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQ 218
           C+P +       CE + N+C P+N   S  +L +L     CD   C R     P    K 
Sbjct: 146 CDPLSFEDISKQCEDVINKCAPVNDS-SGIQLRQLLRSTLCDHLSCDRY-WILPDSHVKT 203

Query: 219 PSVFYLSISILTLMLLPIFYLIVYL 243
           P +     +++    L I+Y ++++
Sbjct: 204 PVLQKFLTALIVTAGLLIYYSVIFM 228


>gi|56754925|gb|AAW25645.1| SJCHGC03203 protein [Schistosoma japonicum]
          Length = 254

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 93/206 (45%), Gaps = 12/206 (5%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CS  +SR A  ++ E + P  +     ++ +C  +P +D++   EQ K  +   +W C  
Sbjct: 26  CSISQSRLARSILNELVFPLFQFANVVVNEQCPFNPKHDIYSIHEQMKNKISDYDWECQL 85

Query: 107 CKKSFRAEKFLDQHFDNRHN-NLLNVSHGKCLADLCGALHC-------DFVMNSKSSRSK 158
           C K F  E+  D H  NRH  N  +++   CL+  C  L C       D+        + 
Sbjct: 86  CGKRFYTERSFDLHIGNRHKANAYSINRTICLSSYCSLLRCSVLKPEIDYGNQIFWDEAL 145

Query: 159 CNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQ 218
           C+P +       CE + N+C P+N   S  +L +L     CD   C R     P    K 
Sbjct: 146 CDPLSFEDISKQCEDVINKCAPVNDS-SGIQLRQLLRSTLCDHLSCDRY-WILPDSHVKT 203

Query: 219 PSV-FYLSISILTLMLLPIFYLIVYL 243
           P +  +L+  I+T  LL I+Y ++++
Sbjct: 204 PVLQKFLTAFIVTAGLL-IYYSVIFM 228


>gi|157134250|ref|XP_001663208.1| hypothetical protein AaeL_AAEL013023 [Aedes aegypti]
 gi|108870545|gb|EAT34770.1| AAEL013023-PA [Aedes aegypti]
          Length = 1069

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 65/115 (56%), Gaps = 1/115 (0%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ CSR+ SR    +++   +P +E+ Q  +  +C  HP  D+F  Q+  K     ++W 
Sbjct: 41  KISCSRDSSRVVRKIVQSKWLPILEKYQVTLPLECPFHPLRDIFGPQQAAKKQNRPSQWT 100

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS 157
           CG C KSF  EK LD HF+NRH + +N++    CLAD C  + C+ ++   ++ S
Sbjct: 101 CGLCGKSFFEEKHLDMHFENRHKSNINMAEDAVCLADYCDMMRCEVLIAKDATLS 155



 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 60/109 (55%), Gaps = 1/109 (0%)

Query: 50  ERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKK 109
           E SR    +++   +P +E+ Q  +  +C  HP  D+F  Q+  K     ++W CG C K
Sbjct: 557 ESSRVVRKIVQSKWLPILEKYQVTLPLECPFHPLRDIFGPQQAAKKQNRPSQWTCGLCGK 616

Query: 110 SFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS 157
           SF  EK LD HF+NRH + +N++    CLAD C  + C+ ++   ++ S
Sbjct: 617 SFFEEKHLDMHFENRHKSNINMAEDAVCLADYCDMMRCEVLIAKDATLS 665


>gi|328865036|gb|EGG13422.1| hypothetical protein DFA_11183 [Dictyostelium fasciculatum]
          Length = 306

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 55/216 (25%), Positives = 104/216 (48%), Gaps = 24/216 (11%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CS   S     ++   +  FV+  +Y++ S C  + + DM    +  K   + +++RC  
Sbjct: 75  CSTPLSNRVREILNHVVYSFVDYNEYKLPSTCVFNKNLDMLAVFQSLKFLQNDHQYRCKS 134

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHC--DFVMNSKSSRSKCNPAAV 164
           C K F++E ++DQH  N H +L+  +   CL D C  L+C  D++++       C+   +
Sbjct: 135 CSKQFKSEDYIDQHLKNNHADLIPSNATICLGDYCEMLNCNNDYIIS-------CDERKM 187

Query: 165 AKNRHLCESLANRCFPINQGPSASR-LHELFLRQFCDAHKCPRKSQPFPKGG-------- 215
            K ++ C+ +   CFP    PS ++  ++ F    CD   C   S    + G        
Sbjct: 188 EKLKYACQGIMYNCFP----PSVNKNYNKHFNALICDKLTCKGGSSTTGQDGIIRDADIE 243

Query: 216 --RKQPSVFYLSISILTLMLLPIFYLIVYLYQREMR 249
             + Q S+     +++ L++L IFYLI++ +++E R
Sbjct: 244 EKQHQWSLMKYLSALVILVILIIFYLILWFFKKETR 279


>gi|242021247|ref|XP_002431057.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212516286|gb|EEB18319.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 428

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 71/134 (52%), Gaps = 13/134 (9%)

Query: 39  PGRA-HEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHV 97
           PG+   ++ C R+ +     +I+   +P +E+ Q +I  +C  HP  D+F  Q+  K   
Sbjct: 17  PGQTVFKISCPRDSASVVRKLIQRKWIPILEKYQVKIPVECPFHPIRDIFGPQQAAKHQF 76

Query: 98  DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSR 156
             ++W CG+C KSF  E++L+ HFDN H N +NV+    CLAD C  + C+ ++  +   
Sbjct: 77  RSSQWSCGFCGKSFYEERYLELHFDNIHKNQINVAEDAVCLADYCDIMRCEVLVTQE--- 133

Query: 157 SKCNPAAVAKNRHL 170
                    KN+HL
Sbjct: 134 --------IKNKHL 139


>gi|390365205|ref|XP_003730770.1| PREDICTED: uncharacterized protein LOC100893077 [Strongylocentrotus
           purpuratus]
          Length = 233

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 94/195 (48%), Gaps = 14/195 (7%)

Query: 75  SSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSH 133
            ++C L P+ +M+ D E+HK     N W C YC KSF  E+FLD+H+DNRH + +L    
Sbjct: 9   GTECLLKPERNMYWDDERHKWSEIANNWNCLYCGKSFYNEQFLDKHYDNRHGSTVLTGPE 68

Query: 134 GKCLADLCGALHCDFVMNSKSS----RSKCNPAAVAKNRHLCESLANRCFP--INQGPSA 187
             CLAD C  L CD + ++K +     + C    +   R  C+++   C P  I      
Sbjct: 69  AVCLADHCDYLRCDIIASTKKTSFWEEALCKQKDLLVLRKKCQNIMKSCIPSHITDAEKN 128

Query: 188 SRLHELFLRQFCDAHKCPRKSQ-PFPKGGRKQPSVFYLSISILTLMLLPIFYLI----VY 242
           S L +L     C    C +  + P+ +    + + ++++  I T+    ++Y++    +Y
Sbjct: 129 SLLDKL-DDTVCSFLTCEKYWENPYKEVPPSRTAAYFIA-GIFTVFGFIMYYVMACSHLY 186

Query: 243 LYQREMRGGTQELKR 257
             ++E  G +    R
Sbjct: 187 ADEKEFLGDSDGFVR 201


>gi|195388338|ref|XP_002052837.1| GJ17779 [Drosophila virilis]
 gi|194149294|gb|EDW64992.1| GJ17779 [Drosophila virilis]
          Length = 625

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 82/187 (43%), Gaps = 14/187 (7%)

Query: 39  PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
           P    ++ CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K    
Sbjct: 18  PSNVIKILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPLRDVFAPRQDEKQRDR 77

Query: 99  INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRS 157
             +W C  C KSF  EK+LD HFD RH +++N      CLAD C  + C+      +S  
Sbjct: 78  PTQWTCRLCGKSFYQEKYLDLHFDTRHKHIINEAEDAVCLADFCDIMRCEVFQTEDASSL 137

Query: 158 KCNPA-----------AVAKNRHLCESLAN--RCFPINQGPSASRLHELFLRQFCDAHKC 204
           K               ++ +N  L ++ A      P     SA+R  ++  RQ       
Sbjct: 138 KFGDQHIVTDIEVWGDSLGQNSALAKANAAYLSLIPRTSTLSATRAAKVQNRQLLQEKPS 197

Query: 205 PRKSQPF 211
           P K QP 
Sbjct: 198 PSKEQPL 204


>gi|195118254|ref|XP_002003655.1| GI21701 [Drosophila mojavensis]
 gi|193914230|gb|EDW13097.1| GI21701 [Drosophila mojavensis]
          Length = 639

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 64/121 (52%), Gaps = 1/121 (0%)

Query: 39  PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
           P    ++ CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  +++ K    
Sbjct: 18  PSNVIKILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPLRDVFAPRQEEKQRDR 77

Query: 99  INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS 157
             +W C  C KSF  EK+LD HFD RH +++N +    CLADLC  + C+      +S  
Sbjct: 78  PTQWTCRLCGKSFYQEKYLDLHFDTRHKHIINEAEDSVCLADLCDIMRCEVFQTEDASSL 137

Query: 158 K 158
           K
Sbjct: 138 K 138


>gi|357621060|gb|EHJ73029.1| hypothetical protein KGM_12603 [Danaus plexippus]
          Length = 425

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 1/108 (0%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           ++ CSR+ S+    VIE   +P +E+ Q ++  +C  HP  D+F  Q   K     ++W 
Sbjct: 28  KITCSRDNSKIVRKVIENKWLPVLERFQVKLPLECPFHPARDIFAPQHAAKQQHRSSQWT 87

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVM 150
           C +C KSF  E+ LD HFD RH + +N      CLAD C  + C  ++
Sbjct: 88  CAFCGKSFYEERHLDTHFDQRHRHQINKAEDAVCLADYCDIMRCQVLV 135


>gi|195032746|ref|XP_001988553.1| GH11227 [Drosophila grimshawi]
 gi|193904553|gb|EDW03420.1| GH11227 [Drosophila grimshawi]
          Length = 635

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 61/121 (50%), Gaps = 1/121 (0%)

Query: 39  PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
           P    ++ CSR+ SR    ++     P +++ Q ++   C LHP  D+F  ++  K    
Sbjct: 18  PSNVIKLLCSRDNSRLVRKIVRSKWTPILDKHQVKLPLDCPLHPLRDVFAPRQDEKQRDR 77

Query: 99  INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS 157
             +W C  C KSF  EKFLD HFD RH +++N +    CLAD C  + C+      +S  
Sbjct: 78  PTQWTCRRCGKSFYQEKFLDLHFDTRHKHIINEAEDSVCLADFCDIMRCEVFQTEDASSL 137

Query: 158 K 158
           K
Sbjct: 138 K 138


>gi|303280527|ref|XP_003059556.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459392|gb|EEH56688.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 324

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 78/192 (40%), Gaps = 45/192 (23%)

Query: 58  VIEEYLMPFVEQ--VQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEK 115
           V+++Y  P V++    +E+   C      DM+ +QE+HK  V   +W+  Y +K FR E 
Sbjct: 44  VLDDYFYPVVDRHAGSFEVPPACPFQRSKDMYLEQEKHKERVRRTQWKSLYSEKVFRTEW 103

Query: 116 FLDQHFDNRHNNLLNVSHGKCLADLCGALHCD-------------FVMNSKSSRSK---- 158
            +D+H DNRH + +      CLAD C  L CD                    SRSK    
Sbjct: 104 HVDKHMDNRHADKIPPGADVCLADYCAVLRCDEHHAHRNPDDPSRSGSRGAPSRSKGGGG 163

Query: 159 ------------CNPAAVAKNRHLCESLANRCFPINQGPSA--------------SRLHE 192
                       C+ + +A  RH CE+L + CFP +    +               RL  
Sbjct: 164 GGAGARRARRERCDESKLAGARHFCETLMHACFPSDDDAESDGDRGGGARGHGAAGRLRA 223

Query: 193 LFLRQFCDAHKC 204
            F R  CD   C
Sbjct: 224 YFTRHHCDQLTC 235


>gi|195437634|ref|XP_002066745.1| GK24396 [Drosophila willistoni]
 gi|194162830|gb|EDW77731.1| GK24396 [Drosophila willistoni]
          Length = 754

 Score = 76.6 bits (187), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 58/113 (51%), Gaps = 1/113 (0%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K      +W C  
Sbjct: 31  CSRDNSRLVRKIVRSKWTPILDKHQVKLPMECPLHPFRDIFAPRQDSKQRDRPTQWTCRR 90

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
           C KSF  EK+LD HFD RH  L+N V    CL+D C  + CD      ++  K
Sbjct: 91  CGKSFYQEKYLDLHFDIRHETLINEVEDSICLSDFCDIMRCDVFQTEDATSLK 143


>gi|442624983|ref|NP_001259827.1| CG4133, isoform C [Drosophila melanogaster]
 gi|440213075|gb|AGB92364.1| CG4133, isoform C [Drosophila melanogaster]
          Length = 583

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 1/121 (0%)

Query: 39  PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
           P     + CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K    
Sbjct: 23  PTNVIRILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDR 82

Query: 99  INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRS 157
             +W C  C KSF  EK LD HFD RH +++N      CLAD C  + C+      +S  
Sbjct: 83  PTQWTCRKCGKSFYQEKHLDLHFDTRHKSIINEAEDAVCLADFCDIMRCEVFETEDASSL 142

Query: 158 K 158
           K
Sbjct: 143 K 143


>gi|195575557|ref|XP_002077644.1| GD23028 [Drosophila simulans]
 gi|194189653|gb|EDX03229.1| GD23028 [Drosophila simulans]
          Length = 667

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K      +W C  
Sbjct: 31  CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
           C KSF  EK LD HFD RH +++N      CLAD C  + C+      +S  K
Sbjct: 91  CGKSFYQEKHLDLHFDTRHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|194853511|ref|XP_001968176.1| GG24723 [Drosophila erecta]
 gi|190660043|gb|EDV57235.1| GG24723 [Drosophila erecta]
          Length = 669

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K      +W C  
Sbjct: 31  CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
           C KSF  EK LD HFD RH +++N      CLAD C  + C+      +S  K
Sbjct: 91  CGKSFYQEKHLDLHFDTRHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|195470274|ref|XP_002087433.1| GE16858 [Drosophila yakuba]
 gi|194173534|gb|EDW87145.1| GE16858 [Drosophila yakuba]
          Length = 669

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K      +W C  
Sbjct: 31  CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
           C KSF  EK LD HFD RH +++N      CLAD C  + C+      +S  K
Sbjct: 91  CGKSFYQEKHLDLHFDTRHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|194766505|ref|XP_001965365.1| GF20663 [Drosophila ananassae]
 gi|190617975|gb|EDV33499.1| GF20663 [Drosophila ananassae]
          Length = 701

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K      +W C  
Sbjct: 31  CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRR 90

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
           C KSF  EK LD HFD RH +++N      CLAD C  + C+      +S  K
Sbjct: 91  CGKSFYQEKHLDLHFDTRHKHIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|442624981|ref|NP_001259826.1| CG4133, isoform B [Drosophila melanogaster]
 gi|440213074|gb|AGB92363.1| CG4133, isoform B [Drosophila melanogaster]
          Length = 642

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K      +W C  
Sbjct: 31  CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
           C KSF  EK LD HFD RH +++N      CLAD C  + C+      +S  K
Sbjct: 91  CGKSFYQEKHLDLHFDTRHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|195350111|ref|XP_002041585.1| GM16746 [Drosophila sechellia]
 gi|194123358|gb|EDW45401.1| GM16746 [Drosophila sechellia]
          Length = 652

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K      +W C  
Sbjct: 31  CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
           C KSF  EK LD HFD RH +++N      CLAD C  + C+      +S  K
Sbjct: 91  CGKSFYQEKHLDLHFDTRHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|19920466|ref|NP_608526.1| CG4133, isoform A [Drosophila melanogaster]
 gi|7296200|gb|AAF51492.1| CG4133, isoform A [Drosophila melanogaster]
 gi|18447493|gb|AAL68309.1| RE51073p [Drosophila melanogaster]
 gi|220948886|gb|ACL86986.1| CG4133-PA [synthetic construct]
          Length = 667

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K      +W C  
Sbjct: 31  CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
           C KSF  EK LD HFD RH +++N      CLAD C  + C+      +S  K
Sbjct: 91  CGKSFYQEKHLDLHFDTRHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|326435960|gb|EGD81530.1| hypothetical protein PTSG_02249 [Salpingoeca sp. ATCC 50818]
          Length = 334

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 70/162 (43%), Gaps = 28/162 (17%)

Query: 41  RAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISS------------------KCRLHP 82
           +A + +C R  SR   GV+ + ++P +      +SS                   C L P
Sbjct: 29  QAADFNCDRATSRRVRGVLTQSILPLLGARGLHLSSPAKHNNDSRAEPGSALTPSCPLDP 88

Query: 83  DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK--CLADL 140
             D++  Q++H+      +  C YC KSFR E FLD H  NRH   L V      CLAD 
Sbjct: 89  TRDVYARQDEHRDMEGTTKLTCLYCGKSFRGEPFLDLHMHNRHAGELLVPDTGPLCLADY 148

Query: 141 CGALHCDFVMN--------SKSSRSKCNPAAVAKNRHLCESL 174
           C A  C+  +         S     KC+P A+++ +  CE L
Sbjct: 149 CDATMCETQLEYRQFARLFSPEHGGKCDPDAMSRRKESCERL 190


>gi|195147218|ref|XP_002014577.1| GL18880 [Drosophila persimilis]
 gi|194106530|gb|EDW28573.1| GL18880 [Drosophila persimilis]
          Length = 683

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 57/110 (51%), Gaps = 1/110 (0%)

Query: 39  PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
           P    ++ CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K    
Sbjct: 23  PTNVIKILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDR 82

Query: 99  INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCD 147
             +W C  C KSF  E+ LD HFD RH  ++N +    CLAD C  + C+
Sbjct: 83  PTQWTCRRCGKSFYQERHLDLHFDTRHKTIINEAMDSICLADFCDIMRCE 132


>gi|198473764|ref|XP_001356431.2| GA17979 [Drosophila pseudoobscura pseudoobscura]
 gi|198138098|gb|EAL33495.2| GA17979 [Drosophila pseudoobscura pseudoobscura]
          Length = 684

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 60/121 (49%), Gaps = 1/121 (0%)

Query: 39  PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
           P    ++ CSR+ SR    ++     P +++ Q ++  +C LHP  D+F  ++  K    
Sbjct: 23  PTNVIKILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDR 82

Query: 99  INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS 157
             +W C  C KSF  E+ LD HFD RH  ++N +    CLAD C  + C+      ++  
Sbjct: 83  PTQWTCRRCGKSFYQERHLDLHFDTRHKTIINEAMDSICLADFCDIMRCEVFETEDATSL 142

Query: 158 K 158
           K
Sbjct: 143 K 143


>gi|449682242|ref|XP_004210029.1| PREDICTED: uncharacterized protein LOC100205425 [Hydra
            magnipapillata]
          Length = 1324

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 57/244 (23%), Positives = 101/244 (41%), Gaps = 33/244 (13%)

Query: 7    IFTSIFLLLL-----FSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEE 61
            +F  IF  LL     FSP+ +L   +  S+    K+        +CS +RS     ++ +
Sbjct: 1053 LFCYIFFSLLDKKRNFSPRNML---WSKSSRIDFKR--------NCSLQRSALVSNILSQ 1101

Query: 62   YLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHF 121
             L P  E+ Q +I  +C    + +++R  E  K  +     +C +C K F  E +  +H 
Sbjct: 1102 TLYPIYEKYQLKIPRRCIFLQERNIYRGVEMGKTELSGGHIKCEFCGKLFENENYFYKHM 1161

Query: 122  DNRHNNLLNVSHG-KCLADLCGALHCD----FVMNSKSSRSKCNPAAVAKNRHLCESLAN 176
            D +H N+   +    CL+D C    CD    + +N       CN   V K +  CE +  
Sbjct: 1162 DRKHENVYVFNKDFVCLSDYCDIFRCDLHEMYPVNP-LDFDVCNKKDVRKLQRKCEKIIY 1220

Query: 177  RCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPI 236
             C P         +++   +  C    C      F     +  S++Y++     LML+P 
Sbjct: 1221 TCLPTEM---QHYVYDSLTKSLCAPLNCNNFYADF-----EDVSIWYIA---FMLMLVPA 1269

Query: 237  FYLI 240
             +++
Sbjct: 1270 TFIL 1273


>gi|307110360|gb|EFN58596.1| hypothetical protein CHLNCDRAFT_140777 [Chlorella variabilis]
          Length = 192

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/176 (27%), Positives = 72/176 (40%), Gaps = 40/176 (22%)

Query: 74  ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH 133
           + + C L P +D +R QE  K  +    WRC  C K F AE+ +D H  NRH +L     
Sbjct: 1   MPAGCPLSPASDRYRHQEAAKETLAQGRWRCILCGKRFTAERHIDLHLANRHAHLEEQQQ 60

Query: 134 GKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGP-SASRLHE 192
                     L  D                       C  LA RCFP + G  SA RLH 
Sbjct: 61  ----------LQED-----------------------CLELAARCFPPHGGDGSAQRLHA 87

Query: 193 LFLRQFCDAHKCPRKSQ-----PFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 243
           L   + C AH C  + +        +  R+    +  +++   L++L  F+L+++L
Sbjct: 88  LLAGRLCKAHTCDTRRKRSRTLAVAQAHRRWRGRWSFTLAA-GLVVLATFWLLLWL 142


>gi|256083558|ref|XP_002578009.1| hypothetical protein [Schistosoma mansoni]
 gi|350644825|emb|CCD60456.1| hypothetical protein Smp_067940 [Schistosoma mansoni]
          Length = 129

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 51/121 (42%), Gaps = 8/121 (6%)

Query: 78  CRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSHGKC 136
           C  HP +D++   EQ K  +   +W C  C K F  E   D H  NRH  N  + S   C
Sbjct: 5   CPFHPKHDIYAIHEQMKNKISDYDWECQMCGKRFYTENTFDLHIGNRHETNAYSTSRTIC 64

Query: 137 LADLCGALHC-------DFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASR 189
           L+  C  L C       D+        + C+P +       CE + N+C P     S+++
Sbjct: 65  LSSYCPLLRCSVLKPDVDYGYQVFWDEALCDPKSFEAISKQCEDILNKCIPSGNDSSSTQ 124

Query: 190 L 190
           L
Sbjct: 125 L 125


>gi|391348561|ref|XP_003748515.1| PREDICTED: uncharacterized protein LOC100905503 [Metaseiulus
           occidentalis]
          Length = 505

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 56/228 (24%), Positives = 86/228 (37%), Gaps = 35/228 (15%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           C R  S     V+   L P +++ +  +   C  HP  D+     Q   +   + WRC  
Sbjct: 34  CDRHSSAIVRHVVYTRLQPVLDRYRVSMPEDCPFHPARDVLV-WPQRDANSITDVWRCPV 92

Query: 107 CKKSFRAEKFLDQHFDNRHNNLLNVSHGK---------CLADLCGALHCDFVMNS----- 152
           C KSF  E+ L QHF           H K         CL   C  + CD +        
Sbjct: 93  CGKSFFGEENLTQHFHAHTGTRGGTQHVKRHITEGDRVCLGTFCDIMRCDVIERQLSEAL 152

Query: 153 ---KSSRSKCNPAAVAK-NRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK----- 203
               SSR   +P   A  NR     L NRC  + Q  +   L+ L ++++ D  +     
Sbjct: 153 TPRTSSRKTESPKPKADCNRQSMAVLENRCKAVMQQCTIGLLNVLSVKEYKDVEEALHQN 212

Query: 204 ------CPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIF--YLIVYL 243
                 C R  +  P+  R  P   +L+ +I  L+    F  Y +V++
Sbjct: 213 ICSYLTCERYWEEGPRDTRFSP---FLASTIGFLIASGFFGCYWVVWI 257


>gi|358337329|dbj|GAA55703.1| hypothetical protein CLF_108755 [Clonorchis sinensis]
          Length = 463

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 3/97 (3%)

Query: 56  WGVIEEYLM--PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRA 113
           W    ++L+  P  +     + ++C  +P  D++  QE  K      +W+C  C K F +
Sbjct: 262 WVPTTDFLVFFPLYDFHHVPVPNECPWNPHLDIYNLQELMKTQNSARDWQCNQCWKRFYS 321

Query: 114 EKFLDQHFDNRHNNLLNVS-HGKCLADLCGALHCDFV 149
           E  LD H DNRH + +    +  CLA  C  L CD +
Sbjct: 322 EHALDLHMDNRHRSTVYTGPNATCLALFCPLLRCDVL 358


>gi|412990909|emb|CCO18281.1| predicted protein [Bathycoccus prasinos]
          Length = 305

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 63/269 (23%), Positives = 99/269 (36%), Gaps = 51/269 (18%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQE-----QHKIHVDINE 101
           C + +SR A   +EE+  P     + E+S +C      D++R+ E     +H       E
Sbjct: 17  CDQRKSRTARQALEEFFFPLFTNEKEEVSERCPFKKSRDVYREIESRKTRRHNKRKKEEE 76

Query: 102 WRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNS-------K 153
           W+     K F++E  +D +   R       + G  CLAD C  L CD    S        
Sbjct: 77  WK-DVDGKVFKSEADIDAYLSGR---FFKSTKGDVCLADYCDVLQCDEYYESLETSSSSS 132

Query: 154 SSRSKCNPAAVAKNRHLCESLANRCFPIN-----QGPSASRLHELFLRQFCDAHKCPR-- 206
           +S  + + AA  +    C +L   CFP       Q  +A R   +F   F   H+  R  
Sbjct: 133 TSSGEKDYAAKTRAGAHCYALTKTCFPERSESAVQEKNARRFESIFCDTFVGLHQDGRNA 192

Query: 207 ----KSQPFPKGGRKQPS--------------------VFYLSISILTLMLLPIFYLIVY 242
               + Q +   G  + +                      +L IS L   L  IFYL+ +
Sbjct: 193 FAHIRKQEYKYPGHVRHANTNIDASASPSSSSSSSSSSSSFLIISTLFTFLGFIFYLLFF 252

Query: 243 L---YQREMRGGTQELKRIPRVGRKAKPS 268
           L     R   G   + ++  R   K K +
Sbjct: 253 LAPSINRRASGSDLKARKGRRTASKKKST 281


>gi|167519981|ref|XP_001744330.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777416|gb|EDQ91033.1| predicted protein [Monosiga brevicollis MX1]
          Length = 272

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 57/142 (40%), Gaps = 11/142 (7%)

Query: 47  CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
           C++  SR A   +E+ L PF        S  C    ++D+F +  +        +  C +
Sbjct: 41  CNQTESRRAMAALEQVLYPFQPLTP---SESCPWIVEHDLFYELNRDADVTAKGQPACLF 97

Query: 107 CKKSFRAEKFLDQHFDNRH----NNLLNVSHGKCLADLCGALHCDFVMNSKS----SRSK 158
           C K+F    FLDQH+  RH     +L N     CLAD C  L C            +   
Sbjct: 98  CGKAFENAVFLDQHYPRRHLPTFLDLPNPDRRLCLADSCDVLQCAAYSEGAELWAFAHQP 157

Query: 159 CNPAAVAKNRHLCESLANRCFP 180
           C+P      +  C +  + CFP
Sbjct: 158 CSPGQYDMAQVRCRATIDTCFP 179


>gi|303390934|ref|XP_003073697.1| hypothetical protein Eint_100070 [Encephalitozoon intestinalis ATCC
           50506]
 gi|303302845|gb|ADM12337.1| hypothetical protein Eint_100070 [Encephalitozoon intestinalis ATCC
           50506]
          Length = 388

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 33/50 (66%), Gaps = 5/50 (10%)

Query: 80  LHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFDNRHNN 127
           LHP     +++E + +   +NE  ++CG+C KSF +EKF+  HF+N+H +
Sbjct: 278 LHPKT---KEEELNNMTTIVNENHYKCGFCGKSFESEKFIFNHFNNKHED 324


>gi|68489815|ref|XP_711247.1| potential zinc finger protein [Candida albicans SC5314]
 gi|46432535|gb|EAK92012.1| potential zinc finger protein [Candida albicans SC5314]
          Length = 132

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 8/75 (10%)

Query: 78  CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 134
           C  HP  N  F D +Q+ +HV  I+E  C  C K F  +KFL  H D  HN +L +    
Sbjct: 28  CNYHPCFNQAFEDYQQYHLHVVSIHEHTCESCLKVFPDKKFLSLHIDENHNPILQIKQER 87

Query: 135 -----KCLADLCGAL 144
                KC  + C A+
Sbjct: 88  GDKIFKCYTESCKAV 102


>gi|19074624|ref|NP_586130.1| Zinc finger domain [Encephalitozoon cuniculi GB-M1]
 gi|51702181|sp|Q8SUH6.1|ZA15_ENCCU RecName: Full=Zinc finger C2H2 protein ECU10_0150
 gi|19069266|emb|CAD25734.1| Zinc finger domain [Encephalitozoon cuniculi GB-M1]
          Length = 388

 Score = 41.6 bits (96), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 24/31 (77%)

Query: 97  VDINEWRCGYCKKSFRAEKFLDQHFDNRHNN 127
           V+ N ++CG+C K+F +EKF+  HF+N+H N
Sbjct: 294 VNENHYKCGFCGKAFESEKFIFNHFNNKHEN 324


>gi|68489768|ref|XP_711271.1| potential zinc finger protein [Candida albicans SC5314]
 gi|46432560|gb|EAK92036.1| potential zinc finger protein [Candida albicans SC5314]
          Length = 132

 Score = 41.2 bits (95), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 8/75 (10%)

Query: 78  CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 134
           C  HP  N  F D +Q+ +HV  I+E  C  C K F  + FL  H D  HN LL +    
Sbjct: 28  CNYHPCFNQAFEDYQQYHLHVVSIHEHTCESCLKVFPNKSFLSLHIDENHNPLLQIKQER 87

Query: 135 -----KCLADLCGAL 144
                KC  + C A+
Sbjct: 88  GDKIFKCYTESCKAV 102


>gi|347967063|ref|XP_550770.4| AGAP002041-PA [Anopheles gambiae str. PEST]
 gi|333469774|gb|EAL38493.4| AGAP002041-PA [Anopheles gambiae str. PEST]
          Length = 1932

 Score = 41.2 bits (95), Expect = 0.45,   Method: Composition-based stats.
 Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 15/98 (15%)

Query: 92  QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
           +HK+ H D  ++ CG C K+F+ +  L+ H     N        +C AD CG  +CD   
Sbjct: 264 KHKLTHSDERKYVCGMCSKAFKRQDHLNGHMLTHRNK----KPYECKADGCGKSYCD--- 316

Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 188
            ++S R         +N H    LAN     + GP  S
Sbjct: 317 -ARSLRRH------TENHHSALGLANSSGTASPGPGGS 347


>gi|347967065|ref|XP_003436012.1| AGAP002041-PB [Anopheles gambiae str. PEST]
 gi|333469775|gb|EGK97401.1| AGAP002041-PB [Anopheles gambiae str. PEST]
          Length = 1876

 Score = 41.2 bits (95), Expect = 0.50,   Method: Composition-based stats.
 Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 15/98 (15%)

Query: 92  QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
           +HK+ H D  ++ CG C K+F+ +  L+ H     N        +C AD CG  +CD   
Sbjct: 264 KHKLTHSDERKYVCGMCSKAFKRQDHLNGHMLTHRNK----KPYECKADGCGKSYCD--- 316

Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 188
            ++S R         +N H    LAN     + GP  S
Sbjct: 317 -ARSLRRH------TENHHSALGLANSSGTASPGPGGS 347


>gi|401827595|ref|XP_003888090.1| hypothetical protein EHEL_100120 [Encephalitozoon hellem ATCC
           50504]
 gi|392999290|gb|AFM99109.1| hypothetical protein EHEL_100120 [Encephalitozoon hellem ATCC
           50504]
          Length = 388

 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 15/29 (51%), Positives = 23/29 (79%)

Query: 97  VDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
           V+ N ++CG+C KSF +EKF+  HF+N+H
Sbjct: 294 VNENHYKCGFCGKSFESEKFIFNHFNNKH 322


>gi|66358034|ref|XP_626195.1| hypothetical protein [Cryptosporidium parvum Iowa II]
 gi|46227028|gb|EAK87978.1| hypothetical protein with signal peptide, possible transmembrane
           domain near C-terminus and predicted SMART Znf_C2H2
           domain [Cryptosporidium parvum Iowa II]
          Length = 233

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 44/85 (51%), Gaps = 1/85 (1%)

Query: 44  EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
           +++CSR+ SR     I + L+   + +   +  +C  + ++ +F  +E+  I     E +
Sbjct: 25  DINCSRQVSRRVSHAIRK-LLKDKKDISKHLLPECAFNQEHHIFNYEEKINIVYPTGERQ 83

Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNL 128
           CG+C + F+ EK  DQH    H +L
Sbjct: 84  CGFCGEIFQDEKTYDQHMGKSHPSL 108


>gi|357631664|gb|EHJ79133.1| putative zinc finger protein 91-like protein [Danaus plexippus]
          Length = 390

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 28/37 (75%), Gaps = 1/37 (2%)

Query: 92  QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNN 127
           QH + H+DIN+++CG+C K++++ + L +HF   H++
Sbjct: 189 QHLVSHIDINQFKCGFCNKTYKSRQSLKEHFRVAHSS 225


>gi|241956272|ref|XP_002420856.1| zinc finger protein, putative [Candida dubliniensis CD36]
 gi|223644199|emb|CAX41009.1| zinc finger protein, putative [Candida dubliniensis CD36]
          Length = 143

 Score = 38.1 bits (87), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 8/75 (10%)

Query: 78  CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 134
           C  HP  N  F + +Q+ +HV   +E  C  C+K F  + +L  H D  HN LL +    
Sbjct: 39  CNYHPCLNSAFENYQQYHLHVLSTHEHTCESCQKVFPNKNYLSLHIDENHNPLLQIKQER 98

Query: 135 -----KCLADLCGAL 144
                KC  + C A+
Sbjct: 99  GDKIFKCYTETCKAV 113


>gi|443690749|gb|ELT92809.1| hypothetical protein CAPTEDRAFT_214413 [Capitella teleta]
          Length = 947

 Score = 38.1 bits (87), Expect = 3.7,   Method: Composition-based stats.
 Identities = 32/113 (28%), Positives = 54/113 (47%), Gaps = 6/113 (5%)

Query: 15  LLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCS-RERSRAAWGVIEEYLMPFVEQVQYE 73
           LLFS + +  +G D  A+   + + G  +E+ CS  ER + A  V+++      +Q++ E
Sbjct: 106 LLFSQEYLNNKGKDLDAAFKREADVGIVNEISCSCNERMQQAHKVLKKEF----DQLKEE 161

Query: 74  ISS-KCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
           ++  K   H    M   Q+Q   +   +  +C YC K+F    FL  H   RH
Sbjct: 162 LAEVKKESHKRKKMILAQQQMLQNGQNSYNKCPYCSKAFVNASFLQSHLLRRH 214


>gi|392568613|gb|EIW61787.1| hypothetical protein TRAVEDRAFT_27265 [Trametes versicolor
           FP-101664 SS1]
          Length = 904

 Score = 38.1 bits (87), Expect = 3.9,   Method: Composition-based stats.
 Identities = 15/47 (31%), Positives = 29/47 (61%)

Query: 83  DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
           + ++ R  E H    D  ++RC  C+K F+A  F+++H  N+H++L+
Sbjct: 670 EEELSRACEAHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHSDLI 716


>gi|195107861|ref|XP_001998512.1| GI23595 [Drosophila mojavensis]
 gi|193915106|gb|EDW13973.1| GI23595 [Drosophila mojavensis]
          Length = 2064

 Score = 38.1 bits (87), Expect = 4.2,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 5/88 (5%)

Query: 92  QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
           +HK+ H D  ++ C  C K+F+ +  L+ H     N        +C A+ CG  +CD   
Sbjct: 351 KHKLTHSDERKYICALCSKAFKRQDHLNGHMMTHRNK----KPYECKAEGCGKSYCDARS 406

Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRC 178
             + S +    AA   N  L  + +  C
Sbjct: 407 LRRHSENHHGGAATPNNNSLSPTTSTNC 434


>gi|409049944|gb|EKM59421.1| hypothetical protein PHACADRAFT_249892 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 924

 Score = 37.7 bits (86), Expect = 5.5,   Method: Composition-based stats.
 Identities = 15/39 (38%), Positives = 25/39 (64%)

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
           EQH    D  ++RC  C+K F+A  F+++H  N+H +L+
Sbjct: 693 EQHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPDLV 731


>gi|67612273|ref|XP_667210.1| zf-C2H2 zinc finger protein [Cryptosporidium hominis TU502]
 gi|54658324|gb|EAL36982.1| zf-C2H2 zinc finger protein [Cryptosporidium hominis]
          Length = 668

 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
           EQH +    + W+CG C K F+ E+++ +H   +H + L
Sbjct: 533 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 571


>gi|449549636|gb|EMD40601.1| hypothetical protein CERSUDRAFT_111186 [Ceriporiopsis subvermispora
           B]
          Length = 926

 Score = 37.4 bits (85), Expect = 6.9,   Method: Composition-based stats.
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
           EQH    D  ++RC  C+K F+A  F+++H  N+H  L+
Sbjct: 694 EQHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPELV 732


>gi|66475454|ref|XP_627543.1| protein with C2H2 Zn finger, ASR2B-like N- and C-terminus
           [Cryptosporidium parvum Iowa II]
 gi|46228996|gb|EAK89845.1| protein with C2H2 Zn finger, ASR2B-like N- and C-terminus
           [Cryptosporidium parvum Iowa II]
          Length = 668

 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
           EQH +    + W+CG C K F+ E+++ +H   +H + L
Sbjct: 533 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 571


>gi|32398760|emb|CAD98720.1| hypothetical predicted zf-C2H2 zinc finger protein, unknown
           function [Cryptosporidium parvum]
          Length = 598

 Score = 37.0 bits (84), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 91  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
           EQH +    + W+CG C K F+ E+++ +H   +H + L
Sbjct: 463 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 501


>gi|157136418|ref|XP_001663747.1| hypothetical protein AaeL_AAEL013548 [Aedes aegypti]
 gi|108869956|gb|EAT34181.1| AAEL013548-PA [Aedes aegypti]
          Length = 566

 Score = 37.0 bits (84), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 95  IHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNL 128
           IHVD   +RC  C K+F+ +++L +H +N H ++
Sbjct: 303 IHVDPKTYRCEPCDKTFKGKRYLQEHLENVHGSI 336


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.325    0.138    0.435 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,156,223,143
Number of Sequences: 23463169
Number of extensions: 163957519
Number of successful extensions: 606517
Number of sequences better than 100.0: 181
Number of HSP's better than 100.0 without gapping: 112
Number of HSP's successfully gapped in prelim test: 69
Number of HSP's that attempted gapping in prelim test: 606103
Number of HSP's gapped (non-prelim): 381
length of query: 268
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 128
effective length of database: 9,074,351,707
effective search space: 1161517018496
effective search space used: 1161517018496
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 75 (33.5 bits)