BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 028680
         (205 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|297733623|emb|CBI14870.3| unnamed protein product [Vitis vinifera]
          Length = 276

 Score =  352 bits (902), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 160/205 (78%), Positives = 179/205 (87%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ QY+ISS C+LHPDN +F DQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN
Sbjct: 72  MPFVEREQYQISSSCKLHPDNHLFSDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 131

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH  LLNVSH KCLADLCGALHCD VMNSK+ ++KCNPAA  +NRHLCESLA+ CFP+N 
Sbjct: 132 RHYGLLNVSHSKCLADLCGALHCDLVMNSKTPKAKCNPAAAVRNRHLCESLADSCFPVNH 191

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SASRLH++FLRQFCDAH C    +PF +GG KQPS+FYL+ISILTLM+LP+FY+IVYL
Sbjct: 192 GLSASRLHDIFLRQFCDAHTCSGGGKPFSRGGMKQPSIFYLAISILTLMMLPVFYVIVYL 251

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           YQREMR GTQELKRI R G K KPS
Sbjct: 252 YQREMRNGTQELKRISRHGLKTKPS 276


>gi|359491160|ref|XP_002262859.2| PREDICTED: uncharacterized protein LOC100244377 [Vitis vinifera]
          Length = 254

 Score =  350 bits (898), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 160/205 (78%), Positives = 179/205 (87%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ QY+ISS C+LHPDN +F DQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN
Sbjct: 50  MPFVEREQYQISSSCKLHPDNHLFSDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 109

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH  LLNVSH KCLADLCGALHCD VMNSK+ ++KCNPAA  +NRHLCESLA+ CFP+N 
Sbjct: 110 RHYGLLNVSHSKCLADLCGALHCDLVMNSKTPKAKCNPAAAVRNRHLCESLADSCFPVNH 169

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SASRLH++FLRQFCDAH C    +PF +GG KQPS+FYL+ISILTLM+LP+FY+IVYL
Sbjct: 170 GLSASRLHDIFLRQFCDAHTCSGGGKPFSRGGMKQPSIFYLAISILTLMMLPVFYVIVYL 229

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           YQREMR GTQELKRI R G K KPS
Sbjct: 230 YQREMRNGTQELKRISRHGLKTKPS 254


>gi|224133834|ref|XP_002321672.1| predicted protein [Populus trichocarpa]
 gi|222868668|gb|EEF05799.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  348 bits (894), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 158/205 (77%), Positives = 184/205 (89%), Gaps = 2/205 (0%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ QY+ISSKCRLHP ND+FRDQE+HKIHVDINEW+CGYCKK FRAEK+LDQHFDN
Sbjct: 64  MPFVEREQYQISSKCRLHPSNDLFRDQEEHKIHVDINEWKCGYCKKHFRAEKYLDQHFDN 123

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN+S GKC+ADLCGALHCDF+M+SKS+++KCNPAAVAKN HLCESLA+ CFP+NQ
Sbjct: 124 RHYNLLNISDGKCMADLCGALHCDFMMDSKSTKTKCNPAAVAKNHHLCESLADSCFPLNQ 183

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GPSASRLHELFL QFCDAH C    + FPKGG+K+ SVFYL+ISILT+M++P+FYLIVYL
Sbjct: 184 GPSASRLHELFLHQFCDAHTCSGNKKLFPKGGKKKTSVFYLAISILTMMMIPLFYLIVYL 243

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +Q EMR GTQEL+RI +  RK KPS
Sbjct: 244 HQSEMRKGTQELRRISK--RKTKPS 266


>gi|226531528|ref|NP_001142988.1| uncharacterized protein LOC100275449 precursor [Zea mays]
 gi|195612594|gb|ACG28127.1| zinc finger protein [Zea mays]
          Length = 276

 Score =  324 bits (830), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 145/205 (70%), Positives = 176/205 (85%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKF+DQHF+N
Sbjct: 72  MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFIDQHFEN 131

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NL++ S G+C+ADLCGALHCD +M  K  +SKCN AA  +NRHLCESLA+ CFPINQ
Sbjct: 132 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPINQ 191

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G +ASRLHE FLRQFCDAH C + ++PFPKGGRKQ +  YL++SILTL+LLP+FYLIV+L
Sbjct: 192 GLAASRLHEFFLRQFCDAHTCSKGTKPFPKGGRKQTNRLYLALSILTLILLPLFYLIVFL 251

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +QREMR GTQ+LKR  ++G+K KPS
Sbjct: 252 HQREMRKGTQDLKRFSKIGQKKKPS 276


>gi|449440588|ref|XP_004138066.1| PREDICTED: uncharacterized protein LOC101218367 [Cucumis sativus]
          Length = 280

 Score =  324 bits (830), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 144/204 (70%), Positives = 175/204 (85%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +PF+E+  YE+S++CRLHP+ND+FRDQEQHKIH+DIN W+CGYC+KSFRAEKFLD+HFDN
Sbjct: 76  LPFMEKENYEVSTQCRLHPNNDLFRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDN 135

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH+NLLNVSHGKCLADLCGALHCD  M+ KS +SKC PAA A+N+HLCESLA+ CFPIN+
Sbjct: 136 RHSNLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHLCESLADSCFPINE 195

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GPSA+RLHELFL QFC AH C  K +PF +G  +QP +FY++ SIL LMLLPIFY+IVYL
Sbjct: 196 GPSANRLHELFLHQFCGAHSCTGKQKPFSRGAARQPGIFYMASSILILMLLPIFYVIVYL 255

Query: 181 YQREMRGGTQELKRIPRVGRKAKP 204
           ++RE R G + LKRI + GRK KP
Sbjct: 256 HRRESRNGIEVLKRISKAGRKNKP 279


>gi|238013592|gb|ACR37831.1| unknown [Zea mays]
 gi|413916779|gb|AFW56711.1| zinc finger protein [Zea mays]
          Length = 275

 Score =  323 bits (829), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 145/205 (70%), Positives = 176/205 (85%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKF+DQHF+N
Sbjct: 71  MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFIDQHFEN 130

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NL++ S G+C+ADLCGALHCD +M  K  +SKCN AA  +NRHLCESLA+ CFPINQ
Sbjct: 131 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPINQ 190

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G +ASRLHE FLRQFCDAH C + ++PFPKGGRKQ +  YL++SILTL+LLP+FYLIV+L
Sbjct: 191 GLAASRLHEFFLRQFCDAHTCSKGTKPFPKGGRKQTNRLYLALSILTLILLPLFYLIVFL 250

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +QREMR GTQ+LKR  ++G+K KPS
Sbjct: 251 HQREMRKGTQDLKRFSKIGQKKKPS 275


>gi|242083250|ref|XP_002442050.1| hypothetical protein SORBIDRAFT_08g008280 [Sorghum bicolor]
 gi|241942743|gb|EES15888.1| hypothetical protein SORBIDRAFT_08g008280 [Sorghum bicolor]
          Length = 276

 Score =  323 bits (829), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 145/205 (70%), Positives = 176/205 (85%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKFLDQHF+N
Sbjct: 72  MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFLDQHFEN 131

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NL++ S G+C+ADLCGALHCD +M  K  +SKCN AA  +NRHLCESLA+ CFP+NQ
Sbjct: 132 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPVNQ 191

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G +ASRLHE FLRQFCDAH C + ++PFPKGGRKQ + FYL++ ILTL+LLP+FYLIV+L
Sbjct: 192 GLAASRLHEFFLRQFCDAHTCNKGTKPFPKGGRKQTNRFYLALCILTLILLPLFYLIVFL 251

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +QREMR GTQ+LKR  ++G+K KPS
Sbjct: 252 HQREMRKGTQDLKRFSKIGQKKKPS 276


>gi|218186624|gb|EEC69051.1| hypothetical protein OsI_37889 [Oryza sativa Indica Group]
          Length = 330

 Score =  321 bits (823), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 146/205 (71%), Positives = 174/205 (84%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRLHPDNDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQHF N
Sbjct: 126 MPFVEKEKYELPSKCRLHPDNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQHFHN 185

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RHNNL++ S G+CLADLCGALHCD ++  K  +SKC+  A A+NRHLCESLA+ CFPINQ
Sbjct: 186 RHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCFPINQ 245

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SASRLHE FLRQFCDAH C   S+PFPKGGRKQ + FYL++  LT++LLP+FYLIV+L
Sbjct: 246 GQSASRLHEFFLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYLIVFL 305

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +QREM+ G Q L+RI +V +K KPS
Sbjct: 306 HQREMKKGGQNLRRISKVVQKKKPS 330


>gi|296081316|emb|CBI17698.3| unnamed protein product [Vitis vinifera]
          Length = 409

 Score =  318 bits (814), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 145/205 (70%), Positives = 168/205 (81%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+  Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKKSF AE +LD+HFDN
Sbjct: 205 MPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKKSFYAENYLDKHFDN 264

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLNVS  KCLAD+CGALHCD VM+SK  ++KCNPAA A+N HLCESLA+ CFP++ 
Sbjct: 265 RHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHHLCESLADGCFPVSG 324

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GPSASRLHE FLRQFCDAH C    +PF +G +K+ SVFYL ISILTLMLLP+FY  VYL
Sbjct: 325 GPSASRLHEFFLRQFCDAHTCTGGRKPFSRGRKKRTSVFYLCISILTLMLLPLFYFFVYL 384

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           YQR +R G Q  +RI   GRK KP+
Sbjct: 385 YQRGLRRGAQNFRRISESGRKKKPT 409


>gi|115487926|ref|NP_001066450.1| Os12g0233400 [Oryza sativa Japonica Group]
 gi|108862374|gb|ABA96879.2| zinc finger family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|108862375|gb|ABG21933.1| zinc finger family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|108862376|gb|ABG21934.1| zinc finger family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|108862377|gb|ABG21935.1| zinc finger family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113648957|dbj|BAF29469.1| Os12g0233400 [Oryza sativa Japonica Group]
 gi|215697514|dbj|BAG91508.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 275

 Score =  316 bits (809), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 144/205 (70%), Positives = 172/205 (83%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRLHP NDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQHF N
Sbjct: 71  MPFVEKEKYELPSKCRLHPGNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQHFHN 130

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RHNNL++ S G+CLADLCGALHCD ++  K  +SKC+  A A+NRHLCESLA+ CFPINQ
Sbjct: 131 RHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCFPINQ 190

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SASRLHE  LRQFCDAH C   S+PFPKGGRKQ + FYL++  LT++LLP+FYLIV+L
Sbjct: 191 GQSASRLHEFLLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYLIVFL 250

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +QREM+ G Q L+RI +V +K KPS
Sbjct: 251 HQREMKKGGQNLRRISKVVQKKKPS 275


>gi|359488643|ref|XP_002271064.2| PREDICTED: uncharacterized protein LOC100257742 [Vitis vinifera]
          Length = 279

 Score =  315 bits (807), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 145/205 (70%), Positives = 168/205 (81%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+  Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKKSF AE +LD+HFDN
Sbjct: 75  MPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKKSFYAENYLDKHFDN 134

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLNVS  KCLAD+CGALHCD VM+SK  ++KCNPAA A+N HLCESLA+ CFP++ 
Sbjct: 135 RHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHHLCESLADGCFPVSG 194

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GPSASRLHE FLRQFCDAH C    +PF +G +K+ SVFYL ISILTLMLLP+FY  VYL
Sbjct: 195 GPSASRLHEFFLRQFCDAHTCTGGRKPFSRGRKKRTSVFYLCISILTLMLLPLFYFFVYL 254

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           YQR +R G Q  +RI   GRK KP+
Sbjct: 255 YQRGLRRGAQNFRRISESGRKKKPT 279


>gi|326487255|dbj|BAJ89612.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326532646|dbj|BAJ89168.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 273

 Score =  314 bits (805), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 146/205 (71%), Positives = 169/205 (82%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRL P+NDMFR+QEQHKIH D+NEWRCG+CKKSFRAEKF+DQHF N
Sbjct: 69  MPFVEKEKYELPSKCRLRPENDMFREQEQHKIHFDVNEWRCGFCKKSFRAEKFIDQHFAN 128

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RHNNLL+ S G+CLADLCGALHCD  M  K  +SKCN AA A+NRH CESLA+ CFPINQ
Sbjct: 129 RHNNLLDNSQGRCLADLCGALHCDLTMEFKKPKSKCNDAAAARNRHRCESLADSCFPINQ 188

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SASRLHE FL QFCDAH C   S+PFPKGGRKQ + FYL+I ILT++LLP+FYLIV+L
Sbjct: 189 GHSASRLHEFFLHQFCDAHTCNGVSKPFPKGGRKQTNRFYLAICILTVLLLPVFYLIVFL 248

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +QREM+ G Q  KRI +   K KPS
Sbjct: 249 HQREMKKGDQVFKRIGKTVHKKKPS 273


>gi|297793919|ref|XP_002864844.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297310679|gb|EFH41103.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 270

 Score =  311 bits (796), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 151/205 (73%), Positives = 176/205 (85%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            PFVE+ +Y+I   CRLHPDND++RDQEQHKIHVD+ EW+CGYCKKSF  EKFLD+HF  
Sbjct: 66  TPFVERERYQIPKNCRLHPDNDLYRDQEQHKIHVDVFEWKCGYCKKSFNDEKFLDKHFTT 125

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN +  KCLADLCGALHCDFV+NSK  ++KCNPAAVAKNRHLCES+AN CFP++Q
Sbjct: 126 RHYNLLNTTDTKCLADLCGALHCDFVLNSKKPKTKCNPAAVAKNRHLCESVANSCFPVSQ 185

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GPSASRLHE FLRQFCDAH C  K +PFP+GG+K+  VFYL+ISILTLMLLP+FYL+V+L
Sbjct: 186 GPSASRLHEHFLRQFCDAHTCTGKDKPFPRGGKKKSGVFYLAISILTLMLLPLFYLLVFL 245

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           YQRE R GTQEL+RI + GRK KPS
Sbjct: 246 YQREKRSGTQELRRIRKTGRKPKPS 270


>gi|356548383|ref|XP_003542582.1| PREDICTED: uncharacterized protein LOC100780668 [Glycine max]
          Length = 244

 Score =  306 bits (784), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 138/205 (67%), Positives = 169/205 (82%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            PF+E+  Y++SSKC+LHP+ND+F DQE+HKI++D +EWRCGYCKKSFR EKFLDQHFD+
Sbjct: 40  TPFMERENYQLSSKCKLHPENDIFTDQEEHKIYIDRHEWRCGYCKKSFREEKFLDQHFDS 99

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH N LN+SHGKCL DLCGALHCD VMN KSSRSKCNPAA A+NRHLCE+LA+ CFPI++
Sbjct: 100 RHYNFLNLSHGKCLGDLCGALHCDAVMNFKSSRSKCNPAAAARNRHLCENLADNCFPISE 159

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GPSA RLHELFL QFCDAH C  K +PF +GG+ Q S F L+   L L+LLP+FYL +YL
Sbjct: 160 GPSAGRLHELFLHQFCDAHTCSGKHKPFSRGGKDQSSFFRLAAGALILVLLPVFYLFLYL 219

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
            Q +++  TQEL+RI + G K+KPS
Sbjct: 220 VQSDVKSRTQELRRISKAGWKSKPS 244


>gi|18424689|ref|NP_568969.1| C2H2-like zinc finger protein [Arabidopsis thaliana]
 gi|17381006|gb|AAL36315.1| unknown protein [Arabidopsis thaliana]
 gi|21536913|gb|AAM61245.1| zinc finger-like protein [Arabidopsis thaliana]
 gi|22136886|gb|AAM91787.1| unknown protein [Arabidopsis thaliana]
 gi|332010344|gb|AED97727.1| C2H2-like zinc finger protein [Arabidopsis thaliana]
          Length = 271

 Score =  300 bits (769), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 145/205 (70%), Positives = 173/205 (84%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            PFVE+ +YEI   CRLHPDND++RDQE HK+HVD+ EW+CGYCKKSF  EKFLD+HF  
Sbjct: 67  TPFVERERYEIPKNCRLHPDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEKFLDKHFST 126

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN +  KCLADLCGALHCDFV++SK  +SKCNP AVAKNRHLCES+AN CFP++Q
Sbjct: 127 RHYNLLNTTDTKCLADLCGALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVANSCFPVSQ 186

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GPSASRLHE FLRQFCDAH C    +PFP+GG+K+  VFYL+ISILTLMLLP+FYL+V+L
Sbjct: 187 GPSASRLHEHFLRQFCDAHTCTGNDKPFPRGGKKKSGVFYLAISILTLMLLPLFYLLVFL 246

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +QRE R GTQ+L+RI + G+K KPS
Sbjct: 247 HQREKRSGTQDLRRIIKSGKKTKPS 271


>gi|222616852|gb|EEE52984.1| hypothetical protein OsJ_35656 [Oryza sativa Japonica Group]
          Length = 1075

 Score =  296 bits (759), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 131/182 (71%), Positives = 155/182 (85%)

Query: 1    MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            MPFVE+ +YE+ SKCRLHP NDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQHF N
Sbjct: 859  MPFVEKEKYELPSKCRLHPGNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQHFHN 918

Query: 61   RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
            RHNNL++ S G+CLADLCGALHCD ++  K  +SKC+  A A+NRHLCESLA+ CFPINQ
Sbjct: 919  RHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCFPINQ 978

Query: 121  GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
            G SASRLHE  LRQFCDAH C   S+PFPKGGRKQ + FYL++  LT++LLP+FYLIV+L
Sbjct: 979  GQSASRLHEFLLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYLIVFL 1038

Query: 181  YQ 182
            +Q
Sbjct: 1039 HQ 1040


>gi|224060201|ref|XP_002300082.1| predicted protein [Populus trichocarpa]
 gi|222847340|gb|EEE84887.1| predicted protein [Populus trichocarpa]
          Length = 224

 Score =  296 bits (758), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 129/203 (63%), Positives = 166/203 (81%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +Y+ISS+CRLHP+ND++RDQEQHK+HVDINEWRCGYCKK+F  EK+LD+HFDN
Sbjct: 21  MPFVEKERYKISSRCRLHPENDLYRDQEQHKMHVDINEWRCGYCKKTFYEEKYLDKHFDN 80

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH +LLNVSH KCLAD+CG LHCD V++S   ++KCNPAA  +N+HLCESLA+ CFP+N+
Sbjct: 81  RHYDLLNVSHSKCLADVCGVLHCDLVLDSAPHKTKCNPAATGRNKHLCESLADSCFPVNE 140

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SA RL+E FLRQFCDAH C    +PF KGG+K+ S  Y+ IS+L LMLL +FY+ +YL
Sbjct: 141 GTSALRLNEFFLRQFCDAHTCSGGRKPFSKGGKKETSTLYVIISVLVLMLLALFYIFMYL 200

Query: 181 YQREMRGGTQELKRIPRVGRKAK 203
           Y R ++ G+Q LK I + G+K +
Sbjct: 201 YLRGIKRGSQGLKHISKSGQKKR 223


>gi|302398695|gb|ADL36642.1| C2H2L domain class transcription factor [Malus x domestica]
          Length = 274

 Score =  292 bits (748), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 134/207 (64%), Positives = 167/207 (80%), Gaps = 2/207 (0%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +Y+IS  C+LHPDND++RDQE+HKI VD+NEW+CGYCKK F  +KFLD+HFDN
Sbjct: 68  MPFVEKERYQISRTCKLHPDNDLYRDQEEHKIQVDLNEWQCGYCKKRFYDDKFLDKHFDN 127

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNS-KSSRSKCNPAAVAKNRHLCESLANRCFPIN 119
           RH NLLNVSH +CLAD+CGALHCD  M+S    ++KCNPAA A+NRHLCE LA+ CFP+ 
Sbjct: 128 RHYNLLNVSHTRCLADVCGALHCDLEMDSVPPKKTKCNPAAAARNRHLCEGLADSCFPVK 187

Query: 120 QGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQ-PSVFYLSISILTLMLLPIFYLIV 178
           QGP+ASRLHE FLRQFCD+H C  + +PF +G RK+  S+ YL ISILTL LL +FY  +
Sbjct: 188 QGPAASRLHEFFLRQFCDSHTCTGRQKPFSRGRRKKRASISYLVISILTLALLFLFYSYI 247

Query: 179 YLYQREMRGGTQELKRIPRVGRKAKPS 205
           Y+Y R ++ GTQELKR+  +GRK KPS
Sbjct: 248 YMYSRGVKRGTQELKRVTPIGRKKKPS 274


>gi|297805600|ref|XP_002870684.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316520|gb|EFH46943.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 271

 Score =  283 bits (723), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 129/204 (63%), Positives = 167/204 (81%), Gaps = 1/204 (0%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP+VE+ +Y++ S CR+H DND++R+QE+HK+H DINEWRCG+CKK+F  EK+LD+HFD+
Sbjct: 68  MPYVEKERYQLPSSCRVHRDNDIYREQEEHKVHSDINEWRCGFCKKAFYEEKYLDKHFDS 127

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN SHGKCLADLCGALHCD V+N+   +SKCNPAA A+NRHLCESLAN CFP+N+
Sbjct: 128 RHYNLLNASHGKCLADLCGALHCDLVVNTAQLKSKCNPAASARNRHLCESLANSCFPVNK 187

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GPSA+RLH+ FLRQFCDAH C   S+PF +  +K+  + Y+ ISI TL++L ++Y  VYL
Sbjct: 188 GPSANRLHDFFLRQFCDAHTCSGGSRPFSQKPKKRGKL-YIIISISTLIVLLLYYSFVYL 246

Query: 181 YQREMRGGTQELKRIPRVGRKAKP 204
           +QR ++ G+QELKRI R G K KP
Sbjct: 247 FQRGLKRGSQELKRIRRNGLKKKP 270


>gi|357160391|ref|XP_003578749.1| PREDICTED: uncharacterized protein LOC100823211 [Brachypodium
           distachyon]
          Length = 274

 Score =  280 bits (717), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 145/205 (70%), Positives = 171/205 (83%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRLH +NDMFR+QE+HKIH D+NEWRCG+CKKSFRAEK++DQHF N
Sbjct: 70  MPFVEKEKYELPSKCRLHAENDMFREQEEHKIHFDVNEWRCGFCKKSFRAEKYIDQHFSN 129

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RHNNLL+ S G+CLADLCGALHCD  M  K  +SKCN AA A+N HLCESLA+ CFP+NQ
Sbjct: 130 RHNNLLDNSQGRCLADLCGALHCDLTMEFKKPKSKCNAAAAARNCHLCESLADNCFPVNQ 189

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SASRLHE FLRQFCDAH C   S+PFPKGGRKQ + FYL++ ILTL+LLP+FYLIV+L
Sbjct: 190 GHSASRLHEFFLRQFCDAHTCNGDSKPFPKGGRKQTNRFYLALCILTLLLLPLFYLIVFL 249

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           +QREM+ G Q  KRIP+   K KPS
Sbjct: 250 HQREMKKGVQVFKRIPKTVHKKKPS 274


>gi|255540381|ref|XP_002511255.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223550370|gb|EEF51857.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 217

 Score =  275 bits (704), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 122/157 (77%), Positives = 144/157 (91%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +Y+ISSKCRLHP+ND+FRDQE+HKIHVDINEW+CGYCKKSFRAE+FLDQHF+N
Sbjct: 40  MPFVERERYQISSKCRLHPENDLFRDQEEHKIHVDINEWKCGYCKKSFRAERFLDQHFEN 99

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN+S GKCLADLCGALHCDFV+N+K+ ++KCNPAAVAKNRHLCESLA+ CFP+NQ
Sbjct: 100 RHYNLLNISDGKCLADLCGALHCDFVLNTKTMKTKCNPAAVAKNRHLCESLADSCFPLNQ 159

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPS 157
           GPSASRLHELF+ QFCDAH C  K + FPKGG++  +
Sbjct: 160 GPSASRLHELFVHQFCDAHTCSGKRKLFPKGGKRNEA 196


>gi|449455886|ref|XP_004145681.1| PREDICTED: uncharacterized protein LOC101219582 [Cucumis sativus]
 gi|449492886|ref|XP_004159131.1| PREDICTED: uncharacterized LOC101219582 [Cucumis sativus]
          Length = 281

 Score =  274 bits (701), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 128/205 (62%), Positives = 162/205 (79%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +Y+IS+KCRLHPDNDMFRDQEQHK H+D N+W+CGYC+K F  EK++DQHFDN
Sbjct: 77  MPFVEKKKYKISTKCRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYIDQHFDN 136

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLNVS  +CLADLCGALHCD V+++ S +SKCNPAA A+N+H+CE LA+ CFP+++
Sbjct: 137 RHYNLLNVSRNRCLADLCGALHCDHVIDAVSQKSKCNPAAAARNKHMCEGLADSCFPVDE 196

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G  AS LHE FL QFCDAH C  K +PF +G + + SVFY+ IS+LT++ +  FY+  YL
Sbjct: 197 GALASHLHEFFLHQFCDAHTCSGKPKPFSRGRQVRRSVFYIVISVLTILFVMFFYVFFYL 256

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           Y R MR   Q LKR+ + GRK KPS
Sbjct: 257 YNRGMRTRPQVLKRLSQSGRKKKPS 281


>gi|10177309|dbj|BAB10570.1| unnamed protein product [Arabidopsis thaliana]
          Length = 277

 Score =  273 bits (699), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 129/211 (61%), Positives = 160/211 (75%), Gaps = 6/211 (2%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            PFVE+ +YEI   CRLHPDND++RDQE HK+HVD+ EW+CGYCKKSF  EKFLD+HF  
Sbjct: 67  TPFVERERYEIPKNCRLHPDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEKFLDKHFST 126

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN +  KCLADLCGALHCDFV++SK  +SKCNP AVAKNRHLCES+AN CFP++Q
Sbjct: 127 RHYNLLNTTDTKCLADLCGALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVANSCFPVSQ 186

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRK---QPSVF-YLSISILTLMLLPIF-- 174
           GPSASRLHE FLRQFCDAH C    +PFP+GG+    + +V+ YL +  +  +   IF  
Sbjct: 187 GPSASRLHEHFLRQFCDAHTCTGNDKPFPRGGKSSYIKGTVWCYLFLFSIIALSSEIFGH 246

Query: 175 YLIVYLYQREMRGGTQELKRIPRVGRKAKPS 205
             +  ++ RE R GTQ+L+RI + G+K KPS
Sbjct: 247 IAVSTVFNREKRSGTQDLRRIIKSGKKTKPS 277


>gi|356567332|ref|XP_003551875.1| PREDICTED: uncharacterized protein LOC100527694 [Glycine max]
          Length = 268

 Score =  272 bits (695), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 134/205 (65%), Positives = 166/205 (80%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +Y IS +CRLHPDND++RDQEQHK H DINEW+CG+CKKSF  EK LDQHFDN
Sbjct: 64  MPFVEKEKYHISKRCRLHPDNDIYRDQEQHKFHTDINEWQCGFCKKSFYEEKHLDQHFDN 123

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH+NLLN+S  +C+AD+CGALHCD  MNS S +SKCNPAA AKN+HLCESLA+ CFP+ +
Sbjct: 124 RHSNLLNLSESQCIADVCGALHCDHEMNSGSKKSKCNPAAAAKNKHLCESLADSCFPVTE 183

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GP+ASRLHE FL QFCDAH C    +PF +G RK+ +VFY+ +SIL ++LL ++YL +YL
Sbjct: 184 GPAASRLHEFFLHQFCDAHSCTGNRKPFSRGRRKKTNVFYIFVSILLVILLLLYYLYIYL 243

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           YQR M+  TQ LKR+ +  RK KPS
Sbjct: 244 YQRGMKRETQVLKRVSQASRKKKPS 268


>gi|255632970|gb|ACU16839.1| unknown [Glycine max]
          Length = 266

 Score =  267 bits (682), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 131/201 (65%), Positives = 163/201 (81%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +Y IS +CRLHPDND++RDQEQHK H DINEW+CG+CKKSF  EK LDQHFDN
Sbjct: 64  MPFVEKEKYHISKRCRLHPDNDIYRDQEQHKFHTDINEWQCGFCKKSFYEEKHLDQHFDN 123

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH+NLLN+S  +C+AD+CGALHCD  MNS S +SKCNPAA AKN+HLCESLA+ CFP+ +
Sbjct: 124 RHSNLLNLSESQCIADVCGALHCDHEMNSGSKKSKCNPAAAAKNKHLCESLADSCFPVTE 183

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GP+ASRLHE FL QFCDAH C    +PF +G RK+ +VFY+ +SIL ++LL ++YL +YL
Sbjct: 184 GPAASRLHEFFLHQFCDAHSCTGNRKPFSRGRRKKTNVFYIFVSILLVILLLLYYLYIYL 243

Query: 181 YQREMRGGTQELKRIPRVGRK 201
           YQR M+  TQ LKR+ +  RK
Sbjct: 244 YQRGMKRETQVLKRVSQASRK 264


>gi|294462744|gb|ADE76916.1| unknown [Picea sitchensis]
          Length = 277

 Score =  266 bits (680), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 121/201 (60%), Positives = 152/201 (75%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVEQ  Y IS  CRLH + DMFR+QE HK H+ +NEW+CG+C+K F +EK LDQHFDN
Sbjct: 74  MPFVEQENYNISRTCRLHLEIDMFREQEPHKDHLQVNEWQCGFCRKIFISEKLLDQHFDN 133

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH+NLLN+S  +CLAD+CGALHCD +  SK  ++KCNPAA  KN HLCESLAN CFP  +
Sbjct: 134 RHHNLLNISRSRCLADMCGALHCDAMDKSKPIKTKCNPAAADKNCHLCESLANSCFPPQE 193

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
            PSA RLH+ FLRQFCDAH C R SQPF +G  K  ++ Y ++ +L +M+LPIFY IVY+
Sbjct: 194 SPSARRLHDFFLRQFCDAHTCKRGSQPFARGSGKTTNLLYYAVCLLLMMMLPIFYCIVYV 253

Query: 181 YQREMRGGTQELKRIPRVGRK 201
           +QREM    +  KRIP++ +K
Sbjct: 254 HQREMGKHIKNFKRIPKMNQK 274


>gi|26452757|dbj|BAC43459.1| putative zinc finger protein [Arabidopsis thaliana]
          Length = 272

 Score =  265 bits (678), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 120/204 (58%), Positives = 162/204 (79%), Gaps = 1/204 (0%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP+VE+ +Y++ S CR+H DND++R+QE+HK+  DINEWRCG+CKK+F  EK+LD+HFD+
Sbjct: 69  MPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFCKKAFYEEKYLDKHFDS 128

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN SHGKCL+DLCGALHCD V+++   +SKCNPAA AKNRHLCESLAN CFP+N+
Sbjct: 129 RHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKNRHLCESLANSCFPVNK 188

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SA+RLH+ FLRQFCDAH C   S+P  +  +K+ S+ Y+  SI+ L++L ++Y  VYL
Sbjct: 189 GSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFSIIVLVVLLLYYSFVYL 247

Query: 181 YQREMRGGTQELKRIPRVGRKAKP 204
           ++R ++  +Q+LKRI   G K KP
Sbjct: 248 FRRGLKRRSQDLKRIRHNGLKKKP 271


>gi|15237477|ref|NP_198887.1| C2H2 type zinc finger protein [Arabidopsis thaliana]
 gi|9758099|dbj|BAB08543.1| unnamed protein product [Arabidopsis thaliana]
 gi|90962968|gb|ABE02408.1| At5g40710 [Arabidopsis thaliana]
 gi|332007203|gb|AED94586.1| C2H2 type zinc finger protein [Arabidopsis thaliana]
          Length = 272

 Score =  265 bits (678), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 120/204 (58%), Positives = 162/204 (79%), Gaps = 1/204 (0%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP+VE+ +Y++ S CR+H DND++R+QE+HK+  DINEWRCG+CKK+F  EK+LD+HFD+
Sbjct: 69  MPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFCKKAFYEEKYLDKHFDS 128

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN SHGKCL+DLCGALHCD V+++   +SKCNPAA AKNRHLCESLAN CFP+N+
Sbjct: 129 RHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKNRHLCESLANSCFPVNK 188

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           G SA+RLH+ FLRQFCDAH C   S+P  +  +K+ S+ Y+  SI+ L++L ++Y  VYL
Sbjct: 189 GSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFSIIVLVVLLLYYSFVYL 247

Query: 181 YQREMRGGTQELKRIPRVGRKAKP 204
           ++R ++  +Q+LKRI   G K KP
Sbjct: 248 FRRGLKRRSQDLKRIRHNGLKKKP 271


>gi|356527071|ref|XP_003532137.1| PREDICTED: uncharacterized protein LOC100792257 [Glycine max]
          Length = 271

 Score =  265 bits (676), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 133/205 (64%), Positives = 165/205 (80%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +Y IS +C LHPDND++RDQEQHK H+DINEW+C YCKKSF  EK LDQHFDN
Sbjct: 67  MPFVEKEKYHISKRCMLHPDNDIYRDQEQHKSHIDINEWQCRYCKKSFYEEKHLDQHFDN 126

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH+NLLN+S  +CLAD+CGALHCD  MNS S +SKCNPAA AK++HLCESLA+ CFP++ 
Sbjct: 127 RHSNLLNLSESQCLADVCGALHCDHEMNSGSKKSKCNPAAAAKHKHLCESLADSCFPVSD 186

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           GP+AS+LHE FL QFCDAH C    +PF +G RK+ +VFY+ +SIL ++LL ++YL +YL
Sbjct: 187 GPAASQLHEFFLHQFCDAHSCTGSRKPFSRGHRKKTNVFYIFVSILLVILLLLYYLYIYL 246

Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
           YQR M+  TQ LKRI +  RK KPS
Sbjct: 247 YQRGMKRETQVLKRISQASRKKKPS 271


>gi|357477231|ref|XP_003608901.1| hypothetical protein MTR_4g104230 [Medicago truncatula]
 gi|355509956|gb|AES91098.1| hypothetical protein MTR_4g104230 [Medicago truncatula]
          Length = 241

 Score =  260 bits (664), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 122/165 (73%), Positives = 139/165 (84%), Gaps = 2/165 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            PFVE+  Y++S KCRLHP+NDMFRDQE+H+I++D NEWRCGYCKKSFR EKFLDQH DN
Sbjct: 72  TPFVEKENYQLSRKCRLHPENDMFRDQEEHRIYLDRNEWRCGYCKKSFREEKFLDQHLDN 131

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLN+SHGKCLADLCGALHCD V NSKSSRSKCNPAA A+NRHLCESLA+ CFPI+ 
Sbjct: 132 RHCNLLNLSHGKCLADLCGALHCDAVTNSKSSRSKCNPAAAARNRHLCESLADSCFPISG 191

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISI 165
           G SASRLHELFL QFCDAH C  K +PF +GG+ +P   YL+  I
Sbjct: 192 GLSASRLHELFLHQFCDAHSCSGKHKPFSRGGKVRPE--YLTGEI 234


>gi|147776347|emb|CAN76470.1| hypothetical protein VITISV_016787 [Vitis vinifera]
          Length = 483

 Score =  248 bits (633), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 111/151 (73%), Positives = 128/151 (84%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+  Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKKSF AE +LD+HFDN
Sbjct: 38  MPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKKSFYAENYLDKHFDN 97

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLNVS  KCLAD+CGALHCD VM+SK  ++KCNPAA A+N HLCESLA+ CFP++ 
Sbjct: 98  RHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHHLCESLADGCFPVSG 157

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKG 151
           GPSASRLHE FLRQFCDAH C    +PF +G
Sbjct: 158 GPSASRLHEFFLRQFCDAHTCTGGRKPFSRG 188


>gi|255571984|ref|XP_002526933.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223533685|gb|EEF35420.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 240

 Score =  247 bits (631), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 106/149 (71%), Positives = 130/149 (87%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+  Y++S+ CRLHPDND++RDQEQHKIH D NEWRCGYC+KSF  E++L++HFDN
Sbjct: 72  MPFVEKEHYQVSTGCRLHPDNDLYRDQEQHKIHEDGNEWRCGYCRKSFYEERYLNKHFDN 131

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NLLNVSH +CLAD+CGALHCDFVM+S + ++KCNPAA A+N+HLCESLAN CFP+N+
Sbjct: 132 RHYNLLNVSHSRCLADVCGALHCDFVMDSATRKTKCNPAAAARNKHLCESLANSCFPVNE 191

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFP 149
           GPSA RL+ELFL QFCDAH C  + +PFP
Sbjct: 192 GPSAHRLNELFLHQFCDAHTCTGRQKPFP 220


>gi|357436185|ref|XP_003588368.1| hypothetical protein MTR_1g006480 [Medicago truncatula]
 gi|355477416|gb|AES58619.1| hypothetical protein MTR_1g006480 [Medicago truncatula]
          Length = 306

 Score =  241 bits (614), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 102/153 (66%), Positives = 128/153 (83%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            PFVE+ +Y IS +CRLHPDND++RDQEQHK H+DINEW+CGYCKK+F  EK LDQHFDN
Sbjct: 145 TPFVEKEKYNISRRCRLHPDNDIYRDQEQHKSHIDINEWQCGYCKKTFYEEKHLDQHFDN 204

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH+NLLN+   +CLAD+CGALHCD  +NS S +SKCNPAA A+N+H+CE++A+ CFP+N+
Sbjct: 205 RHSNLLNLKESRCLADVCGALHCDHEINSGSKKSKCNPAAAARNKHICETIADSCFPVNE 264

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGR 153
           GP+ASRLHE FL QFCDAH C  + +PF +G R
Sbjct: 265 GPAASRLHEFFLHQFCDAHSCTGRGKPFSRGRR 297


>gi|388514801|gb|AFK45462.1| unknown [Lotus japonicus]
          Length = 263

 Score =  227 bits (579), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 110/181 (60%), Positives = 133/181 (73%), Gaps = 12/181 (6%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +Y IS +CRLHPDND++RDQE HK H+DINEW+CG+CKKSF  EK LDQHFDN
Sbjct: 80  MPFVEKEKYHISKRCRLHPDNDIYRDQEDHKSHIDINEWQCGFCKKSFYEEKHLDQHFDN 139

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH+NLLN S  +CLAD+CGALHCD  +NS S +SKCNPAA A+N+HLCESLA+ CFP+N+
Sbjct: 140 RHSNLLNSSESRCLADVCGALHCDLEINSGSKKSKCNPAAAARNKHLCESLADSCFPVNE 199

Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKG-GRKQ------PSVF-----YLSISILTL 168
           GP+ASRLHE FL QFCD     +     P+G GR Q      P  F     Y +IS  T 
Sbjct: 200 GPAASRLHEFFLHQFCDHTAVLQVGNLSPEGAGRGQMCSTSWPPFFLWCCCYFTISTFTC 259

Query: 169 M 169
           +
Sbjct: 260 I 260


>gi|449529367|ref|XP_004171671.1| PREDICTED: uncharacterized protein LOC101223370, partial [Cucumis
           sativus]
          Length = 203

 Score =  224 bits (571), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 98/128 (76%), Positives = 118/128 (92%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +PF+E+  YE+S++CRLHP+ND+FRDQEQHKIH+DIN W+CGYC+KSFRAEKFLD+HFDN
Sbjct: 76  LPFMEKENYEVSTQCRLHPNNDLFRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDN 135

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH+NLLNVSHGKCLADLCGALHCD  M+ KS +SKC PAA A+N+HLCESLA+ CFPIN+
Sbjct: 136 RHSNLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHLCESLADSCFPINE 195

Query: 121 GPSASRLH 128
           GPSA+RLH
Sbjct: 196 GPSANRLH 203


>gi|413916780|gb|AFW56712.1| hypothetical protein ZEAMMB73_479170 [Zea mays]
          Length = 222

 Score =  208 bits (530), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 95/139 (68%), Positives = 114/139 (82%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKF+DQHF+N
Sbjct: 71  MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFIDQHFEN 130

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
           RH NL++ S G+C+ADLCGALHCD +M  K  +SKCN AA  +NRHLCESLA+ CFPINQ
Sbjct: 131 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPINQ 190

Query: 121 GPSASRLHELFLRQFCDAH 139
           G +ASRLH  +    C+ H
Sbjct: 191 GLAASRLHGKWNVSECNDH 209


>gi|168063136|ref|XP_001783530.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664965|gb|EDQ51666.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  192 bits (488), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 131/200 (65%), Gaps = 8/200 (4%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFV+  +Y +   CRLHPDND++ +QE++ I +   +W+CGYCKK FR + FLD HFDN
Sbjct: 175 MPFVKDQKYTLPKTCRLHPDNDVYHEQEENMIELRPMQWQCGYCKKIFRNQGFLDLHFDN 234

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSR----SKCNPAAVAKNRHLCESLANRCF 116
           RH+  L  S   CLAD CGALHCD+  NS SS+    + C  A V KNRH CE LAN CF
Sbjct: 235 RHSEKLEPSSNSCLADACGALHCDY-YNSLSSKPQIQATCKLAVVEKNRHSCEVLANTCF 293

Query: 117 PINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGG--RKQPSVFYLSISILTLMLLPIF 174
           P +  P+A RL+  F RQFCDAH C +K + FP G    +  S+ Y ++++ T+++L IF
Sbjct: 294 PADHSPAAQRLNHFFKRQFCDAHTCKKKLKIFPSGSGINRNKSLIY-ALTLFTVIVLAIF 352

Query: 175 YLIVYLYQREMRGGTQELKR 194
           Y I+YL +R+     + L+R
Sbjct: 353 YAILYLIKRDTNLLRKGLRR 372


>gi|302808117|ref|XP_002985753.1| hypothetical protein SELMODRAFT_47592 [Selaginella moellendorffii]
 gi|300146662|gb|EFJ13331.1| hypothetical protein SELMODRAFT_47592 [Selaginella moellendorffii]
          Length = 171

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 84/144 (58%), Positives = 105/144 (72%), Gaps = 4/144 (2%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +PFVE   Y +S +CRLH  ND FR+QE+ K    I++WRCGYC K F +E++LD HFDN
Sbjct: 24  LPFVENENYALSPECRLHASNDAFREQEREKQFYHIHDWRCGYCHKIFESEEYLDLHFDN 83

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNS---KSSRSKCNPAAVAKNRHLCESLANRCFP 117
           RH+  LNVS   CLADLCGALHCD++      K S++KC+P +V +NRHLCE LAN CFP
Sbjct: 84  RHSETLNVSRHNCLADLCGALHCDYMETKDKHKFSKNKCSP-SVDRNRHLCEKLANSCFP 142

Query: 118 INQGPSASRLHELFLRQFCDAHKC 141
             QG  A+RL++ FLRQFCDAH C
Sbjct: 143 PQQGAQATRLNDFFLRQFCDAHSC 166


>gi|302785357|ref|XP_002974450.1| hypothetical protein SELMODRAFT_57652 [Selaginella moellendorffii]
 gi|300158048|gb|EFJ24672.1| hypothetical protein SELMODRAFT_57652 [Selaginella moellendorffii]
          Length = 171

 Score =  174 bits (441), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 83/144 (57%), Positives = 105/144 (72%), Gaps = 4/144 (2%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +PFVE   Y +S +CRLH  ND FR+QE+ K    I++WRCGYC K F +E++LD HFDN
Sbjct: 24  LPFVENENYALSPQCRLHASNDAFREQEREKQFYHIHDWRCGYCHKIFESEEYLDLHFDN 83

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNS---KSSRSKCNPAAVAKNRHLCESLANRCFP 117
           RH+  LNVS   CLAD+CGALHCD++      K S++KC+P +V +NRHLCE LAN CFP
Sbjct: 84  RHSETLNVSRDNCLADVCGALHCDYMETKDKHKFSKNKCSP-SVDRNRHLCEKLANSCFP 142

Query: 118 INQGPSASRLHELFLRQFCDAHKC 141
             QG  A+RL++ FLRQFCDAH C
Sbjct: 143 PQQGAQATRLNDFFLRQFCDAHSC 166


>gi|413916778|gb|AFW56710.1| hypothetical protein ZEAMMB73_479170 [Zea mays]
          Length = 177

 Score =  172 bits (437), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 76/107 (71%), Positives = 91/107 (85%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKF+DQHF+N
Sbjct: 71  MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFIDQHFEN 130

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHL 107
           RH NL++ S G+C+ADLCGALHCD +M  K  +SKCN AA  +NRHL
Sbjct: 131 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHL 177


>gi|168040762|ref|XP_001772862.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675773|gb|EDQ62264.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 235

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 58/112 (51%), Positives = 77/112 (68%), Gaps = 4/112 (3%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
           PFVE  +Y +   CRLHP ND++R+QE++   +   +W+C YCKK FR + FLD HFDNR
Sbjct: 70  PFVEAEKYTLPESCRLHPANDIYREQEENMDELRPMQWQCRYCKKIFRNQGFLDTHFDNR 129

Query: 62  HNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSK----CNPAAVAKNRHLCE 109
           H+NLL+ S   CLAD CGALHCD+  +  +S+S+    C PA V +NRH CE
Sbjct: 130 HSNLLDKSLNMCLADACGALHCDYYNSLSTSKSQKQSTCKPAVVERNRHSCE 181


>gi|428163164|gb|EKX32251.1| hypothetical protein GUITHDRAFT_91074 [Guillardia theta CCMP2712]
          Length = 297

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 100/213 (46%), Gaps = 30/213 (14%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
           P V ++ +++   C    + DMF D E+HK  V  N W+C YC K F++E +L+ H+ NR
Sbjct: 55  PHVRKLGFQLPPGCPFSEERDMFLDNEKHKKEVRFNNWKCLYCNKVFKSESYLEHHYVNR 114

Query: 62  HNNLLNVSHGKCLADLCGALHCDFVMNS-------KSSRSKCNPAAVAKNRHLCESLANR 114
           H   +    G CLAD C  L CD   N        K+S  +C+   + K RH C S+ ++
Sbjct: 115 HPETIK-EDGFCLADYCDVLECDMQENVIGSEGMFKTSSYRCDAKRMQKRRHRCHSVLDK 173

Query: 115 CFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLS------------ 162
           CFP +   +A+RLH  F + +C+   C           R+  ++   S            
Sbjct: 174 CFPPHTSDAANRLHHEFEKLYCEHLSCEASPDGLGDVARRPHTLLSQSKARSEYHMKKTK 233

Query: 163 ----------ISILTLMLLPIFYLIVYLYQREM 185
                       +L  ++L IFYL V LY+++M
Sbjct: 234 HSPWRKLAIVFGVLFAIILFIFYLGVCLYRKDM 266


>gi|255077084|ref|XP_002502194.1| predicted protein [Micromonas sp. RCC299]
 gi|226517459|gb|ACO63452.1| predicted protein [Micromonas sp. RCC299]
          Length = 301

 Score = 95.9 bits (237), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 91/204 (44%), Gaps = 19/204 (9%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
           P  ++ +Y +   C      DM+ + E+HK  V   +++  Y  K+F  E ++D+H DNR
Sbjct: 59  PVADRNRYRLPPNCPFDRSKDMYLEHEKHKEVVRRTQYKSLYSDKTFYNEYYVDKHMDNR 118

Query: 62  HNNLLNVSHGKCLADLCGALHCDF-----------VMNSKSSRSKCNPAAVAKNRHLCES 110
           H + +      CLAD C  L CD             + S +SRS+C+   +   RH CE 
Sbjct: 119 HMDKVPPGADVCLADYCEVLRCDEHQAWKHPELRKTVGSSASRSRCSQERMRGLRHFCEV 178

Query: 111 LANRCFPINQGP-----SASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSV---FYLS 162
           L + CFP   GP     +A+RL E F R  CD   C      F   G          Y+ 
Sbjct: 179 LMSGCFPTGGGPDGDAGTAARLREYFTRHHCDMLTCDGVRDMFGVLGSHHAVAGGGGYVL 238

Query: 163 ISILTLMLLPIFYLIVYLYQREMR 186
           + I+    +  +YL V+   R+ R
Sbjct: 239 VLIVVFAAVGCYYLFVWHAVRDGR 262


>gi|443688435|gb|ELT91124.1| hypothetical protein CAPTEDRAFT_220314 [Capitella teleta]
          Length = 284

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 85/190 (44%), Gaps = 15/190 (7%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P + + Q   S  C L  D DM+ +QEQ K     ++W C +C KSF  E+ LD HF+N
Sbjct: 58  LPILAEFQVTASESCPLSADRDMYAEQEQRKFMESSSKWTCNFCGKSFYDERSLDNHFEN 117

Query: 61  RH-NNLLNVSHGKCLADLCGALHCDFVMNSKSSR----SKCNPAAVAKNRHLCESLANRC 115
           RH  N+L      C AD C    CD +  +  +     + C  A +    H CE L   C
Sbjct: 118 RHPKNILTGPRATCPADFCDIFRCDVISGAHRAGYWEVALCRDADLIHTHHRCEDLLRNC 177

Query: 116 FPINQGPSASRLHELFLRQF----CDAHKCPRKSQPFPKGGRKQP--SVFYLSISILTLM 169
            P N  P A+R  +L L +F    C    C    +  P+     P  +  Y+ + IL   
Sbjct: 178 LPSNI-PHATR--DLILARFEKAICGYLTCDMFWE-IPENRVSNPVGTAIYIVLFILLSF 233

Query: 170 LLPIFYLIVY 179
            L I+Y   Y
Sbjct: 234 GLLIYYYAAY 243


>gi|384247793|gb|EIE21279.1| hypothetical protein COCSUDRAFT_57182 [Coccomyxa subellipsoidea
           C-169]
          Length = 220

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/183 (28%), Positives = 89/183 (48%), Gaps = 19/183 (10%)

Query: 37  NEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSR--- 93
           N + C  CK+ F +E+  ++H+D +H+  ++     CLAD C  LHCD +  + +SR   
Sbjct: 20  NRFVCSLCKERFSSEEAWERHYDRKHHIHIHPGRDVCLADYCEVLHCDKINLTPTSRQLR 79

Query: 94  ------------SKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 141
                       + C  +A  + R  C  LA  CFP + GP+A++LH LF+   C AH C
Sbjct: 80  AQQAGDPTLAPHAPCRESAAEQLRESCRDLAAACFPPDAGPAAAKLHRLFVEGVCAAHTC 139

Query: 142 PRKSQP---FPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRV 198
             + Q          ++ ++ Y  +  L L  + IFY  +++Y  + R G+     + R+
Sbjct: 140 DLELQQDLLVQLAASEKGTLGYKLLIGLALGAMAIFYACLWVYH-QYRSGSLSRGDVRRL 198

Query: 199 GRK 201
            RK
Sbjct: 199 RRK 201


>gi|308807625|ref|XP_003081123.1| zinc finger (ISS) [Ostreococcus tauri]
 gi|116059585|emb|CAL55292.1| zinc finger (ISS) [Ostreococcus tauri]
          Length = 241

 Score = 87.0 bits (214), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 101/213 (47%), Gaps = 14/213 (6%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFD 59
           P  E  + EI++ C      D+F + E+HK  V  ++  WR  Y  K F++E ++D+H +
Sbjct: 8   PVAEGAKTEIAATCPFSRIRDVFYEHERHKERVRHSKQYWRSLYSGKVFKSEYYVDKHME 67

Query: 60  NRHNNLLNVSHGKCLADLCGALHCD----FVMNSKSSRSKCNPAAVAKNRHLCESLANRC 115
            RH + +      CLAD C  L CD    ++   +    +C+ A +A  +  C  +   C
Sbjct: 68  RRHGDKIPPGANVCLADHCEVLQCDRHASYMNGERGKLVRCDAAHMADVQAECRKMLKSC 127

Query: 116 FPINQGPSASR---LHELFLRQFCDAHKCPRKSQPFPKGGRKQPSV---FYLSISILTLM 169
           FP   G +A R   +H+   + +CD   C      F        +    +Y+S+  + ++
Sbjct: 128 FPHESGRAAERSRKVHDYLNKYYCDQLTCVGVGGVFELMSTHHETAGASYYVSLGFV-ML 186

Query: 170 LLPIFYLIVYLYQREMRGGTQELKRIPRVGRKA 202
           L+ ++Y +VY +    RG + ++KR  R  R +
Sbjct: 187 LVALYYFLVYSWV-NGRGVSLDVKRAHRRARAS 218


>gi|195999976|ref|XP_002109856.1| hypothetical protein TRIADDRAFT_6027 [Trichoplax adhaerens]
 gi|190587980|gb|EDV28022.1| hypothetical protein TRIADDRAFT_6027, partial [Trichoplax
           adhaerens]
          Length = 202

 Score = 87.0 bits (214), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 73/150 (48%), Gaps = 12/150 (8%)

Query: 3   FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 62
            ++ +Q ++S  C  HP+ D+FR QE+ K +   + W C YC K F  E+FLDQH DNRH
Sbjct: 20  LIQSMQVQLSFDCPFHPERDLFRKQEELKDNAYQSSWTCSYCGKWFYRERFLDQHLDNRH 79

Query: 63  NNLL-NVSHGKCLADLCGALHCDFVMNSKSSRSK----------CNPAAVAKNRHLCESL 111
           + LL  V +  CLA+ C  L CD      ++ +K          C    + + R  C  +
Sbjct: 80  SALLGTVMNATCLANYCDILGCDLAHVQDTTLAKGNDLWWKTALCRSTQMVELRDQCLQI 139

Query: 112 ANRCFPINQGPSASRLHELFLRQFCDAHKC 141
           A +C P     ++S +  + +   C    C
Sbjct: 140 AEQCTP-KSSKASSGVRNIIISNICSRLTC 168


>gi|145350216|ref|XP_001419510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579742|gb|ABO97803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 270

 Score = 86.7 bits (213), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 59/217 (27%), Positives = 104/217 (47%), Gaps = 16/217 (7%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFD 59
           P  +  +Y +   C      D+F + E HK  V  ++  WR  Y  K F++E ++D+H +
Sbjct: 36  PVADGARYALDLGCPFSHARDLFYEHETHKEPVRHSKLYWRSLYSGKVFKSEYYVDKHME 95

Query: 60  NRHNNLLNVSHGKCLADLCGALHCD----FVMNSKSSR-SKCNPAAVAKNRHLCESLANR 114
            RH + +  +   CLAD C  L CD    +V N + S+ ++C+   ++  R  C  +  +
Sbjct: 96  RRHGDKIPSTADVCLADYCDVLQCDKHAEYVKNGRGSKPTRCDEEQMSLVRDKCHEMLTQ 155

Query: 115 CFPINQGPS---ASRLHELFLRQFCDAHKCPRKSQPFP--KGGRKQPSV-FYLSISILTL 168
           CFP   G +   A +L+  F + +CD   C      F         P V +Y++++ L  
Sbjct: 156 CFPNENGSTRDNAGKLNVYFTKYYCDQMTCADVDGVFELMSAHHDSPGVGYYIALAFL-F 214

Query: 169 MLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 205
            ++ ++YL VY +    R  + ++KR  RV  +A  S
Sbjct: 215 FIVAMYYLTVYSWTNGRR-VSSDIKR-ARVRARASWS 249


>gi|281201704|gb|EFA75912.1| hypothetical protein PPL_10484 [Polysphondylium pallidum PN500]
          Length = 353

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/190 (30%), Positives = 94/190 (49%), Gaps = 14/190 (7%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
           P V   +Y++   C+L+P  DM    E  KI++  + W+C  C K F  E+FLD H  N 
Sbjct: 144 PVVVDHKYQLPVNCQLNPSLDMLAYLETQKINL-YSIWKCRNCSKQFNTEEFLDLHLKNN 202

Query: 62  HNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQG 121
           H +L+  +   CL+D C  L+C            C+   + K +H C+S+   CFP+NQ 
Sbjct: 203 HASLIPSNATVCLSDFCDMLNC-----KDDPVIACDNKRMDKLKHHCQSIMYNCFPLNQE 257

Query: 122 PSASRLHELFLRQFCDAHKC-------PRKSQPFPKGGRKQPSVFYLSISILTLMLLPIF 174
            S   L+  F +  CD   C       P++S        +  S+     +++T++++ +F
Sbjct: 258 QS-RMLNSHFTKNICDHLTCDGYKRHIPKQSDSNQSNSGRSWSLMKYIAALITIIIIVLF 316

Query: 175 YLIVYLYQRE 184
           YLIV  Y++E
Sbjct: 317 YLIVCFYRKE 326


>gi|390365205|ref|XP_003730770.1| PREDICTED: uncharacterized protein LOC100893077 [Strongylocentrotus
           purpuratus]
          Length = 233

 Score = 83.2 bits (204), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 94/195 (48%), Gaps = 14/195 (7%)

Query: 12  SSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSH 70
            ++C L P+ +M+ D E+HK     N W C YC KSF  E+FLD+H+DNRH + +L    
Sbjct: 9   GTECLLKPERNMYWDDERHKWSEIANNWNCLYCGKSFYNEQFLDKHYDNRHGSTVLTGPE 68

Query: 71  GKCLADLCGALHCDFVMNSKSS----RSKCNPAAVAKNRHLCESLANRCFP--INQGPSA 124
             CLAD C  L CD + ++K +     + C    +   R  C+++   C P  I      
Sbjct: 69  AVCLADHCDYLRCDIIASTKKTSFWEEALCKQKDLLVLRKKCQNIMKSCIPSHITDAEKN 128

Query: 125 SRLHELFLRQFCDAHKCPRKSQ-PFPKGGRKQPSVFYLSISILTLMLLPIFYLI----VY 179
           S L +L     C    C +  + P+ +    + + ++++  I T+    ++Y++    +Y
Sbjct: 129 SLLDKL-DDTVCSFLTCEKYWENPYKEVPPSRTAAYFIA-GIFTVFGFIMYYVMACSHLY 186

Query: 180 LYQREMRGGTQELKR 194
             ++E  G +    R
Sbjct: 187 ADEKEFLGDSDGFVR 201


>gi|328865036|gb|EGG13422.1| hypothetical protein DFA_11183 [Dictyostelium fasciculatum]
          Length = 306

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 52/198 (26%), Positives = 98/198 (49%), Gaps = 24/198 (12%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
            FV+  +Y++ S C  + + DM    +  K   + +++RC  C K F++E ++DQH  N 
Sbjct: 93  SFVDYNEYKLPSTCVFNKNLDMLAVFQSLKFLQNDHQYRCKSCSKQFKSEDYIDQHLKNN 152

Query: 62  HNNLLNVSHGKCLADLCGALHC--DFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPIN 119
           H +L+  +   CL D C  L+C  D++++       C+   + K ++ C+ +   CFP  
Sbjct: 153 HADLIPSNATICLGDYCEMLNCNNDYIIS-------CDERKMEKLKYACQGIMYNCFP-- 203

Query: 120 QGPSASR-LHELFLRQFCDAHKCPRKSQPFPKGG----------RKQPSVFYLSISILTL 168
             PS ++  ++ F    CD   C   S    + G          + Q S+     +++ L
Sbjct: 204 --PSVNKNYNKHFNALICDKLTCKGGSSTTGQDGIIRDADIEEKQHQWSLMKYLSALVIL 261

Query: 169 MLLPIFYLIVYLYQREMR 186
           ++L IFYLI++ +++E R
Sbjct: 262 VILIIFYLILWFFKKETR 279


>gi|383857833|ref|XP_003704408.1| PREDICTED: uncharacterized protein LOC100876283 [Megachile
           rotundata]
          Length = 440

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP +++ Q E+  +C  H   D+FR Q++ K     ++W CG C KSF AEK LD HFDN
Sbjct: 45  MPILKKYQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDAHFDN 104

Query: 61  RHNNLLNVSH-GKCLADLCGALHCDFVMN 88
           RH + +N +    CLAD C  + CD + N
Sbjct: 105 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 133


>gi|66522619|ref|XP_393929.2| PREDICTED: hypothetical protein LOC410449 [Apis mellifera]
          Length = 454

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP +++ Q E+  +C  H   D+FR Q++ K     ++W CG C KSF AEK LD HFDN
Sbjct: 45  MPILKKHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDAHFDN 104

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
           RH + +N +    CLAD C  + CD + N
Sbjct: 105 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 133


>gi|380017517|ref|XP_003692701.1| PREDICTED: uncharacterized protein LOC100864708 [Apis florea]
          Length = 454

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP +++ Q E+  +C  H   D+FR Q++ K     ++W CG C KSF AEK LD HFDN
Sbjct: 45  MPILKKHQVELPLECPFHESRDIFRPQQKAKHQHRSSQWTCGLCGKSFYAEKHLDAHFDN 104

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
           RH + +N +    CLAD C  + CD + N
Sbjct: 105 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 133


>gi|307188422|gb|EFN73179.1| hypothetical protein EAG_05305 [Camponotus floridanus]
          Length = 441

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP +++ Q E+  +C  H   D+FR Q++ K     ++W CG C KSF AEK LD HFDN
Sbjct: 39  MPILKKYQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDVHFDN 98

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
           RH + +N +    CLAD C  + CD + N
Sbjct: 99  RHKSNVNTAEDAVCLADYCDIMRCDVLGN 127


>gi|340723433|ref|XP_003400094.1| PREDICTED: hypothetical protein LOC100643092 [Bombus terrestris]
          Length = 453

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP +++ Q E+  +C  H   D+FR Q++ K     ++W CG C KSF AEK LD HFDN
Sbjct: 48  MPILKRHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDAHFDN 107

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
           RH + +N +    CLAD C  + CD + N
Sbjct: 108 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 136


>gi|350427747|ref|XP_003494865.1| PREDICTED: hypothetical protein LOC100742329 [Bombus impatiens]
          Length = 450

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           MP +++ Q E+  +C  H   D+FR Q++ K     ++W CG C KSF AEK LD HFDN
Sbjct: 48  MPILKRHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDAHFDN 107

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
           RH + +N +    CLAD C  + CD + N
Sbjct: 108 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 136


>gi|156386657|ref|XP_001634028.1| predicted protein [Nematostella vectensis]
 gi|156221106|gb|EDO41965.1| predicted protein [Nematostella vectensis]
          Length = 257

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/179 (25%), Positives = 77/179 (43%), Gaps = 6/179 (3%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P  +Q + ++   C LH  +D+F   E +K    ++ W C +C K+F +E ++D H  N
Sbjct: 47  FPAYDQHKVDVPLSCLLHETHDVFIRHEINKEEEHVSRWTCNFCGKAFYSEHYIDMHMQN 106

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKS----SRSKCNPAAVAKNRHLCESLANRCF 116
           RH + L      CLAD C    C  + + K      R+ C    +A  +  CE L + C 
Sbjct: 107 RHKDFLIQDSSVCLADYCDVFRCVVLNDQKKEWFWERALCKHEKMAALKRRCEVLMHGCM 166

Query: 117 PINQGPSAS--RLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPI 173
           P+  G S +  +++E      C    C    +P  +       + Y   S   +M L +
Sbjct: 167 PVANGTSDTPYKVYETVWSNVCSLLTCKDYWKPPYEEMPTWRIIVYAVFSPFFVMALIV 225


>gi|242021247|ref|XP_002431057.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212516286|gb|EEB18319.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 428

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 59/108 (54%), Gaps = 12/108 (11%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ Q +I  +C  HP  D+F  Q+  K     ++W CG+C KSF  E++L+ HFDN
Sbjct: 43  IPILEKYQVKIPVECPFHPIRDIFGPQQAAKHQFRSSQWSCGFCGKSFYEERYLELHFDN 102

Query: 61  RHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHL 107
            H N +NV+    CLAD C  + C+ ++  +            KN+HL
Sbjct: 103 IHKNQINVAEDAVCLADYCDIMRCEVLVTQE-----------IKNKHL 139


>gi|312378286|gb|EFR24907.1| hypothetical protein AND_10215 [Anopheles darlingi]
          Length = 630

 Score = 76.3 bits (186), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 68/125 (54%), Gaps = 3/125 (2%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ Q ++  +C  HP  D+F  Q+  K     ++W CG+C KSF  E+ LD HF+N
Sbjct: 65  LPILEKYQVKLPLECPFHPMRDIFGPQQSAKKQHRPSQWTCGFCGKSFFEEQHLDMHFEN 124

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHL-CESLANRCFP 117
           RH   +N++    CLAD C  + C+ ++   ++ +   +P AV+ +  +  E+ A R   
Sbjct: 125 RHKTNINMAEDAICLADYCDMMRCEVLIAKDATLAFGGDPNAVSTDIEVWSEATAYRTAL 184

Query: 118 INQGP 122
              GP
Sbjct: 185 TTAGP 189


>gi|270007090|gb|EFA03538.1| hypothetical protein TcasGA2_TC013541 [Tribolium castaneum]
          Length = 423

 Score = 76.3 bits (186), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 1/89 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ +  +  +C  HP  D+F  Q+  K     ++W CG C KSF  EK+LD HFDN
Sbjct: 48  IPILEKYKVSLPIECPFHPIRDIFGPQQAAKQQHRPSQWTCGLCGKSFFEEKYLDMHFDN 107

Query: 61  RHNNLLNVSH-GKCLADLCGALHCDFVMN 88
           RH   +N++    CLAD C  + CD ++ 
Sbjct: 108 RHKGFINMAEDAVCLADYCDIMRCDVLIT 136


>gi|158296785|ref|XP_317135.4| AGAP008328-PA [Anopheles gambiae str. PEST]
 gi|157014878|gb|EAA12581.4| AGAP008328-PA [Anopheles gambiae str. PEST]
          Length = 410

 Score = 76.3 bits (186), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 73/153 (47%), Gaps = 16/153 (10%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ Q ++  +C  HP  D+F  Q+  K     ++W CG+C KSF  EK LD HF+N
Sbjct: 46  LPILEKYQVKLPLECPFHPLRDIFGPQQSAKKQHRPSQWTCGFCGKSFFEEKHLDMHFEN 105

Query: 61  RHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHL-CESLANRCFP 117
           RH   +N +    CLAD C  + C+ ++   ++ +   +P AV+ +  +  E+ A R   
Sbjct: 106 RHRTNINTAEDAVCLADYCDMMRCEVLIAKDATLAFGSDPNAVSTDIEVWSEATAYRTAL 165

Query: 118 INQGPSASRLHELFLRQFCDAHKCPRKSQPFPK 150
              GP              D  K P +    PK
Sbjct: 166 STSGPR-------------DVAKVPERKSLLPK 185


>gi|91082837|ref|XP_969705.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 350

 Score = 75.9 bits (185), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 1/89 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ +  +  +C  HP  D+F  Q+  K     ++W CG C KSF  EK+LD HFDN
Sbjct: 48  IPILEKYKVSLPIECPFHPIRDIFGPQQAAKQQHRPSQWTCGLCGKSFFEEKYLDMHFDN 107

Query: 61  RHNNLLNVSH-GKCLADLCGALHCDFVMN 88
           RH   +N++    CLAD C  + CD ++ 
Sbjct: 108 RHKGFINMAEDAVCLADYCDIMRCDVLIT 136


>gi|168022196|ref|XP_001763626.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685119|gb|EDQ71516.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 285

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 56/109 (51%), Gaps = 12/109 (11%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           M +V   +Y +   C  H DND  R+QE+    + + EW+C  CK  FR+ +FLD HFDN
Sbjct: 86  MLYVHSQKYTLPKNCHFHFDNDFSREQEKSMDELGLTEWQCRCCKIVFRSREFLDTHFDN 145

Query: 61  RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCE 109
           RH+  L+     CL+            +    ++ C PA + KNRH CE
Sbjct: 146 RHSEKLDTVVFCCLS------------SQPKKQATCKPAVMEKNRHSCE 182


>gi|170038289|ref|XP_001846984.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167881843|gb|EDS45226.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 616

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 67/125 (53%), Gaps = 3/125 (2%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ Q ++  +C  HP  D+F  Q+  K     ++W CG+C KSF  EK LD HF++
Sbjct: 120 LPILEKYQVKLPLECPFHPLRDIFGPQQTAKKQHRPSQWTCGFCGKSFFEEKHLDMHFEH 179

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHL-CESLANRCFP 117
           RH   +N++    CLAD C  + C+ ++   ++ S   +P  ++ +  +  ES A R   
Sbjct: 180 RHKGNINMAEDAICLADYCDMMRCEVLIAKDATLSFGVDPNMMSTDIEVWSESTAYRTAL 239

Query: 118 INQGP 122
              GP
Sbjct: 240 TTSGP 244


>gi|226481637|emb|CAX73716.1| Zinc finger, C2H2-type domain-containing protein [Schistosoma
           japonicum]
          Length = 254

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/187 (24%), Positives = 79/187 (42%), Gaps = 10/187 (5%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
           P  +     ++ +C  +P +D++   EQ K  +   +W C  C K F  E+  D H  NR
Sbjct: 44  PLFQYANVVVNEQCPFNPKHDIYSIHEQMKNKISDYDWECQLCGKRFYTERSFDLHIGNR 103

Query: 62  HN-NLLNVSHGKCLADLCGALHC-------DFVMNSKSSRSKCNPAAVAKNRHLCESLAN 113
           H  N  +++   CL+  C  L C       D+        + C+P +       CE + N
Sbjct: 104 HKANAYSINRTICLSSYCSLLRCSVLKPEIDYGNQIFWDEALCDPLSFEDISKQCEDVIN 163

Query: 114 RCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPI 173
           +C P+N   S  +L +L     CD   C R     P    K P +     +++    L I
Sbjct: 164 KCAPVNDS-SGIQLRQLLRSTLCDHLSCDRY-WILPDSHVKTPVLQKFLTALIVTAGLLI 221

Query: 174 FYLIVYL 180
           +Y ++++
Sbjct: 222 YYSVIFM 228


>gi|56754925|gb|AAW25645.1| SJCHGC03203 protein [Schistosoma japonicum]
          Length = 254

 Score = 73.2 bits (178), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 83/189 (43%), Gaps = 12/189 (6%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P  +     ++ +C  +P +D++   EQ K  +   +W C  C K F  E+  D H  N
Sbjct: 43  FPLFQFANVVVNEQCPFNPKHDIYSIHEQMKNKISDYDWECQLCGKRFYTERSFDLHIGN 102

Query: 61  RHN-NLLNVSHGKCLADLCGALHC-------DFVMNSKSSRSKCNPAAVAKNRHLCESLA 112
           RH  N  +++   CL+  C  L C       D+        + C+P +       CE + 
Sbjct: 103 RHKANAYSINRTICLSSYCSLLRCSVLKPEIDYGNQIFWDEALCDPLSFEDISKQCEDVI 162

Query: 113 NRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSV-FYLSISILTLMLL 171
           N+C P+N   S  +L +L     CD   C R     P    K P +  +L+  I+T  LL
Sbjct: 163 NKCAPVNDS-SGIQLRQLLRSTLCDHLSCDRY-WILPDSHVKTPVLQKFLTAFIVTAGLL 220

Query: 172 PIFYLIVYL 180
            I+Y ++++
Sbjct: 221 -IYYSVIFM 228


>gi|195118254|ref|XP_002003655.1| GI21701 [Drosophila mojavensis]
 gi|193914230|gb|EDW13097.1| GI21701 [Drosophila mojavensis]
          Length = 639

 Score = 73.2 bits (178), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 1/95 (1%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
           P +++ Q ++  +C LHP  D+F  +++ K      +W C  C KSF  EK+LD HFD R
Sbjct: 44  PILDKHQVKLPLECPLHPLRDVFAPRQEEKQRDRPTQWTCRLCGKSFYQEKYLDLHFDTR 103

Query: 62  HNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSK 95
           H +++N +    CLADLC  + C+      +S  K
Sbjct: 104 HKHIINEAEDSVCLADLCDIMRCEVFQTEDASSLK 138


>gi|157134250|ref|XP_001663208.1| hypothetical protein AaeL_AAEL013023 [Aedes aegypti]
 gi|108870545|gb|EAT34770.1| AAEL013023-PA [Aedes aegypti]
          Length = 1069

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 66/125 (52%), Gaps = 3/125 (2%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ Q  +  +C  HP  D+F  Q+  K     ++W CG C KSF  EK LD HF+N
Sbjct: 61  LPILEKYQVTLPLECPFHPLRDIFGPQQAAKKQNRPSQWTCGLCGKSFFEEKHLDMHFEN 120

Query: 61  RHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHLC-ESLANRCFP 117
           RH + +N++    CLAD C  + C+ ++   ++ S   +P  ++ +  +  E+ A R   
Sbjct: 121 RHKSNINMAEDAVCLADYCDMMRCEVLIAKDATLSFGGDPNMMSTDIEVWNEATAYRTAL 180

Query: 118 INQGP 122
              GP
Sbjct: 181 STAGP 185



 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 66/125 (52%), Gaps = 3/125 (2%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ Q  +  +C  HP  D+F  Q+  K     ++W CG C KSF  EK LD HF+N
Sbjct: 571 LPILEKYQVTLPLECPFHPLRDIFGPQQAAKKQNRPSQWTCGLCGKSFFEEKHLDMHFEN 630

Query: 61  RHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHLC-ESLANRCFP 117
           RH + +N++    CLAD C  + C+ ++   ++ S   +P  ++ +  +  E+ A R   
Sbjct: 631 RHKSNINMAEDAVCLADYCDMMRCEVLIAKDATLSFGGDPNMMSTDIEVWNEATAYRTAL 690

Query: 118 INQGP 122
              GP
Sbjct: 691 STAGP 695


>gi|298711019|emb|CBJ26414.1| C2H2 zinc finger protein [Ectocarpus siliculosus]
          Length = 232

 Score = 72.0 bits (175), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 79/160 (49%), Gaps = 36/160 (22%)

Query: 2   PFVEQ-VQYEISSKCRLHPDNDMFRDQEQHK--IHVDINEWRCGYCKKSFRAEKFLDQHF 58
           PF+++     I   C LHP+ D+F+DQE HK  +   ++ ++CGYC+K F+ E ++D+H 
Sbjct: 26  PFLDKDPALAIPLACPLHPELDVFKDQELHKRLVPGKLDTFQCGYCQKQFKTEFYMDRHM 85

Query: 59  DNRHNNLLNVSH------------GKCLADLCGALHC-DFVMNSKSSR------------ 93
           DN+H + LN++             G+CL DLC  L C D+  +S S              
Sbjct: 86  DNKHADRLNLAGSDAIGHGIEPAPGRCLGDLCPELGCGDYAADSCSVEIHAGGGSSLGTG 145

Query: 94  --------SKCNPAAVAKNRHLCESLANRCFPINQGPSAS 125
                     C+PA + + R  C +L + CF  + G + S
Sbjct: 146 AGTIRRGCGACDPADMERRRRRCRALFHGCFDDSTGSAGS 185


>gi|195437634|ref|XP_002066745.1| GK24396 [Drosophila willistoni]
 gi|194162830|gb|EDW77731.1| GK24396 [Drosophila willistoni]
          Length = 754

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 34/95 (35%), Positives = 50/95 (52%), Gaps = 1/95 (1%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
           P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK+LD HFD R
Sbjct: 49  PILDKHQVKLPMECPLHPFRDIFAPRQDSKQRDRPTQWTCRRCGKSFYQEKYLDLHFDIR 108

Query: 62  HNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
           H  L+N V    CL+D C  + CD      ++  K
Sbjct: 109 HETLINEVEDSICLSDFCDIMRCDVFQTEDATSLK 143


>gi|195388338|ref|XP_002052837.1| GJ17779 [Drosophila virilis]
 gi|194149294|gb|EDW64992.1| GJ17779 [Drosophila virilis]
          Length = 625

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 71/162 (43%), Gaps = 14/162 (8%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK+LD HFD 
Sbjct: 43  TPILDKHQVKLPLECPLHPLRDVFAPRQDEKQRDRPTQWTCRLCGKSFYQEKYLDLHFDT 102

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSKCNPA-----------AVAKNRHLC 108
           RH +++N      CLAD C  + C+      +S  K               ++ +N  L 
Sbjct: 103 RHKHIINEAEDAVCLADFCDIMRCEVFQTEDASSLKFGDQHIVTDIEVWGDSLGQNSALA 162

Query: 109 ESLAN--RCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPF 148
           ++ A      P     SA+R  ++  RQ       P K QP 
Sbjct: 163 KANAAYLSLIPRTSTLSATRAAKVQNRQLLQEKPSPSKEQPL 204


>gi|303280527|ref|XP_003059556.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459392|gb|EEH56688.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 324

 Score = 70.9 bits (172), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 69/177 (38%), Gaps = 43/177 (24%)

Query: 8   QYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLN 67
            +E+   C      DM+ +QE+HK  V   +W+  Y +K FR E  +D+H DNRH + + 
Sbjct: 59  SFEVPPACPFQRSKDMYLEQEKHKERVRRTQWKSLYSEKVFRTEWHVDKHMDNRHADKIP 118

Query: 68  VSHGKCLADLCGALHCD-------------FVMNSKSSRSK----------------CNP 98
                CLAD C  L CD                    SRSK                C+ 
Sbjct: 119 PGADVCLADYCAVLRCDEHHAHRNPDDPSRSGSRGAPSRSKGGGGGGAGARRARRERCDE 178

Query: 99  AAVAKNRHLCESLANRCFPINQGPSA--------------SRLHELFLRQFCDAHKC 141
           + +A  RH CE+L + CFP +    +               RL   F R  CD   C
Sbjct: 179 SKLAGARHFCETLMHACFPSDDDAESDGDRGGGARGHGAAGRLRAYFTRHHCDQLTC 235


>gi|195032746|ref|XP_001988553.1| GH11227 [Drosophila grimshawi]
 gi|193904553|gb|EDW03420.1| GH11227 [Drosophila grimshawi]
          Length = 635

 Score = 68.2 bits (165), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 50/96 (52%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++   C LHP  D+F  ++  K      +W C  C KSF  EKFLD HFD 
Sbjct: 43  TPILDKHQVKLPLDCPLHPLRDVFAPRQDEKQRDRPTQWTCRRCGKSFYQEKFLDLHFDT 102

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSK 95
           RH +++N +    CLAD C  + C+      +S  K
Sbjct: 103 RHKHIINEAEDSVCLADFCDIMRCEVFQTEDASSLK 138


>gi|357621060|gb|EHJ73029.1| hypothetical protein KGM_12603 [Danaus plexippus]
          Length = 425

 Score = 67.4 bits (163), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 47/88 (53%), Gaps = 1/88 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
           +P +E+ Q ++  +C  HP  D+F  Q   K     ++W C +C KSF  E+ LD HFD 
Sbjct: 48  LPVLERFQVKLPLECPFHPARDIFAPQHAAKQQHRSSQWTCAFCGKSFYEERHLDTHFDQ 107

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVM 87
           RH + +N      CLAD C  + C  ++
Sbjct: 108 RHRHQINKAEDAVCLADYCDIMRCQVLV 135


>gi|326435960|gb|EGD81530.1| hypothetical protein PTSG_02249 [Salpingoeca sp. ATCC 50818]
          Length = 334

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 53/111 (47%), Gaps = 10/111 (9%)

Query: 11  ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH 70
           ++  C L P  D++  Q++H+      +  C YC KSFR E FLD H  NRH   L V  
Sbjct: 80  LTPSCPLDPTRDVYARQDEHRDMEGTTKLTCLYCGKSFRGEPFLDLHMHNRHAGELLVPD 139

Query: 71  GK--CLADLCGALHCDFVMN--------SKSSRSKCNPAAVAKNRHLCESL 111
               CLAD C A  C+  +         S     KC+P A+++ +  CE L
Sbjct: 140 TGPLCLADYCDATMCETQLEYRQFARLFSPEHGGKCDPDAMSRRKESCERL 190


>gi|442624983|ref|NP_001259827.1| CG4133, isoform C [Drosophila melanogaster]
 gi|440213075|gb|AGB92364.1| CG4133, isoform C [Drosophila melanogaster]
          Length = 583

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
           RH +++N      CLAD C  + C+      +S  K
Sbjct: 108 RHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|442624981|ref|NP_001259826.1| CG4133, isoform B [Drosophila melanogaster]
 gi|440213074|gb|AGB92363.1| CG4133, isoform B [Drosophila melanogaster]
          Length = 642

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
           RH +++N      CLAD C  + C+      +S  K
Sbjct: 108 RHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|195350111|ref|XP_002041585.1| GM16746 [Drosophila sechellia]
 gi|194123358|gb|EDW45401.1| GM16746 [Drosophila sechellia]
          Length = 652

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
           RH +++N      CLAD C  + C+      +S  K
Sbjct: 108 RHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|194853511|ref|XP_001968176.1| GG24723 [Drosophila erecta]
 gi|190660043|gb|EDV57235.1| GG24723 [Drosophila erecta]
          Length = 669

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
           RH +++N      CLAD C  + C+      +S  K
Sbjct: 108 RHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|195470274|ref|XP_002087433.1| GE16858 [Drosophila yakuba]
 gi|194173534|gb|EDW87145.1| GE16858 [Drosophila yakuba]
          Length = 669

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
           RH +++N      CLAD C  + C+      +S  K
Sbjct: 108 RHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|195575557|ref|XP_002077644.1| GD23028 [Drosophila simulans]
 gi|194189653|gb|EDX03229.1| GD23028 [Drosophila simulans]
          Length = 667

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
           RH +++N      CLAD C  + C+      +S  K
Sbjct: 108 RHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|19920466|ref|NP_608526.1| CG4133, isoform A [Drosophila melanogaster]
 gi|7296200|gb|AAF51492.1| CG4133, isoform A [Drosophila melanogaster]
 gi|18447493|gb|AAL68309.1| RE51073p [Drosophila melanogaster]
 gi|220948886|gb|ACL86986.1| CG4133-PA [synthetic construct]
          Length = 667

 Score = 63.9 bits (154), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSK 95
           RH +++N +    CLAD C  + C+      +S  K
Sbjct: 108 RHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|194766505|ref|XP_001965365.1| GF20663 [Drosophila ananassae]
 gi|190617975|gb|EDV33499.1| GF20663 [Drosophila ananassae]
          Length = 701

 Score = 63.9 bits (154), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  EK LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRRCGKSFYQEKHLDLHFDT 107

Query: 61  RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
           RH +++N      CLAD C  + C+      +S  K
Sbjct: 108 RHKHIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143


>gi|307110360|gb|EFN58596.1| hypothetical protein CHLNCDRAFT_140777 [Chlorella variabilis]
          Length = 192

 Score = 63.2 bits (152), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 48/176 (27%), Positives = 72/176 (40%), Gaps = 40/176 (22%)

Query: 11  ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH 70
           + + C L P +D +R QE  K  +    WRC  C K F AE+ +D H  NRH +L     
Sbjct: 1   MPAGCPLSPASDRYRHQEAAKETLAQGRWRCILCGKRFTAERHIDLHLANRHAHLEEQQQ 60

Query: 71  GKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQG-PSASRLHE 129
                     L  D                       C  LA RCFP + G  SA RLH 
Sbjct: 61  ----------LQED-----------------------CLELAARCFPPHGGDGSAQRLHA 87

Query: 130 LFLRQFCDAHKCPRKSQ-----PFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
           L   + C AH C  + +        +  R+    +  +++   L++L  F+L+++L
Sbjct: 88  LLAGRLCKAHTCDTRRKRSRTLAVAQAHRRWRGRWSFTLAA-GLVVLATFWLLLWL 142


>gi|195147218|ref|XP_002014577.1| GL18880 [Drosophila persimilis]
 gi|194106530|gb|EDW28573.1| GL18880 [Drosophila persimilis]
          Length = 683

 Score = 62.4 bits (150), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 1/113 (0%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  E+ LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRRCGKSFYQERHLDLHFDT 107

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLA 112
           RH  ++N +    CLAD C  + C+      ++  K     V       +SL 
Sbjct: 108 RHKTIINEAMDSICLADFCDIMRCEVFETEDATSLKFGDHIVTDIEVWGDSLG 160


>gi|198473764|ref|XP_001356431.2| GA17979 [Drosophila pseudoobscura pseudoobscura]
 gi|198138098|gb|EAL33495.2| GA17979 [Drosophila pseudoobscura pseudoobscura]
          Length = 684

 Score = 62.0 bits (149), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 1/113 (0%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
            P +++ Q ++  +C LHP  D+F  ++  K      +W C  C KSF  E+ LD HFD 
Sbjct: 48  TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRRCGKSFYQERHLDLHFDT 107

Query: 61  RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLA 112
           RH  ++N +    CLAD C  + C+      ++  K     V       +SL 
Sbjct: 108 RHKTIINEAMDSICLADFCDIMRCEVFETEDATSLKFGDHIVTDIEVWGDSLG 160


>gi|256083558|ref|XP_002578009.1| hypothetical protein [Schistosoma mansoni]
 gi|350644825|emb|CCD60456.1| hypothetical protein Smp_067940 [Schistosoma mansoni]
          Length = 129

 Score = 61.2 bits (147), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 51/121 (42%), Gaps = 8/121 (6%)

Query: 15  CRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSHGKC 73
           C  HP +D++   EQ K  +   +W C  C K F  E   D H  NRH  N  + S   C
Sbjct: 5   CPFHPKHDIYAIHEQMKNKISDYDWECQMCGKRFYTENTFDLHIGNRHETNAYSTSRTIC 64

Query: 74  LADLCGALHC-------DFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASR 126
           L+  C  L C       D+        + C+P +       CE + N+C P     S+++
Sbjct: 65  LSSYCPLLRCSVLKPDVDYGYQVFWDEALCDPKSFEAISKQCEDILNKCIPSGNDSSSTQ 124

Query: 127 L 127
           L
Sbjct: 125 L 125


>gi|449682242|ref|XP_004210029.1| PREDICTED: uncharacterized protein LOC100205425 [Hydra
            magnipapillata]
          Length = 1324

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 43/185 (23%), Positives = 77/185 (41%), Gaps = 18/185 (9%)

Query: 2    PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
            P  E+ Q +I  +C    + +++R  E  K  +     +C +C K F  E +  +H D +
Sbjct: 1105 PIYEKYQLKIPRRCIFLQERNIYRGVEMGKTELSGGHIKCEFCGKLFENENYFYKHMDRK 1164

Query: 62   HNNLLNVSHG-KCLADLCGALHCD----FVMNSKSSRSKCNPAAVAKNRHLCESLANRCF 116
            H N+   +    CL+D C    CD    + +N       CN   V K +  CE +   C 
Sbjct: 1165 HENVYVFNKDFVCLSDYCDIFRCDLHEMYPVNPLDF-DVCNKKDVRKLQRKCEKIIYTCL 1223

Query: 117  PINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYL 176
            P         +++   +  C    C      F     +  S++Y++     LML+P  + 
Sbjct: 1224 PTEM---QHYVYDSLTKSLCAPLNCNNFYADF-----EDVSIWYIA---FMLMLVPATF- 1271

Query: 177  IVYLY 181
            I++L+
Sbjct: 1272 ILFLF 1276


>gi|358337329|dbj|GAA55703.1| hypothetical protein CLF_108755 [Clonorchis sinensis]
          Length = 463

 Score = 53.5 bits (127), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 11  ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVS- 69
           + ++C  +P  D++  QE  K      +W+C  C K F +E  LD H DNRH + +    
Sbjct: 282 VPNECPWNPHLDIYNLQELMKTQNSARDWQCNQCWKRFYSEHALDLHMDNRHRSTVYTGP 341

Query: 70  HGKCLADLCGALHCDFV 86
           +  CLA  C  L CD +
Sbjct: 342 NATCLALFCPLLRCDVL 358


>gi|391348561|ref|XP_003748515.1| PREDICTED: uncharacterized protein LOC100905503 [Metaseiulus
           occidentalis]
          Length = 505

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 80/210 (38%), Gaps = 35/210 (16%)

Query: 2   PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
           P +++ +  +   C  HP  D+     Q   +   + WRC  C KSF  E+ L QHF   
Sbjct: 52  PVLDRYRVSMPEDCPFHPARDVLV-WPQRDANSITDVWRCPVCGKSFFGEENLTQHFHAH 110

Query: 62  HNNLLNVSHGK---------CLADLCGALHCDFVMNS--------KSSRSKCNPAAVAK- 103
                   H K         CL   C  + CD +            SSR   +P   A  
Sbjct: 111 TGTRGGTQHVKRHITEGDRVCLGTFCDIMRCDVIERQLSEALTPRTSSRKTESPKPKADC 170

Query: 104 NRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK-----------CPRKSQPFPKGG 152
           NR     L NRC  + Q  +   L+ L ++++ D  +           C R  +  P+  
Sbjct: 171 NRQSMAVLENRCKAVMQQCTIGLLNVLSVKEYKDVEEALHQNICSYLTCERYWEEGPRDT 230

Query: 153 RKQPSVFYLSISILTLMLLPIF--YLIVYL 180
           R  P   +L+ +I  L+    F  Y +V++
Sbjct: 231 RFSP---FLASTIGFLIASGFFGCYWVVWI 257


>gi|167519981|ref|XP_001744330.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777416|gb|EDQ91033.1| predicted protein [Monosiga brevicollis MX1]
          Length = 272

 Score = 46.2 bits (108), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 51/130 (39%), Gaps = 13/130 (10%)

Query: 1   MPFVEQVQYEI-----SSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLD 55
           M  +EQV Y       S  C    ++D+F +  +        +  C +C K+F    FLD
Sbjct: 50  MAALEQVLYPFQPLTPSESCPWIVEHDLFYELNRDADVTAKGQPACLFCGKAFENAVFLD 109

Query: 56  QHFDNRH----NNLLNVSHGKCLADLCGALHCDFVMNSKS----SRSKCNPAAVAKNRHL 107
           QH+  RH     +L N     CLAD C  L C            +   C+P      +  
Sbjct: 110 QHYPRRHLPTFLDLPNPDRRLCLADSCDVLQCAAYSEGAELWAFAHQPCSPGQYDMAQVR 169

Query: 108 CESLANRCFP 117
           C +  + CFP
Sbjct: 170 CRATIDTCFP 179


>gi|303390934|ref|XP_003073697.1| hypothetical protein Eint_100070 [Encephalitozoon intestinalis ATCC
           50506]
 gi|303302845|gb|ADM12337.1| hypothetical protein Eint_100070 [Encephalitozoon intestinalis ATCC
           50506]
          Length = 388

 Score = 42.7 bits (99), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 33/50 (66%), Gaps = 5/50 (10%)

Query: 17  LHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFDNRHNN 64
           LHP     +++E + +   +NE  ++CG+C KSF +EKF+  HF+N+H +
Sbjct: 278 LHPKT---KEEELNNMTTIVNENHYKCGFCGKSFESEKFIFNHFNNKHED 324


>gi|68489815|ref|XP_711247.1| potential zinc finger protein [Candida albicans SC5314]
 gi|46432535|gb|EAK92012.1| potential zinc finger protein [Candida albicans SC5314]
          Length = 132

 Score = 42.4 bits (98), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 8/75 (10%)

Query: 15  CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 71
           C  HP  N  F D +Q+ +HV  I+E  C  C K F  +KFL  H D  HN +L +    
Sbjct: 28  CNYHPCFNQAFEDYQQYHLHVVSIHEHTCESCLKVFPDKKFLSLHIDENHNPILQIKQER 87

Query: 72  -----KCLADLCGAL 81
                KC  + C A+
Sbjct: 88  GDKIFKCYTESCKAV 102


>gi|19074624|ref|NP_586130.1| Zinc finger domain [Encephalitozoon cuniculi GB-M1]
 gi|51702181|sp|Q8SUH6.1|ZA15_ENCCU RecName: Full=Zinc finger C2H2 protein ECU10_0150
 gi|19069266|emb|CAD25734.1| Zinc finger domain [Encephalitozoon cuniculi GB-M1]
          Length = 388

 Score = 41.2 bits (95), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 16/42 (38%), Positives = 30/42 (71%), Gaps = 2/42 (4%)

Query: 25  RDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFDNRHNN 64
           +++E + +   +NE  ++CG+C K+F +EKF+  HF+N+H N
Sbjct: 283 KEEELNNMTTIVNENHYKCGFCGKAFESEKFIFNHFNNKHEN 324


>gi|412990909|emb|CCO18281.1| predicted protein [Bathycoccus prasinos]
          Length = 305

 Score = 41.2 bits (95), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 58/252 (23%), Positives = 88/252 (34%), Gaps = 51/252 (20%)

Query: 1   MPFVEQVQYEISSKCRLHPDNDMFRDQE-----QHKIHVDINEWRCGYCKKSFRAEKFLD 55
            P     + E+S +C      D++R+ E     +H       EW+     K F++E  +D
Sbjct: 34  FPLFTNEKEEVSERCPFKKSRDVYREIESRKTRRHNKRKKEEEWK-DVDGKVFKSEADID 92

Query: 56  QHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNS-------KSSRSKCNPAAVAKNRHL 107
            +   R       + G  CLAD C  L CD    S        +S  + + AA  +    
Sbjct: 93  AYLSGR---FFKSTKGDVCLADYCDVLQCDEYYESLETSSSSSTSSGEKDYAAKTRAGAH 149

Query: 108 CESLANRCFPIN-----QGPSASRLHELFLRQFCDAHKCPR------KSQPFPKGG---- 152
           C +L   CFP       Q  +A R   +F   F   H+  R      + Q +   G    
Sbjct: 150 CYALTKTCFPERSESAVQEKNARRFESIFCDTFVGLHQDGRNAFAHIRKQEYKYPGHVRH 209

Query: 153 ----------------RKQPSVFYLSISILTLMLLPIFYLIVYL---YQREMRGGTQELK 193
                               S  +L IS L   L  IFYL+ +L     R   G   + +
Sbjct: 210 ANTNIDASASPSSSSSSSSSSSSFLIISTLFTFLGFIFYLLFFLAPSINRRASGSDLKAR 269

Query: 194 RIPRVGRKAKPS 205
           +  R   K K +
Sbjct: 270 KGRRTASKKKST 281


>gi|347967063|ref|XP_550770.4| AGAP002041-PA [Anopheles gambiae str. PEST]
 gi|333469774|gb|EAL38493.4| AGAP002041-PA [Anopheles gambiae str. PEST]
          Length = 1932

 Score = 41.2 bits (95), Expect = 0.27,   Method: Composition-based stats.
 Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 15/98 (15%)

Query: 29  QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 87
           +HK+ H D  ++ CG C K+F+ +  L+ H     N        +C AD CG  +CD   
Sbjct: 264 KHKLTHSDERKYVCGMCSKAFKRQDHLNGHMLTHRNK----KPYECKADGCGKSYCD--- 316

Query: 88  NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 125
            ++S R         +N H    LAN     + GP  S
Sbjct: 317 -ARSLRRH------TENHHSALGLANSSGTASPGPGGS 347


>gi|347967065|ref|XP_003436012.1| AGAP002041-PB [Anopheles gambiae str. PEST]
 gi|333469775|gb|EGK97401.1| AGAP002041-PB [Anopheles gambiae str. PEST]
          Length = 1876

 Score = 41.2 bits (95), Expect = 0.28,   Method: Composition-based stats.
 Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 15/98 (15%)

Query: 29  QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 87
           +HK+ H D  ++ CG C K+F+ +  L+ H     N        +C AD CG  +CD   
Sbjct: 264 KHKLTHSDERKYVCGMCSKAFKRQDHLNGHMLTHRNK----KPYECKADGCGKSYCD--- 316

Query: 88  NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 125
            ++S R         +N H    LAN     + GP  S
Sbjct: 317 -ARSLRRH------TENHHSALGLANSSGTASPGPGGS 347


>gi|68489768|ref|XP_711271.1| potential zinc finger protein [Candida albicans SC5314]
 gi|46432560|gb|EAK92036.1| potential zinc finger protein [Candida albicans SC5314]
          Length = 132

 Score = 40.8 bits (94), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 8/75 (10%)

Query: 15  CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 71
           C  HP  N  F D +Q+ +HV  I+E  C  C K F  + FL  H D  HN LL +    
Sbjct: 28  CNYHPCFNQAFEDYQQYHLHVVSIHEHTCESCLKVFPNKSFLSLHIDENHNPLLQIKQER 87

Query: 72  -----KCLADLCGAL 81
                KC  + C A+
Sbjct: 88  GDKIFKCYTESCKAV 102


>gi|401827595|ref|XP_003888090.1| hypothetical protein EHEL_100120 [Encephalitozoon hellem ATCC
           50504]
 gi|392999290|gb|AFM99109.1| hypothetical protein EHEL_100120 [Encephalitozoon hellem ATCC
           50504]
          Length = 388

 Score = 40.4 bits (93), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 29/40 (72%), Gaps = 2/40 (5%)

Query: 25  RDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFDNRH 62
           +++E + +   +NE  ++CG+C KSF +EKF+  HF+N+H
Sbjct: 283 KEEELNNMTTIVNENHYKCGFCGKSFESEKFIFNHFNNKH 322


>gi|66475454|ref|XP_627543.1| protein with C2H2 Zn finger, ASR2B-like N- and C-terminus
           [Cryptosporidium parvum Iowa II]
 gi|46228996|gb|EAK89845.1| protein with C2H2 Zn finger, ASR2B-like N- and C-terminus
           [Cryptosporidium parvum Iowa II]
          Length = 668

 Score = 40.4 bits (93), Expect = 0.43,   Method: Composition-based stats.
 Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 28  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           EQH +    + W+CG C K F+ E+++ +H   +H + L
Sbjct: 533 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 571


>gi|357631664|gb|EHJ79133.1| putative zinc finger protein 91-like protein [Danaus plexippus]
          Length = 390

 Score = 38.5 bits (88), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 28/37 (75%), Gaps = 1/37 (2%)

Query: 29  QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNN 64
           QH + H+DIN+++CG+C K++++ + L +HF   H++
Sbjct: 189 QHLVSHIDINQFKCGFCNKTYKSRQSLKEHFRVAHSS 225


>gi|392568613|gb|EIW61787.1| hypothetical protein TRAVEDRAFT_27265 [Trametes versicolor
           FP-101664 SS1]
          Length = 904

 Score = 38.1 bits (87), Expect = 2.1,   Method: Composition-based stats.
 Identities = 15/47 (31%), Positives = 29/47 (61%)

Query: 20  DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           + ++ R  E H    D  ++RC  C+K F+A  F+++H  N+H++L+
Sbjct: 670 EEELSRACEAHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHSDLI 716


>gi|195107861|ref|XP_001998512.1| GI23595 [Drosophila mojavensis]
 gi|193915106|gb|EDW13973.1| GI23595 [Drosophila mojavensis]
          Length = 2064

 Score = 38.1 bits (87), Expect = 2.3,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 5/88 (5%)

Query: 29  QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 87
           +HK+ H D  ++ C  C K+F+ +  L+ H     N        +C A+ CG  +CD   
Sbjct: 351 KHKLTHSDERKYICALCSKAFKRQDHLNGHMMTHRNK----KPYECKAEGCGKSYCDARS 406

Query: 88  NSKSSRSKCNPAAVAKNRHLCESLANRC 115
             + S +    AA   N  L  + +  C
Sbjct: 407 LRRHSENHHGGAATPNNNSLSPTTSTNC 434


>gi|241956272|ref|XP_002420856.1| zinc finger protein, putative [Candida dubliniensis CD36]
 gi|223644199|emb|CAX41009.1| zinc finger protein, putative [Candida dubliniensis CD36]
          Length = 143

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 8/75 (10%)

Query: 15  CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 71
           C  HP  N  F + +Q+ +HV   +E  C  C+K F  + +L  H D  HN LL +    
Sbjct: 39  CNYHPCLNSAFENYQQYHLHVLSTHEHTCESCQKVFPNKNYLSLHIDENHNPLLQIKQER 98

Query: 72  -----KCLADLCGAL 81
                KC  + C A+
Sbjct: 99  GDKIFKCYTETCKAV 113


>gi|409049944|gb|EKM59421.1| hypothetical protein PHACADRAFT_249892 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 924

 Score = 37.7 bits (86), Expect = 3.0,   Method: Composition-based stats.
 Identities = 15/39 (38%), Positives = 25/39 (64%)

Query: 28  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           EQH    D  ++RC  C+K F+A  F+++H  N+H +L+
Sbjct: 693 EQHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPDLV 731


>gi|449549636|gb|EMD40601.1| hypothetical protein CERSUDRAFT_111186 [Ceriporiopsis subvermispora
           B]
          Length = 926

 Score = 37.0 bits (84), Expect = 4.2,   Method: Composition-based stats.
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 28  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           EQH    D  ++RC  C+K F+A  F+++H  N+H  L+
Sbjct: 694 EQHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPELV 732


>gi|67612273|ref|XP_667210.1| zf-C2H2 zinc finger protein [Cryptosporidium hominis TU502]
 gi|54658324|gb|EAL36982.1| zf-C2H2 zinc finger protein [Cryptosporidium hominis]
          Length = 668

 Score = 37.0 bits (84), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 28  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           EQH +    + W+CG C K F+ E+++ +H   +H + L
Sbjct: 533 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 571


>gi|148698279|gb|EDL30226.1| mCG12246 [Mus musculus]
          Length = 334

 Score = 37.0 bits (84), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 20/35 (57%), Gaps = 1/35 (2%)

Query: 29  QH-KIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 62
           QH +IH+ +  + C YC KSFR    L QH  N H
Sbjct: 270 QHLRIHLGVKPYHCSYCDKSFRQLSHLQQHTRNPH 304


>gi|32398760|emb|CAD98720.1| hypothetical predicted zf-C2H2 zinc finger protein, unknown
           function [Cryptosporidium parvum]
          Length = 598

 Score = 36.6 bits (83), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 28  EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           EQH +    + W+CG C K F+ E+++ +H   +H + L
Sbjct: 463 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 501


>gi|157136418|ref|XP_001663747.1| hypothetical protein AaeL_AAEL013548 [Aedes aegypti]
 gi|108869956|gb|EAT34181.1| AAEL013548-PA [Aedes aegypti]
          Length = 566

 Score = 36.6 bits (83), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 32  IHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNL 65
           IHVD   +RC  C K+F+ +++L +H +N H ++
Sbjct: 303 IHVDPKTYRCEPCDKTFKGKRYLQEHLENVHGSI 336


>gi|409075328|gb|EKM75709.1| hypothetical protein AGABI1DRAFT_87630 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 864

 Score = 36.2 bits (82), Expect = 7.7,   Method: Composition-based stats.
 Identities = 15/47 (31%), Positives = 27/47 (57%)

Query: 20  DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           + ++ +  E H    D  ++RC  C+K FRA  F+++H  N+H  L+
Sbjct: 619 EEELSKAAEPHIKQEDEGKFRCKTCQKLFRATPFVEKHIANKHPELV 665


>gi|50424891|ref|XP_461036.1| DEHA2F15532p [Debaryomyces hansenii CBS767]
 gi|49656705|emb|CAG89406.1| DEHA2F15532p [Debaryomyces hansenii CBS767]
          Length = 147

 Score = 36.2 bits (82), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 1/50 (2%)

Query: 24 FRDQEQHKIH-VDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK 72
          F D   ++ H VD + + C  C+K F +E FLD H D  HN    +S  K
Sbjct: 40 FNDYLAYEAHIVDTHNFLCLECQKKFPSETFLDIHIDENHNPFFQISKEK 89


>gi|426197981|gb|EKV47907.1| hypothetical protein AGABI2DRAFT_191625 [Agaricus bisporus var.
           bisporus H97]
          Length = 898

 Score = 36.2 bits (82), Expect = 8.2,   Method: Composition-based stats.
 Identities = 15/47 (31%), Positives = 27/47 (57%)

Query: 20  DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           + ++ +  E H    D  ++RC  C+K FRA  F+++H  N+H  L+
Sbjct: 653 EEELSKAAEPHIKQEDEGKFRCKTCQKLFRATPFVEKHIANKHPELV 699


>gi|242223936|ref|XP_002477512.1| predicted protein [Postia placenta Mad-698-R]
 gi|220722825|gb|EED77289.1| predicted protein [Postia placenta Mad-698-R]
          Length = 1194

 Score = 36.2 bits (82), Expect = 8.5,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 27/47 (57%)

Query: 20  DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           + ++ +  E H    D  ++RC  C+K F+A  F+++H  N+H  L+
Sbjct: 83  EEELSKAAEPHMKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPELV 129


>gi|393246564|gb|EJD54073.1| hypothetical protein AURDEDRAFT_110707 [Auricularia delicata
           TFB-10046 SS5]
          Length = 850

 Score = 36.2 bits (82), Expect = 8.5,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 27/48 (56%)

Query: 19  PDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
           PD ++ +  E      D  ++RC  C K F+A  F+++H  N+H+ L+
Sbjct: 637 PDEELTKAVEPFIKQEDEGKFRCKTCSKLFKATSFVEKHIANKHSELV 684


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.325    0.138    0.444 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,197,713,580
Number of Sequences: 23463169
Number of extensions: 122365442
Number of successful extensions: 498701
Number of sequences better than 100.0: 201
Number of HSP's better than 100.0 without gapping: 122
Number of HSP's successfully gapped in prelim test: 79
Number of HSP's that attempted gapping in prelim test: 498242
Number of HSP's gapped (non-prelim): 437
length of query: 205
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 69
effective length of database: 9,168,204,383
effective search space: 632606102427
effective search space used: 632606102427
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 73 (32.7 bits)