BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028680
(205 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297733623|emb|CBI14870.3| unnamed protein product [Vitis vinifera]
Length = 276
Score = 352 bits (902), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 160/205 (78%), Positives = 179/205 (87%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ QY+ISS C+LHPDN +F DQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN
Sbjct: 72 MPFVEREQYQISSSCKLHPDNHLFSDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 131
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH LLNVSH KCLADLCGALHCD VMNSK+ ++KCNPAA +NRHLCESLA+ CFP+N
Sbjct: 132 RHYGLLNVSHSKCLADLCGALHCDLVMNSKTPKAKCNPAAAVRNRHLCESLADSCFPVNH 191
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SASRLH++FLRQFCDAH C +PF +GG KQPS+FYL+ISILTLM+LP+FY+IVYL
Sbjct: 192 GLSASRLHDIFLRQFCDAHTCSGGGKPFSRGGMKQPSIFYLAISILTLMMLPVFYVIVYL 251
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
YQREMR GTQELKRI R G K KPS
Sbjct: 252 YQREMRNGTQELKRISRHGLKTKPS 276
>gi|359491160|ref|XP_002262859.2| PREDICTED: uncharacterized protein LOC100244377 [Vitis vinifera]
Length = 254
Score = 350 bits (898), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 160/205 (78%), Positives = 179/205 (87%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ QY+ISS C+LHPDN +F DQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN
Sbjct: 50 MPFVEREQYQISSSCKLHPDNHLFSDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 109
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH LLNVSH KCLADLCGALHCD VMNSK+ ++KCNPAA +NRHLCESLA+ CFP+N
Sbjct: 110 RHYGLLNVSHSKCLADLCGALHCDLVMNSKTPKAKCNPAAAVRNRHLCESLADSCFPVNH 169
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SASRLH++FLRQFCDAH C +PF +GG KQPS+FYL+ISILTLM+LP+FY+IVYL
Sbjct: 170 GLSASRLHDIFLRQFCDAHTCSGGGKPFSRGGMKQPSIFYLAISILTLMMLPVFYVIVYL 229
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
YQREMR GTQELKRI R G K KPS
Sbjct: 230 YQREMRNGTQELKRISRHGLKTKPS 254
>gi|224133834|ref|XP_002321672.1| predicted protein [Populus trichocarpa]
gi|222868668|gb|EEF05799.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 348 bits (894), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 158/205 (77%), Positives = 184/205 (89%), Gaps = 2/205 (0%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ QY+ISSKCRLHP ND+FRDQE+HKIHVDINEW+CGYCKK FRAEK+LDQHFDN
Sbjct: 64 MPFVEREQYQISSKCRLHPSNDLFRDQEEHKIHVDINEWKCGYCKKHFRAEKYLDQHFDN 123
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN+S GKC+ADLCGALHCDF+M+SKS+++KCNPAAVAKN HLCESLA+ CFP+NQ
Sbjct: 124 RHYNLLNISDGKCMADLCGALHCDFMMDSKSTKTKCNPAAVAKNHHLCESLADSCFPLNQ 183
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GPSASRLHELFL QFCDAH C + FPKGG+K+ SVFYL+ISILT+M++P+FYLIVYL
Sbjct: 184 GPSASRLHELFLHQFCDAHTCSGNKKLFPKGGKKKTSVFYLAISILTMMMIPLFYLIVYL 243
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+Q EMR GTQEL+RI + RK KPS
Sbjct: 244 HQSEMRKGTQELRRISK--RKTKPS 266
>gi|226531528|ref|NP_001142988.1| uncharacterized protein LOC100275449 precursor [Zea mays]
gi|195612594|gb|ACG28127.1| zinc finger protein [Zea mays]
Length = 276
Score = 324 bits (830), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 145/205 (70%), Positives = 176/205 (85%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKF+DQHF+N
Sbjct: 72 MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFIDQHFEN 131
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NL++ S G+C+ADLCGALHCD +M K +SKCN AA +NRHLCESLA+ CFPINQ
Sbjct: 132 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPINQ 191
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G +ASRLHE FLRQFCDAH C + ++PFPKGGRKQ + YL++SILTL+LLP+FYLIV+L
Sbjct: 192 GLAASRLHEFFLRQFCDAHTCSKGTKPFPKGGRKQTNRLYLALSILTLILLPLFYLIVFL 251
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+QREMR GTQ+LKR ++G+K KPS
Sbjct: 252 HQREMRKGTQDLKRFSKIGQKKKPS 276
>gi|449440588|ref|XP_004138066.1| PREDICTED: uncharacterized protein LOC101218367 [Cucumis sativus]
Length = 280
Score = 324 bits (830), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 144/204 (70%), Positives = 175/204 (85%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+PF+E+ YE+S++CRLHP+ND+FRDQEQHKIH+DIN W+CGYC+KSFRAEKFLD+HFDN
Sbjct: 76 LPFMEKENYEVSTQCRLHPNNDLFRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDN 135
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH+NLLNVSHGKCLADLCGALHCD M+ KS +SKC PAA A+N+HLCESLA+ CFPIN+
Sbjct: 136 RHSNLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHLCESLADSCFPINE 195
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GPSA+RLHELFL QFC AH C K +PF +G +QP +FY++ SIL LMLLPIFY+IVYL
Sbjct: 196 GPSANRLHELFLHQFCGAHSCTGKQKPFSRGAARQPGIFYMASSILILMLLPIFYVIVYL 255
Query: 181 YQREMRGGTQELKRIPRVGRKAKP 204
++RE R G + LKRI + GRK KP
Sbjct: 256 HRRESRNGIEVLKRISKAGRKNKP 279
>gi|238013592|gb|ACR37831.1| unknown [Zea mays]
gi|413916779|gb|AFW56711.1| zinc finger protein [Zea mays]
Length = 275
Score = 323 bits (829), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 145/205 (70%), Positives = 176/205 (85%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKF+DQHF+N
Sbjct: 71 MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFIDQHFEN 130
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NL++ S G+C+ADLCGALHCD +M K +SKCN AA +NRHLCESLA+ CFPINQ
Sbjct: 131 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPINQ 190
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G +ASRLHE FLRQFCDAH C + ++PFPKGGRKQ + YL++SILTL+LLP+FYLIV+L
Sbjct: 191 GLAASRLHEFFLRQFCDAHTCSKGTKPFPKGGRKQTNRLYLALSILTLILLPLFYLIVFL 250
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+QREMR GTQ+LKR ++G+K KPS
Sbjct: 251 HQREMRKGTQDLKRFSKIGQKKKPS 275
>gi|242083250|ref|XP_002442050.1| hypothetical protein SORBIDRAFT_08g008280 [Sorghum bicolor]
gi|241942743|gb|EES15888.1| hypothetical protein SORBIDRAFT_08g008280 [Sorghum bicolor]
Length = 276
Score = 323 bits (829), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 145/205 (70%), Positives = 176/205 (85%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKFLDQHF+N
Sbjct: 72 MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFLDQHFEN 131
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NL++ S G+C+ADLCGALHCD +M K +SKCN AA +NRHLCESLA+ CFP+NQ
Sbjct: 132 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPVNQ 191
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G +ASRLHE FLRQFCDAH C + ++PFPKGGRKQ + FYL++ ILTL+LLP+FYLIV+L
Sbjct: 192 GLAASRLHEFFLRQFCDAHTCNKGTKPFPKGGRKQTNRFYLALCILTLILLPLFYLIVFL 251
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+QREMR GTQ+LKR ++G+K KPS
Sbjct: 252 HQREMRKGTQDLKRFSKIGQKKKPS 276
>gi|218186624|gb|EEC69051.1| hypothetical protein OsI_37889 [Oryza sativa Indica Group]
Length = 330
Score = 321 bits (823), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 146/205 (71%), Positives = 174/205 (84%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRLHPDNDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQHF N
Sbjct: 126 MPFVEKEKYELPSKCRLHPDNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQHFHN 185
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RHNNL++ S G+CLADLCGALHCD ++ K +SKC+ A A+NRHLCESLA+ CFPINQ
Sbjct: 186 RHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCFPINQ 245
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SASRLHE FLRQFCDAH C S+PFPKGGRKQ + FYL++ LT++LLP+FYLIV+L
Sbjct: 246 GQSASRLHEFFLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYLIVFL 305
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+QREM+ G Q L+RI +V +K KPS
Sbjct: 306 HQREMKKGGQNLRRISKVVQKKKPS 330
>gi|296081316|emb|CBI17698.3| unnamed protein product [Vitis vinifera]
Length = 409
Score = 318 bits (814), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 145/205 (70%), Positives = 168/205 (81%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKKSF AE +LD+HFDN
Sbjct: 205 MPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKKSFYAENYLDKHFDN 264
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLNVS KCLAD+CGALHCD VM+SK ++KCNPAA A+N HLCESLA+ CFP++
Sbjct: 265 RHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHHLCESLADGCFPVSG 324
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GPSASRLHE FLRQFCDAH C +PF +G +K+ SVFYL ISILTLMLLP+FY VYL
Sbjct: 325 GPSASRLHEFFLRQFCDAHTCTGGRKPFSRGRKKRTSVFYLCISILTLMLLPLFYFFVYL 384
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
YQR +R G Q +RI GRK KP+
Sbjct: 385 YQRGLRRGAQNFRRISESGRKKKPT 409
>gi|115487926|ref|NP_001066450.1| Os12g0233400 [Oryza sativa Japonica Group]
gi|108862374|gb|ABA96879.2| zinc finger family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|108862375|gb|ABG21933.1| zinc finger family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|108862376|gb|ABG21934.1| zinc finger family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|108862377|gb|ABG21935.1| zinc finger family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113648957|dbj|BAF29469.1| Os12g0233400 [Oryza sativa Japonica Group]
gi|215697514|dbj|BAG91508.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 275
Score = 316 bits (809), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 144/205 (70%), Positives = 172/205 (83%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRLHP NDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQHF N
Sbjct: 71 MPFVEKEKYELPSKCRLHPGNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQHFHN 130
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RHNNL++ S G+CLADLCGALHCD ++ K +SKC+ A A+NRHLCESLA+ CFPINQ
Sbjct: 131 RHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCFPINQ 190
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SASRLHE LRQFCDAH C S+PFPKGGRKQ + FYL++ LT++LLP+FYLIV+L
Sbjct: 191 GQSASRLHEFLLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYLIVFL 250
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+QREM+ G Q L+RI +V +K KPS
Sbjct: 251 HQREMKKGGQNLRRISKVVQKKKPS 275
>gi|359488643|ref|XP_002271064.2| PREDICTED: uncharacterized protein LOC100257742 [Vitis vinifera]
Length = 279
Score = 315 bits (807), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 145/205 (70%), Positives = 168/205 (81%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKKSF AE +LD+HFDN
Sbjct: 75 MPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKKSFYAENYLDKHFDN 134
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLNVS KCLAD+CGALHCD VM+SK ++KCNPAA A+N HLCESLA+ CFP++
Sbjct: 135 RHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHHLCESLADGCFPVSG 194
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GPSASRLHE FLRQFCDAH C +PF +G +K+ SVFYL ISILTLMLLP+FY VYL
Sbjct: 195 GPSASRLHEFFLRQFCDAHTCTGGRKPFSRGRKKRTSVFYLCISILTLMLLPLFYFFVYL 254
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
YQR +R G Q +RI GRK KP+
Sbjct: 255 YQRGLRRGAQNFRRISESGRKKKPT 279
>gi|326487255|dbj|BAJ89612.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326532646|dbj|BAJ89168.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 273
Score = 314 bits (805), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 146/205 (71%), Positives = 169/205 (82%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRL P+NDMFR+QEQHKIH D+NEWRCG+CKKSFRAEKF+DQHF N
Sbjct: 69 MPFVEKEKYELPSKCRLRPENDMFREQEQHKIHFDVNEWRCGFCKKSFRAEKFIDQHFAN 128
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RHNNLL+ S G+CLADLCGALHCD M K +SKCN AA A+NRH CESLA+ CFPINQ
Sbjct: 129 RHNNLLDNSQGRCLADLCGALHCDLTMEFKKPKSKCNDAAAARNRHRCESLADSCFPINQ 188
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SASRLHE FL QFCDAH C S+PFPKGGRKQ + FYL+I ILT++LLP+FYLIV+L
Sbjct: 189 GHSASRLHEFFLHQFCDAHTCNGVSKPFPKGGRKQTNRFYLAICILTVLLLPVFYLIVFL 248
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+QREM+ G Q KRI + K KPS
Sbjct: 249 HQREMKKGDQVFKRIGKTVHKKKPS 273
>gi|297793919|ref|XP_002864844.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310679|gb|EFH41103.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 270
Score = 311 bits (796), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 151/205 (73%), Positives = 176/205 (85%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
PFVE+ +Y+I CRLHPDND++RDQEQHKIHVD+ EW+CGYCKKSF EKFLD+HF
Sbjct: 66 TPFVERERYQIPKNCRLHPDNDLYRDQEQHKIHVDVFEWKCGYCKKSFNDEKFLDKHFTT 125
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN + KCLADLCGALHCDFV+NSK ++KCNPAAVAKNRHLCES+AN CFP++Q
Sbjct: 126 RHYNLLNTTDTKCLADLCGALHCDFVLNSKKPKTKCNPAAVAKNRHLCESVANSCFPVSQ 185
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GPSASRLHE FLRQFCDAH C K +PFP+GG+K+ VFYL+ISILTLMLLP+FYL+V+L
Sbjct: 186 GPSASRLHEHFLRQFCDAHTCTGKDKPFPRGGKKKSGVFYLAISILTLMLLPLFYLLVFL 245
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
YQRE R GTQEL+RI + GRK KPS
Sbjct: 246 YQREKRSGTQELRRIRKTGRKPKPS 270
>gi|356548383|ref|XP_003542582.1| PREDICTED: uncharacterized protein LOC100780668 [Glycine max]
Length = 244
Score = 306 bits (784), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 138/205 (67%), Positives = 169/205 (82%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
PF+E+ Y++SSKC+LHP+ND+F DQE+HKI++D +EWRCGYCKKSFR EKFLDQHFD+
Sbjct: 40 TPFMERENYQLSSKCKLHPENDIFTDQEEHKIYIDRHEWRCGYCKKSFREEKFLDQHFDS 99
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH N LN+SHGKCL DLCGALHCD VMN KSSRSKCNPAA A+NRHLCE+LA+ CFPI++
Sbjct: 100 RHYNFLNLSHGKCLGDLCGALHCDAVMNFKSSRSKCNPAAAARNRHLCENLADNCFPISE 159
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GPSA RLHELFL QFCDAH C K +PF +GG+ Q S F L+ L L+LLP+FYL +YL
Sbjct: 160 GPSAGRLHELFLHQFCDAHTCSGKHKPFSRGGKDQSSFFRLAAGALILVLLPVFYLFLYL 219
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
Q +++ TQEL+RI + G K+KPS
Sbjct: 220 VQSDVKSRTQELRRISKAGWKSKPS 244
>gi|18424689|ref|NP_568969.1| C2H2-like zinc finger protein [Arabidopsis thaliana]
gi|17381006|gb|AAL36315.1| unknown protein [Arabidopsis thaliana]
gi|21536913|gb|AAM61245.1| zinc finger-like protein [Arabidopsis thaliana]
gi|22136886|gb|AAM91787.1| unknown protein [Arabidopsis thaliana]
gi|332010344|gb|AED97727.1| C2H2-like zinc finger protein [Arabidopsis thaliana]
Length = 271
Score = 300 bits (769), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 145/205 (70%), Positives = 173/205 (84%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
PFVE+ +YEI CRLHPDND++RDQE HK+HVD+ EW+CGYCKKSF EKFLD+HF
Sbjct: 67 TPFVERERYEIPKNCRLHPDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEKFLDKHFST 126
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN + KCLADLCGALHCDFV++SK +SKCNP AVAKNRHLCES+AN CFP++Q
Sbjct: 127 RHYNLLNTTDTKCLADLCGALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVANSCFPVSQ 186
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GPSASRLHE FLRQFCDAH C +PFP+GG+K+ VFYL+ISILTLMLLP+FYL+V+L
Sbjct: 187 GPSASRLHEHFLRQFCDAHTCTGNDKPFPRGGKKKSGVFYLAISILTLMLLPLFYLLVFL 246
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+QRE R GTQ+L+RI + G+K KPS
Sbjct: 247 HQREKRSGTQDLRRIIKSGKKTKPS 271
>gi|222616852|gb|EEE52984.1| hypothetical protein OsJ_35656 [Oryza sativa Japonica Group]
Length = 1075
Score = 296 bits (759), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 131/182 (71%), Positives = 155/182 (85%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRLHP NDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQHF N
Sbjct: 859 MPFVEKEKYELPSKCRLHPGNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQHFHN 918
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RHNNL++ S G+CLADLCGALHCD ++ K +SKC+ A A+NRHLCESLA+ CFPINQ
Sbjct: 919 RHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCFPINQ 978
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SASRLHE LRQFCDAH C S+PFPKGGRKQ + FYL++ LT++LLP+FYLIV+L
Sbjct: 979 GQSASRLHEFLLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYLIVFL 1038
Query: 181 YQ 182
+Q
Sbjct: 1039 HQ 1040
>gi|224060201|ref|XP_002300082.1| predicted protein [Populus trichocarpa]
gi|222847340|gb|EEE84887.1| predicted protein [Populus trichocarpa]
Length = 224
Score = 296 bits (758), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 129/203 (63%), Positives = 166/203 (81%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +Y+ISS+CRLHP+ND++RDQEQHK+HVDINEWRCGYCKK+F EK+LD+HFDN
Sbjct: 21 MPFVEKERYKISSRCRLHPENDLYRDQEQHKMHVDINEWRCGYCKKTFYEEKYLDKHFDN 80
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH +LLNVSH KCLAD+CG LHCD V++S ++KCNPAA +N+HLCESLA+ CFP+N+
Sbjct: 81 RHYDLLNVSHSKCLADVCGVLHCDLVLDSAPHKTKCNPAATGRNKHLCESLADSCFPVNE 140
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SA RL+E FLRQFCDAH C +PF KGG+K+ S Y+ IS+L LMLL +FY+ +YL
Sbjct: 141 GTSALRLNEFFLRQFCDAHTCSGGRKPFSKGGKKETSTLYVIISVLVLMLLALFYIFMYL 200
Query: 181 YQREMRGGTQELKRIPRVGRKAK 203
Y R ++ G+Q LK I + G+K +
Sbjct: 201 YLRGIKRGSQGLKHISKSGQKKR 223
>gi|302398695|gb|ADL36642.1| C2H2L domain class transcription factor [Malus x domestica]
Length = 274
Score = 292 bits (748), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 134/207 (64%), Positives = 167/207 (80%), Gaps = 2/207 (0%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +Y+IS C+LHPDND++RDQE+HKI VD+NEW+CGYCKK F +KFLD+HFDN
Sbjct: 68 MPFVEKERYQISRTCKLHPDNDLYRDQEEHKIQVDLNEWQCGYCKKRFYDDKFLDKHFDN 127
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNS-KSSRSKCNPAAVAKNRHLCESLANRCFPIN 119
RH NLLNVSH +CLAD+CGALHCD M+S ++KCNPAA A+NRHLCE LA+ CFP+
Sbjct: 128 RHYNLLNVSHTRCLADVCGALHCDLEMDSVPPKKTKCNPAAAARNRHLCEGLADSCFPVK 187
Query: 120 QGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQ-PSVFYLSISILTLMLLPIFYLIV 178
QGP+ASRLHE FLRQFCD+H C + +PF +G RK+ S+ YL ISILTL LL +FY +
Sbjct: 188 QGPAASRLHEFFLRQFCDSHTCTGRQKPFSRGRRKKRASISYLVISILTLALLFLFYSYI 247
Query: 179 YLYQREMRGGTQELKRIPRVGRKAKPS 205
Y+Y R ++ GTQELKR+ +GRK KPS
Sbjct: 248 YMYSRGVKRGTQELKRVTPIGRKKKPS 274
>gi|297805600|ref|XP_002870684.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297316520|gb|EFH46943.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 271
Score = 283 bits (723), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 129/204 (63%), Positives = 167/204 (81%), Gaps = 1/204 (0%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP+VE+ +Y++ S CR+H DND++R+QE+HK+H DINEWRCG+CKK+F EK+LD+HFD+
Sbjct: 68 MPYVEKERYQLPSSCRVHRDNDIYREQEEHKVHSDINEWRCGFCKKAFYEEKYLDKHFDS 127
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN SHGKCLADLCGALHCD V+N+ +SKCNPAA A+NRHLCESLAN CFP+N+
Sbjct: 128 RHYNLLNASHGKCLADLCGALHCDLVVNTAQLKSKCNPAASARNRHLCESLANSCFPVNK 187
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GPSA+RLH+ FLRQFCDAH C S+PF + +K+ + Y+ ISI TL++L ++Y VYL
Sbjct: 188 GPSANRLHDFFLRQFCDAHTCSGGSRPFSQKPKKRGKL-YIIISISTLIVLLLYYSFVYL 246
Query: 181 YQREMRGGTQELKRIPRVGRKAKP 204
+QR ++ G+QELKRI R G K KP
Sbjct: 247 FQRGLKRGSQELKRIRRNGLKKKP 270
>gi|357160391|ref|XP_003578749.1| PREDICTED: uncharacterized protein LOC100823211 [Brachypodium
distachyon]
Length = 274
Score = 280 bits (717), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 145/205 (70%), Positives = 171/205 (83%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRLH +NDMFR+QE+HKIH D+NEWRCG+CKKSFRAEK++DQHF N
Sbjct: 70 MPFVEKEKYELPSKCRLHAENDMFREQEEHKIHFDVNEWRCGFCKKSFRAEKYIDQHFSN 129
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RHNNLL+ S G+CLADLCGALHCD M K +SKCN AA A+N HLCESLA+ CFP+NQ
Sbjct: 130 RHNNLLDNSQGRCLADLCGALHCDLTMEFKKPKSKCNAAAAARNCHLCESLADNCFPVNQ 189
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SASRLHE FLRQFCDAH C S+PFPKGGRKQ + FYL++ ILTL+LLP+FYLIV+L
Sbjct: 190 GHSASRLHEFFLRQFCDAHTCNGDSKPFPKGGRKQTNRFYLALCILTLLLLPLFYLIVFL 249
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
+QREM+ G Q KRIP+ K KPS
Sbjct: 250 HQREMKKGVQVFKRIPKTVHKKKPS 274
>gi|255540381|ref|XP_002511255.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223550370|gb|EEF51857.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 217
Score = 275 bits (704), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 122/157 (77%), Positives = 144/157 (91%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +Y+ISSKCRLHP+ND+FRDQE+HKIHVDINEW+CGYCKKSFRAE+FLDQHF+N
Sbjct: 40 MPFVERERYQISSKCRLHPENDLFRDQEEHKIHVDINEWKCGYCKKSFRAERFLDQHFEN 99
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN+S GKCLADLCGALHCDFV+N+K+ ++KCNPAAVAKNRHLCESLA+ CFP+NQ
Sbjct: 100 RHYNLLNISDGKCLADLCGALHCDFVLNTKTMKTKCNPAAVAKNRHLCESLADSCFPLNQ 159
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPS 157
GPSASRLHELF+ QFCDAH C K + FPKGG++ +
Sbjct: 160 GPSASRLHELFVHQFCDAHTCSGKRKLFPKGGKRNEA 196
>gi|449455886|ref|XP_004145681.1| PREDICTED: uncharacterized protein LOC101219582 [Cucumis sativus]
gi|449492886|ref|XP_004159131.1| PREDICTED: uncharacterized LOC101219582 [Cucumis sativus]
Length = 281
Score = 274 bits (701), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 128/205 (62%), Positives = 162/205 (79%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +Y+IS+KCRLHPDNDMFRDQEQHK H+D N+W+CGYC+K F EK++DQHFDN
Sbjct: 77 MPFVEKKKYKISTKCRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYIDQHFDN 136
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLNVS +CLADLCGALHCD V+++ S +SKCNPAA A+N+H+CE LA+ CFP+++
Sbjct: 137 RHYNLLNVSRNRCLADLCGALHCDHVIDAVSQKSKCNPAAAARNKHMCEGLADSCFPVDE 196
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G AS LHE FL QFCDAH C K +PF +G + + SVFY+ IS+LT++ + FY+ YL
Sbjct: 197 GALASHLHEFFLHQFCDAHTCSGKPKPFSRGRQVRRSVFYIVISVLTILFVMFFYVFFYL 256
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
Y R MR Q LKR+ + GRK KPS
Sbjct: 257 YNRGMRTRPQVLKRLSQSGRKKKPS 281
>gi|10177309|dbj|BAB10570.1| unnamed protein product [Arabidopsis thaliana]
Length = 277
Score = 273 bits (699), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 129/211 (61%), Positives = 160/211 (75%), Gaps = 6/211 (2%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
PFVE+ +YEI CRLHPDND++RDQE HK+HVD+ EW+CGYCKKSF EKFLD+HF
Sbjct: 67 TPFVERERYEIPKNCRLHPDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEKFLDKHFST 126
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN + KCLADLCGALHCDFV++SK +SKCNP AVAKNRHLCES+AN CFP++Q
Sbjct: 127 RHYNLLNTTDTKCLADLCGALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVANSCFPVSQ 186
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRK---QPSVF-YLSISILTLMLLPIF-- 174
GPSASRLHE FLRQFCDAH C +PFP+GG+ + +V+ YL + + + IF
Sbjct: 187 GPSASRLHEHFLRQFCDAHTCTGNDKPFPRGGKSSYIKGTVWCYLFLFSIIALSSEIFGH 246
Query: 175 YLIVYLYQREMRGGTQELKRIPRVGRKAKPS 205
+ ++ RE R GTQ+L+RI + G+K KPS
Sbjct: 247 IAVSTVFNREKRSGTQDLRRIIKSGKKTKPS 277
>gi|356567332|ref|XP_003551875.1| PREDICTED: uncharacterized protein LOC100527694 [Glycine max]
Length = 268
Score = 272 bits (695), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 134/205 (65%), Positives = 166/205 (80%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +Y IS +CRLHPDND++RDQEQHK H DINEW+CG+CKKSF EK LDQHFDN
Sbjct: 64 MPFVEKEKYHISKRCRLHPDNDIYRDQEQHKFHTDINEWQCGFCKKSFYEEKHLDQHFDN 123
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH+NLLN+S +C+AD+CGALHCD MNS S +SKCNPAA AKN+HLCESLA+ CFP+ +
Sbjct: 124 RHSNLLNLSESQCIADVCGALHCDHEMNSGSKKSKCNPAAAAKNKHLCESLADSCFPVTE 183
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GP+ASRLHE FL QFCDAH C +PF +G RK+ +VFY+ +SIL ++LL ++YL +YL
Sbjct: 184 GPAASRLHEFFLHQFCDAHSCTGNRKPFSRGRRKKTNVFYIFVSILLVILLLLYYLYIYL 243
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
YQR M+ TQ LKR+ + RK KPS
Sbjct: 244 YQRGMKRETQVLKRVSQASRKKKPS 268
>gi|255632970|gb|ACU16839.1| unknown [Glycine max]
Length = 266
Score = 267 bits (682), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 131/201 (65%), Positives = 163/201 (81%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +Y IS +CRLHPDND++RDQEQHK H DINEW+CG+CKKSF EK LDQHFDN
Sbjct: 64 MPFVEKEKYHISKRCRLHPDNDIYRDQEQHKFHTDINEWQCGFCKKSFYEEKHLDQHFDN 123
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH+NLLN+S +C+AD+CGALHCD MNS S +SKCNPAA AKN+HLCESLA+ CFP+ +
Sbjct: 124 RHSNLLNLSESQCIADVCGALHCDHEMNSGSKKSKCNPAAAAKNKHLCESLADSCFPVTE 183
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GP+ASRLHE FL QFCDAH C +PF +G RK+ +VFY+ +SIL ++LL ++YL +YL
Sbjct: 184 GPAASRLHEFFLHQFCDAHSCTGNRKPFSRGRRKKTNVFYIFVSILLVILLLLYYLYIYL 243
Query: 181 YQREMRGGTQELKRIPRVGRK 201
YQR M+ TQ LKR+ + RK
Sbjct: 244 YQRGMKRETQVLKRVSQASRK 264
>gi|294462744|gb|ADE76916.1| unknown [Picea sitchensis]
Length = 277
Score = 266 bits (680), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 121/201 (60%), Positives = 152/201 (75%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVEQ Y IS CRLH + DMFR+QE HK H+ +NEW+CG+C+K F +EK LDQHFDN
Sbjct: 74 MPFVEQENYNISRTCRLHLEIDMFREQEPHKDHLQVNEWQCGFCRKIFISEKLLDQHFDN 133
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH+NLLN+S +CLAD+CGALHCD + SK ++KCNPAA KN HLCESLAN CFP +
Sbjct: 134 RHHNLLNISRSRCLADMCGALHCDAMDKSKPIKTKCNPAAADKNCHLCESLANSCFPPQE 193
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
PSA RLH+ FLRQFCDAH C R SQPF +G K ++ Y ++ +L +M+LPIFY IVY+
Sbjct: 194 SPSARRLHDFFLRQFCDAHTCKRGSQPFARGSGKTTNLLYYAVCLLLMMMLPIFYCIVYV 253
Query: 181 YQREMRGGTQELKRIPRVGRK 201
+QREM + KRIP++ +K
Sbjct: 254 HQREMGKHIKNFKRIPKMNQK 274
>gi|26452757|dbj|BAC43459.1| putative zinc finger protein [Arabidopsis thaliana]
Length = 272
Score = 265 bits (678), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 120/204 (58%), Positives = 162/204 (79%), Gaps = 1/204 (0%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP+VE+ +Y++ S CR+H DND++R+QE+HK+ DINEWRCG+CKK+F EK+LD+HFD+
Sbjct: 69 MPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFCKKAFYEEKYLDKHFDS 128
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN SHGKCL+DLCGALHCD V+++ +SKCNPAA AKNRHLCESLAN CFP+N+
Sbjct: 129 RHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKNRHLCESLANSCFPVNK 188
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SA+RLH+ FLRQFCDAH C S+P + +K+ S+ Y+ SI+ L++L ++Y VYL
Sbjct: 189 GSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFSIIVLVVLLLYYSFVYL 247
Query: 181 YQREMRGGTQELKRIPRVGRKAKP 204
++R ++ +Q+LKRI G K KP
Sbjct: 248 FRRGLKRRSQDLKRIRHNGLKKKP 271
>gi|15237477|ref|NP_198887.1| C2H2 type zinc finger protein [Arabidopsis thaliana]
gi|9758099|dbj|BAB08543.1| unnamed protein product [Arabidopsis thaliana]
gi|90962968|gb|ABE02408.1| At5g40710 [Arabidopsis thaliana]
gi|332007203|gb|AED94586.1| C2H2 type zinc finger protein [Arabidopsis thaliana]
Length = 272
Score = 265 bits (678), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 120/204 (58%), Positives = 162/204 (79%), Gaps = 1/204 (0%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP+VE+ +Y++ S CR+H DND++R+QE+HK+ DINEWRCG+CKK+F EK+LD+HFD+
Sbjct: 69 MPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFCKKAFYEEKYLDKHFDS 128
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN SHGKCL+DLCGALHCD V+++ +SKCNPAA AKNRHLCESLAN CFP+N+
Sbjct: 129 RHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKNRHLCESLANSCFPVNK 188
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
G SA+RLH+ FLRQFCDAH C S+P + +K+ S+ Y+ SI+ L++L ++Y VYL
Sbjct: 189 GSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFSIIVLVVLLLYYSFVYL 247
Query: 181 YQREMRGGTQELKRIPRVGRKAKP 204
++R ++ +Q+LKRI G K KP
Sbjct: 248 FRRGLKRRSQDLKRIRHNGLKKKP 271
>gi|356527071|ref|XP_003532137.1| PREDICTED: uncharacterized protein LOC100792257 [Glycine max]
Length = 271
Score = 265 bits (676), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 133/205 (64%), Positives = 165/205 (80%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +Y IS +C LHPDND++RDQEQHK H+DINEW+C YCKKSF EK LDQHFDN
Sbjct: 67 MPFVEKEKYHISKRCMLHPDNDIYRDQEQHKSHIDINEWQCRYCKKSFYEEKHLDQHFDN 126
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH+NLLN+S +CLAD+CGALHCD MNS S +SKCNPAA AK++HLCESLA+ CFP++
Sbjct: 127 RHSNLLNLSESQCLADVCGALHCDHEMNSGSKKSKCNPAAAAKHKHLCESLADSCFPVSD 186
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
GP+AS+LHE FL QFCDAH C +PF +G RK+ +VFY+ +SIL ++LL ++YL +YL
Sbjct: 187 GPAASQLHEFFLHQFCDAHSCTGSRKPFSRGHRKKTNVFYIFVSILLVILLLLYYLYIYL 246
Query: 181 YQREMRGGTQELKRIPRVGRKAKPS 205
YQR M+ TQ LKRI + RK KPS
Sbjct: 247 YQRGMKRETQVLKRISQASRKKKPS 271
>gi|357477231|ref|XP_003608901.1| hypothetical protein MTR_4g104230 [Medicago truncatula]
gi|355509956|gb|AES91098.1| hypothetical protein MTR_4g104230 [Medicago truncatula]
Length = 241
Score = 260 bits (664), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 122/165 (73%), Positives = 139/165 (84%), Gaps = 2/165 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
PFVE+ Y++S KCRLHP+NDMFRDQE+H+I++D NEWRCGYCKKSFR EKFLDQH DN
Sbjct: 72 TPFVEKENYQLSRKCRLHPENDMFRDQEEHRIYLDRNEWRCGYCKKSFREEKFLDQHLDN 131
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLN+SHGKCLADLCGALHCD V NSKSSRSKCNPAA A+NRHLCESLA+ CFPI+
Sbjct: 132 RHCNLLNLSHGKCLADLCGALHCDAVTNSKSSRSKCNPAAAARNRHLCESLADSCFPISG 191
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISI 165
G SASRLHELFL QFCDAH C K +PF +GG+ +P YL+ I
Sbjct: 192 GLSASRLHELFLHQFCDAHSCSGKHKPFSRGGKVRPE--YLTGEI 234
>gi|147776347|emb|CAN76470.1| hypothetical protein VITISV_016787 [Vitis vinifera]
Length = 483
Score = 248 bits (633), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 111/151 (73%), Positives = 128/151 (84%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKKSF AE +LD+HFDN
Sbjct: 38 MPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKKSFYAENYLDKHFDN 97
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLNVS KCLAD+CGALHCD VM+SK ++KCNPAA A+N HLCESLA+ CFP++
Sbjct: 98 RHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHHLCESLADGCFPVSG 157
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKG 151
GPSASRLHE FLRQFCDAH C +PF +G
Sbjct: 158 GPSASRLHEFFLRQFCDAHTCTGGRKPFSRG 188
>gi|255571984|ref|XP_002526933.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223533685|gb|EEF35420.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 240
Score = 247 bits (631), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 106/149 (71%), Positives = 130/149 (87%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ Y++S+ CRLHPDND++RDQEQHKIH D NEWRCGYC+KSF E++L++HFDN
Sbjct: 72 MPFVEKEHYQVSTGCRLHPDNDLYRDQEQHKIHEDGNEWRCGYCRKSFYEERYLNKHFDN 131
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NLLNVSH +CLAD+CGALHCDFVM+S + ++KCNPAA A+N+HLCESLAN CFP+N+
Sbjct: 132 RHYNLLNVSHSRCLADVCGALHCDFVMDSATRKTKCNPAAAARNKHLCESLANSCFPVNE 191
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFP 149
GPSA RL+ELFL QFCDAH C + +PFP
Sbjct: 192 GPSAHRLNELFLHQFCDAHTCTGRQKPFP 220
>gi|357436185|ref|XP_003588368.1| hypothetical protein MTR_1g006480 [Medicago truncatula]
gi|355477416|gb|AES58619.1| hypothetical protein MTR_1g006480 [Medicago truncatula]
Length = 306
Score = 241 bits (614), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 102/153 (66%), Positives = 128/153 (83%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
PFVE+ +Y IS +CRLHPDND++RDQEQHK H+DINEW+CGYCKK+F EK LDQHFDN
Sbjct: 145 TPFVEKEKYNISRRCRLHPDNDIYRDQEQHKSHIDINEWQCGYCKKTFYEEKHLDQHFDN 204
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH+NLLN+ +CLAD+CGALHCD +NS S +SKCNPAA A+N+H+CE++A+ CFP+N+
Sbjct: 205 RHSNLLNLKESRCLADVCGALHCDHEINSGSKKSKCNPAAAARNKHICETIADSCFPVNE 264
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKGGR 153
GP+ASRLHE FL QFCDAH C + +PF +G R
Sbjct: 265 GPAASRLHEFFLHQFCDAHSCTGRGKPFSRGRR 297
>gi|388514801|gb|AFK45462.1| unknown [Lotus japonicus]
Length = 263
Score = 227 bits (579), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 110/181 (60%), Positives = 133/181 (73%), Gaps = 12/181 (6%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +Y IS +CRLHPDND++RDQE HK H+DINEW+CG+CKKSF EK LDQHFDN
Sbjct: 80 MPFVEKEKYHISKRCRLHPDNDIYRDQEDHKSHIDINEWQCGFCKKSFYEEKHLDQHFDN 139
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH+NLLN S +CLAD+CGALHCD +NS S +SKCNPAA A+N+HLCESLA+ CFP+N+
Sbjct: 140 RHSNLLNSSESRCLADVCGALHCDLEINSGSKKSKCNPAAAARNKHLCESLADSCFPVNE 199
Query: 121 GPSASRLHELFLRQFCDAHKCPRKSQPFPKG-GRKQ------PSVF-----YLSISILTL 168
GP+ASRLHE FL QFCD + P+G GR Q P F Y +IS T
Sbjct: 200 GPAASRLHEFFLHQFCDHTAVLQVGNLSPEGAGRGQMCSTSWPPFFLWCCCYFTISTFTC 259
Query: 169 M 169
+
Sbjct: 260 I 260
>gi|449529367|ref|XP_004171671.1| PREDICTED: uncharacterized protein LOC101223370, partial [Cucumis
sativus]
Length = 203
Score = 224 bits (571), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 98/128 (76%), Positives = 118/128 (92%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+PF+E+ YE+S++CRLHP+ND+FRDQEQHKIH+DIN W+CGYC+KSFRAEKFLD+HFDN
Sbjct: 76 LPFMEKENYEVSTQCRLHPNNDLFRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDN 135
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH+NLLNVSHGKCLADLCGALHCD M+ KS +SKC PAA A+N+HLCESLA+ CFPIN+
Sbjct: 136 RHSNLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHLCESLADSCFPINE 195
Query: 121 GPSASRLH 128
GPSA+RLH
Sbjct: 196 GPSANRLH 203
>gi|413916780|gb|AFW56712.1| hypothetical protein ZEAMMB73_479170 [Zea mays]
Length = 222
Score = 208 bits (530), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 95/139 (68%), Positives = 114/139 (82%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKF+DQHF+N
Sbjct: 71 MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFIDQHFEN 130
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQ 120
RH NL++ S G+C+ADLCGALHCD +M K +SKCN AA +NRHLCESLA+ CFPINQ
Sbjct: 131 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPINQ 190
Query: 121 GPSASRLHELFLRQFCDAH 139
G +ASRLH + C+ H
Sbjct: 191 GLAASRLHGKWNVSECNDH 209
>gi|168063136|ref|XP_001783530.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664965|gb|EDQ51666.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 192 bits (488), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 131/200 (65%), Gaps = 8/200 (4%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFV+ +Y + CRLHPDND++ +QE++ I + +W+CGYCKK FR + FLD HFDN
Sbjct: 175 MPFVKDQKYTLPKTCRLHPDNDVYHEQEENMIELRPMQWQCGYCKKIFRNQGFLDLHFDN 234
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSR----SKCNPAAVAKNRHLCESLANRCF 116
RH+ L S CLAD CGALHCD+ NS SS+ + C A V KNRH CE LAN CF
Sbjct: 235 RHSEKLEPSSNSCLADACGALHCDY-YNSLSSKPQIQATCKLAVVEKNRHSCEVLANTCF 293
Query: 117 PINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGG--RKQPSVFYLSISILTLMLLPIF 174
P + P+A RL+ F RQFCDAH C +K + FP G + S+ Y ++++ T+++L IF
Sbjct: 294 PADHSPAAQRLNHFFKRQFCDAHTCKKKLKIFPSGSGINRNKSLIY-ALTLFTVIVLAIF 352
Query: 175 YLIVYLYQREMRGGTQELKR 194
Y I+YL +R+ + L+R
Sbjct: 353 YAILYLIKRDTNLLRKGLRR 372
>gi|302808117|ref|XP_002985753.1| hypothetical protein SELMODRAFT_47592 [Selaginella moellendorffii]
gi|300146662|gb|EFJ13331.1| hypothetical protein SELMODRAFT_47592 [Selaginella moellendorffii]
Length = 171
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 84/144 (58%), Positives = 105/144 (72%), Gaps = 4/144 (2%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+PFVE Y +S +CRLH ND FR+QE+ K I++WRCGYC K F +E++LD HFDN
Sbjct: 24 LPFVENENYALSPECRLHASNDAFREQEREKQFYHIHDWRCGYCHKIFESEEYLDLHFDN 83
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNS---KSSRSKCNPAAVAKNRHLCESLANRCFP 117
RH+ LNVS CLADLCGALHCD++ K S++KC+P +V +NRHLCE LAN CFP
Sbjct: 84 RHSETLNVSRHNCLADLCGALHCDYMETKDKHKFSKNKCSP-SVDRNRHLCEKLANSCFP 142
Query: 118 INQGPSASRLHELFLRQFCDAHKC 141
QG A+RL++ FLRQFCDAH C
Sbjct: 143 PQQGAQATRLNDFFLRQFCDAHSC 166
>gi|302785357|ref|XP_002974450.1| hypothetical protein SELMODRAFT_57652 [Selaginella moellendorffii]
gi|300158048|gb|EFJ24672.1| hypothetical protein SELMODRAFT_57652 [Selaginella moellendorffii]
Length = 171
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 83/144 (57%), Positives = 105/144 (72%), Gaps = 4/144 (2%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+PFVE Y +S +CRLH ND FR+QE+ K I++WRCGYC K F +E++LD HFDN
Sbjct: 24 LPFVENENYALSPQCRLHASNDAFREQEREKQFYHIHDWRCGYCHKIFESEEYLDLHFDN 83
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNS---KSSRSKCNPAAVAKNRHLCESLANRCFP 117
RH+ LNVS CLAD+CGALHCD++ K S++KC+P +V +NRHLCE LAN CFP
Sbjct: 84 RHSETLNVSRDNCLADVCGALHCDYMETKDKHKFSKNKCSP-SVDRNRHLCEKLANSCFP 142
Query: 118 INQGPSASRLHELFLRQFCDAHKC 141
QG A+RL++ FLRQFCDAH C
Sbjct: 143 PQQGAQATRLNDFFLRQFCDAHSC 166
>gi|413916778|gb|AFW56710.1| hypothetical protein ZEAMMB73_479170 [Zea mays]
Length = 177
Score = 172 bits (437), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 76/107 (71%), Positives = 91/107 (85%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MPFVE+ +YE+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKF+DQHF+N
Sbjct: 71 MPFVEKEKYELPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFIDQHFEN 130
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHL 107
RH NL++ S G+C+ADLCGALHCD +M K +SKCN AA +NRHL
Sbjct: 131 RHKNLVDNSEGRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHL 177
>gi|168040762|ref|XP_001772862.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675773|gb|EDQ62264.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 235
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/112 (51%), Positives = 77/112 (68%), Gaps = 4/112 (3%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
PFVE +Y + CRLHP ND++R+QE++ + +W+C YCKK FR + FLD HFDNR
Sbjct: 70 PFVEAEKYTLPESCRLHPANDIYREQEENMDELRPMQWQCRYCKKIFRNQGFLDTHFDNR 129
Query: 62 HNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSK----CNPAAVAKNRHLCE 109
H+NLL+ S CLAD CGALHCD+ + +S+S+ C PA V +NRH CE
Sbjct: 130 HSNLLDKSLNMCLADACGALHCDYYNSLSTSKSQKQSTCKPAVVERNRHSCE 181
>gi|428163164|gb|EKX32251.1| hypothetical protein GUITHDRAFT_91074 [Guillardia theta CCMP2712]
Length = 297
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 100/213 (46%), Gaps = 30/213 (14%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
P V ++ +++ C + DMF D E+HK V N W+C YC K F++E +L+ H+ NR
Sbjct: 55 PHVRKLGFQLPPGCPFSEERDMFLDNEKHKKEVRFNNWKCLYCNKVFKSESYLEHHYVNR 114
Query: 62 HNNLLNVSHGKCLADLCGALHCDFVMNS-------KSSRSKCNPAAVAKNRHLCESLANR 114
H + G CLAD C L CD N K+S +C+ + K RH C S+ ++
Sbjct: 115 HPETIK-EDGFCLADYCDVLECDMQENVIGSEGMFKTSSYRCDAKRMQKRRHRCHSVLDK 173
Query: 115 CFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLS------------ 162
CFP + +A+RLH F + +C+ C R+ ++ S
Sbjct: 174 CFPPHTSDAANRLHHEFEKLYCEHLSCEASPDGLGDVARRPHTLLSQSKARSEYHMKKTK 233
Query: 163 ----------ISILTLMLLPIFYLIVYLYQREM 185
+L ++L IFYL V LY+++M
Sbjct: 234 HSPWRKLAIVFGVLFAIILFIFYLGVCLYRKDM 266
>gi|255077084|ref|XP_002502194.1| predicted protein [Micromonas sp. RCC299]
gi|226517459|gb|ACO63452.1| predicted protein [Micromonas sp. RCC299]
Length = 301
Score = 95.9 bits (237), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 91/204 (44%), Gaps = 19/204 (9%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
P ++ +Y + C DM+ + E+HK V +++ Y K+F E ++D+H DNR
Sbjct: 59 PVADRNRYRLPPNCPFDRSKDMYLEHEKHKEVVRRTQYKSLYSDKTFYNEYYVDKHMDNR 118
Query: 62 HNNLLNVSHGKCLADLCGALHCDF-----------VMNSKSSRSKCNPAAVAKNRHLCES 110
H + + CLAD C L CD + S +SRS+C+ + RH CE
Sbjct: 119 HMDKVPPGADVCLADYCEVLRCDEHQAWKHPELRKTVGSSASRSRCSQERMRGLRHFCEV 178
Query: 111 LANRCFPINQGP-----SASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSV---FYLS 162
L + CFP GP +A+RL E F R CD C F G Y+
Sbjct: 179 LMSGCFPTGGGPDGDAGTAARLREYFTRHHCDMLTCDGVRDMFGVLGSHHAVAGGGGYVL 238
Query: 163 ISILTLMLLPIFYLIVYLYQREMR 186
+ I+ + +YL V+ R+ R
Sbjct: 239 VLIVVFAAVGCYYLFVWHAVRDGR 262
>gi|443688435|gb|ELT91124.1| hypothetical protein CAPTEDRAFT_220314 [Capitella teleta]
Length = 284
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 85/190 (44%), Gaps = 15/190 (7%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P + + Q S C L D DM+ +QEQ K ++W C +C KSF E+ LD HF+N
Sbjct: 58 LPILAEFQVTASESCPLSADRDMYAEQEQRKFMESSSKWTCNFCGKSFYDERSLDNHFEN 117
Query: 61 RH-NNLLNVSHGKCLADLCGALHCDFVMNSKSSR----SKCNPAAVAKNRHLCESLANRC 115
RH N+L C AD C CD + + + + C A + H CE L C
Sbjct: 118 RHPKNILTGPRATCPADFCDIFRCDVISGAHRAGYWEVALCRDADLIHTHHRCEDLLRNC 177
Query: 116 FPINQGPSASRLHELFLRQF----CDAHKCPRKSQPFPKGGRKQP--SVFYLSISILTLM 169
P N P A+R +L L +F C C + P+ P + Y+ + IL
Sbjct: 178 LPSNI-PHATR--DLILARFEKAICGYLTCDMFWE-IPENRVSNPVGTAIYIVLFILLSF 233
Query: 170 LLPIFYLIVY 179
L I+Y Y
Sbjct: 234 GLLIYYYAAY 243
>gi|384247793|gb|EIE21279.1| hypothetical protein COCSUDRAFT_57182 [Coccomyxa subellipsoidea
C-169]
Length = 220
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/183 (28%), Positives = 89/183 (48%), Gaps = 19/183 (10%)
Query: 37 NEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSR--- 93
N + C CK+ F +E+ ++H+D +H+ ++ CLAD C LHCD + + +SR
Sbjct: 20 NRFVCSLCKERFSSEEAWERHYDRKHHIHIHPGRDVCLADYCEVLHCDKINLTPTSRQLR 79
Query: 94 ------------SKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 141
+ C +A + R C LA CFP + GP+A++LH LF+ C AH C
Sbjct: 80 AQQAGDPTLAPHAPCRESAAEQLRESCRDLAAACFPPDAGPAAAKLHRLFVEGVCAAHTC 139
Query: 142 PRKSQP---FPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRV 198
+ Q ++ ++ Y + L L + IFY +++Y + R G+ + R+
Sbjct: 140 DLELQQDLLVQLAASEKGTLGYKLLIGLALGAMAIFYACLWVYH-QYRSGSLSRGDVRRL 198
Query: 199 GRK 201
RK
Sbjct: 199 RRK 201
>gi|308807625|ref|XP_003081123.1| zinc finger (ISS) [Ostreococcus tauri]
gi|116059585|emb|CAL55292.1| zinc finger (ISS) [Ostreococcus tauri]
Length = 241
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 101/213 (47%), Gaps = 14/213 (6%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFD 59
P E + EI++ C D+F + E+HK V ++ WR Y K F++E ++D+H +
Sbjct: 8 PVAEGAKTEIAATCPFSRIRDVFYEHERHKERVRHSKQYWRSLYSGKVFKSEYYVDKHME 67
Query: 60 NRHNNLLNVSHGKCLADLCGALHCD----FVMNSKSSRSKCNPAAVAKNRHLCESLANRC 115
RH + + CLAD C L CD ++ + +C+ A +A + C + C
Sbjct: 68 RRHGDKIPPGANVCLADHCEVLQCDRHASYMNGERGKLVRCDAAHMADVQAECRKMLKSC 127
Query: 116 FPINQGPSASR---LHELFLRQFCDAHKCPRKSQPFPKGGRKQPSV---FYLSISILTLM 169
FP G +A R +H+ + +CD C F + +Y+S+ + ++
Sbjct: 128 FPHESGRAAERSRKVHDYLNKYYCDQLTCVGVGGVFELMSTHHETAGASYYVSLGFV-ML 186
Query: 170 LLPIFYLIVYLYQREMRGGTQELKRIPRVGRKA 202
L+ ++Y +VY + RG + ++KR R R +
Sbjct: 187 LVALYYFLVYSWV-NGRGVSLDVKRAHRRARAS 218
>gi|195999976|ref|XP_002109856.1| hypothetical protein TRIADDRAFT_6027 [Trichoplax adhaerens]
gi|190587980|gb|EDV28022.1| hypothetical protein TRIADDRAFT_6027, partial [Trichoplax
adhaerens]
Length = 202
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 73/150 (48%), Gaps = 12/150 (8%)
Query: 3 FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 62
++ +Q ++S C HP+ D+FR QE+ K + + W C YC K F E+FLDQH DNRH
Sbjct: 20 LIQSMQVQLSFDCPFHPERDLFRKQEELKDNAYQSSWTCSYCGKWFYRERFLDQHLDNRH 79
Query: 63 NNLL-NVSHGKCLADLCGALHCDFVMNSKSSRSK----------CNPAAVAKNRHLCESL 111
+ LL V + CLA+ C L CD ++ +K C + + R C +
Sbjct: 80 SALLGTVMNATCLANYCDILGCDLAHVQDTTLAKGNDLWWKTALCRSTQMVELRDQCLQI 139
Query: 112 ANRCFPINQGPSASRLHELFLRQFCDAHKC 141
A +C P ++S + + + C C
Sbjct: 140 AEQCTP-KSSKASSGVRNIIISNICSRLTC 168
>gi|145350216|ref|XP_001419510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579742|gb|ABO97803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 270
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 104/217 (47%), Gaps = 16/217 (7%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFD 59
P + +Y + C D+F + E HK V ++ WR Y K F++E ++D+H +
Sbjct: 36 PVADGARYALDLGCPFSHARDLFYEHETHKEPVRHSKLYWRSLYSGKVFKSEYYVDKHME 95
Query: 60 NRHNNLLNVSHGKCLADLCGALHCD----FVMNSKSSR-SKCNPAAVAKNRHLCESLANR 114
RH + + + CLAD C L CD +V N + S+ ++C+ ++ R C + +
Sbjct: 96 RRHGDKIPSTADVCLADYCDVLQCDKHAEYVKNGRGSKPTRCDEEQMSLVRDKCHEMLTQ 155
Query: 115 CFPINQGPS---ASRLHELFLRQFCDAHKCPRKSQPFP--KGGRKQPSV-FYLSISILTL 168
CFP G + A +L+ F + +CD C F P V +Y++++ L
Sbjct: 156 CFPNENGSTRDNAGKLNVYFTKYYCDQMTCADVDGVFELMSAHHDSPGVGYYIALAFL-F 214
Query: 169 MLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 205
++ ++YL VY + R + ++KR RV +A S
Sbjct: 215 FIVAMYYLTVYSWTNGRR-VSSDIKR-ARVRARASWS 249
>gi|281201704|gb|EFA75912.1| hypothetical protein PPL_10484 [Polysphondylium pallidum PN500]
Length = 353
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 94/190 (49%), Gaps = 14/190 (7%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
P V +Y++ C+L+P DM E KI++ + W+C C K F E+FLD H N
Sbjct: 144 PVVVDHKYQLPVNCQLNPSLDMLAYLETQKINL-YSIWKCRNCSKQFNTEEFLDLHLKNN 202
Query: 62 HNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQG 121
H +L+ + CL+D C L+C C+ + K +H C+S+ CFP+NQ
Sbjct: 203 HASLIPSNATVCLSDFCDMLNC-----KDDPVIACDNKRMDKLKHHCQSIMYNCFPLNQE 257
Query: 122 PSASRLHELFLRQFCDAHKC-------PRKSQPFPKGGRKQPSVFYLSISILTLMLLPIF 174
S L+ F + CD C P++S + S+ +++T++++ +F
Sbjct: 258 QS-RMLNSHFTKNICDHLTCDGYKRHIPKQSDSNQSNSGRSWSLMKYIAALITIIIIVLF 316
Query: 175 YLIVYLYQRE 184
YLIV Y++E
Sbjct: 317 YLIVCFYRKE 326
>gi|390365205|ref|XP_003730770.1| PREDICTED: uncharacterized protein LOC100893077 [Strongylocentrotus
purpuratus]
Length = 233
Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 94/195 (48%), Gaps = 14/195 (7%)
Query: 12 SSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSH 70
++C L P+ +M+ D E+HK N W C YC KSF E+FLD+H+DNRH + +L
Sbjct: 9 GTECLLKPERNMYWDDERHKWSEIANNWNCLYCGKSFYNEQFLDKHYDNRHGSTVLTGPE 68
Query: 71 GKCLADLCGALHCDFVMNSKSS----RSKCNPAAVAKNRHLCESLANRCFP--INQGPSA 124
CLAD C L CD + ++K + + C + R C+++ C P I
Sbjct: 69 AVCLADHCDYLRCDIIASTKKTSFWEEALCKQKDLLVLRKKCQNIMKSCIPSHITDAEKN 128
Query: 125 SRLHELFLRQFCDAHKCPRKSQ-PFPKGGRKQPSVFYLSISILTLMLLPIFYLI----VY 179
S L +L C C + + P+ + + + ++++ I T+ ++Y++ +Y
Sbjct: 129 SLLDKL-DDTVCSFLTCEKYWENPYKEVPPSRTAAYFIA-GIFTVFGFIMYYVMACSHLY 186
Query: 180 LYQREMRGGTQELKR 194
++E G + R
Sbjct: 187 ADEKEFLGDSDGFVR 201
>gi|328865036|gb|EGG13422.1| hypothetical protein DFA_11183 [Dictyostelium fasciculatum]
Length = 306
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 98/198 (49%), Gaps = 24/198 (12%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
FV+ +Y++ S C + + DM + K + +++RC C K F++E ++DQH N
Sbjct: 93 SFVDYNEYKLPSTCVFNKNLDMLAVFQSLKFLQNDHQYRCKSCSKQFKSEDYIDQHLKNN 152
Query: 62 HNNLLNVSHGKCLADLCGALHC--DFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPIN 119
H +L+ + CL D C L+C D++++ C+ + K ++ C+ + CFP
Sbjct: 153 HADLIPSNATICLGDYCEMLNCNNDYIIS-------CDERKMEKLKYACQGIMYNCFP-- 203
Query: 120 QGPSASR-LHELFLRQFCDAHKCPRKSQPFPKGG----------RKQPSVFYLSISILTL 168
PS ++ ++ F CD C S + G + Q S+ +++ L
Sbjct: 204 --PSVNKNYNKHFNALICDKLTCKGGSSTTGQDGIIRDADIEEKQHQWSLMKYLSALVIL 261
Query: 169 MLLPIFYLIVYLYQREMR 186
++L IFYLI++ +++E R
Sbjct: 262 VILIIFYLILWFFKKETR 279
>gi|383857833|ref|XP_003704408.1| PREDICTED: uncharacterized protein LOC100876283 [Megachile
rotundata]
Length = 440
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP +++ Q E+ +C H D+FR Q++ K ++W CG C KSF AEK LD HFDN
Sbjct: 45 MPILKKYQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDAHFDN 104
Query: 61 RHNNLLNVSH-GKCLADLCGALHCDFVMN 88
RH + +N + CLAD C + CD + N
Sbjct: 105 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 133
>gi|66522619|ref|XP_393929.2| PREDICTED: hypothetical protein LOC410449 [Apis mellifera]
Length = 454
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP +++ Q E+ +C H D+FR Q++ K ++W CG C KSF AEK LD HFDN
Sbjct: 45 MPILKKHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDAHFDN 104
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
RH + +N + CLAD C + CD + N
Sbjct: 105 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 133
>gi|380017517|ref|XP_003692701.1| PREDICTED: uncharacterized protein LOC100864708 [Apis florea]
Length = 454
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP +++ Q E+ +C H D+FR Q++ K ++W CG C KSF AEK LD HFDN
Sbjct: 45 MPILKKHQVELPLECPFHESRDIFRPQQKAKHQHRSSQWTCGLCGKSFYAEKHLDAHFDN 104
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
RH + +N + CLAD C + CD + N
Sbjct: 105 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 133
>gi|307188422|gb|EFN73179.1| hypothetical protein EAG_05305 [Camponotus floridanus]
Length = 441
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP +++ Q E+ +C H D+FR Q++ K ++W CG C KSF AEK LD HFDN
Sbjct: 39 MPILKKYQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDVHFDN 98
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
RH + +N + CLAD C + CD + N
Sbjct: 99 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 127
>gi|340723433|ref|XP_003400094.1| PREDICTED: hypothetical protein LOC100643092 [Bombus terrestris]
Length = 453
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP +++ Q E+ +C H D+FR Q++ K ++W CG C KSF AEK LD HFDN
Sbjct: 48 MPILKRHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDAHFDN 107
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
RH + +N + CLAD C + CD + N
Sbjct: 108 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 136
>gi|350427747|ref|XP_003494865.1| PREDICTED: hypothetical protein LOC100742329 [Bombus impatiens]
Length = 450
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
MP +++ Q E+ +C H D+FR Q++ K ++W CG C KSF AEK LD HFDN
Sbjct: 48 MPILKRHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWTCGLCGKSFYAEKHLDAHFDN 107
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMN 88
RH + +N + CLAD C + CD + N
Sbjct: 108 RHKSNVNTAEDAVCLADYCDIMRCDVLGN 136
>gi|156386657|ref|XP_001634028.1| predicted protein [Nematostella vectensis]
gi|156221106|gb|EDO41965.1| predicted protein [Nematostella vectensis]
Length = 257
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 77/179 (43%), Gaps = 6/179 (3%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +Q + ++ C LH +D+F E +K ++ W C +C K+F +E ++D H N
Sbjct: 47 FPAYDQHKVDVPLSCLLHETHDVFIRHEINKEEEHVSRWTCNFCGKAFYSEHYIDMHMQN 106
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKS----SRSKCNPAAVAKNRHLCESLANRCF 116
RH + L CLAD C C + + K R+ C +A + CE L + C
Sbjct: 107 RHKDFLIQDSSVCLADYCDVFRCVVLNDQKKEWFWERALCKHEKMAALKRRCEVLMHGCM 166
Query: 117 PINQGPSAS--RLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPI 173
P+ G S + +++E C C +P + + Y S +M L +
Sbjct: 167 PVANGTSDTPYKVYETVWSNVCSLLTCKDYWKPPYEEMPTWRIIVYAVFSPFFVMALIV 225
>gi|242021247|ref|XP_002431057.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516286|gb|EEB18319.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 428
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 59/108 (54%), Gaps = 12/108 (11%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ Q +I +C HP D+F Q+ K ++W CG+C KSF E++L+ HFDN
Sbjct: 43 IPILEKYQVKIPVECPFHPIRDIFGPQQAAKHQFRSSQWSCGFCGKSFYEERYLELHFDN 102
Query: 61 RHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHL 107
H N +NV+ CLAD C + C+ ++ + KN+HL
Sbjct: 103 IHKNQINVAEDAVCLADYCDIMRCEVLVTQE-----------IKNKHL 139
>gi|312378286|gb|EFR24907.1| hypothetical protein AND_10215 [Anopheles darlingi]
Length = 630
Score = 76.3 bits (186), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 68/125 (54%), Gaps = 3/125 (2%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ Q ++ +C HP D+F Q+ K ++W CG+C KSF E+ LD HF+N
Sbjct: 65 LPILEKYQVKLPLECPFHPMRDIFGPQQSAKKQHRPSQWTCGFCGKSFFEEQHLDMHFEN 124
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHL-CESLANRCFP 117
RH +N++ CLAD C + C+ ++ ++ + +P AV+ + + E+ A R
Sbjct: 125 RHKTNINMAEDAICLADYCDMMRCEVLIAKDATLAFGGDPNAVSTDIEVWSEATAYRTAL 184
Query: 118 INQGP 122
GP
Sbjct: 185 TTAGP 189
>gi|270007090|gb|EFA03538.1| hypothetical protein TcasGA2_TC013541 [Tribolium castaneum]
Length = 423
Score = 76.3 bits (186), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 1/89 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ + + +C HP D+F Q+ K ++W CG C KSF EK+LD HFDN
Sbjct: 48 IPILEKYKVSLPIECPFHPIRDIFGPQQAAKQQHRPSQWTCGLCGKSFFEEKYLDMHFDN 107
Query: 61 RHNNLLNVSH-GKCLADLCGALHCDFVMN 88
RH +N++ CLAD C + CD ++
Sbjct: 108 RHKGFINMAEDAVCLADYCDIMRCDVLIT 136
>gi|158296785|ref|XP_317135.4| AGAP008328-PA [Anopheles gambiae str. PEST]
gi|157014878|gb|EAA12581.4| AGAP008328-PA [Anopheles gambiae str. PEST]
Length = 410
Score = 76.3 bits (186), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 73/153 (47%), Gaps = 16/153 (10%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ Q ++ +C HP D+F Q+ K ++W CG+C KSF EK LD HF+N
Sbjct: 46 LPILEKYQVKLPLECPFHPLRDIFGPQQSAKKQHRPSQWTCGFCGKSFFEEKHLDMHFEN 105
Query: 61 RHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHL-CESLANRCFP 117
RH +N + CLAD C + C+ ++ ++ + +P AV+ + + E+ A R
Sbjct: 106 RHRTNINTAEDAVCLADYCDMMRCEVLIAKDATLAFGSDPNAVSTDIEVWSEATAYRTAL 165
Query: 118 INQGPSASRLHELFLRQFCDAHKCPRKSQPFPK 150
GP D K P + PK
Sbjct: 166 STSGPR-------------DVAKVPERKSLLPK 185
>gi|91082837|ref|XP_969705.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 350
Score = 75.9 bits (185), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 51/89 (57%), Gaps = 1/89 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ + + +C HP D+F Q+ K ++W CG C KSF EK+LD HFDN
Sbjct: 48 IPILEKYKVSLPIECPFHPIRDIFGPQQAAKQQHRPSQWTCGLCGKSFFEEKYLDMHFDN 107
Query: 61 RHNNLLNVSH-GKCLADLCGALHCDFVMN 88
RH +N++ CLAD C + CD ++
Sbjct: 108 RHKGFINMAEDAVCLADYCDIMRCDVLIT 136
>gi|168022196|ref|XP_001763626.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685119|gb|EDQ71516.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 285
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 56/109 (51%), Gaps = 12/109 (11%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
M +V +Y + C H DND R+QE+ + + EW+C CK FR+ +FLD HFDN
Sbjct: 86 MLYVHSQKYTLPKNCHFHFDNDFSREQEKSMDELGLTEWQCRCCKIVFRSREFLDTHFDN 145
Query: 61 RHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCE 109
RH+ L+ CL+ + ++ C PA + KNRH CE
Sbjct: 146 RHSEKLDTVVFCCLS------------SQPKKQATCKPAVMEKNRHSCE 182
>gi|170038289|ref|XP_001846984.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167881843|gb|EDS45226.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 616
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 67/125 (53%), Gaps = 3/125 (2%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ Q ++ +C HP D+F Q+ K ++W CG+C KSF EK LD HF++
Sbjct: 120 LPILEKYQVKLPLECPFHPLRDIFGPQQTAKKQHRPSQWTCGFCGKSFFEEKHLDMHFEH 179
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHL-CESLANRCFP 117
RH +N++ CLAD C + C+ ++ ++ S +P ++ + + ES A R
Sbjct: 180 RHKGNINMAEDAICLADYCDMMRCEVLIAKDATLSFGVDPNMMSTDIEVWSESTAYRTAL 239
Query: 118 INQGP 122
GP
Sbjct: 240 TTSGP 244
>gi|226481637|emb|CAX73716.1| Zinc finger, C2H2-type domain-containing protein [Schistosoma
japonicum]
Length = 254
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 79/187 (42%), Gaps = 10/187 (5%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
P + ++ +C +P +D++ EQ K + +W C C K F E+ D H NR
Sbjct: 44 PLFQYANVVVNEQCPFNPKHDIYSIHEQMKNKISDYDWECQLCGKRFYTERSFDLHIGNR 103
Query: 62 HN-NLLNVSHGKCLADLCGALHC-------DFVMNSKSSRSKCNPAAVAKNRHLCESLAN 113
H N +++ CL+ C L C D+ + C+P + CE + N
Sbjct: 104 HKANAYSINRTICLSSYCSLLRCSVLKPEIDYGNQIFWDEALCDPLSFEDISKQCEDVIN 163
Query: 114 RCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPI 173
+C P+N S +L +L CD C R P K P + +++ L I
Sbjct: 164 KCAPVNDS-SGIQLRQLLRSTLCDHLSCDRY-WILPDSHVKTPVLQKFLTALIVTAGLLI 221
Query: 174 FYLIVYL 180
+Y ++++
Sbjct: 222 YYSVIFM 228
>gi|56754925|gb|AAW25645.1| SJCHGC03203 protein [Schistosoma japonicum]
Length = 254
Score = 73.2 bits (178), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 83/189 (43%), Gaps = 12/189 (6%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P + ++ +C +P +D++ EQ K + +W C C K F E+ D H N
Sbjct: 43 FPLFQFANVVVNEQCPFNPKHDIYSIHEQMKNKISDYDWECQLCGKRFYTERSFDLHIGN 102
Query: 61 RHN-NLLNVSHGKCLADLCGALHC-------DFVMNSKSSRSKCNPAAVAKNRHLCESLA 112
RH N +++ CL+ C L C D+ + C+P + CE +
Sbjct: 103 RHKANAYSINRTICLSSYCSLLRCSVLKPEIDYGNQIFWDEALCDPLSFEDISKQCEDVI 162
Query: 113 NRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSV-FYLSISILTLMLL 171
N+C P+N S +L +L CD C R P K P + +L+ I+T LL
Sbjct: 163 NKCAPVNDS-SGIQLRQLLRSTLCDHLSCDRY-WILPDSHVKTPVLQKFLTAFIVTAGLL 220
Query: 172 PIFYLIVYL 180
I+Y ++++
Sbjct: 221 -IYYSVIFM 228
>gi|195118254|ref|XP_002003655.1| GI21701 [Drosophila mojavensis]
gi|193914230|gb|EDW13097.1| GI21701 [Drosophila mojavensis]
Length = 639
Score = 73.2 bits (178), Expect = 6e-11, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
P +++ Q ++ +C LHP D+F +++ K +W C C KSF EK+LD HFD R
Sbjct: 44 PILDKHQVKLPLECPLHPLRDVFAPRQEEKQRDRPTQWTCRLCGKSFYQEKYLDLHFDTR 103
Query: 62 HNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSK 95
H +++N + CLADLC + C+ +S K
Sbjct: 104 HKHIINEAEDSVCLADLCDIMRCEVFQTEDASSLK 138
>gi|157134250|ref|XP_001663208.1| hypothetical protein AaeL_AAEL013023 [Aedes aegypti]
gi|108870545|gb|EAT34770.1| AAEL013023-PA [Aedes aegypti]
Length = 1069
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 66/125 (52%), Gaps = 3/125 (2%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ Q + +C HP D+F Q+ K ++W CG C KSF EK LD HF+N
Sbjct: 61 LPILEKYQVTLPLECPFHPLRDIFGPQQAAKKQNRPSQWTCGLCGKSFFEEKHLDMHFEN 120
Query: 61 RHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHLC-ESLANRCFP 117
RH + +N++ CLAD C + C+ ++ ++ S +P ++ + + E+ A R
Sbjct: 121 RHKSNINMAEDAVCLADYCDMMRCEVLIAKDATLSFGGDPNMMSTDIEVWNEATAYRTAL 180
Query: 118 INQGP 122
GP
Sbjct: 181 STAGP 185
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 66/125 (52%), Gaps = 3/125 (2%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ Q + +C HP D+F Q+ K ++W CG C KSF EK LD HF+N
Sbjct: 571 LPILEKYQVTLPLECPFHPLRDIFGPQQAAKKQNRPSQWTCGLCGKSFFEEKHLDMHFEN 630
Query: 61 RHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHLC-ESLANRCFP 117
RH + +N++ CLAD C + C+ ++ ++ S +P ++ + + E+ A R
Sbjct: 631 RHKSNINMAEDAVCLADYCDMMRCEVLIAKDATLSFGGDPNMMSTDIEVWNEATAYRTAL 690
Query: 118 INQGP 122
GP
Sbjct: 691 STAGP 695
>gi|298711019|emb|CBJ26414.1| C2H2 zinc finger protein [Ectocarpus siliculosus]
Length = 232
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 79/160 (49%), Gaps = 36/160 (22%)
Query: 2 PFVEQ-VQYEISSKCRLHPDNDMFRDQEQHK--IHVDINEWRCGYCKKSFRAEKFLDQHF 58
PF+++ I C LHP+ D+F+DQE HK + ++ ++CGYC+K F+ E ++D+H
Sbjct: 26 PFLDKDPALAIPLACPLHPELDVFKDQELHKRLVPGKLDTFQCGYCQKQFKTEFYMDRHM 85
Query: 59 DNRHNNLLNVSH------------GKCLADLCGALHC-DFVMNSKSSR------------ 93
DN+H + LN++ G+CL DLC L C D+ +S S
Sbjct: 86 DNKHADRLNLAGSDAIGHGIEPAPGRCLGDLCPELGCGDYAADSCSVEIHAGGGSSLGTG 145
Query: 94 --------SKCNPAAVAKNRHLCESLANRCFPINQGPSAS 125
C+PA + + R C +L + CF + G + S
Sbjct: 146 AGTIRRGCGACDPADMERRRRRCRALFHGCFDDSTGSAGS 185
>gi|195437634|ref|XP_002066745.1| GK24396 [Drosophila willistoni]
gi|194162830|gb|EDW77731.1| GK24396 [Drosophila willistoni]
Length = 754
Score = 71.2 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 34/95 (35%), Positives = 50/95 (52%), Gaps = 1/95 (1%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK+LD HFD R
Sbjct: 49 PILDKHQVKLPMECPLHPFRDIFAPRQDSKQRDRPTQWTCRRCGKSFYQEKYLDLHFDIR 108
Query: 62 HNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
H L+N V CL+D C + CD ++ K
Sbjct: 109 HETLINEVEDSICLSDFCDIMRCDVFQTEDATSLK 143
>gi|195388338|ref|XP_002052837.1| GJ17779 [Drosophila virilis]
gi|194149294|gb|EDW64992.1| GJ17779 [Drosophila virilis]
Length = 625
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 71/162 (43%), Gaps = 14/162 (8%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK+LD HFD
Sbjct: 43 TPILDKHQVKLPLECPLHPLRDVFAPRQDEKQRDRPTQWTCRLCGKSFYQEKYLDLHFDT 102
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSKCNPA-----------AVAKNRHLC 108
RH +++N CLAD C + C+ +S K ++ +N L
Sbjct: 103 RHKHIINEAEDAVCLADFCDIMRCEVFQTEDASSLKFGDQHIVTDIEVWGDSLGQNSALA 162
Query: 109 ESLAN--RCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPF 148
++ A P SA+R ++ RQ P K QP
Sbjct: 163 KANAAYLSLIPRTSTLSATRAAKVQNRQLLQEKPSPSKEQPL 204
>gi|303280527|ref|XP_003059556.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459392|gb|EEH56688.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 324
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 69/177 (38%), Gaps = 43/177 (24%)
Query: 8 QYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLN 67
+E+ C DM+ +QE+HK V +W+ Y +K FR E +D+H DNRH + +
Sbjct: 59 SFEVPPACPFQRSKDMYLEQEKHKERVRRTQWKSLYSEKVFRTEWHVDKHMDNRHADKIP 118
Query: 68 VSHGKCLADLCGALHCD-------------FVMNSKSSRSK----------------CNP 98
CLAD C L CD SRSK C+
Sbjct: 119 PGADVCLADYCAVLRCDEHHAHRNPDDPSRSGSRGAPSRSKGGGGGGAGARRARRERCDE 178
Query: 99 AAVAKNRHLCESLANRCFPINQGPSA--------------SRLHELFLRQFCDAHKC 141
+ +A RH CE+L + CFP + + RL F R CD C
Sbjct: 179 SKLAGARHFCETLMHACFPSDDDAESDGDRGGGARGHGAAGRLRAYFTRHHCDQLTC 235
>gi|195032746|ref|XP_001988553.1| GH11227 [Drosophila grimshawi]
gi|193904553|gb|EDW03420.1| GH11227 [Drosophila grimshawi]
Length = 635
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ C LHP D+F ++ K +W C C KSF EKFLD HFD
Sbjct: 43 TPILDKHQVKLPLDCPLHPLRDVFAPRQDEKQRDRPTQWTCRRCGKSFYQEKFLDLHFDT 102
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSK 95
RH +++N + CLAD C + C+ +S K
Sbjct: 103 RHKHIINEAEDSVCLADFCDIMRCEVFQTEDASSLK 138
>gi|357621060|gb|EHJ73029.1| hypothetical protein KGM_12603 [Danaus plexippus]
Length = 425
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 47/88 (53%), Gaps = 1/88 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
+P +E+ Q ++ +C HP D+F Q K ++W C +C KSF E+ LD HFD
Sbjct: 48 LPVLERFQVKLPLECPFHPARDIFAPQHAAKQQHRSSQWTCAFCGKSFYEERHLDTHFDQ 107
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVM 87
RH + +N CLAD C + C ++
Sbjct: 108 RHRHQINKAEDAVCLADYCDIMRCQVLV 135
>gi|326435960|gb|EGD81530.1| hypothetical protein PTSG_02249 [Salpingoeca sp. ATCC 50818]
Length = 334
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 53/111 (47%), Gaps = 10/111 (9%)
Query: 11 ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH 70
++ C L P D++ Q++H+ + C YC KSFR E FLD H NRH L V
Sbjct: 80 LTPSCPLDPTRDVYARQDEHRDMEGTTKLTCLYCGKSFRGEPFLDLHMHNRHAGELLVPD 139
Query: 71 GK--CLADLCGALHCDFVMN--------SKSSRSKCNPAAVAKNRHLCESL 111
CLAD C A C+ + S KC+P A+++ + CE L
Sbjct: 140 TGPLCLADYCDATMCETQLEYRQFARLFSPEHGGKCDPDAMSRRKESCERL 190
>gi|442624983|ref|NP_001259827.1| CG4133, isoform C [Drosophila melanogaster]
gi|440213075|gb|AGB92364.1| CG4133, isoform C [Drosophila melanogaster]
Length = 583
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
RH +++N CLAD C + C+ +S K
Sbjct: 108 RHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|442624981|ref|NP_001259826.1| CG4133, isoform B [Drosophila melanogaster]
gi|440213074|gb|AGB92363.1| CG4133, isoform B [Drosophila melanogaster]
Length = 642
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
RH +++N CLAD C + C+ +S K
Sbjct: 108 RHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|195350111|ref|XP_002041585.1| GM16746 [Drosophila sechellia]
gi|194123358|gb|EDW45401.1| GM16746 [Drosophila sechellia]
Length = 652
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
RH +++N CLAD C + C+ +S K
Sbjct: 108 RHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|194853511|ref|XP_001968176.1| GG24723 [Drosophila erecta]
gi|190660043|gb|EDV57235.1| GG24723 [Drosophila erecta]
Length = 669
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
RH +++N CLAD C + C+ +S K
Sbjct: 108 RHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|195470274|ref|XP_002087433.1| GE16858 [Drosophila yakuba]
gi|194173534|gb|EDW87145.1| GE16858 [Drosophila yakuba]
Length = 669
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
RH +++N CLAD C + C+ +S K
Sbjct: 108 RHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|195575557|ref|XP_002077644.1| GD23028 [Drosophila simulans]
gi|194189653|gb|EDX03229.1| GD23028 [Drosophila simulans]
Length = 667
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
RH +++N CLAD C + C+ +S K
Sbjct: 108 RHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|19920466|ref|NP_608526.1| CG4133, isoform A [Drosophila melanogaster]
gi|7296200|gb|AAF51492.1| CG4133, isoform A [Drosophila melanogaster]
gi|18447493|gb|AAL68309.1| RE51073p [Drosophila melanogaster]
gi|220948886|gb|ACL86986.1| CG4133-PA [synthetic construct]
Length = 667
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRKCGKSFYQEKHLDLHFDT 107
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSK 95
RH +++N + CLAD C + C+ +S K
Sbjct: 108 RHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|194766505|ref|XP_001965365.1| GF20663 [Drosophila ananassae]
gi|190617975|gb|EDV33499.1| GF20663 [Drosophila ananassae]
Length = 701
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF EK LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRRCGKSFYQEKHLDLHFDT 107
Query: 61 RHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 95
RH +++N CLAD C + C+ +S K
Sbjct: 108 RHKHIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|307110360|gb|EFN58596.1| hypothetical protein CHLNCDRAFT_140777 [Chlorella variabilis]
Length = 192
Score = 63.2 bits (152), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 72/176 (40%), Gaps = 40/176 (22%)
Query: 11 ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH 70
+ + C L P +D +R QE K + WRC C K F AE+ +D H NRH +L
Sbjct: 1 MPAGCPLSPASDRYRHQEAAKETLAQGRWRCILCGKRFTAERHIDLHLANRHAHLEEQQQ 60
Query: 71 GKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQG-PSASRLHE 129
L D C LA RCFP + G SA RLH
Sbjct: 61 ----------LQED-----------------------CLELAARCFPPHGGDGSAQRLHA 87
Query: 130 LFLRQFCDAHKCPRKSQ-----PFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 180
L + C AH C + + + R+ + +++ L++L F+L+++L
Sbjct: 88 LLAGRLCKAHTCDTRRKRSRTLAVAQAHRRWRGRWSFTLAA-GLVVLATFWLLLWL 142
>gi|195147218|ref|XP_002014577.1| GL18880 [Drosophila persimilis]
gi|194106530|gb|EDW28573.1| GL18880 [Drosophila persimilis]
Length = 683
Score = 62.4 bits (150), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 1/113 (0%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF E+ LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRRCGKSFYQERHLDLHFDT 107
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLA 112
RH ++N + CLAD C + C+ ++ K V +SL
Sbjct: 108 RHKTIINEAMDSICLADFCDIMRCEVFETEDATSLKFGDHIVTDIEVWGDSLG 160
>gi|198473764|ref|XP_001356431.2| GA17979 [Drosophila pseudoobscura pseudoobscura]
gi|198138098|gb|EAL33495.2| GA17979 [Drosophila pseudoobscura pseudoobscura]
Length = 684
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 1/113 (0%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDN 60
P +++ Q ++ +C LHP D+F ++ K +W C C KSF E+ LD HFD
Sbjct: 48 TPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRRCGKSFYQERHLDLHFDT 107
Query: 61 RHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLA 112
RH ++N + CLAD C + C+ ++ K V +SL
Sbjct: 108 RHKTIINEAMDSICLADFCDIMRCEVFETEDATSLKFGDHIVTDIEVWGDSLG 160
>gi|256083558|ref|XP_002578009.1| hypothetical protein [Schistosoma mansoni]
gi|350644825|emb|CCD60456.1| hypothetical protein Smp_067940 [Schistosoma mansoni]
Length = 129
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 51/121 (42%), Gaps = 8/121 (6%)
Query: 15 CRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSHGKC 73
C HP +D++ EQ K + +W C C K F E D H NRH N + S C
Sbjct: 5 CPFHPKHDIYAIHEQMKNKISDYDWECQMCGKRFYTENTFDLHIGNRHETNAYSTSRTIC 64
Query: 74 LADLCGALHC-------DFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASR 126
L+ C L C D+ + C+P + CE + N+C P S+++
Sbjct: 65 LSSYCPLLRCSVLKPDVDYGYQVFWDEALCDPKSFEAISKQCEDILNKCIPSGNDSSSTQ 124
Query: 127 L 127
L
Sbjct: 125 L 125
>gi|449682242|ref|XP_004210029.1| PREDICTED: uncharacterized protein LOC100205425 [Hydra
magnipapillata]
Length = 1324
Score = 55.8 bits (133), Expect = 1e-05, Method: Composition-based stats.
Identities = 43/185 (23%), Positives = 77/185 (41%), Gaps = 18/185 (9%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
P E+ Q +I +C + +++R E K + +C +C K F E + +H D +
Sbjct: 1105 PIYEKYQLKIPRRCIFLQERNIYRGVEMGKTELSGGHIKCEFCGKLFENENYFYKHMDRK 1164
Query: 62 HNNLLNVSHG-KCLADLCGALHCD----FVMNSKSSRSKCNPAAVAKNRHLCESLANRCF 116
H N+ + CL+D C CD + +N CN V K + CE + C
Sbjct: 1165 HENVYVFNKDFVCLSDYCDIFRCDLHEMYPVNPLDF-DVCNKKDVRKLQRKCEKIIYTCL 1223
Query: 117 PINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYL 176
P +++ + C C F + S++Y++ LML+P +
Sbjct: 1224 PTEM---QHYVYDSLTKSLCAPLNCNNFYADF-----EDVSIWYIA---FMLMLVPATF- 1271
Query: 177 IVYLY 181
I++L+
Sbjct: 1272 ILFLF 1276
>gi|358337329|dbj|GAA55703.1| hypothetical protein CLF_108755 [Clonorchis sinensis]
Length = 463
Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
Query: 11 ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVS- 69
+ ++C +P D++ QE K +W+C C K F +E LD H DNRH + +
Sbjct: 282 VPNECPWNPHLDIYNLQELMKTQNSARDWQCNQCWKRFYSEHALDLHMDNRHRSTVYTGP 341
Query: 70 HGKCLADLCGALHCDFV 86
+ CLA C L CD +
Sbjct: 342 NATCLALFCPLLRCDVL 358
>gi|391348561|ref|XP_003748515.1| PREDICTED: uncharacterized protein LOC100905503 [Metaseiulus
occidentalis]
Length = 505
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 80/210 (38%), Gaps = 35/210 (16%)
Query: 2 PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNR 61
P +++ + + C HP D+ Q + + WRC C KSF E+ L QHF
Sbjct: 52 PVLDRYRVSMPEDCPFHPARDVLV-WPQRDANSITDVWRCPVCGKSFFGEENLTQHFHAH 110
Query: 62 HNNLLNVSHGK---------CLADLCGALHCDFVMNS--------KSSRSKCNPAAVAK- 103
H K CL C + CD + SSR +P A
Sbjct: 111 TGTRGGTQHVKRHITEGDRVCLGTFCDIMRCDVIERQLSEALTPRTSSRKTESPKPKADC 170
Query: 104 NRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK-----------CPRKSQPFPKGG 152
NR L NRC + Q + L+ L ++++ D + C R + P+
Sbjct: 171 NRQSMAVLENRCKAVMQQCTIGLLNVLSVKEYKDVEEALHQNICSYLTCERYWEEGPRDT 230
Query: 153 RKQPSVFYLSISILTLMLLPIF--YLIVYL 180
R P +L+ +I L+ F Y +V++
Sbjct: 231 RFSP---FLASTIGFLIASGFFGCYWVVWI 257
>gi|167519981|ref|XP_001744330.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777416|gb|EDQ91033.1| predicted protein [Monosiga brevicollis MX1]
Length = 272
Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 51/130 (39%), Gaps = 13/130 (10%)
Query: 1 MPFVEQVQYEI-----SSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLD 55
M +EQV Y S C ++D+F + + + C +C K+F FLD
Sbjct: 50 MAALEQVLYPFQPLTPSESCPWIVEHDLFYELNRDADVTAKGQPACLFCGKAFENAVFLD 109
Query: 56 QHFDNRH----NNLLNVSHGKCLADLCGALHCDFVMNSKS----SRSKCNPAAVAKNRHL 107
QH+ RH +L N CLAD C L C + C+P +
Sbjct: 110 QHYPRRHLPTFLDLPNPDRRLCLADSCDVLQCAAYSEGAELWAFAHQPCSPGQYDMAQVR 169
Query: 108 CESLANRCFP 117
C + + CFP
Sbjct: 170 CRATIDTCFP 179
>gi|303390934|ref|XP_003073697.1| hypothetical protein Eint_100070 [Encephalitozoon intestinalis ATCC
50506]
gi|303302845|gb|ADM12337.1| hypothetical protein Eint_100070 [Encephalitozoon intestinalis ATCC
50506]
Length = 388
Score = 42.7 bits (99), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 33/50 (66%), Gaps = 5/50 (10%)
Query: 17 LHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFDNRHNN 64
LHP +++E + + +NE ++CG+C KSF +EKF+ HF+N+H +
Sbjct: 278 LHPKT---KEEELNNMTTIVNENHYKCGFCGKSFESEKFIFNHFNNKHED 324
>gi|68489815|ref|XP_711247.1| potential zinc finger protein [Candida albicans SC5314]
gi|46432535|gb|EAK92012.1| potential zinc finger protein [Candida albicans SC5314]
Length = 132
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 8/75 (10%)
Query: 15 CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 71
C HP N F D +Q+ +HV I+E C C K F +KFL H D HN +L +
Sbjct: 28 CNYHPCFNQAFEDYQQYHLHVVSIHEHTCESCLKVFPDKKFLSLHIDENHNPILQIKQER 87
Query: 72 -----KCLADLCGAL 81
KC + C A+
Sbjct: 88 GDKIFKCYTESCKAV 102
>gi|19074624|ref|NP_586130.1| Zinc finger domain [Encephalitozoon cuniculi GB-M1]
gi|51702181|sp|Q8SUH6.1|ZA15_ENCCU RecName: Full=Zinc finger C2H2 protein ECU10_0150
gi|19069266|emb|CAD25734.1| Zinc finger domain [Encephalitozoon cuniculi GB-M1]
Length = 388
Score = 41.2 bits (95), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 16/42 (38%), Positives = 30/42 (71%), Gaps = 2/42 (4%)
Query: 25 RDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFDNRHNN 64
+++E + + +NE ++CG+C K+F +EKF+ HF+N+H N
Sbjct: 283 KEEELNNMTTIVNENHYKCGFCGKAFESEKFIFNHFNNKHEN 324
>gi|412990909|emb|CCO18281.1| predicted protein [Bathycoccus prasinos]
Length = 305
Score = 41.2 bits (95), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 58/252 (23%), Positives = 88/252 (34%), Gaps = 51/252 (20%)
Query: 1 MPFVEQVQYEISSKCRLHPDNDMFRDQE-----QHKIHVDINEWRCGYCKKSFRAEKFLD 55
P + E+S +C D++R+ E +H EW+ K F++E +D
Sbjct: 34 FPLFTNEKEEVSERCPFKKSRDVYREIESRKTRRHNKRKKEEEWK-DVDGKVFKSEADID 92
Query: 56 QHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNS-------KSSRSKCNPAAVAKNRHL 107
+ R + G CLAD C L CD S +S + + AA +
Sbjct: 93 AYLSGR---FFKSTKGDVCLADYCDVLQCDEYYESLETSSSSSTSSGEKDYAAKTRAGAH 149
Query: 108 CESLANRCFPIN-----QGPSASRLHELFLRQFCDAHKCPR------KSQPFPKGG---- 152
C +L CFP Q +A R +F F H+ R + Q + G
Sbjct: 150 CYALTKTCFPERSESAVQEKNARRFESIFCDTFVGLHQDGRNAFAHIRKQEYKYPGHVRH 209
Query: 153 ----------------RKQPSVFYLSISILTLMLLPIFYLIVYL---YQREMRGGTQELK 193
S +L IS L L IFYL+ +L R G + +
Sbjct: 210 ANTNIDASASPSSSSSSSSSSSSFLIISTLFTFLGFIFYLLFFLAPSINRRASGSDLKAR 269
Query: 194 RIPRVGRKAKPS 205
+ R K K +
Sbjct: 270 KGRRTASKKKST 281
>gi|347967063|ref|XP_550770.4| AGAP002041-PA [Anopheles gambiae str. PEST]
gi|333469774|gb|EAL38493.4| AGAP002041-PA [Anopheles gambiae str. PEST]
Length = 1932
Score = 41.2 bits (95), Expect = 0.27, Method: Composition-based stats.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 15/98 (15%)
Query: 29 QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 87
+HK+ H D ++ CG C K+F+ + L+ H N +C AD CG +CD
Sbjct: 264 KHKLTHSDERKYVCGMCSKAFKRQDHLNGHMLTHRNK----KPYECKADGCGKSYCD--- 316
Query: 88 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 125
++S R +N H LAN + GP S
Sbjct: 317 -ARSLRRH------TENHHSALGLANSSGTASPGPGGS 347
>gi|347967065|ref|XP_003436012.1| AGAP002041-PB [Anopheles gambiae str. PEST]
gi|333469775|gb|EGK97401.1| AGAP002041-PB [Anopheles gambiae str. PEST]
Length = 1876
Score = 41.2 bits (95), Expect = 0.28, Method: Composition-based stats.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 15/98 (15%)
Query: 29 QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 87
+HK+ H D ++ CG C K+F+ + L+ H N +C AD CG +CD
Sbjct: 264 KHKLTHSDERKYVCGMCSKAFKRQDHLNGHMLTHRNK----KPYECKADGCGKSYCD--- 316
Query: 88 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 125
++S R +N H LAN + GP S
Sbjct: 317 -ARSLRRH------TENHHSALGLANSSGTASPGPGGS 347
>gi|68489768|ref|XP_711271.1| potential zinc finger protein [Candida albicans SC5314]
gi|46432560|gb|EAK92036.1| potential zinc finger protein [Candida albicans SC5314]
Length = 132
Score = 40.8 bits (94), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 8/75 (10%)
Query: 15 CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 71
C HP N F D +Q+ +HV I+E C C K F + FL H D HN LL +
Sbjct: 28 CNYHPCFNQAFEDYQQYHLHVVSIHEHTCESCLKVFPNKSFLSLHIDENHNPLLQIKQER 87
Query: 72 -----KCLADLCGAL 81
KC + C A+
Sbjct: 88 GDKIFKCYTESCKAV 102
>gi|401827595|ref|XP_003888090.1| hypothetical protein EHEL_100120 [Encephalitozoon hellem ATCC
50504]
gi|392999290|gb|AFM99109.1| hypothetical protein EHEL_100120 [Encephalitozoon hellem ATCC
50504]
Length = 388
Score = 40.4 bits (93), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 29/40 (72%), Gaps = 2/40 (5%)
Query: 25 RDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFDNRH 62
+++E + + +NE ++CG+C KSF +EKF+ HF+N+H
Sbjct: 283 KEEELNNMTTIVNENHYKCGFCGKSFESEKFIFNHFNNKH 322
>gi|66475454|ref|XP_627543.1| protein with C2H2 Zn finger, ASR2B-like N- and C-terminus
[Cryptosporidium parvum Iowa II]
gi|46228996|gb|EAK89845.1| protein with C2H2 Zn finger, ASR2B-like N- and C-terminus
[Cryptosporidium parvum Iowa II]
Length = 668
Score = 40.4 bits (93), Expect = 0.43, Method: Composition-based stats.
Identities = 13/39 (33%), Positives = 23/39 (58%)
Query: 28 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
EQH + + W+CG C K F+ E+++ +H +H + L
Sbjct: 533 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 571
>gi|357631664|gb|EHJ79133.1| putative zinc finger protein 91-like protein [Danaus plexippus]
Length = 390
Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 29 QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNN 64
QH + H+DIN+++CG+C K++++ + L +HF H++
Sbjct: 189 QHLVSHIDINQFKCGFCNKTYKSRQSLKEHFRVAHSS 225
>gi|392568613|gb|EIW61787.1| hypothetical protein TRAVEDRAFT_27265 [Trametes versicolor
FP-101664 SS1]
Length = 904
Score = 38.1 bits (87), Expect = 2.1, Method: Composition-based stats.
Identities = 15/47 (31%), Positives = 29/47 (61%)
Query: 20 DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
+ ++ R E H D ++RC C+K F+A F+++H N+H++L+
Sbjct: 670 EEELSRACEAHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHSDLI 716
>gi|195107861|ref|XP_001998512.1| GI23595 [Drosophila mojavensis]
gi|193915106|gb|EDW13973.1| GI23595 [Drosophila mojavensis]
Length = 2064
Score = 38.1 bits (87), Expect = 2.3, Method: Composition-based stats.
Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 5/88 (5%)
Query: 29 QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 87
+HK+ H D ++ C C K+F+ + L+ H N +C A+ CG +CD
Sbjct: 351 KHKLTHSDERKYICALCSKAFKRQDHLNGHMMTHRNK----KPYECKAEGCGKSYCDARS 406
Query: 88 NSKSSRSKCNPAAVAKNRHLCESLANRC 115
+ S + AA N L + + C
Sbjct: 407 LRRHSENHHGGAATPNNNSLSPTTSTNC 434
>gi|241956272|ref|XP_002420856.1| zinc finger protein, putative [Candida dubliniensis CD36]
gi|223644199|emb|CAX41009.1| zinc finger protein, putative [Candida dubliniensis CD36]
Length = 143
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 8/75 (10%)
Query: 15 CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 71
C HP N F + +Q+ +HV +E C C+K F + +L H D HN LL +
Sbjct: 39 CNYHPCLNSAFENYQQYHLHVLSTHEHTCESCQKVFPNKNYLSLHIDENHNPLLQIKQER 98
Query: 72 -----KCLADLCGAL 81
KC + C A+
Sbjct: 99 GDKIFKCYTETCKAV 113
>gi|409049944|gb|EKM59421.1| hypothetical protein PHACADRAFT_249892 [Phanerochaete carnosa
HHB-10118-sp]
Length = 924
Score = 37.7 bits (86), Expect = 3.0, Method: Composition-based stats.
Identities = 15/39 (38%), Positives = 25/39 (64%)
Query: 28 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
EQH D ++RC C+K F+A F+++H N+H +L+
Sbjct: 693 EQHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPDLV 731
>gi|449549636|gb|EMD40601.1| hypothetical protein CERSUDRAFT_111186 [Ceriporiopsis subvermispora
B]
Length = 926
Score = 37.0 bits (84), Expect = 4.2, Method: Composition-based stats.
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 28 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
EQH D ++RC C+K F+A F+++H N+H L+
Sbjct: 694 EQHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPELV 732
>gi|67612273|ref|XP_667210.1| zf-C2H2 zinc finger protein [Cryptosporidium hominis TU502]
gi|54658324|gb|EAL36982.1| zf-C2H2 zinc finger protein [Cryptosporidium hominis]
Length = 668
Score = 37.0 bits (84), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 13/39 (33%), Positives = 23/39 (58%)
Query: 28 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
EQH + + W+CG C K F+ E+++ +H +H + L
Sbjct: 533 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 571
>gi|148698279|gb|EDL30226.1| mCG12246 [Mus musculus]
Length = 334
Score = 37.0 bits (84), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 29 QH-KIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 62
QH +IH+ + + C YC KSFR L QH N H
Sbjct: 270 QHLRIHLGVKPYHCSYCDKSFRQLSHLQQHTRNPH 304
>gi|32398760|emb|CAD98720.1| hypothetical predicted zf-C2H2 zinc finger protein, unknown
function [Cryptosporidium parvum]
Length = 598
Score = 36.6 bits (83), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 13/39 (33%), Positives = 23/39 (58%)
Query: 28 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
EQH + + W+CG C K F+ E+++ +H +H + L
Sbjct: 463 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 501
>gi|157136418|ref|XP_001663747.1| hypothetical protein AaeL_AAEL013548 [Aedes aegypti]
gi|108869956|gb|EAT34181.1| AAEL013548-PA [Aedes aegypti]
Length = 566
Score = 36.6 bits (83), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 32 IHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNL 65
IHVD +RC C K+F+ +++L +H +N H ++
Sbjct: 303 IHVDPKTYRCEPCDKTFKGKRYLQEHLENVHGSI 336
>gi|409075328|gb|EKM75709.1| hypothetical protein AGABI1DRAFT_87630 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 864
Score = 36.2 bits (82), Expect = 7.7, Method: Composition-based stats.
Identities = 15/47 (31%), Positives = 27/47 (57%)
Query: 20 DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
+ ++ + E H D ++RC C+K FRA F+++H N+H L+
Sbjct: 619 EEELSKAAEPHIKQEDEGKFRCKTCQKLFRATPFVEKHIANKHPELV 665
>gi|50424891|ref|XP_461036.1| DEHA2F15532p [Debaryomyces hansenii CBS767]
gi|49656705|emb|CAG89406.1| DEHA2F15532p [Debaryomyces hansenii CBS767]
Length = 147
Score = 36.2 bits (82), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 24 FRDQEQHKIH-VDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK 72
F D ++ H VD + + C C+K F +E FLD H D HN +S K
Sbjct: 40 FNDYLAYEAHIVDTHNFLCLECQKKFPSETFLDIHIDENHNPFFQISKEK 89
>gi|426197981|gb|EKV47907.1| hypothetical protein AGABI2DRAFT_191625 [Agaricus bisporus var.
bisporus H97]
Length = 898
Score = 36.2 bits (82), Expect = 8.2, Method: Composition-based stats.
Identities = 15/47 (31%), Positives = 27/47 (57%)
Query: 20 DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
+ ++ + E H D ++RC C+K FRA F+++H N+H L+
Sbjct: 653 EEELSKAAEPHIKQEDEGKFRCKTCQKLFRATPFVEKHIANKHPELV 699
>gi|242223936|ref|XP_002477512.1| predicted protein [Postia placenta Mad-698-R]
gi|220722825|gb|EED77289.1| predicted protein [Postia placenta Mad-698-R]
Length = 1194
Score = 36.2 bits (82), Expect = 8.5, Method: Composition-based stats.
Identities = 14/47 (29%), Positives = 27/47 (57%)
Query: 20 DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
+ ++ + E H D ++RC C+K F+A F+++H N+H L+
Sbjct: 83 EEELSKAAEPHMKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPELV 129
>gi|393246564|gb|EJD54073.1| hypothetical protein AURDEDRAFT_110707 [Auricularia delicata
TFB-10046 SS5]
Length = 850
Score = 36.2 bits (82), Expect = 8.5, Method: Composition-based stats.
Identities = 15/48 (31%), Positives = 27/48 (56%)
Query: 19 PDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 66
PD ++ + E D ++RC C K F+A F+++H N+H+ L+
Sbjct: 637 PDEELTKAVEPFIKQEDEGKFRCKTCSKLFKATSFVEKHIANKHSELV 684
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.138 0.444
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,197,713,580
Number of Sequences: 23463169
Number of extensions: 122365442
Number of successful extensions: 498701
Number of sequences better than 100.0: 201
Number of HSP's better than 100.0 without gapping: 122
Number of HSP's successfully gapped in prelim test: 79
Number of HSP's that attempted gapping in prelim test: 498242
Number of HSP's gapped (non-prelim): 437
length of query: 205
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 69
effective length of database: 9,168,204,383
effective search space: 632606102427
effective search space used: 632606102427
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 73 (32.7 bits)