BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 024416
(268 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224133834|ref|XP_002321672.1| predicted protein [Populus trichocarpa]
gi|222868668|gb|EEF05799.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 192/268 (71%), Positives = 227/268 (84%), Gaps = 2/268 (0%)
Query: 1 MGRNTQIFTSIFLLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIE 60
MG+ +I +FL+ L S Q + + A+RT+KQE G AH+VHCSRERSRAAW IE
Sbjct: 1 MGKAARITQILFLISLLSKQILSQDVGEHVAARTVKQEEGHAHKVHCSRERSRAAWQAIE 60
Query: 61 EYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQH 120
EYLMPFVE+ QY+ISSKCRLHP ND+FRDQE+HKIHVDINEW+CGYCKK FRAEK+LDQH
Sbjct: 61 EYLMPFVEREQYQISSKCRLHPSNDLFRDQEEHKIHVDINEWKCGYCKKHFRAEKYLDQH 120
Query: 121 FDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFP 180
FDNRH NLLN+S GKC+ADLCGALHCDF+M+SKS+++KCNPAAVAKN HLCESLA+ CFP
Sbjct: 121 FDNRHYNLLNISDGKCMADLCGALHCDFMMDSKSTKTKCNPAAVAKNHHLCESLADSCFP 180
Query: 181 INQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLI 240
+NQGPSASRLHELFL QFCDAH C + FPKGG+K+ SVFYL+ISILT+M++P+FYLI
Sbjct: 181 LNQGPSASRLHELFLHQFCDAHTCSGNKKLFPKGGKKKTSVFYLAISILTMMMIPLFYLI 240
Query: 241 VYLYQREMRGGTQELKRIPRVGRKAKPS 268
VYL+Q EMR GTQEL+RI + RK KPS
Sbjct: 241 VYLHQSEMRKGTQELRRISK--RKTKPS 266
>gi|359491160|ref|XP_002262859.2| PREDICTED: uncharacterized protein LOC100244377 [Vitis vinifera]
Length = 254
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 191/260 (73%), Positives = 218/260 (83%), Gaps = 8/260 (3%)
Query: 10 SIFLLLLFSPQEILIQGFDDSAS-RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVE 68
+IF L +F Q F++SA+ R LKQ+ AHEVHCSRERSRAAW +IEEY+MPFVE
Sbjct: 2 TIFNLFMF-------QDFEESAAARNLKQQKDHAHEVHCSRERSRAAWKIIEEYMMPFVE 54
Query: 69 QVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNL 128
+ QY+ISS C+LHPDN +F DQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH L
Sbjct: 55 REQYQISSSCKLHPDNHLFSDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHYGL 114
Query: 129 LNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 188
LNVSH KCLADLCGALHCD VMNSK+ ++KCNPAA +NRHLCESLA+ CFP+N G SAS
Sbjct: 115 LNVSHSKCLADLCGALHCDLVMNSKTPKAKCNPAAAVRNRHLCESLADSCFPVNHGLSAS 174
Query: 189 RLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREM 248
RLH++FLRQFCDAH C +PF +GG KQPS+FYL+ISILTLM+LP+FY+IVYLYQREM
Sbjct: 175 RLHDIFLRQFCDAHTCSGGGKPFSRGGMKQPSIFYLAISILTLMMLPVFYVIVYLYQREM 234
Query: 249 RGGTQELKRIPRVGRKAKPS 268
R GTQELKRI R G K KPS
Sbjct: 235 RNGTQELKRISRHGLKTKPS 254
>gi|297733623|emb|CBI14870.3| unnamed protein product [Vitis vinifera]
Length = 276
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 187/246 (76%), Positives = 212/246 (86%), Gaps = 1/246 (0%)
Query: 24 IQGFDDSAS-RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHP 82
Q F++SA+ R LKQ+ AHEVHCSRERSRAAW +IEEY+MPFVE+ QY+ISS C+LHP
Sbjct: 31 TQDFEESAAARNLKQQKDHAHEVHCSRERSRAAWKIIEEYMMPFVEREQYQISSSCKLHP 90
Query: 83 DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCG 142
DN +F DQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH LLNVSH KCLADLCG
Sbjct: 91 DNHLFSDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHYGLLNVSHSKCLADLCG 150
Query: 143 ALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAH 202
ALHCD VMNSK+ ++KCNPAA +NRHLCESLA+ CFP+N G SASRLH++FLRQFCDAH
Sbjct: 151 ALHCDLVMNSKTPKAKCNPAAAVRNRHLCESLADSCFPVNHGLSASRLHDIFLRQFCDAH 210
Query: 203 KCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVG 262
C +PF +GG KQPS+FYL+ISILTLM+LP+FY+IVYLYQREMR GTQELKRI R G
Sbjct: 211 TCSGGGKPFSRGGMKQPSIFYLAISILTLMMLPVFYVIVYLYQREMRNGTQELKRISRHG 270
Query: 263 RKAKPS 268
K KPS
Sbjct: 271 LKTKPS 276
>gi|296081316|emb|CBI17698.3| unnamed protein product [Vitis vinifera]
Length = 409
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/279 (64%), Positives = 211/279 (75%), Gaps = 11/279 (3%)
Query: 1 MGRNTQIFTSIFLLLL----------FSPQEILIQGFDDSA-SRTLKQEPGRAHEVHCSR 49
MGR+ +F+ + L + SP QGF +S SR +KQ+ HEVHCSR
Sbjct: 131 MGRSASLFSMVLWLWVCLQLRPTTSSISPPLHTNQGFQESTNSRNVKQDQDGVHEVHCSR 190
Query: 50 ERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKK 109
ERSRAAW +IEEYLMPFVE+ Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKK
Sbjct: 191 ERSRAAWKIIEEYLMPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKK 250
Query: 110 SFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRH 169
SF AE +LD+HFDNRH NLLNVS KCLAD+CGALHCD VM+SK ++KCNPAA A+N H
Sbjct: 251 SFYAENYLDKHFDNRHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHH 310
Query: 170 LCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISIL 229
LCESLA+ CFP++ GPSASRLHE FLRQFCDAH C +PF +G +K+ SVFYL ISIL
Sbjct: 311 LCESLADGCFPVSGGPSASRLHEFFLRQFCDAHTCTGGRKPFSRGRKKRTSVFYLCISIL 370
Query: 230 TLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
TLMLLP+FY VYLYQR +R G Q +RI GRK KP+
Sbjct: 371 TLMLLPLFYFFVYLYQRGLRRGAQNFRRISESGRKKKPT 409
>gi|359488643|ref|XP_002271064.2| PREDICTED: uncharacterized protein LOC100257742 [Vitis vinifera]
Length = 279
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 179/279 (64%), Positives = 211/279 (75%), Gaps = 11/279 (3%)
Query: 1 MGRNTQIFTSIFLLLL----------FSPQEILIQGFDDSA-SRTLKQEPGRAHEVHCSR 49
MGR+ +F+ + L + SP QGF +S SR +KQ+ HEVHCSR
Sbjct: 1 MGRSASLFSMVLWLWVCLQLRPTTSSISPPLHTNQGFQESTNSRNVKQDQDGVHEVHCSR 60
Query: 50 ERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKK 109
ERSRAAW +IEEYLMPFVE+ Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKK
Sbjct: 61 ERSRAAWKIIEEYLMPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKK 120
Query: 110 SFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRH 169
SF AE +LD+HFDNRH NLLNVS KCLAD+CGALHCD VM+SK ++KCNPAA A+N H
Sbjct: 121 SFYAENYLDKHFDNRHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHH 180
Query: 170 LCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISIL 229
LCESLA+ CFP++ GPSASRLHE FLRQFCDAH C +PF +G +K+ SVFYL ISIL
Sbjct: 181 LCESLADGCFPVSGGPSASRLHEFFLRQFCDAHTCTGGRKPFSRGRKKRTSVFYLCISIL 240
Query: 230 TLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
TLMLLP+FY VYLYQR +R G Q +RI GRK KP+
Sbjct: 241 TLMLLPLFYFFVYLYQRGLRRGAQNFRRISESGRKKKPT 279
>gi|226531528|ref|NP_001142988.1| uncharacterized protein LOC100275449 precursor [Zea mays]
gi|195612594|gb|ACG28127.1| zinc finger protein [Zea mays]
Length = 276
Score = 369 bits (947), Expect = e-100, Method: Compositional matrix adjust.
Identities = 168/238 (70%), Positives = 200/238 (84%)
Query: 31 ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQ 90
SR L Q AHEVHCSRERSRAAW I+EYLMPFVE+ +YE+ SKCRL PDNDMFR+Q
Sbjct: 39 GSRILLQTNPDAHEVHCSRERSRAAWEAIDEYLMPFVEKEKYELPSKCRLRPDNDMFREQ 98
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
EQHKIH DINEW CG+CKK+FRAEKF+DQHF+NRH NL++ S G+C+ADLCGALHCD +M
Sbjct: 99 EQHKIHFDINEWHCGFCKKAFRAEKFIDQHFENRHKNLVDNSEGRCMADLCGALHCDLMM 158
Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQP 210
K +SKCN AA +NRHLCESLA+ CFPINQG +ASRLHE FLRQFCDAH C + ++P
Sbjct: 159 QFKKPKSKCNAAAATRNRHLCESLADSCFPINQGLAASRLHEFFLRQFCDAHTCSKGTKP 218
Query: 211 FPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
FPKGGRKQ + YL++SILTL+LLP+FYLIV+L+QREMR GTQ+LKR ++G+K KPS
Sbjct: 219 FPKGGRKQTNRLYLALSILTLILLPLFYLIVFLHQREMRKGTQDLKRFSKIGQKKKPS 276
>gi|242083250|ref|XP_002442050.1| hypothetical protein SORBIDRAFT_08g008280 [Sorghum bicolor]
gi|241942743|gb|EES15888.1| hypothetical protein SORBIDRAFT_08g008280 [Sorghum bicolor]
Length = 276
Score = 369 bits (947), Expect = e-100, Method: Compositional matrix adjust.
Identities = 173/255 (67%), Positives = 207/255 (81%), Gaps = 1/255 (0%)
Query: 15 LLFSPQEILIQGFDDS-ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYE 73
LLF Q F ++ SR L Q AHEVHCSRERSRAAW I+EYLMPFVE+ +YE
Sbjct: 22 LLFVDGSHQDQEFREAVGSRILLQTNPDAHEVHCSRERSRAAWEAIDEYLMPFVEKEKYE 81
Query: 74 ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH 133
+ SKCRL PDNDMFR+QEQHKIH DINEW CG+CKK+FRAEKFLDQHF+NRH NL++ S
Sbjct: 82 LPSKCRLRPDNDMFREQEQHKIHFDINEWHCGFCKKAFRAEKFLDQHFENRHKNLVDNSE 141
Query: 134 GKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHEL 193
G+C+ADLCGALHCD +M K +SKCN AA +NRHLCESLA+ CFP+NQG +ASRLHE
Sbjct: 142 GRCMADLCGALHCDLMMQFKKPKSKCNAAAATRNRHLCESLADSCFPVNQGLAASRLHEF 201
Query: 194 FLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQ 253
FLRQFCDAH C + ++PFPKGGRKQ + FYL++ ILTL+LLP+FYLIV+L+QREMR GTQ
Sbjct: 202 FLRQFCDAHTCNKGTKPFPKGGRKQTNRFYLALCILTLILLPLFYLIVFLHQREMRKGTQ 261
Query: 254 ELKRIPRVGRKAKPS 268
+LKR ++G+K KPS
Sbjct: 262 DLKRFSKIGQKKKPS 276
>gi|238013592|gb|ACR37831.1| unknown [Zea mays]
gi|413916779|gb|AFW56711.1| zinc finger protein [Zea mays]
Length = 275
Score = 368 bits (945), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 168/238 (70%), Positives = 201/238 (84%), Gaps = 1/238 (0%)
Query: 31 ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQ 90
SR L+ P AHEVHCSRERSRAAW I+EYLMPFVE+ +YE+ SKCRL PDNDMFR+Q
Sbjct: 39 GSRILQTNPD-AHEVHCSRERSRAAWEAIDEYLMPFVEKEKYELPSKCRLRPDNDMFREQ 97
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
EQHKIH DINEW CG+CKK+FRAEKF+DQHF+NRH NL++ S G+C+ADLCGALHCD +M
Sbjct: 98 EQHKIHFDINEWHCGFCKKAFRAEKFIDQHFENRHKNLVDNSEGRCMADLCGALHCDLMM 157
Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQP 210
K +SKCN AA +NRHLCESLA+ CFPINQG +ASRLHE FLRQFCDAH C + ++P
Sbjct: 158 QFKKPKSKCNAAAATRNRHLCESLADSCFPINQGLAASRLHEFFLRQFCDAHTCSKGTKP 217
Query: 211 FPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
FPKGGRKQ + YL++SILTL+LLP+FYLIV+L+QREMR GTQ+LKR ++G+K KPS
Sbjct: 218 FPKGGRKQTNRLYLALSILTLILLPLFYLIVFLHQREMRKGTQDLKRFSKIGQKKKPS 275
>gi|449440588|ref|XP_004138066.1| PREDICTED: uncharacterized protein LOC101218367 [Cucumis sativus]
Length = 280
Score = 365 bits (936), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 166/244 (68%), Positives = 202/244 (82%), Gaps = 1/244 (0%)
Query: 25 QGFDDSAS-RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPD 83
Q + SA+ R L+Q EVHCSRERSR AW +IEE+L+PF+E+ YE+S++CRLHP+
Sbjct: 36 QDEEQSATLRPLEQNEEHVDEVHCSRERSRTAWNIIEEHLLPFMEKENYEVSTQCRLHPN 95
Query: 84 NDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGA 143
ND+FRDQEQHKIH+DIN W+CGYC+KSFRAEKFLD+HFDNRH+NLLNVSHGKCLADLCGA
Sbjct: 96 NDLFRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHSNLLNVSHGKCLADLCGA 155
Query: 144 LHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK 203
LHCD M+ KS +SKC PAA A+N+HLCESLA+ CFPIN+GPSA+RLHELFL QFC AH
Sbjct: 156 LHCDLKMDIKSRKSKCKPAAAARNKHLCESLADSCFPINEGPSANRLHELFLHQFCGAHS 215
Query: 204 CPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGR 263
C K +PF +G +QP +FY++ SIL LMLLPIFY+IVYL++RE R G + LKRI + GR
Sbjct: 216 CTGKQKPFSRGAARQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLKRISKAGR 275
Query: 264 KAKP 267
K KP
Sbjct: 276 KNKP 279
>gi|218186624|gb|EEC69051.1| hypothetical protein OsI_37889 [Oryza sativa Indica Group]
Length = 330
Score = 363 bits (932), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 173/269 (64%), Positives = 211/269 (78%), Gaps = 2/269 (0%)
Query: 2 GRN-TQIFTSIFLLLLFSPQEILIQGFDDSA-SRTLKQEPGRAHEVHCSRERSRAAWGVI 59
GR+ + + +F L + Q F ++A SRTL ++VHCSRERSRAAW I
Sbjct: 62 GRSRSAAYLVLFASCLLAVAAASHQEFHEAAGSRTLLMSHEHTNQVHCSRERSRAAWKAI 121
Query: 60 EEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQ 119
+EYLMPFVE+ +YE+ SKCRLHPDNDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQ
Sbjct: 122 DEYLMPFVEKEKYELPSKCRLHPDNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQ 181
Query: 120 HFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCF 179
HF NRHNNL++ S G+CLADLCGALHCD ++ K +SKC+ A A+NRHLCESLA+ CF
Sbjct: 182 HFHNRHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCF 241
Query: 180 PINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYL 239
PINQG SASRLHE FLRQFCDAH C S+PFPKGGRKQ + FYL++ LT++LLP+FYL
Sbjct: 242 PINQGQSASRLHEFFLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYL 301
Query: 240 IVYLYQREMRGGTQELKRIPRVGRKAKPS 268
IV+L+QREM+ G Q L+RI +V +K KPS
Sbjct: 302 IVFLHQREMKKGGQNLRRISKVVQKKKPS 330
>gi|326487255|dbj|BAJ89612.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326532646|dbj|BAJ89168.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 273
Score = 362 bits (930), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 174/256 (67%), Positives = 201/256 (78%), Gaps = 4/256 (1%)
Query: 13 LLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQY 72
LL+ S QE+ + S SR L Q HEVHCSRERSRAAW I+EYLMPFVE+ +Y
Sbjct: 22 LLVAGSHQEL----HEASGSRILHQTKPHTHEVHCSRERSRAAWKAIDEYLMPFVEKEKY 77
Query: 73 EISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVS 132
E+ SKCRL P+NDMFR+QEQHKIH D+NEWRCG+CKKSFRAEKF+DQHF NRHNNLL+ S
Sbjct: 78 ELPSKCRLRPENDMFREQEQHKIHFDVNEWRCGFCKKSFRAEKFIDQHFANRHNNLLDNS 137
Query: 133 HGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHE 192
G+CLADLCGALHCD M K +SKCN AA A+NRH CESLA+ CFPINQG SASRLHE
Sbjct: 138 QGRCLADLCGALHCDLTMEFKKPKSKCNDAAAARNRHRCESLADSCFPINQGHSASRLHE 197
Query: 193 LFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGT 252
FL QFCDAH C S+PFPKGGRKQ + FYL+I ILT++LLP+FYLIV+L+QREM+ G
Sbjct: 198 FFLHQFCDAHTCNGVSKPFPKGGRKQTNRFYLAICILTVLLLPVFYLIVFLHQREMKKGD 257
Query: 253 QELKRIPRVGRKAKPS 268
Q KRI + K KPS
Sbjct: 258 QVFKRIGKTVHKKKPS 273
>gi|297793919|ref|XP_002864844.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310679|gb|EFH41103.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 270
Score = 359 bits (921), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 181/270 (67%), Positives = 215/270 (79%), Gaps = 2/270 (0%)
Query: 1 MGRNTQIFTSIFLLLLFSPQEILIQGFDDSASRTLKQEP--GRAHEVHCSRERSRAAWGV 58
MGR + I LL + L QGF++S S L E G + E+HCSRERSRAAW +
Sbjct: 1 MGRLCESLPVILLLFALLLHQSLSQGFEESESTRLVNEEVGGNSPEIHCSRERSRAAWQI 60
Query: 59 IEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLD 118
I++YL PFVE+ +Y+I CRLHPDND++RDQEQHKIHVD+ EW+CGYCKKSF EKFLD
Sbjct: 61 IQDYLTPFVERERYQIPKNCRLHPDNDLYRDQEQHKIHVDVFEWKCGYCKKSFNDEKFLD 120
Query: 119 QHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRC 178
+HF RH NLLN + KCLADLCGALHCDFV+NSK ++KCNPAAVAKNRHLCES+AN C
Sbjct: 121 KHFTTRHYNLLNTTDTKCLADLCGALHCDFVLNSKKPKTKCNPAAVAKNRHLCESVANSC 180
Query: 179 FPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFY 238
FP++QGPSASRLHE FLRQFCDAH C K +PFP+GG+K+ VFYL+ISILTLMLLP+FY
Sbjct: 181 FPVSQGPSASRLHEHFLRQFCDAHTCTGKDKPFPRGGKKKSGVFYLAISILTLMLLPLFY 240
Query: 239 LIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
L+V+LYQRE R GTQEL+RI + GRK KPS
Sbjct: 241 LLVFLYQREKRSGTQELRRIRKTGRKPKPS 270
>gi|115487926|ref|NP_001066450.1| Os12g0233400 [Oryza sativa Japonica Group]
gi|108862374|gb|ABA96879.2| zinc finger family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|108862375|gb|ABG21933.1| zinc finger family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|108862376|gb|ABG21934.1| zinc finger family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|108862377|gb|ABG21935.1| zinc finger family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113648957|dbj|BAF29469.1| Os12g0233400 [Oryza sativa Japonica Group]
gi|215697514|dbj|BAG91508.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 275
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 171/269 (63%), Positives = 209/269 (77%), Gaps = 2/269 (0%)
Query: 2 GRN-TQIFTSIFLLLLFSPQEILIQGFDDSA-SRTLKQEPGRAHEVHCSRERSRAAWGVI 59
GR+ + + +F L + Q F ++A SRTL ++VHCSRERSRAAW I
Sbjct: 7 GRSRSAAYLVLFASCLLAVAAASHQEFHEAAGSRTLLMSHEHTNQVHCSRERSRAAWKAI 66
Query: 60 EEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQ 119
+EYLMPFVE+ +YE+ SKCRLHP NDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQ
Sbjct: 67 DEYLMPFVEKEKYELPSKCRLHPGNDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQ 126
Query: 120 HFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCF 179
HF NRHNNL++ S G+CLADLCGALHCD ++ K +SKC+ A A+NRHLCESLA+ CF
Sbjct: 127 HFHNRHNNLVDNSQGRCLADLCGALHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCF 186
Query: 180 PINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYL 239
PINQG SASRLHE LRQFCDAH C S+PFPKGGRKQ + FYL++ LT++LLP+FYL
Sbjct: 187 PINQGQSASRLHEFLLRQFCDAHTCKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYL 246
Query: 240 IVYLYQREMRGGTQELKRIPRVGRKAKPS 268
IV+L+QREM+ G Q L+RI +V +K KPS
Sbjct: 247 IVFLHQREMKKGGQNLRRISKVVQKKKPS 275
>gi|356548383|ref|XP_003542582.1| PREDICTED: uncharacterized protein LOC100780668 [Glycine max]
Length = 244
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 161/247 (65%), Positives = 197/247 (79%), Gaps = 6/247 (2%)
Query: 22 ILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLH 81
+L++G + +QE A EVHCSRERSRAA VIEEYL PF+E+ Y++SSKC+LH
Sbjct: 4 LLVEGIEQ------EQEQKHAQEVHCSRERSRAARKVIEEYLTPFMERENYQLSSKCKLH 57
Query: 82 PDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLC 141
P+ND+F DQE+HKI++D +EWRCGYCKKSFR EKFLDQHFD+RH N LN+SHGKCL DLC
Sbjct: 58 PENDIFTDQEEHKIYIDRHEWRCGYCKKSFREEKFLDQHFDSRHYNFLNLSHGKCLGDLC 117
Query: 142 GALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDA 201
GALHCD VMN KSSRSKCNPAA A+NRHLCE+LA+ CFPI++GPSA RLHELFL QFCDA
Sbjct: 118 GALHCDAVMNFKSSRSKCNPAAAARNRHLCENLADNCFPISEGPSAGRLHELFLHQFCDA 177
Query: 202 HKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRV 261
H C K +PF +GG+ Q S F L+ L L+LLP+FYL +YL Q +++ TQEL+RI +
Sbjct: 178 HTCSGKHKPFSRGGKDQSSFFRLAAGALILVLLPVFYLFLYLVQSDVKSRTQELRRISKA 237
Query: 262 GRKAKPS 268
G K+KPS
Sbjct: 238 GWKSKPS 244
>gi|18424689|ref|NP_568969.1| C2H2-like zinc finger protein [Arabidopsis thaliana]
gi|17381006|gb|AAL36315.1| unknown protein [Arabidopsis thaliana]
gi|21536913|gb|AAM61245.1| zinc finger-like protein [Arabidopsis thaliana]
gi|22136886|gb|AAM91787.1| unknown protein [Arabidopsis thaliana]
gi|332010344|gb|AED97727.1| C2H2-like zinc finger protein [Arabidopsis thaliana]
Length = 271
Score = 344 bits (882), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 168/247 (68%), Positives = 202/247 (81%), Gaps = 3/247 (1%)
Query: 25 QGFDDSASRTLKQEP---GRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLH 81
QGF++S S L E A E+HCSRERSRAAW +I++YL PFVE+ +YEI CRLH
Sbjct: 25 QGFEESESTRLVNEEVEVSNAPEIHCSRERSRAAWQIIQDYLTPFVERERYEIPKNCRLH 84
Query: 82 PDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLC 141
PDND++RDQE HK+HVD+ EW+CGYCKKSF EKFLD+HF RH NLLN + KCLADLC
Sbjct: 85 PDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEKFLDKHFSTRHYNLLNTTDTKCLADLC 144
Query: 142 GALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDA 201
GALHCDFV++SK +SKCNP AVAKNRHLCES+AN CFP++QGPSASRLHE FLRQFCDA
Sbjct: 145 GALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVANSCFPVSQGPSASRLHEHFLRQFCDA 204
Query: 202 HKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRV 261
H C +PFP+GG+K+ VFYL+ISILTLMLLP+FYL+V+L+QRE R GTQ+L+RI +
Sbjct: 205 HTCTGNDKPFPRGGKKKSGVFYLAISILTLMLLPLFYLLVFLHQREKRSGTQDLRRIIKS 264
Query: 262 GRKAKPS 268
G+K KPS
Sbjct: 265 GKKTKPS 271
>gi|302398695|gb|ADL36642.1| C2H2L domain class transcription factor [Malus x domestica]
Length = 274
Score = 339 bits (870), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 158/250 (63%), Positives = 199/250 (79%), Gaps = 6/250 (2%)
Query: 25 QGFDDSAS----RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRL 80
QG +DSA+ +T+ +E G EVHCSRERS+AAW +I+EYLMPFVE+ +Y+IS C+L
Sbjct: 25 QGHEDSAATSFFKTINKELGDPPEVHCSRERSKAAWKIIQEYLMPFVEKERYQISRTCKL 84
Query: 81 HPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADL 140
HPDND++RDQE+HKI VD+NEW+CGYCKK F +KFLD+HFDNRH NLLNVSH +CLAD+
Sbjct: 85 HPDNDLYRDQEEHKIQVDLNEWQCGYCKKRFYDDKFLDKHFDNRHYNLLNVSHTRCLADV 144
Query: 141 CGALHCDFVMNS-KSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFC 199
CGALHCD M+S ++KCNPAA A+NRHLCE LA+ CFP+ QGP+ASRLHE FLRQFC
Sbjct: 145 CGALHCDLEMDSVPPKKTKCNPAAAARNRHLCEGLADSCFPVKQGPAASRLHEFFLRQFC 204
Query: 200 DAHKCPRKSQPFPKGGRKQ-PSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRI 258
D+H C + +PF +G RK+ S+ YL ISILTL LL +FY +Y+Y R ++ GTQELKR+
Sbjct: 205 DSHTCTGRQKPFSRGRRKKRASISYLVISILTLALLFLFYSYIYMYSRGVKRGTQELKRV 264
Query: 259 PRVGRKAKPS 268
+GRK KPS
Sbjct: 265 TPIGRKKKPS 274
>gi|222616852|gb|EEE52984.1| hypothetical protein OsJ_35656 [Oryza sativa Japonica Group]
Length = 1075
Score = 338 bits (867), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 154/222 (69%), Positives = 183/222 (82%), Gaps = 1/222 (0%)
Query: 25 QGFDDSA-SRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPD 83
Q F ++A SRTL ++VHCSRERSRAAW I+EYLMPFVE+ +YE+ SKCRLHP
Sbjct: 819 QEFHEAAGSRTLLMSHEHTNQVHCSRERSRAAWKAIDEYLMPFVEKEKYELPSKCRLHPG 878
Query: 84 NDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGA 143
NDMFR+QEQHKIH DINEWRCG+CKK+FRAEKFLDQHF NRHNNL++ S G+CLADLCGA
Sbjct: 879 NDMFREQEQHKIHFDINEWRCGFCKKAFRAEKFLDQHFHNRHNNLVDNSQGRCLADLCGA 938
Query: 144 LHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK 203
LHCD ++ K +SKC+ A A+NRHLCESLA+ CFPINQG SASRLHE LRQFCDAH
Sbjct: 939 LHCDLMLEFKKPKSKCSATAAARNRHLCESLADSCFPINQGQSASRLHEFLLRQFCDAHT 998
Query: 204 CPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQ 245
C S+PFPKGGRKQ + FYL++ LT++LLP+FYLIV+L+Q
Sbjct: 999 CKNGSKPFPKGGRKQTNRFYLALCALTIVLLPLFYLIVFLHQ 1040
>gi|224060201|ref|XP_002300082.1| predicted protein [Populus trichocarpa]
gi|222847340|gb|EEE84887.1| predicted protein [Populus trichocarpa]
Length = 224
Score = 334 bits (856), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 146/223 (65%), Positives = 185/223 (82%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
EVHCSRERSRAAW +I+EYLMPFVE+ +Y+ISS+CRLHP+ND++RDQEQHK+HVDINEWR
Sbjct: 1 EVHCSRERSRAAWKIIDEYLMPFVEKERYKISSRCRLHPENDLYRDQEQHKMHVDINEWR 60
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAA 163
CGYCKK+F EK+LD+HFDNRH +LLNVSH KCLAD+CG LHCD V++S ++KCNPAA
Sbjct: 61 CGYCKKTFYEEKYLDKHFDNRHYDLLNVSHSKCLADVCGVLHCDLVLDSAPHKTKCNPAA 120
Query: 164 VAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFY 223
+N+HLCESLA+ CFP+N+G SA RL+E FLRQFCDAH C +PF KGG+K+ S Y
Sbjct: 121 TGRNKHLCESLADSCFPVNEGTSALRLNEFFLRQFCDAHTCSGGRKPFSKGGKKETSTLY 180
Query: 224 LSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAK 266
+ IS+L LMLL +FY+ +YLY R ++ G+Q LK I + G+K +
Sbjct: 181 VIISVLVLMLLALFYIFMYLYLRGIKRGSQGLKHISKSGQKKR 223
>gi|357160391|ref|XP_003578749.1| PREDICTED: uncharacterized protein LOC100823211 [Brachypodium
distachyon]
Length = 274
Score = 328 bits (841), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/262 (66%), Positives = 204/262 (77%), Gaps = 4/262 (1%)
Query: 7 IFTSIFLLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPF 66
+ S LL+ S QEI + + SR Q HEVHCSRERSRAAW I+EYLMPF
Sbjct: 17 LAASCLLLVAGSHQEI----HEATGSRIHHQTNQHTHEVHCSRERSRAAWKAIDEYLMPF 72
Query: 67 VEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHN 126
VE+ +YE+ SKCRLH +NDMFR+QE+HKIH D+NEWRCG+CKKSFRAEK++DQHF NRHN
Sbjct: 73 VEKEKYELPSKCRLHAENDMFREQEEHKIHFDVNEWRCGFCKKSFRAEKYIDQHFSNRHN 132
Query: 127 NLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPS 186
NLL+ S G+CLADLCGALHCD M K +SKCN AA A+N HLCESLA+ CFP+NQG S
Sbjct: 133 NLLDNSQGRCLADLCGALHCDLTMEFKKPKSKCNAAAAARNCHLCESLADNCFPVNQGHS 192
Query: 187 ASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQR 246
ASRLHE FLRQFCDAH C S+PFPKGGRKQ + FYL++ ILTL+LLP+FYLIV+L+QR
Sbjct: 193 ASRLHEFFLRQFCDAHTCNGDSKPFPKGGRKQTNRFYLALCILTLLLLPLFYLIVFLHQR 252
Query: 247 EMRGGTQELKRIPRVGRKAKPS 268
EM+ G Q KRIP+ K KPS
Sbjct: 253 EMKKGVQVFKRIPKTVHKKKPS 274
>gi|449455886|ref|XP_004145681.1| PREDICTED: uncharacterized protein LOC101219582 [Cucumis sativus]
gi|449492886|ref|XP_004159131.1| PREDICTED: uncharacterized LOC101219582 [Cucumis sativus]
Length = 281
Score = 327 bits (838), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 160/253 (63%), Positives = 197/253 (77%), Gaps = 3/253 (1%)
Query: 19 PQEILIQGFD--DSASR-TLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEIS 75
P + L QG + DSAS T KQ+ AHEVHCSRERSRAAW VIEEYLMPFVE+ +Y+IS
Sbjct: 29 PLQSLNQGSNSKDSASESTPKQDWNNAHEVHCSRERSRAAWKVIEEYLMPFVEKKKYKIS 88
Query: 76 SKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK 135
+KCRLHPDNDMFRDQEQHK H+D N+W+CGYC+K F EK++DQHFDNRH NLLNVS +
Sbjct: 89 TKCRLHPDNDMFRDQEQHKSHLDFNDWKCGYCRKRFYEEKYIDQHFDNRHYNLLNVSRNR 148
Query: 136 CLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFL 195
CLADLCGALHCD V+++ S +SKCNPAA A+N+H+CE LA+ CFP+++G AS LHE FL
Sbjct: 149 CLADLCGALHCDHVIDAVSQKSKCNPAAAARNKHMCEGLADSCFPVDEGALASHLHEFFL 208
Query: 196 RQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQEL 255
QFCDAH C K +PF +G + + SVFY+ IS+LT++ + FY+ YLY R MR Q L
Sbjct: 209 HQFCDAHTCSGKPKPFSRGRQVRRSVFYIVISVLTILFVMFFYVFFYLYNRGMRTRPQVL 268
Query: 256 KRIPRVGRKAKPS 268
KR+ + GRK KPS
Sbjct: 269 KRLSQSGRKKKPS 281
>gi|255540381|ref|XP_002511255.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223550370|gb|EEF51857.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 217
Score = 325 bits (832), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 147/191 (76%), Positives = 172/191 (90%)
Query: 30 SASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRD 89
SA RT K++ AHEVHCSRERSRAAW +IEEYLMPFVE+ +Y+ISSKCRLHP+ND+FRD
Sbjct: 6 SAFRTGKKDKEHAHEVHCSRERSRAAWQIIEEYLMPFVERERYQISSKCRLHPENDLFRD 65
Query: 90 QEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFV 149
QE+HKIHVDINEW+CGYCKKSFRAE+FLDQHF+NRH NLLN+S GKCLADLCGALHCDFV
Sbjct: 66 QEEHKIHVDINEWKCGYCKKSFRAERFLDQHFENRHYNLLNISDGKCLADLCGALHCDFV 125
Query: 150 MNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQ 209
+N+K+ ++KCNPAAVAKNRHLCESLA+ CFP+NQGPSASRLHELF+ QFCDAH C K +
Sbjct: 126 LNTKTMKTKCNPAAVAKNRHLCESLADSCFPLNQGPSASRLHELFVHQFCDAHTCSGKRK 185
Query: 210 PFPKGGRKQPS 220
FPKGG++ +
Sbjct: 186 LFPKGGKRNEA 196
>gi|297805600|ref|XP_002870684.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297316520|gb|EFH46943.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 271
Score = 324 bits (830), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 149/244 (61%), Positives = 192/244 (78%), Gaps = 8/244 (3%)
Query: 24 IQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPD 83
++GF D G HE+HCSRERSR AW +I+EYLMP+VE+ +Y++ S CR+H D
Sbjct: 35 LKGFKDPVD-------GSFHEIHCSRERSRVAWKIIQEYLMPYVEKERYQLPSSCRVHRD 87
Query: 84 NDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGA 143
ND++R+QE+HK+H DINEWRCG+CKK+F EK+LD+HFD+RH NLLN SHGKCLADLCGA
Sbjct: 88 NDIYREQEEHKVHSDINEWRCGFCKKAFYEEKYLDKHFDSRHYNLLNASHGKCLADLCGA 147
Query: 144 LHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK 203
LHCD V+N+ +SKCNPAA A+NRHLCESLAN CFP+N+GPSA+RLH+ FLRQFCDAH
Sbjct: 148 LHCDLVVNTAQLKSKCNPAASARNRHLCESLANSCFPVNKGPSANRLHDFFLRQFCDAHT 207
Query: 204 CPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGR 263
C S+PF + +K+ + Y+ ISI TL++L ++Y VYL+QR ++ G+QELKRI R G
Sbjct: 208 CSGGSRPFSQKPKKRGKL-YIIISISTLIVLLLYYSFVYLFQRGLKRGSQELKRIRRNGL 266
Query: 264 KAKP 267
K KP
Sbjct: 267 KKKP 270
>gi|356567332|ref|XP_003551875.1| PREDICTED: uncharacterized protein LOC100527694 [Glycine max]
Length = 268
Score = 320 bits (821), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 164/263 (62%), Positives = 202/263 (76%), Gaps = 7/263 (2%)
Query: 11 IFLL-----LLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMP 65
+FLL L FS Q D +A RTL QE G H++HCSRERSR AW +I+EYLMP
Sbjct: 8 LFLLCIVCGLYFSLQ-FTTGHADSNAGRTLNQEGG-DHQIHCSRERSRTAWKIIQEYLMP 65
Query: 66 FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
FVE+ +Y IS +CRLHPDND++RDQEQHK H DINEW+CG+CKKSF EK LDQHFDNRH
Sbjct: 66 FVEKEKYHISKRCRLHPDNDIYRDQEQHKFHTDINEWQCGFCKKSFYEEKHLDQHFDNRH 125
Query: 126 NNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGP 185
+NLLN+S +C+AD+CGALHCD MNS S +SKCNPAA AKN+HLCESLA+ CFP+ +GP
Sbjct: 126 SNLLNLSESQCIADVCGALHCDHEMNSGSKKSKCNPAAAAKNKHLCESLADSCFPVTEGP 185
Query: 186 SASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQ 245
+ASRLHE FL QFCDAH C +PF +G RK+ +VFY+ +SIL ++LL ++YL +YLYQ
Sbjct: 186 AASRLHEFFLHQFCDAHSCTGNRKPFSRGRRKKTNVFYIFVSILLVILLLLYYLYIYLYQ 245
Query: 246 REMRGGTQELKRIPRVGRKAKPS 268
R M+ TQ LKR+ + RK KPS
Sbjct: 246 RGMKRETQVLKRVSQASRKKKPS 268
>gi|10177309|dbj|BAB10570.1| unnamed protein product [Arabidopsis thaliana]
Length = 277
Score = 317 bits (811), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 153/253 (60%), Positives = 186/253 (73%), Gaps = 9/253 (3%)
Query: 25 QGFDDSASRTLKQEP---GRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLH 81
QGF++S S L E A E+HCSRERSRAAW +I++YL PFVE+ +YEI CRLH
Sbjct: 25 QGFEESESTRLVNEEVEVSNAPEIHCSRERSRAAWQIIQDYLTPFVERERYEIPKNCRLH 84
Query: 82 PDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLC 141
PDND++RDQE HK+HVD+ EW+CGYCKKSF EKFLD+HF RH NLLN + KCLADLC
Sbjct: 85 PDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEKFLDKHFSTRHYNLLNTTDTKCLADLC 144
Query: 142 GALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDA 201
GALHCDFV++SK +SKCNP AVAKNRHLCES+AN CFP++QGPSASRLHE FLRQFCDA
Sbjct: 145 GALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVANSCFPVSQGPSASRLHEHFLRQFCDA 204
Query: 202 HKCPRKSQPFPKGGR----KQPSVFYLSISILTLMLLPIFYLIVY--LYQREMRGGTQEL 255
H C +PFP+GG+ K YL + + + IF I ++ RE R GTQ+L
Sbjct: 205 HTCTGNDKPFPRGGKSSYIKGTVWCYLFLFSIIALSSEIFGHIAVSTVFNREKRSGTQDL 264
Query: 256 KRIPRVGRKAKPS 268
+RI + G+K KPS
Sbjct: 265 RRIIKSGKKTKPS 277
>gi|255632970|gb|ACU16839.1| unknown [Glycine max]
Length = 266
Score = 315 bits (807), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/259 (62%), Positives = 199/259 (76%), Gaps = 7/259 (2%)
Query: 11 IFLL-----LLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMP 65
+FLL L FS Q D +A RTL QE G H++HCSRERSR AW +I+EYLMP
Sbjct: 8 LFLLCIVCGLYFSLQ-FTTGHADSNAGRTLNQEGG-DHQIHCSRERSRTAWKIIQEYLMP 65
Query: 66 FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
FVE+ +Y IS +CRLHPDND++RDQEQHK H DINEW+CG+CKKSF EK LDQHFDNRH
Sbjct: 66 FVEKEKYHISKRCRLHPDNDIYRDQEQHKFHTDINEWQCGFCKKSFYEEKHLDQHFDNRH 125
Query: 126 NNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGP 185
+NLLN+S +C+AD+CGALHCD MNS S +SKCNPAA AKN+HLCESLA+ CFP+ +GP
Sbjct: 126 SNLLNLSESQCIADVCGALHCDHEMNSGSKKSKCNPAAAAKNKHLCESLADSCFPVTEGP 185
Query: 186 SASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQ 245
+ASRLHE FL QFCDAH C +PF +G RK+ +VFY+ +SIL ++LL ++YL +YLYQ
Sbjct: 186 AASRLHEFFLHQFCDAHSCTGNRKPFSRGRRKKTNVFYIFVSILLVILLLLYYLYIYLYQ 245
Query: 246 REMRGGTQELKRIPRVGRK 264
R M+ TQ LKR+ + RK
Sbjct: 246 RGMKRETQVLKRVSQASRK 264
>gi|356527071|ref|XP_003532137.1| PREDICTED: uncharacterized protein LOC100792257 [Glycine max]
Length = 271
Score = 310 bits (795), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 155/239 (64%), Positives = 192/239 (80%), Gaps = 1/239 (0%)
Query: 30 SASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRD 89
+A RTL QE G H++HCSRERSR AW +I+EYLMPFVE+ +Y IS +C LHPDND++RD
Sbjct: 34 NAGRTLNQEGG-DHQIHCSRERSRTAWKIIQEYLMPFVEKEKYHISKRCMLHPDNDIYRD 92
Query: 90 QEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFV 149
QEQHK H+DINEW+C YCKKSF EK LDQHFDNRH+NLLN+S +CLAD+CGALHCD
Sbjct: 93 QEQHKSHIDINEWQCRYCKKSFYEEKHLDQHFDNRHSNLLNLSESQCLADVCGALHCDHE 152
Query: 150 MNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQ 209
MNS S +SKCNPAA AK++HLCESLA+ CFP++ GP+AS+LHE FL QFCDAH C +
Sbjct: 153 MNSGSKKSKCNPAAAAKHKHLCESLADSCFPVSDGPAASQLHEFFLHQFCDAHSCTGSRK 212
Query: 210 PFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKPS 268
PF +G RK+ +VFY+ +SIL ++LL ++YL +YLYQR M+ TQ LKRI + RK KPS
Sbjct: 213 PFSRGHRKKTNVFYIFVSILLVILLLLYYLYIYLYQRGMKRETQVLKRISQASRKKKPS 271
>gi|255571984|ref|XP_002526933.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223533685|gb|EEF35420.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 240
Score = 307 bits (786), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 138/205 (67%), Positives = 166/205 (80%)
Query: 8 FTSIFLLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFV 67
T+ LL S L D +ASR L+ E G AHEVHCSRERSRAAW +IEEYLMPFV
Sbjct: 16 LTNSLHLLTLSCSTSLQGSKDYAASRALQHEQGNAHEVHCSRERSRAAWKIIEEYLMPFV 75
Query: 68 EQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNN 127
E+ Y++S+ CRLHPDND++RDQEQHKIH D NEWRCGYC+KSF E++L++HFDNRH N
Sbjct: 76 EKEHYQVSTGCRLHPDNDLYRDQEQHKIHEDGNEWRCGYCRKSFYEERYLNKHFDNRHYN 135
Query: 128 LLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSA 187
LLNVSH +CLAD+CGALHCDFVM+S + ++KCNPAA A+N+HLCESLAN CFP+N+GPSA
Sbjct: 136 LLNVSHSRCLADVCGALHCDFVMDSATRKTKCNPAAAARNKHLCESLANSCFPVNEGPSA 195
Query: 188 SRLHELFLRQFCDAHKCPRKSQPFP 212
RL+ELFL QFCDAH C + +PFP
Sbjct: 196 HRLNELFLHQFCDAHTCTGRQKPFP 220
>gi|294462744|gb|ADE76916.1| unknown [Picea sitchensis]
Length = 277
Score = 306 bits (784), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 144/259 (55%), Positives = 185/259 (71%), Gaps = 3/259 (1%)
Query: 7 IFTSIFLLLLFSPQEILIQGFDD-SASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMP 65
+ +FL ++ P L Q + +A R + ++E HCSRERSR AW +IEEYLMP
Sbjct: 18 LLCVVFLCII--PVGGLAQELQEFAAVRAQQDSVVHSYETHCSRERSRTAWKIIEEYLMP 75
Query: 66 FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
FVEQ Y IS CRLH + DMFR+QE HK H+ +NEW+CG+C+K F +EK LDQHFDNRH
Sbjct: 76 FVEQENYNISRTCRLHLEIDMFREQEPHKDHLQVNEWQCGFCRKIFISEKLLDQHFDNRH 135
Query: 126 NNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGP 185
+NLLN+S +CLAD+CGALHCD + SK ++KCNPAA KN HLCESLAN CFP + P
Sbjct: 136 HNLLNISRSRCLADMCGALHCDAMDKSKPIKTKCNPAAADKNCHLCESLANSCFPPQESP 195
Query: 186 SASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQ 245
SA RLH+ FLRQFCDAH C R SQPF +G K ++ Y ++ +L +M+LPIFY IVY++Q
Sbjct: 196 SARRLHDFFLRQFCDAHTCKRGSQPFARGSGKTTNLLYYAVCLLLMMMLPIFYCIVYVHQ 255
Query: 246 REMRGGTQELKRIPRVGRK 264
REM + KRIP++ +K
Sbjct: 256 REMGKHIKNFKRIPKMNQK 274
>gi|15237477|ref|NP_198887.1| C2H2 type zinc finger protein [Arabidopsis thaliana]
gi|9758099|dbj|BAB08543.1| unnamed protein product [Arabidopsis thaliana]
gi|90962968|gb|ABE02408.1| At5g40710 [Arabidopsis thaliana]
gi|332007203|gb|AED94586.1| C2H2 type zinc finger protein [Arabidopsis thaliana]
Length = 272
Score = 305 bits (782), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 150/280 (53%), Positives = 201/280 (71%), Gaps = 22/280 (7%)
Query: 1 MGRNTQIFTSIFLLLLFS----------PQEIL---IQGFDDSASRTLKQEPGRAHEVHC 47
MGR+ S ++LL F P I+ ++GF D E G HE+HC
Sbjct: 1 MGRSNSQCPSHWILLCFFISSQFWGFSLPTSIIQQNLEGFKDP-------EDG-FHEIHC 52
Query: 48 SRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYC 107
SRERSR AW +I+EYLMP+VE+ +Y++ S CR+H DND++R+QE+HK+ DINEWRCG+C
Sbjct: 53 SRERSRVAWKIIQEYLMPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFC 112
Query: 108 KKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKN 167
KK+F EK+LD+HFD+RH NLLN SHGKCL+DLCGALHCD V+++ +SKCNPAA AKN
Sbjct: 113 KKAFYEEKYLDKHFDSRHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKN 172
Query: 168 RHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSIS 227
RHLCESLAN CFP+N+G SA+RLH+ FLRQFCDAH C S+P + +K+ S+ Y+ S
Sbjct: 173 RHLCESLANSCFPVNKGSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFS 231
Query: 228 ILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKP 267
I+ L++L ++Y VYL++R ++ +Q+LKRI G K KP
Sbjct: 232 IIVLVVLLLYYSFVYLFRRGLKRRSQDLKRIRHNGLKKKP 271
>gi|26452757|dbj|BAC43459.1| putative zinc finger protein [Arabidopsis thaliana]
Length = 272
Score = 304 bits (778), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 149/280 (53%), Positives = 201/280 (71%), Gaps = 22/280 (7%)
Query: 1 MGRNTQIFTSIFLLLLFS----------PQEIL---IQGFDDSASRTLKQEPGRAHEVHC 47
MGR+ S ++LL F P I+ ++GF D E G H++HC
Sbjct: 1 MGRSNSQCPSHWILLCFFISSQFWGFSLPTSIIQQNLEGFKDP-------EDG-FHKIHC 52
Query: 48 SRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYC 107
SRERSR AW +I+EYLMP+VE+ +Y++ S CR+H DND++R+QE+HK+ DINEWRCG+C
Sbjct: 53 SRERSRVAWKIIQEYLMPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFC 112
Query: 108 KKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKN 167
KK+F EK+LD+HFD+RH NLLN SHGKCL+DLCGALHCD V+++ +SKCNPAA AKN
Sbjct: 113 KKAFYEEKYLDKHFDSRHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKN 172
Query: 168 RHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSIS 227
RHLCESLAN CFP+N+G SA+RLH+ FLRQFCDAH C S+P + +K+ S+ Y+ S
Sbjct: 173 RHLCESLANSCFPVNKGSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFS 231
Query: 228 ILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKAKP 267
I+ L++L ++Y VYL++R ++ +Q+LKRI G K KP
Sbjct: 232 IIVLVVLLLYYSFVYLFRRGLKRRSQDLKRIRHNGLKKKP 271
>gi|357477231|ref|XP_003608901.1| hypothetical protein MTR_4g104230 [Medicago truncatula]
gi|355509956|gb|AES91098.1| hypothetical protein MTR_4g104230 [Medicago truncatula]
Length = 241
Score = 301 bits (770), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 145/201 (72%), Positives = 163/201 (81%), Gaps = 2/201 (0%)
Query: 28 DDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMF 87
+ S R L QE A EVHCSRERSR A VIEEYL PFVE+ Y++S KCRLHP+NDMF
Sbjct: 36 ESSGERNLNQEHEHAPEVHCSRERSRVASKVIEEYLTPFVEKENYQLSRKCRLHPENDMF 95
Query: 88 RDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCD 147
RDQE+H+I++D NEWRCGYCKKSFR EKFLDQH DNRH NLLN+SHGKCLADLCGALHCD
Sbjct: 96 RDQEEHRIYLDRNEWRCGYCKKSFREEKFLDQHLDNRHCNLLNLSHGKCLADLCGALHCD 155
Query: 148 FVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRK 207
V NSKSSRSKCNPAA A+NRHLCESLA+ CFPI+ G SASRLHELFL QFCDAH C K
Sbjct: 156 AVTNSKSSRSKCNPAAAARNRHLCESLADSCFPISGGLSASRLHELFLHQFCDAHSCSGK 215
Query: 208 SQPFPKGGRKQPSVFYLSISI 228
+PF +GG+ +P YL+ I
Sbjct: 216 HKPFSRGGKVRPE--YLTGEI 234
>gi|388514801|gb|AFK45462.1| unknown [Lotus japonicus]
Length = 263
Score = 272 bits (695), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 130/218 (59%), Positives = 159/218 (72%), Gaps = 12/218 (5%)
Query: 27 FDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDM 86
+ RTL+QE H++HCSRERSR AW +I+EYLMPFVE+ +Y IS +CRLHPDND+
Sbjct: 43 INSDVGRTLQQEQDGNHQIHCSRERSRVAWKIIQEYLMPFVEKEKYHISKRCRLHPDNDI 102
Query: 87 FRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHC 146
+RDQE HK H+DINEW+CG+CKKSF EK LDQHFDNRH+NLLN S +CLAD+CGALHC
Sbjct: 103 YRDQEDHKSHIDINEWQCGFCKKSFYEEKHLDQHFDNRHSNLLNSSESRCLADVCGALHC 162
Query: 147 DFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPR 206
D +NS S +SKCNPAA A+N+HLCESLA+ CFP+N+GP+ASRLHE FL QFCD +
Sbjct: 163 DLEINSGSKKSKCNPAAAARNKHLCESLADSCFPVNEGPAASRLHEFFLHQFCDHTAVLQ 222
Query: 207 KSQPFPKG-GRKQ------PSVF-----YLSISILTLM 232
P+G GR Q P F Y +IS T +
Sbjct: 223 VGNLSPEGAGRGQMCSTSWPPFFLWCCCYFTISTFTCI 260
>gi|449529367|ref|XP_004171671.1| PREDICTED: uncharacterized protein LOC101223370, partial [Cucumis
sativus]
Length = 203
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 120/168 (71%), Positives = 145/168 (86%), Gaps = 1/168 (0%)
Query: 25 QGFDDSAS-RTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPD 83
Q + SA+ R L+Q EVHCSRERSR AW +IEE+L+PF+E+ YE+S++CRLHP+
Sbjct: 36 QDEEQSATLRPLEQNEEHVDEVHCSRERSRTAWNIIEEHLLPFMEKENYEVSTQCRLHPN 95
Query: 84 NDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGA 143
ND+FRDQEQHKIH+DIN W+CGYC+KSFRAEKFLD+HFDNRH+NLLNVSHGKCLADLCGA
Sbjct: 96 NDLFRDQEQHKIHLDINHWQCGYCRKSFRAEKFLDKHFDNRHSNLLNVSHGKCLADLCGA 155
Query: 144 LHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLH 191
LHCD M+ KS +SKC PAA A+N+HLCESLA+ CFPIN+GPSA+RLH
Sbjct: 156 LHCDLKMDIKSRKSKCKPAAAARNKHLCESLADSCFPINEGPSANRLH 203
>gi|147776347|emb|CAN76470.1| hypothetical protein VITISV_016787 [Vitis vinifera]
Length = 483
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 113/156 (72%), Positives = 131/156 (83%)
Query: 59 IEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLD 118
+YLMPFVE+ Y+ISS CRLH +ND++RDQEQHKIHVDINEWRCGYCKKSF AE +LD
Sbjct: 33 TNQYLMPFVEKEGYQISSSCRLHHENDLYRDQEQHKIHVDINEWRCGYCKKSFYAENYLD 92
Query: 119 QHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRC 178
+HFDNRH NLLNVS KCLAD+CGALHCD VM+SK ++KCNPAA A+N HLCESLA+ C
Sbjct: 93 KHFDNRHYNLLNVSQNKCLADVCGALHCDHVMDSKLQKTKCNPAAAARNHHLCESLADGC 152
Query: 179 FPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKG 214
FP++ GPSASRLHE FLRQFCDAH C +PF +G
Sbjct: 153 FPVSGGPSASRLHEFFLRQFCDAHTCTGGRKPFSRG 188
>gi|413916780|gb|AFW56712.1| hypothetical protein ZEAMMB73_479170 [Zea mays]
Length = 222
Score = 253 bits (645), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 118/172 (68%), Positives = 139/172 (80%), Gaps = 1/172 (0%)
Query: 31 ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQ 90
SR L+ P AHEVHCSRERSRAAW I+EYLMPFVE+ +YE+ SKCRL PDNDMFR+Q
Sbjct: 39 GSRILQTNPD-AHEVHCSRERSRAAWEAIDEYLMPFVEKEKYELPSKCRLRPDNDMFREQ 97
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
EQHKIH DINEW CG+CKK+FRAEKF+DQHF+NRH NL++ S G+C+ADLCGALHCD +M
Sbjct: 98 EQHKIHFDINEWHCGFCKKAFRAEKFIDQHFENRHKNLVDNSEGRCMADLCGALHCDLMM 157
Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAH 202
K +SKCN AA +NRHLCESLA+ CFPINQG +ASRLH + C+ H
Sbjct: 158 QFKKPKSKCNAAAATRNRHLCESLADSCFPINQGLAASRLHGKWNVSECNDH 209
>gi|357436185|ref|XP_003588368.1| hypothetical protein MTR_1g006480 [Medicago truncatula]
gi|355477416|gb|AES58619.1| hypothetical protein MTR_1g006480 [Medicago truncatula]
Length = 306
Score = 251 bits (640), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 108/177 (61%), Positives = 139/177 (78%)
Query: 40 GRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDI 99
R + +++ S A +EYL PFVE+ +Y IS +CRLHPDND++RDQEQHK H+DI
Sbjct: 121 ARVFQCSSTKQESSNAQARRQEYLTPFVEKEKYNISRRCRLHPDNDIYRDQEQHKSHIDI 180
Query: 100 NEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKC 159
NEW+CGYCKK+F EK LDQHFDNRH+NLLN+ +CLAD+CGALHCD +NS S +SKC
Sbjct: 181 NEWQCGYCKKTFYEEKHLDQHFDNRHSNLLNLKESRCLADVCGALHCDHEINSGSKKSKC 240
Query: 160 NPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGR 216
NPAA A+N+H+CE++A+ CFP+N+GP+ASRLHE FL QFCDAH C + +PF +G R
Sbjct: 241 NPAAAARNKHICETIADSCFPVNEGPAASRLHEFFLHQFCDAHSCTGRGKPFSRGRR 297
>gi|168063136|ref|XP_001783530.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664965|gb|EDQ51666.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 223 bits (568), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 117/261 (44%), Positives = 160/261 (61%), Gaps = 13/261 (4%)
Query: 5 TQIFTSIFLLLLFSPQEILIQGF--DDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEY 62
+ S+ L+L S F + R L++E GR VHCSR RSR ++ EY
Sbjct: 117 AHVLISLALVLCLSSFTACADKFRVSEERQRQLQEEKGR---VHCSRSRSRTVRAIVSEY 173
Query: 63 LMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFD 122
LMPFV+ +Y + CRLHPDND++ +QE++ I + +W+CGYCKK FR + FLD HFD
Sbjct: 174 LMPFVKDQKYTLPKTCRLHPDNDVYHEQEENMIELRPMQWQCGYCKKIFRNQGFLDLHFD 233
Query: 123 NRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSR----SKCNPAAVAKNRHLCESLANRC 178
NRH+ L S CLAD CGALHCD+ NS SS+ + C A V KNRH CE LAN C
Sbjct: 234 NRHSEKLEPSSNSCLADACGALHCDY-YNSLSSKPQIQATCKLAVVEKNRHSCEVLANTC 292
Query: 179 FPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGG--RKQPSVFYLSISILTLMLLPI 236
FP + P+A RL+ F RQFCDAH C +K + FP G + S+ Y ++++ T+++L I
Sbjct: 293 FPADHSPAAQRLNHFFKRQFCDAHTCKKKLKIFPSGSGINRNKSLIY-ALTLFTVIVLAI 351
Query: 237 FYLIVYLYQREMRGGTQELKR 257
FY I+YL +R+ + L+R
Sbjct: 352 FYAILYLIKRDTNLLRKGLRR 372
>gi|413916778|gb|AFW56710.1| hypothetical protein ZEAMMB73_479170 [Zea mays]
Length = 177
Score = 216 bits (551), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 99/140 (70%), Positives = 116/140 (82%), Gaps = 1/140 (0%)
Query: 31 ASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQ 90
SR L+ P AHEVHCSRERSRAAW I+EYLMPFVE+ +YE+ SKCRL PDNDMFR+Q
Sbjct: 39 GSRILQTNPD-AHEVHCSRERSRAAWEAIDEYLMPFVEKEKYELPSKCRLRPDNDMFREQ 97
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
EQHKIH DINEW CG+CKK+FRAEKF+DQHF+NRH NL++ S G+C+ADLCGALHCD +M
Sbjct: 98 EQHKIHFDINEWHCGFCKKAFRAEKFIDQHFENRHKNLVDNSEGRCMADLCGALHCDLMM 157
Query: 151 NSKSSRSKCNPAAVAKNRHL 170
K +SKCN AA +NRHL
Sbjct: 158 QFKKPKSKCNAAAATRNRHL 177
>gi|302808117|ref|XP_002985753.1| hypothetical protein SELMODRAFT_47592 [Selaginella moellendorffii]
gi|300146662|gb|EFJ13331.1| hypothetical protein SELMODRAFT_47592 [Selaginella moellendorffii]
Length = 171
Score = 200 bits (508), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 97/165 (58%), Positives = 120/165 (72%), Gaps = 4/165 (2%)
Query: 43 HEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEW 102
H V CSR RSR A ++EEYL+PFVE Y +S +CRLH ND FR+QE+ K I++W
Sbjct: 3 HGVQCSRARSRTARHILEEYLLPFVENENYALSPECRLHASNDAFREQEREKQFYHIHDW 62
Query: 103 RCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNS---KSSRSKC 159
RCGYC K F +E++LD HFDNRH+ LNVS CLADLCGALHCD++ K S++KC
Sbjct: 63 RCGYCHKIFESEEYLDLHFDNRHSETLNVSRHNCLADLCGALHCDYMETKDKHKFSKNKC 122
Query: 160 NPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 204
+P +V +NRHLCE LAN CFP QG A+RL++ FLRQFCDAH C
Sbjct: 123 SP-SVDRNRHLCEKLANSCFPPQQGAQATRLNDFFLRQFCDAHSC 166
>gi|302785357|ref|XP_002974450.1| hypothetical protein SELMODRAFT_57652 [Selaginella moellendorffii]
gi|300158048|gb|EFJ24672.1| hypothetical protein SELMODRAFT_57652 [Selaginella moellendorffii]
Length = 171
Score = 199 bits (507), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 96/165 (58%), Positives = 120/165 (72%), Gaps = 4/165 (2%)
Query: 43 HEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEW 102
H V CSR RSR A ++EEYL+PFVE Y +S +CRLH ND FR+QE+ K I++W
Sbjct: 3 HGVQCSRARSRTARHILEEYLLPFVENENYALSPQCRLHASNDAFREQEREKQFYHIHDW 62
Query: 103 RCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNS---KSSRSKC 159
RCGYC K F +E++LD HFDNRH+ LNVS CLAD+CGALHCD++ K S++KC
Sbjct: 63 RCGYCHKIFESEEYLDLHFDNRHSETLNVSRDNCLADVCGALHCDYMETKDKHKFSKNKC 122
Query: 160 NPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 204
+P +V +NRHLCE LAN CFP QG A+RL++ FLRQFCDAH C
Sbjct: 123 SP-SVDRNRHLCEKLANSCFPPQQGAQATRLNDFFLRQFCDAHSC 166
>gi|168040762|ref|XP_001772862.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675773|gb|EDQ62264.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 235
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 69/132 (52%), Positives = 90/132 (68%), Gaps = 4/132 (3%)
Query: 45 VHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRC 104
VHCSR RSR ++ EYL PFVE +Y + CRLHP ND++R+QE++ + +W+C
Sbjct: 50 VHCSRSRSRTVREIVHEYLTPFVEAEKYTLPESCRLHPANDIYREQEENMDELRPMQWQC 109
Query: 105 GYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSK----CN 160
YCKK FR + FLD HFDNRH+NLL+ S CLAD CGALHCD+ + +S+S+ C
Sbjct: 110 RYCKKIFRNQGFLDTHFDNRHSNLLDKSLNMCLADACGALHCDYYNSLSTSKSQKQSTCK 169
Query: 161 PAAVAKNRHLCE 172
PA V +NRH CE
Sbjct: 170 PAVVERNRHSCE 181
>gi|428163164|gb|EKX32251.1| hypothetical protein GUITHDRAFT_91074 [Guillardia theta CCMP2712]
Length = 297
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 118/255 (46%), Gaps = 40/255 (15%)
Query: 23 LIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHP 82
+ G +DS SR +C+R RSR A ++ E P V ++ +++ C
Sbjct: 23 FLHGAEDSDSR----------HANCNRGRSRLARNILREEFYPHVRKLGFQLPPGCPFSE 72
Query: 83 DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCG 142
+ DMF D E+HK V N W+C YC K F++E +L+ H+ NRH + G CLAD C
Sbjct: 73 ERDMFLDNEKHKKEVRFNNWKCLYCNKVFKSESYLEHHYVNRHPETIK-EDGFCLADYCD 131
Query: 143 ALHCDFVMNS-------KSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFL 195
L CD N K+S +C+ + K RH C S+ ++CFP + +A+RLH F
Sbjct: 132 VLECDMQENVIGSEGMFKTSSYRCDAKRMQKRRHRCHSVLDKCFPPHTSDAANRLHHEFE 191
Query: 196 RQFCDAHKCPRKSQPFPKGGRKQPSVFYLS----------------------ISILTLML 233
+ +C+ C R+ ++ S +L ++
Sbjct: 192 KLYCEHLSCEASPDGLGDVARRPHTLLSQSKARSEYHMKKTKHSPWRKLAIVFGVLFAII 251
Query: 234 LPIFYLIVYLYQREM 248
L IFYL V LY+++M
Sbjct: 252 LFIFYLGVCLYRKDM 266
>gi|255077084|ref|XP_002502194.1| predicted protein [Micromonas sp. RCC299]
gi|226517459|gb|ACO63452.1| predicted protein [Micromonas sp. RCC299]
Length = 301
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/222 (31%), Positives = 105/222 (47%), Gaps = 19/222 (8%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSRERSR A GV++++ P ++ +Y + C DM+ + E+HK V +++ Y
Sbjct: 41 CSRERSRHARGVLDDFFHPVADRNRYRLPPNCPFDRSKDMYLEHEKHKEVVRRTQYKSLY 100
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCD-----------FVMNSKSS 155
K+F E ++D+H DNRH + + CLAD C L CD + S +S
Sbjct: 101 SDKTFYNEYYVDKHMDNRHMDKVPPGADVCLADYCEVLRCDEHQAWKHPELRKTVGSSAS 160
Query: 156 RSKCNPAAVAKNRHLCESLANRCFPINQGP-----SASRLHELFLRQFCDAHKCPRKSQP 210
RS+C+ + RH CE L + CFP GP +A+RL E F R CD C
Sbjct: 161 RSRCSQERMRGLRHFCEVLMSGCFPTGGGPDGDAGTAARLREYFTRHHCDMLTCDGVRDM 220
Query: 211 FPKGGRKQPSV---FYLSISILTLMLLPIFYLIVYLYQREMR 249
F G Y+ + I+ + +YL V+ R+ R
Sbjct: 221 FGVLGSHHAVAGGGGYVLVLIVVFAAVGCYYLFVWHAVRDGR 262
>gi|145350216|ref|XP_001419510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579742|gb|ABO97803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 270
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 112/224 (50%), Gaps = 15/224 (6%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE--WRC 104
CSRERSRAA V+ E+ P + +Y + C D+F + E HK V ++ WR
Sbjct: 18 CSRERSRAARTVLNEFFYPVADGARYALDLGCPFSHARDLFYEHETHKEPVRHSKLYWRS 77
Query: 105 GYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCD----FVMNSKSSR-SKC 159
Y K F++E ++D+H + RH + + + CLAD C L CD +V N + S+ ++C
Sbjct: 78 LYSGKVFKSEYYVDKHMERRHGDKIPSTADVCLADYCDVLQCDKHAEYVKNGRGSKPTRC 137
Query: 160 NPAAVAKNRHLCESLANRCFPINQGPS---ASRLHELFLRQFCDAHKCPRKSQPFP--KG 214
+ ++ R C + +CFP G + A +L+ F + +CD C F
Sbjct: 138 DEEQMSLVRDKCHEMLTQCFPNENGSTRDNAGKLNVYFTKYYCDQMTCADVDGVFELMSA 197
Query: 215 GRKQPSV-FYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKR 257
P V +Y++++ L ++ ++YL VY + R + ++KR
Sbjct: 198 HHDSPGVGYYIALAFL-FFIVAMYYLTVYSWTNGRR-VSSDIKR 239
>gi|443688435|gb|ELT91124.1| hypothetical protein CAPTEDRAFT_220314 [Capitella teleta]
Length = 284
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 95/212 (44%), Gaps = 15/212 (7%)
Query: 42 AHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE 101
A C R+RSR + E ++P + + Q S C L D DM+ +QEQ K ++
Sbjct: 36 AISAECPRDRSRIVRNIFIEDVLPILAEFQVTASESCPLSADRDMYAEQEQRKFMESSSK 95
Query: 102 WRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSHGKCLADLCGALHCDFVMNSKSS----R 156
W C +C KSF E+ LD HF+NRH N+L C AD C CD + + +
Sbjct: 96 WTCNFCGKSFYDERSLDNHFENRHPKNILTGPRATCPADFCDIFRCDVISGAHRAGYWEV 155
Query: 157 SKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQF----CDAHKCPRKSQPFP 212
+ C A + H CE L C P N P A+R +L L +F C C + P
Sbjct: 156 ALCRDADLIHTHHRCEDLLRNCLPSNI-PHATR--DLILARFEKAICGYLTCDMFWE-IP 211
Query: 213 KGGRKQP--SVFYLSISILTLMLLPIFYLIVY 242
+ P + Y+ + IL L I+Y Y
Sbjct: 212 ENRVSNPVGTAIYIVLFILLSFGLLIYYYAAY 243
>gi|168022196|ref|XP_001763626.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685119|gb|EDQ71516.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 285
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 52/128 (40%), Positives = 69/128 (53%), Gaps = 12/128 (9%)
Query: 45 VHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRC 104
VHCSR RSR A I EYLM +V +Y + C H DND R+QE+ + + EW+C
Sbjct: 67 VHCSRSRSRTARTTIFEYLMLYVHSQKYTLPKNCHFHFDNDFSREQEKSMDELGLTEWQC 126
Query: 105 GYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAV 164
CK FR+ +FLD HFDNRH+ L+ CL+ + ++ C PA +
Sbjct: 127 RCCKIVFRSREFLDTHFDNRHSEKLDTVVFCCLS------------SQPKKQATCKPAVM 174
Query: 165 AKNRHLCE 172
KNRH CE
Sbjct: 175 EKNRHSCE 182
>gi|195999976|ref|XP_002109856.1| hypothetical protein TRIADDRAFT_6027 [Trichoplax adhaerens]
gi|190587980|gb|EDV28022.1| hypothetical protein TRIADDRAFT_6027, partial [Trichoplax
adhaerens]
Length = 202
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 54/169 (31%), Positives = 83/169 (49%), Gaps = 12/169 (7%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR SR A +++ L ++ +Q ++S C HP+ D+FR QE+ K + + W C Y
Sbjct: 1 CSRINSRYARNILKRSLYDLIQSMQVQLSFDCPFHPERDLFRKQEELKDNAYQSSWTCSY 60
Query: 107 CKKSFRAEKFLDQHFDNRHNNLL-NVSHGKCLADLCGALHCDFVMNSKSSRSK------- 158
C K F E+FLDQH DNRH+ LL V + CLA+ C L CD ++ +K
Sbjct: 61 CGKWFYRERFLDQHLDNRHSALLGTVMNATCLANYCDILGCDLAHVQDTTLAKGNDLWWK 120
Query: 159 ---CNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 204
C + + R C +A +C P ++S + + + C C
Sbjct: 121 TALCRSTQMVELRDQCLQIAEQCTP-KSSKASSGVRNIIISNICSRLTC 168
>gi|156386657|ref|XP_001634028.1| predicted protein [Nematostella vectensis]
gi|156221106|gb|EDO41965.1| predicted protein [Nematostella vectensis]
Length = 257
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 100/237 (42%), Gaps = 21/237 (8%)
Query: 6 QIFTSIFLLLLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEEYLMP 65
+I+ +F LL+++P + G D HCSR SR + E+ P
Sbjct: 4 EIWPVVFRLLVWAPLYAFLVGADGE---------------HCSRYHSRLVRNALREHFFP 48
Query: 66 FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
+Q + ++ C LH +D+F E +K ++ W C +C K+F +E ++D H NRH
Sbjct: 49 AYDQHKVDVPLSCLLHETHDVFIRHEINKEEEHVSRWTCNFCGKAFYSEHYIDMHMQNRH 108
Query: 126 NNLLNVSHGKCLADLCGALHCDFVMNSKS----SRSKCNPAAVAKNRHLCESLANRCFPI 181
+ L CLAD C C + + K R+ C +A + CE L + C P+
Sbjct: 109 KDFLIQDSSVCLADYCDVFRCVVLNDQKKEWFWERALCKHEKMAALKRRCEVLMHGCMPV 168
Query: 182 NQGPSAS--RLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPI 236
G S + +++E C C +P + + Y S +M L +
Sbjct: 169 ANGTSDTPYKVYETVWSNVCSLLTCKDYWKPPYEEMPTWRIIVYAVFSPFFVMALIV 225
>gi|281201704|gb|EFA75912.1| hypothetical protein PPL_10484 [Polysphondylium pallidum PN500]
Length = 353
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 101/208 (48%), Gaps = 14/208 (6%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CS E S +++ P V +Y++ C+L+P DM E KI++ + W+C
Sbjct: 126 CSSELSNKVRAILDNIFYPVVVDHKYQLPVNCQLNPSLDMLAYLETQKINL-YSIWKCRN 184
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSRSKCNPAAVAK 166
C K F E+FLD H N H +L+ + CL+D C L+C C+ + K
Sbjct: 185 CSKQFNTEEFLDLHLKNNHASLIPSNATVCLSDFCDMLNC-----KDDPVIACDNKRMDK 239
Query: 167 NRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC-------PRKSQPFPKGGRKQP 219
+H C+S+ CFP+NQ S L+ F + CD C P++S +
Sbjct: 240 LKHHCQSIMYNCFPLNQEQS-RMLNSHFTKNICDHLTCDGYKRHIPKQSDSNQSNSGRSW 298
Query: 220 SVFYLSISILTLMLLPIFYLIVYLYQRE 247
S+ +++T++++ +FYLIV Y++E
Sbjct: 299 SLMKYIAALITIIIIVLFYLIVCFYRKE 326
>gi|308807625|ref|XP_003081123.1| zinc finger (ISS) [Ostreococcus tauri]
gi|116059585|emb|CAL55292.1| zinc finger (ISS) [Ostreococcus tauri]
Length = 241
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 106/220 (48%), Gaps = 14/220 (6%)
Query: 58 VIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEK 115
++EE+ P E + EI++ C D+F + E+HK V ++ WR Y K F++E
Sbjct: 1 MLEEFFYPVAEGAKTEIAATCPFSRIRDVFYEHERHKERVRHSKQYWRSLYSGKVFKSEY 60
Query: 116 FLDQHFDNRHNNLLNVSHGKCLADLCGALHCD----FVMNSKSSRSKCNPAAVAKNRHLC 171
++D+H + RH + + CLAD C L CD ++ + +C+ A +A + C
Sbjct: 61 YVDKHMERRHGDKIPPGANVCLADHCEVLQCDRHASYMNGERGKLVRCDAAHMADVQAEC 120
Query: 172 ESLANRCFPINQGPSASR---LHELFLRQFCDAHKCPRKSQPFPKGGRKQPSV---FYLS 225
+ CFP G +A R +H+ + +CD C F + +Y+S
Sbjct: 121 RKMLKSCFPHESGRAAERSRKVHDYLNKYYCDQLTCVGVGGVFELMSTHHETAGASYYVS 180
Query: 226 ISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRVGRKA 265
+ + ++L+ ++Y +VY + RG + ++KR R R +
Sbjct: 181 LGFV-MLLVALYYFLVYSWV-NGRGVSLDVKRAHRRARAS 218
>gi|170038289|ref|XP_001846984.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167881843|gb|EDS45226.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 616
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 94/183 (51%), Gaps = 4/183 (2%)
Query: 7 IFTSIFLLLLFSPQEILIQGFDDSASRTLKQEPG-RAHEVHCSRERSRAAWGVIEEYLMP 65
I T+++ + + P++ L+ ++ PG ++ CSR+ SR +++ +P
Sbjct: 62 IITTVYAVAVSPPRKPLMMSVIVICFLSIIPWPGPSVFKISCSRDSSRVVRKIVQSKWLP 121
Query: 66 FVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
+E+ Q ++ +C HP D+F Q+ K ++W CG+C KSF EK LD HF++RH
Sbjct: 122 ILEKYQVKLPLECPFHPLRDIFGPQQTAKKQHRPSQWTCGFCGKSFFEEKHLDMHFEHRH 181
Query: 126 NNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS-KCNPAAVAKNRHL-CESLANRCFPIN 182
+N++ CLAD C + C+ ++ ++ S +P ++ + + ES A R
Sbjct: 182 KGNINMAEDAICLADYCDMMRCEVLIAKDATLSFGVDPNMMSTDIEVWSESTAYRTALTT 241
Query: 183 QGP 185
GP
Sbjct: 242 SGP 244
>gi|298711019|emb|CBJ26414.1| C2H2 zinc finger protein [Ectocarpus siliculosus]
Length = 232
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 36/183 (19%)
Query: 42 AHEVHCSRERSRAAWGVIEEYLMPFVEQ-VQYEISSKCRLHPDNDMFRDQEQHK--IHVD 98
A+E CSRE SR A +I + + PF+++ I C LHP+ D+F+DQE HK +
Sbjct: 3 ANESSCSREHSRQARKIIAKSVYPFLDKDPALAIPLACPLHPELDVFKDQELHKRLVPGK 62
Query: 99 INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH------------GKCLADLCGALHC 146
++ ++CGYC+K F+ E ++D+H DN+H + LN++ G+CL DLC L C
Sbjct: 63 LDTFQCGYCQKQFKTEFYMDRHMDNKHADRLNLAGSDAIGHGIEPAPGRCLGDLCPELGC 122
Query: 147 -DFVMNSKSSR--------------------SKCNPAAVAKNRHLCESLANRCFPINQGP 185
D+ +S S C+PA + + R C +L + CF + G
Sbjct: 123 GDYAADSCSVEIHAGGGSSLGTGAGTIRRGCGACDPADMERRRRRCRALFHGCFDDSTGS 182
Query: 186 SAS 188
+ S
Sbjct: 183 AGS 185
>gi|312378286|gb|EFR24907.1| hypothetical protein AND_10215 [Anopheles darlingi]
Length = 630
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 78/145 (53%), Gaps = 3/145 (2%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ CSRE SR ++ +P +E+ Q ++ +C HP D+F Q+ K ++W
Sbjct: 45 KISCSRESSRVVRKIVHSRWLPILEKYQVKLPLECPFHPMRDIFGPQQSAKKQHRPSQWT 104
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS-KCNP 161
CG+C KSF E+ LD HF+NRH +N++ CLAD C + C+ ++ ++ + +P
Sbjct: 105 CGFCGKSFFEEQHLDMHFENRHKTNINMAEDAICLADYCDMMRCEVLIAKDATLAFGGDP 164
Query: 162 AAVAKNRHL-CESLANRCFPINQGP 185
AV+ + + E+ A R GP
Sbjct: 165 NAVSTDIEVWSEATAYRTALTTAGP 189
>gi|384247793|gb|EIE21279.1| hypothetical protein COCSUDRAFT_57182 [Coccomyxa subellipsoidea
C-169]
Length = 220
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/183 (28%), Positives = 89/183 (48%), Gaps = 19/183 (10%)
Query: 100 NEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVMNSKSSR--- 156
N + C CK+ F +E+ ++H+D +H+ ++ CLAD C LHCD + + +SR
Sbjct: 20 NRFVCSLCKERFSSEEAWERHYDRKHHIHIHPGRDVCLADYCEVLHCDKINLTPTSRQLR 79
Query: 157 ------------SKCNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKC 204
+ C +A + R C LA CFP + GP+A++LH LF+ C AH C
Sbjct: 80 AQQAGDPTLAPHAPCRESAAEQLRESCRDLAAACFPPDAGPAAAKLHRLFVEGVCAAHTC 139
Query: 205 PRKSQP---FPKGGRKQPSVFYLSISILTLMLLPIFYLIVYLYQREMRGGTQELKRIPRV 261
+ Q ++ ++ Y + L L + IFY +++Y + R G+ + R+
Sbjct: 140 DLELQQDLLVQLAASEKGTLGYKLLIGLALGAMAIFYACLWVYH-QYRSGSLSRGDVRRL 198
Query: 262 GRK 264
RK
Sbjct: 199 RRK 201
>gi|158296785|ref|XP_317135.4| AGAP008328-PA [Anopheles gambiae str. PEST]
gi|157014878|gb|EAA12581.4| AGAP008328-PA [Anopheles gambiae str. PEST]
Length = 410
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 85/179 (47%), Gaps = 17/179 (9%)
Query: 39 PG-RAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHV 97
PG ++ CSRE SR ++ +P +E+ Q ++ +C HP D+F Q+ K
Sbjct: 20 PGPSVFKISCSRESSRVVRKIVHSRWLPILEKYQVKLPLECPFHPLRDIFGPQQSAKKQH 79
Query: 98 DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSR 156
++W CG+C KSF EK LD HF+NRH +N + CLAD C + C+ ++ ++
Sbjct: 80 RPSQWTCGFCGKSFFEEKHLDMHFENRHRTNINTAEDAVCLADYCDMMRCEVLIAKDATL 139
Query: 157 S-KCNPAAVAKNRHL-CESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPK 213
+ +P AV+ + + E+ A R GP D K P + PK
Sbjct: 140 AFGSDPNAVSTDIEVWSEATAYRTALSTSGPR-------------DVAKVPERKSLLPK 185
>gi|383857833|ref|XP_003704408.1| PREDICTED: uncharacterized protein LOC100876283 [Megachile
rotundata]
Length = 440
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ C R+R+ ++++ MP +++ Q E+ +C H D+FR Q++ K ++W
Sbjct: 25 KISCPRDRASVVRRIVQKRWMPILKKYQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 84
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
CG C KSF AEK LD HFDNRH + +N + CLAD C + CD + N
Sbjct: 85 CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 133
>gi|66522619|ref|XP_393929.2| PREDICTED: hypothetical protein LOC410449 [Apis mellifera]
Length = 454
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ C R+R+ ++++ MP +++ Q E+ +C H D+FR Q++ K ++W
Sbjct: 25 KISCPRDRASVVRRIVQKRWMPILKKHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 84
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
CG C KSF AEK LD HFDNRH + +N + CLAD C + CD + N
Sbjct: 85 CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 133
>gi|380017517|ref|XP_003692701.1| PREDICTED: uncharacterized protein LOC100864708 [Apis florea]
Length = 454
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ C R+R+ ++++ MP +++ Q E+ +C H D+FR Q++ K ++W
Sbjct: 25 KISCPRDRASVVRRIVQKRWMPILKKHQVELPLECPFHESRDIFRPQQKAKHQHRSSQWT 84
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
CG C KSF AEK LD HFDNRH + +N + CLAD C + CD + N
Sbjct: 85 CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 133
>gi|340723433|ref|XP_003400094.1| PREDICTED: hypothetical protein LOC100643092 [Bombus terrestris]
Length = 453
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ C R+R+ ++++ MP +++ Q E+ +C H D+FR Q++ K ++W
Sbjct: 28 KISCPRDRASVVRRIVQKRWMPILKRHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 87
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
CG C KSF AEK LD HFDNRH + +N + CLAD C + CD + N
Sbjct: 88 CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 136
>gi|350427747|ref|XP_003494865.1| PREDICTED: hypothetical protein LOC100742329 [Bombus impatiens]
Length = 450
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 64/109 (58%), Gaps = 1/109 (0%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ C R+R+ ++++ MP +++ Q E+ +C H D+FR Q++ K ++W
Sbjct: 28 KISCPRDRASVVRRIVQKRWMPILKRHQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 87
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
CG C KSF AEK LD HFDNRH + +N + CLAD C + CD + N
Sbjct: 88 CGLCGKSFYAEKHLDAHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 136
>gi|307188422|gb|EFN73179.1| hypothetical protein EAG_05305 [Camponotus floridanus]
Length = 441
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 63/109 (57%), Gaps = 1/109 (0%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ C R+R+ ++ + MP +++ Q E+ +C H D+FR Q++ K ++W
Sbjct: 19 KISCPRDRASVVRRIVHKRWMPILKKYQVELPLECPFHESRDIFRPQQKAKHQHRPSQWT 78
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
CG C KSF AEK LD HFDNRH + +N + CLAD C + CD + N
Sbjct: 79 CGLCGKSFYAEKHLDVHFDNRHKSNVNTAEDAVCLADYCDIMRCDVLGN 127
>gi|270007090|gb|EFA03538.1| hypothetical protein TcasGA2_TC013541 [Tribolium castaneum]
Length = 423
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 63/115 (54%), Gaps = 2/115 (1%)
Query: 39 PG-RAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHV 97
PG ++ C R+ +R +I+ +P +E+ + + +C HP D+F Q+ K
Sbjct: 22 PGPTVFKISCPRDNARIVRKIIQNKWIPILEKYKVSLPIECPFHPIRDIFGPQQAAKQQH 81
Query: 98 DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMN 151
++W CG C KSF EK+LD HFDNRH +N++ CLAD C + CD ++
Sbjct: 82 RPSQWTCGLCGKSFFEEKYLDMHFDNRHKGFINMAEDAVCLADYCDIMRCDVLIT 136
>gi|91082837|ref|XP_969705.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 350
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 63/115 (54%), Gaps = 2/115 (1%)
Query: 39 PG-RAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHV 97
PG ++ C R+ +R +I+ +P +E+ + + +C HP D+F Q+ K
Sbjct: 22 PGPTVFKISCPRDNARIVRKIIQNKWIPILEKYKVSLPIECPFHPIRDIFGPQQAAKQQH 81
Query: 98 DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMN 151
++W CG C KSF EK+LD HFDNRH +N++ CLAD C + CD ++
Sbjct: 82 RPSQWTCGLCGKSFFEEKYLDMHFDNRHKGFINMAEDAVCLADYCDIMRCDVLIT 136
>gi|226481637|emb|CAX73716.1| Zinc finger, C2H2-type domain-containing protein [Schistosoma
japonicum]
Length = 254
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 89/205 (43%), Gaps = 10/205 (4%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CS +SR A ++ E + P + ++ +C +P +D++ EQ K + +W C
Sbjct: 26 CSISQSRLARSILNELVFPLFQYANVVVNEQCPFNPKHDIYSIHEQMKNKISDYDWECQL 85
Query: 107 CKKSFRAEKFLDQHFDNRHN-NLLNVSHGKCLADLCGALHC-------DFVMNSKSSRSK 158
C K F E+ D H NRH N +++ CL+ C L C D+ +
Sbjct: 86 CGKRFYTERSFDLHIGNRHKANAYSINRTICLSSYCSLLRCSVLKPEIDYGNQIFWDEAL 145
Query: 159 CNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQ 218
C+P + CE + N+C P+N S +L +L CD C R P K
Sbjct: 146 CDPLSFEDISKQCEDVINKCAPVNDS-SGIQLRQLLRSTLCDHLSCDRY-WILPDSHVKT 203
Query: 219 PSVFYLSISILTLMLLPIFYLIVYL 243
P + +++ L I+Y ++++
Sbjct: 204 PVLQKFLTALIVTAGLLIYYSVIFM 228
>gi|56754925|gb|AAW25645.1| SJCHGC03203 protein [Schistosoma japonicum]
Length = 254
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 93/206 (45%), Gaps = 12/206 (5%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CS +SR A ++ E + P + ++ +C +P +D++ EQ K + +W C
Sbjct: 26 CSISQSRLARSILNELVFPLFQFANVVVNEQCPFNPKHDIYSIHEQMKNKISDYDWECQL 85
Query: 107 CKKSFRAEKFLDQHFDNRHN-NLLNVSHGKCLADLCGALHC-------DFVMNSKSSRSK 158
C K F E+ D H NRH N +++ CL+ C L C D+ +
Sbjct: 86 CGKRFYTERSFDLHIGNRHKANAYSINRTICLSSYCSLLRCSVLKPEIDYGNQIFWDEAL 145
Query: 159 CNPAAVAKNRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQ 218
C+P + CE + N+C P+N S +L +L CD C R P K
Sbjct: 146 CDPLSFEDISKQCEDVINKCAPVNDS-SGIQLRQLLRSTLCDHLSCDRY-WILPDSHVKT 203
Query: 219 PSV-FYLSISILTLMLLPIFYLIVYL 243
P + +L+ I+T LL I+Y ++++
Sbjct: 204 PVLQKFLTAFIVTAGLL-IYYSVIFM 228
>gi|157134250|ref|XP_001663208.1| hypothetical protein AaeL_AAEL013023 [Aedes aegypti]
gi|108870545|gb|EAT34770.1| AAEL013023-PA [Aedes aegypti]
Length = 1069
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 65/115 (56%), Gaps = 1/115 (0%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ CSR+ SR +++ +P +E+ Q + +C HP D+F Q+ K ++W
Sbjct: 41 KISCSRDSSRVVRKIVQSKWLPILEKYQVTLPLECPFHPLRDIFGPQQAAKKQNRPSQWT 100
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS 157
CG C KSF EK LD HF+NRH + +N++ CLAD C + C+ ++ ++ S
Sbjct: 101 CGLCGKSFFEEKHLDMHFENRHKSNINMAEDAVCLADYCDMMRCEVLIAKDATLS 155
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 60/109 (55%), Gaps = 1/109 (0%)
Query: 50 ERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKK 109
E SR +++ +P +E+ Q + +C HP D+F Q+ K ++W CG C K
Sbjct: 557 ESSRVVRKIVQSKWLPILEKYQVTLPLECPFHPLRDIFGPQQAAKKQNRPSQWTCGLCGK 616
Query: 110 SFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSRS 157
SF EK LD HF+NRH + +N++ CLAD C + C+ ++ ++ S
Sbjct: 617 SFFEEKHLDMHFENRHKSNINMAEDAVCLADYCDMMRCEVLIAKDATLS 665
>gi|328865036|gb|EGG13422.1| hypothetical protein DFA_11183 [Dictyostelium fasciculatum]
Length = 306
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 55/216 (25%), Positives = 104/216 (48%), Gaps = 24/216 (11%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CS S ++ + FV+ +Y++ S C + + DM + K + +++RC
Sbjct: 75 CSTPLSNRVREILNHVVYSFVDYNEYKLPSTCVFNKNLDMLAVFQSLKFLQNDHQYRCKS 134
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHC--DFVMNSKSSRSKCNPAAV 164
C K F++E ++DQH N H +L+ + CL D C L+C D++++ C+ +
Sbjct: 135 CSKQFKSEDYIDQHLKNNHADLIPSNATICLGDYCEMLNCNNDYIIS-------CDERKM 187
Query: 165 AKNRHLCESLANRCFPINQGPSASR-LHELFLRQFCDAHKCPRKSQPFPKGG-------- 215
K ++ C+ + CFP PS ++ ++ F CD C S + G
Sbjct: 188 EKLKYACQGIMYNCFP----PSVNKNYNKHFNALICDKLTCKGGSSTTGQDGIIRDADIE 243
Query: 216 --RKQPSVFYLSISILTLMLLPIFYLIVYLYQREMR 249
+ Q S+ +++ L++L IFYLI++ +++E R
Sbjct: 244 EKQHQWSLMKYLSALVILVILIIFYLILWFFKKETR 279
>gi|242021247|ref|XP_002431057.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516286|gb|EEB18319.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 428
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 71/134 (52%), Gaps = 13/134 (9%)
Query: 39 PGRA-HEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHV 97
PG+ ++ C R+ + +I+ +P +E+ Q +I +C HP D+F Q+ K
Sbjct: 17 PGQTVFKISCPRDSASVVRKLIQRKWIPILEKYQVKIPVECPFHPIRDIFGPQQAAKHQF 76
Query: 98 DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH-GKCLADLCGALHCDFVMNSKSSR 156
++W CG+C KSF E++L+ HFDN H N +NV+ CLAD C + C+ ++ +
Sbjct: 77 RSSQWSCGFCGKSFYEERYLELHFDNIHKNQINVAEDAVCLADYCDIMRCEVLVTQE--- 133
Query: 157 SKCNPAAVAKNRHL 170
KN+HL
Sbjct: 134 --------IKNKHL 139
>gi|390365205|ref|XP_003730770.1| PREDICTED: uncharacterized protein LOC100893077 [Strongylocentrotus
purpuratus]
Length = 233
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 94/195 (48%), Gaps = 14/195 (7%)
Query: 75 SSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSH 133
++C L P+ +M+ D E+HK N W C YC KSF E+FLD+H+DNRH + +L
Sbjct: 9 GTECLLKPERNMYWDDERHKWSEIANNWNCLYCGKSFYNEQFLDKHYDNRHGSTVLTGPE 68
Query: 134 GKCLADLCGALHCDFVMNSKSS----RSKCNPAAVAKNRHLCESLANRCFP--INQGPSA 187
CLAD C L CD + ++K + + C + R C+++ C P I
Sbjct: 69 AVCLADHCDYLRCDIIASTKKTSFWEEALCKQKDLLVLRKKCQNIMKSCIPSHITDAEKN 128
Query: 188 SRLHELFLRQFCDAHKCPRKSQ-PFPKGGRKQPSVFYLSISILTLMLLPIFYLI----VY 242
S L +L C C + + P+ + + + ++++ I T+ ++Y++ +Y
Sbjct: 129 SLLDKL-DDTVCSFLTCEKYWENPYKEVPPSRTAAYFIA-GIFTVFGFIMYYVMACSHLY 186
Query: 243 LYQREMRGGTQELKR 257
++E G + R
Sbjct: 187 ADEKEFLGDSDGFVR 201
>gi|195388338|ref|XP_002052837.1| GJ17779 [Drosophila virilis]
gi|194149294|gb|EDW64992.1| GJ17779 [Drosophila virilis]
Length = 625
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 82/187 (43%), Gaps = 14/187 (7%)
Query: 39 PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
P ++ CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K
Sbjct: 18 PSNVIKILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPLRDVFAPRQDEKQRDR 77
Query: 99 INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRS 157
+W C C KSF EK+LD HFD RH +++N CLAD C + C+ +S
Sbjct: 78 PTQWTCRLCGKSFYQEKYLDLHFDTRHKHIINEAEDAVCLADFCDIMRCEVFQTEDASSL 137
Query: 158 KCNPA-----------AVAKNRHLCESLAN--RCFPINQGPSASRLHELFLRQFCDAHKC 204
K ++ +N L ++ A P SA+R ++ RQ
Sbjct: 138 KFGDQHIVTDIEVWGDSLGQNSALAKANAAYLSLIPRTSTLSATRAAKVQNRQLLQEKPS 197
Query: 205 PRKSQPF 211
P K QP
Sbjct: 198 PSKEQPL 204
>gi|195118254|ref|XP_002003655.1| GI21701 [Drosophila mojavensis]
gi|193914230|gb|EDW13097.1| GI21701 [Drosophila mojavensis]
Length = 639
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 64/121 (52%), Gaps = 1/121 (0%)
Query: 39 PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
P ++ CSR+ SR ++ P +++ Q ++ +C LHP D+F +++ K
Sbjct: 18 PSNVIKILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPLRDVFAPRQEEKQRDR 77
Query: 99 INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS 157
+W C C KSF EK+LD HFD RH +++N + CLADLC + C+ +S
Sbjct: 78 PTQWTCRLCGKSFYQEKYLDLHFDTRHKHIINEAEDSVCLADLCDIMRCEVFQTEDASSL 137
Query: 158 K 158
K
Sbjct: 138 K 138
>gi|357621060|gb|EHJ73029.1| hypothetical protein KGM_12603 [Danaus plexippus]
Length = 425
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 58/108 (53%), Gaps = 1/108 (0%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
++ CSR+ S+ VIE +P +E+ Q ++ +C HP D+F Q K ++W
Sbjct: 28 KITCSRDNSKIVRKVIENKWLPVLERFQVKLPLECPFHPARDIFAPQHAAKQQHRSSQWT 87
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVM 150
C +C KSF E+ LD HFD RH + +N CLAD C + C ++
Sbjct: 88 CAFCGKSFYEERHLDTHFDQRHRHQINKAEDAVCLADYCDIMRCQVLV 135
>gi|195032746|ref|XP_001988553.1| GH11227 [Drosophila grimshawi]
gi|193904553|gb|EDW03420.1| GH11227 [Drosophila grimshawi]
Length = 635
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 61/121 (50%), Gaps = 1/121 (0%)
Query: 39 PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
P ++ CSR+ SR ++ P +++ Q ++ C LHP D+F ++ K
Sbjct: 18 PSNVIKLLCSRDNSRLVRKIVRSKWTPILDKHQVKLPLDCPLHPLRDVFAPRQDEKQRDR 77
Query: 99 INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS 157
+W C C KSF EKFLD HFD RH +++N + CLAD C + C+ +S
Sbjct: 78 PTQWTCRRCGKSFYQEKFLDLHFDTRHKHIINEAEDSVCLADFCDIMRCEVFQTEDASSL 137
Query: 158 K 158
K
Sbjct: 138 K 138
>gi|303280527|ref|XP_003059556.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459392|gb|EEH56688.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 324
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 78/192 (40%), Gaps = 45/192 (23%)
Query: 58 VIEEYLMPFVEQ--VQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEK 115
V+++Y P V++ +E+ C DM+ +QE+HK V +W+ Y +K FR E
Sbjct: 44 VLDDYFYPVVDRHAGSFEVPPACPFQRSKDMYLEQEKHKERVRRTQWKSLYSEKVFRTEW 103
Query: 116 FLDQHFDNRHNNLLNVSHGKCLADLCGALHCD-------------FVMNSKSSRSK---- 158
+D+H DNRH + + CLAD C L CD SRSK
Sbjct: 104 HVDKHMDNRHADKIPPGADVCLADYCAVLRCDEHHAHRNPDDPSRSGSRGAPSRSKGGGG 163
Query: 159 ------------CNPAAVAKNRHLCESLANRCFPINQGPSA--------------SRLHE 192
C+ + +A RH CE+L + CFP + + RL
Sbjct: 164 GGAGARRARRERCDESKLAGARHFCETLMHACFPSDDDAESDGDRGGGARGHGAAGRLRA 223
Query: 193 LFLRQFCDAHKC 204
F R CD C
Sbjct: 224 YFTRHHCDQLTC 235
>gi|195437634|ref|XP_002066745.1| GK24396 [Drosophila willistoni]
gi|194162830|gb|EDW77731.1| GK24396 [Drosophila willistoni]
Length = 754
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 58/113 (51%), Gaps = 1/113 (0%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K +W C
Sbjct: 31 CSRDNSRLVRKIVRSKWTPILDKHQVKLPMECPLHPFRDIFAPRQDSKQRDRPTQWTCRR 90
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
C KSF EK+LD HFD RH L+N V CL+D C + CD ++ K
Sbjct: 91 CGKSFYQEKYLDLHFDIRHETLINEVEDSICLSDFCDIMRCDVFQTEDATSLK 143
>gi|442624983|ref|NP_001259827.1| CG4133, isoform C [Drosophila melanogaster]
gi|440213075|gb|AGB92364.1| CG4133, isoform C [Drosophila melanogaster]
Length = 583
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 1/121 (0%)
Query: 39 PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
P + CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K
Sbjct: 23 PTNVIRILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDR 82
Query: 99 INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRS 157
+W C C KSF EK LD HFD RH +++N CLAD C + C+ +S
Sbjct: 83 PTQWTCRKCGKSFYQEKHLDLHFDTRHKSIINEAEDAVCLADFCDIMRCEVFETEDASSL 142
Query: 158 K 158
K
Sbjct: 143 K 143
>gi|195575557|ref|XP_002077644.1| GD23028 [Drosophila simulans]
gi|194189653|gb|EDX03229.1| GD23028 [Drosophila simulans]
Length = 667
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K +W C
Sbjct: 31 CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
C KSF EK LD HFD RH +++N CLAD C + C+ +S K
Sbjct: 91 CGKSFYQEKHLDLHFDTRHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|194853511|ref|XP_001968176.1| GG24723 [Drosophila erecta]
gi|190660043|gb|EDV57235.1| GG24723 [Drosophila erecta]
Length = 669
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K +W C
Sbjct: 31 CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
C KSF EK LD HFD RH +++N CLAD C + C+ +S K
Sbjct: 91 CGKSFYQEKHLDLHFDTRHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|195470274|ref|XP_002087433.1| GE16858 [Drosophila yakuba]
gi|194173534|gb|EDW87145.1| GE16858 [Drosophila yakuba]
Length = 669
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K +W C
Sbjct: 31 CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
C KSF EK LD HFD RH +++N CLAD C + C+ +S K
Sbjct: 91 CGKSFYQEKHLDLHFDTRHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|194766505|ref|XP_001965365.1| GF20663 [Drosophila ananassae]
gi|190617975|gb|EDV33499.1| GF20663 [Drosophila ananassae]
Length = 701
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K +W C
Sbjct: 31 CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRR 90
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
C KSF EK LD HFD RH +++N CLAD C + C+ +S K
Sbjct: 91 CGKSFYQEKHLDLHFDTRHKHIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|442624981|ref|NP_001259826.1| CG4133, isoform B [Drosophila melanogaster]
gi|440213074|gb|AGB92363.1| CG4133, isoform B [Drosophila melanogaster]
Length = 642
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K +W C
Sbjct: 31 CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
C KSF EK LD HFD RH +++N CLAD C + C+ +S K
Sbjct: 91 CGKSFYQEKHLDLHFDTRHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|195350111|ref|XP_002041585.1| GM16746 [Drosophila sechellia]
gi|194123358|gb|EDW45401.1| GM16746 [Drosophila sechellia]
Length = 652
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K +W C
Sbjct: 31 CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
C KSF EK LD HFD RH +++N CLAD C + C+ +S K
Sbjct: 91 CGKSFYQEKHLDLHFDTRHKSIINQAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|19920466|ref|NP_608526.1| CG4133, isoform A [Drosophila melanogaster]
gi|7296200|gb|AAF51492.1| CG4133, isoform A [Drosophila melanogaster]
gi|18447493|gb|AAL68309.1| RE51073p [Drosophila melanogaster]
gi|220948886|gb|ACL86986.1| CG4133-PA [synthetic construct]
Length = 667
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K +W C
Sbjct: 31 CSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDRPTQWTCRK 90
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLN-VSHGKCLADLCGALHCDFVMNSKSSRSK 158
C KSF EK LD HFD RH +++N CLAD C + C+ +S K
Sbjct: 91 CGKSFYQEKHLDLHFDTRHKSIINEAEDAVCLADFCDIMRCEVFETEDASSLK 143
>gi|326435960|gb|EGD81530.1| hypothetical protein PTSG_02249 [Salpingoeca sp. ATCC 50818]
Length = 334
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 70/162 (43%), Gaps = 28/162 (17%)
Query: 41 RAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISS------------------KCRLHP 82
+A + +C R SR GV+ + ++P + +SS C L P
Sbjct: 29 QAADFNCDRATSRRVRGVLTQSILPLLGARGLHLSSPAKHNNDSRAEPGSALTPSCPLDP 88
Query: 83 DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK--CLADL 140
D++ Q++H+ + C YC KSFR E FLD H NRH L V CLAD
Sbjct: 89 TRDVYARQDEHRDMEGTTKLTCLYCGKSFRGEPFLDLHMHNRHAGELLVPDTGPLCLADY 148
Query: 141 CGALHCDFVMN--------SKSSRSKCNPAAVAKNRHLCESL 174
C A C+ + S KC+P A+++ + CE L
Sbjct: 149 CDATMCETQLEYRQFARLFSPEHGGKCDPDAMSRRKESCERL 190
>gi|195147218|ref|XP_002014577.1| GL18880 [Drosophila persimilis]
gi|194106530|gb|EDW28573.1| GL18880 [Drosophila persimilis]
Length = 683
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 57/110 (51%), Gaps = 1/110 (0%)
Query: 39 PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
P ++ CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K
Sbjct: 23 PTNVIKILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDR 82
Query: 99 INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCD 147
+W C C KSF E+ LD HFD RH ++N + CLAD C + C+
Sbjct: 83 PTQWTCRRCGKSFYQERHLDLHFDTRHKTIINEAMDSICLADFCDIMRCE 132
>gi|198473764|ref|XP_001356431.2| GA17979 [Drosophila pseudoobscura pseudoobscura]
gi|198138098|gb|EAL33495.2| GA17979 [Drosophila pseudoobscura pseudoobscura]
Length = 684
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 60/121 (49%), Gaps = 1/121 (0%)
Query: 39 PGRAHEVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVD 98
P ++ CSR+ SR ++ P +++ Q ++ +C LHP D+F ++ K
Sbjct: 23 PTNVIKILCSRDNSRLVRKIVRSKWTPILDKHQVKLPLECPLHPFRDVFAPRQDAKQRDR 82
Query: 99 INEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNSKSSRS 157
+W C C KSF E+ LD HFD RH ++N + CLAD C + C+ ++
Sbjct: 83 PTQWTCRRCGKSFYQERHLDLHFDTRHKTIINEAMDSICLADFCDIMRCEVFETEDATSL 142
Query: 158 K 158
K
Sbjct: 143 K 143
>gi|449682242|ref|XP_004210029.1| PREDICTED: uncharacterized protein LOC100205425 [Hydra
magnipapillata]
Length = 1324
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 101/244 (41%), Gaps = 33/244 (13%)
Query: 7 IFTSIFLLLL-----FSPQEILIQGFDDSASRTLKQEPGRAHEVHCSRERSRAAWGVIEE 61
+F IF LL FSP+ +L + S+ K+ +CS +RS ++ +
Sbjct: 1053 LFCYIFFSLLDKKRNFSPRNML---WSKSSRIDFKR--------NCSLQRSALVSNILSQ 1101
Query: 62 YLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHF 121
L P E+ Q +I +C + +++R E K + +C +C K F E + +H
Sbjct: 1102 TLYPIYEKYQLKIPRRCIFLQERNIYRGVEMGKTELSGGHIKCEFCGKLFENENYFYKHM 1161
Query: 122 DNRHNNLLNVSHG-KCLADLCGALHCD----FVMNSKSSRSKCNPAAVAKNRHLCESLAN 176
D +H N+ + CL+D C CD + +N CN V K + CE +
Sbjct: 1162 DRKHENVYVFNKDFVCLSDYCDIFRCDLHEMYPVNP-LDFDVCNKKDVRKLQRKCEKIIY 1220
Query: 177 RCFPINQGPSASRLHELFLRQFCDAHKCPRKSQPFPKGGRKQPSVFYLSISILTLMLLPI 236
C P +++ + C C F + S++Y++ LML+P
Sbjct: 1221 TCLPTEM---QHYVYDSLTKSLCAPLNCNNFYADF-----EDVSIWYIA---FMLMLVPA 1269
Query: 237 FYLI 240
+++
Sbjct: 1270 TFIL 1273
>gi|307110360|gb|EFN58596.1| hypothetical protein CHLNCDRAFT_140777 [Chlorella variabilis]
Length = 192
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 72/176 (40%), Gaps = 40/176 (22%)
Query: 74 ISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSH 133
+ + C L P +D +R QE K + WRC C K F AE+ +D H NRH +L
Sbjct: 1 MPAGCPLSPASDRYRHQEAAKETLAQGRWRCILCGKRFTAERHIDLHLANRHAHLEEQQQ 60
Query: 134 GKCLADLCGALHCDFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGP-SASRLHE 192
L D C LA RCFP + G SA RLH
Sbjct: 61 ----------LQED-----------------------CLELAARCFPPHGGDGSAQRLHA 87
Query: 193 LFLRQFCDAHKCPRKSQ-----PFPKGGRKQPSVFYLSISILTLMLLPIFYLIVYL 243
L + C AH C + + + R+ + +++ L++L F+L+++L
Sbjct: 88 LLAGRLCKAHTCDTRRKRSRTLAVAQAHRRWRGRWSFTLAA-GLVVLATFWLLLWL 142
>gi|256083558|ref|XP_002578009.1| hypothetical protein [Schistosoma mansoni]
gi|350644825|emb|CCD60456.1| hypothetical protein Smp_067940 [Schistosoma mansoni]
Length = 129
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 51/121 (42%), Gaps = 8/121 (6%)
Query: 78 CRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH-NNLLNVSHGKC 136
C HP +D++ EQ K + +W C C K F E D H NRH N + S C
Sbjct: 5 CPFHPKHDIYAIHEQMKNKISDYDWECQMCGKRFYTENTFDLHIGNRHETNAYSTSRTIC 64
Query: 137 LADLCGALHC-------DFVMNSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSASR 189
L+ C L C D+ + C+P + CE + N+C P S+++
Sbjct: 65 LSSYCPLLRCSVLKPDVDYGYQVFWDEALCDPKSFEAISKQCEDILNKCIPSGNDSSSTQ 124
Query: 190 L 190
L
Sbjct: 125 L 125
>gi|391348561|ref|XP_003748515.1| PREDICTED: uncharacterized protein LOC100905503 [Metaseiulus
occidentalis]
Length = 505
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 86/228 (37%), Gaps = 35/228 (15%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
C R S V+ L P +++ + + C HP D+ Q + + WRC
Sbjct: 34 CDRHSSAIVRHVVYTRLQPVLDRYRVSMPEDCPFHPARDVLV-WPQRDANSITDVWRCPV 92
Query: 107 CKKSFRAEKFLDQHFDNRHNNLLNVSHGK---------CLADLCGALHCDFVMNS----- 152
C KSF E+ L QHF H K CL C + CD +
Sbjct: 93 CGKSFFGEENLTQHFHAHTGTRGGTQHVKRHITEGDRVCLGTFCDIMRCDVIERQLSEAL 152
Query: 153 ---KSSRSKCNPAAVAK-NRHLCESLANRCFPINQGPSASRLHELFLRQFCDAHK----- 203
SSR +P A NR L NRC + Q + L+ L ++++ D +
Sbjct: 153 TPRTSSRKTESPKPKADCNRQSMAVLENRCKAVMQQCTIGLLNVLSVKEYKDVEEALHQN 212
Query: 204 ------CPRKSQPFPKGGRKQPSVFYLSISILTLMLLPIF--YLIVYL 243
C R + P+ R P +L+ +I L+ F Y +V++
Sbjct: 213 ICSYLTCERYWEEGPRDTRFSP---FLASTIGFLIASGFFGCYWVVWI 257
>gi|358337329|dbj|GAA55703.1| hypothetical protein CLF_108755 [Clonorchis sinensis]
Length = 463
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 3/97 (3%)
Query: 56 WGVIEEYLM--PFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRA 113
W ++L+ P + + ++C +P D++ QE K +W+C C K F +
Sbjct: 262 WVPTTDFLVFFPLYDFHHVPVPNECPWNPHLDIYNLQELMKTQNSARDWQCNQCWKRFYS 321
Query: 114 EKFLDQHFDNRHNNLLNVS-HGKCLADLCGALHCDFV 149
E LD H DNRH + + + CLA C L CD +
Sbjct: 322 EHALDLHMDNRHRSTVYTGPNATCLALFCPLLRCDVL 358
>gi|412990909|emb|CCO18281.1| predicted protein [Bathycoccus prasinos]
Length = 305
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 63/269 (23%), Positives = 99/269 (36%), Gaps = 51/269 (18%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQE-----QHKIHVDINE 101
C + +SR A +EE+ P + E+S +C D++R+ E +H E
Sbjct: 17 CDQRKSRTARQALEEFFFPLFTNEKEEVSERCPFKKSRDVYREIESRKTRRHNKRKKEEE 76
Query: 102 WRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGK-CLADLCGALHCDFVMNS-------K 153
W+ K F++E +D + R + G CLAD C L CD S
Sbjct: 77 WK-DVDGKVFKSEADIDAYLSGR---FFKSTKGDVCLADYCDVLQCDEYYESLETSSSSS 132
Query: 154 SSRSKCNPAAVAKNRHLCESLANRCFPIN-----QGPSASRLHELFLRQFCDAHKCPR-- 206
+S + + AA + C +L CFP Q +A R +F F H+ R
Sbjct: 133 TSSGEKDYAAKTRAGAHCYALTKTCFPERSESAVQEKNARRFESIFCDTFVGLHQDGRNA 192
Query: 207 ----KSQPFPKGGRKQPS--------------------VFYLSISILTLMLLPIFYLIVY 242
+ Q + G + + +L IS L L IFYL+ +
Sbjct: 193 FAHIRKQEYKYPGHVRHANTNIDASASPSSSSSSSSSSSSFLIISTLFTFLGFIFYLLFF 252
Query: 243 L---YQREMRGGTQELKRIPRVGRKAKPS 268
L R G + ++ R K K +
Sbjct: 253 LAPSINRRASGSDLKARKGRRTASKKKST 281
>gi|167519981|ref|XP_001744330.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777416|gb|EDQ91033.1| predicted protein [Monosiga brevicollis MX1]
Length = 272
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 57/142 (40%), Gaps = 11/142 (7%)
Query: 47 CSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWRCGY 106
C++ SR A +E+ L PF S C ++D+F + + + C +
Sbjct: 41 CNQTESRRAMAALEQVLYPFQPLTP---SESCPWIVEHDLFYELNRDADVTAKGQPACLF 97
Query: 107 CKKSFRAEKFLDQHFDNRH----NNLLNVSHGKCLADLCGALHCDFVMNSKS----SRSK 158
C K+F FLDQH+ RH +L N CLAD C L C +
Sbjct: 98 CGKAFENAVFLDQHYPRRHLPTFLDLPNPDRRLCLADSCDVLQCAAYSEGAELWAFAHQP 157
Query: 159 CNPAAVAKNRHLCESLANRCFP 180
C+P + C + + CFP
Sbjct: 158 CSPGQYDMAQVRCRATIDTCFP 179
>gi|303390934|ref|XP_003073697.1| hypothetical protein Eint_100070 [Encephalitozoon intestinalis ATCC
50506]
gi|303302845|gb|ADM12337.1| hypothetical protein Eint_100070 [Encephalitozoon intestinalis ATCC
50506]
Length = 388
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 33/50 (66%), Gaps = 5/50 (10%)
Query: 80 LHPDNDMFRDQEQHKIHVDINE--WRCGYCKKSFRAEKFLDQHFDNRHNN 127
LHP +++E + + +NE ++CG+C KSF +EKF+ HF+N+H +
Sbjct: 278 LHPKT---KEEELNNMTTIVNENHYKCGFCGKSFESEKFIFNHFNNKHED 324
>gi|68489815|ref|XP_711247.1| potential zinc finger protein [Candida albicans SC5314]
gi|46432535|gb|EAK92012.1| potential zinc finger protein [Candida albicans SC5314]
Length = 132
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 8/75 (10%)
Query: 78 CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 134
C HP N F D +Q+ +HV I+E C C K F +KFL H D HN +L +
Sbjct: 28 CNYHPCFNQAFEDYQQYHLHVVSIHEHTCESCLKVFPDKKFLSLHIDENHNPILQIKQER 87
Query: 135 -----KCLADLCGAL 144
KC + C A+
Sbjct: 88 GDKIFKCYTESCKAV 102
>gi|19074624|ref|NP_586130.1| Zinc finger domain [Encephalitozoon cuniculi GB-M1]
gi|51702181|sp|Q8SUH6.1|ZA15_ENCCU RecName: Full=Zinc finger C2H2 protein ECU10_0150
gi|19069266|emb|CAD25734.1| Zinc finger domain [Encephalitozoon cuniculi GB-M1]
Length = 388
Score = 41.6 bits (96), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 24/31 (77%)
Query: 97 VDINEWRCGYCKKSFRAEKFLDQHFDNRHNN 127
V+ N ++CG+C K+F +EKF+ HF+N+H N
Sbjct: 294 VNENHYKCGFCGKAFESEKFIFNHFNNKHEN 324
>gi|68489768|ref|XP_711271.1| potential zinc finger protein [Candida albicans SC5314]
gi|46432560|gb|EAK92036.1| potential zinc finger protein [Candida albicans SC5314]
Length = 132
Score = 41.2 bits (95), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 8/75 (10%)
Query: 78 CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 134
C HP N F D +Q+ +HV I+E C C K F + FL H D HN LL +
Sbjct: 28 CNYHPCFNQAFEDYQQYHLHVVSIHEHTCESCLKVFPNKSFLSLHIDENHNPLLQIKQER 87
Query: 135 -----KCLADLCGAL 144
KC + C A+
Sbjct: 88 GDKIFKCYTESCKAV 102
>gi|347967063|ref|XP_550770.4| AGAP002041-PA [Anopheles gambiae str. PEST]
gi|333469774|gb|EAL38493.4| AGAP002041-PA [Anopheles gambiae str. PEST]
Length = 1932
Score = 41.2 bits (95), Expect = 0.45, Method: Composition-based stats.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 15/98 (15%)
Query: 92 QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
+HK+ H D ++ CG C K+F+ + L+ H N +C AD CG +CD
Sbjct: 264 KHKLTHSDERKYVCGMCSKAFKRQDHLNGHMLTHRNK----KPYECKADGCGKSYCD--- 316
Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 188
++S R +N H LAN + GP S
Sbjct: 317 -ARSLRRH------TENHHSALGLANSSGTASPGPGGS 347
>gi|347967065|ref|XP_003436012.1| AGAP002041-PB [Anopheles gambiae str. PEST]
gi|333469775|gb|EGK97401.1| AGAP002041-PB [Anopheles gambiae str. PEST]
Length = 1876
Score = 41.2 bits (95), Expect = 0.50, Method: Composition-based stats.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 15/98 (15%)
Query: 92 QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
+HK+ H D ++ CG C K+F+ + L+ H N +C AD CG +CD
Sbjct: 264 KHKLTHSDERKYVCGMCSKAFKRQDHLNGHMLTHRNK----KPYECKADGCGKSYCD--- 316
Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRCFPINQGPSAS 188
++S R +N H LAN + GP S
Sbjct: 317 -ARSLRRH------TENHHSALGLANSSGTASPGPGGS 347
>gi|401827595|ref|XP_003888090.1| hypothetical protein EHEL_100120 [Encephalitozoon hellem ATCC
50504]
gi|392999290|gb|AFM99109.1| hypothetical protein EHEL_100120 [Encephalitozoon hellem ATCC
50504]
Length = 388
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 23/29 (79%)
Query: 97 VDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
V+ N ++CG+C KSF +EKF+ HF+N+H
Sbjct: 294 VNENHYKCGFCGKSFESEKFIFNHFNNKH 322
>gi|66358034|ref|XP_626195.1| hypothetical protein [Cryptosporidium parvum Iowa II]
gi|46227028|gb|EAK87978.1| hypothetical protein with signal peptide, possible transmembrane
domain near C-terminus and predicted SMART Znf_C2H2
domain [Cryptosporidium parvum Iowa II]
Length = 233
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/85 (27%), Positives = 44/85 (51%), Gaps = 1/85 (1%)
Query: 44 EVHCSRERSRAAWGVIEEYLMPFVEQVQYEISSKCRLHPDNDMFRDQEQHKIHVDINEWR 103
+++CSR+ SR I + L+ + + + +C + ++ +F +E+ I E +
Sbjct: 25 DINCSRQVSRRVSHAIRK-LLKDKKDISKHLLPECAFNQEHHIFNYEEKINIVYPTGERQ 83
Query: 104 CGYCKKSFRAEKFLDQHFDNRHNNL 128
CG+C + F+ EK DQH H +L
Sbjct: 84 CGFCGEIFQDEKTYDQHMGKSHPSL 108
>gi|357631664|gb|EHJ79133.1| putative zinc finger protein 91-like protein [Danaus plexippus]
Length = 390
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 92 QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNN 127
QH + H+DIN+++CG+C K++++ + L +HF H++
Sbjct: 189 QHLVSHIDINQFKCGFCNKTYKSRQSLKEHFRVAHSS 225
>gi|241956272|ref|XP_002420856.1| zinc finger protein, putative [Candida dubliniensis CD36]
gi|223644199|emb|CAX41009.1| zinc finger protein, putative [Candida dubliniensis CD36]
Length = 143
Score = 38.1 bits (87), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 8/75 (10%)
Query: 78 CRLHP-DNDMFRDQEQHKIHV-DINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHG- 134
C HP N F + +Q+ +HV +E C C+K F + +L H D HN LL +
Sbjct: 39 CNYHPCLNSAFENYQQYHLHVLSTHEHTCESCQKVFPNKNYLSLHIDENHNPLLQIKQER 98
Query: 135 -----KCLADLCGAL 144
KC + C A+
Sbjct: 99 GDKIFKCYTETCKAV 113
>gi|443690749|gb|ELT92809.1| hypothetical protein CAPTEDRAFT_214413 [Capitella teleta]
Length = 947
Score = 38.1 bits (87), Expect = 3.7, Method: Composition-based stats.
Identities = 32/113 (28%), Positives = 54/113 (47%), Gaps = 6/113 (5%)
Query: 15 LLFSPQEILIQGFDDSASRTLKQEPGRAHEVHCS-RERSRAAWGVIEEYLMPFVEQVQYE 73
LLFS + + +G D A+ + + G +E+ CS ER + A V+++ +Q++ E
Sbjct: 106 LLFSQEYLNNKGKDLDAAFKREADVGIVNEISCSCNERMQQAHKVLKKEF----DQLKEE 161
Query: 74 ISS-KCRLHPDNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRH 125
++ K H M Q+Q + + +C YC K+F FL H RH
Sbjct: 162 LAEVKKESHKRKKMILAQQQMLQNGQNSYNKCPYCSKAFVNASFLQSHLLRRH 214
>gi|392568613|gb|EIW61787.1| hypothetical protein TRAVEDRAFT_27265 [Trametes versicolor
FP-101664 SS1]
Length = 904
Score = 38.1 bits (87), Expect = 3.9, Method: Composition-based stats.
Identities = 15/47 (31%), Positives = 29/47 (61%)
Query: 83 DNDMFRDQEQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
+ ++ R E H D ++RC C+K F+A F+++H N+H++L+
Sbjct: 670 EEELSRACEAHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHSDLI 716
>gi|195107861|ref|XP_001998512.1| GI23595 [Drosophila mojavensis]
gi|193915106|gb|EDW13973.1| GI23595 [Drosophila mojavensis]
Length = 2064
Score = 38.1 bits (87), Expect = 4.2, Method: Composition-based stats.
Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 5/88 (5%)
Query: 92 QHKI-HVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLLNVSHGKCLADLCGALHCDFVM 150
+HK+ H D ++ C C K+F+ + L+ H N +C A+ CG +CD
Sbjct: 351 KHKLTHSDERKYICALCSKAFKRQDHLNGHMMTHRNK----KPYECKAEGCGKSYCDARS 406
Query: 151 NSKSSRSKCNPAAVAKNRHLCESLANRC 178
+ S + AA N L + + C
Sbjct: 407 LRRHSENHHGGAATPNNNSLSPTTSTNC 434
>gi|409049944|gb|EKM59421.1| hypothetical protein PHACADRAFT_249892 [Phanerochaete carnosa
HHB-10118-sp]
Length = 924
Score = 37.7 bits (86), Expect = 5.5, Method: Composition-based stats.
Identities = 15/39 (38%), Positives = 25/39 (64%)
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
EQH D ++RC C+K F+A F+++H N+H +L+
Sbjct: 693 EQHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPDLV 731
>gi|67612273|ref|XP_667210.1| zf-C2H2 zinc finger protein [Cryptosporidium hominis TU502]
gi|54658324|gb|EAL36982.1| zf-C2H2 zinc finger protein [Cryptosporidium hominis]
Length = 668
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 13/39 (33%), Positives = 23/39 (58%)
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
EQH + + W+CG C K F+ E+++ +H +H + L
Sbjct: 533 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 571
>gi|449549636|gb|EMD40601.1| hypothetical protein CERSUDRAFT_111186 [Ceriporiopsis subvermispora
B]
Length = 926
Score = 37.4 bits (85), Expect = 6.9, Method: Composition-based stats.
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
EQH D ++RC C+K F+A F+++H N+H L+
Sbjct: 694 EQHIKQEDEGKFRCKTCQKLFKATSFVEKHIANKHPELV 732
>gi|66475454|ref|XP_627543.1| protein with C2H2 Zn finger, ASR2B-like N- and C-terminus
[Cryptosporidium parvum Iowa II]
gi|46228996|gb|EAK89845.1| protein with C2H2 Zn finger, ASR2B-like N- and C-terminus
[Cryptosporidium parvum Iowa II]
Length = 668
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 13/39 (33%), Positives = 23/39 (58%)
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
EQH + + W+CG C K F+ E+++ +H +H + L
Sbjct: 533 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 571
>gi|32398760|emb|CAD98720.1| hypothetical predicted zf-C2H2 zinc finger protein, unknown
function [Cryptosporidium parvum]
Length = 598
Score = 37.0 bits (84), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 13/39 (33%), Positives = 23/39 (58%)
Query: 91 EQHKIHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNLL 129
EQH + + W+CG C K F+ E+++ +H +H + L
Sbjct: 463 EQHTLRKKADRWQCGKCLKQFKGEEYVHKHLAKKHKDFL 501
>gi|157136418|ref|XP_001663747.1| hypothetical protein AaeL_AAEL013548 [Aedes aegypti]
gi|108869956|gb|EAT34181.1| AAEL013548-PA [Aedes aegypti]
Length = 566
Score = 37.0 bits (84), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 95 IHVDINEWRCGYCKKSFRAEKFLDQHFDNRHNNL 128
IHVD +RC C K+F+ +++L +H +N H ++
Sbjct: 303 IHVDPKTYRCEPCDKTFKGKRYLQEHLENVHGSI 336
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.138 0.435
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,156,223,143
Number of Sequences: 23463169
Number of extensions: 163957519
Number of successful extensions: 606517
Number of sequences better than 100.0: 181
Number of HSP's better than 100.0 without gapping: 112
Number of HSP's successfully gapped in prelim test: 69
Number of HSP's that attempted gapping in prelim test: 606103
Number of HSP's gapped (non-prelim): 381
length of query: 268
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 128
effective length of database: 9,074,351,707
effective search space: 1161517018496
effective search space used: 1161517018496
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 75 (33.5 bits)