BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023577
(280 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224128117|ref|XP_002320248.1| predicted protein [Populus trichocarpa]
gi|222861021|gb|EEE98563.1| predicted protein [Populus trichocarpa]
Length = 271
Score = 436 bits (1121), Expect = e-120, Method: Compositional matrix adjust.
Identities = 219/274 (79%), Positives = 242/274 (88%), Gaps = 8/274 (2%)
Query: 10 EKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA 69
+KK+NFRKR++EE+E + DDE+ERRLALEE+KFLQKQRERKSGIPA+ + Q+A
Sbjct: 3 QKKRNFRKRTFEEDEHSKAS-DDDEQERRLALEEVKFLQKQRERKSGIPALATTSQTATT 61
Query: 70 AGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
K++EK +GDGEK+ELVLQDTFAQETAVMVEDPNML+YVEQELAKKRGKNID
Sbjct: 62 VAA----KLTEKADGDGEKEELVLQDTFAQETAVMVEDPNMLQYVEQELAKKRGKNIDAT 117
Query: 130 DRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 187
D+VE +LK AEDELYKIPEHLK KRNSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAK
Sbjct: 118 DQVETELKRAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAK 177
Query: 188 KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG-AGSRP 246
KLLQEKRLMGR KS+FSIPSSYSADYFQRGRDYAEKLRR+HPELYKDR QDD AGS+P
Sbjct: 178 KLLQEKRLMGRPKSEFSIPSSYSADYFQRGRDYAEKLRRDHPELYKDRSLQDDAVAGSKP 237
Query: 247 TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
DNSTDAAG RQAATD+FMLERFRKRERHRVMRR
Sbjct: 238 ADNSTDAAGRRQAATDEFMLERFRKRERHRVMRR 271
>gi|356498525|ref|XP_003518101.1| PREDICTED: uncharacterized protein C9orf78 homolog [Glycine max]
Length = 287
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 220/284 (77%), Positives = 243/284 (85%), Gaps = 11/284 (3%)
Query: 7 QKKEKKKNFRKRSY-----EEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP 61
+++++KKN+RKRS E + +N SDDE ERR+ALEEIK LQKQRERKSGIPA P
Sbjct: 5 KQQQRKKNYRKRSAPTDKDELPQSQSNNESDDERERRMALEEIKLLQKQRERKSGIPANP 64
Query: 62 SALQSAAAAGGGGLTKVSEKNEGDG-EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK 120
S LQ + GGG K +EKN+GDG EKDELVLQDTFAQETAVM EDPNM+KY+E ELAK
Sbjct: 65 S-LQVQSGTGGGLAAKAAEKNDGDGGEKDELVLQDTFAQETAVMDEDPNMVKYIEHELAK 123
Query: 121 KRGKNIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLK 178
KRG+ ID D+VEN+LK AEDELYKIPEHLK +RNSEESSTQWTTGIAEVQLPIEYKLK
Sbjct: 124 KRGRKIDAADQVENELKRAEDELYKIPEHLKVKRRNSEESSTQWTTGIAEVQLPIEYKLK 183
Query: 179 NIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQ 238
NIEETEAAKKLLQEKRLMGR KSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDR Q
Sbjct: 184 NIEETEAAKKLLQEKRLMGRTKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRSHQ 243
Query: 239 DDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRERHRVMRR 280
DD +GS+ D+STDAAG+ RQAATD+FMLERFRKRERHRVMRR
Sbjct: 244 DDSSGSKQNDSSTDAAGAVQRQAATDEFMLERFRKRERHRVMRR 287
>gi|358248108|ref|NP_001239815.1| uncharacterized protein LOC100812323 [Glycine max]
gi|255645199|gb|ACU23097.1| unknown [Glycine max]
Length = 288
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 218/284 (76%), Positives = 242/284 (85%), Gaps = 11/284 (3%)
Query: 7 QKKEKKKNFRKRSYEEEEE-----TTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP 61
Q++++KKN+RKRS +E+ +N SDDE ERR+ALEEIK LQKQRERKSGIPA P
Sbjct: 6 QQQQRKKNYRKRSAPTDEDELPQSQSNNESDDERERRMALEEIKLLQKQRERKSGIPANP 65
Query: 62 SALQSAAAAGGGGLTKVSEKNEGDG-EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK 120
S LQ + GGG K +EKN+GDG +KDELVLQDTFAQETAVM EDPNM+ Y+E ELAK
Sbjct: 66 S-LQVQSGTGGGLAAKAAEKNDGDGGDKDELVLQDTFAQETAVMDEDPNMVNYIEHELAK 124
Query: 121 KRGKNIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLK 178
KRG+ ID D+ EN+LK AEDELYKIPEHLK +RNSEESSTQWTTGIAEVQLPIEYKLK
Sbjct: 125 KRGRKIDAADQAENELKRAEDELYKIPEHLKVKRRNSEESSTQWTTGIAEVQLPIEYKLK 184
Query: 179 NIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQ 238
NIEETEAAKKLLQEKRLMGR KSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDR Q
Sbjct: 185 NIEETEAAKKLLQEKRLMGRTKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRNHQ 244
Query: 239 DDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRERHRVMRR 280
DD +GS+ D+STDAAG+ RQAATD+FMLERFRKRERHRVMRR
Sbjct: 245 DDSSGSKKNDSSTDAAGAVQRQAATDEFMLERFRKRERHRVMRR 288
>gi|225437728|ref|XP_002280535.1| PREDICTED: uncharacterized protein LOC100250416 [Vitis vinifera]
Length = 270
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 211/273 (77%), Positives = 239/273 (87%), Gaps = 7/273 (2%)
Query: 11 KKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAA 70
+KKNFRKRS E+++ N S+DEEERRLALEE+KFLQKQRERK GIPAIP+ LQ+
Sbjct: 2 QKKNFRKRSIEDDQAKDNNNSEDEEERRLALEEVKFLQKQRERKLGIPAIPT-LQTT--- 57
Query: 71 GGGGLTKVSEKNE-GDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
G KV+EKNE DG+K+ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG+NID
Sbjct: 58 GVTPTKKVAEKNEVPDGDKEELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGRNIDAT 117
Query: 130 DRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 187
++V NDLK A+DELY +PEHLK +RNSEESSTQWTTGIAEVQLP+EYKL+NIEETEAAK
Sbjct: 118 NQVGNDLKRADDELYVVPEHLKVKRRNSEESSTQWTTGIAEVQLPVEYKLRNIEETEAAK 177
Query: 188 KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPT 247
KLLQ+KRLMGR K++F+IPSSYSADYFQRGRDYAEKLRREHPELYKD+G QD+G GSR
Sbjct: 178 KLLQDKRLMGRTKTEFNIPSSYSADYFQRGRDYAEKLRREHPELYKDKGVQDNGGGSRLP 237
Query: 248 DNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
D ST+ AG RQAATD+FML+RFRKRERHRVMRR
Sbjct: 238 DASTEVAGRRQAATDEFMLDRFRKRERHRVMRR 270
>gi|297744059|emb|CBI37029.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 212/276 (76%), Positives = 241/276 (87%), Gaps = 7/276 (2%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+ +KKNFRKRS E+++ N S+DEEERRLALEE+KFLQKQRERK GIPAIP+ LQ+
Sbjct: 27 KEMQKKNFRKRSIEDDQAKDNNNSEDEEERRLALEEVKFLQKQRERKLGIPAIPT-LQTT 85
Query: 68 AAAGGGGLTKVSEKNE-GDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI 126
G KV+EKNE DG+K+ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG+NI
Sbjct: 86 ---GVTPTKKVAEKNEVPDGDKEELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGRNI 142
Query: 127 DVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETE 184
D ++V NDLK A+DELY +PEHLK +RNSEESSTQWTTGIAEVQLP+EYKL+NIEETE
Sbjct: 143 DATNQVGNDLKRADDELYVVPEHLKVKRRNSEESSTQWTTGIAEVQLPVEYKLRNIEETE 202
Query: 185 AAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGS 244
AAKKLLQ+KRLMGR K++F+IPSSYSADYFQRGRDYAEKLRREHPELYKD+G QD+G GS
Sbjct: 203 AAKKLLQDKRLMGRTKTEFNIPSSYSADYFQRGRDYAEKLRREHPELYKDKGVQDNGGGS 262
Query: 245 RPTDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
R D ST+ AG RQAATD+FML+RFRKRERHRVMRR
Sbjct: 263 RLPDASTEVAGRRQAATDEFMLDRFRKRERHRVMRR 298
>gi|255556659|ref|XP_002519363.1| Protein C9orf78, putative [Ricinus communis]
gi|223541430|gb|EEF42980.1| Protein C9orf78, putative [Ricinus communis]
Length = 293
Score = 392 bits (1007), Expect = e-107, Method: Compositional matrix adjust.
Identities = 222/292 (76%), Positives = 243/292 (83%), Gaps = 24/292 (8%)
Query: 11 KKKNFRKRSYEEEEE-----TTNKLS--DDEEERRLALEEIKFLQKQRERKSGIPAIPSA 63
KKKNFRKRS EE E+ N + DDEEERRLALEE+KFLQKQRERKSGIPAI +
Sbjct: 4 KKKNFRKRSIEEAEDPESSRNNNNATPDDDEEERRLALEEVKFLQKQRERKSGIPAILTP 63
Query: 64 LQSAAAA----------GGGGLT---KVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNM 110
SA+++ GL KV+EKN+GDGEK++LVLQDTFAQETAVMVEDPNM
Sbjct: 64 SSSASSSAAAAAAQLQQNSSGLVSSKKVTEKNDGDGEKEDLVLQDTFAQETAVMVEDPNM 123
Query: 111 LKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAE 168
L YVEQELAKK GKN+D +VEN+LK AEDELY IPEHLK +RNSEESSTQWTTGIAE
Sbjct: 124 LMYVEQELAKKSGKNVDAT-QVENELKRAEDELYTIPEHLKVKRRNSEESSTQWTTGIAE 182
Query: 169 VQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREH 228
VQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKS+FSIPSSYSADYFQRGRDYAEKLRREH
Sbjct: 183 VQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSEFSIPSSYSADYFQRGRDYAEKLRREH 242
Query: 229 PELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
PELYKDR SQD+ AGS+P DN+TDA R+AATD+FMLERFRKRERHRVMRR
Sbjct: 243 PELYKDRNSQDESAGSKPADNNTDAT-RREAATDEFMLERFRKRERHRVMRR 293
>gi|449463519|ref|XP_004149481.1| PREDICTED: uncharacterized protein LOC101215146 [Cucumis sativus]
Length = 293
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 202/288 (70%), Positives = 230/288 (79%), Gaps = 18/288 (6%)
Query: 11 KKKNFRKRSYEEEEE-------TTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSA 63
KKKNFRKR+ + E+ T ++EEE R+ALEE+KFLQKQRER++GIPA+P
Sbjct: 6 KKKNFRKRNCYDSEDGGDEANSVTAISEEEEEEHRMALEEVKFLQKQRERRAGIPAVPPV 65
Query: 64 LQSAAAAGGGGL------TKVSEKNE---GDGEKDELVLQDTFAQETAVMVEDPNMLKYV 114
AG GG KNE G+G+KD+LVLQDTFAQETAVMVEDPNMLKY+
Sbjct: 66 SAQTTTAGAGGARSHKTGGGGGNKNESAGGEGDKDDLVLQDTFAQETAVMVEDPNMLKYI 125
Query: 115 EQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLP 172
EQELAKKRG+ ++ + ENDLK AEDELYKIPEHLK +RNS ESSTQWTTGIAEVQLP
Sbjct: 126 EQELAKKRGRTVETVEGAENDLKQAEDELYKIPEHLKVKRRNSNESSTQWTTGIAEVQLP 185
Query: 173 IEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELY 232
IE+KLKNIEETEAAKKLLQEKR +GR+ S+FSIPSSYSADYF RGRDYAEKLRREHPELY
Sbjct: 186 IEFKLKNIEETEAAKKLLQEKRFVGRSTSEFSIPSSYSADYFHRGRDYAEKLRREHPELY 245
Query: 233 KDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
KDR QDDG+GS+P + T+AAG RQAATD+FMLERFRKRERHRVMRR
Sbjct: 246 KDRSLQDDGSGSKPAETGTEAAGQRQAATDEFMLERFRKRERHRVMRR 293
>gi|297848438|ref|XP_002892100.1| hypothetical protein ARALYDRAFT_887370 [Arabidopsis lyrata subsp.
lyrata]
gi|297337942|gb|EFH68359.1| hypothetical protein ARALYDRAFT_887370 [Arabidopsis lyrata subsp.
lyrata]
Length = 277
Score = 385 bits (990), Expect = e-105, Method: Compositional matrix adjust.
Identities = 206/280 (73%), Positives = 235/280 (83%), Gaps = 17/280 (6%)
Query: 12 KKNFRKRSYEEE--EETTNK--LSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+NFRKRS+EEE + NK +SD+EE+RRLALEE+KFLQK RERK GIPA+ +A S
Sbjct: 4 KRNFRKRSFEEEEEDNDVNKAAISDEEEKRRLALEEVKFLQKLRERKLGIPALSTAQSSI 63
Query: 68 AAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID 127
G K EK E +GEK+ELVLQDTFAQETAV++EDPNM+KY+EQELAKKRGKNID
Sbjct: 64 ------GKVKPVEKTEAEGEKEELVLQDTFAQETAVLIEDPNMVKYIEQELAKKRGKNID 117
Query: 128 VNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 185
+ VEN+LK EDELYKIP+HLK KR+SEESSTQWTTGIAEVQLPIEYKLKNIEETEA
Sbjct: 118 DAEEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 177
Query: 186 AKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGS-QDDGAGS 244
AKKLLQE+RLMGR KS+FSIPSSYSADYFQRG+DYAEKLRREHPELYKDRG Q DG G+
Sbjct: 178 AKKLLQERRLMGRPKSEFSIPSSYSADYFQRGKDYAEKLRREHPELYKDRGGPQADGEGA 237
Query: 245 RP----TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
+P ++N+ D+ SRQAATDQ MLERFRKRER+RVMRR
Sbjct: 238 KPSTSSSNNNADSGKSRQAATDQIMLERFRKRERNRVMRR 277
>gi|449481099|ref|XP_004156081.1| PREDICTED: uncharacterized LOC101215146 [Cucumis sativus]
Length = 305
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 199/300 (66%), Positives = 230/300 (76%), Gaps = 30/300 (10%)
Query: 11 KKKNFRKRSYEEEEE-------TTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSA 63
KKKNFRKR+ + E+ T ++EEE R+ALEE+KFLQKQRER++GIPA+P
Sbjct: 6 KKKNFRKRNCYDSEDGGDEANSVTAISEEEEEEHRMALEEVKFLQKQRERRAGIPAVPPV 65
Query: 64 ------------------LQSAAAAGGGGLTKVSEKNE---GDGEKDELVLQDTFAQETA 102
++ + A KNE G+G+KD+LVLQDTFAQETA
Sbjct: 66 SAQTTTAGAGGASGGGGLVRKSTDANSKTGGGGGNKNESAGGEGDKDDLVLQDTFAQETA 125
Query: 103 VMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESST 160
VMVEDPNMLKY+EQELAKKRG+ ++ + ENDLK AEDELYKIPEHLK +RNS ESST
Sbjct: 126 VMVEDPNMLKYIEQELAKKRGRTVETVEGAENDLKQAEDELYKIPEHLKVKRRNSNESST 185
Query: 161 QWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 220
QWTTGIAEVQLPIE+KLKNIEETEAAKKLLQEKR +GR+ S+FSIPSSYSADYF RGRDY
Sbjct: 186 QWTTGIAEVQLPIEFKLKNIEETEAAKKLLQEKRFVGRSTSEFSIPSSYSADYFHRGRDY 245
Query: 221 AEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
AEKLRREHPELYKDR QDDG+GS+P + T+AAG RQAATD+FMLERFRKRERHRVMRR
Sbjct: 246 AEKLRREHPELYKDRSLQDDGSGSKPAETGTEAAGQRQAATDEFMLERFRKRERHRVMRR 305
>gi|217071714|gb|ACJ84217.1| unknown [Medicago truncatula]
Length = 291
Score = 376 bits (965), Expect = e-102, Method: Compositional matrix adjust.
Identities = 207/292 (70%), Positives = 234/292 (80%), Gaps = 14/292 (4%)
Query: 1 MENPIPQKKEKKKNFRKRSYEEEEETT-----NKLSDDEEERRLALEEIKFLQKQRERKS 55
MEN ++K ++KN+RKR+ EE + N SDDE ERRLALEEIK LQKQRERKS
Sbjct: 1 MENS-KEEKPRRKNYRKRTPTEEHDQPPQSQQNNDSDDESERRLALEEIKLLQKQRERKS 59
Query: 56 GIPAIPSALQSAAAAGGGGLTKV---SEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLK 112
GIPA + QS G +K ++ G+KD+LVLQDTFAQETAVM EDPNM+K
Sbjct: 60 GIPATLTLQQSQPGISSGLASKAVDKNDAGGDGGDKDDLVLQDTFAQETAVMDEDPNMVK 119
Query: 113 YVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL--KKRNSEESSTQWTTGIAEVQ 170
Y+ QELAKKRG+NID D+VEN+LK AEDELY IP+HL KKRNSEESSTQWTTGIAE+Q
Sbjct: 120 YIGQELAKKRGRNIDEEDQVENELKRAEDELYTIPDHLKVKKRNSEESSTQWTTGIAEIQ 179
Query: 171 LPIEYKLKNIEETEAAKKLLQEKRLM-GRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP 229
LPIEYKLKNIEETEAAKKLLQEKRLM GRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP
Sbjct: 180 LPIEYKLKNIEETEAAKKLLQEKRLMVGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP 239
Query: 230 ELYKDRGSQDDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRERHRVMR 279
ELYKDR QDD + S+ ++S+DA G+ RQAATDQFMLERF+KRERHRV R
Sbjct: 240 ELYKDRSQQDDNSASKQNESSSDAPGAVQRQAATDQFMLERFKKRERHRVRR 291
>gi|388514547|gb|AFK45335.1| unknown [Medicago truncatula]
Length = 291
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 206/292 (70%), Positives = 232/292 (79%), Gaps = 14/292 (4%)
Query: 1 MENPIPQKKEKKKNFRKRSYEEEEETT-----NKLSDDEEERRLALEEIKFLQKQRERKS 55
MEN ++K ++KN+RKR+ EE + N SDDE ERRLALEEIK LQKQRERKS
Sbjct: 1 MENS-KEEKPRRKNYRKRTPTEEHDQPPQSQQNNDSDDESERRLALEEIKLLQKQRERKS 59
Query: 56 GIPAIPSALQSAAAAGGGGLTKV---SEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLK 112
GIPA + QS G +K ++ G+KD+LVLQDTFAQETAVM E PNM+K
Sbjct: 60 GIPATLTLQQSQPGISSGLASKAVDKNDAGGDGGDKDDLVLQDTFAQETAVMDEGPNMVK 119
Query: 113 YVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQ 170
Y+ QELAKKRG+NID D+VEN+LK AEDELY IP+HLK KRNSEESSTQWTTGIAE+Q
Sbjct: 120 YIGQELAKKRGRNIDEEDQVENELKRAEDELYTIPDHLKVKKRNSEESSTQWTTGIAEIQ 179
Query: 171 LPIEYKLKNIEETEAAKKLLQEKRLM-GRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP 229
LPIEYKLKNIEETEAAKKLLQEKRLM GRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP
Sbjct: 180 LPIEYKLKNIEETEAAKKLLQEKRLMVGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP 239
Query: 230 ELYKDRGSQDDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRERHRVMR 279
ELYKDR QDD S+ ++S+DA G+ RQAATDQFMLERF+KRERHRV R
Sbjct: 240 ELYKDRSQQDDNFASKQNESSSDAPGAVQRQAATDQFMLERFKKRERHRVRR 291
>gi|18378951|ref|NP_563649.1| uncharacterized protein [Arabidopsis thaliana]
gi|332189295|gb|AEE27416.1| uncharacterized protein [Arabidopsis thaliana]
Length = 279
Score = 368 bits (944), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 205/281 (72%), Positives = 235/281 (83%), Gaps = 17/281 (6%)
Query: 12 KKNFRKRSYEEE--EETTNK--LSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+NFRKRS+EEE + NK +S++EE+RRLALEE+KFLQK RERK GIPA+ S QS+
Sbjct: 4 KRNFRKRSFEEEEEDNDVNKAAISEEEEKRRLALEEVKFLQKLRERKLGIPALSSTAQSS 63
Query: 68 AAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID 127
G K EK E +GEK+ELVLQDTFAQETAV++EDPNM+KY+EQELAKKRG+NID
Sbjct: 64 I-----GKVKPVEKTETEGEKEELVLQDTFAQETAVLIEDPNMVKYIEQELAKKRGRNID 118
Query: 128 VNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 185
+ VEN+LK EDELYKIP+HLK KR+SEESSTQWTTGIAEVQLPIEYKLKNIEETEA
Sbjct: 119 DAEEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 178
Query: 186 AKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGS-QDDGAGS 244
AKKLLQE+RLMGR KS+FSIPSSYSADYFQRG+DYAEKLRREHPELYKDRG Q DG +
Sbjct: 179 AKKLLQERRLMGRPKSEFSIPSSYSADYFQRGKDYAEKLRREHPELYKDRGGPQADGEAA 238
Query: 245 RP-----TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
+P T+N+ D+ SRQAATDQ MLERFRKRER+RVMRR
Sbjct: 239 KPSTSSSTNNNADSGKSRQAATDQIMLERFRKRERNRVMRR 279
>gi|326512722|dbj|BAK03268.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531604|dbj|BAJ97806.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 272
Score = 303 bits (777), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 172/276 (62%), Positives = 207/276 (75%), Gaps = 14/276 (5%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG 72
KNFRKRS E + SDDE+ RR+ALEEI+++QK RERK GIPA A +A+AAG
Sbjct: 3 KNFRKRSLESDAADN---SDDEDTRRVALEEIRYMQKLRERKLGIPAASVATGAASAAGA 59
Query: 73 GGLTKVSEKNEGDGE---KDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
+ + +++LVLQDTFAQETAV +EDPNML+YVE EL KKRGK I+VN
Sbjct: 60 TDGSSARGRGGSGAGAAGEEDLVLQDTFAQETAVTIEDPNMLRYVENELLKKRGKAIEVN 119
Query: 130 DRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 187
D+ D K DELY +P+HLK K+N EESSTQWTTGIAEVQLPIEYKL+NIEETEAAK
Sbjct: 120 DK---DDKDQVDELYVVPDHLKVRKKNMEESSTQWTTGIAEVQLPIEYKLRNIEETEAAK 176
Query: 188 KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPT 247
K+LQE+RL G+ KSD +IPSSYSAD+F RGRDYAEKLRREHPELYK + SQ + G +PT
Sbjct: 177 KMLQERRLAGKTKSDANIPSSYSADFFHRGRDYAEKLRREHPELYKGQDSQANETGGKPT 236
Query: 248 DNSTDA---AGSRQAATDQFMLERFRKRERHRVMRR 280
D++ A R+AATD+ +LERFRKRE+ RVMRR
Sbjct: 237 DSNNPGGPPAAHREAATDELLLERFRKREKFRVMRR 272
>gi|115453221|ref|NP_001050211.1| Os03g0374100 [Oryza sativa Japonica Group]
gi|31249717|gb|AAP46210.1| unknown protein [Oryza sativa Japonica Group]
gi|108708408|gb|ABF96203.1| Hepatocellular carcinoma-associated antigen 59 family protein,
expressed [Oryza sativa Japonica Group]
gi|113548682|dbj|BAF12125.1| Os03g0374100 [Oryza sativa Japonica Group]
gi|125586427|gb|EAZ27091.1| hypothetical protein OsJ_11022 [Oryza sativa Japonica Group]
gi|215708801|dbj|BAG94070.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740836|dbj|BAG96992.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 277
Score = 303 bits (775), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 172/281 (61%), Positives = 210/281 (74%), Gaps = 17/281 (6%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG 71
+KNFRKR+ E + + ++ RR+ALEEIK++QK RERK GIPA +A +++AA
Sbjct: 2 RKNFRKRNLEADAAADHSDD--DDARRVALEEIKYMQKLRERKLGIPAAAAAAGASSAAS 59
Query: 72 GGGLT-------KVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK 124
G + GD EK++LVLQDTFAQETAV +EDPNML+YVE EL KKRGK
Sbjct: 60 ADGASPRGRGGGGGGLAAGGDAEKEDLVLQDTFAQETAVTIEDPNMLRYVENELLKKRGK 119
Query: 125 NIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEE 182
+DV D+ E D DELY +P+HLK K+NSEESSTQWTTGIAEVQLPIEYKL+NIEE
Sbjct: 120 KVDVKDKEEKD---QVDELYTVPDHLKVRKKNSEESSTQWTTGIAEVQLPIEYKLRNIEE 176
Query: 183 TEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGA 242
TEAAKK+LQEKRL G+ KSD +IPSSY+AD+F RG+DY EKLRREHPELYKD+GSQ +G
Sbjct: 177 TEAAKKMLQEKRLAGKTKSDANIPSSYNADFFHRGKDYTEKLRREHPELYKDQGSQANGT 236
Query: 243 GSRPT-DNSTDAAGS--RQAATDQFMLERFRKRERHRVMRR 280
G + N D AG+ R+AATD+ +LERFRKRE+ RVMRR
Sbjct: 237 GGKSMGGNHPDGAGAGRREAATDELLLERFRKREKFRVMRR 277
>gi|125544064|gb|EAY90203.1| hypothetical protein OsI_11769 [Oryza sativa Indica Group]
Length = 278
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 172/282 (60%), Positives = 211/282 (74%), Gaps = 18/282 (6%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG 71
+KNFRKR+ E ++ + ++ RR+ALEEIK++QK RERK GIPA +A +++AA
Sbjct: 2 RKNFRKRNLEADDAADHSDD--DDARRVALEEIKYMQKLRERKLGIPAAAAAAGASSAAS 59
Query: 72 GGGLT--------KVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG 123
G + GD EK++LVLQDTFAQETAV +EDPNML+YVE EL KKRG
Sbjct: 60 ADGASPRGRGGGGGGGLAAGGDAEKEDLVLQDTFAQETAVTIEDPNMLRYVENELLKKRG 119
Query: 124 KNIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIE 181
K +DV D+ E D DELY +P+HLK K+NSEESSTQWTTGIAEVQLPIEYKL+NIE
Sbjct: 120 KKVDVKDKEEKD---QVDELYIVPDHLKVRKKNSEESSTQWTTGIAEVQLPIEYKLRNIE 176
Query: 182 ETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG 241
ETEAAKK+LQEKRL G+ KSD +IPSSY+AD+F RG+DY EKLRREHPELYKD+GSQ +G
Sbjct: 177 ETEAAKKMLQEKRLAGKTKSDANIPSSYNADFFHRGKDYTEKLRREHPELYKDQGSQANG 236
Query: 242 AGSRPT-DNSTDAAGS--RQAATDQFMLERFRKRERHRVMRR 280
G + N D AG+ R+AATD+ +LERFRKRE+ RVMRR
Sbjct: 237 TGGKSMGGNHPDGAGAGRREAATDELLLERFRKREKFRVMRR 278
>gi|357112143|ref|XP_003557869.1| PREDICTED: uncharacterized protein LOC100821850 [Brachypodium
distachyon]
Length = 273
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 176/278 (63%), Positives = 207/278 (74%), Gaps = 15/278 (5%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG 71
+KNFRKR+ E + T SDDE+ RR+ALEEIK++QK RERK GIPA A +AA
Sbjct: 2 QKNFRKRNLEPD---TADHSDDEDVRRVALEEIKYMQKLRERKLGIPAASVATGAAATTT 58
Query: 72 GGGLTKVSEKNEG----DGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID 127
G + + +K++LVLQDTFAQETAV +EDPNML+YVE EL KKRGK I+
Sbjct: 59 DGSSARGRGGGGAAAASETDKEDLVLQDTFAQETAVTIEDPNMLRYVENELLKKRGKTIE 118
Query: 128 VNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 185
VND+ E K DELY +P+HLK K+N EESSTQWTTGIAEVQLPIEYKL+NIEETEA
Sbjct: 119 VNDKDE---KDDVDELYVVPDHLKVKKKNMEESSTQWTTGIAEVQLPIEYKLRNIEETEA 175
Query: 186 AKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSR 245
AKKLLQEKRL G+ KSD +IPSSYSADYF RGRDYAEKLRREHPELYK + Q + G +
Sbjct: 176 AKKLLQEKRLAGKTKSDANIPSSYSADYFHRGRDYAEKLRREHPELYKGQDLQANETGGK 235
Query: 246 PT-DNSTDA--AGSRQAATDQFMLERFRKRERHRVMRR 280
PT N+ D A R+AATD+ +LERFRKRE+ RVMRR
Sbjct: 236 PTGSNNPDGPPARRREAATDELLLERFRKREKFRVMRR 273
>gi|148907301|gb|ABR16788.1| unknown [Picea sitchensis]
Length = 272
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 166/250 (66%), Positives = 191/250 (76%), Gaps = 8/250 (3%)
Query: 36 ERRLALEEIKFLQKQRERKSGIPA--IPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVL 93
+RRLALEE+KFLQKQRERK+GI A I + S + K EG+GEK+ELVL
Sbjct: 26 QRRLALEELKFLQKQRERKAGIAANEISEVVVSKIGDNNSNNNNNNNKAEGEGEKEELVL 85
Query: 94 QDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK-- 151
QDTFAQETAV VEDPNMLKYVEQELAKKRGK I N + K ED+LY +P+HLK
Sbjct: 86 QDTFAQETAVTVEDPNMLKYVEQELAKKRGKQIGKNT---EETKPPEDDLYVVPDHLKVR 142
Query: 152 KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR-LMGRAKSDFSIPSSYS 210
+RNSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAKK LQ+KR +GR + SIPSSYS
Sbjct: 143 RRNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAKKQLQDKRPFVGRGRPQSSIPSSYS 202
Query: 211 ADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFR 270
ADYFQRGR+YAEKLRR+HPELYKD+ +Q+ G+ S + RQAATD+ MLERFR
Sbjct: 203 ADYFQRGREYAEKLRRDHPELYKDKDAQNSGSISGEIAPEGNVGNRRQAATDEIMLERFR 262
Query: 271 KRERHRVMRR 280
KRER R+MRR
Sbjct: 263 KRERSRLMRR 272
>gi|226493466|ref|NP_001148850.1| LOC100282469 [Zea mays]
gi|194701872|gb|ACF85020.1| unknown [Zea mays]
gi|195622612|gb|ACG33136.1| hepatocellular carcinoma-associated antigen 59 family protein [Zea
mays]
gi|414866976|tpg|DAA45533.1| TPA: Hepatocellular carcinoma-associated antigen 59 family [Zea
mays]
Length = 269
Score = 289 bits (739), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 166/274 (60%), Positives = 201/274 (73%), Gaps = 13/274 (4%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG 72
+NFRKR E++ + + DE+ RR+ALEEIK++QK RERK GIPA +A S +
Sbjct: 3 RNFRKRGIEQDTDDRSD---DEDTRRIALEEIKYMQKLRERKLGIPADLAA-ASTNGSSA 58
Query: 73 GGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRV 132
GL G+ EK++LVLQDTFAQETAV +EDPNML+YVE ELAKKRGK +DV +
Sbjct: 59 RGLLGTGAAVAGEAEKEDLVLQDTFAQETAVTIEDPNMLRYVETELAKKRGKMVDVGHKE 118
Query: 133 ENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 190
E D H DELY +P+HLK K+NSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAKKLL
Sbjct: 119 EMD--HV-DELYTVPDHLKVKKKNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAKKLL 175
Query: 191 QEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG-AGSRPTDN 249
QEKRL + K D +IPSSYSADYF RG++Y EKLRRE+P LYKD S+ G G + TD
Sbjct: 176 QEKRLARKPKPDANIPSSYSADYFHRGKEYDEKLRRENPGLYKDNDSRPSGNPGGKATDT 235
Query: 250 STD---AAGSRQAATDQFMLERFRKRERHRVMRR 280
AG R+AA+D+ ML+RFRKRE+ R +RR
Sbjct: 236 KNPDGVGAGRREAASDELMLQRFRKREKFRALRR 269
>gi|242035643|ref|XP_002465216.1| hypothetical protein SORBIDRAFT_01g034230 [Sorghum bicolor]
gi|241919070|gb|EER92214.1| hypothetical protein SORBIDRAFT_01g034230 [Sorghum bicolor]
Length = 270
Score = 288 bits (736), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 167/274 (60%), Positives = 204/274 (74%), Gaps = 12/274 (4%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG 72
+NFRKR E + + + DE+ RR+ALEEIK++QK RERK GIPA +A + ++
Sbjct: 3 RNFRKRGIEPDTDDRSD---DEDTRRVALEEIKYMQKLRERKLGIPAGTAAASTNGSSAR 59
Query: 73 GGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRV 132
GG G+ EK++LVLQDTFAQETAV +EDPNML+YVE ELAKKRGK +DV +
Sbjct: 60 GGRVGSGAAAAGEAEKEDLVLQDTFAQETAVTIEDPNMLRYVETELAKKRGKMVDVGHKE 119
Query: 133 ENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 190
E D H DELY +P+HLK K+NSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAKK+L
Sbjct: 120 EMD--HV-DELYTVPDHLKVKKKNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAKKVL 176
Query: 191 QEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG-AGSRPTDN 249
QEKRL + KSD +IPSSYSADYF RG++Y EKLRRE+P LYKD S+ G +G + TD
Sbjct: 177 QEKRLASKPKSDANIPSSYSADYFHRGKEYDEKLRRENPGLYKDNDSRPRGSSGGKATDT 236
Query: 250 STD---AAGSRQAATDQFMLERFRKRERHRVMRR 280
AG R+AA+D+FMLERFRKRE+ R +RR
Sbjct: 237 KNPGGVGAGRREAASDEFMLERFRKREKFRALRR 270
>gi|168003604|ref|XP_001754502.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694123|gb|EDQ80472.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 267
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 170/274 (62%), Positives = 203/274 (74%), Gaps = 18/274 (6%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG 72
K FRKR+ E E SDD+EE R LEE+KFLQKQRER++G+ A + G
Sbjct: 6 KRFRKRNAPEAGEQ----SDDDEEIRSTLEEVKFLQKQRERRNGVVA-----NQLGQSLG 56
Query: 73 GGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRV 132
G KV++K EG+GEK+E VLQDTFAQETAV +EDPNMLKY+EQE+AKKRG+ + V V
Sbjct: 57 GLNPKVADKGEGEGEKEEQVLQDTFAQETAVTIEDPNMLKYIEQEMAKKRGRELGV---V 113
Query: 133 ENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 190
E + K ED+LY IPEHLK +RN+EESSTQWTTGIAEVQLPIEYKLKNIEETEAAKK L
Sbjct: 114 EEESKPPEDDLYVIPEHLKVRRRNAEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKQL 173
Query: 191 QEKR-LMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDN 249
Q+KR +GR ++ SIP+SYSADYFQRGR+YAEKLR EHPE +KD+G P +
Sbjct: 174 QDKRPFVGRGRTQSSIPASYSADYFQRGREYAEKLRSEHPEPFKDKGRGGGAGRGDPIGS 233
Query: 250 ST---DAAGSRQAATDQFMLERFRKRERHRVMRR 280
++ D RQAATD+ MLERFRKRER R+MRR
Sbjct: 234 NSEKLDLGNRRQAATDEIMLERFRKRERSRLMRR 267
>gi|13937157|gb|AAK50072.1|AF372932_1 At1g02330/T6A9_12 [Arabidopsis thaliana]
gi|22137212|gb|AAM91451.1| At1g02330/T6A9_12 [Arabidopsis thaliana]
Length = 179
Score = 280 bits (715), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 141/179 (78%), Positives = 157/179 (87%), Gaps = 8/179 (4%)
Query: 110 MLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--KRNSEESSTQWTTGIA 167
M+KY+EQELAKKRG+NID + VEN+LK EDELYKIP+HLK KR+SEESSTQWTTGIA
Sbjct: 1 MVKYIEQELAKKRGRNIDDAEEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIA 60
Query: 168 EVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRRE 227
EVQLPIEYKLKNIEETEAAKKLLQE+RLMGR KS+FSIPSSYSADYFQRG+DYAEKLRRE
Sbjct: 61 EVQLPIEYKLKNIEETEAAKKLLQERRLMGRPKSEFSIPSSYSADYFQRGKDYAEKLRRE 120
Query: 228 HPELYKDRGS-QDDGAGSRP-----TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 280
HPELYKDRG Q DG ++P T+N+ D+ SRQAATDQ MLERFRKRER+RVMRR
Sbjct: 121 HPELYKDRGGPQADGEAAKPSTSSSTNNNADSGKSRQAATDQIMLERFRKRERNRVMRR 179
>gi|388490564|gb|AFK33348.1| unknown [Lotus japonicus]
Length = 228
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 136/198 (68%), Positives = 155/198 (78%), Gaps = 10/198 (5%)
Query: 8 KKEKKKNFRKRSYEEEEET------TNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP 61
K+++KKN+RKRS E++ N SDDE ERR+ALEEIK LQKQRERKSGI A P
Sbjct: 5 KQQRKKNYRKRSAPVEQDQLPQSQDNNNESDDERERRMALEEIKLLQKQRERKSGIAANP 64
Query: 62 SALQSAAAAGGGGLTKVSEKN-EGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK 120
S LQS A G K +EKN G+KD+LVLQDTFAQETAVM EDPNM+KYVEQELAK
Sbjct: 65 S-LQSQAVVTAGSAAKPAEKNDGDGGDKDDLVLQDTFAQETAVMDEDPNMVKYVEQELAK 123
Query: 121 KRGKNIDVNDRVENDLKHAEDELYKIPEHL--KKRNSEESSTQWTTGIAEVQLPIEYKLK 178
KRG+ ID D++EN+LK AEDELYKIPEHL KKRNSEESSTQWTTGIAE+QLPIEYKLK
Sbjct: 124 KRGRKIDEADQIENELKRAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEIQLPIEYKLK 183
Query: 179 NIEETEAAKKLLQEKRLM 196
NIEETEAAK +++ L
Sbjct: 184 NIEETEAAKNFYRKRGLW 201
>gi|168015241|ref|XP_001760159.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688539|gb|EDQ74915.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 247
Score = 236 bits (602), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 143/241 (59%), Positives = 169/241 (70%), Gaps = 16/241 (6%)
Query: 33 DEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELV 92
D E R LEE+KFLQKQRER +G+ A GG + V+EK EG+GE +E V
Sbjct: 1 DGEFCRSTLEEVKFLQKQRERSNGVVA-----NQLGQPAGGANSNVAEKGEGEGENEEQV 55
Query: 93 LQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK- 151
LQDTFAQETAV +EDPNMLKY+EQE+AKKRG+ V ++K E +LY IPEHLK
Sbjct: 56 LQDTFAQETAVTIEDPNMLKYIEQEMAKKRGRE---TSEVGEEVKPPEVDLYVIPEHLKV 112
Query: 152 -KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR-LMGRAKSDFSIPSSY 209
KRN EESSTQWTTGIAEVQLP+EYKLKNIEETEAAKK LQ KR +GR +S SIP+SY
Sbjct: 113 RKRNGEESSTQWTTGIAEVQLPVEYKLKNIEETEAAKKQLQGKRPFVGRGRSQSSIPASY 172
Query: 210 SADYFQRGRDYAEKLRREHPELYKDR--GSQDDGAGSRPTDNST---DAAGSRQAATDQF 264
+ADYFQRGR+YAEKLR +HPE Y+D+ G PT + + D RQAATD+
Sbjct: 173 NADYFQRGREYAEKLRSDHPEGYRDKGRGEGRGRGRGGPTGSKSETYDVRNRRQAATDEI 232
Query: 265 M 265
M
Sbjct: 233 M 233
>gi|9857529|gb|AAG00884.1|AC064879_2 Hypothetical protein [Arabidopsis thaliana]
Length = 178
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 118/170 (69%), Positives = 139/170 (81%), Gaps = 11/170 (6%)
Query: 12 KKNFRKRSYEEE--EETTNK--LSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+NFRKRS+EEE + NK +S++EE+RRLALEE+KFLQK RERK GIPA+ S QS+
Sbjct: 4 KRNFRKRSFEEEEEDNDVNKAAISEEEEKRRLALEEVKFLQKLRERKLGIPALSSTAQSS 63
Query: 68 AAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID 127
G K EK E +GEK+ELVLQDTFAQETAV++EDPNM+KY+EQELAKKRG+NID
Sbjct: 64 I-----GKVKPVEKTETEGEKEELVLQDTFAQETAVLIEDPNMVKYIEQELAKKRGRNID 118
Query: 128 VNDRVENDLKHAEDELYKIPEHL--KKRNSEESSTQWTTGIAEVQLPIEY 175
+ VEN+LK EDELYKIP+HL KKR+SEESSTQWTTGIAEVQLPIEY
Sbjct: 119 DAEEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIAEVQLPIEY 168
>gi|147765932|emb|CAN62422.1| hypothetical protein VITISV_020607 [Vitis vinifera]
Length = 128
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/124 (71%), Positives = 100/124 (80%), Gaps = 14/124 (11%)
Query: 171 LPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRR---- 226
L + YKL+NIEETEAAKKLLQ+KRLMGR K++F+IPSSYSADYFQRGRDYAEKLRR
Sbjct: 5 LSVWYKLRNIEETEAAKKLLQDKRLMGRTKTEFNIPSSYSADYFQRGRDYAEKLRRECHF 64
Query: 227 ----------EHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHR 276
EHPELYKD+G QD+G GSR D ST+ AG RQAATD+FML+RFRKRERHR
Sbjct: 65 LLLTRYEIFAEHPELYKDKGVQDNGGGSRLPDASTEVAGRRQAATDEFMLDRFRKRERHR 124
Query: 277 VMRR 280
VMRR
Sbjct: 125 VMRR 128
>gi|302753264|ref|XP_002960056.1| hypothetical protein SELMODRAFT_75683 [Selaginella moellendorffii]
gi|302804660|ref|XP_002984082.1| hypothetical protein SELMODRAFT_119432 [Selaginella moellendorffii]
gi|300148434|gb|EFJ15094.1| hypothetical protein SELMODRAFT_119432 [Selaginella moellendorffii]
gi|300170995|gb|EFJ37595.1| hypothetical protein SELMODRAFT_75683 [Selaginella moellendorffii]
Length = 132
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 94/139 (67%), Positives = 107/139 (76%), Gaps = 9/139 (6%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTF 97
RLALEE+K LQKQR R+ G+ A P AA+ G K S+K E +GEK+ELVLQDTF
Sbjct: 1 RLALEEVKLLQKQRGRRCGVMANP-----VAASPGLDRVKSSDKVEVEGEKEELVLQDTF 55
Query: 98 AQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--KRNS 155
AQETAV VEDPNMLKYVEQELAKKRG+ + V+ D K AED+LY IP+HLK KRNS
Sbjct: 56 AQETAVNVEDPNMLKYVEQELAKKRGRQ-ESGGTVDAD-KPAEDDLYVIPDHLKVRKRNS 113
Query: 156 EESSTQWTTGIAEVQLPIE 174
EESSTQWTTGIAEVQLP+E
Sbjct: 114 EESSTQWTTGIAEVQLPLE 132
>gi|395540548|ref|XP_003772215.1| PREDICTED: uncharacterized protein C9orf78 homolog [Sarcophilus
harrisii]
Length = 288
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 146/289 (50%), Gaps = 34/289 (11%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP----------- 61
K FR+R + E E+ ++ D EE RL LEE K +Q R R +G+ A+
Sbjct: 5 KTFRRRRADSESESDDQ---DSEEVRLKLEETKEVQSLRRRPNGVSAVALLVGEKVQEET 61
Query: 62 SALQSAAAAGGGGLT---KVSEKNEGD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQE 117
+ + GG+ K+ E+N+ E+++L L +F+ ET ED +M+KY+E E
Sbjct: 62 ALVDDPFQVKTGGMVDMKKLKERNKDRISEEEDLNLGTSFSAETNRRDEDADMMKYIETE 121
Query: 118 LAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPI 173
L K++G I N+ + LK+AED LY++PE ++ K+ E S Q +GI EV L I
Sbjct: 122 LKKRKG--IVENEEQKVKLKNAEDCLYELPESIRVSSAKKTEEMLSNQMLSGIPEVDLGI 179
Query: 174 EYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLR---REHP 229
E K+KNI TE AK +LL E+R + +P++ + +Y Q R Y E+L R H
Sbjct: 180 EAKIKNIISTEDAKARLLAEQRNKKKDSETSFVPTNMAVNYVQHNRFYHEELHAPVRRHK 239
Query: 230 ELYKDR----GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
E K R G + R N + ATD + E+F+K R
Sbjct: 240 EEPKTRPLRVGDTEKPEAERSPPNRKRPPNEK--ATDDYHYEKFKKMNR 286
>gi|126277132|ref|XP_001372334.1| PREDICTED: uncharacterized protein C9orf78-like [Monodelphis
domestica]
Length = 288
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 146/288 (50%), Gaps = 32/288 (11%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA---- 68
K FRKR + E E+ + D EE RL LEE K +Q R R +G+ A+ +
Sbjct: 5 KTFRKRRDDSESESDEQ---DSEEVRLKLEETKEVQSLRRRPNGVSAVALLVGEKVQEET 61
Query: 69 ----------AAGGGGLTKVSEKNEGD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQE 117
A G + K+ E+N+ E+++L L +F+ ET ED +M+KY E E
Sbjct: 62 TLVDDPFKIKAGGMVDMKKLKERNKDRINEEEDLNLGTSFSAETNRRDEDADMMKYFETE 121
Query: 118 LAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPI 173
L K++G I N+ + LK+AED LY++PE+++ K+ E S Q +GI EV L I
Sbjct: 122 LKKRKG--IVENEEQKVKLKNAEDCLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGI 179
Query: 174 EYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLR---REHP 229
+ K+KNI TE AK +LL E++ + +P++ + +Y Q R Y E+L R H
Sbjct: 180 DAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRHK 239
Query: 230 ELYKDRGSQDDGAGSRPTDNSTDAAGSR---QAATDQFMLERFRKRER 274
E K R + G +P + R + ATD + E+F+K R
Sbjct: 240 EEPKTRPLR-VGDTEKPEPERSPPNRKRPPNEKATDDYHYEKFKKMNR 286
>gi|156401402|ref|XP_001639280.1| predicted protein [Nematostella vectensis]
gi|156226407|gb|EDO47217.1| predicted protein [Nematostella vectensis]
Length = 285
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 152/297 (51%), Gaps = 54/297 (18%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRL-ALEEIKFLQKQRERKSGIPAIPSALQSAA-- 68
K+N+R++ E+E DDE E + ALEE + +QK R+R G+ A+ AL
Sbjct: 3 KRNYRRKRITEDE-------DDEAEVAIEALEERREIQKFRKRPKGVSAVGLALGKKVDI 55
Query: 69 ---------AAGGGGLTKVSE-------KNEGDGEKDELVLQDTFAQETAVMVEDPNMLK 112
GGL ++++ EG+ + L + FA ET ED +MLK
Sbjct: 56 EDEVESDPFKLKTGGLVQINDLIQDRERDREGEDSGKSINLGENFAAETNRREEDTHMLK 115
Query: 113 YVEQELAKKRGK---NIDVNDRVENDLKHAEDELYKIPEHLKKRN----SEES-STQWTT 164
Y+E+E++K++G+ D+N +V + K ED L+++P+H+ R+ SEE S Q +
Sbjct: 116 YIEEEISKRKGQAESGEDIN-KVRDKFKTKEDLLFQVPKHIDVRSRLMKSEEMLSNQMLS 174
Query: 165 GIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR----- 218
GI EV L I K++NIE TE AK K+++E+R + +P++ ++++ R
Sbjct: 175 GIPEVDLGISAKIRNIEATEEAKMKVIEEQRSKRKHGPTEMVPTNMASNFMLHSRFMDEQ 234
Query: 219 DYAEKLRREHPELYKDRGSQDDG-AGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
AE RR+ L R ++DDG A +P + ATD F E+FRKR R
Sbjct: 235 KNAEAERRKTATL---RATKDDGKAKPQPV---------VEKATDDFYYEKFRKRAR 279
>gi|195427179|ref|XP_002061656.1| GK17111 [Drosophila willistoni]
gi|194157741|gb|EDW72642.1| GK17111 [Drosophila willistoni]
Length = 296
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 148/303 (48%), Gaps = 55/303 (18%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSAL 64
I KK+ +KN R+R + DD++E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 14 IVFKKKSRKNLRQRKNSD---------DDDKEEQLTLDEIKERQRLRQRPNGVSLVGLAL 64
Query: 65 QSAAA------------AGGGGLT-----KVSEKNEGDGEKDELVLQDTFAQETAVMVED 107
A GGL K + E D D + + F+ ET ED
Sbjct: 65 GKKIAPEEELAIKDPFNVKTGGLVNMQTLKSGKMKEADDAYD-VGIGTQFSAETNKRDED 123
Query: 108 PNMLKYVEQELAKKRGKNIDVNDRVEND-------LKHAEDELYKIPEHLKKRNSEES-- 158
M+KY+EQEL K++G D + +ND L + LY +P+HL++ +S S
Sbjct: 124 EEMMKYIEQELQKRKGGATDADTGGDNDDSDAHKYLTPEDAALYALPDHLRQSSSHRSEE 183
Query: 159 --STQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQ 215
S Q GI EV L I+ K++NIE TE AK KLLQ+ + S F +P++ + ++ Q
Sbjct: 184 MLSNQMLNGIPEVDLGIQAKIRNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQ 242
Query: 216 RGR----DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRK 271
R D E+ RR+ E KD G++ + T+ G ++ ATD + ++FRK
Sbjct: 243 HNRFNIEDNNEQRRRKREE--KD--------GNKAAHHQTNPNGVKR-ATDDYHYDKFRK 291
Query: 272 RER 274
+ R
Sbjct: 292 QFR 294
>gi|241105597|ref|XP_002410015.1| conserved hypothetical protein [Ixodes scapularis]
gi|215492857|gb|EEC02498.1| conserved hypothetical protein [Ixodes scapularis]
Length = 249
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 125/264 (47%), Gaps = 47/264 (17%)
Query: 41 LEEIKFLQKQRERKSGIPAI------------PSALQSAAAAGGGGLTKVSE---KNEGD 85
LE+ K +QK R+R +G+ I ++ GG+ + K
Sbjct: 1 LEDTKEIQKLRKRPNGVSVIGLNLGKKLTTKEELVIEDPFKLKTGGMIDMKALKGKRITM 60
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E D + L +TF+ ET ED +M+KY+E+ELAK+RGK D ++ ED L+
Sbjct: 61 EELDAVNLGNTFSVETNQRDEDADMMKYIEEELAKRRGKGQDTETDSRDEGVDPEDVLFH 120
Query: 146 IPEHLKKRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKS 201
+PEHL+K +S++S S Q +GI EV L IE +++NIE TE AK L +R+ + +
Sbjct: 121 VPEHLRKSSSKKSEEMLSNQMLSGIPEVDLGIEERIRNIEATEEAKLKLLRERMAKKERE 180
Query: 202 DFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSR-----------PTDNS 250
+P++ + ++ Q +R + DDG+ SR P
Sbjct: 181 TSFVPTNMAVNFVQH-----------------NRFNIDDGSRSRYARRVPREKEPPVAKP 223
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
+AATD F E+F+K+ R
Sbjct: 224 VVVIAEAEAATDDFHFEKFKKQFR 247
>gi|260797455|ref|XP_002593718.1| hypothetical protein BRAFLDRAFT_63995 [Branchiostoma floridae]
gi|229278946|gb|EEN49729.1| hypothetical protein BRAFLDRAFT_63995 [Branchiostoma floridae]
Length = 291
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 147/297 (49%), Gaps = 40/297 (13%)
Query: 10 EKKKNFRKR--SYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
+K+KNFR+R S +EE+E+ ++S LEE K QK R+R G+ A AL
Sbjct: 4 QKRKNFRRRRDSSDEEDESVQEVSS-------ILEEAKEAQKFRQRPKGVSATALALGKK 56
Query: 68 AAAGG-----------GGLT---KVSEKN-EGDGEKDELVLQD---TFAQETAVMVEDPN 109
+ GG+ + ++N + GE+D+ L D +F+ ET E
Sbjct: 57 LSGNAALVNDPFKLRTGGMVDMKAIKDRNRDRTGEEDDKDLSDLGTSFSAETNTRDEHAE 116
Query: 110 MLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTG 165
M+KY+E E+ K++G+ + + + +K AED LY++P+ LK R+ E S Q +G
Sbjct: 117 MMKYIEVEMKKRKGQEKE-KEASQAKIKGAEDLLYELPDRLKAATSTRSEEMLSNQMLSG 175
Query: 166 IAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEK-- 223
I EV L I+ K++NIE TE AK+ LQE+ R K +P + + +Y Q R Y E
Sbjct: 176 IPEVDLGIQEKIRNIEATEDAKQRLQEQMRKKRDKGTSFVPVNMAVNYVQHNRFYREDTE 235
Query: 224 ----LRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHR 276
+++E P+ + + T + G R ATD F E+F+K+ R
Sbjct: 236 TKKVVKQEAPKPRPLKVGDTEPPIMEETSQTKKRPGER--ATDDFHFEKFKKQMTRR 290
>gi|395506250|ref|XP_003757448.1| PREDICTED: uncharacterized protein C9orf78 homolog [Sarcophilus
harrisii]
Length = 287
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/263 (32%), Positives = 134/263 (50%), Gaps = 29/263 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE K +Q R R G+ A+ + + GG+ K+ E+N+
Sbjct: 26 RLKLEETKEVQSLRRRPKGVSAVALLVGEKVQEETTLVDDPFNINTGGMVDMKKIKERNK 85
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ ++ LK+AED
Sbjct: 86 DRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IMENEELKVKLKNAEDC 143
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE ++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 144 LYELPESIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 203
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDAA 254
+ +P++ + +Y Q R Y E+L R H E K R + G +P +
Sbjct: 204 KDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRHKEEPKTRPLR-VGDTEKPEPEKSPPN 262
Query: 255 GSR---QAATDQFMLERFRKRER 274
R + ATD + E+F+K R
Sbjct: 263 RKRPPNEKATDDYHYEKFKKMNR 285
>gi|224073514|ref|XP_002198575.1| PREDICTED: uncharacterized protein C9orf78 homolog [Taeniopygia
guttata]
Length = 289
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 90/266 (33%), Positives = 136/266 (51%), Gaps = 29/266 (10%)
Query: 35 EERRLALEEIKFLQKQRERKSGIPAIP----SALQSAAA-------AGGGGLTKVSEKNE 83
EE RL LEE K +Q R+R +G+ A+ LQ A GG+ + + E
Sbjct: 25 EEVRLKLEEAKEVQSLRKRPNGVSAVALLVGEKLQEEATLVDDPFKIKSGGMVDMKKLKE 84
Query: 84 GD----GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ + LK+A
Sbjct: 85 RGKDRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEQKVKLKNA 142
Query: 140 EDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 194
ED LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK KLL E++
Sbjct: 143 EDSLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKAKLLAEQQ 202
Query: 195 LMGRAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNST 251
+ +P++ + +Y Q R Y E+L R + E K R + G RP +
Sbjct: 203 NKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKEEPKPRPLR-VGDTERPEPERS 261
Query: 252 DAAGSR---QAATDQFMLERFRKRER 274
R + ATD + E+F+K R
Sbjct: 262 PPNRKRPLNEKATDDYHYEKFKKMNR 287
>gi|195375178|ref|XP_002046380.1| GJ12867 [Drosophila virilis]
gi|194153538|gb|EDW68722.1| GJ12867 [Drosophila virilis]
Length = 298
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 144/297 (48%), Gaps = 51/297 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK +KN R+R K SDD + E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 21 KKSSRKNLRQR----------KNSDDGDNEEQLTLDEIKERQRLRQRPNGVSLVGLALGK 70
Query: 67 AAA------------AGGGGLTKVSEKNEGD-GEKD---ELVLQDTFAQETAVMVEDPNM 110
A GGL + + G E D ++ + F+ ET ED M
Sbjct: 71 KVAPEEELAIKDPFNVKIGGLVNMQQMKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEM 130
Query: 111 LKYVEQELAKKRGKNIDVNDRVEND----LKHAEDELYKIPEHLKKRNSEES----STQW 162
+KY+EQEL K++G D N+ ++D L + LY +P+HL++ +S S S Q
Sbjct: 131 MKYIEQELQKRKGGAADENENDDSDAHKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQM 190
Query: 163 TTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR--- 218
GI EV L I K+ NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 191 LNGIPEVDLGIHAKIHNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRFNI 249
Query: 219 -DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
D E+ RR+ +D ++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 250 EDNNEQRRRKR---------EDKDGNNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 296
>gi|195135385|ref|XP_002012113.1| GI16613 [Drosophila mojavensis]
gi|193918377|gb|EDW17244.1| GI16613 [Drosophila mojavensis]
Length = 300
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 146/299 (48%), Gaps = 53/299 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK+ +KN R+R K SDD + E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 21 KKKPRKNLRQR----------KNSDDGDNEEQLTLDEIKERQRLRQRPNGVSLVGLALGK 70
Query: 67 AAA------------AGGGGLTKVSEKNEGD----GEKDELVLQDTFAQETAVMVEDPNM 110
A GGL + G + ++ + F+ ET ED M
Sbjct: 71 KVAPEEELAIKDPFNVKIGGLVNMQTIKSGKMKEVDDAYDVGIGTQFSAETNKRDEDEEM 130
Query: 111 LKYVEQELAKKRGKNID--VNDRVEND----LKHAEDELYKIPEHLKKRNSEES----ST 160
+KY+EQEL K++G D NDR + D + + LY +PEHL++ +S S S
Sbjct: 131 MKYIEQELQKRKGGAADENSNDRDDRDAHKYMSPEDAALYALPEHLRQSSSHRSEEMLSN 190
Query: 161 QWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR- 218
Q GI EV L I+ K+ NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 191 QMLNGIPEVDLGIQAKIHNIEATEEAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRF 249
Query: 219 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
D +E+ RR+ ++ A ++ N T+ G ++ ATD + ++FRK+ R
Sbjct: 250 KIEDSSEQRRRKR---------ENREADNKSARNQTNPNGVKR-ATDDYHYDKFRKQFR 298
>gi|195012263|ref|XP_001983556.1| GH15962 [Drosophila grimshawi]
gi|193897038|gb|EDV95904.1| GH15962 [Drosophila grimshawi]
Length = 300
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 145/299 (48%), Gaps = 53/299 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK +KN R+R K SDD E+E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 21 KKSSRKNLRQR----------KNSDDGEKEEQLTLDEIKERQRLRQRPNGVSLVGLALGK 70
Query: 67 AAA------------AGGGGLTKVSEKNEGDGEKDE----LVLQDTFAQETAVMVEDPNM 110
A GGL + G ++ E + + F+ ET ED M
Sbjct: 71 KVAPEEELAIKDPFNVKMGGLVNMQTLKSGKMKEPEDAYDVGIGTQFSAETNKRDEDEEM 130
Query: 111 LKYVEQELAKKRGKNI-DVNDRVENDLKH----AED-ELYKIPEHLKKRNSEES----ST 160
+KY+EQEL K++G D D ++ H ED LY +P+HL++ +S S S
Sbjct: 131 MKYIEQELQKRKGGGADDSTDNADDGDAHKYLTPEDAALYALPDHLRQSSSHRSEEMLSN 190
Query: 161 QWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR- 218
Q GI EV L I K++NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 191 QMLNGIPEVDLGIHAKIRNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRF 249
Query: 219 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
D E+ RR+ +D A S+ + T+ G ++ ATD + ++FRK+ R
Sbjct: 250 NIEDSNEQRRRKR---------EDKEAKSKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 298
>gi|147904475|ref|NP_001087905.1| chromosome 9 open reading frame 78 [Xenopus laevis]
gi|51950077|gb|AAH82454.1| MGC84248 protein [Xenopus laevis]
Length = 290
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 146/292 (50%), Gaps = 38/292 (13%)
Query: 13 KNFRKRSYEEEEETTNKLSDDE---EERRLALEEIKFLQKQRERKSGIPA--------IP 61
+NFR+R +E +DE E R+ LEE K +Q R+R++G+ A +P
Sbjct: 5 RNFRRRKASSSDEEV----EDEGVTREVRMKLEEAKEVQSLRKRQNGVSAAALLVGEKLP 60
Query: 62 SALQSA----AAAGGGGLTKVSEKNEGD---GEKDELVLQDTFAQETAVMVEDPNMLKYV 114
+ A GG + K+ G GE+++L L +F+ ET ED +M+KY+
Sbjct: 61 EEVNMADDPFKMQNGGMVDMKKLKDRGKDRIGEEEDLNLGTSFSAETNRRDEDADMMKYI 120
Query: 115 EQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQ 170
E EL K++G I N+ + K AED LY++PE +K K+ E S Q +GI EV
Sbjct: 121 ETELKKRKG--IVENEEKKVKPKSAEDCLYELPESIKVSSAKKTEEMLSNQMLSGIPEVD 178
Query: 171 LPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAE----KLR 225
L I+ K+KNI TE AK +LL E++ + K +P++ + +Y Q R Y E +R
Sbjct: 179 LGIDAKIKNIISTEEAKARLLAEQQNKKKDKHTSFVPTNMAVNYVQHNRFYQEDQNTPMR 238
Query: 226 REHPELYKDRGSQDDGAGSRPTDNSTDAA---GSRQAATDQFMLERFRKRER 274
R H E K R + G +P + S + ATD + E+F+K R
Sbjct: 239 R-HKEEPKPRPLR-VGDTEKPEPEKSPPNRKRPSNEKATDDYHYEKFKKMNR 288
>gi|229365864|gb|ACQ57912.1| C9orf78 [Anoplopoma fimbria]
Length = 292
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 149/300 (49%), Gaps = 52/300 (17%)
Query: 13 KNFRKR---SYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAI--------- 60
KNFR+R S EE+ETT ++ R LEE K LQ R+R++G+
Sbjct: 5 KNFRRRRDSSDVEEDETTIEV-------RSKLEEAKELQSLRKRQTGVSVTALLVGEKLP 57
Query: 61 --------PSALQSAAAAGGGGLTKVSEKNEGDGEKD-ELVLQDTFAQETAVMVEDPNML 111
P L++ G + K ++N E++ +L L +F+ ET ED +M+
Sbjct: 58 PEDEIDNDPFKLKTG---GVVDMKKAKDRNRDMTEEETDLNLGTSFSAETNRRDEDADMM 114
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIA 167
KY+E EL KK+G +V+ +K+AED LY++PE ++ K+ E S Q +GI
Sbjct: 115 KYIETELKKKKGLVEAEEQKVK--VKNAEDHLYELPESIRVNSAKKTEEMLSNQMLSGIP 172
Query: 168 EVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL-- 224
EV L I+ K+KNI +TE AK KLL E+R + + +P++ + +Y Q R Y E +
Sbjct: 173 EVDLGIDAKIKNIIQTEDAKAKLLAEQRNKKKDQGTSFVPTNIAVNYVQHSRFYREDVNA 232
Query: 225 ------RREHPELYKDRGSQDDGAGSRPTDNSTDAA----GSRQAATDQFMLERFRKRER 274
RE P+ R + G P + +T A + + ATD + E+F+K R
Sbjct: 233 PQRHHRHREEPKARPLRVGDTEKPG--PEEVTTPANFRKRPNNEKATDDYHYEKFKKMNR 290
>gi|326930340|ref|XP_003211305.1| PREDICTED: uncharacterized protein C9orf78-like [Meleagris
gallopavo]
Length = 289
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 88/263 (33%), Positives = 134/263 (50%), Gaps = 29/263 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP----SALQSAAA-------AGGGGLTKVSEKNEGD- 85
RL LEE K +Q R+R +G+ A+ LQ A GG+ + + E
Sbjct: 28 RLKLEEAKEVQSLRKRPNGVSAVALLVGEKLQEEATLVDDPFKIKSGGMVDMKKLKERGK 87
Query: 86 ---GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ + LK+AED
Sbjct: 88 DRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEQKVKLKNAEDS 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK KLL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKAKLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDAA 254
+ +P++ + +Y Q R Y E+L R + E K R + G RP +
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKEEPKPRPLR-VGDTERPEPERSPPN 264
Query: 255 GSR---QAATDQFMLERFRKRER 274
R + ATD + E+F+K R
Sbjct: 265 RKRPPNEKATDDYHYEKFKKMNR 287
>gi|410925759|ref|XP_003976347.1| PREDICTED: uncharacterized protein C9orf78 homolog [Takifugu
rubripes]
Length = 291
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 149/297 (50%), Gaps = 47/297 (15%)
Query: 13 KNFRKR--SYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAI---------- 60
KN R+R S ++EE +D EE R +EE K LQ R+R++G+
Sbjct: 5 KNLRRRRDSSDDEE------NDIAEELRSKVEEAKELQSLRKRQTGVSLTALLVGEKLPP 58
Query: 61 -------PSALQSAAAAGGGGLTKVSEKNEGDGEKDE--LVLQDTFAQETAVMVEDPNML 111
P L++ G + KV ++N D +DE L L +F+ ET ED +M+
Sbjct: 59 DAEIDNDPFKLKTG---GVVDMKKVKDRNR-DMTEDETDLNLGTSFSVETNRRDEDADMM 114
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIA 167
KY+E EL K++G+ +V+ +K+AED LY++PE+++ K+ E S Q +GI
Sbjct: 115 KYIETELKKRKGQVEAEEQKVK--VKNAEDHLYELPENIRVNSAKKTEEMLSNQMLSGIP 172
Query: 168 EVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL-- 224
EV L I+ K+KNI TE AK +LL E+R + + +P++ + +Y Q R Y E +
Sbjct: 173 EVDLGIDAKIKNIINTEEAKARLLAEQRNKKKDQGTSFVPTNIAVNYVQHNRFYHEDMNA 232
Query: 225 ------RREHPELYKDR-GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
RE P++ R G + P+ + + + ATD + E+F+K R
Sbjct: 233 PQRHHRHREEPKVRPLRVGDTEKPGPEAPSPPNYRKRPNNEKATDDYHYEKFKKMNR 289
>gi|50757325|ref|XP_415471.1| PREDICTED: uncharacterized protein C9orf78 [Gallus gallus]
Length = 289
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 88/263 (33%), Positives = 134/263 (50%), Gaps = 29/263 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPA----IPSALQSAAA-------AGGGGLTKVSEKNEGD- 85
RL LEE K +Q R+R +G+ A + LQ A GG+ + + E
Sbjct: 28 RLKLEEAKEVQSLRKRPNGVSAAALLVGEKLQEEATLVDDPFKIKSGGMVDMKKLKERGK 87
Query: 86 ---GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ + LK+AED
Sbjct: 88 DRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEQKVKLKNAEDS 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK KLL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKAKLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDAA 254
+ +P++ + +Y Q R Y E+L R + E K R + G RP +
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKEEPKPRPLR-VGDTERPEPERSPPN 264
Query: 255 GSR---QAATDQFMLERFRKRER 274
R + ATD + E+F+K R
Sbjct: 265 RKRPPNEKATDDYHYEKFKKMNR 287
>gi|195095931|ref|XP_001997854.1| GH17986 [Drosophila grimshawi]
gi|193905556|gb|EDW04423.1| GH17986 [Drosophila grimshawi]
Length = 300
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 145/299 (48%), Gaps = 53/299 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK +KN R+R K SDD E+E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 21 KKSSRKNLRQR----------KNSDDCEKEEQLTLDEIKERQRLRQRPNGVSLVGLALGK 70
Query: 67 AAA------------AGGGGLTKVSEKNEGDGEKDE----LVLQDTFAQETAVMVEDPNM 110
A GGL + G ++ E + + F+ ET ED M
Sbjct: 71 KVAPEEELAIKDPFNVKMGGLVNMQTLKSGKMKEPEDAYDVGIGTQFSAETNKRDEDEEM 130
Query: 111 LKYVEQELAKKRGKNI-DVNDRVENDLKH----AED-ELYKIPEHLKKRNSEES----ST 160
+KY+EQEL K++G D D ++ H ED LY +P+HL++ +S S S
Sbjct: 131 MKYIEQELQKRKGGGADDSTDNADDGDAHKYLTPEDAALYALPDHLRQSSSHRSEEMLSN 190
Query: 161 QWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR- 218
Q GI EV L I K++NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 191 QMLNGIPEVDLGIHAKIRNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRF 249
Query: 219 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
D E+ RR+ +D A ++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 250 NIEDSNEQRRRKR---------EDKEAKNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 298
>gi|332016923|gb|EGI57732.1| Uncharacterized protein C9orf78 [Acromyrmex echinatior]
Length = 297
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 86/274 (31%), Positives = 130/274 (47%), Gaps = 40/274 (14%)
Query: 32 DDEEERRLAL----EEIKFLQKQRERKSGIPAIPSALQSAAAA-----------GGGGLT 76
+D EE +++L EE+K +QK RER +G+ + AL A+ GG +
Sbjct: 31 NDSEEEKMSLREKVEEMKIIQKLRERPAGVDVVGLALGENVASDTITSDPFNMKTGGMIN 90
Query: 77 KVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI-----DVN 129
+ KN D E + F ET ED M+KY+E+EL+K++ KN D N
Sbjct: 91 MAALKNTKHKPNDAYETGIGTQFNAETNKRDEDEEMVKYIEEELSKRKSKNSNDAANDAN 150
Query: 130 DRVENDLKHAEDELYKIPEHLKK----RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 185
+ + E L +PEHL++ R+ E S Q +GI EV L IE K++NIE TE
Sbjct: 151 NEKGSYCSPEEAALRAVPEHLRQSSANRSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEE 210
Query: 186 AK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGS 244
AK KLL ++ S F +P++ + ++ Q R E + +K SQ +
Sbjct: 211 AKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNR-----FNIEDSDFHK---SQQESGDK 261
Query: 245 RPTDNSTDAAGSR----QAATDQFMLERFRKRER 274
+ D G R + ATD + ERF+K+ R
Sbjct: 262 KKCTTKEDIRGKRKDNGEKATDDYHYERFKKQFR 295
>gi|390353053|ref|XP_001177304.2| PREDICTED: uncharacterized protein C9orf78-like [Strongylocentrotus
purpuratus]
Length = 244
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/244 (34%), Positives = 125/244 (51%), Gaps = 44/244 (18%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K +KK++ R+R + E + ++D R LEE K QK RER G+ A
Sbjct: 5 KAKKKRSIRQRKTSSDSEDDGQSNED---IRNILEETKEAQKFRERPHGVSA-------T 54
Query: 68 AAAGGGGLTKVSEKNEGD-------------------------GEKDELVLQDTFAQETA 102
A G +TKV E N+ D ++D + TFA ET
Sbjct: 55 ALLTGKKMTKVEEMNDDDPFNLKVGGMLSLKEIKDRNRDRSDESDRDVANMGSTFAVETN 114
Query: 103 VMVEDPNMLKYVEQELAKKRGKNIDV-NDRVENDLKH--AEDELYKIPEHLK---KRNSE 156
ED M+KY+E E+ KK+G ++D +D + KH ED+LY++P++LK +++SE
Sbjct: 115 QRDEDAEMMKYIEIEMNKKKGLDLDKESDPTKEGAKHKTPEDKLYELPDNLKVEAQKSSE 174
Query: 157 ES-STQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYF 214
E S Q +GI EV L IE K+KNIE TE AK K L+E+R + + F +P++ + +Y
Sbjct: 175 EMLSNQMLSGIPEVDLGIEAKIKNIEATEDAKQKHLEERRNKKKNTTSF-VPANMAVNYV 233
Query: 215 QRGR 218
Q R
Sbjct: 234 QHSR 237
>gi|442760949|gb|JAA72633.1| Hypothetical protein, partial [Ixodes ricinus]
Length = 303
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 131/279 (46%), Gaps = 45/279 (16%)
Query: 38 RLALEEIKFLQKQRERKSGIPAI------------------PSALQSAAAAGGGGLTKVS 79
R LE+ K +QK R+R +G+ I P L++ L
Sbjct: 26 REILEDTKEIQKLRKRPNGVSVIGLNLGKKLTTKEELVIEDPFKLKTGGMIDMKALKGXX 85
Query: 80 EKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
E E D + L +TF+ ET ED +M+KY+E+ELAK+RGK D ++
Sbjct: 86 ITME---ELDAVNLGNTFSVETNQRDEDADMMKYIEEELAKRRGKGQDTETDSRDEGVDP 142
Query: 140 EDELYKIPEHLKKRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL 195
ED L+ +PEHL+K +S++S S Q +GI EV L IE +++NIE TE AK L +R+
Sbjct: 143 EDVLFHVPEHLRKSSSKKSEEMLSNQMLSGIPEVDLGIEERIRNIEATEEAKLKLLRERM 202
Query: 196 MGRAKSDFSIPSSYSADYFQRGR---------DYAEKLRRE-HPELYKDR---GSQDDGA 242
+ + +P++ + ++ Q R YA ++ RE P + K + A
Sbjct: 203 AKKERETSFVPTNMAVNFVQHNRFNIDDGSRSRYARRVPREKEPPVAKPVVVIAEAEAVA 262
Query: 243 GSRPTDNSTDAAG-SR------QAATDQFMLERFRKRER 274
S P G SR + ATD F E+F+K+ R
Sbjct: 263 HSIPGRQGKGGKGLSRGKGNDDEKATDDFHFEKFKKQFR 301
>gi|194747083|ref|XP_001955982.1| GF24824 [Drosophila ananassae]
gi|190623264|gb|EDV38788.1| GF24824 [Drosophila ananassae]
Length = 295
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 148/301 (49%), Gaps = 53/301 (17%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSA 63
I KK +KN R+R K SDD E+E ++ LEEIK Q+ R+R +G+ + A
Sbjct: 15 IVFKKSSRKNLRQR----------KSSDDGEKEEQVTLEEIKERQRLRQRPNGVSLVGLA 64
Query: 64 LQSAAA------------AGGGGLTKVSEKNEGD-GEKD---ELVLQDTFAQETAVMVED 107
L A GGL + + G E D ++ + F+ ET ED
Sbjct: 65 LGKKMAPEEELAIKDPFNVKTGGLVNMQQLKSGKMKEADDAYDVGIGTQFSAETNKRDED 124
Query: 108 PNMLKYVEQELAK-KRG----KNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEES---- 158
M+KY+EQEL K KRG D + V L + LY +P+HL++ +S S
Sbjct: 125 EEMMKYIEQELQKRKRGGTDASAADDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEML 184
Query: 159 STQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRG 217
S Q GI EV L I+ K++NIE TE AK KLLQ+ + S F +P++ + ++ Q
Sbjct: 185 SNQMLNGIPEVDLGIQAKIRNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHN 243
Query: 218 R----DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRE 273
R D +E+ RR+ ++D G++ + T+ G ++ ATD + ++FRK+
Sbjct: 244 RFNIEDNSEQKRRK----------REDREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQF 292
Query: 274 R 274
R
Sbjct: 293 R 293
>gi|288684380|ref|NP_001165770.1| uncharacterized protein LOC733913 [Xenopus (Silurana) tropicalis]
gi|170285295|gb|AAI61311.1| Unknown (protein for MGC:186018) [Xenopus (Silurana) tropicalis]
Length = 290
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 89/267 (33%), Positives = 136/267 (50%), Gaps = 31/267 (11%)
Query: 35 EERRLALEEIKFLQKQRERKSGIPA--------IPSALQSA----AAAGGGGLTKVSEKN 82
+E R+ LEE K +Q R+R++G+ A +P + A GG + K+
Sbjct: 26 QEVRIKLEEAKEVQSLRKRQNGVSAAALLVGERLPEEVIMADDPFKMQSGGMVDMKKLKD 85
Query: 83 EGD---GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
G GE+++L L +F+ ET ED +M+KY+E EL K++G I N+ + K A
Sbjct: 86 RGKDRLGEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEKKVKPKSA 143
Query: 140 EDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 194
ED LY++PE +K K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 144 EDCLYELPESIKVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKARLLAEQQ 203
Query: 195 LMGRAKSDFSIPSSYSADYFQRGRDYAE----KLRREHPELYKDRGSQDDGAGSRPTDNS 250
+ K +P++ + +Y Q R Y E +RR H E K R + G +P
Sbjct: 204 NKKKDKHTSFVPTNMAVNYVQHNRFYQEDQNTPMRR-HKEEPKPRPLR-VGDTEKPEPEK 261
Query: 251 TDAAGSRQA---ATDQFMLERFRKRER 274
+ R + ATD + E+F+K R
Sbjct: 262 SPPNRKRPSNEKATDDYHYEKFKKMNR 288
>gi|431898902|gb|ELK07272.1| hypothetical protein PAL_GLEAN10012522 [Pteropus alecto]
Length = 289
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 143/289 (49%), Gaps = 34/289 (11%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP----------- 61
K FR+R + E E + D +E RL LEE + +Q R+R +G+ A+
Sbjct: 6 KTFRRRRADSESEEDEQ---DSQEVRLKLEETREVQNLRKRPNGVSAVALLVGEKVQEET 62
Query: 62 SALQSAAAAGGGGLTKVSEKNEGDGEK----DELVLQDTFAQETAVMVEDPNMLKYVEQE 117
+ + GG+ + + E +K ++L L +F+ ET ED +M+KY+E E
Sbjct: 63 TLVDDPFQMKTGGMVDMKKLKERGKDKISDEEDLHLGTSFSAETNRRDEDADMMKYIETE 122
Query: 118 LAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPI 173
L K++G I ++ + K+AED LY++PE+++ K+ E S Q +GI EV L I
Sbjct: 123 LKKRKG--IVEHEEQKVKQKNAEDCLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGI 180
Query: 174 EYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL------RR 226
+ K+KNI TE AK +LL E++ + +P++ + +Y Q R Y E+L +
Sbjct: 181 DAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNK 240
Query: 227 EHPELYKDR-GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
E P+ R G + R N A + ATD + E+F+K R
Sbjct: 241 EEPKARPLRVGDTEKPEPERSPPNRKRPANEK--ATDDYHYEKFKKMNR 287
>gi|47212056|emb|CAF90174.1| unnamed protein product [Tetraodon nigroviridis]
Length = 291
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 136/267 (50%), Gaps = 33/267 (12%)
Query: 38 RLALEEIKFLQKQRERKSGIPA--------IPSALQ------SAAAAGGGGLTKVSEKNE 83
R +EE K LQ R+R++G+ +P ++ G + +V ++N
Sbjct: 26 RSKVEEAKELQSLRKRQTGVSLTALLVGEKLPPEVEIDNDPFKLKTGGVVDMKRVKDRNR 85
Query: 84 GDGEKDE--LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAED 141
D +DE L L +F+ ET ED +M+KY+E EL K++G+ +V+ +K+AED
Sbjct: 86 -DMTEDETDLNLGTSFSVETNRRDEDADMMKYIETELKKRKGQVEAEEQKVK--VKNAED 142
Query: 142 ELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLM 196
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E+R
Sbjct: 143 HLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIINTEDAKARLLAEQRNK 202
Query: 197 GRAKSDFSIPSSYSADYFQRGRDYAEKL--------RREHPELYKDR-GSQDDGAGSRPT 247
+ +S +P++ + +Y Q R Y E + RE P+ R G + P+
Sbjct: 203 KKDQSTSFVPTNIAVNYVQHNRFYHEDMNAPQRHHRHREEPKARPLRVGDTEKPGPEAPS 262
Query: 248 DNSTDAAGSRQAATDQFMLERFRKRER 274
++ + + ATD + E+F+K R
Sbjct: 263 PSNHRKRPNNEKATDDYHYEKFKKMNR 289
>gi|440797240|gb|ELR18335.1| hepatocellular carcinomaassociated antigen 59, putative
[Acanthamoeba castellanii str. Neff]
Length = 309
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 140/297 (47%), Gaps = 39/297 (13%)
Query: 9 KEKKKNFRKRSYEEEEETT------NKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPS 62
+++K RK+ E+E ET + +D+ L L E + LQ++RER G A +
Sbjct: 20 RKQKARLRKKIVEDEPETEADQEAETEEGEDDAPLGLMLRETRKLQRERERVKGCEAAAT 79
Query: 63 ALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVE---QELA 119
+ + + ++ D K+ +L+ TF + V +P + Y+E +E
Sbjct: 80 STEVLQKIATSSFVRPVANDDDD--KETHLLESTFTVQAEQDVVNPLLENYIEARLREFR 137
Query: 120 KKRGK----------NIDVNDRVEN-----DLKHAEDELYKIPEHLK----KRNSEESST 160
+ R K +D D+ E DL+ E +LY+IPEHLK R+ ++ S
Sbjct: 138 ETRAKEAIEKAKAERGVDWRDKEETTEKEFDLREEERKLYEIPEHLKVSETMRSDDQVSE 197
Query: 161 QWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 220
W TGI EV+LPIEYKLKNIE TE AK+LL +++ + P +Y+ F R
Sbjct: 198 AWLTGIQEVELPIEYKLKNIEATEDAKRLLLKRKEGPKPPPQ---PDAYNT-RFGRPSTQ 253
Query: 221 AEKLRRE---HPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
+ +RR+ H + +DR + G D+ + ATD ERF+KR R
Sbjct: 254 TQIVRRDRNAHRDSNRDRNNDGQQQGGWRGDHHR--GKKSEQATDDIAFERFKKRFR 308
>gi|307199470|gb|EFN80083.1| Uncharacterized protein C9orf78 [Harpegnathos saltator]
Length = 295
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 130/278 (46%), Gaps = 42/278 (15%)
Query: 30 LSDDEEER------RLALEEIKFLQKQRERKSGIPAIPSALQSAAAA-----------GG 72
+SDD+E R +EE+K +QK RER +GI + AL A+ G
Sbjct: 25 ISDDDESESEKTSLREKVEEMKIVQKLRERPTGINVVGLALGENVASDVIMSDPFNMKTG 84
Query: 73 GGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI-DVN 129
G + KN + D + + F ET ED M+KY+E+EL+K++ KN DV
Sbjct: 85 GIVNMAVLKNTKHRQNDAYDTGIGTQFNAETNKRDEDEEMVKYIEEELSKRKSKNNNDVT 144
Query: 130 DRVEND----LKHAEDELYKIPEHLKK----RNSEESSTQWTTGIAEVQLPIEYKLKNIE 181
+ N+ E L +PEHL++ R+ E S Q +GI EV L IE K++NIE
Sbjct: 145 NGTNNEKGSYCSPEEAALRAVPEHLRQSSAHRSEEMLSNQMLSGIPEVDLGIEAKIRNIE 204
Query: 182 ETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDD 240
TE AK KLL ++ S F +P++ + ++ Q R E + K R DD
Sbjct: 205 ATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNR-----FNIEDSDFQKSRQDSDD 258
Query: 241 GAGSRPTDNSTDAAGSR----QAATDQFMLERFRKRER 274
+ D G R + ATD + ERF+K+ R
Sbjct: 259 ---KKKCVTKEDIRGKRKDNGEKATDDYHYERFKKQFR 293
>gi|348530434|ref|XP_003452716.1| PREDICTED: uncharacterized protein C9orf78 homolog [Oreochromis
niloticus]
Length = 291
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 144/295 (48%), Gaps = 43/295 (14%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAI------------ 60
KNFR+R ++ + + + EE R LEE K LQ R+R+SGI
Sbjct: 5 KNFRRR----KDSSDVEEDETTEEVRHKLEEAKELQSLRKRQSGISVTALLVGEKLPPEA 60
Query: 61 -----PSALQSAAAAGGGGLTKVSEKNEGDGEKDE--LVLQDTFAQETAVMVEDPNMLKY 113
P L++ G + KV ++N D +DE L L +F+ ET ED +M+KY
Sbjct: 61 EIENDPFKLKTG---GIVDMKKVKDRNR-DMTEDETDLNLGTSFSAETNRRDEDADMMKY 116
Query: 114 VEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEV 169
+E EL KK+G +V+ +K+ ED LY++PE+++ K+ E S Q +GI EV
Sbjct: 117 IETELKKKKGLVEAEEQKVK--VKNPEDHLYELPENIRVNSAKKTEEMLSNQMLSGIPEV 174
Query: 170 QLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY-----AEK 223
L I+ K+KNI +TE AK KLL E+R + +P++ + +Y Q R Y A +
Sbjct: 175 DLGIDAKIKNIIQTEDAKAKLLAEQRNKKKDHGTSFVPTNIAVNYVQHNRFYHEDANAPQ 234
Query: 224 LRREHPELYKDR----GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
H E K R G + P+ + + + ATD + E+F+K R
Sbjct: 235 RHHRHKEEPKARPLRVGDTEKPGPEAPSPPNYRKRPNNEKATDDYHYEKFKKMNR 289
>gi|383853293|ref|XP_003702157.1| PREDICTED: uncharacterized protein C9orf78-like [Megachile
rotundata]
Length = 310
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 145/299 (48%), Gaps = 42/299 (14%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSAL 64
I K++ +K+ RKR +E+ ++E R +EE+K +QK RER G+ + AL
Sbjct: 23 IEFKRKSRKSLRKRHVSSDEDDN---ENEETSIREKVEEMKIIQKLRERPKGVNVVGLAL 79
Query: 65 QSAA-----------AAGGGGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNML 111
GG + + KN + D E + F ET ED M+
Sbjct: 80 GENVTPDVMTSDPFNVKTGGMVNMAALKNTKLKQNDAYETGIGTQFNAETNKRDEDEEMV 139
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLKK----RNSEESSTQW 162
KY+E+EL+K++ KN D + ++ K + E L +PEHL++ R+ E S Q
Sbjct: 140 KYIEEELSKRKSKNEDKTENGSSNDKGSYCSPEEAALQAVPEHLRQSSAHRSEEMLSNQM 199
Query: 163 TTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR--- 218
+GI EV L IE K++NIE TE AK KLL ++ S F +P++ + ++ Q R
Sbjct: 200 LSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNRFNI 258
Query: 219 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
D+ +K +++ + K +DD G R DN + ATD + ERF+K+ R
Sbjct: 259 EDADF-QKSKQDSDDRKKVTAPRDDFKGKRK-DNG-------EKATDDYHYERFKKQFR 308
>gi|125979391|ref|XP_001353728.1| GA20734 [Drosophila pseudoobscura pseudoobscura]
gi|195169146|ref|XP_002025386.1| GL11930 [Drosophila persimilis]
gi|54640711|gb|EAL29462.1| GA20734 [Drosophila pseudoobscura pseudoobscura]
gi|194108854|gb|EDW30897.1| GL11930 [Drosophila persimilis]
Length = 296
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 115/237 (48%), Gaps = 37/237 (15%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEE-ERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK +KN R+R K SDDEE E +L L++IK Q+ R R +G+ + AL
Sbjct: 20 KKSSRKNLRQR----------KNSDDEEKEEKLTLDDIKERQRLRHRPNGVSLVGLALGK 69
Query: 67 AAA------------AGGGGLTKVSEKNEGDGEKDE----LVLQDTFAQETAVMVEDPNM 110
A GGL ++ G ++ E + + F+ ET ED M
Sbjct: 70 KIAPEEELAIKDPFNVKSGGLVNMATLKSGKMKEAEDPYDVGIGTQFSAETNKRDEDEEM 129
Query: 111 LKYVEQELAKKRGKNIDVNDRVEND----LKHAEDELYKIPEHLKK----RNSEESSTQW 162
+KY+E EL K++G D D + D L + LY +P+HL++ R+ E S Q
Sbjct: 130 MKYIELELQKRKGGGTDAADNDDGDVNKYLTPEDAALYALPDHLRQSSTHRSEEMLSNQM 189
Query: 163 TTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 218
GI EV L I+ K+ NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 190 LNGIPEVDLGIQAKICNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNR 245
>gi|238231821|ref|NP_001154097.1| CI078 protein [Oncorhynchus mykiss]
gi|225704000|gb|ACO07846.1| C9orf78 [Oncorhynchus mykiss]
Length = 295
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 138/274 (50%), Gaps = 40/274 (14%)
Query: 36 ERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA---AGG---------GGLTKVSE--- 80
E R L+E K LQ R+R++G+ ++ + L+ GG GG+ + +
Sbjct: 25 EVRSKLDEAKELQSLRKRQTGV-SVAALLEGEKLRLDEGGDNDPFKLKTGGVVDMKKVKD 83
Query: 81 --KNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH 138
++ D + +L L +F+ ET ED +M+KY+E EL KK+G +V+ +K+
Sbjct: 84 RARDMTDDDTGDLNLGTSFSAETNRRDEDADMMKYIETELKKKKGSVEAEEQKVK--VKN 141
Query: 139 AEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEK 193
AED LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK KLLQ++
Sbjct: 142 AEDLLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIIFTEEAKAKLLQDQ 201
Query: 194 RLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDD--------GAGSR 245
R + +P++ + +Y Q R Y E + P+ + +R + G +
Sbjct: 202 RNKKKDNGTSFVPTNITVNYVQHNRFYREDV--NAPQRHHNRHKPKEPEARPLRVGDTEK 259
Query: 246 PTDNSTDAAGSR-----QAATDQFMLERFRKRER 274
P + + A R + ATD + E+F+K R
Sbjct: 260 PGPEAVEPANHRKRPNNEKATDDYHYEKFKKMNR 293
>gi|21358507|ref|NP_647643.1| CG7974, isoform A [Drosophila melanogaster]
gi|442629510|ref|NP_001261273.1| CG7974, isoform B [Drosophila melanogaster]
gi|7292132|gb|AAF47544.1| CG7974, isoform A [Drosophila melanogaster]
gi|17861842|gb|AAL39398.1| GM02612p [Drosophila melanogaster]
gi|220943288|gb|ACL84187.1| CG7974-PA [synthetic construct]
gi|220953398|gb|ACL89242.1| CG7974-PA [synthetic construct]
gi|440215140|gb|AGB93968.1| CG7974, isoform B [Drosophila melanogaster]
Length = 294
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 142/294 (48%), Gaps = 46/294 (15%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +EEE +E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDEEE---------KEEQLTLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMKQLKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRGKNIDVNDRVEND------LKHAEDELYKIPEHLKKRNSEES----STQ 161
KY+EQEL K++G + D E+D L + LY +P+HL++ +S S S Q
Sbjct: 129 KYIEQELQKRKGGGTE--DAAEDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQ 186
Query: 162 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 220
GI EV L I K++NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 187 MLNGIPEVDLGIVAKIRNIEATEEAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRFN 245
Query: 221 AEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
E + +++ G++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 246 IEDNSDQRRR------KREEREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 292
>gi|387914846|gb|AFK11032.1| uncharacterized protein C9orf78-like protein [Callorhinchus milii]
Length = 289
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 143/293 (48%), Gaps = 41/293 (13%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEER-----RLALEEIKFLQKQRERKSGI---------- 57
K++R+R EE SD+E+E+ R L+E+K +Q R R++G+
Sbjct: 5 KSYRRRRLEE--------SDEEDEQTTVLVRSKLDELKEIQSMRRRQNGVSAAALLVGEK 56
Query: 58 -PAIPSALQSAAAAGGGGLT-----KVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNML 111
P S + GG+ K + + E+ +L L +F+ ET ED +M+
Sbjct: 57 TPEEASTVDDPFKLKTGGMIDMKKIKDRNRERVEEEETDLNLGTSFSVETNRRDEDADMM 116
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIA 167
KY+E EL KR K I N+ + +K+ ED LY++P+++ KR E S Q +GI
Sbjct: 117 KYIETEL--KRRKGILENEEQKVKIKNPEDMLYELPDNINVSSAKRTEEMLSNQMLSGIP 174
Query: 168 EVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL-- 224
EV L I+ K+KNI TE AK +LL E+R + + +P++ + +Y Q R Y E++
Sbjct: 175 EVDLGIDAKIKNIISTEEAKAQLLAEQRNKKKDNATSFVPTNIAVNYVQHNRFYREEIHA 234
Query: 225 --RREHPELYKDRGSQDDGAGSRPTDNSTD-AAGSRQAATDQFMLERFRKRER 274
RR EL D + P + + S + ATD + E+F+K R
Sbjct: 235 PARRHKEELKPKPLRVGDTEKTEPDQSPPNRKRPSNEKATDDYHYEKFKKMSR 287
>gi|195490506|ref|XP_002093169.1| GE21178 [Drosophila yakuba]
gi|194179270|gb|EDW92881.1| GE21178 [Drosophila yakuba]
Length = 294
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 142/294 (48%), Gaps = 46/294 (15%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +E E +E ++ L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDETE---------KEEQITLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMQQLKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRGKNIDVNDRVEND------LKHAEDELYKIPEHLKKRNSEES----STQ 161
KY+EQEL K++G + D VE+D L + LY +P+HL++ +S S S Q
Sbjct: 129 KYIEQELQKRKGGGTE--DAVEDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQ 186
Query: 162 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 220
GI EV L I K++NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 187 MLNGIPEVDLGIVAKIRNIEATEEAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRFN 245
Query: 221 AEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
E + +++ G++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 246 IEDSSDQRRR------KREEREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 292
>gi|311246626|ref|XP_003122268.1| PREDICTED: uncharacterized protein C9orf78-like [Sus scrofa]
Length = 289
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ KR E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKRTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|226442876|ref|NP_001139972.1| CI078 protein [Salmo salar]
gi|221220608|gb|ACM08965.1| C9orf78 [Salmo salar]
Length = 295
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 138/274 (50%), Gaps = 40/274 (14%)
Query: 36 ERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA---AGG---------GGLTKVSE--- 80
E R L+E K LQ R+R++G+ ++ + L+ GG GG+ + +
Sbjct: 25 EVRSKLDEAKELQSLRKRQTGV-SVAALLEGEKLRLDEGGDNDPFKLKTGGVVDMKKVKD 83
Query: 81 --KNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH 138
++ D + +L L +F+ ET ED +M+KY+E EL KK+G +V+ +K+
Sbjct: 84 RARDMTDDDTGDLNLGTSFSAETNRRDEDADMMKYIETELKKKKGLVEAEEQKVK--VKN 141
Query: 139 AEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEK 193
AED LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK KLLQ++
Sbjct: 142 AEDLLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIIFTEEAKAKLLQDQ 201
Query: 194 RLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDD--------GAGSR 245
R + +P++ + +Y Q R Y E + P+ + +R + G +
Sbjct: 202 RNKKKDNGTSFVPTNIAVNYVQHNRFYREDV--NAPQRHHNRHKPKEPEARPLRVGDTEK 259
Query: 246 PTDNSTDAAGSR-----QAATDQFMLERFRKRER 274
P + + A R + ATD + E+F+K R
Sbjct: 260 PGPEAVEPANHRKRPNNEKATDDYHYEKFKKMNR 293
>gi|195336668|ref|XP_002034957.1| GM14436 [Drosophila sechellia]
gi|194128050|gb|EDW50093.1| GM14436 [Drosophila sechellia]
Length = 294
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 142/294 (48%), Gaps = 46/294 (15%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +EEE +E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDEEE---------KEEQLTLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMKQLKSGKIKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRGKNIDVNDRVEND------LKHAEDELYKIPEHLKKRNSEES----STQ 161
KY+EQEL K++G + D E+D L + LY +P+HL++ +S S S Q
Sbjct: 129 KYIEQELQKRKGGGTE--DAAEDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQ 186
Query: 162 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 220
GI EV L I K++NIE TE AK KL+Q+ + S F +P++ + ++ Q R
Sbjct: 187 MLNGIPEVDLGIVAKIRNIEATEEAKQKLMQDAKNKKDGPSQF-VPTNMAVNFMQHNRFN 245
Query: 221 AEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
E + +++ G++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 246 IEDNSDQRRR------KREEREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 292
>gi|17068385|gb|AAH17570.1| Chromosome 9 open reading frame 78 [Homo sapiens]
Length = 289
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G +V++ K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKGIVEHEEQKVKS--KNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQSKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|380018473|ref|XP_003693152.1| PREDICTED: uncharacterized protein C9orf78-like [Apis florea]
Length = 296
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 128/266 (48%), Gaps = 45/266 (16%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA-----------GGGGLTKVSEKNEGDGEKD 89
+EE+K +QK RER G+ + AL GG + + KN + D
Sbjct: 42 VEEMKIIQKLRERPKGVNVVGLALGENVTPDVMMSDPFNVKTGGMVNMAALKNTKLKQND 101
Query: 90 --ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH--------A 139
E + F ET ED M+KY+E+EL+K++ KN D+ EN L +
Sbjct: 102 AYETGIGTQFNAETNKRDEDEEMVKYIEEELSKRKSKN---EDKTENGLNNDKGSYCSPE 158
Query: 140 EDELYKIPEHLKK----RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 194
E L +PEHL++ R+ E S Q +GI EV L IE K++NIE TE AK KLL ++
Sbjct: 159 EAALQAVPEHLRQSSAHRSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRH 218
Query: 195 LMGRAKSDFSIPSSYSADYFQRGR------DYAEKLRREHPELYKDRGSQDDGAGSRPTD 248
S F +P++ + ++ Q R D+ +K +++ + K +DD G R D
Sbjct: 219 RKKDGPSQF-VPTNMAVNFVQHNRFNIEDTDF-QKSKQDSDDRKKIIAPRDDYKGKR-KD 275
Query: 249 NSTDAAGSRQAATDQFMLERFRKRER 274
N + ATD + ERF+K+ R
Sbjct: 276 NG-------EKATDDYHYERFKKQFR 294
>gi|410979370|ref|XP_003996058.1| PREDICTED: uncharacterized protein C9orf78 homolog, partial [Felis
catus]
Length = 286
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 25 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 84
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 85 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 142
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 143 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 202
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 203 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 262
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 263 KRPANEK--ATDDYHYEKFKKMNR 284
>gi|432874672|ref|XP_004072535.1| PREDICTED: uncharacterized protein C9orf78 homolog [Oryzias
latipes]
Length = 294
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 132/271 (48%), Gaps = 39/271 (14%)
Query: 38 RLALEEIKFLQKQRERKSGIPAI-----------------PSALQSAAAAGGGGLTKVSE 80
R LEE K +Q R+R++G+ P L++ G + KV +
Sbjct: 27 RSKLEEAKEIQSLRKRQTGVSVTALLVGEKLPPEAEIDNDPFKLKTG---GVIDMKKVKD 83
Query: 81 KNEGDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
+N E + EL L +F+ ET ED +M+KY+E EL KK+G +++ +K+
Sbjct: 84 RNRDMTEDETELNLGTSFSAETNRRDEDADMMKYIETELKKKKGLVEAEEQKIK--VKNP 141
Query: 140 EDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 194
ED LY++PE+++ K+ E S Q +GI EV L I+ K+KNI +TE AK KL+ E+R
Sbjct: 142 EDHLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIIQTEEAKAKLIAEQR 201
Query: 195 LMGRAKSDFSIPSSYSADYFQRGRDYAE---KLRREH--------PELYKDRGSQDDGAG 243
+ +P++ + +Y Q R Y E +R H P + ++ G
Sbjct: 202 NKKKDNGTSFVPTNIAVNYVQHNRFYHEDSNAAQRHHRHKEPEPKPRPLRVGDTEKPGLE 261
Query: 244 SRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
+ P+ + + + ATD + E+F+K R
Sbjct: 262 AAPSPPNFRKRPNNEKATDDYHYEKFKKMNR 292
>gi|115496926|ref|NP_001069516.1| uncharacterized protein C9orf78 homolog [Bos taurus]
gi|338720610|ref|XP_003364207.1| PREDICTED: uncharacterized protein C9orf78-like isoform 2 [Equus
caballus]
gi|94574208|gb|AAI16054.1| Chromosome 9 open reading frame 78 ortholog [Bos taurus]
gi|296482067|tpg|DAA24182.1| TPA: chromosome 9 open reading frame 78 [Bos taurus]
Length = 265
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/250 (31%), Positives = 128/250 (51%), Gaps = 27/250 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGD-GEKDELVLQDT 96
RL LEE + +Q R+R +G+ G + K+ E+ + E+++L L +
Sbjct: 28 RLKLEETREVQNLRKRPNGM----------KTGGMVDMKKLKERGKDKISEEEDLHLGTS 77
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----K 152
F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY++PE+++ K
Sbjct: 78 FSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYELPENIRVSSAK 135
Query: 153 RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSA 211
+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ + +P++ +
Sbjct: 136 KTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAV 195
Query: 212 DYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDAAGSRQAATDQF 264
+Y Q R Y E+L +E P+ R G + R N A + ATD +
Sbjct: 196 NYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRPANEK--ATDDY 253
Query: 265 MLERFRKRER 274
E+F+K R
Sbjct: 254 HYEKFKKMNR 263
>gi|395844397|ref|XP_003794948.1| PREDICTED: uncharacterized protein C9orf78 homolog [Otolemur
garnettii]
Length = 289
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKIQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|355732023|gb|AES10570.1| hypothetical protein [Mustela putorius furo]
Length = 288
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|440894378|gb|ELR46847.1| hypothetical protein M91_13534 [Bos grunniens mutus]
Length = 289
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|7706557|ref|NP_057604.1| uncharacterized protein C9orf78 [Homo sapiens]
gi|388452692|ref|NP_001253951.1| uncharacterized protein LOC716542 [Macaca mulatta]
gi|114627167|ref|XP_520311.2| PREDICTED: uncharacterized protein C9orf78 homolog isoform 2 [Pan
troglodytes]
gi|332230227|ref|XP_003264289.1| PREDICTED: uncharacterized protein C9orf78 homolog [Nomascus
leucogenys]
gi|397503617|ref|XP_003822417.1| PREDICTED: uncharacterized protein C9orf78 homolog [Pan paniscus]
gi|402896312|ref|XP_003911247.1| PREDICTED: uncharacterized protein C9orf78 homolog [Papio anubis]
gi|426363280|ref|XP_004048771.1| PREDICTED: uncharacterized protein C9orf78 homolog [Gorilla gorilla
gorilla]
gi|74753081|sp|Q9NZ63.1|CI078_HUMAN RecName: Full=Uncharacterized protein C9orf78; AltName:
Full=Hepatocellular carcinoma-associated antigen 59
gi|7158847|gb|AAF37561.1| hepatocellular carcinoma-associated antigen 59 [Homo sapiens]
gi|14043339|gb|AAH07664.1| Chromosome 9 open reading frame 78 [Homo sapiens]
gi|119608316|gb|EAW87910.1| chromosome 9 open reading frame 78, isoform CRA_b [Homo sapiens]
gi|193787017|dbj|BAG51840.1| unnamed protein product [Homo sapiens]
gi|355570051|gb|EHH25578.1| hypothetical protein EGK_21433 [Macaca mulatta]
gi|355753000|gb|EHH57046.1| hypothetical protein EGM_06606 [Macaca fascicularis]
gi|380785079|gb|AFE64415.1| uncharacterized protein C9orf78 [Macaca mulatta]
gi|380808288|gb|AFE76019.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|380813696|gb|AFE78722.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|383411663|gb|AFH29045.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|383411665|gb|AFH29046.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|383411667|gb|AFH29047.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|383411669|gb|AFH29048.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|384942542|gb|AFI34876.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|384942544|gb|AFI34877.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|384942546|gb|AFI34878.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|410223530|gb|JAA08984.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410223532|gb|JAA08985.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410223534|gb|JAA08986.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410256334|gb|JAA16134.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410256336|gb|JAA16135.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410256338|gb|JAA16136.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410295782|gb|JAA26491.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410295784|gb|JAA26492.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410355223|gb|JAA44215.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410355225|gb|JAA44216.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410355227|gb|JAA44217.1| chromosome 9 open reading frame 78 [Pan troglodytes]
Length = 289
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|427797465|gb|JAA64184.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 326
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 115/231 (49%), Gaps = 28/231 (12%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEE--RRLALEEIKFLQKQRERKSGIPAI--------- 60
K +KR + ++++ DEEE R L++ K +QK R+R +G+ I
Sbjct: 22 KSGLKKRKCFRQHKSSDGSESDEEEGVSREILQDTKEIQKLRKRPNGVSVIGLNLGKKLT 81
Query: 61 ---------PSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNML 111
P L++ L E E D + L +TF+ ET ED +M+
Sbjct: 82 PKEELVIDDPFKLKTGGMIDMKALKGKRVTME---ELDAVNLGNTFSVETNQRDEDADMM 138
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEES----STQWTTGIA 167
KY+E+ELAK+RG+ + +N + +D L+ +PEHL+K S++S S Q +GI
Sbjct: 139 KYIEEELAKRRGRVQEPQPTPQNTVDE-KDVLFHVPEHLRKSTSKKSEEMLSNQMLSGIP 197
Query: 168 EVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 218
EV L IE +++NIE TE AK L R+ + + +P++ + ++ Q R
Sbjct: 198 EVDLGIEERIRNIEATEEAKLKLIRDRMARKERETSFVPTNMAVNFVQHNR 248
>gi|149738228|ref|XP_001499857.1| PREDICTED: uncharacterized protein C9orf78-like isoform 1 [Equus
caballus]
Length = 289
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|255083575|ref|XP_002508362.1| predicted protein [Micromonas sp. RCC299]
gi|226523639|gb|ACO69620.1| predicted protein [Micromonas sp. RCC299]
Length = 246
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 75/197 (38%), Positives = 101/197 (51%), Gaps = 26/197 (13%)
Query: 31 SDDEEERRLA------LEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEG 84
SDDEE+ + A +E+ K L K R + G+ A A A G G ++++
Sbjct: 14 SDDEEDEQGAQALRERMEDAKTLIKNRVKSKGVGA------EALALGSGKKDVDADEDAD 67
Query: 85 DGEKDELVLQDTFAQETAVMV--EDPNMLKYVEQELAKKRGKNIDVNDRVEND----LKH 138
DG+ + FA AV V EDPNML+Y+EQELAK+RG D K
Sbjct: 68 DGKHAQ------FAAGAAVDVDGEDPNMLRYIEQELAKRRGAGGDEGGDGAGTSGGGAKS 121
Query: 139 AEDELYKIPEHL--KKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLM 196
AE+ L+ P+ L KK EE++ +W TGI EVQLP +YK+KNIE TE AK + EK
Sbjct: 122 AEERLWDTPDELRVKKTEGEETADRWLTGIVEVQLPADYKIKNIEATERAKAKMLEKIHG 181
Query: 197 GRAKSDFSIPSSYSADY 213
G + P S A+
Sbjct: 182 GGDGAAMDHPHSRQAEL 198
>gi|348570394|ref|XP_003470982.1| PREDICTED: uncharacterized protein C9orf78-like [Cavia porcellus]
Length = 292
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/269 (31%), Positives = 130/269 (48%), Gaps = 38/269 (14%)
Query: 38 RLALEEIKFLQKQRERKSGIPA---------------IPSALQSAAAAGGGGLTKVSEKN 82
RL LEE + +Q R+R +G+ A + SA S GG + K
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAAALLVGEKVQEETTLVVSARSSFPMKTGGMVDMKKLKE 87
Query: 83 EGD---GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLK-- 137
G E+++L L +F+ ET ED +M+KY+E EL K++G + + E +K
Sbjct: 88 RGKDKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG----IVEHEEQKVKPR 143
Query: 138 HAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQE 192
+AED LY++PE ++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E
Sbjct: 144 NAEDCLYELPESIRVASAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAE 203
Query: 193 KRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSR 245
++ + +P++ + +Y Q R Y E+L +E P+ R G + R
Sbjct: 204 QQNKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPER 263
Query: 246 PTDNSTDAAGSRQAATDQFMLERFRKRER 274
N A + ATD + E+F+K R
Sbjct: 264 SPPNRKRPANEK--ATDDYHYEKFKKMNR 290
>gi|296191004|ref|XP_002743423.1| PREDICTED: uncharacterized protein C9orf78-like [Callithrix
jacchus]
Length = 289
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 131/264 (49%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLTKVSEKNEGDG 86
RL LEE + +Q R+R +G+ A+ + + GG+ + + E
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 87 EK----DELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
+K ++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISDEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|225704712|gb|ACO08202.1| C9orf78 [Oncorhynchus mykiss]
Length = 295
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 85/273 (31%), Positives = 137/273 (50%), Gaps = 44/273 (16%)
Query: 36 ERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA---AGG---------GGLTKVSE--- 80
E R L+E K LQ R+R++G+ ++ + L+ GG GG+ + +
Sbjct: 25 EVRSKLDEAKELQSLRKRQTGV-SVAALLEGEKLRLDEGGDNDPFKLKTGGVVDMKKVKD 83
Query: 81 --KNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH 138
++ D + +L L +F+ ET ED +M+KY+E EL KK+G +V+ +K+
Sbjct: 84 RARDMTDDDTGDLNLGTSFSAETNRRDEDADMVKYIETELKKKKGLVEAEEQKVK--VKN 141
Query: 139 AEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEK 193
AED LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK KLLQ++
Sbjct: 142 AEDLLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIIFTEEAKAKLLQDQ 201
Query: 194 RLMGRAKSDFSIPSSYSADYFQRGRDYAEKL---RREH-------PELYKDRGSQDDGAG 243
R + +P++ + +Y Q R Y E + +R H PE R G
Sbjct: 202 RNKKKDNGTSFVPTNIAVNYVQHNRFYREDVNAPQRHHSRHKPKEPEARPLRV----GDT 257
Query: 244 SRPTDNSTDAAGSR-----QAATDQFMLERFRK 271
+P + + A R + ATD + E+F+K
Sbjct: 258 EKPGPEAVEPANHRKRPNNEKATDDYHYEKFKK 290
>gi|443714105|gb|ELU06673.1| hypothetical protein CAPTEDRAFT_168725 [Capitella teleta]
Length = 342
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 142/300 (47%), Gaps = 39/300 (13%)
Query: 9 KEKKKNFRKRSYEEEEETTNKLSDDEEERRL-ALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+ KKNFR++ + E + + + E ++E K LQK R+R+ G+ A A+
Sbjct: 3 KKPKKNFRRKVESSDSEDNSDVEEKSNETLCDRIKEAKELQKLRQRQRGVSAEDLAVAKI 62
Query: 68 AA-------------AGGGGLTKVSEKNEGDGEKDEL-VLQDTFAQETAVMVEDPNMLKY 113
GG K +K +K+++ + TFA ET ED +MLKY
Sbjct: 63 TPKDSKKKEDPLKLKTGGYIELKTLKKEISKADKEDVEQIGTTFAAETNRRDEDADMLKY 122
Query: 114 VEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK------KRNSEESSTQWTTGIA 167
VE+EL K++G I E + ED LY++PEH+K +N + S Q +GI
Sbjct: 123 VEEELNKRKG--ITKEFESETLKRKPEDALYELPEHVKALTAKKSKNEDMLSNQMLSGIP 180
Query: 168 EVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRR 226
EV L IE K+ NIE TE AK KL++EKR + +P++ + ++ Q R L R
Sbjct: 181 EVDLGIEVKIHNIEMTEVAKQKLIEEKRRKKDSGISEFVPTNIAVNFMQHNR---FTLHR 237
Query: 227 EHPELYKDRGSQDD-------GAGSRP-----TDNSTDAAGSRQAATDQFMLERFRKRER 274
+ + K + ++ G RP T + A S + ATD + ERF+K R
Sbjct: 238 DEKAVVKKKVVEEPKPEPLRVGDIQRPDTVPSTSEFSRPAASTEKATDDYHYERFKKAVR 297
>gi|344271642|ref|XP_003407646.1| PREDICTED: uncharacterized protein C9orf78-like [Loxodonta
africana]
Length = 289
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
S + ATD + E+F+K R
Sbjct: 266 KRP--SNEKATDDYHYEKFKKMNR 287
>gi|21450249|ref|NP_659134.1| uncharacterized protein C9orf78 homolog [Mus musculus]
gi|408360017|sp|Q3TQI7.2|CI078_MOUSE RecName: Full=Uncharacterized protein C9orf78 homolog
gi|13542853|gb|AAH05624.1| CDNA sequence BC005624 [Mus musculus]
gi|74177493|dbj|BAE34621.1| unnamed protein product [Mus musculus]
gi|74207670|dbj|BAE40080.1| unnamed protein product [Mus musculus]
gi|148676552|gb|EDL08499.1| mCG19001 [Mus musculus]
gi|149039065|gb|EDL93285.1| similar to Hypothetical protein MGC11690 [Rattus norvegicus]
Length = 289
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 132/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPA----IPSALQ----------SAAAAGGGGLTKVSEKNE 83
RL LEE + +Q R+R +G+ A + +Q A G + K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAAALLVGEKVQEETTLVDDPFQMATGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I + + K+AED
Sbjct: 88 DKVSEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEQEEQKAKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|426226107|ref|XP_004007195.1| PREDICTED: uncharacterized protein C9orf78 homolog [Ovis aries]
Length = 341
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 80 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 139
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 140 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 197
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 198 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 257
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 258 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 317
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 318 KRPANEK--ATDDYHYEKFKKMNR 339
>gi|321477023|gb|EFX87982.1| hypothetical protein DAPPUDRAFT_305639 [Daphnia pulex]
Length = 305
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 109/233 (46%), Gaps = 24/233 (10%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQ-- 65
KK +K RKR E EE + DE + LEE K LQK RER GI A+ A+
Sbjct: 18 KKPSRKPMRKRL--EIEEDDDAGGSDELDVLSKLEETKELQKLRERPHGISAVALAIGKR 75
Query: 66 -------------SAAAAGGGGLTKVSEKNEGDGEKD---ELVLQDTFAQETAVMVEDPN 109
G + V + D E + F+ ET ED
Sbjct: 76 ITVEEEVTVNDPFKVTTGGMADMKAVKAGKQNSSSVDDAYETGIGTQFSVETNTRDEDAE 135
Query: 110 MLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEES----STQWTTG 165
M+KY+E++LAK++G + D+ L E +PE+L+ ++S +S S Q +G
Sbjct: 136 MMKYIEEQLAKRKGLMQEDEDKSNKYLTPEEIAFSSVPEYLRVKSSVQSEEMLSNQMLSG 195
Query: 166 IAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 218
I EV L IE K+KNIE TE AK+ L ++RL + +P++ + ++ Q R
Sbjct: 196 IPEVDLGIEAKIKNIEATEEAKQKLLQERLRKKDGPSMFVPTNMAVNFVQHNR 248
>gi|194864932|ref|XP_001971179.1| GG14814 [Drosophila erecta]
gi|190652962|gb|EDV50205.1| GG14814 [Drosophila erecta]
Length = 294
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 77/236 (32%), Positives = 117/236 (49%), Gaps = 35/236 (14%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +E ET E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDETET---------EEQLTLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMQQLKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRG---KNIDVNDRVENDLKHAED-ELYKIPEHLKKRNSEES----STQWT 163
KY+EQEL K++G +++ +D N ED LY +P+HL++ +S S S Q
Sbjct: 129 KYIEQELQKRKGGGTEDVPEDDGDMNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQML 188
Query: 164 TGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 218
GI EV L I K++NIE TE AK KL+Q+ + S F +P++ + ++ Q R
Sbjct: 189 NGIPEVDLGIVAKIRNIEATEEAKQKLMQDAKNKKDGPSQF-VPTNMAVNFMQHNR 243
>gi|57092043|ref|XP_537817.1| PREDICTED: uncharacterized protein C9orf78 isoform 1 [Canis lupus
familiaris]
Length = 289
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L ++ P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKDEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|313227794|emb|CBY22942.1| unnamed protein product [Oikopleura dioica]
Length = 329
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 127/267 (47%), Gaps = 42/267 (15%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGI---------- 57
K + K+NFRKR E EE E E L L ++K ++ ++R +G+
Sbjct: 4 KSKSKRNFRKRRTEVNEEEPENSQVYEVENGLELAKLK--RELKKRTAGVNSESLAKGVK 61
Query: 58 ------PAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNML 111
P P L S GGGLT++ EK G+ EKD + TF E + E+ M
Sbjct: 62 TPRFDDPNDPYKLNS-----GGGLTQIREKRLGNNEKDVTQISSTFKTEKKIRDEEEEMN 116
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDE-----LYKIPE-------HLKKRNSEESS 159
K++E E+ K+RG + ++ +L+ ED LY+IPE HL R S
Sbjct: 117 KFIESEILKRRGIESATKESMKQNLR-LEDIVDPKFLYEIPEKYRATSKHL--REDGLLS 173
Query: 160 TQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMG--RAKSDFSIPSSYS--ADYFQ 215
Q +GI EV L + KL+NIE TEAAK+LL +K + A SD S SY+ A +
Sbjct: 174 AQMLSGIPEVDLGVNNKLQNIERTEAAKRLLVDKFIKDEKEASSDKSHERSYAREAAVNR 233
Query: 216 RGRDYAEKLRREHPELYKDRGSQDDGA 242
G ++ ++ +H YK ++ D
Sbjct: 234 GGNEFTDQFYSQHMRFYKGEEAETDAV 260
>gi|432116600|gb|ELK37393.1| hypothetical protein MDA_GLEAN10011232 [Myotis davidii]
Length = 289
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 81/263 (30%), Positives = 135/263 (51%), Gaps = 29/263 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDAA 254
+ +P++ + +Y Q R Y E+L R + E K R + G +P + +
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLR-VGDTEKPEPDRSPPN 264
Query: 255 GSR---QAATDQFMLERFRKRER 274
R + ATD + E+F+K R
Sbjct: 265 RKRPPNEKATDDYHYEKFKKMNR 287
>gi|351697008|gb|EHA99926.1| hypothetical protein GW7_01886 [Heterocephalus glaber]
Length = 289
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 132/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAAALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKINEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|391325854|ref|XP_003737442.1| PREDICTED: uncharacterized protein C9orf78 homolog [Metaseiulus
occidentalis]
Length = 277
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 27 TNKLSDDEEER--RLALEEIKFLQKQRERKSGIPAIPSALQSAAAA------------GG 72
T L DDE++ R LE+ K LQK R+R G+ L A G
Sbjct: 37 TTPLIDDEDDHVDRSVLEDTKELQKLRKRPHGVSVEALILGKPVADTEEKVSDPFKIDSG 96
Query: 73 GGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRV 132
GGLT + + + +V+ + FA ET ED +M+KY+E EL K++G
Sbjct: 97 GGLTDMKASS-----TETIVIGNQFASETNERDEDADMMKYIEAELKKRQGTQQQTEAEA 151
Query: 133 ENDLKHAEDELYKI-PEHLKK----RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 187
+ +ED L +I P HL++ +N E S Q GI EV L +E +++NIE TE AK
Sbjct: 152 KPLSLKSEDLLMQILPNHLERSQGQKNEEMLSNQMLAGIPEVDLGMEERIRNIEATEEAK 211
Query: 188 KLLQEKRLMGRAKSDFSIPSSYSADY-FQRGRDYAEKLRRE 227
+ +R+ G+ K +P++ S ++ Q+ + E++RRE
Sbjct: 212 MKMLHERMSGKRKETSLVPTNISVNFESQQKKPKKEQVRRE 252
>gi|354503906|ref|XP_003514021.1| PREDICTED: uncharacterized protein C9orf78 homolog [Cricetulus
griseus]
gi|344258464|gb|EGW14568.1| Uncharacterized protein C9orf78-like [Cricetulus griseus]
Length = 289
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 131/264 (49%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAAALLVGEKVQEETTLVDDPFQMTTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I + + K+AED
Sbjct: 88 DKVSEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEQEEQKAKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|350396974|ref|XP_003484725.1| PREDICTED: uncharacterized protein C9orf78-like [Bombus impatiens]
Length = 296
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 140/299 (46%), Gaps = 42/299 (14%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSAL 64
I KK+ +K RKR +E+ ++E R +EE+K +QK RER GI + AL
Sbjct: 9 IEFKKKSRKPIRKRQVSSDEDDN---ENEEASVREKVEEMKTIQKLRERPKGINVVGLAL 65
Query: 65 QSAAAA-----------GGGGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNML 111
GG + KN + D E + F ET ED M+
Sbjct: 66 GENVTPDVMTSDPFNVKTGGMVNMTVLKNTKLKQNDAYETGIGTQFNAETNKRDEDEEMV 125
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLKK----RNSEESSTQW 162
KY+E+EL+K++ K + N+ K + E L +PEHL++ R+ E S Q
Sbjct: 126 KYIEEELSKRKSKTEGTTENGSNNDKGSYCSPEEAALQAVPEHLRQSSAHRSEEMLSNQM 185
Query: 163 TTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR--- 218
+GI EV L IE K++NIE TE AK KLL ++ S F +P++ + ++ Q R
Sbjct: 186 LSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNRFNI 244
Query: 219 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
D+ +K +++ E K +DD R DN + ATD + ERF+K+ R
Sbjct: 245 EDTDF-QKSKQDSDERKKVAAPRDDYKSKR-KDNG-------EKATDDYHYERFKKQFR 294
>gi|346468277|gb|AEO33983.1| hypothetical protein [Amblyomma maculatum]
Length = 336
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 81/136 (59%), Gaps = 4/136 (2%)
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKI 146
E D + L +TF+ ET ED +M+KY+E+ELAK+RG+ + + + +D L+ +
Sbjct: 123 ELDAVNLGNTFSVETNQRDEDADMMKYIEEELAKRRGRVQETPAEEKTQVVDEKDVLFHV 182
Query: 147 PEHLKKRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 202
PEHL+K S++S S Q +GI EV L IE +++NIE TE AK L +R+ + +
Sbjct: 183 PEHLRKSTSKKSEEMLSNQMLSGIPEVDLGIEERIRNIEATEEAKLKLIRERMARKERET 242
Query: 203 FSIPSSYSADYFQRGR 218
+P++ + ++ Q R
Sbjct: 243 SFVPTNMAVNFVQHNR 258
>gi|196010774|ref|XP_002115251.1| hypothetical protein TRIADDRAFT_50676 [Trichoplax adhaerens]
gi|190582022|gb|EDV22096.1| hypothetical protein TRIADDRAFT_50676 [Trichoplax adhaerens]
Length = 270
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 139/286 (48%), Gaps = 42/286 (14%)
Query: 12 KKNFRKR---SYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPA-------IP 61
K+N+RKR S ++E+ +SD ++ K LQK R + GI + +P
Sbjct: 3 KRNYRKRRDSSDDDEKVGDESISD-------VIKRAKELQKFRAKPRGIDSSELDKGNVP 55
Query: 62 SAL----QSAAAAGGGGLTKVSEKNEG--DGEKDELVLQDTFAQETAVMVEDPNMLKYVE 115
+ GGL + +G D D++ L F+ ET ED M+KY+E
Sbjct: 56 DVEVPEDEDPFKLKTGGLIDMDHAKKGGVDEMGDKISLGKNFSAETNTFDEDAAMMKYIE 115
Query: 116 QELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQL 171
ELAKK+G + +D K ED LY++PE+L+ K++ E S Q +GI E+ L
Sbjct: 116 VELAKKKGV-VSQDDEDSRSGKVLEDSLYELPENLRITSAKKSEEMLSNQMLSGIPEIDL 174
Query: 172 PIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFS--IPSSYSADYFQRGRDYAEKLRREH 228
I+ KL+NIE TE AK ++L ++R R K++ S +P + + +Y Q R Y + + E
Sbjct: 175 GIDAKLRNIEATENAKLEMLMKRR---RKKNEISSMVPINIAVNYVQHTR-YVD-IDVEE 229
Query: 229 PELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
+ + + D + N + ATD + E+FRK+ R
Sbjct: 230 EVISRKTATNDRATTQKRRHN------WKNTATDDYHFEKFRKQMR 269
>gi|74201040|dbj|BAE37395.1| unnamed protein product [Mus musculus]
Length = 289
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 131/264 (49%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPA----IPSALQ----------SAAAAGGGGLTKVSEKNE 83
RL LEE +Q R+R +G+ A + +Q A G + K+ E+ +
Sbjct: 28 RLKLEETGEVQNLRKRPNGVSAAALLVGEKVQEETTLVDDPFQMATGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I + + K+AED
Sbjct: 88 DKVSEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEQEEQKAKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|301758846|ref|XP_002915284.1| PREDICTED: uncharacterized protein C9orf78-like [Ailuropoda
melanoleuca]
Length = 385
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 124 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 183
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 184 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 241
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 242 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 301
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 302 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 361
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 362 KRPANEK--ATDDYHYEKFKKMNR 383
>gi|198428614|ref|XP_002128903.1| PREDICTED: similar to Uncharacterized protein C9orf78
(Hepatocellular carcinoma-associated antigen 59) [Ciona
intestinalis]
Length = 291
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/267 (31%), Positives = 132/267 (49%), Gaps = 34/267 (12%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSALQSAAA-------------AGGGGLTK---VSEK 81
R LE K LQK R+R+ G+ A+ A + GGL + V ++
Sbjct: 27 RDMLEATKELQKIRKRQMGVNAVSLATGAKLKKVDNLDVEADPFKMTTGGLVEMGNVKDR 86
Query: 82 NEG----DGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLK 137
N D ++D L TF+ ET ED + Y+E EL +++G+ + +D+ +
Sbjct: 87 NRDRTYEDVDRDVTNLGHTFSVETNRRDEDAELTAYIENELKRRKGETSNGDDKKAKE-- 144
Query: 138 HAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQE 192
AED+LY++PEHL+ K++ E S Q +GI EV L I+ K+KNIE TE AK KL+ E
Sbjct: 145 SAEDKLYQLPEHLQIKVGKQSEEMLSNQMLSGIPEVDLGIDTKIKNIERTEEAKQKLITE 204
Query: 193 KRLMGRAKSDFSIPSSYSADYFQRGR----DYAEKLRREHPELYKDRGSQDDGAGSRP-T 247
++ F +P++ + +Y Q R D + ++ PE D +G RP +
Sbjct: 205 LSKKKEKRTSF-VPTNMAVNYVQHKRFMHNDGHKNATKKEPETEAPPLVVGD-SGRRPAS 262
Query: 248 DNSTDAAGSRQAATDQFMLERFRKRER 274
+ + A + +TD F ++FRK R
Sbjct: 263 EVAQHRADNSGKSTDNFHYDKFRKAAR 289
>gi|197102566|ref|NP_001125335.1| uncharacterized protein C9orf78 homolog [Pongo abelii]
gi|75042142|sp|Q5RC87.1|CI078_PONAB RecName: Full=Uncharacterized protein C9orf78 homolog
gi|55727739|emb|CAH90620.1| hypothetical protein [Pongo abelii]
Length = 289
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 132/264 (50%), Gaps = 31/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 197
LY++PE+++ K+ E S Q +GI EV I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDQGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 198 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|221131461|ref|XP_002156012.1| PREDICTED: uncharacterized protein C9orf78 homolog [Hydra
magnipapillata]
Length = 292
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 126/273 (46%), Gaps = 47/273 (17%)
Query: 40 ALEEIKFLQKQRERKSGIPA--IPSALQSAA------------AAGGGGLTKVSEKNEGD 85
++E+ K LQK R R G+ + S L + + GGGL +
Sbjct: 24 SIEDRKELQKFRSRPKGVSVEVLASLLDTVSQNKEKTNDDPFKLNSGGGLV------DNG 77
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
+D F+ ET ED +LKY+E+ L KKRG N N + + ED LY+
Sbjct: 78 KSRDLSNFGTNFSTETNQRDEDKQLLKYIEEGLMKKRGVNQQENP--DTKVLSKEDLLYQ 135
Query: 146 IPEHLK-----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRA 199
+PE+LK ++ E S+Q GI EV L IE K+KNIE TE AK K+++E + +
Sbjct: 136 LPENLKVQSKIMKSEEMLSSQVLCGIPEVDLGIEAKIKNIEATEEAKMKMIEESKNRKQQ 195
Query: 200 KSDFSIPSSYSADYFQRGRDY----------AEKLRREHPELYKDRGSQDDG------AG 243
S+F +P++ ++++ R Y E+ R + D+G G
Sbjct: 196 ASEF-VPTNMASNFMHHSRFYDEKKAIEKEKKEEKERLENAVVIDKGPTVGGDIIENAED 254
Query: 244 SRPTDNSTDAAGSR--QAATDQFMLERFRKRER 274
S+ N+ + G R + +D FM E F+KR R
Sbjct: 255 SKFIRNTMSSGGKRNKKGTSDDFMFESFKKRAR 287
>gi|340716340|ref|XP_003396657.1| PREDICTED: uncharacterized protein C9orf78-like isoform 1 [Bombus
terrestris]
gi|340716342|ref|XP_003396658.1| PREDICTED: uncharacterized protein C9orf78-like isoform 2 [Bombus
terrestris]
Length = 296
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 136/292 (46%), Gaps = 42/292 (14%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAA- 70
+K RKR +E+ ++E R +EE+K +QK RER GI + AL
Sbjct: 16 RKPIRKRQVSSDEDDN---ENEEASVREKVEEMKTIQKLRERPKGINVVGLALGENVTPD 72
Query: 71 ----------GGGGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNMLKYVEQEL 118
GG + KN + D E + F ET ED M+KY+E+EL
Sbjct: 73 VMTSDPFNVKTGGMVNMTVLKNTKLKQNDAYETGIGTQFNAETNKRDEDEEMVKYIEEEL 132
Query: 119 AKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLKK----RNSEESSTQWTTGIAEV 169
+K++ K + N+ K + E L +PEHL++ R+ E S Q +GI EV
Sbjct: 133 SKRKSKTEGTTENGSNNDKGSYCSPEEAALQAVPEHLRQSSAHRSEEMLSNQMLSGIPEV 192
Query: 170 QLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR------DYAE 222
L IE K++NIE TE AK KLL ++ S F +P++ + ++ Q R D+ +
Sbjct: 193 DLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNRFNIEDTDF-Q 250
Query: 223 KLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
K +++ E K +DD R DN + ATD + ERF+K+ R
Sbjct: 251 KSKQDSDERKKVAAPRDDYKSKR-KDNG-------EKATDDYHYERFKKQFR 294
>gi|320169949|gb|EFW46848.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 388
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 133/281 (47%), Gaps = 59/281 (20%)
Query: 31 SDD----EEERRLALEEIKFLQKQRERKSGI-PA-------IPSALQSAAAA-------G 71
SDD EE R LE +K LQ+ R+RKSG+ PA + S + AA
Sbjct: 131 SDDADASEESHRERLERMKELQRFRQRKSGVTPAGLALGQRVKSVAEELLAASDPFKLKS 190
Query: 72 GGGL-------TKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK 124
GGGL K +++ + L TF+ ET ED ML Y++++LA +RG
Sbjct: 191 GGGLVDKAAIRVKDRDRDRDADDDKNFSLTSTFSHETKARDEDKMMLSYIDEQLAIRRGT 250
Query: 125 NIDVNDRVENDLKHAEDELYKIPEHLK-----KRNSEES-STQWTTGIAEVQLPIEYKLK 178
N + N+ +LY +P++L+ K SE+S S+ +GI EV L ++ +++
Sbjct: 251 N---ANDNANNANDPTAQLYVVPKNLEATSALKNVSEDSISSALLSGIPEVDLGVQSRIR 307
Query: 179 NIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYK--DRG 236
NIEETE A+ L++KRL G S+ + H + DR
Sbjct: 308 NIEETEKARIELEQKRLSGNQSSNVVL---------------------HHHRFFNTADRS 346
Query: 237 SQDDGAGSRPTDNSTDAAGSRQA-ATDQFMLERFRKRERHR 276
+ D AG++ + +A +RQ ATDQ + ++F+K + R
Sbjct: 347 DRSDSAGAQQHGSQNSSAAARQPRATDQLVFDKFKKAQLGR 387
>gi|449266760|gb|EMC77776.1| hypothetical protein A306_15006, partial [Columba livia]
Length = 275
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 131/262 (50%), Gaps = 33/262 (12%)
Query: 35 EERRLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLTKVSEKNE 83
EE RL LEE K +Q R+R +G+ A+ + + GG+ + + E
Sbjct: 23 EEVRLKLEEAKEVQSLRKRPNGVSAVALLVGEKVQEEATLVDDPFKMKSGGMVDMKKLKE 82
Query: 84 GD----GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ + LK+A
Sbjct: 83 RGKDRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEQKVKLKNA 140
Query: 140 EDELYKIPEHLKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGR 198
ED LY++PE+++ SS + T E+ L I+ K+KNI TE AK KLL E++ +
Sbjct: 141 EDSLYELPENIRV-----SSAKKT---EEMLLGIDAKIKNIISTEEAKAKLLAEQQNKKK 192
Query: 199 AKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDAAG 255
+P++ + +Y Q R Y E+L R + E K R + G RP +
Sbjct: 193 DSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKEEPKPRPLR-VGDTERPEPERSPPNR 251
Query: 256 SR---QAATDQFMLERFRKRER 274
R + ATD + E+F+K R
Sbjct: 252 KRPLNEKATDDYHYEKFKKMNR 273
>gi|54400388|ref|NP_001005945.1| chromosome 9 open reading frame 78 [Danio rerio]
gi|53734462|gb|AAH83464.1| Zgc:103692 [Danio rerio]
gi|148725502|emb|CAN88766.1| novel protein (zgc:103692) [Danio rerio]
Length = 289
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 128/264 (48%), Gaps = 30/264 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSALQS---------------AAAAGGGGLTKVSEKN 82
R L+E K LQ R+R+ G+ +I + L G + KV +++
Sbjct: 27 RSKLDEAKELQSLRKRQHGV-SIATLLVGEKLPLEAELEDDPFKLKTGGVVDMKKVKDRS 85
Query: 83 -EGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAED 141
+ ++++L L +F+ ET ED +M+KY+E EL KK+G +V+ +K+ ED
Sbjct: 86 RDMTVDENDLNLGTSFSAETNRRDEDADMMKYIETELKKKKGMVEAEEQKVK--VKNPED 143
Query: 142 ELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLM 196
LY++PE++ K+ E S Q +GI EV L I+ K+KNI TE AK KLL E+R
Sbjct: 144 LLYELPENINVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIINTEEAKAKLLAEQRNK 203
Query: 197 GRAKSDFSIPSSYSADYFQRGRDYAE------KLRREHPELYKDRGSQDDGAGSRPTDNS 250
+ +P++ + +Y Q R Y E + RE P+ R + + +
Sbjct: 204 KKDSGTSFVPTNIAVNYVQHNRFYHEDSNAPQRRNREEPKARPLRVGDTEKPAPEASPPN 263
Query: 251 TDAAGSRQAATDQFMLERFRKRER 274
+ + ATD + E+F+K R
Sbjct: 264 FRKRPNNEKATDDYHYEKFKKMNR 287
>gi|240849477|ref|NP_001155632.1| uncharacterized protein LOC100164612 [Acyrthosiphon pisum]
gi|239792059|dbj|BAH72414.1| ACYPI005606 [Acyrthosiphon pisum]
Length = 321
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 125/267 (46%), Gaps = 44/267 (16%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAA------------AGGGGLTKVSEKNEGD--- 85
LEE+K +QK R+R +G+ I AL + GGL ++ G
Sbjct: 64 LEEMKTMQKLRDRPNGVNIISLALGEKLSQEEEKLMVDPFKVKTGGLINMNALKTGQVTQ 123
Query: 86 -GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVND--RVENDLKHAEDE 142
+ + + F+ ET ED M+KY+++++A + G+ +D++D N + E
Sbjct: 124 VDDAYDTGIGTQFSAETNKRDEDEEMMKYIDEQVAVRTGRTVDIDDDNVSLNKSNYCPPE 183
Query: 143 LYK---IPEHLKK----RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 194
L +P HL+ R+ E S Q +GI EV L I+ K+KNIE TE AK KL+++KR
Sbjct: 184 LAALQAVPSHLRNSTTHRSEEMLSNQMLSGIPEVDLGIDAKIKNIEATEEAKMKLIRDKR 243
Query: 195 LMGRAKSDFSIPSSYSADYFQRGRDYAE-------KLRREHPELYKDRGSQDDGAGSRPT 247
S F +P++ + ++ Q R E K ++ P + +D + + +
Sbjct: 244 NKKDGPSQF-VPTNMAVNFVQHNRFNIEITGPDGKKNYKQQPAVKQDHSDEKNIDKRKKK 302
Query: 248 DNSTDAAGSRQAATDQFMLERFRKRER 274
DN ATD F ERF+K+ R
Sbjct: 303 DN----------ATDDFHYERFKKQFR 319
>gi|7106830|gb|AAF36140.1|AF151054_1 HSPC220 [Homo sapiens]
Length = 176
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 94/165 (56%), Gaps = 9/165 (5%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 16 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 73
Query: 146 IPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 200
+PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 74 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 133
Query: 201 SDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSR 245
+P++ + +Y Q R Y E+L H E K R +Q SR
Sbjct: 134 ETSFVPTNMAVNYVQHNRFYHEELNCAHTE--KQRRAQGPALESR 176
>gi|119608315|gb|EAW87909.1| chromosome 9 open reading frame 78, isoform CRA_a [Homo sapiens]
Length = 219
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 107/201 (53%), Gaps = 16/201 (7%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 21 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 78
Query: 146 IPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 200
+PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 79 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 138
Query: 201 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 253
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 139 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 198
Query: 254 AGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 199 ANEK--ATDDYHYEKFKKMNR 217
>gi|281349488|gb|EFB25072.1| hypothetical protein PANDA_003241 [Ailuropoda melanoleuca]
Length = 201
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 107/201 (53%), Gaps = 16/201 (7%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 3 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 60
Query: 146 IPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 200
+PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 61 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 120
Query: 201 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 253
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 121 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 180
Query: 254 AGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 181 ANEK--ATDDYHYEKFKKMNR 199
>gi|349603477|gb|AEP99304.1| Uncharacterized protein C9orf78-like protein, partial [Equus
caballus]
Length = 253
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 107/201 (53%), Gaps = 16/201 (7%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 55 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 112
Query: 146 IPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 200
+PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 113 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 172
Query: 201 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 253
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 173 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 232
Query: 254 AGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 233 ANEK--ATDDYHYEKFKKMNR 251
>gi|6808233|emb|CAB70805.1| hypothetical protein [Homo sapiens]
Length = 241
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 107/201 (53%), Gaps = 16/201 (7%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 43 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 100
Query: 146 IPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 200
+PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 101 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 160
Query: 201 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 253
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 161 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 220
Query: 254 AGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 221 ANEK--ATDDYHYEKFKKMNR 239
>gi|345495075|ref|XP_001606209.2| PREDICTED: uncharacterized protein C9orf78-like [Nasonia
vitripennis]
Length = 297
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 142/297 (47%), Gaps = 40/297 (13%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDDEEER--RLALEEIKFLQKQRERKSGIP---- 58
I KK+ +K RKR +E + DEEE R LEE+K LQ+ RER G+
Sbjct: 12 IEFKKKSRKPLRKRRASSDESNS-----DEEETGVRSKLEELKTLQRLRERPKGVNIAGL 66
Query: 59 AIPSALQSAAAAG------GGGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNM 110
A+ + + AA GG+ ++ + D + + F ET ED M
Sbjct: 67 ALGEVVNDSIAASDPFNVKTGGMVNMAALKNVSKQDDAYDTGIGTQFNAETNKRDEDEEM 126
Query: 111 LKYVEQELAKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLKKRNSEES----STQ 161
+KY+E++L+K++ KN + N K E L +PEHL++ ++ +S S Q
Sbjct: 127 VKYIEEQLSKRKNKNNGEKEDESNKNKPTYCSPEEAALQAVPEHLRQSSTHKSEEMLSNQ 186
Query: 162 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR-- 218
+GI EV L IE K++NIE TE AK KLL ++ S F +P++ + ++ Q R
Sbjct: 187 MLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNRFN 245
Query: 219 -DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
+ A+ +R+ K +Q + + DN + ATD + ERF+K+ R
Sbjct: 246 IEDADAQKRKQDAAAKRHAAQASHSKEKRKDND-------EKATDDYHYERFKKQFR 295
>gi|444517774|gb|ELV11788.1| hypothetical protein TREES_T100018213 [Tupaia chinensis]
Length = 269
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 107/201 (53%), Gaps = 16/201 (7%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 71 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 128
Query: 146 IPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 200
+PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 129 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 188
Query: 201 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 253
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 189 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 248
Query: 254 AGSRQAATDQFMLERFRKRER 274
A + ATD + E+F+K R
Sbjct: 249 ANEK--ATDDYHYEKFKKMNR 267
>gi|403298580|ref|XP_003940093.1| PREDICTED: uncharacterized protein C9orf78-like [Saimiri
boliviensis boliviensis]
Length = 358
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 107/200 (53%), Gaps = 16/200 (8%)
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKI 146
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY++
Sbjct: 78 EEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYEL 135
Query: 147 PEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKS 201
PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 136 PENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSE 195
Query: 202 DFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDAA 254
+P++ + +Y Q R Y E+L +E P+ R G + R N A
Sbjct: 196 TSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRPA 255
Query: 255 GSRQAATDQFMLERFRKRER 274
+ ATD + E+F+K R
Sbjct: 256 NEK--ATDDYHYEKFKKMNR 273
>gi|332375184|gb|AEE62733.1| unknown [Dendroctonus ponderosae]
Length = 290
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 136/297 (45%), Gaps = 56/297 (18%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLA--LEEIKFLQKQRERKSGIPAIPSALQ 65
K KK+N R++ + SDDEE +++ L E+K Q R+R G+ I AL
Sbjct: 19 KSVKKRNLRQKVKSD--------SDDEETAQISNKLGEMKERQNLRKRPHGVSVIGLALG 70
Query: 66 SAAAAGGGGLTKVSEKNEGDGEKDELVLQ-----------DT-----FAQETAVMVEDPN 109
+ +AG +K K E G + L+ DT F+ ET ED
Sbjct: 71 TKFSAGDEASSKDPFKVEAGGMVNMQALKSGKVKQVDDAYDTGIGTQFSVETNKRDEDEE 130
Query: 110 MLKYVEQELAKKRGK----NIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQ 161
M+K++E EL+KK+GK + + + L E L +P+HL+ KR+ E S Q
Sbjct: 131 MMKFIENELSKKKGKVGQEEPILPTKKSSYLSPEEAALQAVPDHLRESSTKRSEEMLSNQ 190
Query: 162 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR-- 218
+GI EV L IE K+KNIE TE AK +LL E + S F +P++ + ++ Q R
Sbjct: 191 MLSGIPEVDLGIEAKIKNIEATEEAKLRLLWESQNKKNGPSQF-VPTNMAVNFVQHKRYN 249
Query: 219 -DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
D AE R++ T+ + ATD + E+F+K+ R
Sbjct: 250 NDRAEMARKKA-----------------KTEVEDKQKKKDEKATDDYHFEKFKKQFR 289
>gi|48772899|gb|AAT46619.1| hepatocellular carcinoma-associated antigen 59 [Homo sapiens]
Length = 195
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 86/144 (59%), Gaps = 7/144 (4%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 12 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 69
Query: 146 IPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 200
+PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 70 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 129
Query: 201 SDFSIPSSYSADYFQRGRDYAEKL 224
+P++ + +Y Q R Y E+L
Sbjct: 130 ETSFVPTNMAVNYVQHNRFYHEEL 153
>gi|157127876|ref|XP_001655062.1| hypothetical protein AaeL_AAEL010964 [Aedes aegypti]
gi|108872762|gb|EAT36987.1| AAEL010964-PA [Aedes aegypti]
Length = 289
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 87/297 (29%), Positives = 137/297 (46%), Gaps = 53/297 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLA-LEEIKFLQKQRERKSGIPAIPSAL-- 64
K + +K FRKR E+E D +E L+ LEE K QK R + +G+ + A+
Sbjct: 14 KSKARKQFRKRIKSEDE-------DKPDEDILSKLEETKEKQKLRNKPNGVNILTLAVGK 66
Query: 65 ----------QSAAAAGGGGLTKV----SEKNEGDGEKDELVLQDTFAQETAVMVEDPNM 110
+ A GG+ + S K + + + + F+ ET ED M
Sbjct: 67 KITVEEEVTNKDLFNAKAGGMVNMQALKSGKIKAVDDAYDTGIGTQFSAETNKRDEDEEM 126
Query: 111 LKYVEQELAKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLKKRNSEES----STQ 161
+KY+E++L+KK+G D E + H E L +P HL + +++ S S Q
Sbjct: 127 MKYIEEQLSKKKGVAKDTTKEPEAESSHKYLSPEEAALLSLPAHLSQTSTQRSEEMLSNQ 186
Query: 162 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR-- 218
+GI EV L IE K+KNIE TE AK K LQE++ S F +PS+ + ++ Q R
Sbjct: 187 MLSGIPEVDLGIEAKIKNIEATEDAKIKFLQEQQRKKDLPSHF-VPSNMAVNFMQHNRFK 245
Query: 219 -DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
D + +R + E+ + R G P + ATD + ++F+K+ R
Sbjct: 246 IDQPVQQKRRYTEVQQHRS----GDEKIP-----------KKATDDYHFDKFKKQYR 287
>gi|56757920|gb|AAW27100.1| SJCHGC04993 protein [Schistosoma japonicum]
Length = 312
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 31/213 (14%)
Query: 39 LALEEIKFLQKQRERKSGI-----------PAIPSALQS----AAAAGGGGLTKVSEKNE 83
+ + I+ LQK R+R +GI P + A+ + G L +S +
Sbjct: 39 VVVGAIRELQKLRKRPAGISLSALATGKEVPDVNLAIANDPFKLKTGGLVDLNSISSTKQ 98
Query: 84 GDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
+ + D E L TFA ET ED M++Y+E+ELAK++G DR E+ D
Sbjct: 99 AEEDDDVEAHLAKTFATETNKRDEDAEMIRYIEEELAKRKGLTKPSLDRAED-----SDL 153
Query: 143 LYKIPEHLKKRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL--- 195
L +PE+LK ++ S Q GI E+ L +E K+KNIE TE AK++L +KR
Sbjct: 154 LQDVPEYLKPSIGQQKEDMLSNQMLCGIPEIDLGVEAKMKNIEATEEAKQILLKKRFNRK 213
Query: 196 MGRAKSDFSIPSSYSADYFQRGR--DYAEKLRR 226
G + + + P + + ++ Q R Y +++ R
Sbjct: 214 HGHSVDEIA-PINMALNFVQHSRWNSYTDRVSR 245
>gi|226469096|emb|CAX70027.1| hypothetical protein [Schistosoma japonicum]
Length = 312
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 31/213 (14%)
Query: 39 LALEEIKFLQKQRERKSGI-----------PAIPSALQS----AAAAGGGGLTKVSEKNE 83
+ + I+ LQK R+R +GI P + A+ + G L +S +
Sbjct: 39 VVVGAIRELQKLRKRPAGISLSALATGKEVPDVNLAIANDPFKLKTGGLVDLNSISSTKQ 98
Query: 84 GDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
+ + D E L TFA ET ED M++Y+E+ELAK++G DR E+ D
Sbjct: 99 AEEDDDVEAHLAKTFATETNKRDEDAEMIRYIEEELAKRKGLTKPSLDRAED-----SDL 153
Query: 143 LYKIPEHLKKRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL--- 195
L +PE+LK ++ S Q GI E+ L +E K+KNIE TE AK++L +KR
Sbjct: 154 LQDVPEYLKPSIGQQKEDMLSNQMLCGIPEIDLGVEAKMKNIEATEEAKQILLKKRFNRK 213
Query: 196 MGRAKSDFSIPSSYSADYFQRGR--DYAEKLRR 226
G + + + P + + ++ Q R Y +++ R
Sbjct: 214 HGHSVDEIA-PINMALNFVQHSRWNSYTDRVSR 245
>gi|334311928|ref|XP_001369366.2| PREDICTED: uncharacterized protein C9orf78-like [Monodelphis
domestica]
Length = 240
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 94/178 (52%), Gaps = 24/178 (13%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA---- 68
K FRKR + E E+ + D EE RL LEE K +Q R R +G+ A+ +
Sbjct: 5 KTFRKRRDDSESESDEQ---DSEEVRLKLEETKEVQSLRRRPNGVSAVALLVGEKVQEET 61
Query: 69 ----------AAGGGGLTKVSEKNEGD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQE 117
A G + K+ E+N+ E+++L L +F+ ET ED +M+KY+E E
Sbjct: 62 TLVDDPFKIKAGGMVDMKKLKERNKDRINEEEDLNLGTSFSAETNRRDEDADMMKYIETE 121
Query: 118 LAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQL 171
L K++G I N+ + LK+AED LY++PE+++ K+ E S Q +GI EV L
Sbjct: 122 LKKRKG--IVENEEQKVKLKNAEDCLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDL 177
>gi|428169346|gb|EKX38281.1| hypothetical protein GUITHDRAFT_115622 [Guillardia theta CCMP2712]
Length = 299
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 80/158 (50%), Gaps = 20/158 (12%)
Query: 139 AEDELYKIPEHL----KKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 194
AEDELY IPE K R +++ W TGI EV+LP+E KLKNIEETE AKK + E+R
Sbjct: 142 AEDELYVIPEEYRVKSKTRQLGDAAETWLTGIVEVELPLEEKLKNIEETEKAKKKILEER 201
Query: 195 LMGRAKSDFSIPSSYSADYFQ---RGRDYAEKLRREHPELYKDRGSQDDGAGSRPTD--- 248
+ G +S F + ++ Y + R + + +++ L G G+ +
Sbjct: 202 INGVRQSTFVVDTASEKGYMRMEGEMRKGKKVMAKQNIPLISKEAEAVGGVGNFNANYIK 261
Query: 249 -NSTDAAGSRQAA---------TDQFMLERFRKRERHR 276
N + G AA +D ++ERF+KR +HR
Sbjct: 262 RNGKEREGRPAAASEVKRPDLSSDDLVMERFKKRLKHR 299
>gi|91078372|ref|XP_974116.1| PREDICTED: similar to CG7974 CG7974-PA [Tribolium castaneum]
gi|270003886|gb|EFA00334.1| hypothetical protein TcasGA2_TC003173 [Tribolium castaneum]
Length = 299
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 108/213 (50%), Gaps = 28/213 (13%)
Query: 32 DDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDEL 91
+D EE LEE+K LQ R+R G+ A+ AL + ++K K + G +
Sbjct: 41 EDLEEVSTKLEEMKELQNLRKRPHGVNALGLALGTKITIEDECISKDPFKVKSGGMVNMQ 100
Query: 92 VLQ-----------DT-----FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVEND 135
L+ DT F+ ET ED M+K++E+EL+KK+ K ++ ++ E +
Sbjct: 101 ALKSGKVKQVDDAYDTGIGTQFSVETNKRDEDEEMMKFIEEELSKKKRK-VEPQEQAEAE 159
Query: 136 LKHA-----EDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAA 186
K A E L +P+HL+ KR+ E S Q GI EV L IE K+KNIE TE A
Sbjct: 160 NKSAYTSPEEAALRAVPDHLRESSTKRSEEMLSNQMLNGIPEVDLGIEAKIKNIEATEEA 219
Query: 187 K-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 218
K +LL EK+ S F +P++ + ++ Q R
Sbjct: 220 KLRLLWEKQNKKDGPSPF-VPTNMAVNFVQHNR 251
>gi|195587004|ref|XP_002083257.1| GD13638 [Drosophila simulans]
gi|194195266|gb|EDX08842.1| GD13638 [Drosophila simulans]
Length = 286
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 136/293 (46%), Gaps = 52/293 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +EEE +E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDEEE---------KEEQLTLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMKQLKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRGKNIDVNDRVEND------LKHAEDELYKIPEHLKKRNSEES----STQ 161
KY+EQEL K++G + D E+D L + LY +P+HL++ +S S S Q
Sbjct: 129 KYIEQELQKRKGGGTE--DAAEDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQ 186
Query: 162 WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYA 221
GI EV L I + EA +KLLQ+ + S F +P++ + ++ Q R
Sbjct: 187 MLNGIPEVDLGIRPR-------EAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRFNI 238
Query: 222 EKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
E + +++ G++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 239 EDNSDQRRR------KREEREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 284
>gi|312375465|gb|EFR22835.1| hypothetical protein AND_14137 [Anopheles darlingi]
Length = 263
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 96/189 (50%), Gaps = 19/189 (10%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRG------KNIDVNDRVENDLKHAEDELYKIPEHL 150
F+ ET ED M+KY+E+EL+K++G K ID + L E L +P HL
Sbjct: 81 FSAETNKRDEDEEMMKYIEEELSKRKGIAQQQDKPIDGESSTKY-LSPEEAALLSLPAHL 139
Query: 151 KK----RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSI 205
+ R+ E S Q +GI E+ L IE K+KNIE TE AK K +QE++ S F +
Sbjct: 140 SQTSSLRSEEMLSNQMLSGIPEIDLGIEAKIKNIEATEEAKLKYMQEQQRKKNLPSHF-V 198
Query: 206 PSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFM 265
PS+ + ++ Q R R ++P K R D G + D D ++ ATD +
Sbjct: 199 PSNMAVNFMQHNR-----YRIDNPAPPKRRYQDDHHRGGQRGDQRNDDRIPKK-ATDDYH 252
Query: 266 LERFRKRER 274
++F+K+ R
Sbjct: 253 FDKFKKQYR 261
>gi|170035810|ref|XP_001845760.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878197|gb|EDS41580.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 309
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/238 (30%), Positives = 112/238 (47%), Gaps = 35/238 (14%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSAL--- 64
K + K+N R+R E+ SDD++E LEE K Q+ R + +G+ + A+
Sbjct: 15 KPKSKRNLRQRIKTED-------SDDDQEVLTKLEETKEKQRLRNKTNGVNLLSLAMGKK 67
Query: 65 ---------QSAAAAGGGGLTKVSEKNEGDGEKDE----LVLQDTFAQETAVMVEDPNML 111
+ GG+ + G + E + F+ ET ED M+
Sbjct: 68 ITIEEEVTNKDPFNTKSGGMVNMQALKSGKIKTVEDPYDTGIGTQFSAETNKRDEDEEMM 127
Query: 112 KYVEQELAKKRGKNIDV---NDRVENDLKHAEDE---LYKIPEHLKKRNSEES----STQ 161
KY+EQ+L KK+G + + D E+ K+ E L +P HL +S+ S S Q
Sbjct: 128 KYIEQQLGKKKGLDKETAGDGDAGESSAKYLSPEEAALLSLPAHLSHTSSQRSEEMLSNQ 187
Query: 162 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 218
+GI EV L IE K+KNIE TE AK K +QE++ S F +P++ + ++ Q R
Sbjct: 188 MLSGIPEVDLGIEAKIKNIEATEDAKLKFMQEQQRKKDMPSHF-VPTNMAVNFMQHNR 244
>gi|322793759|gb|EFZ17143.1| hypothetical protein SINV_07529 [Solenopsis invicta]
Length = 293
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/213 (32%), Positives = 106/213 (49%), Gaps = 30/213 (14%)
Query: 30 LSDDEEER------RLALEEIKFLQKQRERKSGIPAIPSALQSAAAA-----------GG 72
LSDD + R +EE+K +QK RER +G+ + AL + A+ G
Sbjct: 28 LSDDNDSEGEKMSLREKVEEMKIIQKLRERPAGVDIVGLALGESVASDVITSDPFNMKTG 87
Query: 73 GGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVND 130
G + + KN D E + F ET ED M+KY+E+EL+K++ KN +
Sbjct: 88 GMVNMTALKNTKHKPNDAYETGIGTQFNAETNKRDEDEEMVKYIEEELSKRKSKNNNDAA 147
Query: 131 RVENDLKHA-----EDELYKIPEHLKK----RNSEESSTQWTTGIAEVQLPIEYKLKNIE 181
N+ K + E L +PEHL++ R+ E S Q +GI EV L IE K++NIE
Sbjct: 148 NSANNEKGSYCSPEEAALRAVPEHLRQSSANRSEEMLSNQMLSGIPEVDLGIEAKIRNIE 207
Query: 182 ETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADY 213
TE AK KLL ++ S F +P++ + ++
Sbjct: 208 ATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNF 239
>gi|353228848|emb|CCD75019.1| hypothetical protein Smp_035150 [Schistosoma mansoni]
Length = 315
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 98/204 (48%), Gaps = 30/204 (14%)
Query: 39 LALEEIKFLQKQRERKSGIPA--------IPSALQSAA-------AAGGGGLTKVSEKNE 83
+ +E I+ LQK R+R +G+ +P + A G L +S +
Sbjct: 39 VVVEAIRELQKLRKRPAGVSLSALATGKEVPDINLTIANDPFRLKTGGLVDLNSISSAKQ 98
Query: 84 GDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK-NIDVNDRVENDLKHAED 141
+ + D E L TF ET ED M+KY+E+E+AK++G DR E+ D
Sbjct: 99 SEEDDDVEARLAKTFTTETNKRDEDAEMIKYIEEEVAKRKGLIKPSTLDRDED-----SD 153
Query: 142 ELYKIPEHLKKRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMG 197
L +PE+LK ++ S Q GI EV L +E K+KNIE TE AK+ L KRL G
Sbjct: 154 LLQDVPEYLKPSIGQQKEDMLSNQMLCGIPEVDLGVEAKMKNIEATEEAKQTLFRKRL-G 212
Query: 198 RA---KSDFSIPSSYSADYFQRGR 218
R ++ P+S + ++ Q R
Sbjct: 213 RKHGYSTNHIAPTSMAVNFVQHSR 236
>gi|340378972|ref|XP_003388001.1| PREDICTED: uncharacterized protein C9orf78 homolog, partial
[Amphimedon queenslandica]
Length = 237
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 67/238 (28%), Positives = 107/238 (44%), Gaps = 54/238 (22%)
Query: 89 DELV--LQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVND-----------RVEND 135
DE+V L F+ ET ++ +MLKY++ E+A+++GK + R E
Sbjct: 4 DEVVKRLTSQFSAETQTRDDETHMLKYIDDEIARRKGKQDEETLQLYLKLLPLFYRYEAK 63
Query: 136 LKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQ 191
+ LYKIPE + KR+ + S Q +GI EV L ++ K KNIEETE AKK +
Sbjct: 64 IA----SLYKIPEKYQVEDSKRSEDMLSNQMLSGIPEVDLGLDAKFKNIEETEIAKKKMA 119
Query: 192 EKRLMGRAKSDFSIPSSYSADY----------------------FQRGRDYAEKLRREHP 229
E +L + K IP+++++++ +R ++ E R+ P
Sbjct: 120 EDKLKMKDKQTSMIPTNFASNFTHHSLRFFKDRGRGHHRRGGGGGKRSQEEEETESRDQP 179
Query: 230 ELYKDRGS----------QDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER-HR 276
GS +D+G G T A Q TD + ++FRK+ + HR
Sbjct: 180 SFIPVVGSFDEPELKPTTRDEGGGGPNTKKRKPGADHSQLPTDDYHFDKFRKKAKSHR 237
>gi|226486444|emb|CAX74351.1| hypothetical protein [Schistosoma japonicum]
Length = 312
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 103/213 (48%), Gaps = 31/213 (14%)
Query: 39 LALEEIKFLQKQRERKSGI-----------PAIPSALQS----AAAAGGGGLTKVSEKNE 83
+ + I+ LQK R+R +GI P + A+ + G L +S +
Sbjct: 39 VVVGAIRELQKLRKRPAGISLSALATGKEVPDVNLAIANDPFKLKTGGLVDLNSISSTKQ 98
Query: 84 GDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
+ + D E L TFA ET ED M++Y+E+ELAK++G DR E+ D
Sbjct: 99 AEEDDDVEAHLAKTFATETNKRDEDAEMIRYIEEELAKRKGLTKPSLDRAED-----SDL 153
Query: 143 LYKIPEHLKKRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL--- 195
L +PE+LK ++ S Q GI E+ L +E K+KNIE E AK++L +KR
Sbjct: 154 LQDVPEYLKPSIGQQKEDMLSNQMLCGIPEIDLGVEAKMKNIEAPEEAKQILLKKRFNRK 213
Query: 196 MGRAKSDFSIPSSYSADYFQRGR--DYAEKLRR 226
G + + + P + + ++ Q R Y +++ R
Sbjct: 214 HGHSVDEIA-PINMALNFVQHSRWNSYTDRVSR 245
>gi|256092888|ref|XP_002582109.1| hypothetical protein [Schistosoma mansoni]
Length = 314
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 72/227 (31%), Positives = 106/227 (46%), Gaps = 42/227 (18%)
Query: 22 EEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG---------- 71
EE+ N + + + +E I+ LQK R+R +G+ SA A G
Sbjct: 22 EEDHLENSTAPVVADSTVVVEAIRELQKLRKRPAGVSL------SALATGKEVPDINLTI 75
Query: 72 --------GGGLTKVSEKNEG-DGEKDELV---LQDTFAQETAVMVEDPNMLKYVEQELA 119
GGL ++ + E+D+ V L TF ET ED M+KY+E+E+A
Sbjct: 76 ANDPFRLKTGGLVDLNSISSAKQSEEDDDVEARLAKTFTTETNKRDEDAEMIKYIEEEVA 135
Query: 120 KKRG-KNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEES----STQWTTGIAEVQLPIE 174
K++G DR E+ D L +PE+LK ++ S Q GI EV L +E
Sbjct: 136 KRKGLIKPSTLDRDED-----SDLLQDVPEYLKPSIGQQKEDMLSNQMLCGIPEVDLGVE 190
Query: 175 YKLKNIEETEAAKKLLQEKRLMGRA---KSDFSIPSSYSADYFQRGR 218
K+KNIE TE AK+ L KRL GR ++ P+S + ++ Q R
Sbjct: 191 AKMKNIEATEEAKQTLFRKRL-GRKHGYSTNHIAPTSMAVNFVQHSR 236
>gi|58388944|ref|XP_316650.2| AGAP006620-PA [Anopheles gambiae str. PEST]
gi|55239374|gb|EAA11347.2| AGAP006620-PA [Anopheles gambiae str. PEST]
Length = 295
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 97/188 (51%), Gaps = 23/188 (12%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVEND-----LKHAEDELYKIPEHLK 151
F+ ET ED M+KY+E+EL K++G + +++ E + L E L +P HL
Sbjct: 119 FSAETNKRDEDEEMMKYIEEELGKRKGIAQEQDNQAEGESSGKYLSPEEAALLSLPAHLS 178
Query: 152 KRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIP 206
+ +S+ S S Q +GI E+ L IE K+KNIE TE AK K +QE++ S F +P
Sbjct: 179 QTSSQRSEEMLSNQMLSGIPEIDLGIEAKIKNIEATEDAKLKYMQEQQRKKDLPSHF-VP 237
Query: 207 SSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFML 266
S+ + ++ Q R R ++P K R Q+D R D + ATD +
Sbjct: 238 SNMAVNFMQHNR-----YRIDNPAPAKRR-YQEDHRDQRHDDRVP------KKATDDYHF 285
Query: 267 ERFRKRER 274
++F+K+ R
Sbjct: 286 DKFKKQYR 293
>gi|342318949|gb|EGU10904.1| Hypothetical Protein RTG_03298 [Rhodotorula glutinis ATCC 204091]
Length = 359
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/231 (29%), Positives = 109/231 (47%), Gaps = 36/231 (15%)
Query: 71 GGGGLTKVSEKNEG-DGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
G GG ++ + +EG + + +++ D F +T + D +ML Y+E ELAKKRG+ +
Sbjct: 129 GTGGADRIRDDSEGPEAKARKIIKTDNFTGQTNTVDVDKHMLAYIEAELAKKRGEASGSS 188
Query: 130 DRVENDLKHAE--DELYKIPEHLK------------KRNSEESSTQWTTG----IAEVQL 171
D N + + DELY++ E K +R+ EE + +TG I EV L
Sbjct: 189 DPSSNPSRPYDPRDELYRVAEKYKFADIAEQEGKKKERDEEEGNVTLSTGMLMGIPEVDL 248
Query: 172 PIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFS-IPSS---YSADYFQRGRDYAEKLRRE 227
I+ KLKNIE TE AK+ L+E G + + +P ++ D F R R
Sbjct: 249 GIDTKLKNIEATEKAKRALREGSRRGSGPEEAAGLPPDKDEFAVDRFYR--------HRR 300
Query: 228 HPELYKDRGSQDDGAGSRPTDNSTDA-----AGSRQAATDQFMLERFRKRE 273
E + ++ + P + DA R+ ATD+ + RF+KR+
Sbjct: 301 PLESDQSALARARYLAANPPETDPDAELRKRKPGRETATDEMAVARFKKRQ 351
>gi|68163441|ref|NP_001020174.1| uncharacterized protein LOC311855 [Rattus norvegicus]
gi|60552111|gb|AAH91189.1| Similar to Hypothetical protein MGC11690 [Rattus norvegicus]
Length = 211
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 81/137 (59%), Gaps = 7/137 (5%)
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKI 146
E+++L L +F+ ET ED +M+KY+E EL K++G I + + K+AED LY++
Sbjct: 46 EEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEQEEQKAKPKNAEDCLYEL 103
Query: 147 PEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKS 201
PE+++ K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 104 PENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSE 163
Query: 202 DFSIPSSYSADYFQRGR 218
+P++ + +Y Q R
Sbjct: 164 TSFVPTNMAVNYVQHNR 180
>gi|71024789|ref|XP_762624.1| hypothetical protein UM06477.1 [Ustilago maydis 521]
gi|46100513|gb|EAK85746.1| hypothetical protein UM06477.1 [Ustilago maydis 521]
Length = 320
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 107/232 (46%), Gaps = 40/232 (17%)
Query: 64 LQSAAAAGGGGLTKVSEKNEGDGEKD----ELVLQDTFAQETAVMVEDPNMLKYVEQELA 119
+Q+AA G T +E+ E D + V ++ F ET + D +M+ Y+EQE+
Sbjct: 105 IQAAALRGSTSHTADNEQEESDDDNPTKPRRRVRKNHFQSETGTVDVDKHMMAYIEQEIK 164
Query: 120 KKRGKNI--DVN-DRVENDLKHAEDELYKIPEHLK--KRNSEESSTQ------------W 162
K+ G N+ D N D V +++ + +LY + E + +R+ + TQ
Sbjct: 165 KRTGTNMQSDSNSDSVSKPIQNPDHQLYAVAEKYRELQRSIQPEQTQEEREGNVALSSAM 224
Query: 163 TTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAE 222
+ I EV L I+ ++ NI++TE A++ L + R D P + D + A
Sbjct: 225 LSSIPEVDLGIDNRMHNIQQTELARRKLHQHRTSNAHHQDAHAPQAARGDAADQALANA- 283
Query: 223 KLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
R +H + RP +D + +Q ATDQ +L+RFRKR+R
Sbjct: 284 --RFQH-------------SKQRPL---SDPSARQQMATDQLVLDRFRKRQR 317
>gi|291414331|ref|XP_002723414.1| PREDICTED: chromosome 9 open reading frame 78-like [Oryctolagus
cuniculus]
Length = 291
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/218 (32%), Positives = 108/218 (49%), Gaps = 20/218 (9%)
Query: 73 GGLT---KVSEKN-EGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDV 128
GG+ K+ E+ E E+++L L +F+ ET ED +M+K EL KR K+I
Sbjct: 76 GGMVDMKKLKERGKEKISEEEDLHLGTSFSAETNRRDEDADMMKVHRTEL--KRRKSIVE 133
Query: 129 NDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETE 184
+ + AED LY++PE ++ KR E S Q +GI EV L I+ K+KNI TE
Sbjct: 134 CEEQRVKPRSAEDCLYELPESIRVRSAKRTEEMLSNQMLSGIPEVDLGIDAKIKNIISTE 193
Query: 185 AAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-G 236
AK +LL E++ + +P++ + +Y Q R Y E+L +E P+ R G
Sbjct: 194 DAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVG 253
Query: 237 SQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
+ R N A + ATD + E+F+K R
Sbjct: 254 DTEKPEPERSPPNRKRPANEK--ATDDYHYEKFKKMNR 289
>gi|345306099|ref|XP_001508082.2| PREDICTED: uncharacterized protein C9orf78-like [Ornithorhynchus
anatinus]
Length = 174
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 91/176 (51%), Gaps = 16/176 (9%)
Query: 111 LKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK----KRNSEESSTQWTTGI 166
+KY+E EL K++G I N+ + K+AED LY++PE+++ K+ E S Q +GI
Sbjct: 1 MKYIETELKKRKG--IVENEEQKVKPKNAEDCLYELPENIRVSSAKKTEEMLSNQMLSGI 58
Query: 167 AEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL- 224
EV L I+ K+KNI TE AK +LL E++ + +P++ + +Y Q R Y E+L
Sbjct: 59 PEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHNRFYHEELN 118
Query: 225 -----RREHPELYKDR-GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 274
+E P+ R G + R N + ATD + E+F+K R
Sbjct: 119 APVRRNKEEPKARPLRVGDTEKPEPERSPPNRKRPHNEK--ATDDYHYEKFKKMNR 172
>gi|170573241|ref|XP_001892395.1| Hepatocellular carcinoma-associated antigen 59 family protein
[Brugia malayi]
gi|158602086|gb|EDP38774.1| Hepatocellular carcinoma-associated antigen 59 family protein
[Brugia malayi]
Length = 426
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 119/244 (48%), Gaps = 30/244 (12%)
Query: 9 KEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA 68
K+++++ R+R + ++ TT ++E E LE +K LQ+ R RK+G+ A+ AL
Sbjct: 9 KKRQRHLRERIIDNDDSTT----EEEAEIACKLEGVKELQESRIRKNGLNAVECALGKEL 64
Query: 69 AA------------GGGGLTKVSEKNEGDGEKDEL--VLQDTFAQETAVMVEDPNMLKYV 114
AA GGG+ ++SE + ++ ++D F +E+ + E M KYV
Sbjct: 65 AAEFIAMDDDPFRQRGGGMLRLSEGRQAQMHAADIEAGIRDQFKKESFLRDEHEEMKKYV 124
Query: 115 EQELAKKRG-KNIDVNDRVENDLKHAEDEL-YKIPEHLK----KRNSEESSTQWTTGIAE 168
+ EL K++ ++++ ND + + ED L +K E ++ +RN E S Q GI E
Sbjct: 125 QAELRKRKAVQDLEDNDATTSKVSSMEDTLMWKAAEKVRLFRSERNDELLSNQMLAGIPE 184
Query: 169 VQLPIEYKLKNIEETEAAK----KLLQEKRLMGRAKSDFSIPSS--YSADYFQRGRDYAE 222
V L I ++ NI ETE K K + EKR S FS + + DY Q Y E
Sbjct: 185 VDLGINARMSNIIETEKKKSDMLKEVVEKRRNLAQDSLFSQDRAKDLAKDYVQHSIFYME 244
Query: 223 KLRR 226
R
Sbjct: 245 STTR 248
>gi|357610714|gb|EHJ67110.1| hypothetical protein KGM_02139 [Danaus plexippus]
Length = 214
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/193 (35%), Positives = 97/193 (50%), Gaps = 22/193 (11%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKR-----GKNIDVNDRVENDLKHAEDELYKIPEHLK 151
F+ ET ED M+KY+E++LAK++ K + V L E L +PEHL+
Sbjct: 27 FSAETNKRDEDEEMMKYIEEQLAKRKEGSDSSKKESDDSEVLKYLAPEEAALLSLPEHLR 86
Query: 152 K----RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIP 206
R+ E S Q +GI EV L I+ K+KNIE TE AK KLL EK S F +P
Sbjct: 87 SSSMHRSEEMLSNQMLSGIPEVDLGIDAKIKNIEATEEAKMKLLWEKHNKKDGPSHF-VP 145
Query: 207 SSYSADYFQRGR-----DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAAT 261
++ + ++ Q R +++K E P + K S D + ++ A G R AT
Sbjct: 146 TNMAVNFVQHNRFNLDSIHSKKRPAERP-IQKVEVSVIDESVNKIVKK---AKGER--AT 199
Query: 262 DQFMLERFRKRER 274
D + ERFRK+ R
Sbjct: 200 DDYHYERFRKQFR 212
>gi|307109350|gb|EFN57588.1| hypothetical protein CHLNCDRAFT_143273 [Chlorella variabilis]
Length = 271
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 98/198 (49%), Gaps = 26/198 (13%)
Query: 11 KKKNFRK-RSYEEEEETTNKLSDDEE-ERRLALEEIKFLQKQRERKS---------GI-P 58
K++N RK R+ +E+EE + ++ E + +L ++IK LQKQR+R++ G+ P
Sbjct: 2 KQRNIRKKRALDEQEELSEGEAEGAEGQPKLTADDIKLLQKQRQRRTVRLATAETRGVQP 61
Query: 59 AIPSALQSAAAAGGGGL----TKVSEKNEGDGEKDELVLQDTFAQETAVMVED--PNMLK 112
A LQ+ + G L +V K G VLQ F +E + ED NM K
Sbjct: 62 AW--RLQAGSGVDVGSLMVADVRVERKEAGAAAVVGDVLQAAFKRERRLHSEDEDVNMKK 119
Query: 113 YVEQELAKKRGK----NIDVNDRVENDLKHAEDELYK-IPEHLKKRNSE-ESSTQWTTGI 166
YVE++LAK+ G+ + +D L +PE L+KR + E W GI
Sbjct: 120 YVEEQLAKRMGRPGQEEEEAAAAEAERRARMQDPLLAAMPEGLQKRQQDTELGPSWVAGI 179
Query: 167 AEVQLPIEYKLKNIEETE 184
EV L +E KL NIE TE
Sbjct: 180 TEVPLSMEQKLANIEATE 197
>gi|388582301|gb|EIM22606.1| hypothetical protein WALSEDRAFT_56792 [Wallemia sebi CBS 633.66]
Length = 285
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 114/240 (47%), Gaps = 44/240 (18%)
Query: 11 KKKNFRKRSYEEE--EETTNKLSDDEEERRLALEEIKFLQKQRERKSGI----------- 57
KK+N K + E+ EE+ N +DD+ +EE ++K + + +GI
Sbjct: 3 KKRNINKSTRREKSIEESNNVDNDDDSTISDTIEEKHLIRKLKRQAAGIDSEKLVQTTSN 62
Query: 58 ---PAIPSALQSAA---AAGGGGLTKVSEK-NEGDGEKDELVLQDTFAQETAVMVEDPNM 110
P + ++ A +A GG+ +EK E +D+LV F ++A + D +M
Sbjct: 63 SKKPKLDNSQSKEAHGWSASSGGIVDNTEKLKEASNPQDKLVKTSNFTGQSATIDVDKHM 122
Query: 111 LKYVEQELAKKRGK------NIDV---NDRVENDLKHAEDELYKIPEH--LKKRNSEESS 159
+ Y+E+++ KK + N+++ N+R+ N +DELY I + K RN ++ S
Sbjct: 123 MSYIEEQMLKKHQQQGLPTDNLNLGITNERINN----PQDELYDIAQKYTYKSRNLDDGS 178
Query: 160 T----QWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQ 215
T I EV L +E KL+NI+ETE AK +R+ K P++ DY Q
Sbjct: 179 ITNSESMLTKIPEVDLGVEAKLRNIQETEKAK-----QRMRDLEKHTSRKPANTDPDYTQ 233
>gi|242007288|ref|XP_002424473.1| protein C9orf78, putative [Pediculus humanus corporis]
gi|212507891|gb|EEB11735.1| protein C9orf78, putative [Pediculus humanus corporis]
Length = 314
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 129/278 (46%), Gaps = 54/278 (19%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSAL--------------QSAAAAGGGGLTKVSEKNE 83
R LEE+K LQK R R +G+ + AL + GG+ +
Sbjct: 48 RTKLEEMKMLQKLRARPNGVNIVGLALGRKIGEEEEEDIDVKDPFKTKSGGMINMKTLKS 107
Query: 84 GDGEKDE----LVLQDTFAQETAVMVEDPNMLKYVEQELAKKR--------GKNIDVNDR 131
G +K + + F+ ET ED M+K++E +L+KK+ GK+ D ++
Sbjct: 108 GKIKKMDDAYDTGIGTQFSAETNKRDEDEEMMKFIEDQLSKKKGLMKEKKSGKSDDQDES 167
Query: 132 VENDLKHAEDE-LYKIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAA 186
++ E+ L IP+HL+ +R+ E S Q +GI EV L I+ K++NIE TE A
Sbjct: 168 SKSKYCSPEEAALQAIPDHLRSSSMQRSEEMLSHQMLSGIPEVDLGIDAKIRNIEATEEA 227
Query: 187 K-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR---DYAEKLRREHPELYKDRGSQDDGA 242
K KLL + S F +PS+ + ++ Q+ + D E LR+ YK
Sbjct: 228 KLKLLWSEHNKKEGPSQF-VPSNITVNFMQQNKMNQDDLEPLRKRQKN-YK--------- 276
Query: 243 GSRPT----DNSTDAAGSRQA--ATDQFMLERFRKRER 274
RPT D++ A ++ ATD + ERF+K+ R
Sbjct: 277 --RPTVQILDDAKIAIKRKEGEKATDDYHYERFKKQFR 312
>gi|343426688|emb|CBQ70217.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 275
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 88/199 (44%), Gaps = 50/199 (25%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN-DRVENDLKHAEDELYKIPEH 149
LV ++ F ET + D +M+ Y+E E+AK+ G + + D V + L + D+LY + E
Sbjct: 109 LVRKNNFQGETGTVDVDKHMMAYIEAEMAKRTGTSTASSADTVRSALANPHDQLYALAEE 168
Query: 150 LK--KRNSEESSTQ------------WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL 195
+ +R + TQ + I EV L I+ ++KNI+ TE AK+ L ++
Sbjct: 169 YRQLQRQIKPDQTQDEREGNVALSAAMLSSIPEVDLGIDERMKNIQHTEDAKRALAQRAK 228
Query: 196 MGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAG 255
A + ++++ FQ+ +
Sbjct: 229 AANADG-LGVDTAFAGARFQQ----------------------------------VSGSN 253
Query: 256 SRQAATDQFMLERFRKRER 274
SRQ ATDQ +L+RFRKR+R
Sbjct: 254 SRQMATDQLVLDRFRKRQR 272
>gi|170090594|ref|XP_001876519.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164648012|gb|EDR12255.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 252
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 97/195 (49%), Gaps = 22/195 (11%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
+V + F Q+T + D +M+ Y+E+ L + D +D E+ ++ LYKI EH
Sbjct: 68 VVRANNFTQQTNTLDVDKHMMAYIEENLKIRSKPREDSDD--EDKPHDPQEALYKIAEHW 125
Query: 151 K--------KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 202
K S +S T I EV L ++ +LKNIE+TE AK+++ E+R D
Sbjct: 126 KVGKPQPKTDEGSVTNSMTMLTAIPEVDLGMDTRLKNIEDTEKAKRVVAEER------HD 179
Query: 203 FSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGS--RQA 259
P++ ++ R Y +R + ++ +D ++ + G P D+S + Q
Sbjct: 180 RKKPNN-DEEHLVASRFYRPNMRAKSDADILRD--AKLEAMGMPPQDDSPQRSNQERTQM 236
Query: 260 ATDQFMLERFRKRER 274
ATD+ ++ERF+KR R
Sbjct: 237 ATDEIVMERFKKRMR 251
>gi|159489200|ref|XP_001702585.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280607|gb|EDP06364.1| predicted protein [Chlamydomonas reinhardtii]
Length = 261
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 96/193 (49%), Gaps = 19/193 (9%)
Query: 11 KKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA-- 68
K++N RKR E+E+ + +S + E +R L E + +Q+ R+R +G A+
Sbjct: 4 KQRNIRKRVASEDEDAPD-VSAEPECQRDKLAETRLMQQLRKRSAGTGVGALAMGGGGPG 62
Query: 69 ---AAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMV---EDPNMLKYVEQELAKKR 122
A+ G + SE EG G V+ D + + ++ V ED +M KYVE++LA +
Sbjct: 63 IGPASREGSVEPGSEGGEGAG----TVVMDAYVKAKSIAVQMDEDAHMQKYVEEQLAARL 118
Query: 123 GKNIDVNDRVEND----LKHAEDELYKIPEHLKKRNSEESSTQ-WTTGIAEVQLPIEYKL 177
GK + END + E ELY +P + +E + +AEV L + KL
Sbjct: 119 GKTAEAEAE-ENDPEVKRRKLEQELYALPSDFTTQLEQELVLPGMVSTLAEVPLAAKDKL 177
Query: 178 KNIEETEAAKKLL 190
K+IE TEA K+ L
Sbjct: 178 KSIEATEALKRSL 190
>gi|443927092|gb|ELU45623.1| hepatocellular carcinoma-associated antigen 59 domain-containing
protein [Rhizoctonia solani AG-1 IA]
Length = 349
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 94/193 (48%), Gaps = 31/193 (16%)
Query: 31 SDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA--------------AAAGGGGLT 76
S+ +E ++ +EE+ L+K R ++ GI + S G GL
Sbjct: 40 SEGVDEEKMTIEELLELRKLRRQRQGIDSTKLNAGSTKKKKRRDEDEEAEDENEGKYGLR 99
Query: 77 KVSEKNEGD-------GEKD---ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI 126
K ++ +GD G +D +++ + F Q+T + D +M+KY+E+EL K+RGK
Sbjct: 100 KGGQRQDGDDDEASADGAEDVAKKIIKSNNFTQQTNKLDVDKHMMKYIEEELEKRRGKPN 159
Query: 127 DVNDRVENDLKHAEDELYKIPEHLKKRNSEE-------SSTQWTTGIAEVQLPIEYKLKN 179
D ++ EL++I E K + +E +S+ T I EV L ++ +LKN
Sbjct: 160 ASGDTGNSNSSDPYAELFRISEKYKLQKKQELEEGSVTNSSAMLTAIPEVDLGMDTRLKN 219
Query: 180 IEETEAAKKLLQE 192
IEETE AK+ + E
Sbjct: 220 IEETEKAKRTVSE 232
>gi|148298871|ref|NP_001091802.1| uncharacterized protein LOC778507 [Bombyx mori]
gi|116272507|gb|ABJ97189.1| hypothetical protein [Bombyx mori]
Length = 226
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 95/188 (50%), Gaps = 13/188 (6%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAK-KRGKNIDVNDRVEND-LKHAEDE---LYKIPEHLK 151
F+ ET ED M+KY+E++LAK K G + D D + LK+ E L +P+HL+
Sbjct: 40 FSAETNKRDEDEEMMKYIEEQLAKRKEGCDKDNKDHNHTETLKYLSPEEAALLSLPDHLR 99
Query: 152 ----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIP 206
+R+ E S Q +GI EV L I+ K+KNIE TE AK KL+ E++ S F +P
Sbjct: 100 VSSNQRSEEMLSNQMLSGIPEVDLGIDAKIKNIEATEEAKMKLIWERQNKKDGPSQF-VP 158
Query: 207 SSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFML 266
++ + ++ Q R E ++ E ++ A G R ATD +
Sbjct: 159 TNMAVNFVQHNRFNMENDKKRKIEKVVVPKTEISVIDENVDKIVKKAKGER--ATDDYHY 216
Query: 267 ERFRKRER 274
E+F+K+ R
Sbjct: 217 EKFKKQFR 224
>gi|134108060|ref|XP_777412.1| hypothetical protein CNBB2130 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260102|gb|EAL22765.1| hypothetical protein CNBB2130 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 303
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 63/113 (55%), Gaps = 9/113 (7%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
LV + F Q+T + D +M+ Y+E ELAK+RG+ D D+ + + ELY+I E
Sbjct: 123 LVRVNNFTQQTNALDVDKHMMAYIEAELAKRRGQAADTTDKSAIEDNDPQAELYRIAEKY 182
Query: 151 -----KKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 194
KK+ +E +S T I EV L ++ +LKNIE TE AK+ + E+R
Sbjct: 183 QFETRKKKADDEGNVTNSLGMLTSIPEVDLGMDNRLKNIEMTEKAKRDMLEQR 235
>gi|58264334|ref|XP_569323.1| hypothetical protein CNB03570 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57223973|gb|AAW42016.1| hypothetical protein CNB03570 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 303
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 63/113 (55%), Gaps = 9/113 (7%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
LV + F Q+T + D +M+ Y+E ELAK+RG+ D D+ + + ELY+I E
Sbjct: 123 LVRVNNFTQQTNALDVDKHMMAYIEAELAKRRGQAADTTDKSAIEDNDPQAELYRIAEKY 182
Query: 151 -----KKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 194
KK+ +E +S T I EV L ++ +LKNIE TE AK+ + E+R
Sbjct: 183 QFETRKKKADDEGNVTNSLGMLTSIPEVDLGMDNRLKNIEMTEKAKRDMLEQR 235
>gi|328871809|gb|EGG20179.1| hypothetical protein DFA_07299 [Dictyostelium fasciculatum]
Length = 310
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 97/206 (47%), Gaps = 23/206 (11%)
Query: 12 KKNFRKRSYEEEEETTNKLS-------DDEEERRLALEEI-KFLQKQRERKSGI------ 57
K + RK+ ++E TN S DD+++ + EI K QK RE+ GI
Sbjct: 54 KPSLRKKDGDDESSLTNDTSSIASENGDDQQQDLSTIIEITKERQKMREKGKGIIAGVLA 113
Query: 58 --PAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVE 115
P I + L+ T +EKNE + ++ + ++ ++ + + L
Sbjct: 114 EGPHIKAHLRELEHKLDDSFTIATEKNETNVHLEKFLAKEMEKKKIEIKHKLTGGLMGTT 173
Query: 116 QELAKKRGKNIDVNDRVEND----LKHAEDELYKIPEHL---KKRNSEESSTQWTTGIAE 168
+ ++ +N D N+ +K ED LY+ PEHL K R EE T W GI+E
Sbjct: 174 EHRKEEDDENKQTKDNNANNTTTKIKTDEDSLYETPEHLAVKKTRKKEEDKTNWLAGISE 233
Query: 169 VQLPIEYKLKNIEETEAAKKLLQEKR 194
V LP YK+KNI+ETE A+ +++ +
Sbjct: 234 VSLPTSYKIKNIQETEDARSKIKDSK 259
>gi|302693901|ref|XP_003036629.1| hypothetical protein SCHCODRAFT_49879 [Schizophyllum commune H4-8]
gi|300110326|gb|EFJ01727.1| hypothetical protein SCHCODRAFT_49879 [Schizophyllum commune H4-8]
Length = 292
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 96/197 (48%), Gaps = 27/197 (13%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPE-- 148
+V + F Q+T + D +M+ Y+EQ L K R + +D D + + ++ LY + +
Sbjct: 109 VVRNNNFTQQTNALDVDKHMIAYIEQNL-KVRSRPLDDEDEKKQEPLDPQEALYHLSDKW 167
Query: 149 HLKKRNSEE-----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDF 203
+L K+ E +S T I EV L ++ +LKNIEETE AK+L+ E++ GR +D
Sbjct: 168 NLNKQTHPEDGSVTNSMTMLTAIPEVDLGMDARLKNIEETEKAKRLIAEEK-QGRKLTD- 225
Query: 204 SIPSSYSADYFQRGRDYAEKLRREHPELYKD----RGSQDDGAGSRPTDNSTDAAGSR-- 257
+ A + R H + D R ++ G +P D S +
Sbjct: 226 -----------EEAHLVATRFYRPHLKTKSDADIMRDAKLAAMGMQPKDQSQRWSNHDRP 274
Query: 258 QAATDQFMLERFRKRER 274
Q ATD+ ++ERF+KR R
Sbjct: 275 QMATDEIVMERFKKRMR 291
>gi|443897955|dbj|GAC75293.1| uncharacterized conserved protein [Pseudozyma antarctica T-34]
Length = 290
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 94/204 (46%), Gaps = 41/204 (20%)
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK-NIDVNDRVENDLKHAEDELYK 145
+K LV ++ F ET + D +M+ Y+E E+ K+ G + + + ED LY
Sbjct: 111 DKPRLVRKNNFQGETGTVDVDKHMMAYIEDEMRKRTGSADTVDAAAIVAAVNDPEDALYA 170
Query: 146 IPE-------HLKKRNSEES-------STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQ 191
+ E +K ++E S+ T I EV L I+ ++ NI++TEAA++
Sbjct: 171 VAEKYKELHRSIKPEQTQEQREGNVAFSSAMLTSIPEVDLGIDARMANIQDTEAARREAS 230
Query: 192 EKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNST 251
+ + + + ++ FQR K R++H +Q GSRP
Sbjct: 231 QPK-----PAHHDVDEDFANARFQRA-----KPRQDH--------AQSSNQGSRPE---- 268
Query: 252 DAAGSRQAATDQFMLERFRKRERH 275
RQ ATDQ +L+RF+KR+R+
Sbjct: 269 ----RRQMATDQLVLDRFKKRQRN 288
>gi|336373766|gb|EGO02104.1| hypothetical protein SERLA73DRAFT_132900 [Serpula lacrymans var.
lacrymans S7.3]
gi|336386581|gb|EGO27727.1| hypothetical protein SERLADRAFT_383135 [Serpula lacrymans var.
lacrymans S7.9]
Length = 276
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 99/193 (51%), Gaps = 22/193 (11%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQEL-AKKRGKNIDVNDRVENDLKHAEDELYKIPEH 149
+V + F Q+T + D +M+ Y+E+ L + + ++ + R + +DELY + E
Sbjct: 96 VVRANNFTQQTNALDVDKHMMAYIEENLKIRSKPQSPEPTSRSSD----PQDELYNVSER 151
Query: 150 LK--KRNSEESSTQ----WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDF 203
K KR +EE S T I EV L ++ +LKNIEETE AK+++ E+R R K
Sbjct: 152 WKVEKRMAEEGSVTNSLTMLTAIPEVDLGMDTRLKNIEETEKAKRMVAEER-KDRKK--- 207
Query: 204 SIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR-QAAT 261
++ ++ R Y L+ + ++ +D ++ + G P D S R Q AT
Sbjct: 208 ---ATNDEEHLAAARFYRPNLKQKSDADIMRD--AKLEAMGLPPQDESRRHHHDRPQMAT 262
Query: 262 DQFMLERFRKRER 274
D+ ++ERF+KR R
Sbjct: 263 DEAVMERFKKRMR 275
>gi|393216067|gb|EJD01558.1| hypothetical protein FOMMEDRAFT_135745 [Fomitiporia mediterranea
MF3/22]
Length = 309
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 95/195 (48%), Gaps = 22/195 (11%)
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID-VNDRVENDLKHAEDELYKIPEHL 150
V + F Q+T + D +M+ Y+E+ + ++RG+ + + D ++ELY+I E
Sbjct: 124 VRTNNFTQQTNALDVDKHMMAYIEENMRQRRGERGEKIEDEEAPKPLDPQEELYRIAEKF 183
Query: 151 KKRN--------SEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 202
K +N S +S T I EV L ++ +LKNIEETE AK+ + E + R +D
Sbjct: 184 KTQNNARGQEEGSVTNSLSMLTAIPEVDLGMDARLKNIEETEKAKRAVAEAKKERRQNND 243
Query: 203 FSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNST--DAAGSRQA 259
++ R Y ++ + ++ +D ++ + G RP D Q
Sbjct: 244 --------EEHLAATRFYKPNIKQKSDADIIRD--AKLEAMGLRPDDYEPRRHHPEKVQT 293
Query: 260 ATDQFMLERFRKRER 274
ATD+ ++ERF+KR R
Sbjct: 294 ATDEMIMERFKKRMR 308
>gi|357017455|gb|AET50756.1| hypothetical protein [Eimeria tenella]
Length = 325
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 59/108 (54%), Gaps = 3/108 (2%)
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK 151
+L+ FA D ++ +++ + L K+ ++ + E+D+ +LY IP+HLK
Sbjct: 148 LLEKNFASGGPGSATDKHLEEFLRERLKDKQHESREEKALREHDMVDKMRDLYAIPDHLK 207
Query: 152 KRNSEE---SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLM 196
+ E W TG+ EV+LP+E KLKNIE TE AK+ L K L+
Sbjct: 208 VADKTEEYKDQMNWVTGLVEVELPMETKLKNIEATERAKRQLLRKGLL 255
>gi|358054938|dbj|GAA99005.1| hypothetical protein E5Q_05694 [Mixia osmundae IAM 14324]
Length = 307
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 63/111 (56%), Gaps = 11/111 (9%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKN---IDVNDRVENDLKHAEDELYKIP 147
LV + F Q+T + D +M+ Y+E EL K+ KN +DV + + H DELY++
Sbjct: 118 LVKSNNFTQQTNTLDVDKHMMAYIEAELRKRTQKNPGQLDVEEELGKLDPH--DELYQVA 175
Query: 148 EHLK------KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQE 192
E + + +E +S+ T + EV L I+ +L+NIEETE AK+ L+E
Sbjct: 176 ERYRVAKMPVREGNETTSSAMLTAVQEVDLGIDARLRNIEETEKAKQRLRE 226
>gi|403413736|emb|CCM00436.1| predicted protein [Fibroporia radiculosa]
Length = 294
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 97/195 (49%), Gaps = 23/195 (11%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG-KNIDVNDRVENDLKHAEDELYKIPEH 149
+V + F Q+T V+ D +M+ Y+E+ + +RG +N ++D D EL+ IP+
Sbjct: 111 VVRANNFTQQTNVLDVDKHMMAYIEENMKLRRGNQNEPMSDDGPLD---PYAELFSIPDK 167
Query: 150 LKKRNSEE-------SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 202
+ +E +S T I EV L ++ +LKNIEETE AK+++ E+R + K D
Sbjct: 168 YRLTQEQEQDEGNVTNSLAMLTAIPEVDLGMDTRLKNIEETEKAKRMITEERKERKKKVD 227
Query: 203 FSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR--QA 259
++ R Y L+ + ++ +D ++ + G P D+ Q
Sbjct: 228 -------DEEHLAAARFYRPNLKMKSDADIIRD--AKLEAMGLLPEDHEYRRPQHERMQM 278
Query: 260 ATDQFMLERFRKRER 274
ATD+ ++ERF+KR R
Sbjct: 279 ATDELVMERFKKRMR 293
>gi|321248874|ref|XP_003191271.1| hypothetical protein CGB_A2540W [Cryptococcus gattii WM276]
gi|317457738|gb|ADV19484.1| hypothetical protein CNB03570 [Cryptococcus gattii WM276]
Length = 311
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 63/113 (55%), Gaps = 9/113 (7%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
LV + F Q+T + D +M+ Y+E ELAK+RG+ D+ + + + ELY+I E
Sbjct: 125 LVRVNNFTQQTNALDVDKHMMAYIETELAKRRGQAAAPTDKSKVEDNDPQAELYRIAEKY 184
Query: 151 -----KKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 194
KK+ +E +S T I EV L ++ +LKNIE TE AK+ + E+R
Sbjct: 185 QFETKKKKADDEGNVTNSLGMLTSIPEVDLGMDNRLKNIEMTEKAKRDMLEQR 237
>gi|405118594|gb|AFR93368.1| hypothetical protein CNAG_03868 [Cryptococcus neoformans var.
grubii H99]
Length = 304
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 61/113 (53%), Gaps = 9/113 (7%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
LV + F Q+T + D +M+ Y+E ELAK+RG+ D + + ELY+I E
Sbjct: 124 LVRVNNFTQQTNALDVDKHMMAYIEAELAKRRGQAAATTDESALEDNDPQAELYRIAEKY 183
Query: 151 -----KKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 194
KK+ +E +S T I EV L ++ +LKNIE TE AK+ + E+R
Sbjct: 184 QFETRKKKADDEGNVTNSLGMLTSIPEVDLGMDNRLKNIEMTEKAKRDMLEQR 236
>gi|390601598|gb|EIN10992.1| hypothetical protein PUNSTDRAFT_64949 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 292
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 89/191 (46%), Gaps = 22/191 (11%)
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK 151
V F Q+T + D +M+ Y+E+ +AK RG + +D EL ++ + K
Sbjct: 111 VRTSNFTQQTNTLDVDRHMMAYIEENMAKLRGAK---REEKSDDPADPYAELNRLADRYK 167
Query: 152 --KRNSEE------SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAK--S 201
K+N +E +S T I EV L ++ +LKNIEETE AK+++ E+R R K
Sbjct: 168 FSKKNEKEEEGNVTNSLAMLTAIPEVDLGMDARLKNIEETERAKRIVAEERKDNRRKRED 227
Query: 202 DFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAAT 261
+ IP S + AE LR E G RP + T Q AT
Sbjct: 228 EEQIPGSRFYRPNHNAKTDAEILRNAKLEAM---GLPPQEENRRPHNERT------QMAT 278
Query: 262 DQFMLERFRKR 272
D+ ++ERF+KR
Sbjct: 279 DEMVMERFKKR 289
>gi|449549468|gb|EMD40433.1| hypothetical protein CERSUDRAFT_148439 [Ceriporiopsis subvermispora
B]
Length = 287
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 94/196 (47%), Gaps = 26/196 (13%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAED--ELYKIPE 148
+V + F Q+T + D +M+ Y+E+ + ++G + E D + EL++I +
Sbjct: 105 VVRANNFTQQTNTLDVDKHMMAYIEENMKLRQG----AQNESEKDDGPLDPYAELFRIAD 160
Query: 149 HLKKRNSEE---------SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRA 199
K R E +S T I EV L ++ +LKNIEETE AK+L++E++ +
Sbjct: 161 KYKPRQDSEKEKEEGSVTNSLSMLTAIPEVDLGMDTRLKNIEETEKAKRLIEERKERKKQ 220
Query: 200 KSDFSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQ 258
D + + R Y L+ R ++ +D ++ + G P D+ Q
Sbjct: 221 ADDEA--------HLAATRFYRPNLKTRSDADIIRD--AKLEAMGLPPEDHDRPRHDRPQ 270
Query: 259 AATDQFMLERFRKRER 274
ATD+ ++ERF+KR R
Sbjct: 271 MATDEMVMERFKKRMR 286
>gi|393240981|gb|EJD48505.1| hypothetical protein AURDEDRAFT_112943 [Auricularia delicata
TFB-10046 SS5]
Length = 306
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 108/237 (45%), Gaps = 38/237 (16%)
Query: 66 SAAAAGGGGLTKVSEKNEGDGEKDE-----LVLQDTFAQETAVMVEDPNMLKYVEQELAK 120
+AA GGL +V E + + E DE V + F Q+T + D +M+ Y+E+ + K
Sbjct: 73 TAAIVTPGGLREVREPVDEELEGDEAKARRTVRSNNFTQQTNALDVDKHMMNYIEENMRK 132
Query: 121 KRGKNIDVNDRVENDLKHAEDELYKI--------------------PEHLKKRNSEE--- 157
+ G D +D +N+ EL+++ P KK+++EE
Sbjct: 133 RYG-GTDDDDEKKNEPWDPLAELFRVDPVFPSKPNVKPDASKSATAPVVSKKKDNEEGSV 191
Query: 158 -SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQE-KRLMGRAKSDFSIPSSYSADYFQ 215
+S T I EV L ++ +LKNIE+TE AK+ E K+ R D A F
Sbjct: 192 TNSATMLTAIPEVDLGMDTRLKNIEDTEKAKRTAAELKQERQRVDKD-----DLVAGRFY 246
Query: 216 RGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKR 272
RG+D + KL+ + E + + + G P + ++ ATD +ERF+KR
Sbjct: 247 RGQDKS-KLKTDE-ERLRSAMAVEMGNAPDPARRHEHRSQRKEVATDDIAMERFKKR 301
>gi|392568454|gb|EIW61628.1| hypothetical protein TRAVEDRAFT_163006 [Trametes versicolor
FP-101664 SS1]
Length = 285
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 96/196 (48%), Gaps = 25/196 (12%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAED--ELYKIPE 148
+V ++ F +T + D +M+ Y+E+ + +RG D + D A+ E+++I E
Sbjct: 102 VVRENNFTHQTNALDVDKHMMAYIEENMKLRRG----TQDESKKDDGQADPYAEVFRITE 157
Query: 149 HLK----KRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAK 200
K K+ EE +S T I EV L ++ +LKNIEETE AK+ + E+R + +
Sbjct: 158 KYKPPTQKKEQEEGNVTNSLAMLTAIPEVDLGMDARLKNIEETEKAKRQIAEQRKDKQKQ 217
Query: 201 SDFSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR-Q 258
D + R Y LR + ++ +D ++ + G P D+ R Q
Sbjct: 218 GD-------DEAHLAGSRFYRPNLRAKSDADILRD--AKLEAMGLNPEDHEVRRHSDRPQ 268
Query: 259 AATDQFMLERFRKRER 274
ATD+ ++ERF+KR R
Sbjct: 269 MATDEMVMERFKKRMR 284
>gi|299748182|ref|XP_001837525.2| hypothetical protein CC1G_01437 [Coprinopsis cinerea okayama7#130]
gi|298407852|gb|EAU84441.2| hypothetical protein CC1G_01437 [Coprinopsis cinerea okayama7#130]
Length = 289
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 107/221 (48%), Gaps = 26/221 (11%)
Query: 73 GGLTK--VSEKNEGDGEKDE-----LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKN 125
GGL K +E +E D E E +V + F Q+T + D +M+ Y+E+ L K RGK
Sbjct: 75 GGLRKPLAAEGDEDDEEAKEARARRVVRTNNFTQQTNALDVDKHMMAYIEENL-KVRGKP 133
Query: 126 IDVNDRVENDLKHAEDELYKI---------PEHLKKRNSEESSTQWTTGIAEVQLPIEYK 176
+ D + +D LY+ P+ + + +S T I EV L ++ +
Sbjct: 134 RNEEDDEDKKPLDPQDILYRQVADRFRLDKPKAATEEGNVTNSMSMLTAIPEVDLGMDTR 193
Query: 177 LKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLR-REHPELYKDR 235
LKNIEETE AK+++ E+R R K + + Y+ Y LR + ++ +D
Sbjct: 194 LKNIEETEKAKRVVAEER-QERKKVNPDEEHLVATRYWTV---YRPNLRAKSDADILRD- 248
Query: 236 GSQDDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRER 274
++ + G P D++ + Q ATD+ ++ERF+KR R
Sbjct: 249 -AKLEAMGLPPQDDAHHRSNHDRAQTATDEIVMERFKKRMR 288
>gi|312090668|ref|XP_003146699.1| hypothetical protein LOAG_11128 [Loa loa]
Length = 288
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 102/202 (50%), Gaps = 25/202 (12%)
Query: 6 PQKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQ 65
P KK K+++ R+R ++++E T + E E LE IK LQ+ R K+G+ A+ AL
Sbjct: 7 PTKK-KQRHLRERIFDDDEITAEE----EAEIACKLEGIKELQESRVCKNGLNAVECALG 61
Query: 66 SAAAA------------GGGGLTKVSEKNEGDGEKDEL--VLQDTFAQETAVMVEDPNML 111
AA GGG+ ++SE + ++ ++D F +E+ + E M
Sbjct: 62 KELAAEFIAMDDDPFRQRGGGMLRLSEGRQAQMHAADIEAGIRDQFKKESFLRDEHEEMK 121
Query: 112 KYVEQELAKKRG-KNIDVNDRVENDLKHAEDEL-YKIPEHLK----KRNSEESSTQWTTG 165
KYV+ EL K++ ++++ D + + ED L +K E ++ +RN E S Q G
Sbjct: 122 KYVQAELRKRKAVQDLEDGDATTSKVPSMEDSLMWKAAEKVRLFRSERNDELLSNQMLAG 181
Query: 166 IAEVQLPIEYKLKNIEETEAAK 187
I EV L I ++ NI ETE K
Sbjct: 182 IPEVDLGINARMSNIIETEKKK 203
>gi|291229542|ref|XP_002734736.1| PREDICTED: glycosyltransferase 25 domain containing 2-like, partial
[Saccoglossus kowalevskii]
Length = 576
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 72/138 (52%), Gaps = 30/138 (21%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQE 100
LEE K QK R+R+SG+ ++E D ++D + + TFA E
Sbjct: 464 LEETKEAQKFRKRQSGV-----------------------RSEEDTDRDVVDMGSTFAAE 500
Query: 101 TAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNS----- 155
T ED +M+KY+E+++AK++GK + + E LK AED LY++P L+ +S
Sbjct: 501 TNRRDEDADMMKYIEEQMAKRKGKAMTKEE--ERRLKSAEDLLYELPARLQVESSSQKTE 558
Query: 156 EESSTQWTTGIAEVQLPI 173
E S Q +GI EV L I
Sbjct: 559 EMMSHQMLSGIPEVDLGI 576
>gi|328768488|gb|EGF78534.1| hypothetical protein BATDEDRAFT_26637 [Batrachochytrium
dendrobatidis JAM81]
Length = 273
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 86/207 (41%), Gaps = 51/207 (24%)
Query: 31 SDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG------------------ 72
SD++E RL L+E L+K R+ K GI A + G
Sbjct: 48 SDEDEHSRLTLQEALELRKLRKPKPGISAASLETGKVSLPNGKHEVSVETLQDQDDPWKL 107
Query: 73 --GGLTKVSE------KNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK 124
GGL +S+ EG G F + M + +M ++E+EL K+RG
Sbjct: 108 KNGGLINISDIRGRSFGEEGSG-------TGGFETASKAMDTEKHMKAFIEKELRKRRGD 160
Query: 125 NID------------VNDRVENDLKHAEDELYKIPEHLK------KRNSEESSTQWTTGI 166
+ND ++ ++ELY+IP+ L K ++ ST I
Sbjct: 161 APSTTSDTSLPSLRKLNDELKTGPTDYDEELYRIPDALTIPVKPIKEDNVTLSTGMLMSI 220
Query: 167 AEVQLPIEYKLKNIEETEAAKKLLQEK 193
EV L + KLKNIEETE AK+ L EK
Sbjct: 221 PEVDLGVSNKLKNIEETEQAKRSLLEK 247
>gi|393908085|gb|EFO17373.2| hypothetical protein LOAG_11128 [Loa loa]
Length = 266
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 101/199 (50%), Gaps = 24/199 (12%)
Query: 9 KEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA 68
K+K+++ R+R ++++E T + E E LE IK LQ+ R K+G+ A+ AL
Sbjct: 9 KKKQRHLRERIFDDDEITAEE----EAEIACKLEGIKELQESRVCKNGLNAVECALGKEL 64
Query: 69 AA------------GGGGLTKVSEKNEGDGEKDEL--VLQDTFAQETAVMVEDPNMLKYV 114
AA GGG+ ++SE + ++ ++D F +E+ + E M KYV
Sbjct: 65 AAEFIAMDDDPFRQRGGGMLRLSEGRQAQMHAADIEAGIRDQFKKESFLRDEHEEMKKYV 124
Query: 115 EQELAKKRG-KNIDVNDRVENDLKHAEDEL-YKIPEHLK----KRNSEESSTQWTTGIAE 168
+ EL K++ ++++ D + + ED L +K E ++ +RN E S Q GI E
Sbjct: 125 QAELRKRKAVQDLEDGDATTSKVPSMEDSLMWKAAEKVRLFRSERNDELLSNQMLAGIPE 184
Query: 169 VQLPIEYKLKNIEETEAAK 187
V L I ++ NI ETE K
Sbjct: 185 VDLGINARMSNIIETEKKK 203
>gi|281201418|gb|EFA75630.1| hypothetical protein PPL_11136 [Polysphondylium pallidum PN500]
Length = 313
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 64/139 (46%), Gaps = 30/139 (21%)
Query: 143 LYKIPEHL---KKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRA 199
LY+ P HL K R EE T W GI+EV LP +YK+KNI ETE A R
Sbjct: 199 LYETPSHLLVNKGRKKEEDKTNWVAGISEVVLPTKYKIKNILETEEA-----------RE 247
Query: 200 KSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAA---GS 256
K D Q G + +L+ +R + ++ S ++S+D S
Sbjct: 248 KID------------QSGNTNTTTTNKNQSKLH-NRCNYNNVHASYTNESSSDTNQYEKS 294
Query: 257 RQAATDQFMLERFRKRERH 275
ATD+ + ERF+K+ R+
Sbjct: 295 SDKATDEEVYERFKKKFRY 313
>gi|66827193|ref|XP_646951.1| hypothetical protein DDB_G0268774 [Dictyostelium discoideum AX4]
gi|60475040|gb|EAL72976.1| hypothetical protein DDB_G0268774 [Dictyostelium discoideum AX4]
Length = 341
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 143 LYKIPEHLKKRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKS 201
L++ PEHLK + + E T W GIAEVQLP YK KNI ETE AK L ++ R
Sbjct: 240 LFETPEHLKSQKGKVEEKTNWVAGIAEVQLPDVYKYKNIVETEKAKDALDKQ---PRNYE 296
Query: 202 DFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDD 240
P +++ +Y R YA ++ + + D+ + D+
Sbjct: 297 KLLTPQNFNQNYQYHNR-YANNKKQRNEDKATDKEAMDN 334
>gi|326428178|gb|EGD73748.1| hypothetical protein PTSG_11504 [Salpingoeca sp. ATCC 50818]
Length = 301
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 95/206 (46%), Gaps = 21/206 (10%)
Query: 4 PIPQKKEKKK--NFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP 61
P PQ K+KK+ N +KR ++ D E+ ALEE QK R R+ GI A
Sbjct: 8 PAPQFKKKKRRGNLKKREKASMDDILKADEDVAEKIEAALEE----QKLRSREQGISAEE 63
Query: 62 SALQSAAAAGGG------GLTKVSEKNEGDGEKDEL-VLQDTFAQETAVMVEDPNMLKYV 114
A + GL K G + D + ++ FA+ET + D ML+Y+
Sbjct: 64 LAKRDEEIDEEEEQLIQYGLQTKKSKTSGGADDDAMKGIEKAFAEETGRLDRDKEMLQYI 123
Query: 115 EQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEES------STQWTTGIAE 168
+++L + GK + + + LYK+PE L++ EE S+ GI E
Sbjct: 124 KEKLQESEGKK--PTGKQTSKYEQMMASLYKVPERLQEEKQEEEVERGMLSSAVLQGIPE 181
Query: 169 VQLPIEYKLKNIEETEAAKKLLQEKR 194
V L I+ K+KNIEETE AK + R
Sbjct: 182 VSLGIDEKIKNIEETERAKASIHRSR 207
>gi|409079546|gb|EKM79907.1| hypothetical protein AGABI1DRAFT_39047 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426192503|gb|EKV42439.1| hypothetical protein AGABI2DRAFT_78642 [Agaricus bisporus var.
bisporus H97]
Length = 262
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 104/225 (46%), Gaps = 39/225 (17%)
Query: 73 GGLTKVSEKNEGDGEKDE------LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI 126
GGL K + E D E ++ +V + F Q+T VM D +M+ Y+E+ L K RG+
Sbjct: 52 GGLRKGAPVEEDDEEANKEAKARRVVRTNNFTQQTNVMDVDKHMMAYIEENL-KIRGRPR 110
Query: 127 D--VNDRVENDLKHAEDELYKIPEHLK---------KRNSEESSTQWTTGIAEVQLPIEY 175
D DR D + A LY+I + K + S +S T I EV L ++
Sbjct: 111 DDEAKDRKPLDPQEA---LYRIVDKFKVNKEGQTKGEEGSVTNSLTMLTAIPEVDLGMDN 167
Query: 176 KLKNIEETEAAKKLLQEKRLM-GRAKSD---FSIPSSYSADYFQRGRDYAEKLRREHPEL 231
+LKNIEETE AK+ + E R +AK+D Y Y + + A+ L
Sbjct: 168 RLKNIEETEKAKRSVDEHRQQRKKAKTDEEHLIAQRFYRPGY--KAKSDADIL------- 218
Query: 232 YKDRGSQDDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRER 274
R ++ + G P + S + Q ATD+ ++ERF+K R
Sbjct: 219 ---RDAKLEAMGLPPKEESPRRSNHERNQMATDEIVMERFKKHVR 260
>gi|392587037|gb|EIW76372.1| hypothetical protein CONPUDRAFT_130968 [Coniophora puteana
RWD-64-598 SS2]
Length = 188
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 53/181 (29%), Positives = 91/181 (50%), Gaps = 13/181 (7%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAE--DELYKIPEHLKKRN 154
F Q+T + D +M++Y+E+ + K R D+++ + D +E + + I + L +
Sbjct: 17 FTQQTNTLDVDKHMMEYIEENIKKMRNAP-DLSEDIPADASESEHKSDKWNIEKKLAEEG 75
Query: 155 SEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYF 214
S +S T I EV L ++ +LKNIEETE AK++ E+R K ++ +A+ F
Sbjct: 76 SVTNSMAMLTAIPEVDLGMDARLKNIEETEKAKRIHAEER--KEKKRVYNDEEHLAANRF 133
Query: 215 QRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRE 273
R LR + E+ +D +Q + G P Q +TD+ ++ERF+KR
Sbjct: 134 FRP-----NLRQKTDAEIMRD--AQREAMGLPPIQERQRNYERPQMSTDEAVMERFKKRM 186
Query: 274 R 274
R
Sbjct: 187 R 187
>gi|395329965|gb|EJF62350.1| hypothetical protein DICSQDRAFT_126606 [Dichomitus squalens
LYAD-421 SS1]
Length = 300
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 94/195 (48%), Gaps = 22/195 (11%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
+V + F Q+T + D +M+ Y+E+ + +RG D D+ + EL+++ +
Sbjct: 116 VVRTNNFTQQTNALDVDKHMMAYIEENMKLRRGAKDD--DKKDEGPADPYAELFRLTKAA 173
Query: 151 KKRNSE---ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPS 207
+K+ E +S T I EV L ++ +LKNIEETE AK+ + E R K+D
Sbjct: 174 QKKEEEGNVTNSLAMLTAIPEVDLGMDTRLKNIEETEKAKRQISELRKERSKKADDEA-- 231
Query: 208 SYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR-QAATDQFM 265
+ R Y L+ + ++ +D ++ + G RP D+ R Q ATD+ +
Sbjct: 232 -----HLAAARFYRPNLKAKSDADIMRD--AKLEAMGLRPDDHEYRRPSDRAQMATDEIL 284
Query: 266 ------LERFRKRER 274
+ERF+KR R
Sbjct: 285 THLHQVMERFKKRMR 299
>gi|324508095|gb|ADY43422.1| Unknown [Ascaris suum]
Length = 287
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 97/212 (45%), Gaps = 26/212 (12%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA------------GGGGLTKVSEKNEGD--G 86
L +IK +Q R R++G+ A+ AL AA GGG+ ++SE +
Sbjct: 37 LADIKEVQLSRLRRNGLNAVECALGKELAAEFVAMDDDPFRQRGGGMLRLSEGRQAQLHA 96
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVEN-DLKHAEDEL-Y 144
E ++D F +E+ + E M KYV+ EL K++ ND + + ED L +
Sbjct: 97 ADIEAGIRDQFKKESFLRDEHEEMKKYVQAELRKRKADYEPDNDESTSVKVPSVEDNLMW 156
Query: 145 KIPEHLK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRA 199
K E ++ RN E S Q GI EV L I ++ NI ETE K ++L++ GR+
Sbjct: 157 KAAEKVRFFKSMRNDELLSNQMLAGIPEVDLGINARMSNIIETEKKKSEMLKDVIEHGRS 216
Query: 200 KSDFSI-----PSSYSADYFQRGRDYAEKLRR 226
++ ++ S DY Q Y E R
Sbjct: 217 LTEETLFQQERAKDLSKDYVQHSIFYMESTTR 248
>gi|389746703|gb|EIM87882.1| hypothetical protein STEHIDRAFT_94716 [Stereum hirsutum FP-91666
SS1]
Length = 317
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 98/202 (48%), Gaps = 30/202 (14%)
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH--AEDELYKIPEH 149
V + F Q+T + D +M+ Y+E+ + + +N D E K+ ++ELY++ E
Sbjct: 126 VRTNNFTQQTNALDVDKHMMAYIEENMKLRHAQNSSTPD-PEAAPKYLDPQEELYRLSEK 184
Query: 150 LK-----KRNSEESSTQ---WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKS 201
K + N E S T T I EV L ++ +LKNIEETE AK+ L + R + +
Sbjct: 185 YKVEKKAQPNEEGSVTNSLAMLTAIPEVDLGMDTRLKNIEETEKAKQALTQARKDRQKRQ 244
Query: 202 DFSIPSSYSADYFQ---RGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSR- 257
+ +A +F+ R + A+ + R ++ + G P +++ G+R
Sbjct: 245 NDDEEHLAAARFFRPNTRMKSDADII----------RDAKLEAMGLPPAEDNEPHRGNRP 294
Query: 258 -----QAATDQFMLERFRKRER 274
Q ATD+ ++ERF+KR R
Sbjct: 295 RHDRPQMATDEMVMERFKKRMR 316
>gi|384500335|gb|EIE90826.1| hypothetical protein RO3G_15537 [Rhizopus delemar RA 99-880]
Length = 263
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 63/108 (58%), Gaps = 14/108 (12%)
Query: 95 DTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH-----AEDELYKIPEH 149
D+F +T + D +M++Y+E E+ K++G + +E + K +ELY++P+
Sbjct: 103 DSFTTQTNKLDVDKHMMEYIESEMRKRKG--YKPQEEIEEEYKDKGFVDIYEELYRLPDQ 160
Query: 150 LK--KRNSE-----ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 190
LK K+ SE + S+Q T I EV L I+ +L+NIEETE AK+ L
Sbjct: 161 LKGEKKESENEGNVQLSSQMLTAIPEVDLGIDTRLQNIEETEKAKRKL 208
>gi|301102378|ref|XP_002900276.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102017|gb|EEY60069.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 301
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 93/213 (43%), Gaps = 47/213 (22%)
Query: 92 VLQDTFAQETAVMVEDPN---MLKYVEQELAKKR---GK-NIDVNDRVENDLKHAEDELY 144
+L F ++A +D + M K++E+ L KKR GK N LK AED L+
Sbjct: 108 LLDGQFTGQSATTEKDQHVELMNKFIEERLQKKRKIDGKQNAGDVGDAAAALKTAEDRLF 167
Query: 145 KIPEHLK------KRNSEESS----TQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 194
++PEHL +N +E + TGIAEV+LP Y E TE A K E
Sbjct: 168 ELPEHLNPDVPSSSKNYDEGTEGGMLMGNTGIAEVELPSSY----AERTEKATKRALEAN 223
Query: 195 LMGRAKSDF-------SIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPT 247
AK D +P ++S D+ + +Y +++ + + ++RG + G
Sbjct: 224 KPRAAKLDAIGGLASSVVPGNFSTDFNRHKTNYVAEMKSLNKDEQRERGFRQVG------ 277
Query: 248 DNSTDAAGSRQAATDQFMLERFRKRE----RHR 276
+ ATD + +FRK E RHR
Sbjct: 278 ---------KNRATDNHAVSQFRKLESRKLRHR 301
>gi|298713725|emb|CBJ48916.1| hypothetical protein (Partial) [Ectocarpus siliculosus]
Length = 463
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 21/82 (25%)
Query: 134 NDLKHAEDELYKIPEHLKKRNSEE---------------------SSTQWTTGIAEVQLP 172
N++ ED LY IPE KK+ E + W TG+AE+ LP
Sbjct: 248 NNMLSEEDRLYTIPEDFKKKVEEAKVEFDTGANRGDEMDGEVGSGAQIAWNTGLAEIALP 307
Query: 173 IEYKLKNIEETEAAKKLLQEKR 194
IE+KLKN+E+T AA+ ++ R
Sbjct: 308 IEFKLKNMEDTLAARDKMESVR 329
>gi|349805921|gb|AEQ18433.1| hypothetical protein [Hymenochirus curtipes]
Length = 157
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 52/88 (59%), Gaps = 6/88 (6%)
Query: 90 ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEH 149
+L L +F+ ET ED +M+KY+E EL KK+G D +V+ LK AED LY++P+
Sbjct: 37 DLNLGTSFSAETNRRDEDADMMKYIETELKKKKGIVEDEEKKVK--LKSAEDCLYELPDS 94
Query: 150 LK----KRNSEESSTQWTTGIAEVQLPI 173
+K K+ E S Q +GI EV L I
Sbjct: 95 IKVSSAKKTEEMLSNQMLSGIPEVDLGI 122
>gi|328860548|gb|EGG09654.1| hypothetical protein MELLADRAFT_74358 [Melampsora larici-populina
98AG31]
Length = 385
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 65/122 (53%), Gaps = 18/122 (14%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI--DVNDRVENDLKHAE-------- 140
++ + F Q+T + D +M+ Y+E+EL ++R I ++ E L AE
Sbjct: 150 IIKSNNFTQQTNTLDVDKHMMHYIEEELKQRRKAAIAAGADESSEPILTGAEAVASLDPR 209
Query: 141 DELYKIPEHLK-------KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 193
DELYKI E + + N S+T T+ I EV L I+ ++KNIE TE AK+ L ++
Sbjct: 210 DELYKIAEKYRIDRKPVVEGNVTLSATMLTS-IPEVDLGIDTRIKNIEATEKAKRKLADE 268
Query: 194 RL 195
RL
Sbjct: 269 RL 270
>gi|348672270|gb|EGZ12090.1| hypothetical protein PHYSODRAFT_317356 [Phytophthora sojae]
Length = 297
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 94/212 (44%), Gaps = 45/212 (21%)
Query: 92 VLQDTFAQETAVMVEDPN---MLKYVEQELAKKR-----GKNIDVNDRVENDLKHAEDEL 143
+L F ++A +D + M +++E+ L KKR N D L+ AED+L
Sbjct: 104 LLDGQFTGQSATTDKDQHEELMNQFIEERLQKKRKTQQVSANGDGASDAAAALRTAEDKL 163
Query: 144 YKIPEHLKKRNSEESSTQW-----------TTGIAEVQLPIEYKLKNIEETEAAKKLLQE 192
+++P++LK SS + GIAEV+LP Y E TE A + E
Sbjct: 164 FELPDNLKPDVPSSSSAGYDDTAEGGMLMGNAGIAEVELPASY----AERTERATRTALE 219
Query: 193 KRLMGRAKSDF-------SIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSR 245
+ G K D ++P+++SAD+ + DY +++ + + ++RG + G
Sbjct: 220 QSKAGGVKRDAVGGLANSALPTNFSADFNRHKTDYVAEMKSLNKDEQRERGFRTVG---- 275
Query: 246 PTDNSTDAAGSRQAATDQFMLERFRKRERHRV 277
+ A+D + RFRK E ++
Sbjct: 276 -----------KNQASDDRAVSRFRKFESRKL 296
>gi|358336454|dbj|GAA54958.1| hypothetical protein CLF_106187 [Clonorchis sinensis]
Length = 337
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 85/307 (27%), Positives = 131/307 (42%), Gaps = 70/307 (22%)
Query: 34 EEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG-------------GGGLTKV-- 78
+E+ +E I+ LQK R+R GI SAL + AA GGL ++
Sbjct: 34 QEDSTHVVEAIRELQKVRKRPPGISL--SALSTGKAAPEETIIVSDPFKLKTGGLVEIRK 91
Query: 79 ---SEKNEGDGEKDEL--VLQDTFAQETAVMVEDPNM--------LKYVEQELAKKR--- 122
S+K E E+D++ L TFA ET ED M + + ++KK+
Sbjct: 92 AIRSKKTE---EEDDVEARLAKTFATETNKRDEDAEMFVPFHASLIPFTGLSISKKKSLD 148
Query: 123 GKNIDV----NDRVENDLKHA-----------EDELYKIPEHLK----KRNSEESSTQWT 163
GK+ DV N + + D+ ++ D L +PE+L+ ++ + S Q
Sbjct: 149 GKDYDVLHLQNRKSDADVFYSAHSPDAVPNAGADLLRDVPEYLRPVIGQQKEDMLSNQML 208
Query: 164 TGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL---MGRAKSDFSIPSSYSADYFQRGR-- 218
GI EV L ++ K++NIE TE AK+ L + R G A SD P++ + ++ Q R
Sbjct: 209 CGIPEVDLGVDAKMRNIEATEEAKQTLLKHRFNRGYGMA-SDGLAPTNVAVNFVQHSRWN 267
Query: 219 DYAEKLRREHPELYKDRGSQDDGAGSRPTD------NSTDAAGSRQAA---TDQFMLERF 269
+ + +D S A TD DA R A TD +L+RF
Sbjct: 268 SHNATTTFSSGDYTRDLLSIASKANPHKTDIVHQQTTGLDAERERLGAERSTDSLVLQRF 327
Query: 270 RKRERHR 276
+ R R
Sbjct: 328 KSHMRGR 334
>gi|401406410|ref|XP_003882654.1| hypothetical protein NCLIV_024100 [Neospora caninum Liverpool]
gi|325117070|emb|CBZ52622.1| hypothetical protein NCLIV_024100 [Neospora caninum Liverpool]
Length = 344
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 37/56 (66%), Gaps = 3/56 (5%)
Query: 141 DELYKIPEHLK--KRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 193
+ELY++P+ L+ R+ E W TG+ EVQLP+ KLKNIE TE AK+ L +K
Sbjct: 224 NELYRVPDRLQVADRSGEYREQLNWLTGLTEVQLPMTVKLKNIEATEKAKRALLKK 279
>gi|238578425|ref|XP_002388713.1| hypothetical protein MPER_12237 [Moniliophthora perniciosa FA553]
gi|215450245|gb|EEB89643.1| hypothetical protein MPER_12237 [Moniliophthora perniciosa FA553]
Length = 191
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 91/189 (48%), Gaps = 24/189 (12%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNSE 156
Q+T + D +M+ Y+E+ L K R + D + D + A LY++PE K +
Sbjct: 15 LTQQTNALDVDKHMMTYIEEHL-KIRSRPKDEEKKKPLDPQEA---LYQVPERWKVEQKK 70
Query: 157 E--------SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSS 208
+ +S T I EV L ++ +LKNIEETE AK+++ E R R ++P
Sbjct: 71 QETDDGSITNSMTMLTAIPEVDLGMDARLKNIEETEKAKRVVAEDRSDKR-----TVPK- 124
Query: 209 YSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR--QAATDQFM 265
++ R Y L+ + ++ +D ++ + G + D A Q ATD+ +
Sbjct: 125 -GEEHLVAARFYRPNLKAKSDADIMRD--AKLEAMGLQLQDEQPRRANQDRPQIATDELV 181
Query: 266 LERFRKRER 274
+ERF+KR R
Sbjct: 182 MERFKKRMR 190
>gi|237832377|ref|XP_002365486.1| hypothetical protein TGME49_063860 [Toxoplasma gondii ME49]
gi|211963150|gb|EEA98345.1| hypothetical protein TGME49_063860 [Toxoplasma gondii ME49]
Length = 455
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 36/56 (64%), Gaps = 3/56 (5%)
Query: 141 DELYKIPEHLK--KRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 193
+ELY++P+ L+ R+ E W TG+ EV LP+ KLKNIE TE AK+ L +K
Sbjct: 335 NELYRVPDRLQVADRSGEYREQLNWLTGLTEVHLPMTVKLKNIEATEKAKRALLKK 390
>gi|221502202|gb|EEE27940.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 455
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 36/56 (64%), Gaps = 3/56 (5%)
Query: 141 DELYKIPEHLK--KRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 193
+ELY++P+ L+ R+ E W TG+ EV LP+ KLKNIE TE AK+ L +K
Sbjct: 335 NELYRVPDRLQVADRSGEYREQLNWLTGLTEVHLPMTVKLKNIEATEKAKRALLKK 390
>gi|221481742|gb|EEE20118.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 455
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 36/56 (64%), Gaps = 3/56 (5%)
Query: 141 DELYKIPEHLK--KRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 193
+ELY++P+ L+ R+ E W TG+ EV LP+ KLKNIE TE AK+ L +K
Sbjct: 335 NELYRVPDRLQVADRSGEYREQLNWLTGLTEVHLPMTVKLKNIEATEKAKRALLKK 390
>gi|392576231|gb|EIW69362.1| hypothetical protein TREMEDRAFT_17281, partial [Tremella
mesenterica DSM 1558]
Length = 288
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 61/113 (53%), Gaps = 9/113 (7%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG-KNIDVNDRVENDLKHAEDEL------ 143
L+ + F Q+T + D +ML ++E+EL K+RG + +N+ + EL
Sbjct: 110 LIRTNNFTQQTNALDVDKHMLAFIEKELNKRRGAEAASKTSNTQNESFDPQSELLEVTKK 169
Query: 144 YKIPEHLK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 194
YKI +++K + S +S T I EV L +E +L+NIE TE AK+ + E R
Sbjct: 170 YKIEKNMKLEEEGSLTNSMGMLTTIPEVDLGMENRLRNIEATEKAKREMLESR 222
>gi|156099820|ref|XP_001615706.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804580|gb|EDL45979.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 225
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 33/46 (71%), Gaps = 3/46 (6%)
Query: 142 ELYKIPEHLKKRNSEESSTQ---WTTGIAEVQLPIEYKLKNIEETE 184
+LYK+ +HLK ++S S+ + TGI E+ LPIE KLKNIEETE
Sbjct: 166 DLYKLSDHLKVKSSVASNPEKLNCITGITEIPLPIEVKLKNIEETE 211
>gi|83285872|ref|XP_729913.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23489078|gb|EAA21478.1| Arabidopsis thaliana At1g02330/T6A9_12-related [Plasmodium yoelii
yoelii]
Length = 209
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 39/59 (66%), Gaps = 4/59 (6%)
Query: 143 LYKIPEHLKKRNSEESSTQ---WTTGIAEVQLPIEYKLKNIEETEAAKK-LLQEKRLMG 197
LYK+P+ LK + S ++ + TGI EV LP+E KLKNIEETE K+ LL++ + M
Sbjct: 147 LYKLPDDLKVKTSTNNAQERLNCFTGINEVPLPLEMKLKNIEETEKIKRELLKKAKFMN 205
>gi|221059073|ref|XP_002260182.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193810255|emb|CAQ41449.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 227
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 33/46 (71%), Gaps = 3/46 (6%)
Query: 142 ELYKIPEHLKKRNSEESSTQ---WTTGIAEVQLPIEYKLKNIEETE 184
+LYK+ +HLK ++S ++ + TGI EV LPIE KLKNIEETE
Sbjct: 168 DLYKLSDHLKVKSSVAANPEKLNCITGITEVPLPIEVKLKNIEETE 213
>gi|402226280|gb|EJU06340.1| hypothetical protein DACRYDRAFT_97824 [Dacryopinax sp. DJM-731 SS1]
Length = 299
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 83/181 (45%), Gaps = 23/181 (12%)
Query: 34 EEERRLALEEIKFLQKQRE--RKSGIPAI-----------PSALQSAAAAGGGGLTKVSE 80
E+ R L LE+I L+K R+ R GI P A GGL +
Sbjct: 47 EDARSLGLEDIIALRKYRQGMRHEGIDVGKLSKGEKRKRNPEGEDDGAVVEKGGLKRREF 106
Query: 81 KNEGDGEKDELVLQ----DTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDL 136
+ E + + E + + + F Q+T + + +M++Y+E E+ K+RG + +
Sbjct: 107 EGEVESSEAESIAKKLRANNFTQQTNALDVNKHMMEYIEGEIRKRRGDTASTEEGQKTGA 166
Query: 137 KHAEDELYKIPEHLKKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQE 192
+L+K +K + EE +S T I EV L +E +L+NIEETE AK+ E
Sbjct: 167 YDPYAQLFKT--DVKPDSREEAAISTSMAMLTAIPEVDLGMETRLRNIEETEKAKRQAAE 224
Query: 193 K 193
+
Sbjct: 225 R 225
>gi|389585171|dbj|GAB67902.1| hypothetical protein PCYB_124680, partial [Plasmodium cynomolgi
strain B]
Length = 223
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 33/46 (71%), Gaps = 3/46 (6%)
Query: 142 ELYKIPEHLKKRNSEESSTQ---WTTGIAEVQLPIEYKLKNIEETE 184
+LYK+ +HLK ++S S+ + TGI E+ LPIE KLKNIEETE
Sbjct: 164 DLYKLSDHLKVKSSVVSNQEKLNCITGITEIPLPIEVKLKNIEETE 209
>gi|115444733|ref|NP_001046146.1| Os02g0189900 [Oryza sativa Japonica Group]
gi|113535677|dbj|BAF08060.1| Os02g0189900 [Oryza sativa Japonica Group]
gi|218190225|gb|EEC72652.1| hypothetical protein OsI_06177 [Oryza sativa Indica Group]
Length = 105
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/29 (79%), Positives = 25/29 (86%), Gaps = 1/29 (3%)
Query: 146 IPEHLKKR-NSEESSTQWTTGIAEVQLPI 173
+ +HLK R NSEESSTQWTTGIAEVQ PI
Sbjct: 3 VADHLKVRKNSEESSTQWTTGIAEVQPPI 31
>gi|68066195|ref|XP_675081.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56494056|emb|CAH95798.1| conserved hypothetical protein [Plasmodium berghei]
Length = 209
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 39/59 (66%), Gaps = 4/59 (6%)
Query: 143 LYKIPEHLKKRNSEESSTQ---WTTGIAEVQLPIEYKLKNIEETEAAKK-LLQEKRLMG 197
LYK+P+ LK + S ++ + TGI EV LP+E KL+NIEETE K+ LL++ + M
Sbjct: 147 LYKLPDDLKVKTSTNNAQERLNCFTGINEVPLPLEMKLQNIEETEKIKRQLLKKAKFMS 205
>gi|124810282|ref|XP_001348824.1| conserved protein, unknown function [Plasmodium falciparum 3D7]
gi|23497725|gb|AAN37263.1| conserved protein, unknown function [Plasmodium falciparum 3D7]
Length = 226
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 82/166 (49%), Gaps = 35/166 (21%)
Query: 44 IKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQE-TA 102
+K LQ R +K GI + A TKV EK+ D EK +L F + T
Sbjct: 59 LKVLQHMRMKKKGI----------STANLNYETKVIEKH--DNEKK--LLDKHFTKNITE 104
Query: 103 VMVEDPNMLKYVEQ-------ELAKKRGKNID----------VNDRVENDLKHAEDELYK 145
+E+ ++ ++++ EL K+ + I+ +N + EN+ + LYK
Sbjct: 105 KEIEEAHIESFIKENMKEFYDELNNKKKQQIEQEHEHEQDQEINKKQENNDNDLINNLYK 164
Query: 146 IPEHLKKRNSEESSTQ---WTTGIAEVQLPIEYKLKNIEETEAAKK 188
+ +HLK + + E +++ TGI EV +P+E K+KNIEETE K+
Sbjct: 165 LSDHLKIKTTHEDTSEKLNCITGITEVPIPLEIKMKNIEETEKFKR 210
>gi|17509215|ref|NP_492142.1| Protein T23G11.4 [Caenorhabditis elegans]
gi|3880111|emb|CAB03415.1| Protein T23G11.4 [Caenorhabditis elegans]
Length = 240
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 85/189 (44%), Gaps = 33/189 (17%)
Query: 30 LSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAA--------GGGGLTKVSEK 81
++ DEE R++ +I+ LQ+ RERK+G+ + A+ + AA GGG+ S+K
Sbjct: 23 VAADEESSRVS--DIRDLQRSRERKNGLTELECAVGISKAAALEDGIQMAGGGMVMTSKK 80
Query: 82 NEG-DGEKDELVLQDTFAQETAVMVEDPNMLKYVE---QELAKKRGK------------- 124
+ E L++ F +ET + E + KY++ QE K
Sbjct: 81 KAAMEAASIEQGLREQFEKETMLRDEHEELRKYIDDGLQEYTADTSKIEKQKQPSSSAAA 140
Query: 125 ---NIDVNDRVENDLKHAEDELYKIPEHLKKRNSEESSTQWTTGIAEVQLPIEYKLKNIE 181
+++ DR LK A K+ + K+ +E S GI EV L I ++ NI
Sbjct: 141 KFSSLNAEDRDVELLKQAAG---KVKGNQSKKETELLSEHMLAGIPEVDLGISTRITNIL 197
Query: 182 ETEAAKKLL 190
ETE K+ L
Sbjct: 198 ETEKKKRFL 206
>gi|268562986|ref|XP_002638721.1| Hypothetical protein CBG00304 [Caenorhabditis briggsae]
Length = 243
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 81/184 (44%), Gaps = 34/184 (18%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA---------GGGGLTKVSEKNEGDGEKDEL 91
+ +I+ LQ+ RERK+G+ + A+ AA GGG + +K + E
Sbjct: 32 VADIRDLQRSRERKNGLTELECAVGITKAAALEDGIQMTGGGMIMTAKKKAAMEAASIEH 91
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQEL----------AKKRGKN------------IDVN 129
L+D F +ET + E + KY++ L ++ RG+ ++ +
Sbjct: 92 GLRDQFEKETMLRDEHEELRKYIDDGLTHYTKDTSTPSQPRGETPQAASASSRFSSLNAD 151
Query: 130 DRVENDLKHAEDELYKIPEHLKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKL 189
DR LK A K+ + K+ +E S GI EV L I ++ NI ETE K+
Sbjct: 152 DRDVELLKEAAG---KLKANQSKKETELLSEHMLAGIPEVDLGISTRITNILETEKKKRF 208
Query: 190 LQEK 193
L +K
Sbjct: 209 LMQK 212
>gi|222622342|gb|EEE56474.1| hypothetical protein OsJ_05693 [Oryza sativa Japonica Group]
Length = 135
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 20/24 (83%), Positives = 22/24 (91%)
Query: 150 LKKRNSEESSTQWTTGIAEVQLPI 173
L ++NSEESSTQWTTGIAEVQ PI
Sbjct: 38 LVRKNSEESSTQWTTGIAEVQPPI 61
>gi|403169032|ref|XP_003328586.2| hypothetical protein PGTG_10545 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167772|gb|EFP84167.2| hypothetical protein PGTG_10545 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 399
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 99/222 (44%), Gaps = 50/222 (22%)
Query: 23 EEETTNKLSDDEEERRLALEEIKFLQKQRERKSGI---------PAIPSALQSAAAAGGG 73
+E + +++ ++E E R +EE+ L++ ++ + GI P ++ AAG G
Sbjct: 50 DEVSKDRVKEEEVEARRTVEEMIALRRLKQARVGIELQRLNAGEPKKKKKKKNPNAAGEG 109
Query: 74 G---------LTKVSEKNEGDGEKDE----------------LVLQDTFAQETAVMVEDP 108
+ ++ + D KD+ ++ + F Q+T + D
Sbjct: 110 ADGSEQGGKPVGANTDPLDDDPVKDDRLADDEPEDEDARTRKIIKSNHFTQQTNTLDVDK 169
Query: 109 NMLKYVEQELAKKRGKNIDVN--DRVENDLKHAE--------DELYKIPE--HLKKRNSE 156
+M+ Y+E+EL ++R I + E LK E DELYKI E ++K+
Sbjct: 170 HMMAYIEEELQRRRTDAIAAGTIESSEPILKGLEAIASLDPRDELYKIAEKYRIQKKPVV 229
Query: 157 ES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 194
E S T I EV L I+ +++N E TE AK+ L E+R
Sbjct: 230 EGNVTLSATMLTSIPEVDLGIDNRIRNFEATEKAKRQLTEQR 271
>gi|308476864|ref|XP_003100647.1| hypothetical protein CRE_20426 [Caenorhabditis remanei]
gi|308264665|gb|EFP08618.1| hypothetical protein CRE_20426 [Caenorhabditis remanei]
Length = 246
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 79/185 (42%), Gaps = 35/185 (18%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA---------GGGGLTKVSEKNEGDGEKDEL 91
+ +I+ LQ+ RERK+G+ + A+ AA GGG + +K + E
Sbjct: 32 VSDIRDLQRSRERKNGLTELECAVGITKAAALEDGIQMTGGGMMMTAKKKAAMEAASIEH 91
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKN-----------------------IDV 128
L+D F +ET + E + KY++ L N ++
Sbjct: 92 GLRDQFEKETMLRDEHEELRKYIDDGLTHYTKDNSSNSTQKTEKEPKIQSTSSKFSSLNA 151
Query: 129 NDRVENDLKHAEDELYKIPEHLKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKK 188
+DR D++ ++ K+ + K+ +E S GI EV L I ++ NI ETE K+
Sbjct: 152 DDR---DVELLKEAATKVRANQGKKETELLSEHMLAGIPEVDLGISTRITNILETEKKKR 208
Query: 189 LLQEK 193
L +K
Sbjct: 209 FLLQK 213
>gi|308459412|ref|XP_003092026.1| hypothetical protein CRE_23169 [Caenorhabditis remanei]
gi|308254444|gb|EFO98396.1| hypothetical protein CRE_23169 [Caenorhabditis remanei]
Length = 246
Score = 43.9 bits (102), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 79/185 (42%), Gaps = 35/185 (18%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA---------GGGGLTKVSEKNEGDGEKDEL 91
+ +I+ LQ+ RERK+G+ + A+ AA GGG + +K + E
Sbjct: 32 VSDIRDLQRSRERKNGLTELECAVGITKAAALEDGIQMTGGGMMMTAKKKAAMEAASIEH 91
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKN-----------------------IDV 128
L+D F +ET + E + KY++ L N ++
Sbjct: 92 GLRDQFEKETMLRDEHEELRKYIDDGLTHYTKDNSSNSTQKTEKEPKIQSTSYKFSSLNA 151
Query: 129 NDRVENDLKHAEDELYKIPEHLKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKK 188
+DR D++ ++ K+ + K+ +E S GI EV L I ++ NI ETE K+
Sbjct: 152 DDR---DVELLKEAAAKVRANQGKKETELLSEHMLAGIPEVDLGISTRITNILETEKKKR 208
Query: 189 LLQEK 193
L +K
Sbjct: 209 FLLQK 213
>gi|341883974|gb|EGT39909.1| hypothetical protein CAEBREN_32234 [Caenorhabditis brenneri]
gi|341886393|gb|EGT42328.1| hypothetical protein CAEBREN_25065 [Caenorhabditis brenneri]
Length = 244
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 95/218 (43%), Gaps = 42/218 (19%)
Query: 7 QKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
+K +KK RK S EEE+ D++E R++ +I+ LQK RERK+G+ + A+
Sbjct: 6 RKPKKKIQQRKISAEEEQ------FDEDESTRVS--DIRDLQKSRERKNGLTELECAVGI 57
Query: 67 AAAAG--------GGGLTKVSEKNEGDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQE 117
AA GGG+ ++K E L++ F +ET + E + KY++
Sbjct: 58 TKAAALEDGIQMSGGGMQMTAKKKAAMEAASIEHGLREQFEKETMLRDEHEELRKYIDDG 117
Query: 118 L----------------------AKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNS 155
L + +++ DR LK A K+ + K+ +
Sbjct: 118 LTHYTKDTSKGSSSRRDPPPEQSTSSKFSSLNAEDRDVELLKQAAG---KVRANQGKKET 174
Query: 156 EESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 193
E S GI EV L I ++ NI ETE K+ L EK
Sbjct: 175 ELLSEHMLAGIPEVDLGIGSRITNILETEKKKRFLLEK 212
>gi|325182770|emb|CCA17225.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325189176|emb|CCA23700.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 267
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 8/83 (9%)
Query: 110 MLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL------KKRNSEESSTQWT 163
M +Y+E L + + D D+ ++ ++ +D LY + L + NS + W
Sbjct: 111 MNRYIEDRLGSIKVDSKD--DKSQDSIEKEDDALYALSTDLAPTPTTNETNSSDGVLIWN 168
Query: 164 TGIAEVQLPIEYKLKNIEETEAA 186
TGIAEV+LP YK K +E T++A
Sbjct: 169 TGIAEVELPSTYKNKIVEATKSA 191
>gi|353238535|emb|CCA70478.1| hypothetical protein PIIN_04416 [Piriformospora indica DSM 11827]
Length = 299
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 126/326 (38%), Gaps = 96/326 (29%)
Query: 9 KEKKKNFRKR--SYEEEEETTNKLSDDEEE--------RRLALEEIKFLQKQRERKSGIP 58
K++ KN RK + E++EE +S +E R+ +LE++ L+ R+ + GI
Sbjct: 5 KQRPKNQRKHEENVEKQEEQDGHVSGEETPAKEATPAIRQSSLEDMIALRNMRKARQGID 64
Query: 59 AIPSALQSAAAAGGGGLTK--------------VSEKNEGDGEKD---------ELVLQD 95
A S A GG K + E DGE D V
Sbjct: 65 A------SKLATGGQKKRKNEEEEYESEKPRYGLHTPKEDDGEDDLEGAMAKARRAVRMS 118
Query: 96 TFAQETAVMVEDPNMLKYVEQEL-AKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--- 151
F Q+T + D +M+ Y+E+ L K+ + + E+ + K E K
Sbjct: 119 NFTQQTNALDVDKHMMAYIEENLKLMKQQAGTSIEEDASKAPPTTEEGMLKFGERYKTHG 178
Query: 152 ---KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAK-------- 200
K S +S + I EV L ++ +L+NIEETE AK++ E++ A+
Sbjct: 179 IELKEGSVGNSLAMLSAIPEVDLGMDARLRNIEETEKAKRIAAEEKRAREAQRMNPDEAR 238
Query: 201 ---SDFSIP---------SSYSADYFQRGRDYAEKLRREH--PELYKDRGSQDDGAGSRP 246
+ F P + A Y G D EK +R+H PEL
Sbjct: 239 LAATRFYNPHLRQESDQEAIKQAKYKALGIDVPEKSQRKHERPEL--------------- 283
Query: 247 TDNSTDAAGSRQAATDQFMLERFRKR 272
A+D+ ++ERFRKR
Sbjct: 284 -------------ASDEAVMERFRKR 296
>gi|119480413|ref|XP_001260235.1| hypothetical protein NFIA_082890 [Neosartorya fischeri NRRL 181]
gi|119408389|gb|EAW18338.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 344
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 86/199 (43%), Gaps = 32/199 (16%)
Query: 50 QRERKSGIPAIPSALQSAAAAGG-GGLTKVSEKNEGDGEKDEL-VLQDTFAQETAVMVE- 106
QR RK GI ++ Q G +T V+ + D E +++ + D F T V+
Sbjct: 74 QRARKGGIEFSNTSRQRTDKTGNQAAVTTVTAE---DLENEKIRAMCDRFTAYTGQTVDV 130
Query: 107 DPNMLKYVEQELAKKR---------GKNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEE 157
D +M+ Y+E E+AK+ NI + +D L + P L K
Sbjct: 131 DKHMMAYIETEMAKRHRQQMPANTTDSNISTTSQASSDGLSTTVALQREPASLGK----- 185
Query: 158 SSTQWTTGIAEVQLPIEYKLKNIEETEAA-KKLLQEKRLMGRAKSDFSIPSSYSADYFQR 216
+ E+ L E KL+NI TEAA ++L+ + R + A D +S A +
Sbjct: 186 --------LHEIDLGQEAKLQNIARTEAATRRLVGDDRDVSPANED---STSSIAASGKD 234
Query: 217 GRDYAEKLRREHPELYKDR 235
GR + + RR ++ +DR
Sbjct: 235 GRPWRNRKRRNSEDIERDR 253
>gi|121715222|ref|XP_001275220.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119403377|gb|EAW13794.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 336
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 85/197 (43%), Gaps = 30/197 (15%)
Query: 50 QRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVE-DP 108
QR RK GI ++ Q++ G G + + + E+ + D F T V+ D
Sbjct: 68 QRGRKGGIEFSTTSRQTSDRTGSQGAGAMVSAEDLENERIR-AMCDRFTVHTGQTVDVDK 126
Query: 109 NMLKYVEQELAKK---------RGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEESS 159
+M+ Y+E E+AK+ G + ++R +D + P L K
Sbjct: 127 HMMAYIENEMAKRYRPKMPTDTTGSDDAASNRTTSDGFATAVASKREPASLGK------- 179
Query: 160 TQWTTGIAEVQLPIEYKLKNIEETEAA-KKLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 218
+ E+ L E K++NI TEAA +KL+ E G +S +S +A + GR
Sbjct: 180 ------LHEIDLGQETKMQNIARTEAATRKLVGEDMSPGLGES-----ASSTAGAGKAGR 228
Query: 219 DYAEKLRREHPELYKDR 235
+ + RR ++ +DR
Sbjct: 229 PWRNRKRRNSEDIERDR 245
>gi|302843629|ref|XP_002953356.1| hypothetical protein VOLCADRAFT_105874 [Volvox carteri f.
nagariensis]
gi|300261453|gb|EFJ45666.1| hypothetical protein VOLCADRAFT_105874 [Volvox carteri f.
nagariensis]
Length = 562
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 56/110 (50%), Gaps = 9/110 (8%)
Query: 93 LQDTFAQETAVMV---EDPNMLKYVEQELAKKRGKN---IDVNDRVENDLK--HAEDELY 144
+ DT+ + ++ ED +M KY+E++LA + GK + ++ ++ + K E ELY
Sbjct: 195 VMDTYVKAKSIATQQDEDAHMQKYIEEQLAVRLGKTAAQVSEDEELDPEAKKRKIEAELY 254
Query: 145 KIPEHLKKRNSEESSTQ-WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 193
+P K +E + ++EV L KL +IE TEA K+ L K
Sbjct: 255 AVPADFKNTLEQEVVLPGLVSTLSEVPLSARDKLASIEATEALKRKLLAK 304
>gi|330840141|ref|XP_003292079.1| hypothetical protein DICPUDRAFT_156758 [Dictyostelium purpureum]
gi|325077714|gb|EGC31409.1| hypothetical protein DICPUDRAFT_156758 [Dictyostelium purpureum]
Length = 265
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 116/279 (41%), Gaps = 59/279 (21%)
Query: 7 QKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
+KK+K +N RK+ +E +N + EE+ LE K QK RE+ G+ +
Sbjct: 36 KKKDKVRNLRKK----DETNSNLEVEGEEDEENILELTKEKQKLREKGKGL--------N 83
Query: 67 AAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK------ 120
G K + + + D + QE + +E KY+ ++L
Sbjct: 84 VGILAEGPHIKQNFRELENKLDDSFTAHNEDKQEVNLHLE-----KYINEQLELKKQKQK 138
Query: 121 --KRGKNIDVNDRVENDLKHAEDELYKIPEHLKK--RNSEESSTQWTTGIAEVQLPIEYK 176
K N + N+ ++ + E L++ PEHLK+ + E T W GIAEVQLP +K
Sbjct: 139 QSKTDNNDNNNNENNSNTELKESSLFETPEHLKRNEKKQNEDKTNWVAGIAEVQLPEVFK 198
Query: 177 LKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRG 236
KN+ ETE A+ +++ + + P +++ +Y + H ++
Sbjct: 199 YKNMVETEKARDAMEKDS--DKHTEKLNTPQNFNQNY------------QYHNRFINNKK 244
Query: 237 SQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERH 275
DD ATDQ +E F+KR R+
Sbjct: 245 RSDD------------------KATDQEAVENFKKRFRY 265
>gi|317027296|ref|XP_001400603.2| hypothetical protein ANI_1_2038024 [Aspergillus niger CBS 513.88]
Length = 341
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 66/150 (44%), Gaps = 23/150 (15%)
Query: 50 QRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDEL-VLQDTFAQETAVMVE-D 107
QR RK GI + S AG T VS E D E + L + D F T V+ D
Sbjct: 70 QRARKGGIEF---SATSRPPAGKNSQTAVSTVAEEDQENERLRAMCDRFTAHTGQTVDVD 126
Query: 108 PNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEESSTQW----- 162
+M+ ++E E+AK+ +++ E ++ P + + + SS
Sbjct: 127 KHMMDFIESEMAKRYRRDMPT-----------EIAMHDTPSATEPKAAALSSADLPQRGP 175
Query: 163 -TTG-IAEVQLPIEYKLKNIEETEAAKKLL 190
+ G + E+ L E KL+NI TEAA K L
Sbjct: 176 ASLGKLHEIDLGHETKLQNIARTEAATKRL 205
>gi|212528790|ref|XP_002144552.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210073950|gb|EEA28037.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 332
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 63/143 (44%), Gaps = 13/143 (9%)
Query: 50 QRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVE-DP 108
QR R+ GI S S AA LT E E D K L D F T V+ D
Sbjct: 75 QRMRRGGIEF--STTSSQAADKNHSLTLSVEDTEADILKARL---DRFTAHTGQKVDVDK 129
Query: 109 NMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKKRNSEESSTQWTTG-IA 167
+M++Y+E ELA+++ + +ND A + + N+ T G +
Sbjct: 130 HMMEYIESELARRQNRT------QKNDENSAASRFSQSDANDSFTNAFSKREPATLGKLH 183
Query: 168 EVQLPIEYKLKNIEETEAAKKLL 190
E+ L E KL+NI TEAA + +
Sbjct: 184 EIDLGQETKLQNIARTEAATRRM 206
>gi|168179428|ref|ZP_02614092.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916]
gi|182669596|gb|EDT81572.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916]
Length = 143
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 43/95 (45%), Gaps = 12/95 (12%)
Query: 116 QELAKKRGKNIDVN-------DRVENDLKHAEDELYKIPEHLKKRNS----EESSTQWTT 164
+EL + KNI N + + ND D+LY++ E++K NS E + +
Sbjct: 49 KELNIRYKKNISFNIHYFSDKEDLNNDCNDMTDKLYEVLEYIKTNNSLYRANEMTHEVID 108
Query: 165 GIAEVQLPIEYK-LKNIEETEAAKKLLQEKRLMGR 198
G+ L Y LK IEE KL QE L GR
Sbjct: 109 GVLHFMLQFNYHVLKEIEEAPKMNKLKQEVYLNGR 143
>gi|315041258|ref|XP_003170006.1| hypothetical protein MGYG_08185 [Arthroderma gypseum CBS 118893]
gi|311345968|gb|EFR05171.1| hypothetical protein MGYG_08185 [Arthroderma gypseum CBS 118893]
Length = 326
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 23/124 (18%)
Query: 92 VLQDTFAQETAVMVE-DPNMLKYVEQELAKKR-----GKNIDVNDRVE--NDLKHAEDEL 143
+ D F + V+ D +M+ ++E E+AK+R +N D +D E + L+H +
Sbjct: 101 AISDRFVGHSGQKVDVDKHMMAFIEAEMAKRRHGTLPSENNDPSDPAESQSQLRHGAEHT 160
Query: 144 YKIPEHLKKRNSEESSTQWTTG-IAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 202
HL +R T G + E+ L + KL+NI TEAA + L AK D
Sbjct: 161 PAADLHLPQRQP------ATLGKLHEIDLGPDSKLQNIARTEAATRNL--------AKGD 206
Query: 203 FSIP 206
+ P
Sbjct: 207 SAAP 210
>gi|357608205|gb|EHJ65877.1| hypothetical protein KGM_17821 [Danaus plexippus]
Length = 168
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 16/111 (14%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAA-----------AAGGGGLTKVSEKNEGDGEKD 89
LEE K +QK RER +G+ + A A + GG+ + G ++
Sbjct: 43 LEEAKEIQKLRERPNGVSVVALATGQATISEEITCKDPFSVKSGGMINMQALKSGKVKQV 102
Query: 90 E----LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDL 136
E + F+ ET ED M+KY+E++LAK++G+ + VN+ + +L
Sbjct: 103 EDAYDTGIGTQFSAETNKRDEDEEMMKYIEEQLAKRKGRCM-VNNSIYYNL 152
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.308 0.127 0.341
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,399,976,953
Number of Sequences: 23463169
Number of extensions: 195055966
Number of successful extensions: 1130934
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 174
Number of HSP's successfully gapped in prelim test: 3999
Number of HSP's that attempted gapping in prelim test: 1090995
Number of HSP's gapped (non-prelim): 25465
length of query: 280
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 140
effective length of database: 9,074,351,707
effective search space: 1270409238980
effective search space used: 1270409238980
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 76 (33.9 bits)