BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023468
(282 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224128117|ref|XP_002320248.1| predicted protein [Populus trichocarpa]
gi|222861021|gb|EEE98563.1| predicted protein [Populus trichocarpa]
Length = 271
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 221/274 (80%), Positives = 244/274 (89%), Gaps = 6/274 (2%)
Query: 10 EKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA 69
+KK+NFRKR++EE+E + DDE+ERRLALEE+KFLQKQRERKSGIPA+ + Q+A
Sbjct: 3 QKKRNFRKRTFEEDEHSKAS-DDDEQERRLALEEVKFLQKQRERKSGIPALATTSQTATT 61
Query: 70 AGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
K++EK +GDGEK+ELVLQDTFAQETAVMVEDPNML+YVEQELAKKRGKNID
Sbjct: 62 VAA----KLTEKADGDGEKEELVLQDTFAQETAVMVEDPNMLQYVEQELAKKRGKNIDAT 117
Query: 130 DRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 189
D+VE +LK AEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAK
Sbjct: 118 DQVETELKRAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAK 177
Query: 190 KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG-AGSRP 248
KLLQEKRLMGR KS+FSIPSSYSADYFQRGRDYAEKLRR+HPELYKDR QDD AGS+P
Sbjct: 178 KLLQEKRLMGRPKSEFSIPSSYSADYFQRGRDYAEKLRRDHPELYKDRSLQDDAVAGSKP 237
Query: 249 TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
DNSTDAAG RQAATD+FMLERFRKRERHRVMRR
Sbjct: 238 ADNSTDAAGRRQAATDEFMLERFRKRERHRVMRR 271
>gi|356498525|ref|XP_003518101.1| PREDICTED: uncharacterized protein C9orf78 homolog [Glycine max]
Length = 287
Score = 433 bits (1113), Expect = e-119, Method: Compositional matrix adjust.
Identities = 222/284 (78%), Positives = 245/284 (86%), Gaps = 9/284 (3%)
Query: 7 QKKEKKKNFRKRSY-----EEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP 61
+++++KKN+RKRS E + +N SDDE ERR+ALEEIK LQKQRERKSGIPA P
Sbjct: 5 KQQQRKKNYRKRSAPTDKDELPQSQSNNESDDERERRMALEEIKLLQKQRERKSGIPANP 64
Query: 62 SALQSAAAAGGGGLTKVSEKNEGDG-EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK 120
S LQ + GGG K +EKN+GDG EKDELVLQDTFAQETAVM EDPNM+KY+E ELAK
Sbjct: 65 S-LQVQSGTGGGLAAKAAEKNDGDGGEKDELVLQDTFAQETAVMDEDPNMVKYIEHELAK 123
Query: 121 KRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLK 180
KRG+ ID D+VEN+LK AEDELYKIPEHLKVK+RNSEESSTQWTTGIAEVQLPIEYKLK
Sbjct: 124 KRGRKIDAADQVENELKRAEDELYKIPEHLKVKRRNSEESSTQWTTGIAEVQLPIEYKLK 183
Query: 181 NIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQ 240
NIEETEAAKKLLQEKRLMGR KSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDR Q
Sbjct: 184 NIEETEAAKKLLQEKRLMGRTKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRSHQ 243
Query: 241 DDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRERHRVMRR 282
DD +GS+ D+STDAAG+ RQAATD+FMLERFRKRERHRVMRR
Sbjct: 244 DDSSGSKQNDSSTDAAGAVQRQAATDEFMLERFRKRERHRVMRR 287
>gi|358248108|ref|NP_001239815.1| uncharacterized protein LOC100812323 [Glycine max]
gi|255645199|gb|ACU23097.1| unknown [Glycine max]
Length = 288
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 220/284 (77%), Positives = 244/284 (85%), Gaps = 9/284 (3%)
Query: 7 QKKEKKKNFRKRSYEEEEE-----TTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP 61
Q++++KKN+RKRS +E+ +N SDDE ERR+ALEEIK LQKQRERKSGIPA P
Sbjct: 6 QQQQRKKNYRKRSAPTDEDELPQSQSNNESDDERERRMALEEIKLLQKQRERKSGIPANP 65
Query: 62 SALQSAAAAGGGGLTKVSEKNEGDG-EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK 120
S LQ + GGG K +EKN+GDG +KDELVLQDTFAQETAVM EDPNM+ Y+E ELAK
Sbjct: 66 S-LQVQSGTGGGLAAKAAEKNDGDGGDKDELVLQDTFAQETAVMDEDPNMVNYIEHELAK 124
Query: 121 KRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLK 180
KRG+ ID D+ EN+LK AEDELYKIPEHLKVK+RNSEESSTQWTTGIAEVQLPIEYKLK
Sbjct: 125 KRGRKIDAADQAENELKRAEDELYKIPEHLKVKRRNSEESSTQWTTGIAEVQLPIEYKLK 184
Query: 181 NIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQ 240
NIEETEAAKKLLQEKRLMGR KSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDR Q
Sbjct: 185 NIEETEAAKKLLQEKRLMGRTKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRNHQ 244
Query: 241 DDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRERHRVMRR 282
DD +GS+ D+STDAAG+ RQAATD+FMLERFRKRERHRVMRR
Sbjct: 245 DDSSGSKKNDSSTDAAGAVQRQAATDEFMLERFRKRERHRVMRR 288
>gi|297744059|emb|CBI37029.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 214/276 (77%), Positives = 243/276 (88%), Gaps = 5/276 (1%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+ +KKNFRKRS E+++ N S+DEEERRLALEE+KFLQKQRERK GIPAIP+ LQ+
Sbjct: 27 KEMQKKNFRKRSIEDDQAKDNNNSEDEEERRLALEEVKFLQKQRERKLGIPAIPT-LQTT 85
Query: 68 AAAGGGGLTKVSEKNE-GDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI 126
G KV+EKNE DG+K+ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG+NI
Sbjct: 86 ---GVTPTKKVAEKNEVPDGDKEELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGRNI 142
Query: 127 DVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETE 186
D ++V NDLK A+DELY +PEHLKVK+RNSEESSTQWTTGIAEVQLP+EYKL+NIEETE
Sbjct: 143 DATNQVGNDLKRADDELYVVPEHLKVKRRNSEESSTQWTTGIAEVQLPVEYKLRNIEETE 202
Query: 187 AAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGS 246
AAKKLLQ+KRLMGR K++F+IPSSYSADYFQRGRDYAEKLRREHPELYKD+G QD+G GS
Sbjct: 203 AAKKLLQDKRLMGRTKTEFNIPSSYSADYFQRGRDYAEKLRREHPELYKDKGVQDNGGGS 262
Query: 247 RPTDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
R D ST+ AG RQAATD+FML+RFRKRERHRVMRR
Sbjct: 263 RLPDASTEVAGRRQAATDEFMLDRFRKRERHRVMRR 298
>gi|225437728|ref|XP_002280535.1| PREDICTED: uncharacterized protein LOC100250416 [Vitis vinifera]
Length = 270
Score = 402 bits (1034), Expect = e-110, Method: Compositional matrix adjust.
Identities = 213/273 (78%), Positives = 241/273 (88%), Gaps = 5/273 (1%)
Query: 11 KKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAA 70
+KKNFRKRS E+++ N S+DEEERRLALEE+KFLQKQRERK GIPAIP+ LQ+
Sbjct: 2 QKKNFRKRSIEDDQAKDNNNSEDEEERRLALEEVKFLQKQRERKLGIPAIPT-LQTT--- 57
Query: 71 GGGGLTKVSEKNE-GDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
G KV+EKNE DG+K+ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG+NID
Sbjct: 58 GVTPTKKVAEKNEVPDGDKEELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGRNIDAT 117
Query: 130 DRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 189
++V NDLK A+DELY +PEHLKVK+RNSEESSTQWTTGIAEVQLP+EYKL+NIEETEAAK
Sbjct: 118 NQVGNDLKRADDELYVVPEHLKVKRRNSEESSTQWTTGIAEVQLPVEYKLRNIEETEAAK 177
Query: 190 KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPT 249
KLLQ+KRLMGR K++F+IPSSYSADYFQRGRDYAEKLRREHPELYKD+G QD+G GSR
Sbjct: 178 KLLQDKRLMGRTKTEFNIPSSYSADYFQRGRDYAEKLRREHPELYKDKGVQDNGGGSRLP 237
Query: 250 DNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
D ST+ AG RQAATD+FML+RFRKRERHRVMRR
Sbjct: 238 DASTEVAGRRQAATDEFMLDRFRKRERHRVMRR 270
>gi|255556659|ref|XP_002519363.1| Protein C9orf78, putative [Ricinus communis]
gi|223541430|gb|EEF42980.1| Protein C9orf78, putative [Ricinus communis]
Length = 293
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 224/292 (76%), Positives = 245/292 (83%), Gaps = 22/292 (7%)
Query: 11 KKKNFRKRSYEEEEE-----TTNKLS--DDEEERRLALEEIKFLQKQRERKSGIPAIPSA 63
KKKNFRKRS EE E+ N + DDEEERRLALEE+KFLQKQRERKSGIPAI +
Sbjct: 4 KKKNFRKRSIEEAEDPESSRNNNNATPDDDEEERRLALEEVKFLQKQRERKSGIPAILTP 63
Query: 64 LQSAAAA----------GGGGLT---KVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNM 110
SA+++ GL KV+EKN+GDGEK++LVLQDTFAQETAVMVEDPNM
Sbjct: 64 SSSASSSAAAAAAQLQQNSSGLVSSKKVTEKNDGDGEKEDLVLQDTFAQETAVMVEDPNM 123
Query: 111 LKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAE 170
L YVEQELAKK GKN+D +VEN+LK AEDELY IPEHLKVK+RNSEESSTQWTTGIAE
Sbjct: 124 LMYVEQELAKKSGKNVDAT-QVENELKRAEDELYTIPEHLKVKRRNSEESSTQWTTGIAE 182
Query: 171 VQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREH 230
VQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKS+FSIPSSYSADYFQRGRDYAEKLRREH
Sbjct: 183 VQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSEFSIPSSYSADYFQRGRDYAEKLRREH 242
Query: 231 PELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
PELYKDR SQD+ AGS+P DN+TDA R+AATD+FMLERFRKRERHRVMRR
Sbjct: 243 PELYKDRNSQDESAGSKPADNNTDAT-RREAATDEFMLERFRKRERHRVMRR 293
>gi|449463519|ref|XP_004149481.1| PREDICTED: uncharacterized protein LOC101215146 [Cucumis sativus]
Length = 293
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 204/288 (70%), Positives = 232/288 (80%), Gaps = 16/288 (5%)
Query: 11 KKKNFRKRSYEEEEE-------TTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSA 63
KKKNFRKR+ + E+ T ++EEE R+ALEE+KFLQKQRER++GIPA+P
Sbjct: 6 KKKNFRKRNCYDSEDGGDEANSVTAISEEEEEEHRMALEEVKFLQKQRERRAGIPAVPPV 65
Query: 64 LQSAAAAGGGGL------TKVSEKNE---GDGEKDELVLQDTFAQETAVMVEDPNMLKYV 114
AG GG KNE G+G+KD+LVLQDTFAQETAVMVEDPNMLKY+
Sbjct: 66 SAQTTTAGAGGARSHKTGGGGGNKNESAGGEGDKDDLVLQDTFAQETAVMVEDPNMLKYI 125
Query: 115 EQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLP 174
EQELAKKRG+ ++ + ENDLK AEDELYKIPEHLKVK+RNS ESSTQWTTGIAEVQLP
Sbjct: 126 EQELAKKRGRTVETVEGAENDLKQAEDELYKIPEHLKVKRRNSNESSTQWTTGIAEVQLP 185
Query: 175 IEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELY 234
IE+KLKNIEETEAAKKLLQEKR +GR+ S+FSIPSSYSADYF RGRDYAEKLRREHPELY
Sbjct: 186 IEFKLKNIEETEAAKKLLQEKRFVGRSTSEFSIPSSYSADYFHRGRDYAEKLRREHPELY 245
Query: 235 KDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
KDR QDDG+GS+P + T+AAG RQAATD+FMLERFRKRERHRVMRR
Sbjct: 246 KDRSLQDDGSGSKPAETGTEAAGQRQAATDEFMLERFRKRERHRVMRR 293
>gi|297848438|ref|XP_002892100.1| hypothetical protein ARALYDRAFT_887370 [Arabidopsis lyrata subsp.
lyrata]
gi|297337942|gb|EFH68359.1| hypothetical protein ARALYDRAFT_887370 [Arabidopsis lyrata subsp.
lyrata]
Length = 277
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 208/280 (74%), Positives = 237/280 (84%), Gaps = 15/280 (5%)
Query: 12 KKNFRKRSYEEE--EETTNK--LSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+NFRKRS+EEE + NK +SD+EE+RRLALEE+KFLQK RERK GIPA+ +A S
Sbjct: 4 KRNFRKRSFEEEEEDNDVNKAAISDEEEKRRLALEEVKFLQKLRERKLGIPALSTAQSSI 63
Query: 68 AAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID 127
G K EK E +GEK+ELVLQDTFAQETAV++EDPNM+KY+EQELAKKRGKNID
Sbjct: 64 ------GKVKPVEKTEAEGEKEELVLQDTFAQETAVLIEDPNMVKYIEQELAKKRGKNID 117
Query: 128 VNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 187
+ VEN+LK EDELYKIP+HLKVKKR+SEESSTQWTTGIAEVQLPIEYKLKNIEETEA
Sbjct: 118 DAEEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 177
Query: 188 AKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGS-QDDGAGS 246
AKKLLQE+RLMGR KS+FSIPSSYSADYFQRG+DYAEKLRREHPELYKDRG Q DG G+
Sbjct: 178 AKKLLQERRLMGRPKSEFSIPSSYSADYFQRGKDYAEKLRREHPELYKDRGGPQADGEGA 237
Query: 247 RP----TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
+P ++N+ D+ SRQAATDQ MLERFRKRER+RVMRR
Sbjct: 238 KPSTSSSNNNADSGKSRQAATDQIMLERFRKRERNRVMRR 277
>gi|449481099|ref|XP_004156081.1| PREDICTED: uncharacterized LOC101215146 [Cucumis sativus]
Length = 305
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 201/300 (67%), Positives = 232/300 (77%), Gaps = 28/300 (9%)
Query: 11 KKKNFRKRSYEEEEE-------TTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSA 63
KKKNFRKR+ + E+ T ++EEE R+ALEE+KFLQKQRER++GIPA+P
Sbjct: 6 KKKNFRKRNCYDSEDGGDEANSVTAISEEEEEEHRMALEEVKFLQKQRERRAGIPAVPPV 65
Query: 64 ------------------LQSAAAAGGGGLTKVSEKNE---GDGEKDELVLQDTFAQETA 102
++ + A KNE G+G+KD+LVLQDTFAQETA
Sbjct: 66 SAQTTTAGAGGASGGGGLVRKSTDANSKTGGGGGNKNESAGGEGDKDDLVLQDTFAQETA 125
Query: 103 VMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESST 162
VMVEDPNMLKY+EQELAKKRG+ ++ + ENDLK AEDELYKIPEHLKVK+RNS ESST
Sbjct: 126 VMVEDPNMLKYIEQELAKKRGRTVETVEGAENDLKQAEDELYKIPEHLKVKRRNSNESST 185
Query: 163 QWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 222
QWTTGIAEVQLPIE+KLKNIEETEAAKKLLQEKR +GR+ S+FSIPSSYSADYF RGRDY
Sbjct: 186 QWTTGIAEVQLPIEFKLKNIEETEAAKKLLQEKRFVGRSTSEFSIPSSYSADYFHRGRDY 245
Query: 223 AEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
AEKLRREHPELYKDR QDDG+GS+P + T+AAG RQAATD+FMLERFRKRERHRVMRR
Sbjct: 246 AEKLRREHPELYKDRSLQDDGSGSKPAETGTEAAGQRQAATDEFMLERFRKRERHRVMRR 305
>gi|217071714|gb|ACJ84217.1| unknown [Medicago truncatula]
Length = 291
Score = 385 bits (990), Expect = e-105, Method: Compositional matrix adjust.
Identities = 209/292 (71%), Positives = 236/292 (80%), Gaps = 12/292 (4%)
Query: 1 MENPIPQKKEKKKNFRKRSYEEEEETT-----NKLSDDEEERRLALEEIKFLQKQRERKS 55
MEN ++K ++KN+RKR+ EE + N SDDE ERRLALEEIK LQKQRERKS
Sbjct: 1 MENS-KEEKPRRKNYRKRTPTEEHDQPPQSQQNNDSDDESERRLALEEIKLLQKQRERKS 59
Query: 56 GIPAIPSALQSAAAAGGGGLTKV---SEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLK 112
GIPA + QS G +K ++ G+KD+LVLQDTFAQETAVM EDPNM+K
Sbjct: 60 GIPATLTLQQSQPGISSGLASKAVDKNDAGGDGGDKDDLVLQDTFAQETAVMDEDPNMVK 119
Query: 113 YVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQ 172
Y+ QELAKKRG+NID D+VEN+LK AEDELY IP+HLKVKKRNSEESSTQWTTGIAE+Q
Sbjct: 120 YIGQELAKKRGRNIDEEDQVENELKRAEDELYTIPDHLKVKKRNSEESSTQWTTGIAEIQ 179
Query: 173 LPIEYKLKNIEETEAAKKLLQEKRLM-GRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP 231
LPIEYKLKNIEETEAAKKLLQEKRLM GRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP
Sbjct: 180 LPIEYKLKNIEETEAAKKLLQEKRLMVGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP 239
Query: 232 ELYKDRGSQDDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRERHRVMR 281
ELYKDR QDD + S+ ++S+DA G+ RQAATDQFMLERF+KRERHRV R
Sbjct: 240 ELYKDRSQQDDNSASKQNESSSDAPGAVQRQAATDQFMLERFKKRERHRVRR 291
>gi|388514547|gb|AFK45335.1| unknown [Medicago truncatula]
Length = 291
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 208/292 (71%), Positives = 234/292 (80%), Gaps = 12/292 (4%)
Query: 1 MENPIPQKKEKKKNFRKRSYEEEEETT-----NKLSDDEEERRLALEEIKFLQKQRERKS 55
MEN ++K ++KN+RKR+ EE + N SDDE ERRLALEEIK LQKQRERKS
Sbjct: 1 MENS-KEEKPRRKNYRKRTPTEEHDQPPQSQQNNDSDDESERRLALEEIKLLQKQRERKS 59
Query: 56 GIPAIPSALQSAAAAGGGGLTKV---SEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLK 112
GIPA + QS G +K ++ G+KD+LVLQDTFAQETAVM E PNM+K
Sbjct: 60 GIPATLTLQQSQPGISSGLASKAVDKNDAGGDGGDKDDLVLQDTFAQETAVMDEGPNMVK 119
Query: 113 YVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQ 172
Y+ QELAKKRG+NID D+VEN+LK AEDELY IP+HLKVKKRNSEESSTQWTTGIAE+Q
Sbjct: 120 YIGQELAKKRGRNIDEEDQVENELKRAEDELYTIPDHLKVKKRNSEESSTQWTTGIAEIQ 179
Query: 173 LPIEYKLKNIEETEAAKKLLQEKRLM-GRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP 231
LPIEYKLKNIEETEAAKKLLQEKRLM GRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP
Sbjct: 180 LPIEYKLKNIEETEAAKKLLQEKRLMVGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHP 239
Query: 232 ELYKDRGSQDDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRERHRVMR 281
ELYKDR QDD S+ ++S+DA G+ RQAATDQFMLERF+KRERHRV R
Sbjct: 240 ELYKDRSQQDDNFASKQNESSSDAPGAVQRQAATDQFMLERFKKRERHRVRR 291
>gi|18378951|ref|NP_563649.1| uncharacterized protein [Arabidopsis thaliana]
gi|332189295|gb|AEE27416.1| uncharacterized protein [Arabidopsis thaliana]
Length = 279
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 207/281 (73%), Positives = 237/281 (84%), Gaps = 15/281 (5%)
Query: 12 KKNFRKRSYEEE--EETTNK--LSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+NFRKRS+EEE + NK +S++EE+RRLALEE+KFLQK RERK GIPA+ S QS+
Sbjct: 4 KRNFRKRSFEEEEEDNDVNKAAISEEEEKRRLALEEVKFLQKLRERKLGIPALSSTAQSS 63
Query: 68 AAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID 127
G K EK E +GEK+ELVLQDTFAQETAV++EDPNM+KY+EQELAKKRG+NID
Sbjct: 64 I-----GKVKPVEKTETEGEKEELVLQDTFAQETAVLIEDPNMVKYIEQELAKKRGRNID 118
Query: 128 VNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 187
+ VEN+LK EDELYKIP+HLKVKKR+SEESSTQWTTGIAEVQLPIEYKLKNIEETEA
Sbjct: 119 DAEEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 178
Query: 188 AKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGS-QDDGAGS 246
AKKLLQE+RLMGR KS+FSIPSSYSADYFQRG+DYAEKLRREHPELYKDRG Q DG +
Sbjct: 179 AKKLLQERRLMGRPKSEFSIPSSYSADYFQRGKDYAEKLRREHPELYKDRGGPQADGEAA 238
Query: 247 RP-----TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
+P T+N+ D+ SRQAATDQ MLERFRKRER+RVMRR
Sbjct: 239 KPSTSSSTNNNADSGKSRQAATDQIMLERFRKRERNRVMRR 279
>gi|326512722|dbj|BAK03268.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531604|dbj|BAJ97806.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 272
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 173/276 (62%), Positives = 209/276 (75%), Gaps = 12/276 (4%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG 72
KNFRKRS E + SDDE+ RR+ALEEI+++QK RERK GIPA A +A+AAG
Sbjct: 3 KNFRKRSLESDAADN---SDDEDTRRVALEEIRYMQKLRERKLGIPAASVATGAASAAGA 59
Query: 73 GGLTKVSEKNEGDGE---KDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
+ + +++LVLQDTFAQETAV +EDPNML+YVE EL KKRGK I+VN
Sbjct: 60 TDGSSARGRGGSGAGAAGEEDLVLQDTFAQETAVTIEDPNMLRYVENELLKKRGKAIEVN 119
Query: 130 DRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 189
D+ D K DELY +P+HLKV+K+N EESSTQWTTGIAEVQLPIEYKL+NIEETEAAK
Sbjct: 120 DK---DDKDQVDELYVVPDHLKVRKKNMEESSTQWTTGIAEVQLPIEYKLRNIEETEAAK 176
Query: 190 KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPT 249
K+LQE+RL G+ KSD +IPSSYSAD+F RGRDYAEKLRREHPELYK + SQ + G +PT
Sbjct: 177 KMLQERRLAGKTKSDANIPSSYSADFFHRGRDYAEKLRREHPELYKGQDSQANETGGKPT 236
Query: 250 DNSTDA---AGSRQAATDQFMLERFRKRERHRVMRR 282
D++ A R+AATD+ +LERFRKRE+ RVMRR
Sbjct: 237 DSNNPGGPPAAHREAATDELLLERFRKREKFRVMRR 272
>gi|115453221|ref|NP_001050211.1| Os03g0374100 [Oryza sativa Japonica Group]
gi|31249717|gb|AAP46210.1| unknown protein [Oryza sativa Japonica Group]
gi|108708408|gb|ABF96203.1| Hepatocellular carcinoma-associated antigen 59 family protein,
expressed [Oryza sativa Japonica Group]
gi|113548682|dbj|BAF12125.1| Os03g0374100 [Oryza sativa Japonica Group]
gi|125586427|gb|EAZ27091.1| hypothetical protein OsJ_11022 [Oryza sativa Japonica Group]
gi|215708801|dbj|BAG94070.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740836|dbj|BAG96992.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 277
Score = 310 bits (794), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 173/281 (61%), Positives = 212/281 (75%), Gaps = 15/281 (5%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG 71
+KNFRKR+ E + + ++ RR+ALEEIK++QK RERK GIPA +A +++AA
Sbjct: 2 RKNFRKRNLEADAAADHSDD--DDARRVALEEIKYMQKLRERKLGIPAAAAAAGASSAAS 59
Query: 72 GGGLT-------KVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK 124
G + GD EK++LVLQDTFAQETAV +EDPNML+YVE EL KKRGK
Sbjct: 60 ADGASPRGRGGGGGGLAAGGDAEKEDLVLQDTFAQETAVTIEDPNMLRYVENELLKKRGK 119
Query: 125 NIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEE 184
+DV D+ E D DELY +P+HLKV+K+NSEESSTQWTTGIAEVQLPIEYKL+NIEE
Sbjct: 120 KVDVKDKEEKD---QVDELYTVPDHLKVRKKNSEESSTQWTTGIAEVQLPIEYKLRNIEE 176
Query: 185 TEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGA 244
TEAAKK+LQEKRL G+ KSD +IPSSY+AD+F RG+DY EKLRREHPELYKD+GSQ +G
Sbjct: 177 TEAAKKMLQEKRLAGKTKSDANIPSSYNADFFHRGKDYTEKLRREHPELYKDQGSQANGT 236
Query: 245 GSRPT-DNSTDAAGS--RQAATDQFMLERFRKRERHRVMRR 282
G + N D AG+ R+AATD+ +LERFRKRE+ RVMRR
Sbjct: 237 GGKSMGGNHPDGAGAGRREAATDELLLERFRKREKFRVMRR 277
>gi|125544064|gb|EAY90203.1| hypothetical protein OsI_11769 [Oryza sativa Indica Group]
Length = 278
Score = 308 bits (790), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 173/282 (61%), Positives = 213/282 (75%), Gaps = 16/282 (5%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG 71
+KNFRKR+ E ++ + ++ RR+ALEEIK++QK RERK GIPA +A +++AA
Sbjct: 2 RKNFRKRNLEADDAADHSDD--DDARRVALEEIKYMQKLRERKLGIPAAAAAAGASSAAS 59
Query: 72 GGGLT--------KVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG 123
G + GD EK++LVLQDTFAQETAV +EDPNML+YVE EL KKRG
Sbjct: 60 ADGASPRGRGGGGGGGLAAGGDAEKEDLVLQDTFAQETAVTIEDPNMLRYVENELLKKRG 119
Query: 124 KNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIE 183
K +DV D+ E D DELY +P+HLKV+K+NSEESSTQWTTGIAEVQLPIEYKL+NIE
Sbjct: 120 KKVDVKDKEEKD---QVDELYIVPDHLKVRKKNSEESSTQWTTGIAEVQLPIEYKLRNIE 176
Query: 184 ETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG 243
ETEAAKK+LQEKRL G+ KSD +IPSSY+AD+F RG+DY EKLRREHPELYKD+GSQ +G
Sbjct: 177 ETEAAKKMLQEKRLAGKTKSDANIPSSYNADFFHRGKDYTEKLRREHPELYKDQGSQANG 236
Query: 244 AGSRPT-DNSTDAAGS--RQAATDQFMLERFRKRERHRVMRR 282
G + N D AG+ R+AATD+ +LERFRKRE+ RVMRR
Sbjct: 237 TGGKSMGGNHPDGAGAGRREAATDELLLERFRKREKFRVMRR 278
>gi|357112143|ref|XP_003557869.1| PREDICTED: uncharacterized protein LOC100821850 [Brachypodium
distachyon]
Length = 273
Score = 303 bits (776), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 178/278 (64%), Positives = 209/278 (75%), Gaps = 13/278 (4%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG 71
+KNFRKR+ E + T SDDE+ RR+ALEEIK++QK RERK GIPA A +AA
Sbjct: 2 QKNFRKRNLEPD---TADHSDDEDVRRVALEEIKYMQKLRERKLGIPAASVATGAAATTT 58
Query: 72 GGGLTKVSEKNEG----DGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID 127
G + + +K++LVLQDTFAQETAV +EDPNML+YVE EL KKRGK I+
Sbjct: 59 DGSSARGRGGGGAAAASETDKEDLVLQDTFAQETAVTIEDPNMLRYVENELLKKRGKTIE 118
Query: 128 VNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEA 187
VND+ E K DELY +P+HLKVKK+N EESSTQWTTGIAEVQLPIEYKL+NIEETEA
Sbjct: 119 VNDKDE---KDDVDELYVVPDHLKVKKKNMEESSTQWTTGIAEVQLPIEYKLRNIEETEA 175
Query: 188 AKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSR 247
AKKLLQEKRL G+ KSD +IPSSYSADYF RGRDYAEKLRREHPELYK + Q + G +
Sbjct: 176 AKKLLQEKRLAGKTKSDANIPSSYSADYFHRGRDYAEKLRREHPELYKGQDLQANETGGK 235
Query: 248 PT-DNSTDA--AGSRQAATDQFMLERFRKRERHRVMRR 282
PT N+ D A R+AATD+ +LERFRKRE+ RVMRR
Sbjct: 236 PTGSNNPDGPPARRREAATDELLLERFRKREKFRVMRR 273
>gi|148907301|gb|ABR16788.1| unknown [Picea sitchensis]
Length = 272
Score = 298 bits (762), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 167/250 (66%), Positives = 193/250 (77%), Gaps = 6/250 (2%)
Query: 36 ERRLALEEIKFLQKQRERKSGIPA--IPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVL 93
+RRLALEE+KFLQKQRERK+GI A I + S + K EG+GEK+ELVL
Sbjct: 26 QRRLALEELKFLQKQRERKAGIAANEISEVVVSKIGDNNSNNNNNNNKAEGEGEKEELVL 85
Query: 94 QDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK 153
QDTFAQETAV VEDPNMLKYVEQELAKKRGK I N + K ED+LY +P+HLKV+
Sbjct: 86 QDTFAQETAVTVEDPNMLKYVEQELAKKRGKQIGKNT---EETKPPEDDLYVVPDHLKVR 142
Query: 154 KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR-LMGRAKSDFSIPSSYS 212
+RNSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAKK LQ+KR +GR + SIPSSYS
Sbjct: 143 RRNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAKKQLQDKRPFVGRGRPQSSIPSSYS 202
Query: 213 ADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFR 272
ADYFQRGR+YAEKLRR+HPELYKD+ +Q+ G+ S + RQAATD+ MLERFR
Sbjct: 203 ADYFQRGREYAEKLRRDHPELYKDKDAQNSGSISGEIAPEGNVGNRRQAATDEIMLERFR 262
Query: 273 KRERHRVMRR 282
KRER R+MRR
Sbjct: 263 KRERSRLMRR 272
>gi|226493466|ref|NP_001148850.1| LOC100282469 [Zea mays]
gi|194701872|gb|ACF85020.1| unknown [Zea mays]
gi|195622612|gb|ACG33136.1| hepatocellular carcinoma-associated antigen 59 family protein [Zea
mays]
gi|414866976|tpg|DAA45533.1| TPA: Hepatocellular carcinoma-associated antigen 59 family [Zea
mays]
Length = 269
Score = 297 bits (761), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 168/274 (61%), Positives = 203/274 (74%), Gaps = 11/274 (4%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG 72
+NFRKR E++ + + DE+ RR+ALEEIK++QK RERK GIPA +A S +
Sbjct: 3 RNFRKRGIEQDTDDRSD---DEDTRRIALEEIKYMQKLRERKLGIPADLAA-ASTNGSSA 58
Query: 73 GGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRV 132
GL G+ EK++LVLQDTFAQETAV +EDPNML+YVE ELAKKRGK +DV +
Sbjct: 59 RGLLGTGAAVAGEAEKEDLVLQDTFAQETAVTIEDPNMLRYVETELAKKRGKMVDVGHKE 118
Query: 133 ENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 192
E D H DELY +P+HLKVKK+NSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAKKLL
Sbjct: 119 EMD--HV-DELYTVPDHLKVKKKNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAKKLL 175
Query: 193 QEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG-AGSRPTDN 251
QEKRL + K D +IPSSYSADYF RG++Y EKLRRE+P LYKD S+ G G + TD
Sbjct: 176 QEKRLARKPKPDANIPSSYSADYFHRGKEYDEKLRRENPGLYKDNDSRPSGNPGGKATDT 235
Query: 252 STD---AAGSRQAATDQFMLERFRKRERHRVMRR 282
AG R+AA+D+ ML+RFRKRE+ R +RR
Sbjct: 236 KNPDGVGAGRREAASDELMLQRFRKREKFRALRR 269
>gi|242035643|ref|XP_002465216.1| hypothetical protein SORBIDRAFT_01g034230 [Sorghum bicolor]
gi|241919070|gb|EER92214.1| hypothetical protein SORBIDRAFT_01g034230 [Sorghum bicolor]
Length = 270
Score = 296 bits (758), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 169/274 (61%), Positives = 206/274 (75%), Gaps = 10/274 (3%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG 72
+NFRKR E + + + DE+ RR+ALEEIK++QK RERK GIPA +A + ++
Sbjct: 3 RNFRKRGIEPDTDDRSD---DEDTRRVALEEIKYMQKLRERKLGIPAGTAAASTNGSSAR 59
Query: 73 GGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRV 132
GG G+ EK++LVLQDTFAQETAV +EDPNML+YVE ELAKKRGK +DV +
Sbjct: 60 GGRVGSGAAAAGEAEKEDLVLQDTFAQETAVTIEDPNMLRYVETELAKKRGKMVDVGHKE 119
Query: 133 ENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 192
E D H DELY +P+HLKVKK+NSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAKK+L
Sbjct: 120 EMD--HV-DELYTVPDHLKVKKKNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAKKVL 176
Query: 193 QEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG-AGSRPTDN 251
QEKRL + KSD +IPSSYSADYF RG++Y EKLRRE+P LYKD S+ G +G + TD
Sbjct: 177 QEKRLASKPKSDANIPSSYSADYFHRGKEYDEKLRRENPGLYKDNDSRPRGSSGGKATDT 236
Query: 252 STD---AAGSRQAATDQFMLERFRKRERHRVMRR 282
AG R+AA+D+FMLERFRKRE+ R +RR
Sbjct: 237 KNPGGVGAGRREAASDEFMLERFRKREKFRALRR 270
>gi|168003604|ref|XP_001754502.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694123|gb|EDQ80472.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 267
Score = 293 bits (750), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 171/274 (62%), Positives = 205/274 (74%), Gaps = 16/274 (5%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG 72
K FRKR+ E E SDD+EE R LEE+KFLQKQRER++G+ A + G
Sbjct: 6 KRFRKRNAPEAGEQ----SDDDEEIRSTLEEVKFLQKQRERRNGVVA-----NQLGQSLG 56
Query: 73 GGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRV 132
G KV++K EG+GEK+E VLQDTFAQETAV +EDPNMLKY+EQE+AKKRG+ + V V
Sbjct: 57 GLNPKVADKGEGEGEKEEQVLQDTFAQETAVTIEDPNMLKYIEQEMAKKRGRELGV---V 113
Query: 133 ENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 192
E + K ED+LY IPEHLKV++RN+EESSTQWTTGIAEVQLPIEYKLKNIEETEAAKK L
Sbjct: 114 EEESKPPEDDLYVIPEHLKVRRRNAEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKQL 173
Query: 193 QEKR-LMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDN 251
Q+KR +GR ++ SIP+SYSADYFQRGR+YAEKLR EHPE +KD+G P +
Sbjct: 174 QDKRPFVGRGRTQSSIPASYSADYFQRGREYAEKLRSEHPEPFKDKGRGGGAGRGDPIGS 233
Query: 252 ST---DAAGSRQAATDQFMLERFRKRERHRVMRR 282
++ D RQAATD+ MLERFRKRER R+MRR
Sbjct: 234 NSEKLDLGNRRQAATDEIMLERFRKRERSRLMRR 267
>gi|13937157|gb|AAK50072.1|AF372932_1 At1g02330/T6A9_12 [Arabidopsis thaliana]
gi|22137212|gb|AAM91451.1| At1g02330/T6A9_12 [Arabidopsis thaliana]
Length = 179
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 143/179 (79%), Positives = 159/179 (88%), Gaps = 6/179 (3%)
Query: 110 MLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIA 169
M+KY+EQELAKKRG+NID + VEN+LK EDELYKIP+HLKVKKR+SEESSTQWTTGIA
Sbjct: 1 MVKYIEQELAKKRGRNIDDAEEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIA 60
Query: 170 EVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRRE 229
EVQLPIEYKLKNIEETEAAKKLLQE+RLMGR KS+FSIPSSYSADYFQRG+DYAEKLRRE
Sbjct: 61 EVQLPIEYKLKNIEETEAAKKLLQERRLMGRPKSEFSIPSSYSADYFQRGKDYAEKLRRE 120
Query: 230 HPELYKDRGS-QDDGAGSRP-----TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
HPELYKDRG Q DG ++P T+N+ D+ SRQAATDQ MLERFRKRER+RVMRR
Sbjct: 121 HPELYKDRGGPQADGEAAKPSTSSSTNNNADSGKSRQAATDQIMLERFRKRERNRVMRR 179
>gi|388490564|gb|AFK33348.1| unknown [Lotus japonicus]
Length = 228
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/198 (69%), Positives = 157/198 (79%), Gaps = 8/198 (4%)
Query: 8 KKEKKKNFRKRSYEEEEET------TNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP 61
K+++KKN+RKRS E++ N SDDE ERR+ALEEIK LQKQRERKSGI A P
Sbjct: 5 KQQRKKNYRKRSAPVEQDQLPQSQDNNNESDDERERRMALEEIKLLQKQRERKSGIAANP 64
Query: 62 SALQSAAAAGGGGLTKVSEKN-EGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK 120
S LQS A G K +EKN G+KD+LVLQDTFAQETAVM EDPNM+KYVEQELAK
Sbjct: 65 S-LQSQAVVTAGSAAKPAEKNDGDGGDKDDLVLQDTFAQETAVMDEDPNMVKYVEQELAK 123
Query: 121 KRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLK 180
KRG+ ID D++EN+LK AEDELYKIPEHLKVKKRNSEESSTQWTTGIAE+QLPIEYKLK
Sbjct: 124 KRGRKIDEADQIENELKRAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEIQLPIEYKLK 183
Query: 181 NIEETEAAKKLLQEKRLM 198
NIEETEAAK +++ L
Sbjct: 184 NIEETEAAKNFYRKRGLW 201
>gi|168015241|ref|XP_001760159.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688539|gb|EDQ74915.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 247
Score = 244 bits (622), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 144/241 (59%), Positives = 171/241 (70%), Gaps = 14/241 (5%)
Query: 33 DEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELV 92
D E R LEE+KFLQKQRER +G+ A GG + V+EK EG+GE +E V
Sbjct: 1 DGEFCRSTLEEVKFLQKQRERSNGVVA-----NQLGQPAGGANSNVAEKGEGEGENEEQV 55
Query: 93 LQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKV 152
LQDTFAQETAV +EDPNMLKY+EQE+AKKRG+ V ++K E +LY IPEHLKV
Sbjct: 56 LQDTFAQETAVTIEDPNMLKYIEQEMAKKRGRE---TSEVGEEVKPPEVDLYVIPEHLKV 112
Query: 153 KKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR-LMGRAKSDFSIPSSY 211
+KRN EESSTQWTTGIAEVQLP+EYKLKNIEETEAAKK LQ KR +GR +S SIP+SY
Sbjct: 113 RKRNGEESSTQWTTGIAEVQLPVEYKLKNIEETEAAKKQLQGKRPFVGRGRSQSSIPASY 172
Query: 212 SADYFQRGRDYAEKLRREHPELYKDR--GSQDDGAGSRPTDNST---DAAGSRQAATDQF 266
+ADYFQRGR+YAEKLR +HPE Y+D+ G PT + + D RQAATD+
Sbjct: 173 NADYFQRGREYAEKLRSDHPEGYRDKGRGEGRGRGRGGPTGSKSETYDVRNRRQAATDEI 232
Query: 267 M 267
M
Sbjct: 233 M 233
>gi|9857529|gb|AAG00884.1|AC064879_2 Hypothetical protein [Arabidopsis thaliana]
Length = 178
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 120/170 (70%), Positives = 141/170 (82%), Gaps = 9/170 (5%)
Query: 12 KKNFRKRSYEEE--EETTNK--LSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+NFRKRS+EEE + NK +S++EE+RRLALEE+KFLQK RERK GIPA+ S QS+
Sbjct: 4 KRNFRKRSFEEEEEDNDVNKAAISEEEEKRRLALEEVKFLQKLRERKLGIPALSSTAQSS 63
Query: 68 AAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID 127
G K EK E +GEK+ELVLQDTFAQETAV++EDPNM+KY+EQELAKKRG+NID
Sbjct: 64 I-----GKVKPVEKTETEGEKEELVLQDTFAQETAVLIEDPNMVKYIEQELAKKRGRNID 118
Query: 128 VNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEY 177
+ VEN+LK EDELYKIP+HLKVKKR+SEESSTQWTTGIAEVQLPIEY
Sbjct: 119 DAEEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIAEVQLPIEY 168
>gi|147765932|emb|CAN62422.1| hypothetical protein VITISV_020607 [Vitis vinifera]
Length = 128
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/124 (71%), Positives = 100/124 (80%), Gaps = 14/124 (11%)
Query: 173 LPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRR---- 228
L + YKL+NIEETEAAKKLLQ+KRLMGR K++F+IPSSYSADYFQRGRDYAEKLRR
Sbjct: 5 LSVWYKLRNIEETEAAKKLLQDKRLMGRTKTEFNIPSSYSADYFQRGRDYAEKLRRECHF 64
Query: 229 ----------EHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHR 278
EHPELYKD+G QD+G GSR D ST+ AG RQAATD+FML+RFRKRERHR
Sbjct: 65 LLLTRYEIFAEHPELYKDKGVQDNGGGSRLPDASTEVAGRRQAATDEFMLDRFRKRERHR 124
Query: 279 VMRR 282
VMRR
Sbjct: 125 VMRR 128
>gi|302753264|ref|XP_002960056.1| hypothetical protein SELMODRAFT_75683 [Selaginella moellendorffii]
gi|302804660|ref|XP_002984082.1| hypothetical protein SELMODRAFT_119432 [Selaginella moellendorffii]
gi|300148434|gb|EFJ15094.1| hypothetical protein SELMODRAFT_119432 [Selaginella moellendorffii]
gi|300170995|gb|EFJ37595.1| hypothetical protein SELMODRAFT_75683 [Selaginella moellendorffii]
Length = 132
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 95/139 (68%), Positives = 109/139 (78%), Gaps = 7/139 (5%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTF 97
RLALEE+K LQKQR R+ G+ A P AA+ G K S+K E +GEK+ELVLQDTF
Sbjct: 1 RLALEEVKLLQKQRGRRCGVMANP-----VAASPGLDRVKSSDKVEVEGEKEELVLQDTF 55
Query: 98 AQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNS 157
AQETAV VEDPNMLKYVEQELAKKRG+ + V+ D K AED+LY IP+HLKV+KRNS
Sbjct: 56 AQETAVNVEDPNMLKYVEQELAKKRGRQ-ESGGTVDAD-KPAEDDLYVIPDHLKVRKRNS 113
Query: 158 EESSTQWTTGIAEVQLPIE 176
EESSTQWTTGIAEVQLP+E
Sbjct: 114 EESSTQWTTGIAEVQLPLE 132
>gi|395540548|ref|XP_003772215.1| PREDICTED: uncharacterized protein C9orf78 homolog [Sarcophilus
harrisii]
Length = 288
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/289 (33%), Positives = 147/289 (50%), Gaps = 32/289 (11%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP----------- 61
K FR+R + E E+ ++ D EE RL LEE K +Q R R +G+ A+
Sbjct: 5 KTFRRRRADSESESDDQ---DSEEVRLKLEETKEVQSLRRRPNGVSAVALLVGEKVQEET 61
Query: 62 SALQSAAAAGGGGLT---KVSEKNEGD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQE 117
+ + GG+ K+ E+N+ E+++L L +F+ ET ED +M+KY+E E
Sbjct: 62 ALVDDPFQVKTGGMVDMKKLKERNKDRISEEEDLNLGTSFSAETNRRDEDADMMKYIETE 121
Query: 118 LAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPI 175
L K++G I N+ + LK+AED LY++PE ++V K+ E S Q +GI EV L I
Sbjct: 122 LKKRKG--IVENEEQKVKLKNAEDCLYELPESIRVSSAKKTEEMLSNQMLSGIPEVDLGI 179
Query: 176 EYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLR---REHP 231
E K+KNI TE AK +LL E+R + +P++ + +Y Q R Y E+L R H
Sbjct: 180 EAKIKNIISTEDAKARLLAEQRNKKKDSETSFVPTNMAVNYVQHNRFYHEELHAPVRRHK 239
Query: 232 ELYKDR----GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
E K R G + R N + ATD + E+F+K R
Sbjct: 240 EEPKTRPLRVGDTEKPEAERSPPNRKRPPNEK--ATDDYHYEKFKKMNR 286
>gi|126277132|ref|XP_001372334.1| PREDICTED: uncharacterized protein C9orf78-like [Monodelphis
domestica]
Length = 288
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 97/288 (33%), Positives = 147/288 (51%), Gaps = 30/288 (10%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA---- 68
K FRKR + E E+ + D EE RL LEE K +Q R R +G+ A+ +
Sbjct: 5 KTFRKRRDDSESESDEQ---DSEEVRLKLEETKEVQSLRRRPNGVSAVALLVGEKVQEET 61
Query: 69 ----------AAGGGGLTKVSEKNEGD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQE 117
A G + K+ E+N+ E+++L L +F+ ET ED +M+KY E E
Sbjct: 62 TLVDDPFKIKAGGMVDMKKLKERNKDRINEEEDLNLGTSFSAETNRRDEDADMMKYFETE 121
Query: 118 LAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPI 175
L K++G I N+ + LK+AED LY++PE+++V K+ E S Q +GI EV L I
Sbjct: 122 LKKRKG--IVENEEQKVKLKNAEDCLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGI 179
Query: 176 EYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLR---REHP 231
+ K+KNI TE AK +LL E++ + +P++ + +Y Q R Y E+L R H
Sbjct: 180 DAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRHK 239
Query: 232 ELYKDRGSQDDGAGSRPTDNSTDAAGSR---QAATDQFMLERFRKRER 276
E K R + G +P + R + ATD + E+F+K R
Sbjct: 240 EEPKTRPLR-VGDTEKPEPERSPPNRKRPPNEKATDDYHYEKFKKMNR 286
>gi|156401402|ref|XP_001639280.1| predicted protein [Nematostella vectensis]
gi|156226407|gb|EDO47217.1| predicted protein [Nematostella vectensis]
Length = 285
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 153/297 (51%), Gaps = 52/297 (17%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRL-ALEEIKFLQKQRERKSGIPAIPSALQSAA-- 68
K+N+R++ E+E DDE E + ALEE + +QK R+R G+ A+ AL
Sbjct: 3 KRNYRRKRITEDE-------DDEAEVAIEALEERREIQKFRKRPKGVSAVGLALGKKVDI 55
Query: 69 ---------AAGGGGLTKVSE-------KNEGDGEKDELVLQDTFAQETAVMVEDPNMLK 112
GGL ++++ EG+ + L + FA ET ED +MLK
Sbjct: 56 EDEVESDPFKLKTGGLVQINDLIQDRERDREGEDSGKSINLGENFAAETNRREEDTHMLK 115
Query: 113 YVEQELAKKRGK---NIDVNDRVENDLKHAEDELYKIPEHLKVKKR--NSEES-STQWTT 166
Y+E+E++K++G+ D+N +V + K ED L+++P+H+ V+ R SEE S Q +
Sbjct: 116 YIEEEISKRKGQAESGEDIN-KVRDKFKTKEDLLFQVPKHIDVRSRLMKSEEMLSNQMLS 174
Query: 167 GIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR----- 220
GI EV L I K++NIE TE AK K+++E+R + +P++ ++++ R
Sbjct: 175 GIPEVDLGISAKIRNIEATEEAKMKVIEEQRSKRKHGPTEMVPTNMASNFMLHSRFMDEQ 234
Query: 221 DYAEKLRREHPELYKDRGSQDDG-AGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
AE RR+ L R ++DDG A +P + ATD F E+FRKR R
Sbjct: 235 KNAEAERRKTATL---RATKDDGKAKPQPV---------VEKATDDFYYEKFRKRAR 279
>gi|395506250|ref|XP_003757448.1| PREDICTED: uncharacterized protein C9orf78 homolog [Sarcophilus
harrisii]
Length = 287
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/263 (33%), Positives = 135/263 (51%), Gaps = 27/263 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE K +Q R R G+ A+ + + GG+ K+ E+N+
Sbjct: 26 RLKLEETKEVQSLRRRPKGVSAVALLVGEKVQEETTLVDDPFNINTGGMVDMKKIKERNK 85
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ ++ LK+AED
Sbjct: 86 DRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IMENEELKVKLKNAEDC 143
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE ++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 144 LYELPESIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 203
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDAA 256
+ +P++ + +Y Q R Y E+L R H E K R + G +P +
Sbjct: 204 KDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRHKEEPKTRPLR-VGDTEKPEPEKSPPN 262
Query: 257 GSR---QAATDQFMLERFRKRER 276
R + ATD + E+F+K R
Sbjct: 263 RKRPPNEKATDDYHYEKFKKMNR 285
>gi|224073514|ref|XP_002198575.1| PREDICTED: uncharacterized protein C9orf78 homolog [Taeniopygia
guttata]
Length = 289
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 91/266 (34%), Positives = 137/266 (51%), Gaps = 27/266 (10%)
Query: 35 EERRLALEEIKFLQKQRERKSGIPAIP----SALQSAAA-------AGGGGLTKVSEKNE 83
EE RL LEE K +Q R+R +G+ A+ LQ A GG+ + + E
Sbjct: 25 EEVRLKLEEAKEVQSLRKRPNGVSAVALLVGEKLQEEATLVDDPFKIKSGGMVDMKKLKE 84
Query: 84 GD----GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ + LK+A
Sbjct: 85 RGKDRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEQKVKLKNA 142
Query: 140 EDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 196
ED LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK KLL E++
Sbjct: 143 EDSLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKAKLLAEQQ 202
Query: 197 LMGRAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNST 253
+ +P++ + +Y Q R Y E+L R + E K R + G RP +
Sbjct: 203 NKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKEEPKPRPLR-VGDTERPEPERS 261
Query: 254 DAAGSR---QAATDQFMLERFRKRER 276
R + ATD + E+F+K R
Sbjct: 262 PPNRKRPLNEKATDDYHYEKFKKMNR 287
>gi|260797455|ref|XP_002593718.1| hypothetical protein BRAFLDRAFT_63995 [Branchiostoma floridae]
gi|229278946|gb|EEN49729.1| hypothetical protein BRAFLDRAFT_63995 [Branchiostoma floridae]
Length = 291
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 147/297 (49%), Gaps = 38/297 (12%)
Query: 10 EKKKNFRKR--SYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
+K+KNFR+R S +EE+E+ ++S LEE K QK R+R G+ A AL
Sbjct: 4 QKRKNFRRRRDSSDEEDESVQEVSS-------ILEEAKEAQKFRQRPKGVSATALALGKK 56
Query: 68 AAAGG-----------GGLT---KVSEKN-EGDGEKDELVLQD---TFAQETAVMVEDPN 109
+ GG+ + ++N + GE+D+ L D +F+ ET E
Sbjct: 57 LSGNAALVNDPFKLRTGGMVDMKAIKDRNRDRTGEEDDKDLSDLGTSFSAETNTRDEHAE 116
Query: 110 MLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--VKKRNSEESSTQWTTG 167
M+KY+E E+ K++G+ + + + +K AED LY++P+ LK R+ E S Q +G
Sbjct: 117 MMKYIEVEMKKRKGQEKE-KEASQAKIKGAEDLLYELPDRLKAATSTRSEEMLSNQMLSG 175
Query: 168 IAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEK-- 225
I EV L I+ K++NIE TE AK+ LQE+ R K +P + + +Y Q R Y E
Sbjct: 176 IPEVDLGIQEKIRNIEATEDAKQRLQEQMRKKRDKGTSFVPVNMAVNYVQHNRFYREDTE 235
Query: 226 ----LRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERHR 278
+++E P+ + + T + G R ATD F E+F+K+ R
Sbjct: 236 TKKVVKQEAPKPRPLKVGDTEPPIMEETSQTKKRPGER--ATDDFHFEKFKKQMTRR 290
>gi|195427179|ref|XP_002061656.1| GK17111 [Drosophila willistoni]
gi|194157741|gb|EDW72642.1| GK17111 [Drosophila willistoni]
Length = 296
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 147/303 (48%), Gaps = 53/303 (17%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSAL 64
I KK+ +KN R+R + DD++E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 14 IVFKKKSRKNLRQRKNSD---------DDDKEEQLTLDEIKERQRLRQRPNGVSLVGLAL 64
Query: 65 QSAAA------------AGGGGLT-----KVSEKNEGDGEKDELVLQDTFAQETAVMVED 107
A GGL K + E D D + + F+ ET ED
Sbjct: 65 GKKIAPEEELAIKDPFNVKTGGLVNMQTLKSGKMKEADDAYD-VGIGTQFSAETNKRDED 123
Query: 108 PNMLKYVEQELAKKRGKNIDVNDRVEND-------LKHAEDELYKIPEHLK--VKKRNSE 158
M+KY+EQEL K++G D + +ND L + LY +P+HL+ R+ E
Sbjct: 124 EEMMKYIEQELQKRKGGATDADTGGDNDDSDAHKYLTPEDAALYALPDHLRQSSSHRSEE 183
Query: 159 ESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQ 217
S Q GI EV L I+ K++NIE TE AK KLLQ+ + S F +P++ + ++ Q
Sbjct: 184 MLSNQMLNGIPEVDLGIQAKIRNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQ 242
Query: 218 RGR----DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRK 273
R D E+ RR+ E KD G++ + T+ G ++ ATD + ++FRK
Sbjct: 243 HNRFNIEDNNEQRRRKREE--KD--------GNKAAHHQTNPNGVKR-ATDDYHYDKFRK 291
Query: 274 RER 276
+ R
Sbjct: 292 QFR 294
>gi|229365864|gb|ACQ57912.1| C9orf78 [Anoplopoma fimbria]
Length = 292
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/300 (32%), Positives = 150/300 (50%), Gaps = 50/300 (16%)
Query: 13 KNFRKR---SYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAI--------- 60
KNFR+R S EE+ETT ++ R LEE K LQ R+R++G+
Sbjct: 5 KNFRRRRDSSDVEEDETTIEV-------RSKLEEAKELQSLRKRQTGVSVTALLVGEKLP 57
Query: 61 --------PSALQSAAAAGGGGLTKVSEKNEGDGEKD-ELVLQDTFAQETAVMVEDPNML 111
P L++ G + K ++N E++ +L L +F+ ET ED +M+
Sbjct: 58 PEDEIDNDPFKLKTG---GVVDMKKAKDRNRDMTEEETDLNLGTSFSAETNRRDEDADMM 114
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIA 169
KY+E EL KK+G +V+ +K+AED LY++PE ++V K+ E S Q +GI
Sbjct: 115 KYIETELKKKKGLVEAEEQKVK--VKNAEDHLYELPESIRVNSAKKTEEMLSNQMLSGIP 172
Query: 170 EVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL-- 226
EV L I+ K+KNI +TE AK KLL E+R + + +P++ + +Y Q R Y E +
Sbjct: 173 EVDLGIDAKIKNIIQTEDAKAKLLAEQRNKKKDQGTSFVPTNIAVNYVQHSRFYREDVNA 232
Query: 227 ------RREHPELYKDRGSQDDGAGSRPTDNSTDAA----GSRQAATDQFMLERFRKRER 276
RE P+ R + G P + +T A + + ATD + E+F+K R
Sbjct: 233 PQRHHRHREEPKARPLRVGDTEKPG--PEEVTTPANFRKRPNNEKATDDYHYEKFKKMNR 290
>gi|147904475|ref|NP_001087905.1| chromosome 9 open reading frame 78 [Xenopus laevis]
gi|51950077|gb|AAH82454.1| MGC84248 protein [Xenopus laevis]
Length = 290
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/292 (33%), Positives = 147/292 (50%), Gaps = 36/292 (12%)
Query: 13 KNFRKRSYEEEEETTNKLSDDE---EERRLALEEIKFLQKQRERKSGIPA--------IP 61
+NFR+R +E +DE E R+ LEE K +Q R+R++G+ A +P
Sbjct: 5 RNFRRRKASSSDEEV----EDEGVTREVRMKLEEAKEVQSLRKRQNGVSAAALLVGEKLP 60
Query: 62 SALQSA----AAAGGGGLTKVSEKNEGD---GEKDELVLQDTFAQETAVMVEDPNMLKYV 114
+ A GG + K+ G GE+++L L +F+ ET ED +M+KY+
Sbjct: 61 EEVNMADDPFKMQNGGMVDMKKLKDRGKDRIGEEEDLNLGTSFSAETNRRDEDADMMKYI 120
Query: 115 EQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQ 172
E EL K++G I N+ + K AED LY++PE +KV K+ E S Q +GI EV
Sbjct: 121 ETELKKRKG--IVENEEKKVKPKSAEDCLYELPESIKVSSAKKTEEMLSNQMLSGIPEVD 178
Query: 173 LPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAE----KLR 227
L I+ K+KNI TE AK +LL E++ + K +P++ + +Y Q R Y E +R
Sbjct: 179 LGIDAKIKNIISTEEAKARLLAEQQNKKKDKHTSFVPTNMAVNYVQHNRFYQEDQNTPMR 238
Query: 228 REHPELYKDRGSQDDGAGSRPTDNSTDAA---GSRQAATDQFMLERFRKRER 276
R H E K R + G +P + S + ATD + E+F+K R
Sbjct: 239 R-HKEEPKPRPLR-VGDTEKPEPEKSPPNRKRPSNEKATDDYHYEKFKKMNR 288
>gi|326930340|ref|XP_003211305.1| PREDICTED: uncharacterized protein C9orf78-like [Meleagris
gallopavo]
Length = 289
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/263 (33%), Positives = 135/263 (51%), Gaps = 27/263 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP----SALQSAAA-------AGGGGLTKVSEKNEGD- 85
RL LEE K +Q R+R +G+ A+ LQ A GG+ + + E
Sbjct: 28 RLKLEEAKEVQSLRKRPNGVSAVALLVGEKLQEEATLVDDPFKIKSGGMVDMKKLKERGK 87
Query: 86 ---GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ + LK+AED
Sbjct: 88 DRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEQKVKLKNAEDS 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK KLL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKAKLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDAA 256
+ +P++ + +Y Q R Y E+L R + E K R + G RP +
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKEEPKPRPLR-VGDTERPEPERSPPN 264
Query: 257 GSR---QAATDQFMLERFRKRER 276
R + ATD + E+F+K R
Sbjct: 265 RKRPPNEKATDDYHYEKFKKMNR 287
>gi|410925759|ref|XP_003976347.1| PREDICTED: uncharacterized protein C9orf78 homolog [Takifugu
rubripes]
Length = 291
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 150/297 (50%), Gaps = 45/297 (15%)
Query: 13 KNFRKR--SYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAI---------- 60
KN R+R S ++EE +D EE R +EE K LQ R+R++G+
Sbjct: 5 KNLRRRRDSSDDEE------NDIAEELRSKVEEAKELQSLRKRQTGVSLTALLVGEKLPP 58
Query: 61 -------PSALQSAAAAGGGGLTKVSEKNEGDGEKDE--LVLQDTFAQETAVMVEDPNML 111
P L++ G + KV ++N D +DE L L +F+ ET ED +M+
Sbjct: 59 DAEIDNDPFKLKTG---GVVDMKKVKDRNR-DMTEDETDLNLGTSFSVETNRRDEDADMM 114
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIA 169
KY+E EL K++G+ +V+ +K+AED LY++PE+++V K+ E S Q +GI
Sbjct: 115 KYIETELKKRKGQVEAEEQKVK--VKNAEDHLYELPENIRVNSAKKTEEMLSNQMLSGIP 172
Query: 170 EVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL-- 226
EV L I+ K+KNI TE AK +LL E+R + + +P++ + +Y Q R Y E +
Sbjct: 173 EVDLGIDAKIKNIINTEEAKARLLAEQRNKKKDQGTSFVPTNIAVNYVQHNRFYHEDMNA 232
Query: 227 ------RREHPELYKDR-GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
RE P++ R G + P+ + + + ATD + E+F+K R
Sbjct: 233 PQRHHRHREEPKVRPLRVGDTEKPGPEAPSPPNYRKRPNNEKATDDYHYEKFKKMNR 289
>gi|390353053|ref|XP_001177304.2| PREDICTED: uncharacterized protein C9orf78-like [Strongylocentrotus
purpuratus]
Length = 244
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/244 (34%), Positives = 124/244 (50%), Gaps = 42/244 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
K +KK++ R+R + E + ++D R LEE K QK RER G+ A
Sbjct: 5 KAKKKRSIRQRKTSSDSEDDGQSNED---IRNILEETKEAQKFRERPHGVSA-------T 54
Query: 68 AAAGGGGLTKVSEKNEGD-------------------------GEKDELVLQDTFAQETA 102
A G +TKV E N+ D ++D + TFA ET
Sbjct: 55 ALLTGKKMTKVEEMNDDDPFNLKVGGMLSLKEIKDRNRDRSDESDRDVANMGSTFAVETN 114
Query: 103 VMVEDPNMLKYVEQELAKKRGKNIDV-NDRVENDLKH--AEDELYKIPEHLKVKKRNSEE 159
ED M+KY+E E+ KK+G ++D +D + KH ED+LY++P++LKV+ + S E
Sbjct: 115 QRDEDAEMMKYIEIEMNKKKGLDLDKESDPTKEGAKHKTPEDKLYELPDNLKVEAQKSSE 174
Query: 160 S--STQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYF 216
S Q +GI EV L IE K+KNIE TE AK K L+E+R + + F +P++ + +Y
Sbjct: 175 EMLSNQMLSGIPEVDLGIEAKIKNIEATEDAKQKHLEERRNKKKNTTSF-VPANMAVNYV 233
Query: 217 QRGR 220
Q R
Sbjct: 234 QHSR 237
>gi|50757325|ref|XP_415471.1| PREDICTED: uncharacterized protein C9orf78 [Gallus gallus]
Length = 289
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/263 (33%), Positives = 135/263 (51%), Gaps = 27/263 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPA----IPSALQSAAA-------AGGGGLTKVSEKNEGD- 85
RL LEE K +Q R+R +G+ A + LQ A GG+ + + E
Sbjct: 28 RLKLEEAKEVQSLRKRPNGVSAAALLVGEKLQEEATLVDDPFKIKSGGMVDMKKLKERGK 87
Query: 86 ---GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ + LK+AED
Sbjct: 88 DRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEQKVKLKNAEDS 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK KLL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKAKLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDAA 256
+ +P++ + +Y Q R Y E+L R + E K R + G RP +
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKEEPKPRPLR-VGDTERPEPERSPPN 264
Query: 257 GSR---QAATDQFMLERFRKRER 276
R + ATD + E+F+K R
Sbjct: 265 RKRPPNEKATDDYHYEKFKKMNR 287
>gi|241105597|ref|XP_002410015.1| conserved hypothetical protein [Ixodes scapularis]
gi|215492857|gb|EEC02498.1| conserved hypothetical protein [Ixodes scapularis]
Length = 249
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 123/264 (46%), Gaps = 45/264 (17%)
Query: 41 LEEIKFLQKQRERKSGIPAIPS------------ALQSAAAAGGGGLTKVSE---KNEGD 85
LE+ K +QK R+R +G+ I ++ GG+ + K
Sbjct: 1 LEDTKEIQKLRKRPNGVSVIGLNLGKKLTTKEELVIEDPFKLKTGGMIDMKALKGKRITM 60
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E D + L +TF+ ET ED +M+KY+E+ELAK+RGK D ++ ED L+
Sbjct: 61 EELDAVNLGNTFSVETNQRDEDADMMKYIEEELAKRRGKGQDTETDSRDEGVDPEDVLFH 120
Query: 146 IPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKS 203
+PEHL+ K++ E S Q +GI EV L IE +++NIE TE AK L +R+ + +
Sbjct: 121 VPEHLRKSSSKKSEEMLSNQMLSGIPEVDLGIEERIRNIEATEEAKLKLLRERMAKKERE 180
Query: 204 DFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSR-----------PTDNS 252
+P++ + ++ Q +R + DDG+ SR P
Sbjct: 181 TSFVPTNMAVNFVQH-----------------NRFNIDDGSRSRYARRVPREKEPPVAKP 223
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
+AATD F E+F+K+ R
Sbjct: 224 VVVIAEAEAATDDFHFEKFKKQFR 247
>gi|288684380|ref|NP_001165770.1| uncharacterized protein LOC733913 [Xenopus (Silurana) tropicalis]
gi|170285295|gb|AAI61311.1| Unknown (protein for MGC:186018) [Xenopus (Silurana) tropicalis]
Length = 290
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/267 (33%), Positives = 137/267 (51%), Gaps = 29/267 (10%)
Query: 35 EERRLALEEIKFLQKQRERKSGIPA--------IPSALQSA----AAAGGGGLTKVSEKN 82
+E R+ LEE K +Q R+R++G+ A +P + A GG + K+
Sbjct: 26 QEVRIKLEEAKEVQSLRKRQNGVSAAALLVGERLPEEVIMADDPFKMQSGGMVDMKKLKD 85
Query: 83 EGD---GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
G GE+++L L +F+ ET ED +M+KY+E EL K++G I N+ + K A
Sbjct: 86 RGKDRLGEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEKKVKPKSA 143
Query: 140 EDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 196
ED LY++PE +KV K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 144 EDCLYELPESIKVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKARLLAEQQ 203
Query: 197 LMGRAKSDFSIPSSYSADYFQRGRDYAE----KLRREHPELYKDRGSQDDGAGSRPTDNS 252
+ K +P++ + +Y Q R Y E +RR H E K R + G +P
Sbjct: 204 NKKKDKHTSFVPTNMAVNYVQHNRFYQEDQNTPMRR-HKEEPKPRPLR-VGDTEKPEPEK 261
Query: 253 TDAAGSRQA---ATDQFMLERFRKRER 276
+ R + ATD + E+F+K R
Sbjct: 262 SPPNRKRPSNEKATDDYHYEKFKKMNR 288
>gi|440797240|gb|ELR18335.1| hepatocellular carcinomaassociated antigen 59, putative
[Acanthamoeba castellanii str. Neff]
Length = 309
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 142/297 (47%), Gaps = 37/297 (12%)
Query: 9 KEKKKNFRKRSYEEEEETT------NKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPS 62
+++K RK+ E+E ET + +D+ L L E + LQ++RER G A +
Sbjct: 20 RKQKARLRKKIVEDEPETEADQEAETEEGEDDAPLGLMLRETRKLQRERERVKGCEAAAT 79
Query: 63 ALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVE---QELA 119
+ + + ++ D K+ +L+ TF + V +P + Y+E +E
Sbjct: 80 STEVLQKIATSSFVRPVANDDDD--KETHLLESTFTVQAEQDVVNPLLENYIEARLREFR 137
Query: 120 KKRGK----------NIDVNDRVEN-----DLKHAEDELYKIPEHLKVKK--RNSEESST 162
+ R K +D D+ E DL+ E +LY+IPEHLKV + R+ ++ S
Sbjct: 138 ETRAKEAIEKAKAERGVDWRDKEETTEKEFDLREEERKLYEIPEHLKVSETMRSDDQVSE 197
Query: 163 QWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 222
W TGI EV+LPIEYKLKNIE TE AK+LL +++ + P +Y+ F R
Sbjct: 198 AWLTGIQEVELPIEYKLKNIEATEDAKRLLLKRKEGPKPPPQ---PDAYNT-RFGRPSTQ 253
Query: 223 AEKLRRE---HPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
+ +RR+ H + +DR + G D+ + ATD ERF+KR R
Sbjct: 254 TQIVRRDRNAHRDSNRDRNNDGQQQGGWRGDHHR--GKKSEQATDDIAFERFKKRFR 308
>gi|431898902|gb|ELK07272.1| hypothetical protein PAL_GLEAN10012522 [Pteropus alecto]
Length = 289
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 144/289 (49%), Gaps = 32/289 (11%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP----------- 61
K FR+R + E E + D +E RL LEE + +Q R+R +G+ A+
Sbjct: 6 KTFRRRRADSESEEDEQ---DSQEVRLKLEETREVQNLRKRPNGVSAVALLVGEKVQEET 62
Query: 62 SALQSAAAAGGGGLTKVSEKNEGDGEK----DELVLQDTFAQETAVMVEDPNMLKYVEQE 117
+ + GG+ + + E +K ++L L +F+ ET ED +M+KY+E E
Sbjct: 63 TLVDDPFQMKTGGMVDMKKLKERGKDKISDEEDLHLGTSFSAETNRRDEDADMMKYIETE 122
Query: 118 LAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPI 175
L K++G I ++ + K+AED LY++PE+++V K+ E S Q +GI EV L I
Sbjct: 123 LKKRKG--IVEHEEQKVKQKNAEDCLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGI 180
Query: 176 EYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL------RR 228
+ K+KNI TE AK +LL E++ + +P++ + +Y Q R Y E+L +
Sbjct: 181 DAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNK 240
Query: 229 EHPELYKDR-GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
E P+ R G + R N A + ATD + E+F+K R
Sbjct: 241 EEPKARPLRVGDTEKPEPERSPPNRKRPANEK--ATDDYHYEKFKKMNR 287
>gi|195375178|ref|XP_002046380.1| GJ12867 [Drosophila virilis]
gi|194153538|gb|EDW68722.1| GJ12867 [Drosophila virilis]
Length = 298
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 143/297 (48%), Gaps = 49/297 (16%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK +KN R+R K SDD + E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 21 KKSSRKNLRQR----------KNSDDGDNEEQLTLDEIKERQRLRQRPNGVSLVGLALGK 70
Query: 67 AAA------------AGGGGLTKVSEKNEGD-GEKD---ELVLQDTFAQETAVMVEDPNM 110
A GGL + + G E D ++ + F+ ET ED M
Sbjct: 71 KVAPEEELAIKDPFNVKIGGLVNMQQMKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEM 130
Query: 111 LKYVEQELAKKRGKNIDVNDRVEND----LKHAEDELYKIPEHLK--VKKRNSEESSTQW 164
+KY+EQEL K++G D N+ ++D L + LY +P+HL+ R+ E S Q
Sbjct: 131 MKYIEQELQKRKGGAADENENDDSDAHKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQM 190
Query: 165 TTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR--- 220
GI EV L I K+ NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 191 LNGIPEVDLGIHAKIHNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRFNI 249
Query: 221 -DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
D E+ RR+ +D ++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 250 EDNNEQRRRKR---------EDKDGNNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 296
>gi|47212056|emb|CAF90174.1| unnamed protein product [Tetraodon nigroviridis]
Length = 291
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/267 (31%), Positives = 137/267 (51%), Gaps = 31/267 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPA--------IPSALQ------SAAAAGGGGLTKVSEKNE 83
R +EE K LQ R+R++G+ +P ++ G + +V ++N
Sbjct: 26 RSKVEEAKELQSLRKRQTGVSLTALLVGEKLPPEVEIDNDPFKLKTGGVVDMKRVKDRNR 85
Query: 84 GDGEKDE--LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAED 141
D +DE L L +F+ ET ED +M+KY+E EL K++G+ +V+ +K+AED
Sbjct: 86 -DMTEDETDLNLGTSFSVETNRRDEDADMMKYIETELKKRKGQVEAEEQKVK--VKNAED 142
Query: 142 ELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLM 198
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E+R
Sbjct: 143 HLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIINTEDAKARLLAEQRNK 202
Query: 199 GRAKSDFSIPSSYSADYFQRGRDYAEKL--------RREHPELYKDR-GSQDDGAGSRPT 249
+ +S +P++ + +Y Q R Y E + RE P+ R G + P+
Sbjct: 203 KKDQSTSFVPTNIAVNYVQHNRFYHEDMNAPQRHHRHREEPKARPLRVGDTEKPGPEAPS 262
Query: 250 DNSTDAAGSRQAATDQFMLERFRKRER 276
++ + + ATD + E+F+K R
Sbjct: 263 PSNHRKRPNNEKATDDYHYEKFKKMNR 289
>gi|255083575|ref|XP_002508362.1| predicted protein [Micromonas sp. RCC299]
gi|226523639|gb|ACO69620.1| predicted protein [Micromonas sp. RCC299]
Length = 246
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/197 (38%), Positives = 103/197 (52%), Gaps = 24/197 (12%)
Query: 31 SDDEEERRLA------LEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEG 84
SDDEE+ + A +E+ K L K R + G+ A A A G G ++++
Sbjct: 14 SDDEEDEQGAQALRERMEDAKTLIKNRVKSKGVGA------EALALGSGKKDVDADEDAD 67
Query: 85 DGEKDELVLQDTFAQETAVMV--EDPNMLKYVEQELAKKRGKNIDVNDRVEND----LKH 138
DG+ + FA AV V EDPNML+Y+EQELAK+RG D K
Sbjct: 68 DGKHAQ------FAAGAAVDVDGEDPNMLRYIEQELAKRRGAGGDEGGDGAGTSGGGAKS 121
Query: 139 AEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLM 198
AE+ L+ P+ L+VKK EE++ +W TGI EVQLP +YK+KNIE TE AK + EK
Sbjct: 122 AEERLWDTPDELRVKKTEGEETADRWLTGIVEVQLPADYKIKNIEATERAKAKMLEKIHG 181
Query: 199 GRAKSDFSIPSSYSADY 215
G + P S A+
Sbjct: 182 GGDGAAMDHPHSRQAEL 198
>gi|195135385|ref|XP_002012113.1| GI16613 [Drosophila mojavensis]
gi|193918377|gb|EDW17244.1| GI16613 [Drosophila mojavensis]
Length = 300
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 145/299 (48%), Gaps = 51/299 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK+ +KN R+R K SDD + E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 21 KKKPRKNLRQR----------KNSDDGDNEEQLTLDEIKERQRLRQRPNGVSLVGLALGK 70
Query: 67 AAA------------AGGGGLTKVSEKNEGD----GEKDELVLQDTFAQETAVMVEDPNM 110
A GGL + G + ++ + F+ ET ED M
Sbjct: 71 KVAPEEELAIKDPFNVKIGGLVNMQTIKSGKMKEVDDAYDVGIGTQFSAETNKRDEDEEM 130
Query: 111 LKYVEQELAKKRGKNID--VNDRVEND----LKHAEDELYKIPEHLK--VKKRNSEESST 162
+KY+EQEL K++G D NDR + D + + LY +PEHL+ R+ E S
Sbjct: 131 MKYIEQELQKRKGGAADENSNDRDDRDAHKYMSPEDAALYALPEHLRQSSSHRSEEMLSN 190
Query: 163 QWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR- 220
Q GI EV L I+ K+ NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 191 QMLNGIPEVDLGIQAKIHNIEATEEAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRF 249
Query: 221 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
D +E+ RR+ ++ A ++ N T+ G ++ ATD + ++FRK+ R
Sbjct: 250 KIEDSSEQRRRKR---------ENREADNKSARNQTNPNGVKR-ATDDYHYDKFRKQFR 298
>gi|195012263|ref|XP_001983556.1| GH15962 [Drosophila grimshawi]
gi|193897038|gb|EDV95904.1| GH15962 [Drosophila grimshawi]
Length = 300
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 144/299 (48%), Gaps = 51/299 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK +KN R+R K SDD E+E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 21 KKSSRKNLRQR----------KNSDDGEKEEQLTLDEIKERQRLRQRPNGVSLVGLALGK 70
Query: 67 AAA------------AGGGGLTKVSEKNEGDGEKDE----LVLQDTFAQETAVMVEDPNM 110
A GGL + G ++ E + + F+ ET ED M
Sbjct: 71 KVAPEEELAIKDPFNVKMGGLVNMQTLKSGKMKEPEDAYDVGIGTQFSAETNKRDEDEEM 130
Query: 111 LKYVEQELAKKRGKNI-DVNDRVENDLKH----AED-ELYKIPEHLK--VKKRNSEESST 162
+KY+EQEL K++G D D ++ H ED LY +P+HL+ R+ E S
Sbjct: 131 MKYIEQELQKRKGGGADDSTDNADDGDAHKYLTPEDAALYALPDHLRQSSSHRSEEMLSN 190
Query: 163 QWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR- 220
Q GI EV L I K++NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 191 QMLNGIPEVDLGIHAKIRNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRF 249
Query: 221 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
D E+ RR+ +D A S+ + T+ G ++ ATD + ++FRK+ R
Sbjct: 250 NIEDSNEQRRRKR---------EDKEAKSKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 298
>gi|348530434|ref|XP_003452716.1| PREDICTED: uncharacterized protein C9orf78 homolog [Oreochromis
niloticus]
Length = 291
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/295 (32%), Positives = 145/295 (49%), Gaps = 41/295 (13%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAI------------ 60
KNFR+R ++ + + + EE R LEE K LQ R+R+SGI
Sbjct: 5 KNFRRR----KDSSDVEEDETTEEVRHKLEEAKELQSLRKRQSGISVTALLVGEKLPPEA 60
Query: 61 -----PSALQSAAAAGGGGLTKVSEKNEGDGEKDE--LVLQDTFAQETAVMVEDPNMLKY 113
P L++ G + KV ++N D +DE L L +F+ ET ED +M+KY
Sbjct: 61 EIENDPFKLKTG---GIVDMKKVKDRNR-DMTEDETDLNLGTSFSAETNRRDEDADMMKY 116
Query: 114 VEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEV 171
+E EL KK+G +V+ +K+ ED LY++PE+++V K+ E S Q +GI EV
Sbjct: 117 IETELKKKKGLVEAEEQKVK--VKNPEDHLYELPENIRVNSAKKTEEMLSNQMLSGIPEV 174
Query: 172 QLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY-----AEK 225
L I+ K+KNI +TE AK KLL E+R + +P++ + +Y Q R Y A +
Sbjct: 175 DLGIDAKIKNIIQTEDAKAKLLAEQRNKKKDHGTSFVPTNIAVNYVQHNRFYHEDANAPQ 234
Query: 226 LRREHPELYKDR----GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
H E K R G + P+ + + + ATD + E+F+K R
Sbjct: 235 RHHRHKEEPKARPLRVGDTEKPGPEAPSPPNYRKRPNNEKATDDYHYEKFKKMNR 289
>gi|332016923|gb|EGI57732.1| Uncharacterized protein C9orf78 [Acromyrmex echinatior]
Length = 297
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/274 (31%), Positives = 128/274 (46%), Gaps = 38/274 (13%)
Query: 32 DDEEERRLAL----EEIKFLQKQRERKSGIPAIPSALQSAAAA-----------GGGGLT 76
+D EE +++L EE+K +QK RER +G+ + AL A+ GG +
Sbjct: 31 NDSEEEKMSLREKVEEMKIIQKLRERPAGVDVVGLALGENVASDTITSDPFNMKTGGMIN 90
Query: 77 KVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI-----DVN 129
+ KN D E + F ET ED M+KY+E+EL+K++ KN D N
Sbjct: 91 MAALKNTKHKPNDAYETGIGTQFNAETNKRDEDEEMVKYIEEELSKRKSKNSNDAANDAN 150
Query: 130 DRVENDLKHAEDELYKIPEHLKVKKRNSEES--STQWTTGIAEVQLPIEYKLKNIEETEA 187
+ + E L +PEHL+ N E S Q +GI EV L IE K++NIE TE
Sbjct: 151 NEKGSYCSPEEAALRAVPEHLRQSSANRSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEE 210
Query: 188 AK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGS 246
AK KLL ++ S F +P++ + ++ Q R E + +K SQ +
Sbjct: 211 AKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNR-----FNIEDSDFHK---SQQESGDK 261
Query: 247 RPTDNSTDAAGSR----QAATDQFMLERFRKRER 276
+ D G R + ATD + ERF+K+ R
Sbjct: 262 KKCTTKEDIRGKRKDNGEKATDDYHYERFKKQFR 295
>gi|195095931|ref|XP_001997854.1| GH17986 [Drosophila grimshawi]
gi|193905556|gb|EDW04423.1| GH17986 [Drosophila grimshawi]
Length = 300
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 144/299 (48%), Gaps = 51/299 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK +KN R+R K SDD E+E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 21 KKSSRKNLRQR----------KNSDDCEKEEQLTLDEIKERQRLRQRPNGVSLVGLALGK 70
Query: 67 AAA------------AGGGGLTKVSEKNEGDGEKDE----LVLQDTFAQETAVMVEDPNM 110
A GGL + G ++ E + + F+ ET ED M
Sbjct: 71 KVAPEEELAIKDPFNVKMGGLVNMQTLKSGKMKEPEDAYDVGIGTQFSAETNKRDEDEEM 130
Query: 111 LKYVEQELAKKRGKNI-DVNDRVENDLKH----AED-ELYKIPEHLK--VKKRNSEESST 162
+KY+EQEL K++G D D ++ H ED LY +P+HL+ R+ E S
Sbjct: 131 MKYIEQELQKRKGGGADDSTDNADDGDAHKYLTPEDAALYALPDHLRQSSSHRSEEMLSN 190
Query: 163 QWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR- 220
Q GI EV L I K++NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 191 QMLNGIPEVDLGIHAKIRNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRF 249
Query: 221 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
D E+ RR+ +D A ++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 250 NIEDSNEQRRRKR---------EDKEAKNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 298
>gi|238231821|ref|NP_001154097.1| CI078 protein [Oncorhynchus mykiss]
gi|225704000|gb|ACO07846.1| C9orf78 [Oncorhynchus mykiss]
Length = 295
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 84/274 (30%), Positives = 139/274 (50%), Gaps = 38/274 (13%)
Query: 36 ERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA---AGG---------GGLTKVSE--- 80
E R L+E K LQ R+R++G+ ++ + L+ GG GG+ + +
Sbjct: 25 EVRSKLDEAKELQSLRKRQTGV-SVAALLEGEKLRLDEGGDNDPFKLKTGGVVDMKKVKD 83
Query: 81 --KNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH 138
++ D + +L L +F+ ET ED +M+KY+E EL KK+G +V+ +K+
Sbjct: 84 RARDMTDDDTGDLNLGTSFSAETNRRDEDADMMKYIETELKKKKGSVEAEEQKVK--VKN 141
Query: 139 AEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEK 195
AED LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK KLLQ++
Sbjct: 142 AEDLLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIIFTEEAKAKLLQDQ 201
Query: 196 RLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDD--------GAGSR 247
R + +P++ + +Y Q R Y E + P+ + +R + G +
Sbjct: 202 RNKKKDNGTSFVPTNITVNYVQHNRFYREDV--NAPQRHHNRHKPKEPEARPLRVGDTEK 259
Query: 248 PTDNSTDAAGSR-----QAATDQFMLERFRKRER 276
P + + A R + ATD + E+F+K R
Sbjct: 260 PGPEAVEPANHRKRPNNEKATDDYHYEKFKKMNR 293
>gi|387914846|gb|AFK11032.1| uncharacterized protein C9orf78-like protein [Callorhinchus milii]
Length = 289
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/293 (31%), Positives = 144/293 (49%), Gaps = 39/293 (13%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEER-----RLALEEIKFLQKQRERKSGI---------- 57
K++R+R EE SD+E+E+ R L+E+K +Q R R++G+
Sbjct: 5 KSYRRRRLEE--------SDEEDEQTTVLVRSKLDELKEIQSMRRRQNGVSAAALLVGEK 56
Query: 58 -PAIPSALQSAAAAGGGGLT-----KVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNML 111
P S + GG+ K + + E+ +L L +F+ ET ED +M+
Sbjct: 57 TPEEASTVDDPFKLKTGGMIDMKKIKDRNRERVEEEETDLNLGTSFSVETNRRDEDADMM 116
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIA 169
KY+E EL KR K I N+ + +K+ ED LY++P+++ V KR E S Q +GI
Sbjct: 117 KYIETEL--KRRKGILENEEQKVKIKNPEDMLYELPDNINVSSAKRTEEMLSNQMLSGIP 174
Query: 170 EVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL-- 226
EV L I+ K+KNI TE AK +LL E+R + + +P++ + +Y Q R Y E++
Sbjct: 175 EVDLGIDAKIKNIISTEEAKAQLLAEQRNKKKDNATSFVPTNIAVNYVQHNRFYREEIHA 234
Query: 227 --RREHPELYKDRGSQDDGAGSRPTDNSTD-AAGSRQAATDQFMLERFRKRER 276
RR EL D + P + + S + ATD + E+F+K R
Sbjct: 235 PARRHKEELKPKPLRVGDTEKTEPDQSPPNRKRPSNEKATDDYHYEKFKKMSR 287
>gi|194747083|ref|XP_001955982.1| GF24824 [Drosophila ananassae]
gi|190623264|gb|EDV38788.1| GF24824 [Drosophila ananassae]
Length = 295
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 147/301 (48%), Gaps = 51/301 (16%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDD-EEERRLALEEIKFLQKQRERKSGIPAIPSA 63
I KK +KN R+R K SDD E+E ++ LEEIK Q+ R+R +G+ + A
Sbjct: 15 IVFKKSSRKNLRQR----------KSSDDGEKEEQVTLEEIKERQRLRQRPNGVSLVGLA 64
Query: 64 LQSAAA------------AGGGGLTKVSEKNEGD-GEKD---ELVLQDTFAQETAVMVED 107
L A GGL + + G E D ++ + F+ ET ED
Sbjct: 65 LGKKMAPEEELAIKDPFNVKTGGLVNMQQLKSGKMKEADDAYDVGIGTQFSAETNKRDED 124
Query: 108 PNMLKYVEQELAK-KRG----KNIDVNDRVENDLKHAEDELYKIPEHLK--VKKRNSEES 160
M+KY+EQEL K KRG D + V L + LY +P+HL+ R+ E
Sbjct: 125 EEMMKYIEQELQKRKRGGTDASAADDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEML 184
Query: 161 STQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRG 219
S Q GI EV L I+ K++NIE TE AK KLLQ+ + S F +P++ + ++ Q
Sbjct: 185 SNQMLNGIPEVDLGIQAKIRNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHN 243
Query: 220 R----DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRE 275
R D +E+ RR+ ++D G++ + T+ G ++ ATD + ++FRK+
Sbjct: 244 RFNIEDNSEQKRRK----------REDREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQF 292
Query: 276 R 276
R
Sbjct: 293 R 293
>gi|226442876|ref|NP_001139972.1| CI078 protein [Salmo salar]
gi|221220608|gb|ACM08965.1| C9orf78 [Salmo salar]
Length = 295
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/274 (30%), Positives = 139/274 (50%), Gaps = 38/274 (13%)
Query: 36 ERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA---AGG---------GGLTKVSE--- 80
E R L+E K LQ R+R++G+ ++ + L+ GG GG+ + +
Sbjct: 25 EVRSKLDEAKELQSLRKRQTGV-SVAALLEGEKLRLDEGGDNDPFKLKTGGVVDMKKVKD 83
Query: 81 --KNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH 138
++ D + +L L +F+ ET ED +M+KY+E EL KK+G +V+ +K+
Sbjct: 84 RARDMTDDDTGDLNLGTSFSAETNRRDEDADMMKYIETELKKKKGLVEAEEQKVK--VKN 141
Query: 139 AEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEK 195
AED LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK KLLQ++
Sbjct: 142 AEDLLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIIFTEEAKAKLLQDQ 201
Query: 196 RLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDD--------GAGSR 247
R + +P++ + +Y Q R Y E + P+ + +R + G +
Sbjct: 202 RNKKKDNGTSFVPTNIAVNYVQHNRFYREDV--NAPQRHHNRHKPKEPEARPLRVGDTEK 259
Query: 248 PTDNSTDAAGSR-----QAATDQFMLERFRKRER 276
P + + A R + ATD + E+F+K R
Sbjct: 260 PGPEAVEPANHRKRPNNEKATDDYHYEKFKKMNR 293
>gi|311246626|ref|XP_003122268.1| PREDICTED: uncharacterized protein C9orf78-like [Sus scrofa]
Length = 289
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V KR E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKRTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|442760949|gb|JAA72633.1| Hypothetical protein, partial [Ixodes ricinus]
Length = 303
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 43/279 (15%)
Query: 38 RLALEEIKFLQKQRERKSGIPAI------------------PSALQSAAAAGGGGLTKVS 79
R LE+ K +QK R+R +G+ I P L++ L
Sbjct: 26 REILEDTKEIQKLRKRPNGVSVIGLNLGKKLTTKEELVIEDPFKLKTGGMIDMKALKGXX 85
Query: 80 EKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
E E D + L +TF+ ET ED +M+KY+E+ELAK+RGK D ++
Sbjct: 86 ITME---ELDAVNLGNTFSVETNQRDEDADMMKYIEEELAKRRGKGQDTETDSRDEGVDP 142
Query: 140 EDELYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL 197
ED L+ +PEHL+ K++ E S Q +GI EV L IE +++NIE TE AK L +R+
Sbjct: 143 EDVLFHVPEHLRKSSSKKSEEMLSNQMLSGIPEVDLGIEERIRNIEATEEAKLKLLRERM 202
Query: 198 MGRAKSDFSIPSSYSADYFQRGR---------DYAEKLRRE-HPELYKDR---GSQDDGA 244
+ + +P++ + ++ Q R YA ++ RE P + K + A
Sbjct: 203 AKKERETSFVPTNMAVNFVQHNRFNIDDGSRSRYARRVPREKEPPVAKPVVVIAEAEAVA 262
Query: 245 GSRPTDNSTDAAG-SR------QAATDQFMLERFRKRER 276
S P G SR + ATD F E+F+K+ R
Sbjct: 263 HSIPGRQGKGGKGLSRGKGNDDEKATDDFHFEKFKKQFR 301
>gi|432874672|ref|XP_004072535.1| PREDICTED: uncharacterized protein C9orf78 homolog [Oryzias
latipes]
Length = 294
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 81/271 (29%), Positives = 133/271 (49%), Gaps = 37/271 (13%)
Query: 38 RLALEEIKFLQKQRERKSGIPAI-----------------PSALQSAAAAGGGGLTKVSE 80
R LEE K +Q R+R++G+ P L++ G + KV +
Sbjct: 27 RSKLEEAKEIQSLRKRQTGVSVTALLVGEKLPPEAEIDNDPFKLKTG---GVIDMKKVKD 83
Query: 81 KNEGDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
+N E + EL L +F+ ET ED +M+KY+E EL KK+G +++ +K+
Sbjct: 84 RNRDMTEDETELNLGTSFSAETNRRDEDADMMKYIETELKKKKGLVEAEEQKIK--VKNP 141
Query: 140 EDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 196
ED LY++PE+++V K+ E S Q +GI EV L I+ K+KNI +TE AK KL+ E+R
Sbjct: 142 EDHLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIIQTEEAKAKLIAEQR 201
Query: 197 LMGRAKSDFSIPSSYSADYFQRGRDYAE---KLRREH--------PELYKDRGSQDDGAG 245
+ +P++ + +Y Q R Y E +R H P + ++ G
Sbjct: 202 NKKKDNGTSFVPTNIAVNYVQHNRFYHEDSNAAQRHHRHKEPEPKPRPLRVGDTEKPGLE 261
Query: 246 SRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
+ P+ + + + ATD + E+F+K R
Sbjct: 262 AAPSPPNFRKRPNNEKATDDYHYEKFKKMNR 292
>gi|307199470|gb|EFN80083.1| Uncharacterized protein C9orf78 [Harpegnathos saltator]
Length = 295
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 129/278 (46%), Gaps = 40/278 (14%)
Query: 30 LSDDEEER------RLALEEIKFLQKQRERKSGIPAIPSALQSAAAA-----------GG 72
+SDD+E R +EE+K +QK RER +GI + AL A+ G
Sbjct: 25 ISDDDESESEKTSLREKVEEMKIVQKLRERPTGINVVGLALGENVASDVIMSDPFNMKTG 84
Query: 73 GGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI-DVN 129
G + KN + D + + F ET ED M+KY+E+EL+K++ KN DV
Sbjct: 85 GIVNMAVLKNTKHRQNDAYDTGIGTQFNAETNKRDEDEEMVKYIEEELSKRKSKNNNDVT 144
Query: 130 DRVEND----LKHAEDELYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIE 183
+ N+ E L +PEHL+ R+ E S Q +GI EV L IE K++NIE
Sbjct: 145 NGTNNEKGSYCSPEEAALRAVPEHLRQSSAHRSEEMLSNQMLSGIPEVDLGIEAKIRNIE 204
Query: 184 ETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDD 242
TE AK KLL ++ S F +P++ + ++ Q R E + K R DD
Sbjct: 205 ATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNR-----FNIEDSDFQKSRQDSDD 258
Query: 243 GAGSRPTDNSTDAAGSR----QAATDQFMLERFRKRER 276
+ D G R + ATD + ERF+K+ R
Sbjct: 259 ---KKKCVTKEDIRGKRKDNGEKATDDYHYERFKKQFR 293
>gi|17068385|gb|AAH17570.1| Chromosome 9 open reading frame 78 [Homo sapiens]
Length = 289
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G +V++ K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKGIVEHEEQKVKS--KNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQSKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|395844397|ref|XP_003794948.1| PREDICTED: uncharacterized protein C9orf78 homolog [Otolemur
garnettii]
Length = 289
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKIQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|115496926|ref|NP_001069516.1| uncharacterized protein C9orf78 homolog [Bos taurus]
gi|338720610|ref|XP_003364207.1| PREDICTED: uncharacterized protein C9orf78-like isoform 2 [Equus
caballus]
gi|94574208|gb|AAI16054.1| Chromosome 9 open reading frame 78 ortholog [Bos taurus]
gi|296482067|tpg|DAA24182.1| TPA: chromosome 9 open reading frame 78 [Bos taurus]
Length = 265
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 80/250 (32%), Positives = 129/250 (51%), Gaps = 25/250 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGD-GEKDELVLQDT 96
RL LEE + +Q R+R +G+ G + K+ E+ + E+++L L +
Sbjct: 28 RLKLEETREVQNLRKRPNGM----------KTGGMVDMKKLKERGKDKISEEEDLHLGTS 77
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--K 154
F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY++PE+++V K
Sbjct: 78 FSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYELPENIRVSSAK 135
Query: 155 RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSA 213
+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ + +P++ +
Sbjct: 136 KTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAV 195
Query: 214 DYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDAAGSRQAATDQF 266
+Y Q R Y E+L +E P+ R G + R N A + ATD +
Sbjct: 196 NYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRPANEK--ATDDY 253
Query: 267 MLERFRKRER 276
E+F+K R
Sbjct: 254 HYEKFKKMNR 263
>gi|410979370|ref|XP_003996058.1| PREDICTED: uncharacterized protein C9orf78 homolog, partial [Felis
catus]
Length = 286
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 25 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 84
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 85 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 142
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 143 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 202
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 203 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 262
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 263 KRPANEK--ATDDYHYEKFKKMNR 284
>gi|355732023|gb|AES10570.1| hypothetical protein [Mustela putorius furo]
Length = 288
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|149738228|ref|XP_001499857.1| PREDICTED: uncharacterized protein C9orf78-like isoform 1 [Equus
caballus]
Length = 289
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|440894378|gb|ELR46847.1| hypothetical protein M91_13534 [Bos grunniens mutus]
Length = 289
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|7706557|ref|NP_057604.1| uncharacterized protein C9orf78 [Homo sapiens]
gi|388452692|ref|NP_001253951.1| uncharacterized protein LOC716542 [Macaca mulatta]
gi|114627167|ref|XP_520311.2| PREDICTED: uncharacterized protein C9orf78 homolog isoform 2 [Pan
troglodytes]
gi|332230227|ref|XP_003264289.1| PREDICTED: uncharacterized protein C9orf78 homolog [Nomascus
leucogenys]
gi|397503617|ref|XP_003822417.1| PREDICTED: uncharacterized protein C9orf78 homolog [Pan paniscus]
gi|402896312|ref|XP_003911247.1| PREDICTED: uncharacterized protein C9orf78 homolog [Papio anubis]
gi|426363280|ref|XP_004048771.1| PREDICTED: uncharacterized protein C9orf78 homolog [Gorilla gorilla
gorilla]
gi|74753081|sp|Q9NZ63.1|CI078_HUMAN RecName: Full=Uncharacterized protein C9orf78; AltName:
Full=Hepatocellular carcinoma-associated antigen 59
gi|7158847|gb|AAF37561.1| hepatocellular carcinoma-associated antigen 59 [Homo sapiens]
gi|14043339|gb|AAH07664.1| Chromosome 9 open reading frame 78 [Homo sapiens]
gi|119608316|gb|EAW87910.1| chromosome 9 open reading frame 78, isoform CRA_b [Homo sapiens]
gi|193787017|dbj|BAG51840.1| unnamed protein product [Homo sapiens]
gi|355570051|gb|EHH25578.1| hypothetical protein EGK_21433 [Macaca mulatta]
gi|355753000|gb|EHH57046.1| hypothetical protein EGM_06606 [Macaca fascicularis]
gi|380785079|gb|AFE64415.1| uncharacterized protein C9orf78 [Macaca mulatta]
gi|380808288|gb|AFE76019.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|380813696|gb|AFE78722.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|383411663|gb|AFH29045.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|383411665|gb|AFH29046.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|383411667|gb|AFH29047.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|383411669|gb|AFH29048.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|384942542|gb|AFI34876.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|384942544|gb|AFI34877.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|384942546|gb|AFI34878.1| chromosome 9 open reading frame 78 [Macaca mulatta]
gi|410223530|gb|JAA08984.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410223532|gb|JAA08985.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410223534|gb|JAA08986.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410256334|gb|JAA16134.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410256336|gb|JAA16135.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410256338|gb|JAA16136.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410295782|gb|JAA26491.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410295784|gb|JAA26492.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410355223|gb|JAA44215.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410355225|gb|JAA44216.1| chromosome 9 open reading frame 78 [Pan troglodytes]
gi|410355227|gb|JAA44217.1| chromosome 9 open reading frame 78 [Pan troglodytes]
Length = 289
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|296191004|ref|XP_002743423.1| PREDICTED: uncharacterized protein C9orf78-like [Callithrix
jacchus]
Length = 289
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 132/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLTKVSEKNEGDG 86
RL LEE + +Q R+R +G+ A+ + + GG+ + + E
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 87 EK----DELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
+K ++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISDEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|125979391|ref|XP_001353728.1| GA20734 [Drosophila pseudoobscura pseudoobscura]
gi|195169146|ref|XP_002025386.1| GL11930 [Drosophila persimilis]
gi|54640711|gb|EAL29462.1| GA20734 [Drosophila pseudoobscura pseudoobscura]
gi|194108854|gb|EDW30897.1| GL11930 [Drosophila persimilis]
Length = 296
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 114/237 (48%), Gaps = 35/237 (14%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEE-ERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
KK +KN R+R K SDDEE E +L L++IK Q+ R R +G+ + AL
Sbjct: 20 KKSSRKNLRQR----------KNSDDEEKEEKLTLDDIKERQRLRHRPNGVSLVGLALGK 69
Query: 67 AAA------------AGGGGLTKVSEKNEGDGEKDE----LVLQDTFAQETAVMVEDPNM 110
A GGL ++ G ++ E + + F+ ET ED M
Sbjct: 70 KIAPEEELAIKDPFNVKSGGLVNMATLKSGKMKEAEDPYDVGIGTQFSAETNKRDEDEEM 129
Query: 111 LKYVEQELAKKRGKNIDVNDRVEND----LKHAEDELYKIPEHLK--VKKRNSEESSTQW 164
+KY+E EL K++G D D + D L + LY +P+HL+ R+ E S Q
Sbjct: 130 MKYIELELQKRKGGGTDAADNDDGDVNKYLTPEDAALYALPDHLRQSSTHRSEEMLSNQM 189
Query: 165 TTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 220
GI EV L I+ K+ NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 190 LNGIPEVDLGIQAKICNIEATEDAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNR 245
>gi|383853293|ref|XP_003702157.1| PREDICTED: uncharacterized protein C9orf78-like [Megachile
rotundata]
Length = 310
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 140/298 (46%), Gaps = 38/298 (12%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSAL 64
I K++ +K+ RKR +E+ ++E R +EE+K +QK RER G+ + AL
Sbjct: 23 IEFKRKSRKSLRKRHVSSDEDDN---ENEETSIREKVEEMKIIQKLRERPKGVNVVGLAL 79
Query: 65 QSAAAA-----------GGGGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNML 111
GG + + KN + D E + F ET ED M+
Sbjct: 80 GENVTPDVMTSDPFNVKTGGMVNMAALKNTKLKQNDAYETGIGTQFNAETNKRDEDEEMV 139
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLK--VKKRNSEESSTQW 164
KY+E+EL+K++ KN D + ++ K + E L +PEHL+ R+ E S Q
Sbjct: 140 KYIEEELSKRKSKNEDKTENGSSNDKGSYCSPEEAALQAVPEHLRQSSAHRSEEMLSNQM 199
Query: 165 TTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYA 223
+GI EV L IE K++NIE TE AK KLL ++ S F +P++ + ++ Q R
Sbjct: 200 LSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNR--- 255
Query: 224 EKLRREHPELYKDRGSQDD-GAGSRPTDNSTDAAGSR----QAATDQFMLERFRKRER 276
E + K + DD + P D D G R + ATD + ERF+K+ R
Sbjct: 256 --FNIEDADFQKSKQDSDDRKKVTAPRD---DFKGKRKDNGEKATDDYHYERFKKQFR 308
>gi|348570394|ref|XP_003470982.1| PREDICTED: uncharacterized protein C9orf78-like [Cavia porcellus]
Length = 292
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 131/269 (48%), Gaps = 36/269 (13%)
Query: 38 RLALEEIKFLQKQRERKSGIPA---------------IPSALQSAAAAGGGGLTKVSEKN 82
RL LEE + +Q R+R +G+ A + SA S GG + K
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAAALLVGEKVQEETTLVVSARSSFPMKTGGMVDMKKLKE 87
Query: 83 EGD---GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLK-- 137
G E+++L L +F+ ET ED +M+KY+E EL K++G + + E +K
Sbjct: 88 RGKDKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG----IVEHEEQKVKPR 143
Query: 138 HAEDELYKIPEHLKV--KKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQE 194
+AED LY++PE ++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E
Sbjct: 144 NAEDCLYELPESIRVASAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAE 203
Query: 195 KRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSR 247
++ + +P++ + +Y Q R Y E+L +E P+ R G + R
Sbjct: 204 QQNKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPER 263
Query: 248 PTDNSTDAAGSRQAATDQFMLERFRKRER 276
N A + ATD + E+F+K R
Sbjct: 264 SPPNRKRPANEK--ATDDYHYEKFKKMNR 290
>gi|225704712|gb|ACO08202.1| C9orf78 [Oncorhynchus mykiss]
Length = 295
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 138/273 (50%), Gaps = 42/273 (15%)
Query: 36 ERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA---AGG---------GGLTKVSE--- 80
E R L+E K LQ R+R++G+ ++ + L+ GG GG+ + +
Sbjct: 25 EVRSKLDEAKELQSLRKRQTGV-SVAALLEGEKLRLDEGGDNDPFKLKTGGVVDMKKVKD 83
Query: 81 --KNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH 138
++ D + +L L +F+ ET ED +M+KY+E EL KK+G +V+ +K+
Sbjct: 84 RARDMTDDDTGDLNLGTSFSAETNRRDEDADMVKYIETELKKKKGLVEAEEQKVK--VKN 141
Query: 139 AEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEK 195
AED LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK KLLQ++
Sbjct: 142 AEDLLYELPENIRVNSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIIFTEEAKAKLLQDQ 201
Query: 196 RLMGRAKSDFSIPSSYSADYFQRGRDYAEKL---RREH-------PELYKDRGSQDDGAG 245
R + +P++ + +Y Q R Y E + +R H PE R G
Sbjct: 202 RNKKKDNGTSFVPTNIAVNYVQHNRFYREDVNAPQRHHSRHKPKEPEARPLRV----GDT 257
Query: 246 SRPTDNSTDAAGSR-----QAATDQFMLERFRK 273
+P + + A R + ATD + E+F+K
Sbjct: 258 EKPGPEAVEPANHRKRPNNEKATDDYHYEKFKK 290
>gi|344271642|ref|XP_003407646.1| PREDICTED: uncharacterized protein C9orf78-like [Loxodonta
africana]
Length = 289
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
S + ATD + E+F+K R
Sbjct: 266 KRP--SNEKATDDYHYEKFKKMNR 287
>gi|426226107|ref|XP_004007195.1| PREDICTED: uncharacterized protein C9orf78 homolog [Ovis aries]
Length = 341
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 80 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 139
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 140 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 197
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 198 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 257
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 258 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 317
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 318 KRPANEK--ATDDYHYEKFKKMNR 339
>gi|443714105|gb|ELU06673.1| hypothetical protein CAPTEDRAFT_168725 [Capitella teleta]
Length = 342
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 143/300 (47%), Gaps = 37/300 (12%)
Query: 9 KEKKKNFRKRSYEEEEETTNKLSDDEEERRL-ALEEIKFLQKQRERKSGIPAIPSALQSA 67
K+ KKNFR++ + E + + + E ++E K LQK R+R+ G+ A A+
Sbjct: 3 KKPKKNFRRKVESSDSEDNSDVEEKSNETLCDRIKEAKELQKLRQRQRGVSAEDLAVAKI 62
Query: 68 AA-------------AGGGGLTKVSEKNEGDGEKDEL-VLQDTFAQETAVMVEDPNMLKY 113
GG K +K +K+++ + TFA ET ED +MLKY
Sbjct: 63 TPKDSKKKEDPLKLKTGGYIELKTLKKEISKADKEDVEQIGTTFAAETNRRDEDADMLKY 122
Query: 114 VEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKV----KKRNSEESSTQWTTGIA 169
VE+EL K++G I E + ED LY++PEH+K K +N + S Q +GI
Sbjct: 123 VEEELNKRKG--ITKEFESETLKRKPEDALYELPEHVKALTAKKSKNEDMLSNQMLSGIP 180
Query: 170 EVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRR 228
EV L IE K+ NIE TE AK KL++EKR + +P++ + ++ Q R L R
Sbjct: 181 EVDLGIEVKIHNIEMTEVAKQKLIEEKRRKKDSGISEFVPTNIAVNFMQHNR---FTLHR 237
Query: 229 EHPELYKDRGSQDD-------GAGSRP-----TDNSTDAAGSRQAATDQFMLERFRKRER 276
+ + K + ++ G RP T + A S + ATD + ERF+K R
Sbjct: 238 DEKAVVKKKVVEEPKPEPLRVGDIQRPDTVPSTSEFSRPAASTEKATDDYHYERFKKAVR 297
>gi|313227794|emb|CBY22942.1| unnamed protein product [Oikopleura dioica]
Length = 329
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 128/265 (48%), Gaps = 36/265 (13%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGI---------- 57
K + K+NFRKR E EE E E L L ++K ++ ++R +G+
Sbjct: 4 KSKSKRNFRKRRTEVNEEEPENSQVYEVENGLELAKLK--RELKKRTAGVNSESLAKGVK 61
Query: 58 ------PAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNML 111
P P L S GGGLT++ EK G+ EKD + TF E + E+ M
Sbjct: 62 TPRFDDPNDPYKLNS-----GGGLTQIREKRLGNNEKDVTQISSTFKTEKKIRDEEEEMN 116
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAED-----ELYKIPEHLKVKKRNSEES---STQ 163
K++E E+ K+RG + ++ +L+ ED LY+IPE + ++ E S Q
Sbjct: 117 KFIESEILKRRGIESATKESMKQNLR-LEDIVDPKFLYEIPEKYRATSKHLREDGLLSAQ 175
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMG--RAKSDFSIPSSYS--ADYFQRG 219
+GI EV L + KL+NIE TEAAK+LL +K + A SD S SY+ A + G
Sbjct: 176 MLSGIPEVDLGVNNKLQNIERTEAAKRLLVDKFIKDEKEASSDKSHERSYAREAAVNRGG 235
Query: 220 RDYAEKLRREHPELYKDRGSQDDGA 244
++ ++ +H YK ++ D
Sbjct: 236 NEFTDQFYSQHMRFYKGEEAETDAV 260
>gi|21450249|ref|NP_659134.1| uncharacterized protein C9orf78 homolog [Mus musculus]
gi|408360017|sp|Q3TQI7.2|CI078_MOUSE RecName: Full=Uncharacterized protein C9orf78 homolog
gi|13542853|gb|AAH05624.1| CDNA sequence BC005624 [Mus musculus]
gi|74177493|dbj|BAE34621.1| unnamed protein product [Mus musculus]
gi|74207670|dbj|BAE40080.1| unnamed protein product [Mus musculus]
gi|148676552|gb|EDL08499.1| mCG19001 [Mus musculus]
gi|149039065|gb|EDL93285.1| similar to Hypothetical protein MGC11690 [Rattus norvegicus]
Length = 289
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/264 (31%), Positives = 133/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPA----IPSALQ----------SAAAAGGGGLTKVSEKNE 83
RL LEE + +Q R+R +G+ A + +Q A G + K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAAALLVGEKVQEETTLVDDPFQMATGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I + + K+AED
Sbjct: 88 DKVSEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEQEEQKAKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|57092043|ref|XP_537817.1| PREDICTED: uncharacterized protein C9orf78 isoform 1 [Canis lupus
familiaris]
Length = 289
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L ++ P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKDEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|198428614|ref|XP_002128903.1| PREDICTED: similar to Uncharacterized protein C9orf78
(Hepatocellular carcinoma-associated antigen 59) [Ciona
intestinalis]
Length = 291
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 134/267 (50%), Gaps = 32/267 (11%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSALQSAAA-------------AGGGGLTK---VSEK 81
R LE K LQK R+R+ G+ A+ A + GGL + V ++
Sbjct: 27 RDMLEATKELQKIRKRQMGVNAVSLATGAKLKKVDNLDVEADPFKMTTGGLVEMGNVKDR 86
Query: 82 NEG----DGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLK 137
N D ++D L TF+ ET ED + Y+E EL +++G+ + +D+ +
Sbjct: 87 NRDRTYEDVDRDVTNLGHTFSVETNRRDEDAELTAYIENELKRRKGETSNGDDKKAKE-- 144
Query: 138 HAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQE 194
AED+LY++PEHL++K K++ E S Q +GI EV L I+ K+KNIE TE AK KL+ E
Sbjct: 145 SAEDKLYQLPEHLQIKVGKQSEEMLSNQMLSGIPEVDLGIDTKIKNIERTEEAKQKLITE 204
Query: 195 KRLMGRAKSDFSIPSSYSADYFQRGR----DYAEKLRREHPELYKDRGSQDDGAGSRP-T 249
++ F +P++ + +Y Q R D + ++ PE D +G RP +
Sbjct: 205 LSKKKEKRTSF-VPTNMAVNYVQHKRFMHNDGHKNATKKEPETEAPPLVVGD-SGRRPAS 262
Query: 250 DNSTDAAGSRQAATDQFMLERFRKRER 276
+ + A + +TD F ++FRK R
Sbjct: 263 EVAQHRADNSGKSTDNFHYDKFRKAAR 289
>gi|432116600|gb|ELK37393.1| hypothetical protein MDA_GLEAN10011232 [Myotis davidii]
Length = 289
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/266 (30%), Positives = 135/266 (50%), Gaps = 33/266 (12%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDRGSQDDGAGSRPTDNST 253
+ +P++ + +Y Q R Y E+L +E P+ R G +P + +
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRV----GDTEKPEPDRS 261
Query: 254 DAAGSR---QAATDQFMLERFRKRER 276
R + ATD + E+F+K R
Sbjct: 262 PPNRKRPPNEKATDDYHYEKFKKMNR 287
>gi|351697008|gb|EHA99926.1| hypothetical protein GW7_01886 [Heterocephalus glaber]
Length = 289
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 133/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAAALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKINEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|21358507|ref|NP_647643.1| CG7974, isoform A [Drosophila melanogaster]
gi|442629510|ref|NP_001261273.1| CG7974, isoform B [Drosophila melanogaster]
gi|7292132|gb|AAF47544.1| CG7974, isoform A [Drosophila melanogaster]
gi|17861842|gb|AAL39398.1| GM02612p [Drosophila melanogaster]
gi|220943288|gb|ACL84187.1| CG7974-PA [synthetic construct]
gi|220953398|gb|ACL89242.1| CG7974-PA [synthetic construct]
gi|440215140|gb|AGB93968.1| CG7974, isoform B [Drosophila melanogaster]
Length = 294
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 141/294 (47%), Gaps = 44/294 (14%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +EEE +E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDEEE---------KEEQLTLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMKQLKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRGKNIDVNDRVEND------LKHAEDELYKIPEHLK--VKKRNSEESSTQ 163
KY+EQEL K++G + D E+D L + LY +P+HL+ R+ E S Q
Sbjct: 129 KYIEQELQKRKGGGTE--DAAEDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQ 186
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 222
GI EV L I K++NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 187 MLNGIPEVDLGIVAKIRNIEATEEAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRFN 245
Query: 223 AEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
E + +++ G++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 246 IEDNSDQRRR------KREEREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 292
>gi|354503906|ref|XP_003514021.1| PREDICTED: uncharacterized protein C9orf78 homolog [Cricetulus
griseus]
gi|344258464|gb|EGW14568.1| Uncharacterized protein C9orf78-like [Cricetulus griseus]
Length = 289
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 132/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAAALLVGEKVQEETTLVDDPFQMTTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I + + K+AED
Sbjct: 88 DKVSEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEQEEQKAKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|195490506|ref|XP_002093169.1| GE21178 [Drosophila yakuba]
gi|194179270|gb|EDW92881.1| GE21178 [Drosophila yakuba]
Length = 294
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 141/294 (47%), Gaps = 44/294 (14%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +E E +E ++ L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDETE---------KEEQITLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMQQLKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRGKNIDVNDRVEND------LKHAEDELYKIPEHLK--VKKRNSEESSTQ 163
KY+EQEL K++G + D VE+D L + LY +P+HL+ R+ E S Q
Sbjct: 129 KYIEQELQKRKGGGTE--DAVEDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQ 186
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 222
GI EV L I K++NIE TE AK KLLQ+ + S F +P++ + ++ Q R
Sbjct: 187 MLNGIPEVDLGIVAKIRNIEATEEAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRFN 245
Query: 223 AEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
E + +++ G++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 246 IEDSSDQRRR------KREEREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 292
>gi|74201040|dbj|BAE37395.1| unnamed protein product [Mus musculus]
Length = 289
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/264 (31%), Positives = 132/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPA----IPSALQ----------SAAAAGGGGLTKVSEKNE 83
RL LEE +Q R+R +G+ A + +Q A G + K+ E+ +
Sbjct: 28 RLKLEETGEVQNLRKRPNGVSAAALLVGEKVQEETTLVDDPFQMATGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I + + K+AED
Sbjct: 88 DKVSEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEQEEQKAKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|321477023|gb|EFX87982.1| hypothetical protein DAPPUDRAFT_305639 [Daphnia pulex]
Length = 305
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 109/233 (46%), Gaps = 22/233 (9%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQ-- 65
KK +K RKR E EE + DE + LEE K LQK RER GI A+ A+
Sbjct: 18 KKPSRKPMRKRL--EIEEDDDAGGSDELDVLSKLEETKELQKLRERPHGISAVALAIGKR 75
Query: 66 -------------SAAAAGGGGLTKVSEKNEGDGEKD---ELVLQDTFAQETAVMVEDPN 109
G + V + D E + F+ ET ED
Sbjct: 76 ITVEEEVTVNDPFKVTTGGMADMKAVKAGKQNSSSVDDAYETGIGTQFSVETNTRDEDAE 135
Query: 110 MLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKK--RNSEESSTQWTTG 167
M+KY+E++LAK++G + D+ L E +PE+L+VK ++ E S Q +G
Sbjct: 136 MMKYIEEQLAKRKGLMQEDEDKSNKYLTPEEIAFSSVPEYLRVKSSVQSEEMLSNQMLSG 195
Query: 168 IAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 220
I EV L IE K+KNIE TE AK+ L ++RL + +P++ + ++ Q R
Sbjct: 196 IPEVDLGIEAKIKNIEATEEAKQKLLQERLRKKDGPSMFVPTNMAVNFVQHNR 248
>gi|221131461|ref|XP_002156012.1| PREDICTED: uncharacterized protein C9orf78 homolog [Hydra
magnipapillata]
Length = 292
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 130/275 (47%), Gaps = 49/275 (17%)
Query: 40 ALEEIKFLQKQRERKSGIPA--IPSALQSAA------------AAGGGGLTKVSEKNEGD 85
++E+ K LQK R R G+ + S L + + GGGL +
Sbjct: 24 SIEDRKELQKFRSRPKGVSVEVLASLLDTVSQNKEKTNDDPFKLNSGGGLV------DNG 77
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLK--HAEDEL 143
+D F+ ET ED +LKY+E+ L KKRG VN + D K ED L
Sbjct: 78 KSRDLSNFGTNFSTETNQRDEDKQLLKYIEEGLMKKRG----VNQQENPDTKVLSKEDLL 133
Query: 144 YKIPEHLKVKKR--NSEES-STQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
Y++PE+LKV+ + SEE S+Q GI EV L IE K+KNIE TE AK K+++E +
Sbjct: 134 YQLPENLKVQSKIMKSEEMLSSQVLCGIPEVDLGIEAKIKNIEATEEAKMKMIEESKNRK 193
Query: 200 RAKSDFSIPSSYSADYFQRGRDY----------AEKLRREHPELYKDRGSQDDG------ 243
+ S+F +P++ ++++ R Y E+ R + D+G G
Sbjct: 194 QQASEF-VPTNMASNFMHHSRFYDEKKAIEKEKKEEKERLENAVVIDKGPTVGGDIIENA 252
Query: 244 AGSRPTDNSTDAAGSR--QAATDQFMLERFRKRER 276
S+ N+ + G R + +D FM E F+KR R
Sbjct: 253 EDSKFIRNTMSSGGKRNKKGTSDDFMFESFKKRAR 287
>gi|196010774|ref|XP_002115251.1| hypothetical protein TRIADDRAFT_50676 [Trichoplax adhaerens]
gi|190582022|gb|EDV22096.1| hypothetical protein TRIADDRAFT_50676 [Trichoplax adhaerens]
Length = 270
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 140/286 (48%), Gaps = 40/286 (13%)
Query: 12 KKNFRKR---SYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPA-------IP 61
K+N+RKR S ++E+ +SD ++ K LQK R + GI + +P
Sbjct: 3 KRNYRKRRDSSDDDEKVGDESISD-------VIKRAKELQKFRAKPRGIDSSELDKGNVP 55
Query: 62 SAL----QSAAAAGGGGLTKVSEKNEG--DGEKDELVLQDTFAQETAVMVEDPNMLKYVE 115
+ GGL + +G D D++ L F+ ET ED M+KY+E
Sbjct: 56 DVEVPEDEDPFKLKTGGLIDMDHAKKGGVDEMGDKISLGKNFSAETNTFDEDAAMMKYIE 115
Query: 116 QELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQL 173
ELAKK+G + +D K ED LY++PE+L++ K++ E S Q +GI E+ L
Sbjct: 116 VELAKKKGV-VSQDDEDSRSGKVLEDSLYELPENLRITSAKKSEEMLSNQMLSGIPEIDL 174
Query: 174 PIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFS--IPSSYSADYFQRGRDYAEKLRREH 230
I+ KL+NIE TE AK ++L ++R R K++ S +P + + +Y Q R Y + + E
Sbjct: 175 GIDAKLRNIEATENAKLEMLMKRR---RKKNEISSMVPINIAVNYVQHTR-YVD-IDVEE 229
Query: 231 PELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
+ + + D + N + ATD + E+FRK+ R
Sbjct: 230 EVISRKTATNDRATTQKRRHN------WKNTATDDYHFEKFRKQMR 269
>gi|195336668|ref|XP_002034957.1| GM14436 [Drosophila sechellia]
gi|194128050|gb|EDW50093.1| GM14436 [Drosophila sechellia]
Length = 294
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 141/294 (47%), Gaps = 44/294 (14%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +EEE +E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDEEE---------KEEQLTLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMKQLKSGKIKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRGKNIDVNDRVEND------LKHAEDELYKIPEHLK--VKKRNSEESSTQ 163
KY+EQEL K++G + D E+D L + LY +P+HL+ R+ E S Q
Sbjct: 129 KYIEQELQKRKGGGTE--DAAEDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQ 186
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDY 222
GI EV L I K++NIE TE AK KL+Q+ + S F +P++ + ++ Q R
Sbjct: 187 MLNGIPEVDLGIVAKIRNIEATEEAKQKLMQDAKNKKDGPSQF-VPTNMAVNFMQHNRFN 245
Query: 223 AEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
E + +++ G++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 246 IEDNSDQRRR------KREEREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 292
>gi|301758846|ref|XP_002915284.1| PREDICTED: uncharacterized protein C9orf78-like [Ailuropoda
melanoleuca]
Length = 385
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 124 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 183
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 184 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 241
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++
Sbjct: 242 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKK 301
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 302 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 361
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 362 KRPANEK--ATDDYHYEKFKKMNR 383
>gi|197102566|ref|NP_001125335.1| uncharacterized protein C9orf78 homolog [Pongo abelii]
gi|75042142|sp|Q5RC87.1|CI078_PONAB RecName: Full=Uncharacterized protein C9orf78 homolog
gi|55727739|emb|CAH90620.1| hypothetical protein [Pongo abelii]
Length = 289
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 29/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLT---KVSEKNE 83
RL LEE + +Q R+R +G+ A+ + + GG+ K+ E+ +
Sbjct: 28 RLKLEETREVQNLRKRPNGVSAVALLVGEKVQEETTLVDDPFQMKTGGMVDMKKLKERGK 87
Query: 84 GD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED
Sbjct: 88 DKISEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDC 145
Query: 143 LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMG 199
LY++PE+++V K+ E S Q +GI EV I+ K+KNI TE AK +LL E++
Sbjct: 146 LYELPENIRVSSAKKTEEMLSNQMLSGIPEVDQGIDAKIKNIISTEDAKARLLAEQQNKK 205
Query: 200 RAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 206 KDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNR 265
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 266 KRPANEK--ATDDYHYEKFKKMNR 287
>gi|380018473|ref|XP_003693152.1| PREDICTED: uncharacterized protein C9orf78-like [Apis florea]
Length = 296
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 43/266 (16%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA-----------GGGGLTKVSEKNEGDGEKD 89
+EE+K +QK RER G+ + AL GG + + KN + D
Sbjct: 42 VEEMKIIQKLRERPKGVNVVGLALGENVTPDVMMSDPFNVKTGGMVNMAALKNTKLKQND 101
Query: 90 --ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH--------A 139
E + F ET ED M+KY+E+EL+K++ KN D+ EN L +
Sbjct: 102 AYETGIGTQFNAETNKRDEDEEMVKYIEEELSKRKSKN---EDKTENGLNNDKGSYCSPE 158
Query: 140 EDELYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 196
E L +PEHL+ R+ E S Q +GI EV L IE K++NIE TE AK KLL ++
Sbjct: 159 EAALQAVPEHLRQSSAHRSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRH 218
Query: 197 LMGRAKSDFSIPSSYSADYFQRGR------DYAEKLRREHPELYKDRGSQDDGAGSRPTD 250
S F +P++ + ++ Q R D+ +K +++ + K +DD G R D
Sbjct: 219 RKKDGPSQF-VPTNMAVNFVQHNRFNIEDTDF-QKSKQDSDDRKKIIAPRDDYKGKR-KD 275
Query: 251 NSTDAAGSRQAATDQFMLERFRKRER 276
N + ATD + ERF+K+ R
Sbjct: 276 NG-------EKATDDYHYERFKKQFR 294
>gi|427797465|gb|JAA64184.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 326
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 69/231 (29%), Positives = 114/231 (49%), Gaps = 26/231 (11%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEE--RRLALEEIKFLQKQRERKSGIPAI--------- 60
K +KR + ++++ DEEE R L++ K +QK R+R +G+ I
Sbjct: 22 KSGLKKRKCFRQHKSSDGSESDEEEGVSREILQDTKEIQKLRKRPNGVSVIGLNLGKKLT 81
Query: 61 ---------PSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNML 111
P L++ L E E D + L +TF+ ET ED +M+
Sbjct: 82 PKEELVIDDPFKLKTGGMIDMKALKGKRVTME---ELDAVNLGNTFSVETNQRDEDADMM 138
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK--VKKRNSEESSTQWTTGIA 169
KY+E+ELAK+RG+ + +N + +D L+ +PEHL+ K++ E S Q +GI
Sbjct: 139 KYIEEELAKRRGRVQEPQPTPQNTVDE-KDVLFHVPEHLRKSTSKKSEEMLSNQMLSGIP 197
Query: 170 EVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 220
EV L IE +++NIE TE AK L R+ + + +P++ + ++ Q R
Sbjct: 198 EVDLGIEERIRNIEATEEAKLKLIRDRMARKERETSFVPTNMAVNFVQHNR 248
>gi|194864932|ref|XP_001971179.1| GG14814 [Drosophila erecta]
gi|190652962|gb|EDV50205.1| GG14814 [Drosophila erecta]
Length = 294
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/236 (32%), Positives = 116/236 (49%), Gaps = 33/236 (13%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +E ET E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDETET---------EEQLTLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMQQLKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRG---KNIDVNDRVENDLKHAED-ELYKIPEHLK--VKKRNSEESSTQWT 165
KY+EQEL K++G +++ +D N ED LY +P+HL+ R+ E S Q
Sbjct: 129 KYIEQELQKRKGGGTEDVPEDDGDMNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQML 188
Query: 166 TGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 220
GI EV L I K++NIE TE AK KL+Q+ + S F +P++ + ++ Q R
Sbjct: 189 NGIPEVDLGIVAKIRNIEATEEAKQKLMQDAKNKKDGPSQF-VPTNMAVNFMQHNR 243
>gi|449266760|gb|EMC77776.1| hypothetical protein A306_15006, partial [Columba livia]
Length = 275
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/264 (31%), Positives = 132/264 (50%), Gaps = 35/264 (13%)
Query: 35 EERRLALEEIKFLQKQRERKSGIPAIP-----------SALQSAAAAGGGGLTKVSEKNE 83
EE RL LEE K +Q R+R +G+ A+ + + GG+ + + E
Sbjct: 23 EEVRLKLEEAKEVQSLRKRPNGVSAVALLVGEKVQEEATLVDDPFKMKSGGMVDMKKLKE 82
Query: 84 GD----GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHA 139
E+++L L +F+ ET ED +M+KY+E EL K++G I N+ + LK+A
Sbjct: 83 RGKDRINEEEDLNLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVENEEQKVKLKNA 140
Query: 140 EDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLM 198
ED LY++PE+++V SS + T E+ L I+ K+KNI TE AK KLL E++
Sbjct: 141 EDSLYELPENIRV-------SSAKKTE---EMLLGIDAKIKNIISTEEAKAKLLAEQQNK 190
Query: 199 GRAKSDFSIPSSYSADYFQRGRDYAEKLR---REHPELYKDRGSQDDGAGSRPTDNSTDA 255
+ +P++ + +Y Q R Y E+L R + E K R + G RP +
Sbjct: 191 KKDSETSFVPTNMAVNYVQHNRFYHEELNAPVRRNKEEPKPRPLR-VGDTERPEPERSPP 249
Query: 256 AGSR---QAATDQFMLERFRKRER 276
R + ATD + E+F+K R
Sbjct: 250 NRKRPLNEKATDDYHYEKFKKMNR 273
>gi|350396974|ref|XP_003484725.1| PREDICTED: uncharacterized protein C9orf78-like [Bombus impatiens]
Length = 296
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 139/299 (46%), Gaps = 40/299 (13%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSAL 64
I KK+ +K RKR +E+ ++E R +EE+K +QK RER GI + AL
Sbjct: 9 IEFKKKSRKPIRKRQVSSDEDDN---ENEEASVREKVEEMKTIQKLRERPKGINVVGLAL 65
Query: 65 QSAAAA-----------GGGGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNML 111
GG + KN + D E + F ET ED M+
Sbjct: 66 GENVTPDVMTSDPFNVKTGGMVNMTVLKNTKLKQNDAYETGIGTQFNAETNKRDEDEEMV 125
Query: 112 KYVEQELAKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLK--VKKRNSEESSTQW 164
KY+E+EL+K++ K + N+ K + E L +PEHL+ R+ E S Q
Sbjct: 126 KYIEEELSKRKSKTEGTTENGSNNDKGSYCSPEEAALQAVPEHLRQSSAHRSEEMLSNQM 185
Query: 165 TTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR--- 220
+GI EV L IE K++NIE TE AK KLL ++ S F +P++ + ++ Q R
Sbjct: 186 LSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNRFNI 244
Query: 221 ---DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
D+ +K +++ E K +DD R DN + ATD + ERF+K+ R
Sbjct: 245 EDTDF-QKSKQDSDERKKVAAPRDDYKSKR-KDNG-------EKATDDYHYERFKKQFR 294
>gi|54400388|ref|NP_001005945.1| chromosome 9 open reading frame 78 [Danio rerio]
gi|53734462|gb|AAH83464.1| Zgc:103692 [Danio rerio]
gi|148725502|emb|CAN88766.1| novel protein (zgc:103692) [Danio rerio]
Length = 289
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 129/264 (48%), Gaps = 28/264 (10%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSALQS---------------AAAAGGGGLTKVSEKN 82
R L+E K LQ R+R+ G+ +I + L G + KV +++
Sbjct: 27 RSKLDEAKELQSLRKRQHGV-SIATLLVGEKLPLEAELEDDPFKLKTGGVVDMKKVKDRS 85
Query: 83 -EGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAED 141
+ ++++L L +F+ ET ED +M+KY+E EL KK+G +V+ +K+ ED
Sbjct: 86 RDMTVDENDLNLGTSFSAETNRRDEDADMMKYIETELKKKKGMVEAEEQKVK--VKNPED 143
Query: 142 ELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLM 198
LY++PE++ V K+ E S Q +GI EV L I+ K+KNI TE AK KLL E+R
Sbjct: 144 LLYELPENINVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIINTEEAKAKLLAEQRNK 203
Query: 199 GRAKSDFSIPSSYSADYFQRGRDYAE------KLRREHPELYKDRGSQDDGAGSRPTDNS 252
+ +P++ + +Y Q R Y E + RE P+ R + + +
Sbjct: 204 KKDSGTSFVPTNIAVNYVQHNRFYHEDSNAPQRRNREEPKARPLRVGDTEKPAPEASPPN 263
Query: 253 TDAAGSRQAATDQFMLERFRKRER 276
+ + ATD + E+F+K R
Sbjct: 264 FRKRPNNEKATDDYHYEKFKKMNR 287
>gi|391325854|ref|XP_003737442.1| PREDICTED: uncharacterized protein C9orf78 homolog [Metaseiulus
occidentalis]
Length = 277
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 110/221 (49%), Gaps = 23/221 (10%)
Query: 27 TNKLSDDEEER--RLALEEIKFLQKQRERKSGIPAIPSALQSAAAA------------GG 72
T L DDE++ R LE+ K LQK R+R G+ L A G
Sbjct: 37 TTPLIDDEDDHVDRSVLEDTKELQKLRKRPHGVSVEALILGKPVADTEEKVSDPFKIDSG 96
Query: 73 GGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRV 132
GGLT + + + +V+ + FA ET ED +M+KY+E EL K++G
Sbjct: 97 GGLTDMKASS-----TETIVIGNQFASETNERDEDADMMKYIEAELKKRQGTQQQTEAEA 151
Query: 133 ENDLKHAEDELYKI-PEHLKVKK--RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 189
+ +ED L +I P HL+ + +N E S Q GI EV L +E +++NIE TE AK
Sbjct: 152 KPLSLKSEDLLMQILPNHLERSQGQKNEEMLSNQMLAGIPEVDLGMEERIRNIEATEEAK 211
Query: 190 KLLQEKRLMGRAKSDFSIPSSYSADY-FQRGRDYAEKLRRE 229
+ +R+ G+ K +P++ S ++ Q+ + E++RRE
Sbjct: 212 MKMLHERMSGKRKETSLVPTNISVNFESQQKKPKKEQVRRE 252
>gi|7106830|gb|AAF36140.1|AF151054_1 HSPC220 [Homo sapiens]
Length = 176
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 95/165 (57%), Gaps = 7/165 (4%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 16 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 73
Query: 146 IPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 202
+PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 74 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 133
Query: 203 SDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSR 247
+P++ + +Y Q R Y E+L H E K R +Q SR
Sbjct: 134 ETSFVPTNMAVNYVQHNRFYHEELNCAHTE--KQRRAQGPALESR 176
>gi|119608315|gb|EAW87909.1| chromosome 9 open reading frame 78, isoform CRA_a [Homo sapiens]
Length = 219
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 14/201 (6%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 21 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 78
Query: 146 IPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 202
+PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 79 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 138
Query: 203 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 255
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 139 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 198
Query: 256 AGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 199 ANEK--ATDDYHYEKFKKMNR 217
>gi|320169949|gb|EFW46848.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 388
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 133/281 (47%), Gaps = 57/281 (20%)
Query: 31 SDD----EEERRLALEEIKFLQKQRERKSGI-PA-------IPSALQSAAAA-------G 71
SDD EE R LE +K LQ+ R+RKSG+ PA + S + AA
Sbjct: 131 SDDADASEESHRERLERMKELQRFRQRKSGVTPAGLALGQRVKSVAEELLAASDPFKLKS 190
Query: 72 GGGL-------TKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK 124
GGGL K +++ + L TF+ ET ED ML Y++++LA +RG
Sbjct: 191 GGGLVDKAAIRVKDRDRDRDADDDKNFSLTSTFSHETKARDEDKMMLSYIDEQLAIRRGT 250
Query: 125 NIDVNDRVENDLKHAEDELYKIPEHLKVK---KRNSEES-STQWTTGIAEVQLPIEYKLK 180
N + N+ +LY +P++L+ K SE+S S+ +GI EV L ++ +++
Sbjct: 251 N---ANDNANNANDPTAQLYVVPKNLEATSALKNVSEDSISSALLSGIPEVDLGVQSRIR 307
Query: 181 NIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYK--DRG 238
NIEETE A+ L++KRL G S+ + H + DR
Sbjct: 308 NIEETEKARIELEQKRLSGNQSSNVVL---------------------HHHRFFNTADRS 346
Query: 239 SQDDGAGSRPTDNSTDAAGSRQA-ATDQFMLERFRKRERHR 278
+ D AG++ + +A +RQ ATDQ + ++F+K + R
Sbjct: 347 DRSDSAGAQQHGSQNSSAAARQPRATDQLVFDKFKKAQLGR 387
>gi|281349488|gb|EFB25072.1| hypothetical protein PANDA_003241 [Ailuropoda melanoleuca]
Length = 201
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 14/201 (6%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 3 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 60
Query: 146 IPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 202
+PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 61 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 120
Query: 203 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 255
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 121 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 180
Query: 256 AGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 181 ANEK--ATDDYHYEKFKKMNR 199
>gi|349603477|gb|AEP99304.1| Uncharacterized protein C9orf78-like protein, partial [Equus
caballus]
Length = 253
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 14/201 (6%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 55 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 112
Query: 146 IPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 202
+PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 113 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 172
Query: 203 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 255
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 173 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 232
Query: 256 AGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 233 ANEK--ATDDYHYEKFKKMNR 251
>gi|6808233|emb|CAB70805.1| hypothetical protein [Homo sapiens]
Length = 241
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 14/201 (6%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 43 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 100
Query: 146 IPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 202
+PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 101 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 160
Query: 203 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 255
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 161 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 220
Query: 256 AGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 221 ANEK--ATDDYHYEKFKKMNR 239
>gi|444517774|gb|ELV11788.1| hypothetical protein TREES_T100018213 [Tupaia chinensis]
Length = 269
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 14/201 (6%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 71 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 128
Query: 146 IPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 202
+PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 129 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 188
Query: 203 SDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDA 255
+P++ + +Y Q R Y E+L +E P+ R G + R N
Sbjct: 189 ETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRP 248
Query: 256 AGSRQAATDQFMLERFRKRER 276
A + ATD + E+F+K R
Sbjct: 249 ANEK--ATDDYHYEKFKKMNR 267
>gi|346468277|gb|AEO33983.1| hypothetical protein [Amblyomma maculatum]
Length = 336
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 80/136 (58%), Gaps = 2/136 (1%)
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKI 146
E D + L +TF+ ET ED +M+KY+E+ELAK+RG+ + + + +D L+ +
Sbjct: 123 ELDAVNLGNTFSVETNQRDEDADMMKYIEEELAKRRGRVQETPAEEKTQVVDEKDVLFHV 182
Query: 147 PEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 204
PEHL+ K++ E S Q +GI EV L IE +++NIE TE AK L +R+ + +
Sbjct: 183 PEHLRKSTSKKSEEMLSNQMLSGIPEVDLGIEERIRNIEATEEAKLKLIRERMARKERET 242
Query: 205 FSIPSSYSADYFQRGR 220
+P++ + ++ Q R
Sbjct: 243 SFVPTNMAVNFVQHNR 258
>gi|240849477|ref|NP_001155632.1| uncharacterized protein LOC100164612 [Acyrthosiphon pisum]
gi|239792059|dbj|BAH72414.1| ACYPI005606 [Acyrthosiphon pisum]
Length = 321
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 125/267 (46%), Gaps = 42/267 (15%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAA------------AGGGGLTKVSEKNEGD--- 85
LEE+K +QK R+R +G+ I AL + GGL ++ G
Sbjct: 64 LEEMKTMQKLRDRPNGVNIISLALGEKLSQEEEKLMVDPFKVKTGGLINMNALKTGQVTQ 123
Query: 86 -GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVND--RVENDLKHAEDE 142
+ + + F+ ET ED M+KY+++++A + G+ +D++D N + E
Sbjct: 124 VDDAYDTGIGTQFSAETNKRDEDEEMMKYIDEQVAVRTGRTVDIDDDNVSLNKSNYCPPE 183
Query: 143 LYK---IPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKR 196
L +P HL+ R+ E S Q +GI EV L I+ K+KNIE TE AK KL+++KR
Sbjct: 184 LAALQAVPSHLRNSTTHRSEEMLSNQMLSGIPEVDLGIDAKIKNIEATEEAKMKLIRDKR 243
Query: 197 LMGRAKSDFSIPSSYSADYFQRGRDYAE-------KLRREHPELYKDRGSQDDGAGSRPT 249
S F +P++ + ++ Q R E K ++ P + +D + + +
Sbjct: 244 NKKDGPSQF-VPTNMAVNFVQHNRFNIEITGPDGKKNYKQQPAVKQDHSDEKNIDKRKKK 302
Query: 250 DNSTDAAGSRQAATDQFMLERFRKRER 276
DN ATD F ERF+K+ R
Sbjct: 303 DN----------ATDDFHYERFKKQFR 319
>gi|340716340|ref|XP_003396657.1| PREDICTED: uncharacterized protein C9orf78-like isoform 1 [Bombus
terrestris]
gi|340716342|ref|XP_003396658.1| PREDICTED: uncharacterized protein C9orf78-like isoform 2 [Bombus
terrestris]
Length = 296
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 135/292 (46%), Gaps = 40/292 (13%)
Query: 12 KKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAA- 70
+K RKR +E+ ++E R +EE+K +QK RER GI + AL
Sbjct: 16 RKPIRKRQVSSDEDDN---ENEEASVREKVEEMKTIQKLRERPKGINVVGLALGENVTPD 72
Query: 71 ----------GGGGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNMLKYVEQEL 118
GG + KN + D E + F ET ED M+KY+E+EL
Sbjct: 73 VMTSDPFNVKTGGMVNMTVLKNTKLKQNDAYETGIGTQFNAETNKRDEDEEMVKYIEEEL 132
Query: 119 AKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLK--VKKRNSEESSTQWTTGIAEV 171
+K++ K + N+ K + E L +PEHL+ R+ E S Q +GI EV
Sbjct: 133 SKRKSKTEGTTENGSNNDKGSYCSPEEAALQAVPEHLRQSSAHRSEEMLSNQMLSGIPEV 192
Query: 172 QLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR------DYAE 224
L IE K++NIE TE AK KLL ++ S F +P++ + ++ Q R D+ +
Sbjct: 193 DLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNRFNIEDTDF-Q 250
Query: 225 KLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
K +++ E K +DD R DN + ATD + ERF+K+ R
Sbjct: 251 KSKQDSDERKKVAAPRDDYKSKR-KDNG-------EKATDDYHYERFKKQFR 294
>gi|403298580|ref|XP_003940093.1| PREDICTED: uncharacterized protein C9orf78-like [Saimiri
boliviensis boliviensis]
Length = 358
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 108/200 (54%), Gaps = 14/200 (7%)
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKI 146
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY++
Sbjct: 78 EEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYEL 135
Query: 147 PEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKS 203
PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 136 PENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSE 195
Query: 204 DFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-GSQDDGAGSRPTDNSTDAA 256
+P++ + +Y Q R Y E+L +E P+ R G + R N A
Sbjct: 196 TSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVGDTEKPEPERSPPNRKRPA 255
Query: 257 GSRQAATDQFMLERFRKRER 276
+ ATD + E+F+K R
Sbjct: 256 NEK--ATDDYHYEKFKKMNR 273
>gi|48772899|gb|AAT46619.1| hepatocellular carcinoma-associated antigen 59 [Homo sapiens]
Length = 195
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/144 (37%), Positives = 87/144 (60%), Gaps = 5/144 (3%)
Query: 86 GEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYK 145
E+++L L +F+ ET ED +M+KY+E EL K++G I ++ + K+AED LY+
Sbjct: 12 SEEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEHEEQKVKPKNAEDCLYE 69
Query: 146 IPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAK 202
+PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 70 LPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDS 129
Query: 203 SDFSIPSSYSADYFQRGRDYAEKL 226
+P++ + +Y Q R Y E+L
Sbjct: 130 ETSFVPTNMAVNYVQHNRFYHEEL 153
>gi|428169346|gb|EKX38281.1| hypothetical protein GUITHDRAFT_115622 [Guillardia theta CCMP2712]
Length = 299
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 83/158 (52%), Gaps = 18/158 (11%)
Query: 139 AEDELYKIPEHLKVKKRNSE--ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
AEDELY IPE +VK + + +++ W TGI EV+LP+E KLKNIEETE AKK + E+R
Sbjct: 142 AEDELYVIPEEYRVKSKTRQLGDAAETWLTGIVEVELPLEEKLKNIEETEKAKKKILEER 201
Query: 197 LMGRAKSDFSIPSSYSADYFQ---RGRDYAEKLRREHPELYKDRGSQDDGAGSRPTD--- 250
+ G +S F + ++ Y + R + + +++ L G G+ +
Sbjct: 202 INGVRQSTFVVDTASEKGYMRMEGEMRKGKKVMAKQNIPLISKEAEAVGGVGNFNANYIK 261
Query: 251 -NSTDAAGSRQAA---------TDQFMLERFRKRERHR 278
N + G AA +D ++ERF+KR +HR
Sbjct: 262 RNGKEREGRPAAASEVKRPDLSSDDLVMERFKKRLKHR 299
>gi|334311928|ref|XP_001369366.2| PREDICTED: uncharacterized protein C9orf78-like [Monodelphis
domestica]
Length = 240
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 95/178 (53%), Gaps = 22/178 (12%)
Query: 13 KNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA---- 68
K FRKR + E E+ + D EE RL LEE K +Q R R +G+ A+ +
Sbjct: 5 KTFRKRRDDSESESDEQ---DSEEVRLKLEETKEVQSLRRRPNGVSAVALLVGEKVQEET 61
Query: 69 ----------AAGGGGLTKVSEKNEGD-GEKDELVLQDTFAQETAVMVEDPNMLKYVEQE 117
A G + K+ E+N+ E+++L L +F+ ET ED +M+KY+E E
Sbjct: 62 TLVDDPFKIKAGGMVDMKKLKERNKDRINEEEDLNLGTSFSAETNRRDEDADMMKYIETE 121
Query: 118 LAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQL 173
L K++G I N+ + LK+AED LY++PE+++V K+ E S Q +GI EV L
Sbjct: 122 LKKRKG--IVENEEQKVKLKNAEDCLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDL 177
>gi|332375184|gb|AEE62733.1| unknown [Dendroctonus ponderosae]
Length = 290
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 136/297 (45%), Gaps = 54/297 (18%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLA--LEEIKFLQKQRERKSGIPAIPSALQ 65
K KK+N R++ + SDDEE +++ L E+K Q R+R G+ I AL
Sbjct: 19 KSVKKRNLRQKVKSD--------SDDEETAQISNKLGEMKERQNLRKRPHGVSVIGLALG 70
Query: 66 SAAAAGGGGLTKVSEKNEGDGEKDELVLQ-----------DT-----FAQETAVMVEDPN 109
+ +AG +K K E G + L+ DT F+ ET ED
Sbjct: 71 TKFSAGDEASSKDPFKVEAGGMVNMQALKSGKVKQVDDAYDTGIGTQFSVETNKRDEDEE 130
Query: 110 MLKYVEQELAKKRGK----NIDVNDRVENDLKHAEDELYKIPEHLK--VKKRNSEESSTQ 163
M+K++E EL+KK+GK + + + L E L +P+HL+ KR+ E S Q
Sbjct: 131 MMKFIENELSKKKGKVGQEEPILPTKKSSYLSPEEAALQAVPDHLRESSTKRSEEMLSNQ 190
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR-- 220
+GI EV L IE K+KNIE TE AK +LL E + S F +P++ + ++ Q R
Sbjct: 191 MLSGIPEVDLGIEAKIKNIEATEEAKLRLLWESQNKKNGPSQF-VPTNMAVNFVQHKRYN 249
Query: 221 -DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
D AE R++ T+ + ATD + E+F+K+ R
Sbjct: 250 NDRAEMARKKA-----------------KTEVEDKQKKKDEKATDDYHFEKFKKQFR 289
>gi|345495075|ref|XP_001606209.2| PREDICTED: uncharacterized protein C9orf78-like [Nasonia
vitripennis]
Length = 297
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 139/297 (46%), Gaps = 38/297 (12%)
Query: 5 IPQKKEKKKNFRKRSYEEEEETTNKLSDDEEER--RLALEEIKFLQKQRERKSGIP---- 58
I KK+ +K RKR +E + DEEE R LEE+K LQ+ RER G+
Sbjct: 12 IEFKKKSRKPLRKRRASSDESNS-----DEEETGVRSKLEELKTLQRLRERPKGVNIAGL 66
Query: 59 AIPSALQSAAAAG------GGGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNM 110
A+ + + AA GG+ ++ + D + + F ET ED M
Sbjct: 67 ALGEVVNDSIAASDPFNVKTGGMVNMAALKNVSKQDDAYDTGIGTQFNAETNKRDEDEEM 126
Query: 111 LKYVEQELAKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLKVKKRNSEES--STQ 163
+KY+E++L+K++ KN + N K E L +PEHL+ + E S Q
Sbjct: 127 VKYIEEQLSKRKNKNNGEKEDESNKNKPTYCSPEEAALQAVPEHLRQSSTHKSEEMLSNQ 186
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR-- 220
+GI EV L IE K++NIE TE AK KLL ++ S F +P++ + ++ Q R
Sbjct: 187 MLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNFVQHNRFN 245
Query: 221 -DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
+ A+ +R+ K +Q + + DN + ATD + ERF+K+ R
Sbjct: 246 IEDADAQKRKQDAAAKRHAAQASHSKEKRKDND-------EKATDDYHYERFKKQFR 295
>gi|56757920|gb|AAW27100.1| SJCHGC04993 protein [Schistosoma japonicum]
Length = 312
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 106/213 (49%), Gaps = 29/213 (13%)
Query: 39 LALEEIKFLQKQRERKSGI-----------PAIPSALQS----AAAAGGGGLTKVSEKNE 83
+ + I+ LQK R+R +GI P + A+ + G L +S +
Sbjct: 39 VVVGAIRELQKLRKRPAGISLSALATGKEVPDVNLAIANDPFKLKTGGLVDLNSISSTKQ 98
Query: 84 GDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
+ + D E L TFA ET ED M++Y+E+ELAK++G DR E+ D
Sbjct: 99 AEEDDDVEAHLAKTFATETNKRDEDAEMIRYIEEELAKRKGLTKPSLDRAED-----SDL 153
Query: 143 LYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL--- 197
L +PE+LK + ++ + S Q GI E+ L +E K+KNIE TE AK++L +KR
Sbjct: 154 LQDVPEYLKPSIGQQKEDMLSNQMLCGIPEIDLGVEAKMKNIEATEEAKQILLKKRFNRK 213
Query: 198 MGRAKSDFSIPSSYSADYFQRGR--DYAEKLRR 228
G + + + P + + ++ Q R Y +++ R
Sbjct: 214 HGHSVDEIA-PINMALNFVQHSRWNSYTDRVSR 245
>gi|226469096|emb|CAX70027.1| hypothetical protein [Schistosoma japonicum]
Length = 312
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 105/213 (49%), Gaps = 29/213 (13%)
Query: 39 LALEEIKFLQKQRERKSGIPA--------IPSALQSAA-------AAGGGGLTKVSEKNE 83
+ + I+ LQK R+R +GI +P + A G L +S +
Sbjct: 39 VVVGAIRELQKLRKRPAGISLSALATGKEVPDVNLAIANDPFKLKTGGLVDLNSISSTKQ 98
Query: 84 GDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
+ + D E L TFA ET ED M++Y+E+ELAK++G DR E+ D
Sbjct: 99 AEEDDDVEAHLAKTFATETNKRDEDAEMIRYIEEELAKRKGLTKPSLDRAED-----SDL 153
Query: 143 LYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL--- 197
L +PE+LK + ++ + S Q GI E+ L +E K+KNIE TE AK++L +KR
Sbjct: 154 LQDVPEYLKPSIGQQKEDMLSNQMLCGIPEIDLGVEAKMKNIEATEEAKQILLKKRFNRK 213
Query: 198 MGRAKSDFSIPSSYSADYFQRGR--DYAEKLRR 228
G + + + P + + ++ Q R Y +++ R
Sbjct: 214 HGHSVDEIA-PINMALNFVQHSRWNSYTDRVSR 245
>gi|157127876|ref|XP_001655062.1| hypothetical protein AaeL_AAEL010964 [Aedes aegypti]
gi|108872762|gb|EAT36987.1| AAEL010964-PA [Aedes aegypti]
Length = 289
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 136/297 (45%), Gaps = 51/297 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLA-LEEIKFLQKQRERKSGIPAIPSAL-- 64
K + +K FRKR E+E D +E L+ LEE K QK R + +G+ + A+
Sbjct: 14 KSKARKQFRKRIKSEDE-------DKPDEDILSKLEETKEKQKLRNKPNGVNILTLAVGK 66
Query: 65 ----------QSAAAAGGGGLTKV----SEKNEGDGEKDELVLQDTFAQETAVMVEDPNM 110
+ A GG+ + S K + + + + F+ ET ED M
Sbjct: 67 KITVEEEVTNKDLFNAKAGGMVNMQALKSGKIKAVDDAYDTGIGTQFSAETNKRDEDEEM 126
Query: 111 LKYVEQELAKKRGKNIDVNDRVENDLKHA-----EDELYKIPEHLK--VKKRNSEESSTQ 163
+KY+E++L+KK+G D E + H E L +P HL +R+ E S Q
Sbjct: 127 MKYIEEQLSKKKGVAKDTTKEPEAESSHKYLSPEEAALLSLPAHLSQTSTQRSEEMLSNQ 186
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR-- 220
+GI EV L IE K+KNIE TE AK K LQE++ S F +PS+ + ++ Q R
Sbjct: 187 MLSGIPEVDLGIEAKIKNIEATEDAKIKFLQEQQRKKDLPSHF-VPSNMAVNFMQHNRFK 245
Query: 221 -DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
D + +R + E+ + R G P + ATD + ++F+K+ R
Sbjct: 246 IDQPVQQKRRYTEVQQHRS----GDEKIP-----------KKATDDYHFDKFKKQYR 287
>gi|340378972|ref|XP_003388001.1| PREDICTED: uncharacterized protein C9orf78 homolog, partial
[Amphimedon queenslandica]
Length = 237
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 108/238 (45%), Gaps = 52/238 (21%)
Query: 89 DELV--LQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVND-----------RVEND 135
DE+V L F+ ET ++ +MLKY++ E+A+++GK + R E
Sbjct: 4 DEVVKRLTSQFSAETQTRDDETHMLKYIDDEIARRKGKQDEETLQLYLKLLPLFYRYEAK 63
Query: 136 LKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQ 193
+ LYKIPE +V+ KR+ + S Q +GI EV L ++ K KNIEETE AKK +
Sbjct: 64 IA----SLYKIPEKYQVEDSKRSEDMLSNQMLSGIPEVDLGLDAKFKNIEETEIAKKKMA 119
Query: 194 EKRLMGRAKSDFSIPSSYSADYFQ----------------------RGRDYAEKLRREHP 231
E +L + K IP+++++++ R ++ E R+ P
Sbjct: 120 EDKLKMKDKQTSMIPTNFASNFTHHSLRFFKDRGRGHHRRGGGGGKRSQEEEETESRDQP 179
Query: 232 ELYKDRGS----------QDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER-HR 278
GS +D+G G T A Q TD + ++FRK+ + HR
Sbjct: 180 SFIPVVGSFDEPELKPTTRDEGGGGPNTKKRKPGADHSQLPTDDYHFDKFRKKAKSHR 237
>gi|91078372|ref|XP_974116.1| PREDICTED: similar to CG7974 CG7974-PA [Tribolium castaneum]
gi|270003886|gb|EFA00334.1| hypothetical protein TcasGA2_TC003173 [Tribolium castaneum]
Length = 299
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 108/213 (50%), Gaps = 26/213 (12%)
Query: 32 DDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDEL 91
+D EE LEE+K LQ R+R G+ A+ AL + ++K K + G +
Sbjct: 41 EDLEEVSTKLEEMKELQNLRKRPHGVNALGLALGTKITIEDECISKDPFKVKSGGMVNMQ 100
Query: 92 VLQ-----------DT-----FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVEND 135
L+ DT F+ ET ED M+K++E+EL+KK+ K ++ ++ E +
Sbjct: 101 ALKSGKVKQVDDAYDTGIGTQFSVETNKRDEDEEMMKFIEEELSKKKRK-VEPQEQAEAE 159
Query: 136 LKHA-----EDELYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAA 188
K A E L +P+HL+ KR+ E S Q GI EV L IE K+KNIE TE A
Sbjct: 160 NKSAYTSPEEAALRAVPDHLRESSTKRSEEMLSNQMLNGIPEVDLGIEAKIKNIEATEEA 219
Query: 189 K-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 220
K +LL EK+ S F +P++ + ++ Q R
Sbjct: 220 KLRLLWEKQNKKDGPSPF-VPTNMAVNFVQHNR 251
>gi|353228848|emb|CCD75019.1| hypothetical protein Smp_035150 [Schistosoma mansoni]
Length = 315
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 100/204 (49%), Gaps = 28/204 (13%)
Query: 39 LALEEIKFLQKQRERKSGIPA--------IPSALQSAA-------AAGGGGLTKVSEKNE 83
+ +E I+ LQK R+R +G+ +P + A G L +S +
Sbjct: 39 VVVEAIRELQKLRKRPAGVSLSALATGKEVPDINLTIANDPFRLKTGGLVDLNSISSAKQ 98
Query: 84 GDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK-NIDVNDRVENDLKHAED 141
+ + D E L TF ET ED M+KY+E+E+AK++G DR E+ D
Sbjct: 99 SEEDDDVEARLAKTFTTETNKRDEDAEMIKYIEEEVAKRKGLIKPSTLDRDED-----SD 153
Query: 142 ELYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMG 199
L +PE+LK + ++ + S Q GI EV L +E K+KNIE TE AK+ L KRL G
Sbjct: 154 LLQDVPEYLKPSIGQQKEDMLSNQMLCGIPEVDLGVEAKMKNIEATEEAKQTLFRKRL-G 212
Query: 200 RA---KSDFSIPSSYSADYFQRGR 220
R ++ P+S + ++ Q R
Sbjct: 213 RKHGYSTNHIAPTSMAVNFVQHSR 236
>gi|226486444|emb|CAX74351.1| hypothetical protein [Schistosoma japonicum]
Length = 312
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 105/213 (49%), Gaps = 29/213 (13%)
Query: 39 LALEEIKFLQKQRERKSGI-----------PAIPSALQS----AAAAGGGGLTKVSEKNE 83
+ + I+ LQK R+R +GI P + A+ + G L +S +
Sbjct: 39 VVVGAIRELQKLRKRPAGISLSALATGKEVPDVNLAIANDPFKLKTGGLVDLNSISSTKQ 98
Query: 84 GDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDE 142
+ + D E L TFA ET ED M++Y+E+ELAK++G DR E+ D
Sbjct: 99 AEEDDDVEAHLAKTFATETNKRDEDAEMIRYIEEELAKRKGLTKPSLDRAED-----SDL 153
Query: 143 LYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL--- 197
L +PE+LK + ++ + S Q GI E+ L +E K+KNIE E AK++L +KR
Sbjct: 154 LQDVPEYLKPSIGQQKEDMLSNQMLCGIPEIDLGVEAKMKNIEAPEEAKQILLKKRFNRK 213
Query: 198 MGRAKSDFSIPSSYSADYFQRGR--DYAEKLRR 228
G + + + P + + ++ Q R Y +++ R
Sbjct: 214 HGHSVDEIA-PINMALNFVQHSRWNSYTDRVSR 245
>gi|342318949|gb|EGU10904.1| Hypothetical Protein RTG_03298 [Rhodotorula glutinis ATCC 204091]
Length = 359
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/231 (29%), Positives = 110/231 (47%), Gaps = 34/231 (14%)
Query: 71 GGGGLTKVSEKNEG-DGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
G GG ++ + +EG + + +++ D F +T + D +ML Y+E ELAKKRG+ +
Sbjct: 129 GTGGADRIRDDSEGPEAKARKIIKTDNFTGQTNTVDVDKHMLAYIEAELAKKRGEASGSS 188
Query: 130 DRVENDLKHAE--DELYKIPEHLKV----------KKRNSEESSTQWTTG----IAEVQL 173
D N + + DELY++ E K K+R+ EE + +TG I EV L
Sbjct: 189 DPSSNPSRPYDPRDELYRVAEKYKFADIAEQEGKKKERDEEEGNVTLSTGMLMGIPEVDL 248
Query: 174 PIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFS-IPSS---YSADYFQRGRDYAEKLRRE 229
I+ KLKNIE TE AK+ L+E G + + +P ++ D F R R
Sbjct: 249 GIDTKLKNIEATEKAKRALREGSRRGSGPEEAAGLPPDKDEFAVDRFYR--------HRR 300
Query: 230 HPELYKDRGSQDDGAGSRPTDNSTDA-----AGSRQAATDQFMLERFRKRE 275
E + ++ + P + DA R+ ATD+ + RF+KR+
Sbjct: 301 PLESDQSALARARYLAANPPETDPDAELRKRKPGRETATDEMAVARFKKRQ 351
>gi|256092888|ref|XP_002582109.1| hypothetical protein [Schistosoma mansoni]
Length = 314
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/227 (31%), Positives = 108/227 (47%), Gaps = 40/227 (17%)
Query: 22 EEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG---------- 71
EE+ N + + + +E I+ LQK R+R +G+ SA A G
Sbjct: 22 EEDHLENSTAPVVADSTVVVEAIRELQKLRKRPAGVSL------SALATGKEVPDINLTI 75
Query: 72 --------GGGLTKVSEKNEG-DGEKDELV---LQDTFAQETAVMVEDPNMLKYVEQELA 119
GGL ++ + E+D+ V L TF ET ED M+KY+E+E+A
Sbjct: 76 ANDPFRLKTGGLVDLNSISSAKQSEEDDDVEARLAKTFTTETNKRDEDAEMIKYIEEEVA 135
Query: 120 KKRG-KNIDVNDRVENDLKHAEDELYKIPEHLK--VKKRNSEESSTQWTTGIAEVQLPIE 176
K++G DR E+ D L +PE+LK + ++ + S Q GI EV L +E
Sbjct: 136 KRKGLIKPSTLDRDED-----SDLLQDVPEYLKPSIGQQKEDMLSNQMLCGIPEVDLGVE 190
Query: 177 YKLKNIEETEAAKKLLQEKRLMGRA---KSDFSIPSSYSADYFQRGR 220
K+KNIE TE AK+ L KRL GR ++ P+S + ++ Q R
Sbjct: 191 AKMKNIEATEEAKQTLFRKRL-GRKHGYSTNHIAPTSMAVNFVQHSR 236
>gi|71024789|ref|XP_762624.1| hypothetical protein UM06477.1 [Ustilago maydis 521]
gi|46100513|gb|EAK85746.1| hypothetical protein UM06477.1 [Ustilago maydis 521]
Length = 320
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 107/232 (46%), Gaps = 38/232 (16%)
Query: 64 LQSAAAAGGGGLTKVSEKNEGDGEKD----ELVLQDTFAQETAVMVEDPNMLKYVEQELA 119
+Q+AA G T +E+ E D + V ++ F ET + D +M+ Y+EQE+
Sbjct: 105 IQAAALRGSTSHTADNEQEESDDDNPTKPRRRVRKNHFQSETGTVDVDKHMMAYIEQEIK 164
Query: 120 KKRGKNI--DVN-DRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQ------------W 164
K+ G N+ D N D V +++ + +LY + E + +R+ + TQ
Sbjct: 165 KRTGTNMQSDSNSDSVSKPIQNPDHQLYAVAEKYRELQRSIQPEQTQEEREGNVALSSAM 224
Query: 165 TTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAE 224
+ I EV L I+ ++ NI++TE A++ L + R D P + D + A
Sbjct: 225 LSSIPEVDLGIDNRMHNIQQTELARRKLHQHRTSNAHHQDAHAPQAARGDAADQALANA- 283
Query: 225 KLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
R +H + RP +D + +Q ATDQ +L+RFRKR+R
Sbjct: 284 --RFQH-------------SKQRPL---SDPSARQQMATDQLVLDRFRKRQR 317
>gi|68163441|ref|NP_001020174.1| uncharacterized protein LOC311855 [Rattus norvegicus]
gi|60552111|gb|AAH91189.1| Similar to Hypothetical protein MGC11690 [Rattus norvegicus]
Length = 211
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 82/137 (59%), Gaps = 5/137 (3%)
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKI 146
E+++L L +F+ ET ED +M+KY+E EL K++G I + + K+AED LY++
Sbjct: 46 EEEDLHLGTSFSAETNRRDEDADMMKYIETELKKRKG--IVEQEEQKAKPKNAEDCLYEL 103
Query: 147 PEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKS 203
PE+++V K+ E S Q +GI EV L I+ K+KNI TE AK +LL E++ +
Sbjct: 104 PENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSE 163
Query: 204 DFSIPSSYSADYFQRGR 220
+P++ + +Y Q R
Sbjct: 164 TSFVPTNMAVNYVQHNR 180
>gi|195587004|ref|XP_002083257.1| GD13638 [Drosophila simulans]
gi|194195266|gb|EDX08842.1| GD13638 [Drosophila simulans]
Length = 286
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 135/293 (46%), Gaps = 50/293 (17%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA 67
KK +KN R+R +EEE +E +L L+EIK Q+ R+R +G+ + AL
Sbjct: 18 KKAGRKNLRQRKNSDEEE---------KEEQLTLDEIKERQRLRQRPNGVSLVGLALGKK 68
Query: 68 AA------------AGGGGLTKVSEKNEGD-GEKDE---LVLQDTFAQETAVMVEDPNML 111
A GGL + + G E D+ + + F+ ET ED M+
Sbjct: 69 IAPEEELAIKDPFNVKTGGLVNMKQLKSGKMKEADDAYDVGIGTQFSAETNKRDEDEEMM 128
Query: 112 KYVEQELAKKRGKNIDVNDRVEND------LKHAEDELYKIPEHLK--VKKRNSEESSTQ 163
KY+EQEL K++G + D E+D L + LY +P+HL+ R+ E S Q
Sbjct: 129 KYIEQELQKRKGGGTE--DAAEDDGDVNKYLTPEDAALYALPDHLRQSSSHRSEEMLSNQ 186
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYA 223
GI EV L I + EA +KLLQ+ + S F +P++ + ++ Q R
Sbjct: 187 MLNGIPEVDLGIRPR-------EAKQKLLQDAKNKKDGPSQF-VPTNMAVNFMQHNRFNI 238
Query: 224 EKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
E + +++ G++ + T+ G ++ ATD + ++FRK+ R
Sbjct: 239 EDNSDQRRR------KREEREGNKSAQHQTNPNGVKR-ATDDYHYDKFRKQFR 284
>gi|312375465|gb|EFR22835.1| hypothetical protein AND_14137 [Anopheles darlingi]
Length = 263
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 95/189 (50%), Gaps = 17/189 (8%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRG------KNIDVNDRVENDLKHAEDELYKIPEHL 150
F+ ET ED M+KY+E+EL+K++G K ID + L E L +P HL
Sbjct: 81 FSAETNKRDEDEEMMKYIEEELSKRKGIAQQQDKPIDGESSTKY-LSPEEAALLSLPAHL 139
Query: 151 KVKK--RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSI 207
R+ E S Q +GI E+ L IE K+KNIE TE AK K +QE++ S F +
Sbjct: 140 SQTSSLRSEEMLSNQMLSGIPEIDLGIEAKIKNIEATEEAKLKYMQEQQRKKNLPSHF-V 198
Query: 208 PSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFM 267
PS+ + ++ Q R R ++P K R D G + D D ++ ATD +
Sbjct: 199 PSNMAVNFMQHNR-----YRIDNPAPPKRRYQDDHHRGGQRGDQRNDDRIPKK-ATDDYH 252
Query: 268 LERFRKRER 276
++F+K+ R
Sbjct: 253 FDKFKKQYR 261
>gi|170035810|ref|XP_001845760.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878197|gb|EDS41580.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 309
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/238 (30%), Positives = 112/238 (47%), Gaps = 33/238 (13%)
Query: 8 KKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSAL--- 64
K + K+N R+R E+ SDD++E LEE K Q+ R + +G+ + A+
Sbjct: 15 KPKSKRNLRQRIKTED-------SDDDQEVLTKLEETKEKQRLRNKTNGVNLLSLAMGKK 67
Query: 65 ---------QSAAAAGGGGLTKVSEKNEGDGEKDE----LVLQDTFAQETAVMVEDPNML 111
+ GG+ + G + E + F+ ET ED M+
Sbjct: 68 ITIEEEVTNKDPFNTKSGGMVNMQALKSGKIKTVEDPYDTGIGTQFSAETNKRDEDEEMM 127
Query: 112 KYVEQELAKKRGKNIDV---NDRVENDLKHAEDE---LYKIPEHLK--VKKRNSEESSTQ 163
KY+EQ+L KK+G + + D E+ K+ E L +P HL +R+ E S Q
Sbjct: 128 KYIEQQLGKKKGLDKETAGDGDAGESSAKYLSPEEAALLSLPAHLSHTSSQRSEEMLSNQ 187
Query: 164 WTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR 220
+GI EV L IE K+KNIE TE AK K +QE++ S F +P++ + ++ Q R
Sbjct: 188 MLSGIPEVDLGIEAKIKNIEATEDAKLKFMQEQQRKKDMPSHF-VPTNMAVNFMQHNR 244
>gi|291414331|ref|XP_002723414.1| PREDICTED: chromosome 9 open reading frame 78-like [Oryctolagus
cuniculus]
Length = 291
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/218 (33%), Positives = 110/218 (50%), Gaps = 18/218 (8%)
Query: 73 GGLT---KVSEKN-EGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDV 128
GG+ K+ E+ E E+++L L +F+ ET ED +M+K EL KR K+I
Sbjct: 76 GGMVDMKKLKERGKEKISEEEDLHLGTSFSAETNRRDEDADMMKVHRTEL--KRRKSIVE 133
Query: 129 NDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETE 186
+ + AED LY++PE ++V+ KR E S Q +GI EV L I+ K+KNI TE
Sbjct: 134 CEEQRVKPRSAEDCLYELPESIRVRSAKRTEEMLSNQMLSGIPEVDLGIDAKIKNIISTE 193
Query: 187 AAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL------RREHPELYKDR-G 238
AK +LL E++ + +P++ + +Y Q R Y E+L +E P+ R G
Sbjct: 194 DAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHNRFYHEELNAPIRRNKEEPKARPLRVG 253
Query: 239 SQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
+ R N A + ATD + E+F+K R
Sbjct: 254 DTEKPEPERSPPNRKRPANEK--ATDDYHYEKFKKMNR 289
>gi|322793759|gb|EFZ17143.1| hypothetical protein SINV_07529 [Solenopsis invicta]
Length = 293
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 70/213 (32%), Positives = 104/213 (48%), Gaps = 28/213 (13%)
Query: 30 LSDDEEER------RLALEEIKFLQKQRERKSGIPAIPSALQSAAAA-----------GG 72
LSDD + R +EE+K +QK RER +G+ + AL + A+ G
Sbjct: 28 LSDDNDSEGEKMSLREKVEEMKIIQKLRERPAGVDIVGLALGESVASDVITSDPFNMKTG 87
Query: 73 GGLTKVSEKNEGDGEKD--ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVND 130
G + + KN D E + F ET ED M+KY+E+EL+K++ KN +
Sbjct: 88 GMVNMTALKNTKHKPNDAYETGIGTQFNAETNKRDEDEEMVKYIEEELSKRKSKNNNDAA 147
Query: 131 RVENDLKHA-----EDELYKIPEHLKVKKRNSEES--STQWTTGIAEVQLPIEYKLKNIE 183
N+ K + E L +PEHL+ N E S Q +GI EV L IE K++NIE
Sbjct: 148 NSANNEKGSYCSPEEAALRAVPEHLRQSSANRSEEMLSNQMLSGIPEVDLGIEAKIRNIE 207
Query: 184 ETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADY 215
TE AK KLL ++ S F +P++ + ++
Sbjct: 208 ATEEAKLKLLWDRHRKKDGPSQF-VPTNMAVNF 239
>gi|58388944|ref|XP_316650.2| AGAP006620-PA [Anopheles gambiae str. PEST]
gi|55239374|gb|EAA11347.2| AGAP006620-PA [Anopheles gambiae str. PEST]
Length = 295
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 96/188 (51%), Gaps = 21/188 (11%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVEND-----LKHAEDELYKIPEHLK 151
F+ ET ED M+KY+E+EL K++G + +++ E + L E L +P HL
Sbjct: 119 FSAETNKRDEDEEMMKYIEEELGKRKGIAQEQDNQAEGESSGKYLSPEEAALLSLPAHLS 178
Query: 152 --VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIP 208
+R+ E S Q +GI E+ L IE K+KNIE TE AK K +QE++ S F +P
Sbjct: 179 QTSSQRSEEMLSNQMLSGIPEIDLGIEAKIKNIEATEDAKLKYMQEQQRKKDLPSHF-VP 237
Query: 209 SSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFML 268
S+ + ++ Q R R ++P K R Q+D R D + ATD +
Sbjct: 238 SNMAVNFMQHNR-----YRIDNPAPAKRR-YQEDHRDQRHDDRVP------KKATDDYHF 285
Query: 269 ERFRKRER 276
++F+K+ R
Sbjct: 286 DKFKKQYR 293
>gi|345306099|ref|XP_001508082.2| PREDICTED: uncharacterized protein C9orf78-like [Ornithorhynchus
anatinus]
Length = 174
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 92/176 (52%), Gaps = 14/176 (7%)
Query: 111 LKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KRNSEESSTQWTTGI 168
+KY+E EL K++G I N+ + K+AED LY++PE+++V K+ E S Q +GI
Sbjct: 1 MKYIETELKKRKG--IVENEEQKVKPKNAEDCLYELPENIRVSSAKKTEEMLSNQMLSGI 58
Query: 169 AEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKL- 226
EV L I+ K+KNI TE AK +LL E++ + +P++ + +Y Q R Y E+L
Sbjct: 59 PEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHNRFYHEELN 118
Query: 227 -----RREHPELYKDR-GSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRER 276
+E P+ R G + R N + ATD + E+F+K R
Sbjct: 119 APVRRNKEEPKARPLRVGDTEKPEPERSPPNRKRPHNEK--ATDDYHYEKFKKMNR 172
>gi|388582301|gb|EIM22606.1| hypothetical protein WALSEDRAFT_56792 [Wallemia sebi CBS 633.66]
Length = 285
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 114/240 (47%), Gaps = 42/240 (17%)
Query: 11 KKKNFRKRSYEEE--EETTNKLSDDEEERRLALEEIKFLQKQRERKSGI----------- 57
KK+N K + E+ EE+ N +DD+ +EE ++K + + +GI
Sbjct: 3 KKRNINKSTRREKSIEESNNVDNDDDSTISDTIEEKHLIRKLKRQAAGIDSEKLVQTTSN 62
Query: 58 ---PAIPSALQSAA---AAGGGGLTKVSEK-NEGDGEKDELVLQDTFAQETAVMVEDPNM 110
P + ++ A +A GG+ +EK E +D+LV F ++A + D +M
Sbjct: 63 SKKPKLDNSQSKEAHGWSASSGGIVDNTEKLKEASNPQDKLVKTSNFTGQSATIDVDKHM 122
Query: 111 LKYVEQELAKKRGK------NIDV---NDRVENDLKHAEDELYKIPEHLKVKKRNSEESS 161
+ Y+E+++ KK + N+++ N+R+ N +DELY I + K RN ++ S
Sbjct: 123 MSYIEEQMLKKHQQQGLPTDNLNLGITNERINN----PQDELYDIAQKYTYKSRNLDDGS 178
Query: 162 T----QWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQ 217
T I EV L +E KL+NI+ETE AK +R+ K P++ DY Q
Sbjct: 179 ITNSESMLTKIPEVDLGVEAKLRNIQETEKAK-----QRMRDLEKHTSRKPANTDPDYTQ 233
>gi|170573241|ref|XP_001892395.1| Hepatocellular carcinoma-associated antigen 59 family protein
[Brugia malayi]
gi|158602086|gb|EDP38774.1| Hepatocellular carcinoma-associated antigen 59 family protein
[Brugia malayi]
Length = 426
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 120/244 (49%), Gaps = 28/244 (11%)
Query: 9 KEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA 68
K+++++ R+R + ++ TT ++E E LE +K LQ+ R RK+G+ A+ AL
Sbjct: 9 KKRQRHLRERIIDNDDSTT----EEEAEIACKLEGVKELQESRIRKNGLNAVECALGKEL 64
Query: 69 AA------------GGGGLTKVSEKNEGDGEKDEL--VLQDTFAQETAVMVEDPNMLKYV 114
AA GGG+ ++SE + ++ ++D F +E+ + E M KYV
Sbjct: 65 AAEFIAMDDDPFRQRGGGMLRLSEGRQAQMHAADIEAGIRDQFKKESFLRDEHEEMKKYV 124
Query: 115 EQELAKKRG-KNIDVNDRVENDLKHAEDEL-YKIPEHLKV--KKRNSEESSTQWTTGIAE 170
+ EL K++ ++++ ND + + ED L +K E +++ +RN E S Q GI E
Sbjct: 125 QAELRKRKAVQDLEDNDATTSKVSSMEDTLMWKAAEKVRLFRSERNDELLSNQMLAGIPE 184
Query: 171 VQLPIEYKLKNIEETEAAK----KLLQEKRLMGRAKSDFSIPSS--YSADYFQRGRDYAE 224
V L I ++ NI ETE K K + EKR S FS + + DY Q Y E
Sbjct: 185 VDLGINARMSNIIETEKKKSDMLKEVVEKRRNLAQDSLFSQDRAKDLAKDYVQHSIFYME 244
Query: 225 KLRR 228
R
Sbjct: 245 STTR 248
>gi|336373766|gb|EGO02104.1| hypothetical protein SERLA73DRAFT_132900 [Serpula lacrymans var.
lacrymans S7.3]
gi|336386581|gb|EGO27727.1| hypothetical protein SERLADRAFT_383135 [Serpula lacrymans var.
lacrymans S7.9]
Length = 276
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 101/193 (52%), Gaps = 20/193 (10%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQEL-AKKRGKNIDVNDRVENDLKHAEDELYKIPEH 149
+V + F Q+T + D +M+ Y+E+ L + + ++ + R + +DELY + E
Sbjct: 96 VVRANNFTQQTNALDVDKHMMAYIEENLKIRSKPQSPEPTSRSSD----PQDELYNVSER 151
Query: 150 LKVKKRNSEESSTQ----WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDF 205
KV+KR +EE S T I EV L ++ +LKNIEETE AK+++ E+R R K
Sbjct: 152 WKVEKRMAEEGSVTNSLTMLTAIPEVDLGMDTRLKNIEETEKAKRMVAEER-KDRKK--- 207
Query: 206 SIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR-QAAT 263
++ ++ R Y L+ + ++ +D ++ + G P D S R Q AT
Sbjct: 208 ---ATNDEEHLAAARFYRPNLKQKSDADIMRD--AKLEAMGLPPQDESRRHHHDRPQMAT 262
Query: 264 DQFMLERFRKRER 276
D+ ++ERF+KR R
Sbjct: 263 DEAVMERFKKRMR 275
>gi|170090594|ref|XP_001876519.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164648012|gb|EDR12255.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 252
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 99/195 (50%), Gaps = 20/195 (10%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
+V + F Q+T + D +M+ Y+E+ L + D +D E+ ++ LYKI EH
Sbjct: 68 VVRANNFTQQTNTLDVDKHMMAYIEENLKIRSKPREDSDD--EDKPHDPQEALYKIAEHW 125
Query: 151 KVKK------RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 204
KV K S +S T I EV L ++ +LKNIE+TE AK+++ E+R D
Sbjct: 126 KVGKPQPKTDEGSVTNSMTMLTAIPEVDLGMDTRLKNIEDTEKAKRVVAEER------HD 179
Query: 205 FSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGS--RQA 261
P++ ++ R Y +R + ++ +D ++ + G P D+S + Q
Sbjct: 180 RKKPNN-DEEHLVASRFYRPNMRAKSDADILRD--AKLEAMGMPPQDDSPQRSNQERTQM 236
Query: 262 ATDQFMLERFRKRER 276
ATD+ ++ERF+KR R
Sbjct: 237 ATDEIVMERFKKRMR 251
>gi|343426688|emb|CBQ70217.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 275
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 88/199 (44%), Gaps = 48/199 (24%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN-DRVENDLKHAEDELYKIPEH 149
LV ++ F ET + D +M+ Y+E E+AK+ G + + D V + L + D+LY + E
Sbjct: 109 LVRKNNFQGETGTVDVDKHMMAYIEAEMAKRTGTSTASSADTVRSALANPHDQLYALAEE 168
Query: 150 LKVKKRNSEESSTQ------------WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL 197
+ +R + TQ + I EV L I+ ++KNI+ TE AK+ L ++
Sbjct: 169 YRQLQRQIKPDQTQDEREGNVALSAAMLSSIPEVDLGIDERMKNIQHTEDAKRALAQRAK 228
Query: 198 MGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAG 257
A + ++++ FQ+ +
Sbjct: 229 AANADG-LGVDTAFAGARFQQ----------------------------------VSGSN 253
Query: 258 SRQAATDQFMLERFRKRER 276
SRQ ATDQ +L+RFRKR+R
Sbjct: 254 SRQMATDQLVLDRFRKRQR 272
>gi|443927092|gb|ELU45623.1| hepatocellular carcinoma-associated antigen 59 domain-containing
protein [Rhizoctonia solani AG-1 IA]
Length = 349
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 96/193 (49%), Gaps = 29/193 (15%)
Query: 31 SDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSA--------------AAAGGGGLT 76
S+ +E ++ +EE+ L+K R ++ GI + S G GL
Sbjct: 40 SEGVDEEKMTIEELLELRKLRRQRQGIDSTKLNAGSTKKKKRRDEDEEAEDENEGKYGLR 99
Query: 77 KVSEKNEGD-------GEKD---ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI 126
K ++ +GD G +D +++ + F Q+T + D +M+KY+E+EL K+RGK
Sbjct: 100 KGGQRQDGDDDEASADGAEDVAKKIIKSNNFTQQTNKLDVDKHMMKYIEEELEKRRGKPN 159
Query: 127 DVNDRVENDLKHAEDELYKIPEHLKVKKR-----NSEESSTQWTTGIAEVQLPIEYKLKN 181
D ++ EL++I E K++K+ S +S+ T I EV L ++ +LKN
Sbjct: 160 ASGDTGNSNSSDPYAELFRISEKYKLQKKQELEEGSVTNSSAMLTAIPEVDLGMDTRLKN 219
Query: 182 IEETEAAKKLLQE 194
IEETE AK+ + E
Sbjct: 220 IEETEKAKRTVSE 232
>gi|328871809|gb|EGG20179.1| hypothetical protein DFA_07299 [Dictyostelium fasciculatum]
Length = 310
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 99/206 (48%), Gaps = 21/206 (10%)
Query: 12 KKNFRKRSYEEEEETTNKLS-------DDEEERRLALEEI-KFLQKQRERKSGI------ 57
K + RK+ ++E TN S DD+++ + EI K QK RE+ GI
Sbjct: 54 KPSLRKKDGDDESSLTNDTSSIASENGDDQQQDLSTIIEITKERQKMREKGKGIIAGVLA 113
Query: 58 --PAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVE 115
P I + L+ T +EKNE + ++ + ++ ++ + + L
Sbjct: 114 EGPHIKAHLRELEHKLDDSFTIATEKNETNVHLEKFLAKEMEKKKIEIKHKLTGGLMGTT 173
Query: 116 QELAKKRGKNIDVNDRVEND----LKHAEDELYKIPEHLKVKK-RNSEESSTQWTTGIAE 170
+ ++ +N D N+ +K ED LY+ PEHL VKK R EE T W GI+E
Sbjct: 174 EHRKEEDDENKQTKDNNANNTTTKIKTDEDSLYETPEHLAVKKTRKKEEDKTNWLAGISE 233
Query: 171 VQLPIEYKLKNIEETEAAKKLLQEKR 196
V LP YK+KNI+ETE A+ +++ +
Sbjct: 234 VSLPTSYKIKNIQETEDARSKIKDSK 259
>gi|357610714|gb|EHJ67110.1| hypothetical protein KGM_02139 [Danaus plexippus]
Length = 214
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/193 (35%), Positives = 97/193 (50%), Gaps = 20/193 (10%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKR-----GKNIDVNDRVENDLKHAEDELYKIPEHLK 151
F+ ET ED M+KY+E++LAK++ K + V L E L +PEHL+
Sbjct: 27 FSAETNKRDEDEEMMKYIEEQLAKRKEGSDSSKKESDDSEVLKYLAPEEAALLSLPEHLR 86
Query: 152 VKK--RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIP 208
R+ E S Q +GI EV L I+ K+KNIE TE AK KLL EK S F +P
Sbjct: 87 SSSMHRSEEMLSNQMLSGIPEVDLGIDAKIKNIEATEEAKMKLLWEKHNKKDGPSHF-VP 145
Query: 209 SSYSADYFQRGR-----DYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAAT 263
++ + ++ Q R +++K E P + K S D + ++ A G R AT
Sbjct: 146 TNMAVNFVQHNRFNLDSIHSKKRPAERP-IQKVEVSVIDESVNKIVKK---AKGER--AT 199
Query: 264 DQFMLERFRKRER 276
D + ERFRK+ R
Sbjct: 200 DDYHYERFRKQFR 212
>gi|443897955|dbj|GAC75293.1| uncharacterized conserved protein [Pseudozyma antarctica T-34]
Length = 290
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 95/204 (46%), Gaps = 39/204 (19%)
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK-NIDVNDRVENDLKHAEDELYK 145
+K LV ++ F ET + D +M+ Y+E E+ K+ G + + + ED LY
Sbjct: 111 DKPRLVRKNNFQGETGTVDVDKHMMAYIEDEMRKRTGSADTVDAAAIVAAVNDPEDALYA 170
Query: 146 IPE-----HLKVKKRNSEES-------STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQ 193
+ E H +K ++E S+ T I EV L I+ ++ NI++TEAA++
Sbjct: 171 VAEKYKELHRSIKPEQTQEQREGNVAFSSAMLTSIPEVDLGIDARMANIQDTEAARREAS 230
Query: 194 EKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNST 253
+ + + + ++ FQR K R++H +Q GSRP
Sbjct: 231 QPK-----PAHHDVDEDFANARFQRA-----KPRQDH--------AQSSNQGSRPE---- 268
Query: 254 DAAGSRQAATDQFMLERFRKRERH 277
RQ ATDQ +L+RF+KR+R+
Sbjct: 269 ----RRQMATDQLVLDRFKKRQRN 288
>gi|148298871|ref|NP_001091802.1| uncharacterized protein LOC778507 [Bombyx mori]
gi|116272507|gb|ABJ97189.1| hypothetical protein [Bombyx mori]
Length = 226
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/132 (38%), Positives = 76/132 (57%), Gaps = 9/132 (6%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAK-KRGKNIDVNDRVEND-LKHAEDE---LYKIPEHLK 151
F+ ET ED M+KY+E++LAK K G + D D + LK+ E L +P+HL+
Sbjct: 40 FSAETNKRDEDEEMMKYIEEQLAKRKEGCDKDNKDHNHTETLKYLSPEEAALLSLPDHLR 99
Query: 152 V--KKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRAKSDFSIP 208
V +R+ E S Q +GI EV L I+ K+KNIE TE AK KL+ E++ S F +P
Sbjct: 100 VSSNQRSEEMLSNQMLSGIPEVDLGIDAKIKNIEATEEAKMKLIWERQNKKDGPSQF-VP 158
Query: 209 SSYSADYFQRGR 220
++ + ++ Q R
Sbjct: 159 TNMAVNFVQHNR 170
>gi|302693901|ref|XP_003036629.1| hypothetical protein SCHCODRAFT_49879 [Schizophyllum commune H4-8]
gi|300110326|gb|EFJ01727.1| hypothetical protein SCHCODRAFT_49879 [Schizophyllum commune H4-8]
Length = 292
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 95/197 (48%), Gaps = 25/197 (12%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
+V + F Q+T + D +M+ Y+EQ L K R + +D D + + ++ LY + +
Sbjct: 109 VVRNNNFTQQTNALDVDKHMIAYIEQNL-KVRSRPLDDEDEKKQEPLDPQEALYHLSDKW 167
Query: 151 KVKKRNSEE-----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDF 205
+ K+ E +S T I EV L ++ +LKNIEETE AK+L+ E++ GR +D
Sbjct: 168 NLNKQTHPEDGSVTNSMTMLTAIPEVDLGMDARLKNIEETEKAKRLIAEEK-QGRKLTD- 225
Query: 206 SIPSSYSADYFQRGRDYAEKLRREHPELYKD----RGSQDDGAGSRPTDNSTDAAGSR-- 259
+ A + R H + D R ++ G +P D S +
Sbjct: 226 -----------EEAHLVATRFYRPHLKTKSDADIMRDAKLAAMGMQPKDQSQRWSNHDRP 274
Query: 260 QAATDQFMLERFRKRER 276
Q ATD+ ++ERF+KR R
Sbjct: 275 QMATDEIVMERFKKRMR 291
>gi|134108060|ref|XP_777412.1| hypothetical protein CNBB2130 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260102|gb|EAL22765.1| hypothetical protein CNBB2130 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 303
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPE-- 148
LV + F Q+T + D +M+ Y+E ELAK+RG+ D D+ + + ELY+I E
Sbjct: 123 LVRVNNFTQQTNALDVDKHMMAYIEAELAKRRGQAADTTDKSAIEDNDPQAELYRIAEKY 182
Query: 149 HLKVKKRNSEE-----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
+ +K+ +++ +S T I EV L ++ +LKNIE TE AK+ + E+R
Sbjct: 183 QFETRKKKADDEGNVTNSLGMLTSIPEVDLGMDNRLKNIEMTEKAKRDMLEQR 235
>gi|242007288|ref|XP_002424473.1| protein C9orf78, putative [Pediculus humanus corporis]
gi|212507891|gb|EEB11735.1| protein C9orf78, putative [Pediculus humanus corporis]
Length = 314
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 129/278 (46%), Gaps = 52/278 (18%)
Query: 38 RLALEEIKFLQKQRERKSGIPAIPSAL--------------QSAAAAGGGGLTKVSEKNE 83
R LEE+K LQK R R +G+ + AL + GG+ +
Sbjct: 48 RTKLEEMKMLQKLRARPNGVNIVGLALGRKIGEEEEEDIDVKDPFKTKSGGMINMKTLKS 107
Query: 84 GDGEKDE----LVLQDTFAQETAVMVEDPNMLKYVEQELAKKR--------GKNIDVNDR 131
G +K + + F+ ET ED M+K++E +L+KK+ GK+ D ++
Sbjct: 108 GKIKKMDDAYDTGIGTQFSAETNKRDEDEEMMKFIEDQLSKKKGLMKEKKSGKSDDQDES 167
Query: 132 VENDLKHAEDE-LYKIPEHLKVK--KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAA 188
++ E+ L IP+HL+ +R+ E S Q +GI EV L I+ K++NIE TE A
Sbjct: 168 SKSKYCSPEEAALQAIPDHLRSSSMQRSEEMLSHQMLSGIPEVDLGIDAKIRNIEATEEA 227
Query: 189 K-KLLQEKRLMGRAKSDFSIPSSYSADYFQRGR---DYAEKLRREHPELYKDRGSQDDGA 244
K KLL + S F +PS+ + ++ Q+ + D E LR+ YK
Sbjct: 228 KLKLLWSEHNKKEGPSQF-VPSNITVNFMQQNKMNQDDLEPLRKRQKN-YK--------- 276
Query: 245 GSRPT----DNSTDAAGSRQA--ATDQFMLERFRKRER 276
RPT D++ A ++ ATD + ERF+K+ R
Sbjct: 277 --RPTVQILDDAKIAIKRKEGEKATDDYHYERFKKQFR 312
>gi|307109350|gb|EFN57588.1| hypothetical protein CHLNCDRAFT_143273 [Chlorella variabilis]
Length = 271
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 101/199 (50%), Gaps = 26/199 (13%)
Query: 11 KKKNFRK-RSYEEEEETTNKLSDDEE-ERRLALEEIKFLQKQRERKS---------GI-P 58
K++N RK R+ +E+EE + ++ E + +L ++IK LQKQR+R++ G+ P
Sbjct: 2 KQRNIRKKRALDEQEELSEGEAEGAEGQPKLTADDIKLLQKQRQRRTVRLATAETRGVQP 61
Query: 59 AIPSALQSAAAAGGGGL----TKVSEKNEGDGEKDELVLQDTFAQETAVMVED--PNMLK 112
A LQ+ + G L +V K G VLQ F +E + ED NM K
Sbjct: 62 AW--RLQAGSGVDVGSLMVADVRVERKEAGAAAVVGDVLQAAFKRERRLHSEDEDVNMKK 119
Query: 113 YVEQELAKKRGK----NIDVNDRVENDLKHAEDELYK-IPEHLKVKKRNSEESSTQWTTG 167
YVE++LAK+ G+ + +D L +PE L+ +++++E + W G
Sbjct: 120 YVEEQLAKRMGRPGQEEEEAAAAEAERRARMQDPLLAAMPEGLQKRQQDTELGPS-WVAG 178
Query: 168 IAEVQLPIEYKLKNIEETE 186
I EV L +E KL NIE TE
Sbjct: 179 ITEVPLSMEQKLANIEATE 197
>gi|58264334|ref|XP_569323.1| hypothetical protein CNB03570 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57223973|gb|AAW42016.1| hypothetical protein CNB03570 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 303
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 64/113 (56%), Gaps = 7/113 (6%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
LV + F Q+T + D +M+ Y+E ELAK+RG+ D D+ + + ELY+I E
Sbjct: 123 LVRVNNFTQQTNALDVDKHMMAYIEAELAKRRGQAADTTDKSAIEDNDPQAELYRIAEKY 182
Query: 151 KV---KKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
+ KK+ +E +S T I EV L ++ +LKNIE TE AK+ + E+R
Sbjct: 183 QFETRKKKADDEGNVTNSLGMLTSIPEVDLGMDNRLKNIEMTEKAKRDMLEQR 235
>gi|357017455|gb|AET50756.1| hypothetical protein [Eimeria tenella]
Length = 325
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 61/108 (56%), Gaps = 1/108 (0%)
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK 151
+L+ FA D ++ +++ + L K+ ++ + E+D+ +LY IP+HLK
Sbjct: 148 LLEKNFASGGPGSATDKHLEEFLRERLKDKQHESREEKALREHDMVDKMRDLYAIPDHLK 207
Query: 152 VKKRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLM 198
V + E + W TG+ EV+LP+E KLKNIE TE AK+ L K L+
Sbjct: 208 VADKTEEYKDQMNWVTGLVEVELPMETKLKNIEATERAKRQLLRKGLL 255
>gi|390601598|gb|EIN10992.1| hypothetical protein PUNSTDRAFT_64949 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 292
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 89/191 (46%), Gaps = 20/191 (10%)
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK 151
V F Q+T + D +M+ Y+E+ +AK RG + +D EL ++ + K
Sbjct: 111 VRTSNFTQQTNTLDVDRHMMAYIEENMAKLRGAK---REEKSDDPADPYAELNRLADRYK 167
Query: 152 VKKRNSEE------SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAK--S 203
K+N +E +S T I EV L ++ +LKNIEETE AK+++ E+R R K
Sbjct: 168 FSKKNEKEEEGNVTNSLAMLTAIPEVDLGMDARLKNIEETERAKRIVAEERKDNRRKRED 227
Query: 204 DFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAAT 263
+ IP S + AE LR E G RP + T Q AT
Sbjct: 228 EEQIPGSRFYRPNHNAKTDAEILRNAKLEAM---GLPPQEENRRPHNERT------QMAT 278
Query: 264 DQFMLERFRKR 274
D+ ++ERF+KR
Sbjct: 279 DEMVMERFKKR 289
>gi|358054938|dbj|GAA99005.1| hypothetical protein E5Q_05694 [Mixia osmundae IAM 14324]
Length = 307
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 64/111 (57%), Gaps = 9/111 (8%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKN---IDVNDRVENDLKHAEDELYKIP 147
LV + F Q+T + D +M+ Y+E EL K+ KN +DV + + H DELY++
Sbjct: 118 LVKSNNFTQQTNTLDVDKHMMAYIEAELRKRTQKNPGQLDVEEELGKLDPH--DELYQVA 175
Query: 148 EHLKVKK----RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQE 194
E +V K +E +S+ T + EV L I+ +L+NIEETE AK+ L+E
Sbjct: 176 ERYRVAKMPVREGNETTSSAMLTAVQEVDLGIDARLRNIEETEKAKQRLRE 226
>gi|159489200|ref|XP_001702585.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280607|gb|EDP06364.1| predicted protein [Chlamydomonas reinhardtii]
Length = 261
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 95/194 (48%), Gaps = 19/194 (9%)
Query: 11 KKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA-- 68
K++N RKR E+E+ + +S + E +R L E + +Q+ R+R +G A+
Sbjct: 4 KQRNIRKRVASEDEDAPD-VSAEPECQRDKLAETRLMQQLRKRSAGTGVGALAMGGGGPG 62
Query: 69 ---AAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMV---EDPNMLKYVEQELAKKR 122
A+ G + SE EG G V+ D + + ++ V ED +M KYVE++LA +
Sbjct: 63 IGPASREGSVEPGSEGGEGAG----TVVMDAYVKAKSIAVQMDEDAHMQKYVEEQLAARL 118
Query: 123 GKNIDVNDRVEND----LKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYK 178
GK + END + E ELY +P + E + +AEV L + K
Sbjct: 119 GKTAEAEAE-ENDPEVKRRKLEQELYALPSDF-TTQLEQELVLPGMVSTLAEVPLAAKDK 176
Query: 179 LKNIEETEAAKKLL 192
LK+IE TEA K+ L
Sbjct: 177 LKSIEATEALKRSL 190
>gi|321248874|ref|XP_003191271.1| hypothetical protein CGB_A2540W [Cryptococcus gattii WM276]
gi|317457738|gb|ADV19484.1| hypothetical protein CNB03570 [Cryptococcus gattii WM276]
Length = 311
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 65/113 (57%), Gaps = 7/113 (6%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPE-- 148
LV + F Q+T + D +M+ Y+E ELAK+RG+ D+ + + + ELY+I E
Sbjct: 125 LVRVNNFTQQTNALDVDKHMMAYIETELAKRRGQAAAPTDKSKVEDNDPQAELYRIAEKY 184
Query: 149 HLKVKKRNSEE-----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
+ KK+ +++ +S T I EV L ++ +LKNIE TE AK+ + E+R
Sbjct: 185 QFETKKKKADDEGNVTNSLGMLTSIPEVDLGMDNRLKNIEMTEKAKRDMLEQR 237
>gi|328768488|gb|EGF78534.1| hypothetical protein BATDEDRAFT_26637 [Batrachochytrium
dendrobatidis JAM81]
Length = 273
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 89/207 (42%), Gaps = 49/207 (23%)
Query: 31 SDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAGG------------------ 72
SD++E RL L+E L+K R+ K GI A + G
Sbjct: 48 SDEDEHSRLTLQEALELRKLRKPKPGISAASLETGKVSLPNGKHEVSVETLQDQDDPWKL 107
Query: 73 --GGLTKVSE------KNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGK 124
GGL +S+ EG G F + M + +M ++E+EL K+RG
Sbjct: 108 KNGGLINISDIRGRSFGEEGSG-------TGGFETASKAMDTEKHMKAFIEKELRKRRGD 160
Query: 125 NID------------VNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTG----I 168
+ND ++ ++ELY+IP+ L + + +E + +TG I
Sbjct: 161 APSTTSDTSLPSLRKLNDELKTGPTDYDEELYRIPDALTIPVKPIKEDNVTLSTGMLMSI 220
Query: 169 AEVQLPIEYKLKNIEETEAAKKLLQEK 195
EV L + KLKNIEETE AK+ L EK
Sbjct: 221 PEVDLGVSNKLKNIEETEQAKRSLLEK 247
>gi|403413736|emb|CCM00436.1| predicted protein [Fibroporia radiculosa]
Length = 294
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 99/195 (50%), Gaps = 21/195 (10%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG-KNIDVNDRVENDLKHAEDELYKIPEH 149
+V + F Q+T V+ D +M+ Y+E+ + +RG +N ++D D EL+ IP+
Sbjct: 111 VVRANNFTQQTNVLDVDKHMMAYIEENMKLRRGNQNEPMSDDGPLD---PYAELFSIPDK 167
Query: 150 LKVKKRNSEE-----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 204
++ + ++ +S T I EV L ++ +LKNIEETE AK+++ E+R + K D
Sbjct: 168 YRLTQEQEQDEGNVTNSLAMLTAIPEVDLGMDTRLKNIEETEKAKRMITEERKERKKKVD 227
Query: 205 FSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR--QA 261
++ R Y L+ + ++ +D ++ + G P D+ Q
Sbjct: 228 -------DEEHLAAARFYRPNLKMKSDADIIRD--AKLEAMGLLPEDHEYRRPQHERMQM 278
Query: 262 ATDQFMLERFRKRER 276
ATD+ ++ERF+KR R
Sbjct: 279 ATDELVMERFKKRMR 293
>gi|405118594|gb|AFR93368.1| hypothetical protein CNAG_03868 [Cryptococcus neoformans var.
grubii H99]
Length = 304
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 7/113 (6%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
LV + F Q+T + D +M+ Y+E ELAK+RG+ D + + ELY+I E
Sbjct: 124 LVRVNNFTQQTNALDVDKHMMAYIEAELAKRRGQAAATTDESALEDNDPQAELYRIAEKY 183
Query: 151 KV---KKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
+ KK+ +E +S T I EV L ++ +LKNIE TE AK+ + E+R
Sbjct: 184 QFETRKKKADDEGNVTNSLGMLTSIPEVDLGMDNRLKNIEMTEKAKRDMLEQR 236
>gi|393216067|gb|EJD01558.1| hypothetical protein FOMMEDRAFT_135745 [Fomitiporia mediterranea
MF3/22]
Length = 309
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 95/195 (48%), Gaps = 20/195 (10%)
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID-VNDRVENDLKHAEDELYKIPEHL 150
V + F Q+T + D +M+ Y+E+ + ++RG+ + + D ++ELY+I E
Sbjct: 124 VRTNNFTQQTNALDVDKHMMAYIEENMRQRRGERGEKIEDEEAPKPLDPQEELYRIAEKF 183
Query: 151 KVKK--RNSEESST----QWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSD 204
K + R EE S T I EV L ++ +LKNIEETE AK+ + E + R +D
Sbjct: 184 KTQNNARGQEEGSVTNSLSMLTAIPEVDLGMDARLKNIEETEKAKRAVAEAKKERRQNND 243
Query: 205 FSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNST--DAAGSRQA 261
++ R Y ++ + ++ +D ++ + G RP D Q
Sbjct: 244 --------EEHLAATRFYKPNIKQKSDADIIRD--AKLEAMGLRPDDYEPRRHHPEKVQT 293
Query: 262 ATDQFMLERFRKRER 276
ATD+ ++ERF+KR R
Sbjct: 294 ATDEMIMERFKKRMR 308
>gi|389746703|gb|EIM87882.1| hypothetical protein STEHIDRAFT_94716 [Stereum hirsutum FP-91666
SS1]
Length = 317
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 101/202 (50%), Gaps = 28/202 (13%)
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH--AEDELYKIPEH 149
V + F Q+T + D +M+ Y+E+ + + +N D E K+ ++ELY++ E
Sbjct: 126 VRTNNFTQQTNALDVDKHMMAYIEENMKLRHAQNSSTPD-PEAAPKYLDPQEELYRLSEK 184
Query: 150 LKVKKR---NSEESSTQ---WTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKS 203
KV+K+ N E S T T I EV L ++ +LKNIEETE AK+ L + R + +
Sbjct: 185 YKVEKKAQPNEEGSVTNSLAMLTAIPEVDLGMDTRLKNIEETEKAKQALTQARKDRQKRQ 244
Query: 204 DFSIPSSYSADYFQ---RGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSR- 259
+ +A +F+ R + A+ + R ++ + G P +++ G+R
Sbjct: 245 NDDEEHLAAARFFRPNTRMKSDADII----------RDAKLEAMGLPPAEDNEPHRGNRP 294
Query: 260 -----QAATDQFMLERFRKRER 276
Q ATD+ ++ERF+KR R
Sbjct: 295 RHDRPQMATDEMVMERFKKRMR 316
>gi|409079546|gb|EKM79907.1| hypothetical protein AGABI1DRAFT_39047 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426192503|gb|EKV42439.1| hypothetical protein AGABI2DRAFT_78642 [Agaricus bisporus var.
bisporus H97]
Length = 262
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/225 (31%), Positives = 105/225 (46%), Gaps = 37/225 (16%)
Query: 73 GGLTKVSEKNEGDGEKDE------LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI 126
GGL K + E D E ++ +V + F Q+T VM D +M+ Y+E+ L K RG+
Sbjct: 52 GGLRKGAPVEEDDEEANKEAKARRVVRTNNFTQQTNVMDVDKHMMAYIEENL-KIRGRPR 110
Query: 127 D--VNDRVENDLKHAEDELYKIPEHLKVKKRN-------SEESSTQWTTGIAEVQLPIEY 177
D DR D + A LY+I + KV K S +S T I EV L ++
Sbjct: 111 DDEAKDRKPLDPQEA---LYRIVDKFKVNKEGQTKGEEGSVTNSLTMLTAIPEVDLGMDN 167
Query: 178 KLKNIEETEAAKKLLQEKRLM-GRAKSD---FSIPSSYSADYFQRGRDYAEKLRREHPEL 233
+LKNIEETE AK+ + E R +AK+D Y Y + + A+ L
Sbjct: 168 RLKNIEETEKAKRSVDEHRQQRKKAKTDEEHLIAQRFYRPGY--KAKSDADIL------- 218
Query: 234 YKDRGSQDDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRER 276
R ++ + G P + S + Q ATD+ ++ERF+K R
Sbjct: 219 ---RDAKLEAMGLPPKEESPRRSNHERNQMATDEIVMERFKKHVR 260
>gi|291229542|ref|XP_002734736.1| PREDICTED: glycosyltransferase 25 domain containing 2-like, partial
[Saccoglossus kowalevskii]
Length = 576
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 74/138 (53%), Gaps = 28/138 (20%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQE 100
LEE K QK R+R+SG+ ++E D ++D + + TFA E
Sbjct: 464 LEETKEAQKFRKRQSGV-----------------------RSEEDTDRDVVDMGSTFAAE 500
Query: 101 TAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK---KRNS 157
T ED +M+KY+E+++AK++GK + + E LK AED LY++P L+V+ ++
Sbjct: 501 TNRRDEDADMMKYIEEQMAKRKGKAMTKEE--ERRLKSAEDLLYELPARLQVESSSQKTE 558
Query: 158 EESSTQWTTGIAEVQLPI 175
E S Q +GI EV L I
Sbjct: 559 EMMSHQMLSGIPEVDLGI 576
>gi|392568454|gb|EIW61628.1| hypothetical protein TRAVEDRAFT_163006 [Trametes versicolor
FP-101664 SS1]
Length = 285
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 97/196 (49%), Gaps = 23/196 (11%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAED--ELYKIPE 148
+V ++ F +T + D +M+ Y+E+ + +RG D + D A+ E+++I E
Sbjct: 102 VVRENNFTHQTNALDVDKHMMAYIEENMKLRRG----TQDESKKDDGQADPYAEVFRITE 157
Query: 149 HLK--VKKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAK 202
K +K+ EE +S T I EV L ++ +LKNIEETE AK+ + E+R + +
Sbjct: 158 KYKPPTQKKEQEEGNVTNSLAMLTAIPEVDLGMDARLKNIEETEKAKRQIAEQRKDKQKQ 217
Query: 203 SDFSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR-Q 260
D + R Y LR + ++ +D ++ + G P D+ R Q
Sbjct: 218 GD-------DEAHLAGSRFYRPNLRAKSDADILRD--AKLEAMGLNPEDHEVRRHSDRPQ 268
Query: 261 AATDQFMLERFRKRER 276
ATD+ ++ERF+KR R
Sbjct: 269 MATDEMVMERFKKRMR 284
>gi|401406410|ref|XP_003882654.1| hypothetical protein NCLIV_024100 [Neospora caninum Liverpool]
gi|325117070|emb|CBZ52622.1| hypothetical protein NCLIV_024100 [Neospora caninum Liverpool]
Length = 344
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/56 (50%), Positives = 38/56 (67%), Gaps = 1/56 (1%)
Query: 141 DELYKIPEHLKVKKRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 195
+ELY++P+ L+V R+ E W TG+ EVQLP+ KLKNIE TE AK+ L +K
Sbjct: 224 NELYRVPDRLQVADRSGEYREQLNWLTGLTEVQLPMTVKLKNIEATEKAKRALLKK 279
>gi|312090668|ref|XP_003146699.1| hypothetical protein LOAG_11128 [Loa loa]
Length = 288
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 102/199 (51%), Gaps = 22/199 (11%)
Query: 9 KEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA 68
K+K+++ R+R ++++E T + E E LE IK LQ+ R K+G+ A+ AL
Sbjct: 9 KKKQRHLRERIFDDDEITAEE----EAEIACKLEGIKELQESRVCKNGLNAVECALGKEL 64
Query: 69 AA------------GGGGLTKVSEKNEGDGEKDEL--VLQDTFAQETAVMVEDPNMLKYV 114
AA GGG+ ++SE + ++ ++D F +E+ + E M KYV
Sbjct: 65 AAEFIAMDDDPFRQRGGGMLRLSEGRQAQMHAADIEAGIRDQFKKESFLRDEHEEMKKYV 124
Query: 115 EQELAKKRG-KNIDVNDRVENDLKHAEDEL-YKIPEHLKV--KKRNSEESSTQWTTGIAE 170
+ EL K++ ++++ D + + ED L +K E +++ +RN E S Q GI E
Sbjct: 125 QAELRKRKAVQDLEDGDATTSKVPSMEDSLMWKAAEKVRLFRSERNDELLSNQMLAGIPE 184
Query: 171 VQLPIEYKLKNIEETEAAK 189
V L I ++ NI ETE K
Sbjct: 185 VDLGINARMSNIIETEKKK 203
>gi|281201418|gb|EFA75630.1| hypothetical protein PPL_11136 [Polysphondylium pallidum PN500]
Length = 313
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 65/139 (46%), Gaps = 28/139 (20%)
Query: 143 LYKIPEHLKVKK-RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRA 201
LY+ P HL V K R EE T W GI+EV LP +YK+KNI ETE A R
Sbjct: 199 LYETPSHLLVNKGRKKEEDKTNWVAGISEVVLPTKYKIKNILETEEA-----------RE 247
Query: 202 KSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAA---GS 258
K D Q G + +L+ +R + ++ S ++S+D S
Sbjct: 248 KID------------QSGNTNTTTTNKNQSKLH-NRCNYNNVHASYTNESSSDTNQYEKS 294
Query: 259 RQAATDQFMLERFRKRERH 277
ATD+ + ERF+K+ R+
Sbjct: 295 SDKATDEEVYERFKKKFRY 313
>gi|299748182|ref|XP_001837525.2| hypothetical protein CC1G_01437 [Coprinopsis cinerea okayama7#130]
gi|298407852|gb|EAU84441.2| hypothetical protein CC1G_01437 [Coprinopsis cinerea okayama7#130]
Length = 289
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 67/221 (30%), Positives = 111/221 (50%), Gaps = 24/221 (10%)
Query: 73 GGLTK--VSEKNEGDGEKDE-----LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKN 125
GGL K +E +E D E E +V + F Q+T + D +M+ Y+E+ L K RGK
Sbjct: 75 GGLRKPLAAEGDEDDEEAKEARARRVVRTNNFTQQTNALDVDKHMMAYIEENL-KVRGKP 133
Query: 126 IDVNDRVENDLKHAEDELYK-IPEHLKVKKRN--SEE----SSTQWTTGIAEVQLPIEYK 178
+ D + +D LY+ + + ++ K +EE +S T I EV L ++ +
Sbjct: 134 RNEEDDEDKKPLDPQDILYRQVADRFRLDKPKAATEEGNVTNSMSMLTAIPEVDLGMDTR 193
Query: 179 LKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLR-REHPELYKDR 237
LKNIEETE AK+++ E+R R K + + Y+ Y LR + ++ +D
Sbjct: 194 LKNIEETEKAKRVVAEER-QERKKVNPDEEHLVATRYWTV---YRPNLRAKSDADILRD- 248
Query: 238 GSQDDGAGSRPTDNSTDAAGS--RQAATDQFMLERFRKRER 276
++ + G P D++ + Q ATD+ ++ERF+KR R
Sbjct: 249 -AKLEAMGLPPQDDAHHRSNHDRAQTATDEIVMERFKKRMR 288
>gi|393240981|gb|EJD48505.1| hypothetical protein AURDEDRAFT_112943 [Auricularia delicata
TFB-10046 SS5]
Length = 306
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 109/237 (45%), Gaps = 36/237 (15%)
Query: 66 SAAAAGGGGLTKVSEKNEGDGEKDE-----LVLQDTFAQETAVMVEDPNMLKYVEQELAK 120
+AA GGL +V E + + E DE V + F Q+T + D +M+ Y+E+ + K
Sbjct: 73 TAAIVTPGGLREVREPVDEELEGDEAKARRTVRSNNFTQQTNALDVDKHMMNYIEENMRK 132
Query: 121 KRGKNIDVNDRVENDLKHAEDELYKI----PEHLKVK--------------KRNSEE--- 159
+ G D +D +N+ EL+++ P VK K+++EE
Sbjct: 133 RYG-GTDDDDEKKNEPWDPLAELFRVDPVFPSKPNVKPDASKSATAPVVSKKKDNEEGSV 191
Query: 160 -SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQE-KRLMGRAKSDFSIPSSYSADYFQ 217
+S T I EV L ++ +LKNIE+TE AK+ E K+ R D A F
Sbjct: 192 TNSATMLTAIPEVDLGMDTRLKNIEDTEKAKRTAAELKQERQRVDKD-----DLVAGRFY 246
Query: 218 RGRDYAEKLRREHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKR 274
RG+D + KL+ + E + + + G P + ++ ATD +ERF+KR
Sbjct: 247 RGQDKS-KLKTDE-ERLRSAMAVEMGNAPDPARRHEHRSQRKEVATDDIAMERFKKR 301
>gi|328860548|gb|EGG09654.1| hypothetical protein MELLADRAFT_74358 [Melampsora larici-populina
98AG31]
Length = 385
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 64/121 (52%), Gaps = 14/121 (11%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNI--DVNDRVENDLKHAE-------- 140
++ + F Q+T + D +M+ Y+E+EL ++R I ++ E L AE
Sbjct: 150 IIKSNNFTQQTNTLDVDKHMMHYIEEELKQRRKAAIAAGADESSEPILTGAEAVASLDPR 209
Query: 141 DELYKIPEHLKVKKRNSEES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
DELYKI E ++ ++ E S T I EV L I+ ++KNIE TE AK+ L ++R
Sbjct: 210 DELYKIAEKYRIDRKPVVEGNVTLSATMLTSIPEVDLGIDTRIKNIEATEKAKRKLADER 269
Query: 197 L 197
L
Sbjct: 270 L 270
>gi|449549468|gb|EMD40433.1| hypothetical protein CERSUDRAFT_148439 [Ceriporiopsis subvermispora
B]
Length = 287
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 97/196 (49%), Gaps = 24/196 (12%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAED--ELYKIPE 148
+V + F Q+T + D +M+ Y+E+ + ++G + E D + EL++I +
Sbjct: 105 VVRANNFTQQTNTLDVDKHMMAYIEENMKLRQG----AQNESEKDDGPLDPYAELFRIAD 160
Query: 149 HLKVKKRNSEE-------SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRA 201
K ++ + +E +S T I EV L ++ +LKNIEETE AK+L++E++ +
Sbjct: 161 KYKPRQDSEKEKEEGSVTNSLSMLTAIPEVDLGMDTRLKNIEETEKAKRLIEERKERKKQ 220
Query: 202 KSDFSIPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQ 260
D + + R Y L+ R ++ +D ++ + G P D+ Q
Sbjct: 221 ADDEA--------HLAATRFYRPNLKTRSDADIIRD--AKLEAMGLPPEDHDRPRHDRPQ 270
Query: 261 AATDQFMLERFRKRER 276
ATD+ ++ERF+KR R
Sbjct: 271 MATDEMVMERFKKRMR 286
>gi|221481742|gb|EEE20118.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 455
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 141 DELYKIPEHLKVKKRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 195
+ELY++P+ L+V R+ E W TG+ EV LP+ KLKNIE TE AK+ L +K
Sbjct: 335 NELYRVPDRLQVADRSGEYREQLNWLTGLTEVHLPMTVKLKNIEATEKAKRALLKK 390
>gi|237832377|ref|XP_002365486.1| hypothetical protein TGME49_063860 [Toxoplasma gondii ME49]
gi|211963150|gb|EEA98345.1| hypothetical protein TGME49_063860 [Toxoplasma gondii ME49]
Length = 455
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 141 DELYKIPEHLKVKKRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 195
+ELY++P+ L+V R+ E W TG+ EV LP+ KLKNIE TE AK+ L +K
Sbjct: 335 NELYRVPDRLQVADRSGEYREQLNWLTGLTEVHLPMTVKLKNIEATEKAKRALLKK 390
>gi|221502202|gb|EEE27940.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 455
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 37/56 (66%), Gaps = 1/56 (1%)
Query: 141 DELYKIPEHLKVKKRNSE-ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 195
+ELY++P+ L+V R+ E W TG+ EV LP+ KLKNIE TE AK+ L +K
Sbjct: 335 NELYRVPDRLQVADRSGEYREQLNWLTGLTEVHLPMTVKLKNIEATEKAKRALLKK 390
>gi|384500335|gb|EIE90826.1| hypothetical protein RO3G_15537 [Rhizopus delemar RA 99-880]
Length = 263
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 64/108 (59%), Gaps = 12/108 (11%)
Query: 95 DTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKH-----AEDELYKIPEH 149
D+F +T + D +M++Y+E E+ K++G + +E + K +ELY++P+
Sbjct: 103 DSFTTQTNKLDVDKHMMEYIESEMRKRKG--YKPQEEIEEEYKDKGFVDIYEELYRLPDQ 160
Query: 150 LKVKKRNSE-----ESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 192
LK +K+ SE + S+Q T I EV L I+ +L+NIEETE AK+ L
Sbjct: 161 LKGEKKESENEGNVQLSSQMLTAIPEVDLGIDTRLQNIEETEKAKRKL 208
>gi|66827193|ref|XP_646951.1| hypothetical protein DDB_G0268774 [Dictyostelium discoideum AX4]
gi|60475040|gb|EAL72976.1| hypothetical protein DDB_G0268774 [Dictyostelium discoideum AX4]
Length = 341
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 5/100 (5%)
Query: 143 LYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAK 202
L++ PEHLK +K EE T W GIAEVQLP YK KNI ETE AK L ++ R
Sbjct: 240 LFETPEHLKSQKGKVEEK-TNWVAGIAEVQLPDVYKYKNIVETEKAKDALDKQ---PRNY 295
Query: 203 SDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDD 242
P +++ +Y R YA ++ + + D+ + D+
Sbjct: 296 EKLLTPQNFNQNYQYHNR-YANNKKQRNEDKATDKEAMDN 334
>gi|393908085|gb|EFO17373.2| hypothetical protein LOAG_11128 [Loa loa]
Length = 266
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 102/199 (51%), Gaps = 22/199 (11%)
Query: 9 KEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAA 68
K+K+++ R+R ++++E T + E E LE IK LQ+ R K+G+ A+ AL
Sbjct: 9 KKKQRHLRERIFDDDEITAEE----EAEIACKLEGIKELQESRVCKNGLNAVECALGKEL 64
Query: 69 AA------------GGGGLTKVSEKNEGDGEKDEL--VLQDTFAQETAVMVEDPNMLKYV 114
AA GGG+ ++SE + ++ ++D F +E+ + E M KYV
Sbjct: 65 AAEFIAMDDDPFRQRGGGMLRLSEGRQAQMHAADIEAGIRDQFKKESFLRDEHEEMKKYV 124
Query: 115 EQELAKKRG-KNIDVNDRVENDLKHAEDEL-YKIPEHLKV--KKRNSEESSTQWTTGIAE 170
+ EL K++ ++++ D + + ED L +K E +++ +RN E S Q GI E
Sbjct: 125 QAELRKRKAVQDLEDGDATTSKVPSMEDSLMWKAAEKVRLFRSERNDELLSNQMLAGIPE 184
Query: 171 VQLPIEYKLKNIEETEAAK 189
V L I ++ NI ETE K
Sbjct: 185 VDLGINARMSNIIETEKKK 203
>gi|324508095|gb|ADY43422.1| Unknown [Ascaris suum]
Length = 287
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 98/212 (46%), Gaps = 24/212 (11%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA------------GGGGLTKVSEKNEGD--G 86
L +IK +Q R R++G+ A+ AL AA GGG+ ++SE +
Sbjct: 37 LADIKEVQLSRLRRNGLNAVECALGKELAAEFVAMDDDPFRQRGGGMLRLSEGRQAQLHA 96
Query: 87 EKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVEN-DLKHAEDEL-Y 144
E ++D F +E+ + E M KYV+ EL K++ ND + + ED L +
Sbjct: 97 ADIEAGIRDQFKKESFLRDEHEEMKKYVQAELRKRKADYEPDNDESTSVKVPSVEDNLMW 156
Query: 145 KIPEHLKVKK--RNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK-KLLQEKRLMGRA 201
K E ++ K RN E S Q GI EV L I ++ NI ETE K ++L++ GR+
Sbjct: 157 KAAEKVRFFKSMRNDELLSNQMLAGIPEVDLGINARMSNIIETEKKKSEMLKDVIEHGRS 216
Query: 202 KSDFSI-----PSSYSADYFQRGRDYAEKLRR 228
++ ++ S DY Q Y E R
Sbjct: 217 LTEETLFQQERAKDLSKDYVQHSIFYMESTTR 248
>gi|238578425|ref|XP_002388713.1| hypothetical protein MPER_12237 [Moniliophthora perniciosa FA553]
gi|215450245|gb|EEB89643.1| hypothetical protein MPER_12237 [Moniliophthora perniciosa FA553]
Length = 191
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/189 (29%), Positives = 94/189 (49%), Gaps = 22/189 (11%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRN 156
Q+T + D +M+ Y+E+ L K R + D + D + A LY++PE KV+++
Sbjct: 15 LTQQTNALDVDKHMMTYIEEHL-KIRSRPKDEEKKKPLDPQEA---LYQVPERWKVEQKK 70
Query: 157 SEE------SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSS 210
E +S T I EV L ++ +LKNIEETE AK+++ E R R ++P
Sbjct: 71 QETDDGSITNSMTMLTAIPEVDLGMDARLKNIEETEKAKRVVAEDRSDKR-----TVPK- 124
Query: 211 YSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGS--RQAATDQFM 267
++ R Y L+ + ++ +D ++ + G + D A Q ATD+ +
Sbjct: 125 -GEEHLVAARFYRPNLKAKSDADIMRD--AKLEAMGLQLQDEQPRRANQDRPQIATDELV 181
Query: 268 LERFRKRER 276
+ERF+KR R
Sbjct: 182 MERFKKRMR 190
>gi|392576231|gb|EIW69362.1| hypothetical protein TREMEDRAFT_17281, partial [Tremella
mesenterica DSM 1558]
Length = 288
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 63/113 (55%), Gaps = 7/113 (6%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRG-KNIDVNDRVENDLKHAEDEL------ 143
L+ + F Q+T + D +ML ++E+EL K+RG + +N+ + EL
Sbjct: 110 LIRTNNFTQQTNALDVDKHMLAFIEKELNKRRGAEAASKTSNTQNESFDPQSELLEVTKK 169
Query: 144 YKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
YKI +++K+++ S +S T I EV L +E +L+NIE TE AK+ + E R
Sbjct: 170 YKIEKNMKLEEEGSLTNSMGMLTTIPEVDLGMENRLRNIEATEKAKREMLESR 222
>gi|349805921|gb|AEQ18433.1| hypothetical protein [Hymenochirus curtipes]
Length = 157
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 53/88 (60%), Gaps = 4/88 (4%)
Query: 90 ELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEH 149
+L L +F+ ET ED +M+KY+E EL KK+G D +V+ LK AED LY++P+
Sbjct: 37 DLNLGTSFSAETNRRDEDADMMKYIETELKKKKGIVEDEEKKVK--LKSAEDCLYELPDS 94
Query: 150 LKVK--KRNSEESSTQWTTGIAEVQLPI 175
+KV K+ E S Q +GI EV L I
Sbjct: 95 IKVSSAKKTEEMLSNQMLSGIPEVDLGI 122
>gi|326428178|gb|EGD73748.1| hypothetical protein PTSG_11504 [Salpingoeca sp. ATCC 50818]
Length = 301
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 96/206 (46%), Gaps = 19/206 (9%)
Query: 4 PIPQKKEKKK--NFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIP 61
P PQ K+KK+ N +KR ++ D E+ ALEE QK R R+ GI A
Sbjct: 8 PAPQFKKKKRRGNLKKREKASMDDILKADEDVAEKIEAALEE----QKLRSREQGISAEE 63
Query: 62 SALQSAAAAGGG------GLTKVSEKNEGDGEKDEL-VLQDTFAQETAVMVEDPNMLKYV 114
A + GL K G + D + ++ FA+ET + D ML+Y+
Sbjct: 64 LAKRDEEIDEEEEQLIQYGLQTKKSKTSGGADDDAMKGIEKAFAEETGRLDRDKEMLQYI 123
Query: 115 EQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEES----STQWTTGIAE 170
+++L + GK + + + LYK+PE L+ +K+ E S+ GI E
Sbjct: 124 KEKLQESEGKK--PTGKQTSKYEQMMASLYKVPERLQEEKQEEEVERGMLSSAVLQGIPE 181
Query: 171 VQLPIEYKLKNIEETEAAKKLLQEKR 196
V L I+ K+KNIEETE AK + R
Sbjct: 182 VSLGIDEKIKNIEETERAKASIHRSR 207
>gi|358336454|dbj|GAA54958.1| hypothetical protein CLF_106187 [Clonorchis sinensis]
Length = 337
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 85/307 (27%), Positives = 132/307 (42%), Gaps = 68/307 (22%)
Query: 34 EEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAAG-------------GGGLTKV-- 78
+E+ +E I+ LQK R+R GI SAL + AA GGL ++
Sbjct: 34 QEDSTHVVEAIRELQKVRKRPPGISL--SALSTGKAAPEETIIVSDPFKLKTGGLVEIRK 91
Query: 79 ---SEKNEGDGEKDEL--VLQDTFAQETAVMVEDPNM--------LKYVEQELAKKR--- 122
S+K E E+D++ L TFA ET ED M + + ++KK+
Sbjct: 92 AIRSKKTE---EEDDVEARLAKTFATETNKRDEDAEMFVPFHASLIPFTGLSISKKKSLD 148
Query: 123 GKNIDV----NDRVENDLKHA-----------EDELYKIPEHLK--VKKRNSEESSTQWT 165
GK+ DV N + + D+ ++ D L +PE+L+ + ++ + S Q
Sbjct: 149 GKDYDVLHLQNRKSDADVFYSAHSPDAVPNAGADLLRDVPEYLRPVIGQQKEDMLSNQML 208
Query: 166 TGIAEVQLPIEYKLKNIEETEAAKKLLQEKRL---MGRAKSDFSIPSSYSADYFQRGR-- 220
GI EV L ++ K++NIE TE AK+ L + R G A SD P++ + ++ Q R
Sbjct: 209 CGIPEVDLGVDAKMRNIEATEEAKQTLLKHRFNRGYGMA-SDGLAPTNVAVNFVQHSRWN 267
Query: 221 DYAEKLRREHPELYKDRGSQDDGAGSRPTD------NSTDAAGSRQAA---TDQFMLERF 271
+ + +D S A TD DA R A TD +L+RF
Sbjct: 268 SHNATTTFSSGDYTRDLLSIASKANPHKTDIVHQQTTGLDAERERLGAERSTDSLVLQRF 327
Query: 272 RKRERHR 278
+ R R
Sbjct: 328 KSHMRGR 334
>gi|392587037|gb|EIW76372.1| hypothetical protein CONPUDRAFT_130968 [Coniophora puteana
RWD-64-598 SS2]
Length = 188
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 53/181 (29%), Positives = 90/181 (49%), Gaps = 11/181 (6%)
Query: 97 FAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRN 156
F Q+T + D +M++Y+E+ + K R D+++ + D +E + K K+ +
Sbjct: 17 FTQQTNTLDVDKHMMEYIEENIKKMRNAP-DLSEDIPADASESEHKSDKWNIEKKLAEEG 75
Query: 157 SEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYF 216
S +S T I EV L ++ +LKNIEETE AK++ E+R K ++ +A+ F
Sbjct: 76 SVTNSMAMLTAIPEVDLGMDARLKNIEETEKAKRIHAEER--KEKKRVYNDEEHLAANRF 133
Query: 217 QRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRE 275
R LR + E+ +D +Q + G P Q +TD+ ++ERF+KR
Sbjct: 134 FRP-----NLRQKTDAEIMRD--AQREAMGLPPIQERQRNYERPQMSTDEAVMERFKKRM 186
Query: 276 R 276
R
Sbjct: 187 R 187
>gi|301102378|ref|XP_002900276.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102017|gb|EEY60069.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 301
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 94/213 (44%), Gaps = 45/213 (21%)
Query: 92 VLQDTFAQETAVMVEDPN---MLKYVEQELAKKR---GK-NIDVNDRVENDLKHAEDELY 144
+L F ++A +D + M K++E+ L KKR GK N LK AED L+
Sbjct: 108 LLDGQFTGQSATTEKDQHVELMNKFIEERLQKKRKIDGKQNAGDVGDAAAALKTAEDRLF 167
Query: 145 KIPEHLKVKKRNSEESSTQWT--------TGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
++PEHL +S ++ + T TGIAEV+LP Y E TE A K E
Sbjct: 168 ELPEHLNPDVPSSSKNYDEGTEGGMLMGNTGIAEVELPSSY----AERTEKATKRALEAN 223
Query: 197 LMGRAKSDF-------SIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSRPT 249
AK D +P ++S D+ + +Y +++ + + ++RG + G
Sbjct: 224 KPRAAKLDAIGGLASSVVPGNFSTDFNRHKTNYVAEMKSLNKDEQRERGFRQVG------ 277
Query: 250 DNSTDAAGSRQAATDQFMLERFRKRE----RHR 278
+ ATD + +FRK E RHR
Sbjct: 278 ---------KNRATDNHAVSQFRKLESRKLRHR 301
>gi|83285872|ref|XP_729913.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23489078|gb|EAA21478.1| Arabidopsis thaliana At1g02330/T6A9_12-related [Plasmodium yoelii
yoelii]
Length = 209
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/59 (50%), Positives = 39/59 (66%), Gaps = 2/59 (3%)
Query: 143 LYKIPEHLKVK-KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKK-LLQEKRLMG 199
LYK+P+ LKVK N+ + TGI EV LP+E KLKNIEETE K+ LL++ + M
Sbjct: 147 LYKLPDDLKVKTSTNNAQERLNCFTGINEVPLPLEMKLKNIEETEKIKRELLKKAKFMN 205
>gi|395329965|gb|EJF62350.1| hypothetical protein DICSQDRAFT_126606 [Dichomitus squalens
LYAD-421 SS1]
Length = 300
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 94/198 (47%), Gaps = 26/198 (13%)
Query: 91 LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
+V + F Q+T + D +M+ Y+E+ + +RG D D+ + EL+++
Sbjct: 116 VVRTNNFTQQTNALDVDKHMMAYIEENMKLRRGAKDD--DKKDEGPADPYAELFRL---T 170
Query: 151 KVKKRNSEE----SSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFS 206
K ++ EE +S T I EV L ++ +LKNIEETE AK+ + E R K+D
Sbjct: 171 KAAQKKEEEGNVTNSLAMLTAIPEVDLGMDTRLKNIEETEKAKRQISELRKERSKKADDE 230
Query: 207 IPSSYSADYFQRGRDYAEKLR-REHPELYKDRGSQDDGAGSRPTDNSTDAAGSR-QAATD 264
+ R Y L+ + ++ +D ++ + G RP D+ R Q ATD
Sbjct: 231 A-------HLAAARFYRPNLKAKSDADIMRD--AKLEAMGLRPDDHEYRRPSDRAQMATD 281
Query: 265 QFM------LERFRKRER 276
+ + +ERF+KR R
Sbjct: 282 EILTHLHQVMERFKKRMR 299
>gi|298713725|emb|CBJ48916.1| hypothetical protein (Partial) [Ectocarpus siliculosus]
Length = 463
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 42/82 (51%), Gaps = 19/82 (23%)
Query: 134 NDLKHAEDELYKIPEHLKVK------------KRNSE-----ESSTQ--WTTGIAEVQLP 174
N++ ED LY IPE K K R E S Q W TG+AE+ LP
Sbjct: 248 NNMLSEEDRLYTIPEDFKKKVEEAKVEFDTGANRGDEMDGEVGSGAQIAWNTGLAEIALP 307
Query: 175 IEYKLKNIEETEAAKKLLQEKR 196
IE+KLKN+E+T AA+ ++ R
Sbjct: 308 IEFKLKNMEDTLAARDKMESVR 329
>gi|348672270|gb|EGZ12090.1| hypothetical protein PHYSODRAFT_317356 [Phytophthora sojae]
Length = 297
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 95/212 (44%), Gaps = 43/212 (20%)
Query: 92 VLQDTFAQETAVMVEDPN---MLKYVEQELAKKR-----GKNIDVNDRVENDLKHAEDEL 143
+L F ++A +D + M +++E+ L KKR N D L+ AED+L
Sbjct: 104 LLDGQFTGQSATTDKDQHEELMNQFIEERLQKKRKTQQVSANGDGASDAAAALRTAEDKL 163
Query: 144 YKIPEHLKVKKRNSEESSTQWTT---------GIAEVQLPIEYKLKNIEETEAAKKLLQE 194
+++P++LK +S + T GIAEV+LP Y E TE A + E
Sbjct: 164 FELPDNLKPDVPSSSSAGYDDTAEGGMLMGNAGIAEVELPASY----AERTERATRTALE 219
Query: 195 KRLMGRAKSDF-------SIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDGAGSR 247
+ G K D ++P+++SAD+ + DY +++ + + ++RG + G
Sbjct: 220 QSKAGGVKRDAVGGLANSALPTNFSADFNRHKTDYVAEMKSLNKDEQRERGFRTVG---- 275
Query: 248 PTDNSTDAAGSRQAATDQFMLERFRKRERHRV 279
+ A+D + RFRK E ++
Sbjct: 276 -----------KNQASDDRAVSRFRKFESRKL 296
>gi|68066195|ref|XP_675081.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56494056|emb|CAH95798.1| conserved hypothetical protein [Plasmodium berghei]
Length = 209
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 2/59 (3%)
Query: 143 LYKIPEHLKVK-KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKK-LLQEKRLMG 199
LYK+P+ LKVK N+ + TGI EV LP+E KL+NIEETE K+ LL++ + M
Sbjct: 147 LYKLPDDLKVKTSTNNAQERLNCFTGINEVPLPLEMKLQNIEETEKIKRQLLKKAKFMS 205
>gi|115444733|ref|NP_001046146.1| Os02g0189900 [Oryza sativa Japonica Group]
gi|113535677|dbj|BAF08060.1| Os02g0189900 [Oryza sativa Japonica Group]
gi|218190225|gb|EEC72652.1| hypothetical protein OsI_06177 [Oryza sativa Indica Group]
Length = 105
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/30 (80%), Positives = 27/30 (90%), Gaps = 1/30 (3%)
Query: 146 IPEHLKVKKRNSEESSTQWTTGIAEVQLPI 175
+ +HLKV+K NSEESSTQWTTGIAEVQ PI
Sbjct: 3 VADHLKVRK-NSEESSTQWTTGIAEVQPPI 31
>gi|403169032|ref|XP_003328586.2| hypothetical protein PGTG_10545 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167772|gb|EFP84167.2| hypothetical protein PGTG_10545 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 399
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 100/222 (45%), Gaps = 48/222 (21%)
Query: 23 EEETTNKLSDDEEERRLALEEIKFLQKQRERKSGI---------PAIPSALQSAAAAGGG 73
+E + +++ ++E E R +EE+ L++ ++ + GI P ++ AAG G
Sbjct: 50 DEVSKDRVKEEEVEARRTVEEMIALRRLKQARVGIELQRLNAGEPKKKKKKKNPNAAGEG 109
Query: 74 G---------LTKVSEKNEGDGEKDE----------------LVLQDTFAQETAVMVEDP 108
+ ++ + D KD+ ++ + F Q+T + D
Sbjct: 110 ADGSEQGGKPVGANTDPLDDDPVKDDRLADDEPEDEDARTRKIIKSNHFTQQTNTLDVDK 169
Query: 109 NMLKYVEQELAKKRGKNIDVN--DRVENDLKHAE--------DELYKIPEHLKVKKRNSE 158
+M+ Y+E+EL ++R I + E LK E DELYKI E +++K+
Sbjct: 170 HMMAYIEEELQRRRTDAIAAGTIESSEPILKGLEAIASLDPRDELYKIAEKYRIQKKPVV 229
Query: 159 ES----STQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKR 196
E S T I EV L I+ +++N E TE AK+ L E+R
Sbjct: 230 EGNVTLSATMLTSIPEVDLGIDNRIRNFEATEKAKRQLTEQR 271
>gi|156099820|ref|XP_001615706.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804580|gb|EDL45979.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 225
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 142 ELYKIPEHLKVKKR-NSEESSTQWTTGIAEVQLPIEYKLKNIEETE 186
+LYK+ +HLKVK S TGI E+ LPIE KLKNIEETE
Sbjct: 166 DLYKLSDHLKVKSSVASNPEKLNCITGITEIPLPIEVKLKNIEETE 211
>gi|389585171|dbj|GAB67902.1| hypothetical protein PCYB_124680, partial [Plasmodium cynomolgi
strain B]
Length = 223
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 31/46 (67%), Gaps = 1/46 (2%)
Query: 142 ELYKIPEHLKVKKRN-SEESSTQWTTGIAEVQLPIEYKLKNIEETE 186
+LYK+ +HLKVK S + TGI E+ LPIE KLKNIEETE
Sbjct: 164 DLYKLSDHLKVKSSVVSNQEKLNCITGITEIPLPIEVKLKNIEETE 209
>gi|402226280|gb|EJU06340.1| hypothetical protein DACRYDRAFT_97824 [Dacryopinax sp. DJM-731 SS1]
Length = 299
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 81/179 (45%), Gaps = 17/179 (9%)
Query: 34 EEERRLALEEIKFLQKQRE--RKSGIPAI-----------PSALQSAAAAGGGGLTKVSE 80
E+ R L LE+I L+K R+ R GI P A GGL +
Sbjct: 47 EDARSLGLEDIIALRKYRQGMRHEGIDVGKLSKGEKRKRNPEGEDDGAVVEKGGLKRREF 106
Query: 81 KNEGDGEKDELVLQ----DTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDL 136
+ E + + E + + + F Q+T + + +M++Y+E E+ K+RG + +
Sbjct: 107 EGEVESSEAESIAKKLRANNFTQQTNALDVNKHMMEYIEGEIRKRRGDTASTEEGQKTGA 166
Query: 137 KHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 195
+L+K ++ + +S T I EV L +E +L+NIEETE AK+ E+
Sbjct: 167 YDPYAQLFKTDVKPDSREEAAISTSMAMLTAIPEVDLGMETRLRNIEETEKAKRQAAER 225
>gi|221059073|ref|XP_002260182.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193810255|emb|CAQ41449.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 227
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 142 ELYKIPEHLKVKKR-NSEESSTQWTTGIAEVQLPIEYKLKNIEETE 186
+LYK+ +HLKVK + TGI EV LPIE KLKNIEETE
Sbjct: 168 DLYKLSDHLKVKSSVAANPEKLNCITGITEVPLPIEVKLKNIEETE 213
>gi|124810282|ref|XP_001348824.1| conserved protein, unknown function [Plasmodium falciparum 3D7]
gi|23497725|gb|AAN37263.1| conserved protein, unknown function [Plasmodium falciparum 3D7]
Length = 226
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 81/166 (48%), Gaps = 33/166 (19%)
Query: 44 IKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQE-TA 102
+K LQ R +K GI + A TKV EK+ D EK +L F + T
Sbjct: 59 LKVLQHMRMKKKGI----------STANLNYETKVIEKH--DNEKK--LLDKHFTKNITE 104
Query: 103 VMVEDPNMLKYVEQ-------ELAKKRGKNID----------VNDRVENDLKHAEDELYK 145
+E+ ++ ++++ EL K+ + I+ +N + EN+ + LYK
Sbjct: 105 KEIEEAHIESFIKENMKEFYDELNNKKKQQIEQEHEHEQDQEINKKQENNDNDLINNLYK 164
Query: 146 IPEHLKVKKRNSEES-STQWTTGIAEVQLPIEYKLKNIEETEAAKK 190
+ +HLK+K + + S TGI EV +P+E K+KNIEETE K+
Sbjct: 165 LSDHLKIKTTHEDTSEKLNCITGITEVPIPLEIKMKNIEETEKFKR 210
>gi|222622342|gb|EEE56474.1| hypothetical protein OsJ_05693 [Oryza sativa Japonica Group]
Length = 135
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 19/24 (79%), Positives = 22/24 (91%)
Query: 152 VKKRNSEESSTQWTTGIAEVQLPI 175
+ ++NSEESSTQWTTGIAEVQ PI
Sbjct: 38 LVRKNSEESSTQWTTGIAEVQPPI 61
>gi|268562986|ref|XP_002638721.1| Hypothetical protein CBG00304 [Caenorhabditis briggsae]
Length = 243
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 80/186 (43%), Gaps = 36/186 (19%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA---------GGGGLTKVSEKNEGDGEKDEL 91
+ +I+ LQ+ RERK+G+ + A+ AA GGG + +K + E
Sbjct: 32 VADIRDLQRSRERKNGLTELECAVGITKAAALEDGIQMTGGGMIMTAKKKAAMEAASIEH 91
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQEL----------AKKRGKN------------IDVN 129
L+D F +ET + E + KY++ L ++ RG+ ++ +
Sbjct: 92 GLRDQFEKETMLRDEHEELRKYIDDGLTHYTKDTSTPSQPRGETPQAASASSRFSSLNAD 151
Query: 130 DRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 189
DR LK A +L K+ +E S GI EV L I ++ NI ETE K
Sbjct: 152 DRDVELLKEAAGKLKA-----NQSKKETELLSEHMLAGIPEVDLGISTRITNILETEKKK 206
Query: 190 KLLQEK 195
+ L +K
Sbjct: 207 RFLMQK 212
>gi|325182770|emb|CCA17225.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325189176|emb|CCA23700.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 267
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 6/83 (7%)
Query: 110 MLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL----KVKKRNSEESSTQWT 165
M +Y+E L + + D D+ ++ ++ +D LY + L + NS + W
Sbjct: 111 MNRYIEDRLGSIKVDSKD--DKSQDSIEKEDDALYALSTDLAPTPTTNETNSSDGVLIWN 168
Query: 166 TGIAEVQLPIEYKLKNIEETEAA 188
TGIAEV+LP YK K +E T++A
Sbjct: 169 TGIAEVELPSTYKNKIVEATKSA 191
>gi|17509215|ref|NP_492142.1| Protein T23G11.4 [Caenorhabditis elegans]
gi|3880111|emb|CAB03415.1| Protein T23G11.4 [Caenorhabditis elegans]
Length = 240
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 88/187 (47%), Gaps = 27/187 (14%)
Query: 30 LSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAA--------GGGGLTKVSEK 81
++ DEE R++ +I+ LQ+ RERK+G+ + A+ + AA GGG+ S+K
Sbjct: 23 VAADEESSRVS--DIRDLQRSRERKNGLTELECAVGISKAAALEDGIQMAGGGMVMTSKK 80
Query: 82 NEG-DGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK--------KRGKNIDVNDRV 132
+ E L++ F +ET + E + KY++ L + ++ K +
Sbjct: 81 KAAMEAASIEQGLREQFEKETMLRDEHEELRKYIDDGLQEYTADTSKIEKQKQPSSSAAA 140
Query: 133 ENDLKHAED---ELYKIPEHLKVK----KRNSEESSTQWTTGIAEVQLPIEYKLKNIEET 185
+ +AED EL K KVK K+ +E S GI EV L I ++ NI ET
Sbjct: 141 KFSSLNAEDRDVELLKQAAG-KVKGNQSKKETELLSEHMLAGIPEVDLGISTRITNILET 199
Query: 186 EAAKKLL 192
E K+ L
Sbjct: 200 EKKKRFL 206
>gi|330840141|ref|XP_003292079.1| hypothetical protein DICPUDRAFT_156758 [Dictyostelium purpureum]
gi|325077714|gb|EGC31409.1| hypothetical protein DICPUDRAFT_156758 [Dictyostelium purpureum]
Length = 265
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 116/279 (41%), Gaps = 57/279 (20%)
Query: 7 QKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
+KK+K +N RK+ +E +N + EE+ LE K QK RE+ G+ +
Sbjct: 36 KKKDKVRNLRKK----DETNSNLEVEGEEDEENILELTKEKQKLREKGKGL--------N 83
Query: 67 AAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAK------ 120
G K + + + D + QE + +E KY+ ++L
Sbjct: 84 VGILAEGPHIKQNFRELENKLDDSFTAHNEDKQEVNLHLE-----KYINEQLELKKQKQK 138
Query: 121 --KRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYK 178
K N + N+ ++ + E L++ PEHLK ++ E T W GIAEVQLP +K
Sbjct: 139 QSKTDNNDNNNNENNSNTELKESSLFETPEHLKRNEKKQNEDKTNWVAGIAEVQLPEVFK 198
Query: 179 LKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRG 238
KN+ ETE A+ +++ + + P +++ +Y + H ++
Sbjct: 199 YKNMVETEKARDAMEKDS--DKHTEKLNTPQNFNQNY------------QYHNRFINNKK 244
Query: 239 SQDDGAGSRPTDNSTDAAGSRQAATDQFMLERFRKRERH 277
DD ATDQ +E F+KR R+
Sbjct: 245 RSDD------------------KATDQEAVENFKKRFRY 265
>gi|353238535|emb|CCA70478.1| hypothetical protein PIIN_04416 [Piriformospora indica DSM 11827]
Length = 299
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 126/326 (38%), Gaps = 94/326 (28%)
Query: 9 KEKKKNFRKR--SYEEEEETTNKLSDDEEE--------RRLALEEIKFLQKQRERKSGIP 58
K++ KN RK + E++EE +S +E R+ +LE++ L+ R+ + GI
Sbjct: 5 KQRPKNQRKHEENVEKQEEQDGHVSGEETPAKEATPAIRQSSLEDMIALRNMRKARQGID 64
Query: 59 AIPSALQSAAAAGGGGLTK--------------VSEKNEGDGEKD---------ELVLQD 95
A S A GG K + E DGE D V
Sbjct: 65 A------SKLATGGQKKRKNEEEEYESEKPRYGLHTPKEDDGEDDLEGAMAKARRAVRMS 118
Query: 96 TFAQETAVMVEDPNMLKYVEQEL-AKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK- 153
F Q+T + D +M+ Y+E+ L K+ + + E+ + K E K
Sbjct: 119 NFTQQTNALDVDKHMMAYIEENLKLMKQQAGTSIEEDASKAPPTTEEGMLKFGERYKTHG 178
Query: 154 ---KRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAK-------- 202
K S +S + I EV L ++ +L+NIEETE AK++ E++ A+
Sbjct: 179 IELKEGSVGNSLAMLSAIPEVDLGMDARLRNIEETEKAKRIAAEEKRAREAQRMNPDEAR 238
Query: 203 ---SDFSIP---------SSYSADYFQRGRDYAEKLRREH--PELYKDRGSQDDGAGSRP 248
+ F P + A Y G D EK +R+H PEL
Sbjct: 239 LAATRFYNPHLRQESDQEAIKQAKYKALGIDVPEKSQRKHERPEL--------------- 283
Query: 249 TDNSTDAAGSRQAATDQFMLERFRKR 274
A+D+ ++ERFRKR
Sbjct: 284 -------------ASDEAVMERFRKR 296
>gi|341883974|gb|EGT39909.1| hypothetical protein CAEBREN_32234 [Caenorhabditis brenneri]
gi|341886393|gb|EGT42328.1| hypothetical protein CAEBREN_25065 [Caenorhabditis brenneri]
Length = 244
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 94/220 (42%), Gaps = 44/220 (20%)
Query: 7 QKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQS 66
+K +KK RK S EEE+ D++E R++ +I+ LQK RERK+G+ + A+
Sbjct: 6 RKPKKKIQQRKISAEEEQ------FDEDESTRVS--DIRDLQKSRERKNGLTELECAVGI 57
Query: 67 AAAAG--------GGGLTKVSEKNEGDGEKD-ELVLQDTFAQETAVMVEDPNMLKYVEQE 117
AA GGG+ ++K E L++ F +ET + E + KY++
Sbjct: 58 TKAAALEDGIQMSGGGMQMTAKKKAAMEAASIEHGLREQFEKETMLRDEHEELRKYIDDG 117
Query: 118 L----------------------AKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKR 155
L + +++ DR LK A ++ K+
Sbjct: 118 LTHYTKDTSKGSSSRRDPPPEQSTSSKFSSLNAEDRDVELLKQAAGKV-----RANQGKK 172
Query: 156 NSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 195
+E S GI EV L I ++ NI ETE K+ L EK
Sbjct: 173 ETELLSEHMLAGIPEVDLGIGSRITNILETEKKKRFLLEK 212
>gi|167521894|ref|XP_001745285.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776243|gb|EDQ89863.1| predicted protein [Monosiga brevicollis MX1]
Length = 482
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 53/186 (28%), Positives = 84/186 (45%), Gaps = 36/186 (19%)
Query: 14 NFRKRSYEEEEETTNKLSDDEEERRL--ALEEIKFLQKQRERKSGIPA---IPSALQSA- 67
N R+R EE + D EEE +L ALE+ QK R R +GI A +P +
Sbjct: 129 NTRQREKVSLEELVSH--DAEEETKLQEALED----QKMRRRTTGIDATGEVPDKTSNTK 182
Query: 68 -------AAAGGGGLTK---VSEKNEGDGEKDELVLQD---TFAQETAVMVEDPNMLKYV 114
+ GG + K + +++ +D +++ TF+ T +D MLKY+
Sbjct: 183 TDKPEGWSLTAGGLMPKGPIIQDRDRDRDNEDARIMKGIDATFSSGTGQRDQDAEMLKYI 242
Query: 115 EQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKR----------NSEESSTQW 164
E L ++RG+ D N ++ + ELY +PE +V+KR N+ ST
Sbjct: 243 EDRLQEERGEAQDTNS-AASEYERRIKELYAVPEAFQVEKRPEVDEDAAVANAGMLSTSL 301
Query: 165 TTGIAE 170
+GI E
Sbjct: 302 LSGIPE 307
>gi|308476864|ref|XP_003100647.1| hypothetical protein CRE_20426 [Caenorhabditis remanei]
gi|308264665|gb|EFP08618.1| hypothetical protein CRE_20426 [Caenorhabditis remanei]
Length = 246
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 76/187 (40%), Gaps = 37/187 (19%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA---------GGGGLTKVSEKNEGDGEKDEL 91
+ +I+ LQ+ RERK+G+ + A+ AA GGG + +K + E
Sbjct: 32 VSDIRDLQRSRERKNGLTELECAVGITKAAALEDGIQMTGGGMMMTAKKKAAMEAASIEH 91
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKN-----------------------IDV 128
L+D F +ET + E + KY++ L N ++
Sbjct: 92 GLRDQFEKETMLRDEHEELRKYIDDGLTHYTKDNSSNSTQKTEKEPKIQSTSSKFSSLNA 151
Query: 129 NDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAA 188
+DR LK A ++ K+ +E S GI EV L I ++ NI ETE
Sbjct: 152 DDRDVELLKEAATKV-----RANQGKKETELLSEHMLAGIPEVDLGISTRITNILETEKK 206
Query: 189 KKLLQEK 195
K+ L +K
Sbjct: 207 KRFLLQK 213
>gi|308459412|ref|XP_003092026.1| hypothetical protein CRE_23169 [Caenorhabditis remanei]
gi|308254444|gb|EFO98396.1| hypothetical protein CRE_23169 [Caenorhabditis remanei]
Length = 246
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 76/187 (40%), Gaps = 37/187 (19%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAAAA---------GGGGLTKVSEKNEGDGEKDEL 91
+ +I+ LQ+ RERK+G+ + A+ AA GGG + +K + E
Sbjct: 32 VSDIRDLQRSRERKNGLTELECAVGITKAAALEDGIQMTGGGMMMTAKKKAAMEAASIEH 91
Query: 92 VLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKN-----------------------IDV 128
L+D F +ET + E + KY++ L N ++
Sbjct: 92 GLRDQFEKETMLRDEHEELRKYIDDGLTHYTKDNSSNSTQKTEKEPKIQSTSYKFSSLNA 151
Query: 129 NDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAA 188
+DR LK A ++ K+ +E S GI EV L I ++ NI ETE
Sbjct: 152 DDRDVELLKEAAAKV-----RANQGKKETELLSEHMLAGIPEVDLGISTRITNILETEKK 206
Query: 189 KKLLQEK 195
K+ L +K
Sbjct: 207 KRFLLQK 213
>gi|317027296|ref|XP_001400603.2| hypothetical protein ANI_1_2038024 [Aspergillus niger CBS 513.88]
Length = 341
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 63/145 (43%), Gaps = 11/145 (7%)
Query: 50 QRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDEL-VLQDTFAQETAVMVE-D 107
QR RK GI + S AG T VS E D E + L + D F T V+ D
Sbjct: 70 QRARKGGIEF---SATSRPPAGKNSQTAVSTVAEEDQENERLRAMCDRFTAHTGQTVDVD 126
Query: 108 PNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTG 167
+M+ ++E E+AK+ +++ + + E P+ + + +
Sbjct: 127 KHMMDFIESEMAKRYRRDMPTEIAMHDTPSATE------PKAAALSSADLPQRGPASLGK 180
Query: 168 IAEVQLPIEYKLKNIEETEAAKKLL 192
+ E+ L E KL+NI TEAA K L
Sbjct: 181 LHEIDLGHETKLQNIARTEAATKRL 205
>gi|302843629|ref|XP_002953356.1| hypothetical protein VOLCADRAFT_105874 [Volvox carteri f.
nagariensis]
gi|300261453|gb|EFJ45666.1| hypothetical protein VOLCADRAFT_105874 [Volvox carteri f.
nagariensis]
Length = 562
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 55/111 (49%), Gaps = 9/111 (8%)
Query: 93 LQDTFAQETAVMV---EDPNMLKYVEQELAKKRGKN---IDVNDRVENDLK--HAEDELY 144
+ DT+ + ++ ED +M KY+E++LA + GK + ++ ++ + K E ELY
Sbjct: 195 VMDTYVKAKSIATQQDEDAHMQKYIEEQLAVRLGKTAAQVSEDEELDPEAKKRKIEAELY 254
Query: 145 KIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEK 195
+P K E + ++EV L KL +IE TEA K+ L K
Sbjct: 255 AVPADFK-NTLEQEVVLPGLVSTLSEVPLSARDKLASIEATEALKRKLLAK 304
>gi|121715222|ref|XP_001275220.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119403377|gb|EAW13794.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 336
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 49/202 (24%), Positives = 87/202 (43%), Gaps = 38/202 (18%)
Query: 50 QRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVE-DP 108
QR RK GI ++ Q++ G G + + + E+ + D F T V+ D
Sbjct: 68 QRGRKGGIEFSTTSRQTSDRTGSQGAGAMVSAEDLENERIR-AMCDRFTVHTGQTVDVDK 126
Query: 109 NMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGI 168
+M+ Y+E E+AK+ ++ D ++D ++ +S + T +
Sbjct: 127 HMMAYIENEMAKR------YRPKMPTDTTGSDDAA------------SNRTTSDGFATAV 168
Query: 169 A------------EVQLPIEYKLKNIEETEAA-KKLLQEKRLMGRAKSDFSIPSSYSADY 215
A E+ L E K++NI TEAA +KL+ E G +S +S +A
Sbjct: 169 ASKREPASLGKLHEIDLGQETKMQNIARTEAATRKLVGEDMSPGLGES-----ASSTAGA 223
Query: 216 FQRGRDYAEKLRREHPELYKDR 237
+ GR + + RR ++ +DR
Sbjct: 224 GKAGRPWRNRKRRNSEDIERDR 245
>gi|357608205|gb|EHJ65877.1| hypothetical protein KGM_17821 [Danaus plexippus]
Length = 168
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 16/111 (14%)
Query: 41 LEEIKFLQKQRERKSGIPAIPSALQSAA-----------AAGGGGLTKVSEKNEGDGEKD 89
LEE K +QK RER +G+ + A A + GG+ + G ++
Sbjct: 43 LEEAKEIQKLRERPNGVSVVALATGQATISEEITCKDPFSVKSGGMINMQALKSGKVKQV 102
Query: 90 E----LVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDL 136
E + F+ ET ED M+KY+E++LAK++G+ + VN+ + +L
Sbjct: 103 EDAYDTGIGTQFSAETNKRDEDEEMMKYIEEQLAKRKGRCM-VNNSIYYNL 152
>gi|403339735|gb|EJY69129.1| hypothetical protein OXYTRI_10252 [Oxytricha trifallax]
Length = 248
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Query: 142 ELYKIPEHLKVKKRNSEE---SSTQWTTGIAEVQLPIEYKLKNIEETE 186
+LYKIP HL V K + W G+ EV + E K++N+EE E
Sbjct: 132 DLYKIPAHLDVDKLKQDRRLIDQKTWINGLMEVPISQEKKIQNVEEAE 179
>gi|326471260|gb|EGD95269.1| hypothetical protein TESG_02759 [Trichophyton tonsurans CBS 112818]
Length = 326
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 68/155 (43%), Gaps = 9/155 (5%)
Query: 39 LALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFA 98
+ + ++ L+K R RK GI ++ ++ L +E DG + D F
Sbjct: 56 IGVADVLRLRKNRSRKGGI-----EFSTSRSSRSDALVPAAETTT-DGRLSGI--SDRFV 107
Query: 99 QETAVMVE-DPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNS 157
+ V+ D +M+ ++E E+AK+R + + + +++ + EH V +
Sbjct: 108 GHSGQKVDVDKHMMAFIEAEMAKRRHGTLPSENSNPTNPADGQNQSRQAAEHTPVAELQL 167
Query: 158 EESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 192
+ + E+ L + KL+NI TEAA + L
Sbjct: 168 PQRQPATLGKLHEIDLGPDSKLQNIARTEAATRNL 202
>gi|315041258|ref|XP_003170006.1| hypothetical protein MGYG_08185 [Arthroderma gypseum CBS 118893]
gi|311345968|gb|EFR05171.1| hypothetical protein MGYG_08185 [Arthroderma gypseum CBS 118893]
Length = 326
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 52/118 (44%), Gaps = 9/118 (7%)
Query: 92 VLQDTFAQETAVMVE-DPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHL 150
+ D F + V+ D +M+ ++E E+AK+R + + +D ++ +L EH
Sbjct: 101 AISDRFVGHSGQKVDVDKHMMAFIEAEMAKRRHGTLPSENNDPSDPAESQSQLRHGAEHT 160
Query: 151 KVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIP 208
+ + + E+ L + KL+NI TEAA + L AK D + P
Sbjct: 161 PAADLHLPQRQPATLGKLHEIDLGPDSKLQNIARTEAATRNL--------AKGDSAAP 210
>gi|261194974|ref|XP_002623891.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
gi|239587763|gb|EEQ70406.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
Length = 348
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 49/101 (48%), Gaps = 6/101 (5%)
Query: 93 LQDTFAQETAVMVE-DPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLK 151
+ D F T V+ D +M+ Y+E E+ K+ ++ + N ND + +E + + L
Sbjct: 119 ISDRFVAHTGQRVDVDKHMMAYIESEMTKRHQQHKNNNATDNNDQQVSEATVQSLNSDLV 178
Query: 152 VKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 192
+ +R + E+ L + KL+NIE TEAA + L
Sbjct: 179 LPQRQPASLGK-----LHEIDLGPDAKLRNIERTEAATRRL 214
>gi|326479356|gb|EGE03366.1| hypothetical protein TEQG_02400 [Trichophyton equinum CBS 127.97]
Length = 326
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 68/155 (43%), Gaps = 9/155 (5%)
Query: 39 LALEEIKFLQKQRERKSGIPAIPSALQSAAAAGGGGLTKVSEKNEGDGEKDELVLQDTFA 98
+ + ++ L+K R RK GI ++ ++ L +E DG + D F
Sbjct: 56 IGVADVLRLRKNRSRKGGI-----EFSTSRSSRSDALVPAAETTT-DGRLSGI--SDRFV 107
Query: 99 QETAVMVE-DPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNS 157
+ V+ D +M+ ++E E+AK+R + + + +++ + EH V +
Sbjct: 108 GHSGQKVDVDKHMMAFIEAEMAKRRHGTLPSENSNPTNPADGQNQSRQAAEHTPVAELQL 167
Query: 158 EESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLL 192
+ + E+ L + KL+NI TEAA + L
Sbjct: 168 PQRQPATLGKLHEIDLGPDSKLQNIARTEAATRNL 202
>gi|119480413|ref|XP_001260235.1| hypothetical protein NFIA_082890 [Neosartorya fischeri NRRL 181]
gi|119408389|gb|EAW18338.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 344
Score = 37.0 bits (84), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 87/204 (42%), Gaps = 40/204 (19%)
Query: 50 QRERKSGIPAIPSALQSAAAAGG-GGLTKVSEKNEGDGEKDEL-VLQDTFAQETAVMVE- 106
QR RK GI ++ Q G +T V+ + D E +++ + D F T V+
Sbjct: 74 QRARKGGIEFSNTSRQRTDKTGNQAAVTTVTAE---DLENEKIRAMCDRFTAYTGQTVDV 130
Query: 107 DPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTT 166
D +M+ Y+E E+AK+ + + N N S+ SS +T
Sbjct: 131 DKHMMAYIETEMAKRHRQQMPANTTDSN------------------ISTTSQASSDGLST 172
Query: 167 GIA------------EVQLPIEYKLKNIEETEAA-KKLLQEKRLMGRAKSDFSIPSSYSA 213
+A E+ L E KL+NI TEAA ++L+ + R + A D +S A
Sbjct: 173 TVALQREPASLGKLHEIDLGQEAKLQNIARTEAATRRLVGDDRDVSPANED---STSSIA 229
Query: 214 DYFQRGRDYAEKLRREHPELYKDR 237
+ GR + + RR ++ +DR
Sbjct: 230 ASGKDGRPWRNRKRRNSEDIERDR 253
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.308 0.127 0.340
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,414,007,254
Number of Sequences: 23463169
Number of extensions: 195162964
Number of successful extensions: 1130508
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 182
Number of HSP's successfully gapped in prelim test: 4462
Number of HSP's that attempted gapping in prelim test: 1088554
Number of HSP's gapped (non-prelim): 27583
length of query: 282
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 141
effective length of database: 9,050,888,538
effective search space: 1276175283858
effective search space used: 1276175283858
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 76 (33.9 bits)