BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019962
(333 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225441674|ref|XP_002282707.1| PREDICTED: uncharacterized protein C4orf29 homolog [Vitis vinifera]
gi|147852945|emb|CAN81264.1| hypothetical protein VITISV_030682 [Vitis vinifera]
Length = 359
Score = 540 bits (1391), Expect = e-151, Method: Compositional matrix adjust.
Identities = 273/310 (88%), Positives = 290/310 (93%), Gaps = 1/310 (0%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MVTVNLGMLH++LDHVYGAFMHRTKISPPFFSRGWGG+KL+LLER+IKQLFPE +NWP
Sbjct: 1 MVTVNLGMLHHILDHVYGAFMHRTKISPPFFSRGWGGAKLDLLERMIKQLFPE-AAENWP 59
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
PSLIQPIW+T+WET+TA LREGVF+TPCDE+L+SALPPESH ARVAFL PK VPPQKMAC
Sbjct: 60 PSLIQPIWKTVWETKTACLREGVFKTPCDERLLSALPPESHTARVAFLTPKFVPPQKMAC 119
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRP+LQRGAKLLCVSDLLLLG
Sbjct: 120 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPVLQRGAKLLCVSDLLLLG 179
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
RATIEEAR LLHWL+ EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA
Sbjct: 180 RATIEEARSLLHWLDSEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 239
Query: 241 FCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVA 300
FCEGILKH TAWEALRE+LA +K AMTLE+VRERMRNVLSLTDVTRFPIPK PNAVIFVA
Sbjct: 240 FCEGILKHATAWEALREDLAVQKAAMTLEDVRERMRNVLSLTDVTRFPIPKNPNAVIFVA 299
Query: 301 ATVSTVFDYH 310
AT H
Sbjct: 300 ATDDGYIPKH 309
>gi|224120236|ref|XP_002330998.1| predicted protein [Populus trichocarpa]
gi|222872928|gb|EEF10059.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 538 bits (1385), Expect = e-150, Method: Compositional matrix adjust.
Identities = 267/310 (86%), Positives = 288/310 (92%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MVTVN+GMLHYV+DH+YGAFMHRTKISPPFFSRGWGGSKLELLER+I+ LFPE+EGQNWP
Sbjct: 1 MVTVNIGMLHYVIDHIYGAFMHRTKISPPFFSRGWGGSKLELLERMIEDLFPEVEGQNWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
P+LIQPIWRT+WET+TA LREGVFRT CDEQL+SALPPESH ARVAFLAPKCVPPQK AC
Sbjct: 61 PTLIQPIWRTVWETRTACLREGVFRTTCDEQLISALPPESHTARVAFLAPKCVPPQKTAC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDHTF+RRLRLGGPLLK+NIATMVLESPFYG+RRP+LQ GAKLLCVSDLLLLG
Sbjct: 121 VVHLAGTGDHTFDRRLRLGGPLLKQNIATMVLESPFYGRRRPMLQCGAKLLCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
RATIEE R LLHWL+ E GFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA
Sbjct: 181 RATIEETRSLLHWLDSEGGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
Query: 241 FCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVA 300
FC+GIL++GTAWEALRE+LA +K MTLEE R+RMRNVLSLTDVTRFPIPK PNAVIFVA
Sbjct: 241 FCDGILRYGTAWEALREDLAVQKTVMTLEEARQRMRNVLSLTDVTRFPIPKNPNAVIFVA 300
Query: 301 ATVSTVFDYH 310
AT H
Sbjct: 301 ATDDGYIPKH 310
>gi|356572068|ref|XP_003554192.1| PREDICTED: uncharacterized protein C4orf29 homolog [Glycine max]
Length = 353
Score = 533 bits (1372), Expect = e-149, Method: Compositional matrix adjust.
Identities = 266/303 (87%), Positives = 284/303 (93%)
Query: 8 MLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPI 67
MLHYVLDHVYGAFMHRTK+S PFFSRGWGG+KLE+LER+IKQLFPE+EG NWPPS+I+P+
Sbjct: 1 MLHYVLDHVYGAFMHRTKMSTPFFSRGWGGTKLEMLERVIKQLFPEVEGHNWPPSMIEPV 60
Query: 68 WRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGT 127
WRT+WET+ A LREGVFRTPC+EQL+ ALPPESH ARVAFL PK VPP KMACVVHLAGT
Sbjct: 61 WRTVWETKMASLREGVFRTPCEEQLLGALPPESHTARVAFLMPKSVPPHKMACVVHLAGT 120
Query: 128 GDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEA 187
GDHTFERRLRLGGPL+KENIATMVLESPFYGQRRP+LQRGAKLLCVSDLLLLGRATIEEA
Sbjct: 121 GDHTFERRLRLGGPLMKENIATMVLESPFYGQRRPVLQRGAKLLCVSDLLLLGRATIEEA 180
Query: 188 RCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILK 247
R LLHWL+ EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILK
Sbjct: 181 RSLLHWLDSEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILK 240
Query: 248 HGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTVF 307
HGTAWEALR++LAA+KVAMTLEEVRERMRNVLSLTDVTRFPIPK PNAVIFVAAT
Sbjct: 241 HGTAWEALRKDLAAQKVAMTLEEVRERMRNVLSLTDVTRFPIPKNPNAVIFVAATDDGYI 300
Query: 308 DYH 310
H
Sbjct: 301 PKH 303
>gi|449437601|ref|XP_004136580.1| PREDICTED: uncharacterized protein C4orf29 homolog [Cucumis
sativus]
gi|449515482|ref|XP_004164778.1| PREDICTED: uncharacterized protein C4orf29 homolog [Cucumis
sativus]
Length = 360
Score = 530 bits (1365), Expect = e-148, Method: Compositional matrix adjust.
Identities = 264/310 (85%), Positives = 284/310 (91%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MVTVNLGMLHYVLDHVYGAFMHRTK+SPPFFSRGWGGSKL+LLE++IKQLFP++ Q WP
Sbjct: 1 MVTVNLGMLHYVLDHVYGAFMHRTKLSPPFFSRGWGGSKLDLLEKMIKQLFPDVAAQAWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
PSLI+PIWRT+WE +TA LREG FRTPCDEQL++ALPPESHNARVAFL PK VP KM+C
Sbjct: 61 PSLIKPIWRTVWENETARLREGFFRTPCDEQLLAALPPESHNARVAFLMPKSVPTHKMSC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDH+FERRLRLGGPLLK+NIATMVLESPFYGQRRP+LQ GAKLLCVSDLLLLG
Sbjct: 121 VVHLAGTGDHSFERRLRLGGPLLKDNIATMVLESPFYGQRRPILQHGAKLLCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
RATIEEAR LLHWL+ EAGFGKMGVCGLSMGGVHAAMVGSLHPTP+ATLPFLSPHSAVVA
Sbjct: 181 RATIEEARSLLHWLDSEAGFGKMGVCGLSMGGVHAAMVGSLHPTPIATLPFLSPHSAVVA 240
Query: 241 FCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVA 300
FCEGILKHGTAWEALR +L ++ AMTLEEVRERMRNVLSLTDVTRFPIPK PNAVI VA
Sbjct: 241 FCEGILKHGTAWEALRNDLGLQQSAMTLEEVRERMRNVLSLTDVTRFPIPKNPNAVILVA 300
Query: 301 ATVSTVFDYH 310
AT H
Sbjct: 301 ATDDGYIPKH 310
>gi|357509847|ref|XP_003625212.1| hypothetical protein MTR_7g092680 [Medicago truncatula]
gi|124360671|gb|ABN08660.1| hypothetical protein MtrDRAFT_AC157891g33v2 [Medicago truncatula]
gi|355500227|gb|AES81430.1| hypothetical protein MTR_7g092680 [Medicago truncatula]
Length = 360
Score = 523 bits (1348), Expect = e-146, Method: Compositional matrix adjust.
Identities = 262/310 (84%), Positives = 285/310 (91%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MVTVNLGMLHYVLDHVYGAFMHRTK+S PFFSRGWGG+KL++LE +I QLFP++ Q+ P
Sbjct: 1 MVTVNLGMLHYVLDHVYGAFMHRTKMSTPFFSRGWGGTKLDMLENMINQLFPDLGRQSLP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
P+ IQP+W+T+WET+TA LREGVFRTPC++QL+ ALPPESH ARVAFL PK VPPQ MAC
Sbjct: 61 PTEIQPVWKTVWETRTACLREGVFRTPCEDQLLGALPPESHIARVAFLMPKSVPPQNMAC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDHTFERRLRLGGPL+KENIATMVLESPFYGQRRP+LQRGAKLLCVSDLLLLG
Sbjct: 121 VVHLAGTGDHTFERRLRLGGPLVKENIATMVLESPFYGQRRPVLQRGAKLLCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
RATIEEAR LLHWL++EAGFGKMGVCGLSMGGVHAAMVGSLHPTP+AT PFLSPHSAVVA
Sbjct: 181 RATIEEARSLLHWLDFEAGFGKMGVCGLSMGGVHAAMVGSLHPTPIATFPFLSPHSAVVA 240
Query: 241 FCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVA 300
FCEGILKHGTAWEALR +LAA+KVAMTLEEVRERMRNVLSLTDVTRFPIPK PNAVI VA
Sbjct: 241 FCEGILKHGTAWEALRNDLAAEKVAMTLEEVRERMRNVLSLTDVTRFPIPKNPNAVILVA 300
Query: 301 ATVSTVFDYH 310
AT H
Sbjct: 301 ATDDGYIPKH 310
>gi|297834016|ref|XP_002884890.1| hypothetical protein ARALYDRAFT_478575 [Arabidopsis lyrata subsp.
lyrata]
gi|297330730|gb|EFH61149.1| hypothetical protein ARALYDRAFT_478575 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 252/311 (81%), Positives = 285/311 (91%), Gaps = 1/311 (0%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFP-EIEGQNW 59
MV++ LGMLHYV+DHVYGAFMHRTKI+PPFFSRGWGG LELLER++++LFP E +GQNW
Sbjct: 1 MVSIKLGMLHYVIDHVYGAFMHRTKITPPFFSRGWGGPNLELLERMVQRLFPLEAQGQNW 60
Query: 60 PPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMA 119
PP L++P+WRT+WET+TA LREGVF+TPC ++L +ALPPES ARVA+L PK VPPQKMA
Sbjct: 61 PPPLVRPVWRTVWETKTATLREGVFQTPCADELTAALPPESRTARVAWLVPKNVPPQKMA 120
Query: 120 CVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLL 179
CVVHLAGTGDHT++RRLRLGGPL+K+NIATMVLESPFYGQRRP LQRGA+LLCVSDLLLL
Sbjct: 121 CVVHLAGTGDHTYDRRLRLGGPLVKQNIATMVLESPFYGQRRPFLQRGARLLCVSDLLLL 180
Query: 180 GRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVV 239
GRATIEE+R L+HWL+ E GFGKMGVCGLSMGGVHA+MVGSLHPTPVATLPFLSPHSAVV
Sbjct: 181 GRATIEESRSLIHWLDTEEGFGKMGVCGLSMGGVHASMVGSLHPTPVATLPFLSPHSAVV 240
Query: 240 AFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFV 299
AFCEGILK+GTAWEALREELAA+K+ MTL+EVRERMRNVLSLTDVTRFPIPK P+AVIFV
Sbjct: 241 AFCEGILKYGTAWEALREELAAQKITMTLDEVRERMRNVLSLTDVTRFPIPKNPDAVIFV 300
Query: 300 AATVSTVFDYH 310
AAT H
Sbjct: 301 AATDDGYIPKH 311
>gi|12322047|gb|AAG51070.1|AC069472_10 unknown protein; 3293-1369 [Arabidopsis thaliana]
Length = 360
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 250/311 (80%), Positives = 284/311 (91%), Gaps = 1/311 (0%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFP-EIEGQNW 59
MVT LGMLHYV+DH+YGAFMHRTK++PPFFSRGWGG LELLER++++LFP E++GQNW
Sbjct: 1 MVTTKLGMLHYVIDHIYGAFMHRTKMTPPFFSRGWGGPNLELLERMVQRLFPLEVQGQNW 60
Query: 60 PPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMA 119
PP L++P+WRT+WET+TA LREGVF+TPC ++L +ALPPES ARVA+L PK VPPQKMA
Sbjct: 61 PPPLVRPVWRTVWETKTATLREGVFQTPCADELTAALPPESRTARVAWLVPKNVPPQKMA 120
Query: 120 CVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLL 179
CVVHLAGTGDHT++RRLRLGGPL+K+NIATMVLESPFYGQRRP LQ GA+LLCVSDLLLL
Sbjct: 121 CVVHLAGTGDHTYDRRLRLGGPLVKQNIATMVLESPFYGQRRPFLQCGARLLCVSDLLLL 180
Query: 180 GRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVV 239
GRATIEE+R L+HWL+ E GFGKMGVCGLSMGGVHA+MVGSLHPTPVATLPFLSPHSAVV
Sbjct: 181 GRATIEESRSLIHWLDTEEGFGKMGVCGLSMGGVHASMVGSLHPTPVATLPFLSPHSAVV 240
Query: 240 AFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFV 299
AFCEGILK+GTAWEALREELAA+K+ MTL+EVRERMRNVLSLTDVTRFPIPK P+AVIFV
Sbjct: 241 AFCEGILKYGTAWEALREELAAQKITMTLDEVRERMRNVLSLTDVTRFPIPKNPDAVIFV 300
Query: 300 AATVSTVFDYH 310
AAT H
Sbjct: 301 AATDDGYIPKH 311
>gi|30682072|ref|NP_187822.2| uncharacterized protein [Arabidopsis thaliana]
gi|20260670|gb|AAM13233.1| unknown protein [Arabidopsis thaliana]
gi|31711868|gb|AAP68290.1| At3g12156 [Arabidopsis thaliana]
gi|332641638|gb|AEE75159.1| uncharacterized protein [Arabidopsis thaliana]
Length = 363
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 250/311 (80%), Positives = 284/311 (91%), Gaps = 1/311 (0%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFP-EIEGQNW 59
MVT LGMLHYV+DH+YGAFMHRTK++PPFFSRGWGG LELLER++++LFP E++GQNW
Sbjct: 4 MVTTKLGMLHYVIDHIYGAFMHRTKMTPPFFSRGWGGPNLELLERMVQRLFPLEVQGQNW 63
Query: 60 PPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMA 119
PP L++P+WRT+WET+TA LREGVF+TPC ++L +ALPPES ARVA+L PK VPPQKMA
Sbjct: 64 PPPLVRPVWRTVWETKTATLREGVFQTPCADELTAALPPESRTARVAWLVPKNVPPQKMA 123
Query: 120 CVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLL 179
CVVHLAGTGDHT++RRLRLGGPL+K+NIATMVLESPFYGQRRP LQ GA+LLCVSDLLLL
Sbjct: 124 CVVHLAGTGDHTYDRRLRLGGPLVKQNIATMVLESPFYGQRRPFLQCGARLLCVSDLLLL 183
Query: 180 GRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVV 239
GRATIEE+R L+HWL+ E GFGKMGVCGLSMGGVHA+MVGSLHPTPVATLPFLSPHSAVV
Sbjct: 184 GRATIEESRSLIHWLDTEEGFGKMGVCGLSMGGVHASMVGSLHPTPVATLPFLSPHSAVV 243
Query: 240 AFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFV 299
AFCEGILK+GTAWEALREELAA+K+ MTL+EVRERMRNVLSLTDVTRFPIPK P+AVIFV
Sbjct: 244 AFCEGILKYGTAWEALREELAAQKITMTLDEVRERMRNVLSLTDVTRFPIPKNPDAVIFV 303
Query: 300 AATVSTVFDYH 310
AAT H
Sbjct: 304 AATDDGYIPKH 314
>gi|224139822|ref|XP_002323293.1| predicted protein [Populus trichocarpa]
gi|222867923|gb|EEF05054.1| predicted protein [Populus trichocarpa]
Length = 304
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 252/290 (86%), Positives = 269/290 (92%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLR 80
MHRTKISPPFFSRGWGGSKLELLER+IK LFPE+EGQNWPPSLIQPIWRT+WET++A LR
Sbjct: 1 MHRTKISPPFFSRGWGGSKLELLERMIKDLFPEVEGQNWPPSLIQPIWRTVWETRSACLR 60
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLRLGG 140
EGVFRT CDEQL+SALPPESH ARVAFLAPK VPPQKMACVVHLAGTGDH+F+RRL LGG
Sbjct: 61 EGVFRTTCDEQLISALPPESHTARVAFLAPKHVPPQKMACVVHLAGTGDHSFDRRLHLGG 120
Query: 141 PLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGF 200
PLLKENIATMVLESPFYG+RRP+LQ GAKLLCVSDLLLLGR TI+E R LLHWL+ EAGF
Sbjct: 121 PLLKENIATMVLESPFYGRRRPMLQHGAKLLCVSDLLLLGRTTIDETRSLLHWLDSEAGF 180
Query: 201 GKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELA 260
GKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGIL++GTAWEALRE+LA
Sbjct: 181 GKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILRYGTAWEALREDLA 240
Query: 261 AKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTVFDYH 310
+K AMTLE+VRERMRNVLSLTDVTRFPIPK PNAVIFVAAT H
Sbjct: 241 VQKPAMTLEDVRERMRNVLSLTDVTRFPIPKNPNAVIFVAATDDGYIPKH 290
>gi|219886851|gb|ACL53800.1| unknown [Zea mays]
Length = 366
Score = 496 bits (1276), Expect = e-138, Method: Compositional matrix adjust.
Identities = 230/323 (71%), Positives = 273/323 (84%), Gaps = 13/323 (4%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MV+VNLG++HYVLDH+YG +HRTK+ PFFS+GWGG+KL+LLE+++KQLFPE QNWP
Sbjct: 1 MVSVNLGLVHYVLDHIYGTLLHRTKLGTPFFSKGWGGTKLDLLEKMVKQLFPEARCQNWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
P+ +QP+W+T+WET + LREGVFRT CDE+L+ ALPPESHNARVAFL PK V P+KM+C
Sbjct: 61 PTAVQPMWKTVWETNNSCLREGVFRTTCDERLIDALPPESHNARVAFLTPKNVTPEKMSC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDHTFERRLRLGGPLLK NIATMVLESP+YGQRRP +QRGAKL CVSDLLLLG
Sbjct: 121 VVHLAGTGDHTFERRLRLGGPLLKNNIATMVLESPYYGQRRPSMQRGAKLQCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
+ATI+EAR LL+WL+ EAG+GKMG+CGLSMGGVHAAMVGSLHPTPVATLPFL+PHSAVV
Sbjct: 181 KATIDEARSLLYWLQNEAGYGKMGICGLSMGGVHAAMVGSLHPTPVATLPFLAPHSAVVP 240
Query: 241 FCEGILKHGTAWEALRE-------------ELAAKKVAMTLEEVRERMRNVLSLTDVTRF 287
FCEG+ K+ TAW+ALR+ E AA+K +T+E+VR+R+R+VLSLTDVTRF
Sbjct: 241 FCEGVYKYATAWDALRKDAAVLTQDVTLLAEDAAQKSGITIEQVRDRLRSVLSLTDVTRF 300
Query: 288 PIPKIPNAVIFVAATVSTVFDYH 310
P+PK P AVIFV AT H
Sbjct: 301 PVPKNPQAVIFVGATDDGYIPRH 323
>gi|226505978|ref|NP_001143458.1| uncharacterized protein LOC100276119 [Zea mays]
gi|195620838|gb|ACG32249.1| hypothetical protein [Zea mays]
Length = 366
Score = 493 bits (1269), Expect = e-137, Method: Compositional matrix adjust.
Identities = 229/323 (70%), Positives = 272/323 (84%), Gaps = 13/323 (4%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MV+VNLG++HYVLDH+YG +HRTK+ PFFS+GWGG+KL+LLE+++KQLFPE QNWP
Sbjct: 1 MVSVNLGLVHYVLDHIYGTLLHRTKLGTPFFSKGWGGTKLDLLEKMVKQLFPEARCQNWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
P+ +QP+W+T+WET + LREGVFRT CDE+L+ ALPPESHNARVAFL PK V P+KM+C
Sbjct: 61 PTAVQPMWKTVWETNNSCLREGVFRTTCDERLIDALPPESHNARVAFLTPKNVTPEKMSC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDHTFERRLRLGGPLLK NIATMVLESP+YGQRRP +QRGAKL CVSDLLLLG
Sbjct: 121 VVHLAGTGDHTFERRLRLGGPLLKNNIATMVLESPYYGQRRPSMQRGAKLQCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
+ATI+EAR LL+WL+ EAG+GKMG+CGLSMGGVHAAMV SLHPTPVATLPFL+PHSAVV
Sbjct: 181 KATIDEARSLLYWLQNEAGYGKMGICGLSMGGVHAAMVESLHPTPVATLPFLAPHSAVVP 240
Query: 241 FCEGILKHGTAWEALRE-------------ELAAKKVAMTLEEVRERMRNVLSLTDVTRF 287
FCEG+ K+ TAW+ALR+ E AA+K +T+E+VR+R+R+VLSLTDVTRF
Sbjct: 241 FCEGVYKYATAWDALRKDAAVLTQDVTLLAEDAAQKSGITIEQVRDRLRSVLSLTDVTRF 300
Query: 288 PIPKIPNAVIFVAATVSTVFDYH 310
P+PK P AVIFV AT H
Sbjct: 301 PVPKNPQAVIFVGATDDGYIPRH 323
>gi|12322018|gb|AAG51056.1|AC069473_18 unknown protein; 3519-5443 [Arabidopsis thaliana]
Length = 375
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 248/326 (76%), Positives = 282/326 (86%), Gaps = 16/326 (4%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFP-EIEGQNW 59
MVT LGMLHYV+DH+YGAFMHRTK++PPFFSRGWGG LELLER++++LFP E++GQNW
Sbjct: 1 MVTTKLGMLHYVIDHIYGAFMHRTKMTPPFFSRGWGGPNLELLERMVQRLFPLEVQGQNW 60
Query: 60 PPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMA 119
PP L++P+WRT+WET+TA LREGVF+TPC ++L +ALPPES ARVA+L PK VPPQKMA
Sbjct: 61 PPPLVRPVWRTVWETKTATLREGVFQTPCADELTAALPPESRTARVAWLVPKNVPPQKMA 120
Query: 120 CVVHLAGT---------------GDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLL 164
CVVHLA GDHT++RRLRLGGPL+K+NIATMVLESPFYGQRRP L
Sbjct: 121 CVVHLAVVRRACLCDLNLFIALPGDHTYDRRLRLGGPLVKQNIATMVLESPFYGQRRPFL 180
Query: 165 QRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPT 224
Q GA+LLCVSDLLLLGRATIEE+R L+HWL+ E GFGKMGVCGLSMGGVHA+MVGSLHPT
Sbjct: 181 QCGARLLCVSDLLLLGRATIEESRSLIHWLDTEEGFGKMGVCGLSMGGVHASMVGSLHPT 240
Query: 225 PVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDV 284
PVATLPFLSPHSAVVAFCEGILK+GTAWEALREELAA+K+ MTL+EVRERMRNVLSLTDV
Sbjct: 241 PVATLPFLSPHSAVVAFCEGILKYGTAWEALREELAAQKITMTLDEVRERMRNVLSLTDV 300
Query: 285 TRFPIPKIPNAVIFVAATVSTVFDYH 310
TRFPIPK P+AVIFVAAT H
Sbjct: 301 TRFPIPKNPDAVIFVAATDDGYIPKH 326
>gi|242064370|ref|XP_002453474.1| hypothetical protein SORBIDRAFT_04g006510 [Sorghum bicolor]
gi|241933305|gb|EES06450.1| hypothetical protein SORBIDRAFT_04g006510 [Sorghum bicolor]
Length = 366
Score = 490 bits (1262), Expect = e-136, Method: Compositional matrix adjust.
Identities = 226/315 (71%), Positives = 270/315 (85%), Gaps = 13/315 (4%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MV+VN+G++HYVLDH+YG +HRTK+ PFFS+GWGG+KL+LLE+++KQLFPE QNWP
Sbjct: 1 MVSVNIGLVHYVLDHIYGTLLHRTKLGTPFFSKGWGGTKLDLLEKMVKQLFPEARCQNWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
P+ +QP+W+T+WET ++ LREGVFRT CDE+L+ ALP ESHNARVAFL PK V P+KM+C
Sbjct: 61 PTAVQPMWKTVWETNSSCLREGVFRTTCDERLIGALPLESHNARVAFLTPKNVTPEKMSC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDHTFERRLRLGGPLLK NIATMVLESP+YGQRRP +QRGAKL CVSDLLLLG
Sbjct: 121 VVHLAGTGDHTFERRLRLGGPLLKNNIATMVLESPYYGQRRPSMQRGAKLQCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
+ATI+EAR LL+WL+ EAG+ KMGVCGLSMGGVHAAMVGSLHPTP+ATLPFL+PHSAVV
Sbjct: 181 KATIDEARSLLYWLQTEAGYSKMGVCGLSMGGVHAAMVGSLHPTPIATLPFLAPHSAVVP 240
Query: 241 FCEGILKHGTAWEALRE-------------ELAAKKVAMTLEEVRERMRNVLSLTDVTRF 287
FCEG+ K+ TAW+ LRE E AA+K +T+E+VR+R+R+VLSLTDVTRF
Sbjct: 241 FCEGVYKYATAWDVLREDAAALTQDVTSLAEDAAQKTGITIEQVRDRLRSVLSLTDVTRF 300
Query: 288 PIPKIPNAVIFVAAT 302
P+PK P AVIFV AT
Sbjct: 301 PVPKNPQAVIFVGAT 315
>gi|115444807|ref|NP_001046183.1| Os02g0195000 [Oryza sativa Japonica Group]
gi|49388113|dbj|BAD25244.1| unknown protein [Oryza sativa Japonica Group]
gi|113535714|dbj|BAF08097.1| Os02g0195000 [Oryza sativa Japonica Group]
gi|215734815|dbj|BAG95537.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 366
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 224/323 (69%), Positives = 274/323 (84%), Gaps = 13/323 (4%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MV+VNLG++HYVLDH+YG +HRTK+ PFFS+GWGG+KL+LLE+++KQLFPE QNWP
Sbjct: 1 MVSVNLGLVHYVLDHIYGTVLHRTKLGTPFFSKGWGGTKLDLLEKMVKQLFPEARCQNWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
P+ +QP+W+T+WET+++ LREGVFRT CD +L+ ALPPESHNARVAFL PK V P+KMAC
Sbjct: 61 PTAVQPMWKTVWETKSSCLREGVFRTTCDPRLIEALPPESHNARVAFLTPKSVSPEKMAC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
V+HLAGTGDH+FERRLRLGGPLLK+NIATMVLESP+YGQRRP +Q G+KL CVSDLLLLG
Sbjct: 121 VIHLAGTGDHSFERRLRLGGPLLKDNIATMVLESPYYGQRRPSMQHGSKLQCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
+ATI+EAR LL+WL+ EAG+GKMG+CGLSMGGVHAAMVGSLHPTP+ATLPFL+PHSAVV
Sbjct: 181 KATIDEARSLLYWLQNEAGYGKMGICGLSMGGVHAAMVGSLHPTPIATLPFLAPHSAVVP 240
Query: 241 FCEGILKHGTAWEALREELA-------------AKKVAMTLEEVRERMRNVLSLTDVTRF 287
FC+G+ +H TAW+ALR++ A A+K +T+E+VRER+R+VLSLTDVTRF
Sbjct: 241 FCDGLYRHATAWDALRKDAATLAQDVTSLTEDMAQKSGITIEQVRERLRSVLSLTDVTRF 300
Query: 288 PIPKIPNAVIFVAATVSTVFDYH 310
P+PK P AVIFV AT H
Sbjct: 301 PVPKNPQAVIFVGATDDGYIPKH 323
>gi|357139350|ref|XP_003571245.1| PREDICTED: uncharacterized protein C4orf29 homolog [Brachypodium
distachyon]
Length = 366
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 226/323 (69%), Positives = 271/323 (83%), Gaps = 13/323 (4%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MV+VNLG++HYVLDH+YG +HRTK+ PFFS+GWGG++L LLER++KQLFPE QNWP
Sbjct: 1 MVSVNLGLVHYVLDHIYGTVLHRTKLGTPFFSKGWGGTRLVLLERMVKQLFPEAPSQNWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
P+ +QPIW+T+WET+ + LREGVFRT CDE+L+ ALPPESHNARVAFL PK V P+KMAC
Sbjct: 61 PTAVQPIWKTVWETKNSSLREGVFRTTCDERLIDALPPESHNARVAFLTPKSVSPEKMAC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDH+FERRLRLG PLLK+NIATMVLESP+YGQRRP +Q G+KL CVSDLLLLG
Sbjct: 121 VVHLAGTGDHSFERRLRLGAPLLKDNIATMVLESPYYGQRRPSMQHGSKLQCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
+ATI+EAR LL+WL+ EAG+GKMG+CGLSMGGVHAAMVGSLHPTP+ATLPFL+PHSAVV
Sbjct: 181 KATIDEARSLLYWLQSEAGYGKMGICGLSMGGVHAAMVGSLHPTPIATLPFLAPHSAVVP 240
Query: 241 FCEGILKHGTAWEALRE-------------ELAAKKVAMTLEEVRERMRNVLSLTDVTRF 287
FCEG+ +H TAW+AL E E AA+K +T+E+V++R+R+VLSLTDVTRF
Sbjct: 241 FCEGLYRHATAWDALMEDAAALAQDATSLTEDAAQKSGITIEQVKDRLRSVLSLTDVTRF 300
Query: 288 PIPKIPNAVIFVAATVSTVFDYH 310
P+PK P AVIFV AT H
Sbjct: 301 PVPKKPQAVIFVGATDDGYIPRH 323
>gi|356500465|ref|XP_003519052.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C4orf29
homolog [Glycine max]
Length = 346
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 237/310 (76%), Positives = 263/310 (84%), Gaps = 13/310 (4%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MVTVNLGMLHY+LDH+YGA MHR++IS PFFSRGWGG+KLE+LE++I
Sbjct: 1 MVTVNLGMLHYLLDHIYGALMHRSRISTPFFSRGWGGTKLEMLEKMIG------------ 48
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
+L++PIWRT+WET TA LREGVF TPCD QL++ LPP SH ARVAFL PKCV P +MAC
Sbjct: 49 -ALVRPIWRTVWETGTASLREGVFXTPCDYQLLAELPPXSHMARVAFLVPKCVAPHRMAC 107
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
V+HLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQR P+LQRGAKL CVSDLLLLG
Sbjct: 108 VLHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRHPMLQRGAKLXCVSDLLLLG 167
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
RATIEE+R LLHW+ EAGF KMG+CGLSMGGVHAAMVGSLHPTPVAT PFLSPHSA VA
Sbjct: 168 RATIEESRGLLHWMYSEAGFSKMGICGLSMGGVHAAMVGSLHPTPVATFPFLSPHSAAVA 227
Query: 241 FCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVA 300
F EGILK+GTAWEALR +LA +K MTLEEVRER+RNVLSLT+VT FPIPKIPNAVIFV+
Sbjct: 228 FREGILKYGTAWEALRGDLATQKAEMTLEEVRERLRNVLSLTEVTCFPIPKIPNAVIFVS 287
Query: 301 ATVSTVFDYH 310
AT H
Sbjct: 288 ATDDGYIPKH 297
>gi|116789020|gb|ABK25086.1| unknown [Picea sitchensis]
Length = 356
Score = 479 bits (1233), Expect = e-133, Method: Compositional matrix adjust.
Identities = 228/314 (72%), Positives = 267/314 (85%), Gaps = 6/314 (1%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFP----EIEG 56
MVT+NLG+ HY+LDH+YGAF+HR +++PPFFS GWGG KL+LLE++ KQL E+
Sbjct: 1 MVTLNLGVCHYILDHIYGAFVHRMRLAPPFFSAGWGGPKLDLLEKMTKQLLSQGLVEVAA 60
Query: 57 QNWPPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQ 116
Q WPP +IQP+W+T+WE++T+ L+EG+F+TPCDE+L++ALP ES+ ARVAFL PK VP
Sbjct: 61 QRWPPRIIQPLWKTVWESRTSRLQEGIFKTPCDEELLNALPIESYTARVAFLTPKYVPSH 120
Query: 117 KMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDL 176
MACVVHLAGTGDH+FERRLRLGGPLLK NIATMVLESPFYG RRP LQ GAKLLCVSDL
Sbjct: 121 NMACVVHLAGTGDHSFERRLRLGGPLLKYNIATMVLESPFYGHRRPKLQHGAKLLCVSDL 180
Query: 177 LLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHS 236
L+LGRATIEE R LL+WLE EAGF K GVCGLSMGGVHAAMVGSLHPTPVATLPFL+PHS
Sbjct: 181 LMLGRATIEETRTLLYWLETEAGFSKTGVCGLSMGGVHAAMVGSLHPTPVATLPFLAPHS 240
Query: 237 AVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAV 296
A VAFCEGILKHGTAW+ALR ++ + V MT+EEVRERMR+VLSLTDVT+FP PK P++V
Sbjct: 241 AAVAFCEGILKHGTAWDALRRDV--QSVGMTIEEVRERMRSVLSLTDVTQFPTPKCPSSV 298
Query: 297 IFVAATVSTVFDYH 310
IFVAAT H
Sbjct: 299 IFVAATYDGYIPKH 312
>gi|9294114|dbj|BAB01965.1| unnamed protein product [Arabidopsis thaliana]
Length = 335
Score = 460 bits (1183), Expect = e-127, Method: Compositional matrix adjust.
Identities = 229/286 (80%), Positives = 261/286 (91%), Gaps = 1/286 (0%)
Query: 26 ISPPFFSRGWGGSKLELLERLIKQLFP-EIEGQNWPPSLIQPIWRTIWETQTAVLREGVF 84
++PPFFSRGWGG LELLER++++LFP E++GQNWPP L++P+WRT+WET+TA LREGVF
Sbjct: 1 MTPPFFSRGWGGPNLELLERMVQRLFPLEVQGQNWPPPLVRPVWRTVWETKTATLREGVF 60
Query: 85 RTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLRLGGPLLK 144
+TPC ++L +ALPPES ARVA+L PK VPPQKMACVVHLAGTGDHT++RRLRLGGPL+K
Sbjct: 61 QTPCADELTAALPPESRTARVAWLVPKNVPPQKMACVVHLAGTGDHTYDRRLRLGGPLVK 120
Query: 145 ENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMG 204
+NIATMVLESPFYGQRRP LQ GA+LLCVSDLLLLGRATIEE+R L+HWL+ E GFGKMG
Sbjct: 121 QNIATMVLESPFYGQRRPFLQCGARLLCVSDLLLLGRATIEESRSLIHWLDTEEGFGKMG 180
Query: 205 VCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKV 264
VCGLSMGGVHA+MVGSLHPTPVATLPFLSPHSAVVAFCEGILK+GTAWEALREELAA+K+
Sbjct: 181 VCGLSMGGVHASMVGSLHPTPVATLPFLSPHSAVVAFCEGILKYGTAWEALREELAAQKI 240
Query: 265 AMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTVFDYH 310
MTL+EVRERMRNVLSLTDVTRFPIPK P+AVIFVAAT H
Sbjct: 241 TMTLDEVRERMRNVLSLTDVTRFPIPKNPDAVIFVAATDDGYIPKH 286
>gi|297739730|emb|CBI29912.3| unnamed protein product [Vitis vinifera]
Length = 314
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 232/265 (87%), Positives = 245/265 (92%), Gaps = 1/265 (0%)
Query: 46 LIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARV 105
+IKQLFPE +NWPPSLIQPIW+T+WET+TA LREGVF+TPCDE+L+SALPPESH ARV
Sbjct: 1 MIKQLFPEA-AENWPPSLIQPIWKTVWETKTACLREGVFKTPCDERLLSALPPESHTARV 59
Query: 106 AFLAPKCVPPQKMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQ 165
AFL PK VPPQKMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRP+LQ
Sbjct: 60 AFLTPKFVPPQKMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPVLQ 119
Query: 166 RGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTP 225
RGAKLLCVSDLLLLGRATIEEAR LLHWL+ EAGFGKMGVCGLSMGGVHAAMVGSLHPTP
Sbjct: 120 RGAKLLCVSDLLLLGRATIEEARSLLHWLDSEAGFGKMGVCGLSMGGVHAAMVGSLHPTP 179
Query: 226 VATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVT 285
VATLPFLSPHSAVVAFCEGILKH TAWEALRE+LA +K AMTLE+VRERMRNVLSLTDVT
Sbjct: 180 VATLPFLSPHSAVVAFCEGILKHATAWEALREDLAVQKAAMTLEDVRERMRNVLSLTDVT 239
Query: 286 RFPIPKIPNAVIFVAATVSTVFDYH 310
RFPIPK PNAVIFVAAT H
Sbjct: 240 RFPIPKNPNAVIFVAATDDGYIPKH 264
>gi|302821230|ref|XP_002992279.1| hypothetical protein SELMODRAFT_236489 [Selaginella moellendorffii]
gi|300139929|gb|EFJ06660.1| hypothetical protein SELMODRAFT_236489 [Selaginella moellendorffii]
Length = 355
Score = 436 bits (1121), Expect = e-120, Method: Compositional matrix adjust.
Identities = 215/306 (70%), Positives = 247/306 (80%), Gaps = 7/306 (2%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPE----IEG 56
MV NL + HY LDHVYGA M+R ++SPPFFS GWGG++L LLE+L +QL +
Sbjct: 1 MVVSNLRVAHYWLDHVYGALMYRLRLSPPFFSNGWGGARLVLLEQLTRQLISQGLANFSL 60
Query: 57 QNWPPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQ 116
+ WPP I P+WRT+WE++ A L+EG+F TPCD + LPPESH ARV L P+ VP
Sbjct: 61 KYWPPPPIDPVWRTVWESKAAKLQEGIFPTPCDPLVRECLPPESHIARVRLLMPRSVPAH 120
Query: 117 KMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDL 176
KMACVVHLAGTGDH FERRLRLGGPLLK+NIAT+VLESPFYG RRP LQRGAKLLCVSDL
Sbjct: 121 KMACVVHLAGTGDHGFERRLRLGGPLLKDNIATLVLESPFYGNRRPRLQRGAKLLCVSDL 180
Query: 177 LLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHS 236
L+LGRATIEEAR LL+WL+ +AGF K+GVCGLSMGGVHAAMVGSLHPTP+A LP LSPHS
Sbjct: 181 LVLGRATIEEARTLLYWLDKQAGFSKLGVCGLSMGGVHAAMVGSLHPTPLAVLPLLSPHS 240
Query: 237 AVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAV 296
A VAFCEGI+K+GTAWE L + A +MTL++VRERMR VLSLTDVTRFPIPK P AV
Sbjct: 241 AAVAFCEGIMKYGTAWEVLMRDEAC---SMTLDQVRERMRAVLSLTDVTRFPIPKNPRAV 297
Query: 297 IFVAAT 302
IFV AT
Sbjct: 298 IFVGAT 303
>gi|302812275|ref|XP_002987825.1| hypothetical protein SELMODRAFT_235364 [Selaginella moellendorffii]
gi|300144444|gb|EFJ11128.1| hypothetical protein SELMODRAFT_235364 [Selaginella moellendorffii]
Length = 355
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 214/306 (69%), Positives = 247/306 (80%), Gaps = 7/306 (2%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPE----IEG 56
MV NL + HY LDHVYGA M+R ++SPPFFS GWGG++L LLE+L +QL +
Sbjct: 1 MVVSNLRVAHYWLDHVYGALMYRLRLSPPFFSNGWGGARLVLLEQLTRQLISQGLANFSV 60
Query: 57 QNWPPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQ 116
+ WPP I P+WRT+WE++ A L+EG+F TPCD + LPPESH ARV L P+ VP
Sbjct: 61 KCWPPPPIDPVWRTVWESKAAKLQEGIFPTPCDPLVRECLPPESHIARVRLLMPRSVPAH 120
Query: 117 KMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDL 176
KMACVVHLAGTGDH FERRLRLGGPLLK+NIAT+VLESPFYG RRP LQRGAKLLCVSDL
Sbjct: 121 KMACVVHLAGTGDHGFERRLRLGGPLLKDNIATLVLESPFYGNRRPRLQRGAKLLCVSDL 180
Query: 177 LLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHS 236
L+LGRATIEEAR LL+WL+ +AGF K+GVCGLSMGGVHAAMVGSLHPTP+A LP LSPHS
Sbjct: 181 LVLGRATIEEARTLLYWLDKQAGFSKLGVCGLSMGGVHAAMVGSLHPTPLAVLPLLSPHS 240
Query: 237 AVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAV 296
A VAFCEGI+K+GTAW+ L + A +MTL++VRERMR VLSLTDVTRFPIPK P AV
Sbjct: 241 AAVAFCEGIMKYGTAWDVLMRDEAC---SMTLDQVRERMRAVLSLTDVTRFPIPKNPRAV 297
Query: 297 IFVAAT 302
IFV AT
Sbjct: 298 IFVGAT 303
>gi|168063632|ref|XP_001783774.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664717|gb|EDQ51426.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 374
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 207/340 (60%), Positives = 252/340 (74%), Gaps = 26/340 (7%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQL----FPEIEG 56
MV VN+G HY LDH+YGA MHR +++PPFFS GWGG KLELLE++ +QL ++
Sbjct: 1 MVAVNVGAAHYWLDHLYGAIMHRMRLAPPFFSGGWGGRKLELLEQMSRQLIAQGLAQVSL 60
Query: 57 QNWPPSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQ 116
Q+WPP I P+WRT+WE++ A L+EG+F TPC++ L LP ES ARV L+P+ VP
Sbjct: 61 QHWPPPAINPVWRTVWESRKAKLQEGIFTTPCEDMLKQVLPIESQTARVRLLSPRHVPIH 120
Query: 117 KMACVVHLAG-----------------TGDHTFERRLRLGGPLLKENIATMVLESPFYGQ 159
+ + VVHLAG TGDH F+RRLRLGGPLL++NIAT+VLESP+YG+
Sbjct: 121 ETSFVVHLAGILSAQASSSNGIVIFPGTGDHGFDRRLRLGGPLLEKNIATLVLESPYYGK 180
Query: 160 RRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVG 219
RRP LQRGA+LLCVSDLLLLGR TIEEAR LL+W E E G+ K+GVCGLSMGGVHAAMVG
Sbjct: 181 RRPPLQRGARLLCVSDLLLLGRTTIEEARALLYWAETEEGYKKVGVCGLSMGGVHAAMVG 240
Query: 220 SLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL-REELAAKKVAMTLEEVRERMRNV 278
SLH +PVA LPFL+PHSA VAFCEGIL++GTAWE L R+EL + +MT E+V ERMR V
Sbjct: 241 SLHRSPVAILPFLTPHSAAVAFCEGILQYGTAWEVLMRDELLSG--SMTREQVVERMRTV 298
Query: 279 LSLTDVTRFPIPKIPNAVIFVAATVSTVFDYHHEEVLKMD 318
LSLTDVT+FP P+ P +VIFVAAT H VLK+
Sbjct: 299 LSLTDVTQFPAPQNPKSVIFVAATDDGYVPDH--SVLKLQ 336
>gi|413936215|gb|AFW70766.1| hypothetical protein ZEAMMB73_402274 [Zea mays]
gi|413936216|gb|AFW70767.1| hypothetical protein ZEAMMB73_402274 [Zea mays]
Length = 212
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 160/211 (75%), Positives = 190/211 (90%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MV+VNLG++HYVLDH+YG +HRTK+ PFFS+GWGG+KL+LLE+++KQLFPE QNWP
Sbjct: 1 MVSVNLGLVHYVLDHIYGTLLHRTKLGTPFFSKGWGGTKLDLLEKMVKQLFPEARCQNWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
P+ +QP+W+T+WET + LREGVFRT CDE+L+ ALPPESHNARVAFL PK V P+KM+C
Sbjct: 61 PTAVQPMWKTVWETNNSCLREGVFRTTCDERLIDALPPESHNARVAFLTPKNVTPEKMSC 120
Query: 121 VVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
VVHLAGTGDHTFERRLRLGGPLLK NIATMVLESP+YGQRRP +QRGAKL CVSDLLLLG
Sbjct: 121 VVHLAGTGDHTFERRLRLGGPLLKNNIATMVLESPYYGQRRPSMQRGAKLQCVSDLLLLG 180
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMG 211
+ATI+EAR LL+WL+ EAG+GKMG+CGLSMG
Sbjct: 181 KATIDEARSLLYWLQNEAGYGKMGICGLSMG 211
>gi|255583266|ref|XP_002532397.1| conserved hypothetical protein [Ricinus communis]
gi|223527893|gb|EEF29982.1| conserved hypothetical protein [Ricinus communis]
Length = 243
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 169/193 (87%), Positives = 178/193 (92%)
Query: 118 MACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLL 177
MACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRP+LQ GAKLLCVSDLL
Sbjct: 1 MACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPMLQTGAKLLCVSDLL 60
Query: 178 LLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSA 237
LLGRATI+EAR LLHWL+ EAGFGK GVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSA
Sbjct: 61 LLGRATIDEARSLLHWLDCEAGFGKTGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSA 120
Query: 238 VVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVI 297
VAFCEGIL+HGTAWEALRE+LA +K A+TL+EV+ERMRNVLSLTDVTRFPIPK PNAVI
Sbjct: 121 AVAFCEGILRHGTAWEALREDLAVQKAAITLQEVQERMRNVLSLTDVTRFPIPKNPNAVI 180
Query: 298 FVAATVSTVFDYH 310
FVAAT H
Sbjct: 181 FVAATDDGYIPKH 193
>gi|218190244|gb|EEC72671.1| hypothetical protein OsI_06224 [Oryza sativa Indica Group]
Length = 249
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 144/198 (72%), Positives = 171/198 (86%), Gaps = 13/198 (6%)
Query: 118 MACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLL 177
MACV+HLAGTGDH+FERRLRLGGPLLK+NIATMVLESP+YGQRRP +Q G+KL CVSDLL
Sbjct: 1 MACVIHLAGTGDHSFERRLRLGGPLLKDNIATMVLESPYYGQRRPSMQHGSKLQCVSDLL 60
Query: 178 LLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSA 237
LLG+ATI+EAR LL+WL+ EAG+GKMG+CGLSMGGVHAAMVGSLHPTP+ATLPFL+PHSA
Sbjct: 61 LLGKATIDEARSLLYWLQNEAGYGKMGICGLSMGGVHAAMVGSLHPTPIATLPFLAPHSA 120
Query: 238 VVAFCEGILKHGTAWEALREELA-------------AKKVAMTLEEVRERMRNVLSLTDV 284
VV FC+G+ +H TAW+ALR++ A A+K +T+E+VRER+R+VLSLTDV
Sbjct: 121 VVPFCDGLYRHATAWDALRKDAATLAQDVTSLTEDTAQKSGITIEQVRERLRSVLSLTDV 180
Query: 285 TRFPIPKIPNAVIFVAAT 302
TRFP+PK P AVIFV AT
Sbjct: 181 TRFPVPKNPQAVIFVGAT 198
>gi|222622365|gb|EEE56497.1| hypothetical protein OsJ_05743 [Oryza sativa Japonica Group]
Length = 249
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 144/198 (72%), Positives = 171/198 (86%), Gaps = 13/198 (6%)
Query: 118 MACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLL 177
MACV+HLAGTGDH+FERRLRLGGPLLK+NIATMVLESP+YGQRRP +Q G+KL CVSDLL
Sbjct: 1 MACVIHLAGTGDHSFERRLRLGGPLLKDNIATMVLESPYYGQRRPSMQHGSKLQCVSDLL 60
Query: 178 LLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSA 237
LLG+ATI+EAR LL+WL+ EAG+GKMG+CGLSMGGVHAAMVGSLHPTP+ATLPFL+PHSA
Sbjct: 61 LLGKATIDEARSLLYWLQNEAGYGKMGICGLSMGGVHAAMVGSLHPTPIATLPFLAPHSA 120
Query: 238 VVAFCEGILKHGTAWEALREELA-------------AKKVAMTLEEVRERMRNVLSLTDV 284
VV FC+G+ +H TAW+ALR++ A A+K +T+E+VRER+R+VLSLTDV
Sbjct: 121 VVPFCDGLYRHATAWDALRKDAATLAQDVTSLTEDMAQKSGITIEQVRERLRSVLSLTDV 180
Query: 285 TRFPIPKIPNAVIFVAAT 302
TRFP+PK P AVIFV AT
Sbjct: 181 TRFPVPKNPQAVIFVGAT 198
>gi|356537333|ref|XP_003537182.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C4orf29
homolog [Glycine max]
Length = 234
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 140/183 (76%), Positives = 151/183 (82%), Gaps = 5/183 (2%)
Query: 128 GDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEA 187
G+H+FE RLRLGGP LK NI TMVL+SPFYGQRRP+LQRGAKLLCVSDLLLL RATI+E
Sbjct: 8 GNHSFEGRLRLGGPXLKANIXTMVLKSPFYGQRRPMLQRGAKLLCVSDLLLLRRATIKEL 67
Query: 188 RCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILK 247
R LLH ++ E FGKM SMGGVHAAMVGSLHP P+AT FLSPHSAVVAFCE ILK
Sbjct: 68 RSLLHXMDSETXFGKM-----SMGGVHAAMVGSLHPRPIATFHFLSPHSAVVAFCEEILK 122
Query: 248 HGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTVF 307
HGTAWEALR +LAA+K MTLEE RER+RNVLSLTDVT FPIPKIPNAVIFVAAT
Sbjct: 123 HGTAWEALRGDLAAQKAEMTLEEXRERLRNVLSLTDVTCFPIPKIPNAVIFVAATDDGYI 182
Query: 308 DYH 310
H
Sbjct: 183 PKH 185
>gi|384245857|gb|EIE19349.1| alpha/beta-hydrolase, partial [Coccomyxa subellipsoidea C-169]
Length = 301
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 149/248 (60%), Gaps = 28/248 (11%)
Query: 83 VFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLRLGGPL 142
+ RTPC +L ALP ES ARV L P+ + CVVHLAGTGDH FERR LG PL
Sbjct: 1 IHRTPCHGRLYDALPEESRTARVRLLLPRGAS-EATDCVVHLAGTGDHGFERRTHLGLPL 59
Query: 143 LKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGK 202
+ + +ATM LESP+YG RRP Q G+KL VSDLL LGR TIEE+ LL W + + F +
Sbjct: 60 IAKGVATMALESPYYGSRRPPWQEGSKLERVSDLLTLGRTTIEESLYLLAWAQ-QQKFRR 118
Query: 203 MGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL------- 255
+G+CG SMGGVHA MV SL+P +A +P L+P SA VAFC G L+ TAW+ L
Sbjct: 119 LGICGFSMGGVHACMVASLYPKALACVPLLAPRSAAVAFCHGALREATAWQPLLAAADEA 178
Query: 256 ------------REELAAKKVAMTLEEVRERMRNVL-SLTDVTRFPIPKIPNAVIFVAAT 302
E +AA+K+A RER+ VL + TDVTRFP P+ P+A + V A
Sbjct: 179 DKVCPSVFPSQSHETVAAQKLA------RERLDKVLETYTDVTRFPRPRRPDAAVIVGAH 232
Query: 303 VSTVFDYH 310
H
Sbjct: 233 NDAYVSAH 240
>gi|302832233|ref|XP_002947681.1| hypothetical protein VOLCADRAFT_43395 [Volvox carteri f.
nagariensis]
gi|300267029|gb|EFJ51214.1| hypothetical protein VOLCADRAFT_43395 [Volvox carteri f.
nagariensis]
Length = 235
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 105/184 (57%), Positives = 130/184 (70%), Gaps = 2/184 (1%)
Query: 119 ACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLL 178
ACVVHLA TGD TF RRLRLG PLLK+N+ ++VLESPFYG RRP QRG+KLL VSDLL
Sbjct: 1 ACVVHLAATGDQTFGRRLRLGFPLLKDNVCSLVLESPFYGARRPAAQRGSKLLRVSDLLT 60
Query: 179 LGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAV 238
LG ATI E+ LLHWL E G+G +G+CGLSMGGVHA+M L P VA P L+P SA
Sbjct: 61 LGWATIAESINLLHWLR-EEGYGALGMCGLSMGGVHASMTAGLFPGDVAVTPLLAPRSAA 119
Query: 239 VAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVL-SLTDVTRFPIPKIPNAVI 297
VA+C+G ++ AWE L +EL A + E R++ VL + TD+TR+P P+ +A +
Sbjct: 120 VAYCDGAMRAAMAWEPLLKELRAGDRRLDRPETVLRLKQVLETYTDITRYPRPRRTDAAV 179
Query: 298 FVAA 301
VAA
Sbjct: 180 IVAA 183
>gi|388513101|gb|AFK44612.1| unknown [Lotus japonicus]
Length = 98
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 73/88 (82%), Positives = 83/88 (94%)
Query: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP 60
MVTVNLGMLHYVLDHVYGAFMHRTKIS PFFS GWGG+KLE+LE++I QLFPE+ GQNWP
Sbjct: 1 MVTVNLGMLHYVLDHVYGAFMHRTKISTPFFSGGWGGTKLEMLEKMINQLFPEVAGQNWP 60
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPC 88
PSL++P+WRT+WET+TA LREGVFRTPC
Sbjct: 61 PSLVRPVWRTVWETKTACLREGVFRTPC 88
>gi|325181061|emb|CCA15470.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 449
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 28/315 (8%)
Query: 10 HYVLDHVYGAFMHRTKISPP-FFSRGWGGSKL-ELLERLIKQLFPEIEGQNWPPSLIQPI 67
H LD + H P FF GWG + E + ++K E+ + IQ
Sbjct: 88 HRFLDRMIATLTHHRVFFPNGFFGDGWGDVTVSERIRTIVKS--DEMRSIYRIKNGIQ-- 143
Query: 68 WRTI--WETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMA--CVVH 123
WR++ + L+EG F T E SALP S A + P C +KMA V+
Sbjct: 144 WRSVKVLPSLNVQLQEGSFHTTLQED--SALPECSRTAYFELVTPLCTDGKKMANAMVIS 201
Query: 124 LAGTGDHTF-ERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRA 182
L GTG+H + RR L P+ ++T+++E PFYG+R+P Q+G+KL VSDL LLG+
Sbjct: 202 LPGTGEHGYGHRRNTLAIPMALNGVSTLIVEGPFYGKRKPPNQKGSKLRRVSDLPLLGQT 261
Query: 183 TIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFC 242
TI EA+ LL + + + V G SMGG+HAAM S +P V + +L+P A AF
Sbjct: 262 TITEAKSLLVHFKEHHPYTRFVVAGTSMGGLHAAMTASTYPFDVGMVAWLAPLCASSAFA 321
Query: 243 EGILKHGTAWEALREELAAKKV---------AMTLEEVR------ERMRNVLSLTDVTRF 287
+G+L W AL E+L + A + ++ R +R+ +LS TD+T F
Sbjct: 322 DGVLSESCNWSALYEDLEGAIIDGNDSFTCSASSTDKFRGKELAKQRLVQLLSFTDITNF 381
Query: 288 PIPKIPNAVIFVAAT 302
PK P+A +FV T
Sbjct: 382 APPKRPDATVFVYGT 396
>gi|301113346|ref|XP_002998443.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111744|gb|EEY69796.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 458
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 119/364 (32%), Positives = 168/364 (46%), Gaps = 62/364 (17%)
Query: 10 HYVLDHVYGAFMHRTKISPP-FFSRGWGGSKL-----ELLE-RLIKQLFPEIEGQNWPPS 62
H LD A + P FFS GWG ELL+ R + + +G+
Sbjct: 51 HRYLDRAAAAVTQNPVLFPNGFFSDGWGDLNTPKRIRELLQSRRMSDVVSLRDGE----- 105
Query: 63 LIQPIWRTIWETQTA--VLREGVFRTPCDEQLMSALPPESHNARVAFLAP-------KCV 113
P W ++ + A LREG F + D LP ES +A + P + +
Sbjct: 106 ---PNWSSVRKLSVAKVALREGKFSSSLD-NAQQLLPAESQDAFCELVTPLEWEREDQRI 161
Query: 114 PP--QKMACVVHLAGTGDHTF-ERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKL 170
P Q VV L GTG+H F RR + PL K +AT++LE PFYG+R+P Q+G+KL
Sbjct: 162 PQGRQDRPLVVLLPGTGEHGFLHRRASIAIPLAKRGVATLILEGPFYGKRKPSKQKGSKL 221
Query: 171 LCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLP 230
VSDL +LG+ATIEEA+ LL G+ ++ V G SMGG+HAAMV S+ P V
Sbjct: 222 RRVSDLPILGQATIEEAKSLLEHFRDCHGYSQLVVAGSSMGGLHAAMVASVFPGDVGATA 281
Query: 231 FLSPHSAVVAFCEGILKHGTAWEALREE---------LAAKKVAMTLE------------ 269
+L+P SAV F +G+L W +L ++ L A + E
Sbjct: 282 WLAPPSAVPVFADGLLSGSCNWRSLYKQHELQMLDKMLTGHAAAESYEKLLGAAVDDEKE 341
Query: 270 -------------EVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTVFDYHHEEVLK 316
E ++RMR LS+TD+ F P+ +AV+FV T + + +
Sbjct: 342 RAECSELELDPVQEAKKRMRLFLSITDIDNFLPPRKSDAVVFVYGTEDEYIGFTEPQWQR 401
Query: 317 MDSQ 320
M Q
Sbjct: 402 MREQ 405
>gi|452824654|gb|EME31655.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 328
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 94/273 (34%), Positives = 135/273 (49%), Gaps = 39/273 (14%)
Query: 65 QP-IWRTIWETQTAVLREGVFRTPCDEQL-----MSALPPESHNARVAFLAPKCVPPQKM 118
QP +W + E LRE F TP E L +S+ P E+ AR + P +
Sbjct: 5 QPWLWNRVAELWKLRLREACFLTPAIEWLNETGSLSSFPGETRMARFLLVEP--LYKSDS 62
Query: 119 ACVVHLAGTGDHTFERRLR-LGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLL 177
+ V+HLA TGDH + RRL PL I++++LE+P+YG R+P+ Q G+KL V DLL
Sbjct: 63 SLVIHLAATGDHGYNRRLFCFALPLANHGISSVILENPYYGSRKPVHQVGSKLAYVQDLL 122
Query: 178 LLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSA 237
LLG ATI E + + + + M GLS GG+HAAM SL+P VAT+ SPHSA
Sbjct: 123 LLGFATILECMSIAKYFSEDVEYRSMCFTGLSQGGLHAAMAASLYPFSVATVAAFSPHSA 182
Query: 238 VVAFCEGILKHGTAWEALREEL------------------------------AAKKVAMT 267
V F +G+L+ +W L + ++ +
Sbjct: 183 VPVFTDGVLRQSCSWNQLAATMNEAVQSSCTIQHPEDSHERTEYDHLASSPQVSRYIDRK 242
Query: 268 LEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVA 300
+ VR ++R L ++D+ FP P PNA I +A
Sbjct: 243 EQTVRSQLRIALEMSDIRHFPQPANPNAAILLA 275
>gi|452824655|gb|EME31656.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
Length = 280
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 94/273 (34%), Positives = 135/273 (49%), Gaps = 39/273 (14%)
Query: 65 QP-IWRTIWETQTAVLREGVFRTPCDEQL-----MSALPPESHNARVAFLAPKCVPPQKM 118
QP +W + E LRE F TP E L +S+ P E+ AR + P +
Sbjct: 5 QPWLWNRVAELWKLRLREACFLTPAIEWLNETGSLSSFPGETRMARFLLVEP--LYKSDS 62
Query: 119 ACVVHLAGTGDHTFERRLR-LGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLL 177
+ V+HLA TGDH + RRL PL I++++LE+P+YG R+P+ Q G+KL V DLL
Sbjct: 63 SLVIHLAATGDHGYNRRLFCFALPLANHGISSVILENPYYGSRKPVHQVGSKLAYVQDLL 122
Query: 178 LLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSA 237
LLG ATI E + + + + M GLS GG+HAAM SL+P VAT+ SPHSA
Sbjct: 123 LLGFATILECMSIAKYFSEDVEYRSMCFTGLSQGGLHAAMAASLYPFSVATVAAFSPHSA 182
Query: 238 VVAFCEGILKHGTAWEALREEL------------------------------AAKKVAMT 267
V F +G+L+ +W L + ++ +
Sbjct: 183 VPVFTDGVLRQSCSWNQLAATMNEAVQSSCTIQHPEDSHERTEYDHLASSPQVSRYIDRK 242
Query: 268 LEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVA 300
+ VR ++R L ++D+ FP P PNA I +A
Sbjct: 243 EQTVRSQLRIALEMSDIRHFPQPANPNAAILLA 275
>gi|348669937|gb|EGZ09759.1| hypothetical protein PHYSODRAFT_564254 [Phytophthora sojae]
Length = 458
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 169/364 (46%), Gaps = 62/364 (17%)
Query: 10 HYVLDHVYGAFMHRTKISPP-FFSRGWGGSKL-----ELLE-RLIKQLFPEIEGQNWPPS 62
H LD A + P FFS GWG ELL+ R + + EG+
Sbjct: 51 HRYLDRAAAAVTQNPVLFPNGFFSDGWGDLNTSKRIRELLQSRRMSDVVSLKEGE----- 105
Query: 63 LIQPIWRTIWETQTA--VLREGVFRTPCDEQLMSALPPESHNARVAFLAP-------KCV 113
P W + E A LREG F++ LP +S +A + P V
Sbjct: 106 ---PSWGCVRELSVAKVALREGRFQSTLG-NAQQLLPEQSLDAFCELVTPLDWEREDGGV 161
Query: 114 P--PQKMACVVHLAGTGDHTF-ERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKL 170
P VV L GTG+H F RR + PL K+ +AT++LE PFYG+R+P Q+G+KL
Sbjct: 162 PHGGTDRPLVVLLPGTGEHGFLHRRTSIAIPLAKKGVATLILEGPFYGKRKPPQQKGSKL 221
Query: 171 LCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLP 230
VSDL +LG+ATIEEA+ LL G+ ++ + G SMGG+HAAMV S+ P V
Sbjct: 222 RRVSDLPILGQATIEEAKSLLEHFRDYHGYSQLVIAGSSMGGLHAAMVASVFPGDVGATA 281
Query: 231 FLSPHSAVVAFCEGILKHGTAWEAL--REE-------LAAKKVAMTLE------------ 269
+L+P SAV F +G+L W +L R E LA + VA + E
Sbjct: 282 WLAPPSAVPVFADGLLSGSCNWRSLYKRHELQMLDKMLAGQAVAESYEKLATAGADTGAE 341
Query: 270 -------------EVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTVFDYHHEEVLK 316
E ++RMR LS+TD+ F P+ +AV+FV T + + +
Sbjct: 342 ELDTSDAELDPVQEAKKRMRLFLSITDIDNFLPPRRSDAVVFVYGTEDEYIGFTEPQWER 401
Query: 317 MDSQ 320
M Q
Sbjct: 402 MREQ 405
>gi|428179448|gb|EKX48319.1| hypothetical protein GUITHDRAFT_68777, partial [Guillardia theta
CCMP2712]
Length = 246
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/189 (40%), Positives = 105/189 (55%), Gaps = 4/189 (2%)
Query: 141 PLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGF 200
PL I +++LESP+YG R+P QRG KL CVSDLL LG ATIEE +L + G
Sbjct: 1 PLPNTGIGSVILESPYYGHRKPRRQRGPKLQCVSDLLSLGNATIEETISILRYFN-AHGH 59
Query: 201 GKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELA 260
G +G+CG SMGGVHA M + PVA + FL+P A FC+G L W+AL +
Sbjct: 60 GPLGICGFSMGGVHAIMTAGVCNLPVALVTFLAPQCAAPVFCQGALSASCDWDALSRHSS 119
Query: 261 A---KKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTVFDYHHEEVLKM 317
+ + E+V+ R+ +L +TDVTR P P P A I + A D EEV++
Sbjct: 120 SINWNEWNCEDEDVKHRLGRILRITDVTRLPPPPCPWATILIQAKEDAYIDRRSEEVIRS 179
Query: 318 DSQHFFALF 326
+ ++ F
Sbjct: 180 SWRDYWKAF 188
>gi|300120528|emb|CBK20082.2| unnamed protein product [Blastocystis hominis]
Length = 848
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 103/293 (35%), Positives = 151/293 (51%), Gaps = 27/293 (9%)
Query: 30 FFSRGWGGSKLELLERLIKQLF---PEIEGQNWPPSL-IQPIWRTIWETQTAVLREGVFR 85
FF+ GWG ++L++RL + L E + PP + I I E ++++G F+
Sbjct: 526 FFTAGWG--DIDLVDRLDEGLLLLKEETANKVHPPPININLTAPEINEEDEVIIQDGQFK 583
Query: 86 TPCDEQLMSALPPESHNARVAFLAP----KCVPPQKMACVVHLAGTGDHTFERRLR-LGG 140
T + LP ES + + P + PP K V+ L GTG+ F RR +
Sbjct: 584 TV--SRYREYLPVESEQVYIRIIKPLSWGRLDPPHK-PMVLILPGTGEKGFGRRYDGVSV 640
Query: 141 PLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGF 200
PL + I +++LE PFYG+R+P Q G KL VSDL LLG ATIEE+R LL++L E G
Sbjct: 641 PLARLGIGSIILEGPFYGRRKPKKQNGCKLRHVSDLPLLGAATIEESRSLLYYLR-EQGL 699
Query: 201 GKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELA 260
G + V G+SMGG+HAAMV +L P+ T + P SAV F G++ + W+ L ++
Sbjct: 700 GPLVVGGISMGGLHAAMVAALTAFPLGTASLVGPPSAVPVFTSGLMANLIPWKRLDKDAH 759
Query: 261 AKKVAMTL------------EEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAA 301
+A L ++ E M L +T++ F P +P A IF A
Sbjct: 760 YYNLADRLKNRYFDVDKPKMDKAHELMGRFLRITNIENFDPPMVPEAAIFATA 812
>gi|444913711|ref|ZP_21233860.1| hypothetical protein D187_06030 [Cystobacter fuscus DSM 2262]
gi|444715534|gb|ELW56400.1| hypothetical protein D187_06030 [Cystobacter fuscus DSM 2262]
Length = 343
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/278 (35%), Positives = 134/278 (48%), Gaps = 26/278 (9%)
Query: 28 PPFFSRGWGGSKLELLERLIKQ----LFPEIEGQNWPPSLIQPIWRTIWETQTAVLREGV 83
P FF GWG S L LE+L + FPE+ P + +++EG
Sbjct: 20 PRFFEDGWGSSAL--LEKLTRGPQGFAFPELSDVRMSPPRRE---------GHLLVQEGR 68
Query: 84 FRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLRLGGPLL 143
F +P + +LP AR L P+ P CV LA +GD F R + G L
Sbjct: 69 FPSPAA---VGSLPAACQEARFQLLLPQGAGPLPAVCVF-LASSGDEGFGLRRFIAGKLA 124
Query: 144 KENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKM 203
+ + ++LE+P+YG RRP Q+G + V+DLLL+ RAT EA LL WL G K+
Sbjct: 125 RSGVGALLLENPYYGSRRPPSQKGPAVRTVADLLLMFRATAVEATALLGWL-LARGHPKV 183
Query: 204 GVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKK 263
G+CG SMGG AA +L P PVA +P + H+A F EG+L WE L L
Sbjct: 184 GICGYSMGGSIAAYAAALFPLPVAVIPLAAAHTAAPVFTEGVLSALPDWETLGRPLG--- 240
Query: 264 VAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAA 301
+ E R+R+ +LS T P P IF+AA
Sbjct: 241 ---STEAARQRLHELLSAAGTTTLPPLPHPKRAIFMAA 275
>gi|198422598|ref|XP_002127781.1| PREDICTED: similar to CG32112 CG32112-PB [Ciona intestinalis]
Length = 445
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/259 (37%), Positives = 139/259 (53%), Gaps = 23/259 (8%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQP-----IWRTIWET 74
++R I F++RGWG K E +++L+K + I + L+ P I + I
Sbjct: 8 LIYRRLILTKFYTRGWG--KPEEMKKLLK-MQKLISNRKTCAGLVSPDYKVNIDKKIEYK 64
Query: 75 QTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPK--------CVPPQKMACVVHLAG 126
+ VLR G F TP L +P S AR + P+ V P AC+ H+AG
Sbjct: 65 ECTVLR-GSFVTPAMNILSEVVPTVSQTARFEIVMPQKELHDGNSGVRP---ACI-HMAG 119
Query: 127 TGDHTFERRLRL-GGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIE 185
TGDH F RR L G PLL+ I +++LE+PFYG R+P Q + LL V+DL ++G I
Sbjct: 120 TGDHGFHRRRELLGKPLLESGITSVLLENPFYGSRKPKDQWRSGLLHVNDLFVMGSCLIL 179
Query: 186 EARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGI 245
EA+ LLHWL+ G+G +G+ G+SMGG A++ + P P+A +P +S SA V + EG+
Sbjct: 180 EAQVLLHWLK-RNGYGPLGLTGISMGGHMASLAATNWPEPLAVIPCMSWTSASVVWTEGV 238
Query: 246 LKHGTAWEALREELAAKKV 264
L W L + A V
Sbjct: 239 LSRAIPWRVLELQYAKNPV 257
>gi|156405934|ref|XP_001640986.1| predicted protein [Nematostella vectensis]
gi|156228123|gb|EDO48923.1| predicted protein [Nematostella vectensis]
Length = 448
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 88/253 (34%), Positives = 138/253 (54%), Gaps = 9/253 (3%)
Query: 22 HRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEG-QNWPPSLIQPIWRTIWETQTAV 78
+R+ + FF++GWG +L+ ++RL + F E E + PS W+ +
Sbjct: 15 YRSLVISKFFTKGWG--ELDAVKRLFDFRLEFKEREKCASLVPSSYPVHLDKSWKRDSYY 72
Query: 79 LREGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQKMACVVHLAGTGDHTFERRLR 137
+ EG F +P + L LP +S AR + P K + VHLAGTGDH + RR
Sbjct: 73 MAEGHFLSPVAKYLPGILPQQSEYARFQVIIPTKWQHRNRKPMCVHLAGTGDHFYWRRRN 132
Query: 138 -LGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ PLLKE+ I +++LE+PFYG R+P Q+ + L V DL ++G I E+ LLHW E
Sbjct: 133 FMAKPLLKEHGIGSIILENPFYGSRKPKDQQRSSLKHVVDLFIMGTGLILESSVLLHWCE 192
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL 255
G+G + + G+SMGG A++ ++ P P+A +P LS +A F EG+++ W+ L
Sbjct: 193 -RHGYGPLALTGISMGGHMASLAATVWPKPLAVVPCLSWSTASCVFTEGVMRKSLPWDFL 251
Query: 256 REELAAKKVAMTL 268
+++L TL
Sbjct: 252 KQQLEDDNYRETL 264
>gi|328718173|ref|XP_003246411.1| PREDICTED: uncharacterized protein C4orf29 homolog [Acyrthosiphon
pisum]
Length = 461
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 86/254 (33%), Positives = 133/254 (52%), Gaps = 15/254 (5%)
Query: 20 FMHRTKISPPFFSRGWGG-SKLELLERLIKQLFP-----EIEGQNWPPSLIQPIWRTIWE 73
+++R + +F GWG KL+ L + + ++ ++P +++ E
Sbjct: 8 YLYRKLLLTKYFVNGWGDPEKLKSLFQFRNHIIDRESCFKLVSADYPVKIVKKK-----E 62
Query: 74 TQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMA-CVVHLAGTGD-HT 131
++ EGVF++P L +P ESH A L PK P + + VH+AGTGD H
Sbjct: 63 DSDSITLEGVFQSPFSYYLPDIVPKESHLAHFQVLIPKKWPSKNVKPMCVHMAGTGDQHY 122
Query: 132 FERRLRLGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCL 190
+ RR L PLLKE I +++LE+PFYG+R+P Q + L VSD+ ++G I E+ L
Sbjct: 123 WRRRAMLANPLLKEAAIGSIILENPFYGKRKPNNQVRSILCNVSDIFVMGGCLILESLVL 182
Query: 191 LHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGT 250
HW E E GFG +GV GLSMGG A++ + P P+ +P LS +A F EG +
Sbjct: 183 FHWCERE-GFGPIGVTGLSMGGHMASLAAASWPKPIVLVPCLSGTTASGVFTEGAISCAI 241
Query: 251 AWEALREELAAKKV 264
W L ++ + +
Sbjct: 242 DWNLLEQQYKSNSI 255
>gi|291230350|ref|XP_002735130.1| PREDICTED: CD029 protein-like [Saccoglossus kowalevskii]
Length = 509
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/251 (35%), Positives = 135/251 (53%), Gaps = 12/251 (4%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLI----QPIWRTIWETQT 76
++R+ + FF+RGWG K E L+R+ + F ++ G + PI+ ET+
Sbjct: 7 LYRSLVLSKFFTRGWG--KPESLKRIFE--FQKVVGSRETCQHLVDKDYPIYVDKDETRG 62
Query: 77 AV-LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ER 134
+ EG F +P L +P E A+ F+ PK + +HLAGTGDH F R
Sbjct: 63 ECRIVEGHFLSPLQVHLPGIMPKEGEIAKFQFILPKTWKTRHKPVCIHLAGTGDHYFWRR 122
Query: 135 RLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
R + PLLKE IA+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHW
Sbjct: 123 RTMMARPLLKEYGIASLLLENPYYGTRKPKDQLRSSLHNVSDLFVMGGALILESLALLHW 182
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
E + GFG +G+ G+SMGG A++ + P+ +P LS +A F G+L + W+
Sbjct: 183 CERQ-GFGPLGLTGISMGGHMASLAATNWNKPITLIPCLSGTTATPVFTRGVLSNAIPWK 241
Query: 254 ALREELAAKKV 264
L+ + V
Sbjct: 242 LLQTQYECDNV 252
>gi|195160687|ref|XP_002021206.1| GL24937 [Drosophila persimilis]
gi|194118319|gb|EDW40362.1| GL24937 [Drosophila persimilis]
Length = 512
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 141/274 (51%), Gaps = 25/274 (9%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEG------QNWPPSLIQPIWRTIW 72
++R + FF +GWG K E L R+ +++ E +++P + + + I+
Sbjct: 9 LYRRMLITRFFEKGWG--KPENLRRVFQFRKIISNRESCFKLVPRDYPVEITK---KEIY 63
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKC-VPPQKMACVVHLAGTGDHT 131
T + EG F TP + L +P +S NA L P Q +HLAGTGDH
Sbjct: 64 AESTLI--EGQFITPLELHLPGVVPKKSRNAYFQLLLPNTWKNEQHKPVCIHLAGTGDHF 121
Query: 132 FERRLR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARC 189
F RR + PLLKE NI +++LE+PFYG R+P Q+ + L VSD+ ++G I E
Sbjct: 122 FWRRRNFIAKPLLKEGNIGSIILENPFYGLRKPDDQKRSNLHNVSDIFVMGGCLILECLV 181
Query: 190 LLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHG 249
LLHW E GFG +GV GLSMGG A++ + P P+ +P LS +A F G++
Sbjct: 182 LLHWCE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQS 240
Query: 250 TAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
W+ L + + + RER+ ++++ D
Sbjct: 241 INWDMLETQYFSDG------QYRERLSKMVNVID 268
>gi|198464921|ref|XP_002134878.1| GA23563 [Drosophila pseudoobscura pseudoobscura]
gi|198149937|gb|EDY73505.1| GA23563 [Drosophila pseudoobscura pseudoobscura]
Length = 512
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 141/274 (51%), Gaps = 25/274 (9%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEG------QNWPPSLIQPIWRTIW 72
++R + FF +GWG K E L R+ +++ E +++P + + + I+
Sbjct: 9 LYRRMLITRFFEKGWG--KPENLRRVFQFRKIISNRESCFKLVPRDYPVEITK---KEIY 63
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKC-VPPQKMACVVHLAGTGDHT 131
T + EG F TP + L +P +S NA L P Q +HLAGTGDH
Sbjct: 64 AESTLI--EGQFITPLELHLPGVVPKKSRNAYFQLLLPNTWKNEQHKPVCIHLAGTGDHF 121
Query: 132 FERRLR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARC 189
F RR + PLLKE NI +++LE+PFYG R+P Q+ + L VSD+ ++G I E
Sbjct: 122 FWRRRNFIAKPLLKEGNIGSIILENPFYGLRKPDDQKRSNLHNVSDIFVMGGCLILECLV 181
Query: 190 LLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHG 249
LLHW E GFG +GV GLSMGG A++ + P P+ +P LS +A F G++
Sbjct: 182 LLHWCE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQS 240
Query: 250 TAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
W+ L + + + RER+ ++++ D
Sbjct: 241 INWDMLETQYFSDG------QYRERLSKMVNVID 268
>gi|307102375|gb|EFN50663.1| hypothetical protein CHLNCDRAFT_59467 [Chlorella variabilis]
Length = 204
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 110/189 (58%), Gaps = 13/189 (6%)
Query: 12 VLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLI--QPIWR 69
++D Y A +H+ I P FF +G+G L ++Q Q WPP Q W+
Sbjct: 15 IVDLAYAALVHQLGIIPRFFPKGFGSLDLIDFHEDVQQF------QRWPPDHFPQQLPWK 68
Query: 70 TIWETQTAV----LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLA 125
+ E+ + + FRTPC ++ ALP ES A + P P CVVHLA
Sbjct: 69 KLVESSYGKHGYKVFKASFRTPCQGRVYDALPAESRAAHAMLIVPDA-PADGAPCVVHLA 127
Query: 126 GTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIE 185
TGDH + RR LG PL+++ IAT+ LESP+YGQR+P QRG+KLL VSDLLLLGRATIE
Sbjct: 128 ATGDHGYARRSHLGLPLVQQGIATLALESPYYGQRKPHYQRGSKLLHVSDLLLLGRATIE 187
Query: 186 EARCLLHWL 194
E+ LLHWL
Sbjct: 188 ESLLLLHWL 196
>gi|86160679|ref|YP_467464.1| hypothetical protein Adeh_4263 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85777190|gb|ABC84027.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
2CP-C]
Length = 359
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 106/296 (35%), Positives = 136/296 (45%), Gaps = 25/296 (8%)
Query: 11 YVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRT 70
+VLD ++G T P FF+ GWG RL+K+L P + P+ I
Sbjct: 2 HVLDVLFGL----TAAGPRFFADGWGD------RRLVKRLQP-LPLARRAPARIDVSLGP 50
Query: 71 IWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDH 130
LR+G FR+P + LP + AR+ L P P + +A VHLA +GD
Sbjct: 51 PRAAHGGTLRDGSFRSP-----EARLPGCARAARIQVLLPDG-PLRGVA--VHLAASGDQ 102
Query: 131 TFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCL 190
F RLR PLL I +VLE+ FYG RRP Q + VSDL L+G AT +E R L
Sbjct: 103 GFAMRLRFAAPLLAHGIGAIVLENAFYGARRPERQARHAVRSVSDLYLMGAATFQEGRAL 162
Query: 191 LHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGT 250
L W ++GV G SMGG AAMVG+ P PVAT+P S G+L+
Sbjct: 163 LAWAREALDAPRVGVTGYSMGGQLAAMVGASMPWPVATVPLAPSCSPDSVLLSGVLRDVP 222
Query: 251 AWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTV 306
W AL + A + E R + LS V P P P A I V V
Sbjct: 223 DWAALAGDAADR------EAARVELCAGLSRFSVCALPPPVAPGAAIVVGTAADGV 272
>gi|410913335|ref|XP_003970144.1| PREDICTED: uncharacterized protein C4orf29 homolog [Takifugu
rubripes]
Length = 459
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/250 (35%), Positives = 132/250 (52%), Gaps = 13/250 (5%)
Query: 22 HRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIW-----ETQT 76
+R + F GWG K E L+R+ + F +I G + P ++ E
Sbjct: 10 YRRLLLTKLFIGGWG--KPEDLKRIFE--FRKIIGDREKCKSLVPKDYPVYINKTEENSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ EG F +P + + LPPE+ AR F+ PK + C+ HLAGTGDH F RR
Sbjct: 66 CYIHEGYFISPLEHFVSGILPPEAVKARFQFIVPKRWQKNRPVCI-HLAGTGDHFFWRRR 124
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHWL
Sbjct: 125 TLMARPMIKEAGMASLLLENPYYGYRKPKDQLRSSLKNVSDLFVMGGALILESTVLLHWL 184
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E G+ +G+ G+SMGG A++ + P P+ +P LS +A F G+L W
Sbjct: 185 ERE-GYWPLGMTGISMGGYMASLAVTNWPKPIPLIPCLSWSTASSVFTRGVLSKAVNWAE 243
Query: 255 LREELAAKKV 264
L ++ A V
Sbjct: 244 LEKQYAINSV 253
>gi|195435702|ref|XP_002065818.1| GK18793 [Drosophila willistoni]
gi|194161903|gb|EDW76804.1| GK18793 [Drosophila willistoni]
Length = 502
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 140/274 (51%), Gaps = 25/274 (9%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEG------QNWPPSLIQPIWRTIW 72
++R + FF +GWG K E L R+ +++ E +++P + + + I+
Sbjct: 9 LYRRMLITRFFEKGWG--KPENLRRVFQFRKIISSRETCFKLVPRDYPVEITK---KEIY 63
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQKMACVVHLAGTGDHT 131
T + EG F TP + L +P + NA L P K V +HLAGTGDH
Sbjct: 64 SESTLI--EGKFMTPLELHLPGVVPKAAQNAYFQLLIPNKWVDEHHKPVCIHLAGTGDHF 121
Query: 132 FERRLR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARC 189
F RR + PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I E
Sbjct: 122 FWRRRNFIAKPLLKDANIGSIILENPFYGLRKPDDQIRSNLHNVSDIFVMGGCLILECLV 181
Query: 190 LLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHG 249
LLHW E GFG +GV GLSMGG A++ + P P+ +P LS +A F G++
Sbjct: 182 LLHWCE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQS 240
Query: 250 TAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
W+ L + + + RER+ ++++ D
Sbjct: 241 INWDMLETQYYSDGL------YRERLSKMVTVID 268
>gi|157109670|ref|XP_001650775.1| hypothetical protein AaeL_AAEL005342 [Aedes aegypti]
gi|108878959|gb|EAT43184.1| AAEL005342-PA, partial [Aedes aegypti]
Length = 497
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 93/288 (32%), Positives = 140/288 (48%), Gaps = 38/288 (13%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLR 80
++R+ + FF +GWG K E LER + + IEG S +R I + A +
Sbjct: 9 LYRSLLLTKFFCKGWG--KPENLERYLSK----IEGDEHDKSFTLFAFRKIISNRAACSK 62
Query: 81 ----------------------EGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQK 117
EG F TP + L +P NA L P K +
Sbjct: 63 LVPQDYPIEITKEEVASDCKIIEGKFITPLEIYLPGLVPDVVQNAHFQVLLPLKWNDERF 122
Query: 118 MACVVHLAGTGDHTF-ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSD 175
+HLAGTGDH F +RR + PLLKE N+ ++LE+PFYG R+P QR + L VSD
Sbjct: 123 KPMCIHLAGTGDHYFWKRRNLIAKPLLKEANLGAIILENPFYGARKPKDQRASSLHNVSD 182
Query: 176 LLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPH 235
+ ++G + E+ LL+W E G+G +G+ GLSMGG A++ + P P+ +P LS
Sbjct: 183 IFVMGGCLVLESLVLLNWCE-RNGYGPLGITGLSMGGHMASLAATNWPKPLVLVPCLSWS 241
Query: 236 SAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
+A F EG++ H W+ L + + RER+ ++++ D
Sbjct: 242 TASSVFTEGVMSHSINWDVLETQYFSDG------NYRERLSKMVTVVD 283
>gi|347969788|ref|XP_314274.5| AGAP003371-PA [Anopheles gambiae str. PEST]
gi|333469271|gb|EAA09613.6| AGAP003371-PA [Anopheles gambiae str. PEST]
Length = 558
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 90/271 (33%), Positives = 142/271 (52%), Gaps = 19/271 (7%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAV-- 78
++R+ + FF +GWG K E LERL F +I S + P + T+ +
Sbjct: 9 LYRSLLLTKFFCKGWG--KPENLERLFA--FRKIISNRAACSQLVPRDYPVEITKEEIHS 64
Query: 79 ---LREGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQKMACVVHLAGTGDHTF-E 133
+ EG F +P + + +P + NA L P K + +HLAGTGDH + +
Sbjct: 65 DCKILEGKFISPLEIYMPGLVPDVAQNAHFQILLPLKWNDERYKPICIHLAGTGDHYYWK 124
Query: 134 RRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
RR + PLLKE N+ ++LE+PFYG R+P QR + L VSD+ ++G + E+ LL+
Sbjct: 125 RRNLIAKPLLKEANLGAIILENPFYGLRKPKEQRASSLQNVSDIFVMGGCLVLESLVLLN 184
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAW 252
W E G+G +G+ GLSMGG A++ + P P+ +P LS +A F EG++ H +W
Sbjct: 185 WCE-RNGYGPLGITGLSMGGHMASLAATNWPKPLVLVPCLSWSTASSVFTEGVMSHSISW 243
Query: 253 EALREELAAKKVAMTLEEVRERMRNVLSLTD 283
+ L + A RER+ ++++ D
Sbjct: 244 DVLETQYFADG------NFRERLSKMVTVVD 268
>gi|220919497|ref|YP_002494801.1| hypothetical protein A2cp1_4418 [Anaeromyxobacter dehalogenans
2CP-1]
gi|219957351|gb|ACL67735.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
2CP-1]
Length = 359
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 107/296 (36%), Positives = 136/296 (45%), Gaps = 25/296 (8%)
Query: 11 YVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRT 70
+VLD ++G T P FF+ GWG RL+ +L P + P+ I
Sbjct: 2 HVLDVLFGL----TAAGPHFFADGWGD------RRLVAKLRP-LPLARRAPARIDVSLGP 50
Query: 71 IWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDH 130
LR+G FR+P S LP + AR+ L P P + +A VHLA +GD
Sbjct: 51 PRGAHGGTLRDGCFRSP-----ESRLPGCARAARIQVLLPAG-PLRGVA--VHLAASGDQ 102
Query: 131 TFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCL 190
F RLR PLL + I +VLE+ FYG RRP Q + VSDL L+G AT +E R L
Sbjct: 103 GFAMRLRFAAPLLAQGIGAVVLENAFYGARRPERQARHAVRSVSDLYLMGAATFQEGRAL 162
Query: 191 LHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGT 250
L W G ++GV G SMGG AAMVG+ P PVAT+P S G+L+
Sbjct: 163 LAWAREALGAPRVGVTGYSMGGQLAAMVGASMPFPVATVPLAPSCSPDSVLLSGVLRDVP 222
Query: 251 AWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAATVSTV 306
W AL A E R ++ LS V P P P A I V V
Sbjct: 223 DWAAL------AGRAADREAARRKLCAGLSRFSVCALPPPVAPGAAIVVGTAADGV 272
>gi|197124778|ref|YP_002136729.1| hypothetical protein AnaeK_4397 [Anaeromyxobacter sp. K]
gi|196174627|gb|ACG75600.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
Length = 359
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 94/245 (38%), Positives = 123/245 (50%), Gaps = 19/245 (7%)
Query: 11 YVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRT 70
+VLD ++G T P FF+ GWG RL+++L P + P+ I
Sbjct: 2 HVLDVLFGL----TAAGPHFFADGWGD------RRLVEKLRP-LPLARRAPARIDVSLGP 50
Query: 71 IWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDH 130
Q LR+G FR+P S LP + AR+ L P P + +A VHLA +GD
Sbjct: 51 PRAAQGGTLRDGCFRSP-----ESRLPGCARAARIQVLLPAG-PLRGVA--VHLAASGDQ 102
Query: 131 TFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCL 190
F RLR PLL + + +VLE+ FYG RRP Q + VSDL L+G AT +E R L
Sbjct: 103 GFAIRLRFAAPLLAQGLGAIVLENAFYGARRPERQARHAVRSVSDLYLMGAATFQEGRAL 162
Query: 191 LHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGT 250
L W G ++GV G SMGG AAMVG+ P PVAT+P S G+L+
Sbjct: 163 LAWAREALGAPRVGVTGYSMGGQLAAMVGASMPFPVATVPLAPSCSPDSVLLSGVLRDVP 222
Query: 251 AWEAL 255
W AL
Sbjct: 223 DWAAL 227
>gi|291167800|ref|NP_001013365.2| uncharacterized protein LOC503769 [Danio rerio]
Length = 454
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/251 (36%), Positives = 137/251 (54%), Gaps = 13/251 (5%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWP-PSLIQ---PIWRTIWETQT 76
++R + F RGWG K E L+R+ + F +I G SL++ PI+ E Q
Sbjct: 9 LYRRLLLTKLFIRGWG--KPEDLKRIFE--FRKIIGDREKCKSLVERDYPIFIDKVEDQA 64
Query: 77 AV-LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ER 134
+ G F +P + + LP ES AR F+ PK + C+ HLAGTGDH F R
Sbjct: 65 DCKIHSGHFISPLEHFVPGILPAESVKARFQFIVPKRWKKHRPVCI-HLAGTGDHFFWRR 123
Query: 135 RLRLGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
R + P++KE+ +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHW
Sbjct: 124 RTLMARPMVKESGMASLLLENPYYGYRKPKDQLRSSLKNVSDLFVMGGALILESAALLHW 183
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
LE + GF +G+ G+SMGG A++ + P P+ +P LS +A F G+L W
Sbjct: 184 LERD-GFWPLGMTGISMGGHMASLAVTNWPKPIPLIPCLSWTTASSVFTTGVLSRAVNWR 242
Query: 254 ALREELAAKKV 264
L ++ A V
Sbjct: 243 ELEKQYATHTV 253
>gi|387014860|gb|AFJ49549.1| Uncharacterized protein C4orf29-like protein [Crotalus adamanteus]
Length = 461
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/251 (34%), Positives = 134/251 (53%), Gaps = 12/251 (4%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEG-----QNWPPSLIQPIWRTIWETQ 75
++R + F +GWG + E L+R+ + F +I G QN P + E
Sbjct: 9 LYRRLLLTKLFIQGWG--RPEDLKRIFE--FRKIIGNREKCQNLVPRDYPVHINKVEEQS 64
Query: 76 TAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ER 134
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH F R
Sbjct: 65 DCKILDGHFISPLAHYVPDIMPSESITARFQFIVPKRWNSKYKPVCIHLAGTGDHYFWRR 124
Query: 135 RLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
R + P++KE ++A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHW
Sbjct: 125 RTLMARPMIKEASMASLLLENPYYGYRKPKDQLRSCLKNVSDLFVMGGALVLESAALLHW 184
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
LE E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 185 LEKE-GYGPLGMTGISMGGHMASLAVSNWPKPLPLVPCLSWSTASGVFTTGVLSKAVNWR 243
Query: 254 ALREELAAKKV 264
L ++ ++ V
Sbjct: 244 ELEKQYYSQSV 254
>gi|345481655|ref|XP_001605900.2| PREDICTED: uncharacterized protein C4orf29 homolog [Nasonia
vitripennis]
Length = 468
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/261 (35%), Positives = 137/261 (52%), Gaps = 32/261 (12%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIK------------QLFPEIEGQNWP 60
LD VY R+ + FF++GWG E L R+ K +L P +++P
Sbjct: 6 LDAVY-----RSILLTKFFTKGWGNP--ENLRRIFKFRKVVANREACYKLIP----RDYP 54
Query: 61 PSLIQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMAC 120
+ + E + EG F +P ++ L +P E+ A + PK QKM
Sbjct: 55 VKITKDE-----EWSDCHVLEGQFESPFEKNLPGIMPEETKTAHFQMILPKHWESQKMKP 109
Query: 121 V-VHLAGTGDHTFERRLRL-GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLL 177
V +HLAGTGDH F RR L PLLKE+ IA+++LE+PFYG R+P Q + L VSD+
Sbjct: 110 VCLHLAGTGDHYFWRRRNLVAKPLLKESGIASILLENPFYGLRKPKDQIRSSLHNVSDIF 169
Query: 178 LLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSA 237
++G I E+ +L+WLE + GFG +G+ GLSMGG A++ + P P+ +P LS +A
Sbjct: 170 IMGGCLIMESIVILNWLE-QQGFGPLGLTGLSMGGHMASLAATNWPKPIPLVPCLSWSTA 228
Query: 238 VVAFCEGILKHGTAWEALREE 258
F +G++ W L +
Sbjct: 229 SPVFTQGVMSASINWALLESQ 249
>gi|194870019|ref|XP_001972569.1| GG15592 [Drosophila erecta]
gi|190654352|gb|EDV51595.1| GG15592 [Drosophila erecta]
Length = 510
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 139/278 (50%), Gaps = 22/278 (7%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLI---QPIWR 69
LD +Y R + FF +GWG K E L ++ Q I + L+ P+
Sbjct: 5 LDSIY-----RRMLITRFFEKGWG--KPENLRKVF-QFRKVISSRESCFKLVPRDYPVEI 56
Query: 70 TIWET-QTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGT 127
T E + L EG F+TP + + +P ES A L P +K + +HLAGT
Sbjct: 57 TKKEIGSDSTLIEGQFKTPLELHMPGVVPEESQQAHFQLLIPNKWRNEKHKPICIHLAGT 116
Query: 128 GDHTFERRLR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIE 185
GDH F RR + PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I
Sbjct: 117 GDHFFWRRRNFIAKPLLKDANIGSIILENPFYGLRKPNNQTRSNLHNVSDIFVMGGCLIL 176
Query: 186 EARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGI 245
E L HW E GFG +GV GLSMGG A++ + P P+ +P LS +A F G+
Sbjct: 177 ECLVLFHWCE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGV 235
Query: 246 LKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
+ W+ L + + + RER+ ++++ D
Sbjct: 236 MSQSINWDMLETQYFSDG------QYRERLSKMVTVID 267
>gi|413936218|gb|AFW70769.1| hypothetical protein ZEAMMB73_974630 [Zea mays]
Length = 150
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/99 (64%), Positives = 77/99 (77%), Gaps = 13/99 (13%)
Query: 217 MVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALRE-------------ELAAKK 263
MVGSLHPTPVATLPFL+PHSAVV FCEG+ K+ TAW+ALR+ E AA+K
Sbjct: 1 MVGSLHPTPVATLPFLAPHSAVVPFCEGVYKYATAWDALRKDAAVLTQDVTLLAEDAAQK 60
Query: 264 VAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAAT 302
+T+E+VR+R+R+VLSLTDVTRFP+PK P AVIFV AT
Sbjct: 61 SGITIEQVRDRLRSVLSLTDVTRFPVPKNPQAVIFVGAT 99
>gi|45551552|ref|NP_729820.2| CG32112, isoform B [Drosophila melanogaster]
gi|442632008|ref|NP_001261779.1| CG32112, isoform E [Drosophila melanogaster]
gi|45445910|gb|AAN11852.2| CG32112, isoform B [Drosophila melanogaster]
gi|440215711|gb|AGB94472.1| CG32112, isoform E [Drosophila melanogaster]
Length = 510
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 139/278 (50%), Gaps = 22/278 (7%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLI---QPIWR 69
LD +Y R + FF +GWG K E L ++ Q I + L+ P+
Sbjct: 5 LDSIY-----RRMLITRFFEKGWG--KPENLRKVF-QFRKVISSRESCFKLVPRDYPVEI 56
Query: 70 TIWET-QTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGT 127
T E + L EG F+TP + + +P ES A L P +K + +HLAGT
Sbjct: 57 TKKEIGAESTLIEGQFKTPMELHMPGVVPEESQQAHFQLLIPNKWKNEKHKPICIHLAGT 116
Query: 128 GDHTFERRLR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIE 185
GDH F RR + PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I
Sbjct: 117 GDHFFWRRRNFIAKPLLKDANIGSIILENPFYGLRKPNNQTRSNLHNVSDIFVMGGCLIL 176
Query: 186 EARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGI 245
E L HW E GFG +GV GLSMGG A++ + P P+ +P LS +A F G+
Sbjct: 177 ECLVLFHWCE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGV 235
Query: 246 LKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
+ W+ L + + + RER+ ++++ D
Sbjct: 236 MSQSINWDMLETQYFSDG------QYRERLSKMVTVID 267
>gi|195493913|ref|XP_002094617.1| GE21919 [Drosophila yakuba]
gi|194180718|gb|EDW94329.1| GE21919 [Drosophila yakuba]
Length = 511
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 137/270 (50%), Gaps = 17/270 (6%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLI---QPIWRTIWET-QT 76
++R + FF +GWG K E L ++ Q I + L+ P+ T E
Sbjct: 8 LYRRMLITRFFEKGWG--KPENLRKVF-QFRKVISSRESCFKLVPRDYPVEITKKEIGAE 64
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGDHTFERR 135
+ L EG F+TP + + +P ES A L P +K + +HLAGTGDH F RR
Sbjct: 65 STLIEGQFKTPMELHMPGVVPEESQQAHFQLLIPNKWKNEKHKPICIHLAGTGDHFFWRR 124
Query: 136 LR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
+ PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I E L HW
Sbjct: 125 RNFIAKPLLKDANIGSIILENPFYGLRKPNNQTRSNLHNVSDIFVMGGCLILECLVLFHW 184
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
E GFG +GV GLSMGG A++ + P P+ +P LS +A F G++ W+
Sbjct: 185 CE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQSINWD 243
Query: 254 ALREELAAKKVAMTLEEVRERMRNVLSLTD 283
L + + + RER+ ++++ D
Sbjct: 244 MLETQYFSDG------QYRERLSKMVTVID 267
>gi|256088674|ref|XP_002580452.1| hypothetical protein [Schistosoma mansoni]
Length = 493
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 105/187 (56%), Gaps = 2/187 (1%)
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF 132
E ++ + EG F +P D + +AL ++ AR + P+ A +H +GTGD +
Sbjct: 30 EDKSTIQIEGSFISPFDSVISNALKGDNRIARFQMIIPRKWSTNYRAVCIHFSGTGDQNY 89
Query: 133 -ERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLL 191
RR+ L L+K+ IA+++L PFY +R+P Q+G+ L VSDL ++G A I E LL
Sbjct: 90 YRRRVFLASSLIKDGIASIILMHPFYSKRKPDEQQGSGLNSVSDLFIMGGALIMECSALL 149
Query: 192 HWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTA 251
W E G+G + G+SMGG +A+ ++ P P++ +P LS SA F EGIL +
Sbjct: 150 KWCE-HNGYGPFALHGISMGGYMSALCATVWPKPISLIPCLSWTSASCVFLEGILSNTVN 208
Query: 252 WEALREE 258
W L ++
Sbjct: 209 WSVLTKQ 215
>gi|27804865|gb|AAO22902.1| hypothetical protein [Myxococcus xanthus]
Length = 326
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/324 (30%), Positives = 155/324 (47%), Gaps = 35/324 (10%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIW 72
+D ++ R ++ FS+GWG E+ ++++ Q PP I P W
Sbjct: 7 VDVLFAGLSRRARL----FSQGWGD------EQFLEEVAAAAPFQQRPPP-IAPEWSAPR 55
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF 132
+ +R+G F +P ++ L + A V +L+ P + ACVV LA + + F
Sbjct: 56 LQRGLQVRDGTFPSP-----LARLDAAARTAHVRWLS-AGQGPSRGACVV-LAASREEGF 108
Query: 133 ERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
R R+ PL +E I +LE+P+YG RRP+ Q+G L VSD +L+ ++EAR LL
Sbjct: 109 SLRERMYAPLAREGIDLFLLENPYYGLRRPVGQKGGALRTVSDHVLMNLGMVDEARALLA 168
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAW 252
WL E G ++GV G SMGG AA+ ++ P P+A + S V F +G+L A+
Sbjct: 169 WLRSE-GHARLGVAGYSMGGYMAALTAAVVPEPLAVAALAAGASPVPVFTQGLLSWSIAF 227
Query: 253 EALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAA----------T 302
L E+ R R+ + L ++ RFP P+ P A + VA T
Sbjct: 228 ALL------DGPRRDAEQARSRLGRIFDLANLARFPPPRQPEAAVLVACRRDGFVPGDET 281
Query: 303 VSTVFDYHHEEVLKMDSQHFFALF 326
++ + E+ +D+ H ALF
Sbjct: 282 LALHAHWPRSELRWVDAGHVTALF 305
>gi|108761544|ref|YP_634971.1| hypothetical protein MXAN_6854 [Myxococcus xanthus DK 1622]
gi|108465424|gb|ABF90609.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
Length = 329
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/324 (30%), Positives = 155/324 (47%), Gaps = 35/324 (10%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIW 72
+D ++ R ++ FS+GWG E+ ++++ Q PP I P W
Sbjct: 10 VDVLFAGLSRRARL----FSQGWGD------EQFLEEVAAAAPFQQRPPP-IAPEWSAPR 58
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF 132
+ +R+G F +P ++ L + A V +L+ P + ACVV LA + + F
Sbjct: 59 LQRGLQVRDGTFPSP-----LARLDAAARTAHVRWLS-AGQGPSRGACVV-LAASREEGF 111
Query: 133 ERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
R R+ PL +E I +LE+P+YG RRP+ Q+G L VSD +L+ ++EAR LL
Sbjct: 112 SLRERMYAPLAREGIDLFLLENPYYGLRRPVGQKGGALRTVSDHVLMNLGMVDEARALLA 171
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAW 252
WL E G ++GV G SMGG AA+ ++ P P+A + S V F +G+L A+
Sbjct: 172 WLRSE-GHARLGVAGYSMGGYMAALTAAVVPEPLAVAALAAGASPVPVFTQGLLSWSIAF 230
Query: 253 EALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAA----------T 302
L E+ R R+ + L ++ RFP P+ P A + VA T
Sbjct: 231 ALL------DGPRRDAEQARSRLGRIFDLANLARFPPPRQPEAAVLVACRRDGFVPGDET 284
Query: 303 VSTVFDYHHEEVLKMDSQHFFALF 326
++ + E+ +D+ H ALF
Sbjct: 285 LALHAHWPRSELRWVDAGHVTALF 308
>gi|357631626|gb|EHJ79095.1| hypothetical protein KGM_15485 [Danaus plexippus]
Length = 570
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 143/283 (50%), Gaps = 32/283 (11%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIK---------QLFPEIEGQNWPPSL 63
LD VY R+ + FF++GWG K E L RL + + F +E +++P ++
Sbjct: 6 LDAVY-----RSILLTKFFTKGWG--KPENLRRLFEFRKVVSNRDECFKLVE-RDYPVTI 57
Query: 64 IQPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVP-PQKMACVV 122
+ T L EG F TP + L +P + A L P P P+ +
Sbjct: 58 TKEQNLT-----DCRLLEGYFLTPLERYLPGIVPEIAQKAHFQILLPVHWPDPRCKPVCL 112
Query: 123 HLAGTGDHTFERRLRLG-GPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLG 180
HLAGTGDH F RR L PLLKE + ++LE+PFYG R+P Q + L VSD+ ++G
Sbjct: 113 HLAGTGDHFFWRRRNLMVKPLLKEAGVGGIILENPFYGLRKPTDQVRSSLHNVSDIFVMG 172
Query: 181 RATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVA 240
I E+ L HW E G G +GV GLSMGG A++ + P P+ +P LS +A
Sbjct: 173 GCLILESLVLFHWCE-RNGLGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAV 231
Query: 241 FCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
F +G++ H W+ L ++ + V RE++ ++++ D
Sbjct: 232 FLQGVMSHSINWDLLEDQYMSDGV------YREKLSKMVTIVD 268
>gi|427781861|gb|JAA56382.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 392
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 77/188 (40%), Positives = 109/188 (57%), Gaps = 6/188 (3%)
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAPK--CVPPQKMACVVHLAGTGDHTFERRLRL 138
EG +P + L +P ESH A L PK P + C+ HLAGTGDH F RR L
Sbjct: 32 EGHLVSPLVQYLPECVPKESHKAWFQVLLPKKWVTEPLRPLCI-HLAGTGDHYFWRRRTL 90
Query: 139 GG-PLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEW 196
PLLKEN +A+++LE+PFYG R+P Q + L CVSD+ ++G + E+ LLHW E
Sbjct: 91 TCRPLLKENGVASIILENPFYGLRKPKDQVRSNLHCVSDIFVMGGCLVLESMALLHWCER 150
Query: 197 EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALR 256
E GFG +G+ G+SMGG A++ G+ P+ +P LS +A F +G++ WE L+
Sbjct: 151 E-GFGPLGITGISMGGHMASLAGANWYKPIGIIPCLSWTTASCVFTQGVMSGAIPWELLQ 209
Query: 257 EELAAKKV 264
+ + V
Sbjct: 210 SQYFSDHV 217
>gi|195589898|ref|XP_002084686.1| GD14399 [Drosophila simulans]
gi|194196695|gb|EDX10271.1| GD14399 [Drosophila simulans]
Length = 528
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 76/210 (36%), Positives = 113/210 (53%), Gaps = 10/210 (4%)
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGDHTFERR 135
+ L EG F+TP + + +P ES A L P +K + +HLAGTGDH F RR
Sbjct: 83 STLIEGQFKTPMELHMPGVVPEESQQAHFQLLIPNKWKNEKHKPICIHLAGTGDHFFWRR 142
Query: 136 LR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
+ PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I E L HW
Sbjct: 143 RNFIAKPLLKDANIGSIILENPFYGLRKPNNQTRSNLHNVSDIFVMGGCLILECLVLFHW 202
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
E GFG +GV GLSMGG A++ + P P+ +P LS +A F G++ W+
Sbjct: 203 CE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQSINWD 261
Query: 254 ALREELAAKKVAMTLEEVRERMRNVLSLTD 283
L + + + RER+ ++++ D
Sbjct: 262 MLETQYFSDG------QYRERLSKMVTVID 285
>gi|195327209|ref|XP_002030314.1| GM25367 [Drosophila sechellia]
gi|194119257|gb|EDW41300.1| GM25367 [Drosophila sechellia]
Length = 528
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 76/210 (36%), Positives = 113/210 (53%), Gaps = 10/210 (4%)
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGDHTFERR 135
+ L EG F+TP + + +P ES A L P +K + +HLAGTGDH F RR
Sbjct: 83 STLIEGQFKTPMELHMPGVVPEESQQAHFQLLIPNKWKNEKHKPICIHLAGTGDHFFWRR 142
Query: 136 LR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
+ PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I E L HW
Sbjct: 143 RNFIAKPLLKDANIGSIILENPFYGLRKPNNQTRSNLHNVSDIFVMGGCLILECLVLFHW 202
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
E GFG +GV GLSMGG A++ + P P+ +P LS +A F G++ W+
Sbjct: 203 CE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQSINWD 261
Query: 254 ALREELAAKKVAMTLEEVRERMRNVLSLTD 283
L + + + RER+ ++++ D
Sbjct: 262 MLETQYFSDG------QYRERLSKMVTVID 285
>gi|443709823|gb|ELU04328.1| hypothetical protein CAPTEDRAFT_223901 [Capitella teleta]
Length = 429
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 90/263 (34%), Positives = 139/263 (52%), Gaps = 12/263 (4%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQTA 77
++R + FF++GWG E L+R+ K L + Q P PI E Q+
Sbjct: 7 IYRRFLLTKFFTKGWGDP--ENLKRIFDFRKILSNRDQCQQLVPKDY-PIHIDKDEAQSE 63
Query: 78 V-LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ + G F++P +QL +P E A+ + P + +HLAGTGDH F RR
Sbjct: 64 IRILHGHFKSPFVDQLPGIMPKEVETAKFQIILPMQWKSKLKPVCLHLAGTGDHGFGRRR 123
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ L PLLKE IA+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHW
Sbjct: 124 MLLARPLLKEAGIASIILENPYYGVRKPKDQWRSSLRNVSDLFVMGGALILESLALLHWC 183
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E G+G +G+ G+SMGG A++ + P++ +P LS +A F +G+L W+
Sbjct: 184 E-RHGYGPLGITGISMGGHMASLAATNWHKPISLIPCLSWTTASGVFTQGVLSGAIPWKL 242
Query: 255 LREELAAKKVAMTLEEVRERMRN 277
L ++ + T EV R+ +
Sbjct: 243 LEDQYYMDSIYET--EVASRIES 263
>gi|242006736|ref|XP_002424203.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507544|gb|EEB11465.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 511
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 128/249 (51%), Gaps = 21/249 (8%)
Query: 22 HRTKISPPFFSRGWGGSKLELLERLIK---------QLFPEIEGQNWPPSLIQPIWRTIW 72
+R+ + FF +GWG K E L++L + FP ++ ++P ++ + I
Sbjct: 10 YRSLVISKFFKKGWG--KPENLKKLFEFRKIVSKRETCFPLVD-TDYPVTITKEI----- 61
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGDHT 131
++L EG F +P +E L LP S A L PK V +HLAGTGDH
Sbjct: 62 NYSDSILLEGQFLSPFEEYLPGLLPQVSKTAYFQMLLPKKWKSSHYKPVCLHLAGTGDHF 121
Query: 132 FERRLRL-GGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARC 189
F RR L PLLKE I ++LE+PFYG R+P QR + L VSD+ ++G I E+
Sbjct: 122 FFRRRNLMAKPLLKEAGIGALLLENPFYGLRKPKDQRWSSLHNVSDIFVMGGCLILESLV 181
Query: 190 LLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHG 249
L HW + GFG G+ G+SMGG A++ + P P+ +P LS +A F G++
Sbjct: 182 LFHWCK-RNGFGPFGLTGISMGGHMASLAATNIPEPIVLVPCLSWTTASGVFTRGVMSSA 240
Query: 250 TAWEALREE 258
WE L +
Sbjct: 241 INWELLEHQ 249
>gi|291401870|ref|XP_002717290.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 464
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 130/250 (52%), Gaps = 8/250 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNREKCQNLVSSDYPVYIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRE 244
Query: 255 LREELAAKKV 264
L ++ + V
Sbjct: 245 LEKQYYTQTV 254
>gi|170031419|ref|XP_001843583.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167869843|gb|EDS33226.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 531
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 117/213 (54%), Gaps = 10/213 (4%)
Query: 74 TQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGDHTF 132
T + EG F+TP + L +P +A L P ++ V +HLAGTGDH F
Sbjct: 56 TSDCRILEGRFKTPLEIYLPGLVPDAVKDAHFQILLPNEWRDERYKPVCIHLAGTGDHYF 115
Query: 133 -ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCL 190
+RR + PLLKE N+ ++LE+PFYG R+P Q+ + L VSD+ ++G + E+ L
Sbjct: 116 WKRRNLIAKPLLKEANLGAIILENPFYGMRKPKDQKASSLHNVSDIFVMGGCLVLESLVL 175
Query: 191 LHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGT 250
L+W E G+G +G+ GLSMGG A++ + P P+ +P LS +A F EG++ H
Sbjct: 176 LNWCE-RNGYGPLGITGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTEGVMSHSI 234
Query: 251 AWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
W+ L+ + + RER+ ++++ D
Sbjct: 235 NWDVLQTQYFSDG------NYRERLSKMVTVVD 261
>gi|348582131|ref|XP_003476830.1| PREDICTED: uncharacterized protein C4orf29 homolog [Cavia
porcellus]
Length = 459
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 87/251 (34%), Positives = 132/251 (52%), Gaps = 12/251 (4%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEG-----QNWPPSLIQPIWRTIWETQ 75
++R + F RGWG + E L+RL + F ++ G QN S I E
Sbjct: 9 LYRRLLLTKLFIRGWG--RPEHLKRLFE--FRKVIGNRERCQNLVSSDYPVYIDKIEEQS 64
Query: 76 TAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ER 134
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + R
Sbjct: 65 DCKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNNKYRPVCIHLAGTGDHHYWRR 124
Query: 135 RLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
R + P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHW
Sbjct: 125 RTLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHW 184
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
LE E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 185 LERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSVNWR 243
Query: 254 ALREELAAKKV 264
L ++ + V
Sbjct: 244 ELEKQYYTQTV 254
>gi|195378755|ref|XP_002048147.1| GJ11501 [Drosophila virilis]
gi|194155305|gb|EDW70489.1| GJ11501 [Drosophila virilis]
Length = 502
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 137/274 (50%), Gaps = 25/274 (9%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEG------QNWPPSLIQPIWRTIW 72
++R + FF +GWG K E L R+ +++ E +++P + +
Sbjct: 9 LYRRMLITRFFEKGWG--KPENLHRVFQFRKIISSRETCFKLVPRDYPVEITKKK----- 61
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQKMACVVHLAGTGDHT 131
+ + L EG F TP + + +P + A L P K Q +HLAGTGDH
Sbjct: 62 KYSDSTLIEGNFTTPLELHMPGVVPEAAQQAYFQLLLPNKWNNEQHKPICIHLAGTGDHF 121
Query: 132 FERRLR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARC 189
F RR + PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I E
Sbjct: 122 FWRRRNFIAKPLLKDANIGSIILENPFYGLRKPDDQIRSNLHNVSDIFVMGGCLILECLV 181
Query: 190 LLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHG 249
LLHW E GFG +G+ GLSMGG A++ + P P+ +P LS +A F G++
Sbjct: 182 LLHWCE-RNGFGPLGITGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQS 240
Query: 250 TAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
W+ L + + + RER+ ++++ D
Sbjct: 241 INWDMLETQYYSDG------QYRERLSKMVTIID 268
>gi|195019700|ref|XP_001985036.1| GH14725 [Drosophila grimshawi]
gi|193898518|gb|EDV97384.1| GH14725 [Drosophila grimshawi]
Length = 496
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 137/278 (49%), Gaps = 33/278 (11%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIK------------QLFPEIEGQNWPPSLIQPIW 68
++R + FF +GWG E L R+ + +L P +++P + +
Sbjct: 9 LYRRMLITKFFEKGWGTP--ENLHRVFQFRKVISCRETCFKLVP----RDYPVEITKKK- 61
Query: 69 RTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQKMACVVHLAGT 127
+ L EG F+TP + + +P + A L P K V + +HLAGT
Sbjct: 62 ----RYSDSTLIEGNFKTPMELHMPGVVPKAAQKAYFQILLPNKWVNEEHKPICIHLAGT 117
Query: 128 GDHTFERRLR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIE 185
GDH F RR + PL+K+ NI +++LE+PFYG R+P Q + L VSD+ ++G I
Sbjct: 118 GDHFFWRRRNFIAKPLMKDANIGSIILENPFYGLRKPDDQIRSNLRNVSDIFVMGGCLIL 177
Query: 186 EARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGI 245
E LLHW E GFG +GV GLSMGG A++ + P P+ +P LS +A F G+
Sbjct: 178 ECLVLLHWCE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGV 236
Query: 246 LKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTD 283
+ W+ L + + + RER+ ++++ D
Sbjct: 237 MSQSINWDMLETQYYSDG------QYRERLSKMVTVID 268
>gi|405371582|ref|ZP_11027105.1| Hypothetical protein A176_3551 [Chondromyces apiculatus DSM 436]
gi|397088771|gb|EJJ19732.1| Hypothetical protein A176_3551 [Myxococcus sp. (contaminant ex DSM
436)]
Length = 336
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/329 (30%), Positives = 161/329 (48%), Gaps = 41/329 (12%)
Query: 11 YVLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRT 70
+++D ++ R+++ FS+GWG E+ ++ + Q+ P S + P W
Sbjct: 5 HLVDFLFAGLSRRSRL----FSQGWGN------EQFLEDVAEAAPFQHLP-SPVTPAWSE 53
Query: 71 IWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDH 130
+ +R+G F +P ++ L + A V +L+ P + AC+V LA + +
Sbjct: 54 PRLQRGLQVRDGTFLSP-----LAGLDAAAQTAHVRWLSAGNGSP-RGACIV-LASSREE 106
Query: 131 TFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCL 190
F R RL PL +E I +LE+P+YG RRPL Q+G L VSD +L+ +EEAR L
Sbjct: 107 GFSLRERLYAPLAREGIDLFLLENPYYGLRRPLGQKGGALRTVSDHVLMNLGMVEEARAL 166
Query: 191 LHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGT 250
L WL +G ++GV G SMGG AA+ ++ PVA + S V F +G+L
Sbjct: 167 LAWLR-ASGRSRLGVAGYSMGGYMAALTAAVVSEPVAVAALAAGASPVPVFTQGLLSWSI 225
Query: 251 AWEAL---REELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAA------ 301
A+ L R + A + R R+ + L ++TRFP PK P+A + VA
Sbjct: 226 AFALLDGPRGDAA---------QARLRLGRIFDLANLTRFPPPKQPDAAVLVACRRDGFV 276
Query: 302 ----TVSTVFDYHHEEVLKMDSQHFFALF 326
T++ + E+ +D+ H ALF
Sbjct: 277 PGEETLALHAHWPGSELRWVDAGHVSALF 305
>gi|153007255|ref|YP_001381580.1| hypothetical protein Anae109_4418 [Anaeromyxobacter sp. Fw109-5]
gi|152030828|gb|ABS28596.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
Length = 368
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/309 (33%), Positives = 138/309 (44%), Gaps = 36/309 (11%)
Query: 30 FFSRGWGGSKL---ELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLREGVFRT 86
FF GWG + E L+++ IE + P VL +G+F +
Sbjct: 17 FFEDGWGDRAICDATDPEALLRRRARPIEVRLGPGR----------RAHCGVLHDGIFES 66
Query: 87 PCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLRLGGPLLKEN 146
P + LP + AR+ L PK P + VHLA +GD F RLR PLL
Sbjct: 67 PEER-----LPACARRARIQLLLPKG--PVR-GVYVHLAASGDQGFGLRLRFAEPLLASG 118
Query: 147 IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVC 206
+ +VLE+ +YG RRP QR L VSD+ L+ AT+ E R LL WL E G G +GV
Sbjct: 119 VGAVVLENAYYGGRRPERQRAHALRSVSDMHLMAAATLLEGRALLRWLRDELGVGLVGVT 178
Query: 207 GLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKVAM 266
G SMGG AAMVG+ PVA +P S +G+L+H +W L E +
Sbjct: 179 GYSMGGQLAAMVGAAMSFPVAVVPIAPACSPDSVLRQGVLRHVPSWPKLAAEGEDEAA-- 236
Query: 267 TLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVA---------ATVSTVFDYHHEEVLKM 317
VRE + S VT P P P A I V + + + +Y E+ +
Sbjct: 237 ----VREVLLGRASRFSVTCLPAPVYPEAAIVVGTERDGFVPPSDMRRIAEYWGAELRWL 292
Query: 318 DSQHFFALF 326
+ H AL
Sbjct: 293 PAGHVSALL 301
>gi|426345446|ref|XP_004040424.1| PREDICTED: uncharacterized protein C4orf29 homolog [Gorilla gorilla
gorilla]
Length = 464
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 129/250 (51%), Gaps = 8/250 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRE 244
Query: 255 LREELAAKKV 264
L ++ + V
Sbjct: 245 LEKQYYTQTV 254
>gi|194747607|ref|XP_001956243.1| GF24694 [Drosophila ananassae]
gi|190623525|gb|EDV39049.1| GF24694 [Drosophila ananassae]
Length = 528
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/210 (37%), Positives = 112/210 (53%), Gaps = 10/210 (4%)
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGDHTFERR 135
+ L EG F TP + L +P S A L P +K V +HLAGTGDH F RR
Sbjct: 84 STLVEGKFITPLELHLPGVVPKASQYAHFQLLIPNKWKSEKHKPVCIHLAGTGDHFFWRR 143
Query: 136 LR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
+ PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I E LLHW
Sbjct: 144 RNFIAKPLLKDANIGSIILENPFYGVRKPDDQTRSNLHNVSDIFVMGGCLILECLVLLHW 203
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
E GFG +GV GLSMGG A++ + P P+ +P LS +A F G++ W+
Sbjct: 204 CE-RNGFGPLGVTGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQSINWD 262
Query: 254 ALREELAAKKVAMTLEEVRERMRNVLSLTD 283
L + + + RER+ ++++ D
Sbjct: 263 MLETQYFSDG------QFRERLSKMVTIVD 286
>gi|432961056|ref|XP_004086552.1| PREDICTED: uncharacterized protein C4orf29 homolog [Oryzias
latipes]
Length = 478
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 72/188 (38%), Positives = 107/188 (56%), Gaps = 4/188 (2%)
Query: 79 LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRLR 137
+ +G F +P + + LPPE+ AR F+ PK + C+ LAGTGDH F RR
Sbjct: 94 IHDGFFISPLEHLVPGILPPEAIKARFQFIVPKVWKKNRPVCI-QLAGTGDHFFWRRRTL 152
Query: 138 LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEW 196
+ P++KE IA+++LE+P+YG R+P Q + L VSDL ++G A I E+ LL WLE
Sbjct: 153 MARPMIKEAGIASLLLENPYYGYRKPKDQLRSSLKNVSDLFVMGGALILESTALLRWLER 212
Query: 197 EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALR 256
E G+ +G+ G+SMGG A++ + P P+ +P LS +A F G+L W L
Sbjct: 213 E-GYWPLGMTGISMGGYMASLAVTNWPKPIPLVPCLSWSTASSVFTTGVLSKAVNWSQLE 271
Query: 257 EELAAKKV 264
++ A V
Sbjct: 272 KQYAVNSV 279
>gi|334330750|ref|XP_003341402.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C4orf29
homolog [Monodelphis domestica]
Length = 461
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 85/249 (34%), Positives = 133/249 (53%), Gaps = 8/249 (3%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIWRTIWETQTAV 78
++R + F RGWG + E L+RL +++ E S P++ E +T
Sbjct: 9 LYRRLLLTKLFIRGWG--RPEDLKRLFEFRKVIGNRETCQNMVSRDYPVYVDKIEDETDC 66
Query: 79 -LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRL 136
+ EG F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 67 KILEGHFVSPMAHHVPDLMPIESVIARFQFIVPKEWNNKYRPVCIHLAGTGDHHYWRRRT 126
Query: 137 RLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 127 LMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLE 186
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL 255
E G+G +G+ G+SMGG A++ + P P+ +P LS +A F G+L W L
Sbjct: 187 RE-GYGPLGMTGISMGGHMASLAVTNWPKPLPLIPCLSWSTASGVFTTGVLSKSVNWREL 245
Query: 256 REELAAKKV 264
++ + V
Sbjct: 246 EKQYYTQTV 254
>gi|326918436|ref|XP_003205494.1| PREDICTED: uncharacterized protein C4orf29 homolog [Meleagris
gallopavo]
Length = 461
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 86/249 (34%), Positives = 133/249 (53%), Gaps = 8/249 (3%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIWRTIWETQTAV 78
++R + F RGWG K E L+R+ +++ E S P++ E Q+
Sbjct: 9 LYRKLLLTKLFIRGWG--KPEDLKRIFEFRKIIGNREKCQTLVSKDYPVFIDKVEEQSDC 66
Query: 79 -LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRL 136
+ EG F +P + LP ES AR F+ P+ + +HLAGTGDH F RR
Sbjct: 67 KILEGHFISPLAHYVPGILPVESLVARFQFITPRRWNGKHRPVCIHLAGTGDHHFWRRRT 126
Query: 137 RLGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 127 LMARPMIKEACMASLLLENPYYGCRKPKDQIRSCLKNVSDLFVMGGALVLESAALLHWLE 186
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL 255
E G+G +G+ G+SMGG A++ + P P+ +P LS +A F G+L W L
Sbjct: 187 RE-GYGPLGMTGISMGGHMASLAVTNWPKPLPLIPCLSWSTASAVFTTGVLSKAVNWREL 245
Query: 256 REELAAKKV 264
++ + V
Sbjct: 246 EKQYYTQTV 254
>gi|167830469|ref|NP_080898.2| uncharacterized protein C4orf29 homolog precursor [Mus musculus]
gi|81898417|sp|Q8C1A9.1|CD029_MOUSE RecName: Full=Uncharacterized protein C4orf29 homolog; Flags:
Precursor
gi|26324524|dbj|BAC26016.1| unnamed protein product [Mus musculus]
gi|74142127|dbj|BAE41123.1| unnamed protein product [Mus musculus]
Length = 464
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 129/250 (51%), Gaps = 8/250 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S + E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKVEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPGIMPIESVVARFQFIVPKEWNSRYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALILESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRE 244
Query: 255 LREELAAKKV 264
L ++ + V
Sbjct: 245 LEKQYYTQTV 254
>gi|195127563|ref|XP_002008238.1| GI13377 [Drosophila mojavensis]
gi|193919847|gb|EDW18714.1| GI13377 [Drosophila mojavensis]
Length = 495
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/270 (33%), Positives = 136/270 (50%), Gaps = 17/270 (6%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLI---QPIWRTIWETQT- 76
++R + FF +GWG K E L R+ Q I + L+ P+ T +T +
Sbjct: 9 LYRRMLITRFFEKGWG--KPENLHRVF-QFRKIISCRETCFKLVPRDYPVEITKKKTYSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQKMACVVHLAGTGDHTFERR 135
+ L EG F TP + + +P + A L P K +HLAGTGDH F RR
Sbjct: 66 STLIEGNFTTPLELHMPGVVPEAAQQAHFQLLLPNKWNDEHHKPICIHLAGTGDHFFWRR 125
Query: 136 LR-LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
+ PLLK+ NI +++LE+PFYG R+P Q + L VSD+ ++G I E LLHW
Sbjct: 126 RNFIAKPLLKDANIGSIILENPFYGLRKPDDQIRSNLHNVSDIFVMGGCLILECLVLLHW 185
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
E GFG +G+ GLSMGG A++ + P P+ +P LS +A F G++ W+
Sbjct: 186 CE-RNGFGPLGITGLSMGGHMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQSINWD 244
Query: 254 ALREELAAKKVAMTLEEVRERMRNVLSLTD 283
L + + + RER+ ++++ D
Sbjct: 245 MLETQYYSDG------QYRERLSKMVTIVD 268
>gi|354485455|ref|XP_003504899.1| PREDICTED: uncharacterized protein C4orf29 homolog [Cricetulus
griseus]
Length = 463
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 129/250 (51%), Gaps = 8/250 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S + E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKVEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPGIMPIESVIARFQFIVPKEWNSRYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALILESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRE 244
Query: 255 LREELAAKKV 264
L ++ + V
Sbjct: 245 LEKQYYTQTV 254
>gi|291167802|ref|NP_001026305.2| uncharacterized protein LOC422499 [Gallus gallus]
Length = 462
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 85/243 (34%), Positives = 131/243 (53%), Gaps = 8/243 (3%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIWRTIWETQTAV 78
++R + F RGWG K E L+R+ +++ E S P++ E Q+
Sbjct: 9 LYRKLLLTKLFIRGWG--KPEDLKRIFEFRKIIGNREKCQTLVSKDYPVFIDKVEEQSDC 66
Query: 79 -LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRL 136
+ EG F +P + LP ES AR F+ P+ + +HLAGTGDH F RR
Sbjct: 67 KILEGHFISPLAHYVPGILPVESLVARFQFITPRRWNSKHRPVCIHLAGTGDHHFWRRRT 126
Query: 137 RLGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 127 LMARPMIKEACMASLLLENPYYGCRKPKDQIRSCLKNVSDLFVMGGALVLESAALLHWLE 186
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL 255
E G+G +G+ G+SMGG A++ + P P+ +P LS +A F G+L W L
Sbjct: 187 RE-GYGPLGMTGISMGGHMASLAVTNWPKPLPLIPCLSWSTASAVFTTGVLSKAVNWREL 245
Query: 256 REE 258
++
Sbjct: 246 EKQ 248
>gi|167830472|ref|NP_001020210.2| uncharacterized protein C4orf29 homolog precursor [Rattus
norvegicus]
Length = 464
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 129/250 (51%), Gaps = 8/250 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S + E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKVEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPGIMPIESVIARFQFIVPKEWNSRYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALILESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRE 244
Query: 255 LREELAAKKV 264
L ++ + V
Sbjct: 245 LEKQYYTQTV 254
>gi|321455299|gb|EFX66436.1| hypothetical protein DAPPUDRAFT_332220 [Daphnia pulex]
Length = 464
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 89/262 (33%), Positives = 131/262 (50%), Gaps = 21/262 (8%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIWRT 70
LD VY R + FF RGWG E ++RL +++ + E SL+ P+ +
Sbjct: 6 LDRVY-----RNLVLSKFFVRGWGNP--ENIKRLFDFRKIVSDREKCQ---SLVDPMHKV 55
Query: 71 IWETQTA----VLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQK--MACVVHL 124
+ + + G F++P L +P ES A L PK VV L
Sbjct: 56 TFTKEEDHKHYKILNGHFQSPFAYHLPGLVPEESETAHFQMLLPKKWNWSNGLKPMVVQL 115
Query: 125 AGTGDHTF-ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRA 182
AGTGDH F RR+ +G PLL E N+ +++LE+PFYG R+P Q+ + L VSD+ ++G
Sbjct: 116 AGTGDHFFWRRRILMGKPLLNEWNVGSIILENPFYGLRKPKEQKLSCLHNVSDIFVMGGC 175
Query: 183 TIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFC 242
I E+ L HW E G G +GV G+SMGG A++ S P PV +P LS +A F
Sbjct: 176 LILESLVLFHWCE-RNGLGPLGVTGVSMGGHMASLAASSWPKPVVLVPCLSWSTASAVFT 234
Query: 243 EGILKHGTAWEALREELAAKKV 264
G++ W L + + ++
Sbjct: 235 RGVMSGAIDWSLLESQYFSNQI 256
>gi|395541759|ref|XP_003772806.1| PREDICTED: uncharacterized protein C4orf29 homolog [Sarcophilus
harrisii]
Length = 467
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 85/248 (34%), Positives = 131/248 (52%), Gaps = 8/248 (3%)
Query: 22 HRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIW-RTIWETQTAV 78
+R + F RGWG + E L+RL +++ E S P++ I E
Sbjct: 10 YRRLLLTKLFIRGWG--RPEDLKRLFEFRKIIGNREKCQNMVSRDYPVYIDKIEEETDCK 67
Query: 79 LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRLR 137
+ EG F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 68 ILEGHFVSPMAHYVPDLMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHYYWRRRTL 127
Query: 138 LGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEW 196
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 128 MARPMIKEACMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLER 187
Query: 197 EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALR 256
E G+G +G+ G+SMGG A++ + P P+ +P LS +A F G+L W L
Sbjct: 188 E-GYGPLGMTGISMGGHMASLAVTNWPKPLPLIPCLSWSTASGVFTTGVLSKSINWRELE 246
Query: 257 EELAAKKV 264
++ + V
Sbjct: 247 KQYYTQTV 254
>gi|327274080|ref|XP_003221806.1| PREDICTED: uncharacterized protein C4orf29 homolog [Anolis
carolinensis]
Length = 462
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 86/249 (34%), Positives = 132/249 (53%), Gaps = 8/249 (3%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIWRTIWETQTAV 78
++R + F +GWG + E L+R+ ++L E S P+ E Q+
Sbjct: 9 LYRRLLLTKLFIQGWG--RPEDLKRIFEFRKLIGNREKCQNLVSRDYPVHINKVEEQSDC 66
Query: 79 -LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRL 136
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH F RR
Sbjct: 67 KILDGHFVSPLAHYVPEIMPSESITARFQFIVPKRWNSKYRPVCIHLAGTGDHHFWRRRT 126
Query: 137 RLGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 127 LMARPMIKEACMASLLLENPYYGCRKPKDQIRSCLKNVSDLFVMGGALVLESAALLHWLE 186
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL 255
E G+G +GV G+SMGG A++ S P P+ +P LS +A F G+L W L
Sbjct: 187 RE-GYGPLGVTGISMGGHMASLAVSNWPKPLPLVPCLSWSTASGVFTTGVLSKAVNWREL 245
Query: 256 REELAAKKV 264
++ + V
Sbjct: 246 EKQYYTQSV 254
>gi|390342494|ref|XP_781317.3| PREDICTED: uncharacterized protein C4orf29 homolog
[Strongylocentrotus purpuratus]
Length = 576
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 83/254 (32%), Positives = 127/254 (50%), Gaps = 18/254 (7%)
Query: 30 FFSRGWGG-----SKLELLERLI-KQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLREGV 83
FF+RGWG + + L L + + +++P ++ + E + +G
Sbjct: 20 FFTRGWGNPATIKRQFDFLRVLADRSSCQRLVNEHYPVNIDSDV-----EKGDVRIVDGN 74
Query: 84 FRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRLRLGGPL 142
FR+P D L +P E AR + P +K +H+AGTGDH F RR L PL
Sbjct: 75 FRSPFDRYLPDIMPKEVKTARFQLIVPTQWRTEKKPVCIHMAGTGDHFFWRRRTFLARPL 134
Query: 143 LKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFG 201
+KE I ++++E+PFYG R+P Q + L V+DL ++G I E +LHW E + GFG
Sbjct: 135 IKEYGIGSLLIENPFYGYRKPKEQLRSSLRHVNDLFVMGGGLILEGLVMLHWCE-KQGFG 193
Query: 202 KMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAA 261
+G+ G+SMGG A++ + P+ +P LS S AF +G+L W L
Sbjct: 194 PLGLTGISMGGHMASLAATNWHKPIPLIPCLSWSSGTPAFTKGVLSGSIPWPV----LVT 249
Query: 262 KKVAMTLEEVRERM 275
+ + LE RE M
Sbjct: 250 QYLGQHLEYEREIM 263
>gi|348541369|ref|XP_003458159.1| PREDICTED: uncharacterized protein C4orf29 homolog [Oreochromis
niloticus]
Length = 450
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 86/247 (34%), Positives = 131/247 (53%), Gaps = 13/247 (5%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVL- 79
+R + F GWG K E L+R+ + F +I G + P ++ +T L
Sbjct: 9 FYRRLLLTKLFIGGWG--KPEDLKRIFE--FRKIIGDREKCKSLVPKDYPVYINKTEELA 64
Query: 80 ----REGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ER 134
+EG F +P + + LP E+ AR F+ PK + C+ HLAGTGDH F R
Sbjct: 65 DCHVQEGFFISPLEHLVPGILPQEAIKARFQFIVPKRWQKNRPVCI-HLAGTGDHFFWRR 123
Query: 135 RLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
R + P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LL W
Sbjct: 124 RTLMARPMIKEAGMASLLLENPYYGYRKPRDQLRSSLKNVSDLFVMGGALILESTVLLRW 183
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
LE E G+ +G+ G+SMGG A++ + P P+ +P LS +A F G+L W
Sbjct: 184 LERE-GYWPLGMTGISMGGYMASLAVTNWPKPIPLIPCLSWSTASSVFTTGVLSKAVNWT 242
Query: 254 ALREELA 260
L ++ A
Sbjct: 243 QLEKQYA 249
>gi|307183299|gb|EFN70168.1| Uncharacterized protein C4orf29-like protein [Camponotus
floridanus]
Length = 479
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 89/256 (34%), Positives = 133/256 (51%), Gaps = 15/256 (5%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIW 72
LD VY R+ + FF++GWG + L K++ E L P+ T
Sbjct: 6 LDAVY-----RSILLTKFFAKGWGNP-----QNLKKKVIANREACYNMIPLDYPVEITKD 55
Query: 73 ETQTAV-LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGDH 130
E + + EG F TP ++ L +P ++ A + P K+ + +HLAGTGDH
Sbjct: 56 EVWSDCHIIEGQFETPFEKHLPGLMPDQAKIAYFQVILPTKWSSHKIKPICLHLAGTGDH 115
Query: 131 TFERRLRL-GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEAR 188
F RR L PLLKE IA+++LE+PFYG R+P Q + L V D+ ++G I E+
Sbjct: 116 FFWRRRNLIAKPLLKETGIASILLENPFYGLRKPNNQIRSSLNNVCDIFIMGGCLIMESI 175
Query: 189 CLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKH 248
LL+W E + GFG +G+ GLSMGG A++ + P P++ +P LS SA F EG++
Sbjct: 176 VLLNWCE-QQGFGPLGLTGLSMGGHMASLAATNWPKPISLVPCLSWSSASPVFTEGVMSA 234
Query: 249 GTAWEALREELAAKKV 264
W L + + K+
Sbjct: 235 SINWSLLETQYFSNKI 250
>gi|449499682|ref|XP_002189591.2| PREDICTED: uncharacterized protein C4orf29 homolog [Taeniopygia
guttata]
Length = 463
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 133/249 (53%), Gaps = 8/249 (3%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIWRTIWETQTAV 78
++R + F RGWG K E L+R+ +++ E S P++ E Q+
Sbjct: 9 LYRKLLLTKLFIRGWG--KPEDLKRIFEFRKIIGNREKCQKLVSKDYPVFIDKVEGQSDC 66
Query: 79 -LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRL 136
+ EG F +P + LP ES AR F+ P+ + +HLAGTGDH F RR
Sbjct: 67 KILEGHFISPLAHCVPGILPVESLVARFQFIIPRRWNGKHRPVCIHLAGTGDHHFWRRRT 126
Query: 137 RLGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHWLE
Sbjct: 127 LMARPMIKEACMASLLLENPYYGCRKPKDQLRSCLKNVSDLFVMGGALILESAALLHWLE 186
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL 255
E G+G +G+ G+SMGG A++ + P P+ +P LS +A F G+L W L
Sbjct: 187 RE-GYGPLGMTGISMGGHMASLAVTNWPKPLPLIPCLSWSTASAVFTTGVLSKAVNWREL 245
Query: 256 REELAAKKV 264
++ + V
Sbjct: 246 EKQYFTQTV 254
>gi|213512899|ref|NP_001133359.1| CD029 protein [Salmo salar]
gi|209151902|gb|ACI33088.1| C4orf29 homolog precursor [Salmo salar]
Length = 451
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 134/250 (53%), Gaps = 11/250 (4%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLI---QPIWRTIWETQTA 77
++R + F +GWG K + L+R+ +L + + L+ P++ E Q+
Sbjct: 9 LYRKLLLTKLFIQGWG--KPDDLKRIF-ELRKIVGNREKCKELVPKDYPVFIDKVEDQSD 65
Query: 78 V-LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ G F +P + + LP ES AR F+ PK K C+ HLAGTGDH F +RR
Sbjct: 66 CKIHNGYFISPLEHIVPGILPSESIKARFQFIVPKKWKNHKPVCI-HLAGTGDHFFWKRR 124
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHWL
Sbjct: 125 TLMARPMIKEAGMASLLLENPYYGYRKPKEQVRSSLRNVSDLFVMGAALILESAVLLHWL 184
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E + +G+ G+SMGG A++ + P P+ +P LS +A F G+L W
Sbjct: 185 ERE-DYWPLGMTGISMGGHMASLAVTNWPKPIPLIPCLSWTTASNVFTTGVLSKAVNWRE 243
Query: 255 LREELAAKKV 264
L ++ A V
Sbjct: 244 LEKQYAMHSV 253
>gi|338532091|ref|YP_004665425.1| hypothetical protein LILAB_12200 [Myxococcus fulvus HW-1]
gi|337258187|gb|AEI64347.1| hypothetical protein LILAB_12200 [Myxococcus fulvus HW-1]
Length = 329
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 152/324 (46%), Gaps = 35/324 (10%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIW 72
+D ++ R ++ FS+GWG L E F E P I P W
Sbjct: 10 VDVLFAGLSRRARL----FSQGWGDEAF-LEEVAAAAPFQER------PLPIAPEWSAPR 58
Query: 73 ETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF 132
+ +R+G F +P ++ L + A V +L+ P + ACVV LA + + F
Sbjct: 59 LQRGLRVRDGTFPSP-----LARLDVAARTAHVRWLS-AGQGPSRGACVV-LAASREEGF 111
Query: 133 ERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
R R+ PL +E + +LE+P+YG RRP+ Q+G L VSD +L+ ++EAR LL
Sbjct: 112 SLRERMYAPLAREGLDLFLLENPYYGLRRPVGQKGGALRTVSDHVLMNLGMVDEARALLA 171
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAW 252
WL E G ++GV G SMGG AA+ ++ P P+A + S V F +G+L A+
Sbjct: 172 WLRGE-GHARLGVAGYSMGGYMAALTAAVVPEPLAVAALAAGASPVPVFTQGLLSWSIAF 230
Query: 253 EALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAA----------T 302
L E+ R R+ + L ++TRFP P+ P A + VA T
Sbjct: 231 ALL------DGPRRDAEQARLRLGRIFDLANLTRFPPPRQPEAAVLVACRRDGFVPGDET 284
Query: 303 VSTVFDYHHEEVLKMDSQHFFALF 326
++ + E+ +D+ H ALF
Sbjct: 285 LALHAHWPRSELRWVDAGHVTALF 308
>gi|91092776|ref|XP_973805.1| PREDICTED: similar to CG32112 CG32112-PB [Tribolium castaneum]
gi|270014895|gb|EFA11343.1| hypothetical protein TcasGA2_TC010883 [Tribolium castaneum]
Length = 448
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 79/242 (32%), Positives = 126/242 (52%), Gaps = 14/242 (5%)
Query: 31 FSRGWGGSK--LELLE--RLI--KQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLREGVF 84
F++GWG + +EL +L+ + E+ +++P +I + + EG
Sbjct: 19 FTKGWGDPENLMELFNFRKLVSNRNACYELLPKDYPVDII-----NVQRFTDFQILEGRL 73
Query: 85 RTPCDEQLMSALPPESHNARVAFLAPKCVPPQKM-ACVVHLAGTGDHTFERRLR-LGGPL 142
TP L S L PE + P+ P +HLAGTGDH F RR + PL
Sbjct: 74 WTPFRMFLPSLLVPEIQQVYFQMILPRKWPSSDYRPLCIHLAGTGDHYFWRRRNFMAKPL 133
Query: 143 LKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGK 202
L+ I ++++E+P+YG R+P Q + L VSD+ ++G I E LL+W E + GFG
Sbjct: 134 LQAGIGSLIVENPYYGTRKPKDQLRSSLHYVSDIFVMGGCLILETLALLNWCE-QIGFGP 192
Query: 203 MGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAK 262
+GV G+SMGG A++ S P P+ +P LS +A F EG++ W+ L+++L +
Sbjct: 193 LGVSGISMGGHMASLAASNWPKPLVLVPCLSWSTASSVFTEGVMSESIDWDMLQKQLFSN 252
Query: 263 KV 264
K+
Sbjct: 253 KI 254
>gi|196006373|ref|XP_002113053.1| hypothetical protein TRIADDRAFT_25351 [Trichoplax adhaerens]
gi|190585094|gb|EDV25163.1| hypothetical protein TRIADDRAFT_25351 [Trichoplax adhaerens]
Length = 399
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 84/231 (36%), Positives = 124/231 (53%), Gaps = 13/231 (5%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIK-QLFPEIEGQNWPPSLIQP-----IWRTIWET 74
++R I +F+ GWG ++LL R++ + ++ W L+ P I + I
Sbjct: 10 LYRRIIIAKYFTSGWG--DVDLLRRIVSFRNDATVDASKWR-KLVSPNHPVTIDKRIKNN 66
Query: 75 QTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFER 134
VL +G F +P E ++ E AR + PK K +HLAGTGDH F R
Sbjct: 67 SYEVL-QGHFTSPIVEFFGPSMLEEIKTARFEVVLPKNWNTDKKPMCIHLAGTGDHFFWR 125
Query: 135 RLRLGG-PLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
R L PLLKE I +++LE+P+YG R+P QR + L V+DL L+G A + E+ LLH
Sbjct: 126 RRHLMAIPLLKEYGIGSILLENPYYGVRKPKEQRRSSLKYVADLFLMGIALVLESSVLLH 185
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCE 243
W + GFG + + G+SMGG A++ + P VA +P LS +A +AF E
Sbjct: 186 WCQ-NMGFGPLCLHGISMGGHMASLAATAWPESVAVVPCLSWSTASIAFTE 235
>gi|345309596|ref|XP_001521395.2| PREDICTED: uncharacterized protein C4orf29 homolog, partial
[Ornithorhynchus anatinus]
Length = 321
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 85/251 (33%), Positives = 130/251 (51%), Gaps = 12/251 (4%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEG-----QNWPPSLIQPIWRTIWETQ 75
++R + F RGWG + E L+RL + F +I G QN + E
Sbjct: 9 LYRRLLLTKLFIRGWG--RPEDLKRLFE--FRKIIGNREKCQNLISRDYPVFIDKVGEHT 64
Query: 76 TAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ER 134
+ +G F +P + +P ES AR + PK + +HLAGTGDH + R
Sbjct: 65 DCKILDGHFVSPMAHYVPGIMPTESVIARFQLIVPKEWKSKHRPVCIHLAGTGDHYYWRR 124
Query: 135 RLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
R + P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHW
Sbjct: 125 RTLMARPMIKEAGMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLHW 184
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
LE E GFG +G+ G+SMGG A++ + P P+ +P LS +A F G+L W
Sbjct: 185 LERE-GFGPLGLTGISMGGHMASLAVTNWPKPLPLVPCLSWSTASGVFTTGVLSKSINWR 243
Query: 254 ALREELAAKKV 264
L ++ + V
Sbjct: 244 ELDKQYYTQTV 254
>gi|440898181|gb|ELR49732.1| hypothetical protein M91_01156 [Bos grunniens mutus]
Length = 464
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 86/252 (34%), Positives = 130/252 (51%), Gaps = 12/252 (4%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEG-----QNWPPSLIQPIWRTIWET 74
++R + F RGWG + E L+RL + F +I G QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFE--FRKIIGNRERCQNLVSSDYPVYIDKIEEQ 63
Query: 75 QTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-E 133
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH +
Sbjct: 64 SDCKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWR 123
Query: 134 RRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
RR + P++KE +A+++LE+P+Y +P Q + L VSDL ++G A + E+ LLH
Sbjct: 124 RRTLMARPMIKEARMASLLLENPYYILLKPKDQIRSSLKNVSDLFVMGGALVLESAALLH 183
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAW 252
WLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 184 WLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINW 242
Query: 253 EALREELAAKKV 264
L ++ + V
Sbjct: 243 RELEKQYYTQTV 254
>gi|328782234|ref|XP_624391.3| PREDICTED: uncharacterized protein C4orf29 homolog [Apis mellifera]
Length = 372
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 84/261 (32%), Positives = 138/261 (52%), Gaps = 20/261 (7%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGG----SKLELLERLI--KQLFPEIEGQNWPPSLIQP 66
LD VY R+ + FF++GWG ++ RL+ + ++ +++P ++ +
Sbjct: 6 LDAVY-----RSILLTKFFTKGWGSPQNLKRIFEFRRLLANRDTCYKLIPRDYPINITKD 60
Query: 67 IWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLA 125
E + EG F +P ++ L +P E+ A + P+ K+ + +HLA
Sbjct: 61 E-----EWSDCHIIEGCFESPFNKHLPQIMPYETITANFQLVLPRKWYSHKIKPICLHLA 115
Query: 126 GTGDHTFERRLRL-GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRAT 183
GTGDH F RR L PLLKE+ IA+++LE+PFYG R+P Q + L V D+ ++G
Sbjct: 116 GTGDHFFWRRRNLIAKPLLKESGIASLLLENPFYGTRKPQNQIRSCLHNVCDIFVMGGCL 175
Query: 184 IEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCE 243
I E+ LL+W E + GFG +G+ GLSMGG A++ + P P+ +P LS +A F +
Sbjct: 176 IMESIVLLNWCE-QQGFGPLGLTGLSMGGHMASLAATNWPKPIPLIPCLSWSTASPVFTQ 234
Query: 244 GILKHGTAWEALREELAAKKV 264
G++ W L + A ++
Sbjct: 235 GVMSASINWTLLENQYFANEL 255
>gi|324512972|gb|ADY45354.1| Unknown [Ascaris suum]
Length = 380
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 109/194 (56%), Gaps = 9/194 (4%)
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKM--ACVVHLAGTGDHTFERR-LR 137
EG F++P + + PE + + + + P+K+ A V+HLAGTGDH+F RR
Sbjct: 77 EGCFKSP-----YAWVFPEMMPDNIGWATWRGIFPKKLRRALVIHLAGTGDHSFFRREWG 131
Query: 138 LGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWE 197
LLK+ +++++LE+PFYG R+P Q + L VSDL ++G A + E LL W + +
Sbjct: 132 FANNLLKQGVSSILLENPFYGSRKPKNQFRSSLNNVSDLFVMGGALMAECNFLLKWAK-Q 190
Query: 198 AGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALRE 257
G+ +G+CG+SMGG ++ + P PVA +P LS +A F EG L WE L
Sbjct: 191 MGYCPLGLCGVSMGGHMVSLACTNSPDPVAVVPCLSWTTAAPVFVEGALSGAIPWETLTM 250
Query: 258 ELAAKKVAMTLEEV 271
E +KK + ++
Sbjct: 251 EFRSKKFQSAINQI 264
>gi|332024686|gb|EGI64879.1| Uncharacterized protein C4orf29-like protein [Acromyrmex
echinatior]
Length = 482
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/254 (33%), Positives = 134/254 (52%), Gaps = 18/254 (7%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIW 72
LD VY R+ + FF++GWG + L R+ + F ++ + PI I
Sbjct: 6 LDAVY-----RSILLTKFFTKGWGNP--QNLRRIFE--FRKVIANRKTCYNMIPINYPIE 56
Query: 73 ETQTAV-----LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAG 126
T+ V + EG F +P ++ L +P ++ A + P K+ + +HLAG
Sbjct: 57 ITKDEVWSDCHIIEGRFESPFEKHLPGIMPDKTKMAYFQMVLPSKWNSHKIKPICLHLAG 116
Query: 127 TGDHTFERRLRL-GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATI 184
TGDH F RR L PLLKE+ IA+++LE+PFYG R+P Q+ + L V D+ ++G I
Sbjct: 117 TGDHFFWRRRNLIAKPLLKESGIASILLENPFYGLRKPDDQKRSSLHNVCDIFIMGGCLI 176
Query: 185 EEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEG 244
E+ LL+W E + GFG + + GLSMGG A++ + P P++ +P LS +A F EG
Sbjct: 177 MESIVLLNWCE-QHGFGPLALTGLSMGGHMASLAATNWPKPISLVPCLSWSTASPVFTEG 235
Query: 245 ILKHGTAWEALREE 258
++ W L +
Sbjct: 236 VMSASINWALLETQ 249
>gi|449278211|gb|EMC86145.1| Putative protein C4orf29 like protein [Columba livia]
Length = 464
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/249 (33%), Positives = 131/249 (52%), Gaps = 8/249 (3%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIWRTIWETQTAV 78
++R + F RGWG K E L+R+ +++ E S P++ E Q+
Sbjct: 9 LYRRLLLTKLFIRGWG--KPEDLKRIFEFRKIIGNREKCQTLVSKDYPVFIDKVEEQSDC 66
Query: 79 -LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRL 136
+ EG F +P + LP ES AR F+ P+ + +HLAGTGDH F RR
Sbjct: 67 KILEGHFISPLAHYVPGILPVESLVARFQFIIPRRWNSKHRPVCIHLAGTGDHHFWRRRT 126
Query: 137 RLGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ P++KE +A+++LE+P+Y +P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 127 LMARPMIKEACMASLLLENPYYILLKPKDQLRSCLKNVSDLFVMGGALVLESAALLHWLE 186
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL 255
E G+G +G+ G+SMGG A++ + P P+ +P LS +A F G+L W L
Sbjct: 187 RE-GYGPLGMTGISMGGHMASLAVTNWPKPLPLIPCLSWSTASAVFTTGVLSKAVNWREL 245
Query: 256 REELAAKKV 264
++ + V
Sbjct: 246 EKQYYTQTV 254
>gi|380020226|ref|XP_003693992.1| PREDICTED: uncharacterized protein C4orf29 homolog [Apis florea]
Length = 477
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/261 (32%), Positives = 138/261 (52%), Gaps = 20/261 (7%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGG----SKLELLERLI--KQLFPEIEGQNWPPSLIQP 66
LD VY R+ + FF++GWG ++ RL+ + ++ +++P ++ +
Sbjct: 6 LDAVY-----RSILLTKFFTKGWGSPQNLKRIFEFRRLLANRDTCYKLIPRDYPINITKD 60
Query: 67 IWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLA 125
E + EG F +P ++ L +P E+ A + P+ K+ + +HLA
Sbjct: 61 E-----EWSDCHIIEGRFESPFNKHLPQIMPYETITANFQLVLPRKWYSHKIKPICLHLA 115
Query: 126 GTGDHTFERRLRL-GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRAT 183
GTGDH F RR L PLLKE+ IA+++LE+PFYG R+P Q + L V D+ ++G
Sbjct: 116 GTGDHFFWRRRNLIAKPLLKESGIASLLLENPFYGTRKPQNQIRSCLHNVCDIFVMGGCL 175
Query: 184 IEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCE 243
I E+ LL+W E + GFG +G+ GLSMGG A++ + P P+ +P LS +A F +
Sbjct: 176 IMESIVLLNWCE-QQGFGPLGLTGLSMGGHMASLAATNWPKPIPLIPCLSWSTASPVFTQ 234
Query: 244 GILKHGTAWEALREELAAKKV 264
G++ W L + A ++
Sbjct: 235 GVMSASINWTLLENQYFANEL 255
>gi|312381415|gb|EFR27171.1| hypothetical protein AND_06288 [Anopheles darlingi]
Length = 1653
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 111/206 (53%), Gaps = 10/206 (4%)
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQKMACVVHLAGTGDHTF-ERRLRL 138
EG F TP + + +P + A + P K + +HLAGTGDH F +RR +
Sbjct: 868 EGKFITPLEIYMPGLVPDVAQQAHFQVVLPLKWNDERYKPICIHLAGTGDHYFWKRRNMI 927
Query: 139 GGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWE 197
PLLKE N+ ++LE+PFYG R+P QR + L VSD+ ++G + E+ LL+W E
Sbjct: 928 AKPLLKEANLGAIILENPFYGLRKPKEQRASSLQNVSDIFVMGGCLVLESLVLLNWCE-R 986
Query: 198 AGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALRE 257
G G +G+ GLSMGG A++ + P P+ +P LS +A F EG++ H W+ L
Sbjct: 987 NGLGPLGITGLSMGGHMASLAATNWPKPLVLVPCLSWSTASSVFTEGVMSHSINWDVLET 1046
Query: 258 ELAAKKVAMTLEEVRERMRNVLSLTD 283
+ A RER+ ++++ D
Sbjct: 1047 QYFADG------NFRERLSKMVTVVD 1066
>gi|307204724|gb|EFN83305.1| Uncharacterized protein C4orf29-like protein [Harpegnathos
saltator]
Length = 482
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/257 (34%), Positives = 133/257 (51%), Gaps = 12/257 (4%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGG-SKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTI 71
LD VY R+ + FF++GWG L+ + K + N P L PI T
Sbjct: 6 LDAVY-----RSILLTKFFTKGWGNPQNLKRIFEFRKVIANRETCYNMIP-LDYPIEITK 59
Query: 72 WETQTAV-LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGD 129
E + + EG F +P ++ L +P ++ A + P K+ + +HLAGTGD
Sbjct: 60 SEDWSDCHVIEGRFESPFEKHLPGIMPDKTKMAYFQVILPSKWNSHKIKPICLHLAGTGD 119
Query: 130 HTFERRLRL-GGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEA 187
H F RR L PLLKE IA+++LE+PFYG R+P Q + L V D+ ++G I E+
Sbjct: 120 HFFWRRRNLIARPLLKEAGIASILLENPFYGLRKPDDQIRSSLHNVCDIFIMGGCLIMES 179
Query: 188 RCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILK 247
LL+W E + GF +G+ GLSMGG A++ + P P++ +P LS +A F EG++
Sbjct: 180 IVLLNWCE-QQGFRPLGLTGLSMGGHMASLAATNWPKPISLVPCLSWSTASPVFTEGVMS 238
Query: 248 HGTAWEALREELAAKKV 264
W L + + KV
Sbjct: 239 ASINWALLETQYFSDKV 255
>gi|158563777|sp|Q4V7A8.2|CD029_RAT RecName: Full=Uncharacterized protein C4orf29 homolog; Flags:
Precursor
Length = 464
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/250 (33%), Positives = 127/250 (50%), Gaps = 8/250 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S + E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKVEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPGIMPIESVIARFQFIVPKEWNSRYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+Y +P Q + L VSDL ++G A I E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYILLKPKDQVRSSLKNVSDLFVMGGALILESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRE 244
Query: 255 LREELAAKKV 264
L ++ + V
Sbjct: 245 LEKQYYTQTV 254
>gi|320165554|gb|EFW42453.1| CD029 protein [Capsaspora owczarzaki ATCC 30864]
Length = 533
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 90/141 (63%), Gaps = 3/141 (2%)
Query: 121 VVHLAGTGDHTF-ERRLRLGGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLL 178
++ LAGTGDH F RR L PLL ++ I +++LE+PFYG R+P Q + LL V+DL L
Sbjct: 255 LIQLAGTGDHYFWRRRHLLAKPLLHDSGIGSIILENPFYGLRKPAYQWRSSLLHVTDLFL 314
Query: 179 LGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAV 238
+G A I E LL W E E+G+G++G+ G SMGG ++ S +P P+A +P LS +A
Sbjct: 315 MGVALILETTVLLRWCE-ESGYGQLGMQGFSMGGHMTSLAASAYPKPLAIIPCLSASTAS 373
Query: 239 VAFCEGILKHGTAWEALREEL 259
F +G++ AW AL E+L
Sbjct: 374 AVFADGVMSSACAWPALTEQL 394
>gi|312073986|ref|XP_003139767.1| hypothetical protein LOAG_04182 [Loa loa]
Length = 355
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 93/158 (58%), Gaps = 2/158 (1%)
Query: 117 KMACVVHLAGTGDHT-FERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSD 175
K A V+HLAGTGDHT F R L+K NI++++L++PFYG R+P Q + L+ VSD
Sbjct: 87 KHALVIHLAGTGDHTYFRREFGFANDLMKNNISSILLQNPFYGSRKPRDQFRSSLINVSD 146
Query: 176 LLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPH 235
L ++G A + E LL W E G+ +G+ G+SMGG A + + P P+A +P LS
Sbjct: 147 LFIMGGALVAECNFLLKWAR-EQGYWPVGLAGVSMGGHMACLACTNSPEPIALVPCLSWT 205
Query: 236 SAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRE 273
+A F +G L +W+ L EL +K+ + ++ E
Sbjct: 206 TASTVFVQGTLSKSVSWDVLTMELLSKQFQNGIRQIPE 243
>gi|393910173|gb|EFO24303.2| hypothetical protein LOAG_04182 [Loa loa]
Length = 333
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 93/158 (58%), Gaps = 2/158 (1%)
Query: 117 KMACVVHLAGTGDHT-FERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSD 175
K A V+HLAGTGDHT F R L+K NI++++L++PFYG R+P Q + L+ VSD
Sbjct: 65 KHALVIHLAGTGDHTYFRREFGFANDLMKNNISSILLQNPFYGSRKPRDQFRSSLINVSD 124
Query: 176 LLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPH 235
L ++G A + E LL W E G+ +G+ G+SMGG A + + P P+A +P LS
Sbjct: 125 LFIMGGALVAECNFLLKWAR-EQGYWPVGLAGVSMGGHMACLACTNSPEPIALVPCLSWT 183
Query: 236 SAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRE 273
+A F +G L +W+ L EL +K+ + ++ E
Sbjct: 184 TASTVFVQGTLSKSVSWDVLTMELLSKQFQNGIRQIPE 221
>gi|148703205|gb|EDL35152.1| mCG125094 [Mus musculus]
Length = 464
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/250 (33%), Positives = 127/250 (50%), Gaps = 8/250 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S + E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKVEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPGIMPIESVVARFQFIVPKEWNSRYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+Y +P Q + L VSDL ++G A I E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYILLKPKDQVRSSLKNVSDLFVMGGALILESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEA 254
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRE 244
Query: 255 LREELAAKKV 264
L ++ + V
Sbjct: 245 LEKQYYTQTV 254
>gi|427779375|gb|JAA55139.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 419
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 111/215 (51%), Gaps = 33/215 (15%)
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAPK--CVPPQKMACVVHLAGTGDHTFERR--- 135
EG +P + L +P ESH A L PK P + C+ HLAGTGDH F RR
Sbjct: 32 EGHLVSPLVQYLPECVPKESHKAWFQVLLPKKWVTEPLRPLCI-HLAGTGDHYFWRRRTL 90
Query: 136 -----LRLGG--------------------PLLKEN-IATMVLESPFYGQRRPLLQRGAK 169
L+ G PLLKEN +A+++LE+PFYG R+P Q +
Sbjct: 91 TCRPLLKENGVASIILENPFYILFXTLTCRPLLKENGVASIILENPFYGLRKPKDQVRSN 150
Query: 170 LLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATL 229
L CVSD+ ++G + E+ LLHW E E GFG +G+ G+SMGG A++ G+ P+ +
Sbjct: 151 LHCVSDIFVMGGCLVLESMALLHWCERE-GFGPLGITGISMGGHMASLAGANWYKPIGII 209
Query: 230 PFLSPHSAVVAFCEGILKHGTAWEALREELAAKKV 264
P LS +A F +G++ WE L+ + + V
Sbjct: 210 PCLSWTTASCVFTQGVMSGAIPWELLQSQYFSDHV 244
>gi|393910174|gb|EJD75771.1| hypothetical protein, variant [Loa loa]
Length = 309
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 93/158 (58%), Gaps = 2/158 (1%)
Query: 117 KMACVVHLAGTGDHT-FERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSD 175
K A V+HLAGTGDHT F R L+K NI++++L++PFYG R+P Q + L+ VSD
Sbjct: 41 KHALVIHLAGTGDHTYFRREFGFANDLMKNNISSILLQNPFYGSRKPRDQFRSSLINVSD 100
Query: 176 LLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPH 235
L ++G A + E LL W E G+ +G+ G+SMGG A + + P P+A +P LS
Sbjct: 101 LFIMGGALVAECNFLLKWAR-EQGYWPVGLAGVSMGGHMACLACTNSPEPIALVPCLSWT 159
Query: 236 SAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRE 273
+A F +G L +W+ L EL +K+ + ++ E
Sbjct: 160 TASTVFVQGTLSKSVSWDVLTMELLSKQFQNGIRQIPE 197
>gi|441618033|ref|XP_003264696.2| PREDICTED: uncharacterized protein C4orf29 homolog isoform 2
[Nomascus leucogenys]
Length = 479
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 130/265 (49%), Gaps = 23/265 (8%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARV---------------AFLAPKCVPPQKMACV 121
+ +G F +P + +P ES AR+ F+ PK +
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPVESVIARIIDEMLDTNLILLPLFQFIVPKEWNSKYRPVC 125
Query: 122 VHLAGTGDHTF-ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLL 179
+HLAGTGDH + RR + P++KE +A+++LE+P+YG R+P Q + L VSDL ++
Sbjct: 126 IHLAGTGDHHYWRRRTLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVM 185
Query: 180 GRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVV 239
G A + E+ LLHWLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A
Sbjct: 186 GGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASG 244
Query: 240 AFCEGILKHGTAWEALREELAAKKV 264
F G+L W L ++ + V
Sbjct: 245 VFTTGVLSKSINWRELEKQYYTQTV 269
>gi|350397459|ref|XP_003484884.1| PREDICTED: uncharacterized protein C4orf29 homolog [Bombus
impatiens]
Length = 477
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 84/255 (32%), Positives = 134/255 (52%), Gaps = 20/255 (7%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGG----SKLELLERLI--KQLFPEIEGQNWPPSLIQP 66
LD VY R+ + FF++GWG ++ RL+ ++ ++ +++P ++ +
Sbjct: 6 LDAVY-----RSILLTKFFTKGWGSPHNLKRIFEFRRLLANRETCYKLIPRDYPVNITKD 60
Query: 67 IWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAF-LAPKCVPPQKMACVVHLA 125
E + EG F +P ++ L +P E+ A L PK + +HLA
Sbjct: 61 E-----EWSDCHIIEGCFESPFNKHLPDIMPYETITAHFQLVLPPKWHSHKVKPICLHLA 115
Query: 126 GTGDHTFERRLRL-GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRAT 183
GTGDH F RR L PLLKE+ IA+++LE+PFYG R+P Q + L V D+ ++G
Sbjct: 116 GTGDHFFWRRRNLIAKPLLKESGIASLLLENPFYGSRKPQNQIRSCLHNVCDIFIMGGCL 175
Query: 184 IEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCE 243
I E+ LL+W E + GFG +G+ GLSMGG A++ + P P+ +P LS +A F +
Sbjct: 176 IMESIVLLNWCE-QQGFGPLGLTGLSMGGHMASLAATNWPKPIPLIPCLSWSTASPVFTQ 234
Query: 244 GILKHGTAWEALREE 258
G++ W L +
Sbjct: 235 GVMSASINWTLLENQ 249
>gi|383855718|ref|XP_003703357.1| PREDICTED: uncharacterized protein C4orf29 homolog [Megachile
rotundata]
Length = 477
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 85/263 (32%), Positives = 138/263 (52%), Gaps = 24/263 (9%)
Query: 13 LDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEG------QNWPPSLI 64
LD VY R+ + FF++GWG E L+R+ +++ E +++P ++
Sbjct: 6 LDAVY-----RSILLTKFFTKGWGSP--ENLKRIFEFRKVIANREACYNLIPRDYPINIS 58
Query: 65 QPIWRTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACV-VH 123
+ E + EG F +P + L +P E+ A + P+ K+ + +H
Sbjct: 59 KDE-----EWSDCHIIEGSFESPFHKHLPGIMPRETITAHFQLVLPRRWSSHKVKPICLH 113
Query: 124 LAGTGDHTFERRLRL-GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGR 181
LAGTGDH F RR L PLLKE+ IA+++LE+PFYG R+P Q + L V D+ ++G
Sbjct: 114 LAGTGDHYFWRRRNLVAKPLLKESGIASLLLENPFYGSRKPENQIRSSLHNVCDIFIMGG 173
Query: 182 ATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
I E+ LL+W E + GF +G+ GLSMGG A++ + P P+ +P LS +A F
Sbjct: 174 CLIMESIVLLNWCE-QQGFAPLGLTGLSMGGHMASLAATNWPKPIPLVPCLSWSTASPVF 232
Query: 242 CEGILKHGTAWEALREELAAKKV 264
+G++ W L ++ A ++
Sbjct: 233 TQGVMSASINWTLLEDQYFANEL 255
>gi|24663415|ref|NP_729821.1| CG32112, isoform C [Drosophila melanogaster]
gi|281366154|ref|NP_729819.3| CG32112, isoform D [Drosophila melanogaster]
gi|23093576|gb|AAN11853.1| CG32112, isoform C [Drosophila melanogaster]
gi|272455182|gb|AAN11851.3| CG32112, isoform D [Drosophila melanogaster]
Length = 435
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 103/191 (53%), Gaps = 10/191 (5%)
Query: 96 LPPESHNARVAFLAPKCVPPQKMACV-VHLAGTGDHTFERRLR-LGGPLLKE-NIATMVL 152
+P ES A L P +K + +HLAGTGDH F RR + PLLK+ NI +++L
Sbjct: 9 VPEESQQAHFQLLIPNKWKNEKHKPICIHLAGTGDHFFWRRRNFIAKPLLKDANIGSIIL 68
Query: 153 ESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGG 212
E+PFYG R+P Q + L VSD+ ++G I E L HW E GFG +GV GLSMGG
Sbjct: 69 ENPFYGLRKPNNQTRSNLHNVSDIFVMGGCLILECLVLFHWCE-RNGFGPLGVTGLSMGG 127
Query: 213 VHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVR 272
A++ + P P+ +P LS +A F G++ W+ L + + + R
Sbjct: 128 HMASLAATNWPKPLVLVPCLSWSTASAVFTTGVMSQSINWDMLETQYFSDG------QYR 181
Query: 273 ERMRNVLSLTD 283
ER+ ++++ D
Sbjct: 182 ERLSKMVTVID 192
>gi|402588811|gb|EJW82744.1| hypothetical protein WUBG_06344 [Wuchereria bancrofti]
Length = 290
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 93/158 (58%), Gaps = 2/158 (1%)
Query: 117 KMACVVHLAGTGDHT-FERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSD 175
K A V+HLAGTGDHT F R L+K NI++++L++PFYG R+P Q + L+ VSD
Sbjct: 107 KHALVIHLAGTGDHTYFRREFGFANDLMKSNISSILLQNPFYGSRKPRDQFRSSLINVSD 166
Query: 176 LLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPH 235
L ++G A + E LL W + G+ +G+ G+SMGG A + + P P+A +P LS
Sbjct: 167 LFIMGGALVAECNFLLKWAR-QQGYWPVGLAGVSMGGHMACLACTNSPEPIALVPCLSWT 225
Query: 236 SAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRE 273
+A F +G L +W+ L EL +K+ + E+ E
Sbjct: 226 TASTVFVQGTLSKSVSWDVLTMELLSKQFQDGIREIPE 263
>gi|358342729|dbj|GAA50193.1| hypothetical protein CLF_104212, partial [Clonorchis sinensis]
Length = 406
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 95/159 (59%), Gaps = 2/159 (1%)
Query: 107 FLAPKCVPPQKMACVVHLAGTGDHT-FERRLRLGGPLLKENIATMVLESPFYGQRRPLLQ 165
+ PK P+ + LAGTGDHT + RRL L LL++ IA++++ +PFY +R+P Q
Sbjct: 3 LILPKEWIPKYRPICLQLAGTGDHTYYRRRLFLANRLLEDGIASIIIMNPFYSKRKPRDQ 62
Query: 166 RGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTP 225
RG+ L VSDL ++G A I E LL W E G+G + + G+SMGG A++ ++ P P
Sbjct: 63 RGSCLNYVSDLFIMGGALITECATLLRWCE-TNGYGPVAIHGISMGGYMASLCATVWPKP 121
Query: 226 VATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKV 264
++ +P LS +A V F +GI+ W+ L ++ A V
Sbjct: 122 ISLIPCLSWTTASVVFVDGIMAGAVDWDTLTKQYFADSV 160
>gi|170583988|ref|XP_001896811.1| CG32112-PA [Brugia malayi]
gi|158595854|gb|EDP34337.1| CG32112-PA, putative [Brugia malayi]
Length = 328
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 92/156 (58%), Gaps = 2/156 (1%)
Query: 117 KMACVVHLAGTGDHT-FERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSD 175
K A V+HLAGTGDHT F R L+K NI++++L++PFYG R+P Q + L+ VSD
Sbjct: 59 KHALVIHLAGTGDHTYFRREFGFANDLMKSNISSILLQNPFYGSRKPRDQFRSSLINVSD 118
Query: 176 LLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPH 235
L ++G A + E LL W + G+ +G+ G+SMGG A + + P P+A +P LS
Sbjct: 119 LFIMGGALVAECNFLLKWAR-QQGYWPVGLAGVSMGGHMACLACTNSPEPIALVPCLSWT 177
Query: 236 SAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEV 271
+A F +G L +W+ L EL +K+ + E+
Sbjct: 178 TASTVFVQGTLSKSVSWDVLTMELLSKQFQDGIREI 213
>gi|410038680|ref|XP_003310503.2| PREDICTED: uncharacterized protein C4orf29 homolog [Pan
troglodytes]
Length = 479
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 129/265 (48%), Gaps = 23/265 (8%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARV---------------AFLAPKCVPPQKMACV 121
+ +G F +P + +P ES AR F+ PK +
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARTIDEMLDTNLILLPLFQFIVPKEWNSKYRPVC 125
Query: 122 VHLAGTGDHTF-ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLL 179
+HLAGTGDH + RR + P++KE +A+++LE+P+YG R+P Q + L VSDL ++
Sbjct: 126 IHLAGTGDHHYWRRRTLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVM 185
Query: 180 GRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVV 239
G A + E+ LLHWLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A
Sbjct: 186 GGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASG 244
Query: 240 AFCEGILKHGTAWEALREELAAKKV 264
F G+L W L ++ + V
Sbjct: 245 VFTTGVLSKSINWRELEKQYYTQTV 269
>gi|344277389|ref|XP_003410484.1| PREDICTED: uncharacterized protein C4orf29-like [Loxodonta
africana]
Length = 414
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 123/226 (54%), Gaps = 8/226 (3%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI--KQLFPEIEGQNWPPSLIQPIW-RTIWETQTA 77
++R + F RGWG + E L+RL ++L E S P++ I E
Sbjct: 9 LYRRLLLTKLFIRGWG--RPEDLKRLFEFRKLIGNRERCQHLVSSDYPVYIDKIEEQSDC 66
Query: 78 VLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLR 137
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 67 KILDGHFVSPMAHYVPDIMPVESVTARFQFIVPKEWNSKYRPVCIHLAGTGDHHYGRRRT 126
Query: 138 L-GGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
L P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 127 LMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLE 186
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 187 RE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|194208459|ref|XP_001915854.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
C4orf29-like [Equus caballus]
Length = 414
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 123/229 (53%), Gaps = 12/229 (5%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEG-----QNWPPSLIQPIWRTIWET 74
++R + F RGWG + E L+RL + F ++ G QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFE--FRKVIGNRERCQNLVSSDYPVYIDKIEEQ 63
Query: 75 QTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-E 133
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH +
Sbjct: 64 SDCKILDGHFVSPMAHYVPDIMPTESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWR 123
Query: 134 RRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
RR + P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLH
Sbjct: 124 RRTLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLH 183
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
WLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 184 WLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|426247065|ref|XP_004017307.1| PREDICTED: uncharacterized protein C4orf29 homolog isoform 2 [Ovis
aries]
Length = 414
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 123/229 (53%), Gaps = 12/229 (5%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEG-----QNWPPSLIQPIWRTIWET 74
++R + F RGWG + E L+RL + F +I G QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFE--FRKIIGNRERCQNLVSSDYPVYIDKIEEQ 63
Query: 75 QTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-E 133
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH +
Sbjct: 64 SDCKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWR 123
Query: 134 RRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
RR + P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLH
Sbjct: 124 RRTLMARPMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLH 183
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
WLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 184 WLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|311262570|ref|XP_003129246.1| PREDICTED: uncharacterized protein C4orf29-like [Sus scrofa]
Length = 413
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/227 (35%), Positives = 120/227 (52%), Gaps = 8/227 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVYIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPVESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|397505280|ref|XP_003823197.1| PREDICTED: uncharacterized protein C4orf29 homolog, partial [Pan
paniscus]
Length = 600
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 129/265 (48%), Gaps = 23/265 (8%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 129 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 186
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARV---------------AFLAPKCVPPQKMACV 121
+ +G F +P + +P ES AR F+ PK +
Sbjct: 187 CKILDGHFVSPMAHYVPDIMPIESVIARTIDEMLDTNLILLPLFQFIVPKEWNSKYRPVC 246
Query: 122 VHLAGTGDHTF-ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLL 179
+HLAGTGDH + RR + P++KE +A+++LE+P+YG R+P Q + L VSDL ++
Sbjct: 247 IHLAGTGDHHYWRRRTLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVM 306
Query: 180 GRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVV 239
G A + E+ LLHWLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A
Sbjct: 307 GGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASG 365
Query: 240 AFCEGILKHGTAWEALREELAAKKV 264
F G+L W L ++ + V
Sbjct: 366 VFTTGVLSKSINWRELEKQYYTQTV 390
>gi|89363030|ref|NP_001034806.1| uncharacterized protein C4orf29 precursor [Homo sapiens]
gi|121940364|sp|Q0P651.1|CD029_HUMAN RecName: Full=Uncharacterized protein C4orf29; Flags: Precursor
gi|112180775|gb|AAH34253.1| Chromosome 4 open reading frame 29 [Homo sapiens]
Length = 414
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/227 (35%), Positives = 120/227 (52%), Gaps = 8/227 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|390460417|ref|XP_003732481.1| PREDICTED: uncharacterized protein C4orf29 homolog isoform 2
[Callithrix jacchus]
Length = 414
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/227 (35%), Positives = 120/227 (52%), Gaps = 8/227 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESIIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|159487231|ref|XP_001701637.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280856|gb|EDP06612.1| predicted protein [Chlamydomonas reinhardtii]
Length = 281
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 130/320 (40%), Gaps = 110/320 (34%)
Query: 12 VLDHVYGAFMHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTI 71
LDH+Y ++ PFF RGWG + + ++ L + G PP+ I+ WR +
Sbjct: 42 ALDHMYARI---GALNGPFFPRGWGNLSVVNYDEDLRHL---VAG---PPAAIRLAWRLV 92
Query: 72 ----WETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGT 127
+ +L EG FRTPC +++ ALPPES R
Sbjct: 93 ERGSRDGVDYMLYEGSFRTPCLQRVYDALPPESRTGR----------------------- 129
Query: 128 GDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEA 187
NI T+VLESPFYG RRP QRG+KLL VSDLL LG ATI E+
Sbjct: 130 ------------------NICTLVLESPFYGSRRPAAQRGSKLLRVSDLLTLGWATIAES 171
Query: 188 RCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILK 247
LLHWL E A VA+C+G ++
Sbjct: 172 INLLHWLREEG--------------------------------------AAVAYCDGAMR 193
Query: 248 HGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAA------ 301
A A L + L+ V E + TDVTR+P P+ +A + VAA
Sbjct: 194 ---ALRAGDRRLDQPDTVLRLKRVLE------TYTDVTRYPKPRRTDAAVIVAARDDAYV 244
Query: 302 ---TVSTVFDYHHEEVLKMD 318
+V + Y L+MD
Sbjct: 245 SRESVQQLHQYWAGSELRMD 264
>gi|395845748|ref|XP_003795586.1| PREDICTED: uncharacterized protein C4orf29 homolog isoform 2
[Otolemur garnettii]
Length = 414
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 122/228 (53%), Gaps = 12/228 (5%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEG-----QNWPPSLIQPIWRTIWETQ 75
++R + F RGWG + E L+RL + F ++ G QN I E
Sbjct: 9 LYRRLLLTKLFIRGWG--RPEDLKRLFE--FRKVIGNRERCQNLVSKDYPVYIDKIEEQS 64
Query: 76 TAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ER 134
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + R
Sbjct: 65 DCKILDGHFVSPMAHYVPDIMPVESVIARFQFIVPKEWNGKYRPVCIHLAGTGDHYYWRR 124
Query: 135 RLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
R + P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHW
Sbjct: 125 RTLMARPMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLHW 184
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
LE E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 185 LERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|449684676|ref|XP_002162928.2| PREDICTED: uncharacterized protein C4orf29 homolog [Hydra
magnipapillata]
Length = 442
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/251 (31%), Positives = 127/251 (50%), Gaps = 25/251 (9%)
Query: 24 TKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLR--- 80
T FFS+GWG +ELL+++ K + + ++ I + +T+ + +
Sbjct: 15 TNYVSKFFSQGWGD--IELLKKIAK-----FQASVLDKNQLRDIHSILEDTEIKLFKVVE 67
Query: 81 ----------EGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDH 130
G F +P + L ALP +S + P K A +HLAGTGDH
Sbjct: 68 DKNDSSVNIFRGQFVSPLTKLLPHALPLKSEIVNFEVVMPSG--DNKPAMCIHLAGTGDH 125
Query: 131 TFE-RRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEAR 188
F R+ + PL E I +++LE+P+YGQR+P Q + + VSD+ ++G A + E+
Sbjct: 126 HFWWRKKSMAIPLANEYKIGSILLENPYYGQRKPKNQVRSSVNYVSDIFVMGCALLVESI 185
Query: 189 CLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKH 248
L W + + GFG +G+ G+SMGG A + S P+A +P LS SA + +G+L
Sbjct: 186 TLFLWCQ-KNGFGPLGITGISMGGHMATVASSGWNKPLAIVPCLSWTSAAPIYTQGVLYG 244
Query: 249 GTAWEALREEL 259
G W+ L ++L
Sbjct: 245 GVHWKILEDQL 255
>gi|405958381|gb|EKC24514.1| B-box type zinc finger protein ncl-1 [Crassostrea gigas]
Length = 1216
Score = 113 bits (283), Expect = 1e-22, Method: Composition-based stats.
Identities = 63/179 (35%), Positives = 100/179 (55%), Gaps = 5/179 (2%)
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAP-KCVPPQKMACVVHLAGTGDHTFERRLRL- 138
EG F +P D+ + +P +R + P + P + C+ H GTGDH + R RL
Sbjct: 839 EGEFHSPWDKIIPGVMPSVVKKSRFQMIIPNRWQGPSRPVCI-HHGGTGDHGYLIRRRLM 897
Query: 139 GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWE 197
PLL ++ I ++++ESPFYG R+P Q + L VSDL+++G A + E LLHW E E
Sbjct: 898 AEPLLNDHGIGSIIIESPFYGSRKPKDQFRSSLQNVSDLIVMGGALMFETVVLLHWCE-E 956
Query: 198 AGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALR 256
G+G + G+SMGG +++ ++ P P+ +P L+ +A + GIL W+ L+
Sbjct: 957 EGWGPFCLTGISMGGFMSSLAATIWPKPIGLVPCLAGVTASPVYTRGILTKAVRWDVLK 1015
>gi|345784044|ref|XP_850878.2| PREDICTED: uncharacterized protein C4orf29 [Canis lupus familiaris]
Length = 414
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/227 (35%), Positives = 119/227 (52%), Gaps = 8/227 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR + PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQLIVPKEWNSKYKPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESTALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|410956888|ref|XP_003985068.1| PREDICTED: uncharacterized protein C4orf29 homolog [Felis catus]
Length = 414
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/227 (35%), Positives = 119/227 (52%), Gaps = 8/227 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR + PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQLIVPKEWNSKYKPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|301773646|ref|XP_002922241.1| PREDICTED: uncharacterized protein C4orf29-like [Ailuropoda
melanoleuca]
Length = 414
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/227 (34%), Positives = 119/227 (52%), Gaps = 8/227 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S + E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKVEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR + PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQLIVPKEWNSKYKPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|76156195|gb|AAX27424.2| SJCHGC04095 protein [Schistosoma japonicum]
Length = 208
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 108/206 (52%), Gaps = 10/206 (4%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLR 80
++RT + FFS+GWG + L +LI+ + + L++ E ++ ++
Sbjct: 7 LYRTMLPLKFFSKGWGAP--DTLLKLIENMKTVTNRDRF--CLLKTKTNISIEKKSETMK 62
Query: 81 ----EGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHT-FERR 135
EG F +P D + + L + AR + PK VH AGTGDH F RR
Sbjct: 63 TIEIEGSFLSPFDSVIPNVLTGNNKIARFQMIIPKVWSTNYRPICVHFAGTGDHNYFRRR 122
Query: 136 LRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
L L+ + +A++++ +PFY R+P QRG+ L VSDLL++G A I E LL W E
Sbjct: 123 FLLANRLVDDGVASLIIMNPFYATRKPKEQRGSGLNFVSDLLIMGGALIMECSALLEWCE 182
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSL 221
+G+G + + G+SMGG +A+ ++
Sbjct: 183 -NSGYGPLALHGISMGGYMSALCATV 207
>gi|351694779|gb|EHA97697.1| hypothetical protein GW7_10902 [Heterocephalus glaber]
Length = 425
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 118/226 (52%), Gaps = 8/226 (3%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQTA 77
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 9 LYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVYIDKIEEQSDC 66
Query: 78 VLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRL 136
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 67 KILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRRT 126
Query: 137 RLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ P++KE +A+++LE+P+Y +P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 127 LMARPMIKEARMASLLLENPYYILLKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLE 186
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 187 RE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|391331196|ref|XP_003740036.1| PREDICTED: uncharacterized protein C4orf29 homolog [Metaseiulus
occidentalis]
Length = 439
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 135/288 (46%), Gaps = 29/288 (10%)
Query: 23 RTKISPPFFSRGWGG----SKLELLERLI--KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
RT + +F +GWG K+ +++ ++ ++ N+P + + + +
Sbjct: 14 RTYLLNQYFVKGWGDPATIHKICQFRKVVGNREKCTQLVDDNYPIHIAKEEDKGAYR--- 70
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVP-PQKMACVVHLAGTGDHTFERR 135
L EG F +P L +P ESH A L P P+ + LAGTGD F RR
Sbjct: 71 --LLEGHFTSPLVHYLPDVIPEESHKAYFEMLIPNNWKHPRLKPVCLQLAGTGDQKFWRR 128
Query: 136 LRL-GGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
L PLLKE I +++LE+P+YG R+P Q L VSD+ ++G + E+ LL W
Sbjct: 129 RTLVAKPLLKEFGIGSILLENPYYGFRKPKEQLRTVLHNVSDVFVMGGCLVLESIALLKW 188
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
E G+G + + G+SMGG A++ G PV P LS +A AF +G++ WE
Sbjct: 189 CE-RQGYGPLALTGISMGGHMASLAGGSFDKPVGIAPCLSWTTASCAFTQGVMSGAIPWE 247
Query: 254 ALREELAAKKVAMTLEEVRERMRNVLSLTDVTRFPIPKIPNAVIFVAA 301
L+ + + + +RE + ++ P N +F+A
Sbjct: 248 LLQNQYIEEPI------IREELSKMIE--------TPSSGNDKVFLAG 281
>gi|66910929|gb|AAH98046.1| LOC499602 protein, partial [Rattus norvegicus]
Length = 367
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/145 (41%), Positives = 89/145 (61%), Gaps = 3/145 (2%)
Query: 122 VHLAGTGDHTF-ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLL 179
+HLAGTGDH + RR + P++KE +A+++LE+P+YG R+P Q + L VSDL ++
Sbjct: 14 IHLAGTGDHHYWRRRTLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVM 73
Query: 180 GRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVV 239
G A I E+ LLHWLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A
Sbjct: 74 GGALILESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASG 132
Query: 240 AFCEGILKHGTAWEALREELAAKKV 264
F G+L W L ++ + V
Sbjct: 133 VFTTGVLSKSINWRELEKQYYTQTV 157
>gi|403271853|ref|XP_003927817.1| PREDICTED: uncharacterized protein C4orf29 homolog [Saimiri
boliviensis boliviensis]
Length = 433
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 120/230 (52%), Gaps = 11/230 (4%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESIIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFY---GQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLL 191
+ P++KE +A+++LE+P+Y G P+ ++ + L VSDL ++G A + E+ LL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYILLGCSEPISRQRSSLKNVSDLFVMGGALVLESAALL 185
Query: 192 HWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
HWLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 186 HWLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 234
>gi|281352380|gb|EFB27964.1| hypothetical protein PANDA_011201 [Ailuropoda melanoleuca]
Length = 414
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 117/227 (51%), Gaps = 8/227 (3%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S + E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKVEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR + PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQLIVPKEWNSKYKPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+Y +P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 126 TLMARPMIKEARMASLLLENPYYILLKPKDQIRSSLKNVSDLFVMGGALVLESAALLHWL 185
Query: 195 EWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF 241
E E G+G +G+ G+SMGG A++ S P P+ +P LS +A F
Sbjct: 186 ERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVF 231
>gi|402913107|ref|XP_003919067.1| PREDICTED: uncharacterized protein C4orf29-like, partial [Papio
anubis]
Length = 284
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/198 (36%), Positives = 106/198 (53%), Gaps = 8/198 (4%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 89 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 146
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 147 CKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 206
Query: 136 LRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWL 194
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWL
Sbjct: 207 TLMARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWL 266
Query: 195 EWEAGFGKMGVCGLSMGG 212
E E G+G +G+ G+SMGG
Sbjct: 267 ERE-GYGPLGMTGISMGG 283
>gi|119625596|gb|EAX05191.1| hypothetical protein FLJ21106, isoform CRA_c [Homo sapiens]
Length = 433
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 79/249 (31%), Positives = 115/249 (46%), Gaps = 21/249 (8%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLE 195
+ P++KE +L S L VSDL ++G A + E+ LLHWLE
Sbjct: 126 TLMARPMIKEARMASLLRS--------------SLKNVSDLFVMGGALVLESAALLHWLE 171
Query: 196 WEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEAL 255
E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L
Sbjct: 172 RE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWREL 230
Query: 256 REELAAKKV 264
++ + V
Sbjct: 231 EKQYYTQTV 239
>gi|308474164|ref|XP_003099304.1| hypothetical protein CRE_09637 [Caenorhabditis remanei]
gi|308267443|gb|EFP11396.1| hypothetical protein CRE_09637 [Caenorhabditis remanei]
Length = 379
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 101/192 (52%), Gaps = 6/192 (3%)
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHT-FERRLRLG 139
EG F +P +P + R F A +P + +HLAGTGDH+ F R+ L
Sbjct: 81 EGFFTSPHATLFPDHMP--GNVGRAHFRA--WLPERPSPVCIHLAGTGDHSYFRRQYLLV 136
Query: 140 GPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAG 199
+LK+ + ++++++PFYG R+P Q + L V+DL ++G + I E L HW E G
Sbjct: 137 EDMLKDGVGSILVQNPFYGDRKPPNQFRSSLENVTDLFVMGASLIAECNHLFHWAE-TIG 195
Query: 200 FGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREEL 259
+G + G+SMGG A + GS P++ +P L+ +A A+ EG + ++ L+++L
Sbjct: 196 YGPFAISGVSMGGFMAQLAGSNSQRPISIIPILAWTTAGPAYTEGAIAPAVNYKLLQKQL 255
Query: 260 AAKKVAMTLEEV 271
L+ +
Sbjct: 256 EDPHYVEKLKRI 267
>gi|17506223|ref|NP_492206.1| Protein C54G4.7 [Caenorhabditis elegans]
gi|3875258|emb|CAA99819.1| Protein C54G4.7 [Caenorhabditis elegans]
Length = 378
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 98/180 (54%), Gaps = 6/180 (3%)
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERR-LRLG 139
EG F +P + +P + R F A +P + +HLAGTGDH++ RR L
Sbjct: 81 EGFFASPHATLFPNHMP--GNVGRAHFRA--YLPQKPGPVCIHLAGTGDHSYFRRHYLLV 136
Query: 140 GPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAG 199
+LK+ + ++++++PFYG R+P Q + L V+DL ++G A I E L +W E G
Sbjct: 137 DDMLKDGVGSILIQNPFYGDRKPPNQFRSSLENVTDLFVMGAALIAECNHLFNWSE-TLG 195
Query: 200 FGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREEL 259
+G + G+SMGG A + GS P++ +P L+ +A ++ EG + + L+++L
Sbjct: 196 YGPFAISGVSMGGFMAQLAGSNSQRPISIVPILAWTTASPSYTEGAISPAVNYSLLQKQL 255
>gi|268567095|ref|XP_002639889.1| Hypothetical protein CBG08211 [Caenorhabditis briggsae]
Length = 342
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 100/192 (52%), Gaps = 6/192 (3%)
Query: 81 EGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHT-FERRLRLG 139
EG F +P +P + R F A +P + +HLAGTGDH+ F R+ L
Sbjct: 40 EGFFTSPHATLFPDHMP--GNVGRAHFKA--WLPDKPSPVCIHLAGTGDHSYFRRQYLLV 95
Query: 140 GPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAG 199
+LK + ++++++PFYG R+P Q + L V+DL ++G + I E L +W E G
Sbjct: 96 DDMLKVGVGSILIQNPFYGDRKPPNQFRSSLENVTDLFVMGASLIAECNHLFNWAE-TLG 154
Query: 200 FGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREEL 259
+G + G+SMGG A + GS P++ +P LS +A ++ EG + ++ L+++L
Sbjct: 155 YGPFAISGVSMGGFMAQLAGSNSQRPISIIPILSWTTASPSYTEGAIAPAVNYKLLQKQL 214
Query: 260 AAKKVAMTLEEV 271
++ +
Sbjct: 215 EDPNYTDKIKNI 226
>gi|118764195|gb|AAI28144.1| C4orf29 protein [Homo sapiens]
Length = 332
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 85/137 (62%), Gaps = 3/137 (2%)
Query: 107 FLAPKCVPPQKMACVVHLAGTGDHTF-ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLL 164
F+ PK + +HLAGTGDH + RR + P++KE +A+++LE+P+YG R+P
Sbjct: 14 FIVPKEWNSKYRPVCIHLAGTGDHHYWRRRTLMARPMIKEARMASLLLENPYYGCRKPKD 73
Query: 165 QRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPT 224
Q + L VSDL ++G A + E+ LLHWLE E G+G +G+ G+SMGG A++ S P
Sbjct: 74 QVRSSLKNVSDLFVMGGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNWPK 132
Query: 225 PVATLPFLSPHSAVVAF 241
P+ +P LS +A F
Sbjct: 133 PMPLIPCLSWSTASGVF 149
>gi|341898299|gb|EGT54234.1| hypothetical protein CAEBREN_19215 [Caenorhabditis brenneri]
Length = 380
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 121/235 (51%), Gaps = 20/235 (8%)
Query: 42 LLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLR-EGVFRTPCDEQLMSALPPES 100
L+ +K+L P+I+ +++ I T+ V+ EG F +P +P
Sbjct: 54 LVMNYVKELNPKID-------IVKKI------TKNGVISYEGFFPSPHALLFPDHMP--G 98
Query: 101 HNARVAFLAPKCVPPQKMACVVHLAGTGDHT-FERRLRLGGPLLKENIATMVLESPFYGQ 159
+ R F A +P + +HLAGTGDH+ F R+ L +LK+ + ++++++PFYG
Sbjct: 99 NVGRAHFRA--YLPEKPGPVCIHLAGTGDHSYFRRQYLLVEDMLKDGVGSILVQNPFYGD 156
Query: 160 RRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVG 219
R+P Q + L V+DL ++G A I E L +W E G+G + G+SMGG A + G
Sbjct: 157 RKPPNQFRSSLENVTDLFVMGAALIAECNHLFNWSE-TLGYGPFAISGVSMGGFMAQLAG 215
Query: 220 SLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRER 274
S P++ +P L+ +A ++ EG + + L+++L + + ++ ++
Sbjct: 216 SNSLRPISIVPILAWTTASPSYTEGAIAPAVNYPLLQKQLEDPQYTEKIRKIPDQ 270
>gi|341893412|gb|EGT49347.1| hypothetical protein CAEBREN_17935 [Caenorhabditis brenneri]
Length = 380
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 121/235 (51%), Gaps = 20/235 (8%)
Query: 42 LLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLR-EGVFRTPCDEQLMSALPPES 100
L+ +K+L P+I+ +++ I T+ V+ EG F +P +P
Sbjct: 54 LVMNYVKELNPKID-------MVKKI------TKNGVISYEGFFPSPHALLFPDHMP--G 98
Query: 101 HNARVAFLAPKCVPPQKMACVVHLAGTGDHT-FERRLRLGGPLLKENIATMVLESPFYGQ 159
+ R F A +P + +HLAGTGDH+ F R+ L +LK+ + ++++++PFYG
Sbjct: 99 NVGRAHFRA--YLPEKPGPVCIHLAGTGDHSYFRRQYLLVEDMLKDGVGSILVQNPFYGD 156
Query: 160 RRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVG 219
R+P Q + L V+DL ++G A I E L +W E G+G + G+SMGG A + G
Sbjct: 157 RKPPNQFRSSLENVTDLFVMGAALIAECNHLFNWSE-TLGYGPFAISGVSMGGFMAQLAG 215
Query: 220 SLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRER 274
S P++ +P L+ +A ++ EG + + L+++L + + ++ ++
Sbjct: 216 SNSLRPISIVPILAWTTASPSYTEGAIAPAVNYPLLQKQLEDPQYTEKIRKIPDQ 270
>gi|442319786|ref|YP_007359807.1| hypothetical protein MYSTI_02807 [Myxococcus stipitatus DSM 14675]
gi|441487428|gb|AGC44123.1| hypothetical protein MYSTI_02807 [Myxococcus stipitatus DSM 14675]
Length = 319
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 75/224 (33%), Positives = 111/224 (49%), Gaps = 19/224 (8%)
Query: 79 LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLRL 138
+EGVF +P SALP E RV +L+ P++ ACVV LA + D + R L
Sbjct: 58 FQEGVFASP-----WSALPSEVRQGRVRWLS-SGRGPRRDACVV-LAASRDEGYRLRTWL 110
Query: 139 GGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEA 198
L+ E + +LE+PFYG RR QRG + V + L + ATIEEAR L+ + +
Sbjct: 111 FASLVDEGMDLFLLENPFYGARRATGQRGPHIRTVGEQLHMNIATIEEARGLVAYARRQ- 169
Query: 199 GFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREE 258
G+ ++ V G SMGG AA+ + P PV + S F +G+ HG
Sbjct: 170 GYQRVAVAGYSMGGYMAALSAATMPEPVGVAALAAGASPAPVFTKGV--HG-------RS 220
Query: 259 LAAKKVAMTLEEV--RERMRNVLSLTDVTRFPIPKIPNAVIFVA 300
+ K++ + +E R+R+ +L + + P P P A I VA
Sbjct: 221 IDFKRLGGSPDETVARQRLATLLDMANACLLPPPAKPGAAIIVA 264
>gi|313247095|emb|CBY35923.1| unnamed protein product [Oikopleura dioica]
Length = 460
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 49/145 (33%), Positives = 78/145 (53%), Gaps = 2/145 (1%)
Query: 119 ACVVHLAGTGDHTFERRLRL-GGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDL 176
+ AGTGDH F+RR + PL++E NIA+ ++E+P+Y +R+P Q+ + L DL
Sbjct: 139 GIAIQTAGTGDHGFKRRREIIAKPLIQEHNIASCIMENPYYARRKPDKQQYSGLRSFVDL 198
Query: 177 LLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHS 236
+ L E L WL+ E G ++ V GLS+GG AA+ S+ P P+A P + +
Sbjct: 199 ITLSMGVGIECNALAKWLKEELGVERICVTGLSLGGHTAALAASISPVPIAAAPGFAWST 258
Query: 237 AVVAFCEGILKHGTAWEALREELAA 261
+ + G L + W L ++ A
Sbjct: 259 STGVWTTGALSNRVDWANLESDINA 283
>gi|229576963|ref|NP_001153277.1| uncharacterized LOC100294620 [Pongo abelii]
gi|55728820|emb|CAH91149.1| hypothetical protein [Pongo abelii]
Length = 337
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 76/125 (60%), Gaps = 2/125 (1%)
Query: 141 PLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAG 199
P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE E G
Sbjct: 4 PMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLERE-G 62
Query: 200 FGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREEL 259
+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L ++
Sbjct: 63 YGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRELEKQY 122
Query: 260 AAKKV 264
+ V
Sbjct: 123 YTQTV 127
>gi|296195594|ref|XP_002745407.1| PREDICTED: uncharacterized protein C4orf29 homolog isoform 1
[Callithrix jacchus]
Length = 337
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 77/128 (60%), Gaps = 2/128 (1%)
Query: 138 LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEW 196
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 1 MARPMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLER 60
Query: 197 EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALR 256
E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L
Sbjct: 61 E-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRELE 119
Query: 257 EELAAKKV 264
++ + V
Sbjct: 120 KQYYTQTV 127
>gi|118763584|gb|AAI28143.1| C4orf29 protein [Homo sapiens]
Length = 321
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 76/125 (60%), Gaps = 2/125 (1%)
Query: 141 PLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAG 199
P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE E G
Sbjct: 4 PMIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLERE-G 62
Query: 200 FGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREEL 259
+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L ++
Sbjct: 63 YGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRELEKQY 122
Query: 260 AAKKV 264
+ V
Sbjct: 123 YTQTV 127
>gi|395845746|ref|XP_003795585.1| PREDICTED: uncharacterized protein C4orf29 homolog isoform 1
[Otolemur garnettii]
Length = 337
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 76/125 (60%), Gaps = 2/125 (1%)
Query: 141 PLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAG 199
P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE E G
Sbjct: 4 PMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLHWLERE-G 62
Query: 200 FGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREEL 259
+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L ++
Sbjct: 63 YGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRELEKQY 122
Query: 260 AAKKV 264
+ V
Sbjct: 123 YNQTV 127
>gi|156120905|ref|NP_001095599.1| uncharacterized protein C4orf29 homolog [Bos taurus]
gi|151554761|gb|AAI50054.1| MGC165715 protein [Bos taurus]
gi|296478731|tpg|DAA20846.1| TPA: hypothetical protein LOC530484 [Bos taurus]
Length = 337
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 77/128 (60%), Gaps = 2/128 (1%)
Query: 138 LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEW 196
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 1 MARPMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLHWLER 60
Query: 197 EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALR 256
E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L
Sbjct: 61 E-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRELE 119
Query: 257 EELAAKKV 264
++ + V
Sbjct: 120 KQYYTQTV 127
>gi|426247063|ref|XP_004017306.1| PREDICTED: uncharacterized protein C4orf29 homolog isoform 1 [Ovis
aries]
Length = 337
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 77/128 (60%), Gaps = 2/128 (1%)
Query: 138 LGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEW 196
+ P++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE
Sbjct: 1 MARPMIKEARMASLLLENPYYGCRKPKDQIRSSLKNVSDLFVMGGALVLESAALLHWLER 60
Query: 197 EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALR 256
E G+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L
Sbjct: 61 E-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRELE 119
Query: 257 EELAAKKV 264
++ + V
Sbjct: 120 KQYYTQTV 127
>gi|313225938|emb|CBY21081.1| unnamed protein product [Oikopleura dioica]
Length = 460
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 78/145 (53%), Gaps = 2/145 (1%)
Query: 119 ACVVHLAGTGDHTFERRLRL-GGPLLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDL 176
+ AGTGDH F+RR + PL++E+ IA+ ++E+P+Y +R+P Q+ + L DL
Sbjct: 139 GIAIQTAGTGDHGFKRRREIIAKPLIQEHKIASCIMENPYYARRKPDKQQYSGLRSFVDL 198
Query: 177 LLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHS 236
+ L E L WL+ E G ++ V GLS+GG AA+ S+ P P+A P + +
Sbjct: 199 ITLSMGVGIECNALAKWLKEELGVERICVTGLSLGGHTAALAASISPVPIAAAPGFAWST 258
Query: 237 AVVAFCEGILKHGTAWEALREELAA 261
+ + G L + W L ++ A
Sbjct: 259 STGVWTTGALSNRIDWANLESDINA 283
>gi|260832030|ref|XP_002610961.1| hypothetical protein BRAFLDRAFT_115643 [Branchiostoma floridae]
gi|229296330|gb|EEN66971.1| hypothetical protein BRAFLDRAFT_115643 [Branchiostoma floridae]
Length = 324
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 76/125 (60%), Gaps = 5/125 (4%)
Query: 138 LGGPLLKEN-IATMVLESPFY---GQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHW 193
+ PLLKE+ IA+++LE+P+Y ++P Q + LL VSD+ ++G A I E++ LLHW
Sbjct: 1 MAKPLLKESGIASLLLENPYYILWSWQKPKDQLRSSLLNVSDIFVMGGALILESQVLLHW 60
Query: 194 LEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWE 253
E G G +G+ G+SMGG A++ S P P+A +P LS +A F +G+L W
Sbjct: 61 CE-RQGLGPLGLTGISMGGHMASLAASNWPKPIALVPCLSWSTASSVFTQGVLSRAIPWR 119
Query: 254 ALREE 258
L ++
Sbjct: 120 LLEKQ 124
>gi|219519327|gb|AAI45213.1| 3110057O12Rik protein [Mus musculus]
Length = 333
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 75/124 (60%), Gaps = 2/124 (1%)
Query: 142 LLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGF 200
++KE +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHWLE E G+
Sbjct: 1 MIKEARMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALILESAALLHWLERE-GY 59
Query: 201 GKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELA 260
G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L ++
Sbjct: 60 GPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRELEKQYY 119
Query: 261 AKKV 264
+ V
Sbjct: 120 TQTV 123
>gi|148228150|ref|NP_001079943.1| uncharacterized protein LOC379634 [Xenopus laevis]
gi|34785883|gb|AAH57712.1| MGC68853 protein [Xenopus laevis]
Length = 331
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 75/124 (60%), Gaps = 2/124 (1%)
Query: 142 LLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGF 200
++KE +A+++LE+P+YG R+P Q + L VSDL ++G A + E+ LLHWLE E G+
Sbjct: 1 MIKEAGMASLLLENPYYGCRKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLERE-GY 59
Query: 201 GKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELA 260
G +G+ G+SMGG A++ + P P+ +P LS +A F G+L W L ++
Sbjct: 60 GPLGMTGISMGGHMASLAVTNWPKPIPLIPCLSWSTASGVFTTGVLSKAVNWRELEKQYC 119
Query: 261 AKKV 264
+ V
Sbjct: 120 TQTV 123
>gi|60416119|gb|AAH90706.1| Zgc:110741 [Danio rerio]
Length = 324
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/124 (38%), Positives = 75/124 (60%), Gaps = 2/124 (1%)
Query: 142 LLKEN-IATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGF 200
++KE+ +A+++LE+P+YG R+P Q + L VSDL ++G A I E+ LLHWLE + GF
Sbjct: 1 MVKESGMASLLLENPYYGYRKPKDQLRSSLKNVSDLFVMGGALILESAALLHWLERD-GF 59
Query: 201 GKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELA 260
+G+ G+SMGG A++ + P P+ +P LS +A F G+L W L ++ A
Sbjct: 60 WPLGMTGISMGGHMASLAVTNWPKPIPLIPCLSWTTASSVFTTGVLSRAVNWRELEKQYA 119
Query: 261 AKKV 264
V
Sbjct: 120 THTV 123
>gi|148703201|gb|EDL35148.1| mCG125095, isoform CRA_a [Mus musculus]
Length = 130
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 74/119 (62%), Gaps = 3/119 (2%)
Query: 96 LPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRLRLGGPLLKE-NIATMVLE 153
+P ES AR F+ PK + +HLAGTGDH + RR + P++KE +A+++LE
Sbjct: 9 MPIESVVARFQFIVPKEWNSRYRPVCIHLAGTGDHHYWRRRTLMARPMIKEARMASLLLE 68
Query: 154 SPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGG 212
+P+Y + + L VSDLL++G A I E+ LLHWLE E+ +G +G+ G+SMGG
Sbjct: 69 NPYYWLQEAQGPSRSSLKDVSDLLVMGGALILESAALLHWLERES-YGPLGMTGISMGG 126
>gi|360043226|emb|CCD78638.1| hypothetical protein Smp_197480 [Schistosoma mansoni]
Length = 402
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 62/108 (57%), Gaps = 1/108 (0%)
Query: 155 PFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVH 214
PFY +R+P Q+G+ L VSDL ++G A I E LL W E G+G + G+SMGG
Sbjct: 3 PFYSKRKPDEQQGSGLNSVSDLFIMGGALIMECSALLKWCE-HNGYGPFALHGISMGGYM 61
Query: 215 AAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAK 262
+A+ ++ P P++ +P LS SA F EGIL + W L ++ +
Sbjct: 62 SALCATVWPKPISLIPCLSWTSASCVFLEGILSNTVNWSVLTKQYYSD 109
>gi|297293355|ref|XP_001082145.2| PREDICTED: uncharacterized protein C4orf29 homolog isoform 2
[Macaca mulatta]
Length = 331
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 72/125 (57%), Gaps = 8/125 (6%)
Query: 141 PLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAG 199
P++KE +A+++LE+P+Y R L+ VSDL ++G A + E+ LLHWLE E G
Sbjct: 4 PMIKEARMASLLLENPYYILLRSSLKN------VSDLFVMGGALVLESAALLHWLERE-G 56
Query: 200 FGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREEL 259
+G +G+ G+SMGG A++ S P P+ +P LS +A F G+L W L ++
Sbjct: 57 YGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTASGVFTTGVLSKSINWRELEKQY 116
Query: 260 AAKKV 264
+ V
Sbjct: 117 YTQTV 121
>gi|339233712|ref|XP_003381973.1| hypothetical protein Tsp_11075 [Trichinella spiralis]
gi|316979160|gb|EFV61988.1| hypothetical protein Tsp_11075 [Trichinella spiralis]
Length = 240
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 108/235 (45%), Gaps = 35/235 (14%)
Query: 31 FSRGWGG-SKLELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLREGVFRTPCD 89
F+ GWG + L L + +++ G+ I E + V+ G F TP
Sbjct: 2 FADGWGNPADLIKLIKFRREMVKRDAGK-------------IIENNSFVIYSGEFETPVV 48
Query: 90 EQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLR-LGGPLLKE-NI 147
+ L +P A+ + PK +H+AGTGDH F RR + L PL ++ I
Sbjct: 49 KLLPELVPLPVRYAQFEMILPKVKQANSCPLCIHMAGTGDHGFWRRRKFLALPLAQQMGI 108
Query: 148 ATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCG 207
T+ + RR L+ V+DL ++G I E+ LL+WL + G +G+ G
Sbjct: 109 GTISV-------RRSCLRY------VTDLFVMGVCLIFESAVLLNWLI-KRGNWPLGLTG 154
Query: 208 LSMGGVHAAMVGSL----HPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREE 258
+S+GG H + + SL + PVA +P LS +A F +G++ W L ++
Sbjct: 155 ISLGG-HVSQMASLAAACYSKPVAIVPCLSWTTASAVFTQGVMARAIPWNTLEKQ 208
>gi|355687600|gb|EHH26184.1| hypothetical protein EGK_16086 [Macaca mulatta]
gi|355763441|gb|EHH62171.1| hypothetical protein EGM_20397 [Macaca fascicularis]
Length = 358
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 77/149 (51%), Gaps = 23/149 (15%)
Query: 138 LGGPLLKE-NIATMVLESPFY------GQRRPLLQR--------GAK-------LLCVSD 175
+ P++KE +A+++LE+P+Y + L+ R G K L VSD
Sbjct: 1 MARPMIKEARMASLLLENPYYILLLFTEEEMKLMHRYKEMSNVAGNKCRGRRSSLKNVSD 60
Query: 176 LLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPH 235
L ++G A + E+ LLHWLE E G+G +G+ G+SMGG A++ S P P+ +P LS
Sbjct: 61 LFVMGGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWS 119
Query: 236 SAVVAFCEGILKHGTAWEALREELAAKKV 264
+A F G+L W L ++ + V
Sbjct: 120 TASGVFTTGVLSKSINWRELEKQYYTQTV 148
>gi|432104075|gb|ELK30905.1| hypothetical protein MDA_GLEAN10021161 [Myotis davidii]
Length = 498
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 59/102 (57%), Gaps = 1/102 (0%)
Query: 163 LLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLH 222
++ R + L VSDL ++G A + E+ LLHWLE E G+G +G+ G+SMGG A++ S
Sbjct: 187 IIARRSSLKNVSDLFVMGGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNW 245
Query: 223 PTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKV 264
P P+ +P LS +A F G+L W L ++ A+ V
Sbjct: 246 PKPMPLIPCLSWSTASGVFTTGVLSKSINWRELEKQYYAQTV 287
>gi|260832032|ref|XP_002610962.1| hypothetical protein BRAFLDRAFT_60900 [Branchiostoma floridae]
gi|229296331|gb|EEN66972.1| hypothetical protein BRAFLDRAFT_60900 [Branchiostoma floridae]
Length = 157
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 80/146 (54%), Gaps = 12/146 (8%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLIKQLFPEIEGQNWPPSLIQPIWRT--IWETQTAV 78
M+R + FF+RGWG L+ L+RL+ + ++ ++ L+ +R + +++V
Sbjct: 9 MYRRLLLTKFFTRGWGD--LDQLKRLL-EFKKFVQNRDLCCQLVDTHFRNYPVAIDKSSV 65
Query: 79 -----LREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF- 132
+ EG F +P L LP E+ AR + P P ++ +HLAGTGDH F
Sbjct: 66 GGDCKILEGHFTSPLTHILPGLLPREAETARFQLILPVRWPTEQRPVCIHLAGTGDHFFW 125
Query: 133 ERRLRLGGPLLKEN-IATMVLESPFY 157
RR + PLLKE+ IA+++LE+P+Y
Sbjct: 126 RRRTLMAKPLLKESGIASLLLENPYY 151
>gi|167525697|ref|XP_001747183.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774478|gb|EDQ88107.1| predicted protein [Monosiga brevicollis MX1]
Length = 340
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 108/270 (40%), Gaps = 47/270 (17%)
Query: 30 FFSRGWGGSK-LELLERLIKQLFPEIEGQNWPPSLIQPIWRTIWETQTAVLREGVFRTPC 88
FFS+G+G K + LE LI +G + Q W E R R+P
Sbjct: 32 FFSQGFGDMKRVTELETLIY-----AQGSAAFSDIDQLSWEAPKEQGVLYTRRATCRSP- 85
Query: 89 DEQLMSALPPESHN-------ARVAFLAPKCVPPQKM----------ACVVHLAGTGDHT 131
L S LP ES N +R FL P P +M +VHLA TGD
Sbjct: 86 ---LASFLPEESANMHLQLVMSRDYFL-PGLNDPDRMDATDQRQPVKGIMVHLAPTGDMG 141
Query: 132 FERRLRL-GGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCL 190
F R +L P+ ++ A+++L P+YG+R+P Q V+D L + EA L
Sbjct: 142 FAFRTKLMAEPMAQQGYASLLLIIPYYGRRKPHAQIKHYASTVADYLTCCFGSFVEAAKL 201
Query: 191 LHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAF-CEGILKHG 249
+ + +G+ G+S+GG A M L H +V C G
Sbjct: 202 TQYCRTQFSQVPVGLTGMSLGGAMACMASGLD------------HGDLVLLACVGSASPR 249
Query: 250 TAWEALREELAAKKVAMTLEEVRERMRNVL 279
AL AK +LE R+R+ VL
Sbjct: 250 VMVNAL-----AKDANCSLEAARDRLAQVL 274
>gi|12844839|dbj|BAB26518.1| unnamed protein product [Mus musculus]
Length = 147
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 55/99 (55%), Gaps = 2/99 (2%)
Query: 69 RTIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTG 128
R + E + +G F +P + +P ES AR F+ PK + +HLAGTG
Sbjct: 45 RPVEEQSDCKILDGHFVSPMAHYVPGIMPIESVVARFQFIVPKEWNSRYRPVCIHLAGTG 104
Query: 129 DHTF-ERRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQ 165
DH + RR + P++KE +A+++LE+P+YG R+P Q
Sbjct: 105 DHHYWRRRTLMARPMIKEARMASLLLENPYYGCRKPKDQ 143
>gi|255076197|ref|XP_002501773.1| predicted protein [Micromonas sp. RCC299]
gi|226517037|gb|ACO63031.1| predicted protein [Micromonas sp. RCC299]
Length = 349
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 135/301 (44%), Gaps = 45/301 (14%)
Query: 13 LDHVYGAFMHRTKI--SPPFFSRGWGGS--KLELLERLIKQLFPEIEGQNWPPSLIQPI- 67
LD Y K + FF +G+ G+ ++E L K + I G+N I+P+
Sbjct: 4 LDDWYSTLAKNLKAGSTKRFFWKGYEGAPGEMEKATSLFKDVLDTITGRN--EKKIKPLA 61
Query: 68 WRTIWE----------------TQTAVLREGVFRTPCDEQLMSALPPESHNARVAFL--A 109
R I E T ++R+ F +P + L P ES ++ ++
Sbjct: 62 LRWISERVVEPSVFSACLTPGSTSMPIIRQAEFDSPAAQYL----PKESQTGQLMYVWKT 117
Query: 110 PKCVPPQKMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFYGQRRPLLQRGAK 169
VPP+K+ C++ L TGD + R ++ LL IAT++ P+YG+R+P Q
Sbjct: 118 GGVVPPRKI-CIM-LPTTGDAFYWFRKQIALDLLSHEIATVIPMFPYYGKRKPKDQFHHI 175
Query: 170 LLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGG----VHAAMVGSLHPTP 225
+ VSD + + + EA + W E + G+SMGG V A + S +
Sbjct: 176 IPSVSDFFVQICSGVLEAAAIGSWAAAEFPEVETVFTGVSMGGSVANVAAILAASNSGSK 235
Query: 226 VATLPFLSPHSAVVAFCEGILKHGTAWEALREELAAKKVAMTLEEVRERMRNVLSLTDVT 285
V T ++ SA +F G+L + AW+ L E A + VA + ++N++++ +V+
Sbjct: 236 VGTCCVVATCSA-TSFLTGVLHNRIAWKELSE--APRGVA-------DELKNLVAVENVS 285
Query: 286 R 286
Sbjct: 286 E 286
>gi|344250240|gb|EGW06344.1| Uncharacterized protein C4orf29-like [Cricetulus griseus]
Length = 294
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 179 LGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAV 238
+G A I E+ LLHWLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A
Sbjct: 1 MGGALILESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTAS 59
Query: 239 VAFCEGILKHGTAWEALREELAAKKV 264
F G+L W L ++ + V
Sbjct: 60 GVFTTGVLSKSINWRELEKQYYTQTV 85
>gi|149048797|gb|EDM01338.1| hypothetical protein LOC499602 [Rattus norvegicus]
Length = 278
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 179 LGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAV 238
+G A I E+ LLHWLE E G+G +G+ G+SMGG A++ S P P+ +P LS +A
Sbjct: 1 MGGALILESAALLHWLERE-GYGPLGMTGISMGGHMASLAVSNWPKPMPLIPCLSWSTAS 59
Query: 239 VAFCEGILKHGTAWEALREELAAKKV 264
F G+L W L ++ + V
Sbjct: 60 GVFTTGVLSKSINWRELEKQYYTQTV 85
>gi|60098473|emb|CAH65067.1| hypothetical protein RCJMB04_2k21 [Gallus gallus]
Length = 293
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 46/80 (57%), Gaps = 1/80 (1%)
Query: 179 LGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAV 238
+G A + E+ LLHWLE E G+G +G+ G+SMGG A++ + P P+ +P LS +A
Sbjct: 1 MGGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAVTNWPKPLPLIPCLSWSTAS 59
Query: 239 VAFCEGILKHGTAWEALREE 258
F G+L W L ++
Sbjct: 60 AVFTTGVLSKAVNWRELEKQ 79
>gi|380807653|gb|AFE75702.1| uncharacterized protein C4orf29 precursor, partial [Macaca mulatta]
Length = 84
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 160 RRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVG 219
R+P Q + L VSDL ++G A + E+ LLHWLE E G+G +G+ G+SMGG A++
Sbjct: 2 RKPKDQVRSSLKNVSDLFVMGGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAV 60
Query: 220 SLHPTPVATLPFLSPHSAVVAF 241
S P P+ +P LS +A F
Sbjct: 61 SNWPKPMPLIPCLSWSTASGVF 82
>gi|119625597|gb|EAX05192.1| hypothetical protein FLJ21106, isoform CRA_d [Homo sapiens]
Length = 393
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 69/143 (48%), Gaps = 7/143 (4%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 8 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKIEEQSD 65
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 66 CKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDHHYWRRR 125
Query: 136 LRLGGPLLKE-NIATMVLESPFY 157
+ P++KE +A+++LE+P+Y
Sbjct: 126 TLMARPMIKEARMASLLLENPYY 148
>gi|15030119|gb|AAH11312.1| 3110057O12Rik protein, partial [Mus musculus]
Length = 189
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 69/143 (48%), Gaps = 7/143 (4%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S + E
Sbjct: 46 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVHIDKVEEQSD 103
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH + RR
Sbjct: 104 CKILDGHFVSPMAHYVPGIMPIESVVARFQFIVPKEWNSRYRPVCIHLAGTGDHHYWRRR 163
Query: 136 LRLGGPLLKE-NIATMVLESPFY 157
+ P++KE +A+++LE+P+Y
Sbjct: 164 TLMARPMIKEARMASLLLENPYY 186
>gi|219120616|ref|XP_002181043.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407759|gb|EEC47695.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 373
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/254 (22%), Positives = 109/254 (42%), Gaps = 32/254 (12%)
Query: 25 KISPPFFSRGWG------GSKLELLERLI-----KQLFPEIEGQNWPPSLIQPIWRTIWE 73
+ P FF+ GWG G++ E+L L + E+E + IQ W
Sbjct: 29 RARPKFFADGWGKYELAFGAQDEMLSMLKSSDKRNRFRTELENGS-----IQ--WSQPVV 81
Query: 74 TQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFE 133
+ + G F +P + LP ++ R ++ P +K ++ L GTG+
Sbjct: 82 KSSVSVTSGAFPSP----VAHLLPDKAKICRFYYVQPIIEEKRKTVTIIMLPGTGEAGKG 137
Query: 134 RRLRLGGPLLKE-NIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLH 192
RL++ L E +++++ +P+Y R+P Q + V DLLL A ++EA L
Sbjct: 138 DRLKMATQLADECGWSSIIVTAPYYAARKPDNQTAFFVRTVEDLLLQSVAIMQEAAILAS 197
Query: 193 WLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTP--------VATLPFLSPHSAVVAFCEG 244
+ + ++ + G S G A+ ++ T +A +P++ S + G
Sbjct: 198 YFLHRSEQQRVCITGFSWGAAMASGAAAVALTTAHKDAGRRLACVPYVGCSSPSI-LVSG 256
Query: 245 ILKHGTAWEALREE 258
+L+ W AL+++
Sbjct: 257 VLESSIDWTALQQK 270
>gi|303278470|ref|XP_003058528.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459688|gb|EEH56983.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 574
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 95/212 (44%), Gaps = 17/212 (8%)
Query: 78 VLREGVFRTPCDEQLMSALPPESHNARVAFL-APKCVPPQKMACVVHLAGTGDHTFERRL 136
+ RE F +P + + P ES + F+ PP K+A +HL TGD F R
Sbjct: 86 LYREAEFDSPGAKYM----PEESKVGNMLFIWRTGKKPPSKIA--IHLPTTGDQYFWYRK 139
Query: 137 RLGGPLLKENIATMVLESPFYGQRRPLLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEW 196
+L LLK ++A+ + P+YG+R+P Q L VS + I EA + W
Sbjct: 140 QLAKDLLKHDVASCIPMFPYYGKRKPPGQYLHLLTSVSAFITQVCGGIMEAAGIAAWANA 199
Query: 197 EAGFGKMGVCGLSMGGVHAAMVGSLH----PTPVATLPFLSPHSAVVAFCEGILKHGTAW 252
K+ + G+S+GG A + + T V + P ++ SA +F G+L + AW
Sbjct: 200 AYPGAKVVMTGVSLGGSVANVAAVIAAGNCDTGVGSCPVVATSSA-TSFLTGVLHNRIAW 258
Query: 253 EALREELAA-----KKVAMTLEEVRERMRNVL 279
L E A K V +V E+ + V+
Sbjct: 259 NVLSEAEDAVADEIKAVVAASLDVGEKSKPVI 290
>gi|355736537|gb|AES12034.1| hypothetical protein [Mustela putorius furo]
Length = 200
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 66/141 (46%), Gaps = 7/141 (4%)
Query: 20 FMHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQT 76
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 62 ILYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVYIDKIEEQSD 119
Query: 77 AVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERR 135
+ +G F +P + +P ES AR + PK + +HLAGTGDH + RR
Sbjct: 120 CKILDGHFVSPMAHYVPDIMPMESVIARFQLIVPKEWNSKYKPVCIHLAGTGDHHYWRRR 179
Query: 136 LRLGGPLLKE-NIATMVLESP 155
+ P++KE +A+++LE+P
Sbjct: 180 TLMARPMIKEARMASLLLENP 200
>gi|148703202|gb|EDL35149.1| mCG125095, isoform CRA_b [Mus musculus]
Length = 75
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 2/64 (3%)
Query: 96 LPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTF-ERRLRLGGPLLKE-NIATMVLE 153
+P ES AR F+ PK + +HLAGTGDH + RR + P++KE +A+++LE
Sbjct: 9 MPIESVVARFQFIVPKEWNSRYRPVCIHLAGTGDHHYWRRRTLMARPMIKEARMASLLLE 68
Query: 154 SPFY 157
+P+Y
Sbjct: 69 NPYY 72
>gi|444721947|gb|ELW62654.1| hypothetical protein TREES_T100015077 [Tupaia chinensis]
Length = 320
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 35/50 (70%), Gaps = 1/50 (2%)
Query: 163 LLQRGAKLLCVSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGG 212
++ R + L VSDL ++G A + E+ LLHWLE E G+G +G+ G+SMGG
Sbjct: 90 VIARRSSLKNVSDLFVMGGALVLESAALLHWLERE-GYGPLGMTGISMGG 138
>gi|156717650|ref|NP_001096365.1| uncharacterized protein LOC100124957 [Xenopus (Silurana)
tropicalis]
gi|134025805|gb|AAI35880.1| LOC100124957 protein [Xenopus (Silurana) tropicalis]
Length = 258
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 39/64 (60%), Gaps = 1/64 (1%)
Query: 179 LGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAV 238
+G A + E+ LLHWLE E G+G +G+ G+SMGG A++ + P P+ +P LS +A
Sbjct: 1 MGGALVLESAALLHWLERE-GYGPLGMTGISMGGHMASLAVTNWPKPIPLVPCLSWSTAS 59
Query: 239 VAFC 242
F
Sbjct: 60 GVFT 63
>gi|430742482|ref|YP_007201611.1| dienelactone hydrolase-like enzyme [Singulisphaera acidiphila DSM
18658]
gi|430014202|gb|AGA25916.1| dienelactone hydrolase-like enzyme [Singulisphaera acidiphila DSM
18658]
Length = 319
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 58/130 (44%), Gaps = 6/130 (4%)
Query: 98 PESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLESPFY 157
PE++ + P + V+H+ G D R L L + +A + ++ P+Y
Sbjct: 73 PENNTVHAEYFRPVGPGRRPAVVVLHILGA-DFALSRYL--AARLAQRGVAALFVKLPYY 129
Query: 158 GQRRPLLQRGAKLLCVSDLLLLG-RATIEEARCLLHWLEWEAGFG--KMGVCGLSMGGVH 214
G+RRP L D LL R + + R WL A ++GV G+S+GG+
Sbjct: 130 GERRPAGSDKKFLSADMDRSLLSMRQGVCDVRRAAAWLAGRAEVDPKQLGVTGISLGGIV 189
Query: 215 AAMVGSLHPT 224
A++ + PT
Sbjct: 190 ASLAAANDPT 199
>gi|347754310|ref|YP_004861874.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
B]
gi|347586828|gb|AEP11358.1| hypothetical protein Cabther_A0601 [Candidatus Chloracidobacterium
thermophilum B]
Length = 383
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 41/80 (51%), Gaps = 4/80 (5%)
Query: 147 IATMVLESPFYGQRRPLLQRGAKLLCVSDL---LLLGRATIEEARCLLHWLEWEAGFGKM 203
IA++ L P++G RRP+ Q A + +L L R + E R +L WLE G+ +
Sbjct: 146 IASVRLSLPYHGSRRPVHQVRADYMVSPNLGRTLQAVRQAVHEVRLVLDWLE-SQGYHRF 204
Query: 204 GVCGLSMGGVHAAMVGSLHP 223
G+ G S+G A + + P
Sbjct: 205 GIIGTSIGSCVAFLAYAFDP 224
>gi|47207974|emb|CAF94569.1| unnamed protein product [Tetraodon nigroviridis]
Length = 57
Score = 44.3 bits (103), Expect = 0.079, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 29/40 (72%), Gaps = 1/40 (2%)
Query: 173 VSDLLLLGRATIEEARCLLHWLEWEAGFGKMGVCGLSMGG 212
VSDL ++G A I E+ LLHWLE E G+ +G+ G+SMGG
Sbjct: 7 VSDLFVMGGALILESTVLLHWLERE-GYWPLGMTGISMGG 45
>gi|380807651|gb|AFE75701.1| uncharacterized protein C4orf29 precursor, partial [Macaca mulatta]
Length = 117
Score = 44.3 bits (103), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Query: 70 TIWETQTAVLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGD 129
I E + +G F +P + +P ES AR F+ PK + +HLAGTGD
Sbjct: 33 KIEEQSDCKILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGD 92
Query: 130 HTF-ERRLRLGGPLLKE-NIATMVL 152
H + RR + P++KE +A+++L
Sbjct: 93 HHYWRRRTLMARPMIKEARMASLLL 117
>gi|412993724|emb|CCO14235.1| unknown protein [Bathycoccus prasinos]
Length = 583
Score = 43.9 bits (102), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 50/107 (46%), Gaps = 12/107 (11%)
Query: 127 TGDHTFERRLRLGGPLL----KENIA-------TMVLESPFYGQRRPLLQRGAKLLCVSD 175
TGD TF R R LL KEN A ++ E PFYG+RR + Q + VS+
Sbjct: 240 TGDTTFAFRRRTAENLLTAHYKENDAMEDEAMVVLIPEFPFYGKRRVVGQPTHVISTVSE 299
Query: 176 LLLLGRATIEEARCLLHWLEWEAGFG-KMGVCGLSMGGVHAAMVGSL 221
+L+ + EA L+ W G + + G SMGG AA G L
Sbjct: 300 YILMHLIGLREACGLIEWARDTYGDEVSIAIGGCSMGGYIAANAGIL 346
>gi|422350316|ref|ZP_16431202.1| hypothetical protein HMPREF9465_02092 [Sutterella wadsworthensis
2_1_59BFAA]
gi|404657374|gb|EKB30266.1| hypothetical protein HMPREF9465_02092 [Sutterella wadsworthensis
2_1_59BFAA]
Length = 301
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 199 GFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILKHGTAWEALREE 258
G G+ V G SMGG+ A + HP + +L LS SA G+ + GT W+ +R
Sbjct: 108 GIGRFHVAGFSMGGMIAQTLALRHPERILSLASLS--SATGNPATGLGRLGTIWKIIRP- 164
Query: 259 LAAKKVAMTLEEVRERMRNVL 279
+ A T EE RE MR +L
Sbjct: 165 ---RGPARTKEEAREEMREIL 182
>gi|431899695|gb|ELK07649.1| hypothetical protein PAL_GLEAN10013963 [Pteropus alecto]
Length = 380
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 49/113 (43%), Gaps = 5/113 (4%)
Query: 21 MHRTKISPPFFSRGWGGSKLELLERLI---KQLFPEIEGQNWPPSLIQPIWRTIWETQTA 77
++R + F RGWG + E L+RL K + QN S I E
Sbjct: 9 LYRRLLLTKLFIRGWG--RPEDLKRLFEFRKMIGNRERCQNLVSSDYPVYIDKIEEQSDC 66
Query: 78 VLREGVFRTPCDEQLMSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDH 130
+ +G F +P + +P ES AR F+ PK + +HLAGTGDH
Sbjct: 67 KILDGHFVSPMAHYVPDIMPIESVIARFQFIVPKEWNSKYRPVCIHLAGTGDH 119
>gi|283778652|ref|YP_003369407.1| hypothetical protein Psta_0862 [Pirellula staleyi DSM 6068]
gi|283437105|gb|ADB15547.1| hypothetical protein Psta_0862 [Pirellula staleyi DSM 6068]
Length = 329
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 59/135 (43%), Gaps = 7/135 (5%)
Query: 93 MSALPPESHNARVAFLAPKCVPPQKMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVL 152
M P ++ F PK + V+H+ G GD R L + A+M++
Sbjct: 77 MKTESPANNTVHGEFYRPKMAGQRPAVIVLHILG-GDLQLSRVF--CNHLAQNGSASMLI 133
Query: 153 ESPFYGQRRPLLQRGAKLLCVSDLLLLG-RATIEEARCLLHWLEWEAGF---GKMGVCGL 208
P+YG RR + + D ++G R + + R + W E + + G+ G+
Sbjct: 134 HLPYYGNRRAPGESRRMISEDPDQTVVGMRQAVMDIRRGISWFESQRETIDPQQTGIFGI 193
Query: 209 SMGGVHAAMVGSLHP 223
S+GG+ +A+ ++ P
Sbjct: 194 SLGGITSALAAAVEP 208
>gi|320104582|ref|YP_004180173.1| hypothetical protein Isop_3059 [Isosphaera pallida ATCC 43644]
gi|319751864|gb|ADV63624.1| hypothetical protein Isop_3059 [Isosphaera pallida ATCC 43644]
Length = 382
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 15/135 (11%)
Query: 99 ESHNARVAFLAPKCVPP-----QKMACVVHLAGTGDHTFERRLRLGGPLLKENIATMVLE 153
E++ + + APK P + V+H+ G D R L L + +A L+
Sbjct: 109 ENNVVPLDYFAPKVPAPGGADRRPAVVVLHILGA-DFALSRYL--CARLAQRGVAAAFLQ 165
Query: 154 SPFYGQRRPLLQRGAKLLCVSDL---LLLGRATIEEARCLLHWLEW--EAGFGKMGVCGL 208
P+YG RRP G + +D+ + R + + R + WL E ++GV G+
Sbjct: 166 LPYYGDRRP--PGGDQRFLSADIERSVAAMRQGVCDVRYAVAWLAQRPEVDPERLGVAGI 223
Query: 209 SMGGVHAAMVGSLHP 223
S+GG+ +++V + P
Sbjct: 224 SLGGIISSLVAANDP 238
>gi|164687129|ref|ZP_02211157.1| hypothetical protein CLOBAR_00755 [Clostridium bartlettii DSM
16795]
gi|164604014|gb|EDQ97479.1| glutamine-fructose-6-phosphate transaminase (isomerizing)
[Clostridium bartlettii DSM 16795]
Length = 609
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 40/71 (56%), Gaps = 13/71 (18%)
Query: 240 AFCEGILKHGTAWEALREE------LAAK-----KVAMTLEEVRERMRNVLSLTDVTRFP 288
AF G LKHGT AL EE LA + K+ ++EV+ R NV+S+T+VT
Sbjct: 497 AFAAGELKHGTI--ALIEEGVPVIVLATQQRLFEKMLSNMQEVKARGANVISITEVTNKE 554
Query: 289 IPKIPNAVIFV 299
+ K ++VI++
Sbjct: 555 VEKSSDSVIYI 565
>gi|255034251|ref|YP_003084872.1| putative esterase [Dyadobacter fermentans DSM 18053]
gi|254947007|gb|ACT91707.1| putative esterase [Dyadobacter fermentans DSM 18053]
Length = 237
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%)
Query: 197 EAGFGKMGVCGLSMGGVHAAMVGSLHPTPVATLPFLSPHSAVVAFCEGILK 247
E GK+GV G S GG HAA HP VA L +S + F +G
Sbjct: 105 ECNVGKIGVAGCSFGGFHAANFAFRHPEMVAYLVSMSGAFDIRGFLDGFYD 155
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.139 0.439
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,372,852,231
Number of Sequences: 23463169
Number of extensions: 224330052
Number of successful extensions: 680424
Number of sequences better than 100.0: 197
Number of HSP's better than 100.0 without gapping: 173
Number of HSP's successfully gapped in prelim test: 24
Number of HSP's that attempted gapping in prelim test: 679887
Number of HSP's gapped (non-prelim): 218
length of query: 333
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 190
effective length of database: 9,003,962,200
effective search space: 1710752818000
effective search space used: 1710752818000
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)