BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029860
(186 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255539042|ref|XP_002510586.1| protein with unknown function [Ricinus communis]
gi|223551287|gb|EEF52773.1| protein with unknown function [Ricinus communis]
Length = 282
Score = 326 bits (836), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 151/179 (84%), Positives = 167/179 (93%), Gaps = 1/179 (0%)
Query: 2 QDDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLA 61
Q +E+LHKLLLP+V +LP+ PPS+I+ NFV+YFAPDF+KPGHDQYVYRHANGLCVIGLA
Sbjct: 96 QVTDEDLHKLLLPDVSNLPIVPPSSIETNFVSYFAPDFMKPGHDQYVYRHANGLCVIGLA 155
Query: 62 PTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVR 121
PTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKK NAQHFESNTA+CKVCT + SYIVR
Sbjct: 156 PTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKK-NAQHFESNTALCKVCTRDASYIVR 214
Query: 122 CCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKRE 180
CCVKGSLLEVN++LIKQPGLLNSSADREGY AI+MPKP DWLKIKSSLL LE+YK+ RE
Sbjct: 215 CCVKGSLLEVNDKLIKQPGLLNSSADREGYTAIMMPKPGDWLKIKSSLLTLEEYKKLRE 273
>gi|224061971|ref|XP_002300690.1| predicted protein [Populus trichocarpa]
gi|222842416|gb|EEE79963.1| predicted protein [Populus trichocarpa]
Length = 205
Score = 306 bits (785), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 144/167 (86%), Positives = 156/167 (93%), Gaps = 1/167 (0%)
Query: 14 PNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGI 73
P++ DLP TPPS++ NFV+YFAPDF+KPGHDQYVYRHANGLCVIGLA THVAFKDEGGI
Sbjct: 38 PDIRDLPTTPPSSVQTNFVSYFAPDFVKPGHDQYVYRHANGLCVIGLARTHVAFKDEGGI 97
Query: 74 TAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNN 133
T VDFNVGKSDRSG KVTGKRKK NAQHFESNTA+CKVCT NDSYIVRCCVKGSLLEVN+
Sbjct: 98 TVVDFNVGKSDRSGIKVTGKRKK-NAQHFESNTALCKVCTQNDSYIVRCCVKGSLLEVND 156
Query: 134 RLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKRE 180
RLIKQP LLNSSADREGYIAIIMPKPADWLK+K+SLLGLE+YK+ RE
Sbjct: 157 RLIKQPELLNSSADREGYIAIIMPKPADWLKVKASLLGLEEYKKLRE 203
>gi|302142721|emb|CBI19924.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 304 bits (779), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 139/175 (79%), Positives = 163/175 (93%), Gaps = 1/175 (0%)
Query: 8 LHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAF 67
L +LL+P+VH+LPL PPSA+ NFV+YFAPDF+KPGHDQYVYRHANGLCVIGLAP+HVA
Sbjct: 739 LQRLLVPDVHNLPLFPPSAVQSNFVSYFAPDFMKPGHDQYVYRHANGLCVIGLAPSHVAL 798
Query: 68 KDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGS 127
++GG+TAVDFNVGK+DRSG KVTGKRKK NAQHFESN+A+CKVCTN+ SYIVRCCVKG+
Sbjct: 799 TEKGGVTAVDFNVGKTDRSGIKVTGKRKK-NAQHFESNSALCKVCTNDASYIVRCCVKGN 857
Query: 128 LLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREEC 182
LLEVN+RLIKQPGLLNSSADREGYIAIIMPKPADWLK+K+SL+ +E+YK+ +E C
Sbjct: 858 LLEVNDRLIKQPGLLNSSADREGYIAIIMPKPADWLKVKASLIDIEEYKKLKEVC 912
>gi|225457867|ref|XP_002269229.1| PREDICTED: protein FAM206A [Vitis vinifera]
Length = 196
Score = 303 bits (775), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 139/175 (79%), Positives = 163/175 (93%), Gaps = 1/175 (0%)
Query: 8 LHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAF 67
L +LL+P+VH+LPL PPSA+ NFV+YFAPDF+KPGHDQYVYRHANGLCVIGLAP+HVA
Sbjct: 23 LQRLLVPDVHNLPLFPPSAVQSNFVSYFAPDFMKPGHDQYVYRHANGLCVIGLAPSHVAL 82
Query: 68 KDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGS 127
++GG+TAVDFNVGK+DRSG KVTGKRKK NAQHFESN+A+CKVCTN+ SYIVRCCVKG+
Sbjct: 83 TEKGGVTAVDFNVGKTDRSGIKVTGKRKK-NAQHFESNSALCKVCTNDASYIVRCCVKGN 141
Query: 128 LLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREEC 182
LLEVN+RLIKQPGLLNSSADREGYIAIIMPKPADWLK+K+SL+ +E+YK+ +E C
Sbjct: 142 LLEVNDRLIKQPGLLNSSADREGYIAIIMPKPADWLKVKASLIDIEEYKKLKEVC 196
>gi|356517742|ref|XP_003527545.1| PREDICTED: protein FAM206A-like isoform 1 [Glycine max]
Length = 203
Score = 298 bits (762), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 139/174 (79%), Positives = 156/174 (89%), Gaps = 1/174 (0%)
Query: 7 ELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVA 66
+L KLL+P+V +LPL PPSA++ NF YFA DF+KP HDQYVYRHANGLCVIGLAP+HVA
Sbjct: 29 DLQKLLVPDVQNLPLIPPSAVETNFATYFALDFMKPAHDQYVYRHANGLCVIGLAPSHVA 88
Query: 67 FKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKG 126
FKDEGGITAVDFNVGKSDRSG KVTGKRKK NAQHFESNTA+CK+ T NDSYIVRCCVKG
Sbjct: 89 FKDEGGITAVDFNVGKSDRSGMKVTGKRKK-NAQHFESNTALCKISTKNDSYIVRCCVKG 147
Query: 127 SLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKRE 180
SLLEVN +LIKQP LLN SA REGYIAIIMPKPADWLK+K+SL+ L++YK+ RE
Sbjct: 148 SLLEVNQQLIKQPELLNVSAYREGYIAIIMPKPADWLKVKASLVSLQEYKKLRE 201
>gi|356517744|ref|XP_003527546.1| PREDICTED: protein FAM206A-like isoform 2 [Glycine max]
Length = 217
Score = 295 bits (754), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 140/187 (74%), Positives = 157/187 (83%), Gaps = 13/187 (6%)
Query: 7 ELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVA 66
+L KLL+P+V +LPL PPSA++ NF YFA DF+KP HDQYVYRHANGLCVIGLAP+HVA
Sbjct: 29 DLQKLLVPDVQNLPLIPPSAVETNFATYFALDFMKPAHDQYVYRHANGLCVIGLAPSHVA 88
Query: 67 FKDEGGITAVDFNVGKSDRSGFKVTGKRKK-------------QNAQHFESNTAVCKVCT 113
FKDEGGITAVDFNVGKSDRSG KVTGKRKK QNAQHFESNTA+CK+ T
Sbjct: 89 FKDEGGITAVDFNVGKSDRSGMKVTGKRKKLLLLLLLIGCVFVQNAQHFESNTALCKIST 148
Query: 114 NNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLE 173
NDSYIVRCCVKGSLLEVN +LIKQP LLN SA REGYIAIIMPKPADWLK+K+SL+ L+
Sbjct: 149 KNDSYIVRCCVKGSLLEVNQQLIKQPELLNVSAYREGYIAIIMPKPADWLKVKASLVSLQ 208
Query: 174 DYKRKRE 180
+YK+ RE
Sbjct: 209 EYKKLRE 215
>gi|357467055|ref|XP_003603812.1| hypothetical protein MTR_3g114470 [Medicago truncatula]
gi|355492860|gb|AES74063.1| hypothetical protein MTR_3g114470 [Medicago truncatula]
Length = 211
Score = 291 bits (746), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 135/177 (76%), Positives = 156/177 (88%), Gaps = 1/177 (0%)
Query: 4 DEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPT 63
+E++L KLLLP+V + PL PPSAI+ NFV YFA DF KP HDQY+YRHANGLCVIGLAP+
Sbjct: 17 EEDDLEKLLLPDVENFPLIPPSAIETNFVTYFALDFTKPAHDQYIYRHANGLCVIGLAPS 76
Query: 64 HVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCC 123
H+AFKDEGGIT+VDFNVGKSDR+ KV+GKRKK NAQHFESNTA+CKV T NDSYIVRCC
Sbjct: 77 HIAFKDEGGITSVDFNVGKSDRTAVKVSGKRKK-NAQHFESNTALCKVSTKNDSYIVRCC 135
Query: 124 VKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKRE 180
VKGSLLEVN +LIK P LLN SADREGYIAI+MPKPADWLK+K+S + L++YK+ RE
Sbjct: 136 VKGSLLEVNQQLIKHPELLNGSADREGYIAIMMPKPADWLKVKASFVSLQEYKKVRE 192
>gi|312282739|dbj|BAJ34235.1| unnamed protein product [Thellungiella halophila]
Length = 230
Score = 287 bits (734), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 136/178 (76%), Positives = 151/178 (84%), Gaps = 1/178 (0%)
Query: 3 DDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAP 62
+DE+EL LLL ++ DLPL+PPSA NFV+YF DF K GHDQY+YRHANGLCVIGLAP
Sbjct: 51 EDEDELRNLLLSDIGDLPLSPPSATQVNFVSYFITDFTKSGHDQYIYRHANGLCVIGLAP 110
Query: 63 THVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRC 122
TH+AFKDEGGIT +DFNVGKSDRS KV+GKRKK NA ESNTA+CKV T NDSYIVRC
Sbjct: 111 THIAFKDEGGITNIDFNVGKSDRSVLKVSGKRKK-NAMRSESNTALCKVSTANDSYIVRC 169
Query: 123 CVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKRE 180
CVKGSLLEVN RLIKQP LLNSSADREGYIAIIMP+PADW K K SL+ LE+YK K+E
Sbjct: 170 CVKGSLLEVNERLIKQPQLLNSSADREGYIAIIMPRPADWTKNKESLITLEEYKEKKE 227
>gi|449437184|ref|XP_004136372.1| PREDICTED: protein FAM206A-like [Cucumis sativus]
gi|449511231|ref|XP_004163899.1| PREDICTED: protein FAM206A-like [Cucumis sativus]
Length = 223
Score = 286 bits (732), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 138/182 (75%), Positives = 156/182 (85%), Gaps = 3/182 (1%)
Query: 1 MQDDE--EELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVI 58
Q+DE EEL KLL P+V DLPL PPSA++ NFV+YFAPDF+K GHDQYVYRHANGLCV+
Sbjct: 41 FQEDEQDEELRKLLTPDVRDLPLVPPSAVETNFVSYFAPDFIKEGHDQYVYRHANGLCVV 100
Query: 59 GLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSY 118
GLA TH+AFKD GG+TAVDFNVGKSDRS FKVTGKRKK NAQ FE N+A+CKV T + SY
Sbjct: 101 GLAATHIAFKDSGGVTAVDFNVGKSDRSSFKVTGKRKK-NAQFFEPNSALCKVYTKDASY 159
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
IVRC VKGSLLEVN+RLIKQP LLN+ REGYIAI+MPKPADWLK+K SLL LE++KR
Sbjct: 160 IVRCSVKGSLLEVNDRLIKQPELLNTLPHREGYIAIVMPKPADWLKVKESLLSLEEHKRL 219
Query: 179 RE 180
RE
Sbjct: 220 RE 221
>gi|297842337|ref|XP_002889050.1| hypothetical protein ARALYDRAFT_476748 [Arabidopsis lyrata subsp.
lyrata]
gi|297334891|gb|EFH65309.1| hypothetical protein ARALYDRAFT_476748 [Arabidopsis lyrata subsp.
lyrata]
Length = 224
Score = 283 bits (725), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 134/179 (74%), Positives = 153/179 (85%), Gaps = 1/179 (0%)
Query: 2 QDDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLA 61
+D+E+EL KLLL ++ +LP++PPSA NFV+YF DF K GHDQY+YRHANGLCVIGLA
Sbjct: 44 EDEEDELRKLLLSDIGELPISPPSATQVNFVSYFITDFTKSGHDQYIYRHANGLCVIGLA 103
Query: 62 PTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVR 121
PTH+AFKDEGGIT VDFNVGKSDRS KV+GKRKK NA ESNTA+CKV T DSYIVR
Sbjct: 104 PTHIAFKDEGGITNVDFNVGKSDRSVLKVSGKRKK-NALRSESNTALCKVSTAKDSYIVR 162
Query: 122 CCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKRE 180
CCVKGSLLEVN RLI+QP LLNSSADREGYIAIIMP+PADW K K SL+ LE+YK+K+E
Sbjct: 163 CCVKGSLLEVNERLIRQPELLNSSADREGYIAIIMPRPADWTKNKESLITLEEYKQKKE 221
>gi|15222925|ref|NP_177725.1| biotinyl-lipoyl domain-containing protein [Arabidopsis thaliana]
gi|117168225|gb|ABK32195.1| At1g75980 [Arabidopsis thaliana]
gi|332197661|gb|AEE35782.1| biotinyl-lipoyl domain-containing protein [Arabidopsis thaliana]
Length = 225
Score = 282 bits (721), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 135/181 (74%), Positives = 153/181 (84%), Gaps = 3/181 (1%)
Query: 2 QDDEEE--LHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIG 59
Q+DEEE L KLLL ++ +LP++PPSA NFV+YF DF K GHDQY+YRHANGLCVIG
Sbjct: 43 QEDEEEDELRKLLLSDIGELPISPPSATQVNFVSYFITDFTKSGHDQYIYRHANGLCVIG 102
Query: 60 LAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYI 119
LAPTH+AFKDEGGIT+VDFNVGKSDRS KV+GKRKK NA ESNTA+CKV T D+YI
Sbjct: 103 LAPTHIAFKDEGGITSVDFNVGKSDRSVLKVSGKRKK-NALRSESNTALCKVSTAKDTYI 161
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
VRCCVKGSLLEVN RLIKQP LLNS+ADREGYIAIIMP+PADW K K SL+ LE+YK K+
Sbjct: 162 VRCCVKGSLLEVNERLIKQPELLNSTADREGYIAIIMPRPADWTKNKESLITLEEYKEKK 221
Query: 180 E 180
E
Sbjct: 222 E 222
>gi|351727222|ref|NP_001238433.1| uncharacterized protein LOC100306155 [Glycine max]
gi|255627707|gb|ACU14198.1| unknown [Glycine max]
Length = 189
Score = 273 bits (697), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 127/160 (79%), Positives = 143/160 (89%), Gaps = 1/160 (0%)
Query: 7 ELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVA 66
+L KLL+P++ +LPLTP SA++ NF YFA DF+KP HDQYVYRHANGLCVIGLAP+H+A
Sbjct: 27 DLQKLLVPDLQNLPLTPASAVETNFAIYFALDFMKPAHDQYVYRHANGLCVIGLAPSHIA 86
Query: 67 FKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKG 126
F DEGGITAVDFNVGKSDRSG KVTGKRKK NAQHFESNTA+CK+ T ND+YIVRCCVKG
Sbjct: 87 FNDEGGITAVDFNVGKSDRSGIKVTGKRKK-NAQHFESNTALCKISTKNDTYIVRCCVKG 145
Query: 127 SLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIK 166
SLLE+N +LIKQP LLN SA REGYIAIIMPKPADWLK+K
Sbjct: 146 SLLEINQQLIKQPELLNVSAGREGYIAIIMPKPADWLKVK 185
>gi|222622066|gb|EEE56198.1| hypothetical protein OsJ_05157 [Oryza sativa Japonica Group]
Length = 201
Score = 266 bits (681), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 123/178 (69%), Positives = 145/178 (81%), Gaps = 1/178 (0%)
Query: 5 EEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTH 64
+EE L++P+ DLP PPSA++ NF YF DFL PGHDQYVYRH NGLCV+GLA H
Sbjct: 25 DEETRALVVPDAGDLPPFPPSAVEANFARYFVADFLNPGHDQYVYRHPNGLCVVGLASAH 84
Query: 65 VAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCV 124
+A K+EGGITAVDFNVGKSDRS KVTGKRK+ NAQH + N+A+CKVCT+++S++VRCCV
Sbjct: 85 IALKEEGGITAVDFNVGKSDRSEMKVTGKRKR-NAQHLQENSALCKVCTSSNSFVVRCCV 143
Query: 125 KGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREEC 182
KGSLLE+N+RLIKQP LLN+SADREGYIAI MPKPADWLKIK L +DYK R C
Sbjct: 144 KGSLLEINDRLIKQPDLLNTSADREGYIAIFMPKPADWLKIKDKFLSYDDYKNLRGTC 201
>gi|357138505|ref|XP_003570832.1| PREDICTED: protein FAM206A-like [Brachypodium distachyon]
Length = 201
Score = 262 bits (670), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 122/175 (69%), Positives = 141/175 (80%), Gaps = 1/175 (0%)
Query: 5 EEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTH 64
+EE L+ P+ LP P SA++ NF YFA D+L PGHDQYVYRH NGLCV+GLAP H
Sbjct: 25 DEETRALIAPDAGSLPEAPASAVEANFARYFAADYLNPGHDQYVYRHPNGLCVVGLAPGH 84
Query: 65 VAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCV 124
VA K+EGGITAVDFNVGK+DRS KVTGKRK+ NAQH + N+A+CKVCTN+ S+IVRCCV
Sbjct: 85 VALKEEGGITAVDFNVGKTDRSEIKVTGKRKR-NAQHLQENSALCKVCTNDKSFIVRCCV 143
Query: 125 KGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
KGSLLE+N+RLIKQP LLN+S+DREGYIAI MPKP DWLKIK L EDYK R
Sbjct: 144 KGSLLEINDRLIKQPNLLNTSSDREGYIAIFMPKPVDWLKIKDKFLSSEDYKSLR 198
>gi|413926805|gb|AFW66737.1| hypothetical protein ZEAMMB73_109561 [Zea mays]
Length = 201
Score = 256 bits (655), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 118/178 (66%), Positives = 140/178 (78%), Gaps = 1/178 (0%)
Query: 5 EEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTH 64
++E L++P+ LP +PPSA++ NF YF DFL PG+DQYVYRH NGLCV+GLAP H
Sbjct: 25 DDETRALVVPDAAALPASPPSAVEANFARYFVADFLNPGYDQYVYRHPNGLCVVGLAPAH 84
Query: 65 VAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCV 124
V FK+EG I+ VDFNVGKSDRS KVTGKRK+ NAQH + N+A+CKVC N+ S+IVRCCV
Sbjct: 85 VVFKEEGVISTVDFNVGKSDRSEMKVTGKRKR-NAQHLQENSALCKVCVNDKSFIVRCCV 143
Query: 125 KGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREEC 182
KGSLLE+N RLIKQP LLN++ADREGYIA+ PKPADWLKIK L EDYK R C
Sbjct: 144 KGSLLEINERLIKQPDLLNTAADREGYIAVFQPKPADWLKIKDKFLSYEDYKNLRGVC 201
>gi|294463220|gb|ADE77146.1| unknown [Picea sitchensis]
Length = 248
Score = 245 bits (625), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 116/171 (67%), Positives = 134/171 (78%), Gaps = 1/171 (0%)
Query: 12 LLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEG 71
L P VH+LPL PP AI+ N+V YF DFLKPGHDQY+YRHANGLCV+GLA THVA DE
Sbjct: 79 LCPKVHELPLRPPPAIESNYVQYFVIDFLKPGHDQYIYRHANGLCVVGLASTHVALGDEP 138
Query: 72 GITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEV 131
I +VDFNVGKS RS KV+GKRKK NA E+N+A+CKV +N+ YIVRCCVKGSLLEV
Sbjct: 139 QIKSVDFNVGKSSRSDMKVSGKRKK-NAHFLEANSALCKVISNDTFYIVRCCVKGSLLEV 197
Query: 132 NNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREEC 182
N RLIK+P LLN+S EGY+AI+MPKP DWLK KSS L E+YK+ R C
Sbjct: 198 NERLIKEPELLNTSGSSEGYVAIMMPKPDDWLKAKSSFLSREEYKKHRGLC 248
>gi|115443859|ref|NP_001045709.1| Os02g0120400 [Oryza sativa Japonica Group]
gi|113535240|dbj|BAF07623.1| Os02g0120400, partial [Oryza sativa Japonica Group]
Length = 144
Score = 226 bits (577), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 105/141 (74%), Positives = 121/141 (85%), Gaps = 1/141 (0%)
Query: 42 PGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQH 101
PGHDQYVYRH NGLCV+GLA H+A K+EGGITAVDFNVGKSDRS KVTGKRK+ NAQH
Sbjct: 5 PGHDQYVYRHPNGLCVVGLASAHIALKEEGGITAVDFNVGKSDRSEMKVTGKRKR-NAQH 63
Query: 102 FESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPAD 161
+ N+A+CKVCT+++S++VRCCVKGSLLE+N+RLIKQP LLN+SADREGYIAI MPKPAD
Sbjct: 64 LQENSALCKVCTSSNSFVVRCCVKGSLLEINDRLIKQPDLLNTSADREGYIAIFMPKPAD 123
Query: 162 WLKIKSSLLGLEDYKRKREEC 182
WLKIK L +DYK R C
Sbjct: 124 WLKIKDKFLSYDDYKNLRGTC 144
>gi|8778816|gb|AAF26769.2|AC007396_18 T4O12.20 [Arabidopsis thaliana]
Length = 217
Score = 226 bits (576), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 119/183 (65%), Positives = 137/183 (74%), Gaps = 15/183 (8%)
Query: 2 QDDEEE--LHKLLLPNVHDLPLTPPSA--IDFNFVAYFAPDFLKPGHDQYVYRHANGLCV 57
Q+DEEE L KLLL ++ +LP++PPSA I N V + + LCV
Sbjct: 43 QEDEEEDELRKLLLSDIGELPISPPSATQILLNRVMI----------NIFTVTLMGILCV 92
Query: 58 IGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDS 117
IGLAPTH+AFKDEGGIT+VDFNVGKSDRS KV+GKRKK NA ESNTA+CKV T D+
Sbjct: 93 IGLAPTHIAFKDEGGITSVDFNVGKSDRSVLKVSGKRKK-NALRSESNTALCKVSTAKDT 151
Query: 118 YIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
YIVRCCVKGSLLEVN RLIKQP LLNS+ADREGYIAIIMP+PADW K K SL+ LE+YK
Sbjct: 152 YIVRCCVKGSLLEVNERLIKQPELLNSTADREGYIAIIMPRPADWTKNKESLITLEEYKE 211
Query: 178 KRE 180
K+E
Sbjct: 212 KKE 214
>gi|125537853|gb|EAY84248.1| hypothetical protein OsI_05628 [Oryza sativa Indica Group]
Length = 179
Score = 214 bits (544), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 100/149 (67%), Positives = 123/149 (82%), Gaps = 2/149 (1%)
Query: 5 EEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTH 64
+EE L++P+ DLP PPSA++ NF YF DFL PGHDQYVYRH NGLCV+GLA H
Sbjct: 25 DEETRALVVPDAGDLPPFPPSAVEANFARYFVADFLNPGHDQYVYRHPNGLCVVGLASAH 84
Query: 65 VAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCV 124
+A K+EGGITAVDFNVGKSDRS KVTGKRK+ NAQH + N+A+CKVCT+++S++VRCCV
Sbjct: 85 IALKEEGGITAVDFNVGKSDRSEMKVTGKRKR-NAQHLQENSALCKVCTSSNSFVVRCCV 143
Query: 125 KGSLLEVNNRLIKQPGLLNSS-ADREGYI 152
KGSLLE+N+RLIKQP LLN+S +D G+I
Sbjct: 144 KGSLLEINDRLIKQPDLLNTSWSDMYGHI 172
>gi|242063832|ref|XP_002453205.1| hypothetical protein SORBIDRAFT_04g001640 [Sorghum bicolor]
gi|241933036|gb|EES06181.1| hypothetical protein SORBIDRAFT_04g001640 [Sorghum bicolor]
Length = 179
Score = 202 bits (514), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 102/178 (57%), Positives = 120/178 (67%), Gaps = 24/178 (13%)
Query: 5 EEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTH 64
++E L++P+ DL +PPSA++ NF YF AP H
Sbjct: 26 DDETRALVVPDAADLLPSPPSAVEANFARYFIA-----------------------APAH 62
Query: 65 VAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCV 124
V FK+EG ITAVDFNVGKSDRS KVTGKRK+ NAQH + N+A+CKVC N+ S+IVRCCV
Sbjct: 63 VVFKEEGAITAVDFNVGKSDRSEMKVTGKRKR-NAQHLQENSALCKVCVNDKSFIVRCCV 121
Query: 125 KGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREEC 182
KGSLLE+N RLIKQP LLN++ADREGYIAI PKPADWLKIK L EDYK R C
Sbjct: 122 KGSLLEINERLIKQPDLLNTAADREGYIAIFQPKPADWLKIKDKFLSYEDYKNLRGVC 179
>gi|224085932|ref|XP_002307744.1| predicted protein [Populus trichocarpa]
gi|222857193|gb|EEE94740.1| predicted protein [Populus trichocarpa]
Length = 128
Score = 194 bits (493), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 98/124 (79%), Positives = 107/124 (86%), Gaps = 2/124 (1%)
Query: 57 VIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNND 116
V+ LAPTHVAF D GGITAVDFNVGKSDRSG KVTGKRKK NAQHFES+TA CKVCT +D
Sbjct: 5 VVCLAPTHVAFTDAGGITAVDFNVGKSDRSGIKVTGKRKK-NAQHFESDTAPCKVCTQSD 63
Query: 117 SYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
SYIVRCCVKGSLLEVN+RLIK +REGYIAIIMPKPADWLK+K+SLLGLE+YK
Sbjct: 64 SYIVRCCVKGSLLEVNDRLIKHSKFAE-LFNREGYIAIIMPKPADWLKVKASLLGLEEYK 122
Query: 177 RKRE 180
+ RE
Sbjct: 123 KLRE 126
>gi|302753306|ref|XP_002960077.1| hypothetical protein SELMODRAFT_74174 [Selaginella moellendorffii]
gi|300171016|gb|EFJ37616.1| hypothetical protein SELMODRAFT_74174 [Selaginella moellendorffii]
Length = 185
Score = 183 bits (464), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 89/182 (48%), Positives = 119/182 (65%), Gaps = 4/182 (2%)
Query: 1 MQDDEEELHKLL--LPNVHDLPLTPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCV 57
M+D EEE +L P H+LP P A + NF YFA D K HDQY+YRH NGLCV
Sbjct: 1 MEDKEEENAAVLALYPRAHELPARAPPAHETNFTHYFALDVGGKEFHDQYLYRHGNGLCV 60
Query: 58 IGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDS 117
+GLAP+H+A + I VD++VGK R+ K +G+RKK +AQ E N+++CKV + +
Sbjct: 61 VGLAPSHLALRSPSKIKIVDYDVGKKSRADMKASGRRKK-DAQVLEENSSLCKVSSEDAE 119
Query: 118 YIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
++VRCC++G LLEVN RL P LL+ +GYIAI+MPKP W K + SLL E YK+
Sbjct: 120 FLVRCCIRGQLLEVNERLTSDPSLLSRKVPSQGYIAILMPKPEAWKKAEKSLLTREQYKQ 179
Query: 178 KR 179
+R
Sbjct: 180 QR 181
>gi|302804624|ref|XP_002984064.1| hypothetical protein SELMODRAFT_445756 [Selaginella moellendorffii]
gi|300148416|gb|EFJ15076.1| hypothetical protein SELMODRAFT_445756 [Selaginella moellendorffii]
Length = 184
Score = 181 bits (460), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 90/181 (49%), Positives = 118/181 (65%), Gaps = 3/181 (1%)
Query: 1 MQDDEEELHKL-LLPNVHDLPLTPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVI 58
M D EEE L L P H+LP P A + NF YFA D K HDQY+YRH NGLCV+
Sbjct: 1 MDDKEEENAVLALYPRAHELPPRAPPAHETNFTHYFALDVGGKEFHDQYLYRHGNGLCVV 60
Query: 59 GLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSY 118
GLAP+H+A + I VD++VGK R+ K +G+RKK +AQ E N+++CKV + + +
Sbjct: 61 GLAPSHLALRSPSKIKIVDYDVGKKSRADMKASGRRKK-DAQVLEENSSLCKVSSEDAEF 119
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+VRCC++G LLEVN RL P LL+ +GYIAI+MPKP W K + SLL E YK++
Sbjct: 120 LVRCCIRGQLLEVNERLTSDPSLLSRKPASQGYIAILMPKPEAWKKAEKSLLTREQYKQQ 179
Query: 179 R 179
R
Sbjct: 180 R 180
>gi|168045365|ref|XP_001775148.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673487|gb|EDQ60009.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 162
Score = 177 bits (450), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 86/163 (52%), Positives = 111/163 (68%), Gaps = 2/163 (1%)
Query: 18 DLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVD 77
+LPL PP AI+ NF + A D K HDQY+YRH NGLCVIGL PTH AFK +TAVD
Sbjct: 1 ELPLRPPRAIESNFTHWVALDVGKEFHDQYIYRHGNGLCVIGLLPTHAAFKPTPAVTAVD 60
Query: 78 FNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIV-RCCVKGSLLEVNNRLI 136
FNVGK +R+ KV+GKRK+ NA ++ +CKV Y++ RCCV+G +LEVN RLI
Sbjct: 61 FNVGKQNRADAKVSGKRKR-NAILLHPDSVLCKVMVGEVFYLISRCCVRGVMLEVNERLI 119
Query: 137 KQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
K+P LLN+ AD EG+I I+MP DW + + L ++ YK +R
Sbjct: 120 KEPTLLNTRADTEGHIGIMMPSSEDWSRACKTFLTMDQYKARR 162
>gi|340715238|ref|XP_003396124.1| PREDICTED: UPF0436 protein C9orf6 homolog [Bombus terrestris]
Length = 269
Score = 119 bits (297), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 90/160 (56%), Gaps = 3/160 (1%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-K 82
P+ D F AY+ D P D + H+N +C++ LAP+H +D+ I VDF V K
Sbjct: 89 PTITDRYFTAYYKLDVQSPADDICILMHSNRICMLTLAPSHAILQDDKHIIKVDFKVSNK 148
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
DR KV+GK K AQ + N+ +C + C+N +Y+++CC+ G L+EVN LI+ P L
Sbjct: 149 LDRVQNKVSGKSK-HGAQPLQMNSNICIITCSNKKTYVIKCCIIGKLVEVNEALIQNPKL 207
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREE 181
L R GY+AII+P K+K SLL E Y K E
Sbjct: 208 LLEPPHRAGYLAIILPNIKLLDKMKESLLTQEQYDLKMLE 247
>gi|350397591|ref|XP_003484925.1| PREDICTED: UPF0436 protein C9orf6 homolog [Bombus impatiens]
Length = 253
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 90/160 (56%), Gaps = 3/160 (1%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-K 82
P+ D F AY+ D P D + H+N +C++ LAP+H +D+ I VDF V K
Sbjct: 91 PTITDRYFTAYYKLDVQSPADDICILMHSNRICMLTLAPSHAILQDDKHIIKVDFKVSNK 150
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
DR KV+GK K AQ + N+ +C + C+N +Y+++CC+ G L+EVN LI+ P L
Sbjct: 151 LDRVQNKVSGK-SKHGAQPLQMNSNICIITCSNEKTYVIKCCIIGKLVEVNEALIQNPKL 209
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREE 181
L R GY+AI++P K+K SLL E Y K E
Sbjct: 210 LLEPPHRAGYLAIVLPNIKLLDKMKKSLLTQEQYDLKMLE 249
>gi|157138898|ref|XP_001647553.1| hypothetical protein AaeL_AAEL015669 [Aedes aegypti]
gi|108865363|gb|EAT32221.1| AAEL015669-PA [Aedes aegypti]
Length = 192
Score = 116 bits (290), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/182 (36%), Positives = 103/182 (56%), Gaps = 11/182 (6%)
Query: 1 MQDDEEELHKLLLPN-VHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYR---HANGLC 56
+Q++++ + ++P+ V D PS +D F Y+ +++PG + R H+N +C
Sbjct: 10 LQEEQQVICGEMVPSIVEDFVEDYPSTVDRFFTRYY---YVQPGKENEPLRVLFHSNRIC 66
Query: 57 VIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNND 116
+I LAP HVAF E GI AV F VG+ DRS +V+GK+K +T C +
Sbjct: 67 LISLAPEHVAF--EKGIQAVTFEVGRIDRSENQVSGKKKSGGMILQMDSTLALITCEDGS 124
Query: 117 SYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
Y VR CV+G L+EVN RL++ LL D GY+A++MPKP +K+ L+ E+Y
Sbjct: 125 VYKVRGCVQGKLVEVNKRLVENIDLLREEGD--GYVAVVMPKPEQCEAVKAKLMSQEEYD 182
Query: 177 RK 178
+K
Sbjct: 183 KK 184
>gi|157133377|ref|XP_001656228.1| hypothetical protein AaeL_AAEL002958 [Aedes aegypti]
gi|157133379|ref|XP_001656229.1| hypothetical protein AaeL_AAEL002958 [Aedes aegypti]
gi|108881565|gb|EAT45790.1| AAEL002958-PB [Aedes aegypti]
gi|108881566|gb|EAT45791.1| AAEL002958-PA [Aedes aegypti]
Length = 192
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 66/166 (39%), Positives = 93/166 (56%), Gaps = 10/166 (6%)
Query: 16 VHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYR---HANGLCVIGLAPTHVAFKDEGG 72
V D PS +D F Y+ +++PG + R H+N +C+IGLAP HVAF E G
Sbjct: 26 VADFVEDYPSTVDRFFTRYY---YVQPGKENEPLRVLFHSNRICLIGLAPEHVAF--EKG 80
Query: 73 ITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVN 132
I AV F VG+ DRS +V+GK+K +T C + Y VR CV+G L+EVN
Sbjct: 81 IQAVTFEVGRIDRSENQVSGKKKSGGMILQMDSTLALITCKDGSVYKVRGCVQGKLVEVN 140
Query: 133 NRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
RL++ LL D GY+A++MPKP +K+ L+ E+Y +K
Sbjct: 141 KRLVENIDLLREEGD--GYVAVVMPKPEQCEAVKAKLMSREEYDKK 184
>gi|322802821|gb|EFZ23020.1| hypothetical protein SINV_07402 [Solenopsis invicta]
Length = 291
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 90/154 (58%), Gaps = 3/154 (1%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-K 82
P+ D F Y+ D G D ++ H+N +C++ LAP+HV +D I VDF + K
Sbjct: 128 PTVTDRYFTPYYKVDVQSLGDDMCIWIHSNRICMLSLAPSHVILRDNKDIKKVDFKISDK 187
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
DRS KV+GK K AQ ++N+ +C + C+N +Y+++CC+ G L+EVN L++ P L
Sbjct: 188 LDRSLNKVSGK-GKHGAQPLQTNSNICTISCSNEHTYVIKCCIIGKLVEVNEALLEDPQL 246
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
L S + GY+AI++P +K SLL + Y
Sbjct: 247 LKESPHKGGYLAIVLPNLKLLESLKQSLLTHDQY 280
>gi|380020234|ref|XP_003693996.1| PREDICTED: protein FAM206A-like [Apis florea]
Length = 216
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 90/160 (56%), Gaps = 3/160 (1%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-K 82
P+ D F AY+ + P D + H+N +C++ LAP+H + + IT V+F V K
Sbjct: 12 PTITDRYFTAYYKLNVQSPADDICILIHSNRICMLTLAPSHTILQGDKYITKVNFKVSNK 71
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
DR KV+GK K AQ + N+ +C + C+N +YI++CC+ G L+EVN LI+ P L
Sbjct: 72 LDRVLNKVSGKSK-HGAQPLQMNSNICIITCSNEKTYIIKCCIIGKLVEVNEALIENPKL 130
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREE 181
L + GYIAII+P KIK SLL E Y K E
Sbjct: 131 LLEPPHKGGYIAIILPNIKLLDKIKESLLTQEQYNLKILE 170
>gi|428172610|gb|EKX41518.1| hypothetical protein GUITHDRAFT_112492 [Guillardia theta CCMP2712]
Length = 202
Score = 112 bits (281), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/159 (40%), Positives = 100/159 (62%), Gaps = 6/159 (3%)
Query: 26 AIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS- 83
A + ++ YFA + + G DQ+VY+H N LCV+GLAP+HVAF+D I AVDF+VG+S
Sbjct: 39 ARERDYTHYFATNVGGQEGADQFVYQHFNSLCVVGLAPSHVAFRDGRRIVAVDFDVGRSG 98
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCK-VCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLL 142
D++ K +GK KK + + ++ +C VC + + +R ++G LLE+N+ L P LL
Sbjct: 99 DKTEVKPSGK-KKNGSLFVKEDSRLCDVVCDDGSRFKIRAAIQGHLLEMNSNLQSNPDLL 157
Query: 143 NSSADREGYIAIIMPKPADWLKIKSS--LLGLEDYKRKR 179
N A EG+I+II P+ K++ S LL L+++ + R
Sbjct: 158 NLKASSEGFISIIQPRRELENKLRESCKLLSLDEFNKVR 196
>gi|307215249|gb|EFN90003.1| UPF0436 protein C9orf6-like protein [Harpegnathos saltator]
Length = 270
Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 90/156 (57%), Gaps = 3/156 (1%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-K 82
P+ + FVAY+ + PG D + H+N +C+ LAP+HV + + IT VDF + K
Sbjct: 98 PTVTERYFVAYYKINVQSPGDDICIRIHSNRICMFSLAPSHVILQGDRDITKVDFKINDK 157
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
DR+ KV+GK K AQ + N+ +C + C++ Y+++CC+ G L+EVN L +P L
Sbjct: 158 LDRALNKVSGK-SKHGAQPLQVNSNICTIFCSDGQKYLIKCCMIGKLVEVNEMLSVKPQL 216
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
L + GY+AII+P +K SLL E+Y R
Sbjct: 217 LRELPHKGGYLAIILPNIKHLESLKQSLLTHEEYIR 252
>gi|118782767|ref|XP_312490.2| AGAP002455-PA [Anopheles gambiae str. PEST]
gi|116129723|gb|EAA08138.2| AGAP002455-PA [Anopheles gambiae str. PEST]
Length = 192
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 103/183 (56%), Gaps = 10/183 (5%)
Query: 2 QDDEEELHKLLLPNV-HDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGL 60
+ + +E+ + +P++ +D PS ++ + ++ K Q + H+N +C++GL
Sbjct: 15 EAETQEICGVKVPSILNDFIENQPSIVERFYTKLYSVPPNKTDEPQQILFHSNRICLVGL 74
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYI 119
A HVAF E GI ++ F VGK DRS KV+G RKK ++++ + V C + Y
Sbjct: 75 AKEHVAF--EKGIRSISFEVGKFDRSENKVSG-RKKSGGMILQADSTLALVTCVDGSVYK 131
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
VR CV+G L+EVN R++ +P L+ +S EGY+AI+MPK +K LL E Y R
Sbjct: 132 VRSCVQGKLVEVNERIVSRPDLMRTSG--EGYVAIVMPKIEHCDVLKEKLLSEEAY---R 186
Query: 180 EEC 182
E C
Sbjct: 187 EMC 189
>gi|307179432|gb|EFN67756.1| UPF0436 protein C9orf6-like protein [Camponotus floridanus]
Length = 277
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 89/154 (57%), Gaps = 3/154 (1%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-K 82
P+ D F Y+ + P D + H+N +C++ LAP+H+ + + I ++F V K
Sbjct: 108 PTVTDRYFTPYYKVNVQLPEDDICIRIHSNRICMLSLAPSHIILQGDKDIKTINFKVSDK 167
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
DRS KV+GK K AQ ++N+ +C + C++ +Y+++CC+ G L+EVN L++ P L
Sbjct: 168 LDRSLNKVSGK-GKHGAQPLQTNSNICSISCSDGQTYMIKCCMIGKLVEVNEMLLENPRL 226
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
L + + GY+AII+P +K SLL E Y
Sbjct: 227 LRNPPHKGGYLAIILPNLKLLENLKQSLLTHEQY 260
>gi|345481643|ref|XP_003424420.1| PREDICTED: UPF0436 protein C9orf6 homolog [Nasonia vitripennis]
Length = 273
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 86/156 (55%), Gaps = 3/156 (1%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-K 82
P+ + Y+ D KPG D + H+N +C+I LAP+H ++ I + ++ V K
Sbjct: 85 PTVTERYLTPYYQLDAQKPGDDFCILIHSNRVCMITLAPSHPILQENKTIESCNYKVSEK 144
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
DR+ KV+GK K AQ +SN+ +C + CT+ Y ++CC+ G L+EVN RL++ P L
Sbjct: 145 LDRTKNKVSGKAK-HGAQPLQSNSNLCSLTCTDGSIYAIKCCMIGKLIEVNERLVENPSL 203
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
L GYIAII+P +K LL + Y+
Sbjct: 204 LKKEPHFGGYIAIILPNLKHLDSLKEKLLTQQQYEE 239
>gi|348687789|gb|EGZ27603.1| hypothetical protein PHYSODRAFT_475208 [Phytophthora sojae]
Length = 311
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 84/140 (60%), Gaps = 7/140 (5%)
Query: 41 KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQ 100
KP DQYV+ H+N LCV+G+A +H ++E + +V+F D +VTGK+KK Q
Sbjct: 137 KPMEDQYVHMHSNRLCVVGVAESHPVMQEE--LVSVEFAQNVLDS---RVTGKKKK-GGQ 190
Query: 101 HFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKP 159
+ T +C + C + + + C++GSL+EVN RL K+P LL +GY+ II PK
Sbjct: 191 FMQPKTVLCVLKCKSGREFTLYSCIRGSLIEVNERLQKEPQLLKQKHQSDGYLVIIQPKK 250
Query: 160 ADWLKIKSSLLGLEDYKRKR 179
+ +I+ SLL ++YK+ R
Sbjct: 251 VEVAEIQESLLSKDEYKQFR 270
>gi|170055011|ref|XP_001863390.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167875134|gb|EDS38517.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 192
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 84/154 (54%), Gaps = 6/154 (3%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQ--YVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG 81
PS +D F Y+ + + V H+N +C+I LAP HVAF+ GI +V F VG
Sbjct: 35 PSTVDRFFTRYYYEHPVDKDRQEPHLVLFHSNRICLIQLAPEHVAFRL--GIKSVSFEVG 92
Query: 82 KSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
K DRS +V+GK+K +T C + + VR CV+G L+EVN R+++ GL
Sbjct: 93 KIDRSQNQVSGKKKSGGMIVQADSTLAMVTCNDESVFKVRGCVQGKLVEVNQRVVQDVGL 152
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
L D G+IA++MPKP IK L+ E++
Sbjct: 153 LGREGD--GFIAVVMPKPEQCEAIKGKLMTREEF 184
>gi|301093690|ref|XP_002997690.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109939|gb|EEY67991.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 262
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 92/154 (59%), Gaps = 12/154 (7%)
Query: 25 SAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDF--NVG 81
+ I+ + FA D + KP DQYV+ H+N LCV+G+A +H ++E + +V+F NV
Sbjct: 117 TMIERYYTEMFACDTMGKPMEDQYVHMHSNRLCVLGVAESHPVMQEE--LVSVEFTQNVL 174
Query: 82 KSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+S +VTGK+KK Q NT +C + C + + + C++G+L+EVN RL K+P
Sbjct: 175 QS-----RVTGKKKK-GGQFMLPNTVLCVLKCKSGHEFKLYSCMRGALIEVNERLQKEPE 228
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLED 174
LL +GY+AII PK + +I+ SLL ED
Sbjct: 229 LLKQKNQSDGYLAIIQPKKVEVSEIQQSLLSKED 262
>gi|332023984|gb|EGI64202.1| UPF0436 protein C9orf6-like protein [Acromyrmex echinatior]
Length = 273
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 80/136 (58%), Gaps = 3/136 (2%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-K 82
P+ D F Y+ D G+D ++ H+N +C++ LAP+H+ +D I V+F V K
Sbjct: 107 PTVTDRYFTPYYKMDAQSLGNDTCIWIHSNRICMLSLAPSHIILQDNKDIQKVNFKVSDK 166
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
DRS KV+GK K AQ ++N+ +C + C + +Y+++C + G L+EVN L + P L
Sbjct: 167 LDRSLNKVSGKS-KHGAQPLQTNSNICTILCKDGQTYVIKCYIVGKLVEVNEALSENPQL 225
Query: 142 LNSSADREGYIAIIMP 157
L + GY+AII+P
Sbjct: 226 LKKLPHKGGYLAIILP 241
>gi|148222985|ref|NP_001085336.1| protein FAM206A [Xenopus laevis]
gi|82184720|sp|Q6GR35.1|F206A_XENLA RecName: Full=Protein FAM206A
gi|49257208|gb|AAH71099.1| MGC81155 protein [Xenopus laevis]
Length = 183
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 86/159 (54%), Gaps = 4/159 (2%)
Query: 21 LTPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFN 79
L PS +D F ++ PD KP D + +H+N +C+I LA H ++E I ++ +
Sbjct: 8 LAFPSVVDRYFTRWYKPDIKGKPCEDHCILQHSNRICIITLAECHPLLQNEKTIKSISYQ 67
Query: 80 VGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKVC-TNNDSYIVRCCVKGSLLEVNNRLIK 137
+ + R KV+GK K+ AQ +C++ T+ + Y + C++G LLEVN +++
Sbjct: 68 ISANCSRLQNKVSGKSKR-GAQFLTELAPLCRISSTDGEEYTIYSCIRGRLLEVNENILQ 126
Query: 138 QPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
PGLL EGYIA+++PK + I LL +Y+
Sbjct: 127 NPGLLKEKPSTEGYIAVVLPKFEESKTITEGLLSQREYE 165
>gi|194745318|ref|XP_001955135.1| GF18619 [Drosophila ananassae]
gi|190628172|gb|EDV43696.1| GF18619 [Drosophila ananassae]
Length = 196
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 83/155 (53%), Gaps = 5/155 (3%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
PS +D F Y+ + K V H+N +C+I LAP H AF GI++V+F+VG
Sbjct: 38 PSVVDRFFTRYY---YFKGKVPYQVLYHSNRICLICLAPGHPAFTQ--GISSVNFDVGNV 92
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLN 143
DRS V GK K+ +T N ++Y V C++G L+EVN L+ QP LL
Sbjct: 93 DRSQNVVKGKAKRGGMILQAESTLALLTTENGETYKVPSCIRGKLVEVNTALVVQPKLLE 152
Query: 144 SSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+ GY AI++PK + IK+SLL E Y+ +
Sbjct: 153 QLPEGAGYFAILLPKIENCDAIKASLLTQEQYEER 187
>gi|124487245|ref|NP_001074889.1| protein FAM206A [Mus musculus]
gi|123797704|sp|Q80ZQ9.2|F206A_MOUSE RecName: Full=Protein FAM206A
gi|112180374|gb|AAH26590.1| BC026590 protein [Mus musculus]
gi|112180499|gb|AAH48548.2| BC026590 protein [Mus musculus]
gi|148670308|gb|EDL02255.1| mCG7387 [Mus musculus]
Length = 193
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 83/157 (52%), Gaps = 4/157 (2%)
Query: 23 PPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG 81
PPS +D F ++ D KP D + +H+N +CVI LA +H + I + + +
Sbjct: 24 PPSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKAIQRISYQIS 83
Query: 82 KS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQP 139
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ QP
Sbjct: 84 NNCSRLENKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHQP 142
Query: 140 GLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
LL EGYIA+++PK + + LL + Y+
Sbjct: 143 SLLQEKPSTEGYIAVVLPKFEESKSVTEGLLTQQQYE 179
>gi|432881590|ref|XP_004073855.1| PREDICTED: protein FAM206A-like [Oryzias latipes]
Length = 181
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 87/162 (53%), Gaps = 4/162 (2%)
Query: 18 DLPLTPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAV 76
D L PPS ID + ++ D +P D + +H+N +CV+ LA TH ++ I +
Sbjct: 8 DTELEPPSIIDRYYTRWYRADMKGQPCEDHCILQHSNRICVVTLAETHPVLQNGRTIKTI 67
Query: 77 DFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNR 134
++ + R KV+GK K+ Q +C++ CT+ Y + C++G LLEVN
Sbjct: 68 NYQISNGCSRLNNKVSGKSKR-GGQFLTDFAPLCRITCTDETKYTIYSCIRGRLLEVNEN 126
Query: 135 LIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
++++P LL EGYIA+I+PK + + +LL ++++
Sbjct: 127 ILERPNLLLEKPSTEGYIAVILPKFEESKSVTENLLSRKEFE 168
>gi|26342845|dbj|BAC35079.1| unnamed protein product [Mus musculus]
Length = 225
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 83/157 (52%), Gaps = 4/157 (2%)
Query: 23 PPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG 81
PPS +D F ++ D KP D + +H+N +CVI LA +H + I + + +
Sbjct: 56 PPSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKAIQRISYQIS 115
Query: 82 KS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQP 139
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ QP
Sbjct: 116 NNCSRLENKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHQP 174
Query: 140 GLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
LL EGYIA+++PK + + LL + Y+
Sbjct: 175 SLLQEKPSTEGYIAVVLPKFEESKSVTEGLLTQQQYE 211
>gi|126335097|ref|XP_001364006.1| PREDICTED: UPF0436 protein C9orf6-like [Monodelphis domestica]
Length = 188
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 87/156 (55%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KPG D + +H+N +CVI LA +H ++ I ++++ +
Sbjct: 20 PSLVDRYFTRWYKADVKGKPGEDHCILQHSNRICVITLAESHPLLQNGKTIKSINYQISA 79
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +C++ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 80 NCSRLQNKVSGKFKR-GAQFLTELAPLCRISCSDGEEYTVSSCVRGRLMEVNENILDKPS 138
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL ++Y+
Sbjct: 139 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKEYE 174
>gi|354477182|ref|XP_003500801.1| PREDICTED: UPF0436 protein C9orf6 homolog [Cricetulus griseus]
gi|344236388|gb|EGV92491.1| UPF0436 protein C9orf6-like [Cricetulus griseus]
Length = 192
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 25 PSLVDRYFTRWYKADVKGKPSEDHCILQHSNRICVITLAGSHPVLQSGKTIKSISYQIST 84
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ QP
Sbjct: 85 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIHCSDGEEYTISSCVRGRLMEVNENILHQPS 143
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
LL EGYIA+++PK + I LL + Y+
Sbjct: 144 LLQEKPSTEGYIAVVLPKFEESKSITEGLLTQQQYE 179
>gi|417408434|gb|JAA50768.1| Putative glycine cleavage system h protein, partial [Desmodus
rotundus]
Length = 183
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 84/158 (53%), Gaps = 4/158 (2%)
Query: 22 TPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNV 80
T PS +D F ++ PD +P D + +H+N +CVI LA H + I ++ + +
Sbjct: 13 TVPSLVDRYFTRWYKPDVKGRPCEDHCILQHSNRICVITLAEAHPVLQSGKAIKSISYQI 72
Query: 81 GKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQ 138
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +
Sbjct: 73 STNCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHK 131
Query: 139 PGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
P +L EGYIA+++PK + I LL + Y+
Sbjct: 132 PSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 169
>gi|149638600|ref|XP_001513853.1| PREDICTED: UPF0436 protein C9orf6 homolog [Ornithorhynchus
anatinus]
Length = 179
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 85/156 (54%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNV-G 81
PS +D F ++ D + G D + +H+N LCVI LA H ++ I +VD+ +
Sbjct: 10 PSLVDRYFTRWYRADVKGRLGEDHCILQHSNRLCVITLAEAHPLLQNGKTIKSVDYQISA 69
Query: 82 KSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
R KV+GK K+ AQ +C++ C++ + Y + C++G L+EVN ++K+P
Sbjct: 70 HCSRLQNKVSGKSKR-GAQFLTELAPLCRISCSDGEEYTIYSCIRGRLMEVNENILKRPS 128
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL ++Y+
Sbjct: 129 ILQEKPATEGYIAVVLPKFEESKTITEGLLSQKEYE 164
>gi|321460812|gb|EFX71850.1| hypothetical protein DAPPUDRAFT_201415 [Daphnia pulex]
Length = 171
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 85/144 (59%), Gaps = 9/144 (6%)
Query: 19 LPLTPPSAIDFNFVAYFAPDFLKPG--HDQYVYRHANGLCVIGLAPTHVAFKDEGGITAV 76
L +T PS ID YF +++ +D + H+N +C++ LA +H FK + I V
Sbjct: 11 LSVTYPSVID----RYFTRRYIQRDGDNDTCLLYHSNKICLVTLAKSHAIFKKDVAIKKV 66
Query: 77 DFNVG-KSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNR 134
+F VG K DR+ KV+GK KK Q+ + ++ +C V C ++ Y VR VKG L+EVN
Sbjct: 67 NFKVGEKLDRTNNKVSGK-KKHGGQYVDEHSILCLVECEDDSVYTVRAGVKGKLVEVNEH 125
Query: 135 LIKQPGLLNSSADREGYIAIIMPK 158
LI P LL + A+ +GYIA+++ K
Sbjct: 126 LIGDPNLLLTKAETDGYIAVVLQK 149
>gi|198412893|ref|XP_002124373.1| PREDICTED: similar to MGC81155 protein [Ciona intestinalis]
Length = 204
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 87/167 (52%), Gaps = 15/167 (8%)
Query: 24 PSAIDFNFVAYFAPDFLKPGH------------DQYVYRHANGLCVIGLAPTHVAFKDEG 71
PS I+ F + + K G+ D + RH+N +C+I LAPTH +++
Sbjct: 31 PSVIERYFTTGYKVNLPKGGNLDFFKSNPSNMNDVCILRHSNSICLITLAPTHPVIENKC 90
Query: 72 GITAVDFNVG-KSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLL 129
IT++ F V K DR V+GKRK+ AQ + +C + C + + V C ++GSL+
Sbjct: 91 RITSISFKVTDKLDRLQNSVSGKRKR-GAQWLNPSGPLCVISCDDGSKHTVYCGIRGSLV 149
Query: 130 EVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
EVN RLI P LL+ EGYIA++MPK + LL E +K
Sbjct: 150 EVNERLISSPQLLHDCYKTEGYIAVVMPKHLEAKATMDDLLTEEQFK 196
>gi|293347380|ref|XP_002726576.1| PREDICTED: protein FAM206A [Rattus norvegicus]
gi|293359267|ref|XP_002729524.1| PREDICTED: protein FAM206A [Rattus norvegicus]
gi|149037158|gb|EDL91689.1| rCG31985 [Rattus norvegicus]
Length = 193
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 25 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKTIKSISYQISN 84
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + C++G L+EVN ++ QP
Sbjct: 85 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCIRGRLMEVNENILHQPS 143
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
LL EGYIA+++PK + I LL + Y+
Sbjct: 144 LLQEKPSTEGYIAVVLPKFEESKSITEGLLTQQQYE 179
>gi|351703324|gb|EHB06243.1| hypothetical protein GW7_20448 [Heterocephalus glaber]
Length = 181
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 88/168 (52%), Gaps = 6/168 (3%)
Query: 19 LPLTP--PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITA 75
+P P PS +D F ++ D KP D + +H+N +CVI LA +H K I +
Sbjct: 6 VPAEPAVPSLVDRYFTRWYKADLKGKPCEDHCILQHSNRICVITLAESHPVLKSGKTIKS 65
Query: 76 VDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNN 133
+ + + + R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN
Sbjct: 66 ISYQISTNCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNE 124
Query: 134 RLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREE 181
++ +P +L EGYIA+++PK + I LL + Y+ E+
Sbjct: 125 NILHKPSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYEELMEK 172
>gi|242024185|ref|XP_002432510.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212517948|gb|EEB19772.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 187
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/144 (37%), Positives = 77/144 (53%), Gaps = 3/144 (2%)
Query: 34 YFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNV-GKSDRSGFKVTG 92
YF + P V H+N +C+I LAPTH K + IT +DF V + DR KV+G
Sbjct: 31 YFLSNKGLPSEHHCVLMHSNKICLITLAPTHPLIKSQKKITKLDFQVTNEVDRLENKVSG 90
Query: 93 KRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGY 151
K KK QH ++ +C V C++ Y + C+KG LLE+NN L+ +P L+ G+
Sbjct: 91 KSKK-GGQHLMPDSLLCHVECSDGSKYSIWSCIKGKLLEINNLLLSKPDLMVQKPLSVGF 149
Query: 152 IAIIMPKPADWLKIKSSLLGLEDY 175
+AII+P D K L+ E Y
Sbjct: 150 LAIILPSVKDIEIYKKKLITCEKY 173
>gi|344272074|ref|XP_003407861.1| PREDICTED: UPF0436 protein C9orf6 homolog [Loxodonta africana]
Length = 181
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL E Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQEQYE 167
>gi|355734626|gb|AES11396.1| hypothetical protein [Mustela putorius furo]
Length = 181
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 84/158 (53%), Gaps = 4/158 (2%)
Query: 22 TPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNV 80
T PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 11 TVPSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQI 70
Query: 81 GKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQ 138
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +
Sbjct: 71 STNCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHK 129
Query: 139 PGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
P +L EGYIA+++PK + I LL + Y+
Sbjct: 130 PSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|332222444|ref|XP_003260379.1| PREDICTED: protein FAM206A [Nomascus leucogenys]
Length = 181
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + YK
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYK 167
>gi|114051912|ref|NP_001039421.1| protein FAM206A [Bos taurus]
gi|122135337|sp|Q29S16.1|F206A_BOVIN RecName: Full=Protein FAM206A
gi|88758642|gb|AAI13224.1| Chromosome 9 open reading frame 6 ortholog [Bos taurus]
gi|296484387|tpg|DAA26502.1| TPA: hypothetical protein LOC507081 [Bos taurus]
Length = 187
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 88/166 (53%), Gaps = 6/166 (3%)
Query: 16 VHDLPLTP--PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGG 72
V DL P PS +D F ++ D KP D + +H+N +CVI LA +H +
Sbjct: 9 VADLAPEPAVPSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKT 68
Query: 73 ITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLE 130
I ++ + + + R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+E
Sbjct: 69 IKSISYQISTNCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLME 127
Query: 131 VNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
VN ++ +P +L EGYIA+++PK + I LL ++Y+
Sbjct: 128 VNENILHKPSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKEYE 173
>gi|440908038|gb|ELR58107.1| hypothetical protein M91_02692, partial [Bos grunniens mutus]
Length = 191
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 88/166 (53%), Gaps = 6/166 (3%)
Query: 16 VHDLPLTP--PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGG 72
V DL P PS +D F ++ D KP D + +H+N +CVI LA +H +
Sbjct: 13 VADLAPEPAVPSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKT 72
Query: 73 ITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLE 130
I ++ + + + R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+E
Sbjct: 73 IKSISYQISTNCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLME 131
Query: 131 VNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
VN ++ +P +L EGYIA+++PK + I LL ++Y+
Sbjct: 132 VNENILHKPSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKEYE 177
>gi|156362472|ref|XP_001625801.1| predicted protein [Nematostella vectensis]
gi|156212651|gb|EDO33701.1| predicted protein [Nematostella vectensis]
Length = 203
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 87/150 (58%), Gaps = 8/150 (5%)
Query: 34 YFAPDFL-----KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-KSDRSG 87
YF+ ++ + G DQ V H+N +CV+ ++P H K++ IT+++F + K +R
Sbjct: 22 YFSRGYMTDVNGRCGEDQCVLLHSNKICVVTVSPEHPLVKEKCAITSINFQISSKINRLD 81
Query: 88 FKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSA 146
V+GK KK AQ + A+C+V C+N +Y + C+KG L+E+N L+K P L+
Sbjct: 82 NHVSGKSKK-GAQWMMPDDALCEVTCSNGTTYTLNSCIKGKLVEINQELLKTPELIKLKP 140
Query: 147 DREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+ +G+IA+++PK + + LL E Y+
Sbjct: 141 ETDGFIAVVLPKLTEVNEYFDKLLSPESYR 170
>gi|395823997|ref|XP_003785260.1| PREDICTED: protein FAM206A [Otolemur garnettii]
Length = 181
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 84/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKTIKSISYQISN 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL ++Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKEYE 167
>gi|113678602|ref|NP_001038394.1| protein FAM206A [Danio rerio]
gi|123888302|sp|Q1LU93.1|F206A_DANRE RecName: Full=Protein FAM206A
gi|116487556|gb|AAI25853.1| Si:ch211-254e15.2 [Danio rerio]
Length = 177
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS ID F ++ D KP D + +H+N +CVI LA +H F++ I +++ +
Sbjct: 10 PSVIDRYFTRWYRTDLKGKPCEDHCILQHSNRICVITLAESHPIFQNGRKIKNINYQISD 69
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
R KV+GK K+ Q +C++ CT+ + + C++G LLEVN ++ +P
Sbjct: 70 GCSRLKNKVSGKSKR-GGQFLTEFAPLCRITCTDEQEFTIFSCIRGRLLEVNEVILNKPD 128
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
LL EGYIA+I+PK + + LL E Y+
Sbjct: 129 LLMEKPSTEGYIAVILPKFEESKSVTEGLLTREQYE 164
>gi|195500911|ref|XP_002097577.1| GE26300 [Drosophila yakuba]
gi|194183678|gb|EDW97289.1| GE26300 [Drosophila yakuba]
Length = 205
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 92/177 (51%), Gaps = 5/177 (2%)
Query: 2 QDDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLA 61
Q+++E KL+ P PS +D F Y+ + K V H+N +C+I LA
Sbjct: 16 QEEQEICGKLVTPITDGFQEDYPSVVDRFFTRYY---YFKGDVPYQVLYHSNRICLICLA 72
Query: 62 PTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVR 121
P H A GI++V+F++G DRS V GK KK +T N +SY V
Sbjct: 73 PEHPALGQ--GISSVNFDIGNVDRSQNVVKGKGKKGGMILQAESTLALVTTVNGESYKVP 130
Query: 122 CCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
C++G L+EVN L+++P LL + GY AI++PK + IK+SLL E Y+ +
Sbjct: 131 SCIRGKLVEVNTSLVEEPKLLEQLPEGAGYFAILLPKIENCDAIKASLLTQEQYEER 187
>gi|348556081|ref|XP_003463851.1| PREDICTED: UPF0436 protein C9orf6-like [Cavia porcellus]
Length = 181
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H K I ++ + +
Sbjct: 13 PSLVDRYFTRWYKTDVKGKPCEDHCILQHSNRICVITLAESHPVLKSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHNPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|149739785|ref|XP_001492507.1| PREDICTED: UPF0436 protein C9orf6 homolog [Equus caballus]
Length = 181
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D V +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKADVKGKPCEDHCVLQHSNRVCVITLAESHPVLQSGKTIESISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|340368380|ref|XP_003382730.1| PREDICTED: UPF0436 protein C9orf6 homolog [Amphimedon
queenslandica]
Length = 177
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 90/162 (55%), Gaps = 6/162 (3%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGG--ITAVDFNV 80
PS +D + ++A D + D + +H+N + ++ LA TH KD+ + ++F +
Sbjct: 15 PSVVDRYYTKFYAIDVKGRENEDHCILQHSNRVMIVCLAETHPLIKDQSSNEVREIEFQI 74
Query: 81 GKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCT-NNDSYIVRCCVKGSLLEVNNRLIKQP 139
GK DRS V+G+ KK Q + + + V T +N Y++ C+KG L+E+N++L+K+P
Sbjct: 75 GKVDRSDNPVSGRFKK-GGQKLTATSPIAIVHTKDNKKYVLYSCLKGMLVEINDQLLKEP 133
Query: 140 GLLNSSADREGYIAIIMP-KPADWLKIKSSLLGLEDYKRKRE 180
LL GYIAI++P K + + SLL E YK R+
Sbjct: 134 QLLQIEPQGRGYIAILLPNKLNEASTVTESLLSEESYKTLRQ 175
>gi|348513105|ref|XP_003444083.1| PREDICTED: UPF0436 protein C9orf6 homolog [Oreochromis niloticus]
Length = 182
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 84/163 (51%), Gaps = 4/163 (2%)
Query: 17 HDLPLTPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITA 75
D PS ID + ++ D KP D + +H+N +CVI LA TH ++ I
Sbjct: 7 EDTKANAPSVIDRYYTRWYRADMKGKPCEDHCILQHSNRICVITLAETHPILQNGRTIKC 66
Query: 76 VDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNN 133
+++ + R KV+GK K+ Q +C++ CT+ Y + C++G LLEVN
Sbjct: 67 INYQISNGCSRLKNKVSGKSKR-GGQFLTDFAPLCRITCTDETEYTIYSCIRGRLLEVNE 125
Query: 134 RLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+++ P LL EGYIA+I+PK + I +LL ++++
Sbjct: 126 NILETPNLLLEKPSTEGYIAVILPKFEESKSITENLLNRDEFE 168
>gi|73971936|ref|XP_532026.2| PREDICTED: protein FAM206A-like [Canis lupus familiaris]
Length = 181
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|426219738|ref|XP_004004075.1| PREDICTED: protein FAM206A [Ovis aries]
Length = 187
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 84/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 19 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKTIKSISYQIST 78
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +P
Sbjct: 79 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHKPS 137
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + + LL ++Y+
Sbjct: 138 ILQEKPSTEGYIAVVLPKFEESKSVTEGLLTQKEYE 173
>gi|291382835|ref|XP_002708164.1| PREDICTED: CG9288-like [Oryctolagus cuniculus]
Length = 181
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKTDVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTEFAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHEPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|431918437|gb|ELK17661.1| hypothetical protein PAL_GLEAN10009403 [Pteropus alecto]
Length = 181
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHEPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|410978819|ref|XP_003995785.1| PREDICTED: protein FAM206A [Felis catus]
Length = 181
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITDGLLTQKQYE 167
>gi|443732668|gb|ELU17295.1| hypothetical protein CAPTEDRAFT_149184 [Capitella teleta]
Length = 215
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/163 (36%), Positives = 88/163 (53%), Gaps = 4/163 (2%)
Query: 16 VHDLPLTPPSAIDFNFVAYFAPDFLK-PGHDQYVYRHANGLCVIGLAPTHVAFKDEGGIT 74
+ DL P S +D F + + K PG DQ V H+N LCVI +A +H + ++
Sbjct: 13 IADLTGVPESFVDRYFTRRYKLNVKKIPGEDQCVLVHSNRLCVITIADSHPIRSMDKVVS 72
Query: 75 AVDFNV-GKSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVN 132
+V+F + GK DR+ +GK K+ AQ + + +C V CT+ Y + V+G L+EVN
Sbjct: 73 SVNFKIDGKIDRAKNAASGKHKR-GAQFLQEGSPLCHVTCTDGSRYTLYAVVRGCLVEVN 131
Query: 133 NRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
+ L +P LL+ + EGYIAI+M D +LL EDY
Sbjct: 132 DCLNARPELLSLMPETEGYIAIVMTSLKDHKTQVDALLSHEDY 174
>gi|426362625|ref|XP_004048458.1| PREDICTED: protein FAM206A [Gorilla gorilla gorilla]
Length = 181
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|8923431|ref|NP_060302.1| protein FAM206A [Homo sapiens]
gi|197098938|ref|NP_001127141.1| protein FAM206A [Pongo abelii]
gi|74753037|sp|Q9NX38.1|F206A_HUMAN RecName: Full=Protein FAM206A
gi|75042690|sp|Q5RFS0.1|F206A_PONAB RecName: Full=Protein FAM206A
gi|12002684|gb|AAG43367.1|AF153417_1 chromosome 9 open reading frame 6 [Homo sapiens]
gi|7020571|dbj|BAA91183.1| unnamed protein product [Homo sapiens]
gi|16041813|gb|AAH15795.1| Chromosome 9 open reading frame 6 [Homo sapiens]
gi|55725039|emb|CAH89387.1| hypothetical protein [Pongo abelii]
gi|119579437|gb|EAW59033.1| chromosome 9 open reading frame 6, isoform CRA_b [Homo sapiens]
gi|312152256|gb|ADQ32640.1| chromosome 9 open reading frame 6 [synthetic construct]
Length = 181
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|397479257|ref|XP_003810943.1| PREDICTED: protein FAM206A [Pan paniscus]
Length = 181
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|66812542|ref|XP_640450.1| hypothetical protein DDB_G0281837 [Dictyostelium discoideum AX4]
gi|74855284|sp|Q54TD9.1|F206_DICDI RecName: Full=FAM206 family protein
gi|60468474|gb|EAL66478.1| hypothetical protein DDB_G0281837 [Dictyostelium discoideum AX4]
Length = 175
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/138 (37%), Positives = 76/138 (55%), Gaps = 11/138 (7%)
Query: 45 DQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDF--NVGKSDRSGFKVTGKRKKQNAQHF 102
DQYV +H N L VIG+AP+H + E I+ ++F N ++ SG + TG K Q
Sbjct: 27 DQYVNQHTNELAVIGVAPSHPVLQQE--ISKIEFKENAINNEVSGVRKTGGFKLQ----- 79
Query: 103 ESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPAD 161
SNT +C + C+N Y +R C+KG LLE+N LI P LL ++ G++AI+ P D
Sbjct: 80 -SNTILCVITCSNGKEYFLRSCIKGKLLEINKELINNPSLLKTNHATSGFLAIVEPMIKD 138
Query: 162 WLKIKSSLLGLEDYKRKR 179
L+ E+Y + R
Sbjct: 139 KFMENPGLIKFEEYHKLR 156
>gi|386780722|ref|NP_001247772.1| uncharacterized protein LOC710486 [Macaca mulatta]
gi|384947872|gb|AFI37541.1| hypothetical protein LOC54942 [Macaca mulatta]
Length = 180
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|410910888|ref|XP_003968922.1| PREDICTED: protein FAM206A-like [Takifugu rubripes]
Length = 181
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 86/157 (54%), Gaps = 4/157 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D + ++ D KP D + +H+N +CV+ LA +H ++ I ++++ +
Sbjct: 14 PSVVDRYYTRWYRADMKGKPCEDHCILQHSNRICVVTLAESHPILQNGRQIKSINYQISN 73
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
R KV+GK K+ Q +C++ CT+ + Y + C++G LLEVN +++ P
Sbjct: 74 GCSRLNNKVSGKSKR-GGQFLTDFAPLCRITCTDENEYTIYSCIRGRLLEVNEAILETPS 132
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
LL +GYIA+I+PK + I +LL E++++
Sbjct: 133 LLLQKPSTDGYIAVILPKFEESKSITENLLSREEFEK 169
>gi|444723959|gb|ELW64583.1| hypothetical protein TREES_T100001553, partial [Tupaia chinensis]
Length = 170
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 2 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAESHPVLQSGKIIKSISYQIST 61
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +P
Sbjct: 62 NCSRLQNKVSGKFKR-GAQFLTELAPLCKISCSDGEEYTISSCVRGRLMEVNENILHKPS 120
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 121 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 156
>gi|213510992|ref|NP_001134602.1| F54F2.7 [Salmo salar]
gi|209734598|gb|ACI68168.1| F54F2.7 [Salmo salar]
Length = 181
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 85/156 (54%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS ID + ++ D K DQ V +H+N +CVI LA +H ++ I ++++ +
Sbjct: 10 PSVIDRYYTRWYKTDIKGKDCEDQCVLQHSNRICVITLAESHPILQNGRSIKSINYQISA 69
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
R +V+GK K+ Q +C++ CT+ Y + C++G LLEVN ++++P
Sbjct: 70 GCSRLHNRVSGKSKR-GGQFLTEFAPLCRITCTDGVEYTIYSCIRGRLLEVNEDILERPE 128
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
LL EGYIA+I+PK + + + LL E+Y+
Sbjct: 129 LLLGKPSTEGYIAVILPKIEESKTVTNGLLSREEYE 164
>gi|353681769|ref|NP_001238841.1| uncharacterized protein LOC464643 [Pan troglodytes]
gi|410214646|gb|JAA04542.1| chromosome 9 open reading frame 6 [Pan troglodytes]
gi|410267872|gb|JAA21902.1| chromosome 9 open reading frame 6 [Pan troglodytes]
gi|410287550|gb|JAA22375.1| chromosome 9 open reading frame 6 [Pan troglodytes]
gi|410329595|gb|JAA33744.1| chromosome 9 open reading frame 6 [Pan troglodytes]
Length = 181
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + + LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSVTEGLLTQKQYE 167
>gi|194033862|ref|XP_001924231.1| PREDICTED: UPF0436 protein C9orf6 homolog [Sus scrofa]
Length = 185
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +CVI LA +H + I ++ + +
Sbjct: 17 PSLVDRYFTRWYKADVKGKPCEDHCILQHSNRICVITLAGSHPVLQSGKTIESISYQIST 76
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y + CV+G L+EVN ++ +P
Sbjct: 77 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTISSCVRGRLMEVNEDILHKPS 135
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + + LL + Y+
Sbjct: 136 ILQEKPSTEGYIAVVLPKFEESKSVTEGLLTQQQYE 171
>gi|402896732|ref|XP_003911443.1| PREDICTED: protein FAM206A [Papio anubis]
Length = 180
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKLIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|390352351|ref|XP_003727880.1| PREDICTED: protein FAM206A-like [Strongylocentrotus purpuratus]
Length = 205
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 85/155 (54%), Gaps = 4/155 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNV-G 81
PS ++ F A + D K D + +H+N +CVI +A H +D+ + VDF V
Sbjct: 21 PSFLERYFTAKYQIDTNGKTAEDFCILQHSNKICVITVAGGHPIVRDKMKVEKVDFQVTA 80
Query: 82 KSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
++R KV GK KK AQ+ +S + +CKV C+ SY + C+KG L+EVN L P
Sbjct: 81 NTNRMDNKVIGKNKK-GAQNLQSESILCKVHCSGGKSYTIYSCIKGKLVEVNEILFSHPQ 139
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
LL + EGY+AII+ + +D+ L+ ++Y
Sbjct: 140 LLVEKPNTEGYVAIILVRLSDYDSQFQRLISEDEY 174
>gi|195453010|ref|XP_002073598.1| GK14200 [Drosophila willistoni]
gi|194169683|gb|EDW84584.1| GK14200 [Drosophila willistoni]
Length = 208
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 88/155 (56%), Gaps = 7/155 (4%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
P+ +D F Y+ ++K D V H+N +C+I LAP+H A+ GIT V+FNVG
Sbjct: 45 PNVVDRFFTRYY---YIKGNADYQVLYHSNRVCLICLAPSHPAYAQ--GITQVNFNVGNV 99
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCKVCTNN-DSYIVRCCVKGSLLEVNNRLIKQPGLL 142
DRS V + K+ ++++ + + T +++ + C++G L+EVN LI +P LL
Sbjct: 100 DRSQ-NVVKGKGKKGGMILQADSTLALLTTKTGETFKIPSCIRGKLVEVNTDLIAEPKLL 158
Query: 143 NSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
++ GY AI++PK + IK+SLL + Y+
Sbjct: 159 EQLPEKAGYFAILLPKIENCEDIKASLLNQQQYEE 193
>gi|196015938|ref|XP_002117824.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190579575|gb|EDV19667.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 174
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 75/135 (55%), Gaps = 8/135 (5%)
Query: 34 YFAPDFL-----KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNV-GKSDRSG 87
YF P + K D + +H+N +C++ +AP+H F+ IT VDF V K +R
Sbjct: 11 YFTPLYQLNVKGKALEDHLILQHSNRICIVAVAPSHAMFQCRRKITNVDFQVTSKLNRLD 70
Query: 88 FKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSA 146
KV+GKRK+ AQ + +C + C + Y + CVKG L+E+N LI+ P LL
Sbjct: 71 NKVSGKRKR-GAQMVNETSPLCHITCDDGSKYTISSCVKGKLIEINELLIENPDLLAEKP 129
Query: 147 DREGYIAIIMPKPAD 161
EG+IA+I+P+ +
Sbjct: 130 CTEGFIAVILPRKVN 144
>gi|123893019|sp|Q28G67.1|F206A_XENTR RecName: Full=Protein FAM206A
gi|89266816|emb|CAJ82750.1| novel protein [Xenopus (Silurana) tropicalis]
Length = 183
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 83/156 (53%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +C+I LA H ++ I + + +
Sbjct: 12 PSVVDRYFKRWYKSDVKGKPCEDHCILQHSNRICIITLAECHPLLQNGKTIKTISYQISA 71
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKVC-TNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ + +C++ T+ + Y + C++G LLEVN +++ P
Sbjct: 72 NCSRLQNKVSGKSKR-GAQFLTEHAPLCRISSTDGEEYTIYSCIRGRLLEVNENILQNPE 130
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
LL EGYIA+++PK + + LL ++Y+
Sbjct: 131 LLKEKPSTEGYIAVVLPKFEESKTVTEGLLSQQEYE 166
>gi|195399846|ref|XP_002058530.1| GJ14482 [Drosophila virilis]
gi|194142090|gb|EDW58498.1| GJ14482 [Drosophila virilis]
Length = 195
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 87/156 (55%), Gaps = 7/156 (4%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
PS +D F Y+ ++K G + H+N +C++ LAPTH A+ + GI V +NVG
Sbjct: 40 PSMVDRFFTRYY---YIKAGAAYQLLYHSNRICLVCLAPTHPAYGE--GIAKVSYNVGNV 94
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCKVCTN-NDSYIVRCCVKGSLLEVNNRLIKQPGLL 142
DRS V GK KK +++T + + T+ ++Y + C++G L+EVN L+ L
Sbjct: 95 DRSQNVVKGKAKK-GGMILQADTTLAVLTTDAGNTYKIPSCIRGKLVEVNTALLADAKQL 153
Query: 143 NSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+ + GY AI++PK + IK+SLL + Y+ +
Sbjct: 154 QLAPEGAGYFAILLPKIENCDDIKASLLTQQQYEEQ 189
>gi|403266236|ref|XP_003925299.1| PREDICTED: protein FAM206A [Saimiri boliviensis boliviensis]
Length = 181
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKADVKGKSCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|296190523|ref|XP_002743226.1| PREDICTED: protein FAM206A [Callithrix jacchus]
Length = 181
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKADVKGKSCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|327270193|ref|XP_003219874.1| PREDICTED: UPF0436 protein C9orf6-like [Anolis carolinensis]
Length = 177
Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 82/158 (51%), Gaps = 4/158 (2%)
Query: 22 TPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNV 80
T P + F ++ PD D V +H+N +CVI +A H + I ++ + +
Sbjct: 3 TVPGVTERYFTRWYKPDVKGNLCEDHCVLQHSNRICVITIADAHPVLQKGKQIESISYQI 62
Query: 81 GKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQ 138
+ R KV+GK K+ AQ +C++ CT+ D Y + C++G L+EVN +++
Sbjct: 63 SANCSRLQNKVSGKSKR-GAQFLTELAPLCRIFCTDGDEYTIYSCIRGKLIEVNENILEN 121
Query: 139 PGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
P LL EGYIA+++PK + +I LL ++Y+
Sbjct: 122 PSLLQEKPSTEGYIAVVLPKFEESKRITEGLLTRKEYE 159
>gi|346472829|gb|AEO36259.1| hypothetical protein [Amblyomma maculatum]
Length = 210
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 87/165 (52%), Gaps = 20/165 (12%)
Query: 34 YFAPDFL-----KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-KSDRSG 87
Y+ P FL + GHDQ H+N + ++ LAP+H K++ IT VD+ V K +R
Sbjct: 33 YYKPRFLLDANGQKGHDQCALFHSNRIVLVTLAPSHPILKEKKTITKVDYQVNEKLNRLH 92
Query: 88 FKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSA 146
+V+GK KK Q + +A+C + C++ +Y V V GSL+EVN L P LL
Sbjct: 93 NQVSGKWKK-GGQKLQKESALCIITCSDGQTYTVVSTVPGSLIEVNEHLTTSPELLTEKP 151
Query: 147 DREGYIAIIMPKPADWLKIKSSLLGL----EDYKRKRE----ECF 183
GYIAI++P LK+ + LL E Y + R+ EC+
Sbjct: 152 WSSGYIAIVLPP----LKLDTMLLDTMTSPEAYVKARKDVIPECY 192
>gi|21358197|ref|NP_650316.1| CG9288 [Drosophila melanogaster]
gi|74947893|sp|Q9VFR1.1|F206_DROME RecName: Full=FAM206 family protein CG9288
gi|7299810|gb|AAF54989.1| CG9288 [Drosophila melanogaster]
gi|17862500|gb|AAL39727.1| LD32260p [Drosophila melanogaster]
gi|220944272|gb|ACL84679.1| CG9288-PA [synthetic construct]
gi|220954118|gb|ACL89602.1| CG9288-PA [synthetic construct]
Length = 214
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 90/176 (51%), Gaps = 5/176 (2%)
Query: 3 DDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAP 62
+++E KL+ P PS +D F Y+ + K V H+N +C+I LAP
Sbjct: 17 EEQEICGKLVAPITDSFQEDYPSVVDRFFTRYY---YFKGDVPYQVLYHSNRICLICLAP 73
Query: 63 THVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRC 122
H A GI++V+F++G DRS V GK KK +T N +Y V
Sbjct: 74 EHPALAQ--GISSVNFDIGNVDRSQNVVKGKGKKGGMILQAESTLALLTTANGGTYKVPS 131
Query: 123 CVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
C++G L+EVN ++++P LL + GY AI++PK + IK+SLL E Y+ +
Sbjct: 132 CIRGKLVEVNTAIVEEPKLLEQLPEGAGYFAILLPKIENCDAIKASLLTQEQYEER 187
>gi|194901352|ref|XP_001980216.1| GG19832 [Drosophila erecta]
gi|190651919|gb|EDV49174.1| GG19832 [Drosophila erecta]
Length = 208
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 89/168 (52%), Gaps = 7/168 (4%)
Query: 10 KLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKD 69
KL+ P PS +D F Y+ + K V H+N +C+I LAP H A
Sbjct: 24 KLVTPITDGFQEDYPSVVDRFFTRYY---YFKGDVPYQVLYHSNRICLICLAPEHPALGQ 80
Query: 70 EGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNN-DSYIVRCCVKGSL 128
GI++V+F++G DRS V + K+ ++ + + + T N ++Y V C++G L
Sbjct: 81 --GISSVNFDIGNVDRSQ-NVVKGKGKKGGMILQAESTLALIATENGETYKVPSCIRGKL 137
Query: 129 LEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+EVN L+++P LL + GY AI++PK + IK+SLL E Y+
Sbjct: 138 VEVNTALVEEPKLLEQLPEGAGYFAILLPKIENCDAIKASLLTQEQYE 185
>gi|195329160|ref|XP_002031279.1| GM24134 [Drosophila sechellia]
gi|194120222|gb|EDW42265.1| GM24134 [Drosophila sechellia]
Length = 214
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 90/176 (51%), Gaps = 5/176 (2%)
Query: 3 DDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAP 62
+++E KL+ P P+ +D F Y+ + K V H+N +C+I LAP
Sbjct: 17 EEQEICGKLVPPITESFQEDYPTVVDRFFTRYY---YFKGDAPYQVLYHSNRICLICLAP 73
Query: 63 THVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRC 122
H A GI++V+F++G DRS V GK KK +T N +Y V
Sbjct: 74 EHPALAQ--GISSVNFDIGNVDRSQNVVKGKGKKGGMILQAESTLALLTTANGGTYKVPS 131
Query: 123 CVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
C++G L+EVN ++++P LL + GY AI++PK + IK+SLL E Y+ +
Sbjct: 132 CIRGKLVEVNTAIVEEPKLLEQLPEGAGYFAILLPKIENCDAIKASLLTQEQYEER 187
>gi|90076436|dbj|BAE87898.1| unnamed protein product [Macaca fascicularis]
Length = 180
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D K D + +H+N +CVI LA +H + I ++ + +
Sbjct: 13 PSLVDRYFTRWYKLDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSISYQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+++PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 167
>gi|125775293|ref|XP_001358898.1| GA21672 [Drosophila pseudoobscura pseudoobscura]
gi|54638639|gb|EAL28041.1| GA21672 [Drosophila pseudoobscura pseudoobscura]
Length = 219
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 77/154 (50%), Gaps = 5/154 (3%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
PS +D F ++ +++ V H+N +C+I LAPTH A GI ++ F+VG
Sbjct: 38 PSVVDRFFTRFY---YIRANAAYQVLYHSNRICLICLAPTHPALAQ--GIASLSFDVGNV 92
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLN 143
DRS V GK KK +T Y + C++G L+EVN L+ +P LL
Sbjct: 93 DRSQNVVKGKAKKGGMILQADSTLALLTTETGQIYKIPSCIRGKLVEVNAALVAEPKLLE 152
Query: 144 SSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
+ GY AI++PK + I+ SLL E Y+
Sbjct: 153 QLPEGAGYFAILLPKIENCDLIRESLLTQEQYEE 186
>gi|195144686|ref|XP_002013327.1| GL24085 [Drosophila persimilis]
gi|194102270|gb|EDW24313.1| GL24085 [Drosophila persimilis]
Length = 219
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 77/154 (50%), Gaps = 5/154 (3%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
PS +D F ++ +++ V H+N +C+I LAPTH A GI ++ F+VG
Sbjct: 38 PSVVDRFFTRFY---YIRANAAYQVLYHSNRICLICLAPTHPALAQ--GIASLSFDVGNV 92
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLN 143
DRS V GK KK +T Y + C++G L+EVN L+ +P LL
Sbjct: 93 DRSQNVVKGKAKKGGMILQADSTLALLTTETGQIYKIPSCIRGKLVEVNAALVAEPKLLE 152
Query: 144 SSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
+ GY AI++PK + I+ SLL E Y+
Sbjct: 153 QLPEGAGYFAILLPKIENCDLIRESLLTQEQYEE 186
>gi|328710080|ref|XP_003244158.1| PREDICTED: UPF0436 protein C9orf6 homolog [Acyrthosiphon pisum]
Length = 189
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 87/159 (54%), Gaps = 6/159 (3%)
Query: 23 PPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNV- 80
PP+ + F + +A + KP D V H+N LC++ LA THV I +DF V
Sbjct: 30 PPTVVQRYFESRYATNVNDKPNQDYSVLIHSNNLCILSLASTHVLMGKT--IDKIDFQVS 87
Query: 81 GKSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQP 139
G +DR +TGK K+ AQ ++ + +CKV C++ + + + + G L+++N++L P
Sbjct: 88 GNTDRLSNNMTGKGKR-GAQIVQAGSTLCKVHCSDGEEHKILSAIPGKLVQMNSKLADTP 146
Query: 140 GLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
++ D G+IAI++P+ + KI+ LL E Y +
Sbjct: 147 NIMLDKPDDLGFIAIVLPQKQRFEKIRDGLLTKEQYTNR 185
>gi|427786821|gb|JAA58862.1| Putative glycine cleavage system h protein [Rhipicephalus
pulchellus]
Length = 210
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 80/155 (51%), Gaps = 8/155 (5%)
Query: 34 YFAPDFL-----KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-KSDRSG 87
YF P F K GHDQ H+N + ++ LAP+H +++ + VD+ V K +R
Sbjct: 32 YFKPRFQLDANGKKGHDQCALFHSNRIVLVTLAPSHPIIREKKTVIKVDYQVNEKLNRLN 91
Query: 88 FKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSA 146
+V+GK KK Q + +A+C V C++ + Y V V GSL+EVN+ L P LL
Sbjct: 92 NQVSGKWKK-GGQKLQKESALCIVTCSDGEIYTVTSTVPGSLMEVNDHLTTSPELLTQKP 150
Query: 147 DREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREE 181
GYIAI++P + +++ E Y R E
Sbjct: 151 WSSGYIAIVLPPLKMDTMLLDTMMSPEAYVTARRE 185
>gi|195571063|ref|XP_002103523.1| GD18931 [Drosophila simulans]
gi|194199450|gb|EDX13026.1| GD18931 [Drosophila simulans]
Length = 214
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 93/179 (51%), Gaps = 6/179 (3%)
Query: 1 MQDDEEELHKLLLPNVHD-LPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIG 59
+ +E+E+ L+P + D P+ +D F Y+ + K V H+N +C+I
Sbjct: 14 FEHEEQEICGKLVPPITDSFQEDYPTVVDRFFTRYY---YFKGDAPYQVLYHSNRICLIC 70
Query: 60 LAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYI 119
LAP H A GI++V+F++G DRS V GK KK +T N +Y
Sbjct: 71 LAPEHPALAQ--GISSVNFDIGNVDRSQNVVKGKGKKGGMILQAESTLALLTTANGATYK 128
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V C++G L+EVN ++++P LL + GY AI++PK + IK+SLL E Y+ +
Sbjct: 129 VPSCIRGKLVEVNTAIVEEPKLLEQLPEGAGYFAILLPKIENCDAIKASLLTQEQYEER 187
>gi|355567549|gb|EHH23890.1| hypothetical protein EGK_07451 [Macaca mulatta]
Length = 180
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 81/156 (51%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSIPCQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+ +PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVELPKFEESKNITEGLLTQKQYE 167
>gi|355753126|gb|EHH57172.1| hypothetical protein EGM_06754 [Macaca fascicularis]
Length = 180
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 81/156 (51%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ PD K D + +H+N +CVI LA +H + I ++ +
Sbjct: 13 PSLVDRYFTRWYKPDVKGKFCEDHCILQHSNRICVITLAESHPVLQSGKTIKSIPCQIST 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +CK+ C++ + Y V CV+G L+EVN ++ +P
Sbjct: 73 NCSRLQNKVSGKFKR-GAQFLTELAPLCKIYCSDGEEYTVSSCVRGRLMEVNENILHKPS 131
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
+L EGYIA+ +PK + I LL + Y+
Sbjct: 132 ILQEKPSTEGYIAVELPKFEESKNITEGLLTQKQYE 167
>gi|281202532|gb|EFA76734.1| hypothetical protein PPL_09485 [Polysphondylium pallidum PN500]
Length = 172
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 81/157 (51%), Gaps = 8/157 (5%)
Query: 25 SAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
+ I+ + Y+A D K +DQY+ +H+NG+CV+GLAPTH A + +++
Sbjct: 3 TVIERYYDQYYAIDIAGKENNDQYINKHSNGVCVVGLAPTHPALTK--SMVKIEYT---G 57
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLL 142
D + G +KK A +T V + C +N +Y + CVKG LLE N + + +L
Sbjct: 58 DLENNQTQGTQKK-GAFKLHRDTIVAYITCDDNSTYPIYSCVKGKLLETNKTITQNIDIL 116
Query: 143 NSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
++ GYI I+ P D L +S LL E Y + R
Sbjct: 117 KNNYSTSGYIIIVEPFETDELSKQSELLSFEKYHQVR 153
>gi|395515395|ref|XP_003761890.1| PREDICTED: protein FAM206A, partial [Sarcophilus harrisii]
Length = 169
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 78/138 (56%), Gaps = 3/138 (2%)
Query: 41 KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGFKVTGKRKKQNA 99
KP D + +H+N +CVI LA +H ++ I ++++ + + R KV+GK K+ A
Sbjct: 19 KPCEDHCILQHSNRICVITLAESHPLLQNGKTIKSINYQISANCSRLQNKVSGKFKR-GA 77
Query: 100 QHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPK 158
Q +C++ C++ + Y V CV+G L+EVN ++ +P +L EGYIA+++PK
Sbjct: 78 QFLTELAPLCRISCSDGEEYTVSSCVRGRLMEVNENILDKPSILQEKPSTEGYIAVVLPK 137
Query: 159 PADWLKIKSSLLGLEDYK 176
+ I LL ++Y+
Sbjct: 138 FEESKSITEGLLTQKEYE 155
>gi|195109374|ref|XP_001999262.1| GI23167 [Drosophila mojavensis]
gi|193915856|gb|EDW14723.1| GI23167 [Drosophila mojavensis]
Length = 198
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 79/155 (50%), Gaps = 5/155 (3%)
Query: 24 PSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
PS +D F Y+ ++K G V H+N +C+I LAPTH A+ + GI +V ++VG
Sbjct: 43 PSVVDRFFTRYY---YIKGGAPYQVLYHSNRICLICLAPTHPAYAE--GIASVSYDVGNM 97
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLN 143
DRS V GK KK E T Y + C++G L+EVN L L+
Sbjct: 98 DRSQNVVKGKAKKGGMILQEDTTLALLTTETGTVYKIPSCIRGKLVEVNTALQTDAKQLH 157
Query: 144 SSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+A+ GY AI++PK + I S+LL + Y +
Sbjct: 158 QAAEGAGYFAILLPKIENCSDIISNLLTQQQYDEQ 192
>gi|224045927|ref|XP_002189552.1| PREDICTED: protein FAM206A [Taeniopygia guttata]
Length = 181
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 81/156 (51%), Gaps = 4/156 (2%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
P + F ++ PD +P D V +H+N +CVI LA H + I +++ +
Sbjct: 16 PGDAERYFTRWYKPDVKGRPCEDFCVLQHSNRICVITLAEAHPLLQSGKTIKNINYQISA 75
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKVCTNN-DSYIVRCCVKGSLLEVNNRLIKQPG 140
+ R KV+GK K+ AQ +C++ +++ + Y + C++G L+EVN ++ P
Sbjct: 76 NCSRLQNKVSGKSKR-GAQFLTELAPLCRISSSDGEEYTIYSCIRGRLIEVNENILSNPA 134
Query: 141 LLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
LL EGYIA+++PK + I LL ++Y+
Sbjct: 135 LLQEKPSTEGYIAVVLPKFEESKSITQGLLTQKEYE 170
>gi|328869692|gb|EGG18069.1| hypothetical protein DFA_06736 [Dictyostelium fasciculatum]
Length = 187
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 80/159 (50%), Gaps = 8/159 (5%)
Query: 25 SAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
+ I+ + YF+ D K DQY+ +H+NG+ V+GLAPTH ++ I ++++
Sbjct: 3 TVIERYYDQYFSIDNQGKENEDQYINQHSNGVFVVGLAPTHPIISEQKSILKIEYSQAAL 62
Query: 84 DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLL 142
+ F +GK+ A TA+C + C + SY + CVKG LLE+N LI P +L
Sbjct: 63 NNV-FSGSGKK---GAVKMNKETAMCTIHCVDGTSYSIYPCVKGKLLEINKILIDNPSML 118
Query: 143 NSSADREGYIAIIMPKPADWL--KIKSSLLGLEDYKRKR 179
S YI +I P + K +S L E Y KR
Sbjct: 119 LSKHATSAYICMIDPYDRETYMDKYQSKFLTFEQYHLKR 157
>gi|57530419|ref|NP_001006383.1| protein FAM206A [Gallus gallus]
gi|82080896|sp|Q5ZHW7.1|F206A_CHICK RecName: Full=Protein FAM206A
gi|53136694|emb|CAG32676.1| hypothetical protein RCJMB04_32i4 [Gallus gallus]
Length = 173
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 79/149 (53%), Gaps = 4/149 (2%)
Query: 31 FVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGF 88
F ++ D +P D V +H+N +CVI LA H + IT++++ + + R
Sbjct: 13 FTRWYKADVKGRPCEDFCVLQHSNRICVITLAEAHPLLQPGKTITSINYQISPNCSRLQN 72
Query: 89 KVTGKRKKQNAQHFESNTAVCKVCTNN-DSYIVRCCVKGSLLEVNNRLIKQPGLLNSSAD 147
KV+GK K+ AQ +C++ +++ + Y + ++G L+EVN ++ P LL
Sbjct: 73 KVSGKSKR-GAQFLTELAPLCRIASSDGEEYTIYSFIRGRLIEVNENILSNPALLQEKPS 131
Query: 148 REGYIAIIMPKPADWLKIKSSLLGLEDYK 176
EGYIA+++PK + + LL E+YK
Sbjct: 132 TEGYIAVVLPKFEESKSVTQGLLTPEEYK 160
>gi|195061631|ref|XP_001996034.1| GH14275 [Drosophila grimshawi]
gi|193891826|gb|EDV90692.1| GH14275 [Drosophila grimshawi]
Length = 197
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 84/176 (47%), Gaps = 5/176 (2%)
Query: 2 QDDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLA 61
Q+ +E +L+ P PS +D F Y+ ++K V H+N +C+I LA
Sbjct: 17 QEKQEICGQLVSPITDCYEEKYPSVVDRFFTRYY---YIKSDAAYQVLYHSNRICLICLA 73
Query: 62 PTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVR 121
TH A+ + GI +V ++VG DRS V GK KK T Y +
Sbjct: 74 STHPAYGE--GIASVSYDVGNVDRSQNVVKGKAKKGGMILQADTTLALLTTETGKVYKIP 131
Query: 122 CCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKR 177
C++G L+EVN L+ L + + GY AI++PK + IK+SLL + Y+
Sbjct: 132 SCIRGKLVEVNTSLVADAKQLETVPEGAGYFAILLPKIENCDDIKASLLTQQQYEE 187
>gi|225719312|gb|ACO15502.1| F54F2.7 [Caligus clemensi]
Length = 197
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 71/135 (52%), Gaps = 5/135 (3%)
Query: 44 HDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVG-KSDRSGFKVTGKRKKQNAQHF 102
HDQ + H+N L +I LAP H F EG T VD+ V K DRS V GK KK AQ
Sbjct: 53 HDQIICVHSNRLVLISLAPRHGLFSKEGSFT-VDYRVTPKCDRSANSVRGKGKK-GAQEL 110
Query: 103 ESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPK-PA 160
+C + + + V+ VKG L+EVN L+ P L +G+IA+I+P P
Sbjct: 111 SPEANLCVIKPQEGEEFFVKALVKGKLVEVNKSLLNDPQALKRYPQGQGFIALILPNLPE 170
Query: 161 DWLKIKSSLLGLEDY 175
++K++ L ++Y
Sbjct: 171 GLAQLKATYLSEKEY 185
>gi|391341951|ref|XP_003745288.1| PREDICTED: protein FAM206A-like [Metaseiulus occidentalis]
Length = 156
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 67/114 (58%), Gaps = 2/114 (1%)
Query: 45 DQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFE 103
D + H+N + V+ LAP+H ++ + VDF + +S +R KV+GK KK Q +
Sbjct: 3 DLMILIHSNRVAVVSLAPSHTIVRERLRVDRVDFRINESLNRLENKVSGKWKK-GGQRVK 61
Query: 104 SNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
++++C + N + VR CV G+L+EVN RL +P LLNS+ +I +++P
Sbjct: 62 KDSSLCYIHAENRIFTVRACVPGALVEVNARLCDEPSLLNSNPFDLAHIGVVLP 115
>gi|330797610|ref|XP_003286852.1| hypothetical protein DICPUDRAFT_150882 [Dictyostelium purpureum]
gi|325083154|gb|EGC36614.1| hypothetical protein DICPUDRAFT_150882 [Dictyostelium purpureum]
Length = 184
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 19/146 (13%)
Query: 45 DQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDF--NVGKSDRSGFKVTGKRKKQNAQHF 102
D YV +H NGL +IG+AP+H K E I+ ++F N S+ SG K G K Q
Sbjct: 29 DHYVNQHTNGLAIIGVAPSHPVLKKE--ISKIEFQENSMNSEVSGIKKRGGFKLQKE--- 83
Query: 103 ESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQP--------GLLNSSADREGYIA 153
T +CK+ C++N Y +R C+KG LLE+N L + LL ++ G+IA
Sbjct: 84 ---TIICKIYCSDNTQYNIRSCIKGKLLEINKNLTENDEKHNNVNFSLLKDNSSTSGFIA 140
Query: 154 IIMPKPADWLKIKSSLLGLEDYKRKR 179
I+ P L+ +DY + R
Sbjct: 141 IVEPLIQAKFIENPGLISYDDYHKIR 166
>gi|119579438|gb|EAW59034.1| chromosome 9 open reading frame 6, isoform CRA_c [Homo sapiens]
Length = 257
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 68/126 (53%), Gaps = 3/126 (2%)
Query: 53 NGLCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV 111
G+CVI LA +H + I ++ + + + R KV+GK K+ AQ +CK+
Sbjct: 119 TGICVITLAESHPVLQSGKTIKSISYQISTNCSRLQNKVSGKFKR-GAQFLTELAPLCKI 177
Query: 112 -CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLL 170
C++ + Y V CV+G L+EVN ++ +P +L EGYIA+++PK + I LL
Sbjct: 178 YCSDGEEYTVSSCVRGRLMEVNENILHKPSILQEKPSTEGYIAVVLPKFEESKSITEGLL 237
Query: 171 GLEDYK 176
+ Y+
Sbjct: 238 TQKQYE 243
>gi|154336649|ref|XP_001564560.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061595|emb|CAM38625.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 210
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 82/153 (53%), Gaps = 15/153 (9%)
Query: 19 LPLTPPSAIDFNFVAYFAPDFLK-----PGHDQYVYRHANGLCVIGLAPTH-----VAFK 68
+PL PP+ D+ YF ++ G+D + +H+NGLCV+ L P H A
Sbjct: 33 VPLPPPTTFDYFTKRYFHQFSIRDCKKVKGNDCRILQHSNGLCVLCLDPNHELVRRCAAA 92
Query: 69 DEGGITA-VDFNVGKSD--RSGFKVTGKRKKQNAQHFESNTAVCK-VCTNNDSYIVRCCV 124
D G A V+F G++ +V GK+KK NA F+++T VC V ++ Y + CV
Sbjct: 93 DAGVTVAKVEFFKGRTAIMPESIQVVGKKKK-NALVFQNDTKVCSIVLSDMIEYTIPACV 151
Query: 125 KGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
G +LE+N L++ P ++ + EGY+A+I P
Sbjct: 152 NGYVLELNAALLEHPWMVAVAPTAEGYLAVINP 184
>gi|281345178|gb|EFB20762.1| hypothetical protein PANDA_004556 [Ailuropoda melanoleuca]
Length = 138
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 67/124 (54%), Gaps = 3/124 (2%)
Query: 55 LCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-C 112
+CVI LA +H + I +V + + + R KV+GK K+ AQ +CK+ C
Sbjct: 2 ICVITLAESHPVLQSGKTIKSVSYQISTNCSRLQNKVSGKFKR-GAQFLTELAPLCKIYC 60
Query: 113 TNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGL 172
++ + Y V CV+G L+EVN ++ +P +L EGYIA+++PK + I LL
Sbjct: 61 SDGEEYTVSSCVRGRLMEVNENILHEPSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQ 120
Query: 173 EDYK 176
+ Y+
Sbjct: 121 QQYE 124
>gi|325182045|emb|CCA16498.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 282
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 86/168 (51%), Gaps = 10/168 (5%)
Query: 14 PNVHDLPLTPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGG 72
P+V P + ++ + FA + + D + H+N LCV LAP+H F++
Sbjct: 85 PSVKAQNTRPATIVERYYTELFALNVMGNESEDISIQMHSNRLCVTTLAPSHPVFQE--S 142
Query: 73 ITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEV 131
I +V+F ++ +KK+ T +C++ C + Y + C++G+LLE+
Sbjct: 143 IISVEFG----EKLKENKVNGKKKKGGLFLIPTTILCRIRCQSGRLYEIPSCIRGALLEI 198
Query: 132 NNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
N R+I+QP LL A EGY+ I+ PK A+ I+ +L+ DY+ R
Sbjct: 199 NERVIEQPELLKHPA--EGYLVIVQPKVAEIYGIQQTLVPKSDYELLR 244
>gi|389609513|dbj|BAM18368.1| similar to CG9288 [Papilio xuthus]
Length = 175
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 75/136 (55%), Gaps = 3/136 (2%)
Query: 43 GHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHF 102
+D + H+N + +I LAP+H FK ++F++G DR V GK KK Q
Sbjct: 38 NNDIMLMFHSNRITLISLAPSHFFFKKTDQY-KINFSIGNIDRLTNSVKGKGKKG-GQML 95
Query: 103 ESNTAVCKVCTNNDS-YIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPAD 161
+N+ +CK+ ++ + + + CC+KG+L+EVN L K P LL D G+IAI++ A
Sbjct: 96 TANSVICKIEYDDGTMFAIPCCMKGTLIEVNKELTKNPELLREQPDSNGFIAIMLSSIAI 155
Query: 162 WLKIKSSLLGLEDYKR 177
KS LL E+Y +
Sbjct: 156 SDSTKSELLNHEEYVK 171
>gi|326917295|ref|XP_003204935.1| PREDICTED: UPF0436 protein C9orf6 homolog [Meleagris gallopavo]
Length = 142
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 68/124 (54%), Gaps = 3/124 (2%)
Query: 55 LCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKVCT 113
+CVI LA H + IT++++ + + R KV+GK K+ AQ +C++ +
Sbjct: 7 ICVITLAEAHPLLQHGRTITSINYQISPNCSRLQNKVSGKSKR-GAQFLTELAPLCRISS 65
Query: 114 NN-DSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGL 172
++ + Y + C++G L+EVN ++ P LL EGYIA+++PK + I LL
Sbjct: 66 SDGEEYTIYSCIRGRLIEVNENILSNPALLQEKPSTEGYIAVVLPKFEESKSITQGLLTP 125
Query: 173 EDYK 176
E+YK
Sbjct: 126 EEYK 129
>gi|47202750|emb|CAG14078.1| unnamed protein product [Tetraodon nigroviridis]
Length = 140
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D + ++ D KP D + +H+N +CV+ LA +H +DE I ++++ +
Sbjct: 13 PSVVDRYYTRWYRADMKGKPCEDHCILQHSNRICVVTLAESHPILQDERQIKSINYQISN 72
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPG 140
R KV+GK K+ Q +C++ CT+ Y + C++G LLEVN +++ P
Sbjct: 73 GCSRLNNKVSGKSKR-GGQFLTDFAPLCRITCTDETEYTIYSCIRGRLLEVNEAILETPS 131
Query: 141 LL 142
LL
Sbjct: 132 LL 133
>gi|402585653|gb|EJW79592.1| hypothetical protein WUBG_09496 [Wuchereria bancrofti]
Length = 156
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 77/138 (55%), Gaps = 12/138 (8%)
Query: 25 SAIDFNFVAYFAPDFLKPGHDQYVY-RHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
S +D + Y LK G + Y RH +G+ V+ L +H + + IT +D+ + K+
Sbjct: 6 SVVDRIYKRYV----LKNGGNGLCYLRHPSGVIVVTLCKSHELIQKQ--ITKIDWRMKKN 59
Query: 84 ---DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQP 139
DRS KV+GK KK +T +C V C + YI+R +K L+E+N RL++ P
Sbjct: 60 RGIDRSKQKVSGKAKK-GGLALLPDTKLCIVHCEDGTEYILRAGIKALLIEINERLLEDP 118
Query: 140 GLLNSSADREGYIAIIMP 157
L+ SS + +GYIAII+P
Sbjct: 119 NLMRSSPENQGYIAIIIP 136
>gi|119579436|gb|EAW59032.1| chromosome 9 open reading frame 6, isoform CRA_a [Homo sapiens]
Length = 156
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 3/124 (2%)
Query: 55 LCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKV-C 112
+CVI LA +H + I ++ + + + R KV+GK K+ AQ +CK+ C
Sbjct: 20 ICVITLAESHPVLQSGKTIKSISYQISTNCSRLQNKVSGKFKR-GAQFLTELAPLCKIYC 78
Query: 113 TNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGL 172
++ + Y V CV+G L+EVN ++ +P +L EGYIA+++PK + I LL
Sbjct: 79 SDGEEYTVSSCVRGRLMEVNENILHKPSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQ 138
Query: 173 EDYK 176
+ Y+
Sbjct: 139 KQYE 142
>gi|291229714|ref|XP_002734820.1| PREDICTED: UPF0436 protein C9orf6 homolog [Saccoglossus
kowalevskii]
Length = 198
Score = 72.8 bits (177), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 81/157 (51%), Gaps = 2/157 (1%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F + D +PG D + +H+N + ++ +A +H K+ I V F V K
Sbjct: 31 PSVVDRYFRRGYKIDVNGRPGEDLCILQHSNKVIIVTIADSHPLIKNNKTIKNVTFQVTK 90
Query: 83 S-DRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGL 141
+ +R KV+GK KK E+ C ++ Y + CV G L+EVN +L+ P L
Sbjct: 91 NLNRMDNKVSGKGKKGGQWVNETGPLCIITCDDDSQYTIYSCVNGKLVEVNEKLLSVPNL 150
Query: 142 LNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+N +G+I+II+PK + LL ++Y+++
Sbjct: 151 INEKPSTDGFISIILPKFGNSDPQMEKLLTKDEYEKE 187
>gi|312080338|ref|XP_003142557.1| hypothetical protein LOAG_06975 [Loa loa]
gi|307762278|gb|EFO21512.1| hypothetical protein LOAG_06975 [Loa loa]
Length = 164
Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 77/138 (55%), Gaps = 12/138 (8%)
Query: 25 SAIDFNFVAYFAPDFLKPGHDQYVY-RHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS 83
S +D N+ Y LK G + Y RH +G+ V+ L H + + I +D+++ K+
Sbjct: 15 SVVDRNYKRYI----LKNGGNGLCYLRHPSGVIVVTLYKNHELAQKQ--IKKIDWHMKKN 68
Query: 84 ---DRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQP 139
DRS KV+GK KK +T +C V C + Y++R +K L+EVN RL++ P
Sbjct: 69 KGIDRSKQKVSGKAKK-GGLALLPDTKLCIVHCEDGTEYVLRAGIKALLIEVNERLLEDP 127
Query: 140 GLLNSSADREGYIAIIMP 157
L+ S+ + +GY+AII+P
Sbjct: 128 NLMRSAPENQGYLAIIIP 145
>gi|449271786|gb|EMC82026.1| UPF0436 protein C9orf6 like protein, partial [Columba livia]
Length = 135
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 69/124 (55%), Gaps = 3/124 (2%)
Query: 55 LCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVCKVCT 113
+CVI LA H ++ IT+V++ + + R KV+GK K+ AQ +C++ +
Sbjct: 2 ICVITLAEAHPLLQNGKKITSVNYQISANCSRLQNKVSGKSKR-GAQFLTELAPLCRISS 60
Query: 114 NN-DSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGL 172
++ + Y + C++G L+EVN ++ P +L EGYIA+++PK + + LL
Sbjct: 61 SDGEEYTIYSCIRGRLIEVNENILSNPSILQEKPSTEGYIAVVLPKFEESKSVTQGLLTQ 120
Query: 173 EDYK 176
++Y+
Sbjct: 121 KEYE 124
>gi|52345836|ref|NP_001004966.1| protein FAM206A [Xenopus (Silurana) tropicalis]
gi|49522442|gb|AAH75478.1| MGC89307 protein [Xenopus (Silurana) tropicalis]
Length = 236
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/208 (22%), Positives = 84/208 (40%), Gaps = 55/208 (26%)
Query: 24 PSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK 82
PS +D F ++ D KP D + +H+N +C+I LA H ++ I + + +
Sbjct: 12 PSVVDRYFKRWYKSDVKGKPCEDHCILQHSNRICIITLAECHPLLQNGKTIKTISYQISA 71
Query: 83 S-DRSGFKVTGKRKK--------------------------------------------- 96
+ R KV+GK K+
Sbjct: 72 NCSRLQNKVSGKSKRGRPFIFQQDNARPHSASITTSWLRRRRIRVLKWPVCSPDLSPIEN 131
Query: 97 -------QNAQHFESNTAVCKVC-TNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADR 148
+ AQ + +C++ T+ + Y + C++G LLEVN +++ P LL
Sbjct: 132 IWHIIQRKGAQFLTEHAPLCRISSTDGEEYTIYSCIRGRLLEVNENILQNPELLKEKPST 191
Query: 149 EGYIAIIMPKPADWLKIKSSLLGLEDYK 176
EGYIA+++PK + + LL ++Y+
Sbjct: 192 EGYIAVVLPKFEESKTVTEGLLSQQEYE 219
>gi|146099952|ref|XP_001468798.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398022592|ref|XP_003864458.1| hypothetical protein, conserved [Leishmania donovani]
gi|134073166|emb|CAM71887.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322502693|emb|CBZ37776.1| hypothetical protein, conserved [Leishmania donovani]
Length = 211
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 71/124 (57%), Gaps = 10/124 (8%)
Query: 43 GHDQYVYRHANGLCVIGLAPTHVAFK----DEGGITA--VDFNVGKSDRS--GFKVTGKR 94
G+D + +H+NGLCV+ L P H + + G+T V F G++ S +V GK+
Sbjct: 63 GNDCRILQHSNGLCVLCLDPNHELVRRCAPADAGVTVAKVAFFKGRTAISPENIQVVGKK 122
Query: 95 KKQNAQHFESNTAVCK-VCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIA 153
KK NA +++T +C V ++ Y V CV G +LE+N L++QP ++ + EGY+A
Sbjct: 123 KK-NALVCQNDTKLCSIVLSDGTEYTVPACVNGFVLELNATLLEQPWMVAVAPTAEGYLA 181
Query: 154 IIMP 157
II P
Sbjct: 182 IINP 185
>gi|401428789|ref|XP_003878877.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322495126|emb|CBZ30430.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 211
Score = 68.9 bits (167), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 71/124 (57%), Gaps = 10/124 (8%)
Query: 43 GHDQYVYRHANGLCVIGLAPTHVAFK----DEGGITA--VDFNVGKSD--RSGFKVTGKR 94
G+D + +H+NGLCV+ L P H + + G+T V F G++ +V GK+
Sbjct: 63 GNDCRILQHSNGLCVLCLDPNHAVVRRCAPADAGVTVAKVAFFKGRTAIVPESIQVVGKK 122
Query: 95 KKQNAQHFESNTAVCK-VCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIA 153
KK NA +++T +C V ++ Y + CV G +LE+N L++QP ++ ++ EGY+A
Sbjct: 123 KK-NALVCQNDTKLCGIVLSDGTEYTIPACVNGFVLELNATLLEQPWMVAAAPTAEGYLA 181
Query: 154 IIMP 157
II P
Sbjct: 182 IINP 185
>gi|300175978|emb|CBK22195.2| unnamed protein product [Blastocystis hominis]
Length = 142
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 72/130 (55%), Gaps = 4/130 (3%)
Query: 51 HANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFK--VTGKRKKQNAQHFESNTAV 108
H+N +C++ LA H V F GK R K V+GKRKK E +T +
Sbjct: 2 HSNTICIVSLAFGHDIITKGKRCVKVVFGKGKDHRDLVKNDVSGKRKK-GGIWVEESTIL 60
Query: 109 CKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKS 167
C++ C + Y + C++ LLEVN RL+++P LLN+ +G+IA+ PK AD + I+
Sbjct: 61 CEITCDDGSVYKIAACIRARLLEVNLRLLEEPELLNTKPRTDGFIALFAPKVADVMNIQK 120
Query: 168 SLLGLEDYKR 177
SLL E+Y R
Sbjct: 121 SLLTSEEYIR 130
>gi|294944313|ref|XP_002784193.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239897227|gb|EER15989.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 226
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 64/122 (52%), Gaps = 10/122 (8%)
Query: 43 GHDQYVYRHANGLCVIGLAPTHVAFKDEGG--ITAVDFNVGKSDRSGFKVTGKRKKQNAQ 100
GH QYV+RH+N + ++G APTHV F + G + +V + G + +VTGKRK+ A
Sbjct: 62 GHHQYVHRHSNNIFLVGAAPTHVMFTENPGVKVKSVKYREGMTTT---QVTGKRKR-GAI 117
Query: 101 HFESNTAVCKVCTNNDSYIVRC-CVKGSLLEVNNRLIKQPG---LLNSSADREGYIAIIM 156
T + V N C C++G ++E N RL + S + EG++AI++
Sbjct: 118 SVNPTTVLADVTMENGKVYPLCACIRGDIIETNTRLCGDEAAKLMTPDSCETEGFVAILL 177
Query: 157 PK 158
K
Sbjct: 178 LK 179
>gi|157876129|ref|XP_001686425.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129499|emb|CAJ08042.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 211
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 43 GHDQYVYRHANGLCVIGLAPTH-----VAFKDEG-GITAVDFNVGKSD--RSGFKVTGKR 94
G+D + +H+NGLCV+ L P H A D G +T V F G++ +V GK+
Sbjct: 63 GNDCRILQHSNGLCVLCLDPNHELVRRCAPADTGVTVTKVAFFKGRTAIVPESIQVVGKK 122
Query: 95 KKQNAQHFESNTAVCK-VCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIA 153
KK NA +++T +C V ++ Y + CV G +LE+N L++QP ++ + EGY+A
Sbjct: 123 KK-NALVCQTDTKLCGIVLSDGTEYTIPACVNGFVLELNATLLEQPWMVAMAPIAEGYLA 181
Query: 154 IIMP 157
II P
Sbjct: 182 IINP 185
>gi|167383016|ref|XP_001736371.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901279|gb|EDR27372.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 187
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 66/133 (49%), Gaps = 13/133 (9%)
Query: 30 NFVAYFAPDFLKPG------HDQYVYRHANGLCVIGLAPTHVAFK-DEGGITAVDFNVGK 82
NF+ + + P +DQY++ H+NGL ++G+A H A K E I V
Sbjct: 15 NFLQRYTTQYYVPNIDDHQYYDQYMFDHSNGLVIMGIAQCHPACKAKEISIQIV------ 68
Query: 83 SDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLL 142
D+ + + K NA +S T + K+ +N Y + G L+E+N L+K P L+
Sbjct: 69 VDQGNNRPVKGKHKTNAIKLKSKTVIVKIIADNVEYPIYAGCSGKLIELNEELLKDPSLI 128
Query: 143 NSSADREGYIAII 155
++ +G+IAII
Sbjct: 129 KKESEGKGFIAII 141
>gi|452821982|gb|EME29006.1| hypothetical protein Gasu_35760 [Galdieria sulphuraria]
Length = 163
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 76/144 (52%), Gaps = 13/144 (9%)
Query: 23 PPSAIDFNF-VAYFAPDFLKPGHDQY------VYRHANGLCVIGLAPTHVAFKDEGGITA 75
PP+ ++ F + F P +D++ V +H+N L VI LA +H + + +T
Sbjct: 17 PPTFVERYFKTSVFPYQRTNPPNDKFLCEDVLVAQHSNLLYVIFLADSHSIVQRKEIVTQ 76
Query: 76 VDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDS-YIVRCCVKGSLLEVNNR 134
+ N K D S V GKRKK + + T +C + T + + Y+V V+GSL+E N R
Sbjct: 77 I--NPIKWDPS--TVHGKRKK-GGRAVQPRTIICSIMTQSGTEYLVPAGVQGSLVEFNWR 131
Query: 135 LIKQPGLLNSSADREGYIAIIMPK 158
L +P LL EGYIA+I+PK
Sbjct: 132 LASEPSLLLDKRKTEGYIAVILPK 155
>gi|440295326|gb|ELP88239.1| hypothetical protein EIN_225720 [Entamoeba invadens IP1]
Length = 186
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 61/119 (51%), Gaps = 7/119 (5%)
Query: 43 GHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHF 102
G DQ+++ H+NGL ++GLA TH K A D V + + +V G R K AQ+
Sbjct: 34 GSDQFIFLHSNGLVLVGLARTHPVCK------AQDIKVAINTQKENQVKG-RHKFGAQNV 86
Query: 103 ESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPAD 161
+ T + V + Y V G LLE+N L P LL + ++ +G++AI+ P +
Sbjct: 87 KPKTVLITVVADGVEYKVHALTSGRLLEMNESLTDHPELLKNESEGKGFVAILQKNPNE 145
>gi|255083018|ref|XP_002504495.1| predicted protein [Micromonas sp. RCC299]
gi|226519763|gb|ACO65753.1| predicted protein [Micromonas sp. RCC299]
Length = 242
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 59/109 (54%), Gaps = 2/109 (1%)
Query: 73 ITAVDFNVGKSDRSGFKVTGK-RKKQNAQHFESNTAVCKVCTNNDSY-IVRCCVKGSLLE 130
I +VD+ GK + G +V + +KK+ A+ + N+ +C + Y I R CV+G L+E
Sbjct: 129 IASVDYTCGKGEGKGNEVNIRGKKKRGAKTLQENSGLCWLVDEAGMYWIARACVRGKLIE 188
Query: 131 VNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
N RL K+P L G++AII PAD +++ + + E Y+ R
Sbjct: 189 TNERLAKEPQLCTRFPTASGFLAIISAAPADMQRLRETGMNREAYEALR 237
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 20 PLTPPSAIDFNFVAYFAPDFLKPGH-DQYVYRHANGLCVIGLAPTH 64
P P + D + Y+ PD H DQ VY H NGLCV+ LA H
Sbjct: 3 PRGPVTTRDRTYAVYYVPDVSGHWHHDQVVYAHPNGLCVVCLAERH 48
>gi|403344131|gb|EJY71403.1| hypothetical protein OXYTRI_07723 [Oxytricha trifallax]
Length = 230
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 76/160 (47%), Gaps = 8/160 (5%)
Query: 20 PLTPPSAIDF-NFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFK--DEGGITAV 76
P T + F N A D + DQ+ + H+N L + G++ H A K D I +V
Sbjct: 67 PQTISQMMGFVNSDAQLKQDGSEKCLDQFYFMHSNQLVLFGISNRHEAIKNNDTNPIVSV 126
Query: 77 DFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGS-LLEVNNR 134
+F+ + + G + K A + T +C++ C + Y +R VK + +LE+N R
Sbjct: 127 NFDCDRKQKRGDSQVKGKSKSGAIKLDPRTMLCEIKCQDGTYYKIRSFVKNAQILEMNKR 186
Query: 135 LIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLED 174
L P LL S ++ EGYIAI K L + L +ED
Sbjct: 187 LCLNPQLLASKSESEGYIAIFSVKKGQNL---NEALQMED 223
>gi|67479491|ref|XP_655127.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56472241|gb|EAL49740.1| hypothetical protein EHI_138450 [Entamoeba histolytica HM-1:IMSS]
gi|449710039|gb|EMD49184.1| protein C9orf6 family protein [Entamoeba histolytica KU27]
Length = 187
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 75/162 (46%), Gaps = 28/162 (17%)
Query: 1 MQDDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPG------HDQYVYRHANG 54
M++D +E +L PN F+ + + P +DQY++ H+NG
Sbjct: 1 MEEDSKEKEQLKFPN---------------FLQRYTTQYYVPNIDDHQYYDQYMFDHSNG 45
Query: 55 LCVIGLAPTHVAFK-DEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCT 113
L ++G+A H A K E I V D+ + + K NA +S T + K+
Sbjct: 46 LVIMGIAQCHPACKAKEISIQIV------VDQGNNRPVKGKHKTNAIKLKSKTVIVKITA 99
Query: 114 NNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII 155
+ Y + G L+E+N L+K P L+ + ++ +G+IAII
Sbjct: 100 DGVEYPIYAGCSGKLIELNEELLKNPSLIKNESEGKGFIAII 141
>gi|407041543|gb|EKE40800.1| hypothetical protein ENU1_079030 [Entamoeba nuttalli P19]
Length = 187
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 75/162 (46%), Gaps = 28/162 (17%)
Query: 1 MQDDEEELHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPG------HDQYVYRHANG 54
M++D +E +L PN F+ + + P +DQY++ H+NG
Sbjct: 1 MEEDSKEKEQLKFPN---------------FLQRYTTQYYVPNIDDHQYYDQYMFDHSNG 45
Query: 55 LCVIGLAPTHVAFK-DEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCT 113
L ++G+A H A K E I V D+ + + K NA +S T + K+
Sbjct: 46 LVIMGIAQCHPACKAKEISIQIV------VDQGNNRPVKGKHKTNAIKLKSKTVIVKITA 99
Query: 114 NNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII 155
+ Y + G L+E+N L+K P L+ + ++ +G+IAII
Sbjct: 100 DGVEYPIYAGCSGKLIELNEELLKNPSLIKNDSEGKGFIAII 141
>gi|170571630|ref|XP_001891799.1| hypothetical protein Bm1_01415 [Brugia malayi]
gi|158603477|gb|EDP39391.1| hypothetical protein Bm1_01415 [Brugia malayi]
Length = 165
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 67/112 (59%), Gaps = 7/112 (6%)
Query: 50 RHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS---DRSGFKVTGKRKKQNAQHFESNT 106
RH +G+ V+ L +H + + IT +D+ + K+ DRS KV+GK KK +T
Sbjct: 37 RHPSGVIVVTLCKSHELIQKQ--ITKIDWRMKKNRGIDRSKQKVSGKGKKGGLALL-PDT 93
Query: 107 AVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
+C V C + YI+R +K L+E+N RL++ P L+ SS + +GYIAI++P
Sbjct: 94 KLCIVHCEDGTEYILRAGIKALLIEINERLLEDPNLMRSSPENQGYIAIVIP 145
>gi|324518948|gb|ADY47246.1| Unknown [Ascaris suum]
Length = 108
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 57/89 (64%), Gaps = 5/89 (5%)
Query: 73 ITAVDFNVGK---SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSL 128
IT V + +GK +DRS +VTGK KK +T +C V C + Y++R +K +L
Sbjct: 5 ITKVGWQMGKHKRTDRSQRQVTGKAKK-GGLALLPDTRLCIVECEDGSQYVLRAGIKANL 63
Query: 129 LEVNNRLIKQPGLLNSSADREGYIAIIMP 157
+EVNN L ++P L+ ++++ +GYIAII+P
Sbjct: 64 VEVNNYLSEKPELIRTASENQGYIAIIIP 92
>gi|355700983|gb|EHH29004.1| hypothetical protein EGK_09303, partial [Macaca mulatta]
gi|355754686|gb|EHH58587.1| hypothetical protein EGM_08464, partial [Macaca fascicularis]
Length = 124
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 62/122 (50%), Gaps = 3/122 (2%)
Query: 51 HANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS-DRSGFKVTGKRKKQNAQHFESNTAVC 109
H+N +CV L +H + I ++ + + R KV+GK K Q AQ +C
Sbjct: 1 HSNRICVTTLTESHPVLQSGKTIKSIPCQISTNCSRLQNKVSGKCK-QGAQFLTELAPLC 59
Query: 110 KV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSS 168
K+ C++ ++Y V CV+G +EV ++ +P +L EGYIA+ +PK + I
Sbjct: 60 KIYCSDGEAYTVSSCVRGCSMEVIENILHKPSILQEKPSTEGYIAVELPKFEESKNITEG 119
Query: 169 LL 170
LL
Sbjct: 120 LL 121
>gi|307108711|gb|EFN56950.1| hypothetical protein CHLNCDRAFT_143503 [Chlorella variabilis]
Length = 242
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 85/205 (41%), Gaps = 40/205 (19%)
Query: 15 NVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGIT 74
+V LP PP+ I ++ + F D G+D H NGLC++GLAP H A + G +
Sbjct: 7 DVAALPERPPTHIQESYSSLFLLD--DGGNDTSFRLHPNGLCLVGLAPGHAAHCEHAGGS 64
Query: 75 AVD------------FNVGKSDRSGFKVTGKRKKQ-----------------NAQHFESN 105
D +V D + V G R +Q +AQ
Sbjct: 65 GGDQPGAAGQGAPTGHDVMLLDSAADGVQGARAQQAQPGEDGSQQQQGILPPSAQQQSGG 124
Query: 106 TA---------VCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIM 156
A ++ Y++RC V+G L+E+N RL + G A +GY+AI+
Sbjct: 125 DARLALAPKLLQAELSVGGAEYVLRCGVRGLLVELNARLQRLAGRELLRALEDGYLAILE 184
Query: 157 PKPADWLKIKSSLLGLEDYKRKREE 181
+ A+ ++ L E Y+++ E
Sbjct: 185 LRAAEAQQLAGRCLSEERYRQRLAE 209
>gi|72387756|ref|XP_844302.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62359454|gb|AAX79891.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70800835|gb|AAZ10743.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 197
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 80/163 (49%), Gaps = 22/163 (13%)
Query: 24 PSAIDFNFVAYFAPDFLK-----PGHDQYVYRHANGLCVIGLAPTHVAFKD--------E 70
PS D+ Y+ +K G++ + H+NG+CV+ L TH + E
Sbjct: 14 PSTFDYFTERYYHQYVVKNCKGVEGNNCRLLVHSNGICVLCLDETHRVVRAAKSSAGAVE 73
Query: 71 GGITAVDFNVGKSDR----SGFKVTGKRKKQNAQHFESNTAVCKVCTNNDS-YIVRCCVK 125
+ +V F G+ + V GKRKKQ A + +T +C + ++ + Y + CV
Sbjct: 74 TNVASVVFGSGRGNSQLSSGSIHVVGKRKKQAAV-CQVDTKICIITMSDGTVYHIPACVD 132
Query: 126 GSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSS 168
G +LE+N+ L + P LL + EGYIA+I P ++ K+K S
Sbjct: 133 GFVLELNSALQQHPNLLLDAPTAEGYIALISP---NYSKVKFS 172
>gi|320165348|gb|EFW42247.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 310
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 59/125 (47%), Gaps = 3/125 (2%)
Query: 35 FAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKR 94
+A D G QYV+ H N L ++G+AP+H A V + + +
Sbjct: 120 YAMDVQGQGCHQYVFEHYNKLQLVGIAPSHAIVTGTLAEGAEASTVVSVKFAALPLI-RP 178
Query: 95 KKQNAQHFESNTAVCKVCTNNDSYIVR--CCVKGSLLEVNNRLIKQPGLLNSSADREGYI 152
KK ++H + T +D + R C+ G+++E N RL+ QP LL + GY+
Sbjct: 179 KKMESRHAQQPLTTLAEVTMSDGRVFRPRICIYGAIVEFNERLLAQPDLLRTHPTTAGYL 238
Query: 153 AIIMP 157
A+++P
Sbjct: 239 ALVIP 243
>gi|261327457|emb|CBH10432.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 197
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 80/163 (49%), Gaps = 22/163 (13%)
Query: 24 PSAIDFNFVAYFAPDFLK-----PGHDQYVYRHANGLCVIGLAPTHVAFKD--------E 70
PS D+ Y+ +K G++ + H+NG+CV+ L TH + E
Sbjct: 14 PSTFDYFTERYYHQYVVKNCKGVEGNNCRLLVHSNGICVLCLDETHRVVRAAKSSAGAVE 73
Query: 71 GGITAVDFNVGKSDR----SGFKVTGKRKKQNAQHFESNTAVCKVCTNNDS-YIVRCCVK 125
+ +V F G+ + V GKRKKQ A + +T +C + ++ + Y + CV
Sbjct: 74 TNVASVVFGSGRGNSQLSSGSIHVVGKRKKQAAV-CQVDTKICIITMSDGTVYHIPACVD 132
Query: 126 GSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSS 168
G +LE+N+ L + P LL + EGYIA+I P ++ K+K S
Sbjct: 133 GFVLELNSVLQQHPNLLLDAPTAEGYIALISP---NYSKVKFS 172
>gi|340053167|emb|CCC47454.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 211
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 71/150 (47%), Gaps = 19/150 (12%)
Query: 24 PSAIDFNFVAYFAPDFLKP-----GHDQYVYRHANGLCVIGLAPTHVAFK------DEGG 72
PS D+ Y+ +K G++ + H+NG+CV+ L +H A + +G
Sbjct: 14 PSTFDYFTERYYHQYLVKDCKGLDGNNCRLMVHSNGICVLCLDDSHAAVQAFRCSTGDGS 73
Query: 73 ITAVDFNVGKS------DRSGFKVTGKRKKQNAQHFESNTAVCKVC-TNNDSYIVRCCVK 125
+T +G V GKRK+ NA + +T +C + +N Y V CV
Sbjct: 74 VTVSSIVLGSGRGNSQIGSGKLHVVGKRKR-NAAVCQVDTKMCTITLSNGGVYCVPACVH 132
Query: 126 GSLLEVNNRLIKQPGLLNSSADREGYIAII 155
G +LE+N L+++P L + EGYIAII
Sbjct: 133 GYVLELNTALLERPALAVEAPSGEGYIAII 162
>gi|342180544|emb|CCC90020.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 206
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 77/158 (48%), Gaps = 25/158 (15%)
Query: 24 PSAIDFNFVAYFAPDFLK-----PGHDQYVYRHANGLCVIGLAPTHVAFK---------- 68
PS D+ Y+ +K G++ + H+NG+CV+ L THVA +
Sbjct: 17 PSTFDYFTERYYYQYVVKGCKGVEGNNCRLLVHSNGICVLCLDETHVAVEAAGVAAAAKS 76
Query: 69 ----DEGGITAVDFNVGKSDR----SGFKVTGKRKKQNAQHFESNTAVCKVC-TNNDSYI 119
+ G + +V F G+ + V+GKRK+ A +++T +C + T+ Y
Sbjct: 77 GGAAENGTVASVVFGSGRGNSQISSGNIHVSGKRKRH-AAICQADTNICLITMTSGAVYR 135
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
V CV G +LE+N+ L + P L+ + EGYIA+I P
Sbjct: 136 VPACVDGFVLELNSALQQHPNLVTHAPTTEGYIALISP 173
>gi|303281180|ref|XP_003059882.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458537|gb|EEH55834.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 248
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 79/180 (43%), Gaps = 37/180 (20%)
Query: 19 LPLTPPSAIDFNFVAYFAPD-FLKPGHDQYVYRHANGLCVIGLAP-----------THVA 66
L P + D + PD + DQ ++ H NGL V+ LAP T
Sbjct: 8 LATAPRTTRDRTYARIDVPDPEGRRERDQVLFHHPNGLVVVCLAPRVGGGATRDSKTTTV 67
Query: 67 FKDEGG---------------------ITAVDFNVGKS---DRSGFKVTGKRKKQNAQHF 102
D G I +VDF++GK + G +KK+ A+
Sbjct: 68 VVDRGADDADAGDLAPSASASASGAPSIVSVDFSLGKGKGHQSNDVVKGGGKKKRGAKLL 127
Query: 103 ESNTAVCKVCTNN-DSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPAD 161
+N+ V +V T++ S+ R C KG LLEVN RLI +PGL ++ EG++ ++ P+ D
Sbjct: 128 MANSGVARVSTDDGTSWTARACAKGRLLEVNERLIDEPGLASTDPLNEGFLCVVQPRAED 187
>gi|268574994|ref|XP_002642476.1| Hypothetical protein CBG06893 [Caenorhabditis briggsae]
Length = 153
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 55/87 (63%), Gaps = 5/87 (5%)
Query: 73 ITAVDFNVGK---SDRSGFKVTGKRKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSL 128
+ +VDF K +DRS +V+GK KK A ++++ +C C + ++VR V+G+L
Sbjct: 43 VVSVDFGSTKKHGADRSTNQVSGKGKK-GALILQTDSKLCTFKCKDGTEHVVRAGVRGTL 101
Query: 129 LEVNNRLIKQPGLLNSSADREGYIAII 155
+E+N+RL P L+ ++ D +G+IAII
Sbjct: 102 VEMNDRLKTHPDLIRTAPDNQGFIAII 128
>gi|308471873|ref|XP_003098166.1| hypothetical protein CRE_12143 [Caenorhabditis remanei]
gi|308269317|gb|EFP13270.1| hypothetical protein CRE_12143 [Caenorhabditis remanei]
Length = 158
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 65/117 (55%), Gaps = 11/117 (9%)
Query: 44 HDQYVY-RHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKS---DRSGFKVTGKRKKQNA 99
+D Y H +G+ V+ L E + VDF K+ DRS +V+GK KK A
Sbjct: 21 YDNIAYLHHPSGVTVVVLRNV-----PESEVIEVDFGKTKTHGADRSMNQVSGKGKK-GA 74
Query: 100 QHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII 155
++++ +C C + ++VR V+G+L+E+N+RL P L+ ++ D +G+IAII
Sbjct: 75 LILQTDSKLCTFKCKDGSEHVVRAGVRGTLVEMNDRLKTHPDLIRTAPDNQGFIAII 131
>gi|341900788|gb|EGT56723.1| hypothetical protein CAEBREN_12763 [Caenorhabditis brenneri]
Length = 158
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 65/117 (55%), Gaps = 11/117 (9%)
Query: 44 HDQYVY-RHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK---SDRSGFKVTGKRKKQNA 99
+D Y H +G+ V+ L E + V+F K +DRS +V+GK KK A
Sbjct: 22 YDNIAYLHHPSGVTVVVLRNI-----PESEVVEVNFGNTKKHGADRSTNQVSGKGKK-GA 75
Query: 100 QHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII 155
++++ +C C + ++VR V+G+L+E+N+RL P L+ ++ D +GYIAII
Sbjct: 76 LVLQTDSKLCTFKCKDGSEHVVRAGVRGTLVEMNDRLKTHPDLIRTAPDNQGYIAII 132
>gi|302832359|ref|XP_002947744.1| hypothetical protein VOLCADRAFT_116496 [Volvox carteri f.
nagariensis]
gi|300267092|gb|EFJ51277.1| hypothetical protein VOLCADRAFT_116496 [Volvox carteri f.
nagariensis]
Length = 309
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 56/108 (51%), Gaps = 5/108 (4%)
Query: 76 VDFNVGKSDRS-GFKVTGKRKKQNAQHFESNTAVCKVCTNND--SYIVRCCVKGSLLEVN 132
V+F G + + G K +GK++ + Q+ + V S+ + + LLEVN
Sbjct: 195 VNFEAGAARQQIGTKTSGKKRHRGVQNLRGGALLAAVERRGGGASFPIWFVLNSQLLEVN 254
Query: 133 NRLIKQPGLLNSSADREGYIAIIMPKPADWLK-IKSSLLGLEDYKRKR 179
RL+K PGLL REGY+AI+ P AD LK +S L+ +Y R R
Sbjct: 255 ERLLKTPGLLYDRPCREGYVAILSPT-ADALKYFESRLVSEAEYLRLR 301
Score = 39.3 bits (90), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 19 LPLTPPSAIDFNFVAYFAPDFLKPG-HDQYVYRHANGLCVIGLAPTH 64
LP P + ++ FA D HDQYV+ NGL VIGLAP+H
Sbjct: 19 LPPRPLGLTETKYIRRFAIDLHGAKCHDQYVFLAPNGLAVIGLAPSH 65
>gi|17553608|ref|NP_498946.1| Protein F54F2.7 [Caenorhabditis elegans]
gi|21431875|sp|P34452.2|YMA7_CAEEL RecName: Full=Uncharacterized protein F54F2.7
gi|373219925|emb|CCD71241.1| Protein F54F2.7 [Caenorhabditis elegans]
Length = 157
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 63/117 (53%), Gaps = 11/117 (9%)
Query: 44 HDQYVY-RHANGLCVIGLAPTHVAFKDEGGITAVDFNVGK---SDRSGFKVTGKRKKQNA 99
+D Y H +G+ V+ L E + VDF K +DRS +V+GK KK A
Sbjct: 22 YDNIAYLHHPSGVTVVVLRNI-----PESEVVEVDFGTTKKHGADRSTNQVSGKGKK-GA 75
Query: 100 QHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII 155
+ ++ +C C + ++VR V+G+L+E+N+RL P + ++ D +G+IAII
Sbjct: 76 LILQPDSKLCTFKCKDGSEHVVRAGVRGTLVEMNDRLKTTPDFIRTAPDNQGFIAII 132
>gi|407404389|gb|EKF29864.1| hypothetical protein MOQ_006334 [Trypanosoma cruzi marinkellei]
gi|407411582|gb|EKF33586.1| hypothetical protein MOQ_002543 [Trypanosoma cruzi marinkellei]
Length = 204
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 20/126 (15%)
Query: 51 HANGLCVIGLAPTHVAFK--------DEG------GITAVDFNVGKSDR----SGFKVTG 92
H+NGLCV+ L +H A + DE I +V F G+ + + G
Sbjct: 43 HSNGLCVLCLDESHAAIQSHRKSMCTDEALVASHPKIESVTFGSGRGNAKLAPQNVCLVG 102
Query: 93 KRKKQNAQHFESNTAVCKVC-TNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGY 151
KRKK NA + +T +C + T+ Y + CV G +LE+N + ++P LL ++ EG+
Sbjct: 103 KRKK-NATKCQEDTNICVITMTDGTLYKIPACVAGFVLELNPAVQERPELLVTAPHVEGF 161
Query: 152 IAIIMP 157
IAII P
Sbjct: 162 IAIISP 167
>gi|38048423|gb|AAR10114.1| similar to Drosophila melanogaster CG9288, partial [Drosophila
yakuba]
Length = 90
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 40/63 (63%)
Query: 116 DSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
+SY V C++G L+EVN L+++P LL + GY AI++PK + IK+SLL E Y
Sbjct: 4 ESYKVPSCIRGKLVEVNTALVEEPKLLEQLPEGAGYFAILLPKIENCDAIKASLLTQEQY 63
Query: 176 KRK 178
+ +
Sbjct: 64 EER 66
>gi|339245245|ref|XP_003378548.1| conserved hypothetical protein [Trichinella spiralis]
gi|316972530|gb|EFV56207.1| conserved hypothetical protein [Trichinella spiralis]
Length = 184
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 62/116 (53%), Gaps = 4/116 (3%)
Query: 41 KPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQ 100
K D + +H NG+C++ LAPTH + + I +++ ++D+ + K++
Sbjct: 34 KEDEDFCILQHPNGICIVCLAPTHPLLRYKAEICSIEL---RADKQAAQEKPLTKREKQI 90
Query: 101 HFESNTAVCKVCTNND-SYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII 155
+ T +C++ +ND +++R CV G L+ N+ L++ LL + G++A+I
Sbjct: 91 RVDPKTILCRINLSNDKQFVIRACVFGKLINTNSLLLEHKQLLLDEPEWRGHVAVI 146
>gi|71657580|ref|XP_817304.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70882485|gb|EAN95453.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 201
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 62/126 (49%), Gaps = 20/126 (15%)
Query: 51 HANGLCVIGLAPTHVAFKDEGG--------------ITAVDFNVGKSDR----SGFKVTG 92
H+NGLCV+ L +H A + I +V F G+ + + G
Sbjct: 43 HSNGLCVLCLDGSHAAIQSHRKSLCTDEELDASHPRIESVTFGSGRGNAKLAPQNVCLVG 102
Query: 93 KRKKQNAQHFESNTAVCKVC-TNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGY 151
KRKK NA + +T +C + T+ Y + CV G +LE+N + ++P LL ++ EG+
Sbjct: 103 KRKK-NATKCQEDTNICVITMTDGTLYKIPACVAGFVLELNPTVQERPELLVTAPHVEGF 161
Query: 152 IAIIMP 157
IA+I P
Sbjct: 162 IAVISP 167
>gi|440790209|gb|ELR11493.1| hypothetical protein ACA1_027280 [Acanthamoeba castellanii str.
Neff]
Length = 114
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 7/77 (9%)
Query: 24 PSAIDFNFVAYFAPDFLK----PGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFN 79
P I ++ YF F+ GHD YVY+H+N C+IG A TH ++ +T ++F
Sbjct: 18 PDVISRYYIQYF---FMNVDGIEGHDMYVYQHSNHNCIIGAAETHPIVREGKTVTHINFQ 74
Query: 80 VGKSDRSGFKVTGKRKK 96
GK D ++ GK KK
Sbjct: 75 PGKIDHLKLQIKGKGKK 91
>gi|407851000|gb|EKG05138.1| hypothetical protein TCSYLVIO_003793 [Trypanosoma cruzi]
Length = 203
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 62/126 (49%), Gaps = 20/126 (15%)
Query: 51 HANGLCVIGLAPTHVAFKDEGG--------------ITAVDFNVGKSDR----SGFKVTG 92
H+NGLCV+ + +H A + I +V F G+ + + G
Sbjct: 43 HSNGLCVLCVDGSHAAIQSHRKGLCTDEELDASHPRIESVTFGSGRGNAKLAPQNVCLLG 102
Query: 93 KRKKQNAQHFESNTAVCKVC-TNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGY 151
KRKK NA + +T +C + T+ Y + CV G +LE+N + ++P LL ++ EG+
Sbjct: 103 KRKK-NATKCQEDTNICVITMTDGTLYKIPACVAGFVLELNPAVQERPELLATAPHVEGF 161
Query: 152 IAIIMP 157
IA+I P
Sbjct: 162 IAVISP 167
>gi|256082246|ref|XP_002577370.1| impas 3 peptidase (A22 family) [Schistosoma mansoni]
gi|353230158|emb|CCD76329.1| impas 3 peptidase (A22 family) [Schistosoma mansoni]
Length = 819
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 64/153 (41%), Gaps = 37/153 (24%)
Query: 44 HDQYVYRHANGLCVIGLAPTHVAFKD------------EGGITAVDFNVGKS-DRSGFKV 90
+D + RH+N LCV+GLA H FK I +D+ V +RS KV
Sbjct: 587 NDICILRHSNTLCVVGLACGHHIFKRCSMNSSSDCSSVTHEILGLDYQVSNGLNRSKSKV 646
Query: 91 TGKRKK-QNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRL--IKQPGLLNSSAD 147
G+RK + H ++ +C + + C + G+LLEVNN L K L SS D
Sbjct: 647 KGRRKHGGHVLHKSTDIIAYAICDREIKHPLMCGIPGNLLEVNNSLDITKNESHLPSSED 706
Query: 148 REG---------------------YIAIIMPKP 159
Y++II+PKP
Sbjct: 707 VSSLKSKVKLTPLMGSGSTVNIGTYLSIILPKP 739
>gi|335280761|ref|XP_003122114.2| PREDICTED: hypothetical protein LOC100525629 [Sus scrofa]
Length = 166
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 35/62 (56%)
Query: 115 NDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLED 174
+ S I CV+G L+EVN ++ +P +L EGYIA+++PK + + LL +
Sbjct: 91 HHSLIQSSCVRGRLMEVNEDILHKPSILQEKPSTEGYIAVVLPKFEESKSVTEGLLTQQQ 150
Query: 175 YK 176
Y+
Sbjct: 151 YE 152
>gi|432091546|gb|ELK24571.1| hypothetical protein MDA_GLEAN10018143 [Myotis davidii]
Length = 130
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 32/54 (59%)
Query: 123 CVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
CV+G L+EVN ++ +P +L EGYIA+++PK + I LL + Y+
Sbjct: 63 CVRGRLMEVNENILHKPSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 116
>gi|410719282|gb|AFV80069.1| putative glycine cleavage system H protein [Spironucleus
salmonicida]
Length = 155
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 59/117 (50%), Gaps = 14/117 (11%)
Query: 51 HANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCK 110
H NG+ V+ ++P++ + G I++V F + K+ GK K NA + N + +
Sbjct: 24 HRNGILVLFMSPSYT--QTFGKISSVVF----KNFDLQKIVGKNK-HNAHFVKINETLVQ 76
Query: 111 VCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKS 167
+ + V+ V LLE N RL +P LL + R GY+A+ PK W KI++
Sbjct: 77 I----NGIEVKSPVSFKLLEQNTRLENEPKLLENDCLRTGYLAVFAPK---WEKIET 126
>gi|254579431|ref|XP_002495701.1| ZYRO0C00902p [Zygosaccharomyces rouxii]
gi|238938592|emb|CAR26768.1| ZYRO0C00902p [Zygosaccharomyces rouxii]
Length = 167
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 58/136 (42%), Gaps = 19/136 (13%)
Query: 46 QYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESN 105
+++ HA+G +G+ T A G T V+ + E+
Sbjct: 49 EWLAAHADGTAFLGI--TKYASDALGDATYVEL-----------------PEKETQLEAG 89
Query: 106 TAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKI 165
A V + + V V G ++E N++L QP L+NS EG+IA I D L+
Sbjct: 90 EAYGSVESVKSASEVYQPVAGEIIEANDKLESQPQLINSDPLGEGWIARIKIAEPDSLET 149
Query: 166 KSSLLGLEDYKRKREE 181
+ LL LE Y+R +E
Sbjct: 150 QEGLLTLEQYERSLKE 165
>gi|358253828|dbj|GAA53827.1| UPF0436 protein C9orf6 homolog [Clonorchis sinensis]
Length = 307
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 52/114 (45%), Gaps = 15/114 (13%)
Query: 41 KPGHDQYVYRHANGLCVIGLAPTHVAF---KDEGGIT----------AVDFNVG-KSDRS 86
+P +D + RH+N LC +GLAP H F K++ + VDF + K R
Sbjct: 76 EPFNDICILRHSNTLCTVGLAPAHHLFMGCKEQLSVNGNSLTPHNLLGVDFKISDKVHRL 135
Query: 87 GFKVTGKRKKQNAQHFES-NTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQP 139
K++G+RK S N VC + + + C + G LLEVN L P
Sbjct: 136 HNKMSGRRKHGGQTVGRSPNVLAFAVCDRSVRHKLMCGIPGRLLEVNKALSYPP 189
>gi|737595|prf||1923203A H protein
Length = 131
Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 56/123 (45%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
AP+H K EG + + D G V + + + AV V +D
Sbjct: 10 APSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGVSVTKGKGFGAVESVXATSD-- 67
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN L +PGL+NSS +G++ I P D L+ SLLG ++Y +
Sbjct: 68 -VNSPISGEVIEVNTGLTGKPGLINSSPYEDGWMIKIKPTSPDELE---SLLGAKEYTKF 123
Query: 179 REE 181
EE
Sbjct: 124 CEE 126
>gi|1065301|pdb|1HPC|A Chain A, Refined Structures At 2 Angstroms And 2.2 Angstroms Of The
Two Forms Of The H-Protein, A Lipoamide-Containing
Protein Of The Glycine Decarboxylase
gi|1065302|pdb|1HPC|B Chain B, Refined Structures At 2 Angstroms And 2.2 Angstroms Of The
Two Forms Of The H-Protein, A Lipoamide-Containing
Protein Of The Glycine Decarboxylase
gi|9955325|pdb|1DXM|A Chain A, Reduced Form Of The H Protein From Glycine Decarboxylase
Complex
gi|9955326|pdb|1DXM|B Chain B, Reduced Form Of The H Protein From Glycine Decarboxylase
Complex
gi|157831395|pdb|1HTP|A Chain A, Refined Structures At 2 Angstroms And 2.2 Angstroms Of The
Two Forms Of The H-Protein, A Lipoamide-Containing
Protein Of The Glycine Decarboxylase Complex
Length = 131
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 56/123 (45%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
AP+H K EG + + D G V + + + AV V +D
Sbjct: 10 APSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGVSVTKGKGFGAVESVKATSD-- 67
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN L +PGL+NSS +G++ I P D L+ SLLG ++Y +
Sbjct: 68 -VNSPISGEVIEVNTGLTGKPGLINSSPYEDGWMIKIKPTSPDELE---SLLGAKEYTKF 123
Query: 179 REE 181
EE
Sbjct: 124 CEE 126
>gi|121080|sp|P16048.1|GCSH_PEA RecName: Full=Glycine cleavage system H protein, mitochondrial;
Flags: Precursor
gi|20737|emb|CAA45978.1| H protein [Pisum sativum]
gi|169093|gb|AAA33668.1| H-protein of glycine decarboxylase precursor (EC 2.1.2.10) [Pisum
sativum]
gi|287815|emb|CAA37704.1| H-protein [Pisum sativum]
gi|62700762|emb|CAI79404.1| H-protein [Pisum fulvum]
gi|68609706|emb|CAJ13415.1| H subunit of glycine decarboxylase multi-enzyme complex [Pisum
fulvum]
gi|68609718|emb|CAJ13721.1| H-protein of the glycine decarboxylase multi-enzyme complex [Pisum
sativum var. pumilio]
gi|68609723|emb|CAJ13722.1| H-protein of the glycine decarboxylase multi-enzyme complex [Pisum
sativum var. pumilio]
gi|68609733|emb|CAJ13724.1| H-protein of the glycine decarboxylase multi-enzyme complex [Pisum
sativum]
gi|68609738|emb|CAJ13725.1| H-protein of the glycine decarboxylase multi-enzyme complex [Pisum
sativum]
gi|68609743|emb|CAJ13726.1| H-protein of the glycine decarboxylase multi-enzyme complex [Pisum
sativum]
gi|68609748|emb|CAJ13727.1| H protein of the glycine decarboxylase multi-enzyme complex [Pisum
fulvum]
gi|68609752|emb|CAJ13728.1| H protein of the glycine decarboxylase multi-enzyme complex [Pisum
fulvum]
gi|68609757|emb|CAJ13729.1| H-protein, glycine decarboxylase multi-enzyme complex [Pisum
sativum subsp. elatius]
gi|68609761|emb|CAJ13730.1| H-protein, glycine decarboxylase multi-enzyme complex [Pisum
sativum subsp. elatius]
gi|68609766|emb|CAJ13731.1| H-protein, glycine decarboxylase multi-enzyme complex [Pisum
sativum subsp. elatius]
gi|68609771|emb|CAJ13732.1| H-protein, glycine decarboxylase multi-enzyme complex [Pisum
sativum subsp. elatius]
gi|68609776|emb|CAJ13733.1| H-protein from the glycine decarboxylase multi-enzyme complex
[Pisum abyssinicum]
gi|68609784|emb|CAJ13734.1| H-protein from the glycine decarboxylase multi-enzyme complex
[Pisum abyssinicum]
gi|68609789|emb|CAJ13735.1| H-protein from the glycine decarboxylase multi-enzyme complex
[Pisum abyssinicum]
gi|68609794|emb|CAJ13736.1| H-protein from the glycine decarboxylase multi-enzyme complex
[Pisum abyssinicum]
gi|68638161|emb|CAJ13828.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
gi|68638168|emb|CAJ13829.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
gi|68638175|emb|CAJ13830.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
gi|68638183|emb|CAJ13831.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
gi|68638189|emb|CAJ13832.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
gi|68638194|emb|CAJ13833.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum subsp. asiaticum]
gi|68638203|emb|CAJ13834.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
gi|68638210|emb|CAJ13835.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
gi|68638215|emb|CAJ13836.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum var. tibetanicum]
gi|68638223|emb|CAJ13837.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum subsp. transcaucasicum]
gi|68638228|emb|CAJ13838.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum subsp. transcaucasicum]
gi|68638233|emb|CAJ13839.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
gi|68638237|emb|CAJ13840.1| glycine decarboxylase multi-enzyme complex, H subunit [Pisum
sativum]
Length = 165
Score = 45.1 bits (105), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 56/123 (45%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
AP+H K EG + + D G V + + + AV V +D
Sbjct: 44 APSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGVSVTKGKGFGAVESVKATSD-- 101
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN L +PGL+NSS +G++ I P D L+ SLLG ++Y +
Sbjct: 102 -VNSPISGEVIEVNTGLTGKPGLINSSPYEDGWMIKIKPTSPDELE---SLLGAKEYTKF 157
Query: 179 REE 181
EE
Sbjct: 158 CEE 160
>gi|297823331|ref|XP_002879548.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297325387|gb|EFH55807.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 161
Score = 44.7 bits (104), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 64/133 (48%), Gaps = 10/133 (7%)
Query: 52 ANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQN--AQHFESNTAVC 109
+ GL + A +H K EG + + D G V + ++N A +S AV
Sbjct: 31 STGLEGLKYANSHEWVKHEGSVATIGITAHAQDHLGEVVFVELPEENTSATKEKSFGAVE 90
Query: 110 KVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP-KPADWLKIKSS 168
V ++ + + G ++EVN +L + PGL+NSS +G++ + P PA+ S
Sbjct: 91 SVKATSE---ILSPISGEIIEVNKKLTESPGLINSSPYEDGWMIKVKPSSPAE----LES 143
Query: 169 LLGLEDYKRKREE 181
L+G ++Y + EE
Sbjct: 144 LMGPKEYTKFCEE 156
>gi|68609728|emb|CAJ13723.1| H-protein of the glycine decarboxylase multi-enzyme complex [Pisum
sativum var. pumilio]
Length = 165
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 56/123 (45%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
AP+H K EG + + D G V + + + AV V +D
Sbjct: 44 APSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGVSVTKGKGFGAVESVKATSD-- 101
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN L +PGL+NSS +G++ I P D L+ SLLG ++Y +
Sbjct: 102 -VNSPISGEVIEVNTGLTGKPGLINSSPYEDGWMIKIKPTIPDELE---SLLGAKEYTKF 157
Query: 179 REE 181
EE
Sbjct: 158 CEE 160
>gi|217075032|gb|ACJ85876.1| unknown [Medicago truncatula]
Length = 166
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 61/123 (49%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EG + + D G V + + + N AV V +D
Sbjct: 45 ASSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGGSVTKGNGFGAVESVKATSD-- 102
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN++L +PGL+NSS +G++ I KP+D +++ SLLG ++Y +
Sbjct: 103 -VNSPISGEIVEVNDKLTGKPGLINSSPYEDGWMIKI--KPSDPSELE-SLLGAKEYTKF 158
Query: 179 REE 181
EE
Sbjct: 159 CEE 161
>gi|357482357|ref|XP_003611464.1| Glycine cleavage system H protein [Medicago truncatula]
gi|355512799|gb|AES94422.1| Glycine cleavage system H protein [Medicago truncatula]
gi|388518315|gb|AFK47219.1| unknown [Medicago truncatula]
Length = 166
Score = 43.9 bits (102), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 61/123 (49%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EG + + D G V + + + N AV V +D
Sbjct: 45 ASSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGGSVTKGNGFGAVESVKATSD-- 102
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN++L +PGL+NSS +G++ I KP+D +++ SLLG ++Y +
Sbjct: 103 -VNSPISGEIVEVNDKLTGKPGLINSSPYEDGWMIKI--KPSDPSELE-SLLGAKEYTKF 158
Query: 179 REE 181
EE
Sbjct: 159 CEE 161
>gi|357482359|ref|XP_003611465.1| Glycine cleavage system H protein [Medicago truncatula]
gi|355512800|gb|AES94423.1| Glycine cleavage system H protein [Medicago truncatula]
Length = 156
Score = 43.5 bits (101), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 61/123 (49%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EG + + D G V + + + N AV V +D
Sbjct: 35 ASSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGGSVTKGNGFGAVESVKATSD-- 92
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN++L +PGL+NSS +G+ +I KP+D +++ SLLG ++Y +
Sbjct: 93 -VNSPISGEIVEVNDKLTGKPGLINSSPYEDGW--MIKIKPSDPSELE-SLLGAKEYTKF 148
Query: 179 REE 181
EE
Sbjct: 149 CEE 151
>gi|206889383|ref|YP_002248836.1| glycine cleavage system H protein [Thermodesulfovibrio yellowstonii
DSM 11347]
gi|206741321|gb|ACI20378.1| glycine cleavage system H protein [Thermodesulfovibrio yellowstonii
DSM 11347]
Length = 130
Score = 43.5 bits (101), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 101 HFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII-MPKP 159
H E++T + ++ + S V V G+++E+N LI P ++N +G+IA+I M P
Sbjct: 50 HVEADTEMAEIESTKTSSPVIAPVSGTIVEINEELIDHPEIINEDPYGKGWIAVIEMDNP 109
Query: 160 ADWLKIKSSLLGLEDYKRKREE 181
+ L+ EDY+ EE
Sbjct: 110 REL----EDLMDYEDYESYLEE 127
>gi|308158679|gb|EFO61246.1| Hypothetical protein GLP15_63 [Giardia lamblia P15]
Length = 207
Score = 43.1 bits (100), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 58/119 (48%), Gaps = 16/119 (13%)
Query: 51 HANGLCVIGLAPTH--VAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFE----- 103
HAN + ++ + P H ++ D+ IT ++ G + ++ ++G+RKK F
Sbjct: 30 HANSVGLLCIGPDHPLLSLPDDWAITKASYSDG-AHQALATMSGRRKKGAGVLFPPAPLL 88
Query: 104 --SNTAVCK------VCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAI 154
+ T VC V N S ++ ++ L+E+N+ L + P LL +S G++AI
Sbjct: 89 TVTLTRVCSPGSGDDVTPNTLSIVIYLPIRLKLIEINDSLERFPWLLKTSHTNRGFLAI 147
>gi|336388366|gb|EGO29510.1| hypothetical protein SERLADRAFT_457336 [Serpula lacrymans var.
lacrymans S7.9]
Length = 159
Score = 43.1 bits (100), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 38/77 (49%), Gaps = 12/77 (15%)
Query: 107 AVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGY---IAIIMPKPADWL 163
AV V +D Y V G +LEVN RL +PGLLN S + EG+ I + P D
Sbjct: 91 AVESVKAASDIY---APVSGEVLEVNERLNTEPGLLNRSPEDEGWLCRIKLSTPSEVD-- 145
Query: 164 KIKSSLLGLEDYKRKRE 180
SLL LE YK E
Sbjct: 146 ----SLLTLEAYKETFE 158
>gi|336375307|gb|EGO03643.1| hypothetical protein SERLA73DRAFT_130037 [Serpula lacrymans var.
lacrymans S7.3]
Length = 156
Score = 43.1 bits (100), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 38/77 (49%), Gaps = 12/77 (15%)
Query: 107 AVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGY---IAIIMPKPADWL 163
AV V +D Y V G +LEVN RL +PGLLN S + EG+ I + P D
Sbjct: 88 AVESVKAASDIY---APVSGEVLEVNERLNTEPGLLNRSPEDEGWLCRIKLSTPSEVD-- 142
Query: 164 KIKSSLLGLEDYKRKRE 180
SLL LE YK E
Sbjct: 143 ----SLLTLEAYKETFE 155
>gi|124512808|ref|XP_001349760.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
gi|23615177|emb|CAD52167.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
Length = 320
Score = 42.7 bits (99), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 41/206 (19%), Positives = 86/206 (41%), Gaps = 32/206 (15%)
Query: 4 DEEELHKL--LLPNVHDLPLTPPSAIDFN----FVAYFAPDF------LKPGHDQYVYRH 51
DEE + KL LL HD + + +F+ + +++ F ++ +D ++ +H
Sbjct: 108 DEEFIDKLSSLLKLNHDTNIIIENKKNFDTLYKYYSFYDDTFNTSCFYIQDDYDLFLSKH 167
Query: 52 ANGLCVIGLAPTHVAFK----DEGGITAVDFNVGKSDRSGFKV-----------TGKRKK 96
NGL ++ ++ K D + +N+ + ++ +GKRK
Sbjct: 168 VNGLYILSISKYSNILKNILSDLINMNNDKYNLKDLIKQYVQIIFDPRLINADTSGKRKT 227
Query: 97 QNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIM 156
+ + + + N+ Y ++ +KG ++N ++ P LL + EG++ I+
Sbjct: 228 KTI-FINDDMTILNIIYKNNIYKIKSKIKGYYYDINQNIVHNPYLLFYNTQDEGWLIIVK 286
Query: 157 PKPADWLKIKSSLLGLEDYKRKREEC 182
K D K + DY K++ C
Sbjct: 287 NKNTDLTK----FVEATDYYNKKKLC 308
>gi|159472895|ref|XP_001694580.1| COP-II coat subunit [Chlamydomonas reinhardtii]
gi|158276804|gb|EDP02575.1| COP-II coat subunit [Chlamydomonas reinhardtii]
Length = 825
Score = 42.7 bits (99), Expect = 0.070, Method: Composition-based stats.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 19 LPLTPPSAIDFNFVAYFAPDFL-KPGHDQYVYRHANGLCVIGLAPTH 64
LP P + + ++ FA D + HDQYV+ NGL V+GLAP+H
Sbjct: 700 LPPRPHTFTETKYLRLFAVDLAGRSNHDQYVFLAPNGLAVVGLAPSH 746
>gi|270003496|gb|EEZ99943.1| hypothetical protein TcasGA2_TC002739 [Tribolium castaneum]
Length = 87
Score = 42.4 bits (98), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 39/65 (60%), Gaps = 2/65 (3%)
Query: 94 RKKQNAQHFESNTAVCKVCTNN-DSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYI 152
+ K+ AQ + + +C + T + + Y V + G LLE+N+RL + P LL + EG++
Sbjct: 4 KGKRGAQLLQPESVLCFIETEDGEKYPVHAGIYGKLLEINDRLSENPNLLVDDLN-EGFL 62
Query: 153 AIIMP 157
A+I+P
Sbjct: 63 AVILP 67
>gi|405964367|gb|EKC29864.1| Telomere-associated protein RIF1 [Crassostrea gigas]
Length = 2619
Score = 42.4 bits (98), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 42/72 (58%), Gaps = 2/72 (2%)
Query: 55 LCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKV-CT 113
+C++ +A H +++ I+ + ++V +R K++GK K+ AQ ++ +C+V CT
Sbjct: 2521 ICIVTVAEWHPLLREKKTISQISYDVEGVNRMDNKISGKGKR-GAQLLTPSSPLCEVTCT 2579
Query: 114 NNDSYIVRCCVK 125
+ YI+ C V+
Sbjct: 2580 DGSKYIITCGVR 2591
>gi|388509560|gb|AFK42846.1| unknown [Lotus japonicus]
Length = 166
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 57/127 (44%), Gaps = 16/127 (12%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKV------TGKRKKQNAQHFESNTAVCKVCTN 114
A +H K EG + + D G V +GK + + AV V
Sbjct: 45 ASSHEWVKHEGSVATIGITDHAQDHLGEAVFVELPESGKPVTKG----DGFGAVESVKAT 100
Query: 115 NDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLED 174
+D + + G ++EVN +L +QPGL+N+S +G++ + P L SLLG ++
Sbjct: 101 SD---INSPISGEIIEVNTKLSEQPGLINTSPYEDGWMIKVKPSSPSELD---SLLGPQE 154
Query: 175 YKRKREE 181
Y + EE
Sbjct: 155 YTKFCEE 161
>gi|146454538|gb|ABQ41935.1| glycine cleavage system H protein [Sonneratia caseolaris]
Length = 129
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 5/99 (5%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EGG+ + D G V + Q + + AV V +D
Sbjct: 30 AESHEWIKHEGGVATIGITDHAQDHLGEVVFVDLPEPGVQVSQGGSFGAVESVKATSD-- 87
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
V V G ++EVN +L ++PGL+NSS +G++ I P
Sbjct: 88 -VNSPVSGKIMEVNTQLTEKPGLVNSSPYEDGWMIKIKP 125
>gi|146454540|gb|ABQ41936.1| glycine cleavage system H protein [Sonneratia ovata]
Length = 129
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 5/99 (5%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EGG+ + D G V + Q + + AV V +D
Sbjct: 30 AESHEWVKHEGGVATIGITDHAQDHLGEVVFVDLPEPGVQVSQGGSFGAVESVKATSD-- 87
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
V V G ++EVN +L ++PGL+NSS +G++ I P
Sbjct: 88 -VNSPVSGKIMEVNTQLTEKPGLVNSSPYEDGWMIKIKP 125
>gi|146454542|gb|ABQ41937.1| glycine cleavage system H protein [Sonneratia apetala]
Length = 129
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 5/99 (5%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EGG+ + D G V + Q + + AV V +D
Sbjct: 30 AESHEWVKHEGGVATIGITDHAQDHLGEVVFVDLPEPGVQVSQGGSFGAVESVKATSD-- 87
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
V V G ++EVN +L ++PGL+NSS +G++ I P
Sbjct: 88 -VNSPVSGKIMEVNTQLTEKPGLVNSSPYEDGWMIKIKP 125
>gi|3334201|sp|P93255.1|GCSH_MESCR RecName: Full=Glycine cleavage system H protein, mitochondrial;
Flags: Precursor
gi|1724106|gb|AAB38501.1| glycine cleavage system protein H precursor [Mesembryanthemum
crystallinum]
Length = 163
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 5/63 (7%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPK-PADWLKIKSSLLGLEDYKRK 178
V + G ++EVN +L PGLLNSS EG++ + P PA+ SL+G ++Y +
Sbjct: 101 VNSPISGEIVEVNTKLSDSPGLLNSSPYEEGWMVKVKPSNPAE----LESLMGSKEYTKF 156
Query: 179 REE 181
EE
Sbjct: 157 CEE 159
>gi|419759601|ref|ZP_14285893.1| glycine cleavage system protein H [Thermosipho africanus
H17ap60334]
gi|407515365|gb|EKF50128.1| glycine cleavage system protein H [Thermosipho africanus
H17ap60334]
Length = 118
Score = 41.2 bits (95), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 48/104 (46%), Gaps = 21/104 (20%)
Query: 53 NGLCVIGLAPTHVAFKDEGGITAVDF-NVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKV 111
NG+ +IG++ A +D G +T VD VGK E +C +
Sbjct: 15 NGIGIIGISKR--AAEDLGDVTYVDLPEVGKV------------------IEKGEQLCTI 54
Query: 112 CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII 155
+ + V V G ++EVN +L P L++ SA+ EG+IA I
Sbjct: 55 ESVKAASSVYAPVSGKIIEVNEKLDTTPELISDSAEEEGWIAKI 98
>gi|349603825|gb|AEP99551.1| UPF0436 protein C9orf6-like protein, partial [Equus caballus]
Length = 65
Score = 41.2 bits (95), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 29/51 (56%)
Query: 126 GSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYK 176
G L+EVN ++ +P +L EGYIA+++PK + I LL + Y+
Sbjct: 1 GRLMEVNENILHKPSILQEKPSTEGYIAVVLPKFEESKSITEGLLTQKQYE 51
>gi|392597329|gb|EIW86651.1| glycine cleavage H-protein [Coniophora puteana RWD-64-598 SS2]
Length = 125
Score = 41.2 bits (95), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 6/74 (8%)
Query: 107 AVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIK 166
AV V +D Y V G ++EVN RL +P LLN S + EG++ I L+ +
Sbjct: 57 AVESVKAASDIY---APVSGEVVEVNERLNTEPSLLNKSPEDEGWLCKIKLSA---LEER 110
Query: 167 SSLLGLEDYKRKRE 180
+LL EDYK+ E
Sbjct: 111 DALLSAEDYKKSYE 124
>gi|118430834|gb|AAQ67414.2| mitochondrial glycine decarboxylase complex H-protein [Populus
tremuloides]
Length = 165
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSG--FKVTGKRKKQNAQHFESNTAVCKVCTNNDSY 118
A +H K EG + + D G V + + +S AV V +D
Sbjct: 45 ASSHEWVKHEGPVATIGITDHAQDHLGEVVFVDLPEPEGSVSQGKSFGAVESVKATSD-- 102
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+ + G ++EVN +L + PGL+N S EG+ +I KP+D +++ SLLG ++Y +
Sbjct: 103 -INSPISGEIVEVNTKLSETPGLINKSPYEEGW--MIKVKPSDPSELQ-SLLGPKEYTKF 158
Query: 179 REE 181
EE
Sbjct: 159 CEE 161
>gi|15226973|ref|NP_181080.1| glycine cleavage system H protein 1 [Arabidopsis thaliana]
gi|121075|sp|P25855.1|GCSH1_ARATH RecName: Full=Glycine cleavage system H protein 1, mitochondrial;
Flags: Precursor
gi|166725|gb|AAA32802.1| H-Protein precursor [Arabidopsis thaliana]
gi|861215|gb|AAA87942.1| glycine decarboxylase complex H-protein precursor [Arabidopsis
thaliana]
gi|3608151|gb|AAC36184.1| glycine decarboxylase complex H-protein [Arabidopsis thaliana]
gi|15215833|gb|AAK91461.1| At2g35370/T32F12.25 [Arabidopsis thaliana]
gi|16604472|gb|AAL24242.1| At2g35370/T32F12.25 [Arabidopsis thaliana]
gi|20453253|gb|AAM19865.1| At2g35370/T32F12.25 [Arabidopsis thaliana]
gi|21592462|gb|AAM64413.1| glycine decarboxylase complex H-protein [Arabidopsis thaliana]
gi|110736863|dbj|BAF00389.1| glycine decarboxylase complex H-protein [Arabidopsis thaliana]
gi|330254006|gb|AEC09100.1| glycine cleavage system H protein 1 [Arabidopsis thaliana]
gi|445119|prf||1908425A Gly decarboxylase:SUBUNIT=H protein
Length = 165
Score = 40.8 bits (94), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 58/130 (44%), Gaps = 22/130 (16%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKV--------TGKRKKQNAQHFESNTAVCKVC 112
A +H K EG + + D G V T K+++ ES A ++
Sbjct: 44 ANSHEWVKHEGSVATIGITAHAQDHLGEVVFVELPEDNTSVSKEKSFGAVESVKATSEIL 103
Query: 113 TNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP-KPADWLKIKSSLLG 171
+ + G ++EVN +L + PGL+NSS +G++ + P PA+ SL+G
Sbjct: 104 SP---------ISGEIIEVNKKLTESPGLINSSPYEDGWMIKVKPSSPAE----LESLMG 150
Query: 172 LEDYKRKREE 181
++Y + EE
Sbjct: 151 PKEYTKFCEE 160
>gi|384485498|gb|EIE77678.1| hypothetical protein RO3G_02382 [Rhizopus delemar RA 99-880]
Length = 142
Score = 40.8 bits (94), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 57/119 (47%), Gaps = 23/119 (19%)
Query: 45 DQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFES 104
D YV + N +CV+GL +K ++ FN ++ G++ K++
Sbjct: 33 DVYVRQAPNKICVLGLLEPSRDYK------SIKFNT--------ELIGEKIKRD------ 72
Query: 105 NTAVCKVCTNNDSYI--VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPAD 161
T +C++ + V+ ++G LLE++ L+ LL + + G+IA+IMPK D
Sbjct: 73 -TVLCELLDGEGQTVASVKAHMEGKLLELHTELVDNLDLLFNRSLDHGFIAVIMPKHED 130
>gi|134142794|gb|ABO61731.1| mitochondrial glycine decarboxylase complex H-protein [Populus
tremuloides]
Length = 163
Score = 40.8 bits (94), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSG--FKVTGKRKKQNAQHFESNTAVCKVCTNNDSY 118
A +H K EG + + D G V + +S AV V +D
Sbjct: 44 ATSHEWVKHEGSVATIGITDHAQDHLGEVVFVDLPEADDSVSQGKSFGAVESVKATSD-- 101
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+ + G ++EVN +L + PGL+N S EG+ +I KP++ +I+ SLLG ++Y +
Sbjct: 102 -INSPISGEIVEVNTQLSETPGLINKSPYEEGW--MIKVKPSNPSEIE-SLLGPKEYTKL 157
Query: 179 REE 181
EE
Sbjct: 158 CEE 160
>gi|85001179|ref|XP_955308.1| hypothetical protein [Theileria annulata]
gi|65303454|emb|CAI75832.1| hypothetical protein TA18595 [Theileria annulata]
Length = 235
Score = 40.8 bits (94), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 7/106 (6%)
Query: 45 DQYVYRHANGLCVIGLAPTHVAFKD-EGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFE 103
D Y Y NG+ + GL TH K+ G + FN + KR+K+ Q E
Sbjct: 114 DYYCYSLQNGIVIFGLNSTHALIKEISGNNVKITFNELMCKNISTESRAKRRKKEIQTPE 173
Query: 104 SNTAVCKVCTNNDSYIVRC---CVKGSLLEVNNRLIKQPGLLNSSA 146
VC+V + +IV+ + ++E+N RL +P L+ +
Sbjct: 174 E---VCEVEVGSKKFIVKLPKSLLNLEVVELNTRLENEPELITTEV 216
>gi|258545818|ref|ZP_05706052.1| glycine cleavage system H protein [Cardiobacterium hominis ATCC
15826]
gi|258518962|gb|EEV87821.1| glycine cleavage system H protein [Cardiobacterium hominis ATCC
15826]
Length = 121
Score = 40.8 bits (94), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 24/33 (72%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYI 152
V+ V G ++EVN L+ QP LLNS+A+ EG++
Sbjct: 63 VKSPVSGEVIEVNQALVDQPNLLNSAAESEGWV 95
>gi|392423134|ref|YP_006459738.1| glycine cleavage system H protein [Pseudomonas stutzeri CCUG 29243]
gi|390985322|gb|AFM35315.1| glycine cleavage system H protein [Pseudomonas stutzeri CCUG 29243]
Length = 154
Score = 40.4 bits (93), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 54/132 (40%), Gaps = 23/132 (17%)
Query: 37 PDFLK--PGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKR 94
PD L+ P H ++ A+G IGL A G I F T KR
Sbjct: 9 PDSLRYAPEHSLWLREEADGSLTIGLTAYGCALY--GQI--------------FAFTPKR 52
Query: 95 KKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAI 154
+ E + V + S R V G LL VN L+++P L+N EG++
Sbjct: 53 VGARIER-ERSFGVVEFAKAASS--ARSPVSGELLAVNEALLERPALINQDCYGEGWMVR 109
Query: 155 IMPKPADWLKIK 166
+ +PADW ++
Sbjct: 110 L--QPADWATVR 119
>gi|29841039|gb|AAP06052.1| similar to GenBank Accession Number BC015795 hypothetical protein
FLJ20457 in Homo sapiens; hypothetical protein FLJ20457
in Homo sapiens [Schistosoma japonicum]
Length = 296
Score = 40.4 bits (93), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 49/107 (45%), Gaps = 15/107 (14%)
Query: 44 HDQYVYRHANGLCVIGLAPTHVAFK--------DEGG-----ITAVDFNVGKS-DRSGFK 89
+D V RH+N LCV+GLA H FK D+ I +D+ V +R K
Sbjct: 64 NDICVLRHSNTLCVVGLAYGHHIFKRCTTNSSGDDSASKTHEILGLDYQVSSGLNRLKNK 123
Query: 90 VTGKRKKQNAQHFESNTAVC-KVCTNNDSYIVRCCVKGSLLEVNNRL 135
V G+RK +S + VC + V C + G+LLEVN L
Sbjct: 124 VKGRRKHGGHVLSKSTDILAYAVCDRGIRHPVVCGIPGNLLEVNGFL 170
>gi|3688309|emb|CAB16914.1| H-Protein precursor [Flaveria pringlei]
Length = 162
Score = 40.0 bits (92), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G ++EVN +L + PGL+NSS +G++ + P L+ SL+G ++Y +
Sbjct: 99 VNSPISGEIVEVNTKLSETPGLINSSPYEDGWMIKVKPSNPSELE---SLMGAKEYTKFC 155
Query: 180 EE 181
EE
Sbjct: 156 EE 157
>gi|407923430|gb|EKG16501.1| Glycine cleavage H-protein [Macrophomina phaseolina MS6]
Length = 173
Score = 40.0 bits (92), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 45/81 (55%), Gaps = 4/81 (4%)
Query: 102 FESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII-MPKPA 160
F ++ A+ V + + + + G ++EVN L ++P +N S + EG+IA + + + A
Sbjct: 96 FAAHDAIGAVESVKSASDILTPLPGKVVEVNRVLEEKPATINKSPEGEGWIAKLELAEGA 155
Query: 161 DWLKIKSSLLGLEDYKRKREE 181
D I L+ +EDYK+ EE
Sbjct: 156 D---INEGLMSVEDYKKFTEE 173
>gi|547500|emb|CAA85759.1| H-protein [Flaveria pringlei]
Length = 161
Score = 40.0 bits (92), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G ++EVN +L + PGL+NSS +G++ + P L+ SL+G ++Y +
Sbjct: 98 VNSPISGEIVEVNTKLSETPGLINSSPYEDGWMIKVKPSNPSELE---SLMGAKEYTKFC 154
Query: 180 EE 181
EE
Sbjct: 155 EE 156
>gi|224138868|ref|XP_002326710.1| precursor of carboxylase h-protein 4, glycine decarboxylase complex
[Populus trichocarpa]
gi|118489400|gb|ABK96504.1| unknown [Populus trichocarpa x Populus deltoides]
gi|222834032|gb|EEE72509.1| precursor of carboxylase h-protein 4, glycine decarboxylase complex
[Populus trichocarpa]
Length = 163
Score = 39.7 bits (91), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSG--FKVTGKRKKQNAQHFESNTAVCKVCTNNDSY 118
A +H K EG + + D G V + +S AV V +D
Sbjct: 44 ATSHEWVKHEGSLATIGITDHAQDHLGEVVFVDLPEPDGSVSQGKSFGAVESVKATSD-- 101
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+ + G ++EVN +L + PGL+N S EG++ + P L+ SLLG ++Y +
Sbjct: 102 -INSPISGEIVEVNTQLSETPGLINKSPYEEGWMIKVKPSNPSELE---SLLGAKEYTKL 157
Query: 179 REE 181
EE
Sbjct: 158 CEE 160
>gi|225427234|ref|XP_002280707.1| PREDICTED: glycine cleavage system H protein, mitochondrial isoform
1 [Vitis vinifera]
gi|297742112|emb|CBI33899.3| unnamed protein product [Vitis vinifera]
Length = 165
Score = 39.7 bits (91), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EG + + D G V + + + N AV V +D
Sbjct: 44 ATSHEWVKHEGSVATIGITDHAQDHLGEVVFVEIPEPGGSVSKGNGFGAVESVKATSD-- 101
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN++L + PGL+NSS +G++ + P L+ SL+G ++Y +
Sbjct: 102 -VNSPISGEVVEVNSKLSEAPGLINSSPYEDGWMIKVKPSNPSELE---SLMGPKEYTKF 157
Query: 179 REE 181
EE
Sbjct: 158 CEE 160
>gi|437993|emb|CAA81073.1| H-protein [Flaveria cronquistii]
gi|437999|emb|CAA81074.1| H-protein [Flaveria pringlei]
gi|3688299|emb|CAB16912.1| H-protein [Flaveria pringlei]
Length = 162
Score = 39.3 bits (90), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G ++EVN++L + PGL+NSS +G++ + P L SL+G ++Y +
Sbjct: 99 VNSPISGEIVEVNSKLSETPGLINSSPYEDGWMIKVKPSNPSELD---SLMGAKEYTKFC 155
Query: 180 EE 181
EE
Sbjct: 156 EE 157
>gi|88812453|ref|ZP_01127702.1| glycine cleavage system protein H [Nitrococcus mobilis Nb-231]
gi|88790239|gb|EAR21357.1| glycine cleavage system protein H [Nitrococcus mobilis Nb-231]
Length = 130
Score = 39.3 bits (90), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 10/86 (11%)
Query: 91 TGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREG 150
G+R K E+ V V +D+Y V G +++ N+RL ++P L+N S EG
Sbjct: 48 VGRRLKAG----EACAVVESVKAASDTY---APVAGEVVDTNSRLAEEPELINRSPYDEG 100
Query: 151 YIAIIMPKPADWLKIKSSLLGLEDYK 176
++ I P A+ L+ SLL EDY+
Sbjct: 101 WLMRIQPDDANALE---SLLDPEDYE 123
>gi|2887286|emb|CAB16710.1| H protein [Flaveria anomala]
Length = 162
Score = 39.3 bits (90), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G ++EVN++L + PGL+NSS +G++ + P L SL+G ++Y +
Sbjct: 99 VNSPISGEIVEVNSKLSETPGLINSSPYEDGWMIKVKPSNPSELD---SLMGAKEYTKFC 155
Query: 180 EE 181
EE
Sbjct: 156 EE 157
>gi|2499417|sp|Q39732.1|GCSH_FLAAN RecName: Full=Glycine cleavage system H protein, mitochondrial;
Flags: Precursor
gi|547558|emb|CAA85761.1| H-protein [Flaveria anomala]
Length = 162
Score = 39.3 bits (90), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G ++EVN++L + PGL+NSS +G++ + P L SL+G ++Y +
Sbjct: 99 VNSPISGEIVEVNSKLSETPGLINSSPYEDGWMIKVKPSNPSELD---SLMGAKEYTKFC 155
Query: 180 EE 181
EE
Sbjct: 156 EE 157
>gi|1346118|sp|P49359.1|GCSH_FLAPR RecName: Full=Glycine cleavage system H protein, mitochondrial;
Flags: Precursor
gi|438001|emb|CAA81075.1| H-protein [Flaveria pringlei]
gi|3688301|emb|CAB16913.1| H-protein [Flaveria pringlei]
Length = 162
Score = 39.3 bits (90), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G ++EVN++L + PGL+NSS +G++ + P L SL+G ++Y +
Sbjct: 99 VNSPISGEIVEVNSKLSETPGLINSSPYEDGWMIKVKPSNPSELD---SLMGAKEYTKFC 155
Query: 180 EE 181
EE
Sbjct: 156 EE 157
>gi|118489297|gb|ABK96453.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 165
Score = 39.3 bits (90), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 40/75 (53%), Gaps = 6/75 (8%)
Query: 107 AVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIK 166
AV V +D + + G ++EVN +L + PGL+N S EG++ + P L+
Sbjct: 93 AVESVKATSD---INSPISGEIVEVNTKLSETPGLINKSPYEEGWMIKVKPSNPSELQ-- 147
Query: 167 SSLLGLEDYKRKREE 181
SLLG ++Y + EE
Sbjct: 148 -SLLGPKEYTKFCEE 161
>gi|224126315|ref|XP_002329524.1| precursor of carboxylase h-protein 3, glycine decarboxylase complex
[Populus trichocarpa]
gi|118486954|gb|ABK95310.1| unknown [Populus trichocarpa]
gi|222870233|gb|EEF07364.1| precursor of carboxylase h-protein 3, glycine decarboxylase complex
[Populus trichocarpa]
Length = 165
Score = 39.3 bits (90), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 40/75 (53%), Gaps = 6/75 (8%)
Query: 107 AVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIK 166
AV V +D + + G ++EVN +L + PGL+N S EG++ + P L+
Sbjct: 93 AVESVKATSD---INSPISGEIVEVNTKLSETPGLINKSPYEEGWMIKVKPSNPSELQ-- 147
Query: 167 SSLLGLEDYKRKREE 181
SLLG ++Y + EE
Sbjct: 148 -SLLGPKEYTKFCEE 161
>gi|356541408|ref|XP_003539169.1| PREDICTED: glycine cleavage system H protein, mitochondrial-like
[Glycine max]
Length = 165
Score = 39.3 bits (90), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 6/75 (8%)
Query: 107 AVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIK 166
AV V +D + + G ++EVN +L + PGL+NSS +G+ +I KP+D ++
Sbjct: 92 AVESVKATSD---INSPISGEIVEVNKKLTETPGLVNSSPYEDGW--MIKVKPSDPSELD 146
Query: 167 SSLLGLEDYKRKREE 181
SL+G ++Y + EE
Sbjct: 147 -SLMGPKEYTKFCEE 160
>gi|386018861|ref|YP_005936885.1| glycine cleavage system H protein [Pseudomonas stutzeri DSM 4166]
gi|327478833|gb|AEA82143.1| glycine cleavage system H protein [Pseudomonas stutzeri DSM 4166]
Length = 154
Score = 39.3 bits (90), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 25/39 (64%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPK 158
RC + G+LL VN L+K+PGL+N EG++ + P+
Sbjct: 75 ARCPLSGTLLAVNEALLKRPGLINQDCYGEGWMVRLQPE 113
>gi|147770018|emb|CAN76620.1| hypothetical protein VITISV_038357 [Vitis vinifera]
Length = 165
Score = 39.3 bits (90), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 57/123 (46%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EG + + D G V + + + N AV V +D
Sbjct: 44 ATSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGGSVSKGNGFGAVESVKATSD-- 101
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+ + G ++EVN++L + PGL+NSS +G++ + P L+ SL+G ++Y +
Sbjct: 102 -INSPISGEVVEVNSKLSESPGLINSSPYEDGWMIKVKPGNPSELE---SLMGPKEYTKF 157
Query: 179 REE 181
EE
Sbjct: 158 CEE 160
>gi|297851690|ref|XP_002893726.1| hypothetical protein ARALYDRAFT_890824 [Arabidopsis lyrata subsp.
lyrata]
gi|297339568|gb|EFH69985.1| hypothetical protein ARALYDRAFT_890824 [Arabidopsis lyrata subsp.
lyrata]
Length = 166
Score = 38.9 bits (89), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 124 VKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP-KPADWLKIKSSLLGLEDYKR 177
+ G ++EVN +L + PGL+NSS EG++ + P PA+ +L+G ++Y +
Sbjct: 107 ISGEVIEVNTKLTESPGLINSSPYEEGWMIKVKPSSPAE----LEALMGPKEYTK 157
>gi|354545784|emb|CCE42512.1| hypothetical protein CPAR2_201550 [Candida parapsilosis]
Length = 186
Score = 38.9 bits (89), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%)
Query: 124 VKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREE 181
V G ++ VN L +PGLLN EG+IA I + + +LL EDY + EE
Sbjct: 128 VSGEIVGVNEILESEPGLLNHDPTVEGWIAHIKLEDTRQITDSETLLSSEDYAKSLEE 185
>gi|15223217|ref|NP_174525.1| glycine cleavage system H protein [Arabidopsis thaliana]
gi|12644523|sp|Q9LQL0.1|GCSH2_ARATH RecName: Full=Probable glycine cleavage system H protein 2,
mitochondrial; Flags: Precursor
gi|8920623|gb|AAF81345.1|AC007767_25 Identical to a glycine cleavage system H-protein precursor from
Arabidopsis thaliana gb|P25855. It contains a glycine
cleavage H-protein domain PF|01597. ESTs gb|R90208,
gb|AI994794, gb|AA605324, gb|N38240, gb|AV533336,
gb|AV534187, gb|AA597419 and gb|AA597515 come from this
gene [Arabidopsis thaliana]
gi|12083338|gb|AAG48828.1|AF332465_1 putative glycine cleavage system H protein precursor [Arabidopsis
thaliana]
gi|14326572|gb|AAK60330.1|AF385740_1 At1g32470/F5D14_10 [Arabidopsis thaliana]
gi|18700238|gb|AAL77729.1| At1g32470/F5D14_10 [Arabidopsis thaliana]
gi|332193369|gb|AEE31490.1| glycine cleavage system H protein [Arabidopsis thaliana]
Length = 166
Score = 38.9 bits (89), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 59/124 (47%), Gaps = 10/124 (8%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EG + + D G V + + N+ + + AV V ++
Sbjct: 45 ANSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEANSSVSKEKSFGAVESVKATSE-- 102
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP-KPADWLKIKSSLLGLEDYKR 177
+ + G ++EVN +L + PGL+NSS +G++ + P PA+ +L+G ++Y +
Sbjct: 103 -ILSPISGEVIEVNTKLTESPGLINSSPYEDGWMIKVKPSSPAE----LEALMGPKEYTK 157
Query: 178 KREE 181
EE
Sbjct: 158 FCEE 161
>gi|886868|emb|CAA88734.1| H-protein precursor of glycine cleavage system [Flaveria trinervia]
Length = 163
Score = 38.9 bits (89), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G ++EVN++L K PGL+N S +G++ + P L SL+G ++Y +
Sbjct: 99 VNSPISGEIVEVNSKLTKTPGLINKSPYEDGWMIKVKPSNPSELD---SLMGPKEYTKFC 155
Query: 180 EE 181
EE
Sbjct: 156 EE 157
>gi|1169884|sp|P46485.1|GCSH_FLATR RecName: Full=Glycine cleavage system H protein, mitochondrial;
AltName: Full=Lipoyl-bearing H protein; Flags: Precursor
gi|547502|emb|CAA85760.1| H-protein [Flaveria trinervia]
Length = 165
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G ++EVN++L K PGL+N S +G++ + P L SL+G ++Y +
Sbjct: 101 VNSPISGEIVEVNSKLTKTPGLINKSPYEDGWMIKVKPSNPSELD---SLMGPKEYTKFC 157
Query: 180 EE 181
EE
Sbjct: 158 EE 159
>gi|351721381|ref|NP_001238231.1| uncharacterized protein LOC100306654 [Glycine max]
gi|255629191|gb|ACU14940.1| unknown [Glycine max]
Length = 199
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 38/58 (65%), Gaps = 3/58 (5%)
Query: 124 VKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREE 181
+ G ++EVN +L + PGL+NSS +G+ +I KP+D ++ SL+G ++Y + EE
Sbjct: 140 ISGEIVEVNKKLTETPGLVNSSPYEDGW--MIKVKPSDPSELD-SLMGPKEYTKFCEE 194
>gi|386345765|ref|YP_006044014.1| Glycine cleavage system H protein [Spirochaeta thermophila DSM
6578]
gi|339410732|gb|AEJ60297.1| Glycine cleavage system H protein [Spirochaeta thermophila DSM
6578]
Length = 128
Score = 38.5 bits (88), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 8/80 (10%)
Query: 103 ESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPK-PAD 161
E+ V V T +D V V G ++E+N L ++P LLN+S EG++ + P PA+
Sbjct: 54 EAFGVVESVKTTSD---VHMPVSGKIVEINETLKERPDLLNTSPFEEGWLVKVQPTDPAE 110
Query: 162 WLKIKSSLLGLEDYKRKREE 181
LL +DY++ EE
Sbjct: 111 L----DELLSADDYRKYVEE 126
>gi|448525556|ref|XP_003869145.1| Gcv3 glycine decarboxylase subunit H [Candida orthopsilosis Co
90-125]
gi|380353498|emb|CCG23008.1| Gcv3 glycine decarboxylase subunit H [Candida orthopsilosis]
Length = 186
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 31/62 (50%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
+ V G ++ VN+ L +PGLLN EG+IA I + +LL EDY +
Sbjct: 124 IYSPVSGEVVGVNSVLESEPGLLNHDPTVEGWIAHIKLDDPKQITESETLLTTEDYAKSL 183
Query: 180 EE 181
EE
Sbjct: 184 EE 185
>gi|326505670|dbj|BAJ95506.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326515436|dbj|BAK03631.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516078|dbj|BAJ88062.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 164
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 35/64 (54%), Gaps = 8/64 (12%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP-KPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN++L + PGL+NSS EG++ + P PA+ L GL D +
Sbjct: 102 VNSPISGEVVEVNSKLSEAPGLINSSPYEEGWMIKVKPSSPAE-------LEGLLDSAKY 154
Query: 179 REEC 182
+ C
Sbjct: 155 TKHC 158
>gi|452846007|gb|EME47940.1| hypothetical protein DOTSEDRAFT_69764 [Dothistroma septosporum
NZE10]
Length = 178
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 103 ESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADW 162
ES AV V + +D + + G ++E N+ L ++PG +N S +++G++A I AD
Sbjct: 102 ESIGAVESVKSASD---IMTPISGVIVEANSVLEEKPGTINKSPEKDGWLAKIKASNADE 158
Query: 163 LKIKSSLLGLEDYKRKREEC 182
L S+L+ YK+ EE
Sbjct: 159 L---SALMDAAGYKKFTEEA 175
>gi|146454536|gb|ABQ41934.1| glycine cleavage system H protein [Sonneratia alba]
gi|241865154|gb|ACS68655.1| H-protein of the glycine decarboxylase multi-enzyme complex
[Sonneratia alba]
gi|241865386|gb|ACS68725.1| H-protein of the glycine decarboxylase multi-enzyme complex
[Sonneratia alba]
Length = 129
Score = 38.1 bits (87), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 5/99 (5%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EG + + D G V + Q + + AV V +D
Sbjct: 30 AESHEWVKHEGEVATIGITDHAQDHLGEVVFVDLPEPGVQVSQGGSFGAVESVKATSD-- 87
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
V V G ++EVN +L ++PGL+NSS +G++ I P
Sbjct: 88 -VNSPVSGKIMEVNTQLTEKPGLVNSSPYEDGWMIKIKP 125
>gi|307717727|ref|YP_003873259.1| glycine cleavage system H protein [Spirochaeta thermophila DSM
6192]
gi|306531452|gb|ADN00986.1| glycine cleavage system H protein [Spirochaeta thermophila DSM
6192]
Length = 128
Score = 38.1 bits (87), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 8/80 (10%)
Query: 103 ESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPK-PAD 161
E+ V V T +D V V G ++E+N L ++P LLN+S EG++ + P PA+
Sbjct: 54 EAFGVVESVKTTSD---VHMPVSGKIVEINETLKERPDLLNTSPFEEGWLVKLQPTDPAE 110
Query: 162 WLKIKSSLLGLEDYKRKREE 181
LL +DY++ EE
Sbjct: 111 L----EELLSADDYRKYVEE 126
>gi|255557631|ref|XP_002519845.1| glycine cleavage system h protein, putative [Ricinus communis]
gi|223540891|gb|EEF42449.1| glycine cleavage system h protein, putative [Ricinus communis]
Length = 165
Score = 38.1 bits (87), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSG--FKVTGKRKKQNAQHFESNTAVCKVCTNNDSY 118
A +H K EG + + D G V + +S AV V +D
Sbjct: 44 ATSHEWVKHEGSVATIGITDHAQDHLGEVVFVDLPEAGTSVTQGKSFGAVESVKATSD-- 101
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
V + G ++EVN +L + PG +NSS EG++ + P L+ SL+G ++Y +
Sbjct: 102 -VNSPISGEVVEVNTKLTEGPGSINSSPYEEGWMIKVKPNNPSELE---SLMGPKEYTKF 157
Query: 179 REE 181
EE
Sbjct: 158 CEE 160
>gi|547519|emb|CAA85755.1| H-protein [Flaveria cronquistii]
Length = 152
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
V + G ++EVN +L + PGL+NSS +G++ + P L+ SL+G ++Y
Sbjct: 99 VNSPISGEIVEVNTKLSETPGLINSSPYEDGWMIKVKPSNPSELE---SLMGAKEY 151
>gi|168033770|ref|XP_001769387.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679307|gb|EDQ65756.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 159
Score = 37.7 bits (86), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 58/128 (45%), Gaps = 26/128 (20%)
Query: 56 CVIGLAPTHVAFKDEGGITAVDF-NVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTN 114
V+G+ TH A + G + VD VGK+ G +S +V V
Sbjct: 53 AVVGI--THHAQEALGDVVFVDLPEVGKAISQG---------------DSFGSVESVKAV 95
Query: 115 NDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPK-PADWLKIKSSLLGLE 173
+D Y V G ++EVN+ L + PGL+N EG++ + K PA+ SLL E
Sbjct: 96 SDVY---SPVSGEVVEVNSELTESPGLVNKGPFEEGWMMKLKLKDPAE----VQSLLTAE 148
Query: 174 DYKRKREE 181
YK+ EE
Sbjct: 149 SYKQYIEE 156
>gi|149175893|ref|ZP_01854511.1| hypothetical protein PM8797T_24781 [Planctomyces maris DSM 8797]
gi|148845340|gb|EDL59685.1| hypothetical protein PM8797T_24781 [Planctomyces maris DSM 8797]
Length = 233
Score = 37.7 bits (86), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 73/160 (45%), Gaps = 21/160 (13%)
Query: 8 LHKLLLPNVHDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAF 67
+H+LL D LT +D F + + H ++ HA LC + ++PT V
Sbjct: 77 VHQLLTKPYSDAELT----VDLKFAGSPSTNLTFNQH-KFKGSHAGHLCRVVVSPTKVTL 131
Query: 68 KDEGGITAVDFN-VGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDS-YIVRCCVK 125
+D G T V N + K RS K+T + + + A V D Y V +K
Sbjct: 132 RD--GKTGVFSNEIFKKRRSKEKLTAEEQTI----LKKTQADFSVKLEKDKWYTVTVRIK 185
Query: 126 GSLLE--VNNRLI---KQPGLLNSSADREGYIAIIMPKPA 160
G L++ ++ RLI + PG+ +++ D+ I ++ PK +
Sbjct: 186 GDLMQAFIDGRLIGSLQSPGIAHATKDK---IGLVTPKES 222
>gi|356510047|ref|XP_003523752.1| PREDICTED: protein cereblon-like [Glycine max]
Length = 543
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 1/58 (1%)
Query: 4 DEEELHKLLLPNVH-DLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLCVIGL 60
D +E + L L H D L PPS+ +FNF Y AP G Q + H +G ++ L
Sbjct: 44 DPDEEYALFLSQYHLDASLPPPSSAEFNFDTYIAPLHTYLGGTQSRFLHLDGGAILNL 101
>gi|547521|emb|CAA85756.1| H-protein [Flaveria cronquistii]
Length = 152
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
V + G ++EVN++L + PGL+NSS +G++ + P L SL+G ++Y
Sbjct: 99 VNSPISGEIVEVNSKLSETPGLINSSPYEDGWMIKVKPSNPSELD---SLMGAKEY 151
>gi|449461527|ref|XP_004148493.1| PREDICTED: glycine cleavage system H protein, mitochondrial-like
[Cucumis sativus]
gi|449513621|ref|XP_004164375.1| PREDICTED: glycine cleavage system H protein, mitochondrial-like
[Cucumis sativus]
Length = 165
Score = 37.4 bits (85), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 8/123 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSG--FKVTGKRKKQNAQHFESNTAVCKVCTNNDSY 118
A +H K EG + V D G V + +S AV V +D
Sbjct: 44 ASSHEWVKHEGSVATVGITDHAQDHLGEVVFVDLPESGSSVSQGKSFGAVESVKATSD-- 101
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
+ + G ++EVN +L + PGL+NSS G++ + P L SLLG ++Y +
Sbjct: 102 -INSPISGEIVEVNPQLKESPGLINSSPYEGGWMIKVKPSNPGELD---SLLGPKEYTKF 157
Query: 179 REE 181
EE
Sbjct: 158 CEE 160
>gi|547496|emb|CAA85757.1| H-protein [Flaveria chlorifolia]
gi|547498|emb|CAA85758.1| H-protein [Flaveria linearis]
gi|547560|emb|CAA85766.1| H-protein [Flaveria floridana]
Length = 152
Score = 37.4 bits (85), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
V + G ++EVN++L + PGL+NSS +G++ + P L SL+G ++Y
Sbjct: 99 VNSPISGEIVEVNSKLSETPGLINSSPYEDGWMIKVKPSNPSELD---SLMGAKEY 151
>gi|1346119|sp|P49360.1|GCSH_FLAPU RecName: Full=Glycine cleavage system H protein, mitochondrial;
Flags: Precursor
gi|547564|emb|CAA85768.1| H-protein [Flaveria pubescens]
Length = 152
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
V + G ++EVN++L + PGL+NSS +G++ + P L SL+G ++Y
Sbjct: 99 VNSPISGEIVEVNSKLSETPGLINSSPYEDGWMIKVKPSNPSELD---SLMGAKEY 151
>gi|338731629|ref|YP_004661021.1| glycine cleavage system H protein [Thermotoga thermarum DSM 5069]
gi|335365980|gb|AEH51925.1| glycine cleavage system H protein [Thermotoga thermarum DSM 5069]
Length = 123
Score = 37.0 bits (84), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 2/103 (1%)
Query: 60 LAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYI 119
A TH + G I V +V ++ G V K + + V + + +
Sbjct: 4 FAKTHEWIEVSGNIATVGISVYAQEKLGDVVYVDLPKV-GKEVKKGEVVLTIESVKAAGE 62
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII-MPKPAD 161
V V G ++EVN +L QP L+N A+ EG+I I M PA+
Sbjct: 63 VYAPVSGKIVEVNEKLNNQPELINKDAEGEGWIVKIEMTNPAE 105
>gi|145482755|ref|XP_001427400.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394481|emb|CAK60002.1| unnamed protein product [Paramecium tetraurelia]
Length = 159
Score = 37.0 bits (84), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 54/123 (43%), Gaps = 10/123 (8%)
Query: 34 YFAPDFLKPGHDQYVYRHANGLCVIGLAPTHVAFKDEGGITAVDFNVGKSDRSGFKVTGK 93
YF F QYV++H+N L + G+ H K + I ++F + G K G
Sbjct: 41 YFVKSF---SEHQYVFQHSNRLYLSGINKNHPIVKSDSDIKEIEFTQKEQHVKGKKKKGG 97
Query: 94 RKKQNAQHFESNTAVCKV-CTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYI 152
K + S+T + KV N + Y V G ++E+N L K L + +G++
Sbjct: 98 LKVK------SDTVMLKVHLENGEVYNVVAGHDGYIIEMNENLQKNKQYLKQYPETKGFV 151
Query: 153 AII 155
II
Sbjct: 152 CII 154
>gi|330804345|ref|XP_003290156.1| hypothetical protein DICPUDRAFT_92440 [Dictyostelium purpureum]
gi|325079713|gb|EGC33300.1| hypothetical protein DICPUDRAFT_92440 [Dictyostelium purpureum]
Length = 148
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 37/62 (59%), Gaps = 4/62 (6%)
Query: 116 DSYIVRCC--VKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLE 173
D ++V+ + G++ ++N ++I+QP L+N+ + G++AII K D K LL +
Sbjct: 81 DQFMVKFMNPISGTVTKINEKVIEQPELVNNDCENTGWLAIIETKSND--KDNKKLLSKK 138
Query: 174 DY 175
+Y
Sbjct: 139 EY 140
>gi|547562|emb|CAA85767.1| H-protein [Flaveria palmeri]
Length = 154
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 53/117 (45%), Gaps = 8/117 (6%)
Query: 61 APTHVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNT--AVCKVCTNNDSY 118
A +H K EG + + D G V + + ++ + AV V +D
Sbjct: 43 ANSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEAGGSVTKTTSFGAVESVKATSD-- 100
Query: 119 IVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDY 175
V + G ++EVN++L + PGL+N S +G++ + P L SL+G ++Y
Sbjct: 101 -VNSPISGEIVEVNSKLTETPGLINKSPYEDGWMIKVKPSNPSELD---SLMGAKEY 153
>gi|331215111|ref|XP_003320236.1| glycine cleavage system H protein [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309299226|gb|EFP75817.1| glycine cleavage system H protein [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 163
Score = 36.6 bits (83), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 9/56 (16%)
Query: 124 VKGSLLEVNNRLIKQPGLLNSSADREGYIAIIM---PKPADWLKIKSSLLGLEDYK 176
V G++ EVN +L Q GLLN S + EG++A I P+ D +LL E YK
Sbjct: 107 VSGTVTEVNTKLADQAGLLNKSPEDEGWLAKIQLSSPQELD------NLLSEESYK 156
>gi|405963638|gb|EKC29196.1| Glycine cleavage system H protein [Crassostrea gigas]
Length = 123
Score = 36.2 bits (82), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 23/32 (71%)
Query: 126 GSLLEVNNRLIKQPGLLNSSADREGYIAIIMP 157
G ++EVN RL +P L+NSSA+ +G++ I P
Sbjct: 68 GVIVEVNKRLESEPELVNSSAESDGWLVKIKP 99
>gi|315426233|dbj|BAJ47876.1| glycine cleavage system H protein [Candidatus Caldiarchaeum
subterraneum]
gi|315427886|dbj|BAJ49478.1| glycine cleavage system H protein [Candidatus Caldiarchaeum
subterraneum]
Length = 140
Score = 36.2 bits (82), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 5/59 (8%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII-MPKPADWLKIKSSLLGLEDYKR 177
V V G ++EVN++L+ P +LN S EG+IA I + P++ K LL ++Y+R
Sbjct: 72 VLAPVSGRVVEVNSKLLDHPEILNESPYTEGWIAKIELSDPSELEK----LLDADEYRR 126
>gi|409051704|gb|EKM61180.1| hypothetical protein PHACADRAFT_247626 [Phanerochaete carnosa
HHB-10118-sp]
Length = 160
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 36/58 (62%), Gaps = 3/58 (5%)
Query: 124 VKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKREE 181
+ G++ VN L KQPGLLN S + +G++ I K AD ++K +L+ E Y++ EE
Sbjct: 105 ISGTIEAVNEELNKQPGLLNKSPEGDGWLCKI--KVADEAELK-NLMTEEAYQKHCEE 159
>gi|315426158|dbj|BAJ47802.1| glycine cleavage system H protein [Candidatus Caldiarchaeum
subterraneum]
gi|343485042|dbj|BAJ50696.1| glycine cleavage system H protein [Candidatus Caldiarchaeum
subterraneum]
Length = 140
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 5/59 (8%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAII-MPKPADWLKIKSSLLGLEDYKR 177
V V G ++EVN++L+ P +LN S EG+IA I + P++ K LL ++Y+R
Sbjct: 72 VLAPVSGRVVEVNSKLLDHPEILNESPYTEGWIAKIELSDPSELEK----LLDADEYRR 126
>gi|365121518|ref|ZP_09338438.1| hypothetical protein HMPREF1033_01784 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645763|gb|EHL85023.1| hypothetical protein HMPREF1033_01784 [Tannerella sp.
6_1_58FAA_CT1]
Length = 195
Score = 36.2 bits (82), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 100 QHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKP 159
++F+ NT + + ++ + + G++L VNN L PG+LNS+A IIM KP
Sbjct: 117 EYFDKNTVFATIESISEVCDLYMPLSGTILGVNNVLEDSPGMLNSNAHE---YPIIMLKP 173
Query: 160 ADWLKIKSSL 169
D L+ + L
Sbjct: 174 DDSLEFEELL 183
>gi|221060711|ref|XP_002261925.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193811075|emb|CAQ41803.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 311
Score = 36.2 bits (82), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 58/140 (41%), Gaps = 17/140 (12%)
Query: 35 FAPDFLKPG--HDQYVYRHANGLCVIGLAPTHVAF----------KDEGGITAVDFNVGK 82
F +F G +D + + NGL V+ L F + I + NV
Sbjct: 141 FCENFYLMGDEYDLQISKQLNGLSVLSLNKCSTIFFNILSDLKADTSKSFIQLIKENVKV 200
Query: 83 SDRSGF---KVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQP 139
+V+GKRKK NA N + K+ + Y+V+ + G L++N L++ P
Sbjct: 201 EYDPRLLNARVSGKRKK-NALFVNENMTILKIFYKDKLYMVKNKICGYQLDINENLVENP 259
Query: 140 GLLNSSADREGYIAIIMPKP 159
LL +E +I I+ K
Sbjct: 260 YLL-FLGQQEAWILILKCKE 278
>gi|385805437|ref|YP_005841835.1| glycine cleavage system protein H [Fervidicoccus fontis Kam940]
gi|383795300|gb|AFH42383.1| glycine cleavage system protein H [Fervidicoccus fontis Kam940]
Length = 137
Score = 35.8 bits (81), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 3/61 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
V + G +L+VN L++ P L+NS G+IA+I + D L +SLL E Y +
Sbjct: 77 VYAPISGEILDVNETLLESPELMNSDPYESGWIALIKIENNDEL---NSLLTFEQYVKLL 133
Query: 180 E 180
E
Sbjct: 134 E 134
>gi|160901730|ref|YP_001567311.1| glycine cleavage system H protein [Petrotoga mobilis SJ95]
gi|160359374|gb|ABX30988.1| glycine cleavage system H protein [Petrotoga mobilis SJ95]
Length = 116
Score = 35.8 bits (81), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 23/93 (24%), Positives = 42/93 (45%), Gaps = 1/93 (1%)
Query: 63 THVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRC 122
TH EG I V + ++ G VT + + + +C + + + +
Sbjct: 7 THEYVTIEGNIATVGISAKAAEELG-DVTYVELPEVGKEVKKGEVLCTIESVKSAEDIYV 65
Query: 123 CVKGSLLEVNNRLIKQPGLLNSSADREGYIAII 155
+ G ++EVNN L QP ++N A+ +G+I I
Sbjct: 66 PLSGKIVEVNNDLEDQPEIINEDAENKGWIVKI 98
>gi|190348927|gb|EDK41481.2| hypothetical protein PGUG_05579 [Meyerozyma guilliermondii ATCC
6260]
Length = 397
Score = 35.8 bits (81), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 70/174 (40%), Gaps = 21/174 (12%)
Query: 17 HDLPLTPPSAIDFNFVAYFAPDFLKPGHDQYVYRHANGLC-----VIGLAPTHVAFKDEG 71
H LP+T P ++F + PD +K Q V H + V+ L +V
Sbjct: 41 HVLPITVPVTLEFGDTGFAFPDLVKATQIQ-VDSHLKSIGQERYEVVLLDNLYVDPLRNS 99
Query: 72 GITAVDFNVGKSDRSGFKVTGKR-------KKQNAQHFESNTAVCKVCTNNDSYIVRCCV 124
ITAVD N +D + V K A +F S+ A+ + N+ Y V C+
Sbjct: 100 SITAVDHNSTSNDYAVEMVHASENFVVIDSNKLKAFNFYSSAAIHE---NDIPYFVAQCI 156
Query: 125 KGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRK 178
LL + L+++P S + + D++ I + E+YKR+
Sbjct: 157 LYHLLRTDLSLLQRPNATVYSPQLHVNLVSVFLNGDDYIDIPT-----EEYKRE 205
>gi|242060658|ref|XP_002451618.1| hypothetical protein SORBIDRAFT_04g004720 [Sorghum bicolor]
gi|241931449|gb|EES04594.1| hypothetical protein SORBIDRAFT_04g004720 [Sorghum bicolor]
Length = 158
Score = 35.8 bits (81), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 39/62 (62%), Gaps = 3/62 (4%)
Query: 120 VRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIKSSLLGLEDYKRKR 179
+ V G ++EVNN+L ++PGL+N+S +G+ II K +D ++ +SL+ E Y +
Sbjct: 96 INSPVSGEVVEVNNKLSEEPGLVNASPYEKGW--IIKVKLSDSGEL-NSLMNDEKYSKFC 152
Query: 180 EE 181
EE
Sbjct: 153 EE 154
>gi|224134418|ref|XP_002321819.1| precursor of carboxylase h-protein 2, glycine decarboxylase complex
[Populus trichocarpa]
gi|222868815|gb|EEF05946.1| precursor of carboxylase h-protein 2, glycine decarboxylase complex
[Populus trichocarpa]
Length = 155
Score = 35.8 bits (81), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 6/75 (8%)
Query: 107 AVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADREGYIAIIMPKPADWLKIK 166
AV V +D Y V G+++EVN L PGL+NSS +G+I + K A L+
Sbjct: 82 AVESVKATSDVY---SPVSGNVVEVNEELSSSPGLVNSSPYEKGWIMKVEIKDASELE-- 136
Query: 167 SSLLGLEDYKRKREE 181
+L +DY + EE
Sbjct: 137 -TLKNSDDYAKFCEE 150
>gi|389585971|dbj|GAB68700.1| hypothetical protein PCYB_141280, partial [Plasmodium cynomolgi
strain B]
Length = 319
Score = 35.4 bits (80), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 89 KVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRCCVKGSLLEVNNRLIKQPGLLNSSADR 148
+V+GKRKK NA N + K+ + Y+V+ + G ++N L+K P LL ++
Sbjct: 218 RVSGKRKK-NALFVNENMTILKIFYKDKLYMVKNKIFGYQHDINENLVKNPHLL-FLGEQ 275
Query: 149 EGYIAIIMPK 158
E +I I+ K
Sbjct: 276 EAWILILKCK 285
>gi|374339160|ref|YP_005095896.1| glycine cleavage system H protein [Marinitoga piezophila KA3]
gi|372100694|gb|AEX84598.1| glycine cleavage system H protein [Marinitoga piezophila KA3]
Length = 119
Score = 35.4 bits (80), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 23/90 (25%), Positives = 40/90 (44%), Gaps = 1/90 (1%)
Query: 63 THVAFKDEGGITAVDFNVGKSDRSGFKVTGKRKKQNAQHFESNTAVCKVCTNNDSYIVRC 122
TH + EG I + ++ G +T + + + A+C V + + V
Sbjct: 7 THEWVEVEGNIATIGITAHAAEELG-DITYVDLPEEGKEIKKGEALCAVESVKSASDVYS 65
Query: 123 CVKGSLLEVNNRLIKQPGLLNSSADREGYI 152
V G ++EVN+ L P L+N A+ G+I
Sbjct: 66 PVSGKVIEVNSELDASPELINDDAENAGWI 95
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.138 0.419
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,239,268,531
Number of Sequences: 23463169
Number of extensions: 133420623
Number of successful extensions: 249824
Number of sequences better than 100.0: 261
Number of HSP's better than 100.0 without gapping: 173
Number of HSP's successfully gapped in prelim test: 88
Number of HSP's that attempted gapping in prelim test: 249481
Number of HSP's gapped (non-prelim): 267
length of query: 186
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 52
effective length of database: 9,215,130,721
effective search space: 479186797492
effective search space used: 479186797492
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)