Psyllid ID: psy16442


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110------
MKYIEEQLSKRKKGEETKEGGQGEEGGVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHNKKDGPSQFVPTNMAVNFVQHN
cHHHHHHHHHHHcccccccccccccccccccccHHHHHHHcccccccccccccHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccHHHHccccc
ccHHHHHHHHHccccccccccccccccccccccHHHHHHHcHHHcccccccccHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccEEEEEEcc
MKYIEEQLSKrkkgeetkeggqgeeggvnycspeeaALQAvpdhlrqssthqSEEMLSNqmlsgipevdlGIEAKIRNIEATEEAKLKLLWDrhnkkdgpsqfvptnmavNFVQHN
mkyieeqlskrkkgeetkeggqgeegGVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHNKkdgpsqfvptnmavnfvqhn
MKYIEEQLSkrkkgeetkeggqgeeggVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHNKKDGPSQFVPTNMAVNFVQHN
******************************************************************EVDLGIEAKIRNIEATEEAKLKLLWD************************
MKYIEEQ*****************************ALQAV********************LSGIPEVDLGIEAKIRNIEATE***********************NMAVNFVQ**
MKYIEEQL*****************GGVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHNKKDGPSQFVPTNMAVNFVQHN
*K***EQLSKRK*****************YCSPEEAALQAVPDHLRQS**HQSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHNKKDGPSQFVPTNMAVNFV***
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKYIEEQLSKRKKGEETKEGGQGEEGGVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHNKKDGPSQFVPTNMAVNFVQHN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query116 2.2.26 [Sep-21-2011]
Q3TQI7289 Uncharacterized protein C yes N/A 0.939 0.377 0.547 5e-26
Q9NZ63289 Uncharacterized protein C yes N/A 0.939 0.377 0.547 1e-25
Q5RC87289 Uncharacterized protein C yes N/A 0.939 0.377 0.538 8e-25
>sp|Q3TQI7|CI078_MOUSE Uncharacterized protein C9orf78 homolog OS=Mus musculus PE=1 SV=2 Back     alignment and function desciption
 Score =  116 bits (290), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 64/117 (54%), Positives = 83/117 (70%), Gaps = 8/117 (6%)

Query: 1   MKYIEEQLSKRKKGEETKEGGQGEEGGVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSNQ 60
           MKYIE +L KRK G   +E  + +      C      L  +P+++R SS  ++EEMLSNQ
Sbjct: 116 MKYIETELKKRK-GIVEQEEQKAKPKNAEDC------LYELPENIRVSSAKKTEEMLSNQ 168

Query: 61  MLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHN-KKDGPSQFVPTNMAVNFVQHN 116
           MLSGIPEVDLGI+AKI+NI +TE+AK +LL ++ N KKD  + FVPTNMAVN+VQHN
Sbjct: 169 MLSGIPEVDLGIDAKIKNIISTEDAKARLLAEQQNKKKDSETSFVPTNMAVNYVQHN 225





Mus musculus (taxid: 10090)
>sp|Q9NZ63|CI078_HUMAN Uncharacterized protein C9orf78 OS=Homo sapiens GN=C9orf78 PE=1 SV=1 Back     alignment and function description
>sp|Q5RC87|CI078_PONAB Uncharacterized protein C9orf78 homolog OS=Pongo abelii PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query116
383853293 310 PREDICTED: uncharacterized protein C9orf 0.991 0.370 0.837 5e-50
350396974 296 PREDICTED: uncharacterized protein C9orf 0.991 0.388 0.846 6e-50
340716340 296 PREDICTED: uncharacterized protein C9orf 0.991 0.388 0.846 7e-50
345495075 297 PREDICTED: uncharacterized protein C9orf 1.0 0.390 0.818 2e-49
380018473 296 PREDICTED: uncharacterized protein C9orf 0.991 0.388 0.837 2e-49
307199470 295 Uncharacterized protein C9orf78 [Harpegn 0.982 0.386 0.805 7e-48
332016923 297 Uncharacterized protein C9orf78 [Acromyr 1.0 0.390 0.784 2e-46
357610714214 hypothetical protein KGM_02139 [Danaus p 1.0 0.542 0.724 2e-44
91078372 299 PREDICTED: similar to CG7974 CG7974-PA [ 0.991 0.384 0.75 7e-44
322793759 293 hypothetical protein SINV_07529 [Solenop 0.965 0.382 0.776 2e-43
>gi|383853293|ref|XP_003702157.1| PREDICTED: uncharacterized protein C9orf78-like [Megachile rotundata] Back     alignment and taxonomy information
 Score =  201 bits (511), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 98/117 (83%), Positives = 105/117 (89%), Gaps = 2/117 (1%)

Query: 1   MKYIEEQLSKRK-KGEETKEGGQGEEGGVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSN 59
           +KYIEE+LSKRK K E+  E G   + G +YCSPEEAALQAVP+HLRQSS H+SEEMLSN
Sbjct: 139 VKYIEEELSKRKSKNEDKTENGSSNDKG-SYCSPEEAALQAVPEHLRQSSAHRSEEMLSN 197

Query: 60  QMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHNKKDGPSQFVPTNMAVNFVQHN 116
           QMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRH KKDGPSQFVPTNMAVNFVQHN
Sbjct: 198 QMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRHRKKDGPSQFVPTNMAVNFVQHN 254




Source: Megachile rotundata

Species: Megachile rotundata

Genus: Megachile

Family: Megachilidae

Order: Hymenoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|350396974|ref|XP_003484725.1| PREDICTED: uncharacterized protein C9orf78-like [Bombus impatiens] Back     alignment and taxonomy information
>gi|340716340|ref|XP_003396657.1| PREDICTED: uncharacterized protein C9orf78-like isoform 1 [Bombus terrestris] gi|340716342|ref|XP_003396658.1| PREDICTED: uncharacterized protein C9orf78-like isoform 2 [Bombus terrestris] Back     alignment and taxonomy information
>gi|345495075|ref|XP_001606209.2| PREDICTED: uncharacterized protein C9orf78-like [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|380018473|ref|XP_003693152.1| PREDICTED: uncharacterized protein C9orf78-like [Apis florea] Back     alignment and taxonomy information
>gi|307199470|gb|EFN80083.1| Uncharacterized protein C9orf78 [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|332016923|gb|EGI57732.1| Uncharacterized protein C9orf78 [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|357610714|gb|EHJ67110.1| hypothetical protein KGM_02139 [Danaus plexippus] Back     alignment and taxonomy information
>gi|91078372|ref|XP_974116.1| PREDICTED: similar to CG7974 CG7974-PA [Tribolium castaneum] gi|270003886|gb|EFA00334.1| hypothetical protein TcasGA2_TC003173 [Tribolium castaneum] Back     alignment and taxonomy information
>gi|322793759|gb|EFZ17143.1| hypothetical protein SINV_07529 [Solenopsis invicta] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query116
UNIPROTKB|F1P4D4293 C17H9orf78 "Uncharacterized pr 0.706 0.279 0.662 1.4e-23
UNIPROTKB|Q1JQC8265 C11H9ORF78 "Uncharacterized pr 0.706 0.309 0.638 7.8e-23
UNIPROTKB|Q9NZ63289 C9orf78 "Uncharacterized prote 0.706 0.283 0.638 7.8e-23
UNIPROTKB|F1RQE0289 C9orf78 "Uncharacterized prote 0.706 0.283 0.638 7.8e-23
MGI|MGI:2385132289 BC005624 "cDNA sequence BC0056 0.706 0.283 0.638 7.8e-23
ZFIN|ZDB-GENE-041010-15289 zgc:103692 "zgc:103692" [Danio 0.939 0.377 0.521 2.1e-22
TAIR|locus:2204888279 AT1G02330 "AT1G02330" [Arabido 0.663 0.275 0.407 1.8e-07
UNIPROTKB|F1P4D4 C17H9orf78 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
 Score = 271 (100.5 bits), Expect = 1.4e-23, P = 1.4e-23
 Identities = 55/83 (66%), Positives = 69/83 (83%)

Query:    35 EAALQAVPDHLRQSSTHQSEEMLSNQMLSGIPEVDLGIEAKIRNIEATEEAKLKLLWDRH 94
             E +L  +P+++R SS  ++EEMLSNQMLSGIPEVDLGI+AKI+NI +TEEAK KLL ++ 
Sbjct:   147 EDSLYELPENIRVSSAKKTEEMLSNQMLSGIPEVDLGIDAKIKNIISTEEAKAKLLAEQQ 206

Query:    95 NKK-DGPSQFVPTNMAVNFVQHN 116
             NKK D  + FVPTNMAVN+VQHN
Sbjct:   207 NKKKDSETSFVPTNMAVNYVQHN 229




GO:0005634 "nucleus" evidence=IEA
GO:0005737 "cytoplasm" evidence=IEA
UNIPROTKB|Q1JQC8 C11H9ORF78 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q9NZ63 C9orf78 "Uncharacterized protein C9orf78" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1RQE0 C9orf78 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
MGI|MGI:2385132 BC005624 "cDNA sequence BC005624" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-041010-15 zgc:103692 "zgc:103692" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
TAIR|locus:2204888 AT1G02330 "AT1G02330" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q5RC87CI078_PONABNo assigned EC number0.53840.93960.3771yesN/A
Q3TQI7CI078_MOUSENo assigned EC number0.54700.93960.3771yesN/A
Q9NZ63CI078_HUMANNo assigned EC number0.54700.93960.3771yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query116
pfam07052102 pfam07052, Hep_59, Hepatocellular carcinoma-associ 8e-30
>gnl|CDD|219280 pfam07052, Hep_59, Hepatocellular carcinoma-associated antigen 59 Back     alignment and domain information
 Score =  102 bits (256), Expect = 8e-30
 Identities = 48/90 (53%), Positives = 62/90 (68%), Gaps = 4/90 (4%)

Query: 1   MKYIEEQLSKRKKGEETKEGGQGEEGGVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSNQ 60
           MKYIEE+L+KRK G+      +  +   N     E  L A+P+HL+     ++EEMLSNQ
Sbjct: 17  MKYIEEELAKRK-GQAADADAEDSDKAANNLKRAEDELYALPEHLK---VKKNEEMLSNQ 72

Query: 61  MLSGIPEVDLGIEAKIRNIEATEEAKLKLL 90
           ML GIPEVDLGIEAK++NIE TE+AK +LL
Sbjct: 73  MLGGIPEVDLGIEAKLKNIERTEKAKRRLL 102


This family represents a conserved region approximately 100 residues long within mammalian hepatocellular carcinoma-associated antigen 59 and similar proteins. Family members are found in a variety of eukaryotes, mainly as hypothetical proteins. Length = 102

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 116
KOG3345|consensus275 100.0
PF07052104 Hep_59: Hepatocellular carcinoma-associated antige 99.97
>KOG3345|consensus Back     alignment and domain information
Probab=100.00  E-value=1.9e-43  Score=281.93  Aligned_cols=110  Identities=61%  Similarity=0.871  Sum_probs=96.2

Q ss_pred             ChhHHHHHHhhhCCCCCcCCCCCCCCCCCCCChHHHhhhhcCccccccccchhhhhhhhhhhcCCCcccccHHHHhhcHH
Q psy16442          1 MKYIEEQLSKRKKGEETKEGGQGEEGGVNYCSPEEAALQAVPDHLRQSSTHQSEEMLSNQMLSGIPEVDLGIEAKIRNIE   80 (116)
Q Consensus         1 MkYIEeELaKR~g~~~~~~~~~~~~~~~~~~~~~ed~Ly~~Pe~l~~~~~~~~ee~lS~~mL~gI~EVdLgie~KlkNIe   80 (116)
                      |||||+||+||+|....+.++..     ..+...+||||.+|+||+++++++++||||+|||+|||||||||++||+||+
T Consensus       114 ~KYIeqeLqkrkg~~~~e~eeqk-----~k~~naED~Ly~lpdhlk~ssskks~EmlS~qml~GIpEVdLgI~~kikNI~  188 (275)
T KOG3345|consen  114 MKYIEQELQKRKGTSIVEHEEQK-----NKPKNAEDCLYELPDHLKVSSSKKSEEMLSNQMLSGIPEVDLGIDAKIKNIE  188 (275)
T ss_pred             HHHHHHHHHHhhccchhhhhhhh-----cccCCHHHHHHhhhhhhccccccchHHHHHHHHHcCCcccccchhhhhhhHH
Confidence            79999999999995443322211     1333568999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHhhhhhcC-CCCCCCCcccchhhhhhhcC
Q psy16442         81 ATEEAKLKLLWDRHNK-KDGPSQFVPTNMAVNFVQHN  116 (116)
Q Consensus        81 ~TE~AKr~l~~~~~~k-k~~~s~fvp~n~a~n~~qh~  116 (116)
                      +||+|| +||++++++ ++|+++|||+||+|||||||
T Consensus       189 eTE~aK-rlLqe~~~~~~~g~~~~ipt~~a~ny~qr~  224 (275)
T KOG3345|consen  189 ETEDAK-RLLQEQQNKKKDGETSFIPTNMAVNYVQRN  224 (275)
T ss_pred             hhHHHH-HHHHHHHhccCCCCCCCCccchhhhHHHhh
Confidence            999999 788888876 69999999999999999997



>PF07052 Hep_59: Hepatocellular carcinoma-associated antigen 59; InterPro: IPR010756 This family represents a conserved region approximately 100 residues long within mammalian hepatocellular carcinoma-associated antigen 59 and similar proteins Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00