Psyllid ID: psy8934


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------13
STPAEDVESVPLAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDYILNCTLSFGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDYNKHNKESR
ccccccccHHHHHHHHHHHHHcccccccccccccccccccccccEEEEEccccccccccccccHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHcccccccc
cccccccccHHHHHHHHHHHHHHHcccccccccccccccHHHccEEEEEEEEEccccccccccHHHHHHHHcccHHHHHHHHHHHHHHHHcccHHccccccccccccHHHHHHHHHHHHHHHccccccc
stpaedvesvpLAQSLLNAICDllfcpdftvasnkkfnsrkKMDYILNCTlsfgglklgtqdnLFINYLSRIHRDEDFQFVLAGFSRLlnnpllqtylpnstkkIEFHQELLVLFWKMCdynkhnkesr
stpaedvesvPLAQSLLNAICDLLFCPDFTVASnkkfnsrkKMDYILNCTLSFGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDynkhnkesr
STPAEDVESVPLAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDYILNCTLSFGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDYNKHNKESR
***********LAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDYILNCTLSFGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDYN*******
***AEDVESVPLAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDYILNCTLSF**********LFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTY*PNSTKKIEFHQELLVLFWKMCDYNKHN****
*********VPLAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDYILNCTLSFGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDYNKHNKESR
*******ESVPLAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDYILNCTLSFGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDYNKHN****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
STPAEDVESVPLAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDYILNCTLSFGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDYNKHNKESR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query129 2.2.26 [Sep-21-2011]
Q8IV36 788 Protein HID1 OS=Homo sapi yes N/A 0.457 0.074 0.822 8e-27
Q8R1F6 788 Protein HID1 OS=Mus muscu yes N/A 0.403 0.065 0.822 9e-27
Q54JJ6 905 Protein HID1 OS=Dictyoste yes N/A 0.465 0.066 0.467 8e-09
>sp|Q8IV36|HID1_HUMAN Protein HID1 OS=Homo sapiens GN=HID1 PE=1 SV=1 Back     alignment and function desciption
 Score =  118 bits (296), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 51/62 (82%), Positives = 60/62 (96%)

Query: 62  DNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDY 121
           +NLF+NYLSRIHR+EDFQF+L G +RLL+NPLLQTYLPNSTKKI+FHQELLVLFWK+CD+
Sbjct: 322 ENLFVNYLSRIHREEDFQFILKGIARLLSNPLLQTYLPNSTKKIQFHQELLVLFWKLCDF 381

Query: 122 NK 123
           NK
Sbjct: 382 NK 383




May play an important role in the development of cancers in a broad range of tissues.
Homo sapiens (taxid: 9606)
>sp|Q8R1F6|HID1_MOUSE Protein HID1 OS=Mus musculus GN=Hid1 PE=1 SV=1 Back     alignment and function description
>sp|Q54JJ6|HID1_DICDI Protein HID1 OS=Dictyostelium discoideum GN=hid1 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query129
242008660 790 conserved hypothetical protein [Pediculu 0.914 0.149 0.529 4e-28
332022774 754 UPF0663 transmembrane protein C17orf28 [ 0.906 0.155 0.528 1e-27
307187905 781 Transmembrane protein C17orf28-like prot 0.906 0.149 0.536 1e-27
322791200 543 hypothetical protein SINV_05188 [Solenop 0.542 0.128 0.794 3e-27
350398971 778 PREDICTED: UPF0663 transmembrane protein 0.891 0.147 0.521 4e-27
307195455 783 Transmembrane protein C17orf28 [Harpegna 0.906 0.149 0.550 4e-27
156548268 777 PREDICTED: UPF0663 transmembrane protein 0.906 0.150 0.514 6e-27
345486109 740 PREDICTED: UPF0663 transmembrane protein 0.906 0.158 0.514 6e-27
340712094 778 PREDICTED: UPF0663 transmembrane protein 0.891 0.147 0.521 7e-27
383861178 760 PREDICTED: protein hid-1 homolog [Megach 0.891 0.151 0.528 1e-26
>gi|242008660|ref|XP_002425120.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212508794|gb|EEB12382.1| conserved hypothetical protein [Pediculus humanus corporis] Back     alignment and taxonomy information
 Score =  129 bits (324), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 71/134 (52%), Positives = 87/134 (64%), Gaps = 16/134 (11%)

Query: 2   TPAEDVESVPLAQSLLNAIC------------DLLFCPDFTVASNKKFNSRKKMDYILNC 49
           T AE+  ++P+  SLLN +C             LLF      A      + + +   L+ 
Sbjct: 233 TSAENRHALPMFTSLLNTVCAYDPVGLGVPYNHLLFS---DTAEPLVDTALQILIVTLDH 289

Query: 50  TLSFGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQ 109
             S G   + + DNLFINYLSRIHRDEDFQFVL GF+RLLNNPL+QTYLPNSTKK++FHQ
Sbjct: 290 DTSLGDENVSS-DNLFINYLSRIHRDEDFQFVLGGFTRLLNNPLVQTYLPNSTKKVQFHQ 348

Query: 110 ELLVLFWKMCDYNK 123
           ELLV FWK+CDYNK
Sbjct: 349 ELLVFFWKLCDYNK 362




Source: Pediculus humanus corporis

Species: Pediculus humanus

Genus: Pediculus

Family: Pediculidae

Order: Phthiraptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|332022774|gb|EGI63047.1| UPF0663 transmembrane protein C17orf28 [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|307187905|gb|EFN72818.1| Transmembrane protein C17orf28-like protein [Camponotus floridanus] Back     alignment and taxonomy information
>gi|322791200|gb|EFZ15737.1| hypothetical protein SINV_05188 [Solenopsis invicta] Back     alignment and taxonomy information
>gi|350398971|ref|XP_003485368.1| PREDICTED: UPF0663 transmembrane protein C17orf28-like [Bombus impatiens] Back     alignment and taxonomy information
>gi|307195455|gb|EFN77341.1| Transmembrane protein C17orf28 [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|156548268|ref|XP_001601368.1| PREDICTED: UPF0663 transmembrane protein C17orf28 homolog isoform 1 [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|345486109|ref|XP_003425403.1| PREDICTED: UPF0663 transmembrane protein C17orf28 homolog isoform 2 [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|340712094|ref|XP_003394599.1| PREDICTED: UPF0663 transmembrane protein C17orf28-like [Bombus terrestris] Back     alignment and taxonomy information
>gi|383861178|ref|XP_003706063.1| PREDICTED: protein hid-1 homolog [Megachile rotundata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query129
UNIPROTKB|F1PV04 782 C17orf28 "Uncharacterized prot 0.480 0.079 0.822 7.4e-27
UNIPROTKB|F6QK33 787 C19H17orf28 "Uncharacterized p 0.480 0.078 0.822 7.6e-27
UNIPROTKB|Q8IV36 788 HID1 "Protein HID1" [Homo sapi 0.480 0.078 0.822 7.6e-27
MGI|MGI:2445087 788 Hid1 "HID1 domain containing" 0.480 0.078 0.822 7.6e-27
UNIPROTKB|D4A0C3 788 RGD1311422 "Protein RGD1311422 0.480 0.078 0.822 7.6e-27
UNIPROTKB|F1RVV6 796 C17orf28 "Uncharacterized prot 0.480 0.077 0.822 1.6e-26
UNIPROTKB|E1C0V2 796 C18H17orf28 "Uncharacterized p 0.480 0.077 0.806 2.7e-26
ZFIN|ZDB-GENE-040718-239 804 hid1b "HID1 domain containing 0.480 0.077 0.790 2.6e-25
WB|WBGene00001844 729 hid-1 [Caenorhabditis elegans 0.503 0.089 0.641 6.9e-19
RGD|1311422272 Hid1 "HID1 domain containing" 0.310 0.147 0.8 4.8e-17
UNIPROTKB|F1PV04 C17orf28 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
 Score = 294 (108.6 bits), Expect = 7.4e-27, Sum P(2) = 7.4e-27
 Identities = 51/62 (82%), Positives = 60/62 (96%)

Query:    62 DNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLLQTYLPNSTKKIEFHQELLVLFWKMCDY 121
             +NLF+NYLSRIHR+EDFQF+L G +RLL+NPLLQTYLPNSTKKI+FHQELLVLFWK+CD+
Sbjct:   314 ENLFVNYLSRIHREEDFQFILKGIARLLSNPLLQTYLPNSTKKIQFHQELLVLFWKLCDF 373

Query:   122 NK 123
             NK
Sbjct:   374 NK 375


GO:0031001 "response to brefeldin A" evidence=IEA
GO:0006886 "intracellular protein transport" evidence=IEA
GO:0005886 "plasma membrane" evidence=IEA
GO:0005797 "Golgi medial cisterna" evidence=IEA
GO:0000139 "Golgi membrane" evidence=IEA
GO:0000138 "Golgi trans cisterna" evidence=IEA
UNIPROTKB|F6QK33 C19H17orf28 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q8IV36 HID1 "Protein HID1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:2445087 Hid1 "HID1 domain containing" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|D4A0C3 RGD1311422 "Protein RGD1311422" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|F1RVV6 C17orf28 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|E1C0V2 C18H17orf28 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-040718-239 hid1b "HID1 domain containing b" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
WB|WBGene00001844 hid-1 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
RGD|1311422 Hid1 "HID1 domain containing" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q8R1F6HID1_MOUSENo assigned EC number0.82250.40310.0659yesN/A
Q8IV36HID1_HUMANNo assigned EC number0.82250.45730.0748yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query129
pfam12722 813 pfam12722, Hid1, High-temperature-induced dauer-fo 2e-26
pfam09742 659 pfam09742, Dymeclin, Dyggve-Melchior-Clausen syndr 3e-11
pfam12722 813 pfam12722, Hid1, High-temperature-induced dauer-fo 1e-07
>gnl|CDD|221734 pfam12722, Hid1, High-temperature-induced dauer-formation protein Back     alignment and domain information
 Score =  102 bits (256), Expect = 2e-26
 Identities = 46/141 (32%), Positives = 67/141 (47%), Gaps = 26/141 (18%)

Query: 7   VESVPLAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDY----------ILNCTLS---- 52
            E + L  SLLN +C   + P+        +N+    D           +L   L     
Sbjct: 249 HEVLCLLCSLLNTVCR--YNPNGWKEYGLPYNNSVFKDSREILVEYSLQLLLVMLLYPIP 306

Query: 53  --------FGGLKLGTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPL--LQTYLPNST 102
                   + G K     NLF NYLSR+HR+ED QF+L G +R+L NP+    +YLPNS 
Sbjct: 307 SSTRLSFLYLGSKNSKPKNLFRNYLSRLHREEDLQFILDGLTRILKNPIQANSSYLPNSP 366

Query: 103 KKIEFHQELLVLFWKMCDYNK 123
           K + +  E+L+L W++   NK
Sbjct: 367 KVVPWAPEMLILLWELIQCNK 387


Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor. Length = 813

>gnl|CDD|220375 pfam09742, Dymeclin, Dyggve-Melchior-Clausen syndrome protein Back     alignment and domain information
>gnl|CDD|221734 pfam12722, Hid1, High-temperature-induced dauer-formation protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 129
PF12722 895 Hid1: High-temperature-induced dauer-formation pro 100.0
KOG2226|consensus 786 99.96
PF09742 678 Dymeclin: Dyggve-Melchior-Clausen syndrome protein 98.4
>PF12722 Hid1: High-temperature-induced dauer-formation protein Back     alignment and domain information
Probab=100.00  E-value=1.9e-34  Score=265.64  Aligned_cols=122  Identities=45%  Similarity=0.768  Sum_probs=108.2

Q ss_pred             ccccccchHHHHHHHHHhhccCCCcccccCcccCcchhhHHHHHhhhccccC----------------------------
Q psy8934           5 EDVESVPLAQSLLNAICDLLFCPDFTVASNKKFNSRKKMDYILNCTLSFGGL----------------------------   56 (129)
Q Consensus         5 ~~~~~~pla~~l~~~l~dllf~~~~t~~~~~~~~~~~~~~~i~~~~v~~~~~----------------------------   56 (129)
                      ++++.+|||++||++|+|||||+|||||+...++. +..+.||+.|||+..+                            
T Consensus       151 ~~~~~~pLg~~Ll~al~dLLF~~gFTv~~~~~~~~-~~~~~iWe~GIg~~~~~~~~~~~~dsNR~EvLrLLLtl~S~~lY  229 (895)
T PF12722_consen  151 DDEQSPPLGERLLDALVDLLFCPGFTVPSSSSSPD-KVSYSIWEPGIGTNTSPPSPNKEHDSNRTEVLRLLLTLFSEPLY  229 (895)
T ss_pred             cccCCCCHHHHHHHHHHHHhCCCCCcCCCccCCCC-ccceeeccCccccCCCCCCCCHHHHHHHHHHHHHHHHHhchhhc
Confidence            34448899999999999999999999999983333 4455599999999811                            


Q ss_pred             --------------------------------------------------------------------------------
Q psy8934          57 --------------------------------------------------------------------------------   56 (129)
Q Consensus        57 --------------------------------------------------------------------------------   56 (129)
                                                                                                      
T Consensus       230 ~~~~~~~~~~~~l~~l~s~~~~~~~l~L~~SLlNt~~~y~p~~~~~~~lpY~~~~~~d~r~~lV~~slQlL~vlL~y~~~  309 (895)
T PF12722_consen  230 QPPSLSSTGNKFLTYLVSCTPRQLVLTLLCSLLNTVCRYDPSGNEEWGLPYNHLVFSDSREPLVTYSLQLLNVLLDYPPP  309 (895)
T ss_pred             CCCccCCCCCeeeeeeccCcchhhhHHHHHHHHHHHHhcCCCCccccccccccccccCcccHHHHHHHHHHHHheeCCCC
Confidence                                                                                            


Q ss_pred             ---CC----------CCchhHHHHHHHhccCchhHHHHHHHHHHHhhchhh------------cccCCCCccchhhHHHH
Q psy8934          57 ---KL----------GTQDNLFINYLSRIHRDEDFQFVLAGFSRLLNNPLL------------QTYLPNSTKKIEFHQEL  111 (129)
Q Consensus        57 ---~~----------~~~~N~FR~yLsrLHR~eDf~FIl~glsriL~~Pl~------------~syLP~s~k~i~~~~El  111 (129)
                         ..          .+++|+||+|||||||++||+||++||+|||++|++            ++|+|++++++.|++|+
T Consensus       310 ~~~~~~~~~~~~~~~~~~~N~~r~yl~~lhr~~D~~fi~~gl~~iL~~pl~~~~~~es~~f~~~~~lp~s~~~~~~~~e~  389 (895)
T PF12722_consen  310 SQTNSSFSSETNLTSSKPKNLFRNYLSKLHREEDFQFILDGLTRILNNPLQSAIEQESNPFSSSSYLPGSQKPSPWAQEL  389 (895)
T ss_pred             CCCccccccccCcCCCCcchHHHHHHhccCcHHHHHHHHHHHHHHHHhhhhhhhhccccccccccccccccccchHHHHH
Confidence               01          688999999999999999999999999999999999            58999999999999999


Q ss_pred             HHHHHHHhhccccccc
Q psy8934         112 LVLFWKMCDYNKHNKE  127 (129)
Q Consensus       112 lmL~Wellq~NKrFr~  127 (129)
                      +||||++|||||+|+.
T Consensus       390 l~l~W~l~~~Nk~F~~  405 (895)
T PF12722_consen  390 LMLFWELIQCNKRFRN  405 (895)
T ss_pred             HHHHHHHHhhCHHHHH
Confidence            9999999999999985



>KOG2226|consensus Back     alignment and domain information
>PF09742 Dymeclin: Dyggve-Melchior-Clausen syndrome protein; InterPro: IPR019142 Dymeclin (Dyggve-Melchior-Clausen syndrome protein) contains a large number of leucine and isoleucine residues and a total of 17 repeated dileucine motifs Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00