Psyllid ID: psy6694


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130------
MLRLLLVCTQEVTEEQSDVTILRQINRLNDDGSYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRISYSARNGTGFQASGTIATPVENARLNPSYTTVKPPAHHPFLKRPILILKAMVPEVKVLTKLRVR
cccEEEEcccccccccccccEEEEEEEccccccEEEEEEcccccEEEEEEccccEEEEEEEEEcccccEEEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccEEcEEEEcc
ccEEEEEEcccccccccccEEEEEEccccccccEEEEEEcccccEEEEEcccccEEEEEEEEEcccccEEEEEEEEccccccEcccccccccccccccccHcccccccccccccccEEcccccccccEEEEEEEEc
MLRLLLVCTQEVTEEQSDVTILRQINrlnddgsytfgyeaadgsfkietrdvggnvkgmfgfvdengelkrisysarngtgfqasgtiatpvenarlnpsyttvkppahhpflkrpILILKAMVPEVKVLTKLRVR
MLRLLLVCTqevteeqsdvtilrqinrlnddgsYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRISYSARNGTGFQASGTIATPVENARLNPSYTTVKPPAHHPFLKRPILILKAMVPEVKVLTKLRVR
MLRLLLVCTQEVTEEQSDVTILRQINRLNDDGSYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRISYSARNGTGFQASGTIATPVENARLNPSYTTVKPPAHHPFLKRPILILKAMVPEVKVLTKLRVR
***LLLVCTQEVTEEQSDVTILRQINRLNDDGSYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRISYSARNGTGFQASGTIATPVENARLNPSYTTVKPPAHHPFLKRPILILKAMVPEVKVLTK****
MLRLLL*CTQEVT***********INRLNDDGSYTFGYEAADGS*********GNVKGMFGFVDENGELKRISYSARNGTGFQASGTIATPV***********VKPPAHHPFLKRPILILKAMVPEVKVLTKL***
MLRLLLVCTQEVTEEQSDVTILRQINRLNDDGSYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRISYSARNGTGFQASGTIATPVENARLNPSYTTVKPPAHHPFLKRPILILKAMVPEVKVLTKLRVR
MLRLLLVCTQEVTEEQSDVTILRQINRLNDDGSYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRISYSARNGTGFQASGTIATPVENARLNPSYTTVKPPAHHPFLKRPILILKAMVPEVKVLTKLRVR
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MLRLLLVCTQEVTEEQSDVTILRQINRLNDDGSYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRISYSARNGTGFQASGTIATPVENARLNPSYTTVKPPAHHPFLKRPILILKAMVPEVKVLTKLRVR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query136
242013789 733 hypothetical protein Phum_PHUM331820 [Pe 0.573 0.106 0.782 1e-30
328709224 500 PREDICTED: hypothetical protein LOC10057 0.514 0.14 0.771 4e-27
332021193 704 hypothetical protein G5I_10142 [Acromyrm 0.720 0.139 0.541 6e-25
307213608 716 hypothetical protein EAI_11787 [Harpegna 0.735 0.139 0.508 2e-23
322791509 715 hypothetical protein SINV_01405 [Solenop 0.720 0.137 0.518 2e-23
307185615 1206 hypothetical protein EAG_14231 [Camponot 0.720 0.081 0.457 4e-21
157131431 749 hypothetical protein AaeL_AAEL012088 [Ae 0.757 0.137 0.432 1e-20
321478861220 hypothetical protein DAPPUDRAFT_303105 [ 0.808 0.5 0.451 3e-20
223671090201 TPA: putative cuticle protein [Bombyx mo 0.470 0.318 0.656 6e-19
170063815 802 conserved hypothetical protein [Culex qu 0.558 0.094 0.531 1e-18
>gi|242013789|ref|XP_002427583.1| hypothetical protein Phum_PHUM331820 [Pediculus humanus corporis] gi|212511998|gb|EEB14845.1| hypothetical protein Phum_PHUM331820 [Pediculus humanus corporis] Back     alignment and taxonomy information
 Score =  137 bits (345), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 61/78 (78%), Positives = 70/78 (89%)

Query: 14  EEQSDVTILRQINRLNDDGSYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRIS 73
           E Q  V IL+QINR+NDDGSYTFGYEAADGSFKIETRDV GNVKGMFGF++E+GELKR+S
Sbjct: 67  ETQDQVAILKQINRVNDDGSYTFGYEAADGSFKIETRDVNGNVKGMFGFINEDGELKRVS 126

Query: 74  YSARNGTGFQASGTIATP 91
           YSA NGTGFQ++GT+  P
Sbjct: 127 YSASNGTGFQSTGTLNIP 144




Source: Pediculus humanus corporis

Species: Pediculus humanus

Genus: Pediculus

Family: Pediculidae

Order: Phthiraptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|328709224|ref|XP_003243902.1| PREDICTED: hypothetical protein LOC100572094 [Acyrthosiphon pisum] Back     alignment and taxonomy information
>gi|332021193|gb|EGI61578.1| hypothetical protein G5I_10142 [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|307213608|gb|EFN88994.1| hypothetical protein EAI_11787 [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|322791509|gb|EFZ15900.1| hypothetical protein SINV_01405 [Solenopsis invicta] Back     alignment and taxonomy information
>gi|307185615|gb|EFN71553.1| hypothetical protein EAG_14231 [Camponotus floridanus] Back     alignment and taxonomy information
>gi|157131431|ref|XP_001662245.1| hypothetical protein AaeL_AAEL012088 [Aedes aegypti] gi|108871541|gb|EAT35766.1| AAEL012088-PA [Aedes aegypti] Back     alignment and taxonomy information
>gi|321478861|gb|EFX89818.1| hypothetical protein DAPPUDRAFT_303105 [Daphnia pulex] Back     alignment and taxonomy information
>gi|223671090|tpd|FAA00496.1| TPA: putative cuticle protein [Bombyx mori] Back     alignment and taxonomy information
>gi|170063815|ref|XP_001867267.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167881318|gb|EDS44701.1| conserved hypothetical protein [Culex quinquefasciatus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query136
FB|FBgn0039480 366 Cpr97Ea "Cuticular protein 97E 0.713 0.265 0.457 1e-16
FB|FBgn0039481235 Cpr97Eb "Cuticular protein 97E 0.625 0.361 0.466 3.7e-16
FB|FBgn0036680 597 Cpr73D "Cuticular protein 73D" 0.536 0.122 0.346 4.2e-07
FB|FBgn0039805241 Cpr100A "Cuticular protein 100 0.580 0.327 0.353 4.3e-07
FB|FBgn0038819 381 Cpr92F "Cuticular protein 92F" 0.661 0.236 0.336 4.2e-06
FB|FBgn0002534112 Lcp3 "Larval cuticle protein 3 0.654 0.794 0.301 4.3e-06
FB|FBgn0033598214 Cpr47Eb "Cuticular protein 47E 0.632 0.401 0.343 5.3e-06
FB|FBgn0036618217 Cpr72Eb "Cuticular protein 72E 0.522 0.327 0.328 1.6e-05
FB|FBgn0002535112 Lcp4 "Larval cuticle protein 4 0.654 0.794 0.311 2.4e-05
FB|FBgn0038405354 CG8927 [Drosophila melanogaste 0.477 0.183 0.369 2.7e-05
FB|FBgn0039480 Cpr97Ea "Cuticular protein 97Ea" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 210 (79.0 bits), Expect = 1.0e-16, P = 1.0e-16
 Identities = 49/107 (45%), Positives = 65/107 (60%)

Query:    19 VTILRQINRLNDDGSYTFGYEAADGSFKIETRDVGGNVKGMFGFVDENGELKRISYSARN 78
             V IL+QIN+ N+DGS T+GYE ADGSFKIET+   G VKG +G+VDE G+++ + Y A N
Sbjct:    57 VAILKQINKHNEDGSCTYGYEGADGSFKIETKLATGEVKGKYGYVDETGKVRVVEYGA-N 115

Query:    79 GTGFQASG---TIATPV---ENARLNPSYTTVKPPAH--HPF-LKRP 116
               GF  SG   T+A P    E  +  P Y     P     P+ ++RP
Sbjct:   116 KYGFLPSGEGITVAPPTLVDETLKEEPDYADEPAPQRPQKPYRVQRP 162




GO:0005214 "structural constituent of chitin-based cuticle" evidence=ISS
FB|FBgn0039481 Cpr97Eb "Cuticular protein 97Eb" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
FB|FBgn0036680 Cpr73D "Cuticular protein 73D" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
FB|FBgn0039805 Cpr100A "Cuticular protein 100A" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
FB|FBgn0038819 Cpr92F "Cuticular protein 92F" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
FB|FBgn0002534 Lcp3 "Larval cuticle protein 3" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
FB|FBgn0033598 Cpr47Eb "Cuticular protein 47Eb" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
FB|FBgn0036618 Cpr72Eb "Cuticular protein 72Eb" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
FB|FBgn0002535 Lcp4 "Larval cuticle protein 4" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
FB|FBgn0038405 CG8927 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query136
pfam0037951 pfam00379, Chitin_bind_4, Insect cuticle protein 7e-10
>gnl|CDD|215886 pfam00379, Chitin_bind_4, Insect cuticle protein Back     alignment and domain information
 Score = 50.7 bits (122), Expect = 7e-10
 Identities = 23/52 (44%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 34 YTFGYEAADGSFKIETR---DVGGNVKGMFGFVDENGELKRISYSARNGTGF 82
          Y+FGYE +DG  + E R   D GG VKG + +VD +G+L+ ++Y A +  GF
Sbjct: 1  YSFGYETSDGKTQEEGRGTEDDGGVVKGSYSYVDPDGKLRTVTYVA-DENGF 51


Many insect cuticular proteins include a 35-36 amino acid motif known as the R&R consensus. The extensive conservation of this region led to the suggestion that it functions to bind chitin. Provocatively, it has no sequence similarity to the well-known cysteine-containing chitin-binding domain found in chitinases and some peritrophic membrane proteins. Chitin binding has been shown experimentally for this region. Thus arthropods have two distinct classes of chitin binding proteins, those with the chitin-binding domain found in lectins, chitinases and peritrophic membranes (cysCBD) and those with the cuticular protein chitin-binding domain (non-cysCBD). Length = 51

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 136
PF0037952 Chitin_bind_4: Insect cuticle protein; InterPro: I 99.67
>PF00379 Chitin_bind_4: Insect cuticle protein; InterPro: IPR000618 Insect cuticle is composed of proteins and chitin Back     alignment and domain information
Probab=99.67  E-value=2.2e-16  Score=99.91  Aligned_cols=47  Identities=40%  Similarity=0.916  Sum_probs=43.3

Q ss_pred             eeEEEecCCCCEEEEee-----CCCCeEEEEEEEECCCCcEEEEEEEeeCCCCc
Q psy6694          34 YTFGYEAADGSFKIETR-----DVGGNVKGMFGFVDENGELKRISYSARNGTGF   82 (136)
Q Consensus        34 Y~f~yet~dG~~~~E~g-----~~~~~v~GsYsy~~pdG~~~~V~Y~AD~e~Gf   82 (136)
                      |+|+|+++|| .++|++     +..++|+|+|+|++|||+.++|+|+|| ++||
T Consensus         1 Y~f~Y~~~dg-~~~e~~~~~~~~~~~~v~GsY~y~~pdG~~~~V~Y~Ad-~~Gf   52 (52)
T PF00379_consen    1 YSFGYETSDG-SRQEEGRPETEDEGGVVRGSYSYIDPDGQTRTVTYVAD-ENGF   52 (52)
T ss_pred             CeeEeEcCCC-CEEEEEEcccCCCCCEEEEEEEEECCCCCEEEEEEECC-CCCC
Confidence            7999999999 666666     488999999999999999999999999 9998



The cuticular proteins seem to be specific to the type of cuticle (flexible or stiff) that occur at stages of the insect development. The proteins found in the flexible cuticle of larva and pupa of different insects share a conserved C-terminal section [] such a region is also found in the soft endocuticle of adults insects [] as well as in other cuticular proteins including in arachnids []. In addition, cuticular proteins share hydrophobic regions dominated by tetrapeptide repeats (A-A-P-A/V), which are presumed to be functionally important [, ]. Many insect cuticle proteins also include a 35-36 amino acid motif known as the R and R consensus. An extended form of this motif has been shown [] to bind chitin. It has no sequence similiarity to the cysteine-containing chitin-binding domain of chitinases and some peritrophic membrane proteins, suggesting that arthropods have two distinct classes of chitin-binding proteins, those with the chitin-binding domains found in lectins, chitinases and peritrophic membranes (cysCBD), and those with the type of chitin-binding domains found in cuticular proteins (non-cysCBD) []. The cuticle protein signature has been found in locust cuticle proteins 7 (LM-7), 8 (LM-8), 19 (LM-19) and endocuticle structural glycoprotein ABD-4; Hyalophora cecropia (Cecropia moth) cuticle proteins 12 and 66; Drosophila melanogaster (Fruit fly) larval cuticles proteins I, II, III and IV (LCP1 to LCP4); drosophila pupal cuticle proteins PCP, EDG-78E and EDG-84E; Manduca sexta (Tobacco hawkmoth) cuticle protein LCP-14; Tenebrio molitor (Yellow mealworm) cuticle proteins ACP-20, A1A, A2B and A3A; and Araneus diadematus (Spider) cuticle proteins ACP 11.9, ACP 12.4, ACP 12.6, ACP 15.5 and ACP 15.7.; GO: 0042302 structural constituent of cuticle


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00