Psyllid ID: psy10193
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 135 | ||||||
| 47605413 | 135 | RecName: Full=Endocuticle structural gly | 0.770 | 0.770 | 0.561 | 7e-27 | |
| 389608145 | 145 | cuticular protein PxutCPR32 [Papilio xut | 0.688 | 0.641 | 0.634 | 1e-26 | |
| 357623617 | 144 | cuticular protein RR-1 motif 32 [Danaus | 0.688 | 0.645 | 0.634 | 4e-26 | |
| 290563462 | 154 | cuticular protein RR-1 motif 32 precurso | 0.688 | 0.603 | 0.602 | 4e-25 | |
| 354549523 | 154 | cuticular protein RR-1 motif 32 [Anthera | 0.688 | 0.603 | 0.612 | 6e-25 | |
| 312378093 | 158 | hypothetical protein AND_10432 [Anophele | 0.703 | 0.601 | 0.536 | 1e-22 | |
| 91083861 | 164 | PREDICTED: similar to cuticular protein | 0.725 | 0.597 | 0.540 | 3e-22 | |
| 166947671 | 145 | putative cuticle protein CP5 [Leptinotar | 0.792 | 0.737 | 0.512 | 3e-22 | |
| 312378095 | 190 | hypothetical protein AND_10431 [Anophele | 0.844 | 0.6 | 0.466 | 5e-22 | |
| 58388110 | 156 | AGAP006012-PA [Anopheles gambiae str. PE | 0.725 | 0.628 | 0.510 | 7e-22 |
| >gi|47605413|sp|Q7M4F3.1|CUD2_SCHGR RecName: Full=Endocuticle structural glycoprotein SgAbd-2 | Back alignment and taxonomy information |
|---|
Score = 125 bits (313), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 59/105 (56%), Positives = 79/105 (75%), Gaps = 1/105 (0%)
Query: 11 APPASP-IPIISYVNEPPVDGTYKFAFETGNGIAVQEQGALKNAGQKDLEAQTAQGQSSY 69
+PPA +PI+ Y NE DG+Y ++++TGNGIA QEQG LKN GQ+DLEA+ QG SY
Sbjct: 13 SPPARQQVPILQYSNEVNPDGSYAYSYQTGNGIAAQEQGYLKNPGQRDLEAENVQGTFSY 72
Query: 70 TSPDGTPIQTQWYADETGYHASGAHLPTPPPIPDEIAKAIATLPK 114
T+PDGTPI ++ ADE G+ A GAHLPTPPPIP+ IA+++ + +
Sbjct: 73 TAPDGTPISLRYVADENGFRAEGAHLPTPPPIPEAIARSLEVIAR 117
|
Source: Schistocerca gregaria Species: Schistocerca gregaria Genus: Schistocerca Family: Acrididae Order: Orthoptera Class: Insecta Phylum: Arthropoda Superkingdom: Eukaryota |
| >gi|389608145|dbj|BAM17684.1| cuticular protein PxutCPR32 [Papilio xuthus] | Back alignment and taxonomy information |
|---|
| >gi|357623617|gb|EHJ74702.1| cuticular protein RR-1 motif 32 [Danaus plexippus] | Back alignment and taxonomy information |
|---|
| >gi|290563462|ref|NP_001166719.1| cuticular protein RR-1 motif 32 precursor [Bombyx mori] gi|223671166|tpd|FAA00535.1| TPA: putative cuticle protein [Bombyx mori] | Back alignment and taxonomy information |
|---|
| >gi|354549523|gb|AER27818.1| cuticular protein RR-1 motif 32 [Antheraea yamamai] | Back alignment and taxonomy information |
|---|
| >gi|312378093|gb|EFR24758.1| hypothetical protein AND_10432 [Anopheles darlingi] | Back alignment and taxonomy information |
|---|
| >gi|91083861|ref|XP_974125.1| PREDICTED: similar to cuticular protein 80, RR-1 family (AGAP009878-PA) [Tribolium castaneum] gi|270006758|gb|EFA03206.1| hypothetical protein TcasGA2_TC013126 [Tribolium castaneum] | Back alignment and taxonomy information |
|---|
| >gi|166947671|gb|ABZ04122.1| putative cuticle protein CP5 [Leptinotarsa decemlineata] | Back alignment and taxonomy information |
|---|
| >gi|312378095|gb|EFR24759.1| hypothetical protein AND_10431 [Anopheles darlingi] | Back alignment and taxonomy information |
|---|
| >gi|58388110|ref|XP_316045.2| AGAP006012-PA [Anopheles gambiae str. PEST] gi|55238815|gb|EAA11695.3| AGAP006012-PA [Anopheles gambiae str. PEST] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 135 | ||||||
| FB|FBgn0050045 | 144 | Cpr49Aa "Cuticular protein 49A | 0.740 | 0.694 | 0.514 | 1.6e-22 | |
| FB|FBgn0035686 | 239 | Cpr65Az "Cuticular protein 65A | 0.622 | 0.351 | 0.559 | 3e-21 | |
| FB|FBgn0033603 | 610 | Cpr47Ef "Cuticular protein 47E | 0.822 | 0.181 | 0.448 | 1.4e-19 | |
| FB|FBgn0036110 | 122 | Cpr67Fb "Cuticular protein 67F | 0.592 | 0.655 | 0.516 | 1.6e-17 | |
| FB|FBgn0033597 | 135 | Cpr47Ea "Cuticular protein 47E | 0.533 | 0.533 | 0.534 | 2.5e-17 | |
| FB|FBgn0033731 | 190 | Cpr49Ah "Cuticular protein 49A | 0.829 | 0.589 | 0.401 | 1.8e-16 | |
| FB|FBgn0033728 | 134 | Cpr49Ae "Cuticular protein 49A | 0.659 | 0.664 | 0.483 | 3.7e-16 | |
| FB|FBgn0037068 | 140 | Cpr78Cb "Cuticular protein 78C | 0.637 | 0.614 | 0.427 | 5.4e-15 | |
| FB|FBgn0035737 | 127 | Cpr65Ec "Cuticular protein 65E | 0.585 | 0.622 | 0.443 | 3e-14 | |
| FB|FBgn0037069 | 119 | Cpr78Cc "Cuticular protein 78C | 0.570 | 0.647 | 0.436 | 3e-14 |
| FB|FBgn0050045 Cpr49Aa "Cuticular protein 49Aa" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
Score = 261 (96.9 bits), Expect = 1.6e-22, P = 1.6e-22
Identities = 55/107 (51%), Positives = 66/107 (61%)
Query: 29 DGTYKFAFETGNGIAVQEQGALKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQWYADETGY 88
DG+YK+ +ETGNGI +E+G LKN G D Q AQG SYTSP+G PI+ + ADE G+
Sbjct: 43 DGSYKYLYETGNGINAEEEGYLKNPGT-DNAGQVAQGSFSYTSPEGIPIRITYLADENGF 101
Query: 89 HASGAHLPTPPPIPDEIAKAIATLPKLVEENYAPNPQPAPGRGFGRK 135
G HLPTPPPIP I KA+A L P PQ PG GF +
Sbjct: 102 QPQGDHLPTPPPIPPAIQKALAYLATA-----PPPPQEQPG-GFNNR 142
|
|
| FB|FBgn0035686 Cpr65Az "Cuticular protein 65Az" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0033603 Cpr47Ef "Cuticular protein 47Ef" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0036110 Cpr67Fb "Cuticular protein 67Fb" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0033597 Cpr47Ea "Cuticular protein 47Ea" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0033731 Cpr49Ah "Cuticular protein 49Ah" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0033728 Cpr49Ae "Cuticular protein 49Ae" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0037068 Cpr78Cb "Cuticular protein 78Cb" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0035737 Cpr65Ec "Cuticular protein 65Ec" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
| FB|FBgn0037069 Cpr78Cc "Cuticular protein 78Cc" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
Prediction of Functionally Associated Proteins
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 135 | |||
| pfam00379 | 51 | pfam00379, Chitin_bind_4, Insect cuticle protein | 4e-11 |
| >gnl|CDD|215886 pfam00379, Chitin_bind_4, Insect cuticle protein | Back alignment and domain information |
|---|
Score = 53.8 bits (130), Expect = 4e-11
Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 6/57 (10%)
Query: 32 YKFAFETGNGIAVQEQGALKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQWYADETGY 88
Y F +ET +G +E ++ G +G SY PDG + ADE G+
Sbjct: 1 YSFGYETSDGKTQEEGRGTEDDGG------VVKGSYSYVDPDGKLRTVTYVADENGF 51
|
Many insect cuticular proteins include a 35-36 amino acid motif known as the R&R consensus. The extensive conservation of this region led to the suggestion that it functions to bind chitin. Provocatively, it has no sequence similarity to the well-known cysteine-containing chitin-binding domain found in chitinases and some peritrophic membrane proteins. Chitin binding has been shown experimentally for this region. Thus arthropods have two distinct classes of chitin binding proteins, those with the chitin-binding domain found in lectins, chitinases and peritrophic membranes (cysCBD) and those with the cuticular protein chitin-binding domain (non-cysCBD). Length = 51 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 135 | |||
| PF00379 | 52 | Chitin_bind_4: Insect cuticle protein; InterPro: I | 99.71 | |
| PF06004 | 50 | DUF903: Bacterial protein of unknown function (DUF | 88.34 |
| >PF00379 Chitin_bind_4: Insect cuticle protein; InterPro: IPR000618 Insect cuticle is composed of proteins and chitin | Back alignment and domain information |
|---|
Probab=99.71 E-value=4e-17 Score=103.43 Aligned_cols=52 Identities=37% Similarity=0.700 Sum_probs=46.3
Q ss_pred eeeEEEcCCCceEEEeeeeecCCccccccceeeeEEEEECCCCcEEEEEEEeCCCCc
Q psy10193 32 YKFAFETGNGIAVQEQGALKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQWYADETGY 88 (135)
Q Consensus 32 Y~f~Ye~~dG~~~~E~G~~k~~G~~~~~~~~v~GsYsy~~pDG~~~~V~YtADenGf 88 (135)
|+|+|+++|| .++|.+. .++ ++..++|+|+|+|++|||+.++|+|+||++||
T Consensus 1 Y~f~Y~~~dg-~~~e~~~---~~~-~~~~~~v~GsY~y~~pdG~~~~V~Y~Ad~~Gf 52 (52)
T PF00379_consen 1 YSFGYETSDG-SRQEEGR---PET-EDEGGVVRGSYSYIDPDGQTRTVTYVADENGF 52 (52)
T ss_pred CeeEeEcCCC-CEEEEEE---ccc-CCCCCEEEEEEEEECCCCCEEEEEEECCCCCC
Confidence 7899999999 8888877 222 56778999999999999999999999999998
|
The cuticular proteins seem to be specific to the type of cuticle (flexible or stiff) that occur at stages of the insect development. The proteins found in the flexible cuticle of larva and pupa of different insects share a conserved C-terminal section [] such a region is also found in the soft endocuticle of adults insects [] as well as in other cuticular proteins including in arachnids []. In addition, cuticular proteins share hydrophobic regions dominated by tetrapeptide repeats (A-A-P-A/V), which are presumed to be functionally important [, ]. Many insect cuticle proteins also include a 35-36 amino acid motif known as the R and R consensus. An extended form of this motif has been shown [] to bind chitin. It has no sequence similiarity to the cysteine-containing chitin-binding domain of chitinases and some peritrophic membrane proteins, suggesting that arthropods have two distinct classes of chitin-binding proteins, those with the chitin-binding domains found in lectins, chitinases and peritrophic membranes (cysCBD), and those with the type of chitin-binding domains found in cuticular proteins (non-cysCBD) []. The cuticle protein signature has been found in locust cuticle proteins 7 (LM-7), 8 (LM-8), 19 (LM-19) and endocuticle structural glycoprotein ABD-4; Hyalophora cecropia (Cecropia moth) cuticle proteins 12 and 66; Drosophila melanogaster (Fruit fly) larval cuticles proteins I, II, III and IV (LCP1 to LCP4); drosophila pupal cuticle proteins PCP, EDG-78E and EDG-84E; Manduca sexta (Tobacco hawkmoth) cuticle protein LCP-14; Tenebrio molitor (Yellow mealworm) cuticle proteins ACP-20, A1A, A2B and A3A; and Araneus diadematus (Spider) cuticle proteins ACP 11.9, ACP 12.4, ACP 12.6, ACP 15.5 and ACP 15.7.; GO: 0042302 structural constituent of cuticle |
| >PF06004 DUF903: Bacterial protein of unknown function (DUF903); InterPro: IPR010305 This family consists of several small bacterial proteins several of which are classified as putative lipoproteins | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 135 | |||
| 2rd1_A | 62 | Putative outer membrane lipoprotein; X-RAY, NESG, | 87.96 | |
| 2k57_A | 61 | Putative lipoprotein; structural genomics, PSI-2, | 86.92 | |
| 3bdu_A | 62 | Putative lipoprotein; X_RAY, NESG, Q6D8G1, structu | 85.96 | |
| 2rb6_A | 61 | Uncharacterized protein; NESG, Q8EI81_sheon, struc | 85.85 | |
| 2jn0_A | 61 | Hypothetical lipoprotein YGDR; solution structure, | 83.19 | |
| 2ra2_A | 64 | Putative lipoprotein; X-RAY, NESG, STR88A, Q7CPV8, | 82.17 |
| >2rd1_A Putative outer membrane lipoprotein; X-RAY, NESG, Q7CQI7, STR87A, structural genomics, PSI-2, protein structure initiative; 2.30A {Salmonella typhimurium LT2} SCOP: b.38.1.6 | Back alignment and structure |
|---|
Probab=87.96 E-value=1.2 Score=28.52 Aligned_cols=39 Identities=26% Similarity=0.450 Sum_probs=29.9
Q ss_pred eEEEcCCCceEEEeeeeecCCccccccceeeeEEEEECCCCcEEEEE
Q psy10193 34 FAFETGNGIAVQEQGALKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQ 80 (135)
Q Consensus 34 f~Ye~~dG~~~~E~G~~k~~G~~~~~~~~v~GsYsy~~pDG~~~~V~ 80 (135)
|--.+.||..-...|.++-. .-.|.|+|.+.+|+.+.|+
T Consensus 5 yvitt~DG~~IvT~gKP~~D--------~dTGm~sY~D~~G~~~qIn 43 (62)
T 2rd1_A 5 YVMTTKNGQTIVTQGKPQLD--------KETGMTSYTDQEGNQREIN 43 (62)
T ss_dssp EEEEETTSCEEEEESCCEEE--------TTTTEEEEECTTSCEEEEE
T ss_pred eEEEeCCCcEEEcCCCccee--------CCCCCEEEEcCCCCEEEEc
Confidence 44567888888777776532 3489999999999998875
|
| >2k57_A Putative lipoprotein; structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium, NESG; NMR {Pseudomonas syringae PV} SCOP: b.38.1.6 | Back alignment and structure |
|---|
| >3bdu_A Putative lipoprotein; X_RAY, NESG, Q6D8G1, structural genomics, PSI-2, protein structure initiative; 1.90A {Pectobacterium atrosepticum SCRI1043} SCOP: b.38.1.6 | Back alignment and structure |
|---|
| >2rb6_A Uncharacterized protein; NESG, Q8EI81_sheon, structural genomics, PSI-2, protein STRU initiative, northeast structural genomics consortium; 2.50A {Shewanella oneidensis} SCOP: b.38.1.6 | Back alignment and structure |
|---|
| >2jn0_A Hypothetical lipoprotein YGDR; solution structure, PSI-2 target, structural genomics, protein structure initiative; NMR {Escherichia coli} SCOP: b.38.1.6 PDB: 3fif_A | Back alignment and structure |
|---|
| >2ra2_A Putative lipoprotein; X-RAY, NESG, STR88A, Q7CPV8, structural genomics, PSI-2, protein structure initiative; 1.90A {Salmonella typhimurium LT2} SCOP: b.38.1.6 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 135 | |||
| d2rd1a1 | 52 | Putative outer membrane lipoprotein STM1585 {Salmo | 86.8 | |
| d2rb6a1 | 52 | Uncharacterized protein SO0963 {Shewanella oneiden | 84.79 | |
| d2ra2a1 | 53 | Uncharacterized protein YgdI {Salmonella typhimuri | 84.49 | |
| d2jn0a1 | 50 | Hypothetical lipoprotein YgdR {Escherichia coli [T | 83.39 | |
| d3bdua1 | 53 | Uncharacterized protein ECA1013 {Erwinia carotovor | 82.19 |
| >d2rd1a1 b.38.1.6 (A:22-73) Putative outer membrane lipoprotein STM1585 {Salmonella typhimurium [TaxId: 90371]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: Sm-like fold superfamily: Sm-like ribonucleoproteins family: YgdI/YgdR-like domain: Putative outer membrane lipoprotein STM1585 species: Salmonella typhimurium [TaxId: 90371]
Probab=86.80 E-value=0.8 Score=26.96 Aligned_cols=39 Identities=26% Similarity=0.450 Sum_probs=29.7
Q ss_pred eEEEcCCCceEEEeeeeecCCccccccceeeeEEEEECCCCcEEEEE
Q psy10193 34 FAFETGNGIAVQEQGALKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQ 80 (135)
Q Consensus 34 f~Ye~~dG~~~~E~G~~k~~G~~~~~~~~v~GsYsy~~pDG~~~~V~ 80 (135)
|--.+.||......|..+-. .-.|-|+|.+.+|+...++
T Consensus 3 yvm~t~dGr~I~t~gKP~~D--------~dTGm~sY~d~~G~~~qIN 41 (52)
T d2rd1a1 3 YVMTTKNGQTIVTQGKPQLD--------KETGMTSYTDQEGNQREIN 41 (52)
T ss_dssp EEEEETTSCEEEEESCCEEE--------TTTTEEEEECTTSCEEEEE
T ss_pred eEEEeCCCcEEEcCCCcccc--------CCCCCEEEEcCCCCEEEeC
Confidence 45567888887777766532 2489999999999988764
|
| >d2rb6a1 b.38.1.6 (A:25-76) Uncharacterized protein SO0963 {Shewanella oneidensis [TaxId: 70863]} | Back information, alignment and structure |
|---|
| >d2ra2a1 b.38.1.6 (A:4-56) Uncharacterized protein YgdI {Salmonella typhimurium [TaxId: 90371]} | Back information, alignment and structure |
|---|
| >d2jn0a1 b.38.1.6 (A:4-53) Hypothetical lipoprotein YgdR {Escherichia coli [TaxId: 562]} | Back information, alignment and structure |
|---|
| >d3bdua1 b.38.1.6 (A:2-54) Uncharacterized protein ECA1013 {Erwinia carotovora [TaxId: 554]} | Back information, alignment and structure |
|---|