Psyllid ID: psy11499


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70
IVGFKCPDSVDPHSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDYTDKY
ccEEcccccccccccccccccccccccccccccEEEEEccccEEccccccEEEEcccccccccccccccc
ccccccccccccccccccccccccccccccccEEEEEEccccEEEEcccccEEcccccccccHccccccc
ivgfkcpdsvdphsvaakfwpyprfpvpgdrerlitcvdghprliscgdgklfdeasltclgpedytdky
ivgfkcpdsvdphsvaakfwpyprfpvpgDRERLITCVDGHPRLIscgdgklfdeasltclgpedytdky
IVGFKCPDSVDPHSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDYTDKY
************HSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLG********
IVGFKCPDSVDP*SVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDYTDK*
IVGFKCPDSVDPHSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDYTDKY
*VGFKCPDSVDPHSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDY**K*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
IVGFKCPDSVDPHSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDYTDKY
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query70
322801955 264 hypothetical protein SINV_11948 [Solenop 0.928 0.246 0.769 3e-22
307187917 231 hypothetical protein EAG_00911 [Camponot 0.928 0.281 0.784 6e-22
347964890 228 AGAP000986-PA [Anopheles gambiae str. PE 0.928 0.285 0.738 2e-21
350405633 232 PREDICTED: hypothetical protein LOC10074 0.928 0.280 0.753 2e-21
340711124 232 PREDICTED: hypothetical protein LOC10064 0.928 0.280 0.753 2e-21
288869485 232 cuticular protein analogous to peritroph 0.928 0.280 0.753 3e-21
332027355 384 hypothetical protein G5I_04084 [Acromyrm 0.928 0.169 0.753 3e-21
195345919 230 GM23015 [Drosophila sechellia] gi|194134 0.914 0.278 0.734 6e-21
195567827 230 GD17481 [Drosophila simulans] gi|1942048 0.914 0.278 0.734 6e-21
194893361 230 GG19276 [Drosophila erecta] gi|190649511 0.914 0.278 0.734 6e-21
>gi|322801955|gb|EFZ22502.1| hypothetical protein SINV_11948 [Solenopsis invicta] Back     alignment and taxonomy information
 Score =  108 bits (271), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 50/65 (76%), Positives = 55/65 (84%)

Query: 1   IVGFKCPDSVDPHSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTC 60
           IVGFKCP  + PHS AAKFWPYPRFPVP D  RLITCVDGHPRL++CG+GKLFD  SLTC
Sbjct: 197 IVGFKCPHKLPPHSAAAKFWPYPRFPVPSDCGRLITCVDGHPRLLTCGEGKLFDSVSLTC 256

Query: 61  LGPED 65
           L PE+
Sbjct: 257 LDPEE 261




Source: Solenopsis invicta

Species: Solenopsis invicta

Genus: Solenopsis

Family: Formicidae

Order: Hymenoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|307187917|gb|EFN72830.1| hypothetical protein EAG_00911 [Camponotus floridanus] Back     alignment and taxonomy information
>gi|347964890|ref|XP_560209.2| AGAP000986-PA [Anopheles gambiae str. PEST] gi|333466525|gb|EAL41678.2| AGAP000986-PA [Anopheles gambiae str. PEST] Back     alignment and taxonomy information
>gi|350405633|ref|XP_003487500.1| PREDICTED: hypothetical protein LOC100741733 [Bombus impatiens] Back     alignment and taxonomy information
>gi|340711124|ref|XP_003394130.1| PREDICTED: hypothetical protein LOC100643152 [Bombus terrestris] Back     alignment and taxonomy information
>gi|288869485|ref|NP_001165850.1| cuticular protein analogous to peritrophins 3-D precursor [Apis mellifera] Back     alignment and taxonomy information
>gi|332027355|gb|EGI67439.1| hypothetical protein G5I_04084 [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|195345919|ref|XP_002039516.1| GM23015 [Drosophila sechellia] gi|194134742|gb|EDW56258.1| GM23015 [Drosophila sechellia] Back     alignment and taxonomy information
>gi|195567827|ref|XP_002107460.1| GD17481 [Drosophila simulans] gi|194204867|gb|EDX18443.1| GD17481 [Drosophila simulans] Back     alignment and taxonomy information
>gi|194893361|ref|XP_001977862.1| GG19276 [Drosophila erecta] gi|190649511|gb|EDV46789.1| GG19276 [Drosophila erecta] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query70
FB|FBgn0022770230 Peritrophin-A "Peritrophin A" 0.914 0.278 0.734 6.8e-24
FB|FBgn0031097237 obst-A "obstructor-A" [Drosoph 0.9 0.265 0.281 0.00026
FB|FBgn0022770 Peritrophin-A "Peritrophin A" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 274 (101.5 bits), Expect = 6.8e-24, P = 6.8e-24
 Identities = 47/64 (73%), Positives = 55/64 (85%)

Query:     1 IVGFKCPDSVDPHSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTC 60
             +VGFKCP  VDP+SVAA+FWP+PRFPV GD  RLITCV+GHPRLISCG+ K+FDE +LTC
Sbjct:   156 VVGFKCPTKVDPNSVAARFWPFPRFPVAGDCHRLITCVEGHPRLISCGEDKVFDEHTLTC 215

Query:    61 LGPE 64
               PE
Sbjct:   216 EDPE 219




GO:0016490 "structural constituent of peritrophic membrane" evidence=ISS
GO:0008061 "chitin binding" evidence=IEA;RCA;NAS
GO:0006030 "chitin metabolic process" evidence=IEA
GO:0032504 "multicellular organism reproduction" evidence=IEP
GO:0005615 "extracellular space" evidence=IDA
FB|FBgn0031097 obst-A "obstructor-A" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query70
smart0049449 smart00494, ChtBD2, Chitin-binding domain type 2 2e-04
pfam0160753 pfam01607, CBM_14, Chitin binding Peritrophin-A do 0.002
>gnl|CDD|214696 smart00494, ChtBD2, Chitin-binding domain type 2 Back     alignment and domain information
 Score = 35.1 bits (81), Expect = 2e-04
 Identities = 13/37 (35%), Positives = 20/37 (54%)

Query: 24 RFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTC 60
           +P P D  +   C +G P + SC  G +F+ A+ TC
Sbjct: 10 LYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTC 46


Length = 49

>gnl|CDD|216601 pfam01607, CBM_14, Chitin binding Peritrophin-A domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 70
PF0160753 CBM_14: Chitin binding Peritrophin-A domain; Inter 99.6
smart0049456 ChtBD2 Chitin-binding domain type 2. 99.43
PF0342761 CBM_19: Carbohydrate binding domain (family 19); I 89.92
>PF01607 CBM_14: Chitin binding Peritrophin-A domain; InterPro: IPR002557 This entry represents a chitin binding domain [] Back     alignment and domain information
Probab=99.60  E-value=5.6e-16  Score=82.90  Aligned_cols=48  Identities=27%  Similarity=0.573  Sum_probs=36.7

Q ss_pred             CCCCCCccCCcccEEcCCCeeeEeeCCCCCeeecCCCcCCCCCCCCCC
Q psy11499         22 YPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDYTDK   69 (70)
Q Consensus        22 ~~~~p~p~dC~~yy~C~~G~~~~~~Cp~Gl~Fd~~~~~C~~p~~v~~C   69 (70)
                      .++++++.+|++||+|.+|.+.++.||.|++||+.++.|+++++|..|
T Consensus         6 ~~~~~~~~~C~~Y~~C~~g~~~~~~C~~g~~fd~~~~~C~~~~~~~~C   53 (53)
T PF01607_consen    6 DGFYPHPDDCRKYYQCVNGQAVEQRCPEGLYFDPSSQRCVPPSNVVQC   53 (53)
T ss_dssp             SEEE--SS-SSEEEEEETTEEEEEE-TTS-EE-TTTSSEE-TTT-TT-
T ss_pred             CeeEeCCCCCCEEEEeeCCcEECCCCcCCCEECcCcCEEcCCccCCCC
Confidence            457899999999999999999999999999999999999999997666



It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.

>smart00494 ChtBD2 Chitin-binding domain type 2 Back     alignment and domain information
>PF03427 CBM_19: Carbohydrate binding domain (family 19); InterPro: IPR005089 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query70
1dqc_A74 Tachycitin; disulfide-rich, antimicrobial protein; 99.69
1wb0_A445 Chitinase 1, chitotriosidase 1; cyclopentapeptide 99.26
>1dqc_A Tachycitin; disulfide-rich, antimicrobial protein; NMR {Tachypleus tridentatus} SCOP: g.31.1.1 Back     alignment and structure
Probab=99.69  E-value=1.2e-17  Score=96.27  Aligned_cols=46  Identities=15%  Similarity=0.103  Sum_probs=43.7

Q ss_pred             CCCCCccCCcccEEcCCCeeeEeeCCCCCeeecCCCcCCCCCCCCCC
Q psy11499         23 PRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDYTDK   69 (70)
Q Consensus        23 ~~~p~p~dC~~yy~C~~G~~~~~~Cp~Gl~Fd~~~~~C~~p~~v~~C   69 (70)
                      |+|+||.+|++||+|++|.++++.||.||+||++++.|+||++|. |
T Consensus        16 G~~~~p~dC~~fy~C~~G~~~~~~Cp~Gl~Fn~~~~~Cd~p~~v~-C   61 (74)
T 1dqc_A           16 GPNVNLYSCCSFYNCHKCLARLENCPKGLHYNAYLKVCDWPSKAG-C   61 (74)
T ss_dssp             EEECCSSCSSEEEEEETTEEEEEECTTSCEEETTTTEEECTTTTT-C
T ss_pred             CEeCCcccCcceeECCCCcEEEeECcCCCEEChhhCcCcCcccCC-C
Confidence            478999999999999999999999999999999999999999985 5



>1wb0_A Chitinase 1, chitotriosidase 1; cyclopentapeptide inhibitors, chitinase inhibitors, carbohyd metabolism, chitin degradation, chitin-binding; HET: VR0 MEA; 1.65A {Homo sapiens} SCOP: c.1.8.5 d.26.3.1 PDB: 1waw_A* 1guv_A 1lg2_A 1lg1_A 1lq0_A 1hki_A* 1hkj_A* 1hkm_A* 1hkk_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query70
d1dqca_73 Tachycitin {Horseshoe crab (Tachypleus tridentatus 99.78
>d1dqca_ g.31.1.1 (A:) Tachycitin {Horseshoe crab (Tachypleus tridentatus) [TaxId: 6853]} Back     information, alignment and structure
class: Small proteins
fold: Invertebrate chitin-binding proteins
superfamily: Invertebrate chitin-binding proteins
family: Tachycitin
domain: Tachycitin
species: Horseshoe crab (Tachypleus tridentatus) [TaxId: 6853]
Probab=99.78  E-value=5.8e-20  Score=104.92  Aligned_cols=61  Identities=15%  Similarity=0.139  Sum_probs=53.2

Q ss_pred             CCcccCCCCCCCCCcccccCCCCCCCCccCCcccEEcCCCeeeEeeCCCCCeeecCCCcCCCCCCCCCC
Q psy11499          1 IVGFKCPDSVDPHSVAAKFWPYPRFPVPGDRERLITCVDGHPRLISCGDGKLFDEASLTCLGPEDYTDK   69 (70)
Q Consensus         1 ~~gf~CP~~~~~~~~~~~~~~~~~~p~p~dC~~yy~C~~G~~~~~~Cp~Gl~Fd~~~~~C~~p~~v~~C   69 (70)
                      ||||+|+....-       ...|+++||.||++||+|.+|++.+++||+||+||++++.|+||++| +|
T Consensus         1 ~~~~~c~~y~~C-------~~dG~~~~p~dC~~yy~C~~g~~~~~~Cp~Gl~Fn~~~~~Cd~p~~v-~C   61 (73)
T d1dqca_           1 YLAFRCGRYSPC-------LDDGPNVNLYSCCSFYNCHKCLARLENCPKGLHYNAYLKVCDWPSKA-GC   61 (73)
T ss_dssp             CCSCCSGGGCCS-------SSSEEECCSSCSSEEEEEETTEEEEEECTTSCEEETTTTEEECTTTT-TC
T ss_pred             CccccCCCcCCC-------CCCCccCCcccccEEEEeECCEEEEEECCCCCEECCCcCEECccccC-CC
Confidence            689999765422       12568999999999999999999999999999999999999999999 45