Psyllid ID: psy13383


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-----
MEWLFGRKITPEEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQMTLSTERHVALLLEFVIIVGPENRENFENSDLRPQSNFFAVFTIGEVSLL
ccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHccc
ccHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHEEEEEccccccccccccccccccEEEEEEEcEEEcc
mewlfgrkitpeEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQMTLSTERHVALLLEFVIIvgpenrenfensdlrpqsnfFAVFTIGEVSLL
mewlfgrkitpeemlrkNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQMTLSTERHVALLLEFVIIVGPENRENfensdlrpqsnffavftigevsll
MEWLFGRKITPEEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQMTLSTERHVALLLEFVIIVGPENRENFENSDLRPQSNFFAVFTIGEVSLL
*********************************************************LSTERHVALLLEFVIIVGPE*************SNFFAVFTIGE****
**W*****************************************************TLSTERHVALLLEFVII***********************FTIGEVSLL
MEWLFGRKITPEEMLRKNQRALNKAMRDL***********KKLIADIKKMAKEGQMTLSTERHVALLLEFVIIVGPENRENFENSDLRPQSNFFAVFTIGEVSLL
*EW*FGRKITPEEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQMTLSTERHVALLLEFVIIVGPENR*NFENSDLRPQSNFFAVFTIGEVSLL
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEWLFGRKITPxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxGQMTLSTERHVALLLEFVIIVGPENRENFENSDLRPQSNFFAVFTIGEVSLL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query105 2.2.26 [Sep-21-2011]
Q7ZW25 220 Charged multivesicular bo yes N/A 0.533 0.254 0.821 1e-20
Q6DFS6 220 Charged multivesicular bo yes N/A 0.533 0.254 0.803 3e-20
Q6IP52 220 Charged multivesicular bo N/A N/A 0.533 0.254 0.785 7e-20
O43633 222 Charged multivesicular bo yes N/A 0.533 0.252 0.75 1e-18
Q9DB34 222 Charged multivesicular bo yes N/A 0.533 0.252 0.75 2e-18
Q54DB1198 Charged multivesicular bo yes N/A 0.514 0.272 0.629 1e-13
P36108 232 DOA4-independent degradat yes N/A 0.552 0.25 0.534 2e-12
Q5ZHN1 220 Charged multivesicular bo yes N/A 0.533 0.254 0.714 5e-12
Q9SKI2 225 Vacuolar protein sorting- yes N/A 0.533 0.248 0.535 4e-11
O14177 210 ESCRT-III complex subunit yes N/A 0.495 0.247 0.5 4e-08
>sp|Q7ZW25|CHM2A_DANRE Charged multivesicular body protein 2a OS=Danio rerio GN=chmp2a PE=2 SV=1 Back     alignment and function desciption
 Score = 98.6 bits (244), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 46/56 (82%), Positives = 53/56 (94%)

Query: 1  MEWLFGRKITPEEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQM 56
          ME LFGR+ TPEEMLR+NQRALN+AMRDLDRE+Q +EQQEKK+IADIKKMAK+GQM
Sbjct: 1  MESLFGRRKTPEEMLRQNQRALNRAMRDLDRERQRLEQQEKKIIADIKKMAKQGQM 56




Probable core component of the endosomal sorting required for transport complex III (ESCRT-III) which is involved in multivesicular bodies (MVBs) formation and sorting of endosomal cargo proteins into MVBs. MVBs contain intraluminal vesicles (ILVs) that are generated by invagination and scission from the limiting membrane of the endosome and mostly are delivered to lysosomes enabling degradation of membrane proteins, such as stimulated growth factor receptors, lysosomal enzymes and lipids.
Danio rerio (taxid: 7955)
>sp|Q6DFS6|CHM2A_XENTR Charged multivesicular body protein 2a OS=Xenopus tropicalis GN=chmp2a PE=2 SV=1 Back     alignment and function description
>sp|Q6IP52|CHM2A_XENLA Charged multivesicular body protein 2a OS=Xenopus laevis GN=chmp2a PE=2 SV=1 Back     alignment and function description
>sp|O43633|CHM2A_HUMAN Charged multivesicular body protein 2a OS=Homo sapiens GN=CHMP2A PE=1 SV=1 Back     alignment and function description
>sp|Q9DB34|CHM2A_MOUSE Charged multivesicular body protein 2a OS=Mus musculus GN=Chmp2a PE=1 SV=1 Back     alignment and function description
>sp|Q54DB1|CM2A2_DICDI Charged multivesicular body protein 2a homolog 2 OS=Dictyostelium discoideum GN=chmp2a2 PE=3 SV=1 Back     alignment and function description
>sp|P36108|DID4_YEAST DOA4-independent degradation protein 4 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=DID4 PE=1 SV=2 Back     alignment and function description
>sp|Q5ZHN1|CHM2A_CHICK Charged multivesicular body protein 2a OS=Gallus gallus GN=CHMP2A PE=2 SV=1 Back     alignment and function description
>sp|Q9SKI2|VPS2A_ARATH Vacuolar protein sorting-associated protein 2 homolog 1 OS=Arabidopsis thaliana GN=VPS2.1 PE=1 SV=2 Back     alignment and function description
>sp|O14177|DID4_SCHPO ESCRT-III complex subunit did4 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=did4 PE=3 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query105
322790209 228 hypothetical protein SINV_04238 [Solenop 0.533 0.245 0.892 1e-21
289741155 243 vacuolar assembly/sorting protein DID4 [ 0.533 0.230 0.875 3e-21
307175574 227 Charged multivesicular body protein 2a [ 0.533 0.246 0.875 4e-21
332019427 228 Charged multivesicular body protein 2a [ 0.533 0.245 0.875 4e-21
66509926 227 PREDICTED: charged multivesicular body p 0.533 0.246 0.857 6e-21
380025498 227 PREDICTED: charged multivesicular body p 0.533 0.246 0.857 6e-21
383857259 227 PREDICTED: charged multivesicular body p 0.533 0.246 0.857 6e-21
195454116 251 GK14462 [Drosophila willistoni] gi|19417 0.533 0.223 0.857 6e-21
195349629 256 GM10303 [Drosophila sechellia] gi|194123 0.533 0.218 0.857 7e-21
195151933 256 GL21825 [Drosophila persimilis] gi|19411 0.533 0.218 0.857 7e-21
>gi|322790209|gb|EFZ15208.1| hypothetical protein SINV_04238 [Solenopsis invicta] Back     alignment and taxonomy information
 Score =  106 bits (265), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/56 (89%), Positives = 54/56 (96%)

Query: 1  MEWLFGRKITPEEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQM 56
          MEWLFG++ITPEEMLRKNQRALNKAMRDLDREK  MEQQEKK+IADIKK+AKEGQM
Sbjct: 1  MEWLFGKRITPEEMLRKNQRALNKAMRDLDREKMRMEQQEKKIIADIKKLAKEGQM 56




Source: Solenopsis invicta

Species: Solenopsis invicta

Genus: Solenopsis

Family: Formicidae

Order: Hymenoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|289741155|gb|ADD19325.1| vacuolar assembly/sorting protein DID4 [Glossina morsitans morsitans] Back     alignment and taxonomy information
>gi|307175574|gb|EFN65486.1| Charged multivesicular body protein 2a [Camponotus floridanus] Back     alignment and taxonomy information
>gi|332019427|gb|EGI59911.1| Charged multivesicular body protein 2a [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|66509926|ref|XP_625164.1| PREDICTED: charged multivesicular body protein 2a-like [Apis mellifera] Back     alignment and taxonomy information
>gi|380025498|ref|XP_003696510.1| PREDICTED: charged multivesicular body protein 2a-like [Apis florea] Back     alignment and taxonomy information
>gi|383857259|ref|XP_003704122.1| PREDICTED: charged multivesicular body protein 2a-like [Megachile rotundata] Back     alignment and taxonomy information
>gi|195454116|ref|XP_002074094.1| GK14462 [Drosophila willistoni] gi|194170179|gb|EDW85080.1| GK14462 [Drosophila willistoni] Back     alignment and taxonomy information
>gi|195349629|ref|XP_002041345.1| GM10303 [Drosophila sechellia] gi|194123040|gb|EDW45083.1| GM10303 [Drosophila sechellia] Back     alignment and taxonomy information
>gi|195151933|ref|XP_002016893.1| GL21825 [Drosophila persimilis] gi|194111950|gb|EDW33993.1| GL21825 [Drosophila persimilis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query105
FB|FBgn0039402 256 vps2 "vps2" [Drosophila melano 0.533 0.218 0.857 7e-22
ZFIN|ZDB-GENE-040426-2922 220 bc2 "putative breast adenocarc 0.533 0.254 0.821 5.7e-20
UNIPROTKB|Q3T0U5 222 CHMP2A "Chromatin modifying pr 0.533 0.252 0.75 2.8e-18
UNIPROTKB|E2RIU2 222 CHMP2A "Uncharacterized protei 0.533 0.252 0.75 2.8e-18
UNIPROTKB|O43633 222 CHMP2A "Charged multivesicular 0.533 0.252 0.75 2.8e-18
MGI|MGI:1916203 222 Chmp2a "charged multivesicular 0.533 0.252 0.75 2.8e-18
RGD|1305050 222 Chmp2a "charged multivesicular 0.533 0.252 0.75 2.8e-18
UNIPROTKB|Q5ZHN1 220 CHMP2A "Charged multivesicular 0.533 0.254 0.714 4.1e-17
WB|WBGene00012903 237 vps-2 [Caenorhabditis elegans 0.533 0.236 0.696 2.9e-16
CGD|CAL0003689 235 VPS2 [Candida albicans (taxid: 0.533 0.238 0.642 3.3e-15
FB|FBgn0039402 vps2 "vps2" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 255 (94.8 bits), Expect = 7.0e-22, P = 7.0e-22
 Identities = 48/56 (85%), Positives = 54/56 (96%)

Query:     1 MEWLFGRKITPEEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQM 56
             M+WLFG+KI+P+EMLRKNQRALNKAMRDLDRE+  MEQQEKK+IADIKKMAKEGQM
Sbjct:     1 MDWLFGKKISPDEMLRKNQRALNKAMRDLDRERMKMEQQEKKIIADIKKMAKEGQM 56




GO:0015031 "protein transport" evidence=IEA
ZFIN|ZDB-GENE-040426-2922 bc2 "putative breast adenocarcinoma marker" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|Q3T0U5 CHMP2A "Chromatin modifying protein 2A" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|E2RIU2 CHMP2A "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|O43633 CHMP2A "Charged multivesicular body protein 2a" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1916203 Chmp2a "charged multivesicular body protein 2A" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
RGD|1305050 Chmp2a "charged multivesicular body protein 2A" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q5ZHN1 CHMP2A "Charged multivesicular body protein 2a" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
WB|WBGene00012903 vps-2 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
CGD|CAL0003689 VPS2 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q6DFS6CHM2A_XENTRNo assigned EC number0.80350.53330.2545yesN/A
Q9DB34CHM2A_MOUSENo assigned EC number0.750.53330.2522yesN/A
P36108DID4_YEASTNo assigned EC number0.53440.55230.25yesN/A
O43633CHM2A_HUMANNo assigned EC number0.750.53330.2522yesN/A
Q5ZHN1CHM2A_CHICKNo assigned EC number0.71420.53330.2545yesN/A
Q9SKI2VPS2A_ARATHNo assigned EC number0.53570.53330.2488yesN/A
Q7ZW25CHM2A_DANRENo assigned EC number0.82140.53330.2545yesN/A
Q54DB1CM2A2_DICDINo assigned EC number0.62960.51420.2727yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query105
pfam03357169 pfam03357, Snf7, Snf7 1e-06
>gnl|CDD|146145 pfam03357, Snf7, Snf7 Back     alignment and domain information
 Score = 43.8 bits (104), Expect = 1e-06
 Identities = 17/40 (42%), Positives = 31/40 (77%)

Query: 17 KNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQM 56
          +   +L KA+R+LD++++ +E++ KKL A+IKK+AK+G  
Sbjct: 1  EAILSLRKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNK 40


This family of proteins are involved in protein sorting and transport from the endosome to the vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes play an important role in the degradation of both lipids and cellular proteins. In order to perform this degradative function, vacuoles/lysosomes contain numerous hydrolases which have been transported in the form of inactive precursors via the biosynthetic pathway and are proteolytically activated upon delivery to the vacuole/lysosome. The delivery of transmembrane proteins, such as activated cell surface receptors to the lumen of the vacuole/lysosome, either for degradation/downregulation, or in the case of hydrolases, for proper localisation, requires the formation of multivesicular bodies (MVBs). These late endosomal structures are formed by invaginating and budding of the limiting membrane into the lumen of the compartment. During this process, a subset of the endosomal membrane proteins is sorted into the forming vesicles. Mature MVBs fuse with the vacuole/lysosome, thereby releasing cargo containing vesicles into its hydrolytic lumen for degradation. Endosomal proteins that are not sorted into the intralumenal MVB vesicles are either recycled back to the plasma membrane or Golgi complex, or remain in the limiting membrane of the MVB and are thereby transported to the limiting membrane of the vacuole/lysosome as a consequence of fusion. Therefore, the MVB sorting pathway plays a critical role in the decision between recycling and degradation of membrane proteins. A few archaeal sequences are also present within this family. Length = 169


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query105
2gd5_A179 Charged multivesicular BODY protein 3; CHMP3, ESCR 5e-10
3frt_A 218 Charged multivesicular BODY protein 3; ESCRT, ESCR 1e-09
>2gd5_A Charged multivesicular BODY protein 3; CHMP3, ESCRT-III, protein transport; 2.80A {Homo sapiens} PDB: 3frv_A Length = 179 Back     alignment and structure
 Score = 52.4 bits (125), Expect = 5e-10
 Identities = 14/49 (28%), Positives = 31/49 (63%)

Query: 8  KITPEEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQM 56
          +  P+E++ +    + K MR +DR+ + ++++E+K+   +K  AK+GQ 
Sbjct: 5  EKPPKELVNEWSLKIRKEMRVVDRQIRDIQREEEKVKRSVKDAAKKGQK 53


>3frt_A Charged multivesicular BODY protein 3; ESCRT, ESCRT-111, CHMP, IST1, coiled coil, cytoplasm, lipoprotein, membrane, myristate, phosphoprotein; 4.00A {Homo sapiens} Length = 218 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query105
3frt_A 218 Charged multivesicular BODY protein 3; ESCRT, ESCR 99.9
2gd5_A179 Charged multivesicular BODY protein 3; CHMP3, ESCR 99.86
4dci_A150 Uncharacterized protein; PSI-biology, midwest cent 81.11
>3frt_A Charged multivesicular BODY protein 3; ESCRT, ESCRT-111, CHMP, IST1, coiled coil, cytoplasm, lipoprotein, membrane, myristate, phosphoprotein; 4.00A {Homo sapiens} Back     alignment and structure
Probab=99.90  E-value=2.4e-24  Score=165.29  Aligned_cols=81  Identities=20%  Similarity=0.302  Sum_probs=75.0

Q ss_pred             CCCCHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhcCccchhHhHHHHHHHHHHHhchhhhhhhhccc
Q psy13383          7 RKITPEEMLRKNQRALNKAMRDLDREKQHMEQQEKKLIADIKKMAKEGQMTLSTERHVALLLEFVIIVGPENRENFENSD   86 (105)
Q Consensus         7 kk~tPkEqlRe~kr~LRk~~R~LDRei~~LereEkKl~~~IKk~AKkgd~~~aki~~~~LAKElVr~rK~~~Rl~~~ks~   86 (105)
                      ++|||+|++|+|+|.||+++|+|||++++|+++|++++.+||++||+||+++|||    ||++|||+|+++.|+|.++|+
T Consensus         4 ~~~~p~e~~r~~~r~Lr~~~R~LdR~~~kle~eEkk~~~~IKkaakkg~~~~ari----lAkelVR~Rk~~~rl~~~kaq   79 (218)
T 3frt_A            4 QEKPPKELVNEWSLKIRKEMRVVDRQIRDIQREEEKVKRSVKDAAKKGQKDVCIV----LAKEMIRSRKAVSKLYASKAH   79 (218)
T ss_dssp             ----CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTCHHHHHH----HHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHH----HHHHHHHHHHHHHHHHHHHHH
Confidence            4689999999999999999999999999999999999999999999999999999    999999999999999999999


Q ss_pred             ccccc
Q psy13383         87 LRPQS   91 (105)
Q Consensus        87 l~~q~   91 (105)
                      |++-+
T Consensus        80 L~sV~   84 (218)
T 3frt_A           80 MNSVL   84 (218)
T ss_dssp             HHHHH
T ss_pred             HHHHH
Confidence            98643



>2gd5_A Charged multivesicular BODY protein 3; CHMP3, ESCRT-III, protein transport; 2.80A {Homo sapiens} PDB: 3frv_A Back     alignment and structure
>4dci_A Uncharacterized protein; PSI-biology, midwest center for structural genomics, MCSG, S genomics, unknown function; 2.82A {Synechococcus SP} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00