Psyllid ID: psy4061


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------12
MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLYDWMSEHPTDPPDAAPAS
ccccccHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHccccHHHHHHHHHcccccccccccccccHHHHHHHHHHHccccccccccccccccccccc
ccccccHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHccccHHHHHHHHcccccccccHHHHHHccHHHEEEEEEcccccccccccccccccccccc
msrhpnyfGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSgipllerssdhkyrdnakyqyykkstsplipippsvyvevPKFLKFILCceyplydwmsehptdppdaapas
MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYykkstsplipipPSVYVEVPKFLKFILCCEYPLYDWMSEHPTDPPDAAPAS
MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLYDWMSEHPTDPPDAAPAS
*****NYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLYDWM**************
MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLER**********KYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLY*****************
MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLYDWMSEHP**********
*SRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLYDWMS*************
iiiiiiHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooo
iiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooo
iiiiiiHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLYDWMSEHPTDPPDAAPAS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query119
322795657 313 hypothetical protein SINV_03336 [Solenop 0.915 0.348 0.770 1e-45
307204514 313 hypothetical protein EAI_05984 [Harpegna 0.915 0.348 0.752 5e-45
270004630175 hypothetical protein TcasGA2_TC003999 [T 0.907 0.617 0.768 7e-45
91080037210 PREDICTED: similar to AGAP005737-PA [Tri 0.899 0.509 0.775 9e-45
332027118 313 hypothetical protein G5I_04371 [Acromyrm 0.915 0.348 0.752 1e-44
383859053 313 PREDICTED: uncharacterized protein LOC10 0.915 0.348 0.743 4e-44
328784034 333 PREDICTED: hypothetical protein LOC40964 0.915 0.327 0.724 9e-43
242003778 311 conserved hypothetical protein [Pediculu 0.924 0.353 0.732 1e-42
156554867 313 PREDICTED: hypothetical protein LOC10012 0.915 0.348 0.706 3e-42
357617741 299 hypothetical protein KGM_16674 [Danaus p 0.865 0.344 0.747 1e-41
>gi|322795657|gb|EFZ18336.1| hypothetical protein SINV_03336 [Solenopsis invicta] Back     alignment and taxonomy information
 Score =  186 bits (473), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 84/109 (77%), Positives = 100/109 (91%)

Query: 1   MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKY 60
           +SRHPNYFGEIV+WWG+FIISLNVI GAE+V + SPIF TLIILFLSG+PLLE++SD +Y
Sbjct: 189 LSRHPNYFGEIVVWWGIFIISLNVIEGAEWVAIASPIFTTLIILFLSGMPLLEKASDERY 248

Query: 61  RDNAKYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLYDWMSEHP 109
           RDNA+Y+YYK+STSPLIPIPPS+YVEVP+FLKFILCCE+PLYD +   P
Sbjct: 249 RDNAEYRYYKQSTSPLIPIPPSIYVEVPRFLKFILCCEFPLYDSLDVKP 297




Source: Solenopsis invicta

Species: Solenopsis invicta

Genus: Solenopsis

Family: Formicidae

Order: Hymenoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|307204514|gb|EFN83194.1| hypothetical protein EAI_05984 [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|270004630|gb|EFA01078.1| hypothetical protein TcasGA2_TC003999 [Tribolium castaneum] Back     alignment and taxonomy information
>gi|91080037|ref|XP_972620.1| PREDICTED: similar to AGAP005737-PA [Tribolium castaneum] Back     alignment and taxonomy information
>gi|332027118|gb|EGI67214.1| hypothetical protein G5I_04371 [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|383859053|ref|XP_003705012.1| PREDICTED: uncharacterized protein LOC100880271 [Megachile rotundata] Back     alignment and taxonomy information
>gi|328784034|ref|XP_623136.2| PREDICTED: hypothetical protein LOC409642 isoform 2 [Apis mellifera] gi|380030415|ref|XP_003698844.1| PREDICTED: uncharacterized protein LOC100865641 [Apis florea] Back     alignment and taxonomy information
>gi|242003778|ref|XP_002422856.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212505738|gb|EEB10118.1| conserved hypothetical protein [Pediculus humanus corporis] Back     alignment and taxonomy information
>gi|156554867|ref|XP_001606793.1| PREDICTED: hypothetical protein LOC100123185 [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|357617741|gb|EHJ70969.1| hypothetical protein KGM_16674 [Danaus plexippus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query119
FB|FBgn0035914316 CG6282 [Drosophila melanogaste 0.882 0.332 0.714 8.7e-40
TAIR|locus:2194090305 AT1G18180 [Arabidopsis thalian 0.848 0.331 0.524 2.4e-28
TAIR|locus:2027774302 AT1G73650 [Arabidopsis thalian 0.848 0.334 0.504 9.4e-27
UNIPROTKB|G4N6G0335 MGG_03731 "Uncharacterized pro 0.857 0.304 0.391 1.1e-16
UNIPROTKB|Q722V9257 LMOf2365_0620 "Putative membra 0.605 0.280 0.448 5.4e-15
UNIPROTKB|Q74AM9256 GSU2323 "Uncharacterized prote 0.621 0.289 0.462 5.4e-15
TIGR_CMR|GSU_2323256 GSU_2323 "membrane protein, pu 0.621 0.289 0.462 5.4e-15
ZFIN|ZDB-GENE-030131-3630288 si:ch211-210c8.6 "si:ch211-210 0.663 0.274 0.392 1.4e-13
DICTYBASE|DDB_G0279829267 DDB_G0279829 "DUF1295 family p 0.647 0.288 0.4 1.4e-12
UNIPROTKB|G4NC19328 MGG_00459 "Uncharacterized pro 0.453 0.164 0.407 3e-11
FB|FBgn0035914 CG6282 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 424 (154.3 bits), Expect = 8.7e-40, P = 8.7e-40
 Identities = 75/105 (71%), Positives = 90/105 (85%)

Query:     1 MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKY 60
             +SRHPNYFGE+V+WWG+F ISLNVI G E+V + SPIF T+IILFLSGIPL ERS+D KY
Sbjct:   192 VSRHPNYFGEVVIWWGIFAISLNVISGHEWVAIASPIFTTMIILFLSGIPLRERSADEKY 251

Query:    61 RDNAKYQYYKKSTSPLIPIPPSVYVEVPKFLKFILCCEYPLYDWM 105
             +D  +Y+ YK STSPLIP+PP+VYVEVP  LKFILCCE+P+YD M
Sbjct:   252 KDQLEYRKYKASTSPLIPVPPAVYVEVPSALKFILCCEFPIYDSM 296




GO:0016627 "oxidoreductase activity, acting on the CH-CH group of donors" evidence=IEA
GO:0005737 "cytoplasm" evidence=IEA
GO:0016021 "integral to membrane" evidence=IEA
GO:0006629 "lipid metabolic process" evidence=IEA
TAIR|locus:2194090 AT1G18180 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2027774 AT1G73650 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|G4N6G0 MGG_03731 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] Back     alignment and assigned GO terms
UNIPROTKB|Q722V9 LMOf2365_0620 "Putative membrane protein" [Listeria monocytogenes serotype 4b str. F2365 (taxid:265669)] Back     alignment and assigned GO terms
UNIPROTKB|Q74AM9 GSU2323 "Uncharacterized protein" [Geobacter sulfurreducens PCA (taxid:243231)] Back     alignment and assigned GO terms
TIGR_CMR|GSU_2323 GSU_2323 "membrane protein, putative" [Geobacter sulfurreducens PCA (taxid:243231)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-030131-3630 si:ch211-210c8.6 "si:ch211-210c8.6" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0279829 DDB_G0279829 "DUF1295 family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
UNIPROTKB|G4NC19 MGG_00459 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query119
pfam06966235 pfam06966, DUF1295, Protein of unknown function (D 1e-29
COG3752272 COG3752, COG3752, Steroid 5-alpha reductase family 7e-15
>gnl|CDD|219244 pfam06966, DUF1295, Protein of unknown function (DUF1295) Back     alignment and domain information
 Score =  105 bits (265), Expect = 1e-29
 Identities = 36/72 (50%), Positives = 54/72 (75%)

Query: 2   SRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYR 61
           SRHPNYFGE ++WWG+F+I++NV+ G E+  +  P+ +TL+++F+SGIPLLE S   KY 
Sbjct: 164 SRHPNYFGEALIWWGIFLIAINVLSGLEWWTIAGPLLMTLLLVFVSGIPLLEASMLKKYG 223

Query: 62  DNAKYQYYKKST 73
           D   Y+ Y++ T
Sbjct: 224 DREDYRAYQRRT 235


This family contains a number of bacterial and eukaryotic proteins of unknown function that are approximately 300 residues long. Length = 235

>gnl|CDD|226275 COG3752, COG3752, Steroid 5-alpha reductase family enzyme [General function prediction only] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 119
PF06966235 DUF1295: Protein of unknown function (DUF1295); In 99.88
COG3752272 Steroid 5-alpha reductase family enzyme [General f 99.83
KOG4650|consensus311 99.69
COG2020187 STE14 Putative protein-S-isoprenylcysteine methylt 99.62
PF01222432 ERG4_ERG24: Ergosterol biosynthesis ERG4/ERG24 fam 99.52
KOG1435|consensus428 99.33
KOG2628|consensus201 99.33
PF04191106 PEMT: Phospholipid methyltransferase ; InterPro: I 99.17
PF0414094 ICMT: Isoprenylcysteine carboxyl methyltransferase 98.68
PLN02392260 probable steroid reductase DET2 98.46
KOG1638|consensus257 98.26
PF02544150 Steroid_dh: 3-oxo-5-alpha-steroid 4-dehydrogenase 98.23
PLN02560308 enoyl-CoA reductase 97.98
PLN03164323 3-oxo-5-alpha-steroid 4-dehydrogenase, C-terminal 97.7
COG1755172 Uncharacterized protein conserved in bacteria [Fun 97.13
PF07298191 NnrU: NnrU protein; InterPro: IPR009915 This famil 96.98
COG4094219 Predicted membrane protein [Function unknown] 94.92
KOG1640|consensus304 94.66
KOG1639|consensus297 93.88
>PF06966 DUF1295: Protein of unknown function (DUF1295); InterPro: IPR010721 This family contains a number of bacterial and eukaryotic proteins of unknown function that are approximately 300 residues long Back     alignment and domain information
Probab=99.88  E-value=1.1e-22  Score=157.43  Aligned_cols=73  Identities=47%  Similarity=1.021  Sum_probs=67.3

Q ss_pred             CccchhHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHhcchHHHHHHHhhcCCCHHHHHHHhhc
Q psy4061           1 MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVLSPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYYKKST   73 (119)
Q Consensus         1 ysRHPnY~Ge~l~~~g~~l~~~~~~~~~~w~~~i~pl~~~~ll~~~~~i~~~E~~~~~kyG~~~~Y~~Y~~~t   73 (119)
                      |||||||+||+++|+|+++++.+...+..++++++|+++++++++.+|+++.|+++.+|||++|+|++||++|
T Consensus       163 ~sRHPNYfGE~l~W~g~~~~a~~~~~~~~~~~~~~pl~~~~~l~~~sgip~~E~~~~~kyg~~~~Y~~Y~~~t  235 (235)
T PF06966_consen  163 YSRHPNYFGEILFWWGIYLAAISSGSGWLWWAIIGPLFMTLLLLFVSGIPLLEKRMAKKYGDRPAYQEYQRRT  235 (235)
T ss_pred             eeeCchHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHcCchHHHHHHHHhcCCCHhHHHHHhcC
Confidence            6999999999999999999998875555578899999999999999999999999999999999999999997



>COG3752 Steroid 5-alpha reductase family enzyme [General function prediction only] Back     alignment and domain information
>KOG4650|consensus Back     alignment and domain information
>COG2020 STE14 Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>PF01222 ERG4_ERG24: Ergosterol biosynthesis ERG4/ERG24 family; InterPro: IPR001171 The two fungal enzymes, C-14 sterol reductase (gene ERG24 in budding yeast and erg3 in Neurospora crassa) and C-24(28) sterol reductase (gene ERG4 in budding yeast and sts1 in fission yeast), are involved in ergosterol biosynthesis Back     alignment and domain information
>KOG1435|consensus Back     alignment and domain information
>KOG2628|consensus Back     alignment and domain information
>PF04191 PEMT: Phospholipid methyltransferase ; InterPro: IPR007318 The Saccharomyces cerevisiae (Baker's yeast) phospholipid methyltransferase (2 Back     alignment and domain information
>PF04140 ICMT: Isoprenylcysteine carboxyl methyltransferase (ICMT) family ; InterPro: IPR007269 The isoprenylcysteine o-methyltransferase (2 Back     alignment and domain information
>PLN02392 probable steroid reductase DET2 Back     alignment and domain information
>KOG1638|consensus Back     alignment and domain information
>PF02544 Steroid_dh: 3-oxo-5-alpha-steroid 4-dehydrogenase ; InterPro: IPR001104 Synonym(s): Steroid 5-alpha-reductase 3-oxo-5-alpha-steroid 4-dehydrogenases, 1 Back     alignment and domain information
>PLN02560 enoyl-CoA reductase Back     alignment and domain information
>PLN03164 3-oxo-5-alpha-steroid 4-dehydrogenase, C-terminal domain containing protein; Provisional Back     alignment and domain information
>COG1755 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PF07298 NnrU: NnrU protein; InterPro: IPR009915 This family consists of several plant and bacterial NnrU proteins Back     alignment and domain information
>COG4094 Predicted membrane protein [Function unknown] Back     alignment and domain information
>KOG1640|consensus Back     alignment and domain information
>KOG1639|consensus Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query119
4a2n_B194 Isoprenylcysteine carboxyl methyltransferase; memb 99.76
>4a2n_B Isoprenylcysteine carboxyl methyltransferase; membrane protein, RAS and RHO gtpases signallin; HET: SAH PLM CDL; 3.40A {Methanosarcina acetivorans} Back     alignment and structure
Probab=99.76  E-value=1.3e-18  Score=130.28  Aligned_cols=70  Identities=21%  Similarity=0.264  Sum_probs=54.3

Q ss_pred             CccchhHHHHHHHHHHHHHHHHhhhhhhHHHHHH-HHHHHHHHHHHHhcchHHHHHHHhhcCCCHHHHHHHhhcCCcccc
Q psy4061           1 MSRHPNYFGEIVLWWGMFIISLNVIRGAEFVCVL-SPIFITLIILFLSGIPLLERSSDHKYRDNAKYQYYKKSTSPLIPI   79 (119)
Q Consensus         1 ysRHPnY~Ge~l~~~g~~l~~~~~~~~~~w~~~i-~pl~~~~ll~~~~~i~~~E~~~~~kyG~~~~Y~~Y~~~t~~~IP~   79 (119)
                      +||||||+|++++++|+++...+      ++.++ ++++++++  +..+++.||+.+.++||  ++|++||++|+++||+
T Consensus       123 ~vRHP~Y~G~~l~~~g~~l~~~s------~~~~~~~~~~~~~~--~~~ri~~EE~~L~~~fG--~~Y~~Y~~rv~r~iP~  192 (194)
T 4a2n_B          123 NIRHPMYAHLWLWVITQGIILSN------WVVLIFGIVAWAIL--YFIRVPKEEELLIEEFG--DEYIEYMGKTGRLFPK  192 (194)
T ss_dssp             TBSSHHHHHHHHHHHHHHHHHTC------HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHT--HHHHHHHHHCBSSSCC
T ss_pred             hccCccHHHHHHHHHHHHHHhcc------HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhC--HHHHHHHHhCCeeCce
Confidence            69999999999999999998654      33322 23322222  33568999999999999  8999999999999998


Q ss_pred             C
Q psy4061          80 P   80 (119)
Q Consensus        80 ~   80 (119)
                      +
T Consensus       193 i  193 (194)
T 4a2n_B          193 V  193 (194)
T ss_dssp             -
T ss_pred             e
Confidence            5




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00