Psyllid ID: psy6159


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-
MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGERQPFREQTLSVEDKIRQERESNPRPSAYKADAQPTKLTIGCRLANFILTLPQ
ccEEEEEEEEEEEEccHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccHHHHHHHHHHHHHcccccccccccccccEEEEEEEEcHHHccccc
cccEEEEEEEEcHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccEHHHHHHHHHHHcccccccccccccccEEEEEHHHHHHHHcccc
MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIynegrldrwigerqpfreqtLSVEDKIRQEresnprpsaykadaqptkltigcRLANFILTLPQ
MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGerqpfreqtlsvedkirqeresnprpsaykadaqptkltigCRLANFILTLPQ
MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGERQPFREQTLSVEDKIRQERESNPRPSAYKADAQPTKLTIGCRLANFILTLPQ
****KHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGE***********************************KLTIGCRLANFILT***
MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEG***************************************AQPTKLTIGCRLANFILTLP*
MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGERQPFREQTLSVEDKI****************AQPTKLTIGCRLANFILTLPQ
*AVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGERQPFREQTLSVEDKIRQERESNPRPSAYKADAQPTKLTIGCRLANFILTLPQ
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGERQPFREQTLSVEDKIRQERESNPRPSAYKADAQPTKLTIGCRLANFILTLPQ
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query101 2.2.26 [Sep-21-2011]
Q86BJ3514 Heparan sulfate 2-O-sulfo no N/A 0.405 0.079 0.536 2e-05
>sp|Q86BJ3|PIPE_DROME Heparan sulfate 2-O-sulfotransferase pipe OS=Drosophila melanogaster GN=pip PE=1 SV=1 Back     alignment and function desciption
 Score = 47.8 bits (112), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 28/41 (68%)

Query: 2   AVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGR 42
            VE  YAVVG  ED N+TL+V+E YIPR+F +A   Y  G+
Sbjct: 401 TVETEYAVVGTWEDTNITLSVLEAYIPRYFRNAKVAYYLGK 441




Sulfotransferase involved in dorsoventral axis patterning in early embryos. Required for the specific ventral activation of a series of proteases acting in the perivitelline space between the embryonic membrane and the eggshell. Probably acts by mediating the sulfation of some glycoprotein or glycosaminoglycan stably deposited in the egg, whose ventrally localized modification leads to spatially restricted activation of the protease cascade.
Drosophila melanogaster (taxid: 7227)
EC: 2EC: .EC: 8EC: .EC: 2EC: .EC: -

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query101
383860488 408 PREDICTED: heparan sulfate 2-O-sulfotran 0.376 0.093 0.763 2e-09
340716106 402 PREDICTED: LOW QUALITY PROTEIN: heparan 0.376 0.094 0.763 2e-09
350421797 398 PREDICTED: heparan sulfate 2-O-sulfotran 0.376 0.095 0.763 2e-09
380012956 400 PREDICTED: heparan sulfate 2-O-sulfotran 0.376 0.095 0.763 3e-09
328777404 400 PREDICTED: heparan sulfate 2-O-sulfotran 0.376 0.095 0.763 3e-09
307209570 310 Heparan sulfate 2-O-sulfotransferase pip 0.376 0.122 0.736 5e-09
332024386 333 Heparan sulfate 2-O-sulfotransferase pip 0.376 0.114 0.710 6e-09
307183100 360 Heparan sulfate 2-O-sulfotransferase pip 0.376 0.105 0.710 1e-08
242018996 296 Heparan sulfate 2-O-sulfotransferase pip 0.435 0.148 0.644 1e-08
322799531 326 hypothetical protein SINV_12448 [Solenop 0.376 0.116 0.710 2e-08
>gi|383860488|ref|XP_003705721.1| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like [Megachile rotundata] Back     alignment and taxonomy information
 Score = 66.6 bits (161), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 29/38 (76%), Positives = 34/38 (89%)

Query: 1   MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIY 38
           MAVEKHYAVVG+LED+N TLTV+ENYIPRFF  A+ +Y
Sbjct: 305 MAVEKHYAVVGVLEDVNATLTVLENYIPRFFQGATDVY 342




Source: Megachile rotundata

Species: Megachile rotundata

Genus: Megachile

Family: Megachilidae

Order: Hymenoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|340716106|ref|XP_003396543.1| PREDICTED: LOW QUALITY PROTEIN: heparan sulfate 2-O-sulfotransferase pipe-like [Bombus terrestris] Back     alignment and taxonomy information
>gi|350421797|ref|XP_003492960.1| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like [Bombus impatiens] Back     alignment and taxonomy information
>gi|380012956|ref|XP_003690538.1| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like [Apis florea] Back     alignment and taxonomy information
>gi|328777404|ref|XP_395133.3| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like [Apis mellifera] Back     alignment and taxonomy information
>gi|307209570|gb|EFN86485.1| Heparan sulfate 2-O-sulfotransferase pipe [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|332024386|gb|EGI64584.1| Heparan sulfate 2-O-sulfotransferase pipe [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|307183100|gb|EFN70017.1| Heparan sulfate 2-O-sulfotransferase pipe [Camponotus floridanus] Back     alignment and taxonomy information
>gi|242018996|ref|XP_002429954.1| Heparan sulfate 2-O-sulfotransferase pipe, putative [Pediculus humanus corporis] gi|212515002|gb|EEB17216.1| Heparan sulfate 2-O-sulfotransferase pipe, putative [Pediculus humanus corporis] Back     alignment and taxonomy information
>gi|322799531|gb|EFZ20839.1| hypothetical protein SINV_12448 [Solenopsis invicta] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query101
FB|FBgn0003089514 pip "pipe" [Drosophila melanog 0.415 0.081 0.558 8.4e-06
FB|FBgn0003089 pip "pipe" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 113 (44.8 bits), Expect = 8.4e-06, P = 8.4e-06
 Identities = 24/43 (55%), Positives = 30/43 (69%)

Query:     3 VEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDR 45
             VE  YAVVG  ED N+TL+V+E YIPR+F +A   Y  G+ DR
Sbjct:   402 VETEYAVVGTWEDTNITLSVLEAYIPRYFRNAKVAYYLGK-DR 443


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.135   0.396    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      101       101   0.00091  102 3  11 22  0.39    30
                                                     29  0.47    31


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  566 (60 KB)
  Total size of DFA:  113 KB (2075 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  11.92u 0.09s 12.01t   Elapsed:  00:00:21
  Total cpu time:  11.92u 0.09s 12.01t   Elapsed:  00:00:21
  Start:  Thu Aug 15 14:19:29 2013   End:  Thu Aug 15 14:19:50 2013


GO:0007313 "maternal specification of dorsal/ventral axis, oocyte, soma encoded" evidence=TAS
GO:0016485 "protein processing" evidence=IGI
GO:0008063 "Toll signaling pathway" evidence=TAS
GO:0004394 "heparan sulfate 2-O-sulfotransferase activity" evidence=ISS;NAS
GO:0009950 "dorsal/ventral axis specification" evidence=IGI;IMP;TAS
GO:0005794 "Golgi apparatus" evidence=IDA
GO:0005694 "chromosome" evidence=IDA
GO:0005634 "nucleus" evidence=IDA
GO:0048477 "oogenesis" evidence=NAS
GO:0016021 "integral to membrane" evidence=IEA
GO:0005515 "protein binding" evidence=IPI
GO:0008146 "sulfotransferase activity" evidence=IMP
GO:0040014 "regulation of multicellular organism growth" evidence=IMP
GO:0009880 "embryonic pattern specification" evidence=IMP
GO:0034483 "heparan sulfate sulfotransferase activity" evidence=IDA
GO:0034481 "chondroitin sulfotransferase activity" evidence=IDA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 101
KOG3922|consensus361 99.71
PF03567253 Sulfotransfer_2: Sulfotransferase family; InterPro 91.99
>KOG3922|consensus Back     alignment and domain information
Probab=99.71  E-value=2.5e-18  Score=141.14  Aligned_cols=75  Identities=23%  Similarity=0.391  Sum_probs=58.8

Q ss_pred             CcccceeeEEeeehhhhHHHHHHHHhhchhhhhHHHHHhcc-cccccccccccccCCCCCCCHHHHHHHHhC--CCcccc
Q psy6159           1 MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEG-RLDRWIGERQPFREQTLSVEDKIRQERESN--PRPSAY   77 (101)
Q Consensus         1 ~NVe~~Y~vVGilEeln~tL~VLE~~lPrfFkGA~~~Y~~~-~~~~~~~tkn~~Kp~~~~~Seev~~~lr~n--~t~E~~   77 (101)
                      .||+++|++|||+|||+++|.+||+++||||+||+++|.+. +.++.+ +++++||    ||+++.++++++  +++|..
T Consensus       246 ~nv~e~y~LVGvtEel~d~l~LLE~~lPrfFkGv~~iY~~~~~~~hlr-~t~~k~p----Ps~etvk~i~r~~~~~mE~e  320 (361)
T KOG3922|consen  246 FNVEEEYLLVGVTEELEDFLSLLERYLPRFFKGVREIYATSAKKSHLR-VTNKKKP----PSSETVKDIQRRMIYKMENE  320 (361)
T ss_pred             HHHhhhheeeeeHHHHHHHHHHHHHHhHHHHHHHHHHHhhhhhhhcee-eecccCC----CCHHHHHHHHHHHHHHHHHH
Confidence            48999999999999999999999999999999999999964 444333 3345565    777777777665  566655


Q ss_pred             ccc
Q psy6159          78 KAD   80 (101)
Q Consensus        78 ~~~   80 (101)
                      -||
T Consensus       321 fY~  323 (361)
T KOG3922|consen  321 FYD  323 (361)
T ss_pred             HHH
Confidence            554



>PF03567 Sulfotransfer_2: Sulfotransferase family; InterPro: IPR005331 This entry consists of a number of carbohydrate sulphotransferases that transfer sulphate to carbohydrate groups in glycoproteins and glycolipids Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query101
3f5f_A658 Maltose-binding periplasmic protein, heparan sulfa 3e-14
>3f5f_A Maltose-binding periplasmic protein, heparan sulfate 2-O-sulfotransferase 1; maltose binding protein, fusion, heparan sulfate biosynthesis; HET: GLC A3P; 2.65A {Escherichia coli k-12} Length = 658 Back     alignment and structure
 Score = 65.8 bits (160), Expect = 3e-14
 Identities = 14/69 (20%), Positives = 33/69 (47%), Gaps = 3/69 (4%)

Query: 3   VEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGERQPFREQTLSVED 62
           +   Y +VG+ E+L   + ++E  +PRFF  A+++Y  G+       R+   ++  +   
Sbjct: 546 LINEYFLVGVTEELEDFIMLLEAALPRFFRGATELYRTGKKSH---LRKTTEKKLPTAAT 602

Query: 63  KIRQERESN 71
             + ++   
Sbjct: 603 IAKLQQSEI 611


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query101
3f5f_A658 Maltose-binding periplasmic protein, heparan sulfa 99.39
>3f5f_A Maltose-binding periplasmic protein, heparan sulfate 2-O-sulfotransferase 1; maltose binding protein, fusion, heparan sulfate biosynthesis; HET: GLC A3P; 2.65A {Escherichia coli k-12} Back     alignment and structure
Probab=99.39  E-value=7.7e-14  Score=114.97  Aligned_cols=96  Identities=17%  Similarity=0.214  Sum_probs=68.2

Q ss_pred             CcccceeeEEeeehhhhHHHHHHHHhhchhhhhHHHHHhcccccccccccccccCCCCCCCHHHHHHHHhC--CCccccc
Q psy6159           1 MAVEKHYAVVGLLEDLNVTLTVMENYIPRFFAHASQIYNEGRLDRWIGERQPFREQTLSVEDKIRQERESN--PRPSAYK   78 (101)
Q Consensus         1 ~NVe~~Y~vVGilEeln~tL~VLE~~lPrfFkGA~~~Y~~~~~~~~~~tkn~~Kp~~~~~Seev~~~lr~n--~t~E~~~   78 (101)
                      +||+++|+|||+||++++||.+||+++|+||+||.++|.....+..+.++ . +.   +++++.++.++++  ...+.-=
T Consensus       544 ~nl~~~~~~vG~~E~~~~sl~lle~~~~~~f~~~~~~~~~~~~~~~~~~~-~-~~---~~~~~~~~~~~~~~~~~~D~~l  618 (658)
T 3f5f_A          544 YNLINEYFLVGVTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTT-E-KK---LPTAATIAKLQQSEIWKMENEF  618 (658)
T ss_dssp             HHHHHTEEEEEEGGGHHHHHHHHHHHCHHHHTTHHHHHHTCSTTSCSCBS-C-CC---CCCHHHHHHHTTSHHHHHHHHH
T ss_pred             HHHHhhhheeehHHhhHHHHHHHHHHHHHHHhhHHHHHHhcchHHHHhhh-c-CC---CCCHHHHHHHHhChHHHHHHHH
Confidence            47899999999999999999999999999999999999887655444433 2 34   5899999999877  3333322


Q ss_pred             cc-----cCccceeeeeeeehhhccCCC
Q psy6159          79 AD-----AQPTKLTIGCRLANFILTLPQ  101 (101)
Q Consensus        79 ~~-----~~~~~~~~~~~~~~~~~~~~~  101 (101)
                      ||     ++=-...+|-.-.+++-.+||
T Consensus       619 Y~~a~~~f~~~~~~~~~~~~~~~~~~~~  646 (658)
T 3f5f_A          619 YEFALEQFQFVRAHAVREKDGELYILAQ  646 (658)
T ss_dssp             HHHHHHHHHHHHHTTEEEETTEEEECCC
T ss_pred             HHHHHHHHHHHHHhhccCCCcccccchH
Confidence            22     222233445555555555554




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00