Psyllid ID: psy3908


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170
MATDGQILLATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPKSEVTLGE
ccccccEEEEEEEEccccccHHHHHHcccccEEEEEEcccccccccccccccccccEEEEEEcccEEEEEEEEEEEccccEEEEcccEEEEcccccEEEEEEccccccccccHHHHHHHHHccccEEEEEEEEEEEEEEEcccccccccccEEEEcccEEEEEccccccc
cccccEEEEEEEcccccccHHHHHHHccccccEEEEEEccccccccccccccccccEEEEEcccccEEEEEEEEEEccccEEcccccEEEcccccccEEEEEcccccccccHHHHHHHHHccccccEEEEcEEEEEEEcccccccEcccccEEEEccEEEEEcccEEccc
MATDGQILLATAKWSDEGQYLREFVSKNSLPTVVKIIKgqygnlgvptlpspglqsTVLLVSAGKRQKIIAQAVKLKEgrrfvgvgsridipenypgYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKydeegnlqsdgarivnigevllpksevtlge
matdgqillatakwsdegqYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQavklkegrrfvgvgsridipenyPGYFELLSEEGKSMRCCETVLEisrrkieagCLIREAIKGIVAKYdeegnlqsdgarivnigevllpksevtlge
MATDGQILLATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPKSEVTLGE
******ILLATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLL*********
******I*LATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVA*********SDGARIVNIGEVLLPKSEVTL**
MATDGQILLATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPKSEVTLGE
**TDGQILLATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPKSEVTLG*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MATDGQILLATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPKSEVTLGE
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query170
357618418 626 hypothetical protein KGM_04871 [Danaus p 0.994 0.269 0.674 5e-62
242015432 641 conserved hypothetical protein [Pediculu 0.988 0.262 0.648 2e-60
91085635 583 PREDICTED: similar to serrano protein [T 0.964 0.281 0.666 3e-57
328788501 644 PREDICTED: hypothetical protein LOC10057 0.982 0.259 0.601 2e-51
380025319 629 PREDICTED: uncharacterized protein LOC10 0.982 0.265 0.595 6e-51
307209197 631 hypothetical protein EAI_15572 [Harpegna 0.982 0.264 0.589 1e-50
322784888 519 hypothetical protein SINV_13076 [Solenop 0.947 0.310 0.597 1e-50
332026051 621 hypothetical protein G5I_05320 [Acromyrm 0.982 0.268 0.589 1e-50
345489915 643 PREDICTED: hypothetical protein LOC10011 0.982 0.259 0.580 3e-50
340714385 625 PREDICTED: hypothetical protein LOC10064 0.982 0.267 0.583 8e-50
>gi|357618418|gb|EHJ71403.1| hypothetical protein KGM_04871 [Danaus plexippus] Back     alignment and taxonomy information
 Score =  242 bits (617), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 114/169 (67%), Positives = 140/169 (82%)

Query: 2   ATDGQILLATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLV 61
           ATDGQ+++A ++WSDEG YL+EFV KN LP V KIIKGQYG LGVPTLPSPGLQST LLV
Sbjct: 3   ATDGQVVVAASRWSDEGHYLKEFVVKNRLPNVAKIIKGQYGGLGVPTLPSPGLQSTALLV 62

Query: 62  SAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRR 121
           SAGK++KIIAQA+KLKEGRR V VG RI IPE Y GYFELLSEEG+++RC E+V EI+RR
Sbjct: 63  SAGKKKKIIAQAIKLKEGRRMVSVGPRIAIPETYKGYFELLSEEGRAVRCMESVSEIARR 122

Query: 122 KIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPKSEVTLGE 170
           K+E GCL+RE I+ I AK D  G++ +DG+R +  GEV++P+ EV LG+
Sbjct: 123 KLEDGCLVREPIRIICAKTDLNGDITADGSRTLPAGEVIMPRGEVFLGK 171




Source: Danaus plexippus

Species: Danaus plexippus

Genus: Danaus

Family: Nymphalidae

Order: Lepidoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|242015432|ref|XP_002428357.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212512969|gb|EEB15619.1| conserved hypothetical protein [Pediculus humanus corporis] Back     alignment and taxonomy information
>gi|91085635|ref|XP_970327.1| PREDICTED: similar to serrano protein [Tribolium castaneum] gi|270010087|gb|EFA06535.1| hypothetical protein TcasGA2_TC009439 [Tribolium castaneum] Back     alignment and taxonomy information
>gi|328788501|ref|XP_003251141.1| PREDICTED: hypothetical protein LOC100578865 [Apis mellifera] Back     alignment and taxonomy information
>gi|380025319|ref|XP_003696423.1| PREDICTED: uncharacterized protein LOC100871586 [Apis florea] Back     alignment and taxonomy information
>gi|307209197|gb|EFN86304.1| hypothetical protein EAI_15572 [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|322784888|gb|EFZ11668.1| hypothetical protein SINV_13076 [Solenopsis invicta] Back     alignment and taxonomy information
>gi|332026051|gb|EGI66202.1| hypothetical protein G5I_05320 [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|345489915|ref|XP_001600158.2| PREDICTED: hypothetical protein LOC100115432 [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|340714385|ref|XP_003395709.1| PREDICTED: hypothetical protein LOC100648821 [Bombus terrestris] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query170
FB|FBgn0034408 778 sano "serrano" [Drosophila mel 0.770 0.168 0.567 3e-33
FB|FBgn0034408 sano "serrano" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 371 (135.7 bits), Expect = 3.0e-33, P = 3.0e-33
 Identities = 76/134 (56%), Positives = 101/134 (75%)

Query:     2 ATDGQILLATAKWSDEGQ-YLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLL 60
             ATDGQI+LA +++ D    YLR+F SK  LP V KIIKGQ+  LGVPTL +P LQST L 
Sbjct:     3 ATDGQIILAASRFGDATPVYLRDF-SKQPLPAVAKIIKGQHQALGVPTLSAPSLQSTALF 61

Query:    61 VSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISR 120
             +SAGK+ +I+AQ +K+KEGR+   VG+++ IPE Y GYFELLSE+G+S RC ++VLE+SR
Sbjct:    62 LSAGKKYQILAQPIKIKEGRKPTNVGAKVLIPETYGGYFELLSEDGRSTRCIDSVLELSR 121

Query:   121 RKIEAGCLIREAIK 134
             R+  A  L+RE  +
Sbjct:   122 RR-NARVLVRETFR 134


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.136   0.383    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      170       170   0.00086  108 3  11 22  0.44    32
                                                     31  0.47    34


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  558 (59 KB)
  Total size of DFA:  129 KB (2082 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  16.57u 0.09s 16.66t   Elapsed:  00:00:04
  Total cpu time:  16.57u 0.09s 16.66t   Elapsed:  00:00:04
  Start:  Thu Aug 15 15:48:33 2013   End:  Thu Aug 15 15:48:37 2013


GO:0003674 "molecular_function" evidence=ND
GO:0042067 "establishment of ommatidial planar polarity" evidence=IDA
GO:0035159 "regulation of tube length, open tracheal system" evidence=IMP
GO:0045179 "apical cortex" evidence=IDA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query170
pfam12736 251 pfam12736, CABIT, Cell-cycle sustaining, positive 1e-13
>gnl|CDD|221744 pfam12736, CABIT, Cell-cycle sustaining, positive selection, Back     alignment and domain information
 Score = 65.9 bits (161), Expect = 1e-13
 Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 16/131 (12%)

Query: 30  LPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRI 89
           LP VVK+  G+Y   GV  L        +LL S  + +K++A+ V+ + GR    VG ++
Sbjct: 1   LPQVVKVTSGEYFEGGVYLLKGQ----ELLLHSGDRIKKVVARCVEREGGR----VGPKL 52

Query: 90  DIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKYDEEGNLQSD 149
            IP +YPG F+LL  +    R  E+V E+  R      L+ E I+            Q  
Sbjct: 53  LIPLSYPGKFKLLPRD----RSFESVEELLARVFPGRVLVTEDIRTSSDLSSSSDVDQ-- 106

Query: 150 GARIVNIGEVL 160
              +V  G+ L
Sbjct: 107 --ILVPAGDEL 115


The 'CABIT' domain (for 'cysteine-containing, all- in Themis') is found in a newly identified gene family that has three mammalian homologues (Themis, Icb1 and 9130404H23Rik) that encode proteins with two CABIT domains and a highly conserved proline-rich region. In contrast, Fam59A, Fam59B and related proteins from mammals to cnidarians, including the insect Serrano proteins, have a single copy of the CABIT domain, a proline-rich region and often a C-terminal SAM (sterile-motif) domain. Multiple-sequence alignment has predicted that the CABIT domain adopts an all-strand structure with at least 12 strands, ie a dyad of six-stranded beta-barrel units. The CABIT domain contains a nearly absolutely conserved cysteine residue which is likely to be central to its function. CABIT domain proteins function downstream of tyrosine kinase signalling and interact with GRB2. Length = 251

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 170
PF12736 256 CABIT: Cell-cycle sustaining, positive selection, 99.88
>PF12736 CABIT: Cell-cycle sustaining, positive selection, Back     alignment and domain information
Probab=99.88  E-value=1.9e-22  Score=165.12  Aligned_cols=121  Identities=36%  Similarity=0.459  Sum_probs=110.2

Q ss_pred             CCeeEEeccCcccccCccCCCCCCcceeEeeeeccceeeeeeeehhhhcCceeccccceeeeCCCCCceeEEeccCCcee
Q psy3908          30 LPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSM  109 (170)
Q Consensus        30 LP~vvri~~Gqy~~lGvp~l~~~slq~~~Ll~sa~~r~~i~aqa~kl~Egr~v~~IGPkI~IP~~Y~G~F~LLsed~~~v  109 (170)
                      ||+|++|..|+|...+++.|+    .+.++||.+.+.++|+|++++-++|+.   +||++.||++|+|||+++..++   
T Consensus         1 lP~vv~V~~g~y~~~~~~~l~----G~~l~l~~~~~~k~i~a~~~~~~~~~~---~g~~~~IP~~y~G~F~~~~~~~---   70 (256)
T PF12736_consen    1 LPRVVKVQSGVYFQGSVYELS----GQELCLHTGDQIKKILAQSVESEKGRK---VGPHFLIPLNYPGKFKRLPRPG---   70 (256)
T ss_pred             CCeEEEEecCCCCCCccccCC----CCEEEEEecCEeeEEEEEEEeccCccc---cCCEEEECCcccceEEECCCcc---
Confidence            799999999999998888777    566899999888999999999888887   8999999999999999999988   


Q ss_pred             eeeehHHHH-HhhccCcceEEeeeeeeeeeeecCCCCeeccceeeEecCceeeec
Q psy3908         110 RCCETVLEI-SRRKIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPK  163 (170)
Q Consensus       110 R~fesV~El-a~~fp~~rvlv~E~Ir~~~~~~~ed~~~~~~~~~Tl~aGe~L~l~  163 (170)
                       +|+||+|| ++.+| .+|++++++++.. ..+++++....+++++++||.|+++
T Consensus        71 -~f~tV~eL~~~~~p-~~v~~~~~~~~~~-~~~~~~~~~~~~~~~v~~Ge~L~l~  122 (256)
T PF12736_consen   71 -PFETVYELLSRGFP-LRVVVTKDITFSS-ASGENSSVSDYGKRTVPAGEVLRLL  122 (256)
T ss_pred             -ccCCHHHHhhccCC-CEEEEcCccceec-ccccccccccCCceEECCCCEEEEE
Confidence             99999999 99999 9999999999987 5566666777889999999999999




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00