Psyllid ID: psy12800


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130----
IGRESNESLESSGKKSRRPSFISGRSFASASRILNQHFPNLQLTKSKSGDTSKHTLSADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIMLELTLGEIQLD
ccccccccccccccccccccccccccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccEEEEEccccEEEEEEEcccccHHHHcccccccccccccEEEEEEccHHHHHccccccccc
ccccccccccccccccccccccccHHHHHHHHHHHHHcccccccEcccccccccccccccccccccccccccEEEEEEEccccccEEEEcccccccEEEccccccccccccccEEEEEccHEEEEEEcccEEcc
igresneslessgkksrrpsfisgrSFASASRILnqhfpnlqltksksgdtskhtlsadsldskgqspemhrrsfvgfnagngtrsyeyypysqrstIRDLVgrgwangfpgrhifridENIMLELTLGEIQLD
igresneslessgkksrrpsfiSGRSFASASRILNQHFPnlqltksksgdtskHTLSadsldskgqspemhrrsfvgfnagngtrsyeyypYSQRSTIRDLVGRGWANGFPGRHIFRIDENIMLELTLGEIQLD
IGRESNESLESSGKKSRRPSFISGRSFASASRILNQHFPNLQLTKSKSGDTSKHTLSADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIMLELTLGEIQLD
**************************************************************************FVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIMLELTLGE****
*************************SFASASRILNQHFPNL*********************************FVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIMLELTLGEIQL*
*********************ISGRSFASASRILNQHFPNLQLTK************************MHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIMLELTLGEIQLD
**********************SGRSFASASRILNQHFPNLQLTK*********************SPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIMLELTLGEIQLD
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
IGRESNESLESSGKKSRRPSFISGRSFASASRILNQHFPNLQLTKSKSGDTSKHTLSADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDENIMLELTLGEIQLD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query134 2.2.26 [Sep-21-2011]
Q0KHY3 1431 Extracellular domains-con no N/A 0.462 0.043 0.597 9e-19
>sp|Q0KHY3|Y1004_DROME Extracellular domains-containing protein CG31004 OS=Drosophila melanogaster GN=CG31004 PE=1 SV=1 Back     alignment and function desciption
 Score = 92.0 bits (227), Expect = 9e-19,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 5/67 (7%)

Query: 58  ADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFR 117
            D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPGRHIFR
Sbjct: 367 GDTTKGEGGVP-----AYVGFNAGNGTQAYEYNPYSQNMVIRDLANRGWANGFPGRHIFR 421

Query: 118 IDENIML 124
           +DE I++
Sbjct: 422 VDEQILI 428





Drosophila melanogaster (taxid: 7227)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query134
350417175 1267 PREDICTED: extracellular domains-contain 0.559 0.059 0.629 7e-20
340715043 1267 PREDICTED: extracellular domains-contain 0.559 0.059 0.629 7e-20
383855804 1397 PREDICTED: extracellular domains-contain 0.373 0.035 0.86 4e-19
332019409 1406 Extracellular domains-containing protein 0.380 0.036 0.843 4e-19
345482776 1358 PREDICTED: extracellular domains-contain 0.380 0.037 0.823 5e-19
357623997 1477 hypothetical protein KGM_05469 [Danaus p 0.380 0.034 0.803 6e-19
380020283 1377 PREDICTED: extracellular domains-contain 0.507 0.049 0.630 2e-18
307174209 1395 Uncharacterized protein K03H1.5 [Campono 0.380 0.036 0.823 2e-18
328788914 1265 PREDICTED: extracellular domains-contain 0.507 0.053 0.616 3e-18
157109935 1427 hypothetical protein AaeL_AAEL005432 [Ae 0.380 0.035 0.784 6e-18
>gi|350417175|ref|XP_003491293.1| PREDICTED: extracellular domains-containing protein CG31004-like [Bombus impatiens] Back     alignment and taxonomy information
 Score =  102 bits (253), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 60/81 (74%), Gaps = 6/81 (7%)

Query: 52  SKHTLSA-DSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGF 110
           S HT +  D++  +G  P     +FVGFNAGNGTRSYEY P+SQ STIRDL GRGWANGF
Sbjct: 366 SSHTEAGGDTMYGEGGVP-----AFVGFNAGNGTRSYEYKPFSQMSTIRDLTGRGWANGF 420

Query: 111 PGRHIFRIDENIMLELTLGEI 131
           PGRH+FRIDENIM  +   +I
Sbjct: 421 PGRHMFRIDENIMPAVCNKDI 441




Source: Bombus impatiens

Species: Bombus impatiens

Genus: Bombus

Family: Apidae

Order: Hymenoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|340715043|ref|XP_003396030.1| PREDICTED: extracellular domains-containing protein CG31004-like [Bombus terrestris] Back     alignment and taxonomy information
>gi|383855804|ref|XP_003703400.1| PREDICTED: extracellular domains-containing protein CG31004-like [Megachile rotundata] Back     alignment and taxonomy information
>gi|332019409|gb|EGI59893.1| Extracellular domains-containing protein [Acromyrmex echinatior] Back     alignment and taxonomy information
>gi|345482776|ref|XP_001600000.2| PREDICTED: extracellular domains-containing protein CG31004-like [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|357623997|gb|EHJ74925.1| hypothetical protein KGM_05469 [Danaus plexippus] Back     alignment and taxonomy information
>gi|380020283|ref|XP_003694019.1| PREDICTED: extracellular domains-containing protein CG31004-like [Apis florea] Back     alignment and taxonomy information
>gi|307174209|gb|EFN64854.1| Uncharacterized protein K03H1.5 [Camponotus floridanus] Back     alignment and taxonomy information
>gi|328788914|ref|XP_392408.3| PREDICTED: extracellular domains-containing protein CG31004-like isoform 1 [Apis mellifera] Back     alignment and taxonomy information
>gi|157109935|ref|XP_001650886.1| hypothetical protein AaeL_AAEL005432 [Aedes aegypti] gi|108878861|gb|EAT43086.1| AAEL005432-PB [Aedes aegypti] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query134
FB|FBgn0051004 1431 mesh "mesh" [Drosophila melano 0.492 0.046 0.583 9.5e-17
FB|FBgn0051004 mesh "mesh" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 221 (82.9 bits), Expect = 9.5e-17, P = 9.5e-17
 Identities = 42/72 (58%), Positives = 51/72 (70%)

Query:    54 HT-LSADSLDSKGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPG 112
             HT    D+   +G  P     ++VGFNAGNGT++YEY PYSQ   IRDL  RGWANGFPG
Sbjct:   362 HTEAGGDTTKGEGGVP-----AYVGFNAGNGTQAYEYNPYSQNMVIRDLANRGWANGFPG 416

Query:   113 RHIFRIDENIML 124
             RHIFR+DE I++
Sbjct:   417 RHIFRVDEQILI 428


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.314   0.131   0.376    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      134       134   0.00091  102 3  11 23  0.46    31
                                                     30  0.44    33


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  527 (56 KB)
  Total size of DFA:  119 KB (2079 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  15.00u 0.09s 15.09t   Elapsed:  00:00:06
  Total cpu time:  15.00u 0.09s 15.09t   Elapsed:  00:00:07
  Start:  Thu Aug 15 11:13:53 2013   End:  Thu Aug 15 11:14:00 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005575 "cellular_component" evidence=ND
GO:0007160 "cell-matrix adhesion" evidence=IEA
GO:0019991 "septate junction assembly" evidence=IMP
GO:0005920 "smooth septate junction" evidence=IDA
GO:0007156 "homophilic cell adhesion" evidence=IMP
GO:0061028 "establishment of endothelial barrier" evidence=IMP

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query134
pfam0611990 pfam06119, NIDO, Nidogen-like 1e-07
smart00539152 smart00539, NIDO, Extracellular domain of unknown 0.004
>gnl|CDD|147987 pfam06119, NIDO, Nidogen-like Back     alignment and domain information
 Score = 46.3 bits (110), Expect = 1e-07
 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 2/45 (4%)

Query: 76  VGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDE 120
            GF+AG+G R +   P S+  ++R+L       G PGR IFR+  
Sbjct: 47  AGFSAGDGNRFFTL-PGSRTESVRNLAQS-SNVGVPGRWIFRVGG 89


This is a nidogen-like domain (NIDO) domain and is an extracellular domain found in nidogen and hypothetical proteins of unknown function. Length = 90

>gnl|CDD|214712 smart00539, NIDO, Extracellular domain of unknown function in nidogen (entactin) and hypothetical proteins Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 134
PF0611990 NIDO: Nidogen-like; InterPro: IPR003886 The ~180-r 99.71
smart00539152 NIDO Extracellular domain of unknown function in n 99.62
KOG4291|consensus 1043 99.52
>PF06119 NIDO: Nidogen-like; InterPro: IPR003886 The ~180-residue NIDO domain is an extracellular domain of unknown function, found in nidogen (entactin) and hypothetical proteins Back     alignment and domain information
Probab=99.71  E-value=6e-18  Score=118.12  Aligned_cols=56  Identities=39%  Similarity=0.599  Sum_probs=51.7

Q ss_pred             CCCCCCCCccceeeeeccCceeeEEeecCCCcccccccccccccCCCCcceEEeechh
Q psy12800         64 KGQSPEMHRRSFVGFNAGNGTRSYEYYPYSQRSTIRDLVGRGWANGFPGRHIFRIDEN  121 (134)
Q Consensus        64 ~g~~G~GGipAqvGFNaGnGt~~y~y~P~Sqt~~I~dL~~ts~~ng~PGRwvFRIDe~  121 (134)
                      .+.+|++++||+||||+||+++||+ +|+|++..|++|...+|+ |+||||+||||++
T Consensus        35 ~~~~~~~~~~a~aGf~~gd~~~~~~-lp~s~t~~i~~l~~~sN~-g~~G~wifrid~~   90 (90)
T PF06119_consen   35 GGYNGLGGVPAQAGFNSGDGTNYYN-LPGSGTDSIRNLDNSSNV-GVPGRWIFRIDSE   90 (90)
T ss_pred             cccccceeeeeEEEEecCCCcceEE-ecCCCccceeehhhcCCC-CCCEEEEEEeCCC
Confidence            4568999999999999999999999 899999999999999855 6999999999975



The NIDO domain is found in association with other domains, such as nidogen G2 beta-barrel (IPR006605 from INTERPRO), thyroglobulin type-1 (IPR000716 from INTERPRO), LDLRB (IPR000033 from INTERPRO), AMOP (IPR005533 from INTERPRO), EGF-like (IPR000742 from INTERPRO), VWFD, IPT/TIG, or sushi/CCP/SCR (IPR000436 from INTERPRO) [, , , ]. Some proteins known to contain a NIDO domain are listed below: Vertebrate nidogen-1 (NID-1) or entactin, a sulphated glycoprotein widely distributed in basement membranes. Vertebrate nidogen-2 (NID-2) or osteonidogen, a cell adhesion glycoprotein which is widely distributed in basement membranes. Vertebrate alpha-tectorin. Mammalian mucin-4 (MUC4), a highly glycosylated membrane-bound protein. Xenopus ID14, a putative matrix protein. ; GO: 0007160 cell-matrix adhesion

>smart00539 NIDO Extracellular domain of unknown function in nidogen (entactin) and hypothetical proteins Back     alignment and domain information
>KOG4291|consensus Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00