Psyllid ID: psy3908
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 170 | ||||||
| 357618418 | 626 | hypothetical protein KGM_04871 [Danaus p | 0.994 | 0.269 | 0.674 | 5e-62 | |
| 242015432 | 641 | conserved hypothetical protein [Pediculu | 0.988 | 0.262 | 0.648 | 2e-60 | |
| 91085635 | 583 | PREDICTED: similar to serrano protein [T | 0.964 | 0.281 | 0.666 | 3e-57 | |
| 328788501 | 644 | PREDICTED: hypothetical protein LOC10057 | 0.982 | 0.259 | 0.601 | 2e-51 | |
| 380025319 | 629 | PREDICTED: uncharacterized protein LOC10 | 0.982 | 0.265 | 0.595 | 6e-51 | |
| 307209197 | 631 | hypothetical protein EAI_15572 [Harpegna | 0.982 | 0.264 | 0.589 | 1e-50 | |
| 322784888 | 519 | hypothetical protein SINV_13076 [Solenop | 0.947 | 0.310 | 0.597 | 1e-50 | |
| 332026051 | 621 | hypothetical protein G5I_05320 [Acromyrm | 0.982 | 0.268 | 0.589 | 1e-50 | |
| 345489915 | 643 | PREDICTED: hypothetical protein LOC10011 | 0.982 | 0.259 | 0.580 | 3e-50 | |
| 340714385 | 625 | PREDICTED: hypothetical protein LOC10064 | 0.982 | 0.267 | 0.583 | 8e-50 |
| >gi|357618418|gb|EHJ71403.1| hypothetical protein KGM_04871 [Danaus plexippus] | Back alignment and taxonomy information |
|---|
Score = 242 bits (617), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 114/169 (67%), Positives = 140/169 (82%)
Query: 2 ATDGQILLATAKWSDEGQYLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLV 61
ATDGQ+++A ++WSDEG YL+EFV KN LP V KIIKGQYG LGVPTLPSPGLQST LLV
Sbjct: 3 ATDGQVVVAASRWSDEGHYLKEFVVKNRLPNVAKIIKGQYGGLGVPTLPSPGLQSTALLV 62
Query: 62 SAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISRR 121
SAGK++KIIAQA+KLKEGRR V VG RI IPE Y GYFELLSEEG+++RC E+V EI+RR
Sbjct: 63 SAGKKKKIIAQAIKLKEGRRMVSVGPRIAIPETYKGYFELLSEEGRAVRCMESVSEIARR 122
Query: 122 KIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPKSEVTLGE 170
K+E GCL+RE I+ I AK D G++ +DG+R + GEV++P+ EV LG+
Sbjct: 123 KLEDGCLVREPIRIICAKTDLNGDITADGSRTLPAGEVIMPRGEVFLGK 171
|
Source: Danaus plexippus Species: Danaus plexippus Genus: Danaus Family: Nymphalidae Order: Lepidoptera Class: Insecta Phylum: Arthropoda Superkingdom: Eukaryota |
| >gi|242015432|ref|XP_002428357.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212512969|gb|EEB15619.1| conserved hypothetical protein [Pediculus humanus corporis] | Back alignment and taxonomy information |
|---|
| >gi|91085635|ref|XP_970327.1| PREDICTED: similar to serrano protein [Tribolium castaneum] gi|270010087|gb|EFA06535.1| hypothetical protein TcasGA2_TC009439 [Tribolium castaneum] | Back alignment and taxonomy information |
|---|
| >gi|328788501|ref|XP_003251141.1| PREDICTED: hypothetical protein LOC100578865 [Apis mellifera] | Back alignment and taxonomy information |
|---|
| >gi|380025319|ref|XP_003696423.1| PREDICTED: uncharacterized protein LOC100871586 [Apis florea] | Back alignment and taxonomy information |
|---|
| >gi|307209197|gb|EFN86304.1| hypothetical protein EAI_15572 [Harpegnathos saltator] | Back alignment and taxonomy information |
|---|
| >gi|322784888|gb|EFZ11668.1| hypothetical protein SINV_13076 [Solenopsis invicta] | Back alignment and taxonomy information |
|---|
| >gi|332026051|gb|EGI66202.1| hypothetical protein G5I_05320 [Acromyrmex echinatior] | Back alignment and taxonomy information |
|---|
| >gi|345489915|ref|XP_001600158.2| PREDICTED: hypothetical protein LOC100115432 [Nasonia vitripennis] | Back alignment and taxonomy information |
|---|
| >gi|340714385|ref|XP_003395709.1| PREDICTED: hypothetical protein LOC100648821 [Bombus terrestris] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 170 | ||||||
| FB|FBgn0034408 | 778 | sano "serrano" [Drosophila mel | 0.770 | 0.168 | 0.567 | 3e-33 |
| FB|FBgn0034408 sano "serrano" [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
Score = 371 (135.7 bits), Expect = 3.0e-33, P = 3.0e-33
Identities = 76/134 (56%), Positives = 101/134 (75%)
Query: 2 ATDGQILLATAKWSDEGQ-YLREFVSKNSLPTVVKIIKGQYGNLGVPTLPSPGLQSTVLL 60
ATDGQI+LA +++ D YLR+F SK LP V KIIKGQ+ LGVPTL +P LQST L
Sbjct: 3 ATDGQIILAASRFGDATPVYLRDF-SKQPLPAVAKIIKGQHQALGVPTLSAPSLQSTALF 61
Query: 61 VSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSMRCCETVLEISR 120
+SAGK+ +I+AQ +K+KEGR+ VG+++ IPE Y GYFELLSE+G+S RC ++VLE+SR
Sbjct: 62 LSAGKKYQILAQPIKIKEGRKPTNVGAKVLIPETYGGYFELLSEDGRSTRCIDSVLELSR 121
Query: 121 RKIEAGCLIREAIK 134
R+ A L+RE +
Sbjct: 122 RR-NARVLVRETFR 134
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.316 0.136 0.383 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 170 170 0.00086 108 3 11 22 0.44 32
31 0.47 34
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 1
No. of states in DFA: 558 (59 KB)
Total size of DFA: 129 KB (2082 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 16.57u 0.09s 16.66t Elapsed: 00:00:04
Total cpu time: 16.57u 0.09s 16.66t Elapsed: 00:00:04
Start: Thu Aug 15 15:48:33 2013 End: Thu Aug 15 15:48:37 2013
|
|
Prediction of Enzyme Commission (EC) Number
Prediction of Functionally Associated Proteins
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 170 | |||
| pfam12736 | 251 | pfam12736, CABIT, Cell-cycle sustaining, positive | 1e-13 |
| >gnl|CDD|221744 pfam12736, CABIT, Cell-cycle sustaining, positive selection, | Back alignment and domain information |
|---|
Score = 65.9 bits (161), Expect = 1e-13
Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 16/131 (12%)
Query: 30 LPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRI 89
LP VVK+ G+Y GV L +LL S + +K++A+ V+ + GR VG ++
Sbjct: 1 LPQVVKVTSGEYFEGGVYLLKGQ----ELLLHSGDRIKKVVARCVEREGGR----VGPKL 52
Query: 90 DIPENYPGYFELLSEEGKSMRCCETVLEISRRKIEAGCLIREAIKGIVAKYDEEGNLQSD 149
IP +YPG F+LL + R E+V E+ R L+ E I+ Q
Sbjct: 53 LIPLSYPGKFKLLPRD----RSFESVEELLARVFPGRVLVTEDIRTSSDLSSSSDVDQ-- 106
Query: 150 GARIVNIGEVL 160
+V G+ L
Sbjct: 107 --ILVPAGDEL 115
|
The 'CABIT' domain (for 'cysteine-containing, all- in Themis') is found in a newly identified gene family that has three mammalian homologues (Themis, Icb1 and 9130404H23Rik) that encode proteins with two CABIT domains and a highly conserved proline-rich region. In contrast, Fam59A, Fam59B and related proteins from mammals to cnidarians, including the insect Serrano proteins, have a single copy of the CABIT domain, a proline-rich region and often a C-terminal SAM (sterile-motif) domain. Multiple-sequence alignment has predicted that the CABIT domain adopts an all-strand structure with at least 12 strands, ie a dyad of six-stranded beta-barrel units. The CABIT domain contains a nearly absolutely conserved cysteine residue which is likely to be central to its function. CABIT domain proteins function downstream of tyrosine kinase signalling and interact with GRB2. Length = 251 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 170 | |||
| PF12736 | 256 | CABIT: Cell-cycle sustaining, positive selection, | 99.88 |
| >PF12736 CABIT: Cell-cycle sustaining, positive selection, | Back alignment and domain information |
|---|
Probab=99.88 E-value=1.9e-22 Score=165.12 Aligned_cols=121 Identities=36% Similarity=0.459 Sum_probs=110.2
Q ss_pred CCeeEEeccCcccccCccCCCCCCcceeEeeeeccceeeeeeeehhhhcCceeccccceeeeCCCCCceeEEeccCCcee
Q psy3908 30 LPTVVKIIKGQYGNLGVPTLPSPGLQSTVLLVSAGKRQKIIAQAVKLKEGRRFVGVGSRIDIPENYPGYFELLSEEGKSM 109 (170)
Q Consensus 30 LP~vvri~~Gqy~~lGvp~l~~~slq~~~Ll~sa~~r~~i~aqa~kl~Egr~v~~IGPkI~IP~~Y~G~F~LLsed~~~v 109 (170)
||+|++|..|+|...+++.|+ .+.++||.+.+.++|+|++++-++|+. +||++.||++|+|||+++..++
T Consensus 1 lP~vv~V~~g~y~~~~~~~l~----G~~l~l~~~~~~k~i~a~~~~~~~~~~---~g~~~~IP~~y~G~F~~~~~~~--- 70 (256)
T PF12736_consen 1 LPRVVKVQSGVYFQGSVYELS----GQELCLHTGDQIKKILAQSVESEKGRK---VGPHFLIPLNYPGKFKRLPRPG--- 70 (256)
T ss_pred CCeEEEEecCCCCCCccccCC----CCEEEEEecCEeeEEEEEEEeccCccc---cCCEEEECCcccceEEECCCcc---
Confidence 799999999999998888777 566899999888999999999888887 8999999999999999999988
Q ss_pred eeeehHHHH-HhhccCcceEEeeeeeeeeeeecCCCCeeccceeeEecCceeeec
Q psy3908 110 RCCETVLEI-SRRKIEAGCLIREAIKGIVAKYDEEGNLQSDGARIVNIGEVLLPK 163 (170)
Q Consensus 110 R~fesV~El-a~~fp~~rvlv~E~Ir~~~~~~~ed~~~~~~~~~Tl~aGe~L~l~ 163 (170)
+|+||+|| ++.+| .+|++++++++.. ..+++++....+++++++||.|+++
T Consensus 71 -~f~tV~eL~~~~~p-~~v~~~~~~~~~~-~~~~~~~~~~~~~~~v~~Ge~L~l~ 122 (256)
T PF12736_consen 71 -PFETVYELLSRGFP-LRVVVTKDITFSS-ASGENSSVSDYGKRTVPAGEVLRLL 122 (256)
T ss_pred -ccCCHHHHhhccCC-CEEEEcCccceec-ccccccccccCCceEECCCCEEEEE
Confidence 99999999 99999 9999999999987 5566666777889999999999999
|
|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00