Psyllid ID: psy6917
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 132 | ||||||
| 91087275 | 307 | PREDICTED: similar to CG7598 CG7598-PA [ | 0.484 | 0.208 | 0.676 | 2e-20 | |
| 270011209 | 291 | NADH dehydrogenase (ubiquinone) 1 alpha | 0.484 | 0.219 | 0.676 | 2e-20 | |
| 113197941 | 318 | Unknown (protein for MGC:146886) [Xenopu | 0.545 | 0.226 | 0.630 | 9e-18 | |
| 395503379 | 329 | PREDICTED: complex I intermediate-associ | 0.492 | 0.197 | 0.646 | 1e-17 | |
| 62860000 | 302 | NADH dehydrogenase (ubiquinone) 1 alpha | 0.507 | 0.221 | 0.641 | 1e-17 | |
| 332375991 | 290 | unknown [Dendroctonus ponderosae] | 0.477 | 0.217 | 0.646 | 1e-17 | |
| 170048662 | 316 | chaperone protein [Culex quinquefasciatu | 0.507 | 0.212 | 0.637 | 1e-17 | |
| 157133317 | 317 | chaperone protein, putative [Aedes aegyp | 0.507 | 0.211 | 0.637 | 2e-17 | |
| 157103939 | 317 | chaperone protein, putative [Aedes aegyp | 0.507 | 0.211 | 0.637 | 2e-17 | |
| 291242841 | 273 | PREDICTED: CG7598-like [Saccoglossus kow | 0.507 | 0.245 | 0.611 | 2e-17 |
| >gi|91087275|ref|XP_975544.1| PREDICTED: similar to CG7598 CG7598-PA [Tribolium castaneum] | Back alignment and taxonomy information |
|---|
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 44/65 (67%), Positives = 57/65 (87%)
Query: 56 IPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEF 115
IPFSKFF+ SKGR+QD Q+P+ +R+++ G+S+ D+ NGPF LE+DYIGIEYDPNHTEEF
Sbjct: 220 IPFSKFFMASKGRVQDKQHPIALNRVTNFGISIGDKVNGPFSLEIDYIGIEYDPNHTEEF 279
Query: 116 AYEMY 120
AYE+Y
Sbjct: 280 AYELY 284
|
Source: Tribolium castaneum Species: Tribolium castaneum Genus: Tribolium Family: Tenebrionidae Order: Coleoptera Class: Insecta Phylum: Arthropoda Superkingdom: Eukaryota |
| >gi|270011209|gb|EFA07657.1| NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, assembly factor 1 [Tribolium castaneum] | Back alignment and taxonomy information |
|---|
| >gi|113197941|gb|AAI21514.1| Unknown (protein for MGC:146886) [Xenopus (Silurana) tropicalis] | Back alignment and taxonomy information |
|---|
| >gi|395503379|ref|XP_003756044.1| PREDICTED: complex I intermediate-associated protein 30, mitochondrial [Sarcophilus harrisii] | Back alignment and taxonomy information |
|---|
| >gi|62860000|ref|NP_001017153.1| NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, assembly factor 1 [Xenopus (Silurana) tropicalis] gi|89272920|emb|CAJ82283.1| NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, assembly factor 1 [Xenopus (Silurana) tropicalis] | Back alignment and taxonomy information |
|---|
| >gi|332375991|gb|AEE63136.1| unknown [Dendroctonus ponderosae] | Back alignment and taxonomy information |
|---|
| >gi|170048662|ref|XP_001870724.1| chaperone protein [Culex quinquefasciatus] gi|167870702|gb|EDS34085.1| chaperone protein [Culex quinquefasciatus] | Back alignment and taxonomy information |
|---|
| >gi|157133317|ref|XP_001662831.1| chaperone protein, putative [Aedes aegypti] gi|108870867|gb|EAT35092.1| AAEL012716-PA [Aedes aegypti] | Back alignment and taxonomy information |
|---|
| >gi|157103939|ref|XP_001648190.1| chaperone protein, putative [Aedes aegypti] gi|108869298|gb|EAT33523.1| AAEL014201-PA [Aedes aegypti] | Back alignment and taxonomy information |
|---|
| >gi|291242841|ref|XP_002741312.1| PREDICTED: CG7598-like [Saccoglossus kowalevskii] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 132 | ||||||
| FB|FBgn0039689 | 296 | CG7598 [Drosophila melanogaste | 0.492 | 0.219 | 0.582 | 2.3e-29 | |
| ZFIN|ZDB-GENE-050306-9 | 304 | ndufaf1 "NADH dehydrogenase (u | 0.492 | 0.213 | 0.646 | 1e-23 | |
| UNIPROTKB|E1BZH1 | 303 | NDUFAF1 "Uncharacterized prote | 0.507 | 0.221 | 0.597 | 8.8e-23 | |
| UNIPROTKB|E2RPE7 | 328 | NDUFAF1 "Uncharacterized prote | 0.477 | 0.192 | 0.619 | 2e-21 | |
| RGD|1306202 | 330 | Ndufaf1 "NADH dehydrogenase (u | 0.477 | 0.190 | 0.634 | 1.4e-17 | |
| MGI|MGI:1916952 | 328 | Ndufaf1 "NADH dehydrogenase (u | 0.477 | 0.192 | 0.619 | 1.7e-17 | |
| UNIPROTKB|Q9Y375 | 327 | NDUFAF1 "Complex I intermediat | 0.477 | 0.192 | 0.619 | 2.3e-17 | |
| UNIPROTKB|E1BAZ2 | 328 | NDUFAF1 "Uncharacterized prote | 0.477 | 0.192 | 0.619 | 4.7e-16 |
| FB|FBgn0039689 CG7598 [Drosophila melanogaster (taxid:7227)] | Back alignment and assigned GO terms |
|---|
Score = 215 (80.7 bits), Expect = 2.3e-29, Sum P(2) = 2.3e-29
Identities = 39/67 (58%), Positives = 51/67 (76%)
Query: 56 IPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQN--NGPFQLELDYIGIEYDPNHTE 113
IPFSKFFL SKGR+QD Q + +R++ G S+ + +GPF LE+DY+G+EYDP+H E
Sbjct: 221 IPFSKFFLSSKGRVQDRQGAIPLNRVTHFGFSVAAKKGMDGPFGLEIDYVGLEYDPSHRE 280
Query: 114 EFAYEMY 120
EFAYEMY
Sbjct: 281 EFAYEMY 287
|
|
| ZFIN|ZDB-GENE-050306-9 ndufaf1 "NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, assembly factor 1" [Danio rerio (taxid:7955)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|E1BZH1 NDUFAF1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|E2RPE7 NDUFAF1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] | Back alignment and assigned GO terms |
|---|
| RGD|1306202 Ndufaf1 "NADH dehydrogenase (ubiquinone) complex I, assembly factor 1" [Rattus norvegicus (taxid:10116)] | Back alignment and assigned GO terms |
|---|
| MGI|MGI:1916952 Ndufaf1 "NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, assembly factor 1" [Mus musculus (taxid:10090)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q9Y375 NDUFAF1 "Complex I intermediate-associated protein 30, mitochondrial" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|E1BAZ2 NDUFAF1 "Uncharacterized protein" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
Prediction of Functionally Associated Proteins
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 132 | |||
| pfam08547 | 156 | pfam08547, CIA30, Complex I intermediate-associate | 4e-11 |
| >gnl|CDD|219896 pfam08547, CIA30, Complex I intermediate-associated protein 30 (CIA30) | Back alignment and domain information |
|---|
Score = 56.4 bits (137), Expect = 4e-11
Identities = 22/45 (48%), Positives = 28/45 (62%)
Query: 56 IPFSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLEL 100
IPFS F +GR+ D PLD RI IGL + D+ GPF+LE+
Sbjct: 112 IPFSDFKPTFRGRVVDDAPPLDLSRIKQIGLLISDKQPGPFELEI 156
|
This protein is associated with mitochondrial Complex I intermediate-associated protein 30 (CIA30) in human and mouse. The family is also present in Schizosaccharomyces pombe which does not contain the NADH dehydrogenase component of complex I, or many of the other essential subunits. This means it is possible that this family of protein may not be directly involved in oxidative phosphorylation. Length = 156 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 132 | |||
| KOG2435|consensus | 323 | 100.0 | ||
| PF08547 | 157 | CIA30: Complex I intermediate-associated protein 3 | 99.87 | |
| PF03425 | 178 | CBM_11: Carbohydrate binding domain (family 11); I | 95.7 |
| >KOG2435|consensus | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.1e-39 Score=266.25 Aligned_cols=125 Identities=50% Similarity=0.813 Sum_probs=121.4
Q ss_pred Cchhhhhhhhc---cCceeeecCCCceEEEEEc-CCCCCCeEEeeccccCCccceEEE----------------------
Q psy6917 1 MWKDEVQDKLY---MDPVGDVYPNETKVIWKFN-ESTLDDWIVTTDSDHNEGLSKASL---------------------- 54 (132)
Q Consensus 1 ~~~~e~~~~~~---~~~~~~~~~~~~~vlf~F~-~~~l~~W~v~sD~v~GeG~S~a~l---------------------- 54 (132)
+++.|++.+|+ ++||+++++++.+|+|+|+ +++|+.|+++||+++| |+|+|+|
T Consensus 93 ~lkeeIkl~w~~eE~~pllevr~~eakvvf~F~~kEdLdkWtv~sDsd~g-G~StasLe~sd~G~~alf~G~~ss~~~kd 171 (323)
T KOG2435|consen 93 LLKEEIKLHWRGEEGHPLLEVRLEEAKVVFQFRGKEDLDKWTVTSDSDIG-GRSTASLEMSDNGQSALFYGTLSSEAPKD 171 (323)
T ss_pred HHHHHHHHHhccccCccceeecCCcceEEEEccChhhcceeEeecccccC-CeeeEEEEecCCCcceeeccccccccccC
Confidence 47899999997 8999999999999999999 9999999999999999 9999999
Q ss_pred -----------------------------------------------------------------------------EEe
Q psy6917 55 -----------------------------------------------------------------------------AIP 57 (132)
Q Consensus 55 -----------------------------------------------------------------------------~IP 57 (132)
+||
T Consensus 172 g~i~RsGyc~Mrs~~RkaF~rk~~~dw~qfn~L~LrvRGDGRsy~inihte~~~dq~wndsys~flft~gGp~wq~~KIP 251 (323)
T KOG2435|consen 172 GEITRSGYCAMRSRPRKAFERKMSYDWSQFNTLYLRVRGDGRSYMINIHTETDFDQRWNDSYSYFLFTRGGPYWQEVKIP 251 (323)
T ss_pred cceeeeeeeeeeccchhhhcceecccccccceEEEEEecCCceEEEEecCccchhhhcccceeeEEecCCCCceeEEecc
Confidence 799
Q ss_pred CCCceeccCCeecCCCCCCCccCccEEEEEEecCCCCceEEEEeEEEEEeCCCchhhhhhhcccccceE
Q psy6917 58 FSKFFLQSKGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHTEEFAYEMYLFENEV 126 (132)
Q Consensus 58 fs~F~~t~RGr~~d~q~~L~~~~I~~iGf~I~dkq~GpF~LeId~I~a~~d~~~~ee~aye~y~~~~~~ 126 (132)
||+|++|++||++|.|.+||+++|.+|||+++||.+|||.||||+|++.+|++|+|+||||+|..++|.
T Consensus 252 fSKff~t~kGriqDrq~e~nl~~vssig~sl~dk~dGpF~LEIDfIGv~~d~~H~EdFayE~y~~p~~r 320 (323)
T KOG2435|consen 252 FSKFFFTNKGRIQDRQHELNLDKVSSIGFSLADKVDGPFFLEIDFIGVFTDPAHTEDFAYENYPEPNPR 320 (323)
T ss_pred hhhheeccccceeecccccCccceeeEeEEEeeccCCcceeeEEEEEEecCCCcccceeeecccccccc
Confidence 999999999999999999999999999999999999999999999999999999999999999999765
|
|
| >PF08547 CIA30: Complex I intermediate-associated protein 30 (CIA30); InterPro: IPR013857 Mitochondrial complex I intermediate-associated protein 30 (CIA30) is present in human and mouse, and also in Schizosaccharomyces pombe (Fission yeast) which does not contain the NADH dehydrogenase component of complex I, or many of the other essential subunits | Back alignment and domain information |
|---|
| >PF03425 CBM_11: Carbohydrate binding domain (family 11); InterPro: IPR005087 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 132 | |||
| 1v0a_A | 178 | Endoglucanase H; carbohydrate binding module, cell | 97.95 |
| >1v0a_A Endoglucanase H; carbohydrate binding module, cellulosome, cellulose degradation, hydrolase, glycosidase; 1.98A {Clostridium thermocellum} SCOP: b.18.1.30 | Back alignment and structure |
|---|
Probab=97.95 E-value=1.1e-05 Score=62.43 Aligned_cols=56 Identities=29% Similarity=0.397 Sum_probs=46.0
Q ss_pred EEEEeCCCceeccCCeecCCCC----CCCccCccEEEEEEecCCCCceEEEEeEEEEEeCCCch
Q psy6917 53 SLAIPFSKFFLQSKGRIQDIQN----PLDTDRISSIGLSLVDQNNGPFQLELDYIGIEYDPNHT 112 (132)
Q Consensus 53 ~l~IPfs~F~~t~RGr~~d~q~----~L~~~~I~~iGf~I~dkq~GpF~LeId~I~a~~d~~~~ 112 (132)
.++|||++|. ||-+.|+..+ +||+++|++|+|+++...+|. |.||+|+++.-..|.
T Consensus 115 ~IeIPFs~F~--~r~~~~P~~~~~~~~~d~~~i~si~~m~G~~~~g~--~~id~I~l~~~~~~~ 174 (178)
T 1v0a_A 115 TIEIPFSSFR--RRLDYQPPGQDMSGTLDLDNIDSIHFMYANNKSGK--FVVDNIKLIGALEHH 174 (178)
T ss_dssp EEEEEGGGCE--ECCSCCCTTCCCCSSCCTTSEEEEEEEESSSCCEE--EEEEEEEEECBCCCC
T ss_pred EEEEEHHHhc--cccccCCCCcccCCCcChhHeEEEEEEEcCCCcce--EEEEEEEEEeeeccc
Confidence 3499999999 7777766544 899999999999988766666 899999999777654
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 132 | ||||
| d1v0aa1 | 167 | b.18.1.30 (A:4-170) Endoglucanase H {Clostridium t | 2e-05 |
| >d1v0aa1 b.18.1.30 (A:4-170) Endoglucanase H {Clostridium thermocellum [TaxId: 1515]} Length = 167 | Back information, alignment and structure |
|---|
class: All beta proteins fold: Galactose-binding domain-like superfamily: Galactose-binding domain-like family: CBM11 domain: Endoglucanase H species: Clostridium thermocellum [TaxId: 1515]
Score = 40.0 bits (93), Expect = 2e-05
Identities = 17/52 (32%), Positives = 25/52 (48%), Gaps = 3/52 (5%)
Query: 56 IPFSKFFLQS--KGRIQDIQNPLDTDRISSIGLSLVDQNNGPFQLE-LDYIG 104
IPFS F + + QD+ LD D I SI + +G F ++ + IG
Sbjct: 115 IPFSSFRRRLDYQPPGQDMSGTLDLDNIDSIHFMYANNKSGKFVVDNIKLIG 166
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 132 | |||
| d1v0aa1 | 167 | Endoglucanase H {Clostridium thermocellum [TaxId: | 99.0 | |
| d1wmxa_ | 173 | Endoglucanase CelJ {Clostridium thermocellum [TaxI | 91.04 |
| >d1v0aa1 b.18.1.30 (A:4-170) Endoglucanase H {Clostridium thermocellum [TaxId: 1515]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: Galactose-binding domain-like superfamily: Galactose-binding domain-like family: CBM11 domain: Endoglucanase H species: Clostridium thermocellum [TaxId: 1515]
Probab=99.00 E-value=3.7e-10 Score=82.48 Aligned_cols=51 Identities=31% Similarity=0.487 Sum_probs=45.2
Q ss_pred EEEeCCCceeccCCeec--CCCCCCCccCccEEEEEEecCCCCceEEEEeEEEEE
Q psy6917 54 LAIPFSKFFLQSKGRIQ--DIQNPLDTDRISSIGLSLVDQNNGPFQLELDYIGIE 106 (132)
Q Consensus 54 l~IPfs~F~~t~RGr~~--d~q~~L~~~~I~~iGf~I~dkq~GpF~LeId~I~a~ 106 (132)
++|||++|.++.+++.+ +..++||+++|.+|+|+++++++|+|+ ||+|+++
T Consensus 113 V~IPF~~F~~~~~~~~~~~~~~~~ldls~i~~~~~~~~~g~~G~f~--iD~i~l~ 165 (167)
T d1v0aa1 113 IEIPFSSFRRRLDYQPPGQDMSGTLDLDNIDSIHFMYANNKSGKFV--VDNIKLI 165 (167)
T ss_dssp EEEEGGGCEECCSCCCTTCCCCSSCCTTSEEEEEEEESSSCCEEEE--EEEEEEE
T ss_pred EEEEHHHcccccccCcccCCCccccChHHeEEEEEEECCCCCccEE--EEEEEEe
Confidence 39999999999888765 467799999999999999999999885 8999886
|
| >d1wmxa_ b.18.1.24 (A:) Endoglucanase CelJ {Clostridium thermocellum [TaxId: 1515]} | Back information, alignment and structure |
|---|