Psyllid ID: psy4652
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 291 | ||||||
| 322802279 | 265 | hypothetical protein SINV_06642 [Solenop | 0.704 | 0.773 | 0.462 | 1e-56 | |
| 307172448 | 265 | UPF0293 protein C16orf42 [Camponotus flo | 0.505 | 0.554 | 0.560 | 6e-55 | |
| 332028977 | 235 | UPF0293 protein C16orf42 [Acromyrmex ech | 0.512 | 0.634 | 0.554 | 8e-55 | |
| 380012742 | 662 | PREDICTED: uncharacterized protein LOC10 | 0.505 | 0.222 | 0.539 | 4e-54 | |
| 383864286 | 264 | PREDICTED: probable ribosome biogenesis | 0.505 | 0.556 | 0.544 | 7e-54 | |
| 328788025 | 267 | PREDICTED: UPF0293 protein C16orf42-like | 0.505 | 0.550 | 0.534 | 1e-53 | |
| 242015816 | 387 | conserved hypothetical protein [Pediculu | 0.687 | 0.516 | 0.427 | 4e-53 | |
| 157125951 | 269 | hypothetical protein AaeL_AAEL010333 [Ae | 0.512 | 0.553 | 0.539 | 5e-53 | |
| 357617627 | 249 | hypothetical protein KGM_13042 [Danaus p | 0.508 | 0.594 | 0.542 | 1e-52 | |
| 389614886 | 250 | simila to CG4338, partial [Papilio polyt | 0.505 | 0.588 | 0.550 | 1e-52 |
| >gi|322802279|gb|EFZ22675.1| hypothetical protein SINV_06642 [Solenopsis invicta] | Back alignment and taxonomy information |
|---|
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 135/292 (46%), Positives = 163/292 (55%), Gaps = 87/292 (29%)
Query: 1 MWDMKHCDPKKCSGRKLARHNLIKILKLGERFRGFPKEAKFYLDKFKWGRTFLTLNEELL 60
MWDM+HCDP+KCSGRKLARH LIKIL+LG RF G
Sbjct: 58 MWDMEHCDPRKCSGRKLARHGLIKILRLGARFPG-------------------------- 91
Query: 61 DLYKACDTVLFIFLMRICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKT 120
+CLTPVG +CVS D +I++E G AVVDCSWA++EETPFNKM+T
Sbjct: 92 ----------------LCLTPVGDKCVSPTDCEIVREYGCAVVDCSWARLEETPFNKMRT 135
Query: 121 NHPRLLPFLIAANPINYGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNE 180
HPRLLPFL+AANPINYGKPC+LSCVEA+AAT II GFP EAK YL KF WG +FL LN
Sbjct: 136 PHPRLLPFLVAANPINYGKPCQLSCVEAIAATLIITGFPDEAKLYLGKFSWGHSFLELNS 195
Query: 181 ELLDLYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVV 240
ELL+ Y C +S +V+
Sbjct: 196 ELLEKY--------------------------------------------ALCKSSEEVI 211
Query: 241 KVQNEYLKKIEEER-NMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
Q EYLK +E+ + P +PDFP S++E EEEEE+++E I ELG I
Sbjct: 212 AAQEEYLKNAWQEKLDRPALPDFPQSDTESEEEEEKKDENAATKITKELGDI 263
|
Source: Solenopsis invicta Species: Solenopsis invicta Genus: Solenopsis Family: Formicidae Order: Hymenoptera Class: Insecta Phylum: Arthropoda Superkingdom: Eukaryota |
| >gi|307172448|gb|EFN63901.1| UPF0293 protein C16orf42 [Camponotus floridanus] | Back alignment and taxonomy information |
|---|
| >gi|332028977|gb|EGI68992.1| UPF0293 protein C16orf42 [Acromyrmex echinatior] | Back alignment and taxonomy information |
|---|
| >gi|380012742|ref|XP_003690436.1| PREDICTED: uncharacterized protein LOC100869694 [Apis florea] | Back alignment and taxonomy information |
|---|
| >gi|383864286|ref|XP_003707610.1| PREDICTED: probable ribosome biogenesis protein C16orf42 homolog [Megachile rotundata] | Back alignment and taxonomy information |
|---|
| >gi|328788025|ref|XP_625167.2| PREDICTED: UPF0293 protein C16orf42-like [Apis mellifera] | Back alignment and taxonomy information |
|---|
| >gi|242015816|ref|XP_002428543.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212513177|gb|EEB15805.1| conserved hypothetical protein [Pediculus humanus corporis] | Back alignment and taxonomy information |
|---|
| >gi|157125951|ref|XP_001654463.1| hypothetical protein AaeL_AAEL010333 [Aedes aegypti] gi|108873473|gb|EAT37698.1| AAEL010333-PA [Aedes aegypti] | Back alignment and taxonomy information |
|---|
| >gi|357617627|gb|EHJ70898.1| hypothetical protein KGM_13042 [Danaus plexippus] | Back alignment and taxonomy information |
|---|
| >gi|389614886|dbj|BAM20446.1| simila to CG4338, partial [Papilio polytes] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 291 | ||||||
| ZFIN|ZDB-GENE-040718-63 | 297 | tsr3 "TSR3, 20S rRNA accumulat | 0.549 | 0.538 | 0.470 | 2e-47 | |
| UNIPROTKB|D4A6X9 | 263 | RGD1565744 "Probable ribosome | 0.388 | 0.429 | 0.601 | 8.6e-47 | |
| MGI|MGI:1915577 | 323 | Tsr3 "TSR3 20S rRNA accumulati | 0.388 | 0.349 | 0.592 | 1.4e-46 | |
| UNIPROTKB|F1RFZ6 | 316 | TSR3 "Probable ribosome biogen | 0.388 | 0.357 | 0.601 | 1.8e-46 | |
| UNIPROTKB|A5PK98 | 315 | C25H16ORF42 "Probable ribosome | 0.446 | 0.412 | 0.553 | 2.3e-46 | |
| UNIPROTKB|Q9UJK0 | 312 | TSR3 "Ribosome biogenesis prot | 0.388 | 0.362 | 0.584 | 6e-46 | |
| UNIPROTKB|F1PWP9 | 312 | TSR3 "Probable ribosome biogen | 0.388 | 0.362 | 0.592 | 4.1e-45 | |
| TAIR|locus:2184128 | 266 | AT5G10070 "AT5G10070" [Arabido | 0.391 | 0.428 | 0.543 | 2.5e-43 | |
| CGD|CAL0002246 | 346 | orf19.5802 [Candida albicans ( | 0.450 | 0.378 | 0.471 | 1.1e-42 | |
| UNIPROTKB|Q5A0K9 | 346 | TSR3 "Ribosome biogenesis prot | 0.450 | 0.378 | 0.471 | 1.1e-42 |
| ZFIN|ZDB-GENE-040718-63 tsr3 "TSR3, 20S rRNA accumulation, homolog (S. cerevisiae)" [Danio rerio (taxid:7955)] | Back alignment and assigned GO terms |
|---|
Score = 379 (138.5 bits), Expect = 2.0e-47, Sum P(2) = 2.0e-47
Identities = 79/168 (47%), Positives = 108/168 (64%)
Query: 77 ICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKTNHPRLLPFLIAANPIN 136
+ L+P+G + V+ AD +I+ G AV+DCSWA++E+TPF+KM +HPRLLP+L+AANP+N
Sbjct: 95 LILSPMGTKYVTPADREIVAHGGLAVIDCSWARLEDTPFSKMIGSHPRLLPYLVAANPVN 154
Query: 137 YGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNEELLDLY----KECDTI 192
YGKPCKLSCVEA AATF I GF A L KFKWG FL LN+ LL+ Y E + +
Sbjct: 155 YGKPCKLSCVEAYAATFCIVGFRDLAVLLLRKFKWGLGFLELNKTLLERYASSQSEEELL 214
Query: 193 FYEDLDGERFPKEAKFYLDKFKW--GRTFLTLNEELLDLYKECDTSTD 238
E P+E + +D F GR F+ LN + Y + D +++
Sbjct: 215 AVEKEFLSATPQEEE--IDPFDVDSGRDFVNLNRPVQAKYDDSDNTSE 260
|
|
| UNIPROTKB|D4A6X9 RGD1565744 "Probable ribosome biogenesis protein C16orf42 homolog" [Rattus norvegicus (taxid:10116)] | Back alignment and assigned GO terms |
|---|
| MGI|MGI:1915577 Tsr3 "TSR3 20S rRNA accumulation" [Mus musculus (taxid:10090)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1RFZ6 TSR3 "Probable ribosome biogenesis protein C16orf42 homolog" [Sus scrofa (taxid:9823)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|A5PK98 C25H16ORF42 "Probable ribosome biogenesis protein C16orf42 homolog" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q9UJK0 TSR3 "Ribosome biogenesis protein TSR3 homolog" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1PWP9 TSR3 "Probable ribosome biogenesis protein C16orf42 homolog" [Canis lupus familiaris (taxid:9615)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2184128 AT5G10070 "AT5G10070" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| CGD|CAL0002246 orf19.5802 [Candida albicans (taxid:5476)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q5A0K9 TSR3 "Ribosome biogenesis protein TSR3" [Candida albicans SC5314 (taxid:237561)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
Prediction of Functionally Associated Proteins
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 291 | |||
| COG2042 | 179 | COG2042, COG2042, Uncharacterized conserved protei | 2e-56 | |
| pfam04034 | 128 | pfam04034, DUF367, Domain of unknown function (DUF | 2e-54 | |
| PRK02287 | 171 | PRK02287, PRK02287, hypothetical protein; Provisio | 3e-52 | |
| pfam04068 | 35 | pfam04068, RLI, Possible Fer4-like domain in RNase | 1e-08 | |
| pfam04034 | 128 | pfam04034, DUF367, Domain of unknown function (DUF | 7e-08 | |
| pfam02459 | 548 | pfam02459, Adeno_terminal, Adenoviral DNA terminal | 0.001 | |
| TIGR00927 | 1096 | TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger | 0.003 |
| >gnl|CDD|224953 COG2042, COG2042, Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
Score = 179 bits (456), Expect = 2e-56
Identities = 86/251 (34%), Positives = 113/251 (45%), Gaps = 87/251 (34%)
Query: 1 MWDMKHCDPKKCSGRKLARHNLIKILKLGERFRGFPKEAKFYLDKFKWGRTFLTLNEELL 60
++ CDPKKC+G+KL R L ++L++G +F G
Sbjct: 16 IYHAGQCDPKKCTGKKLERFGLARLLRVGGKFPG-------------------------- 49
Query: 61 DLYKACDTVLFIFLMRICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKT 120
+ LTP + +S AD DI++ G VVDCSW ++E F K++
Sbjct: 50 ----------------VVLTPFAEKALSPADRDIVERFGITVVDCSWNRVERV-FKKLRG 92
Query: 121 NHPRLLPFLIAANPINYGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNE 180
R LPFL+AANP+NYGKP KLS EALAA I GF EA
Sbjct: 93 REHRRLPFLVAANPVNYGKPFKLSSAEALAAALYIVGFKDEA-----------------S 135
Query: 181 ELLDLYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVV 240
EL L KFKWG TFL LN+ELL+ Y + S +VV
Sbjct: 136 EL---------------------------LSKFKWGHTFLELNKELLEEYSNAEDSAEVV 168
Query: 241 KVQNEYLKKIE 251
++Q EYL+K E
Sbjct: 169 EIQEEYLEKGE 179
|
Length = 179 |
| >gnl|CDD|112833 pfam04034, DUF367, Domain of unknown function (DUF367) | Back alignment and domain information |
|---|
| >gnl|CDD|235024 PRK02287, PRK02287, hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >gnl|CDD|217870 pfam04068, RLI, Possible Fer4-like domain in RNase L inhibitor, RLI | Back alignment and domain information |
|---|
| >gnl|CDD|112833 pfam04034, DUF367, Domain of unknown function (DUF367) | Back alignment and domain information |
|---|
| >gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein | Back alignment and domain information |
|---|
| >gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 291 | |||
| KOG3154|consensus | 263 | 100.0 | ||
| COG2042 | 179 | Uncharacterized conserved protein [Function unknow | 100.0 | |
| PRK02287 | 171 | hypothetical protein; Provisional | 100.0 | |
| PF04034 | 127 | DUF367: Domain of unknown function (DUF367); Inter | 100.0 | |
| PF04034 | 127 | DUF367: Domain of unknown function (DUF367); Inter | 99.88 | |
| KOG3154|consensus | 263 | 99.85 | ||
| COG2042 | 179 | Uncharacterized conserved protein [Function unknow | 99.84 | |
| PRK02287 | 171 | hypothetical protein; Provisional | 99.83 | |
| PF04068 | 35 | RLI: Possible Fer4-like domain in RNase L inhibito | 98.65 | |
| PF03942 | 203 | DTW: DTW domain; InterPro: IPR005636 This presumed | 93.71 |
| >KOG3154|consensus | Back alignment and domain information |
|---|
Probab=100.00 E-value=2.7e-76 Score=535.23 Aligned_cols=171 Identities=58% Similarity=1.141 Sum_probs=166.4
Q ss_pred CCCCCCCCCCCcchhhhhhhchHHHHhhcccccCChhhHHHHhcccCcchhHHHhhHHHHHHHhhccchhhhhcceEEeC
Q psy4652 1 MWDMKHCDPKKCSGRKLARHNLIKILKLGERFRGFPKEAKFYLDKFKWGRTFLTLNEELLDLYKACDTVLFIFLMRICLT 80 (291)
Q Consensus 1 m~d~~~cDpkkcsg~kL~KLnCvEALaa~l~I~G~~~~A~~lL~~F~WG~~F~~iN~elLd~Y~~C~~~~ev~~~~i~l~ 80 (291)
||||+||||||||||||.||+||..|..+..+-| ||||
T Consensus 47 mwdf~hCdpkrCsGkKL~rlglv~~lr~g~kF~G------------------------------------------vVlS 84 (263)
T KOG3154|consen 47 MWDFGHCDPKRCSGKKLARLGLVRNLRVGQKFGG------------------------------------------VVLS 84 (263)
T ss_pred hhcccCCCccccchHHHHhhhhHHHhhccCccCc------------------------------------------eEEC
Confidence 9999999999999999999999999999999999 9999
Q ss_pred CCCCcccCccchHHHHhcCeEEEecCcccccccccccCCCCCCCcCcchhhcCCCCCCCCCcccHHHHHHhHHHHcCChH
Q psy4652 81 PVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKTNHPRLLPFLIAANPINYGKPCKLSCVEALAATFIIAGFPK 160 (291)
Q Consensus 81 P~~~~~vs~~D~~~i~~~G~~vvdcSW~~~~~~~~~~~~~~~~R~LP~LiAaNpvnygkp~kLs~vEAlaa~l~i~g~~~ 160 (291)
|+|++||||+|+++|.+.|++||||||++|+++||.||+..|+||||||||||||||||||+|||||||||+|||+||++
T Consensus 85 P~gk~~vsp~D~d~v~~~G~avvdCSWArLdevpF~kl~~~h~RLLP~lVAANpVNYGrP~rLnCvEAlaA~l~I~G~~e 164 (263)
T KOG3154|consen 85 PVGKQCVSPADRDVVERSGAAVVDCSWARLDEVPFGKLRGRHERLLPYLVAANPVNYGRPWRLNCVEALAACLYICGFPE 164 (263)
T ss_pred CCCCcccCHHHHHHHHhcCceEEecchhhccccchhhcCCCcccccchhhhcCccccCCCceecHHHHHHhHeeeecChh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhccCCCCcchhhhcHHHHHHhhhhcccccccccCCCcHHHHHHHhhccCCCchhhhhcHHHHHHHhcCCCcHHHH
Q psy4652 161 EAKFYLDKFKWGRTFLTLNEELLDLYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVV 240 (291)
Q Consensus 161 ~a~~~l~~fkwG~~FleLN~elLe~Y~a~~si~ye~l~~~~f~eqA~~lL~kFkWG~~Fl~lN~ELLE~Ya~c~~~~Ev~ 240 (291)
+|..+|++|||||.||+|| +++|+.|++|.+++||+
T Consensus 165 ~A~~lL~~F~wG~~Fl~lN--------------------------------------------~~lLd~Ya~C~~s~ev~ 200 (263)
T KOG3154|consen 165 EARELLDKFKWGHAFLELN--------------------------------------------KDLLDEYAKCASSAEVV 200 (263)
T ss_pred HHHHHHhcCcchHHHHHHh--------------------------------------------HHHHHHHHhhCCHHHHH
Confidence 9999999999999999999 89999999999999999
Q ss_pred HHHHHHHHHHHHhhCCC
Q psy4652 241 KVQNEYLKKIEEERNMP 257 (291)
Q Consensus 241 ~iq~eyl~~~~~~~~~~ 257 (291)
++|++|++++++..+.+
T Consensus 201 ~~qn~~Le~~~~~~Qe~ 217 (263)
T KOG3154|consen 201 EVQNEFLESQEKSPQEP 217 (263)
T ss_pred HHHHHHHHHhhhcccCC
Confidence 99999999888764443
|
|
| >COG2042 Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
| >PRK02287 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PF04034 DUF367: Domain of unknown function (DUF367); InterPro: IPR007177 This domain is found in a family of proteins of unknown function | Back alignment and domain information |
|---|
| >PF04034 DUF367: Domain of unknown function (DUF367); InterPro: IPR007177 This domain is found in a family of proteins of unknown function | Back alignment and domain information |
|---|
| >KOG3154|consensus | Back alignment and domain information |
|---|
| >COG2042 Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
| >PRK02287 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PF04068 RLI: Possible Fer4-like domain in RNase L inhibitor, RLI; InterPro: IPR007209 This is a possible metal-binding domain in endoribonuclease RNase L inhibitor | Back alignment and domain information |
|---|
| >PF03942 DTW: DTW domain; InterPro: IPR005636 This presumed domain is found in bacterial and eukaryotic proteins | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00