Psyllid ID: psy146


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250---
MSHSNIIRKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEFVSSNQSSAPKVVSTNPLDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARLNEDALAHPEFVIPKGKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPKTDTKLGKVGF
ccHHHHHHHHHHcccccccccccccHHHHHHHHHHHHHccccccccccHHHHcccccccHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHccHHHHHHcHHHHHHccccccccccccccccccccccHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHccHHHHccccccccccccccccccccccccccHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHcccccccccccccc
cccHHHHHHHHHHccccccccccccHHHHHHHHHHHHHcHEEEcccccHHHHcccccccHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHcccHHHHHHccHHHHcccccccccccccccccccccccHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHccHHHHHcccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccc
MSHSNIIRKLktlgyaspesfnpirekEFRNMVVWLEDQKIRIYKIEDRAelrklesphwmnsfhqyctdvglpitllstPLEQIEWLLGHAsrleyadnvdqyKSQTAEfvssnqssapkvvstnpldnlnfdspefkKGVNSLAEklkirphpnhltTLEACCKLISARLNedalahpefvipkgkpfpfleidngftlpdpvLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTvtanpktdtklgkvgf
mshsniirklktlgyaspesfnpirekefrnmVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEfvssnqssapkvVSTNPLDNLNFDSPEFKKGVNSLAEKlkirphpnhlTTLEACCKLISARLNEDALAHPEFVIPKGKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQtvtanpktdtklgkvgf
MSHSNIIRKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEFVSSNQSSAPKVVSTNPLDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARLNEDALAHPEFVIPKGKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPKTDTKLGKVGF
*********LKTLGYA****FNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVD**********************************************LKIRPHPNHLTTLEACCKLISARLNEDALAHPEFVIPKGKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVT**************
****NII*KLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYK*************HWMNSFHQYCTDVGLPITLLSTPLEQIEWLLGHASRLEYADN*******************************NFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARL********************LEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPKTDTKLGKVGF
MSHSNIIRKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTA*************VSTNPLDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARLNEDALAHPEFVIPKGKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPKTDTKLGKVGF
***SNIIRKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEFVS************NPLDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARLNEDALAHPEFVIPKGKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPK****L*****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSHSNIIRKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEFVSSNQSSAPKVVSTNPLDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARLNEDALAHPEFVIPKGKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPKTDTKLGKVGF
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query253 2.2.26 [Sep-21-2011]
Q5R8I2244 UPF0568 protein C14orf166 yes N/A 0.944 0.979 0.479 1e-53
Q9Y224244 UPF0568 protein C14orf166 yes N/A 0.944 0.979 0.479 1e-53
Q9CQE8244 UPF0568 protein C14orf166 yes N/A 0.948 0.983 0.473 3e-53
Q63ZS0240 UPF0568 protein C14orf166 N/A N/A 0.932 0.983 0.473 1e-51
Q7ZUH1242 UPF0568 protein C14orf166 yes N/A 0.940 0.983 0.461 2e-50
Q553E8271 UPF0568 protein OS=Dictyo yes N/A 0.885 0.826 0.257 7e-15
Q09526253 Uncharacterized protein E yes N/A 0.940 0.940 0.25 1e-09
>sp|Q5R8I2|CN166_PONAB UPF0568 protein C14orf166 homolog OS=Pongo abelii PE=2 SV=1 Back     alignment and function desciption
 Score =  209 bits (532), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 118/246 (47%), Positives = 152/246 (61%), Gaps = 7/246 (2%)

Query: 8   RKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQY 67
           RKL  L Y +P  FN   E EFRN +VWLEDQKIR YKIEDR  LR + S  W   F +Y
Sbjct: 4   RKLTALDYHNPAGFNCKDETEFRNFIVWLEDQKIRHYKIEDRGNLRNIHSSDWPKFFEKY 63

Query: 68  CTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEFVSSNQSSAPKVV-STN 126
             DV  P  +     E I+WLLG A RLEY DN ++YK    + V  N  +A     +  
Sbjct: 64  LRDVNCPFKIQDRQ-EAIDWLLGLAVRLEYGDNAEKYK----DLVPDNSKTADNATKNAE 118

Query: 127 PLDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARLNEDALAHPEFVIPK 186
           PL NL+ ++P+FK GV +LA  L+I+ H ++L  L+A   L+  RL +DA+A       +
Sbjct: 119 PLINLDVNNPDFKAGVMALANLLQIQRHDDYLVMLKAIRILVQERLTQDAVAKAN-QTKE 177

Query: 187 GKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPKTDT 246
           G P    +   GF   D VLN+AA+IL LL+I +LR+LQTKINEAIVAVQ + A+PKTD 
Sbjct: 178 GLPVALDKHILGFDTGDAVLNEAAQILRLLHIEELRELQTKINEAIVAVQAIIADPKTDH 237

Query: 247 KLGKVG 252
           +LGKVG
Sbjct: 238 RLGKVG 243




Involved in modulation of mRNA transcription by Polymerase II.
Pongo abelii (taxid: 9601)
>sp|Q9Y224|CN166_HUMAN UPF0568 protein C14orf166 OS=Homo sapiens GN=C14orf166 PE=1 SV=1 Back     alignment and function description
>sp|Q9CQE8|CN166_MOUSE UPF0568 protein C14orf166 homolog OS=Mus musculus PE=2 SV=1 Back     alignment and function description
>sp|Q63ZS0|CN166_XENLA UPF0568 protein C14orf166 homolog OS=Xenopus laevis PE=2 SV=1 Back     alignment and function description
>sp|Q7ZUH1|CN166_DANRE UPF0568 protein C14orf166 homolog OS=Danio rerio GN=zgc:56576 PE=2 SV=1 Back     alignment and function description
>sp|Q553E8|U568_DICDI UPF0568 protein OS=Dictyostelium discoideum GN=DDB_G0275847 PE=2 SV=1 Back     alignment and function description
>sp|Q09526|YQN5_CAEEL Uncharacterized protein E02H1.5 OS=Caenorhabditis elegans GN=E02H1.5 PE=4 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query253
307095150245 hypothetical conserved protein [Triatoma 0.952 0.983 0.562 2e-72
156551153247 PREDICTED: UPF0568 protein C14orf166 hom 0.956 0.979 0.556 1e-70
383852573249 PREDICTED: UPF0568 protein C14orf166 hom 0.960 0.975 0.542 1e-67
350418375249 PREDICTED: UPF0568 protein C14orf166 hom 0.960 0.975 0.522 4e-64
427786363246 Putative riken cdna 2700060e02 protein [ 0.956 0.983 0.502 6e-64
380023932252 PREDICTED: UPF0568 protein C14orf166 hom 0.960 0.964 0.506 8e-64
340726572249 PREDICTED: UPF0568 protein C14orf166 hom 0.960 0.975 0.518 9e-64
307202994249 UPF0568 protein C14orf166-like protein [ 0.960 0.975 0.506 9e-64
307176766249 UPF0568 protein C14orf166-like protein [ 0.960 0.975 0.510 7e-63
332027096250 UPF0568 protein C14orf166-like protein [ 0.960 0.972 0.487 3e-59
>gi|307095150|gb|ADN29881.1| hypothetical conserved protein [Triatoma matogrossensis] Back     alignment and taxonomy information
 Score =  278 bits (710), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 139/247 (56%), Positives = 180/247 (72%), Gaps = 6/247 (2%)

Query: 8   RKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQY 67
           RKL+ LGY S ++F+    ++FR +V+WLEDQ IR YKIEDR +LR ++S  W +SF +Y
Sbjct: 4   RKLEALGYHSADTFDINDSRQFRALVLWLEDQVIRHYKIEDRDDLRNIDSKLWDSSFEKY 63

Query: 68  CTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEFVSSNQSSAPKVVSTNP 127
           C D+  PI       E +EW++G A +LEY D+  +Y  Q ++ +  ++SSAP V S+NP
Sbjct: 64  CKDLECPIAFTEQK-ECLEWVVGLAVKLEYNDDKQKYSKQVSDVIKESKSSAPVVKSSNP 122

Query: 128 LDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARL-NEDALAHPEFVIPK 186
           LDNL+F S EF+KGVN+LA+ L+I PHPNHL TLEA C LI   L +E+ L   +    +
Sbjct: 123 LDNLDFYSDEFRKGVNALADMLQITPHPNHLITLEAICTLIIQSLASENNLTSSK----E 178

Query: 187 GKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPKTDT 246
           GKPFP +E D GF + D +LN+AAK+L LLYIHDLRDLQTKINE IVAVQT+TANPKTDT
Sbjct: 179 GKPFPIMESDLGFDMGDYILNQAAKVLRLLYIHDLRDLQTKINECIVAVQTITANPKTDT 238

Query: 247 KLGKVGF 253
           KLGKVGF
Sbjct: 239 KLGKVGF 245




Source: Triatoma matogrossensis

Species: Triatoma matogrossensis

Genus: Triatoma

Family: Reduviidae

Order: Hemiptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|156551153|ref|XP_001604251.1| PREDICTED: UPF0568 protein C14orf166 homolog [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|383852573|ref|XP_003701801.1| PREDICTED: UPF0568 protein C14orf166 homolog [Megachile rotundata] Back     alignment and taxonomy information
>gi|350418375|ref|XP_003491839.1| PREDICTED: UPF0568 protein C14orf166 homolog [Bombus impatiens] Back     alignment and taxonomy information
>gi|427786363|gb|JAA58633.1| Putative riken cdna 2700060e02 protein [Rhipicephalus pulchellus] Back     alignment and taxonomy information
>gi|380023932|ref|XP_003695763.1| PREDICTED: UPF0568 protein C14orf166 homolog [Apis florea] Back     alignment and taxonomy information
>gi|340726572|ref|XP_003401630.1| PREDICTED: UPF0568 protein C14orf166 homolog [Bombus terrestris] Back     alignment and taxonomy information
>gi|307202994|gb|EFN82210.1| UPF0568 protein C14orf166-like protein [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|307176766|gb|EFN66166.1| UPF0568 protein C14orf166-like protein [Camponotus floridanus] Back     alignment and taxonomy information
>gi|332027096|gb|EGI67192.1| UPF0568 protein C14orf166-like protein [Acromyrmex echinatior] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query253
UNIPROTKB|F1NEI4244 C5H14orf166 "Uncharacterized p 0.948 0.983 0.477 4.4e-52
UNIPROTKB|Q3T0S7245 C10H14ORF166 "Uncharacterized 0.952 0.983 0.477 6.5e-51
UNIPROTKB|Q9Y224244 C14orf166 "UPF0568 protein C14 0.944 0.979 0.479 1.1e-50
UNIPROTKB|F1SFF4244 C14orf166 "Uncharacterized pro 0.948 0.983 0.481 1.1e-50
UNIPROTKB|Q5R8I2244 Q5R8I2 "UPF0568 protein C14orf 0.944 0.979 0.479 1.1e-50
MGI|MGI:1915295244 2700060E02Rik "RIKEN cDNA 2700 0.948 0.983 0.481 1.1e-50
UNIPROTKB|Q63ZS0240 Q63ZS0 "UPF0568 protein C14orf 0.932 0.983 0.473 2.3e-48
ZFIN|ZDB-GENE-040426-1082242 zgc:56576 "zgc:56576" [Danio r 0.940 0.983 0.461 4.7e-48
RGD|1304704296 RGD1304704 "similar to Hypothe 0.889 0.760 0.462 8.6e-47
FB|FBgn0051249250 CG31249 [Drosophila melanogast 0.940 0.952 0.438 9.1e-45
UNIPROTKB|F1NEI4 C5H14orf166 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 117/245 (47%), Positives = 154/245 (62%)

Query:     8 RKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQY 67
             RKL  L Y +P  FN   E EFRN +VWLEDQKIR YKIEDR  LR + S  W  SF +Y
Sbjct:     4 RKLSALDYHNPGGFNCRDETEFRNFIVWLEDQKIRHYKIEDRGNLRNIHSDDWPKSFEKY 63

Query:    68 CTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEFVSSNQSSAPKVVSTNP 127
               DV  P  +     E ++WLLG A RLEY DN D+YK  T +   +  ++A    +  P
Sbjct:    64 MKDVNCPFKMQERQ-ETVDWLLGLAVRLEYGDNADKYKDSTPDGAKNTDNTAK---NAEP 119

Query:   128 LDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARLNEDALAHPEFVIPKG 187
             L NL+ ++P+FK GV +LA  L+I+ H ++L  L+A   L+  RL +DA+A       +G
Sbjct:   120 LINLDVNNPDFKAGVMALANLLQIQRHDDYLVMLKAIRILVQERLTQDAIAKAN-QSKEG 178

Query:   188 KPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANPKTDTK 247
              P    +   GF   D V+N+AA+IL LL+I +LR+LQTKINEAIVAVQ + A+PKTD +
Sbjct:   179 LPVALEKHILGFDTGDAVINEAAQILRLLHIEELRELQTKINEAIVAVQAIIADPKTDHR 238

Query:   248 LGKVG 252
             LGKVG
Sbjct:   239 LGKVG 243




GO:0000993 "RNA polymerase II core binding" evidence=IEA
GO:0005634 "nucleus" evidence=IEA
GO:0042802 "identical protein binding" evidence=IEA
GO:0045944 "positive regulation of transcription from RNA polymerase II promoter" evidence=IEA
GO:0072669 "tRNA-splicing ligase complex" evidence=IEA
UNIPROTKB|Q3T0S7 C10H14ORF166 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q9Y224 C14orf166 "UPF0568 protein C14orf166" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1SFF4 C14orf166 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|Q5R8I2 Q5R8I2 "UPF0568 protein C14orf166 homolog" [Pongo abelii (taxid:9601)] Back     alignment and assigned GO terms
MGI|MGI:1915295 2700060E02Rik "RIKEN cDNA 2700060E02 gene" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|Q63ZS0 Q63ZS0 "UPF0568 protein C14orf166 homolog" [Xenopus laevis (taxid:8355)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-040426-1082 zgc:56576 "zgc:56576" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
RGD|1304704 RGD1304704 "similar to Hypothetical protein CGI-99" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
FB|FBgn0051249 CG31249 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q553E8U568_DICDINo assigned EC number0.25770.88530.8265yesN/A
Q9Y224CN166_HUMANNo assigned EC number0.47960.94460.9795yesN/A
Q09526YQN5_CAEELNo assigned EC number0.250.94070.9407yesN/A
Q7ZUH1CN166_DANRENo assigned EC number0.46120.94070.9834yesN/A
Q5R8I2CN166_PONABNo assigned EC number0.47960.94460.9795yesN/A
Q9CQE8CN166_MOUSENo assigned EC number0.47340.94860.9836yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query253
pfam10036245 pfam10036, RLL, Putative carnitine deficiency-asso 4e-80
>gnl|CDD|220534 pfam10036, RLL, Putative carnitine deficiency-associated protein Back     alignment and domain information
 Score =  240 bits (615), Expect = 4e-80
 Identities = 119/251 (47%), Positives = 154/251 (61%), Gaps = 12/251 (4%)

Query: 7   IRKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQ 66
            RKL  LGY +P+  N    KEFR++++WLEDQKIR+YKIEDR +LR ++S  W  +  +
Sbjct: 3   RRKLAALGYPNPDLLNIDDPKEFRSLILWLEDQKIRLYKIEDREKLRNVDSADWTEALEK 62

Query: 67  YCTDVGLPITLLSTPLEQIEWLLGHASRLEYADNVDQYKSQTAEFVSSNQSSA-PKVVST 125
           Y  D+ +PI L  T  EQ++WLL  A RLEY DN     S   +           +  S 
Sbjct: 63  YLKDLEIPIDL-KTRQEQLDWLLSLAIRLEYKDNTQYDLSGARDLPEEESKRTKLQDDSA 121

Query: 126 NPLDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTLEACCKLISARLNEDALA---HPEF 182
           N    L+F SPEF++GV +LA+ L I  HP+HL  L A CK+I  RL+ +A A       
Sbjct: 122 NDN--LDFTSPEFEEGVRALAKLLGIAHHPDHLVLLLAACKVIQERLSTEAKAENPIEGK 179

Query: 183 VIPKGKPFPFLEIDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTANP 242
             P+ K FP      GF + D  LN+AA+IL LL+I  LR+LQTKINE +VAVQ +TANP
Sbjct: 180 PFPEDKLFPL-----GFDVGDAELNEAARILRLLHIEQLRELQTKINETLVAVQNLTANP 234

Query: 243 KTDTKLGKVGF 253
           KTDTKLGKVG 
Sbjct: 235 KTDTKLGKVGR 245


This family of proteins conserved from nematodes to humans is of approximately 250 amino acids. It is purported to be carnitine deficiency-associated protein but this could not be confirmed. It carries a characteristic RLL sequence-motif. The function is unknown. Length = 245

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 253
PF10036249 RLL: Putative carnitine deficiency-associated prot 100.0
KOG4380|consensus244 100.0
PF10239318 DUF2465: Protein of unknown function (DUF2465); In 96.67
KOG3973|consensus 465 91.89
PF03801157 Ndc80_HEC: HEC/Ndc80p family; InterPro: IPR005550 80.78
>PF10036 RLL: Putative carnitine deficiency-associated protein; InterPro: IPR019265 This family of proteins conserved from nematodes to humans is of approximately 250 amino acids Back     alignment and domain information
Probab=100.00  E-value=1.3e-95  Score=657.21  Aligned_cols=245  Identities=55%  Similarity=0.882  Sum_probs=226.2

Q ss_pred             hHHHHHhhhCCCCCCCCCCCCHHHHHHHHHHHhhchhccCCccchhhhcCCCCccHHHHHHHHHhhcCCCCCCCCChHHH
Q psy146            5 NIIRKLKTLGYASPESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLEQ   84 (253)
Q Consensus         5 m~~rkL~aL~Y~~~~~~n~~d~~efr~lI~WLEdqkIR~y~iedR~~Lr~i~s~~W~~~~~kYl~dl~cP~~~~~~~~~~   84 (253)
                      ||+|||+|||||++++||++|++|||+||+|||||||||||||+|++||+++|++|+++|+|||+|++||++..+ ++++
T Consensus         1 m~~rkL~aL~Yp~~~~~n~~d~~~fr~lVvWLEDqKIR~Y~iedR~~LR~i~s~~W~~~~~kYl~dl~cP~~~~~-~~~~   79 (249)
T PF10036_consen    1 MFRRKLKALGYPKPDSFNIDDEEEFRSLVVWLEDQKIRHYKIEDREKLRNIDSSDWPKAFEKYLKDLGCPFSSES-RQEQ   79 (249)
T ss_pred             CcHHHHHHcCCCCCCCCCCCCHHHHHHHHHHHhhhhhccCCHhhHHHHhcCCcchHHHHHHHHHHhcCCCCcchh-HHHH
Confidence            999999999999999999999999999999999999999999999999999999999999999999999998888 9999


Q ss_pred             HHHHHHHHHHHHHhhchHHhhhhhHHhh---hhcCCCCCCccCCCCCCCCCCCChHHHHHHHHHHHHhCCCCCCChhHHH
Q psy146           85 IEWLLGHASRLEYADNVDQYKSQTAEFV---SSNQSSAPKVVSTNPLDNLNFDSPEFKKGVNSLAEKLKIRPHPNHLTTL  161 (253)
Q Consensus        85 ldWLl~~AV~leY~D~~~~y~~~~~~~~---~~~~~~~~~~~~~~~l~~ld~~s~~f~~gv~~La~~L~I~~hpD~lv~L  161 (253)
                      ||||||+||++||+|++  |....+...   .+.+...+..++++|++++||+|++|++||++||++||||+||||+|+|
T Consensus        80 ldWLL~~AV~leY~Dn~--~~~~~~~~~~~~~~~~~~~~~~~~~~pl~~ld~~s~~f~~gv~~La~lL~i~~h~d~~v~l  157 (249)
T PF10036_consen   80 LDWLLGLAVRLEYKDNA--YDDDSSYKSAKEEQNKETSPQCKSTNPLDNLDFNSPEFKAGVRALASLLQIPRHPDHLVTL  157 (249)
T ss_pred             HHHHHHHHHHHHHhccc--cccccccccchhhhcccccccccccCcccccCCCCHHHHHHHHHHHHHhCCCCCchHHHHH
Confidence            99999999999999999  222222211   1122356677899999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHhcChhhhcCCCcCCCCCCCCCccc-CCCCCCCCchhHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHhhhc
Q psy146          162 EACCKLISARLNEDALAHPEFVIPKGKPFPFLE-IDNGFTLPDPVLNKAAKILNLLYIHDLRDLQTKINEAIVAVQTVTA  240 (253)
Q Consensus       162 ~A~~~~i~~~l~~~a~~~~~~~~~~g~~~~l~~-~~lGf~~~D~~l~~aa~iLRLL~I~~LR~LQt~INe~IVaVQ~lTA  240 (253)
                      +|||++|+++|++++++++..+. .|.|+.... |++||++||++|++||+|||||||++||+|||+|||+||+||+|||
T Consensus       158 ~a~~~~i~~~l~~~~~~~~~~~~-~~~~~~~~~~~~lG~~~~d~~~~~aa~iLRlL~i~~LR~lQt~Ine~iv~vQ~~Ta  236 (249)
T PF10036_consen  158 KAACKLIKEKLSPEAISEANIEQ-SGKPFLDDKLFPLGFDVKDPALNEAARILRLLHIEDLRELQTKINETIVAVQNLTA  236 (249)
T ss_pred             HHHHHHHHHHhchhhhccccccc-cCCcccccccCCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            99999999999999999877665 488888666 9999999999999999999999999999999999999999999999


Q ss_pred             CCccccccccCCC
Q psy146          241 NPKTDTKLGKVGF  253 (253)
Q Consensus       241 dPkTD~kLGKVGr  253 (253)
                      |||||+|||||||
T Consensus       237 dPktD~~LGkVGr  249 (249)
T PF10036_consen  237 DPKTDTKLGKVGR  249 (249)
T ss_pred             CCcccccccccCC
Confidence            9999999999998



It is purported to be carnitine deficiency-associated protein but this could not be confirmed. It carries a characteristic RLL sequence-motif. The function is unknown.

>KOG4380|consensus Back     alignment and domain information
>PF10239 DUF2465: Protein of unknown function (DUF2465); InterPro: IPR018797 FAM98A, B and C are glycine-rich proteins found from worms to humans whose function is unknown Back     alignment and domain information
>KOG3973|consensus Back     alignment and domain information
>PF03801 Ndc80_HEC: HEC/Ndc80p family; InterPro: IPR005550 Members of this family are components of the mitotic spindle Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query253
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-05
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 8e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 44.5 bits (104), Expect = 2e-05
 Identities = 42/253 (16%), Positives = 80/253 (31%), Gaps = 68/253 (26%)

Query: 29  FRNMV-VWLED--QKIRIYKIED--RAELRKLESPHWMNSFHQYCTDVGLPITLLSTPLE 83
           +++++ V+ +          ++D  ++ L K E  H + S       + L  TLLS   E
Sbjct: 18  YKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWTLLSKQEE 77

Query: 84  QIEWLLGHASRLEYADNVDQYKSQTAEFVSSNQSSAPKVVSTNPLDNLNFDSPEFKKG-- 141
            ++  +    R+ Y   +   K++        Q S    +     D L  D+  F K   
Sbjct: 78  MVQKFVEEVLRINYKFLMSPIKTEQ------RQPSMMTRMYIEQRDRLYNDNQVFAKYNV 131

Query: 142 -----VNSLAEKL-KIRPHPN---H---------LTTLEACCKLISARLNEDALAHPEFV 183
                   L + L ++RP  N             +  L+ C       L+       +F 
Sbjct: 132 SRLQPYLKLRQALLELRPAKNVLIDGVLGSGKTWV-ALDVC-------LSYKVQCKMDFK 183

Query: 184 I------PKGKPFPFLEI----------------DNGFTLPDPVLNKAAKILNLLYIHD- 220
           I          P   LE+                D+   +   + +  A++  LL     
Sbjct: 184 IFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPY 243

Query: 221 ------LRDLQTK 227
                 L ++Q  
Sbjct: 244 ENCLLVLLNVQNA 256


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query253
2igp_A120 Retinoblastoma-associated protein HEC; calponin ho 88.71
>2igp_A Retinoblastoma-associated protein HEC; calponin homology (CH) domain, alpha helices, cell cycle; 1.80A {Homo sapiens} Back     alignment and structure
Probab=88.71  E-value=2.2  Score=33.59  Aligned_cols=77  Identities=19%  Similarity=0.364  Sum_probs=55.8

Q ss_pred             HHHhhhCCCCC---CCCCCCCHHHHHHHHHHHhhchhccCCccchhhhcCCCCccHHHHHHHHHhhcCCCCCCCC-----
Q psy146            8 RKLKTLGYASP---ESFNPIREKEFRNMVVWLEDQKIRIYKIEDRAELRKLESPHWMNSFHQYCTDVGLPITLLS-----   79 (253)
Q Consensus         8 rkL~aL~Y~~~---~~~n~~d~~efr~lI~WLEdqkIR~y~iedR~~Lr~i~s~~W~~~~~kYl~dl~cP~~~~~-----   79 (253)
                      .-|..-||+.+   ..+.--.-++|.+++.||=.+--..|+..         ...+++.+-..+++|+||+....     
T Consensus        25 ~fL~~~~~~~~is~k~L~~Pt~kdF~~if~fL~~~idp~~~~~---------~~~~eeev~~~lK~L~YP~~isKS~L~a   95 (120)
T 2igp_A           25 EFLTENGYAHNVSMKSLQAPSVKDFLKIFTFLYGFLCPSYELP---------DTKFEEEVPRIFKDLGYPFALSKSSMYT   95 (120)
T ss_dssp             HHHHHTTCSSCCCTTTTSSCCHHHHHHHHHHHHTTTCTTCCCC---------CTTHHHHHHHHHHHTTCCSCCCHHHHHT
T ss_pred             HHHHHcCCCCCCCccccCCCCHHHHHHHHHHHHHhcCCCCccC---------CCChHHHHHHHHHHcCCCceeCHHHhcC
Confidence            45777888752   23444467899999999987655555442         46799999999999999997521     


Q ss_pred             ---C-----hHHHHHHHHHHHH
Q psy146           80 ---T-----PLEQIEWLLGHAS   93 (253)
Q Consensus        80 ---~-----~~~~ldWLl~~AV   93 (253)
                         .     --.+|.||.-++.
T Consensus        96 ~g~pHsWp~~L~~L~WLvel~~  117 (120)
T 2igp_A           96 VGAPHTWPHIVAALVWLIDCIK  117 (120)
T ss_dssp             TTSTTTHHHHHHHHHHHHHHHH
T ss_pred             CCCCccHHHHHHHHHHHHHHHH
Confidence               0     5678999987763




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00