Citrus Sinensis ID: 041780


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420--
MEKTVYLFLLFGAISTLCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDCVDIYKQPALDHPALKNHTIQLSPSFDIPAEKVDRRNESSRLPVTIQTWQKSGSCPNGTVPIRRIRRQDLLRASSLQQFGRKVPEVSYAANRTDATHSKFESINNKTIHLGPLVDRSAAVLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGFVQTGQVALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKKTPHTKTAMGSGSFSHGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPGQNPICK
cccHHHHHHHHHHHHHHcccEEEcccccccHHHHHHHHHHccccccEEEEcccccEEEEEEccccccccccccccccccccccccccccccccccccccccccEEEEcccccccccccccEEcccccHHcccHHHHccccccccccccccccccccccccccccccccccccccEEEEEEEEccEEEEEEEEEEEEcccccccccccEEEEEHHccccccccEEEEcEEEcccccccccEEEEEEEEEcccccccccccccccEEEcccccccccccccccccccEEEEEEEEEEccccccEEEEEcccEEEEEcccccccccccccEEEEEEEEEEcccccccccccccccccccccccccccEEEEEEEEEccccccccccccEEEccccccEEEEEcccccccccEEEEEccccccccc
ccHHHHHHEEHHHHHHHccccccccHcHHHHHHHHHHHHHHccccEEEEEcccccEEEEEccccccccccccHcccccccccccccccccccccccccccccccEEEccccccccccEEEEEccHHHHHcHHHHHHHcccccccccccccccccccccccccccccccccccccEEEEEEEcccEEEEEEEEEEEccccccccccccHEEEEEEccccccccEEEEEEEEcHHHccccccEEEEEEEcccccccccEccccccEEEEEEEEEccEEcccccccccEEEEEEEEEEccccccEEEEEcccEEEEcccHHHHHHHHccccEEEEccEEEccccccccccccccccccccccccccEEEEEEEEEEEccccccccHHcEEEccccccEEEEEccccccccEEEEEcccccccccc
MEKTVYLFLLFGAISTLCNGVEVNAKARKTLPEIDRKLKllnrpavksiksedgdiidcvdiykqpaldhpalknhtiqlspsfdipaekvdrrnessrlpvTIQTwqksgscpngtvpirRIRRQDLLRASSlqqfgrkvpevsyaanrtdathskfesinnktihlgplvdrSAAVLVTVGFnyigaqgdinvwnprvespddyttAQIWlkggpgdnfesveggwvvnpklygdklTRLFVYWTkdsykstgcfdaiCSGFVQTGQVALGaaigpwsisegpqyylpvgiyldpntknwwlKVNGNIVvgywpgslFSYLSYSAILVEwggqvyspnvkktphtktamgsgsfshglqgsacsiehVRIIDFSLqlkypqwvgtwadeyycydaynyvkgyttepvfffggpgqnpick
MEKTVYLFLLFGAISTLCNGVEVNAKarktlpeidrklkllnrpavksiksedgdiiDCVDIYKQPALDHPALKNHTIQLSPSFDIPAEKVDRRNESSRLpvtiqtwqksgscpngtvpIRRIRRQDLLRasslqqfgrkvpevsyaanrtdathskfesinnktihlgpLVDRSAAVLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGFVQTGQVALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKKTPHTKTAMGSGSFSHGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPGQNPICK
MEKTVYLFLLFGAISTLCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDCVDIYKQPALDHPALKNHTIQLSPSFDIPAEKVDRRNESSRLPVTIQTWQKSGSCPNGTVPIRRIRRQDLLRASSLQQFGRKVPEVSYAANRTDATHSKFESINNKTIHLGPLVDRSAAVLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGFVQTGQVALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKKTPHTKTAMGSGSFSHGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPGQNPICK
***TVYLFLLFGAISTLCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDCVDIYKQPALDHPALKNHTI************************TIQTWQ*******GTVPIRRIRRQDLL****************Y**********KFESINNKTIHLGPLVDRSAAVLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGFVQTGQVALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNV****************HGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGG********
**KTVYLFLLFGAISTLCNG*******************LLNRPAVKSIKSEDGDIIDCVDIYKQPALDH*A**********************************WQKSGSCPNGTVPIRRIR******************************HSKFESINNKTIHLGPLVDRSAAVLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGFVQTGQVALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKK*****TA**SGSFSHGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPGQNP***
MEKTVYLFLLFGAISTLCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDCVDIYKQPALDHPALKNHTIQLSPSFDIPAEKVDRRNESSRLPVTIQTWQKSGSCPNGTVPIRRIRRQDLLRASSLQQFGRKVPEVSYAANRTDATHSKFESINNKTIHLGPLVDRSAAVLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGFVQTGQVALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKKTPHTKTAMGSGSFSHGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPGQNPICK
*EKTVYLFLLFGAISTLCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDCVDIYKQPALDHPALKNHTIQLSPS*****************PVTIQTWQKSGSCPNGTVPIRRIRRQDLLRASSLQQFG****************************HLGPLVDRSAAVLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGFVQTGQVALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKKTPHTKTA*GSGSFSHGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPGQNPIC*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHiiii
SSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEKTVYLFLLFGAISTLCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDCVDIYKQPALDHPALKNHTIQLSPSFDIPAEKVDRRNESSRLPVTIQTWQKSGSCPNGTVPIRRIRRQDLLRASSLQQFGRKVPEVSYAANRTDATHSKFESINNKTIHLGPLVDRSAAVLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGFVQTGQVALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKKTPHTKTAMGSGSFSHGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPGQNPICK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query422
449447325409 PREDICTED: uncharacterized protein LOC10 0.943 0.973 0.538 1e-127
449517126409 PREDICTED: uncharacterized protein LOC10 0.943 0.973 0.536 1e-126
388514999416 unknown [Lotus japonicus] 0.947 0.961 0.556 1e-119
357465653438 hypothetical protein MTR_3g104000 [Medic 0.947 0.913 0.565 1e-119
224088948418 predicted protein [Populus trichocarpa] 0.978 0.988 0.574 1e-119
357465649435 hypothetical protein MTR_3g103980 [Medic 0.954 0.926 0.550 1e-119
357465663434 hypothetical protein MTR_3g104070 [Medic 0.959 0.933 0.531 1e-118
357465657424 Carboxyl-terminal proteinase [Medicago t 0.969 0.964 0.558 1e-118
357496467410 hypothetical protein MTR_6g012620 [Medic 0.905 0.931 0.560 1e-115
297812809413 hypothetical protein ARALYDRAFT_351609 [ 0.969 0.990 0.516 1e-113
>gi|449447325|ref|XP_004141419.1| PREDICTED: uncharacterized protein LOC101213587 [Cucumis sativus] Back     alignment and taxonomy information
 Score =  460 bits (1183), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 229/425 (53%), Positives = 292/425 (68%), Gaps = 27/425 (6%)

Query: 2   EKTVYLFLLFGAIS--TLCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDC 59
           ++ V +  + G +    + N    N +   +L +I+ KLKLLN+P++K+I SEDGDI++C
Sbjct: 6   DEVVMVLAMVGLVVGVAIVNVNATNVEMNLSLLQIENKLKLLNKPSIKTIYSEDGDIMEC 65

Query: 60  VDIYKQPALDHPALKNHTIQLSPSFDIPAEKVDRRNES--SRLPVTIQTWQKSGSCPNGT 117
           VD+YKQPA DHP LKNHTIQ+ P   I  +    +NES  SRL    QTWQKSGSCP GT
Sbjct: 66  VDVYKQPAFDHPLLKNHTIQMKPDLSIDLKMSSTQNESFGSRL-YPFQTWQKSGSCPKGT 124

Query: 118 VPIRRIRRQDLLRASSLQQFGRKVPEVSYAANRTDATHSKFESINNKTIHLGPLVDRSAA 177
           +PIRR+ R+DLLRA+SL  FG+K P   Y  ++                 LG   +RS A
Sbjct: 125 IPIRRVGREDLLRANSLHHFGKKFP---YGDSK-----------------LGQEFNRSTA 164

Query: 178 VLVTVGFNYIGAQGDINVWNPRVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGD 237
           +L+T G NYIGA G+INVWNP+V+ P+D+T +++WLK GP + FESVE GW+VNPKLYGD
Sbjct: 165 ILITTGVNYIGASGNINVWNPKVDLPNDFTASKVWLKNGPSEKFESVEAGWMVNPKLYGD 224

Query: 238 KLTRLFVYWTKDSYKSTGCFDAICSGFVQTG-QVALGAAIGPWSISEGPQYYLPVGIYLD 296
             TRL +YWT DSYK+TGCFD  CSGFVQT   VA+GA I P S + G QY + +GI+ D
Sbjct: 225 AKTRLSLYWTVDSYKTTGCFDLTCSGFVQTNPSVAIGAVIEPLSSTNGQQYTISIGIFQD 284

Query: 297 PNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKKTPHTKTAMGSGSF 356
           P++ NWWLK  GN  VGYWP +LF YL +SA LVEWGG+V+S N+K  PHT T MGSG +
Sbjct: 285 PHSGNWWLKFQGN-PVGYWPSTLFGYLDHSATLVEWGGEVFSSNIKIVPHTGTGMGSGDY 343

Query: 357 SHGLQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPG 416
           + GL   A  ++  RI+D+SLQLKYP  VGTWADE  CY   NY + YTTEPVF+FGGPG
Sbjct: 344 ASGLYEYASFVKEPRIVDYSLQLKYPDRVGTWADEPSCYSVDNYQQSYTTEPVFYFGGPG 403

Query: 417 QNPIC 421
            +  C
Sbjct: 404 LSRDC 408




Source: Cucumis sativus

Species: Cucumis sativus

Genus: Cucumis

Family: Cucurbitaceae

Order: Cucurbitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449517126|ref|XP_004165597.1| PREDICTED: uncharacterized protein LOC101231919 [Cucumis sativus] Back     alignment and taxonomy information
>gi|388514999|gb|AFK45561.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|357465653|ref|XP_003603111.1| hypothetical protein MTR_3g104000 [Medicago truncatula] gi|355492159|gb|AES73362.1| hypothetical protein MTR_3g104000 [Medicago truncatula] Back     alignment and taxonomy information
>gi|224088948|ref|XP_002308584.1| predicted protein [Populus trichocarpa] gi|222854560|gb|EEE92107.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357465649|ref|XP_003603109.1| hypothetical protein MTR_3g103980 [Medicago truncatula] gi|355492157|gb|AES73360.1| hypothetical protein MTR_3g103980 [Medicago truncatula] Back     alignment and taxonomy information
>gi|357465663|ref|XP_003603116.1| hypothetical protein MTR_3g104070 [Medicago truncatula] gi|355492164|gb|AES73367.1| hypothetical protein MTR_3g104070 [Medicago truncatula] Back     alignment and taxonomy information
>gi|357465657|ref|XP_003603113.1| Carboxyl-terminal proteinase [Medicago truncatula] gi|355492161|gb|AES73364.1| Carboxyl-terminal proteinase [Medicago truncatula] Back     alignment and taxonomy information
>gi|357496467|ref|XP_003618522.1| hypothetical protein MTR_6g012620 [Medicago truncatula] gi|355493537|gb|AES74740.1| hypothetical protein MTR_6g012620 [Medicago truncatula] Back     alignment and taxonomy information
>gi|297812809|ref|XP_002874288.1| hypothetical protein ARALYDRAFT_351609 [Arabidopsis lyrata subsp. lyrata] gi|297320125|gb|EFH50547.1| hypothetical protein ARALYDRAFT_351609 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query422
TAIR|locus:2180647432 AT5G25950 "AT5G25950" [Arabido 0.969 0.946 0.521 8.4e-113
TAIR|locus:2180542352 AT5G25960 [Arabidopsis thalian 0.563 0.676 0.542 1.8e-101
TAIR|locus:2050579402 AT2G44240 [Arabidopsis thalian 0.552 0.579 0.424 2.5e-83
TAIR|locus:2167121420 AT5G56530 "AT5G56530" [Arabido 0.938 0.942 0.403 1.3e-82
TAIR|locus:2050528403 AT2G44220 [Arabidopsis thalian 0.606 0.635 0.392 2.5e-79
TAIR|locus:2050487445 AT2G44210 [Arabidopsis thalian 0.850 0.806 0.410 1.1e-78
TAIR|locus:2035666422 AT1G55360 [Arabidopsis thalian 0.874 0.874 0.395 2.7e-77
TAIR|locus:2092870419 AT3G13510 [Arabidopsis thalian 0.931 0.937 0.374 9e-77
TAIR|locus:2026815465 AT1G70550 [Arabidopsis thalian 0.568 0.516 0.392 6.8e-71
TAIR|locus:2019933467 AT1G10750 [Arabidopsis thalian 0.573 0.518 0.391 8.6e-71
TAIR|locus:2180647 AT5G25950 "AT5G25950" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1113 (396.9 bits), Expect = 8.4e-113, P = 8.4e-113
 Identities = 220/422 (52%), Positives = 281/422 (66%)

Query:     4 TVYLFLLFGAISTLCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDCVDIY 63
             T+  FL+F     LC G    A  + +L +ID KLK LN+PA+K+IKSEDGDIIDC+DIY
Sbjct:    19 TMRFFLIF---VILC-GFYNEAYGKGSL-DIDLKLKALNKPALKTIKSEDGDIIDCIDIY 73

Query:    64 KQPALDHPALKNHTIQLSPSFDIPAEKVDRRNESSRLPVTIQTWQKSGSCPNGTVPIRRI 123
             KQ A DHPALKNH IQ+ PS     +K    N  S   +  Q W KSG CP GT+P+RR+
Sbjct:    74 KQHAFDHPALKNHKIQMKPSVKFGTKKTTIPNNGSSEHIKSQIWSKSGKCPMGTIPVRRV 133

Query:   124 RRQDLLRASSLQQFGRKVPEVSYAANRTDATHS-KFESINNKTIHLGPLVDRSAAVLVTV 182
              R+D+ RASS   FGRK P   Y+       H   F     K     P + RS A +V +
Sbjct:   134 SREDISRASSPSHFGRKTPH-KYSFLDNALQHKGNFNITPAKINEAQPRL-RSEAFIVAL 191

Query:   183 GFNYIGAQGDINVWNP-RVESPDDYTTAQIWLKGGPGDNFESVEGGWVVNPKLYGDKLTR 241
             GFN++GAQ DIN+WNP RVE+ D Y+TAQIWL GG  +NFESVEGGW+VNP ++GD  TR
Sbjct:   192 GFNFVGAQSDINIWNPPRVEATD-YSTAQIWLVGGLSENFESVEGGWMVNPAVFGDSRTR 250

Query:   242 LFVYWTKDSYKSTGCFDAICSGFVQTGQ-VALGAAIGPWSISEGPQYYLPVGIYLDPNTK 300
             LF+ WT D Y  TGC + +C+GFVQT +  ALGA + P S S   QY++ V I+LDPN+ 
Sbjct:   251 LFISWTTDGYTKTGCINLLCAGFVQTSKKFALGATVEPVSSSSSTQYHITVSIFLDPNSG 310

Query:   301 NWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVK-KTPHTKTAMGSGSFSHG 359
             NWWL    N V+GYWPG+LF+YL +SA  V+WGG+V+SPNV  K PHT TAMGSG ++  
Sbjct:   311 NWWLTCENN-VLGYWPGTLFNYLKHSATAVQWGGEVHSPNVVLKKPHTTTAMGSGQWASY 369

Query:   360 LQGSACSIEHVRIIDFSLQLKYPQWVGTWADEYYCYDAYNYVKGYTTEPVFFFGGPGQNP 419
             +   AC   ++RI D+S+QLKYPQ++  +ADEY CY    + K Y +EP F+FGGPG+N 
Sbjct:   370 IWAEACFHTNLRIKDYSMQLKYPQFLSEYADEYNCYSTLLHRKTYMSEPHFYFGGPGRNS 429

Query:   420 IC 421
              C
Sbjct:   430 RC 431




GO:0005575 "cellular_component" evidence=ND
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2180542 AT5G25960 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2050579 AT2G44240 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2167121 AT5G56530 "AT5G56530" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2050528 AT2G44220 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2050487 AT2G44210 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2035666 AT1G55360 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2092870 AT3G13510 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026815 AT1G70550 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2019933 AT1G10750 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query422
pfam03080230 pfam03080, DUF239, Domain of unknown function (DUF 2e-84
pfam14365117 pfam14365, DUF4409, Domain of unknown function (DU 9e-36
>gnl|CDD|217358 pfam03080, DUF239, Domain of unknown function (DUF239) Back     alignment and domain information
 Score =  257 bits (659), Expect = 2e-84
 Identities = 102/234 (43%), Positives = 138/234 (58%), Gaps = 8/234 (3%)

Query: 186 YIGAQGDINVWNPRVESPDDYTTAQIWLKGGP-GDNFESVEGGWVVNPKLYGDKLTRLFV 244
           Y GAQ  INV+NP+V++P+ ++ +QIW+  GP   +  S++ GW VNP LYGD  TRLF 
Sbjct: 1   YYGAQATINVYNPKVQNPNQFSLSQIWIVNGPENGDLNSIQAGWQVNPSLYGDNRTRLFT 60

Query: 245 YWTKDSYKSTGCFDAICSGFVQTGQ-VALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWW 303
           YWT D Y  TGC++ +C GFVQ    +ALG AI P S+  G QY + + I+ DP T NWW
Sbjct: 61  YWT-DGYNKTGCYNLLCPGFVQVSSKIALGGAISPVSVYGGKQYDITLLIWKDPKTGNWW 119

Query: 304 LKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKKTPHTKTAMGSGSFSHGLQGS 363
           L    N V+GYWP SLF++LS  A +VEWGG+VYSP      H    MGSG F     G 
Sbjct: 120 LTYGNNTVIGYWPSSLFTHLSSGATMVEWGGEVYSP---PGGHPSPPMGSGHFPEEGFGK 176

Query: 364 ACSIEHVRIIDFSLQLKYPQWVG--TWADEYYCYDAYNYVKGYTTEPVFFFGGP 415
           A    +++++D   +L  P      T+AD   CY+  +          F++GGP
Sbjct: 177 AAYFRNIKVVDEDNKLVKPDEDKLETFADSPNCYNVKDGGYTGEWGHAFYYGGP 230


This is a family of plant and bacterial proteins, a small number of which are putative carboxy-terminal peptidases. Length = 230

>gnl|CDD|206533 pfam14365, DUF4409, Domain of unknown function (DUF4409) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 422
PF03080229 DUF239: Domain of unknown function (DUF239); Inter 100.0
PF14365117 DUF4409: Domain of unknown function (DUF4409) 100.0
>PF03080 DUF239: Domain of unknown function (DUF239); InterPro: IPR004314 This is a family of plant proteins, a small number of which are putative peptidases (see for example Q9XIN9 from SWISSPROT) Back     alignment and domain information
Probab=100.00  E-value=4.7e-83  Score=612.56  Aligned_cols=225  Identities=46%  Similarity=0.912  Sum_probs=214.4

Q ss_pred             EeeeEEEEeeecCCcCCCCCceeEEEEeccCC-CCCcceEEEEEEEcccccCCCceEEEEEEeecCcccCcccCCCCCce
Q 041780          186 YIGAQGDINVWNPRVESPDDYTTAQIWLKGGP-GDNFESVEGGWVVNPKLYGDKLTRLFVYWTKDSYKSTGCFDAICSGF  264 (422)
Q Consensus       186 ~~Ga~a~inV~~p~v~~~~q~S~sqiwv~~g~-~~~~n~IeaGW~V~P~lYgD~~~rlf~yWt~d~y~~tGCyNl~CpGF  264 (422)
                      |||++|+||||+|+|+.++|||++||||++++ .+.+|+|||||+|+|+||||++||||+|||+|+|++||||||+||||
T Consensus         1 y~G~~a~i~v~~p~v~~~~q~S~~~i~i~~g~~~~~~~~i~~GW~V~P~lygd~~~~lf~~wt~d~~~~tgCyN~~CpGF   80 (229)
T PF03080_consen    1 YYGARATISVWNPKVQQPDQFSLSQIWISNGSDDDSLNSIEAGWQVYPSLYGDSRTRLFVYWTADGYQKTGCYNLDCPGF   80 (229)
T ss_pred             CeeeEEEEECcCCccCCccceeheeEEEEecCCCCCCcEEEEeeeccccccCCCceEEEEEEEccCCCCcceeCCCCCcE
Confidence            79999999999999997779999999999999 78899999999999999999999999999999999999999999999


Q ss_pred             EEecc-cccCccccccccCCCceeEEeEEEEEecCCCCeEEEecCceeeeeecchhchhcccCceEEEEeeEEecCCCCC
Q 041780          265 VQTGQ-VALGAAIGPWSISEGPQYYLPVGIYLDPNTKNWWLKVNGNIVVGYWPGSLFSYLSYSAILVEWGGQVYSPNVKK  343 (422)
Q Consensus       265 VQvs~-i~lG~~i~pvS~~~G~q~~i~i~I~kD~~tgnWWL~~~~~~~IGYwP~sLF~~l~~~A~~V~wGGeV~~~~~~~  343 (422)
                      ||+++ |+||++|+|+|+++|+|++|+|+|+||+.+|||||+++++ .|||||++||++|+++|+.|+|||||++++   
T Consensus        81 Vq~s~~i~~G~~~~~~S~~gG~q~~i~~~i~kD~~~gnWWL~~~~~-~IGYwP~sLF~~l~~~A~~v~wGGeV~~~~---  156 (229)
T PF03080_consen   81 VQVSSSIALGAAISPVSTYGGKQYEITLSIFKDPKSGNWWLYYGGE-PIGYWPKSLFTSLADGATEVEWGGEVYSPP---  156 (229)
T ss_pred             EEeCCccccceeeCCCccCCCceEEEEEEEEecCCCccEEEEEecc-eeeeehHHhhhhhhcCceEEEEEEEEeCCC---
Confidence            99999 9999999999999999999999999999999999999886 899999999999999999999999999985   


Q ss_pred             CCCCCCCCCCCcCCCCCCCccEEEeecEEEcCCCCccCCC--cceecccCCcceeEeecccCcCcCCeEEEeCC
Q 041780          344 TPHTKTAMGSGSFSHGLQGSACSIEHVRIIDFSLQLKYPQ--WVGTWADEYYCYDAYNYVKGYTTEPVFFFGGP  415 (422)
Q Consensus       344 ~~~~sp~MGSG~fp~~g~~~Aay~~ni~ivd~~~~~~~p~--~~~~~~d~~~CY~v~~~~~~~~~g~~f~yGGP  415 (422)
                      ++|++|||||||||++++++|||||||+++|.++....+.  .+++++|+|+||++..... .+||.+||||||
T Consensus       157 ~~~~sppMGSG~fp~~g~~~aAy~~~i~~~d~~~~~~~~~~~~~~~~~~~~~CY~~~~~~~-~~~g~~f~yGGP  229 (229)
T PF03080_consen  157 GRHTSPPMGSGHFPSEGFGKAAYFRNIQVVDSNGQFVDPNDDLLEVFADNPSCYDVSYIGD-GDWGYYFFYGGP  229 (229)
T ss_pred             CCCCCCCccCCcCCCCCCCccEEEEEEEEEcCCCCCcCCcccceeEccCCCCceeEeeccC-CCcccEEEeeCC
Confidence            3589999999999999999999999999999999888774  4778999999999998733 679999999999



However, the structure of the protein PDB3:3eu8 has an alpha-alpha toroid fold and is similar to a glucoamylase, PDB:1ayx. Such glucoamylases are involved in breaking down complex sugars (e.g. starch). The biologically relevant state is likely to be monomeric. The putative active site is located at the centre of the toroid with a well defined large cavity. Further structural comparisons also show relationships with other glycohydrolases.

>PF14365 DUF4409: Domain of unknown function (DUF4409) Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query422
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 5e-09
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 57.6 bits (138), Expect = 5e-09
 Identities = 64/447 (14%), Positives = 128/447 (28%), Gaps = 145/447 (32%)

Query: 17  LCNGVEVNAKARKTLPEIDRKLKLLNRPAVKSIKSEDGDIIDCVDIY------KQPALDH 70
           L N  +V AK    +  +   LKL  R A+  ++         V I       K   +  
Sbjct: 119 LYNDNQVFAK--YNVSRLQPYLKL--RQALLELRPAKN-----VLIDGVLGSGKT-WVAL 168

Query: 71  PALKNHTIQLSPSFDIPAEKVDRRNESSRLPVTIQTWQKSGSCPNGTVPIRRIRRQDLLR 130
               ++ +Q    F I                    W    +C            + +L 
Sbjct: 169 DVCLSYKVQCKMDFKI-------------------FWLNLKNC---------NSPETVLE 200

Query: 131 ASSLQQFGRKV-PEVSYAANRTDATHSKFESINNKTIHLGPLVDRSAAVLVTVGFNYIGA 189
              LQ+   ++ P  +  ++ +     +  SI  +   L         +LV         
Sbjct: 201 M--LQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENCLLVL-------- 250

Query: 190 QGDINVWNPRVESPDDY------TTAQI----WLKGGPGDNFESVEGGWVVNPKLYGDKL 239
               NV N +  +  +       TT       +L      +         + P    D++
Sbjct: 251 --L-NVQNAKAWNAFNLSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTP----DEV 303

Query: 240 TRLFVYWTKDSYKSTGCFD---AICSG--FVQTGQVALGAAIGPWSISEGP---QYYLPV 291
             L + +           D    + +         +++ A     SI +G      +  V
Sbjct: 304 KSLLLKYL-----DCRPQDLPREVLTTNPRR----LSIIAE----SIRDGLATWDNWKHV 350

Query: 292 GI------------YLDPNT--KNWW-LKV---NGNIVVGYWPGSLFSYLSYSAILVEWG 333
                          L+P    K +  L V   + +I     P  L S +        W 
Sbjct: 351 NCDKLTTIIESSLNVLEPAEYRKMFDRLSVFPPSAHI-----PTILLSLI--------WF 397

Query: 334 GQVYS--PNVKKTPHTKTAMG----SGSFS-HGLQ-GSACSIE-----HVRIID-FSLQL 379
             + S    V    H  + +       + S   +       +E     H  I+D +++  
Sbjct: 398 DVIKSDVMVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYALHRSIVDHYNIPK 457

Query: 380 KYPQW--VGTWADEYYCYDAYNYVKGY 404
            +     +  + D+Y+    Y+++ G+
Sbjct: 458 TFDSDDLIPPYLDQYF----YSHI-GH 479


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00