Citrus Sinensis ID: 044596


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640----
MNAKPKPMESLLSLFSFFFVLLYSFLLLSVKVTLAGGANTSGDVLKGKENPFSPKAYLMRYWDKTVSKNSPKPQFLLSKASPLNAVETATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENKNFTNYGTAGLGGVNSFKNYSEDGNVEVDSFRRYSRDSAGHNDKFSAYASHSNVVDDSFNTYGTGATGGAGDFKSYNREVNVPNLRFTSYSDAGNGRAQTFSSYTENTNSGGESFSSYGKNGNGLPNEFMSYAKESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQNFKSYGDGGNSGIESFTSYNPQSNVGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGYGENADGQKIGFKVYGPDRSFKDYAKTGVTFASYSKNGSGSTSGSSVNKNNNKWVEPGKFFRESMLKTGTVMPMPDIRDKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSECERQPSKGETKRCIGSAEDMIDFATSVLGNSVTLRTTQNVQGSKQNIMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIVDRMV
cccccccccccHHHHHHHHHHHHHHHHHHHHEEcccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEcccccccccccccEEEEccccccccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccccHHHHHHHcccccccHHHHHHHHHHHHHcccccccccccccccHHHHHHHHHHHcccccEEEEEEccccccccEEEEEEEEEcccccccEEEEcccccccEEEEEEEccccEEEEEEEEcccccccEEEEEEEEEccccccccccHHHHHcccccccccEEEEEEcccEEEEEccccc
ccccccccccHHHHHHHHHHHHHHHHHHHHHEEEEccccccccccccccccccHHHHHHHHHHHccccccccccHHHcccccccHHHHHHHHHHHcccccccHcHHHHHHHcccccccccccccccccccccEEcccccccccccccccccccccccccccccccccHHHcccccccccccccccccccccccccccccccccccccccEEccccccccccccccEccccccccEEEEEEEccccccccccEEEcccccccccccEEEcccccccccccccccccccccccccccEccccccccccccccccccccccccccccccccccccccccHHHccccccccccEEEccccccccccEEEEcccccccccccEEEEcccccccccccccccHHHHcccccccccccccccccccccccccEEEHHHcccccEEEcccccccccccccccHHHHHcccccHHHHHHHHHHcccccccHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHcccEEEEEcccccccccEEEEEEEEEccccccccEEEEccccccEEEEEEEcccccEEEEEEEEcccccccEEEEEEEEEcccccccccccEEEEEcccccccEEEEEcccccEEEEEccccc
MNAKPKPMESLLSLFSFFFVLLYSFLLLSVKVTLaggantsgdvlkgkenpfspkayLMRYWdktvsknspkpqfllskasplnavetATYSKLAEqktlssvlpsfcssaklfcfpdlspslekhgadanfavyenknftnygtaglggvnsfknysedgnvevdsfrrysrdsaghndkfsayashsnvvddsfntygtgatggagdfksynrevnvpnlrftsysdagngraqtfssytentnsggesfssygkngnglpneFMSYAKESNVIQSnftsygeagnaandtfksygsdgnvpeqnfksygdggnsgiesftsynpqsnvgannfksyakssntekvnfanyqedfnsvshrfsgygenadgqkigfkvygpdrsfkdyaktgvtfasyskngsgstsgssvnknnnkwvepgKFFRESmlktgtvmpmpdirdkmpqrsflpraivsklpfsssnVNVLKEIFHASENSSMESIIKDALSecerqpskgetkrcigSAEDMIDFATSvlgnsvtlrttqnvqgskqnimigsvkginggkvtqsvschqslfpyllyychsvpkvrvyeadlldpktkakinhgvaichidtsawsqTHGAFLalgsgpgriEVCHWIFENDLTWTIVDRMV
MNAKPKPMESLLSLFSFFFVLLYSFLLLSVKVTLAGgantsgdvlkgkenpfSPKAYLMRYWDKTVSKNSPKPQFLLSKASPLNAVETATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENKNFTNYGTAGLGGVNSFKNYSEDGNVEVDSFRRYSRDSAGHNDKFSAYASHSNVVDDSFNTYGTGAtggagdfksynREVNVPNLRftsysdagngRAQTFSSYTENTNSGGESFSSYGKNGNGLPNEFMSYAKESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQNFKSYGDGGNSGIESFTSYNPQSNVGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGYGENADGQKIGFKVYGPDRSFKDYAKTGVTFASyskngsgstsgssvnknnnkwvepGKFFRESMLKTGTVMPMPDIRDKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFhasenssmESIIKDALsecerqpskgeTKRCIGSAEDMIDFATSVLGNSVTLrttqnvqgskqniMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENdltwtivdrmv
MNAKPKPMEsllslfsfffvllysflllsVKVTLAGGANTSGDVLKGKENPFSPKAYLMRYWDKTVSKNSPKPQFLLSKASPLNAVETATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENKNFTNYGTAGLGGVNSFKNYSEDGNVEVDSFRRYSRDSAGHNDKFSAYASHSNVVDDSFNTYgtgatggagDFKSYNREVNVPNLRFTSYSDAGNGRAQTFSSYTENTNSGGESFSSYGKNGNGLPNEFMSYAKESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQNFKSYGDGGNSGIESFTSYNPQSNVGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGYGENADGQKIGFKVYGPDRSFKDYAKTGVTFAsyskngsgstsgssvnknnnkWVEPGKFFRESMLKTGTVMPMPDIRDKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSECERQPSKGETKRCIGSAEDMIDFATSVLGNSVTLRTTQNVQGSKQNIMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIVDRMV
**********LLSLFSFFFVLLYSFLLLSVKVTLAGGANTSGDVLK*****FSPKAYLMRYWDKTV*********************TATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENKNFTNYGTAGLGGVNSFKNYS*******************************NVVDDSFNTYGTGATGGAGDFKSYNREVNVPNLRFT****************************************************************************************************************************FANYQEDFNSVSHRFSGYGENADGQKIGFKVYGPDRSFKDYAKTGVTFA********************************************************RAIVSKLPFS**NVNVLKEIF*******************************IGSAEDMIDFATSVLGNSVTLRTTQNVQGSKQNIMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIVD***
**********LLSLFSFFFVLLYSFLLLSVKVTLAG***************FSPKAYLMRYWDKTVSKN**********ASP*********************LPSFCSSAKLFC**********************************************************************************************************************************************************AK***V*****************T***Y*S*******************************************************************************KVYGPDRSFKDYAKTGV**********************NKWVEPGKFFRESMLKTGTVMPMPD****M***SFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSECE*************SAEDMIDFATSVLGNS**************************GKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIVDRMV
********ESLLSLFSFFFVLLYSFLLLSVKVTLAGGANTSGDVLKGKENPFSPKAYLMRYWDKTVSKNSPKPQFLLSKASPLNAVETATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENKNFTNYGTAGLGGVNSFKNYSEDGNVEVDSFRRYSRDSAGHNDKFSAYASHSNVVDDSFNTYGTGATGGAGDFKSYNREVNVPNLRFTSYSDAGNGRAQTFSSYTENTNSGGESFSSYGKNGNGLPNEFMSYAKESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQNFKSYGDGGNSGIESFTSYNPQSNVGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGYGENADGQKIGFKVYGPDRSFKDYAKTGVTFASYS************NKNNNKWVEPGKFFRESMLKTGTVMPMPDIRDKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSE************CIGSAEDMIDFATSVLGNSVTLRTTQNVQGSKQNIMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIVDRMV
*****KPMESLLSLFSFFFVLLYSFLLLSVKVTLAGGA*********KENPFSPKAYLMRYWDKTVSKNSPKPQFLLSKASPLNAVETATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENKNFTNYGTAGLGGVNSFKNYSEDGNVEVDSFRRYSRDSA*HNDKFSAY*SHSNVVDDSFNTYGTGATGGAGDFKSYNREVNVPNLRFTSYSDAGNGRAQTFSSYTENTNSGGESFSSYGKNGNGLPNEFMSYAKESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQNFKSYGDGGNSGIESFTSYNPQSNVGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGYGENADGQKIGFKVYGPDRSFKDYAKTGVTFASYSKN************NNNKWVEPGKFFRESMLKTGTVMPMPDIRDKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSECERQPSKGETKRCIGSAEDMIDFATSVLGNSVTLRTTQNVQGSKQNIMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIVD***
iiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MNAKPKPMESLLSLFSFFFVLLYSFLLLSVKVTLAGGANTSGDVLKGKENPFSPKAYLMRYWDKTVSKNSPKPQFLLSKASPLNAVETATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENKNFTNYGTAGLGGVNSFKNYSEDGNVEVDSFRRYSRDSAGHNDKFSAYASHSNVVDDSFNTYGTGATGGAGDFKSYNREVNVPNLRFTSYSDAGNGRAQTFSSYTENTNSGGESFSSYGKNGNGLPNEFMSYAKESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQNFKSYGDGGNSGIESFTSYNPQSNVGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGYGENADGQKIGFKVYGPDRSFKDYAKTGVTFASYSKNGSGSTSGSSVNKNNNKWVEPGKFFRESMLKTGTVMPMPDIRDKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSECERQPSKGETKRCIGSAEDMIDFATSVLGNSVTLRTTQNVQGSKQNIMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIVDRMV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query644 2.2.26 [Sep-21-2011]
P92990626 Probable polygalacturonas yes no 0.961 0.988 0.637 0.0
P92982622 Probable polygalacturonas no no 0.958 0.991 0.636 0.0
O80760624 Probable polygalacturonas no no 0.958 0.988 0.612 0.0
Q40161630 Polygalacturonase-1 non-c N/A no 0.917 0.938 0.612 0.0
P93217629 Polygalacturonase non-cat N/A no 0.914 0.936 0.610 0.0
P93218632 Polygalacturonase non-cat N/A no 0.928 0.946 0.590 0.0
Q6ZA27627 BURP domain-containing pr yes no 0.902 0.926 0.493 1e-167
Q6K2M1690 BURP domain-containing pr no no 0.909 0.849 0.438 1e-139
Q7XES5344 BURP domain-containing pr no no 0.349 0.654 0.530 6e-72
Q0JEP3324 BURP domain-containing pr no no 0.324 0.645 0.316 7e-26
>sp|P92990|JP650_ARATH Probable polygalacturonase non-catalytic subunit JP650 OS=Arabidopsis thaliana GN=JP650 PE=2 SV=2 Back     alignment and function desciption
 Score =  842 bits (2175), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/626 (63%), Positives = 492/626 (78%), Gaps = 7/626 (1%)

Query: 19  FVLLYSFLLLSVKVTLAGGANTSGDVLKGKENPFSPKAYLMRYWDKTVSKNSPKPQFLLS 78
           F+LL SF      V + GG  T G     +ENPF+PKA L+RYW+K +   SP+ +FL+S
Sbjct: 5   FLLLQSFSFFLFNVVIVGG-RTFGGGFSAEENPFTPKASLVRYWNKEIRGQSPRSEFLIS 63

Query: 79  KASPLNAVETATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENK 138
           KASPLNAV++AT+SKLA   +L +  P FCS+A LFCFPDL  SLEKH  D  F+VY+ K
Sbjct: 64  KASPLNAVDSATFSKLAAANSLPTRFPDFCSAANLFCFPDLGASLEKHDDDVKFSVYDQK 123

Query: 139 NFTNYGTAGLGGVNSFKNYSEDGNVEVDSFRRYSRDSAGHNDKFSAYASHSNVVDDSFNT 198
           NFTNYG A  GG +SFKNYS+DGNV  DSFRRYSR++AGH+DKF+ Y  +SNVV++ FN+
Sbjct: 124 NFTNYGNARAGGADSFKNYSKDGNVVTDSFRRYSRNAAGHDDKFTVYGENSNVVEEGFNS 183

Query: 199 YGTGATGGAGDFKSYNREVNVPNLRFTSYSDAGNGRAQTFSSYTENTNSG-GESFSSYGK 257
           YGT  TGGAGDF +Y   VN P  RFT+YSD GNGR+QTF +YT   N+G G+SF+SYGK
Sbjct: 184 YGTFGTGGAGDFTNYQNNVNNPTSRFTAYSDGGNGRSQTFKTYTHEANAGNGQSFTSYGK 243

Query: 258 NGNGLPNEFMSYAKESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQNFKSYGDGGNS 317
           NGNG+PNEF SY   SNVI S F++YGE+GNAANDTF SYGSDGNVP+ NF +YG  GN+
Sbjct: 244 NGNGVPNEFTSYGVSSNVIGSGFSNYGESGNAANDTFTSYGSDGNVPQNNFNNYGASGNA 303

Query: 318 GIESFTSYNPQSNVGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGYGENADGQKIG 377
            +++F +Y  ++NVG ++F SYAK SN+EKVNF NY + FN  S  F+GYG+ A+G K+ 
Sbjct: 304 AVDTFANYRDKANVGDDSFSSYAKDSNSEKVNFVNYGQSFNPGSETFTGYGKGAEGSKLS 363

Query: 378 FKVYGPDRSFKDYAKTGVTFASYSKNGSGSTSGSSVN--KNNNKWVEPGKFFRESMLKTG 435
           FK Y P+ +FKDYAK GV FA Y+ +   +T+ ++V   K  NKW+EPGKFFRES LK G
Sbjct: 364 FKTYTPNSTFKDYAKKGVAFAKYNVS---TTTANTVGDGKTVNKWIEPGKFFRESSLKEG 420

Query: 436 TVMPMPDIRDKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSECE 495
           TV+PMPDI+DKMP+RSFLPR+I++KLPFS+S +  +K IFHA ENS+M  II DA++ECE
Sbjct: 421 TVIPMPDIKDKMPKRSFLPRSIITKLPFSTSKLGEIKRIFHAVENSTMGGIITDAVTECE 480

Query: 496 RQPSKGETKRCIGSAEDMIDFATSVLGNSVTLRTTQNVQGSKQNIMIGSVKGINGGKVTQ 555
           R PS GETKRC+GSAEDMIDFATSVLG SV LRTT+NV GSK+ ++IG V GINGGK+T+
Sbjct: 481 RPPSVGETKRCVGSAEDMIDFATSVLGRSVVLRTTENVAGSKEKVVIGKVNGINGGKLTK 540

Query: 556 SVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFL 615
           +VSCHQSL+PYLLYYCHSVPKVRVYEADLL+  +K KINHG+AICH+DTS+W  +HGAFL
Sbjct: 541 AVSCHQSLYPYLLYYCHSVPKVRVYEADLLELNSKKKINHGIAICHMDTSSWGPSHGAFL 600

Query: 616 ALGSGPGRIEVCHWIFENDLTWTIVD 641
           ALGS PGRIEVCHWIFEND+ W I D
Sbjct: 601 ALGSKPGRIEVCHWIFENDMNWAIAD 626




Non-catalytic subunit of polygalacturonase.
Arabidopsis thaliana (taxid: 3702)
>sp|P92982|JP630_ARATH Probable polygalacturonase non-catalytic subunit JP630 OS=Arabidopsis thaliana GN=JP630 PE=2 SV=2 Back     alignment and function description
>sp|O80760|Y639_ARATH Probable polygalacturonase non-catalytic subunit At1g60390 OS=Arabidopsis thaliana GN=At1g60390 PE=2 SV=1 Back     alignment and function description
>sp|Q40161|GP1_SOLLC Polygalacturonase-1 non-catalytic subunit beta OS=Solanum lycopersicum GN=GP1 PE=1 SV=1 Back     alignment and function description
>sp|P93217|GP2_SOLLC Polygalacturonase non-catalytic subunit AroGP2 OS=Solanum lycopersicum GN=GP2 PE=3 SV=1 Back     alignment and function description
>sp|P93218|GP3_SOLLC Polygalacturonase non-catalytic subunit AroGP3 OS=Solanum lycopersicum GN=GP3 PE=3 SV=1 Back     alignment and function description
>sp|Q6ZA27|BURPC_ORYSJ BURP domain-containing protein 12 OS=Oryza sativa subsp. japonica GN=BURP12 PE=2 SV=1 Back     alignment and function description
>sp|Q6K2M1|BURPE_ORYSJ BURP domain-containing protein 14 OS=Oryza sativa subsp. japonica GN=BURP14 PE=2 SV=2 Back     alignment and function description
>sp|Q7XES5|BURPG_ORYSJ BURP domain-containing protein 16 OS=Oryza sativa subsp. japonica GN=BURP16 PE=2 SV=1 Back     alignment and function description
>sp|Q0JEP3|BURP5_ORYSJ BURP domain-containing protein 5 OS=Oryza sativa subsp. japonica GN=BURP5 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query644
225425505633 PREDICTED: probable polygalacturonase no 0.975 0.992 0.666 0.0
224100181598 predicted protein [Populus trichocarpa] 0.916 0.986 0.697 0.0
225425509624 PREDICTED: probable polygalacturonase no 0.962 0.993 0.648 0.0
224107839615 predicted protein [Populus trichocarpa] 0.933 0.977 0.668 0.0
15223133626 putative polygalacturonase non-catalytic 0.961 0.988 0.637 0.0
297841779626 hypothetical protein ARALYDRAFT_894840 [ 0.962 0.990 0.641 0.0
1762584626 polygalacturonase isoenzyme 1 beta subun 0.961 0.988 0.635 0.0
449450462623 PREDICTED: probable polygalacturonase no 0.959 0.991 0.626 0.0
449533308612 PREDICTED: probable polygalacturonase no 0.950 1.0 0.630 0.0
15220856622 putative polygalacturonase non-catalytic 0.958 0.991 0.636 0.0
>gi|225425505|ref|XP_002266341.1| PREDICTED: probable polygalacturonase non-catalytic subunit JP630-like [Vitis vinifera] Back     alignment and taxonomy information
 Score =  876 bits (2264), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/639 (66%), Positives = 497/639 (77%), Gaps = 11/639 (1%)

Query: 8   MESLLSLFSFFFVLLYSFLLLSVKVTLAGGANTSGDVLKGKENPFSPKAYLMRYWDKTVS 67
           M     L S  F L++ F      V L G    SG  + G ENPF+P+A L+RYW+K + 
Sbjct: 1   MHRQHKLLSHLF-LIFLFSASFPNVALGGAGEESGKPVAG-ENPFTPRASLLRYWNKQIG 58

Query: 68  KNSPKPQFLLSKASPLNAVETATYSKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHG 127
              PK  FLLSKASPL+AVE+AT++KLA Q  LS  LP+FC SA L CFPDL  SLEKH 
Sbjct: 59  SGLPKSTFLLSKASPLSAVESATFTKLAAQNALSDNLPAFCKSANLLCFPDLGQSLEKHD 118

Query: 128 ADANFAVYENKNFTNYGTAGLGGVNSFKNYSEDGNVEVDSFRRYSRDSAGHNDKFSAYAS 187
           A +NFAVY NKNFTNYGT GLGG ++F  YSE  N+ VD+F+RYSRDS G+NDKF  Y  
Sbjct: 119 ASSNFAVYSNKNFTNYGTDGLGGTDAFTKYSEGVNLPVDAFKRYSRDSVGNNDKFDNYGR 178

Query: 188 HSNVVDDSFNTYGTGATGGAGDFKSYNREVNVPNLRFTSYSDAGNGRAQTFSSYTENTNS 247
             NVVD SF+ YG GATGG+G+FK YN EVNVPNLRF SY+D GNGR Q+F+SYT  TNS
Sbjct: 179 EGNVVDQSFSGYGAGATGGSGEFKKYNEEVNVPNLRFNSYTDDGNGRQQSFTSYTNETNS 238

Query: 248 GGESFSSYGKNGNGLPNEFMSYAKESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQN 307
           G ESFSSYGKNGNG PNEF SY   SNVI S FT YGE GNAANDTFKSYG DGNVPE N
Sbjct: 239 GDESFSSYGKNGNGSPNEFTSYGSSSNVIGSTFTGYGETGNAANDTFKSYGFDGNVPENN 298

Query: 308 FKSYGDGGNSGIESFTSYNPQSNVGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGY 367
           FK YGDGGN+G ++F SY  QSNVG ++FKSYAK+SN+ +V+F NY + FN  +  F+GY
Sbjct: 299 FKKYGDGGNAGTDTFISYRDQSNVGDDSFKSYAKNSNSAEVDFVNYGKSFNEGTDTFAGY 358

Query: 368 GENADGQKIGFKVYGPDRSFKDYAKTGVTFASYSKNGSGS-----TSGSSVNKNNNKWVE 422
           G+ A   KI FK+YG + +F DYAK G++F+ Y+   S +      SGSSV    N+WVE
Sbjct: 359 GKGATNHKINFKIYGVNNTFTDYAKKGISFSRYTNKSSETMTSMAVSGSSV----NRWVE 414

Query: 423 PGKFFRESMLKTGTVMPMPDIRDKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSS 482
           PGKFFRESMLK GTVMPMPDIRDKMP+RSFLPR I SK PFS+S +  +K+IFHA++NSS
Sbjct: 415 PGKFFRESMLKKGTVMPMPDIRDKMPKRSFLPRTISSKFPFSTSKLEEMKKIFHAADNSS 474

Query: 483 MESIIKDALSECERQPSKGETKRCIGSAEDMIDFATSVLGNSVTLRTTQNVQGSKQNIMI 542
           ME +  +AL +CER PSKGETKRC+ S EDMIDFAT+VLG +V +RTTQ+V+GSKQN+MI
Sbjct: 475 MEHMFTEALDDCERAPSKGETKRCVPSIEDMIDFATTVLGRNVVVRTTQSVEGSKQNLMI 534

Query: 543 GSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINHGVAICHI 602
           GSVKGINGG+VT+SVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKA +NHGVAICH+
Sbjct: 535 GSVKGINGGQVTKSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKANVNHGVAICHL 594

Query: 603 DTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIVD 641
           DTS WS  HGAF+ALGSGPGRIEVCHWIFEND+TWTI D
Sbjct: 595 DTSDWSAGHGAFVALGSGPGRIEVCHWIFENDMTWTIAD 633




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224100181|ref|XP_002311777.1| predicted protein [Populus trichocarpa] gi|222851597|gb|EEE89144.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225425509|ref|XP_002266815.1| PREDICTED: probable polygalacturonase non-catalytic subunit JP650-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|224107839|ref|XP_002314620.1| predicted protein [Populus trichocarpa] gi|222863660|gb|EEF00791.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|15223133|ref|NP_177194.1| putative polygalacturonase non-catalytic subunit JP650 [Arabidopsis thaliana] gi|334183807|ref|NP_001185361.1| putative polygalacturonase non-catalytic subunit JP650 [Arabidopsis thaliana] gi|82592920|sp|P92990.2|JP650_ARATH RecName: Full=Probable polygalacturonase non-catalytic subunit JP650; AltName: Full=Aromatic-rich glycoprotein JP650; Flags: Precursor gi|3176680|gb|AAC18803.1| Identical to polygalacuronase isoenzyme 1 beta subunit homolog mRNA gb|U63373. EST gb|AA404878 comes from this gene [Arabidopsis thaliana] gi|110742068|dbj|BAE98965.1| hypothetical protein [Arabidopsis thaliana] gi|332196929|gb|AEE35050.1| putative polygalacturonase non-catalytic subunit JP650 [Arabidopsis thaliana] gi|332196930|gb|AEE35051.1| putative polygalacturonase non-catalytic subunit JP650 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297841779|ref|XP_002888771.1| hypothetical protein ARALYDRAFT_894840 [Arabidopsis lyrata subsp. lyrata] gi|297334612|gb|EFH65030.1| hypothetical protein ARALYDRAFT_894840 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|1762584|gb|AAB39546.1| polygalacturonase isoenzyme 1 beta subunit homolog [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|449450462|ref|XP_004142981.1| PREDICTED: probable polygalacturonase non-catalytic subunit JP650-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449533308|ref|XP_004173618.1| PREDICTED: probable polygalacturonase non-catalytic subunit JP650-like, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|15220856|ref|NP_173788.1| putative polygalacturonase non-catalytic subunit JP630 [Arabidopsis thaliana] gi|82592919|sp|P92982.2|JP630_ARATH RecName: Full=Probable polygalacturonase non-catalytic subunit JP630; AltName: Full=Aromatic-rich glycoprotein JP630; Flags: Precursor gi|4056458|gb|AAC98031.1| Identical to gb|ATU59467 aromatic rich glycoprotein which is strongly similar to gb|U63373 polygalacturonase isozyme 1 from Arabidopsis thaliana. EST gb|AA395212 comes from this gene [Arabidopsis thaliana] gi|332192308|gb|AEE30429.1| putative polygalacturonase non-catalytic subunit JP630 [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query644
TAIR|locus:2016194626 PG2 "AT1G70370" [Arabidopsis t 0.944 0.971 0.623 7.3e-217
TAIR|locus:2034823622 JP630 "AT1G23760" [Arabidopsis 0.937 0.971 0.621 6.1e-213
TAIR|locus:2195593624 PG1 "AT1G60390" [Arabidopsis t 0.936 0.966 0.600 3.4e-203
TAIR|locus:2179424392 RD22 "RESPONSIVE TO DESSICATIO 0.319 0.525 0.327 9.1e-22
UNIPROTKB|Q70KG3362 RAFTIN1B "Protein RAFTIN 1B" [ 0.322 0.574 0.291 5.7e-20
UNIPROTKB|Q70KG5389 RAFTIN1A "Protein RAFTIN 1A" [ 0.324 0.537 0.283 1.7e-19
TAIR|locus:2010237280 USPL1 "AT1G49320" [Arabidopsis 0.327 0.753 0.280 1.7e-19
UNIPROTKB|Q7F8U7412 BURP13 "BURP domain-containing 0.324 0.507 0.267 7.5e-16
SGD|S000004784430 DDR48 "DNA damage-responsive p 0.392 0.588 0.271 3.4e-10
DICTYBASE|DDB_G0288501986 ddx42 "DEAD/DEAH box helicase" 0.330 0.216 0.266 2.5e-07
TAIR|locus:2016194 PG2 "AT1G70370" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 2095 (742.5 bits), Expect = 7.3e-217, P = 7.3e-217
 Identities = 381/611 (62%), Positives = 467/611 (76%)

Query:    32 VTLAGGANTSGDVLKGKENPFSPKAYLMRYWDKTVSKNSPKPQFLLSKASPLNAVETATY 91
             V + GG  T G     +ENPF+PKA L+RYW+K +   SP+ +FL+SKASPLNAV++AT+
Sbjct:    18 VVIVGG-RTFGGGFSAEENPFTPKASLVRYWNKEIRGQSPRSEFLISKASPLNAVDSATF 76

Query:    92 SKLAEQKTLSSVLPSFCSSAKLFCFPDLSPSLEKHGADANFAVYENKNFTNYGTAGLGGV 151
             SKLA   +L +  P FCS+A LFCFPDL  SLEKH  D  F+VY+ KNFTNYG A  GG 
Sbjct:    77 SKLAAANSLPTRFPDFCSAANLFCFPDLGASLEKHDDDVKFSVYDQKNFTNYGNARAGGA 136

Query:   152 NSFKNYSEDGNVEVDSFRRYSRDSAGHNDKFSAYASHSNVVDDSFNTYXXXXXXXXXDFK 211
             +SFKNYS+DGNV  DSFRRYSR++AGH+DKF+ Y  +SNVV++ FN+Y         DF 
Sbjct:   137 DSFKNYSKDGNVVTDSFRRYSRNAAGHDDKFTVYGENSNVVEEGFNSYGTFGTGGAGDFT 196

Query:   212 SYNREVNVPNLRFTSYSDAGNGRAQTFSSYTENTNSG-GESFSSYGKNGNGLPNEFMSYA 270
             +Y   VN P  RFT+YSD GNGR+QTF +YT   N+G G+SF+SYGKNGNG+PNEF SY 
Sbjct:   197 NYQNNVNNPTSRFTAYSDGGNGRSQTFKTYTHEANAGNGQSFTSYGKNGNGVPNEFTSYG 256

Query:   271 KESNVIQSNFTSYGEAGNAANDTFKSYGSDGNVPEQNFKSYGDGGNSGIESFTSYNPQSN 330
               SNVI S F++YGE+GNAANDTF SYGSDGNVP+ NF +YG  GN+ +++F +Y  ++N
Sbjct:   257 VSSNVIGSGFSNYGESGNAANDTFTSYGSDGNVPQNNFNNYGASGNAAVDTFANYRDKAN 316

Query:   331 VGANNFKSYAKSSNTEKVNFANYQEDFNSVSHRFSGYGENADGQKIGFKVYGPDRSFKDY 390
             VG ++F SYAK SN+EKVNF NY + FN  S  F+GYG+ A+G K+ FK Y P+ +FKDY
Sbjct:   317 VGDDSFSSYAKDSNSEKVNFVNYGQSFNPGSETFTGYGKGAEGSKLSFKTYTPNSTFKDY 376

Query:   391 AKTGVTFAXXXXXXXXXXXXXXXXXXXXXWVEPGKFFRESMLKTGTVMPMPDIRDKMPQR 450
             AK GV FA                     W+EPGKFFRES LK GTV+PMPDI+DKMP+R
Sbjct:   377 AKKGVAFAKYNVSTTTANTVGDGKTVNK-WIEPGKFFRESSLKEGTVIPMPDIKDKMPKR 435

Query:   451 SFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSECERQPSKGETKRCIGSA 510
             SFLPR+I++KLPFS+S +  +K IFHA ENS+M  II DA++ECER PS GETKRC+GSA
Sbjct:   436 SFLPRSIITKLPFSTSKLGEIKRIFHAVENSTMGGIITDAVTECERPPSVGETKRCVGSA 495

Query:   511 EDMIDFATSVLGNSVTLRTTQNVQGSKQNIMIGSVKGINGGKVTQSVSCHQSLFPYLLYY 570
             EDMIDFATSVLG SV LRTT+NV GSK+ ++IG V GINGGK+T++VSCHQSL+PYLLYY
Sbjct:   496 EDMIDFATSVLGRSVVLRTTENVAGSKEKVVIGKVNGINGGKLTKAVSCHQSLYPYLLYY 555

Query:   571 CHSVPKVRVYEADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWI 630
             CHSVPKVRVYEADLL+  +K KINHG+AICH+DTS+W  +HGAFLALGS PGRIEVCHWI
Sbjct:   556 CHSVPKVRVYEADLLELNSKKKINHGIAICHMDTSSWGPSHGAFLALGSKPGRIEVCHWI 615

Query:   631 FENDLTWTIVD 641
             FEND+ W I D
Sbjct:   616 FENDMNWAIAD 626




GO:0004650 "polygalacturonase activity" evidence=ISS
GO:0005576 "extracellular region" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0000271 "polysaccharide biosynthetic process" evidence=RCA
GO:0000272 "polysaccharide catabolic process" evidence=RCA
GO:0005982 "starch metabolic process" evidence=RCA
GO:0007389 "pattern specification process" evidence=RCA
GO:0008361 "regulation of cell size" evidence=RCA
GO:0009664 "plant-type cell wall organization" evidence=RCA
GO:0009825 "multidimensional cell growth" evidence=RCA
GO:0009832 "plant-type cell wall biogenesis" evidence=RCA
GO:0009926 "auxin polar transport" evidence=RCA
GO:0009932 "cell tip growth" evidence=RCA
GO:0010015 "root morphogenesis" evidence=RCA
GO:0010075 "regulation of meristem growth" evidence=RCA
GO:0010817 "regulation of hormone levels" evidence=RCA
GO:0040007 "growth" evidence=RCA
GO:0043481 "anthocyanin accumulation in tissues in response to UV light" evidence=RCA
GO:0048653 "anther development" evidence=RCA
GO:0048767 "root hair elongation" evidence=RCA
GO:0071555 "cell wall organization" evidence=RCA
TAIR|locus:2034823 JP630 "AT1G23760" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2195593 PG1 "AT1G60390" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2179424 RD22 "RESPONSIVE TO DESSICATION 22" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q70KG3 RAFTIN1B "Protein RAFTIN 1B" [Triticum aestivum (taxid:4565)] Back     alignment and assigned GO terms
UNIPROTKB|Q70KG5 RAFTIN1A "Protein RAFTIN 1A" [Triticum aestivum (taxid:4565)] Back     alignment and assigned GO terms
TAIR|locus:2010237 USPL1 "AT1G49320" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q7F8U7 BURP13 "BURP domain-containing protein 13" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
SGD|S000004784 DDR48 "DNA damage-responsive protein" [Saccharomyces cerevisiae (taxid:4932)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0288501 ddx42 "DEAD/DEAH box helicase" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q6ZA27BURPC_ORYSJNo assigned EC number0.49340.90210.9266yesno
P93218GP3_SOLLCNo assigned EC number0.59080.92850.9462N/Ano
P93217GP2_SOLLCNo assigned EC number0.61030.91450.9364N/Ano
P92990JP650_ARATHNo assigned EC number0.63730.96110.9888yesno
Q40161GP1_SOLLCNo assigned EC number0.61200.91770.9380N/Ano

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query644
smart01045222 smart01045, BURP, The BURP domain is found at the 1e-104
pfam03181216 pfam03181, BURP, BURP domain 3e-89
>gnl|CDD|214992 smart01045, BURP, The BURP domain is found at the C-terminus of several different plant proteins Back     alignment and domain information
 Score =  316 bits (810), Expect = e-104
 Identities = 131/224 (58%), Positives = 158/224 (70%), Gaps = 11/224 (4%)

Query: 424 GKFFRESMLKTGTVMPMPDIRDK-MPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSS 482
           GKFFRE+ LK GT+M MP I+D  MP+R FLPR I   LPFSSS ++ +  +F A++NS 
Sbjct: 1   GKFFRENDLKEGTLMLMPFIKDDLMPKRPFLPRQIADLLPFSSSKIDEILRVFSATKNSP 60

Query: 483 MESIIKDALSECERQPSKGETKRCIGSAEDMIDFATSVLGNS-VTLRTTQNVQGSKQ--- 538
           M  IIK+ + ECE    +GETKRC+ S E MIDFATSVLG   V +RTT+ V GSK    
Sbjct: 61  MAGIIKETVGECEAPAIEGETKRCVTSLESMIDFATSVLGRYVVKVRTTEVVVGSKNKAL 120

Query: 539 ---NIMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVYEADLLDPKTKAKINH 595
               ++I  VKG+NG   T+SVSCHQSL+PY +YYCHSVP VRVYE DLLDPK   KIN 
Sbjct: 121 HNYTVVIAKVKGLNG---TKSVSCHQSLYPYAVYYCHSVPGVRVYEVDLLDPKGMRKINV 177

Query: 596 GVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTI 639
           G A+CH+DTSAW   HGAF  L S PG+I VCH+I END+ W I
Sbjct: 178 GPAVCHMDTSAWDANHGAFKVLKSEPGQIPVCHFIPENDMVWVI 221


It was named after the proteins in which it was first identified: the BNM2 clone-derived protein from Brassica napus; USPs and USP-like proteins; RD22 from Arabidopsis thaliana; and PG1beta from Lycopersicon esculentum. This domain is around 230 amino acid residues long. It possesses the following conserved features: two phenylalanine residues at its N-terminus; two cysteine residues; and four repeated cysteine-histidine motifs, arranged as: CH-X(10)-CH-X(25-27)-CH-X(25-26)-CH, where X can be any amino acid. The function of this domain is unknown. Length = 222

>gnl|CDD|202568 pfam03181, BURP, BURP domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 644
PF03181216 BURP: BURP domain; InterPro: IPR004873 The BURP do 100.0
>PF03181 BURP: BURP domain; InterPro: IPR004873 The BURP domain is a ~230-residue module, which has been named for the four members of the group initially identified, BNM2, USP, RD22, and PG1beta Back     alignment and domain information
Probab=100.00  E-value=5.9e-81  Score=619.30  Aligned_cols=213  Identities=40%  Similarity=0.742  Sum_probs=199.9

Q ss_pred             CccccccCCCCCceecCCCCC-CCCCCCCCCchhcccCCCCCcccHHHHHHHhcCCCCchHHHHHHHHHHHhccCCCCCC
Q 044596          424 GKFFRESMLKTGTVMPMPDIR-DKMPQRSFLPRAIVSKLPFSSSNVNVLKEIFHASENSSMESIIKDALSECERQPSKGE  502 (644)
Q Consensus       424 g~FF~E~dL~pG~~M~l~~i~-d~~p~~~FLPR~vAdsIPFSs~kL~eIL~~Fsi~~~S~~A~~m~~TL~~CE~~pi~GE  502 (644)
                      .+||+|+||+||++|+|++.. +..+.++||||++|++||||+++|++||++|+|+++|+||++|++||++||.+|++||
T Consensus         1 ~~fF~e~dL~~G~~m~l~f~~~~~~~~~~fLpr~~A~siPfss~~l~~iL~~Fsi~~~S~~A~~m~~Tl~~Ce~~~~~GE   80 (216)
T PF03181_consen    1 ALFFLEKDLHPGKKMPLYFPKSDNSAKRPFLPRQVADSIPFSSSKLPEILQMFSIPPGSPMAKAMKNTLEECESPPIKGE   80 (216)
T ss_pred             CcccCHHHCCCCceeeecCCCCCCCcccccCCHHHhccCCcCHHHHHHHHHHhcCCCCCHHHHHHHHHHHHhhcCCCCCc
Confidence            379999999999999987643 3346789999999999999999999999999999999999999999999999999999


Q ss_pred             cccccCchhhHHHHHhhhcCC-ceEEEEeeccc-CCcceEEEeeeeeecCCCccceeeecccCCcceeeEeeeeCceeEE
Q 044596          503 TKRCIGSAEDMIDFATSVLGN-SVTLRTTQNVQ-GSKQNIMIGSVKGINGGKVTQSVSCHQSLFPYLLYYCHSVPKVRVY  580 (644)
Q Consensus       503 ~K~CaTSLESMiDFa~S~LG~-nv~~~ST~~~~-~s~q~~~v~~V~~i~gg~~~k~V~CH~~~YPYaVfYCH~v~~TrvY  580 (644)
                      +|+|||||||||||++|+||+ +|++++|+... ...|+|+|++|++|.++  +++|+||+|+|||+|||||.++.||||
T Consensus        81 ~k~CaTSLESMvdF~~s~LG~~~v~a~st~~~~~~~~~~y~V~~v~~i~~~--~~~V~CH~~~yPYaVyyCH~~~~t~~y  158 (216)
T PF03181_consen   81 TKYCATSLESMVDFAVSKLGTRNVRALSTEVPKSTPLQNYTVEGVKKIGGG--DKSVVCHKMPYPYAVYYCHSIPPTRVY  158 (216)
T ss_pred             CccCcCCHHHHHHHHHHhcCCCccEEEeccccCCCCCccEEEEeeeeecCC--CceEEEcccCCceeEEEeeecCceeEE
Confidence            999999999999999999999 79999998876 45789999999999873  589999999999999999999999999


Q ss_pred             EEEeeCCcCccccceEEEEeeccCCCCCCcchHHHhhCCCCCCcceeeeeeCCceEEeec
Q 044596          581 EADLLDPKTKAKINHGVAICHIDTSAWSQTHGAFLALGSGPGRIEVCHWIFENDLTWTIV  640 (644)
Q Consensus       581 ~V~L~g~d~~~ki~~AvAVCH~DTS~WnP~H~AF~~L~vkPG~~pVCHfi~~~~ivWvpa  640 (644)
                      +|+|++.|++  +++||||||+|||.|||+|+||++||+|||++||||||++++|+|||.
T Consensus       159 ~V~l~g~dg~--~~~avavCH~DTS~W~p~h~aF~~L~vkPG~~~VCHf~~~~~ivWv~~  216 (216)
T PF03181_consen  159 MVPLVGEDGT--KVEAVAVCHLDTSGWNPDHPAFQVLGVKPGTVPVCHFLPNDHIVWVPN  216 (216)
T ss_pred             EEEEeecCCc--eEEEEEEEecCCCCCCcchHHHHHhCCCCCCcceEEEeeCCeEEEccC
Confidence            9999999987  368999999999999999999999999999999999999999999983



It is found in the C-terminal part of a number of plant cell wall proteins, which are defined not only by the BURP domain, but also by the overall similarity in their modular construction. The BURP domain proteins consists of either three or four modules: (i) an N-terminal hydrophobic domain - a presumptive transit peptide, joined to (ii) a short conserved segment or other short segment, (iii) an optional segment consisting of repeated units which is unique to each member, and (iv) the C-terminal BURP domain. Although the BURP domain proteins share primary structural features, their expression patterns and the conditions under which they are expressed differ. The presence of the conserved BURP domain in diverse plant proteins suggests an important and fundamental functional role for this domain []. It is possible that the BURP domain represents a general motif for localization of proteins within the cell wall matrix. The other structural domains associated with the BURP domain may specify other target sites for intermolecular interactions []. Some proteins known to contain a BURP domain are listed below [, , ]: Brassica protein BNM2, which is expressed during the induction of microspore embryogenesis. Field bean USPs, abundant non-storage seed proteins with unknown function. Soybean USP-like proteins ADR6 (or SALI5-4A), an auxin-repressible, aluminium-inducible protein and SALI3-2, a protein that is up-regulated by aluminium. Soybean seed coat BURP-domain protein 1 (SCB1). It might play a role in the differentiation of the seed coat parenchyma cells. Arabidopsis RD22 drought induced protein. Maize ZRP2, a protein of unknown function in cortex parenchyma. Tomato PG1beta, the beta-subunit of polygalacturonase isozyme 1 (PG1), which is expressed in ripening fruits. Cereal RAFTIN. It is essential specifically for the maturation phase of pollen development.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00