Citrus Sinensis ID: 003369


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-----
MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQQLLLGIRRANRQPANLSSSVLSSDSMHIGILAAAAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGWDESTAGEKRNRVSIWEIEPVTAPFFICPPPFFRSKHPRQADDDASDLDNVFKRTMPWIGDDFGVKDSQSLPGLSLVQWMNMQQNPSLANAMQSSYMHSLPGSILQNLNGGLSQMPQQNNLQYTGQSLPQQVPQIDQLAKLPSTVNPLGSNILPQQPLGDISQQSRQNMITQNLPSGPVQAQVLQPQNLVQTSNILQQQPSIQNPQLPANLPQNLQQQQQQQHIMGQNQQQNLMQTQLPDPINQNLQMSDKQIQLHLLQKLQQQRQSLLSQQSALQQPAQLIQLQDQQRQLLDASQSFSRSGTPTQMLEMHQVTPTSLPQSNIMSQQIANSGSLNNVQFSQPPQQPKLEQQQPGILPQMPGHMGLPASHIINPVSTAGNSALTGAAGVGQSVITDDNPSCSTSPSTNNCQRLIQPTINSRTHRSAGIGEEVAQSASALLNPSALETMPSNANLVKDLPHKSDVKPSVNISKTQNQGFFTPQTYLNGAATQTDYLDTSSSTTSVCLSQNDVHLQQNNSLSYNPQSTLRDASQDGEVPVDPRSNIPYGANIDGPLGSMNPDPLLTKGMMGLGKDFSNNISSGAMLANYENSKDAQQELSSSIVSQSFGVPDMAFNSIDSTINDSSFLNGGPWAPPPQFPQRMRTYTKV
ccccccEEEEEEcccccEEEEEEEccccccccccccccccccccccccccEEEEEEcccccEEEEEEEEccccccccccccccccccHHHHHHHHHHHHccccEEEEEcccccccccEEEHHHHHHHHHccccccccEEEEEccccccccEEEEEEEEEEccccccccccccccEEEEEccccccccccccccccEEEEccccccccccccccccccccccccccccHHHccccccccccccccccccccccccHHHHccccccccHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccHHHHHcccccccccccccccccccccccccccccHHHHHHHHHHHHHHcHHHHHHHHHHccccHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHcccccccccccccccccccccccccccHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccc
ccccHHHHHHHHHccccEEEEEEEccccccEEEccccEEEEEccEEEcccEEEEEEcccccEEEEEEEccccccccccccEccccHHHHHHHHHHHHHHccccEEEEEccccccccEEEEHHHHHHHHHcccEEEEEEEEEEEcccccccEEEEEEEEEccccccccccccccEEEEEEccccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHccccccccccccccccccccccccccccccHcccccccccccccccccHcccccccHHHcccccccccccccHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcHHcccccccccccccccccccHcccccccHHccccccccccHcccccHHHHHHcccccccccHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEccccccccccccccccccccccccccccccccccHHcccccccccccccccccccccccHHHHcccccccccccccccccccccccccHccccccccccccccccccccEcccccccccccccccccccccccccccccccEEEccccccccEEcccccccccccHHHHHccccccccccccccccccccccccccccHHHHHHHHHHHccccccccccccccccEccccccccccccccccccccEEEEccc
MQPPAQELMARDLHDNIWTFrhiyrgqpkrhlltTGWSLFvsgkrlfagdsvLFIRDEKQQLLLGIRRanrqpanlsssvlssdsMHIGILAAAAHaaannspftvfynpraspsefvVPLAKYYKAVHSNQISLGMRFRMMFEteesgtrrymgtitgisdldpvrwknsqwrnlqvgwdestagekrnrvsiweiepvtapfficpppffrskhprqadddasdLDNVFKrtmpwigddfgvkdsqslpglsLVQWMNMQQNPSLANAMQSsymhslpgsilqnlngglsqmpqqnnlqytgqslpqqvpqidqlaklpstvnplgsnilpqqplgdisqQSRQNMitqnlpsgpvqaqvlqpqnlvqtsnilqqqpsiqnpqlpanlpqNLQQQQQQQHIMGQNQQQNLmqtqlpdpinqnlqmSDKQIQLHLLQKLQQQRQSLLSQQSALQQPAQLIQLQDQQRQLLDasqsfsrsgtptqmlemhqvtptslpqsnimsqqiansgslnnvqfsqppqqpkleqqqpgilpqmpghmglpashiinpvstagnsaltgaagvgqsvitddnpscstspstnncqrliqptinsrthrsagiGEEVAQSASAllnpsaletmpsnanlvkdlphksdvkpsvnisktqnqgfftpqtylngaatqtdyldtsssttsvclsqndvhlqqnnslsynpqstlrdasqdgevpvdprsnipyganidgplgsmnpdplltkgmmglgkdfsnnissgamlanyensKDAQQELSSSIvsqsfgvpdmafnsidstindssflnggpwapppqfpqrmrtytkv
MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQQLLLGIRRANRQPANLSSSVLSSDSMHIGILAAAAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHsnqislgmrFRMMFETEesgtrrymGTITgisdldpvrwkNSQWRnlqvgwdestagekrnRVSIWEIEPVTAPFFICPPPFFRSKHPRQADDDASDLDNVFKRTMPWIGDDFGVKDSQSLPGLSLVQWMNMQQNPSLANAMQSSYMHSLPGSILQNLNGGLSQMPQQNNLQYTGQSLPQQVPQIDQLAKLPSTVNPLGSNILPQQPLGDISQQSRQNMITQNLPSGPVQAQVLQPQNLVQTSNILQQQPSIQNPQLPANLPQNLQQQQQQQHIMGQNQQQNLMQTQLPDPINQNLQMSDKQIQLHLLQKLQQQRQSLLSQQSALQQPAQLIQLQDQQRQLLDASQSFSRSGTPTQMLEMHQVTPTSLPQSNIMSQQIANSGSLNNVQFSQPPQQPKLEQQQPGILPQMPGHMGLPASHIINPVSTAGNSALTGAAGVGQSVITDDNPSCSTSPSTNNCQRLIQPTINSRTHRSAGIGEEVAQSASALLNPSALETMPSNANLVKDLPHKSDVKPSVNISKTQNQGFFTPQTYLNGAATQTDYLDTSSSTTSVCLSQNDVHLQQNNSLSYNPQSTLRDASQDGEVPVDPRSNIPYGANIDGPLGSMNPDPLLTKGMMGLGKDFSNNISSGAMLANYENSKDAQQELSSSIVSQSFGVPDMAFNSIDSTINdssflnggpwapppqfpqrmrtytkv
MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQQLLLGIRRANRQPANlsssvlssdsMHIGILaaaahaaaNNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGWDESTAGEKRNRVSIWEIEPVTAPFFICPPPFFRSKHPRQADDDASDLDNVFKRTMPWIGDDFGVKDSQSLPGLSLVQWMNMQQNPSLANAMQSSYMHSLPGSILQNLNGGLSQMPQQNNLQYTGQSLPQQVPQIDQLAKLPSTVNPLGSNILPQQPLGDISQQSRQNMITQNLPSGPVQAQVLQPQNLVQTSNILQQQPSIqnpqlpanlpqnlqqqqqqqhimgqnqqqnlmqtqlPDPINQNLQMSDkqiqlhllqklqqqrqsllsqqsalqqpaqliqlqdqqrqlldasqsFSRSGTPTQMLEMHQVTPTSLPQSNIMSQQIANSGSLNNVQFSqppqqpkleqqqpGILPQMPGHMGLPASHIINPVSTAGNSALTGAAGVGQSVItddnpscstspstnncQRLIQPTINSRTHRSAGIGEEVAQSASALLNPSALETMPSNANLVKDLPHKSDVKPSVNISKTQNQGFFTPQTYLNGAATQTDYLDtsssttsVCLSQNDVHLQQNNSLSYNPQSTLRDASQDGEVPVDPRSNIPYGANIDGPLGSMNPDPLLTKGMMGLGKDFSNNISSGAMLANYENSKDAQQELSSSIVSQSFGVPDMAFNSIDSTINDSSFLNGGPWAPPPQFPQRMRTYTKV
**********RDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQQLLLGIRR*****************MHIGILAAAAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGWDESTAGEKRNRVSIWEIEPVTAPFFICPPPFFR**************DNVFKRTMPWIGDDFGVKDSQSLPGLSLVQWM************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************FFTPQTYLNGAATQ**Y*********************************************************************************************************************************************************
**PPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQQLLLGIRR*******************IGILAAAAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGWDESTAGEKRNRVSIWEIEPVT********************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************AATQTDYLDTSSSTT*VC******************************VPVDPRSNIPYGANIDGPLGSMNPDPLLTKGMMGLGKDF*******************************FGVPDMAFNSIDSTINDSSFLNGGPW*****FPQRMRTYT**
********MARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQQLLLGIRRANRQPANLSSSVLSSDSMHIGILAAAAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGWDESTAGEKRNRVSIWEIEPVTAPFFICPPPFFRSKHPRQADDDASDLDNVFKRTMPWIGDDFGVKDSQSLPGLSLVQWMNMQQNPSLANAMQSSYMHSLPGSILQNLNGGLSQMPQQNNLQYTGQSLPQQVPQIDQLAKLPSTVNPLGSNILPQQPLGDISQQSRQNMITQNLPSGPVQAQVLQPQNLVQTSNILQQQPSIQNPQLPANLP*****************QQNLMQTQLPDPINQNLQMSDKQIQLHLLQKLQ**************QPAQLIQLQDQQRQLLDASQSFSRSGTPTQMLEMHQVTPTSLPQSNIMSQQIANSGSLNNVQFSQPPQQPKLEQQQPGILPQMPGHMGLPASHIINPVSTAGNSALTGAAGVGQSVITD***********NNCQRLIQPTINSRTHRSAGIGEEVAQSASALLNPSALETMPSNANLVKDLPHKSDVKPSVNISKTQNQGFFTPQTYLNGAATQTDY********SVCLSQNDVHLQQNNSLSYNPQ************PVDPRSNIPYGANIDGPLGSMNPDPLLTKGMMGLGKDFSNNISSGAMLANYEN**********SIVSQSFGVPDMAFNSIDSTINDSSFLNGGPWAPPPQFPQ********
*QPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQQLLLGIRRANRQ***LS*S**SSD*MHIGILAAAAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGWDESTAGEKRNRVSIWEIEPVTAPFFICPPPFFRSKHP**********************************GLSLVQWMNMQQNPSLANAMQSSYMHSLPGSILQNLNGGL**************************************************************************************************************************************************************************************************MLE*******S*****************************************************************************D**PSCST******CQRLI*************************LNPSALETMPSNANLVKDLPHKSDVKPSVNIS*T*NQGFFTPQTYLNGAATQTDYLDTSSSTTSVCLSQNDVHLQQNNSLSYNPQSTLRDASQDGEVPVDPRSNIPYGANIDGPLGSMNPDPLLTKGMMGLGKDFSNNISSGAMLANYENSKDAQQELSSSIVSQSFGVPDMAFNSIDSTINDSSFLNGGPWAPPPQFPQRMRTYTKV
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQQLLLGIRRANRQPANLSSSVLSSDSMHIGILAAAAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGWDESTAGEKRNRVSIWEIEPVTAPFFICPPPFFRSKHPRQADDDASDLDNVFKRTMPWIGDDFGVKDSQSLPGLSLVQWMNMQQNPSLANAMQSSYMHSLPGSILQNLNGGLSQMPQQNNLQYTGQSLPQQVPQIDQLAKLPSTVNPLGSNILPQQPLGDISQQSRQNMITQNLPSGPVQAQVLQPQNLVQTSNILQQQPSIQNPQLPANLPQNLQQQQQQQHIMGQNQQQNLMQTQLPDPINQNLQMSDKQIQLHLLQKLQQQRQSLLSQQSALQQPAQLIQLQDQQRQLLDASQSFSRSGTPTQMLEMHQVTPTSLPQSNIMSQQIANSGSLNNVQFSQPPQQPKLEQQQPGILPQMPGHMGLPASHIINPVSTAGNSALTGAAGVGQSVITDDNPSCSTSPSTNNCQRLIQPTINSRTHRSAGIGEEVAQSASALLNPSALETMPSNANLVKDLPHKSDVKPSVNISKTQNQGFFTPQTYLNGAATQTDYLDTSSSTTSVCLSQNDVHLQQNNSLSYNPQSTLRDASQDGEVPVDPRSNIPYGANIDGPLGSMNPDPLLTKGMMGLGKDFSNNISSGAMLANYENSKDAQQELSSSIVSQSFGVPDMAFNSIDSTINDSSFLNGGPWAPPPQFPQRMRTYTKV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query825 2.2.26 [Sep-21-2011]
Q6Z2W3 1142 Auxin response factor 5 O yes no 0.464 0.335 0.660 1e-138
Q0D9R7 1161 Auxin response factor 19 yes no 0.455 0.323 0.690 1e-129
Q6YZW0 1116 Auxin response factor 21 no no 0.425 0.314 0.663 1e-128
Q8RYC81086 Auxin response factor 19 yes no 0.326 0.247 0.777 1e-121
A3B9A01055 Auxin response factor 16 no no 0.334 0.261 0.742 1e-117
A2YAA51055 Auxin response factor 16 N/A no 0.334 0.261 0.742 1e-117
P93022 1164 Auxin response factor 7 O no no 0.386 0.274 0.647 1e-111
Q6H6V4908 Auxin response factor 6 O no no 0.333 0.302 0.610 2e-90
A2X1A1908 Auxin response factor 6 O N/A no 0.333 0.302 0.610 2e-90
Q9ZTX8935 Auxin response factor 6 O no no 0.339 0.299 0.593 2e-90
>sp|Q6Z2W3|ARFE_ORYSJ Auxin response factor 5 OS=Oryza sativa subsp. japonica GN=ARF5 PE=2 SV=2 Back     alignment and function desciption
 Score =  494 bits (1272), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 265/401 (66%), Positives = 312/401 (77%), Gaps = 18/401 (4%)

Query: 1   MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQ 60
           MQPPAQEL ARD+HDN+WTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSV+ +RDEK 
Sbjct: 181 MQPPAQELQARDIHDNVWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVIVVRDEKH 240

Query: 61  QLLLGIRRANRQPANLSSSVLSSDSMHIGILAAAAHAAANNSPFTVFYNPRASPSEFVVP 120
           QLLLGIRRANRQP N+SSSVLSSDSMHIG+LAAAAHAAAN+SPFT+FYNPRASP+EFV+P
Sbjct: 241 QLLLGIRRANRQPTNISSSVLSSDSMHIGVLAAAAHAAANSSPFTIFYNPRASPTEFVIP 300

Query: 121 LAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGW 180
            AKY KA++SNQISLGMRFRMMFETEE GTRRYMGTITGISDLDPVRWKNSQWRNLQVGW
Sbjct: 301 FAKYQKALYSNQISLGMRFRMMFETEELGTRRYMGTITGISDLDPVRWKNSQWRNLQVGW 360

Query: 181 DESTAGEKRNRVSIWEIEPVTAPFFICPPPFFRSKHPRQADDDASDLDNVFKRTMPWIGD 240
           DES AGE+RNRVSIWEIEPV APFF+CP PFF  K PRQ DD+ S+++N+FKR MPW+G+
Sbjct: 361 DESAAGERRNRVSIWEIEPVAAPFFLCPQPFFGVKRPRQLDDE-SEMENLFKRAMPWLGE 419

Query: 241 DFGVKDSQ----SLPGLSLVQWMNM--QQNPSLAN-AMQSSYMHSLPGSILQNLNGGL-- 291
           +  +KD+Q    + PGLSLVQWMNM  QQ+ SLAN A QS Y+ +L    +QNL      
Sbjct: 420 EVCIKDTQNQNSTAPGLSLVQWMNMNRQQSSSLANTAAQSEYLQALGNPAMQNLAADELA 479

Query: 292 ------SQMPQQNNLQYTGQSLPQQVPQIDQLAKLPSTVNPLGSNILPQQPLGDISQQSR 345
                 + + QQN +Q+    LPQQ+  ++ L+K    +N LG+ I PQ    D     R
Sbjct: 480 RQLYVQNNLLQQNCIQFNSPKLPQQMQTMNDLSKAAIPLNQLGAIINPQDQKQDAVNHQR 539

Query: 346 QNMITQNLPSGPVQAQVLQPQNLVQTSNILQQQPS-IQNPQ 385
           Q    Q +P    Q+ ++Q Q +VQ + + QQ+PS  QNPQ
Sbjct: 540 QQNSIQVIPLSQAQSNLVQAQVIVQ-NQMQQQKPSPTQNPQ 579




Auxin response factors (ARFs) are transcriptional factors that binds specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs).
Oryza sativa subsp. japonica (taxid: 39947)
>sp|Q0D9R7|ARFS_ORYSJ Auxin response factor 19 OS=Oryza sativa subsp. japonica GN=ARF19 PE=2 SV=2 Back     alignment and function description
>sp|Q6YZW0|ARFU_ORYSJ Auxin response factor 21 OS=Oryza sativa subsp. japonica GN=ARF21 PE=2 SV=2 Back     alignment and function description
>sp|Q8RYC8|ARFS_ARATH Auxin response factor 19 OS=Arabidopsis thaliana GN=ARF19 PE=1 SV=2 Back     alignment and function description
>sp|A3B9A0|ARFP_ORYSJ Auxin response factor 16 OS=Oryza sativa subsp. japonica GN=ARF16 PE=2 SV=1 Back     alignment and function description
>sp|A2YAA5|ARFP_ORYSI Auxin response factor 16 OS=Oryza sativa subsp. indica GN=ARF16 PE=2 SV=1 Back     alignment and function description
>sp|P93022|ARFG_ARATH Auxin response factor 7 OS=Arabidopsis thaliana GN=ARF7 PE=1 SV=2 Back     alignment and function description
>sp|Q6H6V4|ARFF_ORYSJ Auxin response factor 6 OS=Oryza sativa subsp. japonica GN=ARF6 PE=1 SV=1 Back     alignment and function description
>sp|A2X1A1|ARFF_ORYSI Auxin response factor 6 OS=Oryza sativa subsp. indica GN=ARF6 PE=2 SV=1 Back     alignment and function description
>sp|Q9ZTX8|ARFF_ARATH Auxin response factor 6 OS=Arabidopsis thaliana GN=ARF6 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query825
224118764 1113 predicted protein [Populus trichocarpa] 1.0 0.741 0.754 0.0
255557566 1109 Auxin response factor, putative [Ricinus 0.989 0.735 0.717 0.0
73697836 1099 auxin response factor 2 [Gossypium arbor 0.986 0.740 0.690 0.0
359484941 1117 PREDICTED: auxin response factor 5-like 0.996 0.735 0.742 0.0
449470376 1107 PREDICTED: LOW QUALITY PROTEIN: uncharac 0.987 0.736 0.670 0.0
47496694 1107 auxin response factor 2 [Cucumis sativus 0.987 0.736 0.670 0.0
449522288 1107 PREDICTED: uncharacterized LOC101205542 0.987 0.736 0.670 0.0
356520887 1110 PREDICTED: auxin response factor 25-like 0.991 0.736 0.681 0.0
297736017 1034 unnamed protein product [Vitis vinifera] 0.915 0.730 0.698 0.0
356496641 1104 PREDICTED: auxin response factor 6-like 0.992 0.741 0.679 0.0
>gi|224118764|ref|XP_002331440.1| predicted protein [Populus trichocarpa] gi|222873654|gb|EEF10785.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score = 1081 bits (2796), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 637/844 (75%), Positives = 700/844 (82%), Gaps = 19/844 (2%)

Query: 1    MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQ 60
            MQPPAQEL+ARDLHDN+WTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLF+RDEKQ
Sbjct: 163  MQPPAQELVARDLHDNVWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFMRDEKQ 222

Query: 61   QLLLGIRRANRQPANLSSSVLSSDSMHIGILAAAAHAAANNSPFTVFYNPRASPSEFVVP 120
            QLLLGIRRANRQP NLSSSVLSSDSMHIGILAAAAHAAANNSPFTV+YNPRASPSEFV+P
Sbjct: 223  QLLLGIRRANRQPTNLSSSVLSSDSMHIGILAAAAHAAANNSPFTVYYNPRASPSEFVIP 282

Query: 121  LAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGW 180
            LAKYYKAV+SNQISLGMRFRMMFETEESGTRR+MGTITGISDLD VRWKNSQWRNLQVGW
Sbjct: 283  LAKYYKAVYSNQISLGMRFRMMFETEESGTRRHMGTITGISDLDAVRWKNSQWRNLQVGW 342

Query: 181  DESTAGEKRNRVSIWEIEPVTAPFFICPPPFFRSKHPRQ---ADDDASDLDNVFKRTMPW 237
            DESTAGE+RNRVSIWEIEPVTAPFFICPPPFFRSKHPRQ    DDD++D D++FKRTMPW
Sbjct: 343  DESTAGERRNRVSIWEIEPVTAPFFICPPPFFRSKHPRQPGMPDDDSTDFDSLFKRTMPW 402

Query: 238  IGDDFGVKDSQSLPGLSLVQWMNMQQNPSLANAMQSSYMHSLPGSILQNLNGG-LS---- 292
            +GDD  +KD Q LPGLSL Q MNMQQNPSLAN+MQ +YM SL GS+LQNL GG LS    
Sbjct: 403  LGDDIYMKDPQVLPGLSLAQRMNMQQNPSLANSMQPNYMQSLSGSVLQNLPGGDLSRQLG 462

Query: 293  ----QMPQQNNLQYTGQSLPQQVPQIDQLAKLPSTVNPLGSNILPQQPLGDISQQSRQNM 348
                QMPQ NNLQ+  Q LPQQ  Q+DQL KL S +NPLGS I  QQ +GDI+QQSRQNM
Sbjct: 463  LSSPQMPQPNNLQFNAQRLPQQAQQLDQLPKLQSLLNPLGSIIQSQQQMGDITQQSRQNM 522

Query: 349  ITQNLPSGPVQAQVLQPQNLVQTSNILQQQPSIQNPQLPANLPQNLQ--QQQQQQHIMGQ 406
            + Q LPS  VQAQ+LQPQ L QT+NILQQQPSIQ+ QL  NLPQ L   QQ QQQHIMGQ
Sbjct: 523  MAQTLPSSQVQAQLLQPQTLAQTNNILQQQPSIQSHQLLRNLPQTLHQQQQNQQQHIMGQ 582

Query: 407  NQQQNLMQTQLPDPINQNLQMSDKQIQLHLLQKLQQQRQSLLSQQSALQQPAQLIQLQDQ 466
            NQQQ+LMQ+QL D +NQ++QMSD QIQ  L+QKLQQQ+QS+ +QQSA+ Q  QL QLQD 
Sbjct: 583  NQQQSLMQSQLSDQVNQHMQMSDNQIQSQLMQKLQQQQQSVSAQQSAMHQAGQLGQLQDS 642

Query: 467  QRQLLDASQSFSRSGTPTQMLEMHQVTPTSLPQSNIMSQQIANSGSLNNVQFSQPPQQPK 526
            QRQLLDASQSFSRS TP+QMLE+ Q TPTSLPQ N + QQ+  + +  N +FS  PQQ K
Sbjct: 643  QRQLLDASQSFSRSMTPSQMLEIPQTTPTSLPQPNTIPQQMTKNNNQTNTRFSHLPQQLK 702

Query: 527  LEQQQPGI--LPQMPGHMGLPASHIINPVSTAGNSALTGAAGVGQSVITDDNPSCSTSPS 584
             +QQ  GI  L +M GHMGLP S + N +STAG+S LT AAG GQS ITDD PSCSTSPS
Sbjct: 703  PQQQHSGIMLLSEMAGHMGLPPSSMANQLSTAGSSILTAAAGPGQSGITDDVPSCSTSPS 762

Query: 585  TNNCQRLIQPTINSRTHRSAGIGEEVAQSASALLNPSALETMPSNANLVKDLPHKSDVKP 644
            TNNC  ++QP IN   HRS  +GE++AQSA  L +PSALET+ SN NLVKDL  KS+VKP
Sbjct: 763  TNNCPNIVQPMINGWAHRSTAMGEDMAQSAVTLFSPSALETVSSNGNLVKDLLQKSEVKP 822

Query: 645  SVNISKTQNQGFFTPQTYLNGAATQTDYLDTSSSTTSVCLSQNDVHLQQ-NNSLSYNPQS 703
            S+NISK QN G F+ QTYLNG A Q DYLDTSSSTTSVCLSQNDVHLQQ NNSLSYNPQS
Sbjct: 823  SLNISKNQNPGLFSSQTYLNGVAAQIDYLDTSSSTTSVCLSQNDVHLQQNNNSLSYNPQS 882

Query: 704  T-LRDASQDGEVPVDPRSNIPYGANIDGPLGS-MNPDPLLTKGMMGLGKDFSNNISSGAM 761
              LRDAS DGE+  DPR+NI YG NID  L   +N D LLTKGMMGLGKDFSNN SSG M
Sbjct: 883  VLLRDASHDGELQGDPRNNILYGTNIDSQLVMPINSDHLLTKGMMGLGKDFSNNFSSGGM 942

Query: 762  LANYENSKDAQQELSSSIVSQSFGVPDMAFNSIDSTINDSSFLNGGPWAPPPQFPQRMRT 821
            L N ENSKD QQELSS+IVS+SFGVPDM FNSIDSTINDSS LN G WAPP Q  QRMRT
Sbjct: 943  LTNCENSKDPQQELSSAIVSKSFGVPDMPFNSIDSTINDSSLLNRGSWAPPQQQFQRMRT 1002

Query: 822  YTKV 825
            YTKV
Sbjct: 1003 YTKV 1006




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255557566|ref|XP_002519813.1| Auxin response factor, putative [Ricinus communis] gi|223541052|gb|EEF42609.1| Auxin response factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|73697836|gb|AAZ81522.1| auxin response factor 2 [Gossypium arboreum] Back     alignment and taxonomy information
>gi|359484941|ref|XP_002266603.2| PREDICTED: auxin response factor 5-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|449470376|ref|XP_004152893.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101205542 [Cucumis sativus] Back     alignment and taxonomy information
>gi|47496694|dbj|BAD19062.1| auxin response factor 2 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449522288|ref|XP_004168159.1| PREDICTED: uncharacterized LOC101205542 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356520887|ref|XP_003529091.1| PREDICTED: auxin response factor 25-like [Glycine max] Back     alignment and taxonomy information
>gi|297736017|emb|CBI24055.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356496641|ref|XP_003517174.1| PREDICTED: auxin response factor 6-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query825
TAIR|locus:2202205 1086 ARF19 "auxin response factor 1 0.442 0.336 0.587 2.2e-129
TAIR|locus:2204237935 ARF6 "auxin response factor 6" 0.444 0.392 0.476 1.8e-88
TAIR|locus:2035454902 MP "MONOPTEROS" [Arabidopsis t 0.443 0.405 0.471 6.5e-83
TAIR|locus:2152642811 ARF8 "auxin response factor 8" 0.431 0.438 0.475 8.1e-76
TAIR|locus:2174013859 ARF2 "auxin response factor 2" 0.261 0.251 0.502 8.7e-61
TAIR|locus:2175098788 ARF4 "auxin response factor 4" 0.299 0.313 0.462 8.4e-53
TAIR|locus:2138096638 ARF9 "auxin response factor 9" 0.327 0.423 0.401 3.1e-51
TAIR|locus:2025991665 ARF1 "auxin response factor 1" 0.261 0.324 0.465 4e-51
TAIR|locus:2039124622 ARF11 "auxin response factor 1 0.247 0.327 0.466 6.8e-49
TAIR|locus:2076765602 ARF18 "auxin response factor 1 0.244 0.335 0.475 4.9e-48
TAIR|locus:2202205 ARF19 "auxin response factor 19" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1119 (399.0 bits), Expect = 2.2e-129, Sum P(3) = 2.2e-129
 Identities = 225/383 (58%), Positives = 268/383 (69%)

Query:     1 MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQ 60
             MQPPAQE++A+DLHD  WTFRHIYRGQPKRHLLTTGWS+FVS KRLFAGDSVLF+RDEK 
Sbjct:   159 MQPPAQEIVAKDLHDTTWTFRHIYRGQPKRHLLTTGWSVFVSTKRLFAGDSVLFVRDEKS 218

Query:    61 QLLLGIRRANRQPANXXXXXXXXXXMHIGILXXXXXXXXNNSPFTVFYNPRASPSEFVVP 120
             QL+LGIRRANRQ             MHIGIL        N+SPFT+F+NPRASPSEFVVP
Sbjct:   219 QLMLGIRRANRQTPTLSSSVISSDSMHIGILAAAAHANANSSPFTIFFNPRASPSEFVVP 278

Query:   121 LAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQWRNLQVGW 180
             LAKY KA+++ Q+SLGMRFRMMFETE+ G RRYMGT+TGISDLDPVRWK SQWRNLQVGW
Sbjct:   279 LAKYNKALYA-QVSLGMRFRMMFETEDCGVRRYMGTVTGISDLDPVRWKGSQWRNLQVGW 337

Query:   181 DESTAGEKRNRVSIWEIEPVTAPFFICPPPFFRSKHPRQAD--DDASDLDNVFKRTMPWI 238
             DESTAG++ +RVSIWEIEPV  PF+ICPPPFFR K+PRQ    DD  D++N FKR MPW+
Sbjct:   338 DESTAGDRPSRVSIWEIEPVITPFYICPPPFFRPKYPRQPGMPDDELDMENAFKRAMPWM 397

Query:   239 GDDFGVKDSQS--LPGLSLVQWMNMQQNPSLANAMQSSYMHSLPGSILQ-NLNGGLSQMP 295
             G+DFG+KD+QS   PGLSLVQWM+MQQN    N +  S    LP ++   NL    +   
Sbjct:   398 GEDFGMKDAQSSMFPGLSLVQWMSMQQN----NPLSGSATPQLPSALSSFNLPNNFASND 453

Query:   296 QQNNLQYTGQSLPQQVPQIDQLAKLPSTVNPLGSNILPQQPLGDISQQSRQNMITQNLPS 355
                 L +   +L     Q ++    P+TVN + S  +  QP    SQQ +Q    Q+   
Sbjct:   454 PSKLLNFQSPNLSSANSQFNK----PNTVNHI-SQQMQAQPAMVKSQQQQQQQQQQHQHQ 508

Query:   356 GPVQAQVLQPQNLVQTSNILQQQ 378
                Q Q+ Q Q L  +   +QQQ
Sbjct:   509 ---QQQLQQQQQLQMSQQQVQQQ 528


GO:0005634 "nucleus" evidence=ISM;IEA;IDA;TAS
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0009725 "response to hormone stimulus" evidence=IEA
GO:0009733 "response to auxin stimulus" evidence=RCA;IMP;NAS
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0048366 "leaf development" evidence=IGI
GO:0048527 "lateral root development" evidence=IGI;RCA
GO:0009723 "response to ethylene stimulus" evidence=IMP
GO:0003677 "DNA binding" evidence=IDA
GO:0010311 "lateral root formation" evidence=IGI
GO:0007155 "cell adhesion" evidence=RCA
GO:0009741 "response to brassinosteroid stimulus" evidence=RCA
GO:0010090 "trichome morphogenesis" evidence=RCA
GO:0010583 "response to cyclopentenone" evidence=RCA
GO:0045010 "actin nucleation" evidence=RCA
GO:0048765 "root hair cell differentiation" evidence=RCA
GO:0071555 "cell wall organization" evidence=RCA
GO:0043565 "sequence-specific DNA binding" evidence=IDA
GO:0046983 "protein dimerization activity" evidence=IDA
TAIR|locus:2204237 ARF6 "auxin response factor 6" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2035454 MP "MONOPTEROS" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2152642 ARF8 "auxin response factor 8" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2174013 ARF2 "auxin response factor 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2175098 ARF4 "auxin response factor 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2138096 ARF9 "auxin response factor 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2025991 ARF1 "auxin response factor 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2039124 ARF11 "auxin response factor 11" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2076765 ARF18 "auxin response factor 18" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pg.C_1640064
hypothetical protein (1113 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query825
pfam0650783 pfam06507, Auxin_resp, Auxin response factor 7e-46
pfam0236297 pfam02362, B3, B3 DNA binding domain 6e-17
smart0101996 smart01019, B3, B3 DNA binding domain 4e-15
cd1001798 cd10017, B3_DNA, Plant-specific B3-DNA binding dom 2e-09
pfam09770 804 pfam09770, PAT1, Topoisomerase II-associated prote 2e-08
pfam09606768 pfam09606, Med15, ARC105 or Med15 subunit of Media 3e-07
pfam09770 804 pfam09770, PAT1, Topoisomerase II-associated prote 3e-06
pfam09606 768 pfam09606, Med15, ARC105 or Med15 subunit of Media 4e-06
pfam09770 804 pfam09770, PAT1, Topoisomerase II-associated prote 3e-05
pfam13779820 pfam13779, DUF4175, Domain of unknown function (DU 4e-05
pfam09770804 pfam09770, PAT1, Topoisomerase II-associated prote 3e-04
pfam09606768 pfam09606, Med15, ARC105 or Med15 subunit of Media 4e-04
pfam03153332 pfam03153, TFIIA, Transcription factor IIA, alpha/ 5e-04
pfam13779820 pfam13779, DUF4175, Domain of unknown function (DU 0.002
pfam03153332 pfam03153, TFIIA, Transcription factor IIA, alpha/ 0.003
>gnl|CDD|191545 pfam06507, Auxin_resp, Auxin response factor Back     alignment and domain information
 Score =  158 bits (401), Expect = 7e-46
 Identities = 58/84 (69%), Positives = 70/84 (83%), Gaps = 1/84 (1%)

Query: 94  AAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRY 153
           AAHAA+  SPF VFYNPRAS SEFVVP AKY KA+ +N  S+GMRF+M FETE+S  RR+
Sbjct: 1   AAHAASTGSPFHVFYNPRASTSEFVVPYAKYLKAM-NNPFSVGMRFKMRFETEDSSERRF 59

Query: 154 MGTITGISDLDPVRWKNSQWRNLQ 177
            GTI+G+SDLDP+RW NS+WR+LQ
Sbjct: 60  SGTISGVSDLDPIRWPNSKWRSLQ 83


A conserved region of auxin-responsive transcription factors. Length = 83

>gnl|CDD|216995 pfam02362, B3, B3 DNA binding domain Back     alignment and domain information
>gnl|CDD|214977 smart01019, B3, B3 DNA binding domain Back     alignment and domain information
>gnl|CDD|197383 cd10017, B3_DNA, Plant-specific B3-DNA binding domain Back     alignment and domain information
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1 Back     alignment and domain information
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex non-fungal Back     alignment and domain information
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1 Back     alignment and domain information
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex non-fungal Back     alignment and domain information
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1 Back     alignment and domain information
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175) Back     alignment and domain information
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1 Back     alignment and domain information
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex non-fungal Back     alignment and domain information
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit Back     alignment and domain information
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175) Back     alignment and domain information
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 825
PF0650783 Auxin_resp: Auxin response factor; InterPro: IPR01 100.0
KOG06441113 consensus Uncharacterized conserved protein, conta 99.58
PF02362100 B3: B3 DNA binding domain; InterPro: IPR003340 Two 99.15
>PF06507 Auxin_resp: Auxin response factor; InterPro: IPR010525 This pattern represents a conserved region of auxin-responsive transcription factors Back     alignment and domain information
Probab=100.00  E-value=5.7e-36  Score=263.64  Aligned_cols=83  Identities=67%  Similarity=1.128  Sum_probs=81.7

Q ss_pred             HHHHHhcCCceEEEEcCCCCCCccccchHHHHHhhhhcCccccceeeeeeeccCCCcceeeEEEEEeecCCCCCCCCCCc
Q 003369           94 AAHAAANNSPFTVFYNPRASPSEFVVPLAKYYKAVHSNQISLGMRFRMMFETEESGTRRYMGTITGISDLDPVRWKNSQW  173 (825)
Q Consensus        94 AahaAatgspFtV~Y~PRas~sEFVVp~~kY~kAm~~~~ws~GMRFRM~FEtEDss~rry~GTItgVsd~DPvrWP~S~W  173 (825)
                      |+|||++|++|+|+||||++++|||||++||++||+ ++|++||||||.||+||+++++|+|||+||+|+||+|||+|+|
T Consensus         1 A~~aa~~~~~F~V~Y~PRa~~sEFVV~~~k~~~al~-~~~~~GmRfkM~fE~eds~~~~~~GtI~~v~~~dp~~w~~S~W   79 (83)
T PF06507_consen    1 AAHAAATGSPFEVFYYPRASPSEFVVPASKYDKALN-HPWSVGMRFKMRFETEDSSERRWQGTIVGVSDLDPIRWPGSKW   79 (83)
T ss_pred             ChhHhhcCCeEEEEECCCCCCcceEEEHHHHHHHhc-CCCCCCcEEEEEeccCCCccceeeeEEeEeeccCCCCCCCCCc
Confidence            689999999999999999999999999999999999 9999999999999999999999999999999999999999999


Q ss_pred             ccee
Q 003369          174 RNLQ  177 (825)
Q Consensus       174 R~Lk  177 (825)
                      ||||
T Consensus        80 R~Lq   83 (83)
T PF06507_consen   80 RMLQ   83 (83)
T ss_pred             ccCc
Confidence            9997



The plant hormone auxin (indole-3-acetic acid) can regulate the gene expression of several families, including Aux/IAA, GH3 and SAUR families. Two related families of proteins, Aux/IAA proteins (IPR003311 from INTERPRO) and the auxin response factors (ARF), are key regulators of auxin-modulated gene expression []. There are multiple ARF proteins, some of which activate, while others repress transcription. ARF proteins bind to auxin-responsive cis-acting promoter elements (AuxREs) using an N-terminal DNA-binding domain. It is thought that Aux/IAA proteins activate transcription by modifying ARF activity through the C-terminal protein-protein interaction domains (IPR011525 from INTERPRO) found in both Aux/IAA and ARF proteins. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0009725 response to hormone stimulus, 0005634 nucleus

>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only] Back     alignment and domain information
>PF02362 B3: B3 DNA binding domain; InterPro: IPR003340 Two DNA binding proteins, RAV1 and RAV2 from Arabidopsis thaliana contain two distinct amino acid sequence domains found only in higher plant species Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query825
1wid_A130 Solution Structure Of The B3 Dna-Binding Domain Of 9e-05
>pdb|1WID|A Chain A, Solution Structure Of The B3 Dna-Binding Domain Of Rav1 Length = 130 Back     alignment and structure

Iteration: 1

Score = 45.8 bits (107), Expect = 9e-05, Method: Composition-based stats. Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 2/56 (3%) Query: 12 DLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRD--EKQQLLLG 65 D++ +W FR+ Y + ++LT GWS FV K L AGD V F R + QQL +G Sbjct: 58 DVNGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLRAGDVVSFSRSNGQDQQLYIG 113 Database: pdbaa

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query825
1wid_A130 DNA-binding protein RAV1; DNA-binding domain, stru 2e-24
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-10
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-04
1m2v_B 926 SEC24, protein transport protein SEC24, SEC24P, SE 5e-05
3lj5_A725 Portal protein, protein GP1; DNA ejection, molecul 4e-04
>1wid_A DNA-binding protein RAV1; DNA-binding domain, structural genomics, riken structural genomics/proteomics initiative, RSGI; NMR {Arabidopsis thaliana} SCOP: b.142.1.2 Length = 130 Back     alignment and structure
 Score = 98.3 bits (244), Expect = 2e-24
 Identities = 29/83 (34%), Positives = 41/83 (49%)

Query: 1   MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQ 60
           +      L   D++  +W FR+ Y    + ++LT GWS FV  K L AGD V F R   Q
Sbjct: 47  VSVKGVLLNFEDVNGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLRAGDVVSFSRSNGQ 106

Query: 61  QLLLGIRRANRQPANLSSSVLSS 83
              L I   +R  ++L +S  SS
Sbjct: 107 DQQLYIGWKSRSGSDLDASGPSS 129


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1m2v_B SEC24, protein transport protein SEC24, SEC24P, SEC24 protein, abnormal nuclear; zinc-finger, beta barrel, VWA domain, gelsolin domain,; 2.75A {Saccharomyces cerevisiae} SCOP: a.71.2.1 b.2.8.1 c.62.1.2 d.109.2.1 g.41.10.1 Length = 926 Back     alignment and structure
>3lj5_A Portal protein, protein GP1; DNA ejection, molecular motor, DNA packaging podoviridae, virus assembly, tail tube, trunk domain; 7.50A {Enterobacteria phage P22} Length = 725 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query825
1wid_A130 DNA-binding protein RAV1; DNA-binding domain, stru 99.73
4i1k_A146 B3 domain-containing transcription factor VRN1; B3 99.0
1yel_A104 AT1G16640; CESG, protein structure initiative, str 98.72
>1wid_A DNA-binding protein RAV1; DNA-binding domain, structural genomics, riken structural genomics/proteomics initiative, RSGI; NMR {Arabidopsis thaliana} SCOP: b.142.1.2 Back     alignment and structure
Probab=99.73  E-value=3.2e-18  Score=158.65  Aligned_cols=71  Identities=35%  Similarity=0.560  Sum_probs=65.8

Q ss_pred             CCccEEEEeecCCCeeEeeEEEcCCCcccccccccceeecCCCcCCCCeEEEEEcCC--CcEEEEEEeccCCC
Q 003369            3 PPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEK--QQLLLGIRRANRQP   73 (825)
Q Consensus         3 pP~QeLvarDlhGk~W~FRhiYRG~PkRhlLTtGWS~FVk~KrL~AGDsVVF~R~e~--geL~vGIRRA~R~~   73 (825)
                      ++.++|.++|.+|++|+|||+||+++++|+||+||+.||++|+|++||.|+|+|.++  ++|+|+|||+.+..
T Consensus        49 ~~~~~l~l~D~~Gk~W~fr~~~~~~~~~~~Lt~GW~~FV~~~~L~~GD~~~F~~~~~~~~~l~I~~rr~~~~~  121 (130)
T 1wid_A           49 VKGVLLNFEDVNGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLRAGDVVSFSRSNGQDQQLYIGWKSRSGSD  121 (130)
T ss_dssp             SCCEEEEEEETTTEEEEEEEEEETTTTEEEEESSHHHHHHHTTCCTTCEEEEEECCSSSCCEEEEEECCCSCS
T ss_pred             CCcEEEEEEeCCCCEEEEEEEEECCCCceEEcCChHHHHHHcCCCCCCEEEEEEecCCCcEEEEEEEECCCCC
Confidence            466899999999999999999999999999999999999999999999999999874  67999999988753



>4i1k_A B3 domain-containing transcription factor VRN1; B3 domain beta-barrel, DNA binding protein; 1.60A {Arabidopsis thaliana} Back     alignment and structure
>1yel_A AT1G16640; CESG, protein structure initiative, structural genomics, center for eukaryotic structural genomics, unknown function; NMR {Arabidopsis thaliana} SCOP: b.142.1.2 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 825
d1wida_117 b.142.1.2 (A:) DNA-binding protein RAV1 {Thale cre 3e-18
d1yela1102 b.142.1.2 (A:1-102) At1g16640 {Thale cress (Arabid 2e-09
d1na6a1175 b.142.1.1 (A:4-178) Restriction endonuclease EcoRI 7e-07
>d1wida_ b.142.1.2 (A:) DNA-binding protein RAV1 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 117 Back     information, alignment and structure

class: All beta proteins
fold: DNA-binding pseudobarrel domain
superfamily: DNA-binding pseudobarrel domain
family: B3 DNA binding domain
domain: DNA-binding protein RAV1
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 79.3 bits (195), Expect = 3e-18
 Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 2/74 (2%)

Query: 1   MQPPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDEKQ 60
           +      L   D++  +W FR+ Y    + ++LT GWS FV  K L AGD V F R   Q
Sbjct: 40  VSVKGVLLNFEDVNGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLRAGDVVSFSRSNGQ 99

Query: 61  --QLLLGIRRANRQ 72
             QL +G +  +  
Sbjct: 100 DQQLYIGWKSRSGS 113


>d1yela1 b.142.1.2 (A:1-102) At1g16640 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 102 Back     information, alignment and structure
>d1na6a1 b.142.1.1 (A:4-178) Restriction endonuclease EcoRII, N-terminal domain {Escherichia coli [TaxId: 562]} Length = 175 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query825
d1wida_117 DNA-binding protein RAV1 {Thale cress (Arabidopsis 99.57
d1yela1102 At1g16640 {Thale cress (Arabidopsis thaliana) [Tax 98.77
d1na6a1175 Restriction endonuclease EcoRII, N-terminal domain 98.68
>d1wida_ b.142.1.2 (A:) DNA-binding protein RAV1 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All beta proteins
fold: DNA-binding pseudobarrel domain
superfamily: DNA-binding pseudobarrel domain
family: B3 DNA binding domain
domain: DNA-binding protein RAV1
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.57  E-value=1e-15  Score=135.95  Aligned_cols=69  Identities=36%  Similarity=0.586  Sum_probs=63.9

Q ss_pred             CCccEEEEeecCCCeeEeeEEEcCCCcccccccccceeecCCCcCCCCeEEEEEcC--CCcEEEEEEeccC
Q 003369            3 PPAQELMARDLHDNIWTFRHIYRGQPKRHLLTTGWSLFVSGKRLFAGDSVLFIRDE--KQQLLLGIRRANR   71 (825)
Q Consensus         3 pP~QeLvarDlhGk~W~FRhiYRG~PkRhlLTtGWS~FVk~KrL~AGDsVVF~R~e--~geL~vGIRRA~R   71 (825)
                      ++..++.++|.+|++|.|+|+||+..++|+|++||..||++++|++||.|+|+|.+  +++++|++||+..
T Consensus        42 ~~~~~~~~~d~~g~~W~~~~~~~~~~~~~~l~~GW~~Fv~~~~Lk~GD~~~F~~~~~~~~~~~i~~r~~~~  112 (117)
T d1wida_          42 VKGVLLNFEDVNGKVWRFRYSYWNSSQSYVLTKGWSRFVKEKNLRAGDVVSFSRSNGQDQQLYIGWKSRSG  112 (117)
T ss_dssp             SCCEEEEEEETTTEEEEEEEEEETTTTEEEEESSHHHHHHHTTCCTTCEEEEEECCSSSCCEEEEEECCCS
T ss_pred             CCcEEEEEEeCCCCEEEEEEEEECCCCceEEecCHHHHHHHcCCCCCCEEEEEEEeCCCCEEEEEEEECCC
Confidence            45778999999999999999999888999999999999999999999999999974  6799999999875



>d1yela1 b.142.1.2 (A:1-102) At1g16640 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
>d1na6a1 b.142.1.1 (A:4-178) Restriction endonuclease EcoRII, N-terminal domain {Escherichia coli [TaxId: 562]} Back     information, alignment and structure