Citrus Sinensis ID: 000633


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-------830-------840-------850-------860-------870-------880-------890-------900-------910-------920-------930-------940-------950-------960-------970-------980-------990------1000------1010------1020------1030------1040------1050------1060------1070------1080------1090------1100------1110------1120------1130------1140------1150------1160------1170------1180------1190------1200------1210------1220------1230------1240------1250------1260------1270------1280------1290------1300------1310------1320------1330------1340------1350------1360------1370------1380
MAPSRRKGASKAAAAAAARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHLKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPSSEEELKSSLQTEEKPSKDTAVNLVEHISPSLNNYDEGFTEKGPSLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLSPQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQSSSSSPYARSQYKKSLGAPVADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCSSSAVAYSRRSSRTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFEAVTPEHNSEIPEERDANPSMKKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLPQDVPPSSPPLPSSPPPVLPPPPSIPHSCAFSDSYSNGASMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHRHMRPPSGGAVPVGIRAPGAWRPR
cccccccccHHHHHHHHHHcccccccEEEEEccccccccccccccccccccccccEEEEEEEccccEEEcccccccccHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHcccccHHHcccccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHccccccccccccccccHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHccccccccccEEEHHHHHHHHccccHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
cccccccccHHHHHHHHHHHHcccccEEEEEEccccccccEEccccccccccccccEEEEEEccccEEEccHHHcccccHHHHHHHccccccccccHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEcccccccccccccccccccccccccccccEEEcccccccccccccccccccccccccccccccccccccccccccccEEEHcccccccccccccccccccHccccccccccHcccccccccccccccccccccccccccccccHHccccccccccHcccccccccccccccccccccHccccccccccccccccccccccccccccccccccccccccccccEcccccccccccccHHccccccHHHHHHHHHHccccHHHccccccccccccEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEcccccccccccccccccccccccccccccccccccccccccEcccccccccccccccccccccccccccEccccccccccccccccccccHHHHHHHHHHHHHHccccccccccccHHcccccccccccccccccccccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHccccccHHEEHHHHHHHHHHHcccHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHccccccccccccccccccccccccccHHHHcccEEcccccccccccccccccHHcccccccccccccccccccccccccccccccccccHHcccEEEEEcccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
MAPSRRKGASKAAAAAAARRQWKVGDLVLAKvkgfpawpatvsepekwgysadWKKVLVFFFGtqqiafcnpadveaFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKkqdqvdsnsgdeltvanggnsvnsishlkdrteaseatldsqmkpsnstagdglnlptedspagrqldalpakeplpeqpsenlvakatpvlttyssrkrsggsrlqstqrmapstrrsrsstmVESCRLqnlmmpynnegknaegisaksildgslirnkrtrkspdgsecndldssalmsngsiednsseivtvesdafslnegstvdsgckvedsetvlecldgdemLSKRLDFQIKAVVVKkkrkpnrkrvcndavdpparintatevdvstrntchssentggnlderdfkedgdehlPLVKRARvrmgkpsseeELKSslqteekpskdTAVNLVEhispslnnydegftekgpslvkestdsvspskvcsevsgnrpqlwkgttnqsfgcsadgeaalppsKRLHRALEAMSANAAEEGQASVQASSLintsingccvnsickcshetvdsrersgsglqnvptcdqlsencnsqkqesfrddvgsvdnvdgkdlpgspfsvhTIQTAVqtqtpvnilpdpkkrhssFQLYQNSldqlslkdegsaedlqlkdsrvenvdkefNTSALVelsldpvsgadesvklspqngsnelqysvqgmsyenseslksqiddnchINARCEAVEEIKQNEKqkemssvsisddhlgdkgVASVlfssspaegvdsparvsppntslchvstsesanivqssssspyarsqykkslgapvadegkvdttltqrpksvgkwsscsEAHAALSSFEAVLGSLTRTKESIGRATRIAIdcakfgvsSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKAILTVLPRllsaaappgnvaqeNRRQCLKVLRLWLERRILPESIIRHHMreldtvtcsssavaysrrssrteralddpvrdmEGMLVdeygsnssfqlpgfcmprmlkddddgsdsdggsfeavtpehnseipeerdanpsmkkHRHILEEvdgelemedvaptcdnemssTVLVDIAQTshdqllsfvpplpqdvppsspplpsspppvlppppsiphscafsdsysngasmqnmqndgqqsvtqqsvaprinstvstnavhyhasecrdhqmpmqmpestssfgcysmrpsnnfqqtdgprfhnkpypprpphapqsnhfsyvqasqsaksrreaphpsnshrfhphpnfdggnfynnhdrmkpgpyehreswrfsapsfsgprypdearecypsgsyggplrepprysnrgwaypprpmnhrhmrppsggavpvgirapgawrpr
mapsrrkgaSKAAAAAAARRQWKVGDLVLAKVKgfpawpatvsepekwgysADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSlltkrqgrgadFVRAVQEIIDSYEklkkqdqvdsnsgdeltvanggnsvnsisHLKDRTEASEatldsqmkpsnstagdglnLPTEDSPAGRQLDALPAKeplpeqpsenlvakatpvlttyssrkrsggsrlqstqrmapstrrsrsstmvESCRLQNLMMPYNNEgknaegisaksildgslirnkrtrkspdgsecndlDSSALMSNGSIEDNSSEIVTVESDAFslnegstvdsgckvedsETVLECLDGDEMLSKRLDFQIKavvvkkkrkpnrkrvcndavdpparintatevdvstrntchssentggnlderdfkedgdehlplvkrarvrmgkpsseeelksslqteekpskdTAVNLVEHispslnnydeGFTEKGPSLVkestdsvspskvcsevsgnrpqlwkgttnqsfgcSADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADEsvklspqngsnELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTsesanivqssssspyarSQYKKSlgapvadegkvdttltqrpksvgkwSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIaidcakfgvsSKVVEIVARHlesesslyrrVDLFFLVDSIMQCSRGMKAILTVLPRLLSAAappgnvaqenrrQCLKVLRLWLERRILPesiirhhmreldtvtcsssavaysrrssrteralddpvrDMEGMLVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFeavtpehnseipeerdanpsmkkHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLPQDVPPSSPPLPSSPPPVLPPPPSIPHSCAFSDSYSNGASMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHpsnshrfhphpnFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYpsgsyggplrepprySNRGWAYPPRPMNHRHmrppsggavpvgirapgawrpr
MapsrrkgaskaaaaaaarrQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHLKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIkavvvkkkrkpnrkrvCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPsseeelksslqteekPSKDTAVNLVEHISPSLNNYDEGFTEKGPSLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLSPQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQsssssPYARSQYKKSLGAPVADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCsssavaysrrssrTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKddddgsdsdggsFEAVTPEHNSEIPEERDANPSMKKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFvpplpqdvppsspplpsspppvlppppsipHSCAFSDSYSNGASMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYpprpmnhrhmrppSGGAVPVGIRAPGAWRPR
*****************ARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFT**************GADFVRAVQEIIDS*******************************************************************************************************************************************************************************************************************************ETVLECLDGDEMLSKRLDFQIKAVVVK*********************************************************************************************************************************************************************************************LINTSINGCCVNSICKCSH***************************************************************************************************************************************************************************HI***********************************************************************************************************************HAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCS******************************************************************************************************************VLVDIA******************************************************************************************************************************************************************************************************************************************************************************
************************GDLVLAKVKGFPAWPATVSEP******ADWKKVLVFFFGTQQIAFCNPADVEAFTEEKK***********ADFVRAVQEIIDSY*******************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************ALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVT**********************VRDMEGMLVDEYGSNSSFQLPGFCMPRML*************************************HRHILEEVDGELEMEDVAP*********************************************************************************************************************************************************************************************************************************************************************************************
********************QWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKL***********DELTVANGGNSVNSISHL*********************AGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKATPVLTT*******************************ESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNK************DLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARV************************TAVNLVEHISPSLNNYDEGFTEKG**********************NRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAM*************ASSLINTSINGCCVNSICKCSH************LQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLSPQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEE*****************DHLGDKGVASVLFSSS**************NTSL*************************************KVDTTL**************EAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTV******************ALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDD********************************KKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLPQD**************VLPPPPSIPHSCAFSDSYSNGASMQN**************APRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNH**********************HRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHRHMRPPSGGAVPVGIRAPGAWRPR
******************RRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLK**********************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************FGCSADGEAALPPSKRL*RALEAMSAN***********************************************************************************************************************************************************************************************************************************************************************************************************************************AHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTV*****************RALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRML**********************************MKKHRHILEEVDGELEMEDV************************************************VLPPPPSIPHSCAF*DSYSNG**********QQS**QQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHRHMRPPSGGAVPV***AP******
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAPSRRKGASKAAAAAAARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHLKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPSSEEELKSSLQTEEKPSKDTAVNLVEHISPSLNNYDEGFTEKGPSLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLSPQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQSSSSSPYARSQYKKSLGAPVADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCSSSAVAYSRRSSRTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFEAVTPEHNSEIPEERDANPSMKKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLPQDVPPSSPPLPSSPPPVLPPPPSIPHSCAFSDSYSNGASMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHRHMRPPSGGAVPVGIRAPGAWRPR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query1380 2.2.26 [Sep-21-2011]
F4IZM81347 HUA2-like protein 2 OS=Ar yes no 0.923 0.945 0.400 0.0
F4IN781366 HUA2-like protein 3 OS=Ar no no 0.436 0.440 0.462 1e-124
Q9XER91392 ENHANCER OF AG-4 protein no no 0.200 0.198 0.477 7e-63
Q9LEY41445 HUA2-like protein 1 OS=Ar no no 0.176 0.168 0.452 5e-56
Q9Y3E1203 Hepatoma-derived growth f yes no 0.092 0.630 0.320 3e-09
Q99JF8528 PC4 and SFRS1-interacting yes no 0.052 0.138 0.418 7e-09
P51859237 Hepatoma-derived growth f no no 0.059 0.345 0.379 7e-09
Q923W4202 Hepatoma-derived growth f yes no 0.092 0.633 0.302 7e-09
Q8VHK7237 Hepatoma-derived growth f no no 0.059 0.345 0.390 8e-09
Q9JMG7202 Hepatoma-derived growth f no no 0.092 0.628 0.313 9e-09
>sp|F4IZM8|HUAL2_ARATH HUA2-like protein 2 OS=Arabidopsis thaliana GN=At3g63070 PE=2 SV=1 Back     alignment and function desciption
 Score =  747 bits (1928), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 582/1453 (40%), Positives = 793/1453 (54%), Gaps = 179/1453 (12%)

Query: 1    MAPSRRKGASKAAAAAAARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVF 60
            MAPSR++G  +AAAA++ARR+WKVGDLVLAKVKGFPAWPA V EPEKWG+SAD KKV V 
Sbjct: 1    MAPSRKRGGGRAAAASSARREWKVGDLVLAKVKGFPAWPAVVDEPEKWGHSADSKKVTVH 60

Query: 61   FFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDS- 119
            FFGTQQIAFCN  DVE+FTEEKKQSLLT+R  +G+DFVRAV+EI +SYEKLK+QDQ    
Sbjct: 61   FFGTQQIAFCNHGDVESFTEEKKQSLLTRRHAKGSDFVRAVKEITESYEKLKQQDQASGP 120

Query: 120  NSGDELTVANGGNSVNSISHLKDRTEASEAT-LDSQMKPSNSTAGDGLNLPTEDSPAGRQ 178
               +E T  + GN+    S L    E    + LD+Q++ S+S   D L L +ED+ A  Q
Sbjct: 121  KYAEETTAGSSGNT----SQLPQACENLIGSRLDTQIESSSSHGRDELTLLSEDASAAEQ 176

Query: 179  LDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRL--QSTQRMAPSTRRSRSSTM 236
            + AL           ++  AK    + TYSSR+R+   R    + Q +      S+ S+ 
Sbjct: 177  MLALRHNTLAHNGACDSAAAKDLCEIATYSSRRRNERVRALKYAPQSIILPVEHSKISSR 236

Query: 237  VESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNG 296
            +E  R+Q  M+  ++ G +   I      +G  IR ++  ++   SE +D+ SS L  +G
Sbjct: 237  LELDRVQRSMLQCSDGGPSVNSI------NGKAIRRRKRIRTSGQSESDDVVSSDLNLHG 290

Query: 297  SIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVV 356
            S EDN+SEI TVES+  S NEG+ VDSG KVE S+ V E  DG   L+K LDF I  +V 
Sbjct: 291  SDEDNASEIATVESNNNSRNEGNGVDSGSKVEYSDAVGEGCDGGHELNKGLDFHISTMVK 350

Query: 357  KKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLP 416
            +KKRKP RKR  +D +DPPA++  A  +  +  ++C  S+N+   L+ER  +E+GDEHLP
Sbjct: 351  RKKRKPTRKRETSDIIDPPAKVE-AEGLGPNACDSCQRSQNSHERLNERPCEENGDEHLP 409

Query: 417  LVKRARVRMGKP-SSEEELKSSLQTEEKPSKDTAVNLVEHISPSLNN-------YDEGFT 468
            LVKRARVRM +   ++E++ +S Q EE+ SKDT ++     SPS+N+       +D    
Sbjct: 410  LVKRARVRMSRAFYADEKVNASSQVEERSSKDTLLSAALQTSPSVNHENGIGSGHDTSAA 469

Query: 469  EKG-----------------PSLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSA 511
            E+                  PS +++ +D +SPS  C +  G+R        N+ F  + 
Sbjct: 470  EEFNSFELSAKLSGVMVDVVPSHMEKPSDRMSPSVACVQTVGDRQTAVNFHENE-FTMTL 528

Query: 512  DGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHET---- 567
            D E     S +L   +E   A   E  Q   + S   N  I+      I +CSH++    
Sbjct: 529  DDEVTRAQSNQLSSLVET-EARVPEVVQGCSEESQTGNCLISETDPIDI-QCSHQSEKHE 586

Query: 568  -------VDSRERSGSGLQNVPTCDQLSENCNSQKQES-FRDDVGSVDNVD------GKD 613
                   VDS      GL     C  L         +S  +  +   D+ D      G  
Sbjct: 587  TPLNPDIVDSSANKSPGL-----CSSLDMTTTVVPAQSPHQHKIQEYDSSDHSLVIVGDS 641

Query: 614  LPGSPFSV-HTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKD 672
            L G    + + +   VQ+Q    + P P        L+ + ++   +++    E+   K+
Sbjct: 642  LNGKCEKIDYCMTQVVQSQA---LEPPPP-------LFCSVVNYQEVENLQETENTLWKE 691

Query: 673  SRVENVDKEFNTSALVELSLDPVSGADESVKL----SPQNGSNELQYSVQGMSYENSESL 728
            ++  +  KE ++     +  +PV  A ES  +     PQ    E  YS    + EN E  
Sbjct: 692  NQ-GSPGKELDSDKQAHMIQNPVLSATESEMIVDDAEPQY---ETVYSHCADAVENRELE 747

Query: 729  KSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSP 788
            KS           CE  E   Q E+ +  +S+S+S++   +K       +SSPA G    
Sbjct: 748  KS-----------CEVDE---QKEQMQATNSISVSENFSREK------LNSSPARG---- 783

Query: 789  ARVSPPNTSLCHVSTSESANIVQSSSS-SPYARSQYKKSLGA-PVADEGKVDTTLTQRPK 846
               +P   S+C +ST+ES N +Q++S  S   +    KSL    V +E KV+T  TQ  K
Sbjct: 784  ---TPNCNSVCRISTAESENAMQNNSYYSTNVQYGENKSLNVDTVKEESKVETGTTQVKK 840

Query: 847  SVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLE 906
             V      S+    + SFE  L SL RTKE+IGRATR+A+D AKFGVS+K +EI+A  LE
Sbjct: 841  VVS-----SDVQCTVESFETALDSLVRTKETIGRATRLAMDLAKFGVSAKAMEILAHTLE 895

Query: 907  SESSLYRRVDLFFLVDSIMQCSRGMK---------AILTVLPRLLSAAAPPGNVAQENRR 957
            SES+L RRVDLFFLVDSI QCS+G+          +I  +LPRLL+AA P G   QENR+
Sbjct: 896  SESNLQRRVDLFFLVDSIAQCSKGLNGDAGGVYLSSIQAMLPRLLTAAVPAGATTQENRK 955

Query: 958  QCLKVLRLWLERRILPESIIRHHMRELDTVTCSSSAVAYSRRSSRTERALDDPVRDMEGM 1017
            QCLKVLRLWLERRILPESI+RHH+RELD+++ +  A  YSRRS+RTERALDDPVRDMEG+
Sbjct: 956  QCLKVLRLWLERRILPESIVRHHIRELDSLS-NVPACLYSRRSARTERALDDPVRDMEGI 1014

Query: 1018 LVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFEAVTPEHNSEIPEERDANPSMKKH 1077
            LVDEYGSNS+ QL GFC+P +L+D+D+GSDSDGG FE+VTPEH S   EE       ++H
Sbjct: 1015 LVDEYGSNSTLQLHGFCIPPILRDEDEGSDSDGGDFESVTPEHESRSLEEHVTPSITERH 1074

Query: 1078 RHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLP-QDVPPSSPPLP 1136
              ILE+VDGELEMEDVAP  +   S++ + D A          VP    Q+V  SSP   
Sbjct: 1075 TRILEDVDGELEMEDVAPPWEGGSSASAITDQADNRESANCLLVPGTSHQNVTSSSP--- 1131

Query: 1137 SSPPPVLPPPPSIPHSCAFSDSYSNGASMQ---NMQNDGQQSVTQQSVAPRINSTVSTNA 1193
                   P  PS     A S+SYSNG   +   +MQ D        +  PR+N       
Sbjct: 1132 -------PARPSQNAQLAMSNSYSNGFDYRRNPSMQGD------YHAGPPRMNP-----P 1173

Query: 1194 VHYHASECRDHQMPMQMPESTSSFGCYSMRP---SNNFQQTDGPRFHNKPYPPRPPHAPQ 1250
            +HY + E          P  +S        P    +NFQ       H       PP  P 
Sbjct: 1174 MHYGSPE----------PSYSSRVSLSKSMPRGEGSNFQ-------HRPYPSSHPPPPPP 1216

Query: 1251 SNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFS 1310
            S+H+SY++     KSRRE    S  HR H    FD  N+ ++++RM+P P E+R++WR+ 
Sbjct: 1217 SHHYSYMEPDHHIKSRREG--LSYPHRSHYTLEFDERNYQDSYERMRPEPCENRDNWRYH 1274

Query: 1311 APSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHR---HMRPPSGGAV 1367
             PS  GPRY D  +  + S SY G  R+  R  N  W+  PR  N+R   H +  S G V
Sbjct: 1275 PPSSHGPRYHDRHKGPHQSSSYSGHHRDSGRLQNNRWSDSPRAYNNRHSYHYKQHSEGPV 1334

Query: 1368 PVGIRAPGAWRPR 1380
            PVG+R PG W  R
Sbjct: 1335 PVGMRDPGTWHQR 1347




Probable transcrition factor that seems to be involved in mRNA processing.
Arabidopsis thaliana (taxid: 3702)
>sp|F4IN78|HUAL3_ARATH HUA2-like protein 3 OS=Arabidopsis thaliana GN=At2g48160 PE=2 SV=2 Back     alignment and function description
>sp|Q9XER9|HUA2_ARATH ENHANCER OF AG-4 protein 2 OS=Arabidopsis thaliana GN=HUA2 PE=2 SV=1 Back     alignment and function description
>sp|Q9LEY4|HUAL1_ARATH HUA2-like protein 1 OS=Arabidopsis thaliana GN=At5g08230 PE=2 SV=1 Back     alignment and function description
>sp|Q9Y3E1|HDGR3_HUMAN Hepatoma-derived growth factor-related protein 3 OS=Homo sapiens GN=HDGFRP3 PE=1 SV=1 Back     alignment and function description
>sp|Q99JF8|PSIP1_MOUSE PC4 and SFRS1-interacting protein OS=Mus musculus GN=Psip1 PE=1 SV=1 Back     alignment and function description
>sp|P51859|HDGF_MOUSE Hepatoma-derived growth factor OS=Mus musculus GN=Hdgf PE=1 SV=2 Back     alignment and function description
>sp|Q923W4|HDGR3_RAT Hepatoma-derived growth factor-related protein 3 OS=Rattus norvegicus GN=Hdgfrp3 PE=2 SV=1 Back     alignment and function description
>sp|Q8VHK7|HDGF_RAT Hepatoma-derived growth factor OS=Rattus norvegicus GN=Hdgf PE=1 SV=2 Back     alignment and function description
>sp|Q9JMG7|HDGR3_MOUSE Hepatoma-derived growth factor-related protein 3 OS=Mus musculus GN=Hdgfrp3 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1380
255584708 1558 glutathione peroxidase, putative [Ricinu 0.940 0.833 0.545 0.0
2254369581479 PREDICTED: uncharacterized protein LOC10 0.994 0.927 0.522 0.0
2960867161459 unnamed protein product [Vitis vinifera] 0.975 0.922 0.478 0.0
3574408091396 Hepatoma-derived growth factor-related p 0.949 0.938 0.469 0.0
356536756 1561 PREDICTED: uncharacterized protein LOC10 0.955 0.844 0.470 0.0
356503527 1536 PREDICTED: uncharacterized protein LOC10 0.923 0.830 0.456 0.0
449436812 1580 PREDICTED: HUA2-like protein 3-like [Cuc 0.955 0.834 0.457 0.0
306957191347 Tudor/PWWP/MBT domain-containing protein 0.923 0.945 0.400 0.0
2978249251379 hypothetical protein ARALYDRAFT_484006 [ 0.754 0.754 0.427 0.0
75734361261 putative protein [Arabidopsis thaliana] 0.789 0.864 0.415 0.0
>gi|255584708|ref|XP_002533075.1| glutathione peroxidase, putative [Ricinus communis] gi|223527139|gb|EEF29314.1| glutathione peroxidase, putative [Ricinus communis] Back     alignment and taxonomy information
 Score = 1295 bits (3350), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 772/1416 (54%), Positives = 939/1416 (66%), Gaps = 118/1416 (8%)

Query: 21   QWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTE 80
            QWKVGDLVLAKVKGFPAWPATVSEPEKWGY+ADWKKVLV+FFGTQQIAFCNPADVEAFTE
Sbjct: 22   QWKVGDLVLAKVKGFPAWPATVSEPEKWGYAADWKKVLVYFFGTQQIAFCNPADVEAFTE 81

Query: 81   EKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDS-NSGDELTVANGGNSVNSISH 139
            EKKQSLL KRQG+GADFVRAVQEII+SYEKLKK DQVD  NSG+E+T+ NGG+S+ S ++
Sbjct: 82   EKKQSLLVKRQGKGADFVRAVQEIIESYEKLKKSDQVDDRNSGEEITLVNGGHSMESSAY 141

Query: 140  --LKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLV 197
              LK +TE SEAT+  +  P     G  +++P        Q   L  KE   EQP++N+ 
Sbjct: 142  FELKGQTETSEATVTGRDDP-----GLAVDVP--------QSGTLHDKEDSTEQPADNMA 188

Query: 198  AKATPVLTTYSSRKRSGG--SRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKN 255
                P + TY+SRKRS G  SR  + Q+   S  RS S   +ES R QN M+P N   K+
Sbjct: 189  VPVKPGIATYTSRKRSLGLRSRKHAAQKNDSSVERSGSLPRLESSRFQNFMLPSNEGSKS 248

Query: 256  AEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSL 315
            A   S   I D +L RNKRTR+SPD SE +D+DSSA +SNGSIED+ SEIVTV+SD+ SL
Sbjct: 249  AGDASTDVISDRALRRNKRTRRSPDASEWDDVDSSAFVSNGSIEDDGSEIVTVDSDSLSL 308

Query: 316  NEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPP 375
            NEGST+DS  K E SET +ECL+GD  LSK LDFQIKAV +KKKRK NRKRV N+A +PP
Sbjct: 309  NEGSTIDSASKPEHSETFVECLEGDVELSKGLDFQIKAVFIKKKRKQNRKRVSNEAAEPP 368

Query: 376  AR-INTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPSSEEEL 434
            AR + T   +D  T ++  + +N G NL+ER  KEDGDEHLPLVKRARVRMGK SS +E 
Sbjct: 369  ARRLETEVYLDAETHSSSQNLKNAGDNLNERHNKEDGDEHLPLVKRARVRMGKLSSLQEH 428

Query: 435  KSSLQTEEKPSKDTAVN----------------------LVEHISPSLNNYDEGFTEKGP 472
             S  Q EEK S + A+N                       +E I PS N  D+   +K  
Sbjct: 429  ASFSQDEEKISNEVAINPTEVNNGLCQVEERTLNEVAVATLERIVPSDNLSDDCSADKDS 488

Query: 473  SLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSA 532
              +K + D  SP K  +++  NRPQ+     +QSFGC+ADGEAALPPSKRLHRALEAMSA
Sbjct: 489  FSLKGALDIASPPKAHAQIPVNRPQILILKESQSFGCTADGEAALPPSKRLHRALEAMSA 548

Query: 533  NAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQ----LS 588
            NAAEEG A  + +S+  TS+N     S+   S   V+ +E +GSG Q            S
Sbjct: 549  NAAEEGHACAE-TSIKKTSMNDGSTFSMKGSSGMVVERKENNGSGEQITEGLSHGASAFS 607

Query: 589  ENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQ 648
             + N   +ES R                SP         +++ T         +RH    
Sbjct: 608  SSSNRVLEESVR----------------SPLDRENCNELIESST--------SQRHHKDA 643

Query: 649  L---YQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLS 705
            L   + N  D      EG AED +L+ +  EN  ++ + S    L+   +S A+E  K +
Sbjct: 644  LALGFHNGRDVNGSCIEGHAEDAELRVAGGENRVEDVSISESSRLNASLISLANEGTKGT 703

Query: 706  PQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDD 765
              NGS+ LQ +    + EN+E+L++Q+DDN             + N  +KE    S S+D
Sbjct: 704  SLNGSDALQNTADDTACENTETLRTQVDDNS------------RDNGIRKESCYASSSED 751

Query: 766  HLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQSSSSSPYARSQYKK 825
            HLG +    V  S  PA+G++SPA+ SPP TS+CHVST+ESAN +Q+S  S    SQ K 
Sbjct: 752  HLGGRDGLGVGSSPVPADGMESPAQTSPPTTSICHVSTAESANFIQNSGCSSPNHSQQKT 811

Query: 826  SLGAPVADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIA 885
            ++   V DE K+++   QRPKSVGKWSS +EAHAALSSFE +LGSLTRTKESIGRATRIA
Sbjct: 812  TVCTSVVDEEKIESVAPQRPKSVGKWSSYAEAHAALSSFEGMLGSLTRTKESIGRATRIA 871

Query: 886  IDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMK---------AILT 936
            IDCAKFGVS+KVV+I+AR LESES+L+RRVDLFFLVDSI QCSRG+K         AI  
Sbjct: 872  IDCAKFGVSAKVVDILARTLESESNLHRRVDLFFLVDSITQCSRGLKGDVGGIYPSAIQA 931

Query: 937  VLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCSSSAVAY 996
            VLPRLLSAAAPPG+ AQENRRQCLKVLRLWLERRILPE ++RHHMRE+D++  SSS  AY
Sbjct: 932  VLPRLLSAAAPPGSFAQENRRQCLKVLRLWLERRILPEPVVRHHMREIDSLGGSSSGGAY 991

Query: 997  SRRSSRTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFEAV 1056
            SRRS+RTER LDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKD+D+GSDSDG SFEAV
Sbjct: 992  SRRSARTERPLDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDEDEGSDSDGESFEAV 1051

Query: 1057 TPEHNSEIPEERDANPSMKKHRHILEEVDGELEMEDVAPTCDNEMSS------TVLVDIA 1110
            TPEHNSE PEE D+ P+++KH HILE+VDGELEMEDVAP+C+ E SS         V I 
Sbjct: 1052 TPEHNSETPEEHDSAPAIEKHTHILEDVDGELEMEDVAPSCEIEASSAGGIGGVNAVHIP 1111

Query: 1111 QTSHDQLLS--FVPPLPQDV--PPSSPPLPSSPPPVLPPPPSIPHSCAFSDSYSNGA--- 1163
            Q+  +Q  S  F PPLPQDV       P    PPP  PPPP+I  S    D Y NG    
Sbjct: 1112 QSQLEQHFSLPFAPPLPQDVPPSSPPLPTSPPPPPPPPPPPAILPSSGMPDPYVNGVDSK 1171

Query: 1164 ---SMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCY 1220
               +   M +D +++V Q   APRI S++ T+ VHYHA+ECRD QM MQ+ +STSSF  Y
Sbjct: 1172 LYTNSHYMHDDLRETVAQPLAAPRITSSI-TDGVHYHATECRD-QMQMQLCDSTSSFSSY 1229

Query: 1221 SMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHP 1280
               P NN Q  D P FH+K Y PRPPH P SN FSYVQA Q  KSRR +P PS+ HR+  
Sbjct: 1230 PACPVNNVQHADSPNFHHKAYAPRPPHHPPSNQFSYVQAGQHVKSRRASPPPSHHHRYQS 1289

Query: 1281 HPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPP 1340
              N DGGN+YNNH+RM+P PY+  ESWR+  P F GPRYPD++R  YP G YGGP REP 
Sbjct: 1290 SHNTDGGNYYNNHERMRPAPYD--ESWRYPPPPFPGPRYPDKSRASYPRGPYGGPPREPT 1347

Query: 1341 RYSNRGWAYPPRPMNHRH---MRPPSGGAVPVGIRA 1373
            R  ++GW+YP + M+HR+    RPPS  AVPV  R 
Sbjct: 1348 RMPHQGWSYPSQDMHHRNFMPFRPPS-DAVPVSNRV 1382




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225436958|ref|XP_002271866.1| PREDICTED: uncharacterized protein LOC100261323 [Vitis vinifera] Back     alignment and taxonomy information
>gi|296086716|emb|CBI32351.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357440809|ref|XP_003590682.1| Hepatoma-derived growth factor-related protein [Medicago truncatula] gi|355479730|gb|AES60933.1| Hepatoma-derived growth factor-related protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|356536756|ref|XP_003536901.1| PREDICTED: uncharacterized protein LOC100803427 [Glycine max] Back     alignment and taxonomy information
>gi|356503527|ref|XP_003520559.1| PREDICTED: uncharacterized protein LOC100799006 [Glycine max] Back     alignment and taxonomy information
>gi|449436812|ref|XP_004136186.1| PREDICTED: HUA2-like protein 3-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|30695719|ref|NP_191866.3| Tudor/PWWP/MBT domain-containing protein [Arabidopsis thaliana] gi|403399430|sp|F4IZM8.1|HUAL2_ARATH RecName: Full=HUA2-like protein 2 gi|332646910|gb|AEE80431.1| Tudor/PWWP/MBT domain-containing protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297824925|ref|XP_002880345.1| hypothetical protein ARALYDRAFT_484006 [Arabidopsis lyrata subsp. lyrata] gi|297326184|gb|EFH56604.1| hypothetical protein ARALYDRAFT_484006 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|7573436|emb|CAB87752.1| putative protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1380
TAIR|locus:20992421347 AT3G63070 "AT3G63070" [Arabido 0.910 0.933 0.362 5.4e-166
TAIR|locus:20393511366 AT2G48160 [Arabidopsis thalian 0.757 0.765 0.348 8.5e-162
TAIR|locus:21782921392 HUA2 "ENHANCER OF AG-4 2" [Ara 0.197 0.196 0.464 1e-72
TAIR|locus:21815691445 AT5G08230 "AT5G08230" [Arabido 0.305 0.291 0.336 2.3e-64
UNIPROTKB|Q9Y3E1203 HDGFRP3 "Hepatoma-derived grow 0.113 0.768 0.307 8.9e-13
RGD|628773202 Hdgfrp3 "hepatoma-derived grow 0.086 0.594 0.343 9e-13
MGI|MGI:1352760202 Hdgfrp3 "hepatoma-derived grow 0.086 0.594 0.343 1.8e-12
CGD|CAL00038741404 PGA55 [Candida albicans (taxid 0.566 0.556 0.184 9.4e-12
UNIPROTKB|Q59SG91404 PGA55 "Flocculin-like protein" 0.566 0.556 0.184 9.4e-12
ZFIN|ZDB-GENE-050213-2201 hdgfrp3 "hepatoma-derived grow 0.092 0.636 0.335 2.5e-11
TAIR|locus:2099242 AT3G63070 "AT3G63070" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1615 (573.6 bits), Expect = 5.4e-166, P = 5.4e-166
 Identities = 506/1395 (36%), Positives = 697/1395 (49%)

Query:    21 QWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTE 80
             +WKVGDLVLAKVKGFPAWPA V EPEKWG+SAD KKV V FFGTQQIAFCN  DVE+FTE
Sbjct:    21 EWKVGDLVLAKVKGFPAWPAVVDEPEKWGHSADSKKVTVHFFGTQQIAFCNHGDVESFTE 80

Query:    81 EKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHL 140
             EKKQSLLT+R  +G+DFVRAV+EI +SYEKLK+QDQ       E T A  G+S N+ S L
Sbjct:    81 EKKQSLLTRRHAKGSDFVRAVKEITESYEKLKQQDQASGPKYAEETTA--GSSGNT-SQL 137

Query:   141 KDRTE-ASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAK 199
                 E    + LD+Q++ S+S   D L L +ED+ A  Q+ AL           ++  AK
Sbjct:   138 PQACENLIGSRLDTQIESSSSHGRDELTLLSEDASAAEQMLALRHNTLAHNGACDSAAAK 197

Query:   200 ATPVLTTYSSRKRSGGSRL--QSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAE 257
                 + TYSSR+R+   R    + Q +      S+ S+ +E  R+Q  M+  ++ G +  
Sbjct:   198 DLCEIATYSSRRRNERVRALKYAPQSIILPVEHSKISSRLELDRVQRSMLQCSDGGPSVN 257

Query:   258 GISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNE 317
              I+ K+I      R KR R S   SE +D+ SS L  +GS EDN+SEI TVES+  S NE
Sbjct:   258 SINGKAIR-----RRKRIRTSGQ-SESDDVVSSDLNLHGSDEDNASEIATVESNNNSRNE 311

Query:   318 GSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIXXXXXXXXXXXXXXXXCNDAVDPPAR 377
             G+ VDSG KVE S+ V E  DG   L+K LDF I                 +D +DPPA+
Sbjct:   312 GNGVDSGSKVEYSDAVGEGCDGGHELNKGLDFHISTMVKRKKRKPTRKRETSDIIDPPAK 371

Query:   378 INTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPXXXXXXXXX 437
             +  A  +  +  ++C  S+N+   L+ER  +E+GDEHLPLVKRARVRM +          
Sbjct:   372 VE-AEGLGPNACDSCQRSQNSHERLNERPCEENGDEHLPLVKRARVRMSRAFYADEKVNA 430

Query:   438 XXXXXX-PSKDTAVNLVEHISPSLNNYDEGF-TEKGPSLVKESTD---SVSPSKVCSEVS 492
                     SKDT ++     SPS+N ++ G  +    S  +E      S   S V  +V 
Sbjct:   431 SSQVEERSSKDTLLSAALQTSPSVN-HENGIGSGHDTSAAEEFNSFELSAKLSGVMVDVV 489

Query:   493 GNRPQLWKGTTNQSFGCS---ADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLIN 549
              +  +      + S  C     D + A+      H     M+ +       S Q SSL+ 
Sbjct:   490 PSHMEKPSDRMSPSVACVQTVGDRQTAV----NFHENEFTMTLDDEVTRAQSNQLSSLVE 545

Query:   550 TSINGCCVNSICK-CSHETVDSRERSGSGLQNVPTCDQLSENCN--SQKQES-FRDDVGS 605
             T      V  + + CS E+     ++G+ L  +   D +   C+  S+K E+    D+  
Sbjct:   546 TEAR---VPEVVQGCSEES-----QTGNCL--ISETDPIDIQCSHQSEKHETPLNPDI-- 593

Query:   606 VDNVDGKDLPGSPFSVHTIQTAVQTQTP-VNILPDPKKRHSSFQLYQNSLDQLSLK-DEG 663
             VD+   K  PG   S+    T V  Q+P  + + +      S  +  +SL+    K D  
Sbjct:   594 VDSSANKS-PGLCSSLDMTTTVVPAQSPHQHKIQEYDSSDHSLVIVGDSLNGKCEKIDYC 652

Query:   664 SAEDLQ--------------LKDSRVENVDKEFNTSALVELSLDPVSGADESVKL----S 705
               + +Q              +    VEN+ +  NT    E    P    D   +     +
Sbjct:   653 MTQVVQSQALEPPPPLFCSVVNYQEVENLQETENT-LWKENQGSPGKELDSDKQAHMIQN 711

Query:   706 PQNGSNELQYSVQGMS--YENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSIS 763
             P   + E +  V      YE   S  +   +N  +   CE V+E  Q E+ +  +S+S+S
Sbjct:   712 PVLSATESEMIVDDAEPQYETVYSHCADAVENRELEKSCE-VDE--QKEQMQATNSISVS 768

Query:   764 DDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQXXXXXPYARS-Q 822
             ++   +K       +SSPA G       +P   S+C +ST+ES N +Q      Y+ + Q
Sbjct:   769 ENFSREK------LNSSPARG-------TPNCNSVCRISTAESENAMQNNSY--YSTNVQ 813

Query:   823 Y--KKSLGAP-VADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIG 879
             Y   KSL    V +E KV+T  TQ  K V      S+    + SFE  L SL RTKE+IG
Sbjct:   814 YGENKSLNVDTVKEESKVETGTTQVKKVVS-----SDVQCTVESFETALDSLVRTKETIG 868

Query:   880 RATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMK------- 932
             RATR+A+D AKFGVS+K +EI+A  LESES+L RRVDLFFLVDSI QCS+G+        
Sbjct:   869 RATRLAMDLAKFGVSAKAMEILAHTLESESNLQRRVDLFFLVDSIAQCSKGLNGDAGGVY 928

Query:   933 --AILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCX 990
               +I  +LPRLL+AA P G   QENR+QCLKVLRLWLERRILPESI+RHH+RELD+++  
Sbjct:   929 LSSIQAMLPRLLTAAVPAGATTQENRKQCLKVLRLWLERRILPESIVRHHIRELDSLS-N 987

Query:   991 XXXXXXXXXXXXTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKXXXXXXXXXX 1050
                         TERALDDPVRDMEG+LVDEYGSNS+ QL GFC+P +L+          
Sbjct:   988 VPACLYSRRSARTERALDDPVRDMEGILVDEYGSNSTLQLHGFCIPPILRDEDEGSDSDG 1047

Query:  1051 XXFEAVTPEHNSEIPEERDANPSM-KKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDI 1109
               FE+VTPEH S   EE    PS+ ++H  ILE+VDGELEMEDVAP  +   S++ + D 
Sbjct:  1048 GDFESVTPEHESRSLEEH-VTPSITERHTRILEDVDGELEMEDVAPPWEGGSSASAITDQ 1106

Query:  1110 AQTSHDQLLSFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHSCAFSDSYSNGASMQNMQ 1169
             A  + +                                   +S  F   Y    SMQ   
Sbjct:  1107 AD-NRESANCLLVPGTSHQNVTSSSPPARPSQNAQLAMSNSYSNGFD--YRRNPSMQGDY 1163

Query:  1170 NDGQQSVTQQ----SVAPRINSTVSTNAV--HYHASECRDHQMPMQMPESTSSFGCYS-M 1222
             + G   +       S  P  +S VS +        S  +    P   P        YS M
Sbjct:  1164 HAGPPRMNPPMHYGSPEPSYSSRVSLSKSMPRGEGSNFQHRPYPSSHPPPPPPSHHYSYM 1223

Query:  1223 RPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHP 1282
              P ++ +     R     YP R  +  + +  +Y  + +  + R E     ++ R+HP P
Sbjct:  1224 EPDHHIKS----RREGLSYPHRSHYTLEFDERNYQDSYE--RMRPEPCENRDNWRYHP-P 1276

Query:  1283 NFDGGNFYNNHDRMKPGPYE-------HRESWRFSAPSFS-GPR-YPDEARECYPSGSYG 1333
             +  G  +   HDR K GP++       HR+S R     +S  PR Y +     Y   S G
Sbjct:  1277 SSHGPRY---HDRHK-GPHQSSSYSGHHRDSGRLQNNRWSDSPRAYNNRHSYHYKQHSEG 1332

Query:  1334 G-P--LREPPRYSNR 1345
               P  +R+P  +  R
Sbjct:  1333 PVPVGMRDPGTWHQR 1347


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2039351 AT2G48160 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2178292 HUA2 "ENHANCER OF AG-4 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2181569 AT5G08230 "AT5G08230" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q9Y3E1 HDGFRP3 "Hepatoma-derived growth factor-related protein 3" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
RGD|628773 Hdgfrp3 "hepatoma-derived growth factor, related protein 3" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
MGI|MGI:1352760 Hdgfrp3 "hepatoma-derived growth factor, related protein 3" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
CGD|CAL0003874 PGA55 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
UNIPROTKB|Q59SG9 PGA55 "Flocculin-like protein" [Candida albicans SC5314 (taxid:237561)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-050213-2 hdgfrp3 "hepatoma-derived growth factor, related protein 3" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
F4IZM8HUAL2_ARATHNo assigned EC number0.40050.92310.9458yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1380
cd0583483 cd05834, HDGF_related, The PWWP domain is an essen 8e-31
smart00582124 smart00582, RPR, domain present in proteins, which 6e-28
cd0516287 cd05162, PWWP, The PWWP domain, named for a conser 1e-13
pfam0085574 pfam00855, PWWP, PWWP domain 1e-09
cd0583686 cd05836, N_Pac_NP60, The PWWP domain is an essenti 8e-09
smart0029363 smart00293, PWWP, domain with conserved PWWP motif 3e-08
cd0584093 cd05840, SPBC215_ISWI_like, The PWWP domain is a c 2e-07
cd0583587 cd05835, Dnmt3b_related, The PWWP domain is an ess 1e-06
cd0584183 cd05841, BS69_related, The PWWP domain is part of 7e-04
cd0608080 cd06080, MUM1_like, Mutated melanoma-associated an 0.001
pfam09770 804 pfam09770, PAT1, Topoisomerase II-associated prote 0.002
>gnl|CDD|99895 cd05834, HDGF_related, The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF Back     alignment and domain information
 Score =  116 bits (292), Expect = 8e-31
 Identities = 42/85 (49%), Positives = 52/85 (61%), Gaps = 5/85 (5%)

Query: 20  RQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFT 79
           +Q+K GDLV AKVKG+PAWPA V EPE W      KK  V+FFGT + AF  P D+  +T
Sbjct: 1   KQFKAGDLVFAKVKGYPAWPARVDEPEDWKPPG--KKYPVYFFGTHETAFLKPEDLFPYT 58

Query: 80  EEKKQSLLTKRQGRGADFVRAVQEI 104
           E KK+    K++     F  AV EI
Sbjct: 59  ENKKKFGKPKKR---KGFNEAVWEI 80


This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. Length = 83

>gnl|CDD|214731 smart00582, RPR, domain present in proteins, which are involved in regulation of nuclear pre-mRNA Back     alignment and domain information
>gnl|CDD|99894 cd05162, PWWP, The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids Back     alignment and domain information
>gnl|CDD|216154 pfam00855, PWWP, PWWP domain Back     alignment and domain information
>gnl|CDD|99897 cd05836, N_Pac_NP60, The PWWP domain is an essential part of the cytokine-like nuclear factor n-pac protein, or NP60, which enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha Back     alignment and domain information
>gnl|CDD|214603 smart00293, PWWP, domain with conserved PWWP motif Back     alignment and domain information
>gnl|CDD|99901 cd05840, SPBC215_ISWI_like, The PWWP domain is a component of the S Back     alignment and domain information
>gnl|CDD|99896 cd05835, Dnmt3b_related, The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis Back     alignment and domain information
>gnl|CDD|99902 cd05841, BS69_related, The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions Back     alignment and domain information
>gnl|CDD|99903 cd06080, MUM1_like, Mutated melanoma-associated antigen 1 (MUM-1) is a melanoma-associated antigen (MAA) Back     alignment and domain information
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1 Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 1380
KOG1904496 consensus Transcription coactivator [Transcription 99.89
cd0583483 HDGF_related The PWWP domain is an essential part 99.87
cd0583686 N_Pac_NP60 The PWWP domain is an essential part of 99.86
PF12243139 CTK3: CTD kinase subunit gamma CTK3 99.86
cd0584093 SPBC215_ISWI_like The PWWP domain is a component o 99.85
cd0583587 Dnmt3b_related The PWWP domain is an essential com 99.84
cd0583895 WHSC1_related The PWWP domain was first identified 99.83
cd0516287 PWWP The PWWP domain, named for a conserved Pro-Tr 99.83
PF0085586 PWWP: PWWP domain; InterPro: IPR000313 Upon charac 99.82
cd0584183 BS69_related The PWWP domain is part of BS69 prote 99.81
cd05837110 MSH6_like The PWWP domain is present in MSH6, a mi 99.72
smart0029363 PWWP domain with conserved PWWP motif. conservatio 99.72
cd0608080 MUM1_like Mutated melanoma-associated antigen 1 (M 99.63
smart00582121 RPR domain present in proteins, which are involved 99.6
cd05839111 BR140_related The PWWP domain is found in the BR14 99.58
cd03562114 CID CID (CTD-Interacting Domain) domain family; CI 98.92
KOG1924 1102 consensus RhoA GTPase effector DIA/Diaphanous [Sig 98.92
KOG1924 1102 consensus RhoA GTPase effector DIA/Diaphanous [Sig 98.82
KOG0132 894 consensus RNA polymerase II C-terminal domain-bind 98.55
PF0481864 CTD_bind: RNA polymerase II-binding domain.; Inter 98.29
KOG2669325 consensus Regulator of nuclear mRNA [RNA processin 97.79
KOG0151877 consensus Predicted splicing regulator, contains R 97.77
KOG1830518 consensus Wiskott Aldrich syndrome proteins [Cytos 97.15
KOG1081463 consensus Transcription factor NSD1 and related SE 97.0
PHA03247 3151 large tegument protein UL36; Provisional 96.42
PHA03247 3151 large tegument protein UL36; Provisional 96.08
KOG3671569 consensus Actin regulatory protein (Wiskott-Aldric 95.65
KOG0132894 consensus RNA polymerase II C-terminal domain-bind 95.54
KOG3671569 consensus Actin regulatory protein (Wiskott-Aldric 94.05
KOG4368 757 consensus Predicted RNA binding protein, contains 93.76
KOG4849498 consensus mRNA cleavage factor I subunit/CPSF subu 92.53
KOG1923 830 consensus Rac1 GTPase effector FRL [Signal transdu 90.17
smart0033357 TUDOR Tudor domain. Domain of unknown function pre 89.9
KOG1830518 consensus Wiskott Aldrich syndrome proteins [Cytos 89.53
COG5178 2365 PRP8 U5 snRNP spliceosome subunit [RNA processing 88.21
smart0074361 Agenet Tudor-like domain present in plant sequence 87.55
KOG0119554 consensus Splicing factor 1/branch point binding p 87.15
KOG1923 830 consensus Rac1 GTPase effector FRL [Signal transdu 86.66
COG5178 2365 PRP8 U5 snRNP spliceosome subunit [RNA processing 85.34
KOG4849498 consensus mRNA cleavage factor I subunit/CPSF subu 85.31
KOG4672487 consensus Uncharacterized conserved low complexity 84.63
PRK15319 2039 AIDA autotransporter-like protein ShdA; Provisiona 84.34
KOG2071579 consensus mRNA cleavage and polyadenylation factor 84.06
KOG4672487 consensus Uncharacterized conserved low complexity 82.31
>KOG1904 consensus Transcription coactivator [Transcription] Back     alignment and domain information
Probab=99.89  E-value=3.8e-23  Score=239.00  Aligned_cols=234  Identities=34%  Similarity=0.467  Sum_probs=166.3

Q ss_pred             CCCCcCCEEEEecCCCCCCCeeeeCCCCCCCCCCCCeEEEEEeCCCCeeeecCCCccccchHHHHhhhhhcCCChHHHHH
Q 000633           20 RQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVR   99 (1380)
Q Consensus        20 ~~Fk~GDLVWAKVKGYPwWPArI~~Pe~~~~K~~~nkylV~FFGT~EyAWV~pkdLkPFsE~kkesflnKrkgK~K~Fre   99 (1380)
                      ..|++||||||||||||.|||+|.++..+..++..++|.||||||+++|||.+++|++|..++.. +..-.+...+.|++
T Consensus        11 ~~~~~GDLV~AKlkgyp~WParI~~~~~~~~kp~pkky~V~FfGT~e~Afl~p~dlqpy~~~k~~-~g~~~k~~~k~F~~   89 (496)
T KOG1904|consen   11 GNFKCGDLVFAKLKGYPPWPARIRNGPDGAVKPPPKKYTVFFFGTKETAFLKPKDLQPYMLNKEK-LGKPNKRVWKGFIE   89 (496)
T ss_pred             CCCCCCceeeecccCCCCCcccccCcccccccCCCceeEEEEeccCcccccchhhccchhhhhhh-cccchhhhhHHHHH
Confidence            58999999999999999999999999999999999999999999999999999999999988763 33212113499999


Q ss_pred             HHHHHHHHHHHHhhhcccCcCCCcccccccCCCCCCccccccccccchhhcccccCCCCCCCCCCCCCCCCCCCcccccc
Q 000633          100 AVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHLKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQL  179 (1380)
Q Consensus       100 ALeEAee~~e~LK~qeq~~~~~aee~~~a~~~~~~~s~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~  179 (1380)
                      |+++|.+++     +... .++...+.         +.                        +..+.-+..         
T Consensus        90 av~eI~~a~-----~np~-~~~~~~~~---------~~------------------------~~~~~~~~~---------  121 (496)
T KOG1904|consen   90 AVEEIREAF-----NNPK-SESDGIFF---------SA------------------------TATGRLLGP---------  121 (496)
T ss_pred             HHHHHHHHh-----cCCC-ccccCccc---------cc------------------------ccccccccc---------
Confidence            999999988     1000 11100000         00                        000000000         


Q ss_pred             cCCCCCCCCCCCCCcCcccccCCCccccccccccCCCccccccccCCccccccCcccchhhhhccccccccccCCCCCCC
Q 000633          180 DALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEGI  259 (1380)
Q Consensus       180 ~~~~~~~~~~~~p~en~~~~~~~~~~t~S~~kr~~~~~~~~~~~r~~s~rrsrSs~~~~s~r~q~~~~p~~d~gk~~g~~  259 (1380)
                                            +++                    .           +.+.       ..+|.|++-   
T Consensus       122 ----------------------~~~--------------------~-----------~~~~-------~~~~~~~~~---  138 (496)
T KOG1904|consen  122 ----------------------ENL--------------------F-----------VASS-------VLNDTGNSE---  138 (496)
T ss_pred             ----------------------ccc--------------------c-----------cccc-------ccccccccc---
Confidence                                  010                    0           0000       011233111   


Q ss_pred             cccccccCceeccccccCCCCCCCCCccCccccccCCccCCCCCceeeeecCccccCCCCccCCCCCccccchhhhcccc
Q 000633          260 SAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDG  339 (1380)
Q Consensus       260 s~~~~~d~~~~~~k~~~~S~~~s~~ddv~Ss~~~~ngs~edn~sei~t~~sd~~slNegs~vds~~~~E~Se~~~e~~e~  339 (1380)
                                                       ...|..+.+.++|.+.++++. -+++..+|+..+.|.+...=+...|
T Consensus       139 ---------------------------------~~~~~~~~~~s~~as~e~~~~-~~~~~~~~~~~~~d~~~~~d~~~~~  184 (496)
T KOG1904|consen  139 ---------------------------------GEPGADENNASEIATPESDNN-RPEGNGEDSSSKVDYSDSTDHFELG  184 (496)
T ss_pred             ---------------------------------ccccccccccccccccccccc-CCccCcccccccccccccccccccc
Confidence                                             223334444555666655554 6788899999999987776677888


Q ss_pred             ccccccccccceeeeeeccccCCCCcccccCCCCCccccCccccccccccccccCCcccCCCCCCCCCccCCCCCcc
Q 000633          340 DEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLP  416 (1380)
Q Consensus       340 ~~~l~~~~d~~~k~vv~kkKRKpnrkr~tnd~~~p~~~~~~~~~l~~~~~~~~~~s~N~~~ns~e~~~k~DGDEHLP  416 (1380)
                      ..+|++++.++|.+.+.++|||..|||.|.+             +...    ++.++|-..-+.++.++++|++|+|
T Consensus       185 ~~el~~~~~~~~~~~~~r~~~kg~rkr~ts~-------------~~~~----~~~~~a~~~k~t~~~~~~~~s~krP  244 (496)
T KOG1904|consen  185 GDELNKGKRKPIDTMVKRKKRKGTRKRKTSD-------------AEPK----VQRSQASHEKLTERPCESNGSEKRP  244 (496)
T ss_pred             chhhccccccccccccccccCcccccccccc-------------cccc----ccccccchhhccCCCcccccccccc
Confidence            9999999999999999999999999999888             1223    4477777888999999999999999



>cd05834 HDGF_related The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF Back     alignment and domain information
>cd05836 N_Pac_NP60 The PWWP domain is an essential part of the cytokine-like nuclear factor n-pac protein, or NP60, which enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha Back     alignment and domain information
>PF12243 CTK3: CTD kinase subunit gamma CTK3 Back     alignment and domain information
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S Back     alignment and domain information
>cd05835 Dnmt3b_related The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis Back     alignment and domain information
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS) Back     alignment and domain information
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids Back     alignment and domain information
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified [] Back     alignment and domain information
>cd05841 BS69_related The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions Back     alignment and domain information
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS Back     alignment and domain information
>smart00293 PWWP domain with conserved PWWP motif Back     alignment and domain information
>cd06080 MUM1_like Mutated melanoma-associated antigen 1 (MUM-1) is a melanoma-associated antigen (MAA) Back     alignment and domain information
>smart00582 RPR domain present in proteins, which are involved in regulation of nuclear pre-mRNA Back     alignment and domain information
>cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2 Back     alignment and domain information
>cd03562 CID CID (CTD-Interacting Domain) domain family; CID is present in several RNA-processing factors such as Pcf11 and Nrd1 Back     alignment and domain information
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>KOG0132 consensus RNA polymerase II C-terminal domain-binding protein RA4, contains RPR and RRM domains [RNA processing and modification; Transcription] Back     alignment and domain information
>PF04818 CTD_bind: RNA polymerase II-binding domain Back     alignment and domain information
>KOG2669 consensus Regulator of nuclear mRNA [RNA processing and modification] Back     alignment and domain information
>KOG0151 consensus Predicted splicing regulator, contains RRM, SWAP and RPR domains [General function prediction only] Back     alignment and domain information
>KOG1830 consensus Wiskott Aldrich syndrome proteins [Cytoskeleton] Back     alignment and domain information
>KOG1081 consensus Transcription factor NSD1 and related SET domain proteins [Transcription] Back     alignment and domain information
>PHA03247 large tegument protein UL36; Provisional Back     alignment and domain information
>PHA03247 large tegument protein UL36; Provisional Back     alignment and domain information
>KOG3671 consensus Actin regulatory protein (Wiskott-Aldrich syndrome protein) [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>KOG0132 consensus RNA polymerase II C-terminal domain-binding protein RA4, contains RPR and RRM domains [RNA processing and modification; Transcription] Back     alignment and domain information
>KOG3671 consensus Actin regulatory protein (Wiskott-Aldrich syndrome protein) [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>KOG4368 consensus Predicted RNA binding protein, contains SWAP, RPR and G-patch domains [General function prediction only] Back     alignment and domain information
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification] Back     alignment and domain information
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>smart00333 TUDOR Tudor domain Back     alignment and domain information
>KOG1830 consensus Wiskott Aldrich syndrome proteins [Cytoskeleton] Back     alignment and domain information
>COG5178 PRP8 U5 snRNP spliceosome subunit [RNA processing and modification] Back     alignment and domain information
>smart00743 Agenet Tudor-like domain present in plant sequences Back     alignment and domain information
>KOG0119 consensus Splicing factor 1/branch point binding protein (RRM superfamily) [RNA processing and modification] Back     alignment and domain information
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>COG5178 PRP8 U5 snRNP spliceosome subunit [RNA processing and modification] Back     alignment and domain information
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification] Back     alignment and domain information
>KOG4672 consensus Uncharacterized conserved low complexity protein [Function unknown] Back     alignment and domain information
>PRK15319 AIDA autotransporter-like protein ShdA; Provisional Back     alignment and domain information
>KOG2071 consensus mRNA cleavage and polyadenylation factor I/II complex, subunit Pcf11 [RNA processing and modification] Back     alignment and domain information
>KOG4672 consensus Uncharacterized conserved low complexity protein [Function unknown] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1380
4fu6_A153 Crystal Structure Of The Psip1 Pwwp Domain Length = 1e-08
2nlu_A100 Domain-Swapped Dimer Of The Pwwp Module Of Human He 1e-08
1ri0_A110 Nmr Structure Of The N-Terminal Hath Domain Of Huma 2e-08
2b8a_A110 High Resolution Structure Of The Hdgf Pwwp Domain L 2e-08
2m16_A97 P75/ledgf Pwwp Domain Length = 97 2e-08
1n27_A96 Solution Structure Of The Pwwp Domain Of Mouse Hepa 7e-08
3qby_A94 Crystal Structure Of The Pwwp Domain Of Human Hepat 3e-06
3eae_A93 Pwwp Domain Of Human Hepatoma-Derived Growth Factor 3e-06
>pdb|4FU6|A Chain A, Crystal Structure Of The Psip1 Pwwp Domain Length = 153 Back     alignment and structure

Iteration: 1

Score = 59.7 bits (143), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 1/72 (1%) Query: 22 WKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEE 81 +K GDL+ AK+KG+P WPA V E K+ +FFFGT + AF P D+ ++E Sbjct: 23 FKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFPYSEN 82 Query: 82 K-KQSLLTKRQG 92 K K KR+G Sbjct: 83 KEKYGKPNKRKG 94
>pdb|2NLU|A Chain A, Domain-Swapped Dimer Of The Pwwp Module Of Human Hepatoma- Derived Growth Factor Length = 100 Back     alignment and structure
>pdb|1RI0|A Chain A, Nmr Structure Of The N-Terminal Hath Domain Of Human Hdgf Length = 110 Back     alignment and structure
>pdb|2B8A|A Chain A, High Resolution Structure Of The Hdgf Pwwp Domain Length = 110 Back     alignment and structure
>pdb|2M16|A Chain A, P75/ledgf Pwwp Domain Length = 97 Back     alignment and structure
>pdb|1N27|A Chain A, Solution Structure Of The Pwwp Domain Of Mouse Hepatoma- Derived Growth Factor, Related Protein 3 Length = 96 Back     alignment and structure
>pdb|3QBY|A Chain A, Crystal Structure Of The Pwwp Domain Of Human Hepatoma-Derived Growth Factor 2 Length = 94 Back     alignment and structure
>pdb|3EAE|A Chain A, Pwwp Domain Of Human Hepatoma-Derived Growth Factor 2 (Hdgf2) Length = 93 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1380
2l89_A108 PWWP domain-containing protein 1; histone binding, 3e-24
2daq_A110 WHSC1L1 protein, isoform long; PWWP domain, struct 3e-23
1h3z_A109 Hypothetical 62.8 kDa protein C215.07C; nuclear pr 2e-22
3qby_A94 Hepatoma-derived growth factor-related protein 2; 2e-21
1ri0_A110 Hepatoma-derived growth factor; HDGF, HATH domain, 3e-21
1khc_A147 DNA cytosine-5 methyltransferase 3B2; five beta-sh 2e-18
4flb_A132 Regulation of nuclear PRE-mRNA domain-containing; 4e-18
4fld_A135 Regulation of nuclear PRE-mRNA domain-containing 1 1e-17
3llr_A154 DNA (cytosine-5)-methyltransferase 3A; DNA methylt 6e-17
2gfu_A134 DNA mismatch repair protein MSH6; PWWP domain, tud 1e-15
3d9j_A145 RNA-binding protein 16; SCAF8, RNA polymerase II C 1e-14
2km4_A142 Regulator of TY1 transposition protein 103; CTD-in 8e-14
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-13
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-08
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-06
1vt4_I1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-05
3l42_A130 Peregrin; transcription regulation, histone H3 ace 1e-04
3pfs_A158 Bromodomain and PHD finger-containing protein 3; s 8e-04
>2l89_A PWWP domain-containing protein 1; histone binding, protein binding; NMR {Schizosaccharomyces pombe} Length = 108 Back     alignment and structure
 Score = 97.8 bits (243), Expect = 3e-24
 Identities = 21/108 (19%), Positives = 40/108 (37%), Gaps = 6/108 (5%)

Query: 17  AARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKK-----VLVFFFGTQQIAFCN 71
           +A  +   GD +L K  G+P WPA +   ++   S +          V FF     A+  
Sbjct: 1   SADDRLNFGDRILVKAPGYPWWPALLLRRKETKDSLNTNSSFNVLYKVLFFPDFNFAWVK 60

Query: 72  PADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDS 119
              V+   + +    L   + +  + + A +        LK++   D 
Sbjct: 61  RNSVKPLLDSEIAKFLGSSKRKSKELIEAYEASKT-PPDLKEESSTDL 107


>2daq_A WHSC1L1 protein, isoform long; PWWP domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: b.34.9.2 Length = 110 Back     alignment and structure
>3qby_A Hepatoma-derived growth factor-related protein 2; HDGF2, structural genomics consortium, SGC, protein binding; HET: M3L; 1.95A {Homo sapiens} PDB: 3qj6_A* 3eae_A 1n27_A Length = 94 Back     alignment and structure
>1ri0_A Hepatoma-derived growth factor; HDGF, HATH domain, PWWP domain, heparin-binding, hormone/growth factor complex; NMR {Homo sapiens} SCOP: b.34.9.2 PDB: 2b8a_A 2nlu_A Length = 110 Back     alignment and structure
>1khc_A DNA cytosine-5 methyltransferase 3B2; five beta-sheets barrel followed by five-helix bundle; HET: DNA; 1.80A {Mus musculus} SCOP: b.34.9.2 PDB: 3flg_A* 3qkj_A* Length = 147 Back     alignment and structure
>4flb_A Regulation of nuclear PRE-mRNA domain-containing; structural genomics consortium, SGC, protein binding; 1.80A {Homo sapiens} Length = 132 Back     alignment and structure
>3llr_A DNA (cytosine-5)-methyltransferase 3A; DNA methyltransferase, methylysine binding, STR genomics consortium, SGC, alternative promoter usage; HET: DNA BTB; 2.30A {Homo sapiens} Length = 154 Back     alignment and structure
>2gfu_A DNA mismatch repair protein MSH6; PWWP domain, tudor domain, DNA binding, DNA binding protein; HET: DNA; NMR {Homo sapiens} Length = 134 Back     alignment and structure
>3d9j_A RNA-binding protein 16; SCAF8, RNA polymerase II CTD interacting domain, arm repeats phospho-CTD, phosphoprotein, transcription; 1.60A {Homo sapiens} PDB: 3d9k_A* 3d9l_A* 3d9m_A* 3d9n_A* 3d9p_A* 3d9i_A 3d9o_A* 2diw_A Length = 145 Back     alignment and structure
>2km4_A Regulator of TY1 transposition protein 103; CTD-interacting domain, RNA polymerase II binding protein, phosphoprotein; NMR {Saccharomyces cerevisiae} PDB: 2l0i_A* Length = 142 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>3l42_A Peregrin; transcription regulation, histone H3 acetylation, chromatin modification, structural genomics, structural genomics CONS SGC, activator; 1.30A {Homo sapiens} PDB: 3mo8_A* 2x4w_A* 2x35_A* 2x4x_A* 2x4y_A* Length = 130 Back     alignment and structure
>3pfs_A Bromodomain and PHD finger-containing protein 3; structural genomics, structural genomics consortium, SGC, PW domain, protein binding; 1.90A {Homo sapiens} PDB: 3lyi_A* Length = 158 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 1380
d2nlua1100 b.34.9.2 (A:1-100) Hepatoma-derived growth factor, 2e-21
d2daqa197 b.34.9.2 (A:8-104) Histone-lysine N-methyltransfer 9e-20
d1h3za_108 b.34.9.2 (A:) Hypothetical protein SPBC215.07c {Fi 2e-18
d1khca_137 b.34.9.2 (A:) DNA methyltransferase DNMT3B {Mouse 4e-15
d1szaa_144 a.118.9.4 (A:) PCF11 protein {Baker's yeast (Sacch 8e-10
>d2nlua1 b.34.9.2 (A:1-100) Hepatoma-derived growth factor, HDGF {Human (Homo sapiens) [TaxId: 9606]} Length = 100 Back     information, alignment and structure

class: All beta proteins
fold: SH3-like barrel
superfamily: Tudor/PWWP/MBT
family: PWWP domain
domain: Hepatoma-derived growth factor, HDGF
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 88.1 bits (218), Expect = 2e-21
 Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 19  RRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAF 78
           ++++K GDLV AK+KG+P WPA + E  +    +   K  VFFFGT + AF  P D+  +
Sbjct: 7   QKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANKYQVFFFGTHETAFLGPKDLFPY 66

Query: 79  TEEKKQSLLTKRQGRGADFVRAVQEI 104
            E K++     +  +   F   + EI
Sbjct: 67  EESKEKFG---KPNKRKGFSEGLWEI 89


>d2daqa1 b.34.9.2 (A:8-104) Histone-lysine N-methyltransferase NSD3 {Human (Homo sapiens) [TaxId: 9606]} Length = 97 Back     information, alignment and structure
>d1h3za_ b.34.9.2 (A:) Hypothetical protein SPBC215.07c {Fission yeast (Schizosaccharomyces pombe) [TaxId: 4896]} Length = 108 Back     information, alignment and structure
>d1khca_ b.34.9.2 (A:) DNA methyltransferase DNMT3B {Mouse (Mus musculus) [TaxId: 10090]} Length = 137 Back     information, alignment and structure
>d1szaa_ a.118.9.4 (A:) PCF11 protein {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 144 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query1380
d2nlua1100 Hepatoma-derived growth factor, HDGF {Human (Homo 99.86
d2daqa197 Histone-lysine N-methyltransferase NSD3 {Human (Ho 99.84
d1h3za_108 Hypothetical protein SPBC215.07c {Fission yeast (S 99.84
d1khca_137 DNA methyltransferase DNMT3B {Mouse (Mus musculus) 99.82
d1szaa_144 PCF11 protein {Baker's yeast (Saccharomyces cerevi 99.41
d1mhna_59 Survival motor neuron protein 1, smn {Human (Homo 81.67
>d2nlua1 b.34.9.2 (A:1-100) Hepatoma-derived growth factor, HDGF {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: All beta proteins
fold: SH3-like barrel
superfamily: Tudor/PWWP/MBT
family: PWWP domain
domain: Hepatoma-derived growth factor, HDGF
species: Human (Homo sapiens) [TaxId: 9606]
Probab=99.86  E-value=2.1e-22  Score=166.82  Aligned_cols=85  Identities=35%  Similarity=0.659  Sum_probs=73.9

Q ss_pred             CCCCCCCCEEEEECCCCCCCCEEEECCCCCCCCCCCCEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHHHCCCCHHHHH
Q ss_conf             07987697999921789889715629898787888975999993799866402797534626789766430479807899
Q 000633           19 RRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFV   98 (1380)
Q Consensus        19 ~~~FkpGDLVWAKLKGYPwWPArIvdPe~~a~Ka~~kky~V~FFGThEyAWV~pkdLkPFsE~keesfl~KrkgK~K~Fk   98 (1380)
                      .+.|++||||||||+|||||||+|+++.....+...++|+|+|||+++|+||..++|++|++++.. +..+  .|.+.|+
T Consensus         7 ~~~f~~GdlVWaK~~gypwWPa~V~~~~~~~~~~~~~~~~V~Ffg~~~~~wv~~~~l~~f~e~~~~-~~~~--~k~~~f~   83 (100)
T d2nlua1           7 QKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANKYQVFFFGTHETAFLGPKDLFPYEESKEK-FGKP--NKRKGFS   83 (100)
T ss_dssp             CCCCSTTCEEEEEETTEEEEEEEEECCTTSSCSCCCSCEEEEETTTTEEEEECGGGEECHHHHHHH-HSCC--SCCHHHH
T ss_pred             CCCCCCCCEEEEECCCCCCCCEEEECCHHHCCCCCCCEEEEEECCCCCEEEECHHHCCCHHHHHHH-HHCC--CCHHHHH
T ss_conf             775876889999668979998798360230346777879999868998799838885470343788-7033--3349999


Q ss_pred             HHHHHHHH
Q ss_conf             99999999
Q 000633           99 RAVQEIID  106 (1380)
Q Consensus        99 eALeEAee  106 (1380)
                      +||+||++
T Consensus        84 ~Av~e~e~   91 (100)
T d2nlua1          84 EGLWEIEN   91 (100)
T ss_dssp             HHHHHHHT
T ss_pred             HHHHHHHH
T ss_conf             99999885



>d2daqa1 b.34.9.2 (A:8-104) Histone-lysine N-methyltransferase NSD3 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1h3za_ b.34.9.2 (A:) Hypothetical protein SPBC215.07c {Fission yeast (Schizosaccharomyces pombe) [TaxId: 4896]} Back     information, alignment and structure
>d1khca_ b.34.9.2 (A:) DNA methyltransferase DNMT3B {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1szaa_ a.118.9.4 (A:) PCF11 protein {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1mhna_ b.34.9.1 (A:) Survival motor neuron protein 1, smn {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure