Citrus Sinensis ID: 000616


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-------830-------840-------850-------860-------870-------880-------890-------900-------910-------920-------930-------940-------950-------960-------970-------980-------990------1000------1010------1020------1030------1040------1050------1060------1070------1080------1090------1100------1110------1120------1130------1140------1150------1160------1170------1180------1190------1200------1210------1220------1230------1240------1250------1260------1270------1280------1290------1300------1310------1320------1330------1340------1350------1360------1370------1380------139
MAPSRRKGASKAAAAAAARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHLKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPSSEEELKSSLQTEEKPSKDTAVNLVEHISPSLNNYDEGFTEKGPSLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLSPQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQSSSSSPYARSQYKKSLGAPVADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCSSSAVAYSRRSSRTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFEAVTPEHNSEIPEERDANPSMKKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLPQDVPPSSPPLPSSPPPVLPPPPSIPHSCAFSDSYSNGASMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHRHMRPPSGGAVPVGIRAPGAWRPR
ccccccccHHHHHHHHHHHcccccccEEEEEccccccccccccccccccccccccEEEEEEEccccEEEcccccccccHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHccccccccccccccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHcHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHccccccccccccccccHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHccccccccccEEEEHHHHHHHccccccccccccHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
cccccccccHHHHHHHHHHHHcccccEEEEEEccccccccEEccccccccccccccEEEEEEccccEEEccHHHcccccHHHHHHHccccccccccHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEcccccccccccccccccccccccccccccccccccccccccccccEEEHccccccccccccccccccHHccccccccccccccccccccccccccccccccccccccccccHccccccccccHHHccccccccccccccccccccHccccccccccccccccccccccccccccccccccccccccccccEcccccccccccccHHccccccHHHHHHHHHHccccHHHccccccccccccEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEcccccccccccccccccccccccccccccccccccccccccEcccccccccccccccccccccccccccEccccccccccccccccccccHHHHHHHHHHHHHHccccccccccccHHcccccccccccccccccccccccccccccccccEEcccccccHcccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHccccccHHEEEEEEHHHHHHHccccccccccccHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHccccccccccccccccccccccccccHHHHcccEEcccccccccccccccccHHcccccccccccccccccccccccccccccccccccHHcccEEEEEcccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
MAPSRRKGASKAAAAAAARRQWKVGDLVLAKvkgfpawpatvsepekwgysadWKKVLVFFFGtqqiafcnpadveaFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKkqdqvdsnsgdeltvanggnsvnsishlkdrteaseatldsqmkpsnstagdglnlptedspagrqldalpakeplpeqpsenlvakatpvlttyssrkrsggsrlqstqrmapstrrsrsstmVESCRLqnlmmpynnegknaegisaksildgslirnkrtrkspdgsecndldssalmsngsiednsseivtvesdafslnegstvdsgckvedsetvlecldgdemLSKRLDFQIKAVVVKkkrkpnrkrvcndavdpparintatevdvstrntchssentggnlderdfkedgdehlPLVKRARvrmgkpsseeELKSslqteekpskdTAVNLVEhispslnnydegftekgpslvkestdsvspskvcsevsgnrpqlwkgttnqsfgcsadgeaalppsKRLHRALEAMSANAAEEGQASVQASSLintsingccvnsickcshetvdsrersgsglqnvptcdqlsencnsqkqesfrddvgsvdnvdgkdlpgspfsvhTIQTAVqtqtpvnilpdpkkrhssFQLYQNSldqlslkdegsaedlqlkdsrvenvdkefNTSALVelsldpvsgadesvklspqngsnelqysvqgmsyenseslksqiddnchINARCEAVEEIKQNEKqkemssvsisddhlgdkgVASVlfssspaegvdsparvsppntslchvstsesanivqssssspyarsqykkslgapvadegkvdttltqrpksvgkwsscsEAHAALSSFEAVLGSLTRTKESIGRATRIAIdcakfgvsSKVVEIVARHLESESSLYRRVDLFFLVDSIMQcsrgmkgdvsgiipSAILTVLPrllsaaappgnvaqeNRRQCLKVLRLWLERRILPESIIRHHMreldtvtcsssavaysrrssrteralddpvrdmEGMLVdeygsnssfqlpgfcmprmlkddddgsdsdggsfeavtpehnseipeerdanpsmkkHRHILEEvdgelemedvaptcdnemssTVLVDIAQTshdqllsfvpplpqdvppsspplpsspppvlppppsiphscafsdsysngasmqnmqndgqqsvtqqsvaprinstvstnavhyhasecrdhqmpmqmpestssfgcysmrpsnnfqqtdgprfhnkpypprpphapqsnhfsyvqasqsaksrreaphpsnshrfhphpnfdggnfynnhdrmkpgpyehreswrfsapsfsgprypdearecypsgsyggplrepprysnrgwaypprpmnhrhmrppsggavpvgirapgawrpr
mapsrrkgaSKAAAAAAARRQWKVGDLVLAKVKgfpawpatvsepekwgysADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSlltkrqgrgadFVRAVQEIIDSYEklkkqdqvdsnsgdeltvanggnsvnsisHLKDRTEASEatldsqmkpsnstagdglnLPTEDSPAGRQLDALPAKeplpeqpsenlvakatpvlttyssrkrsggsrlqstqrmapstrrsrsstmvESCRLQNLMMPYNNEgknaegisaksildgslirnkrtrkspdgsecndlDSSALMSNGSIEDNSSEIVTVESDAFslnegstvdsgckvedsETVLECLDGDEMLSKRLDFQIKavvvkkkrkpnrkrvcndavdpparintatevdvstrntchssentggnlderdfkedgdehlplvkrarvrmgkpsseeelksslqteekpskdTAVNLVEHispslnnydeGFTEKGPSLVkestdsvspskvcsevsgnrpqlwkgttnqsfgcSADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADEsvklspqngsnELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTsesanivqssssspyarSQYKKSlgapvadegkvdttltqrpksvgkwSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIaidcakfgvsSKVVEIVARHlesesslyrrVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAappgnvaqenrrQCLKVLRLWLERRILPesiirhhmreldtvtcsssavaysrrssrteralddpvrDMEGMLVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFeavtpehnseipeerdanpsmkkHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLPQDVPPSSPPLPSSPPPVLPPPPSIPHSCAFSDSYSNGASMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHpsnshrfhphpnFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYpsgsyggplrepprySNRGWAYPPRPMNHRHmrppsggavpvgirapgawrpr
MapsrrkgaskaaaaaaarrQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHLKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIkavvvkkkrkpnrkrvCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPsseeelksslqteekPSKDTAVNLVEHISPSLNNYDEGFTEKGPSLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLSPQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQsssssPYARSQYKKSLGAPVADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCsssavaysrrssrTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKddddgsdsdggsFEAVTPEHNSEIPEERDANPSMKKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFvpplpqdvppsspplpsspppvlppppsipHSCAFSDSYSNGASMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYpprpmnhrhmrppSGGAVPVGIRAPGAWRPR
*****************ARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFT**************GADFVRAVQEIIDS*******************************************************************************************************************************************************************************************************************************ETVLECLDGDEMLSKRLDFQIKAVVVK*********************************************************************************************************************************************************************************************LINTSINGCCVNSICKCSH***************************************************************************************************************************************************************************HI***********************************************************************************************************************HAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCS******************************************************************************************************************VLVDIA******************************************************************************************************************************************************************************************************************************************************************************
************************GDLVLAKVKGFPAWPATVSEP******ADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQ**********ADFVRAVQEII*********************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************AALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAAPPGN***ENRRQCLKVLRLWLERRILPESIIRHHMRELDTVT**********************VRDMEGMLVDEYGSNSSFQLPGFCMPRML*************************************HRHILEEVDGELEMEDVAP*********************************************************************************************************************************************************************************************************************************************************************************************
********************QWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKL***********DELTVANGGNSVNSISHL*********************AGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKATPVLTT*******************************ESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNK************DLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARV************************TAVNLVEHISPSLNNYDEGFTEKG**********************NRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAM*************ASSLINTSINGCCVNSICKCSH************LQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLSPQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEE*****************DHLGDKGVASVLFSSS**************NTSL*************************************KVDTTL**************EAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTV******************ALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDD********************************KKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLPQD**************VLPPPPSIPHSCAFSDSYSNGASMQN**************APRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNH**********************HRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHRHMRPPSGGAVPVGIRAPGAWRPR
******************RRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLK**********************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************FGCSADGEAALPPSKRL*RALEAMSAN***********************************************************************************************************************************************************************************************************************************************************************************************************************************AHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTV*****************RALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRML**********************************MKKHRHILEEVDGELEMEDV************************************************VLPPPP************************GQQS**QQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHRHMRPPSGGAVPV***AP******
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAPSRRKGASKAAAAAAARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHLKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPSSEEELKSSLQTEEKPSKDTAVNLVEHISPSLNNYDEGFTEKGPSLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQLSENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLSPQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQSSSSSPYARSQYKKSLGAPVADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCSSSAVAYSRRSSRTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFEAVTPEHNSEIPEERDANPSMKKHRHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLPQDVPPSSPPLPSSPPPVLPPPPSIPHSCAFSDSYSNGASMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCYSMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHRHMRPPSGGAVPVGIRAPGAWRPR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query1389 2.2.26 [Sep-21-2011]
F4IZM81347 HUA2-like protein 2 OS=Ar yes no 0.923 0.952 0.403 0.0
F4IN781366 HUA2-like protein 3 OS=Ar no no 0.439 0.447 0.467 1e-128
Q9XER91392 ENHANCER OF AG-4 protein no no 0.205 0.205 0.477 3e-64
Q9LEY41445 HUA2-like protein 1 OS=Ar no no 0.182 0.175 0.456 4e-58
Q9Y3E1203 Hepatoma-derived growth f yes no 0.092 0.630 0.320 3e-09
P51859237 Hepatoma-derived growth f yes no 0.059 0.345 0.379 7e-09
Q99JF8528 PC4 and SFRS1-interacting no no 0.052 0.138 0.418 8e-09
Q923W4202 Hepatoma-derived growth f yes no 0.092 0.633 0.302 8e-09
Q9XSK7239 Hepatoma-derived growth f yes no 0.059 0.343 0.390 8e-09
Q8VHK7237 Hepatoma-derived growth f no no 0.059 0.345 0.390 9e-09
>sp|F4IZM8|HUAL2_ARATH HUA2-like protein 2 OS=Arabidopsis thaliana GN=At3g63070 PE=2 SV=1 Back     alignment and function desciption
 Score =  762 bits (1968), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 586/1453 (40%), Positives = 798/1453 (54%), Gaps = 170/1453 (11%)

Query: 1    MAPSRRKGASKAAAAAAARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVF 60
            MAPSR++G  +AAAA++ARR+WKVGDLVLAKVKGFPAWPA V EPEKWG+SAD KKV V 
Sbjct: 1    MAPSRKRGGGRAAAASSARREWKVGDLVLAKVKGFPAWPAVVDEPEKWGHSADSKKVTVH 60

Query: 61   FFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDS- 119
            FFGTQQIAFCN  DVE+FTEEKKQSLLT+R  +G+DFVRAV+EI +SYEKLK+QDQ    
Sbjct: 61   FFGTQQIAFCNHGDVESFTEEKKQSLLTRRHAKGSDFVRAVKEITESYEKLKQQDQASGP 120

Query: 120  NSGDELTVANGGNSVNSISHLKDRTEASEAT-LDSQMKPSNSTAGDGLNLPTEDSPAGRQ 178
               +E T  + GN+    S L    E    + LD+Q++ S+S   D L L +ED+ A  Q
Sbjct: 121  KYAEETTAGSSGNT----SQLPQACENLIGSRLDTQIESSSSHGRDELTLLSEDASAAEQ 176

Query: 179  LDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRL--QSTQRMAPSTRRSRSSTM 236
            + AL           ++  AK    + TYSSR+R+   R    + Q +      S+ S+ 
Sbjct: 177  MLALRHNTLAHNGACDSAAAKDLCEIATYSSRRRNERVRALKYAPQSIILPVEHSKISSR 236

Query: 237  VESCRLQNLMMPYNNEGKNAEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNG 296
            +E  R+Q  M+  ++ G +   I      +G  IR ++  ++   SE +D+ SS L  +G
Sbjct: 237  LELDRVQRSMLQCSDGGPSVNSI------NGKAIRRRKRIRTSGQSESDDVVSSDLNLHG 290

Query: 297  SIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVV 356
            S EDN+SEI TVES+  S NEG+ VDSG KVE S+ V E  DG   L+K LDF I  +V 
Sbjct: 291  SDEDNASEIATVESNNNSRNEGNGVDSGSKVEYSDAVGEGCDGGHELNKGLDFHISTMVK 350

Query: 357  KKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLP 416
            +KKRKP RKR  +D +DPPA++  A  +  +  ++C  S+N+   L+ER  +E+GDEHLP
Sbjct: 351  RKKRKPTRKRETSDIIDPPAKVE-AEGLGPNACDSCQRSQNSHERLNERPCEENGDEHLP 409

Query: 417  LVKRARVRMGKP-SSEEELKSSLQTEEKPSKDTAVNLVEHISPSLNN-------YDEGFT 468
            LVKRARVRM +   ++E++ +S Q EE+ SKDT ++     SPS+N+       +D    
Sbjct: 410  LVKRARVRMSRAFYADEKVNASSQVEERSSKDTLLSAALQTSPSVNHENGIGSGHDTSAA 469

Query: 469  EKG-----------------PSLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSA 511
            E+                  PS +++ +D +SPS  C +  G+R        N+ F  + 
Sbjct: 470  EEFNSFELSAKLSGVMVDVVPSHMEKPSDRMSPSVACVQTVGDRQTAVNFHENE-FTMTL 528

Query: 512  DGEAALPPSKRLHRALEAMSANAAEEGQASVQASSLINTSINGCCVNSICKCSHET---- 567
            D E     S +L   +E   A   E  Q   + S   N  I+      I +CSH++    
Sbjct: 529  DDEVTRAQSNQLSSLVET-EARVPEVVQGCSEESQTGNCLISETDPIDI-QCSHQSEKHE 586

Query: 568  -------VDSRERSGSGLQNVPTCDQLSENCNSQKQES-FRDDVGSVDNVD------GKD 613
                   VDS      GL     C  L         +S  +  +   D+ D      G  
Sbjct: 587  TPLNPDIVDSSANKSPGL-----CSSLDMTTTVVPAQSPHQHKIQEYDSSDHSLVIVGDS 641

Query: 614  LPGSPFSV-HTIQTAVQTQTPVNILPDPKKRHSSFQLYQNSLDQLSLKDEGSAEDLQLKD 672
            L G    + + +   VQ+Q    + P P        L+ + ++   +++    E+   K+
Sbjct: 642  LNGKCEKIDYCMTQVVQSQA---LEPPPP-------LFCSVVNYQEVENLQETENTLWKE 691

Query: 673  SRVENVDKEFNTSALVELSLDPVSGADESVKL----SPQNGSNELQYSVQGMSYENSESL 728
            ++  +  KE ++     +  +PV  A ES  +     PQ    E  YS    + EN E  
Sbjct: 692  NQ-GSPGKELDSDKQAHMIQNPVLSATESEMIVDDAEPQY---ETVYSHCADAVENRELE 747

Query: 729  KSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDDHLGDKGVASVLFSSSPAEGVDSP 788
            KS           CE  E   Q E+ +  +S+S+S++   +K       +SSPA G    
Sbjct: 748  KS-----------CEVDE---QKEQMQATNSISVSENFSREK------LNSSPARG---- 783

Query: 789  ARVSPPNTSLCHVSTSESANIVQSSSS-SPYARSQYKKSLGA-PVADEGKVDTTLTQRPK 846
               +P   S+C +ST+ES N +Q++S  S   +    KSL    V +E KV+T  TQ  K
Sbjct: 784  ---TPNCNSVCRISTAESENAMQNNSYYSTNVQYGENKSLNVDTVKEESKVETGTTQVKK 840

Query: 847  SVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIAIDCAKFGVSSKVVEIVARHLE 906
             V      S+    + SFE  L SL RTKE+IGRATR+A+D AKFGVS+K +EI+A  LE
Sbjct: 841  VVS-----SDVQCTVESFETALDSLVRTKETIGRATRLAMDLAKFGVSAKAMEILAHTLE 895

Query: 907  SESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILTVLPRLLSAAAPPGNVAQENRR 966
            SES+L RRVDLFFLVDSI QCS+G+ GD  G+  S+I  +LPRLL+AA P G   QENR+
Sbjct: 896  SESNLQRRVDLFFLVDSIAQCSKGLNGDAGGVYLSSIQAMLPRLLTAAVPAGATTQENRK 955

Query: 967  QCLKVLRLWLERRILPESIIRHHMRELDTVTCSSSAVAYSRRSSRTERALDDPVRDMEGM 1026
            QCLKVLRLWLERRILPESI+RHH+RELD+++ +  A  YSRRS+RTERALDDPVRDMEG+
Sbjct: 956  QCLKVLRLWLERRILPESIVRHHIRELDSLS-NVPACLYSRRSARTERALDDPVRDMEGI 1014

Query: 1027 LVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFEAVTPEHNSEIPEERDANPSMKKH 1086
            LVDEYGSNS+ QL GFC+P +L+D+D+GSDSDGG FE+VTPEH S   EE       ++H
Sbjct: 1015 LVDEYGSNSTLQLHGFCIPPILRDEDEGSDSDGGDFESVTPEHESRSLEEHVTPSITERH 1074

Query: 1087 RHILEEVDGELEMEDVAPTCDNEMSSTVLVDIAQTSHDQLLSFVPPLP-QDVPPSSPPLP 1145
              ILE+VDGELEMEDVAP  +   S++ + D A          VP    Q+V  SSP   
Sbjct: 1075 TRILEDVDGELEMEDVAPPWEGGSSASAITDQADNRESANCLLVPGTSHQNVTSSSP--- 1131

Query: 1146 SSPPPVLPPPPSIPHSCAFSDSYSNGASMQ---NMQNDGQQSVTQQSVAPRINSTVSTNA 1202
                   P  PS     A S+SYSNG   +   +MQ D        +  PR+N       
Sbjct: 1132 -------PARPSQNAQLAMSNSYSNGFDYRRNPSMQGD------YHAGPPRMNP-----P 1173

Query: 1203 VHYHASECRDHQMPMQMPESTSSFGCYSMRP---SNNFQQTDGPRFHNKPYPPRPPHAPQ 1259
            +HY + E          P  +S        P    +NFQ       H       PP  P 
Sbjct: 1174 MHYGSPE----------PSYSSRVSLSKSMPRGEGSNFQ-------HRPYPSSHPPPPPP 1216

Query: 1260 SNHFSYVQASQSAKSRREAPHPSNSHRFHPHPNFDGGNFYNNHDRMKPGPYEHRESWRFS 1319
            S+H+SY++     KSRRE    S  HR H    FD  N+ ++++RM+P P E+R++WR+ 
Sbjct: 1217 SHHYSYMEPDHHIKSRREG--LSYPHRSHYTLEFDERNYQDSYERMRPEPCENRDNWRYH 1274

Query: 1320 APSFSGPRYPDEARECYPSGSYGGPLREPPRYSNRGWAYPPRPMNHR---HMRPPSGGAV 1376
             PS  GPRY D  +  + S SY G  R+  R  N  W+  PR  N+R   H +  S G V
Sbjct: 1275 PPSSHGPRYHDRHKGPHQSSSYSGHHRDSGRLQNNRWSDSPRAYNNRHSYHYKQHSEGPV 1334

Query: 1377 PVGIRAPGAWRPR 1389
            PVG+R PG W  R
Sbjct: 1335 PVGMRDPGTWHQR 1347




Probable transcrition factor that seems to be involved in mRNA processing.
Arabidopsis thaliana (taxid: 3702)
>sp|F4IN78|HUAL3_ARATH HUA2-like protein 3 OS=Arabidopsis thaliana GN=At2g48160 PE=2 SV=2 Back     alignment and function description
>sp|Q9XER9|HUA2_ARATH ENHANCER OF AG-4 protein 2 OS=Arabidopsis thaliana GN=HUA2 PE=2 SV=1 Back     alignment and function description
>sp|Q9LEY4|HUAL1_ARATH HUA2-like protein 1 OS=Arabidopsis thaliana GN=At5g08230 PE=2 SV=1 Back     alignment and function description
>sp|Q9Y3E1|HDGR3_HUMAN Hepatoma-derived growth factor-related protein 3 OS=Homo sapiens GN=HDGFRP3 PE=1 SV=1 Back     alignment and function description
>sp|P51859|HDGF_MOUSE Hepatoma-derived growth factor OS=Mus musculus GN=Hdgf PE=1 SV=2 Back     alignment and function description
>sp|Q99JF8|PSIP1_MOUSE PC4 and SFRS1-interacting protein OS=Mus musculus GN=Psip1 PE=1 SV=1 Back     alignment and function description
>sp|Q923W4|HDGR3_RAT Hepatoma-derived growth factor-related protein 3 OS=Rattus norvegicus GN=Hdgfrp3 PE=2 SV=1 Back     alignment and function description
>sp|Q9XSK7|HDGF_BOVIN Hepatoma-derived growth factor OS=Bos taurus GN=HDGF PE=2 SV=1 Back     alignment and function description
>sp|Q8VHK7|HDGF_RAT Hepatoma-derived growth factor OS=Rattus norvegicus GN=Hdgf PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1389
255584708 1558 glutathione peroxidase, putative [Ricinu 0.940 0.838 0.550 0.0
2254369581479 PREDICTED: uncharacterized protein LOC10 0.994 0.933 0.527 0.0
2960867161459 unnamed protein product [Vitis vinifera] 0.975 0.928 0.482 0.0
3574408091396 Hepatoma-derived growth factor-related p 0.949 0.944 0.472 0.0
356503527 1536 PREDICTED: uncharacterized protein LOC10 0.924 0.835 0.459 0.0
356536756 1561 PREDICTED: uncharacterized protein LOC10 0.955 0.850 0.472 0.0
449436812 1580 PREDICTED: HUA2-like protein 3-like [Cuc 0.955 0.839 0.460 0.0
306957191347 Tudor/PWWP/MBT domain-containing protein 0.923 0.952 0.403 0.0
2978249251379 hypothetical protein ARALYDRAFT_484006 [ 0.755 0.761 0.430 0.0
75734361261 putative protein [Arabidopsis thaliana] 0.791 0.871 0.418 0.0
>gi|255584708|ref|XP_002533075.1| glutathione peroxidase, putative [Ricinus communis] gi|223527139|gb|EEF29314.1| glutathione peroxidase, putative [Ricinus communis] Back     alignment and taxonomy information
 Score = 1315 bits (3404), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 779/1416 (55%), Positives = 946/1416 (66%), Gaps = 109/1416 (7%)

Query: 21   QWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTE 80
            QWKVGDLVLAKVKGFPAWPATVSEPEKWGY+ADWKKVLV+FFGTQQIAFCNPADVEAFTE
Sbjct: 22   QWKVGDLVLAKVKGFPAWPATVSEPEKWGYAADWKKVLVYFFGTQQIAFCNPADVEAFTE 81

Query: 81   EKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDS-NSGDELTVANGGNSVNSISH 139
            EKKQSLL KRQG+GADFVRAVQEII+SYEKLKK DQVD  NSG+E+T+ NGG+S+ S ++
Sbjct: 82   EKKQSLLVKRQGKGADFVRAVQEIIESYEKLKKSDQVDDRNSGEEITLVNGGHSMESSAY 141

Query: 140  --LKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLV 197
              LK +TE SEAT+  +  P     G  +++P        Q   L  KE   EQP++N+ 
Sbjct: 142  FELKGQTETSEATVTGRDDP-----GLAVDVP--------QSGTLHDKEDSTEQPADNMA 188

Query: 198  AKATPVLTTYSSRKRSGG--SRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKN 255
                P + TY+SRKRS G  SR  + Q+   S  RS S   +ES R QN M+P N   K+
Sbjct: 189  VPVKPGIATYTSRKRSLGLRSRKHAAQKNDSSVERSGSLPRLESSRFQNFMLPSNEGSKS 248

Query: 256  AEGISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSL 315
            A   S   I D +L RNKRTR+SPD SE +D+DSSA +SNGSIED+ SEIVTV+SD+ SL
Sbjct: 249  AGDASTDVISDRALRRNKRTRRSPDASEWDDVDSSAFVSNGSIEDDGSEIVTVDSDSLSL 308

Query: 316  NEGSTVDSGCKVEDSETVLECLDGDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPP 375
            NEGST+DS  K E SET +ECL+GD  LSK LDFQIKAV +KKKRK NRKRV N+A +PP
Sbjct: 309  NEGSTIDSASKPEHSETFVECLEGDVELSKGLDFQIKAVFIKKKRKQNRKRVSNEAAEPP 368

Query: 376  AR-INTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPSSEEEL 434
            AR + T   +D  T ++  + +N G NL+ER  KEDGDEHLPLVKRARVRMGK SS +E 
Sbjct: 369  ARRLETEVYLDAETHSSSQNLKNAGDNLNERHNKEDGDEHLPLVKRARVRMGKLSSLQEH 428

Query: 435  KSSLQTEEKPSKDTAVN----------------------LVEHISPSLNNYDEGFTEKGP 472
             S  Q EEK S + A+N                       +E I PS N  D+   +K  
Sbjct: 429  ASFSQDEEKISNEVAINPTEVNNGLCQVEERTLNEVAVATLERIVPSDNLSDDCSADKDS 488

Query: 473  SLVKESTDSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSA 532
              +K + D  SP K  +++  NRPQ+     +QSFGC+ADGEAALPPSKRLHRALEAMSA
Sbjct: 489  FSLKGALDIASPPKAHAQIPVNRPQILILKESQSFGCTADGEAALPPSKRLHRALEAMSA 548

Query: 533  NAAEEGQASVQASSLINTSINGCCVNSICKCSHETVDSRERSGSGLQNVPTCDQ----LS 588
            NAAEEG A  + +S+  TS+N     S+   S   V+ +E +GSG Q            S
Sbjct: 549  NAAEEGHACAE-TSIKKTSMNDGSTFSMKGSSGMVVERKENNGSGEQITEGLSHGASAFS 607

Query: 589  ENCNSQKQESFRDDVGSVDNVDGKDLPGSPFSVHTIQTAVQTQTPVNILPDPKKRHSSFQ 648
             + N   +ES R                SP         +++ T         +RH    
Sbjct: 608  SSSNRVLEESVR----------------SPLDRENCNELIESST--------SQRHHKDA 643

Query: 649  L---YQNSLDQLSLKDEGSAEDLQLKDSRVENVDKEFNTSALVELSLDPVSGADESVKLS 705
            L   + N  D      EG AED +L+ +  EN  ++ + S    L+   +S A+E  K +
Sbjct: 644  LALGFHNGRDVNGSCIEGHAEDAELRVAGGENRVEDVSISESSRLNASLISLANEGTKGT 703

Query: 706  PQNGSNELQYSVQGMSYENSESLKSQIDDNCHINARCEAVEEIKQNEKQKEMSSVSISDD 765
              NGS+ LQ +    + EN+E+L++Q+DDN             + N  +KE    S S+D
Sbjct: 704  SLNGSDALQNTADDTACENTETLRTQVDDNS------------RDNGIRKESCYASSSED 751

Query: 766  HLGDKGVASVLFSSSPAEGVDSPARVSPPNTSLCHVSTSESANIVQSSSSSPYARSQYKK 825
            HLG +    V  S  PA+G++SPA+ SPP TS+CHVST+ESAN +Q+S  S    SQ K 
Sbjct: 752  HLGGRDGLGVGSSPVPADGMESPAQTSPPTTSICHVSTAESANFIQNSGCSSPNHSQQKT 811

Query: 826  SLGAPVADEGKVDTTLTQRPKSVGKWSSCSEAHAALSSFEAVLGSLTRTKESIGRATRIA 885
            ++   V DE K+++   QRPKSVGKWSS +EAHAALSSFE +LGSLTRTKESIGRATRIA
Sbjct: 812  TVCTSVVDEEKIESVAPQRPKSVGKWSSYAEAHAALSSFEGMLGSLTRTKESIGRATRIA 871

Query: 886  IDCAKFGVSSKVVEIVARHLESESSLYRRVDLFFLVDSIMQCSRGMKGDVSGIIPSAILT 945
            IDCAKFGVS+KVV+I+AR LESES+L+RRVDLFFLVDSI QCSRG+KGDV GI PSAI  
Sbjct: 872  IDCAKFGVSAKVVDILARTLESESNLHRRVDLFFLVDSITQCSRGLKGDVGGIYPSAIQA 931

Query: 946  VLPRLLSAAAPPGNVAQENRRQCLKVLRLWLERRILPESIIRHHMRELDTVTCSSSAVAY 1005
            VLPRLLSAAAPPG+ AQENRRQCLKVLRLWLERRILPE ++RHHMRE+D++  SSS  AY
Sbjct: 932  VLPRLLSAAAPPGSFAQENRRQCLKVLRLWLERRILPEPVVRHHMREIDSLGGSSSGGAY 991

Query: 1006 SRRSSRTERALDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDDDDGSDSDGGSFEAV 1065
            SRRS+RTER LDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKD+D+GSDSDG SFEAV
Sbjct: 992  SRRSARTERPLDDPVRDMEGMLVDEYGSNSSFQLPGFCMPRMLKDEDEGSDSDGESFEAV 1051

Query: 1066 TPEHNSEIPEERDANPSMKKHRHILEEVDGELEMEDVAPTCDNEMSS------TVLVDIA 1119
            TPEHNSE PEE D+ P+++KH HILE+VDGELEMEDVAP+C+ E SS         V I 
Sbjct: 1052 TPEHNSETPEEHDSAPAIEKHTHILEDVDGELEMEDVAPSCEIEASSAGGIGGVNAVHIP 1111

Query: 1120 QTSHDQLLS--FVPPLPQDV--PPSSPPLPSSPPPVLPPPPSIPHSCAFSDSYSNGA--- 1172
            Q+  +Q  S  F PPLPQDV       P    PPP  PPPP+I  S    D Y NG    
Sbjct: 1112 QSQLEQHFSLPFAPPLPQDVPPSSPPLPTSPPPPPPPPPPPAILPSSGMPDPYVNGVDSK 1171

Query: 1173 ---SMQNMQNDGQQSVTQQSVAPRINSTVSTNAVHYHASECRDHQMPMQMPESTSSFGCY 1229
               +   M +D +++V Q   APRI S++ T+ VHYHA+ECRD QM MQ+ +STSSF  Y
Sbjct: 1172 LYTNSHYMHDDLRETVAQPLAAPRITSSI-TDGVHYHATECRD-QMQMQLCDSTSSFSSY 1229

Query: 1230 SMRPSNNFQQTDGPRFHNKPYPPRPPHAPQSNHFSYVQASQSAKSRREAPHPSNSHRFHP 1289
               P NN Q  D P FH+K Y PRPPH P SN FSYVQA Q  KSRR +P PS+ HR+  
Sbjct: 1230 PACPVNNVQHADSPNFHHKAYAPRPPHHPPSNQFSYVQAGQHVKSRRASPPPSHHHRYQS 1289

Query: 1290 HPNFDGGNFYNNHDRMKPGPYEHRESWRFSAPSFSGPRYPDEARECYPSGSYGGPLREPP 1349
              N DGGN+YNNH+RM+P PY+  ESWR+  P F GPRYPD++R  YP G YGGP REP 
Sbjct: 1290 SHNTDGGNYYNNHERMRPAPYD--ESWRYPPPPFPGPRYPDKSRASYPRGPYGGPPREPT 1347

Query: 1350 RYSNRGWAYPPRPMNHRH---MRPPSGGAVPVGIRA 1382
            R  ++GW+YP + M+HR+    RPPS  AVPV  R 
Sbjct: 1348 RMPHQGWSYPSQDMHHRNFMPFRPPS-DAVPVSNRV 1382




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225436958|ref|XP_002271866.1| PREDICTED: uncharacterized protein LOC100261323 [Vitis vinifera] Back     alignment and taxonomy information
>gi|296086716|emb|CBI32351.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357440809|ref|XP_003590682.1| Hepatoma-derived growth factor-related protein [Medicago truncatula] gi|355479730|gb|AES60933.1| Hepatoma-derived growth factor-related protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|356503527|ref|XP_003520559.1| PREDICTED: uncharacterized protein LOC100799006 [Glycine max] Back     alignment and taxonomy information
>gi|356536756|ref|XP_003536901.1| PREDICTED: uncharacterized protein LOC100803427 [Glycine max] Back     alignment and taxonomy information
>gi|449436812|ref|XP_004136186.1| PREDICTED: HUA2-like protein 3-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|30695719|ref|NP_191866.3| Tudor/PWWP/MBT domain-containing protein [Arabidopsis thaliana] gi|403399430|sp|F4IZM8.1|HUAL2_ARATH RecName: Full=HUA2-like protein 2 gi|332646910|gb|AEE80431.1| Tudor/PWWP/MBT domain-containing protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297824925|ref|XP_002880345.1| hypothetical protein ARALYDRAFT_484006 [Arabidopsis lyrata subsp. lyrata] gi|297326184|gb|EFH56604.1| hypothetical protein ARALYDRAFT_484006 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|7573436|emb|CAB87752.1| putative protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1389
TAIR|locus:20393511366 AT2G48160 [Arabidopsis thalian 0.404 0.411 0.384 6.4e-192
TAIR|locus:20992421347 AT3G63070 "AT3G63070" [Arabido 0.920 0.948 0.362 2.4e-170
TAIR|locus:21782921392 HUA2 "ENHANCER OF AG-4 2" [Ara 0.203 0.202 0.457 9.9e-75
TAIR|locus:21815691445 AT5G08230 "AT5G08230" [Arabido 0.247 0.238 0.359 2.4e-65
UNIPROTKB|Q9Y3E1203 HDGFRP3 "Hepatoma-derived grow 0.112 0.768 0.307 9.1e-13
RGD|628773202 Hdgfrp3 "hepatoma-derived grow 0.086 0.594 0.343 9.2e-13
MGI|MGI:1352760202 Hdgfrp3 "hepatoma-derived grow 0.086 0.594 0.343 1.9e-12
CGD|CAL00038741404 PGA55 [Candida albicans (taxid 0.562 0.556 0.184 9.6e-12
UNIPROTKB|Q59SG91404 PGA55 "Flocculin-like protein" 0.562 0.556 0.184 9.6e-12
ZFIN|ZDB-GENE-050213-2201 hdgfrp3 "hepatoma-derived grow 0.092 0.636 0.335 2.6e-11
TAIR|locus:2039351 AT2G48160 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 836 (299.3 bits), Expect = 6.4e-192, Sum P(3) = 6.4e-192
 Identities = 233/606 (38%), Positives = 326/606 (53%)

Query:    23 KVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEK 82
             KVGDLVLAKVKGFPAWPA VSEPEKW  S D KKV V FFGTQQIAFCNP DVEAFTEE+
Sbjct:    23 KVGDLVLAKVKGFPAWPAVVSEPEKWDASPDSKKVFVHFFGTQQIAFCNPGDVEAFTEER 82

Query:    83 KQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQV-DSNSGDELTVANGGNSVNSISHLK 141
             KQSLLT+R  +G+DFVRAV+EII+SYEKLK+Q++  D  S +E T+ +  N+    + + 
Sbjct:    83 KQSLLTRRHAKGSDFVRAVKEIIESYEKLKQQERASDPKSAEEGTLGSAENT----TLMP 138

Query:   142 DRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQLDALPAKEPLPEQPSENLVAKAT 201
                E   AT  +QM    S   D   L  ED+ A  Q+ AL        +  ++ V K  
Sbjct:   139 QVIEIPTATSLTQMNSDPSHGRDESTLLNEDASAAEQMLALRDNSGPRNKACDSAVVKEP 198

Query:   202 PVLTTYSSRKRSGGSRLQST--QRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEGI 259
               + TYSSRKR+GG R Q+   Q      +RS+S + +++ +LQ+ M+  N++G    G 
Sbjct:   199 RKIATYSSRKRNGGVRSQNCAPQNETCPVQRSKSPSRLQTEKLQSSMLQ-NSDG----GQ 253

Query:   260 SAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGS 319
             +   + DG+L R KR R+S   SE +D+ +S+L S+GS E+N+SEI TVESD  + NEG+
Sbjct:   254 TIDDVEDGALRREKRIRRSSGHSESDDVATSSLNSHGSDEENASEIATVESDN-NRNEGN 312

Query:   320 TVDSGCKVEDSETVLECLDGDEMLSKRLDFQIXXXXXXXXXXXXXXXXCNDAVDPPARIN 379
              VDSG KVE  +   + L+GD  L+K L+FQI                 +D VDP A++ 
Sbjct:   313 GVDSGSKVEQIDIGGKFLEGDYDLNKGLNFQINIMVKRKKRKPTRKRGTSDVVDPQAKVE 372

Query:   380 TATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLPLVKRARVRMGKPXXXXXXXXXXX 439
                  +   RN   +S+N+     ER  +E+GDEHLPLVKRARVRM +            
Sbjct:   373 GEAVPEAGARNNVQTSQNSHEKFTERPCEENGDEHLPLVKRARVRMSRAFYGNHEANSSL 432

Query:   440 XXXXPS-KDTAVNLVEHISPS--LNNYDEGFTEKG-----------------PSLVKEST 479
                  S KDT V+     SPS  ++++D    E+                  PS V++S 
Sbjct:   433 QAEERSPKDTVVSATAQTSPSDIISSHDTFAVEESKFFEVSAKLSGDMVNVAPSPVEKSH 492

Query:   480 DSVSPSKVCSEVSGNRPQLWKGTTNQSFGCSADGEAALPPSKRLHRALEAMSANAAEEGQ 539
             D +SPS+ C +    R +   G    S   + D ++A P   ++  +L A  A  A   +
Sbjct:   493 DGMSPSEACVQTVRER-EYAMGWNELSK--TPDDKSAGPQYNQVS-SLPAGEAQTASVPE 548

Query:   540 ASV-QASSLINT-----SINGCCVNSI-CKCSHETVDSRERSGSGLQNVPTCDQLSENCN 592
             A   +   L+ +     ++  C V  I       TVDS   + S + ++    QLS    
Sbjct:   549 AVCPEVLKLLTSESDLPAVQYCQVAKIEPSMDPNTVDSSANNASEICSLSIPSQLSGQDR 608

Query:   593 SQKQES 598
             S  Q++
Sbjct:   609 SNDQDA 614


GO:0005634 "nucleus" evidence=ISM
TAIR|locus:2099242 AT3G63070 "AT3G63070" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2178292 HUA2 "ENHANCER OF AG-4 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2181569 AT5G08230 "AT5G08230" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q9Y3E1 HDGFRP3 "Hepatoma-derived growth factor-related protein 3" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
RGD|628773 Hdgfrp3 "hepatoma-derived growth factor, related protein 3" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
MGI|MGI:1352760 Hdgfrp3 "hepatoma-derived growth factor, related protein 3" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
CGD|CAL0003874 PGA55 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
UNIPROTKB|Q59SG9 PGA55 "Flocculin-like protein" [Candida albicans SC5314 (taxid:237561)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-050213-2 hdgfrp3 "hepatoma-derived growth factor, related protein 3" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
F4IZM8HUAL2_ARATHNo assigned EC number0.40330.92360.9524yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1389
cd0583483 cd05834, HDGF_related, The PWWP domain is an essen 8e-31
smart00582124 smart00582, RPR, domain present in proteins, which 9e-28
cd0516287 cd05162, PWWP, The PWWP domain, named for a conser 1e-13
pfam0085574 pfam00855, PWWP, PWWP domain 1e-09
cd0583686 cd05836, N_Pac_NP60, The PWWP domain is an essenti 8e-09
smart0029363 smart00293, PWWP, domain with conserved PWWP motif 3e-08
cd0584093 cd05840, SPBC215_ISWI_like, The PWWP domain is a c 2e-07
cd0583587 cd05835, Dnmt3b_related, The PWWP domain is an ess 1e-06
cd0584183 cd05841, BS69_related, The PWWP domain is part of 7e-04
cd0608080 cd06080, MUM1_like, Mutated melanoma-associated an 0.001
pfam09770 804 pfam09770, PAT1, Topoisomerase II-associated prote 0.002
>gnl|CDD|99895 cd05834, HDGF_related, The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF Back     alignment and domain information
 Score =  116 bits (292), Expect = 8e-31
 Identities = 42/85 (49%), Positives = 52/85 (61%), Gaps = 5/85 (5%)

Query: 20  RQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFT 79
           +Q+K GDLV AKVKG+PAWPA V EPE W      KK  V+FFGT + AF  P D+  +T
Sbjct: 1   KQFKAGDLVFAKVKGYPAWPARVDEPEDWKPPG--KKYPVYFFGTHETAFLKPEDLFPYT 58

Query: 80  EEKKQSLLTKRQGRGADFVRAVQEI 104
           E KK+    K++     F  AV EI
Sbjct: 59  ENKKKFGKPKKR---KGFNEAVWEI 80


This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. Length = 83

>gnl|CDD|214731 smart00582, RPR, domain present in proteins, which are involved in regulation of nuclear pre-mRNA Back     alignment and domain information
>gnl|CDD|99894 cd05162, PWWP, The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids Back     alignment and domain information
>gnl|CDD|216154 pfam00855, PWWP, PWWP domain Back     alignment and domain information
>gnl|CDD|99897 cd05836, N_Pac_NP60, The PWWP domain is an essential part of the cytokine-like nuclear factor n-pac protein, or NP60, which enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha Back     alignment and domain information
>gnl|CDD|214603 smart00293, PWWP, domain with conserved PWWP motif Back     alignment and domain information
>gnl|CDD|99901 cd05840, SPBC215_ISWI_like, The PWWP domain is a component of the S Back     alignment and domain information
>gnl|CDD|99896 cd05835, Dnmt3b_related, The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis Back     alignment and domain information
>gnl|CDD|99902 cd05841, BS69_related, The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions Back     alignment and domain information
>gnl|CDD|99903 cd06080, MUM1_like, Mutated melanoma-associated antigen 1 (MUM-1) is a melanoma-associated antigen (MAA) Back     alignment and domain information
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1 Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 1389
KOG1904496 consensus Transcription coactivator [Transcription 99.9
cd0583483 HDGF_related The PWWP domain is an essential part 99.87
cd0583686 N_Pac_NP60 The PWWP domain is an essential part of 99.87
PF12243139 CTK3: CTD kinase subunit gamma CTK3 99.87
cd0584093 SPBC215_ISWI_like The PWWP domain is a component o 99.85
cd0583587 Dnmt3b_related The PWWP domain is an essential com 99.85
cd0583895 WHSC1_related The PWWP domain was first identified 99.83
cd0516287 PWWP The PWWP domain, named for a conserved Pro-Tr 99.83
PF0085586 PWWP: PWWP domain; InterPro: IPR000313 Upon charac 99.82
cd0584183 BS69_related The PWWP domain is part of BS69 prote 99.81
cd05837110 MSH6_like The PWWP domain is present in MSH6, a mi 99.72
smart0029363 PWWP domain with conserved PWWP motif. conservatio 99.71
smart00582121 RPR domain present in proteins, which are involved 99.66
cd0608080 MUM1_like Mutated melanoma-associated antigen 1 (M 99.62
cd05839111 BR140_related The PWWP domain is found in the BR14 99.57
KOG0151877 consensus Predicted splicing regulator, contains R 99.52
cd03562114 CID CID (CTD-Interacting Domain) domain family; CI 99.01
KOG1924 1102 consensus RhoA GTPase effector DIA/Diaphanous [Sig 98.91
KOG0132 894 consensus RNA polymerase II C-terminal domain-bind 98.7
PF0481864 CTD_bind: RNA polymerase II-binding domain.; Inter 98.53
KOG2669325 consensus Regulator of nuclear mRNA [RNA processin 97.96
KOG1081463 consensus Transcription factor NSD1 and related SE 96.92
PHA03247 3151 large tegument protein UL36; Provisional 96.43
PHA03247 3151 large tegument protein UL36; Provisional 96.13
KOG1830518 consensus Wiskott Aldrich syndrome proteins [Cytos 96.08
KOG3671569 consensus Actin regulatory protein (Wiskott-Aldric 95.66
KOG0132894 consensus RNA polymerase II C-terminal domain-bind 95.41
KOG4368 757 consensus Predicted RNA binding protein, contains 95.22
KOG3671569 consensus Actin regulatory protein (Wiskott-Aldric 94.06
KOG4849498 consensus mRNA cleavage factor I subunit/CPSF subu 92.58
KOG1923 830 consensus Rac1 GTPase effector FRL [Signal transdu 89.93
smart0033357 TUDOR Tudor domain. Domain of unknown function pre 89.81
KOG1830518 consensus Wiskott Aldrich syndrome proteins [Cytos 89.51
COG5178 2365 PRP8 U5 snRNP spliceosome subunit [RNA processing 88.51
smart0074361 Agenet Tudor-like domain present in plant sequence 88.15
KOG1923 830 consensus Rac1 GTPase effector FRL [Signal transdu 87.79
KOG0119554 consensus Splicing factor 1/branch point binding p 87.12
KOG4849498 consensus mRNA cleavage factor I subunit/CPSF subu 85.79
KOG4672487 consensus Uncharacterized conserved low complexity 85.06
PRK15319 2039 AIDA autotransporter-like protein ShdA; Provisiona 84.87
KOG4672487 consensus Uncharacterized conserved low complexity 84.52
cd03561133 VHS VHS domain family; The VHS domain is present i 83.36
KOG2071579 consensus mRNA cleavage and polyadenylation factor 83.24
PF1171755 Tudor-knot: RNA binding activity-knot of a chromod 81.28
>KOG1904 consensus Transcription coactivator [Transcription] Back     alignment and domain information
Probab=99.90  E-value=2.7e-23  Score=240.67  Aligned_cols=235  Identities=34%  Similarity=0.463  Sum_probs=166.8

Q ss_pred             cCCCccCCEEEEccCCCCCCCeeeeCCCCCCCCCCCCEEEEEEecCCCeeeeccCccccccHHHHHHhhhhcCCCcHHHH
Q 000616           19 RRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADFV   98 (1389)
Q Consensus        19 ~~kFk~GDLVWAKVKGYPwWPArI~dPe~~~~K~~~nkylV~FFGT~EyAWV~~kdLkPFsE~k~esfl~KrkgK~KdF~   98 (1389)
                      ...|++||||||||||||.|||+|.++..+..++..++|.||||||+++|||.+++|+||..++. .+..-.+...+.|+
T Consensus        10 ~~~~~~GDLV~AKlkgyp~WParI~~~~~~~~kp~pkky~V~FfGT~e~Afl~p~dlqpy~~~k~-~~g~~~k~~~k~F~   88 (496)
T KOG1904|consen   10 AGNFKCGDLVFAKLKGYPPWPARIRNGPDGAVKPPPKKYTVFFFGTKETAFLKPKDLQPYMLNKE-KLGKPNKRVWKGFI   88 (496)
T ss_pred             cCCCCCCceeeecccCCCCCcccccCcccccccCCCceeEEEEeccCcccccchhhccchhhhhh-hcccchhhhhHHHH
Confidence            46899999999999999999999999999988999999999999999999999999999999876 34422211349999


Q ss_pred             HHHHHHHHHHHHHhhhcccccCCCcccccccCCCCCCccccccccccchhhccccCCCCCCCCCCCCCCCCCCCCccccc
Q 000616           99 RAVQEIIDSYEKLKKQDQVDSNSGDELTVANGGNSVNSISHLKDRTEASEATLDSQMKPSNSTAGDGLNLPTEDSPAGRQ  178 (1389)
Q Consensus        99 eAmeEAeE~~E~LK~qeq~~~~~a~e~~~a~v~~~~~s~~~~~~~t~~~~~~~~~~m~~~~~~~~~~~~~~~~~~~a~~~  178 (1389)
                      +|+++|++++     +... .++...+.         +.                        +..+.-+..        
T Consensus        89 ~av~eI~~a~-----~np~-~~~~~~~~---------~~------------------------~~~~~~~~~--------  121 (496)
T KOG1904|consen   89 EAVEEIREAF-----NNPK-SESDGIFF---------SA------------------------TATGRLLGP--------  121 (496)
T ss_pred             HHHHHHHHHh-----cCCC-ccccCccc---------cc------------------------ccccccccc--------
Confidence            9999999988     1100 11100000         00                        000000000        


Q ss_pred             ccCCCCCCCCCCCCCcCcccccCCCccccccccccCCCccccccccCCccccccCcccchhhhhccccccccccCCCCCC
Q 000616          179 LDALPAKEPLPEQPSENLVAKATPVLTTYSSRKRSGGSRLQSTQRMAPSTRRSRSSTMVESCRLQNLMMPYNNEGKNAEG  258 (1389)
Q Consensus       179 ~~~~~~~~~~~~~p~en~~~~~~~~~~t~S~~kr~~~~~~~~~~~r~~s~rrsrSs~~~~s~r~q~~~~p~~d~gk~~g~  258 (1389)
                                             +++                    .           +.+.       ..+|.|++-  
T Consensus       122 -----------------------~~~--------------------~-----------~~~~-------~~~~~~~~~--  138 (496)
T KOG1904|consen  122 -----------------------ENL--------------------F-----------VASS-------VLNDTGNSE--  138 (496)
T ss_pred             -----------------------ccc--------------------c-----------cccc-------ccccccccc--
Confidence                                   010                    0           0000       011233111  


Q ss_pred             CcccccccCceeccccccCCCCCCCCCccCccccccCCccCCCCCceeeeecCccccCCCCccCCCCCcccccchhhccc
Q 000616          259 ISAKSILDGSLIRNKRTRKSPDGSECNDLDSSALMSNGSIEDNSSEIVTVESDAFSLNEGSTVDSGCKVEDSETVLECLD  338 (1389)
Q Consensus       259 ~s~~~~~d~~~~~~k~~~~S~~~s~~ddv~Ss~~~~ngs~edn~~ei~t~~sd~~slNegs~vds~~~~E~Se~~~~~~e  338 (1389)
                                                        ...|..+.+.++|.+.++++. -+++..+|+..+.|.+...=+...
T Consensus       139 ----------------------------------~~~~~~~~~~s~~as~e~~~~-~~~~~~~~~~~~~d~~~~~d~~~~  183 (496)
T KOG1904|consen  139 ----------------------------------GEPGADENNASEIATPESDNN-RPEGNGEDSSSKVDYSDSTDHFEL  183 (496)
T ss_pred             ----------------------------------ccccccccccccccccccccc-CCccCccccccccccccccccccc
Confidence                                              223334444555666655554 678889999999998777666778


Q ss_pred             cccccccccccceeeeeeccccCCCCcccccCCCCCccccCccccccccccccccCCcccCCCCCCCCCccCCCCCcc
Q 000616          339 GDEMLSKRLDFQIKAVVVKKKRKPNRKRVCNDAVDPPARINTATEVDVSTRNTCHSSENTGGNLDERDFKEDGDEHLP  416 (1389)
Q Consensus       339 ~e~~l~~~~d~~~k~vv~kkKRKpnrk~~~nd~~~p~~~~~~~~~l~~~~~~~~~~s~N~~~ns~e~~~k~DGDEHLP  416 (1389)
                      |..+|++++.++|.+.+.++|||..|||.|.+             +...    ++.++|-..-+.++.++++|++|+|
T Consensus       184 ~~~el~~~~~~~~~~~~~r~~~kg~rkr~ts~-------------~~~~----~~~~~a~~~k~t~~~~~~~~s~krP  244 (496)
T KOG1904|consen  184 GGDELNKGKRKPIDTMVKRKKRKGTRKRKTSD-------------AEPK----VQRSQASHEKLTERPCESNGSEKRP  244 (496)
T ss_pred             cchhhccccccccccccccccCcccccccccc-------------cccc----ccccccchhhccCCCcccccccccc
Confidence            89999999999999999999999999999888             1223    4477777888999999999999999



>cd05834 HDGF_related The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF Back     alignment and domain information
>cd05836 N_Pac_NP60 The PWWP domain is an essential part of the cytokine-like nuclear factor n-pac protein, or NP60, which enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha Back     alignment and domain information
>PF12243 CTK3: CTD kinase subunit gamma CTK3 Back     alignment and domain information
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S Back     alignment and domain information
>cd05835 Dnmt3b_related The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis Back     alignment and domain information
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS) Back     alignment and domain information
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids Back     alignment and domain information
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified [] Back     alignment and domain information
>cd05841 BS69_related The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions Back     alignment and domain information
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS Back     alignment and domain information
>smart00293 PWWP domain with conserved PWWP motif Back     alignment and domain information
>smart00582 RPR domain present in proteins, which are involved in regulation of nuclear pre-mRNA Back     alignment and domain information
>cd06080 MUM1_like Mutated melanoma-associated antigen 1 (MUM-1) is a melanoma-associated antigen (MAA) Back     alignment and domain information
>cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2 Back     alignment and domain information
>KOG0151 consensus Predicted splicing regulator, contains RRM, SWAP and RPR domains [General function prediction only] Back     alignment and domain information
>cd03562 CID CID (CTD-Interacting Domain) domain family; CID is present in several RNA-processing factors such as Pcf11 and Nrd1 Back     alignment and domain information
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>KOG0132 consensus RNA polymerase II C-terminal domain-binding protein RA4, contains RPR and RRM domains [RNA processing and modification; Transcription] Back     alignment and domain information
>PF04818 CTD_bind: RNA polymerase II-binding domain Back     alignment and domain information
>KOG2669 consensus Regulator of nuclear mRNA [RNA processing and modification] Back     alignment and domain information
>KOG1081 consensus Transcription factor NSD1 and related SET domain proteins [Transcription] Back     alignment and domain information
>PHA03247 large tegument protein UL36; Provisional Back     alignment and domain information
>PHA03247 large tegument protein UL36; Provisional Back     alignment and domain information
>KOG1830 consensus Wiskott Aldrich syndrome proteins [Cytoskeleton] Back     alignment and domain information
>KOG3671 consensus Actin regulatory protein (Wiskott-Aldrich syndrome protein) [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>KOG0132 consensus RNA polymerase II C-terminal domain-binding protein RA4, contains RPR and RRM domains [RNA processing and modification; Transcription] Back     alignment and domain information
>KOG4368 consensus Predicted RNA binding protein, contains SWAP, RPR and G-patch domains [General function prediction only] Back     alignment and domain information
>KOG3671 consensus Actin regulatory protein (Wiskott-Aldrich syndrome protein) [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification] Back     alignment and domain information
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>smart00333 TUDOR Tudor domain Back     alignment and domain information
>KOG1830 consensus Wiskott Aldrich syndrome proteins [Cytoskeleton] Back     alignment and domain information
>COG5178 PRP8 U5 snRNP spliceosome subunit [RNA processing and modification] Back     alignment and domain information
>smart00743 Agenet Tudor-like domain present in plant sequences Back     alignment and domain information
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton] Back     alignment and domain information
>KOG0119 consensus Splicing factor 1/branch point binding protein (RRM superfamily) [RNA processing and modification] Back     alignment and domain information
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification] Back     alignment and domain information
>KOG4672 consensus Uncharacterized conserved low complexity protein [Function unknown] Back     alignment and domain information
>PRK15319 AIDA autotransporter-like protein ShdA; Provisional Back     alignment and domain information
>KOG4672 consensus Uncharacterized conserved low complexity protein [Function unknown] Back     alignment and domain information
>cd03561 VHS VHS domain family; The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule) Back     alignment and domain information
>KOG2071 consensus mRNA cleavage and polyadenylation factor I/II complex, subunit Pcf11 [RNA processing and modification] Back     alignment and domain information
>PF11717 Tudor-knot: RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1389
4fu6_A153 Crystal Structure Of The Psip1 Pwwp Domain Length = 1e-08
2nlu_A100 Domain-Swapped Dimer Of The Pwwp Module Of Human He 1e-08
1ri0_A110 Nmr Structure Of The N-Terminal Hath Domain Of Huma 1e-08
2b8a_A110 High Resolution Structure Of The Hdgf Pwwp Domain L 2e-08
2m16_A97 P75/ledgf Pwwp Domain Length = 97 3e-08
1n27_A96 Solution Structure Of The Pwwp Domain Of Mouse Hepa 7e-08
3qby_A94 Crystal Structure Of The Pwwp Domain Of Human Hepat 3e-06
3eae_A93 Pwwp Domain Of Human Hepatoma-Derived Growth Factor 3e-06
>pdb|4FU6|A Chain A, Crystal Structure Of The Psip1 Pwwp Domain Length = 153 Back     alignment and structure

Iteration: 1

Score = 59.7 bits (143), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 1/72 (1%) Query: 22 WKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEE 81 +K GDL+ AK+KG+P WPA V E K+ +FFFGT + AF P D+ ++E Sbjct: 23 FKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFPYSEN 82 Query: 82 K-KQSLLTKRQG 92 K K KR+G Sbjct: 83 KEKYGKPNKRKG 94
>pdb|2NLU|A Chain A, Domain-Swapped Dimer Of The Pwwp Module Of Human Hepatoma- Derived Growth Factor Length = 100 Back     alignment and structure
>pdb|1RI0|A Chain A, Nmr Structure Of The N-Terminal Hath Domain Of Human Hdgf Length = 110 Back     alignment and structure
>pdb|2B8A|A Chain A, High Resolution Structure Of The Hdgf Pwwp Domain Length = 110 Back     alignment and structure
>pdb|2M16|A Chain A, P75/ledgf Pwwp Domain Length = 97 Back     alignment and structure
>pdb|1N27|A Chain A, Solution Structure Of The Pwwp Domain Of Mouse Hepatoma- Derived Growth Factor, Related Protein 3 Length = 96 Back     alignment and structure
>pdb|3QBY|A Chain A, Crystal Structure Of The Pwwp Domain Of Human Hepatoma-Derived Growth Factor 2 Length = 94 Back     alignment and structure
>pdb|3EAE|A Chain A, Pwwp Domain Of Human Hepatoma-Derived Growth Factor 2 (Hdgf2) Length = 93 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1389
2l89_A108 PWWP domain-containing protein 1; histone binding, 3e-24
2daq_A110 WHSC1L1 protein, isoform long; PWWP domain, struct 3e-23
1h3z_A109 Hypothetical 62.8 kDa protein C215.07C; nuclear pr 2e-22
3qby_A94 Hepatoma-derived growth factor-related protein 2; 2e-21
1ri0_A110 Hepatoma-derived growth factor; HDGF, HATH domain, 2e-21
4flb_A132 Regulation of nuclear PRE-mRNA domain-containing; 1e-18
1khc_A147 DNA cytosine-5 methyltransferase 3B2; five beta-sh 2e-18
4fld_A135 Regulation of nuclear PRE-mRNA domain-containing 1 2e-17
3d9j_A145 RNA-binding protein 16; SCAF8, RNA polymerase II C 4e-17
3llr_A154 DNA (cytosine-5)-methyltransferase 3A; DNA methylt 6e-17
2gfu_A134 DNA mismatch repair protein MSH6; PWWP domain, tud 1e-15
2km4_A142 Regulator of TY1 transposition protein 103; CTD-in 4e-14
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-13
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-08
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-05
1vt4_I1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-05
3l42_A130 Peregrin; transcription regulation, histone H3 ace 1e-04
3pfs_A158 Bromodomain and PHD finger-containing protein 3; s 7e-04
>2l89_A PWWP domain-containing protein 1; histone binding, protein binding; NMR {Schizosaccharomyces pombe} Length = 108 Back     alignment and structure
 Score = 97.8 bits (243), Expect = 3e-24
 Identities = 21/108 (19%), Positives = 40/108 (37%), Gaps = 6/108 (5%)

Query: 17  AARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKK-----VLVFFFGTQQIAFCN 71
           +A  +   GD +L K  G+P WPA +   ++   S +          V FF     A+  
Sbjct: 1   SADDRLNFGDRILVKAPGYPWWPALLLRRKETKDSLNTNSSFNVLYKVLFFPDFNFAWVK 60

Query: 72  PADVEAFTEEKKQSLLTKRQGRGADFVRAVQEIIDSYEKLKKQDQVDS 119
              V+   + +    L   + +  + + A +        LK++   D 
Sbjct: 61  RNSVKPLLDSEIAKFLGSSKRKSKELIEAYEASKT-PPDLKEESSTDL 107


>2daq_A WHSC1L1 protein, isoform long; PWWP domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: b.34.9.2 Length = 110 Back     alignment and structure
>3qby_A Hepatoma-derived growth factor-related protein 2; HDGF2, structural genomics consortium, SGC, protein binding; HET: M3L; 1.95A {Homo sapiens} PDB: 3qj6_A* 3eae_A 1n27_A Length = 94 Back     alignment and structure
>1ri0_A Hepatoma-derived growth factor; HDGF, HATH domain, PWWP domain, heparin-binding, hormone/growth factor complex; NMR {Homo sapiens} SCOP: b.34.9.2 PDB: 2b8a_A 2nlu_A Length = 110 Back     alignment and structure
>4flb_A Regulation of nuclear PRE-mRNA domain-containing; structural genomics consortium, SGC, protein binding; 1.80A {Homo sapiens} Length = 132 Back     alignment and structure
>1khc_A DNA cytosine-5 methyltransferase 3B2; five beta-sheets barrel followed by five-helix bundle; HET: DNA; 1.80A {Mus musculus} SCOP: b.34.9.2 PDB: 3flg_A* 3qkj_A* Length = 147 Back     alignment and structure
>3d9j_A RNA-binding protein 16; SCAF8, RNA polymerase II CTD interacting domain, arm repeats phospho-CTD, phosphoprotein, transcription; 1.60A {Homo sapiens} PDB: 3d9k_A* 3d9l_A* 3d9m_A* 3d9n_A* 3d9p_A* 3d9i_A 3d9o_A* 2diw_A Length = 145 Back     alignment and structure
>3llr_A DNA (cytosine-5)-methyltransferase 3A; DNA methyltransferase, methylysine binding, STR genomics consortium, SGC, alternative promoter usage; HET: DNA BTB; 2.30A {Homo sapiens} Length = 154 Back     alignment and structure
>2gfu_A DNA mismatch repair protein MSH6; PWWP domain, tudor domain, DNA binding, DNA binding protein; HET: DNA; NMR {Homo sapiens} Length = 134 Back     alignment and structure
>2km4_A Regulator of TY1 transposition protein 103; CTD-interacting domain, RNA polymerase II binding protein, phosphoprotein; NMR {Saccharomyces cerevisiae} PDB: 2l0i_A* Length = 142 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>3l42_A Peregrin; transcription regulation, histone H3 acetylation, chromatin modification, structural genomics, structural genomics CONS SGC, activator; 1.30A {Homo sapiens} PDB: 3mo8_A* 2x4w_A* 2x35_A* 2x4x_A* 2x4y_A* Length = 130 Back     alignment and structure
>3pfs_A Bromodomain and PHD finger-containing protein 3; structural genomics, structural genomics consortium, SGC, PW domain, protein binding; 1.90A {Homo sapiens} PDB: 3lyi_A* Length = 158 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 1389
d2nlua1100 b.34.9.2 (A:1-100) Hepatoma-derived growth factor, 2e-21
d2daqa197 b.34.9.2 (A:8-104) Histone-lysine N-methyltransfer 8e-20
d1h3za_108 b.34.9.2 (A:) Hypothetical protein SPBC215.07c {Fi 1e-18
d1khca_137 b.34.9.2 (A:) DNA methyltransferase DNMT3B {Mouse 4e-15
d1szaa_144 a.118.9.4 (A:) PCF11 protein {Baker's yeast (Sacch 1e-08
>d2nlua1 b.34.9.2 (A:1-100) Hepatoma-derived growth factor, HDGF {Human (Homo sapiens) [TaxId: 9606]} Length = 100 Back     information, alignment and structure

class: All beta proteins
fold: SH3-like barrel
superfamily: Tudor/PWWP/MBT
family: PWWP domain
domain: Hepatoma-derived growth factor, HDGF
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 88.1 bits (218), Expect = 2e-21
 Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 19  RRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAF 78
           ++++K GDLV AK+KG+P WPA + E  +    +   K  VFFFGT + AF  P D+  +
Sbjct: 7   QKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANKYQVFFFGTHETAFLGPKDLFPY 66

Query: 79  TEEKKQSLLTKRQGRGADFVRAVQEI 104
            E K++     +  +   F   + EI
Sbjct: 67  EESKEKFG---KPNKRKGFSEGLWEI 89


>d2daqa1 b.34.9.2 (A:8-104) Histone-lysine N-methyltransferase NSD3 {Human (Homo sapiens) [TaxId: 9606]} Length = 97 Back     information, alignment and structure
>d1h3za_ b.34.9.2 (A:) Hypothetical protein SPBC215.07c {Fission yeast (Schizosaccharomyces pombe) [TaxId: 4896]} Length = 108 Back     information, alignment and structure
>d1khca_ b.34.9.2 (A:) DNA methyltransferase DNMT3B {Mouse (Mus musculus) [TaxId: 10090]} Length = 137 Back     information, alignment and structure
>d1szaa_ a.118.9.4 (A:) PCF11 protein {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 144 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query1389
d2nlua1100 Hepatoma-derived growth factor, HDGF {Human (Homo 99.88
d1h3za_108 Hypothetical protein SPBC215.07c {Fission yeast (S 99.87
d2daqa197 Histone-lysine N-methyltransferase NSD3 {Human (Ho 99.87
d1khca_137 DNA methyltransferase DNMT3B {Mouse (Mus musculus) 99.85
d1szaa_144 PCF11 protein {Baker's yeast (Saccharomyces cerevi 99.57
d2f5ka183 Mortality factor 4-like protein 1, MRG15 {Human (H 89.13
d1mhna_59 Survival motor neuron protein 1, smn {Human (Homo 86.5
d2d9ta160 Tudor domain-containing protein 3, TDRD3 {Mouse (M 82.88
>d2nlua1 b.34.9.2 (A:1-100) Hepatoma-derived growth factor, HDGF {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: All beta proteins
fold: SH3-like barrel
superfamily: Tudor/PWWP/MBT
family: PWWP domain
domain: Hepatoma-derived growth factor, HDGF
species: Human (Homo sapiens) [TaxId: 9606]
Probab=99.88  E-value=1.5e-23  Score=167.07  Aligned_cols=86  Identities=35%  Similarity=0.643  Sum_probs=74.7

Q ss_pred             HCCCCCCCCEEEECCCCCCCCCEEEECCCCCCCCCCCCEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHHHCCCCCHHH
Q ss_conf             21898659799880589988970662889878888897799999368976542268643541889987653147990779
Q 000616           18 ARRQWKVGDLVLAKVKGFPAWPATVSEPEKWGYSADWKKVLVFFFGTQQIAFCNPADVEAFTEEKKQSLLTKRQGRGADF   97 (1389)
Q Consensus        18 a~~~FkvGDLVWAKLKGYPwWPArI~dPe~~~~K~~~kkYlV~FFGT~EyAWV~~kdLkPFsE~kkesfl~KrkgK~KdF   97 (1389)
                      ....|++||||||||+|||||||+|+++.....+...++|+|+|||+++|+||..++|++|++++. .+..+  .|.+.|
T Consensus         6 ~~~~f~~GdlVWaK~~gypwWPa~V~~~~~~~~~~~~~~~~V~Ffg~~~~~wv~~~~l~~f~e~~~-~~~~~--~k~~~f   82 (100)
T d2nlua1           6 RQKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANKYQVFFFGTHETAFLGPKDLFPYEESKE-KFGKP--NKRKGF   82 (100)
T ss_dssp             CCCCCSTTCEEEEEETTEEEEEEEEECCTTSSCSCCCSCEEEEETTTTEEEEECGGGEECHHHHHH-HHSCC--SCCHHH
T ss_pred             CCCCCCCCCEEEEECCCCCCCCEEEECCHHHCCCCCCCEEEEEECCCCCEEEECHHHCCCHHHHHH-HHHCC--CCHHHH
T ss_conf             777587688999966897999879836023034677787999986899879983888547034378-87033--334999


Q ss_pred             HHHHHHHHH
Q ss_conf             999999999
Q 000616           98 VRAVQEIID  106 (1389)
Q Consensus        98 ~eALeEAee  106 (1389)
                      ++||+||++
T Consensus        83 ~~Av~e~e~   91 (100)
T d2nlua1          83 SEGLWEIEN   91 (100)
T ss_dssp             HHHHHHHHT
T ss_pred             HHHHHHHHH
T ss_conf             999999885



>d1h3za_ b.34.9.2 (A:) Hypothetical protein SPBC215.07c {Fission yeast (Schizosaccharomyces pombe) [TaxId: 4896]} Back     information, alignment and structure
>d2daqa1 b.34.9.2 (A:8-104) Histone-lysine N-methyltransferase NSD3 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1khca_ b.34.9.2 (A:) DNA methyltransferase DNMT3B {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1szaa_ a.118.9.4 (A:) PCF11 protein {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d2f5ka1 b.34.13.3 (A:6-88) Mortality factor 4-like protein 1, MRG15 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1mhna_ b.34.9.1 (A:) Survival motor neuron protein 1, smn {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2d9ta1 b.34.9.1 (A:8-67) Tudor domain-containing protein 3, TDRD3 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure