Citrus Sinensis ID: 000465


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-------580-------590-------600-------610-------620-------630-------640-------650-------660-------670-------680-------690-------700-------710-------720-------730-------740-------750-------760-------770-------780-------790-------800-------810-------820-------830-------840-------850-------860-------870-------880-------890-------900-------910-------920-------930-------940-------950-------960-------970-------980-------990------1000------1010------1020------1030------1040------1050------1060------1070------1080------1090------1100------1110------1120------1130------1140------1150------1160------1170------1180------1190------1200------1210------1220------1230------1240------1250------1260------1270------1280------1290------1300------1310------1320------1330------1340------1350------1360------1370------1380------1390------1400------1410------1420------1430------1440------1450------1460------1470------148
MSKSNPRPNLSQNPLAEYFTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLASNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV
cccccccccccccccHHHHHHcHHHHHHHHHHHHHcHHHccccccccccccccccHHHHHHHHHcccHHHHHHccccccHHHHHHHHHHHHHHHHccccEEEEccccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHcHHHHHHHHHHHcccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHcccccHHHccccHHHHHHHHHHHHcccccHHHHHHHHHHHcccccccEEEcccccccccccccccccEEEEEccccccccccccccccccccHHHHHHcccccHHHHHHHHHHccccccccccccEEccccccccHHHHHHHHHHccEEEEEEcccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccEEEEccccccccccccccccccccccHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccEEEcccccccccHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHcccHHHccHHHHHHHHHHHHHHHHHccccEEEEEcccccccccccccccEEEEcccccccccccccccccccccHHHHHHHHHHHHHHHcccccccccEEEEcccccEEEEEccccccccccccccccccHHHHHHcccccccccccccccccccccccccccccccccccccEEEEEEccccccccHHHHHHHHHHHHcccccHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHcccccccccccc
ccccccccccccccHHHcccccHHHHHHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHcccHHHHHHHHHHccHHHHHHHHHHHHHHHHcccccEEEEcccccccccccccHHEEccccHHHHHHHccHHHHHHHHHHHHccccccccccccccccccHcHHHHHHHHHHcHHHHHHHHHHHccccccccccccccccccHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHccccHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccEEEEEcccccccEEccccccccEEEccccccHHcccccccccccccHHHHHHHccHHHHHHHHHHHcccccccccHcEEEEcccccHHHHHHHHHHHHHccEEEEEHHHcEEEHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccEEcccccccccccccccccccccccccccccccccccccccccEccccccEEccccccHccccccccccccccccEEEEEccccccccccccccccccccEcccccccccccccccccccccEccccccccccccccccccccccccccccccccccccccccccccccEEcccccccccccccccEEEEEccccEEEEEcccccEEEEcccccccccccccccccccccccccccccccHcccEcccccccccccccccccccccccccccccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccEcccccccccccccccccccccccEEEcccccccEEEccEHHHcccccEEEEccccccccccccccccccccccHHHHHccHHHHHHHHcccccccccccccccccccccccccccccccccEEEEEEcccccccEccccccccccccccccEEEcccccccEEEEccccccccccccEEEccccccccHHHHcccccccccccccccccccccccccccEEEEccccccccccccccHcHHcccccEEEcccccccccccccEccccccccccEEEEccccEEccccccccccccccccccccHHHHHHHEEHHHccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHccccEEEEEEcccccccccccccEEEEEcccccccccccccccccccHHHHHHHHHHHHcccccccccccccEEEEccccccEEEEEEEccccccccccccccccccccccccccccccEEEEEEEEEccccccEEEEccccccEEEEEEEEEccccccccHHHHHHHHHHHHHHccccHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHccccccccccc
msksnprpnlsqnplaeyfTNSLMAQNQLIDSLTSHISlyhshslssnpnpssnprssILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLranghgffiilpdlpsrdppylpglcykksrgllSRVAESNESGRWVFESTRlfssregekieewscpvncldtfTVSVEFVENVDRFIDIMDEisnggflrgeeselagdwveFDWLKAKGYYSIEAFIVNRLEVGLRLAWLncnngkkrGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAgmkqssrfyhskslqRTISTLSVDVEcglaispaslsgipaSLATVFSGLFVLQDITTMVLSsqhneydieKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDctklelfgegnfksspnkskekpstigrrkkcracstkrqnplpksaldelsldklpkdpegaltdtekvdlmgsdkvpgisngkdinretstsEMEMVVCHQEHARALVAgkgrtnarktktvknknknctynnpvpvkdpkvsvletsssislqdevekYDKLSAqnvsvdnstcsnvlasnqssctsasvparegiatqstqedcvvnsvnsecrrfsngridnqtqhflqettdskvecniispdmpardldnafgnsisginfqnsfhesetgaisvlpdkgiEALEIKKesavtqdqrneSFFGTALKsslecpsyewptiapvyfpsisshllpatdrlhldvghnwhnhvrqpfvptlhqarnhpfdggcnqilsqplpmsldwppmvqnvsgiapsvtcnydsgfissrqsgfqqnfatkgmqfnaktsddegkcsgdfmdlpeptttqeqgdecdshwlseeelevhtvsgidynqyfgggvmywntsdhpgtgfsrppslssddsswawHEADIKRAVDDMVAFSssystngltsptaasfcspfdplgpghqafsyvvpgnevpgkvlhssstttdvateeeisgsfaslsgdvdskaldtlpcpilrpiiipnlsrersrsdfkrshehkspcvppsrreqprikrppspvvlcvpraprppppspvsdsrktrgfptvrsgsssprhwgvrgwyhegttseegcvrmdgsevvwpswrnknlsahpmiqplsGALLQDHLIAISQLardqehpdvafplqplevqncptrkASLSLMHSLLHEEIDSFCKQVAAEntarkpyiNWAVKRVTRSLQvlwprsrtnifgsnatglslpssdvdlvvclppvrnlepikeagilegrngiKETCLQHAARYLANQewvksdslktventaIPIIMLVVEVPHDLIASAassvqspkedaahttlkhdnhvhsdmvalddsaspkcshtssdnikaatsvrldisfkspshtglqTTDLVKELteqfpastpLALVLKQFLAdrsldqsysgglSSYCLMLLITRFLQhehhlgrpinqv
msksnprpnlsqnPLAEYFTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVaesnesgrwvfesTRLFssregekieewscpvnCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLElfgegnfksspnkskekpstigrrkkcracstkrqnplpksaldelsldklpkdpegaltdtekvdlmgsdkvpgisngkdiNRETSTSEMEMVVCHQEHARalvagkgrtnarktktvknknknctynnpvpvkdpkvSVLETSSsislqdevekYDKLSAQNVSVDNSTCSNVLASNQSSCTSASVPAREgiatqstqedcvvnSVNSECRRfsngridnqtqhflqettdskVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKesavtqdqrneSFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGsfaslsgdvdskaldtlpcpilrpiiipnlsrersrsdfkrshehkspcvppsrreqprikrppspvvlcvpraprppppspvsdsrktrgfptvrsgsssprhwgvrgwyhegttseegcvRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTarkpyinwavKRVTRSLQVLWPRSRTNIFGsnatglslpssdVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANqewvksdslktVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQhehhlgrpinqv
MSKSNPRPNLSQNPLAEYFTNSLMAQNQLIDsltshislyhshslssnpnpssnprssILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFksspnkskekpsTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRtnarktktvknknknctynnPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLASNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFsrppslssddssWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTlpcpilrpiiipnlSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSpvvlcvpraprppppspvsdsrKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKAslslmhsllhEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV
*****************YFTNSLMAQNQLIDSLTSHISLY******************ILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAE***SGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGE******************************************************************************************VC***********************************************************************************************************VV******************************VECNII**********NAFGNSISGINFQNSF******AISVL****I*******************FFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFIS**************************************************HWLSEEELEVHTVSGIDYNQYFGGGVMYWNT********************WAWHEADIKRAVDDMVAFSSSY*********AASFCSPFDPLGPGHQAFSYVVPG*************************************ALDTLPCPILRPIIIP***************************************************************************HWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIA**********************************************************************DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH********
*****************************IDSLTS**************************WFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDL************YKKS****************VFESTRLFS*********WSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESE*AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNG***********NAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAGM****************STLSVDVECGLAISP**************SGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELF**************************************************************************************************************************************************************************************************************************************************************************************************************************YEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTL*************QILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQS*****FA**GMQFNAKTSDDEGKCSGDFMDLPEPTTT*EQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHP**********************DIKRAVDDM********************************************************************************TLPCPILRPII***********************************************************************************WYHEGTTSEEGCVR**********************************************************************MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLE**********RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH***********
**********SQNPLAEYFTNSLMAQNQLIDSLTSHISLYH**************RSSILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFK****************************PLPKSALDELSLDKLPKDPEGALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLAS***************IATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEP***********SHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPG***************WAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLH************EISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSR*****************************RPPSPVVLCVPRAP***************************RHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAA**************LKHDNHVHSDMVAL*************DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV
********************NSLMAQNQLIDSLTSHISLYHSH*************SSILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVFESTRLFS*******EEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGE************************************************************************************SEMEMVVCHQEHARALVA*********************************SVLETSSS*SL**EVE*YDKLSAQNVSVDNSTCSNV*ASNQSSCT***********************************************************************NSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRP*SLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGK****************ISGSFASLSGDVDSKALDTLPCPILRPIIIPNL**************************QPRIKRPPSPVVLCVPRAPRPP********************SSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGR*****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSKSNPRPNLSQNPLAEYFTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLASNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query1478 2.2.26 [Sep-21-2011]
Q8NDF8 572 PAP-associated domain-con yes no 0.115 0.298 0.258 2e-11
Q68ED3 633 PAP-associated domain-con yes no 0.115 0.270 0.258 3e-11
G5EFL0 845 Poly(A) RNA polymerase gl yes no 0.046 0.081 0.436 2e-09
Q9UTN3 684 Poly(A) RNA polymerase ci yes no 0.046 0.100 0.422 1e-08
Q5XG87 542 DNA polymerase sigma OS=H no no 0.064 0.177 0.339 1e-07
Q6PB75 542 DNA polymerase sigma OS=M no no 0.064 0.177 0.339 1e-07
P53632 584 Poly(A) RNA polymerase pr yes no 0.106 0.270 0.235 2e-06
P48561 642 Poly(A) RNA polymerase pr no no 0.104 0.241 0.234 2e-05
>sp|Q8NDF8|PAPD5_HUMAN PAP-associated domain-containing protein 5 OS=Homo sapiens GN=PAPD5 PE=1 SV=2 Back     alignment and function desciption
 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 120  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 178

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 179  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 214

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 215  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 236

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 237  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 276

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 277  FTGGIGSYSLFLMAVSFLQ 295




Plays a role in replication-dependent histone mRNA degradation. May be involved in the terminal uridylation of mature histone mRNAs before their degradation is initiated. DNA polymerase, probably involved in DNA repair. May play a role in sister chromatid cohesion.
Homo sapiens (taxid: 9606)
EC: 2EC: .EC: 7EC: .EC: 7EC: .EC: -
>sp|Q68ED3|PAPD5_MOUSE PAP-associated domain-containing protein 5 OS=Mus musculus GN=Papd5 PE=2 SV=2 Back     alignment and function description
>sp|G5EFL0|GLD4_CAEEL Poly(A) RNA polymerase gld-4 OS=Caenorhabditis elegans GN=gld-4 PE=1 SV=1 Back     alignment and function description
>sp|Q9UTN3|CID14_SCHPO Poly(A) RNA polymerase cid14 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=cid14 PE=1 SV=2 Back     alignment and function description
>sp|Q5XG87|PAPD7_HUMAN DNA polymerase sigma OS=Homo sapiens GN=PAPD7 PE=1 SV=2 Back     alignment and function description
>sp|Q6PB75|PAPD7_MOUSE DNA polymerase sigma OS=Mus musculus GN=Papd7 PE=2 SV=1 Back     alignment and function description
>sp|P53632|PAP2_YEAST Poly(A) RNA polymerase protein 2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PAP2 PE=1 SV=1 Back     alignment and function description
>sp|P48561|TRF5_YEAST Poly(A) RNA polymerase protein 1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TRF5 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1478
2241299261566 predicted protein [Populus trichocarpa] 0.963 0.909 0.618 0.0
1478190781500 hypothetical protein VITISV_000637 [Viti 0.906 0.893 0.598 0.0
2977345321331 unnamed protein product [Vitis vinifera] 0.821 0.912 0.631 0.0
4494329961526 PREDICTED: uncharacterized protein LOC10 0.938 0.908 0.561 0.0
4494782461514 PREDICTED: LOW QUALITY PROTEIN: uncharac 0.938 0.916 0.561 0.0
3341862401481 Nucleotidyltransferase family protein [A 0.917 0.915 0.561 0.0
2555411761420 nucleotidyltransferase, putative [Ricinu 0.688 0.716 0.615 0.0
3565064281436 PREDICTED: uncharacterized protein LOC10 0.698 0.719 0.592 0.0
297810181840 hypothetical protein ARALYDRAFT_912246 [ 0.492 0.866 0.677 0.0
19699089839 AT4g00060/F6N15_10 [Arabidopsis thaliana 0.486 0.856 0.686 0.0
>gi|224129926|ref|XP_002320705.1| predicted protein [Populus trichocarpa] gi|222861478|gb|EEE99020.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score = 1729 bits (4479), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 925/1495 (61%), Positives = 1097/1495 (73%), Gaps = 71/1495 (4%)

Query: 1    MSKSNPRPNLSQNPLAEYFTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSIL 60
            MS  NP  N S++   +   N+LM QN LI SLTSHISLYHS S   +    +  RSSIL
Sbjct: 1    MSVPNPNNNFSKH---QNIPNALMTQNHLIGSLTSHISLYHSQSNPPSSPNPNP-RSSIL 56

Query: 61   KWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCY 120
            KWF SL+VHQRQ+HLT VD KF Q+L+QML KL ++GH  FIILPDL SRD   LP LC+
Sbjct: 57   KWFKSLSVHQRQSHLTTVDFKFTQILLQMLAKLHSHGHCRFIILPDLLSRD---LPSLCF 113

Query: 121  KKSRGLLSRVAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDR 180
            KKSRGLLSR+AESNES R +FESTRLFSSREGEK+++       LD+ TVS + +ENV++
Sbjct: 114  KKSRGLLSRIAESNESERLIFESTRLFSSREGEKVDDCRSGAEGLDSVTVSEDLIENVEK 173

Query: 181  FIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNN 240
            F+++MD+ISNGGFLRGEESEL  DWVE +WLK +GYY IEAF+ N+LEV LRLAWLNC N
Sbjct: 174  FVELMDDISNGGFLRGEESELGTDWVELEWLKVRGYYCIEAFLANKLEVALRLAWLNCGN 233

Query: 241  GKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVL 300
            GKKRGVKLKEKL+AAG+AANV+WR+KGCVDWW NLD  +RRKVL   LGKAAKSLT E+L
Sbjct: 234  GKKRGVKLKEKLSAAGVAANVFWRRKGCVDWWRNLDAEVRRKVLNFALGKAAKSLTREIL 293

Query: 301  KEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLA 360
            K+ S    D + LF AG+++  R  H++S QR    L  D E GLA  P S SG  AS A
Sbjct: 294  KDVSGVSGDELSLFRAGVQRPWRDLHAESRQRIFLKLPADAEFGLAPKP-SFSGKDASFA 352

Query: 361  TVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCT 420
             +F+ LFVLQDI ++VL  Q +EYD   IFFS L  + T +DC+LRKLRGL+MV+SLDCT
Sbjct: 353  NIFNSLFVLQDIVSLVLPDQGSEYDTSHIFFSMLGSLGTLSDCILRKLRGLVMVISLDCT 412

Query: 421  KLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKR-QNPLPKSALDELSLDKLPKDPE 479
            +LEL GEG   SS NK  EK     RRKK +  + K+  NP P  ++DE S  KL +D +
Sbjct: 413  RLELLGEGTSNSSANKPSEKLGAGSRRKKGKTQNMKKLMNPTPVKSVDESSFKKLAEDIK 472

Query: 480  GALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKT 539
             A    +K +LM S+++PGI +  + +R+ S+  +EM     EH + LV  K RT  RK 
Sbjct: 473  CAPACIKKTELMESNEMPGIPHENENHRDISSPTVEM-----EHTQGLVHEKKRTAGRKN 527

Query: 540  KTVKNKNKNCTYNNPVPVKDPKVSVLETSS-SISLQDEVEKYDKLSAQNVSVDNSTCSNV 598
            +  +NK K  +++NPV V+ P+++V E  S S+   DE  K  +LS      DN T    
Sbjct: 528  RKGRNKKKKSSFSNPVEVRKPEIAVSEAPSFSVCSSDEEAKLCRLS------DNLTTQK- 580

Query: 599  LASNQSSCT-SASVPAREGIATQSTQEDCVVNSVNS------ECRRFSNGRIDNQTQHFL 651
             ASN S    S + P R+ I      ED  V           E  R SNG +DN++    
Sbjct: 581  -ASNDSLIDPSINEPTRKEIDALGIPEDHAVGCTEGISDAGLEHYRSSNGFVDNKSMPSR 639

Query: 652  QETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEI 711
            +ET    V  NII      ++L     N   G +F N   E +      + +K +   E+
Sbjct: 640  RETR-CGVGQNIIYQVATTKELITVSSNE--GTSFLNKKTEVKLD----VGNKLVRTHEV 692

Query: 712  KKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHN 771
            K+   + + + +E+F  +  K   +C SYEWP++ PVYFPSI+SHL PAT RLHLDVGHN
Sbjct: 693  KEVPTLNRREESENFHESGSKGLSDCLSYEWPSLGPVYFPSINSHLPPATYRLHLDVGHN 752

Query: 772  WHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSG 831
            WHNH+ QPF+PT+HQARN P +GG N++LSQPLPMSLDWPPMV++  G+AP++TCNYDSG
Sbjct: 753  WHNHIHQPFLPTVHQARNSPIEGGSNRMLSQPLPMSLDWPPMVRSNCGLAPTMTCNYDSG 812

Query: 832  FISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEE 891
            FIS  QS FQ+++  K MQ+ +KT DDE +CSGD +D  E T++QE  DE ++HW+SEEE
Sbjct: 813  FISRWQSTFQKSYTAKNMQYISKTFDDERRCSGDAIDFTEATSSQELMDEYENHWISEEE 872

Query: 892  LEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMV 951
             EVH VSGIDYNQ+FGGGVMYW+ SDHPGTGFSRPPSLSSDDS W WHEA++ RAVDDMV
Sbjct: 873  YEVHAVSGIDYNQHFGGGVMYWDPSDHPGTGFSRPPSLSSDDSGWPWHEAELNRAVDDMV 932

Query: 952  AFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEE 1011
            AFSSSYST GLTSPTAASFCS FDPL PGHQA  YV+ GNEVPGK +  SST TD A EE
Sbjct: 933  AFSSSYSTTGLTSPTAASFCSAFDPLVPGHQALGYVMSGNEVPGKAM-LSSTVTDAAAEE 991

Query: 1012 EISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRRE 1071
            ++SGS ASLS DV+ KA D+LP PILRPIIIPN+SRERSRSDFKRS +HKSPCVPP+RRE
Sbjct: 992  DVSGSLASLSSDVEGKAGDSLPYPILRPIIIPNMSRERSRSDFKRSLDHKSPCVPPTRRE 1051

Query: 1072 QPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTT 1131
             PRIKRPPSPVVLCVPRAPRPPPPSPVSDSRK RGFPTVRSGSSSPR WGVRGWYH+GT 
Sbjct: 1052 HPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRQWGVRGWYHDGTN 1111

Query: 1132 SEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEH------- 1184
             EE C RMDG+EVVWPSWRNK LS HPM+QPL GALLQD LIA+S LARDQ+H       
Sbjct: 1112 LEEACGRMDGAEVVWPSWRNKKLSTHPMVQPLPGALLQDRLIAMSHLARDQDHVSVLLYC 1171

Query: 1185 --PDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRV 1242
              PDV FPLQ  E+QNCPTRKASL L+ SLLH+EIDSFCKQVAA N ARKP+INWAVKRV
Sbjct: 1172 AIPDVLFPLQRAEIQNCPTRKASLCLVQSLLHDEIDSFCKQVAAANMARKPFINWAVKRV 1231

Query: 1243 TRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET 1302
            TRSLQVLWPRSR NIFGS+ATGL+LP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET
Sbjct: 1232 TRSLQVLWPRSRINIFGSSATGLALPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET 1291

Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
            CLQ                        IP+IMLVVEVP DLI SAAS+VQSPKE+  H T
Sbjct: 1292 CLQ------------------------IPVIMLVVEVPTDLITSAASNVQSPKEEPIHLT 1327

Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
             +HD  V S+MV L+DS SPKC+  + D+ +   S+RLDISFKSPSHTGLQTT LVK+LT
Sbjct: 1328 GEHDIQVQSNMVVLEDSISPKCTQLNCDSKRDVKSIRLDISFKSPSHTGLQTTQLVKDLT 1387

Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            EQFPA+TPLALVLKQFLADRSLDQSYSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1388 EQFPAATPLALVLKQFLADRSLDQSYSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1442




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|147819078|emb|CAN65347.1| hypothetical protein VITISV_000637 [Vitis vinifera] Back     alignment and taxonomy information
>gi|297734532|emb|CBI16583.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449432996|ref|XP_004134284.1| PREDICTED: uncharacterized protein LOC101221970 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449478246|ref|XP_004155262.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101221970 [Cucumis sativus] Back     alignment and taxonomy information
>gi|334186240|ref|NP_191917.3| Nucleotidyltransferase family protein [Arabidopsis thaliana] gi|332656419|gb|AEE81819.1| Nucleotidyltransferase family protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|255541176|ref|XP_002511652.1| nucleotidyltransferase, putative [Ricinus communis] gi|223548832|gb|EEF50321.1| nucleotidyltransferase, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|356506428|ref|XP_003521985.1| PREDICTED: uncharacterized protein LOC100809291 [Glycine max] Back     alignment and taxonomy information
>gi|297810181|ref|XP_002872974.1| hypothetical protein ARALYDRAFT_912246 [Arabidopsis lyrata subsp. lyrata] gi|297318811|gb|EFH49233.1| hypothetical protein ARALYDRAFT_912246 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|19699089|gb|AAL90911.1| AT4g00060/F6N15_10 [Arabidopsis thaliana] gi|25090419|gb|AAN72296.1| At4g00060/F6N15_10 [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query1478
DICTYBASE|DDB_G0274847 1880 DDB_G0274847 "nucleotidyltrans 0.053 0.042 0.452 1.1e-15
UNIPROTKB|I3LUY3 500 PAPD5 "Uncharacterized protein 0.064 0.192 0.368 1.8e-13
UNIPROTKB|Q8NDF8 572 PAPD5 "PAP-associated domain-c 0.064 0.167 0.368 2.8e-13
UNIPROTKB|E9PC61 619 PAPD5 "PAP-associated domain-c 0.064 0.155 0.368 3.6e-13
MGI|MGI:1917820 633 Papd5 "PAP associated domain c 0.064 0.151 0.368 3.8e-13
UNIPROTKB|F1PHH1 673 PAPD5 "Uncharacterized protein 0.064 0.142 0.368 4.5e-13
UNIPROTKB|B4DV38 698 PAPD5 "PAP-associated domain-c 0.064 0.137 0.368 5e-13
FB|FBgn0030049 1001 Trf4-1 [Drosophila melanogaste 0.046 0.068 0.450 6.4e-13
UNIPROTKB|F1NCH7 501 PAPD5 "Uncharacterized protein 0.064 0.191 0.349 8.5e-13
FB|FBgn0039251 407 Trf4-2 [Drosophila melanogaste 0.067 0.245 0.370 1.3e-12
DICTYBASE|DDB_G0274847 DDB_G0274847 "nucleotidyltransferase domain-containing protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
 Score = 177 (67.4 bits), Expect = 1.1e-15, Sum P(5) = 1.1e-15
 Identities = 38/84 (45%), Positives = 56/84 (66%)

Query:  1392 IKAATS--VRLDISFK-SPS--HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
             IK  T+  + +DISF+ SP+  HTG+   DLV +   +     PLA+VLK FL +R L+ 
Sbjct:  1582 IKLVTTENISVDISFRESPTSVHTGIAARDLVADCVREIIGLYPLAIVLKWFLRERGLNN 1641

Query:  1447 SYSGGLSSYCLMLLITRFLQHEHH 1470
             +Y+GGLSSYCL+L++  FL++  H
Sbjct:  1642 TYTGGLSSYCLVLMLISFLKNNEH 1665


GO:0016779 "nucleotidyltransferase activity" evidence=IEA
UNIPROTKB|I3LUY3 PAPD5 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|Q8NDF8 PAPD5 "PAP-associated domain-containing protein 5" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|E9PC61 PAPD5 "PAP-associated domain-containing protein 5" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1917820 Papd5 "PAP associated domain containing 5" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|F1PHH1 PAPD5 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|B4DV38 PAPD5 "PAP-associated domain-containing protein 5" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
FB|FBgn0030049 Trf4-1 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|F1NCH7 PAPD5 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
FB|FBgn0039251 Trf4-2 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer2.7.7LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pg.C_LG_XIV000111
hypothetical protein (1566 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1478
cd05402114 cd05402, NT_PAP_TUTase, Nucleotidyltransferase (NT 1e-08
COG5260 482 COG5260, TRF4, DNA polymerase sigma [DNA replicati 4e-07
COG5260 482 COG5260, TRF4, DNA polymerase sigma [DNA replicati 2e-06
pfam0190992 pfam01909, NTP_transf_2, Nucleotidyltransferase do 5e-06
>gnl|CDD|143392 cd05402, NT_PAP_TUTase, Nucleotidyltransferase (NT) domain of poly(A) polymerases and terminal uridylyl transferases Back     alignment and domain information
 Score = 54.1 bits (131), Expect = 1e-08
 Identities = 25/96 (26%), Positives = 40/96 (41%), Gaps = 17/96 (17%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRN 1297
             + R+   ++  +P ++   FGS  TGL LP SD+DL +  P  R               
Sbjct: 5    VLDRLQELIKEWFPGAKLYPFGSYVTGLGLPGSDIDLCLLGPNHRVDR------------ 52

Query: 1298 GIKETCLQHAARYLANQEWVKSDSLKTVENTAIPII 1333
               E  L+  A+ L     V    ++ + N  +PII
Sbjct: 53   ---EDFLRKLAKLLKKSGEVV--EVEPIINARVPII 83


Poly(A) polymerases (PAPs) catalyze mRNA poly(A) tail synthesis, and terminal uridylyl transferases (TUTases) uridylate RNA. PAPs in this subgroup include human PAP alpha, mouse testis-specific cytoplasmic PAP beta, human nuclear PAP gamma, Saccharomyces cerevisiae PAP1, TRF4 and-5, Schizosaccharomyces pombe caffeine-induced death proteins -1, and -14, Caenorhabditis elegans Germ Line Development-2, and Chlamydomonas reinhardtii MUT68. This family also includes human U6 snRNA-specific TUTase1, and Trypanosoma brucei 3'-TUTase-1,-2, and 4. This family belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. For the majority of proteins in this family, these carboxylate residues are conserved. Length = 114

>gnl|CDD|227585 COG5260, TRF4, DNA polymerase sigma [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|227585 COG5260, TRF4, DNA polymerase sigma [DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|216779 pfam01909, NTP_transf_2, Nucleotidyltransferase domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 1478
COG5260 482 TRF4 DNA polymerase sigma [DNA replication, recomb 99.97
KOG1906 514 consensus DNA polymerase sigma [Replication, recom 99.96
cd05402114 NT_PAP_TUTase Nucleotidyltransferase (NT) domain o 99.82
PTZ00418 593 Poly(A) polymerase; Provisional 99.67
KOG2277 596 consensus S-M checkpoint control protein CID1 and 99.61
KOG2245 562 consensus Poly(A) polymerase and related nucleotid 99.46
COG5186 552 PAP1 Poly(A) polymerase [RNA processing and modifi 99.04
TIGR03671 408 cca_archaeal CCA-adding enzyme. 98.82
PRK13300 447 tRNA CCA-pyrophosphorylase; Provisional 98.8
KOG1906514 consensus DNA polymerase sigma [Replication, recom 98.46
COG1746 443 CCA1 tRNA nucleotidyltransferase (CCA-adding enzym 98.23
PF0190993 NTP_transf_2: Nucleotidyltransferase domain A subs 97.88
cd0539749 NT_Pol-beta-like Nucleotidyltransferase (NT) domai 97.6
PF04928254 PAP_central: Poly(A) polymerase central domain; In 97.22
cd0540393 NT_KNTase_like Nucleotidyltransferase (NT) domain 97.17
cd05400143 NT_2-5OAS_ClassI-CCAase Nucleotidyltransferase (NT 96.43
COG1708128 Predicted nucleotidyltransferases [General functio 95.94
COG166997 Predicted nucleotidyltransferases [General functio 95.57
PRK13746262 aminoglycoside resistance protein; Provisional 94.03
PF03813 972 Nrap: Nrap protein; InterPro: IPR005554 Members of 91.37
PF14792112 DNA_pol_B_palm: DNA polymerase beta palm ; PDB: 1R 87.28
PF10421190 OAS1_C: 2'-5'-oligoadenylate synthetase 1, domain 85.76
PRK02098221 phosphoribosyl-dephospho-CoA transferase; Provisio 84.82
TIGR03135202 malonate_mdcG holo-ACP synthase, malonate decarbox 83.89
PF09249114 tRNA_NucTransf2: tRNA nucleotidyltransferase, seco 83.78
smart00572246 DZF domain in DSRM or ZnF_C2H2 domain containing p 81.17
>COG5260 TRF4 DNA polymerase sigma [DNA replication, recombination, and repair] Back     alignment and domain information
Probab=99.97  E-value=1.7e-30  Score=297.61  Aligned_cols=185  Identities=28%  Similarity=0.444  Sum_probs=158.0

Q ss_pred             HHHHHHHHHHHHHHHhhhhHHhhhHHHHHHHHHHHHHHHhhCCCCeEEEecccccCCCCCCCCeeEEEEcCCCCCCcchh
Q 000465         1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIK 1288 (1478)
Q Consensus      1209 m~s~LheEI~dF~kyVaPEeeeRK~~In~AVkRIe~ILQsLwPdArV~pFGS~ATGLaLPsSDIDLVV~~PpvrNlEPiK 1288 (1478)
                      ....|+.||.+|+.+++|..++-+ .|..++++|+.+|++.||++.+++|||+.|||++|.|||||||..++....    
T Consensus        53 ~~~~lt~el~~~y~~I~ps~eEl~-~R~~~leklr~~lk~~~pda~l~vFGS~~t~L~l~~SDiDl~I~s~~~~~~----  127 (482)
T COG5260          53 ESDELTSELLEFYDYIAPSDEELK-RRKALLEKLRTLLKKEFPDADLKVFGSTETGLALPKSDIDLCIISDPRGYK----  127 (482)
T ss_pred             hHHHHHHHHHHHHHhhCCCHHHHH-HHHHHHHHHHHHHHHhCCccceeEecccccccccCcccccEEEecCCcccc----
Confidence            346899999999999999543222 245799999999999999999999999999999999999999998764432    


Q ss_pred             cccccccccchhHHHHHHHHHHHHhcCCcccCCeeeccCCCceEEEEEEcccchhhhhhcccCCCCcccccccccccCCc
Q 000465         1289 EAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368 (1478)
Q Consensus      1289 eAGiLE~RNgIKEt~LqkLAR~LRrqg~vk~~sVevI~nARVPIIKLv~eVP~Dli~~yas~vqS~keeSi~I~~earsP 1368 (1478)
                           +.+++     ...+..++.+..|.   .+++|.+|||||||+++                               
T Consensus       128 -----et~~~-----~~l~~~l~~~~~~~---~~~~v~tarVPIIKl~d-------------------------------  163 (482)
T COG5260         128 -----ETRNA-----GSLASHLFKKNLAK---EVVVVSTARVPIIKLVD-------------------------------  163 (482)
T ss_pred             -----ccccH-----HHHHHHHHHhccCe---eeEEEEecccceEEEec-------------------------------
Confidence                 23332     12233344445554   68899999999999986                               


Q ss_pred             cccccccccCCCCCCCCCCCCcccccccceEEEEEeeCCCccHHHHHHHHHHHHhhCCCchHHHHHHHHHHHhcCCCCCC
Q 000465         1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448 (1478)
Q Consensus      1369 ~~SD~~~~EgS~sP~Csp~n~d~~dk~tgI~VDISFNNPshnGLrNSeLIreYleqfPrLRPLVLVLK~WLKqRGLNdpy 1448 (1478)
                                               ...++.|||+|++  .+|+.|+++++.|...+|++|||+++||+||++|.||+++
T Consensus       164 -------------------------~~s~l~~Disfn~--~~~~~~akl~~~~~~~~P~lrpLvliIKhwl~~R~ln~~~  216 (482)
T COG5260         164 -------------------------PQSGLHCDISFNN--TNGIVNAKLIRSYLKEDPRLRPLVLIIKHWLKRRALNDVA  216 (482)
T ss_pred             -------------------------CccceEEEeecCc--hhHHHHHHHHHHHHhcCcccchHHHHHHHHHHHHhhcccc
Confidence                                     2478999999999  8999999999999999999999999999999999999999


Q ss_pred             CCCCChHHHHHHHHHHHhhcC
Q 000465         1449 SGGLSSYCLMLLITRFLQHEH 1469 (1478)
Q Consensus      1449 sGGLSSYALVLMVI~FLQ~~~ 1469 (1478)
                      .|||+||++++||+.|||+++
T Consensus       217 ~GtL~sy~i~cmV~sfLq~~~  237 (482)
T COG5260         217 TGTLSSYTISCMVLSFLQMHP  237 (482)
T ss_pred             cCcchhhhhHHHHHHHHHhCC
Confidence            999999999999999999997



>KOG1906 consensus DNA polymerase sigma [Replication, recombination and repair] Back     alignment and domain information
>cd05402 NT_PAP_TUTase Nucleotidyltransferase (NT) domain of poly(A) polymerases and terminal uridylyl transferases Back     alignment and domain information
>PTZ00418 Poly(A) polymerase; Provisional Back     alignment and domain information
>KOG2277 consensus S-M checkpoint control protein CID1 and related nucleotidyltransferases [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
>KOG2245 consensus Poly(A) polymerase and related nucleotidyltransferases [RNA processing and modification] Back     alignment and domain information
>COG5186 PAP1 Poly(A) polymerase [RNA processing and modification] Back     alignment and domain information
>TIGR03671 cca_archaeal CCA-adding enzyme Back     alignment and domain information
>PRK13300 tRNA CCA-pyrophosphorylase; Provisional Back     alignment and domain information
>KOG1906 consensus DNA polymerase sigma [Replication, recombination and repair] Back     alignment and domain information
>COG1746 CCA1 tRNA nucleotidyltransferase (CCA-adding enzyme) [Translation, ribosomal structure and biogenesis] Back     alignment and domain information
>PF01909 NTP_transf_2: Nucleotidyltransferase domain A subset of this Pfam family; InterPro: IPR002934 A small region that overlaps with a nuclear localization signal and binds to the RNA primer contains three aspartates that are essential for catalysis Back     alignment and domain information
>cd05397 NT_Pol-beta-like Nucleotidyltransferase (NT) domain of DNA polymerase beta and similar proteins Back     alignment and domain information
>PF04928 PAP_central: Poly(A) polymerase central domain; InterPro: IPR007012 In eukaryotes, polyadenylation of pre-mRNA plays an essential role in the initiation step of protein synthesis, as well as in the export and stability of mRNAs Back     alignment and domain information
>cd05403 NT_KNTase_like Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins Back     alignment and domain information
>cd05400 NT_2-5OAS_ClassI-CCAase Nucleotidyltransferase (NT) domain of 2'5'-oligoadenylate (2-5A)synthetase (2-5OAS) and class I CCA-adding enzyme Back     alignment and domain information
>COG1708 Predicted nucleotidyltransferases [General function prediction only] Back     alignment and domain information
>COG1669 Predicted nucleotidyltransferases [General function prediction only] Back     alignment and domain information
>PRK13746 aminoglycoside resistance protein; Provisional Back     alignment and domain information
>PF03813 Nrap: Nrap protein; InterPro: IPR005554 Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human Back     alignment and domain information
>PF14792 DNA_pol_B_palm: DNA polymerase beta palm ; PDB: 1RZT_A 3PML_A 2PFN_A 3HX0_K 3HWT_A 2GWS_E 2BCQ_A 3UPQ_A 2BCS_A 3UQ2_A Back     alignment and domain information
>PF10421 OAS1_C: 2'-5'-oligoadenylate synthetase 1, domain 2, C-terminus ; InterPro: IPR018952 This is the largely alpha-helical, C-terminal half of 2'-5'-oligoadenylate synthetase 1, being described as domain 2 of the enzyme and homologous to a tandem ubiquitin repeat Back     alignment and domain information
>PRK02098 phosphoribosyl-dephospho-CoA transferase; Provisional Back     alignment and domain information
>TIGR03135 malonate_mdcG holo-ACP synthase, malonate decarboxylase-specific Back     alignment and domain information
>PF09249 tRNA_NucTransf2: tRNA nucleotidyltransferase, second domain; InterPro: IPR015329 This domain adopts a structure consisting of a five helical bundle core Back     alignment and domain information
>smart00572 DZF domain in DSRM or ZnF_C2H2 domain containing proteins Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1478
3nyb_A 323 Structure And Function Of The Polymerase Core Of Tr 1e-06
>pdb|3NYB|A Chain A, Structure And Function Of The Polymerase Core Of Tramp, A Rna Surveillance Complex Length = 323 Back     alignment and structure

Iteration: 1

Score = 53.1 bits (126), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 55/234 (23%), Positives = 85/234 (36%), Gaps = 76/234 (32%) Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295 N + + +++ LWP + ++FGS +T L LP SD+D VV E G E Sbjct: 43 NQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVT----------SELGGKES 92 Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355 RN + + LA + ++ V +PII V PH I A Sbjct: 93 RNNLYSLASHLKKKNLATE-------VEVVAKARVPIIKFV--EPHSGIHIA-------- 135 Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415 S ++ I+AA +R Sbjct: 136 ----------------------------VSFERTNGIEAAKLIR---------------- 151 Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469 E + P L L++KQFL R L+ ++GGL + ++ L+ FL H H Sbjct: 152 ----EWLDDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 200

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query1478
3nyb_A 323 Poly(A) RNA polymerase protein 2; polya RNA polyme 5e-24
4ep7_A 340 Poly(A) RNA polymerase protein CID1; poly(U) polym 8e-22
4e8f_A 405 Poly(A) RNA polymerase protein CID1; beta polymera 4e-19
3pq1_A 464 Poly(A) RNA polymerase; nucleotidyl transferase, R 9e-18
2ikf_A 353 RNA uridylyl transferase; tutase, nucleotidyltrans 1e-14
2hhp_A 530 Poly(A) polymerase; template-independent RNA polym 2e-13
2b4v_A 468 RNA editing complex protein MP57; tbret2, TBMP57, 9e-13
1q79_A 514 Poly(A) polymerase alpha; mRNA processing, nucleot 3e-10
3hj4_A 384 Minor editosome-associated tutase; nucleotidyltran 4e-10
3hj4_A 384 Minor editosome-associated tutase; nucleotidyltran 1e-09
1vt4_I1221 APAF-1 related killer DARK; drosophila apoptosome, 7e-08
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-04
2rff_A111 Putative nucleotidyltransferase; NP_343093.1, nucl 5e-04
>3nyb_A Poly(A) RNA polymerase protein 2; polya RNA polymerase, zinc knuckle protein, RNA surveillance binds to TRF4P/AIR2P heterodimer; 2.70A {Saccharomyces cerevisiae} Length = 323 Back     alignment and structure
 Score =  104 bits (259), Expect = 5e-24
 Identities = 51/266 (19%), Positives = 89/266 (33%), Gaps = 82/266 (30%)

Query: 1209 MHSLLHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            +   L  EI  F   +     E   R    N  +  +  +++ LWP +  ++FGS +T L
Sbjct: 17   ISDWLTFEIKDFVAYISPSREEIEIR----NQTISTIREAVKQLWPDADLHVFGSYSTDL 72

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
             LP SD+D VV                 E RN +        A +L  +       ++ V
Sbjct: 73   YLPGSDIDCVVTSELGG----------KESRNNLYSL-----ASHLKKKNLATE--VEVV 115

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
                +PI                                                  K  
Sbjct: 116  AKARVPI-------------------------------------------------IKFV 126

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
                      + + + +SF+     G++   L++E  +  P    L L++KQFL  R L+
Sbjct: 127  EP-------HSGIHIAVSFER--TNGIEAAKLIREWLDDTPGLRELVLIVKQFLHARRLN 177

Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHL 1471
              ++GGL  + ++ L+  FL     +
Sbjct: 178  NVHTGGLGGFSIICLVFSFLHMHPRI 203


>4ep7_A Poly(A) RNA polymerase protein CID1; poly(U) polymerase, UTP binding, transferase; HET: UTP; 2.28A {Schizosaccharomyces pombe} Length = 340 Back     alignment and structure
>4e8f_A Poly(A) RNA polymerase protein CID1; beta polymerase-like nucleotidyl transferase, terminal uridi transferase, UTP, cytoplasmic; 2.60A {Schizosaccharomyces pombe 972h-} PDB: 4e7x_A* 4e80_A Length = 405 Back     alignment and structure
>3pq1_A Poly(A) RNA polymerase; nucleotidyl transferase, RNP-type RNA binding domain, poly(A polymerase, mitochondria, transferase; 3.10A {Homo sapiens} Length = 464 Back     alignment and structure
>2ikf_A RNA uridylyl transferase; tutase, nucleotidyltransferase, UTP-binding, RNA editing; HET: UTP; 2.00A {Trypanosoma brucei} PDB: 2nom_A* 2q0c_A* 2q0d_A* 2q0e_A* 2q0f_A* 2q0g_A* Length = 353 Back     alignment and structure
>2hhp_A Poly(A) polymerase; template-independent RNA polymerase, transferase; HET: FLC; 1.80A {Saccharomyces cerevisiae} SCOP: a.160.1.1 d.218.1.3 d.58.16.1 PDB: 1fa0_A* 3c66_A* 2o1p_A 2q66_A* Length = 530 Back     alignment and structure
>2b4v_A RNA editing complex protein MP57; tbret2, TBMP57, terminal uridylyl transferase, editosome, transferase/RNA binding protein complex; 1.80A {Trypanosoma brucei} SCOP: a.160.1.4 d.218.1.10 PDB: 2b51_A* 2b56_A* Length = 468 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>2rff_A Putative nucleotidyltransferase; NP_343093.1, nucleotidyltransferase domain, structural genomics; HET: MSE; 1.40A {Sulfolobus solfataricus P2} Length = 111 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 1478
d2b4va1 183 a.160.1.4 (A:289-471) RNA editing terminal uridyl 9e-08
d1q79a2196 d.218.1.3 (A:19-214) Poly(A) polymerase, PAP, N-te 3e-07
d2q66a2197 d.218.1.3 (A:5-201) Poly(A) polymerase, PAP, N-ter 4e-05
d1no5a_100 d.218.1.5 (A:) Hypothetical protein HI0073 {Haemop 5e-04
>d2b4va1 a.160.1.4 (A:289-471) RNA editing terminal uridyl transferase 2, TUTase 2, RET2 {Trypanosoma brucei [TaxId: 5691]} Length = 183 Back     information, alignment and structure

class: All alpha proteins
fold: PAP/OAS1 substrate-binding domain
superfamily: PAP/OAS1 substrate-binding domain
family: RNA editing terminal uridyl transferase 2, RET2, domain 2
domain: RNA editing terminal uridyl transferase 2, TUTase 2, RET2
species: Trypanosoma brucei [TaxId: 5691]
 Score = 51.4 bits (122), Expect = 9e-08
 Identities = 7/53 (13%), Positives = 19/53 (35%)

Query: 1426 PASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
             A+   A+ +K +    ++       L+SY + ++   +L     +       
Sbjct: 2    VAARHTAMAVKAWGKATNVGAGSGAMLTSYAVTVMFIYYLLVTRQVLWVDPWS 54


>d1q79a2 d.218.1.3 (A:19-214) Poly(A) polymerase, PAP, N-terminal domain {Cow (Bos taurus) [TaxId: 9913]} Length = 196 Back     information, alignment and structure
>d2q66a2 d.218.1.3 (A:5-201) Poly(A) polymerase, PAP, N-terminal domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 197 Back     information, alignment and structure
>d1no5a_ d.218.1.5 (A:) Hypothetical protein HI0073 {Haemophilus influenzae [TaxId: 727]} Length = 100 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query1478
d2q66a2197 Poly(A) polymerase, PAP, N-terminal domain {Baker' 99.75
d1q79a2196 Poly(A) polymerase, PAP, N-terminal domain {Cow (B 99.73
d2b4va2259 RNA editing terminal uridyl transferase 2, TUTase 99.34
d1no5a_100 Hypothetical protein HI0073 {Haemophilus influenza 98.26
d1ylqa190 Putative nucleotidyltransferase AF0614 {Archaeon A 97.42
d1r89a2142 tRNA nucleotidyltransferase, N-terminal domain {Ar 96.85
d1px5a2200 2'-5'-oligoadenylate synthetase 1, OAS1, N-termina 96.71
d1wota_98 Unnamed putative nucleotidyltransferase {Thermus t 96.36
d2q66a1150 Poly(A) polymerase, PAP, middle domain {Baker's ye 96.35
d1q79a1150 Poly(A) polymerase, PAP, middle domain {Cow (Bos t 96.35
d1knya2125 Kanamycin nucleotidyltransferase (KNTase), N-termi 96.12
d2bcqa3190 DNA polymerase lambda {Human (Homo sapiens) [TaxId 93.78
d1jmsa4208 Terminal deoxynucleotidyl transferase {Mouse (Mus 92.66
d2fmpa3187 DNA polymerase beta, catalytic (31 kD) fragment {H 86.21
>d2q66a2 d.218.1.3 (A:5-201) Poly(A) polymerase, PAP, N-terminal domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Nucleotidyltransferase
superfamily: Nucleotidyltransferase
family: Poly(A) polymerase, PAP, N-terminal domain
domain: Poly(A) polymerase, PAP, N-terminal domain
species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=99.75  E-value=8.7e-18  Score=135.13  Aligned_cols=118  Identities=19%  Similarity=0.193  Sum_probs=83.4

Q ss_pred             CCCCCCCCCCCCHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHH----------------CCC
Q ss_conf             866768875100123789999999999998----822239961199999999999999711----------------899
Q 000465         1193 PLEVQNCPTRKASLSLMHSLLHEEIDSFCK----QVAAENTARKPYINWAVKRVTRSLQVL----------------WPR 1252 (1478)
Q Consensus      1193 pP~m~~CP~~k~sLs~i~s~LheEI~aF~k----yVaPEeeeRK~~Ie~AIkRIerIIQsL----------------fPd 1252 (1478)
                      +|.-..-|+.+      ...++++|..|.+    |-.+++.++   |+.++++|+++++.+                +++
T Consensus         7 ~Pis~~~Pt~~------d~~~s~~L~~~L~~~~~~~s~ee~~~---R~~vl~~l~~iv~~~v~~~~~~~~~~~~~~~~~~   77 (197)
T d2q66a2           7 GPVSTVGATAA------ENKLNDSLIQELKKEGSFETEQETAN---RVQVLKILQELAQRFVYEVSKKKNMSDGMARDAG   77 (197)
T ss_dssp             CCSCCCCCCHH------HHHHHHHHHHHHHHTTCSCCHHHHHH---HHHHHHHHHHHHHHHHHHHHHHTTCCHHHHHHTB
T ss_pred             CCCCCCCCCHH------HHHHHHHHHHHHHHCCCCCCHHHHHH---HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC
T ss_conf             98546898848------89989999999998089988789988---9999999999999998887663068732234689


Q ss_pred             CEEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCEE
Q ss_conf             80888334446998899970699974898897610000222355430689999999999706874467736505787008
Q 000465         1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332 (1478)
Q Consensus      1253 ARV~IFGSsATGLaLPsSDIDLVV~lPpvrNlEPiKeAGIlE~RNgIKEtiLqkLAR~LRr~g~vk~~sVkvI~kARVPI 1332 (1478)
                      ++|++||||.+|+++|+||||+++..|...+                .+.++..++..|++...+.  +++.|.+|+|||
T Consensus        78 ~~v~~fGS~~lg~~~p~SDID~~~~~p~~~~----------------~~~ff~~l~~~L~~~~~v~--~i~~I~~A~VPI  139 (197)
T d2q66a2          78 GKIFTYGSYRLGVHGPGSDIDTLVVVPKHVT----------------REDFFTVFDSLLRERKELD--EIAPVPDAFVPI  139 (197)
T ss_dssp             CEEEEEHHHHHTCCCTTCCEEEEEEECTTCC----------------HHHHHHHHHHHHHTCTTEE--EEEEECSSSSCE
T ss_pred             CEEEECCCCCCCCCCCCCCCCEEEECCCCCC----------------HHHHHHHHHHHHHHCCCCC--EEEEEECCCCCE
T ss_conf             6799524512355278877246996487689----------------8999999999998187776--168830245657


Q ss_pred             EEEEE
Q ss_conf             99998
Q 000465         1333 IMLVV 1337 (1478)
Q Consensus      1333 IKLv~ 1337 (1478)
                      |||+.
T Consensus       140 IK~~~  144 (197)
T d2q66a2         140 IKIKF  144 (197)
T ss_dssp             EEEEE
T ss_pred             EEEEE
T ss_conf             99998



>d1q79a2 d.218.1.3 (A:19-214) Poly(A) polymerase, PAP, N-terminal domain {Cow (Bos taurus) [TaxId: 9913]} Back     information, alignment and structure
>d2b4va2 d.218.1.10 (A:30-288) RNA editing terminal uridyl transferase 2, TUTase 2, RET2 {Trypanosoma brucei [TaxId: 5691]} Back     information, alignment and structure
>d1no5a_ d.218.1.5 (A:) Hypothetical protein HI0073 {Haemophilus influenzae [TaxId: 727]} Back     information, alignment and structure
>d1ylqa1 d.218.1.5 (A:1-90) Putative nucleotidyltransferase AF0614 {Archaeon Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure
>d1r89a2 d.218.1.7 (A:1-142) tRNA nucleotidyltransferase, N-terminal domain {Archaeon Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure
>d1px5a2 d.218.1.6 (A:1-200) 2'-5'-oligoadenylate synthetase 1, OAS1, N-terminal domain {Pig (Sus scrofa) [TaxId: 9823]} Back     information, alignment and structure
>d1wota_ d.218.1.5 (A:) Unnamed putative nucleotidyltransferase {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure
>d2q66a1 a.160.1.1 (A:202-351) Poly(A) polymerase, PAP, middle domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1q79a1 a.160.1.1 (A:215-364) Poly(A) polymerase, PAP, middle domain {Cow (Bos taurus) [TaxId: 9913]} Back     information, alignment and structure
>d1knya2 d.218.1.1 (A:1-125) Kanamycin nucleotidyltransferase (KNTase), N-terminal domain {Staphylococcus aureus [TaxId: 1280]} Back     information, alignment and structure
>d2bcqa3 d.218.1.2 (A:386-575) DNA polymerase lambda {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1jmsa4 d.218.1.2 (A:303-510) Terminal deoxynucleotidyl transferase {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2fmpa3 d.218.1.2 (A:149-335) DNA polymerase beta, catalytic (31 kD) fragment {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure