Citrus Sinensis ID: 013971


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430---
MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLE
cccccccccEEEEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEEEccccccEEEEEEEEEEEcccEEEEEcccccccccccccccEEEEEEEEEcEEcccEEEEEEEEEEcccccEEEEEEEEEEEccccEEEEEEEEEEcccEEEEEEEEEcccccEEEEEEEEEEcccccEEEcccccccccccccccccccccEEEEccccEEEEEEEEEEcccccccccccccccccEEEEEEEEcccccccEEEEEEEEccccccccEEEEEEEcccEEEEcccEEEEEEEEEccccccccEEEEEEccccccccEEEEccccEEEEcccccccEEEEEEEEEEcccEEEEEccEEEEEccccEEEccccccc
ccccccccEEEEEEEEcccccccccccccccccccccccccccccccccccHHHccccccccccccccccccccccccccccccccccccccccccEEEEEEEEEEEEEEcccccEEEEEEEEEEEEccccEEEcccccccccccccccccccEEEEEEEHHcccEEEEEEEEEEcccccEEEEEEEEEEEEccccEEEEEEEcccccEEEEEEEEEcccccEEEEEEEEccccccEEEEcccccccccccccccccccccccEEccccEEEEEEEEEcccccccccccccccEEEEEEEEEEEcccccccEEEcccccccccccccEEEEEEEcccEEEEEEcEEEEEEEEEccccccccEEEEEEccccccEEEEEEcccEccccccccccccEEEEEEEEEccccEEEEcEEEEEEcccccEcccccccc
msstpgthsLAFRVMRlcrpslhvepplrvdptdlfigedifddpiaasnlpplissdvttnkssdltyRSRFLLhdsadsiglsgllvlpqafgaiylgeTFCSYISinnsstlevRDVVIKAEIQTDKQRILLLdtskspvesiraggrydfIVEHDVKELGAHTLVCTAlysdgegerkyLPQFFKFIvsnplsvrtKVRVVKEITFLEACIENhtksnlymdqvefepsqnwsatmlkadgphsdynaqsreifkppvlirsgggiHNYLYQLKmlshgssspvkvqgsnvlgklqitwrtnlgepgrlqtQQILGTTITSKEIElnvvevpsvvgidkpfllklkltnqtdkeqgpfeiwlsqndsdeekVVMINGLRIMalapveafgstdFHLNLIATKLGVQRITGITVFDKLekitydslpdle
msstpgthslAFRVMRLCrpslhvepplrvDPTDLFIGEDIFDDPIAasnlpplissdvttnksSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAeiqtdkqrillldtskspvesiRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFivsnplsvrtKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWrtnlgepgrlqtQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLtnqtdkeqgpfeiwlsqndsdEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKlgvqritgitvfdklekitydslpdle
MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRlqtqqilgttitSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLE
*********LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS*********DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEF*************************EIFKPPVLIRSGGGIHNYLYQLKMLSHG****VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN*********EIW**********VVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITY*******
******T**LAFRVMRLCRPSLHVEPPLRVDPTDLFIGE*****************************************SIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD********************GRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM********************PVLIRSGGGIHNYLYQLK***************NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPD**
MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH********QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLE
*****GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGE**************************************SADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA***********REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSL****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLE
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query433 2.2.26 [Sep-21-2011]
Q6PBY7412 UPF0533 protein C5orf44 h yes no 0.872 0.917 0.330 1e-52
Q5RCG0417 UPF0533 protein C5orf44 h yes no 0.870 0.904 0.321 6e-52
A5PLN9417 UPF0533 protein C5orf44 O yes no 0.870 0.904 0.318 7e-52
A7MB76417 UPF0533 protein C5orf44 h yes no 0.870 0.904 0.314 1e-50
Q3TIR1417 UPF0533 protein C5orf44 h yes no 0.879 0.913 0.319 1e-50
Q5M887418 UPF0533 protein C5orf44 h yes no 0.882 0.913 0.319 3e-50
Q0VFT9412 UPF0533 protein C5orf44 h yes no 0.879 0.924 0.323 8e-50
Q6GPR5414 UPF0533 protein C5orf44 h N/A no 0.884 0.925 0.320 3e-48
A8WX89401 UPF0533 protein CBG04321 N/A no 0.868 0.937 0.303 1e-46
Q95QQ2401 UPF0533 protein C56C10.7 yes no 0.870 0.940 0.294 2e-46
>sp|Q6PBY7|CE044_DANRE UPF0533 protein C5orf44 homolog OS=Danio rerio GN=zgc:73187 PE=2 SV=2 Back     alignment and function desciption
 Score =  207 bits (527), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 141/427 (33%), Positives = 222/427 (51%), Gaps = 49/427 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF+                L+  D +T K
Sbjct: 10  HLLALKVMRLTKPTLFTNMPVTCEDRDLPGDLFLR---------------LMKDDPSTVK 54

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                          A+++ L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++K
Sbjct: 55  G--------------AETLILGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S S V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNSAVSELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKLY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVK-EITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 242
             +FFKF V  PL V+TK    + +  FLEA I+N T S ++M++V  EPS  ++ T L 
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVTELN 219

Query: 243 --ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 300
             A G  S  +   +  +  P+  R       YLY LK     +     ++G  V+GKL 
Sbjct: 220 NVASGDESSESTFGKMSYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKLD 273

Query: 301 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 360
           I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++   
Sbjct: 274 IVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEFIPDTVDLEEPFDITCKITNCSERT-- 331

Query: 361 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 420
             ++ L   ++       ++G ++  L+P     S    L L+++  G+Q I+G+ + D 
Sbjct: 332 -MDLLLEMCNTRSVHWCGVSGRQLGKLSPS---ASLSIPLKLLSSVQGLQSISGLRLTDT 387

Query: 421 LEKITYD 427
             K TY+
Sbjct: 388 FLKRTYE 394





Danio rerio (taxid: 7955)
>sp|Q5RCG0|CE044_PONAB UPF0533 protein C5orf44 homolog OS=Pongo abelii PE=2 SV=1 Back     alignment and function description
>sp|A5PLN9|CE044_HUMAN UPF0533 protein C5orf44 OS=Homo sapiens GN=C5orf44 PE=2 SV=2 Back     alignment and function description
>sp|A7MB76|CE044_BOVIN UPF0533 protein C5orf44 homolog OS=Bos taurus PE=2 SV=1 Back     alignment and function description
>sp|Q3TIR1|CE044_MOUSE UPF0533 protein C5orf44 homolog OS=Mus musculus PE=2 SV=1 Back     alignment and function description
>sp|Q5M887|CE044_RAT UPF0533 protein C5orf44 homolog OS=Rattus norvegicus PE=2 SV=2 Back     alignment and function description
>sp|Q0VFT9|CE044_XENTR UPF0533 protein C5orf44 homolog OS=Xenopus tropicalis PE=2 SV=1 Back     alignment and function description
>sp|Q6GPR5|CE044_XENLA UPF0533 protein C5orf44 homolog OS=Xenopus laevis PE=2 SV=2 Back     alignment and function description
>sp|A8WX89|U533_CAEBR UPF0533 protein CBG04321 OS=Caenorhabditis briggsae GN=CBG04321 PE=3 SV=2 Back     alignment and function description
>sp|Q95QQ2|U533_CAEEL UPF0533 protein C56C10.7 OS=Caenorhabditis elegans GN=C56C10.7 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query433
255556003434 expressed protein, putative [Ricinus com 0.986 0.983 0.774 0.0
225470348438 PREDICTED: UPF0533 protein C5orf44 [Viti 0.997 0.986 0.773 0.0
356548745440 PREDICTED: UPF0533 protein C5orf44 homol 0.976 0.961 0.758 0.0
449457717440 PREDICTED: UPF0533 protein C5orf44-like 1.0 0.984 0.723 0.0
224079249450 predicted protein [Populus trichocarpa] 0.983 0.946 0.739 0.0
356521339435 PREDICTED: UPF0533 protein C5orf44-like 0.967 0.963 0.742 0.0
388496064437 unknown [Medicago truncatula] 0.965 0.956 0.725 1e-180
358346667446 hypothetical protein MTR_084s0010 [Medic 0.965 0.937 0.710 1e-177
297824907443 hypothetical protein ARALYDRAFT_483987 [ 0.993 0.970 0.680 1e-168
18407493442 uncharacterized protein [Arabidopsis tha 0.990 0.970 0.679 1e-168
>gi|255556003|ref|XP_002519036.1| expressed protein, putative [Ricinus communis] gi|223541699|gb|EEF43247.1| expressed protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  699 bits (1803), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/434 (77%), Positives = 380/434 (87%), Gaps = 7/434 (1%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MS+TPGTHSLAFRVMRLCRPS HV+  L VDP+DL +GEDIFDDP+AAS LPPLI S +T
Sbjct: 1   MSTTPGTHSLAFRVMRLCRPSFHVDAQLLVDPSDLIVGEDIFDDPVAASRLPPLIDSHIT 60

Query: 61  T-NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
               +SDL+YR+RFL    +DS GL+GLLVLPQAFGAIYLGETFCSYISINNSS  EVRD
Sbjct: 61  KLTDTSDLSYRTRFLHQHPSDSFGLTGLLVLPQAFGAIYLGETFCSYISINNSSNFEVRD 120

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
           V+IKAEIQT++QRILLLDTSK+PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG+G
Sbjct: 121 VIIKAEIQTERQRILLLDTSKNPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGDG 180

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 239
           ERKYLPQFFKFIV+NPLSVRTKVRVVKE T+LEACIENHTK+NLYMDQVEFEP+Q+WSA 
Sbjct: 181 ERKYLPQFFKFIVANPLSVRTKVRVVKETTYLEACIENHTKTNLYMDQVEFEPAQHWSAK 240

Query: 240 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 299
           ++K D   S+ ++ +REIFKPPVLIRSGGGIHNYLYQL++ +HG++       SNVLGKL
Sbjct: 241 IIKDDEKQSEKDSLTREIFKPPVLIRSGGGIHNYLYQLRLSAHGAAQ------SNVLGKL 294

Query: 300 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 359
           QITWRTNLGEPGRLQTQQILGT IT KEIEL + +VP+V+ +DKPF + LKLTN TDKE 
Sbjct: 295 QITWRTNLGEPGRLQTQQILGTPITRKEIELCIAKVPAVINLDKPFSVHLKLTNHTDKEL 354

Query: 360 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 419
           GPFE+WLSQ+ S EEK V INGL+ M L+ +EAFG+TDFHLNLIATKLGVQRITGITVFD
Sbjct: 355 GPFEVWLSQDGSVEEKAVTINGLQTMELSQLEAFGTTDFHLNLIATKLGVQRITGITVFD 414

Query: 420 KLEKITYDSLPDLE 433
           K EK TYD LPDLE
Sbjct: 415 KSEKKTYDPLPDLE 428




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225470348|ref|XP_002269604.1| PREDICTED: UPF0533 protein C5orf44 [Vitis vinifera] gi|296090651|emb|CBI41051.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356548745|ref|XP_003542760.1| PREDICTED: UPF0533 protein C5orf44 homolog [Glycine max] Back     alignment and taxonomy information
>gi|449457717|ref|XP_004146594.1| PREDICTED: UPF0533 protein C5orf44-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224079249|ref|XP_002305809.1| predicted protein [Populus trichocarpa] gi|222848773|gb|EEE86320.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356521339|ref|XP_003529314.1| PREDICTED: UPF0533 protein C5orf44-like [Glycine max] Back     alignment and taxonomy information
>gi|388496064|gb|AFK36098.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|358346667|ref|XP_003637387.1| hypothetical protein MTR_084s0010 [Medicago truncatula] gi|355503322|gb|AES84525.1| hypothetical protein MTR_084s0010 [Medicago truncatula] Back     alignment and taxonomy information
>gi|297824907|ref|XP_002880336.1| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp. lyrata] gi|297326175|gb|EFH56595.1| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|18407493|ref|NP_566117.1| uncharacterized protein [Arabidopsis thaliana] gi|16226796|gb|AAL16264.1|AF428334_1 At2g47960/T9J23.10 [Arabidopsis thaliana] gi|18377797|gb|AAL67048.1| unknown protein [Arabidopsis thaliana] gi|20197311|gb|AAC63650.2| expressed protein [Arabidopsis thaliana] gi|20197565|gb|AAM15133.1| expressed protein [Arabidopsis thaliana] gi|21281259|gb|AAM45021.1| unknown protein [Arabidopsis thaliana] gi|330255823|gb|AEC10917.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query433
TAIR|locus:2043433442 AT2G47960 "AT2G47960" [Arabido 0.990 0.970 0.661 9e-148
ZFIN|ZDB-GENE-030131-9775412 trappc13 "trafficking protein 0.775 0.815 0.338 1.1e-52
MGI|MGI:1914225417 Trappc13 "trafficking protein 0.778 0.808 0.325 1.4e-48
DICTYBASE|DDB_G0269062511 DDB_G0269062 "DUF974 family pr 0.542 0.459 0.348 1.2e-47
FB|FBgn0032204438 CG4953 [Drosophila melanogaste 0.785 0.776 0.316 2.3e-44
UNIPROTKB|G4NC96339 MGG_01105 "Uncharacterized pro 0.219 0.280 0.324 0.00048
TAIR|locus:2043433 AT2G47960 "AT2G47960" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1443 (513.0 bits), Expect = 9.0e-148, P = 9.0e-148
 Identities = 287/434 (66%), Positives = 338/434 (77%)

Query:     2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
             + T G HSLAFRVMRLC+PS HV+PPLR+DP DL  GED  DDP +AS     +SS    
Sbjct:     6 TQTHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADAV 65

Query:    62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
             +  SDL+YR+RFLL+   D IGLSGLL+LPQ+FGAIYLGETFCSYIS+NNSST EVRDV 
Sbjct:    66 D--SDLSYRNRFLLNHPTDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDVT 123

Query:   122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
             IKAEIQT++QRILLLDTSKSPVESIR GGRYDFIVEHDVKELGAHTLVC+ALY+D +GER
Sbjct:   124 IKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEHDVKELGAHTLVCSALYNDADGER 183

Query:   182 KYLPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 241
             KYLPQFFKF+V+NPLSVRTKVRVVKE TFLEACIENHTK+NL+MDQV+FEP++ WSA  L
Sbjct:   184 KYLPQFFKFVVANPLSVRTKVRVVKETTFLEACIENHTKANLFMDQVDFEPAKQWSAVRL 243

Query:   242 KADGPHSD--YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 299
             + +    D   +  S  I KPPV+IRSGGGIHNYLY+L   S   S   K QGSN+LGK 
Sbjct:   244 QNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNP-SADVSGQTKFQGSNILGKF 302

Query:   300 QITWRTNLGEPGRXXXXXXXXXXXXSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 359
             QITWRTNLGEPGR             KEI + VVEVP+V+ +++PF   L LTNQTD++ 
Sbjct:   303 QITWRTNLGEPGRLQTQQILGAPVSRKEINMRVVEVPAVIHLNRPFRAYLNLTNQTDRQL 362

Query:   360 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 419
             GPFE+ LSQ+++  EK V INGL+ + L  +EAFGS DF LNLIA+KLGVQ+I GIT  D
Sbjct:   363 GPFEVSLSQDETQLEKPVGINGLQTLMLPRIEAFGSNDFQLNLIASKLGVQKIAGITALD 422

Query:   420 KLEKITYDSLPDLE 433
               EK TY+ +PD+E
Sbjct:   423 TREKKTYELVPDME 436




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0006635 "fatty acid beta-oxidation" evidence=RCA
GO:0016558 "protein import into peroxisome matrix" evidence=RCA
ZFIN|ZDB-GENE-030131-9775 trappc13 "trafficking protein particle complex 13" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
MGI|MGI:1914225 Trappc13 "trafficking protein particle complex 13" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0269062 DDB_G0269062 "DUF974 family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
FB|FBgn0032204 CG4953 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|G4NC96 MGG_01105 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q0VFT9CE044_XENTRNo assigned EC number0.32310.87990.9247yesno
Q95TN1U533_DROMENo assigned EC number0.30570.89600.8858yesno
Q5RCG0CE044_PONABNo assigned EC number0.32100.87060.9040yesno
Q3TIR1CE044_MOUSENo assigned EC number0.31930.87990.9136yesno
Q5M887CE044_RATNo assigned EC number0.31930.88220.9138yesno
A7MB76CE044_BOVINNo assigned EC number0.31400.87060.9040yesno
Q6PBY7CE044_DANRENo assigned EC number0.33020.87290.9174yesno
A5PLN9CE044_HUMANNo assigned EC number0.31870.87060.9040yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00010269001
SubName- Full=Chromosome undetermined scaffold_252, whole genome shotgun sequence; (440 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query433
pfam06159235 pfam06159, DUF974, Protein of unknown function (DU 1e-102
>gnl|CDD|218917 pfam06159, DUF974, Protein of unknown function (DUF974) Back     alignment and domain information
 Score =  302 bits (775), Expect = e-102
 Identities = 115/236 (48%), Positives = 151/236 (63%), Gaps = 7/236 (2%)

Query: 88  LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
           L LPQ+FG+IYLGETF SY+ +NN S+ EVRDV IKAE+QT  QR+ L D+  +PVE++R
Sbjct: 1   LTLPQSFGSIYLGETFSSYLCVNNESSKEVRDVSIKAELQTPSQRLNLSDSVDAPVETLR 60

Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKE 207
            G   DF+V  DVKE G H LVCT  Y++  GE +Y  +FFKFIV NPLSVRTK   +++
Sbjct: 61  PGESLDFVVSFDVKEEGTHILVCTVSYTEASGETRYFRKFFKFIVKNPLSVRTKFYQLED 120

Query: 208 I----TFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVL 263
           +     +LEA IEN T+ NL++++V  EPS  + AT L  +    D +     + K PVL
Sbjct: 121 LSRRRVYLEAQIENITEDNLFLEKVTLEPSPGYKATSLNWEPSLGDVDGLDGGMDKRPVL 180

Query: 264 IRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 319
               G I  YL+ LK    G+   +K+ G   LGKL I WRT +GE GRLQT Q+ 
Sbjct: 181 --KPGDIRQYLFCLKP-KEGALEELKLDGRTNLGKLDIVWRTAMGEKGRLQTSQLQ 233


Family of uncharacterized eukaryotic proteins. Length = 235

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 433
KOG2625348 consensus Uncharacterized conserved protein [Funct 100.0
PF06159249 DUF974: Protein of unknown function (DUF974); Inte 100.0
PF07919554 Gryzun: Gryzun, putative trafficking through Golgi 99.91
KOG4386809 consensus Uncharacterized conserved protein [Funct 99.47
PF12735306 Trs65: TRAPP trafficking subunit Trs65; InterPro: 98.88
PF086261185 TRAPPC9-Trs120: Transport protein Trs120 or TRAPPC 98.32
PF1274257 Gryzun-like: Gryzun, putative Golgi trafficking 97.75
PF12584147 TRAPPC10: Trafficking protein particle complex sub 97.68
PF07705101 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic 96.87
PF00927107 Transglut_C: Transglutaminase family, C-terminal i 96.7
PF07919 554 Gryzun: Gryzun, putative trafficking through Golgi 96.11
PF1063378 NPCBM_assoc: NPCBM-associated, NEW3 domain of alph 95.14
PF14874102 PapD-like: Flagellar-associated PapD-like 94.48
PF07705101 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic 93.93
PF05753181 TRAP_beta: Translocon-associated protein beta (TRA 93.27
PF1063378 NPCBM_assoc: NPCBM-associated, NEW3 domain of alph 90.28
PF05753181 TRAP_beta: Translocon-associated protein beta (TRA 87.95
PF11797140 DUF3324: Protein of unknown function C-terminal (D 87.41
PF0020792 A2M: Alpha-2-macroglobulin family; InterPro: IPR00 86.0
smart00809104 Alpha_adaptinC2 Adaptin C-terminal domain. Adaptin 85.99
PF02883115 Alpha_adaptinC2: Adaptin C-terminal domain; InterP 85.52
PF14874102 PapD-like: Flagellar-associated PapD-like 85.49
PF13584 484 BatD: Oxygen tolerance 82.3
PF00927107 Transglut_C: Transglutaminase family, C-terminal i 81.39
>KOG2625 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
Probab=100.00  E-value=6.2e-94  Score=661.34  Aligned_cols=332  Identities=31%  Similarity=0.556  Sum_probs=308.0

Q ss_pred             cccccccccceeecceeeEEEEEEcCCCCceeeEEEEEEEeCCCceeeccCCCCCCccccCCCCeeeEEEEEEcccccce
Q 013971           87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAH  166 (433)
Q Consensus        87 ~L~LP~sfG~iylGEtF~~~i~v~N~s~~~v~~V~ikvelqT~s~r~~L~~~~~~~~~~L~pg~~ld~iv~~~lke~G~h  166 (433)
                      +|.+||.||+|||||||++||+++|+|++.|++|.+|+||||.+||+.|... ...+.+++|.++.+.||+||+||+|+|
T Consensus         1 ~l~~pq~f~niflgetfs~yinv~nds~k~v~~i~lk~dlqtssqrl~l~~s-~~~~aei~~~~c~~~vi~hevkeig~h   79 (348)
T KOG2625|consen    1 MLIAPQMFENIFLGETFSFYINVHNDSEKTVKDILLKADLQTSSQRLNLPAS-NAAAAEIEPDCCEDDVIHHEVKEIGQH   79 (348)
T ss_pred             CccchhhhcceeeccceEEEEEEecchhhhhhhheeeecccccceeeccccc-hhhhhhcCccccchhhhhHHHHhhccE
Confidence            4789999999999999999999999999999999999999999999999643 244668999999999999999999999


Q ss_pred             EEEEEEEEEcCCCceeecCeeEEEEeecCeEEEEEEEEe-------CCeeEEEEEEEecCcccEEEEeEEeeecCCccee
Q 013971          167 TLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVV-------KEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT  239 (433)
Q Consensus       167 ~L~c~V~Y~~~~ge~~~frK~ykF~v~~Pl~VrTK~~~~-------~~~~~LEaqiqN~s~~pl~le~v~lep~~~~~~~  239 (433)
                      +|+|+|+|++.+||.++|||||||+|.+|++||||||++       .+++||||||||+|..+|+||+|+|+|+.+|.++
T Consensus        80 ilicavny~tq~ge~myfrkffkf~v~kpidvktkfynaesdlssv~~dvfleaqien~s~a~mflekv~ldps~~ynvt  159 (348)
T KOG2625|consen   80 ILICAVNYKTQAGEKMYFRKFFKFPVLKPIDVKTKFYNAESDLSSVNDDVFLEAQIENMSNANMFLEKVELDPSIHYNVT  159 (348)
T ss_pred             EEEEEEeeeccCccchhHHhhccccccccccccceeecccccccccchhhhhhhhhhcccccchhhhhhccCchheecce
Confidence            999999999999999999999999999999999999986       5789999999999999999999999999999999


Q ss_pred             eecCCCCCCCCCcccccccCCceEEeCCCCeeeEEEEEeecCCCCCCCccccCceeeEEEEEEEEcCCCCCeeeeEEeee
Q 013971          240 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL  319 (433)
Q Consensus       240 ~ln~~~~~~~~~~~~~~~~~~~~l~l~~gd~~q~lf~L~p~~~~~~~~~~~~g~~~lG~L~I~WRs~~Ge~G~L~Ts~l~  319 (433)
                      +++.+.+.++..++    |.. ..+++|.|+|||||||+||.+..++..-.++.+.+|||||.||++|||+||||||+|+
T Consensus       160 ~i~~~~e~gdcvst----fg~-~~~lkp~d~rq~l~cl~pk~d~~~~~gi~k~lt~igkldi~wktnlgekgrlqts~lq  234 (348)
T KOG2625|consen  160 EIAHEDEAGDCVST----FGS-GALLKPKDIRQFLFCLKPKADFAEKAGIIKDLTSIGKLDISWKTNLGEKGRLQTSALQ  234 (348)
T ss_pred             eecchhhccccccc----ccc-ccccCccchhhheeecCchHHHHHhhccccccceeeeeEEEeeccccccccchHHHHH
Confidence            99888777665443    332 2346789999999999999877645555678899999999999999999999999999


Q ss_pred             eccCccCCeEEEEEecCceEEeCCcEEEEEEEEeCCCCccccEEEEEEeCCCCCceeEEEecccceeeeeeCCCCeEEEE
Q 013971          320 GTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH  399 (433)
Q Consensus       320 ~~~~~~~dl~l~v~~~P~~v~v~~PF~v~~~v~N~s~r~~~~l~v~l~~~~~~~~~~~~~~G~~~~~Lg~L~P~~s~~~~  399 (433)
                      |.+|+++|++|+++.+|+.|.+++||.++|+|+|+|+|.|+ |.+.|++..   ..-++|||+++++||+|.|.+...|.
T Consensus       235 riapgygdvrlsle~~p~~vdleepf~iscki~ncserald-l~l~l~~~n---nrhi~~c~~sg~qlgkl~ps~~l~~a  310 (348)
T KOG2625|consen  235 RIAPGYGDVRLSLEAIPACVDLEEPFEISCKITNCSERALD-LQLELCNPN---NRHIHFCGISGRQLGKLHPSQHLCFA  310 (348)
T ss_pred             hhcCCCCceEEEeeccccccccCCCeEEEEEEcccchhhhh-hhhhhcCCC---CceeEEeccccccccCCCCcceeeeE
Confidence            99999999999999999999999999999999999999999 999998864   35799999999999999999999999


Q ss_pred             EEEEecccceEEeCceEEEeCCCCeEEee
Q 013971          400 LNLIATKLGVQRITGITVFDKLEKITYDS  428 (433)
Q Consensus       400 L~l~pl~~Glq~isgI~l~D~~~~~~y~~  428 (433)
                      |+++|...|+|+|+||+|+|+++||+|||
T Consensus       311 l~l~~~~~giqsisgiritdtf~kr~ye~  339 (348)
T KOG2625|consen  311 LNLFPSTQGIQSISGIRITDTFLKRIYEH  339 (348)
T ss_pred             EeeccchhcceeecceEeehhhhhhhhcc
Confidence            99999999999999999999999999997



>PF06159 DUF974: Protein of unknown function (DUF974); InterPro: IPR010378 This is a family of uncharacterised eukaryotic proteins Back     alignment and domain information
>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function Back     alignment and domain information
>KOG4386 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>PF12735 Trs65: TRAPP trafficking subunit Trs65; InterPro: IPR024662 This family is one of the subunits of the TRAPP Golgi trafficking complex [] Back     alignment and domain information
>PF08626 TRAPPC9-Trs120: Transport protein Trs120 or TRAPPC9, TRAPP II complex subunit; InterPro: IPR013935 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway Back     alignment and domain information
>PF12742 Gryzun-like: Gryzun, putative Golgi trafficking Back     alignment and domain information
>PF12584 TRAPPC10: Trafficking protein particle complex subunit 10, TRAPPC10; InterPro: IPR022233 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway Back     alignment and domain information
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins Back     alignment and domain information
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds Back     alignment and domain information
>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function Back     alignment and domain information
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known Back     alignment and domain information
>PF14874 PapD-like: Flagellar-associated PapD-like Back     alignment and domain information
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins Back     alignment and domain information
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins Back     alignment and domain information
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known Back     alignment and domain information
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins Back     alignment and domain information
>PF11797 DUF3324: Protein of unknown function C-terminal (DUF3324); InterPro: IPR021759 This family consists of several hypothetical bacterial proteins of unknown function Back     alignment and domain information
>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins Back     alignment and domain information
>smart00809 Alpha_adaptinC2 Adaptin C-terminal domain Back     alignment and domain information
>PF02883 Alpha_adaptinC2: Adaptin C-terminal domain; InterPro: IPR008152 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment Back     alignment and domain information
>PF14874 PapD-like: Flagellar-associated PapD-like Back     alignment and domain information
>PF13584 BatD: Oxygen tolerance Back     alignment and domain information
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query433
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 7e-07
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 6e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 50.6 bits (120), Expect = 7e-07
 Identities = 72/392 (18%), Positives = 111/392 (28%), Gaps = 130/392 (33%)

Query: 115 LEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY 174
            + +DV        D  + +L  + +            D I+       G   L  T L 
Sbjct: 33  FDCKDV-------QDMPKSIL--SKEE----------IDHIIMSKDAVSGTLRLFWTLLS 73

Query: 175 SDGEGERKY----LPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEF 230
              E  +K+    L   +KF++S P+    +   +    ++E       +  LY D   F
Sbjct: 74  KQEEMVQKFVEEVLRINYKFLMS-PIKTEQRQPSMMTRMYIE------QRDRLYNDNQVF 126

Query: 231 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK---------MLS 281
                              YN  SR     P L      +   L +L+         +L 
Sbjct: 127 AK-----------------YNV-SRL---QPYL-----KLRQALLELRPAKNVLIDGVLG 160

Query: 282 HGSSSPVK--VQGSNVLGKL--QITWRTNLGEPGR----LQTQQILGTTIT--------- 324
            G +           V  K+  +I W  NL         L+  Q L   I          
Sbjct: 161 SGKTWVALDVCLSYKVQCKMDFKIFW-LNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDH 219

Query: 325 SKEIELNVVEV-------------------------PSVVGIDKPFLLKLK--LT----N 353
           S  I+L +  +                                  F L  K  LT     
Sbjct: 220 SSNIKLRIHSIQAELRRLLKSKPYENCLLVLLNVQNAKAW---NAFNLSCKILLTTRFKQ 276

Query: 354 QTDKEQGPFEIWLSQNDS------DEEKVVMIN--GLRIMALAPVEAFGSTDFHLNLIAT 405
            TD         +S +        DE K +++     R   L P E   +    L++IA 
Sbjct: 277 VTDFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDL-PREVLTTNPRRLSIIAE 335

Query: 406 KL--GVQRITGI--TVFDKLEKITYDSLPDLE 433
            +  G+           DKL  I   SL  LE
Sbjct: 336 SIRDGLATWDNWKHVNCDKLTTIIESSLNVLE 367


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query433
2xzz_A102 Protein-glutamine gamma-glutamyltransferase K; 2.3 96.79
3hrz_B252 Cobra venom factor; serine protease, glycosilated, 96.33
1vjj_A692 Protein-glutamine glutamyltransferase E; transglut 96.24
1ex0_A731 Coagulation factor XIII A chain; transglutaminase, 96.21
3idu_A127 Uncharacterized protein; all beta-protein, structu 95.27
1g0d_A695 Protein-glutamine gamma-glutamyltransferase; tissu 93.57
2q3z_A687 Transglutaminase 2; transglutaminase 2, tissue tra 92.82
3es6_B118 Prolactin-inducible protein; major histocompatibil 92.54
4acq_A 1451 Alpha-2-macroglobulin; hydrolase inhibitor, protei 91.29
4fxk_B 767 Complement C4-A alpha chain; immune system, proteo 91.02
3prx_B 1642 Cobra venom factor; immune system, complement, imm 91.02
2hr0_B 915 Complement C3 alpha' chain; complement component C 90.34
2qsv_A220 Uncharacterized protein; MCSG, structural genomics 89.99
2ys4_A122 Hydrocephalus-inducing protein homolog; hydin, PAP 88.89
2b39_A 1661 C3; thioester, immune defense, immune system; HET: 88.83
2pn5_A 1325 TEP1R, thioester-containing protein I; FULL-length 87.45
3qbt_B140 Inositol polyphosphate 5-phosphatase OCRL-1; prote 85.7
2xzz_A102 Protein-glutamine gamma-glutamyltransferase K; 2.3 85.25
2l0d_A114 Cell surface protein; structural genomics, northea 84.17
1vjj_A 692 Protein-glutamine glutamyltransferase E; transglut 83.21
1ex0_A 731 Coagulation factor XIII A chain; transglutaminase, 81.2
3cu7_A 1676 Complement C5; Mg domain, inflammation, anaphylato 81.16
>2xzz_A Protein-glutamine gamma-glutamyltransferase K; 2.30A {Homo sapiens} Back     alignment and structure
Probab=96.79  E-value=0.0029  Score=52.17  Aligned_cols=72  Identities=6%  Similarity=0.138  Sum_probs=60.9

Q ss_pred             ecCceEEeCCcEEEEEEEEeCCCCccccEEEEEEeCCCCCceeEEEecccceeeeeeCCCCeEEEEEEEEecccceEEeC
Q 013971          334 EVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT  413 (433)
Q Consensus       334 ~~P~~v~v~~PF~v~~~v~N~s~r~~~~l~v~l~~~~~~~~~~~~~~G~~~~~Lg~L~P~~s~~~~L~l~pl~~Glq~is  413 (433)
                      +++...++++|+.+++.++|--...+....+.++...      + ..+ ....++.+.|+++..+.+.+.|..+|-++|-
T Consensus        11 ~v~g~~~v~~~l~v~vsf~NPL~~~L~~c~~~vEG~G------L-~~~-~~~~~~~v~pg~~~~~~~~~~P~~~G~~~L~   82 (102)
T 2xzz_A           11 TLLGAAVVGQECEVQIVFKNPLPVTLTNVVFRLEGSG------L-QRP-KILNVGDIGGNETVTLRQSFVPVRPGPRQLI   82 (102)
T ss_dssp             EESSCCCSSSCEEEEEEEECCSSSCBCSEEEEEEETT------T-EEE-EEEEECCBCTTCEEEEEEEECCCSCSSCCCE
T ss_pred             EECCCcccCCeEEEEEEEECCCCCcccCEEEEEECCC------C-Ccc-eEEEcCcCCCCCEEEEEEEEecCcccceEEE
Confidence            5677778999999999999997777777888888764      3 344 5677899999999999999999999998874



>3hrz_B Cobra venom factor; serine protease, glycosilated, multi-domain, complement SYST convertase, complement alternate pathway; HET: NAG P6G; 2.20A {Naja kaouthia} PDB: 3frp_G* 3hs0_B* Back     alignment and structure
>1vjj_A Protein-glutamine glutamyltransferase E; transglutaminase 3, X-RAY crystallography, metalloenzyme, calcium ION; HET: GDP; 1.90A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1sgx_A* 1l9m_A 1l9n_A* 1nud_A 1nuf_A 1nug_A 1rle_A* Back     alignment and structure
>1ex0_A Coagulation factor XIII A chain; transglutaminase, blood coagulation, mutant, W279F, oxyanion, transferase; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1evu_A 1fie_A 1f13_A 1ggt_A 1ggu_A 1ggy_A 1qrk_A Back     alignment and structure
>3idu_A Uncharacterized protein; all beta-protein, structural genomics, PSI-2, protein structure initiative; 1.70A {Pyrococcus furiosus} PDB: 2kl6_A Back     alignment and structure
>1g0d_A Protein-glutamine gamma-glutamyltransferase; tissue transglutaminase,acyltransferase; 2.50A {Pagrus major} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 Back     alignment and structure
>2q3z_A Transglutaminase 2; transglutaminase 2, tissue transglutaminase, TG2, transferas; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1kv3_A 3ly6_A* Back     alignment and structure
>3es6_B Prolactin-inducible protein; major histocompatibility complex, protein-protein complex, P inducible protein, zinc 2-glycoprotein, ZAG-PIP complex; HET: NDG NAG BMA MAN P6G; 3.23A {Homo sapiens} SCOP: b.1.18.23 Back     alignment and structure
>4acq_A Alpha-2-macroglobulin; hydrolase inhibitor, proteinase inhibitor, irreversible PROT inhibitor, conformational change, blood plasma inhibitor; HET: MEQ NAG MAN; 4.30A {Homo sapiens} Back     alignment and structure
>4fxk_B Complement C4-A alpha chain; immune system, proteolytic cascade; HET: NAG BMA; 3.60A {Homo sapiens} PDB: 4fxg_B* Back     alignment and structure
>3prx_B Cobra venom factor; immune system, complement, immune SYS complex; HET: NAG; 4.30A {Naja kaouthia} PDB: 3pvm_B* Back     alignment and structure
>2hr0_B Complement C3 alpha' chain; complement component C3B, immune system; HET: THC; 2.26A {Homo sapiens} PDB: 2icf_B* 2wii_B* 2win_B* 3g6j_B 3l5n_B* 2a73_B* 2i07_B* 2xwj_B* 2xwb_B* 2a74_C* 2ice_C* 2qki_C* 3l3o_F* 3nms_C* 3nsa_C* 3ohx_C* 3t4a_C 2ice_B* 3l3o_B* 3nms_B* ... Back     alignment and structure
>2qsv_A Uncharacterized protein; MCSG, structural genomics, porphyromonas gingivalis W83, PSI protein structure initiative; 2.10A {Porphyromonas gingivalis} Back     alignment and structure
>2ys4_A Hydrocephalus-inducing protein homolog; hydin, PAPD-like, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} Back     alignment and structure
>2b39_A C3; thioester, immune defense, immune system; HET: NAG BMA; 3.00A {Bos taurus} Back     alignment and structure
>2pn5_A TEP1R, thioester-containing protein I; FULL-length mature peptide, immune system; HET: NAG; 2.70A {Anopheles gambiae} Back     alignment and structure
>3qbt_B Inositol polyphosphate 5-phosphatase OCRL-1; protein transport, vesicular trafficking, GTPase, LOWE syndr immunoglobulin fold, RAB8A, endocytosis; HET: GNP; 2.00A {Homo sapiens} Back     alignment and structure
>2xzz_A Protein-glutamine gamma-glutamyltransferase K; 2.30A {Homo sapiens} Back     alignment and structure
>2l0d_A Cell surface protein; structural genomics, northeast structural genomics consortiu PSI-2, protein structure initiative; NMR {Methanosarcina acetivorans} Back     alignment and structure
>1vjj_A Protein-glutamine glutamyltransferase E; transglutaminase 3, X-RAY crystallography, metalloenzyme, calcium ION; HET: GDP; 1.90A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1sgx_A* 1l9m_A 1l9n_A* 1nud_A 1nuf_A 1nug_A 1rle_A* Back     alignment and structure
>1ex0_A Coagulation factor XIII A chain; transglutaminase, blood coagulation, mutant, W279F, oxyanion, transferase; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1evu_A 1fie_A 1f13_A 1ggt_A 1ggu_A 1ggy_A 1qrk_A Back     alignment and structure
>3cu7_A Complement C5; Mg domain, inflammation, anaphylatoxin, cleavage of basic residues, complement alternate pathway, glycoprotein, immune response; HET: NAG; 3.10A {Homo sapiens} PDB: 3kls_A* 3km9_A* 4e0s_A* 3prx_A* 3pvm_A* 4a5w_A* 1xwe_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query433
d1vjja399 Transglutaminase, two C-terminal domains {Human (H 97.01
d1ex0a3100 Transglutaminase, two C-terminal domains {Human (H 96.89
d1g0da3101 Transglutaminase, two C-terminal domains {Red sea 96.82
d2q3za398 Transglutaminase, two C-terminal domains {Human (H 96.53
d1vjja2115 Transglutaminase, two C-terminal domains {Human (H 96.49
d1ex0a2112 Transglutaminase, two C-terminal domains {Human (H 96.45
d2q3za2114 Transglutaminase, two C-terminal domains {Human (H 95.77
d1g0da2112 Transglutaminase, two C-terminal domains {Red sea 95.55
d2q3za398 Transglutaminase, two C-terminal domains {Human (H 95.07
d1vjja399 Transglutaminase, two C-terminal domains {Human (H 94.71
d3es6b1118 Prolactin-inducible protein, PIP {Human (Homo sapi 94.57
d1g0da3101 Transglutaminase, two C-terminal domains {Red sea 93.19
d1ex0a3100 Transglutaminase, two C-terminal domains {Human (H 90.88
>d1vjja3 b.1.5.1 (A:594-692) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]} Back     information, alignment and structure
class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: Transglutaminase, two C-terminal domains
family: Transglutaminase, two C-terminal domains
domain: Transglutaminase, two C-terminal domains
species: Human (Homo sapiens), TGase E3 [TaxId: 9606]
Probab=97.01  E-value=0.00072  Score=53.60  Aligned_cols=75  Identities=11%  Similarity=0.257  Sum_probs=61.6

Q ss_pred             EecCceEEeCCcEEEEEEEEeCCCCccccEEEEEEeCCCCCceeEEEecccceeeeeeCCCCeEEEEEEEEecccceEEe
Q 013971          333 VEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI  412 (433)
Q Consensus       333 ~~~P~~v~v~~PF~v~~~v~N~s~r~~~~l~v~l~~~~~~~~~~~~~~G~~~~~Lg~L~P~~s~~~~L~l~pl~~Glq~i  412 (433)
                      +++|....+++|+.+++.++|--+..+.+-.+.++...       ++.+.+...++.+.|+++.++.+.+.|..+|.++|
T Consensus         5 I~v~~~~~v~~~~~v~vsf~NPL~~~L~~c~f~vEG~G-------L~~~~~~~~~~~v~p~~~~~~~~~~~P~~~G~~~l   77 (99)
T d1vjja3           5 LEVLNEARVRKPVNVQMLFSNPLDEPVRDCVLMVEGSG-------LLLGNLKIDVPTLGPKERSRVRFDILPSRSGTKQL   77 (99)
T ss_dssp             EEECSCCBTTSCEEEEEEEECCSSSCBCSEEEEEECTT-------TSSSCEEEEECCBCTTCEEEEEEEECCCSCEEEEE
T ss_pred             EEeCCCcCcCCeEEEEEEEECCCCCchhCEEEEEEeCC-------CCCccEEEecCccCCCCEEEEEEEEEcCCcccEEE
Confidence            36788889999999999999997777776888887653       12334455788899999999999999999999997


Q ss_pred             Cc
Q 013971          413 TG  414 (433)
Q Consensus       413 sg  414 (433)
                      -.
T Consensus        78 ~a   79 (99)
T d1vjja3          78 LA   79 (99)
T ss_dssp             EE
T ss_pred             EE
Confidence            54



>d1ex0a3 b.1.5.1 (A:628-727) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1g0da3 b.1.5.1 (A:584-684) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]} Back     information, alignment and structure
>d2q3za3 b.1.5.1 (A:586-683) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1vjja2 b.1.5.1 (A:479-593) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]} Back     information, alignment and structure
>d1ex0a2 b.1.5.1 (A:516-627) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]} Back     information, alignment and structure
>d2q3za2 b.1.5.1 (A:472-585) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1g0da2 b.1.5.1 (A:472-583) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]} Back     information, alignment and structure
>d2q3za3 b.1.5.1 (A:586-683) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1vjja3 b.1.5.1 (A:594-692) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]} Back     information, alignment and structure
>d3es6b1 b.1.18.23 (B:1-118) Prolactin-inducible protein, PIP {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1g0da3 b.1.5.1 (A:584-684) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]} Back     information, alignment and structure
>d1ex0a3 b.1.5.1 (A:628-727) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]} Back     information, alignment and structure