Citrus Sinensis ID: 013275


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440------
MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD
cccccccccEEEEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEEEEccccccEEEEEEEEEEEcccEEEEEcccccccccccccccEEEEEEEEEEEEEccEEEEEEEEEEcccccEEEEEEEEEEEEEccEEEEEEEEEEcccccccccEEEEEEEEEccccccEEEEEEEEEEcccccEEEcccccccccccccccccccccEEEEccccEEEEEEEEEEccccccccccccccEEcEEEEEEEEcccccccEEEEEEEEEEccccccEEEEEEEcccEEEEcccEEEEEEEEEccccccccEEEEEEccccccccEEEEEcccEEEEcccccccEEEEEEEEEEccccEEEEccEEEEEccccEEEcccccEEEEEccc
ccccccccEEEEEEEEccccccccccccccccccccccccccccccccccccHHccccccccccccccccccccccccccccccccccccccccccEEEEEEEEEEEEEEcccccEEEEEEEEEEEEccccEEEcccccccccccccccccccEEEEEEEHHcccEEEEEEEEEEcccccEEEEEEEEEEEEccccEEEEEEEcccccccccccEEEEEEEEEEcccccEEEEEEEEccccccEEEEcccccccccccccccccccccccEEccccEEEEEEEEEcccccccccccccccEEEEEEEEEEEccccccccEEcccccccccccccEEEEEEEcccEEEEEEcEEEEEEEEEccccccccEEEEEEccccccEEEEEEcccEccccccccccccEEEEEEEEEccccEEEEcEEEEEEccccEEEEccccEEEEEccc
msstpgthsLAFRVMRlcrpslhvepplrvdptdlfigedifddpiaasnlpplissdvttnkssdltyRSRFLLhdsadsiglsgllvlpqafgaiylgeTFCSYISinnsstlevRDVVIKAEIQTDKQRILLLdtskspvesiraggrydfIVEHDVKELGAHTLVCTAlysdgegerkyLPQFFKFIvsnplsvrtKVRVVKVGATHFQEITFLEACIENHtksnlymdqvefepsqnwsatmlkadgphsdynaqsreifkppvlirsgggiHNYLYQLKmlshgssspvkvqgsnvlgklqitwrtnlgepgrlqtQQILGTTITSKEIElnvvevpsvvgidkpfllklkltnqtdkeqgpfeiwlsqndsdeekVVMINGLRIMalapveafgstdFHLNLIATKLGVQRITGITVFDKLekitydslpdleifvdqd
msstpgthslAFRVMRLCrpslhvepplrvDPTDLFIGEDIFDDPIAasnlpplissdvttnksSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAeiqtdkqrillldtskspvesiRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIvsnplsvrTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWrtnlgepgrlqtQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLtnqtdkeqgpfeiwlsqndsdEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKlekitydslpdleifvdqd
MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRlqtqqilgttitSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD
*********LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS*********DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF*************************EIFKPPVLIRSGGGIHNYLYQLKMLSHG****VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN*********EIW**********VVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV***
*****GT**LAFRVMRLCRPSLHVEPPLRVDPTDLFIG******************************************SIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD********************GRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKV************ITFLEACIENHTKSNLYMDQVEFEPSQNWSATM********************PVLIRSGGGIHNYLYQLKML************SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD
MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH********QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD
*****GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGE**************************************SADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA***********REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQ*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query446 2.2.26 [Sep-21-2011]
Q5RCG0417 UPF0533 protein C5orf44 h yes no 0.860 0.920 0.318 1e-52
A5PLN9417 UPF0533 protein C5orf44 O yes no 0.860 0.920 0.316 1e-52
Q6PBY7412 UPF0533 protein C5orf44 h yes no 0.849 0.919 0.325 3e-52
A7MB76417 UPF0533 protein C5orf44 h yes no 0.860 0.920 0.311 2e-51
Q3TIR1417 UPF0533 protein C5orf44 h yes no 0.869 0.930 0.317 4e-51
Q5M887418 UPF0533 protein C5orf44 h yes no 0.872 0.930 0.317 6e-51
Q0VFT9412 UPF0533 protein C5orf44 h yes no 0.856 0.927 0.318 3e-49
Q6GPR5414 UPF0533 protein C5orf44 h N/A no 0.860 0.927 0.316 1e-47
A8WX89401 UPF0533 protein CBG04321 N/A no 0.872 0.970 0.306 2e-47
Q95QQ2401 UPF0533 protein C56C10.7 yes no 0.874 0.972 0.298 3e-47
>sp|Q5RCG0|CE044_PONAB UPF0533 protein C5orf44 homolog OS=Pongo abelii PE=2 SV=1 Back     alignment and function desciption
 Score =  207 bits (528), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 138/433 (31%), Positives = 217/433 (50%), Gaps = 49/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLK--ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L   +    S     SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVTELNSVSQAGESVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 386

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 387 LRLTDTFLKRTYE 399





Pongo abelii (taxid: 9601)
>sp|A5PLN9|CE044_HUMAN UPF0533 protein C5orf44 OS=Homo sapiens GN=C5orf44 PE=2 SV=2 Back     alignment and function description
>sp|Q6PBY7|CE044_DANRE UPF0533 protein C5orf44 homolog OS=Danio rerio GN=zgc:73187 PE=2 SV=2 Back     alignment and function description
>sp|A7MB76|CE044_BOVIN UPF0533 protein C5orf44 homolog OS=Bos taurus PE=2 SV=1 Back     alignment and function description
>sp|Q3TIR1|CE044_MOUSE UPF0533 protein C5orf44 homolog OS=Mus musculus PE=2 SV=1 Back     alignment and function description
>sp|Q5M887|CE044_RAT UPF0533 protein C5orf44 homolog OS=Rattus norvegicus PE=2 SV=2 Back     alignment and function description
>sp|Q0VFT9|CE044_XENTR UPF0533 protein C5orf44 homolog OS=Xenopus tropicalis PE=2 SV=1 Back     alignment and function description
>sp|Q6GPR5|CE044_XENLA UPF0533 protein C5orf44 homolog OS=Xenopus laevis PE=2 SV=2 Back     alignment and function description
>sp|A8WX89|U533_CAEBR UPF0533 protein CBG04321 OS=Caenorhabditis briggsae GN=CBG04321 PE=3 SV=2 Back     alignment and function description
>sp|Q95QQ2|U533_CAEEL UPF0533 protein C56C10.7 OS=Caenorhabditis elegans GN=C56C10.7 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query446
255556003434 expressed protein, putative [Ricinus com 0.970 0.997 0.760 0.0
225470348438 PREDICTED: UPF0533 protein C5orf44 [Viti 0.977 0.995 0.763 0.0
356548745440 PREDICTED: UPF0533 protein C5orf44 homol 0.961 0.975 0.747 0.0
449457717440 PREDICTED: UPF0533 protein C5orf44-like 0.979 0.993 0.712 0.0
224079249450 predicted protein [Populus trichocarpa] 0.964 0.955 0.733 0.0
356521339435 PREDICTED: UPF0533 protein C5orf44-like 0.952 0.977 0.731 0.0
388496064437 unknown [Medicago truncatula] 0.950 0.970 0.715 1e-180
358346667446 hypothetical protein MTR_084s0010 [Medic 0.950 0.950 0.700 1e-178
18407493442 uncharacterized protein [Arabidopsis tha 0.975 0.984 0.668 1e-168
297824907443 hypothetical protein ARALYDRAFT_483987 [ 0.970 0.977 0.667 1e-167
>gi|255556003|ref|XP_002519036.1| expressed protein, putative [Ricinus communis] gi|223541699|gb|EEF43247.1| expressed protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  697 bits (1799), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/447 (76%), Positives = 384/447 (85%), Gaps = 14/447 (3%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MS+TPGTHSLAFRVMRLCRPS HV+  L VDP+DL +GEDIFDDP+AAS LPPLI S +T
Sbjct: 1   MSTTPGTHSLAFRVMRLCRPSFHVDAQLLVDPSDLIVGEDIFDDPVAASRLPPLIDSHIT 60

Query: 61  T-NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
               +SDL+YR+RFL    +DS GL+GLLVLPQAFGAIYLGETFCSYISINNSS  EVRD
Sbjct: 61  KLTDTSDLSYRTRFLHQHPSDSFGLTGLLVLPQAFGAIYLGETFCSYISINNSSNFEVRD 120

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
           V+IKAEIQT++QRILLLDTSK+PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG+G
Sbjct: 121 VIIKAEIQTERQRILLLDTSKNPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGDG 180

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
           ERKYLPQFFKFIV+NPLSVRTKVRVVK       E T+LEACIENHTK+NLYMDQVEFEP
Sbjct: 181 ERKYLPQFFKFIVANPLSVRTKVRVVK-------ETTYLEACIENHTKTNLYMDQVEFEP 233

Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           +Q+WSA ++K D   S+ ++ +REIFKPPVLIRSGGGIHNYLYQL++ +HG++       
Sbjct: 234 AQHWSAKIIKDDEKQSEKDSLTREIFKPPVLIRSGGGIHNYLYQLRLSAHGAAQ------ 287

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
           SNVLGKLQITWRTNLGEPGRLQTQQILGT IT KEIEL + +VP+V+ +DKPF + LKLT
Sbjct: 288 SNVLGKLQITWRTNLGEPGRLQTQQILGTPITRKEIELCIAKVPAVINLDKPFSVHLKLT 347

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           N TDKE GPFE+WLSQ+ S EEK V INGL+ M L+ +EAFG+TDFHLNLIATKLGVQRI
Sbjct: 348 NHTDKELGPFEVWLSQDGSVEEKAVTINGLQTMELSQLEAFGTTDFHLNLIATKLGVQRI 407

Query: 420 TGITVFDKLEKITYDSLPDLEIFVDQD 446
           TGITVFDK EK TYD LPDLEIFV  D
Sbjct: 408 TGITVFDKSEKKTYDPLPDLEIFVAID 434




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225470348|ref|XP_002269604.1| PREDICTED: UPF0533 protein C5orf44 [Vitis vinifera] gi|296090651|emb|CBI41051.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356548745|ref|XP_003542760.1| PREDICTED: UPF0533 protein C5orf44 homolog [Glycine max] Back     alignment and taxonomy information
>gi|449457717|ref|XP_004146594.1| PREDICTED: UPF0533 protein C5orf44-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224079249|ref|XP_002305809.1| predicted protein [Populus trichocarpa] gi|222848773|gb|EEE86320.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356521339|ref|XP_003529314.1| PREDICTED: UPF0533 protein C5orf44-like [Glycine max] Back     alignment and taxonomy information
>gi|388496064|gb|AFK36098.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|358346667|ref|XP_003637387.1| hypothetical protein MTR_084s0010 [Medicago truncatula] gi|355503322|gb|AES84525.1| hypothetical protein MTR_084s0010 [Medicago truncatula] Back     alignment and taxonomy information
>gi|18407493|ref|NP_566117.1| uncharacterized protein [Arabidopsis thaliana] gi|16226796|gb|AAL16264.1|AF428334_1 At2g47960/T9J23.10 [Arabidopsis thaliana] gi|18377797|gb|AAL67048.1| unknown protein [Arabidopsis thaliana] gi|20197311|gb|AAC63650.2| expressed protein [Arabidopsis thaliana] gi|20197565|gb|AAM15133.1| expressed protein [Arabidopsis thaliana] gi|21281259|gb|AAM45021.1| unknown protein [Arabidopsis thaliana] gi|330255823|gb|AEC10917.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297824907|ref|XP_002880336.1| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp. lyrata] gi|297326175|gb|EFH56595.1| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query446
TAIR|locus:2043433442 AT2G47960 "AT2G47960" [Arabido 0.975 0.984 0.651 9e-148
MGI|MGI:1914225417 Trappc13 "trafficking protein 0.771 0.824 0.322 1.6e-49
ZFIN|ZDB-GENE-030131-9775412 trappc13 "trafficking protein 0.612 0.662 0.364 2.3e-48
DICTYBASE|DDB_G0269062511 DDB_G0269062 "DUF974 family pr 0.526 0.459 0.338 9.7e-46
FB|FBgn0032204438 CG4953 [Drosophila melanogaste 0.605 0.616 0.337 2.2e-40
UNIPROTKB|G4NC96339 MGG_01105 "Uncharacterized pro 0.213 0.280 0.324 0.00052
TAIR|locus:2043433 AT2G47960 "AT2G47960" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1443 (513.0 bits), Expect = 9.0e-148, P = 9.0e-148
 Identities = 291/447 (65%), Positives = 343/447 (76%)

Query:     2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
             + T G HSLAFRVMRLC+PS HV+PPLR+DP DL  GED  DDP +AS     +SS    
Sbjct:     6 TQTHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADAV 65

Query:    62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
             +  SDL+YR+RFLL+   D IGLSGLL+LPQ+FGAIYLGETFCSYIS+NNSST EVRDV 
Sbjct:    66 D--SDLSYRNRFLLNHPTDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDVT 123

Query:   122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
             IKAEIQT++QRILLLDTSKSPVESIR GGRYDFIVEHDVKELGAHTLVC+ALY+D +GER
Sbjct:   124 IKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEHDVKELGAHTLVCSALYNDADGER 183

Query:   182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
             KYLPQFFKF+V+NPLSVRTKVRVVK       E TFLEACIENHTK+NL+MDQV+FEP++
Sbjct:   184 KYLPQFFKFVVANPLSVRTKVRVVK-------ETTFLEACIENHTKANLFMDQVDFEPAK 236

Query:   242 NWSATMLKADGPHSD--YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
              WSA  L+ +    D   +  S  I KPPV+IRSGGGIHNYLY+L   S   S   K QG
Sbjct:   237 QWSAVRLQNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNP-SADVSGQTKFQG 295

Query:   300 SNVLGKLQITWRTNLGEPGRXXXXXXXXXXXXSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
             SN+LGK QITWRTNLGEPGR             KEI + VVEVP+V+ +++PF   L LT
Sbjct:   296 SNILGKFQITWRTNLGEPGRLQTQQILGAPVSRKEINMRVVEVPAVIHLNRPFRAYLNLT 355

Query:   360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
             NQTD++ GPFE+ LSQ+++  EK V INGL+ + L  +EAFGS DF LNLIA+KLGVQ+I
Sbjct:   356 NQTDRQLGPFEVSLSQDETQLEKPVGINGLQTLMLPRIEAFGSNDFQLNLIASKLGVQKI 415

Query:   420 TGITVFDKLEKITYDSLPDLEIFVDQD 446
              GIT  D  EK TY+ +PD+EIFV+ D
Sbjct:   416 AGITALDTREKKTYELVPDMEIFVETD 442




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0006635 "fatty acid beta-oxidation" evidence=RCA
GO:0016558 "protein import into peroxisome matrix" evidence=RCA
MGI|MGI:1914225 Trappc13 "trafficking protein particle complex 13" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-030131-9775 trappc13 "trafficking protein particle complex 13" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0269062 DDB_G0269062 "DUF974 family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
FB|FBgn0032204 CG4953 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|G4NC96 MGG_01105 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q0VFT9CE044_XENTRNo assigned EC number0.31860.85650.9271yesno
Q95TN1U533_DROMENo assigned EC number0.30220.87440.8904yesno
Q5RCG0CE044_PONABNo assigned EC number0.31870.86090.9208yesno
Q3TIR1CE044_MOUSENo assigned EC number0.31700.86990.9304yesno
Q5M887CE044_RATNo assigned EC number0.31700.87210.9306yesno
A7MB76CE044_BOVINNo assigned EC number0.31170.86090.9208yesno
Q6PBY7CE044_DANRENo assigned EC number0.32560.84970.9199yesno
A5PLN9CE044_HUMANNo assigned EC number0.31630.86090.9208yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.IV.3206.1
hypothetical protein (423 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query446
pfam06159235 pfam06159, DUF974, Protein of unknown function (DU 1e-101
>gnl|CDD|218917 pfam06159, DUF974, Protein of unknown function (DUF974) Back     alignment and domain information
 Score =  300 bits (771), Expect = e-101
 Identities = 115/239 (48%), Positives = 150/239 (62%), Gaps = 6/239 (2%)

Query: 88  LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
           L LPQ+FG+IYLGETF SY+ +NN S+ EVRDV IKAE+QT  QR+ L D+  +PVE++R
Sbjct: 1   LTLPQSFGSIYLGETFSSYLCVNNESSKEVRDVSIKAELQTPSQRLNLSDSVDAPVETLR 60

Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV 207
            G   DF+V  DVKE G H LVCT  Y++  GE +Y  +FFKFIV NPLSVRTK   ++ 
Sbjct: 61  PGESLDFVVSFDVKEEGTHILVCTVSYTEASGETRYFRKFFKFIVKNPLSVRTKFYQLED 120

Query: 208 GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKP 267
            +       +LEA IEN T+ NL++++V  EPS  + AT L  +    D +     + K 
Sbjct: 121 LSR---RRVYLEAQIENITEDNLFLEKVTLEPSPGYKATSLNWEPSLGDVDGLDGGMDKR 177

Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
           PVL    G I  YL+ LK    G+   +K+ G   LGKL I WRT +GE GRLQT Q+ 
Sbjct: 178 PVL--KPGDIRQYLFCLKP-KEGALEELKLDGRTNLGKLDIVWRTAMGEKGRLQTSQLQ 233


Family of uncharacterized eukaryotic proteins. Length = 235

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 446
KOG2625348 consensus Uncharacterized conserved protein [Funct 100.0
PF06159249 DUF974: Protein of unknown function (DUF974); Inte 100.0
PF07919554 Gryzun: Gryzun, putative trafficking through Golgi 99.91
KOG4386809 consensus Uncharacterized conserved protein [Funct 99.78
PF12735306 Trs65: TRAPP trafficking subunit Trs65; InterPro: 98.82
PF086261185 TRAPPC9-Trs120: Transport protein Trs120 or TRAPPC 98.34
PF1274257 Gryzun-like: Gryzun, putative Golgi trafficking 97.72
PF12584147 TRAPPC10: Trafficking protein particle complex sub 97.58
PF07705101 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic 96.59
PF00927107 Transglut_C: Transglutaminase family, C-terminal i 96.53
PF1063378 NPCBM_assoc: NPCBM-associated, NEW3 domain of alph 95.32
PF07919 554 Gryzun: Gryzun, putative trafficking through Golgi 95.03
PF14874102 PapD-like: Flagellar-associated PapD-like 94.92
PF05753181 TRAP_beta: Translocon-associated protein beta (TRA 93.84
PF07705101 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic 93.38
PF1063378 NPCBM_assoc: NPCBM-associated, NEW3 domain of alph 90.47
PF05753181 TRAP_beta: Translocon-associated protein beta (TRA 90.1
smart00809104 Alpha_adaptinC2 Adaptin C-terminal domain. Adaptin 86.52
PF11797140 DUF3324: Protein of unknown function C-terminal (D 86.22
PF02883115 Alpha_adaptinC2: Adaptin C-terminal domain; InterP 84.7
PF14874102 PapD-like: Flagellar-associated PapD-like 84.38
PF0020792 A2M: Alpha-2-macroglobulin family; InterPro: IPR00 81.76
PF00927107 Transglut_C: Transglutaminase family, C-terminal i 81.21
>KOG2625 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
Probab=100.00  E-value=1.3e-97  Score=687.14  Aligned_cols=348  Identities=30%  Similarity=0.533  Sum_probs=322.2

Q ss_pred             cccccccccceeecceEEEEEEEEcCCCcceEEEEEEEEEeCCCceEeccCCCCCCccccCCCCeeeEEEEEEccccCce
Q 013275           87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAH  166 (446)
Q Consensus        87 ~L~LP~sfG~iylGEtFs~~i~v~N~s~~~v~~V~ikaelqT~s~r~~L~~~~~~~~~~L~pg~~ld~iv~~~lke~G~h  166 (446)
                      +|.+||.||+|||||||+.||+|||+|++.|++|.+||||||.+||+.|... .....+++|.++.+.+|+||+||+|+|
T Consensus         1 ~l~~pq~f~niflgetfs~yinv~nds~k~v~~i~lk~dlqtssqrl~l~~s-~~~~aei~~~~c~~~vi~hevkeig~h   79 (348)
T KOG2625|consen    1 MLIAPQMFENIFLGETFSFYINVHNDSEKTVKDILLKADLQTSSQRLNLPAS-NAAAAEIEPDCCEDDVIHHEVKEIGQH   79 (348)
T ss_pred             CccchhhhcceeeccceEEEEEEecchhhhhhhheeeecccccceeeccccc-hhhhhhcCccccchhhhhHHHHhhccE
Confidence            4789999999999999999999999999999999999999999999999653 344678999999999999999999999


Q ss_pred             EEEEEEEEEcCCCceeeeceEEEEEeecCeEEEEEeEEccccccccCCeeEEEEEEEecccccEEEEeEEeeecCCceee
Q 013275          167 TLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT  246 (446)
Q Consensus       167 ~L~c~VsY~~~~Ge~~~frK~fkF~v~~Pl~VrtK~~~~~~~~~~~~~~~~LEaqiqN~s~~~l~le~v~Lep~~~~~~~  246 (446)
                      +|+|+|+|++++||.++|||||||+|.+|++||||||+++..-...++++||||||||+|..+|+||+|+|+|+.+|.++
T Consensus        80 ilicavny~tq~ge~myfrkffkf~v~kpidvktkfynaesdlssv~~dvfleaqien~s~a~mflekv~ldps~~ynvt  159 (348)
T KOG2625|consen   80 ILICAVNYKTQAGEKMYFRKFFKFPVLKPIDVKTKFYNAESDLSSVNDDVFLEAQIENMSNANMFLEKVELDPSIHYNVT  159 (348)
T ss_pred             EEEEEEeeeccCccchhHHhhccccccccccccceeecccccccccchhhhhhhhhhcccccchhhhhhccCchheecce
Confidence            99999999999999999999999999999999999999964444557899999999999999999999999999999999


Q ss_pred             eecCCCCCCCCCcccccccCCceEEeCCCCeeeEEEEEeecCCCCCCCccccCceeeEEEEEEEEcCCCCCceeeEEeee
Q 013275          247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL  326 (446)
Q Consensus       247 ~ln~~~~~~~~~~~~~~~~~~~~l~~~~gd~~q~lf~l~~~~~~~~~~~~~~g~~~lGkL~I~WRs~~Ge~G~L~Ts~l~  326 (446)
                      +++.+.+.++.-++    |.... +++|.|+|||||||+|+.+..++.+-.++.+.+|||||.||++|||+|||||++||
T Consensus       160 ~i~~~~e~gdcvst----fg~~~-~lkp~d~rq~l~cl~pk~d~~~~~gi~k~lt~igkldi~wktnlgekgrlqts~lq  234 (348)
T KOG2625|consen  160 EIAHEDEAGDCVST----FGSGA-LLKPKDIRQFLFCLKPKADFAEKAGIIKDLTSIGKLDISWKTNLGEKGRLQTSALQ  234 (348)
T ss_pred             eecchhhccccccc----ccccc-ccCccchhhheeecCchHHHHHhhccccccceeeeeEEEeeccccccccchHHHHH
Confidence            99988777665433    33332 46789999999999999887656666788999999999999999999999999999


Q ss_pred             eecCcCCCeEEEEEecCceEeeCCcEEEEEEEEeCCCCCcccEEEEEeeCCCCCcceEEEecccceeecccCCCCeeEEE
Q 013275          327 GTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH  406 (446)
Q Consensus       327 ~~~~~~~dl~l~v~~~P~~v~l~~pF~v~~~v~N~s~r~~~~l~l~l~~~~~~~~~~~~~~G~s~~~Lg~L~P~~s~~~~  406 (446)
                      |.+|+++|++|+++.+|+.|.+++||.++|+++|||+|.|| |++.+++..   ..-++|||+++++||+|.|.+...|.
T Consensus       235 riapgygdvrlsle~~p~~vdleepf~iscki~ncserald-l~l~l~~~n---nrhi~~c~~sg~qlgkl~ps~~l~~a  310 (348)
T KOG2625|consen  235 RIAPGYGDVRLSLEAIPACVDLEEPFEISCKITNCSERALD-LQLELCNPN---NRHIHFCGISGRQLGKLHPSQHLCFA  310 (348)
T ss_pred             hhcCCCCceEEEeeccccccccCCCeEEEEEEcccchhhhh-hhhhhcCCC---CceeEEeccccccccCCCCcceeeeE
Confidence            99999999999999999999999999999999999999999 999998763   35799999999999999999999999


Q ss_pred             EEEEecccceEEeCceEEEecCCCeeeccCCCeeeEee
Q 013275          407 LNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVD  444 (446)
Q Consensus       407 L~l~pl~~Glq~isgi~l~D~~~~r~y~~~~~~~vfV~  444 (446)
                      |+++|...|+|+|+||+|+|+++||+|||+|++||||.
T Consensus       311 l~l~~~~~giqsisgiritdtf~kr~ye~ddiaqi~v~  348 (348)
T KOG2625|consen  311 LNLFPSTQGIQSISGIRITDTFLKRIYEHDDIAQICVS  348 (348)
T ss_pred             EeeccchhcceeecceEeehhhhhhhhcccchHHhhcC
Confidence            99999999999999999999999999999999999984



>PF06159 DUF974: Protein of unknown function (DUF974); InterPro: IPR010378 This is a family of uncharacterised eukaryotic proteins Back     alignment and domain information
>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function Back     alignment and domain information
>KOG4386 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
>PF12735 Trs65: TRAPP trafficking subunit Trs65; InterPro: IPR024662 This family is one of the subunits of the TRAPP Golgi trafficking complex [] Back     alignment and domain information
>PF08626 TRAPPC9-Trs120: Transport protein Trs120 or TRAPPC9, TRAPP II complex subunit; InterPro: IPR013935 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway Back     alignment and domain information
>PF12742 Gryzun-like: Gryzun, putative Golgi trafficking Back     alignment and domain information
>PF12584 TRAPPC10: Trafficking protein particle complex subunit 10, TRAPPC10; InterPro: IPR022233 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway Back     alignment and domain information
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins Back     alignment and domain information
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds Back     alignment and domain information
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known Back     alignment and domain information
>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function Back     alignment and domain information
>PF14874 PapD-like: Flagellar-associated PapD-like Back     alignment and domain information
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins Back     alignment and domain information
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins Back     alignment and domain information
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known Back     alignment and domain information
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins Back     alignment and domain information
>smart00809 Alpha_adaptinC2 Adaptin C-terminal domain Back     alignment and domain information
>PF11797 DUF3324: Protein of unknown function C-terminal (DUF3324); InterPro: IPR021759 This family consists of several hypothetical bacterial proteins of unknown function Back     alignment and domain information
>PF02883 Alpha_adaptinC2: Adaptin C-terminal domain; InterPro: IPR008152 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment Back     alignment and domain information
>PF14874 PapD-like: Flagellar-associated PapD-like Back     alignment and domain information
>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins Back     alignment and domain information
>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query446
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-06
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 49.5 bits (117), Expect = 2e-06
 Identities = 60/390 (15%), Positives = 103/390 (26%), Gaps = 123/390 (31%)

Query: 162 ELGAHTLVCTALYSDGEGERKYLPQF-----FKFIVSNPLSVRTKVRVVK--VGATHFQE 214
           E G H      + S       +   F      K +   P S+ +K   +   + +     
Sbjct: 10  ETGEHQYQYKDILSV------FEDAFVDNFDCKDVQDMPKSILSK-EEIDHIIMSKDAVS 62

Query: 215 IT-FLEACIENHTKSNLYMDQVE--FEPSQNWSATMLKAD-----GPHSDYNAQ------ 260
            T  L   + +  +  +    VE     +  +  + +K +          Y  Q      
Sbjct: 63  GTLRLFWTLLSK-QEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYN 121

Query: 261 SREIFKP-----PVLIRSGGGIHNYLYQLK---------MLSHGSSSPVK--VQGSNVLG 304
             ++F                +   L +L+         +L  G +           V  
Sbjct: 122 DNQVFAKYNVSRLQPYLK---LRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQC 178

Query: 305 KL--QITWRTNLGEPGR----LQTQQILGTTIT---------SKEIELNVVEV------- 342
           K+  +I W  NL         L+  Q L   I          S  I+L +  +       
Sbjct: 179 KMDFKIFW-LNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRL 237

Query: 343 ------------------PSVVGIDKPFLLKLK--LT----NQTDKEQGPFEIWLSQNDS 378
                                      F L  K  LT      TD         +S +  
Sbjct: 238 LKSKPYENCLLVLLNVQNAKAW---NAFNLSCKILLTTRFKQVTDFLSAATTTHISLDHH 294

Query: 379 ------DEEKVVMIN--GLRIMALAPVEAFGSTDFHLNLIATKL--GVQRITGI--TVFD 426
                 DE K +++     R   L P E   +    L++IA  +  G+           D
Sbjct: 295 SMTLTPDEVKSLLLKYLDCRPQDL-PREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCD 353

Query: 427 KLEKI---TYDSLP---------DLEIFVD 444
           KL  I   + + L           L +F  
Sbjct: 354 KLTTIIESSLNVLEPAEYRKMFDRLSVFPP 383


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query446
2xzz_A102 Protein-glutamine gamma-glutamyltransferase K; 2.3 96.71
1ex0_A731 Coagulation factor XIII A chain; transglutaminase, 95.97
3hrz_B252 Cobra venom factor; serine protease, glycosilated, 95.52
3idu_A127 Uncharacterized protein; all beta-protein, structu 93.94
1vjj_A692 Protein-glutamine glutamyltransferase E; transglut 93.79
1g0d_A695 Protein-glutamine gamma-glutamyltransferase; tissu 93.11
2q3z_A687 Transglutaminase 2; transglutaminase 2, tissue tra 92.35
2qsv_A220 Uncharacterized protein; MCSG, structural genomics 89.43
2ys4_A122 Hydrocephalus-inducing protein homolog; hydin, PAP 89.34
4fxk_B 767 Complement C4-A alpha chain; immune system, proteo 89.17
4acq_A 1451 Alpha-2-macroglobulin; hydrolase inhibitor, protei 88.06
2hr0_B 915 Complement C3 alpha' chain; complement component C 87.89
3prx_B 1642 Cobra venom factor; immune system, complement, imm 87.8
3es6_B118 Prolactin-inducible protein; major histocompatibil 86.88
2b39_A 1661 C3; thioester, immune defense, immune system; HET: 85.96
2xzz_A102 Protein-glutamine gamma-glutamyltransferase K; 2.3 85.79
2pn5_A 1325 TEP1R, thioester-containing protein I; FULL-length 85.61
2ys4_A122 Hydrocephalus-inducing protein homolog; hydin, PAP 81.88
1vjj_A 692 Protein-glutamine glutamyltransferase E; transglut 80.67
2l0d_A114 Cell surface protein; structural genomics, northea 80.58
1ex0_A 731 Coagulation factor XIII A chain; transglutaminase, 80.21
>2xzz_A Protein-glutamine gamma-glutamyltransferase K; 2.30A {Homo sapiens} Back     alignment and structure
Probab=96.71  E-value=0.0038  Score=51.66  Aligned_cols=73  Identities=5%  Similarity=0.138  Sum_probs=60.5

Q ss_pred             ecCceEeeCCcEEEEEEEEeCCCCCcccEEEEEeeCCCCCcceEEEecccceeecccCCCCeeEEEEEEEecccceEEeC
Q 013275          341 EVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT  420 (446)
Q Consensus       341 ~~P~~v~l~~pF~v~~~v~N~s~r~~~~l~l~l~~~~~~~~~~~~~~G~s~~~Lg~L~P~~s~~~~L~l~pl~~Glq~is  420 (446)
                      +++...++++++.+++.++|--...+....+.++...      + ..+ ....++.+.||++..+.+.+.|..+|.++|-
T Consensus        11 ~v~g~~~v~~~l~v~vsf~NPL~~~L~~c~~~vEG~G------L-~~~-~~~~~~~v~pg~~~~~~~~~~P~~~G~~~L~   82 (102)
T 2xzz_A           11 TLLGAAVVGQECEVQIVFKNPLPVTLTNVVFRLEGSG------L-QRP-KILNVGDIGGNETVTLRQSFVPVRPGPRQLI   82 (102)
T ss_dssp             EESSCCCSSSCEEEEEEEECCSSSCBCSEEEEEEETT------T-EEE-EEEEECCBCTTCEEEEEEEECCCSCSSCCCE
T ss_pred             EECCCcccCCeEEEEEEEECCCCCcccCEEEEEECCC------C-Ccc-eEEEcCcCCCCCEEEEEEEEecCcccceEEE
Confidence            4566668999999999999997777777888998753      2 344 5567899999999999999999999998874


Q ss_pred             c
Q 013275          421 G  421 (446)
Q Consensus       421 g  421 (446)
                      .
T Consensus        83 a   83 (102)
T 2xzz_A           83 A   83 (102)
T ss_dssp             E
T ss_pred             E
Confidence            3



>1ex0_A Coagulation factor XIII A chain; transglutaminase, blood coagulation, mutant, W279F, oxyanion, transferase; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1evu_A 1fie_A 1f13_A 1ggt_A 1ggu_A 1ggy_A 1qrk_A Back     alignment and structure
>3hrz_B Cobra venom factor; serine protease, glycosilated, multi-domain, complement SYST convertase, complement alternate pathway; HET: NAG P6G; 2.20A {Naja kaouthia} PDB: 3frp_G* 3hs0_B* Back     alignment and structure
>3idu_A Uncharacterized protein; all beta-protein, structural genomics, PSI-2, protein structure initiative; 1.70A {Pyrococcus furiosus} PDB: 2kl6_A Back     alignment and structure
>1vjj_A Protein-glutamine glutamyltransferase E; transglutaminase 3, X-RAY crystallography, metalloenzyme, calcium ION; HET: GDP; 1.90A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1sgx_A* 1l9m_A 1l9n_A* 1nud_A 1nuf_A 1nug_A 1rle_A* Back     alignment and structure
>1g0d_A Protein-glutamine gamma-glutamyltransferase; tissue transglutaminase,acyltransferase; 2.50A {Pagrus major} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 Back     alignment and structure
>2q3z_A Transglutaminase 2; transglutaminase 2, tissue transglutaminase, TG2, transferas; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1kv3_A 3ly6_A* Back     alignment and structure
>2qsv_A Uncharacterized protein; MCSG, structural genomics, porphyromonas gingivalis W83, PSI protein structure initiative; 2.10A {Porphyromonas gingivalis} Back     alignment and structure
>2ys4_A Hydrocephalus-inducing protein homolog; hydin, PAPD-like, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} Back     alignment and structure
>4fxk_B Complement C4-A alpha chain; immune system, proteolytic cascade; HET: NAG BMA; 3.60A {Homo sapiens} PDB: 4fxg_B* Back     alignment and structure
>4acq_A Alpha-2-macroglobulin; hydrolase inhibitor, proteinase inhibitor, irreversible PROT inhibitor, conformational change, blood plasma inhibitor; HET: MEQ NAG MAN; 4.30A {Homo sapiens} Back     alignment and structure
>2hr0_B Complement C3 alpha' chain; complement component C3B, immune system; HET: THC; 2.26A {Homo sapiens} PDB: 2icf_B* 2wii_B* 2win_B* 3g6j_B 3l5n_B* 2a73_B* 2i07_B* 2xwj_B* 2xwb_B* 2a74_C* 2ice_C* 2qki_C* 3l3o_F* 3nms_C* 3nsa_C* 3ohx_C* 3t4a_C 2ice_B* 3l3o_B* 3nms_B* ... Back     alignment and structure
>3prx_B Cobra venom factor; immune system, complement, immune SYS complex; HET: NAG; 4.30A {Naja kaouthia} PDB: 3pvm_B* Back     alignment and structure
>3es6_B Prolactin-inducible protein; major histocompatibility complex, protein-protein complex, P inducible protein, zinc 2-glycoprotein, ZAG-PIP complex; HET: NDG NAG BMA MAN P6G; 3.23A {Homo sapiens} SCOP: b.1.18.23 Back     alignment and structure
>2b39_A C3; thioester, immune defense, immune system; HET: NAG BMA; 3.00A {Bos taurus} Back     alignment and structure
>2xzz_A Protein-glutamine gamma-glutamyltransferase K; 2.30A {Homo sapiens} Back     alignment and structure
>2pn5_A TEP1R, thioester-containing protein I; FULL-length mature peptide, immune system; HET: NAG; 2.70A {Anopheles gambiae} Back     alignment and structure
>2ys4_A Hydrocephalus-inducing protein homolog; hydin, PAPD-like, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} Back     alignment and structure
>1vjj_A Protein-glutamine glutamyltransferase E; transglutaminase 3, X-RAY crystallography, metalloenzyme, calcium ION; HET: GDP; 1.90A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1sgx_A* 1l9m_A 1l9n_A* 1nud_A 1nuf_A 1nug_A 1rle_A* Back     alignment and structure
>2l0d_A Cell surface protein; structural genomics, northeast structural genomics consortiu PSI-2, protein structure initiative; NMR {Methanosarcina acetivorans} Back     alignment and structure
>1ex0_A Coagulation factor XIII A chain; transglutaminase, blood coagulation, mutant, W279F, oxyanion, transferase; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1evu_A 1fie_A 1f13_A 1ggt_A 1ggu_A 1ggy_A 1qrk_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query446
d1vjja399 Transglutaminase, two C-terminal domains {Human (H 96.96
d1ex0a3100 Transglutaminase, two C-terminal domains {Human (H 96.82
d1g0da3101 Transglutaminase, two C-terminal domains {Red sea 96.75
d2q3za398 Transglutaminase, two C-terminal domains {Human (H 96.47
d1ex0a2112 Transglutaminase, two C-terminal domains {Human (H 96.34
d1vjja2115 Transglutaminase, two C-terminal domains {Human (H 95.95
d1g0da2112 Transglutaminase, two C-terminal domains {Red sea 95.61
d2q3za2114 Transglutaminase, two C-terminal domains {Human (H 95.19
d1vjja399 Transglutaminase, two C-terminal domains {Human (H 94.54
d2q3za398 Transglutaminase, two C-terminal domains {Human (H 94.52
d1g0da3101 Transglutaminase, two C-terminal domains {Red sea 92.93
d3es6b1118 Prolactin-inducible protein, PIP {Human (Homo sapi 92.52
d1ex0a3100 Transglutaminase, two C-terminal domains {Human (H 91.01
>d1vjja3 b.1.5.1 (A:594-692) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]} Back     information, alignment and structure
class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: Transglutaminase, two C-terminal domains
family: Transglutaminase, two C-terminal domains
domain: Transglutaminase, two C-terminal domains
species: Human (Homo sapiens), TGase E3 [TaxId: 9606]
Probab=96.96  E-value=0.00089  Score=53.21  Aligned_cols=75  Identities=11%  Similarity=0.252  Sum_probs=60.7

Q ss_pred             EecCceEeeCCcEEEEEEEEeCCCCCcccEEEEEeeCCCCCcceEEEecccceeecccCCCCeeEEEEEEEecccceEEe
Q 013275          340 VEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI  419 (446)
Q Consensus       340 ~~~P~~v~l~~pF~v~~~v~N~s~r~~~~l~l~l~~~~~~~~~~~~~~G~s~~~Lg~L~P~~s~~~~L~l~pl~~Glq~i  419 (446)
                      .++|...++++++.++++++|--+..+.+-.+.++...      + +.+.....++.+.||++.++.+.+.|..+|.++|
T Consensus         5 I~v~~~~~v~~~~~v~vsf~NPL~~~L~~c~f~vEG~G------L-~~~~~~~~~~~v~p~~~~~~~~~~~P~~~G~~~l   77 (99)
T d1vjja3           5 LEVLNEARVRKPVNVQMLFSNPLDEPVRDCVLMVEGSG------L-LLGNLKIDVPTLGPKERSRVRFDILPSRSGTKQL   77 (99)
T ss_dssp             EEECSCCBTTSCEEEEEEEECCSSSCBCSEEEEEECTT------T-SSSCEEEEECCBCTTCEEEEEEEECCCSCEEEEE
T ss_pred             EEeCCCcCcCCeEEEEEEEECCCCCchhCEEEEEEeCC------C-CCccEEEecCccCCCCEEEEEEEEEcCCcccEEE
Confidence            35677788999999999999998888876888887652      1 2233345688899999999999999999999997


Q ss_pred             Cc
Q 013275          420 TG  421 (446)
Q Consensus       420 sg  421 (446)
                      -.
T Consensus        78 ~a   79 (99)
T d1vjja3          78 LA   79 (99)
T ss_dssp             EE
T ss_pred             EE
Confidence            43



>d1ex0a3 b.1.5.1 (A:628-727) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1g0da3 b.1.5.1 (A:584-684) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]} Back     information, alignment and structure
>d2q3za3 b.1.5.1 (A:586-683) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1ex0a2 b.1.5.1 (A:516-627) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1vjja2 b.1.5.1 (A:479-593) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]} Back     information, alignment and structure
>d1g0da2 b.1.5.1 (A:472-583) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]} Back     information, alignment and structure
>d2q3za2 b.1.5.1 (A:472-585) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1vjja3 b.1.5.1 (A:594-692) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]} Back     information, alignment and structure
>d2q3za3 b.1.5.1 (A:586-683) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]} Back     information, alignment and structure
>d1g0da3 b.1.5.1 (A:584-684) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]} Back     information, alignment and structure
>d3es6b1 b.1.18.23 (B:1-118) Prolactin-inducible protein, PIP {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1ex0a3 b.1.5.1 (A:628-727) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]} Back     information, alignment and structure