Citrus Sinensis ID: 013275

Local Sequence Feature Prediction

Prediction and (Method)	Result

Residue Number Marker

Protein Sequence

Secondary Structure (PSIPRED)

Secondary Structure Prediction (SSPRO)

Coil and Loop (DISEMBL)

Flexible Loop (DISEMBL)

Low Complexity Region (SEG)

Disordered region (IsUnstruct)

Disordered Region (DISOPRED)

Disordered Region (DISEMBL)

Disordered Region (DISPRO)

Transmembrane Helix (TMHMM)

Transmembrane Helix (HMMTOP)

Transmembrane Helix (MEMSAT)

TM Helix, Signal Peptide (MEMSAT_SVM)

TM Helix, Signal Peptide (Phobius)

Signal Peptide (SignalP HMM Mode)

Signal Peptide (SignalP NN Mode)

Coiled Coils (COILS)

Positional Conservation

--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440------

MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD

cccccccccEEEEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEEEEccccccEEEEEEEEEEEcccEEEEEcccccccccccccccEEEEEEEEEEEEEccEEEEEEEEEEcccccEEEEEEEEEEEEEccEEEEEEEEEEcccccccccEEEEEEEEEccccccEEEEEEEEEEcccccEEEcccccccccccccccccccccEEEEccccEEEEEEEEEEccccccccccccccEEcEEEEEEEEcccccccEEEEEEEEEEccccccEEEEEEEcccEEEEcccEEEEEEEEEccccccccEEEEEEccccccccEEEEEcccEEEEcccccccEEEEEEEEEEccccEEEEccEEEEEccccEEEcccccEEEEEccc

ccccccccEEEEEEEEccccccccccccccccccccccccccccccccccccHHccccccccccccccccccccccccccccccccccccccccccEEEEEEEEEEEEEEcccccEEEEEEEEEEEEccccEEEcccccccccccccccccccEEEEEEEHHcccEEEEEEEEEEcccccEEEEEEEEEEEEccccEEEEEEEcccccccccccEEEEEEEEEEcccccEEEEEEEEccccccEEEEcccccccccccccccccccccccEEccccEEEEEEEEEcccccccccccccccEEEEEEEEEEEccccccccEEcccccccccccccEEEEEEEcccEEEEEEcEEEEEEEEEccccccccEEEEEEccccccEEEEEEcccEccccccccccccEEEEEEEEEccccEEEEcEEEEEEccccEEEEccccEEEEEccc

msstpgthsLAFRVMRlcrpslhvepplrvdptdlfigedifddpiaasnlpplissdvttnkssdltyRSRFLLhdsadsiglsgllvlpqafgaiylgeTFCSYISinnsstlevRDVVIKAEIQTDKQRILLLdtskspvesiraggrydfIVEHDVKELGAHTLVCTAlysdgegerkyLPQFFKFIvsnplsvrtKVRVVKVGATHFQEITFLEACIENHtksnlymdqvefepsqnwsatmlkadgphsdynaqsreifkppvlirsgggiHNYLYQLKmlshgssspvkvqgsnvlgklqitwrtnlgepgrlqtQQILGTTITSKEIElnvvevpsvvgidkpfllklkltnqtdkeqgpfeiwlsqndsdeekVVMINGLRIMalapveafgstdFHLNLIATKLGVQRITGITVFDKLekitydslpdleifvdqd

msstpgthslAFRVMRLCrpslhvepplrvDPTDLFIGEDIFDDPIAasnlpplissdvttnksSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAeiqtdkqrillldtskspvesiRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIvsnplsvrTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWrtnlgepgrlqtQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLtnqtdkeqgpfeiwlsqndsdEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKlekitydslpdleifvdqd

MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRlqtqqilgttitSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD

*********LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS*********DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF*************************EIFKPPVLIRSGGGIHNYLYQLKMLSHG****VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN*********EIW**********VVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV***

*****GT**LAFRVMRLCRPSLHVEPPLRVDPTDLFIG******************************************SIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD********************GRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKV************ITFLEACIENHTKSNLYMDQVEFEPSQNWSATM********************PVLIRSGGGIHNYLYQLKML************SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD

MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH********QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD

*****GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGE**************************************SADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA***********REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQ*

oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooo

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST

Original result of BLAST against SWISS-PROT Database

ID	Alignment graph	Length	Definition	RBH(Q2H)	RBH(H2Q)	Q cover	H cover	Identity	E-value
Query		446	2.2.26 [Sep-21-2011]
Q5RCG0		417	UPF0533 protein C5orf44 h	yes	no	0.860	0.920	0.318	1e-52
A5PLN9		417	UPF0533 protein C5orf44 O	yes	no	0.860	0.920	0.316	1e-52
Q6PBY7		412	UPF0533 protein C5orf44 h	yes	no	0.849	0.919	0.325	3e-52
A7MB76		417	UPF0533 protein C5orf44 h	yes	no	0.860	0.920	0.311	2e-51
Q3TIR1		417	UPF0533 protein C5orf44 h	yes	no	0.869	0.930	0.317	4e-51
Q5M887		418	UPF0533 protein C5orf44 h	yes	no	0.872	0.930	0.317	6e-51
Q0VFT9		412	UPF0533 protein C5orf44 h	yes	no	0.856	0.927	0.318	3e-49
Q6GPR5		414	UPF0533 protein C5orf44 h	N/A	no	0.860	0.927	0.316	1e-47
A8WX89		401	UPF0533 protein CBG04321	N/A	no	0.872	0.970	0.306	2e-47
Q95QQ2		401	UPF0533 protein C56C10.7	yes	no	0.874	0.972	0.298	3e-47

>sp\|Q5RCG0\|CE044_PONAB UPF0533 protein C5orf44 homolog OS=Pongo abelii PE=2 SV=1	Back alignment and function desciption

 Score =  207 bits (528), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 138/433 (31%), Positives = 217/433 (50%), Gaps = 49/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLK--ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L   +    S     SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVTELNSVSQAGESVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 386

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 387 LRLTDTFLKRTYE 399

Pongo abelii (taxid: 9601)

>sp\|A5PLN9\|CE044_HUMAN UPF0533 protein C5orf44 OS=Homo sapiens GN=C5orf44 PE=2 SV=2	Back alignment and function description

>sp\|Q6PBY7\|CE044_DANRE UPF0533 protein C5orf44 homolog OS=Danio rerio GN=zgc:73187 PE=2 SV=2	Back alignment and function description

>sp\|A7MB76\|CE044_BOVIN UPF0533 protein C5orf44 homolog OS=Bos taurus PE=2 SV=1	Back alignment and function description

>sp\|Q3TIR1\|CE044_MOUSE UPF0533 protein C5orf44 homolog OS=Mus musculus PE=2 SV=1	Back alignment and function description

>sp\|Q5M887\|CE044_RAT UPF0533 protein C5orf44 homolog OS=Rattus norvegicus PE=2 SV=2	Back alignment and function description

>sp\|Q0VFT9\|CE044_XENTR UPF0533 protein C5orf44 homolog OS=Xenopus tropicalis PE=2 SV=1	Back alignment and function description

>sp\|Q6GPR5\|CE044_XENLA UPF0533 protein C5orf44 homolog OS=Xenopus laevis PE=2 SV=2	Back alignment and function description

>sp\|A8WX89\|U533_CAEBR UPF0533 protein CBG04321 OS=Caenorhabditis briggsae GN=CBG04321 PE=3 SV=2	Back alignment and function description

>sp\|Q95QQ2\|U533_CAEEL UPF0533 protein C56C10.7 OS=Caenorhabditis elegans GN=C56C10.7 PE=1 SV=1	Back alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST

Original result of BLAST against Nonredundant Database

GI	Alignment Graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		446
255556003		434	expressed protein, putative [Ricinus com	0.970	0.997	0.760	0.0
225470348		438	PREDICTED: UPF0533 protein C5orf44 [Viti	0.977	0.995	0.763	0.0
356548745		440	PREDICTED: UPF0533 protein C5orf44 homol	0.961	0.975	0.747	0.0
449457717		440	PREDICTED: UPF0533 protein C5orf44-like	0.979	0.993	0.712	0.0
224079249		450	predicted protein [Populus trichocarpa]	0.964	0.955	0.733	0.0
356521339		435	PREDICTED: UPF0533 protein C5orf44-like	0.952	0.977	0.731	0.0
388496064		437	unknown [Medicago truncatula]	0.950	0.970	0.715	1e-180
358346667		446	hypothetical protein MTR_084s0010 [Medic	0.950	0.950	0.700	1e-178
18407493		442	uncharacterized protein [Arabidopsis tha	0.975	0.984	0.668	1e-168
297824907		443	hypothetical protein ARALYDRAFT_483987 [	0.970	0.977	0.667	1e-167

>gi\|255556003\|ref\|XP_002519036.1\| expressed protein, putative [Ricinus communis] gi\|223541699\|gb\|EEF43247.1\| expressed protein, putative [Ricinus communis]	Back alignment and taxonomy information

 Score =  697 bits (1799), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/447 (76%), Positives = 384/447 (85%), Gaps = 14/447 (3%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MS+TPGTHSLAFRVMRLCRPS HV+  L VDP+DL +GEDIFDDP+AAS LPPLI S +T
Sbjct: 1   MSTTPGTHSLAFRVMRLCRPSFHVDAQLLVDPSDLIVGEDIFDDPVAASRLPPLIDSHIT 60

Query: 61  T-NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
               +SDL+YR+RFL    +DS GL+GLLVLPQAFGAIYLGETFCSYISINNSS  EVRD
Sbjct: 61  KLTDTSDLSYRTRFLHQHPSDSFGLTGLLVLPQAFGAIYLGETFCSYISINNSSNFEVRD 120

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
           V+IKAEIQT++QRILLLDTSK+PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG+G
Sbjct: 121 VIIKAEIQTERQRILLLDTSKNPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGDG 180

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
           ERKYLPQFFKFIV+NPLSVRTKVRVVK       E T+LEACIENHTK+NLYMDQVEFEP
Sbjct: 181 ERKYLPQFFKFIVANPLSVRTKVRVVK-------ETTYLEACIENHTKTNLYMDQVEFEP 233

Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           +Q+WSA ++K D   S+ ++ +REIFKPPVLIRSGGGIHNYLYQL++ +HG++       
Sbjct: 234 AQHWSAKIIKDDEKQSEKDSLTREIFKPPVLIRSGGGIHNYLYQLRLSAHGAAQ------ 287

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
           SNVLGKLQITWRTNLGEPGRLQTQQILGT IT KEIEL + +VP+V+ +DKPF + LKLT
Sbjct: 288 SNVLGKLQITWRTNLGEPGRLQTQQILGTPITRKEIELCIAKVPAVINLDKPFSVHLKLT 347

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           N TDKE GPFE+WLSQ+ S EEK V INGL+ M L+ +EAFG+TDFHLNLIATKLGVQRI
Sbjct: 348 NHTDKELGPFEVWLSQDGSVEEKAVTINGLQTMELSQLEAFGTTDFHLNLIATKLGVQRI 407

Query: 420 TGITVFDKLEKITYDSLPDLEIFVDQD 446
           TGITVFDK EK TYD LPDLEIFV  D
Sbjct: 408 TGITVFDKSEKKTYDPLPDLEIFVAID 434

Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi\|225470348\|ref\|XP_002269604.1\| PREDICTED: UPF0533 protein C5orf44 [Vitis vinifera] gi\|296090651\|emb\|CBI41051.3\| unnamed protein product [Vitis vinifera]	Back alignment and taxonomy information

>gi\|356548745\|ref\|XP_003542760.1\| PREDICTED: UPF0533 protein C5orf44 homolog [Glycine max]	Back alignment and taxonomy information

>gi\|449457717\|ref\|XP_004146594.1\| PREDICTED: UPF0533 protein C5orf44-like [Cucumis sativus]	Back alignment and taxonomy information

>gi\|224079249\|ref\|XP_002305809.1\| predicted protein [Populus trichocarpa] gi\|222848773\|gb\|EEE86320.1\| predicted protein [Populus trichocarpa]	Back alignment and taxonomy information

>gi\|356521339\|ref\|XP_003529314.1\| PREDICTED: UPF0533 protein C5orf44-like [Glycine max]	Back alignment and taxonomy information

>gi\|388496064\|gb\|AFK36098.1\| unknown [Medicago truncatula]	Back alignment and taxonomy information

>gi\|358346667\|ref\|XP_003637387.1\| hypothetical protein MTR_084s0010 [Medicago truncatula] gi\|355503322\|gb\|AES84525.1\| hypothetical protein MTR_084s0010 [Medicago truncatula]	Back alignment and taxonomy information

>gi|18407493|ref|NP_566117.1| uncharacterized protein [Arabidopsis thaliana] gi|16226796|gb|AAL16264.1|AF428334_1 At2g47960/T9J23.10 [Arabidopsis thaliana] gi|18377797|gb|AAL67048.1| unknown protein [Arabidopsis thaliana] gi|20197311|gb|AAC63650.2| expressed protein [Arabidopsis thaliana] gi|20197565|gb|AAM15133.1| expressed protein [Arabidopsis thaliana] gi|21281259|gb|AAM45021.1| unknown protein [Arabidopsis thaliana] gi|330255823|gb|AEC10917.1| uncharacterized protein [Arabidopsis thaliana]

Back alignment and taxonomy information

>gi\|297824907\|ref\|XP_002880336.1\| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp. lyrata] gi\|297326175\|gb\|EFH56595.1\| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp. lyrata]	Back alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST

Original result of BLAST against Gene Ontology (AMIGO)

ID	Alignment graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		446
TAIR\|locus:2043433		442	AT2G47960 "AT2G47960" [Arabido	0.975	0.984	0.651	9e-148
MGI\|MGI:1914225		417	Trappc13 "trafficking protein	0.771	0.824	0.322	1.6e-49
ZFIN\|ZDB-GENE-030131-9775		412	trappc13 "trafficking protein	0.612	0.662	0.364	2.3e-48
DICTYBASE\|DDB_G0269062		511	DDB_G0269062 "DUF974 family pr	0.526	0.459	0.338	9.7e-46
FB\|FBgn0032204		438	CG4953 [Drosophila melanogaste	0.605	0.616	0.337	2.2e-40
UNIPROTKB\|G4NC96		339	MGG_01105 "Uncharacterized pro	0.213	0.280	0.324	0.00052

TAIR\|locus:2043433 AT2G47960 "AT2G47960" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

 Score = 1443 (513.0 bits), Expect = 9.0e-148, P = 9.0e-148
 Identities = 291/447 (65%), Positives = 343/447 (76%)

Query:     2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
             + T G HSLAFRVMRLC+PS HV+PPLR+DP DL  GED  DDP +AS     +SS    
Sbjct:     6 TQTHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADAV 65

Query:    62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
             +  SDL+YR+RFLL+   D IGLSGLL+LPQ+FGAIYLGETFCSYIS+NNSST EVRDV 
Sbjct:    66 D--SDLSYRNRFLLNHPTDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDVT 123

Query:   122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
             IKAEIQT++QRILLLDTSKSPVESIR GGRYDFIVEHDVKELGAHTLVC+ALY+D +GER
Sbjct:   124 IKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEHDVKELGAHTLVCSALYNDADGER 183

Query:   182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
             KYLPQFFKF+V+NPLSVRTKVRVVK       E TFLEACIENHTK+NL+MDQV+FEP++
Sbjct:   184 KYLPQFFKFVVANPLSVRTKVRVVK-------ETTFLEACIENHTKANLFMDQVDFEPAK 236

Query:   242 NWSATMLKADGPHSD--YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
              WSA  L+ +    D   +  S  I KPPV+IRSGGGIHNYLY+L   S   S   K QG
Sbjct:   237 QWSAVRLQNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNP-SADVSGQTKFQG 295

Query:   300 SNVLGKLQITWRTNLGEPGRXXXXXXXXXXXXSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
             SN+LGK QITWRTNLGEPGR             KEI + VVEVP+V+ +++PF   L LT
Sbjct:   296 SNILGKFQITWRTNLGEPGRLQTQQILGAPVSRKEINMRVVEVPAVIHLNRPFRAYLNLT 355

Query:   360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
             NQTD++ GPFE+ LSQ+++  EK V INGL+ + L  +EAFGS DF LNLIA+KLGVQ+I
Sbjct:   356 NQTDRQLGPFEVSLSQDETQLEKPVGINGLQTLMLPRIEAFGSNDFQLNLIASKLGVQKI 415

Query:   420 TGITVFDKLEKITYDSLPDLEIFVDQD 446
              GIT  D  EK TY+ +PD+EIFV+ D
Sbjct:   416 AGITALDTREKKTYELVPDMEIFVETD 442

GO:0003674 "molecular_function" evidence=ND

GO:0008150 "biological_process" evidence=ND

GO:0009507 "chloroplast" evidence=ISM

GO:0006635 "fatty acid beta-oxidation" evidence=RCA

GO:0016558 "protein import into peroxisome matrix" evidence=RCA

MGI\|MGI:1914225 Trappc13 "trafficking protein particle complex 13" [Mus musculus (taxid:10090)]	Back alignment and assigned GO terms

ZFIN\|ZDB-GENE-030131-9775 trappc13 "trafficking protein particle complex 13" [Danio rerio (taxid:7955)]	Back alignment and assigned GO terms

DICTYBASE\|DDB_G0269062 DDB_G0269062 "DUF974 family protein" [Dictyostelium discoideum (taxid:44689)]	Back alignment and assigned GO terms

FB\|FBgn0032204 CG4953 [Drosophila melanogaster (taxid:7227)]	Back alignment and assigned GO terms

UNIPROTKB\|G4NC96 MGG_01105 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)]	Back alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries

Original result of BLAST against SWISS-PROT

ID	Name	Annotated EC number	Identity	Query coverage	Hit coverage	RBH(Q2H)	RBH(H2Q)
Q0VFT9	CE044_XENTR	No assigned EC number	0.3186	0.8565	0.9271	yes	no
Q95TN1	U533_DROME	No assigned EC number	0.3022	0.8744	0.8904	yes	no
Q5RCG0	CE044_PONAB	No assigned EC number	0.3187	0.8609	0.9208	yes	no
Q3TIR1	CE044_MOUSE	No assigned EC number	0.3170	0.8699	0.9304	yes	no
Q5M887	CE044_RAT	No assigned EC number	0.3170	0.8721	0.9306	yes	no
A7MB76	CE044_BOVIN	No assigned EC number	0.3117	0.8609	0.9208	yes	no
Q6PBY7	CE044_DANRE	No assigned EC number	0.3256	0.8497	0.9199	yes	no
A5PLN9	CE044_HUMAN	No assigned EC number	0.3163	0.8609	0.9208	yes	no

EC Number Prediction by Ezypred Server

Original result from Ezypred Server

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software

No EC number assignment, probably not an enzyme!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING

Original result from the STRING server

Your Input:
	gw1.IV.3206.1	hypothetical protein (423 aa)
		(Populus trichocarpa)
Predicted Functional Partners:
		Sorry, there are no predicted associations at the current settings.

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST

Original result of RPS-BLAST against CDD database part I

ID	Alignment Graph	Length	Definition	E-value
Query		446
pfam06159		235	pfam06159, DUF974, Protein of unknown function (DU	1e-101

>gnl\|CDD\|218917 pfam06159, DUF974, Protein of unknown function (DUF974)	Back alignment and domain information

 Score =  300 bits (771), Expect = e-101
 Identities = 115/239 (48%), Positives = 150/239 (62%), Gaps = 6/239 (2%)

Query: 88  LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
           L LPQ+FG+IYLGETF SY+ +NN S+ EVRDV IKAE+QT  QR+ L D+  +PVE++R
Sbjct: 1   LTLPQSFGSIYLGETFSSYLCVNNESSKEVRDVSIKAELQTPSQRLNLSDSVDAPVETLR 60

Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV 207
            G   DF+V  DVKE G H LVCT  Y++  GE +Y  +FFKFIV NPLSVRTK   ++ 
Sbjct: 61  PGESLDFVVSFDVKEEGTHILVCTVSYTEASGETRYFRKFFKFIVKNPLSVRTKFYQLED 120

Query: 208 GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKP 267
            +       +LEA IEN T+ NL++++V  EPS  + AT L  +    D +     + K 
Sbjct: 121 LSR---RRVYLEAQIENITEDNLFLEKVTLEPSPGYKATSLNWEPSLGDVDGLDGGMDKR 177

Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
           PVL    G I  YL+ LK    G+   +K+ G   LGKL I WRT +GE GRLQT Q+ 
Sbjct: 178 PVL--KPGDIRQYLFCLKP-KEGALEELKLDGRTNLGKLDIVWRTAMGEKGRLQTSQLQ 233

Family of uncharacterized eukaryotic proteins. Length = 235

Conserved Domains Detected by HHsearch

Original result of HHsearch against CDD database

ID	Alignment Graph	Length	Definition	Probability
Query		446
KOG2625		348	consensus Uncharacterized conserved protein [Funct	100.0
PF06159		249	DUF974: Protein of unknown function (DUF974); Inte	100.0
PF07919		554	Gryzun: Gryzun, putative trafficking through Golgi	99.91
KOG4386		809	consensus Uncharacterized conserved protein [Funct	99.78
PF12735		306	Trs65: TRAPP trafficking subunit Trs65; InterPro:	98.82
PF08626		1185	TRAPPC9-Trs120: Transport protein Trs120 or TRAPPC	98.34
PF12742		57	Gryzun-like: Gryzun, putative Golgi trafficking	97.72
PF12584		147	TRAPPC10: Trafficking protein particle complex sub	97.58
PF07705		101	CARDB: CARDB; InterPro: IPR011635 The APHP (acidic	96.59
PF00927		107	Transglut_C: Transglutaminase family, C-terminal i	96.53
PF10633		78	NPCBM_assoc: NPCBM-associated, NEW3 domain of alph	95.32
PF07919		554	Gryzun: Gryzun, putative trafficking through Golgi	95.03
PF14874		102	PapD-like: Flagellar-associated PapD-like	94.92
PF05753		181	TRAP_beta: Translocon-associated protein beta (TRA	93.84
PF07705		101	CARDB: CARDB; InterPro: IPR011635 The APHP (acidic	93.38
PF10633		78	NPCBM_assoc: NPCBM-associated, NEW3 domain of alph	90.47
PF05753		181	TRAP_beta: Translocon-associated protein beta (TRA	90.1
smart00809		104	Alpha_adaptinC2 Adaptin C-terminal domain. Adaptin	86.52
PF11797		140	DUF3324: Protein of unknown function C-terminal (D	86.22
PF02883		115	Alpha_adaptinC2: Adaptin C-terminal domain; InterP	84.7
PF14874		102	PapD-like: Flagellar-associated PapD-like	84.38
PF00207		92	A2M: Alpha-2-macroglobulin family; InterPro: IPR00	81.76
PF00927		107	Transglut_C: Transglutaminase family, C-terminal i	81.21

>KOG2625 consensus Uncharacterized conserved protein [Function unknown]	Back alignment and domain information

Probab=100.00  E-value=1.3e-97  Score=687.14  Aligned_cols=348  Identities=30%  Similarity=0.533  Sum_probs=322.2

Q ss_pred             cccccccccceeecceEEEEEEEEcCCCcceEEEEEEEEEeCCCceEeccCCCCCCccccCCCCeeeEEEEEEccccCce
Q 013275           87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAH  166 (446)
Q Consensus        87 ~L~LP~sfG~iylGEtFs~~i~v~N~s~~~v~~V~ikaelqT~s~r~~L~~~~~~~~~~L~pg~~ld~iv~~~lke~G~h  166 (446)
                      +|.+||.||+|||||||+.||+|||+|++.|++|.+||||||.+||+.|... .....+++|.++.+.+|+||+||+|+|
T Consensus         1 ~l~~pq~f~niflgetfs~yinv~nds~k~v~~i~lk~dlqtssqrl~l~~s-~~~~aei~~~~c~~~vi~hevkeig~h   79 (348)
T KOG2625|consen    1 MLIAPQMFENIFLGETFSFYINVHNDSEKTVKDILLKADLQTSSQRLNLPAS-NAAAAEIEPDCCEDDVIHHEVKEIGQH   79 (348)
T ss_pred             CccchhhhcceeeccceEEEEEEecchhhhhhhheeeecccccceeeccccc-hhhhhhcCccccchhhhhHHHHhhccE
Confidence            4789999999999999999999999999999999999999999999999653 344678999999999999999999999


Q ss_pred             EEEEEEEEEcCCCceeeeceEEEEEeecCeEEEEEeEEccccccccCCeeEEEEEEEecccccEEEEeEEeeecCCceee
Q 013275          167 TLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT  246 (446)
Q Consensus       167 ~L~c~VsY~~~~Ge~~~frK~fkF~v~~Pl~VrtK~~~~~~~~~~~~~~~~LEaqiqN~s~~~l~le~v~Lep~~~~~~~  246 (446)
                      +|+|+|+|++++||.++|||||||+|.+|++||||||+++..-...++++||||||||+|..+|+||+|+|+|+.+|.++
T Consensus        80 ilicavny~tq~ge~myfrkffkf~v~kpidvktkfynaesdlssv~~dvfleaqien~s~a~mflekv~ldps~~ynvt  159 (348)
T KOG2625|consen   80 ILICAVNYKTQAGEKMYFRKFFKFPVLKPIDVKTKFYNAESDLSSVNDDVFLEAQIENMSNANMFLEKVELDPSIHYNVT  159 (348)
T ss_pred             EEEEEEeeeccCccchhHHhhccccccccccccceeecccccccccchhhhhhhhhhcccccchhhhhhccCchheecce
Confidence            99999999999999999999999999999999999999964444557899999999999999999999999999999999


Q ss_pred             eecCCCCCCCCCcccccccCCceEEeCCCCeeeEEEEEeecCCCCCCCccccCceeeEEEEEEEEcCCCCCceeeEEeee
Q 013275          247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL  326 (446)
Q Consensus       247 ~ln~~~~~~~~~~~~~~~~~~~~l~~~~gd~~q~lf~l~~~~~~~~~~~~~~g~~~lGkL~I~WRs~~Ge~G~L~Ts~l~  326 (446)
                      +++.+.+.++.-++    |.... +++|.|+|||||||+|+.+..++.+-.++.+.+|||||.||++|||+|||||++||
T Consensus       160 ~i~~~~e~gdcvst----fg~~~-~lkp~d~rq~l~cl~pk~d~~~~~gi~k~lt~igkldi~wktnlgekgrlqts~lq  234 (348)
T KOG2625|consen  160 EIAHEDEAGDCVST----FGSGA-LLKPKDIRQFLFCLKPKADFAEKAGIIKDLTSIGKLDISWKTNLGEKGRLQTSALQ  234 (348)
T ss_pred             eecchhhccccccc----ccccc-ccCccchhhheeecCchHHHHHhhccccccceeeeeEEEeeccccccccchHHHHH
Confidence            99988777665433    33332 46789999999999999887656666788999999999999999999999999999


Q ss_pred             eecCcCCCeEEEEEecCceEeeCCcEEEEEEEEeCCCCCcccEEEEEeeCCCCCcceEEEecccceeecccCCCCeeEEE
Q 013275          327 GTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH  406 (446)
Q Consensus       327 ~~~~~~~dl~l~v~~~P~~v~l~~pF~v~~~v~N~s~r~~~~l~l~l~~~~~~~~~~~~~~G~s~~~Lg~L~P~~s~~~~  406 (446)
                      |.+|+++|++|+++.+|+.|.+++||.++|+++|||+|.|| |++.+++..   ..-++|||+++++||+|.|.+...|.
T Consensus       235 riapgygdvrlsle~~p~~vdleepf~iscki~ncserald-l~l~l~~~n---nrhi~~c~~sg~qlgkl~ps~~l~~a  310 (348)
T KOG2625|consen  235 RIAPGYGDVRLSLEAIPACVDLEEPFEISCKITNCSERALD-LQLELCNPN---NRHIHFCGISGRQLGKLHPSQHLCFA  310 (348)
T ss_pred             hhcCCCCceEEEeeccccccccCCCeEEEEEEcccchhhhh-hhhhhcCCC---CceeEEeccccccccCCCCcceeeeE
Confidence            99999999999999999999999999999999999999999 999998763   35799999999999999999999999


Q ss_pred             EEEEecccceEEeCceEEEecCCCeeeccCCCeeeEee
Q 013275          407 LNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVD  444 (446)
Q Consensus       407 L~l~pl~~Glq~isgi~l~D~~~~r~y~~~~~~~vfV~  444 (446)
                      |+++|...|+|+|+||+|+|+++||+|||+|++||||.
T Consensus       311 l~l~~~~~giqsisgiritdtf~kr~ye~ddiaqi~v~  348 (348)
T KOG2625|consen  311 LNLFPSTQGIQSISGIRITDTFLKRIYEHDDIAQICVS  348 (348)
T ss_pred             EeeccchhcceeecceEeehhhhhhhhcccchHHhhcC
Confidence            99999999999999999999999999999999999984

>PF06159 DUF974: Protein of unknown function (DUF974); InterPro: IPR010378 This is a family of uncharacterised eukaryotic proteins	Back alignment and domain information

>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function	Back alignment and domain information

>KOG4386 consensus Uncharacterized conserved protein [Function unknown]	Back alignment and domain information

>PF12735 Trs65: TRAPP trafficking subunit Trs65; InterPro: IPR024662 This family is one of the subunits of the TRAPP Golgi trafficking complex []	Back alignment and domain information

>PF08626 TRAPPC9-Trs120: Transport protein Trs120 or TRAPPC9, TRAPP II complex subunit; InterPro: IPR013935 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway	Back alignment and domain information

>PF12742 Gryzun-like: Gryzun, putative Golgi trafficking	Back alignment and domain information

>PF12584 TRAPPC10: Trafficking protein particle complex subunit 10, TRAPPC10; InterPro: IPR022233 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway	Back alignment and domain information

>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins	Back alignment and domain information

>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds	Back alignment and domain information

>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known	Back alignment and domain information

>PF07919 Gryzun: Gryzun, putative trafficking through Golgi; InterPro: IPR012880 The proteins featured in this family are all hypothetical eukaryotic proteins of unknown function	Back alignment and domain information

>PF14874 PapD-like: Flagellar-associated PapD-like	Back alignment and domain information

>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins	Back alignment and domain information

>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins	Back alignment and domain information

>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known	Back alignment and domain information

>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins	Back alignment and domain information

>smart00809 Alpha_adaptinC2 Adaptin C-terminal domain	Back alignment and domain information

>PF11797 DUF3324: Protein of unknown function C-terminal (DUF3324); InterPro: IPR021759 This family consists of several hypothetical bacterial proteins of unknown function	Back alignment and domain information

>PF02883 Alpha_adaptinC2: Adaptin C-terminal domain; InterPro: IPR008152 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment	Back alignment and domain information

>PF14874 PapD-like: Flagellar-associated PapD-like	Back alignment and domain information

>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins	Back alignment and domain information

>PF00927 Transglut_C: Transglutaminase family, C-terminal ig like domain; InterPro: IPR008958 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Transglutaminases catalyse the post-translational modification of proteins at glutamine residues, with formation of isopeptide bonds	Back alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST

Original result of BLAST against Protein Data Bank

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST

Original result of RPS-BLAST against PDB70 database

ID	Alignment Graph	Length	Definition	E-value
Query		446
1vt4_I		1221	APAF-1 related killer DARK; drosophila apoptosome,	2e-06
1vt4_I		1221	APAF-1 related killer DARK; drosophila apoptosome,	3e-04

>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221	Back alignment and structure

 Score = 49.5 bits (117), Expect = 2e-06
 Identities = 60/390 (15%), Positives = 103/390 (26%), Gaps = 123/390 (31%)

Query: 162 ELGAHTLVCTALYSDGEGERKYLPQF-----FKFIVSNPLSVRTKVRVVK--VGATHFQE 214
           E G H      + S       +   F      K +   P S+ +K   +   + +     
Sbjct: 10  ETGEHQYQYKDILSV------FEDAFVDNFDCKDVQDMPKSILSK-EEIDHIIMSKDAVS 62

Query: 215 IT-FLEACIENHTKSNLYMDQVE--FEPSQNWSATMLKAD-----GPHSDYNAQ------ 260
            T  L   + +  +  +    VE     +  +  + +K +          Y  Q      
Sbjct: 63  GTLRLFWTLLSK-QEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYN 121

Query: 261 SREIFKP-----PVLIRSGGGIHNYLYQLK---------MLSHGSSSPVK--VQGSNVLG 304
             ++F                +   L +L+         +L  G +           V  
Sbjct: 122 DNQVFAKYNVSRLQPYLK---LRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQC 178

Query: 305 KL--QITWRTNLGEPGR----LQTQQILGTTIT---------SKEIELNVVEV------- 342
           K+  +I W  NL         L+  Q L   I          S  I+L +  +       
Sbjct: 179 KMDFKIFW-LNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRL 237

Query: 343 ------------------PSVVGIDKPFLLKLK--LT----NQTDKEQGPFEIWLSQNDS 378
                                      F L  K  LT      TD         +S +  
Sbjct: 238 LKSKPYENCLLVLLNVQNAKAW---NAFNLSCKILLTTRFKQVTDFLSAATTTHISLDHH 294

Query: 379 ------DEEKVVMIN--GLRIMALAPVEAFGSTDFHLNLIATKL--GVQRITGI--TVFD 426
                 DE K +++     R   L P E   +    L++IA  +  G+           D
Sbjct: 295 SMTLTPDEVKSLLLKYLDCRPQDL-PREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCD 353

Query: 427 KLEKI---TYDSLP---------DLEIFVD 444
           KL  I   + + L           L +F  
Sbjct: 354 KLTTIIESSLNVLEPAEYRKMFDRLSVFPP 383

PyMOL of 1vt4

>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221	Back alignment and structure

Structure Templates Detected by HHsearch

Original result of HHsearch against PDB70 database

ID	Alignment Graph	Length	Definition	Probability
Query		446
2xzz_A		102	Protein-glutamine gamma-glutamyltransferase K; 2.3	96.71
1ex0_A		731	Coagulation factor XIII A chain; transglutaminase,	95.97
3hrz_B		252	Cobra venom factor; serine protease, glycosilated,	95.52
3idu_A		127	Uncharacterized protein; all beta-protein, structu	93.94
1vjj_A		692	Protein-glutamine glutamyltransferase E; transglut	93.79
1g0d_A		695	Protein-glutamine gamma-glutamyltransferase; tissu	93.11
2q3z_A		687	Transglutaminase 2; transglutaminase 2, tissue tra	92.35
2qsv_A		220	Uncharacterized protein; MCSG, structural genomics	89.43
2ys4_A		122	Hydrocephalus-inducing protein homolog; hydin, PAP	89.34
4fxk_B		767	Complement C4-A alpha chain; immune system, proteo	89.17
4acq_A		1451	Alpha-2-macroglobulin; hydrolase inhibitor, protei	88.06
2hr0_B		915	Complement C3 alpha' chain; complement component C	87.89
3prx_B		1642	Cobra venom factor; immune system, complement, imm	87.8
3es6_B		118	Prolactin-inducible protein; major histocompatibil	86.88
2b39_A		1661	C3; thioester, immune defense, immune system; HET:	85.96
2xzz_A		102	Protein-glutamine gamma-glutamyltransferase K; 2.3	85.79
2pn5_A		1325	TEP1R, thioester-containing protein I; FULL-length	85.61
2ys4_A		122	Hydrocephalus-inducing protein homolog; hydin, PAP	81.88
1vjj_A		692	Protein-glutamine glutamyltransferase E; transglut	80.67
2l0d_A		114	Cell surface protein; structural genomics, northea	80.58
1ex0_A		731	Coagulation factor XIII A chain; transglutaminase,	80.21

>2xzz_A Protein-glutamine gamma-glutamyltransferase K; 2.30A {Homo sapiens}	Back alignment and structure

Probab=96.71  E-value=0.0038  Score=51.66  Aligned_cols=73  Identities=5%  Similarity=0.138  Sum_probs=60.5

Q ss_pred             ecCceEeeCCcEEEEEEEEeCCCCCcccEEEEEeeCCCCCcceEEEecccceeecccCCCCeeEEEEEEEecccceEEeC
Q 013275          341 EVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT  420 (446)
Q Consensus       341 ~~P~~v~l~~pF~v~~~v~N~s~r~~~~l~l~l~~~~~~~~~~~~~~G~s~~~Lg~L~P~~s~~~~L~l~pl~~Glq~is  420 (446)
                      +++...++++++.+++.++|--...+....+.++...      + ..+ ....++.+.||++..+.+.+.|..+|.++|-
T Consensus        11 ~v~g~~~v~~~l~v~vsf~NPL~~~L~~c~~~vEG~G------L-~~~-~~~~~~~v~pg~~~~~~~~~~P~~~G~~~L~   82 (102)
T 2xzz_A           11 TLLGAAVVGQECEVQIVFKNPLPVTLTNVVFRLEGSG------L-QRP-KILNVGDIGGNETVTLRQSFVPVRPGPRQLI   82 (102)
T ss_dssp             EESSCCCSSSCEEEEEEEECCSSSCBCSEEEEEEETT------T-EEE-EEEEECCBCTTCEEEEEEEECCCSCSSCCCE
T ss_pred             EECCCcccCCeEEEEEEEECCCCCcccCEEEEEECCC------C-Ccc-eEEEcCcCCCCCEEEEEEEEecCcccceEEE
Confidence            4566668999999999999997777777888998753      2 344 5567899999999999999999999998874


Q ss_pred             c
Q 013275          421 G  421 (446)
Q Consensus       421 g  421 (446)
                      .
T Consensus        83 a   83 (102)
T 2xzz_A           83 A   83 (102)
T ss_dssp             E
T ss_pred             E
Confidence            3

PyMOL of 2xzz

>1ex0_A Coagulation factor XIII A chain; transglutaminase, blood coagulation, mutant, W279F, oxyanion, transferase; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1evu_A 1fie_A 1f13_A 1ggt_A 1ggu_A 1ggy_A 1qrk_A	Back alignment and structure

>3hrz_B Cobra venom factor; serine protease, glycosilated, multi-domain, complement SYST convertase, complement alternate pathway; HET: NAG P6G; 2.20A {Naja kaouthia} PDB: 3frp_G* 3hs0_B*	Back alignment and structure

>3idu_A Uncharacterized protein; all beta-protein, structural genomics, PSI-2, protein structure initiative; 1.70A {Pyrococcus furiosus} PDB: 2kl6_A	Back alignment and structure

>1vjj_A Protein-glutamine glutamyltransferase E; transglutaminase 3, X-RAY crystallography, metalloenzyme, calcium ION; HET: GDP; 1.90A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1sgx_A* 1l9m_A 1l9n_A* 1nud_A 1nuf_A 1nug_A 1rle_A*	Back alignment and structure

>1g0d_A Protein-glutamine gamma-glutamyltransferase; tissue transglutaminase,acyltransferase; 2.50A {Pagrus major} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4	Back alignment and structure

>2q3z_A Transglutaminase 2; transglutaminase 2, tissue transglutaminase, TG2, transferas; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1kv3_A 3ly6_A*	Back alignment and structure

>2qsv_A Uncharacterized protein; MCSG, structural genomics, porphyromonas gingivalis W83, PSI protein structure initiative; 2.10A {Porphyromonas gingivalis}	Back alignment and structure

>2ys4_A Hydrocephalus-inducing protein homolog; hydin, PAPD-like, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens}	Back alignment and structure

>4fxk_B Complement C4-A alpha chain; immune system, proteolytic cascade; HET: NAG BMA; 3.60A {Homo sapiens} PDB: 4fxg_B*	Back alignment and structure

>4acq_A Alpha-2-macroglobulin; hydrolase inhibitor, proteinase inhibitor, irreversible PROT inhibitor, conformational change, blood plasma inhibitor; HET: MEQ NAG MAN; 4.30A {Homo sapiens}	Back alignment and structure

>2hr0_B Complement C3 alpha' chain; complement component C3B, immune system; HET: THC; 2.26A {Homo sapiens} PDB: 2icf_B* 2wii_B* 2win_B* 3g6j_B 3l5n_B* 2a73_B* 2i07_B* 2xwj_B* 2xwb_B* 2a74_C* 2ice_C* 2qki_C* 3l3o_F* 3nms_C* 3nsa_C* 3ohx_C* 3t4a_C 2ice_B* 3l3o_B* 3nms_B* ...	Back alignment and structure

>3prx_B Cobra venom factor; immune system, complement, immune SYS complex; HET: NAG; 4.30A {Naja kaouthia} PDB: 3pvm_B*	Back alignment and structure

>3es6_B Prolactin-inducible protein; major histocompatibility complex, protein-protein complex, P inducible protein, zinc 2-glycoprotein, ZAG-PIP complex; HET: NDG NAG BMA MAN P6G; 3.23A {Homo sapiens} SCOP: b.1.18.23	Back alignment and structure

>2b39_A C3; thioester, immune defense, immune system; HET: NAG BMA; 3.00A {Bos taurus}	Back alignment and structure

>2xzz_A Protein-glutamine gamma-glutamyltransferase K; 2.30A {Homo sapiens}	Back alignment and structure

>2pn5_A TEP1R, thioester-containing protein I; FULL-length mature peptide, immune system; HET: NAG; 2.70A {Anopheles gambiae}	Back alignment and structure

>2ys4_A Hydrocephalus-inducing protein homolog; hydin, PAPD-like, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens}	Back alignment and structure

>1vjj_A Protein-glutamine glutamyltransferase E; transglutaminase 3, X-RAY crystallography, metalloenzyme, calcium ION; HET: GDP; 1.90A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1sgx_A* 1l9m_A 1l9n_A* 1nud_A 1nuf_A 1nug_A 1rle_A*	Back alignment and structure

>2l0d_A Cell surface protein; structural genomics, northeast structural genomics consortiu PSI-2, protein structure initiative; NMR {Methanosarcina acetivorans}	Back alignment and structure

>1ex0_A Coagulation factor XIII A chain; transglutaminase, blood coagulation, mutant, W279F, oxyanion, transferase; 2.00A {Homo sapiens} SCOP: b.1.18.9 b.1.5.1 b.1.5.1 d.3.1.4 PDB: 1evu_A 1fie_A 1f13_A 1ggt_A 1ggu_A 1ggy_A 1qrk_A	Back alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST

Original result of RPS-BLAST against SCOP70(version1.75) database

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch

Original result of HHsearch against SCOP70(version1.75) database

ID	Alignment Graph	Length	Definition	Probability
Query		446
d1vjja3		99	Transglutaminase, two C-terminal domains {Human (H	96.96
d1ex0a3		100	Transglutaminase, two C-terminal domains {Human (H	96.82
d1g0da3		101	Transglutaminase, two C-terminal domains {Red sea	96.75
d2q3za3		98	Transglutaminase, two C-terminal domains {Human (H	96.47
d1ex0a2		112	Transglutaminase, two C-terminal domains {Human (H	96.34
d1vjja2		115	Transglutaminase, two C-terminal domains {Human (H	95.95
d1g0da2		112	Transglutaminase, two C-terminal domains {Red sea	95.61
d2q3za2		114	Transglutaminase, two C-terminal domains {Human (H	95.19
d1vjja3		99	Transglutaminase, two C-terminal domains {Human (H	94.54
d2q3za3		98	Transglutaminase, two C-terminal domains {Human (H	94.52
d1g0da3		101	Transglutaminase, two C-terminal domains {Red sea	92.93
d3es6b1		118	Prolactin-inducible protein, PIP {Human (Homo sapi	92.52
d1ex0a3		100	Transglutaminase, two C-terminal domains {Human (H	91.01

>d1vjja3 b.1.5.1 (A:594-692) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]}	Back information, alignment and structure

class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: Transglutaminase, two C-terminal domains
family: Transglutaminase, two C-terminal domains
domain: Transglutaminase, two C-terminal domains
species: Human (Homo sapiens), TGase E3 [TaxId: 9606]

Probab=96.96  E-value=0.00089  Score=53.21  Aligned_cols=75  Identities=11%  Similarity=0.252  Sum_probs=60.7

Q ss_pred             EecCceEeeCCcEEEEEEEEeCCCCCcccEEEEEeeCCCCCcceEEEecccceeecccCCCCeeEEEEEEEecccceEEe
Q 013275          340 VEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI  419 (446)
Q Consensus       340 ~~~P~~v~l~~pF~v~~~v~N~s~r~~~~l~l~l~~~~~~~~~~~~~~G~s~~~Lg~L~P~~s~~~~L~l~pl~~Glq~i  419 (446)
                      .++|...++++++.++++++|--+..+.+-.+.++...      + +.+.....++.+.||++.++.+.+.|..+|.++|
T Consensus         5 I~v~~~~~v~~~~~v~vsf~NPL~~~L~~c~f~vEG~G------L-~~~~~~~~~~~v~p~~~~~~~~~~~P~~~G~~~l   77 (99)
T d1vjja3           5 LEVLNEARVRKPVNVQMLFSNPLDEPVRDCVLMVEGSG------L-LLGNLKIDVPTLGPKERSRVRFDILPSRSGTKQL   77 (99)
T ss_dssp             EEECSCCBTTSCEEEEEEEECCSSSCBCSEEEEEECTT------T-SSSCEEEEECCBCTTCEEEEEEEECCCSCEEEEE
T ss_pred             EEeCCCcCcCCeEEEEEEEECCCCCchhCEEEEEEeCC------C-CCccEEEecCccCCCCEEEEEEEEEcCCcccEEE
Confidence            35677788999999999999998888876888887652      1 2233345688899999999999999999999997


Q ss_pred             Cc
Q 013275          420 TG  421 (446)
Q Consensus       420 sg  421 (446)
                      -.
T Consensus        78 ~a   79 (99)
T d1vjja3          78 LA   79 (99)
T ss_dssp             EE
T ss_pred             EE
Confidence            43

PyMOL of d1vjja3

>d1ex0a3 b.1.5.1 (A:628-727) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]}	Back information, alignment and structure

>d1g0da3 b.1.5.1 (A:584-684) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]}	Back information, alignment and structure

>d2q3za3 b.1.5.1 (A:586-683) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]}	Back information, alignment and structure

>d1ex0a2 b.1.5.1 (A:516-627) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]}	Back information, alignment and structure

>d1vjja2 b.1.5.1 (A:479-593) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]}	Back information, alignment and structure

>d1g0da2 b.1.5.1 (A:472-583) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]}	Back information, alignment and structure

>d2q3za2 b.1.5.1 (A:472-585) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]}	Back information, alignment and structure

>d1vjja3 b.1.5.1 (A:594-692) Transglutaminase, two C-terminal domains {Human (Homo sapiens), TGase E3 [TaxId: 9606]}	Back information, alignment and structure

>d2q3za3 b.1.5.1 (A:586-683) Transglutaminase, two C-terminal domains {Human (Homo sapiens), tissue isozyme [TaxId: 9606]}	Back information, alignment and structure

>d1g0da3 b.1.5.1 (A:584-684) Transglutaminase, two C-terminal domains {Red sea bream (Chrysophrys major) [TaxId: 143350]}	Back information, alignment and structure

>d3es6b1 b.1.18.23 (B:1-118) Prolactin-inducible protein, PIP {Human (Homo sapiens) [TaxId: 9606]}	Back information, alignment and structure

>d1ex0a3 b.1.5.1 (A:628-727) Transglutaminase, two C-terminal domains {Human (Homo sapiens), blood isozyme [TaxId: 9606]}	Back information, alignment and structure