Citrus Sinensis ID: 047859
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 214 | ||||||
| 225440274 | 250 | PREDICTED: uncharacterized protein LOC10 | 0.925 | 0.792 | 0.67 | 2e-76 | |
| 224089857 | 255 | predicted protein [Populus trichocarpa] | 0.976 | 0.819 | 0.647 | 5e-74 | |
| 147807424 | 255 | hypothetical protein VITISV_012853 [Viti | 0.934 | 0.784 | 0.671 | 5e-71 | |
| 255565374 | 253 | conserved hypothetical protein [Ricinus | 0.925 | 0.782 | 0.671 | 1e-70 | |
| 357509113 | 252 | hypothetical protein MTR_7g088210 [Medic | 0.953 | 0.809 | 0.655 | 6e-67 | |
| 15226837 | 261 | uncharacterized protein [Arabidopsis tha | 0.915 | 0.750 | 0.585 | 4e-63 | |
| 363814547 | 248 | uncharacterized protein LOC100785613 [Gl | 0.789 | 0.681 | 0.682 | 8e-62 | |
| 297827749 | 261 | hypothetical protein ARALYDRAFT_483180 [ | 0.794 | 0.651 | 0.653 | 4e-61 | |
| 449437650 | 247 | PREDICTED: uncharacterized protein LOC10 | 0.915 | 0.793 | 0.577 | 5e-60 | |
| 115483975 | 275 | Os11g0130200 [Oryza sativa Japonica Grou | 0.789 | 0.614 | 0.593 | 1e-55 |
| >gi|225440274|ref|XP_002278905.1| PREDICTED: uncharacterized protein LOC100267181 [Vitis vinifera] gi|297741746|emb|CBI32878.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 290 bits (743), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 134/200 (67%), Positives = 162/200 (81%), Gaps = 2/200 (1%)
Query: 13 HTTNFKSFRVLYRYSAKEEDDNDGENCSFDEAVALFNERAYYKCHDCLESLWYTAEEPTR 72
HT+ F + YR+SA E+++ D C+FD+AVALFN R YY+CHD LE +W AE+P R
Sbjct: 49 HTSTFHLLPISYRFSAVEDENED--ECNFDDAVALFNSRDYYRCHDFLEEIWNRAEDPVR 106
Query: 73 TLIHGVLQCAVGFYHLFNQNHKGAMMELGEGLGKLRKMNLRSGPFHEFENEISAALEFIY 132
TL+HG+LQCAVG +HLFNQNH+GAMMELGEGL KLRKMN +GPF +FE EISA LEFIY
Sbjct: 107 TLVHGILQCAVGLHHLFNQNHRGAMMELGEGLCKLRKMNFETGPFLQFEQEISAVLEFIY 166
Query: 133 RTQIELAACADDICLAMDQSERSYQLLGDYAAGQQLYHLESDHNQIMYIVFDPQRSYGSD 192
+TQ+ELAAC DD C+AMDQSERSYQLLG Y AGQ+LYHLE D N+I+YIVF P+RS G+
Sbjct: 167 QTQLELAACTDDFCIAMDQSERSYQLLGGYGAGQRLYHLEGDPNEIVYIVFSPERSQGTG 226
Query: 193 DKSIKVKLPTLNATEEHLMA 212
+ +VKLP LNATEE+LMA
Sbjct: 227 ETPRRVKLPILNATEEYLMA 246
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224089857|ref|XP_002308839.1| predicted protein [Populus trichocarpa] gi|222854815|gb|EEE92362.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|147807424|emb|CAN70760.1| hypothetical protein VITISV_012853 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255565374|ref|XP_002523678.1| conserved hypothetical protein [Ricinus communis] gi|223537078|gb|EEF38713.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|357509113|ref|XP_003624845.1| hypothetical protein MTR_7g088210 [Medicago truncatula] gi|87162857|gb|ABD28652.1| Protein of unknown function DUF309 [Medicago truncatula] gi|355499860|gb|AES81063.1| hypothetical protein MTR_7g088210 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|15226837|ref|NP_181645.1| uncharacterized protein [Arabidopsis thaliana] gi|3402705|gb|AAD11999.1| hypothetical protein [Arabidopsis thaliana] gi|38564300|gb|AAR23729.1| At2g41120 [Arabidopsis thaliana] gi|38604032|gb|AAR24759.1| At2g41120 [Arabidopsis thaliana] gi|330254840|gb|AEC09934.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|363814547|ref|NP_001242712.1| uncharacterized protein LOC100785613 [Glycine max] gi|255647305|gb|ACU24119.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|297827749|ref|XP_002881757.1| hypothetical protein ARALYDRAFT_483180 [Arabidopsis lyrata subsp. lyrata] gi|297327596|gb|EFH58016.1| hypothetical protein ARALYDRAFT_483180 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|449437650|ref|XP_004136604.1| PREDICTED: uncharacterized protein LOC101210744 [Cucumis sativus] gi|449511488|ref|XP_004163968.1| PREDICTED: uncharacterized protein LOC101226155 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|115483975|ref|NP_001065649.1| Os11g0130200 [Oryza sativa Japonica Group] gi|77548513|gb|ABA91310.1| expressed protein [Oryza sativa Japonica Group] gi|113644353|dbj|BAF27494.1| Os11g0130200 [Oryza sativa Japonica Group] gi|215765304|dbj|BAG87001.1| unnamed protein product [Oryza sativa Japonica Group] gi|218185160|gb|EEC67587.1| hypothetical protein OsI_34952 [Oryza sativa Indica Group] gi|222615445|gb|EEE51577.1| hypothetical protein OsJ_32810 [Oryza sativa Japonica Group] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 214 | ||||||
| TAIR|locus:2063213 | 261 | AT2G41120 "AT2G41120" [Arabido | 0.915 | 0.750 | 0.589 | 4.3e-61 | |
| UNIPROTKB|Q74CK9 | 137 | GSU1664 "Uncharacterized prote | 0.317 | 0.496 | 0.397 | 4.2e-08 | |
| TIGR_CMR|GSU_1664 | 137 | GSU_1664 "conserved hypothetic | 0.317 | 0.496 | 0.397 | 4.2e-08 |
| TAIR|locus:2063213 AT2G41120 "AT2G41120" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 625 (225.1 bits), Expect = 4.3e-61, P = 4.3e-61
Identities = 128/217 (58%), Positives = 159/217 (73%)
Query: 14 TTNFKSFRVLY------RYS---AKEEDDNDG---ENCSFDEAVALFNERAYYKCHDCLE 61
+T F S RV + RY+ ++E+D+D E SF+EAV LFN+R YYK HD LE
Sbjct: 41 STKFSSIRVRFSRRSIVRYNDHAGEDEEDSDEDKEEYWSFEEAVTLFNKRDYYKSHDALE 100
Query: 62 SLWYTAEEPTRTLIHGVLQCAVGFYHLFNQNHKGAMMELGEGLGKLRKMNLRSGPFHEFE 121
+LW AEEPTRTLIHG+LQCAVGF+HLFN NHKGAMMELGEG+ KLRKMN GPFHEFE
Sbjct: 101 ALWIQAEEPTRTLIHGILQCAVGFHHLFNNNHKGAMMELGEGVCKLRKMNFEDGPFHEFE 160
Query: 122 NEISAALEFIYRTQIELAACADDICLAMDQSERSYQLLGDYAAGQQLYHLES--DHNQIM 179
++SA LEF+Y+TQ+ELAAC++D+CL MDQS+RSYQLLG YAAG+ +Y LE+ D N M
Sbjct: 161 RDVSAVLEFVYQTQLELAACSEDMCLTMDQSDRSYQLLGGYAAGESIYSLETVLDFNNGM 220
Query: 180 Y----IVFDPQRSYGSDDKSIKVKLPTLNATEEHLMA 212
I+F P S + +VKLPTL AT++HL+A
Sbjct: 221 SETTSILFSPLHS---SSEPTRVKLPTLTATDKHLLA 254
|
|
| UNIPROTKB|Q74CK9 GSU1664 "Uncharacterized protein" [Geobacter sulfurreducens PCA (taxid:243231)] | Back alignment and assigned GO terms |
|---|
| TIGR_CMR|GSU_1664 GSU_1664 "conserved hypothetical protein" [Geobacter sulfurreducens PCA (taxid:243231)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 214 | |||
| pfam03745 | 55 | pfam03745, DUF309, Domain of unknown function (DUF | 4e-18 | |
| COG1547 | 156 | COG1547, COG1547, Predicted metal-dependent hydrol | 2e-05 |
| >gnl|CDD|202755 pfam03745, DUF309, Domain of unknown function (DUF309) | Back alignment and domain information |
|---|
Score = 74.5 bits (184), Expect = 4e-18
Identities = 23/54 (42%), Positives = 30/54 (55%)
Query: 40 SFDEAVALFNERAYYKCHDCLESLWYTAEEPTRTLIHGVLQCAVGFYHLFNQNH 93
+ E V LFN Y++ H+ LE LW A P R L+ G++Q AV YHL N
Sbjct: 2 ALREGVELFNAGDYFEAHEVLEELWKAAPGPERDLLQGLIQLAVALYHLRRGNL 55
|
This domain is found in eubacterial and archaebacterial proteins of unknown function. The proteins contain a motif HXXXEXX(W/Y) where X can be any amino acid. This motif is likely to be functionally important and may be involved in metal binding. Length = 55 |
| >gnl|CDD|224464 COG1547, COG1547, Predicted metal-dependent hydrolase [General function prediction only] | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 214 | |||
| PF03745 | 62 | DUF309: Domain of unknown function (DUF309); Inter | 99.93 | |
| COG1547 | 156 | Predicted metal-dependent hydrolase [General funct | 99.44 | |
| COG1547 | 156 | Predicted metal-dependent hydrolase [General funct | 96.93 | |
| PF09670 | 379 | Cas_Cas02710: CRISPR-associated protein (Cas_Cas02 | 92.52 | |
| PF13432 | 65 | TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL | 91.57 | |
| cd00189 | 100 | TPR Tetratricopeptide repeat domain; typically con | 88.71 | |
| PF12862 | 94 | Apc5: Anaphase-promoting complex subunit 5 | 87.19 | |
| TIGR03302 | 235 | OM_YfiO outer membrane assembly lipoprotein YfiO. | 85.61 | |
| TIGR02710 | 380 | CRISPR-associated protein, TIGR02710 family. Membe | 85.34 | |
| PRK02603 | 172 | photosystem I assembly protein Ycf3; Provisional | 84.2 | |
| PF13525 | 203 | YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M | 82.56 | |
| PF13424 | 78 | TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15 | 81.2 | |
| TIGR02795 | 119 | tol_pal_ybgF tol-pal system protein YbgF. Members | 80.1 |
| >PF03745 DUF309: Domain of unknown function (DUF309); InterPro: IPR005500 This family consists of eubacterial and archaebacterial proteins of unknown function | Back alignment and domain information |
|---|
Probab=99.93 E-value=7.4e-26 Score=162.36 Aligned_cols=62 Identities=37% Similarity=0.724 Sum_probs=57.3
Q ss_pred hhHHHHHHHhccCCchhhhhhhHHhhccCCcchhhHHHHHHHHHHHHHHHhcCCHHHHHHHH
Q 047859 39 CSFDEAVALFNERAYYKCHDCLESLWYTAEEPTRTLIHGVLQCAVGFYHLFNQNHKGAMMEL 100 (214)
Q Consensus 39 ~~l~~gi~LFN~G~YfEAHEVLEe~Wk~~~g~er~~lqGLIQlAvAl~H~~rGN~~GA~~Ll 100 (214)
+.|..|+.+||+|+|||||||||++|+..+++++.+||||||+|||+||+++||++||++|+
T Consensus 1 ~~~~~~~~l~n~g~f~EaHEvlE~~W~~~~~~~~~~lqglIq~A~a~~h~~~gn~~gA~~l~ 62 (62)
T PF03745_consen 1 EALEEGIELFNAGDFFEAHEVLEELWKAAPGPERDFLQGLIQLAVALYHLRRGNPRGARRLL 62 (62)
T ss_dssp -HHHHHHHHHHTT-HHHHHHHHHHHCCCT-CCHHHHHHHHHHHHHHHHHHHCTSHHHHHHHH
T ss_pred CHHHHHHHHHcCCCHHHhHHHHHHHHHHCCcchHHHHHHHHHHHHHHHHHHhCCHHHHHHhC
Confidence 47899999999999999999999999999888999999999999999999999999999986
|
The proteins contain a motif HXXXEXX(W/Y) where X can be any amino acid. This motif is likely to be functionally important and may be involved in metal binding.; PDB: 2CXD_B 2CWY_A 2IJQ_B. |
| >COG1547 Predicted metal-dependent hydrolase [General function prediction only] | Back alignment and domain information |
|---|
| >COG1547 Predicted metal-dependent hydrolase [General function prediction only] | Back alignment and domain information |
|---|
| >PF09670 Cas_Cas02710: CRISPR-associated protein (Cas_Cas02710) | Back alignment and domain information |
|---|
| >PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B | Back alignment and domain information |
|---|
| >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C | Back alignment and domain information |
|---|
| >PF12862 Apc5: Anaphase-promoting complex subunit 5 | Back alignment and domain information |
|---|
| >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO | Back alignment and domain information |
|---|
| >TIGR02710 CRISPR-associated protein, TIGR02710 family | Back alignment and domain information |
|---|
| >PRK02603 photosystem I assembly protein Ycf3; Provisional | Back alignment and domain information |
|---|
| >PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A | Back alignment and domain information |
|---|
| >PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H | Back alignment and domain information |
|---|
| >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 214 | |||
| 2cwy_A | 94 | Hypothetical protein TTHA0068; structural genomics | 1e-20 | |
| 2ijq_A | 161 | Hypothetical protein; structural genomics, PSI, pr | 1e-17 |
| >2cwy_A Hypothetical protein TTHA0068; structural genomics, conserved hypothetical protein, NPPSFA; 1.85A {Thermus thermophilus} SCOP: a.246.2.1 PDB: 2cxd_A Length = 94 | Back alignment and structure |
|---|
Score = 81.8 bits (202), Expect = 1e-20
Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 4/91 (4%)
Query: 41 FDEAVALFNERAYYKCHDCLESLWYTAEEPTRTLIHGVLQCAVGFYHLFNQNHKGAMMEL 100
++E + L+ YY+ H+ LE W A R L+ GV+ A + + + + L
Sbjct: 5 WEEVLGLWRAGRYYEVHEVLEPYWLKATGEERRLLQGVILLAAALHQ--RRLGRPGLRNL 62
Query: 101 GEGLGKLRKMNLRSGPF--HEFENEISAALE 129
+ +L + E L
Sbjct: 63 RKAEARLEGLPCPLMGLDWRSLLQEARRRLG 93
|
| >2ijq_A Hypothetical protein; structural genomics, PSI, protein structure initiative, NEW YORK SGX research center for structural genomics; 1.88A {Haloarcula marismortui} SCOP: a.246.2.1 Length = 161 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 214 | |||
| 2ijq_A | 161 | Hypothetical protein; structural genomics, PSI, pr | 100.0 | |
| 2cwy_A | 94 | Hypothetical protein TTHA0068; structural genomics | 99.96 | |
| 3ro3_A | 164 | PINS homolog, G-protein-signaling modulator 2; asy | 92.82 | |
| 2v5f_A | 104 | Prolyl 4-hydroxylase subunit alpha-1; endoplasmic | 91.79 | |
| 3ro3_A | 164 | PINS homolog, G-protein-signaling modulator 2; asy | 91.71 | |
| 2kc7_A | 99 | BFR218_protein; tetratricopeptide repeat, all-alph | 90.74 | |
| 3gw4_A | 203 | Uncharacterized protein; structural genomics, PSI- | 90.55 | |
| 1hxi_A | 121 | PEX5, peroxisome targeting signal 1 receptor PEX5; | 90.5 | |
| 1elw_A | 118 | TPR1-domain of HOP; HOP, TPR-domain, peptide-compl | 90.47 | |
| 3gw4_A | 203 | Uncharacterized protein; structural genomics, PSI- | 90.02 | |
| 2l6j_A | 111 | TPR repeat-containing protein associated with HSP; | 89.9 | |
| 2dba_A | 148 | Smooth muscle cell associated protein-1, isoform 2 | 89.63 | |
| 1na3_A | 91 | Designed protein CTPR2; de novo protein; HET: IPT; | 89.56 | |
| 2lni_A | 133 | Stress-induced-phosphoprotein 1; structural genomi | 89.23 | |
| 3sz7_A | 164 | HSC70 cochaperone (SGT); TPR domain, GET4, GET5, G | 89.13 | |
| 3upv_A | 126 | Heat shock protein STI1; TPR-fold, adaptor protein | 89.04 | |
| 3nf1_A | 311 | KLC 1, kinesin light chain 1; TPR, structural geno | 88.86 | |
| 2fbn_A | 198 | 70 kDa peptidylprolyl isomerase, putative; sulfur | 88.63 | |
| 3q49_B | 137 | STIP1 homology and U box-containing protein 1; E3 | 88.49 | |
| 3qky_A | 261 | Outer membrane assembly lipoprotein YFIO; membrane | 88.46 | |
| 1elr_A | 131 | TPR2A-domain of HOP; HOP, TPR-domain, peptide-comp | 88.14 | |
| 2kck_A | 112 | TPR repeat; tetratricopeptide repeat, structural g | 88.09 | |
| 2xev_A | 129 | YBGF; tetratricopeptide, alpha-helical, metal bind | 87.85 | |
| 2xcb_A | 142 | PCRH, regulatory protein PCRH; protein transport, | 87.81 | |
| 3ro2_A | 338 | PINS homolog, G-protein-signaling modulator 2; TPR | 87.73 | |
| 2vyi_A | 131 | SGTA protein; chaperone, TPR repeat, phosphoprotei | 87.72 | |
| 2vgx_A | 148 | Chaperone SYCD; alternative dimer assembly, tetrat | 86.69 | |
| 3ma5_A | 100 | Tetratricopeptide repeat domain protein; NESG, str | 86.58 | |
| 1elr_A | 131 | TPR2A-domain of HOP; HOP, TPR-domain, peptide-comp | 85.75 | |
| 3k9i_A | 117 | BH0479 protein; putative protein binding protein, | 84.82 | |
| 4a1s_A | 411 | PINS, partner of inscuteable; cell cycle, LGN, mit | 84.28 | |
| 1na0_A | 125 | Designed protein CTPR3; de novo protein; HET: IPT; | 84.15 | |
| 3rkv_A | 162 | Putative peptidylprolyl isomerase; structural geno | 83.47 | |
| 3ulq_A | 383 | Response regulator aspartate phosphatase F; tetrat | 83.22 | |
| 2fo7_A | 136 | Synthetic consensus TPR protein; tetratricopeptide | 82.95 | |
| 2ho1_A | 252 | Type 4 fimbrial biogenesis protein PILF; type IV p | 82.64 | |
| 3ro2_A | 338 | PINS homolog, G-protein-signaling modulator 2; TPR | 82.56 | |
| 3gyz_A | 151 | Chaperone protein IPGC; asymmetric homodimer, tetr | 82.53 | |
| 1a17_A | 166 | Serine/threonine protein phosphatase 5; hydrolase, | 82.4 | |
| 2xev_A | 129 | YBGF; tetratricopeptide, alpha-helical, metal bind | 82.39 | |
| 4gcn_A | 127 | Protein STI-1; structural genomics, PSI-biology, m | 82.36 | |
| 3urz_A | 208 | Uncharacterized protein; tetratricopeptide repeats | 81.74 | |
| 2kat_A | 115 | Uncharacterized protein; NESG, structure, structur | 81.65 | |
| 2vyi_A | 131 | SGTA protein; chaperone, TPR repeat, phosphoprotei | 81.34 | |
| 3sf4_A | 406 | G-protein-signaling modulator 2; tetratricopeptide | 81.32 | |
| 1na0_A | 125 | Designed protein CTPR3; de novo protein; HET: IPT; | 81.29 | |
| 4gco_A | 126 | Protein STI-1; structural genomics, PSI-biology, m | 80.81 | |
| 3q15_A | 378 | PSP28, response regulator aspartate phosphatase H; | 80.37 | |
| 3u4t_A | 272 | TPR repeat-containing protein; structural genomics | 80.04 | |
| 3cv0_A | 327 | Peroxisome targeting signal 1 receptor PEX5; TPR m | 80.03 |
| >2ijq_A Hypothetical protein; structural genomics, PSI, protein structure initiative, NEW YORK SGX research center for structural genomics; 1.88A {Haloarcula marismortui} SCOP: a.246.2.1 | Back alignment and structure |
|---|
Probab=100.00 E-value=3.6e-34 Score=237.78 Aligned_cols=120 Identities=16% Similarity=0.265 Sum_probs=110.3
Q ss_pred CCcchhHHHHHHHhccCCchhhhhhhHHhhccCCcc--hhhHHHHHHH-HHHHHHHHhcCCHHHHHHHHHHHHHHHhhcc
Q 047859 35 DGENCSFDEAVALFNERAYYKCHDCLESLWYTAEEP--TRTLIHGVLQ-CAVGFYHLFNQNHKGAMMELGEGLGKLRKMN 111 (214)
Q Consensus 35 ~e~~~~l~~gi~LFN~G~YfEAHEVLEe~Wk~~~g~--er~~lqGLIQ-lAvAl~H~~rGN~~GA~~Ll~rAl~~L~~~~ 111 (214)
.+..++|.+|+.|||+|+||||||+||++|+..+++ +|.||||||| +|||+||+++||++||++|+++|+.+|+.++
T Consensus 30 ~~~~~~~~~~i~lFn~g~yfeaHEvLEe~W~~~~~~~~er~~lqGLIQ~lAvAl~H~~rgN~~GA~~ll~~Al~~L~~~~ 109 (161)
T 2ijq_A 30 ETLRRAVVHGVRLYNSGEFHESHDCFEDEWYNYGRGNTESKFLHGMVQVAAGAYKHFDFEDDDGMRSLFRTSLQYFRGVP 109 (161)
T ss_dssp HHHHHHHHHHHHHHHTTCHHHHHHHHHHHTTTTCSSSHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHTTSC
T ss_pred CCccHHHHHHHHHHhCCCchHHHHHHHHHHHhcCCCchHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhCC
Confidence 457789999999999999999999999999999876 9999999999 9999999999999999999999999999984
Q ss_pred CCCCCCcccccCHHHHHHHHHHhhHHHHhcccCceeeecCcchhHHhhh
Q 047859 112 LRSGPFHEFENEISAALEFIYRTQIELAACADDICLAMDQSERSYQLLG 160 (214)
Q Consensus 112 ~~~~p~~~~GvDV~aLl~~i~~l~~Elaa~~~d~~l~~dgs~~~~~~Lg 160 (214)
..++|||+..|+.++.++..+...+ ++|||++||+++++++|+
T Consensus 110 -----~~~~GIDv~~L~~~~~~~l~dp~~~-~~P~l~~dg~~~~~~~~~ 152 (161)
T 2ijq_A 110 -----NDYYGVDLLDVRTTVTNALSDPSAL-HGWQIRLDGEYPTCRPED 152 (161)
T ss_dssp -----TTGGGBCHHHHHHHHHHHHHCGGGG-TTCCCCBTTBCCCCCHHH
T ss_pred -----CccCCcCHHHHHHHHHHHhhCCccC-CCCeEEecCCCCCCccCC
Confidence 3579999999999999988654444 499999999999999997
|
| >2cwy_A Hypothetical protein TTHA0068; structural genomics, conserved hypothetical protein, NPPSFA; 1.85A {Thermus thermophilus} SCOP: a.246.2.1 PDB: 2cxd_A | Back alignment and structure |
|---|
| >3ro3_A PINS homolog, G-protein-signaling modulator 2; asymmetric cell division, protein binding; 1.10A {Mus musculus} | Back alignment and structure |
|---|
| >2v5f_A Prolyl 4-hydroxylase subunit alpha-1; endoplasmic reticulum, metal-binding, oxidoreductase; 2.03A {Homo sapiens} PDB: 1tjc_A | Back alignment and structure |
|---|
| >3ro3_A PINS homolog, G-protein-signaling modulator 2; asymmetric cell division, protein binding; 1.10A {Mus musculus} | Back alignment and structure |
|---|
| >2kc7_A BFR218_protein; tetratricopeptide repeat, all-alpha, GFT-structural genomics, PSI-2, protein structure initiative; NMR {Bacteroides fragilis} | Back alignment and structure |
|---|
| >3gw4_A Uncharacterized protein; structural genomics, PSI-2, protein structure initiative, no structural genomics consortium, NESG, DRR162B; 2.49A {Deinococcus radiodurans R1} | Back alignment and structure |
|---|
| >1hxi_A PEX5, peroxisome targeting signal 1 receptor PEX5; alpha helical, transport protein; 1.60A {Trypanosoma brucei} SCOP: a.118.8.1 | Back alignment and structure |
|---|
| >1elw_A TPR1-domain of HOP; HOP, TPR-domain, peptide-complex, helical repeat, HSP70, protein binding, chaperone; 1.60A {Homo sapiens} SCOP: a.118.8.1 | Back alignment and structure |
|---|
| >3gw4_A Uncharacterized protein; structural genomics, PSI-2, protein structure initiative, no structural genomics consortium, NESG, DRR162B; 2.49A {Deinococcus radiodurans R1} | Back alignment and structure |
|---|
| >2l6j_A TPR repeat-containing protein associated with HSP; tetratricopeptide repeat (TPR), HSP90 CO-factor, protein BIN; NMR {Saccharomyces cerevisiae} | Back alignment and structure |
|---|
| >2dba_A Smooth muscle cell associated protein-1, isoform 2; tetratricopeptide repeat, structural genomics, NPPSFA; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >1na3_A Designed protein CTPR2; de novo protein; HET: IPT; 1.55A {Unidentified} SCOP: k.38.1.1 PDB: 2avp_A | Back alignment and structure |
|---|
| >2lni_A Stress-induced-phosphoprotein 1; structural genomics, northeast structural genomics consortiu PSI-biology, protein structure initiative, chaperone; NMR {Homo sapiens} | Back alignment and structure |
|---|
| >3sz7_A HSC70 cochaperone (SGT); TPR domain, GET4, GET5, GET3, MDY2, SSA1, SSE1, chaperone regulator; 1.72A {Aspergillus fumigatus} | Back alignment and structure |
|---|
| >3upv_A Heat shock protein STI1; TPR-fold, adaptor protein for HSP70 and HSP90, C-terminal PA HSP70, peptide binding protein; 1.60A {Saccharomyces cerevisiae} | Back alignment and structure |
|---|
| >3nf1_A KLC 1, kinesin light chain 1; TPR, structural genomics consortium (SGC), motor PR transport protein; 2.80A {Homo sapiens} | Back alignment and structure |
|---|
| >2fbn_A 70 kDa peptidylprolyl isomerase, putative; sulfur SAD, PFL2275C, TPR-containing domain, structural genomics; 1.63A {Plasmodium falciparum} SCOP: a.118.8.1 | Back alignment and structure |
|---|
| >3q49_B STIP1 homology and U box-containing protein 1; E3 ubiquitin ligase, ligase-chaperone complex; 1.54A {Mus musculus} PDB: 3q47_B 3q4a_B* | Back alignment and structure |
|---|
| >3qky_A Outer membrane assembly lipoprotein YFIO; membrane protein; 2.15A {Rhodothermus marinus} | Back alignment and structure |
|---|
| >1elr_A TPR2A-domain of HOP; HOP, TPR-domain, peptide-complex, helical repeat, protein binding, chaperone; 1.90A {Homo sapiens} SCOP: a.118.8.1 PDB: 3esk_A 3fwv_A | Back alignment and structure |
|---|
| >2kck_A TPR repeat; tetratricopeptide repeat, structural genomics, unknown function, PSI-2, protein structure initiative; NMR {Methanococcus maripaludis} | Back alignment and structure |
|---|
| >2xev_A YBGF; tetratricopeptide, alpha-helical, metal binding; 1.57A {Xanthomonas campestris} | Back alignment and structure |
|---|
| >2xcb_A PCRH, regulatory protein PCRH; protein transport, bacterial toxin, type III secretion, protein binding; 1.85A {Pseudomonas aeruginosa} PDB: 2xcc_A | Back alignment and structure |
|---|
| >3ro2_A PINS homolog, G-protein-signaling modulator 2; TPR repeat, protein-protein interaction, protein-binding, PR binding; 2.30A {Mus musculus} | Back alignment and structure |
|---|
| >2vyi_A SGTA protein; chaperone, TPR repeat, phosphoprotein, tetratricopeptide repeat protein, HOST-virus interaction; 2.4A {Homo sapiens} SCOP: k.38.1.1 | Back alignment and structure |
|---|
| >2vgx_A Chaperone SYCD; alternative dimer assembly, tetratricopeptide repeat, type III secretion; HET: MLY; 1.95A {Yersinia enterocolitica} SCOP: k.38.1.1 PDB: 2vgx_B* 2vgy_A* | Back alignment and structure |
|---|
| >3ma5_A Tetratricopeptide repeat domain protein; NESG, structural genomics, PSI-2, protein structure initiative; 2.80A {Salinibacter ruber} PDB: 2kcl_A 2kcv_A | Back alignment and structure |
|---|
| >1elr_A TPR2A-domain of HOP; HOP, TPR-domain, peptide-complex, helical repeat, protein binding, chaperone; 1.90A {Homo sapiens} SCOP: a.118.8.1 PDB: 3esk_A 3fwv_A | Back alignment and structure |
|---|
| >3k9i_A BH0479 protein; putative protein binding protein, structural genomics, joint for structural genomics, JCSG; 2.71A {Bacillus halodurans} | Back alignment and structure |
|---|
| >4a1s_A PINS, partner of inscuteable; cell cycle, LGN, mitotic spindle orientation, asymmetric CEL divisions; 2.10A {Drosophila melanogaster} | Back alignment and structure |
|---|
| >1na0_A Designed protein CTPR3; de novo protein; HET: IPT; 1.60A {Unidentified} SCOP: k.38.1.1 PDB: 2wqh_A 3kd7_A | Back alignment and structure |
|---|
| >3rkv_A Putative peptidylprolyl isomerase; structural genomics, APC102156, PSI-biology, midwest center structural genomics, MCSG; 2.41A {Caenorhabditis elegans} | Back alignment and structure |
|---|
| >3ulq_A Response regulator aspartate phosphatase F; tetratricopeptide repeat, response regulator helix-turn-HELX binding, 3-helix bundle; 2.30A {Bacillus subtilis} | Back alignment and structure |
|---|
| >2fo7_A Synthetic consensus TPR protein; tetratricopeptide repeat, consensus protein, superhelix, de novo protein; 2.30A {Synthetic} SCOP: k.38.1.1 PDB: 2hyz_A | Back alignment and structure |
|---|
| >2ho1_A Type 4 fimbrial biogenesis protein PILF; type IV pilus biogenesis, TPR, superhelix, protein binding; HET: MSE; 2.00A {Pseudomonas aeruginosa} PDB: 2fi7_A | Back alignment and structure |
|---|
| >3ro2_A PINS homolog, G-protein-signaling modulator 2; TPR repeat, protein-protein interaction, protein-binding, PR binding; 2.30A {Mus musculus} | Back alignment and structure |
|---|
| >3gyz_A Chaperone protein IPGC; asymmetric homodimer, tetratricopeptide repeat, TPR, chapero virulence; 2.15A {Shigella flexneri} PDB: 3gz1_A 3gz2_A 3ks2_A | Back alignment and structure |
|---|
| >1a17_A Serine/threonine protein phosphatase 5; hydrolase, protein-protein interactions, TPR, S helix; 2.45A {Homo sapiens} SCOP: a.118.8.1 PDB: 2bug_A | Back alignment and structure |
|---|
| >2xev_A YBGF; tetratricopeptide, alpha-helical, metal binding; 1.57A {Xanthomonas campestris} | Back alignment and structure |
|---|
| >4gcn_A Protein STI-1; structural genomics, PSI-biology, midwest center for structu genomics, MCSG, tetratricopeptide repeat domain; HET: PGE; 1.85A {Caenorhabditis elegans} | Back alignment and structure |
|---|
| >3urz_A Uncharacterized protein; tetratricopeptide repeats (TPR) containing protein, structur genomics, joint center for structural genomics, JCSG; HET: PG4; 2.19A {Bacteroides ovatus} | Back alignment and structure |
|---|
| >2kat_A Uncharacterized protein; NESG, structure, structural genomics, PSI-2, protein structure initiative; NMR {Bordetella parapertussis} | Back alignment and structure |
|---|
| >2vyi_A SGTA protein; chaperone, TPR repeat, phosphoprotein, tetratricopeptide repeat protein, HOST-virus interaction; 2.4A {Homo sapiens} SCOP: k.38.1.1 | Back alignment and structure |
|---|
| >3sf4_A G-protein-signaling modulator 2; tetratricopeptide repeat, TPR, cell polarity, asymmetric CEL division, mitotic spindle orientation; 2.60A {Homo sapiens} | Back alignment and structure |
|---|
| >1na0_A Designed protein CTPR3; de novo protein; HET: IPT; 1.60A {Unidentified} SCOP: k.38.1.1 PDB: 2wqh_A 3kd7_A | Back alignment and structure |
|---|
| >4gco_A Protein STI-1; structural genomics, PSI-biology, midwest center for structu genomics, MCSG, tetratricopeptide repeat domain; 1.60A {Caenorhabditis elegans} | Back alignment and structure |
|---|
| >3q15_A PSP28, response regulator aspartate phosphatase H; tetratricopeptide repeat, 3-helix bundle, phosphorelay signa transduction, phosphatase; 2.19A {Bacillus subtilis} | Back alignment and structure |
|---|
| >3u4t_A TPR repeat-containing protein; structural genomics, PSI- protein structure initiative, northeast structural genomics consortium, NESG; 2.28A {Cytophaga hutchinsonii} | Back alignment and structure |
|---|
| >3cv0_A Peroxisome targeting signal 1 receptor PEX5; TPR motifs, TPR protein, peroxin 5, PEX5, PTS1 binding domain, protein-peptide complex, receptor; 2.00A {Trypanosoma brucei} PDB: 3cvl_A 3cvn_A 3cvp_A 3cvq_A | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 214 | ||||
| d2cwya1 | 94 | a.246.2.1 (A:1-94) Hypothetical protein TTHA0068 { | 2e-19 | |
| d2ijqa1 | 145 | a.246.2.1 (A:14-158) Hypothetical protein rrnAC103 | 1e-17 |
| >d2cwya1 a.246.2.1 (A:1-94) Hypothetical protein TTHA0068 {Thermus thermophilus [TaxId: 274]} Length = 94 | Back information, alignment and structure |
|---|
class: All alpha proteins fold: Hyaluronidase domain-like superfamily: TTHA0068-like family: TTHA0068-like domain: Hypothetical protein TTHA0068 species: Thermus thermophilus [TaxId: 274]
Score = 77.1 bits (190), Expect = 2e-19
Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 2/69 (2%)
Query: 41 FDEAVALFNERAYYKCHDCLESLWYTAEEPTRTLIHGVLQCAVGFYHLFNQNHKGAMMEL 100
++E + L+ YY+ H+ LE W A R L+ GV+ A + + + + L
Sbjct: 5 WEEVLGLWRAGRYYEVHEVLEPYWLKATGEERRLLQGVILLAAALHQR--RLGRPGLRNL 62
Query: 101 GEGLGKLRK 109
+ +L
Sbjct: 63 RKAEARLEG 71
|
| >d2ijqa1 a.246.2.1 (A:14-158) Hypothetical protein rrnAC1037 {Haloarcula marismortui [TaxId: 2238]} Length = 145 | Back information, alignment and structure |
|---|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 214 | |||
| d2ijqa1 | 145 | Hypothetical protein rrnAC1037 {Haloarcula marismo | 99.97 | |
| d2cwya1 | 94 | Hypothetical protein TTHA0068 {Thermus thermophilu | 99.96 | |
| d1tjca_ | 95 | Prolyl 4-hydroxylase alpha-1 subunit, P4HA1 {Human | 94.53 | |
| d1hz4a_ | 366 | Transcription factor MalT domain III {Escherichia | 90.86 | |
| d1hxia_ | 112 | Peroxin pex5 (peroxisomal targeting signal 1 (PTS1 | 89.81 | |
| d1fcha_ | 323 | Peroxin pex5 (peroxisomal targeting signal 1 (PTS1 | 86.26 | |
| d1hz4a_ | 366 | Transcription factor MalT domain III {Escherichia | 82.17 |
| >d2ijqa1 a.246.2.1 (A:14-158) Hypothetical protein rrnAC1037 {Haloarcula marismortui [TaxId: 2238]} | Back information, alignment and structure |
|---|
class: All alpha proteins fold: Hyaluronidase domain-like superfamily: TTHA0068-like family: TTHA0068-like domain: Hypothetical protein rrnAC1037 species: Haloarcula marismortui [TaxId: 2238]
Probab=99.97 E-value=1.6e-31 Score=216.89 Aligned_cols=117 Identities=17% Similarity=0.295 Sum_probs=103.4
Q ss_pred CcchhHHHHHHHhccCCchhhhhhhHHhhccCC--cchhhHHHHHHHHHHHHHH-HhcCCHHHHHHHHHHHHHHHhhccC
Q 047859 36 GENCSFDEAVALFNERAYYKCHDCLESLWYTAE--EPTRTLIHGVLQCAVGFYH-LFNQNHKGAMMELGEGLGKLRKMNL 112 (214)
Q Consensus 36 e~~~~l~~gi~LFN~G~YfEAHEVLEe~Wk~~~--g~er~~lqGLIQlAvAl~H-~~rGN~~GA~~Ll~rAl~~L~~~~~ 112 (214)
..+.+|.+|+.|||+|+||||||+||++|+..+ +++|.|||||||+|||+|| +++||++||++|+++|+.+|+.++
T Consensus 16 t~~~a~~~Gi~lfN~g~y~eaHE~lEe~W~~~~~~~~er~~~QGLIQ~Aaa~~hh~~~gN~~GA~~l~~~al~~L~~~~- 94 (145)
T d2ijqa1 16 TLRRAVVHGVRLYNSGEFHESHDCFEDEWYNYGRGNTESKFLHGMVQVAAGAYKHFDFEDDDGMRSLFRTSLQYFRGVP- 94 (145)
T ss_dssp HHHHHHHHHHHHHHTTCHHHHHHHHHHHTTTTCSSSHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHTTSC-
T ss_pred hHHHHHHHHHHHHhCCCHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhCC-
Confidence 567899999999999999999999999999976 4789999999999999885 899999999999999999999884
Q ss_pred CCCCCcccccCHHHHHHHHHHhhHHHHhcccCceeeecCcchhHHh
Q 047859 113 RSGPFHEFENEISAALEFIYRTQIELAACADDICLAMDQSERSYQL 158 (214)
Q Consensus 113 ~~~p~~~~GvDV~aLl~~i~~l~~Elaa~~~d~~l~~dgs~~~~~~ 158 (214)
..++||||+.|+.++.++..+.... .|+.++.|+..++...
T Consensus 95 ----~~~~Gidv~~l~~~~~~~~~d~~~~-~p~ri~ld~~~p~~~p 135 (145)
T d2ijqa1 95 ----NDYYGVDLLDVRTTVTNALSDPSAL-HGWQIRLDGEYPTCRP 135 (145)
T ss_dssp ----TTGGGBCHHHHHHHHHHHHHCGGGG-TTCCCCBTTBCCCCCH
T ss_pred ----CccCCccHHHHHHHHHHHHhchhhc-CCcceecCCCCCCCCc
Confidence 3479999999999999998765544 5789999998776543
|
| >d2cwya1 a.246.2.1 (A:1-94) Hypothetical protein TTHA0068 {Thermus thermophilus [TaxId: 274]} | Back information, alignment and structure |
|---|
| >d1tjca_ a.118.8.1 (A:) Prolyl 4-hydroxylase alpha-1 subunit, P4HA1 {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d1hz4a_ a.118.8.2 (A:) Transcription factor MalT domain III {Escherichia coli [TaxId: 562]} | Back information, alignment and structure |
|---|
| >d1hxia_ a.118.8.1 (A:) Peroxin pex5 (peroxisomal targeting signal 1 (PTS1) receptor) {Trypanosoma brucei [TaxId: 5691]} | Back information, alignment and structure |
|---|
| >d1fcha_ a.118.8.1 (A:) Peroxin pex5 (peroxisomal targeting signal 1 (PTS1) receptor) {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d1hz4a_ a.118.8.2 (A:) Transcription factor MalT domain III {Escherichia coli [TaxId: 562]} | Back information, alignment and structure |
|---|