Citrus Sinensis ID: 029119


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------20
MEYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLSRSNEGQSSSMSFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSSNTVALKG
ccEEEEEccccccccEEEcccccEEEEcccccccccccccccHHHHHHHHHHHccHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHEEEccccccHHHHHHHHHHHHHcccEEcc
ccEEEEEcccccHHHEEEcccccEEEEEcccccccccHEEEHcHEEEEEHHHHccHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHEccccHHHHHHHHHHHHHHHHHHHcc
meyrcvkcgfriKTLFVqyspgnirlmkceNCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLSrsnegqsssmSFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILlntsagassFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSsntvalkg
meyrcvkcgfriktlfvqyspgnirlMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLSRSNEGQSSSMSFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSSNTVALKG
MEYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFllldayrslllsrsNEGQSSSMSFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSSNTVALKG
**YRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLS**********SFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSSNT*****
MEYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLL***************LLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSSNTVALKG
MEYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLS*********MSFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSSNTVALKG
MEYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLSRSNEGQSSSMSFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSSNTVALKG
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLSRSNEGQSSSMSFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYIIDLFVLSSNTVALKG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query198 2.2.26 [Sep-21-2011]
Q54GD9246 Protein arv1 homolog OS=D yes no 0.949 0.764 0.307 7e-23
Q9H2C2271 Protein ARV1 OS=Homo sapi yes no 0.944 0.690 0.325 5e-17
Q9HDX5266 Protein arv1 OS=Schizosac yes no 0.939 0.699 0.306 2e-16
Q9D0U9266 Protein ARV1 OS=Mus muscu yes no 0.949 0.706 0.315 5e-15
Q3SZW3282 Protein ARV1 OS=Bos tauru yes no 0.944 0.663 0.310 2e-14
Q750Q7 299 Protein ARV1 OS=Ashbya go yes no 0.333 0.220 0.439 3e-07
Q06541 321 Protein ARV1 OS=Saccharom yes no 0.318 0.196 0.396 8e-05
>sp|Q54GD9|ARV1_DICDI Protein arv1 homolog OS=Dictyostelium discoideum GN=arv1 PE=3 SV=1 Back     alignment and function desciption
 Score =  107 bits (266), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 105/208 (50%), Gaps = 20/208 (9%)

Query: 5   CVKCGFRIKTLFVQYSP---GNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRH 61
           C++CG  +  ++ ++     GNIRL +C +C   AD+Y+E + +I+ +DL LHK QAYRH
Sbjct: 3   CIECGRPVNDVYKEFGKAGSGNIRLTRCASCNQTADKYVEYDFIIVFLDLFLHKAQAYRH 62

Query: 62  LLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLSRSNEGQSSSMSFSLLAWIFQKMLK 121
           LL+N        +     K  V ++  ++Y   L  +  E   S  +F  + W       
Sbjct: 63  LLFNRQPYRDFGIPIQYIKVLVVYIFFESYIKWLRFKEYEQHPSGPAFYYIDW-----QD 117

Query: 122 DVVLGNVMFLGVFLHAS------------RILLNTSAGASSFKDFLLAVLISSYFKIFLV 169
           DV      F+ V   A             R +  +      +   ++A+++SS+ K FLV
Sbjct: 118 DVPYDRYWFIFVTAIAEFAVYILSIILSVRFIYESRYPIIKYNYLIMAIILSSFGKGFLV 177

Query: 170 AMMVWNFPSSVIYIIDLFVLSSNTVALK 197
            MM+W++P S   I+++FVLSSN VA+K
Sbjct: 178 LMMIWDYPFSFGSILNIFVLSSNVVAIK 205




Mediator of sterol homeostasis involved in sterol uptake, trafficking and distribution into membranes. Regulates also the sphingolipid metabolism.
Dictyostelium discoideum (taxid: 44689)
>sp|Q9H2C2|ARV1_HUMAN Protein ARV1 OS=Homo sapiens GN=ARV1 PE=2 SV=1 Back     alignment and function description
>sp|Q9HDX5|ARV1_SCHPO Protein arv1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=arv1 PE=3 SV=3 Back     alignment and function description
>sp|Q9D0U9|ARV1_MOUSE Protein ARV1 OS=Mus musculus GN=Arv1 PE=2 SV=1 Back     alignment and function description
>sp|Q3SZW3|ARV1_BOVIN Protein ARV1 OS=Bos taurus GN=ARV1 PE=2 SV=1 Back     alignment and function description
>sp|Q750Q7|ARV1_ASHGO Protein ARV1 OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=ARV1 PE=3 SV=1 Back     alignment and function description
>sp|Q06541|ARV1_YEAST Protein ARV1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=ARV1 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query198
225453664248 PREDICTED: protein arv1 homolog [Vitis v 0.994 0.794 0.725 2e-78
224128085232 predicted protein [Populus trichocarpa] 0.989 0.844 0.712 5e-75
255548091187 arv1, putative [Ricinus communis] gi|223 0.904 0.957 0.668 2e-64
356504684250 PREDICTED: protein arv1 homolog [Glycine 0.994 0.788 0.619 3e-61
356570412247 PREDICTED: protein arv1 homolog [Glycine 0.994 0.797 0.614 5e-60
297843020245 hypothetical protein ARALYDRAFT_470181 [ 0.959 0.775 0.571 4e-57
449431936277 PREDICTED: protein arv1 homolog [Cucumis 0.969 0.693 0.576 3e-56
242032783237 hypothetical protein SORBIDRAFT_01g00613 0.969 0.810 0.551 7e-53
79332834245 Arv1-like protein [Arabidopsis thaliana] 0.959 0.775 0.561 1e-52
297814177228 hypothetical protein ARALYDRAFT_490425 [ 0.939 0.815 0.575 9e-52
>gi|225453664|ref|XP_002268438.1| PREDICTED: protein arv1 homolog [Vitis vinifera] gi|296089029|emb|CBI38732.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  297 bits (760), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 143/197 (72%), Positives = 168/197 (85%)

Query: 1   MEYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYR 60
           ME RC+ CGF IKTLF+QYSPGNIRLMKCENC+AVADEYIECE MI+LIDLILHKP+AYR
Sbjct: 1   MELRCIHCGFPIKTLFLQYSPGNIRLMKCENCKAVADEYIECESMIVLIDLILHKPKAYR 60

Query: 61  HLLYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLSRSNEGQSSSMSFSLLAWIFQKML 120
           HLL+N+LN +T+N +G+LWKS +GFLLLDAYR L+L+RS E    SMSFS L W F K+L
Sbjct: 61  HLLFNLLNPQTLNFQGLLWKSALGFLLLDAYRILVLTRSEEEWGLSMSFSSLFWRFSKIL 120

Query: 121 KDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSV 180
            DV+ GN+ FL +FL A+R LLNT  GAS +K+ LLA+LISSYFKIFL+AMMVW  PSSV
Sbjct: 121 MDVLFGNLAFLSIFLLATRNLLNTVGGASRYKNLLLAILISSYFKIFLIAMMVWELPSSV 180

Query: 181 IYIIDLFVLSSNTVALK 197
           I+IID+FVLSSNTVALK
Sbjct: 181 IFIIDVFVLSSNTVALK 197




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224128085|ref|XP_002320240.1| predicted protein [Populus trichocarpa] gi|222861013|gb|EEE98555.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255548091|ref|XP_002515102.1| arv1, putative [Ricinus communis] gi|223545582|gb|EEF47086.1| arv1, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|356504684|ref|XP_003521125.1| PREDICTED: protein arv1 homolog [Glycine max] Back     alignment and taxonomy information
>gi|356570412|ref|XP_003553382.1| PREDICTED: protein arv1 homolog [Glycine max] Back     alignment and taxonomy information
>gi|297843020|ref|XP_002889391.1| hypothetical protein ARALYDRAFT_470181 [Arabidopsis lyrata subsp. lyrata] gi|297335233|gb|EFH65650.1| hypothetical protein ARALYDRAFT_470181 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449431936|ref|XP_004133756.1| PREDICTED: protein arv1 homolog [Cucumis sativus] Back     alignment and taxonomy information
>gi|242032783|ref|XP_002463786.1| hypothetical protein SORBIDRAFT_01g006130 [Sorghum bicolor] gi|241917640|gb|EER90784.1| hypothetical protein SORBIDRAFT_01g006130 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|79332834|ref|NP_171610.2| Arv1-like protein [Arabidopsis thaliana] gi|56540523|gb|AAV92715.1| ARV1 [Arabidopsis thaliana] gi|332189096|gb|AEE27217.1| Arv1-like protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297814177|ref|XP_002874972.1| hypothetical protein ARALYDRAFT_490425 [Arabidopsis lyrata subsp. lyrata] gi|297320809|gb|EFH51231.1| hypothetical protein ARALYDRAFT_490425 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query198
TAIR|locus:2117012228 ARV2 "AT4G01510" [Arabidopsis 0.939 0.815 0.507 1.6e-47
TAIR|locus:2200940245 ARV1 "AT1G01020" [Arabidopsis 0.959 0.775 0.505 8.8e-47
DICTYBASE|DDB_G0290221246 arv1 "arv1-like family protein 0.308 0.247 0.437 6.2e-24
POMBASE|SPAPB1A10.15266 arv1 "Arv1-like family protein 0.343 0.255 0.471 1.1e-20
UNIPROTKB|Q9H2C2271 ARV1 "Protein ARV1" [Homo sapi 0.954 0.697 0.293 2.8e-18
UNIPROTKB|F1RGS6281 ARV1 "Uncharacterized protein" 0.368 0.259 0.389 3.9e-18
UNIPROTKB|E2QRS2281 ARV1 "Uncharacterized protein" 0.954 0.672 0.288 9.5e-18
MGI|MGI:1916115266 Arv1 "ARV1 homolog (yeast)" [M 0.959 0.714 0.282 8.6e-17
RGD|1310264267 Arv1 "ARV1 homolog (S. cerevis 0.959 0.711 0.277 1.1e-16
UNIPROTKB|F1MCN1282 ARV1 "Protein ARV1" [Bos tauru 0.939 0.659 0.303 1.4e-16
TAIR|locus:2117012 ARV2 "AT4G01510" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 497 (180.0 bits), Expect = 1.6e-47, P = 1.6e-47
 Identities = 98/193 (50%), Positives = 133/193 (68%)

Query:     5 CVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLY 64
             CV+CG ++K+LF+QYSPGN RLMKCENC  VADEY+ECE++I+ IDLILHK +AYRHLLY
Sbjct:     8 CVECGHKVKSLFIQYSPGNFRLMKCENCEEVADEYVECELLIIFIDLILHKTKAYRHLLY 67

Query:    65 NVLNSETVNLKGILWKSTVGFXXXXXXXXXXXXXXNEGQSSSMSFSLLAWIFQKMLKDVV 124
             NV+N E+ N++ +LWK  + +              N+G + SMSF L   +  ++L +V+
Sbjct:    68 NVVNQESANVQHLLWKLVLAYLLLDTYRSLLLRRTNDGSNVSMSF-LFESL--EVLVNVL 124

Query:   125 LGNVMFLGVFLHASRILLNTSAGASSFKDFLLAVLISSYFKIFLVAMMVWNFPSSVIYII 184
               N  F+  F  A++++L    G    K+ LL +LISSY KIFL AM VW FP SVI+I+
Sbjct:   125 SANFAFVFSFAFAAKLMLVMPRG----KEILLTILISSYVKIFLFAMPVWEFPVSVIFIV 180

Query:   185 DLFVLSSNTVALK 197
             D+ VL+SN VALK
Sbjct:   181 DMLVLTSNAVALK 193




GO:0003674 "molecular_function" evidence=ND
GO:0016020 "membrane" evidence=ISS
GO:0005783 "endoplasmic reticulum" evidence=IDA
GO:0006665 "sphingolipid metabolic process" evidence=IMP
GO:0016125 "sterol metabolic process" evidence=IMP
GO:0006661 "phosphatidylinositol biosynthetic process" evidence=RCA
TAIR|locus:2200940 ARV1 "AT1G01020" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0290221 arv1 "arv1-like family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
POMBASE|SPAPB1A10.15 arv1 "Arv1-like family protein Arv1 (predicted)" [Schizosaccharomyces pombe (taxid:4896)] Back     alignment and assigned GO terms
UNIPROTKB|Q9H2C2 ARV1 "Protein ARV1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1RGS6 ARV1 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|E2QRS2 ARV1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
MGI|MGI:1916115 Arv1 "ARV1 homolog (yeast)" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
RGD|1310264 Arv1 "ARV1 homolog (S. cerevisiae)" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|F1MCN1 ARV1 "Protein ARV1" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_Genewise1_v1.C_LG_XIV1968
hypothetical protein (232 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query198
pfam04161194 pfam04161, Arv1, Arv1-like family 2e-48
COG5254239 COG5254, ARV1, Predicted membrane protein [Functio 2e-19
>gnl|CDD|217936 pfam04161, Arv1, Arv1-like family Back     alignment and domain information
 Score =  155 bits (395), Expect = 2e-48
 Identities = 76/201 (37%), Positives = 109/201 (54%), Gaps = 14/201 (6%)

Query: 3   YRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHL 62
             C++CG  +K+L+ +YS GNI+L  C NC  VAD+YIE + +IL IDL+L K QAYRHL
Sbjct: 1   MICIECGRPVKSLYKKYSSGNIKLTDCPNCNEVADKYIEYDNVILFIDLLLLKKQAYRHL 60

Query: 63  LYNVLNSETVNLKGILWKSTVGFLLLDAYRSLLLSRSNEGQSSSMSFSLLAWIFQK---- 118
           LYNVL     N    LW+  +  +L + Y  L   R  +    S++ ++L+         
Sbjct: 61  LYNVL-----NKYDRLWRLFLLLILFEVY--LTWRREEKKSQHSLTRNVLSQSPYLQYLF 113

Query: 119 MLKDVVLGNVMFLGVFLHASRILLNTSAGASSFKDFLL-AVLISSYFKIFLVAMMVWNFP 177
            L  ++L NV+         R LL    G       LL  VL+S Y K+F + M++W + 
Sbjct: 114 FLAYLLLENVLLHISTAILLRKLLKWGKGLRYRTLILLYTVLLSYYGKLFPILMLIWPYD 173

Query: 178 SSVIY--IIDLFVLSSNTVAL 196
           +++I   IIDL VL SN  AL
Sbjct: 174 TTLISMSIIDLIVLLSNIEAL 194


Arv1 is a transmembrane protein with potential zinc-binding motifs. ARV1 is a novel mediator of eukaryotic sterol homeostasis. Length = 194

>gnl|CDD|227579 COG5254, ARV1, Predicted membrane protein [Function unknown] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 198
PF04161208 Arv1: Arv1-like family ; InterPro: IPR007290 Arv1 100.0
KOG3134225 consensus Predicted membrane protein [Function unk 100.0
COG5254239 ARV1 Predicted membrane protein [Function unknown] 100.0
smart0083441 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_C 93.53
PF0827143 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013 93.24
PRK0039846 rpoP DNA-directed RNA polymerase subunit P; Provis 92.75
smart0065944 RPOLCX RNA polymerase subunit CX. present in RNA p 91.57
PF0972342 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR0134 91.2
KOG3134225 consensus Predicted membrane protein [Function unk 90.55
TIGR0260552 CxxC_CxxC_SSSS putative regulatory protein, FmdB f 90.29
COG5254239 ARV1 Predicted membrane protein [Function unknown] 90.18
PF14319111 Zn_Tnp_IS91: Transposase zinc-binding domain 89.63
PF1324826 zf-ribbon_3: zinc-ribbon domain 89.46
PF0360432 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa 88.57
COG199649 RPC10 DNA-directed RNA polymerase, subunit RPC10 ( 88.42
PF0518066 zf-DNL: DNL zinc finger; InterPro: IPR007853 Zinc 87.52
cd0072934 rubredoxin_SM Rubredoxin, Small Modular nonheme ir 86.46
PF0728269 OrfB_Zn_ribbon: Putative transposase DNA-binding d 84.09
PF04216290 FdhE: Protein involved in formate dehydrogenase fo 83.31
PF0128634 XPA_N: XPA protein N-terminal; InterPro: IPR022652 82.09
PF1277350 DZR: Double zinc ribbon 82.03
PRK00464154 nrdR transcriptional regulator NrdR; Validated 81.25
PF1264792 RNHCP: RNHCP domain; InterPro: IPR024439 This doma 81.11
PF01927147 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 T 80.2
>PF04161 Arv1: Arv1-like family ; InterPro: IPR007290 Arv1 is a transmembrane protein, with potential zinc-binding motifs, that mediates sterol homeostasis Back     alignment and domain information
Probab=100.00  E-value=6.1e-60  Score=399.15  Aligned_cols=194  Identities=41%  Similarity=0.643  Sum_probs=169.3

Q ss_pred             ccccccCcccccceeeecCCceEeecCCCccccccccccchhHHHHHHHHhcCcchheeeeccccccccc--------ch
Q 029119            3 YRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECEIMILLIDLILHKPQAYRHLLYNVLNSETV--------NL   74 (198)
Q Consensus         3 ~~Ci~C~~~v~~l~~~y~~~~i~l~~C~~C~~~~DkYiE~d~~i~~iDl~L~k~~ayRHllfN~~~~~~~--------~~   74 (198)
                      ++|||||+||++||++|||||||+|+|++||++||||||+|+++++||++|+|||||||++||+.+++..        +.
T Consensus         1 miCIeCg~~v~~Ly~~Ys~~~irLt~C~~C~~vaDkYiE~d~vil~IDLlLlK~~AYRHllfN~~~~~~~~~~~~~~~~~   80 (208)
T PF04161_consen    1 MICIECGHPVKSLYRQYSPGNIRLTKCPNCGKVADKYIEYDNVILFIDLLLLKPQAYRHLLFNRLEPELSKFQVKNWFSK   80 (208)
T ss_pred             CEeccCCCcchhhhhccCCCcEEEeeccccCCcccceeccccHHHHHHHHHcchhhHHHhhccCCccccccchhhhhhhh
Confidence            5899999999999999999999999999999999999999999999999999999999999999887765        44


Q ss_pred             hhHHHHHHHHHHHHHHHHHHhhhccCCCCCC---cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCch
Q 029119           75 KGILWKSTVGFLLLDAYRSLLLSRSNEGQSS---SMSFSLLAWIFQKMLKDVVLGNVMFLGVFLHASRILLNTS-AGASS  150 (198)
Q Consensus        75 ~~~l~kl~~~~ll~eay~~w~~~~~~~~~~~---s~~~~~~~~~~~~~l~~~~l~~~~f~~~~~l~~~~~l~~~-~~~~~  150 (198)
                      .+.+||++++++++|||+.|...+.+.+.+.   .....+...++..++...++++++|+.++.+..+..++++ ....+
T Consensus        81 ~~~~~rl~il~ll~eayl~w~~~~~~~~~~~~~~~~~~~~~~~q~~~f~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~  160 (208)
T PF04161_consen   81 FKSLWRLVILLLLFEAYLRWASEEKSSQSSQLMSSILSQSIYMQYLFFLIYCLLENLLFHLFIALLIRFWLKWGQQYKYR  160 (208)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHhccccccchhhhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchh
Confidence            5689999999999999999987665543321   1111233555667778888899999999999999888887 55668


Q ss_pred             HHHHHHHHHHHhhhHHHhhhhhccCCCCchhh--hhheeEEeeccccc
Q 029119          151 FKDFLLAVLISSYFKIFLVAMMVWNFPSSVIY--IIDLFVLSSNTVAL  196 (198)
Q Consensus       151 ~~~ll~alllSs~~KLF~ilmlIW~yd~s~~~--~i~~~vl~sN~~Al  196 (198)
                      ++.+++|+++||++|+|+++|+||+||.++..  +++++|++||++||
T Consensus       161 ~~~l~~alllSs~~Klf~ilmlIW~~~~~~~~~~ii~~~v~~~~~~aL  208 (208)
T PF04161_consen  161 FRVLLTALLLSSYGKLFPILMLIWPYDSSPISLSIIDWFVLLSNIEAL  208 (208)
T ss_pred             HHHHHHHHHHHhhHHHHHHHHHHhCCCCcHHHHHHHHHHHHHHHHhcC
Confidence            99999999999999999999999999987766  99999999999997



Its action is important in lipid homeostasis, which prevents free sterol toxicity []. Arv1 contains a homology domain (AHD), which consists of an N-terminal cysteine-rich subdomain with a putative zinc-binding motif, followed by a C-terminal subdomain of 33 amino acids. The C-terminal subdomain of the AHD is critical for the protein's function []. In yeast, Arv1p is important for the delivery of an early glycosylphosphatidylinositol GPI intermediate, GlcN-acylPI, to the first mannosyltransferase of GPI synthesis in the ER lumen []. It is important for the traffic of sterol in yeast and in humans. In eukaryotic cells, it may fuction in the sphingolipid metabolic pathway as a transporter of ceramides between the ER and Golgi [].

>KOG3134 consensus Predicted membrane protein [Function unknown] Back     alignment and domain information
>COG5254 ARV1 Predicted membrane protein [Function unknown] Back     alignment and domain information
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein Back     alignment and domain information
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional Back     alignment and domain information
>smart00659 RPOLCX RNA polymerase subunit CX Back     alignment and domain information
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria Back     alignment and domain information
>KOG3134 consensus Predicted membrane protein [Function unknown] Back     alignment and domain information
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family Back     alignment and domain information
>COG5254 ARV1 Predicted membrane protein [Function unknown] Back     alignment and domain information
>PF14319 Zn_Tnp_IS91: Transposase zinc-binding domain Back     alignment and domain information
>PF13248 zf-ribbon_3: zinc-ribbon domain Back     alignment and domain information
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates Back     alignment and domain information
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription] Back     alignment and domain information
>PF05180 zf-DNL: DNL zinc finger; InterPro: IPR007853 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase Back     alignment and domain information
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins Back     alignment and domain information
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex [] Back     alignment and domain information
>PF01286 XPA_N: XPA protein N-terminal; InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer Back     alignment and domain information
>PF12773 DZR: Double zinc ribbon Back     alignment and domain information
>PRK00464 nrdR transcriptional regulator NrdR; Validated Back     alignment and domain information
>PF12647 RNHCP: RNHCP domain; InterPro: IPR024439 This domain is found in uncharacterised bacterial proteins Back     alignment and domain information
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query198
1twf_L70 ABC10-alpha, DNA-directed RNA polymerases I, II, a 89.96
3h0g_L63 DNA-directed RNA polymerases I, II, and III subuni 87.03
>1twf_L ABC10-alpha, DNA-directed RNA polymerases I, II, and III 7.7 K polypeptide; transcription, mRNA, multiprotein complex; HET: UTP; 2.30A {Saccharomyces cerevisiae} SCOP: g.41.9.2 PDB: 1i3q_L 1i6h_L 1k83_L* 1nik_L 1nt9_L 1pqv_L 1r5u_L 1r9s_L* 1r9t_L* 1sfo_L* 1twa_L* 1twc_L* 1i50_L* 1twg_L* 1twh_L* 1wcm_L 1y1v_L 1y1w_L 1y1y_L 1y77_L* ... Back     alignment and structure
Probab=89.96  E-value=0.12  Score=36.12  Aligned_cols=27  Identities=19%  Similarity=0.785  Sum_probs=20.1

Q ss_pred             CCccccccCcccccceeeecCCceEeecCCCccc
Q 029119            1 MEYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRA   34 (198)
Q Consensus         1 ~~~~Ci~C~~~v~~l~~~y~~~~i~l~~C~~C~~   34 (198)
                      .+|+|-+||..++..    ..+.+   .|++||.
T Consensus        27 v~Y~C~~CG~~~e~~----~~d~i---rCp~CG~   53 (70)
T 1twf_L           27 LKYICAECSSKLSLS----RTDAV---RCKDCGH   53 (70)
T ss_dssp             CCEECSSSCCEECCC----TTSTT---CCSSSCC
T ss_pred             EEEECCCCCCcceeC----CCCCc---cCCCCCc
Confidence            368999999997654    23333   7999997



>3h0g_L DNA-directed RNA polymerases I, II, and III subunit rpabc4; transcription, multi-protein complex, DNA- binding, magnesium; 3.65A {Schizosaccharomyces pombe} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query198
d1nnqa237 Rubrerythrin, C-terminal domain {Archaeon Pyrococc 93.68
d1d4ua236 DNA repair factor XPA DNA- and RPA-binding domain, 85.18
d1yuza236 Nigerythrin, C-terminal domain {Desulfovibrio vulg 85.07
d1y0ja139 Erythroid transcription factor GATA-1 {Mouse (Mus 83.96
d1dl6a_58 Transcription initiation factor TFIIB, N-terminal 83.91
d1pfta_50 Transcription initiation factor TFIIB, N-terminal 81.85
d1k81a_36 Zinc-binding domain of translation initiation fact 80.02
>d1nnqa2 g.41.5.1 (A:135-171) Rubrerythrin, C-terminal domain {Archaeon Pyrococcus furiosus [TaxId: 2261]} Back     information, alignment and structure
class: Small proteins
fold: Rubredoxin-like
superfamily: Rubredoxin-like
family: Rubredoxin
domain: Rubrerythrin, C-terminal domain
species: Archaeon Pyrococcus furiosus [TaxId: 2261]
Probab=93.68  E-value=0.012  Score=35.03  Aligned_cols=33  Identities=27%  Similarity=0.667  Sum_probs=26.0

Q ss_pred             CccccccCcccccceeeecCCceEeecCCCccccccccccch
Q 029119            2 EYRCVKCGFRIKTLFVQYSPGNIRLMKCENCRAVADEYIECE   43 (198)
Q Consensus         2 ~~~Ci~C~~~v~~l~~~y~~~~i~l~~C~~C~~~~DkYiE~d   43 (198)
                      .++|..||+-...    -.|     .+||-|+...|+|.++|
T Consensus         5 ~~~C~vCG~i~~g----~~P-----~~CPvCg~~~~~F~~~e   37 (37)
T d1nnqa2           5 VYICPICGYTAVD----EAP-----EYCPVCGAPKEKFVVFE   37 (37)
T ss_dssp             EEECTTTCCEEES----CCC-----SBCTTTCCBGGGSEEEC
T ss_pred             EEECCCCCCEecC----CCC-----CcCCCCCCcHHHcEEcC
Confidence            3689999987643    133     47999999999999875



>d1d4ua2 g.39.1.5 (A:1-36) DNA repair factor XPA DNA- and RPA-binding domain, N-terminal subdomain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1yuza2 g.41.5.1 (A:167-202) Nigerythrin, C-terminal domain {Desulfovibrio vulgaris [TaxId: 881]} Back     information, alignment and structure
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1dl6a_ g.41.3.1 (A:) Transcription initiation factor TFIIB, N-terminal domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1pfta_ g.41.3.1 (A:) Transcription initiation factor TFIIB, N-terminal domain {Archaeon Pyrococcus furiosus [TaxId: 2261]} Back     information, alignment and structure
>d1k81a_ g.59.1.1 (A:) Zinc-binding domain of translation initiation factor 2 beta {Archaeon Methanococcus jannaschii [TaxId: 2190]} Back     information, alignment and structure