Citrus Sinensis ID: 028747


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200----
MGLEVRVSRMSECISSIPLSNSVPVTKIRTLSGSKWRPKNGSVRNGNLQYAMCTFKPPSKNQELFDELGFKDKFSVNSAEENSREIAEVKEEKREEEKDGSDGSSEKVKLRRGRQVMRRSNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLLEDICQVGDVVLIEDESVMENDFKMVRLDTLDTEL
cccEEEEEEEccccccccccccccEEEEEEcccccccccccccccccccccEEEEccccccccccccccEEEccccccccccHHHHHHHHccccccccccccccccHHHcccccHHHHHHHHcccEEEEcccccEEEEEEEEEEEccccEEEEEEEEccccccccEEEEcccccccccEEEEcccccccccccccccccccccc
cccEEEEEccccccccccccccccEEcccccccccccccEEEEEcccccEEEEEEEEcccccccccccccccccccccEEEcccccccccHHHEEEEcccccccccEEEEccccEEEEHHHHHccEEEEccccHcccEEEEEEEEccccEEEEEEEcHHHccccHHEEEHHcccEcccEEEEcccHHHHHHHHEEccccccccc
MGLEVRVSRMSEcissiplsnsvpvtkirtlsgskwrpkngsvrngnlqyamctfkppsknqelfdelgfkdkfsvnsaEENSREIAEVKEEkreeekdgsdgssekvkLRRGRQVMRRSNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVrpnllsgepeRLLLEDICQVGDVVLIedesvmendfkMVRLDTLDTEL
MGLEVRVSRMSEcissiplsnsvpvtkirtlsgskwrpkngsvrnGNLQYAMCTFKPPSKNQELFDELGFKDkfsvnsaeensreiaevkeekreeekdgsdgssekvklrrgrqvmrrsNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPnllsgeperLLLEDICQVGDVVLIedesvmendfkmvrldtldtel
MGLEVRVSRMSECISSIPLSNSVPVTKIRTLSGSKWRPKNGSVRNGNLQYAMCTFKPPSKNQELFDELGFKDKFSVNSAEENSReiaevkeekreeekdgsdgssekVKLRRGRQVMRRSNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLLEDICQVGDVVLIEDESVMENDFKMVRLDTLDTEL
*********************************************GNLQYAMCTF*****************************************************************NLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLLEDICQVGDVVLIEDESVMENDFKMVR********
****VRVSRMSECISSIPLSNSVPVTKIRTL*********************CTFKPPSKNQELFDELG***************************************************NLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLLEDICQVGDVVLIEDESVMENDFKMVRLDTL****
**********SECISSIPLSNSVPVTKIRTLSGSKWRPKNGSVRNGNLQYAMCTFKPPSKNQELFDELGFKDKFSV**************************************QVMRRSNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLLEDICQVGDVVLIEDESVMENDFKMVRLDTLDTEL
*GLEVRVSRMSECISSIPLSNSVPVTKIRTLSGSKWRPKNGSVRNGNLQYAMCTFKPPSKNQELFDELGFKDKFSVNSAEENSREIAEVKEEKREEEKDGSDGSSEKVKLRRGRQVMRRSNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLLEDICQVGDVVLIEDESVMENDFKMVRLDTL****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGLEVRVSRMSECISSIPLSNSVPVTKIRTLSGSKWRPKNGSVRNGNLQYAMCTFKPPSKNQELFDELGFKDKFSxxxxxxxxxxxxxxxxxxxxxxxxGSDGSSEKVKLRRGRQVMRRSNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLLEDICQVGDVVLIEDESVMENDFKMVRLDTLDTEL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query204
225439715 377 PREDICTED: uncharacterized protein LOC10 0.901 0.488 0.434 3e-37
359481313 332 PREDICTED: uncharacterized protein LOC10 0.911 0.560 0.425 3e-37
147840953 377 hypothetical protein VITISV_011642 [Viti 0.901 0.488 0.434 5e-37
356568220 382 PREDICTED: uncharacterized protein LOC10 0.514 0.274 0.681 3e-34
449502678 473 PREDICTED: uncharacterized LOC101206450 0.612 0.264 0.583 1e-33
449449956 473 PREDICTED: uncharacterized protein LOC10 0.612 0.264 0.583 1e-33
224091128 347 predicted protein [Populus trichocarpa] 0.568 0.334 0.644 2e-33
118487679257 unknown [Populus trichocarpa] 0.568 0.451 0.644 2e-33
255575602149 hypothetical protein RCOM_1058350 [Ricin 0.446 0.610 0.780 9e-33
115452855229 Os03g0334100 [Oryza sativa Japonica Grou 0.441 0.393 0.7 3e-28
>gi|225439715|ref|XP_002268101.1| PREDICTED: uncharacterized protein LOC100261869 isoform 1 [Vitis vinifera] gi|297735530|emb|CBI18024.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  160 bits (404), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 106/244 (43%), Positives = 140/244 (57%), Gaps = 60/244 (24%)

Query: 10  MSECISSIPLSNSVPVTKIRTLSGSK--WRPKNGS------------VRNGN-------- 47
           M +C  S  L NS    +IR L  +   W P+N +             RNG         
Sbjct: 1   MCDCAPSTLLLNST--VRIRNLGFTTRDWNPRNHAKLSTFCNSHLIFHRNGGSRLISAEI 58

Query: 48  --LQYAMCTFKPPSKNQELFDELGFKDKF---------SVNSAEENSREIAEVKE-EKRE 95
             L    C    P++      ELGFKD+           V+S+  ++ +  E++E E R 
Sbjct: 59  DALNSTTCVRAIPNRG-----ELGFKDEGKRDFGFETKGVDSSTSSAFDFLELEEQEGRT 113

Query: 96  EEK-------------------DGSDGSSEKVKLRRGRQVMRRSNLLAKQVISIQSAMSL 136
            EK                   +G++   E++ LR GRQVMRRSN+LAKQVISI+SA+SL
Sbjct: 114 GEKSAGNPEDEDLVRVGGGSELEGAEKVDERLGLRSGRQVMRRSNMLAKQVISIRSALSL 173

Query: 137 GFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLLEDICQVGDVVLIEDESVMENDFKMVR 196
           GFVSQLWVDTTSW+VLV+EVRPNLLSGE ER LLED+ +VGDVVL++DESVM+N++KM+ 
Sbjct: 174 GFVSQLWVDTTSWMVLVIEVRPNLLSGELERFLLEDVSRVGDVVLVQDESVMDNEYKMLG 233

Query: 197 LDTL 200
           L+TL
Sbjct: 234 LETL 237




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359481313|ref|XP_003632606.1| PREDICTED: uncharacterized protein LOC100261869 isoform 2 [Vitis vinifera] Back     alignment and taxonomy information
>gi|147840953|emb|CAN71020.1| hypothetical protein VITISV_011642 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356568220|ref|XP_003552311.1| PREDICTED: uncharacterized protein LOC100806448 [Glycine max] Back     alignment and taxonomy information
>gi|449502678|ref|XP_004161711.1| PREDICTED: uncharacterized LOC101206450 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449449956|ref|XP_004142730.1| PREDICTED: uncharacterized protein LOC101206450 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224091128|ref|XP_002309189.1| predicted protein [Populus trichocarpa] gi|222855165|gb|EEE92712.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|118487679|gb|ABK95664.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255575602|ref|XP_002528701.1| hypothetical protein RCOM_1058350 [Ricinus communis] gi|223531873|gb|EEF33690.1| hypothetical protein RCOM_1058350 [Ricinus communis] Back     alignment and taxonomy information
>gi|115452855|ref|NP_001050028.1| Os03g0334100 [Oryza sativa Japonica Group] gi|108707999|gb|ABF95794.1| expressed protein [Oryza sativa Japonica Group] gi|113548499|dbj|BAF11942.1| Os03g0334100 [Oryza sativa Japonica Group] gi|215767854|dbj|BAH00083.1| unnamed protein product [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query204
TAIR|locus:2064184 328 AT2G38570 "AT2G38570" [Arabido 0.436 0.271 0.688 2.4e-28
TAIR|locus:2064184 AT2G38570 "AT2G38570" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 316 (116.3 bits), Expect = 2.4e-28, P = 2.4e-28
 Identities = 62/90 (68%), Positives = 80/90 (88%)

Query:   112 RG-RQVMRRSNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPNLLSGEPERLLL 170
             RG +Q   RSNL+AKQV+SIQSA+SLGF+SQLWVDTTSW+VLVV+V+P+LLSGE ER LL
Sbjct:   112 RGFKQSTTRSNLVAKQVVSIQSALSLGFISQLWVDTTSWLVLVVDVKPSLLSGESERFLL 171

Query:   171 EDICQVGDVVLIEDESVMENDFKMVRLDTL 200
              DI +VGDVVL+++E+V++ +FKMV L+TL
Sbjct:   172 TDIVRVGDVVLVDNETVLDTEFKMVGLETL 201


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.317   0.132   0.375    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      204       181   0.00095  109 3  11 22  0.43    32
                                                     31  0.40    35


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  592 (63 KB)
  Total size of DFA:  152 KB (2092 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  14.58u 0.16s 14.74t   Elapsed:  00:00:01
  Total cpu time:  14.58u 0.16s 14.74t   Elapsed:  00:00:01
  Start:  Sat May 11 09:25:00 2013   End:  Sat May 11 09:25:01 2013


GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 204
TIGR0288876 spore_YlmC_YmxH sporulation protein, YlmC/YmxH fam 99.87
COG187387 Protein implicated in RNA metabolism, contains PRC 99.45
PF0523979 PRC: PRC-barrel domain; InterPro: IPR007903 The PR 99.31
COG3881176 PRC-barrel domain containing protein [General func 98.03
COG3881176 PRC-barrel domain containing protein [General func 97.73
cd00226246 PRCH Photosynthetic reaction center (RC) complex, 93.89
>TIGR02888 spore_YlmC_YmxH sporulation protein, YlmC/YmxH family Back     alignment and domain information
Probab=99.87  E-value=2e-22  Score=148.95  Aligned_cols=67  Identities=27%  Similarity=0.441  Sum_probs=60.5

Q ss_pred             EeccccCceeEeccCcceeeee--eeEEEECCCCEEEEEEECCC-----CCCCCCe-EEeccccceec-cEEEEec
Q 028747          118 RRSNLLAKQVISIQSAMSLGFV--SQLWVDTTSWVVLVVEVRPN-----LLSGEPE-RLLLEDICQVG-DVVLIED  184 (204)
Q Consensus       118 k~SeL~~KEVINi~dG~RLG~I--~DL~VD~~sG~V~AIiVp~~-----~f~ge~~-~IPWe~IkKIG-DVILVe~  184 (204)
                      |+|||++|||||++||+|||+|  .|+++|+++|+|.||++|++     ||+++.+ .|||++|+||| |+|||++
T Consensus         1 r~seL~~keVIni~~G~~lG~v~~~Dl~iD~~~G~I~aiIi~~~~~~~~~~~~~~~~~Ipw~~I~kIG~DvIlV~~   76 (76)
T TIGR02888         1 RLSDLRGKEIINVNDGERLGVIGNIDLEIDEEDGRILSLIIPGKGKKFGLFSKGEEIEIPWDAIKKIGSDVILVDV   76 (76)
T ss_pred             CHHHccCCCEEECCCCcEeeccccceEEEECCCCEEEEEEEeCCCcEEEeecCCcEEEEEhhhccEECCCEEEEeC
Confidence            5799999999999999999999  79999999999999999983     4565554 58999999999 9999984



Members of this family belong to the broader family of PRC-barrel domain proteins (see Pfam model pfam05239), but are found only in endospore-forming bacteria of the Firmicutes lineage. Most such species have exactly two members of this family and all have at least one; the function is unknown. One of two members from Bacillus subtilis, YmxH, is strongly induced by the mother cell-specific sigma-E factor.

>COG1873 Protein implicated in RNA metabolism, contains PRC-barrel domain [General function prediction only] Back     alignment and domain information
>PF05239 PRC: PRC-barrel domain; InterPro: IPR007903 The PRC-barrel is an all beta barrel domain found in photosynthetic reaction centre subunit H of the purple bacteria Back     alignment and domain information
>COG3881 PRC-barrel domain containing protein [General function prediction only] Back     alignment and domain information
>COG3881 PRC-barrel domain containing protein [General function prediction only] Back     alignment and domain information
>cd00226 PRCH Photosynthetic reaction center (RC) complex, subunit H; RC is an integral membrane protein-pigment complex which catalyzes light-induced reduction of ubiquinone to ubiquinol, generating a transmembrane electrochemical gradient of protons used to produce ATP by ATP synthase Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query204
d1pm3a_78 MTH1895 {Archaeon Methanobacterium thermoautotroph 98.7
d1rzhh1213 Photosynthetic reaction centre {Rhodobacter sphaer 97.29
d2i5nh1222 Photosynthetic reaction centre {Rhodopseudomonas v 97.19
d1eysh1201 Photosynthetic reaction centre {Thermochromatium t 97.12
d2f1la175 16S rRNA processing protein RimM, C-terminal domai 82.34
>d1pm3a_ b.41.1.2 (A:) MTH1895 {Archaeon Methanobacterium thermoautotrophicum [TaxId: 145262]} Back     information, alignment and structure
class: All beta proteins
fold: PRC-barrel domain
superfamily: PRC-barrel domain
family: MTH1895
domain: MTH1895
species: Archaeon Methanobacterium thermoautotrophicum [TaxId: 145262]
Probab=98.70  E-value=2.4e-08  Score=64.36  Aligned_cols=70  Identities=21%  Similarity=0.278  Sum_probs=59.2

Q ss_pred             EEEECCCCCCEEEECCCCCEEEEEEEEEEECCCCEEEEEEECCC-C-----CCCCCEEEECCCCCEECCEEEEECCC
Q ss_conf             26703366740575348612344302899778877999998899-8-----89998288153421112389991497
Q 028747          116 VMRRSNLLAKQVISIQSAMSLGFVSQLWVDTTSWVVLVVEVRPN-L-----LSGEPERLLLEDICQVGDVVLIEDES  186 (204)
Q Consensus       116 mmrrSeL~~KEVIni~dG~RLG~I~DL~VD~esG~V~AIiVp~~-l-----f~ge~~~IPWe~IkKIGDVILVe~~~  186 (204)
                      |....||.+|+|++- +++-.|-|.|+++|+++|.|.++++..+ .     +.+.+..+|.++|++|||.||+..++
T Consensus         2 mRi~~dliGk~VlD~-~A~~IGkV~DvEiD~esg~vtsLiI~k~Gis~~igl~~~e~~Vp~e~V~~IGD~VlL~~~~   77 (78)
T d1pm3a_           2 MRIVEEMVGKEVLDS-SAKVIGKVKDVEVDIESQAIESLVLGKGGISEGLGLSKGETIVPYEMVKKIGDKILLKGPE   77 (78)
T ss_dssp             EETTTTSSSCEEECT-TSCEEEEEEEEEEETTTTEEEEEEEECCC---------CEEEEEGGGEEEESSSEEECCTT
T ss_pred             CCHHHHHHCCHHCCC-CCCEEEEEEEEEECCCCCEEEEEEECCCCCCHHHCCCCCEEECCHHHHHHHCCEEEEECCC
T ss_conf             245875728500066-4101123545798677780689997376411010004870881799974322489983799



>d1rzhh1 b.41.1.1 (H:36-248) Photosynthetic reaction centre {Rhodobacter sphaeroides [TaxId: 1063]} Back     information, alignment and structure
>d2i5nh1 b.41.1.1 (H:37-258) Photosynthetic reaction centre {Rhodopseudomonas viridis [TaxId: 1079]} Back     information, alignment and structure
>d1eysh1 b.41.1.1 (H:59-259) Photosynthetic reaction centre {Thermochromatium tepidum [TaxId: 1050]} Back     information, alignment and structure
>d2f1la1 b.41.1.4 (A:101-175) 16S rRNA processing protein RimM, C-terminal domain {Pseudomonas aeruginosa [TaxId: 287]} Back     information, alignment and structure