Citrus Sinensis ID: 028341


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210
MAADAEKLATVATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGSKTQGSEVSKTSEELKRKARAERFGLPVPSSVSEEEAKRKARLARFAPYPKTDSVEEDKRKARALRFSKTSSSSVSQVNGKGNIEVEAAAITGEADGVA
cccHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHccccccccHHHHHHHHHHHHcccccccccccccccHHHHHHHHHHHccccccccccHHHHHHHHHHHHHccccccccHHHHHHHHHHHcccccccccHHccccccccHHHHHHHHccccccc
ccccHHEEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHccccEEEcHHHHHHHHHHHcccccccccccccccHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHccccccccccccccccccccEEEEEccccccc
MAADAEKLATVATttdnakasaegnpknnvgpmdleqnpgadvnssAVEVkkngndsktaVTITavspvsgdadlvTDTQKKIRRAerfgmpvqmseeeKRNTraerfgtgsktqgsevsKTSEELKRKARAerfglpvpssvsEEEAKRKARLarfapypktdsveeDKRKARALRFsktssssvsqvngkgnIEVEAAAItgeadgva
maadaeklatvatttdnakasaegnpknnvgpMDLEQNPGADVNSSAVEVKkngndsktavtitavspvsgdadlvtdTQKKIRraerfgmpvqmseeekrntraerfgtgsktqgsevsktseelkrkaraerfglpvpssvseeeakrkarlarfapypktdsveedkrKARALrfsktssssvsqvngkgnieVEAAAITGEADGVA
MAADAEKLATVATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGSKTQGSEVSKTSEELKRKARAERFGLPVPSSVSEEEAKRKARLARFAPYPKTDSVEEDKRKARALRFsktssssvsQVNGKGNIEVEAAAITGEADGVA
******************************************************************************************************************************************************************************************************************
*********************************************************************************************************************************************************************************************************ITGEAD***
***********************GNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAERFGMPV**********************************RKARAERFGLP**************RLARFAPYP*****************************GKGNIEVEAAAITGEADGVA
****AEKLATVATTTDNAK*********************************************************TDTQKKIRRAERFGMPVQMSEE**********************************************************FAP***************************************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAADAEKLATVATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGSKTQGSEVSKTSEELKRKARAERFGLPVPSSVSEEEAKRKARLARFAPYPKTDSVEEDKRKARALRFSKTSSSSVSQVNGKGNIEVEAAAITGEADGVA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query210 2.2.26 [Sep-21-2011]
Q9D1J3210 SAP domain-containing rib yes no 0.471 0.471 0.4 0.0006
Q498U4210 SAP domain-containing rib yes no 0.471 0.471 0.4 0.0008
>sp|Q9D1J3|SARNP_MOUSE SAP domain-containing ribonucleoprotein OS=Mus musculus GN=Sarnp PE=1 SV=3 Back     alignment and function desciption
 Score = 44.3 bits (103), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 46/115 (40%), Positives = 59/115 (51%), Gaps = 16/115 (13%)

Query: 76  VTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGS---KTQGSEVSKTSEELKRKARA 132
           +  T++  +RAERF +PV +  E K+  RA RFG  S   K   S+        K K RA
Sbjct: 97  IPQTERMQKRAERFNVPVSL--ESKKAARAARFGISSVPTKGLSSDTKPMVNLDKLKERA 154

Query: 133 ERFGLPVPSSV---SEEEAKRKARLARF------APYPKTDSVEEDKRKARALRF 178
           +RFGL V SS+   SE++ K K R  RF      A    T+  E  KRK RA RF
Sbjct: 155 QRFGLNV-SSISRKSEDDEKLKKRKERFGIVTSSAGTGTTEDTEAKKRK-RAERF 207




Binds both single-stranded and double-stranded DNA with higher affinity for the single-stranded form. Specifically binds to scaffold/matrix attachment region DNA. Also binds single-stranded RNA. Enhances RNA unwinding activity of DDX39A. May participate in important transcriptional or translational control of cell growth, metabolism and carcinogenesis.
Mus musculus (taxid: 10090)
>sp|Q498U4|SARNP_RAT SAP domain-containing ribonucleoprotein OS=Rattus norvegicus GN=Sarnp PE=1 SV=3 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query210
356505170189 PREDICTED: uncharacterized protein LOC10 0.842 0.936 0.553 1e-45
225439793218 PREDICTED: uncharacterized protein LOC10 0.757 0.729 0.611 2e-43
356572381198 PREDICTED: uncharacterized protein LOC10 0.880 0.934 0.532 8e-43
297741508120 unnamed protein product [Vitis vinifera] 0.528 0.925 0.666 1e-34
449448693204 PREDICTED: uncharacterized protein LOC10 0.728 0.75 0.540 4e-34
224069298192 predicted protein [Populus trichocarpa] 0.566 0.619 0.672 8e-34
297810363214 hypothetical protein ARALYDRAFT_487051 [ 0.647 0.635 0.608 2e-33
15241954214 uncharacterized protein [Arabidopsis tha 0.961 0.943 0.470 7e-32
357510501196 hypothetical protein MTR_7g100260 [Medic 0.585 0.627 0.658 8e-32
224140141178 predicted protein [Populus trichocarpa] 0.738 0.870 0.489 2e-29
>gi|356505170|ref|XP_003521365.1| PREDICTED: uncharacterized protein LOC100816435 [Glycine max] Back     alignment and taxonomy information
 Score =  188 bits (477), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 103/186 (55%), Positives = 134/186 (72%), Gaps = 9/186 (4%)

Query: 11  VATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVS 70
           + T TDN    A+ NP   + P + +  P  + +++A        D+K +   T+  P S
Sbjct: 1   MTTPTDNNNNPAQENPNKTLDPTEPDPIPATEDDAAA--------DTKDSDAKTSPPPDS 52

Query: 71  GDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGSKTQGSEVSKTSEELKRKA 130
           G+   ++D QKK+RRAERFG+ VQ+SE+EKRN+RAERFGTGS +QGSE SK SEELKRKA
Sbjct: 53  GNDAPLSDIQKKMRRAERFGISVQLSEKEKRNSRAERFGTGSASQGSEPSK-SEELKRKA 111

Query: 131 RAERFGLPVPSSVSEEEAKRKARLARFAPYPKTDSVEEDKRKARALRFSKTSSSSVSQVN 190
           RAERFG+P P++ S+EEAK+KARLARFAP  K D  EEDKRKARALRF+  SS+S++ VN
Sbjct: 112 RAERFGMPSPTTTSDEEAKKKARLARFAPASKVDPQEEDKRKARALRFANPSSTSIANVN 171

Query: 191 GKGNIE 196
           G+G IE
Sbjct: 172 GEGKIE 177




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225439793|ref|XP_002276541.1| PREDICTED: uncharacterized protein LOC100249849 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356572381|ref|XP_003554347.1| PREDICTED: uncharacterized protein LOC100791970 [Glycine max] Back     alignment and taxonomy information
>gi|297741508|emb|CBI32640.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449448693|ref|XP_004142100.1| PREDICTED: uncharacterized protein LOC101221301 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224069298|ref|XP_002326323.1| predicted protein [Populus trichocarpa] gi|222833516|gb|EEE71993.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297810363|ref|XP_002873065.1| hypothetical protein ARALYDRAFT_487051 [Arabidopsis lyrata subsp. lyrata] gi|297318902|gb|EFH49324.1| hypothetical protein ARALYDRAFT_487051 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|15241954|ref|NP_195897.1| uncharacterized protein [Arabidopsis thaliana] gi|14190431|gb|AAK55696.1|AF378893_1 AT5g02770/F9G14_80 [Arabidopsis thaliana] gi|7413552|emb|CAB86031.1| putative protein [Arabidopsis thaliana] gi|15215889|gb|AAK91488.1| AT5g02770/F9G14_80 [Arabidopsis thaliana] gi|62320777|dbj|BAD95440.1| putative protein [Arabidopsis thaliana] gi|332003132|gb|AED90515.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|357510501|ref|XP_003625539.1| hypothetical protein MTR_7g100260 [Medicago truncatula] gi|355500554|gb|AES81757.1| hypothetical protein MTR_7g100260 [Medicago truncatula] Back     alignment and taxonomy information
>gi|224140141|ref|XP_002323443.1| predicted protein [Populus trichocarpa] gi|222868073|gb|EEF05204.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query210
TAIR|locus:2151271214 MOS11 "AT5G02770" [Arabidopsis 0.971 0.953 0.437 7.4e-34
UNIPROTKB|P82979210 SARNP "SAP domain-containing r 0.719 0.719 0.352 1.5e-10
UNIPROTKB|F1PVE3210 SARNP "Uncharacterized protein 0.719 0.719 0.347 2e-10
UNIPROTKB|F1SPJ5210 SARNP "Uncharacterized protein 0.719 0.719 0.352 2e-10
UNIPROTKB|Q567R9150 CIP29 "CIP29 protein" [Homo sa 0.595 0.833 0.392 2.5e-10
UNIPROTKB|Q2TBX1210 CIP29 "Uncharacterized protein 0.719 0.719 0.341 5e-10
UNIPROTKB|H0YHG0523 H0YHG0 "Uncharacterized protei 0.719 0.288 0.352 1.5e-09
RGD|1305692210 Sarnp "SAP domain containing r 0.719 0.719 0.347 2.7e-09
UNIPROTKB|H9L0M1199 SARNP "Uncharacterized protein 0.714 0.753 0.327 3.2e-09
UNIPROTKB|H9L222210 SARNP "Uncharacterized protein 0.714 0.714 0.327 7.1e-09
TAIR|locus:2151271 MOS11 "AT5G02770" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 368 (134.6 bits), Expect = 7.4e-34, P = 7.4e-34
 Identities = 94/215 (43%), Positives = 121/215 (56%)

Query:     1 MAADAEKLATVATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTA 60
             MA + EK+   AT  +    S   NPK  V     E +   D+    V+   +  + K  
Sbjct:     1 MATNGEKVT--ATVVNGGGLSTGENPKKIVDLNTTELDRTDDILDGEVKGFSDSGEKKEE 58

Query:    61 VTITAVSPV----SGDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGT--GSKT 114
                  +       SGD   V D QKKIRRAERFG+ V+++EEEKRN+RAERFGT   +  
Sbjct:    59 TDSNGIGSTAGVDSGDISPVDDIQKKIRRAERFGVSVKLTEEEKRNSRAERFGTVAAAVV 118

Query:   115 QGSEVSKTSEELKRKARAERFGLPVPSSVS---EEEAKRKARLARFAPYPKTDSVEEDKR 171
              GSE +K +EELKRKARA+RFG+P  +S +   EEEAK+KARLARF    K DS EE+KR
Sbjct:   119 NGSEGTKKAEELKRKARADRFGVPSATSTTDKTEEEAKKKARLARFGKETKVDSAEENKR 178

Query:   172 KARALRFXXXXXXXXXQVNGKGNIEVEAAAITGEA 206
             KARALRF             +  I  + AA++G A
Sbjct:   179 KARALRFSKEASADASSDLPEKQIIGKEAAVSGSA 213




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0016973 "poly(A)+ mRNA export from nucleus" evidence=IMP
UNIPROTKB|P82979 SARNP "SAP domain-containing ribonucleoprotein" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1PVE3 SARNP "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|F1SPJ5 SARNP "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|Q567R9 CIP29 "CIP29 protein" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q2TBX1 CIP29 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|H0YHG0 H0YHG0 "Uncharacterized protein" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
RGD|1305692 Sarnp "SAP domain containing ribonucleoprotein" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|H9L0M1 SARNP "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|H9L222 SARNP "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00023087001
SubName- Full=Chromosome chr8 scaffold_29, whole genome shotgun sequence; (218 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 210
KOG4259260 consensus Putative nucleic acid-binding protein Hc 99.79
KOG4259260 consensus Putative nucleic acid-binding protein Hc 99.5
>KOG4259 consensus Putative nucleic acid-binding protein Hcc-1/proliferation associated cytokine-inducible protein, contains SAP domain [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
Probab=99.79  E-value=3.7e-19  Score=156.19  Aligned_cols=125  Identities=34%  Similarity=0.421  Sum_probs=80.5

Q ss_pred             CCCCCcceeccCCCCCCCCCCCCChHHHHHHHHh---hhCCCCCCCH---HHHHHHHHhhhCCCCCCCC-------C---
Q 028341           54 GNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAE---RFGMPVQMSE---EEKRNTRAERFGTGSKTQG-------S---  117 (210)
Q Consensus        54 ~~~~kk~vkIt~~~~~~~~~~~lSd~EKk~~RAe---RFGip~~lse---~eKkk~RAeRFGl~~~~~~-------s---  117 (210)
                      ...++++|.|+++.    -............|++   ||+.|+++-+   .+++.+||+||||+.....       +   
T Consensus       108 ~~~Eks~v~~tstg----k~~E~paet~~~srae~~~rf~~Pvvae~k~a~e~laaRAkRFgIp~d~t~i~sadnKas~a  183 (260)
T KOG4259|consen  108 ISKEKSQVPETSTG----KEAEEPAETTEESRAEVSNRFSSPVVAEEKTAQEKLAARAKRFGIPVDDTQIKSADNKASSA  183 (260)
T ss_pred             hhhccccccccccc----cccccchhhhhhhhcccccccCCCcccccccchHHHHHHHHhcCCCCchHHHHhhccchhhh
Confidence            44577777777743    1123344555666666   7777665533   3666777777776643210       0   


Q ss_pred             -----CcccchHHHHHHHHHHhhCCCCCCC--cc---hHHHHHHHHhhhcCCCCCCCchHHHHH-HHHHhccCCCCC
Q 028341          118 -----EVSKTSEELKRKARAERFGLPVPSS--VS---EEEAKRKARLARFAPYPKTDSVEEDKR-KARALRFSKTSS  183 (210)
Q Consensus       118 -----~~~K~~eeeKlKkRAERFG~~~~~~--~~---~eeeKkkkRaERFG~~~~~d~~ee~Kk-kkRaeRFg~~~~  183 (210)
                           ........+++|.||+|||+++++.  ..   +-..|+++|++|||. .-.+...|+|+ ++|+||||+..+
T Consensus       184 ~~fG~~~~~p~a~dklk~rAqrfg~~v~s~Sr~s~~de~~~kl~arkkRfgg-titde~tEAKKaRaRaERFgtA~~  259 (260)
T KOG4259|consen  184 ANFGNKIQQPLASDKLKNRAQRFGPQVRSNSRSSQRDENAPKLSARKKRFGG-TITDEPTEAKKARARAERFGTAAK  259 (260)
T ss_pred             hhcCCcccchhhhHHHHHHHHhcCCCCCcccccCccccccchhhhhHHhcCC-CCCCchhhHHHHHHHHHHhcccCC
Confidence                 0111245679999999999999865  12   223599999999994 23455677777 999999998765



>KOG4259 consensus Putative nucleic acid-binding protein Hcc-1/proliferation associated cytokine-inducible protein, contains SAP domain [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00