Citrus Sinensis ID: 028285


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-
MAADAEKLATVATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGSKTQGSEVSKTSEELKRKARAERFGLPVPSSVSEEEAKRKARLARFAPYPKTDSVEEDKRKARALRFSKTSSSSVSQVNGKGNIEVEAKAAITGEADGVA
cccHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHcccccccHHHHHHHHHHHHccccccccccccccHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHccccccccHHHHHHHHHHHHHccccccccccccccccccHHHHHHHHcccccccc
ccccHHEEEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHccccEEEcHHHHHHHHHHHcccccccccccccccHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHcccccccccccccccccccccEEEEEccccccc
MAADAEKLATVATttdnakasaegnpknnvgpmdleqnpgadvnssAVEVkkngndsktaVTITavspvsgdadlvTDTQKKIRRAerfgmpvqmseeeKRNTraerfgtgsktqgsevsKTSEELKRKARAerfglpvpssvsEEEAKRKARLarfapypktdsveeDKRKARALRFsktssssvsqvngkgnIEVEAKAAItgeadgva
maadaeklatvatttdnakasaegnpknnvgpMDLEQNPGADVNSSAVEVKkngndsktavtitavspvsgdadlvtdTQKKIRraerfgmpvqmseeekrntraerfgtgsktqgsevsktseelkrkaraerfglpvpssvseeeakrkarlarfapypktdsveedkrKARAlrfsktssssvsqvngkgnieveakaaitgeadgva
MAADAEKLATVATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGSKTQGSEVSKTSEELKRKARAERFGLPVPSSVSEEEAKRKARLARFAPYPKTDSVEEDKRKARALRFsktssssvsQVNGKGNIEVEAKAAITGEADGVA
*******************************************************************************************************************************************************************************************************************
****************************************************************************************F****************************************************************************************************************AITGEADG**
***********************GNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAERFGMPV**********************************RKARAERFGLP**************RLARFAPYP******************************KGNIEVEAKAAITGEADGVA
****AEKLATVATTTDNAK********************************************************VTDTQKKIRRAERFGMPVQMSEE**********************************************************FAP*************************************VEAKAAITGEA****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAADAEKLATVATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGSKTQGSEVSKTSEELKRKARAERFGLPVPSSVSEEEAKRKARLARFAPYPKTDSVEEDKRKARALRFSKTSSSSVSQVNGKGNIEVEAKAAITGEADGVA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query211 2.2.26 [Sep-21-2011]
Q9D1J3210 SAP domain-containing rib yes no 0.469 0.471 0.4 0.0006
Q498U4210 SAP domain-containing rib yes no 0.469 0.471 0.4 0.0007
>sp|Q9D1J3|SARNP_MOUSE SAP domain-containing ribonucleoprotein OS=Mus musculus GN=Sarnp PE=1 SV=3 Back     alignment and function desciption
 Score = 44.3 bits (103), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 46/115 (40%), Positives = 59/115 (51%), Gaps = 16/115 (13%)

Query: 76  VTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGS---KTQGSEVSKTSEELKRKARA 132
           +  T++  +RAERF +PV +  E K+  RA RFG  S   K   S+        K K RA
Sbjct: 97  IPQTERMQKRAERFNVPVSL--ESKKAARAARFGISSVPTKGLSSDTKPMVNLDKLKERA 154

Query: 133 ERFGLPVPSSV---SEEEAKRKARLARF------APYPKTDSVEEDKRKARALRF 178
           +RFGL V SS+   SE++ K K R  RF      A    T+  E  KRK RA RF
Sbjct: 155 QRFGLNV-SSISRKSEDDEKLKKRKERFGIVTSSAGTGTTEDTEAKKRK-RAERF 207




Binds both single-stranded and double-stranded DNA with higher affinity for the single-stranded form. Specifically binds to scaffold/matrix attachment region DNA. Also binds single-stranded RNA. Enhances RNA unwinding activity of DDX39A. May participate in important transcriptional or translational control of cell growth, metabolism and carcinogenesis.
Mus musculus (taxid: 10090)
>sp|Q498U4|SARNP_RAT SAP domain-containing ribonucleoprotein OS=Rattus norvegicus GN=Sarnp PE=1 SV=3 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query211
356505170189 PREDICTED: uncharacterized protein LOC10 0.838 0.936 0.553 1e-45
225439793218 PREDICTED: uncharacterized protein LOC10 0.753 0.729 0.619 5e-45
356572381198 PREDICTED: uncharacterized protein LOC10 0.876 0.934 0.532 7e-43
297741508120 unnamed protein product [Vitis vinifera] 0.568 1.0 0.677 3e-36
449448693204 PREDICTED: uncharacterized protein LOC10 0.691 0.715 0.556 5e-34
224069298192 predicted protein [Populus trichocarpa] 0.563 0.619 0.672 9e-34
297810363214 hypothetical protein ARALYDRAFT_487051 [ 0.644 0.635 0.604 2e-33
15241954214 uncharacterized protein [Arabidopsis tha 0.928 0.915 0.464 7e-32
357510501196 hypothetical protein MTR_7g100260 [Medic 0.582 0.627 0.658 8e-32
224140141178 predicted protein [Populus trichocarpa] 0.734 0.870 0.489 2e-29
>gi|356505170|ref|XP_003521365.1| PREDICTED: uncharacterized protein LOC100816435 [Glycine max] Back     alignment and taxonomy information
 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 103/186 (55%), Positives = 134/186 (72%), Gaps = 9/186 (4%)

Query: 11  VATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTAVTITAVSPVS 70
           + T TDN    A+ NP   + P + +  P  + +++A        D+K +   T+  P S
Sbjct: 1   MTTPTDNNNNPAQENPNKTLDPTEPDPIPATEDDAAA--------DTKDSDAKTSPPPDS 52

Query: 71  GDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGTGSKTQGSEVSKTSEELKRKA 130
           G+   ++D QKK+RRAERFG+ VQ+SE+EKRN+RAERFGTGS +QGSE SK SEELKRKA
Sbjct: 53  GNDAPLSDIQKKMRRAERFGISVQLSEKEKRNSRAERFGTGSASQGSEPSK-SEELKRKA 111

Query: 131 RAERFGLPVPSSVSEEEAKRKARLARFAPYPKTDSVEEDKRKARALRFSKTSSSSVSQVN 190
           RAERFG+P P++ S+EEAK+KARLARFAP  K D  EEDKRKARALRF+  SS+S++ VN
Sbjct: 112 RAERFGMPSPTTTSDEEAKKKARLARFAPASKVDPQEEDKRKARALRFANPSSTSIANVN 171

Query: 191 GKGNIE 196
           G+G IE
Sbjct: 172 GEGKIE 177




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225439793|ref|XP_002276541.1| PREDICTED: uncharacterized protein LOC100249849 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356572381|ref|XP_003554347.1| PREDICTED: uncharacterized protein LOC100791970 [Glycine max] Back     alignment and taxonomy information
>gi|297741508|emb|CBI32640.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449448693|ref|XP_004142100.1| PREDICTED: uncharacterized protein LOC101221301 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224069298|ref|XP_002326323.1| predicted protein [Populus trichocarpa] gi|222833516|gb|EEE71993.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297810363|ref|XP_002873065.1| hypothetical protein ARALYDRAFT_487051 [Arabidopsis lyrata subsp. lyrata] gi|297318902|gb|EFH49324.1| hypothetical protein ARALYDRAFT_487051 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|15241954|ref|NP_195897.1| uncharacterized protein [Arabidopsis thaliana] gi|14190431|gb|AAK55696.1|AF378893_1 AT5g02770/F9G14_80 [Arabidopsis thaliana] gi|7413552|emb|CAB86031.1| putative protein [Arabidopsis thaliana] gi|15215889|gb|AAK91488.1| AT5g02770/F9G14_80 [Arabidopsis thaliana] gi|62320777|dbj|BAD95440.1| putative protein [Arabidopsis thaliana] gi|332003132|gb|AED90515.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|357510501|ref|XP_003625539.1| hypothetical protein MTR_7g100260 [Medicago truncatula] gi|355500554|gb|AES81757.1| hypothetical protein MTR_7g100260 [Medicago truncatula] Back     alignment and taxonomy information
>gi|224140141|ref|XP_002323443.1| predicted protein [Populus trichocarpa] gi|222868073|gb|EEF05204.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query211
TAIR|locus:2151271214 MOS11 "AT5G02770" [Arabidopsis 0.966 0.953 0.435 1.2e-33
UNIPROTKB|P82979210 SARNP "SAP domain-containing r 0.715 0.719 0.352 1.5e-10
UNIPROTKB|F1PVE3210 SARNP "Uncharacterized protein 0.715 0.719 0.347 2e-10
UNIPROTKB|F1SPJ5210 SARNP "Uncharacterized protein 0.715 0.719 0.352 2e-10
UNIPROTKB|Q567R9150 CIP29 "CIP29 protein" [Homo sa 0.592 0.833 0.392 2.5e-10
UNIPROTKB|Q2TBX1210 CIP29 "Uncharacterized protein 0.715 0.719 0.341 6.2e-10
UNIPROTKB|H0YHG0523 H0YHG0 "Uncharacterized protei 0.715 0.288 0.352 1.5e-09
RGD|1305692210 Sarnp "SAP domain containing r 0.715 0.719 0.347 3e-09
UNIPROTKB|H9L0M1199 SARNP "Uncharacterized protein 0.710 0.753 0.327 3.5e-09
UNIPROTKB|H9L222210 SARNP "Uncharacterized protein 0.710 0.714 0.327 7.8e-09
TAIR|locus:2151271 MOS11 "AT5G02770" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 366 (133.9 bits), Expect = 1.2e-33, P = 1.2e-33
 Identities = 94/216 (43%), Positives = 122/216 (56%)

Query:     1 MAADAEKLATVATTTDNAKASAEGNPKNNVGPMDLEQNPGADVNSSAVEVKKNGNDSKTA 60
             MA + EK+   AT  +    S   NPK  V     E +   D+    V+   +  + K  
Sbjct:     1 MATNGEKVT--ATVVNGGGLSTGENPKKIVDLNTTELDRTDDILDGEVKGFSDSGEKKEE 58

Query:    61 VTITAVSPV----SGDADLVTDTQKKIRRAERFGMPVQMSEEEKRNTRAERFGT--GSKT 114
                  +       SGD   V D QKKIRRAERFG+ V+++EEEKRN+RAERFGT   +  
Sbjct:    59 TDSNGIGSTAGVDSGDISPVDDIQKKIRRAERFGVSVKLTEEEKRNSRAERFGTVAAAVV 118

Query:   115 QGSEVSKTSEELKRKARAERFGLPVPSSVS---EEEAKRKARLARFAPYPKTDSVEEDKR 171
              GSE +K +EELKRKARA+RFG+P  +S +   EEEAK+KARLARF    K DS EE+KR
Sbjct:   119 NGSEGTKKAEELKRKARADRFGVPSATSTTDKTEEEAKKKARLARFGKETKVDSAEENKR 178

Query:   172 KARALRFXXXXXXXXXQVNGKGNIEVEAKAAITGEA 207
             KARALRF             +  I +  +AA++G A
Sbjct:   179 KARALRFSKEASADASSDLPEKQI-IGKEAAVSGSA 213




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0016973 "poly(A)+ mRNA export from nucleus" evidence=IMP
UNIPROTKB|P82979 SARNP "SAP domain-containing ribonucleoprotein" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1PVE3 SARNP "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|F1SPJ5 SARNP "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|Q567R9 CIP29 "CIP29 protein" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q2TBX1 CIP29 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|H0YHG0 H0YHG0 "Uncharacterized protein" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
RGD|1305692 Sarnp "SAP domain containing ribonucleoprotein" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|H9L0M1 SARNP "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|H9L222 SARNP "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 211
KOG4259260 consensus Putative nucleic acid-binding protein Hc 99.79
KOG4259260 consensus Putative nucleic acid-binding protein Hc 99.48
>KOG4259 consensus Putative nucleic acid-binding protein Hcc-1/proliferation associated cytokine-inducible protein, contains SAP domain [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information
Probab=99.79  E-value=3.9e-19  Score=156.15  Aligned_cols=127  Identities=35%  Similarity=0.415  Sum_probs=80.5

Q ss_pred             ccCCCCCcceeecCCCCCCCCCCCCChHHHHHHHHh---hhCCCCCCCH---HHHHHHHHhhhCCCCCCCC-------C-
Q 028285           52 KNGNDSKTAVTITAVSPVSGDADLVTDTQKKIRRAE---RFGMPVQMSE---EEKRNTRAERFGTGSKTQG-------S-  117 (211)
Q Consensus        52 k~~~~~kk~vkIt~~~~~~~~~~~lSe~EKk~~RAe---RFGip~~lse---~eKkk~RAeRFGl~~~~~~-------s-  117 (211)
                      +....++++|.|+++.    -............|++   ||+.|+++.+   .+++..||+||||+.....       + 
T Consensus       106 k~~~~Eks~v~~tstg----k~~E~paet~~~srae~~~rf~~Pvvae~k~a~e~laaRAkRFgIp~d~t~i~sadnKas  181 (260)
T KOG4259|consen  106 KIISKEKSQVPETSTG----KEAEEPAETTEESRAEVSNRFSSPVVAEEKTAQEKLAARAKRFGIPVDDTQIKSADNKAS  181 (260)
T ss_pred             hhhhhccccccccccc----cccccchhhhhhhhcccccccCCCcccccccchHHHHHHHHhcCCCCchHHHHhhccchh
Confidence            3344577888887743    1123333445556666   6666665433   3566666666666533210       0 


Q ss_pred             -------ccccchHHHHHHHHHhhhCCCCCCC--cc---hHHHHHHHHhhhcCCCCCCCchHHHHH-HHHHhccCCCCC
Q 028285          118 -------EVSKTSEELKRKARAERFGLPVPSS--VS---EEEAKRKARLARFAPYPKTDSVEEDKR-KARALRFSKTSS  183 (211)
Q Consensus       118 -------~~~k~~eeeKlKkRAERFG~~~~~~--~~---~~eeKkkkRaERFG~~s~~d~~eE~Kk-kkRaeRFg~~~s  183 (211)
                             ........+++|.||+|||+++++.  ..   +...|+++|++|||. ...+...|+|| ++|+||||+..+
T Consensus       182 ~a~~fG~~~~~p~a~dklk~rAqrfg~~v~s~Sr~s~~de~~~kl~arkkRfgg-titde~tEAKKaRaRaERFgtA~~  259 (260)
T KOG4259|consen  182 SAANFGNKIQQPLASDKLKNRAQRFGPQVRSNSRSSQRDENAPKLSARKKRFGG-TITDEPTEAKKARARAERFGTAAK  259 (260)
T ss_pred             hhhhcCCcccchhhhHHHHHHHHhcCCCCCcccccCccccccchhhhhHHhcCC-CCCCchhhHHHHHHHHHHhcccCC
Confidence                   0011235679999999999999864  11   223599999999993 34566778887 999999998765



>KOG4259 consensus Putative nucleic acid-binding protein Hcc-1/proliferation associated cytokine-inducible protein, contains SAP domain [Cell cycle control, cell division, chromosome partitioning] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00