Citrus Sinensis ID: 024112


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270--
MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGVNVTASASMPQSEPVKRKRGRPRKYGPDGSVSLALSPSVSTHPGTISPTQKRGRGRPPGTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYEGRFEILCLSGSYLLSGNGGSRNRSGGLSVSLASPDGRVIGGGVGGMLIAANNVQVTFRLCLSS
ccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccccHHHHHHHHHHHHccccEEEEEEccEEEEEEEEccccccccEEEEEEEEEEEEEEEEEEccccccccccccEEEEEEccccEEEcEEccEEEEEcccEEEEEEEcccc
ccccccccccccccEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHccccccccccEEEEEcccHHHHHHHHHHHHccccEEEEEEcccEEEEEEEEcccccccEEEEEcEEEEEEEccccccccccccccccccEEEEEEcccccEEEccEHHHHEccccEEEEEEEEccc
mdrrdglalpgsasfymqrgmtgsgsgtqpslhgspgihplsnpslqfqsniggstigstlsvdpssaisphgvnvtasasmpqsepvkrkrgrprkygpdgsvslalspsvsthpgtisptqkrgrgrppgtgrkqQVSSlgeslsgsagmgftphVITVAVGEDIAMKLLSFSQQGPRAICVLSangaistatlrqpsssggsvtyeGRFEILCLSgsyllsgnggsrnrsgglsvslaspdgrvigggVGGMLIAANNVQVTFRLCLSS
mdrrdglalpgSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGvnvtasasmpqsepvkrkrgrprkygpdgsvslalspsvsthpgtisptqkrgrgrppgTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAIStatlrqpsssggsVTYEGRFEILCLSGSYLLSGNGGSRNRSGGLSVSLASPDGRVIGGGVGGMLIAANNVQVTFRLCLSS
MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGVNVTASASMPQSEpvkrkrgrprkygpdgSVSLALSPSVSTHPGTISptqkrgrgrppgtgrkqqVsslgeslsgsAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYEGRFEIlclsgsyllsgnggsrnrsgglsvslASPDgrvigggvggMLIAANNVQVTFRLCLSS
*******************************************************************************************************************************************************MGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTA***********VTYEGRFEILCLSGSYLLS*******************DGRVIGGGVGGMLIAANNVQVTFRLCL**
******************************************************************************************************************************************************GMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYEGRFEILCLSGS***************LSVSLASPDGRVIGGGVGGMLIAANNVQVTFRLCLS*
********LPGSASFYMQRGM*************SPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGVNVT**************************VSLALSP*************************************GSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTAT**********VTYEGRFEILCLSGSYLLSGNGGSRNRSGGLSVSLASPDGRVIGGGVGGMLIAANNVQVTFRLCLSS
**************FY**RGMTGSGSGTQPSLHGSPGIHPLSNPSLQ*******************************************************S**L********************************VSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYEGRFEILCLSGSYLLSGNGGSRNRSGGLSVSLASPDGRVIGGGVGGMLIAANNVQVTFRLCLSS
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGVNVTASASMPQSEPVKRKRGRPRKYGPDGSVSLALSPSVSTHPGTISPTQKRGRGRPPGTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYEGRFEILCLSGSYLLSGNGGSRNRSGGLSVSLASPDGRVIGGGVGGMLIAANNVQVTFRLCLSS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query272 2.2.26 [Sep-21-2011]
Q9S7C9311 Putative DNA-binding prot no no 0.352 0.308 0.379 8e-08
>sp|Q9S7C9|ESCA_ARATH Putative DNA-binding protein ESCAROLA OS=Arabidopsis thaliana GN=ESC PE=2 SV=1 Back     alignment and function desciption
 Score = 57.8 bits (138), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 58/108 (53%), Gaps = 12/108 (11%)

Query: 124 KRGRGRPPGTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAIC 183
           KR RGRPPG+  K +   +    S +A      HV+ V+ G DI   + +++++  R + 
Sbjct: 86  KRPRGRPPGSKNKAKPPIIVTRDSPNA---LRSHVLEVSPGADIVESVSTYARRRGRGVS 142

Query: 184 VLSANGAISTATLRQPSS---------SGGSVTYEGRFEILCLSGSYL 222
           VL  NG +S  TLRQP +          GG VT  GRFEIL L+G+ L
Sbjct: 143 VLGGNGTVSNVTLRQPVTPGNGGGVSGGGGVVTLHGRFEILSLTGTVL 190





Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query272
255541558340 DNA binding protein, putative [Ricinus c 0.970 0.776 0.803 1e-109
133907524340 AT-hook DNA-binding protein [Gossypium h 0.970 0.776 0.766 1e-107
225454180345 PREDICTED: uncharacterized protein LOC10 0.970 0.765 0.795 1e-102
224130232336 predicted protein [Populus trichocarpa] 0.952 0.770 0.754 9e-96
358249184341 uncharacterized protein LOC100814615 [Gl 0.959 0.765 0.717 1e-91
224067876328 predicted protein [Populus trichocarpa] 0.930 0.771 0.708 2e-90
356568374342 PREDICTED: uncharacterized protein LOC10 0.959 0.763 0.698 5e-89
356504535340 PREDICTED: uncharacterized protein LOC10 0.970 0.776 0.687 3e-86
356520420340 PREDICTED: uncharacterized protein LOC10 0.970 0.776 0.676 9e-86
449441474334 PREDICTED: uncharacterized protein LOC10 0.871 0.709 0.666 4e-85
>gi|255541558|ref|XP_002511843.1| DNA binding protein, putative [Ricinus communis] gi|223549023|gb|EEF50512.1| DNA binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 213/265 (80%), Positives = 243/265 (91%), Gaps = 1/265 (0%)

Query: 1   MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGST 60
           MDRRD +A+ GSASFYMQRGMTGSGSGTQ  L+ S GI+PL++ ++ FQSN+G +TIGST
Sbjct: 1   MDRRDAMAMSGSASFYMQRGMTGSGSGTQSGLNVSSGINPLTSTNVSFQSNVGANTIGST 60

Query: 61  LSVDPSSAISPHGVNVTASASMPQ-SEPVKRKRGRPRKYGPDGSVSLALSPSVSTHPGTI 119
           L ++ S+AI PHGVNV AS+ MP   EPVKRKRGRPRKYGPDG+VSLALSPS+STHPGTI
Sbjct: 61  LPLETSTAIPPHGVNVGASSLMPPPGEPVKRKRGRPRKYGPDGTVSLALSPSLSTHPGTI 120

Query: 120 SPTQKRGRGRPPGTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGP 179
           +PTQKRGRGRPPGTGRKQQ++SLGE LSGSAGMGFTPH+IT+AVGEDIA K++SFSQQGP
Sbjct: 121 TPTQKRGRGRPPGTGRKQQLASLGEWLSGSAGMGFTPHIITIAVGEDIATKIMSFSQQGP 180

Query: 180 RAICVLSANGAISTATLRQPSSSGGSVTYEGRFEILCLSGSYLLSGNGGSRNRSGGLSVS 239
           RAIC+LSANGA+ST TLRQPS+SGGSVTYEGRFEILCLSGSYL++ NGGSRNR+GGLSVS
Sbjct: 181 RAICILSANGAVSTVTLRQPSTSGGSVTYEGRFEILCLSGSYLVTSNGGSRNRTGGLSVS 240

Query: 240 LASPDGRVIGGGVGGMLIAANNVQV 264
           LASPDGRVIGGGVGGMLIAA+ VQV
Sbjct: 241 LASPDGRVIGGGVGGMLIAASPVQV 265




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|133907524|gb|ABO42262.1| AT-hook DNA-binding protein [Gossypium hirsutum] Back     alignment and taxonomy information
>gi|225454180|ref|XP_002272142.1| PREDICTED: uncharacterized protein LOC100265498 [Vitis vinifera] gi|297745264|emb|CBI40344.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224130232|ref|XP_002320785.1| predicted protein [Populus trichocarpa] gi|222861558|gb|EEE99100.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|358249184|ref|NP_001239751.1| uncharacterized protein LOC100814615 [Glycine max] gi|255636132|gb|ACU18409.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|224067876|ref|XP_002302577.1| predicted protein [Populus trichocarpa] gi|222844303|gb|EEE81850.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356568374|ref|XP_003552386.1| PREDICTED: uncharacterized protein LOC100802542 isoform 1 [Glycine max] gi|356568376|ref|XP_003552387.1| PREDICTED: uncharacterized protein LOC100802542 isoform 2 [Glycine max] Back     alignment and taxonomy information
>gi|356504535|ref|XP_003521051.1| PREDICTED: uncharacterized protein LOC100783475 [Glycine max] Back     alignment and taxonomy information
>gi|356520420|ref|XP_003528860.1| PREDICTED: uncharacterized protein LOC100799791 [Glycine max] Back     alignment and taxonomy information
>gi|449441474|ref|XP_004138507.1| PREDICTED: uncharacterized protein LOC101203138 [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query272
TAIR|locus:2050766348 AT2G45850 [Arabidopsis thalian 0.944 0.738 0.397 2.7e-36
TAIR|locus:2098861354 AT3G61310 [Arabidopsis thalian 0.955 0.734 0.384 5.1e-35
TAIR|locus:2031306361 AT1G63480 [Arabidopsis thalian 0.419 0.315 0.426 4.3e-26
TAIR|locus:2051038351 AT2G33620 [Arabidopsis thalian 0.419 0.324 0.456 1.3e-25
TAIR|locus:2031321378 AT1G63470 [Arabidopsis thalian 0.419 0.301 0.408 5.2e-25
TAIR|locus:2118091356 AHL1 "AT-hook motif nuclear-lo 0.419 0.320 0.508 5.9e-25
TAIR|locus:2126946318 AT4G00200 [Arabidopsis thalian 0.419 0.358 0.5 4.2e-24
TAIR|locus:2153142 419 AHL4 "AT-HOOK MOTIF NUCLEAR LO 0.408 0.264 0.450 1.1e-22
TAIR|locus:2132599334 AT4G22770 [Arabidopsis thalian 0.422 0.344 0.452 5.5e-22
TAIR|locus:2122684 404 AHL3 "AT-HOOK MOTIF NUCLEAR LO 0.419 0.282 0.447 1.7e-20
TAIR|locus:2050766 AT2G45850 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 391 (142.7 bits), Expect = 2.7e-36, P = 2.7e-36
 Identities = 111/279 (39%), Positives = 137/279 (49%)

Query:     1 MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSP----GIHPLSNPSLQFQSNIGGST 56
             MDRRD + L GS S+Y+ RG++GSG    P+ HGSP    G+  L N +  F S  G + 
Sbjct:     1 MDRRDAMGLSGSGSYYIHRGLSGSGP---PTFHGSPQQQQGLRHLPNQNSPFGS--GSTG 55

Query:    57 IGS-TLSVDPSSAIS-------PH--GVNVTASASMPQSEXXXXXXXXXXXXXXXXSVSL 106
              GS +L  DPS A +       PH  GVN+ A    P                   SVSL
Sbjct:    56 FGSPSLHGDPSLATAAGGAGALPHHIGVNMIAPPPPPSETPMKRKRGRPRKYGQDGSVSL 115

Query:   107 ALSPS-VSTHPGTISXXXXXXXXXXXXXXXXXXVXXXXXXXXXXAGMGFTPHVITVAVGE 165
             ALS S VST   T +                  +          +GM FTPHVI V++GE
Sbjct:   116 ALSSSSVSTI--TPNNSNKRGRGRPPGSGKKQRMASVGELMPSSSGMSFTPHVIAVSIGE 173

Query:   166 DIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYEGRFEIXXXXXXXXXXX 225
             DIA K+++FSQQGPRAICVLSA+GA+STATL QPS+S G++ YEGRFEI           
Sbjct:   174 DIASKVIAFSQQGPRAICVLSASGAVSTATLIQPSASPGAIKYEGRFEILALSTSYIVAT 233

Query:   226 XXXXXXXXXXXXXXXASPDXXXXXXXXXXMLIAANNVQV 264
                            ASPD           LIAA+ VQV
Sbjct:   234 DGSFRNRTGNLSVSLASPDGRVIGGAIGGPLIAASPVQV 272




GO:0003677 "DNA binding" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0008150 "biological_process" evidence=ND
GO:0010103 "stomatal complex morphogenesis" evidence=RCA
TAIR|locus:2098861 AT3G61310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031306 AT1G63480 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051038 AT2G33620 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031321 AT1G63470 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2118091 AHL1 "AT-hook motif nuclear-localized protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2126946 AT4G00200 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2153142 AHL4 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2132599 AT4G22770 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2122684 AHL3 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query272
pfam03479120 pfam03479, DUF296, Domain of unknown function (DUF 4e-34
cd11378113 cd11378, DUF296, Domain of unknown function found 7e-29
COG1661141 COG1661, COG1661, Predicted DNA-binding protein wi 0.001
>gnl|CDD|217587 pfam03479, DUF296, Domain of unknown function (DUF296) Back     alignment and domain information
 Score =  119 bits (301), Expect = 4e-34
 Identities = 53/116 (45%), Positives = 65/116 (56%), Gaps = 9/116 (7%)

Query: 154 FTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPS---SSGGSVTYEG 210
             PHV+ +  GED+   L +F++Q      VLS  GA+S  TLRQP     S G VT EG
Sbjct: 1   GRPHVLRLEPGEDLVESLEAFARQRGIGAAVLSGIGAVSNVTLRQPDEEAKSYGVVTLEG 60

Query: 211 RFEILCLSGSYLLSGNGGSRNRSGGLSVSLASPDGRVIGGGV-GGMLIAANNVQVT 265
           RFEIL LSG+    G       SG L VSLA PDG+V+GG +  G + A   V VT
Sbjct: 61  RFEILSLSGTISPGG-----KPSGHLHVSLADPDGQVVGGHLAEGTVFATGEVVVT 111


This putative domain is found in proteins that contain AT-hook motifs pfam02178, which strongly suggests a DNA-binding function for the proteins as a whole. There are three highly conserved histidine residues, eg at 117, 119 and 133 in Reut_B5223, which should be a structurally conserved metal-binding unit, based on structural comparison with known metal-binding structures. The proteins should work as trimers. Length = 120

>gnl|CDD|211390 cd11378, DUF296, Domain of unknown function found in archaea, bacteria, and plants Back     alignment and domain information
>gnl|CDD|224575 COG1661, COG1661, Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 272
PF03479120 DUF296: Domain of unknown function (DUF296); Inter 99.95
COG1661141 Predicted DNA-binding protein with PD1-like DNA-bi 99.87
PF0217813 AT_hook: AT hook motif; InterPro: IPR017956 AT hoo 96.42
smart0038426 AT_hook DNA binding domain with preference for A/T 96.37
PF14621219 RFX5_DNA_bdg: RFX5 DNA-binding domain 84.09
>PF03479 DUF296: Domain of unknown function (DUF296); InterPro: IPR005175 This putative conserved domain is found in proteins that contain AT-hook motifs IPR000637 from INTERPRO, suggesting a DNA-binding function for the proteins as a whole, however, the function of this domain is unknown Back     alignment and domain information
Probab=99.95  E-value=1.3e-27  Score=195.29  Aligned_cols=113  Identities=31%  Similarity=0.388  Sum_probs=98.9

Q ss_pred             ceEEEEEecCCCcHHHHHHHHHHhCCceEEEEeeeceeeeEEEecCCC--CCCceeEeeceEEEEeeceeeeCCCCCCCC
Q 024112          154 FTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSS--SGGSVTYEGRFEILCLSGSYLLSGNGGSRN  231 (272)
Q Consensus       154 ~~phVIrV~~GEDV~~kI~~faqq~~~aicILSa~GsVSnVTLRqp~s--~~~tvt~eG~FEILSLsGt~~~~~~~~~~~  231 (272)
                      |++|++||++||||+++|++||+++++..|+++++|++++|+|++++.  .....+|+|+|||+||+|||...++    .
T Consensus         1 ~r~~~~rl~~Gedl~~~l~~~~~~~~i~~~~is~iGsl~~~~l~~~~~~~~~~~~~~~g~~Ei~sl~G~i~~~~g----~   76 (120)
T PF03479_consen    1 GRVFVIRLDPGEDLLESLEAFAREHGIRSGVISGIGSLSNVTLGYYDPPSYYEPLEFEGPFEIISLSGTISPEDG----K   76 (120)
T ss_dssp             EEEEEEEEETTSBHHHHHHHHHHHHT-SSEEEEEEEEEEEEEEEEEETTTEEEEEEEESEEEEEEEEEEEEEETT----E
T ss_pred             CcEEEEEECCCCHHHHHHHHHHHHCCCcEEEEEEEeEEeEEEEEEecccCCcceEEecccEEEEEeEEEEECCCC----C
Confidence            789999999999999999999999999888999999999999999843  3457899999999999999998433    2


Q ss_pred             CCCceEEEEeCCCCeEEeeecC-cceEeecceEEEEEEcc
Q 024112          232 RSGGLSVSLASPDGRVIGGGVG-GMLIAANNVQVTFRLCL  270 (272)
Q Consensus       232 ~~~hLhISLAg~dGqViGGhV~-G~LIAAtpVqVvv~sF~  270 (272)
                      ++.||||+|+|.||+|+||||. |.++++..|+|+..++.
T Consensus        77 ~~~HlHisl~~~~g~v~gGHl~~g~v~~t~Ev~i~~~~~~  116 (120)
T PF03479_consen   77 PFVHLHISLADPDGQVFGGHLLEGTVFATAEVVITELSGI  116 (120)
T ss_dssp             EEEEEEEEEE-TTSEEEEEEEEEEEEEEEEEEEEEEETTE
T ss_pred             CcceEEEEEECCCCeEEeeEeCCCEEeEEEEEEEEEecCc
Confidence            6679999999999999999999 88878888888877664



Overexpression of a protein containing this domain, Q9S7C9 from SWISSPROT, in Arabidopsis thaliana causes late flowering and modified leaf development []. ; PDB: 2DT4_A 2P6Y_A 3HWU_A 3HTN_A 2NMU_A 2H6L_A 2HX0_A.

>COG1661 Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] Back     alignment and domain information
>PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions Back     alignment and domain information
>smart00384 AT_hook DNA binding domain with preference for A/T rich regions Back     alignment and domain information
>PF14621 RFX5_DNA_bdg: RFX5 DNA-binding domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query272
2p6y_A142 Hypothetical protein VCA0587; NESG, Q9KM02_vibch, 4e-21
2hx0_A154 Putative DNA-binding protein; NESG, PSI-2, SCR59, 4e-21
2dt4_A143 Hypothetical protein PH0802; PPC domain, structura 2e-14
3htn_A149 Putative DNA binding protein; DUF269 family protei 2e-07
2h6l_A146 Hypothetical protein; NESG GR103, structural genom 5e-06
1qzv_F154 Plant photosystem I: subunit PSAF; photosynthesis, 2e-05
3hwu_A147 Putative DNA-binding protein; YP_299413.1, structu 1e-04
>2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} Length = 142 Back     alignment and structure
 Score = 85.8 bits (212), Expect = 4e-21
 Identities = 24/113 (21%), Positives = 43/113 (38%), Gaps = 10/113 (8%)

Query: 156 PHVITVAVGEDIAMKLLSF-SQQGPRAICVLSANGAISTATLRQPSSSGGSVTYEGRFEI 214
              + +  G D+  +++    Q    A  + S  G +ST  +R    S  ++     FEI
Sbjct: 4   LIALRLTRGMDLKQQIVQLVQQHRIHAGSIASCVGCLSTLHIRLA-DSVSTLQVSAPFEI 62

Query: 215 LCLSGSYLLSGNGGSRNRSGGLSVSLASPDGRVIGGGVGGMLIAANNVQVTFR 267
           L LSG+               L +++A   GRV GG +    +     ++   
Sbjct: 63  LSLSGTLTYQHC--------HLHIAVADAQGRVWGGHLLEGNLINTTAELMIH 107


>2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A Length = 154 Back     alignment and structure
>2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} Length = 143 Back     alignment and structure
>3htn_A Putative DNA binding protein; DUF269 family protein, structural genomics, joint center for structural genomics, JCSG; HET: MSE 1PE; 1.50A {Bacteroides thetaiotaomicron vpi-5482} Length = 149 Back     alignment and structure
>2h6l_A Hypothetical protein; NESG GR103, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 2.00A {Archaeoglobus fulgidus} SCOP: d.290.1.3 Length = 146 Back     alignment and structure
>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant photosynthetic reaction center, peripheral antenna; HET: CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1 Length = 154 Back     alignment and structure
>3hwu_A Putative DNA-binding protein; YP_299413.1, structural genomi center for structural genomics, JCSG, protein structure INI PSI-2; HET: MSE; 1.30A {Ralstonia eutropha} Length = 147 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query272
2p6y_A142 Hypothetical protein VCA0587; NESG, Q9KM02_vibch, 99.94
2hx0_A154 Putative DNA-binding protein; NESG, PSI-2, SCR59, 99.94
2dt4_A143 Hypothetical protein PH0802; PPC domain, structura 99.94
2h6l_A146 Hypothetical protein; NESG GR103, structural genom 99.93
3htn_A149 Putative DNA binding protein; DUF269 family protei 99.92
3hwu_A147 Putative DNA-binding protein; YP_299413.1, structu 99.88
2ezd_A26 High mobility group protein HMG-I/HMG-Y; DNA bindi 97.22
>2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} Back     alignment and structure
Probab=99.94  E-value=2.5e-26  Score=191.80  Aligned_cols=108  Identities=23%  Similarity=0.319  Sum_probs=97.0

Q ss_pred             ceEEEEEecCCCcHHHHHHHHHHhCCc-eEEEEeeeceeeeEEEecCCCCCCceeEeeceEEEEeeceeeeCCCCCCCCC
Q 024112          154 FTPHVITVAVGEDIAMKLLSFSQQGPR-AICVLSANGAISTATLRQPSSSGGSVTYEGRFEILCLSGSYLLSGNGGSRNR  232 (272)
Q Consensus       154 ~~phVIrV~~GEDV~~kI~~faqq~~~-aicILSa~GsVSnVTLRqp~s~~~tvt~eG~FEILSLsGt~~~~~~~~~~~~  232 (272)
                      |++|++||++||||.++|.+||+++++ +.||++++|++++++||+++.. .+++|+|+|||+||+|||.+. .      
T Consensus         2 ~r~~~lrL~~Gedl~~~i~~~~~~~~i~~a~v~~~iGsl~~~~l~~~~~~-~~~~~~g~~EIlsl~Gti~~~-~------   73 (142)
T 2p6y_A            2 IHLIALRLTRGMDLKQQIVQLVQQHRIHAGSIASCVGCLSTLHIRLADSV-STLQVSAPFEILSLSGTLTYQ-H------   73 (142)
T ss_dssp             CEEEEEEECTTCBHHHHHHHHHHHTTCSSEEEEEEEEEEEEEEEECTTSS-CEEEECSCEEEEEEEEEECSS-C------
T ss_pred             CcEEEEEECCCCcHHHHHHHHHHHhCCCEEEEEEeEEEEEeEEEECCCCC-ccEecCCcEEEEEeEEEEeCC-C------
Confidence            789999999999999999999999886 8999999999999999999964 478899999999999999985 1      


Q ss_pred             CCceEEEEeCCCCeEEeeecCcceEeecceEEEEEEcc
Q 024112          233 SGGLSVSLASPDGRVIGGGVGGMLIAANNVQVTFRLCL  270 (272)
Q Consensus       233 ~~hLhISLAg~dGqViGGhV~G~LIAAtpVqVvv~sF~  270 (272)
                       .||||+++|+||+|+||||.+..+...++||++..|.
T Consensus        74 -~HlHisl~~~~G~v~GGHl~~g~~V~~t~Ev~i~~~~  110 (142)
T 2p6y_A           74 -CHLHIAVADAQGRVWGGHLLEGNLINTTAELMIHHYP  110 (142)
T ss_dssp             -EEEEEEEECTTSCEEEEEECTTCEECC-EEEEEEECT
T ss_pred             -CEEEEEEECCCCCEEccccCCCCeEEEEEEEEEEEcc
Confidence             5999999999999999999964555789999998764



>2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A Back     alignment and structure
>2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} Back     alignment and structure
>2h6l_A Hypothetical protein; NESG GR103, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 2.00A {Archaeoglobus fulgidus} SCOP: d.290.1.3 Back     alignment and structure
>3htn_A Putative DNA binding protein; DUF269 family protein, structural genomics, joint center for structural genomics, JCSG; HET: MSE 1PE; 1.50A {Bacteroides thetaiotaomicron vpi-5482} SCOP: d.290.1.0 Back     alignment and structure
>3hwu_A Putative DNA-binding protein; YP_299413.1, structural genomi center for structural genomics, JCSG, protein structure INI PSI-2; HET: MSE; 1.30A {Ralstonia eutropha} Back     alignment and structure
>2ezd_A High mobility group protein HMG-I/HMG-Y; DNA binding protein, minor groove DNA binding, transcriptional CO-activator, architectural factor; HET: DNA; NMR {Homo sapiens} SCOP: j.10.1.1 PDB: 2eze_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 272
d2hx0a1136 d.290.1.3 (A:6-141) Hypothetical protein STM3071 { 6e-18
d2h6la1138 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {A 2e-13
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Length = 136 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
 Score = 75.7 bits (186), Expect = 6e-18
 Identities = 22/119 (18%), Positives = 52/119 (43%), Gaps = 13/119 (10%)

Query: 147 SGSAGMGFTPHVITVAVGEDIAMKLLSFS-QQGPRAICVLSANGAISTATLRQPSSSGGS 205
           + S       + + +  G+++  +L +F  Q   RA  +    G+++   LR       +
Sbjct: 2   NASTA---RFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQ-EAT 57

Query: 206 VTYEGRFEILCLSGSYLLSGNGGSRNRSGGLSVSLASPDGRVIGGGVGGMLIAANNVQV 264
            +  G FE++ L+G+  L+G          L ++++ P G ++GG +         +++
Sbjct: 58  TSLTGTFEVISLNGTLELTGEH--------LHLAVSDPYGVMLGGHMMPGCTVRTTLEL 108


>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Length = 138 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query272
d2hx0a1136 Hypothetical protein STM3071 {Salmonella typhimuri 99.94
d2h6la1138 Hypothetical protein AF0104 {Archaeoglobus fulgidu 99.88
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
Probab=99.94  E-value=2.5e-26  Score=188.76  Aligned_cols=111  Identities=19%  Similarity=0.341  Sum_probs=102.5

Q ss_pred             CCCceEEEEEecCCCcHHHHHHHHHHhCCc-eEEEEeeeceeeeEEEecCCCCCCceeEeeceEEEEeeceeeeCCCCCC
Q 024112          151 GMGFTPHVITVAVGEDIAMKLLSFSQQGPR-AICVLSANGAISTATLRQPSSSGGSVTYEGRFEILCLSGSYLLSGNGGS  229 (272)
Q Consensus       151 g~~~~phVIrV~~GEDV~~kI~~faqq~~~-aicILSa~GsVSnVTLRqp~s~~~tvt~eG~FEILSLsGt~~~~~~~~~  229 (272)
                      ++..+.|++||++||||+++|.+||+++++ +.||++++|++++|+|++++. .....++|+|||+||+|||.+.+.   
T Consensus         3 ~~~~R~~~lrl~~Gedl~~~i~~~~~~~~I~~a~V~~~iGs~~~~~~~~~~~-~~~~~~~g~~Ei~sl~G~I~~~~~---   78 (136)
T d2hx0a1           3 ASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQ-EATTSLTGTFEVISLNGTLELTGE---   78 (136)
T ss_dssp             CCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTTC-SSCEEEEEEEEEEEEEEEEETTEE---
T ss_pred             CCCCcEEEEEECCCChHHHHHHHHHHHhCCCEEEEEEEeeeeEEEEEEeCCC-CCcEEecCcEEEEEEEEEeccCCC---
Confidence            467899999999999999999999999885 899999999999999999985 456689999999999999998763   


Q ss_pred             CCCCCceEEEEeCCCCeEEeeecCcceEeecceEEEEEEcc
Q 024112          230 RNRSGGLSVSLASPDGRVIGGGVGGMLIAANNVQVTFRLCL  270 (272)
Q Consensus       230 ~~~~~hLhISLAg~dGqViGGhV~G~LIAAtpVqVvv~sF~  270 (272)
                           |||++|+|.||+|+||||.+.++++.++||++..|.
T Consensus        79 -----HlH~~~a~~~g~v~gGhL~~g~~v~~t~Eivi~~l~  114 (136)
T d2hx0a1          79 -----HLHLAVSDPYGVMLGGHMMPGCTVRTTLELVIGELP  114 (136)
T ss_dssp             -----EEEEEEECTTSCEEEEEECTTCEEEEEEEEEEEECT
T ss_pred             -----eEEEEEECCCCcEEeEEecCCcEEEEEEEEEEEEcc
Confidence                 999999999999999999988999999999999874



>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure