Citrus Sinensis ID: 022052


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300---
MDPREPPPQLHQHQHQHQQPNIMMGPTSYHTNAMMPPNAAAGAAARFSFNPLSSSQSQSQSQSESQSQLQPKQPLDSLPHGGVFDGSPSLRTGGGSFSIDPAKKKRGRPRKYTPDGNIALRLATTAQSPGSLADSGGGGGGAAGSASEPSAKRHRGRPPGSGKKQLDALGGVGGVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWPSFGWFSCRNAHGSFSSAGNCGQFYCGGEKVELKFFEIWAFFSTNAPYVKLRCTYDHI
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEcccccHHHHHHHHHHHcccEEEEEcccccEEEEEEcccccccccEEEccEEEEEcccEEEcccccccccccccEEEEEEccccEEEEEEcEEEEEEEEEHHHHcccccEEEEEcccccc
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccHccccccccccccccccccccccccccccEEEEEcccHHHHHHHHHHHHccccEEEEEEEccEEEEEEEEcccccccEEEEEEEEEEEEccccccccccccccccccEEEEEEcccccEEEccEEEccccEEEEEEEEccccEEEEEEccccc
mdprepppqlhqhqhqhqqpnimmgptsyhtnammppnaaagaaarfsfnplsssqsqsqsqsesqsqlqpkqpldslphggvfdgspslrtgggsfsidpakkkrgrprkytpdgniALRLAttaqspgsladsggggggaagsasepsakrhrgrppgsgkkqldalggvggvgftpHVITVKAGEDISSKIFAfsqqgprtVCILSASGaicnvtlrqptmsggtvtYEKWWVecvpgrvrwpsfgwfscrnahgsfssagncgqfycggekvELKFFEIWAFfstnapyvklrctydhi
MDPREPPPQLHQHQHQHQQPNIMMGPTSYHTNAMMPPNAAAGAAARFSFNPLSSSQSQSQSQSESQSQLQPKQPLDSLPHGGVFDgspslrtgggsfsidpakkkrgrprkytpdgNIALRLATTAQSPGSLADSGGGGGGAagsasepsakrhrgrppgsgKKQLDALGGVGGVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWPSFGWFSCRNAHGSFSSAGNCGQFYCGGEKVELKFFEIWAFfstnapyvkLRCTYDHI
MDPREpppqlhqhqhqhqqpNIMMGPTSYHTnammppnaaagaaaRFSFNplsssqsqsqsqsesqsqlqpkqpldslpHGGVFDGSPSLRTGGGSFSIDPAKKKRGRPRKYTPDGNIALRLATTAQspgsladsggggggaagsasepsaKRHRGRPPGSGKKQLDALGGVGGVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWPSFGWFSCRNAHGSFSSAGNCGQFYCGGEKVELKFFEIWAFFSTNAPYVKLRCTYDHI
***********************************************************************************************************************************************************************ALGGVGGVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWPSFGWFSCRNAHGSFSSAGNCGQFYCGGEKVELKFFEIWAFFSTNAPYVKLRCTY***
***************************************************************************************************************YT**********************************************************VGGVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVR**************SFSSAGNCGQFYCGGEKVELKFFEIWAFFSTNAPYVKLRCT****
*******************PNIMMGPTSYHTNAMMPPNAAAGAAARFSFN************************LDSLPHGGVFDGSPSLRTGGGSFSIDP*********KYTPDGNIALRLATTA************************************KKQLDALGGVGGVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWPSFGWFSCRNAHGSFSSAGNCGQFYCGGEKVELKFFEIWAFFSTNAPYVKLRCTYDHI
*******************PNI**********************A*F*******************************************************************DGNIALRLAT********************************************LGGVGGVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWP***WFSCRNAHGSFSSAGNCGQFYCGGEKVELKFFEIWAFFSTNAPYVKLRCTYDHI
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDPREPPPQLHQHQHQHQQPNIMMGPTSYHTNAMMPPNAAAGAAARFSFNPLSSSQSQSQSQSESQSQLQPKQPLDSLPHGGVFDGSPSLRTGGGSFSIDPAKKKRGRPRKYTPDGNIALRLATTAQSPGSLADSGGGGGGAAGSASEPSAKRHRGRPPGSGKKQLDALGGVGGVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWPSFGWFSCRNAHGSFSSAGNCGQFYCGGEKVELKFFEIWAFFSTNAPYVKLRCTYDHI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query303 2.2.26 [Sep-21-2011]
Q9S7C9311 Putative DNA-binding prot no no 0.260 0.254 0.345 0.0002
>sp|Q9S7C9|ESCA_ARATH Putative DNA-binding protein ESCAROLA OS=Arabidopsis thaliana GN=ESC PE=2 SV=1 Back     alignment and function desciption
 Score = 47.0 bits (110), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 2/81 (2%)

Query: 145 SASEPSAKRHRGRPPGSGKKQLDALGGVGGV--GFTPHVITVKAGEDISSKIFAFSQQGP 202
           ++S    KR RGRPPGS  K    +            HV+ V  G DI   +  ++++  
Sbjct: 79  TSSSAPGKRPRGRPPGSKNKAKPPIIVTRDSPNALRSHVLEVSPGADIVESVSTYARRRG 138

Query: 203 RTVCILSASGAICNVTLRQPT 223
           R V +L  +G + NVTLRQP 
Sbjct: 139 RGVSVLGGNGTVSNVTLRQPV 159





Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query303
224126489375 predicted protein [Populus trichocarpa] 0.792 0.64 0.519 1e-57
255557601376 DNA binding protein, putative [Ricinus c 0.772 0.622 0.537 1e-53
225427270353 PREDICTED: uncharacterized protein LOC10 0.735 0.631 0.530 7e-53
224138096286 predicted protein [Populus trichocarpa] 0.584 0.618 0.644 6e-52
449462009362 PREDICTED: uncharacterized protein LOC10 0.689 0.577 0.505 6e-49
356512006288 PREDICTED: putative DNA-binding protein 0.564 0.593 0.609 2e-39
357513671247 hypothetical protein MTR_8g017860, parti 0.627 0.769 0.487 2e-39
297794575391 hypothetical protein ARALYDRAFT_494313 [ 0.841 0.652 0.458 3e-39
448872670362 putative AT-hook DNA-binding protein [El 0.574 0.480 0.552 1e-38
297800288 404 hypothetical protein ARALYDRAFT_914905 [ 0.597 0.448 0.502 5e-37
>gi|224126489|ref|XP_002329567.1| predicted protein [Populus trichocarpa] gi|222870276|gb|EEF07407.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 146/281 (51%), Positives = 175/281 (62%), Gaps = 41/281 (14%)

Query: 6   PPPQLHQHQHQHQQPNIMMGPTSYHTNA---------MMPPNAAAGAAARFSFNPLSSSQ 56
           PPPQL        QP++++ PTS + +          + P NAA G    F FN +S ++
Sbjct: 22  PPPQLQS------QPSMILVPTSSYPSTTSHLINNPNISPQNAALGGG--FPFNTMSGNR 73

Query: 57  SQSQSQSESQSQLQPKQPLDSLPHGGVFDGSPSLRTGGGSFSIDPAKKKRGRPRKYTPDG 116
            QS+ +                   G FDGS    + G  FSI+PAKKKRGRPRKYTPDG
Sbjct: 74  LQSKPE-------------------GAFDGSSPTSSSGMRFSIEPAKKKRGRPRKYTPDG 114

Query: 117 NIALRLATTAQSPG---SLADSGGGGGGAAGSASEPSAKRHRGRPPGSGKKQLDALGGVG 173
           NIAL L+ T    G     ADSGGGG     +ASE  +K++RGRPPGSGKKQLDALGGVG
Sbjct: 115 NIALGLSPTPVPSGISAGHADSGGGGV-THDAASEHPSKKNRGRPPGSGKKQLDALGGVG 173

Query: 174 GVGFTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYE- 232
           GVGFTPHVITVKAGEDI+SKI AFSQQGPRTVCILSA+GAICNVTLRQP MSGG+VTYE 
Sbjct: 174 GVGFTPHVITVKAGEDIASKIMAFSQQGPRTVCILSANGAICNVTLRQPAMSGGSVTYEG 233

Query: 233 KWWVECVPGRVRWPSFGWFSCRNAHGSFSSAGNCGQFYCGG 273
           ++ +  + G            R+   S S AG+ G+   GG
Sbjct: 234 RFEIISLSGSFLLSESNGSRSRSGGLSVSLAGSDGRVLGGG 274




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255557601|ref|XP_002519830.1| DNA binding protein, putative [Ricinus communis] gi|223540876|gb|EEF42434.1| DNA binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|225427270|ref|XP_002281340.1| PREDICTED: uncharacterized protein LOC100245362 [Vitis vinifera] gi|297742130|emb|CBI33917.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224138096|ref|XP_002326517.1| predicted protein [Populus trichocarpa] gi|222833839|gb|EEE72316.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449462009|ref|XP_004148734.1| PREDICTED: uncharacterized protein LOC101204243 [Cucumis sativus] gi|449511145|ref|XP_004163876.1| PREDICTED: uncharacterized LOC101204243 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356512006|ref|XP_003524712.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Glycine max] Back     alignment and taxonomy information
>gi|357513671|ref|XP_003627124.1| hypothetical protein MTR_8g017860, partial [Medicago truncatula] gi|355521146|gb|AET01600.1| hypothetical protein MTR_8g017860, partial [Medicago truncatula] Back     alignment and taxonomy information
>gi|297794575|ref|XP_002865172.1| hypothetical protein ARALYDRAFT_494313 [Arabidopsis lyrata subsp. lyrata] gi|297311007|gb|EFH41431.1| hypothetical protein ARALYDRAFT_494313 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|448872670|gb|AGE46020.1| putative AT-hook DNA-binding protein [Elaeis guineensis] Back     alignment and taxonomy information
>gi|297800288|ref|XP_002868028.1| hypothetical protein ARALYDRAFT_914905 [Arabidopsis lyrata subsp. lyrata] gi|297313864|gb|EFH44287.1| hypothetical protein ARALYDRAFT_914905 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query303
TAIR|locus:2178505386 AT5G46640 [Arabidopsis thalian 0.633 0.497 0.475 1.2e-35
TAIR|locus:2141045439 AT4G17950 [Arabidopsis thalian 0.541 0.373 0.471 1.3e-31
TAIR|locus:2051038351 AT2G33620 [Arabidopsis thalian 0.419 0.361 0.520 1.2e-28
TAIR|locus:2050766348 AT2G45850 [Arabidopsis thalian 0.386 0.336 0.474 5.2e-26
TAIR|locus:2126946318 AT4G00200 [Arabidopsis thalian 0.343 0.327 0.518 3e-24
TAIR|locus:2153142 419 AHL4 "AT-HOOK MOTIF NUCLEAR LO 0.320 0.231 0.5 4.9e-24
TAIR|locus:2098861354 AT3G61310 [Arabidopsis thalian 0.432 0.370 0.434 1.1e-23
TAIR|locus:2167988 404 AT5G62260 [Arabidopsis thalian 0.330 0.247 0.445 1.6e-23
TAIR|locus:2118091356 AHL1 "AT-hook motif nuclear-lo 0.554 0.471 0.384 2.9e-23
TAIR|locus:2031321378 AT1G63470 [Arabidopsis thalian 0.541 0.433 0.378 2.1e-22
TAIR|locus:2178505 AT5G46640 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 385 (140.6 bits), Expect = 1.2e-35, P = 1.2e-35
 Identities = 98/206 (47%), Positives = 115/206 (55%)

Query:    82 GVFDGSPSLRTGGGSFSIDP------AKKKRGRPRKYTPDGNIALRLATTAQXXXXXXXX 135
             G  DGSPS +     F ID        KKKRGRPRKYTPDG+IAL LA T+         
Sbjct:    77 GFGDGSPSSQPM--RFGIDDQNQQLQVKKKRGRPRKYTPDGSIALGLAPTSPLLSAASNS 134

Query:   136 XXXXXXXXXXXXXXXX----KRHRGRPPGSGKKQLDALGGVGGVGFTPHVITVKAGEDIS 191
                                 KR+RGRPPGS KKQLDALGG  GVGFTPHVI V  GEDI+
Sbjct:   135 YGEGGVGDSGGNGNSVDPPVKRNRGRPPGSSKKQLDALGGTSGVGFTPHVIEVNTGEDIA 194

Query:   192 SKIFAFSQQGPRTVCILSASGAICNVTLRQPTMSGGTVTYE-KWWVECVPGRVRWPSFGW 250
             SK+ AFS QG RT+CILSASGA+  V LRQ + S G VTYE ++ +  + G V       
Sbjct:   195 SKVMAFSDQGSRTICILSASGAVSRVMLRQASHSSGIVTYEGRFEIITLSGSVLNYEVNG 254

Query:   251 FSCRNAHGSFSSAGNCGQFYCGGEKV 276
              + R+ + S + AG  G    GG  V
Sbjct:   255 STNRSGNLSVALAGPDGGIV-GGSVV 279




GO:0003677 "DNA binding" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
TAIR|locus:2141045 AT4G17950 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051038 AT2G33620 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2050766 AT2G45850 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2126946 AT4G00200 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2153142 AHL4 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2098861 AT3G61310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2167988 AT5G62260 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2118091 AHL1 "AT-hook motif nuclear-localized protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031321 AT1G63470 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pm.C_1180010
hypothetical protein (375 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query303
pfam03479120 pfam03479, DUF296, Domain of unknown function (DUF 5e-13
cd11378113 cd11378, DUF296, Domain of unknown function found 6e-10
>gnl|CDD|217587 pfam03479, DUF296, Domain of unknown function (DUF296) Back     alignment and domain information
 Score = 64.1 bits (157), Expect = 5e-13
 Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 3/59 (5%)

Query: 177 FTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQP---TMSGGTVTYE 232
             PHV+ ++ GED+   + AF++Q      +LS  GA+ NVTLRQP     S G VT E
Sbjct: 1   GRPHVLRLEPGEDLVESLEAFARQRGIGAAVLSGIGAVSNVTLRQPDEEAKSYGVVTLE 59


This putative domain is found in proteins that contain AT-hook motifs pfam02178, which strongly suggests a DNA-binding function for the proteins as a whole. There are three highly conserved histidine residues, eg at 117, 119 and 133 in Reut_B5223, which should be a structurally conserved metal-binding unit, based on structural comparison with known metal-binding structures. The proteins should work as trimers. Length = 120

>gnl|CDD|211390 cd11378, DUF296, Domain of unknown function found in archaea, bacteria, and plants Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 303
PF03479120 DUF296: Domain of unknown function (DUF296); Inter 99.93
COG1661141 Predicted DNA-binding protein with PD1-like DNA-bi 99.87
PF0217813 AT_hook: AT hook motif; InterPro: IPR017956 AT hoo 96.5
smart0038426 AT_hook DNA binding domain with preference for A/T 96.47
PF14621219 RFX5_DNA_bdg: RFX5 DNA-binding domain 87.78
>PF03479 DUF296: Domain of unknown function (DUF296); InterPro: IPR005175 This putative conserved domain is found in proteins that contain AT-hook motifs IPR000637 from INTERPRO, suggesting a DNA-binding function for the proteins as a whole, however, the function of this domain is unknown Back     alignment and domain information
Probab=99.93  E-value=2.9e-26  Score=189.97  Aligned_cols=115  Identities=20%  Similarity=0.140  Sum_probs=100.5

Q ss_pred             ceEEEEEecCCCcHHHHHHHHHhhCCceEEEEeecceeeeEEEcCCC--CCCCeeeEEEEEEEecccceecCCCCCCccC
Q 022052          177 FTPHVITVKAGEDISSKIFAFSQQGPRTVCILSASGAICNVTLRQPT--MSGGTVTYEKWWVECVPGRVRWPSFGWFSCR  254 (303)
Q Consensus       177 ftpHVIrV~~GEDV~~kI~~faqq~~raicILSa~GaVSnVTLRqp~--ssg~tvt~eG~FEILSLsGTi~p~~g~~~~~  254 (303)
                      ||+|++||++||||.++|.+||+++++..|+|||+|+|++|+|++++  ......+|+|+|||+||+||+...++   .+
T Consensus         1 ~r~~~~rl~~Gedl~~~l~~~~~~~~i~~~~is~iGsl~~~~l~~~~~~~~~~~~~~~g~~Ei~sl~G~i~~~~g---~~   77 (120)
T PF03479_consen    1 GRVFVIRLDPGEDLLESLEAFAREHGIRSGVISGIGSLSNVTLGYYDPPSYYEPLEFEGPFEIISLSGTISPEDG---KP   77 (120)
T ss_dssp             EEEEEEEEETTSBHHHHHHHHHHHHT-SSEEEEEEEEEEEEEEEEEETTTEEEEEEEESEEEEEEEEEEEEEETT---EE
T ss_pred             CcEEEEEECCCCHHHHHHHHHHHHCCCcEEEEEEEeEEeEEEEEEecccCCcceEEecccEEEEEeEEEEECCCC---CC
Confidence            68999999999999999999999999988999999999999999984  34458899999999999999997544   45


Q ss_pred             CCeeEEEEeCCCCceeeceeE---EeeeEEEEeeecCCCCceee
Q 022052          255 NAHGSFSSAGNCGQFYCGGEK---VELKFFEIWAFFSTNAPYVK  295 (303)
Q Consensus       255 ~~HlHISLAg~~GqViGGglv---v~~tv~vI~a~f~~n~~f~R  295 (303)
                      +.|+||+|+|.+|+|+|||++   +..++|+++-.+ ....|+|
T Consensus        78 ~~HlHisl~~~~g~v~gGHl~~g~v~~t~Ev~i~~~-~~~~~~~  120 (120)
T PF03479_consen   78 FVHLHISLADPDGQVFGGHLLEGTVFATAEVVITEL-SGINFTR  120 (120)
T ss_dssp             EEEEEEEEE-TTSEEEEEEEEEEEEEEEEEEEEEEE-TTEEEEE
T ss_pred             cceEEEEEECCCCeEEeeEeCCCEEeEEEEEEEEEe-cCccccC
Confidence            799999999999999999997   345688888888 7888877



Overexpression of a protein containing this domain, Q9S7C9 from SWISSPROT, in Arabidopsis thaliana causes late flowering and modified leaf development []. ; PDB: 2DT4_A 2P6Y_A 3HWU_A 3HTN_A 2NMU_A 2H6L_A 2HX0_A.

>COG1661 Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] Back     alignment and domain information
>PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions Back     alignment and domain information
>smart00384 AT_hook DNA binding domain with preference for A/T rich regions Back     alignment and domain information
>PF14621 RFX5_DNA_bdg: RFX5 DNA-binding domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query303
2p6y_A142 Hypothetical protein VCA0587; NESG, Q9KM02_vibch, 99.94
2hx0_A154 Putative DNA-binding protein; NESG, PSI-2, SCR59, 99.94
2dt4_A143 Hypothetical protein PH0802; PPC domain, structura 99.94
2h6l_A146 Hypothetical protein; NESG GR103, structural genom 99.94
3htn_A149 Putative DNA binding protein; DUF269 family protei 99.93
3hwu_A147 Putative DNA-binding protein; YP_299413.1, structu 99.88
2ezd_A26 High mobility group protein HMG-I/HMG-Y; DNA bindi 96.58
>2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} Back     alignment and structure
Probab=99.94  E-value=7.7e-27  Score=197.65  Aligned_cols=115  Identities=12%  Similarity=0.080  Sum_probs=104.3

Q ss_pred             ceEEEEEecCCCcHHHHHHHHHhhCCc-eEEEEeecceeeeEEEcCCCCCCCeeeEEEEEEEecccceecCCCCCCccCC
Q 022052          177 FTPHVITVKAGEDISSKIFAFSQQGPR-TVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWPSFGWFSCRN  255 (303)
Q Consensus       177 ftpHVIrV~~GEDV~~kI~~faqq~~r-aicILSa~GaVSnVTLRqp~ssg~tvt~eG~FEILSLsGTi~p~~g~~~~~~  255 (303)
                      |++|++||++||||.++|.+||+++++ +.||++++|++++++||+++. ..+++|+|+|||+||+||+.++.       
T Consensus         2 ~r~~~lrL~~Gedl~~~i~~~~~~~~i~~a~v~~~iGsl~~~~l~~~~~-~~~~~~~g~~EIlsl~Gti~~~~-------   73 (142)
T 2p6y_A            2 IHLIALRLTRGMDLKQQIVQLVQQHRIHAGSIASCVGCLSTLHIRLADS-VSTLQVSAPFEILSLSGTLTYQH-------   73 (142)
T ss_dssp             CEEEEEEECTTCBHHHHHHHHHHHTTCSSEEEEEEEEEEEEEEEECTTS-SCEEEECSCEEEEEEEEEECSSC-------
T ss_pred             CcEEEEEECCCCcHHHHHHHHHHHhCCCEEEEEEeEEEEEeEEEECCCC-CccEecCCcEEEEEeEEEEeCCC-------
Confidence            789999999999999999999999885 799999999999999999995 44789999999999999999862       


Q ss_pred             CeeEEEEeCCCCceeeceeE----EeeeEEEEeeecCCCCceeeccCCC
Q 022052          256 AHGSFSSAGNCGQFYCGGEK----VELKFFEIWAFFSTNAPYVKLRCTY  300 (303)
Q Consensus       256 ~HlHISLAg~~GqViGGglv----v~~tv~vI~a~f~~n~~f~Rlp~~~  300 (303)
                      .|+||+++|.+|+|+|||++    |..++|+++..| .++.|+|++|++
T Consensus        74 ~HlHisl~~~~G~v~GGHl~~g~~V~~t~Ev~i~~~-~~~~~~R~~D~e  121 (142)
T 2p6y_A           74 CHLHIAVADAQGRVWGGHLLEGNLINTTAELMIHHY-PQHHFTREFDPN  121 (142)
T ss_dssp             EEEEEEEECTTSCEEEEEECTTCEECC-EEEEEEEC-TTEEEEEEEETT
T ss_pred             CEEEEEEECCCCCEEccccCCCCeEEEEEEEEEEEc-cCCeEEEeeCCC
Confidence            89999999999999999875    456799999999 898999999974



>2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A Back     alignment and structure
>2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} Back     alignment and structure
>2h6l_A Hypothetical protein; NESG GR103, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 2.00A {Archaeoglobus fulgidus} SCOP: d.290.1.3 Back     alignment and structure
>3htn_A Putative DNA binding protein; DUF269 family protein, structural genomics, joint center for structural genomics, JCSG; HET: MSE 1PE; 1.50A {Bacteroides thetaiotaomicron vpi-5482} SCOP: d.290.1.0 Back     alignment and structure
>3hwu_A Putative DNA-binding protein; YP_299413.1, structural genomi center for structural genomics, JCSG, protein structure INI PSI-2; HET: MSE; 1.30A {Ralstonia eutropha} Back     alignment and structure
>2ezd_A High mobility group protein HMG-I/HMG-Y; DNA binding protein, minor groove DNA binding, transcriptional CO-activator, architectural factor; HET: DNA; NMR {Homo sapiens} SCOP: j.10.1.1 PDB: 2eze_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 303
d2hx0a1136 d.290.1.3 (A:6-141) Hypothetical protein STM3071 { 1e-06
d2h6la1138 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {A 2e-05
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Length = 136 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
 Score = 44.9 bits (106), Expect = 1e-06
 Identities = 12/57 (21%), Positives = 26/57 (45%), Gaps = 2/57 (3%)

Query: 180 HVITVKAGEDISSKIFAFS-QQGPRTVCILSASGAICNVTLRQPTMSGGTVTYEKWW 235
           + + +  G+++ S++ AF  Q   R   I   +G++ +V LR       T +    +
Sbjct: 9   YALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAG-QEATTSLTGTF 64


>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Length = 138 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query303
d2hx0a1136 Hypothetical protein STM3071 {Salmonella typhimuri 99.93
d2h6la1138 Hypothetical protein AF0104 {Archaeoglobus fulgidu 99.89
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
Probab=99.93  E-value=3.9e-26  Score=190.28  Aligned_cols=119  Identities=15%  Similarity=0.175  Sum_probs=108.9

Q ss_pred             CCCCceEEEEEecCCCcHHHHHHHHHhhCC-ceEEEEeecceeeeEEEcCCCCCCCeeeEEEEEEEecccceecCCCCCC
Q 022052          173 GGVGFTPHVITVKAGEDISSKIFAFSQQGP-RTVCILSASGAICNVTLRQPTMSGGTVTYEKWWVECVPGRVRWPSFGWF  251 (303)
Q Consensus       173 ~g~~ftpHVIrV~~GEDV~~kI~~faqq~~-raicILSa~GaVSnVTLRqp~ssg~tvt~eG~FEILSLsGTi~p~~g~~  251 (303)
                      +++..|.|++||++||||+++|.+||++++ ++.||++++|++++|+|++++ ......++|+|||+||+||+...+   
T Consensus         2 ~~~~~R~~~lrl~~Gedl~~~i~~~~~~~~I~~a~V~~~iGs~~~~~~~~~~-~~~~~~~~g~~Ei~sl~G~I~~~~---   77 (136)
T d2hx0a1           2 NASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAG-QEATTSLTGTFEVISLNGTLELTG---   77 (136)
T ss_dssp             SCCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTT-CSSCEEEEEEEEEEEEEEEEETTE---
T ss_pred             CCCCCcEEEEEECCCChHHHHHHHHHHHhCCCEEEEEEEeeeeEEEEEEeCC-CCCcEEecCcEEEEEEEEEeccCC---
Confidence            456789999999999999999999998887 689999999999999999998 456778999999999999998875   


Q ss_pred             ccCCCeeEEEEeCCCCceeecee----EEeeeEEEEeeecCCCCceeeccCCC
Q 022052          252 SCRNAHGSFSSAGNCGQFYCGGE----KVELKFFEIWAFFSTNAPYVKLRCTY  300 (303)
Q Consensus       252 ~~~~~HlHISLAg~~GqViGGgl----vv~~tv~vI~a~f~~n~~f~Rlp~~~  300 (303)
                          .|+|++++|.+|+|+|||+    ++.+++|+++..| .+..|+|.+|+.
T Consensus        78 ----~HlH~~~a~~~g~v~gGhL~~g~~v~~t~Eivi~~l-~~~~~~R~~D~~  125 (136)
T d2hx0a1          78 ----EHLHLAVSDPYGVMLGGHMMPGCTVRTTLELVIGEL-PALTFSRQPCAI  125 (136)
T ss_dssp             ----EEEEEEEECTTSCEEEEEECTTCEEEEEEEEEEEEC-TTEEEEEEECTT
T ss_pred             ----CeEEEEEECCCCcEEeEEecCCcEEEEEEEEEEEEc-cCCceEEccCCC
Confidence                7999999999999999986    5677899999999 899999999875



>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure