Citrus Sinensis ID: 026716


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230----
MDGREAMVASGGVSGPYYIQQHRGAFSGSHSATQSGLMQTPPGFRSSSNLNIPTQPNVGSTFAVEPKHVNFGHNMSVPSGVPLSDPVKKKRGRPRKYAPDGQVSLGLSPLPARPKRSPASDSQALKRSRGRPPGTGRKQQLATLGEWMNSSAGIAFAPHVISIGVGERPRVVCILSGRGTVSSVTLRQPATSVPTVTYEVEALCLYLFLFWKKKIHQKNFHFCTLPFLCLLLVP
cccccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHccccccccccccccccccccccccccccccccEEEEEccccccEEEEEEEcccEEEEEEEccccccccEEEEEEEEEEEEEEEEEcccccccccEEEEEcHHcccccc
cccccccccccccccccEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHccccccccccccccccccccccccccccccccccccHcccccccccccccccHHHHccccccccccccccEEEEEcccccccEEEEEEcccEEEEEEEEcccccccEEEEccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
MDGREAMVasggvsgpyyiqqhrgafsgshsatqsglmqtppgfrsssnlniptqpnvgstfavepkhvnfghnmsvpsgvplsdpvkkkrgrprkyapdgqvslglsplparpkrspasdsqalkrsrgrppgtgrKQQLATLGEwmnssagiafaphvisigvgerpRVVCILsgrgtvssvtlrqpatsvptvtYEVEALCLYLFLFWKkkihqknfhfctlpflclllvp
MDGREAMVASGGVSGPYYIQQHRGAFSGSHSATQSGLMQTPPGFRSSSNLNIPTQPNVGSTFAVEPKHVNFghnmsvpsgvplsdpvKKKRgrprkyapdgqvslglsplparpkrspasdsqalkrsrgrppgtgRKQQLATLGEWMNSSAGIAFAPHVISIGVGERPRVVCILSgrgtvssvtlrqpatsvptvtYEVEALCLYLFLFWKKKIHQKNFHFCTLPFLCLLLVP
MDGREAMVASGGVSGPYYIQQHRGAFSGSHSATQSGLMQTPPGFRSSSNLNIPTQPNVGSTFAVEPKHVNFGHNMSVPSGVPLSDPVKKKRGRPRKYAPDGQVSLGLSPLPARPKRSPASDSQALKRSRGRPPGTGRKQQLATLGEWMNSSAGIAFAPHVISIGVGERPRVVCILSGRGTVSSVTLRQPATSVPTVTYEVEALCLYLFLFWKKKIHQKNFHFCTLPFLCLLLVP
**********************************************************************************************************************************************TLGEWMNSSAGIAFAPHVISIGVGERPRVVCILSGRGTVSSVTLRQPATSVPTVTYEVEALCLYLFLFWKKKIHQKNFHFCTLPFLCLLLV*
*************************************************************************************************AP**************************************************SSAGIAFAPHVISIGVGERPRVVCILSGRGTVSSVTLRQPATSVPTVTYEVEALCLYLFLF*********FHFCTLPFLCLLLVP
*********SGGVSGPYYIQQHRGA*************QTPPGFRSSSNLNIPTQPNVGSTFAVEPKHVNFGHNMSVPSGVPLSD************APDGQVSLGLSPL****************************QQLATLGEWMNSSAGIAFAPHVISIGVGERPRVVCILSGRGTVSSVTLRQPATSVPTVTYEVEALCLYLFLFWKKKIHQKNFHFCTLPFLCLLLVP
*****************Y**QHRGAFSGSHSATQSGLMQTPPGFRSSSNLNIPTQPNVGSTF***************************************QVSLGL************************************LGEWMNSSAGIAFAPHVISIGVGERPRVVCILSGRGTVSSVTLRQPATSVPTVTYEVEALCLYLFLFWKKKIHQKNFHFCTLPFLCLLLVP
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDGREAMVASGGVSGPYYIQQHRGAFSGSHSATQSGLMQTPPGFRSSSNLNIPTQPNVGSTFAVEPKHVNFGHNMSVPSGVPLSDPVKKKRGRPRKYAPDGQVSLGLSPLPARPKRSPASDSQALKRSRGRPPGTGRKQQLATLGEWMNSSAGIAFAPHVISIGVGERPRVVCILSGRGTVSSVTLRQPATSVPTVTYEVEALCLYLFLFWKKKIHQKNFHFCTLPFLCLLLVP
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query234
255537455 347 DNA binding protein, putative [Ricinus c 0.867 0.585 0.588 3e-59
225426407 346 PREDICTED: uncharacterized protein LOC10 0.858 0.580 0.606 8e-59
224053919 343 predicted protein [Populus trichocarpa] 0.846 0.577 0.594 3e-53
224074919 346 predicted protein [Populus trichocarpa] 0.867 0.586 0.575 6e-53
356540448 352 PREDICTED: uncharacterized protein LOC10 0.841 0.559 0.549 2e-50
297742528309 unnamed protein product [Vitis vinifera] 0.735 0.556 0.556 1e-48
356528260 352 PREDICTED: uncharacterized protein LOC10 0.833 0.553 0.5 6e-47
356513399 352 PREDICTED: uncharacterized protein LOC10 0.833 0.553 0.5 1e-46
356497236 357 PREDICTED: uncharacterized protein LOC10 0.841 0.551 0.510 1e-44
449452330 343 PREDICTED: uncharacterized protein LOC10 0.867 0.591 0.539 7e-44
>gi|255537455|ref|XP_002509794.1| DNA binding protein, putative [Ricinus communis] gi|223549693|gb|EEF51181.1| DNA binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  233 bits (595), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 133/226 (58%), Positives = 157/226 (69%), Gaps = 23/226 (10%)

Query: 1   MDGREAMVASGGVSGPYYIQQHRGAFSGSHSATQSGLMQTPPGFRSSSNLNIPTQPNV-- 58
           MDGREAM  + G S PYYI +  G       +       +PPGFR  +N N+    N   
Sbjct: 1   MDGREAMALASG-STPYYIHRGGGVGGSGSGSQAG-GFHSPPGFRPLANPNLLAHSNTRP 58

Query: 59  ---GSTFAVEPKHVNFGH--NMSVPSGVPLSDPVKKKRGRPRKYAPDGQVSLGLSPLPAR 113
              GS+F++EP ++NF H  N++VPSG+P+ +PVKKKRGRPRKYAPDGQVSLGLSPLP +
Sbjct: 59  GSSGSSFSIEPSNINFVHGMNVAVPSGLPVGEPVKKKRGRPRKYAPDGQVSLGLSPLPVK 118

Query: 114 PKRSPASDSQALKRSRGRPPGTGRKQQLATLGEWMNSSAGIAFAPHVISIGVGE------ 167
           PK S   D  + KR+RGRPPGTGRKQQLA LGEWMNSSAGIAF+PHVI IGVGE      
Sbjct: 119 PKPSSGQDPLSPKRARGRPPGTGRKQQLALLGEWMNSSAGIAFSPHVIRIGVGEDIVAKV 178

Query: 168 ------RPRVVCILSGRGTVSSVTLRQPATSVPTVTYE--VEALCL 205
                 RPR +CILSG GTVSSVTLRQPA+S PT+T+E   E LCL
Sbjct: 179 LSFAQQRPRALCILSGTGTVSSVTLRQPASSGPTLTFEGRFEILCL 224




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225426407|ref|XP_002273061.1| PREDICTED: uncharacterized protein LOC100249560 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224053919|ref|XP_002298038.1| predicted protein [Populus trichocarpa] gi|222845296|gb|EEE82843.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224074919|ref|XP_002304491.1| predicted protein [Populus trichocarpa] gi|222841923|gb|EEE79470.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356540448|ref|XP_003538701.1| PREDICTED: uncharacterized protein LOC100790569 [Glycine max] Back     alignment and taxonomy information
>gi|297742528|emb|CBI34677.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356528260|ref|XP_003532722.1| PREDICTED: uncharacterized protein LOC100813888 [Glycine max] Back     alignment and taxonomy information
>gi|356513399|ref|XP_003525401.1| PREDICTED: uncharacterized protein LOC100798706 [Glycine max] Back     alignment and taxonomy information
>gi|356497236|ref|XP_003517468.1| PREDICTED: uncharacterized protein LOC100795781 [Glycine max] Back     alignment and taxonomy information
>gi|449452330|ref|XP_004143912.1| PREDICTED: uncharacterized protein LOC101219973 [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query234
TAIR|locus:2031321 378 AT1G63470 [Arabidopsis thalian 0.504 0.312 0.637 1.6e-49
TAIR|locus:2031306 361 AT1G63480 [Arabidopsis thalian 0.799 0.518 0.475 4.4e-36
TAIR|locus:2050766348 AT2G45850 [Arabidopsis thalian 0.803 0.540 0.416 2.3e-30
TAIR|locus:2098861354 AT3G61310 [Arabidopsis thalian 0.807 0.533 0.410 7.3e-27
TAIR|locus:2051038351 AT2G33620 [Arabidopsis thalian 0.474 0.316 0.480 7.8e-23
TAIR|locus:2122684 404 AHL3 "AT-HOOK MOTIF NUCLEAR LO 0.688 0.398 0.278 4.7e-15
TAIR|locus:2141045 439 AT4G17950 [Arabidopsis thalian 0.329 0.175 0.442 1.5e-14
TAIR|locus:2153142 419 AHL4 "AT-HOOK MOTIF NUCLEAR LO 0.410 0.229 0.411 2.1e-14
TAIR|locus:2178505 386 AT5G46640 [Arabidopsis thalian 0.316 0.191 0.448 2e-12
TAIR|locus:2084983 411 AT3G04590 [Arabidopsis thalian 0.559 0.318 0.347 8.4e-11
TAIR|locus:2031321 AT1G63470 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 412 (150.1 bits), Expect = 1.6e-49, Sum P(2) = 1.6e-49
 Identities = 86/135 (63%), Positives = 101/135 (74%)

Query:    82 PLSDP-VKKKRGRPRKYAPDGQVSLGLSPLPARPKRSP----ASDSQALKRSRGRPPGTG 136
             P   P VKKKRGRPRKY PDGQVSLGLSP+P   K+S      SD  A KR+RGRPPGTG
Sbjct:    98 PSEQPMVKKKRGRPRKYVPDGQVSLGLSPMPCVSKKSKDSSSMSDPNAPKRARGRPPGTG 157

Query:   137 RKQQLATLGEWMNSSAGIAFAPHVISIGVGE------------RPRVVCILSGRGTVSSV 184
             RKQ+LA LGEWMN+SAG+AFAPHVIS+G GE            RPR +CI+SG GTVSSV
Sbjct:   158 RKQRLANLGEWMNTSAGLAFAPHVISVGSGEDIVSKVLSFSQKRPRALCIMSGTGTVSSV 217

Query:   185 TLRQPATSVPTVTYE 199
             TLR+PA++ P++T+E
Sbjct:   218 TLREPASTTPSLTFE 232


GO:0003677 "DNA binding" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0000226 "microtubule cytoskeleton organization" evidence=RCA
GO:0000280 "nuclear division" evidence=RCA
GO:0000911 "cytokinesis by cell plate formation" evidence=RCA
GO:0006260 "DNA replication" evidence=RCA
GO:0006306 "DNA methylation" evidence=RCA
GO:0006342 "chromatin silencing" evidence=RCA
GO:0006346 "methylation-dependent chromatin silencing" evidence=RCA
GO:0007000 "nucleolus organization" evidence=RCA
GO:0007267 "cell-cell signaling" evidence=RCA
GO:0008283 "cell proliferation" evidence=RCA
GO:0009616 "virus induced gene silencing" evidence=RCA
GO:0009909 "regulation of flower development" evidence=RCA
GO:0010267 "production of ta-siRNAs involved in RNA interference" evidence=RCA
GO:0016246 "RNA interference" evidence=RCA
GO:0016458 "gene silencing" evidence=RCA
GO:0016570 "histone modification" evidence=RCA
GO:0016571 "histone methylation" evidence=RCA
GO:0016572 "histone phosphorylation" evidence=RCA
GO:0031047 "gene silencing by RNA" evidence=RCA
GO:0031048 "chromatin silencing by small RNA" evidence=RCA
GO:0034968 "histone lysine methylation" evidence=RCA
GO:0035196 "production of miRNAs involved in gene silencing by miRNA" evidence=RCA
GO:0048449 "floral organ formation" evidence=RCA
GO:0051225 "spindle assembly" evidence=RCA
GO:0051567 "histone H3-K9 methylation" evidence=RCA
TAIR|locus:2031306 AT1G63480 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2050766 AT2G45850 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2098861 AT3G61310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051038 AT2G33620 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2122684 AHL3 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2141045 AT4G17950 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2153142 AHL4 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2178505 AT5G46640 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2084983 AT3G04590 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.I.8978.1
hypothetical protein (349 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query234
pfam03479120 pfam03479, DUF296, Domain of unknown function (DUF 6e-04
>gnl|CDD|217587 pfam03479, DUF296, Domain of unknown function (DUF296) Back     alignment and domain information
 Score = 38.3 bits (90), Expect = 6e-04
 Identities = 23/59 (38%), Positives = 27/59 (45%), Gaps = 15/59 (25%)

Query: 156 FAPHVISIGVGE------------RPRVVCILSGRGTVSSVTLRQP---ATSVPTVTYE 199
             PHV+ +  GE            R     +LSG G VS+VTLRQP   A S   VT E
Sbjct: 1   GRPHVLRLEPGEDLVESLEAFARQRGIGAAVLSGIGAVSNVTLRQPDEEAKSYGVVTLE 59


This putative domain is found in proteins that contain AT-hook motifs pfam02178, which strongly suggests a DNA-binding function for the proteins as a whole. There are three highly conserved histidine residues, eg at 117, 119 and 133 in Reut_B5223, which should be a structurally conserved metal-binding unit, based on structural comparison with known metal-binding structures. The proteins should work as trimers. Length = 120

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 234
PF03479120 DUF296: Domain of unknown function (DUF296); Inter 98.57
PF0217813 AT_hook: AT hook motif; InterPro: IPR017956 AT hoo 96.56
smart0038426 AT_hook DNA binding domain with preference for A/T 96.51
COG1661141 Predicted DNA-binding protein with PD1-like DNA-bi 90.35
PF14621219 RFX5_DNA_bdg: RFX5 DNA-binding domain 86.33
PF13546273 DDE_5: DDE superfamily endonuclease 82.9
>PF03479 DUF296: Domain of unknown function (DUF296); InterPro: IPR005175 This putative conserved domain is found in proteins that contain AT-hook motifs IPR000637 from INTERPRO, suggesting a DNA-binding function for the proteins as a whole, however, the function of this domain is unknown Back     alignment and domain information
Probab=98.57  E-value=3.8e-08  Score=78.34  Aligned_cols=56  Identities=25%  Similarity=0.218  Sum_probs=47.6

Q ss_pred             ceeEEEEEcCCC------------CceeEEEecCCCceeEEEEcCCC--CCCCceEEE--EEeEeeeccccc
Q 026716          156 FAPHVISIGVGE------------RPRVVCILSGRGTVSSVTLRQPA--TSVPTVTYE--VEALCLYLFLFW  211 (234)
Q Consensus       156 ftPHVItV~~GE------------g~raICILSA~G~VSnVTLRQP~--ssggtvTyE--FEILSLSGSflp  211 (234)
                      |+++++.+.+||            .....|++|+.|++++|+|++++  ....+.+||  |||+||+|++.+
T Consensus         1 ~r~~~~rl~~Gedl~~~l~~~~~~~~i~~~~is~iGsl~~~~l~~~~~~~~~~~~~~~g~~Ei~sl~G~i~~   72 (120)
T PF03479_consen    1 GRVFVIRLDPGEDLLESLEAFAREHGIRSGVISGIGSLSNVTLGYYDPPSYYEPLEFEGPFEIISLSGTISP   72 (120)
T ss_dssp             EEEEEEEEETTSBHHHHHHHHHHHHT-SSEEEEEEEEEEEEEEEEEETTTEEEEEEEESEEEEEEEEEEEEE
T ss_pred             CcEEEEEECCCCHHHHHHHHHHHHCCCcEEEEEEEeEEeEEEEEEecccCCcceEEecccEEEEEeEEEEEC
Confidence            789999999999            34667888999999999999974  334567788  999999999997



Overexpression of a protein containing this domain, Q9S7C9 from SWISSPROT, in Arabidopsis thaliana causes late flowering and modified leaf development []. ; PDB: 2DT4_A 2P6Y_A 3HWU_A 3HTN_A 2NMU_A 2H6L_A 2HX0_A.

>PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions Back     alignment and domain information
>smart00384 AT_hook DNA binding domain with preference for A/T rich regions Back     alignment and domain information
>COG1661 Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] Back     alignment and domain information
>PF14621 RFX5_DNA_bdg: RFX5 DNA-binding domain Back     alignment and domain information
>PF13546 DDE_5: DDE superfamily endonuclease Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query234
2hx0_A154 Putative DNA-binding protein; NESG, PSI-2, SCR59, 98.87
2p6y_A142 Hypothetical protein VCA0587; NESG, Q9KM02_vibch, 98.84
2h6l_A146 Hypothetical protein; NESG GR103, structural genom 98.42
2dt4_A143 Hypothetical protein PH0802; PPC domain, structura 98.41
3htn_A149 Putative DNA binding protein; DUF269 family protei 97.99
2ezd_A26 High mobility group protein HMG-I/HMG-Y; DNA bindi 96.75
3hwu_A147 Putative DNA-binding protein; YP_299413.1, structu 95.77
4i95_A142 Putative uncharacterized protein; lipocalin-like d 83.96
>2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A Back     alignment and structure
Probab=98.87  E-value=3.4e-09  Score=86.84  Aligned_cols=61  Identities=18%  Similarity=0.175  Sum_probs=54.5

Q ss_pred             CCCCCceeEEEEEcCCC-------------CceeEEEecCCCceeEEEEcCCCCCCCceEEE--EEeEeeecccccc
Q 026716          151 SAGIAFAPHVISIGVGE-------------RPRVVCILSGRGTVSSVTLRQPATSVPTVTYE--VEALCLYLFLFWK  212 (234)
Q Consensus       151 s~g~~ftPHVItV~~GE-------------g~raICILSA~G~VSnVTLRQP~ssggtvTyE--FEILSLSGSflp~  212 (234)
                      +....+++|+|++.+||             +-++.|||++.|++++|+||+++... +++|+  ||||||+|++.+.
T Consensus        13 ~~~~~~r~~vlrL~~Gedl~~~i~~~~~~~gi~~a~v~s~iGsl~~~~l~~~~~~~-~~~~~g~~EIlsl~Gti~~~   88 (154)
T 2hx0_A           13 HNASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTLELT   88 (154)
T ss_dssp             CSCCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTTCSS-CEEEEEEEEEEEEEEEEETT
T ss_pred             CCCCCccEEEEEECCCCcHHHHHHHHHHHhCCCEEEEEEeEeeeEEeEEEccCCCc-cEecCCcEEEEEeeeeEeCC
Confidence            55678999999999999             67789999999999999999998765 67888  9999999999873



>2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} Back     alignment and structure
>2h6l_A Hypothetical protein; NESG GR103, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 2.00A {Archaeoglobus fulgidus} SCOP: d.290.1.3 Back     alignment and structure
>2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} Back     alignment and structure
>3htn_A Putative DNA binding protein; DUF269 family protein, structural genomics, joint center for structural genomics, JCSG; HET: MSE 1PE; 1.50A {Bacteroides thetaiotaomicron vpi-5482} SCOP: d.290.1.0 Back     alignment and structure
>2ezd_A High mobility group protein HMG-I/HMG-Y; DNA binding protein, minor groove DNA binding, transcriptional CO-activator, architectural factor; HET: DNA; NMR {Homo sapiens} SCOP: j.10.1.1 PDB: 2eze_A* Back     alignment and structure
>3hwu_A Putative DNA-binding protein; YP_299413.1, structural genomi center for structural genomics, JCSG, protein structure INI PSI-2; HET: MSE; 1.30A {Ralstonia eutropha} Back     alignment and structure
>4i95_A Putative uncharacterized protein; lipocalin-like domain of PF13924 family, structural genomics center for structural genomics, JCSG; HET: MSE; 1.81A {Bacteroides eggerthii} PDB: 4gzv_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query234
d2hx0a1136 Hypothetical protein STM3071 {Salmonella typhimuri 98.08
d2h6la1138 Hypothetical protein AF0104 {Archaeoglobus fulgidu 97.52
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
Probab=98.08  E-value=2.9e-06  Score=66.20  Aligned_cols=60  Identities=18%  Similarity=0.175  Sum_probs=52.0

Q ss_pred             CCCceeEEEEEcCCC-------------CceeEEEecCCCceeEEEEcCCCCCCCceEEE--EEeEeeeccccccC
Q 026716          153 GIAFAPHVISIGVGE-------------RPRVVCILSGRGTVSSVTLRQPATSVPTVTYE--VEALCLYLFLFWKK  213 (234)
Q Consensus       153 g~~ftPHVItV~~GE-------------g~raICILSA~G~VSnVTLRQP~ssggtvTyE--FEILSLSGSflp~~  213 (234)
                      ++..+-+++.+.+||             +-++.||+++-|++++|+|+++.... ..+++  |||+||+|++.+.+
T Consensus         3 ~~~~R~~~lrl~~Gedl~~~i~~~~~~~~I~~a~V~~~iGs~~~~~~~~~~~~~-~~~~~g~~Ei~sl~G~I~~~~   77 (136)
T d2hx0a1           3 ASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTLELTG   77 (136)
T ss_dssp             CCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTTCSS-CEEEEEEEEEEEEEEEEETTE
T ss_pred             CCCCcEEEEEECCCChHHHHHHHHHHHhCCCEEEEEEEeeeeEEEEEEeCCCCC-cEEecCcEEEEEEEEEeccCC
Confidence            467899999999999             66899999999999999999987544 45677  99999999997654



>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure